WO2005011366A1 - Polynucleotides, polypeptides and their use in the production of plants with altered condensed tannins - Google Patents

Polynucleotides, polypeptides and their use in the production of plants with altered condensed tannins Download PDF

Info

Publication number
WO2005011366A1
WO2005011366A1 PCT/NZ2004/000171 NZ2004000171W WO2005011366A1 WO 2005011366 A1 WO2005011366 A1 WO 2005011366A1 NZ 2004000171 W NZ2004000171 W NZ 2004000171W WO 2005011366 A1 WO2005011366 A1 WO 2005011366A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
plant
polypeptide
polynucleotide
sequence
Prior art date
Application number
PCT/NZ2004/000171
Other languages
French (fr)
Inventor
David John Whittaker
Kieran Michael Elborough
Colin Robert South
Original Assignee
Fonterra Co-Operative Group Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fonterra Co-Operative Group Limited filed Critical Fonterra Co-Operative Group Limited
Publication of WO2005011366A1 publication Critical patent/WO2005011366A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine

Definitions

  • the present invention relates to compositions and methods for producing and selecting plants with altered condensed tannin production.
  • CTs Proanthocyanidins or condensed tanmns
  • CTs When present in forage crops, CTs, bind to forage proteins, prevent their rapid digestion and thereby reduce the potential for bloating and the need for bloat- preventing drenches.
  • an increase in CT-bound protein by-passing the rumen results in improved animal nutrition due to increased availability of nitrogen in the latter stages of the digestive process (McMahon et al, 2000, Can J Plant Sci 80,469; Barry and McNabb, 1999, Rr. J. Nutr. 81,263).
  • CTs are absent or at sub-optimal levels in the leaves of certain forage crops such as ryegrass and alfalfa. It would thus be advantageous to increase the content of CTs in appropriate tissues of such crops. In some cases however high levels of CT can reduce the palatability of forage crops (Kumar and Singh, 1984, JAgric Food Chem. 32, 447) and reduction of CT content in such forage crops would be beneficial. CTs have also received attention due to their powerful antioxidant properties and associated beneficial effects on human health and immunity (Ross and Kasum, 2002, Annu Rev Nutr 22,19; Bagchi et al, 2000, Toxicology 148, 87; Lin et al, 2002, JN ⁇ t Prod 65, 505). Of particular interest are the CTs from fruit such as cranberry and grape (Foo et al, 2000, Phytochemistry 54, 173 ; Pataki et al, 2002, Am J Clin Nutr 75, 894)
  • the flavonoids are secondary metabolites derived from products of the phenylpropanoid biosynthetic pathway and the Krebs cycle.
  • the final flavonoid compounds include flavonols (colorless pigments), anthocyanins (pink and red pigments), and condensed tannins (colorless pigments that turn brown on oxidation).
  • the sub-pathways for each of these flavonoid compounds share common initial biosynthesis steps.
  • the second enzyme which appears specific to condensed tannin biosynthesis is anthocyanidin reductase, encoded by the BANYULS (BAN) gene of Arabidopsis thaliana and Medicago truncatula (Xie et al, 2003, Science, 299, 396).
  • BANYULS cDNAs from Arabidopsis and Medicago were expressed in Escherichia coli, and the recombinant proteins were shown to convert anthocyanidins to their corresponding 2,3 -cis- flavan-3-ols. This activity has yet to be been confirmed in plant extracts.
  • SN appears to be orthologous to tt8, one of the regulatory genes characterized in Arabidopsis. Like tt8, the SN gene appears to up-regulate CTs only in competent tissues. Plants with altered CT content may also have altered nutritional value, increased pest and disease resistance or improved nutritional qualities (WO 98/07836). Plants with altered CT content may also lead to improved silage storage through reduced bacterial and fungal activity, reduced methane production in cattle grazed up such plants, reduced parasite load in the intestines of animals grazed on such plants, improved resistance to insects and birds, improved quality of oilseeds and grains and altered flavour, colour and taste characteristics in fruit, fruit juice and wine (WO 02/10412).
  • WO 02/10412 is directed towards novel regulatory and biosynthetic genes and encoded proteins which can be used in methods to alter tannin production in plants for a variety of agricultural benefits. No examples describing such plants are provided.
  • WO 98/07836 is directed towards sequences encoding leucoanthocyanidin reductase (LAR) and to use of such sequences to regulate condensed tannin biosynthesis in plants.
  • the sequences are stated to be useful in the production of bloat-sage forage crops and other crops with altered nutritional value, increased disease resistance and pest resistance or improved malting qualities. Prophetic examples of such crops are provided.
  • the invention provides a method of altering condensed tannin production in a plant comprising transformation of a plant cell or plant with a genetic construct comprising a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant thereof, wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
  • the invention provides a transgenic plant having altered condensed tannin content comprising a genetic construct comprising a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant thereof, wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
  • the polynucleotide encodes a polypeptide comprising the amino acid sequence: S Xi X 2 C C N X 3 T I X 4 D X 5 F X 6 X 7 X 8 X 9 F E X 10 L F X u X 12 R N (SEQ ID NO:l) wherein Xi to X 12 are as herein defined, which encodes a polypeptide involved in condensed tannin production in plants.
  • the polynucleotide comprises a sequence selected from a group of related sequences (SEQ ID NOs: 2, 3 and 4) or a variant thereof, wherein the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
  • the polynucleotide comprises SEQ ID NO:2 or a variant thereof, and the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
  • the polynucleotide comprises SEQ ID NO: 3 or a variant thereof, and the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
  • a preferred variant is the polypeptide encoded by F286 (see US Published Patent Application 2004/0049808, paragraph 67).
  • the polynucleotide comprises SEQ ID NO:4 or a variant thereof, and the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
  • the polynucleotide is selected from naturally occurring polynucleotide orthologue of SEQ ID NOs: 2, 3 and 4, and variants of such orthologues, wherein the polynucleotide orthologues and variants encode polypeptides involved in condensed tannin production in plants.
  • Exemplary orthologues are disclosed herein and identified as SEQ ID NOs: 5-9 of the sequence listing.
  • Fragments of SEQ ID NOs: 2, 3 and 4 are useful as probes, primers, gene silencing molecules, and in polynucleotide-based selection methods of the invention.
  • Polynucleotides comprising fragments of orthologues and/or variants are useful in the invention.
  • Isolated polynucleotides are useful in methods for altering condensed tannin production in plants and in traditional breeding programs for selection of individuals with altered CT content.
  • the invention provides a plant cell which comprises one or more of the genetic constructs comprising a polynucleotide as described above.
  • the invention provides methods for selecting plants with altered condensed tannin content comprising the steps of: (a) contacting a sample from a plant with a polynucleotide selected from SEQ ID NO:2,
  • SEQ ID NO:3, SEQ ID NO:4 and orthologues thereof, or a fragment of one of these, as a probe or primer (b) determining presence/content of complementary sequences to assess altered expression of a polypeptide involved in condensed tannin production in a plant; and (c) selection of a plant showing such altered expression.
  • the invention provides methods for selecting plants with altered condensed tannin content comprising the steps of: (a) contacting a plant sample or plant antibody raised against a polypeptide coded by a nucleotide sequence comprising SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a naturally occurring orthologue thereof to assess altered expression of a polypeptide involved in condensed tannin production in a plant; and (b) selection of a plant shown such altered expression.
  • the invention provides plants selected by a method of the invention.
  • plant selected by a method of the invention is intended to include any whole plant, part, portion propagules or progeny of a plant selected by a method of the invention.
  • transgenic plant is intended to include a whole plant or any part of a plant, propagules and progeny of a plant.
  • 'propagule' means any part of a plant that may be used in reproduction or propagation, either sexual or asexual, including seeds and cuttings.
  • polynucleotide(s), means a single or double-stranded deoxyribonucleotide or ribonucleotide polymer of any length, and include as non-limiting examples, coding and non-coding sequences of a gene, sense and antisense sequences, exons, introns, genomic DNA, cDNA, pre-mRNA, mRNA, rRNA, siRNA, miRNA, tRNA, ribozymes, recombinant polynucleotides, isolated and purified naturally occurring DNA or RNA sequences, synthetic RNA and DNA sequences, nucleic acid probes, primers, fragments, genetic constructs, vectors and modified polynucleotides.
  • a "fragment" of a polynucleotide sequence provided herein is a subsequence of contiguous nucleotides that is capable of specific hybridization to a target of interest, e.g., a sequence that is at least 15 nucleotides in length.
  • a fragment of a polynucleotide sequence can be used in antisense, gene silencing, triple helix or ribozyme technology, or as a primer, a probe, included in a microarray, or used in polynucleotide-based selection methods of the invention.
  • primer refers to a short polynucleotide, usually having a free 3 'OH group, that is hybridized to a template and used for priming polymerization of a polynucleotide complementary to the target.
  • probe refers to a short polynucleotide that is used to detect a polynucleotide sequence that is complementary to the probe in a hybridization-based assay.
  • polypeptide encompasses amino acid chains of any length, including full-length proteins, in which amino acid residues are linked by covalent peptide bonds. Polypeptides may be purified natural products, or may be produced partially or wholly using recombinant or synthetic techniques.
  • a "fragment" of a polypeptide is a subsequence of the polypeptide that performs a function that is required for the biological activity and/or provides three dimensional structure of the polypeptide.
  • isolated as applied to the polynucleotide or polypeptide sequences disclosed herein is used to refer to sequences that are removed from their natural cellular environment.
  • An isolated molecule may be obtained by any method or combination of methods including biochemical, recombinant, and synthetic techniques.
  • recombinant refers to a polynucleotide sequence that is removed from sequences that surround it in its natural context and/or is recombined with sequences that are not present in its natural context.
  • a “recombinant” polypeptide sequence is produced by translation from a “recombinant” polynucleotide sequence.
  • variant refers to polynucleotide or polypeptide sequences different from the specifically identified sequences, wherein one or more nucleotides or amino acid residues is deleted, substituted, or added. Variants may be naturally occurring allelic variants, or non-naturally occurring variants. Variants may be from the same or from other species and encompass homologous, paralogues and orthologues. The term “variant” encompasses both cDNA and genomic sequences.
  • Variant sequences preferably exhibit at least 50%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, more preferably at least 98%, and most preferably at least 99% identity to a sequences of the present invention.
  • the term variant, with reference to polynucleotides, also encompasses sequences complementary to any of the polynucleotides used in the invention.
  • percent identity or “percent identical” when used in connection with the polynucleotide molecules and polypeptides, is defined as the percentage of nucleic acid residues in a candidate polynucleotide sequence, or the percentage of amino acid residues in a candidate polypeptide sequence, that are identical with a subject polynucleotide sequence or polypeptide molecule sequence (such as the polynucleotide sequence of SEQ ID NO: 2), after aligning the candidate and subject sequences to achieve the maximum percent identity, and not considering any nucleic acid residue substitutions as part of the polynucleotide sequence identity.
  • the candidate polynucleotide sequence or polypeptide sequence (which may be a portion of a larger polynucleotide sequence or polypeptide sequence) is the same length as the subject polynucleotide sequence or polypeptide sequence, and no gaps are introduced into the candidate polynucleotide sequence or polypeptide sequence in order to achieve the best alignment.
  • Polynucleotide sequence identity can be determined in the following manner.
  • the subject polynucleotide sequence is used to search a polynucleotide sequence database, such as the GenBank database (accessible at web site http ://www. nobi. nln. nih. gov/blast/), using the program BLASTN version 2.2.5 (based on Altschul et al, 1997, Nucleic Acids Res, 25, 3389).
  • the programs are used in the ungapped mode. Default filtering is used to remove sequence homologies due to regions of low complexity.
  • the default parameters of BLASTN, BLASTX and tBLASTX are utilized.
  • Polypeptide sequence identity can be determined in the following manner.
  • the subject polypeptide sequence is used to search a polypeptide sequence database, such as the GenBank database (accessible at web site http ://www. nobi. nln. nih. gov/blast/), using the BLASTP program.
  • the program is used in the ungapped mode.
  • Default filtering is used to remove sequence homologies due to regions of low complexity.
  • the default parameters of BLASTP are utilized.
  • variant polynucleotides and polypeptides with reference to each of the polynucleotides and polypeptides of the present invention, produce an E value of 0.01 or less when compared to the polynucleotide or polypeptide of the present invention using BLASTN, FASTA or BLASTP algorithms set to default parameters. That is, a variant polynucleotide or polypeptide is any sequence that has at least a 99% probability of being the same as the polynucleotide or polypeptide of the present invention.
  • a variant polynucleotide is a sequence having the same number or fewer nucleic acids than a polynucleotide of the present invention that has at least a 99% probability of being the same as the polynucleotide of the present invention, measured as having an E value of 0.01 or less using the BLASTN or FASTA algorithms set at parameters described above.
  • a variant polypeptide is a sequence having the same number or fewer amino acids than a polypeptide of the present invention that has at least a 99% probability of being the same as a polypeptide of the present invention, measured as having an E value of 0.01 or less using the BLASTP algorithm set at the parameters described above.
  • variant polynucleotides of the present invention hybridize to the polynucleotide sequences recited in SEQ ID NO: 2-9, or complements thereof, antisense sequences and complements thereof, under stringent conditions.
  • stringent conditions refers to hybridization conditions such as prewashing in a solution of 6X SSC, 0.2% SDS; hybridizing at 65°C, 6X SSC, 0.2% SDS overnight; followed by two washes of 30 minutes each in IX SSC, 0.1% SDS at 65° C and two washes of 30 minutes each in 0.2X SSC, 0.1% SDS at 65°C.
  • Variant polynucleotides of the present invention also encompasses polynucleotides that differ from the sequences of the invention but that, as a consequence of the degeneracy of the genetic code, encode a polypeptide having similar activity to a polypeptide encoded by a polynucleotide of the present invention.
  • a sequence alteration that does not change the amino acid sequence of the polypeptide is a "silent variation". Except for ATG (methionine) and TGG (tryptophan), other codons for the same amino acid may be changed by art recognized techniques, e.g., to optimize codon expression in a particular host organism.
  • polynucleotides comprising sequences that differ from the polynucleotide sequences identified above as a result of deletions and/or insertions totaling less than 10% of the total sequence length are also useful in methods and products of the present invention.
  • polypeptides comprising sequences that differ from the polypeptide sequences identified as a result of amino acid substitutions, insertions, and/or deletions totaling less than 10% of the total sequence length are useful in methods and products of the present invention.
  • variants of the inventive polypeptides and polynucleotides possess biological activities that are the same or similar to those of the inventive polypeptides or polynucleotides.
  • hybridize under stringent conditions refers to the ability of a polynucleotide molecule to hybridize to a target polynucleotide molecule (such as a target polynucleotide molecule immobilized on a DNA or RNA blot, such as a Southern blot or Northern blot) under defined conditions of temperature and salt concentration.
  • a target polynucleotide molecule such as a target polynucleotide molecule immobilized on a DNA or RNA blot, such as a Southern blot or Northern blot
  • the ability to hybridize under stringent hybridization conditions can be determined by initially hybridizing under less stringent conditions then increasing the stringency to the desired stringency.
  • Tm melting temperature
  • exemplary stringent hybridization conditions are 5 to 10° C below Tm.
  • Tm the Tm of a polynucleotide molecule of length less than 100 bp is reduced by approximately (500/oligonucleotide length) 0 C.
  • PNA's peptide nucleic acids
  • Tm values are higher than those for DNA-DNA or DNA-RNA hybrids, and can be calculated using the formula described in Giesen et al, Nucleic Acids Res. 1998 Nov l;26(21):5004-6.
  • Exemplary stringent hybridization conditions for a DNA-PNA hybrid having a length less than 100 bases are 5 to 10° C below the Tm.
  • the term "genetic construct” refers to a polynucleotide molecule, usually double-stranded DNA, which may have inserted into it another polynucleotide molecule (the insert polynucleotide molecule) such as, but not limited to, a cDNA molecule.
  • a genetic construct may contain the necessary elements that permit transcribing the insert polynucleotide molecule, and, optionally, translating the transcript into a polypeptide.
  • the insert polynucleotide molecule may be derived from the host cell, or may be derived from a different cell or organism and/or may be a recombinant polynucleotide. Once inside the host cell the genetic construct may become integrated in the host chromosomal DNA.
  • the genetic construct may be linked to a vector.
  • vector refers to a polynucleotide molecule, usually double stranded DNA, which is used to transport the genetic construct into a host cell.
  • the vector may be capable of replication in at least one host system, such as E. coli.
  • expression construct refers to a genetic construct that includes the necessary elements that permit transcribing the insert polynucleotide molecule, and, optionally, translating the transcript into a polypeptide.
  • An expression construct typically comprises in a 5' to 3' direction: a) a promoter functional in the host cell into which the construct will be transformed, b) the polynucleotide to be expressed, and c) a terminator functional in the host cell into which the construct will be transformed.
  • coding region or "open reading frame” (ORF) refers to the sense strand of a genomic DNA sequence or a cDNA sequence that is capable of producing a transcription product and/or a polypeptide under the control of appropriate regulatory sequences.
  • the coding sequence is identified by the presence of a 5' translation start codon and a 3' translation stop codon.
  • a "coding sequence” is capable of being expressed when it is operably linked to promoter and terminator sequences.
  • “Operably-linked” means that the coding sequence is in the correct reading frame for translation and its expression is placed under the control of regulatory elements that include promoters, tissue-specific regulatory elements, temporal regulatory elements, enhancers, silencers and terminators.
  • noncoding region refers to untranslated sequences that are upstream of the translational start site and downstream of the translational stop site. These sequences are also referred to respectively as the 5' UTR and the 3' UTR. These regions include elements required for transcription initiation and termination and for regulation of translation efficiency.
  • Terminators are sequences, which terminate transcription, and are found in the 3' untranslated ends of genes downstream of the translated sequence. Terminators are important determinants of mRNA stability and in some cases have been found to have spatial regulatory functions.
  • promoter refers to nontranscribed cis-regulatory elements upstream of the coding region that regulate gene transcription. Promoters comprise cis-initiator elements which specify the transcription initiation site and conserved boxes such as the TATA box, and motifs that are bound by transcription factors.
  • transgene is a polynucleotide that is taken from one organism and introduced into a different organism by transformation.
  • the transgene may be derived from the same species or from a different species as the species of the organism into which the transgene is introduced.
  • a "transgenic plant” refers to a plant which contains new genetic material as a result of genetic manipulation or transformation.
  • the new genetic material may be derived from a plant of the same species as the resulting transgenic plant or from a different species.
  • An "inverted repeat” is a sequence that is repeated, where the second half of the repeat is in the complementary strand, e.g., (5')GATCTA TAGATC(3') (3 CTAGAT ATCTAG(5 ')
  • Read-through transcription will produce a transcript that undergoes complementary base-pairing to form a hairpin structure provided that there is a 3-5 bp spacer between the repeated regions.
  • the term "involved in condensed tannin production” means that such polynucleotides and polypeptides may function to activate, inhibit or otherwise modulate CT production in a plant. Such modulation is intended to include alteration of the type of CT monomer/s present and alteration of the level of polymerization of CTs.
  • Such polynucleotides and polypeptide may act within the CT biosynthetic pathway or may function in regulation of the CT biosynthetic pathway.
  • polypeptide involved in condensed tannin production refers to a polypeptide, an aggregate of a polypeptide such as a dimer or other multimer, a fusion polypeptide, a polypeptide fragment, a polypeptide variant, or derivative thereof capable of affecting the production of CT in a plant.
  • the applicants have identified polynucleotides involved in condensed tannin biosynthesis in plants.
  • This cotton polynucleotide sequence was also used to identify and isolate further orthologous polynucleotides from Arabidopsis thaliana, Lolium perenne (ryegrass) and other species. Where available, 'in-silico' data indicated that the orthologous polynucleotides are expressed in tissues where condensed tannins are expected to be produced.
  • the applicants further identified a consensus polypeptide sequence present in all of the polypeptides encoded by the orthologous polynucleotide sequences, as shown in Figure 3.
  • the invention provides transgenic plants with altered CT content, including plants with both increased and decreased levels of CTs and methods for the production of such plants.
  • the invention further provides methods for selection of plants with altered condensed tannin content, as compared to the majority of similar wild-type plants. Such methods based on use of polynucleotides or polypeptides described herein may be used in traditional breeding programs in the development of varieties with altered condensed tannin content.
  • the isolated polynucleotide of this aspect encodes a polypeptide involved in condensed tannin production in plants.
  • the invention further uses an isolated polynucleotide comprising a sequence selected from a group of related sequences derived from Gossypium hirsutum (cotton), Arabidopsis thaliana and Lolium perenne (ryegrass) (SEQ ID NOs: 2, 3 and 4 respectively) or a variant thereof, wherein the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
  • the present invention also encompasses use of polynucleotide variants of SEQ ID NOs: 2, 3 and 4.
  • polynucleotide variants preferably exhibit at least 50%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, more preferably at least 98%, and most preferably at least 99% identity to any one of the sequences set forth in SEQ ID NOs: 2, 3, or 4.
  • the invention further provides naturally occurring polynucleotide orthologues of SEQ ID NOs: 2, 3 or 4, and variants of such orthologues.
  • Exemplary orthologues are disclosed herein and identified as SEQ ID NO: 5-9 of the sequence listing.
  • polynucleotide molecules can be isolated by using a variety of techniques known to those of ordinary skill in the art.
  • such polynucleotides can be isolated through use of the polymerase chain reaction (PCR) described in Mullis et al, Eds. 1994 The Polymerase Chain Reaction, Birkhauser, incorporated herein by reference.
  • PCR polymerase chain reaction
  • the isolated polynucleotides can be amplified using primers, as defined herein, derived from the polynucleotide sequences . Variant polynucleotides may also be identified using such primers. Typically, the polynucleotide sequence of a primer, useful to amplify variants of polynucleotide molecules by PCR, may be based on a conserved region of amino acid sequence of the polypeptides
  • inverse PCR permits acquisition of unknown sequences, flanking the polynucleotide sequences disclosed herein, starting with primers based on a known region (Triglia et al, 1998, Nucleic Acids Res 16, 8186, incorporated herein by reference).
  • the method uses several restriction enzymes to generate a suitable fragment in the known region of a gene. The fragment is then circularized by intramolecular ligation and used as a PCR template. Divergent primers are designed from the known region.
  • Further methods for identifying variant polynucleotides by methods of the invention include use of all, or portions of, the polynucleotides having the sequence set forth in SEQ ID NO : 2-4 as hybridization probes to screen a plant genomic or cDNA libraries.
  • the technique of hybridizing 5 labelled polynucleotide probes to polynucleotides immobilized on nitrocellulose filters or nylon membranes can be used to screen the genomic or cDNA libraries.
  • Exemplary hybridization and wash conditions are : hybridization for 20 hours at 65°C in 5. 0 X SSC, 0. 5% sodium dodecyl sulfate, 1 X Denhardt's solution ; washing (three washes of twenty minutes each at 55°C) in 1.
  • An optional further wash (for twenty minutes) can be conducted under conditions of 0. I X SSC, 1% (w/v) sodium dodecyl sulfate, at 60°C.
  • variant sequences of the invention can also be identified by computer-based methods, using 15 public domain sequence alignment algorithms and sequence similarity search tools to search sequence databases (public domain databases include Genbank, EMBL, Swiss-Prot, PIR and others). See, e.g., Nucleic Acids Res. 29: 1-10 and 11-16, 2001 for examples of online resources. Similarity searches retrieve and align sequences for comparison with a target sequence to be analyzed (i.e., a query sequence). Sequence comparison algorithms use scoring matrices to 20 assign an overall score to each of the alignments.
  • Polynucleotide and polypeptide sequences may be aligned, and percentage of identical residues in a specified region may be determined against other polynucleotide and polypeptide sequences, using computer algorithms that are publicly available. Two exemplary algorithms for aligning
  • polynucleotide sequences 25 and identifying the similarity of polynucleotide sequences are the BLASTN and FASTA algorithms.
  • Polynucleotides may also be analyzed using the BLASTX algorithm, which compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database.
  • the similarity of polypeptide sequences may be examined using the BLASTP algorithm.
  • BLASTP algorithm is preferred for use in the determination of polypeptide variants used in the present invention.
  • the following running parameters are preferred for determination of alignments and similarities using BLASTP that contribute to the E values and percentage identity of polypeptide sequences: blastall -p blastp -d swissprottrembledb -e 10 -G 0 -E 0 -v 30 -b 30 -i queryseq -o results; the parameters are: -p Program Name [String]; -d Database [String]; -e Expectation value (E) [Real]; -G Cost to open a gap (zero invokes default behavior) [Integer]; -E Cost to extend a gap (zero invokes default behavior) [Integer]; -v Number of one-line descriptions (v) [Integer]; -b Number of alignments to show (b) [Integer]; -I Query File [File In]; -o
  • BLAST family of algorithms, including BLASTN, BLASTP, and BLASTX, is described in the publication of Altschul et al, Nucleic Acids Res. 25: 3389-3402, 1997.
  • the BLASTN software is available on the NCBI server (http://ncbi.nlm.nih.gov) under /blast/executables/ and is available from the National Center for Biotechnology Information (NCBI), National Library of Medicine, Building 38A, Room 8N805, Bethesda, MD 20894 USA.
  • NCBI National Center for Biotechnology Information
  • the FASTA software package is available from the University of Virginia (University of Virginia, PO Box 9025, Charlottesville, VA 22906-9025). Version 2.0u4, February 1996, set to the default parameters described in the documentation and distributed with the algorithm, may be used in the determination of variants used in the present invention.
  • the use of the FASTA algorithm is described in Pearson and Lipman, 1988, Proc Natl Acad Sci USA, 85, 2444; and Pearson, 1990, Methods in Enzymol. 183, 63.
  • the "hits" to one or more database sequences by a queried sequence produced by BLASTN, FASTA, BLASTX, BLASTP or a similar algorithm align and identify similar portions of sequences.
  • the hits are arranged in order of the degree of similarity and the length of sequence overlap. Hits to a database sequence generally represent an overlap over only a fraction of the sequence length of the queried sequence.
  • the BLASTN, FASTA and BLASTP algorithms also produce "Expect" values for alignments.
  • the Expect value (E) indicates the number of hits one can "expect” to see over a certain number of contiguous sequences by chance when searching a database of a certain size.
  • the Expect value is used as a significance threshold for determining whether the hit to a database, such as the preferred EMBL database, indicates true similarity. For example, an E value of 0.1 assigned to a polynucleotide hit is interpreted as meaning that in a database of the size of the EMBL database, one might expect to see 0.1 matches over the aligned portion of the sequence with a similar score simply by chance.
  • the aligned and matched portions of the polynucleotide sequences then have a probability of 90% of being the same.
  • the probability of finding a match by chance in the EMBL database is 1% or less using the BLASTN, FASTA, BLASTX or BLASTP algorithm.
  • Pattern recognition software applications are available for finding motifs or signature sequences.
  • MEME Multiple Em for Motif Elicitation
  • MAST Motif Alignment and Search Tool
  • the MAST results are provided as a series of alignments with appropriate statistical data and a visual overview of the motifs found.
  • MEME and MAST were developed at the University of California, San Diego.
  • PROSITE (Bairoch and Bucher, 1994, Nucleic Acids Res. 22, 3583; Hofinann et al, 1999, Nucleic Acids Res. 27, 215) is a method of identifying the functions of uncharacterized proteins translated from genomic or cDNA sequences.
  • the PROSITE database www.expasy.org/prosite
  • Prosearch is a tool that can search SWISS-PROT and EMBL databases with a given sequence pattern or signature.
  • Proteins can be classified according to their sequence relatedness to other proteins in the same genome (paralogues) or a different genome (orthologues).
  • Orthologous genes are genes that evolved by speciation from a common ancestral gene. These genes normally retain the same function as they evolve.
  • Paralogous genes are genes that are duplicated within a genome. These genes may acquire new specificities or modified functions which may be related to the original one. Phylogenetic analysis methods are reviewed in Tatusov et al, 1997, Science 278, 631- 637,).
  • a partial cDNA sequence may be used, in methods well-known in the art to identify the corresponding full length polynucleotide sequence containing an open reading frame (ORF) necessary for over-expression of the encoded polypeptide in transgenic organisms or cell/s of a such organisms.
  • Such organisms would include bacteria, yeast, fungi, insects, mammals and plants.
  • Such methods would include PCR-based methods, hybridization- based method, computer/database -based methods.
  • standard molecular biology approaches can be utilized (Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987).
  • the polypeptide is involved in condensed tannin production in plants.
  • the isolated polypeptide may be selected from a group of related polypeptides comprising sequences derived from Gossypium hirsutum (cotton), Arabidopsis thaliana and Lolium perenne (SEQ ID NO's: 10, 11 and 12 respectively) or a variant thereof, wherein the polypeptide is involved in condensed tannin production in plant.
  • Such variants preferably exhibit at least 50%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, more preferably at least 98%, and most preferably at least 99% identity to any one of the sequences set forth in SEQ ID NOs: 10, 11 and
  • polypeptide orthologues of SEQ ID NO's 10, 11 and 12, and variants of such orthologues.
  • Exemplary orthologues are disclosed herein and identified as SEQ ID NOs: 13-17 of the sequence listing.
  • the polypeptides may be prepared using peptide synthesis methods well known in the art such as direct peptide synthesis using solid phase techniques (e.g. Stewart et al, 1969, in Solid-Phase Peptide Synthesis, WH Freeman Co, San Francisco California , or automated synthesis, for example using an Applied Biosystems 431 A Peptide Synthesizer (Foster City, California). Mutated forms of the polypeptides may also be produced during such syntheses.
  • polypeptides may also be purified from natural sources using a variety of techniques that are well known in the art (e.g. Deutscher, 1990, Ed, Methods in Enzymology, Vol. 182, Guide to Protein Purification ⁇ ).
  • polypeptides may be expressed recombinantly in suitable host cells and separated from the cells as discussed below.
  • the polypeptide variants may be identified by screening expression libraries using antibodies raised against previously identified polypeptides . Such antibodies may also be used in the isolation of polypeptide variants .
  • the DNA sequence of polynucleotides identified through expression screening, as well as with the DNA sequences of the polynucleotides already identified may be used in computer based approaches as described above to identify polynucleotide sequences encoding polypeptide variants. Such polynucleotide sequences may be utilized for the recombinant production of polypeptide variants of the invention. Such procedures are standard practice for art skilled workers (Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987).
  • the genetic constructs used in the present invention comprise one or more polynucleotide sequences of the invention and/or polynucleotides encoding polypeptides of the invention, and may be useful for transforming, for example, bacterial, fungal, insect, mammalian or plant organisms.
  • the genetic constructs are intended to include expression constructs as herein defined.
  • a host cell which comprises a genetic construct or vector comprising a polynucleotide as described above may be used in the invention.
  • Host cells may be derived from, for example, bacterial, fungal, insect, mammalian or plant organisms.
  • Host cells comprising genetic constructs, such as expression constructs, of the invention are useful in methods well known in the art (e.g. Sambrook et al, Molecular Cloning : A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987 ; Ausubel et al, Current Protocols in Molecular Biology, Greene Publishing, 1987) for recombinant production of polypeptides described herein.
  • Such methods may involve the culture of host cells in an appropriate medium in conditions suitable for or conducive to expression of a polypeptide of the invention.
  • the expressed recombinant polypeptide which may optionally be secreted into the culture, may then be separated from the medium, host cells or culture medium by methods well known in the art (e.g. Deutscher, Ed, 1990, Methods in Enzymology, Vol 182, Guide to Protein Purification).
  • the host cells may also be useful in methods for production of an enzymic product generated by an expressed polypeptide of the invention. Such methods may involve culturing the host cells of the invention in a medium suitable for expression of a recombinant polypeptide of the invention, optionally in the presence of additional enzymic substrate for the expressed polypeptide of the invention. The enzymic product produced may then be separated from the host cells or medium by a variety of art standard methods.
  • Plant cells may be transformed to comprise a genetic construct of the invention, and plant cells modified to alter expression of a polynucleotide or polypeptide of the invention.
  • Condensed tannin production in a plant may also be altered through methods of the invention.
  • Such methods may involve the transformation of plant cells and plants, with a construct of the invention designed to alter expression of a polynucleotide or polypeptide involved in condensed tanin production in such plant cells and plants.
  • Such methods also include the transformation of plant cells and plants with a combination of the construct of the invention and one or more other constructs designed to alter expression of one or more polynucleotides or polypeptides known to be involved in condensed tanin production in such plant cells and plants.
  • a number of plant transformation strategies are available (e.g. Birch, 1997, Ann Rev Plant Phys Plant Mol Biol, 48, 297).
  • strategies may be designed to increase expression of a polynucleotide/polypeptide in a plant cell tissue or organ in which it is normally expressed or to ectopically express a polynucleotide/polypeptide is a cell tissue or organ which it is not normally expressed.
  • the expressed polynucleotide/polypeptide may be dervied from the plant species to be transformed or may be derived from a different plant species.
  • strategies may be designed to reduce expression of a polynucleotide/polypeptide in a plant cell, tissue or organ in which it is normally expressed. Such strategies are known as gene silencing strategies.
  • Temporal regulation of expression of a polynucleotide/polypeptide may also be achieved in the above strategies, through use of a suitable promoter.
  • Genetic constructs for expression of genes in transgenic plants typically include promoters for driving the expression of one or more cloned polynucleotide, terminators and selectable marker sequences.
  • the promoters suitable for use in the constructs of this invention are functional in a cell, tissue or organ of a monocot or dicot plant and include cell-, tissue- and organ-specific promoters, cell cycle specific promoters, temporal promoters, inducible promoters, constitutive promoters that are active in most plant tissues, and recombinant promoters. Choice of promoter will depend upon the temporal and spatial expression of the cloned polynucleotide, so desired.
  • the promoters may be those normally associated with a transgene of interest, or heterologous promoters which are derived from genes of other plants, viruses, and plant pathogenic bacteria and fungi.
  • promoters that are suitable for use in modifying and modulating plant traits using genetic constructs comprising the polynucleotide sequences of the invention.
  • constitutive plant promoters include the CaMV 35S promoter, the nopaline synthase promoter and the octopine synthase promoter, and the Ubi 1 promoter from maize.
  • Specific plant promoters which are active in specific tissues, respond to internal developmental signals or external abiotic or biotic stresses are described in the scientific literature. Exemplary promoters are described, e.g., in WO 02/00894, which is herein incorporated by reference.
  • Exemplary terminators that are commonly used in plant transformation genetic construct include, e.g., the cauliflower mosaic virus (CaMV) 35S terminator, the Agrobacterium tumefaciens nopaline synthase or octopine synthase terminators, the Zea mays zin gene terminator, the Oryza sativa ADP-glucose pyrophosphorylase terminator and the Solanum tuberosum PI- II terminator.
  • CaMV cauliflower mosaic virus
  • Agrobacterium tumefaciens nopaline synthase or octopine synthase terminators the Zea mays zin gene terminator
  • the Oryza sativa ADP-glucose pyrophosphorylase terminator the Solanum tuberosum PI- II terminator.
  • NPT II neomycin phophotransfenrse II gene
  • aadA gene which confers spectinomycin and streptomycin resistance
  • phosphinothricin acetyl transferase bar gene
  • Ignite AgrEvo
  • Basta Basta
  • hygromycin phosphotransferase gene hpt
  • reporter genes coding sequences which express an activity that is foreign to the host, usually an enzymatic activity and/or a visible signal (e.g., luciferase, GUS, GFP) which may be used for promoter expression analysis in plants and plant tissues are also contemplated.
  • a visible signal e.g., luciferase, GUS, GFP
  • the reporter gene literature is reviewed in Herrera-Estrella et al, 1993, Nature 303, 209, and Schrott, 1995, In: Gene Transfer to Plants (Potrykus, T., Spangenbert. Eds) Springer Verlag. Berline, pp. 325-336.
  • Gene silencing strategies may be focused on the gene itself or regulatory elements which effect expression of the encoded polypeptide. "Regulatory elements is used here in the widest possible sense and includes other genes which interact with the gene of interest.
  • Genetic constructs designed to decrease or silence the expression of a polynucleotide/polypeptide using the invention may include an antisense copy of the polynucleotide to be silenced.
  • the polynucleotide is in an antisense orientation with respect to the promoter and terminator.
  • an “antisense” polynucleotide is obtained by inverting a polynucleotide or a segment of the polynucleotide so that the transcript produced will be complementary to the mRNA transcript of the gene, e.g., 5OATCTA 3' (coding strand) 3'CTAGAT 5' (antisense strand)
  • Genetic constructs designed for gene silencing may also include an inverted repeat.
  • An 'inverted repeat' is a sequence that is repeated where the second half of the repeat is in the complementary strand, e.g.,
  • the transcript formed may undergo complementary base pairing to form a hairpin structure provided there is a spacer of at least 3-5 bp between the repeated regions.
  • Another silencing approach involves the use of a small antisense RNA targeted to the transcript equivalent to an miRNA (Llave et al, 2002, Science 297, 2053). Use of such small antisense RNA corresponding to polynucleotides described herein is expressly contemplated.
  • the term genetic construct as used herein also includes such small antisense RNAs.
  • Transformation with an expression construct, as herein defined, may also result in gene silencing through a process known as sense suppression (e.g. Napoli et al, 1990, Plant Cell 2, 279; de Carvalho Niebel et al, 1995, Plant Cell, 7, 347).
  • sense suppression may involve over-expression of the whole or a partial coding sequence but may also involve expression of non-coding region of the gene, such as an intron or a 5' or 3' untranslated region (UTR).
  • Chimeric partial sense constructs can be used to coordinately silence multiple genes (Abbott et al, 2002, Plant Physiol. 128(3): 844-53; Jones et al, 1998, Planta 204: 499-505).
  • the use of such sense suppression strategies to silence the expression of a polynucleotide described above is also contemplated.
  • Pre-transcriptional silencing may be brought about through mutation of the gene itself or its regulatory elements.
  • Such mutations may include point mutations, frameshifts, insertions, deletions and substitutions.
  • transgenic cells and plants of the invention may be from any species.
  • the transgenic cells and plants of the invention are derived from forage plant species from a group comprising but not limited to the following genera: Lolium, Festuca, Dactylis, Bromus, Trifolium, Medicago, Pheleum, Phalaris, Holcus, Lotus, Plantain and Cichorium.
  • the transgenic cells and plants are from the genera Lolium and Trifolium. Particularly preferred are the species Lolium perenne and Trifolium repens.
  • Methods are also provided for selecting plants with altered levels of condensed tannin production.
  • a polynucleotide such as a messenger RNA
  • exemplary methods for measuring the expression of a polynucleotide include but are not limited to Northern analysis, RT-PCR and dot-blot analysis (Sambrook et al, Molecular Cloning : A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987).
  • Polynucleotides or portions of the polynucleotides of the invention are thus useful as probes or primers, as herein defined, in methods for the identification of plants with altered levels of polypeptides involved in condensed tannin production.
  • An altered level in a plant, of a polypeptide involved in condensed tannin biosynthesis may be used as an indicator of altered condensed tannin biosynthesis in such a plant.
  • the polynucleotides described above may be used as probes in hybridization experiments, or as primers in PCR based experiments, designed to identify such plants.
  • antibodies may be raised against the polypeptides described above.
  • Methods for raising and using antibodies are standard in the art (see for example: Antibodies, A Laboratory Manual, Harlow A Lane, Eds, Cold Spring Harbour Laboratory, 1998).
  • Such antibodies may be used in methods to detect altered expression of polypeptides involved in condensed tannin production in plants. Such methods may include ELISA (Kemeny, 1991, A Practical Guid to ELISA, NY Pergamon Press) and Western analysis (Towbin & Gordon, 1994, J Immunol Methods, 72, 313).
  • the invention further provides plants selected by a method of the invention.
  • the selected plants may be from any plant species.
  • the selected plants of the invention are from forage plant species from a group comprising but not limited to the following genera: Lolium, Festuca, Dactylis, Bromus, Trifolium, Medicago, Pheleum, Phalaris, Holcus, Lotus, Plantain and Cichorium.
  • the selected plants are from the genera Lolium and Trifolium. Particularly preferred are the species Lolium perenne and Trifolium repens.
  • the plants of the invention with altered condensed tannin production may be transgenic plants or those selected based on analysis of polynucleotide/polypeptide expression, as described above.
  • the plants of the invention may be grown and either selfed or crossed with a different plant strain and the resulting hybrids, with the desired phenotypic characteristics, may be identified. Two or more generations may be grown to ensure that the subject phenotypic characteristics are stably maintained and inherited. Plants resulting from such standard breeding approaches also form an aspect of the present invention.
  • Example 1 Exemplary methods for measuring condensed tannin content in transgenic plants of the invention or plants selected through methods of the invention are provided in Example 1 below.
  • Figure 1 shows the known steps in the biosynthetic pathway for flavonoids. The figure is from Tanner et al, Proanthocyanidin biosynthesis in plants: Purification of legume leucoanthocyanidin reductase and molecular cloning of its cDNA J. Biol. Chem., In press (June 2003).
  • Figure 2 shows the hydrophobicity profile (Persson, B. & Argos, P. (1994) Prediction of transmembrane segments in proteins utilsing multiple sequence alignments J. Mol. Biol. 237, 182-192.) for the polypeptide represented in SEQ ID NO: 11.
  • Figure 3 shows a multiple sequence alignment of polypeptides, from several species, orthologous to those presented in SEQ ID NOs 2, 3 and 4 and illustrates a region (underlined) containing a consensus sequence, identified by the applicants, which is present in all of the sequences.
  • Figure 4 shows LC-MS traces for 70% acetone fractions from mature silique from transgenic plants (A) and control plants (B).
  • the transgenic plants were transformed with an RNAi; construct containing a sequence corresponding to SEQ ID NO:3.
  • the peak labeled [K-3r, 7r]-l represents kaempferol dirhamnoside.
  • the numbers -1 and -2 indicate the change state in the mass spectrometer.
  • Figure 5 shows Proanthocyanidin levels normalized against dimer for transgenic and pHex control plants.
  • the transgenic plants were transformed with an RNAi construct containing a sequence corresponding to SEQ ID NO:3. ⁇
  • Example 1 Identification and isolation of polynucleotides involved in condensed tannin biosynthesis.
  • High-CT (SAl) and low-CT (DPL90) lines of cotton were grown in the field at USDA-ARS SRRC, New La. Fiber was harvested from both lines at 14 days post-anthesis (DP A), removed from the seeds and frozen in liquid nitrogen immediately.
  • Fiber samples were extracted with aqueous acetone (70% v/v) and the extracts separated into two fractions by LH-20 sephadex chromatography and elution with 50% methanol and 70% acetone.
  • the fractions were examined by HPLC on a C18 column with an acetonitrile/formic acid gradient, with PDA-UV and mass spectrometric detection.
  • Monomeric CT precursors were eluted in the 50% methanol fraction while CT oligomers and higher polymers were eluted in the 70% acetone fraction.
  • cDNA was prepared from the SAl and DPL90 fiber mRNA samples and subtraction performed in both directions using the suppression subtractive hybridization (SSH) technique (Gurskaya et al, 1996, Anal Biochem, 240, 90-97; Diachenko et al, 1996, Proc Nat Acad Sci USA, 93, 6025).
  • SSH suppression subtractive hybridization
  • MOS mirror orientation selection
  • Fiber samples were harvested from SAl and DPL90 lines of Gossypium hirsutum at 14 DP A. Chemical analysis of fiber samples confirmed a significant difference in proanthocyanidin levels between SAl and DPL90, and also indicated differences in chemical composition of the predominant proanthocyanodin oligomers (see supplemental report).
  • RNA was isolated from the SAl and DPL90 fiber samples with ConcertTM Plant RNA reagent (Invitrogen) using an extraction protocol modified for cotton tissue, and subtracted cDNA libraries were constructed using standard SSH and MOS techniques, as described in methods. Both a forward-subtracted library (enriched for genes upregulated in SAl) and a reverse-subtracted library (enriched for genes upregulated in SAl DPL90) were constructed. Both libraries were screened for differentially expressed genes by hybridization to labeled cDNA samples from SAl and DPL90 14 DPA fibers.
  • CT biosynthetic genes are abundant in the differentially expressed cloned from high-CT ( SAl) cotton fiber.
  • sequence of this polynucleotide from cotton is presented in SEQ ID NO: 2 and is predicted to encode the polypeptide sequence presented in SEQ ID NO: 10.
  • Orthologous sequences from Arabidopsis and ryegrass are presented in SEQ ID NOs 3 and 4 respectively and are predicted to encode polypeptides presented in SEQ ID NOs 11 and 12 respectively.
  • Table 1 shows "in-silico” expression data for sequences orthologous to that presented in SEQ ID NOs 2, 3 and 4.
  • Table 1 Identification of expressed genes orthologous to polypeptides involved in condensed tannin biosynthesis
  • NCBI EST sequence database (2003-May-09), comprising unannotated polynucleotide sequence data for genes expressed in specified plant species and tissues, was searched for sequences orthologous to those presented in SEQ ID NOs 2, 3 and 4.
  • Orthologue accession/ID numbers are listed, together with e values indicating the confidence level for each sequence match.
  • Proposed orthologues in Arabidopsis thaliana identified by searching the GenBank, EMBL, DDBJ and PDB annotated polynucleotide sequence databases are also listed, with e values.
  • sequence F286 from Gossypium hirsutum (US Published Patent Application 2004/0049808).
  • the sequence encodes a polypeptide having a deduced amino acid sequence including the partial deduced sequence of SEQ ID NO: 10.
  • Plant material Vegetative tissues were harvested from Arabidopsis thaliana, ecotype Columbia.
  • PCR primers were designed to amplify products from a single exon in genomic sequence corresponding to the sequence in SEQ ID NO: 3. Prior to amplification, the anticipated PCR product sequences were used to perform BLASTN searches (Basic local alignment search tool (1990). Altschul SF; Gish W; Miller W; Myers EW; Lipman DJ J Mol Biol 215: 403-10 ) of Arabidopsis cDNA databases to ensure that they were specific to the sequence in SEQ ID NO: 3, and did not contain stretches of more than 25 nucleotides of perfect homology to any other Arabidopsis sequence.
  • RNAi constructs were amplified from Arabidopsis thaliana genomic DNA using High Fidelity Expand Taq DNA polymerase (Roche Molecular Biochemicals) according to the manufacturers instructions.
  • the primers used to amplify an insert corresponding to sequence in SEQ ID NO: 3 were as follows:
  • the attB-flanked PCR products amplified above were recombined into attP sites of the GATEWAY donor vector pDONR201 (Invitrogen) to generate a pENTRY clone with the target sequence flanked by attL sites.
  • the pEntry clone containing the attL-flanked target sequence was recombined with the attR sites of pTKO2 (HortResearch NZ Ltd) to generate the final destination vector with an inverted repeat of target sequence either side of an intron and under control of CaMV 35S promoter.
  • the destination vector is capable of replication in both Agrobacterium tumefaciens and E.coli, and is maintained by Spectinomycin (lOO ⁇ g/ml) selection.
  • the right and left T-DNA borders delineate the region transferred to the plant genome during Agrobacterium-mediated transformation and include the CaMV35S driven inverted repeat of the target gene and the pNos- NptII-Nos3' region for kanamycin selection of transgenic plants.
  • Dry seed was harvested and plated to select for Ti transformants using antibiotic or herbicide selectable marker on 0.5X MS/0.8% tissue culture Agar plates with 50 ug/ml Kanamycin.
  • Putative transformants were transferred to soil and grown to produce T 2 seed.
  • T 3 generation and control plants transformed with the RNAi vector only
  • Mature siliques were collected from the primary inflorescences of approximately 300 plants transformed with the RNAi construct and approximately 300 control plants, at the onset of yellowing of the most mature silique. All green siliques below the 4 th silique from the top of each primary inflorescence were collected, frozen and pooled.
  • Flavonoids were extracted from the silique pools by standard protocols, purified by liquid chromatography using an LH20 column, and characterized by HPLC/PDA/ESI-MS.
  • the ESI- MS chromatograms are shown in figure 4. Peaks for epicatechin (EC) monomer, dimer and higher oligomers are indicated and in figure 4A. The designations -1 and -2 indicate the ion charge state.
  • the peak labelled [K-3r,7r]-l represents kaempferol dirhamnoside.
  • the numbers in the upper left of the chromatograms are the m/z ratios of the single-ion channels.
  • the peak area for epicatechin monomer, dimer and higher oligomers is listed in table 2.
  • the steady-state concentration of each flavonoid species may be considered proportional to the peak areas.
  • quantification of pentamer and hexamer levels is impaired by interference from co- eluting components and the pentamer and hexamer peak areas are therefore not considered reliable.
  • Total proanthocyanidin levels were approximately 2-fold lower in plants transformed with the RNAi construct compared to the control (table 2), indicating a partial disruption of CT accumulation.
  • each of the peak areas was normalized against the value for dimer. Normalised peak area values are listed in table 3 and plotted in figure 5.
  • the monomer/dimer value is more than 2-fold higher in the RNAi plants compared to controls, whereas trimer/dimer and tetramer/dimer values show no change.
  • the relative abundance of monomer in plants transformed with the RNAi construct indicates a slower rate of polymerization in the RNAi plants, and implies that the target gene is involved in CT polymerization. The low rate of polymerization in the RNAi plants may be due to incomplete gene silencing, or functional redundancy.
  • SEQ ID NO:l polypeptide, consensus
  • SEQ ID NO:2 polynucleotide, Gossipium hirsuitum, EGHSA1S005D10/5814R
  • SEQ ID NO:3 polynucleotide, Arabidopsis thaliana, AF412071/ AT3gl6920
  • SEQ ID NO:4 polynucleotide, Lolium per enne, CLP0001001520-cE2
  • SEQ ID NO:5 polynucleotide, Arabidopsis thaliana, AY034935/ Atlg05850
  • SEQ ID NO:6 polynucleotide, Lolium per enne, FLPB002449F07-bOFSP
  • SEQ ID NO:7 polynucleotide, Musa acuminata, Z99959
  • SEQ ID NO:8 polynucleotide, Zea mays, AY105990
  • SEQ ID NO:9 polynucleotide, Oryza sativa, AP003865
  • SEQ ID NO:10 polypeptide, Gossipium hirsuitum, EGHSA1S005D10/5814R
  • SEQ ID NO:l 1 polypeptide, Arabidopsis thaliana, AT3gl6920
  • SEQ ID NO:12 polypeptide, Lolium per enne, CLP0001001520-cE2
  • SEQ ID NO:13 polypeptide, Arabidopsis thaliana, Atlg05850
  • SEQ ID NO: 14 polypeptide, Lolium perenne, FLPB002449F07- bOFSP_20011203_fgenesh_l_pep
  • SEQ ID NO: 15 polypeptide, Musa acuminata, Z99959
  • SEQ ID NO: 16 polypeptide, Zea mays, AY 105990
  • SEQ ID NO:17 polypeptide, Oryza sativa, BAC55635.1

Landscapes

  • Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Organic Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Zoology (AREA)
  • General Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Microbiology (AREA)
  • Plant Pathology (AREA)
  • Nutrition Science (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Cell Biology (AREA)
  • Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
  • Peptides Or Proteins (AREA)

Abstract

The invention provides a method of altering condensed tannin production in a plant involving transforming a plant or a plant cell with a genetic construct to alter the expression of a polypeptide involved in condensed tannin production in the plant. The invention also provides transformed plants and plant cells and methods for selecting plants with altered condensed tannin content based on determining presence/content of the polypeptide or nucleotide sequences encoding it. The polypeptide has sequence homology with a chitinase or chitinase-like protein.

Description

POLYNUCLEOTIDES, POLYPEPTIDES AND THEIR USE IN THE PRODUCTION OF PLANTS WITH ALTERED CONDENSED TANNINS
TECHNICAL FIELD
The present invention relates to compositions and methods for producing and selecting plants with altered condensed tannin production.
BACKGROUND ART
Proanthocyanidins or condensed tanmns (CTs) are flavonoid oligomers which accumulate in vegetative, fruit and seed tissues of many plants (Perter 1998, In Harborne, JB (Ed) The Flavonoids, Advances in Research Science since 1980, Chapman and Hall, New York, p21). CTs possess many hydrophobic aromatic rings and hydroxyl groups that are able to interact with biological molecules, especially proteins, via hydrophobic interactions and hydrogen bonds. Because of their polymeric nature, the interaction of CTs with protein is considerably stronger than that of protein with other monomeric flavonoids. This is presumably because polymeric CTs are able to interact with large proteins at multiple sites, thus increasing the strength of interaction and minimizing dissociation of formed complexes (Fersht 1985, Chemical catalysis, In Enzyme Structure and Mechanism, WH Freeman and Company, New York, ρ47). The nature of interaction of CT with proteins forms the basis of some of the beneficial properties of CTs.
Cattle and sheep suffer from a potentially lethal condition called "bloat" when proteins are digested too rapidly in the rumen. This leads to the production of a highly stable foam which results in bloat symptoms. When present in forage crops, CTs, bind to forage proteins, prevent their rapid digestion and thereby reduce the potential for bloating and the need for bloat- preventing drenches. In addition, an increase in CT-bound protein by-passing the rumen, results in improved animal nutrition due to increased availability of nitrogen in the latter stages of the digestive process (McMahon et al, 2000, Can J Plant Sci 80,469; Barry and McNabb, 1999, Rr. J. Nutr. 81,263). CTs are absent or at sub-optimal levels in the leaves of certain forage crops such as ryegrass and alfalfa. It would thus be advantageous to increase the content of CTs in appropriate tissues of such crops. In some cases however high levels of CT can reduce the palatability of forage crops (Kumar and Singh, 1984, JAgric Food Chem. 32, 447) and reduction of CT content in such forage crops would be beneficial. CTs have also received attention due to their powerful antioxidant properties and associated beneficial effects on human health and immunity (Ross and Kasum, 2002, Annu Rev Nutr 22,19; Bagchi et al, 2000, Toxicology 148, 87; Lin et al, 2002, JNαt Prod 65, 505). Of particular interest are the CTs from fruit such as cranberry and grape (Foo et al, 2000, Phytochemistry 54, 173 ; Pataki et al, 2002, Am J Clin Nutr 75, 894)
Biosynthesis of CTs occurs in the flavonoid biosynthetic pathway (Figure 1). The flavonoids are secondary metabolites derived from products of the phenylpropanoid biosynthetic pathway and the Krebs cycle. The final flavonoid compounds include flavonols (colorless pigments), anthocyanins (pink and red pigments), and condensed tannins (colorless pigments that turn brown on oxidation). The sub-pathways for each of these flavonoid compounds share common initial biosynthesis steps.
Only two biochemical steps specific to CT biosynthesis have been demonstrated in vitro. The best characterized step is the reduction of (+)3, 4-cis-leucocyanidin to catechin, catalysed by leucoanthocyanidin reductase (LAR). The LAR enzyme was purified from tissues of the CT-rich legume Desmodium uncinatum and the corresponding cDΝA was cloned (Tanner et al, 2003 JBC, in press). Activity of the recombinant LAR enzyme in Escherichia coli and plants confirmed the identity of the protein. The second enzyme which appears specific to condensed tannin biosynthesis is anthocyanidin reductase, encoded by the BANYULS (BAN) gene of Arabidopsis thaliana and Medicago truncatula (Xie et al, 2003, Science, 299, 396). BANYULS cDNAs from Arabidopsis and Medicago were expressed in Escherichia coli, and the recombinant proteins were shown to convert anthocyanidins to their corresponding 2,3 -cis- flavan-3-ols. This activity has yet to be been confirmed in plant extracts.
Genetic and molecular studies indicate the regulation of CT biosynthesis in the model plant Arabidopsis thaliana is complex. In Arabidopsis CTs specifically accumulate in the endothelium during early seed development. Several genes involved in the regulation of CT biosynthesis in Arabidopsis seed have been characterized, but unidentified tissue-specific factors also appear necessary. (Nesi et al, 2000, Plant Cell 12, 1863; Nesi et al, 2001, Plant Cell 13, 2099; Nesi et al, 2002, Plant Cell 14, 2463). The maize SN gene has also been implicated in the regulation of CT biosynthesis (Robbins et al, 2003, J Exp Bot. 54, 239). SN appears to be orthologous to tt8, one of the regulatory genes characterized in Arabidopsis. Like tt8, the SN gene appears to up-regulate CTs only in competent tissues. Plants with altered CT content may also have altered nutritional value, increased pest and disease resistance or improved nutritional qualities (WO 98/07836). Plants with altered CT content may also lead to improved silage storage through reduced bacterial and fungal activity, reduced methane production in cattle grazed up such plants, reduced parasite load in the intestines of animals grazed on such plants, improved resistance to insects and birds, improved quality of oilseeds and grains and altered flavour, colour and taste characteristics in fruit, fruit juice and wine (WO 02/10412).
As the content of CTs in dietary plants can impart both positive and negative effects, an important objective for agriculture is to be able to manipulate the level of CTs in plants. There is thus a need for compositions and methods for developing both forage and human crop varieties with altered CT content.
WO 02/10412 is directed towards novel regulatory and biosynthetic genes and encoded proteins which can be used in methods to alter tannin production in plants for a variety of agricultural benefits. No examples describing such plants are provided.
Over-expression of BAN in tobacco petals and Arabidopsis leaves leads to loss of anthocyanin and accumulation of CTs (Xie et a.,l 2003, Science, 299, 396).
WO 98/07836 is directed towards sequences encoding leucoanthocyanidin reductase (LAR) and to use of such sequences to regulate condensed tannin biosynthesis in plants. The sequences are stated to be useful in the production of bloat-sage forage crops and other crops with altered nutritional value, increased disease resistance and pest resistance or improved malting qualities. Prophetic examples of such crops are provided.
It is an object of the invention to provide plants with altered production of CTs or at least to provide the public with a useful choice.
DISCLOSURE OF THE INVENTION
In one aspect the invention provides a method of altering condensed tannin production in a plant comprising transformation of a plant cell or plant with a genetic construct comprising a polynucleotide that alters the expression of a polypeptide comprising S Xϋ X2 C C N X3 T I D X5 F X6 X7 X8 X9 F E X10 L F X„ X12 R N (SEQ ID NO:l) involved in condensed tannin production in the plant wherein X^K, E, I or R, X2=Y or F, X3=H, Q, D or G, XpS or T, X5=Y or F, X6=Q, E or K, X7=T, V or A, X8=Y or H, X9=Q, K or H, X10= D, N, Q or E, Xπ=A, S or P and X12=K, R or H and wherein the polynucleotide encodes the polypeptide or a fragment thereof or polynucleotide is complementary to said polynucleotide encoding the polypeptide or fragment.
In another aspect the invention provides a method of altering condensed tannin production in a plant comprising transformation of a plant cell or plant with a genetic construct comprising a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant thereof, wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
In a further aspect the invention provides a transgenic plant having altered condensed tannin content comprising a polynucleotide that alters the expression of a polypeptide comprising S Xι X2 C C N X3 T I X4 D X5 F X6 X7 X8 X9 F E X10 L F Xπ X12 R N (SEQ ID NO:l) involved in condensed tannin production in the plant wherein Xi=K, E, I or R, X2=Y or F, X3=H, Q, D or G, X4=S or T, X5=Y or F, X6=Q, E or K, X7=T, V or A, X8-Y or H, X9=Q, K or H, X10= D, N, Q or E, Xπ=A, S or P and Xι2=K, R or H and wherein the polynucleotide encodes the polypeptide or a fragment thereof or the polynucleotide is complementary to said polynucleotide encoding the polypeptide or fragment.
In a further aspect the invention provides a transgenic plant having altered condensed tannin content comprising a genetic construct comprising a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant thereof, wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
In preferred embodiments the polynucleotide encodes a polypeptide comprising the amino acid sequence: S Xi X2 C C N X3 T I X4 D X5 F X6 X7 X8 X9 F E X10 L F Xu X12 R N (SEQ ID NO:l) wherein Xi to X12 are as herein defined, which encodes a polypeptide involved in condensed tannin production in plants.
In preferred embodiments the polynucleotide comprises a sequence selected from a group of related sequences (SEQ ID NOs: 2, 3 and 4) or a variant thereof, wherein the polynucleotide encodes a polypeptide involved in condensed tannin production in plants. In one embodiment the invention the polynucleotide comprises SEQ ID NO:2 or a variant thereof, and the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
In a further embodiment the invention the polynucleotide comprises SEQ ID NO: 3 or a variant thereof, and the polynucleotide encodes a polypeptide involved in condensed tannin production in plants. A preferred variant is the polypeptide encoded by F286 (see US Published Patent Application 2004/0049808, paragraph 67).
In a further embodiment the invention the polynucleotide comprises SEQ ID NO:4 or a variant thereof, and the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
In further embodiments, the polynucleotide is selected from naturally occurring polynucleotide orthologue of SEQ ID NOs: 2, 3 and 4, and variants of such orthologues, wherein the polynucleotide orthologues and variants encode polypeptides involved in condensed tannin production in plants. Exemplary orthologues are disclosed herein and identified as SEQ ID NOs: 5-9 of the sequence listing.
Fragments of SEQ ID NOs: 2, 3 and 4 are useful as probes, primers, gene silencing molecules, and in polynucleotide-based selection methods of the invention. Polynucleotides comprising fragments of orthologues and/or variants are useful in the invention.
Isolated polynucleotides are useful in methods for altering condensed tannin production in plants and in traditional breeding programs for selection of individuals with altered CT content.
In a further aspect the invention provides a plant cell which comprises one or more of the genetic constructs comprising a polynucleotide as described above.
In a further aspect the invention provides methods for selecting plants with altered condensed tannin content comprising the steps of: (a) contacting a sample from a plant with a polynucleotide selected from SEQ ID NO:2,
SEQ ID NO:3, SEQ ID NO:4 and orthologues thereof, or a fragment of one of these, as a probe or primer (b) determining presence/content of complementary sequences to assess altered expression of a polypeptide involved in condensed tannin production in a plant; and (c) selection of a plant showing such altered expression.
In a further aspect the invention provides methods for selecting plants with altered condensed tannin content comprising the steps of: (a) contacting a plant sample or plant antibody raised against a polypeptide coded by a nucleotide sequence comprising SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a naturally occurring orthologue thereof to assess altered expression of a polypeptide involved in condensed tannin production in a plant; and (b) selection of a plant shown such altered expression.
In a further aspect the invention provides plants selected by a method of the invention.
The term "plant selected by a method of the invention" is intended to include any whole plant, part, portion propagules or progeny of a plant selected by a method of the invention.
Definitions
The term "transgenic plant" is intended to include a whole plant or any part of a plant, propagules and progeny of a plant.
The term 'propagule' means any part of a plant that may be used in reproduction or propagation, either sexual or asexual, including seeds and cuttings.
The term "polynucleotide(s)," as used herein, means a single or double-stranded deoxyribonucleotide or ribonucleotide polymer of any length, and include as non-limiting examples, coding and non-coding sequences of a gene, sense and antisense sequences, exons, introns, genomic DNA, cDNA, pre-mRNA, mRNA, rRNA, siRNA, miRNA, tRNA, ribozymes, recombinant polynucleotides, isolated and purified naturally occurring DNA or RNA sequences, synthetic RNA and DNA sequences, nucleic acid probes, primers, fragments, genetic constructs, vectors and modified polynucleotides. A "fragment" of a polynucleotide sequence provided herein is a subsequence of contiguous nucleotides that is capable of specific hybridization to a target of interest, e.g., a sequence that is at least 15 nucleotides in length. A fragment of a polynucleotide sequence can be used in antisense, gene silencing, triple helix or ribozyme technology, or as a primer, a probe, included in a microarray, or used in polynucleotide-based selection methods of the invention.
The term "primer" refers to a short polynucleotide, usually having a free 3 'OH group, that is hybridized to a template and used for priming polymerization of a polynucleotide complementary to the target.
The term "probe" refers to a short polynucleotide that is used to detect a polynucleotide sequence that is complementary to the probe in a hybridization-based assay.
The term "polypeptide", as used herein, encompasses amino acid chains of any length, including full-length proteins, in which amino acid residues are linked by covalent peptide bonds. Polypeptides may be purified natural products, or may be produced partially or wholly using recombinant or synthetic techniques.
A "fragment" of a polypeptide is a subsequence of the polypeptide that performs a function that is required for the biological activity and/or provides three dimensional structure of the polypeptide.
The term "isolated" as applied to the polynucleotide or polypeptide sequences disclosed herein is used to refer to sequences that are removed from their natural cellular environment. An isolated molecule may be obtained by any method or combination of methods including biochemical, recombinant, and synthetic techniques.
The term "recombinant" refers to a polynucleotide sequence that is removed from sequences that surround it in its natural context and/or is recombined with sequences that are not present in its natural context.
A "recombinant" polypeptide sequence is produced by translation from a "recombinant" polynucleotide sequence. As used herein, the term "variant" refers to polynucleotide or polypeptide sequences different from the specifically identified sequences, wherein one or more nucleotides or amino acid residues is deleted, substituted, or added. Variants may be naturally occurring allelic variants, or non-naturally occurring variants. Variants may be from the same or from other species and encompass homologous, paralogues and orthologues. The term "variant" encompasses both cDNA and genomic sequences. Variant sequences (polynucleotide or polypeptide) preferably exhibit at least 50%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, more preferably at least 98%, and most preferably at least 99% identity to a sequences of the present invention. The term variant, with reference to polynucleotides, also encompasses sequences complementary to any of the polynucleotides used in the invention.
The term "percent identity" or "percent identical" when used in connection with the polynucleotide molecules and polypeptides, is defined as the percentage of nucleic acid residues in a candidate polynucleotide sequence, or the percentage of amino acid residues in a candidate polypeptide sequence, that are identical with a subject polynucleotide sequence or polypeptide molecule sequence (such as the polynucleotide sequence of SEQ ID NO: 2), after aligning the candidate and subject sequences to achieve the maximum percent identity, and not considering any nucleic acid residue substitutions as part of the polynucleotide sequence identity. When making the comparison, the candidate polynucleotide sequence or polypeptide sequence (which may be a portion of a larger polynucleotide sequence or polypeptide sequence) is the same length as the subject polynucleotide sequence or polypeptide sequence, and no gaps are introduced into the candidate polynucleotide sequence or polypeptide sequence in order to achieve the best alignment.
Polynucleotide sequence identity can be determined in the following manner. The subject polynucleotide sequence is used to search a polynucleotide sequence database, such as the GenBank database (accessible at web site http ://www. nobi. nln. nih. gov/blast/), using the program BLASTN version 2.2.5 (based on Altschul et al, 1997, Nucleic Acids Res, 25, 3389). The programs are used in the ungapped mode. Default filtering is used to remove sequence homologies due to regions of low complexity. The default parameters of BLASTN, BLASTX and tBLASTX are utilized.
Polypeptide sequence identity can be determined in the following manner. The subject polypeptide sequence is used to search a polypeptide sequence database, such as the GenBank database (accessible at web site http ://www. nobi. nln. nih. gov/blast/), using the BLASTP program. The program is used in the ungapped mode. Default filtering is used to remove sequence homologies due to regions of low complexity. The default parameters of BLASTP are utilized.
According to one embodiment, "variant" polynucleotides and polypeptides, with reference to each of the polynucleotides and polypeptides of the present invention, produce an E value of 0.01 or less when compared to the polynucleotide or polypeptide of the present invention using BLASTN, FASTA or BLASTP algorithms set to default parameters. That is, a variant polynucleotide or polypeptide is any sequence that has at least a 99% probability of being the same as the polynucleotide or polypeptide of the present invention. According to a preferred embodiment, a variant polynucleotide is a sequence having the same number or fewer nucleic acids than a polynucleotide of the present invention that has at least a 99% probability of being the same as the polynucleotide of the present invention, measured as having an E value of 0.01 or less using the BLASTN or FASTA algorithms set at parameters described above. Similarly, according to a preferred embodiment, a variant polypeptide is a sequence having the same number or fewer amino acids than a polypeptide of the present invention that has at least a 99% probability of being the same as a polypeptide of the present invention, measured as having an E value of 0.01 or less using the BLASTP algorithm set at the parameters described above.
Alternatively, variant polynucleotides of the present invention hybridize to the polynucleotide sequences recited in SEQ ID NO: 2-9, or complements thereof, antisense sequences and complements thereof, under stringent conditions. As used herein, "stringent conditions" refers to hybridization conditions such as prewashing in a solution of 6X SSC, 0.2% SDS; hybridizing at 65°C, 6X SSC, 0.2% SDS overnight; followed by two washes of 30 minutes each in IX SSC, 0.1% SDS at 65° C and two washes of 30 minutes each in 0.2X SSC, 0.1% SDS at 65°C.
Variant polynucleotides of the present invention also encompasses polynucleotides that differ from the sequences of the invention but that, as a consequence of the degeneracy of the genetic code, encode a polypeptide having similar activity to a polypeptide encoded by a polynucleotide of the present invention. A sequence alteration that does not change the amino acid sequence of the polypeptide is a "silent variation". Except for ATG (methionine) and TGG (tryptophan), other codons for the same amino acid may be changed by art recognized techniques, e.g., to optimize codon expression in a particular host organism. Conservative substitutions of one or several amino acids of a described polypeptide sequence without significantly altering its biological activity are also included in the invention. A skilled artisan will be aware of methods for making phenotypically silent amino acid substitutions (see, e.g., Bowie et al, 1990, Science 247, 1306).
Additionally, polynucleotides comprising sequences that differ from the polynucleotide sequences identified above as a result of deletions and/or insertions totaling less than 10% of the total sequence length are also useful in methods and products of the present invention. Similarly, polypeptides comprising sequences that differ from the polypeptide sequences identified as a result of amino acid substitutions, insertions, and/or deletions totaling less than 10% of the total sequence length are useful in methods and products of the present invention. In certain embodiments, variants of the inventive polypeptides and polynucleotides possess biological activities that are the same or similar to those of the inventive polypeptides or polynucleotides.
The term "hybridize under stringent conditions", and grammatical equivalents thereof, refers to the ability of a polynucleotide molecule to hybridize to a target polynucleotide molecule (such as a target polynucleotide molecule immobilized on a DNA or RNA blot, such as a Southern blot or Northern blot) under defined conditions of temperature and salt concentration. The ability to hybridize under stringent hybridization conditions can be determined by initially hybridizing under less stringent conditions then increasing the stringency to the desired stringency.
With respect to polynucleotide molecules greater than about 100 bases in length, typical stringent hybridization conditions are no more than 25 to 30° C (for example, 10° C) below the melting temperature (Tm) of the native duplex (see generally, Sambrook et al, Eds, 1987, Molecular Cloning, A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press; Ausubel et al, 1987, Current Protocols in Molecular Biology, Greene Publishing,). Tm for polynucleotide molecules greater than about 100 bases can be calculated by the formula Tm = 81. 5 + 0. 41% (G + C-log (Na+).( Sambrook et al, Eds, 1987, Molecular Cloning, A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press; Bolton and McCarthy, 1962, PNAS 84:1390)
With respect to polynucleotide molecules having a length less than 100 bases, exemplary stringent hybridization conditions are 5 to 10° C below Tm. On average, the Tm of a polynucleotide molecule of length less than 100 bp is reduced by approximately (500/oligonucleotide length)0 C. With respect to the DNA mimics known as peptide nucleic acids (PNA's) (Nielsen et al, Science. 1991 Dec 6;254(5037): 1497-500) Tm values are higher than those for DNA-DNA or DNA-RNA hybrids, and can be calculated using the formula described in Giesen et al, Nucleic Acids Res. 1998 Nov l;26(21):5004-6. Exemplary stringent hybridization conditions for a DNA-PNA hybrid having a length less than 100 bases are 5 to 10° C below the Tm.
The term "genetic construct" refers to a polynucleotide molecule, usually double-stranded DNA, which may have inserted into it another polynucleotide molecule (the insert polynucleotide molecule) such as, but not limited to, a cDNA molecule. A genetic construct may contain the necessary elements that permit transcribing the insert polynucleotide molecule, and, optionally, translating the transcript into a polypeptide. The insert polynucleotide molecule may be derived from the host cell, or may be derived from a different cell or organism and/or may be a recombinant polynucleotide. Once inside the host cell the genetic construct may become integrated in the host chromosomal DNA. The genetic construct may be linked to a vector.
The term "vector" refers to a polynucleotide molecule, usually double stranded DNA, which is used to transport the genetic construct into a host cell. The vector may be capable of replication in at least one host system, such as E. coli.
The term "expression construct" refers to a genetic construct that includes the necessary elements that permit transcribing the insert polynucleotide molecule, and, optionally, translating the transcript into a polypeptide. An expression construct typically comprises in a 5' to 3' direction: a) a promoter functional in the host cell into which the construct will be transformed, b) the polynucleotide to be expressed, and c) a terminator functional in the host cell into which the construct will be transformed.
The term "coding region" or "open reading frame" (ORF) refers to the sense strand of a genomic DNA sequence or a cDNA sequence that is capable of producing a transcription product and/or a polypeptide under the control of appropriate regulatory sequences. The coding sequence is identified by the presence of a 5' translation start codon and a 3' translation stop codon. When inserted into a genetic construct, a "coding sequence" is capable of being expressed when it is operably linked to promoter and terminator sequences. "Operably-linked" means that the coding sequence is in the correct reading frame for translation and its expression is placed under the control of regulatory elements that include promoters, tissue-specific regulatory elements, temporal regulatory elements, enhancers, silencers and terminators.
The term "noncoding region" refers to untranslated sequences that are upstream of the translational start site and downstream of the translational stop site. These sequences are also referred to respectively as the 5' UTR and the 3' UTR. These regions include elements required for transcription initiation and termination and for regulation of translation efficiency.
Terminators are sequences, which terminate transcription, and are found in the 3' untranslated ends of genes downstream of the translated sequence. Terminators are important determinants of mRNA stability and in some cases have been found to have spatial regulatory functions.
The term "promoter" refers to nontranscribed cis-regulatory elements upstream of the coding region that regulate gene transcription. Promoters comprise cis-initiator elements which specify the transcription initiation site and conserved boxes such as the TATA box, and motifs that are bound by transcription factors.
A "transgene" is a polynucleotide that is taken from one organism and introduced into a different organism by transformation. The transgene may be derived from the same species or from a different species as the species of the organism into which the transgene is introduced.
A "transgenic plant" refers to a plant which contains new genetic material as a result of genetic manipulation or transformation. The new genetic material may be derived from a plant of the same species as the resulting transgenic plant or from a different species.
An "inverted repeat" is a sequence that is repeated, where the second half of the repeat is in the complementary strand, e.g., (5')GATCTA TAGATC(3') (3 CTAGAT ATCTAG(5 ')
Read-through transcription will produce a transcript that undergoes complementary base-pairing to form a hairpin structure provided that there is a 3-5 bp spacer between the repeated regions. The term "involved in condensed tannin production" means that such polynucleotides and polypeptides may function to activate, inhibit or otherwise modulate CT production in a plant. Such modulation is intended to include alteration of the type of CT monomer/s present and alteration of the level of polymerization of CTs. Such polynucleotides and polypeptide may act within the CT biosynthetic pathway or may function in regulation of the CT biosynthetic pathway.
The term "polypeptide involved in condensed tannin production" refers to a polypeptide, an aggregate of a polypeptide such as a dimer or other multimer, a fusion polypeptide, a polypeptide fragment, a polypeptide variant, or derivative thereof capable of affecting the production of CT in a plant.
The applicants have identified polynucleotides involved in condensed tannin biosynthesis in plants. A polynucleotide sequence, up-regulated in a naturally occurring Gossypium hirsutum (cotton) mutant, which produces unusually high levels of CTs, relative to wild-type plants, was identified by subtractive methods. This cotton polynucleotide sequence was also used to identify and isolate further orthologous polynucleotides from Arabidopsis thaliana, Lolium perenne (ryegrass) and other species. Where available, 'in-silico' data indicated that the orthologous polynucleotides are expressed in tissues where condensed tannins are expected to be produced. The applicants further identified a consensus polypeptide sequence present in all of the polypeptides encoded by the orthologous polynucleotide sequences, as shown in Figure 3.
The invention provides transgenic plants with altered CT content, including plants with both increased and decreased levels of CTs and methods for the production of such plants. The invention further provides methods for selection of plants with altered condensed tannin content, as compared to the majority of similar wild-type plants. Such methods based on use of polynucleotides or polypeptides described herein may be used in traditional breeding programs in the development of varieties with altered condensed tannin content.
In a first aspect the invention uses an isolated polynucleotide encoding a polypeptide comprising the amino acid sequence: S Xx X2 C C N X3 T I t D X5 F X6 X7 X8 X9 F E X10 L F Xn X12 RN (SEQ ID NO:l) wherein X^K, E, I or R, X2=Y or F, X3=H, Q, D or G, r=S or T, X5=Y or F, X6=Q, E or K, X7=T, V or A, X8=Y or H, X9=Q, K or H, X10= D, N, Q or E, ~X Ϋ= , S or P and X12=K, R or H. The isolated polynucleotide of this aspect encodes a polypeptide involved in condensed tannin production in plants.
The invention further uses an isolated polynucleotide comprising a sequence selected from a group of related sequences derived from Gossypium hirsutum (cotton), Arabidopsis thaliana and Lolium perenne (ryegrass) (SEQ ID NOs: 2, 3 and 4 respectively) or a variant thereof, wherein the polynucleotide encodes a polypeptide involved in condensed tannin production in plants.
The present invention also encompasses use of polynucleotide variants of SEQ ID NOs: 2, 3 and 4. Such polynucleotide variants preferably exhibit at least 50%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, more preferably at least 98%, and most preferably at least 99% identity to any one of the sequences set forth in SEQ ID NOs: 2, 3, or 4.
The invention further provides naturally occurring polynucleotide orthologues of SEQ ID NOs: 2, 3 or 4, and variants of such orthologues. Exemplary orthologues are disclosed herein and identified as SEQ ID NO: 5-9 of the sequence listing.
The polynucleotide molecules can be isolated by using a variety of techniques known to those of ordinary skill in the art. By way of example, such polynucleotides can be isolated through use of the polymerase chain reaction (PCR) described in Mullis et al, Eds. 1994 The Polymerase Chain Reaction, Birkhauser, incorporated herein by reference.
The isolated polynucleotides can be amplified using primers, as defined herein, derived from the polynucleotide sequences . Variant polynucleotides may also be identified using such primers. Typically, the polynucleotide sequence of a primer, useful to amplify variants of polynucleotide molecules by PCR, may be based on a conserved region of amino acid sequence of the polypeptides
Further, by way of example, inverse PCR permits acquisition of unknown sequences, flanking the polynucleotide sequences disclosed herein, starting with primers based on a known region (Triglia et al, 1998, Nucleic Acids Res 16, 8186, incorporated herein by reference). The method uses several restriction enzymes to generate a suitable fragment in the known region of a gene. The fragment is then circularized by intramolecular ligation and used as a PCR template. Divergent primers are designed from the known region. Further methods for identifying variant polynucleotides by methods of the invention include use of all, or portions of, the polynucleotides having the sequence set forth in SEQ ID NO : 2-4 as hybridization probes to screen a plant genomic or cDNA libraries. The technique of hybridizing 5 labelled polynucleotide probes to polynucleotides immobilized on nitrocellulose filters or nylon membranes can be used to screen the genomic or cDNA libraries. Exemplary hybridization and wash conditions are : hybridization for 20 hours at 65°C in 5. 0 X SSC, 0. 5% sodium dodecyl sulfate, 1 X Denhardt's solution ; washing (three washes of twenty minutes each at 55°C) in 1. 0 X SSC, 1% (w/v) sodium dodecyl sulfate, and optionally one wash (for twenty minutes) in 0. 5 10 X SSC, 1% (w/v) sodium dodecyl sulfate, at 60°C. An optional further wash (for twenty minutes) can be conducted under conditions of 0. I X SSC, 1% (w/v) sodium dodecyl sulfate, at 60°C.
The variant sequences of the invention can also be identified by computer-based methods, using 15 public domain sequence alignment algorithms and sequence similarity search tools to search sequence databases (public domain databases include Genbank, EMBL, Swiss-Prot, PIR and others). See, e.g., Nucleic Acids Res. 29: 1-10 and 11-16, 2001 for examples of online resources. Similarity searches retrieve and align sequences for comparison with a target sequence to be analyzed (i.e., a query sequence). Sequence comparison algorithms use scoring matrices to 20 assign an overall score to each of the alignments.
Polynucleotide and polypeptide sequences may be aligned, and percentage of identical residues in a specified region may be determined against other polynucleotide and polypeptide sequences, using computer algorithms that are publicly available. Two exemplary algorithms for aligning
25 and identifying the similarity of polynucleotide sequences are the BLASTN and FASTA algorithms. Polynucleotides may also be analyzed using the BLASTX algorithm, which compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database. The similarity of polypeptide sequences may be examined using the BLASTP algorithm. The BLASTN algorithm Version 2.0.11 [Jan-20-2000]
30 set to the default parameters described in the documentation and distributed with the algorithm, are preferred for use in the determination of polynucleotide variants used in the present invention. The following parameters are preferred for determination of alignments and similarities using the BLASTX program version 2.0.11 [Jan-20-2000] Unix running command: blastall -p blastn -d swissprottrembledb -e 10 -G 0 -E 0 -v 30 -b 30 -i queryseq -o results. The
35. BLASTP algorithm, is preferred for use in the determination of polypeptide variants used in the present invention. The following running parameters are preferred for determination of alignments and similarities using BLASTP that contribute to the E values and percentage identity of polypeptide sequences: blastall -p blastp -d swissprottrembledb -e 10 -G 0 -E 0 -v 30 -b 30 -i queryseq -o results; the parameters are: -p Program Name [String]; -d Database [String]; -e Expectation value (E) [Real]; -G Cost to open a gap (zero invokes default behavior) [Integer]; -E Cost to extend a gap (zero invokes default behavior) [Integer]; -v Number of one-line descriptions (v) [Integer]; -b Number of alignments to show (b) [Integer]; -I Query File [File In]; -o BLAST report Output File [File Out] Optional.
The use of the BLAST family of algorithms, including BLASTN, BLASTP, and BLASTX, is described in the publication of Altschul et al, Nucleic Acids Res. 25: 3389-3402, 1997. The BLASTN software is available on the NCBI server (http://ncbi.nlm.nih.gov) under /blast/executables/ and is available from the National Center for Biotechnology Information (NCBI), National Library of Medicine, Building 38A, Room 8N805, Bethesda, MD 20894 USA.
The FASTA software package is available from the University of Virginia (University of Virginia, PO Box 9025, Charlottesville, VA 22906-9025). Version 2.0u4, February 1996, set to the default parameters described in the documentation and distributed with the algorithm, may be used in the determination of variants used in the present invention. The use of the FASTA algorithm is described in Pearson and Lipman, 1988, Proc Natl Acad Sci USA, 85, 2444; and Pearson, 1990, Methods in Enzymol. 183, 63.
The "hits" to one or more database sequences by a queried sequence produced by BLASTN, FASTA, BLASTX, BLASTP or a similar algorithm, align and identify similar portions of sequences. The hits are arranged in order of the degree of similarity and the length of sequence overlap. Hits to a database sequence generally represent an overlap over only a fraction of the sequence length of the queried sequence.
The BLASTN, FASTA and BLASTP algorithms also produce "Expect" values for alignments. The Expect value (E) indicates the number of hits one can "expect" to see over a certain number of contiguous sequences by chance when searching a database of a certain size. The Expect value is used as a significance threshold for determining whether the hit to a database, such as the preferred EMBL database, indicates true similarity. For example, an E value of 0.1 assigned to a polynucleotide hit is interpreted as meaning that in a database of the size of the EMBL database, one might expect to see 0.1 matches over the aligned portion of the sequence with a similar score simply by chance. By this criterion, the aligned and matched portions of the polynucleotide sequences then have a probability of 90% of being the same. For sequences having an E value of 0.01 or less over aligned and matched portions, the probability of finding a match by chance in the EMBL database is 1% or less using the BLASTN, FASTA, BLASTX or BLASTP algorithm.
Multiple sequence alignments of a group of related sequences can be carried out with PILEUP, which uses progressive, pairwise alignments. (Feng and Doolittle, 1987, J. Mol. Evol. 25, 351).
Pattern recognition software applications are available for finding motifs or signature sequences. For example, MEME (Multiple Em for Motif Elicitation) finds motifs and signature sequences in a set of sequences, and MAST (Motif Alignment and Search Tool) uses these motifs to identify similar or the same motifs in query sequences. The MAST results are provided as a series of alignments with appropriate statistical data and a visual overview of the motifs found. MEME and MAST were developed at the University of California, San Diego.
PROSITE (Bairoch and Bucher, 1994, Nucleic Acids Res. 22, 3583; Hofinann et al, 1999, Nucleic Acids Res. 27, 215) is a method of identifying the functions of uncharacterized proteins translated from genomic or cDNA sequences. The PROSITE database (www.expasy.org/prosite) contains biologically significant patterns and profiles and is designed so that it can be used with appropriate computational tools to assign a new sequence to a known family of proteins or to determine which known domain(s) are present in the sequence (Falquet et al, 2002, Nucleic Acids Res. 30, 235). Prosearch is a tool that can search SWISS-PROT and EMBL databases with a given sequence pattern or signature.
Proteins can be classified according to their sequence relatedness to other proteins in the same genome (paralogues) or a different genome (orthologues). Orthologous genes are genes that evolved by speciation from a common ancestral gene. These genes normally retain the same function as they evolve. Paralogous genes are genes that are duplicated within a genome. These genes may acquire new specificities or modified functions which may be related to the original one. Phylogenetic analysis methods are reviewed in Tatusov et al, 1997, Science 278, 631- 637,). A partial cDNA sequence may be used, in methods well-known in the art to identify the corresponding full length polynucleotide sequence containing an open reading frame (ORF) necessary for over-expression of the encoded polypeptide in transgenic organisms or cell/s of a such organisms. Such organisms would include bacteria, yeast, fungi, insects, mammals and plants. Such methods would include PCR-based methods, hybridization- based method, computer/database -based methods. In order to physically assemble full-length clones, standard molecular biology approaches can be utilized (Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987).
The involvement of variant polynucleotides in condensed tannin production may be assessed by the transgenic methodologies described herein together with analysis of condensed tannin content of the resulting transgenic plants as described in Example 1.
Described herein is an isolated polypeptide comprising the amino acid sequence: S Xj X2 C C N X3 T I t D X5 F X6 X7 X8 X9 F E X10 L F Xπ X12 R N (SEQ ID NO:l) wherein
Figure imgf000019_0001
E or K, X7=T,V or A, X8=Y or H, X9=Q, K or H, X10= D, N, Q or E, Xπ=A, S or P and X12=K, R or H. The polypeptide is involved in condensed tannin production in plants.
The isolated polypeptide may be selected from a group of related polypeptides comprising sequences derived from Gossypium hirsutum (cotton), Arabidopsis thaliana and Lolium perenne (SEQ ID NO's: 10, 11 and 12 respectively) or a variant thereof, wherein the polypeptide is involved in condensed tannin production in plant.
Also described herein are polypeptide variants of SEQ ID NOs: 10, 11 and 12. Such variants preferably exhibit at least 50%, more preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, more preferably at least 98%, and most preferably at least 99% identity to any one of the sequences set forth in SEQ ID NOs: 10, 11 and
12.
Also described herein are naturally occurring polypeptide orthologues of SEQ ID NO's 10, 11 and 12, and variants of such orthologues. Exemplary orthologues are disclosed herein and identified as SEQ ID NOs: 13-17 of the sequence listing. The polypeptides may be prepared using peptide synthesis methods well known in the art such as direct peptide synthesis using solid phase techniques (e.g. Stewart et al, 1969, in Solid-Phase Peptide Synthesis, WH Freeman Co, San Francisco California , or automated synthesis, for example using an Applied Biosystems 431 A Peptide Synthesizer (Foster City, California). Mutated forms of the polypeptides may also be produced during such syntheses.
The polypeptides may also be purified from natural sources using a variety of techniques that are well known in the art (e.g. Deutscher, 1990, Ed, Methods in Enzymology, Vol. 182, Guide to Protein Purification^).
Alternatively the polypeptides may be expressed recombinantly in suitable host cells and separated from the cells as discussed below.
The polypeptide variants may be identified by screening expression libraries using antibodies raised against previously identified polypeptides . Such antibodies may also be used in the isolation of polypeptide variants . The DNA sequence of polynucleotides identified through expression screening, as well as with the DNA sequences of the polynucleotides already identified may be used in computer based approaches as described above to identify polynucleotide sequences encoding polypeptide variants. Such polynucleotide sequences may be utilized for the recombinant production of polypeptide variants of the invention. Such procedures are standard practice for art skilled workers (Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987).
The genetic constructs used in the present invention comprise one or more polynucleotide sequences of the invention and/or polynucleotides encoding polypeptides of the invention, and may be useful for transforming, for example, bacterial, fungal, insect, mammalian or plant organisms. The genetic constructs are intended to include expression constructs as herein defined.
Methods for producing and using genetic constructs and vectors are well known in the art and are described generally in Sambrook et al, Molecular Cloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987 ; Ausubel et al, Current Protocols in Molecular Biology, Greene Publishing, 1987) A host cell which comprises a genetic construct or vector comprising a polynucleotide as described above may be used in the invention. Host cells may be derived from, for example, bacterial, fungal, insect, mammalian or plant organisms.
Host cells comprising genetic constructs, such as expression constructs, of the invention are useful in methods well known in the art (e.g. Sambrook et al, Molecular Cloning : A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987 ; Ausubel et al, Current Protocols in Molecular Biology, Greene Publishing, 1987) for recombinant production of polypeptides described herein. Such methods may involve the culture of host cells in an appropriate medium in conditions suitable for or conducive to expression of a polypeptide of the invention. The expressed recombinant polypeptide, which may optionally be secreted into the culture, may then be separated from the medium, host cells or culture medium by methods well known in the art (e.g. Deutscher, Ed, 1990, Methods in Enzymology, Vol 182, Guide to Protein Purification).
The host cells may also be useful in methods for production of an enzymic product generated by an expressed polypeptide of the invention. Such methods may involve culturing the host cells of the invention in a medium suitable for expression of a recombinant polypeptide of the invention, optionally in the presence of additional enzymic substrate for the expressed polypeptide of the invention. The enzymic product produced may then be separated from the host cells or medium by a variety of art standard methods.
Plant cells may be transformed to comprise a genetic construct of the invention, and plant cells modified to alter expression of a polynucleotide or polypeptide of the invention.
Condensed tannin production in a plant may also be altered through methods of the invention. Such methods may involve the transformation of plant cells and plants, with a construct of the invention designed to alter expression of a polynucleotide or polypeptide involved in condensed tanin production in such plant cells and plants. Such methods also include the transformation of plant cells and plants with a combination of the construct of the invention and one or more other constructs designed to alter expression of one or more polynucleotides or polypeptides known to be involved in condensed tanin production in such plant cells and plants.
Methods for transforming plant cells, plants and portions thereof with polynucleotides are described in Draper et al, 1988, Plant Genetic Transformation and Gene Expression. A Laboratory Manual Blackwell Sci. Pub. Oxford, p. 365; Potrykus and Spangenburg, 1995, Gene Transfer to Plants. Springer- Verlag, Berlin.; and Gelvin et al, 1993, Plant Molecular Biol. Manual. Kluwer Acad. Pub. Dordrecht. A review of transgenic plants, including transformation techniques, is provided in Galun and Breiman,1997, Transgenic Plants. Imperial College Press, London.
A number of plant transformation strategies are available (e.g. Birch, 1997, Ann Rev Plant Phys Plant Mol Biol, 48, 297). For example, strategies may be designed to increase expression of a polynucleotide/polypeptide in a plant cell tissue or organ in which it is normally expressed or to ectopically express a polynucleotide/polypeptide is a cell tissue or organ which it is not normally expressed. The expressed polynucleotide/polypeptide may be dervied from the plant species to be transformed or may be derived from a different plant species.
Alternatively, strategies may be designed to reduce expression of a polynucleotide/polypeptide in a plant cell, tissue or organ in which it is normally expressed. Such strategies are known as gene silencing strategies.
Temporal regulation of expression of a polynucleotide/polypeptide may also be achieved in the above strategies, through use of a suitable promoter.
Genetic constructs for expression of genes in transgenic plants typically include promoters for driving the expression of one or more cloned polynucleotide, terminators and selectable marker sequences.
The promoters suitable for use in the constructs of this invention are functional in a cell, tissue or organ of a monocot or dicot plant and include cell-, tissue- and organ-specific promoters, cell cycle specific promoters, temporal promoters, inducible promoters, constitutive promoters that are active in most plant tissues, and recombinant promoters. Choice of promoter will depend upon the temporal and spatial expression of the cloned polynucleotide, so desired. The promoters may be those normally associated with a transgene of interest, or heterologous promoters which are derived from genes of other plants, viruses, and plant pathogenic bacteria and fungi. Those skilled in the art will, without undue experimentation, be able to select promoters that are suitable for use in modifying and modulating plant traits using genetic constructs comprising the polynucleotide sequences of the invention. Examples of constitutive plant promoters include the CaMV 35S promoter, the nopaline synthase promoter and the octopine synthase promoter, and the Ubi 1 promoter from maize. Specific plant promoters which are active in specific tissues, respond to internal developmental signals or external abiotic or biotic stresses are described in the scientific literature. Exemplary promoters are described, e.g., in WO 02/00894, which is herein incorporated by reference.
Exemplary terminators that are commonly used in plant transformation genetic construct include, e.g., the cauliflower mosaic virus (CaMV) 35S terminator, the Agrobacterium tumefaciens nopaline synthase or octopine synthase terminators, the Zea mays zin gene terminator, the Oryza sativa ADP-glucose pyrophosphorylase terminator and the Solanum tuberosum PI- II terminator.
Selectable markers commonly used in plant transformation include the neomycin phophotransfenrse II gene (NPT II) which confers kanamycin resistance, the aadA gene, which confers spectinomycin and streptomycin resistance, the phosphinothricin acetyl transferase (bar gene) for Ignite (AgrEvo) and Basta (Hoechst) resistance, and the hygromycin phosphotransferase gene ( hpt) for hygromycin resistance.
Use of genetic constructs comprising reporter genes (coding sequences which express an activity that is foreign to the host, usually an enzymatic activity and/or a visible signal (e.g., luciferase, GUS, GFP) which may be used for promoter expression analysis in plants and plant tissues are also contemplated. The reporter gene literature is reviewed in Herrera-Estrella et al, 1993, Nature 303, 209, and Schrott, 1995, In: Gene Transfer to Plants (Potrykus, T., Spangenbert. Eds) Springer Verlag. Berline, pp. 325-336.
Gene silencing strategies may be focused on the gene itself or regulatory elements which effect expression of the encoded polypeptide. "Regulatory elements is used here in the widest possible sense and includes other genes which interact with the gene of interest.
Genetic constructs designed to decrease or silence the expression of a polynucleotide/polypeptide using the invention may include an antisense copy of the polynucleotide to be silenced. In such constructs the polynucleotide is in an antisense orientation with respect to the promoter and terminator.
An "antisense" polynucleotide is obtained by inverting a polynucleotide or a segment of the polynucleotide so that the transcript produced will be complementary to the mRNA transcript of the gene, e.g., 5OATCTA 3' (coding strand) 3'CTAGAT 5' (antisense strand)
3'CUAGAU 5' mRNA 5'GAUCUCG 3' antisense RNA
Genetic constructs designed for gene silencing may also include an inverted repeat. An 'inverted repeat' is a sequence that is repeated where the second half of the repeat is in the complementary strand, e.g.,
5 '-GATCTA TAGATC-3 '
3 '-CTAGAT ATCTAG-5 '
The transcript formed may undergo complementary base pairing to form a hairpin structure provided there is a spacer of at least 3-5 bp between the repeated regions.
Another silencing approach involves the use of a small antisense RNA targeted to the transcript equivalent to an miRNA (Llave et al, 2002, Science 297, 2053). Use of such small antisense RNA corresponding to polynucleotides described herein is expressly contemplated. The term genetic construct as used herein also includes such small antisense RNAs.
Transformation with an expression construct, as herein defined, may also result in gene silencing through a process known as sense suppression (e.g. Napoli et al, 1990, Plant Cell 2, 279; de Carvalho Niebel et al, 1995, Plant Cell, 7, 347). In some cases sense suppression may involve over-expression of the whole or a partial coding sequence but may also involve expression of non-coding region of the gene, such as an intron or a 5' or 3' untranslated region (UTR). Chimeric partial sense constructs can be used to coordinately silence multiple genes (Abbott et al, 2002, Plant Physiol. 128(3): 844-53; Jones et al, 1998, Planta 204: 499-505). The use of such sense suppression strategies to silence the expression of a polynucleotide described above is also contemplated.
Other gene silencing strategies include the dominant negative approach and the use of ribozyme constructs (Mclntyre, 1996, Transgenic Res, 5, 257)
Pre-transcriptional silencing may be brought about through mutation of the gene itself or its regulatory elements. Such mutations may include point mutations, frameshifts, insertions, deletions and substitutions.
The transgenic cells and plants of the invention may be from any species. In a preferred embodiment the transgenic cells and plants of the invention are derived from forage plant species from a group comprising but not limited to the following genera: Lolium, Festuca, Dactylis, Bromus, Trifolium, Medicago, Pheleum, Phalaris, Holcus, Lotus, Plantain and Cichorium.
In a more preferred embodiment of the invention the transgenic cells and plants are from the genera Lolium and Trifolium. Particularly preferred are the species Lolium perenne and Trifolium repens.
The following are representative publications disclosing genetic transformation protocols that can be used to genetically transform the following plant species: Rice (Alam et al, 1999, Plant Cell Rep. 18, 572) ; maize (U. S. Patent Serial Nos. 5, 177, 010 and 5, 981, 840) ; wheat (Ortiz et al, 1996, Plant Cell Rep. 15, 1996, 877) ; tomato (U. S. Patent Serial No. 5, 159, 135) ; potato (Kumar et al, 1996 Plant J. 9, : 821) ; cassava (Li et al, 1996 Nat. Biotechnology 14, 736) ; lettuce (Michelmore et al, 1987, Plant Cell Rep. 6, 439) ; tobacco (Horsch et al, 1985, Science 227, 1229) ; cotton (U. S. Patent Serial Nos. 5, 846, 797 and 5, 004, 863) ; grasses (U. S. Patent Nos. 5, 187, 073 and 6. 020, 539) ; peppermint (Niu et al, 1998, Plant Cell Rep. 17, 165) ; citrus plants (Pena et al, 1995, Plant Sci.104, 183) ; caraway (Krens et al, 1997, Plant Cell Rep, 17, 39) ; banana (U. S. Patent Serial No. 5, 792, 935) ; soybean (U. S. Patent Nos. 5, 416, 011 ; 5, 569, 834 ; 5, 824, 877 ; 5, 563, 04455 and 5, 968, 830) ; pineapple (U. S. Patent Serial No. 5, 952, 543) ; poplar (U. S. Patent No. 4, 795, 855) ; monocots in general (U. S. Patent Nos. 5, 591, 616 and 6, 037, 522) ; brassica (U. S. Patent Nos. 5, 188, 958 ; 5, 463, 174 and 5, 750, 871) ; and cereals (U. S. Patent No. 6, 074, 877).
Methods are also provided for selecting plants with altered levels of condensed tannin production.
The expression of a polynucleotide, such as a messenger RNA, is often used as an indicator of expression of a corresponding polypeptide. Exemplary methods for measuring the expression of a polynucleotide include but are not limited to Northern analysis, RT-PCR and dot-blot analysis (Sambrook et al, Molecular Cloning : A Laboratory Manual, 2nd Ed. Cold Spring Harbor Press, 1987). Polynucleotides or portions of the polynucleotides of the invention are thus useful as probes or primers, as herein defined, in methods for the identification of plants with altered levels of polypeptides involved in condensed tannin production. An altered level in a plant, of a polypeptide involved in condensed tannin biosynthesis may be used as an indicator of altered condensed tannin biosynthesis in such a plant. The polynucleotides described above may be used as probes in hybridization experiments, or as primers in PCR based experiments, designed to identify such plants.
Alternatively antibodies may be raised against the polypeptides described above. Methods for raising and using antibodies are standard in the art (see for example: Antibodies, A Laboratory Manual, Harlow A Lane, Eds, Cold Spring Harbour Laboratory, 1998). Such antibodies may be used in methods to detect altered expression of polypeptides involved in condensed tannin production in plants. Such methods may include ELISA (Kemeny, 1991, A Practical Guid to ELISA, NY Pergamon Press) and Western analysis (Towbin & Gordon, 1994, J Immunol Methods, 72, 313).
These approaches for analysis of polynucleotide or polypeptide expression and the selection of plants with altered expression are useful in conventional breeding programs designed to produce varieties with altered condensed tannin content.
The invention further provides plants selected by a method of the invention. The selected plants may be from any plant species. In a preferred embodiment of the invention, the selected plants of the invention are from forage plant species from a group comprising but not limited to the following genera: Lolium, Festuca, Dactylis, Bromus, Trifolium, Medicago, Pheleum, Phalaris, Holcus, Lotus, Plantain and Cichorium.
In a more preferred embodiment of the invention the selected plants are from the genera Lolium and Trifolium. Particularly preferred are the species Lolium perenne and Trifolium repens.
The plants of the invention with altered condensed tannin production, may be transgenic plants or those selected based on analysis of polynucleotide/polypeptide expression, as described above. The plants of the invention may be grown and either selfed or crossed with a different plant strain and the resulting hybrids, with the desired phenotypic characteristics, may be identified. Two or more generations may be grown to ensure that the subject phenotypic characteristics are stably maintained and inherited. Plants resulting from such standard breeding approaches also form an aspect of the present invention.
Exemplary methods for measuring condensed tannin content in transgenic plants of the invention or plants selected through methods of the invention are provided in Example 1 below. BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will be better understood with reference to the accompanying drawings in which:
Figure 1 shows the known steps in the biosynthetic pathway for flavonoids. The figure is from Tanner et al, Proanthocyanidin biosynthesis in plants: Purification of legume leucoanthocyanidin reductase and molecular cloning of its cDNA J. Biol. Chem., In press (June 2003).
Figure 2 shows the hydrophobicity profile (Persson, B. & Argos, P. (1994) Prediction of transmembrane segments in proteins utilsing multiple sequence alignments J. Mol. Biol. 237, 182-192.) for the polypeptide represented in SEQ ID NO: 11.
Figure 3 shows a multiple sequence alignment of polypeptides, from several species, orthologous to those presented in SEQ ID NOs 2, 3 and 4 and illustrates a region (underlined) containing a consensus sequence, identified by the applicants, which is present in all of the sequences.
Figure 4 shows LC-MS traces for 70% acetone fractions from mature silique from transgenic plants (A) and control plants (B). The transgenic plants were transformed with an RNAi; construct containing a sequence corresponding to SEQ ID NO:3. The peak labeled [K-3r, 7r]-l represents kaempferol dirhamnoside. The numbers -1 and -2 indicate the change state in the mass spectrometer.
Figure 5 shows Proanthocyanidin levels normalized against dimer for transgenic and pHex control plants. The transgenic plants were transformed with an RNAi construct containing a sequence corresponding to SEQ ID NO:3. ι
EXAMPLES
The invention will now be illustrated with reference to the following non-limiting examples. Example 1. Identification and isolation of polynucleotides involved in condensed tannin biosynthesis.
The applicants identified a mutant (SAl) of cotton (Gossypium hirsutum) that produces high levels of CTs in the fiber. In all commercial lines of cotton CTs are either undetectable in fiber or present at very low levels. Since cotton fibers are single-cell trichomes and fiber development in each boll is synchronous, high-CT fibers represent a uniquely uniform CT-producing tissue. The applicants have compared genes expressed during early stages of fiber development in mutant and control lines to identify genes involved in CT biosynthesis.
Plant material
High-CT (SAl) and low-CT (DPL90) lines of cotton (Gossypium hirsutum) were grown in the field at USDA-ARS SRRC, New Orleans, La. Fiber was harvested from both lines at 14 days post-anthesis (DP A), removed from the seeds and frozen in liquid nitrogen immediately.
Chemical analysis of fiber CTs
Fiber samples were extracted with aqueous acetone (70% v/v) and the extracts separated into two fractions by LH-20 sephadex chromatography and elution with 50% methanol and 70% acetone. The fractions were examined by HPLC on a C18 column with an acetonitrile/formic acid gradient, with PDA-UV and mass spectrometric detection. Monomeric CT precursors were eluted in the 50% methanol fraction while CT oligomers and higher polymers were eluted in the 70% acetone fraction.
RNA extraction
Total RNA was prepared from fiber samples harvested from SAl and DPL90 lines at 14 DP A, using Concert ■ TM Plant RNA reagent (Invitrogen) with a modified extraction protocol. Cotton fiber was ground under liquid N2, and added to Concert™ reagent pre-equilibrated at 37°C (lOmL Concert™ reagent/g tissue). The mixture was homogenized for 3 minutes using a polytron homogenizer (speed 20 000 rpm) and incubated 5 minutes at room temperature. RNA was then extracted from the homogenate as described by the manufacturer. The purified RNA was extracted a second time with Concert™ reagent (4mL Concert™ reagent/mg RNA) and mRNA was purified using Oligotex resin (Qiagen). Construction and Screening of Subtracted cDNA libraries
cDNA was prepared from the SAl and DPL90 fiber mRNA samples and subtraction performed in both directions using the suppression subtractive hybridization (SSH) technique (Gurskaya et al, 1996, Anal Biochem, 240, 90-97; Diachenko et al, 1996, Proc Nat Acad Sci USA, 93, 6025). To reduce the background of non-differentially expressed clones, a mirror orientation selection (MOS) step (Rebrikov et al, 2000, Nucl Acid Res, 28 20) was carried out after subtractive hybridization. The cDNAs were ligated into a plasmid vector (pAL9, Euregene) and transformed into Escherichia coli. Individual clones were plated in 96-well plates; six plates were prepared for the forward subtracted (SAl -specific) library and one plate for the reverse- subtracted (DPL90-specific) library. Differential screening of plated clones was performed using forward-subtracted cDNA and reverse-subtracted cDNA samples as probes.
Chemical analysis and construction of subtracted cDNA libraries
Fiber samples were harvested from SAl and DPL90 lines of Gossypium hirsutum at 14 DP A. Chemical analysis of fiber samples confirmed a significant difference in proanthocyanidin levels between SAl and DPL90, and also indicated differences in chemical composition of the predominant proanthocyanodin oligomers (see supplemental report). RNA was isolated from the SAl and DPL90 fiber samples with Concert™ Plant RNA reagent (Invitrogen) using an extraction protocol modified for cotton tissue, and subtracted cDNA libraries were constructed using standard SSH and MOS techniques, as described in methods. Both a forward-subtracted library (enriched for genes upregulated in SAl) and a reverse-subtracted library (enriched for genes upregulated in SAl DPL90) were constructed. Both libraries were screened for differentially expressed genes by hybridization to labeled cDNA samples from SAl and DPL90 14 DPA fibers.
Sequencing and analysis of genes induced in high-CT ( SAl) cotton fiber
Differentially expressed clones induced in high-CT ( SAl) cotton fiber were randomly selected and sequenced in one direction. A total of 163 partial cDNA sequences were obtained. The sequence data was clustered and the Genbank NR database was queried using the BLASTX heuristic search algorithm (Altschul et al, 1990, JMol Biol 215, 403). Most genes (156 in total, or 96%) showed high similarity to functionally annotated plants genes in the Genbank database (Table 1.). There were seven genes (4%) that appeared orthologous to unknown or putative polypeptides.
Known CT biosynthetic genes are abundant in the differentially expressed cloned from high-CT ( SAl) cotton fiber.
Many of the genes with functionally annotated orthologs (65%) showed high sequence similarity to biosynthetic genes thought to be involved in the synthesis of CT precursors (leucoanthocyanadins and catechins) or related flavonoids. A smaller number (17%) appeared orthologous to genes considered unlikely to be involved in flavonoid biosynthesis. The remaining 29 genes (19%) appeared orthologous to genes that were functionally annotated but not highly characterized.
Identification of orthologous sequences and "in-silico" expression characterisation
The high abundance of biosynthetic genes for proanthocyanidin precursors in the SAl -specific library, together with the high levels of proanthocyanidins detected in SAl fiber indicates that many of the unknown, putative or poorly characterized SAl -specific genes are likely to be novel genes necessary for condensed tannin biosynthesis. Publicly available EST databases and an in house ryegrass EST database was searched for expressed genes orthologous to cotton genes thought likely to perform novel functions in proanthocyanidin biosynthesis. Several cotton genes with likely orthologs expressed in species and tissues known to produce CTs were selected as the best candidates for genes involved in CT production. The orthologues of one such gene appears in libraries prepared from tissues known to produce CTs as shown in table 1 consistent with its role in CT production. The sequence of this polynucleotide from cotton is presented in SEQ ID NO: 2 and is predicted to encode the polypeptide sequence presented in SEQ ID NO: 10. Orthologous sequences from Arabidopsis and ryegrass are presented in SEQ ID NOs 3 and 4 respectively and are predicted to encode polypeptides presented in SEQ ID NOs 11 and 12 respectively.
Table 1 shows "in-silico" expression data for sequences orthologous to that presented in SEQ ID NOs 2, 3 and 4. Table 1. Identification of expressed genes orthologous to polypeptides involved in condensed tannin biosynthesis
The NCBI EST sequence database (2003-May-09), comprising unannotated polynucleotide sequence data for genes expressed in specified plant species and tissues, was searched for sequences orthologous to those presented in SEQ ID NOs 2, 3 and 4. Orthologue accession/ID numbers are listed, together with e values indicating the confidence level for each sequence match. Proposed orthologues in Arabidopsis thaliana identified by searching the GenBank, EMBL, DDBJ and PDB annotated polynucleotide sequence databases are also listed, with e values.
Figure imgf000031_0001
Subsequently published was a sequence F286 from Gossypium hirsutum (US Published Patent Application 2004/0049808). The sequence encodes a polypeptide having a deduced amino acid sequence including the partial deduced sequence of SEQ ID NO: 10.
Since condensed tannins are sequestered in the plant vacuole, it is likely that the biosynthetic enzymes and transport proteins required for CT production will be targeted to the vacuole or the vacuolar membrane. The polypeptide sequence encoded by candidate polynucleotides were analysed for the presence of potential membrane-spanning regions and N or C-terminal vacuolar targeting signals.
The hydrophobicity profile (Persson, B. & Argos, P. (1994) Prediction of transmembrane segments in proteins utilising multiple sequence alignments J. Mol. Biol. 237, 182-192.) of the polypeptide presented in SEQ ID NO: 11 is shown in Figure 2. As shown, this polypeptide possesses a hydrophobic N-terminus which could potentially function as a vacuolar targeting signal. Example 2. Reduction of condensed tannin concentration and polymerization rate by RNAi in transgenic plants
Plant material Vegetative tissues were harvested from Arabidopsis thaliana, ecotype Columbia.
Extraction of genomic DNA
Genomic DNA was extracted using standard procedures.
Selection of sequences for inserts in RNAi constructs
PCR primers were designed to amplify products from a single exon in genomic sequence corresponding to the sequence in SEQ ID NO: 3. Prior to amplification, the anticipated PCR product sequences were used to perform BLASTN searches (Basic local alignment search tool (1990). Altschul SF; Gish W; Miller W; Myers EW; Lipman DJ J Mol Biol 215: 403-10 ) of Arabidopsis cDNA databases to ensure that they were specific to the sequence in SEQ ID NO: 3, and did not contain stretches of more than 25 nucleotides of perfect homology to any other Arabidopsis sequence.
Amplification of inserts
Inserts for the RNAi constructs were amplified from Arabidopsis thaliana genomic DNA using High Fidelity Expand Taq DNA polymerase (Roche Molecular Biochemicals) according to the manufacturers instructions. The primers used to amplify an insert corresponding to sequence in SEQ ID NO: 3 were as follows:
attBl-TACGTTAGCGTTTCAAGCGG (SEQ ID NO: 18)
attB2- ACCAGAACTCGGAGGAGCAG (SEQ ID NO: 19)
Polynucleotide sequence of amplified insert is shown below:
TACGTTAGCGTTTCAAGCGGCGATTTGGCGGTGGATGACGCCACCGAAGAAGCATTT ACCGTCGGCGCACGATGTTTTCGTCGGAAAATGGAAGCCAACAAAGAATGACACTG CTGCAAAACGAACTCCGGGGTTTGGAGCCACCATAAACGTTCTTTACGGCGATCAA ATCTGCAACAGCGGATTCGATAACGATGAGATGAACAACATTGTTTCTCATTATTTG TATTATCTTGATCTGATAGGTGTTGGACGAGAAGAAGCTGGTCCTCACGAGAAGCTG
TCATGCGCAGATCAAGAACCTTTTTCTTCTTCCTCCTCTGCTCCTCCGAGTTCTGGT
(SEQ ID NO.20)
Cloning of sequences corresponding to the sequence in SEQ ID NO: 3 into RNAi vector
The attB-flanked PCR products amplified above were recombined into attP sites of the GATEWAY donor vector pDONR201 (Invitrogen) to generate a pENTRY clone with the target sequence flanked by attL sites.
The pEntry clone containing the attL-flanked target sequence was recombined with the attR sites of pTKO2 (HortResearch NZ Ltd) to generate the final destination vector with an inverted repeat of target sequence either side of an intron and under control of CaMV 35S promoter.
The destination vector is capable of replication in both Agrobacterium tumefaciens and E.coli, and is maintained by Spectinomycin (lOOμg/ml) selection. The right and left T-DNA borders delineate the region transferred to the plant genome during Agrobacterium-mediated transformation and include the CaMV35S driven inverted repeat of the target gene and the pNos- NptII-Nos3' region for kanamycin selection of transgenic plants.
Production of transgenic -Arabidopsis plants
Transformation of Arabidopsis with the construct described above was by the floral dipping method of Clough and Bent (Clough and Bent, 1998, Plant J 16, 735).
Healthy Arabidopsis plants were grown under long days in pots in soil. Once flowering, the first bolts were clipped to encourage proliferation of many secondary bolts. Plants were the transformed approximately 4-6 days after clipping.
The destination vector described above was transformed into Agrobacterium tumefaciens strain GV3101 by standard procedures and cultured for 16 hrs at 28°C in LB broth with antibiotics for selection of the destination vector. This culture was then added to 200 ml LB. The Agrobacterium cells were then pelleted (by standard procedures) and resuspend to OD600 = 0.8 in 5% Sucrose solution, 200 ml for each two or three small pots to be dipped. Prior to floral dipping, Silwet L-77 (Setre Chemical Company) was added to a concentration of 0.05% (500 ul/L) in the Agrobacterium solution. The above-ground parts of plants were dipped in Agrobacterium solution for 20 to 30 seconds, with gentle agitation. Dipped plants were placed under a cover for 16 to 24 hours to maintain high humidity. Plants were watered and grown normally until seeds became mature then watering ceased.
Dry seed was harvested and plated to select for Ti transformants using antibiotic or herbicide selectable marker on 0.5X MS/0.8% tissue culture Agar plates with 50 ug/ml Kanamycin.
Putative transformants were transferred to soil and grown to produce T2 seed.
Chemical analysis of flavonoid content in Arabidopsis plants transformed with RNAI construct targeting gene At3gl6920 (Seq. ID 2)
Plant growth and silique collection
Arabidopsis plants transformed with an RNAi construct containing an insert sequence corresponding to the sequence in SEQ ID NO: 2. (T3 generation) and control plants (transformed with the RNAi vector only) were grown in a controlled environment under standard long day conditions (22 °C, 70% relative humidity, and a 14 hour photoperiod at an irradiance of 650 μmol m"2 s"1). Mature siliques were collected from the primary inflorescences of approximately 300 plants transformed with the RNAi construct and approximately 300 control plants, at the onset of yellowing of the most mature silique. All green siliques below the 4th silique from the top of each primary inflorescence were collected, frozen and pooled.
Chemical analysis of flavonoids
Flavonoids were extracted from the silique pools by standard protocols, purified by liquid chromatography using an LH20 column, and characterized by HPLC/PDA/ESI-MS. The ESI- MS chromatograms are shown in figure 4. Peaks for epicatechin (EC) monomer, dimer and higher oligomers are indicated and in figure 4A. The designations -1 and -2 indicate the ion charge state. The peak labelled [K-3r,7r]-l represents kaempferol dirhamnoside. The numbers in the upper left of the chromatograms are the m/z ratios of the single-ion channels. The peak area for epicatechin monomer, dimer and higher oligomers is listed in table 2. The steady-state concentration of each flavonoid species may be considered proportional to the peak areas. However quantification of pentamer and hexamer levels is impaired by interference from co- eluting components and the pentamer and hexamer peak areas are therefore not considered reliable.
Total proanthocyanidin levels were approximately 2-fold lower in plants transformed with the RNAi construct compared to the control (table 2), indicating a partial disruption of CT accumulation. To compare the relative levels of epicatechin monomer and oligomers in both RNAi and control plants, each of the peak areas was normalized against the value for dimer. Normalised peak area values are listed in table 3 and plotted in figure 5. The monomer/dimer value is more than 2-fold higher in the RNAi plants compared to controls, whereas trimer/dimer and tetramer/dimer values show no change. The relative abundance of monomer in plants transformed with the RNAi construct indicates a slower rate of polymerization in the RNAi plants, and implies that the target gene is involved in CT polymerization. The low rate of polymerization in the RNAi plants may be due to incomplete gene silencing, or functional redundancy.
Table 2
70% acetone fractions, peak areas
Figure imgf000035_0001
Table 3 70% acetone fractions, peak areas normalised against dimer
Figure imgf000035_0002
The above examples illustrate practice of the invention. It will be appreciated by those skilled in the art that numerous variations and modifications may be made without departing from the spirit and scope of the invention.
Brief description of sequences in sequence listing
SEQ ID NO:l (polypeptide, consensus)
SEQ ID NO:2 (polynucleotide, Gossipium hirsuitum, EGHSA1S005D10/5814R)
SEQ ID NO:3 (polynucleotide, Arabidopsis thaliana, AF412071/ AT3gl6920) SEQ ID NO:4 (polynucleotide, Lolium per enne, CLP0001001520-cE2)
SEQ ID NO:5 (polynucleotide, Arabidopsis thaliana, AY034935/ Atlg05850)
SEQ ID NO:6 (polynucleotide, Lolium per enne, FLPB002449F07-bOFSP
SEQ ID NO:7 (polynucleotide, Musa acuminata, Z99959)
SEQ ID NO:8 (polynucleotide, Zea mays, AY105990) SEQ ID NO:9 (polynucleotide, Oryza sativa, AP003865)
SEQ ID NO:10 (polypeptide, Gossipium hirsuitum, EGHSA1S005D10/5814R)
SEQ ID NO:l 1 (polypeptide, Arabidopsis thaliana, AT3gl6920)
SEQ ID NO:12 (polypeptide, Lolium per enne, CLP0001001520-cE2)
SEQ ID NO:13 (polypeptide, Arabidopsis thaliana, Atlg05850) SEQ ID NO: 14 (polypeptide, Lolium perenne, FLPB002449F07- bOFSP_20011203_fgenesh_l_pep)
SEQ ID NO: 15 (polypeptide, Musa acuminata, Z99959) SEQ ID NO: 16 (polypeptide, Zea mays, AY 105990)
SEQ ID NO:17 (polypeptide, Oryza sativa, BAC55635.1)
SEQ ID NO: 18 (PRIMER) SEQ ID NO: 19 (PRIMER)
SEQ ID NO:20 (Arabidopsis thaliana)

Claims

CLAIMS:
1. A method of altering condensed tannin production in a plant comprising transformation of a plant cell or plant with a genetic construct comprising a polynucleotide that alters the expression of a polypeptide comprising S X! X2 C C N X3 T I * D X5 F X6 X7 X8 X9 F E X10 L F X„ X12 R N (SEQ ID NO:l) involved in condensed tannin production in the plant wherein X^K, E, I or R, X2=Y or F, X3=H, Q, D or G, X^S or T, X5=Y or F, X6=Q, E or K, X7=T, V or A, X8=Y or H, X9=Q, K or H, X10= D, N, Q or E, Xπ=A, S or P and X12=K, R or H and wherein the polynucleotide encodes the polypeptide or a fragment thereof or the polynucleotide is complementary to said polynucleotide encoding the polypeptide or fragment.
2. A method of altering condensed tannin production in a plant comprising transformation of a plant cell or plant with a genetic construct comprising a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant thereof, wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
3. A method as claimed in claim 2 wherein said sequence comprises SEQ ID NO:2 or a variant SEQ ID NO:2 having at least 70% homology with SEQ ID NO:2.
4. A method as claimed in claim 2 wherein said sequence comprises SEQ ID NO:3 or a variant of SEQ ID NO:3 having at least 70% homology with SEQ ID NO:3.
5. A method as claimed in claim 2 wherein said sequence comprises SEQ ID NO:4 or a variant of SEQ ID NO:4 having at least 70% homology with SEQ ID NO:4.
6. A method as claimed in claim 2 wherein the genetic construct comprises a sequence selected from SEQ ID NO: 2, SEQ ID NO:3 and SEQ ID NO:4 or a naturally occurring polynucleotide orthologue thereof wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
7. A method as claimed in claim 2 wherein said genetic construct comprises a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant wherein said variant has at least 90% homology with one of SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4.
8. A transgenic plant having altered condensed tannin content comprising a polynucleotide that alters the expression of a polypeptide comprising S X\ X2 C C N X3 T I X4 D X5 F X6 X7 X8 X F E X10 L F Xπ X1 R N (SEQ ID NO:l) involved in condensed tannin production in the plant wherein X^K, E, I or R, X2=Y or F, X3=H, Q, D or G, X4=S or T, X5=Y or F, X6=Q, E or K, X7=T, V or A, X8=Y or H, X9=Q, K or H, X10= D, N, Q or E, Xj j=A, S or P and X12=K, R or H and wherein the polynucleotide encodes the polypeptide or a fragment thereof or polynucleotide is complementary to said polynucleotide encoding the polypeptide or fragment.
9. A transgenic plant having altered condensed tannin content comprising a genetic construct comprising a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant thereof, wherein said genetic construct encodes a polypeptide involved in condensed tannin production in plants.
10. A plant as claimed in claim 9 or claim 10 wherein said sequence comprises SEQ ID NO:2 or a variant thereof having at least 70% homology with SEQ ID NO:2.
11. A plant as claimed in claim 10 said sequence comprises SEQ ID NO:3 or a variant thereof having at least 70% homology with SEQ ID NO:3.
12. A plant as claimed in claim 10 said sequence comprises SEQ ID NO:4 or a variant thereof having at least 70% homology with SEQ ID NO:4.
13. A plant as claimed in claim 9 or claim 10 wherein the genetic construct comprises a sequence selected from SEQ ID NO: 2, SEQ ID NO:3 and SEQ ID NO:4 or a naturally occurring polynucleotide orthologue thereof wherein said genetic construct includes a polypeptide involved in condensed tannin production in plants.
14. A plant as claimed in claim 9 or claim 10 wherein said genetic construct comprises a sequence selected from SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a functionally equivalent variant wherein said variant has at least 90% homology with one of SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4.
15. A plant as claimed in any one of claims 8-14 wherein said plant is a forage plant.
16. A plant as claimed in claim 15 wherein said plant is a plant from a genus selected from Lolium, Festuca, Dactylis, Bromus, Trifolium, Medicago, Pheleum, Phalaris, Holcus, Lotus, Plantain and Cichorium.
17. A plant as claimed in claim 16 wherein said plant is selected from the species Lolium perennej d Trifolium repens.
18. A plant as claimed in claim 17 wherein the plant of the species Lolium per enne.
19. A method for selecting plants with altered condensed tannin content comprising the steps of
(a) contacting a sample from a plant with a polynucleotide selected from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 and orthologues thereof, or a fragment of one of these, as a probe or primer
(b) determining presence/content of complementary sequences to assess altered expression of a polypeptide involved in condensed tannin production in a plant; and (c) selection of a plant showing such altered expression.
20. A method for selecting plants with altered condensed tannin content comprising the steps of: (a) contacting a plant sample or plant with an antibody raised against a polypeptide encoded by a nucleotide sequence comprising SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4 or a naturally occurring orthologue thereof to assess altered expression of a polypeptide involved in condensed tannin production in a plant; and (b) selection of a plant shown such altered expression.
21. A plant selected by the method of claim 20 or claim 21.
22. A method as claimed in claim 1 or claim 2 wherein the genetic construct comprises a polynucleotide encoding the polypeptide or a functionally equivalent fragment thereof and the transformed plant overexpresses the polypeptide.
23 A plant as claimed in claim 8 or claim 9 wherein the genetic construct comprises a polynucleotide encoding the polypeptide or a functionally equivalent fragment thereof and the transformed plant overexpresses the polypeptide
24. A plant as claimed in claim 8 or claim 9 wherein the genetic construct comprises a polynucleotide which causes a reduction of the expression of the polypeptide
PCT/NZ2004/000171 2003-08-01 2004-08-02 Polynucleotides, polypeptides and their use in the production of plants with altered condensed tannins WO2005011366A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
NZ52732603 2003-08-01
NZ527326 2003-08-01

Publications (1)

Publication Number Publication Date
WO2005011366A1 true WO2005011366A1 (en) 2005-02-10

Family

ID=34114482

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/NZ2004/000171 WO2005011366A1 (en) 2003-08-01 2004-08-02 Polynucleotides, polypeptides and their use in the production of plants with altered condensed tannins

Country Status (1)

Country Link
WO (1) WO2005011366A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014164389A1 (en) * 2013-03-11 2014-10-09 Pioneer Hi-Bred International, Inc. Compositions and methods to enhance mechanical stalk strength in plants

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1033405A2 (en) * 1999-02-25 2000-09-06 Ceres Incorporated Sequence-determined DNA fragments and corresponding polypeptides encoded thereby
WO2000056908A2 (en) * 1999-03-24 2000-09-28 Pioneer Hi-Bred International, Inc. Maize chitinases and their use in enhancing disease resistance in crop plants
US20020040489A1 (en) * 2000-01-27 2002-04-04 Jorn Gorlach Expressed sequences of arabidopsis thaliana
US20030106097A1 (en) * 2001-07-30 2003-06-05 Haigler Candace H. Chitinase encoding DNA molecule from cotton expressed preferentially in fibers during secondary cell wall deposition and the corresponding promoter
US20040049808A1 (en) * 2001-07-30 2004-03-11 Haigler Candace H. Chitinase encoding DNA molecules from cotton expressed preferentially in secondary walled cells during secondary wall deposition and a corresponding promoter

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1033405A2 (en) * 1999-02-25 2000-09-06 Ceres Incorporated Sequence-determined DNA fragments and corresponding polypeptides encoded thereby
WO2000056908A2 (en) * 1999-03-24 2000-09-28 Pioneer Hi-Bred International, Inc. Maize chitinases and their use in enhancing disease resistance in crop plants
US20020040489A1 (en) * 2000-01-27 2002-04-04 Jorn Gorlach Expressed sequences of arabidopsis thaliana
US20030106097A1 (en) * 2001-07-30 2003-06-05 Haigler Candace H. Chitinase encoding DNA molecule from cotton expressed preferentially in fibers during secondary cell wall deposition and the corresponding promoter
US20040049808A1 (en) * 2001-07-30 2004-03-11 Haigler Candace H. Chitinase encoding DNA molecules from cotton expressed preferentially in secondary walled cells during secondary wall deposition and a corresponding promoter

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014164389A1 (en) * 2013-03-11 2014-10-09 Pioneer Hi-Bred International, Inc. Compositions and methods to enhance mechanical stalk strength in plants

Similar Documents

Publication Publication Date Title
AU2004228984B2 (en) Chalcone synthase dihydroflavonol 4-reductase and leucoanthocyanidine reductase from clover, medic ryegrass or fescue
AU2009255796B2 (en) Novel genes involved in biosynthesis
US5908973A (en) DNA encoding fruit-ripening-related proteins, DNA constructs, cells and plants derived therefrom
CA2690783A1 (en) Secondary wall forming genes from maize and uses thereof
MX2011003616A (en) Manipulation of glutamine synthetases (gs) to improve nitrogen use efficiency and grain yield in higher plants.
NZ514547A (en) Plastid alph-amylase protein and nucleic acids encoding same and methods for altering the starch content of a plant
US7199284B2 (en) Chlorophyllases
WO2014164014A1 (en) Genes for improving nutrient uptake and abiotic stress tolerance in plants
US20170037424A1 (en) Maize genes for controlling plant growth and organ size and their use in improving crop plants
MX2011005719A (en) Methods and compositions for the improvement of plant tolerance to environmental stresses.
WO2009061214A1 (en) Compositions and methods for modulating pigment production in plants
WO2004067723A2 (en) Floral development genes
CN101784656B (en) Nitrate reductases from porphyra, compositions and methods of use thereof
US20020133850A1 (en) Melon promoters for expression of transgenes in plants
WO2005011366A1 (en) Polynucleotides, polypeptides and their use in the production of plants with altered condensed tannins
WO2002005627A1 (en) Ctr1 homologue from melon
CA2572305A1 (en) Cell number polynucleotides and polypeptides and methods of use thereof
JP3987931B2 (en) Polynucleotides as gene fragments involved in biosynthesis of isoquinoline alkaloids, full-length genes and proteins involved in biosynthesis of isoquinoline alkaloids, and transformants thereof
US7179957B1 (en) Method for altering nitrogen or oil content of seeds by down regulating AGL11 expression or activity
AU2013202732B2 (en) Manipulation of soluble carbohydrates (4)
US6680185B1 (en) Plant polyphenol oxidase homologs
AU2012200604B2 (en) Manipulation of flowering and plant architecture (3)
WO2006011815A2 (en) Phosphatases, polynucleotides encoding these and uses thereof
JP2008528025A (en) Polypeptide having pyridoxine biosynthesis-related function, polynucleotide encoding the same, and use thereof
WO2011053167A1 (en) Transcription factor polynucleotides and their use

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase