WO1999000501A1 - Materials and methods relating to a plant regulatory protein - Google Patents

Materials and methods relating to a plant regulatory protein Download PDF

Info

Publication number
WO1999000501A1
WO1999000501A1 PCT/GB1998/001861 GB9801861W WO9900501A1 WO 1999000501 A1 WO1999000501 A1 WO 1999000501A1 GB 9801861 W GB9801861 W GB 9801861W WO 9900501 A1 WO9900501 A1 WO 9900501A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
plant
sequence
ttgi
ttgl
Prior art date
Application number
PCT/GB1998/001861
Other languages
French (fr)
Inventor
John Clinton Gray
Amanda Ruth Walker
Original Assignee
Plant Bioscience Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Plant Bioscience Limited filed Critical Plant Bioscience Limited
Priority to AU81224/98A priority Critical patent/AU8122498A/en
Priority to EP98930949A priority patent/EP1002086A1/en
Publication of WO1999000501A1 publication Critical patent/WO1999000501A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A40/00Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
    • Y02A40/10Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
    • Y02A40/146Genetically Modified [GMO] plants, e.g. transgenic plants

Definitions

  • This invention relates to materials and methods relating to a plant regulatory protein. More particularly, the invention relates to the cloning and expression of the TTGI gene of Arabidopsis thaliana, and homologues from other species, and manipulation and use of the gene in plants.
  • TTGI transparent testa, glabra locus
  • TTGI locus plays a central role in many pathways in Arabidopsis thaliana . Many of these pathways are confined to effects on the epidermal cell layer of different tissues. Mutations at the TTGI locus have a large range of pleiotropic effects (Koornneef (1981) Arabid Inform. Serv. 18 45-51) . It is known that ttgl mutants have a glabrous phenotype with no leaf or stem hairs (trichomes) which are normally derived from the LI layer of cells, the outer single layer of cells covering the meristem that differentiates into all epidermal cells of the leaf.
  • anthocyanin pigments are present in the seed coat leading to the yellow cotyledons being visible through the transparent testa.
  • anthocyanins are present in the hypocotyl of seedlings and in the stem and leaves of plants as they age, and are inducible by many forms of stress including by high light, poor nutrients or water stress.
  • Mutants of the t tgl locus completely lack anthocyanins in the epidermis of leaves and stems (Koornneef (1981) Arabid Inform. Serv. 18 45-51) .
  • Tufts of mucilage are absent from ttgl mutant seeds and the seeds show no secretion of mucilage on imbibing, unlike wild-type plants (Koornneef (1981) Arabid Inform. Serv. 18 45-51) .
  • root hairs extend from root epidermal cells only in files of cells that contact two underlying cortical cells, whereas in ttgl mutants extra root hairs occur in the atrichoblast cell files (Galway et al (1994) Dev. Biol. 166 740-754) .
  • Seeds of ⁇ tgl mutant plants do not require drying and cold treatments to germinate and exhibit an altered seed dormancy compared to ecotypes such as Landsberg erecta (Koornneef et al (1982) Theoret Appl Genet 61 385-393) .
  • GL1 is a MYB transcription factor (Marks and Feld ann (1989) Plant Cell 1 1043-1050, Oppenheimer et al (1991) Cell 67 483-493) .
  • Mutations at another glabra locus GL2 have some features in common with ttgl , although gl2 mutants have normal anthocyanin content and have rudimentary trichomes suggesting the fate of these cells has already been determined
  • the GL2 gene is expressed in developing trichomes (Rerie et al (1994) Genes Dev. 8 1388-1389) and in the atrichoblast cell files of the roots (Masucci et al (1996) Development 122 1253-1260) . Transcript levels of GL2 are greatly reduced in ttgl mutants, suggesting that the TTGI gene product is required for normal expression of GL2 (Cristina et al (1996) Plant J 10 393-402) .
  • Delila and R are both MYC-like transcription factors (Ludwig et al (1989) Proc Natl Acad Sci USA 86 7092-7096, Goodrich et al (1992) Cell 68 955- 964) .
  • R has been shown to activate directly the transcription of several genes encoding anthocyanin biosynthetic enzymes in conjunction with a MYB transcription factor encoded by the Cl gene in maize (Goff et al (1992) Genes Dev. 6 864-875) .
  • Complementation of a ttgl mutant by cauliflower mosaic virus 35S promoter-i? constructs (Lloyd et al (1992) Science 258
  • TTGI might encode an Arabidopsis R homologue.
  • a further characterised transcription factor (Caprice [CPC] - see Wada et al 1997, Science 277, 1113-1116) may act in the opposite way to TTGI in promoting root hair development, and possible reducing the trichome number.
  • the TTGI locus has been broadly mapped by Koornneef (Koornneef et al (1982) Theoret Appl Genet 61 385-393) to chromosome 5 between msl and ga3 . At position 31.5 the ttgl locus has been used as a phenotypic marker in many crosses . Summary of the invention
  • the present inventors have identified the TTGI locus as a gene encoding a D40 repeat protein by complementation of a ttgl mutant with genomic DNA and by sequencing the gene in several ttgl mutant alleles.
  • the TTGI gene has now been cloned and sequenced and the inventors have demonstrated that it encodes a WD40 repeat protein with 7 repeat units .
  • the 1.6 kb transcript is present in all major organs.
  • the identification of the product of the TTGI locus as a D40 repeat protein rules out the possibility that the protein acts as a transcription factor, unlike the products of the other genes, GL1 and GL2, affecting trichome development. Additionally the TTGI protein bears no resemblance to the maize R gene.
  • the D40 repeat protein is a component of a signal transduction pathway which regulates expression or action of downstream transcription factors, and in particular that TTGI acts upstream of an Arabidopsis R homologue in the pathways leading to trichome differentiation and anthocyanin synthesis.
  • the TTGI gene has a novel sequence. No Arabidopsis genes showing significant homology to TTGI were identified in public databases. A protein of unknown function showing 61% amino acid identity is encoded on chromosome 3 of Arabidopsis (orflO in database accession number X98130) , but transcripts of this gene do not cross hybridise with TTGI at high stringency on Northern blots. However, a region of the TTGI protein showed homology to an Expressed Sequence Tag (EST) of unknown function. The EST came from a cell suspension culture from Eco type Columbia (clone library AC16H) .
  • EST Expressed Sequence Tag
  • TTGI gene shows 87.5% similarity to the anil gene from Petunia. This gene is discussed by Vetten et al (1997) Genes & Development 11: 1422-1434,
  • anil locus is described as controlling anthocyanin pigmentation and hence flower colour - but apparently does not exert the pleiotropic effects (e.g. trichomes, anthocyanin in other parts of the plant) of the TTGI gene which forms the basis of the present invention.
  • a genomic sequence encompassing Arabidopsis TTGI has recently (after the priority date of the present invention) been put on a database under accession number AB010068.
  • a nucleic acid molecule including a nucleotide sequence encoding a polypeptide with TTGI function.
  • TTGI function may be used to refer to the ability to manipulate the phenotypic characteristics of plants as described below when its expression is altered like the TTGI gene of Arabidopsis thaliana.
  • Manipulation of the phenotypic characteristics of plants may be achieved by altering the expression of the TTGI gene (by increasing/decreasing expression or by mutation) or by interfering with the normal function of the TTGI protein. Further, manipulation may be achieved by providing for the expression of a further homologous transcript which is able to interact with the expression of the TTGI gene in such a way as to either prevent translation of the transcript occurring or to boost the levels of transcripts being translated.
  • Trichomes hairs on aerial parts of plants: trichomes have a number of functions and the present invention provides a way to increase and decrease the number of trichomes on different organs to enhance their effectiveness.
  • the increase or decrease in the number of trichomas may be utilized in:
  • insect protection due to mechanical effects and to chemicals from glandular hairs . These could be increased, to increase insect protection, on leaves or on cotyledons which often do not have hairs. Protection of cotyledons from insect attack may allow faster seedling growth.
  • glandular trichomes are involved in producing pheromones, antifeedants and other chemicals, including essential oils, which may be increased if the number of trichomes is increased.
  • Hairs may also provide shade and protection for meristem in young seedlings, allowing faster seedling growth.
  • salt removal from leaves the presence of salt glands would allow trichomes to sequester or secrete salt . (Relatives of rice have microtrichomes of 2 cells which secrete salt.)
  • cotton fibres it may be advantageous to increase the number of cotton fibres per boll, and at the same time decrease leaf trichomes to prevent insects hiding and prevent contamination of bolls .
  • ornamental plants it may be preferred to decrease the number of hairs on hairy and glabrous varieties of a range of garden plants.
  • Trichomes on roots manipulation of the number of root hairs may affect water and nutrient absorption (crop nutrient use efficiency) by the plants. Root hairs are also involved in anchoring the plant in the soil, particularly sandy soils, and allow better root penetration.
  • Seed mucilage manipulation may lead to better seed germination in dry soils, due to maintenance of moisture around the seed.
  • Seed dormancy alteration of seed dormancy may allow quicker, or slower (if viviparous) , germination of seeds after harvest. This may lead to faster cycling of crops.
  • Anthocyanin pigments have a range of functions in the plant, and manipulation may alter pigmentation of seeds, leaves, flowers and fruit. Such manipulation may lead to:
  • UV-B protection of plants mainly in leaves, but anthocyanins are produced in plants under a wide range of stresses, including water stress, light stress, increased sugars. These stresses lead to decreased photosynthesis and susceptibility to photoxidation.
  • Altered flower and leaf colour in ornamentals and food crops eg broccoli; altered fruit and seed colour in food, eg aubergines and grains (maize, rice, etc) .
  • Condensed tannins produced by the polymerisation of anthocyanin precursors, are found in many plants and are responsible in part for the taste characteristics of a range of fruits and vegetables, such as apple, kiwifruit, gooseberry, redcurrant and banana. Condensed tannins produce characteristic astringent properties in tea, coffee, wine, spices and fruit juices. Tannins also have important effects in animal feedstuffs. In monogastric animals, such as pigs and chickens, tannins limit the use of potential feedstuffs such as faba beans and sorghum. In ruminants, moderate levels of tannins are beneficial and may improve retention of dietary nitrogen, but higher levels reduce the nutritive value of foliage and feedstuffs. Manipulation of TTGI may alter the levels of condensed tannins in these plants.
  • the present invention provides a nucleic acid isolate encoding a polypeptide including the amino acid sequence shown in Figure 3 (SEQ ID No. 2) or homologues thereof, which may include the coding sequence shown in Figure 3 which is that of the TTGI gene of Arabidopsis thaliana, and/or other transcribed parts of the gene e.g. as shown in Figure 3 or Figure 5.
  • Nucleic acid according to the present invention may have the sequence of an TTGI gene of Arabidopsis thaliana , or be a mutant, variant, derivative or allele or a homologue of the sequence provided.
  • Preferred mutants, variants, derivatives and alleles are those which encode a protein which retains a functional characteristic of the protein encoded by the wild-type gene, especially the ability to affect a physical characteristic of a plant, such as the phenotypic characteristics outlined above.
  • a mutant, variant, derivative or allele in accordance with the present invention may have the ability to affect a physical characteristic of a plant, particularly a phenotypic characteristic identified above.
  • a mutant, variant, derivative or allele may decrease the amount of anthocyanins in the epidermis of leaves and stems compared with wild-type on expression in a plant, e.g. compared with the effect obtained using a gene sequence expressing the polynucleotide sequence of Figure 3.
  • a mutant, variant, derivative or allele increases or decreases the number of trichomes on different organs compared with wild-type on expression in a plant, e.g. compared with the effect obtained using a gene sequence expressing the polynucleotide sequence of Figure 3.
  • Down-regulation of TTGI activity may be achieved by mutant nucleic acids (e.g through co-suppression) or by mutant polypeptides, which may compete for receptors or other binding sites for TTGI, without triggering appropriate effects.
  • nucleic acid according to the present invention may be used in the production of a wide variety of plants and for influencing a phenotypic characteristic thereof.
  • Changes to a sequence, to produce a mutant, variant or derivative may be by one or more of addition, insertion, deletion or substitution of one or more nucleotides in the nucleic acid, leading to the addition, insertion, deletion or substitution of one or more amino acids in the encoded polypeptide. Further, it may lead to the creation of stop codons resulting truncated polypeptide; removal of stop codons resulting in extended polypeptides; or a frameshift resulting in a polypeptide lacking TTGI function. Of course, changes to the nucleic acid which make no difference to the encoded amino acid sequence are included.
  • a preferred nucleic acid sequence for an TTGI gene is shown as the coding sequence within Figure 3/SEQ ID No. 1, alongside the predicted amino acid sequence of a polypeptide according to the present invention which has TTGI function (SEQ ID No. 2) .
  • Particular mutant alleles of the nucleic acid according to the present invention include : a) ttgl.10 (SEQ ID No. 3) which contains a point mutation (G to A) in the 5' untranslated part of the TTGI sequence (see Fig 3) ; b) ttgl.19 (SEQ ID No. 4) which results in the introduction of a stop codon at codon 183; c) ttgl.l (SEQ ID No . 5 - formerly designated ttgl.21) which results in the introduction of a stop codon at codon 317; d) ttgl.20 (SEQ ID No.
  • ttgl.9 has more anthocyanin present than ttgl.l (reference allele) while ttgl.10 has more trichomes, different seed mucilage and less anthocyanin than ttgl.9 (see Larkin et al, Plant Cell 6, 1065-1076 for an analysis) .
  • a mutant, allele, variant or derivative amino acid sequence in accordance with the present invention may include within the sequence shown in Figure 3, a single amino acid change with respect to the sequence shown in Figure 3, or 2, 3, 4, 5, 6, 7, 8, or 9 changes, about 10, 15, 20, 30, 40 or 50 changes, or greater than about 50, 60, 70, 80 or 90 changes.
  • a mutant, allele, variant or derivative amino acid sequence may include additional amino acids at the C-terminus and/or N-terminus.
  • a sequence related to a sequence specifically disclosed herein shares homology with that sequence.
  • Homology may be at the nucleotide sequence and/or amino acid sequence level.
  • the nucleic acid and/or amino acid sequence shares homology with the nucleotide sequence of Figure 3, or the amino acid sequence encoded thereby.
  • the homology is at least about 50%, or 60%, or 70%, or 80% homology, most preferably at least about 90%, 95%, 96%, 97%, 98% or 99% homology.
  • homology at the amino acid level is generally in terms of amino acid similarity or identity. Similarity allows for "conservative variation", i.e. substitution of one hydrophobic residue such as isoleucine, valine, leucine or methionine for another, or the substitution of one polar residue for another, such as arginine for lysine, glutamic for aspartic acid, or glutamine for asparagine . Similarity may be as defined and determined by the TBLASTN program, of Altschul et al . (1990) J. Mol . Biol . 215: 403-10, which is in standard use in the art, or, and this may be preferred, the standard program BestFit, which is part of the Wisconsin Package, Version 8, September 1994,
  • BestFit makes an optimal alignment of the best segment of similarity between two sequences. Optimal alignments are found by inserting gaps to maximize the number of matches using the local homology algorithm of Smith and Waterman.
  • Homology may be over the full-length of the relevant sequence shown herein, or may more preferably be over a contiguous sequence of about or greater than about 20, 25, 30, 33, 40, 50, 67, 133, 167, 200, 233, 267, 300, 333 or more amino acids or codons, compared with the relevant amino acid sequence or nucleotide sequence as the case may be .
  • nucleic acid including or consisting essentially of a sequence of nucleotides complementary to a nucleotide sequence with any sequence provided herein. Further, there is provided nucleic acid including or consisting essentially of a sequence of nucleotides complementary to a nucleotide sequence hybridisable with any encoding sequence provided herein. Another way of looking at this would be for nucleic acid according to this aspect to be hybridisable with a nucleotide sequence complementary to any encoding sequence provided herein.
  • DNA is generally double-stranded and blotting techniques such as Southern hybridisation are often performed following separation of the strands without a distinction being drawn between which of the strands is hybridising.
  • the hybridisable nucleic acid or its complement encode a product able to influence a physical characteristic of a plant, particularly a phenotypic characteristic as described above.
  • Preferred conditions for hybridisation are familiar to those skilled in the art, but are generally stringent enough for there to be positive hybridisation between the sequences of interest to the exclusion of other sequences .
  • SSC Standard Saline Citrate
  • the screening is carried out at about 37°C, a formamide concentration of about 20%, and a salt concentration of about 5 X SSC, or a temperature of about 50°C and a salt concentration of about 2 X SSPE.
  • These conditions will allow the identification of sequences which have a substantial degree of homology (similarity, identity) with the probe sequence, without requiring the perfect homology for the identification of a stable hybrid. Suitable conditions include, e.g.
  • nucleic acid for detection of sequences that are about 80-90% identical, hybridization overnight at 42°C in 0.25M Na 2 HP0 4 , pH 7.2, 6.5% SDS, 10% dextran sulfate and a final wash at 55°C in 0. IX SSC, 0.1% SDS.
  • suitable conditions include hybridization overnight at 65°C in 0.25M Na 2 HP0 4 , pH 7.2, 6.5% SDS, 10% dextran sulfate and a final wash at 60°C in 0.1X SSC, 0.1% SDS.
  • the nucleic acid which may contain, for example,
  • DNA encoding the amino acid sequence of Figure 3 may be in the form of a recombinant and preferably replicable vector, for example a plasmid, cosmid, phage or AgroJbacterium binary vector.
  • the nucleic acid may be under the control of an appropriate promoter or other regulatory elements for expression in a host cell such as a microbial, e.g. bacterial, or plant cell.
  • a host cell such as a microbial, e.g. bacterial, or plant cell.
  • genomic DNA this may contain its own promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
  • a vector including nucleic acid according to the present invention need not include a promoter or other regulatory sequence, particularly if the vector is to be used to introduce the nucleic acid into cells for recombination into the genome.
  • Suitable vectors can be chosen or constructed, containing appropriate regulatory sequences, including promoter sequences, terminator fragments, polyadenylation sequences, enhancer sequences, marker genes and other sequences as appropriate.
  • appropriate regulatory sequences including promoter sequences, terminator fragments, polyadenylation sequences, enhancer sequences, marker genes and other sequences as appropriate.
  • Molecular Cloning a Laboratory- Manual : 2nd edition, Sambrook et al , 1989, Cold Spring Harbor Laboratory Press.
  • Many known techniques and protocols for manipulation of nucleic acid for example in preparation of nucleic acid constructs, mutagenesis, sequencing, introduction of DNA into cells and gene expression, and analysis of proteins, are described in detail in Current Protocols in Molecular Biology, Second Edition, Ausubel et al .
  • Nucleic acid molecules and vectors according to the present invention may be provided isolated and/or purified from their natural environment, in substantially pure or homogeneous form, or free or substantially free of nucleic acid or genes of the species of interest or origin other than the sequence encoding a polypeptide with the required function.
  • Nucleic acid according to the present invention may include cDNA, RNA, genomic DNA and may be wholly or partially synthetic. The term "isolate" encompasses all these possibilities. Where a DNA sequence is specified, e.g. with reference to a figure, unless context requires otherwise, the RNA equivalent, with U substituted for T where it occurs, is encompassed.
  • nucleic acid to be inserted should be assembled within a construct which contains effective regulatory elements which will drive transcription. There must be available a method of transporting the construct into the cell. Once the construct is within the cell membrane, integration into the endogenous chromosomal material either will or will not occur. Finally, as far as plants are concerned the target cell type must be such that cells can be regenerated into whole plants .
  • Plants transformed with the DNA segment containing the sequence may be produced by standard techniques which are already known for the genetic manipulation of plants. Plants can be transformed with DNA using any suitable technology, such as a disarmed Ti-plasmid vector carried by Agrobacterium exploiting its natural gene transfer ability (EP-A-270355, EP-A-0116718 , NAR 12(22) 8711 -
  • Agrobacterium transformation is widely used by those skilled in the art to transform dicotyledonous species . Recently, there has been substantial progress towards the routine production of stable, fertile transgenic plants in almost all economically relevant monocot plants (Toriyama, et al . (1988) Bio/Technology 6, 1072-1074; Zhang, et al . (1988) Plant Cell Rep . 7, 379-384; Zhang, et al . (1988) Theor Appl Genet 76, 835-840; Shimamoto, et al . (1989) Nature 338, 274-276; Datta, et al . (1990) Bio/Technology 8, 736-740; Christou, et al .
  • Agrobacterium mediated transformation is now emerging also as an highly efficient alternative transformation method in monocots (Hiei et al . (1994) The Plant Journal 6, 271-282).
  • a combination of different techniques may be employed to enhance the efficiency of the transformation process, eg bombardment with Agrobacterium coated microparticles (EP-A-486234) or microprojectile bombardment to induce wounding followed by co-cultivation with Agrobacterium (EP-A-486233) .
  • a plant may be regenerated, e.g. from single cells, callus tissue or leaf discs, as is standard in the art. Almost any plant can be entirely regenerated from cells, tissues and organs of the plant . Available techniques are reviewed in Vasil et al . , Cell Cul ture and Somatic Cell Genetics of Plants, Vol I, II and III, Laboratory Procedures and Their Applications , Academic Press, 1984, and Weissbach and Weissbach, Methods for Plant Molecular Biology, Academic Press, 1989.
  • a TTGI gene and modified versions thereof may be used to affect a physical characteristic, such as hairs on roots and aerial parts of plants and anthocyanin pigments characteristics, in plants.
  • nucleic acid such as a vector as described herein may be used for the production of a transgenic plant.
  • Such a plant may possess an altered phenotype as described above compared with wild-type (that is to say a plant that is wild-type for TTGI or the relevant homologue thereof) .
  • the invention further encompasses a host cell transformed with nucleic acid or a vector according to the present invention, especially a plant or a microbial cell.
  • a host cell such as a plant cell, including heterologous nucleic acid according to the present invention is provided.
  • the nucleic acid may be incorporated within the chromosome . There may be more than one heterologous nucleotide sequence per haploid genome .
  • a plant cell having incorporated into its genome nucleic acid, particularly heterologous nucleic acid, as provided by the present invention, under operative control of a regulatory sequence for control of expression.
  • the coding sequence may be operably linked to one or more regulatory sequences which may be heterologous or foreign to the gene, such as not naturally associated with the gene for its expression.
  • the nucleic acid according to the invention may be placed under the control of an externally inducible gene promoter to place expression under the control of the user.
  • a suitable inducible promoter is the GST-II-27 gene promoter which has been shown to be induced by certain chemical compounds which can be applied to growing plants.
  • the promoter is functional in both monocotyledons and dicotyledons. It can therefore be used to control gene expression in a variety of genetically modified plants, including field crops such as canola, sunflower, tobacco, sugarbeet, cotton; cereals such as wheat, barley, rice, maize, sorghum; fruit such as tomatoes, mangoes, peaches, apples, pears, strawberries, bananas, and melons; and vegetables such as carrot, lettuce, cabbage and onion.
  • the GST-II-27 promoter is also suitable for use in a variety of tissues, including roots, leaves, stems and reproductive tissues .
  • a further aspect of the present invention provides a method of making such a plant cell involving introduction of nucleic acid or a suitable vector including the sequence of nucleotides into a plant cell and causing or allowing recombination between the vector and the plant cell genome to introduce the sequence of nucleotides into the genome.
  • the invention extends to plant cells containing nucleic acid according to the invention as a result of introduction of the nucleic acid into an ancestor cell.
  • heterologous may be used to indicate that the gene/sequence of nucleotides in question have been introduced into said cells of the plant or an ancestor thereof, using genetic engineering, i.e. by human intervention.
  • a transgenic plant cell i.e. transgenic for the nucleic acid in question, may be provided.
  • the transgene may be on an extra-genomic vector or incorporated, preferably stably, into the genome.
  • a heterologous gene may replace an endogenous equivalent gene, i.e. one which normally performs the same or a similar function, or the inserted sequence may be additional to the endogenous gene or other sequence.
  • nucleic acid heterologous, or exogenous or foreign, to a plant cell may be non-naturally occurring in cells of that type, variety or species.
  • nucleic acid may include a coding sequence of or derived from a particular type of plant cell or species or variety of plant, placed within the context of a plant cell of a different type or species or variety of plant.
  • a further possibility is for a nucleic acid sequence to be placed within a cell in which it or a homologue is found naturally, but wherein the nucleic acid sequence is linked and/or adjacent to nucleic acid which does not occur naturally within the cell, or cells of that type or species or variety of plant, such as operably linked to one or more regulatory sequences, such as a promoter sequence, for control of expression.
  • a sequence within a plant or other host cell may be identifiably heterologous, exogenous or foreign.
  • Plants which include a plant cell according to the invention are also provided, along with any part or propagule thereof, seed, selfed or hybrid progeny and descendants.
  • a plant according to the present invention may be one which does not breed true in one or more properties.
  • Plant varieties may be excluded, particularly registrable plant varieties according to Plant Breeders' Rights. It is noted that a plant need not be considered a "plant variety” simply because it contains stably within its genome a transgene, introduced into a cell of the plant or an ancestor thereof.
  • the present invention provides any clone of such a plant, seed, selfed or hybrid progeny and descendants, and any part of any of these, such as cuttings, seed.
  • the invention provides any plant propagule, that is any part which may be used in reproduction or propagation, sexual or asexual, including cuttings, seed and so on.
  • the invention further provides a method of influencing or affecting a physical characteristic e.g. hairs on roots and aerial parts of plants and/or the presence or absence of anthocyanin pigment, including causing or allowing expression of a heterologous nucleic acid sequence as discussed within cells of the plant.
  • a physical characteristic e.g. hairs on roots and aerial parts of plants and/or the presence or absence of anthocyanin pigment
  • the invention further provides a method including expression from nucleic acid encoding the amino acid sequence of Figure 3 , or a mutant, variant, allele or derivative of the sequence, within cells of a plant (thereby producing an encoded polypeptide) , following an earlier step of introduction of the nucleic acid into a cell of the plant or an ancestor thereof .
  • a method may influence or affect a phenotypic characteristic of the plant, such as those mentioned above. This may be used in combination with any other gene, such as transgenes involved in any other phenotypic trait or desirable property.
  • the present invention also encompasses the expression product of the nucleic acid sequence disclosed and methods of making the expression product by expression from nucleic acid encoding therefor under appropriate conditions, which may be in appropriate host cells. Following expression, the product may be isolated from the expression system and may be used as desired, for instance in formulation of a composition including at least one additional component.
  • Purified TTGI protein or a variant thereof, e.g. produced recombinantly by expression from encoding nucleic acid therefor, may be used to raise antibodies employing techniques which are standard in the art.
  • Antibodies and polypeptides comprising antigen-binding fragments of antibodies may be used in identifying homologues from other plant species.
  • Methods of producing antibodies include immunising a mammal with the protein or a fragment thereof.
  • Antibodies may be obtained from immunised animals using any of a variety of techniques known in the art, and might be screened, preferably using binding of antibody to antigen of interest.
  • antibodies with appropriate binding specificity may be obtained from a recombinantly produced library of expressed immunoglobulin variable domains, e.g. using lambda bacteriophage or filamentous bacteriophage which display functional immunoglobulin binding domains on their surfaces; for instance see WO92/01047.
  • a further aspect of the present invention provides a method of identifying and cloning TTGI homologues from plant species other than Arabidopsis thaliana which method employs a nucleotide sequence obtainable from that shown in Figure 3.
  • Such a method may include the steps of preparing nucleic acid from plant cells under test, providing a nucleic acid molecule having a nucleotide sequence shown in Figure 3 or complementary to a nucleic acid sequence shown in Figure 3, contacting nucleic acid in said preparation with said nucleic acid molecule under conditions for hybridization of said nucleic acid molecule to any said gene or homologue in said preparation, and identifying said gene or homologue if present by its hybridization with said nucleic acid molecule .
  • nucleotide sequence information may be used in a data-base search to find homologous sequences, expression products of which can be tested for ability to influence characteristics described above. These may have TTGI function or the ability to modify characteristics including hairs on roots and aerial parts of plants and anthocyanin pigments.
  • nucleic acid libraries may be screened using techniques well known to those skilled in the art and homologous sequences thereby identified then tested for requisite functionality.
  • nucleotide sequences obtained from that shown in figure 3 may be used to isolate TTGI homologous from other species of plants by techniques such as hybridization and polymerase chain reaction (PCR) .
  • PCR polymerase chain reaction
  • oligonucleotide probes or primers as well as the full-length sequence (and mutants, alleles, variants and derivatives) are also useful in identifying homologous sequences.
  • the present invention also extends to oligonucleotide probes or primers for amplification and/or identification which are obtainable by use of the sequence shown in Figure 3, optionally by selecting regions which are conserved with other sequences e.g. from the prior art. Alternatively it may be desirable to generate more specific primers by selecting regions of the TTGI which are not homologous to other proteins such as anil.
  • oligonucleotides according to the present invention that are fragments of the sequence shown in Figure 3, or any mutant, allele, variant, or derivatives thereof, are at least 10 nucleotides in length, more preferably at least 15 nucleotides in length, more preferably at least 20 nucleotides in length.
  • Such fragments themselves individually represent aspects of the present invention. Techniques corresponding to those above may also be used for ascertaining the genotype of mutant plants having altered phenotypes corresponding to TTGI activities (e.g which lack trichomes or anthocyanin) i.e. the probes and primers of the present invention can be used for diagnosing mutations in such plants, or as markers for these traits.
  • TTGI activities e.g which lack trichomes or anthocyanin
  • the present invention also extends to nucleic acid encoding an TTGI homologue obtained using a nucleotide sequence derived from that shown in Figure 3.
  • nucleic acid according to the present invention encodes a polypeptide which has homology with all or part of the amino acid sequence shown in Figure 3, in the terms discussed already above (e.g. for length), which homology is greater over the length of the relevant part (i.e. fragment) (the relevant part being greater than 110 amino acids in length, preferably greater than 200 amino acids and even more preferably greater than 300 amino acids in length) than the homology shared between a respective part of the amino acid sequence of Figure 3 and the EST sequence, and may be greater than about 5% greater, more preferably greater than about 10% greater, more preferably greater than about 20% greater, and more preferably greater than about 30% greater.
  • nucleic acid according to certain embodiments of the present invention may have homology with all or part of the nucleotide sequence shown in Figure 3, in the terms discussed already above (e.g. for length) , which homology is greater over the length of the relevant part (i.e. fragment) (the relevant part being greater than 350 nucleotide in length, preferably greater than 400 and even more preferably greater than 500 nucleotide in length) than the homology shared between a respective part of the nucleotide sequence of Figure 3 and may be greater than about 5% greater, more preferably greater than about 10% greater, more preferably greater than about 20% greater, and more preferably greater than about 30% greater.
  • nucleic acid may be provided in accordance with the present invention wherein the nucleotide sequence includes a contiguous sequence of about 350 nucleotides which has greater homology with a contiguous sequence of 350 nucleotides within the nucleotide sequence of Figure 3 than any contiguous sequence of 331 nucleotides of an EST sequence, preferably greater than about 5% greater homology, and so on.
  • sequence information for the TTGI gene of Arabidopsis thaliana enables the obtention of homologous sequences from other plant species.
  • TTGI homologues from related, commercially important Brassica species (e.g. Brassica nigra, Brassica napus, Brassica campestris and Brassica oleracea) .
  • Examples of homologues from Matthiola incana (ten week stock) , Nicotiana tobaccum var Samsum (tobacco) and Gossypium hirsutum cv. Siokva 1-4 (cotton) are disclosed in the Examples below.
  • nucleic acid molecules which encode amino acid sequences which are homologues of TTGI of
  • Arabidopsis thaliana Homology may be at the nucleotide sequence and/or amino acid sequence level, as has already been discussed above.
  • a homologue from a species other than Arabidopsis thaliana encodes a product which causes a phenotype similar to that caused by the Arabidopsis thaliana TTGI gene, generally including the ability to influence a phenotypic characteristic, particularly a phenotypic characteristic as described above.
  • mutants, derivatives or alleles of these genes may alter such characteristics compared with wild-type.
  • TTGI gene homologues may also be identified from economically important monocotyledonous crop plants such as rice and maize. Although genes encoding the same protein in monocotyledonous and dicotyledonous plants show relatively little homology at the nucleotide level, amino acid sequences are conserved. Therefore it is possible to use public sequence databases to identify
  • Arabidopsis, rice or maize cDNA clone sequences that were obtained in random sequencing programmes and share homology to the gene of interest, as has been done for flowering time genes isolated from Arabidopsis (e.g. CO; WO 96/14414) .
  • Nucleic acid according to the invention may be used to modify the characteristics of a plant . This may be achieved by modification of expression of the nucleic acid according to the present invention or by interfering with the normal function of the protein encoded by the nucleic acid according to the present invention.
  • nucleic acid according to the present invention may be used to increase or decrease the number of trichomes on different organs to enhance their effectiveness. Further, it may be used to alter the pigmentation of seeds, leaves, flowers and fruit for UV protection and/or colour for presentation reasons or for ornamental plants. This may involve use of anti-sense or sense regulation, discussed further below.
  • other physical characteristics of plants may be affected by means of expression from nucleic acid according to the present invention.
  • Nucleic acid according to the invention such as an TTGI gene or homologue, may be placed under the control of an externally inducible gene promoter to place the timing of altering the characteristics of the plant under the control of the user.
  • An advantage of introduction of a heterologous gene into a plant cell, particularly when the cell is comprised in a plant, is the ability to place expression of the gene under the control of a promoter of choice, in order to be able to influence gene expression, and therefore characteristic modification, according to preference.
  • mutants and derivatives of the wild-type gene eg with higher or lower activity than wild-type, may be used in place of the endogenous gene.
  • over-expression may be achieved by introduction of the nucleotide sequence in a sense orientation.
  • the present invention provides a method of influencing a physical e.g. a phenotypic characteristic described above such as, an increase or decrease in trichomes, characteristic of a plant, the method including causing or allowing expression of the product (polypeptide or nucleic acid transcript) encoded by heterologous nucleic acid according to the invention from that nucleic acid within cells of the plant.
  • a physical e.g. a phenotypic characteristic described above such as, an increase or decrease in trichomes, characteristic of a plant
  • the method including causing or allowing expression of the product (polypeptide or nucleic acid transcript) encoded by heterologous nucleic acid according to the invention from that nucleic acid within cells of the plant.
  • Down-regulation of expression of a target gene may be achieved using anti-sense technology or "sense regulation" ("co-suppression”) .
  • a nucleotide sequence is placed under the control of a promoter in a "reverse orientation" such that transcription yields RNA which is complementary to normal mRNA transcribed from the "sense" strand of the target gene.
  • a promoter in a "reverse orientation” such that transcription yields RNA which is complementary to normal mRNA transcribed from the "sense" strand of the target gene.
  • Antisense technology is also reviewed in Bourque, (1995) , Plant Science 105, 125-149, and Flavell, (1994) PNAS USA 91, 3490-3496.
  • An alternative is to use a copy of all or part of the target gene inserted in sense, that is the same, orientation as the target gene, to achieve reduction in expression of the target gene by co-suppression.
  • van der Krol et al . (1990) The Plant Cell 2 , 291-299; Napoli et al . , (1990) The Plant Cell 2 , 279- 289; Zhang et al . , (1992) The Plant Cell 4, 1575-1588, and US-A-5,231,020.
  • the complete sequence corresponding to the coding sequence need not be used. For example fragments of sufficient length may be used. It is a routine matter for the person skilled in the art to screen fragments of various sizes and from various parts of the coding sequence to optimise the level of anti-sense inhibition. It may be advantageous to include the initiating methionine ATG codon, and perhaps one or more nucleotides upstream of the initiating codon. A further possibility is to target a conserved sequence of a gene, e.g. a sequence that is characteristic of one or more genes, such as a regulatory sequence .
  • the sequence employed may be about 500 nucleotides or less, possibly about 400 nucleotides, about 300 nucleotides, about 200 nucleotides, or about 100 nucleotides. It may be possible to use oligonucleotides of much shorter lengths, 14-23 nucleotides, although longer fragments, and generally even longer than about 500 nucleotides are preferable where possible, such as longer than about 600 nucleotides, than about 700 nucleotides, than about 800 nucleotides, than about 1000 nucleotides or more.
  • a sequence employed in a down-regulation of gene expression in accordance with the present invention may be a wild-type sequence (e.g. gene) selected from those available, or a mutant, derivative, variant or allele, by way of insertion, addition, deletion or substitution of one or more nucleotides, of such a sequence.
  • the sequence need not include an open reading frame or specify an RNA that would be translatable. It may be preferred for there to be sufficient homology for the respective anti-sense and sense RNA molecules to hybridise. There may be down regulation of gene expression even where there is about 5%, 10%, 15% or 20% or more mismatch between the sequence used and the target gene .
  • the transcribed nucleic acid may represent a fragment of an TTGI gene, such as including a nucleotide sequence shown in Figure 3, or the complement thereof, or may be a mutant, derivative, variant or allele thereof, in similar terms as discussed above in relation to alterations being made to an TTGI coding sequence and the homology of the altered sequence.
  • the homology may be sufficient for the transcribed anti-sense RNA to hybridise with nucleic acid within cells of the plant, though irrespective of whether hybridisation takes place the desired effect is down-regulation of gene expression.
  • the present invention also provides a method of influencing a characteristic of a plant such as any one of those described above, the method including causing or allowing anti-sense transcription from heterologous nucleic acid according to the invention within cells of the plant.
  • the present invention further provides the use of the nucleotide sequence of Figure 3 or a fragment, mutant, derivative, allele, variant or homologue thereof for down-regulation of gene expression, particularly down-regulation of expression of an TTGI gene or homologue thereof, preferably in order to influence a physical characteristic of a plant, especially a phenotypic characteristic such as an increase or decrease of trichomes on different organs and/or and increase or decrease in anthocyanin pigments.
  • a range of phenotypes is produced which includes individuals where over-expression occurs and some where under-expression of protein from the target gene occurs .
  • ribozymes e.g. hammerhead ribozymes, which can catalyse the site-specific cleavage of RNA, such as mRNA (see e.g. Jaeger (1997) "The new world of ribozymes” Curr Opin Struct Biol 7:324-335, or Gibson & Shillitoe (1997) "Ribozymes : their functions and strategies form their use” Mol Biotechnol 7: 242-251.).
  • nucleic acids enconding non- functional or partially functional mutant proteins e.g. encoded by the mutant alleles of the present invention, or as produced by mutagenesis
  • TTGI proteins e.g. receptors or other binding partners
  • the present invention also provides a method of influencing a phenotypic characteristic of a plant, the method including causing or allowing expression from nucleic acid according to the invention within cells of the plant.
  • This may be used to suppress activity of a product with ability to influence a phenotypic characteristic as described above.
  • the activity of the product is preferably suppressed as a result of under-expression within the plant cells.
  • TTGI gene also embraced within the present invention are untranscribed parts of the TTGI gene.
  • a nucleic acid molecule encoding the promoter of the TTGI gene . Owing to the widespread presence of the TTGI transcript in the plant it is believed that this promoter is constitutive or essentially constitutive, and thus may have utility in producing constructs for the expression of genes in plants .
  • Variant promoters having promoter activity are also embraced by the present invention.
  • restriction enzyme or nucleases may be used to digest a nucleic acid molecule comprising the 5' region of Seq ID No 1, or mutagenesis may be employed, followed by an appropriate assay (for example using a reporter gene such as luciferase operably linked to the restricted sequence) .
  • Methods for promoter identification may employed without burden by those skilled in the art in the light of the sequence data disclosed herein.
  • YAC end probes are indicated by boxes; and B) a fine scale map of the TTGI region determined by RFLP mapping. End probes generated from YACs are indicated as boxes.
  • the position of the cosmid g4556 is shown in relation to the genomic lambda isolated from this region. The recombination points are marked with a cross.
  • Figure 2 This shows a map of genomic clones used to complement ttgl mutants . It shows the genomic fragments that have been used to complement the ttgl mutant phenotype . Fragment (a) represents the genomic 13.8kb insert in pB8 with the EcoRI (E) and Xbal (X) restriction sites some of which are used to create the deletion.
  • Fragment (b) represents pB8DXl
  • (c) represents pB8DE2
  • (d) represents pB8DE3. Deletions are indicated by the dotted lines. Fragments (a) and (d) both gave transformed plants with trichomes and anthocyanin. Transformants of (b) and (c) lacked both trichomes and anthocyanin.
  • TTGI locus SEQ ID No. 1
  • the intron coding sequence
  • SEQ ID No. 2 the predicted amino acid sequence
  • Five identified mutations are shown (ttgl.10, ttgl.19, ttgl.20, ttgl.l, ttgl .9 - SEQ ID No's. 3 to 7 respectively) .
  • Sixty bases are shown per line.
  • TTGI DNA sequence shown in Fig 3 including promoter region, full cDNA sequence, and coding sequence (which is aligned with the amino acid sequence) .
  • Recombinants between the ttgl . l and MSI loci generated in a cross between Landsberg erecta carrying ttgl and msl and Ws ecotypes were analysed using RFLPs (restriction fragment length polymorphism) between these parents with probes already mapped to this region by Nam et al (Nam et al (1989) Plant Cell 1 699-705.).
  • Recombinants on the distal side of ttgl . l were selected from a cross of Landsberg erecta carrying the ttgl . l mutation and ga3 and ch5 and the RLD1 ecotype which was wildtype for these loci.
  • End probes from YACs that had been mapped to the region between msl and ga3 were also utilized to map the location of ttgl.
  • RFLPs generated by the cosmid g4556 could not be separated from ttgl with the exception of one recombinant called Dennis 19 (on the msl side of ttgl) suggesting that g4556 was very close to the mutation in ttgl . l .
  • a YAC EG20H2 that hybridised to g4556 and the cosmid were used to isolate overlapping genomic lambda clones.
  • FIG. 1 shows the large scale and fine scale maps from the TTGI region that were derived from the analysis of the recombinants on both sides of the ttgl . l mutation.
  • Figure IB the position of several recombination events between msl and ttgl have been indicated. On the distal side of ttgl no nearby recombination events could be mapped due to lack of RFLPs between the ecotypes used and lack of success in isolating clones from this region from three different libraries in lambda or cosmid vectors using a variety of probes .
  • Agro acteriuiT! tumefaciens strain AGL1 (Lazo et al (1991) Biotechnology 9 963-967) and then used to transform Arabidopsis ecotype Landsberg erecta carrying the ttgl . l mutation by co-cultivation with root explants (Valvekens et al (1988) Proc Natl Acad Sci USA 85 5536-5540) . Only a small number of kanamycin-resistant transformants were obtained, but one plant from pB8, derived from lambda 8, had trichomes but failed to set seed. Transformants from pBl.lA did not have trichomes.
  • kanamycin-resistant shootlets appeared to be escapes due to prolonged exposure to kanamycin in the callus stage.
  • Several deletions were made of pB8 utilizing restriction sites within the genomic sequence and the polylinker of the vector. These deletion constructs (shown in Figure 2) were used to transform Arabidopsis ecotype Columbia carrying the ttgl . 9 mutation via vacuum infiltration (Bechtold et al (1993) Compt Rend Acad Sci Ill-Life Sci 316 1194-1199) .
  • Sixty transformants from pB8DE3 (indicated as construct d) produced trichomes, although one transformant from pB8DE3 showed kanamycin resistance but had no trichomes.
  • transformants bearing trichomes exhibited other wild-type characteristics of brown seed, seed mucilage, purple colouring of the plant and normal root hair numbers indicating that the other ttgl mutant phenotypes had also been complemented.
  • Thirty-six kanamycin-resistant transformants from pB8DE2 (construct c) and 19 transformants from pB8DXl (construct b) failed not produce trichomes, suggesting that TTGI was located in the regions deleted in these constructs .
  • the positional cloning of the TTGI locus has provided information about the order of, and distances between, a number of RFLP markers which may be used to isolate nearby genes. This information is complementary to the data given in the physical maps of the region (Schmidt et al . , 1997 Plant J 11: 563-572; Thorlby et al., 1997 Plant J 12, 471-479). Although the present inventors analysed about 400 recombinants within a 14 map unit region, they were still unable to find breakpoints very close to the TTGI locus . This suggests that recombination rates are reduced close to this gene. Recombination frequencies are known to vary along chromosomes in many species (Livier and Goldman, 1995 Annu Rev Genet 29: 445-476 for a review) .
  • the other predicted gene corresponded to an Arabidopsis EST (F20055, F20056) , indicating that the gene was functional and expressed.
  • the predicted protein sequence of 341 amino acids shows sequence similarity (about 45 %) to a large and diverse group of proteins with WD40 repeat motifs. There are seven WD40 repeats with a short N-terminal region. The first two repeats contain a proline-rich region - the second repeat having 8/23 amino acids which are proline. Three possible TATA boxes have been identified 133, 189 and 216 bases upstream of the predicted start of translation. Comparison of the genomic and EST sequences indicated the presence of a single intron 3' of the termination codon. The sequence including the promoter region is shown in Figure 3. Primer extension experiments indicated that the start of transcription is 109 bases 5' of the start of translation (i.e. 23 bases from a TATA box) .
  • the present inventors examined the nucleotide sequence of this region of a number of ttgl mutant alleles to determine whether the gene encoding the WD40 repeat protein was likely to be the TTGI locus. PCR products from the region were generated with primers designed to give overlapping fragments of about 700 bp. These PCR products were obtained from the ttgl mutants ttgl . 9, ttgl . 10, ttgl . 19 and ttgl . l and from their parental wild-type alleles using genomic DNA as the template. The PCR products were gel-purified and sequenced using both oligonucleotides designed as primers for PCR.
  • a point mutation in ttgl . 20 resulted in the change of serine to cysteine at codon 30. This allele, plus also alleles ttgl . 15-18 contained a premature stop codon at position 310.
  • a mutation in ttgl . 9 resulted in the change of serine to phenylalanine at codon 282.
  • the mutant allele ttgl . 10 contained a point mutation (G to A) in the 5' untranslated leader sequence, which may affect the translation of the RNA. These changes in the gene encoding the WD40 repeat protein confirm its identity as TTGI .
  • mutants have a lower level of transcript detected by northern analysis and compared with message levels from the wildtype parent (results not shown) .
  • the reduction in message level in mutants such as ttgl -9 could be due to nonsense-mediated mRNA decay which has been shown to occur in plants as well as in other organisms (Dickey et al . , 1994 Plant Cell 6, 1171-1176; van Hoof and Green, 1996 Plant J 10: 415-424).
  • TTGI effect on stomata appears to be analogous to control of root hairs (Berger et al, 1998, Dev Biol 194: 226-234) .
  • the table below shows a comparison between the stomatal numbers on hypocotyls in ttgl mutants compared to wild-type in air and at elevated C0 2 concentrations.
  • RNA blot was hybridized with a TTGI probe.
  • the resulting band was measured to be 1.35 kb in length and present in all organs tested (roots, rosette leaves, leaf buds, stems, cauline leaves, siliques, flowers, floral buds) .
  • a surprise result was that it was highly expressed in floral meristems where there are only a few trichomes on the sepals and no anthocyanin in the flower petals.
  • the high level of transcripts of the TTGI locus suggests two possible points of regulation of the gene. The first is that the TTGI protein is present in many tissues where it requires a partner for activation. Another possibility is that regulation of this gene occurs a posttranscriptional stage with protein only being present in those cells that require functional TTGI protein.
  • the present inventors have identified the TTGI locus as a gene encoding a WD40 repeat protein by complementation of a ttgl mutant with genomic DNA and by sequencing the gene in several ttgl mutant alleles. Two of the mutant alleles contained stop codons that would result in the production of truncated proteins, and two others contained point mutations that would change serine residues, to a cysteine and phenylalanine residues.
  • TTGI protein bears no resemblance to the maize R gene product which was able to complement the ttgl mutant phenotype in Arabidopsis and anthocyanin pigment in tobacco flowers (Lloyd et al (1992) Science 258 1773- 1775) . This suggests that TTGI acts upstream of an Arabidopsis R homologue in the pathways leading to trichome differentiation and anthocyanin synthesis . The involvement of other WD40 proteins in signal transduction pathways (see below) suggests that TTGI is involved in a pathway, or pathways, regulating the expression or action of downstream transcription factors.
  • WD40 repeat proteins are involved in a number of different types of regulatory roles, such as signalling (eg. G ⁇ subunit of heterotrimeric G proteins) , cell cycle regulation (eg CDC20 and CDC4) , transcriptional repression (eg yeast TUP1, Drosophila extra sex combs) , vesicular trafficking (eg SEC13) and RNA processing (Neer et al (1994) Nature 371 297-300) .
  • the TTGI protein shows the highest sequence similarity to G 0 subunits, which are the best characterised of the WD40 repeat proteins.
  • the G ⁇ subunit contains 7 repeats of the WD40 motif and has a structure resembling a seven-bladed propeller, based on its crystal structure with its partner G ⁇ (Sondek et al (1996) Nature 379 369-374) .
  • Each blade is composed of 4 ⁇ -sheets, and an N-terminal amphipathic o;-helix interacts closely with the G ⁇ subunit which is required for correct folding and function of the G ⁇ subunit.
  • computer- aided modelling of TTGI in collaboration with N.
  • TTGI represents a separate class of WD40 repeat protein.
  • the TTGI protein does not act directly as a transcription factor but binds to other proteins to promote the initiation of trichomes in leaves and stems.
  • the TTGI protein may act as part of a DNA binding complex to regulate transcription.
  • the amino acid sequence contains no recognizable nuclear localization signal from computer analysis, although a cryptic site might be present.
  • Another possibility is that another protein is required to form a complex for nuclear import as is the case with AP3 and PI from Arabidopsis (McGonigle et al . , 1996 Genes Dev 10, 1812-1821).
  • Another possibility is that the TTGI protein is only located in the cytoplasm and acts as part of a signal transduction pathway.
  • GUS-TTG1 and TTG1-GFP fusion proteins appear to be cytoplasmically located.
  • One sequence is the result of a genomic sequencing project in Arabidopsis and has a 85% similarity to TTGI and is located on chromosome 3.
  • Gel blots hybridized and washed at low stringency show at least three bands in Arabidopsis and maize suggesting that TTGI could belong to a class of proteins .
  • Two C. elegans genes arrayed in tandem have greater similarity to TTGI than any locus from yeast Saccaromyces cerevici . This is surprising as yeast is more closely related to Arabidopsis than C. elegans is.
  • TTGI plays a role in defining functions in epidermal cells, this function may also be required in other multicellular organisms .
  • TTGI might regulate MYC transcription factors, determined by overexpression of the maize R gene in Arabidopsis (Lloyd et al . , 1992 Proc Nat Acad Sci USA 86: 7092-7096).
  • the identification of two WD40 repeat proteins which regulate anthocyanins and, in the case of the TTGI protein, many other pathways suggests that this class of protein may be involved in regulating developmental pathways in other organisms .
  • Primers for degenerate PCR were designed by comparison of the TTGI sequence with the anil sequence (see Fig 4) . Primers were based on the sequence encoding amino acids 74-85, and 296-307.
  • Primer sequences were :
  • Amplification conditions were ascertained using a temperature gradient in a Robocycler (Stratagene) . lng of genomic DNA template was amplified with 5pmol of each primer using Taq DNA polymerase (Qiagen) with the addition of QX (Qiagen) for stabilising the DNA-primer complex.
  • Gossypium hirsutum cv. Siokva 1-4 (cotton) was used as a template in 50ul reactions with the temperature ramped between annealing and extension to 15 degrees per minute.
  • the amplified bands were size-fractioned and extracted from a gel.
  • the Matthiola DNA was polished with Klenow enzyme to make a blunt end ligated into the EcoRV site of pBluescript .
  • Three independent constructs were sequenced.
  • the gel purified PCR product was used as a probe on a Southern blot to verify that the product originated from Matthiola. At high stringency (65°C in 0.1XSSC+1% SDS) the PCR product cross hybridises to the TTGI gene of Arabidopsis.
  • Tobacco sequences (tobacco 1 and 2) were obtained from two constructs (pTOBl and pT0B2) using T-vectors bases on pBluescript. Each corresponds to one of the genomic sequences found in tobacco which is an allotetraploid species. The two sequences are 95% identical at both the nucleotide and the amino acid level .
  • pTOBl hybridises to both tobacco genes but only weakly to TTGI at high stringency. The cotton gene is currently being sequenced.
  • Garden blots reveal the presence of several similar sequences in Arabidopsis, tobacco, Petunia and zea mays when hybridised at 50°C in 5M NaCl and washed at 50°C in IXSSC+1% SDS.
  • the insert in construct pTOBl was removed using Sad and EcoRV and ligated into the Sad and Smal sites in the pROK2 vector. This gives an antisense construct with the 35S promoter driving a transcript from the complementary strand of the TOB1-TTG1 gene. Constructs containing the TOBl sequence may be placed into the Agrobacterium strain LBA4404 for transfer into tobacco plants.
  • Candidates described as glabrous and having yellow seeds were grown and crossed to ttgl.l mutants.
  • the F2 generation was examined for segregating phenotypes. All were examined for seed mucilage, anthocyanin in the plant and the testa and for leaf hairs.
  • Plant material was treated as in Dellaporta et al. (1983) Plant Mol Biol Rep 1,4, 19-21, followed by CsCl banding to remove RNA and polysaccharides (Walker et al, 1997 Photosyn Res 54, 155-163) so as to be able to detect small band shifts on DNA gel blots.
  • the pBIN19 vector was modified to contain a Notl site in the polylinker.
  • pBIN19 was restricted with Notl, treated with Klenow and dNTPs to fill in the site, ligated in a large volume, restricted again with Notl and transfected into E.coli strain TGI. Plasmid DNA was isolated from resulting colonies to check that the original Notl site no longer existed.
  • the vector was restricted with Xbal and Asp718.
  • Two annealed oligonucleotides Nl: GTACCGCGGCCGCAT AND N2 : CTAGATGCGGCCGCG
  • the ligated DNA was restricted with BamHI to remove parental molecules. In effect, the BamHI site in the polylinker of pBIN19 has been replaced with a unique Notl site and the altered vector called pBINNOT.
  • Agrobacterium strain Agll (Lazo et al . , 1991 Biotechnology 9, 963-967) was transformed with constructs based on pBINNOT vector by electroporation. Using vacuum infiltration (Bechtold et al , 1993 Compt Rend Acad Sci III Life Sci 316, 1194-1199) . DNA containing genomic fragments were introduced into ttgl.9 mutant plants.

Abstract

Disclosed are nucleic acids encoding TTG1 from Arabidopsis, which is believed to act upstream of an Arabidopsis R homologue in the pathways leading, inter alia, to trichome differentiation and anthocyanin synthesis. Also disclosed are variants and mutants of said sequence, vectors, host cells, transgenic plants, polypeptide expression products and other related materials, plus also methods of manipulating the phenotypic characteristics associated with TTG1.

Description

Materials and Methods relating to a Plant Regulatory Protein
Field of the invention This invention relates to materials and methods relating to a plant regulatory protein. More particularly, the invention relates to the cloning and expression of the TTGI gene of Arabidopsis thaliana, and homologues from other species, and manipulation and use of the gene in plants.
Background of the invention
The protein encoded by the TTGI (transparent testa, glabra) locus plays a central role in many pathways in Arabidopsis thaliana . Many of these pathways are confined to effects on the epidermal cell layer of different tissues. Mutations at the TTGI locus have a large range of pleiotropic effects (Koornneef (1981) Arabid Inform. Serv. 18 45-51) . It is known that ttgl mutants have a glabrous phenotype with no leaf or stem hairs (trichomes) which are normally derived from the LI layer of cells, the outer single layer of cells covering the meristem that differentiates into all epidermal cells of the leaf. No purple anthocyanin pigments are present in the seed coat leading to the yellow cotyledons being visible through the transparent testa. In the wild-type plant, anthocyanins are present in the hypocotyl of seedlings and in the stem and leaves of plants as they age, and are inducible by many forms of stress including by high light, poor nutrients or water stress. Mutants of the t tgl locus completely lack anthocyanins in the epidermis of leaves and stems (Koornneef (1981) Arabid Inform. Serv. 18 45-51) . Tufts of mucilage are absent from ttgl mutant seeds and the seeds show no secretion of mucilage on imbibing, unlike wild-type plants (Koornneef (1981) Arabid Inform. Serv. 18 45-51) . In wild-type plants, root hairs extend from root epidermal cells only in files of cells that contact two underlying cortical cells, whereas in ttgl mutants extra root hairs occur in the atrichoblast cell files (Galway et al (1994) Dev. Biol. 166 740-754) . Seeds of ϋtgl mutant plants do not require drying and cold treatments to germinate and exhibit an altered seed dormancy compared to ecotypes such as Landsberg erecta (Koornneef et al (1982) Theoret Appl Genet 61 385-393) .
Several genetic loci involved in trichome, or leaf hair, differentiation and development have been described from Arabidopsis (Koornneef (1981) Arabid Inform. Serv. 18 45-51, Hϋlskamp et al . (1994) Cell 76 555-566). Three loci that play a role in the initiation of trichomes have been identified; these are GL1 (glabra 1) , TTGI (transparent testa glabra) and TRY (triptychon) .
Mutations at the GL1 locus lead to hairless plants whereas the TRY locus affects the spacing of trichomes, which form clumps in try mutant plants (Hύlskamp et al . (1994) Cell 76 555-566) . GL1 is a MYB transcription factor (Marks and Feld ann (1989) Plant Cell 1 1043-1050, Oppenheimer et al (1991) Cell 67 483-493) . Mutations at another glabra locus GL2 have some features in common with ttgl , although gl2 mutants have normal anthocyanin content and have rudimentary trichomes suggesting the fate of these cells has already been determined
(Koornneef (1981) Arabid Inform. Serv. 18 45-51) . They have an increased number of ectopic root hairs, although the atrichoblast cells resemble wild-type cells more than in the ttgl mutant indicating a role later in the development of the root epidermal cells (Masucci et al (1996) Development 122 1253-1260) . The effects on the seed coat and mucilage are similar to that of the ttgl mutants (Koornneef (1981) Arabid Inform. Serv. 18 45-51) . The GL2 locus encodes another transcription factor, a homeodomain protein with a leucine zipper domain. By in si tu hybridization, the GL2 gene is expressed in developing trichomes (Rerie et al (1994) Genes Dev. 8 1388-1389) and in the atrichoblast cell files of the roots (Masucci et al (1996) Development 122 1253-1260) . Transcript levels of GL2 are greatly reduced in ttgl mutants, suggesting that the TTGI gene product is required for normal expression of GL2 (Cristina et al (1996) Plant J 10 393-402) .
In ttgl mutants the anthocyanin biosynthetic pathway is blocked at the dihydroflavonol-4 -reductase (DFR) step because no DFR message is detected in these mutants (Shirley et al (1995) Plant J 8 659-671) whereas transcripts encoding chalcone synthase and chalcone isomerase are unaffected. This resembles the effect of Delila mutants in Antirrhinum (Martin et al (1991) Plant J 1 37-49) . In maize the equivalent locus called the R gene affects the whole pathway from chalcone synthase (CS) onwards. Delila and R are both MYC-like transcription factors (Ludwig et al (1989) Proc Natl Acad Sci USA 86 7092-7096, Goodrich et al (1992) Cell 68 955- 964) . R has been shown to activate directly the transcription of several genes encoding anthocyanin biosynthetic enzymes in conjunction with a MYB transcription factor encoded by the Cl gene in maize (Goff et al (1992) Genes Dev. 6 864-875) . Complementation of a ttgl mutant by cauliflower mosaic virus 35S promoter-i? constructs (Lloyd et al (1992) Science 258
1773-1775) was used to suggest that TTGI might encode an Arabidopsis R homologue. A further characterised transcription factor (Caprice [CPC] - see Wada et al 1997, Science 277, 1113-1116) may act in the opposite way to TTGI in promoting root hair development, and possible reducing the trichome number.
The TTGI locus has been broadly mapped by Koornneef (Koornneef et al (1982) Theoret Appl Genet 61 385-393) to chromosome 5 between msl and ga3 . At position 31.5 the ttgl locus has been used as a phenotypic marker in many crosses . Summary of the invention
The present inventors have identified the TTGI locus as a gene encoding a D40 repeat protein by complementation of a ttgl mutant with genomic DNA and by sequencing the gene in several ttgl mutant alleles. The TTGI gene has now been cloned and sequenced and the inventors have demonstrated that it encodes a WD40 repeat protein with 7 repeat units . The 1.6 kb transcript is present in all major organs. The identification of the product of the TTGI locus as a D40 repeat protein rules out the possibility that the protein acts as a transcription factor, unlike the products of the other genes, GL1 and GL2, affecting trichome development. Additionally the TTGI protein bears no resemblance to the maize R gene. The present inventors propose that the D40 repeat protein is a component of a signal transduction pathway which regulates expression or action of downstream transcription factors, and in particular that TTGI acts upstream of an Arabidopsis R homologue in the pathways leading to trichome differentiation and anthocyanin synthesis.
The TTGI gene has a novel sequence. No Arabidopsis genes showing significant homology to TTGI were identified in public databases. A protein of unknown function showing 61% amino acid identity is encoded on chromosome 3 of Arabidopsis (orflO in database accession number X98130) , but transcripts of this gene do not cross hybridise with TTGI at high stringency on Northern blots. However, a region of the TTGI protein showed homology to an Expressed Sequence Tag (EST) of unknown function. The EST came from a cell suspension culture from Eco type Columbia (clone library AC16H) .
Additionally the TTGI gene shows 87.5% similarity to the anil gene from Petunia. This gene is discussed by Vetten et al (1997) Genes & Development 11: 1422-1434,
Pub. Cold Spring Harbor Laboratory Press, which may have been published before the claimed priority date of the present invention. Interestingly, the anil locus is described as controlling anthocyanin pigmentation and hence flower colour - but apparently does not exert the pleiotropic effects (e.g. trichomes, anthocyanin in other parts of the plant) of the TTGI gene which forms the basis of the present invention.
A genomic sequence encompassing Arabidopsis TTGI has recently (after the priority date of the present invention) been put on a database under accession number AB010068.
Thus according to a first aspect of the present invention there is provided a nucleic acid molecule including a nucleotide sequence encoding a polypeptide with TTGI function. Those skilled in the art will appreciate that " TTGI function" may be used to refer to the ability to manipulate the phenotypic characteristics of plants as described below when its expression is altered like the TTGI gene of Arabidopsis thaliana.
Manipulation of the phenotypic characteristics of plants may be achieved by altering the expression of the TTGI gene (by increasing/decreasing expression or by mutation) or by interfering with the normal function of the TTGI protein. Further, manipulation may be achieved by providing for the expression of a further homologous transcript which is able to interact with the expression of the TTGI gene in such a way as to either prevent translation of the transcript occurring or to boost the levels of transcripts being translated.
Examples of phenotypic characteristics that may be manipulated in accordance with the present invention are given below. Preferably at least 2, 3, 4, 5 or 6 or more of these characteristics are manipulated: 1. Trichomes (hairs) on aerial parts of plants: trichomes have a number of functions and the present invention provides a way to increase and decrease the number of trichomes on different organs to enhance their effectiveness. The increase or decrease in the number of trichomas may be utilized in:
(i) insect protection: due to mechanical effects and to chemicals from glandular hairs . These could be increased, to increase insect protection, on leaves or on cotyledons which often do not have hairs. Protection of cotyledons from insect attack may allow faster seedling growth.
(ii) chemical production: glandular trichomes are involved in producing pheromones, antifeedants and other chemicals, including essential oils, which may be increased if the number of trichomes is increased.
(iii) protection in hot, dry climates: hairs form boundary layers for decreased water loss . Hairier plants may have an advantage in warmer climates.
Hairs may also provide shade and protection for meristem in young seedlings, allowing faster seedling growth.
(iv) salt removal from leaves: the presence of salt glands would allow trichomes to sequester or secrete salt . (Relatives of rice have microtrichomes of 2 cells which secrete salt.)
(v) cotton fibres: it may be advantageous to increase the number of cotton fibres per boll, and at the same time decrease leaf trichomes to prevent insects hiding and prevent contamination of bolls . (vi) ornamental plants: it may be preferred to decrease the number of hairs on hairy and glabrous varieties of a range of garden plants. 2. Trichomes on roots: manipulation of the number of root hairs may affect water and nutrient absorption (crop nutrient use efficiency) by the plants. Root hairs are also involved in anchoring the plant in the soil, particularly sandy soils, and allow better root penetration.
3. Seed mucilage: manipulation may lead to better seed germination in dry soils, due to maintenance of moisture around the seed.
4. Seed dormancy: alteration of seed dormancy may allow quicker, or slower (if viviparous) , germination of seeds after harvest. This may lead to faster cycling of crops. 5. Anthocyanin pigments: have a range of functions in the plant, and manipulation may alter pigmentation of seeds, leaves, flowers and fruit. Such manipulation may lead to:
(i) UV-B protection of plants, mainly in leaves, but anthocyanins are produced in plants under a wide range of stresses, including water stress, light stress, increased sugars. These stresses lead to decreased photosynthesis and susceptibility to photoxidation. (ii) Altered flower and leaf colour in ornamentals and food crops, eg broccoli; altered fruit and seed colour in food, eg aubergines and grains (maize, rice, etc) .
6. Condensed tannins, produced by the polymerisation of anthocyanin precursors, are found in many plants and are responsible in part for the taste characteristics of a range of fruits and vegetables, such as apple, kiwifruit, gooseberry, redcurrant and banana. Condensed tannins produce characteristic astringent properties in tea, coffee, wine, spices and fruit juices. Tannins also have important effects in animal feedstuffs. In monogastric animals, such as pigs and chickens, tannins limit the use of potential feedstuffs such as faba beans and sorghum. In ruminants, moderate levels of tannins are beneficial and may improve retention of dietary nitrogen, but higher levels reduce the nutritive value of foliage and feedstuffs. Manipulation of TTGI may alter the levels of condensed tannins in these plants.
7. Stomata on hypocotyls : increases in the number stomata may result in faster seedling growth under ideal conditions, such as optimum water and C02 availability. The present invention provides a nucleic acid isolate encoding a polypeptide including the amino acid sequence shown in Figure 3 (SEQ ID No. 2) or homologues thereof, which may include the coding sequence shown in Figure 3 which is that of the TTGI gene of Arabidopsis thaliana, and/or other transcribed parts of the gene e.g. as shown in Figure 3 or Figure 5.
Nucleic acid according to the present invention may have the sequence of an TTGI gene of Arabidopsis thaliana , or be a mutant, variant, derivative or allele or a homologue of the sequence provided. Preferred mutants, variants, derivatives and alleles are those which encode a protein which retains a functional characteristic of the protein encoded by the wild-type gene, especially the ability to affect a physical characteristic of a plant, such as the phenotypic characteristics outlined above.
A mutant, variant, derivative or allele in accordance with the present invention may have the ability to affect a physical characteristic of a plant, particularly a phenotypic characteristic identified above. Thus, a mutant, variant, derivative or allele may decrease the amount of anthocyanins in the epidermis of leaves and stems compared with wild-type on expression in a plant, e.g. compared with the effect obtained using a gene sequence expressing the polynucleotide sequence of Figure 3.
Alternatively or in addition, a mutant, variant, derivative or allele increases or decreases the number of trichomes on different organs compared with wild-type on expression in a plant, e.g. compared with the effect obtained using a gene sequence expressing the polynucleotide sequence of Figure 3. Down-regulation of TTGI activity may be achieved by mutant nucleic acids (e.g through co-suppression) or by mutant polypeptides, which may compete for receptors or other binding sites for TTGI, without triggering appropriate effects.
Comparison of effect on the increase or decrease of trichomas or other characteristics may be performed in Arabidopsis thaliana, although nucleic acid according to the present invention may be used in the production of a wide variety of plants and for influencing a phenotypic characteristic thereof.
Changes to a sequence, to produce a mutant, variant or derivative, may be by one or more of addition, insertion, deletion or substitution of one or more nucleotides in the nucleic acid, leading to the addition, insertion, deletion or substitution of one or more amino acids in the encoded polypeptide. Further, it may lead to the creation of stop codons resulting truncated polypeptide; removal of stop codons resulting in extended polypeptides; or a frameshift resulting in a polypeptide lacking TTGI function. Of course, changes to the nucleic acid which make no difference to the encoded amino acid sequence are included.
A preferred nucleic acid sequence for an TTGI gene is shown as the coding sequence within Figure 3/SEQ ID No. 1, alongside the predicted amino acid sequence of a polypeptide according to the present invention which has TTGI function (SEQ ID No. 2) .
Particular mutant alleles of the nucleic acid according to the present invention include : a) ttgl.10 (SEQ ID No. 3) which contains a point mutation (G to A) in the 5' untranslated part of the TTGI sequence (see Fig 3) ; b) ttgl.19 (SEQ ID No. 4) which results in the introduction of a stop codon at codon 183; c) ttgl.l (SEQ ID No . 5 - formerly designated ttgl.21) which results in the introduction of a stop codon at codon 317; d) ttgl.20 (SEQ ID No. 6) which contains a point mutation (S to C) at codon 30, plus introduction of a stop codon at codon 310; e) ttgl.9 (SEQ ID No.7) which contains a point mutation (S to F) at codon 282. f) ttgl.15, ttgl.16, ttgl.17, ttgl.18 which all result in a stop codon at codon 310 (via a substitution of 2 different bases - TCGGCT to TAGACT - this sequence is designated SEQ ID No. 13) . Mutant alleles b) to f) contain point mutations that result in changes to the protein product. Mutations in the 5 'untranslated leader sequences may affect the translation of the RNA.
Interestingly the present inventors have established that these mutations can lead to quite different phenotypes for the plants expressing them. For instance ttgl.9 has more anthocyanin present than ttgl.l (reference allele) while ttgl.10 has more trichomes, different seed mucilage and less anthocyanin than ttgl.9 (see Larkin et al, Plant Cell 6, 1065-1076 for an analysis) .
Thus it is clear that, given the sequence information disclosed herein, it will be possible for the skilled person, if desried, to generate ttgl mutant having some (but not all) of the pleiotropic effects of the wild-type/ecotype TTGI gene.
It will be appreciated by the skilled person that the above exemplified point mutations may be present individually or in combination with other point mutations. In other words, a mutant, allele, variant or derivative amino acid sequence in accordance with the present invention may include within the sequence shown in Figure 3, a single amino acid change with respect to the sequence shown in Figure 3, or 2, 3, 4, 5, 6, 7, 8, or 9 changes, about 10, 15, 20, 30, 40 or 50 changes, or greater than about 50, 60, 70, 80 or 90 changes. In addition to one or more changes within the amino acid sequence shown in Figure 3, a mutant, allele, variant or derivative amino acid sequence may include additional amino acids at the C-terminus and/or N-terminus.
A sequence related to a sequence specifically disclosed herein shares homology with that sequence. Homology may be at the nucleotide sequence and/or amino acid sequence level. Preferably, the nucleic acid and/or amino acid sequence shares homology with the nucleotide sequence of Figure 3, or the amino acid sequence encoded thereby. Preferably the homology is at least about 50%, or 60%, or 70%, or 80% homology, most preferably at least about 90%, 95%, 96%, 97%, 98% or 99% homology.
As is well-understood, homology at the amino acid level is generally in terms of amino acid similarity or identity. Similarity allows for "conservative variation", i.e. substitution of one hydrophobic residue such as isoleucine, valine, leucine or methionine for another, or the substitution of one polar residue for another, such as arginine for lysine, glutamic for aspartic acid, or glutamine for asparagine . Similarity may be as defined and determined by the TBLASTN program, of Altschul et al . (1990) J. Mol . Biol . 215: 403-10, which is in standard use in the art, or, and this may be preferred, the standard program BestFit, which is part of the Wisconsin Package, Version 8, September 1994,
(Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, USA, Wisconsin 53711) . BestFit makes an optimal alignment of the best segment of similarity between two sequences. Optimal alignments are found by inserting gaps to maximize the number of matches using the local homology algorithm of Smith and Waterman.
Homology may be over the full-length of the relevant sequence shown herein, or may more preferably be over a contiguous sequence of about or greater than about 20, 25, 30, 33, 40, 50, 67, 133, 167, 200, 233, 267, 300, 333 or more amino acids or codons, compared with the relevant amino acid sequence or nucleotide sequence as the case may be .
Also provided by an aspect of the present invention is nucleic acid including or consisting essentially of a sequence of nucleotides complementary to a nucleotide sequence with any sequence provided herein. Further, there is provided nucleic acid including or consisting essentially of a sequence of nucleotides complementary to a nucleotide sequence hybridisable with any encoding sequence provided herein. Another way of looking at this would be for nucleic acid according to this aspect to be hybridisable with a nucleotide sequence complementary to any encoding sequence provided herein. Of course, DNA is generally double-stranded and blotting techniques such as Southern hybridisation are often performed following separation of the strands without a distinction being drawn between which of the strands is hybridising. Preferably the hybridisable nucleic acid or its complement encode a product able to influence a physical characteristic of a plant, particularly a phenotypic characteristic as described above. Preferred conditions for hybridisation are familiar to those skilled in the art, but are generally stringent enough for there to be positive hybridisation between the sequences of interest to the exclusion of other sequences . For instance, screening may initially be carried out under conditions, which comprise a temperature of about 37°C or less, a formamide concentration of less than about 50%, and a moderate to low salt (e.g. Standard Saline Citrate ( SSC) = 0.15 M sodium chloride; 0.15 M sodium citrate; pH 7) concentration.
Alternatively, a temperature of about 50°C or less and a high salt (e.g. •χSSPE'= 0.180 mM sodium chloride; 9 mM disodium hydrogen phosphate; 9 mM sodium dihydrogen phosphate; 1 mM sodium EDTA; pH 7.4) . Preferably the screening is carried out at about 37°C, a formamide concentration of about 20%, and a salt concentration of about 5 X SSC, or a temperature of about 50°C and a salt concentration of about 2 X SSPE. These conditions will allow the identification of sequences which have a substantial degree of homology (similarity, identity) with the probe sequence, without requiring the perfect homology for the identification of a stable hybrid. Suitable conditions include, e.g. for detection of sequences that are about 80-90% identical, hybridization overnight at 42°C in 0.25M Na2HP04, pH 7.2, 6.5% SDS, 10% dextran sulfate and a final wash at 55°C in 0. IX SSC, 0.1% SDS. For detection of sequences that are greater than about 90% identical, suitable conditions include hybridization overnight at 65°C in 0.25M Na2HP04, pH 7.2, 6.5% SDS, 10% dextran sulfate and a final wash at 60°C in 0.1X SSC, 0.1% SDS. The nucleic acid, which may contain, for example,
DNA encoding the amino acid sequence of Figure 3 , as genomic or cDNA, may be in the form of a recombinant and preferably replicable vector, for example a plasmid, cosmid, phage or AgroJbacterium binary vector. The nucleic acid may be under the control of an appropriate promoter or other regulatory elements for expression in a host cell such as a microbial, e.g. bacterial, or plant cell. In the case of genomic DNA, this may contain its own promoter or other regulatory elements and in the case of cDNA this may be under the control of an appropriate promoter or other regulatory elements for expression in the host cell.
A vector including nucleic acid according to the present invention need not include a promoter or other regulatory sequence, particularly if the vector is to be used to introduce the nucleic acid into cells for recombination into the genome.
Those skilled in the art are well able to construct vectors and design protocols for recombinant gene expression. Suitable vectors can be chosen or constructed, containing appropriate regulatory sequences, including promoter sequences, terminator fragments, polyadenylation sequences, enhancer sequences, marker genes and other sequences as appropriate. For further details see, for example, Molecular Cloning: a Laboratory- Manual : 2nd edition, Sambrook et al , 1989, Cold Spring Harbor Laboratory Press. Many known techniques and protocols for manipulation of nucleic acid, for example in preparation of nucleic acid constructs, mutagenesis, sequencing, introduction of DNA into cells and gene expression, and analysis of proteins, are described in detail in Current Protocols in Molecular Biology, Second Edition, Ausubel et al . eds . , John Wiley & Sons, 1992. The disclosures of Sambrook et al . and Ausubel et al . are incorporated herein by reference. Specific procedures and vectors previously used with wide success upon plants are described by Bevan (Nucl. Acids Res. 12, 8711-8721 (1984)) and Guerineau and Mullineaux (1993) (Plant transformation and expression vectors. In: Plant Molecular Biology Labfax (Croy RRD ed) Oxford, BIOS Scientific Publishers, pp 121-148) . Selectable genetic markers may be used consisting of chimeric genes that confer selectable phenotypes such as resistance to antibiotics such as kanamycin, hygromycin, phosphinotricin, chlorsulfuron, methotrexate, gentamycin, spectinomycin, imidazolinones and glyphosate. Nucleic acid molecules and vectors according to the present invention may be provided isolated and/or purified from their natural environment, in substantially pure or homogeneous form, or free or substantially free of nucleic acid or genes of the species of interest or origin other than the sequence encoding a polypeptide with the required function. Nucleic acid according to the present invention may include cDNA, RNA, genomic DNA and may be wholly or partially synthetic. The term "isolate" encompasses all these possibilities. Where a DNA sequence is specified, e.g. with reference to a figure, unless context requires otherwise, the RNA equivalent, with U substituted for T where it occurs, is encompassed.
When introducing a chosen gene construct into a cell, certain considerations must be taken into account, well known to those skilled in the art. The nucleic acid to be inserted should be assembled within a construct which contains effective regulatory elements which will drive transcription. There must be available a method of transporting the construct into the cell. Once the construct is within the cell membrane, integration into the endogenous chromosomal material either will or will not occur. Finally, as far as plants are concerned the target cell type must be such that cells can be regenerated into whole plants .
Plants transformed with the DNA segment containing the sequence may be produced by standard techniques which are already known for the genetic manipulation of plants. Plants can be transformed with DNA using any suitable technology, such as a disarmed Ti-plasmid vector carried by Agrobacterium exploiting its natural gene transfer ability (EP-A-270355, EP-A-0116718 , NAR 12(22) 8711 -
87215 1984) , particle or microprojectile bombardment (US 5100792, EP-A-444882, EP-A-434616) microinjection (WO 92/09696, WO 94/00583, EP 331083, EP 175966, Green et al . (1987) Plant Tissue and Cell Cul ture, Academic Press) , electroporation (EP 290395, WO 8706614 Gelvin Debeyser - see attached) other forms of direct DNA uptake (DE 4005152, WO 9012096, US 4684611), liposome mediated DNA uptake (e.g. Freeman et al . Plant Cell Physiol . 29: 1353 (1984)), or the vortexing method (e.g. Kindle, PNAS U. S . A . 87: 1228 (1990d) Physical methods for the transformation of plant cells are reviewed in Oard, 1991, Biotech . Adv. 9: 1-11.
Agrobacterium transformation is widely used by those skilled in the art to transform dicotyledonous species . Recently, there has been substantial progress towards the routine production of stable, fertile transgenic plants in almost all economically relevant monocot plants (Toriyama, et al . (1988) Bio/Technology 6, 1072-1074; Zhang, et al . (1988) Plant Cell Rep . 7, 379-384; Zhang, et al . (1988) Theor Appl Genet 76, 835-840; Shimamoto, et al . (1989) Nature 338, 274-276; Datta, et al . (1990) Bio/Technology 8, 736-740; Christou, et al . (1991) Bio/Technology 9, 957-962; Peng, et al . (1991) International Rice Research Institute, Manila, Philippines 563-574; Cao, et al . (1992) Plant Cell Rep . 11, 585-591; Li, et al . (1993) Plant Cell Rep . 12, 250- 255; Rathore, et al . (1993) Plant Molecular Biology 21, 871-884; Fromm, et al . (1990) Bio/Technology 8, 833-839; Gordon-Kamm, et al . (1990) Plant Cell 2 , 603-618; D'Halluin, et al . (1992) Plant Cell 4, 1495-1505; Walters, et al . (1992) Plant Molecular Biology 18, 189- 200; Koziel, et al . (1993) Biotechnology 11, 194-200;
Vasil, I. K. (1994) Plant Molecular Biology 25, 925-937; Weeks, et al . (1993) Plant Physiology 102, 1077-1084; Somers, et al . (1992) Bio/Technology 10, 1589-1594; W092/14828) . In particular, Agrobacterium mediated transformation is now emerging also as an highly efficient alternative transformation method in monocots (Hiei et al . (1994) The Plant Journal 6, 271-282).
The generation of fertile transgenic plants has been achieved in the cereals rice, maize, wheat, oat, and barley (reviewed in Shimamoto, K. (1994) Current Opinion in Biotechnology 5, 158-162.; Vasil, et al . (1992) Bio /Technology 10, 667-674; Vain et al . , 1995, Biotechnology Advances 13 (4) : 653-671; Vasil, 1996, .Nature Biotechnology 14 page 702) . Microprojectile bombardment, electroporation and direct DNA uptake are preferred where Agrobacterium is inefficient or ineffective. Alternatively, a combination of different techniques may be employed to enhance the efficiency of the transformation process, eg bombardment with Agrobacterium coated microparticles (EP-A-486234) or microprojectile bombardment to induce wounding followed by co-cultivation with Agrobacterium (EP-A-486233) .
Following transformation, a plant may be regenerated, e.g. from single cells, callus tissue or leaf discs, as is standard in the art. Almost any plant can be entirely regenerated from cells, tissues and organs of the plant . Available techniques are reviewed in Vasil et al . , Cell Cul ture and Somatic Cell Genetics of Plants, Vol I, II and III, Laboratory Procedures and Their Applications , Academic Press, 1984, and Weissbach and Weissbach, Methods for Plant Molecular Biology, Academic Press, 1989.
The particular choice of a transformation technology will be determined by its efficiency to transform certain plant species as well as the experience and preference of the person practising the invention with a particular methodology of choice. It will be apparent to the skilled person that the particular choice of a transformation system to introduce nucleic acid into plant cells is not essential to or a limitation of the invention, nor is the choice of technique for plant regeneration. A TTGI gene and modified versions thereof (alleles, mutants, variants and derivatives thereof), may be used to affect a physical characteristic, such as hairs on roots and aerial parts of plants and anthocyanin pigments characteristics, in plants. For this purpose nucleic acid such as a vector as described herein may be used for the production of a transgenic plant. Such a plant may possess an altered phenotype as described above compared with wild-type (that is to say a plant that is wild-type for TTGI or the relevant homologue thereof) . The invention further encompasses a host cell transformed with nucleic acid or a vector according to the present invention, especially a plant or a microbial cell. Thus, a host cell, such as a plant cell, including heterologous nucleic acid according to the present invention is provided. Within the cell, the nucleic acid may be incorporated within the chromosome . There may be more than one heterologous nucleotide sequence per haploid genome .
Also according to the invention there is provided a plant cell having incorporated into its genome nucleic acid, particularly heterologous nucleic acid, as provided by the present invention, under operative control of a regulatory sequence for control of expression. The coding sequence may be operably linked to one or more regulatory sequences which may be heterologous or foreign to the gene, such as not naturally associated with the gene for its expression. The nucleic acid according to the invention may be placed under the control of an externally inducible gene promoter to place expression under the control of the user.
A suitable inducible promoter is the GST-II-27 gene promoter which has been shown to be induced by certain chemical compounds which can be applied to growing plants. The promoter is functional in both monocotyledons and dicotyledons. It can therefore be used to control gene expression in a variety of genetically modified plants, including field crops such as canola, sunflower, tobacco, sugarbeet, cotton; cereals such as wheat, barley, rice, maize, sorghum; fruit such as tomatoes, mangoes, peaches, apples, pears, strawberries, bananas, and melons; and vegetables such as carrot, lettuce, cabbage and onion. The GST-II-27 promoter is also suitable for use in a variety of tissues, including roots, leaves, stems and reproductive tissues .
A further aspect of the present invention provides a method of making such a plant cell involving introduction of nucleic acid or a suitable vector including the sequence of nucleotides into a plant cell and causing or allowing recombination between the vector and the plant cell genome to introduce the sequence of nucleotides into the genome. The invention extends to plant cells containing nucleic acid according to the invention as a result of introduction of the nucleic acid into an ancestor cell.
The term "heterologous" may be used to indicate that the gene/sequence of nucleotides in question have been introduced into said cells of the plant or an ancestor thereof, using genetic engineering, i.e. by human intervention. A transgenic plant cell, i.e. transgenic for the nucleic acid in question, may be provided. The transgene may be on an extra-genomic vector or incorporated, preferably stably, into the genome. A heterologous gene may replace an endogenous equivalent gene, i.e. one which normally performs the same or a similar function, or the inserted sequence may be additional to the endogenous gene or other sequence. An advantage of introduction of a heterologous gene is the ability to place expression of a sequence under the control of a promoter of choice, in order to be able to influence expression according to preference. Furthermore, mutants, variants and derivatives of the wild-type gene, e.g. with higher or lower activity than wild-type, may be used in place of the endogenous gene. Nucleic acid heterologous, or exogenous or foreign, to a plant cell may be non-naturally occurring in cells of that type, variety or species. Thus, nucleic acid may include a coding sequence of or derived from a particular type of plant cell or species or variety of plant, placed within the context of a plant cell of a different type or species or variety of plant. A further possibility is for a nucleic acid sequence to be placed within a cell in which it or a homologue is found naturally, but wherein the nucleic acid sequence is linked and/or adjacent to nucleic acid which does not occur naturally within the cell, or cells of that type or species or variety of plant, such as operably linked to one or more regulatory sequences, such as a promoter sequence, for control of expression. A sequence within a plant or other host cell may be identifiably heterologous, exogenous or foreign. Plants which include a plant cell according to the invention are also provided, along with any part or propagule thereof, seed, selfed or hybrid progeny and descendants. A plant according to the present invention may be one which does not breed true in one or more properties. Plant varieties may be excluded, particularly registrable plant varieties according to Plant Breeders' Rights. It is noted that a plant need not be considered a "plant variety" simply because it contains stably within its genome a transgene, introduced into a cell of the plant or an ancestor thereof.
In addition to a plant, the present invention provides any clone of such a plant, seed, selfed or hybrid progeny and descendants, and any part of any of these, such as cuttings, seed. The invention provides any plant propagule, that is any part which may be used in reproduction or propagation, sexual or asexual, including cuttings, seed and so on. Also encompassed by the invention is a plant which is a sexually or asexually propagated off-spring, clone or descendant of such a plant, or any part or propagule of said plant, offspring, clone or descendant.
The invention further provides a method of influencing or affecting a physical characteristic e.g. hairs on roots and aerial parts of plants and/or the presence or absence of anthocyanin pigment, including causing or allowing expression of a heterologous nucleic acid sequence as discussed within cells of the plant.
The invention further provides a method including expression from nucleic acid encoding the amino acid sequence of Figure 3 , or a mutant, variant, allele or derivative of the sequence, within cells of a plant (thereby producing an encoded polypeptide) , following an earlier step of introduction of the nucleic acid into a cell of the plant or an ancestor thereof . Such a method may influence or affect a phenotypic characteristic of the plant, such as those mentioned above. This may be used in combination with any other gene, such as transgenes involved in any other phenotypic trait or desirable property. The present invention also encompasses the expression product of the nucleic acid sequence disclosed and methods of making the expression product by expression from nucleic acid encoding therefor under appropriate conditions, which may be in appropriate host cells. Following expression, the product may be isolated from the expression system and may be used as desired, for instance in formulation of a composition including at least one additional component.
Purified TTGI protein, or a variant thereof, e.g. produced recombinantly by expression from encoding nucleic acid therefor, may be used to raise antibodies employing techniques which are standard in the art.
Antibodies and polypeptides comprising antigen-binding fragments of antibodies may be used in identifying homologues from other plant species.
Methods of producing antibodies include immunising a mammal with the protein or a fragment thereof.
Antibodies may be obtained from immunised animals using any of a variety of techniques known in the art, and might be screened, preferably using binding of antibody to antigen of interest. As an alternative or supplement to immunising a mammal, antibodies with appropriate binding specificity may be obtained from a recombinantly produced library of expressed immunoglobulin variable domains, e.g. using lambda bacteriophage or filamentous bacteriophage which display functional immunoglobulin binding domains on their surfaces; for instance see WO92/01047.
A further aspect of the present invention provides a method of identifying and cloning TTGI homologues from plant species other than Arabidopsis thaliana which method employs a nucleotide sequence obtainable from that shown in Figure 3. Such a method may include the steps of preparing nucleic acid from plant cells under test, providing a nucleic acid molecule having a nucleotide sequence shown in Figure 3 or complementary to a nucleic acid sequence shown in Figure 3, contacting nucleic acid in said preparation with said nucleic acid molecule under conditions for hybridization of said nucleic acid molecule to any said gene or homologue in said preparation, and identifying said gene or homologue if present by its hybridization with said nucleic acid molecule . Sequences derived from these may themselves be used in identifying and in cloning other sequences. The nucleotide sequence information provided herein, or any part thereof, may be used in a data-base search to find homologous sequences, expression products of which can be tested for ability to influence characteristics described above. These may have TTGI function or the ability to modify characteristics including hairs on roots and aerial parts of plants and anthocyanin pigments. Alternatively, nucleic acid libraries may be screened using techniques well known to those skilled in the art and homologous sequences thereby identified then tested for requisite functionality.
Further, nucleotide sequences obtained from that shown in figure 3 may be used to isolate TTGI homologous from other species of plants by techniques such as hybridization and polymerase chain reaction (PCR) .
PCR techniques for the amplification/identification of nucleic acid are described in US Patent No. 4,683,195. The nucleic acid sequence provided herein readily allows the skilled person to design PCR primers.
Such oligonucleotide probes or primers, as well as the full-length sequence (and mutants, alleles, variants and derivatives) are also useful in identifying homologous sequences. Further, the present invention also extends to oligonucleotide probes or primers for amplification and/or identification which are obtainable by use of the sequence shown in Figure 3, optionally by selecting regions which are conserved with other sequences e.g. from the prior art. Alternatively it may be desirable to generate more specific primers by selecting regions of the TTGI which are not homologous to other proteins such as anil. In some preferred embodiments, oligonucleotides according to the present invention that are fragments of the sequence shown in Figure 3, or any mutant, allele, variant, or derivatives thereof, are at least 10 nucleotides in length, more preferably at least 15 nucleotides in length, more preferably at least 20 nucleotides in length.
Such fragments themselves individually represent aspects of the present invention. Techniques corresponding to those above may also be used for ascertaining the genotype of mutant plants having altered phenotypes corresponding to TTGI activities (e.g which lack trichomes or anthocyanin) i.e. the probes and primers of the present invention can be used for diagnosing mutations in such plants, or as markers for these traits.
As described above, the present invention also extends to nucleic acid encoding an TTGI homologue obtained using a nucleotide sequence derived from that shown in Figure 3.
In certain embodiments, nucleic acid according to the present invention encodes a polypeptide which has homology with all or part of the amino acid sequence shown in Figure 3, in the terms discussed already above (e.g. for length), which homology is greater over the length of the relevant part (i.e. fragment) (the relevant part being greater than 110 amino acids in length, preferably greater than 200 amino acids and even more preferably greater than 300 amino acids in length) than the homology shared between a respective part of the amino acid sequence of Figure 3 and the EST sequence, and may be greater than about 5% greater, more preferably greater than about 10% greater, more preferably greater than about 20% greater, and more preferably greater than about 30% greater.
Similarly, nucleic acid according to certain embodiments of the present invention may have homology with all or part of the nucleotide sequence shown in Figure 3, in the terms discussed already above (e.g. for length) , which homology is greater over the length of the relevant part (i.e. fragment) (the relevant part being greater than 350 nucleotide in length, preferably greater than 400 and even more preferably greater than 500 nucleotide in length) than the homology shared between a respective part of the nucleotide sequence of Figure 3 and may be greater than about 5% greater, more preferably greater than about 10% greater, more preferably greater than about 20% greater, and more preferably greater than about 30% greater. Thus, to exemplify with reference to one embodiment, nucleic acid may be provided in accordance with the present invention wherein the nucleotide sequence includes a contiguous sequence of about 350 nucleotides which has greater homology with a contiguous sequence of 350 nucleotides within the nucleotide sequence of Figure 3 than any contiguous sequence of 331 nucleotides of an EST sequence, preferably greater than about 5% greater homology, and so on.
The provision of sequence information for the TTGI gene of Arabidopsis thaliana enables the obtention of homologous sequences from other plant species. In particular, it should be possible to easily isolate TTGI homologues from related, commercially important Brassica species (e.g. Brassica nigra, Brassica napus, Brassica campestris and Brassica oleracea) . Examples of homologues from Matthiola incana (ten week stock) , Nicotiana tobaccum var Samsum (tobacco) and Gossypium hirsutum cv. Siokva 1-4 (cotton) are disclosed in the Examples below.
Thus, included within the scope of the present invention are nucleic acid molecules which encode amino acid sequences which are homologues of TTGI of
Arabidopsis thaliana . Homology may be at the nucleotide sequence and/or amino acid sequence level, as has already been discussed above. A homologue from a species other than Arabidopsis thaliana encodes a product which causes a phenotype similar to that caused by the Arabidopsis thaliana TTGI gene, generally including the ability to influence a phenotypic characteristic, particularly a phenotypic characteristic as described above. In addition, mutants, derivatives or alleles of these genes may alter such characteristics compared with wild-type. TTGI gene homologues may also be identified from economically important monocotyledonous crop plants such as rice and maize. Although genes encoding the same protein in monocotyledonous and dicotyledonous plants show relatively little homology at the nucleotide level, amino acid sequences are conserved. Therefore it is possible to use public sequence databases to identify
Arabidopsis, rice or maize cDNA clone sequences that were obtained in random sequencing programmes and share homology to the gene of interest, as has been done for flowering time genes isolated from Arabidopsis (e.g. CO; WO 96/14414) .
Nucleic acid according to the invention may be used to modify the characteristics of a plant . This may be achieved by modification of expression of the nucleic acid according to the present invention or by interfering with the normal function of the protein encoded by the nucleic acid according to the present invention. For example, nucleic acid according to the present invention may be used to increase or decrease the number of trichomes on different organs to enhance their effectiveness. Further, it may be used to alter the pigmentation of seeds, leaves, flowers and fruit for UV protection and/or colour for presentation reasons or for ornamental plants. This may involve use of anti-sense or sense regulation, discussed further below. As noted above, other physical characteristics of plants may be affected by means of expression from nucleic acid according to the present invention. Nucleic acid according to the invention, such as an TTGI gene or homologue, may be placed under the control of an externally inducible gene promoter to place the timing of altering the characteristics of the plant under the control of the user. An advantage of introduction of a heterologous gene into a plant cell, particularly when the cell is comprised in a plant, is the ability to place expression of the gene under the control of a promoter of choice, in order to be able to influence gene expression, and therefore characteristic modification, according to preference. Furthermore, mutants and derivatives of the wild-type gene, eg with higher or lower activity than wild-type, may be used in place of the endogenous gene. In the present invention, over-expression may be achieved by introduction of the nucleotide sequence in a sense orientation. Thus, the present invention provides a method of influencing a physical e.g. a phenotypic characteristic described above such as, an increase or decrease in trichomes, characteristic of a plant, the method including causing or allowing expression of the product (polypeptide or nucleic acid transcript) encoded by heterologous nucleic acid according to the invention from that nucleic acid within cells of the plant.
Down-regulation of expression of a target gene may be achieved using anti-sense technology or "sense regulation" ("co-suppression") .
In using anti-sense genes or partial gene sequences to down-regulate gene expression, a nucleotide sequence is placed under the control of a promoter in a "reverse orientation" such that transcription yields RNA which is complementary to normal mRNA transcribed from the "sense" strand of the target gene. See, for example, Smith et al , (1988) Nature 334, 724-726; Zhang et al , (1992) The Plant Cell 4, 1575-1588, English et al . , (1996) The Plant Cell 8, 179-188. Antisense technology is also reviewed in Bourque, (1995) , Plant Science 105, 125-149, and Flavell, (1994) PNAS USA 91, 3490-3496. An alternative is to use a copy of all or part of the target gene inserted in sense, that is the same, orientation as the target gene, to achieve reduction in expression of the target gene by co-suppression. See, for example, van der Krol et al . , (1990) The Plant Cell 2 , 291-299; Napoli et al . , (1990) The Plant Cell 2 , 279- 289; Zhang et al . , (1992) The Plant Cell 4, 1575-1588, and US-A-5,231,020.
The complete sequence corresponding to the coding sequence (in reverse orientation for anti-sense) need not be used. For example fragments of sufficient length may be used. It is a routine matter for the person skilled in the art to screen fragments of various sizes and from various parts of the coding sequence to optimise the level of anti-sense inhibition. It may be advantageous to include the initiating methionine ATG codon, and perhaps one or more nucleotides upstream of the initiating codon. A further possibility is to target a conserved sequence of a gene, e.g. a sequence that is characteristic of one or more genes, such as a regulatory sequence .
The sequence employed may be about 500 nucleotides or less, possibly about 400 nucleotides, about 300 nucleotides, about 200 nucleotides, or about 100 nucleotides. It may be possible to use oligonucleotides of much shorter lengths, 14-23 nucleotides, although longer fragments, and generally even longer than about 500 nucleotides are preferable where possible, such as longer than about 600 nucleotides, than about 700 nucleotides, than about 800 nucleotides, than about 1000 nucleotides or more.
It may be preferable that there is complete sequence identity in the sequence used for down-regulation of expression of a target sequence, and the target sequence, though total complementarity or similarity of sequence is not essential. One or more nucleotides may differ in the sequence used from the target gene. Thus, a sequence employed in a down-regulation of gene expression in accordance with the present invention may be a wild-type sequence (e.g. gene) selected from those available, or a mutant, derivative, variant or allele, by way of insertion, addition, deletion or substitution of one or more nucleotides, of such a sequence. The sequence need not include an open reading frame or specify an RNA that would be translatable. It may be preferred for there to be sufficient homology for the respective anti-sense and sense RNA molecules to hybridise. There may be down regulation of gene expression even where there is about 5%, 10%, 15% or 20% or more mismatch between the sequence used and the target gene .
Generally, the transcribed nucleic acid may represent a fragment of an TTGI gene, such as including a nucleotide sequence shown in Figure 3, or the complement thereof, or may be a mutant, derivative, variant or allele thereof, in similar terms as discussed above in relation to alterations being made to an TTGI coding sequence and the homology of the altered sequence. The homology may be sufficient for the transcribed anti-sense RNA to hybridise with nucleic acid within cells of the plant, though irrespective of whether hybridisation takes place the desired effect is down-regulation of gene expression.
Thus, the present invention also provides a method of influencing a characteristic of a plant such as any one of those described above, the method including causing or allowing anti-sense transcription from heterologous nucleic acid according to the invention within cells of the plant.
The present invention further provides the use of the nucleotide sequence of Figure 3 or a fragment, mutant, derivative, allele, variant or homologue thereof for down-regulation of gene expression, particularly down-regulation of expression of an TTGI gene or homologue thereof, preferably in order to influence a physical characteristic of a plant, especially a phenotypic characteristic such as an increase or decrease of trichomes on different organs and/or and increase or decrease in anthocyanin pigments. When additional copies of the target gene are inserted in sense, that is the same, orientation as the target gene, a range of phenotypes is produced which includes individuals where over-expression occurs and some where under-expression of protein from the target gene occurs . When the inserted gene is only part of the endogenous gene the number of under-expressing individuals in the transgenic population increases. The mechanism by which sense regulation occurs, particularly down-regulation, is not well-understood. However, this technique is also well-reported in scientific and patent literature and is used routinely for gene control. See, for example, van der Krol et al . , (1990) The Plant Cell 2, 291-229; Napoli et al . , (1990) The Plant Cell 2, 279-289; Zhang et al , 1992 The Plant Cell 4 , 1575-1588. Again, fragments, mutants and so on may be used in similar terms as described above for use in anti-sense regulation.
Further options for down regulation of gene expression include the use of ribozymes, e.g. hammerhead ribozymes, which can catalyse the site-specific cleavage of RNA, such as mRNA (see e.g. Jaeger (1997) "The new world of ribozymes" Curr Opin Struct Biol 7:324-335, or Gibson & Shillitoe (1997) "Ribozymes : their functions and strategies form their use" Mol Biotechnol 7: 242-251.). Another option is the use of nucleic acids enconding non- functional or partially functional mutant proteins (e.g. encoded by the mutant alleles of the present invention, or as produced by mutagenesis) which, when expressed in a plant, may compete with functional TTGI proteins for e.g. receptors or other binding partners thereby reducing the effectiveness of those proteins.
Thus, the present invention also provides a method of influencing a phenotypic characteristic of a plant, the method including causing or allowing expression from nucleic acid according to the invention within cells of the plant. This may be used to suppress activity of a product with ability to influence a phenotypic characteristic as described above. Here the activity of the product is preferably suppressed as a result of under-expression within the plant cells.
Also embraced within the present invention are untranscribed parts of the TTGI gene. Thus in a further aspect of the present invention there is disclosed a nucleic acid molecule encoding the promoter of the TTGI gene . Owing to the widespread presence of the TTGI transcript in the plant it is believed that this promoter is constitutive or essentially constitutive, and thus may have utility in producing constructs for the expression of genes in plants . Variant promoters having promoter activity are also embraced by the present invention. To find homologous promoters, or the minimal elements or motifs responsible for promoter activity, restriction enzyme or nucleases may be used to digest a nucleic acid molecule comprising the 5' region of Seq ID No 1, or mutagenesis may be employed, followed by an appropriate assay (for example using a reporter gene such as luciferase operably linked to the restricted sequence) . Methods for promoter identification may employed without burden by those skilled in the art in the light of the sequence data disclosed herein. Once characterised the promoters of the present invention may incorporated into vectors.
Aspects and embodiments of the present invention will now be illustrated, by way of example, with reference to the accompanying figures . Further aspects and embodiments will be apparent to those skilled in the art. All documents mentioned in this text are incorporated herein by reference. Brief description of the drawings
Figure 1
This shows A) a large scaled map (not to scale) of the TTGI region showing the relationship between probes used in the region. YAC end probes are indicated by boxes; and B) a fine scale map of the TTGI region determined by RFLP mapping. End probes generated from YACs are indicated as boxes. The position of the cosmid g4556 is shown in relation to the genomic lambda isolated from this region. The recombination points are marked with a cross.
Figure 2 This shows a map of genomic clones used to complement ttgl mutants . It shows the genomic fragments that have been used to complement the ttgl mutant phenotype . Fragment (a) represents the genomic 13.8kb insert in pB8 with the EcoRI (E) and Xbal (X) restriction sites some of which are used to create the deletion.
Fragment (b) represents pB8DXl, (c) represents pB8DE2 and (d) represents pB8DE3. Deletions are indicated by the dotted lines. Fragments (a) and (d) both gave transformed plants with trichomes and anthocyanin. Transformants of (b) and (c) lacked both trichomes and anthocyanin.
Figure 3A and 3B
This shows the sequence of TTGI locus (SEQ ID No. 1) . The intron (coding sequence) is in italics and the predicted amino acid sequence (SEQ ID No. 2) is shown by the single letter code under the nucleotide sequence. Five identified mutations are shown (ttgl.10, ttgl.19, ttgl.20, ttgl.l, ttgl .9 - SEQ ID No's. 3 to 7 respectively) . Sixty bases are shown per line.
Figure 4
This shows an alignment of TTGI, AN11 from petunia and the partial sequences from Matthiola (Seq ID No 8) and tobacco (Seq ID Nos 9 and 10) .
Figure 5
Shows the predicted cDNA sequence of TTGI . This corresponds to the region shown in capitals in Fig 3, plus a further 10 nucleotides which were subsequently mapped to the end of the transcript by primer extension studies .
Summary of sequence ID Nos
1: Full length TTGI DNA sequence shown in Fig 3, including promoter region, full cDNA sequence, and coding sequence (which is aligned with the amino acid sequence) . 2 : TTGI amino acid sequence in Fig 3 and 4.
3: DNA sequence ttgl.10
4: DNA sequence ttgl.19
5 : DNA sequence ttgl .1
6: DNA sequence ttgl.20 7: DNA sequence ttgl .
8 : Partial amino acid sequence of Matthiola TTGI homologue
9: Partial amino acid sequence of first tobacco TTGI homologue . 10 : Partial amino acid sequence of second tobacco TTGI homologue .
11 and 12 : Degenerate primers for cloning TTGI homologues
(see Examples below) .
13: DNA sequence of ttgl.15, ttgl.16, ttgl .17 and ttgl.18.
Detailed description.
Molecular Mapping of ttgl.l
Recombinants between the ttgl . l and MSI loci generated in a cross between Landsberg erecta carrying ttgl and msl and Ws ecotypes were analysed using RFLPs (restriction fragment length polymorphism) between these parents with probes already mapped to this region by Nam et al (Nam et al (1989) Plant Cell 1 699-705.). Recombinants on the distal side of ttgl . l were selected from a cross of Landsberg erecta carrying the ttgl . l mutation and ga3 and ch5 and the RLD1 ecotype which was wildtype for these loci. End probes from YACs that had been mapped to the region between msl and ga3 (Schmidt et al (Schmidt et al (1997) Plant J 11 563-572)) were also utilized to map the location of ttgl. RFLPs generated by the cosmid g4556 could not be separated from ttgl with the exception of one recombinant called Dennis 19 (on the msl side of ttgl) suggesting that g4556 was very close to the mutation in ttgl . l . A YAC EG20H2 that hybridised to g4556 and the cosmid were used to isolate overlapping genomic lambda clones. The lambda clones were ordered using restriction mapping and hybridization techniques and then used as probes for RFLPs amongst the recombinants. Figure 1 shows the large scale and fine scale maps from the TTGI region that were derived from the analysis of the recombinants on both sides of the ttgl . l mutation. In Figure IB the position of several recombination events between msl and ttgl have been indicated. On the distal side of ttgl no nearby recombination events could be mapped due to lack of RFLPs between the ecotypes used and lack of success in isolating clones from this region from three different libraries in lambda or cosmid vectors using a variety of probes .
Complementation of the ttgl mutation
The complete genomic inserts from overlapping lambda clones 1.1A, 3.1A, 8 and X6 marked in Figure IB were subcloned into pBinNOT using the flanking Notl restriction sites in the lambda vector, giving a pB series of binary vectors. These were transferred into
Agro acteriuiT! tumefaciens strain AGL1 (Lazo et al (1991) Biotechnology 9 963-967) and then used to transform Arabidopsis ecotype Landsberg erecta carrying the ttgl . l mutation by co-cultivation with root explants (Valvekens et al (1988) Proc Natl Acad Sci USA 85 5536-5540) . Only a small number of kanamycin-resistant transformants were obtained, but one plant from pB8, derived from lambda 8, had trichomes but failed to set seed. Transformants from pBl.lA did not have trichomes. Other kanamycin-resistant shootlets appeared to be escapes due to prolonged exposure to kanamycin in the callus stage. Several deletions were made of pB8 utilizing restriction sites within the genomic sequence and the polylinker of the vector. These deletion constructs (shown in Figure 2) were used to transform Arabidopsis ecotype Columbia carrying the ttgl . 9 mutation via vacuum infiltration (Bechtold et al (1993) Compt Rend Acad Sci Ill-Life Sci 316 1194-1199) . Sixty transformants from pB8DE3 (indicated as construct d) produced trichomes, although one transformant from pB8DE3 showed kanamycin resistance but had no trichomes. The transformants bearing trichomes exhibited other wild-type characteristics of brown seed, seed mucilage, purple colouring of the plant and normal root hair numbers indicating that the other ttgl mutant phenotypes had also been complemented. Thirty-six kanamycin-resistant transformants from pB8DE2 (construct c) and 19 transformants from pB8DXl (construct b) failed not produce trichomes, suggesting that TTGI was located in the regions deleted in these constructs .
The positional cloning of the TTGI locus has provided information about the order of, and distances between, a number of RFLP markers which may be used to isolate nearby genes. This information is complementary to the data given in the physical maps of the region (Schmidt et al . , 1997 Plant J 11: 563-572; Thorlby et al., 1997 Plant J 12, 471-479). Although the present inventors analysed about 400 recombinants within a 14 map unit region, they were still unable to find breakpoints very close to the TTGI locus . This suggests that recombination rates are reduced close to this gene. Recombination frequencies are known to vary along chromosomes in many species (Lichten and Goldman, 1995 Annu Rev Genet 29: 445-476 for a review) .
Subcloning of pB8DE3 and sequencing
Restriction fragments of the insert in pB8DE3 were ligated into pBluescript, sequenced using fluorescent dideoxynucleotides and the sequences compiled and analysed using the GCG package. Sequence analysis with Genmark and Netgene revealed two possible genes in the 5777 bp insert in pB8DE3.
One of these genes revealed no consistent homology to any known protein, was not similar to any EST clone in data-bases, and did not appear to code a long ORF.
The other predicted gene corresponded to an Arabidopsis EST (F20055, F20056) , indicating that the gene was functional and expressed. The predicted protein sequence of 341 amino acids shows sequence similarity (about 45 %) to a large and diverse group of proteins with WD40 repeat motifs. There are seven WD40 repeats with a short N-terminal region. The first two repeats contain a proline-rich region - the second repeat having 8/23 amino acids which are proline. Three possible TATA boxes have been identified 133, 189 and 216 bases upstream of the predicted start of translation. Comparison of the genomic and EST sequences indicated the presence of a single intron 3' of the termination codon. The sequence including the promoter region is shown in Figure 3. Primer extension experiments indicated that the start of transcription is 109 bases 5' of the start of translation (i.e. 23 bases from a TATA box) .
Sequence analysis of ttcrl mutants
The present inventors examined the nucleotide sequence of this region of a number of ttgl mutant alleles to determine whether the gene encoding the WD40 repeat protein was likely to be the TTGI locus. PCR products from the region were generated with primers designed to give overlapping fragments of about 700 bp. These PCR products were obtained from the ttgl mutants ttgl . 9, ttgl . 10, ttgl . 19 and ttgl . l and from their parental wild-type alleles using genomic DNA as the template. The PCR products were gel-purified and sequenced using both oligonucleotides designed as primers for PCR.
Four of the mutant alleles contained point mutations that would result in changes to the protein product (Figure 3) . Point mutations in ttgl . 19 and ttgl . l resulted in the introduction of stop codons at codons 183 and 317, respectively.
A point mutation in ttgl . 20 resulted in the change of serine to cysteine at codon 30. This allele, plus also alleles ttgl . 15-18 contained a premature stop codon at position 310. A mutation in ttgl . 9 resulted in the change of serine to phenylalanine at codon 282.
The mutant allele ttgl . 10 contained a point mutation (G to A) in the 5' untranslated leader sequence, which may affect the translation of the RNA. These changes in the gene encoding the WD40 repeat protein confirm its identity as TTGI .
Several of the mutants have a lower level of transcript detected by northern analysis and compared with message levels from the wildtype parent (results not shown) . The reduction in message level in mutants such as ttgl -9 could be due to nonsense-mediated mRNA decay which has been shown to occur in plants as well as in other organisms (Dickey et al . , 1994 Plant Cell 6, 1171-1176; van Hoof and Green, 1996 Plant J 10: 415-424).
In summary the phenotypes (all were ttgl-like) and mutations are as follows: Nasc No Parent Jasc Decription Allele Mutation
N300 An-1 pale, branched ttgl.15 s310 -> *
N319 En-2 dwarf ttgl.16 s310 -> *
N339 En-2 pale ttgl.17 s310 -> *
N372 En-2 upright rosette ttgl.16 s310 -> *
N406 En-2 ttgl.19 wl83 -> *
N420 En-2 early flowering ttgl.20 s30 -> c s310 -> *
N447 En-2 dwarf ttgl.l (as ttgl.l)
Effect of TTGI on stomata of Arabidopsis
TTGI effect on stomata appears to be analogous to control of root hairs (Berger et al, 1998, Dev Biol 194: 226-234) .
The table below shows a comparison between the stomatal numbers on hypocotyls in ttgl mutants compared to wild-type in air and at elevated C02 concentrations.
Air C02
Landsberg erecta: 23.2+3.2 8.5+1.9 ttgl mutant: 22.8±1.4 16.0±1.6
Analysis of the expression of the TTGI gene
To determine the length of the TTGI transcript and to see if the expression of the gene was confined to some organs, an RNA blot was hybridized with a TTGI probe. The resulting band was measured to be 1.35 kb in length and present in all organs tested (roots, rosette leaves, leaf buds, stems, cauline leaves, siliques, flowers, floral buds) . A surprise result was that it was highly expressed in floral meristems where there are only a few trichomes on the sepals and no anthocyanin in the flower petals.
The high level of transcripts of the TTGI locus suggests two possible points of regulation of the gene. The first is that the TTGI protein is present in many tissues where it requires a partner for activation. Another possibility is that regulation of this gene occurs a posttranscriptional stage with protein only being present in those cells that require functional TTGI protein.
Structure and function of the TTGI locus The present inventors have identified the TTGI locus as a gene encoding a WD40 repeat protein by complementation of a ttgl mutant with genomic DNA and by sequencing the gene in several ttgl mutant alleles. Two of the mutant alleles contained stop codons that would result in the production of truncated proteins, and two others contained point mutations that would change serine residues, to a cysteine and phenylalanine residues. The TTGI protein bears no resemblance to the maize R gene product which was able to complement the ttgl mutant phenotype in Arabidopsis and anthocyanin pigment in tobacco flowers (Lloyd et al (1992) Science 258 1773- 1775) . This suggests that TTGI acts upstream of an Arabidopsis R homologue in the pathways leading to trichome differentiation and anthocyanin synthesis . The involvement of other WD40 proteins in signal transduction pathways (see below) suggests that TTGI is involved in a pathway, or pathways, regulating the expression or action of downstream transcription factors.
Computer modelling of TTGI
WD40 repeat proteins are involved in a number of different types of regulatory roles, such as signalling (eg. Gø subunit of heterotrimeric G proteins) , cell cycle regulation (eg CDC20 and CDC4) , transcriptional repression (eg yeast TUP1, Drosophila extra sex combs) , vesicular trafficking (eg SEC13) and RNA processing (Neer et al (1994) Nature 371 297-300) . The TTGI protein shows the highest sequence similarity to G0 subunits, which are the best characterised of the WD40 repeat proteins. The Gβ subunit contains 7 repeats of the WD40 motif and has a structure resembling a seven-bladed propeller, based on its crystal structure with its partner Gγ (Sondek et al (1996) Nature 379 369-374) . Each blade is composed of 4 β-sheets, and an N-terminal amphipathic o;-helix interacts closely with the Gγ subunit which is required for correct folding and function of the G^ subunit. However, computer- aided modelling of TTGI (in collaboration with N.
Srinivasan and T.L. Blundell) suggests that proline-rich regions of the first two WD40 repeats may disrupt the folding of the 3-sheets essential for the structure of each blade. In addition, amino acid residues identified as interacting with Ga and Gγ subunits are not well conserved in TTGI. This may suggest that TTGI represents a separate class of WD40 repeat protein. Genes encoding several WD40 proteins, including G^ subunits and the COP1 protein, have been isolated from plants (Ma (1994) Plant Mol Biol 26 1611-1634) , but none of these proteins closely resembles TTGI other than anil.
Thus studies of other WD40 proteins make it probable that the TTGI protein does not act directly as a transcription factor but binds to other proteins to promote the initiation of trichomes in leaves and stems. The TTGI protein may act as part of a DNA binding complex to regulate transcription. However the amino acid sequence contains no recognizable nuclear localization signal from computer analysis, although a cryptic site might be present. Another possibility is that another protein is required to form a complex for nuclear import as is the case with AP3 and PI from Arabidopsis (McGonigle et al . , 1996 Genes Dev 10, 1812-1821). Another possibility is that the TTGI protein is only located in the cytoplasm and acts as part of a signal transduction pathway. GUS-TTG1 and TTG1-GFP fusion proteins appear to be cytoplasmically located. There are several sequences of unknown function that show higher similarity to TTGI than the G^ subunit from either plants or animals. One sequence is the result of a genomic sequencing project in Arabidopsis and has a 85% similarity to TTGI and is located on chromosome 3. Gel blots hybridized and washed at low stringency show at least three bands in Arabidopsis and maize suggesting that TTGI could belong to a class of proteins . Two C. elegans genes arrayed in tandem have greater similarity to TTGI than any locus from yeast Saccaromyces cerevici . This is surprising as yeast is more closely related to Arabidopsis than C. elegans is. However if TTGI plays a role in defining functions in epidermal cells, this function may also be required in other multicellular organisms .
Transcription factors like those from maize and Antirrhinum have been identified in Petunia where they regulate flower colour (reviewed in Mol et al . , 1996). JAF13 is similar to the R gene and Delila and AN2 encodes a MYB factor like Cl from maize. These are thought to act together in a similar way to R and Cl to positively regulate the anthocyanin pathway. AW1I from Petunia controls anthocyanin biosynthesis in flowers (de Vetten et al., 1997 Genes Dev 11, 1422-1434) possibly by regulating AN2. This is contrary to the evidence that
TTGI might regulate MYC transcription factors, determined by overexpression of the maize R gene in Arabidopsis (Lloyd et al . , 1992 Proc Nat Acad Sci USA 86: 7092-7096). The identification of two WD40 repeat proteins which regulate anthocyanins and, in the case of the TTGI protein, many other pathways suggests that this class of protein may be involved in regulating developmental pathways in other organisms .
Cloning of the Matthiola incana, Nicotiana tobaccum and cotton homologues
Primers for degenerate PCR were designed by comparison of the TTGI sequence with the anil sequence (see Fig 4) . Primers were based on the sequence encoding amino acids 74-85, and 296-307.
Primer sequences were :
5' (TL3) TTYGAICAYCCITAYCCICCIACIAARYTIATGTT (Seq ID No 11)
3' (TL4) CATIGGRTCRATICCRTTIGGICCIGCIACIGTNGG (Seq ID No 12)
Amplification conditions were ascertained using a temperature gradient in a Robocycler (Stratagene) . lng of genomic DNA template was amplified with 5pmol of each primer using Taq DNA polymerase (Qiagen) with the addition of QX (Qiagen) for stabilising the DNA-primer complex.
Once an annealing temperature of 42°C was established, genomic DNA from Matthiola incana (ten week stock) . Nicotiana tobaccum var Samsum (tobacco) and
Gossypium hirsutum cv. Siokva 1-4 (cotton) was used as a template in 50ul reactions with the temperature ramped between annealing and extension to 15 degrees per minute. The amplified bands were size-fractioned and extracted from a gel. The Matthiola DNA was polished with Klenow enzyme to make a blunt end ligated into the EcoRV site of pBluescript . Three independent constructs were sequenced. The gel purified PCR product was used as a probe on a Southern blot to verify that the product originated from Matthiola. At high stringency (65°C in 0.1XSSC+1% SDS) the PCR product cross hybridises to the TTGI gene of Arabidopsis.
Tobacco sequences (tobacco 1 and 2) were obtained from two constructs (pTOBl and pT0B2) using T-vectors bases on pBluescript. Each corresponds to one of the genomic sequences found in tobacco which is an allotetraploid species. The two sequences are 95% identical at both the nucleotide and the amino acid level . pTOBl hybridises to both tobacco genes but only weakly to TTGI at high stringency. The cotton gene is currently being sequenced. Garden blots reveal the presence of several similar sequences in Arabidopsis, tobacco, Petunia and zea mays when hybridised at 50°C in 5M NaCl and washed at 50°C in IXSSC+1% SDS.
Use of TTGI anti-sense constructs
The insert in construct pTOBl was removed using Sad and EcoRV and ligated into the Sad and Smal sites in the pROK2 vector. This gives an antisense construct with the 35S promoter driving a transcript from the complementary strand of the TOB1-TTG1 gene. Constructs containing the TOBl sequence may be placed into the Agrobacterium strain LBA4404 for transfer into tobacco plants.
General methods General methods were perfomed in accordance with
Sambrook et al (1989) discussed above.
Plant material
Ecotypes Landsberg erecta, Columbia, RLD1, Ws (Wassilewskija) were supplied by the Nottingham
Arabidopsis Stock Centre. The Landsberg erecta line carrying msl and ttgl . l was from . The line containing ttgl . l ga3 ch5 in Landsberg erecta background was a gift.
Growing plants a) for crosses and seed in the 'Arabicon system', 3:2:1 soil vermiculite perlite was used with 16 hours light. b) for material in trays, 12 hours light was used. c) for transformation, peat-based soil with 5-10 plants in 4 inch pots in a glasshouse with supplementary lighting was used. Pots were covered in muslin, and seed mixed with sand was sprinkled on top. Plants were thinned to 10 per pot at 2-3 weeks. Bolts were cut back once and allowed to reemerge for several days before infiltration. All three types of plants received a weekly feed of macro nutrients . d) in culture 1/2 MS + 0.8% agar 16 hours light was used, e) for root material, plants were grown on plates containing 1/2MS +1.2% phytogel in a nearly vertical position.
Searching the Kranz collection for more ttgl alleles.
Candidates described as glabrous and having yellow seeds were grown and crossed to ttgl.l mutants. The F2 generation was examined for segregating phenotypes. All were examined for seed mucilage, anthocyanin in the plant and the testa and for leaf hairs.
DNA extractions
Plant material was treated as in Dellaporta et al. (1983) Plant Mol Biol Rep 1,4, 19-21, followed by CsCl banding to remove RNA and polysaccharides (Walker et al, 1997 Photosyn Res 54, 155-163) so as to be able to detect small band shifts on DNA gel blots.
Library screens Genomic library in lambda Dashll (Stratagene) from
Landsberg erecta (Boyce et al, 1994 Plant Physiol 106, 1691) distributed by EEC-BRIDGE Arabidopsis DNA Stock Centre .
Construction of the pBINNOT vector
The pBIN19 vector was modified to contain a Notl site in the polylinker. To remove the original Notl site, pBIN19 was restricted with Notl, treated with Klenow and dNTPs to fill in the site, ligated in a large volume, restricted again with Notl and transfected into E.coli strain TGI. Plasmid DNA was isolated from resulting colonies to check that the original Notl site no longer existed. The vector was restricted with Xbal and Asp718. Two annealed oligonucleotides (Nl: GTACCGCGGCCGCAT AND N2 : CTAGATGCGGCCGCG) containing a Notl site were ligated into the vector to reconstitute the Xbal and Asp718 sites. The ligated DNA was restricted with BamHI to remove parental molecules. In effect, the BamHI site in the polylinker of pBIN19 has been replaced with a unique Notl site and the altered vector called pBINNOT.
Plant transformation
Agrobacterium strain Agll (Lazo et al . , 1991 Biotechnology 9, 963-967) was transformed with constructs based on pBINNOT vector by electroporation. Using vacuum infiltration (Bechtold et al , 1993 Compt Rend Acad Sci III Life Sci 316, 1194-1199) . DNA containing genomic fragments were introduced into ttgl.9 mutant plants.

Claims

Claims
1. An isolated nucleic acid molecule encoding a polypeptide with TTGI function.
2. A nucleic acid as claimed in claim 1 wherein the polypeptide includes the amino acid sequence SEQ ID No. 2
3. A nucleic acid as claimed in claim 2 comprising a nucleotide sequence having SEQ ID NO 1 or the coding sequence shown therein
4. An isolated nucleic acid comprising a nucleic acid sequence which shares at least 50%; 60%; 70%; 80%; 90%;
95%; 96%; 97%; 98%; 99% sequence identity with the nucleic acid of claim 2 or claim 3.
5. A nucleic acid as claimed in claim 4 which is a mutant , variant, derivative of any one of the nucleic acid sequences of claim 2 or claim 3 by way of addition, insertion, deletion or substitution of one or more nucleotides in the nucleic acid.
6. A nucleic acid as claimed in claim 4 which is a TTGI homologue from a species other than Arabidopsis thaliana
7. A nucleic acid as claimed in claim 6 which is a homologue from nicotiana or matthiola .
8. A nucleic acid as claimed in claim 7 which encodes a polypeptide comprising amino acid Seq ID No 8 , 9 or 10.
9. A nucleic acid as claimed in claim 4 which is an allele of the nucleic acid sequences of claim 2 or claim 3.
10. A nucleic acid as claimed in any one of claims 4 to 9 which encodes a polypeptide with TTGI function.
11. A nucleic acid as claimed in any one of claims 1 to 3 or claim 10 wherein the TTGI function comprises the ability to alter two or more of the following phenotypic characteristics of a plant into which said polypeptide is introduced: number of trichomes on the aerial parts of the plant; number of trichomes on the roots hairs; mucilage of the seeds; dormancy of the seeds; anthocyanin pigmentation; condensation of the tannins; number of stomata on hypocotyls .
12. A nucleic acid as claimed in claim 9 which comprises the nucleotide sequence of an allele selected from: ttgl.10, ttgl.19, ttgl.l, ttgl.20, ttgl.9.
13. A nucleic acid including or consisting essentially of a sequence of nucleotides complementary to the nucleotide sequence of a nucleic acid as claimed in any one of the preceding claims.
14. A method of identifying a homologue or allele as claimed in any one of claims 6 to 9 which method employs a nucleotide sequence obtainable from that shown in Seq ID No 1, said method including the steps of: (i) preparing nucleic acid from plant cells under test, (ii) providing a nucleic acid molecule which is a probe or primer having a nucleotide comprising all or part of a nucleotide sequence as claimed in claim 2 or claim 3, or complementary to that sequence
(iii) contacting nucleic acid in said preparation with said probe or primer under conditions for hybridization, and (iv) identifying said gene or homologue if present by its hybridization with said nucleic acid molecule.
15. A method as claimed in claim 14 further comprising the step of testing the homologue or allele for TTGI function.
16. A method as claimed in claim 14 or claim 15 wherein the plant cell nucleic acid is obtained from a plant species other than Arabidopsis thaliana .
17. An oligonucleotide for use as a nucleic acid probe or primer for use in the method as claimed in any one of claims 14 to 16, said oligonucleotide comprising: (i) a nucleotide sequence encoding an amino acid sequence which is conserved between TTGI and the polypeptide encoded by the nucleic acid of any one of claims 6 to 8, or
(ii) a nucleotide sequence which is complementary to said sequence .
18. An oligonucleotide as claimed in claim 17 comprising at least about 10; 15; 20; 25; 30 or 35 nucleotides in length.
19. A recombinant vector comprising the nucleic acid of any one of claims 1 to 13.
20. A vector as claimed in claim 19 wherein the nucleic acid is under the control of a promoter.
21. A vector as claimed in claim 20 wherein the promoter is an inducible promoter.
22. A vector as claimed in any one of claims 19 to 21 further comprising one or more of: a terminator sequence; a polyadenylation sequence; an enhancer sequence; a marker gene.
23. A host cell comprising the nucleic acid of any one of claims 1 to 13.
24. A host cell transformed with nucleic acid of any one of claims 1 to 13 or a vector of any one of claims 19 to 22.
25. A host cell having incorporated into its genome heterologous nucleic acid as claimed in any one of claims 1 to 13.
26. A host cell as claimed in any one of claims 23 to 25 which is a plant cell.
27. A method of making the plant cell of claim 26, the method comprising the steps of :
(i) introducing a vector as claimed in any one of claims 19 to 22 into the plant cell,
(ii) causing or allowing recombination between the vector and the plant cell genome to introduce a nucleic acid as claimed in any one of claims 1 to 13 into the genome.
28. A plant which has been regenerated from the plant cell of claim 26.
29. A plant as claimed in claim 28 including the plant cell of claim 26.
30. A plant as claimed in claim 29 which is a clone; selfed or hybrid progeny, or other offspring or descendant of the plant of claim 28.
31. A cutting, part, or seed or other propagule of a plant as claimed in any one of claims 28 to 30.
32. A polypeptide expression product of any one of the nucleic acids of claims 1 to 11 which has TTGI activity.
33. A polypeptide as claimed in claim 32 comprising the amino acid sequence SEQ ID No 2.
34. A method of making the polypeptide of claim 32 or claim 33 by causing or allowing expression from a nucleic acid encoding the polypeptide, following an earlier step of introduction of the nucleic acid into a cell of a plant or an ancestor thereof .
35. Use of a polypeptide of claim 32 or claim 33 or a nucleic acid of any of claims 1 to 13 to regulate the expression or action of a transcription factor.
36. Use of a polypeptide of claim 32 or claim 33 to raise an antibody.
37. An antibody having specific binding affinity for the polypeptide claimed in claim 32 or claim 33.
38. A polypeptide comprising the antigen-binding site of the antibody of claim 37.
39. A method of influencing or affecting a physical characteristic of a plant comprising causing or allowing expression of a heterologous nucleic acid sequence as claimed in any one of claims 1 to 13 within the cells of the plant, following an earlier step of introducing the nucleic acid into a cell of the plant or an ancestor thereof .
40. A method as claimed in claim 39 wherein the nucleic acid is expressed under the control of an inducible promoter.
41. A method for downwardy modulating the expression of a nucleic acid as claimed in any one of claims 1 to 12 in a plant, the method comprising any of the following: (i) causing or allowing transcription from a nucleic acid as claimed in claim 13 in the plant;
(ii) causing or allowing transcription from a nucleic acid as claimed in claims 1 to 12 or a part thereof in the plant such as to reduce expression by co-suppression; (iii) use of nucleic acid encoding a ribozyme specific for a nucleic acid as claimed in any one of claims 1 to 12.
42. A method of influencing any one or more of the following a phenotypic characteristics of a plant: insect protection; chemical production; climate tolerance; salt removal; fibre production; ornamental value; water and nutrient absorption; initiation of seed germination; pigmentation; taste; speed of seedling growth; the method comprising a method as claimed in any one of claims 39 to 41.
43. An isolated nucleic acid molecule comprising a sequence encoding the promoter sequence of the TGG1 gene, or a mutant, variant, derivative or other homolog thereof .
44. A vector comprising the promoter sequence of claim 43.
PCT/GB1998/001861 1997-06-25 1998-06-25 Materials and methods relating to a plant regulatory protein WO1999000501A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU81224/98A AU8122498A (en) 1997-06-25 1998-06-25 Materials and methods relating to a plant regulatory protein
EP98930949A EP1002086A1 (en) 1997-06-25 1998-06-25 Materials and methods relating to a plant regulatory protein

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB9713430.8 1997-06-25
GBGB9713430.8A GB9713430D0 (en) 1997-06-25 1997-06-25 Materials and methods relating to a plant regulatory protein

Publications (1)

Publication Number Publication Date
WO1999000501A1 true WO1999000501A1 (en) 1999-01-07

Family

ID=10814921

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB1998/001861 WO1999000501A1 (en) 1997-06-25 1998-06-25 Materials and methods relating to a plant regulatory protein

Country Status (4)

Country Link
EP (1) EP1002086A1 (en)
AU (1) AU8122498A (en)
GB (1) GB9713430D0 (en)
WO (1) WO1999000501A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2348208A (en) * 1999-01-27 2000-09-27 Danisco Plants producing seeds with a transparent seed coat
WO2001002590A2 (en) * 1999-07-02 2001-01-11 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Plants with modified gene expression
WO2004046336A2 (en) * 2002-11-18 2004-06-03 Monsanto Technology, Llc Production of increased oil and protein in plants by the disruption of the phenylpropanoid pathway
WO2008101342A1 (en) * 2007-02-21 2008-08-28 University Of Manitoba Brassica rapa transparent testa genes for controlling seed colour in brassica species
EP2410056A1 (en) * 2005-07-20 2012-01-25 Agriculture Victoria Services Pty Ltd Modification of flavonoid biosynthesis in plants with transparent testa glabra 1
AU2012213953B2 (en) * 2005-07-20 2015-04-16 Agriculture Victoria Services Pty Ltd Modification of flavonoid biosynthesis in plants with transparent testa glabra 1
WO2017181018A1 (en) * 2016-04-14 2017-10-19 Ebbu, LLC Enhanced cannabis plants and methods of making and using the same
CN109593768A (en) * 2019-01-07 2019-04-09 华中农业大学 Application of the Platanus acerifolia PaGL1 gene in regulation plant epidermal hair
CN111788216A (en) * 2017-12-18 2020-10-16 森正之 Agent for controlling formation of vacuolar cells and plant body introduced with the agent
CN113699164A (en) * 2021-07-28 2021-11-26 山东师范大学 Application of Limonium bicolor gene LbTTG1

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
COOKE R. ET AL.: "AC F20055", EMBL DATABASE, 5 March 1996 (1996-03-05), HEIDELBERG, XP002082374 *
COOKE R. ET AL.: "AC F20056", EMBL DATABASE, 5 March 1996 (1996-03-05), Heidelberg, XP002082375 *
DE VETTEN N. ET AL.: "The an11 locus controlling flower pigmentation in petunia encodes a novel WD-repeat protein conserved in yeast, plants, and animals", GENES & DEVELOPMENT, vol. 11, no. 11, 1 June 1997 (1997-06-01), pages 1422 - 1434, XP002082376 *
GALWAY M. ET AL.: "The TTG gene is required to specify epidermal cell fate and cell patterning in the Arabidopsis root", DEVELOPMENTAL BIOLOGY, vol. 166, no. 2, December 1994 (1994-12-01), pages 740 - 754, XP002082378 *
LARKIN J C ET AL: "Roles of the GLABROUS1 and TRANSPARENT TESTA GLABRA genes in Arabidopsis trichome development.", PLANT CELL 6 (8). 1994. 1065-1076. ISSN: 1040-4651, XP002004687 *
LLOYD A. ET AL.: "Arabidopsis and Nicotiana anthocyanin production activated by maize regulators R and C1", SCIENCE, vol. 258, 11 December 1992 (1992-12-11), pages 1773 - 1775, XP002003019 *
SHIRLEY B. ET AL.: "Analysis of Arabidopsis mutants deficient in flavonoid biosynthesis", THE PLANT JOURNAL, vol. 8, no. 5, November 1995 (1995-11-01), pages 659 - 671, XP002082377 *

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2348208A (en) * 1999-01-27 2000-09-27 Danisco Plants producing seeds with a transparent seed coat
GB2348208B (en) * 1999-01-27 2001-07-18 Danisco Plants producing seeds with a transparent seed coat
WO2001002590A2 (en) * 1999-07-02 2001-01-11 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Plants with modified gene expression
WO2001002590A3 (en) * 1999-07-02 2001-08-09 Max Planck Gesellschaft Plants with modified gene expression
WO2004046336A2 (en) * 2002-11-18 2004-06-03 Monsanto Technology, Llc Production of increased oil and protein in plants by the disruption of the phenylpropanoid pathway
WO2004046336A3 (en) * 2002-11-18 2004-12-16 Monsanto Technology Llc Production of increased oil and protein in plants by the disruption of the phenylpropanoid pathway
US7268276B2 (en) 2002-11-18 2007-09-11 Monsanto Technology Llc Production of increased oil and protein in plants by the disruption of the phenylpropanoid pathway
US8519222B2 (en) 2005-07-20 2013-08-27 Agriculture Victoria Services Pty Ltd Modification of flavonoid biosynthesis in plants
EP2410056A1 (en) * 2005-07-20 2012-01-25 Agriculture Victoria Services Pty Ltd Modification of flavonoid biosynthesis in plants with transparent testa glabra 1
AU2012213953B2 (en) * 2005-07-20 2015-04-16 Agriculture Victoria Services Pty Ltd Modification of flavonoid biosynthesis in plants with transparent testa glabra 1
US9567600B2 (en) 2005-07-20 2017-02-14 Agriculture Victoria Servies PTY LTD Modification of flavonoid biosynthesis in plants
US9567599B2 (en) 2005-07-20 2017-02-14 Agriculture Victoria Services Pty Ltd Modification of flavonoid biosynthesis in plants
US20100287666A1 (en) * 2007-02-21 2010-11-11 Jiefu Zhang Brassica Rapa Transparent Testa Genes for Controlling Seed Colour in Brassica Species
WO2008101342A1 (en) * 2007-02-21 2008-08-28 University Of Manitoba Brassica rapa transparent testa genes for controlling seed colour in brassica species
WO2017181018A1 (en) * 2016-04-14 2017-10-19 Ebbu, LLC Enhanced cannabis plants and methods of making and using the same
CN109312356A (en) * 2016-04-14 2019-02-05 埃布股份有限公司 Enhanced cannabis plants and its preparation and application
CN111788216A (en) * 2017-12-18 2020-10-16 森正之 Agent for controlling formation of vacuolar cells and plant body introduced with the agent
CN111788216B (en) * 2017-12-18 2023-06-23 森正之 Agent for controlling formation of salt vesicle cell and plant body into which the agent is introduced
CN109593768A (en) * 2019-01-07 2019-04-09 华中农业大学 Application of the Platanus acerifolia PaGL1 gene in regulation plant epidermal hair
CN109593768B (en) * 2019-01-07 2020-12-15 华中农业大学 Application of Pagl1 gene in regulating and controlling plant epidermal hair
CN113699164A (en) * 2021-07-28 2021-11-26 山东师范大学 Application of Limonium bicolor gene LbTTG1
CN113699164B (en) * 2021-07-28 2023-02-28 山东师范大学 Application of Limonium bicolor gene LbTTG1

Also Published As

Publication number Publication date
EP1002086A1 (en) 2000-05-24
AU8122498A (en) 1999-01-19
GB9713430D0 (en) 1997-08-27

Similar Documents

Publication Publication Date Title
US6265637B1 (en) Genetic control of flowering
AU717860B2 (en) Flowering genes
US10538779B2 (en) Constitutive photomorphogenesis 1 (COP1) nucleic acid sequence from Zea mays and its use thereof
AU738652B2 (en) Genetic control of plant growth and development
WO1999000501A1 (en) Materials and methods relating to a plant regulatory protein
KR100360305B1 (en) How to shorten the node spacing of flowering following the introduction of SPEL 2 to the warrior petpet obtained from Petunia
US7560612B2 (en) Early-inflorescence-preferred regulatory elements and uses thereof
US20020199219A1 (en) Modulation of abscisic acid signal transduction in plants
US7199285B2 (en) Modulation of abscisic acid signal transduction in plants
KR100432534B1 (en) Asexually producible plant for shortening internode of inflorescence by introducing gene for petunia transcription factor PetSPL2
Bradley Flowering genes

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 1998930949

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP

Ref document number: 1999505378

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 09446810

Country of ref document: US

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 1998930949

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: CA

WWW Wipo information: withdrawn in national office

Ref document number: 1998930949

Country of ref document: EP