WO2003067965A2 - Identification and characterization of an glutamine dumper 1 phenotype (gdui1) in arabidopsis - Google Patents

Identification and characterization of an glutamine dumper 1 phenotype (gdui1) in arabidopsis Download PDF

Info

Publication number
WO2003067965A2
WO2003067965A2 PCT/US2003/004120 US0304120W WO03067965A2 WO 2003067965 A2 WO2003067965 A2 WO 2003067965A2 US 0304120 W US0304120 W US 0304120W WO 03067965 A2 WO03067965 A2 WO 03067965A2
Authority
WO
WIPO (PCT)
Prior art keywords
gdul
sequence
plant
plants
polypeptide
Prior art date
Application number
PCT/US2003/004120
Other languages
French (fr)
Other versions
WO2003067965A3 (en
Inventor
Guillaume Leonard Pilot
Stephen Aaron Lee
Pilar Puente
Jennifer Lee Rhein
Philip Reid Timmons
Michael David Tomalski
Vincent Paul Mary Wingate
Original Assignee
Agrinomics, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Agrinomics, Llc filed Critical Agrinomics, Llc
Priority to AU2003216244A priority Critical patent/AU2003216244A1/en
Publication of WO2003067965A2 publication Critical patent/WO2003067965A2/en
Publication of WO2003067965A3 publication Critical patent/WO2003067965A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/415Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8216Methods for controlling, regulating or enhancing expression of transgenes in plant cells
    • C12N15/8222Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
    • C12N15/8223Vegetative tissue-specific promoters
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8242Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
    • C12N15/8243Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
    • C12N15/8251Amino acid content, e.g. synthetic storage proteins, altering amino acid biosynthesis

Definitions

  • the present invention relates to a plant phenotype, designated GLUTAMINE
  • GDU1 DUMPER (GDU1), together with DNA and polypeptide sequences associated with the same.
  • Activation tagging is a method by which genes are randomly and strongly up- regulated on a genome-wide scale, after which specific phenotypes are screened for and selected. Isolation of mutants by activation tagging has been reported (Hayashi et al, 1992). An activation T-DNA tagging construct was used to activate genes in tobacco cell culture allowing the cells to grow in the absence of plant growth hormones (Walden et al. , 1994). Genes have been isolated from plant genomic sequences flanking the T-DNA tag and putatively assigned to plant growth hormone responses. (See, e.g., Miklashevichs et al. 1997, Harling et al, 1997; Walden et. al., 1994; and Schell et al, 1998, which discusses related studies.)
  • the first gene characterized in Arabidopsis using activation tagging was a gene encoding the histone kinase involved in the cytokinin signal transduction pathway.
  • the gene sequence was isolated from plant genomic DNA by plasmid rescue and the role of the gene, CAT/1, in cytokinin responses in plants was confirmed by re-introduction into Arabidopsis (Kakimoto, 1996). This was followed by reports of several dominant mutants such as TINY, LHY and SHI using a similar approach along with the Ds transposable element (Wilson et al, 1996, Schaffer et al, 1998, Fridborg et al, 1999). In a more recent report, activation T-DNA tagging and screening plants for an early flowering phenotype led to the isolation of the FT gene (Kardailsky et al., 1999).
  • the invention provides nucleic acid and amino acid sequences associated with the
  • GDU1 GLUTAMINE DUMPER 1
  • the invention provides one or more isolated GDU1 nucleic acid sequences comprising a nucleic acid sequence that encodes or is complementary to a sequence that encodes a GDU1 polypeptide having at least 70%, 80%, 90% or more sequence identity to the amino acid sequence presented as SEQ ID NO:2.
  • the polynucleotide comprises a nucleic acid sequence that hybridizes, under high, medium, or low stringency conditions to the nucleic acid sequence, or fragment thereof, presented as SEQ ID NO:l, or the complement thereof.
  • expression of one or more of such GDU1 polynucleo tides in a plant is associated with the GDU1 phenotype.
  • the invention further provides plant transformation vectors, plant cells, plant parts and plants comprising a GDU1 nucleic acid sequence.
  • GDU1 nucleic acid sequence in a plant is associated with the GDU1 phenotype, presented as a glutamine secretion phenotype.
  • a GDU1 nucleic acid sequence may be modified in ornamental plants, fruit and vegetable-producing plants, grain-producing plants, oil-producing plants and nut-producing plants, as well as other crop plants, resulting in the GDU1 phenotype.
  • the invention provides a method of modifying the glutamine secretion in a plant by introducing a GDU1 nucleic acid sequence into plant progenitor cells and growing the cells to produce a transgenic plant.
  • Figure 1A is a schematic representation of a first T-DNA insertion in the genome, which depicts the probable genomic rearrangement. Portions of chromosome 4 and chromosome 5 are indicated “Ch 4" and "Ch5,” respectively.
  • Figure IB is a schematic representation of a second T-DNA insertion in the genome, which was likely associated with a duplication of some of the sequences in the ACTTAG vector.
  • the black arrow represents the gene designated GDUl.
  • White arrows designate other predicted genes.
  • the directions of the arrows designate direction of transcription.
  • the gray arrow designates the BAR gene, and the striped boxes represent the multimerized 35S enhancer.
  • vector refers to a nucleic acid construct designed for transfer between different host cells.
  • expression vector refers to a vector that has the ability to incorporate and express heterologous DNA fragments in a foreign cell. Many prokaryotic and eukaryotic expression vectors are commercially available. Selection of appropriate expression vectors is within the knowledge of those having skill in the art.
  • a “heterologous” nucleic acid construct or sequence has a portion of the sequence which is not native to the plant cell in which it is expressed.
  • Heterologous, with respect to a control sequence refers to a control sequence (i.e. promoter or enhancer) that does not function in nature to regulate the same gene the expression of which it is currently regulating.
  • heterologous nucleic acid sequences are not endogenous to the cell or part of the genome in which they are present, and have been added to the cell, by infection, transfection, microinjection, electroporation, or the like.
  • a “heterologous” nucleic acid construct may contain a control sequence/DNA coding sequence combination that is the same as, or different from a control sequence/DNA coding sequence combination found in the native plant.
  • the term "gene” means the segment of DNA involved in producing a polypeptide chain, which may or may not include regions preceding and following the coding region, e.g. 5' untranslated (5' UTR) or “leader” sequences and 3' UTR or “trailer” sequences, as well as intervening sequences (introns) between individual coding segments (exons).
  • 5' UTR 5' untranslated
  • leader leader
  • 3' UTR or “trailer” sequences as well as intervening sequences (introns) between individual coding segments (exons).
  • percent (%) sequence identity with respect to a subject sequence, or a specified portion of a subject sequence, is defined as the percentage of nucleotides or amino acids in the candidate derivative sequence identical with the nucleotides or amino acids in the subject sequence (or specified portion thereof), after aligning the sequences and introducing gaps, if necessary to achieve the maximum percent sequence identity, as generated by the program WU-BLAST-2.0al9 (Altschul et al, J. Mol. Biol. (1997) 215:403-410; blast.wustl.edu/blast/README.html website) with all the search parameters set to default values.
  • the HSP S and HSP S2 parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched.
  • a % identity value is determined by the number of matching identical nucleotides or amino acids divided by the sequence length for which the percent identity is being reported.
  • Percent (%) amino acid sequence similarity is determined by doing the same calculation as for determining % amino acid sequence identity, but including conservative amino acid substitutions in addition to identical amino acids in the computation.
  • % homology is used interchangeably herein with the term "% identity.”
  • a nucleic acid sequence is considered to be "selectively hybridizable" to a reference nucleic acid sequence if the two sequences specifically hybridize to one another under moderate to high stringency hybridization and wash conditions.
  • Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding complex or probe.
  • Tm melting temperature
  • maximum stringency typically occurs at about Tm-5°C (5° below the Tm of the probe); “high stringency” at about 5-10° below the Tm; “intermediate stringency” at about 10-20° below the Tm of the probe; and “low stringency” at about 20- 25° below the Tm.
  • maximum stringency conditions may be used to identify sequences having strict identity or near-strict identity with the hybridization probe; while high stringency conditions are used to identify sequences having about 80% or more sequence identity with the probe.
  • Moderate and high stringency hybridization conditions are well known in the art (see, for example, Sambrook, et al, 1989, Chapters 9 and 11, and in Ausubel, F.M., et al, 1993, expressly incorporated by reference herein).
  • An example of high stringency conditions includes hybridization at about 42°C in 50% formamide, 5X SSC, 5X Denhardt's solution, 0.5% SDS and 100 ⁇ g/ml denatured carrier DNA followed by washing two times in 2X SSC and 0.5% SDS at room temperature and two additional times in 0.1X SSC and 0.5% SDS at 42°C.
  • recombinant includes reference to a cell or vector, that has been modified by the introduction of a heterologous nucleic acid sequence or that the cell is derived from a cell so modified.
  • recombinant cells express genes that are not found in identical form within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all as a result of deliberate human intervention.
  • the terms “transformed”, “stably transformed” or “transgenic” with reference to a plant cell means the plant cell has a non-native (heterologous) nucleic acid sequence integrated into its genome which is maintained through two or more generations.
  • expression refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene.
  • the process includes both transcription and translation.
  • the term "introduced” in the context of inserting a nucleic acid sequence into a cell means “transfection”, or “transformation” or “transduction” and includes reference to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell where the nucleic acid sequence may be incorporated into the genome of the cell (for example, chromosome, plasmid, plastid, or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (for example, transfected mRNA).
  • a "plant cell” refers to any cell derived from a plant, including cells from undifferentiated tissue (e.g., callus) as well as plant seeds, pollen, progagules and embryos.
  • the terms “native” and “wild-type” relative to a given plant trait or phenotype refers to the form in which that trait or phenotype is found in the same variety of plant in nature.
  • the term “modified” regarding a plant trait refers to a change in the phenotype of a transgenic plant relative to a non-transgenic plant, as it is found in nature.
  • Ti refers to the generation of plants from the seed of To plants.
  • the Ti generation is the first set of transformed plants that can be selected by application of a selection agent, e.g., an antibiotic or herbicide, for which the transgenic plant contains the corresponding resistance gene.
  • a selection agent e.g., an antibiotic or herbicide
  • T refers to the generation of plants by self-fertilization of the flowers of Ti plants, previously selected as being transgenic.
  • plant part includes any plant organ or tissue including, without limitation, seeds, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores.
  • Plant cells can be obtained from any plant organ or tissue and cultures prepared therefrom.
  • the class of plants which can be used in the methods of the present invention is generally as broad as the class of higher plants amenable to transformation techniques, including both monocotyledenous and dicotyledenous plants.
  • transgenic plant includes reference to a plant that comprises within its genome a heterologous polynucleotide.
  • the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations.
  • the heterologous polynucleotide may be integrated into the genome alone or as part of a recombinant expression cassette.
  • Transgenic is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic.
  • heterologous polynucleotide a plant having within its cells a heterologous polynucleotide is referred to herein as a "transgenic plant".
  • the heterologous polynucleotide can be either stably integrated into the genome, or can be extra-chromosomal.
  • the polynucleotide of the present invention is stably integrated into the genome such that the polynucleotide is passed on to successive generations.
  • the polynucleotide is integrated into the genome alone or as part of a recombinant expression cassette.
  • Transgenic is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acids including those transgenics initially so altered as well as those created by sexual crosses or asexual reproduction of the initial transgenics.
  • a plant cell, tissue, organ, or plant into which the recombinant DNA constructs containing the expression constructs have been introduced is considered “transformed”, “transfected", or “transgenic”.
  • a transgenic or transformed cell or plant also includes progeny of the cell or plant and progeny produced from a breeding program employing such a transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from the presence of a recombinant nucleic acid sequence.
  • a plant of the invention will include any plant which has a cell containing a construct with introduced nucleic acid sequences, regardless of whether the sequence was introduced into the directly through transformation means or introduced by generational transfer from a progenitor cell which originally received the construct by direct transformation.
  • GLUTAMINE DUMPER 1 and "GDUl”, as used herein encompass native GLUTAMINE DUMPER 1 (GDUl) nucleic acid and amino acid sequences, homologues, variants and fragments thereof.
  • an "isolated" GDUl nucleic acid molecule is a GDUl nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the natural source of the GDUl nucleic acid.
  • An isolated GDUl nucleic acid molecule is other than in the form or setting in which it is found in nature.
  • an isolated GDUl nucleic acid molecule includes GDUl nucleic acid molecules contained in cells that ordinarily express GDUl where, for example, the nucleic acid molecule is in a chromosomal location different from that of natural cells.
  • mutant with reference to a polynucleotide sequence or gene differs from the corresponding wild type polynucleotide sequence or gene either in terms of sequence or expression, where the difference contributes to a modified plant phenotype or trait.
  • mutant refers to a plant or plant line which has a modified plant phenotype or trait, where the modified phenotype or trait is associated with the modified expression of a wild type polynucleotide sequence or gene.
  • a "variant" polynucleotide sequence encodes a "variant” amino acid sequence that is altered by one or more amino acids from the reference polypeptide sequence.
  • the variant polynucleotide sequence may encode a variant amino acid sequence having "conservative” or “non-conservative” substitutions.
  • Variant polynucleotides may also encode variant amino acid sequences having amino acid insertions or deletions, or both.
  • the term “phenotype” may be used interchangeably with the term “trait”. The terms refer to a plant characteristic which is readily observable or measurable and results from the interaction of the genetic make-up of the plant with the environment in which it develops. Such a phenotype includes chemical changes in the plant make-up resulting from enhanced gene expression which may or may not result in morphological changes in the plant, but which are measurable using analytical techniques known to those of skill in the art.
  • the term "interesting phenotype" with reference to a plant produced by the methods described herein refers to a readily observable or measurable phenotype demonstrated by a T ⁇ and/or subsequent generation plant, which is not displayed by a plant that has not been so transformed (and/or is not the progeny of a plant that has been so transformed) and represents an improvement in the plant.
  • An “improvement” is a feature that may enhance the utility of a plant species or variety by providing the plant with a unique quality. By unique quality is meant a novel feature or a change to an existing feature of the plant species which is a quantitative change (increase or decrease) or a qualitative change in a given feature or trait.
  • Activation tagging is a process by which a heterologous nucleic acid construct comprising a nucleic acid control sequence, e.g. an enhancer, is inserted into a plant genome.
  • the enhancer sequences act to enhance transcription of a one or more native plant genes (See, e.g., Walden R, et al, 1994; Weigel D et al. 2000).
  • the activation tagging vector pSKI015 (Weigel et al, 2000), which comprises a T-DNA (i.e., the sequence derived from the Ti plasmid of Agrobacterium tumifaciens that are transferred to a plant cell host during Agrobacterium infection), an enhancer element and a selectable marker gene.
  • the enhancer element can result in up-regulation genes in the vicinity of the T-DNA insertion, generally within 5-10 kilobase (kb) of the insertion.
  • plants were exposed to the selective agent in order to specifically recover those plants that expressed the selectable marker and therefore harbored insertions of the activation-tagging vector.
  • Transformed plants were observed for interesting phenotypes, which are generally identified at the Ti, T 2 and/or T 3 generations.
  • interesting phenotypes may be identified based on morphology, a biochemical screen, herbicide tolerance testing, herbicide target identification, fungal or bacterial resistance testing, insect or nematode resistance testing, screening for stress tolerance, such as drought, salt or antibiotic tolerance, and output traits, such as oil, starch, pigment, or vitamin composition.
  • Genomic sequence surrounding the T-DNA insertion is analyzed in order to identify genes responsible for the interesting phenotypes. Genes responsible for causing such phenotypes are identified as attractive targets for manipulation for agriculture, food, ornamental plant, and/or pharmaceutical industries.
  • the phenotype is dominant.
  • the enhanced expression of a given native plant gene or a fragment thereof may result in decreased expression or inactivation of its homologue or another native plant gene, which results in the interesting phenotype.
  • the T-DNA insertion may also result in disruption ("loss-of-function") of a native plant gene, in which case the phenotype is generally recessive.
  • the present invention provides a morphological phenotype, identified in Arabidopsis where T 2 plants were observed as having white flecks at the extremity of the major veins of the leaves. These flecks are water-soluble and seem to be excreted by the plant. Mass Spectrometry and NMR were used to determine that the secretion is primarily glutamine. The phenotype was the same in T 3 Arabidopsis plants and has been designated GLUTAMINE DUMPER 1 ("GDUl").
  • the invention also provides a newly identified and isolated nucleic acid sequence that was identified by analysis of the genomic DNA sequence surrounding the T-DNA insertion correlating with the GDUl phenotype.
  • the open reading frame of the GDUl gene which is specifically overexpressed in plants having the GDUl phenotype, and which is provided in SEQ ID NO: 1.
  • a detailed description of the isolation and characterization of GDUl is set forth in the Examples.
  • the GDUl gene may be used in the development of transgenic plants having a desired phenotype. This may be accomplished using the native GDUl coding sequence, a variant GDUl sequence or a homologue or fragment thereof. This may also be accomplished using the native GDUl regulatory ("promoter") sequence, or a variant, fragment or homologue thereof, used to direct expression of a gene of interest (e.g., heterologous coding sequences) in a plant.
  • promoter native GDUl regulatory
  • a GDUl nucleic acid sequence of this invention may be a DNA or RNA sequence, derived from genomic DNA, cDNA or mRNA.
  • the nucleic acid sequence may be cloned, for example, by isolating genomic DNA from an appropriate source, and amplifying and cloning the sequence of interest using PCR.
  • nucleic acid sequence may be synthesized, either completely or in part, especially where it is desirable to provide plant- preferred sequences.
  • all or a portion of the desired structural gene that portion of the gene which encodes a polypeptide or protein
  • the invention provides a polynucleotide comprising a nucleic acid sequence which encodes or is complementary to a sequence which encodes a GDUl polypeptide having the amino acid sequence presented in SEQ ID NO:2 and a polynucleotide sequence identical over its entire length to the GDUl nucleic acid sequence presented SEQ ID NO:l.
  • the invention also provides the coding sequence for the mature GDUl polypeptide, a variant or fragment thereof, as well as the coding sequence for the mature polypeptide or a fragment thereof in a reading frame with other coding sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or prepro- protein sequence.
  • An GDUl coding polynucleotide can also include non-coding sequences, including for example, but not limited to, non-coding 5' and 3' sequences, such as the transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that encodes additional amino acids.
  • non-coding sequences including for example, but not limited to, non-coding 5' and 3' sequences, such as the transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that encodes additional amino acids.
  • a marker sequence can be included to facilitate the purification of the fused polypeptide.
  • Polynucleotides of the present invention also include polynucleotides comprising a structural gene and the naturally associated sequences that control gene expression.
  • the invention further provides non-coding regulatory sequences associated with GDUl, as presented in SEQ ID NO:3.
  • GDUl promoter sequences are operably linked to heterologous coding sequences (i.e., from genes other than GDUl) and are used in transgenic constructs to direct expression of these sequences in a tissue-specific pattern in a plant.
  • heterologous coding sequences i.e., from genes other than GDUl
  • GDUl polynucleotide and GDUl nucleic acid may encompass coding sequences, non-coding sequences, or both, as will be apparent from the context.
  • an isolated polynucleotide of the invention comprises a GDUl nucleic acid sequence flanked by non- GDUl nucleic acid sequence
  • the total length of the combined polynucleotide is typically less than 25 kb, and usually less than 20kb, or 15 kb, and in some cases less than 10 kb, or 5 kb.
  • GDUl variants can be prepared. GDUl variants can be prepared by introducing appropriate nucleotide changes into the GDUl nucleic acid sequence; by synthesis of the desired GDUl polypeptide or by altering the expression level of the GDUl gene in plants. Those skilled in the art will appreciate that amino acid changes may alter post-translational processing of the GDUl polypeptide, such as changing the number or position of glycosylation sites or altering the membrane anchoring characteristics.
  • preferred GDUl coding sequences include a polynucleotide comprising a nucleic acid sequence which encodes or is complementary to a sequence which encodes a GDUl polypeptide having at least 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% or more sequence identity to the amino acid sequence presented in SEQ ID NO:2.
  • preferred variants include a GDUl polynucleotide sequence that is at least 50% to 60% identical over its entire length to the GDUl nucleic acid sequence presented as SEQ ID NO:l or SEQ ID NO:3, and nucleic acid sequences that are complementary to such GDUl sequences. More preferable are GDUl polynucleotide sequences comprise a region having at least 70%, 80%, 85%, 90% or 95% or more sequence identity to the GDUl sequence presented as SEQ ID NO: 1.
  • preferred variants include polynucleotides that are be "selectively hybridizable" to the GDUl polynucleotide sequence presented as SEQ ID NO:l or SEQ ID NO:3.
  • Sequence variants also include nucleic acid molecules that encode the same polypeptide as encoded by the GDUl coding polynucleotide sequence described herein.
  • the coding frame of an identified nucleic acid molecules is known, for example by homology to known genes or by extension of the sequence, it is appreciated that as a result of the degeneracy of the genetic code, a number of coding sequences can be produced.
  • the triplet CGT encodes the amino acid arginine.
  • Arginine is alternatively encoded by CGA, CGC, CGG, AGA, and AGG. Therefore it is appreciated that such substitutions in the coding region fall within the sequence variants that are covered by the present invention. Any and all of these sequence variants can be utilized in the same way as described herein for the identified GDUl parent sequence, SEQ ID NO: 1.
  • sequence variants may or may not selectively hybridize to the parent sequence. This would be possible, for example, when the sequence variant includes a different codon for each of the amino acids encoded by the parent nucleotide. Such variants are, nonetheless, specifically contemplated and encompassed by the present invention. In accordance with the present invention, also encompassed are sequences that at least 70% identical to such degeneracy-derived sequence variants.
  • GDUl nucleotide sequence variants are preferably capable of hybridizing to the nucleotide sequences recited herein under conditions of moderately high or high stringency, there are, in some situations, advantages to using variants based on the degeneracy of the code, as described above.
  • codons may be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic organism, in accordance with the optimum codon usage dictated by the particular host organism.
  • Variations in the native full-length GDUl nucleic acid sequences described herein may be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations, as generally known in the art, oligonucleotide-mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis.
  • Site-directed mutagenesis Kelkel TA et al, 1991
  • cassette mutagenesis (Crameri A et al., 1995); restriction selection mutagenesis (Haught C et al, 1994), or other known techniques can be performed on the cloned DNA to produce nucleic acid sequences encoding GDUl variants.
  • the gene sequences associated with the GDUl phenotype may be synthesized, either completely or in part, especially where it is desirable to provide host-preferred sequences.
  • all or a portion of the desired structural gene may be synthesized using codons preferred by a selected host.
  • Host-preferred codons may be determined, for example, from the codons used most frequently in the proteins expressed in a desired host species.
  • a GDUl coding polynucleotide encodes a GDUl polypeptide that retains substantially the same biological function or activity as the mature GDUl polypeptide encoded by the polynucleotide set forth as SEQ ID NO: 1 (i.e. results in a GDUl phenotype when overexpressed in a plant).
  • Variants also include fragments of the GDUl polynucleotides of the invention, which can be used to synthesize a full-length GDUl polynucleotide.
  • Preferred embodiments include polynucleotides encoding polypeptide variants wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a GDUl polypeptide sequence of the invention are substituted, added or deleted, in any combination. Particularly preferred are substitutions, additions, and deletions that are silent such that they do not alter the properties or activities of the polynucleotide or polypeptide.
  • a nucleotide sequence encoding a GDUl polypeptide can also be used to construct hybridization probes for further genetic analysis. Screening of a cDNA or genomic library with the selected probe may be conducted using standard procedures, such as described in Sambrook et al, 1989). Hybridization conditions, including moderate stringency and high stringency, are provided in Sambrook et al., supra.
  • the probes or portions thereof may also be employed in PCR techniques to generate a pool of sequences for identification of closely related GDUl sequences.
  • GDUl sequences are intended for use as probes, a particular portion of a GDUl encoding sequence, for example a highly conserved portion of the coding sequence may be used.
  • a GDUl nucleotide sequence may be used as a hybridization probe for a cDNA library to isolate genes, for example, those encoding naturally-occurring variants of GDUl from other plant species, which have a desired level of sequence identity to the GDUl nucleotide sequence disclosed in SEQ ID NO:l.
  • Exemplary probes have a length of about 20 to about 50 bases.
  • nucleic acid encoding a GDUl polypeptide may be obtained by screening selected cDNA or genomic libraries using the deduced amino acid sequence disclosed herein, and, if necessary, using conventional primer extension procedures as described in Sambrook et al., supra, to detect GDUl precursors and processing intermediates of mRNA that may not have been reverse-transcribed into cDNA.
  • nucleic acid sequences of this invention may include genomic, cDNA or mRNA sequence.
  • encoding is meant that the sequence corresponds to a particular amino acid sequence either in a sense or anti-sense orientation.
  • extrachromosomal is meant that the sequence is outside of the plant genome of which it is naturally associated.
  • recombinant is meant that the sequence contains a genetically engineered modification through manipulation via mutagenesis, restriction enzymes, and the like.
  • flanking regions may be subjected to resection, mutagenesis, etc.
  • transitions, transversions, deletions, and insertions may be performed on the naturally occurring sequence.
  • the desired form of the GDUl nucleic acid sequence, homologue, variant or fragment thereof may be incorporated into a plant expression vector for transformation of plant cells.
  • the invention provides a GDUl polypeptide, having a native mature or full-length GDUl polypeptide sequence comprising the sequence presented in SEQ ID NO:2.
  • a GDUl polypeptide of the invention can be the mature GDUl polypeptide, part of a fusion protein or a fragment or variant of the GDUl polypeptide sequence presented in SEQ ID NO:2.
  • a GDUl polypeptide of the invention has at least 50% to 60% identity to a GDUl amino acid sequence over its entire length. More preferable are GDUl polypeptide sequences that comprise a region having at least 70%, 80%, 85%, 90% or 95% or more sequence identity to the GDUl polypeptide sequence of SEQ ID NO:2.
  • a fragment is a variant polypeptide that has an amino acid sequence that is entirely the same as part but not all of the amino acid sequence of the previously described polypeptides.
  • Exemplary fragments comprises at least 10, 20, 30, 40, 50, 75, or 100 contiguous amino acids of SEQ ID NO:2.
  • the fragments can be "free-standing" or comprised within a larger polypeptide of which the fragment forms a part or a region, most preferably as a single continuous region.
  • Preferred fragments are biologically active fragments, which are those fragments that mediate activities of the polypeptides of the invention, including those with similar activity or improved activity or with a decreased activity. Also included are those fragments that antigenic or immunogenic in an animal, particularly a human.
  • an GDUl polypeptide comprises a "VIMAG domain,” as further described in the examples, with at least 70%, 80%, 90% or 95% sequence identity to the region defined by amino acids 92-110 of SEQ ID NO:2.
  • GDUl polypeptides of the invention also include polypeptides that vary from the GDUl polypeptide sequence of SEQ ID NO:2. These variants may be substitutional, insertional or deletional variants. The variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, although variants can also be selected which have modified characteristics as further described below.
  • substitution results from the replacement of one or more nucleotides or amino acids by different nucleotides or amino acids, respectively.
  • insertion or “addition” is that change in a nucleotide or amino acid sequence which has resulted in the addition of one or more nucleotides or amino acid residues, respectively, as compared to the naturally occurring sequence.
  • a “deletion” is defined as a change in either nucleotide or amino acid sequence in which one or more nucleotides or amino acid residues, respectively, are absent.
  • Amino acid substitutions are typically of single residues; insertions usually will be on the order of from about 1 to 20 amino acids, although considerably larger insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in some cases deletions may be much larger.
  • substitutions, deletions, insertions or any combination thereof may be used to arrive at a final derivative. Generally these changes are done on a few amino acids to minimize the alteration of the molecule. However, larger changes may be tolerated in certain circumstances.
  • Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid replacements. Insertions or deletions may optionally be in the range of 1 to 5 amino acids.
  • substitutions are generally made in accordance with known "conservative substitutions".
  • a “conservative substitution” refers to the substitution of an amino acid in one class by an amino acid in the same class, where a class is defined by common physicochemical amino acid side chain properties and high substitution frequencies in homologous proteins found in nature (as determined, e.g., by a standard Dayhoff frequency exchange matrix or BLOSUM matrix). (See generally, Doolittle, R.F., 1986.)
  • non-conservative substitution refers to the substitution of an amino acid in one class with an amino acid from another class.
  • GDUl polypeptide variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, although variants also are selected to modify the characteristics of the GDUl polypeptide, as needed. For example, glycosylation sites, and more particularly one or more O-linked or N-linked glycosylation sites may be altered or removed.
  • amino acid changes may alter post- translational processes of the GDUl polypeptide, such as changing the number or position of glycosylation sites or altering the membrane anchoring characteristics.
  • the variations can be made using methods known in the art such as oligonucleotide- mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis.
  • Site-directed mutagenesis [Carter et al. , 1986; Zoller et /., 1987], cassette mutagenesis [Wells et al, 1985], restriction selection mutagenesis [Wells et al, 1986] or other known techniques can be performed on the cloned DNA to produce the GDUl polypeptide-encoding variant DNA.
  • GDUl polypeptides are also included within the definition of GDUl polypeptides.
  • probe or degenerate PCR primer sequences may be used to find other related polypeptides.
  • Useful probe or primer sequences may be designed to all or part of the GDUl polypeptide sequence, or to sequences outside the coding region.
  • preferred PCR primers are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being preferred, and may contain inosine as needed.
  • the conditions for the PCR reaction are generally known in the art.
  • GDUl polypeptides that are a mature protein and may comprise additional amino or carboxyl-terminal amino acids, or amino acids within the mature polypeptide (for example, when the mature form of the protein has more than one polypeptide chain).
  • Such sequences can, for example, play a role in the processing of a protein from a precursor to a mature form, allow protein transport, shorten or lengthen protein half-life, or facilitate manipulation of the protein in assays or production.
  • cellular enzymes can be used to remove any additional amino acids from the mature protein. [See, e.g. , Creighton, TE, 1983].
  • overexpression of a GDUl polypeptide or variant thereof is associated with the GDUl phenotype.
  • the present invention further provides anti-GDUl polypeptide antibodies.
  • the antibodies may be polyclonal, monoclonal, humanized, bispecific or heteroconjugate antibodies. Methods of preparing polyclonal antibodies are known to the skilled artisan. Such polyclonal antibodies can be produced in a mammal, for example, following one or more injections of an immunizing agent, and preferably, an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected into the mammal by a series of subcutaneous or intraperitoneal injections.
  • the immunizing agent may include a GDUl polypeptide or a fusion protein thereof. It may be useful to conjugate the antigen to a protein known to be immunogenic in the mammal being immunized.
  • the immunization protocol may be determined by one skilled in the art based on standard protocols or by routine experimentation.
  • the anti-GDUl polypeptide antibodies may be monoclonal antibodies.
  • Monoclonal antibodies may be produced by hybridomas, wherein a mouse, hamster, or other appropriate host animal, is immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent [Kohler et ⁇ l, 1975].
  • Monoclonal antibodies may also be made by recombinant DNA methods, such as those described in U.S. Patent No. 4,816,567.
  • the anti-GDUl polypeptide antibodies of the invention may further comprise humanized antibodies or human antibodies.
  • humanized antibody refers to humanized forms of non-human (e.g., murine) antibodies that are chimeric antibodies, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen- binding partial sequences of antibodies) which contain some portion of the sequence derived from non-human antibody.
  • Methods for humanizing non-human antibodies are well known in the art, as further detailed in Jones et ⁇ l, 1986; Riechmann et ⁇ l, 1988; and Verhoeyen et ⁇ l, 1988.
  • Methods for producing human antibodies are also known in the art. See, e.g., Jakobovits, A, et ⁇ l, 1995; Jakobovits, A, 1995.
  • anti-GDUl polyclonal antibodies are used for gene isolation.
  • Western blot analysis may be conducted to determine that GDUl or a related protein is present in a crude extract of a particular plant species.
  • genes encoding the related protein may be isolated by screening expression libraries representing the particular plant species.
  • Expression libraries can be constructed in a variety of commercially available vectors, including lambda gtll, as described in Sambrook, et al. , 1989.
  • the GDUl promoter which was shown to be a vascular tissue specific promoter, has utility in the specific expression of heterologous transgenes in crop plants, due to its pattern of expression and its regulation.
  • the promoter could be used for the expression of insecticidal proteins that are toxic to aphids in the phloem. Promoters with similar expression patterns have been described (Barker et al., 2000; Lacombe et al., 2000; Truenit and Sauer, 1995).
  • GDUl promoter sequences are operably linked to heterologous coding sequences (i.e., from genes other than GDUl) and are used in transgenic constructs to direct expression of these sequences in a tissue-specific pattern in a plant (the vascular tissue specific expression pattern associated with the GDUl promoter is detailed below).
  • the term "operably linked” refers to the situation where the GDUl promoter and heterologous coding sequences are present on the same DNA molecule and where the GDUl promoter directs expression of these coding sequences.
  • the GDUl promoter is typically upstream of the 5' end of the coding sequences. In some cases, the GDUl promoter sequence is directly fused to the coding sequences.
  • the GDUl promoter and the coding sequences are separated by intervening sequence, which is typically less than 5kb, preferably less than 1 kb, more preferably less than 500 bases, and more preferably less than 100 bases.
  • intervening sequence typically less than 5kb, preferably less than 1 kb, more preferably less than 500 bases, and more preferably less than 100 bases.
  • the entire GDUl promoter sequence presented in SEQ ID NO:3 may be used.
  • a variant or fragment GDUl promoter that retains the ability to direct vascular tissue specific expression may be used.
  • a reporter construct e.g., containing beta-glucuronidase, GFP or other fluorescent proteins, luciferase, beta-galactosidase, etc.
  • these methods are exemplified by the process described in the Examples to characterize the GDUl promoter sequence presented in SEQ ID NO:3.
  • GDUl coding nucleotide and protein sequences there is further utility in the GDUl coding nucleotide and protein sequences, in the modulated expression of the GDUl protein, and in the mutant GDUl plants.
  • the GDUl mutant may be used as a tool to identify genes involved in amino acid metabolism and transport in plants (for instance, using transcriptional profiling and comparing wild type and GDUl mutant plants).
  • the GDUl coding polynucleotides may also be expressed in plants under control of tissue- and/or temporal-specific promoters to modify a plants utilization of amino acids and/or nitrogen.
  • the GDUl gene and/or promoter may be useful in the development of a system for secretion of commercially important transgenic proteins from the hydathodes of plants.
  • the GDUl phenotype and modified GDUl expression, as well as expression directed by the GDUl promoter are generally applicable to any type of plant
  • the methods described herein are generally applicable to all plants. Although activation tagging and gene identification is carried out in Arabidopsis, following identification of a nucleic acid sequence and associated phenotype, the selected gene, a homologue, variant or fragment thereof, may be expressed in any type of plant. In one aspect, the invention is directed to fruit- and vegetable-bearing plants.
  • the invention is directed to the cut flower industry, grain-producing plants, oil-producing plants and nut-producing plants, as well as other crops including, but not limited to, cotton (Gossypium), alfalfa (Medicago sativa), flax (Linum usitatissimum), tobacco (Nicotiana), turfgrass (Poaceae family), and other forage crops.
  • crops including, but not limited to, cotton (Gossypium), alfalfa (Medicago sativa), flax (Linum usitatissimum), tobacco (Nicotiana), turfgrass (Poaceae family), and other forage crops.
  • the constructs can be introduced in a variety of forms including, but not limited to as a strand of DNA, in a plasmid, or in an artificial chromosome.
  • the introduction of the constructs into the target plant cells can be accomplished by a variety of techniques, including, but not limited to Agr ⁇ b ⁇ cte ⁇ -mediated transformation, electroporation, microinjection, microprojectile bombardment calcium-phosphate-DNA co-precipitation or liposome-mediated transformation of a heterologous nucleic acid construct comprising the GDUl coding sequence.
  • the transformation of the plant is preferably permanent, i.e. by integration of the introduced expression constructs into the host plant genome, so that the introduced constructs are passed onto successive plant generations.
  • binary Ti-based vector systems may be used to transfer and confirm the association between enhanced expression of an identified gene with a particular plant trait or phenotype.
  • Standard Agrobacterium binary vectors are known to those of skill in the art and many are commercially available, such as pBI121 (Clontech Laboratories, Palo Alto, CA). The optimal procedure for transformation of plants with Agrobacterium vectors will vary with the type of plant being transformed.
  • Exemplary methods for Agr ⁇ b ⁇ cterw -mediated transformation include transformation of explants of hypocotyl, shoot tip, stem or leaf tissue, derived from sterile seedlings and/or plantlets. Such transformed plants may be reproduced sexually, or by cell or tissue culture.
  • Agrobacterium transformation has been previously described for a large number of different types of plants and methods for such transformation may be found in the scientific literature.
  • a heterologous nucleic acid construct may be made which comprises a nucleic acid sequence associated with the GDUl phenotype, and which encodes the entire protein, or a biologically active portion thereof for transformation of plant cells and generation of transgenic plants.
  • a GDUl nucleic acid sequence or a homologue, variant or fragment thereof may be carried out under the control of a constitutive, inducible or regulatable promoter. In some cases expression of the GDUl nucleic acid sequence or homologue, variant or fragment thereof may regulated in a developmental stage or tissue- associated or tissue-specific manner. Accordingly, expression of the nucleic acid coding sequences described herein may be regulated with respect to the level of expression, the tissue type(s) where expression takes place and/or developmental stage of expression leading to a wide spectrum of applications wherein the expression of a GDUl coding sequence is modulated in a plant.
  • GDUl nucleic acid sequence or homologue, variant or fragment thereof may also be controlled at the level of transcription, by the use of cell type specific promoters or promoter elements in the plant expression vector.
  • promoters useful for heterologous gene expression are available.
  • Exemplary constitutive promoters include the raspberry E4 promoter (U.S. Patent Nos. 5,783,393 and 5,783,394), the 35S CaMV (Jones JD et al, 1992), the CsVMV promoter (Verdaguer B et al, 1998) and the melon actin promoter.
  • Exemplary tissue-specific promoters include the tomato E4 and E8 promoters (U.S. Patent No. 5,859,330) and the tomato 2AU gene promoter (Van Haaren MJJ et al, 1993).
  • GDUl sequences When GDUl sequences are intended for use as probes, a particular portion of a GDUl encoding sequence, for example a highly conserved portion of a coding sequence may be used.
  • exemplary methods for practicing this aspect of the invention include, but are not limited to antisense suppression (Smith, et ⁇ /.,1988); co-suppression (Napoli, et ⁇ /.,1989); ribozymes (PCT Publication WO 97/10328); and combinations of sense and antisense (Waterhouse, et al, 1998).
  • Methods for the suppression of endogenous sequences in a host cell typically employ the transcription or transcription and translation of at least a portion of the sequence to be suppressed. Such sequences may be homologous to coding as well as non-coding regions of the endogenous sequence.
  • GDUl nucleotide sequence it may be desirable to inhibit expression of the GDUl nucleotide sequence. This may be accomplished using procedures generally employed by those of skill in the art together with the GDUl nucleotide sequence provided herein. Standard molecular and genetic tests may be performed to analyze the association between a cloned gene and an observed phenotype. A number of other techniques that are useful for determining (predicting or confirming) the function of a gene or gene product in plants are described below.
  • DNA/RNA analysis DNA taken form a mutant plant may be sequenced to identify the mutation at the nucleotide level.
  • the mutant phenotype may be rescued by overexpressing the wild type (WT) gene.
  • WT wild type
  • the stage- and tissue-specific gene expression patterns in mutant vs. WT lines, for instance, by in situ hybridization, may be determined. Analysis of the methylation status of the gene, especially flanking regulatory regions, may be performed.
  • Other suitable techniques include overexpression, ectopic expression, expression in other plant species and gene knock-out (reverse genetics, targeted knock-out, viral induced gene silencing (VIGS, see Baulcombe D, 1999).
  • microarray analysis also known as expression profiling or transcript profiling, is used to simultaneously measure differences or induced changes in the expression of many different genes.
  • Techniques for microarray analysis are well known in the art (Schena M et al, Science (1995) 270:467-470; Baldwin D et al, 1999; Dangond F, Physiol Genomics (2000) 2:53-58; van Hal NL et al, J Biotechnol (2000) 78:271-280; Richmond T and Somerville S, Curr Opin Plant Biol (2000) 3:108-116).
  • Microarray analysis of individual tagged lines may be carried out, especially those from which genes have been isolated. Such analysis can identify other genes that are coordinately regulated as a consequence of the overexpression of the gene of interest, which may help to place an unknown gene in a particular pathway.
  • Gene Product Analysis Analysis of gene products may include recombinant protein expression, antisera production, immunolocalization, biochemical assays for catalytic or other activity, analysis of phosphorylation status, and analysis of interaction with other proteins via yeast two-hybrid assays.
  • Pathway analysis may include placing a gene or gene product within a particular biochemical or signaling pathway based on its overexpression phenotype or by sequence homology with related genes. Alternatively, analysis may comprise genetic crosses with WT lines and other mutant lines (creating double mutants) to order the gene in a pathway, or determining the effect of a mutation on expression of downstream "reporter" genes in a pathway.
  • EXAMPLE 1 Generation of Plants with a GDUl Phenotype by Transformation with an Activation Tagging Construct A. Agrobacterium vector preparation.
  • Transformed E. coli colonies and cultures containing the pSKI015 activation tagging construct was confirmed by selection on media containing 100 ⁇ g/ml ampicillin. Agrobacterium colonies and cultures were grown in selective media containing 100 ⁇ g/ml carbenicillin. The presence of the pSKI015 construct was verified in colonies by PCRprimers that span the ocs terminator in the BAR selection cassette under the following PCR conditions: 30 cycles of 94°C 30 seconds; 63°C 40 seconds; 72°C 120 seconds. For long-term storage, PCR-positive colonies were grown in selective media, glycerol added to a final concentration of 30% and cultures quick frozen then stored at - 80°C.
  • Liquid cultures were grown at 28°C, on an orbital shaker at 200 rpm, in LBB with Carbenicillin (Cb) at lOOmg/1 to select for the plasmid, with 50mg l Kanamycin (Kan) added to select for the helper plasmid. After 2 days, this small culture was used to inoculate 6-8 liters (L) of LBB with Cb lOOmg/1 and Kan 50mg/l, IL each in 2000ml Erienmeyer flasks. Cultures are placed on a shaker for 2-3 days, checked for cell concentration by evaluating the OD 60 o (visible light at 600nm) using a spectrophotometer with an OD 600 reading for between 1.5- 2.5 preferred.
  • Cb Carbenicillin
  • Kan Kanamycin
  • Arabidopsis plants were grown in Premier HP soil which contains peat moss and perlite, using a minimal amount of N-P-K (171-2-133) fertilizer diluted to 1/10 the strength, with sub-irrigation, as needed and a n 18 hr day length using natural light supplemented by high pressure sodium lamps at a temperature of 20-25° C. Seeds were sown under humidity domes for the first 4-7 days, then transferred to a greenhouse having approximately 70% humidity.
  • Plants began flowering after about 3-4 weeks, with watering and fertilizing continued as needed until a majority of the siliques turned yellow/brown. Then plants were then left to dry out and seed collected by breaking open siliques to release the seed. Seed was stored at room temperature for a few days, then stored at 4°C in an airtight container with desiccant. Plants are monitored for pests and pathogens, particularly, fungus gnats, white flies, and aphids, with pest control applied as needed, e.g., application of Talstar and Azatin for whitefly, thrips and fungus gnats; application of Gnatrol for fungus gnats, biological control (e.g. mites, for gnat larvae) and safer soap.
  • pests and pathogens particularly, fungus gnats, white flies, and aphids
  • pest control applied as needed, e.g., application of Talstar and Azatin for whitefly,
  • Transformation was accomplished via a floral dip method wherein floral tissues were dipped into a solution containing Agrobacterium tumefaciens, 5% sucrose and a surfactant Silwet L-77, as described in Cough, SJ and Bent, AF, 1998.
  • a second dunking was carried out 6 days after removing the humidity domes, as described above. Plants were watered regularly until seeds were mature, at which time watering was stopped.
  • Dry T ⁇ seed was harvested from transformed plants and stored at 4°C in Eppendorf tubes with desiccant. Transformants were selected at the Ti stage by sprinkling Ti seed on a flat, cold treating the flats for 2 to 3 days and spraying plants as soon as they germinated with Finale (Basta, glufosinate ammonium), diluted at 1 : 1000 of an 11.33% solution, followed by subsequent sprayings a day or two apart.
  • non-transgenic seedlings produced chlorotic primary leaves and their hypocotyls dehydrated and collapsed, killing the plant.
  • the survivors were counted and segregation data calculated after the non-transgenic plants had died (within two-three weeks following the sprayings). Survivors were transplanted into individual pots for further monitoring.
  • the ACTTAGTM line, W000012492 (“GDUl”) was originally identified as having white flecks at the extremity of the major veins of the leaves in the T 2 plants. These flecks are water-soluble and seem to be excreted by the plant.
  • the same phenotype was identified in the T 3 plants. In the T 3 generation, 75% of plants displayed the secretion phenotype and were resistant to BASTA, the selective agent. The 25% that appeared wild-type were also susceptible to BASTA, the selective agent, indicating that they did not have the T-DNA insertion. These results indicated that the GDUl phenotype was dominant.
  • NucleonTM PhytoPureTM systems from AmershamTM were used to extract genomic DNA from T 2 plant tissue. l.Og of fresh plant tissue was ground in liquid nitrogen to yield a free flowing powder, then transferred to a 15-ml polypropylene centrifuge tube. 4.6 ml of Reagent 1 from the Nucleon Phytopure kit was added with thorough mixing followed by addition of 1.5 ml of Reagent 2 from the Nucleon Phytopure kit, with inversion until a homogeneous mixture was obtained. The mixture was incubated at 65°C in a shaking water bath for 10 minutes, and placed on ice for 20 minutes.
  • the samples were removed from the ice, 2 ml of - 20°C chloroform added, mixed and centrifuged at 1300g for 10 minutes. The supernatant was transferred to a fresh tube, 2 ml cold chloroform, 200 ⁇ l of Nucleon PhytoPure DNA extraction resin suspension added and the mixture shaken on a tilt shaker for 10 minutes at room temperature, then centrifuged at 1300g for 10 minutes.
  • the upper DNA containing phase was transferred to a fresh tube, centrifuged at 9500 rpm for 30 minutes to clarify the transferred aqueous phase, an equal volume of cold isopropanol added, the tube gently inverted until the DNA precipitated and then it was pelleted by centrifugation, washed with cold 70% ethanol, pelleted again and air-dried.
  • DNA extracted from plants with the GDUl phenotype (GDUl) and from wild type plants (COL-0) was PCR amplified using primers that amplify a 35S enhancer sequence, and primers that amplify a region of the pBluescript vector sequence in pSKI015. Amplification using primers that span the 35S enhancer region resulted in a ladder of products, indicating that all four copies of the 35S enhancer were present.
  • Amplification using primers to the pBluescript vector was done primarily to detect the T-DNA insert(s) in transformed plants and has been optimized for the following conditions: annealing temp: 57°C, 30 cycles [94°C, 30sec; 57°C, 1 min; 72°C, 1 rnin] 1 cycle [72°C, 7 min].
  • the ACTTAGTM line, W000012492 (GDUl) was confirmed as positive for the presence of 35S enhancer and pSKI015 vector sequences by PCR, and as positive for Southern hybridization verifying genomic integration of the ACTTAG DNA and showing the presence of a single T-DNA insertion in the transgenic line.
  • Genomic DNA from T 2 plants of insertion line, W000012492 (“GDUl”) was digested by restriction enzymes. The restriction fragments were self-ligated and used to transform the E. coli cells. The plasmids that contained a full-length pBluescript vector, 4X 35S enhancer, and a right border T-DNA flanking genomic DNA fragment were rescued.
  • genomic DNA was digested with Pst I, EcoR I, BamH I, Spe I, Hind UI and/or Xho I under standard reaction conditions at 37°C overnight. Briefly, each restriction enzyme was heat inactivated at 65°C for 20 minutes, phenol/ chloroform and chloroform isoamyl (24:1) extracted once with each, and the ligation reactions were set up containing the reagents set forth below and left at 16°C overnight.
  • the ligated DNA was precipitated, resuspended in ddH2O and used to transform E. coli SURE cells (Stratagene) via electroporation, with 10 pg of pUC18 plasmid as a control.
  • the transformation mixture was spread on two LB-plates containing 100 ⁇ g/ml ampicillin and incubated overnight at 37°C. Single colonies were picked from the plates and used to start a 5 ml LB-ampicillin broth culture from each colony by culturing overnight at 37°C. The plasmid was extracted from the culture and restriction digested to confirm the size of genomic insertion.
  • the left ends of plasmids rescued were sequenced across the right T-DNA border.
  • the rescued sequence was subjected to a basic BLASTN search using the sequence comparison program available at the www.ncbi.nlm.nih.gov/BLAST website and a search of the Arabidopsis Information Resource (TAIR) database, available at the Arabidopsis.org website, which revealed sequence identity to BAC clones MUL3 and F28M20.
  • TAIR Arabidopsis Information Resource
  • the ACTTAG vector joins chromosome 5 and the translocated part of chromosome 4.
  • the insertion placed the 35S enhancer approximately 3 kb from the 5' end of GDUl coding transcriptional start site.
  • the scheme of the insertion is depicted in Figure 1A.
  • Semi -quantitative, real-time RT-PCR was performed using forward and reverse primers specific to the gene designated GDUl (SEQ ID NO:l), to other genes in the vicinity, and to constitutively expressed control genes SKOR (GI 3641844) and AKT2 (GI 1100897). The results showed that plants displaying the GDUl phenotype over-expressed the mRNA for Gene 1, indicating the enhanced expression of Gene 1 is correlated with the GDUl phenotype.
  • Quantitative real-time PCR was performed and showed that ninety-fold more GDUl mRNA accumulated in the mutant than in the wild type.
  • the amino acid sequence predicted from the GDUl nucleic acid sequence was determined using GENSCAN and is presented in SEQ ID NO:2.
  • a second ACTTAG mutant allele was isolated.
  • the insertion apparently resulted in a duplication of the BAR gene and 35S enhancer, such that the right T-DNA is flanked by the 35S enhancer and the BAR gene respectively at its left and right borders.
  • the 35S enhancer closer to GDUl was approximately 1.8kb from the 5' end of GDUl coding transcriptional start site.
  • GDUl belongs to a conserved family of proteins.
  • sequence analysis software including BLAST (Altschul, et al, 1997) and GENSCAN (Burge and Karlin, 1997) analyses.
  • SEQ ID NOs: 4, 6, 8, 10 and 12 Corresponding protein sequences are presented in, respectively, SEQ ED NOs: 5, 7, 9, 11 and 13.
  • SEQ ID NO:4 comprises a novel cDNA sequence, of which a partial sequence was identified in GI 18420860.
  • SEQ ID NO: 11 comprises the corresponding novel protein sequence, of which a partial sequence was identified as GI 15238648.
  • SEQ ID NOs: 12 and 13 provide novel nucleic and protein sequences. Overall percent identities with the GDUl protein are shown in Table 1. Multiple sequence alignment using CLUSTAL (Thompson JD et al, 1994,) showed that two domains are particularly conserved: a hydrophobic domain, which is thought to be a transmembrane domain, and a region with a highly conserved "VIMAG" sequence motif, designated the VIMAG domain. The amino acid positions of the conserved regions are indicated in Table 1. Table 1. Sequence characteristics of GDUl homologs.
  • GDUl and related proteins are transmembrane proteins, with the carboxy-terminal region, including the VIMAG domain, positioned in the cytoplasm.
  • BLAST searches of EST databases identified similar sequences present in plants (e.g.
  • the dominant inheritance pattern of the GDUl phenotype was confirmed through genetic analysis. Thirty-six progeny of a plant designated A000267007 were analyzed for segregation of the GDUl phenotype, and their seeds were analyzed for GDUl phenotype and sensitivity BASTA (BASTA S ). Plant A000267007 was determined to be heterozygous for the mutant GDUl, as 10 out of 36 progeny plants displayed a wild type phenotype. The phenotypes of the 36 progeny plants were recorded, and their seeds were sown on petri dishes containing BASTA (15 mg/1) to determine the ratio of BASTA- resistant seedling. Seeds were also sown in soil for phenotypic characterization.
  • the 36 plants were divided in three groups (A, B, C) according to the segregation of the BASTA resistance and the GDUl phenotype (Table 2).
  • Group A comprised plants with GDUl phenotype, which were determined to be homozygous for BASTA resistance and for the GDUl mutation.
  • Group B comprised plants with the GDUl phenotype, which were determined to be heterozygous for BASTA resistance and for the GDUl mutation.
  • Group C comprised plants with wild type appearance, which did not carry BASTA resistance.
  • the BASTA resistance locus was always associated with the GDUl mutation; no plants having mixed phenotypes (BASTA resistant and wild type appearance or BASTA susceptible and GDUl phenotype) were found.
  • the material secreted by the GDUl was collected and subjected to Mass Spectrometry and NMR study.
  • the content in cations (K + , Mg 2+ , Ca 2+ and Na + ) was determined by flame spectrometry (MS).
  • the composition in carbon, hydrogen and nitrogen was determined by combustion. Analyses determined the following composition (% by weight):
  • the NMR study produced a clean proton spectrum, with a peak shape and integration suggesting a single compound.
  • the peak pattern and shift were identical to proton spectrum of pure glutamine.
  • the spectrum is also composed of numerous other peaks (> 146), with many of these multiples of 146 (2 x, 3 x, 4 x).
  • the secretion is composed of 18 % Na and 82 % glutamine
  • the theoretical C, N and H percentages should be, respectively, 34.4, 5.6 and 15.7. This fits correctly with the observed composition of the secretion.
  • the differences observed for C and H may come from contaminating organic molecules collected on the leaf (such as dust and/or soil particles).
  • the secretion was thus determined to be composed of about 18 % Na and 82 % glutamine.
  • GDUl gene presented in SEQ ID NO: 1 a genomic fragment comprising the GDUl cDNA (the gene has no introns), as provided in SEQ ID NO:l, was over-expressed in wild type Arabidopsis plants.
  • the GDUl fragment was amplified by PCR and cloned into Kpnl and BamHl sites in the binary vector pAG2370.
  • pAG2370 whose sequence is provided in SEQ ID NO:4, comprises the vector backbone from the vector pBIN19
  • the GDUl fragment was inserted between the CsVMV promoter region, proximal to the 5' end of genomic fragment, and the Nos termination sequence, proximal to the 3' end of the genomic fragment.
  • the construct was designated pAG2370-G C/i.
  • Another GDUl construct was generated from the plasmid obtained from plasmid rescue using the Pst I enzyme.
  • a fragment was obtained by complete digest with Kpnl and partial digest with HindHI, which contained the pBS backbone, the 4 x 35S enhancer, and a genomic DNA that extended from, upstream, the border of the T-DNA insertion to, downstream, a Malawi site distal to the 3' end of the GDUl gene.
  • This fragment was cloned into the pAG2370 vector from which the CsVMV promoter had been removed by digestion with Kpn I and Hind UJ, and the resulting construct was designated pAG2370- GDU1-VR.
  • Wild type Arabidopsis (COL-0) plants were transformed with ⁇ AG2310-GDU1 and pAG2370-GZ)cH-PR using standard vacuum infiltration methods. All infiltrated seeds were plated in selective media (approximately 60 ⁇ g/ml kanamycin), and kanamycin-resistant Ti plants were transplanted to flats.
  • the transformation process generated 31 independent To events from ⁇ AG2310-GDU1 , 23 of which showed the GDUl phenotype of white leaf secretions. From transformation with pAG2370-G.DC/i -PR, 25 independent To lines were generated, 8 of which showed the GDUl phenotype. Control constructs that did not contain the GDUl gene had no noticeable phenotype. These results showed that GDUl overexpression is responsible of the GDUl phenotype.
  • the expression pattern directed by the GDUl promoter was characterized using a beta-glucuronidase (GUS) reporter construct.
  • GUS beta-glucuronidase
  • the GDUl promoter comprising approximately 2.7 kb of the genomic sequence directly upstream of the GDUl start codon, was amplified by PCR and ligated to the E. coli GUS coding sequence.
  • the fusion product was inserted into the pPZ212 vector backbone (GI 506657; Hajdukiewicz et al., 1994).
  • the sequence of the resulting reporter construct, designated GDU1-GOS is presented in SEQ ID NO:5.
  • GDUl -GUS was introduced into Arabidopsis by Agrob cte ⁇ wm-mediated transformation and GUS expression was analyzed in several Ti and T 2 plants.
  • GUS expression analysis revealed vascular-specific activity of the GDUl promoter.
  • All lines studied showed GUS expression in the root stele, and, more precisely in the stele of the root up to the region where the cells are dividing, co-localizing with the phloem. All lines also showed strong GUS expression in the phloem and xylem pole of the stem vascular bundle. Over 50% of the lines studied showed GDUl promoter activity in the aerial part vasculature and/or in the hydathodes. GDUl promoter activity was sometimes observed in the bases of the developing siliques, and in the vasculature of the anthers.
  • a profile of GDUl expression was also obtained by performing RT-PCR of GDUl coding sequences using mRNA derived from various organs of the plant. The expression observed using the reporter construct was in accordance with the pattern determined from RT-PCR. GDUl mRNA was shown to be most abundant in the roots and in the stem.
  • 3w shoot of 3-week old plantlet ; 4w: shoots of 4-week old plant; rL: rosette leaves ; cL: caulinary leaves ; ySt: young stem ; oSt: old stem ; FI: flowers ; ySi: young siliques ; oSi: old siliques; R: roots. Values are expressed as a percentage of the maximum expression level.
  • the amino acid content of mutants plants shown to overexpress GDUl was analyzed.
  • Free amino acid was extracted from ground plant tissues by either ethanol or NaCitrate extraction.
  • ethanol extraction 500 mg of fresh leaves were ground in 2 ml 100 % ethanol, and centrifuged. The pellets were successively resuspended in 80% ethanol, 60% ethanol, and water, and were centrifuged following each resuspension. The supernatants from successive resuspension steps were pooled and assayed for amino acid content.
  • 500 mg of leaves are ground in Li citrate, pH 2.2, centrifuged and the supernatant was assayed for amino acids.
  • each of these amino acids is greater in the mutant than in the wild type, and the amounts of serine, threonine and glutamate are more than doubled.

Abstract

The present invention is directed to a novel plant phenotype, designated GLUTAMINE DUMPER 1 (GDU1), a nucleic acid sequence expressed in plants demonstrating the GDUI1 phenotype and the corresponding amino acid sequence. Also provided are plant cells and plants that exhibit modified GDUI1 expression. The invention is further directed to the GDUI1 promoter, which is capable of directing vascular tissue specific expression of heterologous coding sequences.

Description

IDENTIFICATION AND CHARACTERIZATION OF AN GLUTAMINE DUMPER 1 PHENOTYPE (GDU1) IN ARABIDOPSIS
REFERENCE TO RELATED APPLICATIONS
This application claims priority to US provisional application US 60/356,962 filed 2/13/2002. The content of the prior application is hereby incorporated in its entirety.
FIELD OF THE INVENTION The present invention relates to a plant phenotype, designated GLUTAMINE
DUMPER (GDU1), together with DNA and polypeptide sequences associated with the same.
BACKGROUND OF THE INVENTION The traditional methods for gene discovery, including chemical mutagenesis, irradiation and T-DNA insertion, used to screen loss of function mutants have limitations. Mutagenic methods such as these rarely identify genes that are redundant in the genome, and gene characterization is time-consuming and laborious.
Activation tagging is a method by which genes are randomly and strongly up- regulated on a genome-wide scale, after which specific phenotypes are screened for and selected. Isolation of mutants by activation tagging has been reported (Hayashi et al, 1992). An activation T-DNA tagging construct was used to activate genes in tobacco cell culture allowing the cells to grow in the absence of plant growth hormones (Walden et al. , 1994). Genes have been isolated from plant genomic sequences flanking the T-DNA tag and putatively assigned to plant growth hormone responses. (See, e.g., Miklashevichs et al. 1997, Harling et al, 1997; Walden et. al., 1994; and Schell et al, 1998, which discusses related studies.)
The first gene characterized in Arabidopsis using activation tagging was a gene encoding the histone kinase involved in the cytokinin signal transduction pathway. The gene sequence was isolated from plant genomic DNA by plasmid rescue and the role of the gene, CAT/1, in cytokinin responses in plants was confirmed by re-introduction into Arabidopsis (Kakimoto, 1996). This was followed by reports of several dominant mutants such as TINY, LHY and SHI using a similar approach along with the Ds transposable element (Wilson et al, 1996, Schaffer et al, 1998, Fridborg et al, 1999). In a more recent report, activation T-DNA tagging and screening plants for an early flowering phenotype led to the isolation of the FT gene (Kardailsky et al., 1999).
The potential application of activation tagging as a high through put technology for gene discovery has been demonstrated based on screening of several dominant mutant genes involved in photoreceptor, brassinosteroid, gibberellin and flowering signal pathways, as well as disease resistance. (See, e.g., Weigel et al., 2000, Christensen et al, 1998; Kardailsky et al, 1999).
SUMMARY OF THE INVENTION. The invention provides nucleic acid and amino acid sequences associated with the
GLUTAMINE DUMPER 1 ("GDU1") phenotype in plants, identified for its altered glutamine secretion relative to wild-type Arabidopsis plants.
In one aspect, the invention provides one or more isolated GDU1 nucleic acid sequences comprising a nucleic acid sequence that encodes or is complementary to a sequence that encodes a GDU1 polypeptide having at least 70%, 80%, 90% or more sequence identity to the amino acid sequence presented as SEQ ID NO:2.
In another aspect, the polynucleotide comprises a nucleic acid sequence that hybridizes, under high, medium, or low stringency conditions to the nucleic acid sequence, or fragment thereof, presented as SEQ ID NO:l, or the complement thereof. In a related aspect, expression of one or more of such GDU1 polynucleo tides in a plant is associated with the GDU1 phenotype.
The invention further provides plant transformation vectors, plant cells, plant parts and plants comprising a GDU1 nucleic acid sequence.
Expression of such a GDU1 nucleic acid sequence in a plant is associated with the GDU1 phenotype, presented as a glutamine secretion phenotype.
The expression of a GDU1 nucleic acid sequence may be modified in ornamental plants, fruit and vegetable-producing plants, grain-producing plants, oil-producing plants and nut-producing plants, as well as other crop plants, resulting in the GDU1 phenotype. In a further aspect the invention provides a method of modifying the glutamine secretion in a plant by introducing a GDU1 nucleic acid sequence into plant progenitor cells and growing the cells to produce a transgenic plant. BRIEF DESCRIPTION OF THE FIGURES
Figure 1A is a schematic representation of a first T-DNA insertion in the genome, which depicts the probable genomic rearrangement. Portions of chromosome 4 and chromosome 5 are indicated "Ch 4" and "Ch5," respectively. Figure IB is a schematic representation of a second T-DNA insertion in the genome, which was likely associated with a duplication of some of the sequences in the ACTTAG vector. In both Figures 1A and IB, the black arrow represents the gene designated GDUl. White arrows designate other predicted genes. The directions of the arrows designate direction of transcription. Within the region representing the ACTTAG vector, the gray arrow designates the BAR gene, and the striped boxes represent the multimerized 35S enhancer.
DETAILED DESCRIPTION OF THE INVENTION I. Definitions.
Unless otherwise indicated, all technical and scientific terms used herein have the same meaning as they would to one skilled in the art of the present invention.
Practitioners are particularly directed to Sambrook et al, 1989, and Ausubel FM et al., 1993, for definitions and terms of the art. It is to be understood that this invention is not limited to the particular methodology, protocols, and reagents described, as these may vary. All publications cited herein, and listed below immediately after the examples, are expressly incorporated herein by reference for the purpose of describing and disclosing compositions and methodologies that might be used in connection with the invention.
As used herein, the term "vector" refers to a nucleic acid construct designed for transfer between different host cells. An "expression vector" refers to a vector that has the ability to incorporate and express heterologous DNA fragments in a foreign cell. Many prokaryotic and eukaryotic expression vectors are commercially available. Selection of appropriate expression vectors is within the knowledge of those having skill in the art.
A "heterologous" nucleic acid construct or sequence has a portion of the sequence which is not native to the plant cell in which it is expressed. Heterologous, with respect to a control sequence refers to a control sequence (i.e. promoter or enhancer) that does not function in nature to regulate the same gene the expression of which it is currently regulating. Generally, heterologous nucleic acid sequences are not endogenous to the cell or part of the genome in which they are present, and have been added to the cell, by infection, transfection, microinjection, electroporation, or the like. A "heterologous" nucleic acid construct may contain a control sequence/DNA coding sequence combination that is the same as, or different from a control sequence/DNA coding sequence combination found in the native plant.
As used herein, the term "gene" means the segment of DNA involved in producing a polypeptide chain, which may or may not include regions preceding and following the coding region, e.g. 5' untranslated (5' UTR) or "leader" sequences and 3' UTR or "trailer" sequences, as well as intervening sequences (introns) between individual coding segments (exons).
As used herein, "percent (%) sequence identity" with respect to a subject sequence, or a specified portion of a subject sequence, is defined as the percentage of nucleotides or amino acids in the candidate derivative sequence identical with the nucleotides or amino acids in the subject sequence (or specified portion thereof), after aligning the sequences and introducing gaps, if necessary to achieve the maximum percent sequence identity, as generated by the program WU-BLAST-2.0al9 (Altschul et al, J. Mol. Biol. (1997) 215:403-410; blast.wustl.edu/blast/README.html website) with all the search parameters set to default values. The HSP S and HSP S2 parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched. A % identity value is determined by the number of matching identical nucleotides or amino acids divided by the sequence length for which the percent identity is being reported. "Percent (%) amino acid sequence similarity" is determined by doing the same calculation as for determining % amino acid sequence identity, but including conservative amino acid substitutions in addition to identical amino acids in the computation. The term "% homology" is used interchangeably herein with the term "% identity."
A nucleic acid sequence is considered to be "selectively hybridizable" to a reference nucleic acid sequence if the two sequences specifically hybridize to one another under moderate to high stringency hybridization and wash conditions. Hybridization conditions are based on the melting temperature (Tm) of the nucleic acid binding complex or probe. For example, "maximum stringency" typically occurs at about Tm-5°C (5° below the Tm of the probe); "high stringency" at about 5-10° below the Tm; "intermediate stringency" at about 10-20° below the Tm of the probe; and "low stringency" at about 20- 25° below the Tm. Functionally, maximum stringency conditions may be used to identify sequences having strict identity or near-strict identity with the hybridization probe; while high stringency conditions are used to identify sequences having about 80% or more sequence identity with the probe.
Moderate and high stringency hybridization conditions are well known in the art (see, for example, Sambrook, et al, 1989, Chapters 9 and 11, and in Ausubel, F.M., et al, 1993, expressly incorporated by reference herein). An example of high stringency conditions includes hybridization at about 42°C in 50% formamide, 5X SSC, 5X Denhardt's solution, 0.5% SDS and 100 μg/ml denatured carrier DNA followed by washing two times in 2X SSC and 0.5% SDS at room temperature and two additional times in 0.1X SSC and 0.5% SDS at 42°C. As used herein, "recombinant" includes reference to a cell or vector, that has been modified by the introduction of a heterologous nucleic acid sequence or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found in identical form within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all as a result of deliberate human intervention.
As used herein, the terms "transformed", "stably transformed" or "transgenic" with reference to a plant cell means the plant cell has a non-native (heterologous) nucleic acid sequence integrated into its genome which is maintained through two or more generations.
As used herein, the term "expression" refers to the process by which a polypeptide is produced based on the nucleic acid sequence of a gene. The process includes both transcription and translation.
The term "introduced" in the context of inserting a nucleic acid sequence into a cell, means "transfection", or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid sequence into a eukaryotic or prokaryotic cell where the nucleic acid sequence may be incorporated into the genome of the cell (for example, chromosome, plasmid, plastid, or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (for example, transfected mRNA).
As used herein, a "plant cell" refers to any cell derived from a plant, including cells from undifferentiated tissue (e.g., callus) as well as plant seeds, pollen, progagules and embryos.
As used herein, the terms "native" and "wild-type" relative to a given plant trait or phenotype refers to the form in which that trait or phenotype is found in the same variety of plant in nature. As used herein, the term "modified" regarding a plant trait, refers to a change in the phenotype of a transgenic plant relative to a non-transgenic plant, as it is found in nature.
As used herein, the term "Ti" refers to the generation of plants from the seed of To plants. The Ti generation is the first set of transformed plants that can be selected by application of a selection agent, e.g., an antibiotic or herbicide, for which the transgenic plant contains the corresponding resistance gene.
As used herein, the term "T " refers to the generation of plants by self-fertilization of the flowers of Ti plants, previously selected as being transgenic.
As used herein, the term "plant part" includes any plant organ or tissue including, without limitation, seeds, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores. Plant cells can be obtained from any plant organ or tissue and cultures prepared therefrom. The class of plants which can be used in the methods of the present invention is generally as broad as the class of higher plants amenable to transformation techniques, including both monocotyledenous and dicotyledenous plants.
As used herein, "transgenic plant" includes reference to a plant that comprises within its genome a heterologous polynucleotide. Generally, the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of a recombinant expression cassette. "Transgenic" is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acid including those transgenics initially so altered as well as those created by sexual crosses or asexual propagation from the initial transgenic. Thus a plant having within its cells a heterologous polynucleotide is referred to herein as a "transgenic plant". The heterologous polynucleotide can be either stably integrated into the genome, or can be extra-chromosomal. Preferably, the polynucleotide of the present invention is stably integrated into the genome such that the polynucleotide is passed on to successive generations. The polynucleotide is integrated into the genome alone or as part of a recombinant expression cassette. "Transgenic" is used herein to include any cell, cell line, callus, tissue, plant part or plant, the genotype of which has been altered by the presence of heterologous nucleic acids including those transgenics initially so altered as well as those created by sexual crosses or asexual reproduction of the initial transgenics. A plant cell, tissue, organ, or plant into which the recombinant DNA constructs containing the expression constructs have been introduced is considered "transformed", "transfected", or "transgenic". A transgenic or transformed cell or plant also includes progeny of the cell or plant and progeny produced from a breeding program employing such a transgenic plant as a parent in a cross and exhibiting an altered phenotype resulting from the presence of a recombinant nucleic acid sequence. Hence, a plant of the invention will include any plant which has a cell containing a construct with introduced nucleic acid sequences, regardless of whether the sequence was introduced into the directly through transformation means or introduced by generational transfer from a progenitor cell which originally received the construct by direct transformation.
The terms " GLUTAMINE DUMPER 1 " and "GDUl", as used herein encompass native GLUTAMINE DUMPER 1 (GDUl) nucleic acid and amino acid sequences, homologues, variants and fragments thereof.
An "isolated" GDUl nucleic acid molecule is a GDUl nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the natural source of the GDUl nucleic acid. An isolated GDUl nucleic acid molecule is other than in the form or setting in which it is found in nature. However, an isolated GDUl nucleic acid molecule includes GDUl nucleic acid molecules contained in cells that ordinarily express GDUl where, for example, the nucleic acid molecule is in a chromosomal location different from that of natural cells.
As used herein, the term "mutant" with reference to a polynucleotide sequence or gene differs from the corresponding wild type polynucleotide sequence or gene either in terms of sequence or expression, where the difference contributes to a modified plant phenotype or trait. Relative to a plant or plant line, the term "mutant" refers to a plant or plant line which has a modified plant phenotype or trait, where the modified phenotype or trait is associated with the modified expression of a wild type polynucleotide sequence or gene.
Generally, a "variant" polynucleotide sequence encodes a "variant" amino acid sequence that is altered by one or more amino acids from the reference polypeptide sequence. The variant polynucleotide sequence may encode a variant amino acid sequence having "conservative" or "non-conservative" substitutions. Variant polynucleotides may also encode variant amino acid sequences having amino acid insertions or deletions, or both. As used herein, the term "phenotype" may be used interchangeably with the term "trait". The terms refer to a plant characteristic which is readily observable or measurable and results from the interaction of the genetic make-up of the plant with the environment in which it develops. Such a phenotype includes chemical changes in the plant make-up resulting from enhanced gene expression which may or may not result in morphological changes in the plant, but which are measurable using analytical techniques known to those of skill in the art.
As used herein, the term "interesting phenotype" with reference to a plant produced by the methods described herein refers to a readily observable or measurable phenotype demonstrated by a T\ and/or subsequent generation plant, which is not displayed by a plant that has not been so transformed (and/or is not the progeny of a plant that has been so transformed) and represents an improvement in the plant. An "improvement" is a feature that may enhance the utility of a plant species or variety by providing the plant with a unique quality. By unique quality is meant a novel feature or a change to an existing feature of the plant species which is a quantitative change (increase or decrease) or a qualitative change in a given feature or trait.
II. The Identified GDUl Phenotype and Gene.
The gene and phenotype of this invention were identified in a large-scale screen using activation tagging. Activation tagging is a process by which a heterologous nucleic acid construct comprising a nucleic acid control sequence, e.g. an enhancer, is inserted into a plant genome. The enhancer sequences act to enhance transcription of a one or more native plant genes (See, e.g., Walden R, et al, 1994; Weigel D et al. 2000).
Briefly, a large number of Arabidopsis plants were transformed with the activation tagging vector pSKI015 (Weigel et al, 2000), which comprises a T-DNA (i.e., the sequence derived from the Ti plasmid of Agrobacterium tumifaciens that are transferred to a plant cell host during Agrobacterium infection), an enhancer element and a selectable marker gene. Following random insertion of pSKI015 into the genome of transformed plants, the enhancer element can result in up-regulation genes in the vicinity of the T-DNA insertion, generally within 5-10 kilobase (kb) of the insertion. In the Ti generation, plants were exposed to the selective agent in order to specifically recover those plants that expressed the selectable marker and therefore harbored insertions of the activation-tagging vector. Transformed plants were observed for interesting phenotypes, which are generally identified at the Ti, T2 and/or T3 generations. Interesting phenotypes may be identified based on morphology, a biochemical screen, herbicide tolerance testing, herbicide target identification, fungal or bacterial resistance testing, insect or nematode resistance testing, screening for stress tolerance, such as drought, salt or antibiotic tolerance, and output traits, such as oil, starch, pigment, or vitamin composition. Genomic sequence surrounding the T-DNA insertion is analyzed in order to identify genes responsible for the interesting phenotypes. Genes responsible for causing such phenotypes are identified as attractive targets for manipulation for agriculture, food, ornamental plant, and/or pharmaceutical industries.
It will be appreciated that in most cases when a modified phenotype results from the enhanced expression of a tagged gene, the phenotype is dominant. In some cases, the enhanced expression of a given native plant gene or a fragment thereof may result in decreased expression or inactivation of its homologue or another native plant gene, which results in the interesting phenotype. The T-DNA insertion may also result in disruption ("loss-of-function") of a native plant gene, in which case the phenotype is generally recessive.
The present invention provides a morphological phenotype, identified in Arabidopsis where T2 plants were observed as having white flecks at the extremity of the major veins of the leaves. These flecks are water-soluble and seem to be excreted by the plant. Mass Spectrometry and NMR were used to determine that the secretion is primarily glutamine. The phenotype was the same in T3 Arabidopsis plants and has been designated GLUTAMINE DUMPER 1 ("GDUl").
The invention also provides a newly identified and isolated nucleic acid sequence that was identified by analysis of the genomic DNA sequence surrounding the T-DNA insertion correlating with the GDUl phenotype. In particular, we have identified and characterized the open reading frame of the GDUl gene, which is specifically overexpressed in plants having the GDUl phenotype, and which is provided in SEQ ID NO: 1. A detailed description of the isolation and characterization of GDUl is set forth in the Examples. We have further identified and characterized regulatory sequences upstream of the open reading frame, which have been found to direct expression of downstream genes in plant vascular tissues. III. Compositions of the Invention
A. GDUl Nucleic acids
The GDUl gene may be used in the development of transgenic plants having a desired phenotype. This may be accomplished using the native GDUl coding sequence, a variant GDUl sequence or a homologue or fragment thereof. This may also be accomplished using the native GDUl regulatory ("promoter") sequence, or a variant, fragment or homologue thereof, used to direct expression of a gene of interest (e.g., heterologous coding sequences) in a plant.
A GDUl nucleic acid sequence of this invention may be a DNA or RNA sequence, derived from genomic DNA, cDNA or mRNA. The nucleic acid sequence may be cloned, for example, by isolating genomic DNA from an appropriate source, and amplifying and cloning the sequence of interest using PCR. Alternatively, nucleic acid sequence may be synthesized, either completely or in part, especially where it is desirable to provide plant- preferred sequences. Thus, all or a portion of the desired structural gene (that portion of the gene which encodes a polypeptide or protein) may be synthesized using codons preferred by a selected host.
The invention provides a polynucleotide comprising a nucleic acid sequence which encodes or is complementary to a sequence which encodes a GDUl polypeptide having the amino acid sequence presented in SEQ ID NO:2 and a polynucleotide sequence identical over its entire length to the GDUl nucleic acid sequence presented SEQ ID NO:l. The invention also provides the coding sequence for the mature GDUl polypeptide, a variant or fragment thereof, as well as the coding sequence for the mature polypeptide or a fragment thereof in a reading frame with other coding sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or prepro- protein sequence. An GDUl coding polynucleotide can also include non-coding sequences, including for example, but not limited to, non-coding 5' and 3' sequences, such as the transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that encodes additional amino acids. For example, a marker sequence can be included to facilitate the purification of the fused polypeptide. Polynucleotides of the present invention also include polynucleotides comprising a structural gene and the naturally associated sequences that control gene expression.
The invention further provides non-coding regulatory sequences associated with GDUl, as presented in SEQ ID NO:3. In a preferred embodiment, GDUl promoter sequences are operably linked to heterologous coding sequences (i.e., from genes other than GDUl) and are used in transgenic constructs to direct expression of these sequences in a tissue-specific pattern in a plant. As used hereinafter, the terms "GDUl polynucleotide" and "GDUl nucleic acid" may encompass coding sequences, non-coding sequences, or both, as will be apparent from the context.
When an isolated polynucleotide of the invention comprises a GDUl nucleic acid sequence flanked by non- GDUl nucleic acid sequence, the total length of the combined polynucleotide is typically less than 25 kb, and usually less than 20kb, or 15 kb, and in some cases less than 10 kb, or 5 kb. In addition to the GDUl nucleic acid and corresponding polypeptide sequences described herein, it is contemplated that GDUl variants can be prepared. GDUl variants can be prepared by introducing appropriate nucleotide changes into the GDUl nucleic acid sequence; by synthesis of the desired GDUl polypeptide or by altering the expression level of the GDUl gene in plants. Those skilled in the art will appreciate that amino acid changes may alter post-translational processing of the GDUl polypeptide, such as changing the number or position of glycosylation sites or altering the membrane anchoring characteristics.
In one aspect, preferred GDUl coding sequences include a polynucleotide comprising a nucleic acid sequence which encodes or is complementary to a sequence which encodes a GDUl polypeptide having at least 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% or more sequence identity to the amino acid sequence presented in SEQ ID NO:2.
In another aspect, preferred variants include a GDUl polynucleotide sequence that is at least 50% to 60% identical over its entire length to the GDUl nucleic acid sequence presented as SEQ ID NO:l or SEQ ID NO:3, and nucleic acid sequences that are complementary to such GDUl sequences. More preferable are GDUl polynucleotide sequences comprise a region having at least 70%, 80%, 85%, 90% or 95% or more sequence identity to the GDUl sequence presented as SEQ ID NO: 1.
In a related aspect, preferred variants include polynucleotides that are be "selectively hybridizable" to the GDUl polynucleotide sequence presented as SEQ ID NO:l or SEQ ID NO:3.
Sequence variants also include nucleic acid molecules that encode the same polypeptide as encoded by the GDUl coding polynucleotide sequence described herein. Thus, where the coding frame of an identified nucleic acid molecules is known, for example by homology to known genes or by extension of the sequence, it is appreciated that as a result of the degeneracy of the genetic code, a number of coding sequences can be produced. For example, the triplet CGT encodes the amino acid arginine. Arginine is alternatively encoded by CGA, CGC, CGG, AGA, and AGG. Therefore it is appreciated that such substitutions in the coding region fall within the sequence variants that are covered by the present invention. Any and all of these sequence variants can be utilized in the same way as described herein for the identified GDUl parent sequence, SEQ ID NO: 1.
It is further appreciated that such sequence variants may or may not selectively hybridize to the parent sequence. This would be possible, for example, when the sequence variant includes a different codon for each of the amino acids encoded by the parent nucleotide. Such variants are, nonetheless, specifically contemplated and encompassed by the present invention. In accordance with the present invention, also encompassed are sequences that at least 70% identical to such degeneracy-derived sequence variants.
Although GDUl nucleotide sequence variants are preferably capable of hybridizing to the nucleotide sequences recited herein under conditions of moderately high or high stringency, there are, in some situations, advantages to using variants based on the degeneracy of the code, as described above. For example, codons may be selected to increase the rate at which expression of the peptide occurs in a particular prokaryotic or eukaryotic organism, in accordance with the optimum codon usage dictated by the particular host organism. Alternatively, it may be desirable to produce RNA having longer half lives than the mRNA produced by the recited sequences.
Variations in the native full-length GDUl nucleic acid sequences described herein, may be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations, as generally known in the art, oligonucleotide-mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis (Kunkel TA et al, 1991); cassette mutagenesis (Crameri A et al., 1995); restriction selection mutagenesis (Haught C et al, 1994), or other known techniques can be performed on the cloned DNA to produce nucleic acid sequences encoding GDUl variants. It is contemplated that the gene sequences associated with the GDUl phenotype may be synthesized, either completely or in part, especially where it is desirable to provide host-preferred sequences. Thus, all or a portion of the desired structural gene (that portion of the gene which encodes the protein) may be synthesized using codons preferred by a selected host. Host-preferred codons may be determined, for example, from the codons used most frequently in the proteins expressed in a desired host species.
It is preferred that a GDUl coding polynucleotide encodes a GDUl polypeptide that retains substantially the same biological function or activity as the mature GDUl polypeptide encoded by the polynucleotide set forth as SEQ ID NO: 1 (i.e. results in a GDUl phenotype when overexpressed in a plant).
Variants also include fragments of the GDUl polynucleotides of the invention, which can be used to synthesize a full-length GDUl polynucleotide. Preferred embodiments include polynucleotides encoding polypeptide variants wherein 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a GDUl polypeptide sequence of the invention are substituted, added or deleted, in any combination. Particularly preferred are substitutions, additions, and deletions that are silent such that they do not alter the properties or activities of the polynucleotide or polypeptide.
A nucleotide sequence encoding a GDUl polypeptide can also be used to construct hybridization probes for further genetic analysis. Screening of a cDNA or genomic library with the selected probe may be conducted using standard procedures, such as described in Sambrook et al, 1989). Hybridization conditions, including moderate stringency and high stringency, are provided in Sambrook et al., supra.
The probes or portions thereof may also be employed in PCR techniques to generate a pool of sequences for identification of closely related GDUl sequences. When GDUl sequences are intended for use as probes, a particular portion of a GDUl encoding sequence, for example a highly conserved portion of the coding sequence may be used.
For example, a GDUl nucleotide sequence may be used as a hybridization probe for a cDNA library to isolate genes, for example, those encoding naturally-occurring variants of GDUl from other plant species, which have a desired level of sequence identity to the GDUl nucleotide sequence disclosed in SEQ ID NO:l. Exemplary probes have a length of about 20 to about 50 bases.
In another exemplary approach, a nucleic acid encoding a GDUl polypeptide may be obtained by screening selected cDNA or genomic libraries using the deduced amino acid sequence disclosed herein, and, if necessary, using conventional primer extension procedures as described in Sambrook et al., supra, to detect GDUl precursors and processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. As discussed above, nucleic acid sequences of this invention may include genomic, cDNA or mRNA sequence. By "encoding" is meant that the sequence corresponds to a particular amino acid sequence either in a sense or anti-sense orientation. By "extrachromosomal" is meant that the sequence is outside of the plant genome of which it is naturally associated. By "recombinant" is meant that the sequence contains a genetically engineered modification through manipulation via mutagenesis, restriction enzymes, and the like.
Once the desired form of a GDUl nucleic acid sequence, homologue, variant or fragment thereof, is obtained, it may be modified in a variety of ways. Where the sequence involves non-coding flanking regions, the flanking regions may be subjected to resection, mutagenesis, etc. Thus, transitions, transversions, deletions, and insertions may be performed on the naturally occurring sequence.
With or without such modification, the desired form of the GDUl nucleic acid sequence, homologue, variant or fragment thereof, may be incorporated into a plant expression vector for transformation of plant cells.
B. GDUl Polvpeptides
In one preferred embodiment, the invention provides a GDUl polypeptide, having a native mature or full-length GDUl polypeptide sequence comprising the sequence presented in SEQ ID NO:2. A GDUl polypeptide of the invention can be the mature GDUl polypeptide, part of a fusion protein or a fragment or variant of the GDUl polypeptide sequence presented in SEQ ID NO:2.
Ordinarily, a GDUl polypeptide of the invention has at least 50% to 60% identity to a GDUl amino acid sequence over its entire length. More preferable are GDUl polypeptide sequences that comprise a region having at least 70%, 80%, 85%, 90% or 95% or more sequence identity to the GDUl polypeptide sequence of SEQ ID NO:2.
Fragments and variants of the GDUl polypeptide sequence of SEQ ID NO:2, are also considered to be a part of the invention. A fragment is a variant polypeptide that has an amino acid sequence that is entirely the same as part but not all of the amino acid sequence of the previously described polypeptides. Exemplary fragments comprises at least 10, 20, 30, 40, 50, 75, or 100 contiguous amino acids of SEQ ID NO:2. The fragments can be "free-standing" or comprised within a larger polypeptide of which the fragment forms a part or a region, most preferably as a single continuous region. Preferred fragments are biologically active fragments, which are those fragments that mediate activities of the polypeptides of the invention, including those with similar activity or improved activity or with a decreased activity. Also included are those fragments that antigenic or immunogenic in an animal, particularly a human.
In one embodiment, an GDUl polypeptide comprises a "VIMAG domain," as further described in the examples, with at least 70%, 80%, 90% or 95% sequence identity to the region defined by amino acids 92-110 of SEQ ID NO:2.
GDUl polypeptides of the invention also include polypeptides that vary from the GDUl polypeptide sequence of SEQ ID NO:2. These variants may be substitutional, insertional or deletional variants. The variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, although variants can also be selected which have modified characteristics as further described below.
A "substitution" results from the replacement of one or more nucleotides or amino acids by different nucleotides or amino acids, respectively.
An "insertion" or "addition" is that change in a nucleotide or amino acid sequence which has resulted in the addition of one or more nucleotides or amino acid residues, respectively, as compared to the naturally occurring sequence.
A "deletion" is defined as a change in either nucleotide or amino acid sequence in which one or more nucleotides or amino acid residues, respectively, are absent.
Amino acid substitutions are typically of single residues; insertions usually will be on the order of from about 1 to 20 amino acids, although considerably larger insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in some cases deletions may be much larger.
Substitutions, deletions, insertions or any combination thereof may be used to arrive at a final derivative. Generally these changes are done on a few amino acids to minimize the alteration of the molecule. However, larger changes may be tolerated in certain circumstances.
Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid replacements. Insertions or deletions may optionally be in the range of 1 to 5 amino acids.
Substitutions are generally made in accordance with known "conservative substitutions". A "conservative substitution" refers to the substitution of an amino acid in one class by an amino acid in the same class, where a class is defined by common physicochemical amino acid side chain properties and high substitution frequencies in homologous proteins found in nature (as determined, e.g., by a standard Dayhoff frequency exchange matrix or BLOSUM matrix). (See generally, Doolittle, R.F., 1986.)
A "non-conservative substitution" refers to the substitution of an amino acid in one class with an amino acid from another class. GDUl polypeptide variants typically exhibit the same qualitative biological activity as the naturally occurring analogue, although variants also are selected to modify the characteristics of the GDUl polypeptide, as needed. For example, glycosylation sites, and more particularly one or more O-linked or N-linked glycosylation sites may be altered or removed. Those skilled in the art will appreciate that amino acid changes may alter post- translational processes of the GDUl polypeptide, such as changing the number or position of glycosylation sites or altering the membrane anchoring characteristics.
The variations can be made using methods known in the art such as oligonucleotide- mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter et al. , 1986; Zoller et /., 1987], cassette mutagenesis [Wells et al, 1985], restriction selection mutagenesis [Wells et al, 1986] or other known techniques can be performed on the cloned DNA to produce the GDUl polypeptide-encoding variant DNA.
Also included within the definition of GDUl polypeptides are other related GDUl polypeptides. Thus, probe or degenerate PCR primer sequences may be used to find other related polypeptides. Useful probe or primer sequences may be designed to all or part of the GDUl polypeptide sequence, or to sequences outside the coding region. As is generally known in the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for the PCR reaction are generally known in the art.
Covalent modifications of GDUl polypeptides are also included within the scope of this invention. For example, the invention provides GDUl polypeptides that are a mature protein and may comprise additional amino or carboxyl-terminal amino acids, or amino acids within the mature polypeptide (for example, when the mature form of the protein has more than one polypeptide chain). Such sequences can, for example, play a role in the processing of a protein from a precursor to a mature form, allow protein transport, shorten or lengthen protein half-life, or facilitate manipulation of the protein in assays or production. It is contemplated that cellular enzymes can be used to remove any additional amino acids from the mature protein. [See, e.g. , Creighton, TE, 1983].
In a preferred embodiment, overexpression of a GDUl polypeptide or variant thereof is associated with the GDUl phenotype. C. Antibodies.
The present invention further provides anti-GDUl polypeptide antibodies. The antibodies may be polyclonal, monoclonal, humanized, bispecific or heteroconjugate antibodies. Methods of preparing polyclonal antibodies are known to the skilled artisan. Such polyclonal antibodies can be produced in a mammal, for example, following one or more injections of an immunizing agent, and preferably, an adjuvant. Typically, the immunizing agent and/or adjuvant will be injected into the mammal by a series of subcutaneous or intraperitoneal injections. The immunizing agent may include a GDUl polypeptide or a fusion protein thereof. It may be useful to conjugate the antigen to a protein known to be immunogenic in the mammal being immunized. The immunization protocol may be determined by one skilled in the art based on standard protocols or by routine experimentation.
Alternatively, the anti-GDUl polypeptide antibodies may be monoclonal antibodies. Monoclonal antibodies may be produced by hybridomas, wherein a mouse, hamster, or other appropriate host animal, is immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent [Kohler et αl, 1975]. Monoclonal antibodies may also be made by recombinant DNA methods, such as those described in U.S. Patent No. 4,816,567. The anti-GDUl polypeptide antibodies of the invention may further comprise humanized antibodies or human antibodies. The term "humanized antibody" refers to humanized forms of non-human (e.g., murine) antibodies that are chimeric antibodies, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- binding partial sequences of antibodies) which contain some portion of the sequence derived from non-human antibody. Methods for humanizing non-human antibodies are well known in the art, as further detailed in Jones et αl, 1986; Riechmann et αl, 1988; and Verhoeyen et αl, 1988. Methods for producing human antibodies are also known in the art. See, e.g., Jakobovits, A, et αl, 1995; Jakobovits, A, 1995.
In one exemplary approach, anti-GDUl polyclonal antibodies are used for gene isolation. Western blot analysis may be conducted to determine that GDUl or a related protein is present in a crude extract of a particular plant species. When reactivity is observed, genes encoding the related protein may be isolated by screening expression libraries representing the particular plant species. Expression libraries can be constructed in a variety of commercially available vectors, including lambda gtll, as described in Sambrook, et al. , 1989.
IV. Utility Of the GDUl Promoter, Coding Sequences, and Phenotype The GDUl promoter, which was shown to be a vascular tissue specific promoter, has utility in the specific expression of heterologous transgenes in crop plants, due to its pattern of expression and its regulation. For instance, the promoter could be used for the expression of insecticidal proteins that are toxic to aphids in the phloem. Promoters with similar expression patterns have been described (Barker et al., 2000; Lacombe et al., 2000; Truenit and Sauer, 1995).
In a preferred embodiment, GDUl promoter sequences are operably linked to heterologous coding sequences (i.e., from genes other than GDUl) and are used in transgenic constructs to direct expression of these sequences in a tissue-specific pattern in a plant (the vascular tissue specific expression pattern associated with the GDUl promoter is detailed below). The term "operably linked" refers to the situation where the GDUl promoter and heterologous coding sequences are present on the same DNA molecule and where the GDUl promoter directs expression of these coding sequences. The GDUl promoter is typically upstream of the 5' end of the coding sequences. In some cases, the GDUl promoter sequence is directly fused to the coding sequences. In other cases, the GDUl promoter and the coding sequences are separated by intervening sequence, which is typically less than 5kb, preferably less than 1 kb, more preferably less than 500 bases, and more preferably less than 100 bases. In some cases, the entire GDUl promoter sequence presented in SEQ ID NO:3 may be used. In other cases, a variant or fragment GDUl promoter that retains the ability to direct vascular tissue specific expression may be used. Methods of using a reporter construct (e.g., containing beta-glucuronidase, GFP or other fluorescent proteins, luciferase, beta-galactosidase, etc.) to characterize the expression pattern directed by a promoter sequence are well known in the art (e.g., Truenit and Sauer, 1995); these methods are exemplified by the process described in the Examples to characterize the GDUl promoter sequence presented in SEQ ID NO:3. There is further utility in the GDUl coding nucleotide and protein sequences, in the modulated expression of the GDUl protein, and in the mutant GDUl plants. The GDUl mutant may be used as a tool to identify genes involved in amino acid metabolism and transport in plants (for instance, using transcriptional profiling and comparing wild type and GDUl mutant plants). The GDUl coding polynucleotides may also be expressed in plants under control of tissue- and/or temporal-specific promoters to modify a plants utilization of amino acids and/or nitrogen. The GDUl gene and/or promoter may be useful in the development of a system for secretion of commercially important transgenic proteins from the hydathodes of plants. In practicing the invention, the GDUl phenotype and modified GDUl expression, as well as expression directed by the GDUl promoter, are generally applicable to any type of plant
The methods described herein are generally applicable to all plants. Although activation tagging and gene identification is carried out in Arabidopsis, following identification of a nucleic acid sequence and associated phenotype, the selected gene, a homologue, variant or fragment thereof, may be expressed in any type of plant. In one aspect, the invention is directed to fruit- and vegetable-bearing plants. In a related aspect, the invention is directed to the cut flower industry, grain-producing plants, oil-producing plants and nut-producing plants, as well as other crops including, but not limited to, cotton (Gossypium), alfalfa (Medicago sativa), flax (Linum usitatissimum), tobacco (Nicotiana), turfgrass (Poaceae family), and other forage crops.
The skilled artisan will recognize that a wide variety of transformation techniques exist in the art, and new techniques are continually becoming available. Any technique that is suitable for the target host plant can be employed within the scope of the present invention. For example, the constructs can be introduced in a variety of forms including, but not limited to as a strand of DNA, in a plasmid, or in an artificial chromosome. The introduction of the constructs into the target plant cells can be accomplished by a variety of techniques, including, but not limited to Agrøbαcteπα -mediated transformation, electroporation, microinjection, microprojectile bombardment calcium-phosphate-DNA co-precipitation or liposome-mediated transformation of a heterologous nucleic acid construct comprising the GDUl coding sequence. The transformation of the plant is preferably permanent, i.e. by integration of the introduced expression constructs into the host plant genome, so that the introduced constructs are passed onto successive plant generations. In one embodiment, binary Ti-based vector systems may be used to transfer and confirm the association between enhanced expression of an identified gene with a particular plant trait or phenotype. Standard Agrobacterium binary vectors are known to those of skill in the art and many are commercially available, such as pBI121 (Clontech Laboratories, Palo Alto, CA). The optimal procedure for transformation of plants with Agrobacterium vectors will vary with the type of plant being transformed. Exemplary methods for Agrøbαcterw -mediated transformation include transformation of explants of hypocotyl, shoot tip, stem or leaf tissue, derived from sterile seedlings and/or plantlets. Such transformed plants may be reproduced sexually, or by cell or tissue culture.
Agrobacterium transformation has been previously described for a large number of different types of plants and methods for such transformation may be found in the scientific literature.
Depending upon the intended use, a heterologous nucleic acid construct may be made which comprises a nucleic acid sequence associated with the GDUl phenotype, and which encodes the entire protein, or a biologically active portion thereof for transformation of plant cells and generation of transgenic plants.
The expression of a GDUl nucleic acid sequence or a homologue, variant or fragment thereof may be carried out under the control of a constitutive, inducible or regulatable promoter. In some cases expression of the GDUl nucleic acid sequence or homologue, variant or fragment thereof may regulated in a developmental stage or tissue- associated or tissue-specific manner. Accordingly, expression of the nucleic acid coding sequences described herein may be regulated with respect to the level of expression, the tissue type(s) where expression takes place and/or developmental stage of expression leading to a wide spectrum of applications wherein the expression of a GDUl coding sequence is modulated in a plant.
Strong promoters with enhancers may result in a high level of expression. When a low level of basal activity is desired, a weak promoter may be a better choice. Expression of GDUl nucleic acid sequence or homologue, variant or fragment thereof may also be controlled at the level of transcription, by the use of cell type specific promoters or promoter elements in the plant expression vector.
Numerous promoters useful for heterologous gene expression are available. Exemplary constitutive promoters include the raspberry E4 promoter (U.S. Patent Nos. 5,783,393 and 5,783,394), the 35S CaMV (Jones JD et al, 1992), the CsVMV promoter (Verdaguer B et al, 1998) and the melon actin promoter. Exemplary tissue-specific promoters include the tomato E4 and E8 promoters (U.S. Patent No. 5,859,330) and the tomato 2AU gene promoter (Van Haaren MJJ et al, 1993). When GDUl sequences are intended for use as probes, a particular portion of a GDUl encoding sequence, for example a highly conserved portion of a coding sequence may be used.
In yet another aspect, in some cases it may be desirable to inhibit the expression of endogenous GDUl sequences in a host cell. Exemplary methods for practicing this aspect of the invention include, but are not limited to antisense suppression (Smith, et α/.,1988); co-suppression (Napoli, et α/.,1989); ribozymes (PCT Publication WO 97/10328); and combinations of sense and antisense (Waterhouse, et al, 1998). Methods for the suppression of endogenous sequences in a host cell typically employ the transcription or transcription and translation of at least a portion of the sequence to be suppressed. Such sequences may be homologous to coding as well as non-coding regions of the endogenous sequence. In some cases, it may be desirable to inhibit expression of the GDUl nucleotide sequence. This may be accomplished using procedures generally employed by those of skill in the art together with the GDUl nucleotide sequence provided herein. Standard molecular and genetic tests may be performed to analyze the association between a cloned gene and an observed phenotype. A number of other techniques that are useful for determining (predicting or confirming) the function of a gene or gene product in plants are described below.
1. DNA/RNA analysis DNA taken form a mutant plant may be sequenced to identify the mutation at the nucleotide level. The mutant phenotype may be rescued by overexpressing the wild type (WT) gene. The stage- and tissue-specific gene expression patterns in mutant vs. WT lines, for instance, by in situ hybridization, may be determined. Analysis of the methylation status of the gene, especially flanking regulatory regions, may be performed. Other suitable techniques include overexpression, ectopic expression, expression in other plant species and gene knock-out (reverse genetics, targeted knock-out, viral induced gene silencing (VIGS, see Baulcombe D, 1999).
In a preferred application, microarray analysis, also known as expression profiling or transcript profiling, is used to simultaneously measure differences or induced changes in the expression of many different genes. Techniques for microarray analysis are well known in the art (Schena M et al, Science (1995) 270:467-470; Baldwin D et al, 1999; Dangond F, Physiol Genomics (2000) 2:53-58; van Hal NL et al, J Biotechnol (2000) 78:271-280; Richmond T and Somerville S, Curr Opin Plant Biol (2000) 3:108-116). Microarray analysis of individual tagged lines may be carried out, especially those from which genes have been isolated. Such analysis can identify other genes that are coordinately regulated as a consequence of the overexpression of the gene of interest, which may help to place an unknown gene in a particular pathway.
2. Gene Product Analysis Analysis of gene products may include recombinant protein expression, antisera production, immunolocalization, biochemical assays for catalytic or other activity, analysis of phosphorylation status, and analysis of interaction with other proteins via yeast two-hybrid assays.
3. Pathway Analysis Pathway analysis may include placing a gene or gene product within a particular biochemical or signaling pathway based on its overexpression phenotype or by sequence homology with related genes. Alternatively, analysis may comprise genetic crosses with WT lines and other mutant lines (creating double mutants) to order the gene in a pathway, or determining the effect of a mutation on expression of downstream "reporter" genes in a pathway.
4. Other Analyses
Other analyses may be performed to determine or confirm the participation of the isolated gene and its product in a particular metabolic or signaling pathway, and to help determine gene function. All publications, patents and patent applications are herein expressly incorporated by reference in their entirety.
While the invention has been described with reference to specific methods and embodiments, it will be appreciated that various modifications and changes may be made without departing from the invention.
EXAMPLES
EXAMPLE 1 Generation of Plants with a GDUl Phenotype by Transformation with an Activation Tagging Construct A. Agrobacterium vector preparation.
Mutants were generated using the activation tagging "ACTTAG" vector, pSKI015 (GenBank Identifier [GI] 6537289; Weigel D et al, 2000).
Transformed E. coli colonies and cultures containing the pSKI015 activation tagging construct was confirmed by selection on media containing 100 μg/ml ampicillin. Agrobacterium colonies and cultures were grown in selective media containing 100 μg/ml carbenicillin. The presence of the pSKI015 construct was verified in colonies by PCRprimers that span the ocs terminator in the BAR selection cassette under the following PCR conditions: 30 cycles of 94°C 30 seconds; 63°C 40 seconds; 72°C 120 seconds. For long-term storage, PCR-positive colonies were grown in selective media, glycerol added to a final concentration of 30% and cultures quick frozen then stored at - 80°C. For the initiation of dense Agrobacterium cultures for plant transformation, stock cultures were grown in selective media, glycerol added to a final concentration of 30%, and a number of 20 μl aliquots quick frozen in liquid nitrogen and stored at -80°C. pSKI015 was maintained in Agrobacterium GV3101 without the helper plasmid and in Agrobacterium strain EHA 105. An Agrobacterium culture was prepared by starting a 50 ml culture 4-5 days prior to plant transformation (e.g., by "dunking"). Liquid cultures were grown at 28°C, on an orbital shaker at 200 rpm, in LBB with Carbenicillin (Cb) at lOOmg/1 to select for the plasmid, with 50mg l Kanamycin (Kan) added to select for the helper plasmid. After 2 days, this small culture was used to inoculate 6-8 liters (L) of LBB with Cb lOOmg/1 and Kan 50mg/l, IL each in 2000ml Erienmeyer flasks. Cultures are placed on a shaker for 2-3 days, checked for cell concentration by evaluating the OD60o (visible light at 600nm) using a spectrophotometer with an OD600 reading for between 1.5- 2.5 preferred. The cultures were then centrifuged at 4,500 RCF for 15 minutes at room temperature (18-22° C), the bacteria resuspended to approximately OD60o=0.8 with about 500 ml per dunking vessel. Approximately 15-20 liters were prepared for 200 pots, and 20-30 plants dunked at a time.
B. Growth and Selection of Arabidopsis thaliana Plants Arabidopsis plants were grown in Premier HP soil which contains peat moss and perlite, using a minimal amount of N-P-K (171-2-133) fertilizer diluted to 1/10 the strength, with sub-irrigation, as needed and a n 18 hr day length using natural light supplemented by high pressure sodium lamps at a temperature of 20-25° C. Seeds were sown under humidity domes for the first 4-7 days, then transferred to a greenhouse having approximately 70% humidity.
Healthy Arabidopsis plants were grown from wild type Arabidopsis seed, Ecotype: Col-0, under long days (16 hrs) in pots in soil covered with bridal veil or window screen, until they flowered.
Plants began flowering after about 3-4 weeks, with watering and fertilizing continued as needed until a majority of the siliques turned yellow/brown. Then plants were then left to dry out and seed collected by breaking open siliques to release the seed. Seed was stored at room temperature for a few days, then stored at 4°C in an airtight container with desiccant. Plants are monitored for pests and pathogens, particularly, fungus gnats, white flies, and aphids, with pest control applied as needed, e.g., application of Talstar and Azatin for whitefly, thrips and fungus gnats; application of Gnatrol for fungus gnats, biological control (e.g. mites, for gnat larvae) and safer soap.
Transformation was accomplished via a floral dip method wherein floral tissues were dipped into a solution containing Agrobacterium tumefaciens, 5% sucrose and a surfactant Silwet L-77, as described in Cough, SJ and Bent, AF, 1998.
Briefly, above-ground parts of 2,000-3,000 plants were dipped (dunked) into an Agrobacterium culture (GV3101 with pMP90RK, helper plasmid) carrying ACTTAG (binary plasmid pSKI.015), 2-3 days after clipping for 15 minutes, with gentle agitation, then placing plants on their sides under a humidity dome or cover for 16-24 hours to maintain high humidity.
A second dunking was carried out 6 days after removing the humidity domes, as described above. Plants were watered regularly until seeds were mature, at which time watering was stopped.
C. Selection Of Transgenic Plants
Dry T\ seed was harvested from transformed plants and stored at 4°C in Eppendorf tubes with desiccant. Transformants were selected at the Ti stage by sprinkling Ti seed on a flat, cold treating the flats for 2 to 3 days and spraying plants as soon as they germinated with Finale (Basta, glufosinate ammonium), diluted at 1 : 1000 of an 11.33% solution, followed by subsequent sprayings a day or two apart.
Following sprayings, non-transgenic seedlings produced chlorotic primary leaves and their hypocotyls dehydrated and collapsed, killing the plant. The survivors were counted and segregation data calculated after the non-transgenic plants had died (within two-three weeks following the sprayings). Survivors were transplanted into individual pots for further monitoring.
Images of each pool of plants were recorded using a Digital camera (DC-260), and moφhology observations were taken from plants that exhibited an interesting phenotype. These plants were grown until seed was produced, which was collected and sown to yield T2 plants.
The ACTTAG™ line, W000012492 ("GDUl") was originally identified as having white flecks at the extremity of the major veins of the leaves in the T2 plants. These flecks are water-soluble and seem to be excreted by the plant. The same phenotype was identified in the T3 plants. In the T3 generation, 75% of plants displayed the secretion phenotype and were resistant to BASTA, the selective agent. The 25% that appeared wild-type were also susceptible to BASTA, the selective agent, indicating that they did not have the T-DNA insertion. These results indicated that the GDUl phenotype was dominant.
EXAMPLE 2 Characterization of Plants That Exhibit the GDUl Phenotype. A. Genomic DNA Extraction and Analysis.
Nucleon™ PhytoPure™ systems from Amersham™ were used to extract genomic DNA from T2 plant tissue. l.Og of fresh plant tissue was ground in liquid nitrogen to yield a free flowing powder, then transferred to a 15-ml polypropylene centrifuge tube. 4.6 ml of Reagent 1 from the Nucleon Phytopure kit was added with thorough mixing followed by addition of 1.5 ml of Reagent 2 from the Nucleon Phytopure kit, with inversion until a homogeneous mixture was obtained. The mixture was incubated at 65°C in a shaking water bath for 10 minutes, and placed on ice for 20 minutes. The samples were removed from the ice, 2 ml of - 20°C chloroform added, mixed and centrifuged at 1300g for 10 minutes. The supernatant was transferred to a fresh tube, 2 ml cold chloroform, 200 μl of Nucleon PhytoPure DNA extraction resin suspension added and the mixture shaken on a tilt shaker for 10 minutes at room temperature, then centrifuged at 1300g for 10 minutes. Without disturbing the Nucleon resin suspension layer, the upper DNA containing phase was transferred to a fresh tube, centrifuged at 9500 rpm for 30 minutes to clarify the transferred aqueous phase, an equal volume of cold isopropanol added, the tube gently inverted until the DNA precipitated and then it was pelleted by centrifugation, washed with cold 70% ethanol, pelleted again and air-dried.
DNA extracted from plants with the GDUl phenotype (GDUl) and from wild type plants (COL-0) was PCR amplified using primers that amplify a 35S enhancer sequence, and primers that amplify a region of the pBluescript vector sequence in pSKI015. Amplification using primers that span the 35S enhancer region resulted in a ladder of products, indicating that all four copies of the 35S enhancer were present. Amplification using primers to the pBluescript vector was done primarily to detect the T-DNA insert(s) in transformed plants and has been optimized for the following conditions: annealing temp: 57°C, 30 cycles [94°C, 30sec; 57°C, 1 min; 72°C, 1 rnin] 1 cycle [72°C, 7 min]. The ACTTAG™ line, W000012492 (GDUl), was confirmed as positive for the presence of 35S enhancer and pSKI015 vector sequences by PCR, and as positive for Southern hybridization verifying genomic integration of the ACTTAG DNA and showing the presence of a single T-DNA insertion in the transgenic line.
B. Plasmid Rescue
Genomic DNA from T2 plants of insertion line, W000012492 ("GDUl"), was digested by restriction enzymes. The restriction fragments were self-ligated and used to transform the E. coli cells. The plasmids that contained a full-length pBluescript vector, 4X 35S enhancer, and a right border T-DNA flanking genomic DNA fragment were rescued.
More specifically, genomic DNA was digested with Pst I, EcoR I, BamH I, Spe I, Hind UI and/or Xho I under standard reaction conditions at 37°C overnight. Briefly, each restriction enzyme was heat inactivated at 65°C for 20 minutes, phenol/ chloroform and chloroform isoamyl (24:1) extracted once with each, and the ligation reactions were set up containing the reagents set forth below and left at 16°C overnight.
Digested Genomic DNA 40 μl
5X Ligation Buffer 50 μl
Ligase (Gibcol, lU/μl) 10 μl ddH2O 150 μl
The ligated DNA was precipitated, resuspended in ddH2O and used to transform E. coli SURE cells (Stratagene) via electroporation, with 10 pg of pUC18 plasmid as a control.
The transformation mixture was spread on two LB-plates containing 100 μg/ml ampicillin and incubated overnight at 37°C. Single colonies were picked from the plates and used to start a 5 ml LB-ampicillin broth culture from each colony by culturing overnight at 37°C. The plasmid was extracted from the culture and restriction digested to confirm the size of genomic insertion. C. Sequencing Of Rescued Plasmids
The left ends of plasmids rescued were sequenced across the right T-DNA border. The rescued sequence was subjected to a basic BLASTN search using the sequence comparison program available at the www.ncbi.nlm.nih.gov/BLAST website and a search of the Arabidopsis Information Resource (TAIR) database, available at the Arabidopsis.org website, which revealed sequence identity to BAC clones MUL3 and F28M20. These BACs are mapped to chromosomes 5 and 4, respectively. The insertion of the ACTTAG vector probably induced a genomic rearrangement, leading to the reciprocal translocation of portions of chromosomes 4 and 5. The ACTTAG vector joins chromosome 5 and the translocated part of chromosome 4. The insertion placed the 35S enhancer approximately 3 kb from the 5' end of GDUl coding transcriptional start site. The scheme of the insertion is depicted in Figure 1A.
Using GENSCAN, several predicted genes were identified in the vicinity of the GDUl insertion, as depicted in Figure 1A.
RNA was extracted from tissues derived from plants exhibiting the GDUl phenotype and from wild type COL-0 plants. Semi -quantitative, real-time RT-PCR was performed using forward and reverse primers specific to the gene designated GDUl (SEQ ID NO:l), to other genes in the vicinity, and to constitutively expressed control genes SKOR (GI 3641844) and AKT2 (GI 1100897). The results showed that plants displaying the GDUl phenotype over-expressed the mRNA for Gene 1, indicating the enhanced expression of Gene 1 is correlated with the GDUl phenotype. Quantitative real-time PCR was performed and showed that ninety-fold more GDUl mRNA accumulated in the mutant than in the wild type.
The amino acid sequence predicted from the GDUl nucleic acid sequence was determined using GENSCAN and is presented in SEQ ID NO:2. A Basic BLASTP 2.0.11 search using the ncbi.nlm.nih.gov/BLAST website, with the predicted amino acid sequence for GDUl, presented in SEQ ID NO:2, revealed that the GDUl gene encodes a protein previously characterized as a hypothetical protein (GI 3281855) that has homology to five putative Arabidopsis genes, on BAC clones F14M19, MIRl, F27A10, F6A4 and T4A2, as further described below. The BLAST search results suggest that GDUl represents a newly discovered phenotype and function associated with a DNA sequence found in the Arabidopsis BAC clone F28M20. D. Characterization of a second GDUl ACTTAG allele
A second ACTTAG mutant allele was isolated. In this line, as depicted in Figure IB, the insertion apparently resulted in a duplication of the BAR gene and 35S enhancer, such that the right T-DNA is flanked by the 35S enhancer and the BAR gene respectively at its left and right borders. The 35S enhancer closer to GDUl was approximately 1.8kb from the 5' end of GDUl coding transcriptional start site.
The same approach as described above was used to determine the association between the T-DNA insertion corresponding to the second allele and GDUl overexpression. Briefly, RT-PCR using RNA from the second allele showed that expression of GDUl was increased 30-fold in lines comprising this allele. Further experimentation, as described below, was conducted using the original GDUl allele, which causes approximately 90-fold upregulation of GDUl RNA.
EXAMPLE 3 Sequence and Homology Analysis
Using sequence analysis software, including BLAST (Altschul, et al, 1997) and GENSCAN (Burge and Karlin, 1997) analyses, we determined that GDUl belongs to a conserved family of proteins. We identified five related cDNA sequences, presented as SEQ ID NOs: 4, 6, 8, 10 and 12. Corresponding protein sequences are presented in, respectively, SEQ ED NOs: 5, 7, 9, 11 and 13. Of these, the following sequences have been previously identified in GenBank: SEQ ID NO:4 (GI 18416613), SEQ ID NO:5 (GI 18416614), SEQ ID NO:6 (in GI 3702736, complement of 14265-13819; gene_id is MRI1.4), SEQ ID NO:7 (GI 9759265), SEQ ID NO:8 (GI 18400627), and SEQ ID NO:9 (GI 18400628). SEQ ID NO: 10 comprises a novel cDNA sequence, of which a partial sequence was identified in GI 18420860. SEQ ID NO: 11 comprises the corresponding novel protein sequence, of which a partial sequence was identified as GI 15238648. SEQ ID NOs: 12 and 13 provide novel nucleic and protein sequences. Overall percent identities with the GDUl protein are shown in Table 1. Multiple sequence alignment using CLUSTAL (Thompson JD et al, 1994,) showed that two domains are particularly conserved: a hydrophobic domain, which is thought to be a transmembrane domain, and a region with a highly conserved "VIMAG" sequence motif, designated the VIMAG domain. The amino acid positions of the conserved regions are indicated in Table 1. Table 1. Sequence characteristics of GDUl homologs.
Figure imgf000030_0001
Using protein structure prediction software (including TMHMM [Moller et al, 2001] and TmPRED[Hofmann and Stoffel, 1993]), we predicted that GDUl and related proteins are transmembrane proteins, with the carboxy-terminal region, including the VIMAG domain, positioned in the cytoplasm. BLAST searches of EST databases identified similar sequences present in plants (e.g. Medicago truncatula GI 7204832 [48 % identity], Lycopersicon esculentum GI 16239823 [49 % identity], Glycine max GI 7591453 [61 % identity], Lutus japonicus GI 7778290 [58 % identity] and Sorghum bicolor GI 9300482 [55 % identity]) and fungus (GI 16506755 [32 % identity]), but neither in animals nor prokaryotes.
EXAMPLE 4 Confirmation of Phenotype/Genotype Association
The dominant inheritance pattern of the GDUl phenotype was confirmed through genetic analysis. Thirty-six progeny of a plant designated A000267007 were analyzed for segregation of the GDUl phenotype, and their seeds were analyzed for GDUl phenotype and sensitivity BASTA (BASTAS). Plant A000267007 was determined to be heterozygous for the mutant GDUl, as 10 out of 36 progeny plants displayed a wild type phenotype. The phenotypes of the 36 progeny plants were recorded, and their seeds were sown on petri dishes containing BASTA (15 mg/1) to determine the ratio of BASTA- resistant seedling. Seeds were also sown in soil for phenotypic characterization. The 36 plants were divided in three groups (A, B, C) according to the segregation of the BASTA resistance and the GDUl phenotype (Table 2). Group A comprised plants with GDUl phenotype, which were determined to be homozygous for BASTA resistance and for the GDUl mutation. Group B comprised plants with the GDUl phenotype, which were determined to be heterozygous for BASTA resistance and for the GDUl mutation. Group C comprised plants with wild type appearance, which did not carry BASTA resistance. The BASTA resistance locus was always associated with the GDUl mutation; no plants having mixed phenotypes (BASTA resistant and wild type appearance or BASTA susceptible and GDUl phenotype) were found.
The analysis showed that the GDUl mutation was dominant - since the heterozygous plants presented the GDUl phenotype - and was associated with the BASTA resistance gene of the T-DNA.
Table 2. Segregation of GDUl and BASTAR
Progeny
Group Phenotype
(number of plants) of the plants BASTAS (%) GDUl (%)
A (10) GDUl 0 100 B (16) GDUl 24.8 74.7 C (10) WT 100 0
EXAMPLE 5 Analysis of Leaf Secretions
The material secreted by the GDUl was collected and subjected to Mass Spectrometry and NMR study. The content in cations (K+, Mg2+, Ca2+ and Na+) was determined by flame spectrometry (MS). The composition in carbon, hydrogen and nitrogen was determined by combustion. Analyses determined the following composition (% by weight):
Sodium 18.33 %
Calcium 0.16 %
Potassium < 0.054 % Magnesium < 0.022 %
Carbon 35.58 %
Hydrogen 6.75 %
Nitrogen 15.66 %
The NMR study produced a clean proton spectrum, with a peak shape and integration suggesting a single compound. The peak pattern and shift were identical to proton spectrum of pure glutamine. The MS analysis produced a spectrum composed of one major peak (m/z = 146) that fragments into 3 daughter ions. The spectrum is also composed of numerous other peaks (> 146), with many of these multiples of 146 (2 x, 3 x, 4 x). This pattern is similar to the one obtained with pure glutamine (molecular mass = 146). Assuming that the secretion is composed of 18 % Na and 82 % glutamine, the theoretical C, N and H percentages should be, respectively, 34.4, 5.6 and 15.7. This fits correctly with the observed composition of the secretion. The differences observed for C and H may come from contaminating organic molecules collected on the leaf (such as dust and/or soil particles). The secretion was thus determined to be composed of about 18 % Na and 82 % glutamine.
EXAMPLE 6 Confirmation of Phenotype/Genotvpe Association in Arabidopsis In order to further confirm the association between the GDUl phenotype and the
GDUl gene presented in SEQ ID NO: 1, a genomic fragment comprising the GDUl cDNA (the gene has no introns), as provided in SEQ ID NO:l, was over-expressed in wild type Arabidopsis plants. The GDUl fragment was amplified by PCR and cloned into Kpnl and BamHl sites in the binary vector pAG2370. pAG2370, whose sequence is provided in SEQ ID NO:4, comprises the vector backbone from the vector pBIN19
(GI1256363), T-DNA left and right border fragments, and, between border fragments, the CsVMV promoter sequence and a Nos termination sequence for controlling expression of the inserted gene, and the neomycin phosphotransferase (NPTIJ) gene, which confers kanamycin resistance, whose expression is controlled by the RE4 promoter (US Patent No. 6054635) and the G7 termination sequence. The GDUl fragment was inserted between the CsVMV promoter region, proximal to the 5' end of genomic fragment, and the Nos termination sequence, proximal to the 3' end of the genomic fragment. The construct was designated pAG2370-G C/i.
Another GDUl construct was generated from the plasmid obtained from plasmid rescue using the Pst I enzyme. A fragment was obtained by complete digest with Kpnl and partial digest with HindHI, which contained the pBS backbone, the 4 x 35S enhancer, and a genomic DNA that extended from, upstream, the border of the T-DNA insertion to, downstream, a Hindu site distal to the 3' end of the GDUl gene. This fragment was cloned into the pAG2370 vector from which the CsVMV promoter had been removed by digestion with Kpn I and Hind UJ, and the resulting construct was designated pAG2370- GDU1-VR.
These constructs, as well as control constructs containing another gene in the vicinity of the T-DNA insertion, with and without the GDUl gene, were transformed into Agrobacterium tumefaciens by electroporation
Wild type Arabidopsis (COL-0) plants were transformed with ΌAG2310-GDU1 and pAG2370-GZ)cH-PR using standard vacuum infiltration methods. All infiltrated seeds were plated in selective media (approximately 60 μg/ml kanamycin), and kanamycin-resistant Ti plants were transplanted to flats. The transformation process generated 31 independent To events from ΌAG2310-GDU1 , 23 of which showed the GDUl phenotype of white leaf secretions. From transformation with pAG2370-G.DC/i -PR, 25 independent To lines were generated, 8 of which showed the GDUl phenotype. Control constructs that did not contain the GDUl gene had no noticeable phenotype. These results showed that GDUl overexpression is responsible of the GDUl phenotype.
EXAMPLE 7
Analysis of the GDUl expression in plant tissues
The expression pattern directed by the GDUl promoter, presented in SEQ ID NO:3, was characterized using a beta-glucuronidase (GUS) reporter construct. The GDUl promoter, comprising approximately 2.7 kb of the genomic sequence directly upstream of the GDUl start codon, was amplified by PCR and ligated to the E. coli GUS coding sequence. The fusion product was inserted into the pPZ212 vector backbone (GI 506657; Hajdukiewicz et al., 1994). The sequence of the resulting reporter construct, designated GDU1-GOS, is presented in SEQ ID NO:5. GDUl -GUS was introduced into Arabidopsis by Agrob cteπwm-mediated transformation and GUS expression was analyzed in several Ti and T2 plants.
While there was some variability to the pattern and intensity of GUS staining, expression analysis revealed vascular-specific activity of the GDUl promoter. All lines studied showed GUS expression in the root stele, and, more precisely in the stele of the root up to the region where the cells are dividing, co-localizing with the phloem. All lines also showed strong GUS expression in the phloem and xylem pole of the stem vascular bundle. Over 50% of the lines studied showed GDUl promoter activity in the aerial part vasculature and/or in the hydathodes. GDUl promoter activity was sometimes observed in the bases of the developing siliques, and in the vasculature of the anthers. A profile of GDUl expression was also obtained by performing RT-PCR of GDUl coding sequences using mRNA derived from various organs of the plant. The expression observed using the reporter construct was in accordance with the pattern determined from RT-PCR. GDUl mRNA was shown to be most abundant in the roots and in the stem.
Results of RT-PCR experiments are presented in Table 3. Expression levels are shown in relation to levels in the caulinary leaves and young stem, which were normalized to 100%. Table 3. GDUl expression in Arabidopsis tissues.
Figure imgf000034_0001
3w: shoot of 3-week old plantlet ; 4w: shoots of 4-week old plant; rL: rosette leaves ; cL: caulinary leaves ; ySt: young stem ; oSt: old stem ; FI: flowers ; ySi: young siliques ; oSi: old siliques; R: roots. Values are expressed as a percentage of the maximum expression level.
EXAMPLE 8
Analysis of amino acid content of GDUl mutants plants
The amino acid content of mutants plants shown to overexpress GDUl was analyzed. Free amino acid was extracted from ground plant tissues by either ethanol or NaCitrate extraction. For the ethanol extraction, 500 mg of fresh leaves were ground in 2 ml 100 % ethanol, and centrifuged. The pellets were successively resuspended in 80% ethanol, 60% ethanol, and water, and were centrifuged following each resuspension. The supernatants from successive resuspension steps were pooled and assayed for amino acid content. For the Citrate extraction, 500 mg of leaves are ground in Li citrate, pH 2.2, centrifuged and the supernatant was assayed for amino acids. Before tissue collection, the plants were washed twice with distilled water to remove the secreted glutamine on the leaves. The percent increase, +/- standard deviations, in the quantity of 18 amino acids in GDUl as compared to wild-type Arabidopsis plants is presented in Table 4. For both GDUl and wild-type tissue, the values of (mg-amino acid)/(g-fresh weight) were determined. The percent increase was calculated by taking the ratio of these values for GDUl and wild- type. Table 4. Amino acid content of GDUl mutant plants
Figure imgf000035_0001
The content of each of these amino acids is greater in the mutant than in the wild type, and the amounts of serine, threonine and glutamate are more than doubled.
REFERENCES
Altschul, S.F. et al, J. Mol. Biol. 215:403-410, 1990.
Altschul, S.F. et al, Nucleic Acids Res. 25:3389-3402, 1997. Ausubel FM et al. Current Protocols in Molecular Biology, John Wiley & Sons,
New York, N.Y., 1993.
Baldwin D et al, Cur Opin Plant Biol. 2(2):96-103, 1999.
Barker, L. et al. (2000), The Plant Cell, 12: 1153-1164.
Baulcombe D, Arch Virol Suppl 15:189-201, 1999. Behringer and Medford, Plant Mol. Biol. Rep. 10(2): 190-198, 1992.
Burge and Karlin, 1997, J Mol Biol 268:78-94.
Carter et al, Nucl. Acids Res. 13:4331, 1986. Christensen S et al., 9th International Conference on Arabidopsis Research. Univ. of Wisconsin-Madison, June 24-28, 1998. Abstract 165.
Cough, SJ and Bent, AF, the Plant Journal 16(6): 735-743, 1998.
Crameri A and Stemmer WP, Bio Techniques 18(2): 194-6, 1995. Creighton, T.E., PROTEINS: STRUCTURE AND MOLECULAR PROPERTIES. W.H.
Freeman & Co., San Francisco, pp. 79-86, 1983.
Doolittle, R.F., OF URFS and ORFS (University Science Books, CA, 1986.)
Fang G et al, Plant Cell, l(l):141-50, 1989.
Feldman et al, Science 243: 1351-1354, 1989. Fridborg I et al, Plant Cell 11: 1019-1032, 1999.
Geest AH and Hall TC, Plant Mol Biol 32(4):579-88, 1996).
Gelvin, S. B., Schilperoort, R. A., Varma, D. P. S., eds. Plant Molecular Biology Manual 1990.
Glick, BR and Thompson, JE, Eds. METHODS IN PLANT MOLECULAR BIOLOGY AND BIOTECHNOLOGY, p. 213-221, CRC Press, 1993.
Hajdukiewicz et al., Plant Mol. Biol. 25:989-994, 1994.
Harling et al, EMBO J. 16: 5855-66, 1997.
Haught C et al BioTechniques 16(l):47-48, 1994.
Hayashi H et al, Science 258: 1350-1353, 1992. Hofmann and Stoffel, 1993, Biol. Chem. Hoppe-Seyler 374,166.
Jakobovits, A, et al, Ann N Y Acad Sci 764:525-35, 1995.
Jakobovits, A, Curr Opin Biotechnol 6(5):561-6, 1995.
Jensen, L.G., et al, Proc. Natl. Acad. Sci. USA 93:3487-3491, 1996.
Jones et al, Nature 321:522-525, 1986. Jones JD et al, Transgenic Res 1 :285-297 1992.
Kakimoto, Science 274: 982-5, 1996.
Kardailsky, I et al, Science 286: 1962-1965, 1998.
Kardailsky et al, Science 286: 1962-5, 1999.
Kohler and Milstein, Nature 256:495, 1975. Kunkel TA et al, Methods Enzymol. 204:125-39, 1991.
Lacombe, B. et al. (2000), The Plant Cell 12: 837-851
Liu et al. Plant Journal 8(3) 457-463, 1995.
Maniatis, et al, MOLECULAR CLONING: A LABORATORY MANUAL, 2d Edition, 1989.
Marks and Feldman, Plant Cell 1:1053-1050, 1989. Miklashevichs et al Plant J. 12: 489-98, 1997.
Moller et al, 2001, Bioinformatics, 17:646-653.
Napoli, et al, Plant Cell 2:279-289, 1989.
Novak, J and Novak, L, Promega Notes Magazine Number 61:27, 1997. Omirulleh et al, Plant Mol Biol. 21(3):415-28, 1993.
Riechmann et al, Nature 332:323-327, 1988.
Sambrook et al. Molecular Cloning: A Laboratory Manual (Second Edition), Cold Spring Harbor Press, Plainview, N. Y.,1989.
Schaffer R, et αZ., Cell 93: 1219-1229, 1998. Schell et al, Trends Plant Sci. 3: 130, 1998.
Smith, et al, Nature 334:724-726, 1988.
Thompson JD et al, 1994, Nucleic Acids Res 22:4673-4680.
Truenit, E. Sauer, N. (1995), Planta 196: 564
Van Haaren MJJ et al, Plant Mol Bio 21:625-640, 1993. Verdaguer B et al, Plant Mol Biol 37:1055-1067, 1998.
Verhoeyen et al, Science 239:1534-1536, 1988.
Walden et. al, EMBO J. 13: 4729-36, 1994.
Walden et al, Plant Mol. Biol. 26: 1521-8, 1994.
Waterhouse, et al, Proc. Natl. Acad. Sci. USA 95:13959-13964, 1998. Wells et al, Gene 34:315, 1985.
Wells et al, Philos. Trans. R. Soc. London SerA 317:415, 1986.
Weigel D, et al, Plant Physiology, 122:1003-1013, 2000.
Wilson K et al, Plant Cell 8: 659-671, 1996.
Xu YL, et al, Plant Cell, 11: 927-36, 1999. Zoller et al, Nucl. Acids Res. 10:6487, 1987.

Claims

IT IS CLAIMED:
1. A method of producing a glutamine secretion phenotype in a plant, said method comprising: a) introducing into progenitor cells of the plant a plant transformation vector comprising a nucleotide sequence that encodes or is complementary to a sequence that encodes a GDUl polypeptide, and b) growing the transformed progenitor cells to produce a transgenic plant, wherein said polynucleotide sequence is expressed and said transgenic plant exhibits a glutamine secretion phenotype.
2. The method of Claim 1 wherein the GDUl polypeptide has at least 50% sequence identity to the amino acid sequence presented as SEQ ID NO:2 and wherein the polypeptide comprises a VIMAG domain comprising at least 80% sequence identity to the sequence presented as amino acids 92-110 of SEQ ID NO:2.
3. The method of Claim 1 wherein the GDUl polypeptide has at least 80% sequence identity to the amino acid sequence presented as SEQ ID NO:2.
4. The method of Claim 1 wherein the GDUl polypeptide has at least 90% sequence identity to the amino acid sequence presented as SEQ ID NO:2.
5. The method of Claim 1 wherein the GDUl polypeptide has the amino acid sequence presented as SEQ ID NO: 2.
6. The method of Claim 1 wherein a GDUl polypeptide is over-expressed in the transgenic plant.
7. A plant obtained by a method of claim 1.
8. A plant part obtained from a plant according to Claim 1.
9. A method of expressing a coding sequence in the vascular tissue of a plant comprising: a) introducing into progenitor cells of the plant a plant transformation vector comprising said coding sequence and an GDUl promoter sequence operably linked to said coding sequence, and b) growing the transformed progenitor cells to produce a transgenic plant, wherein said transgenic plant expresses said coding sequences in vascular tissue.
10. The method of claim 9 wherein the coding sequence is expressed in a tissue selected from the group consisting of xylem, phloem, leaf vasculature, anther vasculature, silique vasculature, and hydathode.
11. The method of claim 9 wherein the GDUl promoter comprises a nucleic acid with the sequence presented in SEQ ID NO: 3 or a fragment thereof that directs vascular tissue specific expression.
12. An isolated nucleic acid molecule that comprises an GDUl promoter operably linked to a heterologous coding sequence.
13. The isolated nucleic acid molecule of claim 12 wherein the GDUl promoter comprises a nucleic acid with the sequence presented in SEQ ID NO: 3 or a fragment thereof that directs vascular tissue specific expression.
PCT/US2003/004120 2002-02-13 2003-02-11 Identification and characterization of an glutamine dumper 1 phenotype (gdui1) in arabidopsis WO2003067965A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003216244A AU2003216244A1 (en) 2002-02-13 2003-02-11 Identification and characterization of an glutamine dumper 1 phenotype (gdui1) in arabidopsis

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US35696202P 2002-02-13 2002-02-13
US60/356,962 2002-02-13

Publications (2)

Publication Number Publication Date
WO2003067965A2 true WO2003067965A2 (en) 2003-08-21
WO2003067965A3 WO2003067965A3 (en) 2003-12-31

Family

ID=27734711

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/004120 WO2003067965A2 (en) 2002-02-13 2003-02-11 Identification and characterization of an glutamine dumper 1 phenotype (gdui1) in arabidopsis

Country Status (2)

Country Link
AU (1) AU2003216244A1 (en)
WO (1) WO2003067965A2 (en)

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DATABASE GENBANK [Online] November 1998 BEVAN ET AL., XP002970703 Retrieved from NCBI Database accession no. (AL031004) *
DATABASE GENBANK [Online] October 2000 PERNA ET AL., XP002970704 Retrieved from NCBI Database accession no. (AAG59477) *

Also Published As

Publication number Publication date
AU2003216244A8 (en) 2003-09-04
AU2003216244A1 (en) 2003-09-04
WO2003067965A3 (en) 2003-12-31

Similar Documents

Publication Publication Date Title
EP1941045B1 (en) Use of a nucleic acid sequence for the generation of a transgenic plant having enhanced drought tolerance
JP5779619B2 (en) Plant height regulatory genes and uses thereof
US7034138B2 (en) Identification and characterization of an anthocyanin mutant (ANT1) in tomato
JP4987734B2 (en) Transformed plant introduced with stress responsive gene
EP1499175A2 (en) Identification and characterization of an anthocyanin mutant (ant1) in tomato
KR100860199B1 (en) the production of fruit tree for transforming tree form by using MADS-Box gene
CN112961842B (en) Soybean phytochrome chromophore synthesis gene GmHY2 and encoding protein and application thereof
WO2001094588A2 (en) Gene (cur) related to a curly phenotype in plants
CN110627887B (en) Application of SlTLFP8 protein and related biological material thereof in regulation and control of tomato drought resistance
US7560612B2 (en) Early-inflorescence-preferred regulatory elements and uses thereof
AU3681999A (en) Gene for floral regulation and methods for controlling of flowering
US6509191B2 (en) Identification and characterization of a PAGODA phenotype (PGD) in plants
WO2003067965A2 (en) Identification and characterization of an glutamine dumper 1 phenotype (gdui1) in arabidopsis
CN113528533B (en) Limonium bicolor gene LbRSG and application thereof
US6534695B2 (en) Identification and characterization of a Dwarf and Late Flowering 2 phenotype (DLF2) in Arabidopsis
CN114230649B (en) Tn1 protein related to rice tillering force, related biological material and application thereof
AU2002245189B8 (en) Identification and characterization of an anthocyanin mutant (ANT1) in tomato
AU2002245189B2 (en) Identification and characterization of an anthocyanin mutant (ANT1) in tomato
AU768139B2 (en) Plant lesion formation suppressing gene, Sp17 and use thereof
WO2002070661A2 (en) Identification and characterization of a waxy curled leaf phenotype (wcl1) in arabidopsis
KR100682131B1 (en) GENE RELATED TO SEED DEVELOPMENT ISOLATED FROM Oryza sativa EXPRESSION VECTOR CONTAINING THE GENE TRANSFORMANT TRANSFORMED BY THE VECTOR AND METHOD FOR PREPARATION OF THE TRANSFORMANT
JP3564537B2 (en) Plant Ran gene mutant and method for promoting plant flowering time using the mutant
AU2002245189A1 (en) Identification and characterization of an anthocyanin mutant (ANT1) in tomato

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP