EP1620557A2 - Novel glyphosate-n-acetyltransferase (gat) genes - Google Patents

Novel glyphosate-n-acetyltransferase (gat) genes

Info

Publication number
EP1620557A2
EP1620557A2 EP04775913A EP04775913A EP1620557A2 EP 1620557 A2 EP1620557 A2 EP 1620557A2 EP 04775913 A EP04775913 A EP 04775913A EP 04775913 A EP04775913 A EP 04775913A EP 1620557 A2 EP1620557 A2 EP 1620557A2
Authority
EP
European Patent Office
Prior art keywords
amino acid
acid residue
positions
seq
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP04775913A
Other languages
German (de)
French (fr)
Inventor
Linda A. Castle
Dan Siehl
Lorraine J. Giver
Jeremy Minshull
Christina Ivy
Yong Hong Chen
Phillip A. Patten
Rebecca Gorton
Nicholas B. Duck
Billy Fred Mccutchen
Roger Kemble
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pioneer Hi Bred International Inc
Verdia LLC
EIDP Inc
Original Assignee
Pioneer Hi Bred International Inc
EI Du Pont de Nemours and Co
Verdia LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=34118130&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=EP1620557(A2) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority claimed from US10/427,692 external-priority patent/US7462481B2/en
Application filed by Pioneer Hi Bred International Inc, EI Du Pont de Nemours and Co, Verdia LLC filed Critical Pioneer Hi Bred International Inc
Priority to EP11153914A priority Critical patent/EP2322629A3/en
Priority to EP12179273.3A priority patent/EP2535414B1/en
Publication of EP1620557A2 publication Critical patent/EP1620557A2/en
Withdrawn legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N9/00Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
    • C12N9/10Transferases (2.)
    • C12N9/1025Acyltransferases (2.3)
    • C12N9/1029Acyltransferases (2.3) transferring groups other than amino-acyl groups (2.3.1)
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/63Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
    • C12N15/79Vectors or expression systems specially adapted for eukaryotic hosts
    • C12N15/82Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
    • C12N15/8241Phenotypically and genetically modified plants via recombinant DNA technology
    • C12N15/8261Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
    • C12N15/8271Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
    • C12N15/8274Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for herbicide resistance
    • C12N15/8275Glyphosate

Definitions

  • Crop selectivity to specific herbicides can be conferred by engineering genes into crops which encode appropriate herbicide metabolizing enzymes, some cases these enzymes, and the nucleic acids that encode them, originate in a plant, h other cases, they are derived from other organisms, such as microbes.
  • transgenic plants have been engineered to express a variety of herbicide tolerance/metabolizing genes, from a variety of organisms. For example, acetohydroxy acid synthase, which has been found to make plants that express this enzyme resistant to multiple types of herbicides, has been introduced into a variety of plants (see, e.g., Hattori et al.
  • genes that confer tolerance to herbicides include: a gene encoding a chimeric protein of rat cytochrome P4507A1 and yeast NADPH-cytochrome P450 oxidoreductase (Shiota et al. (1994) Plant Physiol. 106: 17), genes for glutathione reductase and superoxide dismutase (Aono et al. (1995) Plant Cell Physiol. 36: 1687, and genes for various phosphotransferases (Datta et al. (1992) Plant Mol. Biol. 20: 619).
  • N-phosphonomethylglycine commonly referred to as glyphosate.
  • Glyphosate is the top selling herbicide in the world, with sales projected to reach $5 billion by 2003. It is a broad spectrum herbicide that kills both broadleaf and grass-type plants.
  • a successful mode of commercial level glyphosate resistance in transgenic plants is by introduction of a modified Agrobacterium CP4 5- enolpyruvylshikimate-3 -phosphate synthase (hereinafter referred to as EPSP synthase or EPSPS) gene.
  • EPSP synthase enolpyruvylshikimate-3 -phosphate synthase
  • the transgene is targeted to the chloroplast where it is capable of continuing to synthesize EPSP from phosphoenolpyruvic acid (PEP) and shikimate-3 -phosphate in the presence of glyphosate.
  • PEP phosphoenolpyruvic acid
  • shikimate-3 -phosphate in the presence of glyphosate.
  • the native EPSP synthase is inhibited by glyphosate.
  • plants sprayed with glyphosate quickly die due to inhibition of EPSP synthase which halts the downstream pathway needed for aromatic amino acid, hormone, and vitamin biosynthesis.
  • the CP4 glyphosate-resistant soybean transgenic plants are marketed, e.g., by Monsanto under the name "Round UP ReadyTM.”
  • the predominant mechanism by which glyphosate is degraded is through soil microflora metabolism.
  • the primary metabolite of glyphosate in soil has been identified as aminomethylphosphonic acid (AMP A), which is ultimately converted into ammonia, phosphate and carbon dioxide.
  • AMP A aminomethylphosphonic acid
  • the proposed metabolic scheme that describes the degradation of glyphosate in soil through the AMPA pathway is shown in Fig. 8.
  • sarcosine pathway An alternative metabolic pathway for the breakdown of glyphosate by certain soil bacteria, the sarcosine pathway, occurs via initial cleavage of the C-P bond to give inorganic phosphate and sarcosine, as depicted in Fig. 9.
  • Another successful herbicide/transgenic crop package is glufosinate
  • Glufosinate is also a broad spectrum herbicide. Its target is the glutamate synthase enzyme of the chloroplast. Resistant plants carry the bar gene from Streptomyces hygroscopicus and achieve resistance by the N-acetylation activity of bar, which modifies and detoxifies glufosinate.
  • An enzyme capable of acetylating the primary amine of AMP A is reported in PCT Application No. WO00/29596. The enzyme was not described as being able to acetylate a compound with a secondary amine (e.g., glyphosate). While a variety of herbicide resistance strategies are available as noted above, additional approaches would have considerable commercial value.
  • the present invention provides novel polynucleotides and polypeptides for conferring herbicide tolerance, as well as numerous other benefits as will become apparent during review of the disclosure.
  • the present invention provides methods and reagents for rendering an organism, such as a plant, resistant to glyphosate by one or more of the embodiments described below.
  • One embodiment of the invention provides novel polypeptides referred to herein as glyphosate-N-acetyltransferase ("GAT") polypeptides.
  • GAT polypeptides are characterized by their structural similarity to one another, e.g., in terms of sequence similarity when the GAT polypeptides are aligned with one another.
  • GAT polypeptides of the present invention possess glyphosate-N- acetyltransferase activity, i.e., the ability to catalyze the acetylation of glyphosate. These GAT polypeptides transfer the acetyl group from acetyl CoA to the N of glyphosate. In addition, some GAT polypeptides transfer the propionyl group of propionyl CoA to the N of glyphosate. Some GAT polypeptides are also capable of catalyzing the acetylation of glyphosate analogs and/or glyphosate metabolites, e.g., aminomethylphosphonic acid.
  • GAT polypeptides correspond to SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683,
  • GAT polynucleotides e.g., SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565 ' , 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666,
  • GAT polynucleotides are characterized by their ability to encode GAT polypeptides.
  • a GAT polynucleotide is engineered for better plant expression by replacing one or more parental codons with a synonymous codon that is preferentially used in plants relative to the parental codon.
  • a GAT polynucleotide is modified by the introduction of a nucleotide sequence encoding an N-terminal chloroplast transit peptide.
  • a GAT polynucleotide is modified by the insertion of one or more G+C containing codons (such as GCG or GCT) immediately downstream of and adjacent to the initiating Met codon.
  • GAT polypeptides GAT polynucleotides and glyphosate-N- acetyltransferase activity are described in more detail below.
  • the invention further includes certain fragments of the GAT polypeptides and GAT polynucleotides described herein.
  • the invention includes non-native variants of the polypeptides and polynucleotides described herein, wherein one or more amino acid of the encoded polypeptide has been mutated. h certain preferred embodiments, the GAT polypeptides of the present invention are characterized as follows.
  • the invention further provides an isolated or recombinant polypeptide comprising an amino acid sequence selected from the groups consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ID NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ ID NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ ID NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ ID NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ ID NO:712; (h) an amino acid sequence that is at least 97% identical to SEQ ID NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO:677; (j) an amino acid sequence that is at least 96% identical to
  • the invention further provides an isolated or recombinant polypeptide comprismg an amino acid sequence selected from the groups consisting of: (a) an amino acid sequence that is at least 96% identical to positions 2-146 of SEQ ID NO:919 (such as, for example, SEQ ID NO:917, 919, 921, 923, 925, 927, 833, 835, 839, 843, 845, 859, 863, 873, 877, 891, 895, 901, 905, 907, 913, 915, or 950); (b) an amino acid sequence that is at least 97% identical to positions 2-146 of SEQ ID NO:929 (such as, for example, SEQ LD NO:929, 931, 835, 843, 849, or 867); (c) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ ID NO:847 (such as, for example, SEQ ID NO:845 or 847); (d) an amino acid sequence that is at least 98% identical to positions
  • the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence.
  • the invention further provides an isolated or recombinant polypeptide comprising an amino acid sequence that is at least 95% identical to positions 2-146 of SEQ LD NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ LD NO:929 (such as, for example, SEQ LD NO:837, 849, 893, 897, 905, 921, 927, 929 or 931).
  • the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence.
  • the invention further provides a nucleic acid construct comprising a polynucleotide of the invention.
  • the construct can be a vector, such as a plant transformation vector, hi some aspects a vector of the invention will comprise a T- DNA sequence.
  • the construct can optionally include a regulatory sequence (e.g., a promoter) operably linked to a GAT polynucleotide, where the promoter is heterologous with respect to the polynucleotide and effective to cause sufficient expression of the encoded polypeptide to enhance the glyphosate tolerance of a plant cell transformed with the nucleic acid construct.
  • a GAT polynucleotide functions as a selectable marker, e.g., in a plant, bacteria, actinomycete, yeast, algae or other fungi.
  • an organism that has been transformed with a vector including a GAT polynucleotide selectable marker can be selected based on its ability to grow in the presence of glyphosate.
  • a GAT marker gene can be used for selection or screening for transformed cells expressing the gene.
  • the invention further provides vectors with stacked traits, i.e., vectors that encode a GAT polypeptide and that also include a second polynucleotide sequence encoding a second polypeptide that confers a detectable phenotypic trait upon a - ll CTTi/xgcuxism expressing the second poiypepti ⁇ e at an effective ievei, tor example disease resistance or pest resistance.
  • the detectable phenotypic trait can also function as a selectable marker, e.g., by conferring herbicide resistance or by providing some sort of visible marker.
  • the invention provides a composition comprising two or more polynucleotides of the invention.
  • the GAT polynucleotides encode GAT polypeptides having different kinetic parameters, i.e., a GAT variant having a lower K m can be combined with one having a higher k cat .
  • compositions containing two or more GAT polynucleotides or encoded polypeptides are a feature of the invention, h some cases, these compositions are libraries of nucleic acids containing, e.g., at least 3 or more such nucleic acids.
  • compositions produced by digesting the nucleic acids of the invention with a restriction endonuclease, a DNAse or an RNAse, or otherwise fragmenting the nucleic acids, e.g., mechanical shearing, chemical cleavage, etc. are also a feature of the invention, as are compositions produced by incubating a nucleic acid of the invention with deoxyribonucleotide triphosphates and a nucleic acid polymerase, such as a thermostable nucleic acid polymerase.
  • Cells transduced by a vector of the invention, or which otherwise incorporate a nucleic acid of the invention are an aspect of the invention.
  • the cells express a polypeptide encoded by the nucleic acid of the invention.
  • the cells incorporating the nucleic acids of the invention are plant cells.
  • Transgenic plants, transgenic plant cells, and transgenic plant explants incorporating the nucleic acids of the invention are also a feature of the invention.
  • the transgenic plants, transgenic plant cells, or transgenic plant explants express an exogenous polypeptide with glyphosate-N- acetyltransferase activity encoded by the nucleic acid of the invention.
  • the invention also provides transgenic seeds produced by the transgenic plants of the iiivt-ii ⁇ ii.
  • the invention further provides transgenic plants, transgenic plant cells, transgenic plant explants, or transgenic seeds having enhanced tolerance to glyphosate due to the expression of a polypeptide with glyphosate-N- acetyltransferase activity and a polypeptide that imparts glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5-enolpyruvylshikimate-3- phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase.
  • a polypeptide with glyphosate-N- acetyltransferase activity and a polypeptide that imparts glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5-enolpyruvylshikimate-3- phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase.
  • the invention provides transgenic plants or transgenic plant explants having enhanced tolerance to glyphosate, as well as tolerance to an additional herbicide due to the expression of a polypeptide with glyphosate-N- acetyltransferase activity, a polypeptide that imparts glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5-enolpyruvylshikimate-3- phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and a polypeptide imparting tolerance to the additional herbicide, such as a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxyhydroxy acetohydroxy
  • the invention also provides transgenic plants, transgenic plant cells, transgenic plant explants, or transgenic seeds having enhanced tolerance to glyphosate, as well as tolerance to an additional herbicide due to the expression of a polypeptide with glyphosate-N-acetyltransferase activity and a polypeptide imparting tolerance to an additional herbicide, such as, a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protopo hyrinogen oxidase.
  • the invention also provides transgenic plants, transgenic plant cells, transgenic plant explants, or transgenic seeds having enhanced tolerance to glyphosate as well as additional desirable traits which may be conferred by one or more additional transgenes.
  • Methods of producing the polypeptides of the invention by introducing the nucleic cids cnco ⁇ ing ⁇ nem into celis and then expressing and optionally recovering them from the cells or culture medium are a feature of the invention.
  • the cells expressing the polypeptides of the invention are transgenic plant cells.
  • G/C-rich codons such as GCG or GCT
  • Another aspect of the invention relates to methods of polynucleotide diversification to produce novel GAT polynucleotides and polypeptides by recombining or mutating the nucleic acids of the invention in vitro ox in vivo.
  • the recombination produces at least one library of recombinant GAT polynucleotides.
  • the libraries so produced are embodiments of the invention, as are cells comprising the libraries.
  • methods of producing a modified GAT polynucleotide by mutating a nucleic acid of the invention are embodiments of the invention.
  • Recombinant and mutant GAT polynucleotides and polypeptides produced by the methods of the invention are also embodiments of the invention.
  • diversification is achieved by using recursive recombination, which can be accomplished in vitro, in vivo, in silico, or a combination thereof.
  • recursive recombination which can be accomplished in vitro, in vivo, in silico, or a combination thereof.
  • Some examples of diversification methods described in more detail below are family shuffling methods and synthetic shuffling methods.
  • the invention provides methods for producing a glyphosate-resistant transgenic plant or plant cell that involve transforming a plant or plant cell with a polynucleotide encoding a glyphosate-N-acetyltransferase, and optionally regenerating a transgenic plant from the transformed plant cell.
  • the polynucleotide is a GAT polynucleotide, optionally a GAT polynucleotide derived from a bacterial source.
  • the method can comprise growing the transformed plant or plant cell in a concentration of glyphosate that inhibits the growth of a wild-type plant of the same species without inhibiting the growth of the transformed plant.
  • the method can comprise growing the transformed plant or plant cell or progeny of the plant or plant cell in increasing concentrations of glyphosate and/or in a concentration of glyphosate that is lethal to a wild-type plant or plant cell of the same species.
  • a glyphosate-resistant transgenic plant produced by this method can be propagated, for example by crossing it with a second plant, such that at least some progeny of the cross display glyphosate tolerance.
  • the invention further provides methods for selectively controlling weeds in a field containing a crop that involve planting the field with crop seeds or plants which are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, and applying to the crop and weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop.
  • the invention further provides methods for controlling weeds in a field and preventing the emergence of glyphosate-resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate- tolerant as a result of being transformed with a gene encoding a glyphosate-N- acetyltransferase and a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5- enolpyruvylshikimate-3 -phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and applying to the crop and the weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop.
  • the invention provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase, a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5- enolpyruvylshikimate-3 -phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synth
  • the invention further provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate- tolerant as a result of being transformed with a gene encoding a glyphosate-N- acetyltransferase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone- tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase and applying to
  • the invention further provides methods for producing a genetically transformed plant that is tolerant to glyphosate that involve inserting into the genome of a plant cell a recombinant, double-stranded DNA molecule comprising: (i) a promoter which functions in plant cells to cause the production of an RNA sequence; (ii) a structural DNA sequence that causes the production of an RNA sequence which encodes a GAT; and (iii) a 3' non-translated region which functions in plant cells to cause the addition of a stretch of polyadenyl nucleotides to the 3' end of the RNA sequence; where the promoter is heterologous with respect to the structural DNA sequence and adapted to cause sufficient expression of the encoded polypeptide to enhance the glyphosate tolerance of a plant cell transformed with the DNA molecule; obtaimng a transformed plant cell; and regenerating from the transformed plant cell a genetically transformed plant which has increased tolerance to glyphosate.
  • the invention further provides methods for producing a crop that involve growing a crop plant that is glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, under conditions such that the crop plant produces a crop; and harvesting a crop from the crop plant.
  • These methods often include applying glyphosate to the crop plant at a concentration effective to control weeds.
  • Exemplary crop plants include cotton, corn, and soybean.
  • the invention also provides computers, computer readable medium and integrated systems, including databases that are composed of sequence records including character strings corresponding to SEQ ID NO: 1-10, 16, 48, 190, 193, 196, 202, 205, 268, 300, 442, 445, 448, 454, 457, 515-830 and 832-972.
  • integrated systems optionally include one or more instruction set for selecting, aligning, translating, reverse-translating or viewing any one or more character strings corresponding to SEQ LD NO: 1-10, 16, 48, 190, 193, 196, 202, 205, 268, 300, 442, 445, 448, 454, 457, 515-830 and 832-972, with each other and/or with any additional nucleic acid or amino acid sequence.
  • Figure 1 depicts the N-acetylation of glyphosate catalyzed by a glyphosate- N-acetyltransferase ("GAT").
  • Figure 2 illustrates mass spectroscopic detection of N-acetylglyphosate produced by an exemplary Bacillus culture expressing a native GAT activity.
  • Figure 3 is a table illustrating the relative identity between GAT sequences isolated from different strains of bacteria and yitl from. Bacillus subtilis.
  • Figure 4 is a map of the plasmid pMAXY2120 for expression and purification of the GAT enzyme from E. coli cultures.
  • Figure 5 is a mass spectrometry output showing increased N- acetylglyphosate production over time in a typical GAT enzyme reaction mix.
  • Figure 6 is a plot of the kinetic data of a GAT enzyme from which a K M of 2.9 mM for glyphosate was calculated.
  • Figure 7 is a plot of the kinetic data taken from the data of Figure 6 from which a K M of 2 ⁇ M was calculated for Acetyl CoA.
  • Figure 8 is a scheme that describes the degradation of glyphosate in soil through the AMPA pathway.
  • Figure 9 is a scheme that describes the sarcosine pathway of glyphosate degradation.
  • Figure 10 is the BLOSUM62 matrix.
  • Figure 11 is a map of the plasmid pMAXY2190.
  • Figure 12 depicts a T-DNA construct with gat selectable marker.
  • Figure 13 depicts a yeast expression vector with gat selectable marker.
  • Figure 14 illustrates effect of glyphosate on plant height at tasseling.
  • Figures 15A and 15B provide a comparison of the kinetic parameters K m and k cat /Km, respectively, for various GAT enzymes assayed in either the absence of added KC1 (unshaded bars) or in the presence of 20 mM KC1 (shaded bars) as described in Example 18. Error bars represent the standard deviation of multiple assays, where available.
  • Figures 16A, 16B and 16C provide a comparison of the kinetic parameters
  • the present invention relates to a novel class of enzymes exhibiting N- acetyltransferase activity.
  • the invention relates to a novel class of enzymes capable of acetylating glyphosate and glyphosate analogs, e.g., enzymes possessing glyphosate-N-acetyltransferase ("GAT") activity.
  • GAT glyphosate-N-acetyltransferase
  • Such enzymes are characterized by the ability to acetylate the secondary amine of a compound.
  • this compound is an herbicide, e.g., glyphosate, as illustrated schematically in Figure 1.
  • This compound can also be a glyphosate analog or a metabolic product of glyphosate degradation, e.g., aminomethylphosphonic acid.
  • glyphosate analog or a metabolic product of glyphosate degradation, e.g., aminomethylphosphonic acid.
  • acetylation of glyphosate is a key catalytic step in one metabolic pathway for catabolism of glyphosate
  • the enzymatic acetylation of glyphosate by naturally-occurring, isolated, or recombinant enzymes has not been previously described.
  • the nucleic acids and polypeptides of the invention provide a new biochemical pathway for engineering herbicide resistance.
  • the invention provides novel genes encoding GAT polypeptides.
  • Isolated and recombinant GAT polynucleotides corresponding to naturally occurring polynucleotides, as well as recombinant and engineered, e.g., diversified, GAT polynucleotides are a feature of the invention.
  • GAT polynucleotides are exemplified by SEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678,
  • GAT polynucleotide and polypeptide sequences are provided as examples to help illustrate the invention, and are not intended to limit the scope of the genus of GAT polynucleotides and polypeptides described and/or claimed herein.
  • the invention also provides methods for generating and selecting diversified libraries to produce additional GAT polynucleotides, including polynucleotides encoding GAT polypeptides with improved and/or enhanced characteristics, e.g., altered K m for glyphosate, increased rate of catalysis, increased stability, etc., based upon selection of a polynucleotide constituent of the library for the new or improved activities described herein.
  • Such polynucleotides are especially favorably employed in the production of glyphosate-resistant transgenic plants.
  • the GAT polypeptides of the invention exhibit a novel enzymatic activity. Specifically, the enzymatic acetylation of the synthetic herbicide glyphosate has not been recognized prior to the present invention.
  • polypeptides herein described e.g., as exemplified by SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673,
  • glyphosate should be considered to include any herbicidally effective form of N- phosphonomethylglycme (including any salt thereof) and other forms which result in the production of the glyphosate anion inplanta.
  • glyphosate analog refers to any structural analog of glyphosate that has the ability to inhibit EPSPS at levels such that the glyphosate analog is herbicidally effective.
  • glyphosate-N-acetyltransferase activity or "GAT activity” refers to the ability to catalyze the acetylation of the secondary amine group of glyphosate, as illustrated, for example, in Figure 1.
  • a "glyphosate-N- acetyltransferase” or “GAT” is an enzyme that catalyzes the acetylation of the amine group of glyphosate, a glyphosate analog, and/or a glyphosate primary metabolite (i.e., AMPA or sarcosine).
  • a GAT is able to transfer the acetyl group from Acetyl CoA to the secondary amine of glyphosate and the primary amine of AMPA.
  • some GATs are also able to transfer the propionyl group of propionyl CoA to glyphosate, indicating that GAT is also an acyltransferase.
  • the exemplary GATs described herein are active from about pH 5-9, with optimal activity in the range of about pH 6.5-8.0. Activity can be quantified using various kinetic parameters which are well known in the art, e.g., k cat , K M , and k ca t M .
  • polynucleotide “nucleotide sequence,” and “nucleic acid” are used to refer to a polymer of nucleotides (A, C, T, U, G, etc. or naturally occurring or artificial nucleotide analogues), e.g., DNA or RNA, or a representation thereof, e.g., a character string, etc., depending on the relevant context.
  • a given polynucleotide or complementary polynucleotide can be determined from any specified nucleotide sequence.
  • amino acid sequence is a polymer of amino acids (a protein, polypeptide, etc) or a character string representing an amino acid polymer, depending on context.
  • protein polypeptide
  • peptide a polypeptide of amino acids
  • a polynucleotide, polypeptide, or other component is “isolated” when it is partially or completely separated from components with which it is normally associated (other proteins, nucleic acids, cells, synthetic reagents, etc).
  • a nucleic acid or polypeptide is "recombinant” when it is artificial or engineered, or derived from an artificial or engineered protein or nucleic acid.
  • a polynucleotide that is inserted into a vector or any other heterologous location, e.g., in a genome of a recombinant organism, such that it is not associated with nucleotide sequences that normally flank the polynucleotide as it is found in nature is a recombinant polynucleotide.
  • a protein expressed in vitro or in vivo from a recombinant polynucleotide is an example of a recombinant polypeptide.
  • glyphosate-N-acetyltransferase polypeptide and “GAT polypeptide” are used interchangeably to refer to any of a family of novel polypeptides provided herein.
  • glyphosate-N-acetyltransferase polynucleotide and “GAT polynucleotide” are used interchangeably to refer to a polynucleotide that encodes a GAT polypeptide.
  • a "subsequence” or “fragment” is any portion of an entire sequence.
  • a vector is a composition for facilitating cell transduction/transformation by a selected nucleic acid, or expression of the nucleic acid in the cell.
  • Vectors include, e.g., plasmids, cosmids, viruses, YACs, bacteria, poly-lysine, chromosome integration vectors, episomal vectors, etc.
  • substantially an entire length of a polynucleotide or amino acid sequence refers to at least about 70%, generally at least about 80%, or typically about 90% or more of a sequence.
  • an “antibody” refers to a protein comprising one or more polypeptides substantially or partially encoded by immunoglobulin genes or fragments of immunoglobulin genes.
  • the recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda.
  • Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively.
  • a typical immunoglobulin (antibody) structural unit comprises a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light” (about 25 kD) and one "heavy” chain (about 50-70 kD). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition.
  • the terms variable light chain (V L ) and variable heavy chain (V H ) refer to these light and heavy chains respectively.
  • Antibodies exist as intact immunoglobulins or as a number of well characterized fragments produced by digestion with various peptidases.
  • pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond.
  • the F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the (Fab')2 dimer into an Fab' monomer.
  • the Fab' monomer is essentially a Fab with part of the hinge region (see, Paul, ed.
  • Antibodies include single chain antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a variable light chain are joined together (directly or through a peptide linker) to form a continuous polypeptide.
  • sFv single chain Fv
  • chloroplast transit peptide is an amino acid sequence which is translated in conjunction with a protein and directs the protein to the chloroplast or other plastid types present in the cell in which the protein is made.
  • Chloroplast transit sequence refers to a nucleotide sequence that encodes a chloroplast transit peptide.
  • a “signal peptide” is an amino acid sequence which is translated in conjunction with a protein and directs the protein to the secretory system (Clirispeels (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42: 21-53).
  • a vacuolar targeting signal can further be added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal may be added.
  • any signal peptide present should be removed and instead a nuclear localization signal . included (Raikhel. (1992) Plant Phys : 100: 1627-1632).
  • the terms "diversification” and "diversity,” as applied to a polynucleotide, refers to generation of a plurality of modified forms of a parental polynucleotide, or plurality of parental polynucleotides.
  • the polynucleotide encodes a polypeptide
  • diversity in the nucleotide sequence of the polynucleotide can result in diversity in the corresponding encoded polypeptide, e.g. a diverse pool of polynucleotides encoding a plurality of polypeptide variants.
  • this sequence diversity is exploited by screening/selecting a library of diversified polynucleotides for variants with desirable functional attributes, e.g., a polynucleotide encoding a GAT polypeptide with enhanced functional characteristics.
  • encoding refers to the ability of a nucleotide sequence to code for one or more amino acids.
  • an amino acid sequence can be encoded in any one of six different reading frames provided by a polynucleotide sequence and its complement.
  • the term “artificial variant” refers to a polypeptide having GAT activity, which is encoded by a modified GAT polynucleotide, e.g., a modified form ofany one ofSEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 5
  • nucleic acid construct or “polynucleotide construct” means a nucleic acid molecule, either single-stranded or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature.
  • nucleic acid construct is synonymous with the term "expression cassette" when the nucleic acid construct contains the control sequences required for expression of a coding sequence of the present invention.
  • control sequences is defined herein to include all components, which are necessary or advantageous for the expression of a polypeptide of the present invention.
  • Each control sequence may be native or foreign to the nucleotide sequence encoding the polypeptide.
  • control sequences include, but are not limited to, a leader sequence, polyadenylation sequence, propeptide sequence, promoter sequence, signal peptide sequence, and transcription terminator sequence.
  • the control sequences include a promoter and transcriptional and translational stop signals.
  • the control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleotide sequence encoding a polypeptide.
  • operably linked is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the expression of a polypeptide.
  • coding sequence is intended to cover a nucleotide sequence, which directly specifies the amino acid sequence of its protein product. The boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon.
  • the coding sequence typically includes a DNA, cDNA, and/or recombinant nucleotide sequence.
  • the term "expression” includes any step involved in the production of the polypeptide including, but not limited to, transcription, post- transcriptional modification, translation, post-translational modification, and secretion.
  • expression vector covers a DNA molecule, linear or circular, that comprises a segment encoding a polypeptide of the invention, and which is operably linked to additional segments that provide for its transcription.
  • host cell includes any cell type which is susceptible to transformation with a nucleic acid construct.
  • plant includes whole plants, shoot vegetative organs/structures (e.g., leaves, stems and tubers), roots, flowers and floral organs/structures (e.g., bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, and seed coat) and fruit (the mature ovary), plant tissue (e.g., vascular tissue, ground tissue, and the like) and cells (e.g., guard cells, egg cells, trichomes and the like), and progeny of same.
  • shoot vegetative organs/structures e.g., leaves, stems and tubers
  • roots e.g., flowers and floral organs/structures (e.g., bracts, sepals, petals, stamens, carpels, anthers and ovules)
  • seed including embryo, endosperm, and seed coat
  • fruit the mature ovary
  • plant tissue e.g., vascular tissue, ground tissue
  • the class of plants that can be used in the method of the invention is generally as broad as the class of higher and lower plants amenable to transformation techniques, including angiosperms (monocotyledonous and dicotyledonous plants), gymnosperms, ferns, and multicellular algae. It includes plants of a variety of ploidy levels, including aneuploid, polyploid, diploid, haploid and hemizygous.
  • ploidy levels including aneuploid, polyploid, diploid, haploid and hemizygous.
  • heterologous as used herein describes a relationship between two or more elements which indicates that the elements are not normally found in proximity to one another in nature.
  • a polynucleotide sequence is "heterologous to" an organism or a second polynucleotide sequence if it originates from a foreign species, or, if from the same species, is modified from its original form.
  • a promoter operably linked to a heterologous coding sequence refers to a coding sequence from a species different from that from wliich the promoter was derived, or, if from the same species, a coding sequence which is not naturally associated with the promoter (e.g., a genetically engineered coding sequence or an allele from a different ecotype or variety).
  • heterologous polypeptide is a polypeptide expressed from a recombinant polynucleotide in a transgenic organism.
  • Heterologous polynucleotides and polypeptides are forms of recombinant molecules. A variety of additional terms are defined or otherwise characterized herein.
  • GLYPHOSATE-N-ACETYLTRANSFERASES provides a novel family of isolated or recombinant enzymes referred to herein as "glyphosate-N-acetyltransferases," "GATs,” or “GAT enzymes.”
  • GATs are enzymes that have GAT activity, preferably sufficient activity to confer some degree of glyphosate tolerance upon a transgenic plant engineered to express the GAT.
  • GATs include GAT polypeptides, described in more detail below.
  • GAT-mediated glyphosate tolerance is a complex function of GAT activity,
  • One of skill in the art can determine without undue experimentation the level of GAT activity required to effect glyphosate tolerance in a particular context.
  • GAT activity can be characterized using the conventional kinetic parameters k oat , K M , and k cat /K M - k cat can be thought of as a measure of the rate of acetylation, particularly at high substrate concentrations, K M is a measure of the affinity of the GAT for its substrates (e.g., acetyl CoA, propionyl CoA and glyphosate), and k cat /K M is a measure of catalytic efficiency that takes both substrate affinity and catalytic rate into account.
  • K M is a measure of the affinity of the GAT for its substrates (e.g., acetyl CoA, propionyl CoA and glyphosate)
  • k cat /K M is a measure of catalytic efficiency that takes both substrate affinity and catalytic rate into account.
  • k cat /K m is particularly important in the situation where the concentration of a substrate is at least partially rate- limiting, h general, a GAT with a higher kcat or k cat /K is a more efficient catalyst than another GAT with lower k cat or k cai IKu. A GAT with a lower KM is a more efficient catalyst than another GAT with a higher KM- Thus, to determine whether one GAT is more effective than another, one can compare kinetic parameters for the two enzymes. The relative importance of k cat , k cat K M and K will vary depending upon the context in which the GAT will be expected to function, e.g., the anticipated effective concentration of glyphosate relative to the KM for glyphosate. GAT activity can also be characterized in terms of any of a number of functional characteristics, including but not limited to stability, susceptibility to inhibition, or activation by other molecules.
  • GLYPHOSATE-N-ACETYLTRANSFERASE POLYPEPTIDES h one aspect, the invention provides a novel family of isolated or recombinant polypeptides referred to herein as "glyphosate-N-acetyltransferase polypeptides" or "GAT polypeptides.”
  • GAT polypeptides are characterized by their structural similarity to a novel family of GATs. Many but not all GAT polypeptides are GATs. The distinction is that GATs are defined in terms of function, whereas GAT polypeptides are defined in terms of structure.
  • GAT polypeptides A subset of the GAT polypeptides consists of those GAT polypeptides that have GAT activity, preferably at a level that will function to confer glyphosate resistance upon a transgenic plant expressing the protein at an effective level.
  • Some preferred GAT polypeptides for use in conferring glyphosate tolerance have a k cat of at least 1 min " l , ox more preferably at least 10 min "1 , 100 min "1 or 1000 min "1 .
  • Other preferred GAT polypeptides for use in conferring glyphosate tolerance have a K M no greater than 100 mM, or more preferably no greater than 10 mM, 1 mM, or 0.1 mM.
  • GAT polypeptides for use in conferring glyphosate tolerance have a cat K M of at least 1 rnM ' n "1 or more, preferably at least 10 mM ' Vin "1 , 100 mM " 1 min '1 , 1000 mM ⁇ min "1 , or 10,000 mM ' "1 .
  • Exemplary GAT polypeptides have been isolated and characterized from a variety of bacterial strains.
  • One example of a monomeric GAT polypeptide that has been isolated and characterized has a molecular radius of approximately 17 kD.
  • licheniformis SEQ LD NO:7, exhibits a K m for glyphosate of approximately 2.9 mM and a K m for acetyl CoA of approximately 2 ⁇ M, with a k cat equal to 6/minute.
  • GAT polypeptide refers to any polypeptide comprising an amino acid sequence that can be optimally aligned with an amino acid sequence selected from the group consisting of SEQ LD NO:300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6
  • GAT polypeptides comprising an amino acid sequence that can be optimally aligned with an amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745, 750, 755, or 760 using the BLOSUM62
  • Two sequences are "optimally aligned” when they are aligned for similarity scoring using a defined amino acid substitution matrix (e.g., BLOSUM62), gap existence penalty and gap extension penalty so as to arrive at the highest score possible for that pair of sequences.
  • Amino acid substitution matrices and their use in quantifying the similarity between two sequences are well-known in the art and described, e.g., in Dayhoff et al. (1978) "A model of evolutionary change in proteins” in "Atlas of Protein Sequence and Structure," Vol. 5, Suppl. 3 (ed. M.O. Dayhoff), pp. 345-352. Natl. Biomed. Res. Found., Washington, DC and Henikoff et al. (1992) Proc.
  • the BLOSUM62 matrix (Fig. 10) is often used as a default scoring substitution matrix in sequence alignment protocols such as Gapped BLAST 2.0.
  • the gap existence penalty is imposed for the introduction of a single amino acid gap in one of the aligned sequences, and the gap extension penalty is imposed for each additional empty amino acid position inserted into an already opened gap.
  • the alignment is defined by the amino acids positions of each sequence at which the alignment begins and ends, and optionally by the insertion of a gap or multiple gaps in one or both sequences so as to arrive at the highest possible score.
  • BLAST 2.0 a computer-implemented alignment algorithm
  • NCBI National Center for Biotechnology Information
  • Optimal alignments, including multiple alignments, can be prepared using, e.g., PSI-BLAST, available through the NCBI website and described by Altschul et al. (1997) Nucl. Acids Res. 25:3389-3402.
  • an amino acid residue “corresponds to” the position in the reference sequence with which the residue is paired in the alignment.
  • the "position” is denoted by a number that sequentially identifies each amino acid in the reference sequence based on its position relative to the N-terminus. For example, in SEQ LD NO:300, position 1 is M, position 2 is I, position 3 is E, etc.
  • a residue in the test sequence that aligns with the E at position 3 is said to "correspond to position 3" of SEQ LD NO:300.
  • the amino acid residue number in a test sequence as determined by simply counting from the N-terminal will not necessarily be the same as the number of its corresponding position in the reference sequence.
  • the amino acid residue number in a test sequence as determined by simply counting from the N-terminal will not necessarily be the same as the number of its corresponding position in the reference sequence.
  • that insertion will not correspond to any amino acid position in the reference sequence.
  • GAT polypeptide further refers to any polypeptide comprising an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577 (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578 (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621 (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579 (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602 (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697 (g) an amino acid sequence that is at least 96% identical to SEQ LD NO:721 (h) an amino acid sequence that is at least 97% identical to SEQ LD
  • GAT polypeptide further refers to any polypeptide comprising an amino acid sequence having at least 89% sequence identity with residues 1-96 of the amino acid sequence of SEQ LD NO:677; an amino acid sequence having at least 95% sequence identity with residues 1-96 of the amino acid sequence of SEQ LD NO: 697; an amino acid sequence having at least 96% sequence identity with residues 1-96 of the amino acid sequence selected from the group consisting of SEQ LD NO:584, 612, and 721; an amino acid sequence having at least 97% sequence identity with residues 1-96 of the amino acid sequence selected from the group consisting of SEQ LD NO:578, 613, and 621; an amino acid sequence having at least 98% sequence identity with residues 1-96 of the amino acid sequence selected from the group consisting of SEQ LD NO:577, 579, 590, 602, 616, and 707.
  • GAT polypeptide further refers to any polypeptide comprising an amino acid sequence having at least 89%) sequence identity with residues 51- 146 of the amino acid sequence of SEQ LD NO: 677; an amino acid sequence having at least 95% sequence identity with residues 51-146 of the amino acid sequence of SEQ LD NO:697; an amino acid sequence having at least 96% sequence identity with residues 51-146 of the amino acid sequence selected from the group consisting of SEQ LD NO:584, 612, and 721; an amino acid sequence having at least 97% sequence identity with residues 51-146 of the amino acid sequence selected from the group consisting of SEQ LD NO:578, 613, and 621; an amino acid sequence having at least 98% sequence identity with residues 51-146 of the amino acid sequence selected from the group consisting of SEQ LD NO:577, 579, 590, 602, 616, and 707.
  • GAT polypeptide further refers to any polypeptide comprising an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 96% identical to residues 2-146 of SEQ LD NO:919; (b) an amino acid sequence that is at least 97% identical to residues 2-146 of SEQ LD;
  • GAT polypeptide further refers to any polypeptide comprising an amino acid sequence that is at least 95% identical to residues 2-146 of SEQ LD NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ LD NO:929.
  • GAT polypeptide further refers to any polypeptide comprising an amino acid sequence that shares at least 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%), 98%, 99%, or more sequence identity with an exemplary GAT polypeptide disclosed herein.
  • GAT polypeptides of the invention include polypeptides comprising an amino acid sequence that shares at least 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more sequence identity with any of SEQ LD NO: 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972.
  • the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, at least 50, at least 75, at least 100, at least 125, at least 130, at least 135, at least 140, at least 141, at least 142, at least 143, at least 144 or at least 145 contiguous amino acids of an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96%
  • the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence.
  • Some preferred GAT polypeptides of the invention can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO:300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M
  • the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
  • Some preferred GAT polypeptides of the invention can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group
  • the amino acid residue is ZI; (b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at position 8 the amino acid residue is Z3; (d) at position 89 the amino acid residue is Z3 or Z6; (e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5; (g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2; (i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3; (j) at position 30 the amino acid residue is ZI; (k) at position 6 the amino acid residue is Z6; (1) at position 81 the amino acid residue is Z2 or Z4; (m) at position 113 the amino acid residue is Z3
  • Some preferred GAT polypeptides of the invention further comprise the amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), wherein at least 90% conform to the amino acid residue restrictions specified in (a)-(am).
  • Some preferred GAT polypeptides of the invention additionally comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid
  • Some preferred GAT polypeptides of the invention additionally comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q
  • Some preferred GAT polypeptides of the invention additionally comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; where
  • Some preferred GAT polypeptides of the invention further comprise an amino acid residue at position 36 which is selected from the group consisting of ZI and Z3. Some preferred GAT polypeptides of the invention further comprise an amino acid residue at position 64 which is selected from the group consisting of ZI and Z2.
  • Some preferred GAT polypeptides of the invention further comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 80% conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V; (o) at position 24 the amino acid residue is Q; (p) at position 26
  • Some preferred GAT polypeptides of the invention further comprise amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (ch) above, wherein at least 90% conform to the amino acid residue restrictions specified in (a) - (ch).
  • Some preferred GAT polypeptides of the invention can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue; wherein: ZI is an amino acid residue selected from the group consisting
  • Some preferred GAT polypeptides of the invention further comprise amino acid sequences wherein the amino acid residues meet at least one of the following restrictions: (a) at position 36 the amino acid residue is M, L, or T; (b) at position 72 the amino acid residue is L or I; (c) at position 75 the amino acid residue is M or V; (d) at position 64 the amino acid residue is L, I, or F; (e) at position 88 the amino acid residue is T or S; and (f) at position 117 the amino acid residue is Y or F.
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence wherein the amino acid residues meet at least one of the following additional restrictions: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r) at position
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence wherein of the amino acid residues that correspond to the positions specified in (a) through (ab) as described above, at least 80% conform to the amino acid residue restrictions specified in (a) through (ab).
  • Some preferred GAT polypeptides of the invention have an amino acid sequence that comprises amino acid residues at least one of which meets the following additional restrictions: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S.
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577; (b) an amino acid sequence that is at least 97%o identical to SEQ LD NO: 578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621 ; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 9
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO: 579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO: 697; (g) an amino acid sequence that is at least 96% identical to SEQ ED NO: 721 ; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 96%
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO: 577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical tq SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 9
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ ED NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO: 697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO:677; (j) an amino acid sequence that is at least 96% identical
  • Some preferred GAT polypeptides of the invention further comprise amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), wherein at least 90% conform to the amino acid residue restrictions specified in (a)-(am).
  • Some preferred GAT polypeptides of the invention comprise amino acid residues in the amino acid sequence that correspond to the following positions wherein at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is
  • Some preferred GAT polypeptides of the invention comprise amino acid residues in the amino acid sequence that correspond to the following positions wherein at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q,
  • Some preferred GAT polypeptides of the invention comprise amino acid residues in the amino acid sequence that correspond to the following positions wherein at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein
  • Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein the amino acid residue at position 36 is selected from the group consisting of ZI and Z3. Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein the amino acid residue at position 64 is selected from the group consisting of ZI and Z2.
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence wherein of the amino acid residues that correspond to the following positions, at least 80% conform to the following additional restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V; (o) at position 24 the amino acid residue is Q; (p) at position
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence in which of the residues that correspond to the positions specified in (a) - (ch) above, at least 90% conform to the amino acid residue restrictions specified in (a) - (ch).
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ID NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h
  • Some preferred GAT polypeptides of the invention further comprise an amino acid sequence in which at least one of the following criteria is met: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r) at position 88 the amino acid
  • Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein of the amino acid residues that correspond to the positions specified in (a) through (ab) as described above, at least 80% conform to the amino acid residue restrictions specified in (a) through (ab).
  • Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein the following conditions are also met: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S.
  • Some preferred GAT polypeptides of the invention when optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, have amino acid sequences such that one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue
  • the amino acid residue in the polypeptide corresponding to position 28 is V, I or A.
  • Valine or isoleucine at position 28 generally correlates with reduced K
  • alanine at that position generally correlates with increased k ca t.
  • Threonine at position 89 and arginine at position 58 generally correlates with reduced K M -
  • Other preferred GAT polypeptides are characterized by having 127 (i.e., an I at position 27), M30, D34, S35, R37, S39, H41, G48, K49, N57, Q58, P62, T62, Q65, Q67, K68, V75, E83, S89, A96, E96, R101, T112, A114, K119, K120, E128, V129, D131, T131, V132, V134, V135, H138, R144, 1145, or T146, or any combination thereof.
  • GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 6
  • the invention provides an isolated or recombinant ' polypeptide that comprises at least 20, or alternatively, at least 50, at least 75, at least 100, at least 125, at least 130, at least 135, at least 140, at least 141, at least 142, at least 143, at least 144 or at least 145 contiguous amino acids of an amino acid sequence selected from the groups consisting of: (a) an amino acid sequence that is at least 96% identical to SEQ LD NO:919 (such as, for example, SEQ LD NO:917, 919, 921, 923, 925, 927, 833, 835, 839, 843, 845, 859, 863, 873, 877, 891, 895, 901, 905, 905, 907, 913, 915, or 950); (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:929 (such as, for example, SEQ LD NO:929, 931, 835, 843, 849
  • SEQ LD NO: 837 (u) an amino acid sequence that is at least 99% identical to SEQ ED NO: 841; (v) an amino acid sequence that is at least 99% identical to SEQ LD NO:865; (w) an amino acid sequence that is at least 99% identical to SEQ LD NO:869; and (x) an amino acid sequence that is at least 99% identical to SEQ LD NO:879.
  • the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, at least 50, at least 75, at least 100, at least 125, at least 130, at least 135, at least 140, at least 141, at least 142, at least 143, at least 144 or at least 145 contiguous amino acids of an amino acid sequence that is at least 95% identical to SEQ LD NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929 (such as, for example, SEQ LD NO:837, 849, 893, 897, 905, 921, 927, 929 or 931).
  • the invention provides a polypeptide comprising residues 2-146 of an amino acid sequence selected from the group consisting of SEQ LD NO: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972.
  • the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence.
  • Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of SEQ LD NO: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 946, 948, and 950.
  • the invention further provides preferred GAT polypeptides that are characterized by a combination of the foregoing amino acid residue position restrictions.
  • the invention provides GAT polynucleotides encoding the preferred GAT polypeptides described above, and complementary nucleotide sequences thereof. Some aspects of the invention pertain particularly to the subset ofany of the above-described categories of GAT polypeptides having GAT activity, as described herein. These GAT polypeptides are preferred, for example, for use as agents for conferring glyphosate resistance upon a plant. Examples of desired levels of GAT activity are described herein.
  • the GAT polypeptides comprise an amino acid sequence encoded by a recombinant or isolated form of naturally occurring nucleic acids isolated from a natural source, e.g., a bacterial strain. Wild-type polynucleotides encoding such GAT polypeptides may be specifically screened for by standard techniques known in the art.
  • the invention also includes isolated or recombinant polypeptides which are encoded by an isolated or recombinant polynucleotide comprising a nucleotide sequence which hybridizes under stringent conditions over substantially the entire length of a nucleotide sequence selected from the group consisting of SEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636
  • the invention also includes isolated or recombinant polypeptides which are encoded by an isolated or recombinant polynucleotide comprising a nucleotide sequence which hybridizes under stringent conditions over substantially the entire length of a nucleotide sequence selected from the group consisting of SEQ LD NO: 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, and 930, their complements, and nucleotide sequences encoding an amino acid sequence selected from the group consisting of SEQ
  • the invention further includes any polypeptide having GAT activity that is encoded by a fragment of any of the GAT-encoding polynucleotides described herein.
  • the invention also provides fragments of GAT polypeptides that can be spliced together to form a functional GAT polypeptide. Splicing can be accomplished in vitro or in vivo, and can involve cis- or trans-splicing (i.e., intramolecular or intermolecular splicing). The fragments themselves can, but need not, have GAT activity. For example, two or more segments of a GAT polypeptide can be separated by inteins; removal of the intein sequence by cis- splicing results in a functional GAT polypeptide.
  • an encrypted GAT polypeptide can be expressed as two or more separate fragments; trans-splicing of these segments results in recovery of a functional GAT polypeptide.
  • Various aspects of cis- and trans-splicing, gene encryption, and introduction of intervening sequences are described in more detail in U.S. Patent Application Serial Nos. 09/517,933 and 09/710,686, both of which are incorporated by reference herein in their entirety.
  • the invention includes any polypeptide encoded by a modified GAT polynucleotide derived by mutation, recursive sequence recombination, and/or diversification of the polynucleotide sequences described herein.
  • a GAT polypeptide is modified by single or multiple amino acid substitutions, a deletion, an insertion, or a combination of one or more of these types of modifications. Substitutions can be conservative or non- conservative, can alter function or not, and can add new function.
  • Insertions and deletions can be substantial, such as the case of a truncation of a substantial fragment of the sequence, or in the fusion of additional sequence, either internally or at N or C terminal, h some embodiments of the invention, a GAT polypeptide is part of a fusion protein comprising a functional addition such as, for example, a secretion signal, a chloroplast transit peptide, a purification tag, or any of the numerous other functional groups that will be apparent to the skilled artisan and which are described in more detail elsewhere in this specification. Polypeptides of the invention may contain one or more modified amino acid.
  • modified amino acids may be advantageous in, for example, (a) increasing polypeptide in vivo half-life, (b) reducing or increasing polypeptide antigenicity, and (c) increasing polypeptide storage stability.
  • Amino acid(s) are modified, for example, co-translationally or post-translationally during recombinant production (e.g., N-linked glycosylation at N-X-S/T motifs during expression in mammalian cells) or modified by synthetic means.
  • Non-limiting examples of a modified amino acid include a glycosylated amino acid, a sulfated amino acid, a prenlyated (e.g., farnesylated, geranylgeranylated) amino acid, an acetylated amino acid, an acylated amino acid, a PEG-ylated amino acid, a biotinylated amino acid, a carboxylated amino acid, a phosphorylated amino acid, and the like.
  • the polypeptides may be produced by direct peptide synthesis using solid-phase techniques (e.g., Stewart et al. (1969) Solid-Phase Peptide Synthesis (WH Freeman Co, San Francisco); and Merrifield (1963) J. Am. Chem. Soc. 85: 2149-2154). Peptide synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer, Foster City, CA) in accordance with the instructions provided by the manufacturer.
  • solid-phase techniques e.g., Stewart et al. (1969) Solid-Phase Peptide Synthesis (WH Freeman Co, San Francisco); and Merrifield (1963) J. Am. Chem. Soc. 85: 2149-2154.
  • Peptide synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin El
  • subsequences may be chemically synthesized separately and combined using chemical methods to provide full-length GAT polypeptides.
  • Peptides can also be ordered from a variety of sources.
  • a GAT polypeptide of the invention is used to produce antibodies which have, e.g., diagnostic uses, for example, related to the activity, distribution, and expression of GAT polypeptides, for example, in various tissues of a transgenic plant.
  • GAT homologue polypeptides for antibody induction do not require biological activity; however, the polypeptide or oligopeptide must be antigenic.
  • Peptides used to induce specific antibodies may have an amino acid sequence consisting of at least 10 amino acids, preferably at least 15 or 20 amino acids.
  • GAT polypeptide Short stretches of a GAT polypeptide may be fused with another protein, such as keyhole limpet hemocyaniii, and an antibody produced against the chimeric molecule.
  • Another protein such as keyhole limpet hemocyaniii
  • Methods of producing polyclonal and monoclonal antibodies are known to those of skill in the art, and many antibodies are available. See, e.g., Coligan (1991) Current Protocols in Immunology (Wiley/Greene, NY); Harlow and Lane (1989) Antibodies: A Laboratory Manual (Cold Spring Harbor Press, NY); Stites et al. (eds.) Basic and Clinical Immunology, 4th ed.
  • Specific monoclonal and polyclonal antibodies and antisera will usually bind with a K D of at least about 0.1 ⁇ M, preferably at least about 0.01 ⁇ M or better, and most typically and preferably, 0.001 ⁇ M or better. Additional details antibody of production and engineering techniques can be found in Borrebaeck, ed. (1995) Antibody Engineering, 2 nd ed. (Freeman and Company, NY); McCafferty et al. (1996) Antibody Engineering, A Practical Approach (IRL at Oxford Press, Oxford, England); and Paul (1995) Antibody Engineering Protocols (Humana Press, Towata, NJ).
  • GAT polypeptides of the present invention include conservatively modified variations of the sequences disclosed herein as SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671,
  • Such conservatively modified variations comprise substitutions, additions or deletions which alter, add or delete a single amino acid or a small percentage of amino acids (typically less than about 5%, more typically less than about 4%, 2%, or 1%) in any of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 6
  • a conservatively modified variation (e.g., deletion) of the 146 amino acid polypeptide identified herein as SEQ LD NO:6 will have a length of at least 140 amino acids, preferably at least 141 amino acids, more preferably at least 144 amino acids, and still more preferably at least 145 amino acids, corresponding to a deletion of less than about 5%, 4%, 2% or about 1%, or less of the polypeptide sequence.
  • Another example of a conservatively modified variation e.g., a
  • “conservatively substituted variation”) of the polypeptide identified herein as SEQ ED NO:6 will contain “conservative substitutions,” accordmg to the six substitution groups set forth in Table 2, in up to about 7 residues (i.e., less than about 5%) of the 146 amino acid polypeptide.
  • the GAT polypeptide sequence homologues of the invention can be present as part of larger polypeptide sequences such as occur in a GAT polypeptide, in a GAT fusion with a signal sequence, e.g., a chloroplast targeting sequence, or upon the addition of one or more domains for purification of the protein (e.g., poly his segments, FLAG tag segments, etc.). In the latter case, the additional functional domains have little or no effect on the activity of the GAT portion of the protein, or where the additional domains can be removed by post synthesis processing steps such as by treatment with a protease.
  • polypeptides of the invention provide a new class of enzymes with a defined activity, i.e., the acetylation and acylation of glyphosate, the polypeptides also provide new structural features which can be recognized, e.g., in immunological assays.
  • the generation of antisera which specifically binds the polypeptides of the invention, as well as the polypeptides which are bound by such antisera, are a feature of the invention.
  • the invention includes GAT polypeptides that specifically bind to or that are specifically immunoreactive with an antibody or antisera generated against an immunogen comprising an amino acid sequence selected from one or more of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659
  • the antibody or antisera is subtracted with available related proteins, such as those represented by the proteins or peptides corresponding to GenBank accession numbers available as of the filing date of this application, and exemplified by CAA70664, Z99109 and Y09476.
  • accession number corresponds to a nucleic acid
  • a polypeptide encoded by the nucleic acid is generated and used for antibody/antisera subtraction purposes.
  • Figure 3 tabulates the relative identity between exemplary GAT sequences and the most closely related sequence available in Genbank, Yitl. The function of native Yitl has yet to be elucidated, but the enzyme has been shown to possess detectable GAT activity.
  • the immunoassay uses a polyclonal antiserum which was raised against one or more polypeptides comprismg one or more of the sequences corresponding to one or more of SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657,
  • one or more of the immunogenic polypeptide(s) is produced and purified as described herein.
  • recombinant protein may be produced in a bacterial cell line.
  • An inbred strain of mice (used in this assay because results are more reproducible due to the virtual genetic identity of the mice) is immunized with the immunogenic polypeptide(s) in combination with a standard adjuvant, such as Freund's adjuvant, using a standard mouse immunization protocol (see, Harlow and Lane (1988) Antibodies, A Laboratory Manual (Cold Spring Harbor Publications, New York), for a standard description of antibody generation, immunoassay formats and conditions that can be used to determine specific immunoreactivity).
  • one or more synthetic or recombinant polypeptides derived from the sequences disclosed herein is conjugated to a carrier protein and used as an immunogen.
  • Polyclonal sera are collected and titered against the immunogenic polypeptide(s) in an immunoassay, for example, a solid phase immunoassay with one or more of the immunogenic proteins immobilized on a solid support.
  • Polyclonal antisera with a titer of 10 6 or greater are selected, pooled and subtracted with related polypeptides, e.g., those identified from GENBANK as noted, to produce subtracted, pooled, titered polyclonal antisera.
  • the subtracted, pooled, titered polyclonal antisera are tested for cross reactivity against the related polypeptides.
  • at least two of the immunogenic GATs are used in this determination, preferably in conjunction with at least two related polypeptides, to identify antibodies which are specifically bound by the immunogenic polypeptide(s).
  • discriminatory binding conditions are determined for the subtracted, titered polyclonal antisera which result in at least about a 5-10 fold higher signal to noise ratio for binding of the titered polyclonal antisera to the immunogenic GAT polypeptides as compared to binding to the related polypeptides.
  • the stringency of the binding reaction is adjusted by the addition of non-specific competitors such as albumin or non-fat dry milk, or by adjusting salt conditions, temperature, or the like.
  • binding conditions are used in subsequent assays for determining whether a test polypeptide is specifically bound by the pooled, subtracted polyclonal antisera.
  • a test polypeptide which shows at least a 2-5 fold higher signal to noise ratio than the control polypeptide under discriminatory binding conditions, and at least about a V signal to noise ratio as compared to the immunogemc polypeptide(s), shares substantial structural similarity with the immunogenic polypeptide(s) as compared to known GAT, and is, therefore a polypeptide of the invention.
  • immunoassays in the competitive binding format are used for the detection of a test polypeptide.
  • cross-reacting antibodies are removed from the pooled antisera mixture by immunoabsorption with the control GAT polypeptides.
  • the immunogenic polypeptide(s) are then immobilized to a solid support which is exposed to the subtracted pooled antisera.
  • Test proteins are added to the assay to compete for binding to the pooled, subtracted antisera.
  • test protein(s) The ability of the test protein(s) to compete for binding to the pooled, subtracted antisera as compared to the immobilized protein(s) is compared to the ability of the immunogenic polypeptide(s) added to the assay to compete for binding (the immunogenic polypeptide(s) compete effectively with the immobilized immunogenic polypeptide(s) for binding to the pooled antisera).
  • the percent cross-reactivity for the test proteins is calculated, using standard calculations.
  • the ability of the control proteins to compete for binding to the pooled, subtracted antisera is optionally determined as compared to the ability of the immunogenic polypeptide(s) to compete for binding to the antisera.
  • the percent cross-reactivity for the control polypeptides is calculated, using standard calculations. Where the percent cross-reactivity is at least 5-10x higher for the test polypeptides, the test polypeptides are said to specifically bind the pooled, subtracted antisera.
  • the immunoabsorbed and pooled antisera can be used in a competitive binding immunoassay as described herein to compare any test polypeptide to the immunogenic polyp eptide(s). In order to make this comparison, the two polypeptides are each assayed at a wide range of concentrations and the amount of each polypeptide required to inhibit 50% of the binding of the subtracted antisera to the immobilized protein is determined using standard techniques.
  • the test polypeptide is said to specifically bind to an antibody generated to the immunogenic polypeptide(s), provided the amount is at least about 5-1 Ox higher for a control polypeptide.
  • the pooled antisera is optionally fully immunosorbed with the immunogenic polypeptide(s) (rather than the control polypeptides) until little or no binding of the subtracted, pooled antisera to the immunogenic polypeptide(s) is detectable. This fully immunosorbed antisera is then tested for reactivity with the test polypeptide.
  • test polypeptide is specifically bound by the antisera elicited by the immunogemc polypeptide(s).
  • GLYPHOSATE-N-ACETYLTRANSFERASE POLYNUCLEOTIDES hi one aspect, the invention provides a novel family of isolated or recombinant polynucleotides referred to herein as "glyphosate-N-acetyltransferase polynucleotides" or "GAT polynucleotides.” GAT polynucleotide sequences are characterized by the ability to encode a GAT polypeptide.
  • the invention includes any nucleotide sequence that encodes any of the novel GAT polypeptides described herein, h some aspects of the invention, a GAT polynucleotide that encodes a GAT polypeptide with GAT activity is preferred, h one aspect, the GAT polynucleotides comprise recombinant or isolated forms of naturally occurring nucleic acids isolated from an organism, e.g., a bacterial strain. Exemplary GAT polynucleotides, e.g., SEQ ED NO:l - 5, were discovered by expression cloning of sequences from Bacillus strains exhibiting GAT activity.
  • the product of the reaction was positively identified as N- acetylglyphosate by comparing the mass spectrometry profile of the reaction mix to an N-acetylglyphosate standard as shown in Figure 2.
  • Product detection was dependent on inclusion of both substrates (acetyl CoA and glyphosate) and was abolished by heat denaturing the bacterial cells.
  • Individual GAT polynucleotides were then cloned from the identified strains by functional screening. Genomic DNA was prepared and partially digested with Sau3Al enzyme. Fragments of approximately 4 kb were cloned into an E. coli expression vector and transformed into electrocompetent E. coli.
  • GAT polynucleotides are produced by diversifying, e.g., recombining and/or mutating one or more naturally occurring, isolated, or recombinant GAT polynucleotides.
  • GAT polynucleotides encoding GAT polypeptides with superior functional attributes, e.g., increased catalytic function, increased stability, or higher expression level, than a GAT polynucleotide used as a substrate or parent in the diversification process.
  • the polynucleotides of the invention have a variety of uses in, for example: recombinant production (i.e., expression) of the GAT polypeptides of the invention; as transgenes (e.g., to confer herbicide resistance in transgenic plants); as selectable markers for transformation and plasmid maintenance; as immunogens; as diagnostic probes for the presence of complementary or partially complementary nucleic acids (including for detection of natural GAT coding nucleic acids); as substrates for further diversity generation, e.g., recombination reactions or mutation reactions to produce new and/or improved GAT homologues, and the like.
  • GAT polynucleotides that do not encode active enzymes can be valuable sources of parental polynucleotides for use in diversification procedures to arrive at GAT polynucleotide variants, or non-GAT polynucleotides, with desirable functional properties (e.g., high k cat or k cat /K m , low K m , high stability towards heat or other environmental factors, high transcription or translation rates, resistance to proteolytic cleavage, reducing antigenicity, etc.).
  • desirable functional properties e.g., high k cat or k cat /K m , low K m , high stability towards heat or other environmental factors, high transcription or translation rates, resistance to proteolytic cleavage, reducing antigenicity, etc.
  • nucleotide sequences encoding protease variants with little or no detectable activity have been used as parent polynucleotides in DNA shuffling experiments to produce progeny encoding highly active proteases (Ness et al. (1999) Nature Biotech. 17:893-96).
  • Polynucleotide sequences produced by diversity generation methods or recursive sequence recombination ("RSR") methods are a feature of the invention.
  • RSR recursive sequence recombination
  • Mutation and recombination methods using the nucleic acids described herein are a feature of the invention.
  • one method of the invention includes recursively recombining one or more nucleotide sequences of the invention as described above and below with one or more additional nucleotides.
  • the recombining steps are optionally performed in vivo, ex vivo, in silico or in vitro.
  • This diversity generation or recursive sequence recombination produces at least one library of recombinant modified GAT polynucleotides. Polypeptides encoded by members of this library are included in the invention.
  • polynucleotides also referred to herein as oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably at least 20, 30, or 50 or more bases, which hybridize under stringent or highly stringent conditions to a GAT polynucleotide sequence.
  • the polynucleotides may be used as probes, primers, sense and antisense agents, and the like, according to methods as noted herein.
  • GAT polynucleotides including nucleotide sequences that encode GAT polypeptides, fragments of GAT polypeptides, related fusion proteins, or functional equivalents thereof, are used in recombinant DNA molecules that direct the expression of the GAT polypeptides in appropriate host cells, such as bacterial or plant cells. Due to the inherent degeneracy of the genetic code, other nucleic acid sequences which encode substantially the same or a functionally equivalent amino acid sequence can also be used to clone and express the GAT polynucleotides.
  • the invention provides GAT polynucleotides that encode transcription and/or translation products that are subsequently spliced to ultimately produce functional GAT polypeptides.
  • Splicing can be accomplished in vitro or in vivo, and can involve cis- or trans-splicing.
  • the substrate for splicing can be polynucleotides (e.g., RNA transcripts) or polypeptides.
  • cis- splicing of a polynucleotide is where an intron inserted into a coding sequence is removed and the two flanking exon regions are spliced to generate a GAT polypeptide encoding sequence.
  • trans-splicing would be where a GAT polynucleotide is encrypted by separating the coding sequence into two or more fragments that can be separately transcribed and then spliced to form the full- length GAT encoding sequence.
  • splicing enhancer sequence (which can be introduced into a construct of the invention) can facilitate splicing either in cis or trans. Cis- and trans-splicing of polypeptides are described in more detail elsewhere herein and in U.S. Patent Application Serial Nos. 09/517,933 and 09/710,686. Thus, some GAT polynucleotides do not directly encode a full-length GAT polypeptide, but rather encode a fragment or fragments of a GAT polypeptide.
  • GAT polynucleotides can be used to express a functional GAT polypeptide through a mechanism involving splicing, where splicing can occur at the level of polynucleotide (e.g., intron/exon) and/or polypeptide (e.g., intein/extein). This can be useful, for example, in controlling expression of GAT activity, since functional GAT polypeptide will only be expressed if all required fragments are expressed in an environment that permits splicing processes to generate functional product.
  • polynucleotide e.g., intron/exon
  • polypeptide e.g., intein/extein
  • introduction of one or more insertion sequences into a GAT polynucleotide can facilitate recombination with a low homology polynucleotide; use of an intron or intein for the insertion sequence facilitates the removal of the intervening sequence, thereby restoring function of the encoded variant.
  • it can be advantageous to modify a coding sequence to enhance its expression in a particular host.
  • the genetic code is redundant with 64 possible codons, but most organisms preferentially use a subset of these codons.
  • codons that are utilized most often in a species are called optimal codons, and those not utilized very often are classified as rare or low-usage codons (see, e.g., Zhang et al. (1991) Gene 105:61- 72). Codons can be substituted to reflect the preferred codon usage of the host, a process sometimes called "codon optimization” or "controlling for species codon bias.” Optimized coding sequences containing codons preferred by a particular prokaryotic or eukaryotic host (see also, Murray et al. (1989) Nucl. Acids Res.
  • RNA transcripts can be prepared, for example, to increase the rate of translation or to produce recombinant RNA transcripts having desirable properties, such as a longer half-life, as compared with transcripts produced from a non-optimized sequence.
  • Translation stop codons can also be modified to reflect host preference. For example, preferred stop codons for S. cerevisiae and mammals are UAA and UGA, respectively. The preferred stop codon for monocotyledonous plants is UGA, whereas insects and E. coli prefer to use UAA as the stop codon (Dalphin et al. (1996) Nucl. Acids Res. 24: 216-218).
  • Methodology for optimizing a nucleotide sequence for expression in a plant is provided, for example, in U.S.
  • One embodiment of the invention includes a GAT polynucleotide having optimal codons for expression in a relevant host, e.g., a transgenic plant host. This is particularly desirable when a GAT polynucleotide of bacterial origin is introduced into a transgenic plant, e.g., to confer glyphosate resistance to the plant.
  • the polynucleotide sequences of the present invention can be engineered in order to alter a GAT polynucleotide for a variety of reasons, including but not limited to alterations which modify the cloning, processing and/or expression of the gene product.
  • the polynucleotides of the invention include sequences which encode novel GAT polypeptides and sequences complementary to the coding sequences, and novel fragments of coding sequences and complements thereof.
  • the polynucleotides can be in the form of RNA or in the form of DNA, and include mRNA, cRNA, synthetic RNA and DNA, genomic DNA and cDNA.
  • the polynucleotides can be double-stranded or single-stranded, and if single-stranded, can be the coding strand or the non-coding (anti-sense, complementary) strand.
  • the polynucleotides optionally include the coding sequence of a GAT polypeptide (i) in isolation, (ii) in combination with an additional coding sequence, so as to encode, e.g., a fusion protein, a pre-protein, a prepro-protein, or the like, (iii) in combination with non-coding sequences, such as introns or inteins, control elements such as a promoter, an enhancer, a terminator element, or 5' and/or 3' untranslated regions effective for expression of the coding sequence in a suitable host, and/or (iv) in a vector or host environment in wliich the GAT polynucleotide is a heterologous gene.
  • Sequences can also be found in combination with typical compositional formulations of nucleic acids, including in the presence of carriers, buffers, adjuvants, excipients and the like.
  • Polynucleotides and oligonucleotides of the invention can be prepared by standard solid-phase methods, according to known synthetic methods. Typically, fragments of up to about 100 bases are individually synthesized, then joined (e.g., by enzymatic or chemical ligation methods, or polymerase mediated methods) to form essentially any desired continuous sequence.
  • polynucleotides and oligonucleotides of the invention can be prepared by chemical synthesis using, e.g., the classical phosphoramidite method described by Beaucage et al.
  • oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vectors.
  • essentially any nucleic acid can be custom ordered from any of a variety of commercial sources, such as The Midland Certified Reagent Company (mcrc@oligos.com), The Great American Gene Company (www.genco.com), ExpressGen Inc.
  • peptides and antibodies can be custom ordered from any of a variety of sources, such as PeptidoGenic (pkim@ccnet.com), HTI Bio-products, h e. (www.htibio.com), BMA Biomedicals Ltd (U.K.), Bio. Synthesis, Inc., and many others.
  • Polynucleotides may also be synthesized by well-known techniques as described in the technical literature. See, e.g., Carruthers et al, Cold Spring Harbor Symp. Quant. Biol. 47: 411-418 (1982), and Adams et al. (1983) J Am. Chem.
  • Double stranded DNA fragments may then be obtained either by synthesizing the complementary strand and annealing the strands together under appropriate conditions, or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.
  • General texts which describe molecular biological techniques useful herein, including mutagenesis include Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, Volume 152 (Academic Press, Inc., San Diego, CA); Sambrook et al. (1989) Molecular Cloning - A Laboratory Manual, 2nd ed., Volumes 1-3 (Cold Spring Harbor Laboratory, Cold Spring Harbor, New York); and Ausubel et al., eds.
  • PCR polymerase chain reaction
  • LCR ligase chain reaction
  • Q ⁇ -replicase amplification RNA polymerase mediated techniques
  • NASBA RNA polymerase mediated techniques
  • RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. . See, Ausbel, Sambrook and Berger, all supra.
  • One aspect of the invention provides an isolated or recombinant polynucleotide selected from the group consisting of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626
  • Preferred polynucleotides of the present invention include an isolated or recombinant polynucleotide sequence encoding and amino acid sequence that can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ED NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising the amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), at least 90% conform to the amino acid residue restrictions specified in(a)-(am).
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ LD NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F,
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108,
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 36 an amino acid residue selected from the group consisting of ZI and Z3. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 64 an amino acid residue selected from the group consisting of ZI and Z2.
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 80% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n)
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the amino acid residue restrictions specified in (a) - (ch) above.
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence that when optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence which conforms to at least one of the following additional restrictions: (a) at position 36 the amino acid residue is M, L, or T; (b) at position 72 the amino acid residue is L or I; (c) at position 75 the amino acid residue is M or V; (d) at position 64 the amino acid residue is L, I, or F; (e) at position 88 the amino acid residue is T or S; (f) at position 117 the amino acid residue is Y or F.
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence in which at least one of the following additional conditions is met: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence wherein, of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) through (ab) as described above, at least 80% conform to the amino acid residue restrictions specified in (a) through (ab).
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence which conforms to at least one of the following additional restrictions: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S.
  • Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO: 577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO: 621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO: 579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (X) a nucleotide sequence encoding an amino acid sequence that is at least 95 % identical to SEQ LD NO: 697; (g) a nucleotide sequence encoding an
  • Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO: 602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an
  • Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an amino
  • Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), at least 90% conform to the amino acid residue restrictions specified in(a)-(am).
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence in which of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ LD NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F,
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90%> of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108,
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 36 an amino acid residue selected from the group consisting of ZI and Z3. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 64 an amino acid residue selected from the group consisting of ZI and Z2.
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 80% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n)
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the amino acid residue restrictions specified in (a) - (ch) above.
  • Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98 > identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO: 697; (g) a nucleotide sequence encoding an
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at least one amino acid residue that meets the following criteria: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V;
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ LD NO: 300, 445, or 457, at least 80% of the amino acid residues in the amino acid sequence conform to the amino acid residue restrictions specified in (a) through (ab) above.
  • Some preferred isolated or recombinant polynucleotides of the invention comprise a nucleotide sequence which encodes an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 96%o identical to SEQ ED NO:919 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:917, 919, 921, 923, 925, 927, 833, 835, 839, 843, 845, 859, 863, 873, 877, 891, 895, 901, 905, 907, 913, 915, or 950); (b) an amino acid sequence that is at least 97% identical to SEQ ED NO:929 (such as, for example, a nucleotide sequence which encodes SEQ LD NO:929, 931, 835, 843, 849, or 867);
  • amino acid sequence that is at least 98% identical to SEQ ED NO:847 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:845 or 847);
  • Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ID NO:919 (for example, a nucleotide sequence such as SEQ ED NO:916, 918, 920, 922, 924, 926, 832, 834, 838, 842, 844, 858, 862, 872, 876, 890, 894, 900, 904, 906, 912, 914, 939, 940, 941, 942, 943, 944, 949, 951 or 952); (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:929 (for example, a nucleotide sequence such as SEQ ED NO:928, 930, 834, 842, 848, 866, 936 or 937); (c) a nu
  • Some preferred isolated or recombinant polynucleotides of the invention comprise a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:837, 849, 893, 897, 905, 921, 927, 929 or 931).
  • Some preferred isolated or recombinant polynucleotides of the invention comprise a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929 (for example, a nucleotide sequence such as SEQ LD NO:836, 848, 892, 896, 904, 920, 926, 928, 930, 938).
  • Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence which further comprises one or more amino acid residues meeting the following criteria: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S. While description of the polypeptides of the invention is sometimes expressed herein as a list of possible restrictions on what amino acid residues are found at particular positions, in some embodiments, a polypeptide of the invention meets all of a particular set of possible restrictions.
  • codon table 1 For instance, inspection of the codon table (Table 1) shows that codons AGA, AGG, CGA, CGC, CGG, and CGU all encode the amino acid arginine.
  • the codon can be altered to any of the corresponding codons described above without altering the encoded polypeptide. It is understood that U in an RNA sequence corresponds to T in a DNA sequence.
  • nucleic acid sequence corresponding to nucleotides 1-15 of SEQ ID NO:l (ATG ATT GAA GTC AAA (SEQ ED NO : 826)), a silent variation of this sequence includes AGT ATC GAG GTG AAG (SEQ LD NO: 827); both sequences encode the amino acid sequence MIEVK (SEQ ED NO: 828), which corresponds to amino acids 1-5 of SEQ ED NO:6.
  • silent variations are one species of "conservatively modified variations," as discussed below.
  • each codon in a nucleic acid except AUG, which is ordinarily the only codon for methionine
  • each silent variation of a nucleic acid which encodes a polypeptide is implicit in any described sequence.
  • the invention provides each and every possible variation of nucleic acid sequence encoding a polypeptide of the invention that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code (e.g., as set forth in Table 1) as applied to the nucleic acid sequence encoding a GAT homologue polypeptide of the invention. All such variations of every nucleic acid herein are specifically provided and described by consideration of the sequence in combination with the genetic code. Any variant can be produced as noted herein.
  • a GAT polynucleotide is engineered for optimized codon usage in a desired host organism, for example a plant host.
  • the term “optimized” or “optimal” are not meant to be restricted to the very best possible combination of codons, but simply indicates that the coding sequence as a whole possesses an improved usage of codons relative to a precursor polynucleotide from which it was derived.
  • the invention provides a method for producing a GAT polynucleotide variant by replacing at least one parental codon in a nucleotide sequence with a synonymous codon that is preferentially used in a desired host organism, e.g., a plant, relative to the parental codon.
  • a desired host organism e.g., a plant
  • Constantly modified variations or, simply, “conservative variations” of a particular nucleic acid sequence refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or, where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences.
  • a conservatively substituted variation of a polypeptide of the invention can contain 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 substitutions with a conservatively substituted variation of the same conservative substitution group.
  • a conservatively substituted variation of the polypeptide identified herein as SEQ ED NO:6 will contain "conservative substitutions" according to the six groups defined above, in up to 7 residues (i.e., 5% of the amino acids) in the 146 amino acid polypeptide.
  • RPN QPL EAC M examples include: KPQ QPV ESC M (SEQ ED NO:830) and KPN NPL DAC V (SEQ ED NO:831) and the like, in accordance with the conservative substitutions listed in Table 2 (in the above example, conservative substitutions are underlined).
  • Table 2 examples of conservative substitutions listed in Table 2 (in the above example, conservative substitutions are underlined).
  • sequences which do not alter the encoded activity of a nucleic acid molecule is a conservative variation of the basic nucleic acid.
  • sequences which do not alter the encoded activity of a nucleic acid molecule is a conservative variation of the basic nucleic acid.
  • conservative variations of the nucleic acid constructs which are disclosed yield a functionally identical construct.
  • "silent substitutions" i.e., substitutions in a nucleic acid sequence which do not result in an alteration in an encoded polypeptide
  • substitutions are an implied feature of every nucleic acid sequence which encodes an amino acid.
  • “conservative amino acid substitutions,” in one or a few amino acids in an amino acid sequence are substituted with different amino acids with highly similar properties, are also readily identified as being highly similar to a disclosed construct. Such conservative variations of each disclosed sequence are a feature of the present invention.
  • Non-conservative modifications of a particular nucleic acid are those which substitute any amino acid not characterized as a conservative substitution. For example, any substitution which crosses the bounds of the six groups set forth in Table 2. These include substitutions of basic or acidic amino acids for neutral amino acids, (e.g., Asp, Glu, Asn, or Gin for Val, He, Leu or Met), aromatic amino acid for basic or acidic amino acids (e.g.
  • Nucleic Acid Hybridization Nucleic acids "hybridize" when they associate, typically in solution.
  • Nucleic acids hybridize due to a variety of well-characterized physico-chemical forces, such as hydrogen bonding, solvent exclusion, base stacking and the like.
  • highly stringent hybridization and wash conditions are selected to be about 5°C or less lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH (as noted below, highly stringent conditions can also be referred to in comparative terms).
  • the T m is the temperature (under defined ionic strength and pH) at which 50%> of the test sequence hybridizes to a perfectly matched probe.
  • Very stringent conditions are selected to be equal to the T m for a particular probe.
  • the T m of a nucleic acid duplex indicates the temperature at which the duplex is 50% > denatured under the given conditions and its represents a direct measure of the stability of the nucleic acid hybrid.
  • the T m corresponds to the temperature corresponding to the midpoint in transition from helix to random coil and it depends on length, nucleotide composition, and ionic strength for long stretches of nucleotides.
  • unhybridized nucleic acid material can be removed by a series of washes, the stringency of which can be adjusted depending upon the desired results.
  • Low stringency washing conditions e.g., using higher salt and lower temperature
  • Higher stringency conditions e.g., using lower salt and higher temperature that is closer to the hybridization temperature
  • lowers the background signal typically with only the specific signal remaining. See Rapley, R. and Walker, J.M.
  • (%G + C) is the percentage of guanosine (G) and cytosine (C) nucleotides
  • (%f) is the percentage of formalize
  • n is the number of nucleotide bases (i.e., length) of the hybrid. See Rapley and Walker, supra.
  • Equations 1 and 2 are typically accurate only for hybrid duplexes longer than about 100-200 nucleotides. Id.
  • An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formalin with 1 mg of heparin at 42°C, with the hybridization being carried out overnight.
  • stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook, supra for a description of SSC buffer). Often the high stringency wash is preceded by a low stringency wash to remove background probe signal.
  • An example low stringency wash is 2x SSC at 40°C for 15 minutes. h general, a signal to noise ratio of 2.5x-5x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Detection of at least stringent hybridization between two sequences in the context of the present invention indicates relatively strong structural similarity or homology to, e.g., the nucleic acids of the present invention provided in the sequence listings herein.
  • “highly stringent” conditions are selected to be about 5° C or less lower than the thermal melting point (T m ) for the specific sequence at a defined ionic strength and pH.
  • Target sequences that are closely related or identical to the nucleotide sequence of interest e.g., "probes”
  • T m thermal melting point
  • Lower stringency conditions are appropriate for sequences that are less complementary. See, e.g., Rapley and Walker, supra.
  • Comparative hybridization can be used to identify nucleic acids of the invention, and this comparative hybridization method is a preferred method of distinguishing nucleic acids of the invention.
  • Detection of highly stringent hybridization between two nucleotide sequences in the context of the present invention indicates relatively strong structural similarity/homology to, e.g., the nucleic acids provided in the sequence listing herein.
  • Highly stringent hybridization between two nucleotide sequences demonstrates a degree of similarity or homology of structure, nucleotide base composition, arrangement or order that is greater than that detected by stringent hybridization conditions.
  • detection of highly stringent hybridization in the context of the present invention indicates strong structural similarity or structural homology (e.g., nucleotide structure, base composition, arrangement or order) to, e.g., the nucleic acids provided in the sequence listings herein.
  • one measure of stringent hybridization is the ability to hybridize to one of the listed nucleic acids (e.g., nucleic acid sequences SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 64
  • Stringent hybridization (as well as highly stringent, ultra-high stringency, or ultra-ultra high stringency hybridization conditions) and wash conditions can easily be determined empirically for any test nucleic acid. For example, in determining highly stringent hybridization and wash conditions, the hybridization and wash conditions are gradually increased (e.g., by increasing temperature, decreasing salt concentration, increasing detergent concentration and/or increasing the concentration of organic solvents, such as formalin, in the hybridization or wash), until a selected set of criteria are met.
  • the hybridization and wash conditions are gradually increased until a probe comprising one or more nucleic acid sequences selected from SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 6
  • the unmatched target is a nucleic acid corresponding to a nucleic acid (other than those in the accompanying sequence listing) that is present in a public database such as GenBankTM at the time of filing of the subject application.
  • GenBankTM a public database
  • Such sequences can be identified in GenBank by one of skill. Examples include Accession Nos. Z99109 and Y09476. Additional such sequences can be identified in e.g., GenBank, by one of ordinary skill in the art.
  • a test nucleic acid is said to specifically hybridize to a probe nucleic acid when it hybridizes at least Vi as well to the probe as to the perfectly matched complementary target, i.e., with a signal to noise ratio at least l A as high as hybridization of the probe to the target under conditions in which the perfectly matched probe binds to the perfectly matched complementary target with a signal to noise ratio that is at least about 2x-10x, and occasionally 20x, 5 Ox or greater than that observed for hybridization to any of the unmatched polynucleotides of Accession Nos. Z99109 and Y09476.
  • Ultra high-stringency hybridization and wash conditions are those in wliich the stringency of hybridization and wash conditions are increased until the signal to noise ratio for binding of the probe to the perfectly matched complementary target nucleic acid is at least lOx as high as that observed for hybridization to any of the unmatched target nucleic acids of Genbank Accession numbers Z99109 and Y09476.
  • a target nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at least Vi that of the perfectly matched complementary target nucleic acid is said to bind to the probe under ultra- high stringency conditions.
  • even higher levels of stringency can be determined by gradually increasing the hybridization and/or wash conditions of the relevant hybridization assay.
  • a target nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at least Vi that of the perfectly matched complementary target nucleic acid is said to bind to the probe under ultra-ultra- high stringency conditions.
  • Target nucleic acids which hybridize to the nucleic acids represented by SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 05/012515
  • nucleic acids examples include those with one or a few silent or conservative nucleic acid substitutions as compared to a given nucleic acid sequence. Nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical.
  • the invention provides a nucleic acid which comprises a unique subsequence in a nucleic acid selected from SEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532,
  • the unique subsequence is unique as compared to a nucleic acid corresponding to any of Genbank Accession numbers Z99109 and Y09476.
  • Such unique subsequences can be determined by aligning any of SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650
  • Alignment can be perfonned using the BLAST algorithm set to default parameters. Any unique subsequence is useful, e.g., as a probe to identify the nucleic acids of the invention.
  • the invention includes a polypeptide which comprises a unique subsequence in a polypeptide selected from: SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633,
  • the unique subsequence is unique as compared to a polypeptide corresponding to that of GenBank accession number CAA70664.
  • the polypeptide is aligned against the sequences represented by accession number CAA70664. Note that if the sequence corresponds to a non-translated sequence such as a pseudo gene, the corresponding polypeptide is generated simply by in silico translation of the nucleic acid sequence into an amino acid sequence, where the reading frame is selected to correspond to the reading frame of homologous GAT polynucleotides.
  • the invention also provides for target nucleic acids which hybridize under stringent conditions to a unique coding ohgonucleotide which encodes a unique subsequence in a polypeptide selected from SEQ ED NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655
  • the stringent conditions are selected such that a perfectly complementary ohgonucleotide to the coding ohgonucleotide hybridizes to the coding ohgonucleotide with at least about a 2.5x-10x higher, preferably at least about a 5-1 Ox higher signal to noise ratio than for hybridization of the perfectly complementary ohgonucleotide to a control nucleic acid corresponding to any of the control polypeptides.
  • Conditions can be selected such that higher ratios of signal to noise are observed in the particular assay which is used, e.g., about 15x, 20x, 3 Ox, 50x or more.
  • the target nucleic acid hybridizes to the unique coding ohgonucleotide with at least a 2x higher signal to noise ratio as compared to hybridization of the control nucleic acid to the coding ohgonucleotide.
  • higher signal to noise ratios can be selected, e.g., about 2.5x, 5x, lOx, 20x, 30x, 50x or more.
  • the particular signal will depend on the label used in the relevant assay, e.g., a fluorescent label, a colorimetric label, a radioactive label, or the like.
  • Vectors, Promoters and Expression Systems The present invention also includes recombinant constructs comprising one or more of the nucleic acid sequences as broadly described above.
  • the constructs comprise a vector, such as, a plasmid, a cosmid, a phage, a virus, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), or the like, into which a nucleic acid sequence of the invention has been inserted, in a forward or reverse orientation, h a preferred aspect of this embodiment, the construct further comprises regulatory sequences, including, for example, a promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are known to those of skill in the art, and are commercially available.
  • PCR polymerase chain reaction
  • LCR ligase chain reaction
  • NASBA RNA polymerase mediated techniques
  • the present invention also relates to engineered host cells that are transduced (transformed or transfected) with a vector of the invention (e.g., an. invention cloning vector or an invention expression vector), as well as the production of polypeptides of the invention by recombinant techniques.
  • a vector of the invention e.g., an. invention cloning vector or an invention expression vector
  • the vector may be, for example, a plasmid, a viral particle, a phage, etc.
  • the engineered host cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants, or amplifying the GAT homologue gene.
  • Polynucleotides of the present invention can be incorporated into any one of a variety of expression vectors suitable for expressing a polypeptide.
  • Suitable vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, pseudorabies, adenovirus, adeno-associated viruses, retroviruses and many others. Any vector that transduces genetic material into a cell, and, if replication is desired, which is replicable and viable in the relevant host can be used.
  • a polynucleotide of the invention When incorporated into an expression vector, a polynucleotide of the invention is operatively linked to an appropriate transcription control sequence (promoter) to direct mRNA synthesis.
  • transcription control sequences particularly suited for use in transgenic plants include the cauliflower mosaic virus (CaMV), figwort mosaic virus (FMV) and strawberry vein banding virus (SVBV) promoters, described in U.S. Provisional Application No. 60/245,354.
  • Other promoters known to control expression of genes in prokaryotic or eukaryotic cells or their viruses and which can be used in some embodiments of the invention include S V40 promoter, E. coli lac or trp promoter, and the phage lambda P L promoter.
  • An expression vector optionally contains a ribosome binding site for translation initiation, and a transcription terminator, such as Pinll.
  • the vector also optionally includes appropriate sequences for amplifying expression, e.g., an enhancer.
  • the expression vectors of the present invention optionally contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells.
  • the selectable marker gene will encode antibiotic or herbicide resistance.
  • Suitable genes include those coding for resistance to the antibiotic spectinomycin or streptomycin (e.g., the aada gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding for hygromycin resistance.
  • Additional selectable marker genes include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, and tetracycline or ampicillin resistance in E. coli.
  • Suitable genes coding for resistance to herbicides include those which act to inhibit the action of acetolactate synthase (ALS), in particular the sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene containing mutations leading to such resistance in particular the S4 and/or Hra mutations), those which act to inhibit the action of glutamine synthase, such as phosphinothricin or basta (e.g., the bar gene), or other such genes known in the art.
  • the bar gene encodes resistance to the herbicide basta and the ALS gene encodes resistance to the herbicide chlorsulfuron.
  • the modified GAT genes are used as selectable markers.
  • Vectors of the present invention can be employed to transform an appropriate host to permit the host to express an inventive protein or polypeptide.
  • appropriate expression hosts include: bacterial cells, such as E. coli, B. subtilis, Streptomyces, and Salmonella typhimurium; fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, and Neurospora crassa; insect cells such as Drosophila and Spodoptera frugiperda; mammalian cells such as CHO, COS, BHK, HEK 293 or Bowes melanoma; or plant cells or explants, etc.
  • GAT polypeptides are not all cells or cell lines need to be capable of producing fully functional GAT polypeptides; for example, antigenic fragments of a GAT polypeptide may be produced.
  • the present invention is not limited by the host cells employed.
  • a number of expression vectors may be selected depending upon the use intended for the GAT polypeptide. For example, when large quantities of GAT polypeptide or fragments thereof are needed for commercial production or for induction of antibodies, vectors which direct high level expression of fusion proteins that are readily purified can be desirable.
  • Such vectors include, but are not limited to, multifunctional E.
  • coli cloning and expression vectors such as BLU ⁇ SCR ⁇ PT (Stratagene), in which the GAT polypeptide coding sequence may be ligated into the vector in-frame with sequences for the amino-terminal Met and the subsequent 7 residues of beta- galactosidase so that a hybrid protein is produced; p ⁇ N vectors (Van Heeke & Schuster (1989) J Biol. Chem. 264: 5503-5509); p ⁇ T vectors (Novagen, Madison WI); and the like.
  • BLU ⁇ SCR ⁇ PT Stratagene
  • p ⁇ N vectors Vector Heeke & Schuster (1989) J Biol. Chem. 264: 5503-5509
  • p ⁇ T vectors Novagen, Madison WI
  • yeast Saccharomyces cerevisiae a number of vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH may be used for production of the GAT polypeptides of the invention.
  • constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH
  • PGH protein oxidase
  • a coding sequence e.g., of a GAT polypeptide
  • an adenovirus transcription translation complex consisting of the late promoter and tripartite leader sequence. Insertion of a GAT polypeptide coding region into a nonessential ⁇ l or ⁇ 3 region of the viral genome will result in a viable virus capable of expressing a GAT in infected host cells (Logan and Shenk (1984) Proc. Nat'l Acad. Sci. USA 81:3655-3659).
  • transcription enhancers such as the rous sarcoma virus (RSV) enhancer
  • RSV rous sarcoma virus
  • expression can be driven from a transgene integrated into a plant chromosome, or cytoplasmically from an episomal or viral nucleic acid.
  • stably integrated transgenes it is often desirable to provide sequences capable of driving constitutive or inducible expression of the GAT polynucleotides of the invention, for example, using viral, e.g., CaMV, or plant derived regulatory sequences.
  • Numerous plant derived regulatory sequences have been described, including sequences which direct expression in a tissue specific manner, e.g., TobRB7, patatin B33, GRP gene promoters, the rbcS-3A 5/012515
  • Typical vectors useful for expression of nucleic acids in higher plants are well known in the art and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens described by Rogers et al. (1987) Meth. Enzymol 153: 253-277.
  • Exemplary y tumefaciens vectors useful herein are plasmids pKYLX6 and pKYLX7 of Schardl et al. (1987) Gene 61 : 1 - 11 and Berger et al. (1989) Proc. Natl. Acad. Sci. U.S.A. 86: 8402-8406.
  • Another useful vector herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo Alto, CA).
  • a variety of plant viruses that can be employed as vectors are known in the art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, and tobacco mosaic virus.
  • a GAT polynucleotide construct suitable for transformation of plant cells is prepared.
  • a desired GAT polynucleotide can be incorporated into a recombinant expression cassette to facilitate introduction of the gene into a plant and subsequent expression of the encoded polypeptide.
  • An expression cassette will typically comprise a GAT polynucleotide, or functional fragment thereof, operably linked to a promoter sequence and other transcriptional and translational initiation regulatory sequences which will direct expression of the sequence in the intended tissues (e.g., entire plant, leaves, seeds) of the transformed plant.
  • a strongly or weakly constitutive plant promoter can be employed which will direct expression of the GAT polypeptide in all tissues of a plant.
  • Such promoters are active under most environmental conditions and states of development or cell differentiation.
  • constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1'- or 2'- promoter derived from T-DNA of Agrobacterium tumefaciens, the ubiquitin 1 promoter, the Smas promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Patent No.
  • the Nos promoter the pEmu promoter, the rubisco promoter, the GRPl-8 promoter and other transcription initiation regions from various plant genes known to those of skill, h situations in which over expression of a GAT polynucleotide is detrimental to the plant or otherwise undesirable, one of skill, upon review of this disclosure, will recognize that weak constitutive promoters can be used for low-levels of expression.
  • a strong promoter e.g., a t-RNA or other pol III promoter, or a strong pol II promoter, such as the cauliflower mosaic virus promoter
  • a plant promoter may be under environmental control.
  • inducible promoters are referred to here as "inducible" promoters.
  • inducible promoters include pathogen attack, anaerobic conditions, or the presence of light, hi particular, examples of inducible promoters are the Adhl promoter which is inducible by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, and the PPDK promoter which is inducible by light. Also useful are promoters which are chemically inducible.
  • the promoters used in the present invention can be "tissue-specific" and, as such, under developmental control in that the polynucleotide is expressed only in certain tissues, such as leaves, roots, fruit, flowers and/or seeds.
  • An exemplary promoter is the anther specific promoter 5126 (U.S. Patent Nos. 5,689,049 and 5,689,051).
  • seed-preferred promoters include, but are not limited to, 27 kD gamma zein promoter and waxy promoter, Boronat et al. 1986) Plant Sci. 47, 95-102; Reina et al. (1990) Nucleic Acids Res. 18 (21): 6426; and Kloesgen et al. (1986) Mol. Gen. Genet. 203: 237-244. Promoters that express in the embryo, pericarp, and endosperm are disclosed in U.S. Patent Application Ser. Nos.
  • the endogenous promoters (or variants thereof) from these genes can be employed for directing expression of the genes in the transfected plant.
  • Tissue-specific promoters can also be used to direct expression of heterologous polynucleotides.
  • the particular promoter used in the expression cassette in plants depends on the intended application.
  • Either heterologous or non-heterologous (i.e., endogenous) promoters can be employed to direct expression of the nucleic acids of the present invention.
  • promoters can also be used, for example, in expression cassettes to drive expression of antisense nucleic acids to reduce, increase, or alter the concentration and/or composition of the proteins of the present invention in a desired tissue.
  • Any of a number of promoters which direct transcription in plant cells are suitable.
  • the promoter can be either constitutive or inducible.
  • promoters of bacterial origin which operate in plants include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids (see, Herrara-Estrella et al. (1983) Nature 303: 209-213).
  • Viral promoters include the 35S and 19S RNA promoters of cauliflower mosaic virus (Odell et al. (1985) Nature 313: 810-812).
  • Other plant promoters include the ribulose-1,3- bisphosphate carboxylase small subunit promoter and the phaseolin promoter.
  • the promoter sequence from the E8 gene and other genes may also be used. The isolation and sequence of the E8 promoter is described in detail in Deikman and Fischer (1988) EMBOJ. 7: 3315-3327. To identify candidate promoters, the 5' portions of a genomic clone is analyzed for sequences characteristic of promoter sequences.
  • promoter sequence elements include the TATA box consensus sequence (TATAAT), which is usually 20 to 30 base pairs upstream of the transcription start site, hi plants, further upstream from the TATA box, at positions -80 to -100, there is typically a promoter element with a series of adenines surrounding the trinucleotide G (or T) as described by Messing et al. (1983) Genetic Engineering in Plants, eds. Kosage, et al, pp. 221-227.
  • sequences other than the promoter and the cojoined polynucleotide can also be employed.
  • a polyadenylation region at the 3 '-end of a GAT-encoding region can be included.
  • the polyadenylation region can be derived, for example, from a variety of plant genes, or from T-DNA.
  • the 3' end sequence to be added can be derived from, for example, the nopaline synthase or octopine synthase genes, or alternatively from another plant gene, or less preferably from any other eukaryotic gene.
  • An intron sequence can be added to the 5' untranslated region of the coding sequence or the partial coding sequence to increase the amount of the mature message that accumulates. See for example Bucliman and Berg (1988) Mol. Cell Biol.
  • the marker may encode biocide tolerance, particularly antibiotic tolerance, such as tolerance to kanamycin, G418, bleomycin, hygromycin, or herbicide tolerance, such as tolerance to chlorsulfuron, or phosphinothricin (the active ingredient in the herbicides bialaphos and Basta).
  • antibiotic tolerance such as tolerance to kanamycin, G418, bleomycin, hygromycin
  • herbicide tolerance such as tolerance to chlorsulfuron, or phosphinothricin (the active ingredient in the herbicides bialaphos and Basta).
  • Specific initiation signals can aid in efficient translation of a GAT polynucleo tide-encoding sequence of the present invention. These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where a GAT polypeptide-encoding sequence, its initiation codon and upstream sequences are inserted into an appropriate expression vector, no additional translational control signals may be needed.
  • exogenous transcriptional control signals including the initiation codon must be provided.
  • the initiation codon must be in the correct reading frame to ensure transcription of the entire insert.
  • Exogenous transcriptional elements and initiation codons can be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate to the cell system in use (Scharf et al. (1994) Results Probl Cell Differ. 20: 125-62 and Bittner et al. (1987) Methods in Enzymol 153: 516-544).
  • Polynucleotides of the invention can also be fused, for example, in-frame to nucleic acids encoding a secretion localization sequence, to target polypeptide expression to a desired cellular compartment, membrane, or organelle of a host cell, or to direct polypeptide secretion to the periplasmic space or into the cell culture media.
  • sequences are known to those of skill, and include secretion leader peptides, organelle targeting sequences (e.g., nuclear localization sequences, ER retention signals, mitochondrial transit sequences, and chloroplast transit sequences), membrane localization/anchor sequences (e.g., stop transfer sequences, GPI anchor sequences), and the like.
  • a polynucleotide of the invention is fused in frame with an N-terminal chloroplast transit sequence (or chloroplast transit peptide sequence) derived from a gene encoding a polypeptide that is normally targeted to the chloroplast.
  • chloroplast transit sequence or chloroplast transit peptide sequence
  • Such sequences are typically rich in serine and threonine; are deficient in aspartate, glutamate, and tyrosine; and generally have a central domain rich in positively charged amino acids.
  • Expression Hosts hi a further embodiment, the present invention relates to host cells containing the above-described constructs.
  • the host cell can be a eukaryotic cell, such as a mammalian cell, a yeast cell, or a plant cell, or the host cell can be a prokaryotic cell, such as a bacterial cell.
  • Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, electroporation, or other common techniques (Davis et al, Basic Methods in Molecular Biology).
  • a host cell is optionally chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation.
  • Post- translational processing that cleaves a "pre” or a "prepro” form of the protein may also be important for correct insertion, folding and/or function.
  • Different host cells such as E. coli, Bacillus sp., yeast or mammalian cells such as CHO, HeLa, BHK, MDCK, 293, WI38, etc. have specific cellular machinery and characteristic mechanisms, e.g. , for post-translational activities and may be chosen to ensure the desired modification and processing of the introduced, foreign protein.
  • stable expression systems can be used.
  • plant cells, explants or tissues e.g.
  • shoots, or leaf discs, which stably express a polypeptide of the invention are transduced using expression vectors wliich contain viral origins of replication or endogenous expression elements and a selectable marker gene.
  • cells may be allowed to grow for a period determined to be appropriate for the cell type, e.g., 1 or more hours for bacterial cells, 1-4 days for plant cells, 2-4 weeks for some plant explants, in an enriched media before they are switched to selective media.
  • the purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells which successfully express the introduced sequences.
  • transgenic plants expressing the polypeptides of the invention can be selected directly for resistance to the herbicide, glyphosate.
  • Resistant embryos derived from stably transformed explants can be proliferated, e.g., using tissue culture techniques appropriate to the cell type.
  • Host cells transformed with a nucleotide sequence encoding a polypeptide of the invention are optionally cultured under conditions suitable for the expression and recovery of the encoded protein from cell culture.
  • the protein or fragment thereof produced by a recombinant cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence and/or the vector used.
  • expression vectors containing GAT polynucleotides of the invention can be designed with signal sequences which direct secretion of the mature polypeptides through a prokaryotic or eukaryotic cell membrane.
  • Polynucleotides of the present invention may also comprise a coding sequence fused in- frame to a marker sequence that, e.g., facilitates purification of the encoded polypeptide.
  • purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals, a sequence which binds glutathione (e.g., GST), a hemagglutinin (HA) tag (corresponding to an epitope derived from the influenza hemagglutinin protein; Wilson et al.
  • One expression vector contemplated for use in the compositions and methods described herein provides for expression of a fusion protein comprising a polypeptide of the invention fused to a polyhistidine region separated by an enterokinase cleavage site.
  • the histidine residues facilitate purification on EMIAC (immobilized metal ion affinity chromatography, as described in Porath et al.
  • pGEX vectors may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST).
  • GST glutathione S-transferase
  • fusion proteins are soluble and can easily be purified from lysed cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case of GST-fusions) followed by elution in the presence of free ligand.
  • the selected promoter is induced by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period.
  • Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.
  • Microbial cells employed in the expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, or other methods, which are well known to those skilled in the art.
  • many references are available for the culture and production of many cells, including cells of bacterial, plant, animal (especially mammalian) and archebacterial origin.
  • Polypeptides of the invention can be recovered and purified from recombinant cell cultures by any of a number of methods well known in the art, including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography (e.g., using any of the tagging systems noted herein), hydroxylapatite chromatography, and lectin chromatography. Protein refolding steps can be used, as desired, in completing the configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed in the final purification steps.
  • HPLC high performance liquid chromatography
  • GAT polypeptide of the invention in a large scale suitable for industrial and/or commercial applications, hi such cases bulk fermentation procedures are employed.
  • a GAT polynucleotide e.g.
  • a polynucleotide comprising any one of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 6
  • U.S. Patent No. 5,955,310 to Widner et al. describes a vector with tandem promoters, and stabilizing sequences operably linked to a polypeptide encoding sequence.
  • the vector After inserting the polynucleotide of interest into a vector, the vector is transformed into a bacterial, e.g., a Bacillus subtilis strain PL1801IEE (amyE, apr, npr, s ⁇ oIEE::Tn917) host.
  • an expression vector into a Bacillus cell may, for instance, be effected by protoplast transformation (see, e.g., Chang and Cohen (l979)_Mol Gen. Genet. 168:111), by using competent cells (see, e.g., Young and Spizizin (1961) J. Bacteriol. 81:823, or Dubnau and Davidoff-Abelson (197L ) J Mol. Biol. 56: 209), by electroporation (see, e.g., Shigekawa and Dower (1988) Biotechniques 6: 742), or by conjugation (see, e.g., Koehler and Thorne (1987) J Bacteriol.
  • protoplast transformation see, e.g., Chang and Cohen (l979)_Mol Gen. Genet. 168:111
  • competent cells see, e.g., Young and Spizizin (1961) J. Bacteriol. 81:823, or Dubnau and Davidoff-Abelson (197L
  • the transformed cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods that are known in the art.
  • the cell may be cultivated by shake flask cultivation, small-scale or large- scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated.
  • the cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection).
  • the secreted polypeptide can be recovered directly from the medium.
  • the resulting polypeptide may be isolated by methods known in the art.
  • the polypeptide may be isolated from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation.
  • the isolated polypeptide may then be further purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), or extraction (see, e.g., Bollag et al. (1996) Protein Methods, 2 nd ed. (Wiley-Liss, NY) and Walker (1996) The Protein Protocols Handbook (Humana Press, NJ).
  • Cell-free transcription/translatioii systems can also be employed to produce polypeptides using DNAs or RNAs of the present invention. Several such systems are commercially available. A general guide to in vitro transcription and translation protocols is found in Tymms (1995) In vitro Transcription and Translation Protocols: Methods in Molecular Biology (Garland Publishing, NY), vol. 37.
  • the polynucleotides of the invention are optionally used as substrates for a variety of diversity generating procedures, e.g., mutation, recombination and recursive recombination reactions, in addition to their use in standard cloning methods as set forth in, e.g., Ausubel, Berger and Sambrook, to produce additional GAT polynucleotides and polypeptides with desired properties.
  • diversity generating protocols are available and described in the art.
  • the procedures can be used separately, and/or in combination to produce one or more variants of a polynucleotide or set of polynucleotides, as well variants of encoded proteins.
  • any of the diversity generating procedures described herein can be the generation of one or more polynucleotides, which can be selected or screened for polynucleotides that encode proteins with or which confer desirable properties.
  • any polynucleotides that are produced can be selected for a desired activity or property, e.g. altered K m for glyphosate, altered K m for acetyl CoA, use of alternative cofactors (e.g., propionyl CoA) 05/012515
  • GAT homologs with increased specific activity can be detected by assaying the conversion of glyphosate to N-acetylglyphosate, e.g., by mass spectrometry.
  • improved ability to confer resistance to glyphosate can be assayed by growing bacteria transformed with a nucleic acid of the invention on agar containing increasing concentrations of glyphosate or by spraying transgenic plants incorporating a nucleic acid of the invention with glyphosate.
  • Mutational methods of generating diversity include, for example, site- directed mutagenesis (Ling et al. (1997) "Approaches to DNA mutagenesis: an overview” Anal Biochem. 254(2): 157-178; Dale et al. (1996) “Oligonucleotide- directed random mutagenesis using the phosphorothioate method” Methods Mol. Biol. 57:369-374; Smith (1985) "In vitro mutagenesis” Aim. Rev. Genet.
  • FRAGMENT ISOLATION by Affholter (USSN 60/186,482, filed March 2, 2000).
  • sequence modification methods such as mutation, recombination, etc. are applicable to the present invention and set forth in the references above. That is, alterations to the component nucleic acid sequences to produced modified gene fusion constructs can be performed by any number of the protocols described, either before cojoining of the sequences, or after the cojoining step.
  • the following exemplify some of the different types of preferred formats for diversity generation in the context of the present invention, including, e.g. , certain recombination based diversity generation formats.
  • Nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the references above, including e.g., DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids.
  • DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids.
  • sexual PCR mutagenesis can be used in which random (or pseudo random, or even non-random) fragmentation of the DNA molecule is followed by recombination, based on sequence similarity, between DNA molecules with different but related DNA sequences, in vitro, followed by fixation of the crossover by extension in a polymerase chain reaction.
  • This process and many process variants is described in several of the references above, e.g., in Stemmer (1994) Proc.
  • nucleic acids can be recursively recombined in vivo, e.g., by allowing recombination to occur between nucleic acids in cells.
  • Many such in vivo recombination formats are set forth in the references noted above. Such formats optionally provide direct recombination between nucleic acids of interest, or provide recombination between vectors, viruses, plasmids, etc., comprising the nucleic acids of interest, as well as other formats. Details regarding such procedures are found in the references noted above.
  • Whole genome recombination methods can also be used in which whole genomes of cells or other organisms are recombined, optionally including spiking of the genomic recombination mixtures with desired library components (e.g., genes corresponding to the pathways of the present invention). These methods have many applications, including those in which the identity of a target gene is not known. Details on such methods are found, e.g., in WO 98/31837 by del Cardayre et al.
  • Synthetic recombination methods can also be used, in which oligonucleotides corresponding to targets of interest are synthesized and reassembled in PCR or ligation reactions which include oligonucleotides which correspond to more than one parental nucleic acid, thereby generating new recombined nucleic acids.
  • Oligonucleotides can be made by standard nucleotide addition methods, or can be made, e.g., by tri-nucleotide synthetic approaches.
  • the resulting recombined sequence strings are optionally converted into nucleic acids by synthesis of nucleic acids which correspond to the recombined sequences, e.g., in concert with ohgonucleotide synthesis gene reassembly techniques. This approach can generate random, partially random or designed variants.
  • This methodology is generally applicable to the present invention in providing for recombination of nucleic acid sequences and/or gene fusion constructs encoding proteins involved in various metabolic pathways (such as, for example, carotenoid biosynthetic pathways, ectoine biosynthetic pathways, polyhydroxyalkanoate biosynthetic pathways, aromatic polyketide biosynthetic pathways, and the like) in silico and/or the generation of corresponding nucleic acids or proteins.
  • metabolic pathways such as, for example, carotenoid biosynthetic pathways, ectoine biosynthetic pathways, polyhydroxyalkanoate biosynthetic pathways, aromatic polyketide biosynthetic pathways, and the like.
  • the fragment population derived from the genomic libraryries is annealed with partial, or, often approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly of complex chimeric genes from this population is then mediated by nuclease-base removal of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and subsequent single stranded ligation.
  • the parental polynucleotide strand can be removed by digestion (e.g., if RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in a manner conducive to such separation) and other available se aration/purification methods.
  • the parental strand is optionally co-purified with the chimeric strands and removed during subsequent screening and processing steps. Additional details regarding this approach are found, e.g., in "Single-Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment Isolation" by Affholter, WO 01/64864.
  • single-stranded molecules are converted to double-stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand-mediated binding. After separation of unbound DNA, the selected DNA molecules are released from the support and introduced into a suitable host cell to generate a library of enriched sequences which hybridize to the probe.
  • a library produced in this manner provides a desirable substrate for further diversification using any of the procedures described herein. Any of the preceding general recombination formats can be practiced in a reiterative fashion (e.g., one or more cycles of mutation/recombination or other diversity generation methods, optionally followed by one or more selection methods) to generate a more diverse set of recombinant nucleic acids.
  • Mutagenesis employing polynucleotide chain termination methods have also been proposed (.see e.g., U.S. Patent No. 5,965,408, "Method of DNA reassembly by interrupting synthesis” to Short, and the references above), and can be applied to the present invention.
  • double stranded DNAs corresponding to one or more genes sharing regions of sequence similarity are combined and denatured, in the presence or absence of primers specific for the gene.
  • the single stranded polynucleotides are then annealed and incubated in the presence of a polymerase and a chain terminating reagent (e.g.
  • partial duplex molecules e.g., containing partially extended chains, are then denatured and reannealed in subsequent rounds of replication or partial replication resulting in polynucleotides which share varying degrees of sequence similarity and which are diversified with respect to the starting population of DNA molecules.
  • the products, or partial pools of the products can be amplified at one or more stages in the process.
  • Polynucleotides produced by a chain termination method, such as described above, are suitable substrates for any other described recombination format.
  • Diversity also can be generated in nucleic acids or populations of nucleic acids using a recombinational procedure termed “incremental truncation for the creation of hybrid enzymes" (“ITCHY”) described in Ostermeier et al. (1999) "A combinatorial approach to hybrid enzymes independent of DNA homology" Nature Biotech 17:1205. This approach can be used to generate an initial library of variants which can optionally serve as a substrate for one or more in vitro ox in vivo recombination methods.
  • mutagenesis methods are found in the above-cited references; additional details regarding mutagenesis methods can be found in following, which can also be applied to the present invention.
  • error-prone PCR can be used to generate nucleic acid variants. Using this technique, PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product. Examples of such techniques are found in the references above and, e.g., in Leung et al (1989) Technique 1 : 11-15 and Caldwell et al (1992) PCR Methods Applic. 2: 28-33.
  • assembly PCR can be used, in a process which involves the assembly of a PCR product from a mixture of small DNA fragments.
  • a large number of different PCR reactions can occur in parallel in the same reaction mixture, with the products of one reaction priming the products of another reaction.
  • Ohgonucleotide directed mutagenesis can be used to introduce site- specific mutations in a nucleic acid sequence of interest. Examples of such techniques are found in the references above and, e.g., in Reidhaar-Olson et al. (1988) Science 241 :53-57.
  • cassette mutagenesis can be used in a process that replaces a small region of a double stranded DNA molecule with a synthetic ohgonucleotide cassette that differs from the native sequence.
  • the ohgonucleotide can contain, e.g., completely and/or partially randomized native sequence(s).
  • Recursive ensemble mutagenesis is a process in which an algorithm for protein mutagenesis is used to produce diverse populations of phenotypically related mutants, members of which differ in amino acid sequence. This method uses a feedback mechanism to monitor successive rounds of combinatorial cassette mutagenesis. Examples of this approach are found in Arkin & Youvan (1992) Proc. Natl Acad. Sci. USA 89:7811-7815.
  • Exponential ensemble mutagenesis can be used for generating combinatorial libraries with a high percentage of unique and functional mutants. Small groups of residues in a sequence of interest are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins. Examples of such procedures are found in Delegrave & Youvan (1993) Biotech. Res. 11:1548-1552.
  • In vivo mutagenesis can be used to generate random mutations in any cloned DNA of interest by propagating the DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair pathways. These "mutator" strains have a higher random mutation rate than that of a wild-type parent.
  • Transformation of a suitable host with such multimers consisting of genes that are divergent with respect to one another, (e.g., derived from natural diversity or through application of site directed mutagenesis, error prone PCR, passage through mutagenic bacterial strains, and the like), provides a source of nucleic acid diversity for DNA diversification, e.g., by an in vivo recombination process as indicated above.
  • a multiplicity of monomeric polynucleotides sharing regions of partial sequence similarity can be transformed into a host species and recombined in vivo by the host cell.
  • Subsequent rounds of cell division can be used to generate libraries, members of which, include a single, homogenous population, or pool of monomeric polynucleotides.
  • the monomeric nucleic acids can be recovered by standard techniques, e.g., PCR and/or cloning, and recombined in any of the recombination formats, including recursive recombination formats, described above.
  • Methods for generating multispecies expression libraries have been described (in addition to the references noted above, see, e.g., Peterson et ⁇ l (1998) U.S. Pat. No.
  • Multispecies expression libraries include, in general, libraries comprising cDNA or genomic sequences from a plurality of species or strains, operably linked to appropriate regulatory sequences, in an expression cassette.
  • the cDNA and/or genomic sequences are optionally randomly ligated to further enhance diversity.
  • the vector can be a shuttle vector suitable for transformation and expression in more than one species of host organism, e.g., bacterial species or eukaryotic cells;
  • the library is biased by preselecting sequences which encode a protein of interest, or which hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for any of the methods herein described. The above described procedures have been largely directed to increasing nucleic acid and/or encoded protein diversity.
  • recombined CDRs derived from B cell cDNA libraries can be amplified and assembled into framework regions (e.g., Jirholt et al. (1998) "Exploiting sequence space: shuffling in vivo formed complementarity determining regions into a master framework” Gene 215 : 471) prior to diversifying according to any of the methods described herein. Libraries can be biased towards nucleic acids which encode proteins with desirable enzyme activities.
  • the clone can be mutagenized using any known method for introducing DNA alterations.
  • a library comprising the mutagenized homologues is then screened for a desired activity, which can be the same as or different from the initially specified activity.
  • Desired activities can be identified by any method known in the art.
  • WO 99/10539 proposes that gene libraries can be screened by combining extracts from the gene library with components obtained from metabolically rich cells and identifying combinations which exhibit the desired activity. It has also been proposed (e.g., WO 98/58085) that clones with desired activities can be identified by inserting bioactive substrates into samples of the library, and detecting bioactive fluorescence corresponding to the product of a desired activity using a fluorescent analyzer, e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer. Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe.
  • a fluorescent analyzer e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer.
  • Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe.
  • WO 99/10539 proposes that polynucleotides encoding a desired activity (e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a hydratase, a nitrilase, a transaminase, an amidase or an acylase) can be identified from among genomic DNA sequences.
  • a desired activity e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a
  • single stranded DNA molecules from a population of genomic DNA are hybridized to a ligand-conjugated probe.
  • the genomic DNA can be derived from either a cultivated or uncultivated microorganism, or from an environmental sample.
  • the genomic DNA can be derived from a multicellular organism, or a tissue derived therefrom.
  • Second strand synthesis can be conducted directly from the hybridization probe used in the capture, with or without prior release from the capture medium or by a wide variety of other strategies known in the art.
  • the isolated single-stranded genomic DNA population can be fragmented without further cloning and used directly in, e.g., a recombination- based approach, that employs a single-stranded template, as described above.
  • “Non-stochastic" methods of generating nucleic acids and polypeptides are described in Short "Non-Stochastic Generation of Genetic
  • Vaccines and Enzymes WO 00/46344. These methods, including proposed non- stochastic polynucleotide reassembly and site-saturation mutagenesis methods can be applied to the present invention as well. Random or semi-random mutagenesis using doped or degenerate oligonucleotides is also described in, e.g., Arkin and Youvan (1992) "Optimizing nucleotide mixtures to encode specific subsets of amino acids for semi-random mutagenesis" Biotechnology 10:297-300; Reidhaar- Olson et al. (1991) "Random mutagenesis of protein sequences using ohgonucleotide cassettes” Methods Enzymol.
  • any of the above described methods can be practiced recursively or in combination to alter nucleic acids, e.g., GAT encoding polynucleotides.
  • Kits for mutagenesis, library construction and other diversity generation methods are also commercially available.
  • kits are available from, e.g., Stratagene (e.g., QuickChangeTM site-directed mutagenesis kit; and ChameleonTM double-stranded, site-directed mutagenesis kit); Bio/Can Scientific, Bio-Rad (e.g., using the Kunkel method described above); Boehringer Mannheim Corp.; Clonetech Laboratories; DNA Technologies; Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc.; Lemargo Inc.; Life Technologies (Gibco BRL); New England Biolabs; Pharmacia Biotech; Promega Corp.; Quantum Biotechnologies; Amersham International pic (e.g., using the Eckstein method above); and Boothn Biotechnology Ltd (e.g., using the Carter/Winter method above).
  • Stratagene e.g., QuickChangeTM site-directed mutagenesis kit; and ChameleonTM double-stranded, site-directed mutagenesis kit
  • Bio/Can Scientific, Bio-Rad e.g.,
  • nucleic acids of the present invention can be recombined (with each other, or with related (or even unrelated) sequences) to produce a diverse set of recombinant nucleic acids for use in the gene fusion constructs and modified gene fusion constructs of the present invention, including, e.g., sets of homologous nucleic acids, as well as corresponding polypeptides.
  • modified polynucleotides generate a large number of diverse variants of a parental sequence or sequences.
  • the modification technique e.g., some form of shuffling
  • a library of variants that is then screened for a modified polynucleotide or pool of modified polynucleotides encoding some desired functional attribute, e.g., improved GAT activity.
  • exemplary enzymatic activities that can be screened for include catalytic rates (conventionally characterized in terms of kinetic constants such as k cat and K M ), substrate specificity, and susceptibility to activation or inhibition by substrate, product or other molecules (e.g., inhibitors or activators).
  • One example of selection for a desired enzymatic activity entails growing host cells under conditions that inhibit the growth and/or survival of cells that do not sufficiently express an enzymatic activity of interest, e.g. the GAT activity. Using such a selection process can eliminate from consideration all modified polynucleotides except those encoding a desired enzymatic activity. For example, in some embodiments of the invention host cells are maintained under conditions that inhibit cell growth or survival in the absence of sufficient levels of GAT, e.g., a concentration of glyphosate that is lethal or inhibits the growth of a wild-type plant of the same variety that either lacks or does not express a GAT polynucleotide.
  • a host cell harboring a modified nucleic acid that encodes enzymatic activity or activities able to catalyze production of sufficient levels of the product will survive and grow.
  • Some embodiments of the invention employ multiple rounds of screening at increasing concentrations of glyphosate or a glyphosate analog.
  • mass spectrometry is used to detect the acetylation of glyphosate, or a glyphosate analog or metabolite. The use of mass spectrometry is described in more detail in the Examples below. For convenience and high throughput it will often be desirable to screen/select for desired modified nucleic acids in a microorganism, e.g., a bacteria such as E. coli.
  • screening in plant cells or plants can in some cases be preferable where the ultimate aim is to generate a modified nucleic acid for expression in a plant system.
  • some preferred embodiments of the invention throughput is increased by screening pools of host cells expressing different modified nucleic acids, either alone or as part of a gene fusion construct. Any pools showing significant activity can be deconvoluted to identify single clones expressing the desirable activity.
  • the skilled artisan will recognize that the relevant assay, screening or selection method will vary depending upon the desired host organism and other parameters known in the art. It is normally advantageous to employ an assay that can be practiced in a high-throughput format. In high-throughput assays, it is possible to screen up to several thousand different variants in a single day.
  • each well of a microtiter plate can be used to run a separate assay, or, if concentration or incubation time effects are to be observed, every 5-10 wells can test a single variant.
  • fluidic approaches it is possible, as mentioned above, simply to grow cells on media plates that select for the desired enzymatic or metabolic function. This approach offers a simple and high-throughput screening method.
  • robotic systems have also been developed for solution phase chemistries useful in assay systems. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD.
  • Optical images viewed (and, optionally, recorded) by a camera or other recording device are optionally further processed in any of the embodiments herein, e.g., by digitizing the image and/or storing and analyzing the image on a computer.
  • a variety of commercially available peripheral equipment and software is available for digitizing, storing and analyzing a digitized video or digitized optical image, e.g., using PC (Intel x86 or Pentium chip compatible DOSTM, OSTM WINDOWSTM, WINDOWS NTTM or WINDOWS 95TM based machines), MACE TOSHTM, or UNIX based (e.g., SUNTM work station) computers.
  • a CCD camera includes an array of picture elements (pixels).
  • the light from the specimen is imaged on the CCD.
  • Particular pixels corresponding to regions of the specimen e.g., individual hybridization sites on an array of biological polymers
  • Multiple pixels are processed in parallel to increase speed.
  • the apparatus and methods of the invention are easily used for viewing any sample, e.g. by fluorescent or dark field microscopic techniques.
  • OTHER POLYNUCLEOTIDE COMPOSITIONS The invention also includes compositions comprising two or more polynucleotides of the invention (e.g., as substrates for recombination).
  • composition can comprise a library of recombinant nucleic acids, where the library contains at least 2, 3, 5, 10, 20, or 50 or more polynucleotides.
  • the polynucleotides are optionally cloned into expression vectors, providing expression libraries.
  • the invention also includes compositions produced by digesting one or more polynucleotide of the invention with a restriction endonuclease, an RNAse, or a DNAse (e.g., as is performed in certain of the recombination formats noted above); and compositions produced by fragmenting or shearing one or more polynucleotide of the invention by mechanical means (e.g., sonication, vortexing, and the like), which can also be used to provide substrates for recombination in the methods above.
  • compositions comprising sets of oligonucleotides corresponding to more than one nucleic acid of the invention are useful as recombination substrates and are a feature of the invention.
  • fragmented nucleic acid sets For convenience, these fragmented, sheared, or ohgonucleotide synthesized mixtures are referred to as fragmented nucleic acid sets. Also included in the invention are compositions produced by incubating one or more of the fragmented nucleic acid sets in the presence of ribonucleotide- or deoxyribonucelotide triphosphates and a nucleic acid polymerase. This resulting composition forms a recombination mixture for many of the recombination formats noted above.
  • the nucleic acid polymerase may be an RNA polymerase, a DNA polymerase, or an RNA-directed DNA polymerase (e.g., a "reverse transcriptase"); the polymerase can be, e.g., a thermostable DNA polymerase (such as, VENT, TAQ, or the like).
  • the present invention provides computers, computer readable media and integrated systems comprising character strings corresponding to the sequence information herein for the polypeptides and nucleic acids herein, including, e.g., those sequences listed herein and the various silent substitutions and conservative substitutions thereof.
  • various methods and genetic algorithms (GAs) known in the art can be used to detect homology or similarity between different character strings, or can be used to perform other desirable functions such as to control output files, provide the basis for making presentations of information including the sequences and the like. Examples include BLAST, discussed supra. Thus, different types of homology and similarity of various stringency and length can be detected and recognized in the integrated systems described herein.
  • homology determination methods have been designed for comparative analysis of sequences of biopolymers, for spell-checking in word processing, and for data retrieval from various databases.
  • models that simulate annealing of complementary homologous polynucleotide strings can also be used as a foundation of sequence alignment or other operations typically performed on the character strings corresponding to the sequences herein (e.g., word-processing manipulations, construction of figures comprising sequence or subsequence character strings, output tables, etc.).
  • BLAST An example of a software package with GAs for calculating sequence similarity is BLAST, which can be adapted to the present invention by inputting character strings corresponding to the sequences herein.
  • standard desktop applications such as word processing software (e.g., Microsoft WordTM or Corel WordPerfectTM) and database software (e.g., spreadsheet software such as Microsoft ExcelTM, Corel Quattro ProTM, or database programs such as Microsoft AccessTM or ParadoxTM) can be adapted to the present invention by inputting a character string corresponding to the GAT homologues of the invention (either nucleic acids or proteins, or both).
  • the integrated systems can include the foregoing software having the appropriate character string information, e.g., used in conjunction with a user interface (e.g., a GUI in a standard operating system such as a Windows, Macintosh or LINUX system) to manipulate strings of characters.
  • a user interface e.g., a GUI in a standard operating system such as a Windows, Macintosh or LINUX system
  • specialized alignment programs such as BLAST can also be incorporated into the systems of the invention for alignment of nucleic acids or proteins (or corresponding character strings).
  • Integrated systems for analysis in the present invention typically include a digital computer with GA software for aligning sequences, as well as data sets entered into the software system comprising any of the sequences herein.
  • the computer can be, e.g., a PC (Intel x86 or Pentium chip compatible DOSTM, OS2TM WINDOWSTM WINDOWS NTTM, WENDOWS95TM, WE DOWS98TM LINUX based machine, a MACINTOSHTM, Power PC, or a UNLX based (e.g., SUNTM work station) machine) or other commercially common computer which is known to one of skill.
  • Software for aligning or otherwise manipulating sequences is available, or can easily be constructed by one of skill using a standard programming language such as Visualbasic, Fortran, Basic, Java, or the like.
  • Any controller or computer optionally includes a monitor which is often a cathode ray tube ("CRT") display, a flat panel display (e.g., active matrix liquid crystal display, liquid crystal display), or others.
  • Computer circuitry is often placed in a box wliich includes numerous integrated circuit chips, such as a microprocessor, memory, interface circuits, and others.
  • the box also optionally includes a hard disk drive, a floppy disk drive, a high capacity removable drive such as a writeable CD-ROM, and other common peripheral elements.
  • Inputting devices such as a keyboard or mouse optionally provide for input from a user and for user selection of sequences to be compared or otherwise manipulated in the relevant computer system.
  • the computer typically includes appropriate software for receiving user instructions, either in the form of user input into set parameter fields, e.g., in a GUI, or in the form of preprogrammed instructions, e.g., preprogrammed for a variety of different specific operations.
  • the software then converts these instructions to appropriate language for instructing the operation of the fluid direction and transport controller to carry out the desired operation.
  • the software can also include output elements for controlling nucleic acid synthesis (e.g., based upon a sequence or an alignment of a sequences herein) or other operations which occur downstream from an alignment or other operation performed using a character string corresponding to a sequence herein.
  • Nucleic acid synthesis equipment can, accordingly, be a component in one or more integrated systems herein.
  • kits embodying the methods, composition, systems and apparatus herein optionally comprise one or more of the following: (1) an apparatus, system, system component or apparatus component as described herein; (2) instructions for practicing the methods described herein, and/or for operating the apparatus or apparatus components herein and/or for using the compositions herein; (3) one or more GAT compositions or components; (4) a container for holding components or compositions, and, (5) packaging materials.
  • the present invention provides for the use of any apparatus, apparatus component, composition or kit herein, for the practice of any method or assay herein, and/or for the use ofany apparatus or kit to practice any assay or method herein.
  • the host cell can be eukaryotic, for example, a eukaryotic cell, a plant cell, an animal cell, a protoplast, or a tissue culture cell.
  • the host cell optionally comprises a plurality of cells, for example, an organism.
  • the host cell can be prokaryotic including, but not limited to, bacteria (i.e., gram positive bacteria, purple bacteria, green sulfur bacteria, green non-sulfur bacteria, cyanobacteria, spirochetes, thermatogales, flavobacteria, and bacteroides) and archaebacteria (i.e., Korarchaeota, The ⁇ noproteus, Pyrodictium, Thermococcales, Methanogens, Archaeoglobus, and extreme Halophiles).
  • bacteria i.e., gram positive bacteria, purple bacteria, green sulfur bacteria, green non-sulfur bacteria, cyanobacteria, spirochetes, thermatogales, flavobacteria, and bacteroides
  • archaebacteria i.e., Korarchaeota, The ⁇ noproteus, Pyrodictium, Thermococcales, Methanogens, Archaeoglobus, and
  • the transformation of plant cells and protoplasts can be carried out in essentially any of the various ways known to those skilled in the art of plant molecular biology, including, but not limited to, the methods described herein. See, in general, Methods in Enzymology, Vol. 153 (Recombinant DNA PartD) Wu and Grossman (eds.) 1987, Academic Press; and Weising et al, Ann. Rev. Genet. 22: 421-477 (1988), incorporated herein by reference.
  • the DNA construct may be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation, PEG-mediated transfection, particle bombardment, silicon fiber delivery, or microinjection of plant cell protoplasts or embryogenic callus.
  • Agrobacterium tumefaciens host vector T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector.
  • the virulence functions of the Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell DNA when the cell is infected by the bacteria. See, U.S. Patent No. 5,591,616.
  • Agrobacterium tumefaciens-xxisdiXaXed transformation techniques are well described in the scientific literature. See, for example Horsch et ⁇ l. (1984) Science 233: 496-498, and Fraley et ⁇ l. (1983) Proc. N ⁇ tl Ac ⁇ d. Sci. 80:4803.
  • Agrobacterium transformation of maize is described in U.S. Patent Nos. 5,550,318 and 5,981,840.
  • Other methods of transformation include (1) Agrobacterium rhizogenes '-mediated transformation (see, e.g., Lichtenstein and Fuller hi: Genetic Engineering, Vol. 6, PWJ Rigby, ed., London, Academic Press, 1987; Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., Oxford, ERI Press, 1985;WO 88/02405 describes the use of A.
  • Biol. Reporter 6:165 Expression of polypeptide coding nucleic acids can be obtained by injection of the DNA into reproductive organs of a plant as described by Pena et al. (1987) Nature 325:274. DNA can also be injected directly into the cells of immature embryos and the rehydration of desiccated embryos as described by Neuhaus et al. (1987) Tlieor. Appl. Genet. 75: 30; and Benbrook et al. (1986) in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54. Animal and lower eukaryotic (e.g., yeast) host cells are competent or rendered competent for transfection by various means. There are several well- known methods of introducing DNA into animal cells.
  • transfected cells are cultured by means well known in the art. See, Kuchler, R.J. (1977) Biochemical Methods in Cell Culture and Virology (Dowden, Hutchinson and Ross, Inc.).
  • transformation means alteration of the genotype of a host plant by the introduction of a nucleic acid sequence, e.g., a "heterologous” or "foreign" nucleic acid sequence.
  • heterologous nucleic acid sequence need not necessarily originate from a different source but it will, at some point, have been external to the cell into winch is introduced.
  • useful general references for plant cell cloning, culture and regeneration include Jones, ed. (1995) Plant Gene Transfer and Expression Protocols— Methods in Molecular Biology, volume 49 (Humana Press, Towata, NJ); Payne et al. (1992) Plant Cell and Tissue Culture in Liquid Systems (John Wiley & Sons, Inc. New York, NY) ("Payne”); and Gamborg and Phillips, eds.
  • recombinant vectors including one or more GAT polynucleotides, suitable for the transformation of plant cells are prepared.
  • an expression cassette will typically comprise a selected GAT polynucleotide operably linked to a promoter sequence and other transcriptional and translational initiation regulatory sequences which are sufficient to direct the transcription of the GAT sequence in the intended tissues (e.g., entire plant, leaves, roots, etc.) of the transformed plant.
  • a number of promoters can be used in the practice of the present invention.
  • the promoters can be selected based on the desired outcome. That is, the nucleic acids can be combined with constitutive, tissue-preferred, or other promoters for expression in plants.
  • Constitutive promoters include, for example, the core promoter of the Rsyn7 promoter and other constitutive promoters disclosed in WO 99/43838 and U.S. Patent No. 6,072,050; the core CaMV 35S promoter (Odell et al. (1985) Nature 313:810-812); rice actin (McElroy et al. (1990) Plant Cell 2:163-171); ubiquitin (Christensen et al. (1989) Plant Mol. Biol. 12:619-632 and Christensen et al. (1992) Plant Mol.
  • Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator.
  • the promoter may be a chemical- inducible promoter, where application of the chemical induces gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression.
  • Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzene sulfonamide herbicide safeners; the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides; and the tobacco PR- la promoter, which is activated by salicylic acid.
  • promoters of interest include steroid-responsive promoters. See, for example, the glucocorticoid-inducible promoter in Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88:10421-10425 and McNellis et al. (1998) Plant J. 14(2):247-257 and the tetracycline-inducible and tetracycline-repressible promoters for example, Gatz et al. (1991) Mol. Gen. Genet. 227:229-237, and U.S. Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference.
  • Tissue-preferred promoters can also be utilized to target GAT expression within a particular plant tissue.
  • Tissue-preferred promoters include those disclosed in Yamamoto et al. (1997) Plant J. 12(2):255-265; Kawamata et al. (1997) Plant Cell Physiol. 38(7):792-803; Hansen et al. (1997) Mol. Gen Genet. 254(3):337-343; Russell et al. (1997) Transgenic Res. 6(2).T57-168; Rinehart et al. (1996) Plant Physiol. 112(3):1331-1341; Van Camp et al. (1996) Plant Physiol. 112(2):525-535; Canevascini et al. (1996) Plant Physiol.
  • Root-preferred promoters are known and can be selected from the many available from the literature or isolated de novo from various compatible species. See, for example, Hire et al. (1992) Plant Mol Biol. 20(2):207-218
  • Plant Cell 2(7):633-641 discloses two root-specific promoters isolated from hemoglobin genes from the nitrogen- fixing nonlegume Parasponia andersonii and the related non-nitrogen-fixing nonlegume Trema tomentosa.
  • the promoters of these genes were linked to a 13- glucuronidase reporter gene and introduced into both the nonlegume Nicotiana tabacum and the legume Lotus corniculatus, and in both instances root-specific promoter activity was preserved. Leach et al.
  • Patent No. 5,633,636 the EFS1 promoter (U.S. Patent Application Serial No. 10/104,706) and the rolB promoter (Capana et al. (1994) Plant Mol Biol. 25(4):681-691). See also U.S. Patent Nos. 5,837,876; 5,750,386; 5,459,252; 5,401,836; 5,110,732; and 5,023,179.
  • "Seed-preferred" promoters include both "seed-specific" promoters (those promoters active during seed development such as promoters of seed storage proteins) as well as “seed-germinating” promoters (those promoters active during seed germination). See Thompson et al.
  • Such seed-preferred promoters include, but are not limited to, Ciml (cytokinin-induced message); cZ19Bl (maize 19 kDa zein); milps (myo-inositol-1 -phosphate synthase); and celA (cellulose synthase) (see U.S. Patent No. 6,225,529, herein incorporated by reference).
  • Gamma-zein is an endosperm-specific promoter.
  • Glob-1 is an embryo-specific promoter.
  • seed-specific promoters include, but are not limited to, bean -phaseolin, napin, ⁇ - conglycinin, soybean lectin, cruciferin, and the like.
  • seed-specific promoters include, but are not limited to, maize 15 kDa zein, 22 kDa zein, 27 kDa zein, g-zein, waxy, shrunken 1, shrunken 2, globulin 1, etc. See also WO 00/12733, which discloses seed-preferred promoters from endl and end2 genes; herein incorporated by reference.
  • a strongly or weakly constitutive plant promoter that directs expression of a GAT nucleic acid in all tissues of a plant can be favorably employed.
  • Such promoters are active under most environmental conditions and states of development or cell differentiation.
  • examples of constitutive promoters include the 1 '- or 2 '- promoter i Agrobacterium tumefaciens, and other transcription initiation regions from various plant genes known to those of skill. Where over expression of a GAT polypeptide of the invention is detrimental to the plant, one of skill will recognize that weak constitutive promoters can be used for low-levels of expression.
  • weak promoter a promoter that drives expression of a coding sequence at a low level is intended.
  • low level levels from about 1/1000 transcripts to about 1/100,000 transcripts, to about as low as 1/500,000 transcripts per cell are intended.
  • weak promoters also include promoters that are expressed in only a few cells and not in others to give a total low level of expression. Where a promoter is expressed at unacceptably high levels, portions of the promoter sequence can be deleted or modified to decrease expression levels.
  • a strong promoter e.g., a t-RNA, or other pol III promoter, or a strong pol ⁇ promoter, (e.g., the cauliflower mosaic virus promoter, CaMV, 35S promoter) can be used.
  • a plant promoter can be under environmental control. Such promoters are referred to as "inducible" promoters.
  • promoters that are "tissue-specific" and/or are under developmental control such that the GAT polynucleotide is expressed only in certain tissues or stages of development, e.g., leaves, roots, shoots, etc.
  • Endogenous promoters of genes related to herbicide tolerance and related phenotypes are particularly useful for driving expression of GAT nucleic acids, e.g., P450 monooxygenases, glutathione-S-transferases, homoglutathione-S- transferases, glyphosate oxidases and 5-enolpyruvylslnkimate-2-phosphate synthases.
  • Tissue specific promoters can also be used to direct expression of heterologous structural genes, including the GAT polynucleotides described herein.
  • the promoters can be used in recombinant expression cassettes to drive expression ofany gene whose expression is desirable in the transgenic plants of the invention, e.g., GAT and/or other genes conferring herbicide resistance or tolerance, genes which influence other useful characteristics, e.g., heterosis.
  • enhancer elements e.g., derived from the 5' regulatory sequences or intron of a heterologous gene, can also be used to improve expression of a heterologous structural gene, such as a GAT polynucleotide.
  • the particular promoter used in the expression cassette in plants depends on the intended application. Any of a number of promoters which direct transcription in plant cells can be suitable.
  • the promoter can be either constitutive or inducible.
  • promoters of bacterial origin which operate in plants include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from Ti plasmids. See, Herrera-Estrella et al. (1983) Nature 303:209. Viral promoters include the 35S and 19S RNA promoters of CaMV. See, Odell et al. (1985) Nature 313:810. Other plant promoters include the ribulose-l,3-bisphosphate carboxylase small subunit promoter and the phaseolin promoter.
  • promoter sequence from the E8 gene (see, Deikman and Fischer (1988) EMBO J7:3315) and other genes are also favorably used. Promoters specific for monocotyledonous species are also considered (McElroy and Brettell (1994) "Foreign gene expression in transgenic cereals" Trends Biotech. 12:62-68.)
  • novel promoters with useful characteristics can be identified from any viral, bacterial, or plant source by methods, including sequence analysis, enhancer or promoter trapping, and the like, known in the art.
  • sequences other than the promoter and the GAT encoding gene are also favorably used. If proper polypeptide expression is desired, a polyadenylation region can be derived from the natural gene, from a variety of other plant genes, or from T-DNA.
  • the vector comprising the GAT polynucleotide also can include a marker gene which confers a selectable phenotype on plant cells.
  • the marker may encode biocide tolerance, particularly antibiotic tolerance, such as tolerance to kanamycin, G418, bleomycin, hygromycin, or herbicide tolerance, such as tolerance to chlorosulfuron, or phophinothricin.
  • Reporter genes which are used to monitor gene expression and protein localization via visualizable reaction products (e.g., beta-glucuronidase, beta-galactosidase, and chloramphenicol acetyltransferase) or by direct visualization of the gene product itself (e.g. , green fluorescent protein, GFP; Sheen et al. (1995) The Plant Journal 8:777) can be used for, e.g., monitoring transient gene expression in plant cells.
  • Transient expression systems can be employed in plant cells, for example, in screening plant cell cultures for herbicide tolerance activities.
  • Chloroplasts are a site of action of some herbicide tolerance activities, and, in some instances, the GAT polynucleotide is fused to a chloroplast transit sequence peptide to facilitate translocation of the gene products into the chloroplasts. In these cases, it can be advantageous to transform the GAT polynucleotide into the chloroplasts of the plant host cells. Numerous methods are available in the art to accomplish chloroplast transformation and expression (e.g.,
  • the expression construct comprises a transcriptional regulatory sequence functional in plants operably linked to a polynucleotide encoding the GAT polypeptide.
  • Expression cassettes that are designed to function in chloroplasts include the sequences necessary to ensure expression in chloroplasts.
  • the coding sequence is flanked by two regions of homology to the chloroplastid genome to effect a homologous recombination with the chloroplast genome; often a selectable marker gene is also present within the flanking plastid DNA sequences to facilitate selection of genetically stable transformed chloroplasts in the resultant transplastonic plant cells (see, e.g., Maliga (1993) and Daniell (1998) supra, and references cited therein).
  • General transformation methods DNA constructs of the invention can be introduced into the genome of the desired plant host by a variety of conventional techniques. Techniques for transforming a wide variety of higher plant species are well known and described in the technical and scientific literature. See, e.g., Payne, Gamborg, Croy, Jones, etc.
  • Transformation protocols as well as protocols for introducing nucleotide sequences into plants may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing nucleotide sequences into plant cells and subsequent insertion into the plant genome include microinjection (Crossway et al.
  • D ⁇ As can be introduced directly into the genomic D ⁇ A of a plant cell using techniques such as electroporation and microinjection of plant cell protoplasts, or the D ⁇ A constructs can be introduced directly to plant tissue using ballistic methods, such as D ⁇ A particle bombardment.
  • the D ⁇ A constructs can be combined with suitable T-D ⁇ A flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium host will direct the insertion of the construct and adjacent marker into the plant cell D ⁇ A when the plant cell is infected by the bacteria.
  • Microinjection techniques are known in the art and well described in the scientific and patent literature.
  • Agrobacterium mediated transformation techniques are used to transfer the GAT sequences of the invention to transgenic plants.
  • Agrobacterium-mediated transformation is widely used for the transformation of dicots, however, certain monocots can also be transformed by Agrobacterium.
  • Agrobacterium transformation of rice is described by Hiei et al. (1994) Plant J. 6 :271 ; US Patent No. 5, 187,073 ; US Patent No. 5,591,616; Li et al. (1991) Science in China 34:54; and Raineri et al. (1990) Bio/Technology 8: 33.
  • Transformed maize, barley, triticale and asparagus by Agrobacterium mediated transformation have also been described (Xu et al. (1990) C me.se J.90t 2:81).
  • Agrobacterium mediated transformation techniques take advantage of the ability of the tumor-inducing (Ti) plasmid of A.
  • tumefaciens to integrate into a plant cell genome, to co-transfer a nucleic acid of interest into a plant cell.
  • an expression vector is produced wherein the nucleic acid of interest, such as a GAT polynucleotide of the invention, is ligated into an autonomously replicating plasmid which also contains T-DNA sequences.
  • T-DNA sequences typically flank the expression cassette nucleic acid of interest and comprise the integration sequences of the plasmid.
  • T- DNA also typically includes a marker sequence, e.g., antibiotic resistance genes.
  • the plasmid with the T-DNA and the expression cassette are then transfected into Agrobacterium cells. Typically, for effective transformation of plant cells, the A.
  • tumefaciens bacterium also possesses the necessary vir regions on a plasmid, or integrated into its chromosome.
  • Agrobacterium mediated transformation see, Firoozabady and Kuehnle, (1995) in Plant Cell Tissue and Organ Culture Fundamental Methods, eds. Gamborg and Phillips.
  • the polynucleotides of the present invention can be stacked with any combination of polynucleotide sequences of interest in order to create plants with a desired phenotype.
  • polynucleotides of the present invention may be stacked with any other polynucleotides encoding polypeptides having pesticidal and/or insecticidal activity, such as Bacillus thuringiensis toxic proteins (described in U.S. Patent Nos. 5,366,892; 5,747,450; 5,737,514; 5,723,756; 5,593,881; and Geiser et al. (1986) Gene 48:109), lectins (Van Damme et al. (1994) Plant Mol. Biol. 24:825, pentin (described in U.S.
  • the combinations generated can also include multiple copies ofany one of the polynucleotides of interest.
  • the polynucleotides of the present invention can also be stacked with any other gene or combination of genes to produce plants with a variety of desired trait combinations including, but not limited to, traits desirable for animal feed such as high oil genes (e.g., U.S.
  • polynucleotides of the present invention can also be stacked with traits desirable for disease or herbicide resistance (e.g., fumonisin detoxification genes (U.S. Patent No. 5,792,931); avirulence and disease resistance genes (Jones et al. (1994) Science 266:789; Martin et al. (1993) Science 262:1432; Mindrinos et al.
  • fumonisin detoxification genes U.S. Patent No. 5,792,931
  • avirulence and disease resistance genes Jones et al. (1994) Science 266:789; Martin et al. (1993) Science 262:1432; Mindrinos et al.
  • acetolactate synthase (ALS) mutants that lead to herbicide resistance such as the S4 and/or Hra mutations
  • inhibitors of glutamine synthase such as phosphinothricin or basta (e.g., bar gene); and glyphosate resistance (EPSPS gene)
  • traits desirable for processing or process products such as high oil (e.g., U.S. Patent No. 6,232,529 ); modified oils (e.g., fatty acid desaturase genes (U.S. Patent No.
  • modified starches e.g., ADPG pyrophosphorylases (AGPase), starch synthases (SS), starch branching enzymes (SBE), and starch debranching enzymes (SDBE)
  • polymers or bioplastics e.g., U.S. Patent No. 5.602,321; beta-ketothiolase, polyhydroxybutyrate synthase, and acetoacetyl-CoA reductase (Schubert et al (1988) J Bacteriol. 170:5837- 5847) facilitate expression of polyhydroxyalkanoates (PHAs)); the disclosures of which are herein incorporated by reference.
  • polynucleotides of the present invention could also combine with polynucleotides providing agronomic traits such as male sterility (e.g., see U.S. Patent No. 5.583,210), stalk strength, flowering time, or transformation technology traits such as cell cycle regulation or gene targeting (e.g., WO 99/61619, WO 00/17364, and WO 99/25821); the disclosures of which are herein incorporated by reference.
  • stacked combinations can be created by any method including, but not limited to, cross-breeding plants by any conventional or TopCross methodology, or genetic transformation.
  • the polynucleotide sequences of interest can be combined at any time and in any order.
  • a transgenic plant comprising one or more desired traits can be used as the target to introduce further traits by subsequent transformation.
  • the traits can be introduced simultaneously in a co- transformation protocol with the polynucleotides of interest provided by any combination of transformation cassettes.
  • the two sequences can be contained in separate transformation cassettes (trans) or contained on the same transformation cassette (cis). Expression of the sequences can be driven by the same promoter or by different promoters, h certain cases, it may be desirable to introduce a transformation cassette that will suppress the expression of the polynucleotide of interest.
  • polynucleotide sequences can be stacked at a desired genomic location using a site- specific recombination system. See, for example, WO99/25821, WO99/25854, WO99/25840, WO99/25855, and WO99/25853, all of which are herein incorporated by reference.
  • Transformed plant cells which are derived by plant transformation techniques, including those discussed above, can be cultured to regenerate a whole plant which possesses the transformed genotype (i.e., a GAT polynucleotide), and thus the desired phenotype, such as acquired resistance (i.e., tolerance) to glyphosate or a glyphosate analog.
  • Such regeneration techniques rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the desired nucleotide sequences.
  • a biocide and/or herbicide marker which has been introduced together with the desired nucleotide sequences.
  • selection for glyphosate resistance conferred by the GAT polynucleotide of the invention can be performed.
  • Plant regeneration from cultured protoplasts is described in Evans et al. (1983) Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp 124- 176, Macmillan Publishing Company, New York; and Binding (1985) Regeneration of Plants, Plant Protoplasts pp 21-73, CRC Press, Boca Raton. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al. (1987) Ann Rev of Plant Phys 38:467. See also, e.g., Payne and Gamborg.
  • Transformed plant cells, calli or explant can be cultured on regeneration medium in the dark for several weeks, generally about 1 to 3 weeks to allow the somatic embryos to mature.
  • Preferred regeneration media include media containing MS salts.
  • the plant cells, calli or explant are then typically cultured on rooting medium in a light/dark cycle until shoots and roots develop.
  • Methods for plant regeneration are known in the art and preferred methods are provided by Kamo et al, (Bo Gaz. 146(3):324-334, 1985); West et al, (The Plant Cell 5:1361-1369, 1993); and Duncan et al. ( lanta 165:322-332, 1985). Small plantlets can then be transferred to tubes containing rooting medium and allowed to grow and develop more roots for approximately another week.
  • the plants can then be transplanted to soil mixture in pots in the greenhouse.
  • the regeneration of plants containing the foreign gene introduced by Agrobacterium can be achieved as described by Horsch et al, Science, 227:1229-1231 (1985) and Fraley et al, Proc. Natl. Acad. Sci. U.S.A., 80:4803 (1983). This procedure typically produces shoots within two to four weeks and these transformant shoots are then transferred to an appropriate root-inducing medium containing the selective agent and an antibiotic to prevent bacterial growth.
  • Transgenic plants of the present invention may be fertile or sterile. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al, Ann. Rev.
  • the selection medium depends on the selectable marker that was co-transfected into the explants. After a suitable length of time, transformants will begin to form shoots. After the shoots are about 1-2 cm in length, the shoots should be transferred to a suitable root and shoot medium. Selection pressure should be maintained in the root and shoot medium. Typically, the transformants will develop roots in about 1-2 weeks and form plantlets. After the plantlets are about 3-5 cm in height, they are placed in sterile soil in fiber pots. Those of skill in the art will realize that different acclimation procedures are used to obtain transformed plants of different species. For example, after developing a root and shoot, cuttings, as well as somatic embryos of transformed plants, are transferred to medium for establishment of plantlets.
  • transgenic seed are produced without the need for tissue culture.
  • Agrobacterium-mediated transformation protocols have yet to be developed.
  • successful tissue transformation coupled with regeneration of the transformed tissue to produce a transgenic plant has not been reported for some of the most commercially relevant cotton cultivars.
  • an approach that can be used with these plants involves stably introducing the polynucleotide into a related plant variety via Agrobacterium-mediated transformation, confirming operability, and then transferring the transgene to the desired commercial strain using standard sexual crossing or back-crossing techniques.
  • Agrobacterium can be used to transform a Coker line of Gossypium hirustum (e.g., Coker lines 310, 312, 5110 Deltapine 61 or StoneviUe 213), and then the transgene can be introduced into another more commercially relevant G. hirustum cultivar by back-crossing.
  • the transgenic plants of this invention can be characterized either genotypically or phenotypically to determine the presence of the GAT polynucleotide of the invention. Genotypic analysis can be performed by any of a number of well-known techniques, including PCR amplification of genomic DNA and hybridization of genomic DNA with specific labeled probes.
  • Phenotypic analysis includes, e.g., survival of plants or plant tissues exposed to a selected herbicide such as glyphosate.
  • a selected herbicide such as glyphosate.
  • One of skill will recognize that after the expression cassette containing the GAT gene is stably incorporated in transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a number of standard breeding techniques can be used, depending upon the species to be crossed. In vegetatively propagated crops, mature transgenic plants can be propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants. Selection of desirable transgenics is made and new varieties are obtained and propagated vegetatively for commercial use. h seed propagated crops, mature transgenic plants can be self crossed to produce a homozygous inbred plant.
  • the inbred plant produces seed containing the newly introduced heterologous nucleic acid. These seeds can be grown to produce plants that would produce the selected phenotype. Parts obtained from the regenerated plant, such as flowers, seeds, leaves, branches, fruit, and the like are included in the invention, provided that these parts comprise cells comprising the isolated GAT nucleic acid. Progeny and variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced nucleic acid sequences. Transgenic plants expressing a selectable marker can be screened for transmission of the GAT nucleic acid, for example, by standard immunoblot and DNA detection techniques. Transgenic lines are also typically evaluated on levels of expression of the heterologous nucleic acid.
  • RNA expression at the RNA level can be determined initially to identify and quantitate expression-positive plants.
  • Standard techniques for RNA analysis can be employed and include PCR amplification assays using ohgonucleotide primers designed to amplify only the heterologous RNA templates and solution hybridization assays using heterologous nucleic acid-specific probes.
  • the RNA-positive plants can then be analyzed for protein expression by Western immunoblot analysis using the specifically reactive antibodies of the present invention.
  • in situ hybridization and immunocytochemistry according to standard protocols can be done using heterologous nucleic acid specific polynucleotide probes and antibodies, respectively, to localize sites of expression within transgenic tissue.
  • transgenic lines are usually screened for the incorporated nucleic acid to identify and select plants with the most appropriate expression profiles.
  • a preferred embodiment is a transgenic plant that is homozygous for the added heterologous nucleic acid; i.e., a transgenic plant that contains two added nucleic acid sequences, one gene at the same locus on each chromosome of a chromosome pair.
  • a homozygous transgenic plant can be obtained by sexually mating (selfing) a heterozygous transgenic plant that contains a single added heterologous nucleic acid, germinating some of the seed produced and analyzing the resulting plants produced for altered cell division relative to a control plant (i.e., native, non-transgenic).
  • any plant can be transformed with the GAT polynucleotides of the invention.
  • Suitable plants for the transformation and expression of the novel GAT polynucleotides of this invention include agronomically and horticulturally important species.
  • Such species include, but are not restricted to members of the families: Graminae (including corn, rye, triticale, barley, millet, rice, wheat, oats, etc); Leguminosae (including pea, beans, lentil, peanut, yam bean, cowpeas, velvet beans, soybean, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, and sweetpea); Compositae (the largest family of vascular plants, including at least 1,000 genera, including important commercial crops such as sunflower); and Rosaciae (including raspberry, apricot, almond, peach, rose, etc.); as well as nut plants (including, walnut, pecan, hazelnut, etc.); and forest trees (including Pinus, Quercus, Pseutotsuga, Sequoia, Populus.etc.) Additional targets for modification by the GAT polynucleotides of the invention, as well as those specified above, include plants from the genera: Agro
  • plants in the family Graminae are particularly desirable target plants for the methods of the invention.
  • Common crop plants which are targets of the present invention include corn, rice, triticale, rye, cotton, soybean, sorghum, wheat, oats, barley, millet, sunflower, canola, peas, beans, lentils, peanuts, yam beans, cowpeas, velvet beans, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, sweetpea and nut plants (e.g., walnut, pecan, etc).
  • the invention provides a method for producing a crop by growing a crop plant that is glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, under conditions such that the crop plant produces a crop, and harvesting the crop.
  • glyphosate is applied to the plant, or in the vicinity of the plant, at a concentration effective to control weeds without preventing the transgenic crop plant from growing and producing the crop.
  • the application of glyphosate can be before planting, or at any time after planting up to and including the time of harvest.
  • Glyphosate can be applied once or multiple times.
  • the timing of glyphosate application, amount applied, mode of application, and other parameters will vary based upon the specific nature of the crop plant and the growing environment, and can be readily determined by one of skill in the art.
  • the invention further provides a crop produced by this method.
  • the invention provides for the propagation of a plant containing a GAT polynucleotide transgene.
  • the plant can be, for example, a monocot or a dicot.
  • propagation entails crossing a plant containing a GAT polynucleotide transgene with a second plant, such that at least some progeny of the cross display glyphosate tolerance.
  • the invention provides a method for selectively controlling weeds in a field where a crop is being grown.
  • the method involves planting crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a GAT, e.g., a GAT polynucleotide, and applying to the crop and any weeds a sufficient amount of glyphosate to control the weeds without a significant adverse impact on the crop. It is important to note that it is not necessary for the crop to be totally insensitive to the herbicide, so long as the benefit derived from the inhibition of weeds outweighs any negative impact of the glyphosate or glyphosate analog on the crop or crop plant. h another aspect, the invention provides for use of a GAT polynucleotide as a selectable marker gene.
  • the presence of the GAT polynucleotide in a cell or organism confers upon the cell or organism the detectable phenotypic trait of glyphosate resistance, thereby allowing one to select for cells or organisms that have been transformed with a gene of interest linked to the GAT polynucleotide.
  • the GAT polynucleotide can be introduced into a nucleic acid construct, e.g., a vector, thereby allowing for the identification of a host (e.g.
  • a GAT polynucleotide can be used as a selectable marker in a wide variety of hosts that are sensitive to glyphosate, including plants, most bacteria (including E. coli), actinomycete, yeasts, algae and fungi.
  • herbicide resistance is that it obviates the concern of some members of the public that antibiotic resistance might escape into the environment.
  • GAT polynucleotides conferring enhanced glyphosate resistance in transgenic plants Selection of GAT polynucleotides conferring enhanced glyphosate resistance in transgenic plants.
  • Libraries of GAT encoding nucleic acids diversified according to the methods described herein can be selected for the ability to confer resistance to glyphosate in transgenic plants.
  • the modified GAT genes can be used as a selection marker to facilitate the production and evaluation of transgenic plants and as a means of conferring herbicide resistance in experimental or agricultural plants.
  • an initial functional evaluation can be performed by expressing the library of GAT encoding sequences in E. coli.
  • the expressed GAT polypeptides can be purified, or partially purified as described above, and screened for improved kinetics by mass spectrometry. Following one or more preliminary rounds of diversification and selection, the polynucleotides encoding improved GAT polypeptides are cloned into a plant expression vector, operably linked to, e.g., a strong constitutive promoter, such as the CaMV 35S promoter.
  • the expression vectors comprising the modified GAT nucleic acids are transformed, typically by Agrobacterium mediated transformation, into Arabidopsis thaliana host plants. For example, Arabidopsis hosts are readily transformed by dipping inflorescences into solutions of
  • Agrobacterium and allowing them to grow and set seed Thousands of seeds are recovered in approximately 6 weeks. The seeds are then collected in bulk from the dipped plants and germinated in soil, h this manner it is possible to generate several thousand independently transformed plants for evaluation, constituting a high throughput (HTP) plant transformation format. Bulk grown seedlings are sprayed with glyphosate and surviving seedlings exhibiting glyphosate resistance survive the selection process, whereas non-transgenic plants and plants incorporating less favorably modified GAT nucleic acids are damaged or killed by the herbicide treatment.
  • HTP high throughput
  • the GAT encoding nucleic acids conferring improved resistance to glyphosate are recovered, e.g., by PCR amplification using T-DNA primers flanking the library inserts, and used in further diversification procedures or to produce additional transgenic plants of the same or different species. If desired, additional rounds of diversification and selection can be performed using increasing concentrations of glyphosate in each subsequent selection. In this manner, GAT polynucleotides and polypeptides conferring resistance to concentrations of glyphosate useful in field conditions can be obtained.
  • Herbicide Resistance provides a composition comprising two or more polynucleotides of the invention.
  • the GAT polynucleotides encode GAT polypeptides having different kinetic parameters, i.e., a GAT variant having a lower K m can be combined with one having a higher k ca t.
  • the different GAT polynucleotides may be coupled to a chloroplast transit sequence or other signal sequence thereby providing GAT polypeptide expression in different cellular compartments, organelles or secretion of one or more of the GAT polypeptides.
  • the mechanism of glyphosate resistance of the present invention can be combined with other modes of glyphosate resistance known in the art to produce plants and plant explants with superior glyphosate resistance.
  • glyphosate-tolerant plants can be produced by inserting into the genome of the plant the capacity to produce a higher level of 5-enolpyruvylshikimate-3- phosphate synthase (EPSPS) as more fully described in U.S. Patent Nos. 6,248,876 Bl; 5,627,061; 5,804,425; 5,633,435; 5,145,783; 4,971,908; 5,312,910; 5,188,642; 4,940,835; 5,866,775; 6,225,114 Bl; 6,130,366; 5,310,667; 4,535,060; 4,769,061; 5,633,448; 5,510,471; Re.
  • EPSPS 5-enolpyruvylshikimate-3- phosphate synthase
  • Glyphosate resistance is also imparted to plants that express a gene that encodes a glyphosate oxido-reductase enzyme as described more fully in U.S. Patent Nos. 5,776,760 and 5,463,175, which are incorporated herein by reference in their entireties for all purposes.
  • the mechanism of glyphosate resistance of the present invention may be combined with other modes of herbicide resistance to provide plants and plant explants that are resistant to glyphosate and one or more other herbicides.
  • the hydroxyphenylpyruvatedioxygenases are enzymes that catalyze the reaction in which para-hydroxyphenylpyruvate (HPP) is transformed into homogentisate.
  • Molecules which inhibit this enzyme, and which bind to the enzyme in order to inhibit transformation of the HPP into homogentisate are useful as herbicides. Plants more resistant to certain herbicides are described in U.S. Patent Nos. 6,245,968 Bl; 6,268,549; and 6,069,115; and international publication WO 99/23886, which are incorporated herein by reference in their entireties for all purposes. Sulfonylurea and imidazolinone herbicides also inhibit growth of higher plants by blocking acetolactate synthase (ALS) or acetohydroxy acid synthase (AHAS). The production of sulfonylurea and imidazolinone tolerant plants is described more fully in U.S. Patent Nos.
  • Glutamine synthetase appears to be an essential enzyme necessary for the development and life of most plant cells. Inhibitors of GS are toxic to plant cells. Glufosinate herbicides have been developed based on the toxic effect due to the inhibition of GS in plants. These herbicides are non-selective. They inhibit growth of all the different species of plants present, causing their total destruction.
  • the invention provides methods for selectively controlling weeds in a field containing a crop that involve planting the field with crop seeds or plants which are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, and applying to the crop and weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop.
  • the invention further provides methods for controlling weeds in a field and preventing the emergence of glyphosate-resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase and a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as, a glyphosate-tolerant 5- enolpyruvylsl ikimate-3 -phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and applying to the crop and the weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop.
  • the invention provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase, a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as, a glyphosate- tolerant 5-enolpyruvylshikimate-3-phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as, a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid
  • the invention further provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as, a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthasej a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase and applying to the crop
  • EXAMPLE 1 ISOLATING NOVEL NATIVE GAT POLYNUCLEOTIDES
  • Five native GAT polynucleotides i.e., GAT polynucleotides that occur naturally in a non-genetically modified organism
  • Their nucleotide sequences were determined and are provided herein as SEQ ID NO:l - 5.
  • SEQ ID NO:l - 5 a collection of approximately 500 Bacillus and Pseudomonas strains were screened for native ability to N-acetylate glyphosate.
  • the product of the reaction was positively identified as N-acetylglyphosate by comparing the mass spectrometry profile of the reaction mix to an N- acetylglyphosate standard as shown in Figure 2.
  • Product detection was dependent on inclusion of both substrates (acetyl CoA and glyphosate) and was abolished by heat denaturing the bacterial cells.
  • Individual GAT polynucleotides were then cloned from the identified strains by functional screening. Genomic DNA was prepared and partially digested with Sau3Al enzyme. Fragments of approximately 4 Kb were cloned into an E. coli expression vector and transformed into electrocompetent E. coli.
  • GAT activity Individual clones exhibiting GAT activity were identified by mass spectrometry following a reaction as described previously except that the toluene wash was replaced by permeabilization with PMBS. Genomic fragments were sequenced and the putative GAT polypeptide-encoding open reading frame identified. Identity of the GAT gene was confirmed by expression of the open reading frame inE. coli and detection of high levels of N-acetylglyphosate produced from reaction mixtures.
  • EXAMPLE 2 CHARACTERIZATION OF A GAT POLYPEPTIDE ISOLATED FROM B.LICHENIFORMIS STRAIN B6.
  • Genomic DNA from B. licheniformis strain B6 was purified, partially digested with Sau3Al and fragments of 1-10 Kb were cloned into an E. coli expression vector.
  • a clone with a 2.5 kb insert conferred the glyphosate-N- acetyltransferase (GAT) activity on the E. coli host as determined with mass spectrometry analysis. Sequencing of the insert revealed a single complete open reading frame of 441 base pairs. Subsequent cloning of this open reading frame confirmed that it encoded the GAT enzyme.
  • GAT glyphosate-N- acetyltransferase
  • a plasmid, pMAXY2120, is shown in Figure 4.
  • the gene encoding the GAT enzyme of B6 was transformed into E. coli strain XL1 Blue.
  • a 10% innoculum of a saturated culture was added to Luria broth, and the culture was incubated at 37° C for 1 hr.
  • Expression of GAT was induced by the addition of EPTG at a concentration of 1 mM.
  • the culture was incubated a further 4 hrs, following which, cells were harvested by centrifugation and the cell pellet stored at -80° C.
  • Lysis of the cells was effected by the addition of 1 ml of the following buffer to 0.2 g of cells: 25 mM HEPES, pH 7.3, 100 mM KC1 and 10% methanol (HKM) plus 0.1 mM EDTA, 1 mM DTT, 1 mg/ml chicken egg lysozyme, and a protease inhibitor cocktail obtained from Sigma and used according to the manufacturer's recommendations. After 20 minutes incubation at room temperature (e.g., 22-25° C), lysis was completed with brief sonication. The lysate was centrifuged and the supernatant was desalted by passage through Sephadex G25 equilibrated with HKM.
  • Partial purification was obtained by affinity chromatography on CoA Agarose (Sigma). The column was equilibrated with HKM and the clarified extract was allowed to pass through under hydrostatic pressure. Non-binding proteins were removed by washing the column with HKM, and GAT was eluted with HKM containing 1 mM Coenzyme A. This procedure provided 4-fold purification. At this stage, approximately 65% of the protein staining observed on an SDS polyacrylamide gel loaded with crude lysate was due to GAT, with another 20% due to chloramphenicol acetyltransferase encoded by the vector. Purification to homogeneity was obtained by gel filtration of the partially purified protein through Superdex 75 (Pharmacia).
  • the mobile phase was HKM, in which GAT activity eluted at a volume corresponding to a molecular radius of 17 kD.
  • This material was homogeneous as judged by Coomassie staining of a 3 ⁇ g sample of GAT subjected to SDS polyacrylamide gel electrophoresis on a 12% acrylamide gel, 1 mm thickness. Purification was achieved with a 6-fold increase in specific activity.
  • the apparent K M for glyphosate was determined on reaction mixtures containing saturating (200 ⁇ M) Acetyl CoA, varying concentrations of glyphosate, and 1 ⁇ M purified GAT in buffer containing 5 mM morpholine adjusted to pH 7.7 with acetic acid and 20 % ethylene glycol.
  • the apparent K for Acetyl CoA was determined on reaction mixtures containing 5 mM glyphosate, varying concentrations of Acetyl CoA, and 0.19 ⁇ M GAT in buffer containing 5 mM morpholine adjusted to pH 7.7 with acetic acid and 50% methanol.
  • Initial reaction rates were determined using mass spectrometric detection of N-acetyl glyphosate. Five ⁇ l were repeatedly injected into the instrument and reaction rates were obtained by plotting reaction time vs area of the integrated peak (Figure 6). Hyperbolic saturation kinetics were observed (Figure 7), from which an apparent K M of 2 ⁇ M was derived. From values for V max obtained at a known concentration of enzyme, a k cat of 6/min was calculated.
  • EXAMPLE 3 MASS SPECTROMETRY (MS) SCREENING PROCESS Sample (5 ⁇ l) was drawn from a 96-well microtiter plate at a speed of one sample every 26 seconds and injected into the mass spectrometer (Micromass Quattro LC, triple quadrapole mass spectrometer) without any separation. The sample was carried into the mass spectrometer by a mobile phase of water/methanol (50:50) at a flow rate of 500 Ul/min.
  • MS MASS SPECTROMETRY
  • Each injected sample was ionized by a negative electrospray ionization process (needle voltage, -3.5 KV; cone voltage, 20 V; source temperature, 120° C; desolvation temperature, 250° C; cone gas flow, 90 L/Hr; and desolvation gas flow, 600 L/Hr).
  • the molecular ions (m/z 210) formed during this process were selected by the first quadrapole for performing collision induced dissociation (CED) in the second quadrapole, where the pressure was set at 5 x 10 "4 mBar and the collision energy was adjusted to 20 Ev.
  • CED collision induced dissociation
  • the third quadrapole was set for only allowing one of the daughter ions (m z 124) produced from the parent ions (m/z 210) to get into the detector for signal recording.
  • the first and third quadrapoles were set at unit resolution, while the photomultiplier was operated at 650 V. Pure N-acetylglyphosate standards were used for comparison and peak integration was used to estimate concentrations. It was possible to detect less than 200 Nm N-acetylglyphosate by this method.
  • EXAMPLE 4 DETECTION OF NATIVE OR LOW ACTIVITY GAT ENZYMES Native or low activity GAT enzymes typically have a k cat of approximately 1 min '1 and a K M for glyphosate of 1.5-10 Mm. K M for acetyl CoA was typically less than 25 ⁇ M.
  • Bacterial cultures were grown in rich medium in deep 96-well plates and 0.5 ml stationary phase cells were harvested by centrifugation, washed with 5 mM morpholine acetate pH 8, and resuspended in 0.1 ml reaction mix containing 200 ⁇ M ammonium acetyl CoA, 5 mM ammonium glyphosate, and 5 ⁇ g/ml PMBS (Sigma) in 5 mM morpholine acetate, pH 8.
  • the PMBS permeabilizes the cell membrane allowing the substrates and products to move from the cells to the buffer without releasing the entire cellular contents. Reactions were carried out at 25-37°C for 1-48 hours.
  • EXAMPLE 5 DETECTION OF HIGH ACTIVITY GAT ENZYMES High activity GAT enzymes typically have a k cat up to 400 min "1 and a K below 0.1 mM glyphosate.
  • Genes coding for GAT enzymes were cloned into E. coli expression vector pQE80 (Qiagen) and introduced into E. coli strain XL1 Blue (Stratagene). Cultures were grown in 150 ul rich medium (LB with 50 ug/ml carbenicllin) in shallow U-bottom 96-well polystyrene plates to late-log phase and diluted 1 :9 with fresh medium containing 1 mM EPTG (USB).
  • EXAMPLE 6 PURIFICATION OF GAT ENZYMES Enzyme purification was achieved by affinity chromatography of cell lysates on CoA-agarose and gel-filtration on Superdex-75. Quantities of purified GAT enzyme up to 10 mg were obtained as follows: A 100-ml culture of E. coli carrying a GAT polynucleotide on a pQE80 vector and grown overnight in LB containing 50 ug/ml carbenicillin was used to inoculate 1 L of LB plus 50 ug/ml carbenicillin. After 1 hr, EPTG was added to 1 mM, and the culture was grown a further 6 hr. Cells were harvested by centrifugation.
  • Lysis was effected by suspending the cells in 25 mM HEPES (pH 7.2), 100 mM KC1, 10% methanol (HKM), 0.1 mM EDTA, 1 mM DTT, protease inhibitor cocktail supplied by Sigma-Aldrich and 1 mg/ml of chicken egg lysozyme. After 30 minutes at room temperature, the cells were briefly sonicated. Particulate material was removed by centrifugation, and the lysate was passed through a bed of coenzyme A-Agarose. The column was washed with several bed volumes of HKM and GAT was eluted in 1.5 bed volumes of HKM containing 1 mM acetyl CoA.
  • GAT in the eluate was concentrated by its retention above a Centricon YM 50 ultrafiltration membrane. Further purification was obtained by passing the protein through a Superdex 75 column through a series of 0.6-ml injections. The peak of GAT activity eluted at a volume corresponding to a molecular weight of 17 kD. This method resulted in purification of GAT enzyme to homogeneity with >85% recovery. A similar procedure was used to obtain 0.1 to 0.4 mg quantities of up to 96 shuffled variants at a time.
  • the volume of induced culture was reduced to 1 to 10 ml, coenzyme A- Agarose affinity chromatography was performed in 0.15-ml columns packed in an MAHV filter plate (Millipore) and Superdex 75 chromatography was omitted.
  • EXAMPLE 7 STANDARD PROTOCOL FOR DETERMINATION OF KPAT AND K M k cat and K M for glyphosate of purified protein were determined using a continuous spectrophotometric assay, in which hydrolysis of the sulfoester bond of Acetyl CoA was monitored at 235 mn. Reactions were performed at ambient temperature (about 23 °C) in the wells of a 96-well assay plate, with the following components present in a final volume of 0.3 ml: 20 mM HEPES, pH 6.8, 10%) ethylene glycol, 0.2 mM acetyl CoA, and various concentrations of ammonium glyphosate.
  • both enzymes were assayed under the same conditions, e.g., both at 23°C.
  • k cat was calculated from V max and the enzyme concentration, determined by Bradford assay.
  • KM was calculated from the initial reaction rates obtained from concentrations of glyphosate ranging from 0.125 to 10 mM, using the Lineweaver-Burke transformation of the Michaelis-Menten equation.
  • k cat /K was determined by dividing the value determined for k cat by the value determined for K M - Using this methodology, kinetic parameters for a number of GAT polypeptides exemplified herein were determined.
  • the k cat , K M and kc a t/KM for the GAT polypeptide corresponding to SEQ ED NO:445 have been determined to be 322 min "1 , 0.5 mM and 660 mM ⁇ min "1 , respectively, using the assay conditions described above.
  • the k cat , K M and IO ⁇ /K M for the GAT polypeptide corresponding to SEQ ED NO:457 have been determined to be 118 min "1 , 0.1 mM and 1184 mM ' n "1 , respectively, using the assay conditions described above.
  • the kcat, K M and k cat !Ku for the GAT polypeptide corresponding to SEQ ED NO:300 have been determined to be 296 min "1 , 0.65 mM and 456 mM " 1 min "1 , respectively, using the assay conditions described above.
  • One of skill in the art can use these numbers to confirm that a GAT activity assay is generating kinetic parameters for a GAT suitable for comparison with the values given herein.
  • the conditions used to compare the activity of GATs should yield the same kinetic constants for SEQ ID NO: 300, 445, and 457 (within normal experimental variance) as those reported herein, when the conditions are used to compare a test GAT with the GAT polypeptides exemplified herein.
  • K M for Acetyl CoA was measured using the mass spectrometry method with repeated sampling during the reaction.
  • AcetylCoA and glyphosate (ammonium salts) were placed as 50-fold-concentrated stock solutions into a well of a mass spectrometry sample plate. Reactions were initiated with the addition of enzyme appropriately diluted in a volatile buffer such as mo ⁇ holine acetate or ammonium carbonate, pH 6.8 or 7.7. The sample was repeatedly injected into the instrument and initial rates were calculated from plots of retention time and peak area. K M was calculated as for glyphosate.
  • EXAMPLE 8 SELECTION OF TRANSFORMED E. COLI
  • An evolved GAT gene (a chimera with a native B. licheniformis ribosome binding site (AACTGAAGGAGGAATCTC; SEQ ID NO:515) attached directly to the 5' end of the GAT coding sequence) was cloned into the expression vector pQE80 (Qiagen) between the EcoRI and Hindlll sites, resulting in the plasmid pMAXY2190 ( Figure 11). This eliminated the His tag domain from the plasmid and retained the B-lactamase gene conferring resistance to the antibiotics ampicillin and carbenicillin.
  • pMAXY2190 was electroporated (BioRad Gene Pulser) into XL1 Blue (Stratagene) E. coli cells. The cells were suspended in SOC rich medium and allowed to recover for one hour. The cells were then gently pelleted, washed one time with M9 minimal media lacking aromatic amino acids (12.8 g/L Na2HPO4.7 H2O, 3.0 g/L KH2PO4, 0.5 g/L NaCl, 1.0 g/L NH4C1, 0.4% glucose, 2 mM MgSO4, 0.1 mM CaC12, 10 mg/L thiamine, 10 mg/L proline, 30 mg/L carbenicillin), and resuspended in 20 ml of the same M9 medium.
  • M9 minimal media lacking aromatic amino acids (12.8 g/L Na2HPO4.7 H2O, 3.0 g/L KH2PO4, 0.5 g/L NaCl, 1.0 g/L NH4C1, 0.4% glucose, 2 m
  • EXAMPLE 9 SELECTION OF TRANSFORMED PLANT CELLS Agrobacterium-mediated transformation of plant cells occurs at low efficiencies. To allow propagation of transformed cells while inhibiting proliferation of non-transformed cells, a selectable marker is needed. Antibiotic markers for kanamycin and hygromycin and the herbicide modifying gene bar, which detoxifies the herbicidal compound phosphinothricin, are examples of selectable markers used in plants (Methods in Molecular Biology, 1995, 49:9-18). Here we demonstrate that GAT activity serves as an efficient selectable marker for plant transformation.
  • a screenable GUS marker was present in the T-DNA to allow confirmation of transformation.
  • Transgenic tobacco shoots were generated using glyphosate as the only selecting agent. Axillary buds o ⁇ Nicotiana tabacum L.
  • Xanthi were subcultured on half-strength MS medium with sucrose (1.5 %) and Gelrite (0.3 %) under 16-h light (35-42 ⁇ Einsteins m "2 s "1 , cool white fluorescent lamps) at 24 °C every 2-3 weeks. Young leaves were excised from plants after 2-3 weeks subculture and were cut into 3 3 mm segments.
  • BBI basal shoot induction
  • EXAMPLE 10 GLYPHOSATE SELECTION OF TRANSFORMED YEAST CELLS Selection markers for yeast transformation are usually auxotrophic genes that allow growth of transformed cells on a medium lacking the specific amino acid or nucleotide. Because Saccharomyces cerevisiae is sensitive to glyphosate, GAT can also be used as a selectable marker.
  • an evolved GAT gene (0_6D10), SEQ ED NO: 196, is cloned from the T-DNA vector pMAXY3793 (as shown in Example 9) as a Pstl-Clal fragment containing the entire coding region and ligated into Pstl-Clal digested p424TEF (Gene, 1995, 156:119-122) as shown in Figure 13.
  • This plasmid contains an E. coli origin of replication and a gene conferring carbenicillin resistance as well as a TRP1, tryptophan auxotroph selectable marker for yeast transformation.
  • the GAT containing construct is transformed into E.
  • Plasmid DNA is prepared and used to transform yeast strain YPH499 (Stratagene) using a transformation kit (Bio 101). Equal amounts of transformed cells are plated on CSM-YNB-glucose medium (BiolOl) lacking all aromatic amino acids (tryptophan, tyrosine, and phenylalanine) with added glyphosate. For comparison, p424TEF lacking the GAT gene is also introduced into YPH499 and plated as described. The results demonstrate that GAT activity function will as an efficient selectable marker. The presence of the GAT containing vector in glyphosate selected colonies can be confirmed by re-isolation of the plasmid and restriction digest analysis.
  • EXAMPLE 11 HERBICIDE SPRAY TESTS OF GAT EXPRESSING TOBACCO PLANTS Tobacco shoots generated as described in EXAMPLE 9 were excised from the explants and transferred to basal root induction (BRI) medium composed of half-strength Murashige and Skoog (MS) medium, pH 5.7, with 1.5% sucrose,
  • BBI basal root induction
  • Rooted plants and axillary shoots were clonally propagated by cutting the stem and transferring it to fresh BRI medium until the desired number of clones was obtained. Rooted plants were carefully removed from the solid medium. Prior to placing the plants into small pots of soil, the roots were washed to remove any remaining Gelrite. A protective plastic cover was kept over the plants for at least one week until the plants were well established. To determine if GAT expressing tobacco plants could tolerate simulated field rate sprays of glyphosate, clonal lines of several events per GAT variant were tested.
  • a typical test was set up as follows: One clone from each event was sprayed with 1 ml of solution containing the isopropylamine salt of glyphosate (Sigma P5671) and 0.125% Triton X-100, pH 6.8 such that the amount of active ingredient sprayed was equivalent to that present in commercial glyphosate products.
  • IX oz/acre
  • ai active ingredient
  • 2.4 ul of 40% ai formulation was diluted into 1 ml water and sprayed on a plant in a 4-inch square pot (16 in 2 ).
  • a mock application (OX) with surfactant only was also included. In some cases a second spray was applied 1-4 weeks later.
  • Plants were kept in controlled growth rooms at 25 °C and 70% humidity with 16 hr light.
  • 10 events confirmed positive for GAT0_6D10 (SEQ ID NO: 196), ten for GAT0_5D3 (SEQ LD NO: 193), 8 events for GAT0_5B8 (SEQ ED NO: 190), and plants transformed with the vector only (no GAT) were clonally propagated, transferred to soil and sprayed when plants had an average of 5 leaves. Seed-grown wild type plants were also sprayed. After two weeks, the vector only and seed grown plants sprayed with 0.5, 2 or 4X glyphosate stopped growing, wilted, and turned brown.
  • transgenic GAT plants survived the spraying procedure without signs of glyphosate damage such as chlorosis, leaf elongation, stunting, or browning. All OX plants were healthy, including the non- GAT control plants. Three weeks later all of the surviving plants were sprayed with an 8X dose. The OX control plants died within two weeks. Again, all GAT plants survived. Tobacco plants transformed with GAT and selected on glyphosate were fertile. Flowering and seed set were not detectably different from wild type plants.
  • Ratios near 3 : 1 indicate a single segregating dominant event. Ratios greater than 3:1 indicate several segregating inserts. Ratios less than 3:1 can be due to small sample size effects, incomplete dominance, or position effects that render expression too low to confer herbicide tolerance. Compared to the controls, it was clear that the GAT gene was transmitted to the Tl generation and conferred glyphosate tolerance.
  • EXAMPLE 13 PRODUCTION OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES
  • Maize plants expressing GAT variant transgenes were produced using the methods described in U.S. Patent No. 5,981,849, which is incorporated herein by reference.
  • Agrobacterium tumefaciens vectors were constructed according to methods known in the art. Each vector contained an insert having an ubiquitin promoter and intron, a GAT variant and a PinXI terminator.
  • Maize immature embryos were excised and infected with an Agrobacterium tumefaciens vector containing the GAT variant of interest. After infection, embryos were transferred and cultured in co-cultivation medium.
  • the infected immature embryos were transferred onto media containing 1.0 mM glyphosate (Roundup ULTRA MAXTM). This selection lasted until actively growing putative transgenic calli were identified.
  • the putative transgenic callus tissues were sampled for PCR and Western assay (data not shown) to confirm the presence of the GAT gene.
  • the putative transgenic callus tissues were maintained on 1.0 mM glyphosate selection media for further growth and selection before plant regeneration.
  • callus tissue confirmed to be transgenic were transferred onto maturation medium containing 0.1 mM glyphosate and cultured for somatic embryo maturation. Mature embryos were then transferred onto regeneration medium containing 0.1 mM glyphosate for shoot and root formation.
  • Plant height measurements before spraying with glyphosate after spraying with glyphosate at 1, 2, 3 and 4 weeks mature plants (at tasseling) Two plants were sent to the greenhouse from each event (independent transgenic callus) listed in Table 6. Plant 1 was kept for seed production and was not sprayed with glyphosate. Plant 2 was sprayed at 4x glyphosate (lx glyphosate 26 ounces/acre) at 14 days after transplanting. The TO plant discoloration scores with 4x spray at 7 and 14 days after the spray are shown in Tables 6 and 7. Height data at tasseling is shown in Figure 14. An additional experiment was performed in which TO plants were sprayed with 6x glyphosate. The TO plant discoloration scores with 6x spray at 10 days after spray are shown in Table 8.
  • EXAMPLE 15 Tl STUDIES OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES Maize plants expressing GAT variant transgenes 18-28D9b (SEQ ED).
  • Tl plants were used for the generation of glyphosate field tolerance data.
  • the Tl plants were treated in the field with four different glyphosate spray treatments (OX, 4X, 8X, and 4X + 4X) for each event.
  • the plants were sprayed at V3 and V8. Plants were scored 10 days after treatment for leaf discoloration and plant height comparisons as described in Example 13.
  • the Tl field spray data correlated well with the results previously obtained in the greenhouse as reported in Example 13. T2 seeds were collected for further studies.
  • Tl plants were grown in cool/cold (day 14° C, night 8° C), warm (day 28° C, night 20° C), and hot (day 37° C, night 20° C) conditions. Tl plants were sprayed at V2 with four different glyphosate spray treatments (OX, 4X, 6X, and 8X). Plants were scored at 5 and 14 days after treatment for leaf discoloration and plant height comparisons as described in Example 13. Visual observations indicated that glyphosate tolerance is not adversely effected by the range of temperatures tested.
  • GAT polypeptides that are not native to any organism appeared stable at temperatures in the range of about 47°C to about 54°C.
  • the half lives of several GAT polypeptides were also measured at 37.5°C according to the following procedure.
  • GAT polypeptides 401 SEQ LD NO: 6
  • 17- 15H3 SEQ ED NO: 601
  • 20-8H12 SEQ ID NO: 739
  • 22-13B12 SEQ ED NO: 781
  • 22-15B4 SEQ ED NO: 946
  • 22-18C5 SEQ ID NO: 795
  • EXAMPLE 17 PRODUCTION OF GLYPHOSATE-RESISTANT SOYBEAN EXPRESSING GAT TRANSGENES Soybean plants expressing GAT variant transgenes were produced using the method of particle gun bombardment (see Klein et al. (1987) Nature 327:70-73) using a DuPont Biolistic PDSIOOO/He instrument. The selection agent used during the transformation process was hygromycin. Either the hygromycin selectable marker gene remained in the transgenic events or the hygromycin gene was excised by methods known in the art. DNA fragments were prepared with a synthetic constitutive promoter, a GAT variant and PinEI terminator.
  • the selectable marker gene comprising the 35S CaMV promoter, HPT gene and NOS terminator, was cobombarded with the GAT gene variant as described above. Bombarded soybean embryogenic suspension tissue was cultured for one week in the absence of selection agent. Embryogenic suspension tissue was placed in liquid selection medium for 6 weeks. Putative transgenic suspension tissue was sampled for PCR analysis to determine the presence of the GAT gene. Putative transgenic suspension culture tissue was maintained in selection medium for 3 weeks to obtain enough tissue for plant regeneration. Suspension tissue was matured for 4 weeks using standard procedures; matured somatic embryos were desiccated for 4-7 days and then placed on germination induction medium for 2-4 weeks.
  • EXAMPLE 18 EFFECT OF SALT ON GAT KINETICS To better approximate the physiological conditions under which the GAT enzymes of the invention are intended to be used (e.g., plant cells), the activities of some GAT enzymes of the invention were re-evaluated in the presence of added salt.
  • Figures 15A and 15B provide a comparison of the kinetic parameters K m and k 0 at/K m , respectively, for native GAT enzymes GAT401 (SEQ ED NO:6), B6 (SEQ ED NO:7), and DS3 (SEQ LD NO:8), and evolved GAT enzymes 0_6D10 (SEQ ED NO.-448), 10_4F2 (SEQ ID NO:454), 18-28D9 (SEQ ED NO:618), 17-15H3 (SEQ ED NO:601), 17-10B3 (SEQ LD NO:592), 20-8H12 (SEQ ED NO:739), 20-16A3 (SEQ ED NO:639), and 20-30C6 (SEQ ED NO:683), assayed in either the absence of added KCl (unshaded bars) or in the presence of 20 mM KCl (shaded bars).
  • GAT401 SEQ ED NO:6
  • B6 SEQ
  • the k c t value remains relatively unchanged or increases slightly, the net result being a lower observed k ca t/K m value for GAT enzymes assayed in the presence of 20 mM KCl than in the absence of added KCl ( Figure 15B).
  • EXAMPLE 19 FURTHER EVOLVED GAT GENES ENCODING GAT ENZYMES WITH EXTREMELY HIGH ACTIVITY Additional iterations of directed molecular evolution yielded further evolved gat genes encoding GAT enzymes exhibiting extremely high GAT activity, e.g. exhibiting one or more improved property such as reduced K m for glyphosate, increased k cat , or increased k cat /K m compared to previously-described GAT enzymes.
  • the further evolved gat genes were first selected for growth in E. coli in minimal M9 medium as described in Example 8, except that 5 mM rather than 1 mM glyphosate was used in the selection. Proteins were purified as described in Example 6 above.
  • Exemplary further evolved gat coding sequences comprise nucleic acids sequences identified herein as SEQ LD NOs: 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, and 930, which encode further evolved GAT enzymes comprising amino acid sequences identified herein as SEQ ED NOs: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 8
  • Some such further evolved GAT enzymes exhibit extremely high GAT activity, in that they exhibit one or more improved property such as reduced K m for glyphosate, increased c a t, or increased k cat /K m , compared to previously-described GAT enzymes assayed under the same conditions.
  • Figures 16 A, 16B and 16C provide a comparison of the kinetic parameters K m , k cat , and k cat /K m , respectively, of several previously-described GAT enzymes (unshaded bars) to the kinetic parameters of some further evolved GAT enzymes of the invention (shaded bars), assayed using the continuous spectrophotometric assay in the presence of 20 mM KCl with protein quantified via UV absorbance as described above. Error bars represent the standard deviation of multiple assays, where available.
  • native GAT enzyme GAT401 (SEQ ID NO:6) exhibited a K m for glyphosate of about 4 mM, a k cat of about 5.4 1 1 1 min " , and a k Cat /K m of about 1.35 mM “ min " .
  • some further evolved GAT enzymes of the invention exhibit a range of K m values for glyphosate of less than about 0.4 mM (such as, between about 0.4 mM and 0.1 mM), k ⁇ t values of at least about 1000 min “1 (such as, between about 1000 min “1 and about 2500 min “1 ), and k cat /K m values of at least about 4800 mM “1 min “1 (such as, between about 4800 mM "1 min “1 and about 8000 mM “ min “1 ).
  • some further evolved GAT enzymes of the invention exhibit at least about a 7000-fold increase in k cat /K m over native GAT enzyme GAT401 under these assay conditions.
  • Some further evolved GAT enzymes of the invention comprise one or more amino acid residue positions not observed in previously described GAT polypeptides and GAT enzymes, such as, at position 27, a Bl, ZI or A amino acid residue; at position 33, an N or G amino acid residue; at position 46, a B2, Z4, or H amino acid residue; and at position 93, an R amino acid residue;
  • Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y and V
  • B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T
  • ZI is an amino acid selected from the group consisting of A, I, L, M and V
  • Z4 is an amino acid selected from the group consisting of R, H and K
  • some further evolved GAT enzymes of the invention comprise one or more of: an Ala at position 27 (i.e., Ala27); an Asn or a Gly at position 33 (i.e., Asn33 or Gly33); a His at position 46 (i.e., His46); and an Arg at position 93 (i.e., Arg93), with sequence numbering corresponding to that of, e.g., SEQ ED NO: 907. Sequence/activity analyses were performed to identify amino acid residues which correlate positively with a high k cat /K m (as manifested by a high k oat , a low K m , or both).
  • Amino acid residues wliich appear to correlate positively with a high k ca /K m include Glul4, Asp32, Asn33, Gly38, and Thr62 (sequence numbering corresponding to that of SEQ ED NO:907). Additional GAT enzymes may be constructed by substituting codons for one or more of these residues into the appropriate position(s) of a coding sequence of a template GAT polypeptide.
  • additional GAT enzymes were generated by substituting one or more of codons encoding Glu at codon position 14, Asp at position 32, Asn at position 33, Gly at position 38, and Thr at position 62, into a nucleic acid sequence encoding a template polypeptide, such as GAT 24-5H5 (SEQ ID NO:845) or GAT 25-8H7 (SEQ ED NO:907), two of the further evolved GAT enzymes exhibiting extremely high activity as described above.
  • a template polypeptide such as GAT 24-5H5 (SEQ ID NO:845) or GAT 25-8H7 (SEQ ED NO:907)
  • Exemplary further evolved GAT enzymes generated in this manner identified herein as R12G1 (SEQ ED NOs917), R12G2 (SEQ ED NO:919), R12G3 (SEQ ED NO:921), R12G4 (SEQ LD NO:923), R12G5 (SEQ ED NO:925), R12G6 (SEQ ED NO:927), R12G7 (SEQ LD NO:929), and R12G8 (SEQ ED NO:931), encoded by nucleic acids identified as SEQ ED NOs: 916, 918, 920, 922, 924, 926, 928, and 930, respectively, exhibited extremely high GAT activities comparable to those of the template polypeptides.
  • EXAMPLE 20 AMINO ACEDS THAT CORRELATE WITH HIGH GAT ACTIVITY
  • the amino acids aspartic acid (Asp, D), histidine (His, H) and cysteine (Cys, C) are known to be associated with the active sites of various acetyltransferase enzymes.
  • All D, C, and H residues of GAT20-30C6 were individually mutated to alanine (Ala, A) and the mutated enzymes assayed for N- acetylglyphosate activity.
  • Variants containing the substitutions D34A and H41A retained only about 2%>-3% of the activity of the unmodified enzyme, while the variant containing the substitution H138A exhibited essentially no measurable GAT activity.
  • variants containing the substitutions H138R and H138S retained low but measurable GAT activity (particularly at pHs greater than 6.8), suggesting that His (and nominally Arg and Ser) at position 138 may serve as an active-site base.
  • EXAMPLE 21 IMPROVING GAT EXPRESSION EN PLANTS Plants, animals, and microbes are known to have specific codon preferences that affect the efficiency of amino acid incorporation during translation of gene transcripts. Rare codons could cause problems with tRNA recruitment during translation, which could then lead to lower accumulation of the encoded protein.
  • the original parental gat genes were from bacteria such as Bacillus licheniformis, and, as such, may not have an optimal codon distribution for expression in plants.
  • Evolved gat genes of the invention have successfully been expressed in plants (see, e.g., Examples 9, 11, 13, and 17, above), yet an opportunity exists to improve protein production by increasing the translation efficiency in plants.
  • One way to accomplish this is by substituting one or more codons in the gat coding sequence which are used infrequently in plants for codons for the same amino acid(s) which are more frequently used in plants, thereby generating silent mutations in the gat coding sequence with an unchanged sequence of the encoded protein.
  • Tables showing the frequency of codon usage in corn, cotton and soybeans (available, for example, from the website maintained by the Kazusa DNA Research Eistitute, Chiba, Japan) were compared to generate the following table (Table 13) showing codons wliich are, in general, more frequently or less frequently utilized in either monocot or dicot plants.
  • a second way to increase plant expression of microbial genes is to increase 10 the G+C content near the initiating methionine residue.
  • Naturally-occurring coding sequences in plants tend to contain two or three G and/or C residues immediately downstream of the ATG initiation codon (Joshi et ⁇ /.(1997) Plant Mol. Biol. 35:993-1001). Introducing into the gat coding sequence one or two CG-rich codons immediately downstream of the ATG intitiation codon may create 15 a more plant-like coding sequence and thus may enhance its expression in plants.
  • Evolved gat coding sequences may be altered by replacing codons less frequently utilized in plants for codons more frequently utilized in plants, for example according to the table above. Codons less frequently utilized in plants (e.g., according to the above table) should generally be avoided, hi this manner at least one codon (such as, at least three codons, at least five codons, or at least least ten codons), may be changed in the gat coding sequence from codon(s) less frequently utitlized in plants to codon(s) more frequently utitlized in plants.
  • the codons which are replaced may be located at the 5' end of the coding sequence (e.g., within the first 10 codons, within the first 20 codons, within the first 50 codons, or within the first 100 codons) of the gat coding sequence.
  • the codons which are replaced may be located throughout the gat coding sequence.
  • the more frequently utilized codons furthermore may be chosen to avoid more than about 5- 10 (such as, e.g., more than about 5, more than about 6, more than about 7, more than about 8, more than about 9 or more than about 10) consecutive occurrences of G+C or ofA+T within the coding sequence.
  • the coding sequence may also be altered to contain one or two CG-rich codons immediately downstream of the ATG intitiation codon, such as, for example, by inserting an Ala codon (e.g., a frequently utilized Ala codon) immediately downstream of and adjacent to the initiating Met codon of the gat coding sequence.
  • Ala codon e.g., a frequently utilized Ala codon
  • Table 14 provides exemplary gat coding sequences altered as described above.
  • This bacterial suspension was forced or "infiltrated” into the intercellular spaces of Nicotiana benthamiana leaves using a 1 ml plastic 10 syringe (with no needle). By infiltrating 200-300 ⁇ l of bacterial suspension into each spot (typically 3-4cm 2 in infiltrated area), 4 or more spots could be arranged on a single leaf still attached to the plant.
  • the g ⁇ t-containing Agrobacterium strain was diluted 5:1 or 10:1 with a second Agrobacterium strain lacking gat prior to infiltration. This dilution step has the effect of reducing the 15 overall expression of the gat gene in the plant cells, thereby preventing saturation and allowing easier visualization of expression differences between variants and constructs.
  • EXAMPLE 22 Tl STUDIES OF GLYPHOSATE-RESISTANT SOYBEAN EXPRESSLNG GAT TRANSGENES Soybean plants expressing GAT transgene 18-28D9c (SEQ ED NO:824) were produced using the methods described in Example 17.
  • Tl seed was collected from glyphosate sprayed TO plants.
  • Tl seed were germinated under greenhouse conditions in RediEarth R 360 medium, available from Scotts, Marysville, OH, and sprayed at V2-V3 stage with either 2X or 4X Glyphosate (RoundUp ULTRA MAXTM, available from Monsanto, St. Louise, MO) as per methods described in Example 17. Plants were scored after 10 days and leaf discoloration scores taken as described in Example 17.
  • the Tl greenhouse spray data correlated well with previous greenhouse results at the TO plant stage.
  • T2 seed was collected for further studies.
  • EXAMPLE 23 PRODUCTION OF GLYPHOSATE AND SULFONAMIDE RESISTANT SOYBEANS EXPRESSING GAT AND HRA TRANSGENES Soybean plants expressing GAT & HRA, high resistance allele of acetolactate synthase (U.S. Patent Nos. 5,605,011, 5,378,824, 5,141,870, and 5013,659), genes were produced using the methods described in Example 17.
  • the HRA gene was used as selectable marker gene for transformation.
  • the selection agent was chlorsulfuron at a concentration of 100 ng/ml.
  • the selectable marker gene was comprised of the S-adenosyl-L-methionine synthetase (SAMS) promoter from Gly cine max (U.S. 2003/226166), HRA coding sequence from Gly cine max and acetolactate synthase terminator from Glycine max.
  • SAMS S-adenosyl-L-methionine synthetase
  • HRA HRA coding sequence from Gly cine max
  • acetolactate synthase terminator from Glycine max.
  • the selectable marker gene was either linked to or co-bombarded with a GAT construct consisting of a synthetic constitutive promoter (U.S. Patent Nos. 6,072,050 and 6,555,673) or the maize Histone 2B promoter (U.S. Patent No.
  • EXAMPLE 24 Tl PRE-EMERGENCE STUDIES OF SOYBEANS EXPRESSING GAT AND HRA TRANSGENES
  • Tl seed generated from experiments as described in Example 17 were planted in pots of Tama Silt loam in the greenhouse. Pots were immediately sprayed with a pre-emergence application of chlorimuron, rimsulfuron or tribenuron at a rate of 70 gms a.i./hectare. Germinating plants were evaluated 10 days post spray application based on plant discoloration scores described in Example 17. All HRA and GAT events survived all pre-emergence spray applications with a rating of 9 (uninjured). These results demonstrate pre- emergence resistance to sulfonamide chemistry in soybeans.
  • EXAMPLE 25 Tl POST-EMERGENCE STUDIES OF SOYBEANS EXPRESSING GAT AND HRA TRANSGENES Tl seed generated from experiments as described in Example 17 were germinated in RediEarth R 360 medium in the greenhouse. Plants were sprayed at the V2-V3 stage (14 days after potting) with thifensulfuron, chlorimuron, rimsulfuron or tribenuron (70, 70, 35, 35 gm a.i./hectare, respectively). Plants were evaluated 10 days post application based on plant discoloration scores described in Example 17. Results are shown in Table 16.
  • Thifensulfuron 70 gm a.i./ha 7.81
  • EXAMPLE 26 T3 STUDIES OF GLYPHOSATE-RESISTANT MAIZE PLANTS EXPRESSING GAT TRANSGENES Maize plants expressing GAT transgenes 20-H812 (SEQ LD NO:738) and
  • 20-16A3 (SEQ ED NO:638) were produced using the methods described in Example 13. Plants were scored after 10 days and leaf discoloration scores taken as described in Example 13. Specifically, plants were sprayed at V4 leaf stage. The plants were thinned to equal spacing and stand counts after application of spray treatments. Commercially available NK603 (Monsanto, St. Louis, MO) was used as a control. Resistance scores are shown in Table 18. Plant height measurements were also taken 10 days after treatment and are shown in Table 18.
  • EXAMPLE 27 T3 YIELD STUDIES OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES
  • T3 seed from Example 15 was used to generate T3 plants for the generation of glyphosate field tolerance data on hybrids. The experiment was conducted at 15 Viluco, Chile with four (4) replications using a split-plot design. Specifically, 3 entries were included. Two of the entries comprised maize plants expressing GAT variant transgenes 17-15H3 (SEQ ED NO:549). A glyphosate-resistant control NK603, which is commercially available from Monsanto, was the third entry.
  • EXAMPLE 28 T2 STUDIES OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES
  • T2 plants were examined.
  • GAT positive T2 plants were sprayed at V4 with IX (26 oz/A ULTRA MAXTM).
  • GAT negative plants were PCR sampled at V4. GAT positive plants were removed from the row.
  • EXAMPLE 29 AMINO ACED SUBSTRATES OF GAT POLYPEPTIDES
  • GAT activity of several GAT polypeptides of the present invention was evaluated with respect to a number of amino acid substrates.
  • the GAT polypeptide, AcCoA and amino substrate were incubated in 25 mM Hepes, pH 6.8, 10%) ethylene glycol in the wells of a 96-well polystyrene plate. After 30 minutes, the reactions were stopped by the addition of 30 ⁇ l of 10 mM 5, 5'-dithiobis-2- nitrobenzoate (DTNB) in 500 mM Tris, pH 7.5.
  • DTNB 5'-dithiobis-2- nitrobenzoate
  • native GAT polypeptide 401 (SEQ ED NO: 6) (or B6 (SEQ ED NO: 7), in the case of phosphoserine) exhibited detectable activity with 12 amino acids.
  • the native GAT polypeptide was about as active with L- aspartate, about 4.7 times more active with L-serine, and about 2 times more active with phospho-L-serine than with glyphosate.
  • non-native GAT polypeptides 17-15H3 SEQ ID NO: 601
  • 25- 8H7 SEQ ED NO: 907
  • EXAMPLE 30 EFFECT OF PH ON GAT ACTIVITY
  • the pH optima of k ca t and K M for wild-type enzyme B6 (SEQ ED NO: 7) and GAT polypeptide 17-15H3 (SEQ ED NO: 601) were determined using the spectrophotometric assay described in Example 7 except that assay buffer was 50 mM Hepes and 10%> ethylene glycol, titrated to a range of pH values. Protein concentrations were determined by the UV absorbance assay described in Example 19. The effect of pH on KM and Kcat is shown in Figure 18 for clones B6 (SEQ ED NO: 7) and 17-15H3 (SEQ ED NO: 601).

Abstract

Novel proteins are provided herein, including proteins capable of catalyzing the acetylation of glyphosate and other structurally related proteins. Also provided are novel polynucleotides capable of encoding these proteins, compositions that include one or more of these novel proteins and/or polynucleotides, recombinant cells and transgenic plants comprising these novel compounds, diversification methods involving the novel compounds, and methods of using the compounds. Some of the novel methods and compounds provided herein can be used to render an organism, such as a plant, resistant to glyphosate.

Description

NOVEL GLYPHOSATE-N-ACETYLTRANSFERASE (GAT) GENES
COPYRIGHT NOTIFICATION PURSUANT TO 37 C.F.R. § 1.71(E) A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
BACKGROUND OF THE INVENTION Crop selectivity to specific herbicides can be conferred by engineering genes into crops which encode appropriate herbicide metabolizing enzymes, some cases these enzymes, and the nucleic acids that encode them, originate in a plant, h other cases, they are derived from other organisms, such as microbes.
See, e.g., Padgette et al. (1996) "New weed control opportunities: Development of soybeans with a Round UP Ready™ gene" and Vasil (1996) "Phosphinothricin- resistant crops", both in Herbicide-Resistant Crops, ed. Duke (CRC Press, Boca Raton, Florida) pp.54-84 and pp. 85-91. Indeed, transgenic plants have been engineered to express a variety of herbicide tolerance/metabolizing genes, from a variety of organisms. For example, acetohydroxy acid synthase, which has been found to make plants that express this enzyme resistant to multiple types of herbicides, has been introduced into a variety of plants (see, e.g., Hattori et al. (1995) Mol. Gen. Genet. 246: 419). Other genes that confer tolerance to herbicides include: a gene encoding a chimeric protein of rat cytochrome P4507A1 and yeast NADPH-cytochrome P450 oxidoreductase (Shiota et al. (1994) Plant Physiol. 106: 17), genes for glutathione reductase and superoxide dismutase (Aono et al. (1995) Plant Cell Physiol. 36: 1687, and genes for various phosphotransferases (Datta et al. (1992) Plant Mol. Biol. 20: 619). One herbicide which is the subject of much investigation in this regard is N-phosphonomethylglycine, commonly referred to as glyphosate. Glyphosate is the top selling herbicide in the world, with sales projected to reach $5 billion by 2003. It is a broad spectrum herbicide that kills both broadleaf and grass-type plants. A successful mode of commercial level glyphosate resistance in transgenic plants is by introduction of a modified Agrobacterium CP4 5- enolpyruvylshikimate-3 -phosphate synthase (hereinafter referred to as EPSP synthase or EPSPS) gene. The transgene is targeted to the chloroplast where it is capable of continuing to synthesize EPSP from phosphoenolpyruvic acid (PEP) and shikimate-3 -phosphate in the presence of glyphosate. In contrast, the native EPSP synthase is inhibited by glyphosate. Without the transgene, plants sprayed with glyphosate quickly die due to inhibition of EPSP synthase which halts the downstream pathway needed for aromatic amino acid, hormone, and vitamin biosynthesis. The CP4 glyphosate-resistant soybean transgenic plants are marketed, e.g., by Monsanto under the name "Round UP Ready™." hi the environment, the predominant mechanism by which glyphosate is degraded is through soil microflora metabolism. The primary metabolite of glyphosate in soil has been identified as aminomethylphosphonic acid (AMP A), which is ultimately converted into ammonia, phosphate and carbon dioxide. The proposed metabolic scheme that describes the degradation of glyphosate in soil through the AMPA pathway is shown in Fig. 8. An alternative metabolic pathway for the breakdown of glyphosate by certain soil bacteria, the sarcosine pathway, occurs via initial cleavage of the C-P bond to give inorganic phosphate and sarcosine, as depicted in Fig. 9. Another successful herbicide/transgenic crop package is glufosinate
(phosphinothricin) and the Liberty Link™ trait marketed, e.g., by Aventis. Glufosinate is also a broad spectrum herbicide. Its target is the glutamate synthase enzyme of the chloroplast. Resistant plants carry the bar gene from Streptomyces hygroscopicus and achieve resistance by the N-acetylation activity of bar, which modifies and detoxifies glufosinate. An enzyme capable of acetylating the primary amine of AMP A is reported in PCT Application No. WO00/29596. The enzyme was not described as being able to acetylate a compound with a secondary amine (e.g., glyphosate). While a variety of herbicide resistance strategies are available as noted above, additional approaches would have considerable commercial value. The present invention provides novel polynucleotides and polypeptides for conferring herbicide tolerance, as well as numerous other benefits as will become apparent during review of the disclosure.
SUMMARY OF THE INVENTION The present invention provides methods and reagents for rendering an organism, such as a plant, resistant to glyphosate by one or more of the embodiments described below. One embodiment of the invention provides novel polypeptides referred to herein as glyphosate-N-acetyltransferase ("GAT") polypeptides. GAT polypeptides are characterized by their structural similarity to one another, e.g., in terms of sequence similarity when the GAT polypeptides are aligned with one another. GAT polypeptides of the present invention possess glyphosate-N- acetyltransferase activity, i.e., the ability to catalyze the acetylation of glyphosate. These GAT polypeptides transfer the acetyl group from acetyl CoA to the N of glyphosate. In addition, some GAT polypeptides transfer the propionyl group of propionyl CoA to the N of glyphosate. Some GAT polypeptides are also capable of catalyzing the acetylation of glyphosate analogs and/or glyphosate metabolites, e.g., aminomethylphosphonic acid. Exemplary GAT polypeptides correspond to SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. Also provided are novel polynucleotides referred to herein as GAT polynucleotides, e.g., SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565', 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952. GAT polynucleotides are characterized by their ability to encode GAT polypeptides. hi some embodiments of the invention, a GAT polynucleotide is engineered for better plant expression by replacing one or more parental codons with a synonymous codon that is preferentially used in plants relative to the parental codon. In other embodiments, a GAT polynucleotide is modified by the introduction of a nucleotide sequence encoding an N-terminal chloroplast transit peptide. hi other embodiments, a GAT polynucleotide is modified by the insertion of one or more G+C containing codons (such as GCG or GCT) immediately downstream of and adjacent to the initiating Met codon. GAT polypeptides, GAT polynucleotides and glyphosate-N- acetyltransferase activity are described in more detail below. The invention further includes certain fragments of the GAT polypeptides and GAT polynucleotides described herein. The invention includes non-native variants of the polypeptides and polynucleotides described herein, wherein one or more amino acid of the encoded polypeptide has been mutated. h certain preferred embodiments, the GAT polypeptides of the present invention are characterized as follows. When optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, a Z5 amino acid residue; (ii) at position 62, a ZI amino acid residue; (iii) at position 124, a Z6 amino acid residue; and (iv) at position 144, a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P. The invention further provides an isolated or recombinant polypeptide comprising an amino acid sequence selected from the groups consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ID NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ ID NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ ID NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ ID NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ ID NO:712; (h) an amino acid sequence that is at least 97% identical to SEQ ID NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO:677; (j) an amino acid sequence that is at least 96% identical to SEQ ID NO:584; (k) an amino acid sequence that is at least 98% identical to SEQ ID NO: 707; (1) an amino acid sequence that is at least 98% identical to SEQ ID NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:590. The invention further provides an isolated or recombinant polypeptide comprismg an amino acid sequence selected from the groups consisting of: (a) an amino acid sequence that is at least 96% identical to positions 2-146 of SEQ ID NO:919 (such as, for example, SEQ ID NO:917, 919, 921, 923, 925, 927, 833, 835, 839, 843, 845, 859, 863, 873, 877, 891, 895, 901, 905, 907, 913, 915, or 950); (b) an amino acid sequence that is at least 97% identical to positions 2-146 of SEQ ID NO:929 (such as, for example, SEQ LD NO:929, 931, 835, 843, 849, or 867); (c) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ ID NO:847 (such as, for example, SEQ ID NO:845 or 847); (d) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:851; (e) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:853; (f) an amino acid sequence that is at least 98% identical to positions 2- 146 of SEQ LD NO:855 (such as, for example, SEQ LD NO:835 or 855); (g) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:857; (h) an amino acid sequence that is at least 98% identical to positions 2- 146 of SEQ LD NO:861 (such as, for example, SEQ LD NO:839, 861, or 883); (i) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:871; (j) an amino acid sequence that is at least 98% identical to positions 2- 146 of SEQ LD NO:875; (k) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:881; (1) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:885 (such as, for example, SEQ LD NO:845 or 885); (m) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:887; (n) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:889 (such as, for example, SEQ LD NO: 863, 889, 891, or 903); (o) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:893; (p) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:897; (q) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:899; (r) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ D NO:909 (such as, for example, SEQ LD NO:883 or 909); (s) an amino acid sequence that is at least 98% identical to positions 2-146 of SEQ LD NO:911; (t) an amino acid sequence that is at least 99% identical to positions 2-146 of SEQ LD NO:837; (u) an amino acid sequence that is at least 99% identical to positions 2- 146 of SEQ ID NO: 841; (v) an amino acid sequence that is at least 99% identical to positions 2-146 of SEQ LD NO:865; (w) an amino acid sequence that is at least 99% identical to positions 2-146 of SEQ LD NO:869; and (x) an amino acid sequence that is at least 99% identical to positions 2-146 of SEQ LD NO:879. h some embodiments of the invention, the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence. The invention further provides an isolated or recombinant polypeptide comprising an amino acid sequence that is at least 95% identical to positions 2-146 of SEQ LD NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ LD NO:929 (such as, for example, SEQ LD NO:837, 849, 893, 897, 905, 921, 927, 929 or 931). hi some embodiments of the invention, the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence. The invention further provides a nucleic acid construct comprising a polynucleotide of the invention. The construct can be a vector, such as a plant transformation vector, hi some aspects a vector of the invention will comprise a T- DNA sequence. The construct can optionally include a regulatory sequence (e.g., a promoter) operably linked to a GAT polynucleotide, where the promoter is heterologous with respect to the polynucleotide and effective to cause sufficient expression of the encoded polypeptide to enhance the glyphosate tolerance of a plant cell transformed with the nucleic acid construct. h some aspects of the invention, a GAT polynucleotide functions as a selectable marker, e.g., in a plant, bacteria, actinomycete, yeast, algae or other fungi. For example, an organism that has been transformed with a vector including a GAT polynucleotide selectable marker can be selected based on its ability to grow in the presence of glyphosate. A GAT marker gene can be used for selection or screening for transformed cells expressing the gene. The invention further provides vectors with stacked traits, i.e., vectors that encode a GAT polypeptide and that also include a second polynucleotide sequence encoding a second polypeptide that confers a detectable phenotypic trait upon a - ll CTTi/xgcuxism expressing the second poiypeptiαe at an effective ievei, tor example disease resistance or pest resistance. The detectable phenotypic trait can also function as a selectable marker, e.g., by conferring herbicide resistance or by providing some sort of visible marker. ' h one embodiment, the invention provides a composition comprising two or more polynucleotides of the invention. Preferably, the GAT polynucleotides encode GAT polypeptides having different kinetic parameters, i.e., a GAT variant having a lower Km can be combined with one having a higher kcat. In a further embodiment, the different GAT polynucleotides may be coupled to a chloroplast transit sequence or other signal sequence thereby providing GAT polypeptide expression in different cellular compartments, organelles or secretion of one or more of the GAT polypeptides. Accordingly, compositions containing two or more GAT polynucleotides or encoded polypeptides are a feature of the invention, h some cases, these compositions are libraries of nucleic acids containing, e.g., at least 3 or more such nucleic acids. Compositions produced by digesting the nucleic acids of the invention with a restriction endonuclease, a DNAse or an RNAse, or otherwise fragmenting the nucleic acids, e.g., mechanical shearing, chemical cleavage, etc., are also a feature of the invention, as are compositions produced by incubating a nucleic acid of the invention with deoxyribonucleotide triphosphates and a nucleic acid polymerase, such as a thermostable nucleic acid polymerase. Cells transduced by a vector of the invention, or which otherwise incorporate a nucleic acid of the invention, are an aspect of the invention. In a preferred embodiment, the cells express a polypeptide encoded by the nucleic acid of the invention. In some embodiments, the cells incorporating the nucleic acids of the invention are plant cells. Transgenic plants, transgenic plant cells, and transgenic plant explants incorporating the nucleic acids of the invention are also a feature of the invention. In some embodiments, the transgenic plants, transgenic plant cells, or transgenic plant explants express an exogenous polypeptide with glyphosate-N- acetyltransferase activity encoded by the nucleic acid of the invention. The invention also provides transgenic seeds produced by the transgenic plants of the iiivt-ii υii. The invention further provides transgenic plants, transgenic plant cells, transgenic plant explants, or transgenic seeds having enhanced tolerance to glyphosate due to the expression of a polypeptide with glyphosate-N- acetyltransferase activity and a polypeptide that imparts glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5-enolpyruvylshikimate-3- phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase. In a further embodiment, the invention provides transgenic plants or transgenic plant explants having enhanced tolerance to glyphosate, as well as tolerance to an additional herbicide due to the expression of a polypeptide with glyphosate-N- acetyltransferase activity, a polypeptide that imparts glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5-enolpyruvylshikimate-3- phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and a polypeptide imparting tolerance to the additional herbicide, such as a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase. The invention also provides transgenic plants, transgenic plant cells, transgenic plant explants, or transgenic seeds having enhanced tolerance to glyphosate, as well as tolerance to an additional herbicide due to the expression of a polypeptide with glyphosate-N-acetyltransferase activity and a polypeptide imparting tolerance to an additional herbicide, such as, a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protopo hyrinogen oxidase. The invention also provides transgenic plants, transgenic plant cells, transgenic plant explants, or transgenic seeds having enhanced tolerance to glyphosate as well as additional desirable traits which may be conferred by one or more additional transgenes. Methods of producing the polypeptides of the invention by introducing the nucleic cids cncoαing τnem into celis and then expressing and optionally recovering them from the cells or culture medium are a feature of the invention. In preferred embodiments, the cells expressing the polypeptides of the invention are transgenic plant cells. Methods of increasing the expression level of a polypeptide of the invention in a plant or plant cell by inserting into the polypeptide coding sequence one or two G/C-rich codons (such as GCG or GCT) immediately adjacent to and downstream of the initiating methionine ATG codon, and/or substituting in the polypeptide coding sequence one or more codons which are less frequently utilized in plants for codons encoding the same amino acid(s) which are more frequently utilized in plants, and introducing the modified coding sequence into a plant or plant cell and expressing the modified coding sequence, are also a feature of the invention. Polypeptides that are specifically bound by a polyclonal antisera that reacts against an antigen derived from SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689,
691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972 but not to a naturally occurring related sequence, e.g., such as a peptide represented by a subsequence of those of GenBank accession number CAA70664, as well as antibodies which are produced by administering an antigen derived from any one or more of SEQ LD NO: 568, 5657370, 57i, 5727 /;>, 3 / - D, :> /O, 5 /7, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972 and/or which bind specifically to such antigens and which do not specifically bind to a naturally occurring polypeptide corresponding to those of GenBank accession number CAA70664, are all features of the invention. Another aspect of the invention relates to methods of polynucleotide diversification to produce novel GAT polynucleotides and polypeptides by recombining or mutating the nucleic acids of the invention in vitro ox in vivo. In an embodiment, the recombination produces at least one library of recombinant GAT polynucleotides. The libraries so produced are embodiments of the invention, as are cells comprising the libraries. Furthermore, methods of producing a modified GAT polynucleotide by mutating a nucleic acid of the invention are embodiments of the invention. Recombinant and mutant GAT polynucleotides and polypeptides produced by the methods of the invention are also embodiments of the invention. In some aspects of the invention, diversification is achieved by using recursive recombination, which can be accomplished in vitro, in vivo, in silico, or a combination thereof. Some examples of diversification methods described in more detail below are family shuffling methods and synthetic shuffling methods. The invention provides methods for producing a glyphosate-resistant transgenic plant or plant cell that involve transforming a plant or plant cell with a polynucleotide encoding a glyphosate-N-acetyltransferase, and optionally regenerating a transgenic plant from the transformed plant cell. In some aspects the polynucleotide is a GAT polynucleotide, optionally a GAT polynucleotide derived from a bacterial source. In some aspects of the invention, the method can comprise growing the transformed plant or plant cell in a concentration of glyphosate that inhibits the growth of a wild-type plant of the same species without inhibiting the growth of the transformed plant. The method can comprise growing the transformed plant or plant cell or progeny of the plant or plant cell in increasing concentrations of glyphosate and/or in a concentration of glyphosate that is lethal to a wild-type plant or plant cell of the same species. A glyphosate-resistant transgenic plant produced by this method can be propagated, for example by crossing it with a second plant, such that at least some progeny of the cross display glyphosate tolerance. The invention further provides methods for selectively controlling weeds in a field containing a crop that involve planting the field with crop seeds or plants which are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, and applying to the crop and weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop. The invention further provides methods for controlling weeds in a field and preventing the emergence of glyphosate-resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate- tolerant as a result of being transformed with a gene encoding a glyphosate-N- acetyltransferase and a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5- enolpyruvylshikimate-3 -phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and applying to the crop and the weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop. In a further embodiment the invention provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase, a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as a glyphosate-tolerant 5- enolpyruvylshikimate-3 -phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone- tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase and applying to the crop and the weeds in the field a sufficient amount of glyphosate and an additional herbicide, such as, a hydroxyphenylpyruvatedioxygenase inhibitor, sulfonamide, imidazolinone, bialaphos, phosphinothricin, azafenidin, butafenacil, sulfosate, glufosinate, and a protox inhibitor to control the weeds without significantly affecting the crop. The invention further provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate- tolerant as a result of being transformed with a gene encoding a glyphosate-N- acetyltransferase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone- tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase and applying to the crop and the weeds in the field a sufficient amount of glyphosate and an additional herbicide, such as a hydroxyphenylpyruvatedioxygenase inhibitor, sulfonamide, imidazolinone, bialaphos, phosphinothricin, azafenidin, butafenacil, sulfosate, glufosinate, and a protox inhibitor to control the weeds without significantly affecting the crop. The invention further provides methods for producing a genetically transformed plant that is tolerant to glyphosate that involve inserting into the genome of a plant cell a recombinant, double-stranded DNA molecule comprising: (i) a promoter which functions in plant cells to cause the production of an RNA sequence; (ii) a structural DNA sequence that causes the production of an RNA sequence which encodes a GAT; and (iii) a 3' non-translated region which functions in plant cells to cause the addition of a stretch of polyadenyl nucleotides to the 3' end of the RNA sequence; where the promoter is heterologous with respect to the structural DNA sequence and adapted to cause sufficient expression of the encoded polypeptide to enhance the glyphosate tolerance of a plant cell transformed with the DNA molecule; obtaimng a transformed plant cell; and regenerating from the transformed plant cell a genetically transformed plant which has increased tolerance to glyphosate. The invention further provides methods for producing a crop that involve growing a crop plant that is glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, under conditions such that the crop plant produces a crop; and harvesting a crop from the crop plant. These methods often include applying glyphosate to the crop plant at a concentration effective to control weeds. Exemplary crop plants include cotton, corn, and soybean. The invention also provides computers, computer readable medium and integrated systems, including databases that are composed of sequence records including character strings corresponding to SEQ ID NO: 1-10, 16, 48, 190, 193, 196, 202, 205, 268, 300, 442, 445, 448, 454, 457, 515-830 and 832-972. Such integrated systems optionally include one or more instruction set for selecting, aligning, translating, reverse-translating or viewing any one or more character strings corresponding to SEQ LD NO: 1-10, 16, 48, 190, 193, 196, 202, 205, 268, 300, 442, 445, 448, 454, 457, 515-830 and 832-972, with each other and/or with any additional nucleic acid or amino acid sequence. BRIEF DESCRIPTION OF THE FIGURES Figure 1 depicts the N-acetylation of glyphosate catalyzed by a glyphosate- N-acetyltransferase ("GAT"). Figure 2 illustrates mass spectroscopic detection of N-acetylglyphosate produced by an exemplary Bacillus culture expressing a native GAT activity. Figure 3 is a table illustrating the relative identity between GAT sequences isolated from different strains of bacteria and yitl from. Bacillus subtilis. Figure 4 is a map of the plasmid pMAXY2120 for expression and purification of the GAT enzyme from E. coli cultures. Figure 5 is a mass spectrometry output showing increased N- acetylglyphosate production over time in a typical GAT enzyme reaction mix. Figure 6 is a plot of the kinetic data of a GAT enzyme from which a KM of 2.9 mM for glyphosate was calculated. Figure 7 is a plot of the kinetic data taken from the data of Figure 6 from which a KM of 2 μM was calculated for Acetyl CoA. Figure 8 is a scheme that describes the degradation of glyphosate in soil through the AMPA pathway. Figure 9 is a scheme that describes the sarcosine pathway of glyphosate degradation. Figure 10 is the BLOSUM62 matrix. Figure 11 is a map of the plasmid pMAXY2190. Figure 12 depicts a T-DNA construct with gat selectable marker. Figure 13 depicts a yeast expression vector with gat selectable marker. Figure 14 illustrates effect of glyphosate on plant height at tasseling. Figures 15A and 15B provide a comparison of the kinetic parameters Km and kcat/Km, respectively, for various GAT enzymes assayed in either the absence of added KC1 (unshaded bars) or in the presence of 20 mM KC1 (shaded bars) as described in Example 18. Error bars represent the standard deviation of multiple assays, where available. Figures 16A, 16B and 16C provide a comparison of the kinetic parameters
Km, kcat, and kcat/Km , respectively, of various GAT enzymes of the invention (unshaded bars) to the kinetic parameters of some further evolved GAT enzymes of the invention (shaded bars), as described in Example 19. Error bars represent the standard deviation of multiple assays, where available. Figure 17 depicts remaining GAT activity after incubation at various temperatures as described in Example 16. Figure 18 depicts the effect of pH on Kcat and KM as described in Example 30.
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a novel class of enzymes exhibiting N- acetyltransferase activity. In one aspect, the invention relates to a novel class of enzymes capable of acetylating glyphosate and glyphosate analogs, e.g., enzymes possessing glyphosate-N-acetyltransferase ("GAT") activity. Such enzymes are characterized by the ability to acetylate the secondary amine of a compound. In some aspects of the invention, this compound is an herbicide, e.g., glyphosate, as illustrated schematically in Figure 1. This compound can also be a glyphosate analog or a metabolic product of glyphosate degradation, e.g., aminomethylphosphonic acid. Although the acetylation of glyphosate is a key catalytic step in one metabolic pathway for catabolism of glyphosate, the enzymatic acetylation of glyphosate by naturally-occurring, isolated, or recombinant enzymes has not been previously described. Thus, the nucleic acids and polypeptides of the invention provide a new biochemical pathway for engineering herbicide resistance. In one aspect, the invention provides novel genes encoding GAT polypeptides. Isolated and recombinant GAT polynucleotides corresponding to naturally occurring polynucleotides, as well as recombinant and engineered, e.g., diversified, GAT polynucleotides are a feature of the invention. GAT polynucleotides are exemplified by SEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952. Specific GAT polynucleotide and polypeptide sequences are provided as examples to help illustrate the invention, and are not intended to limit the scope of the genus of GAT polynucleotides and polypeptides described and/or claimed herein. The invention also provides methods for generating and selecting diversified libraries to produce additional GAT polynucleotides, including polynucleotides encoding GAT polypeptides with improved and/or enhanced characteristics, e.g., altered Km for glyphosate, increased rate of catalysis, increased stability, etc., based upon selection of a polynucleotide constituent of the library for the new or improved activities described herein. Such polynucleotides are especially favorably employed in the production of glyphosate-resistant transgenic plants. The GAT polypeptides of the invention exhibit a novel enzymatic activity. Specifically, the enzymatic acetylation of the synthetic herbicide glyphosate has not been recognized prior to the present invention. Thus, the polypeptides herein described, e.g., as exemplified by SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931,946, 948, 950, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972 define a novel biochemical pathway for the detoxification of glyphosate that is functional in vivo, e.g., in plants. Accordingly, the nucleic acids and polypeptides of the invention are of significant utility in the generation of glyphosate-resistant plants by providing new nucleic acids, polypeptides and biochemical pathways for the engineering of herbicide selectivity in transgenic plants.
DEFINITIONS Before describing the present invention in detail, it is to be understood that this invention is not limited to particular compositions or biological systems, which can, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting. As used in this specification and the appended claims, the singular forms "a," "an," and "the" include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to "a device" includes a combination of two or more such devices, reference to "a gene fusion construct" includes mixtures of constructs, and the like. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although any methods and materials similar or equivalent to those described herein can be used in the practice for testing of the present invention, specific examples of appropriate materials and methods are described herein. h describing and claiming the present invention, the following terminology will be used in accordance with the definitions set out below. Accordingly, for purposes of the present invention, the term "glyphosate" should be considered to include any herbicidally effective form of N- phosphonomethylglycme (including any salt thereof) and other forms which result in the production of the glyphosate anion inplanta. The term "glyphosate analog" refers to any structural analog of glyphosate that has the ability to inhibit EPSPS at levels such that the glyphosate analog is herbicidally effective. As used herein, the term "glyphosate-N-acetyltransferase activity" or "GAT activity" refers to the ability to catalyze the acetylation of the secondary amine group of glyphosate, as illustrated, for example, in Figure 1. A "glyphosate-N- acetyltransferase" or "GAT" is an enzyme that catalyzes the acetylation of the amine group of glyphosate, a glyphosate analog, and/or a glyphosate primary metabolite (i.e., AMPA or sarcosine). In some preferred embodiments of the invention, a GAT is able to transfer the acetyl group from Acetyl CoA to the secondary amine of glyphosate and the primary amine of AMPA. hi addition, some GATs are also able to transfer the propionyl group of propionyl CoA to glyphosate, indicating that GAT is also an acyltransferase. The exemplary GATs described herein are active from about pH 5-9, with optimal activity in the range of about pH 6.5-8.0. Activity can be quantified using various kinetic parameters which are well known in the art, e.g., kcat, KM, and kcat M. These ldnetic parameters can be determined as described below in Example 7 or Example 19. The terms "polynucleotide," "nucleotide sequence," and "nucleic acid" are used to refer to a polymer of nucleotides (A, C, T, U, G, etc. or naturally occurring or artificial nucleotide analogues), e.g., DNA or RNA, or a representation thereof, e.g., a character string, etc., depending on the relevant context. A given polynucleotide or complementary polynucleotide can be determined from any specified nucleotide sequence. Similarly, an "amino acid sequence" is a polymer of amino acids (a protein, polypeptide, etc) or a character string representing an amino acid polymer, depending on context. The terms "protein," "polypeptide," and "peptide" are used interchangeably herein. A polynucleotide, polypeptide, or other component is "isolated" when it is partially or completely separated from components with which it is normally associated (other proteins, nucleic acids, cells, synthetic reagents, etc). A nucleic acid or polypeptide is "recombinant" when it is artificial or engineered, or derived from an artificial or engineered protein or nucleic acid. For example, a polynucleotide that is inserted into a vector or any other heterologous location, e.g., in a genome of a recombinant organism, such that it is not associated with nucleotide sequences that normally flank the polynucleotide as it is found in nature is a recombinant polynucleotide. A protein expressed in vitro or in vivo from a recombinant polynucleotide is an example of a recombinant polypeptide.
Likewise, a polynucleotide sequence that does not appear in nature, for example a variant of a naturally occurring gene, is recombinant. The terms "glyphosate-N-acetyltransferase polypeptide" and "GAT polypeptide" are used interchangeably to refer to any of a family of novel polypeptides provided herein. The terms "glyphosate-N-acetyltransferase polynucleotide" and "GAT polynucleotide" are used interchangeably to refer to a polynucleotide that encodes a GAT polypeptide. A "subsequence" or "fragment" is any portion of an entire sequence. Numbering of an amino acid or nucleotide polymer corresponds to numbering of a selected amino acid polymer or nucleic acid when the position of a given monomer component (amino acid residue, incorporated nucleotide, etc) of the polymer corresponds to the same residue position in a selected reference polypeptide or polynucleotide. A vector is a composition for facilitating cell transduction/transformation by a selected nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., plasmids, cosmids, viruses, YACs, bacteria, poly-lysine, chromosome integration vectors, episomal vectors, etc. "Substantially an entire length of a polynucleotide or amino acid sequence" refers to at least about 70%, generally at least about 80%, or typically about 90% or more of a sequence. As used herein, an "antibody" refers to a protein comprising one or more polypeptides substantially or partially encoded by immunoglobulin genes or fragments of immunoglobulin genes. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad immunoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. A typical immunoglobulin (antibody) structural unit comprises a tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition. The terms variable light chain (VL) and variable heavy chain (VH) refer to these light and heavy chains respectively. Antibodies exist as intact immunoglobulins or as a number of well characterized fragments produced by digestion with various peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the (Fab')2 dimer into an Fab' monomer. The Fab' monomer is essentially a Fab with part of the hinge region (see, Paul, ed. (1998) Fundamental Lmmunology (4th Edition, Raven Press, NY), for a more detailed description of other antibody fragments). While various antibody fragments are defined in terms of the digestion of an intact antibody, one of skill will appreciate that such Fab' fragments may be synthesized de novo either chemically or by utilizing recombinant DNA methodology. Thus, the term antibody as used herein also includes antibody fragments either produced by the modification of whole antibodies or synthesized de novo using recombinant DNA methodologies. Antibodies include single chain antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a variable light chain are joined together (directly or through a peptide linker) to form a continuous polypeptide. A "chloroplast transit peptide" is an amino acid sequence which is translated in conjunction with a protein and directs the protein to the chloroplast or other plastid types present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a nucleotide sequence that encodes a chloroplast transit peptide. A "signal peptide" is an amino acid sequence which is translated in conjunction with a protein and directs the protein to the secretory system (Clirispeels (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42: 21-53). If the protein is to be directed to a vacuole, a vacuolar targeting signal can further be added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal may be added. If the protein is to be directed to the nucleus, any signal peptide present should be removed and instead a nuclear localization signal . included (Raikhel. (1992) Plant Phys : 100: 1627-1632). The terms "diversification" and "diversity," as applied to a polynucleotide, refers to generation of a plurality of modified forms of a parental polynucleotide, or plurality of parental polynucleotides. In the case where the polynucleotide encodes a polypeptide, diversity in the nucleotide sequence of the polynucleotide can result in diversity in the corresponding encoded polypeptide, e.g. a diverse pool of polynucleotides encoding a plurality of polypeptide variants. In some embodiments of the invention, this sequence diversity is exploited by screening/selecting a library of diversified polynucleotides for variants with desirable functional attributes, e.g., a polynucleotide encoding a GAT polypeptide with enhanced functional characteristics. The term "encoding" refers to the ability of a nucleotide sequence to code for one or more amino acids. The term does not require a start or stop codon. An amino acid sequence can be encoded in any one of six different reading frames provided by a polynucleotide sequence and its complement. When used herein, the term "artificial variant" refers to a polypeptide having GAT activity, which is encoded by a modified GAT polynucleotide, e.g., a modified form ofany one ofSEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952 or of a naturally occurring GAT polynucleotide isolated from an organism. The modified polynucleotide, from which an artificial variant is produced when expressed in a suitable host, is obtained through human intervention by modification of a GAT polynucleotide. The term "nucleic acid construct" or "polynucleotide construct" means a nucleic acid molecule, either single-stranded or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature. The term nucleic acid construct is synonymous with the term "expression cassette" when the nucleic acid construct contains the control sequences required for expression of a coding sequence of the present invention. The term "control sequences" is defined herein to include all components, which are necessary or advantageous for the expression of a polypeptide of the present invention. Each control sequence may be native or foreign to the nucleotide sequence encoding the polypeptide. Such control sequences include, but are not limited to, a leader sequence, polyadenylation sequence, propeptide sequence, promoter sequence, signal peptide sequence, and transcription terminator sequence. At a minimum, the control sequences include a promoter and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleotide sequence encoding a polypeptide. The term "operably linked" is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the expression of a polypeptide. When used herein the term "coding sequence" is intended to cover a nucleotide sequence, which directly specifies the amino acid sequence of its protein product. The boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon. The coding sequence typically includes a DNA, cDNA, and/or recombinant nucleotide sequence. hi the present context, the term "expression" includes any step involved in the production of the polypeptide including, but not limited to, transcription, post- transcriptional modification, translation, post-translational modification, and secretion. In the present context, the term "expression vector" covers a DNA molecule, linear or circular, that comprises a segment encoding a polypeptide of the invention, and which is operably linked to additional segments that provide for its transcription. The term "host cell", as used herein, includes any cell type which is susceptible to transformation with a nucleic acid construct. The term "plant" includes whole plants, shoot vegetative organs/structures (e.g., leaves, stems and tubers), roots, flowers and floral organs/structures (e.g., bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, and seed coat) and fruit (the mature ovary), plant tissue (e.g., vascular tissue, ground tissue, and the like) and cells (e.g., guard cells, egg cells, trichomes and the like), and progeny of same. The class of plants that can be used in the method of the invention is generally as broad as the class of higher and lower plants amenable to transformation techniques, including angiosperms (monocotyledonous and dicotyledonous plants), gymnosperms, ferns, and multicellular algae. It includes plants of a variety of ploidy levels, including aneuploid, polyploid, diploid, haploid and hemizygous. The term "heterologous" as used herein describes a relationship between two or more elements which indicates that the elements are not normally found in proximity to one another in nature. Thus, for example, a polynucleotide sequence is "heterologous to" an organism or a second polynucleotide sequence if it originates from a foreign species, or, if from the same species, is modified from its original form. For example, a promoter operably linked to a heterologous coding sequence refers to a coding sequence from a species different from that from wliich the promoter was derived, or, if from the same species, a coding sequence which is not naturally associated with the promoter (e.g., a genetically engineered coding sequence or an allele from a different ecotype or variety). An example of a heterologous polypeptide is a polypeptide expressed from a recombinant polynucleotide in a transgenic organism. Heterologous polynucleotides and polypeptides are forms of recombinant molecules. A variety of additional terms are defined or otherwise characterized herein.
GLYPHOSATE-N-ACETYLTRANSFERASES In one aspect, the invention provides a novel family of isolated or recombinant enzymes referred to herein as "glyphosate-N-acetyltransferases," "GATs," or "GAT enzymes." GATs are enzymes that have GAT activity, preferably sufficient activity to confer some degree of glyphosate tolerance upon a transgenic plant engineered to express the GAT. Some examples of GATs include GAT polypeptides, described in more detail below. GAT-mediated glyphosate tolerance is a complex function of GAT activity,
GAT expression levels in the transgenic plant, the particular plant, and numerous other factors, including but not limited to the nature and timing of herbicide application. One of skill in the art can determine without undue experimentation the level of GAT activity required to effect glyphosate tolerance in a particular context. GAT activity can be characterized using the conventional kinetic parameters koat, KM, and kcat /KM- kcat can be thought of as a measure of the rate of acetylation, particularly at high substrate concentrations, KM is a measure of the affinity of the GAT for its substrates (e.g., acetyl CoA, propionyl CoA and glyphosate), and kcat/KM is a measure of catalytic efficiency that takes both substrate affinity and catalytic rate into account. kcat/Km is particularly important in the situation where the concentration of a substrate is at least partially rate- limiting, h general, a GAT with a higher kcat or kcat/K is a more efficient catalyst than another GAT with lower kcat or kcaiIKu. A GAT with a lower KM is a more efficient catalyst than another GAT with a higher KM- Thus, to determine whether one GAT is more effective than another, one can compare kinetic parameters for the two enzymes. The relative importance of kcat, kcat KM and K will vary depending upon the context in which the GAT will be expected to function, e.g., the anticipated effective concentration of glyphosate relative to the KM for glyphosate. GAT activity can also be characterized in terms of any of a number of functional characteristics, including but not limited to stability, susceptibility to inhibition, or activation by other molecules.
GLYPHOSATE-N-ACETYLTRANSFERASE POLYPEPTIDES h one aspect, the invention provides a novel family of isolated or recombinant polypeptides referred to herein as "glyphosate-N-acetyltransferase polypeptides" or "GAT polypeptides." GAT polypeptides are characterized by their structural similarity to a novel family of GATs. Many but not all GAT polypeptides are GATs. The distinction is that GATs are defined in terms of function, whereas GAT polypeptides are defined in terms of structure. A subset of the GAT polypeptides consists of those GAT polypeptides that have GAT activity, preferably at a level that will function to confer glyphosate resistance upon a transgenic plant expressing the protein at an effective level. Some preferred GAT polypeptides for use in conferring glyphosate tolerance have a kcat of at least 1 min" l, ox more preferably at least 10 min"1, 100 min"1 or 1000 min"1. Other preferred GAT polypeptides for use in conferring glyphosate tolerance have a KM no greater than 100 mM, or more preferably no greater than 10 mM, 1 mM, or 0.1 mM. Still other preferred GAT polypeptides for use in conferring glyphosate tolerance have a cat KM of at least 1 rnM' n"1 or more, preferably at least 10 mM'Vin"1, 100 mM" 1min'1, 1000 mM^min"1, or 10,000 mM' "1. Exemplary GAT polypeptides have been isolated and characterized from a variety of bacterial strains. One example of a monomeric GAT polypeptide that has been isolated and characterized has a molecular radius of approximately 17 kD. An exemplary GAT enzyme isolated from a strain of B. licheniformis, SEQ LD NO:7, exhibits a Km for glyphosate of approximately 2.9 mM and a Km for acetyl CoA of approximately 2 μM, with a kcat equal to 6/minute. The term "GAT polypeptide" refers to any polypeptide comprising an amino acid sequence that can be optimally aligned with an amino acid sequence selected from the group consisting of SEQ LD NO:300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence that can be optimally aligned with an amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745, 750, 755, or 760 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, a Z5 amino acid residue; (ii) at position 62, a ZI amino acid residue; (iii) at position 124, a Z6 amino acid residue; and (iv) at position 144, a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P. Two sequences are "optimally aligned" when they are aligned for similarity scoring using a defined amino acid substitution matrix (e.g., BLOSUM62), gap existence penalty and gap extension penalty so as to arrive at the highest score possible for that pair of sequences. Amino acid substitution matrices and their use in quantifying the similarity between two sequences are well-known in the art and described, e.g., in Dayhoff et al. (1978) "A model of evolutionary change in proteins" in "Atlas of Protein Sequence and Structure," Vol. 5, Suppl. 3 (ed. M.O. Dayhoff), pp. 345-352. Natl. Biomed. Res. Found., Washington, DC and Henikoff et al. (1992) Proc. Natl Acad. Sci. USA 89: 10915-10919. The BLOSUM62 matrix (Fig. 10) is often used as a default scoring substitution matrix in sequence alignment protocols such as Gapped BLAST 2.0. The gap existence penalty is imposed for the introduction of a single amino acid gap in one of the aligned sequences, and the gap extension penalty is imposed for each additional empty amino acid position inserted into an already opened gap. The alignment is defined by the amino acids positions of each sequence at which the alignment begins and ends, and optionally by the insertion of a gap or multiple gaps in one or both sequences so as to arrive at the highest possible score. While optimal alignment and scoring can be accomplished manually, the process is facilitated by the use of a computer-implemented alignment algorithm, e.g., gapped BLAST 2.0, described in Altschul et al. (1997) Nucl. Acids Res. 25: 3389-3402, and made available to the public at the National Center for Biotechnology Information (NCBI) Website (www.ncbi.nlm.nih.gov). Optimal alignments, including multiple alignments, can be prepared using, e.g., PSI-BLAST, available through the NCBI website and described by Altschul et al. (1997) Nucl. Acids Res. 25:3389-3402. With respect to an amino acid sequence that is optimally aligned with a reference sequence, an amino acid residue "corresponds to" the position in the reference sequence with which the residue is paired in the alignment. The "position" is denoted by a number that sequentially identifies each amino acid in the reference sequence based on its position relative to the N-terminus. For example, in SEQ LD NO:300, position 1 is M, position 2 is I, position 3 is E, etc. When a test sequence is optimally aligned with SEQ LD NO:300, a residue in the test sequence that aligns with the E at position 3 is said to "correspond to position 3" of SEQ LD NO:300. Owing to deletions, insertion, truncations, fusions, etc., that must be taken into account when determining an optimal alignment, in general the amino acid residue number in a test sequence as determined by simply counting from the N-terminal will not necessarily be the same as the number of its corresponding position in the reference sequence. For example, in a case where there is a deletion in an aligned test sequence, there will be no amino acid that corresponds to a position in the reference sequence at the site of deletion. Where there is an insertion in an aligned reference sequence, that insertion will not correspond to any amino acid position in the reference sequence. In the case of truncations or fusions there can be stretches of amino acids in either the reference or aligned sequence that do not correspond to any amino acid in the corresponding sequence. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577 (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578 (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621 (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579 (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602 (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697 (g) an amino acid sequence that is at least 96% identical to SEQ LD NO:721 (h) an amino acid sequence that is at least 97% identical to SEQ LD NO.-613 (i) an amino acid sequence that is at least 89% identical to SEQ LD NO:677 (j) an amino acid sequence that is at least 96% identical to SEQ LD NO:584 (k) an amino acid sequence that is at least 98% identical to SEQ LD NO:707 (1) an amino acid sequence that is at least 98% identical to SEQ LD NO:616 (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:590. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence having at least 89% sequence identity with residues 1-96 of the amino acid sequence of SEQ LD NO:677; an amino acid sequence having at least 95% sequence identity with residues 1-96 of the amino acid sequence of SEQ LD NO: 697; an amino acid sequence having at least 96% sequence identity with residues 1-96 of the amino acid sequence selected from the group consisting of SEQ LD NO:584, 612, and 721; an amino acid sequence having at least 97% sequence identity with residues 1-96 of the amino acid sequence selected from the group consisting of SEQ LD NO:578, 613, and 621; an amino acid sequence having at least 98% sequence identity with residues 1-96 of the amino acid sequence selected from the group consisting of SEQ LD NO:577, 579, 590, 602, 616, and 707. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence having at least 89%) sequence identity with residues 51- 146 of the amino acid sequence of SEQ LD NO: 677; an amino acid sequence having at least 95% sequence identity with residues 51-146 of the amino acid sequence of SEQ LD NO:697; an amino acid sequence having at least 96% sequence identity with residues 51-146 of the amino acid sequence selected from the group consisting of SEQ LD NO:584, 612, and 721; an amino acid sequence having at least 97% sequence identity with residues 51-146 of the amino acid sequence selected from the group consisting of SEQ LD NO:578, 613, and 621; an amino acid sequence having at least 98% sequence identity with residues 51-146 of the amino acid sequence selected from the group consisting of SEQ LD NO:577, 579, 590, 602, 616, and 707. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 96% identical to residues 2-146 of SEQ LD NO:919; (b) an amino acid sequence that is at least 97% identical to residues 2-146 of SEQ LD
NO:929; (c) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:847; (d) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ ID NO:851; (e) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO: 853; (f) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:855 (such as, for example, SEQ LD NO:835 or 855); (g) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:857; (h) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:861; (i) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:871; (j) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:875; (k) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:881; (1) an amino acid sequence that is at least 98% identical to residues 2- 146 of SEQ LD NO: 885; (m) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:887; (n) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:889; (o) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO: 893; (p) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:897; (q) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:899; (r) an amino acid sequence that is at least 98% identical to residues 2- 146 of SEQ LD NO:909; (s) an amino acid sequence that is at least 98% identical to residues 2-146 of SEQ LD NO:911; (t) an amino acid sequence that is at least 99% identical to residues 2-146 of SEQ LD NO:837; (u) an amino acid sequence that is at least 99% identical to residues 2-146 of SEQ LD NO: 841; (v) an amino acid sequence that is at least 99% identical to residues 2-146 of SEQ LD NO:865; (w) an amino acid sequence that is at least 99% identical to residues 2-146 of SEQ LD NO:869; and (x) an amino acid sequence that is at least 99% identical to residues 2-146 of SEQ LD NO:879. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence that is at least 95% identical to residues 2-146 of SEQ LD NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ LD NO:929. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence that shares at least 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%), 98%, 99%, or more sequence identity with an exemplary GAT polypeptide disclosed herein. Thus, for example, GAT polypeptides of the invention include polypeptides comprising an amino acid sequence that shares at least 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more sequence identity with any of SEQ LD NO: 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. As used herein, the term "identity" or "percent identity" when used with respect to a particular pair of aligned amino acid sequences refers to the percent amino acid sequence identity that is obtained by ClustalW analysis (version W 1.8 available from European Bioinformatics Institute, Cambridge, UK), counting the number of identical matches in the alignment and dividing such number of identical matches by the greater of (i) the length of the aligned sequences, and (ii) 96, and using the following default ClustalW parameters to achieve slow/accurate pairwise alignments - Gap Open Penalty:10; Gap Extension Penalty:0.10; Protein weight matrix: Gonnet series; DNA weight matrix: IUB; Toggle Slow/Fast pairwise alignments = SLOW or FULL Alignment. In another aspect, the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, at least 50, at least 75, at least 100, at least 125, at least 130, at least 135, at least 140, at least 141, at least 142, at least 143, at least 144 or at least 145 contiguous amino acids of an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 96% identical to SEQ LD NO:584; (k) an amino acid sequence that is at least 98% identical to SEQ LD NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ LD NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:590. hi another aspect, the invention provides a polypeptide comprising residues
2-146 of an amino acid sequence selected from the group consisting of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, and 825. In some embodiments of the invention, the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence. Some preferred GAT polypeptides of the invention can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO:300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103,
105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. When used to specify an amino acid or amino acid residue, the single letter designations A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y have their standard meaning as used in the art and as provided in Table 1 herein. Some preferred GAT polypeptides of the invention can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105,
106, 114, 129, 139, and/or 145 the amino acid residue is ZI; (b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at position 8 the amino acid residue is Z3; (d) at position 89 the amino acid residue is Z3 or Z6; (e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5; (g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2; (i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3; (j) at position 30 the amino acid residue is ZI; (k) at position 6 the amino acid residue is Z6; (1) at position 81 the amino acid residue is Z2 or Z4; (m) at position 113 the amino acid residue is Z3; (n) at position 138 the amino acid residue is Z4; (o) at position 142 the amino acid residue is Z2; (p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4; (q) at position 5, 17, and 61 the amino acid residue is Z4; (r) at position 24 the amino acid residue is Z3; (s) at position 104 the amino acid residue is Z5; (t) at positions 52, and/or 69 the amino acid residue is Z3; (u) at positions 14 and/or 119 the amino acid residue is Z5; (v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5; (w) at positions 48 and/or 80 the amino acid residue is Z6; (x) at position 40 the amino acid residue is ZI or Z2; (y) at position 96 the amino acid residue is Z3 or Z5; (z) at position 65 the amino acid residue is Z3, Z4, or Z6; (aa) at positions 84 and/or 115 the amino acid residue is Z3; (ab) at position 93 the amino acid residue is Z4; (ac) at position 130 the amino acid residue is Z2; (ad) at position 58 the amino acid residue is Z3, Z4 or Z6; (ae) at position 47 the amino acid residue is Z4 or Z6; (at) at positions 49 and/or 100 the amino acid residue is Z3 or Z4; (ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4; (ai) at position 131 the amino acid residue is Z5; (aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4; (al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred GAT polypeptides of the invention further comprise the amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), wherein at least 90% conform to the amino acid residue restrictions specified in (a)-(am). Some preferred GAT polypeptides of the invention additionally comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred GAT polypeptides of the invention additionally comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred GAT polypeptides of the invention additionally comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred GAT polypeptides of the invention further comprise an amino acid residue at position 36 which is selected from the group consisting of ZI and Z3. Some preferred GAT polypeptides of the invention further comprise an amino acid residue at position 64 which is selected from the group consisting of ZI and Z2. Some preferred GAT polypeptides of the invention further comprise amino acid residues in the amino acid sequence that correspond to the following positions, wherein at least 80% conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V; (o) at position 24 the amino acid residue is Q; (p) at position 26 the amino acid residue is M, L, V or I; (q) at position 27 the amino acid residue is E; (r) at position 28 the amino acid residue is A or V; (s) at position 30 the amino acid residue is M; (t) at position 31 the amino acid residue is Y or F; (u) at position 32 the amino acid residue is E or D; (v) at position 33 the amino acid residue is T or S; (w) at position 35 the amino acid residue is L; (x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 39 the amino acid residue is A or S; (z) at position 40 the amino acid residue is F or L; (aa) at position 45 the amino acid residue is Y or F; (ab) at position 47 the amino acid residue is R or G; (ac) at position 48 the amino acid residue is G; (ad) at position 49 the amino acid residue is K, R, or Q; (ae) at position 51 the amino acid residue is I or V; (af) at position 52 the amino acid residue is S; (ag) at position 53 the amino acid residue is I or V; (ah) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N; (aj) at position 58 the amino acid residue is Q, K, R or P; (ak) at position 59 the amino acid residue is A; (al) at position 60 the amino acid residue is E; (am) at position 61 the amino acid residue is H or R; (an) at position 63 the amino acid residue is E or D; (ao) at position 65 the amino acid residue is E, P or Q; (ap) at position 67 the amino acid residue is Q or R; (aq) at position 68 the amino acid residue is K or E; (ar) at position 69 the amino acid residue is Q; (as) at position 79 the amino acid residue is E; (at) at position 80 the amino acid residue is G; (au) at position 81 the amino acid residue is Y, H or F; (av) at position 82 the amino acid residue is R; (aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q; (ay) at position 86 the amino acid residue is A; (az) at position 89 the amino acid residue is G, T or S; (ba) at position 90 the amino acid residue is L; (bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K; (bd) at position 93 the amino acid residue is H; (be) at position 96 the amino acid residue is E or Q; (bf) at position 97 the amino acid residue is I; (bg) at position 100 the amino acid residue is K or N ; (bh) at position 101 the amino acid residue is K or R; (bi) at position 103 the amino acid residue is A or V; (bj) at position 104 the amino acid residue is D; (bk) at position 105 the amino acid residue is M, L or I; (bl) at position 106 the amino acid residue is L; (bm) at position 112 the amino acid residue is T or A; (bn) at position 113 the amino acid residue is S or T; (bo) at position 114 the amino acid residue is A; (bp) at position 115 the amino acid residue is S; (bq) at position 119 the amino acid residue is K or R; (br) at position 120 the amino acid residue is K or R; (bs) at position 123 the amino acid residue is F or L; (bt) at position 125 the amino acid residue is E; (bu) at position 126 the amino acid residue is Q or H; (bv) at position 128 the amino acid residue is E or D; (bw) at position 129 the amino acid residue is V or I; (bx) at position 130 the amino acid residue is F; (by) at position 131 the amino acid residue is D or E; (bx) at position 132 the amino acid residue is T; (ca) at position 135 the amino acid residue is V; (cb) at position 138 the amino acid residue is H; (cc) at position 139 the amino acid residue is I; (cd) at position 140 the amino acid residue is L or M; (ce) at position 142 the amino acid residue is Y; (cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T. Some preferred GAT polypeptides of the invention further comprise amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (ch) above, wherein at least 90% conform to the amino acid residue restrictions specified in (a) - (ch). Some preferred GAT polypeptides of the invention can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein at least one of the following positions conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue; wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 9, 76, 94 and 110 the amino acid residue is A; (b) at positions 29 and 108 the amino acid residue is C; (c) at position 34 the amino acid residue is D; (d) at position 95 the amino acid residue is E; (e) at position 56 the amino acid residue is F; (f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G; (g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K; (j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L; (k) at positions 1 and 141 the amino acid residue is M; (1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q; (o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R; (p) at position 55 the amino acid residue is S; (q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and (s) at position 13, 46, 70 and 118 the amino acid residue is Y. Some preferred GAT polypeptides of the invention further comprise amino acid sequences wherein the amino acid residues meet at least one of the following restrictions: (a) at position 36 the amino acid residue is M, L, or T; (b) at position 72 the amino acid residue is L or I; (c) at position 75 the amino acid residue is M or V; (d) at position 64 the amino acid residue is L, I, or F; (e) at position 88 the amino acid residue is T or S; and (f) at position 117 the amino acid residue is Y or F. Some preferred GAT polypeptides of the invention comprise an amino acid sequence wherein the amino acid residues meet at least one of the following additional restrictions: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r) at position 88 the amino acid residue is T; (s) at position 89 the amino acid residue is G; (t) at position 91 the amino acid residue is L; (u) at position 98 the amino acid residue is I; (v) at position 105 the amino acid residue I; (w) at position 112 the amino acid residue is A; (x) at position 124 the amino acid residue is G or C; (y) at position 128 the amino acid residue is D; (z) at position 140 the amino acid residue is M; (aa) at position 143 the amino acid residue is R; and (ab) at position 144 the amino acid residue is W. Some preferred GAT polypeptides of the invention comprise an amino acid sequence wherein of the amino acid residues that correspond to the positions specified in (a) through (ab) as described above, at least 80% conform to the amino acid residue restrictions specified in (a) through (ab). Some preferred GAT polypeptides of the invention have an amino acid sequence that comprises amino acid residues at least one of which meets the following additional restrictions: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S. Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577; (b) an amino acid sequence that is at least 97%o identical to SEQ LD NO: 578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621 ; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 96% identical to SEQ LD NO:584; (k) an amino acid sequence that is at least 98% identical to SEQ LD NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ LD NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:590. Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO: 579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO: 697; (g) an amino acid sequence that is at least 96% identical to SEQ ED NO: 721 ; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 96% identical to SEQ LD NO:584; (k) an amino acid sequence that is at least 98% identical to SEQ ED NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ ID NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO: 590, wherein at least one of the following positions further conforms to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P. Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ LD NO: 577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical tq SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO: 677; (j) an amino acid sequence that is at least 96% identical to SEQ LD NO:584; (k) an amino acid sequence that is at least 98% identical to SEQ LD NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ LD NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:590, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following additional restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ ED NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO: 697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ LD NO:677; (j) an amino acid sequence that is at least 96% identical to SEQ LD NO:584; (k) an amino acid sequence that is at least 98% identical to SEQ LD NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ LD NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ ED NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO: 590, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following additional restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI; (b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at position 8 the amino acid residue is Z3; (d) at position 89 the amino acid residue is Z3 or Z6; (e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5; (g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2; (i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3; (j) at position 30 the amino acid residue is ZI; (k) at position 6 the amino acid residue is Z6; (1) at position 81 the amino acid residue is Z2 or Z4; (m) at position 113 the amino acid residue is Z3; (n) at position 138 the amino acid residue is Z4; (o) at position 142 the amino acid residue is Z2; (p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4; (q) at position 5, 17, and 61 the amino acid residue is Z4; (r) at position 24 the amino acid residue is Z3; (s) at position 104 the amino acid residue is Z5; (t) at positions 52, and/or 69 the amino acid residue is Z3; (u) at positions 14 and/or 119 the amino acid residue is Z5; (v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5; (w) at positions 48 and/or 80 the amino acid residue is Z6; (x) at position 40 the amino acid residue is ZI or Z2; (y) at position 96 the amino acid residue is Z3 or Z5; (z) at position 65 the amino acid residue is Z3, Z4, or Z6; (aa) at positions 84 and/or 115 the amino acid residue is Z3; (ab) at position 93 the amino acid residue is Z4; (ac) at position 130 the amino acid residue is Z2; (ad) at position 58 the amino acid residue is Z3, Z4 or Z6; (ae) at position 47 the amino acid residue is Z4 or Z6; (af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4; (ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4; (ai) at position 131 the amino acid residue is Z5; (aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4; (al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred GAT polypeptides of the invention further comprise amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), wherein at least 90% conform to the amino acid residue restrictions specified in (a)-(am). Some preferred GAT polypeptides of the invention comprise amino acid residues in the amino acid sequence that correspond to the following positions wherein at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred GAT polypeptides of the invention comprise amino acid residues in the amino acid sequence that correspond to the following positions wherein at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred GAT polypeptides of the invention comprise amino acid residues in the amino acid sequence that correspond to the following positions wherein at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein the amino acid residue at position 36 is selected from the group consisting of ZI and Z3. Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein the amino acid residue at position 64 is selected from the group consisting of ZI and Z2. Some preferred GAT polypeptides of the invention comprise an amino acid sequence wherein of the amino acid residues that correspond to the following positions, at least 80% conform to the following additional restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V; (o) at position 24 the amino acid residue is Q; (p) at position 26 the amino acid residue is M, L, V or I; (q) at position 27 the amino acid residue is E; (r) at position 28 the amino acid residue is A or V; (s) at position 30 the amino acid residue is M; (t) at position 31 the amino acid residue is Y or F; (u) at position 32 the amino acid residue is E or D; (v) at position 33 the amino acid residue is T or S; (w) at position 35 the amino acid residue is L; (x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 39 the amino acid residue is A or S; (z) at position 40 the amino acid residue is F or L; (aa) at position 45 the amino acid residue is Y or F; (ab) at position 47 the amino acid residue is R or G; (ac) at position 48 the amino acid residue is G; (ad) at position 49 the amino acid residue is K, R, or Q; (ae) at position 51 the amino acid residue is I or V; (af) at position 52 the amino acid residue is S; (ag) at position 53 the amino acid residue is I or V; (ah) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N; (aj) at position 58 the amino acid residue is Q, K, R or P; (ak) at position 59 the amino acid residue is A; (al) at position 60 the amino acid residue is E; (am) at position 61 the amino acid residue is H or R; (an) at position 63 the amino acid residue is E or D; (ao) at position 65 the amino acid residue is E, P or Q; (ap) at position 67 the amino acid residue is Q or R; (aq) at position 68 the amino acid residue is K or E; (ar) at position 69 the amino acid residue is Q; (as) at position 79 the amino acid residue is E; (at) at position 80 the amino acid residue is G; (au) at position 81 the amino acid residue is Y, H or F; (av) at position 82 the amino acid residue is R; (aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q; (ay) at position 86 the amino acid residue is A; (az) at position 89 the amino acid residue is G, T or S; (ba) at position 90 the amino acid residue is L; (bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K; (bd) at position 93 the amino acid residue is H; (be) at position 96 the amino acid residue is E or Q; (bf) at position 97 the amino acid residue is I; (bg) at position 100 the amino acid residue is K or N ; (bh) at position 101 the amino acid residue is K or R; (bi) at position 103 the amino acid residue is A or V; (bj) at position 104 the amino acid residue is D; (bk) at position 105 the amino acid residue is M, L or I; (bl) at position 106 the amino acid residue is L; (bm) at position 112 the amino acid residue is T or A; (bn) at position 113 the amino acid residue is S or T; (bo) at position 114 the amino acid residue is A; (bp) at position 115 the amino acid residue is S; (bq) at position 119 the amino acid residue is K or R; (br) at position 120 the amino acid residue is K or R; (bs) at position 123 the amino acid residue is F or L; (bt) at position 125 the amino acid residue is E; (bu) at position 126 the amino acid residue is Q or H; (bv) at position 128 the amino acid residue is E or D; (bw) at position 129 the amino acid residue is V or I; (bx) at position 130 the amino acid residue is F; (by) at position 131 the amino acid residue is D or E; (bx) at position 132 the amino acid residue is T; (ca) at position 135 the amino acid residue is V; (cb) at position 138 the amino acid residue is H; (cc) at position 139 the amino acid residue is I; (cd) at position 140 the amino acid residue is L or M; (ce) at position 142 the amino acid residue is Y; (cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T. Some preferred GAT polypeptides of the invention comprise an amino acid sequence in which of the residues that correspond to the positions specified in (a) - (ch) above, at least 90% conform to the amino acid residue restrictions specified in (a) - (ch). Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ID NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ LD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (f) an amino acid sequence that is at least 95% identical to SEQ LD NO:697; (g) an amino acid sequence that is at least 96% identical to SEQ LD NO: 721; (h) an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) an amino acid sequence that is at least 89% identical to SEQ ED NO: 677; (j) an amino acid sequence that is at least 96% identical to SEQ LD NO: 584; (k) an amino acid sequence that is at least 98% identical to SEQ ED NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ LD NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ LD NO:612;and (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:590, further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 9, 76, 94 and 110 the amino acid residue is A; (b) at positions 29 and 108 the amino acid residue is C; (c) at position 34 the amino acid residue is D; (d) at position 95 the amino acid residue is E; (e) at position 56 the amino acid residue is F; (f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G; (g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K; (j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L; (k) at positions 1 and 141 the amino acid residue is M; (1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q; (o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R; (p) at position 55 the amino acid residue is S; (q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and (s) at position 13, 46, 70 and 118 the amino acid residue is Y. Some preferred GAT polypeptides of the invention further comprise an amino acid sequence in which at least one of the following criteria is met: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r) at position 88 the amino acid residue is T; (s) at position 89 the amino acid residue is G; (t) at position 91 the amino acid residue is L; (u) at position 98 the amino acid residue is I; (v) at position 105 the amino acid residue I; (w) at position 112 the amino acid residue is A; (x) at position 124 the amino acid residue is G or C; (y) at position 128 the amino acid residue is D; (z) at position 140 the amino acid residue is M; (aa) at position 143 the amino acid residue is R; and (ab) at position 144 the amino acid residue is W. Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein of the amino acid residues that correspond to the positions specified in (a) through (ab) as described above, at least 80% conform to the amino acid residue restrictions specified in (a) through (ab). Some preferred GAT polypeptides of the invention further comprise an amino acid sequence wherein the following conditions are also met: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S. Some preferred GAT polypeptides of the invention when optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, have amino acid sequences such that one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P. h certain of the aforementioned GAT polypeptides, the amino acid residue in the polypeptide corresponding to position 28 is V, I or A. Valine or isoleucine at position 28 generally correlates with reduced K , while alanine at that position generally correlates with increased kcat. Threonine at position 89 and arginine at position 58 generally correlates with reduced KM- Other preferred GAT polypeptides are characterized by having 127 (i.e., an I at position 27), M30, D34, S35, R37, S39, H41, G48, K49, N57, Q58, P62, T62, Q65, Q67, K68, V75, E83, S89, A96, E96, R101, T112, A114, K119, K120, E128, V129, D131, T131, V132, V134, V135, H138, R144, 1145, or T146, or any combination thereof. Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699, 701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823 and 825. h another aspect, the invention provides an isolated or recombinant ' polypeptide that comprises at least 20, or alternatively, at least 50, at least 75, at least 100, at least 125, at least 130, at least 135, at least 140, at least 141, at least 142, at least 143, at least 144 or at least 145 contiguous amino acids of an amino acid sequence selected from the groups consisting of: (a) an amino acid sequence that is at least 96% identical to SEQ LD NO:919 (such as, for example, SEQ LD NO:917, 919, 921, 923, 925, 927, 833, 835, 839, 843, 845, 859, 863, 873, 877, 891, 895, 901, 905, 907, 913, 915, or 950); (b) an amino acid sequence that is at least 97% identical to SEQ LD NO:929 (such as, for example, SEQ LD NO:929, 931, 835, 843, 849, or 867); (c) an amino acid sequence that is at least 98% identical to SEQ ED NO:847 (such as, for example, SEQ ED NO:845 or 847); (d) an amino acid sequence that is at least 98 > identical to SEQ LD NO: 851; (e) an amino acid sequence that is at least 98% identical to SEQ LD NO:853; (f) an amino acid sequence that is at least 98% identical to SEQ LD NO:855 (such as, for example, SEQ LD NO:835 or 855); (g) an amino acid sequence that is at least 98%) identical to SEQ ED NO:857; (h) an amino acid sequence that is at least 98% identical to SEQ LD NO:861 (such as, for example, SEQ LD NO:839, 861, or 883); (i) an amino acid sequence that is at least 98% identical to SEQ LD NO:871; (j) an amino acid sequence that is at least 98% identical to SEQ LD NO: 875; (k) an amino acid sequence that is at least 98% identical to SEQ LD NO: 881; (1) an amino acid sequence that is at least 98% identical to SEQ LD NO: 885 (such as, for example, SEQ LD NO:845 or 885); (m) an amino acid sequence that is at least 98%> identical to SEQ LD NO:887; (n) an amino acid sequence that is at least 98% identical to SEQ LD NO:889 (such as, for example, SEQ LD NO: 863, 889, 891, or 903); (o) an amino acid sequence that is at least 98% identical to SEQ LD NO: 893; (p) an amino acid sequence that is at least 98% identical to SEQ LD NO:897; (q) an amino acid sequence that is at least 98% identical to SEQ LD NO: 899; (r) an amino acid sequence that is at least 98% identical to SEQ LD NO:909 (such as, for example, SEQ LD NO:883 or 909); (s) an amino acid sequence that is at least 98% identical to SEQ LD NO:911; (t) an amino acid sequence that is at least 99% identical to
SEQ LD NO: 837; (u) an amino acid sequence that is at least 99% identical to SEQ ED NO: 841; (v) an amino acid sequence that is at least 99% identical to SEQ LD NO:865; (w) an amino acid sequence that is at least 99% identical to SEQ LD NO:869; and (x) an amino acid sequence that is at least 99% identical to SEQ LD NO:879. In another aspect, the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, at least 50, at least 75, at least 100, at least 125, at least 130, at least 135, at least 140, at least 141, at least 142, at least 143, at least 144 or at least 145 contiguous amino acids of an amino acid sequence that is at least 95% identical to SEQ LD NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929 (such as, for example, SEQ LD NO:837, 849, 893, 897, 905, 921, 927, 929 or 931). In another aspect, the invention provides a polypeptide comprising residues 2-146 of an amino acid sequence selected from the group consisting of SEQ LD NO: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. h some embodiments of the invention, the amino acid sequence of the polypeptide comprises Met, Met- Ala, or Met- Ala- Ala on the N-terminal side of the amino acid corresponding to position 2 of the reference amino acid sequence. Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of SEQ LD NO: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 946, 948, and 950. The invention further provides preferred GAT polypeptides that are characterized by a combination of the foregoing amino acid residue position restrictions. In addition, the invention provides GAT polynucleotides encoding the preferred GAT polypeptides described above, and complementary nucleotide sequences thereof. Some aspects of the invention pertain particularly to the subset ofany of the above-described categories of GAT polypeptides having GAT activity, as described herein. These GAT polypeptides are preferred, for example, for use as agents for conferring glyphosate resistance upon a plant. Examples of desired levels of GAT activity are described herein. h one aspect, the GAT polypeptides comprise an amino acid sequence encoded by a recombinant or isolated form of naturally occurring nucleic acids isolated from a natural source, e.g., a bacterial strain. Wild-type polynucleotides encoding such GAT polypeptides may be specifically screened for by standard techniques known in the art. The polypeptides defined by SEQ LD NO:6 - 10, for example, were discovered by expression cloning of sequences from Bacillus strains exhibiting GAT activity, as described in more detail below. The invention also includes isolated or recombinant polypeptides which are encoded by an isolated or recombinant polynucleotide comprising a nucleotide sequence which hybridizes under stringent conditions over substantially the entire length of a nucleotide sequence selected from the group consisting of SEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, and 824, their complements, and nucleotide sequences encoding an amino acid sequence selected from the group consisting of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, and 825, including their complements. The invention also includes isolated or recombinant polypeptides which are encoded by an isolated or recombinant polynucleotide comprising a nucleotide sequence which hybridizes under stringent conditions over substantially the entire length of a nucleotide sequence selected from the group consisting of SEQ LD NO: 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, and 930, their complements, and nucleotide sequences encoding an amino acid sequence selected from the group consisting of SEQ LD NO: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. The invention further includes any polypeptide having GAT activity that is encoded by a fragment of any of the GAT-encoding polynucleotides described herein. The invention also provides fragments of GAT polypeptides that can be spliced together to form a functional GAT polypeptide. Splicing can be accomplished in vitro or in vivo, and can involve cis- or trans-splicing (i.e., intramolecular or intermolecular splicing). The fragments themselves can, but need not, have GAT activity. For example, two or more segments of a GAT polypeptide can be separated by inteins; removal of the intein sequence by cis- splicing results in a functional GAT polypeptide. In another example, an encrypted GAT polypeptide can be expressed as two or more separate fragments; trans-splicing of these segments results in recovery of a functional GAT polypeptide. Various aspects of cis- and trans-splicing, gene encryption, and introduction of intervening sequences are described in more detail in U.S. Patent Application Serial Nos. 09/517,933 and 09/710,686, both of which are incorporated by reference herein in their entirety. hi general, the invention includes any polypeptide encoded by a modified GAT polynucleotide derived by mutation, recursive sequence recombination, and/or diversification of the polynucleotide sequences described herein. In some aspects of the invention, a GAT polypeptide is modified by single or multiple amino acid substitutions, a deletion, an insertion, or a combination of one or more of these types of modifications. Substitutions can be conservative or non- conservative, can alter function or not, and can add new function. Insertions and deletions can be substantial, such as the case of a truncation of a substantial fragment of the sequence, or in the fusion of additional sequence, either internally or at N or C terminal, h some embodiments of the invention, a GAT polypeptide is part of a fusion protein comprising a functional addition such as, for example, a secretion signal, a chloroplast transit peptide, a purification tag, or any of the numerous other functional groups that will be apparent to the skilled artisan and which are described in more detail elsewhere in this specification. Polypeptides of the invention may contain one or more modified amino acid. The presence of modified amino acids may be advantageous in, for example, (a) increasing polypeptide in vivo half-life, (b) reducing or increasing polypeptide antigenicity, and (c) increasing polypeptide storage stability. Amino acid(s) are modified, for example, co-translationally or post-translationally during recombinant production (e.g., N-linked glycosylation at N-X-S/T motifs during expression in mammalian cells) or modified by synthetic means. Non-limiting examples of a modified amino acid include a glycosylated amino acid, a sulfated amino acid, a prenlyated (e.g., farnesylated, geranylgeranylated) amino acid, an acetylated amino acid, an acylated amino acid, a PEG-ylated amino acid, a biotinylated amino acid, a carboxylated amino acid, a phosphorylated amino acid, and the like. References adequate to guide one of skill in the modification of amino acids are replete throughout the literature. Example protocols are found in Walker (1998) Protein Protocols on CD-ROM (Humana Press, Towata, NJ). Recombinant methods for producing and isolating GAT polypeptides of the invention are described herein, hi addition to recombinant production, the polypeptides may be produced by direct peptide synthesis using solid-phase techniques (e.g., Stewart et al. (1969) Solid-Phase Peptide Synthesis (WH Freeman Co, San Francisco); and Merrifield (1963) J. Am. Chem. Soc. 85: 2149-2154). Peptide synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer, Foster City, CA) in accordance with the instructions provided by the manufacturer. For example, subsequences may be chemically synthesized separately and combined using chemical methods to provide full-length GAT polypeptides. Peptides can also be ordered from a variety of sources. In another aspect of the invention, a GAT polypeptide of the invention is used to produce antibodies which have, e.g., diagnostic uses, for example, related to the activity, distribution, and expression of GAT polypeptides, for example, in various tissues of a transgenic plant. GAT homologue polypeptides for antibody induction do not require biological activity; however, the polypeptide or oligopeptide must be antigenic. Peptides used to induce specific antibodies may have an amino acid sequence consisting of at least 10 amino acids, preferably at least 15 or 20 amino acids. Short stretches of a GAT polypeptide may be fused with another protein, such as keyhole limpet hemocyaniii, and an antibody produced against the chimeric molecule. Methods of producing polyclonal and monoclonal antibodies are known to those of skill in the art, and many antibodies are available. See, e.g., Coligan (1991) Current Protocols in Immunology (Wiley/Greene, NY); Harlow and Lane (1989) Antibodies: A Laboratory Manual (Cold Spring Harbor Press, NY); Stites et al. (eds.) Basic and Clinical Immunology, 4th ed. (Lange Medical Publications, Los Altos, CA), and references cited therein; Goding (1986) Monoclonal Antibodies: Principles and Practice, 2d ed. (Academic Press, New York, NY); and Kohler and Milstein (1975) Nature 256: 495-497. Other suitable techniques for antibody preparation include selection of libraries of recombinant antibodies in phage or similar vectors. See, Huse et al. (1989) Science 246: 1275-1281; and Ward et al. (1989) Nature 341: 544-546. Specific monoclonal and polyclonal antibodies and antisera will usually bind with a KD of at least about 0.1 μM, preferably at least about 0.01 μM or better, and most typically and preferably, 0.001 μM or better. Additional details antibody of production and engineering techniques can be found in Borrebaeck, ed. (1995) Antibody Engineering, 2nd ed. (Freeman and Company, NY); McCafferty et al. (1996) Antibody Engineering, A Practical Approach (IRL at Oxford Press, Oxford, England); and Paul (1995) Antibody Engineering Protocols (Humana Press, Towata, NJ). Sequence Variations GAT polypeptides of the present invention include conservatively modified variations of the sequences disclosed herein as SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. Such conservatively modified variations comprise substitutions, additions or deletions which alter, add or delete a single amino acid or a small percentage of amino acids (typically less than about 5%, more typically less than about 4%, 2%, or 1%) in any of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. For example, a conservatively modified variation (e.g., deletion) of the 146 amino acid polypeptide identified herein as SEQ LD NO:6 will have a length of at least 140 amino acids, preferably at least 141 amino acids, more preferably at least 144 amino acids, and still more preferably at least 145 amino acids, corresponding to a deletion of less than about 5%, 4%, 2% or about 1%, or less of the polypeptide sequence. Another example of a conservatively modified variation (e.g., a
"conservatively substituted variation") of the polypeptide identified herein as SEQ ED NO:6 will contain "conservative substitutions," accordmg to the six substitution groups set forth in Table 2, in up to about 7 residues (i.e., less than about 5%) of the 146 amino acid polypeptide. The GAT polypeptide sequence homologues of the invention, including conservatively substituted sequences, can be present as part of larger polypeptide sequences such as occur in a GAT polypeptide, in a GAT fusion with a signal sequence, e.g., a chloroplast targeting sequence, or upon the addition of one or more domains for purification of the protein (e.g., poly his segments, FLAG tag segments, etc.). In the latter case, the additional functional domains have little or no effect on the activity of the GAT portion of the protein, or where the additional domains can be removed by post synthesis processing steps such as by treatment with a protease.
Defining; Polypeptides by Immunoreactivity Because the polypeptides of the invention provide a new class of enzymes with a defined activity, i.e., the acetylation and acylation of glyphosate, the polypeptides also provide new structural features which can be recognized, e.g., in immunological assays. The generation of antisera which specifically binds the polypeptides of the invention, as well as the polypeptides which are bound by such antisera, are a feature of the invention. The invention includes GAT polypeptides that specifically bind to or that are specifically immunoreactive with an antibody or antisera generated against an immunogen comprising an amino acid sequence selected from one or more of SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. To eliminate cross-reactivity with other GAT homologues, the antibody or antisera is subtracted with available related proteins, such as those represented by the proteins or peptides corresponding to GenBank accession numbers available as of the filing date of this application, and exemplified by CAA70664, Z99109 and Y09476. Where the accession number corresponds to a nucleic acid, a polypeptide encoded by the nucleic acid is generated and used for antibody/antisera subtraction purposes. Figure 3 tabulates the relative identity between exemplary GAT sequences and the most closely related sequence available in Genbank, Yitl. The function of native Yitl has yet to be elucidated, but the enzyme has been shown to possess detectable GAT activity. In one typical format, the immunoassay uses a polyclonal antiserum which was raised against one or more polypeptides comprismg one or more of the sequences corresponding to one or more of SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699, 701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972, or a substantial subsequence thereof (i.e., at least about 30%) of the full length sequence provided). The full set of potential polypeptide immunogens derived from SEQ LD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972 are collectively referred to below as "the immunogenic polypeptide(s)." The resulting antisera is optionally selected to have low cross-reactivity against other related sequences and any such cross-reactivity is removed by immunoabsorbtion with one or more of the related sequences, prior to use of the polyclonal antiserum in the immunoassay. In order to produce antisera for use in an immunoassay, one or more of the immunogenic polypeptide(s) is produced and purified as described herein. For example, recombinant protein may be produced in a bacterial cell line. An inbred strain of mice (used in this assay because results are more reproducible due to the virtual genetic identity of the mice) is immunized with the immunogenic polypeptide(s) in combination with a standard adjuvant, such as Freund's adjuvant, using a standard mouse immunization protocol (see, Harlow and Lane (1988) Antibodies, A Laboratory Manual (Cold Spring Harbor Publications, New York), for a standard description of antibody generation, immunoassay formats and conditions that can be used to determine specific immunoreactivity).
Alternatively, one or more synthetic or recombinant polypeptides derived from the sequences disclosed herein is conjugated to a carrier protein and used as an immunogen. Polyclonal sera are collected and titered against the immunogenic polypeptide(s) in an immunoassay, for example, a solid phase immunoassay with one or more of the immunogenic proteins immobilized on a solid support. Polyclonal antisera with a titer of 106 or greater are selected, pooled and subtracted with related polypeptides, e.g., those identified from GENBANK as noted, to produce subtracted, pooled, titered polyclonal antisera. The subtracted, pooled, titered polyclonal antisera are tested for cross reactivity against the related polypeptides. Preferably at least two of the immunogenic GATs are used in this determination, preferably in conjunction with at least two related polypeptides, to identify antibodies which are specifically bound by the immunogenic polypeptide(s). In this comparative assay, discriminatory binding conditions are determined for the subtracted, titered polyclonal antisera which result in at least about a 5-10 fold higher signal to noise ratio for binding of the titered polyclonal antisera to the immunogenic GAT polypeptides as compared to binding to the related polypeptides. That is, the stringency of the binding reaction is adjusted by the addition of non-specific competitors such as albumin or non-fat dry milk, or by adjusting salt conditions, temperature, or the like. These binding conditions are used in subsequent assays for determining whether a test polypeptide is specifically bound by the pooled, subtracted polyclonal antisera. In particular, a test polypeptide which shows at least a 2-5 fold higher signal to noise ratio than the control polypeptide under discriminatory binding conditions, and at least about a V signal to noise ratio as compared to the immunogemc polypeptide(s), shares substantial structural similarity with the immunogenic polypeptide(s) as compared to known GAT, and is, therefore a polypeptide of the invention. In another example, immunoassays in the competitive binding format are used for the detection of a test polypeptide. For example, as noted, cross-reacting antibodies are removed from the pooled antisera mixture by immunoabsorption with the control GAT polypeptides. The immunogenic polypeptide(s) are then immobilized to a solid support which is exposed to the subtracted pooled antisera. Test proteins are added to the assay to compete for binding to the pooled, subtracted antisera. The ability of the test protein(s) to compete for binding to the pooled, subtracted antisera as compared to the immobilized protein(s) is compared to the ability of the immunogenic polypeptide(s) added to the assay to compete for binding (the immunogenic polypeptide(s) compete effectively with the immobilized immunogenic polypeptide(s) for binding to the pooled antisera). The percent cross-reactivity for the test proteins is calculated, using standard calculations. In a parallel assay, the ability of the control proteins to compete for binding to the pooled, subtracted antisera is optionally determined as compared to the ability of the immunogenic polypeptide(s) to compete for binding to the antisera. Again, the percent cross-reactivity for the control polypeptides is calculated, using standard calculations. Where the percent cross-reactivity is at least 5-10x higher for the test polypeptides, the test polypeptides are said to specifically bind the pooled, subtracted antisera. In general, the immunoabsorbed and pooled antisera can be used in a competitive binding immunoassay as described herein to compare any test polypeptide to the immunogenic polyp eptide(s). In order to make this comparison, the two polypeptides are each assayed at a wide range of concentrations and the amount of each polypeptide required to inhibit 50% of the binding of the subtracted antisera to the immobilized protein is determined using standard techniques. If the amount of the test polypeptide required is less than twice the amount of the immunogenic polypeptide(s) that is required, then the test polypeptide is said to specifically bind to an antibody generated to the immunogenic polypeptide(s), provided the amount is at least about 5-1 Ox higher for a control polypeptide. As a final determination of specificity, the pooled antisera is optionally fully immunosorbed with the immunogenic polypeptide(s) (rather than the control polypeptides) until little or no binding of the subtracted, pooled antisera to the immunogenic polypeptide(s) is detectable. This fully immunosorbed antisera is then tested for reactivity with the test polypeptide. If little or no reactivity is observed (i.e., no more than 2x the signal to noise ratio observed for binding of the fully immunosorbed antisera to the immunogenic polypeptide(s)), then the test polypeptide is specifically bound by the antisera elicited by the immunogemc polypeptide(s). GLYPHOSATE-N-ACETYLTRANSFERASE POLYNUCLEOTIDES hi one aspect, the invention provides a novel family of isolated or recombinant polynucleotides referred to herein as "glyphosate-N-acetyltransferase polynucleotides" or "GAT polynucleotides." GAT polynucleotide sequences are characterized by the ability to encode a GAT polypeptide. In general, the invention includes any nucleotide sequence that encodes any of the novel GAT polypeptides described herein, h some aspects of the invention, a GAT polynucleotide that encodes a GAT polypeptide with GAT activity is preferred, h one aspect, the GAT polynucleotides comprise recombinant or isolated forms of naturally occurring nucleic acids isolated from an organism, e.g., a bacterial strain. Exemplary GAT polynucleotides, e.g., SEQ ED NO:l - 5, were discovered by expression cloning of sequences from Bacillus strains exhibiting GAT activity. Briefly, a collection of approximately 500 Bacillus and Pseudomonas strains were screened for native ability to N-acetylate glyphosate. Strains were grown in LB overnight, harvested by centrifugation, permeabilized in dilute toluene, and then washed and resuspended in a reaction mix containing buffer, 5 mM glyphosate, and 200 μM acetyl-CoA. The cells were incubated in the reaction mix for between 1 and 48 hours, at which time an equal volume of methanol was added to the reaction. The cells were then pelleted by centrifugation and the supernatant was filtered before analysis by parent ion mode mass spectrometry. The product of the reaction was positively identified as N- acetylglyphosate by comparing the mass spectrometry profile of the reaction mix to an N-acetylglyphosate standard as shown in Figure 2. Product detection was dependent on inclusion of both substrates (acetyl CoA and glyphosate) and was abolished by heat denaturing the bacterial cells. Individual GAT polynucleotides were then cloned from the identified strains by functional screening. Genomic DNA was prepared and partially digested with Sau3Al enzyme. Fragments of approximately 4 kb were cloned into an E. coli expression vector and transformed into electrocompetent E. coli. Individual clones exlubiting GAT activity were identified by mass spectrometry following a reaction as described previously except that the toluene wash was replaced by permeabilization with PMBS. Genomic fragments were sequenced and the putative GAT polypeptide-encoding open reading frame was identified. Identity of the GAT gene was confirmed by expression of the open reading frame in E. coli and detection of high levels of N-acetylglyphosate produced from reaction mixtures. In another aspect of the invention, GAT polynucleotides are produced by diversifying, e.g., recombining and/or mutating one or more naturally occurring, isolated, or recombinant GAT polynucleotides. As described in more detail elsewhere herein, it is often possible to generate diversified GAT polynucleotides encoding GAT polypeptides with superior functional attributes, e.g., increased catalytic function, increased stability, or higher expression level, than a GAT polynucleotide used as a substrate or parent in the diversification process. The polynucleotides of the invention have a variety of uses in, for example: recombinant production (i.e., expression) of the GAT polypeptides of the invention; as transgenes (e.g., to confer herbicide resistance in transgenic plants); as selectable markers for transformation and plasmid maintenance; as immunogens; as diagnostic probes for the presence of complementary or partially complementary nucleic acids (including for detection of natural GAT coding nucleic acids); as substrates for further diversity generation, e.g., recombination reactions or mutation reactions to produce new and/or improved GAT homologues, and the like. It is important to note that certain specific, substantial and credible utilities of GAT polynucleotides do not require that the polynucleotide encode a polypeptide with substantial GAT activity. For example, GAT polynucleotides that do not encode active enzymes can be valuable sources of parental polynucleotides for use in diversification procedures to arrive at GAT polynucleotide variants, or non-GAT polynucleotides, with desirable functional properties (e.g., high kcat or kcat/Km, low Km, high stability towards heat or other environmental factors, high transcription or translation rates, resistance to proteolytic cleavage, reducing antigenicity, etc.). For example, nucleotide sequences encoding protease variants with little or no detectable activity have been used as parent polynucleotides in DNA shuffling experiments to produce progeny encoding highly active proteases (Ness et al. (1999) Nature Biotech. 17:893-96). Polynucleotide sequences produced by diversity generation methods or recursive sequence recombination ("RSR") methods (e.g., DNA shuffling) are a feature of the invention. Mutation and recombination methods using the nucleic acids described herein are a feature of the invention. For example, one method of the invention includes recursively recombining one or more nucleotide sequences of the invention as described above and below with one or more additional nucleotides. The recombining steps are optionally performed in vivo, ex vivo, in silico or in vitro. This diversity generation or recursive sequence recombination produces at least one library of recombinant modified GAT polynucleotides. Polypeptides encoded by members of this library are included in the invention. Also contemplated are uses of polynucleotides, also referred to herein as oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably at least 20, 30, or 50 or more bases, which hybridize under stringent or highly stringent conditions to a GAT polynucleotide sequence. The polynucleotides may be used as probes, primers, sense and antisense agents, and the like, according to methods as noted herein. In accordance with the present invention, GAT polynucleotides, including nucleotide sequences that encode GAT polypeptides, fragments of GAT polypeptides, related fusion proteins, or functional equivalents thereof, are used in recombinant DNA molecules that direct the expression of the GAT polypeptides in appropriate host cells, such as bacterial or plant cells. Due to the inherent degeneracy of the genetic code, other nucleic acid sequences which encode substantially the same or a functionally equivalent amino acid sequence can also be used to clone and express the GAT polynucleotides. The invention provides GAT polynucleotides that encode transcription and/or translation products that are subsequently spliced to ultimately produce functional GAT polypeptides. Splicing can be accomplished in vitro or in vivo, and can involve cis- or trans-splicing. The substrate for splicing can be polynucleotides (e.g., RNA transcripts) or polypeptides. An example of cis- splicing of a polynucleotide is where an intron inserted into a coding sequence is removed and the two flanking exon regions are spliced to generate a GAT polypeptide encoding sequence. An example of trans-splicing would be where a GAT polynucleotide is encrypted by separating the coding sequence into two or more fragments that can be separately transcribed and then spliced to form the full- length GAT encoding sequence. The use of a splicing enhancer sequence (which can be introduced into a construct of the invention) can facilitate splicing either in cis or trans. Cis- and trans-splicing of polypeptides are described in more detail elsewhere herein and in U.S. Patent Application Serial Nos. 09/517,933 and 09/710,686. Thus, some GAT polynucleotides do not directly encode a full-length GAT polypeptide, but rather encode a fragment or fragments of a GAT polypeptide. These GAT polynucleotides can be used to express a functional GAT polypeptide through a mechanism involving splicing, where splicing can occur at the level of polynucleotide (e.g., intron/exon) and/or polypeptide (e.g., intein/extein). This can be useful, for example, in controlling expression of GAT activity, since functional GAT polypeptide will only be expressed if all required fragments are expressed in an environment that permits splicing processes to generate functional product. In another example, introduction of one or more insertion sequences into a GAT polynucleotide can facilitate recombination with a low homology polynucleotide; use of an intron or intein for the insertion sequence facilitates the removal of the intervening sequence, thereby restoring function of the encoded variant. As will be understood by those of skill in the art, it can be advantageous to modify a coding sequence to enhance its expression in a particular host. The genetic code is redundant with 64 possible codons, but most organisms preferentially use a subset of these codons. The codons that are utilized most often in a species are called optimal codons, and those not utilized very often are classified as rare or low-usage codons (see, e.g., Zhang et al. (1991) Gene 105:61- 72). Codons can be substituted to reflect the preferred codon usage of the host, a process sometimes called "codon optimization" or "controlling for species codon bias." Optimized coding sequences containing codons preferred by a particular prokaryotic or eukaryotic host (see also, Murray et al. (1989) Nucl. Acids Res. 17:477-508) can be prepared, for example, to increase the rate of translation or to produce recombinant RNA transcripts having desirable properties, such as a longer half-life, as compared with transcripts produced from a non-optimized sequence. Translation stop codons can also be modified to reflect host preference. For example, preferred stop codons for S. cerevisiae and mammals are UAA and UGA, respectively. The preferred stop codon for monocotyledonous plants is UGA, whereas insects and E. coli prefer to use UAA as the stop codon (Dalphin et al. (1996) Nucl. Acids Res. 24: 216-218). Methodology for optimizing a nucleotide sequence for expression in a plant is provided, for example, in U.S. Patent No. 6,015,891, and the references cited therein. One embodiment of the invention includes a GAT polynucleotide having optimal codons for expression in a relevant host, e.g., a transgenic plant host. This is particularly desirable when a GAT polynucleotide of bacterial origin is introduced into a transgenic plant, e.g., to confer glyphosate resistance to the plant. The polynucleotide sequences of the present invention can be engineered in order to alter a GAT polynucleotide for a variety of reasons, including but not limited to alterations which modify the cloning, processing and/or expression of the gene product. For example, alterations may be introduced using techniques that are well known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, alter glycosylation patterns, change codon preference, introduce splice sites, etc. As described in more detail herein, the polynucleotides of the invention include sequences which encode novel GAT polypeptides and sequences complementary to the coding sequences, and novel fragments of coding sequences and complements thereof. The polynucleotides can be in the form of RNA or in the form of DNA, and include mRNA, cRNA, synthetic RNA and DNA, genomic DNA and cDNA. The polynucleotides can be double-stranded or single-stranded, and if single-stranded, can be the coding strand or the non-coding (anti-sense, complementary) strand. The polynucleotides optionally include the coding sequence of a GAT polypeptide (i) in isolation, (ii) in combination with an additional coding sequence, so as to encode, e.g., a fusion protein, a pre-protein, a prepro-protein, or the like, (iii) in combination with non-coding sequences, such as introns or inteins, control elements such as a promoter, an enhancer, a terminator element, or 5' and/or 3' untranslated regions effective for expression of the coding sequence in a suitable host, and/or (iv) in a vector or host environment in wliich the GAT polynucleotide is a heterologous gene. Sequences can also be found in combination with typical compositional formulations of nucleic acids, including in the presence of carriers, buffers, adjuvants, excipients and the like. Polynucleotides and oligonucleotides of the invention can be prepared by standard solid-phase methods, according to known synthetic methods. Typically, fragments of up to about 100 bases are individually synthesized, then joined (e.g., by enzymatic or chemical ligation methods, or polymerase mediated methods) to form essentially any desired continuous sequence. For example, polynucleotides and oligonucleotides of the invention can be prepared by chemical synthesis using, e.g., the classical phosphoramidite method described by Beaucage et al. (1981) Tetrahedron Letters 22:1859-69, or the method described by Matthes et al. (1984) EMBO J. 3: 801-05, e.g., as is typically practiced in automated synthetic methods. According to the phosphoramidite method, oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vectors. In addition, essentially any nucleic acid can be custom ordered from any of a variety of commercial sources, such as The Midland Certified Reagent Company (mcrc@oligos.com), The Great American Gene Company (www.genco.com), ExpressGen Inc. (www.expressgen.com), Operon Technologies Inc. (Alameda, CA) and many others. Similarly, peptides and antibodies can be custom ordered from any of a variety of sources, such as PeptidoGenic (pkim@ccnet.com), HTI Bio-products, h e. (www.htibio.com), BMA Biomedicals Ltd (U.K.), Bio. Synthesis, Inc., and many others. Polynucleotides may also be synthesized by well-known techniques as described in the technical literature. See, e.g., Carruthers et al, Cold Spring Harbor Symp. Quant. Biol. 47: 411-418 (1982), and Adams et al. (1983) J Am. Chem. Soc. 105: 661. Double stranded DNA fragments may then be obtained either by synthesizing the complementary strand and annealing the strands together under appropriate conditions, or by adding the complementary strand using DNA polymerase with an appropriate primer sequence. General texts which describe molecular biological techniques useful herein, including mutagenesis, include Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, Volume 152 (Academic Press, Inc., San Diego, CA); Sambrook et al. (1989) Molecular Cloning - A Laboratory Manual, 2nd ed., Volumes 1-3 (Cold Spring Harbor Laboratory, Cold Spring Harbor, New York); and Ausubel et al., eds. (2000) Current Protocols in Molecular Biology (Greene Publishing Associates, Inc. and John Wiley & Sons, Inc.). Examples of techniques sufficient to direct persons of skill through in vitro amplification methods, including the polymerase chain reaction (PCR), the ligase chain reaction (LCR), Qβ-replicase amplification and other RNA polymerase mediated techniques (e.g., NASBA) are found in Berger, Sambrook, and Ausubel, as well as in Mullis et al. (1987) U.S. Patent No. 4,683,202; Innis et al, eds. (1990) PCR Protocols: A Guide to Methods and Applications (Academic Press hie. San Diego, CA); Arnheim & Levinson (October 1, 1990) Chemical and Engineering News 36- 47; The Journal Of NIH Research (1991) 3: 81-94; Kwoh et al. (1989) Proc. Natl Acad. Sci. USA 86: 1173; Guatelli et al. (1990) Proc. Natl Acad. Sci. USA 87: 1874; Lomell et al. (1989) J Clin. Chem. 35: 1826; Landegren et al. (1988) Science 241 : 1077-1080; Van Brunt (1990) Biotechnology 8: 291-294; Wu and
Wallace (1989) Gene 4: 560; Barringer et al. (1990) Gene 89: 117, and Sooknanan and Malek (1995) Biotechnology 13: 563-564. Improved methods of cloning in vitro amplified nucleic acids are described in Wallace et al. U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic acids by PCR are summarized in Cheng et al. (1994) Nature 369: 684-685 and the references cited therein, in which PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase.. See, Ausbel, Sambrook and Berger, all supra. One aspect of the invention provides an isolated or recombinant polynucleotide selected from the group consisting of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952. Preferred polynucleotides of the present invention include an isolated or recombinant polynucleotide sequence encoding and amino acid sequence that can be optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. When used to specify an amino acid or amino acid residue, the single letter designations A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, and Y have their standard meaning as used in the art and as provided in Table 1 herein. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ED NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI; (b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at position 8 the amino acid residue is Z3; (d) at position 89 the amino acid residue is Z3 or Z6; (e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5; (g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2; (i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3; (j) at position 30 the amino acid residue is ZI; (k) at position 6 the amino acid residue is Z6; (1) at position 81 the amino acid residue is Z2 or Z4; (m) at position 113 the amino acid residue is Z3; (n) at position 138 the amino acid residue is Z4; (o) at position 142 the amino acid residue is Z2; (p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4; (q) at position 5, 17, and 61 the amino acid residue is Z4; (r) at position 24 the amino acid residue is Z3; (s) at position 104 the amino acid residue is Z5; (t) at positions 52 and/or 69 the amino acid residue is Z3; (u) at positions 14 and/or 119 the amino acid residue is Z5; (v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5; (w) at positions 48 and/or 80 the amino acid residue is Z6; (x) at position 40 the amino acid residue is ZI or Z2; (y) at position 96 the amino acid residue is Z3 or Z5; (z) at position 65 the amino acid residue is Z3, Z4, or Z6; (aa) at positions 84 and/or 115 the amino acid residue is Z3; (ab) at position 93 the amino acid residue is Z4; (ac) at position 130 the amino acid residue is Z2; (ad) at position 58 the amino acid residue is Z3, Z4, or Z6; (ae) at position 47 the amino acid residue is Z4 or Z6; (af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4; (ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4; (ai) at position 131 the amino acid residue is Z5; (aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4;.(al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising the amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), at least 90% conform to the amino acid residue restrictions specified in(a)-(am). Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ LD NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 36 an amino acid residue selected from the group consisting of ZI and Z3. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 64 an amino acid residue selected from the group consisting of ZI and Z2. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 80% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V; (o) at position 24 the amino acid residue is Q; (p) at position 26 the amino acid residue is M, L, V or I; (q) at position 27 the amino acid residue is E; (r) at position 28 the amino acid residue is A or V; (s) at position 30 the amino acid residue is M; (t) at position 31 the amino acid residue is Y or F; (u) at position 32 the amino acid residue is E or D; (v) at position 33 the amino acid residue is T or S; (w) at position 35 the amino acid residue is L; (x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 39 the amino acid residue is A or S; (z) at position 40 the amino acid residue is F or L ; (aa) at position 45 the amino acid residue is Y or F; (ab) at position 47 the amino acid residue is R or G; (ac) at position 48 the amino acid residue is G; (ad) at position 49 the amino acid residue is K, R, or Q; (ae) at position 51 the amino acid residue is I or V; (at) at position 52 the amino acid residue is S; (ag) at position 53 the amino acid residue is I or V; (ah) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N; (aj) at position 58 the amino acid residue is Q, K, R or P; (ak) at position 59 the amino acid residue is A; (al) at position 60 the amino acid residue is E; (am) at position 61 the amino acid residue is H or R; (an) at position 63 the amino acid residue is E or D; (ao) at position 65 the amino acid residue is E, P or Q; (ap) at position 67 the amino acid residue is Q or R; (aq) at position 68 the amino acid residue is K or E; (ar) at position 69 the amino acid residue is Q; (as) at position 79 the amino acid residue is E; (at) at position 80 the amino acid residue is G; (au) at position 81 the amino acid residue is Y, H or F; (av) at position 82 the amino acid residue is R; (aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q; (ay) at position 86 the amino acid residue is A; (az) at position 89 the amino acid residue is G, T or S; (ba) at position 90 the amino acid residue is L; (bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K; (bd) at position 93 the amino acid residue is H; (be) at position 96 the amino acid residue is E or Q; (bf) at position 97 the amino acid residue is I; (bg) at position 100 the amino acid residue is K or N ; (bh) at position 101 the amino acid residue is K or R; (bi) at position 103 the amino acid residue is A or V; (bj) at position 104 the amino acid residue is D; (bk) at position 105 the amino acid residue is M, L or I; (bl) at position 106 the amino acid residue is L; (bm) at position 112 the amino acid residue is T or A; (bn) at position 113 the amino acid residue is S or T; (bo) at position 114 the amino acid residue is A; (bp) at position 115 the amino acid residue is S; (bq) at position 119 the amino acid residue is K or R; (br) at position 120 the amino acid residue is K or R; (bs) at position 123 the amino acid residue is F or L; (bt) at position 125 the amino acid residue is E; (bu) at position 126 the amino acid residue is Q or H; (bv) at position 128 the amino acid residue is E or D; (bw) at position 129 the amino acid residue is V or I; (bx) at position 130 the amino acid residue is F; (by) at position 131 the amino acid residue is D or E; (bx) at position 132 the amino acid residue is T; (ca) at position 135 the amino acid residue is V; (cb) at position 138 the amino acid residue is H; (cc) at position 139 the amino acid residue is I; (cd) at position 140 the amino acid residue is L or M; (ce) at position 142 the amino acid residue is Y; (cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the amino acid residue restrictions specified in (a) - (ch) above. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence that when optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ LD NO: 300, 445, and 457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, one or more of the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P, further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 9, 76, 94 and 110 the amino acid residue is A; (b) at positions 29 and 108 the amino acid residue is C; (c) at position 34 the amino acid residue is D; (d) at position 95 the amino acid residue is E; (e) at position 56 the amino acid residue is F; (f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G; (g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K; (j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L; (k) at positions 1 and 141 the amino acid residue is M; (1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q; (o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R; (p) at position 55 the amino acid residue is S; (q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and (s) at position 13, 46, 70 and 118 the amino acid residue is Y. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence which conforms to at least one of the following additional restrictions: (a) at position 36 the amino acid residue is M, L, or T; (b) at position 72 the amino acid residue is L or I; (c) at position 75 the amino acid residue is M or V; (d) at position 64 the amino acid residue is L, I, or F; (e) at position 88 the amino acid residue is T or S; (f) at position 117 the amino acid residue is Y or F. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence in which at least one of the following additional conditions is met: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r) at position 88 the amino acid residue is T; (s) at position 89 the amino acid residue is G; (t) at position 91 the amino acid residue is L; (u) at position 98 the amino acid residue is I; (v) at position 105 the amino acid residue I; (w) at position 112 the amino acid residue is A; (x) at position 124 the amino acid residue is G or C; (y) at position 128 the amino acid residue is D; (z) at position 140 the amino acid residue is M; (aa) at position 143 the amino acid residue is R; and (ab) at position 144 the amino acid residue is W. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence wherein, of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) through (ab) as described above, at least 80% conform to the amino acid residue restrictions specified in (a) through (ab). Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence which conforms to at least one of the following additional restrictions: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S. Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO: 577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO: 621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO: 579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO:602; (X) a nucleotide sequence encoding an amino acid sequence that is at least 95 % identical to SEQ LD NO: 697; (g) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:721; (h) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ID NO:613; (i) a nucleotide sequence encoding an amino acid sequence that is at least 89% identical to SEQ ED NO:677; (j) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO: 584; (k) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:707; (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:616; (m) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED 'NO:612;and (n) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:590. Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ LD NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO: 602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:721; (h) a nucleotide sequence encoding an amino acid sequence that is at least 97%> identical to SEQ ED NO:613; (i) a nucleotide sequence encoding an amino acid sequence that is at least 89% identical to SEQ ED NO:677; (j) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO: 584; (k) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO: 707; (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:616; (m) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:612; and (n) a nucleotide sequence encoding an amino acid sequence that is at least 98%) identical to SEQ ED NO:590, wherein the following positions conform to the following restrictions: (i) at positions 18 and 38, there is a Z5 amino acid residue; (ii) at position 62, there is a ZI amino acid residue; (iii) at position 124, there is a Z6 amino acid residue; and (iv) at position 144, there is a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; and Z6 is an amino acid residue selected from the group consisting of C, G, and P. Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ LD NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an amino acid sequence that is at least 96%> identical to SEQ ED NO: 721; (h) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ LD NO:613; (i) a nucleotide sequence encoding an amino acid sequence that is at least 89% identical to SEQ LD NO:677; (j) a nucleotide sequence encoding an amino acid sequence that is at least 96%> identical to SEQ ED NO: 584; (k) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:707; (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:616; (m) a nucleotide sequence encoding an amino acid sequence that is at least 96%> identical to SEQ ED NO:612; and (n) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:590, further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:721; (h) a nucleotide sequence encoding an amino acid sequence that is at least 97%) identical to SEQ ED NO:613; (i) a nucleotide sequence encoding an amino acid sequence that is at least 89% identical to SEQ ED NO:677; (j) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO: 584; (k) a nucleotide sequence encoding an amino acid sequence that is at least 98%o identical to SEQ ED NO:707; (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:616; (m) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:612;and (n) a nucleotide sequence encoding an amino acid sequence that is at least 98%o identical to SEQ ED NO:590, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI; (b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at position 8 the amino acid residue is Z3; (d) at position 89 the amino acid residue is Z3 or Z6; (e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5; (g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2; (i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3; (j) at position 30 the amino acid residue is ZI; (k) at position 6 the amino acid residue is Z6; (1) at position 81 the amino acid residue is Z2 or Z4; (m) at position 113 the amino acid residue is Z3; (n) at position 138 the amino acid residue is Z4; (o) at position 142 the amino acid residue is Z2; (p) at positions 57 and or 126 the amino acid residue is Z3 or Z4; (a at position 5, 17, and 61 the amino acid residue is Z4; (r) at position 24 the amino acid residue is Z3; (s) at position 104 the amino acid residue is Z5; (t) at positions 52, and/or 69 the amino acid residue is Z3; (u) at positions 14 and/or 119 the amino acid residue is Z5; (v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5 ; (w) at positions 48 and/or 80 the amino acid residue is Z6; (x) at position 40 the amino acid residue is ZI or Z2; (y) at position 96 the amino acid residue is Z3 or Z5; (z) at position 65 the amino acid residue is Z3, Z4, or Z6; (aa) at positions 84 and/or 115 the amino acid residue is Z3; (ab) at position 93 the amino acid residue is Z4; (ac) at position 130 the amino acid residue is Z2; (ad) at position 58 the amino acid residue is Z3, Z4 or Z6; (ae) at position 47 the amino acid residue is Z4 or Z6; (af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4; (ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4; (ai) at position 131 the amino acid residue is Z5; (aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4; (al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a)-(am), at least 90% conform to the amino acid residue restrictions specified in(a)-(am). Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence in which of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following additional restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ LD NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90%> of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 36 an amino acid residue selected from the group consisting of ZI and Z3. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at position 64 an amino acid residue selected from the group consisting of ZI and Z2. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 80% of the amino acid residues in the amino acid sequence conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K; (e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V; (o) at position 24 the amino acid residue is Q; (p) at position 26 the amino acid residue is M, L, V or I; (q) at position 27 the amino acid residue is E; (r) at position 28 the amino acid residue is A or V; (s) at position 30 the amino acid residue is M; (t) at position 31 the amino acid residue is Y or F; (u) at position 32 the amino acid residue is E or D; (v) at position 33 the amino acid residue is T or S; (w) at position 35 the amino acid residue is L; (x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 39 the amino acid residue is A or S; (z) at position 40 the amino acid residue is F or L ; (aa) at position 45 the amino acid residue is Y or F; (ab) at position 47 the amino acid residue is R or G; (ac) at position 48 the amino acid residue is G; (ad) at position 49 the amino acid residue is K, R, or Q; (ae) at position 51 the amino acid residue is I or V; (af) at position 52 the amino acid residue is S; (ag) at position 53 the amino acid residue is I or V; (all) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N; (aj) at position 58 the amino acid residue is Q, K, R or P; (ak) at position 59 the amino acid residue is A; (al) at position 60 the amino acid residue is E; (am) at position 61 the amino acid residue is H or R; (an) at position 63 the amino acid residue is E or D; (ao) at position 65 the amino acid residue is E, P or Q; (ap) at position 67 the amino acid residue is Q or R; (aq) at position 68 the amino acid residue is K or E; (ar) at position 69 the amino acid residue is Q; (as) at position 79 the amino acid residue is E; (at) at position 80 the amino acid residue is G; (au) at position 81 the amino acid residue is Y, H or F; (av) at position 82 the amino acid residue is R; (aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q; (ay) at position 86 the amino acid residue is A; (az) at position 89 the amino acid residue is G, T or S; (ba) at position 90 the amino acid residue is L; (bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K; (bd) at position 93 the amino acid residue is H; (be) at position 96 the amino acid residue is E or Q; (hi) at position 97 the amino acid residue is I; (bg) at position 100 the amino acid residue is K or N ; (bh) at position 101 the amino acid residue is K or R; (bi) at position 103 the amino acid residue is A or V; (bj) at position 104 the amino acid residue is D; (bk) at position 105 the amino acid residue is M, L or I; (bl) at position 106 the amino acid residue is L; (bm) at position 112 the amino acid residue is T or A; (bn) at position 113 the amino acid residue is S or T; (bo) at position 114 the amino acid residue is A; (bp) at position 115 the amino acid residue is S; (bq) at position 119 the amino acid residue is K or R; (br) at position 120 the amino acid residue is K or R; (bs) at position 123 the amino acid residue is F or L; (bt) at position 125 the amino acid residue is E; (bu) at position 126 the amino acid residue is Q or H; (bv) at position 128 the amino acid residue is E or D; (bw) at position 129 the amino acid residue is V or I; (bx) at position 130 the amino acid residue is F; (by) at position 131 the amino acid residue is D or E; (bx) at position 132 the amino acid residue is T; (ca) at position 135 the amino acid residue is V; (cb) at position 138 the amino acid residue is H; (cc) at position 139 the amino acid residue is I; (cd) at position 140 the amino acid residue is L or M; (ce) at position 142 the amino acid residue is Y; (cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ ED NO: 300, 445, or 457, at least 90% of the amino acid residues in the amino acid sequence conform to the amino acid residue restrictions specified in (a) - (ch) above. Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98 > identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO: 697; (g) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ID NO: 721; (h) a nucleotide sequence encoding an amino acid sequence that is at least 97%) identical to SEQ ED NO:613; (i) a nucleotide sequence encoding an amino acid sequence that is at least 89% identical to SEQ ED NO:677; (j) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO: 584; (k) a nucleotide sequence encoding an amino acid sequence that is at least 98%) identical to SEQ ED NO:707; (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO:616; (m) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:612; and (n) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:590, and further wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 9, 76, 94 and 110 the amino acid residue is A; (b) at positions 29 and 108 the amino acid residue is C; (c) at position 34 the amino acid residue is D; (d) at position 95 the amino acid residue is E; (e) at position 56 the amino acid residue is F; (f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G; (g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K; (j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L; (k) at positions 1 and 141 the amino acid residue is M; (1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q; (o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R; (p) at position 55 the amino acid residue is S; (q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and (s) at position 13, 46, 70 and 118 the amino acid residue is Y. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence further comprising at least one amino acid residue that meets the following criteria: (a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D; (g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L; (n) at position 64 the amino acid residue is I or F; (o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I; (q) at position 75 the amino acid residue is V; (r) at position 88 the amino acid residue is T; (s) at position 89 the amino acid residue is G; (t) at position 91 the amino acid residue is L; (u) at position 98 the amino acid residue is I; (v) at position 105 the amino acid residue I; (w) at position 112 the amino acid residue is A; (x) at position 124 the amino acid residue is G or C; (y) at position 128 the amino acid residue is D; (z) at position 140 the amino acid residue is M; (aa) at position 143 the amino acid residue is R; and (ab) at position 144 the amino acid residue is W. Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence such that when the sequence is optimally aligned with SEQ LD NO: 300, 445, or 457, at least 80% of the amino acid residues in the amino acid sequence conform to the amino acid residue restrictions specified in (a) through (ab) above. Some preferred isolated or recombinant polynucleotides of the invention comprise a nucleotide sequence which encodes an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 96%o identical to SEQ ED NO:919 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:917, 919, 921, 923, 925, 927, 833, 835, 839, 843, 845, 859, 863, 873, 877, 891, 895, 901, 905, 907, 913, 915, or 950); (b) an amino acid sequence that is at least 97% identical to SEQ ED NO:929 (such as, for example, a nucleotide sequence which encodes SEQ LD NO:929, 931, 835, 843, 849, or 867);
(c) an amino acid sequence that is at least 98% identical to SEQ ED NO:847 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:845 or 847);
(d) an amino acid sequence that is at least 98%> identical to SEQ ED NO: 851; (e) an amino acid sequence that is at least 98% identical to SEQ ED NO:853; (f) an amino acid sequence that is at least 98% identical to SEQ ED NO: 855 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:835 or 855); (g) an amino acid sequence that is at least 98% identical to SEQ ED NO: 857; (h) an amino acid sequence that is at least 98 > identical to SEQ LD NO: 861 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:839, 861, or 883); (i) an amino acid sequence that is at least 98% identical to SEQ ED NO:871; (j) an amino acid sequence that is at least 98% identical to SEQ ED NO:875; (k) an amino acid sequence that is at least 98% identical to SEQ LD NO:881; (1) an amino acid sequence that is at least 98% identical to SEQ ED NO:885 (such as, for example, a nucleotide sequence which encodes SEQ ID NO:845 or 885); (m) an amino acid sequence that is at least 98% identical to SEQ ED NO:887; (n) an amino acid sequence that is at least 98% identical to SEQ ED NO:889 (such as, for example, a nucleotide sequence which encodes SEQ ID NO: 863, 889, 891, or 903); (o) an amino acid sequence that is at least 98%> identical to SEQ ED NO:893; (p) an amino acid sequence that is at least 98% identical to SEQ ED NO:897; (q) an amino acid sequence that is at least 98%> identical to SEQ ED NO:899; (r) an amino acid sequence that is at least 98%> identical to SEQ ED NO:909 (such as, for example, a nucleotide sequence which encodes SEQ ED NO: 883 or 909); (s) an amino acid sequence that is at least 98%> identical to SEQ LD NO:911; (t) an amino acid sequence that is at least 99% identical to SEQ ED NO:837; (u) an amino acid sequence that is at least 99% identical to SEQ ED NO: 841; (v) an amino acid sequence that is at least 99%> identical to SEQ ED NO:865; (w) an amino acid sequence that is at least 99% identical to SEQ ID NO:869; and (x) an amino acid sequence that is at least 99% identical to SEQ LD NO:879. Some preferred isolated or recombinant polynucleotides of the invention are selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ID NO:919 (for example, a nucleotide sequence such as SEQ ED NO:916, 918, 920, 922, 924, 926, 832, 834, 838, 842, 844, 858, 862, 872, 876, 890, 894, 900, 904, 906, 912, 914, 939, 940, 941, 942, 943, 944, 949, 951 or 952); (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:929 (for example, a nucleotide sequence such as SEQ ED NO:928, 930, 834, 842, 848, 866, 936 or 937); (c) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO: 847 (for example, a nucleotide sequence such as SEQ ED NO: 844 or 846); (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:851 (for example, a nucleotide sequence such as SEQ ED NO:850); (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:853 (for example, a nucleotide sequence such as SEQ ID NO:852); (f) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:855 (for example, a nucleotide sequence such as SEQ LD NO:834 or 854); (g) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:857 (for example, a nucleotide sequence such as SEQ ED NO: 856); (h) a nucleotide sequence encoding an amino acid sequence that is at least 98%o identical to SEQ ED NO:861 (for example, a nucleotide sequence such as SEQ ID NO:838, 860, or 882); (i) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ LD NO:871 (for example, a nucleotide sequence such as SEQ LD NO:870); (j) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:875 (for example, a nucleotide sequence such as SEQ ED NO:874); (k) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:881 (for example, a nucleotide sequence such as SEQ LD NO:880); (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:885 (for example, a nucleotide sequence such as SEQ ED NO:844 or 884); (m) a nucleotide sequence encoding an amino acid sequence that is at least 98%o identical to SEQ ED NO: 887 (for example, a nucleotide sequence such as SEQ ED NO:886); (n) a nucleotide sequence encoding an amino acid sequence that is at least 98%) identical to SEQ ED NO:889 (for example, a nucleotide sequence such as SEQ LD NO: 862, 888, 890, or 902); (o) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:893 (for example, a nucleotide sequence such as SEQ ID NO: 892); (p) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:897 (for example, a nucleotide sequence such as SEQ ED NO:896); (q) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:899 (for example, a nucleotide sequence such as SEQ ID NO:898); (r) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ED NO:909 (for example, a nucleotide sequence such as SEQ ID NO:882 or 908); (s) a nucleotide sequence encoding an amino acid sequence that is at least 98%o identical to SEQ ED NO:911 (for example, a nucleotide sequence such as SEQ ED NO:910); (t) a nucleotide sequence encoding an amino acid sequence that is at least 99% identical to SEQ ED NO:837 (for example, a nucleotide sequence such as SEQ ED NO:836); (u) a nucleotide sequence encoding an amino acid sequence that is at least 99% identical to SEQ ED NO: 841 (for example, a nucleotide sequence such as SEQ LD NO: 840); (v) a nucleotide sequence encoding an amino acid sequence that is at least 99%> identical to SEQ ED NO: 865 (for example, a nucleotide sequence such as SEQ ED NO: 864); (w) a nucleotide sequence encoding an amino acid sequence that is at least 99%> identical to SEQ ED NO:869 (for example, a nucleotide sequence such as SEQ ED NO:868); and (x) a nucleotide sequence encoding an amino acid sequence that is at least 99%> identical to SEQ ED NO:879 (for example, a nucleotide sequence such as SEQ ED NO:878). Some preferred isolated or recombinant polynucleotides of the invention comprise a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929 (such as, for example, a nucleotide sequence which encodes SEQ ED NO:837, 849, 893, 897, 905, 921, 927, 929 or 931). Some preferred isolated or recombinant polynucleotides of the invention comprise a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929 (for example, a nucleotide sequence such as SEQ LD NO:836, 848, 892, 896, 904, 920, 926, 928, 930, 938). Some preferred isolated or recombinant polynucleotides of the invention encode an amino acid sequence which further comprises one or more amino acid residues meeting the following criteria: (a) at position 41 the amino acid residue is H; (b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S. While description of the polypeptides of the invention is sometimes expressed herein as a list of possible restrictions on what amino acid residues are found at particular positions, in some embodiments, a polypeptide of the invention meets all of a particular set of possible restrictions. That is, in some instances herein, a list of possible restrictions is expressed as a list of options joined by the conjunction "and/or," and in some embodiments, each such conjunction operates as an "and" rather than an "or." hi some embodiments, possible restrictions which are expressed as alternate possibilities are all found in the polypeptide of the invention; this is only true where the alternate possibilities are not mutually exclusive.
Sequence Variations It will be appreciated by those skilled in the art that due to the degeneracy of the genetic code, a multitude of nucleotide sequences encoding GAT polypeptides of the invention may be produced, some of which bear substantial identity to the nucleic acid sequences explicitly disclosed herein. Table 1 Codon Table Amino acids Codon Alanine Ala A GCA GCC GCG GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acid Glu E GAA GAG Phenylalanine Phe F UUC uuu Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU ' Isoleucine He I AUA AUC AUU Lysine Lys K AAA AAG Amino acids Codon Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gin Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp w UGG Tyrosine Tyr Y UAC UAU
For instance, inspection of the codon table (Table 1) shows that codons AGA, AGG, CGA, CGC, CGG, and CGU all encode the amino acid arginine. Thus, at every position in the nucleic acids of the invention where an arginine is specified by a codon, the codon can be altered to any of the corresponding codons described above without altering the encoded polypeptide. It is understood that U in an RNA sequence corresponds to T in a DNA sequence. Using as an example the nucleic acid sequence corresponding to nucleotides 1-15 of SEQ ID NO:l (ATG ATT GAA GTC AAA (SEQ ED NO : 826)), a silent variation of this sequence includes AGT ATC GAG GTG AAG (SEQ LD NO: 827); both sequences encode the amino acid sequence MIEVK (SEQ ED NO: 828), which corresponds to amino acids 1-5 of SEQ ED NO:6. Such "silent variations" are one species of "conservatively modified variations," as discussed below. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine) can be modified by standard techniques to encode a functionally identical polypeptide. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in any described sequence. The invention provides each and every possible variation of nucleic acid sequence encoding a polypeptide of the invention that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code (e.g., as set forth in Table 1) as applied to the nucleic acid sequence encoding a GAT homologue polypeptide of the invention. All such variations of every nucleic acid herein are specifically provided and described by consideration of the sequence in combination with the genetic code. Any variant can be produced as noted herein. A group of two or more different codons that, when translated in the same context, all encode the same amino acid, are referred to herein as "synonymous codons." As described herein, in some aspects of the invention a GAT polynucleotide is engineered for optimized codon usage in a desired host organism, for example a plant host. The term "optimized" or "optimal" are not meant to be restricted to the very best possible combination of codons, but simply indicates that the coding sequence as a whole possesses an improved usage of codons relative to a precursor polynucleotide from which it was derived. Thus, in one aspect the invention provides a method for producing a GAT polynucleotide variant by replacing at least one parental codon in a nucleotide sequence with a synonymous codon that is preferentially used in a desired host organism, e.g., a plant, relative to the parental codon. "Conservatively modified variations" or, simply, "conservative variations" of a particular nucleic acid sequence refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or, where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. One of skill will recognize that individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of amino acids (typically less than 5%o, more typically less than 4%, 2% or 1%>, or less) in an encoded sequence are "conservatively modified variations" where the alterations result in the deletion of an amino acid, addition of an amino acid, or substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Table 2 sets forth six groups which contain amino acids that are "conservative substitutions" for one another. Table 2 Conservative Substitution Groups 1 Alanine (A) Serine (S) Threonine (T) 2 Aspartic acid (D) Glutamic acid (E) 3 Asparagine (N) Glutamine (Q) 4 Arginine (R) Lysine (EC) 5 Isoleucine (I) Leucine (L) Methionine (M) Valine (V) 6 Phenylalanine (F) Tyrosine (Y) Tryptophan (W) Thus, "conservatively substituted variations" of a listed polypeptide sequence of the present invention include substitutions of a small percentage, typically less than 5%, more typically less than 2%> and often less than 1%, of the amino acids of the polypeptide sequence, with a conservatively selected amino acid of the same conservative substitution group. Thus, a conservatively substituted variation of a polypeptide of the invention can contain 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 substitutions with a conservatively substituted variation of the same conservative substitution group. For example, a conservatively substituted variation of the polypeptide identified herein as SEQ ED NO:6 will contain "conservative substitutions" according to the six groups defined above, in up to 7 residues (i.e., 5% of the amino acids) in the 146 amino acid polypeptide. In a further example, if four conservative substitutions were localized in the region corresponding to amino acids 21 to 30 of SEQ ED NO:6, examples of conservatively substituted variations of this region, RPN QPL EAC M (SEQ ED NO:829), include: KPQ QPV ESC M (SEQ ED NO:830) and KPN NPL DAC V (SEQ ED NO:831) and the like, in accordance with the conservative substitutions listed in Table 2 (in the above example, conservative substitutions are underlined). The listing of a protein sequence herein, in conjunction with the above substitution table, provides an express listing of all conservatively substituted proteins. Finally, the addition of sequences which do not alter the encoded activity of a nucleic acid molecule, such as the addition of a non-functional or non-coding sequence, is a conservative variation of the basic nucleic acid. One of skill will appreciate that many conservative variations of the nucleic acid constructs which are disclosed yield a functionally identical construct. For example, as discussed above, owing to the degeneracy of the genetic code, "silent substitutions" (i.e., substitutions in a nucleic acid sequence which do not result in an alteration in an encoded polypeptide) are an implied feature of every nucleic acid sequence which encodes an amino acid. Similarly, "conservative amino acid substitutions," in one or a few amino acids in an amino acid sequence are substituted with different amino acids with highly similar properties, are also readily identified as being highly similar to a disclosed construct. Such conservative variations of each disclosed sequence are a feature of the present invention. Non-conservative modifications of a particular nucleic acid are those which substitute any amino acid not characterized as a conservative substitution. For example, any substitution which crosses the bounds of the six groups set forth in Table 2. These include substitutions of basic or acidic amino acids for neutral amino acids, (e.g., Asp, Glu, Asn, or Gin for Val, He, Leu or Met), aromatic amino acid for basic or acidic amino acids (e.g. , Phe, Tyr or Trp for Asp, Asn, Glu or Gin) or any other substitution not replacing an amino acid with a like amino acid. Nucleic Acid Hybridization Nucleic acids "hybridize" when they associate, typically in solution.
Nucleic acids hybridize due to a variety of well-characterized physico-chemical forces, such as hydrogen bonding, solvent exclusion, base stacking and the like.
An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993)
Laboratory Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic Acid Probes, Part I, Chapter 2, "Overview of principles of hybridization and the strategy of nucleic acid probe assays," (Elsevier, New York ("Tijssen")), as well as in Ausubel, supra, Hames and Higgins (1995) Gene Probes
1, fRL Press at Oxford University Press, Oxford, England ("Hames and Higgins
1") and Hames and Higgins (1995) Gene Probes 2, JJ L Press at Oxford University Press, Oxford, England ("Hames and Higgins 2") and provide details on the synthesis, labeling, detection and quantification of DNA and RNA, including oligonucleotides. "Stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments, such as Southern and northern hybridizations, are sequence dependent, and are different under different environmental parameters. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993), supra, and in Hames and Higgins 1 and Hames and Higgins 2, supra. For purposes of the present invention, generally, "highly stringent" hybridization and wash conditions are selected to be about 5°C or less lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH (as noted below, highly stringent conditions can also be referred to in comparative terms). The Tm is the temperature (under defined ionic strength and pH) at which 50%> of the test sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the Tm for a particular probe. The Tm of a nucleic acid duplex indicates the temperature at which the duplex is 50%> denatured under the given conditions and its represents a direct measure of the stability of the nucleic acid hybrid. Thus, the Tm corresponds to the temperature corresponding to the midpoint in transition from helix to random coil and it depends on length, nucleotide composition, and ionic strength for long stretches of nucleotides. After hybridization, unhybridized nucleic acid material can be removed by a series of washes, the stringency of which can be adjusted depending upon the desired results. Low stringency washing conditions (e.g., using higher salt and lower temperature) increase sensitivity, but can produce nonspecific hybridization signals and high background signals. Higher stringency conditions (e.g., using lower salt and higher temperature that is closer to the hybridization temperature) lowers the background signal, typically with only the specific signal remaining. See Rapley, R. and Walker, J.M. eds., Molecular Biomethods Handbook (Humana Press, hie. 1998) (hereinafter "Rapley and Walker"), which is incorporated herein by reference in its entirety for all purposes. The Tmof a DNA-DNA duplex can be estimated using Equation 1 as follows: Tm (°C) = 81.5°C + 16.6 (logioM) + 0.41 (%G + C) - 0.72 (%f) - 500/n, where M is the molarity of the monovalent cations (usually Na+),
(%G + C) is the percentage of guanosine (G) and cytosine (C) nucleotides, (%f) is the percentage of formalize and n is the number of nucleotide bases (i.e., length) of the hybrid. See Rapley and Walker, supra. The Tm of an RNA-DNA duplex can be estimated by using Equation 2 as follows: Tm (°C) = 79.8°C + 18.5 (log10M) + 0.58 (%G + C) - 11.8(%G + C)2 - 0.56 (%f) - 820/n, where M is the molarity of the monovalent cations (usually Na+), (%G + C) is the percentage of guanosine (G ) and cytosine (C) nucleotides, (%f) is the percentage of formamide and n is the number of nucleotide bases (i.e., length) of the hybrid. Id. Equations 1 and 2 are typically accurate only for hybrid duplexes longer than about 100-200 nucleotides. Id. The Tm of nucleic acid sequences shorter than 50 nucleotides can be calculated as follows: Tm (°C) = 4(G + C) + 2(A + T), where A (adenine), C, T (thymine), and G are the numbers of the corresponding nucleotides. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formalin with 1 mg of heparin at 42°C, with the hybridization being carried out overnight. An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes (see Sambrook, supra for a description of SSC buffer). Often the high stringency wash is preceded by a low stringency wash to remove background probe signal. An example low stringency wash is 2x SSC at 40°C for 15 minutes. h general, a signal to noise ratio of 2.5x-5x (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Detection of at least stringent hybridization between two sequences in the context of the present invention indicates relatively strong structural similarity or homology to, e.g., the nucleic acids of the present invention provided in the sequence listings herein. As noted, "highly stringent" conditions are selected to be about 5° C or less lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Target sequences that are closely related or identical to the nucleotide sequence of interest (e.g., "probes") can be identified under highly stringent conditions. Lower stringency conditions are appropriate for sequences that are less complementary. See, e.g., Rapley and Walker, supra. Comparative hybridization can be used to identify nucleic acids of the invention, and this comparative hybridization method is a preferred method of distinguishing nucleic acids of the invention. Detection of highly stringent hybridization between two nucleotide sequences in the context of the present invention indicates relatively strong structural similarity/homology to, e.g., the nucleic acids provided in the sequence listing herein. Highly stringent hybridization between two nucleotide sequences demonstrates a degree of similarity or homology of structure, nucleotide base composition, arrangement or order that is greater than that detected by stringent hybridization conditions. In particular, detection of highly stringent hybridization in the context of the present invention indicates strong structural similarity or structural homology (e.g., nucleotide structure, base composition, arrangement or order) to, e.g., the nucleic acids provided in the sequence listings herein. For example, it is desirable to identify test nucleic acids that hybridize to the exemplar nucleic acids herein under stringent conditions. Thus, one measure of stringent hybridization is the ability to hybridize to one of the listed nucleic acids (e.g., nucleic acid sequences SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952, and complementary polynucleotide sequences thereof), under highly stringent conditions (or very stringent conditions, or ultra-high stringency hybridization conditions, or ultra-ultra high stringency hybridization conditions). Stringent hybridization (as well as highly stringent, ultra-high stringency, or ultra-ultra high stringency hybridization conditions) and wash conditions can easily be determined empirically for any test nucleic acid. For example, in determining highly stringent hybridization and wash conditions, the hybridization and wash conditions are gradually increased (e.g., by increasing temperature, decreasing salt concentration, increasing detergent concentration and/or increasing the concentration of organic solvents, such as formalin, in the hybridization or wash), until a selected set of criteria are met. For example, the hybridization and wash conditions are gradually increased until a probe comprising one or more nucleic acid sequences selected from SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790792, 794, 796, 798
800, 802, 804, 806 808 810, 812, 814, 816, 818, 820 822 824, 832, 834, 836
838, 840, 842, 844 846, 848, 850, 852, 854, 856, 858, 860862, 864, 866, 868 870, 872, 874, 876: 878, 880, 882, 884, 886, 888, 890, 892894, 896, 898, 900 902, 904, 906, 908 910, 912, 914, 916, 918, 920, 922, 924926, 928, 930, 932 933, 934, 935, 936 937, 938, 939, 940, 941, 942, 943, 944945, 947, 949, 951 and 952, and complementary polynucleotide sequences thereof, binds to a perfectly matched complementary target (again, a nucleic acid comprising one or more nucleic acid sequences selected from SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537,
538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952, and complementary polynucleotide sequences thereof), with a signal to noise ratio that is at least about 2.5x, and optionally about 5x or more as high as that observed for hybridization of the probe to an unmatched target. In this case, the unmatched target is a nucleic acid corresponding to a nucleic acid (other than those in the accompanying sequence listing) that is present in a public database such as GenBank™ at the time of filing of the subject application. Such sequences can be identified in GenBank by one of skill. Examples include Accession Nos. Z99109 and Y09476. Additional such sequences can be identified in e.g., GenBank, by one of ordinary skill in the art. A test nucleic acid is said to specifically hybridize to a probe nucleic acid when it hybridizes at least Vi as well to the probe as to the perfectly matched complementary target, i.e., with a signal to noise ratio at least lA as high as hybridization of the probe to the target under conditions in which the perfectly matched probe binds to the perfectly matched complementary target with a signal to noise ratio that is at least about 2x-10x, and occasionally 20x, 5 Ox or greater than that observed for hybridization to any of the unmatched polynucleotides of Accession Nos. Z99109 and Y09476. Ultra high-stringency hybridization and wash conditions are those in wliich the stringency of hybridization and wash conditions are increased until the signal to noise ratio for binding of the probe to the perfectly matched complementary target nucleic acid is at least lOx as high as that observed for hybridization to any of the unmatched target nucleic acids of Genbank Accession numbers Z99109 and Y09476. A target nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at least Vi that of the perfectly matched complementary target nucleic acid is said to bind to the probe under ultra- high stringency conditions. Similarly, even higher levels of stringency can be determined by gradually increasing the hybridization and/or wash conditions of the relevant hybridization assay. For example, those in which the stringency of hybridization and wash conditions are increased until the signal to noise ratio for binding of the probe to the perfectly matched complementary target nucleic acid is at least lOx, 20x, 5 Ox, lOOx, or 500x or more as high as that observed for hybridization to any of the unmatched target nucleic acids of Genbank Accession numbers Z99109 and Y09476. A target nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at least Vi that of the perfectly matched complementary target nucleic acid is said to bind to the probe under ultra-ultra- high stringency conditions. Target nucleic acids which hybridize to the nucleic acids represented by SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 05/012515
559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622 628, 630, 632
634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 65 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 68 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 72 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 75 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 78 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 81 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 85 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 88 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 92 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 94 949, 951, and 952 under high, ultra-high and ultra-ultra high stringency conditions are a feature of the invention. Examples of such nucleic acids include those with one or a few silent or conservative nucleic acid substitutions as compared to a given nucleic acid sequence. Nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code, or when antisera or antiserum generated against one or more of SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 05/012515
913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 946, 948, 950, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972, which has been subtracted using the polypeptides encoded by known nucleotide sequences, including those of Genbank Accession number CAA70664. Further details on immunological identification of polypeptides of the invention are found below. Additionally, for distinguishing between duplexes with sequences of less than about 100 nucleotides, a TMACl hybridization procedure known to those of ordinary skill in the art can be used. See, e.g., Sorg, U. et al. Nucleic Acids Res. (Sept. 11, 1991) 19(17), incorporated herein by reference in its entirety for all purposes. In one aspect, the invention provides a nucleic acid which comprises a unique subsequence in a nucleic acid selected from SEQ LD NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532,
533, 534, 535, 536, 537, 538 539 540, 541, 542, 543, 544, 545, 546, 547, 548 549, 550, 551, 552, 553, 554 555556, 557, 558, 559, 560, 561, 562, 563, 564 565, 566, 567, 620, 622, 624626628, 630, 632, 634, 636, 638, 640, 642, 644 646, 648, 650, 652, 654, 656658 660, 662, 664, 666, 668, 670, 672, 674, 676 678, 680, 682, 684, 686, 688690: 692, 694, 696, 698, 700, 702, 704, 706, 708 710,712,714,716,718,720722724, 726, 728, 730, 732, 734, 736, 738, 740 742, 744, 746, 748, 750, 752754, 756, 758, 760, 762, 764, 768, 770, 772, 774 776, 778, 780, 782, 784, 786; 788 790, 792, 794, 796, 798, 800, 802, 804, 806 808,810,812,814,816,818 820.822, 824, 832, 834, 836, 838, 840, 842, 844 846, 848, 850, 852, 854, 856; 858 860, 862, 864, 866, 868, 870, 872, 874, 876 878, 880, 882, 884, 886, 888 890: 892, 894, 896, 898, 900, 902, 904, 906, 908 910,912,914,916,918,920922924, 926, 928, 930, 932, 933, 934, 935, 936 937, 938, 939, 940, 941, 942 943, 944, 945, 947, 949, 951, and 952. The unique subsequence is unique as compared to a nucleic acid corresponding to any of Genbank Accession numbers Z99109 and Y09476. Such unique subsequences can be determined by aligning any of SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952 against the complete set of nucleic acids represented by GenBank accession numbers Z99109 and Y09476 or other related sequences available in public databases as of the filing date of the subject application. Alignment can be perfonned using the BLAST algorithm set to default parameters. Any unique subsequence is useful, e.g., as a probe to identify the nucleic acids of the invention. Similarly, the invention includes a polypeptide which comprises a unique subsequence in a polypeptide selected from: SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 946, 948, 950, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972. Here, the unique subsequence is unique as compared to a polypeptide corresponding to that of GenBank accession number CAA70664. Here again, the polypeptide is aligned against the sequences represented by accession number CAA70664. Note that if the sequence corresponds to a non-translated sequence such as a pseudo gene, the corresponding polypeptide is generated simply by in silico translation of the nucleic acid sequence into an amino acid sequence, where the reading frame is selected to correspond to the reading frame of homologous GAT polynucleotides. The invention also provides for target nucleic acids which hybridize under stringent conditions to a unique coding ohgonucleotide which encodes a unique subsequence in a polypeptide selected from SEQ ED NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813,
815,817,819 821 823, 825 833, 835, 837, 839, 841, 843, 845, 847, 849, 851,
853, 855, 857 859 861, 863 865, 867, 869, 871, 873, 875, 877, 879, 881, 883,
885, 887, 889 891 893, 895 897, 899, 901, 903, 905, 907, 909, 911, 913, 915,
917, 919, 921 923 925, 927929, 931, 946, 948, 950, 953, 954, 955, 956, 957,
958, 959, 960 961 962, 963 964, 965, 966, 967, 968, 969, 970, 971, and 972 wherein the unique subsequence is unique as compared to a polypeptide corresponding to any of the control polypeptides. Unique sequences are determined as noted above. h one example, the stringent conditions are selected such that a perfectly complementary ohgonucleotide to the coding ohgonucleotide hybridizes to the coding ohgonucleotide with at least about a 2.5x-10x higher, preferably at least about a 5-1 Ox higher signal to noise ratio than for hybridization of the perfectly complementary ohgonucleotide to a control nucleic acid corresponding to any of the control polypeptides. Conditions can be selected such that higher ratios of signal to noise are observed in the particular assay which is used, e.g., about 15x, 20x, 3 Ox, 50x or more. In this example, the target nucleic acid hybridizes to the unique coding ohgonucleotide with at least a 2x higher signal to noise ratio as compared to hybridization of the control nucleic acid to the coding ohgonucleotide. Again, higher signal to noise ratios can be selected, e.g., about 2.5x, 5x, lOx, 20x, 30x, 50x or more. The particular signal will depend on the label used in the relevant assay, e.g., a fluorescent label, a colorimetric label, a radioactive label, or the like. Vectors, Promoters and Expression Systems, The present invention also includes recombinant constructs comprising one or more of the nucleic acid sequences as broadly described above. The constructs comprise a vector, such as, a plasmid, a cosmid, a phage, a virus, a bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), or the like, into which a nucleic acid sequence of the invention has been inserted, in a forward or reverse orientation, h a preferred aspect of this embodiment, the construct further comprises regulatory sequences, including, for example, a promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are known to those of skill in the art, and are commercially available. As previously discussed, general texts which describe molecular biological techniques useful herein, including the use of vectors, promoters and many other relevant topics, include Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology Volume 152, (Academic Press, Inc., San Diego, CA) ("Berger"); Sambrook et al, Molecular Cloning - A Laboratory Manual, 2d ed., Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in Molecular Biology, F.M. Ausubel et al, eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 1999) ("Ausubel"). Examples of protocols sufficient to direct persons of skill through in vitro amplification methods, including the polymerase chain reaction (PCR), the ligase chain reaction (LCR), Qβ-replicase amplification and other RNA polymerase mediated techniques (e.g., NASBA), e.g., for the production of the homologous nucleic acids of the invention are found in Berger, Sambrook, and Ausubel, as well as in Mullis et al. (1987) U.S. Patent No. 4,683,202; hinis et al, eds. (1990) PCR Protocols: A Guide to Methods and Applications (Academic Press Inc. San Diego, CA) ("Innis"); Arnheim & Levinson (October 1, 1990)
C&EN 36-47; The Journal Of NIH Research (1991) 3: 81-94; Kwoh et α/. (1989) Proc. Natl. Acad. Sci. USA 86: 1173; Guatelli et al. (1990) Proc. Nat 'I. Acad. Sci. USA 87: 1874; Lomell et al. (1989) J Clin. Chem 35: 1826; Landegren et al. (1988) Science 241: 1077-1080; Van Brunt (1990) Biotechnology 8: 291-294; Wu and Wallace (1989) Gene 4:560; Barringer et al. (1990) Gene 89: 117; and
Sooknanan and Malek (1995) Biotechnology 13: 563-564. Improved methods for cloning in vitro amplified nucleic acids are described in Wallace et al, U.S. Pat. No. 5,426,039. Improved methods for amplifying large nucleic acids by PCR are summarized in Cheng et al. (1994) Nature 369: 684-685 and the references cited therein, in which PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. See, e.g., Ausubel, Sambrook and Berger, all supra. The present invention also relates to engineered host cells that are transduced (transformed or transfected) with a vector of the invention (e.g., an. invention cloning vector or an invention expression vector), as well as the production of polypeptides of the invention by recombinant techniques. The vector may be, for example, a plasmid, a viral particle, a phage, etc. The engineered host cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants, or amplifying the GAT homologue gene. Culture conditions, such as temperature, pH and the like, are those previously used with the host cell selected for expression, and will be apparent to those skilled in the art and in the references cited herein, including, e.g., Sambrook, Ausubel and Berger, as well as e.g., Freshney (1994) Culture of Animal Cells: A Manual of Basic Technique, 3rd ed. (Wiley- Liss, New York) and the references cited therein. GAT polypeptides of the invention can be produced in non-animal cells such as plants, yeast, fungi, bacteria and the like. In addition to Sambrook, Berger and Ausubel, details regarding non-animal cell culture can be found in Payne et al. (1992) Plant Cell and Tissue Culture in Liquid Systems (John Wiley & Sons, hie. New York, NY); Gamborg and Phillips, eds. (1995) Plant Cell, Tissue and Organ Culture: Fundamental Methods! Springer Lab Manual (Springer- Verlag, Berlin); and Atlas and Parks, eds., The Handbook of Microbiological Media (1993) CRC Press, Boca Raton, FL. Polynucleotides of the present invention can be incorporated into any one of a variety of expression vectors suitable for expressing a polypeptide. Suitable vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, pseudorabies, adenovirus, adeno-associated viruses, retroviruses and many others. Any vector that transduces genetic material into a cell, and, if replication is desired, which is replicable and viable in the relevant host can be used. When incorporated into an expression vector, a polynucleotide of the invention is operatively linked to an appropriate transcription control sequence (promoter) to direct mRNA synthesis. Examples of such transcription control sequences particularly suited for use in transgenic plants include the cauliflower mosaic virus (CaMV), figwort mosaic virus (FMV) and strawberry vein banding virus (SVBV) promoters, described in U.S. Provisional Application No. 60/245,354. Other promoters known to control expression of genes in prokaryotic or eukaryotic cells or their viruses and which can be used in some embodiments of the invention include S V40 promoter, E. coli lac or trp promoter, and the phage lambda PL promoter. An expression vector optionally contains a ribosome binding site for translation initiation, and a transcription terminator, such as Pinll. The vector also optionally includes appropriate sequences for amplifying expression, e.g., an enhancer. hi addition, the expression vectors of the present invention optionally contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells. Usually, the selectable marker gene will encode antibiotic or herbicide resistance. Suitable genes include those coding for resistance to the antibiotic spectinomycin or streptomycin (e.g., the aada gene), the streptomycin phosphotransferase (SPT) gene coding for streptomycin resistance, the neomycin phosphotransferase (NPTII) gene encoding kanamycin or geneticin resistance, the hygromycin phosphotransferase (HPT) gene coding for hygromycin resistance. Additional selectable marker genes include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, and tetracycline or ampicillin resistance in E. coli. Suitable genes coding for resistance to herbicides include those which act to inhibit the action of acetolactate synthase (ALS), in particular the sulfonylurea-type herbicides (e.g., the acetolactate synthase (ALS) gene containing mutations leading to such resistance in particular the S4 and/or Hra mutations), those which act to inhibit the action of glutamine synthase, such as phosphinothricin or basta (e.g., the bar gene), or other such genes known in the art. The bar gene encodes resistance to the herbicide basta and the ALS gene encodes resistance to the herbicide chlorsulfuron. In some instances, the modified GAT genes are used as selectable markers. Vectors of the present invention can be employed to transform an appropriate host to permit the host to express an inventive protein or polypeptide. Examples of appropriate expression hosts include: bacterial cells, such as E. coli, B. subtilis, Streptomyces, and Salmonella typhimurium; fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, and Neurospora crassa; insect cells such as Drosophila and Spodoptera frugiperda; mammalian cells such as CHO, COS, BHK, HEK 293 or Bowes melanoma; or plant cells or explants, etc. It is understood that not all cells or cell lines need to be capable of producing fully functional GAT polypeptides; for example, antigenic fragments of a GAT polypeptide may be produced. The present invention is not limited by the host cells employed. In bacterial systems, a number of expression vectors may be selected depending upon the use intended for the GAT polypeptide. For example, when large quantities of GAT polypeptide or fragments thereof are needed for commercial production or for induction of antibodies, vectors which direct high level expression of fusion proteins that are readily purified can be desirable. Such vectors include, but are not limited to, multifunctional E. coli cloning and expression vectors such as BLUΕSCRΕPT (Stratagene), in which the GAT polypeptide coding sequence may be ligated into the vector in-frame with sequences for the amino-terminal Met and the subsequent 7 residues of beta- galactosidase so that a hybrid protein is produced; pΕN vectors (Van Heeke & Schuster (1989) J Biol. Chem. 264: 5503-5509); pΕT vectors (Novagen, Madison WI); and the like. Similarly, in the yeast Saccharomyces cerevisiae a number of vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH may be used for production of the GAT polypeptides of the invention. For reviews, see Ausubel (supra) and Grant et al. (1987) Methods in Enzymology 153:516-544. In mammalian host cells, a variety of expression systems, including viral- based systems, may be utilized. In cases where an adenovirus is used as an expression vector, a coding sequence, e.g., of a GAT polypeptide, is optionally ligated into an adenovirus transcription translation complex consisting of the late promoter and tripartite leader sequence. Insertion of a GAT polypeptide coding region into a nonessential Εl or Ε3 region of the viral genome will result in a viable virus capable of expressing a GAT in infected host cells (Logan and Shenk (1984) Proc. Nat'l Acad. Sci. USA 81:3655-3659). In addition, transcription enhancers, such as the rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. Similarly, in plant cells, expression can be driven from a transgene integrated into a plant chromosome, or cytoplasmically from an episomal or viral nucleic acid. In the case of stably integrated transgenes, it is often desirable to provide sequences capable of driving constitutive or inducible expression of the GAT polynucleotides of the invention, for example, using viral, e.g., CaMV, or plant derived regulatory sequences. Numerous plant derived regulatory sequences have been described, including sequences which direct expression in a tissue specific manner, e.g., TobRB7, patatin B33, GRP gene promoters, the rbcS-3A 5/012515
promoter, and the like. Alternatively, high level expression can be achieved by transiently expressing exogenous sequences of a plant viral vector, e.g., TMV, BMV, etc. Typically, transgenic plants constitutively expressing a GAT polynucleotide of the invention will be preferred, and the regulatory sequences are selected to insure constitutive stable expression of the GAT polypeptide. Typical vectors useful for expression of nucleic acids in higher plants are well known in the art and include vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens described by Rogers et al. (1987) Meth. Enzymol 153: 253-277. Exemplary y tumefaciens vectors useful herein are plasmids pKYLX6 and pKYLX7 of Schardl et al. (1987) Gene 61 : 1 - 11 and Berger et al. (1989) Proc. Natl. Acad. Sci. U.S.A. 86: 8402-8406. Another useful vector herein is plasmid pBI101.2 that is available from Clontech Laboratories, Inc. (Palo Alto, CA). A variety of plant viruses that can be employed as vectors are known in the art and include cauliflower mosaic virus (CaMV), geminivirus, brome mosaic virus, and tobacco mosaic virus. hi some embodiments of the present invention, a GAT polynucleotide construct suitable for transformation of plant cells is prepared. For example, a desired GAT polynucleotide can be incorporated into a recombinant expression cassette to facilitate introduction of the gene into a plant and subsequent expression of the encoded polypeptide. An expression cassette will typically comprise a GAT polynucleotide, or functional fragment thereof, operably linked to a promoter sequence and other transcriptional and translational initiation regulatory sequences which will direct expression of the sequence in the intended tissues (e.g., entire plant, leaves, seeds) of the transformed plant. For example, a strongly or weakly constitutive plant promoter can be employed which will direct expression of the GAT polypeptide in all tissues of a plant. Such promoters are active under most environmental conditions and states of development or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic virus (CaMV) 35S transcription initiation region, the 1'- or 2'- promoter derived from T-DNA of Agrobacterium tumefaciens, the ubiquitin 1 promoter, the Smas promoter, the cinnamyl alcohol dehydrogenase promoter (U.S. Patent No. 5,683,439), the Nos promoter, the pEmu promoter, the rubisco promoter, the GRPl-8 promoter and other transcription initiation regions from various plant genes known to those of skill, h situations in which over expression of a GAT polynucleotide is detrimental to the plant or otherwise undesirable, one of skill, upon review of this disclosure, will recognize that weak constitutive promoters can be used for low-levels of expression. In those cases where high levels of expression is not harmful to the plant, a strong promoter, e.g., a t-RNA or other pol III promoter, or a strong pol II promoter, such as the cauliflower mosaic virus promoter, can be used. Alternatively, a plant promoter may be under environmental control. Such promoters are referred to here as "inducible" promoters. Examples of environmental conditions that may effect transcription by inducible promoters include pathogen attack, anaerobic conditions, or the presence of light, hi particular, examples of inducible promoters are the Adhl promoter which is inducible by hypoxia or cold stress, the Hsp70 promoter which is inducible by heat stress, and the PPDK promoter which is inducible by light. Also useful are promoters which are chemically inducible. The promoters used in the present invention can be "tissue-specific" and, as such, under developmental control in that the polynucleotide is expressed only in certain tissues, such as leaves, roots, fruit, flowers and/or seeds. An exemplary promoter is the anther specific promoter 5126 (U.S. Patent Nos. 5,689,049 and 5,689,051). Examples of seed-preferred promoters include, but are not limited to, 27 kD gamma zein promoter and waxy promoter, Boronat et al. 1986) Plant Sci. 47, 95-102; Reina et al. (1990) Nucleic Acids Res. 18 (21): 6426; and Kloesgen et al. (1986) Mol. Gen. Genet. 203: 237-244. Promoters that express in the embryo, pericarp, and endosperm are disclosed in U.S. Patent Application Ser. Nos. 60/097,233 filed August 20, 1998 and 60/098,230 filed August 28, 1998. The disclosures each of these are incorporated herein by reference in their entirety. In embodiments in which one or more nucleic acid sequences endogenous to the plant system are incorporated into the construct, the endogenous promoters (or variants thereof) from these genes can be employed for directing expression of the genes in the transfected plant. Tissue-specific promoters can also be used to direct expression of heterologous polynucleotides. hi general, the particular promoter used in the expression cassette in plants depends on the intended application. Either heterologous or non-heterologous (i.e., endogenous) promoters can be employed to direct expression of the nucleic acids of the present invention. These promoters can also be used, for example, in expression cassettes to drive expression of antisense nucleic acids to reduce, increase, or alter the concentration and/or composition of the proteins of the present invention in a desired tissue. Any of a number of promoters which direct transcription in plant cells are suitable. The promoter can be either constitutive or inducible. hi addition to the promoters noted above, promoters of bacterial origin which operate in plants include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids (see, Herrara-Estrella et al. (1983) Nature 303: 209-213). Viral promoters include the 35S and 19S RNA promoters of cauliflower mosaic virus (Odell et al. (1985) Nature 313: 810-812). Other plant promoters include the ribulose-1,3- bisphosphate carboxylase small subunit promoter and the phaseolin promoter. The promoter sequence from the E8 gene and other genes may also be used. The isolation and sequence of the E8 promoter is described in detail in Deikman and Fischer (1988) EMBOJ. 7: 3315-3327. To identify candidate promoters, the 5' portions of a genomic clone is analyzed for sequences characteristic of promoter sequences. For instance, promoter sequence elements include the TATA box consensus sequence (TATAAT), which is usually 20 to 30 base pairs upstream of the transcription start site, hi plants, further upstream from the TATA box, at positions -80 to -100, there is typically a promoter element with a series of adenines surrounding the trinucleotide G (or T) as described by Messing et al. (1983) Genetic Engineering in Plants, eds. Kosage, et al, pp. 221-227. In preparing polynucleotide constructs, e.g., vectors, of the invention, sequences other than the promoter and the cojoined polynucleotide can also be employed. If normal polypeptide expression is desired, a polyadenylation region at the 3 '-end of a GAT-encoding region can be included. The polyadenylation region can be derived, for example, from a variety of plant genes, or from T-DNA. The 3' end sequence to be added can be derived from, for example, the nopaline synthase or octopine synthase genes, or alternatively from another plant gene, or less preferably from any other eukaryotic gene. An intron sequence can be added to the 5' untranslated region of the coding sequence or the partial coding sequence to increase the amount of the mature message that accumulates. See for example Bucliman and Berg (1988) Mol. Cell Biol. 8: 4395-4405 and Callis et al. (1987) Genes Dev. 1: 1183-1200. Use of maize introns Adhl, intron 1, 2, and 6, and the Bronze- 1 intron are known in the art. See generally, Freeling and Walbot, eds. (1994) The Maize Handbook (Springer, New York), chapter 116. The construct can also include a marker gene which confers a selectable phenotype on plant cells. For example, the marker may encode biocide tolerance, particularly antibiotic tolerance, such as tolerance to kanamycin, G418, bleomycin, hygromycin, or herbicide tolerance, such as tolerance to chlorsulfuron, or phosphinothricin (the active ingredient in the herbicides bialaphos and Basta). Specific initiation signals can aid in efficient translation of a GAT polynucleo tide-encoding sequence of the present invention. These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where a GAT polypeptide-encoding sequence, its initiation codon and upstream sequences are inserted into an appropriate expression vector, no additional translational control signals may be needed. However, in cases where only the coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is inserted, exogenous transcriptional control signals including the initiation codon must be provided. Furthermore, the initiation codon must be in the correct reading frame to ensure transcription of the entire insert. Exogenous transcriptional elements and initiation codons can be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate to the cell system in use (Scharf et al. (1994) Results Probl Cell Differ. 20: 125-62 and Bittner et al. (1987) Methods in Enzymol 153: 516-544). Secretion/Localization Sequences Polynucleotides of the invention can also be fused, for example, in-frame to nucleic acids encoding a secretion localization sequence, to target polypeptide expression to a desired cellular compartment, membrane, or organelle of a host cell, or to direct polypeptide secretion to the periplasmic space or into the cell culture media. Such sequences are known to those of skill, and include secretion leader peptides, organelle targeting sequences (e.g., nuclear localization sequences, ER retention signals, mitochondrial transit sequences, and chloroplast transit sequences), membrane localization/anchor sequences (e.g., stop transfer sequences, GPI anchor sequences), and the like. hi a preferred embodiment, a polynucleotide of the invention is fused in frame with an N-terminal chloroplast transit sequence (or chloroplast transit peptide sequence) derived from a gene encoding a polypeptide that is normally targeted to the chloroplast. Such sequences are typically rich in serine and threonine; are deficient in aspartate, glutamate, and tyrosine; and generally have a central domain rich in positively charged amino acids. Expression Hosts hi a further embodiment, the present invention relates to host cells containing the above-described constructs. The host cell can be a eukaryotic cell, such as a mammalian cell, a yeast cell, or a plant cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated transfection, electroporation, or other common techniques (Davis et al, Basic Methods in Molecular Biology). A host cell is optionally chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. Post- translational processing that cleaves a "pre" or a "prepro" form of the protein may also be important for correct insertion, folding and/or function. Different host cells such as E. coli, Bacillus sp., yeast or mammalian cells such as CHO, HeLa, BHK, MDCK, 293, WI38, etc. have specific cellular machinery and characteristic mechanisms, e.g. , for post-translational activities and may be chosen to ensure the desired modification and processing of the introduced, foreign protein. For long-term, high-yield production of recombinant proteins, stable expression systems can be used. For example, plant cells, explants or tissues, e.g. shoots, or leaf discs, which stably express a polypeptide of the invention are transduced using expression vectors wliich contain viral origins of replication or endogenous expression elements and a selectable marker gene. Following the introduction of the vector, cells may be allowed to grow for a period determined to be appropriate for the cell type, e.g., 1 or more hours for bacterial cells, 1-4 days for plant cells, 2-4 weeks for some plant explants, in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells which successfully express the introduced sequences. For example, transgenic plants expressing the polypeptides of the invention can be selected directly for resistance to the herbicide, glyphosate. Resistant embryos derived from stably transformed explants can be proliferated, e.g., using tissue culture techniques appropriate to the cell type. Host cells transformed with a nucleotide sequence encoding a polypeptide of the invention are optionally cultured under conditions suitable for the expression and recovery of the encoded protein from cell culture. The protein or fragment thereof produced by a recombinant cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing GAT polynucleotides of the invention can be designed with signal sequences which direct secretion of the mature polypeptides through a prokaryotic or eukaryotic cell membrane. Additional Polypeptide Sequences Polynucleotides of the present invention may also comprise a coding sequence fused in- frame to a marker sequence that, e.g., facilitates purification of the encoded polypeptide. Such purification facilitating domains include, but are not limited to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals, a sequence which binds glutathione (e.g., GST), a hemagglutinin (HA) tag (corresponding to an epitope derived from the influenza hemagglutinin protein; Wilson et al. (1984) Cell 37: 767), maltose binding protein sequences, the FLAG epitope utilized in the FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA), and the like. The inclusion of a protease-cleavable polypeptide linker sequence between the purification domain and the GAT homologue sequence is useful to facilitate purification. One expression vector contemplated for use in the compositions and methods described herein provides for expression of a fusion protein comprising a polypeptide of the invention fused to a polyhistidine region separated by an enterokinase cleavage site. The histidine residues facilitate purification on EMIAC (immobilized metal ion affinity chromatography, as described in Porath et al. (1992) Protein Expression and Purification 3: 263-281) while the enterokinase cleavage site provides a means for separating the GAT homologue polypeptide from the fusion protein. pGEX vectors (Promega; Madison, WI) may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case of GST-fusions) followed by elution in the presence of free ligand. Polypeptide Production and Recovery Following transduction of a suitable host and growth of the host cells to an appropriate cell density, the selected promoter is induced by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period. Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification. Microbial cells employed in the expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, or other methods, which are well known to those skilled in the art. As noted, many references are available for the culture and production of many cells, including cells of bacterial, plant, animal (especially mammalian) and archebacterial origin. See e.g., Sambrook, Ausubel, and Berger (all supra), as well as Freshney (1994) Culture of Animal Cells: A Manual of Basic Technique, 3rd ed. (Wiley-Liss, New York) and the references cited therein; Doyle and Griffiths (1997) Mammalian Cell Culture: Essential Techniques (John Wiley and Sons, NY); Humason (1979) Animal Tissue Techniques, 4th ed. (W.H. Freeman and Company); and Ricciardelli, et al. (1989) In vitro Cell Dev. Biol. 25: 1016-1024. For plant cell culture and regeneration see, Payne et al (1992) Plant Cell and Tissue Culture in Liquid Systems (John Wiley & Sons, Luc, New York, NY); Gamborg and Phillips, eds. (1995) Plant Cell, Tissue and Organ Culture: Fundamental Methods! Springer Lab Manual (Springer-Verlag, Berlin); Jones, ed. (1984) Plant Gene Transfer and Expression Protocols (Humana Press, Totowa, New Jersey); and Croy, ed. (1993) Plant Molecular Biology (Bios Scientific
Publishers, Oxford, U.K.), ISBN 0 12 198370 6. Cell culture media in general are set forth in Atlas and Parks, eds. (1993) The Handbook of Microbiological Media (CRC Press, Boca Raton, FL). Additional information for cell culture is found in available commercial literature such as the Life Science Research Cell Culture Catalogues).998) from Sigma-Aldrich, Inc. (St Louis, MO) ("Sigma-LSRCCC") and, e.g., The Plant Culture Catalogu _and supplement (1997) also from Sigma- Aldrich, Inc. (St Louis, MO) ("Sigma-PCCS"). Further details regarding plant cell transformation and transgenic plant production are found below. Polypeptides of the invention can be recovered and purified from recombinant cell cultures by any of a number of methods well known in the art, including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography (e.g., using any of the tagging systems noted herein), hydroxylapatite chromatography, and lectin chromatography. Protein refolding steps can be used, as desired, in completing the configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed in the final purification steps. In addition to the references noted supra, a variety of purification methods are well known in the art, including, e.g., those set forth in Sandana (1997) Bioseparation of Proteins (Academic Press, Inc.; Bollag et al. (1996) Protein Methods, 2nd ed. (Wiley-Liss, NY); Walker (1996) The Protein Protocols Handbook (Humana Press, NJ), Harris and Angal (1990) Protein Purification Applications: A Practical Approach (IRL Press at Oxford, Oxford, England); Harris and Angal Protein Purification Methods: A Practical Approach (ERL Press at Oxford, Oxford, England); Scopes (1993) Protein Purification: Principles and Practice, 3rd ed. (Springer Verlag, NY); Janson and Ryden (1998) Protein Purification: Principles, High Resolution Methods and Applications, 2nd ed. (Wiley- VCH, NY); and Walker (1998) Protein Protocols on CD-ROM (Humana Press, NJ). In some cases, it is desirable to produce the GAT polypeptide of the invention in a large scale suitable for industrial and/or commercial applications, hi such cases bulk fermentation procedures are employed. Briefly, a GAT polynucleotide, e.g. , a polynucleotide comprising any one of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952, or other nucleic acids encoding GAT polypeptides of the invention can be cloned into an expression vector. For example, U.S. Patent No. 5,955,310 to Widner et al. "METHODS FOR PRODUCING A POLYPEPTIDE IN A BACILLUS CELL," describes a vector with tandem promoters, and stabilizing sequences operably linked to a polypeptide encoding sequence. After inserting the polynucleotide of interest into a vector, the vector is transformed into a bacterial, e.g., a Bacillus subtilis strain PL1801IEE (amyE, apr, npr, sρoIEE::Tn917) host. The introduction of an expression vector into a Bacillus cell may, for instance, be effected by protoplast transformation (see, e.g., Chang and Cohen (l979)_Mol Gen. Genet. 168:111), by using competent cells (see, e.g., Young and Spizizin (1961) J. Bacteriol. 81:823, or Dubnau and Davidoff-Abelson (197L) J Mol. Biol. 56: 209), by electroporation (see, e.g., Shigekawa and Dower (1988) Biotechniques 6: 742), or by conjugation (see, e.g., Koehler and Thorne (1987) J Bacteriol. 169: 5271), see also, Ausubel, Sambrook and Berger, all supra. The transformed cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods that are known in the art. For example, the cell may be cultivated by shake flask cultivation, small-scale or large- scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). The secreted polypeptide can be recovered directly from the medium. The resulting polypeptide may be isolated by methods known in the art. For example, the polypeptide may be isolated from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. The isolated polypeptide may then be further purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), or extraction (see, e.g., Bollag et al. (1996) Protein Methods, 2nd ed. (Wiley-Liss, NY) and Walker (1996) The Protein Protocols Handbook (Humana Press, NJ). Cell-free transcription/translatioii systems can also be employed to produce polypeptides using DNAs or RNAs of the present invention. Several such systems are commercially available. A general guide to in vitro transcription and translation protocols is found in Tymms (1995) In vitro Transcription and Translation Protocols: Methods in Molecular Biology (Garland Publishing, NY), vol. 37.
SUBSTRATES AND FORMATS FOR SEQUENCE RECOMBINATION The polynucleotides of the invention are optionally used as substrates for a variety of diversity generating procedures, e.g., mutation, recombination and recursive recombination reactions, in addition to their use in standard cloning methods as set forth in, e.g., Ausubel, Berger and Sambrook, to produce additional GAT polynucleotides and polypeptides with desired properties. A variety of diversity generating protocols are available and described in the art. The procedures can be used separately, and/or in combination to produce one or more variants of a polynucleotide or set of polynucleotides, as well variants of encoded proteins. Individually and collectively, these procedures provide robust, widely applicable ways of generating diversified polynucleotides and sets of polynucleotides (including, e.g., polynucleotide libraries) useful, e.g., for the engineering or rapid evolution of polynucleotides, proteins, pathways, cells and/or organisms with new and/or improved characteristics. The process of altering the sequence can result in, for example, single nucleotide substitutions, multiple nucleotide substitutions, and insertion or deletion of regions of the nucleic acid sequence. While distinctions and classifications are made in the course of the ensuing discussion for clarity, it will be appreciated that the techniques are often not mutually exclusive. Indeed, the various methods can be used singly or in combination, in parallel or in series, to access diverse sequence variants. The result of any of the diversity generating procedures described herein can be the generation of one or more polynucleotides, which can be selected or screened for polynucleotides that encode proteins with or which confer desirable properties. Following diversification by one or more of the methods described herein, or otherwise available to one of skill, any polynucleotides that are produced can be selected for a desired activity or property, e.g. altered Km for glyphosate, altered Km for acetyl CoA, use of alternative cofactors (e.g., propionyl CoA) 05/012515
increased kcat, etc. This can include identifying any activity that can be detected, for example, in an automated or automatable format, by any of the assays in the art. For example, GAT homologs with increased specific activity can be detected by assaying the conversion of glyphosate to N-acetylglyphosate, e.g., by mass spectrometry. Alternatively, improved ability to confer resistance to glyphosate can be assayed by growing bacteria transformed with a nucleic acid of the invention on agar containing increasing concentrations of glyphosate or by spraying transgenic plants incorporating a nucleic acid of the invention with glyphosate. A variety of related (or even unrelated) properties can be evaluated, in serial or in parallel, at the discretion of the practitioner. Additional details regarding recombination and selection for herbicide tolerance can be found, e.g., in "DNA SHUFFLING TO PRODUCE HERBICIDE RESISTANT CROPS" (U.S. Pub. No. 2002/0058249) filed August 12,1999. Descriptions of a variety of diversity generating procedures, including multigene shuffling and methods for generating modified nucleic acid sequences encoding multiple enzymatic domains, are found the following publications and the references cited therein: Soong, N. et al. (2000) Nat. Genet. 25(4): 436-39; Stemmer, et al. (1999) Tumor Targeting 4: 1-4; Ness et al. (1999) Nature Biotech. 17:893-896; Chang etal. (1999) Nature Biotech. 17: 793-797; Minshull and Stemmer (1999) Current Opinion in Chemical Biology 3: 284-290; Christians et al. (1999) Nature Biotech. 17: 259-264; Crameri et al. (1998) Nature 391: 288-291; Crameri et al. (1997) Nature Biotech. 15: 436-438; Zhang et al. (1997) Proc. Natl Acad. Sci. USA 94: 4504-4509; Patten et al. (1997) Current Opinion in Biotech. 8: 724-733; Crameri et al. (1996) Nature Med. 2:100-103; Crameri et al. (1996) Nature Biotech. 14:315-319; Gates et al. (1996) J Mol. Biol. 255: 373-386; Stemmer (1996) "Sexual PCR and Assembly PCR" in The Encyclopedia of Molecular Biology (VCH Publishers, New York) pp.447-457; Crameri and Stemmer (1995) BioTechniques 18: 194-195; Stemmer et al, (1995) Gene 164: 49- 53; Stemmer (1995) Science 270: 1510; Stemmer (1995) Bio/Technology 13: 549- 553; Stemmer (1994) Nature 370: 389-391; and Stemmer (1994) Proc. Nat 1 Acad. Sci. USA 91:10747-10751. Mutational methods of generating diversity include, for example, site- directed mutagenesis (Ling et al. (1997) "Approaches to DNA mutagenesis: an overview" Anal Biochem. 254(2): 157-178; Dale et al. (1996) "Oligonucleotide- directed random mutagenesis using the phosphorothioate method" Methods Mol. Biol. 57:369-374; Smith (1985) "In vitro mutagenesis" Aim. Rev. Genet. 19:423- 462; Botstein & Shortle (1985) "Strategies and applications of zn vitro mutagenesis" Science 229:1193-1201; Carter (1986) "Site-directed mutagenesis" Biochem. J. 237:1-7; and Kunkel (1987) "The efficiency of ohgonucleotide directed mutagenesis" in Nucleic Acids & Molecular Biology (Eckstein, F. and Lilley, D.M.J. eds., Springer Verlag, Berlin)); mutagenesis using uracil containing templates (Kunkel (1985) "Rapid and efficient site-specific mutagenesis without phenotypic selection" Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel et al. (1987) "Rapid and efficient site-specific mutagenesis without phenotypic selection" Methods in Enzymol. 154, 367-382; and Bass et al. (1988) "Mutant Trp repressors with new DNA-binding specificities" Science 242:240-245); oligonucleotide- directed mutagenesis (Methods in Enzymol. 100: 468-500 (1983); Methods in Enzymol. 154: 329-350 (1987); Zoller & Smith (1982) "Oligonucleotide-directed mutagenesis using M13-derived vectors: an efficient and general procedure for the production of point mutations in any DNA fragment" Nucleic Acids Res. 10:6487- 6500; Zoller & Smith (1983) "Oligonucleotide-directed mutagenesis of DNA fragments cloned into Ml 3 vectors" Methods in Enzymol. 100:468-500; and Zoller & Smith (1987) "Oligonucleotide-directed mutagenesis: a simple method using two ohgonucleotide primers and a single-stranded DNA template" Methods in Enzymol. 154:329-350); phosphorothioate-modified DNA mutagenesis (Taylor et al. (1985) "The use of phosphorothioate-modified DNA in restriction enzyme reactions to prepare nicked DNA" Nucl. Acids Res. 13: 8749-8764; Taylor et al. (1985) "The rapid generation of oligonucleotide-directed mutations at high frequency using phosphorothioate-modified DNA" Nucl. Acids Res. 13: 8765- 8787; Nakamaye & Eckstein (1986) "Inhibition of restriction endonuclease Nci I cleavage by phosphorothioate groups and its application to oligonucleotide- directed mutagenesis" Nucl. Acids Res. 14: 9679-9698; Sayers et al. (1988) "Y-T Exonucleases in phosphorofhioate-based oligonucleotide-directed mutagenesis" Nucl. Acids Res. 16:791-802; and Sayers et al. (1988) "Strand specific cleavage of phosphorothioate-containing DNA by reaction with restriction endonucleases in the presence of ethidium bromide" Nucl. Acids Res. 16: 803-814); mutagenesis using gapped duplex DNA (Kramer et al. (1984) "The gapped duplex DNA approach to oligonucleotide-directed mutation construction" Nucl. Acids Res. 12: 9441-9456; Kramer & Fritz (1987) Methods in Enzymol. "Oligonucleotide- directed construction of mutations via gapped duplex DNA" 154:350-367; Kramer et al. (1988) "Improved enzymatic in vitro reactions in the gapped duplex DNA approach to oligonucleotide-directed construction of mutations" Nucl. Acids Res. 16: 7207; and Fritz et al. (1988) "Oligonucleotide-directed construction of mutations: a gapped duplex DNA procedure without enzymatic reactions in vitro" Nucl. Acids Res. 16: 6987-6999). Additional suitable methods include point mismatch repair (Kramer et al. (1984) "Point Mismatch Repair" Cell 38:879-887), mutagenesis using repair- deficient host strains (Carter et al. (1985) "Improved ohgonucleotide site-directed mutagenesis using Ml 3 vectors" Nucl. Acids Res. 13: 4431-4443; and Carter (1987) "Improved oligonucleotide-directed mutagenesis using Ml 3 vectors" Methods in Enzymol. 154: 382-403), deletion mutagenesis (Eghtedarzadeh & Henikoff (1986) "Use of oligonucleotides to generate large deletions" Nucl. Acids Res. 14: 5115), restriction-selection and restriction-purification (Wells et al. (1986) "Importance of hydrogen-bond formation in stabilizing the transition state of subtilisin" Phil. Trans. R. Soc. Lond. A 317: 415-423), mutagenesis by total gene synthesis (Nambiar et al. (1984) "Total synthesis and cloning of a gene coding for the ribonuclease S protein" Science 223: 1299-1301; Sakamar and Khorana (1988) "Total synthesis and expression of a gene for the a-subunit of bovine rod outer segment guanine nucleotide-binding protein (transducin)" Nucl. Acids Res. 14: 6361-6372; Wells et al. (1985) "Cassette mutagenesis: an efficient method for generation of multiple mutations at defined sites" Gene 34:315-323; and Grundstrόm et al. (1985) "Oligonucleotide-directed mutagenesis by microscale 'shot-gun' gene synthesis" Nucl. Acids Res. 13: 3305-3316); double-strand break repair (Mandecki (1986); Arnold (1993) "Protein engineering for unusual environments" Current Opinion in Biotechnology 4:450-455; and "Oligonucleotide-directed double-strand break repair in plasmids of Escherichia coli: a method for site-specific mutagenesis" Proc. Natl. Acad. Sci. USA, 83:7177- 7181). Additional details on many of the above methods can be found in Methods in Enzymology Volume 154, which also describes useful controls for trouble- shooting problems with various mutagenesis methods. Additional details regarding various diversity generating methods can be found in the following U.S. patents, PCT publications, and EPO publications: U.S. Pat. No. 5,605,793 to Stemmer (February 25, 1997), "Methods for In vitro Recombination;" U.S. Pat. No. 5,811,238 to Stemmer etal. (September 22, 1998) "Methods for Generating Polynucleotides having Desired Characteristics by Iterative Selection and Recombination;" U.S. Pat. No. 5,830,721 to Stemmer et al. (November 3, 1998), "DNA Mutagenesis by Random Fragmentation and Reassembly;" U.S. Pat. No. 5,834,252 to Stemmer, et al. (November 10, 1998) "End-Complementary Polymerase Reaction;" U.S. Pat. No. 5,837,458 to Minshull, et al. (November 17, 1998), "Methods and Compositions for Cellular and
Metabolic Engineering;" WO 95/22625, Stemmer and Crameri, "Mutagenesis by Random Fragmentation and Reassembly;" WO 96/33207 by Stemmer and Lipschutz "End Complementary Polymerase Chain Reaction;" WO 97/20078 by Stemmer and Crameri "Methods for Generating Polynucleotides having Desired Characteristics by Iterative Selection and Recombination;" WO 97/35966 by Minshull and Stemmer, "Methods and Compositions for Cellular and Metabolic Engineering;" WO 99/41402 by Punnonen et al "Targeting of Genetic Vaccine Vectors;" WO 99/41383 by Punnonen et al. "Antigen Library Immunization;" WO 99/41369 by Punnonen et al. "Genetic Vaccine Vector Engineering;" WO 99/41368 by Punnonen et al. "Optimization of hnmunomodulatory Properties of Genetic Vaccines;" EP 752008 by Stemmer and Crameri, "DNA Mutagenesis by Random Fragmentation and Reassembly;" EP 0932670 by Stemmer "Evolving Cellular DNA Uptake by Recursive Sequence Recombination;" WO 99/23107 by Stemmer et al, "Modification of Virus Tropism and Host Range by Viral Genome Shuffling;" WO 99/21979 by Apt et al, "Human PapiUomavirus Vectors;" WO 98/31837 by del Cardayre et al. "Evolution of Whole Cells and Organisms by Recursive Sequence Recombination;" WO 98/27230 by Patten and Stemmer, "Methods and Compositions for Polypeptide Engineering;" WO 98/13487 by Stemmer et al, "Methods for Optimization of Gene Therapy by Recursive Sequence Shuffling and Selection;" WO 00/00632, "Methods for Generating Highly Diverse Libraries;" WO 00/09679, "Methods for Obtaining in vitro Recombined Polynucleotide Sequence Banks and Resulting Sequences;" WO 98/42832 by Arnold et al, "Recombination of Polynucleotide Sequences Using Random or Defined Primers;" WO 99/29902 by Arnold et al, "Method for Creating Polynucleotide and Polypeptide Sequences;" WO 98/41653 by Vind, "An in vitro Method for Construction of a DNA Library;" WO 98/41622 by Borchert et al, "Method for Constructing a Library Using DNA Shuffling;" WO 98/42727 by Pati and Zarling, "Sequence Alterations using Homologous Recombination;" WO 00/18906 by Patten et al, "Shuffling of Codon- Altered Genes;" WO 00/04190 by del Cardayre et al. "Evolution of Whole Cells and Organisms by Recursive Recombination;" WO 00/42561 by Crameri et al, "Ohgonucleotide Mediated Nucleic Acid Recombination;" WO 00/42559 by Selifonov and Stemmer
"Methods of Populating Data Structures for Use in Evolutionary Simulations;" WO 00/42560 by Selifonov et al, "Methods for Making Character Strings, Polynucleotides & Polypeptides Having Desired Characteristics;" WO 01/23401 by Welch et al, "Use of Codon-Varied Ohgonucleotide Synthesis for Synthetic Shuffling;" and WO 01/64864 "Single-Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment Isolation" by Affholter. Certain U.S. applications provide additional details regarding various diversity generating methods, including "SHUFFLING OF CODON ALTERED GENES" by Patten et al. filed September 28, 1999, (USSN 09/407,800); "EVOLUTION OF WHOLE CELLS AND ORGANISMS BY
RECURSIVE SEQUENCE RECOMBINATION", by del Cardayre et al. filed July 15, 1998 (USSN 09/166,188), and July 15, 1999 (U.S. Patent No. 6,379,964); "OLIGONUCLEOTEDE MEDIATED NUCLEIC ACED RECOMBINATION" by Crameri et al, filed September 28, 1999 (U.S. Patent No. 6,376,246); "OLIGONUCLEOTEDE MEDIATED NUCLEIC ACED RECOMBINATION" by Crameri et al, filed January 18, 2000 (WO 00/42561); "USE OF CODON-BASED OLIGONUCLEOTEDE SYNTHESIS FOR SYNTHETIC SHUFFLING" by Welch et al, filed September 28, 1999 (U.S. Patent No. 6,436,675); "METHODS FOR MAKING CHARACTER STRINGS, POLYNUCLEOTEDES & POLYPEPTIDES HAVING DESIRED CHARACTERISTICS" by Selifonov et al, filed January 18, 2000, (WO 00/42560); "METHODS FOR MAKING CHARACTER STRINGS, POLYNUCLEOTIDES & POLYPEPTIDES HAVING DESIRED CHARACTERISTICS" by Selifonov et al, filed July 18, 2000 (USSN 09/618,579); "METHODS OF POPULATING DATA STRUCTURES FOR USE EN EVOLUTIONARY SIMULATIONS" by Selifonov and Stemmer (WO 00/42559), filed January 18, 2000; and "SENGLE-STRANDED NUCLEIC ACED TEMPLATE-MEDIATED RECOMBINATION AND NUCLEIC ACED
FRAGMENT ISOLATION" by Affholter (USSN 60/186,482, filed March 2, 2000). In brief, several different general classes of sequence modification methods, such as mutation, recombination, etc. are applicable to the present invention and set forth in the references above. That is, alterations to the component nucleic acid sequences to produced modified gene fusion constructs can be performed by any number of the protocols described, either before cojoining of the sequences, or after the cojoining step. The following exemplify some of the different types of preferred formats for diversity generation in the context of the present invention, including, e.g. , certain recombination based diversity generation formats. Nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the references above, including e.g., DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids. For example, sexual PCR mutagenesis can be used in which random (or pseudo random, or even non-random) fragmentation of the DNA molecule is followed by recombination, based on sequence similarity, between DNA molecules with different but related DNA sequences, in vitro, followed by fixation of the crossover by extension in a polymerase chain reaction. This process and many process variants is described in several of the references above, e.g., in Stemmer (1994) Proc. Natl Acad. Sci. USA 91:10747-10751. Similarly, nucleic acids can be recursively recombined in vivo, e.g., by allowing recombination to occur between nucleic acids in cells. Many such in vivo recombination formats are set forth in the references noted above. Such formats optionally provide direct recombination between nucleic acids of interest, or provide recombination between vectors, viruses, plasmids, etc., comprising the nucleic acids of interest, as well as other formats. Details regarding such procedures are found in the references noted above. Whole genome recombination methods can also be used in which whole genomes of cells or other organisms are recombined, optionally including spiking of the genomic recombination mixtures with desired library components (e.g., genes corresponding to the pathways of the present invention). These methods have many applications, including those in which the identity of a target gene is not known. Details on such methods are found, e.g., in WO 98/31837 by del Cardayre et al. "Evolution of Whole Cells and Organisms by Recursive Sequence Recombination;" and in, e.g., WO 00/04190 by del Cardayre et al, also entitled "Evolution of Whole Cells and Organisms by Recursive Sequence Recombination." Thus, any of these processes and techniques for recombination, recursive recombination, and whole genome recombination, alone or in combination, can be used to generate the modified nucleic acid sequences and/or modified gene fusion constructs of the present invention. Synthetic recombination methods can also be used, in which oligonucleotides corresponding to targets of interest are synthesized and reassembled in PCR or ligation reactions which include oligonucleotides which correspond to more than one parental nucleic acid, thereby generating new recombined nucleic acids. Oligonucleotides can be made by standard nucleotide addition methods, or can be made, e.g., by tri-nucleotide synthetic approaches. Details regarding such approaches are found in the references noted above, including, e.g., WO 00/42561 by Crameri et al, "Ohgonucleotide Mediated Nucleic Acid Recombination;" WO 01/23401 by Welch et al, "Use of Codon- Varied Ohgonucleotide Synthesis for Synthetic Shuffling;" WO 00/42560 by Selifonov et al, "Methods for Making Character Strings, Polynucleotides and Polypeptides Having Desired Characteristics;" and WO 00/42559 by Selifonov and Stemmer "Methods of Populating Data Structures for Use in Evolutionary Simulations." In silico methods of recombination can be effected in which genetic algorithms are used in a computer to recombine sequence strings which correspond to homologous (or even non-homologous) nucleic acids. The resulting recombined sequence strings are optionally converted into nucleic acids by synthesis of nucleic acids which correspond to the recombined sequences, e.g., in concert with ohgonucleotide synthesis gene reassembly techniques. This approach can generate random, partially random or designed variants. Many details regarding in silico recombination, including the use of genetic algorithms, genetic operators and the like in computer systems, combined with generation of corresponding nucleic acids (and/or proteins), as well as combinations of designed nucleic acids and/or proteins (e.g., based on cross-over site selection) as well as designed, pseudo-random or random recombination methods are described in WO 00/42560 by Selifonov et al, "Methods for Making Character Strings, Polynucleotides and Polypeptides Having Desired Characteristics" and WO 00/42559 by Selifonov and Stemmer "Methods of Populating Data Structures for Use in Evolutionary Simulations." Extensive details regarding in silico recombination methods are found in these applications. This methodology is generally applicable to the present invention in providing for recombination of nucleic acid sequences and/or gene fusion constructs encoding proteins involved in various metabolic pathways (such as, for example, carotenoid biosynthetic pathways, ectoine biosynthetic pathways, polyhydroxyalkanoate biosynthetic pathways, aromatic polyketide biosynthetic pathways, and the like) in silico and/or the generation of corresponding nucleic acids or proteins. Many methods of accessing natural diversity, e.g. , by hybridization of diverse nucleic acids or nucleic acid fragments to single-stranded templates, followed by polymerization and/or ligation to regenerate full-length sequences, optionally followed by degradation of the templates and recovery of the resulting modified nucleic acids can be similarly used. In one method employing a single- stranded template, the fragment population derived from the genomic libraryries) is annealed with partial, or, often approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly of complex chimeric genes from this population is then mediated by nuclease-base removal of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and subsequent single stranded ligation. The parental polynucleotide strand can be removed by digestion (e.g., if RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in a manner conducive to such separation) and other available se aration/purification methods. Alternatively, the parental strand is optionally co-purified with the chimeric strands and removed during subsequent screening and processing steps. Additional details regarding this approach are found, e.g., in "Single-Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment Isolation" by Affholter, WO 01/64864. In another approach, single-stranded molecules are converted to double-stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand-mediated binding. After separation of unbound DNA, the selected DNA molecules are released from the support and introduced into a suitable host cell to generate a library of enriched sequences which hybridize to the probe. A library produced in this manner provides a desirable substrate for further diversification using any of the procedures described herein. Any of the preceding general recombination formats can be practiced in a reiterative fashion (e.g., one or more cycles of mutation/recombination or other diversity generation methods, optionally followed by one or more selection methods) to generate a more diverse set of recombinant nucleic acids. Mutagenesis employing polynucleotide chain termination methods have also been proposed (.see e.g., U.S. Patent No. 5,965,408, "Method of DNA reassembly by interrupting synthesis" to Short, and the references above), and can be applied to the present invention. In this approach, double stranded DNAs corresponding to one or more genes sharing regions of sequence similarity are combined and denatured, in the presence or absence of primers specific for the gene. The single stranded polynucleotides are then annealed and incubated in the presence of a polymerase and a chain terminating reagent (e.g. , ultraviolet, gamma or X-ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as single strand binding proteins, transcription activating factors, or histones; polycyclic aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated polymerization mediated by rapid thermocycling; and the like), resulting in the production of partial duplex molecules. The partial duplex molecules, e.g., containing partially extended chains, are then denatured and reannealed in subsequent rounds of replication or partial replication resulting in polynucleotides which share varying degrees of sequence similarity and which are diversified with respect to the starting population of DNA molecules. Optionally, the products, or partial pools of the products, can be amplified at one or more stages in the process. Polynucleotides produced by a chain termination method, such as described above, are suitable substrates for any other described recombination format. Diversity also can be generated in nucleic acids or populations of nucleic acids using a recombinational procedure termed "incremental truncation for the creation of hybrid enzymes" ("ITCHY") described in Ostermeier et al. (1999) "A combinatorial approach to hybrid enzymes independent of DNA homology" Nature Biotech 17:1205. This approach can be used to generate an initial library of variants which can optionally serve as a substrate for one or more in vitro ox in vivo recombination methods. See, also, Ostermeier et al. (1999) "Combinatorial Protein Engineering by Incremental Truncation," Proc. Natl. Acad. Sci. USA, 96: 3562-67; and Ostermeier et al. (1999), "Incremental Truncation as a Strategy in the Engineering of Novel Biocatalysts," Biological and Medicinal Chemistry, 7: 2139-44. Mutational methods which result in the alteration of individual nucleotides or groups of contiguous or non-contiguous nucleotides can be favorably employed to introduce nucleotide diversity into the nucleic acid sequences and/or gene fusion constructs of the present invention. Many mutagenesis methods are found in the above-cited references; additional details regarding mutagenesis methods can be found in following, which can also be applied to the present invention. For example, error-prone PCR can be used to generate nucleic acid variants. Using this technique, PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product. Examples of such techniques are found in the references above and, e.g., in Leung et al (1989) Technique 1 : 11-15 and Caldwell et al (1992) PCR Methods Applic. 2: 28-33. Similarly, assembly PCR can be used, in a process which involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions can occur in parallel in the same reaction mixture, with the products of one reaction priming the products of another reaction. Ohgonucleotide directed mutagenesis can be used to introduce site- specific mutations in a nucleic acid sequence of interest. Examples of such techniques are found in the references above and, e.g., in Reidhaar-Olson et al. (1988) Science 241 :53-57. Similarly, cassette mutagenesis can be used in a process that replaces a small region of a double stranded DNA molecule with a synthetic ohgonucleotide cassette that differs from the native sequence. The ohgonucleotide can contain, e.g., completely and/or partially randomized native sequence(s). Recursive ensemble mutagenesis is a process in which an algorithm for protein mutagenesis is used to produce diverse populations of phenotypically related mutants, members of which differ in amino acid sequence. This method uses a feedback mechanism to monitor successive rounds of combinatorial cassette mutagenesis. Examples of this approach are found in Arkin & Youvan (1992) Proc. Natl Acad. Sci. USA 89:7811-7815. Exponential ensemble mutagenesis can be used for generating combinatorial libraries with a high percentage of unique and functional mutants. Small groups of residues in a sequence of interest are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins. Examples of such procedures are found in Delegrave & Youvan (1993) Biotech. Res. 11:1548-1552. In vivo mutagenesis can be used to generate random mutations in any cloned DNA of interest by propagating the DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair pathways. These "mutator" strains have a higher random mutation rate than that of a wild-type parent. Propagating the DNA in one of these strains will eventually generate random mutations within the DNA. Such procedures are described in the references noted above. Other procedures for introducing diversity into a genome, e.g. a bacterial, fungal, animal or plant genome can be used in conjunction with the above described and/or referenced methods. For example, in addition to the methods above, techniques have been proposed which produce nucleic acid multimers suitable for transformation into a variety of species (see, e.g., Schellenberger U.S. Patent No. 5,756,316 and the references above). Transformation of a suitable host with such multimers, consisting of genes that are divergent with respect to one another, (e.g., derived from natural diversity or through application of site directed mutagenesis, error prone PCR, passage through mutagenic bacterial strains, and the like), provides a source of nucleic acid diversity for DNA diversification, e.g., by an in vivo recombination process as indicated above. Alternatively, a multiplicity of monomeric polynucleotides sharing regions of partial sequence similarity can be transformed into a host species and recombined in vivo by the host cell. Subsequent rounds of cell division can be used to generate libraries, members of which, include a single, homogenous population, or pool of monomeric polynucleotides. Alternatively, the monomeric nucleic acids can be recovered by standard techniques, e.g., PCR and/or cloning, and recombined in any of the recombination formats, including recursive recombination formats, described above. Methods for generating multispecies expression libraries have been described (in addition to the references noted above, see, e.g., Peterson et αl (1998) U.S. Pat. No. 5,783,431 "METHODS FOR GENERATING AND SCREENING NOVEL METABOLIC PATHWAYS;" and Thompson, et αl. (1998) U.S. Pat. No. 5,824,485 METHODS FOR GENERATING AND SCREENING NOVEL METABOLIC PATHWAYS) and their use to identify protein activities of interest has been proposed (in addition to the references noted above, see, Short (1999) U.S. Pat. No. 5,958,672 "PROTEIN ACTIVITY SCREENING OF CLONES HAVENG DNA FROM UNCULTIVATED MICROORGANISMS"). Multispecies expression libraries include, in general, libraries comprising cDNA or genomic sequences from a plurality of species or strains, operably linked to appropriate regulatory sequences, in an expression cassette. The cDNA and/or genomic sequences are optionally randomly ligated to further enhance diversity. The vector can be a shuttle vector suitable for transformation and expression in more than one species of host organism, e.g., bacterial species or eukaryotic cells; In some cases, the library is biased by preselecting sequences which encode a protein of interest, or which hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for any of the methods herein described. The above described procedures have been largely directed to increasing nucleic acid and/or encoded protein diversity. However, in many cases, not all of the diversity is useful, e.g., functional, and contributes merely to increasing the background of variants that must be screened or selected to identify the few favorable variants. In some applications, it is desirable to preselect or prescreen libraries (e.g., an amplified library, a genomic library, a cDNA library, a normalized library, etc) or other substrate nucleic acids prior to diversification, e.g., by recombination-based mutagenesis procedures, or to otherwise bias the substrates towards nucleic acids that encode functional products. For example, in the case of antibody engineering, it is possible to bias the diversity generating process toward antibodies with functional antigen binding sites by taking advantage of z vivo recombination events prior to manipulation by any of the described methods. For example, recombined CDRs derived from B cell cDNA libraries can be amplified and assembled into framework regions (e.g., Jirholt et al. (1998) "Exploiting sequence space: shuffling in vivo formed complementarity determining regions into a master framework" Gene 215 : 471) prior to diversifying according to any of the methods described herein. Libraries can be biased towards nucleic acids which encode proteins with desirable enzyme activities. For example, after identifying a clone from a library which exhibits a specified activity, the clone can be mutagenized using any known method for introducing DNA alterations. A library comprising the mutagenized homologues is then screened for a desired activity, which can be the same as or different from the initially specified activity. An example of such a procedure is proposed in Short (1999) U.S. Patent No. 5,939,250 for "PRODUCTION OF ENZYMES HAVING DESIRED ACTIVITIES BY MUTAGENESIS." Desired activities can be identified by any method known in the art. For example, WO 99/10539 proposes that gene libraries can be screened by combining extracts from the gene library with components obtained from metabolically rich cells and identifying combinations which exhibit the desired activity. It has also been proposed (e.g., WO 98/58085) that clones with desired activities can be identified by inserting bioactive substrates into samples of the library, and detecting bioactive fluorescence corresponding to the product of a desired activity using a fluorescent analyzer, e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer. Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe. For example, WO 99/10539 proposes that polynucleotides encoding a desired activity (e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a hydratase, a nitrilase, a transaminase, an amidase or an acylase) can be identified from among genomic DNA sequences. In particular, single stranded DNA molecules from a population of genomic DNA are hybridized to a ligand-conjugated probe. The genomic DNA can be derived from either a cultivated or uncultivated microorganism, or from an environmental sample. Alternatively, the genomic DNA can be derived from a multicellular organism, or a tissue derived therefrom. Second strand synthesis can be conducted directly from the hybridization probe used in the capture, with or without prior release from the capture medium or by a wide variety of other strategies known in the art. Alternatively, the isolated single-stranded genomic DNA population can be fragmented without further cloning and used directly in, e.g., a recombination- based approach, that employs a single-stranded template, as described above. "Non-stochastic" methods of generating nucleic acids and polypeptides are described in Short "Non-Stochastic Generation of Genetic
Vaccines and Enzymes" WO 00/46344. These methods, including proposed non- stochastic polynucleotide reassembly and site-saturation mutagenesis methods can be applied to the present invention as well. Random or semi-random mutagenesis using doped or degenerate oligonucleotides is also described in, e.g., Arkin and Youvan (1992) "Optimizing nucleotide mixtures to encode specific subsets of amino acids for semi-random mutagenesis" Biotechnology 10:297-300; Reidhaar- Olson et al. (1991) "Random mutagenesis of protein sequences using ohgonucleotide cassettes" Methods Enzymol. 208:564-86; Lim and Sauer (1991) "The role of internal packing interactions in determining the structure and stability of a protein" J Mol. Biol. 219:359-76; Breyer and Sauer (1989) "Mutational analysis of the fine specificity of binding of monoclonal antibody 51F to lambda repressor" J Biol. Chem. 264: 13355-60); "Walk-Through Mutagenesis" (Crea, R; U.S. Patent Nos. 5,830,650 and 5,798,208, and EP Patent 0527809 Bl. It will be readily appreciated that any of the above described techniques suitable for enriching a library prior to diversification can also be used to screen the products, or libraries of products, produced by the diversity generating methods. Any of the above described methods can be practiced recursively or in combination to alter nucleic acids, e.g., GAT encoding polynucleotides. Kits for mutagenesis, library construction and other diversity generation methods are also commercially available. For example, kits are available from, e.g., Stratagene (e.g., QuickChange™ site-directed mutagenesis kit; and Chameleon™ double-stranded, site-directed mutagenesis kit); Bio/Can Scientific, Bio-Rad (e.g., using the Kunkel method described above); Boehringer Mannheim Corp.; Clonetech Laboratories; DNA Technologies; Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc.; Lemargo Inc.; Life Technologies (Gibco BRL); New England Biolabs; Pharmacia Biotech; Promega Corp.; Quantum Biotechnologies; Amersham International pic (e.g., using the Eckstein method above); and Anglian Biotechnology Ltd (e.g., using the Carter/Winter method above). The above references provide many mutational formats, including recombination, recursive recombination, recursive mutation and combinations of recombination with other forms of mutagenesis, as well as many modifications of these formats. Regardless of the diversity generation format that is used, the nucleic acids of the present invention can be recombined (with each other, or with related (or even unrelated) sequences) to produce a diverse set of recombinant nucleic acids for use in the gene fusion constructs and modified gene fusion constructs of the present invention, including, e.g., sets of homologous nucleic acids, as well as corresponding polypeptides. Many of the above-described methodologies for generating modified polynucleotides generate a large number of diverse variants of a parental sequence or sequences. In some preferred embodiments of the invention the modification technique (e.g., some form of shuffling) is used to generate a library of variants that is then screened for a modified polynucleotide or pool of modified polynucleotides encoding some desired functional attribute, e.g., improved GAT activity. Exemplary enzymatic activities that can be screened for include catalytic rates (conventionally characterized in terms of kinetic constants such as kcat and KM), substrate specificity, and susceptibility to activation or inhibition by substrate, product or other molecules (e.g., inhibitors or activators). One example of selection for a desired enzymatic activity entails growing host cells under conditions that inhibit the growth and/or survival of cells that do not sufficiently express an enzymatic activity of interest, e.g. the GAT activity. Using such a selection process can eliminate from consideration all modified polynucleotides except those encoding a desired enzymatic activity. For example, in some embodiments of the invention host cells are maintained under conditions that inhibit cell growth or survival in the absence of sufficient levels of GAT, e.g., a concentration of glyphosate that is lethal or inhibits the growth of a wild-type plant of the same variety that either lacks or does not express a GAT polynucleotide. Under these conditions, only a host cell harboring a modified nucleic acid that encodes enzymatic activity or activities able to catalyze production of sufficient levels of the product will survive and grow. Some embodiments of the invention employ multiple rounds of screening at increasing concentrations of glyphosate or a glyphosate analog. hi some embodiments of the invention, mass spectrometry is used to detect the acetylation of glyphosate, or a glyphosate analog or metabolite. The use of mass spectrometry is described in more detail in the Examples below. For convenience and high throughput it will often be desirable to screen/select for desired modified nucleic acids in a microorganism, e.g., a bacteria such as E. coli. On the other hand, screening in plant cells or plants can in some cases be preferable where the ultimate aim is to generate a modified nucleic acid for expression in a plant system. h some preferred embodiments of the invention throughput is increased by screening pools of host cells expressing different modified nucleic acids, either alone or as part of a gene fusion construct. Any pools showing significant activity can be deconvoluted to identify single clones expressing the desirable activity. The skilled artisan will recognize that the relevant assay, screening or selection method will vary depending upon the desired host organism and other parameters known in the art. It is normally advantageous to employ an assay that can be practiced in a high-throughput format. In high-throughput assays, it is possible to screen up to several thousand different variants in a single day. For example, each well of a microtiter plate can be used to run a separate assay, or, if concentration or incubation time effects are to be observed, every 5-10 wells can test a single variant. In addition to fluidic approaches, it is possible, as mentioned above, simply to grow cells on media plates that select for the desired enzymatic or metabolic function. This approach offers a simple and high-throughput screening method. A number of well known robotic systems have also been developed for solution phase chemistries useful in assay systems. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, MA; and Orca, Hewlett-Packard, Palo Alto, CA) which mimic the manual synthetic operations performed by a scientist. Any of the above devices are suitable for application to the present invention. The nature and implementation of modifications to these devices (if any) so that they can operate as discussed herein with reference to the integrated system will be apparent to persons skilled in the relevant art. High-throughput screening systems are commercially available (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems typically automate entire procedures including all sample and reagent pipetting, liquid dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate for the particular assay. These configurable systems provide high throughput and rapid start up as well as a high degree of flexibility and customization. The manufacturers of such systems provide detailed protocols for the various high throughput devices. Thus, for example, Zymark Corp. provides technical bulletins describing screening systems for detecting the modulation of gene transcription, ligand binding, and the like. Microfluidic approaches to reagent manipulation have also been developed, e.g., by Caliper Technologies (Mountain View, CA). Optical images viewed (and, optionally, recorded) by a camera or other recording device (e.g., a photodiode and data storage device) are optionally further processed in any of the embodiments herein, e.g., by digitizing the image and/or storing and analyzing the image on a computer. A variety of commercially available peripheral equipment and software is available for digitizing, storing and analyzing a digitized video or digitized optical image, e.g., using PC (Intel x86 or Pentium chip compatible DOS™, OS™ WINDOWS™, WINDOWS NT™ or WINDOWS 95™ based machines), MACE TOSH™, or UNIX based (e.g., SUN™ work station) computers. One conventional system carries light from the assay device to a cooled charge-coupled device (CCD) camera, a common use in the art. A CCD camera includes an array of picture elements (pixels). The light from the specimen is imaged on the CCD. Particular pixels corresponding to regions of the specimen (e.g., individual hybridization sites on an array of biological polymers) are sampled to obtain light intensity readings for each position. Multiple pixels are processed in parallel to increase speed. The apparatus and methods of the invention are easily used for viewing any sample, e.g. by fluorescent or dark field microscopic techniques. OTHER POLYNUCLEOTIDE COMPOSITIONS The invention also includes compositions comprising two or more polynucleotides of the invention (e.g., as substrates for recombination). The composition can comprise a library of recombinant nucleic acids, where the library contains at least 2, 3, 5, 10, 20, or 50 or more polynucleotides. The polynucleotides are optionally cloned into expression vectors, providing expression libraries. The invention also includes compositions produced by digesting one or more polynucleotide of the invention with a restriction endonuclease, an RNAse, or a DNAse (e.g., as is performed in certain of the recombination formats noted above); and compositions produced by fragmenting or shearing one or more polynucleotide of the invention by mechanical means (e.g., sonication, vortexing, and the like), which can also be used to provide substrates for recombination in the methods above. Similarly, compositions comprising sets of oligonucleotides corresponding to more than one nucleic acid of the invention are useful as recombination substrates and are a feature of the invention. For convenience, these fragmented, sheared, or ohgonucleotide synthesized mixtures are referred to as fragmented nucleic acid sets. Also included in the invention are compositions produced by incubating one or more of the fragmented nucleic acid sets in the presence of ribonucleotide- or deoxyribonucelotide triphosphates and a nucleic acid polymerase. This resulting composition forms a recombination mixture for many of the recombination formats noted above. The nucleic acid polymerase may be an RNA polymerase, a DNA polymerase, or an RNA-directed DNA polymerase (e.g., a "reverse transcriptase"); the polymerase can be, e.g., a thermostable DNA polymerase (such as, VENT, TAQ, or the like).
INTEGRATED SYSTEMS The present invention provides computers, computer readable media and integrated systems comprising character strings corresponding to the sequence information herein for the polypeptides and nucleic acids herein, including, e.g., those sequences listed herein and the various silent substitutions and conservative substitutions thereof. For example, various methods and genetic algorithms (GAs) known in the art can be used to detect homology or similarity between different character strings, or can be used to perform other desirable functions such as to control output files, provide the basis for making presentations of information including the sequences and the like. Examples include BLAST, discussed supra. Thus, different types of homology and similarity of various stringency and length can be detected and recognized in the integrated systems described herein. For example, many homology determination methods have been designed for comparative analysis of sequences of biopolymers, for spell-checking in word processing, and for data retrieval from various databases. With an understanding of double-helix pair- wise complement interactions among 4 principal nucleobases in natural polynucleotides, models that simulate annealing of complementary homologous polynucleotide strings can also be used as a foundation of sequence alignment or other operations typically performed on the character strings corresponding to the sequences herein (e.g., word-processing manipulations, construction of figures comprising sequence or subsequence character strings, output tables, etc.). An example of a software package with GAs for calculating sequence similarity is BLAST, which can be adapted to the present invention by inputting character strings corresponding to the sequences herein. Similarly, standard desktop applications such as word processing software (e.g., Microsoft Word™ or Corel WordPerfect™) and database software (e.g., spreadsheet software such as Microsoft Excel™, Corel Quattro Pro™, or database programs such as Microsoft Access™ or Paradox™) can be adapted to the present invention by inputting a character string corresponding to the GAT homologues of the invention (either nucleic acids or proteins, or both). For example, the integrated systems can include the foregoing software having the appropriate character string information, e.g., used in conjunction with a user interface (e.g., a GUI in a standard operating system such as a Windows, Macintosh or LINUX system) to manipulate strings of characters. As noted, specialized alignment programs such as BLAST can also be incorporated into the systems of the invention for alignment of nucleic acids or proteins (or corresponding character strings). Integrated systems for analysis in the present invention typically include a digital computer with GA software for aligning sequences, as well as data sets entered into the software system comprising any of the sequences herein. The computer can be, e.g., a PC (Intel x86 or Pentium chip compatible DOS™, OS2™ WINDOWS™ WINDOWS NT™, WENDOWS95™, WE DOWS98™ LINUX based machine, a MACINTOSH™, Power PC, or a UNLX based (e.g., SUN™ work station) machine) or other commercially common computer which is known to one of skill. Software for aligning or otherwise manipulating sequences is available, or can easily be constructed by one of skill using a standard programming language such as Visualbasic, Fortran, Basic, Java, or the like. Any controller or computer optionally includes a monitor which is often a cathode ray tube ("CRT") display, a flat panel display (e.g., active matrix liquid crystal display, liquid crystal display), or others. Computer circuitry is often placed in a box wliich includes numerous integrated circuit chips, such as a microprocessor, memory, interface circuits, and others. The box also optionally includes a hard disk drive, a floppy disk drive, a high capacity removable drive such as a writeable CD-ROM, and other common peripheral elements. Inputting devices such as a keyboard or mouse optionally provide for input from a user and for user selection of sequences to be compared or otherwise manipulated in the relevant computer system. The computer typically includes appropriate software for receiving user instructions, either in the form of user input into set parameter fields, e.g., in a GUI, or in the form of preprogrammed instructions, e.g., preprogrammed for a variety of different specific operations. The software then converts these instructions to appropriate language for instructing the operation of the fluid direction and transport controller to carry out the desired operation. The software can also include output elements for controlling nucleic acid synthesis (e.g., based upon a sequence or an alignment of a sequences herein) or other operations which occur downstream from an alignment or other operation performed using a character string corresponding to a sequence herein. Nucleic acid synthesis equipment can, accordingly, be a component in one or more integrated systems herein. In an additional aspect, the present invention provides kits embodying the methods, composition, systems and apparatus herein. Kits of the invention optionally comprise one or more of the following: (1) an apparatus, system, system component or apparatus component as described herein; (2) instructions for practicing the methods described herein, and/or for operating the apparatus or apparatus components herein and/or for using the compositions herein; (3) one or more GAT compositions or components; (4) a container for holding components or compositions, and, (5) packaging materials. In a further aspect, the present invention provides for the use of any apparatus, apparatus component, composition or kit herein, for the practice of any method or assay herein, and/or for the use ofany apparatus or kit to practice any assay or method herein.
HOST CELLS AND ORGANISMS The host cell can be eukaryotic, for example, a eukaryotic cell, a plant cell, an animal cell, a protoplast, or a tissue culture cell. The host cell optionally comprises a plurality of cells, for example, an organism. Alternatively, the host cell can be prokaryotic including, but not limited to, bacteria (i.e., gram positive bacteria, purple bacteria, green sulfur bacteria, green non-sulfur bacteria, cyanobacteria, spirochetes, thermatogales, flavobacteria, and bacteroides) and archaebacteria (i.e., Korarchaeota, Theπnoproteus, Pyrodictium, Thermococcales, Methanogens, Archaeoglobus, and extreme Halophiles). Transgenic plants, or plant cells, incorporating the GAT nucleic acids, and/or expressing the GAT polypeptides of the invention are a feature of the invention. The transformation of plant cells and protoplasts can be carried out in essentially any of the various ways known to those skilled in the art of plant molecular biology, including, but not limited to, the methods described herein. See, in general, Methods in Enzymology, Vol. 153 (Recombinant DNA PartD) Wu and Grossman (eds.) 1987, Academic Press; and Weising et al, Ann. Rev. Genet. 22: 421-477 (1988), incorporated herein by reference. For example, the DNA construct may be introduced directly into the genomic DNA of the plant cell using techniques such as electroporation, PEG-mediated transfection, particle bombardment, silicon fiber delivery, or microinjection of plant cell protoplasts or embryogenic callus. See, e.g., Tomes, et al. (1995) "Direct DNA Transfer into Intact Plant Cells Via Microprojectile Bombardment," in Plant Cell, Tissue and Organ Culture, Fundamental Methods, eds. Gamborg and Phillips (Springer- Verlag, Berlin), pp.197-213. Further methods for transforming various host cells are disclosed in Klein et al. (1992) "Transformation of microbes, plants and animals by particle bombardment" Bio/Technol 10 (3): 286-291. The introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski et al. (1984) EMBO J. 3:2717-2722. Electroporation techniques are described in Fromm et al. (1985) Proc. Natl. Acad. Sci. 82:5824. Ballistic transformation techniques are described in Klein et al. (1987) Nature 32 \ 0-73. Alternatively, the DNA constructs may be combined with suitable
T-DNA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium tumefaciens host will direct the insertion of the construct and adjacent marker into the plant cell DNA when the cell is infected by the bacteria. See, U.S. Patent No. 5,591,616. Agrobacterium tumefaciens-xxisdiXaXed transformation techniques are well described in the scientific literature. See, for example Horsch et αl. (1984) Science 233: 496-498, and Fraley et αl. (1983) Proc. Nαtl Acαd. Sci. 80:4803. For instance, Agrobacterium transformation of maize is described in U.S. Patent Nos. 5,550,318 and 5,981,840. Other methods of transformation include (1) Agrobacterium rhizogenes '-mediated transformation (see, e.g., Lichtenstein and Fuller hi: Genetic Engineering, Vol. 6, PWJ Rigby, ed., London, Academic Press, 1987; Lichtenstein, C. P., and Draper, J,. In: DNA Cloning, Vol. II, D. M. Glover, Ed., Oxford, ERI Press, 1985;WO 88/02405 describes the use of A. rhizogenes strain A4 and its Ri plasmid along with tumefaciens vectors pARC8 or pARC16); (2) liposome- mediated DNA uptake (see, e.g., Freeman et al. (1984) Plant Cell Physiol. 25:1353; (3) the vortexing method (see, e.g., Kindle (1990) Proc. Natl. Acad. Sci. USA 87:1228. DNA can also be introduced into plants by direct DNA transfer into pollen as described by Zhou et al (1983) Methods in Enzymology 101 :433; D. Hess (1987) Intern Rev. Cytol 107:367; and Luo et al. (1988) Plant Mol. Biol. Reporter 6:165. Expression of polypeptide coding nucleic acids can be obtained by injection of the DNA into reproductive organs of a plant as described by Pena et al. (1987) Nature 325:274. DNA can also be injected directly into the cells of immature embryos and the rehydration of desiccated embryos as described by Neuhaus et al. (1987) Tlieor. Appl. Genet. 75: 30; and Benbrook et al. (1986) in Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54. Animal and lower eukaryotic (e.g., yeast) host cells are competent or rendered competent for transfection by various means. There are several well- known methods of introducing DNA into animal cells. These methods include: calcium phosphate precipitation; fusion of the recipient cells with bacterial protoplasts containing the DNA; treatment of the recipient cells with liposomes containing the DNA; DEAE dextran; electroporation; biolistics; and micro- injection of the DNA directly into the cells. The transfected cells are cultured by means well known in the art. See, Kuchler, R.J. (1977) Biochemical Methods in Cell Culture and Virology (Dowden, Hutchinson and Ross, Inc.). As used herein, the term "transformation" means alteration of the genotype of a host plant by the introduction of a nucleic acid sequence, e.g., a "heterologous" or "foreign" nucleic acid sequence. The heterologous nucleic acid sequence need not necessarily originate from a different source but it will, at some point, have been external to the cell into winch is introduced. In addition to Berger, Ausubel and Sambrook, useful general references for plant cell cloning, culture and regeneration include Jones, ed. (1995) Plant Gene Transfer and Expression Protocols— Methods in Molecular Biology, volume 49 (Humana Press, Towata, NJ); Payne et al. (1992) Plant Cell and Tissue Culture in Liquid Systems (John Wiley & Sons, Inc. New York, NY) ("Payne"); and Gamborg and Phillips, eds. (1995) Plant Cell, Tissue and Organ Culture; Fundamental Methods/ Springer Lab Manual, (Springer-Verlag, Berlin) ("Gamborg"). A variety of cell culture media are described in Atlas and Parks, eds. The Handbook of Microbiological Media_(CRC Press, Boca Raton, FL) ("Atlas"). Additional information for plant cell culture is found in available commercial literature such as the Life Science Research Cell Culture Catalogue (1998) from Sigma-Aldrich, Inc. (St Louis, MO) (Sigma-LSRCCC) and, e.g., the Plant Culture Catalogue and supplement (1997) also from Sigma-Aldrich, Inc. (St Louis, MO) (Sigma-PCCS). Additional details regarding plant cell culture are found in Croy, ed. (1993) Plant Molecular Biology (Bios Scientific Publishers, Oxford, UK). In an embodiment of this invention, recombinant vectors including one or more GAT polynucleotides, suitable for the transformation of plant cells are prepared. A DNA sequence encoding for the desired GAT polypeptide, e.g., selected from among SEQ LD NO: 516, 517, 518, 519, 520, 521, 52
525, 526 527, 528 529, 530, 531 532, 533 534, 535, 536, 537, 53 541, 542 543, 544545, 546, 547548, 549550, 551, 552, 553, 55 557, 558 559, 560561, 562, 563 564, 565 566, 567, 620, 622, 62 630, 632634, 636638, 640, 642644, 646 648, 650, 652, 654, 65 662, 664666, 668 670, 672, 674676, 678 680, 682, 684, 686, 68 694, 696698, 700702, 704, 706708, 710712,714,716,718,72 726, 728730, 732: 734, 736, 738; 740, 742744, 746, 748, 750, 75 758, 760762, 764768, 770, 772774, 776778, 780, 782, 784, 78
792, 794, 796, 798 800, 802, 804806, 808 810,812,814,816,81 824, 832, 834, 836838, 840, 842844, 846848, 850, 852, 854, 85 862, 864, 866, 868 870, 872, 874, 876, 878 880, 882, 884, 886, 88 894, 896, 898, 900902, 904, 906908, 910, 912,914,916,918,92
926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 94 945, 947, 949, 951, and 952, is conveniently used to construct a recombinant expression cassette which can be introduced into the desired plant. In the context of the present invention, an expression cassette will typically comprise a selected GAT polynucleotide operably linked to a promoter sequence and other transcriptional and translational initiation regulatory sequences which are sufficient to direct the transcription of the GAT sequence in the intended tissues (e.g., entire plant, leaves, roots, etc.) of the transformed plant. A number of promoters can be used in the practice of the present invention. The promoters can be selected based on the desired outcome. That is, the nucleic acids can be combined with constitutive, tissue-preferred, or other promoters for expression in plants. Constitutive promoters include, for example, the core promoter of the Rsyn7 promoter and other constitutive promoters disclosed in WO 99/43838 and U.S. Patent No. 6,072,050; the core CaMV 35S promoter (Odell et al. (1985) Nature 313:810-812); rice actin (McElroy et al. (1990) Plant Cell 2:163-171); ubiquitin (Christensen et al. (1989) Plant Mol. Biol. 12:619-632 and Christensen et al. (1992) Plant Mol. Biol. 18:675-689); pEMU (Last et al. (1991) Theor. Appl. Genet. 81:581-588); MAS (YelXen et al. (1984) EMBO J. 3:2723-2730); ALS promoter (U.S. Patent No. 5,659,026), and the like. Other constitutive promoters include, for example, those disclosed in U.S. Patent Nos. 5,608,149; 5,608,144; 5,604,121; 5,569,597; 5,466,785; 5,399,680; 5,268,463; 5,608,142; and 6,177,611. Chemical-regulated promoters can be used to modulate the expression of a gene in a plant through the application of an exogenous chemical regulator. Depending upon the objective, the promoter may be a chemical- inducible promoter, where application of the chemical induces gene expression, or a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters are known in the art and include, but are not limited to, the maize In2-2 promoter, which is activated by benzene sulfonamide herbicide safeners; the maize GST promoter, which is activated by hydrophobic electrophilic compounds that are used as pre-emergent herbicides; and the tobacco PR- la promoter, which is activated by salicylic acid. Other chemical-regulated promoters of interest include steroid-responsive promoters. See, for example, the glucocorticoid-inducible promoter in Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88:10421-10425 and McNellis et al. (1998) Plant J. 14(2):247-257 and the tetracycline-inducible and tetracycline-repressible promoters for example, Gatz et al. (1991) Mol. Gen. Genet. 227:229-237, and U.S. Patent Nos. 5,814,618 and 5,789,156, herein incorporated by reference. Tissue-preferred promoters can also be utilized to target GAT expression within a particular plant tissue. Tissue-preferred promoters include those disclosed in Yamamoto et al. (1997) Plant J. 12(2):255-265; Kawamata et al. (1997) Plant Cell Physiol. 38(7):792-803; Hansen et al. (1997) Mol. Gen Genet. 254(3):337-343; Russell et al. (1997) Transgenic Res. 6(2).T57-168; Rinehart et al. (1996) Plant Physiol. 112(3):1331-1341; Van Camp et al. (1996) Plant Physiol. 112(2):525-535; Canevascini et al. (1996) Plant Physiol. 112(2):513-524; Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778; Lam (1994) Results Probl Cell Differ. 20:181-196; Orozco et al. (1993) Plant Mol Biol. 23(6):1129- 1138; Matsuoka et al. (1993) Proc Natl. Acad. Sci. USA 90(20):9586-9590; and Guevara-Garcia et al. (1993) Plant J. 4(3):495-505. Such promoters can be modified, if necessary, for weak expression. Leaf-specific promoters are known in the art. See, for example, Yamamoto et al. (1997) Plant J. 12(2):255-265; Kwon et al. (1994) Plant Physiol. 105:357-67; Yamamoto et al. (1994) Plant Cell Physiol. 35(5):773-778; Gotor et al. (1993) Plant J. 3:509-18; Orozco et al. (1993) Plant Mol. Biol. 23(6):1129- 1138; and Matsuoka et al. (1993) Proc. Natl. Acad. Sci. USA 90(20):9586-9590. Root-preferred promoters are known and can be selected from the many available from the literature or isolated de novo from various compatible species. See, for example, Hire et al. (1992) Plant Mol Biol. 20(2):207-218
(soybean root-specific glutamine synthetase gene); Keller et al. (1991) Plant Cell 3(10):1051-1061 (root-specific control element in the GRP 1.8 gene of French bean); Sanger et α/. (1990) Plant Mol. Biol. 14(3):433-443 (root-specific promoter of the mannopine synthase (MAS) gene of Agrobacterium tumefaciens); and Miao et al. (1991) Plant Cell 3(1):11-22 (full-length cDNA clone encoding cytosolic glutamine synthetase (GS), which is expressed in roots and root nodules of soybean). See also Bogusz et al. (1990) Plant Cell 2(7):633-641, which discloses two root-specific promoters isolated from hemoglobin genes from the nitrogen- fixing nonlegume Parasponia andersonii and the related non-nitrogen-fixing nonlegume Trema tomentosa. The promoters of these genes were linked to a 13- glucuronidase reporter gene and introduced into both the nonlegume Nicotiana tabacum and the legume Lotus corniculatus, and in both instances root-specific promoter activity was preserved. Leach et al. (1991) describe their analysis of the promoters of the highly expressed rolC and rolD root-inducing genes of Agrobacterium rhizogenes (see Plant Science (Limerick) 79(l):69-76). They concluded that enhancer and tissue-preferred DNA determinants are dissociated in those promoters. Teeri et al. (1989) EMBO J. 8(2):343-350 used gene fusion to lacZ to show that the Agrobacterium T-DNA gene encoding octopine synthase is especially active in the epidermis of the root tip and that the TR2' gene is root specific in the intact plant and stimulated by wounding in leaf tissue, which is an especially desirable combination of characteristics for use with an insecticidal or larvicidal gene. The TRI' gene, fused to nptll (neomycin phosphotransferase II), showed similar characteristics. Additional root-preferred promoters include the VfENOD-GRP3 gene promoter (Kuster et al. (1995) Plant Mol. Biol. 29(4):759- 772); the ZRP2 promoter (U.S. Patent No. 5,633,636); the EFS1 promoter (U.S. Patent Application Serial No. 10/104,706) and the rolB promoter (Capana et al. (1994) Plant Mol Biol. 25(4):681-691). See also U.S. Patent Nos. 5,837,876; 5,750,386; 5,459,252; 5,401,836; 5,110,732; and 5,023,179. "Seed-preferred" promoters include both "seed-specific" promoters (those promoters active during seed development such as promoters of seed storage proteins) as well as "seed-germinating" promoters (those promoters active during seed germination). See Thompson et al. (1989) BioEssays 10:108, herein incorporated by reference. Such seed-preferred promoters include, but are not limited to, Ciml (cytokinin-induced message); cZ19Bl (maize 19 kDa zein); milps (myo-inositol-1 -phosphate synthase); and celA (cellulose synthase) (see U.S. Patent No. 6,225,529, herein incorporated by reference). Gamma-zein is an endosperm-specific promoter. Glob-1 is an embryo-specific promoter. For dicots, seed-specific promoters include, but are not limited to, bean -phaseolin, napin, β- conglycinin, soybean lectin, cruciferin, and the like. For monocots, seed-specific promoters include, but are not limited to, maize 15 kDa zein, 22 kDa zein, 27 kDa zein, g-zein, waxy, shrunken 1, shrunken 2, globulin 1, etc. See also WO 00/12733, which discloses seed-preferred promoters from endl and end2 genes; herein incorporated by reference. h particular, a strongly or weakly constitutive plant promoter that directs expression of a GAT nucleic acid in all tissues of a plant can be favorably employed. Such promoters are active under most environmental conditions and states of development or cell differentiation. In addition to the promoters mentioned above examples of constitutive promoters include the 1 '- or 2 '- promoter i Agrobacterium tumefaciens, and other transcription initiation regions from various plant genes known to those of skill. Where over expression of a GAT polypeptide of the invention is detrimental to the plant, one of skill will recognize that weak constitutive promoters can be used for low-levels of expression. Generally, by "weak promoter" a promoter that drives expression of a coding sequence at a low level is intended. By "low level" levels from about 1/1000 transcripts to about 1/100,000 transcripts, to about as low as 1/500,000 transcripts per cell are intended. Alternatively, it is recognized that weak promoters also include promoters that are expressed in only a few cells and not in others to give a total low level of expression. Where a promoter is expressed at unacceptably high levels, portions of the promoter sequence can be deleted or modified to decrease expression levels. In those cases where high levels of expression is not harmful to the plant, a strong promoter, e.g., a t-RNA, or other pol III promoter, or a strong pol π promoter, (e.g., the cauliflower mosaic virus promoter, CaMV, 35S promoter) can be used. Alternatively, a plant promoter can be under environmental control. Such promoters are referred to as "inducible" promoters. Examples of enviromnental conditions that may alter transcription by inducible promoters include pathogen attack, anaerobic conditions, or the presence of light, h some cases, it is desirable to use promoters that are "tissue-specific" and/or are under developmental control such that the GAT polynucleotide is expressed only in certain tissues or stages of development, e.g., leaves, roots, shoots, etc. Endogenous promoters of genes related to herbicide tolerance and related phenotypes are particularly useful for driving expression of GAT nucleic acids, e.g., P450 monooxygenases, glutathione-S-transferases, homoglutathione-S- transferases, glyphosate oxidases and 5-enolpyruvylslnkimate-2-phosphate synthases. Tissue specific promoters can also be used to direct expression of heterologous structural genes, including the GAT polynucleotides described herein. Thus the promoters can be used in recombinant expression cassettes to drive expression ofany gene whose expression is desirable in the transgenic plants of the invention, e.g., GAT and/or other genes conferring herbicide resistance or tolerance, genes which influence other useful characteristics, e.g., heterosis. Similarly, enhancer elements, e.g., derived from the 5' regulatory sequences or intron of a heterologous gene, can also be used to improve expression of a heterologous structural gene, such as a GAT polynucleotide. hi general, the particular promoter used in the expression cassette in plants depends on the intended application. Any of a number of promoters which direct transcription in plant cells can be suitable. The promoter can be either constitutive or inducible. h addition to the promoters noted above, promoters of bacterial origin which operate in plants include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from Ti plasmids. See, Herrera-Estrella et al. (1983) Nature 303:209. Viral promoters include the 35S and 19S RNA promoters of CaMV. See, Odell et al. (1985) Nature 313:810. Other plant promoters include the ribulose-l,3-bisphosphate carboxylase small subunit promoter and the phaseolin promoter. The promoter sequence from the E8 gene (see, Deikman and Fischer (1988) EMBO J7:3315) and other genes are also favorably used. Promoters specific for monocotyledonous species are also considered (McElroy and Brettell (1994) "Foreign gene expression in transgenic cereals" Trends Biotech. 12:62-68.) Alternatively, novel promoters with useful characteristics can be identified from any viral, bacterial, or plant source by methods, including sequence analysis, enhancer or promoter trapping, and the like, known in the art. In preparing expression vectors of the invention, sequences other than the promoter and the GAT encoding gene are also favorably used. If proper polypeptide expression is desired, a polyadenylation region can be derived from the natural gene, from a variety of other plant genes, or from T-DNA.
Signal/localization peptides, which, e.g., facilitate translocation of the expressed polypeptide to internal organelles (e.g., chloroplasts) or extracellular secretion, can also be employed. The vector comprising the GAT polynucleotide also can include a marker gene which confers a selectable phenotype on plant cells. For example, the marker may encode biocide tolerance, particularly antibiotic tolerance, such as tolerance to kanamycin, G418, bleomycin, hygromycin, or herbicide tolerance, such as tolerance to chlorosulfuron, or phophinothricin. Reporter genes, which are used to monitor gene expression and protein localization via visualizable reaction products (e.g., beta-glucuronidase, beta-galactosidase, and chloramphenicol acetyltransferase) or by direct visualization of the gene product itself (e.g. , green fluorescent protein, GFP; Sheen et al. (1995) The Plant Journal 8:777) can be used for, e.g., monitoring transient gene expression in plant cells. Transient expression systems can be employed in plant cells, for example, in screening plant cell cultures for herbicide tolerance activities. PLANT TRANSFORMATION Protoplasts Numerous protocols for establishment of transformable protoplasts from a variety of plant types and subsequent transformation of the cultured protoplasts are available in the art and are incorporated herein by reference. For examples, see, Hashimoto et al. (1990) Plant Physiol. 93: 857; Fowke and
Constabel, eds. (1994) Plant Protoplasts; Saunders et al. (1993) Applications of
Plant In vitro Technology Symposium, UPM 16-18; and Lyznik et al. (1991)
BioTechniques 10:295, each of which is incorporated herein by reference. Chloroplasts Chloroplasts are a site of action of some herbicide tolerance activities, and, in some instances, the GAT polynucleotide is fused to a chloroplast transit sequence peptide to facilitate translocation of the gene products into the chloroplasts. In these cases, it can be advantageous to transform the GAT polynucleotide into the chloroplasts of the plant host cells. Numerous methods are available in the art to accomplish chloroplast transformation and expression (e.g.,
Daniell et al. (1998) Nature Biotech. 16:346; O'Neill et al. (1993) The Plant
Journal 3:729; and Maliga (1993) TIBTECH 11:1). The expression construct comprises a transcriptional regulatory sequence functional in plants operably linked to a polynucleotide encoding the GAT polypeptide. Expression cassettes that are designed to function in chloroplasts (such as an expression cassette including a GAT polynucleotide) include the sequences necessary to ensure expression in chloroplasts. Typically, the coding sequence is flanked by two regions of homology to the chloroplastid genome to effect a homologous recombination with the chloroplast genome; often a selectable marker gene is also present within the flanking plastid DNA sequences to facilitate selection of genetically stable transformed chloroplasts in the resultant transplastonic plant cells (see, e.g., Maliga (1993) and Daniell (1998) supra, and references cited therein). General transformation methods DNA constructs of the invention can be introduced into the genome of the desired plant host by a variety of conventional techniques. Techniques for transforming a wide variety of higher plant species are well known and described in the technical and scientific literature. See, e.g., Payne, Gamborg, Croy, Jones, etc. all supra, as well as, e.g., Weising et al. (1988) Ann. Rev. Genet. 22:421 and U.S. Patent Nos. 5,889,191, 5,889,190, 5,866,785, 5,589,367 and 5,316,931, herein incorporated by reference. A variety of other transformation protocols are contemplated in the present invention. Transformation protocols as well as protocols for introducing nucleotide sequences into plants may vary depending on the type of plant or plant cell, i.e., monocot or dicot, targeted for transformation. Suitable methods of introducing nucleotide sequences into plant cells and subsequent insertion into the plant genome include microinjection (Crossway et al. (1986) Biotechniques 4:320- 334), electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. USA 83:5602- 5606), ^groόαcterø#n-mediated transformation (U.S. Patent Nos. 5,563,055 and 5,981,840), direct gene transfer (Paszkowski et al. (1984) EMBO J. 3:2717-2722), and ballistic particle acceleration (see, for example, U.S. Patent Nos. 4,945,050; U.S. Patent No. 5,879,918; 5,886,244; 5,932,782; Tomes et al. (1995) "Direct
DNA Transfer into Intact Plant Cells via Microprojectile Bombardment," in Plant Cell, Tissue, and Organ Culture: Fundamental Methods, Eds., Gamborg and Phillips (Springer- Verlag, Berlin); McCabe et al. (1988) Biotechnology 6:923- 926); and Led transformation (WO 00/28058). See also, Weissinger et αl. (1988) Ann. Rev. Genet. 22:421-477; Sanford et αl. (1987) Particulate Science and Technology 5:27-37 (onion); Christou et al. (1988) Plant Physiol 87:671-674 (soybean); McCabe et al. (1988) Bio/Technology 6:923-926 (soybean); Finer and McMullen (1991) In vitro Cell Dev. Biol. 27P:175-182 (soybean); Singh et al. (1998) Theor. Appl. Genet. 96:319-324 (soybean); Datta et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) Proc. Natl. Acad. Sci. USA 85:4305-4309 (maize); Klein et al. (1988) Biotechnology 6:559-563 (maize); U.S. Patent Nos. 5,240,855; 5,322,783 and 5,324,646; Klein et al. (1988) Plant Physiol. 91:440-444 (maize); Fromm et al. (1990) Biotechnology 8:833-839 (maize); Hooykaas-Van Slogteren et al. (1984) Nature (London) 311:763-764; U.S. Patent No. 5,736,369 (cereals); Bytebier et al. (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet et al. (1985) in The Experimental Manipulation of Ovule Tissues, Eds., Chapman et al. (Longman, New York), pp. 197-209
(pollen); Kaeppler et al. (1990) Plant Cell Reports 9:415-418 and Kaeppler et al. (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated transformation); D'Halluin et al. (1992) Plant Cell 4:1495-1505 (electroporation); Li et al. (1993) Plant Cell Reports 12:250-255 and Christou and Ford (1995) Annals of Botany 75:407-413 (rice); Osjoda et al. (1996) Nature Biotechnology 14:745-750 (maize via Agrobacterium tumefaciens); all of which are herein incorporated by reference. For example, DΝAs can be introduced directly into the genomic DΝA of a plant cell using techniques such as electroporation and microinjection of plant cell protoplasts, or the DΝA constructs can be introduced directly to plant tissue using ballistic methods, such as DΝA particle bombardment. Alternatively, the DΝA constructs can be combined with suitable T-DΝA flanking regions and introduced into a conventional Agrobacterium tumefaciens host vector. The virulence functions of the Agrobacterium host will direct the insertion of the construct and adjacent marker into the plant cell DΝA when the plant cell is infected by the bacteria. Microinjection techniques are known in the art and well described in the scientific and patent literature. The introduction of DΝA constructs using polyethylene glycol precipitation is described in Paszkowski et al (1984) EMBO J 3:2717. Electroporation techniques are described in Fromm et al. (1985) Proc NatT Acad Sci USA 82:5824. Ballistic transformation techniques are described in Klein et al. (1987) Nature 327:70; and Weeks et al. Plant Physiol 102:1077. In some embodiments, Agrobacterium mediated transformation techniques are used to transfer the GAT sequences of the invention to transgenic plants. Agrobacterium-mediated transformation is widely used for the transformation of dicots, however, certain monocots can also be transformed by Agrobacterium. For example, Agrobacterium transformation of rice is described by Hiei et al. (1994) Plant J. 6 :271 ; US Patent No. 5, 187,073 ; US Patent No. 5,591,616; Li et al. (1991) Science in China 34:54; and Raineri et al. (1990) Bio/Technology 8: 33. Transformed maize, barley, triticale and asparagus by Agrobacterium mediated transformation have also been described (Xu et al. (1990) C me.se J.90t 2:81). Agrobacterium mediated transformation techniques take advantage of the ability of the tumor-inducing (Ti) plasmid of A. tumefaciens to integrate into a plant cell genome, to co-transfer a nucleic acid of interest into a plant cell. Typically, an expression vector is produced wherein the nucleic acid of interest, such as a GAT polynucleotide of the invention, is ligated into an autonomously replicating plasmid which also contains T-DNA sequences. T-DNA sequences typically flank the expression cassette nucleic acid of interest and comprise the integration sequences of the plasmid. In addition to the expression cassette, T- DNA also typically includes a marker sequence, e.g., antibiotic resistance genes. The plasmid with the T-DNA and the expression cassette are then transfected into Agrobacterium cells. Typically, for effective transformation of plant cells, the A. tumefaciens bacterium also possesses the necessary vir regions on a plasmid, or integrated into its chromosome. For a discussion of Agrobacterium mediated transformation, see, Firoozabady and Kuehnle, (1995) in Plant Cell Tissue and Organ Culture Fundamental Methods, eds. Gamborg and Phillips. hi certain embodiments the polynucleotides of the present invention can be stacked with any combination of polynucleotide sequences of interest in order to create plants with a desired phenotype. For example, the polynucleotides of the present invention may be stacked with any other polynucleotides encoding polypeptides having pesticidal and/or insecticidal activity, such as Bacillus thuringiensis toxic proteins (described in U.S. Patent Nos. 5,366,892; 5,747,450; 5,737,514; 5,723,756; 5,593,881; and Geiser et al. (1986) Gene 48:109), lectins (Van Damme et al. (1994) Plant Mol. Biol. 24:825, pentin (described in U.S.
Patent No. 5,981,722), and the like. The combinations generated can also include multiple copies ofany one of the polynucleotides of interest. The polynucleotides of the present invention can also be stacked with any other gene or combination of genes to produce plants with a variety of desired trait combinations including, but not limited to, traits desirable for animal feed such as high oil genes (e.g., U.S.
Patent No. 6,232,529); balanced amino acids (e.g., hordothionins (U.S. Patent Nos. 5,990,389; 5,885,801; 5,885,802; and 5,703,409); barley high lysine (Williamson et al. (1987) Eur. J. Biochem. 165:99-106; and WO 98/20122) and high methionine proteins (Pedersen et al. (1986) J Biol. Chem. 261:6279; Kirihara et al. (1988) Gene 71:359; and Musumura et al. (1989) Plant Mol. Biol. 12:123)); increased digestibility (e.g., modified storage proteins (U.S. Application Serial No. 10/053,410, filed November 7, 2001); and thioredoxins (U.S. Application Serial No. 10/005,429, filed December 3, 2001)); the disclosures of which are herein incorporated by reference. The polynucleotides of the present invention can also be stacked with traits desirable for disease or herbicide resistance (e.g., fumonisin detoxification genes (U.S. Patent No. 5,792,931); avirulence and disease resistance genes (Jones et al. (1994) Science 266:789; Martin et al. (1993) Science 262:1432; Mindrinos et al. (1994) Cell 78:1089); acetolactate synthase (ALS) mutants that lead to herbicide resistance such as the S4 and/or Hra mutations; inhibitors of glutamine synthase such as phosphinothricin or basta (e.g., bar gene); and glyphosate resistance (EPSPS gene)); and traits desirable for processing or process products such as high oil (e.g., U.S. Patent No. 6,232,529 ); modified oils (e.g., fatty acid desaturase genes (U.S. Patent No. 5,952,544; WO 94/11516)); modified starches (e.g., ADPG pyrophosphorylases (AGPase), starch synthases (SS), starch branching enzymes (SBE), and starch debranching enzymes (SDBE)); and polymers or bioplastics (e.g., U.S. Patent No. 5.602,321; beta-ketothiolase, polyhydroxybutyrate synthase, and acetoacetyl-CoA reductase (Schubert et al (1988) J Bacteriol. 170:5837- 5847) facilitate expression of polyhydroxyalkanoates (PHAs)); the disclosures of which are herein incorporated by reference. One could also combine the polynucleotides of the present invention with polynucleotides providing agronomic traits such as male sterility (e.g., see U.S. Patent No. 5.583,210), stalk strength, flowering time, or transformation technology traits such as cell cycle regulation or gene targeting (e.g., WO 99/61619, WO 00/17364, and WO 99/25821); the disclosures of which are herein incorporated by reference. These stacked combinations can be created by any method including, but not limited to, cross-breeding plants by any conventional or TopCross methodology, or genetic transformation. If the traits are stacked by genetically transforming the plants, the polynucleotide sequences of interest can be combined at any time and in any order. For example, a transgenic plant comprising one or more desired traits can be used as the target to introduce further traits by subsequent transformation. The traits can be introduced simultaneously in a co- transformation protocol with the polynucleotides of interest provided by any combination of transformation cassettes. For example, if two sequences will be introduced, the two sequences can be contained in separate transformation cassettes (trans) or contained on the same transformation cassette (cis). Expression of the sequences can be driven by the same promoter or by different promoters, h certain cases, it may be desirable to introduce a transformation cassette that will suppress the expression of the polynucleotide of interest. This may be combined with any combination of other suppression cassettes or overexpression cassettes to generate the desired combination of traits in the plant. It is further recognized that polynucleotide sequences can be stacked at a desired genomic location using a site- specific recombination system. See, for example, WO99/25821, WO99/25854, WO99/25840, WO99/25855, and WO99/25853, all of which are herein incorporated by reference. Regeneration of Transgenic Plants Transformed plant cells which are derived by plant transformation techniques, including those discussed above, can be cultured to regenerate a whole plant which possesses the transformed genotype (i.e., a GAT polynucleotide), and thus the desired phenotype, such as acquired resistance (i.e., tolerance) to glyphosate or a glyphosate analog. Such regeneration techniques rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the desired nucleotide sequences. For transformation and regeneration of maize see, Gordon-Kamm et al, The Plant Cell, 2:603-618 (1990). Alternatively, selection for glyphosate resistance conferred by the GAT polynucleotide of the invention can be performed. Plant regeneration from cultured protoplasts is described in Evans et al. (1983) Protoplasts Isolation and Culture, Handbook of Plant Cell Culture, pp 124- 176, Macmillan Publishing Company, New York; and Binding (1985) Regeneration of Plants, Plant Protoplasts pp 21-73, CRC Press, Boca Raton. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al. (1987) Ann Rev of Plant Phys 38:467. See also, e.g., Payne and Gamborg. Transformed plant cells, calli or explant can be cultured on regeneration medium in the dark for several weeks, generally about 1 to 3 weeks to allow the somatic embryos to mature. Preferred regeneration media include media containing MS salts. The plant cells, calli or explant are then typically cultured on rooting medium in a light/dark cycle until shoots and roots develop. Methods for plant regeneration are known in the art and preferred methods are provided by Kamo et al, (Bo Gaz. 146(3):324-334, 1985); West et al, (The Plant Cell 5:1361-1369, 1993); and Duncan et al. ( lanta 165:322-332, 1985). Small plantlets can then be transferred to tubes containing rooting medium and allowed to grow and develop more roots for approximately another week. The plants can then be transplanted to soil mixture in pots in the greenhouse. The regeneration of plants containing the foreign gene introduced by Agrobacterium can be achieved as described by Horsch et al, Science, 227:1229-1231 (1985) and Fraley et al, Proc. Natl. Acad. Sci. U.S.A., 80:4803 (1983). This procedure typically produces shoots within two to four weeks and these transformant shoots are then transferred to an appropriate root-inducing medium containing the selective agent and an antibiotic to prevent bacterial growth. Transgenic plants of the present invention may be fertile or sterile. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al, Ann. Rev. of Plant Phys. 38:467-486 (1987). The regeneration of plants from either single plant protoplasts or various explants is well known in the art. See, for example, Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, eds., Academic Press, Inc., San Diego, Calif. (1988). For maize cell culture and regeneration see generally, The Maize Handbook, Freeling and Walbot, eds., Springer, New York (1994); Corn and Corn Improvement, 3rd Ed., Sprague and Dudley eds., American Society of Agronomy, Madison, Wisconsin (1988). After transformation with Agrobacterium, the explants typically are transferred to selection medium. One of skill will realize that the selection medium depends on the selectable marker that was co-transfected into the explants. After a suitable length of time, transformants will begin to form shoots. After the shoots are about 1-2 cm in length, the shoots should be transferred to a suitable root and shoot medium. Selection pressure should be maintained in the root and shoot medium. Typically, the transformants will develop roots in about 1-2 weeks and form plantlets. After the plantlets are about 3-5 cm in height, they are placed in sterile soil in fiber pots. Those of skill in the art will realize that different acclimation procedures are used to obtain transformed plants of different species. For example, after developing a root and shoot, cuttings, as well as somatic embryos of transformed plants, are transferred to medium for establishment of plantlets. For a description of selection and regeneration of transformed plants, see, e.g., Dodds and Roberts (1995) Experiments in Plant Tissue Culture. 3rd Ed., Cambridge University Press. There are also methods for Agrobacterium transformation of Arabidopsis using vacuum infiltration (Bechtold N., Ellis J. and Pelletier G„ 1993, In planta Agrobacterium mediated gene transfer by infiltration of adult Arabidopsis thaliana plants. CR Acad Sci Paris Life Sci 316:1194-1199) and simple dipping of flowering plants (Desfeux, C, Clough S.J., and Bent A.F., 2000, Female reproductive tissues are the primary target of Agrobacterium-mediated transformation by the Arabidopsis floral-dip method. Plant Physiol. 123:895-904). Using these methods, transgenic seed are produced without the need for tissue culture. There are plant varieties for which effective Agrobacterium- mediated transformation protocols have yet to be developed. For example, successful tissue transformation coupled with regeneration of the transformed tissue to produce a transgenic plant has not been reported for some of the most commercially relevant cotton cultivars. Nevertheless, an approach that can be used with these plants involves stably introducing the polynucleotide into a related plant variety via Agrobacterium-mediated transformation, confirming operability, and then transferring the transgene to the desired commercial strain using standard sexual crossing or back-crossing techniques. For example, in the case of cotton, Agrobacterium can be used to transform a Coker line of Gossypium hirustum (e.g., Coker lines 310, 312, 5110 Deltapine 61 or StoneviUe 213), and then the transgene can be introduced into another more commercially relevant G. hirustum cultivar by back-crossing. The transgenic plants of this invention can be characterized either genotypically or phenotypically to determine the presence of the GAT polynucleotide of the invention. Genotypic analysis can be performed by any of a number of well-known techniques, including PCR amplification of genomic DNA and hybridization of genomic DNA with specific labeled probes. Phenotypic analysis includes, e.g., survival of plants or plant tissues exposed to a selected herbicide such as glyphosate. One of skill will recognize that after the expression cassette containing the GAT gene is stably incorporated in transgenic plants and confirmed to be operable, it can be introduced into other plants by sexual crossing. Any of a number of standard breeding techniques can be used, depending upon the species to be crossed. In vegetatively propagated crops, mature transgenic plants can be propagated by the taking of cuttings or by tissue culture techniques to produce multiple identical plants. Selection of desirable transgenics is made and new varieties are obtained and propagated vegetatively for commercial use. h seed propagated crops, mature transgenic plants can be self crossed to produce a homozygous inbred plant. The inbred plant produces seed containing the newly introduced heterologous nucleic acid. These seeds can be grown to produce plants that would produce the selected phenotype. Parts obtained from the regenerated plant, such as flowers, seeds, leaves, branches, fruit, and the like are included in the invention, provided that these parts comprise cells comprising the isolated GAT nucleic acid. Progeny and variants, and mutants of the regenerated plants are also included within the scope of the invention, provided that these parts comprise the introduced nucleic acid sequences. Transgenic plants expressing a selectable marker can be screened for transmission of the GAT nucleic acid, for example, by standard immunoblot and DNA detection techniques. Transgenic lines are also typically evaluated on levels of expression of the heterologous nucleic acid. Expression at the RNA level can be determined initially to identify and quantitate expression-positive plants. Standard techniques for RNA analysis can be employed and include PCR amplification assays using ohgonucleotide primers designed to amplify only the heterologous RNA templates and solution hybridization assays using heterologous nucleic acid-specific probes. The RNA-positive plants can then be analyzed for protein expression by Western immunoblot analysis using the specifically reactive antibodies of the present invention. In addition, in situ hybridization and immunocytochemistry according to standard protocols can be done using heterologous nucleic acid specific polynucleotide probes and antibodies, respectively, to localize sites of expression within transgenic tissue. Generally, a number of transgenic lines are usually screened for the incorporated nucleic acid to identify and select plants with the most appropriate expression profiles. A preferred embodiment is a transgenic plant that is homozygous for the added heterologous nucleic acid; i.e., a transgenic plant that contains two added nucleic acid sequences, one gene at the same locus on each chromosome of a chromosome pair. A homozygous transgenic plant can be obtained by sexually mating (selfing) a heterozygous transgenic plant that contains a single added heterologous nucleic acid, germinating some of the seed produced and analyzing the resulting plants produced for altered cell division relative to a control plant (i.e., native, non-transgenic). Back-crossing to a parental plant and out-crossing with a non-transgenic plant are also contemplated. Essentially any plant can be transformed with the GAT polynucleotides of the invention. Suitable plants for the transformation and expression of the novel GAT polynucleotides of this invention include agronomically and horticulturally important species. Such species include, but are not restricted to members of the families: Graminae (including corn, rye, triticale, barley, millet, rice, wheat, oats, etc); Leguminosae (including pea, beans, lentil, peanut, yam bean, cowpeas, velvet beans, soybean, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, and sweetpea); Compositae (the largest family of vascular plants, including at least 1,000 genera, including important commercial crops such as sunflower); and Rosaciae (including raspberry, apricot, almond, peach, rose, etc.); as well as nut plants (including, walnut, pecan, hazelnut, etc.); and forest trees (including Pinus, Quercus, Pseutotsuga, Sequoia, Populus.etc.) Additional targets for modification by the GAT polynucleotides of the invention, as well as those specified above, include plants from the genera: Agrostis, Allium, Antirrhinum, Apium, Arachis, Asparagus, Atropa, Avena (e.g., oats), Bambusa, Brassica, Bromus, Browaalia, Camellia, Cannabis, Capsicum, Cicer, Chenopodium, Chichorium, Citrus, Coffea, Coix, Cucumis, Curcubita, Cynodon, Dactylis, Datura, Daucus, Digitalis, Dioscorea, Elaeis, Eleusine, Festuca, Fragaria, Geranium, Gossypium, Glycine, Helianthus, Heterocallis, Hevea, Hordeum (e.g., barley), Hyoscyamus, Ipomoea, Lactuca, Lens, Lilium, Linum, Lolium, Lotus, Lycopersicon, Majorana, Malus, Mangifera, Manihot, Medicago, Nemesia, Nicotiana, Onobrychis, Oryza (e.g., rice), Panicum, Pelargonium, Pennisetum (e.g., millet), Petunia, Pisum, Phaseolus, Phleurn, Poa, Prunus, Ranunculus, Raphanus, Ribes, Ricinus, Rubus, Saccharum, Salpiglossis, Secale (e.g., rye), Senecio, Setaria, Sinapis, Solarium, Sorghum, Stenotaphrum, Theobroma, Trifolium, Trigonella, Triticum (e.g., wheat), Vicia, Vigna, Vitis, Zea (e.g., corn), and the Olyreae, the Pharoideae and many others. As noted, plants in the family Graminae are particularly desirable target plants for the methods of the invention. Common crop plants which are targets of the present invention include corn, rice, triticale, rye, cotton, soybean, sorghum, wheat, oats, barley, millet, sunflower, canola, peas, beans, lentils, peanuts, yam beans, cowpeas, velvet beans, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, sweetpea and nut plants (e.g., walnut, pecan, etc). h one aspect, the invention provides a method for producing a crop by growing a crop plant that is glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, under conditions such that the crop plant produces a crop, and harvesting the crop. Preferably, glyphosate is applied to the plant, or in the vicinity of the plant, at a concentration effective to control weeds without preventing the transgenic crop plant from growing and producing the crop. The application of glyphosate can be before planting, or at any time after planting up to and including the time of harvest. Glyphosate can be applied once or multiple times. The timing of glyphosate application, amount applied, mode of application, and other parameters will vary based upon the specific nature of the crop plant and the growing environment, and can be readily determined by one of skill in the art. The invention further provides a crop produced by this method. The invention provides for the propagation of a plant containing a GAT polynucleotide transgene. The plant can be, for example, a monocot or a dicot. In one aspect, propagation entails crossing a plant containing a GAT polynucleotide transgene with a second plant, such that at least some progeny of the cross display glyphosate tolerance. hi one aspect, the invention provides a method for selectively controlling weeds in a field where a crop is being grown. The method involves planting crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a GAT, e.g., a GAT polynucleotide, and applying to the crop and any weeds a sufficient amount of glyphosate to control the weeds without a significant adverse impact on the crop. It is important to note that it is not necessary for the crop to be totally insensitive to the herbicide, so long as the benefit derived from the inhibition of weeds outweighs any negative impact of the glyphosate or glyphosate analog on the crop or crop plant. h another aspect, the invention provides for use of a GAT polynucleotide as a selectable marker gene. In this embodiment of the invention, the presence of the GAT polynucleotide in a cell or organism confers upon the cell or organism the detectable phenotypic trait of glyphosate resistance, thereby allowing one to select for cells or organisms that have been transformed with a gene of interest linked to the GAT polynucleotide. Thus, for example, the GAT polynucleotide can be introduced into a nucleic acid construct, e.g., a vector, thereby allowing for the identification of a host (e.g. , a cell or transgenic plant) containing the nucleic acid construct by growing the host in the presence of glyphosate and selecting for the ability to survive and/or grow at a rate that is discernibly greater than a host lacking the nucleic acid construct would survive or grow. A GAT polynucleotide can be used as a selectable marker in a wide variety of hosts that are sensitive to glyphosate, including plants, most bacteria (including E. coli), actinomycete, yeasts, algae and fungi. One benefit of using herbicide resistance as a marker in plants, as opposed to conventional antibiotic resistance, is that it obviates the concern of some members of the public that antibiotic resistance might escape into the environment. Some experimental data from experiments demonstrating the use of a GAT polynucleotide as a selectable marker in diverse host systems are described in the Examples section of this specification.
Selection of GAT polynucleotides conferring enhanced glyphosate resistance in transgenic plants. Libraries of GAT encoding nucleic acids diversified according to the methods described herein can be selected for the ability to confer resistance to glyphosate in transgenic plants. Following one or more cycles of diversification and selection, the modified GAT genes can be used as a selection marker to facilitate the production and evaluation of transgenic plants and as a means of conferring herbicide resistance in experimental or agricultural plants. For example, after diversification ofany one or more of, e.g., SEQ ED NO:l - 5 to produce a library of diversified GAT polynucleotides, an initial functional evaluation can be performed by expressing the library of GAT encoding sequences in E. coli. The expressed GAT polypeptides can be purified, or partially purified as described above, and screened for improved kinetics by mass spectrometry. Following one or more preliminary rounds of diversification and selection, the polynucleotides encoding improved GAT polypeptides are cloned into a plant expression vector, operably linked to, e.g., a strong constitutive promoter, such as the CaMV 35S promoter. The expression vectors comprising the modified GAT nucleic acids are transformed, typically by Agrobacterium mediated transformation, into Arabidopsis thaliana host plants. For example, Arabidopsis hosts are readily transformed by dipping inflorescences into solutions of
Agrobacterium and allowing them to grow and set seed. Thousands of seeds are recovered in approximately 6 weeks. The seeds are then collected in bulk from the dipped plants and germinated in soil, h this manner it is possible to generate several thousand independently transformed plants for evaluation, constituting a high throughput (HTP) plant transformation format. Bulk grown seedlings are sprayed with glyphosate and surviving seedlings exhibiting glyphosate resistance survive the selection process, whereas non-transgenic plants and plants incorporating less favorably modified GAT nucleic acids are damaged or killed by the herbicide treatment. Optionally, the GAT encoding nucleic acids conferring improved resistance to glyphosate are recovered, e.g., by PCR amplification using T-DNA primers flanking the library inserts, and used in further diversification procedures or to produce additional transgenic plants of the same or different species. If desired, additional rounds of diversification and selection can be performed using increasing concentrations of glyphosate in each subsequent selection. In this manner, GAT polynucleotides and polypeptides conferring resistance to concentrations of glyphosate useful in field conditions can be obtained. Herbicide Resistance The present invention provides a composition comprising two or more polynucleotides of the invention. Preferably, the GAT polynucleotides encode GAT polypeptides having different kinetic parameters, i.e., a GAT variant having a lower Km can be combined with one having a higher kcat. hi a further embodiment, the different GAT polynucleotides may be coupled to a chloroplast transit sequence or other signal sequence thereby providing GAT polypeptide expression in different cellular compartments, organelles or secretion of one or more of the GAT polypeptides. The mechanism of glyphosate resistance of the present invention can be combined with other modes of glyphosate resistance known in the art to produce plants and plant explants with superior glyphosate resistance. For example, glyphosate-tolerant plants can be produced by inserting into the genome of the plant the capacity to produce a higher level of 5-enolpyruvylshikimate-3- phosphate synthase (EPSPS) as more fully described in U.S. Patent Nos. 6,248,876 Bl; 5,627,061; 5,804,425; 5,633,435; 5,145,783; 4,971,908; 5,312,910; 5,188,642; 4,940,835; 5,866,775; 6,225,114 Bl; 6,130,366; 5,310,667; 4,535,060; 4,769,061; 5,633,448; 5,510,471; Re. 36,449; RE 37,287 E; and 5,491,288; and international publications WO 97/04103; WO 00/66746; WO 01/66704; and WO 00/66747, which are incorporated herein by reference in their entireties for all purposes.
Glyphosate resistance is also imparted to plants that express a gene that encodes a glyphosate oxido-reductase enzyme as described more fully in U.S. Patent Nos. 5,776,760 and 5,463,175, which are incorporated herein by reference in their entireties for all purposes. Further, the mechanism of glyphosate resistance of the present invention may be combined with other modes of herbicide resistance to provide plants and plant explants that are resistant to glyphosate and one or more other herbicides. For example, the hydroxyphenylpyruvatedioxygenases are enzymes that catalyze the reaction in which para-hydroxyphenylpyruvate (HPP) is transformed into homogentisate. Molecules which inhibit this enzyme, and which bind to the enzyme in order to inhibit transformation of the HPP into homogentisate are useful as herbicides. Plants more resistant to certain herbicides are described in U.S. Patent Nos. 6,245,968 Bl; 6,268,549; and 6,069,115; and international publication WO 99/23886, which are incorporated herein by reference in their entireties for all purposes. Sulfonylurea and imidazolinone herbicides also inhibit growth of higher plants by blocking acetolactate synthase (ALS) or acetohydroxy acid synthase (AHAS). The production of sulfonylurea and imidazolinone tolerant plants is described more fully in U.S. Patent Nos. 5,605,011; 5,013,659; 5,141,870; 5,767,361; 5,731,180; 5,304,732; 4,761,373; 5,331,107; 5,928,937; and 5,378,824; and international publication WO 96/33270, which are incorporated herein by reference in their entireties for all purposes. Glutamine synthetase (GS) appears to be an essential enzyme necessary for the development and life of most plant cells. Inhibitors of GS are toxic to plant cells. Glufosinate herbicides have been developed based on the toxic effect due to the inhibition of GS in plants. These herbicides are non-selective. They inhibit growth of all the different species of plants present, causing their total destruction. The development of plants containing an exogenous phosphinotliricin acetyltransferase is described in U.S. Patent Nos. 5,969,213; 5,489,520; 5,550,318; 5,874,265; 5,919,675; 5,561,236; 5,648,477; 5,646,024; 6,177,616 Bl; and 5,879,903, which are incorporated herein by reference in their entireties for all purposes. Protoporphyrinogen oxidase (protox) is necessary for the production of chlorophyll, which is necessary for all plant survival. The protox enzyme serves as the target for a variety of herbicidal compounds. These herbicides also inhibit growth of all the different species of plants present, causing their total destruction. The development of plants containing altered protox activity which are resistant to these herbicides are described in U.S. Patent Nos. 6,288,306 Bl; 6,282,837 Bl; and 5,767,373; and international publication WO 01/12825, which are incorporated herein by reference in their entireties for all purposes. Accordingly, the invention provides methods for selectively controlling weeds in a field containing a crop that involve planting the field with crop seeds or plants which are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acetyltransferase, and applying to the crop and weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop. The invention further provides methods for controlling weeds in a field and preventing the emergence of glyphosate-resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase and a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as, a glyphosate-tolerant 5- enolpyruvylsl ikimate-3 -phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and applying to the crop and the weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop. hi a further embodiment the invention provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase, a gene encoding a polypeptide imparting glyphosate tolerance by another mechanism, such as, a glyphosate- tolerant 5-enolpyruvylshikimate-3-phosphate synthase and/or a glyphosate-tolerant glyphosate oxido-reductase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as, a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone- tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase and applying to the crop and the weeds in the field a sufficient amount of glyphosate and an additional herbicide, such as, a hydiOxyphenylpyruvatedioxygenase inhibitor, sulfonamide, imidazolinone, bialaphos, phosphinothricin, azafenidin, butafenacil, sulfosate, glufosinate, and a protox inhibitor to control the weeds without significantly affecting the crop. The invention further provides methods for controlling weeds in a field and preventing the emergence of herbicide resistant weeds in a field containing a crop which involve planting the field with crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate-N-acetyltransferase and a gene encoding a polypeptide imparting tolerance to an additional herbicide, such as, a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthasej a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyltransferase and a mutated protoporphyrinogen oxidase and applying to the crop and the weeds in the field a sufficient amount of glyphosate and an additional herbicide, such as, a hydroxyphenylpyruvatedioxygenase inhibitor, sulfonamide, imidazolinone, bialaphos, phosphinothricin, azafenidin, butafenacil, sulfosate, glufosinate, and a protox inhibitor to control the weeds without significantly affecting the crop.
EXAMPLES The following examples are illustrative and not limiting. One of skill will recognize a variety of non-critical parameters that can be altered to achieve essentially similar results.
EXAMPLE 1 : ISOLATING NOVEL NATIVE GAT POLYNUCLEOTIDES Five native GAT polynucleotides (i.e., GAT polynucleotides that occur naturally in a non-genetically modified organism) were discovered by expression cloning of sequences from Bacillus strains exhibiting GAT activity. Their nucleotide sequences were determined and are provided herein as SEQ ID NO:l - 5. Briefly, a collection of approximately 500 Bacillus and Pseudomonas strains were screened for native ability to N-acetylate glyphosate. Strains were grown in LB overnight, harvested by centrifugation, permeabilized in dilute toluene, and then washed and resuspended in a reaction mix containing buffer, 5 mM glyphosate, and 200 μM acetyl-CoA. The cells were incubated in the reaction mix for between 1 and 48 hours, at which time an equal volume of methanol was added to the reaction. The cells were then pelleted by centrifugation and the supernatant was filtered before analysis by parent ion mode mass spectrometry. The product of the reaction was positively identified as N-acetylglyphosate by comparing the mass spectrometry profile of the reaction mix to an N- acetylglyphosate standard as shown in Figure 2. Product detection was dependent on inclusion of both substrates (acetyl CoA and glyphosate) and was abolished by heat denaturing the bacterial cells. Individual GAT polynucleotides were then cloned from the identified strains by functional screening. Genomic DNA was prepared and partially digested with Sau3Al enzyme. Fragments of approximately 4 Kb were cloned into an E. coli expression vector and transformed into electrocompetent E. coli. Individual clones exhibiting GAT activity were identified by mass spectrometry following a reaction as described previously except that the toluene wash was replaced by permeabilization with PMBS. Genomic fragments were sequenced and the putative GAT polypeptide-encoding open reading frame identified. Identity of the GAT gene was confirmed by expression of the open reading frame inE. coli and detection of high levels of N-acetylglyphosate produced from reaction mixtures.
EXAMPLE 2: CHARACTERIZATION OF A GAT POLYPEPTIDE ISOLATED FROM B.LICHENIFORMIS STRAIN B6. Genomic DNA from B. licheniformis strain B6 was purified, partially digested with Sau3Al and fragments of 1-10 Kb were cloned into an E. coli expression vector. A clone with a 2.5 kb insert conferred the glyphosate-N- acetyltransferase (GAT) activity on the E. coli host as determined with mass spectrometry analysis. Sequencing of the insert revealed a single complete open reading frame of 441 base pairs. Subsequent cloning of this open reading frame confirmed that it encoded the GAT enzyme. A plasmid, pMAXY2120, is shown in Figure 4. The gene encoding the GAT enzyme of B6 was transformed into E. coli strain XL1 Blue. A 10% innoculum of a saturated culture was added to Luria broth, and the culture was incubated at 37° C for 1 hr. Expression of GAT was induced by the addition of EPTG at a concentration of 1 mM. The culture was incubated a further 4 hrs, following which, cells were harvested by centrifugation and the cell pellet stored at -80° C. Lysis of the cells was effected by the addition of 1 ml of the following buffer to 0.2 g of cells: 25 mM HEPES, pH 7.3, 100 mM KC1 and 10% methanol (HKM) plus 0.1 mM EDTA, 1 mM DTT, 1 mg/ml chicken egg lysozyme, and a protease inhibitor cocktail obtained from Sigma and used according to the manufacturer's recommendations. After 20 minutes incubation at room temperature (e.g., 22-25° C), lysis was completed with brief sonication. The lysate was centrifuged and the supernatant was desalted by passage through Sephadex G25 equilibrated with HKM. Partial purification was obtained by affinity chromatography on CoA Agarose (Sigma). The column was equilibrated with HKM and the clarified extract was allowed to pass through under hydrostatic pressure. Non-binding proteins were removed by washing the column with HKM, and GAT was eluted with HKM containing 1 mM Coenzyme A. This procedure provided 4-fold purification. At this stage, approximately 65% of the protein staining observed on an SDS polyacrylamide gel loaded with crude lysate was due to GAT, with another 20% due to chloramphenicol acetyltransferase encoded by the vector. Purification to homogeneity was obtained by gel filtration of the partially purified protein through Superdex 75 (Pharmacia). The mobile phase was HKM, in which GAT activity eluted at a volume corresponding to a molecular radius of 17 kD. This material was homogeneous as judged by Coomassie staining of a 3 μg sample of GAT subjected to SDS polyacrylamide gel electrophoresis on a 12% acrylamide gel, 1 mm thickness. Purification was achieved with a 6-fold increase in specific activity. The apparent KM for glyphosate was determined on reaction mixtures containing saturating (200 μM) Acetyl CoA, varying concentrations of glyphosate, and 1 μM purified GAT in buffer containing 5 mM morpholine adjusted to pH 7.7 with acetic acid and 20 % ethylene glycol. Initial reaction rates were determined by continuous monitoring of the hydrolysis of the thioester bond of Acetyl CoA at 235 nm (E = 3.4 OD/mM/cm). Hyperbolic saturation kinetics were observed (Figure 5), from which an apparent KM of 2.9 ± 0.2 (SD) mM was obtained. The apparent K for Acetyl CoA was determined on reaction mixtures containing 5 mM glyphosate, varying concentrations of Acetyl CoA, and 0.19 μM GAT in buffer containing 5 mM morpholine adjusted to pH 7.7 with acetic acid and 50% methanol. Initial reaction rates were determined using mass spectrometric detection of N-acetyl glyphosate. Five μl were repeatedly injected into the instrument and reaction rates were obtained by plotting reaction time vs area of the integrated peak (Figure 6). Hyperbolic saturation kinetics were observed (Figure 7), from which an apparent KM of 2 μM was derived. From values for Vmax obtained at a known concentration of enzyme, a kcat of 6/min was calculated.
EXAMPLE 3: MASS SPECTROMETRY (MS) SCREENING PROCESS Sample (5 μl) was drawn from a 96-well microtiter plate at a speed of one sample every 26 seconds and injected into the mass spectrometer (Micromass Quattro LC, triple quadrapole mass spectrometer) without any separation. The sample was carried into the mass spectrometer by a mobile phase of water/methanol (50:50) at a flow rate of 500 Ul/min. Each injected sample was ionized by a negative electrospray ionization process (needle voltage, -3.5 KV; cone voltage, 20 V; source temperature, 120° C; desolvation temperature, 250° C; cone gas flow, 90 L/Hr; and desolvation gas flow, 600 L/Hr). The molecular ions (m/z 210) formed during this process were selected by the first quadrapole for performing collision induced dissociation (CED) in the second quadrapole, where the pressure was set at 5 x 10"4 mBar and the collision energy was adjusted to 20 Ev. The third quadrapole was set for only allowing one of the daughter ions (m z 124) produced from the parent ions (m/z 210) to get into the detector for signal recording. The first and third quadrapoles were set at unit resolution, while the photomultiplier was operated at 650 V. Pure N-acetylglyphosate standards were used for comparison and peak integration was used to estimate concentrations. It was possible to detect less than 200 Nm N-acetylglyphosate by this method.
EXAMPLE 4: DETECTION OF NATIVE OR LOW ACTIVITY GAT ENZYMES Native or low activity GAT enzymes typically have a kcat of approximately 1 min'1 and a KM for glyphosate of 1.5-10 Mm. KM for acetyl CoA was typically less than 25 μM. Bacterial cultures were grown in rich medium in deep 96-well plates and 0.5 ml stationary phase cells were harvested by centrifugation, washed with 5 mM morpholine acetate pH 8, and resuspended in 0.1 ml reaction mix containing 200 μM ammonium acetyl CoA, 5 mM ammonium glyphosate, and 5 μg/ml PMBS (Sigma) in 5 mM morpholine acetate, pH 8. The PMBS permeabilizes the cell membrane allowing the substrates and products to move from the cells to the buffer without releasing the entire cellular contents. Reactions were carried out at 25-37°C for 1-48 hours. The reactions were quenched with an equal volume of 100%) ethanol and the entire mixture was filtered on a 0.45 μm MAHV Multiscreen filter plate (Millipore). Samples were analyzed using a mass spectrometer as described above and compared to synthetic N-acetylglyphosate standards.
EXAMPLE 5: DETECTION OF HIGH ACTIVITY GAT ENZYMES High activity GAT enzymes typically have a kcat up to 400 min"1 and a K below 0.1 mM glyphosate. Genes coding for GAT enzymes were cloned into E. coli expression vector pQE80 (Qiagen) and introduced into E. coli strain XL1 Blue (Stratagene). Cultures were grown in 150 ul rich medium (LB with 50 ug/ml carbenicllin) in shallow U-bottom 96-well polystyrene plates to late-log phase and diluted 1 :9 with fresh medium containing 1 mM EPTG (USB). After 4-8 hours induction, cells were harvested, washed with 5mM morpholine acetate pH 6.8 and resuspended in an equal volume of the same morpholine buffer. Reactions were carried out with up to 10 ul of washed cells. At higher activity levels, the cells were first diluted up to 1 :200 and 5 ul was added to 100 ul reaction mix. To measure GAT activity, the same reaction mix as described for low activity was used. However, for detecting highly active GAT enzymes the glyphosate concentration was reduced to 0.15 - 0.5 mM, the pH was reduced to 6.8, and reactions were carried out for 1 hour at 37°C. Reaction workup and MS detection were as described herein.
EXAMPLE 6: PURIFICATION OF GAT ENZYMES Enzyme purification was achieved by affinity chromatography of cell lysates on CoA-agarose and gel-filtration on Superdex-75. Quantities of purified GAT enzyme up to 10 mg were obtained as follows: A 100-ml culture of E. coli carrying a GAT polynucleotide on a pQE80 vector and grown overnight in LB containing 50 ug/ml carbenicillin was used to inoculate 1 L of LB plus 50 ug/ml carbenicillin. After 1 hr, EPTG was added to 1 mM, and the culture was grown a further 6 hr. Cells were harvested by centrifugation. Lysis was effected by suspending the cells in 25 mM HEPES (pH 7.2), 100 mM KC1, 10% methanol (HKM), 0.1 mM EDTA, 1 mM DTT, protease inhibitor cocktail supplied by Sigma-Aldrich and 1 mg/ml of chicken egg lysozyme. After 30 minutes at room temperature, the cells were briefly sonicated. Particulate material was removed by centrifugation, and the lysate was passed through a bed of coenzyme A-Agarose. The column was washed with several bed volumes of HKM and GAT was eluted in 1.5 bed volumes of HKM containing 1 mM acetyl CoA. GAT in the eluate was concentrated by its retention above a Centricon YM 50 ultrafiltration membrane. Further purification was obtained by passing the protein through a Superdex 75 column through a series of 0.6-ml injections. The peak of GAT activity eluted at a volume corresponding to a molecular weight of 17 kD. This method resulted in purification of GAT enzyme to homogeneity with >85% recovery. A similar procedure was used to obtain 0.1 to 0.4 mg quantities of up to 96 shuffled variants at a time. The volume of induced culture was reduced to 1 to 10 ml, coenzyme A- Agarose affinity chromatography was performed in 0.15-ml columns packed in an MAHV filter plate (Millipore) and Superdex 75 chromatography was omitted.
EXAMPLE 7: STANDARD PROTOCOL FOR DETERMINATION OF KPAT AND KM kcat and KM for glyphosate of purified protein were determined using a continuous spectrophotometric assay, in which hydrolysis of the sulfoester bond of Acetyl CoA was monitored at 235 mn. Reactions were performed at ambient temperature (about 23 °C) in the wells of a 96-well assay plate, with the following components present in a final volume of 0.3 ml: 20 mM HEPES, pH 6.8, 10%) ethylene glycol, 0.2 mM acetyl CoA, and various concentrations of ammonium glyphosate. In comparing the kinetics of two GAT enzymes, both enzymes were assayed under the same conditions, e.g., both at 23°C. kcat was calculated from Vmax and the enzyme concentration, determined by Bradford assay. KM was calculated from the initial reaction rates obtained from concentrations of glyphosate ranging from 0.125 to 10 mM, using the Lineweaver-Burke transformation of the Michaelis-Menten equation. kcat/K was determined by dividing the value determined for kcat by the value determined for KM- Using this methodology, kinetic parameters for a number of GAT polypeptides exemplified herein were determined. For example, the kcat, KM and kcat/KM for the GAT polypeptide corresponding to SEQ ED NO:445 have been determined to be 322 min"1, 0.5 mM and 660 mM^min"1, respectively, using the assay conditions described above. The kcat, KM and IO^/KM for the GAT polypeptide corresponding to SEQ ED NO:457 have been determined to be 118 min"1, 0.1 mM and 1184 mM' n"1, respectively, using the assay conditions described above. The kcat, KM and kcat!Ku for the GAT polypeptide corresponding to SEQ ED NO:300 have been determined to be 296 min"1, 0.65 mM and 456 mM" 1min"1, respectively, using the assay conditions described above. One of skill in the art can use these numbers to confirm that a GAT activity assay is generating kinetic parameters for a GAT suitable for comparison with the values given herein. For example, the conditions used to compare the activity of GATs should yield the same kinetic constants for SEQ ID NO: 300, 445, and 457 (within normal experimental variance) as those reported herein, when the conditions are used to compare a test GAT with the GAT polypeptides exemplified herein. KM for Acetyl CoA was measured using the mass spectrometry method with repeated sampling during the reaction. AcetylCoA and glyphosate (ammonium salts) were placed as 50-fold-concentrated stock solutions into a well of a mass spectrometry sample plate. Reactions were initiated with the addition of enzyme appropriately diluted in a volatile buffer such as moφholine acetate or ammonium carbonate, pH 6.8 or 7.7. The sample was repeatedly injected into the instrument and initial rates were calculated from plots of retention time and peak area. KM was calculated as for glyphosate.
EXAMPLE 8: SELECTION OF TRANSFORMED E. COLI An evolved GAT gene (a chimera with a native B. licheniformis ribosome binding site (AACTGAAGGAGGAATCTC; SEQ ID NO:515) attached directly to the 5' end of the GAT coding sequence) was cloned into the expression vector pQE80 (Qiagen) between the EcoRI and Hindlll sites, resulting in the plasmid pMAXY2190 (Figure 11). This eliminated the His tag domain from the plasmid and retained the B-lactamase gene conferring resistance to the antibiotics ampicillin and carbenicillin. pMAXY2190 was electroporated (BioRad Gene Pulser) into XL1 Blue (Stratagene) E. coli cells. The cells were suspended in SOC rich medium and allowed to recover for one hour. The cells were then gently pelleted, washed one time with M9 minimal media lacking aromatic amino acids (12.8 g/L Na2HPO4.7 H2O, 3.0 g/L KH2PO4, 0.5 g/L NaCl, 1.0 g/L NH4C1, 0.4% glucose, 2 mM MgSO4, 0.1 mM CaC12, 10 mg/L thiamine, 10 mg/L proline, 30 mg/L carbenicillin), and resuspended in 20 ml of the same M9 medium. After overnight growth at 37°C at 250 rpm, equal volumes of cells were plated on either M9 medium or M9 plus 1 mM glyphosate medium. pQE80 vector with no GAT gene was similarly introduced into E. coli cells and plated for single colonies for comparison. Table 3 presents a summary of the results, demonstrating that GAT activity allows selection and growth of transformed E. coli cells with less than 1%> background. Note that no IPTG induction was necessary for sufficient GAT activity to allow growth of transformed cells. Transformation was verified by re- isolation of pMAXY2190 from the E. coli cells grown in the presence of glyphosate. Table 3. Glyphosate selection of pMAXY2190 in E. coli
EXAMPLE 9: SELECTION OF TRANSFORMED PLANT CELLS Agrobacterium-mediated transformation of plant cells occurs at low efficiencies. To allow propagation of transformed cells while inhibiting proliferation of non-transformed cells, a selectable marker is needed. Antibiotic markers for kanamycin and hygromycin and the herbicide modifying gene bar, which detoxifies the herbicidal compound phosphinothricin, are examples of selectable markers used in plants (Methods in Molecular Biology, 1995, 49:9-18). Here we demonstrate that GAT activity serves as an efficient selectable marker for plant transformation. An evolved GAT gene (0_5B8), SEQ LD NO:190, was cloned between a plant promoter (enhanced strawberry vein banded virus) and a ubiquinone terminator and introduced into the T-DNA region of the binary vector pMAXY3793 suitable for transformation of plant cells via Agrobacterium tumefaciens EHA105 as shown in Figure 12. A screenable GUS marker was present in the T-DNA to allow confirmation of transformation. Transgenic tobacco shoots were generated using glyphosate as the only selecting agent. Axillary buds oϊNicotiana tabacum L. Xanthi were subcultured on half-strength MS medium with sucrose (1.5 %) and Gelrite (0.3 %) under 16-h light (35-42 μEinsteins m"2 s"1, cool white fluorescent lamps) at 24 °C every 2-3 weeks. Young leaves were excised from plants after 2-3 weeks subculture and were cut into 3 3 mm segments. A. tumefaciens EHA105 was inoculated into LB medium and grown overnight to a density of A600= 1.0. Cells were pelleted at 4,000 rpm for 5 minutes and resuspended in 3 volumes of liquid co-cultivation medium composed of Murashige and Skoog (MS) medium (pH 5.2) with 2 mg/L N6-benzyladenine (BA), 1% glucose and 400 uM acetysyringone. The leaf pieces were then fully submerged in 20 ml of A. tumefaciens in 100 x 25 mm Petri dishes for 30 min, blotted with autoclaved filter paper, then placed on solid co-cultivation medium (0.3% Gelrite) and incubated as described above. After 3 days of co- cultivation, 20-30 segments were transferred to basal shoot induction (BSI) medium composed of MS solid medium (pH 5.7) with 2 mg/L BA, 3% sucrose, 0.3% Gelrite, 0-200 uM glyphosate, and 400 ug/ml Timentin. After 3 weeks, shoots were clearly evident on the explants placed on media with no glyphosate regardless of the presence or absence of the GAT gene. T-DNA transfer from both constracts was confirmed by GUS histochemical staining of leaves from regenerated shoots. Glyphosate concentrations greater than 20 uM completely inhibited any shoot formation from the explants lacking a GAT gene. Explants infected with A. tumefaciens with the GAT construct regenerated shoots at glyphosate concentrations up to 200 uM (the highest level tested). Transformation was confirmed by GUS histochemical staining and by PCR fragment amplification of the GAT gene using primers annealing to the promoter and 3' regions. The results are summarized in Table 4. Table 4. Tobacco shoot regeneration with glyphosate selection.
EXAMPLE 10: GLYPHOSATE SELECTION OF TRANSFORMED YEAST CELLS Selection markers for yeast transformation are usually auxotrophic genes that allow growth of transformed cells on a medium lacking the specific amino acid or nucleotide. Because Saccharomyces cerevisiae is sensitive to glyphosate, GAT can also be used as a selectable marker. To demonstrate this, an evolved GAT gene (0_6D10), SEQ ED NO: 196, is cloned from the T-DNA vector pMAXY3793 (as shown in Example 9) as a Pstl-Clal fragment containing the entire coding region and ligated into Pstl-Clal digested p424TEF (Gene, 1995, 156:119-122) as shown in Figure 13. This plasmid contains an E. coli origin of replication and a gene conferring carbenicillin resistance as well as a TRP1, tryptophan auxotroph selectable marker for yeast transformation. The GAT containing construct is transformed into E. coli XL1 Blue (Statagene) and plated on LB carbenicillin (50 ug/ml) agar medium. Plasmid DNA is prepared and used to transform yeast strain YPH499 (Stratagene) using a transformation kit (Bio 101). Equal amounts of transformed cells are plated on CSM-YNB-glucose medium (BiolOl) lacking all aromatic amino acids (tryptophan, tyrosine, and phenylalanine) with added glyphosate. For comparison, p424TEF lacking the GAT gene is also introduced into YPH499 and plated as described. The results demonstrate that GAT activity function will as an efficient selectable marker. The presence of the GAT containing vector in glyphosate selected colonies can be confirmed by re-isolation of the plasmid and restriction digest analysis.
EXAMPLE 11. HERBICIDE SPRAY TESTS OF GAT EXPRESSING TOBACCO PLANTS Tobacco shoots generated as described in EXAMPLE 9 were excised from the explants and transferred to basal root induction (BRI) medium composed of half-strength Murashige and Skoog (MS) medium, pH 5.7, with 1.5% sucrose,
0.3% Gelrite, 0-200 uM glyphosate and 400 ug/ml Timentin. Rooted plants and axillary shoots were clonally propagated by cutting the stem and transferring it to fresh BRI medium until the desired number of clones was obtained. Rooted plants were carefully removed from the solid medium. Prior to placing the plants into small pots of soil, the roots were washed to remove any remaining Gelrite. A protective plastic cover was kept over the plants for at least one week until the plants were well established. To determine if GAT expressing tobacco plants could tolerate simulated field rate sprays of glyphosate, clonal lines of several events per GAT variant were tested. A typical test was set up as follows: One clone from each event was sprayed with 1 ml of solution containing the isopropylamine salt of glyphosate (Sigma P5671) and 0.125% Triton X-100, pH 6.8 such that the amount of active ingredient sprayed was equivalent to that present in commercial glyphosate products. For example, to achieve 32 oz/acre (IX) of herbicide containing 40% active ingredient ("ai"), 2.4 ul of 40% ai formulation was diluted into 1 ml water and sprayed on a plant in a 4-inch square pot (16 in2). A mock application (OX) with surfactant only was also included. In some cases a second spray was applied 1-4 weeks later. Plants were kept in controlled growth rooms at 25 °C and 70% humidity with 16 hr light. In this example, 10 events confirmed positive for GAT0_6D10 (SEQ ID NO: 196), ten for GAT0_5D3 (SEQ LD NO: 193), 8 events for GAT0_5B8 (SEQ ED NO: 190), and plants transformed with the vector only (no GAT) were clonally propagated, transferred to soil and sprayed when plants had an average of 5 leaves. Seed-grown wild type plants were also sprayed. After two weeks, the vector only and seed grown plants sprayed with 0.5, 2 or 4X glyphosate stopped growing, wilted, and turned brown. Each of the transgenic GAT plants survived the spraying procedure without signs of glyphosate damage such as chlorosis, leaf elongation, stunting, or browning. All OX plants were healthy, including the non- GAT control plants. Three weeks later all of the surviving plants were sprayed with an 8X dose. The OX control plants died within two weeks. Again, all GAT plants survived. Tobacco plants transformed with GAT and selected on glyphosate were fertile. Flowering and seed set were not detectably different from wild type plants.
EXAMPLE 12. MENDELIAN INHERITANCE OF GAT GENE AND GLYPHOSATE-TOLERANT PHENOTYPE Mendelian inheritance of the GAT gene and glyphosate-tolerant phenotype was demonstrated with transformed Arabidopsis. Columbia type Arabidopsis plants were grown and transformed by the dipping method (Clough, SJ and Bent, AF, (1998) Plant J. 16(6):735-43) with a construct containing the GAT variant called chimera (SEQ ED NO: 16). Bulk seed was collected and GAT plants were confirmed by PCR with primers specific to the insert within the T-DNA. Tl seed from individual events were sown on soil with 10-30 seeds per 2-inch square pot. When the first set of true leaves was emerging, pots were sprayed with glyphosate equivalent to 0.5 and IX commercial product (as calculated in EXAMPLE 11). After two weeks, segregation of the transgene and tolerant phenotype was evident as shown in Table 5.
Table 5. Summary of segregation data for 0.5 and IX glyphosate-tolerant Tl Arabidopsis
Ratios near 3 : 1 indicate a single segregating dominant event. Ratios greater than 3:1 indicate several segregating inserts. Ratios less than 3:1 can be due to small sample size effects, incomplete dominance, or position effects that render expression too low to confer herbicide tolerance. Compared to the controls, it was clear that the GAT gene was transmitted to the Tl generation and conferred glyphosate tolerance.
EXAMPLE 13: PRODUCTION OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES Maize plants expressing GAT variant transgenes were produced using the methods described in U.S. Patent No. 5,981,849, which is incorporated herein by reference. Specifically, Agrobacterium tumefaciens vectors were constructed according to methods known in the art. Each vector contained an insert having an ubiquitin promoter and intron, a GAT variant and a PinXI terminator. Maize immature embryos were excised and infected with an Agrobacterium tumefaciens vector containing the GAT variant of interest. After infection, embryos were transferred and cultured in co-cultivation medium. After co-cultivation, the infected immature embryos were transferred onto media containing 1.0 mM glyphosate (Roundup ULTRA MAX™). This selection lasted until actively growing putative transgenic calli were identified. The putative transgenic callus tissues were sampled for PCR and Western assay (data not shown) to confirm the presence of the GAT gene. The putative transgenic callus tissues were maintained on 1.0 mM glyphosate selection media for further growth and selection before plant regeneration. At regeneration, callus tissue confirmed to be transgenic were transferred onto maturation medium containing 0.1 mM glyphosate and cultured for somatic embryo maturation. Mature embryos were then transferred onto regeneration medium containing 0.1 mM glyphosate for shoot and root formation. After shoots and roots emerged, individual plantlets were transferred into tubes with rooting medium containing 0.1 mM glyphosate. Plantlets with established shoots and roots were transplanted into pots in the greenhouse for further growth, the generation of TO spray data and the production of Tl seed. In order to evaluate the level of glyphosate resistance of the transgenic maize plants expressing the GAT variant transgenes, TO plants were sprayed with glyphosate (Roundup ULTRA MAX™) in the greenhouse. Plant resistance levels were evaluated by plant discoloration scores and plant height measurements. Plant discoloration and plant height were evaluated according to the following scales:
Discoloration score at 1, 2, 3 and 4 weeks after spray with glyphosate 9 = no leaf stem discoloration 7 = minor leaf stem discoloration 5 = worse leaf/stem discoloration 3 = severely discolored plant or dying plant 1 = dead plant
Plant height measurements before spraying with glyphosate after spraying with glyphosate at 1, 2, 3 and 4 weeks mature plants (at tasseling) Two plants were sent to the greenhouse from each event (independent transgenic callus) listed in Table 6. Plant 1 was kept for seed production and was not sprayed with glyphosate. Plant 2 was sprayed at 4x glyphosate (lx glyphosate = 26 ounces/acre) at 14 days after transplanting. The TO plant discoloration scores with 4x spray at 7 and 14 days after the spray are shown in Tables 6 and 7. Height data at tasseling is shown in Figure 14. An additional experiment was performed in which TO plants were sprayed with 6x glyphosate. The TO plant discoloration scores with 6x spray at 10 days after spray are shown in Table 8.
Table 6. Resistance Scores at 7 days after treatment with 4x glyphosate constructs # events tested % events @ %> events @ % events @ <7 with 4x 9 7 18534 169 30% (50) 59% (101) 11% (18) (SEQ ED NO: 196) 18537 72 40% (29) 54% (39) 6% (4) (SEQ ED NO: 193) 18540 111 32% (36) 61% (67) 7% (8) (SEQ ED NO: 190) total 352 33% (115) 59% (207) 8% (30)
Table 7. Resistance Scores at 14 days after treatment with 4x glyphosate constracts # events tested with 4x % events @ 9 18534 169 29%(49) (SEQ ED NO: 196) 18537 72 50%(36) (SEQ LD NO: 193) 18540 111 29%(32) (SEQ ED NO: 190) total 352 33%(117)
Table 8. Resistance Scores at 10 days after treatment with 6X glyphosate constracts # events tested with 6X % events with no damage after glyphosate treatment (score= 9) 19286 (SEQ LD 312 51%(160) NO:814) 19288 (SEQ LD 310 52%(163) NO:549) 19900 (SEQ LD 231 56%(129) NO:738) 19902 (SEQ LD 230 42%(96) NO:638) 21895 (SEQ LD 55 30%(17) NO:848) 21896 (SEQ LD 61 61%(37) NO:912) 21905 (SEQ LD 32 70%(25) NO:906) total 1231 51%(627) EXAMPLE 14: GAT IS ALSO AN ACYLTRANSFERASE The ability of GAT variants (B6 (SEQ ED NO:7), 0_6D10 (SEQ ED NO:448), 17-15H3 (SEQ ED NO:601), and 20-8H12c (SEQ ID NO:8.17)) to transfer the propionyl group from propionyl CoA to glyphosate was tested in reaction mixtures containing 5mM glyphosate or no glyphosate. Propionyl CoA was present at 1 mM. After 30 minutes the reactions were terminated and the presence of free propionyl CoA was determined by the addition of DTNB. All variants showed glyphosate-dependent hydrolysis of propionyl CoA. These results indicate that GAT also functions as an acyltransferase.
EXAMPLE 15: Tl STUDIES OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES Maize plants expressing GAT variant transgenes 18-28D9b (SEQ ED
NO:814) and 17-15H3 (SEQ LD NO:549) were produced using the methods described in Example 13. Tl plants were used for the generation of glyphosate field tolerance data. The Tl plants were treated in the field with four different glyphosate spray treatments (OX, 4X, 8X, and 4X + 4X) for each event. The plants were sprayed at V3 and V8. Plants were scored 10 days after treatment for leaf discoloration and plant height comparisons as described in Example 13. The Tl field spray data correlated well with the results previously obtained in the greenhouse as reported in Example 13. T2 seeds were collected for further studies.
EXAMPLE 16: THERMOSTABILITY OF GAT POLYPEPTIDES
A. EFFECT OF TEMPERATURE VARIATION ON GLYPHOSATE TOLERANCE OF GLYPHOSATE RESISTANT MAIZE EXPRESSING GAT TRANSGENES Maize plants expressing GAT variant transgenes 10_4F2 (SEQ ED
NO.-203), 17-15H3 (SEQ ED NO:549), and 18-28D9b (SEQ ED NO:814) were produced using the methods described in Example 13. The effect of temperature on glyphosate tolerance was evaluated in Tl plants. The Tl plants were grown in cool/cold (day 14° C, night 8° C), warm (day 28° C, night 20° C), and hot (day 37° C, night 20° C) conditions. Tl plants were sprayed at V2 with four different glyphosate spray treatments (OX, 4X, 6X, and 8X). Plants were scored at 5 and 14 days after treatment for leaf discoloration and plant height comparisons as described in Example 13. Visual observations indicated that glyphosate tolerance is not adversely effected by the range of temperatures tested.
B. EFFECT OF TEMPERATURE VARIATION ON GAT ACTIVITY IN VITRO In vitro thermostability of several GAT polypeptides (DS3 (a native GAT polypeptide corresponding to SEQ ED NO: 8), 6_6D5 (SEQ ED NO: 410), 17-15H3
(SEQ ED NO: 601), 20-8H12 (SEQ ED NO: 739), 22-13B12 (SEQ ID NO: 781) and 401 (a native GAT polypeptide corresponding to SEQ ED NO: 6)) was evaluated in accordance with the following method. The enzymes were distributed to 200 μl strip PCR tubes (VWR, San Francisco, CA) and incubated in a gradient thermocycler (ML Research, Watertown, MA) for 15 minutes at various temperatures between 30°C and 60°C as indicated in Figure 17. Precipitated protein was removed by centrifugation, and surviving enzymatic activity of the remaining soluble protein was measured at 22°C by the continuous spectrophotometric assay, as described in Example 7. Saturating concentrations of glyphosate (10 mM for DS3 (SEQ ED NO: 8), 401 (SEQ ED NO: 6) and 6_6D5 (SEQ ED NO: 410); 5 mM for 17-15H3 (SEQ ED NO: 601), 20-8H12 (SEQ ED NO: 739), and 22-13B12 (SEQ LD NO: 781) and AcCoA (167 μM) were used. The data is depicted in Figure 17. Native (i.e., wild type) GAT polypeptides DS3 (SEQ ED NO: 8) and 401 (SEQ ED NO: 6) appeared stable with respect to activity at temperatures up to about 42 to about 44°C. GAT polypeptides that are not native to any organism (i.e., not wild type) appeared stable at temperatures in the range of about 47°C to about 54°C. The half lives of several GAT polypeptides were also measured at 37.5°C according to the following procedure. GAT polypeptides 401 (SEQ LD NO: 6), 17- 15H3 (SEQ ED NO: 601), 20-8H12 (SEQ ID NO: 739), 22-13B12 (SEQ ED NO: 781), 22-15B4 (SEQ ED NO: 946) and 22-18C5 (SEQ ID NO: 795) were incubated in a matrix of 25 mM Hepes, pH 7.2, 10 mM KCl and 10% methanol ("HKM"). At various timepoints, aliquots were withdrawn and assayed in triplicate at 22°C using the continuous spectrophotometric assay described in Example 7 using saturating concentrations of glyphosate (20 mM for 401, 5 mM for the rest) and AcCoA (167 uM). The standard error at each time point averaged about 2.9%. GAT activity was plotted as a function of incubation time and the data was fitted to a curve for exponential decay (y=e _x), where y is enzyme activity and x is time in hours, from which half life was calculated. The data is shown below in Table 9.
Table 9. Half-lives of GAT polypeptides at 37.5 C
EXAMPLE 17: PRODUCTION OF GLYPHOSATE-RESISTANT SOYBEAN EXPRESSING GAT TRANSGENES Soybean plants expressing GAT variant transgenes were produced using the method of particle gun bombardment (see Klein et al. (1987) Nature 327:70-73) using a DuPont Biolistic PDSIOOO/He instrument. The selection agent used during the transformation process was hygromycin. Either the hygromycin selectable marker gene remained in the transgenic events or the hygromycin gene was excised by methods known in the art. DNA fragments were prepared with a synthetic constitutive promoter, a GAT variant and PinEI terminator. The selectable marker gene, comprising the 35S CaMV promoter, HPT gene and NOS terminator, was cobombarded with the GAT gene variant as described above. Bombarded soybean embryogenic suspension tissue was cultured for one week in the absence of selection agent. Embryogenic suspension tissue was placed in liquid selection medium for 6 weeks. Putative transgenic suspension tissue was sampled for PCR analysis to determine the presence of the GAT gene. Putative transgenic suspension culture tissue was maintained in selection medium for 3 weeks to obtain enough tissue for plant regeneration. Suspension tissue was matured for 4 weeks using standard procedures; matured somatic embryos were desiccated for 4-7 days and then placed on germination induction medium for 2-4 weeks. Germinated plantlets were transferred to soil in cell pack trays for 3 weeks for acclimatization. Plantlets were potted to 10-inch pots in the greenhouse for evaluation of glyphosate resistance. To determine the level of glyphosate resistance of transgenic soybeans expressing the GAT variant transgenes, TO plants were sprayed with glyphosate (Roundup ULTRA MAX™) in the greenhouse. Plant resistance levels were evaluated by plant discoloration scores and plant height measurements. Discoloration score at 2 weeks after spray with glyphosate 9 == no leaf stem discoloration 7 = minor leaf/stem discoloration 5 = worse leaf/stem discoloration 3 = severely discolored plant or dying plant 1 = dead plant One to four plants were sent to the greenhouse from each independent transgenic event. An additional 1-2 plants per event were grown in controlled environment growth chambers for seed production and were not sprayed with glyphosate. The greenhouse plants were sprayed at IX, 2X or 4X glyphosate (IX glyphosate = 26 ounces/acre of RoundUp ULTRA MAX™) 3-4 weeks after transfer to soil. The TO plant discoloration scores with 2X and 4X spray rates are shown in Table 10 and Table 11, respectively. These results show that soybeans are effectively transformed with GAT gene variants as confirmed by PCR analysis. Transgenic soybeans expressing GAT gene variants are resistant to glyphosate at 2X and 4X spray rates. Events surviving the 4X glyphosate spray rate do show some minor leaf discoloration however within 2 weeks of the spray test, plants recover and demonstrate normal leaf morphology. Table 10. Resistance Scores at 10 days after treatment with 2X glyphosate. # EVENTS % EVENTS @ % EVENTS (c TESTED WITH 7-8 3-6 2X SEQ ED NO:193 27 15%(4) 11%(3)
SEQ ED NO:824 38 8%(3) 74%(23)
Table 11. Resistance Scores at 10 days after treatment with 4X glyphosate. # EVENTS % EVENTS @ % EVENTS TESTED WITH 7-8 3-6 4X
SEQ ED NO:824 23 8%(2) 43%(10)
EXAMPLE 18: EFFECT OF SALT ON GAT KINETICS To better approximate the physiological conditions under which the GAT enzymes of the invention are intended to be used (e.g., plant cells), the activities of some GAT enzymes of the invention were re-evaluated in the presence of added salt. Figures 15A and 15B provide a comparison of the kinetic parameters Km and k0at/Km, respectively, for native GAT enzymes GAT401 (SEQ ED NO:6), B6 (SEQ ED NO:7), and DS3 (SEQ LD NO:8), and evolved GAT enzymes 0_6D10 (SEQ ED NO.-448), 10_4F2 (SEQ ID NO:454), 18-28D9 (SEQ ED NO:618), 17-15H3 (SEQ ED NO:601), 17-10B3 (SEQ LD NO:592), 20-8H12 (SEQ ED NO:739), 20-16A3 (SEQ ED NO:639), and 20-30C6 (SEQ ED NO:683), assayed in either the absence of added KCl (unshaded bars) or in the presence of 20 mM KCl (shaded bars). Protein concentrations were determined using the Bradford assay as described in Example 7. Owing to their extremely low Kms for glyphosate in the absence of KCl, the kinetic parameters for evolved GAT enzymes 0_6D10, 18-28D9 and 20- 8H12 were determined in the absence of KCl using the mass spectrometry assay as described in Example 3, while all other kinetic parameters (either in the absence or presence of KCl) were determined using the continuous spectrophotometric assay as described in Example 7. Error bars represent the standard deviation of multiple assays, where available. Figure 15A shows that addition of salt (20 mM KCl) to the assay buffer significantly increases the Km value for glyphosate. The kc t value remains relatively unchanged or increases slightly, the net result being a lower observed kcat/Km value for GAT enzymes assayed in the presence of 20 mM KCl than in the absence of added KCl (Figure 15B).
EXAMPLE 19: FURTHER EVOLVED GAT GENES ENCODING GAT ENZYMES WITH EXTREMELY HIGH ACTIVITY Additional iterations of directed molecular evolution yielded further evolved gat genes encoding GAT enzymes exhibiting extremely high GAT activity, e.g. exhibiting one or more improved property such as reduced Km for glyphosate, increased kcat, or increased kcat/Km compared to previously-described GAT enzymes. The further evolved gat genes were first selected for growth in E. coli in minimal M9 medium as described in Example 8, except that 5 mM rather than 1 mM glyphosate was used in the selection. Proteins were purified as described in Example 6 above. Protein concentrations were determined by UV absorbance at 205 nm. The extinction coefficient was determined by the method described by Scopes (1994; Protein Purification, Principles and Practice, Springer, New York) according to the formula E (mg ml"1 cm"1) = 27 + 120(A280/A205) = 30.5. Prior to quantitation by UV absorbance the protein solution was buffer-exchanged into 50 mM Na2SO4 using a NAP-5 column (Amersham-Pharmacia Biotech). Exemplary further evolved gat coding sequences comprise nucleic acids sequences identified herein as SEQ LD NOs: 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, and 930, which encode further evolved GAT enzymes comprising amino acid sequences identified herein as SEQ ED NOs: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, and 931, respectively. Some such further evolved GAT enzymes exhibit extremely high GAT activity, in that they exhibit one or more improved property such as reduced Km for glyphosate, increased cat, or increased kcat/Km, compared to previously-described GAT enzymes assayed under the same conditions. Figures 16 A, 16B and 16C provide a comparison of the kinetic parameters Km, kcat, and kcat/Km , respectively, of several previously-described GAT enzymes (unshaded bars) to the kinetic parameters of some further evolved GAT enzymes of the invention (shaded bars), assayed using the continuous spectrophotometric assay in the presence of 20 mM KCl with protein quantified via UV absorbance as described above. Error bars represent the standard deviation of multiple assays, where available. Under these assay conditions, native GAT enzyme GAT401 (SEQ ID NO:6) exhibited a Km for glyphosate of about 4 mM, a kcat of about 5.4 1 1 1 min" , and a kCat/Km of about 1.35 mM" min" . When assayed under these conditions, some further evolved GAT enzymes of the invention (shaded bars) exhibit a range of Km values for glyphosate of less than about 0.4 mM (such as, between about 0.4 mM and 0.1 mM), k^t values of at least about 1000 min"1 (such as, between about 1000 min"1 and about 2500 min"1), and kcat/Km values of at least about 4800 mM"1 min"1 (such as, between about 4800 mM"1 min"1 and about 8000 mM" min"1). For example, some further evolved GAT enzymes of the invention exhibit at least about a 7000-fold increase in kcat/Km over native GAT enzyme GAT401 under these assay conditions. Some further evolved GAT enzymes of the invention comprise one or more amino acid residue positions not observed in previously described GAT polypeptides and GAT enzymes, such as, at position 27, a Bl, ZI or A amino acid residue; at position 33, an N or G amino acid residue; at position 46, a B2, Z4, or H amino acid residue; and at position 93, an R amino acid residue; where Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y and V; B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T; ZI is an amino acid selected from the group consisting of A, I, L, M and V; and Z4 is an amino acid selected from the group consisting of R, H and K. For example, some further evolved GAT enzymes of the invention comprise one or more of: an Ala at position 27 (i.e., Ala27); an Asn or a Gly at position 33 (i.e., Asn33 or Gly33); a His at position 46 (i.e., His46); and an Arg at position 93 (i.e., Arg93), with sequence numbering corresponding to that of, e.g., SEQ ED NO: 907. Sequence/activity analyses were performed to identify amino acid residues which correlate positively with a high kcat/Km (as manifested by a high koat, a low Km, or both). Amino acid residues wliich appear to correlate positively with a high kca/Km include Glul4, Asp32, Asn33, Gly38, and Thr62 (sequence numbering corresponding to that of SEQ ED NO:907). Additional GAT enzymes may be constructed by substituting codons for one or more of these residues into the appropriate position(s) of a coding sequence of a template GAT polypeptide. For example, additional GAT enzymes were generated by substituting one or more of codons encoding Glu at codon position 14, Asp at position 32, Asn at position 33, Gly at position 38, and Thr at position 62, into a nucleic acid sequence encoding a template polypeptide, such as GAT 24-5H5 (SEQ ID NO:845) or GAT 25-8H7 (SEQ ED NO:907), two of the further evolved GAT enzymes exhibiting extremely high activity as described above. Exemplary further evolved GAT enzymes generated in this manner, identified herein as R12G1 (SEQ ED NOs917), R12G2 (SEQ ED NO:919), R12G3 (SEQ ED NO:921), R12G4 (SEQ LD NO:923), R12G5 (SEQ ED NO:925), R12G6 (SEQ ED NO:927), R12G7 (SEQ LD NO:929), and R12G8 (SEQ ED NO:931), encoded by nucleic acids identified as SEQ ED NOs: 916, 918, 920, 922, 924, 926, 928, and 930, respectively, exhibited extremely high GAT activities comparable to those of the template polypeptides.
EXAMPLE 20: AMINO ACEDS THAT CORRELATE WITH HIGH GAT ACTIVITY The amino acids aspartic acid (Asp, D), histidine (His, H) and cysteine (Cys, C) are known to be associated with the active sites of various acetyltransferase enzymes. To determine if any such residues play a role in GAT activity, all D, C, and H residues of GAT20-30C6 (SEQ ED NO:683) were individually mutated to alanine (Ala, A) and the mutated enzymes assayed for N- acetylglyphosate activity. Variants containing the substitutions D34A and H41A retained only about 2%>-3% of the activity of the unmodified enzyme, while the variant containing the substitution H138A exhibited essentially no measurable GAT activity. On the other hand, variants containing the substitutions H138R and H138S retained low but measurable GAT activity (particularly at pHs greater than 6.8), suggesting that His (and nominally Arg and Ser) at position 138 may serve as an active-site base.
Table 12
EXAMPLE 21: IMPROVING GAT EXPRESSION EN PLANTS Plants, animals, and microbes are known to have specific codon preferences that affect the efficiency of amino acid incorporation during translation of gene transcripts. Rare codons could cause problems with tRNA recruitment during translation, which could then lead to lower accumulation of the encoded protein. The original parental gat genes were from bacteria such as Bacillus licheniformis, and, as such, may not have an optimal codon distribution for expression in plants. Evolved gat genes of the invention have successfully been expressed in plants (see, e.g., Examples 9, 11, 13, and 17, above), yet an opportunity exists to improve protein production by increasing the translation efficiency in plants. One way to accomplish this is by substituting one or more codons in the gat coding sequence which are used infrequently in plants for codons for the same amino acid(s) which are more frequently used in plants, thereby generating silent mutations in the gat coding sequence with an unchanged sequence of the encoded protein. Tables showing the frequency of codon usage in corn, cotton and soybeans (available, for example, from the website maintained by the Kazusa DNA Research Eistitute, Chiba, Japan) were compared to generate the following table (Table 13) showing codons wliich are, in general, more frequently or less frequently utilized in either monocot or dicot plants.
Table 13
A second way to increase plant expression of microbial genes is to increase 10 the G+C content near the initiating methionine residue. Naturally-occurring coding sequences in plants tend to contain two or three G and/or C residues immediately downstream of the ATG initiation codon (Joshi et α/.(1997) Plant Mol. Biol. 35:993-1001). Introducing into the gat coding sequence one or two CG-rich codons immediately downstream of the ATG intitiation codon may create 15 a more plant-like coding sequence and thus may enhance its expression in plants. Substitution of the second codon (isoleucine, ATA) for an alanine codon (GCG) resulted in a Ile2Ala variant with reduced kcat compared to the unmodified enzyme. On the other hand, insertion of an alanine codon (either GCG or GCT) between the codons for Met at codon position 1 and He at codon position 2 resulted in a gat coding sequence encoding a GAT enzyme containing an Ala residue inserted between the Met at position 1 and the He at position 2. An exemplary GAT enzyme variant containing two alanines inserted between Metl and Ile2 denoted, identified as 22-15B4 M1MAA (to signify the insertion of two Ala residues immediately following the Met at position 1) and having the protein sequence SEQ ED NO:948, exhibited a reduced kcat compared to the unmodified enzyme 22-15B4 (SEQ ED NO:789). An exemplary GAT enzyme containing one alanine inserted between Metl and Ile2, denoted 22-15B4 MIMA (to signify the insertion of an Ala residue immediately following the Met at position 1), having the protein sequence SEQ ED NO:946, exhibited essentially unaltered kinetics compared to the unmodified enzyme 22-15B4. A general strategy for improving GAT expression in plants was developed.
Evolved gat coding sequences may be altered by replacing codons less frequently utilized in plants for codons more frequently utilized in plants, for example according to the table above. Codons less frequently utilized in plants (e.g., according to the above table) should generally be avoided, hi this manner at least one codon (such as, at least three codons, at least five codons, or at least least ten codons), may be changed in the gat coding sequence from codon(s) less frequently utitlized in plants to codon(s) more frequently utitlized in plants. The codons which are replaced may be located at the 5' end of the coding sequence (e.g., within the first 10 codons, within the first 20 codons, within the first 50 codons, or within the first 100 codons) of the gat coding sequence. Alternatively, the codons which are replaced may be located throughout the gat coding sequence. The more frequently utilized codons furthermore may be chosen to avoid more than about 5- 10 (such as, e.g., more than about 5, more than about 6, more than about 7, more than about 8, more than about 9 or more than about 10) consecutive occurrences of G+C or ofA+T within the coding sequence. The coding sequence may also be altered to contain one or two CG-rich codons immediately downstream of the ATG intitiation codon, such as, for example, by inserting an Ala codon (e.g., a frequently utilized Ala codon) immediately downstream of and adjacent to the initiating Met codon of the gat coding sequence. Table 14 provides exemplary gat coding sequences altered as described above.
Table 14
transformed into competent Agrobacterium tumefaciens strain C58 cells by electroporation (McCormac et al, Mol Biotechnol. 9:155-159, 1998). After growth on LB + 40ug/ml kanamycin plates for 2 days at 28°C, colonies were 5 inoculated into LB + 40ug/ml kanamycin liquid medium and shaken overnight at 28°C. The Agrobacterium cells were collected by centrifugation at 4000 g for 10 minutes and then resuspended in a volume of lOmM MgSO4 equivalent to the initial culture volume. This bacterial suspension was forced or "infiltrated" into the intercellular spaces of Nicotiana benthamiana leaves using a 1 ml plastic 10 syringe (with no needle). By infiltrating 200-300 μl of bacterial suspension into each spot (typically 3-4cm2 in infiltrated area), 4 or more spots could be arranged on a single leaf still attached to the plant. In some cases the gαt-containing Agrobacterium strain was diluted 5:1 or 10:1 with a second Agrobacterium strain lacking gat prior to infiltration. This dilution step has the effect of reducing the 15 overall expression of the gat gene in the plant cells, thereby preventing saturation and allowing easier visualization of expression differences between variants and constructs. After 3 days the leaf material was ground, extracted in aqueous buffer, and centrifuged. The supernatant, containing the soluble proteins, was subjected to SDS-PAGE, and the gel was blotted and probed with an antiGAT polyclonal 20 antibody. The level of GAT protein accumulated in tobacco leaves infiltrated with the GAT4620 gene was comparable to the level of protein accumulated in leaves transformed with the unmodified GAT25-8H7/4618 gene. Tobacco leaves harboring the GAT4621 gene, on the other hand, exhibited about two-fold greater GAT protein accumulation, as a percent of total protein, compared to leaves expressing the unmodified GAT25-8H7/4618 gene.
EXAMPLE 22: Tl STUDIES OF GLYPHOSATE-RESISTANT SOYBEAN EXPRESSLNG GAT TRANSGENES Soybean plants expressing GAT transgene 18-28D9c (SEQ ED NO:824) were produced using the methods described in Example 17. Tl seed was collected from glyphosate sprayed TO plants. Tl seed were germinated under greenhouse conditions in RediEarthR360 medium, available from Scotts, Marysville, OH, and sprayed at V2-V3 stage with either 2X or 4X Glyphosate (RoundUp ULTRA MAX™, available from Monsanto, St. Louise, MO) as per methods described in Example 17. Plants were scored after 10 days and leaf discoloration scores taken as described in Example 17. The Tl greenhouse spray data correlated well with previous greenhouse results at the TO plant stage. T2 seed was collected for further studies.
EXAMPLE 23: PRODUCTION OF GLYPHOSATE AND SULFONAMIDE RESISTANT SOYBEANS EXPRESSING GAT AND HRA TRANSGENES Soybean plants expressing GAT & HRA, high resistance allele of acetolactate synthase (U.S. Patent Nos. 5,605,011, 5,378,824, 5,141,870, and 5013,659), genes were produced using the methods described in Example 17. The HRA gene was used as selectable marker gene for transformation. The selection agent was chlorsulfuron at a concentration of 100 ng/ml. The selectable marker gene was comprised of the S-adenosyl-L-methionine synthetase (SAMS) promoter from Gly cine max (U.S. 2003/226166), HRA coding sequence from Gly cine max and acetolactate synthase terminator from Glycine max. The selectable marker gene was either linked to or co-bombarded with a GAT construct consisting of a synthetic constitutive promoter (U.S. Patent Nos. 6,072,050 and 6,555,673) or the maize Histone 2B promoter (U.S. Patent No. 6,177,611), a GAT variant (18-28D9c (SEQ ED NO:824)) and the Pin E terminator (Gyheung an et al., Plant Cell 1:115:122 (1989)). Transgenic plants were generated as described in Example 17. Levels of glyphosate resistance were determined as described in Example 17 using plant discoloration scores after 2X or 4X glyphosate application rates. The results shown in Table 15 demonstrate that different constitutive promoters driving GAT variant (18-28D9c (SEQ ED NO:824)) confer glyphosate resistance in TO plants.
Table 15. Resistance Scores at 10 days after treatment with 4X glyphosate. # EVENTS % EVENTS %
EVENTS TESTED WITH 4x @ 7-8 SCORE 3-6 SCORE
PHP20163a 58 15.5% (9) 77.6%
(45)
(SEQ LD NO:824)
(SCP1 PROMOTER)
PHP20558a 26 34.6% (9) 42.3%
(11)
(SEQ ID NO:824)
(H2B PROMOTER)
EXAMPLE 24: Tl PRE-EMERGENCE STUDIES OF SOYBEANS EXPRESSING GAT AND HRA TRANSGENES Tl seed generated from experiments as described in Example 17, were planted in pots of Tama Silt loam in the greenhouse. Pots were immediately sprayed with a pre-emergence application of chlorimuron, rimsulfuron or tribenuron at a rate of 70 gms a.i./hectare. Germinating plants were evaluated 10 days post spray application based on plant discoloration scores described in Example 17. All HRA and GAT events survived all pre-emergence spray applications with a rating of 9 (uninjured). These results demonstrate pre- emergence resistance to sulfonamide chemistry in soybeans.
EXAMPLE 25: Tl POST-EMERGENCE STUDIES OF SOYBEANS EXPRESSING GAT AND HRA TRANSGENES Tl seed generated from experiments as described in Example 17 were germinated in RediEarthR 360 medium in the greenhouse. Plants were sprayed at the V2-V3 stage (14 days after potting) with thifensulfuron, chlorimuron, rimsulfuron or tribenuron (70, 70, 35, 35 gm a.i./hectare, respectively). Plants were evaluated 10 days post application based on plant discoloration scores described in Example 17. Results are shown in Table 16.
Table 16. Resistance Scores at 10 days after Post-Emergence treatment with
Sulfonamide Chemistry. Average Resistance scores from GAT (SEQ ID NO:824)/SAMS promoter-HRA events
Unsprayed control 9
Chlorimuron (70gm a.i./ha) 7.75
Rimsulfuron (35gm a.i./ha) 2.21 Tribenuron (35 gm a.i./ha) 3.83
Thifensulfuron (70 gm a.i./ha) 7.81
Events having a plant discoloration rating 7 or 8 after thifensulfuron spray were sprayed with either a 2X or 4X application of glyphosate after 10 days as per methods described in Example 17. Events were evaluated based on discoloration scores described in Example 17. All thifensulfuron tolerant events survived the glyphosate spray with score of 7 or 8 (results not shown). These results demonstrate 100% correlation of thifensulfuron tolerance with glyphosate tolerance under greenhouse conditions conferred by HRA and GAT genes, respectively, at 70 gm a.i./hectare thifensulfuron and 2X glyphosate, respectively.
EXAMPLE 26: T3 STUDIES OF GLYPHOSATE-RESISTANT MAIZE PLANTS EXPRESSING GAT TRANSGENES Maize plants expressing GAT transgenes 20-H812 (SEQ LD NO:738) and
20-16A3 (SEQ ED NO:638) were produced using the methods described in Example 13. Plants were scored after 10 days and leaf discoloration scores taken as described in Example 13. Specifically, plants were sprayed at V4 leaf stage. The plants were thinned to equal spacing and stand counts after application of spray treatments. Commercially available NK603 (Monsanto, St. Louis, MO) was used as a control. Resistance scores are shown in Table 18. Plant height measurements were also taken 10 days after treatment and are shown in Table 18.
Table 18. Resistance Scores at 10 days after treatment with glyphosate 5 Resistance Scores 1-9 scale
Table 19. Plant Height (in inches) 10 days after treatment with glyphosate
10 EXAMPLE 27: T3 YIELD STUDIES OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES T3 seed from Example 15 was used to generate T3 plants for the generation of glyphosate field tolerance data on hybrids. The experiment was conducted at 15 Viluco, Chile with four (4) replications using a split-plot design. Specifically, 3 entries were included. Two of the entries comprised maize plants expressing GAT variant transgenes 17-15H3 (SEQ ED NO:549). A glyphosate-resistant control NK603, which is commercially available from Monsanto, was the third entry. All entries were treated in the field with four different glyphosate spray treatments (OX, 4X at V4, 8X at V4, and 4X at V4 and 4X at V8) for each event. Plants were scored 10 days after treatment for plant height comparisons as described in Example 13. The T3 field spray data correlated well with the results previously obtained in the field as reported in Example 15. Specifically, all entries sprayed with IX and 4X glyphosate were similar in height to unsprayed controls. At the higher 4X at V4 and 4X at V8 rates, the GAT entries were temporarily set back between 12 and 17 % in height and the NK603 entry was set back 6%; however, later in the season (during reproductive maturity) the height of glyphosate-treated entries was the same as in the unsprayed entries. Moreover, yields among glyphosate-treated entries were neither numerically nor statistically reduced from unsprayed entries (LSD005= 11.8 bu./acre, average yield per entry = 243 bu./acre). Similar results were observed in preliminary agronomic trials with T2 plants of the same events that were planted in Johnston, LA and York, NE (data not shown).
EXAMPLE 28: T2 STUDIES OF GLYPHOSATE-RESISTANT MAIZE EXPRESSING GAT TRANSGENES Experiments were conducted on GAT positive and GAT negative iso-lines. Maize plants expressing GAT transgenes 18-28D9b (SEQ ED NO: 814), 17- 15H3 (SEQ LD NO:549), 20-8H12 (SEQ ID NO:738), 20-16A3 (SEQ ED NO:638), were produced using the methods described in Example 17. T2 plants were examined. GAT positive T2 plants were sprayed at V4 with IX (26 oz/A ULTRA MAX™). GAT negative plants were PCR sampled at V4. GAT positive plants were removed from the row. No glyphosate was applied to the GAT negative plants. Plants were thinned to create equal spacing among plants within each row. Four (4) replications were performed. Grain from five (5) ears harvested from the middle of each row was dried and weighed. As shown in Table 20 no yield reduction was detected for any of the constructs. Table 20. Yield data.
EXAMPLE 29: AMINO ACED SUBSTRATES OF GAT POLYPEPTIDES GAT activity of several GAT polypeptides of the present invention was evaluated with respect to a number of amino acid substrates. The GAT polypeptide, AcCoA and amino substrate were incubated in 25 mM Hepes, pH 6.8, 10%) ethylene glycol in the wells of a 96-well polystyrene plate. After 30 minutes, the reactions were stopped by the addition of 30 μl of 10 mM 5, 5'-dithiobis-2- nitrobenzoate (DTNB) in 500 mM Tris, pH 7.5. After 2 minutes, absorbance was read at 412 nm in a Spectramax Plus plate reader (Molecular Devices, Sunnyvale, CA). In addition to glyphosate, native GAT polypeptide 401 (SEQ ED NO: 6) (or B6 (SEQ ED NO: 7), in the case of phosphoserine) exhibited detectable activity with 12 amino acids. The native GAT polypeptide was about as active with L- aspartate, about 4.7 times more active with L-serine, and about 2 times more active with phospho-L-serine than with glyphosate. When compared to native GAT polypeptide, non-native GAT polypeptides 17-15H3 (SEQ ID NO: 601) and 25- 8H7 (SEQ ED NO: 907) exhibited a 40-fold increase in activity with aspartate, but loss of activity with respect to serine and phosphoserine. In addition to aspartate and serine, activity with native GAT polypeptide at 3% or more of that toward glyphosate when present at 1 mM was observed with the following L-amino acids: histidine (10%>), tyrosine (18%), threonine (250%>), valine (12%), glutamate (51%), asparagine (27%>), glutamine (32%), alanine (33%), glycine (21%>) and cysteine (50%). Activity with the other protein amino acids was either undetected or less than 3%> that of GAT activity towards glyphosate as the substrate. No detectable activity was observed with respect to the native GAT polypeptide on the N-methyl derivatives of L-aspartate (2 mM), L- alanine (10 mM) and glycine (i.e., sarcosine, 10 mM). The percentages refer to percent activity relative to activity of the GAT polypeptide towards the substrate, glyphosate. Some of the data is shown below in Table 21.
EXAMPLE 30: EFFECT OF PH ON GAT ACTIVITY The pH optima of kcat and KM for wild-type enzyme B6 (SEQ ED NO: 7) and GAT polypeptide 17-15H3 (SEQ ED NO: 601) were determined using the spectrophotometric assay described in Example 7 except that assay buffer was 50 mM Hepes and 10%> ethylene glycol, titrated to a range of pH values. Protein concentrations were determined by the UV absorbance assay described in Example 19. The effect of pH on KM and Kcat is shown in Figure 18 for clones B6 (SEQ ED NO: 7) and 17-15H3 (SEQ ED NO: 601).
While the foregoing invention has been described in some detail for purposes of clarity and understanding, it will be clear to one skilled in the art from a reading of this disclosure that various changes in form and detail can be made without departing from the true scope of the invention. For example, all the techniques, methods, compositions, apparatus and systems described above may be used in various combinations. The invention is intended to include all methods and reagents described herein, as well as all polynucleotides, polypeptides, cells, organisms, plants, crops, etc., that are the products of these novel methods and reagents. All publications, patents, patent applications, or other documents cited in this application are incorporated by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application, or other document were individually indicated to be incorporated by reference for all purposes.

Claims

WHAT IS CLAIMED IS: 1. An isolated or recombinant polynucleotide comprising:
(a) a nucleotide sequence encoding an amino acid sequence that can be optimally aligned with a sequence selected from the group consisting of SEQ ED NO:300, SEQ LD NO:445 and SEQ ED NO:457 to generate a similarity score of at least 460, using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein the amino acid sequence comprises one or more amino acid residues which meet at least one restriction selected from the group consisting of: (i) at positions 18 and 38, a Z5 amino acid residue; (ii) at position 62, a ZI amino acid residue; (iii) at position 124, a Z6 amino acid residue; and (iv) at position 144, a Z2 amino acid residue, wherein: j ZI is an amino acid residue selected from the group consisting of A, I, , M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; Z6 is an amino acid residue selected from the group consisting of C, G, and P; or
(b) a complementary nucleotide sequence thereof.
2. The isolated or recombinant polynucleotide of claim 1, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions:
(a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
3. The isolated or recombinant polynucleotide of claim 1, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions:
(a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI;
(b) at positions 31 and/or 45 the amino acid residue is Z2;
(c) at position 8 the amino acid residue is Z3; (d) at position 89 the amino acid residue is Z3 or Z6;
(e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4;
(f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5;
(g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2; (i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3;
(j) at position 30 the amino acid residue is ZI ;
(k) at position 6 the amino acid residue is Z6;
(1) at position 81 the amino acid residue is Z2 or Z4; (m) at position 113 the amino acid residue is Z3;
(n) at position 138 the amino acid residue is Z4;
(o) at position 142 the amino acid residue is Z2;
(p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4;
(q) at position 5, 17, and 61 the amino acid residue is Z4; (r) at position 24 the amino acid residue is Z3;
(s) at position 104 the amino acid residue is Z5;
(t) at positions 52, and/or 69 the amino acid residue is Z3; (u) at positions 14 and/or 119 the amino acid residue is Z5; (v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5 ; (w) at positions 48 and/or 80 the amino acid residue is Z6; (x) at position 40 the amino acid residue is ZI or Z2 ; (y) at position 96 the amino acid residue is Z3 or Z5; (z) at position 65 the amino acid residue is Z3, Z4, or Z6;
(aa) at positions 84 and/or 115 the amino acid residue is Z3;
(ab) at position 93 the amino acid residue is Z4;
(ac) at position 130 the amino acid residue is Z2; (ad) at position 58 the amino acid residue is Z3, Z4 or Z6;
(ae) at position 47 the amino acid residue is Z4 or Z6;
(af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4;
(ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4; (ai) at position 131 the amino acid residue is Z5;
(aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4; (al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
4. The isolated or recombinant polynucleotide of claim 3, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (am), at least 90%> conform to the amino acid residue restrictions specified in (a) - (am).
5. The isolated or recombinant polynucleotide of any of claims 1-4, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions:
(a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and
(b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
6. The isolated or recombinant polynucleotide of any of claims 1-4 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%o conform to the following restrictions:
(a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and
(b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. 7. The isolated or recombinant polynucleotide of any of claims 1 -
4 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 1,
7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2;
(c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3;
(d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5;
(f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
8. The isolated or recombinant polynucleotide of claim 7, wherein the amino acid sequence comprises at position 36, an amino acid residue selected from the group consisting of ZI and Z3.
9. The isolated or recombinant polynucleotide of claim 7 or claim 8, wherein the amino acid sequence comprises at position 64, an amino acid residue selected from the group consisting of ZI and Z2.
10. The isolated or recombinant polynucleotide of any claims 1 - 9 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions:
(a) at position 2 the amino acid residue is I or L;
(b) at position 3 the amino acid residue is E;
(c) at position 4 the amino acid residue is V or I; (d) at position 5 the amino acid residue is K;
(e) at position 6 the amino acid residue is P;
(f) at position 8 the amino acid residue is N;
(g) at position 10 the amino acid residue is E;
(h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T;
(j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H;
(m) at position 18 the amino acid residue is R, E or K;
(n) at position 19 the amino acid residue is I or V;
(o) at position 24 the amino acid residue is Q; (p) at position 26 the amino acid residue is M, L, V or I;
(q) at position 27 the amino acid residue is E;
(r) at position 28 the amino acid residue is A or V;
(s) at position 30 the amino acid residue is M;
(t) at position 31 the amino acid residue is Y or F; (u) at position 32 the amino acid residue is E or D;
(v) at position 33 the amino acid residue is T or S;
(w) at position 35 the amino acid residue is L;
(x) at position 37 the amino acid residue is R, G, E or Q;
(y) at position 39 the amino acid residue is A or S; (z) at position 40 the amino acid residue is F or L ;
(aa) at position 45 the amino acid residue is Y or F;
(ab) at position 47 the amino acid residue is R or G;
(ac) at position 48 the amino acid residue is G;
(ad) at position 49 the amino acid residue is K, R, or Q; (ae) at position 51 the amino acid residue is I or V;
(af) at position 52 the amino acid residue is S;
(ag) at position 53 the amino acid residue is I or V; (ah) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N; (aj) at position 58 the amino acid residue is Q, K, R or P;
(ale) at position 59 the amino acid residue is A;
(al) at position 60 the amino acid residue is E;
(am) at position 61 the amino acid residue is H or R;
(an) at position 63 the amino acid residue is E or D; (ao) at position 65 the amino acid residue is E, P or Q;
(ap) at position 67 the amino acid residue is Q or R;
(aq) at position 68 the amino acid residue is K or E; (ar) at position 69 the amino acid residue is Q; (as) at position 79 the amino acid residue is E; (at) at position 80 the amino acid residue is G; (au) at position 81 the amino acid residue is Y, H or F; (av) at position 82 the amino acid residue is R;
(aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q; (ay) at position 86 the amino acid residue is A; (az) at position 89 the amino acid residue is G, T or S; (ba) at position 90 the amino acid residue is L;
(bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K;
(bd) at position 93 the amino acid residue is H;
(be) at position 96 the amino acid residue is E or Q; (bf) at position 97 the amino acid residue is I;
(bg) at position 100 the amino acid residue is K or N ;
(bh) at position 101 the amino acid residue is K or R;
(bi) at position 103 the amino acid residue is A or V;
(bj) at position 104 the amino acid residue is D; (bk) at position 105 the amino acid residue is M, L or I;
(bl) at position 106 the amino acid residue is L;
(bm) at position 112 the amino acid residue is T or A;
(bn) at position 113 the amino acid residue is S or T;
(bo) at position 114 the amino acid residue is A; (bp) at position 115 the amino acid residue is S;
(bq) at position 119 the amino acid residue is K or R;
(br) at position 120 the amino acid residue is K or R;
(bs) at position 123 the amino acid residue is F or L;
(bt) at position 125 the amino acid residue is E; (bu) at position 126 the amino acid residue is Q or H;
(bv) at position 128 the amino acid residue is E or D;
(bw) at position 129 the amino acid residue is V or I; (bx) at position 130 the amino acid residue is F; (by) at position 131 the amino acid residue is D or E; (bz) at position 132 the amino acid residue is T; (ca) at position 135 the amino acid residue is V; (cb) at position 138 the amino acid residue is H;
(cc) at position 139 the amino acid residue is I;
(cd) at position 140 the amino acid residue is L or M;
(ce) at position 142 the amino acid residue is Y;
(cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T.
11. The isolated or recombinant polynucleotide of claim 10, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (ch), at least 90%> conform to the amino acid residue restrictions specified in (a) — (ch).
12. The isolated or recombinant polynucleotide of claim 1, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions:
(a) at positions 9, 76, 94 and 110 the amino acid residue is A;
(b) at positions 29 and 108 the amino acid residue is C;
(c) at position 34 the amino acid residue is D;
(d) at position 95 the amino acid residue is E; (e) at position 56 the amino acid residue is F;
(f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G;
(g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K;
(j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L; (k) at positions 1 and 141 the amino acid residue is M; (1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q; (o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R; (p) at position 55 the amino acid residue is S; (q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and (s) at position 13, 46, 70 and 118 the amino acid residue is Y.
13. The isolated or recombinant polynucleotide of any of claims
1-12, wherein the amino acid sequence comprises an amino acid residue selected from the group consisting of:
(a) at position 36 the amino acid residue is M, L, or T;
(b) at position 72 the amino acid residue is L or I; (c) at position 75 the amino acid residue is M or V;
(d) at position 64 the amino acid residue is L, I, or F;
(e) at position 88 the amino acid residue is T or S;
(f) at position 117 the amino acid residue is Y or F.
14. The isolated or recombinant polynucleotide of any of claims
1-13, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 14 the amino acid residue is D;
(b) at position 18 the amino acid residue is E; (c) at position 26 the amino acid residue is M or V;
(e) at position 30 the amino acid residue is I;
(f) at position 32 the amino acid residue is D;
(g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C; (j) at position 38 the amino acid residue is D;
(j) at position 53 the amino acid residue is V;
(k) at position 58 the amino acid residue is R; (1) at position 61 the amino acid residue is R;
(m) at position 62 the amino acid residue is L;
(n) at position 64 the amino acid residue is I or F;
(o) at position 65 the amino acid residue is P;
(p) at position 72 the amino acid residue is I;
(q) at position 75 the amino acid residue is V;
(r) at position 88 the amino acid residue is T;
(s) at position 89 the amino acid residue is G;
(t) at position 91 the amino acid residue is L;
(u) at position 98 the amino acid residue is I;
(v) at position 105 the amino acid residue I;
(w) at position 112 the amino acid residue is A;
(x) at position 124 the amino acid residue is G or C;
(y) at position 128 the amino acid residue is D;
(z) at position 140 the amino acid residue is M;
(aa) at position 143 the amino acid residue is R; and
(ab) at position 144 the amino acid residue is W.
15. The isolated or recombinant polynucleotide of claim 14, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) through (ab), at least 80% conform to the amino acid residue restrictions specified in (a) through (ab).
16. The isolated or recombinant polynucleotide of any of claims 1-15, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 41 the amino acid residue is H;
(b) at position 138 the amino acid residue is H;
(c) at position 34 the amino acid residue is N; and (d) at position 55 the amino acid residue is S.
17. An isolated or recombinant polynucleotide comprising a nucleotide sequence selected from the group consisting of: (a) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:577; (b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:578; (c) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:621; (d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:579; (e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:602; (f) a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ ED NO:697; (g) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:721; (h) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ ED NO:613; (i) a nucleotide sequence encoding an amino acid sequence that is at least 89% identical to SEQ LD NO:677; (j) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ED NO:584; (k) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:707; (1) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:616; (m) a nucleotide sequence encoding an amino acid sequence that is at least 96%> identical to SEQ LD NO:612; (n) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:590; (o) a nucleotide sequence that is complementary to any one of (a) through (n).
18. The isolated or recombinant polynucleotide of claim 17, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of: (i) at positions 18 and 38, a Z5 amino acid residue; (ii) at position 62, a ZI amino acid residue; (iii) at position 124, a Z6 amino acid residue; and (iv) at position 144, a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; Z6 is an amino acid residue selected from the group consisting of C, G, and P. 19. The isolated or recombinant polynucleotide of claim 17, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions: (a) at positions 2, 4, 15,
19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
20. The isolated or recombinant polynucleotide of claim 17, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80%> conform to the following restrictions:
(a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI;
(b) at positions 31 and/or 45 the amino acid residue is Z2;
(c) at position 8 the amino acid residue is Z3;
(d) at position 89 the amino acid residue is Z3 or Z6;
(e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5;
(g) at position 18 the amino acid residue is Z4 or Z5;
(h) at position 123 the amino acid residue is ZI or Z2;
(i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3; (j) at position 30 the amino acid residue is ZI;
(k) at position 6 the amino acid residue is Z6;
(1) at position 81 the amino acid residue is Z2 or Z4;
(m) at position 113 the amino acid residue is Z3;
(n) at position 138 the amino acid residue is Z4; (o) at position 142 the amino acid residue is Z2;
(p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4;
(q) at position 5, 17, and 61 the amino acid residue is Z4;
(r) at position 24 the amino acid residue is Z3;
(s) at position 104 the amino acid residue is Z5; (t) at positions 52, and/or 69 the amino acid residue is Z3;
(u) at positions 14 and/or 119 the amino acid residue is Z5;
(v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5 ;
(w) at positions 48 and/or 80 the amino acid residue is Z6;
(x) at position 40 the amino acid residue is ZI or Z2 ; (y) at position 96 the amino acid residue is Z3 or Z5;
(z) at position 65 the amino acid residue is Z3, Z4, or Z6;
(aa) at positions 84 and/or 115 the amino acid residue is Z3; (ab) at position 93 the amino acid residue is Z4;
(ac) at position 130 the amino acid residue is Z2;
(ad) at position 58 the amino acid residue is Z3, Z4 or Z6;
(ae) at position 47 the amino acid residue is Z4 or Z6; (af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4; (ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4; (ai) at position 131 the amino acid residue is Z5; (aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4; (al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. 21. The isolated or recombinant polynucleotide of claim 20, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) — (am), at least 90%> conform to the amino acid residue restrictions specified in (a) — (am). 22. The isolated or recombinant polynucleotide ofany of claims
17-21, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%) conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16,
21,
22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
23. The isolated or recombinant polynucleotide of any of claims 17-22 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions:
(a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and
(b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
24. The isolated or recombinant polynucleotide of any of claims 17-22 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI;
(b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2;
(c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3;
(d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5;
(f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
25. The isolated or recombinant polynucleotide of claim 24, wherein the amino acid sequence comprises at position 36, an amino acid residue selected from the group consisting of ZI and Z3.
26. The isolated or recombinant polynucleotide of claim 24 or claim 25, wherein the amino acid sequence comprises at position 64, an amino acid residue selected from the group consisting of ZI and Z2.
27. The isolated or recombinant polynucleotide of any claims 17-26 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80%> conform to the following restrictions: (a) at position 2 the amino acid residue is I or L;
(b) at position 3 the amino acid residue is E;
(c) at position 4 the amino acid residue is V or I;
(d) at position 5 the amino acid residue is K;
(e) at position 6 the amino acid residue is P; (f) at position 8 the amino acid residue is N;
(g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T; (j) at position 14 the amino acid residue is E or D; (k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K; (n) at position 19 the amino acid residue is I or V;
(o) at position 24 the amino acid residue is Q;
(p) at position 26 the amino acid residue is M, L, V or I;
(q) at position 27 the amino acid residue is E; (r) at position 28 the amino acid residue is A or V;
(s) at position 30 the amino acid residue is M;
(t) at position 31 the amino acid residue is Y or F;
(u) at position 32 the amino acid residue is E or D;
(v) at position 33 the amino acid residue is T or S; (w) at position 35 the amino acid residue is L;
(x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 39 the amino acid residue is A or S;
(z) at position 40 the amino acid residue is F or L ;
(aa) at position 45 the amino acid residue is Y or F; (ab) at position 47 the amino acid residue is R or G;
(ac) at position 48 the amino acid residue is G;
(ad) at position 49 the amino acid residue is K, R, or Q;
(ae) at position 51 the amino acid residue is I or V;
(af) at position 52 the amino acid residue is S; (ag) at position 53 the amino acid residue is I or V;
(ah) at position 54 the amino acid residue is A;
(ai) at position 57 the amino acid residue is H or N;
(aj) at position 58 the amino acid residue is Q, K, R or P;
(ak) at position 59 the amino acid residue is A; (al) at position 60 the amino acid residue is E;
(am) at position 61 the amino acid residue is H or R;
(an) at position 63 the amino acid residue is E or D;
(ao) at position 65 the amino acid residue is E, P or Q;
(ap) at position 67 the amino acid residue is Q or R; (aq) at position 68 the amino acid residue is K or E;
(ar) at position 69 the amino acid residue is Q;
(as) at position 79 the amino acid residue is E; (at) at position 80 the amino acid residue is G; (au) at position 81 the amino acid residue is Y, H or F; (av) at position 82 the amino acid residue is R; (aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q; (ay) at position 86 the amino acid residue is A; (az) at position 89 the amino acid residue is G, T or S;
(ba) at position 90 the amino acid residue is L;
(bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K;
(bd) at position 93 the amino acid residue is H;
(be) at position 96 the amino acid residue is E or Q;
(bf) at position 97 the amino acid residue is I;
(bg) at position 100 the amino acid residue is K or N ; (bh) at position 101 the amino acid residue is K or R;
(bi) at position 103 the amino acid residue is A or V;
(bj) at position 104 the amino acid residue is D;
(bk) at position 105 the amino acid residue is M, L or I;
(bl) at position 106 the amino acid residue is L; (bm) at position 112 the amino acid residue is T or A;
(bn) at position 113 the amino acid residue is S or T;
(bo) at position 114 the amino acid residue is A;
(bp) at position 115 the amino acid residue is S;
(bq) at position 119 the amino acid residue is K or R; (br) at position 120 the amino acid residue is K or R;
(bs) at position 123 the amino acid residue is F or L;
(bt) at position 125 the amino acid residue is E;
(bu) at position 126 the amino acid residue is Q or H;
(bv) at position 128 the amino acid residue is E or D; (bw) at position 129 the amino acid residue is V or I;
(bx) at position 130 the amino acid residue is F;
(by) at position 131 the amino acid residue is D or E; (bz) at position 132 the amino acid residue is T;
(ca) at position 135 the amino acid residue is V;
(cb) at position 138 the amino acid residue is H;
(cc) at position 139 the amino acid residue is I; (cd) at position 140 the amino acid residue is L or M;
(ce) at position 142 the amino acid residue is Y;
(cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T.
28. The isolated or recombinant polynucleotide of claim 27, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (ch), at least 90% conform to the amino acid residue restrictions specified in (a) - (ch).
29. The isolated or recombinant polynucleotide of claim 17, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 9, 76, 94 and 110 the amino acid residue is A; (b) at positions 29 and 108 the amino acid residue is C;
(c) at position 34 the amino acid residue is D;
(d) at position 95 the amino acid residue is E;
(e) at position 56 the amino acid residue is F;
(f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G;
(g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K;
(j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L; (k) at positions 1 and 141 the amino acid residue is M; (1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q;
(o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R;
(p) at position 55 the amino acid residue is S;
(q) at position 77 the amino acid residue is T;
(r) at position 107 the amino acid residue is W; and
(s) at position 13, 46, 70 and 118 the amino acid residue is Y.
30. The isolated or recombinant polynucleotide of any of claims 1-29, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 14 the amino acid residue is D;
(b) at position 18 the amino acid residue is E;
(c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I; (f) at position 32 the amino acid residue is D;
(g) at position 36 the amino acid residue is M or T;
(i) at position 37 the amino acid residue is C;
(j) at position 38 the amino acid residue is D;
(j) at position 53 the amino acid residue is V; (k) at position 58 the amino acid residue is R;
(1) at position 61 the amino acid residue is R;
(m) at position 62 the amino acid residue is L;
(n) at position 64 the amino acid residue is I or F;
(o) at position 65 the amino acid residue is P; (p) at position 72 the amino acid residue is I;
(q) at position 75 the amino acid residue is V;
(r) at position 88 the amino acid residue is T;
(s) at position 89 the amino acid residue is G;
(t) at position 91 the amino acid residue is L; (u) at position 98 the amino acid residue is I;
(v) at position 105 the amino acid residue I;
(w) at position 112 the amino acid residue is A; (x) at position 124 the amino acid residue is G or C;
(y) at position 128 the amino acid residue is D;
(z) at position 140 the amino acid residue is M;
(aa) at position 143 the amino acid residue is R; and (ab) at position 144 the amino acid residue is W.
31. The isolated or recombinant polynucleotide of claim 30, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) through (ab), at least 80% conform to the amino acid residue restrictions specified in (a) through (ab).
32. The isolated or recombinant polynucleotide of any of claims 17-31, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of: (a) at position 41 the amino acid residue is H;
(b) at position 138 the amino acid residue is H;
(c) at position 34 the amino acid residue is N; and (e) at position 55 the amino acid residue is S.
33. An isolated or recombinant polynucleotide encoding a protein having glyphosaterN-acetylfransferase activity, said polynucleotide selected from the group consisting of: SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952.
34. The isolated or recombinant polynucleotide of claim 1, wherein a polypeptide encoded by said polynucleotide has glyphosate-N-acetyltransferase activity.
35. The isolated or recombinant polynucleotide of claim 1, wherein a polypeptide encoded by said polynucleotide has glyphosate-N-acyltransferase activity.
36. The isolated or recombinant polynucleotide of claim 34, wherein the polypeptide catalyzes the acetylation of glyphosate with a kcat/Km of at least 10 mM"1 min"1 for glyphosate.
37. The isolated or recombinant polynucleotide of claim 34, wherein the polypeptide catalyzes the acetylation of aminomethylphosphonic acid.
38. A nucleic acid construct comprising the isolated or recombinant polynucleotide of claim 1.
39. The nucleic acid construct of claim 38, comprising a promoter operably linked to the isolated or recombinant polynucleotide of claim 1, where the promoter is heterologous with respect to said polynucleotide and effective to cause sufficient expression of the encoded polypeptide to enhance the glyphosate tolerance of a plant cell transformed with said nucleic acid construct.
40. The nucleic acid construct of claim 38, wherein the isolated or recombinant polynucleotide sequence of claim 1 functions as a selectable marker.
41. The nucleic acid construct of claim 38, wherein the construct is a vector.
42. The vector of claim 41 further comprising a second polynucleotide sequence encoding a second polypeptide that confers a detectable phenotypic trait upon a cell or organism expressing the second polypeptide at an effective level.
43. The vector of claim 42, wherein the detectable phenotypic trait functions as selectable marker.
44. The vector of claim 42, wherein the detectable phenotypic trait is selected from the group consisting of herbicide resistance and a visible marker.
45. The vector of claim 41, wherein the vector further comprises a
T-DNA sequence.
46. The vector of claim 41, wherein the isolated or recombinant polynucleotide is operably linked to a regulatory sequence.
47. The vector of claim 41, wherein the vector is a plant transformation vector.
48. A cell comprising at least one isolated or recombinant polynucleotide of claim 1, wherein the polynucleotide is heterologous to the cell.
49. The cell of claim 48, wherein the isolated or recombinant polynucleotide is operably linked to a regulatory sequence.
50. A cell transduced by the vector of claim 41.
51. The cell of claim 49 or 50, wherein the cell is a transgenic plant cell.
52. The transgenic plant cell of claim 51, wherein the transgenic plant cell expresses an exogenous polypeptide with glyphosate-N-acetyltransferase activity.
53. The transgenic plant cell of claim 51, wherein the transgenic plant cell expresses an exogenous polypeptide with glyphosate-N-acyltransferase activity.
54. A transgenic plant or transgenic plant explant comprising the cell of claim 52.
55. A transgenic plant or transgenic plant explant comprising the cell of claim 53.
56. The transgenic plant or transgenic plant explant of claim 54, wherein said plant or plant explant expresses a polypeptide with glyphosate-N- acetyltransferase activity.
57. The transgenic plant or transgenic plant explant of claim 55, wherein said plant or plant explant expresses a polypeptide with glyphosate-N- acyltransferase activity.
58. The transgenic plant or transgenic plant explant of claim 56, wherein said plant or plant explant is a crop plant selected from the group of genera consisting of: Eleusine, Lollium, Bambusa, Brassica, Dactylis, Sorghum, Pennisetum, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Coix, Glycine and Gossypium.
59. The transgenic plant or transgenic plant explant of claim 57, wherein said plant or plant explant is a crop plant selected from the group of genera consisting of: Eleusine, Lollium, Bambusa, Brassica, Dactylis, Sorghum, Pennisetum, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Coix, Glycine and Gossypium.
60. The transgenic plant or transgenic plant explant of claim 56, wherein said plant or plant explant is Arabidopsis.
61. The transgenic plant or transgenic plant explant of claim 57, wherein said plant or plant explant is Arabidopsis.
62. The transgenic plant or transgenic plant explant of claim 56, wherein said plant or plant explant is Gossypium.
63. The transgenic plant or transgenic plant explant of claim 57, wherein said plant or plant explant is Gossypium.
64. The transgenic plant or transgenic plant explant of claim 56, wherein said plant or plant explant exhibits enhanced resistance to glyphosate as compared to a wild type plant of the same species, strain or cultivar.
65. The transgenic plant or transgenic plant explant of claim 57, wherein said plant or plant explant exhibits enhanced resistance to glyphosate as compared to a wild type plant of the same species, strain or cultivar.
66. A seed produced by the plant of claim 56.
67. A seed produced by the plant of claim 57.
68. A transgenic plant which contains a heterologous gene, said gene selected from the group consisting of SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536,
537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552,
553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952 which encodes a glyphosate-N- acetyltransferase having a kcat/Km of at least 10 mM"1 min"1 for glyphosate, wherein the plant exhibits tolerance to glyphosate applied at a level effective to inhibit the growth of the same plant lacking the heterologous gene, without significant yield reduction due to herbicide application.
69. The transgenic plant of claim 68, wherein the glyphosate-N- acetyltransferase catalyzes the acetylation of aminomethylphosphonic acid.
70. An isolated or recombinant polypeptide comprising:
(a) an amino acid sequence that can be optimally aligned with a sequence selected from the group consisting of SEQ ID NO:300, SEQ ED NO:445 and SEQ ED NO:457 to generate a similarity score of at least 460, using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of: (i) at positions 18 and 38, a Z5 amino acid residue; (ii) at position 62, a ZI amino acid residue; (iii) at position 124, a Z6 amino acid residue; and (iv) at position 144, a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; Z6 is an amino acid residue selected from the group consisting of C, G, and P; or (b) a complementary nucleotide sequence thereof.
71. The isolated or recombinant polypeptide of claim 70, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
72. The isolated or recombinant polypeptide of claim 70, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions:
(a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI;
(b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at position 8 the amino acid residue is Z3;
(d) at position 89 the amino acid residue is Z3 or Z6;
(e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5;
(g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2;
(i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3;
(j) at position 30 the amino acid residue is ZI ;
(k) at position 6 the amino acid residue is Z6;
(1) at position 81 the amino acid residue is Z2 or Z4;
(m) at position 113 the amino acid residue is Z3; (n) at position 138 the amino acid residue is Z4;
(o) at position 142 the amino acid residue is Z2;
(p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4;
(q) at position 5, 17, and 61 the amino acid residue is Z4;
(r) at position 24 the amino acid residue is Z3; (s) at position 104 the amino acid residue is Z5;
(t) at positions 52, and/or 69 the amino acid residue is Z3;
(u) at positions 14 and/or 119 the amino acid residue is Z5;
(v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5 ;
(w) at positions 48 and/or 80 the amino acid residue is Z6; (x) at position 40 the amino acid residue is ZI or Z2 ;
(y) at position 96 the amino acid residue is Z3 or Z5;
(z) at position 65 the amino acid residue is Z3, Z4, or Z6;
(aa) at positions 84 and/or 115 the amino acid residue is Z3;
(ab) at position 93. the amino acid residue is Z4; (ac) at position 130 the amino acid residue is Z2;
(ad) at position 58 the amino acid residue is Z3, Z4 or Z6;
(ae) at position 47 the amino acid residue is Z4 or Z6;
(af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4;
(ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4;
(ai) at position 131 the amino acid residue is Z5;
(aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4;
(al) at position 60 the amino acid residue is Z5; and
(am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
73. The isolated or recombinant polypeptide of claim 72, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (am), at least 90%> conform to the amino acid residue restrictions specified in (a) - (am).
74. The isolated or recombinant polypeptide of any of claims 70-73, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
75. The isolated or recombinant polypeptide of any of claims 70-73, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions: , (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and
(b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. 76. The isolated or recombinant polypeptide of any of claims 70-
73, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90% conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75,
76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2;
(c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3;
(d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4;
(e) at positions 34 and/or 95 the amino acid residue is Z5;
(f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
77. The isolated or recombinant polypeptide of claim 76, wherein the amino acid sequence comprises at position 36, an amino acid residue selected from the group consisting of ZI and Z3.
78. The isolated or recombinant polypeptide of claim 76 or claim 77, wherein the amino acid sequence comprises at position 64, an amino acid residue selected from the group consisting of ZI and Z2.
79. The isolated or recombinant polypeptide of any claims 70-
78 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80%> conform to the following restrictions:
(a) at position 2 the amino acid residue is I or L;
(b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I;
(d) at position 5 the amino acid residue is K;
(e) at position 6 the amino acid residue is P;
(f) at position 8 the amino acid residue is N;
(g) at position 10 the amino acid residue is E; (h) at position 11 the amino acid residue is D or E;
(i) at position 12 the amino acid residue is T;
(j) at position 14 the amino acid residue is E or D;
(k) at position 15 the amino acid residue is L;
(1) at position 17 the amino acid residue is H; (m) at position 18 the amino acid residue is R, E or K;
(n) at position 19 the amino acid residue is I or V;
(o) at position 24 the amino acid residue is Q;
(p) at position 26 the amino acid residue is M, L, V or I;
(q) at position 27 the amino acid residue is E; (r) at position 28 the amino acid residue is A or V;
(s) at position 30 the amino acid residue is M;
(t) at position 31 the amino acid residue is Y or F;
(u) at position 32 the amino acid residue is E or D;
(v) at position 33 the amino acid residue is T or S; (w) at position 35 the amino acid residue is L;
(x) at position 37 the amino acid residue is R, G, E or Q;
(y) at position 39 the amino acid residue is A or S; (z) at position 40 the amino acid residue is F or L ;
(aa) at position 45 the amino acid residue is Y or F;
(ab) at position 47 the amino acid residue is R or G;
(ac) at position 48 the amino acid residue is G; (ad) at position 49 the amino acid residue is K, R, or Q;
(ae) at position 51 the amino acid residue is I or V;
(af) at position 52 the amino acid residue is S;
(ag) at position 53 the amino acid residue is I or V; (ah) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N;
(aj) at position 58 the amino acid residue is Q, K, R or P;
(ak) at position 59 the amino acid residue is A;
(al) at position 60 the amino acid residue is E;
(am) at position 61 the amino acid residue is H or R; (an) at position 63 the amino acid residue is E or D;
(ao) at position 65 the amino acid residue is E, P or Q;
(ap) at position 67 the amino acid residue is Q or R;
(aq) at position 68 the amino acid residue is K or E;
(ar) at position 69 the amino acid residue is Q; (as) at position 79 the amino acid residue is E;
(at) at position 80 the amino acid residue is G;
(au) at position 81 the amino acid residue is Y, H or F;
(av) at position 82 the amino acid residue is R;
(aw) at position 83 the amino acid residue is E or D; (ax) at position 84 the amino acid residue is Q;
(ay) at position 86 the amino acid residue is A;
(az) at position 89 the amino acid residue is G, T or S;
(ba) at position 90 the amino acid residue is L;
(bb) at position 91 the amino acid residue is L, I or V; (be) at position 92 the amino acid residue is R or K;
(bd) at position 93 the amino acid residue is H;
(be) at position 96 the amino acid residue is E or Q; (bf) at position 97 the amino acid residue is I;
(bg) at position 100 the amino acid residue is K or N ; (bh) at position 101 the amino acid residue is K or R; (bi) at position 103 the amino acid residue is A or V; (bj) at position 104 the amino acid residue is D;
(bk) at position 105 the amino acid residue is M, L or I;
(bl) at position 106 the amino acid residue is L;
(bm) at position 112 the amino acid residue is T or A;
(bn) at position 113 the amino acid residue is S or T; (bo) at position 114 the amino acid residue is A;
(bp) at position 115 the amino acid residue is S;
(bq) at position 119 the amino acid residue is K or R;
(br) at position 120 the amino acid residue is K or R;
(bs) at position 123 the amino acid residue is F or L; (bt) at position 125 the amino acid residue is E;
(bu) at position 126 the amino acid residue is Q or H;
(bv) at position 128 the amino acid residue is E or D;
(bw) at position 129 the amino acid residue is V or I;
(bx) at position 130 the amino acid residue is F; (by) at position 131 the amino acid residue is D or E;
(bz) at position 132 the amino acid residue is T;
(ca) at position 135 the amino acid residue is V;
(cb) at position 138 the amino acid residue is H;
(cc) at position 139 the amino acid residue is I; (cd) at position 140 the amino acid residue is L or M;
(ce) at position 142 the amino acid residue is Y;
(cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T.
80. The isolated or recombinant polypeptide of claim 79, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (ch), at least 90%> conform to the amino acid residue restrictions specified in (a) - (ch).
81. The isolated or recombinant polypeptide of claim 70, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions:
(a) at positions 9, 76, 94 and 110 the amino acid residue is A;
(b) at positions 29 and 108 the amino acid residue is C;
(c) at position 34 the amino acid residue is D; (d) at position 95 the amino acid residue is E;
(e) at position 56 the amino acid residue is F;
(f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G;
(g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I;
(i) at position 85 the amino acid residue is K;
(j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L;
(k) at positions 1 and 141 the amino acid residue is M;
(1) at positions 23 and 109 the amino acid residue is N; (m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P;
(n) at position 71 the amino acid residue is Q;
(o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R;
(p) at position 55 the amino acid residue is S;
(q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and
(s) at position 13, 46, 70 and 118 the amino acid residue is Y.
82. The isolated or recombinant polypeptide of any of claims 70-81, wherein the amino acid sequence comprises an amino acid residue selected from the group consisting of:
(a) at position 36 the amino acid residue is M, L, or T;
(b) at position 72 the amino acid residue is L or I; (c) at position 75 the amino acid residue is M or V;
(d) at position 64 the amino acid residue is L, I, or F;
(e) at position 88 the amino acid residue is T or S;
(f) at position 117 the amino acid residue is Y or F.
83. The isolated or recombinant polypeptide of any of claims 70-83, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 14 the amino acid residue is D; (b) at position 18 the amino acid residue is E;
(c) at position 26 the amino acid residue is M or V;
(e) at position 30 the amino acid residue is I;
(f) at position 32 the amino acid residue is D;
(g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C;
(j) at position 38 the amino acid residue is D;
(j) at position 53 the amino acid residue is V;
(k) at position 58 the amino acid residue is R;
(1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L;
(n) at position 64 the amino acid residue is I or F;
(o) at position 65 the amino acid residue is P;
(p) at position 72 the amino acid residue is I;
(q) at position 75 the amino acid residue is V; (r) at position 88 the amino acid residue is T;
(s) at position 89 the amino acid residue is G;
(t) at position 91 the amino acid residue is L;
(u) at position 98 the amino acid residue is I;
(v) at position 105 the amino acid residue I; (w) at position 112 the amino acid residue is A;
(x) at position 124 the amino acid residue is G or C;
(y) at position 128 the amino acid residue is D; (z) at position 140 the amino acid residue is M;
(aa) at position 143 the amino acid residue is R; and
(ab) at position 144 the amino acid residue is W.
84. The isolated or recombinant polypeptide of claim 84, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) through (ab), at least 80%o conform to the amino acid residue restrictions specified in (a) through (ab).
85. The isolated or recombinant polypeptide of any of claims 70-84, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 41 the amino acid residue is H;
(b) at position 138 the amino acid residue is H; (c) at position 34 the amino acid residue is N; and
(f) at position 55 the amino acid residue is S.
86. An isolated or recombinant polypeptide comprising an amino acid sequence selected from the group consisting of: (a) an amino acid sequence that is at least 98% identical to SEQ ID
NO:577; (b) an amino acid sequence that is at least 97% identical to SEQ BD NO:578; (c) ' an amino acid sequence that is at least 97%> identical to SEQ BD NO:621; (d) an amino acid sequence that is at least 98% identical to SEQ BD NO:579; (e) an amino acid sequence that is at least 98% identical to SEQ BD NO:602; (f) an amino acid sequence that is at least 95%> identical to SEQ ID
NO:697; (g) an amino acid sequence that is at least 96%> identical to SEQ ID NO:721; (h) an amino acid sequence that is at least 97% identical to SEQ BD NO:613; (i) an amino acid sequence that is at least 89%> identical to SEQ BD
NO:677; (j) an amino acid sequence that is at least 96% identical to SEQ BD NO:584; (k) an amino acid sequence that is at least 98%> identical to SEQ ED NO:707; (1) an amino acid sequence that is at least 98% identical to SEQ ED NO:616; (m) an amino acid sequence that is at least 96% identical to SEQ ED NO:612; (n) an amino acid sequence that is at least 98% identical to SEQ ED
NO:590; (o) an amino acid sequence that is complementary to any one of (a) through (n).
87. The isolated or recombinant polypeptide of claim 86, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of: (i) at positions 18 and 38, a Z5 amino acid residue; (ii) at position 62, a ZI amino acid residue; (iii) at position 124, a Z6 amino acid residue; and (iv) at position 144, a Z2 amino acid residue, wherein: ZI is an amino acid residue selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid residue selected from the group consisting of F, W, and Y; Z5 is an amino acid residue selected from the group consisting of D and E; Z6 is an amino acid residue selected from the group consisting of C, G, and P.
88. The isolated or recombinant polypeptide of claim 86, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions:
(a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is Bl; and (b) at positions 3, 5, 8, 10, 11, 14, 17, 24, 27, 32, 37, 47, 48, 49, 52, 57, 58, 61, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 125, 126, 128, 131, and/or 143 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
89. The isolated or recombinant polypeptide of claim 86, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is ZI;
(b) at positions 31 and/or 45 the amino acid residue is Z2;
(c) at position 8 the amino acid residue is Z3;
(d) at position 89 the amino acid residue is Z3 or Z6; (e) at positions 82, 92, 101 and/or 120 the amino acid residue is Z4;
(f) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5;
(g) at position 18 the amino acid residue is Z4 or Z5; (h) at position 123 the amino acid residue is ZI or Z2;
(i) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is ZI or Z3;
(j) at position 30 the amino acid residue is ZI; (k) at position 6 the amino acid residue is Z6; (1) at position 81 the amino acid residue is Z2 or Z4;
(m) at position 113 the amino acid residue is Z3;
(n) at position 138 the amino acid residue is Z4;
(o) at position 142 the amino acid residue is Z2; (p) at positions 57 and/or 126 the amino acid residue is Z3 or Z4;
(q) at position 5, 17, and 61 the amino acid residue is Z4;
(r) at position 24 the amino acid residue is Z3;
(s) at position 104 the amino acid residue is Z5;
(t) at positions 52, and/or 69 the amino acid residue is Z3; (u) at positions 14 and/or 119 the amino acid residue is Z5;
(v) at positions 10, 32, 63, and/or 83 the amino acid residue is Z5 ;
(w) at positions 48 and/or 80 the amino acid residue is Z6;
(x) at position 40 the amino acid residue is ZI or Z2 ;
(y) at position 96 the amino acid residue is Z3 or Z5; (z) at position 65 the amino acid residue is Z3, Z4, or Z6;
(aa) at positions 84 and/or 115 the amino acid residue is Z3;
(ab) at position 93 the amino acid residue is Z4;
(ac) at position 130 the amino acid residue is Z2;
(ad) at position 58 the amino acid residue is Z3, Z4 or Z6; (ae) at position 47 the amino acid residue is Z4 or Z6;
(af) at positions 49 and/or 100 the amino acid residue is Z3 or Z4;
(ag) at position 68 the amino acid residue is Z4 or Z5; (ah) at position 143 the amino acid residue is Z4;
(ai) at position 131 the amino acid residue is Z5; (aj) at positions 125 and/or 128 the amino acid residue is Z5; (ak) at position 67 the amino acid residue is Z3 or Z4; (al) at position 60 the amino acid residue is Z5; and (am) at position 37 the amino acid residue is Z4 or Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
90. The isolated or recombinant polypeptide of claim 89, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (am), at least 90% conform to the amino acid residue restrictions specified in (a) - (am).
91. The isolated or recombinant polypeptide of any of claims 86-90, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions:
(a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and
(b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
92. The isolated or recombinant polypeptide of any of claims 86-91 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%o conform to the following restrictions: (a) at positions 1, 7, 9, 13, 20, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is Bl; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 36, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99,102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein Bl is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T.
93. The isolated or recombinant polypeptide of any of claims 86-91 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 90%> conform to the following restrictions: (a) at positions 1, 7, 9, 20, 42, 50, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI;
(b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2;
(c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3;
(d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5;
(f) at position 22, 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134,
136, and/or 137 the amino acid residue is Z6; wherein ZI is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P.
94. The isolated or recombinant polypeptide of claim 93, wherein the amino acid sequence comprises at position 36, an amino acid residue selected from the group consisting of ZI and Z3.
95. The isolated or recombinant polypeptide of claim 93 or claim 94, wherein the amino acid sequence comprises at position 64, an amino acid residue selected from the group consisting of ZI and Z2.
96. The isolated or recombinant polypeptide of any claims 86- 95 wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80% conform to the following restrictions:
(a) at position 2 the amino acid residue is I or L;
(b) at position 3 the amino acid residue is E; (c) at position 4 the amino acid residue is V or I;
(d) at position 5 the amino acid residue is K;
(e) at position 6 the amino acid residue is P;
(f) at position 8 the amino acid residue is N; (g) at position 10 the amino acid residue is E;
(h) at position 11 the amino acid residue is D or E;
(i) at position 12 the amino acid residue is T;
(j) at position 14 the amino acid residue is E or D;
(k) at position 15 the amino acid residue is L; (1) at position 17 the amino acid residue is H;
(m) at position 18 the amino acid residue is R, E or K;
(n) at position 19 the amino acid residue is I or V;
(o) at position 24 the amino acid residue is Q;
(p) at position 26 the amino acid residue is M, L, V or I; (q) at position 27 the amino acid residue is E;
(r) at position 28 the amino acid residue is A or V;
(s) at position 30 the amino acid residue is M;
(t) at position 31 the amino acid residue is Y or F;
(u) at position 32 the amino acid residue is E or D; (v) at position 33 the amino acid residue is T or S;
(w) at position 35 the amino acid residue is L;
(x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 39 the amino acid residue is A or S;
(z) at position 40 the amino acid residue is F or L ; (aa) at position 45 the amino acid residue is Y or F;
(ab) at position 47 the amino acid residue is R or G;
(ac) at position 48 the amino acid residue is G;
(ad) at position 49 the amino acid residue is K, R, or Q;
(ae) at position 51 the amino acid residue is I or V; (af) at position 52 the amino acid residue is S;
(ag) at position 53 the amino acid residue is I or V; (ah) at position 54 the amino acid residue is A; (ai) at position 57 the amino acid residue is H or N;
(aj) at position 58 the amino acid residue is Q, K, R or P;
(ak) at position 59 the amino acid residue is A;
(al) at position 60 the amino acid residue is E; (am) at position 61 the amino acid residue is H or R;
(an) at position 63 the amino acid residue is E or D;
(ao) at position 65 the amino acid residue is E, P or Q;
(ap) at position 67 the amino acid residue is Q or R;
(aq) at position 68 the amino acid residue is K or E; (ar) at position 69 the amino acid residue is Q;
(as) at position 79 the amino acid residue is E;
(at) at position 80 the amino acid residue is G;
(au) at position 81 the amino acid residue is Y, H or F;
(av) at position 82 the amino acid residue is R; (aw) at position 83 the amino acid residue is E or D;
(ax) at position 84 the amino acid residue is Q;
(ay) at position 86 the amino acid residue is A;
(az) at position 89 the amino acid residue is G, T or S;
(ba) at position 90 the amino acid residue is L; (bb) at position 91 the amino acid residue is L, I or V;
(be) at position 92 the amino acid residue is R or K;
(bd) at position 93 the amino acid residue is H;
(be) at position 96 the amino acid residue is E or Q;
(bf) at position 97 the amino acid residue is I; (bg) at position 100 the amino acid residue is K or N ;
(bh) at position 101 the amino acid residue is K or R;
(bi) at position 103 the amino acid residue is A or V;
(bj) at position 104 the amino acid residue is D;
(bk) at position 105 the amino acid residue is M, L or I; (bl) at position 106 the amino acid residue is L;
(bm) at position 112 the amino acid residue is T or A;
(bn) at position 113 the amino acid residue is S or T; (bo) at position 114 the amino acid residue is A; (bp) at position 115 the amino acid residue is S; (bq) at position 119 the amino acid residue is K or R; (br) at position 120 the amino acid residue is K or R; (bs) at position 123 the amino acid residue is F or L; (bt) at position 125 the amino acid residue is E; (bu) at position 126 the amino acid residue is Q or H; (bv) at position 128 the amino acid residue is E or D; (bw) at position 129 the amino acid residue is V or I; (bx) at position 130 the amino acid residue is F;
(by) at position 131 the amino acid residue is D or E; (bz) at position 132 the amino acid residue is T;
(ca) at position 135 the amino acid residue is V;
(cb) at position 138 the amino acid residue is H; (cc) at position 139 the amino acid residue is I;
(cd) at position 140 the amino acid residue is L or M;
(ce) at position 142 the amino acid residue is Y;
(cf) at position 143 the amino acid residue is K or R; (eg) at position 145 the amino acid residue is L or I; and (ch) at position 146 the amino acid residue is T.
97. The isolated or recombinant polypeptide of claim 96, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) - (ch), at least 90% conform to the amino acid residue restrictions specified in (a) - (ch).
98. The isolated or recombinant polypeptide of claim 86, wherein of the amino acid residues in the amino acid sequence that correspond to the following positions, at least 80%> conform to the following restrictions: (a) at positions 9, 76, 94 and 110 the amino acid residue is A;
(b) at positions 29 and 108 the amino acid residue is C;
(c) at position 34 the amino acid residue is D; (d) at position 95 the amino acid residue is E;
(e) at position 56 the amino acid residue is F;
(f) at positions 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G;
(g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K;
(j) at positions 20, 42, 50, 78 and 121 the amino acid residue is L;
(k) at positions 1 and 141 the amino acid residue is M;
(1) at positions 23 and 109 the amino acid residue is N;
(m) at positions 22, 25, 133, 134 and 137 the amino acid residue is P;
(n) at position 71 the amino acid residue is Q;
(o) at positions 16, 21, 73, 99 and 111 the amino acid residue is R;
(p) at position 55 the amino acid residue is S;
(q) at position 77 the amino acid residue is T;
(r) at position 107 the amino acid residue is W; and
(s) at position 13, 46, 70 and 118 the amino acid residue is Y.
99. The isolated or recombinant polypeptide of any of claims 70-98, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 14 the amino acid residue is D;
(b) at position 18 the amino acid residue is E;
(c) at position 26 the amino acid residue is M or V; (e) at position 30 the amino acid residue is I;
(f) at position 32 the amino acid residue is D;
(g) at position 36 the amino acid residue is M or T; (i) at position 37 the amino acid residue is C;
(j) at position 38 the amino acid residue is D; (j) at position 53 the amino acid residue is V;
(k) at position 58 the amino acid residue is R;
(1) at position 61 the amino acid residue is R; (m) at position 62 the amino acid residue is L;
(n) at position 64 the amino acid residue is I or F;
(o) at position 65 the amino acid residue is P;
(P) at position 72 the amino acid residue is I;
(q) at position 75 the amino acid residue is V;
(r) at position 88 the amino acid residue is T;
(s) at position 89 the amino acid residue is G;
(t) at position 91 the amino acid residue is L;
( ) at position 98 the amino acid residue is I;
(V) at position 105 the amino acid residue I;
(w) at position 112 the amino acid residue is A;
(x) at position 124 the amino acid residue is G or C;
(y) at position 128 the amino acid residue is D;
(z) at position 140 the amino acid residue is M;
(aa) at position 143 the amino acid residue is R; and
(ab) at position 144 the amino acid residue is W.
100. The isolated or recombinant polypeptide of claim 99, wherein of the amino acid residues in the amino acid sequence that correspond to the positions specified in (a) through (ab), at least 80% conform to the amino acid residue restrictions specified in (a) through (ab).
101. The isolated or recombinant polypeptide of any of claims 86-100, wherein the amino acid sequence comprises one or more amino acid residues selected from the group consisting of:
(a) at position 41 the amino acid residue is H;
(b) at position 138 the amino acid residue is H;
(c) at position 34 the amino acid residue is N; and (e) at position 55 the amino acid residue is S.
102. The isolated or recombinant polypeptide of claim 69, wherein the polypeptide catalyzes the acetylation of glyphosate with a kCat/Km of at least 10 mM"1 min"1 for glyphosate.
103. The isolated or recombinant polypeptide of claim 102, wherein the polypeptide catalyzes the acetylation of glyphosate with a kcat/Km of at least 100 mM"1 min" for glyphosate.
104. The isolated or recombinant polypeptide of claim 103, wherein the polypeptide catalyzes the acetylation of aminomethylphosphonic acid.
105. A polypeptide having GAT activity said polypeptide selected from the group consisting of SEQ BD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972 characterized by:
(a) a Km for glyphosate of at least about 2 mM or less;
(b) a Km for acetyl CoA of at least about 200 μM or less; and (c) a kca equal to at least about 6/minute.
106. An isolated or recombinant polypeptide having glyphosate-N- acyltransferase activity, said polypeptide selected from the group consisting of: SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972.
107. A method of producing a glyphosate-resistant transgenic plant or plant cell comprising:
(a) transforming a plant or plant cell with a polynucleotide encoding a glyphosate- N-acetyltransferase said polynucleotide selected from the group consisting of SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952; and
(b) optionally regenerating a transgenic plant from the transformed plant cell.
108. A method for selecting a plant or plant cell containing a nucleic acid construct, comprising: (a) providing a transgenic plant or plant cell containing a nucleic acid constract, wherein the nucleic acid construct comprises a nucleotide sequence that encodes a glyphosate-N-acetyltransferase, said nucleotide sequence selected from the group consisting of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624 626, 628, 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656 658, 660, 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688 690, 692, 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720 722, 724, 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752 754, 756, 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786 788, 790, 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818 820, 822, 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920 922, 924, 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942 943, 944, 945, 947, 949, 951, and 952; and (b) growing the transgenic plant or plant cell in the presence of glyphosate under conditions where the glyphosate-N-acetyltransferase is expressed at an effective level, whereby the fransgenic plant or plant cell grows at a rate that is discernibly greater than the plant or plant cell would grow if it did not contain the nucleic acid constract.
109. A method for selectively controlling weeds in a field containing a crop comprising:
(a) planting a field with crop seeds or plants which are glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N- acetyltransferase, said gene selected from the groups consisting of SEQ ID NO 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563 564, 565, 566, 567, 620, 622, 624, 626, 628, 630, 632, 634, 636, 638, 640, 642 644, 646, 648, 650, 652, 654, 656, 658, 660, 662, 664, 666, 668, 670, 672, 674 676, 678, 680, 682, 684, 686, 688, 690, 692, 694, 696, 698, 700, 702, 704, 706 708, 710, 712, 714, 716, 718, 720, 722, 724, 726, 728, 730, 732, 734, 736, 738 740, 742, 744, 746, 748, 750, 752, 754, 756, 758, 760, 762, 764, 768, 770, 772 774, 776, 778, 780, 782, 784, 786, 788, 790, 792, 794, 796, 798, 800, 802, 804 806, 808, 810, 812, 814, 816, 818, 820, 822, 824, 832, 834, 836, 838, 840, 842 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906: 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930, 932, 933, 934, 935 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952; and
(b) applying to the crop and weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop.
110. The method of claim 109, where the crop is cotton, corn, or soybean.
111. A transgenic plant or transgenic plant explant having an enhanced tolerance to glyphosate, wherein the plant or plant explant expresses a polypeptide with glyphosate-N-acetyltransferase activity, said polypeptide selected from the group consisting of SEQ ED NO: 568, 569, 570", 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972; and at least one polypeptide imparting glyphosate tolerance by an additional mechanism.
112. A transgenic plant or transgenic plant explant, wherein the plant or plant explant expresses a polypeptide with glyphosate-N-acetyltransferase activity, said polypeptide selected from the group consisting of SEQ ID NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691, 693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972; and at least one polypeptide imparting tolerance to an additional herbicide.
113. A transgenic plant or transgenic plant explant having an enhanced tolerance to glyphosate, wherein the plant or plant explant expresses a polypeptide with glyphosate-N-acetyltransferase activity, said polypeptide selected from the group consisting of SEQ BD NO: 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 621, 623, 625, 627, 629, 631, 633, 635, 637, 639, 641, 643, 645, 647, 649, 651, 653, 655, 657, 659, 661, 663, 665, 667, 669, 671, 673, 675, 677, 679, 681, 683, 685, 687, 689, 691,
693, 695, 697, 699,701, 703, 705, 707, 709, 711, 713, 715, 717, 719,721, 723, 725, 727, 729, 731, 733, 735, 737, 739, 741, 743, 745, 747, 749, 751, 753, 755, 757, 759, 761, 763, 765, 767, 769, 771, 773, 775, 777, 779, 781, 783, 785, 787, 789, 791, 793, 795, 797, 799, 801, 803, 805, 807, 809, 811, 813, 815, 817, 819, 821, 823, 825, 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972; at least one polypeptide imparting glyphosate tolerance by an additional mechanism, and at least one polypeptide imparting tolerance to an additional herbicide.
114. A method for controlling weeds in a field containing a crop comprising: (a) planting the field with crop seeds or plants wliich are transformed with a gene encoding a glyphosate-N-acetyltransferase, said gene being selected from the group consisting of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 620, 622, 624, 626, 628 630, 632, 634, 636, 638, 640, 642, 644, 646, 648, 650, 652, 654, 656, 658, 660 662, 664, 666, 668, 670, 672, 674, 676, 678, 680, 682, 684, 686, 688, 690, 692 694, 696, 698, 700, 702, 704, 706, 708, 710, 712, 714, 716, 718, 720, 722, 724 726, 728, 730, 732, 734, 736, 738, 740, 742, 744, 746, 748, 750, 752, 754, 756 758, 760, 762, 764, 768, 770, 772, 774, 776, 778, 780, 782, 784, 786, 788, 790 792, 794, 796, 798, 800, 802, 804, 806, 808, 810, 812, 814, 816, 818, 820, 822 824, 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892; 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924 926, 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952; and at least one gene encoding a polypeptide imparting glyphosate tolerance by an additional mechanism; and (b) applying to the crop and weeds in the field an effective application of glyphosate sufficient to inhibit growth of the weeds in the field without significantly affecting the crop.
115. A method for selectively controlling weeds in a field containing a crop comprising:
(a) planting the field with crop seeds or plants wliich are transformed with a gene encoding a glyphosate-N-acetyltransferase, said gene selected from the group consisting of SEQ ID NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541 542, 543, 544, 545, 546, 547, 548, 549 550, 551, 552, 553, 554 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565 566, 567, 620, 622, 624; 626, 628, 630: 632, 634, 636, 638, 640, 642, 644, 646648, 650, 652, 654, 656658, 660, 662 664, 666, 668, 670, 672, 674, 676, 678 680, 682, 684, 686, 688 690, 692, 694 696, 698, 700, 702, 704, 706, 708, 710712, 714, 716, 718, 720: 722, 724, 726: 728, 730, 732, 734, 736, 738, 740, 742; 744, 746, 748, 750, 752754, 756, 758 760, 762, 764, 768, 770, 772, 774, 776; 778, 780, 782, 784, 786788, 790, 792 794, 796, 798, 800, 802, 804, 806, 808 810, 812, 814,816,818 820, 822, 824: 832, 834, 836, 838, 840, 842, 844, 846 848, 850, 852, 854, 856 858, 860, 862 864, 866, 868, 870, 872, 874, 876, 878 880, 882, 884, 886, 888 890, 892, 894 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920; 922, 924, 926 928, 930, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945 947, 949, 951, and 952; and at least one gene encoding a polypeptide imparting tolerance to an additional herbicide, and; (b) applying to the crop and weeds in the field a simultaneous or chronologically staggered application of glyphosate and the additional herbicide which is sufficient to inhibit growth of the weeds in the field without significantly affecting the crop.
116. A method for selectively controlling weeds in a field containing a crop comprising:
(a) planting the field with crop seeds or plants which are transformed with a gene encoding a glyphosate-N-acetyltransferase, said gene selected from the group consisting of SEQ ED NO: 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546: 547 548, 549, 550 551, 552, 553, 554, 555 556, 557, 558, 559, 560, 561 562 563, 564, 565 566, 567, 620, 622, 624 626, 628, 630, 632, 634, 636638 640, 642, 644646, 648, 650, 652, 654 656, 658, 660, 662, 664, 666; 668 670, 672, 674676, 678, 680, 682, 684 686, 688, 690, 692, 694, 696 698 700, 702, 704: 706, 708, 710,712,714 716,718, 720, 722, 724, 726 728 730, 732, 734; 736, 738, 740, 742, 744 746, 748, 750, 752, 754, 756 758 760, 762, 764; 768, 770, 772, 774, 776 778, 780, 782, 784, 786, 788 790: 792, 794, 796 798, 800, 802, 804, 806 808, 810, 812,814,816,818 820, 822, 824, 832 834, 836, 838, 840, 842: 844, 846, 848, 850, 852, 854 856 858, 860, 862 864, 866, 868, 870, 872 874, 876, 878, 880, 882, 884886 888, 890, 892 894, 896, 898, 900, 902 904, 906, 908, 910, 912, 914916 918, 920, 922 924, 926, 928, 930, 932 933, 934, 935, 936, 937, 938 939 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952; at least one gene encoding a polypeptide imparting glyphosate tolerance by an additional mechanism and at least one gene encoding a polypeptide imparting tolerance to an additional herbicide, and; (b) applying to the crop and weeds in the field a simultaneous or chronologically staggered application of glyphosate and the additional herbicide which is sufficient to inhibit growth of the weeds in the field without significantly affecting the crop.
117. An isolated or recombinant polynucleotide comprising a nucleotide sequence selected from the groups consisting of:
(a) a nucleotide sequence encoding an amino acid sequence that is at least 96% identical to SEQ ID NO:919;
(b) a nucleotide sequence encoding an amino acid sequence that is at least 97% identical to SEQ BD NO:929; (c) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ BD NO: 847;
(d) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ BD NO: 851;
(e) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ID NO:853;
(f) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ BD NO:855;
(g) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ BD NO:857; (h) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ BD NO: 861;
(i) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ ID NO: 871;
(j) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ BD NO:875;
(k) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ BD NO: 881;
(1) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ BD NO:885; (m) a nucleotide sequence encoding an amino acid sequence that is at least 98%> identical to SEQ BD NO:887; (n) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO: 889;
(o) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:893; (p) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO: 897;
(q) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO: 899;
(r) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:909;
(s) a nucleotide sequence encoding an amino acid sequence that is at least 98% identical to SEQ ED NO:911;
(t) a nucleotide sequence encoding an amino acid sequence that is at least 99% identical to SEQ ED NO: 837; (u) a nucleotide sequence encoding an amino acid sequence that is at least 99%> identical to SEQ ED NO:841;
(v) a nucleotide sequence encoding an amino acid sequence that is at least 99% identical to SEQ ED NO:865;
(w) a nucleotide sequence encoding an amino acid sequence that is at least 99% identical to SEQ ED NO:869; and
(x) a nucleotide sequence encoding an amino acid sequence that is at least 99% identical to SEQ ED NO: 879.
118. An isolated or recombinant polynucleotide comprising a nucleotide sequence encoding an amino acid sequence that is at least 95% identical to SEQ BD
NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ID NO:929.
119. An isolated or recombinant polynucleotide encoding a polypeptide having glyphosate-N-acetyltransferase activity, said polynucleotide being selected from the group consisting of: SEQ ED NO: 832, 834, 836, 838, 840, 842, 844, 846, 848, 850, 852, 854, 856, 858, 860, 862, 864, 866, 868, 870, 872, 874, 876, 878, 880, 882, 884, 886, 888, 890, 892, 894, 896, 898, 900, 902, 904, 906, 908, 910, 912, 914, 916, 918, 920, 922, 924, 926, 928, 930,932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 947, 949, 951, and 952.
120. An isolated or recombinant polypeptide comprising an amino acid sequence selected from the groups consisting of:
(a) an amino acid sequence that is at least 96%> identical to SEQ ID NO:919;
(b) an amino acid sequence that is at least 97% identical to SEQ BD NO:929;
(c) an amino acid sequence that is at least 98% identical to SEQ ED NO:847; (d) an amino acid sequence that is at least 98% identical to SEQ BD NO:851;
(e) an amino acid sequence that is at least 98% identical to SEQ ED NO:853;
(f) an amino acid sequence that is at least 98%> identical to SEQ ID NO:855;
(g) an amino acid sequence that is at least 98% identical to SEQ LD NO:857; (h) an amino acid sequence that is at least 98%> identical to SEQ BD NO:861; (i) an amino acid sequence that is at least 98%» identical to SEQ BD NO:871;
(j) an amino acid sequence that is at least 98%> identical to SEQ ED NO: 875;
(k) an amino acid sequence that is at least 98% identical to SEQ ED NO:881;
(1) an amino acid sequence that is at least 98%> identical to SEQ BD NO:885;
(m) an amino acid sequence that is at least 98% identical to SEQ ED NO:887; (n) an amino acid sequence that is at least 98% identical to SEQ ED NO:889;
(o) an amino acid sequence that is at least 98% identical to SEQ ED NO:893;
(p) an amino acid sequence that is at least 98%> identical to SEQ ED NO:897;
(q) an amino acid sequence that is at least 98%> identical to SEQ ED NO:899;
(r) an amino acid sequence that is at least 98% identical to SEQ BD NO:909; (s) an amino acid sequence that is at least 98% identical to SEQ ED NO:911 ;
(t) an amino acid sequence that is at least 99%o identical to SEQ LD NO: 837;
(u) an amino acid sequence that is at least 99%> identical to SEQ ED NO: 841;
(v) an amino acid sequence that is at least 99%> identical to SEQ BD NO:865;
(w) an amino acid sequence that is at least 99% identical to SEQ BD NO:869; and (x) an amino acid sequence that is at least 99% identical to SEQ BD NO:879.
121. An isolated or recombinant polypeptide comprising an amino acid sequence that is at least 95% identical to SEQ ED NO:929 and which comprises a Gly or an Asn residue at the amino acid position corresponding to position 33 of SEQ ED NO:929.
122. An isolated or recombinant polypeptide comprising residues 2-146 of an amino acid sequence selected from the group consisting of: SEQ ID NO: 833, 835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972.
123. The isolated or recombinant polypeptide of claim 122, comprising an amino acid sequence selected from the group consisting of: SEQ ED NO: 833,
835, 837, 839, 841, 843, 845, 847, 849, 851, 853, 855, 857, 859, 861, 863, 865, 867, 869, 871, 873, 875, 877, 879, 881, 883, 885, 887, 889, 891, 893, 895, 897, 899, 901, 903, 905, 907, 909, 911, 913, 915, 917, 919, 921, 923, 925, 927, 929, 931, 948, 950, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, and 972.
124. A method of increasing the expression of a polynucleotide in a plant or plant cell, the method comprising: (a) obtaining a first polynucleotide comprising a nucleic acid sequence encoding a polypeptide comprising an amino acid sequence that can be optimally aligned with a sequence selected from the group consisting of SEQ ID NO:300, SEQ ED NO:445 and SEQ ED NO:457 to generate a similarity score of at least 460 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1, the nucleic acid sequence comprising an initiating methionine ATG codon; (b) inserting one or two GCG or GCT codons into the nucleic acid sequence immediately downstream and adjacent to the initiating methionine ATG codon, resulting in a modified polynucleotide; and (c) transforming the plant or plant cell with the modified polynucleotide, wherein the plant or plant cell transformed with the modified polynucleotide accumulates a larger amount of encoded polypeptide compared to a plant or plant cell transformed with the first polynucleotide.
125. The method of claim 124, further comprising identifying one or more codons in the nucleic acid sequence which are less frequently utilized in plants, and substituting the codons which are less frequently utilized in plants for codons encoding the same amino acids which are more frequently utilized in plants.
EP04775913A 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes Withdrawn EP1620557A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP11153914A EP2322629A3 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (GAT) genes
EP12179273.3A EP2535414B1 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CL2003000863 2003-04-29
US10/427,692 US7462481B2 (en) 2000-10-30 2003-04-30 Glyphosate N-acetyltransferase (GAT) genes
PCT/US2004/013145 WO2005012515A2 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes

Related Child Applications (1)

Application Number Title Priority Date Filing Date
EP12179273.3A Division EP2535414B1 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes

Publications (1)

Publication Number Publication Date
EP1620557A2 true EP1620557A2 (en) 2006-02-01

Family

ID=34118130

Family Applications (3)

Application Number Title Priority Date Filing Date
EP11153914A Withdrawn EP2322629A3 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (GAT) genes
EP04775913A Withdrawn EP1620557A2 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes
EP12179273.3A Active EP2535414B1 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP11153914A Withdrawn EP2322629A3 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (GAT) genes

Family Applications After (1)

Application Number Title Priority Date Filing Date
EP12179273.3A Active EP2535414B1 (en) 2003-04-29 2004-04-29 Novel glyphosate-n-acetyltransferase (gat) genes

Country Status (11)

Country Link
EP (3) EP2322629A3 (en)
JP (1) JP2007500514A (en)
CN (1) CN1863914B (en)
AU (1) AU2004260931B9 (en)
BR (1) BRPI0409816B8 (en)
CA (2) CA2662092C (en)
HR (1) HRP20050930A2 (en)
MX (1) MXPA05011585A (en)
SG (1) SG155063A1 (en)
WO (1) WO2005012515A2 (en)
ZA (1) ZA200509602B (en)

Families Citing this family (289)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AR048724A1 (en) 2004-04-30 2006-05-17 Dow Agrosciences Llc GENES WITH RESISTANCE TO HERBICIDES
BRPI0609260B1 (en) 2005-03-04 2020-11-03 Monsanto Technology Llc herbicidal glyphosate composition useful for killing or controlling weed growth in a field containing a crop of glyphosate-tolerant transgenic cotton plants
EA200800622A1 (en) 2005-08-24 2008-08-29 Пайонир Хай-Бред Интернэшнл, Инк. COMPOSITIONS PROVIDING TOLERANCE TO SEVERAL HERBICIDES AND METHODS OF THEIR APPLICATION
US7951995B2 (en) * 2006-06-28 2011-05-31 Pioneer Hi-Bred International, Inc. Soybean event 3560.4.3.5 and compositions and methods for the identification and detection thereof
US7968770B2 (en) 2006-06-28 2011-06-28 Pioneer Hi-Bred International, Inc. Methods for improving yield using soybean event 3560.4.3.5
US7897846B2 (en) 2006-10-30 2011-03-01 Pioneer Hi-Bred Int'l, Inc. Maize event DP-098140-6 and compositions and methods for the identification and/or detection thereof
US7928296B2 (en) 2006-10-30 2011-04-19 Pioneer Hi-Bred International, Inc. Maize event DP-098140-6 and compositions and methods for the identification and/or detection thereof
CL2007003743A1 (en) 2006-12-22 2008-07-11 Bayer Cropscience Ag COMPOSITION THAT INCLUDES FENAMIDONA AND AN INSECTICIDE COMPOUND; AND METHOD TO CONTROL FITOPATOGENOS CULTURES AND INSECTS FACING OR PREVENTIVELY.
CL2007003744A1 (en) 2006-12-22 2008-07-11 Bayer Cropscience Ag COMPOSITION THAT INCLUDES A 2-PYRIDILMETILBENZAMIDE DERIVATIVE AND AN INSECTICIDE COMPOUND; AND METHOD TO CONTROL FITOPATOGENOS CULTURES AND INSECTS FACING OR PREVENTIVELY.
EP1969931A1 (en) 2007-03-12 2008-09-17 Bayer CropScience Aktiengesellschaft Fluoroalkyl phenylamidines and their use as fungicides
EP1969929A1 (en) 2007-03-12 2008-09-17 Bayer CropScience AG Substituted phenylamidines and their use as fungicides
EP1969930A1 (en) 2007-03-12 2008-09-17 Bayer CropScience AG Phenoxy phenylamidines and their use as fungicides
EP1969934A1 (en) 2007-03-12 2008-09-17 Bayer CropScience AG 4-cycloalkyl or 4-aryl substituted phenoxy phenylamidines and their use as fungicides
BRPI0808798A2 (en) 2007-03-12 2014-10-07 Bayer Cropscience Ag 3,5-DISSUBSTITUTED PHENOXYPHENYLAMIDINS AND THEIR USE AS FUNGICIDES
US9199922B2 (en) 2007-03-12 2015-12-01 Bayer Intellectual Property Gmbh Dihalophenoxyphenylamidines and use thereof as fungicides
US8003398B2 (en) 2007-03-27 2011-08-23 E.I. De Pont De Nemours And Company Methods and compositions for detecting glyphosate and metabolites thereof
US7723579B2 (en) 2007-04-12 2010-05-25 Dow Agrosciences Llc Canola cultivar DN040244
US7728195B2 (en) 2007-04-12 2010-06-01 Dow Agrosciences Llc Canola cultivar DN040856
US7723580B2 (en) 2007-04-12 2010-05-25 Dow Agrosciences Llc Canola cultivar DN040844
US7723582B2 (en) 2007-04-12 2010-05-25 Dow Agrosciences Llc Canola cultivar DN041100
US7723577B2 (en) 2007-04-12 2010-05-25 Dow Agrosciences Llc Canola cultivar DN040847
US7723578B2 (en) 2007-04-12 2010-05-25 Dow Agrosciences Llc Canola cultivar DN040839
US7718852B2 (en) 2007-04-12 2010-05-18 Dow Agrosciences Llc Canola cultivar DN040241
US7723581B2 (en) 2007-04-12 2010-05-25 Dow Agrosciences Llc Canola cultivar DN040845
AU2008240142B2 (en) 2007-04-12 2014-11-06 Corteva Agriscience Llc Novel canola cultivars having high yield and stabilized fatty acid profiles
WO2008128639A1 (en) 2007-04-19 2008-10-30 Bayer Cropscience Aktiengesellschaft Thiadiazolyl oxyphenyl amidines and the use thereof as a fungicide
DE102007045922A1 (en) 2007-09-26 2009-04-02 Bayer Cropscience Ag Drug combinations with insecticidal and acaricidal properties
DE102007045956A1 (en) 2007-09-26 2009-04-09 Bayer Cropscience Ag Combination of active ingredients with insecticidal and acaricidal properties
DE102007045955A1 (en) 2007-09-26 2009-04-09 Bayer Cropscience Ag Active agent combination, useful e.g. for combating animal pests and treating seeds of transgenic plants, comprises substituted amino-furan-2-one compound and at least one compound e.g. diazinon, isoxathion, carbofuran or aldicarb
DE102007045920B4 (en) 2007-09-26 2018-07-05 Bayer Intellectual Property Gmbh Synergistic drug combinations
DE102007045957A1 (en) 2007-09-26 2009-04-09 Bayer Cropscience Ag Active agent combination, useful e.g. for combating animal pests e.g. insects and treating seeds of transgenic plants, comprises substituted amino-furan-2-one compound and at least one compound e.g. benzoyl urea, buprofezin and cyromazine
DE102007045953B4 (en) 2007-09-26 2018-07-05 Bayer Intellectual Property Gmbh Drug combinations with insecticidal and acaricidal properties
DE102007045919B4 (en) 2007-09-26 2018-07-05 Bayer Intellectual Property Gmbh Drug combinations with insecticidal and acaricidal properties
EP2090168A1 (en) 2008-02-12 2009-08-19 Bayer CropScience AG Method for improving plant growth
RU2495935C2 (en) 2007-10-05 2013-10-20 ДАУ АГРОСАЙЕНСИЗ ЭлЭлСи Method of transferring molecular substances in plant cells
EP2072506A1 (en) 2007-12-21 2009-06-24 Bayer CropScience AG Thiazolyloxyphenylamidine or thiadiazolyloxyphenylamidine und its use as fungicide
EP2168434A1 (en) 2008-08-02 2010-03-31 Bayer CropScience AG Use of azols to increase resistance of plants of parts of plants to abiotic stress
WO2010015423A2 (en) 2008-08-08 2010-02-11 Bayer Bioscience N.V. Methods for plant fiber characterization and identification
KR20110044900A (en) 2008-08-14 2011-05-02 바이엘 크롭사이언스 아게 Insecticidal 4-phenyl-1H-pyrazole
DE102008041695A1 (en) 2008-08-29 2010-03-04 Bayer Cropscience Ag Methods for improving plant growth
AR074088A1 (en) 2008-11-04 2010-12-22 Dow Agrosciences Llc OBTAINING IMPROVED OILS AND FLOORS FROM MODIFIED BRASSICA JUNCEA SPECIES
EP2201838A1 (en) 2008-12-05 2010-06-30 Bayer CropScience AG Active ingredient-beneficial organism combinations with insecticide and acaricide properties
EP2198709A1 (en) 2008-12-19 2010-06-23 Bayer CropScience AG Method for treating resistant animal pests
EP2223602A1 (en) 2009-02-23 2010-09-01 Bayer CropScience AG Method for improved utilisation of the production potential of genetically modified plants
EP2204094A1 (en) 2008-12-29 2010-07-07 Bayer CropScience AG Method for improved utilization of the production potential of transgenic plants Introduction
EP2381781B1 (en) 2008-12-29 2016-06-08 Bayer Intellectual Property GmbH Method for improved use of the production potential of genetically modified plants
EP2039771A2 (en) 2009-01-06 2009-03-25 Bayer CropScience AG Method for improved utilization of the production potential of transgenic plants
EP2039770A2 (en) 2009-01-06 2009-03-25 Bayer CropScience AG Method for improved utilization of the production potential of transgenic plants
EP2039772A2 (en) 2009-01-06 2009-03-25 Bayer CropScience AG Method for improved utilization of the production potential of transgenic plants introduction
BRPI1006916A8 (en) 2009-01-19 2016-05-03 Bayer Cropscience Ag CYCLIC DIONES AND THEIR USE AS INSECTICIDES, ACARICIDES AND/OR FUNGICIDES
EP2227951A1 (en) 2009-01-23 2010-09-15 Bayer CropScience AG Application of enaminocarbonyl compounds for combating viruses transmitted by insects
BRPI1004930B1 (en) 2009-01-28 2017-10-17 Bayer Intellectual Property Gmbh Compounds, fungicidal composition and method for controlling phytopathogenic fungi of crops.
AR075126A1 (en) 2009-01-29 2011-03-09 Bayer Cropscience Ag METHOD FOR THE BEST USE OF THE TRANSGENIC PLANTS PRODUCTION POTENTIAL
EP2395843B1 (en) 2009-02-13 2017-08-09 Monsanto Technology LLC Encapsulation of herbicides to reduce crop injury
EP2218717A1 (en) 2009-02-17 2010-08-18 Bayer CropScience AG Fungicidal N-((HET)Arylethyl)thiocarboxamide derivatives
US8372982B2 (en) 2009-02-17 2013-02-12 Bayer Cropscience Ag Fungicidal N-(Phenylcycloalkyl)carboxamide, N-(Benzylcycloalkyl)carboxamide and thiocarboxamide derivatives
TW201031331A (en) 2009-02-19 2010-09-01 Bayer Cropscience Ag Pesticide composition comprising a tetrazolyloxime derivative and a fungicide or an insecticide active substance
DE102009001469A1 (en) 2009-03-11 2009-09-24 Bayer Cropscience Ag Improving utilization of productive potential of transgenic plant by controlling e.g. animal pest, and/or by improving plant health, comprises treating the transgenic plant with active agent composition comprising prothioconazole
DE102009001681A1 (en) 2009-03-20 2010-09-23 Bayer Cropscience Ag Improving utilization of production potential of a transgenic plant by controlling animal pests, phytopathogenic fungi, microorganisms and/or improving plant health, comprises treating plant with a drug composition comprising iprovalicarb
DE102009001728A1 (en) 2009-03-23 2010-09-30 Bayer Cropscience Ag Improving the production potential of transgenic plant, by combating e.g. animal pests and/or microorganism, and/or increasing plant health, comprises treating the plants with active agent composition comprising fluoxastrobin
DE102009001730A1 (en) 2009-03-23 2010-09-30 Bayer Cropscience Ag Improving utilization of production potential of a transgenic plant by controlling animal pests, phytopathogenic fungi and/or microorganisms and/or the plant health, comprises treating plant with a drug composition comprising spiroxamine
DE102009001732A1 (en) 2009-03-23 2010-09-30 Bayer Cropscience Ag Improving the production potential of transgenic plant, by combating e.g. animal pests and/or microorganism, and/or increasing plant health, comprises treating the plants with active agent composition comprising trifloxystrobin
EP2410850A2 (en) 2009-03-25 2012-02-01 Bayer Cropscience AG Synergistic combinations of active ingredients
AP3073A (en) 2009-03-25 2014-12-31 Bayer Cropscience Ag Active ingredient combinations with insecticidal and acaricidal properties
BRPI0924986A8 (en) 2009-03-25 2016-06-21 Bayer Cropscience Ag "COMBINATIONS OF ACTIVE SUBSTANCES WITH INSECTICIDE AND ACARICIDE PROPERTIES, THEIR USES AND METHOD FOR THE CONTROL OF ANIMAL PESTS".
US8828906B2 (en) 2009-03-25 2014-09-09 Bayer Cropscience Ag Active compound combinations having insecticidal and acaricidal properties
WO2010108504A1 (en) 2009-03-25 2010-09-30 Bayer Cropscience Ag Active ingredient combinations having insecticidal and acaricidal properties
EP2232995A1 (en) 2009-03-25 2010-09-29 Bayer CropScience AG Method for improved utilisation of the production potential of transgenic plants
EP2239331A1 (en) 2009-04-07 2010-10-13 Bayer CropScience AG Method for improved utilization of the production potential of transgenic plants
JP5771189B2 (en) 2009-05-06 2015-08-26 バイエル・インテレクチュアル・プロパティ・ゲゼルシャフト・ミット・ベシュレンクテル・ハフツングBayer Intellectual Property GmbH Cyclopentanedione compounds and their use as insecticides, acaricides and / or antifungal agents
EP2251331A1 (en) 2009-05-15 2010-11-17 Bayer CropScience AG Fungicide pyrazole carboxamides derivatives
AR076839A1 (en) 2009-05-15 2011-07-13 Bayer Cropscience Ag FUNGICIDE DERIVATIVES OF PIRAZOL CARBOXAMIDAS
EP2255626A1 (en) 2009-05-27 2010-12-01 Bayer CropScience AG Use of succinate dehydrogenase inhibitors to increase resistance of plants or parts of plants to abiotic stress
EP2437595B1 (en) 2009-06-02 2018-10-31 Bayer CropScience AG Use of fluopyram for controlling sclerotinia ssp
EP2451947A1 (en) 2009-07-07 2012-05-16 Linda A. Castle Crystal structure of glyphosate acetyltransferase (glyat) and methods of use
BR112012001080A2 (en) 2009-07-16 2015-09-01 Bayer Cropscience Ag Combinations of synergistic active substances containing phenyltriazoles
WO2011015524A2 (en) 2009-08-03 2011-02-10 Bayer Cropscience Ag Fungicide heterocycles derivatives
EP2292094A1 (en) 2009-09-02 2011-03-09 Bayer CropScience AG Active compound combinations
JP2013507919A (en) 2009-10-16 2013-03-07 ダウ アグロサイエンシィズ エルエルシー Use of dendrimer nanotechnology to deliver biomolecules to plant cells
US8581046B2 (en) 2010-11-24 2013-11-12 Pioneer Hi-Bred International, Inc. Brassica gat event DP-073496-4 and compositions and methods for the identification and/or detection thereof
EP2343280A1 (en) 2009-12-10 2011-07-13 Bayer CropScience AG Fungicide quinoline derivatives
JP5852009B2 (en) 2009-12-23 2016-02-03 バイエル・インテレクチュアル・プロパティ・ゲーエムベーハーBayer Intellectual Property Gmbh Plants resistant to HPPD-inhibiting herbicides
ES2659085T3 (en) 2009-12-23 2018-03-13 Bayer Intellectual Property Gmbh HPPD Inhibitor Herbicide Tolerant Plants
UY33140A (en) 2009-12-23 2011-07-29 Bayer Cropscience Ag TOLERANT PLANTS TO INHIBITING HERBICIDES OF HPPD
CA2785211C (en) 2009-12-23 2018-12-11 Bayer Intellectual Property Gmbh Plants tolerant to hppd inhibitor herbicides
BR112012015690A2 (en) 2009-12-23 2015-08-25 Bayer Intelectual Property Gmbh Herbicide tolerant plants of hppd inhibitors.
CN105399666A (en) 2009-12-28 2016-03-16 拜尔农科股份公司 Fungicide hydroximoyl-heterocycles derivatives
TW201138624A (en) 2009-12-28 2011-11-16 Bayer Cropscience Ag Fungicide hydroximoyl-tetrazole derivatives
EP2519516A2 (en) 2009-12-28 2012-11-07 Bayer CropScience AG Fungicidal hydroximoyl-tetrazole derivatives
WO2011089071A2 (en) 2010-01-22 2011-07-28 Bayer Cropscience Ag Acaricide and/or insecticide active substance combinations
US8378177B2 (en) 2010-02-03 2013-02-19 Dow Agrosciences, Llc Canola cultivar DN051493
ES2523503T3 (en) 2010-03-04 2014-11-26 Bayer Intellectual Property Gmbh 2-Fluoroalkyl-substituted amidobenzimidazoles and their use for increasing stress tolerance in plants
EP2547204A2 (en) 2010-03-18 2013-01-23 Bayer Intellectual Property GmbH Aryl and hetaryl sulfonamides as active agents against abiotic plant stress
KR20130080005A (en) 2010-03-31 2013-07-11 다우 아그로사이언시즈 엘엘씨 Plant peptide gamma-zein for delivery of biomolecules into plant cells
CN102933078A (en) 2010-04-06 2013-02-13 拜耳知识产权有限责任公司 Use of 4-phenylbutyric acid and/or the salts thereof for enhancing the stress tolerance of plants
CA2795838A1 (en) 2010-04-09 2011-10-13 Bayer Intellectual Property Gmbh Use of derivatives of the(1-cyanocyclopropyl)phenylphosphinic acid, the esters thereof and/or the salts thereof for enhancing the tolerance of plants to abiotic stress
WO2011134913A1 (en) 2010-04-28 2011-11-03 Bayer Cropscience Ag Fungicide hydroximoyl-heterocycles derivatives
EP2563772A1 (en) 2010-04-28 2013-03-06 Bayer Cropscience AG Fungicide hydroximoyl-heterocycles derivatives
WO2011134911A2 (en) 2010-04-28 2011-11-03 Bayer Cropscience Ag Fungicide hydroximoyl-tetrazole derivatives
BR112012030580B1 (en) 2010-06-03 2018-06-05 Bayer Cropscience Ag COMPOUND, FUNGICIDE COMPOSITION AND METHOD FOR CONTROLING PHYTOPATHOGENIC CROPS FUNGI
UA110703C2 (en) 2010-06-03 2016-02-10 Байєр Кропсайнс Аг Fungicidal n-[(trisubstitutedsilyl)methyl]carboxamide
JP5730993B2 (en) 2010-06-03 2015-06-10 バイエル・クロップサイエンス・アーゲーBayer Cropscience Ag N-[(Heta) arylalkyl)] pyrazole (thio) carboxamides and their hetero-substituted analogues
AU2011264074B2 (en) 2010-06-09 2015-01-22 Bayer Cropscience Nv Methods and means to modify a plant genome at a nucleotide sequence commonly used in plant genome engineering
CN109504700A (en) 2010-06-09 2019-03-22 拜尔作物科学公司 Plant Genome transformation in commonly on nucleotide sequence modified plant genome Method and kit for
WO2012006443A2 (en) 2010-07-07 2012-01-12 Dow Agrosciences Llc Linear dna molecule delivery using pegylated quantum dots for stable transformation in plants
AR082286A1 (en) 2010-07-20 2012-11-28 Bayer Cropscience Ag BENZOCICLOALQUENOS AS ANTIFUNGIC AGENTS
WO2012021797A1 (en) 2010-08-13 2012-02-16 Pioneer Hi-Bred International, Inc. Methods and compositions for targeting sequences of interest to the chloroplast
AU2011291580B2 (en) 2010-08-18 2015-08-20 Monsanto Technology Llc Early applications of encapsulated acetamides for reduced injury in crops
PL2611300T3 (en) 2010-09-03 2016-10-31 Substituted annelated dihydropyrimidinone compounds
BR112013006611B1 (en) 2010-09-22 2021-01-19 Bayer Intellectual Property Gmbh method for the control of soy cyst nematode (heterodera glycines) by infesting a nematode resistant soy plant comprising the application of n- {2- [3-chloro-5- (trifluoromethyl) -2-pyridinyl] ethyl} -2 - (trifluoromethyl) benzamide (fluoride
EP2460406A1 (en) 2010-12-01 2012-06-06 Bayer CropScience AG Use of fluopyram for controlling nematodes in nematode resistant crops
JP5977242B2 (en) 2010-10-07 2016-08-24 バイエル・クロップサイエンス・アクチェンゲゼルシャフト Bactericidal composition comprising a tetrazolyl oxime derivative and a thiazolyl piperidine derivative
UA107865C2 (en) 2010-10-21 2015-02-25 Байєр Інтелекчуал Проперті Гмбх Heterocyclic carboxamides
US9545105B2 (en) 2010-10-21 2017-01-17 Bayer Intellectual Property Gmbh 1-(heterocyclic carbonyl) piperidines
CN103298802B (en) 2010-11-02 2016-06-08 拜耳知识产权有限责任公司 N-hetervaromatic methyl pyrazolyl carboxylic acid amides
ES2588802T3 (en) 2010-11-10 2016-11-04 Bayer Cropscience Ag HPPD variants and usage procedures
BR112013012080A2 (en) 2010-11-15 2016-07-19 Bayer Ip Gmbh n-aryl pyrazole (thio) carboxamides
MX2013005410A (en) 2010-11-15 2013-07-03 Bayer Ip Gmbh 5-halogenopyrazole(thio)carboxamides.
MX2013005407A (en) 2010-11-15 2013-07-03 Bayer Ip Gmbh 5-halogenopyrazolecarboxamides.
US8575431B2 (en) 2010-11-24 2013-11-05 Pioneer Hi-Bred International, Inc. Brassica GAT event DP-061061-7 and compositions and methods for the identification and/or detection thereof
EP2460407A1 (en) 2010-12-01 2012-06-06 Bayer CropScience AG Agent combinations comprising pyridylethyl benzamides and other agents
KR20130123416A (en) 2010-12-01 2013-11-12 바이엘 인텔렉쳐 프로퍼티 게엠베하 Use of fluopyram for controlling nematodes in crops and for increasing yield
EP2658853A1 (en) 2010-12-29 2013-11-06 Bayer Intellectual Property GmbH Fungicide hydroximoyl-tetrazole derivatives
EP2474542A1 (en) 2010-12-29 2012-07-11 Bayer CropScience AG Fungicide hydroximoyl-tetrazole derivatives
US8530726B2 (en) 2010-12-30 2013-09-10 Agrigenetics, Inc. Canola cultivar G030994
US8563810B2 (en) 2010-12-30 2013-10-22 Agrigenetics, Inc. Canola cultivar DN040244A
US8563811B2 (en) 2010-12-30 2013-10-22 Agrigenetics, Inc. Canola cultivar DN040845A
US8558065B2 (en) 2010-12-30 2013-10-15 Agrigenetics, Inc. Canola cultivar G31064
EP2471363A1 (en) 2010-12-30 2012-07-04 Bayer CropScience AG Use of aryl-, heteroaryl- and benzylsulfonamide carboxylic acids, -carboxylic acid esters, -carboxylic acid amides and -carbonitriles and/or its salts for increasing stress tolerance in plants
US9603322B2 (en) 2010-12-30 2017-03-28 Agrigenetics, Inc. Canola cultivars having high yield and stabilized fatty acid profiles
US8558064B2 (en) 2010-12-30 2013-10-15 Agrigenetics, Inc. Canola cultivar CL31613
US9375025B2 (en) 2011-02-22 2016-06-28 Agrigenetics, Inc. Canola producing dark seeds and low acid detergent fiber
EP2494867A1 (en) 2011-03-01 2012-09-05 Bayer CropScience AG Halogen-substituted compounds in combination with fungicides
US20130345058A1 (en) 2011-03-10 2013-12-26 Wolfram Andersch Use of lipochito-oligosaccharide compounds for safeguarding seed safety of treated seeds
BR112013023502A2 (en) 2011-03-14 2016-08-02 Bayer Ip Gmbh compound (i), fungicidal composition, method for the control of crop phytopathogenic fungi, use of the compounds of formula (i) and process for producing the compositions
WO2012129443A2 (en) 2011-03-23 2012-09-27 Dow Agrosciences Llc Quantum dot carrier peptide conjugates suitable for imaging and delivery applications in plants
WO2012130684A1 (en) 2011-03-25 2012-10-04 Bayer Cropscience Ag Use of n-(1,2,5-oxadiazol-3-yl)benzamides for controlling unwanted plants in areas of transgenic crop plants being tolerant to hppd inhibitor herbicides
EA201391301A1 (en) 2011-03-25 2014-03-31 Байер Интеллектуэль Проперти Гмбх USE OF N- (tetrazol-4-YL) - or N- (triazol-3-YL) arylcarboxamide or salts thereof for controlling unwanted plants in the growing area of the transgenic crop plants resistant to herbicides HPPD-inhibitors
US20140051575A1 (en) 2011-04-08 2014-02-20 Juergen Benting Fungicide hydroximoyl-tetrazole derivatives
EP2511255A1 (en) 2011-04-15 2012-10-17 Bayer CropScience AG Substituted prop-2-in-1-ol and prop-2-en-1-ol derivatives
AR085585A1 (en) 2011-04-15 2013-10-09 Bayer Cropscience Ag VINIL- AND ALQUINILCICLOHEXANOLES SUBSTITUTED AS ACTIVE PRINCIPLES AGAINST STRIPS ABIOTIQUE OF PLANTS
AR085568A1 (en) 2011-04-15 2013-10-09 Bayer Cropscience Ag 5- (BICYCLE [4.1.0] HEPT-3-EN-2-IL) -PENTA-2,4-DIENOS AND 5- (BICYCLE [4.1.0] HEPT-3-EN-2-IL) -PENT- 2-IN-4-INOS REPLACED AS ACTIVE PRINCIPLES AGAINST ABIOTIC STRESS OF PLANTS
AR090010A1 (en) 2011-04-15 2014-10-15 Bayer Cropscience Ag 5- (CICLOHEX-2-EN-1-IL) -PENTA-2,4-DIENOS AND 5- (CICLOHEX-2-EN-1-IL) -PENT-2-EN-4-INOS REPLACED AS ACTIVE PRINCIPLES AGAINST THE ABIOTIC STRESS OF PLANTS, USES AND TREATMENT METHODS
US20140038823A1 (en) 2011-04-22 2014-02-06 Peter Dahmen Active compound combinations comprising a (thio)carboxamide derivative and a fungidical compound
EP2718443B1 (en) 2011-06-06 2017-11-29 Bayer CropScience NV Methods and means to modify a plant genome at a preselected site
JP2014520776A (en) 2011-07-04 2014-08-25 バイエル・インテレクチユアル・プロパテイー・ゲー・エム・ベー・ハー Use of substituted isoquinolinones, isoquinoline diones, isoquinoline triones and dihydroisoquinolinones or their salts in each case as active agents against abiotic stresses in plants
WO2013020985A1 (en) 2011-08-10 2013-02-14 Bayer Intellectual Property Gmbh Active compound combinations comprising specific tetramic acid derivatives
US20140215655A1 (en) 2011-08-12 2014-07-31 Bayer Cropscience Nv Guard cell-specific expression of transgenes in cotton
WO2013026836A1 (en) 2011-08-22 2013-02-28 Bayer Intellectual Property Gmbh Fungicide hydroximoyl-tetrazole derivatives
WO2013026740A2 (en) 2011-08-22 2013-02-28 Bayer Cropscience Nv Methods and means to modify a plant genome
EP2561759A1 (en) 2011-08-26 2013-02-27 Bayer Cropscience AG Fluoroalkyl-substituted 2-amidobenzimidazoles and their effect on plant growth
JP2014530173A (en) 2011-09-09 2014-11-17 バイエル・インテレクチユアル・プロパテイー・ゲー・エム・ベー・ハー Acyl-homoserine lactone derivatives for improving plant yield
BR112014005471A2 (en) 2011-09-12 2017-03-28 Bayer Ip Gmbh compounds of formula (i), (v), (vii), fungicidal composition, method for the control of crop phytopathogenic fungi, use of the compounds of formula (i) and process for the production of compositions for the control of phytopathogenic harmful fungi
CN107897194A (en) 2011-09-16 2018-04-13 拜耳知识产权有限责任公司 5 phenyl or 5 benzyl, 2 isoxazoline, 3 formic acid esters are used for the purposes for improving plant products
AU2012307324A1 (en) 2011-09-16 2014-03-06 Bayer Intellectual Property Gmbh Use of phenylpyrazolin-3-carboxylates for improving plant yield
AR087874A1 (en) 2011-09-16 2014-04-23 Bayer Ip Gmbh USE OF ACILSULPHONAMIDES TO IMPROVE THE PERFORMANCE OF PLANTS
CN103929964A (en) 2011-09-23 2014-07-16 拜耳知识产权有限责任公司 Use of 4-substituted 1-phenyl-pyrazole-3-carboxylic-acid derivatives as agents against abiotic plant stress
CA2844868A1 (en) 2011-10-04 2013-04-11 Bayer Intellectual Property Gmbh Rnai for the control of fungi and oomycetes by inhibiting saccharopine dehydrogenase gene
WO2013050324A1 (en) 2011-10-06 2013-04-11 Bayer Intellectual Property Gmbh Combination, containing 4-phenylbutyric acid (4-pba) or a salt thereof (component (a)) and one or more selected additional agronomically active compounds (component(s) (b)), that reduces abiotic plant stress
US9204602B1 (en) 2011-11-21 2015-12-08 Agrigenetics, Inc. Canola inbred CL77606R
JP2014533666A (en) 2011-11-21 2014-12-15 バイエル・インテレクチユアル・プロパテイー・ゲー・エム・ベー・ハー Bactericide N-[(trisubstituted silyl) methyl] -carboxamide derivatives
US9204601B1 (en) 2011-11-21 2015-12-08 Agrigenetics, Inc. Canola inbred CL60855R
US9210857B1 (en) 2011-11-21 2015-12-15 Agrigenetics, Inc. Canola inbred CL102407R
AR089656A1 (en) 2011-11-30 2014-09-10 Bayer Ip Gmbh DERIVATIVES OF N-BICICLOALQUIL- AND N-TRICICLOALQUIL- (TIO) -CARBOXAMIDA FUNGICIDAS
US9414595B2 (en) 2011-12-19 2016-08-16 Bayer Cropscience Ag Use of anthranilic acid diamide derivatives for pest control in transgenic crops
KR102028903B1 (en) 2011-12-29 2019-10-07 바이엘 인텔렉쳐 프로퍼티 게엠베하 Fungicidal 3-[(pyridin-2-ylmethoxyimino)(phenyl)methyl]-2-substituted-1,2,4-oxadiazol-5(2h)-one derivatives
BR112014015993A8 (en) 2011-12-29 2017-07-04 Bayer Ip Gmbh compound, composition, method for fungal control, use of compounds and process for producing the compositions
NZ628308A (en) 2012-02-22 2017-02-24 Bayer Ip Gmbh Use of succinate dehydrogenase inhibitors (sdhis) for controlling wood diseases in grape.
WO2013127704A1 (en) 2012-02-27 2013-09-06 Bayer Intellectual Property Gmbh Active compound combinations containing a thiazoylisoxazoline and a fungicide
WO2013139949A1 (en) 2012-03-23 2013-09-26 Bayer Intellectual Property Gmbh Compositions comprising a strigolactame compound for enhanced plant growth and yield
CN104245687B (en) 2012-04-12 2016-12-14 拜尔农科股份公司 N-acyl group-2-(ring) alkyl pyrrolidine and piperidines as antifungal
EP2838893B1 (en) 2012-04-20 2019-03-13 Bayer Cropscience AG N-cycloalkyl-n-[(heterocyclylphenyl)methylene]-(thio)carboxamide derivatives
JP2015516396A (en) 2012-04-20 2015-06-11 バイエル・クロップサイエンス・アーゲーBayer Cropscience Ag N-cycloalkyl-N-[(trisubstituted silylphenyl) methylene]-(thio) carboxamide derivatives
AU2013254857B2 (en) 2012-04-23 2018-04-26 Bayer Cropscience Nv Targeted genome engineering in plants
WO2013167545A1 (en) 2012-05-09 2013-11-14 Bayer Cropscience Ag Pyrazole indanyl carboxamides
EP2662363A1 (en) 2012-05-09 2013-11-13 Bayer CropScience AG 5-Halogenopyrazole biphenylcarboxamides
EP2662360A1 (en) 2012-05-09 2013-11-13 Bayer CropScience AG 5-Halogenopyrazole indanyl carboxamides
EP2662362A1 (en) 2012-05-09 2013-11-13 Bayer CropScience AG Pyrazole indanyl carboxamides
EP2662370A1 (en) 2012-05-09 2013-11-13 Bayer CropScience AG 5-Halogenopyrazole benzofuranyl carboxamides
MX2014013489A (en) 2012-05-09 2015-02-12 Bayer Cropscience Ag 5-halogenopyrazole indanyl carboxamides.
EP2662361A1 (en) 2012-05-09 2013-11-13 Bayer CropScience AG Pyrazol indanyl carboxamides
EP2662364A1 (en) 2012-05-09 2013-11-13 Bayer CropScience AG Pyrazole tetrahydronaphthyl carboxamides
AR091104A1 (en) 2012-05-22 2015-01-14 Bayer Cropscience Ag COMBINATIONS OF ACTIVE COMPOUNDS THAT INCLUDE A LIPO-CHYTOOLIGOSACARIDE DERIVATIVE AND A NEMATICIDE, INSECTICIDE OR FUNGICIDE COMPOUND
AU2013271740B2 (en) 2012-06-07 2017-04-13 Corteva Agriscience Llc Construct and method for expressing transgenes using a Brassica bidirectional constitutive promoter
EP2871958A1 (en) 2012-07-11 2015-05-20 Bayer CropScience AG Use of fungicidal combinations for increasing the tolerance of a plant towards abiotic stress
WO2014018512A1 (en) 2012-07-26 2014-01-30 Dow Agrosciences Llc High-throughput dna fragment assembly
AU2013302947B2 (en) 2012-08-17 2017-06-01 Diaa ALABED Use of a maize untranslated region for transgene expression in plants
CA2883574A1 (en) 2012-09-05 2014-03-13 Bayer Cropscience Ag Use of substituted 2-amidobenzimidazoles, 2-amidobenzoxazoles and 2-amidobenzothiazoles or salts thereof as active substances against abiotic plant stress
EP2893006B1 (en) 2012-09-07 2018-08-22 Dow AgroSciences LLC Fad3 performance loci and corresponding target site specific binding proteins capable of inducing targeted breaks
UA118090C2 (en) 2012-09-07 2018-11-26 ДАУ АГРОСАЙЄНСІЗ ЕлЕлСі Fad2 performance loci and corresponding target site specific binding proteins capable of inducing targeted breaks
WO2014043435A1 (en) 2012-09-14 2014-03-20 Bayer Cropscience Lp Hppd variants and methods of use
PL2908640T3 (en) 2012-10-19 2020-06-29 Bayer Cropscience Ag Method of plant growth promotion using carboxamide derivatives
LT2908641T (en) 2012-10-19 2018-04-25 Bayer Cropscience Ag Method for treating plants against fungi resistant to fungicides using carboxamide or thiocarboxamide derivatives
EA026839B1 (en) 2012-10-19 2017-05-31 Байер Кропсайенс Аг Active compound combinations comprising carboxamide compounds
AU2013333846B2 (en) 2012-10-19 2017-04-20 Bayer Cropscience Ag Method for enhancing tolerance to abiotic stress in plants using carboxamide or thiocarboxamide derivatives
EP2735231A1 (en) 2012-11-23 2014-05-28 Bayer CropScience AG Active compound combinations
WO2014079957A1 (en) 2012-11-23 2014-05-30 Bayer Cropscience Ag Selective inhibition of ethylene signal transduction
US9447430B1 (en) 2012-11-29 2016-09-20 Agrigenetics, Inc. Canola inbred line G2X0023AB
US9445564B1 (en) 2012-11-29 2016-09-20 Agrigenetics, Inc. Canola inbred line DN051465A
US9414556B1 (en) 2012-11-29 2016-08-16 Agrigenetics, Inc. Canola inbred restorer line G98014R
CN104994736B (en) 2012-11-30 2018-02-06 拜耳作物科学股份公司 Binary agricultural chemicals and Fungicidal mixture
PL2925134T3 (en) 2012-11-30 2020-06-29 Bayer Cropscience Ag Ternary fungicidal mixtures
JP6367214B2 (en) 2012-11-30 2018-08-01 バイエル・クロップサイエンス・アクチェンゲゼルシャフト Two-component fungicide mixture or two-component pesticide mixture
EA030236B1 (en) 2012-11-30 2018-07-31 Байер Кропсайенс Акциенгезельшафт Ternary fungicidal and pesticidal mixtures
BR112015012057B1 (en) 2012-11-30 2020-05-12 Bayer Cropscience Ag COMPOSITION OF FORMULA (I) AND ITS PREPARATION PROCESS, METHOD TO CONTROL ONE OR MORE HARMFUL MICROORGANISMS AND TO TREAT SEEDS
EP2740356A1 (en) 2012-12-05 2014-06-11 Bayer CropScience AG Substituted (2Z)-5(1-Hydroxycyclohexyl)pent-2-en-4-inic acid derivatives
BR112015012926A2 (en) 2012-12-05 2017-07-11 Bayer Cropscience Ag use of 1- (aryl ethinyl) -, 1- (heteroaryl ethinyl) -, 1- (heterocyclyl ethinyl) substituted and 1- (cycloalkenyl ethinyl) cyclohexanols as active agents against abiotic plant stress
EP2740720A1 (en) 2012-12-05 2014-06-11 Bayer CropScience AG Substituted bicyclic and tricyclic pent-2-en-4-inic acid derivatives and their use for enhancing the stress tolerance in plants
WO2014090765A1 (en) 2012-12-12 2014-06-19 Bayer Cropscience Ag Use of 1-[2-fluoro-4-methyl-5-(2,2,2-trifluoroethylsulfinyl)phenyl]-5-amino-3-trifluoromethyl)-1 h-1,2,4 tfia zole for controlling nematodes in nematode-resistant crops
US20140173775A1 (en) 2012-12-13 2014-06-19 Pioneer Hi-Bred International, Inc. Methods and compositions for producing and selecting transgenic plants
AR093996A1 (en) 2012-12-18 2015-07-01 Bayer Cropscience Ag BACTERICIDAL COMBINATIONS AND BINARY FUNGICIDES
IN2015DN04206A (en) 2012-12-19 2015-10-16 Bayer Cropscience Ag
US20150351390A1 (en) 2012-12-21 2015-12-10 Pioneer Hi-Bred International, Inc. Compositions and methods for auxin-analog conjugation
US20140203176A1 (en) 2013-01-23 2014-07-24 Dow Agrosciences Llc Systems and methods for real-time sampling and analysis of biomolecules beneath the surface of biological tissue
US9426953B1 (en) 2013-02-21 2016-08-30 Agrigenetics, Inc. Canola hybrid cultivar CE216910H
US9538716B1 (en) 2013-02-21 2017-01-10 Agrigenetics, Inc. Canola inbred restorer line CE185942R
US9414557B1 (en) 2013-02-21 2016-08-16 Agrigenetics, Inc. Canola inbred restorer line CE185952R
US20160016944A1 (en) 2013-03-07 2016-01-21 Bayer Cropscience Aktiengesellschaft Fungicidal 3--heterocycle derivatives
WO2014153242A1 (en) 2013-03-14 2014-09-25 Pioneer Hi-Bred International, Inc. Compositions having dicamba decarboxylase activity and methods of use
CA2905595A1 (en) 2013-03-14 2014-09-25 Pioneer Hi-Bred International, Inc. Compositions having dicamba decarboxylase activity and methods of use
CN105121650A (en) 2013-04-02 2015-12-02 拜尔作物科学公司 Targeted genome engineering in eukaryotes
EP2984081B1 (en) 2013-04-12 2017-08-09 Bayer CropScience Aktiengesellschaft Novel triazole derivatives
WO2014167008A1 (en) 2013-04-12 2014-10-16 Bayer Cropscience Ag Novel triazolinthione derivatives
MX358633B (en) 2013-04-19 2018-08-28 Bayer Cropscience Ag Method for improved utilization of the production potential of transgenic plants involving the application of a phthaldiamide derivative.
BR112015025907A2 (en) 2013-04-19 2017-07-25 Bayer Cropscience Ag binary insecticide or pesticide mixture
TW201507722A (en) 2013-04-30 2015-03-01 Bayer Cropscience Ag N-(2-halogen-2-phenethyl)carboxamides as nematicides and endoparasiticides
WO2014177514A1 (en) 2013-04-30 2014-11-06 Bayer Cropscience Ag Nematicidal n-substituted phenethylcarboxamides
US9770022B2 (en) 2013-06-26 2017-09-26 Bayer Cropscience Ag N-cycloalkyl-N-[(bicyclylphenyl)methylene]-(thio)carboxamide derivatives
CA2917559A1 (en) 2013-07-09 2015-01-15 Bayer Cropscience Aktiengesellschaft Use of selected pyridone carboxamides or salts thereof as active substances against abiotic plant stress
EP2837287A1 (en) 2013-08-15 2015-02-18 Bayer CropScience AG Use of prothioconazole for increasing root growth of Brassicaceae
AU2014327258B2 (en) 2013-09-24 2020-05-07 Basf Se Hetero-transglycosylase and uses thereof
WO2015057600A1 (en) 2013-10-18 2015-04-23 E. I. Du Pont De Nemours And Company Glyphosate-n-acetyltransferase (glyat) sequences and methods of use
WO2015066636A2 (en) 2013-11-04 2015-05-07 Dow Agrosciences Llc Optimal maize loci
NZ746567A (en) 2013-11-04 2019-09-27 Dow Agrosciences Llc Optimal soybean loci
US10273493B2 (en) 2013-11-04 2019-04-30 Dow Agrosciences Llc Optimal maize loci
CN105980395A (en) 2013-11-04 2016-09-28 美国陶氏益农公司 Optimal soybean loci
US10070645B2 (en) 2013-12-05 2018-09-11 Bayer Cropscience Aktiengesellschaft N-cycloalkyl-N-{[2-(1-substitutedcycloalkyl)phenyl]methylene}-(thio)carboxamide derivatives
EP3077377B1 (en) 2013-12-05 2020-01-22 Bayer CropScience Aktiengesellschaft N-cycloalkyl-n-{[2-(1-substitutedcycloalkyl)phenyl]methylene}-(thio)carboxamide derivatives
US10683513B2 (en) 2013-12-31 2020-06-16 Dow Agrosciences Llc Tissue-specific expression and hybrid plant production
US9596816B1 (en) 2014-02-28 2017-03-21 Agrigenetics, Inc. Canola inbred restorer line CL215695R
US9554534B1 (en) 2014-02-28 2017-01-31 Agrigenetics, Inc. Canola inbred line CL1992625A
US9854763B1 (en) 2014-02-28 2018-01-02 Agrigenetics, Inc. Canola inbred line CL1992625B
US9844195B1 (en) 2014-02-28 2017-12-19 Agrigenetics, Inc. Canola hybrid cultivar CL2537387H
WO2015138394A2 (en) 2014-03-11 2015-09-17 Bayer Cropscience Lp Hppd variants and methods of use
US10053702B2 (en) 2014-04-22 2018-08-21 E I Du Pont De Nemours And Company Plastidic carbonic anhydrase genes for oil augmentation in seeds with increased DGAT expression
AR101214A1 (en) 2014-07-22 2016-11-30 Bayer Cropscience Ag CIANO-CICLOALQUILPENTA-2,4-DIENOS, CIANO-CICLOALQUILPENT-2-EN-4-INAS, CIANO-HETEROCICLILPENTA-2,4-DIENOS AND CYANO-HETEROCICLILPENT-2-EN-4-INAS REPLACED AS ACTIVE PRINCIPLES PLANTS ABIOTIC
US10165751B2 (en) 2014-12-05 2019-01-01 Agrigenetics, Inc. Canola inbred line G30853A
US9986702B1 (en) 2014-12-05 2018-06-05 Agrigenetics, Inc. Canola inbred restorer line G1934899R
AR103024A1 (en) 2014-12-18 2017-04-12 Bayer Cropscience Ag SELECTED PYRIDONCARBOXAMIDS OR ITS SALTS AS ACTIVE SUBSTANCES AGAINST ABIOTIC PLANTS STRESS
US9968047B2 (en) 2015-03-24 2018-05-15 Agrigenetics, Inc. Canola hybrid cultivar CL2562968H
US10314270B2 (en) 2015-04-01 2019-06-11 Agrigenetics, Inc. Canola hybrid cultivar G3697124H
CN107531676A (en) 2015-04-13 2018-01-02 拜耳作物科学股份公司 N cycloalkyl N (double heterocyclic radical ethylidene) (thio) carboxamide derivative
US10306852B2 (en) 2015-04-15 2019-06-04 Agrigenetics, Inc. Canola inbred line G1992650A
US9968051B2 (en) 2015-04-15 2018-05-15 Agrigenetics, Inc. Canola hybrid cultivar G2537376H
US9974262B2 (en) 2015-04-15 2018-05-22 Agrigenetics, Inc. Canola inbred restorer line CL134904R
US9968050B2 (en) 2015-04-15 2018-05-15 Agrigenetics, Inc. Canola inbred restorer line G175274R
CA3012607A1 (en) 2015-06-18 2016-12-22 The Broad Institute Inc. Crispr enzymes and systems
BR112018004779A8 (en) 2015-09-11 2022-08-09 Bayer Cropscience Lp VARIANTS OF HPPD AND METHODS OF USE
US10633703B2 (en) 2015-11-10 2020-04-28 Dow Agrosciences Llc Methods and systems for predicting the risk of transgene silencing
US20190159451A1 (en) 2016-07-29 2019-05-30 Bayer Cropscience Aktiengesellschaft Active compound combinations and methods to protect the propagation material of plants
US10588281B2 (en) 2016-09-07 2020-03-17 Agrigenetics, Inc. Canola hybrid cultivar G5428584H
US10426110B2 (en) 2016-09-07 2019-10-01 Agrigenetics, Inc. Canola inbred restorer line CL2503899R
US10244716B2 (en) 2016-09-07 2019-04-02 Agrigenetics, Inc. Canola hybrid cultivar CL3701975H
US10463004B2 (en) 2016-09-07 2019-11-05 Agrigenetics, Inc. Canola inbred line G1466454A/B
US10420296B2 (en) 2016-09-07 2019-09-24 Agrigenetics, Inc. Canola inbred restorer line G263532R
BR112019005660A2 (en) 2016-09-22 2019-06-04 Bayer Cropscience Ag new triazole derivatives and their use as fungicides
WO2018054832A1 (en) 2016-09-22 2018-03-29 Bayer Cropscience Aktiengesellschaft Novel triazole derivatives
US20190225974A1 (en) 2016-09-23 2019-07-25 BASF Agricultural Solutions Seed US LLC Targeted genome optimization in plants
MX2019004930A (en) 2016-10-26 2019-06-06 Bayer Cropscience Ag Use of pyraziflumid for controlling sclerotinia spp in seed treatment applications.
JP2020500905A (en) 2016-12-08 2020-01-16 バイエル・クロップサイエンス・アクチェンゲゼルシャフト Use of insecticides to control the harbor bugs
EP3332645A1 (en) 2016-12-12 2018-06-13 Bayer Cropscience AG Use of substituted pyrimidine diones or their salts as agents to combat abiotic plant stress
WO2018108627A1 (en) 2016-12-12 2018-06-21 Bayer Cropscience Aktiengesellschaft Use of substituted indolinylmethyl sulfonamides, or the salts thereof for increasing the stress tolerance of plants
WO2018165091A1 (en) 2017-03-07 2018-09-13 Bayer Cropscience Lp Hppd variants and methods of use
US11591601B2 (en) 2017-05-05 2023-02-28 The Broad Institute, Inc. Methods for identification and modification of lncRNA associated with target genotypes and phenotypes
WO2019025153A1 (en) 2017-07-31 2019-02-07 Bayer Cropscience Aktiengesellschaft Use of substituted n-sulfonyl-n'-aryl diaminoalkanes and n-sulfonyl-n'-heteroaryl diaminoalkanes or salts thereof for increasing the stress tolerance in plants
US11618896B2 (en) 2017-09-21 2023-04-04 The Broad Institute, Inc. Systems, methods, and compositions for targeted nucleic acid editing
WO2019083810A1 (en) 2017-10-24 2019-05-02 Basf Se Improvement of herbicide tolerance to 4-hydroxyphenylpyruvate dioxygenase (hppd) inhibitors by down-regulation of hppd expression in soybean
BR112020008096A2 (en) 2017-10-24 2020-11-03 Basf Se method for checking tolerance to a GM herbicide and soy plant
US10968257B2 (en) 2018-04-03 2021-04-06 The Broad Institute, Inc. Target recognition motifs and uses thereof
US20210323950A1 (en) 2018-06-04 2021-10-21 Bayer Aktiengesellschaft Herbicidally active bicyclic benzoylpyrazoles
EP3833747A1 (en) 2018-06-28 2021-06-16 Pioneer Hi-Bred International, Inc. Methods for selecting transformed plants
CA3107382A1 (en) 2018-07-26 2020-01-30 Bayer Aktiengesellschaft Use of the succinate dehydrogenase inhibitor fluopyram for controlling root rot complex and/or seedling disease complex caused by rhizoctonia solani, fusarium species and pythium species in brassicaceae species
BR112021004933A2 (en) 2018-09-17 2021-06-01 Bayer Aktiengesellschaft use of fluopiram succinate dehydrogenase inhibitor to control claviceps purpurea and reduce sclerotia in cereals
EP3852532A1 (en) 2018-09-17 2021-07-28 Bayer Aktiengesellschaft Use of the fungicide isoflucypram for controlling claviceps purpurea and reducing sclerotia in cereals
AU2019369415A1 (en) 2018-10-31 2021-03-25 Pioneer Hi-Bred International, Inc. Compositions and methods for Ochrobactrum-mediated plant transformation
WO2020131862A1 (en) 2018-12-17 2020-06-25 The Broad Institute, Inc. Crispr-associated transposase systems and methods of use thereof
WO2020160223A1 (en) 2019-01-30 2020-08-06 Monsanto Technology Llc Microencapsulated acetamide herbicides
WO2020185751A1 (en) 2019-03-11 2020-09-17 Pioneer Hi-Bred International, Inc. Methods for clonal plant production
CA3127173A1 (en) 2019-03-28 2020-10-01 Pioneer Hi-Bred International, Inc. Modified agrobacterium strains and use thereof for plant transformation

Family Cites Families (152)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US542901A (en) 1895-07-16 Cabinet
US4535060A (en) 1983-01-05 1985-08-13 Calgene, Inc. Inhibition resistant 5-enolpyruvyl-3-phosphoshikimate synthetase, production and use
US5331107A (en) 1984-03-06 1994-07-19 Mgi Pharma, Inc. Herbicide resistance in plants
US4761373A (en) 1984-03-06 1988-08-02 Molecular Genetics, Inc. Herbicide resistance in plants
US5304732A (en) 1984-03-06 1994-04-19 Mgi Pharma, Inc. Herbicide resistance in plants
US4945050A (en) 1984-11-13 1990-07-31 Cornell Research Foundation, Inc. Method for transporting substances into living cells and tissues and apparatus therefor
ATE93542T1 (en) 1984-12-28 1993-09-15 Plant Genetic Systems Nv RECOMBINANT DNA THAT CAN BE INTRODUCED INTO PLANT CELLS.
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US5569597A (en) 1985-05-13 1996-10-29 Ciba Geigy Corp. Methods of inserting viral DNA into plant material
US4940835A (en) 1985-10-29 1990-07-10 Monsanto Company Glyphosate-resistant plants
NZ217113A (en) 1985-08-07 1988-06-30 Monsanto Co Production of eucaryotic plants which are glyphosate resistant, vectors (transformation and expression), chimeric gene and plant cells
ES2018274T5 (en) 1986-03-11 1996-12-16 Plant Genetic Systems Nv VEGETABLE CELLS RESISTANT TO GLUTAMINE SYNTHETASE INHIBITORS, PREPARED BY GENETIC ENGINEERING.
US5187073A (en) 1986-06-30 1993-02-16 The University Of Toledo Process for transforming gramineae and the products thereof
US5273894A (en) 1986-08-23 1993-12-28 Hoechst Aktiengesellschaft Phosphinothricin-resistance gene, and its use
US5605011A (en) * 1986-08-26 1997-02-25 E. I. Du Pont De Nemours And Company Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase
US5013659A (en) 1987-07-27 1991-05-07 E. I. Du Pont De Nemours And Company Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase
US5378824A (en) 1986-08-26 1995-01-03 E. I. Du Pont De Nemours And Company Nucleic acid fragment encoding herbicide resistant plant acetolactate synthase
WO1988002405A1 (en) 1986-10-01 1988-04-07 The Plant Cell Research Institute, Inc. Genetic transformation and controlled regeneration of cucumis sp. plants in vitro
US5268463A (en) 1986-11-11 1993-12-07 Jefferson Richard A Plant promoter α-glucuronidase gene construct
US5608142A (en) 1986-12-03 1997-03-04 Agracetus, Inc. Insecticidal cotton plants
US5312910A (en) 1987-05-26 1994-05-17 Monsanto Company Glyphosate-tolerant 5-enolpyruvyl-3-phosphoshikimate synthase
US5145783A (en) 1987-05-26 1992-09-08 Monsanto Company Glyphosate-tolerant 5-endolpyruvyl-3-phosphoshikimate synthase
US4971908A (en) 1987-05-26 1990-11-20 Monsanto Company Glyphosate-tolerant 5-enolpyruvyl-3-phosphoshikimate synthase
US5316931A (en) 1988-02-26 1994-05-31 Biosource Genetics Corp. Plant viral vectors having heterologous subgenomic promoters for systemic expression of foreign genes
US5990387A (en) 1988-06-10 1999-11-23 Pioneer Hi-Bred International, Inc. Stable transformation of plant cells
US6015891A (en) 1988-09-09 2000-01-18 Mycogen Plant Science, Inc. Synthetic insecticidal crystal protein gene having a modified frequency of codon usage
US5023179A (en) 1988-11-14 1991-06-11 Eric Lam Promoter enhancer element for gene expression in plant roots
US5110732A (en) 1989-03-14 1992-05-05 The Rockefeller University Selective gene expression in plants
US5240855A (en) 1989-05-12 1993-08-31 Pioneer Hi-Bred International, Inc. Particle gun
US5879918A (en) 1989-05-12 1999-03-09 Pioneer Hi-Bred International, Inc. Pretreatment of microprojectiles prior to using in a particle gun
US5310667A (en) 1989-07-17 1994-05-10 Monsanto Company Glyphosate-tolerant 5-enolpyruvyl-3-phosphoshikimate synthases
US5550318A (en) 1990-04-17 1996-08-27 Dekalb Genetics Corporation Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof
US5322783A (en) 1989-10-17 1994-06-21 Pioneer Hi-Bred International, Inc. Soybean transformation by microparticle bombardment
ATE126535T1 (en) 1990-04-05 1995-09-15 Roberto Crea ''WALK-THROUGH'' MUTAGENesis.
DE69133128T2 (en) 1990-04-12 2003-06-18 Syngenta Participations Ag Tissue-specific promoters
JP3387495B2 (en) 1990-04-26 2003-03-17 プラント・ジエネテイツク・システムズ・エヌ・ベー Novel Bacillus thuringiensis strains and their genes encoding pesticidal toxins
US5498830A (en) 1990-06-18 1996-03-12 Monsanto Company Decreased oil content in plant seeds
DK0536330T3 (en) 1990-06-25 2002-04-22 Monsanto Technology Llc Glyphosate tolerant plants
US5633435A (en) 1990-08-31 1997-05-27 Monsanto Company Glyphosate-tolerant 5-enolpyruvylshikimate-3-phosphate synthases
US5866775A (en) 1990-09-28 1999-02-02 Monsanto Company Glyphosate-tolerant 5-enolpyruvyl-3-phosphoshikimate synthases
US5932782A (en) 1990-11-14 1999-08-03 Pioneer Hi-Bred International, Inc. Plant transformation method using agrobacterium species adhered to microprojectiles
US5277905A (en) 1991-01-16 1994-01-11 Mycogen Corporation Coleopteran-active bacillus thuringiensis isolate
US5459252A (en) 1991-01-31 1995-10-17 North Carolina State University Root specific gene promoter
FR2673643B1 (en) 1991-03-05 1993-05-21 Rhone Poulenc Agrochimie TRANSIT PEPTIDE FOR THE INSERTION OF A FOREIGN GENE INTO A PLANT GENE AND PLANTS TRANSFORMED USING THIS PEPTIDE.
FR2673642B1 (en) 1991-03-05 1994-08-12 Rhone Poulenc Agrochimie CHIMERIC GENE COMPRISING A PROMOTER CAPABLE OF GIVING INCREASED TOLERANCE TO GLYPHOSATE.
USRE36449E (en) 1991-03-05 1999-12-14 Rhone-Poulenc Agro Chimeric gene for the transformation of plants
US5399680A (en) 1991-05-22 1995-03-21 The Salk Institute For Biological Studies Rice chitinase promoter
US5731180A (en) 1991-07-31 1998-03-24 American Cyanamid Company Imidazolinone resistant AHAS mutants
WO1993003154A1 (en) 1991-08-02 1993-02-18 Mycogen Corporation Novel microorganism and insecticide
WO1993004177A1 (en) 1991-08-27 1993-03-04 Agricultural Genetics Company Limited Proteins with insecticidal properties against homopteran insects and their use in plant protection
EP0612208B1 (en) 1991-10-04 2004-09-15 North Carolina State University Pathogen-resistant transgenic plants
AU675923B2 (en) 1991-12-04 1997-02-27 E.I. Du Pont De Nemours And Company Fatty acid desaturase genes from plants
US5324646A (en) 1992-01-06 1994-06-28 Pioneer Hi-Bred International, Inc. Methods of regeneration of Medicago sativa and expressing foreign DNA in same
US5341001A (en) 1992-02-13 1994-08-23 Matsushita Electric Industrial Co., Ltd. Sulfide-selenide manganese-zinc mixed crystal photo semiconductor and laser diode
US5591616A (en) 1992-07-07 1997-01-07 Japan Tobacco, Inc. Method for transforming monocotyledons
US5401836A (en) 1992-07-16 1995-03-28 Pioneer Hi-Bre International, Inc. Brassica regulatory sequence for root-specific or root-abundant gene expression
AU670316B2 (en) 1992-07-27 1996-07-11 Pioneer Hi-Bred International, Inc. An improved method of (agrobacterium)-mediated transformation of cultured soybean cells
DE69333025T2 (en) 1992-11-17 2004-04-29 E.I. Du Pont De Nemours And Co., Wilmington GENES ENCODING FOR MICROSOME DELTA 12 FATTY ACID DESATURASE AND RELATED ENZYMS OF PLANTS
CA2127807A1 (en) 1992-11-20 1994-06-09 John Maliyakal Transgenic cotton plants producing heterologous bioplastic
IL108241A (en) 1992-12-30 2000-08-13 Biosource Genetics Corp Plant expression system comprising a defective tobamovirus replicon integrated into the plant chromosome and a helper virus
WO1994016078A2 (en) 1993-01-13 1994-07-21 Pioneer Hi-Bred International, Inc. High lysine derivatives of alpha-hordothionin
US5583210A (en) 1993-03-18 1996-12-10 Pioneer Hi-Bred International, Inc. Methods and compositions for controlling plant development
US5789156A (en) 1993-06-14 1998-08-04 Basf Ag Tetracycline-regulated transcriptional inhibitors
US5814618A (en) 1993-06-14 1998-09-29 Basf Aktiengesellschaft Methods for regulating gene expression
US5426039A (en) 1993-09-08 1995-06-20 Bio-Rad Laboratories, Inc. Direct molecular cloning of primer extended DNA containing an alkane diol
US5470353A (en) 1993-10-20 1995-11-28 Hollister Incorporated Post-operative thermal blanket
JPH07177130A (en) 1993-12-21 1995-07-14 Fujitsu Ltd Error count circuit
US6117679A (en) 1994-02-17 2000-09-12 Maxygen, Inc. Methods for generating polynucleotides having desired characteristics by iterative selection and recombination
US5834252A (en) 1995-04-18 1998-11-10 Glaxo Group Limited End-complementary polymerase reaction
US5837458A (en) 1994-02-17 1998-11-17 Maxygen, Inc. Methods and compositions for cellular and metabolic engineering
US5605793A (en) 1994-02-17 1997-02-25 Affymax Technologies N.V. Methods for in vitro recombination
US6335160B1 (en) 1995-02-17 2002-01-01 Maxygen, Inc. Methods and compositions for polypeptide engineering
US5593881A (en) 1994-05-06 1997-01-14 Mycogen Corporation Bacillus thuringiensis delta-endotoxin
US5767373A (en) 1994-06-16 1998-06-16 Novartis Finance Corporation Manipulation of protoporphyrinogen oxidase enzyme activity in eukaryotic organisms
US5736369A (en) 1994-07-29 1998-04-07 Pioneer Hi-Bred International, Inc. Method for producing transgenic cereal plants
US5608144A (en) 1994-08-12 1997-03-04 Dna Plant Technology Corp. Plant group 2 promoters and uses thereof
US5792931A (en) 1994-08-12 1998-08-11 Pioneer Hi-Bred International, Inc. Fumonisin detoxification compositions and methods
US5750868A (en) 1994-12-08 1998-05-12 Pioneer Hi-Bred International, Inc. Reversible nuclear genetic system for male sterility in transgenic plants
US5659026A (en) 1995-03-24 1997-08-19 Pioneer Hi-Bred International ALS3 promoter
US5853973A (en) 1995-04-20 1998-12-29 American Cyanamid Company Structure based designed herbicide resistant products
EP0821729B1 (en) 1995-04-20 2006-10-18 Basf Aktiengesellschaft Structure-based designed herbicide resistant products
KR19990008000A (en) 1995-04-24 1999-01-25 로버트 에스. 화이트 헤드 How to create a new metabolic pathway and screen it
AR004938A1 (en) 1995-06-02 1999-04-07 Pioneer Hi Bred Internacional Inc PROTEIN DERIVED FROM ALPHA-HORDOTIONINE WITH HIGH METIONIN CONTENT, NUCLEOTIDE, RNA AND DNA SEQUENCES, EXPRESSION CASSETE, BACTERIAL TRANSFORMATION VECTOR, AND TRANSFORMED VEGETABLE METHODS FROM YIDROID, CELLULA, AND CROPS OF TEID
FR2734842B1 (en) 1995-06-02 1998-02-27 Rhone Poulenc Agrochimie DNA SEQUENCE OF A HYDROXY-PHENYL PYRUVATE DIOXYGENASE GENE AND OBTAINING PLANTS CONTAINING A HYDROXY-PHENYL PYRUVATE DIOXYGENASE GENE, TOLERANT TO CERTAIN HERBICIDES
PL323641A1 (en) 1995-06-02 1998-04-14 Pioneer Hi Bred Int Derivatives of alpha-hordothionine of high threonine content
US5958672A (en) 1995-07-18 1999-09-28 Diversa Corporation Protein activity screening of clones having DNA from uncultivated microorganisms
US6057103A (en) 1995-07-18 2000-05-02 Diversa Corporation Screening for novel bioactivities
FR2736926B1 (en) 1995-07-19 1997-08-22 Rhone Poulenc Agrochimie 5-ENOL PYRUVYLSHIKIMATE-3-PHOSPHATE SYNTHASE MUTEE, CODING GENE FOR THIS PROTEIN AND PROCESSED PLANTS CONTAINING THIS GENE
US5837876A (en) 1995-07-28 1998-11-17 North Carolina State University Root cortex specific gene promoter
US5633636A (en) 1995-10-02 1997-05-27 Analog Devices, Inc. Half-gray digital encoding method and circuitry
US5756316A (en) 1995-11-02 1998-05-26 Genencor International, Inc. Molecular cloning by multimerization of plasmids
US5981722A (en) 1995-11-20 1999-11-09 Board Of Regents For The University Of Oklahoma Trypsin inhibitors with insecticidal properties obtained from PENTACLETHRA MACROLOBA
US5737514A (en) 1995-11-29 1998-04-07 Texas Micro, Inc. Remote checkpoint memory system and protocol for fault-tolerant computer system
US5965408A (en) 1996-07-09 1999-10-12 Diversa Corporation Method of DNA reassembly by interrupting synthesis
US6171820B1 (en) 1995-12-07 2001-01-09 Diversa Corporation Saturation mutagenesis in directed evolution
US5939250A (en) 1995-12-07 1999-08-17 Diversa Corporation Production of enzymes having desired activities by mutagenesis
US20030215798A1 (en) 1997-06-16 2003-11-20 Diversa Corporation High throughput fluorescence-based screening for novel enzymes
US6096548A (en) 1996-03-25 2000-08-01 Maxygen, Inc. Method for directing evolution of a virus
US5783431A (en) 1996-04-24 1998-07-21 Chromaxome Corporation Methods for generating and screening novel metabolic pathways
US6072050A (en) 1996-06-11 2000-06-06 Pioneer Hi-Bred International, Inc. Synthetic promoters
EP0964922A4 (en) 1996-09-27 2000-10-25 Maxygen Inc Methods for optimization of gene therapy by recursive sequence shuffling and selection
JP3441899B2 (en) 1996-11-01 2003-09-02 理化学研究所 How to make a full-length cDNA library
US6232529B1 (en) 1996-11-20 2001-05-15 Pioneer Hi-Bred International, Inc. Methods of producing high-oil seed by modification of starch levels
DE19652284A1 (en) 1996-12-16 1998-06-18 Hoechst Schering Agrevo Gmbh Novel genes encoding amino acid deacetylases with specificity for N-acetyl-L-phosphinothricin, their isolation and use
AU743305C (en) 1997-01-17 2006-03-30 Maxygen, Inc. Evolution of whole cells and organisms by recursive sequence recombination
US6326204B1 (en) 1997-01-17 2001-12-04 Maxygen, Inc. Evolution of whole cells and organisms by recursive sequence recombination
US5981840A (en) 1997-01-24 1999-11-09 Pioneer Hi-Bred International, Inc. Methods for agrobacterium-mediated transformation
NZ337900A (en) 1997-03-18 2001-05-25 Novo Nordisk As DNA shuffling for randomised heterologous and homogenous 3' and 5' primers for construction of DNA libraries for generation of more desirable mutants
AU6611498A (en) 1997-03-18 1998-10-12 Novo Nordisk A/S Method for constructing a library using dna shuffling
US5948653A (en) 1997-03-21 1999-09-07 Pati; Sushma Sequence alterations using homologous recombination
US6153410A (en) 1997-03-25 2000-11-28 California Institute Of Technology Recombination of polynucleotide sequences using random or defined primers
AU1124499A (en) 1997-10-28 1999-05-17 Maxygen, Inc. Human papillomavirus vectors
EP1030861A4 (en) 1997-10-31 2001-09-05 Maxygen Inc Modification of virus tropism and host range by viral genome shuffling
US6245968B1 (en) 1997-11-07 2001-06-12 Aventis Cropscience S.A. Mutated hydroxyphenylpyruvate dioxygenase, DNA sequence and isolation of plants which contain such a gene and which are tolerant to herbicides
US6069115A (en) 1997-11-12 2000-05-30 Rhone-Poulenc Agrochimie Method of controlling weeds in transgenic crops
DE69831265T2 (en) 1997-11-18 2006-06-08 Pioneer Hi-Bred International, Inc. COMPOSITIONS AND METHODS FOR THE GENETIC MODIFICATION OF PLANTS
WO1999025855A1 (en) 1997-11-18 1999-05-27 Pioneer Hi-Bred International, Inc. Mobilization of viral genomes from t-dna using site-specific recombination systems
EP1032692A1 (en) 1997-11-18 2000-09-06 Pioneer Hi-Bred International, Inc. Targeted manipulation of herbicide-resistance genes in plants
DK1032680T3 (en) 1997-11-18 2006-05-22 Pioneer Hi Bred Int A novel method for integrating foreign DNA into eukaryotic genomes
CA2313380C (en) 1997-12-08 2008-12-30 California Institute Of Technology Method for creating polynucleotide and polypeptide sequences
WO1999041402A2 (en) 1998-02-11 1999-08-19 Maxygen, Inc. Targeting of genetic vaccine vectors
WO1999041383A1 (en) 1998-02-11 1999-08-19 Maxygen, Inc. Antigen library immunization
ATE278782T1 (en) 1998-02-26 2004-10-15 Pioneer Hi Bred Int CONSTITUTIVE CORN PROMOTORS
US5955310A (en) 1998-02-26 1999-09-21 Novo Nordisk Biotech, Inc. Methods for producing a polypeptide in a bacillus cell
CA2329056A1 (en) 1998-05-22 1999-12-02 Pioneer Hi-Bred International, Inc. Cell cycle genes, proteins and uses thereof
ATE547532T1 (en) 1998-06-29 2012-03-15 Bristol Myers Squibb Co METHOD FOR CREATING HIGHLY DIVERSE LIBRARIES
IL140442A0 (en) 1998-08-12 2002-02-10 Maxygen Inc Dna shuffling to produce herbicide selective crops
FR2782323B1 (en) 1998-08-12 2002-01-11 Proteus PROCESS FOR THE IN VITRO PRODUCTION OF RECOMBINANT POLYNUCLEOTIDE SEQUENCES, SEQUENCE BANKS AND SEQUENCES THUS OBTAINED
ES2252964T3 (en) 1998-08-20 2006-05-16 Pioneer Hi-Bred International, Inc. PREFERRED SEED PROMOTERS.
US5981849A (en) 1998-08-27 1999-11-09 Monsanto Corporation Soybean cultivar 95-060911
WO2000012733A1 (en) 1998-08-28 2000-03-09 Pioneer Hi-Bred International, Inc. Seed-preferred promoters from end genes
US6518487B1 (en) 1998-09-23 2003-02-11 Pioneer Hi-Bred International, Inc. Cyclin D polynucleotides, polypeptides and uses thereof
JP2002537758A (en) 1998-09-29 2002-11-12 マキシジェン, インコーポレイテッド Shuffling of codon-modified genes
IL142736A0 (en) 1998-11-09 2002-03-10 Pioneer Hi Bred Int Transcriptional activator lec 1 nucleic acids, polypeptides and their uses
CA2351550C (en) 1998-11-17 2013-04-23 Monsanto Company Phosphonate metabolizing plants
US7217858B2 (en) 1998-12-21 2007-05-15 E. I. Du Pont De Nemours And Company S-adenosyl-L-methionine synthetase promoter and its use in expression of transgenic genes in plants
AU2415200A (en) 1999-01-18 2000-08-01 Maxygen, Inc. Methods of populating data structures for use in evolutionary simulations
US6376246B1 (en) 1999-02-05 2002-04-23 Maxygen, Inc. Oligonucleotide mediated nucleic acid recombination
US6436675B1 (en) 1999-09-28 2002-08-20 Maxygen, Inc. Use of codon-varied oligonucleotide synthesis for synthetic shuffling
IL137868A0 (en) 1999-01-19 2001-10-31 Maxygen Inc Oligonucleotide mediated nucleic acid recombination
CZ20013856A3 (en) 1999-04-29 2002-04-17 Syngenta Ltd. Plants resistant to herbicides
MXPA01010921A (en) * 1999-04-29 2003-06-24 Syngenta Ltd Herbicide resistant plants.
WO2000066746A1 (en) 1999-04-29 2000-11-09 Syngenta Limited Herbicide resistant plants
JP2003507019A (en) 1999-08-13 2003-02-25 シンジェンタ パーティシペーションズ アクチェンゲゼルシャフト Herbicide-tolerant protoporphyrinogen oxidase
AU2001241939A1 (en) 2000-02-28 2001-09-12 Maxygen, Inc. Single-stranded nucleic acid template-mediated recombination and nucleic acid fragment isolation
WO2001066704A2 (en) 2000-03-09 2001-09-13 Monsanto Technology Llc Methods for making plants tolerant to glyphosate and compositions thereof
RS32703A (en) * 2000-10-30 2006-12-15 Verdia Inc. Novel glyphosate n-acetyltransferase (gat) genes
AR039501A1 (en) * 2002-04-30 2005-02-23 Verdia Inc N-ACETIL TRANSFERASE GLYPHOSATE GENES (GAT)
FR2948775B1 (en) 2009-07-31 2011-12-02 Horiba Jobin Yvon Sas PLANAR OPTICAL POLYCHROMATIC IMAGING SYSTEM WITH BROAD FIELD OF VISION
US10470602B2 (en) 2011-06-13 2019-11-12 National Presto Industries, Inc. Pump coffee brewer
WO2015170355A1 (en) 2014-05-05 2015-11-12 Filippo Bastianini Apparatus for interrogating distributed optical fibre sensors using a stimulated brillouin scattering optical frequency-domain interferometer
CN105578337A (en) 2014-10-07 2016-05-11 鸿富锦精密工业(深圳)有限公司 Resonant sound equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2005012515A2 *

Also Published As

Publication number Publication date
CN1863914B (en) 2011-03-09
BRPI0409816B8 (en) 2022-12-06
BRPI0409816A (en) 2006-05-02
AU2004260931B2 (en) 2009-11-19
WO2005012515A2 (en) 2005-02-10
WO2005012515A3 (en) 2005-11-17
CA2521284C (en) 2014-07-08
EP2322629A3 (en) 2011-11-02
AU2004260931B9 (en) 2012-01-19
HRP20050930A2 (en) 2006-03-31
EP2535414A1 (en) 2012-12-19
EP2535414A8 (en) 2013-04-03
JP2007500514A (en) 2007-01-18
CA2662092C (en) 2012-07-17
WO2005012515A8 (en) 2005-05-12
CA2521284A1 (en) 2005-02-10
SG155063A1 (en) 2009-09-30
EP2322629A2 (en) 2011-05-18
CA2662092A1 (en) 2005-02-10
ZA200509602B (en) 2007-04-25
EP2535414B1 (en) 2017-12-13
CN1863914A (en) 2006-11-15
AU2004260931A1 (en) 2005-02-10
BRPI0409816B1 (en) 2019-12-17
MXPA05011585A (en) 2006-05-25

Similar Documents

Publication Publication Date Title
US7666644B2 (en) Glyphosate-N-acetyltransferase (GAT) genes
AU2004260931B2 (en) Novel glyphosate-N-acetyltransferase (GAT) genes
US7462481B2 (en) Glyphosate N-acetyltransferase (GAT) genes
WO2003092360A2 (en) Novel glyphosate-n-acetyltransferase (gat) genes
AU2009201716B2 (en) Novel glyphosate-N-acetyltransferase (GAT) genes

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20051123

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL HR LT LV MK

DAX Request for extension of the european patent (deleted)
RIN1 Information on inventor provided before grant (corrected)

Inventor name: KEMBLE, ROGER

Inventor name: MCCUTCHEN, BILLY, FRED

Inventor name: DUCK, NICHOLAS, B.

Inventor name: GORTON, REBECCA

Inventor name: PATTEN, PHILLIP, A.

Inventor name: CHEN, YONG, HONG

Inventor name: IVY, CHRISTINA

Inventor name: MINSHULL, JEREMY

Inventor name: GIVER, LORRAINE, J.

Inventor name: SIEHL, DAN

Inventor name: CASTLE, LINDA, A.

17Q First examination report despatched

Effective date: 20100805

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20110216