AU2007224390B2 - Novel glyphosate N-acetyltransferase (GAT) genes - Google Patents

Novel glyphosate N-acetyltransferase (GAT) genes Download PDF

Info

Publication number
AU2007224390B2
AU2007224390B2 AU2007224390A AU2007224390A AU2007224390B2 AU 2007224390 B2 AU2007224390 B2 AU 2007224390B2 AU 2007224390 A AU2007224390 A AU 2007224390A AU 2007224390 A AU2007224390 A AU 2007224390A AU 2007224390 B2 AU2007224390 B2 AU 2007224390B2
Authority
AU
Australia
Prior art keywords
plant
gat
glyphosate
seq
amino acid
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU2007224390A
Other versions
AU2007224390A1 (en
Inventor
Linda A. Castle
Yong Hong Chen
Nicholas B. Duck
Lorraine J. Giver
Chritina Ivy
Jeremy Minshull
Dan Siehl
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pioneer Hi Bred International Inc
Verdia LLC
EIDP Inc
Original Assignee
Pioneer Hi Bred International Inc
EI Du Pont de Nemours and Co
Verdia LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2002220181A external-priority patent/AU2002220181B2/en
Application filed by Pioneer Hi Bred International Inc, EI Du Pont de Nemours and Co, Verdia LLC filed Critical Pioneer Hi Bred International Inc
Priority to AU2007224390A priority Critical patent/AU2007224390B2/en
Publication of AU2007224390A1 publication Critical patent/AU2007224390A1/en
Priority to AU2009201716A priority patent/AU2009201716B2/en
Application granted granted Critical
Publication of AU2007224390B2 publication Critical patent/AU2007224390B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Description

-1 AUSTRALIA Patents Act 1990 SPECIFICATION Name of Applicants: Pioneer Hi-Bred Intemational, Inc. and E.l. Du Pont Nemours and Company and Verdia, Inc Actual Inventors: Linda A. Castle, Dan Siehl, Lorraine J. Giver, Jeremy Minshull, Christina Ivy, Yong Hong Chen, Nicholas B. Duck Address for Service: Baldwins Intellectual Property 16 Chisholm Street North Ryde Sydney Invention Title: Novel glyphosate N-acetyltransferase (GAT) genes The following statement is a full description of this invention, including the best method of performing it known to us:- NOVEL GLYPHOSATE N-ACETYLTRANSFERASE (GAT) GENES CROSS-REFERENCE TO RELATED APPLICATIONS This application claims priority to and benefit of U. S. Provisional Patent Application Serial No. 60/244,385 filed October 30,2000, the disclosure of which is incorporated herein by reference in its entirety for all purposes. For clarity and completeness, the description of the invention of AU 2002220181, from which the present application was divided, is retained herein. COPYRIGHT NOTIFICATION PURSUANT TO 37 C.F.R. § 1.71 (E) A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever. BACKGROUND OF THE INVENTION Any discussion of the prior art throughout the specification should in no way be considered as an admission that such prior art is widely known or forms part of common general knowledge in the field. Crop selectivity to specific herbicides can be conferred by engineering genes into crops which encode appropriate herbicide metabolizing enzymes. In some cases these enzymes, and the nucleic acids that encode them, originate in a plant. In other cases, they are derived from other organisms, such as microbes. See, e. g., Padgette et al. (1996) "New weed control opportunities: Development of soybeans with a Round UP ReadyTm gene" in Herbicide-Resistant Crops (Duke, ed.), pp54-8 4 , CRC Press, Boca Raton; and Vasil (1996) "Phosphinothricin-resistant crops" in Herbicide-Resistant Crops (Duke, ed.), pp85-91. Indeed, transgenic plants have been engineered to express a variety of herbicide tolerance/metabolizing genes, from a variety of organisms. For example, acetohydroxy acid synthase, which has been found to make plants that express this enzyme resistant to multiple types of herbicides, has been introduced into a variety of plants (see, e.g., Hattori et al. (1995) Mol Gen Genet 246:419. Other genes that confer tolerance to herbicides include: a gene encoding a chimeric protein of rat cytochrome P4507A1 and yeast NADPH cytochrome P450 oxidoreductase (Shiota et al. (1994) Plant Physiol 106:17), genes for glutathione reductase and superoxide dismutase (Aono et al. (1995) Plant Cell Physiol 36:1687, and genes for various phosphotransferases (Datta et al. (1992) Plant Mol Biol 20:619. - la /P----------------11--------- _ ___.
WO 02/36782 PCT/USO1/46227 One herbicide which is the subject of much investigation in this regard is N-phosphonomethylglycine, commonly referred to as glyphosate. Glyphosate is the top selling herbicide in the world, with sales projected to reach $5 billion by 2003. It is a broad spectrum herbicide that kills both broadleaf and grass-type plants. A successful 5 mode of commercial level glyphosate resistance in transgenic plants is by introduction of a modified Agrobacterium CP4 5-enolpyruvylshikimate-3-phosphate synthase (hereinafter referred to as EPSP synthase or EPSPS) gene. The transgene is targeted to the chloroplast where it is capable of continuing to synthesize EPSP from phosphoenolpyruvic acid (PEP) and shikimate-3-phosphate in the presence of glyphosate. In contrast, the native 10 EPSP synthase is inhibited by glyphosate. Without the transgene, plants sprayed with glyphosate quickly die due to inhibition of EPSP synthase which halts the downstream pathway needed for aromatic amino acid, hormone, and vitamin biosynthesis. The CP4 glyphosate-resistant soybean transgenic plants are marketed, e.g., by Monsanto under the name "Round UP Readym." 15 In the environment, the predominant mechanism by which glyphosate is degraded is through soil microflora metabolism. The primary metabolite of glyphosate in soil has been identified as aminomethylphosphonic acid (AMPA), which is ultimately converted into ammonia, phosphate and carbon dioxide. The proposed metabolic scheme that describes the degradation of glyphosate in soil through the AMPA pathway is shown 20 in Fig. 8. An alternative metabolic pathway for the breakdown of glyphosate by certain soil bacteria, the sarcosine pathway, occurs via initial cleavage of the C-P bond to give inorganic phosphate and sarcosine, as depicted in Fig. 9. Another successful herbicide/transgenic crop package is glufosinate (phosphinothricin) and the LibertyLink trait marketed, e.g., by Aventis. Glufosinate is 25 also a broad spectrum herbicide. Its target is the glutamate synthase enzyme of the chloroplast. Resistant plants carry the bar gene from Streptomyces hygroscopicus and achieve resistance by the N-acetylation activity of bar, which modifies and detoxifies glufosinate. An enzyme capable of acetylating the primary amine of AMPA is reported 30 in PCT Application No. WOOO/29596. The enzyme was not described as being able to acetylate a compound with a secondary amine (e.g., glyphosate). While a variety of herbicide resistance strategies are available as noted above, aditional approaches would have considerable commercial value. The present -2- -3 invention provides, e.g., novel polynucleotides and polypeptides for conferring herbicide tolerance, as well as numerous other benefits as will become apparent during review of the disclosure. SUMMARY OF THE INVENTION It is an object of the present invention to provide methods and reagents for rendering an organism, such as a plant, resistant to glyphosate, or to at least provide a useful alternative to known methods and reagents for rendering an organism, such as a plant, resistant to glyphosate. This and other objects of the invention are provided by one or more of the embodiments described below. Unless the context clearly requires otherwise, throughout the description and the claims, the words 'comprise', 'comprising' and the like are to be construed in an inclusive sense as opposed to an exclusive or exhaustive sense; that is to say in the sense of "including but not limited to". One embodiment of the invention provides novel polypeptides referred to herein as GAT polypeptides. GAT polypeptides are characterized by their structural similarity to one another, e.g., in terms of sequence similarity when the GAT polypeptides are aligned with one another. Some GAT polypeptides possess glyphosate N-acetyl transferase activity, i.e., the ability to catalyze the acetylation of glyphosate. Some GAT polypeptides are also capable of catalyzing the acetylation of glyphosate analogs and or glyphosate metabolites, e.g., aminomethylphosphonic acid. Also provided are novel polynucleotides referred to herein as GAT polynucleotides. GAT polynucleotides are characterized by their ability to encode GAT polypeptides. In some embodiments of the invention, a GAT polynucleotide is engineered for better plant expression by replacing one or more parental codons with a synonymous codon that is preferentially used in plants relative to the parental codon. In other -4 embodiments, a GAT polynucleotide is modified by the introduction of a nucleotide sequence encoding an N-terminal chloroplast transit peptide. GAT polypeptides, GAT polynucleotides and glyphosate N-acetyl transferase activity are described in more detail below. The invention further includes certain fragments of the GAT polypeptides and GAT polynucleotides described herein. The invention includes non-native variants of the polypeptides and polynucleotides described herein, wherein one or more amino acids of the encoded polypeptide have been mutated. More specifically, the present invention provides a transgenic plant cell, a transgenic plant, a transgenic seed, or a transgenic explant comprising said transgenic plant cell comprising a heterologous polypeptide with glyphosate N-acetyltransferase activity, wherein said plant cell produces N-acetylglyphosate when treated with glyphosate. The plant cell, plant, plant explant or seed of the present invention exhibits enhanced resistance to glyphosate as compared to a wild type plant cell, plant, or plant extract of the same species, strain or cultivar. In some embodiments of the invention, the transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant further comprises (a) at least one polypeptide imparting glyphosate tolerance by an additional mechanism; and/or (b) at least one polypeptide imparting tolerance to an additional herbicide. In yet another embodiment, the at least one polypeptide imparting glyphosate tolerance by an additional mechanism is a glyphosate-tolerant 5-enolpyruvylshikimate-3 phosphate synthase or glyphosate-tolerant glyphosate oxido-reductase; and/or the at least one polypeptide imparting tolerance to an additional herbicide is a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a -5 sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone-tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyl transferase or a mutated protoporphyrinogen oxidase. The transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant is a crop plant selected from among the genera: Eleusine, Lollium, Bambusa, Brassica, Dactylis, Sorghum, Pennisetum, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Coix, Glycine and Gossypium. The invention also provides a method to produce a polypeptide that has glyphosate N acetyltransferase activity which method comprises culturing the transgenic plant cell, the transgenic plant, the transgenic seed or the transgenic plant explant of the invention. Also provided is a method of producing a glyphosate resistant transgenic plant comprising: (a) transforming a plant or plant cell with a polynucleotide encoding a glyphosate N-acetyltransferase; and (b) optionally regenerating a transgenic plant from the transformed plant cell. In one embodiment, the method further comprises growing the transformed plant or plant cell in a concentration of glyphosate that inhibits the growth of a wild-type plant of the same species, which concentration does not inhibit the growth of the transformed plant, wherein said growing is: in increasing concentrations of glyphosate; and/or in a concentration of glyphosate that is lethal to a wild-type plant or plant cell of the same species. The method may further comprise propagating said transgenic plant by crossing said transgenic plant with a second plant, such that at least some progeny of the cross display glyphosate tolerance.
-6 The present invention also provides a method for selectively controlling weeds in a field containing a crop comprising: (a) planting the field with crop seeds or plants which are glyphosate tolerant as a result of being transformed with a polynucleotide encoding a glyphosate N acetyltransferase; and (b) applying to the crop and weeds in the field a sufficient amount of glyphosate to control the weeds without significantly affecting the crop. In yet another embodiment of the invention, there is provided a method for selectively controlling weeds in a field containing a crop comprising: (a) planting the field with crop, seeds or plants which are glyphosate tolerant as a result of being transformed with a polynucleotide encoding glyphosate N acetyltransferase and further comprising (i) at least one polypeptide imparting glyphosate tolerance by an additional mechanism; and/or (ii) at least one polypeptide imparting tolerance to an additional herbicide; and, (b) applying to the crop and weeds in the field a sufficient amount of glyphosate to inhibit growth of with weeds in the field without significantly affecting the crop; and (c) optionally, applying to the crop and weeds in the field a simultaneous or chronologically staggered application of an additional herbicide. When the additional herbicide is applied and the additional herbicide is selected from the group consisting of a hydroxyphenylpyruvatedioxygenase inhibitor, sulfonamide, imidazolinone, bialaphos, phosphinothricin, azafenidin, butafenacil, sulfosate, glufosinate, and a protox inhibitor. The crop plant for use in step (a) of these methods for selectively controlling weeds in a field containing a crop is a transgenic crop plant or transgenic seed of a crop plant selected -7 from among the genera: Eleusine, Lollium, Bambusa, Brassica, Dactylis, Sorghum, Pennisetum, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Coix, Glycine and Gossypium. The invention also provides a plant cell comprising a metabolic product of glyphosate which is N-acetylglyphosate. Also provided is a method for detecting the presence of a GAT polypeptide or evaluating the activity of a GAT polypeptide in plant tissue comprising treating a plant with glyphosate and assaying plant tissue from said plant for the presence of N-acetylglyphosate. The present invention also provides a method for detecting GAT polypeptides comprising analyzing plant tissues using an immunoassay comprising GAT-specific antibody or antibodies. In yet another embodiment, the invention provides a method for detecting the presence of a polynucleotide that encodes a GAT polypeptide comprising assaying plant tissue using PCR amplification. (followed by page 8) -8 BRIEF DESCRIPTION OF THE FIGURES Figure 1 depicts the N-acetylation of glyphosate catalyzed by a glyphosate N acetyltransferase ("GAT"). (followed by page 9) WO 02136782 PCT/USO1/46227 Figure 2 illustrates mass spectroscopic detection of N-acetylglyphosate produced by an exemplary Bacillus culture expressing a native GAT activity. Figure 3 is a table illustrating the relative identity between GAT sequences isolated from different strains of bacteria and yitI from Bacillus subtilis. 5 Figure 4 is a map of the plasmid pMAXY2120 for expression and purification of the GAT enzyme from E. coli cultures. Figure 5 is a mass spectrometry output showing increased N acetylglyphosate production over time in a typical GAT enzyme -reaction mix. Figure 6 is a plot of the kinetic data of a GAT enzyme from which a Km of 10 2.9 mM for glyphosate was calculated. Figure 7 is a plot of the kinetic data taken from the data of Figure 6 from which a Km of 2 pM was calculated for Acetyl CoA. Figure 8 is a scheme that describes the degradation of glyphosate in soil through the AMPA pathway. 15 Figure 9 is a scheme that describes the sarcosine pathway of glyphosate degradation. Figure 10 is the BLOSUM62 matrix. Figure 11 is a map of the plasmid pMAXY2190. Figure 12 depicts a T-DNA construct with gat selectable marker. 20 Figure 13 depicts a yeast expression vector with gat selectable marker. DETAILED DISCUSSION The present invention relates to a novel class of enzymes exhibiting N acetyltransferase activity. In one aspect, the invention relates to a novel class of enzymes capable of acetylating glyphosate and glyphosate analogs, e.g., enzymes possessing 25 glyphosate N-acetyltransferase ("GAT") activity. Such enzymes are characterized by the ability to acetylate the secondary amine of a compound. In some aspects of the invention, the compound is a herbicide, e.g., glyphosate, as illustrated schematically in Figure 1. The compound can also be a glyphosate analog or a metabolic product of glyphosate degradation, e.g, aminomethylphosphonic acid. Although the acetylation of glyphosate is 30 a key catalytic step in one metabolic pathway for catabolism of glyphosate, the enzymatic acetylation of glyphosate by naturally-occutning, isolated, or recombinant enzymes has not been previously described. Thus, the nucleic acids and polypeptides of the invention provide a new biochemical pathway for engineering herbicide resistance. -9- WO 02/36782 PCT/USO1/46227 In one aspect, the invention provides novel genes encoding GAT polypeptides. Isolated and recombinant GAT polynucleotides corresponding to naturally occurring polynucleotides, as well as recombinant and engineered, e.g., diversified, GAT' polynucleotides are a feature of the invention. GAT polynucleotides are exemplified by 5 SEQ ID NOS: 1-5 and 11-262. Specific GAT polynucleotide and polypeptide sequences are provided as examples to help illustrate the invention, and are not intended to limit the scope of the genus of GAT polynucleotides and polypeptides described and/or claimed herein. The invention also provides methods for generating and selecting 10 diversified libraries to produce additional GAT polynucleotides, including polynucleotides encoding GAT polypeptides with improved and/or enhanced characteristics, e.g., altered Km for glyphosate, increased rate of catalysis, increased stability, etc., based upon selection of a polynucleotide constituent of the library for the new or improved activities described herein. Such polynucleotides are especially favorably employed in the 15 production of glyphosate resistant transgenic plants. The GAT polypeptides of the invention exhibit a novel enzymatic activity. Specifically, the enzymatic acetylation of the synthetic herbicide glyphosate has not been recognized prior to the present invention. Thus, the polypeptides herein described, e.g., as exemplified by SEQ ID NOS: 6-10 and 263-514, define a novel biochemical pathway for 20 the detoxification of glyphosate that is functional in vivo, e.g., in plants. Accordingly, the nucleic acids and polypeptides of the invention are of significant utility in the generation of glyphosate resistant plants by providing new nucleic acids, polypeptides and biochemical pathways for the engineering of herbicide selectivity in transgenic plants. 25 DEFINITIONS Before describing the present invention in detail, it is to be understood that this invention is not limited to particular compositions or biological systems, which can, of course, vary. It is also to be understood that the terminology used herein is for the purpose - of describing particular embodiments only, and is not intended to be limiting. As used in 30 this specification and the appended claims, the singular forms "a", "an" and "the" include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to "a device" includes a combination of-two or more such devices, reference to "a gene fusion construct" includes mixtures of constructs, and the like. -10- WO 02136782 PCT/US01/46227 Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although any methods and materials similar or equivalent to those described herein can be used in the practice for testing of the present invention, 5 specific examples of appropriate materials and methods are described herein. In describing and claiming the present invention, the following terminology will be used in accordance with the definitions set out below. For purposes of the present invention, the term "glyphosate" should be considered to include any herbicidally effective form of N-phosphonomethylglycine 10 (including any salt thereof) and other forms which result in the production of the glyphosate anion in planta. The term "glyphosate analog" refers to any structural analog of glyphostate that has the ability to inhibit EPSPS at levels such that the glyphosate analog is herbicidally effective. As used herein, the term "glyphosate-N-acetyltransferase activity" or "GAT 15 activity" refers to the ability to catalyze the acetylation of the secondary amine group of glyphosate, as illustrated, for example, in Figure 1. A "glyphosate -N-acetyltransferase" or "GAT" is an enzyme that catalyzes the acetylation of the amine group of glyphosate, a glyphosate analog, and/or a glyphosate primary metabolite (i.e., AMPA or sarcosine). In sorne preferred embodiments of the invention, a GAT is able to transfer the acetyl group 20 from AcetylCoA to the secondary amine of glyphosate and the primary amine of AMPA. The exemplary GATs described herein are active from pH 5-9, with optimal activity in the range of pH 6.5-8.0. Activity can be quantified using various kinetic parameters well know in the art, e.g., k,, Km, and k./ KM. These kinetic parameters can be determined as described below in Example 7. 25 The terms "polynucleotide," "nucleotide sequence," and "nucleic acid" are used to refer to a polymer of nucleotides (AC,T,U,G, etc. or naturally occurring or artificial nucleotide analogues), e.g., DNA or RNA, or a representation thereof, e.g., a character string, etc, depending on the relevant context. A given polynucleotide or complementary polynucleotide can be determined from any specified nucleotide sequence. 30 Similarly, an "amino acid sequence" is a polymer of amino acids (a protein, polypeptide, etc.) or a character string representing an amino acid polymer, depending on context. The terms "protein," "polypeptide," and "peptide" are used interchangeably herein. - 11 - WO 02/36782 PCT/USO1/46227 A polynucleotide, polypeptide or other component is "isolated" when it is partially or completely separated from components with which it is normally associated (other proteins, nucleic acids, cells, synthetic reagents, etc.). A nucleic acid or polypeptide is "recombinant" when it is artificial or engineered, or derived from an artificial or 5 engineered protein or nucleic acid. For example, a polynucleotide that is inserted into a vector or any other heterologous location, e.g, in a genome of a recombinant organism, such that it is not associated with nucleotide sequences that normally flank the polynucleotide as it is found in nature is a recombinant polynucleotide. A protein expressed in vitro or in vivo from a recombinant polynucleotide is an example of a 10 recombinant polypeptide. Likewise, a polynucleotide sequence that does not appear in nature, for example a variant of a naturally occurring gene, is recombinant The terms "glyphosate N-acetyl transferase polypeptide" and "GAT polypeptide" are used interchangeably to refer to any of a family of novel polypeptides provided herein. 15 The terms "glyphosate N-acetyl transferase polynucleotide" and "GAT polynucleotide" are used interchangeably to refer to a polynucleotide that encodes a GAT polypeptide. A "subsequence" or "fragment" is any portion of an entire sequence. Numbering of an amino acid or nucleotide polymer corresponds to 20 numbering of a selected amino acid polymer or nucleic acid when the position of a given monomer component (amino acid residue, incorporated nucleotide, etc.) of the polymer corresponds to the same residue position in a selected reference polypeptide or polynucleotide. A vector is a composition for facilitating cell transduction by a selected 25 nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., plasmids, cosmids, viruses, YACs, bacteria, poly-lysine, chromosome integration vectors, episomal vectors, etc. "Substantially an entire length of a polynucleotide or amino acid sequence" refers to at least about 70%, generally at least about 80%, or typically about 90% or more 30 of a sequence. As used herein, an "antibody" refers to a protein comprising one or more polypeptides substantially or partially encoded by immunoglobulin genes or fragments of immimoglobulin genes. The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad - 12- WO 02/36782 PCT/USOI/46227 imnnoglobulin variable region genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. A typical immnoglobulin (antibody) structural unit comprises a tetramer. Each tetramer is 5 composed of two identical pairs of polypeptide chains, each pair having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a variable region of about 100 to 110 or more amino acids primarily responsible for antigen recognition. The terms variable light chain (VL) and variable heavy chain (VI) refer to these light and heavy chains respectively. Antibodies exist as intact immunoglobulins or 10 as a number of well characterized fragments produced by digestion with various peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break the disulfide linkage in the hinge region thereby converting the (Fab')2 dimer into an Fab' 15 monomer. The Fab' monomer is essentially an Fab with part of the hinge region (see, Fundamental Immunology, 4ak Edition,W.E. Paul (ed.), Raven Press, N.Y. (1998), for a more detailed description of other antibody fragments). While various antibody fragments are defined in terms of the digestion of an intact antibody, one of skill will appreciate that such Fab' fragments may be synthesized de novo either chemically or by utilizing 20 recombinant DNA methodology. Thus, the term antibody, as used herein also includes antibody fragments either produced by the modification of whole antibodies or synthesized de novo using recombinant DNA methodologies. Antibodies include single chain antibodies, including single chain Fv (sFv) antibodies in which a variable heavy and a variable light chain are joined together (directly or through a peptide linker) to form a 25 continuous polypeptide. A "chloroplast transit peptide" is an amino acid sequence which is translated in conjunction with a protein and directs the protein to the chloroplast or other plastid types present in the cell in which the protein is made. "Chloroplast transit sequence" refers to a nucleotide sequence that encodes a chloroplast transit peptide. 30 A "signal peptide" is an amino acid sequence which is translated in conjunction with a protein and directs the protein to the secretory system (Chrispeels, J. J., (1991) Ann. Rev. Plant Phys. Plant Mol. Biol. 42:21-53). If the protein is to be directed to a vacuole, a vacuolar targeting signal (supra) can further be added, or if to the endoplasmic reticulum, an endoplasmic reticulum retention signal (supra) may be added. If the protein - 13 - WO 02/36782 PCT/USO1/46227 is to be directed to the nucleus, any signal peptide present should be removed and instead a nuclear localization signal included (Raikhel, N. (1992) Plant Phys. 100:1627-1632). The terms "diversification" and "diversity," as applied to a polynucleotide, refers to generation of a plurality of modified forms of a parental polynucleotide, or 5 plurality of parental polynucleotides. In the case where the polynucleotide encodes a polypeptide, diversity in the nucleotide sequence of the polynucleotide can result in diversity in the corresponding encoded polypeptide, e.g. a diverse pool of polynucleotides encoding a plurality of polypeptide variants. In some embodiments of the invention, this sequence diversity is exploited by screening/selecting a library of diversified 10 polynucleotides for variants with desirable functional attributes, e.g., a polynucleotide encoding a GAT polypeptide with enhanced functional characteristics. The term "encoding" refers to the ability of a nucleotide sequence to code for one or more amino acids. The term does not require a start or stop codon. An amino acid sequence can be encoded in any one of six different reading frames provided by a 15 polynucleotide sequence and its complement. When used herein, the term "artificial variant" refers to a polypeptide having GAT activity, which is encoded by a modified GAT polynucleotide, e.g., a modified form of any one of SEQ ID NOS: 1-5 and 11-262, or of a naturally-occurring GAT polynucleotide isolated from an organism. The modified polynucleotide, from 20 which an artificial variant is produced when expressed in a suitable host, is obtained through human intervention by modification of a GAT polynucleotide. The term "nucleic acid construct" or "polynucleotide construct" means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or which has been modified to contain segments of nucleic acids in a 25 manner that would not otherwise exist in nature. The term nucleic acid construct is synonymous with the term "expression cassette" when the nucleic acid construct contains the control sequences required for expression of a coding sequence of the present invention. The term "control sequences" is defined herein to include all components, 30 which are necessary or advantageous for the expression of a polypeptide of the present invention. Each control sequence may be native or foreign to the nucleotide sequence encoding the polypeptide. Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator. At a minimum, the control sequences include a promoter, and -14- WO 02136782 PCT/US01/46227 transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the nucleotide sequence encoding a polypeptide. 5 The term "operably linked" is defined herein as a configuration in which a control sequence is appropriately placed at a position relative to the coding sequence of the DNA sequence such that the control sequence directs the expression of a polypeptide. When used herein the term "coding sequence" is intended to cover a nucleotide sequence, which directly specifies the amino acid sequence of its protein 10 product. The boundaries of the coding sequence are generally determined by an open reading frame, which usually begins with the ATG start codon. The coding sequence typically includes a DNA, cDNA, and/or recombinant nucleotide sequence. In the present context, the term "expression" includes any step involved in the production of the polypeptide including, but not limited to, transcription, post 15 transcriptional modification, translation, post-translational modification, and secretion. In the present context, the term "expression vector" covers a DNA molecule,. linear or circular, that comprises a segment encoding a polypeptide of the invention, and which is operably linked to additional segments that provide for its transcription. 20 The term "host cell", as used herein, includes any cell type which is susceptible to transformation with a nucleic acid construct. The term "plant" includes whole plants, shoot vegetative organs/structures (e.g. leaves, stems and tubers), roots, flowers and floral organs/structures (e.g. bracts, sepals, petals, stamens, carpels, anthers and ovules), seed (including embryo, endosperm, 25 and seed coat) and fruit (the mature ovary), plant tissue (e.g. vascular tissue, ground tissue, and the like) and cells (e.g. guard cells, egg cells, trichomes and the like), and progeny of same. The class of plants that can be used in the method of the invention is generally as broad as the class of higher and lower plants amenable to transformation techniques, including angiosperms (monocotyledonous and dicotyledonous plants), gymnosperms, 30 ferns, and multicellular algae. It includes plants of a variety of ploidy levels, including aneuploid, polyploid, diploid, haploid and hemizygous. The term "heterologous" as used herein describes a relationship between two or more elements which indicates that the elemennts are not normally found in -15- WO 02136782 PCT/USOI/46227 proximity to one another in nature. Thus, for example, a polynucleotide sequence is "heterologous to" an organism or a second polynucleotide sequence if it originates from a foreign species, or, if from the same species, is modified from its original form. For example, a promoter operably linked to a heterologous coding sequence refers to a coding 5 sequence from a species different from that from which the promoter was derived, or, if from the same species, a coding sequence which is not naturally associated with the promoter (e.g. a genetically engineered coding sequence or an allele from a different ecotype or variety). An example of a heterologous polypeptide is a polypeptide expressed from a recombinant polynucleotide in a transgenic organism. Heterologous 10 polynucleotides and polypeptides are forms of recombinant molecules. A variety of additional terms are defined or otherwise characterized herein. GLYPHOSATE N-ACETYLTRANSFERASES In one aspect, the invention provides a novel family of isolated or recombinant enzymes referred to herein as "glyphosate N-acetyltransferases," "GATs ," 15 or "GAT enzymes." GATs are enzymes that have GAT activity, preferably sufficient activity to confer some degree of glyphosate tolerance upon a transgenic plant engineered to express the GAT. Some examples of GATs include GAT polypeptides, described in more detail below. Of course, GAT-mediated glyphosate tolerance is a complex function of 20 GAT activity, GAT expression levels in the transgenic plant, the particular plant, the nature and timing of herbicide application, etc. One of skill in the art can determine without undue experimentation the level of GAT activity required to effect glyphosate tolerance in a particular context GAT activity can be characterized using the conventional kinetic 25 parameters kv, Km, and kt / KM. kt can be thought of as a measure of the rate of acetylation, particularly at high substrate concentrations, KM is a measure of the affinity of the GAT for its substrates (e.g., Acetyl CoA and glyphosate), and kt / KM is a measure of catalytic efficiency that takes both substrate affinity and catalytic rate into account - this parameter is particularly important in the situation where the concentration of a substrate 30 is at least partially rate limiting. In general, a GAT with a higher k. or kt / KM is a more efficient catalyst than another GAT with lower kct or kt / KM. A GAT with a lower Km is a more efficient catalyst than another GAT with a higher Km. Thus, to determine whether one GAT is more effective than another, one can compare kinetic parameters for the two enzymes. The relative importance of kat, kt / Km and Km will vary depending upon the - 16 - WO 02/36782 PCT/USO1/46227 context in which the GAT will be expected to function, e.g., the anticipated effective concentration of glyphosate relative to Km for glyphosate. GAT activity can also be characterized in terms of any of a number of functional characteristics, e.g., stability, susceptibility to inhibition or activation by other molecules, etc. 5 GLYPHOSATE N-ACETYLTRANSFERASE POLYPEPTIDES In one aspect, the invention provides a novel family of isolated or recombinant polypeptides referred to herein as "glyphosate N-acetyltransferase polypeptides" or "GAT polypeptides." GAT polypeptides are characterized by their structural similarity to a novel family of GATs. Many but not all GAT polypeptides are 10 GATs. The distinction is that GATs are defined in terms of function, whereas GAT polypeptides are defined in terms of structure. A subset of the GAT polypeptides consists of those GAT polypeptides that have GAT activity,~preferably at a level that will function to confer glyphosate resistance upon a transgenic plant expressing the protein at an effective level. Some preferred GAT polypeptides for use in conferring glyphosate 15 tolerance have a kat of at least 1 min~, or more preferably at least 10 min-', 100 min 4 or 1000 min 4 . Other preferred GAT polypeptides for use in conferring glyphosate tolerance have a Km no greater than 100 mM, or more preferably no greater than 10 mM, 1 mM, or 0.1 mM. Still other preferred GAT polypeptides for use in conferring glyphosate tolerance have a kg/ KM of at least 1 Mlmin-1 or more, preferably at least 10 mM'min', 100 20 nM'min 4 , 1000 mv 1 'min 1 , or 10,000 mM'min'. Exemplary GAT polypeptides have been isolated and characterized from a variety of bacterial strains. One example of a monomeric GAT polypeptide that has been isolated and characterized has a molecular radius of approximately 17 kD. An exemplary GAT enzyme isolated from a strain of B. licheniformis, SEQ ID NO:7, exhibits a Km for 25 glyphosate of approximately 2.9 mM and a Km for acetyl CoA of approximately 2 iM, with a kcat equal to 6/minute. The term "GAT polypeptide" refers to any polypeptide comprising an amino acid sequence that can be optimally aligned with an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263-514 to generate a similarity 30 score of at least 430 using the BLOSLM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence that can be optimally aligned with an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263-514 to generate a similarity score of at least 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, - 17 - WO 02/36782 PCT/USOI/46227 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745, 750, 755, or 760 using the BLOSTM62 matrix, a gap existence penalty of 11, and a gap extension 5 penalty of 1. One aspect of the invention pertains to a GAT polypeptide comprising an amino acid sequence that can be optimally aligned with SEQ ID NO. 457 to generate a similarity score of at least 430 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1. Some aspects of the invention pertain to GAT 10 polypeptides comprising an amino acid sequence that can be optimally aligned with SEQ ID NO. 457 to generate a similarity score of at least 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 15 745, 750, 755, or 760 using the BLOSLM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1. One aspect of the invention pertains to a GAT polypeptide comprising an amino acid sequence that can be optimally aligned with SEQ ID NO. 445 to generate a similarity score of at least 430 using the BLOSLM62 matrix, a gap existence penalty of 20 11, and a gap extension penalty of 1. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence that can be optimally aligned with SEQ ID NO. 445 to generate a similarity score of at least 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, 565, 570, 575, 580, 585, 590, 595, 600, 605, 610,615, 620, 625, 630, 635, 640, 645, 650, 25 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745,750,755, or 760 using the BLOSUM62 matrix, a gap-existence penalty of 11, and a gap extension penalty of 1. One aspect of the invention pertains to a GAT polypeptide comprising an amino acid sequence that can be optimally aligned with SEQ ID NO:300 to generate a 30 similarity score of at least 430 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence that can be optimally aligned with SEQ ID NO: 300 to generate a similarity score of at least 440, 445, 450, 455, 460, 465, 470, 475, 480, 485, 490, 495, 500, 505, 510, 515, 520, 525, 530, 535, 540, 545, 550, 555, 560, -18- WO 02/36782 PCT/US01/46227 565, 570, 575, 580, 585, 590, 595, 600, 605, 610, 615, 620, 625, 630, 635, 640, 645, 650, 655, 660, 665, 670, 675, 680, 685, 690, 695, 700, 705, 710, 715, 720, 725, 730, 735, 740, 745, 750,755, or 760 using the BLOSUM62 matrix, a gap existence penalty of 11, and a gap extension penalty of 1. 5 Two sequences are "optimally aligned" when they are aligned for similarity scoring using a defined amino acid substitution matrix (e.g., BLOSUM62), gap existence penalty and gap extension penalty so as to anive at the highest score possible for that pair of sequences. Amino acids substitution matrices and their use in quantifying the similarity between two sequences are well-known in the art and described, e.g., in Dayhoff et al. 10 (1978) "A model of evolutionary change in proteins." In "Atlas of Protein Sequence and Structure," Vol. 5, Suppl. 3 (ed. M.O. Dayhoff), pp. 345-352. Natl. Biomed. Res. Found., Washington, DC and Henikoff et al. (1992) Proc. NatL Acad. Sci. USA 89:10915-10919. The BLOSUM62 matrix (Fig. 10) is often used as a default scoring substitution matrix in sequence alignment protocols such as Gapped BLAST 2.0. The gap existence penalty is 15 imposed for the introduction of a single amino acid gap in one of the aligned sequences, and the gap extension penalty is imposed for each additional empty amino acid position inserted into an already opened gap. The alignment is defined by the amino acids positions of each sequence at which the alignment begins and ends, and optionally by the insertion of a gap or multiple gaps in one or both sequences, so as to arrive at the highest 20 possible score. While optimal alignment and scoring can be accomplished manually, the process is facilitated by the use of a computer-implemented alignment algorithm, e.g., gapped BLAST 2.0, described in Altschul et al, (1997) Nucleic Acids Res. 25:3389-3402, and made available to the public at the National Center for Biotechnology Information Website (http://www.ncbi.nlm.nih.gov). Optimal alignments, including multiple 25 alignments, can be prepared using, e.g., PSI-BLAST, available through http://www.ncbi.nlm.nih.gov and described by Altschul et al, (1997) Nucleic Acids Res. 25:3389-3402. With respect to an amino acid sequence that is optimally aligned with a reference sequence, an amino acid residue "corresponds to" the position in the reference 30 sequence with which the residue is paired in the alignment The "position" is denoted by a number that sequentially identifies each amino acid in the reference sequence based on its position relative to the N-terminus. For example, in SEQ ID NO:300 position 1 is M, position 2 is I, position 3 is E, etc. When a test sequence is optimally aligned with SEQ ID NO:300, a residue in the test sequence that aligns with the E at position 3 is said to - 19 - WO 02136782 PCT/USOI/46227 "correspond to position 3" of SEQ ID NO:300. Owing to deletions, insertion, truncations, fusions, etc., that must be taken into account when determining an optimal alignment, in general the amino acid residue number in a test sequence as determined by simply counting from the N-terminal will not necessarily be the same as the number of its 5 corresponding position in the reference sequence. For example, in a case where there is a deletion in an aligned test sequence, there will be no amino acid that corresponds to a position in the reference sequence at the site of deletion. Where there is an insertion in an aligned reference sequence, that insertion will not correspond to any amino acid position in the reference sequence. In the case of truncations or fusions there can be stretches of 10 amino acids in either the reference or aligned sequence that do not correspond to any amino acid in the corresponding sequence. The tern "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence having at least 40% sequence identity with an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263-514. Some 15 aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263-514. One aspect of the invention pertains to a GAT polypeptide comprising an 20 amino acid sequence having at least 40% sequence identity with SEQ ID NO. 457. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with SEQ ID NO. 457. One aspect of the invention pertains to a GAT polypeptide comprising an 25 amino acid sequence having at least 40% sequence identity with SEQ I) NO. 445. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with SEQ ID NO. 445. One aspect of the invention pertains to a GAT polypeptide comprising an 30 amino acid sequence having at least 40% sequence identity with SEQ ID NO. 300. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with SEQ ED NO. 300. -20- WO 02/36782 PCT/US01/46227 The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 1-96 of an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263 514. Some aspects of the invention pertain to polypeptides comprising an amino acid 5 sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with residues 1-96 of an amino acid sequence selected from the group consisting of SEQ IID NOS: 6-10 and 263-514. One aspect of the invention pertains to a polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 1-96 of SEQ ID NO. 10 457. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with residues 1-96 of SEQ ID NO. 457. One aspect of the invention pertains to a GAT polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 1-96 of SEQ ID 15 NO. 445. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having atleast 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with residues 1-96 of SEQ ID NO. 445. One aspect of the invention pertains to a GAT polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 1-96 of SEQ ID 20 NO. 300. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with residues 1-96 of SEQ ID NO. 300. The term "GAT polypeptide" further refers to any polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 51-146 of an 25 amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263 514. Some aspects of the invention pertain to polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with residues 51-146 of an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263-514. 30 One aspect of the invention pertains to a polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 51-146 of SEQ ID NO. 457. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, or 99% sequence identity with residues 51-146 of SEQ ID NO. 457. -21- WO 02136782 PCT/USO1/46227 One aspect of the invention pertains to a GAT polye copJ ILeaal 41SE amino acid sequence having at least 40% sequence identity with residues 51-146 of SEQ ID NO. 445. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, 5 or 99% sequence identity with residues 51-146 of SEQ ID NO. 445. One aspect of the invention pertains to a GAT polypeptide comprising an amino acid sequence having at least 40% sequence identity with residues 51-146 of SEQ ID NO. 300. Some aspects of the invention pertain to GAT polypeptides comprising an amino acid sequence having at least 60%, 70%, 80%, 90%, 92%, 95%, 96%, 97%, 98%, 10 or 99% sequence identity with residues 51-146 of SEQ ID NO. 300. As used herein, the term "identity" or "percent identity" when used with respect to a particular pair of aligned amino acid sequences, refers to the percent amino acid sequence identity that is obtained by ClustalW analysis (version W 1.8 available from European Bioinformatics Institute, Cambridge, UK), counting the number of identical 15 matches in the alignment and dividing such number of identical matches by the greater of (i) the length of the aligned sequences, and (ii) 96, and using the following default ClustalW parameters to achieve slow/accurate pairwise alignments - Gap Open Penalty:10; Gap Extension Penalty:0.10; Protein weight matrix:Gonnet series; DNA weight matrix: 1UB; Toggle Slow/Fast pairwise alignments = SLOW or FULL Alignment. 20 In another aspect, the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, 50, 75, 100, 125 or 140 contiguous amino acids of an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263-514. In another aspect, the invention provides an isolated or recombinant 25 polypeptide that comprises at least 20, or alternatively, 50, 100 or 140 contiguous amino acids of SEQ ID NO:457. In another aspect, the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, 50, 100 or 140 contiguous amino acids of SEQ ID NO:445. 30 In another aspect, the invention provides an isolated or recombinant polypeptide that comprises at least 20, or alternatively, 50, 100 or 140 contiguous amino acids of SBQ ID NO:300. -22- WO 02136782 PCT/US01/46227 In another aspect, the invention provides a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 6-10 and 263 514. Some preferred GAT polypeptides of the invention are characterized as 5 follows. When optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO:6-10 and 263-514, at least 90% of the amino acid residues in the polypeptide that correspond to the following positions conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 31, 45, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 123, 129, 139, and/or 145 the amino acid residue is B1; and (b) at positions 3, 5, 10 8, 10, 11, 14, 17, 18, 24, 27, 32, 37, 38, 47, 48, 49, 52, 57, 58, 61, 62, 63, 68, 69, 79, 80, 82, 83, 89, 92, 100, 101, 104, 119, 120, 124, 125, 126, 128, 131, 143, and/or 144 the amino acid residue is B2; wherein B 1 is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. When used to specify an amino acid or amino 15 acid residue, the single letter designations A, C, D, E, F, G, E, I, K, L, M, N, P, Q, R, S, T, V, W, and Y have their standard meaning as used in the art and as provided in Table 2 herein. Some preferred GAT polypeptides of the invention ate characterized as follows. When optimally aligned with a reference amino acid sequence selected from the 20 group consisting of SEQ ID NO:6-10 and 263-514, at least 80% of the amino acid residues in the polypeptide that correspond to the following positions conform to the following restrictions: (a) at positions 2, 4, 15, 19, 26, 28, 51, 54, 86, 90, 91, 97, 103, 105, 106, 114, 129, 139, and/or 145 the amino acid residue is Zi; (b) at positions 31 and/or 45 the amino acid residue is Z2; (c) at positions 8 and/or 89 the amino acid residue is Z3; (d) at 25 positions 82, 92, 101 and/or 120 the amino acid residue is Z4; (e) at positions 3, 11, 27 and/or 79 the amino acid residue is Z5; (f) at position 123 the amino acid residue is Z1 or Z2; (g) at positions 12, 33, 35, 39, 53, 59, 112, 132, 135, 140, and/or 146 the amino acid residue is Z1 or Z3; (h) at position 30 the amino acid residue is Z1 or Z4; (i) at position 6 the amino acid residue is Z1 or Z6; (j) at positions 81 and/or 113 the amino acid residue is 30 Z2 or Z3; (k) at positions 138 and/or 142 the amino acid residue is Z2 or Z4; (1) at positions 5, 17, 24, 57, 61, 124 and/or 126 the amino acid residue is Z3 or Z4; (m) at position 104 the amino acid residue is Z3 or Z5; (o) at positions 38, 52, 62 and/or 69 the amino acid residue is Z3 or Z6; (p) at positions 14, 119 and/or 144 the amino acid residue is Z4 or Z5; (q) at position 18 the amino acid residue is Z4 or Z6; (r) at positions 10, 32, -23 - WO 02/36782 PCT/USO1/46227 48, 63, 80 and/or 83 the amino acid residue is Z5 or Z6; (s) at position 40 the amino acid residue is Z1, Z2 or Z3; (t) at positions 65 and/or 96 the amino acid residue is Z1, Z3 or Z5; (u) at positions 84 and/or 115 the amino acid residue is Z1, Z3 or Z4; (v) at position 93 the amino acid residue is Z2, Z3 or Z4; (w) at position 130 the amino acid residue is 5 Z2, Z4 or Z6; (x) at positions 47 and/or 58 the amino acid residue is Z3, Z4 or Z6; (y) at positions 49, 68, 100 and/or 143 the amino acid residue is Z3, Z4 or Z5; (z) at position 131 the amino acid residue is Z3, Z5 or Z6; (aa) at positions 125 and/or 128 the amino acid residue is Z4, Z5 or Z6; (ab) at position 67 the amino acid residue is Z1, Z3, Z4 or Z5; (ac) at position 60 the amino acid residue is Z1, Z4, Z5 or Z6; and(ad) at position 37 the amino 10 acid residue is Z3, Z4, Z5 or Z6; wherein Z1 is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the 15 group consisting of C, G, and P. Some preferred GAT polypeptides of the invention are characterized as follows. When optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO:6-10 and 263-514, at least 90% of the amino acid residues in the polypeptide that correspond to the following positions conform to the following 20 restrictions: (a) at positions 1, 7, 9, 13, 20, 36, 42, 46, 50, 56, 64, 70, 72, 75, 76, 78, 94, 98, 107, 110, 117, 118, 121, and/or 141 the amino acid residue is B1; and (b) at positions 16, 21, 22, 23, 25, 29, 34, 41, 43, 44, 55, 66, 71, 73, 74, 77, 85, 87, 88, 95, 99, 102, 108, 109, 111, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is B2; wherein B1 is an amino acid selected from the group consisting of A, I, L, M, F, W, Y, and V; and B2 is 25 an amino acid selected from the group consisting of R, N, D, C, Q, E, G, H, K, P, S, and T. Some preferred GAT polypeptides of the invention are characterized as follows. When optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO:6-10 and 263-514, at least 90% of the amino acid residues in the polypeptide that correspond to the following positions conform to the following 30 restrictions: (a) at positions 1, 7, 9, 20, 36, 42, 50, 64, 72, 75, 76, 78, 94, 98, 110, 121, and/or 141 the amino acid residue is ZI; (b) at positions 13, 46, 56, 70, 107, 117, and/or 118 the amino acid residue is Z2; (c) at positions 23, 55, 71, 77, 88, and/or 109 the amino acid residue is Z3; (d) at positions 16, 21, 41, 73, 85, 99, and/or 111 the amino acid residue is Z4; (e) at positions 34 and/or 95 the amino acid residue is Z5; (f) at position 22, -24- WO 02136782 PCT/USOI/46227 25, 29, 43, 44, 66, 74, 87, 102, 108, 116, 122, 127, 133, 134, 136, and/or 137 the amino acid residue is Z6; wherein Z1 is an amino acid selected from the group consisting of A, I, L, M, and V; Z2 is an amino acid selected from the group consisting of F, W, and Y; Z3 is an amino acid selected from the group consisting of N, Q, S, and T; Z4 is an amino acid 5 selected from the group consisting of R, H, and K; Z5 is an amino acid selected from the group consisting of D and E; and Z6 is an amino acid selected from the group consisting of C, G, and P. Some preferred GAT polypeptides of the invention are characterized as follows. When optimally aligned with a reference amino acid sequence selected from the 10 group consisting of SEQ ID NO:6-10 and 263-514, at least 80% of the amino acid residues in the polypeptide that correspond to the following positions conform to the following restrictions: (a) at position 2 the amino acid residue is I or L; (b) at position 3 the amino acid residue is E or D; (c) at position 4 the amino acid residue is V, A or I; (d) at position 5 the amino acid residue is K, R or N; (e) at position 6 the amino acid residue is P or L; (f) at 15 position 8 the amino acid residue is N, S or T; (g) at position 10 the amino acid residue is E or G; (h) at position 11 the amino acid residue is D or E; (i) at position 12 the amino acid residue is T or A; (j) at position 14 the amino acid residue is E or K; (k) at position 15 the amino acid residue is I or L; (1) at position 17 the amino acid residue is H or Q; (m) at position 18 the amino acid residue is R, C or K; (n) at position 19 the amino acid residue is 20 I or V; (o) at position 24 the amino acid residue is Q or R; (p) at position 26 the amino acid residue is L or I; (q) at position 27 the amino acid residue is E or D; (r) at position 28 the amino acid residue is A or V; (s) at position 30 the amino acid residue is K, M or R; (t) at position 31 the amino acid residue is Y or F; (u) at position 32 the amino acid residue is E or G; (v) at position 33 the amino acid residue is T, A or S; (w) at position 35 the amino 25 acid residue is L, S or M; (x) at position 37 the amino acid residue is R, G, E or Q; (y) at position 38 the amino acid residue is G or S; (z) at position 39 the amino acid residue is T, A or S; (aa) at position 40 the amino acid residue is F, L or S; (ab) at position 45 the amino acid residue is Yor F; (ac) at position 47 the amino acid residue is R, Q or G; (ad) at position 48 the amino acid residue is G or D; (ae) at position 49 the amino acid residue 30 is K, R, E or Q; (af) at position 51 the amino acid residue is I or V; (ag) at position 52 the amino acid residue is S, C or G; (ah) at position 53 the amino acid residue is I or T; (ai) at position 54 the amino acid residue is A or V; (aj) at position 57 the amino acid residue is H or N; (ak) at position 58 the amino acid residue is Q, K, N or P; (al) at position 59 the amino acid residue is A or S; (am) at position 60 the amino acid residue is E, K, G, V or -25- WO 02/36782 PCT/USOI/46227 D; (an) at position 61 the amino acid residue is H or Q; (ao) at position 62 the amino acid residue is P, S or T; (ap) at position 63 the amino acid residue is E, G or D; (aq) at position 65 the amino acid residue is E, D, V or Q; (ar) at position 67 the amino acid residue is Q, E, R, L, H or K; (as) at position 68 the amino acid residue is K, R, E, or N; (at) at position 5 69 the amino acid residue is Q or P; (au) at position 79 the amino acid residue is E or D; (av) at position 80 the amino acid residue is G or E; (aw) at position 81 the amino acid residue is Y, N or F; (ax) at position 82 the amino acid residue is R or H; (ay) at position 83 the amino acid residue is E, G or D; (az) at position 84 the amino acid residue is Q, R or L; (ba) at position 86 the amino acid residue is A or V; (bb) at position 89 the amino 10 acid residue is T or S; (bc) at position 90 the amino acid residue is L or I; (bd) at position 91 the amino acid residue is I or V; (be) at position 92 the amino acid residue is R or K; (bf) at position 93 the amino acid residue is H, Y or Q; (bg) at position 96 the amino acid residue is E, A or Q; (bh) at position 97 the amino acid residue is L or I; (bi) at position 100 the amino acid residue is K, R, N or E; (bj) at position 101 the amino acid residue is K 15 or R; (bk) at position 103 the amino acid residue is A or V; (b1) at position 104 the amino acid residue is D or N; (bm) at position 105 the amino acid residue is L or M; (bn) at position 106 the amino acid residue is L or I; (bo) at position 112 the amino acid residue is T or I; (bp) at position 113 the amino acid residue is S, T or F; (bq) at position 114 the amino acid residue is A or V; (br) at position 115 the amino acid residue is S, R or A; (bs) 20 at position 119 the amino acid residue is K, E or R; (bt) at position 120 the amino acid residue is K or R; (bu) at position 123 the amino acid residue is F or L; (by) at position 124 the amino acid residue is S or R; (bw) at position 125 the amino acid residue is E, K, G or D; (bx) at position 126 the amino acid residue is Q or H; (by) at position 128 the amino acid residue is E, G or K; (bz) at position 129 the amino acid residue is V, I or A; 25 (ca) at position 130 the amino acid residue is Y, H, F or C; (cb) at position 131 the amino acid residue is D, G, N or E; (cc) at position 132 the amino acid residue is I, T, A, M, V or L; (cd) at position 135 the amino acid residue is V, T, A or I; (ce) at position 138 the amino acid residue is H or Y; (cf) at position 139 the amino acid residue is I or V; (cg) at position 140 the amino acid residue is L or S; (ch) at position 142 the amino acid residue 30 is Y or H; (ci) at position 143 the amino acid residue is K, T or E; (cj) at position 144 the amino acid residue is K, E or R; (ck) at position 145 the amino acid residue is L or I; and (c1) at position 146 the amino acid residue is T or A. Some preferred GAT polypeptides of the invention are characterized as follows. When optimally aligned with a reference amino acid sequence selected from the -26 - WO 02/36782 PCTUSO1/46227 group consisting of SEQ ED NO:6-10 and 263-514, at least 80% of the amino acid residues in the polypeptide that correspond to the following positions conform to the following restrictions: (a) at position 9, 76, 94 and 110 the amino acid residue is A; (b) at position 29 and 108 the amino acid residue is C; (c) at position 34 the amino acid residue is D; (d) at 5 position 95 the amino acid residue is E; (e) at position 56 the amino acid residue is F; (f) at position 43, 44, 66, 74, 87, 102, 116, 122, 127 and 136 the amino acid residue is G; (g) at position 41 the amino acid residue is H; (h) at position 7 the amino acid residue is I; (i) at position 85 the amino acid residue is K; (j) at position 20, 36, 42, 50, 72, 78, 98 and 121 the amino acid residue is L; (k) at position 1, 75 and 141 the amino acid residue is M; (1) at 10 position 23, 64 and 109 the amino acid residue is N; (m) at position 22, 25, 133, 134 and 137 the amino acid residue is P; (n) at position 71 the amino acid residue is Q; (o) at position 16, 21, 73, 99 and 111 the amino acid residue is R; (p) at position 55 and 88 the amino acid residue is S; (q) at position 77 the amino acid residue is T; (r) at position 107 the amino acid residue is W; and (s) at position 13, 46, 70, 117 and 118 the amino acid 15 residue is Y. Some preferred GAT polypeptides of the invention are characterized as follows. When optimally aligned with a reference amino acid sequence selected from the group consisting of SEQ ID NO:6-10 and 263-514, the amino acid residue in the polypeptide that correspond to position 28 is V or A. Valine at the 28 position generally 20 correlates with reduced KM, while alanine at that position generally correlates with increased k,. Other preferred GAT polypeptides are characterized by having 127 (i.e., an I at position 27), M30, S35, R37, S39, G48, K49, N57, Q58, P62, Q65, Q67, K68, E83, S89, A96, E96, R101, T112, A114, K119, K120, E128, V129, D131, T131, V134, R144, 1145, or T146, or any combination thereof. 25 Some preferred GAT polypeptides of the invention comprise an amino acid sequence selected from the group consisting of SEQ ID NOS:6-10 and 263-514. The invention further provides preferred GAT polypeptides that are characterized by a combination of the foregoing amino acid residue position restrictions. In addition, the invention provides GAT polynucleotides encoding the 30 preferred GAT polypeptides described above, and complementary nucleotide sequences thereof. Some aspects of the invention pertain particularly to the subset of any of the above-described categories of GAT polypeptides having GAT activity, as described herein. These GAT polypeptides are preferred, for example, for use as agents for -27- WO 02/36782 PCT/US01/46227 conferring glyphosate resistance upon a plant. Examples of desired levels of GAT activity are described herein. In one aspect, the GAT polypeptides comprise an amino acid sequence encoded by a recombinant or isolated form of naturally occurring nucleic acids isolated 5 from a natural source, e.g., a bacterial strain. Wild-type polynucleotides encoding such GAT polypeptides may be specifically screened for by standard techniques known in the art. The polypeptides defined by SEQ ID NO:6 to SEQ ID NO:10, for example, were discovered by expression cloning of sequences from Bacillus strains exhibiting GAT activity, as described in more detail below. 10 The invention also includes isolated or recombinant polypeptides which are encoded by an isolated or recombinant polynucleotide comprising a nucleotide sequence which hybridizes under stringent conditions over substantially the entire length of a nucleotide sequence selected from the group consisting of SEQ IID NOS: 1-5 and 11-262, their complements, and nucleotide sequences encoding an amino acid sequence selected 15 from the group consisting of SEQ ID NOS: 6-10 and 263-514, including their complements. The invention further includes any polypeptide having GAT activity that is encoded by a fragment of any of the GAT-encoding polynucleotides described herein. The invention also provides fragments of GAT polypeptides that can be 20 spliced together to form a functional GAT polypeptide. Splicing can be accomplished in vitro or in vivo, and can involve cis or trans (i.e., intramolecular or intermolecular) splicing. The fragments themselves can, but need not, have GAT activity. For example, two or more segments of a GAT polypeptide can be separated by inteins; removal of the intein sequence by cis-splicing results in a functional GAT polypeptide. In another 25 example, an encrypted GAT polypeptide can be expressed as two or more separate fragments; trans-splicing of these segments results in recovery of a functional GAT polypeptide. Various aspects of cis and trans splicing, gene encryption, and introduction of intervening sequences are described in more detail in US patent application Nos. 09/517,933 and 09/710,686, both of which are incorporated by reference herein in their 30 entirety. In general, the invention includes any polypeptide encoded by a modified GAT polynucleotide derived by mutation, recursive sequence recombination, and/or diversification of the polynucleotide sequences described herein. In some aspects of the invention, a GAT polypeptide is modified a by single or multiple amino acid substitution, -28 - WO 02/36782 PCT/USO1/46227 a deletion, an insertion, or a combination of one or more of these types of modifications. Substitutions can be conservative, or non-conservative, can alter function or not, and can add new function. Insertions and deletions can be substantial, such as the case of a truncation of a substantial fragment of the sequence, or in the fusion of additional 5 sequence, either internally or at N or C terminal. In some embodiments of the invention, a GAT polypeptide is part of a fusion protein comprising a functional addition such as, for example, a secretion signal, a chloroplast transit peptide, a purification tag, or any of numerous other functional groups that will be apparent to the skilled artisan, and which are described in more detail elsewhere in this specification. 10 Polypeptides of the invention may contain one or more modified amino acid. The presence of modified amino acids may be advantageous in, for example, (a) increasing polypeptide in vivo half-life, (b) reducing or increasing polypeptide antigenicity, (c) increasing polypeptide storage stability. Amino acid(s) are modified, for example, co-translationally or post-translationally during recombinant production (e.g., N 15 linked glycosylation at N-X-S/T motifs during expression in mammalian cells) or modified by synthetic means. Non-limiting examples of a modified amino acid include a glycosylated amino acid, a sulfated amino acid, a prenlyated (e.g., farnesylated, geranylgeranylated) amino acid, an acetylated amino acid, an acylated amino acid, a PEG-ylated amino acid, a 20 biotinylated amino acid, a carboxylated amino acid, a phosphorylated amino acid, and the like. References adequate to guide one of skill in the modification of amino acids are replete throughout the literature. Example protocols are found in Walker (1998) Protein Protocols on CD-ROM Human Press, Towata, NJ. Recombinant methods for producing and isolating GAT polypeptides of the 25 invention are described herein. In addition to recombinant production, the polypeptides may be produced by direct peptide synthesis using solid-phase techniques (e.g., Stewart et al. (1969) Solid-Phase Peptide Synthesis. WH Freeman Co, San Francisco; Merrifield J (1963) . Am. Chem. Soc. 85:2149-2154). Peptide synthesis may be performed using manual techniques or by automation. Automated synthesis may be achieved, for example, 30 using Applied Biosystems 43 1A Peptide Synthesizer (Perkin Elmer, Foster City, Calif.) in accordance with the instructions provided by the manufacturer. For example, subsequences may be chemically synthesized separately and combined using chemical methods to provide full-length GAT polypeptdides. Peptides can also be ordered from a variety of sources. -29 - WO 02136782 PCT/US01/46227 In another aspect of the invention, a GAT polypeptide of the invention is used to produce antibodies which have, e.g., diagnostic uses, for example, related to the activity, distribution, and expression of GAT polypeptides, for example, in various tissues of a transgenic plant 5 GAT homologue polypeptides for antibody induction do not. require biological activity; however, the polypeptide or oligopeptide must be antigenic. Peptides used to induce specific antibodies may have an amino acid sequence consisting of at least 10 amino acids, preferably at least 15 or 20 amino acids. Short stretches of a GAT polypeptide may be fused with another protein, such as keyhole limpet hemocyanin, and 10 antibody produced against the chimeric molecule. Methods of producing polyclonal and monoclonal antibodies are known to those of skill in the art, and many antibodies are available. See, e.g., Coligan (1991) Current Protocols in Immunology Wiley/Greene, NY; and Harlow and Lane (1989) Antibodies: A Laboratory Manual Cold Spring Harbor Press, NY; Stites et al. (eds.) Basic 15 and Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, CA, and references cited therein; Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New York, NY; and Kohler and Milstein (1975) Nature 256: 495-497. Other suitable techniques for antibody preparation include selection of libraries of recombinant antibodies in phage or similar vectors. See, Huse et al. (1989) Science 20 246: 1275-1281; and Ward, et al. (1989) Nature 341: 544-546. Specific monoclonal and polyclonal antibodies and antisera will usually bind with a KD of at least about 0.1 p1M, preferably at least about 0.01 FLM or better, and most typically and preferably, 0.001 pM or better. Additional details antibody production and engineering techniques can be 25 found in Borrebaeck (ed) (1995) Antibody Engineering, 2 d Edition Freeman and Company, NY (Borrebaeck); McCafferty et al. (1996) Antibody Engineering, A Practical Approach IRL at Oxford Press, Oxford, England (McCafferty), and Paul (1995) Antibody Enineering Protocols Humana Press, Towata, NJ (Paul). Sequence Variations 30 GAT polypeptides of the present invention include conservatively modified variations of the sequences disclosed herein as SEQ ID NOS: 6-10 and 263-514. Such conservatively modified variations comprise substitutions, additions or deletions which alter, add or delete a single amino acid or a small percentage of amino acids (typically less -30 - WO 02/36782 PCT/US01/46227 than about 5%, more typically less than about 4%, 2%, or 1%) in any of SEQ ID NOS: 6 10 and 263-514. For example, a conservatively modified variation (e.g., deletion) of the 146 amino acid polypeptide identified herein as SEQ ID NO:6 will have a length of at least 5 140 amino acids, preferably at least 141 amino acids, more preferably at least 144 amino acids, and still more preferably at least 146 amino acids, corresponding to a deletion of less than about 5%, 4%, 2% or about 1%, or less of the polypeptide sequence. Another example of a conservatively modified variation (e.g., a conservativelyy substituted variation") of the polypeptide identified herein as SEQ ID 10 NO:6 will contain "conservative substitutions", according to the six substitution groups set forth in Table 2 (infra), in up to about 7 residues (i.e., less than about 5%) of the 146 amino acid polypeptide. The GAT polypeptide sequence homologues of the invention, including conservatively substituted sequences, can be present as part of larger polypeptide 15 sequences such as occur in a GAT polypeptide, in a GAT fusion with a signal sequence, e.g., a chloraplast targeting sequence, or upon the addition of one or more domains for purification of the protein (e.g., poly his segments, FLAG tag segments, etc.). In the latter case, the additional functional domains have little or no effect on the activity of the GAT portion of the protein, or where the additional domains can be removed by post synthesis 20 processing steps such as by treatment with a protease. Defining Polyeptides by Immunoreactivity Because the polypeptides of the invention provide a new class of enzymes with a defined activity, i.e., the acetylation of glyphosate, the polypeptides also provide new structural features which can be recognized, e.g., in immunological assays. The 25 generation of antisera which specifically binds the polypeptides of the invention, as well as the polypeptides which are bound by such antisera, are a feature of the invention. The invention includes GAT polypeptides that specifically bind to or that are specifically immunoreactive with an antibody or antisera generated against an immunogen comprising an amino acid sequence selected from one or more of SEQ ID 30 NO:6 to SEQ ID NO:10. To eliminate cross-reactivity with other GAT homologues, the antibody or antisera is subtracted with available related proteins, such as those represented by the proteins or peptides corresponding to GenBank accession numbers available as of the filing date of this application, and exemplified by CAA70664, Z99109 and Y09476. -31- WO 02136782 PCT/US01/46227 Where the accession number corresponds to a nucleic acid, a polypeptide encoded by the nucleic acid is generated and used for antibody/antisera subtraction purposes. Figure 3 tabulates the relative identity between exemplary GAT polypeptides and the most closely related sequence available in Genbank, Yit. The function of native YitI has yet to be 5 elucidated, but the enzyme has been shown to possess detectable GAT activity. In one typical format, the immunoassay uses a polyclonal antiserum which was raised against one or more polypeptide comprising one or more of the sequences corresponding to one or more of SEQ ID NOS: 6-10 and 263-514, or a substantial subsequence thereof (i.e., at least about 30% of the full length sequence provided). The 10 full set of potential polypeptide immunogens derived from SEQ ID NOS: 6-10 and 263 514 are collectively referred to below as "the immunogenic polypeptides." The resulting antisera is optionally selected to have low cross-reactivity against other related sequences and any such cross-reactivity is removed by immunoabsorbtion with one or more of the related sequences, prior to use of the polyclonal antiserum in the immunoassay. 15 In order to produce antisera for use in an immunoassay, one or more of the immunogenic polypeptides is produced and purified as described herein. For example, recombinant protein may be produced in a bacterial cell line. An inbred strain of mice (used in this assay because results are more reproducible due to the virtual genetic identity of the mice) is immunized with the immunogenic protein(s) in combination with a 20 standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization protocol (see, Harlow and Lane (1988) Antibodies. A Laboory Manual, Cold Spring Harbor Publications, New York, for a standard description of antibody generation, immunoassay formats and conditions that can be used to determine specific immunoreactivity). Alternatively, one or more synthetic or recombinant polypeptide derived from the 25 sequences disclosed herein is conjugated to a carrier protein and used as an immunogen. Polyclonal sera are collected and titered against the immunogenic polypeptide in an immunoassay, for example, a solid phase immunoassay with one or more of the immunogenic proteins immobilized on a solid support. Polyclonal antisera with a titer of 106 or greater are selected, pooled and subtracted with related polypeptides, 30 e.g., those identified from GENBANK as noted, to produce subtracted pooled titered polyclonal antisera. The subtracted pooled titered polyclonal antisera are tested for cross reactivity against the related polypeptides. Preferably at least two of the immunogenic GATs are used in this determination, preferably in conjunction with at least two of related -32- WO 02/36782 PCT/USO1/46227 polypeptides, to identify antibodies which are specifically bound by the immunogenic protein(s). In this comparative assay, discriminatory binding conditions are determined for the subtracted titered polyclonal antisera which result in at least about a 5-10 fold 5 higher signal to noise ratio for binding of the titered polyclonal antisera to the immunogenic GAT polypeptides as compared to binding to the related polypeptides. That is, the stringency of the binding reaction is adjusted by the addition of non-specific competitors such as albumin or non-fat dry milk, or by adjusting salt conditions, temperature, or the like. These binding conditions are used in subsequent assays for 10 determining whether a test polypeptide is specifically bound by the pooled subtracted polyclonal antisera. In particular, test polypeptides which show at least a 2-5x higher signal to noise ratio than the control polypeptides under discriminatory binding conditions, and at least about a 1/2 signal to noise ratio as compared to the immunogenic polypeptide(s), shares substantial structural similarity with the immunogenic polypeptide 15 as compared to known GAT, and is, therefore a polypeptide of the invention. In another example, immunoassays in the competitive binding format are used for detection of a test polypeptide. For example, as noted, cross-reacting antibodies are removed from the pooled antisera mixture by immunoabsorbtion with the control GAT polypeptides. The immunogenic polypeptide(s) are then immobilized to a solid support 20 which is exposed to the subtracted pooled antisera. Test proteins are added to the assay to compete for binding to the pooled subtracted antisera. The ability of the test protein(s) to compete for binding to the pooled subtracted antisera as compared to the immobilized protein(s) is compared to the ability of the immunogenic polypeptide(s) added to -the assay to compete for binding (the immunogenic polypeptides compete effectively with the 25 immobilized immunogenic polypeptides for binding to the pooled antisera). The percent cross-reactivity for the test proteins is calculated, using standard calculations. In a parallel assay, the ability of the control proteins to compete for binding to the pooled subtracted antisera is optionally determined as compared to the ability of the immunogenic polypeptide(s) to compete for binding to the antisera. Again, the percent 30 cross-reactivity for the control polypeptides is calculated, using standard calculations. Where the percent cross-reactivity is at least 5-10x as high for the test polypeptides, the test polypeptides are said to specifically bind the pooled subtracted antisera. In general, the immunoabsorbed and pooled antisera can be used in a competitive binding immunoassay as described herein to compare any test polypeptide to - 33 - WO 02/36782 PCT/USO1/46227 the immunogenic polypeptide(s). In order to make this comparison, the two polypeptides are each assayed at a wide range of concentrations and the amount of each polypeptide required to inhibit 50% of the binding of the subtracted antisera to the immobilized protein is determined using standard techniques. If the amount of the test polypeptide required is 5 less than twice the amount of the immunogenic polypeptide that is required, then the test polypeptide is said to specifically bind to an antibody generated to the immunogenic protein, provided the amount is at least about 5-10x as high as for a control polypeptide. As a final determination of specificity, the pooled antisera is optionally fully immunosorbed with the immunogenic polypeptide(s) (rather than the control 10 polypeptides) until little or no binding of the resulting immunogenic polypeptide subtracted pooled antisera to the immunogenic polypeptide(s) used in the immunosorbtion is detectable. This fully immunosorbed antisera is then tested for reactivity with the test polypeptide. If little or no reactivity is observed (i.e., no more than 2x the signal to noise ratio observed for binding of the fully immunosorbed antisera to the immunogenic 15 polypeptide), then the test polypeptide is specifically bound by the antisera elicited by the immunogenic protein. GLYPHOSATE N-ACETYLTRAISFERASE POLYNUCLLL TIDES In one aspect, the invention provides a novel family of isolated or recombinant polynucleotides referred to herein as "glyphosate N-acetyltransferase 20 polynucleotides" or "GAT polynucleotides." GAT polynucleotide sequences are characterized by the ability to encode a GAT polypeptide. In general, the invention includes any nucleotide sequence that encodes any of the novel GAT polypeptides described herein. In some aspects of the invention, a GAT polynucleotide that encodes a GAT polypeptide with GAT activity is preferred. 25 In one aspect, the GAT polynucleotides comprise recombinant or isolated forms of naturally occurring nucleic acids isolated from an organism, e,g, a bacterial strain. Exemplary GAT polynucleotides, e.g., SEQ ID NO:1 to SEQ ID NO:5, were discovered by expression cloning of sequences from Bacillus strains exhibiting GAT activity. Briefly, a collection of approximately 500 Bacillus and Pseudomonas strains 30 were screened for native ability to N-acetylate glyphosate. Strains were grown in LB overnight, harvested by centrifugation, permeabilizied in dilute toluene, and then washed and resuspended in a reaction mix containing buffer, 5 mM glyphosate, and 200 pM acetyl-CoA. The cells were incubated in the reaction mix for between 1 and 48 hours, at which time an equal volume of methanol was added to the reaction. The cells were then -34- WO 02/36782 PCT/US01/46227 pelleted by centrifugation and the supernatant was filtered before analysis by parent ion mode mass spectrometry. The product of the reaction was positively identified as N acetylglyphosate by comparing the mass spectrometry profile of the reaction mix to an N acetylglyphosate standard as shown in Figure 2. Product detection was dependent on 5 inclusion of both substrates (acetylCoA and glyphosate) and was abolished by heat denaturing the bacterial cells. Individual GAT polynucleotides were then cloned from the identified strains by functional screening. Genomic DNA was prepared and partially digested with Sau3A1 enzyme. Fragments of approximately 4 Kb were cloned into an E. coli expression 10 vector and transformed into electrocompetent E. coli. Individual clones exhibiting GAT activity were identified by mass spectrometry following a reaction as described previously except that the toluene wash was replaced by permeabilization with PMBS. Genomic fragments were sequenced and the putative GAT polypeptide-encoding open reading frame identified. Identity of the GAT gene was confirmed by expression of the open 15 reading frame in E. coli and detection of high levels of N-acetylglyphosate produced from reaction mixtures. In another aspect of the invention, GAT polynucleotides are produced by diversifying, e.g., recombining and/or mutating one or more naturally occurring, isolated, or recombinant GAT polynucleotides. As described in more detail elsewhere herein, it is 20 often possible to generate diversified GAT polynucleotides encoding GAT polypeptides with superior functional attributes, e.g., increased catalytic function, increased stability, higher expression level, than a GAT polynucleotide used as a substrate or parent in the diversification process. The polynucleotides of the invention have a variety of uses in, for example: 25 recombinant production (i.e., expression) of the GAT polypeptides of the invention; as transgenes (e.g., to confer herbicide resistance in transgenic plants); as selectable markers for transformation and plasmid maintenance; as immunogens; as diagnostic probes for the presence of complementary or partially complementary nucleic acids (including for detection of natural GAT coding nucleic acids; as substrates for further diversity 30 generation, e.g., recombination reactions or mutation reactions to produce new and/or improved GAT homologues, and the like. It is important to note that certain specific, substantial and credible utilities of GAT polynucleotides do not require that the polynucleotide encode a polypeptide with substantial GAT activity. For example, GAT polynucleotides that do not encode active -35- WO 02136782 PCT/US01/46227 enzymes can be valuable sources of parental polynucleotides for use in diversification procedures to arrive at GAT polynucleotide variants, or non-GAT polynucleotides, with desirable functional properties (e.g., high kcat or kcat/Km, low Ki, high stability towards heat or other environmental factor, high transcription or translation rates, resistance to 5 proteolytic cleavage, reducing antigenicity, etc.). For example, nucleotide sequences encoding protease variants with little or no detectable activity have been used as parent polynucleotides in DNA shuffling experiments to produce progeny encoding highly active proteases (Ness et al. (1999) Nature Biotechnology 17:893-96). Polynucleotide sequences produced by diversity generation methods or 10 recursive sequence recombination ("RSR") methods (e.g., DNA shuffling) are a feature of the invention. Mutation and recombination methods using the nucleic acids described herein are a feature of the invention. For example, one method of the invention includes recursively recombining one or more nucleotide sequences of the invention as described above and below with one or more additional nucleotides. The recombining steps are 15 optionally performed in vivo, ex vivo, in silico or in vitro. Said diversity generation or recursive sequence recombination produces at least one library of recombinant modified GAT polynucleotides. Polypeptides encoded by members of this library are included in the invention. Also contemplated are uses of polynucleotides, also referred to herein as 20 oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably at least 20, 30, or 50 or more bases, which hybridize under stringent or highly stringent conditions to a GAT polynucleotide sequence. The polynucleotides may be used as probes, primers, sense and antisense agents, and the like, according to methods as noted herein. 25 In accordance with the present invention, GAT polynucleotides, including nucleotide sequences that encode GAT poolypeptides, fragments of GAT polypeptides, related fusion proteins, or functional equivalents thereof, are used in recombinant DNA molecules that direct the expression of the GAT polypeptides in appropriate host cells, such as bacterial or plant cells. Due to the inherent degeneracy of the genetic code, other 30 nucleic acid sequences which encode substantially the same or a functionally equivalent amino acid sequence can also be used to clone and express the GAT polynucleotides. The invention provides GAT polynucleotides that encode transcription and/or translation product that are subsequently spliced to ultimately produce functional GAT polypeptides. Splicing can be accomplished in vitro or in vivo, and can involve cis - 36 - WO 02136782 PCT/US01/46227 or trans splicing. The substrate for splicing can be polynucleotides (e.g., RNA transcripts) or polypeptides. An example of cis splicing of a polynucleotide is where an intron inserted into a coding sequence is removed and the two flanking exon regions are spliced to generate a GAT polypeptide encoding sequence. An example of trans splicing would 5 be where a GAT polynucleotide is encrypted by separating the coding sequence into two or more fragments that can be separately transcribed and then spliced to form the full length GAT encoding sequence. The use of a splicing enhancer sequence (which can be introduced into a construct of the invention) can facilitate splicing either in cis or trans. Cis and trans splicing of polypeptides are described in more detail elsehwhere herein. 10 More detailed description of cis and trans splicing can be found in US patent application Nos. 09/517,933 and 09/710,686. Thus, some GAT polynucleotides do not directly encode a full-length GAT polypeptide, but rather encode a fragment or fragments of a GAT polypeptide. These GAT polynucleotides can be used to express a functional GAT polypeptide through a 15 mechanism involving splicing, where splicing can occur at the level of polynucleotide (e.g., intron/exon) and/or polypeptide (e.g., intein/extein). This can be useful, for example, in controlling expression of GAT activity, since functional GAT polypeptide will only be expressed if all required fragments are expressed in an environment that permits splicing processes to generate functional product. In another example, introduction of one 20 or more insertion sequences into a GAT polynucleotide can facilitate recombination with a low homology polynucleotide; use of an intron or intein for the insertion sequence facilitates the removal of the intervening sequence, thereby restoring function of the encoded variant As will be understood by those of skill in the art, it can be advantageous to 25 modify a coding sequence to enhance its expression in a particular host. The genetic code is redundant with 64 possible codons, but most organisms preferentially use a subset of these codons. The codons that are utilized most often in a species are called optimal codons, and those not utilized very often are classified as rare or low-usage codons (see, e.g., Zhang SP et al. (1991) Gene 105:61-72). Codons can be substituted to reflect the 30 preferred codon usage of the host, a process sometimes called "codon optimization" or "controlling for species codon bias." Optimized coding sequence containing codons preferred by a particular prokaryotic or eukaryotic host (see also, Murray, E. et al. (1989) Nuc. Acids Res. 17:477 508) can be prepared, for example, to increase the rate of translation or to produce - 37 - WO 02136782 PCT/USO1/46227 recombinant RNA transcripts having desirable properties, such as a longer half-life, as compared with transcripts produced from a non-optimized sequence. Translation stop codons can also be modified to reflect host preference. For example, preferred stop codons for S. cerevisiae and mammals are UAA and UGA respectively. The preferred 5 stop codon for monocotyledonous plants is UGA, whereas insects and E. col prefer to use UAA as the stop codon (Dalphin ME et al. (1996) Nuc. Acids Res. 24: 216-218). Methodology for optimizing a nucleotide sequence for expression in a plant is provided, for example, in U.S. Patent No. 6,015,891, and references cited therein. One embodiment of the invention includes a GAT polynucleotide having 10 optimal codons for expression in a relevant host, e.g., a transgenic plant host. This is particularly desirable when a GAT polynucleotide of bacterial origin is introduced into a transgenic plant, e.g., to confer glyphosate resistance to the plant. The polynucleotide sequences of the present invention can be engineered in order to alter a GAT polynucleotide for a variety of reasons, including but not limited to, 15 alterations which modify the cloning, processing and/or expression of the gene product. For example, alterations may be introduced using techniques that are well known in the art, e.g., site-directed mutagenesis, to insert new restriction sites, alter glycosylation patterns, change codon preference, introduce splice sites, etc. As described in more detail herein, the polynucleotides of the invention 20 include sequences which encode novel GAT polypeptides and sequences complementary to the coding sequences, and novel fragments of coding sequence and complements thereof. The polynucleotides can be in the form of RNA or in the form of DNA, and include mRNA, cRNA, synthetic RNA and DNA, genomic DNA and cDNA. The polynucleotides can be double-stranded or single-stranded, and if single-stranded, can be 25 the coding strand or the non-coding (anti-sense, complementary) strand. The polynucleotides optionally include the coding sequence of a GAT polypeptide (i) in isolation, (ii) in combination with additional coding sequence, so as to encode, e.g., a fusion protein, a pre-protein, a prepro-protein, or the like, (iii) in combination with non coding sequences, such as introns or inteins, control elements such as a promoter, an 30 enhancer, a terminator element, or 5' and/or 3' untranslated regions effective for expression of the coding sequence in a suitable host, and/or (iv) in a vector or host environment in which the GAT polynucleotide is a heterologous gene. Sequences can also be found in combination with typical compositional formulations of nucleic acids, including in the presence of carriers, buffers, adjuvants, excipients and the like. - 38 - WO 02136782 PCTUS01/46227 Polynucleotides and oligonucleotides of the invention can be prepared by standard solid-phase methods, according to known synthetic methods. Typically, fragments of up to about 100 bases are individually synthesized, then joined (e.g., by enzymatic or chemical ligation methods, or polymerase mediated methods) to form 5 essentially any desired continuous sequence. For example, polynucleotides and oligonucleotides of the invention can be prepared by chemical synthesis using, e.g., the classical phosphoramidite method described by Beaucage et al. (1981) Tetrahedron Letters 22:1859-69, or the method described by Matthes et al. (1984) EMBO J. 3: 801-05., e.g., as is typically practiced in automated synthetic methods. According to the 10 phosphoramidite method, oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vectors. In addition, essentially any nucleic acid can be custom ordered from any of a variety of commercial sources, such as The Midland Certified Reagent Company (mcrc@oligos.com), The Great American Gene Company (http://www.genco.com), 15 ExpressGen Inc. (www.expressgen.com), Operon Technologies Inc. (Alameda, CA) and many others. Similarly, peptides and antibodies can be custom ordered from any of a variety of sources, such as PeptidoGenic (pkim@ccnet.com), HTI Bio-products, Inc. (http://www.htibio.com), BMA Biomedicals Ltd (U.K.), Bio.Synthesis, Inc., and many others. 20 Polynucleotides may also be synthesized by well-known techniques as described in the technical literature. See, e.g., Carruthers et al., Cold Spring Harbor Symp. Quant. Biol. 47:411-418 (1982), and Adams et al., J. An. Chem. Soc. 105:661 (1983). Double stranded DNA fragments may then be obtained either by synthesizing the complementary strand and annealing the strands together under appropriate conditions, or 25 by adding the complementary strand using DNA polymerase with an appropriate primer sequence.
WO 02/36782 PCT/USO1/46227 sufficient to direct persons of skill through in vitro amplification methods, including the polymerase chain reaction (PCR) the ligase chain reaction (LCR), QS-replicase amplification and other RNA polymerase mediated techniques (e.g., NASBA) are found in Berger, Sambrook, and Ausubel, as well as Mullis et al., (1987) U.S. Patent No. 5 4,683,202; PCR Protocols A Guide to methods and Aplications (Innis et al., eds.) Academic Press Inc. San Diego, CA (1990); Arnheim & Levinson (October 1, 1990) Chemical and Enzineering News 36-47; The Journal Of NIH Research (1991) 3:81-94; Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86:1173; Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874; Lomell et al. (1989) J. Clin. Chem. 35:1826; Landegren et al., 10 (1988) Science 241:1077-1080; Van Brunt (1990) Biotechnology 8:291-294; Wu and Wallace, (1989) Gene 4:560; Baninger et al. (1990) Gene 89:117, and Sooknanan and Malek (1995) Biotechnology 13:563-564. Improved methods of cloning in vitro amplified nucleic acids are described in Wallace et al., U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic acids by PCR are summarized in Cheng et al. (1994) Nature 15 369:684-685 and the references therein, in which PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. See, Ausbel, Sambrook and Berger, all supra. 20 Sequence Variations It will be appreciated by those skilled in the art that due to the degeneracy of the genetic code, a multitude of nucleotide sequences encoding GAT polypeptides of the invention may be produced, some of which bear substantial identity to the nucleic acid sequences explicitly disclosed herein. -40 - WO 02/36782 PCTIUSO1/46227 Table 1 Codon Table Amino acids Codon Alanine Ala A OGCA GCC GCG GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acid Glu E GAA GAG Phenylalanine Phe F LTUC UUU Glycine Gly G GGA GGC GGG GGU Histidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAA AAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUG Asparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln Q CAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCA UCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUU Tryptophan Trp W UGG Tyrosine Tyr Y UAC UAU 5 For instance, inspection of the codon table (Table 1) shows that codons AGA, AGG, CGA, CGC, CGG, and CGU all encode the amino acid arginine. Thus, at every position in the nucleic acids of the invention where an arginine is specified by a codon, the codon can be altered to any of the corresponding codons described above without altering the encoded polypeptide. It is understood that U in an RNA sequence 10 corresponds to T in a DNA sequence. Using, as an example, the nucleic acid sequence corresponding to nucleotides 1-15 of SEQ ID NO:1, ATG ATT GAA GTC AAA, a silent variation of this sequence includes AGT ATC GAG GTG AAG, both sequences which encode the amino acid sequence MIEVK, corresponding to amino acids 1-5 of SEQ ID NO:6. 15 Such "silent variations" are one species of "conservatively modified variations", discussed below. One of sId11 will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine) can be modified by standard techniques to encode a functionally identical polypeptide. Accordingly, each silent variation of a nucleic acid which encodes a polypeptide is implicit in any described 20 sequence. The invention provides each and every possible variation of nucleic acid -41- WO 02/36782 PCT/USO1/46227 sequence encoding a polypeptide of the invention that could be made by selecting combinations based on possible codon choices. These combinations are made in accordance with the standard triplet genetic code (e.g., as set forth in Table 1) as applied to the nucleic acid sequence encoding a GAT homologue polypeptide of the invention. All 5 such variations of every nucleic acid herein are specifically provided and described by consideration of the sequence in combination with the genetic code. Any variant can be produced as noted herein. A group of two or more different codons that, when translated in the same context, all encode the same amino acid, are referred to herein as "synonoumous codons." 10 As described herein, in some aspects of the invention a GAT polynucleotide is engineered for optimized codon usage in a desired host organism, for example a plant host. The term "optimized" or "optimal" are not meant to be restricted to the very best possible combination of codons, but simple indicates that the coding sequence as a whole possesses an improved usage of codons relative to a precursor polynucleotide from which it was 15 derived. Thus, in one aspect the invention provides a method for producing a GAT polynucleotide variant by replacing at least one parental codon in a nucleotide sequence with a synonomous codon that is preferentially used in a desired host organism, e.g., a plant, relative to the parental codon. "Conservatively modified variations" or, simply, "conservative variations" 20 of a particular nucleic acid sequence refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or, where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. One of skill will recognize that individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of amino acids (typically less than 5%, more typically 25 less than 4%, 2% or 1%, or less) in an encoded sequence are "conservatively modified variations" where the alterations result in the deletion of an amino acid, addition of an amino acid, or substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Table 2 sets forth six groups which contain amino acids that are 30 "conservative substitutions" for one another. -42 - WO 02/36782 PCT/USO1/46227 r - / U L Table 2 Conservative Substitution Groups 1 Alanine (A) Serine (S) Threonine (T) 2 Aspartic acid (D) Glutamic acid (E) 3 Asparagine (N) Glutamine (Q) 4 Arginine (R) Lysine (K) 5 Isoleucine M Leucine (L) Methionine (M) Valine (V) 6 Phenylalanine (F) Tyrosine (Y) Tryptophan (W) 5 Thus, "conservatively substituted variations" of a listed polypeptide sequence of the present invention include substitutions of a small percentage, typically less than 5%, more typically less than 2% and often less than 1%, of the amino acids of the polypeptide sequence, with a conservatively selected amino acid of the same conservative substitution group. 10 For example, a conservatively substituted variation of the polypeptide identified herein as SEQ ID NO:6 will contain "conservative substitutions", according to the six groups defined above, in up to 7 residues (i.e., 5% of the amino acids) in the 146 amino acid polypeptide. In a further example, if four conservative substitutions were localized in 15 the region corresponding to amino acids 21 to 30 of SEQ ID NO:6, examples of conservatively substituted variations of this region, RPN QPL EAC M, include: KPQ QPV ESC M and KPN NPL DAC V and the like, in accordance with the conservative substitutions 20 listed in Table 2 (in the above example, conservative substitutions are underlined). Listing of a protein sequence herein, in conjunction with the above substitution table, provides an express listing of all conservatively substituted proteins. Finally, the addition of sequences which do not alter the encoded activity of a nucleic acid molecule, such as the addition of a non-functional or non-coding sequence, 25 is a conservative variation of the basic nucleic acid. One of skill will appreciate that many conservative variations of the nucleic acid constructs which are disclosed yield a functionally identical construct. For example, as discussed above, owing to the degeneracy of the genetic code, "silent substitutions" -43- WO 02/36782 PCT/US01/46Z27 (i.e., substitutions in a nucleic acid sequence which do not result in an alteration in an encoded polypeptide) are an implied feature of every nucleic acid sequence which encodes an amino acid. Similarly, "conservative amino acid substitutions," in one or a few amino acids in an amino acid sequence are substituted with different amino acids with highly 5 similar properties, are also readily identified as being highly similar to a disclosed construct. Such conservative variations of each disclosed sequence are a feature of the present invention. Non-conservative modifications of a particular nucleic acid are those which substitute any amino acid not characterized as a conservative substitution. For example, 10 any substitution which crosses the bounds of the six groups set forth in Table 2. These include substitutions of basic or acidic amino acids for neutral amino acids, (e.g., Asp, Glu, Asn, or Gin for Val, Ile, Leu or Met), aromatic amino acid for basic or acidic amino acids (e.g., Phe, Tyr or Trp for Asp, Asn, Glu or Gln) or any other substitution not replacing an amino acid with a like amino acid. 15 Nucleic Acid Hybridization Nucleic acids "hybridize" when they associate, typically in solution. Nucleic acids hybridize due to a variety of well-characterized physico-chemical forces, such as hydrogen bonding, solvent exclusion, base stacking and the like. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory 20 Techniques in Biochemistry and Molecular Biology--Hybridization with Nucleic Acid Probes, part I, chapter 2, "Overview of principles of hybridization and the strategy of nucleic acid probe assays," (Elsevier, New York), as well as in Ausubel, supra, Hames and Higgins (1995) Gene Probes 1, IRL Press at Oxford University Press, Oxford, England (Hames and Higgins 1) and Hames and Higgins (1995) Gene Probes 2, IRL Press 25 at Oxford University Press, Oxford, England (Hames and Higgins 2) provide details on the synthesis, labeling, detection and quantification of DNA and RNA, including oligonucleotides. "Stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments, such as Southern and northern hybridizations, are sequence 30 dependent, and are different under different environmental parameters. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993), supra, and in Hames and Higgins 1 and Hames and Higgins 2, supra. For purposes of the present invention, generally, "highly stringent" hybridization and wash conditions are selected to be about 5*C or less lower than the -44- WO 02/36782 PCT/US01/46227 thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH (as noted below, highly stringent conditions can also be referred to in comparative terms). The Tm is the temperature (under defined ionic strength and pH) at which 50% of the test sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected 5 to be equal to the Tm for a particular probe. The Tm of a nucleic acid duplex indicates the temperature at which the duplex is 50% denatured under the given conditions and its represents a direct measure of the stability of the nucleic acid hybrid. Thus, the Tm corresponds to the temperature corresponding to the midpoint in transition from helix to random coil; it depends on 10 length, nucleotide composition, and ionic strength for long stretches of nucleotides. After hybridization, unhybridized nucleic acid material can be removed by a series of washes, the stringency of which can be adjusted depending upon the desired results. Low stringency washing conditions (e.g., using higher salt and lower temperature) increase sensitivity, but can product nonspecific hybridization signals and high 15 background signals. Higher stringency conditions (e.g., using lower salt and higher temperature that is closer to the hybridization temperature) lowers the background signal, typically with only the specific signal remaining. See Rapley, R. and Walker, J.M. eds., Molecular Biomethods Handbook (Humana Press, Inc. 1998) (hereinafter "Rapley and Walker"), which is incorporated herein by reference in its entirety for all purposes. 20 The Tm of a DNA-DNA duplex can be estimated using Equation 1 as follows: Tm (cC) = 81.5*C + 16.6 (logioM) + 0.41 (%G + C) -0.72 (%f) - 500/n, where M is the molarity of the monovalent cations (usually Na+), (%G + C) is the percentage of guanosine (G) and cystosine (C) nucleotides, (%f) is the percentage 25 of formalize and n is the number of nucleotide bases (i.e., length) of the hybrid. See Rapley and Walker, supra. The Tm of an RNA-DNA duplex can be estimated by using Equation 2 as follows: Tm ("C) = 79.8 0 C + 18.5 (logioM) + 0.58 (%G + C) - 11.8(%G + C) 0.56 30 (%f) - 820/n,where M is the molarity of the monovalent cations (usually Na+), (%G + C)is the percentage of guanosine (G ) and cystosine (C) nucleotides, (%f) is the percentage of formamide and n is the number of nucleotide bases (i.e., length) of the hybrid. Id Equations 1 and 2 are typically accurate only for hybrid duplexes longer than about 100-200 nucleotides. Id. -45- WO 02/36782 PCT/US01/46227 The Tm of nucleic acid sequences shorter than 50 nucleotides can be calculated as follows: Tm (*C) = 4(G + C) + 2(A + T), where A (adenine), C, T (thymine), and G are the numbers of the 5 corresponding nucleotides. An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or northern blot is 50% formalin with 1 mg of heparin at 42*C, with the hybridization being carried out overnight. An example of stringent wash conditions is 10 a 0.2x SSC wash at 65*C for 15 minutes (see Sambrook, supra for a description of SSC buffer). Often the high stringency wash is preceded by a low stringency wash to remove background probe signal. An example low stringency wash is 2x SSC at 40*C for 15 minutes. In general, a signal to noise ratio of 2.5x-5x (or higher) than that observed 15 for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Detection of at least stringent hybridization between two sequences in the context of the present invention indicates relatively strong structural similarity or homology to, e.g., the nucleic acids of the present invention provided in the sequence listings herein. 20 As noted, "highly stringent" conditions are selected to be about 5* C or less lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. Target sequences that are closely related or identical to the nucleotide sequence of interest (e.g., "probe") can be identified under highly stringent conditions. Lower stringency conditions are appropriate for sequences that are less complementary. 25 See, e.g., Rapley and Walker, supra. Comparative hybridization can be used to identify nucleic acids of the invention, and this comparative hybridization method is a preferred method of distinguishing nucleic acids of the invention. Detection of highly stringent hybridization between two nucleotide sequences in the context of the present invention indicates 30 relatively strong structural similarity/homology to, e.g., the nucleic acids provided in the sequence listing herein. Highly stringent hybridization between two nucleotide sequences demonstrates a degree of similarity or homology of structure, nucleotide base composition, arrangement or order that is greater than that detected by stringent hybridization -46- WO 02/36782 PCT/US01/46227 conditions. In particular, detection of highly stringent hybridization in the context of the present invention indicates strong structural similarity or structural homology (e.g., nucleotide structure, base composition, arrangement or order) to, e.g., the nucleic acids provided in the sequence listings herein. For example, it is desirable to identify test 5 nucleic acids that hybridize to the exemplar nucleic acids herein under stringent conditions. Thus, one measure of stringent hybridization is the ability to hybridize to one of the listed nucleic acids (e.g., nucleic acid sequences SEQ ID NO: 1 to SEQ ID NO:5 and SEQ ID NO:11 to SEQ I) NO:262, and complementary polynucleotide sequences 10 thereof), under highly stringent conditions (or very stringent conditions, or ultra-high stringency hybridization conditions, or ultra-ultra high stringency hybridization conditions). Stringent hybridization (as well as highly stringent, ultra-high stringency, or ultra-ultra high stringency hybridization conditions) and wash conditions can easily be determined empirically for any test nucleic acid. For example, in determining highly 15 stringent hybridization and wash conditions, the hybridization and wash conditions are gradually increased (e.g., by increasing temperature, decreasing salt concentration, increasing detergent concentration and/or increasing the concentration of organic solvents, such as formalin, in the hybridization or wash), until a selected set of criteria are met. For example, the hybridization and wash conditions are gradually increased until a probe 20 comprising one or more nucleic acid sequences selected from SEQ ID NO: 1 to SEQ ID NO:5 and SEQ ID NO:11 to SEQ ID NO:262, and complementary polynucleotide sequences thereof, binds to a perfectly matched complementary target (again, a nucleic acid comprising one or more nucleic acid sequences selected from SEQ ID NO: 1 to SEQ ID NO:5 and SEQ ID NO: 11 to SEQ ID NO:262, and complementary polynucleotide 25 sequences thereof), with a signal to noise ratio that is at least about 2.5x, and optionally about 5x or more as high as that observed for hybridization of the probe to an unmatched target. In this case, the unmatched target is a nucleic acid corresponding to a nucleic acid (other than those in the accompanying sequence listing) that is present in a public database such as GenBankTM at the time of filing of the subject application. Such sequences can be 30 identified in GenBank by one of skill. Examples include Accession Nos. Z99109 and Y09476. Additional such sequences can be identified in e.g., GenBank, by one of ordinary skill in the art A test nucleic acid is said to specifically hybridize to a probe nucleic acid when it hybridizes at least W as well to the probe as to the perfectly matched -47- WO 02/36782 PCT/USO1/46227 complementary target, i.e., with a signal to noise ratio at least as high as hybridization of the probe to the target under conditions in which the perfectly matched probe binds to the perfectly matched complementary target with a signal to noise ratio that is at least about 2x-10x, and occasionally 20x, 50x or greater than that observed for hybridization to 5 any of the unmatched polynucleotides Accession Nos. Z99109 and Y09476. Ultra high-stringency hybridization and wash conditions are those in which the stringency of hybridization and wash conditions are increased until the signal to noise ratio for binding of the probe to the perfectly matched complementary target nucleic acid is at least 10x as high as that observed for hybridization to any of the unmatched target 10 nucleic acids Genbank Accession numbers Z99109 and Y09476. A target nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at least /2 that of the perfectly matched complementary target nucleic acid is said to bind to the probe under ultra-high stringency conditions. Similarly, even higher levels of stringency can be determined by gradually 15 increasing the hybridization and/or wash conditions of the relevant hybridization assay. For example, those in which the stringency of hybridization and wash conditions are increased until the signal to noise ratio for binding of the probe to the perfectly matched complementary target nucleic acid is at least 1Ox, 20X, 50X, 100X, or 500X or more as high as that observed for hybridization to any of the unmatched target nucleic acids 20 Genbank Accession numbers Z99109 and Y09476. A target nucleic acid which hybridizes to a probe under such conditions, with a signal to noise ratio of at least that of the perfectly matched complementary target nucleic acid is said to bind to the probe under ultra-ultra-high stringency conditions. Target nucleic acids which hybridize to the nucleic acids represented by 25 SEQ ED NO:1 to SEQ ID NO:5 and SEQ ID NO:11 to SEQ ID NO:262 under high, ultra high and ultra-ultra high stringency conditions are a feature of the invention. Examples of such nucleic acids include those with one or a few silent or conservative nucleic acid substitutions as compared to a given nucleic acid sequence. Nucleic acids which do not hybridize to each other under stringent 30 conditions are still substantially identical if the polypeptides which they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code, or when antisera or antiserum generated against one or more of SEQ ID NO:6 to SEQ ID NO:10 and SEQ ID NO:263 to SEQ ID NO:514, which has been subtracted using the polypeptides encoded by known -48- WO 02/36782 PCT/US01/46227 nucleotide sequences, including Genbank Accession number CAA70664. Further details on immunological identification of polypeptides of the invention are found below. Additionally, for distinguishing between duplexes with sequences of less than about 100 nucleotides, a TMAC1 hybridization procedure known to those of ordinary skill in the art 5 can be used. See, e.g., Sorg, U. et al. 1 Nucleic Acids Res. (Sept 11, 1991) 19(17), incorporated herein by reference in its entirety for all purposes. In one aspect, the invention provides a nucleic acid which comprises a unique subsequence in a nucleic acid selected from SEQ ID NO:1 to SEQ ID NO:5 and SEQ ID NO:11 to SEQ ID NO:262. The unique subsequence is unique as compared to a 10 nucleic acid corresponding to any of Genbank Accession numbers Z99109 and Y09476. Such unique subsequences can be determined by aligning any of SEQ ID NO: 1 to SEQ ID NO:5 and SEQ ID NO: 11 to SEQ ID NO:262 against the complete set of nucleic acids represented by GenBank accession numbers Z99109, Y09476 or other related sequences available in public databases as of the filing date of the subject application. Alignment 15 can be performed using the BLAST algorithm set to default parameters. Any unique subsequence is useful, e.g., as a probe to identify the nucleic acids of the invention. Similarly, the invention includes a polypeptide which comprises a unique subsequence in a polypeptide selected from: SEQ ID NO:6 to SEQ ID NO:10 and SEQ ID NO:263 to SEQ ID NO:514. Here, the unique subsequence is unique as compared to a 20 polypeptide corresponding to GenBank accession number CAA70664. Here again, the polypeptide is aligned against the sequences represented by accession number CAA70664. Note that if the sequence corresponds to a non-translated sequence such as a pseudo gene, the corresponding polypeptide is generated simply by in silico translation of the nucleic acid sequence into an amino acid sequence, where the reading frame is selected to 25 correspond to the reading frame of homologous GAT polynucleotides. The invention also provides for target nucleic acids which hybridizes under stringent conditions to a unique coding oligonucleotide which encodes a unique subsequence in a polypeptide selected from SEQ ID NO:6 to SEQ ID NO:10 and SEQ ID NO:263 to SEQ ID NO:514, wherein the unique subsequence is unique as compared to a 30 polypeptide corresponding to any of the control polypeptides. Unique sequences are determined as noted above. In one example, the stringent conditions are selected such that a perfectly complementary oligonucleotide to the coding oligonucleotide hybridizes to the coding oligonucleotide with at least about a 2.5x-10x higher, preferably at least about a 5-10x -49 - WO 02136782 PCT/US01/46227 higher signal to noise ratio than for hybridization of the perfectly complementary oligonucleotide to a control nucleic acid corresponding to any of the control polypeptides. Conditions can be selected such that higher ratios of signal to noise are observed in the particular assay which is used, e.g., about 15x, 20x, 30x, 50x or more. In this example, the 5 target nucleic acid hybridizes to the unique coding oligonucleotide with at least a 2x higher signal to noise ratio as compared to hybridization of the control nucleic acid to the coding oligonucleotide. Again, higher signal to noise ratios can be selected, e.g., about 2.5x, 5x, 10x, 20x, 30x, 50x or more. The particular signal will depend on the label used in the relevant assay, e.g., a fluorescent label, a colorimetric label, a radioactive label, or 10 the like. Vectors, Promoters and Expression Systems. The present invention also includes recombinant constructs comprising one or more of the nucleic acid sequences as broadly described above. The constructs comprise a vector, such as, a plasmid, a cosmid, a phage, a virus, a bacterial artificial 15 chromosome (BAC), a yeast artificial chromosome (YAC), or the like, into which a nucleic acid sequence of the invention has been inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, the construct further comprises regulatory sequences, including, for example, a promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters are known to those of skill in the art, and 20 are commercially available. General texts which describe molecular biological techniques useful herein, including the use of vectors, promoters and many other relevant topics, include Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology volume 152 Academic Press, Inc., San Diego, CA (Berger); Sambrook et al., Molecular Cloning - A 25 Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in Molecular Biology FM. Ausubel et al., eds., Current Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 1999) ("Ausubel"). Examples of protocols sufficient to direct persons of skill through in vitro amplification 30 methods, including the polymerase chain reaction (PCR) the ligase chain reaction (LCR), Qp-replicase amplification and other RNA polymerase mediated techniques (e.g., NASBA), e.g., for the production of the homologous nucleic acids of the invention are found in Berger, Sambrook, and Ausubel, as well as Mullis et al., (1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide to Methods and Applications (Innis et al. eds) -50- WO 02136782 PCT/USO1/46227 Academic Press Inc. San Diego, CA (1990) (Innis); Arnheim & Levinson (October 1, 1990) C&EN 36-47; The Journal Of NIH Research (1991) 3, 81-94; (Kwoh et al. (1989) Proc. Nail. Acad. Sci. USA 86, 1173; Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87, 1874; Lomell et al. (1989) 1. Clin. Chem 35, 1826; Landegren et al., (1988) Science 241, 5 1077-1080; Van Brunt (1990) Biotechnologv 8, 291-294; Wu and Wallace, (1989) Gene 4, 560; Baringer et al. (1990) Gene 89, 117, and Sooknanan and Malek (1995) Biotechnology 13: 563-564. Improved methods for cloning in vitro amplified nucleic acids are described in Wallace et al., U.S. Pat. No. 5,426,039. Improved methods for amplifying large nucleic acids by PCR are summarized in Cheng et al. (1994) Nature 369: 10 684-685 and the references cited therein, in which PCR amplicons of up to 40kb are generated. One of skill will appreciate that essentially any RNA can be converted into a double stranded DNA suitable for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. See, e.g., Ausubel, Sambrook and Berger, all supra. 15 The present invention also relates to engineered host cells that are transduced (transformed or transfected) with a vector of the invention (e.g., an invention cloning vector or an invention expression vector), as well as the production of polypeptides of the invention by recombinant techniques. The vector may be, for example, a plasmid, a viral particle, a phage, etc. The engineered host cells can be 20 cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants, or amplifying the GAT homologue gene. Culture conditions, such as temperature, pH and the like, are those previously used with the host cell selected for expression, and will be apparent to those skilled in the art and in the references cited herein, including, e.g., Sambrook, Ausubel and Berger, as well as e.g., Freshney (1994) 25 Culture of Animal Cells, a Manual of Basic Technique, third edition, Wiley- Liss, New York and the references cited therein. GAT polypeptides of the invention can be produced in non-animal cells such as plants, yeast, fungi, bacteria and the like. In addition to Sambrook, Berger and Ausubel, details regarding non-animal cell culture can be found in Payne et al. (1992) 30 Plant Cell and Tissue Culture in Liquid Systems John Wiley & Sons, Inc. New York, NY; Gamborg and Phillips (eds) (1995) Plant Cell, Tissue and Organ Culture; Fundamental Methods Springer Lab Manual, Springer-Verlag (Berlin Heidelberg New York) and Atlas and Parks (eds) The Handbook of Microbiological Media (1993) CRC Press, Boca Raton, FL. -51- WO 02136782 PCT/USO1/46227 Polynucleotides of the present invention can be incorporated into any one of a variety of expression vectors suitable for expressing a polypeptide. Suitable vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from 5 combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, pseudorabies, adenovirus, adeno-associated virus, retroviruses and many others. Any vector that transduces genetic material into a cell, and, if replication is desired, which is replicable and viable in the relevant host can be used. When incorporated into an expression vector, a polynucleotide of the 10 invention is operatively linked to an appropriate transcription control sequence (promoter) to direct mR.NA synthesis. Examples of such transcription control sequences particularly suited for use in transgenic plants include the cauliflower mosaic virus (CaMV), figwort mosaic virus (FMV) and strawberry vein banding virus (SVBV) promoters, described in U.S. Provisional Application No. 60/245,354. Other promoters known to control 15 expression of genes in prokaryotic or eukaryotic cells or their viruses and which can be used in some embodiments of the invention include SV40 promoter, E. coli lac or trp promoter, phage lambda PL promoter. An expression vector optionally contains a ribosome binding site for translation initiation, and a transcription terminator. The vector also optionally includes appropriate sequences for amplifying expression, e.g., an 20 enhancer. In addition, the expression vectors of the present invention optionally contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells, such as dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E. coli. Vectors of the present invention can be employed to transform an 25 appropriate host to permit the host to express an invention protein or polypeptide. Examples of appropriate expression hosts include: bacterial cells, such as E. coli, H subtilis, Streptomyces, and Salmonella typhimurium; fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, and Neurospora crassa; insect cells such as Drosophila and Spodopterafrigiperda; mammalian cells such as CHO, COS, BHK, HEK 293 or Bowes 30 melanoma; or plant cells or explants, etc. It is understood that not all cells or cell lines need to be capable of producing fully functional GAT polypeptides; for example, antigenic fragments of a GAT polypeptide may be produced. The invention is not limited by the host cells employed. -52- WO 02/36782 PCT/US01/46227 In bacterial systems, a number of expression vectors may be selected depending upon the use intended for the GAT polypeptide. For example, when large quantities of GAT polypeptide or fragments thereof are needed for commercial production or for induction of antibodies, vectors which direct high level expression of fusion proteins 5 that are readily purified can be desirable. Such vectors include, but are not limited to, multifunctional E. coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the GAT polypeptide coding sequence may be ligated into the vector in-frame with sequences for the amino-terminal Met and the subsequent 7 residues of beta-galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke & 10 Schuster (1989) J Biol Chem 264:5503-5509); pET vectors (Novagen, Madison WI); and the like. Similarly, in the yeast Saccharomyces cerevisiae a number of vectors containing constitutive or inducible promoters such as alpha factor, alcohol oxidase and PGH may be used for production of the GAT polypeptides of the invention. For reviews, 15- see Ausubel et al. (supra) and Grant et al. (1987; Methods in Enzymology 153:516-544). In mammalian host cells, a variety of expression systems, including viral based systems, may be utilized. In cases where an adenovirus is used as an expression vector, a coding sequence, e.g., of a GAT polypeptide, is optionally ligated into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 20 leader sequence. Insertion of a GAT polypeptide coding region into a nonessential El or E3 region of the viral genome will result in a viable virus capable of expressing a GAT in infected host cells (Logan and Shenk (1984) Proc Natl Acad Sci USA 81:3655-3659). In addition, transcription enhancers, such as the rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 25 Similarly, in plant cells, expression can be driven from a transgene integrated into a plant chromosome, or cytoplasmically from an episomal or viral nucleic acid. In the case of stably integrated transgenes, it is often desirable to provide sequences capable of driving constitutive or inducible expression of the GAT polynucleotides of the invention, for example, using viral, e.g., CaMV, or plant derived regulatory sequences. 30 Numerous plant derived regulatory sequences have been described, including sequences which direct expression in a tissue specific manner, e.g., TobRB7, patatin B33, GRP gene promoters, the rbcS-3A promoter, and the like. Alternatively, high level expression can be achieved by transiently expressing exogenous sequences of a plant viral vector, e.g., TMV, BMV, etc. Typically, transgenic plants constitutively expressing a GAT polynucleotide of - 53 - WO 02/36782 PCT/USO1/46227 the invention will be preferred, and the regulatory sequences selected to insure constitutive stable expression of the GAT polypeptide. In some embodiments of the present invention, a GAT polynucleotide construct suitable for transformation of plant cells is prepared. For example, a desired 5 GAT polynucleotide can be incorporated into a recombinant expression cassette to facilitate introduction of the gene into a plant and subsequent expression of the encoded polypeptide. An expression cassette will typically comprise a GAT polynucleotide, or functional fragment thereof, operably linked to a promoter sequence and other transcriptional and translational initiation regulatory sequences which will direct 10 expression of the sequence in the intended tissues (e.g., entire plant, leaves, seeds) of the transformed plant. For example, a strongly or weakly constitutive plant promoter can be employed which will direct expression of the GAT polypeptide all tissues of a plant. Such promoters are active under most environmental conditions and states of development or 15 cell differentiation. Examples of constitutive promoters include the l'- or 2'- promoter derived from T-DNA of Agrobacterium tumefaciens, and other transcription initiation regions from various plant genes known to those of skill. In situations in which overexpression of a GAT poynucleotide is detrimental to the plant or otherwise undesirable, one of skill, upon review of this disclosure, will recognize that weak 20 constitutive promoters can be used for low-levels of expression. In those cases where high levels of expression is not harmful to the plant, a strong promoter, e.g., a t-RNA or other pol IHI promoter, or a strong pol II promoter, such as the cauliflower mosaic virus promoter, can be used. Alternatively, a plant promoter may be under environmental control. Such 25 promoters are referred to here as "inducible" promoters. Examples of environmental conditions that may effect transcription by inducible promoters include pathogen attack, anaerobic conditions, or the presence of light. The promoters used in the present invention can be "tissue-specific" and, as such, under developmental control in that the polynucleotide is expressed only in certain 30 tissues, such as leaves and seeds. In embodiments in which one or more nucleic acid sequences endogenous to the plant system are incorporated into the construct, the endogenous promoters (or variants thereof) from these genes can be employed for directing expression of the genes in the transfected plant. Tissue-specific promoters can also be used to direct expression of heterologous polynucleotides. -54 - WO 02/36782 PCT/US01/46227 In general, the particular promoter used in the expression cassette in plants depends on the intended application. Any of a number of promoters which direct transcription in plant cells are suitable. The promoter can be either constitutive or inducible. In addition to the promoters noted above, promoters of bacterial origin which 5 operate in plants include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids (see, Herrara-Estrella et aL (1983) Nature 303:209-213). Viral promoters include the 35S and 19S RNA promoters of cauliflower mosaic virus (Odell et al. (1985) Nature 313:810-812). Other plant promoters include the ribulose-1,3-bisphosphate carboxylase small subunit promoter and the 10 phaseolin promoter. The promoter sequence from the E8 gene and other genes may also be used. The isolation and sequence of the E8 promoter is described in detail in Deikman and Fischer (1988) EMBO J. 7:3315-3327. To identify candidate promoters, the 5' portions of a genomic clone is analyzed for sequences characteristic of promoter sequences. For instance, promoter 15 sequence elements include the TATA box consensus sequence (TATAAT), which is usually 20 to 30 base pairs upstream of the transcription start site. In plants, further upstream from the TATA box, at positions -80 to -100, there is typically a promoter element with a series of adenines surrounding the trinucleotide G (or T) as described by Messing et al. (1983) Genetic Engineering in Plants, Kosage, et al. (eds.), pp. 221-227. 20 In preparing polyucleotide constructs, e.g., vectors, of the invention, sequences other than the promoter and the cojoined polynucleotide can also be employed. If normal polypeptide expression is desired, a polyadenylation region at the 3'-end of a GAT-encoding region can be included. The polyadenylation region can be derived, for example, from a variety of plant genes, or from T-DNA. 25 The construct can also include a marker gene which confers a selectable phenotype on plant cells. For example, the marker may encode biocide tolerance, particularly antibiotic tolerance, such as tolerance to kanamycin, G418, bleomycin, hygromycin, or herbicide tolerance, such as tolerance to chlorosluforon, or phosphinothricin (the active ingredient in the herbicides bialaphos and Basta). 30 Specific initiation signals can aid in efficient translation of a GAT polynucleotide-encoding sequence of the present invention. These signals can include, e.g., the ATG initiation codon and adjacent sequences. In cases where a GAT polypeptide-encoding sequence, its initiation codon and upstream sequences are inserted into an appropriate expression vector, no additional translational control signals may be - 55 - WO 02/36782 PCT/US01/46227 needed. However, in cases where only coding sequence (e.g., a mature protein coding sequence), or a portion thereof, is inserted, exogenous transcriptional control signals including the initiation codon must be provided. Furthermore, the initiation codon must be in the correct reading frame to ensure transcription of the entire insert. Exogenous 5 transcriptional elements and initiation codons can be of various origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate to the cell system in use (Scharf D et al. (1994) Results Prob1 Cell Differ 20:125-62; Bittner et al. (1987) Methods in Enzymol 153:516-544). Secretion/Localization Seguences 10 Polynucleotides of the invention can also be fused, for example, in-frame to nucleic acids encoding a secretion/localization sequence, to target polypeptide expression to a desired cellular compartment, membrane, or organelle of a mammalian cell, or to direct polypeptide secretion to the periplasmic space or into the cell culture media. Such sequences are known to those of skill, and include secretion leader peptides, organelle 15 targeting sequences (e.g., nuclear localization sequences, ER retention signals, mitochondrial transit sequences, chloroplast transit sequences), membrane localization/anchor sequences (e.g., stop transfer sequences, GPI anchor sequences), and the like. In a preferred embodiment, a polynucleotide of the invention is fused in 20 frame with an N-terminal chloroplast transit sequence (or chloroplast transit peptide sequence) derived from a gene encoding a polypeptide that is normally targeted to the chloroplast. Such sequences are typically rich in serine and threonine; are deficient in aspartate, glutamate, and tyrosine; and generally have a central domain rich in positively charged amino acids. 25 Expression Hosts In a further embodiment, the present invention relates to host cells containing the above-described constructs. The host cell can be a eukaryotic cell, such as a mammalian cell, a yeast cell, or a plant cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be effected by 30 calcium phosphate transfection, DEAE-Dextran mediated transfection, electroporation, or other common techniques (Davis, L., Dibner, M., and Battey, I. (1986) Basic Methods in Molecular Biology). A host cell strain is optionally chosen for its ability to modulate the expression of the inserted sequences or to process the expressed protein in the desired - 56 - WO 02136782 PCT/US01146227 fashion. Such modifications of the protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. Post-translational processing that cleaves a "pre" or a "prepro" form of the protein may also be important for correct insertion, folding and/or function. Different host cells such as E. coli, Bacillus sp., 5 yeast or mammalian cells such as CHO, HeLa, BHK, MDCK, 293, WI38, etc. have specific cellular machinery and characteristic mechanisms, e.g., for post-translational activities and may be chosen to ensure the desired modification and processing of the introduced, foreign protein. For long-term, high-yield production of recombinant proteins, stable 10 expression systems can be used. For example, plant cells, explants or tissues, e.g. shoots, leaf discs, which stably express a polypeptide of the invention are transduced using expression vectors which contain viral origins of replication or endogenous expression elements and a selectable marker gene. Following the introduction of the vector, cells may be allowed to grow for a period determined to be appropriate for the cell type, e.g., 1 15 or more hours for bacterial cells, 1-4 days for plant cells, 2-4 weeks for some plant explants, in an enriched media before they are switched to selective media. The purpose of the selectable marker is to confer resistance to selection, and its presence allows growth and recovery of cells which successfully express the introduced sequences. For example, transgenic plants expressing the polypeptides of the invention can be selected directly for 20 resistance to the herbicide, glyphosate. Resistant embryos derived from stably transformed explants can be proliferated, e.g., using tissue culture techniques appropriate to the cell type. Host cells transformed with a nucleotide sequence encoding a polypeptide of the invention are optionally cultured under conditions suitable for the expression and 25 recovery of the encoded protein from cell culture. The protein or fragment thereof produced by a recombinant cell may be secreted, membrane-bound, or contained intracellularly, depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing GAT polynucleotides of the invention can be designed with signal sequences which direct secretion of the mature 30 polypeptides through a prokaryotic or eukaryotic cell membrane. Additional Polvpeptide Sequences Polynucleotides of the present invention may also comprise a coding sequence fused in-frame to a marker sequence that, e.g., facilitates purification of the encoded polypeptide. Such purification facilitating domains include, but are not limited - 57 - WO 02/36782 PCTJUS01/46227 to, metal chelating peptides such as histidine-tryptophan modules that allow purification on immobilized metals, a sequence which binds glutathione (e.g., GST), ahemagglutinin (HA) tag (corresponding to an epitope derived from the influenza hemagglutinin protein; Wilson et al. (1984) Cell 37:767), maltose binding protein sequences, the FLAG epitope 5 utilized in the FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA), and the like. The inclusion of a protease-cleavable polypeptide linker sequence between the purification domain and the GAT homologue sequence is useful to facilitate purification. One expression vector contemplated for use in the compositions and methods described herein provides for expression of a fusion protein comprising a polypeptide of 10 the invention fused to a polyhistidine region separated by an enterokinase cleavage site. The histidine residues facilitate purification on IMIAC (immobilized metal ion affinity chromatography, as described in Porath et al. (1992) Protein Expression and Purification 3:263-281) while the enterokinase cleavage site provides a means for separating the GAT homologue polypeptide from the fusion protein. pGEX vectors (Promega; Madison, WI) 15 may also be used to express foreign polypeptides as fusion proteins with glutathione S transferase (GST). In general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case of GST-fusions) followed by elution in the presence of free ligand. Polypeptide Production and Recovery 20 Following transduction of a suitable host strain and growth of the host strain to an appropriate cell density, the selected promoter is induced by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period. Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification. Microbial cells 25 employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, or other methods, which are well known to those skilled in the art. As noted, many references are available for the culture and production of many cells, including cells of bacterial, plant, animal (especially mammalian) and 30 archebacterial origin. See e.g., Sambrook, Ausubel, and Berger (all supra), as well as Freshney (1994) Culture of Animal Cells, a Manual of Basic Technique third edition, Wiley- Liss, New York and the references cited therein; Doyle and Griffiths (1997) Mammalian Cell Culture: Essential Techniques John Wiley and Sons, NY; Humason (1979) Animal Tissue Techniques, fourth edition W.H. Freeman and Company; and -58- WO 02/36782 PCT/US01/46Z27 Ricciardelli, et al., (1989) In vitro Cell Dev. Biol. 25:1016-1024. For plant cell culture and regeneration, Payne et al. (1992) Plant Cell and Tissue Culture in Liauid Systems John Wiley & Sons, Inc. New York, NY; Gamborg and Phillips (eds) (1995) Plant Cell, Tissue and Organ Culture; Fundamental Methods Springer Lab Manual, Springer-Verlag 5 (Berlin Heidelberg New York); Jones, ed. (1984) Plant Gene Transfer and Expression Protocols, Humana Press, Totowa, New Jersey and Plant Molecular Biolev (1993) R.R.D.Croy, Ed. Bios Scientific Publishers, Oxford, U.K. ISBN 0 12 198370 6. Cell culture media in general are set forth in Atlas and Parks (eds) The Handbook of Microbiological Media (1993) CRC Press, Boca Raton, FL. Additional information for 10 cell culture is found in available commercial literature such as the Life Science Research Cell Culture Catalogue (1998) from Sigma- Aldrich, Inc (St Louis, MO) ("Sigma LSRCCC") and, e.g., The Plant Culture Catalogue and supplement (1997) also from Sigma-Aldrich, Inc (St Louis, MO) ("Sigma-PCCS"). Further details regarding plant cell transformation and transgenic plant production are found below. 15 Polypeptides of the invention can be recovered and purified from recombinant cell cultures by any of a number of methods well known in the art, including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography (e.g., using any of the tagging systems noted 20 herein), hydroxylapatite chromatography, and lectin chromatography. Protein refolding steps can be used, as desired, in completing the configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed in the final purification steps. In addition to the references noted supra, a variety of purification methods are well known in the art, including, e.g., those set forth in Sandana (1997) 25 Bioseparation of Proteins, Academic Press, Inc.; and Bollag et al. (1996) Protein Methods, 2 d Edition Wiley-Liss, NY; Walker (1996) The Protein Protocols Handbook Humana Press, NJ, Harris and Angal (1990) Protein Purification Aplications: A Practical Aporoach IRL Press at Oxford, Oxford, England; Harris and Angal Protein Purification Methods: A Practical Approach IRL Press at Oxford, Oxford, England; Scopes (1993) 30 Protein Purification: Principles and Practice 3d Edition Springer Verlag, NY; Janson and Ryden (1998) Protein Purificatio Priniples. Il Resolution Methods and Aplications. Second Edition Wiley-VCH, NY; and Walker (1998) Protein Protocols on CD-ROM Humana Press, NJ. - 59 - WO 02/36782 PCT/USO1/46227 In some cases, it is desirable to produce the GAT polypeptide of the invention in a large scale suitable for industrial and/or commercial applications. In such cases bulk fermentation procedures are employed. Briefly, a GAT polynucleotide, e.g., a polynucleotide comprising any one of SEQ ID NOS: 1-5 and 11-262. or other nucleic 5 acids encoding GAT polypeptides of the invention can be cloned into an expression vector. For example, U.S. Patent No. 5,955,310 to Widner et al. "METHODS FOR PRODUCING A POLYPEPTIDE IN A BACILLUS CELL," describes a vector with tandem promoters, and stabilizing sequences operably linked to a polypeptide encoding sequence. After inserting the polynucleotide of interest into a vector, the vector is 10 tranformed into a bacterial, e.g., a Bacillus subtilis strain PL1801IIE (amyE, apr, npr, spoIIE::Tn917) host. The introduction of an expression vector into a Bacillus cell may, for instance, be effected by protoplast transformation (see, e.g., Chang and Cohen (1979) Molecular General Genetics 168:111), by using competent cells (see, e.g., Young and Spizizin (1961) Journal of Bacteriology 81:823, or Dubnau and Davidoff-Abelson (1971) 15 Journal of Molecular Bioloay 56:209), by electroporation (see, e.g., Shigekawa and Dower (1988) Biotechniaues 6:742), or by conjugation (see, e.g., Koehler and Thorne (1987) Journal of Bacteriology 169:5271), also Ausubel, Sambrook and Berger, all supra. The transformed cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods that are known in the art. For example, the 20 cell may be cultivated by shake flask cultivation, small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using 25 procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). The secreted polypeptide can be recovered directly from the medium. The resulting polypeptide may be isolated by methods known in the art. For 30 example, the polypeptide may be isolated from the nutrient medium by conventional procedures including, but not limited to, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. The isolated polypeptide may then be further purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), -60- WO 02136782 PCT/US01/46227 electrophoretic procedures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), or extraction (see, e.g., Bollag et al. (1996) Protein Methods, 2 nd Edition Wiley-Liss, NY; Walker (1996) The Protein Protocols Handbook Humana Press, NJ; Bollag et al. (1996) Protein Methods, 2d Edition Wiley-Liss, NY; 5 Walker (1996) The Protein Protocols Handbook Humana Press, NJ). Cell-free transcription/translation systems can also be employed to produce polypeptides using DNAs or RNAs of the present invention. Several such systems are commercially available. A general guide to in vitro transcription and translation protocols is found in Tymms (1995) In vitro Transcription and Translation Protocols: Methods in 10 Molecular Biology Volume 37, Garland Publishing, NY. SUBSTRATES AND FORMATS FOR SEQUENCE RECOMBINATION The polynucleotides of the invention are optionally used as substrates for a variety of diversity generating procedures, e.g., mutation, recombination and recursive recombination reactions, in addition to their use in standard cloning methods as set forth 15 in, e.g., Ausubel, Berger and Sambrook, i.e., to produce additional GAT polynucleotides and polypeptides with desired properties. A variety of diversity generating protocols are available and described in the art. The procedures can be used separately, and/or in combination to produce one or more variants of a polynucleotide or set of polynucleotides, as well variants of encoded proteins. Individually and collectively, these procedures 20 provide robust, widely applicable ways of generating diversified polynucleotides and sets of polynucleotides (including, e.g., polynucleotide libraries) useful, e.g., for the engineering or rapid evolution of polynucleotides, proteins, pathways, cells and/or organisms with new and/or improved characteristics. The process of altering the sequence can result in, for example, single nucleotide substitutions, multiple nucleotide 25 substitutions, and insertion or deletion of regions of the nucleic acid sequence. While distinctions and classifications are made in the course of the ensuing discussion for clarity, it will be appreciated that the techniques are often not mutually exclusive. Indeed, the various methods can be used singly or in combination, in parallel or in series, to access diverse sequence variants. 30 The result of any of the diversity generating procedures described herein can be the generation of one or more polynucleotides, which can be selected or screened for polynucleotides that encode proteins with or which confer desirable properties. Following diversification by one or more of the methods herein, or otherwise available to -61- WO 02136782 PCT/USO1/46227 one of skill, -any polynucleotides that are produced can be selected for a desired activity or property, e.g. altered Km for glyphosate, altered Km for acetyl CoA, use of alternative cofactors (e.g., propionyl CoA) increased kcat, etc. This can include identifying any activity that can be detected, for example, in an automated or automatable format, by any 5 of the assays in the art. For example, GAT homologs with increased specific activity can be detected by assaying the conversion of glyphosate to N-acetylglyphosate, e.g., by mass spectrometry. Alternatively, improved ability to confer resistance to glyphosate can be assayed by growing bacteria transformed with a nucleic acid of the invention on agar containing increasing concentrations of glyphosate or by spraying transgenic plants 10 incorporating a nucleic acid of the invention with glyphosate. A variety of related (or even unrelated) properties can be evaluated, in serial or in parallel, at the discretion of the practitioner. Additional details regarding recombination and selection for herbicide tolerance can be found, e.g., in "DNA SHUFFLING TO PRODUCE HERBICIDE RESISTANT CROPS" (USSN 09/373,333) filed August 12,1999. 15 Descriptions of a variety of diversity generating procedures, including family shuffling and methods for generating modified nucleic acid sequences encoding multiple enzymatic domains, are found the following publications and the references cited therein: Soong, N. et al. (2000) "Molecular breeding of viruses" Nat Genet 25(4):436-39; Stemmer, et al. (1999) "Molecular breeding of viruses for targeting and other clinical 20 properties" Tumor Targeting 4:1-4; Ness et al. (1999) "DNA Shuffling of subgenomic sequences of subtilisin" Nature Biotechnology 17:893-896; Chang et al. (1999) "Evolution of a cytokine using DNA family shuffling" Nature Biotechnology 17:793-797; Minshull and Stemmer (1999) "Protein evolution by molecular breeding" Current Opinion in Chemical Biology 3:284-290; Christians et al. (1999) "Directed evolution of thymidine 25 kinase for AZT phosphorylation using DNA family shuffling" Nature Biotechnology 17:259-264; Crameri et al. (1998) "DNA shuffling of a family of genes from diverse species accelerates directed evolution" Nature 391:288-291; Crameri et al. (1997) "Molecular evolution of an arsenate detoxification pathway by DNA shuffling," Nature Biotechnology 15:436-438; Zhang et al. (1997) "Directed evolution of an effective 30 fucosidase from a galactosidase by DNA shuffling and screening" Proc. Nati. Acad. Sci. USA 94:4504-4509; Patten et al. (1997) "Applications of DNA Shuffling to Pharmaceuticals and Vaccines" Current Opinion in Biotechnology 8:724-733; Crameri et al. (1996) "Construction and evolution of antibody-phage libraries by DNA shuffling" Nature Medicine 2:100-103; Crameri et al. (1996) "Improved green fluorescent protein by -62- WO 02/36782 PCT/USOI/46227 molecular evolution using DNA shuffling" Nature Biotechnology 14:315-319; Gates et al. (1996) "Affinity selective isolation of ligands from peptide libraries through display on a lac repressor 'headpiece dimer"' Journal of Molecular Biology 255:373-386; Stemmer (1996) "Sexual PCR and Assembly PCR" In: The Encyclopedia of Molecular Biology. 5 VCH Publishers, New York, pp.447- 4 57 ; Crameri and Stemmer (1995) "Combinatorial multiple cassette mutagenesis creates all the permutations of mutant and wildtype cassettes" BioTechniques 18:194-195; Stemmer et al., (1995) "Single-step assembly of a gene and entire plasmid form large numbers of oligodeoxy-ribonucleotides" Gene, 164:49-53; Stemmer (1995) "The Evolution of Molecular Computation" Science 270: 10 1510; Stemmer (1995) "Searching Sequence Space" Bio/Technology 13:549-553; Stemmer (1994) "Rapid evolution of a protein in vitro by DNA shuffling" Nature 370:389-391; and Stemmer (1994) "DNA shuffling by random fragmentation and reassembly: In vitro recombination for molecular evolution." Proc. Natl. Acad. Sci. USA 91:10747-10751. 15 Mutational methods of generating diversity include, for example, site directed mutagenesis (Ling et al. (1997) "Approaches to DNA mutagenesis: an overview" Anal Biochem. 254(2): 157-178; Dale et al. (1996) "Oligonucleotide-directed random mutagenesis using the phosphorothioate method" Methods Mol. Biol. 57:369-374; Smith (1985) "In vitro mutagenesis" Ann. Rev. Genet. 19:423-462; Botstein & Shortle (1985) 20 "Strategies and applications of in vitro mutagenesis" Science 229:1193-1201; Carter (1986) "Site-directed mutagenesis" Biochem. .. 237:1-7; and Kunkel (1987) "The efficiency of oligonucleotide directed mutagenesis" in Nucleic Acids & Molecular Biology (Eckstein, F. and Lilley, D.M.J. eds., Springer Verlag, Berlin)); mutagenesis. using uracil containing templates (Kunkel (1985) "Rapid and efficient site-specific 25 mutagenesis without phenotypic selection" Proc. Natl. Acad. Sci. USA 82:488-492; Kunkel et al. (1987) "Rapid and efficient site-specific mutagenesis without phenotypic selection" Methods in Enzymol. 154, 367-382; and Bass et al. (1988) "Mutant Trp repressors with new DNA-binding specificities" Science 242:240-245); oligonucleotide directed mutagenesis (Methods in Enzymol. 100: 468-500 (1983); Methods in Enzymol. 30 154: 329-350 (1987); Zoller & Smith (1982) "Oligonucleotide-directed mutagenesis using M13-derived vectors: an efficient and general procedure for the production of point mutations in any DNA fragment" Nucleic Acids Res. 10:6487-6500; Zoller & Smith (1983) "Oligonucleotide-directed mutagenesis of DNA fragments cloned into M13 vectors" Methods in Enzymol. 100:468-500; and Zoller & Smith (1987) "Oligonucleotide - 63 - WO 02136782 PCT/USOI/46Z27 directed mutagenesis: a simple method using two oligonucleotide primers and a single stranded DNA template" Methods in Enzymol. 154:329-350); phosphorothioate-modified DNA mutagenesis (Taylor et al. (1985) "The use of phosphorothioate-modified DNA in restriction enzyme reactions to prepare nicked DNA" Nucl. Acids Res. 13: 8749-8764; 5 Taylor et al. (1985) "The rapid generation of oligonucleotide-directed mutations at high frequency using phosphorothioate-modifled DNA" Nucl. Acids Res. 13: 8765-8787 (1985); Nakamaye & Eckstein (1986) 'Inhibition of restriction endonuclease Nci I cleavage by phosphorothioate groups and its application to oligonucleotide-directed mutagenesis" Nucl. Acids Res. 14: 9679-9698; Sayers et al. (1988) "Y-T Exonucleases in 10 phosphorothioate-based oligonucleotide-directed mutagenesis" Nucl. Acids Res. 16:791 802; and Sayers et al. (1988) "Strand specific cleavage of phosphorothioate-containing DNA by reaction with restriction endonucleases in the presence of ethidium bromide" Nucl. Acids Res. 16: 803-814); mutagenesis using gapped duplex DNA (Kramer et al. (1984) "The gapped duplex DNA approach to oligonucleotide-directed mutation 15 construction" Nucl. Acids Res. 12: 9441-9456; Kramer & Fritz (1987) Methods in Enzymol. "Oligonucleotide-directed construction of mutations via gapped duplex DNA" 154:350-367; Kramer et al. (1988) "Improved enzymatic in vitro reactions in the gapped duplex DNA approach to oligonucleotide-directed construction of mutations" Nucl. Acids Res. 16: 7207; and Fritz et al. (1988) "Oligonucleotide-directed construction of mutations: 20 a gapped duplex DNA procedure without enzymatic reactions in vitro" Nucl. Acids Res. 16: 6987-6999). Additional suitable methods include point mismatch repair (Kramer et al. (1984) "Point Mismatch Repair" Cell 38:879-887), mutagenesis using repair-deficient host strains (Carter et al. (1985) "Improved oligonucleotide site-directed mutagenesis using 25 M13 vectors" Nucl. Acids Res. 13: 4431-4443; and Carter (1987) "Improved oligonucleotide-directed mutagenesis using M13 vectors" Methods in Enzymol. 154: 382 403), deletion mutagenesis (Eghtedarzadeh & Henikoff (1986) "Use of oligonucleotides to generate large deletions" Nucl. Acids Res. 14: 5115), restriction-selection and restriction selection and restriction-purification (Wells et al. (1986) "Importance of hydrogen-bond 30 formation in stabilizing the transition state of subtilisin" Phil. Trans. R. Soc. Lond. A 317: 415-423), mutagenesis by total gene synthesis (Nambiar et al. (1984) "Total synthesis and cloning of a gene coding for the ribonuclease S protein" Science 223: 1299-1301; Sakamar and Khorana (1988) "Total synthesis and expression of a gene for the a-subunit of bovine rod outer segment guanine nucleotide-binding protein (transducin)" Nucl. Acids Res. 14: -64- WO 02136782 PCT/US01/46227 6361-6372; Wells et al. (1985) "Cassette mutagenesis: an efficient method for generation of multiple mutations at defined sites" Gene 34:315-323; and Grundstrom et al. (1985) "Oligonucleotide-directed mutagenesis by microscale 'shot-gun' gene synthesis" Nucl. Acids Res. 13: 3305-3316), double-strand break repair (Mandecki (1986); Arnold (1993) 5 "Protein engineering for unusual environments" Current Opinion in Biotechnology 4:450 455. "Oligonucleotide-directed double-strand break repair in plasmids of Escherichia coli: a method for site-specific mutagenesis" Proc. NatL. Acad. Sci. USA, 83:7177-7181). Additional details on many of the above methods can be found in Methods in Enzymology Volume 154, which also describes useful controls for trouble-shooting problems with 10 various mutagenesis methods. Additional details regarding various diversity generating methods can be found in the following U.S. patents, PCT publications, and EPO publications: U.S. Pat. No. 5,605,793 to Stemmer (February 25, 1997), "Methods for In Vitro Recombination;" U.S. Pat. No. 5,811,238 to Stemmer et al. (September 22, 1998) "Methods for Generating 15 Polynucleotides having Desired Characteristics by Iterative Selection and Recombination;" U.S. Pat. No. 5,830,721 to Stemmer et al. (November 3, 1998), "DNA Mutagenesis by Random Fragmentation and Reassembly;" U.S. Pat. No. 5,834,252 to Stemmer, et al. (November 10, 1998) "End-Complementary Polymerase Reaction;" U.S. Pat. No. 5,837,458 to Minshull, et al. (November 17, 1998), "Methods and Compositions 20 for Cellular and Metabolic Engineering," WO 95/22625, Stemmer and Crameri, "Mutagenesis by Random Fragmentation and Reassembly;" WO 96/33207 by Stemmer and Lipschutz "End Complementary Polymerase Chain Reaction;" WO 97/20078 by Stemmer and Crameri "Methods for Generating Polynucleotides having Desired Characteristics by Iterative Selection and Recombination;" WO 97/3 5966 by Minshull and 25 Stemmer, "Methods and Compositions for Cellular and Metabolic Engineering;" WO 99/41402 by Punnonen et al. "Targeting of Genetic Vaccine Vectors;" WO 99/41383 by Punnonen et al. "Antigen Library Immunization;" WO 99/41369 by Punnonen et al. "Genetic Vaccine Vector Engineering;" WO 99/41368 by Punnonen et al. "Optimization of Immunomodulatory Properties of Genetic Vaccines;" EP 752008 by Stemmer and 30 Crameri, "DNA Mutagenesis by Random Fragmentation and Reassembly;" EP 0932670 by Stemmer "Evolving Cellular DNA Uptake by Recursive Sequence Recombination;" WO 99/23107 by Stemmer et al., "Modification of Virus Tropism and Host Range by Viral Genome Shuffling;" WO 99/21979 by Apt et al., "Human Papillomavirus Vectors;" WO 98/31837 by del Cardayre et al. "Evolution of Whole Cells and Organisms by - 65 - WO 02/36782 PCT/US01/46227 Recursive Sequence Recombination;" WO 98/27230 by Patten and Stemmer, "Methods and Compositions for Polypeptide Engineering;" WO 98/13487 by Stemmer et al., "Methods for Optimization of Gene Therapy by Recursive Sequence Shuffling and Selection," WO 00/00632, "Methods for Generating Highly Diverse Libraries," WO 5 00/09679, "Methods for Obtaining in Vitro Recombined Polynucleotide Sequence Banks and Resulting Sequences," WO 98/42832 by Arnold et al., "Recombination of Polynucleotide Sequences Using Random or Defined Primers," WO 99129902 by Arnold et al., "Method for Creating Polynucleotide and Polypeptide Sequences," WO 98/41653 by Vind, "An in Vitro Method for Construction of a DNA Library," WO 98/41622 by 10 Borchert et al., "Method for Constructing a Library Using DNA Shuffling," and WO 98/42727 by Pati and Zarling, "Sequence Alterations using Homologous Recombination," WO 00/18906 by Patten et al., "Shuffling of Codon-Altered Genes;" WO 00/04190 by del Cardayre et al. "Evolution of Whole Cells and Organisms by Recursive Recombination;" WO 00/42561 by Crameri et al., "Oligonucleotide Mediated Nucleic Acid 15 Recombination;" WO 00/42559 by Selifonov and Stemmer "Methods of Populating Data Structures for Use in Evolutionary Simulations;" WO 00/42560 by Selifonov et al., "Methods for Making Character Strings, Polynucleotides & Polypeptides Having Desired Characteristics;" WO 01/23401 by Welch et al., "Use of Codon-Varied Oligonucleotide Synthesis for Synthetic Shuffling;" and PCTiUS01/06775 "Single-Stranded Nucleic Acid 20 Template-Mediated Recombination and Nucleic Acid Fragment Isolation" by Affholter. Certain U.S. applications provide additional details regarding various diversity generating methods, including "SHUFFLING OF CODON ALTERED GENES" by Patten et al. filed September 28, 1999, (USSN 09/407,800); "EVOLUTION OF WHOLE CELLS AND ORGANISMS BY RECURSIVE SEQUENCE 25 RECOMBIATION", by del Cardayre et al. filed July 15, 1998 (USSN 09/166,188), and July 15, 1999 (USSN 09/354,922); "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACID RECOMBINATION" by Crameri et al., filed September 28, 1999 (USSN 09/408,392), and "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACID RECOMBINATION" by Crameri et al., filed January 18, 2000 (PCT/USOO/01203); "USE 30 OF CODON-BASED OLIGONUCLEOTIDE SYNTHESIS FOR SYNTHETIC SHUFFLING" by Welch et al., filed September 28, 1999 (USSN 09/408,393); "METHODS FOR MAKING CHARACTER STRINGS, POLYNUCLEOTIDES & POLYPEPTIDES HAVING DESIRED CHARACTERISTICS" by Selifonov et al., filed January 18, 2000, (PCT/US00/01202) and, e.g., "METHODS FOR MAKING - 66 - WO 02/36782 PCT/USO1/46227 CHARACTER STRINGS, POLYNUCLEOTIDES & POLYPEPTIDES HAVING DESIRED CHARACTERISTICS" by Selifonov et al., filed July 18, 2000 (USSN 091618,579); "METHODS OF POPULATING DATA STRUCTURES FOR USE IN EVOLUTIONARY SIMULATIONS" by Selifonov and Stemmer (PCTIUS00/01138), 5 filed January 18, 2000; and "SINGLE-STRANDED NUCLEIC ACID TEMPLATE MEDIATED RECOMBINATION AND NUCLEIC ACID FRAGMENT ISOLATION" by Affholter (USSN 60/186,482, filed March 2, 2000). In brief, several different general classes of sequence modification methods, such as mutation, recombination, etc. are applicable to the present invention and 10 set forth, e.g., in the references above. That is, alterations to the component nucleic acid sequences to produced modified gene fusion constructs can be performed by any number of the protocols described, either before cojoining of the sequences, or after the cojoining step. The following exemplify some of the different types of preferred formats for diversity generation in the context of the present invention, including, e.g., certain 15 recombination based diversity generation formats. Nucleic acids can be recombined in vitro by any of a variety of techniques discussed in the references above, including e.g., DNAse digestion of nucleic acids to be recombined followed by ligation and/or PCR reassembly of the nucleic acids. For example, sexual PCR mutagenesis can be used in which random (or pseudo random, or 20 even non-random) fragmentation of the DNA molecule is followed by recombination, based on sequence similarity, between DNA molecules with different but related DNA sequences, in vitro, followed by fixation of the crossover by extension in a polymerase chain reaction. This process and many process variants is described in several of the references above, e.g., in Stemmer (1994) Proc. Natl. Acad. Sci. USA 91:10747-10751. 25 Similarly, nucleic acids can be recursively recombined in vivo, e.g., by allowing recombination to occur between nucleic acids in cells. Many such in vivo recombination formats are set forth in the references noted above. Such formats optionally provide direct recombination between nucleic acids of interest, or provide recombination between vectors, viruses, plasmids, etc., comprising the nucleic acids of 30 interest, as well as other formats. Details regarding such procedures are found in the references noted above. Whole genome recombination methods can also be used in which whole genomes of cells or other organisms are recombined, optionally including spiking of the genomic recombination mixtures with desired library components (e.g., genes - 67 - WO 02/36782 PCT/USOI/46227 corresponding to the pathways of the present invention). These methods have many applications, including those in which the identity of a target gene is not known. Details on such methods are found, e.g., in WO 98/31837 by del Cardayre et al. "Evolution of Whole Cells and Organisms by Recursive Sequence Recombination;" and in, e.g., 5 PCT/US99115972 by del Cardayre et al., also entitled "Evolution of Whole Cells and Organisms by Recursive Sequence Recombination." Thus, any of these processes and techniques for recombination, recursive recombination, and whole genome recombination, alone or in combination, can be used to generate the modified nucleic acid sequences and/or modified gene fusion constructs of the present invention. 10 Synthetic recombination methods can also be used, in which oligonucleotides corresponding to targets of interest are synthesized and reassembled in PCR or ligation reactions which include oligonucleotides which correspond to more than one parental nucleic acid, thereby generating new recombined nucleic acids. Oligonucleotides can be made by standard nucleotide addition methods, or can be made, 15 e.g., by tri-nucleotide synthetic approaches. Details regarding such approaches are found in the references noted above, including, e.g., WO 00/42561 by Crameri et al., "Olgonucleotide Mediated Nucleic Acid Recombination;" WO 01/23401 by Welch et al., "Use of Codon-Varied Oligonucleotide Synthesis for Synthetic Shuffling;" WO 00/42560 by Selifonov et al., "Methods for Making Character Strings, Polynucleotides and 20 Polypeptides Having Desired Characteristics;" and WO 00/42559 by Selifonov and Stemmer "Methods of Populating Data Structures for Use in Evolutionary Simulations." In silico methods of recombination can be effected in which genetic algorithms are used in a computer to recombine sequence strings which correspond to homologous (or even non-homologous) nucleic acids. The resulting recombined sequence 25 stfings are optionally converted into nucleic acids by synthesis of nucleic acids which correspond to the recombined sequences, e.g., in concert with oligonucleotide synthesis/ gene reassembly techniques. This approach can generate random, partially random or designed variants. Many details regarding in silico recombination, including the use of genetic algorithms, genetic operators and the like in computer systems, combined with 30 generation of corresponding nucleic acids (and/or proteins), as well as combinations of designed nucleic acids and/or proteins (e.g., based on cross-over site selection) as well as designed, pseudo-random or random recombination methods are described in WO 00/42560 by Selifonov et al., "Methods for Making Character Strings, Polynucleotides and Polypeptides Having Desired Characteristics" and WO 00/42559 by Selifonov and - 68 - WO 02136782 PCT/US01/46227 Stemmer "Methods of Populating Data Structures for Use in Evolutionary Simulations." Extensive details regarding in silico recombination methods are found in these applications. This methodology is generally applicable to the present invention in providing for recombination of nucleic acid sequences and/or gene fusion constructs 5 encoding proteins involved in various metabolic pathways (such as, for example, carotenoid biosynthetic pathways, ectoine biosynthetic pathways, polyhydroxyalkanoate biosynthetic pathways, aromatic polyketide biosynthetic pathways, and the like) in silico and/ or the generation of conesponding nucleic acids or proteins. Many methods of accessing natural diversity, e.g., by hybridization of 10 diverse nucleic acids or nucleic acid fragments to single-stranded templates, followed by polymerization and/or ligation to regenerate full-length sequences, optionally followed by degradation of the templates and recovery of the resulting modified nucleic acids can be similarly used. In one method employing a single-stranded template, the fragment population derived from the genomic library(ies) is annealed with partial, or, often 15 approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly of complex chimeric genes from this population is then mediated by nuclease-base removal of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and subsequent single stranded ligation. The parental polynucleotide strand can be removed by digestion (e.g., if RNA or uracil-containing), magnetic separation under 20 denaturing conditions (if labeled in a manner conducive to such separation) and other available separation/purification methods. Alternatively, the parental strand is optionally co-purified with the chimeric strands and removed during subsequent screening and processing steps. Additional details regarding this approach are found, e.g., in "Single Stranded Nucleic Acid Template-Mediated Recombination and Nucleic Acid Fragment 25 Isolation" by Affholter, PCT/US01/06775. In another approach, single-stranded molecules are converted to double stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand mediated binding. After separation of unbound DNA, the selected DNA molecules are released from the support and introduced into a suitable host cell to generate a library 30 enriched sequences which hybridize to the probe. A library produced in this manner provides a desirable substrate for further diversification using any of the procedures described herein. Any of the preceding general recombination fonnats can be practiced in a reiterative fashion (e.g., one or more cycles of mutation/recombination or other diversity - 69 - WO 02136782 PCT/USO1/46227 generation methods, optionally followed by one or more selection methods) to generate a more diverse set of recombinant nucleic acids. Mutagenesis employing polynucleotide chain termination methods have also been proposed (see e.g., U.S. Patent No. 5,965,408, "Method of DNA reassembly by 5 interrupting synthesis" to Short, and the references above), and can be applied to the present invention. In this approach, double stranded DNAs corresponding to one or more genes sharing regions of sequence similarity are combined and denatured, in the presence or absence of primers specific for the gene. The single stranded polynucleotides are then annealed and incubated in the presence of a polymerase and a chain terminating reagent 10 (e.g., ultraviolet, gamma or X-ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as single strand binding proteins, transcription activating factors, or histones; polycyclic aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated polymerization mediated by rapid thermocycling; and the like), resulting in the production of partial duplex molecules. The partial duplex 15 molecules, e.g., containing partially extended chains, are then denatured and reannealed in subsequent rounds of replication or partial replication resulting in polynucleotides which share varying degrees of sequence similarity and which are diversified with respect to the starting population of DNA molecules. Optionally, the products, or partial pools of the products, can be amplified at one or more stages in the process. Polynucleotides produced 20 by a chain termination method, such as described above, are suitable substrates for any other described recombination format. Diversity also can be generated in nucleic acids or populations of nucleic acids using a recombinational procedure termed "incremental truncation for the creation of hybrid enzymes" ("ITCBY") described in Ostermeier et al. (1999) "A combinatorial 25 approach to hybrid enzymes independent of DNA homology" Nature Biotech 17:1205. This approach can be used to generate an initial a library of variants which can optionally serve as a substrate for one or more in vitro or in vivo recombination methods. See, also, Ostermeier et al. (1999) "Combinatorial Protein Engineering by Incremental Truncation," Proc. Natl. Acad. Sci. USA, 96: 3562-67; Ostermeier et al. (1999), "Incremental 30 Truncation as a Strategy in the Engineering of Novel Biocatalysts," Biological and Medicinal Chemistry, 7: 2139-44. Mutational methods which result in the alteration of individual nucleotides or groups of contiguous or non-contiguous nucleotides can be favorably employed to introduce nucleotide diversity into the nucleic acid sequences and/or gene fusion - 70 - WO 02/36782 PCT/US01/46227 constructs of the present invention. Many mutagenesis methods are found in the above cited references; additional details regarding mutagenesis methods can be found in following, which can also be applied to the present invention. For example, error-prone PCR can be used to generate nucleic acid 5 variants. Using this technique, PCR is performed under conditions where the copying fidelity of the DNA polymerase is low, such that a high rate of point mutations is obtained along the entire length of the PCR product. Examples of such techniques are found in the references above and, e.g., in Leung et al. (1989) Technique 1:11-15 and Caldwell et al. (1992) PCR Methods Applic. 2:28-33. Similarly, assembly PCR can be used, in a process 10 which involves the assembly of a PCR product from a mixture of small DNA fragments. A large number of different PCR reactions can occur in parallel in the same reaction mixture, with the products of one reaction priming the products of another reaction. Oligonucleotide directed mutagenesis can be used to introduce site-specific mutations in a nucleic acid sequence of interest. Examples of such techniques are found in 15 the references above and, e.g., in Reidhaar-Olson et al. (1988) Science, 241:53-57. Similarly, cassette mutagenesis can be used in a process that replaces a small region of a double stranded DNA molecule with a synthetic oligonucleotide cassette that differs from the native sequence. The oligonucleotide can contain, e.g., completely and/or partially randomized native sequence(s). 20 Recursive ensemble mutagenesis is a process in which an algorithm for protein mutagenesis is used to produce diverse populations of phenotypically related mutants, members of which differ in amino acid sequence. This method uses a feedback mechanism to monitor successive rounds of combinatorial cassette mutagenesis. Examples of this approach are found in Arkin & Youvan (1992) Proc. Natl. Acad. Sci. 25 USA 89:7811-7815. Exponential ensemble mutagenesis can be used for generating combinatorial libraries with a high percentage of unique and functional mutants. Small groups of residues in a sequence of interest are randomized in parallel to identify, at each altered position, amino acids which lead to functional proteins. Examples of such 30 procedures are found in Delegrave & Youvan (1993) Biotechnology Research 11:1548 1552. In vivo mutagenesis can be used to generate random mutations in any cloned DNA of interest by propagating the DNA, e.g., in a strain of E. coli that carries mutations in one or more of the DNA repair pathways. These "mutator" strains have a -71- WO 02/36782 PCTIUSO1/46227 higher random mutation rate than that of a wild-type parent. Propagating the DNA in one of these strains will eventually generate random mutations within the DNA. Such procedures are described in the references noted above. Other procedures for introducing diversity into a genome, e.g. a bacterial, 5 fungal, animal or plant genome can be used in conjunction with the above described and/or referenced methods. For example, in addition to the methods above, techniques have been proposed which produce nucleic acid multimers suitable for transformation into a variety of species (see, e.g., Schellenberger U.S. Patent No. 5,756,316 and the references above). Transformation of a suitable host with such multimers, consisting of genes that 10 are divergent with respect to one another, (e.g., derived from natural diversity or through application of site directed mutagenesis, error prone PCR, passage through mutagenic bacterial strains, and the like), provides a source of nucleic acid diversity for DNA diversification, e.g., by an in vivo recombination process as indicated above. Alternatively, a multiplicity of monomeric polynucleotides sharing regions 15 of partial sequence similarity can be transformed into a host species and recombined in vivo by the host cell. Subsequent rounds of cell division can be used to generate libraries, members of which, include a single, homogenous population, or pool of monomeric polynucleotides. Alternatively, the monomeric nucleic acid can be recovered by standard techniques, e.g., PCR and/or cloning, and recombined in any of the recombination 20 formats, including recursive recombination formats, described above. Methods for generating multispecies expression libraries, have been described (in addition to the reference noted above, see, e.g., Peterson et al. (1998) U.S. Pat. No. 5,783,431 "METHODS FOR GENERATING AND SCREENING NOVEL METABOLIC PATHWAYS," and Thompson, et al. (1998) U.S. Pat. No. 5,824,485 25 METHODS FOR GENERATING AND SCREENING NOVEL METABOLIC PATHWAYS) and their use to identify protein activities of interest has been proposed (In addition to the references noted above, see, Short (1999) U.S. Pat. No. 5,958,672 "PROTEIN ACTIVITY SCREENING OF CLONES HAVING DNA FROM UNCULTIVATED MICROORGANISMS"). Multispecies expression libraries include, in 30 general, libraries comprising cDNA or genomic sequences from a plurality of species or strains, operably linked to appropriate regulatory sequences, in an expression cassette. The cDNA and/or genomic sequences are optionally randomly ligated to further enhance diversity. The vector can be a shuttle vector suitable for transformation and expression in more than one species of host organism, e.g., bacterial species, eukaryotic cells. In some -72- WO 02/36782 PCT/US01/46227 cases, the library is biased by preselecting sequences which encode a protein of interest, or which hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for any of the methods herein described. The above described procedures have been largely directed to increasing 5 nucleic acid and/ or encoded protein diversity. However, in many cases, not all of the diversity is useful, e.g., functional, and contributes merely to increasing the background of variants that must be screened or selected to identify the few favorable variants. In some applications, it is desirable to preselect or prescreen libraries (e.g., an amplified library, a genomic library, a cDNA library, a normalized library, etc.) or other substrate nucleic 10 acids prior to diversification, e.g., by recombination-based mutagenesis procedures, or to otherwise bias the substrates towards nucleic acids that encode functional products. For example, in the case of antibody engineering, it is possible to bias the diversity generating process toward antibodies with functional antigen binding sites by taking advantage of in vivo recombination events prior to manipulation by any of the described methods. For 15 example, recombined CDRs derived from B cell cDNA libraries can be amplified and assembled into framework regions (e.g., Jirholt et al. (1998) "Exploiting sequence space: shuffling in vivo formed complementarity determining regions into a master framework" Gene 215: 471) prior to diversifying according to any of the methods described herein. Libraries can be biased towards nucleic acids which encode proteins with 20 desirable enzyme activities. For example, after identifying a clone from a library which exhibits a specified activity, the clone can be mutagenized using any known method for introducing DNA alterations. A library comprising the mutagenized homologues is then screened for a desired activity, which can be the same as or different from the initially specified activity. An example of such a procedure is proposed in Short (1999) U.S. 25 Patent No. 5,939,250 for "PRODUCTION OF ENZYMES HAVING DESIRED ACTIVITIES BY MUTAGENESIS." Desired activities can be identified by any method known in the art. For example, WO 99/10539 proposes that gene libraries can be screened by combining extracts from the gene library with components obtained from metabolically rich cells and identifying combinations which exhibit the desired activity. It has also been 30 proposed (e.g., WO 98/58085) that clones with desired activities can be identified by inserting bioactive substrates into samples of the library, and detecting bioactive fluorescence corresponding to the product of a desired activity using a fluorescent analyzer, e.g., a flow cytometry device, a CCD, a fluorometer, or a spectrophotometer. - 73 - WO 02136782 PCT/US01/46227 Libraries can also be biased towards nucleic acids which have specified characteristics, e.g., hybridization to a selected nucleic acid probe. For example, application WO 99/10539 proposes that polynucleotides encoding a desired activity (e.g., an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a 5 glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a hydratase, a nitrilase, a transaminase, an amidase or an acylase) can be identified from among genomic DNA sequences in the following manner. Single stranded DNA molecules from a population of genomic DNA are hybridized to a ligand-conjugated probe. The genomic DNA can be derived from either a cultivated or uncultivated 10 microorganism, or from an environmental sample. Alternatively, the genomic DNA can be derived from a multicellular organism, or a tissue derived therefrom. Second strand synthesis can be conducted directly from the hybridization probe used in the capture, with or without prior release from the capture medium or by a wide variety of other strategies known in the art. Alternatively, the isolated single-stranded genomic DNA population can 15 be fragmented without further cloning and used directly in, e.g., a recombination-based approach, that employs a single-stranded template, as described above. "Non-Stochastic" methods of generating nucleic acids and polypeptides are alleged in Short "Non-Stochastic Generation of Genetic Vaccines and Enzymes" WO 00/46344. These methods, including proposed non-stochastic polynucleotide reassembly 20 and site-saturation mutagenesis methods be applied to the present invention as well. Random or semi-random mutagenesis using doped or degenerate oligonucleotides is also described in, e.g., Arkin and Youvan (1992) "Optimizing nucleotide mixtures to encode specific subsets of amino acids for semi-random mutagenesis" Biotechnology 10:297-300; Reidhaar-Olson et al. (1991) "Random mutagenesis of protein sequences using 25 oligonucleotide cassettes" Methods Enzymol. 208:564-86; Lim and Sauer (1991) "The role of internal packing interactions in determining the structure and stability of a protein" J. Mol. Biol. 219:359-76; Breyer and Sauer (1989) "Mutational analysis of the fine specificity of binding of monoclonal antibody 5 IF to lambda repressor" J. Biol. Chen. 264:13355-60); and "Walk-Through Mutagenesis" (Crea, R; US Patents 5,830,650 and 30 5,798,208, and EP Patent 0527809 Bl. It will readily be appreciated that any of the above described techniques suitable for enriching a library prior to diversification can also be used to screen the products, or libraries of products, produced by the diversity generating methods. Any of -74- WO 02/36782 PCTIUSO1/46227 the above described methods can be practiced recursively or in combination to alter nucleic acids, e.g., GAT encoding polynucleotides. Kits for mutagenesis, library construction and other diversity generation methods are also commercially available. For example, kits are available from, e.g., 5 Stratagene (e.g., QuickChangeT' site-directed mutagenesis kit; and ChameleonTm double stranded, site-directed mutagenesis kit), Bio/Can Scientific, Bio-Rad (e.g., using the Kunkel method described above), Boehringer Mannheim Corp., Clonetech Laboratories, DNA Technologies, Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc, Lemargo Inc, Life Technologies (Gibco BRL), New England Biolabs, Pharmacia Biotech, 10 Promega Corp., Quantum Biotechnologies, Amersham International plc (e.g., using the Eckstein method above), and Anglian Biotechnology Ltd (e.g., using the Carter/Winter method above). The above references provide many mutational formats, including recombination, recursive recombination, recursive mutation and combinations or 15 recombination with other forms of mutagenesis, as well as many modifications of these formats. Regardless of the diversity generation format that is used, the nucleic acids of the present invention can be recombined (with each other, or with related (or even unrelated) sequences) to produce a diverse set of recombinant nucleic acids for use in the gene fusion constructs and modified gene fusion constructs of the present invention, including, e.g., 20 sets of homologous nucleic acids, as well as corresponding polypeptides. Many of the above-described methodologies for generating modified polynucleotides generate a large number of diverse variants of a parental sequence or sequences. In some preferred embodiments of the invention the modification technique (e.g., some form of shuffling) is used to generate a library of variants that is then screened 25 for a modified polynucleotide or pool of modified polynucleotides encoding some desired functional attribute, e.g., improved GAT activity. Exemplary enzymatic activities that can be screened for include catalytic rates (conventionally characterized in terms of kinetic constants such as ke. and K), substrate specificity, and susceptibility to activation or inhibition by substrate, product or other molecules (e.g., inhibitors or activators). 30 One example of selection for a desired enzymatic activity entails growing host cells under conditions that inhibit the growth and/or survival of cells that do not sufficiently express an enzymatic activity of interest, e.g. the GAT activity. Using such a selection process can eliminate from consideration all modified polynucleotides except those encoding a desired enzymatic activity. For example, in some embodiments of the -75- WO 02/36782 PCT/USO1/46227 invention host cells are maintained under conditions that inhibit cell growth or survival in the absence of sufficient levels of GAT, e.g., a concentration of glyphosate that is lethal or inhibits the growth of a wild-type plant of the same variety that lack does not express GAT polynucleotide. Under these conditions, only a host cell harboring a modified nucleic acid 5 that encodes enzymatic activity or activities able to catalyze production of sufficient levels of the product will survive and grow. Some embodiments of the invention employ multiples rounds of screening at increasing concentrations of glyphosate or a glyphosate analog. In some embodiments of the invention, mass spectrometry is used to detect 10 the acetylation of glyphosate, or a glyphosate analog or metabolite. The used of mass spectrometry is described in more detail in the Examples below. For convenience and high throughput it will often be desirable to screen/select for desired modified nucleic acids in a microorganism, e.g., a bacteria such as E. coli. On the other hand, screening in plant cells or plants can will in some cases be 15 preferable where the ultimate aim is to generate a modified nucleic acid for expression in a plant system. In some preferred embodiments of the invention throughput is increased by screening pools of host cells expressing different modified nucleic acids, either alone or as part of a gene fusion construct. Any pools showing significant activity can be 20 deconvoluted to identify single clones expressing the desirable activity. The skilled artisan will recognize that the relevant assay, screening or selection method will vary depending upon the desired host organism, etc. It is normally advantageous to employ an assay that can be practiced in a high-throughput format. In high through put assays, it is possible to screen up to several thousand 25 different variants in a single day. For example, each well of a microtiter plate can be used to run a separate assay, or, if concentration or incubation time effects are to be observed, every 5-10 wells can test a single variant In addition to fluidic approaches, it is possible, as mentioned above, simply to grow cells on media plates that select for the desired enzymatic or metabolic function. 30 This approach offers a simple and high-throughput screening method. A number of well known robotic systems have also been developed for solution phase chemistries useful in assay systems. These systems include automated workstations like the automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate - 76 - WO 02/36782 PCT/US01/46227 II, Zymark Corporation, Hopkinton, MA.; Orca, Hewlett-Packard, Palo Alto, CA) which mimic the manual synthetic operations performed by a scientist. Any of the above devices are suitable for application to the present invention. The nature and implementation of modifications to these devices (if any) so that they can operate as discussed herein with 5 reference to the integrated system will be apparent to persons skilled in the relevant art. High throughput screening systems are commercially available (see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems typically automate entire procedures including all sample and reagent pipetting, 10 liquid dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate for the assay. These configurable systems provide high throughput and rapid start up as well as a high degree of flexibility and customization. The manufacturers of such systems provide detailed protocols for the various high throughput devices. Thus, for example, Zymark Corp. provides technical 15 bulletins describing screening systems for detecting the modulation of gene transcription, ligand binding, and the like. Microfluidic approaches to reagent manipulation have also been developed, e.g., by Caliper Technologies (Mountain View, CA). Optical images viewed (and, optionally, recorded) by a camera or other recording device (e.g., a photodiode and data storage device) are optionally further 20 processed in any of the embodiments herein, e.g., by digitizing the image and/or storing and analyzing the image on a computer. A variety of commercially available peripheral equipment and software is available for digitizing, storing and analyzing a digitized video or digitized optical image, e.g., using PC (Intel x86 or pentium chip compatible DOSTM, OSTM WINDOWSTM, WINDOWS NTrm or WINDOWS 95Tm based machines), 25 MACINTOSHTM, or UNIX based (e.g., SUNTm work station) computers. One conventional system carries light from the assay device to a cooled charge-coupled device (CCD) camera, a common use in the art. A CCD camera includes an array of picture elements (pixels). The light from the specimen is imaged on the CCD. Particular pixels corresponding to regions of the specimen (e.g., individual hybridization 30 sites on an array of biological polymers) are sampled to obtain light intensity readings for each position. Multiple pixels are processed in parallel to increase speed. The apparatus and methods of the invention are easily used for viewing any sample, e.g. by fluorescent or dark field microscopic techniques. - 77- WO 02136782 PCT/USO1/46227 OTHER POLYNUCLBOTIDE COMPOSITIONS The invention also includes compositions comprising two or more polynucleotides of the invention (e.g., as substrates for recombination). The composition can comprise a library of recombinant nucleic acids, where the library contains at least 2, 5 3, 5, 10, 20, or 50 or more polynucleotides. The polynucleotides are optionally cloned into expression vectors, providing expression libraries. The invention also includes compositions produced by digesting one or more polynucleotide of the invention with a restriction endonuclease, an RNAse, or a DNAse (e.g., as is performed in certain of the recombination formats noted above); and 10 compositions produced by fragmenting or shearing one or more polynucleotide of the invention by mechanical means (e.g., sonication, vortexing, and the like), which can also be used to provide substrates for recombination in the methods above. Similarly, compositions comprising sets of oligonucleotides corresponding to more than one nucleic acid of the invention are useful as recombination substrates and are a feature of the 15 invention. For convenience, these fragmented, sheared, or oligonucleotide synthesized mixtures are referred to as fragmented nucleic acid sets. Also included in the invention are compositions produced by incubating one or more of the fragmented nucleic acid sets in the presence of ribonucleotide- or deoxyribonucelotide triphosphates and a nucleic acid polymerase. This resulting 20 composition forms a recombination mixture for many of the recombination formats noted above. The nucleic acid polymerase may be an RNA polymerase, a DNA polymerase, or an RNA-directed DNA polymerase (e.g., a "reverse transcriptase"); the polymerase can be, e.g., a thermostable DNA polymerase (such as, VENT, TAQ, or the like). WTGRATED SYSTEMS 25 The present invention provides computers, computer readable media and integrated systems comprising character strings corresponding to the sequence information herein for the polypeptides and nucleic acids herein, including, e.g., those sequences listed herein and the various silent substitutions and conservative substitutions thereof. For example, various methods and genetic algorithms (GAs) known in the 30 art can be used to detect homology or similarity between different character strings, or can be used to perform other desirable functions such as to control output files, provide the basis for making presentations of information including the sequences and the like. Examples include BLAST, discussed supra. -78- WO 02136782 PCT/US01I/46227 Thus, different types of homology and similarity of various stringency and length can be detected and recognized in the integrated systems herein. For example, many homology determination methods have been designed for comparative analysis of sequences of biopolymers, for spell-checking in word processing, and for data retrieval 5 from various databases. With an understanding of double-helix pair-wise complement interactions among 4 principal nucleobases in natural polynucleotides, models that simulate annealing of complementary homologous polynucleotide strings can also be used as a foundation of sequence alignment or other operations typically performed on the character strings corresponding to the sequences herein (e.g., word-processing 10 manipulations, construction of figures comprising sequence or subsequence character strings, output tables, etc.). An example of a software package with GAs for calculating sequence similarity is BLAST, which can be adapted to the present invention by inputting character strings corresponding to the sequences herein. Similarly, standard desktop applications such as word processing software 15 (e.g., Microsoft WordTM or Corel WordPerfectTM) and database software (e.g., spreadsheet software such as Microsoft ExcelTm, Corel Quattro Pror", or database programs such as Microsoft AccessTm or ParadoxTM) can be adapted to the present invention by inputting a character string corresponding to the GAT homologues of the invention (either nucleic acids or proteins, or both). For example, the integrated systems can include the foregoing 20 software having the appropriate character string information, e.g., used in conjunction with a user interface (e.g., a GUI in a standard operating system such as a Windows, Macintosh or LINUX system) to manipulate strings of characters. As noted, specialized alignment programs such as BLAST can also be incorporated into the systems of the invention for alignment of nucleic acids or proteins (or corresponding character strings). 25 Integrated systems for analysis in the present invention typically include a digital computer with GA software for aligning sequences, as well as data sets entered into the software system comprising any of the sequences herein. The computer can be, e.g., a PC (Intel x86 or Pentium chip- compatible DOSTm, OS2Tm WINDOWSTm WINDOWS NTTm, WINDOWS95Tm, WINDOWS98Tm LINUX based machine, a MACINTOSHTM, 30 Power PC, or a UNIX based (e.g., SUNTm work station) machine) or other commercially common computer which is known to one of skill. Software for aligning or otherwise manipulating sequences is available, or can easily be constructed by one of skill using a standard programming language such as Visualbasic, Fortran, Basic, .ava, or the like. -79- WO 02/36782 PCT/US01/46227 Any controller or computer optionally includes a monitor which is often a cathode ray tube ("CRT") display, a flat panel display (e.g., active matrix liquid crystal display, liquid crystal display), or others. Computer circuitry is often placed in a box which includes numerous integrated circuit chips, such as a microprocessor, memory, 5 interface circuits, and others. The box also optionally includes a hard disk drive, a floppy disk drive, a high capacity removable drive such as a writeable CD-ROM, and other common peripheral elements. Inputting devices such as a keyboard or mouse optionally provide for input from a user and for user selection of sequences to be compared or otherwise manipulated in the relevant computer system. 10 The computer typically includes appropriate software for receiving user instructions, either in the form of user input into a set parameter fields, e.g., in a GUI, or in the form of preprogrammed instructions, e.g., preprogrammed for a variety of different specific operations. The software then converts these instructions to appropriate language for instructing the operation of the fluid direction and transport controller to carry out the 15 desired operation. The software can also include output elements for controlling nucleic acid synthesis (e.g., based upon a sequence or an alignment of a sequences herein) or other operations which occur downstream from an alignment or other operation performed using a character string corresponding to a sequence herein. Nucleic acid synthesis equipment 20 can, accordingly, be a component in one or more integrated systems herein. In an additional aspect, the present invention provides kits embodying the methods, composition, systems and apparatus herein. Kits of the invention optionally comprise one or more of the following: (1) an apparatus, system, system component or apparatus component as described herein; (2) instructions for practicing the methods 25 described herein, and/or for operating the apparatus or apparatus components herein and/or for using the compositions herein; (3) one or more GAT composition or component; (4) a container for holding components or compositions, and, (5) packaging materials. In a further aspect, the present invention provides for the use of any 30 apparatus, apparatus component, composition or kit herein, for the practice of any method or assay herein, and/or for the use of.any apparatus or kit to practice any assay or method herein. - 8o - WO 02/36782 PCT/USO1/46227 HOST C LS AND ORGANISMS The host cell can be eukaryotic, for example, a eukaryotic cell, a plant cell, an animal cell, a protoplast, or a tissue culture. The host cell optionally comprises a plurality of cells, for example, an organism. Alternatively, the host cell can be prokaryotic 5 including, but not limited to, bacteria (i.e., gram positive bacteria, purple bacteria, green sulfur bacteria, green non-sulfur bacteria, cyanobacteria, spirochetes, thermatogales, flavobacteria, and bacteroides) and archaebacteria (i.e., Korarchaeota, Thermoproteus, Pyrodictium, Thermococcales, methanogens, Archaeoglobus, and extreme halophiles). Transgenic plants, or plant cells, incorporating the GAT nucleic acids, 10 and/or expressing the GAT polypeptides of the invention are a feature of the invention. The transformation of plant cells and protoplasts can be carried out in essentially any of the various ways known to those skilled in the art of plant molecular biology, including, but not limited to, the methods described herein. See, in general, Methods in Enzymology, Vol. 153 (Recombinant DNA Part D) Wu and Grossman (eds.) 1987, Academic Press, 15 incorporated herein by reference. As used herein, the term "transformation" means alteration of the genotype of a host plant by the introduction of a nucleic acid sequence, e.g., a "heterologous" or "foreign" nucleic acid sequence. The heterologous nucleic acid sequence need not necessarily originate from a different source but it will, at some point, have been external to the cell into which is introduced. 20 In addition to Berger, Ausubel and Sambrook, useful general references for plant cell cloning, culture and regeneration include Jones (ed) (1995) Plant Gene Transfer and Exression Protocols- Methods in Moecular Biolo Volume 49 Humana Press Towata NJ; Payne et al. (1992) Plant Cell and Tissue Culture in Liquid Systems John Wiley & Sons, Inc. New York, NY (Payne); and Gamborg and Phillips (eds) (1995) Plant 25 Cell. Tissue and Organ Culture: Fundamental Methods Springer Lab Manual, Springer Verlag (Berlin Heidelberg New York) (Gamborg). A variety of cell culture media are described in Atlas and Parks (eds) The Handbook of Microbiological Media (1993) CRC Press, Boca Raton, FL (Atlas). Additional information for plant cell culture is found in available commercial literature such as the Life Science Research Cell Culture Catalogue 30 (1998) from Sigma- Aldrich, Inc (St Louis, MO) (Sigma-LSRCCC) and, e.g., the Plant Culture Catalogue and supplement (1997) also from Sigma-Aldrich, Inc (St Louis, MO) (Sigma-PCCS). Additional details regarding plant cell culture are found in Croy, (ed.) (1993) Plant Molecular Biology Bios Scientific Publishers, Oxford, U.K. - 81 - WO 02136782 PCT/USO1I/46227 In an embodiment of this invention, recombinant vectors including one or more GAT polynucleotides, suitable for the transformation of plant cells are prepared. A DNA sequence encoding for the desired GAT polypeptide, e.g., selected from among SEQ ID NOS: 1-5 and 11-262, is conveniently used to construct a recombinant expression 5 cassette which can be introduced into the desired plant. In the context of the present invention, an expression cassette will typically comprise a selected GAT polynucleotide operably linked to a promoter sequence and other transcriptional and translational initiation regulatory sequences which are sufficient to direct the transcription of the GAT sequence in the intended tissues (e.g., entire plant, leaves, roots, etc.) of the transformed 10 plant. For example, a strongly or weakly constitutive plant promoter that directs expression of a GAT nucleic acid in all tissues of a plant can be favorably employed. Such promoters are active under most environmental conditions and states of development or cell differentiation. Examples of constitutive promoters include the l- or 2'- promoter 15 of Agrobacterium tumefaciens, and other transcription initiation regions from various plant genes known to those of skill. Where overexpression of a GAT polypeptide of the invention is detrimental to the plant, one of skill, will recognize that weak constitutive promoters can be used for low-levels of expression. In those cases where high levels of expression is not harmful to the plant, a strong promoter, e.g., a t-RNA, or other pol I 20 promoter, or a strong pol II promoter, (e.g., the cauliflower mosaic virus promoter, CaMV, 35S promoter) can be used. Alternatively, a plant promoter can be under environmental control. Such promoters are referred to as "inducible" promoters. Examples of environmental conditions that may alter transcription by inducible promoters include pathogen attack, 25 anaerobic conditions, or the presence of light. In some cases, it is desirable to use promoters that are "tissue-specific" and/or are under developmental control such that the GAT polynucleotide is expressed only in certain tissues or stages of development, e.g., leaves, roots, shoots, etc. Endogenous promoters of genes related to herbicide tolerance and related phenotypes are particularly useful for driving expression of GAT nucleic acids, 30 e.g., P450 monooxygenases, glutathione-S-transferases, homoglutathione-S-transferases, glyphosate oxidases and 5-enolpyruvylshikimate-2-phosphate synthases. Tissue specific promoters can also be used to direct expression of heterologous structural genes, including the GAT polynucleotides described herein. Thus the promoters can be used in recombinant expression cassettes to drive expression of any - 82- WO 02/36782 PCT/US01/46227 gene whose expression is desirable in the transgenic plants of the invention, e.g., GAT and/or other genes conferring herbicide resistance or tolerance, genes which influence other useful characteristics, e.g., heterosis. Similarly, -enhancer elements, e.g., derived from the 5' regulatory sequences or intron of a heterologous gene, can also be used to 5 improve expression of a heterologous structural gene, such as a GAT polynucleotide. In general, the particular promoter used in the expression cassette in plants depends on the intended application. Any of a number of promoters which direct transcription in plant cells can be suitable. The promoter can be either constitutive or inducible. In addition to the promoters noted above, promoters of bacterial origin which 10 operate in plants include the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from Ti plasmids. See, Herrera-Estrella et al. (1983) Nature 303:209. Viral promoters include the 35S and 195 RNA promoters of CaMV. See, Odell et al., (1985) Nature 313:810. Other plant promoters include the ribulose-1,3 bisphosphate carboxylase small subunit promoter and the phaseolin promoter. The 15 promoter sequence from the E8 gene (see, Deikman and Fischer (1988) EMBO . 7:3315) and other genes are also favorably used. Promoters specific for monocotyledonous species are also considered (McElroy D., Brettell R.IS. 1994. Foreign gene expression in transgenic cereals. Trends Biotech., 12:62-68.) Alternatively, novel promoters with useful characteristics can be identified from any viral, bacterial, or plant source by 20 methods, including sequence analysis, enhancer or promoter trapping, and the like, known in the art. In preparing expression vectors of the invention, sequences other than the promoter and the GAT encoding gene are also favorably used. If proper polypeptide expression is desired, a polyadenylation region can be derived from the natural gene, from 25 a variety of other plant genes, or from T-DNA. Signal/localization peptides, which, e.g., facilitate translocation of the expressed polypeptide to internal organelles (e.g., chloroplasts) or extracellular secretion, can also be employed. The vector comprising the GAT polynucleotide also can include a marker gene which confers a selectable phenotype on plant cells. For example, the marker may 30 encode biocide tolerance, particularly antibiotic tolerance, such as tolerance to kanamycin, G418, bleomycin, hygromycin, or herbicide tolerance, such as tolerance to chlorosulfuron, or phophinothricin. Reporter genes, which are used to monitor gene expression and protein localization via visualizable reaction products (e.g., beta-glucuronidase, beta - 83 - WO 02/36782 PCT/USO1/46227 galactosidase, and chloramphenicol acetyltransferase) or by direct visualization of the gene product itself (e.g., green fluorescent protein, GFP; Sheen et al. (1995) The Plant Journal 8:777) can be used for, e.g., monitoring transient gene expression in plant cells. Transient expression systems can be employed in plant cells, for example, in screening 5 plant cell cultures for herbicide tolerance activities. PLANT TRANSFORMATION Protoplasts Numerous protocols for establishment of transformable protoplasts from a variety of plant types and subsequent transformation of the cultured protoplasts are 10 available in the art and are incorporated herein by reference. For examples, see, Hashimoto et al. (1990) Plant Physiol. 93:857; Fowke and Constabel (eds)(1994) Plant Protoplasts; Saunders et al. (1993) Applications of Plant In Vitro Technology Symnosium, UPM 16-18; and Lyznik et al. (1991) BioTechniques 10:295, each of which is incorporated herein by reference. 15 Chloroplasts Chloroplasts are a site of action of some herbicide tolerance activities, and, in some instances, the GAT polynucleotide is fused to a chloroplast transit sequence peptide to facilitate translocation of the gene products into the chloroplasts. In these cases, it can be advantageous to transform the GAT polynucleotide into the chloroplasts of the 20 plant host cells. Numerous methods are available in the art to accomplish chloroplast transformation and expression (e.g., Daniell et al. (1998) Nature Biotechnology 16:346; O'Neill et al. (1993) The Plant Journal 3:729; Maliga (1993) TIBTECH 11:1). The expression construct comprises a transcriptional regulatory sequence functional in plants operably linked to a polynucleotide encoding the GAT polypeptide. Expression cassettes 25 that are designed to function in chloroplasts (such as an expression cassette including a GAT polynucleotide) include the sequences necessary to ensure expression in chloroplasts. Typically, the coding sequence is flanked by two regions of homology to the chloroplastid genome to effect a homologous recombination with the chloroplast genome; often a selectable marker gene is also present within the flanking plastid DNA sequences 30 to facilitate selection of genetically stable transformed chloroplasts in the resultant transplastonic plant cells (see, e.g., Maliga (1993) and Daniell (1998), and references cited therein). -84- WO 02/36782 PCT/USO1/46227 General transformation methods DNA constructs of the invention can be introduced into the genome of the desired plant host by a variety of conventional techniques. Techniques for tranforming a wide variety of higher plant species are well known and described in the technical and 5 scientific literature. See, e.g., Payne, Gamborg, Croy, Jones, etc. all supra, as well as, e.g., Weising et al. (1988) Ann. Rev. Genet. 22:421. For example, DNAs can be introduced directly into the genomic DNA of a plant cell using techniques such as electroporation and microinjection of plant cell protoplasts, or the DNA constructs can be introduced directly to plant tissue using ballistic 10 methods, such as DNA particle bombardment. Alternatively, the DNA constructs can be combined with suitable T-DNA flanking regions and introduced into a conventional Agrobacteriun tumefaciens host vector. The virulence functions of the Agrobacterium host will direct the insertion of the construct and adjacent marker into the plant cell DNA when the plant cell is infected by the bacteria. 15 Microinjection techniques are known in the art and well described in the scientific and patent literature. The introduction of DNA constructs using polyethylene glycol precipitation is described in Paszkowski et al (1984) EMBO J 3:2717. Electroporation techniques are described in Fromm et al. (1985) Proc Nat'l Acad Sci USA 82:5824. Ballistic transformation techniques are described in Klein et al. (1987) Nature 20 327:70; and Weeks et al. Plant Physiol 102:1077. In some embodiments, Agrobacterium mediated transformation techniques are used to transfer the GAT sequences of the invention to transgenic plants. Agrobacterium-mediated transformation is widely used for the transformation of dicots, however, certain monocots can also be transformed by Agrobacterium. For example, 25 Agrobacterium transformation of rice is described by Hiei et al. (1994) Plant J. 6:271; US Patent No. 5,187,073; US Patent No. 5,591,616; Li et al. (1991) Science in China34:54; and Raineri et al. (1990) Bio/Technolo2y 8:33. Transformed maize, barley, triticale and asparagus by Agrobacterium mediated transformation have also been described (Xu et al. (1990) Chinese J Bot 2:81). 30 Agrobacterium mediated transformation techniques take advantage of the ability of the tumor-inducing (Ti) plasmid of A tumefaciens to integrate into a plant cell genome, to co-transfer a nucleic acid of interest into a plant cell. Typically, an expression vector is produced wherein the nucleic acid of interest, such as a GAT polynucleotide of the invention, is ligated into an autonomously replicating plasmid which also contains T - 85 - WO 02/36782 PCT/USO1/46227 DNA sequences. T-DNA sequences typically flank the expression casssette nucleic acid of interest and comprise the integration sequences of the plasmid. In addition to the expression cassette, T-DNA also typically include a marker sequence, e.g., antibiotic resistance genes. The plasmid with the T-DNA and the expression cassette are then 5 transfected into Agrobacterin cells. Typically, for effective tranformation of plant cells, the A. tumefaciens bacterium also possesses the necessary vir regions on a plasmid, or integrated into its chromosome. For a discussion of Agrobacterium mediated transformation, see, Firoozabady and Kuehnle, (1995) Plant Cell Tissue and Organ Culture Fundamental Methods, Gamborg and Phillips (eds.). 10 Regeneration of Transgenic Plants Transformed plant cells which are derived by plant transformation techniques, including those discussed above, can be cultured to regenerate a whole plant which possesses the transformed genotype (i.e., a GAT polynucleotide), and thus the desired phenotype, such as acquired resistance (i.e., tolerance) to glyphosate or a 15 glyphosate analog. Such regeneration techniques rely on manipulation of certain phytohormones in a tissue culture growth medium, typically relying on a biocide and/or herbicide marker which has been introduced together with the desired nucleotide sequences. Alternatively, selection for glyphosate resistance conferred by the GAT polynucleotide of the invention can be performed. Plant regeneration from cultured 20 protoplasts is described in Evans et al. (1983) Protoplasts Isolation and Culture. Handbook of Plant Cell Culture, pp 124-176, Macmillan Publishing Company, New York; and Binding (1985) Regeneration of Plants, Plant Protoplasts pp 21-73, CRC Press, Boca Raton. Regeneration can also be obtained from plant callus, explants, organs, or parts thereof. Such regeneration techniques are described generally in Klee et al. (1987) Ann 25 Rev of Plant Phys 38:467. See also, e.g., Payne and Gamborg. After transformation with Agrobacterium, the explants typically are transferred to selection medium One of skill will realize that the selection medium depends on the selectable marker that was co transfected into the explants. After a suitable length of time, transformants will begin to form shoots. After the shoots are about 1-2 cm in length, the shoots should be transferred 30 to a suitable root and shoot medium. Selection pressure should be maintained in the root and shoot medium. Typically, the transformants will develop roots in about 1-2 weeks and form plantlets. After the plantlets are about 3-5 cm in height, they are placed in sterile soil in fiber pots. Those of skill in the art will realize that different acclimation procedures are - 86 - WO 02136782 PCT/USO1/46227 used to obtain transformed plants of different species. For example, after developing a root and shoot, cuttings, as well as somatic embryos of transformed plants, are transferred to medium for establishment of plantlets. For a description of selection and regeneration of transformed plants, see, e.g., Dodds and Roberts (1995) Experiments in Plant Tissue 5 Culture, 3 rd Ed., Cambridge University Press. There are also methods for Agrobacterium transformation of Arabidopsis using vacuum infiltration (Bechtold N., Ellis I. and Pelletier G,, 1993, In planta Agrobacterium mediated gene transfer by infiltration of adult Arabidopsis thaliana plants. CR Acad Sci Paris Life Sci 316:1194-1199) and simple dipping of flowering plants 10 (Desfeux, C., Clough S.I., and Bent A.F., 2000, Female reproductive tissues are the primary target of Agrobacterium-mediated transformation by the Arabidopsis floral-dip method. Plant Physiol. 123:895-904). Using these methods, transgenic seed are produced without the need for tissue culture. There are plant varieties for which effective Agrobacterium-mediated 15 transformation protocols have yet to be developed. For example, successful tissue transformation coupled with regeneration of the transformed tissue to produce a transgenic plant has not been reported for some of the most commercially relevant cotton cultivars. Nevertheless, an approach that can be used with these plants involves stably introducing the polynucleoticle into a related plant variety via Agrobacterium-mediated transformation, 20 confirming operability, and then transferring the transgene to the desired commercial strain using standard sexual crossing or back-crossing techniques. For example, in the case of cotton, Agrobacterium can be used to transform a Coker line of Gossypiui hirustum (e.g., Coker lines 310, 312, 5110 Deltapine 61 or Stoneville 213), and then the transgene can be introduced into another more commercially relevant G. hirustum cultivar 25 by back-crossing. The transgenic plants of this invention can be characterized either genotypically or phenotypically to determine the presence of the GAT polynucleotide of the invention. Genotypic analysis can be performed by any of a number of well-known techniques, including PCR amplification of genomic DNA and hybridization of genomic 30 DNA with specific labeled probes. Phenotypic analysis includes, e.g., survival of plants or plant tissues exposed to a selected herbicide such as glyphosate. Essentially any plant can be transformed with the GAT polynucleotides of the invention. Suitable plants for the transformation and expression of the novel GAT polynucleotides of this invention include agronomically and horticulturally important - 87 - WO 02/36782 PCT/US01/46227 species. Such species include, but are not restricted to members of the families: Graminae (including corn, rye, triticale, barley, millet, rice, wheat, oats, etc.); Leguminosae (including pea, beans, lentil, peanut, yam bean, cowpeas, velvet beans, soybean, clover, alfalfa, lupine, vetch, lotus, sweet clover,. wisteria, and sweetpea); Compositae (the largest 5 family of vascular plants, including at least 1,000 genera, including important commercial crops such as sunflower) and Rosaciae (including raspberry, apricot, almond, peach, rose, etc.), as well as nut plants (including, walnut, pecan, hazelnut, etc.), and forest trees (including Pinus, Quercus, Pseutotsuga, Sequoia, Populusetc.) Additional targets for modification by the GAT polynucleotides of the 10 invention, as well as those specified above, include plants from the genera- Agrostis, Allium, Antirrhinum, Apium, Arachis, Asparagus, Atropa, Avena (e.g., oats), Bambusa, Brassica, Bromus, Browaalia, Camellia, Cannabis, Capsicum, Cicer, Chenopodium, Chichorium, Citrus, Coffea, Coix, Cucumis, Curcubita, Cynodon, Dactylis, Datura, Daucus, Digitalis, Dioscorea, Elaeis, Eleusine, Festuca, Fragaria, Geranium, Gossypium, 15 Glycine, Helianthus, Heterocallis, Hevea, Hordeum (e.g., barley), Hyoscyamus, Ipomoea, Lactuca, Lens, Lilium, Linum, Lolium, Lotus, Lycopersicon, Majorana, Malus, Mangifera, Manihot, Medicago, Nemesia, Nicotiana, Onobrychis, Oryza (e.g., rice), Panicum, Pelargonium, Pennisetum (e.g., millet), Petunia, Pisum, Phaseolus, Phleum, Poa, Prunus, Ranunculus, Raphanus, Ribes, Ricinus, Rubus, Saccharum, Salpiglossis, Secale (e.g., rye), 20 Senecio, Setaria, Sinapis, Solanum, Sorghum, Stenotaphrum, Theobroma, Trifolium, Trigonella, Triticum (e.g., wheat), Vicia, Vigna, Vitis, Zea (e.g., corn), and the Olyreae, the Pharoideae and many others. As noted, plants in the family Graminae are a particularly target plants for the methods of the invention. Common crop plants which are targets of the present invention include 25 com, rice, triticale, rye, cotton, soybean, sorghum, wheat, oats, barley, millet, sunflower, canola, peas, beans, lentils, peanuts, yam beans, cowpeas, velvet beans, clover, alfalfa, lupine, vetch, lotus, sweet clover, wisteria, sweetpea and nut plants (e.g., walnut, pecan, etc). In one aspect, the invention provides a method for producing a crop by 30 growing a crop plant that is glyphosate-tolerant as a result of being transformed with a gene encoding a glyphosate N-acteyltransferase, under conditions such that the crop plant produces a crop, and harvesting the crop. Preferably, glyphosate is applied to the plant, or in the vicinity of the plant, at a concentration effective to control weeds without preventing the transgenic crop plant from growing and producing the crop. The application of -88- WO 02136782 PCTIUSOI/46227 glyphosate can be before planting, or at any time after planting up to and including the time of harvest. Glyphosate can be applied once or multiple times. The timing of glyphosate application, amount applied, mode of application, and other parameters will vary based upon the specific nature of the crop plant and the growing environment, and 5 can be readily determined by one of skill in the art. The invention further provides the crop produced by this method. The invention provides for the propagation of a plant containing a GAT polynucleotide transgene. The plant can be, for example, a monocot or a dicot. In one aspect, propagation entails crossing a plant containing a GAT polynucleotide transgene 10 with a second plant, such that at least some progeny of the cross display glyphosate tolerance. In one aspect, the invention provides a method for selectively controlling weeds in a field where a crop is being grown. The method involves planting crop seeds or plants that are glyphosate-tolerant as a result of being transformed with a gene encoding a 15 GAT, e.g., a GAT polynucleotide, and applying to the crop and any weeds a sufficient amount of glyphosate to control the weeds without a significant adverse impact on the crops. It is important to note that it is not necessary for the crop to be totally insensitive to the herbicide, so long as the benefit derived from the inhibition of weeds outweighs any negative impact of the glyphosate or glyphosate analog on the crop or crop plant 20 In another aspect, the invention provides for use of a GAT polynucleotide as a selectable marker gene. In this embodiment of the invention, the presence of the GAT polynucleotide in a cell or organism confers upon the cell or organism the detectable phenotypic trait of glyphosate resistance, thereby allowing one to select for cells or organisms that have been transformed with a gene of interest linked to the GAT 25 polynucleotide. Thus, for example, the GAT polynucleotide can be introduced into a nucleic acid construct, e.g., a vector, thereby allowing for the identification of a host (e.g., a cell or transgenic plant) containing the nucleic acid construct by growing the host in the presence of glyphosate and selecting for the ability to survive and/or grow at a rate that is discernibly greater than a host lacking the nucleic acid construct would survive or grow. 30 A GAT polynucleotide can be used as a selectable marker in a wide variety of hosts that are sensitive to glyphosate, including plants, most bacteria (including E. coli), actinomycetes, yeasts, algae and fungi. One benefit of using herbicide resistance as a marker in plants, as opposed to conventional antibiotic resistance, is that it obviates the concern of some members of the public that antibiotic resistance might escpe into the - 89 - WO 02136782 PCT/US01/46227 environment. Some experimental data from experiments demonstrating the use of a GAT polynucleotide as a selectable marker in diverse host systems are described in the Examples section of this specification. Selection of aat polynucleotides conferring enhanced glyphosate resistance 5 in transgenic plants. Libraries of GAT encoding nucleic acids diversified according to the methods described herein can be selected for the ability to confer resistance to glyphosate in transgenic plants. Following one or more cycles of diversification and selection, the modified GAT genes can be used as a selection marker to facilitate the production and 10 evaluation of transgenic plants and as a means of conferring herbicide resistance in experimental or agricultural plants. For example, after diversification of any one or more of SEQ ID NO:1 to SEQ ID NO:5 to produce a library of diversified GAT polynucleotides, an initial functional evaluation can be performed by expressing the library of GAT encoding sequences in E. coli. The expressed GAT polypeptides can be 15 purified, or partially purified as described above, and screened for improved kinetics by mass spectrometry. Following one or more preliminary rounds of diversification and selection, the polynucleotides encoding improved GAT polypeptides are cloned into a plant expression vector, operably linked to, e.g., a strong constitutive promoter, such as the CaMV 35S promoter. The expression vectors comprising the modified GAT nucleic acids 20 are transformed, typically by Agrobacterium mediated transformation, into Arabidopsis thaliana host plants. For example, Arabidopsis hosts are readily transformed by dipping inflorescences into solutions of Agrobacterium and allowing them to grow and set seed. Thousands of seeds are recovered in approximately 6 weeks. The seeds are then collected in bulk from the dipped plants and germinated in soil. In this manner it is possible to 25 generate several thousand independently transformed plants for evaluation, constituting a high throughput (TP) plant transformation format. Bulk grown seedlings are sprayed with glyphosate and surviving seedlings exhibiting glyphosate resistance survive the selection process, whereas non-transgenic plants and plants incorporating less favorable modified GAT nucleic acids are damaged or killed by the herbicide treatment Optionally, 30 the GAT encoding nucleic acids conferring improved resistance to glyphosate are recovered, e.g., by PCR amplification using T-DNA primers flanking the library inserts, and used in further diversification procedures or to produce additional transgenic plants of the same or different species. If desired, additional rounds of diversification and selection -90 - WO 02136782 PCT/USO1/46227 can be performed using increasing concentrations of glyphosate in each subsequent selection. In this manner, GAT polynucleotides and polypeptides conferring resistance to concentrations of glyphosate useful in field conditions can be obtained Herbicide Resistance 5 The mechanism of glyphosate resistance of the present invention can be combined with other modes of glyphosate resistance known in the art to produce plants and plant explants with superior glyphosate resistance. For example, glyphosate-tolerant plants can be produced by inserting into the genome of the plant the capacity to produce a higher level of 5-enolpyruvylshikimate-3-phosphate synthase (EPSP) as more fully 10 described in U.S. Patent Nos. 6,248,876 B 1; 5,627,061; 5,804,425; 5,633,435; 5,145,783; 4,971,908; 5,312,910; 5,188,642; 4,940,835; 5,866,775; 6,225,114 B1; 6,130,366; 5,310,667; 4,535,060; 4,769,061; 5,633,448; 5,510,471; Re. 36,449; RE 37,287 E; and 5,491,288; and international publications WO 97/04103; WO 00/66746; WO 01/66704; and WO 00/66747, which are incorporated herein by reference in their entireties for all 15 purposes. Glyphosate resistance is also imparted to plants that express a gene that encodes a glyphosate oxido-reductase enzyme as described more fully in U.S. Patent Nos. 5,776,760 and 5,463,175, which are incorporated herein by reference in their entireties for all purposes. Further, the mechanism of glyphosate resistance of the present invention 20 may be combined with other modes of herbicide resistance to provide plants and plant explants that are resistant to glyphosate and one or more other herbicides. For example, the hydroxyphenylpyrvatdioxygenases are enzymes that catalyze the reaction in which para-hydroxyphenylpyrUvate (HPP) is transformed into homogentisate. Molecules which inhibit this enzyme, and which bind to the enzyme in order to inhibit transformation of the 25 HPP into homogentisate are useful as herbicides. Plants more resistant to certain herbicides are described in U.S Patent Nos. 6,245,968 B1; 6,268,549; and 6,069,115; and international publication WO 99/23886, which are incorporated herein by reference in their entireties for all purposes. Sulfonylurea and imidazolinone herbicides also inhibit growth of higher 30 plants by blocking acetolactate synthase (ALS) or acetohydroxy acid synthase (ABAS). The production of sulfonylurea and imidazolinone tolerant plants is described more fully in U.S Patent Nos. 5,605,011; 5,013,659; 5,141,870; 5,767,361; 5,731,180; 5,304,732; 4,761,373; 5,331,107; 5,928,937; and 5,378,824; and international publication WO 96/33270, which are incorporated herein by reference in their entireties for all purposes. -91- WO 02/36782 PCT/US01/46227 Glutamine synthetase (GS) appears to be an essential enzyme necessary for the development and life of most plant cells. Inhibitors of GS are toxic to plant cells. Glufosinate herbicides have been developed based on the toxic effect due to the inhibition of GS in plants. These herbicides are non-selective. They inhibit growth of all the 5 different species of plants present, causing their total destruction. The development of plants containing an exogenous phosphinothricin acetyl transferase is described in U.S. Patent Nos. 5,969,213; 5,489,520; 5,550,318; 5,874,265; 5,919,675; 5,561,236; 5,648,477; 5,646,024; 6,177,616 B1; and 5,879,903, which are incorporated herein by reference in their entireties for all purposes. 10 Protoporphyrinogen oxidase (protox) is necessary for the production of chlorophyll, which is necessary for all plant survival. The protox enzyme serves as the target for a variety of herbicidal compounds. These herbicides also inhibit growth of all the different species of plants present, causing their total destruction. The development of plants containing altered protox activity which are resistant to these herbicides are 15 described in U.S. Patent Nos. 6,288,306 B1; 6,282,837 B1; and 5,767,373; and international publication WO 01/12825, which are incorporated herein by reference in their entireties for all purposes. EXAMPLES The following examples are illustrative and not limiting. One of skill will 20 recognize a variety of non-critical parameters that can be altered to achieve essentially similar results. EXAMPLE 1: ISOLATING NOVEL NATIVE GAT POLYNUCLEOTIDES Five native GAT polynucleotides (i.e., GAT polynucleotides that occur naturally in a non-genetically modified organism) were discovered by expression cloning 25 of sequences from Bacillus strains exhibiting GAT activity. Their nucleotide sequences were determined and are provided herein as SEQ ID NO:1 to SEQ ID NO:5. Briefly, a collection of approximately 500 Bacillus and Pseudomonas strains were screened for native ability to N-acetylate glyphosate. Strains were grown in LB overnight, harvested by centrifugation, permeabilizied in dilute toluene, and then washed and resuspended in a 30 reaction mix containing buffer, 5 mM glyphosate, and 200 jpM acetyl-CoA. The cells were incubated in the reaction mix for between 1 and 48 hours, at which time an equal volume of methanol was added to the reaction. The cells were then pelleted by centrifugation and the supernatant was filtered before analysis by parent ion mode mass -92 - WO 02/36782 PCT/USO1/46227 spectrometry. The product of the reaction was positively identified as N-acetylglyphosate by comparing the mass spectrometry profile of the reaction mix to an N-acetylglyphosate standard as shown in Figure 2. Product detection was dependent on inclusion of both substrates (acetylCoA and glyphosate) and was abolished by heat denaturing the bacterial 5 cells. Individual GAT polynucleotides were then cloned from the identified strains by functional screening. Genomic DNA was prepared and partially digested with Sau3A1 enzyme. Fragments of approximately 4 Kb were cloned into an E. coli expression vector and transformed into electrocompetent E. coli. Individual clones exhibiting GAT 10 activity were identified by mass spectrometry following a reaction as described previously except that the toluene wash was replaced by permeabilization with PMBS. Genomic fragments were sequenced and the putative GAT polypeptide-encoding open reading frame identified. Identity of the GAT gene was confirmed by expression of the open reading frame in E. coli and detection of high levels of N-acetylglyphosate produced from 15 reaction mixtures. EXAMPLE 2: CHAlACTERIZATION OF A GAT POLYPEPTDE ISOLATED FROM B.LCHENIFORMIS STRAIN B6. Genomic DNA from B. licheniformis strain B6 was purified, partially 20 digested with Sau3Al and fragments of 1-10 Kb were cloned into an E. col expression vector. A clone with a 2.5 kb insert conferred the glyphosate N-acetyltransferase (GAT) activity on the E. coli host as determined with mass spectrometry analysis. Sequencing of the insert revealed a single complete open reading frame of 441 base pairs. Subsequent cloning of this open reading frame confirmed that it encoded the GAT enzyme. A 25 plasmid, pMAXY2120, shown in figure 4, with the gene encoding the GAT enzyme of B6 was transformed into E. coli strain XL1 Blue. A 10% innoculum of a saturated culture was added to Luria broth, and the culture was incubated at 37* C for 1 hr. Expression of GAT was induced by the addition of IPTG at a concentration of 1 mM. The culture was incubated a further 4 hrs, following which, cells were harvested by centrifugation and the 30 cell pellet stored at -80* C. Lysis of the cells was effected by the addition of 1 ml of the following buffer to 0.2 g of cells: 25 mM HEPES, pH 7.3, 100 mM KC1 and 10% methanol (HKM) plus 0.1 mM EDTA, 1 mM DTT, 1 mg/ml chicken egg lysozyme, and a protease inhibitor -93- WO 02/36782 PCT/USO1/46227 cocktail obtained from Sigma and used according to the manufacturer's recommendations. After 20 minutes incubation at room temperature (e.g., 22-25" C), lysis was completed with brief sonication. The lysate was centrifuged and the supernatant was desalted by passage through Sephadex G25 equilibrated with 1K1V Partial purification was obtained 5 by affinity chromatography on CoA Agarose (Sigma). The column was equilibrated with HKM and the clarified extract allowed to pass through under hydrostatic pressure. Non binding proteins were removed by washing the column with HKI, and GAT was eluted with HKM containing 1 mM Coenzyme A. This procedure provided 4-fold purification. At this stage, approximately 65% of the protein staining observed on an SDS 10 polyacrylamide gel loaded with crude lysate was due to GAT, with another 20% due to chloramphenicol acetyltransferase encoded by the vector. Purification to homogeneity was obtained by gel filtration of the partially purified protein through Superdex 75 (Pharmacia). The mobile phase was HKM, in which GAT activity eluted at a volume corresponding to a molecular radius of 17 kD. This 15 material was homogeneous as judged by Coomassie staining of a 3 pg sample of GAT subjected to SDS polyacrylamide gel electrophoresis on a 12% acrylamide gel, 1 mm thickness. Purification was achieved with a 6-fold increase in specific activity. The apparent Km for glyphosate was determined on reaction mixtures containing saturating (200 pM) Acetyl CoA, varying concentrations of glyphosate, and 1 pM purified 20 GAT in buffer containing 5 mM morpholine adjusted to pH 7.7 with acetic acid and 20 % ethylene glycol. Initial reaction rates were determined by continuous monitoring of the hydrolysis of the thioester bond of Acetyl CoA at 235 nm (E = 3.4 OD/mNMcm). Hyperbolic saturation kinetics were observed (Figure 5), from which an apparent Km of 2.9 ± 0.2 (SD) mM was obtained. 25 The apparent Km for AcCoA was determined on reaction mixtures containing 5 mM glyphosate, varying concentrations of Acetyl CoA, and 0.19 pM GAT in buffer containing 5 mM morpholine adjusted to pH 7.7 with acetic acid and 50% methanol. Initial reaction rates were determined using mass spectrometric detection of N acetyl glyphosate. Five pl were repeatedly injected to the instrument and reaction rates 30 were obtained by plotting reaction time vs area of the integrated peak (Figure 6). Hyperbolic saturation kinetics were observed (Figure 7), from which an apparent KM of 2 jM was derived. From values for Vmax obtained at a known concentration of enzyme, a kcat of 6/min was calculated. -94- WO 02/36782 PCT/USO1/46227 EXAMPLE 3: MASS SPECTROMETRY (MS) SCREENING PROCESS Sample (5 ul) is drawn from a 96-well microtiter plate at a speed of one sample every 26 seconds and injected into the mass spectrometer (Micromass Quattro LC, 5 triple quadrupole mass spectrometer) without any separation. The sample is carried into the mass spectrometer by a mobile phase of water/methanol (50:50) at a flow rate of 500 Ul/min. Each injected sample is ionized by negative electrospray ionization process (needle voltage, -3.5 KV; cone voltage, 20 V; source temperature, 120 C; desolvation temperature, 250 C; cone gas flow, 90 IJHr; and desolvation gas flow, 600 IEr). The 10 molecular ions (m/z 210) formed during this process arre selected by the first quadrupole for performing collison induced dissociation (CID) in the second quadrupole, where the pressure is set at 5 x 10-4 mBar and the collision energy is adjusted to 20 Ev. The third quadrupole is set for only allowing one of the daughter ions (m/z 124) produced from the parent ions (m/z 210) to get into the detector for signal recording. The first and third 15 quadupoles are set at unit resolution, while the photomultiplier is operated at 650 V. Pure N-acetylglyphosate standards are used for comparison and peak integration used to estimate concentrations. It is possible to detect less than 200 Nm N-acetylglyphosate by this method. 20 EXAMPLE 4: DETECTION OF NATIVEOR LOW ACTIVITY GAT ENZYMES Native or low activity GAT enzymes typically have Kcat of approximately 1 min' and Km for glyphosate of 1.5-10 Mm. KM for acetylCoA is typically less than 25 pM. Bacterial cultures are grown in rich medium in deep 96-well plates and 0.5 25 ml stationary phase cells are harvested by centrifugation, washed with 5 mM morpholine acetate pH 8, and resuspended in 0.1 ml reaction mix containing 200 pM ammonium acetylCoA, 5 mM ammonium glyphosate, and 5 .g/ml PMBS (Sigma) in 5 mM morpholine acetate, pH 8. The PMBS permeabilizes the cell membrane allowing the substrates and products to move from the cells to the buffer without releasing the entire 30 cellular contents. Reactions are carried out at 25-37*C for 1-48 hours. The reactions are quenched with an equal volume of 100% ethanol and the entire mixture is filtered on a 0.45 pm MARV Multiscreen filter plate (Millipore). Samples are analyzed using a mass spectrometer as desribed above and compared to synthetic N-acetylglyphosate standards. -95 - WO 02/36782 PCT/USO1/46227 EXAMPLE 5: DETBCT[ON OF HIGH ACTIVE GAT ENZYMES High activity GAT enzymes typically have kcat up to 400 min- and Km below 0.1 mM glyphosate. Genes coding for GAT enzymes are cloned into E. coli expression vectors 5 such as pQE8O (Qiagen) and introduced into E coli strains such as XL1 Blue (Stratagene). Cultures are grown in 150 ul rich medium (such as LB with 50 ug/mi carbenicllin) in shallow U-bottom 96-well polystyrene plates to late-log phase and diluted 1:9 with fresh medium containing 1 mM IPTG (TSB). After 4-8 hours induction, cells are harvested, washed with 5mM morpholine acetate pH 6.8 and resuspended in an equal volume of the 10 same morpholine buffer. Reactions are carried out with up to 10 ul of washed cells. At higher activity levels, the cells are first diluted up to 1:200 and 5 ul is added to1O ul reaction mix. To measure GAT activity, the same reaction mix as described for low activity can be used. However, for detecting highly active GAT enzymes the glyphosate concentration is reduced to 0.15 - 0.5 mM, the pH is reduced to 6.8, and reactions are 15 carried out for 1 hour at 37*C. Reaction workup and MS detection are as described herein. EXAMPLE 6: PURIFICATION OF GAT ENZY@E Enzyme purification is achieved by affinity chromatography of cell lysates on CoA-agarose and gel-filtration on Superdex-75. Quantities of purified GAT enzyme up 20 to 10 mg are obtained as follows: A 100-m1 culture of E. coli carrying a GAT polynucleotide on a pQE80 vector and grown overnight in LB containing 50 ug/ml carbenicillin is used to inoculate 1 L of LB plus 50 ug/ml carbenicillin. After 1 hr, IPTG is added to 1 mM, and the culture is grown a further 6 hr. Cells are harvested by centrifugation. Lysis is effected by suspending the cells in 25 mM HEPES (pH 7.2), 100 25 mM KC1, 10% methanol (termed BXM), 0.1 mM EDTA, 1 mM DTT, protease inhibitor cocktail supplied by Sigma-Aldrich and 1 mg/ml of chicken egg lysozyme. After 30 minutes at room temperature, the cells are briefly sonicated. Particulate material is removed by centrifugation, and the lysate is passed through a bed of coenzyme
A
Agarose. The column is washed with several bed volumes of HKM and GAT is eluted in 30 1.5 bed volumes of HKM containing 1 mM acetyl-coenzyme A. GAT in the eluate is concentrated by its retention above a Centricon YM 50 ultrafiltration membrane. Further purification is obtained by passing the protein through a Superdex 75 column through a series of 0.6-ml injections. The peak of GAT activity elutes at a volume corresponding to - 96 - WO 02/36782 PCT/USO1/46227 a molecular weight of 17 kD. This method results in purification of GAT enzyme to homogeneity with >85% recovery. A similar procedure is used to obtain 0.1 to 0.4 mg quantities of up to 96 shuffled variants at a time. The volume of induced culture is reduced to 1 to 10 ml, coenzyme A-Agarose affinity chromatography is performed in 0.15 5 ml columns packed in an MAHV filter plate (Millipore) and Superdex 75 chromatography is omitted. EXAMPLE 7: STANDARD PROTOCOL FOR DETERMNAnON OF AND KM Kt and KM for glyphosate of purified protein are determined using a 10 continuous spectrophotometric assay, in which hydrolysis of the sulfoester bond of AcCoA is monitored at 235 nm. Reactions are performed at ambient temperature (about 23*C) in the wells of a 96-well assay plate, with the following components present in a final volume of 0.3 ml: 20 mM HEPES, pH 6.8, 10% ethylene glycol, 0.2 mM acetyl coenzyme A, and various concentration of ammonium glyphosate. In comparing the 15 kinetics of two GAT enzymes, both enzymes should be assayed under the same condition, e.g:, both at 23*C. Keat is calculated from Vmn and the enzyme concentration, determined by Bradford assay. KM is calculated from the initial reaction rates obtained from concentrations of glyphosate ranging from 0.125 to 10 mM, using the Lineweaver-Burke transformation of the Michaelis-Menten equation. K,./Km is determined by dividing the 20 value determined for K, by the value determined for Km. Using this methodology, kinetic parameters for a number of GAT polypeptides exemplified herein have been determined. For example, the Kc, KM and Kw/Km for the GAT polypeptide corresponding to SEQ ID NO:445 have been determined to be 322 min7', 0.5 mM and 660 mM'min , respectively, using the assay conditions 25 described above. The Kot, Km and K 2 t/KM for the GAT polypeptide corresponding to SEQ ID NO:457 have been determined to be 118 min-', 0.1 mM and 1184 mM 4 mind, respectively, using the assay conditions described above. The Kt, KM and KeatKm for the GAT polypeptide corresponding to SEQ ID NO:300 have been determined to be 296 min-', 0.65 mM and 456 mM'min4, respectively, using the assay conditions described 30 above. One of skill in the art can use these numbers to confirm that a GAT activity assay is generating kinetic parameters for a GAT suitable for comparison with the values given herein. For example, the conditions used to compare the activity of GATs should yield the same kinetic constants for SEQ ID NOS: 300, 445 and 457 (within normal experimental -97- WO 02136782 PCT/USO1/46227 variance) as those reported herein, if the conditions are going to be used to compare a test GAT with the GAT polypeptides exemplified herein. Kinetic parameters for a number of GAT polypeptide variants were determined according to this methodology and are provided in Tables 3, 4 and 5. 5 Table 3. GAT polypeptide krt values SEQ ID NO. Clone ID K..(min ) SEQ ID NO:263 13_1 OF6 48.6 SEQ ID NO:264 13_12G6 52.1 SEQ ID NO:265 14_2A5 280.8 SEQ ID NO:266 14_21 133.4 SEQ .ID NO:267 14 2F11 136.9 SEQ ID NO:268 CHIMERA 155.4 SEQ ID NO:269 10_12D7 77.3 SEQ ID NO:270 101 5F4 37.6 SEQ ID NO:271 10 17D1 176.2 SEQ ID NO:272 10_17F6 47.9 SEQ ID NO:273 1018G9 24 SEQ ID NO:274 10 1H3 762 SEQ ID NO:275 10 20D10 86.2 SEQ ID NO:276 10 23F2 101.3 SEQ ID NO:277 10 2B8 108.4 SEQ ID NO:278 10_2C7 135 SEQ ID NO:279 10_3G5 87.4 SEQ ID NO:280 |104H7 112 SEQ ID NO:281 10_6D1 1 62.4 SEQ ID NO:282 10 8C6 21.7 SEQ ID NO:283 1103 2.8 SEQ ID NO:284 11G3 15.6 SEQ ID NO-285 11H3 1.2 SEQ ID NO:286 12 1F9 80.4 SEQ ID NO:287 12_2G9 151.4 SEQ ID NO:288 12_3F1 44.1 SEQ ID NO:289 12-5010 89.6 SEQ ID NO:290 12_6Al 0 54.7 SEQ ID NO:291 12_6D1 49 SEQ ID NO:292 12_6F9 89.1 SEQ ID NO:293 12_6H6 90.5 SEQ ID NO:294 12_7D6 53.9 SEQ ID NO:295 12 7G11 234.5 SEQ ID NO:296 12F5 3.1 SEQ ID NO:297 12G7 2.3 SEQ ID NO:298 1 2H6 9.3 SEQ ID NO-299 13_12G12 36.1 SEQ ID NO:300 13.6D10 296.5 SEQ ID NO:301 13_7A7 117 SEQ ID NO:302 13_7B1 2 68.9 SEQ ID NO:303 13_7C1 48.1 SEQ ID Nd:304 13"8G6 33.7 SEQ ID NO:305 13_9F6 59 SEQ ID NO:306 14_10C9 127 SEQ ID NO:307 14 10H3 105.2 SEQ ID NO:308 14.10H9 127.2 - 98 - WO 02136782 PCTIUS01/46227 SEQ ID NO:309 14 _ 11C2 108.7 SEQID NO:310 14 _12DB 62-1 SEQ ID NQ:311 14-12H6 91.1 SEQ ID NO:312 14 2B6 34.2 SEQ ID NO:31 3 .1 4 2G1 89.4 SEQ ID NO:314 14 3B2 68.7 SEQ ID NO:315 14 4H8 198.8 SEQ ID NO:316 14 - A8 43.7 SEQ ID NO:317 14 6B1 0 134.7 SEQ ID NO:31 8 14 6D4 256 SEQ ID NO:31 9 14 7A1 1 197.2 SEQ ID NO:320 14 MA 155.8 SEQ ID NQ:321 14 7A9 245.9 FSEQ ID NQ:322 14 7G1 136.7 SEQ ID NO:323 14 7H9 84.4 SEQ ID NO:324 14 BF7 90.5 SEQ ID NO:325 15 1002 69.9 SEQ ID NO:326 151 1D6 67.1 SEQ ID NO:327 15 11 F9 76.4 SEQ ID NO:328 15_ 1 H3 61.9 SEQ ID NQ:329 15 12AB 77.1 SEQ ID NO:330 151_l2D6 148.6 SEQ ID NO:331 15.12D8 59.7 SEQ ID NO-332 15 12D9 59.7 SEQ ID NQ:333 15--3F 10 48.7 SEQ I D NO:334 15 3G 11 71.5 rSEQ ID NO:335 15_4F1 1 80.3 SEQ ID NQ:337 115 6D3 85.9 ___________ SEQ I D NO:349 21 9FE1512 SEQ IDNQ:350 21151. SEQ ID NO:351 2401 1.7 SEQ ID N:32 2406 23.7 SEQ ID NO:33 12E0 83.9 SEQ ID N:354 2_803 24.8 SEQ ID NO:355 19C3 1.1 SEQ ID NO:356 310GB 1.2 SEQ ID NO:37 3BA 100 2.8 SE ID NO:358 3BF 3.7 96 SEQ ID NO:359 23H_121 2.8 SEQ ID N:351 3BC 5.4 SEQ ID N:3521 20 1216.47 ISE Q ID N Q:39 2
.
30~ 1 1H
.
9 SEQ ID NO:33 312H 10 91.1 SEQ ID NO:38 i79H 11.7 SEQ ID NO:365 4A3Bl 23205. SEQ ID NQ:366 4A 102l 20.4 - 99 - WO 02136782 PCTJUSO1/46227 SEQ ID NQ:367 4B 13E1 37.2 SEQ ID NO:368 4Bl 13G1 0 34.9 SEQ ID NQ:369 4B5_ iE1 17 SEQ ID NQ:370 4B_17A1 19.1 SEQ ID NQ:371 4Bil BF1 1 14.6 [SEQ ID NQ:372 4B 19CB 15.9 SEQ ID NO:373 4B-1 G4 3.7 SEQ ID NQ:374 4B 210C6 11.8 SEQ ID NQ:375 4B 2H7 27_____________ SEQ ID NO:376 4B_2H8 B. SEQ ID NQ:377 4B._6DB 2. SEQ ID NO:378 4B_7E8 0. SEQ ID NO:379 40 8.C99 SEQ ID NO:3B0 4H1 . SEQ ID NQ:381 6 14D10 4. SEQ ID NQ:382 6_ 15G7 48.4 SEQ ID NQ:383 616A5 3.2 SEQ ID NQ:384 6_635.2_____________ [SEQ ID NQ:385 161-7C5 35.2 SEQ ID NQ:386 B6- 807 132.2 SEQ ID NO:387 16 18D7 43 SEQ ID NO:3B8 16-19A1 0 86.8 SEQ ID NQ:39 16-19B6 23.9 SEQ ID NQ:390 6 1903 23.1 SEQ ID NQ:391 6_1908 74.8 SEQ ID NO:392 6 20A7 40.4 SEQ ID NO:393 6 20A9 45.1 SEQ ID NO:394 6 0519.5 SEQ ID NO:395 6 21 F4 24.3 SEQ ID NO:396 16-.22C9 47.4 SEQ ID N:397 1622D9 43.9 SEQ ID NO:398 16-22H9 17.4 SEQ ID NO:399 16-23H3 43.9 SEQ ID NQ:4o0 6. ..237 46.2 SEQ ID NO:401 §6.2H1 26.6 SEQ IDM Q4 6. 3D6 41.7 SEQ ID NQ:403 6..3G3 51.9 SEQ ID NO:404 6 3H2 57.2 SEQ ID NO:405 16 A10 55 SEQ ID NQ:406 16-.4M 27 SEQ ID NO:407 6-.5D11 15.2 SEQ ID NO:408 6K.5F1 1 40.1 SEQ ID NQ:409 6. 5G9 35.8 SEQ ID NO:410 6. 6D5 55.3 SEQ ID NO:411 6 D119.7 SEQ ID NO:412 6..8H3 44.7 SQID NO:413 j69G1 1 78.4 SEQ ID NO:414 6F1 10.1 SEQ ID NO:415 7 104 17.4 SEQ ID NO:416 7-2A1 0 1.5 SEQ ID NO:417 7-2A1 1 46.8 SEQ ID N:41 8 7 2D7 54.9 SEQ ID NO:419 7 57 44.7 SEQ ID NO:420 79 99 65 SEQ ID NO:421 91 3F10 3. SEQ ID N:422 9 13F1 131.6 SEQ ID NO:423 19 1 5D5 2. SEQ ID NO:424 9 V15D 0107.3 -100- WO 02/36782 PCTJUSO1/46227 SEQ ID NO:425 915H3 6.7 SEQ ID NO:426 91j 8H2 25 SEQ ID NO:427 9 -20F12 37.8 SEQ ID NQ:428 21828.6 SEQ ID NO:429 I9.2B1 150D.1 SEQ ID N:430 19. 23A1 0 121 SEQ ID NO:431 9G.24F615. SEQ 10 NO:432 194H10I103 SEQ ID NO:433 I9. AH8 147.1 ___________ SEQ ID NO:434 9 S8H1 7. SEQ ID NO:435 19-9H7 28_____________ SEQ ID NO:436 9 06 13_____________ SQID NO:437 19H1 1 4_____________ SEQ ID NO:438 O.AB1O 19 SEQ ID NO:439 I0.5B1 1 219 ____________ SEQ ID NO:440 I0 ..53 143 SE D 4 1 0 5B4 1 80 SQID NO.:4i - 05B8 143 S E Q ID N O :443 0 5 04 22050 8 SEQ ID NO:444 I .5D11 224 SEQ ID NO:445 10 503 322 SEQ ID NO:446 I05D7 244 SE D O44 B4 252 SEQ ID NQ:.448 10 6D1 0 ill SQID NO:449 I0 601 1 212 SEQ 1D NO:450 0 F2 175 SEQ ID NO:451 0 6H9 228 SEQ ID NO:452 10_4010 69.6 SEQ ID NO:45 10_L4D5 82.72 SE ID NO:.454 042210 SEQ ID NQ:455 10 4F9 551.9 SEQ ID NOQ:457 44 118.3 SEQ ID NO:458 11 3A1 55.66 SEQ D NO455 i_3 219.9 SEQ ID NO:456 0 4G 35 194.61 SEQ ID NO:41 i 01 49.0 SEQ ID NO46 ii 303 21.0 SEQ ID NO:463 10 306 1184.44 SEQ ID NO:454 11 -306 55.36 SEQ ID NO:45 1 1G12 21______________ SEQID NO 46 H1____________ SEQ ID NO:467 1 1R H2.6 SEQ ID No:48 11H SEQ ID NO:469 Il -2$12490 SEQ ID NO:470 11 3C3B 214.02 __________ SEQ D NO:473 1 3.064844 SEQ ID NO:472 11D 55D3 SEQ ID NO:473 1-G1 2D44 SEQ ID NO:474 1 2F8 291 ____________ SEQ ID NO:497 1 1216 SEQ ID NO:476 1- .. 394 SEQ ID NO:477 1 2-Al 3D26 SEQ ID NO:478 1 3F3S13 SEQ ID NO:471 1-0 9.q SEQ ID N:480 1-F 405 SEQ ID NO:481 1_4D6B 137 SEQ ID NO:472 1_4H1 236 SEQ~~ ~ ~ 101 -O47 13 ~ 2 WO 02136782 PCT/USO1/46227 SEQ ID NO:483 1_5H5 214 SEQ ID NO:484 1_6F12 209 SEQ ID NO:485 1_6H6 274 SEQ ID NO:486 3_11A10 135.41 SEQ ID NO:487 3_14F6 188.43 SEQ ID NO:488 3,15B2 104.13 SEQ ID NO:489 3_6A1 0 126.48 SEQ ID NO:490 3.6B1 263.08 SEQ ID NO:491 3_F9 193.55 SEQ ID NO:492 3_8G11 99.14 SEQ ID NO:493 4_1B10 77.09 SEQ ID NO:494 5_2B3 56.75 SEQ ID NO:495 5_2D9 75.44 SEQ ID NO:496 5_2F1 0 54.72 SEQ ID NO:497 6_1A11 45.54 SEQ ID NO:498 61 D5 42.92 SEQ ID NO:499 61 F1 1 105.76 SEQ ID NO:500 61 F1 69.81 SEQ ID NO:501 61 H10 17.01 SEQ ID NO:502 6 I H4 85.91 SEQ ID NO:503 8 1F8 8288 SEQ ID NO:504 8_1 G2 67.47 SEQ ID NO:505 _1 G3 18 .9 SEQ ID NO:506 8 1H7 101.24 SEQ ID NO:507 8_1 H9 178.39 SEQ ID NO:508 GAT1_21F12 _5.4 SEQ ID NO:509 GAT124G3 4.9 SEQ ID NO:510 GAT129G1 62 TbElQ 4DGN:11A GAT1_32G1 4.5 SEQ ID NO.512 GAT2-15G8 4.5 SEQ ID NO:513 GAT2_19H _14.1 SEQ ID NO:514 GAT2_21F1 4.2 Table 4. GAT polypeptide (glyphosate) Km values SEQ ID NO. Clone ID KmumM SEQ ID NO:263 13_10F6 1.3 5EQ ID NO:264 13_12G6 1.2 SEQ ID NO:265 14 2AS 1.6 SEQ ID NO:266 |1 4_21 3.1 SEQ ID NO:267 14_2F11 SEQ ID NO:26B CHIMERA 1.3 SEQ ID NO:269 10-12D7 1.8 SEQ ID NO:270 10 15F4 1 SEQ ID NO:271 10_17D1 2.2 SEQ ID NO:272 10_17F6 1.4 SEQ ID NO:273 1 0_1 8G9 1.2 SEQ ID NO:274 10 1H3 1.9 SEQ ID NO:275 10_20D10 1.6 SEQ ID NO-276 10_23F2 0.9 SEQ ID NO:277 10_2B8 1-1 SEQ ID NO:278 10 -2C7 1.4 SEQ ID NO:279 10 3G5 2 SEQ ID NO:280 10_4H7 1.7 SEQ ID NO:281 10_6D11 1.2 SEQ ID NO:282 10_8C6 0.7 SEQ ID NO:283 11C3 3.1 - 102 - WO 02/36782 PCTfUSO1/46227 SEQ ID NQ:284 11 G3 1.7 ____________ SEQ ID NQ:285 11 H31. SEQ ID NO:286 121 1F93 SEQ ID N0287 12 2G9 1.5 _________ SEQ ID NO:288 12 3F10. SEQ ID NO:289 12 50101. SEQ ID NQ:290 12 6AlO 0_____1.1______ SEQ ID NQ:291 12 6D11. SEQ ID N0292 .12 SF91. SEQ ID NO:293 12 6H61 SEQ ID NO:294 12 7DS 1.4 __________ SEQ ID N0:295 12 7G1 12 SEQ ID NO:296 12F5 . SEQ ID NO:297 12G73. SEQ ID N0:298 1 2H6 0.9 ____________ SEQ ID NO:299 13 12G12106 SEQ ID NO:300 13 6D10 06 SEQ ID NO:301 13 7A70. SEQ ID NO:302 .1 37B1 21. SEQ ID NO:303 13_701. SEQ ID NO:304 113-8G6 06 SEQ I D NO:305 13 9FS 1. SEQ ID NO:306 14 - 1C9 1. SEQ ID NO:307 114_10 H3 1. SEQ ID NO:308 1410OH9 ____1.1________ SEQ ID NO:309 14_11021 SEQ ID NQ:31 0 14-12D8 SEQ ID NO.311 14.12H60. SEQ ID NO:312 14 2B6 06 SQ ID NQ:313 14 2 211. [SEQ ID NO:314 14-3B2 0.85 _ __ _ __ _ __ _ SEQ ID NO:315 144H8 SEQ ID NO:31 G 14 6A8 .7 SEQ ID NO;317 114 51101. SEQ ID NO:31 8 14,6D4 I___________ SEQ ID NO:319 14 MA11 3.7 [SEQ ID NO:320 14 43A 1.6 SEQ ID NO:321 14- 7A9 3.2 SEQ-ID NO:322 14-7G 1 0.66 SEQ ID NO:323 14 7H9 1.3 SEQ ID NO:324 14 F7 1.8 SEQ ID NO:325 151002 0.8 SEQ ID NO:326 151D6 1 ____________ SEQ ID NO:327 15_11 F9 1 SEQ ID NQ:328 15 11 H3 1 SEQ ID NO;329 151 1A8 1.6 SEQ ID NO:330 15.12D6 0.74 SEQ ID NO:331 15 12DB 1.3 SEQ ID NO:332 15 12D9 1.4 SEQ ID NO:333 15--3F10 - 0.9 SEQ ID NO:334 15_-3G11 - 1.2 SEQ ID NO:335 15_4F1 1 0.9 SEQ~~~~ IDNO36 54H SEQ ID NO:337 155631. rSEQ ID NO:336 15 _ G1H0. SEQ ID NOM33 15 6 1. SEQ ID NO:340 15SF6. SEQ ID NO:341 15SAI . - 103 - WO 02136782 PCT/USO1/46227 SEQ ID NO:342 16H3 SEQ ID NO:343 17C12 SEQ ID NO:344 1 8D6 SEQ ID NO:345 19C6 SEQ ID NO:346 19D5 SEQ ID NO:347 20A12 SEQ ID NO:348 20F2 SEQ ID NO:349 2.1OE+12 SEQ ID NO:350 23H1 1 SEQ ID NO:351 24C1 SEQ ID NO:352 2406 _1._ SEQ ID NO:353 2.40E+08 0.9 SEQ ID NO:354 2803 1.5 SEQ ID NO:355 2H3 0.9 SEQ ID NO:356 30GB .6 SEQ ID NO:357 3B 10C4. SEQ ID NO:358 3B_1 0G7 SEQ ID NO:359 3B_12B1 SEQ ID NO:360 3B_12D10 SEQ ID NO:361 3B_2E5 _ SEQ ID NO:362 3C_1 DH3 SEQ ID NO:363 3C.12H10 1.2 SEQ ID NO:364 3C_9H8 1 SEQ ID NO:365 4A_1B11 SEQ ID NO:366 4A_1C2 _.2 SEQ ID NO:367 4B 13E1 SEQ ID NO:368 4B_13G10 7.6 SEQ ID NO:369 4B 16El SEQ ID NO:370 4B 17A1 1.1 SEQ ID NO:371 4B_18F1 1 1.7 SEQ ID NO:372 4B_1908 12 SEQ ID NO:373 4B 1 G4 1 SEQ ID NO:374 4B,21C6 0.8 SEQ ID NO:375 4B 2H7 6.2 SEQ ID NO:376 4B_2H8 1.2 SEQ ID NO:377 4B_6DB 1.5 SEQ ID NO:378 4B._7E8 1.2 SEQ ID NO:379 40 809 0.6 SEQ ID NO:380 4H1 4 SEQ ID NO:381 6_14D10 1.5 SEQ ID NO:382 6_15G7 1.3 SEQ ID NO:383 6_16A5 1.1 SEQ ID NO:384 6_16F5 1 SEQ ID NO:385 6_17C5 1.3 SEQ ID NO:386 6_1807 12 SEQ ID NO:387 6..18D7 12 SEQ ID NO:388 6 19A10 1.9 SEQ ID NO:389 6_19B6 SEQ ID NO:390 6_19C3 SEQ ID NO:391 6_1908 SEQ ID NO:392 6_20A7 SEQ ID NO:393 6_20A9 |SEQ ID NO:394 6_20H5 SEQ ID NO:395 621 F4 SEQ ID NO:396 6 22C9 SEQ ID NO:397 6 22D9 1.3 SEQ ID NO:398 6_22H9 - 1.1 SEQ ID NO:399 623H31. -104- WO 02136782 PCT1USI46227 SEQ ID NQ:-400 6..23H7 1.2 SEQ ID NO:401 6-2Hl 0.9 SEQ ID NO:402 6 3D6 1 SEQ ID NQ:403 6..3G3 1 SEQ ID NO:404 16..3H2 -1 SEQ ID NO:405 WAA1O 11.1 SEQ ID NO:406 16AB1 11 FSEQ ID NO:407 16--5D11 1 SEQ ID NO:408 65F1 1 1.9 FSEQ ID NO:409 16-5G9 1.4 SEQ ID NO:41 0 1 1 .6D5 1 SEQ ID NO:41 1 l6JDl 0.5 SEQID NO:412 6 8H3 1 SEQ ID NO:413 6,.9Gl 1 1.8 SEQ ID NO:414 .6F11. SEQ ID NO:415 71 C4 1.1 [SEQ ID NQ:416 7MA10 0.8 SEQ ID NO:417 7T2-A11 1.1 SEQ ID NO:418 172D71. FSEQ iD NO:419 17.5C7_____________ SEQ ID NO:420 7..9C9 SEQ ID NO:421 9 13F1 0 ______________ SEQ ID NO:422 9 13Fl . SE ID NO43 9 15D5 1.2 ____________ SEQ ID NO:424 9-15DB 1.1 ____________ SEQ ID NO:2 .15 1.9 _ __ _ _ _ __ _ _ _ SEQ ID NO:426 19-1. 8H21. SEQ ID NO:427 9 2OF121 SEQ ID NO:428 9-.210 1B SEQ ID) NO:429 9 .4B SEQ ID NO:430 9 23A1 0 SEQ ID NO:431 9 24F6 0.9 _____________ SEQ ID NO:432 1.5 _ __ _ _ _ _ __ _ _ _ _ SEQ ID NO:433 0 H __ _ _ _ ___6__ _ _ _ SEQ ID N:434 9L H1 1.7 _ __ _ _ _ __ _ _ _ SEQ ID NO:435 19-9H7 0.7 __ _ _ __ _ _ __ _ _ SQ ID N:436 1906 __ _ __.5__ _ _ __ _ _ SEQ ID NO:437 19H1 1 2.3 ____________ SEQ ID NO:436 0 -4B10 0._____________ SEQ ID NQ:439 0 -5B1 1 0.54 ____________ SEQ ID NO:440 10-53 0.39 ____________ SEQ ID NO:441 10 5B40. FSEQ ID NO:4 0 B827 SEQ ID NO:443 0_5C4 0.67 SEQ ID NO:444 0..51 0.67 SEQ ID NO:445 0 53 0.5 SEQ ID NQ:446 0 5D71. SEQ ID NO:447 10-64 0 .8s SEQ ID NO:449 0 Q6D1 0.14 SEQ ID NQ:448 10-6D1 110.44 SEQ ID NO:450 10-6F2 0.34 SEQ ID NO:451 0 H9 0.47 SEQ ID NO452 10 Q4C1 0 0.1 SEQ ID NQ:453 10-4D5 10.1 SEQ ID NO:454 110-4F2 10.2 SEQ ID NO:455 10-4F9 0.1 SEQ ID NO:456 10_4G5 0.58 SEQ ID NO:457 10_4H40. - 105 - WO 02/36782 PCTIUSOI/46227 SEQ ID NO:458 11 3Al 0.1 SEQ ID NO:459 ll-3B1 0.63 SEQ ID NO:460 11 3B5 0.26 SEQ ID NO:481 11 3012 0.1 SEQ ID NO:462 11 SC3 0.22 SEQ ID NO:463 11 3C6 0.21 SEQ ID NQ:464 11 -3D6 0.1 SEQ ID NO:465 1_lG12 0.1 SEQ ID NO:466 1 iH1 1.8 SEQ ID NO:467 1i H2 0.44 SEQ ID NO:468 11 H5 1.5 SEQ lb NO:469 1i..2A12 1.3 SEQ ID NO:470 12B6 0.58 SEQ ID NO:471 .12C4 0.8 SEQ ID NO:472 1 2D2- 1.2 SEQ ID NO:473 1 2D4 1.2 SEQ ID NO:474 1~2F8 1.9 SEQ I D NO:475 1-2H8 0.48 SEQ ID NO:476 1--3A2 0.8 SEQ ID NO:477 1 3D6 3.5 SEQ ID NO:478 1 SF3 1.5 SEQ ID NO:479 1 3H2 0.7 SEQ ID NO:480 11 4C5 0.93 SEQ ID NO:481 1 4D6 1.4 SEQ ID NO:482 l-4H1 1.2 SEQ ID NO:483 1 5H5 0.51 SEQ ID NO:484 1 6F12 14.7 SEQ ID NO:485 1--.6H6 1.05 SEQ ID NO:486 3 h1AlO 0.17 SEQ ID NO:487 3 14F6 0.25 SEQ ID NO:488 3 15B2 0.1 SEQ ID NO:4B9 3 .. 6A1 0.6 SEQ ID NO:490 3,_6B1 0.43 SEQ ID NO:491 3 7F9 0.29 SEQ ID NO.492 3 8G1 1 0.1 _____________ SE Q ID NO:493 4 iBlO 0.1 SEQ ID NO:4-94 5 B30. SEQ ID NO:495 5_2D9 1. FSEQ ID NO:496 15-2F1 0 __0. SEQ ID NO:497 16- Al 1_0. SEQ ID NO:498 16_1 D5 0.1 ________ SEQ ID NO.499 16- Fli 1_ _ ___0.1__ _ __ _ SEQ ID NO:500 61 Fl 0.1 _____________ SQ ID NO:501 6_1H1O 0.1 ___________ SEQ ID NO502 6 1 H4 0.1 _ __ _ _ __ _ _ __ _ _ SEQ ID NO:503_ 8_1 F80. SEQ ID NO:504 8_1G20. SEQ ID NO:505 8 1 G3 1. SEQ ID NO:506 .8-1 H7 . SEQ ID NO:507 8 1 H9 0.1 ____________ SEQ ID NO:508 JGAT1.21 F12 4.6 ____________ SEQ ID NO:509 GATL_24G3 3.8 ____________ SEQ ID NO:51 0 GAT1 29G-1 4_____________ SEQ ID NO:511 GATl-32G1 3.3 ____________ SEQ ID NO;512 GAT2j5 G8 _____.8_______ SEQ ID NO:51 3 GAT2J-19H8 2_________8____ SEQ ID NO:514 GAT2_2I Fl _______________ -106- WO 02/36782 PCT/USO1/46227 Table 5. GAT polypepfide ke Km values ISEQ ID NO. Clone ID IKV/K(mWmirn )I SEQ ID N0:263 1)3_1OFS 7. SEQ ID NO:264 1312G6 143.4 SEQ ID NQ:265 14_2A5 175.5 SEQ ID N0:266 14_201 43 SEQ ID NO:267 14 2F1 1 80.6 SEQ ID NQ:268 CHIMERA11. SEQ D N270 0_1F4 37.6 SEQ ID NO:271 110171 80 L. 1 SEQ ID N0:272 110_1 7FS 134.2 SEQ ID NO:273 1101 8G9 20 SEQ ID NO:274 10 1H3 40.1 SEQ ID N0275 10_20DI10 E53.9 [SEQ ID N0:276 110-23F2 112.5 FSEQID NO0277 10- 2B8 98.5 SEQ ID NQ:278 102796.4 SEQ ID NO:279 1 03G5 43.7 FSEQ ID N0:2B0 10Q_4-7 65.9 LSEQ ID NQ:281 10-6D1 1 52 FSEQ ID NO:282 10 806 31 SEQ ID NO:283 1103 0.9 SEQ ID NO:284 11G33 8.9 rSEQ ID NO:285 11H3 0.9 SEQ ID NQ:2B6 1121A F9 26.8 SEQ) ID NQ:287 112 2G9 101 SEQ ID NQ:288 112-3F1 49 SEQ ID N0289 12 -510 59.7 SEQ ID NO:290 12 6AlO0 49.7 SEQ I D NO:291 12 6D1 40.8 SEQ ID NO:292 12-..6F9 46.9 SEQ ID NO:293 12-6H6 56.5 SEQ ID NO:294 12_7D6 38.5 SEQ ID NO:295 112G172 2. ISEQ ID NQ04 3_( 5. SEQ ID N:98 13 2H6 14.3 SEQ ID NO:308 14 1009 124. SEQ ID NO:307 1 1D 03 15.3 SEQ ID NQ:301 14_10H9 115.6 SEQ ID NO:309 14j102 10.7 SEQ ID N0:30 14jI2ID8 32.1 SEQ ID NO:311 112HB6 10.3 SEQ N ID NO:312 14_1B H315.3 I SEQ ID N0:311 14 12H61 49.61 SEQ ID NO:314 14...3B2 80.9 - 107 - WO 02136782 PCTIUS01/46227 SEQ ID NO:315 14_4H8 9. SEQ ID NO:316 14_6A8 5 FSEQ ID NO:31 7 14_6B10 9. SEQ ID NO:318B 14_6D4 256 _________ SEQ ID NO:319 14M 715. SEQ ID NO:320 14 _7A1 7. SEQ ID NO:321 14..7A9 76.9 _________ SEQ ID NQ:322 14 7G1 207.1 ___________ SEQ ID NO:323 14 7H9 49__________5__ SEQ ID NO:324 14 8F7 50____________ SEQ ID NO:325 15 1002 ____7.3________ SEQ ID NO:326 15 1 0D6 6. SEQ ID NO:327 15L 11F9 76.4 __________ [SEQ ID NO:328 15 11 H3 61_______.9___ FSEQ ID NO:329 15- 12A8 8 SEQ ID NO:330 15 12D620. SEQ ID NO:331 15 2DB 45.9 ____________ SEQ ID NO:332 15 12LD9 42.6 ____________ SEQ ID NO:.333 15 3F1 0 54.1 _____________ SEQ ID NO:334 15 3G11 5. SEQ ID NQ:335 15_4F1 SEQ 1D93.36154H SEQ ID NO:337 1 56D3 6. SE I N:38 15 BG 11 4 SEQ ID NO:339 15 9F6 54.2 SEQ ID NO:340 155 0.2 SEQ ID NO:341 1 SAl 3.6 SEQ ID NO:342 16H3 1.2 SEQ ID NO:343 17012 2.3 SEQ ID NO:344 I18D6 8 SEQ ID NQ:345 1906 2 SEQ lb NO:346 19D5 1.3 SEQ ID NQ:347 20A12 2.5 SQ ID NQ:348 20F2 2______________ SEQ ID NO:349 2.1 OE1 ______.2________ SEQ ID NO:350 23H111. SEQ ID NO:351 24012. SEQ ID NO:35q 12.406 . I SEQ ID N Q:3 3 2.10E+12 8. SEQ ID NO:354 2-5; 1603 SEQ ID NQ:355 2H3 17.7 ___________ SEQ ID NO:356 30GB . SEQ ID NO:357 3B 1004 5. SEQ ID NO:358 3B51 G7 19. SEQ ID NO:359 3B_1251 1 SEQ ID NO:360 3B_12D10 SEQ ID NO:361 3 B 2E5 12.6 _ _ _ __ _ _ __ _ _ SEQ ID NO:362 30- 1H3 ___.30.8________ SEQO ID NO:363 30 12H10 7. SEQ ID NO:364 30 9H8 1. SEQ ID NO:365 4A 1B11 15 __________ SEQ ID NO:366 4A 12 1 SEQ ID NO:367 4B_1 3E1 18.6 SEQ ID NO:368 4B 13G10 4.6 SEQ ID NO:369 4B I GEl 17 SEQ) ID NO:70 4B 17A1 17.4 f SEQ -IDNO:.371 4B 1Bl 8 .6 SEQ ID N 0:372 -4B 1908 1324 -109g- WO 02136782 PCTIUSO1I46227 SEQ ID NO:373 4B 1 G4 3.7 SEQ ID NO:374 4B 210C6 14.8 SEQ ID NO:375 4B -2H 4.4 SEQ ID NQ:376 4B 2H8 31.9 SEQ ID NQ:377 14B_6DB 15.2 SEQ ID NO:378 4E 7E8 17.1 SEQ ID NO:379 4C_809 15.1 ___________ SEQ ID NO:380 4H1 0.9 ____________ SEQ ID NO:381 6 14D10 2. SEQ ID NO:3B2 161 5G-7 3. SEQ ID NO:383 16 16A 3A SEQ ID NO:384 6 1 6F535 SEQ ID NO:385 6 17C5 2. SEQ ID NO:386 16-1 8C726 SEQ ID NO:387 61 1BD7 . SEQ ID NO:388 6 .1 9A 014. SEQ ID NO:389 6.19B6 14 SEQ ID NO:390 6 1903 1. SEQ ID NO:391 6 1908G7. SEQ ID NO:392 6-297 4. SEQ ID NO:393 6 20A9 3. SEQ ID NO:394 6 20H5 2. SEQ ID NO:395 6 21F413. SEQ I D NO:396 6-MC 22.9 SEQ ID NO;397 6-22D913. SEQ ID NO:398 -6. 22H9 1. SEQ ID NO:399 6 23H3 3. SEQ ID NO:400 6--23H7 3. SEQ ID NO:401 6-2H1 29.5 SEQ ID NO:402 l6 3D6 41.7 SEQ ID NO:403 6 3G3 51.9 SEQ ID NO:404 &3257.2 SEQ ID NO:405 6 A 0 50 SEQ ID NO:406 6 .AB1 27 SEQ ID NtO:407 6 5D1 1 15.2 SEQ ID NO:408 &5F1 1 21.1 SEQ ID NO:409 16_5G9 25.6 SEQ I D NO:41 0 16_6D5 55.3 SEQ ID NO:411 6 7D1 39.5 SEQ ID NO:412 63-H3 44.7 SEQ ID NO:413 6 9G1 1 60.3 SEQ ID NO:414 SF1 5.6 S-E-Q ID NO:415 17-104 15.9 SEQ ID NO:416 -7 2A1 0 18.2 SEQ ID NQ:417 7- 2A11 42.6 SEQ ID NO:41 8 7-2D7 49.9 SEQ ID NO:419 7-507 44.7 SEQ ID NO:420 7-909 65 SEQ) ID NO:421 9- 3F1 0 49.6 SEQ ID NO:422 9-13F1 28.7 SEQ ID NO:423 9 15D5 23 SEQ ID NO:424 9-15DB 97.6 SEQ IDNO45 9 5H36 SEQ ID ND:426 19..8H2 122.7 SEQ ID NO:427 19-20F2 137.8 SEQ ID NO:428 2 23.8 SEQ ID N:429 9 2B13. SEQ ID NO:430 1923A10 2 - 109 - WO 02/36782 PCT/USO1/46227 SEQ ID NO:431 9,24F6 58.3 SEQ ID NO:432 9_4H10 67.5 SEQ ID NO:433 9_4H8 78.5 SEQ ID NO:434 98H1 44 SEQ ID NO:435 9.9H7 40 SEQ ID NO:436 9C6 5.1 SEQ ID NO:437 9H11 1.7 SEQ ID NO:438 0_4B10 279 SEQ ID NO:439 0_5B11 406 SEQ ID NO:440 0_5B3 367 SEQ ID NO:441 0 5B4 301 SEQ ID NO:442 0 5B8 522 SEQ ID NO:443 0_5C4 306 SEQ ID NO:444 I0_5Dl 1 34 SEQ ID NO:445 |0_5D3 ___ SEQ ID NO:446 10 5D7 222 SEQ ID NO:447 0_6B4 SEQ ID NO:448 O_6D10 1177 SEQ ID NO:449 0_6D11 SEQ ID NO:450 0.6F2 516 SEQ ID NO:451 0_6H9 SEQ ID NO:452 10-4C10 695.98 SEQ ID NO:453 10-4D5 827.16 SEQ ID NO:454 10 4F2 SEQ ID NO:455 10 4F9 SEQ ID NO:456 10_4G5 SEQ ID NO:457 10 4H4 SEQ ID NO:458 11 3A1 1 SEQ ID NO:459 11 3B1 349.17 SEQ ID NO:460 11 3B5 748.49 SEQ ID NO:461 11_3C12 490.67 SEQ ID NO:462 11_3C3 972.81 SEQ ID NO:463 11306 878.27 SEQ ID NO:464 11 3D6 553.01 SEQ ID NO:465 1_1G12 584.79 SEQ ID NO:466 1_1H1 162 SEQ ID NO:467 1_1H2 366 SEQ ID NO:468 1 1H5 63 SEQ ID NO:469 1_2A12 176 SEQ ID NO:470 1 2B6 239 SEQ ID NO:471 1_2C4 242 SEQ ID NO:472 1_ 102D2 SEQ ID NO:473 1 2D4 152 SEQ ID NO:474 1_2F8 SEQ ID NO:475 1 2HB SEQ ID NO:476 13A2 SEQ ID NO:477 1 3D6 -4 SEQ ID NO:478 1_3F3 SEQ ID NO:479 1.3H2 SEQ ID NO:480 1 4C5 SEQ ID NO:481 1_4D6 SEQ ID NO:482 1.4H1 196 SEQ ID NO:483 1 5H5 SEQ ID NO:484 1 6F12 SEQ ID NO:485 1_6H6 SEQ ID NO:486 3_11Al 0796.55 SEQ ID NO:487 3_14F6 -11- .3 0 WO 02136782 PCT/US01/46227 SEQ ID NO:489 3_SA10 191.64 SEQ ID NO:490 3_6B1 611.81 SEQ ID NO:491 3_7F9 667.4 SEQ ID NO:492 3 _G1 1 991.44 SEQ ID NO:493 4_1B10 770.91 SEQ ID NO:494 5_2B3 567.5 SEQ ID NO:495 5_2D9 754.36 SEQ ID NO:496 5_2F10 547.22 SEQ ID NO:497 6,lAll 455.41 SEQ ID NO:498 61 D5 429.16 SEQ ID NO:499 61F11 1057.6 SEQ ID NO:500 61 F1 698.15 SEQ ID NO:501 6_1HiD 170.11 SEQ ID NO:502 61 H4 859.12 SEQ ID NO:503 8_1 F8 828.78 EQ ID NO:504 8_1G2 674.73 SEQ ID NO:505 B_1G3 1088.97 SEQ ID NO:506 8_1 H7 1012.4 SEQ ID NO:507 81 H9 783.89 SEQ ID NO:508 GAT1 21 F1 2 1.2 SEQ ID NO:509 GAT1-24G3 1.3 SEQ ID NO:51 0 GAT1_29G1 1.5 SEQ ID NO:51 1 GAT1 32G1 1.4 SEQ ID NO:512 GAT2_15G8 1.6 SEQ ID NO:513 GAT2_19H8 |1.5 SEQ ID NO:514 GAT2_21F1 _1.4 KM for AcCoA is measured using the mass spectrometry method with repeated sampling during the reaction. Acetyl-coenzyme A and glyphosate (ammonium salts) are placed as 50-fold-concentrated stock solutions into a well of a mass spectrometry 5 sample plate. Reactions are initiated with the addition of enzyme appropriately diluted in a volatile buffer such as morpholine acetate or ammonium carbonate, pH 6.8 or 7.7. The sample is repeatedly injected into the instrument and initial rates are calculated from plots of retention time and peak area. Km is calculated as for glyphosate. 10 EXAMPLE 8: SELECTION OF TRANSFORMED E. COLI An evolved gat gene (a chimera with a native B. licheniformis ribosome binding site (AACTGAAGGAGGAATCTC; SEQ ID NO:515) attached directly to the 5' end of the GAT coding sequence) was cloned into the expression vector pQE80 (Qiagen) between the EcoRI and Hind1Ii sites, resulting in the plasmid pMAXY2190 (Figure 11). 15 This eliminated the His tag domain from the plasmid and retained the B-lactamase gene conferring resistance to the antibiotics ampicillin and carbenicillin. pMAXY2190 was electroporated (BioRad Gene Pulser) into XL1 Blue (Stratagene) E. coli cells. The cells were suspended in SOC rich medium and allowed to recover for one hour. The cells were - 111 - WO 02/36782 PCT/USO1/46227 then gently pelleted, washed one time with M9 minimal media lacking aromatic amino acids (12.8 g/L Na2HPO4.7 H20, 3.0 g/L KH2PO4, 0.5 g/L NaC1, 1.0 g/L NH4C1, 0.4% glucose, 2 mM MgSO4, 0.1 mM CaC12, 10 mg/L thiamine, 10 mg/L proline, 30 mg/L carbenicillin), and resuspended in 20 ml of the same M9 medium. After overnight growth 5 at 37*C at 250 rpm, equal volumes of cells were plated on either M9 medium or M9 plus 1 mM glyphosate medium. pQE80 vector with no gat gene was similarly introduced into E. coli cells and plated for single colonies for comparison. The results are summarized in Table 6 and clearly demonstrate that GAT activity allows selection and growth of transformed E. coli cells with less than 1% background. Note that no IPTG induction was 10 necessary for sufficient GAT activity to allow growth of transformed cells. Transformation was verified by re-isolation of pMAXY2190 from the E. coli cells grown in the presence of glyphosate. Table 6. Glyphosate selection of pMAXY2190 in E. coli 15 Number of colonies Plasmid M9- glyphosate M9 +1 mM glyphosate pMAXY2190 568 512 pQESO 324 3 EXAMPLE 9: SELECTION OF TRANSFORMED PLANT CELLS Agrobacterium-mediated transformation of plant cells occurs at low efficiencies. To allow propagation of transformed cells while inhibiting proliferation of 20 non-transformed cells, a selectable marker is needed. Antibiotic markers for kanamycin and hygromycin and the herbicide modifying gene bar, which detoxifies the herbicidal compound phosphinothricin, are examples of selectable markers used in plants (Methods in Molecular Biology, 1995, 49:9-18). Here we demonstrate that GAT activity serves as an efficient selectable marker for plant transformation. An evolved gat gene (0_5B8) was 25 cloned between a plant promoter (enhanced strawberry vein banded virus) and a ubiquinone terminator and introduced into the T-DNA region of the binary vector pMAXY3793 suitable for transformation of plant cells via Agrobacteriwn tumefaciens EHA105 as shown in Figure 12. A screenable GUS marker was present in the T-DNA to allow confirmation of transformation. Transgenic tobacco shoots were generated using 30 glyphosate as the only selecting agent. Axillary buds of Nicotiana tabacum L Xanthi were subcultured on half strength MS medium with sucrose (1.5 %) and Gelrite (0.3 %) under 16-h light (35-42 -112- WO 02/36782 PCT/USO1/46227 PEinsteins m s-1, cool white fluorescent lamps) at 24 *C every 2-3 weeks. Young leaves were excised from plants after 2-3 weeks subculture and were cut into 3 x 3 mm segments. A tunefaciens EHA105 was inoculated into LB medium and grown overnight to a density of A600= 1.0. Cells were pelleted at 4,000 rpm for 5 minutes and resuspended in 3 5 volumes of liquid co-cultivation medium composed of Murashige and Skoog (MS) medium (pH 5.2) with 2 mg/L N6-benzyladenine (BA), 1% glucose and 400 uM acetysyringone. The leaf pieces were then fully submerged in 20 ml of A tumefaciens in 100 x 25 mm Petri dishes for 30 min, blotted with autoclaved filter paper, then placed on solid co-cultivation medium (0.3% Gelrite) and incubated as described above. After 3 days 10 of co-cultivation, 20-30 segments were transferred to basal shoot induction (BSI) medium composed of MS solid medium (pH 5.7) with 2 mg/L BA, 3% sucrose, 0.3% Geltite, 0 200 uM glyphosate, and 400 ug/ml Timentin. After 3 weeks, shoots were clearly evident on the explants placed on media with no glyphosate regardless of the presence or absence of the gat gene. T-DNA transfer 15 from both constructs was confirmed by GUS histochemical staining of leaves from regenerated shoots. Glyphosate concentrations greater than 20 uM completely inhibited any shoot formation from the explants lacking a gat gene. Explants infected with A tumefaciens with the gat construct regenerated shoots at glyphosate concentrations up to 200 uM (the highest level tested). Transformation was confirmed by GUS histochemical 20 staining and by PCR fragment amplification of the gat gene using primers annealing to the promoter and 3' regions. The results are summarized in Table 7. Table 7. Tobacco shoot regeneration with glyphosate selection. Glyphosate concentration % Shoot Regeneration transferred 0 uM 20 uM 40 uM 80 uM 200 uM genes GUS 100 0 0 0 0 gatand 100 60 30 5 3 GUS 25 -113- WO 02136782 PCT/US01/46227 EXAMPLE 10: GLYPHOSATE SELECTION OF TRANSFORMED YEAST CELLS Selection markers for yeast transformation are usually auxotrophic genes that allow growth of transformed cells on a medium lacking the specific amino acid or nucleotide. Because Saccharomyces cerevisiae is sensitive to glyphosate, GAT can also 5 be used as a selectable marker. To demonstrate this, an evolved gat gene (0L6D10) is cloned from the T-DNA vector pMAXY3793 (as shown in Example 9) as a PstI-ClaI fragment containing the entire coding region and ligated into PstI-ClaI digested p424TEF (Gene, 1995, 156:119-122) as shown in Figure 13. This plasmid contains an E. coli origin of replication and a gene conferring carbenicillin resistance as well as a TRP1, tryptophan 10 auxotroph selectable marker for yeast transformation. The gat containing construct is transformed into E. coli XL1 Blue (Statagene) and plated on LB carbenicillin (50 ug/ml) agar medium. Plasmid DNA is prepared and used to transform yeast strain YPH499 (Stratagene) using a transformation kit (Biol01). Equal amounts of transformed cells are plated on CSM-YNB-glucose 15 medium (Biol0l) lacking all aromatic amino acids (tryptophan, tyrosine, and phenylalanine) with added glyphosate. For comparison, p424TEF lacking the gat gene is also introduced into YPH499 and plated as described. The results demonstrate that GAT activity function will as an efficient selectable marker. The presence of the gat containing vector in glyphosate selected colonies can be confirmed by re-isolation of the plasmid and 20 restriction digest analysis. While the foregoing invention has been described in some detail for purposes of clarity and understanding, it will be clear to one skilled in the art from a reading of this disclosure that various changes in form and detail can be made without 25 departing from the true scope of the invention. For example, all the techniques, methods, compositions, apparatus and systems described above may be used in various combinations. The invention is intended to include all methods and reagents described herein, as well as all polynculeotides, polypeptides, cells, organisms, plants, crops, etc., that are the products of these novel methods and reagents. 30 All publications, patents, patent applications, or other documents cited in this application are incorporated by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application, or other document were individually indicated to be incorporated by reference for all purposes. -114- WO 02/36782 PCT/US01/46227 SEQ ID NO. Clone ID Sequence SEQ ID NO: 1 ST401 gat ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TrGAAGCATGTATGTATGAAACCGATTGCTCGGGGGT GCGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCTTCCTTTCATAAAGCCGAACATrCAGAGCTr GAAGGCGAAGAACAGTATCAGCTGAGAGGGATGGCGA CGCTTGAAGGATACCGTGAGCAAAAAGCGGGAAGCAC GCTCATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAGG GGGCAGACCTITTATGGTGCAATGCCAGGACATCTGTG AGCGGCTACTATGAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGATCGGACCTCATATTTT GATGTATAAGAAATTGACGTAA SEQ ID NO:2 B6 gat ATGATTGAAGTCAAACCTATAAACGCGGAAGATACGTA TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGC ACGTTTCACCTCGGCGGATATTATCGGGACAGGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGGTACCGCGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTCGGAAAAAAGG CGCAGACCTTTTATGGTGCAACGCCAGGACATCTGTGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAAGGC GGGGTCTACGATATACCGCCGATCGGACCTCATATTTTG ATGTATAAGAAATTGACATAA SEQ ID NO:3 DS3 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA TGAGATCAGGCACCGCATCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGT ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCTTCCTCATAATGCCGAACATrCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CGCTTGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATGCCGAAGAGCTTCTTCGAAAAAAAG GCGCGGACcTiTTATGGTGCAACGCCAGGATATCTGTG AGCGGCTACTATGAAAAGCTCGGCTTCAGCGAACAAGG CGGGATCTACGACATACCGCCGATCGGACCTCATATT GATGTATAAGAAATTGGCATAA SEQ ID NO:4 NHA-2 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA TGAGATCAGGcACCGCATTCTCCGGCCGAATCAGCCGC TTAAGCATGTATGTATGAAACCGATTTGCTCGGGGGT ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCTTCCTTCATAATGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CGCTTGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATGCCGAAGAGCTTCTTCGAAAAAAAG GCGCGGACCTTTTATGGTGCAACGCCAGGATATCTGTG AGCGGCTACTATGAAAAGCTCGGCCTCAGCGAACAAGG CGGGATCTACGACATACCGCCGATCGGACCTCATATT - 115- WO 02/36782 PCTfUSOI/46227 GATGTATAAGAAATrGGCATAA SEQ ID NQ:5 NHS-2 ATOAITGAAGTCAAACCAATAAACGCGGAAGATACGTA TGAGATCAGACCGCATI7CTCCGGCCGAATCACCC TTGAAGCATGTATGTATGAAACCATTCTCGTT GOGTiTcAccTcGGTGGATA'TrAccAGGGCAAGCTGAC AGCATCCTTCCTITCATAAACCGAACATCAGACT GAGGGC~AA.ACAGTATCAGCTGAAGGAC7GA Cc1cTrAAGGATACCGTCOAGCAAkAAAGCGGAACAC GTCATCCGCCATOCCGkAAOAGCTCTTCGAAAAAG GGGCAGACCmTATGGTGCAATGCCAGGACATCTG AGCGGCTACTATAAAOcTCGcTCACAACAG CGAAGTCTACGACATACCGCCATCGGACCTCATAT= I-IA 'tOTrAT'rA AGlA AA TT ACOrTAA SEQ DD NO:6 ST401 MIEVKApINAEDT~yE1RERLRPNQPLE-ACMYEiDLLCGGAFH GAT LGGYYRGKLISIASFHKAEHISEL-EEQYQLRGM-ATLEGY REQKAosTRnAELRKKGADU-LWCNARTSVSGYE _____________LGFSEQGEVYDIPPIGPHMMYYKKLT SEQ IDNO:7 B6 GAT MVIEVYPNADTYRRRIIRLRPNQPLEACK-YEEDLLGGUII LGGYYDLSIASFHQAEOQKQYQGMATLGY
REQKAGSLHUAELRKGADLLWCNARTSVSGYK
LGFSEQGGVYDIPPIGPHaIAYKKLT SEQ ID NO: 8 DS3 GAT MIEVK2INAEDTiY.E]HILRPNQPLEACMYETDLLGGTFH LOGYYRGIKISASFENAEI-SELBOQKQYQLRGMA ThEGYREQKASThHAlEELRKKGADLLWCNRSVSG YYEKLGFSEQGGIIPIGPHaLMYXK1C~ SEQ ID NO:9 NRA-2 MMVPNAEDTYEL~RPNQPL-4EACMLGGT GAT LGYRTLSAFNESEEQ(YLGALG RBEQKAGiSTLUI TAEP F 1TCADLLWCNARISVSGYYEKL _______GLSEQGIYDTPPTIGrPB1IMYKKLA SEQ ID NE{5-2 MIEVIKPINAEDYRRIIPNQPLEACMTIGAF NO: 10 GAT LGTGYYQKIQSIASFAHSELEGEEQYQIMTLOY REQKAGSTLJIAEELLRKKOADLLWCNARTSVSYYE ______ LFSEQGEYYD]PPIGPH]UAYXKLT SEQ ID 13_106 ATGATIGAAGTCAAACCAATAAACGCGAAGATACGTA NO:M1 TGAGATCAGCACCOCATCTCCCCAATCACG TGGAAGCATGCAATATGACCOTTAGAGGT ACGnTCACCTCGGTGATATACCGGCTGAC AGCATCCCTCCTCATCAAGCCGAACATCCAGACT GAAGCCAAAACAGTATCACTAAAGGCGA CACTCGAAGGATACCTGAGCAAAGAAGAC GCTCATCCOccATCCAACACTCAAAAA GCGCGGACCTTATGGTGCAACGCCAGGACGTG AGCGGGTACTATAAAAAGCTCGGCCAGCAO CGAAGTCTACGACATACCGCCGGTCGACCTCATA = ______ ATO-TATAAGAAATACGTAA SEQ ED 13_12G6 ATGATGAAGTCAAACCAATAAACCGAAGATACGTA NO: 12 TGAGATCAG~cACCGCArCTCCOGCCGATCAGCCC TOG AOCATGCAAGTATGAAACCGITGCTr-AOGGGT 000 ITCACCTCGGTOATATTACCGGGCAACTGGTC ______AGCATCGCCTCTACACOAACAA' -116- WO 02/36782 PCTUSO1/46227 GAGCCAAAGACAGTATCACTGAGAGGATGGA CAcrGAAGGGTACCTGAGCAAGCGGATACG CTTATCCCCAT CCGAAGAGGC~AAACi CGCAGACCTCTTATGGTGCAACGCCAGGACATCMCGA GCGTCAAAACCCYCrACACCG GAAGTCTACGACATACCGCCGACTGGCCCATA = G ATGTATAAIA A AITTGACATA-A SEQ ID 14_2A5 ATGATGAAGTCAAACCAATAAACGG3kGATACGTA NO: 13 TCTAGATCAGCACCGCACTCCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGAGTCA AcGTrcAccTcGGTGGATAAcc~cT~cATGATcI AGACCM=ATAGCACTCGGT GAAGGCCAAAACAGTATCAGCTGAGAATGGA CACTAAGGGTACCGTGAAACGAAGAC GCTATCCGCCATGCCGAAGAGTGGAAAAAAG GCGCGGACCmTATGGTGCAACGCCAACGTG AGCGGTACTATAAAAA~-GCTCAGACAG CG-FAGTCTACGACACACCGCCGGTCGGACCTCATAr= ____GATGT AT A A GAAA-ITGACGTAA SEQ ID 14_2C1 -ArGATAACAAAGGAGTCT NO: 14 TGAGATCAGcAccGcA~cTccGCCGATcA~cc TGGAAGCATCAAGTATGAAACCGGCAGAGGT G-CGTTCACCTCGGTGGATATACCACTGTC AGCATCGG-TccTT~cATcAAGCCG~cATcCAGACT GAAGGCCAAAACAGTATCACTGAGAGGATGGA CACTCAAGATACCGTGAAACGGATACG CTTATCCCCATGCCGAAG CGAAGG CGOGACTATAGCTGAC~CAGCACAGAW GAAGTCTACACACACCGCCGACTGGCCCATA=m GATGTATAArA A ATTGACGTAA SEQ ID 14_2F1l1 ATGATGIAGTCAAACCAATAAACCGAGATACGTA NO: 15 TGAGATCAGCACCGCACTCCGCGAATCAGCCGC TGGAGCATGCAAGTATGAACGTA~ GCG = ACCGTATATTACACTGTC AGCATCGCCTCCTCATCAAGCCAACATCCAGACT GAAGCCAAAAACAGTATCAAGAGGATGGA CACTCGAAGGATACCGTGAAACGGAGTACG C~rATCCGCCATOCCGAACGGAAGG GGCAGACCTCTATGGTCAACGCCAGACATCGGA GCGGGTACTATAAAAAGCTCGGCAGCAGG GAAGTCTACGACACACCGCCGGCCGACCCCATA=m GATGTATAAGAATGACGTA SEQ ID CHIMER ATGIATGAAGTCAAACCAAT(,CGGAGATACGTA NO: 16 TGAGATCAGCACCGCACTCCGATCACG ITGAAGCATGTATGTATGAAACCGAGTCAGGT GcGTrCACCTCGGTGGATATTAC CAGAT AGcATcGcTccTcATACCGA~cAAGAc~ GAAGGCCAAAAACAGTATCAGTGAGAATGGA CACYrGAAGGATACCGCA AA-GGCAGTACG, -117- WO 02/36782 PCTUSOI/462Z7 CTTATCCGCCATGCCGAAGArCTTcAAGG GGCAGACCTTTATGGTGCAACGCCAGGACATCTGCGA GCCGGGTACTATAAAAGCTCGGTcCACGAACAGG GAAGTCTACCGACACACCCCGGTCGGACCTCATA=TG ATGTATAAGTAAATrGACGTAA SEQ ID) 10_12D7 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 17 TGAGATcAG~cAccGNArcTccGccGAATcA~Cc~ TGGAACATGCAAGTATG ACCGATGGG ACG-(-CACCTCGGTGGATATACCOAACGAT CAGCATCGCrCCTITCATCAAGCCGAACATCCAGAC TGAAGGCCAAAAACAGTATCAGTGAGAcGATG ACACTGAAGAGTACCGGAGAAAGG AC CGCTCATCCGCCATGCC~AAGAGCTCTCGAAA-G 000 3CAGACCTCTTATGGTGCAACGCCAGGACATCTGC GAGCGCGGTACTATAAAG0cCGGcTrCAGCGACAAG GCGAAGTCTACGACATACCGCCGACCGGACCCCATAT 'rrGATGTATAAGAAATTGACGTAA SEQ ID 10_15F4 ATGiATTGAAGTCAAACCAATAAACGCGAAATACGTA NO: 18 TGAGATCAGCACCGCATCCCGCCGATCACG TTGAAGCATGTATGTATGAACCGA = G~CTCAGGGGT ACGT=CACCTCGGTGG0TAA~CGGCAAGC 0 TC AGCATCGCTCCfTrCATCAAGCCGAACATCCAGAGT GAAGGCCAAAACAGTATCAGCTAGAGGA']CGA CA=CTAAGAGTACCGCGAGCAAAGAAGCAC GCTCATCCGCCATGCCGAGArCTCGC 0 0 AA GGGCAGACCTT'ATGGTGCAACGCAACAGC AGcGGGTAcTATAAAAGcTCGGCAGCGAGCA CGGGGTCTACGACATACCGCCGGTCGGACATAF= _______GATGTTATAAGAAKTGPACGTAA SEQ IID 10_17D 1 ATGATrGAAGTCAACAAAACG&ATACGA NO: 19 TGAGATCAGCACCGCATCC0CATA TGGAAGCATCAAGTATGAACC 0ArGTTGTG ACGTCACCTCGGTGGATATTACCGCAGCTATIC AGCATCGTTCC-TCATcAAGGcGAAcATCCAACT GAAGGCCAAAAACAGTATCAGCTGAGAAGGGA CACTAAGGGTACCGCGACAAAAGGAGTACG cTTATCCGccATGGCGAAGAGCTTCTCGAAAAGG CGCAGACCII? ATGGTGCAACGCCAGACATCTCA GCGGGTACTATAAAAAGCTCGGCTCAGCGAACGO GAAGTCTACGACACACCGCCGGTCGGACCTCATAmIT ATGTATAAGAAATI'GACGTAA SEQ ID 10_17F6 AT0ATrGAAGTCAAACCAATAAACGAAATACGTA NO:20 TGACGATCAGCACCGCA-TCXTCCGGCCGAATCAGCCC TGGAAGcATGcAAGTATGAAA CCGArGTC 0 0 GGG ACGITCACCTCGGTGGATA ACCGAGCGTC AGCATCGCTTCCTTCATCAAGCCAACAAGAC~ GAAGGCCAAAAACAGTATCAGCTGAGAGGATGGA GCYTATCCGCCATGCCGAAGAGCTCTI'CGGAAA AGO GCGCAGTACCJTTTATGGTGCAACGCCAGGACATCTGCG -118- WO 02(36782 PCTIUSOI/46227 AGCCGGTACTATAAAAACTCGCTTCGAAGCAG CGAAGTCTACGACATACCCGCGGTCGGACCTCATA=m _____ ATGT-'ATA AGAA ATGACTA A SEQ ID 10_18G9 ATGAITGAAGTCAAACCAATAAACG(CGGuAAGATACGTA NO:21 TGAGATCAGCACC0cA=CCCGGCCGATCACO TGCGAAGCATGCAAGTATGAACTGATTJGCTCGGTG ACGTTCACCTCGTGATATACCGCAGCTGTC AGCATCGTTCC=TATCAAGCGAAcATCAGAC~ GAAGGCCAAAAACAGTATCAGCTAGAGGATGGA CACTTGAAGAGTACCGCGAGCAAAAAGCGKAGAC GCTCATCCGCCATGCCGAAGAGCTCAAAGCT GGGCAGACCTCTTATGGTGCAACGCCAGGACATG AGCGGGTACTATAAAAACTC0CTCAGCAG C0GGTCTACGrACATACCGCCGGTCGACCTCATAr= ________GATGTATAAGAAATrGACGTAA SEQ ID 10_1113 ATG3ATT.AAGTcAAACCAATAAACGCGGAGATACGTA NO:22 TGAGATCAGCACCGCArCTCCGCCGATCACG TGAGAGAGAGAACATGTGGG ACGTrCACCTCGGTATATTATCGGAGTGGTC AGCATCGCrCTCATCAAGCCGAACATCCAACT GAAGGCCGAAACAGTATCAGCTGAGAGGATGGA CACTTC0AAGGGTAccGCGAGCAAAAACGGAGTACG CGCGGACCmTATGGTGCAACCCAGACATCGGA GCGGGTAcTATAAAAAGCTCGGTCAGCGAACAGG GAAGTCTACGACATACCCCGACCGGACCCATAr= ______GATGTATAAGAAATIGACATAA SEQ ID 10_20D10 ATGATT0AAGTCAAACCA ATAAACCrGGAGATACGTA NO:23 TGAGATCAGGACCGCACTCCGGCGAACG TrAAGCATGTATGTATGAAACCATGCTCGGG AcGcTcAccTCGTGGATATACCGTGG0AAc3TAT CAGCATCGCCTCCT1TCATCAAGCCGAAcATCCAGACT TGAAGGCCAAAACAGTATCAGCTGAGA3ATGG ACACTTGAAGAGTACCGCGAGCAAAAAGGGGAGTAC GCTTATCCGCCATGCCAAGAGCTT1TCGGAAG GGOcAGACC-TATGGTGCAACGCCAGGACATI'G C0GCGGTCTACGACATACCGCGGTCGACCCATAT= GATGTATAAGAAATrfGACGTAA SEQ ID 10_23F2 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:24 TGAGATCAGGCACCGCATACTCCGCCGAATCACG TrGAGCATGTATGTATGAAACCATrGCTCvCGG3GC AcGTcAccTcGGTGGATATACCGGG0TGT AGCATCGCTTCCrTCATCAAGCCGAACACCCAGAC'f GAAGGCCAAAAACAGTATCAGCTGAAGGATGGA CACTCGAAGATACCGTGAGCAAAAGGAGTACG c-T~ccA~cAGG~crGAAAG GGCAGACCTCTT'ATGGTGCAACGCCAGGACATCGGA GCGGGTACTATAAAAACTCGGCTCAGCGAACAGG GAAGTCTACGACACACCGCCGGTCGGACCTCATATGj -119- WO 02/36782 PCTTJSOI/46227 _____________ATCGTATAAGAAA1TGACGTAA SEQ ID 10_2B8 ATGrATTGAAGTCAAACCAATAAACGCGGAAQATACGTA NO:25 TGAGATCAGGCACCGCA2TCTCCGiGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTCTCGGGGGT ACGTITCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCT=rATCAAGCCGAACATC-CAGAGCTI GAAGGCCAAAA.ACAGTATCAGCTGAGAGGGATGGCGA CACTI7GAAGAGTACCGCGAGCAAAAAGCGGGCAGTACG CTrCATCCGCCATrGCCGAAGAGC1TCTrCGGAAAAAGG GGCAGACCTCTTATGGTGCAACGCCAGGACATCG~CJGA GCGTGGTACTATAAAAAGCTCGGCTTCAGCGAACAGOGC GAAGTCTACGACACACCGCCGGTCGGACCTCATATI1G ATGTATAAGAAATrGACGTAA SEQ ID 10_2C7 ATGAITGAAGTCAAACCAATAAACGCGGAAGATACGTA NO0:2 6 TGAGATCAGGCACCGCATI7CTCCOGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGAFTrGCTCAGGGGT GCGITI7CACCTCGGCGGATATrACCGGGGCAAGGTGAT CAGCATCGCCTCCTITATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGOGATGGCG ACACTCGAAGGGTACCGTGAGCAAAAAGCGGGAAGCA CGCTCATCCGCCATGCCGAAGAGG'1TCITCGGAAAAAA GGCGCGGACCTI=ATGGTGCAACGCCAGGACATCTGC GAGCGGGTACTATAAAAAGCTCGGCTITCAGCGAACAGG GCGAAGTCTACGACACACCGCCGGTCGGACCTCATAT _____'TGATOTATAAGAAATrGACGTAA SEQ ID 10_3(35 ATGATTCIAAGTCAAACCAATAAACGCGGAAGATACGTA NO:27 TGAGATCAGGCACCGCA'ITCTCCGGCCGAATCAGCCGC TGGrAAGCATGCAAGTATGAAACCGAGCTCGGC~ ACcYTCACCTCGGTGGATATTACCGGGCAAGCTGGTC AGCATCGCTrCCTICATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTrGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCrrATGCcCATGCCGAAGAGCrrCTTCGGAAAAAG GGGCAGACG'1PIATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGCTCAGCGAACAGG CGAAGTCTACGACATACCGCCGACCGGACCCCATAm GATGTATAAGAAATTGACGTAA SEQ ID 10_4117 ATGATrGAAGTCAAACCciATAAACGCGGAAGATACGTA NOM2 TGAGATCAGGCACCGCAT CCGGCCGAATCAGCCC * TTGAAGCATGTATGTATGAAACCGATIGCTCGGGCGGC ACGnTACCTCGGTGGATATACCGGGCAAGCrGTC AGCATCGCTCTrCATCAAGCCGAACATCCAGACT GAAGGCCAAAAACAGTATCAGGTGAGAGGGATGCGA CACITGAAGGGTACCGTGAGCAAAAAGOGGGCAGTACG CTTATCCGCCATGCCGAAGAGCrFCT7CGGAAAAAGGG GGCAGACCT1TFATi~GGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCPCGGCTTCAGCGAACAGGTGC GAAGTCTACGACATACCGCCGACCGGACCCCATATTI _____________GATGTATAAGAAATTGACGTAA SEQ ID 110 011 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA - 120 - WO 02136782 PCTIUSOI/46227 NO:29 TGAGATCAGGCACCGATCTCCGCCGAATCACG TGGAAGcATGcAAGTATGAAAccATI~TCGCGC ACGCTTCACCTCGGTGATATACCGGGCAACTGT cAGCATCGC1fTCCT1TCATCAAGCCGAACATCCAGACT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGCG ACGC'1GAAGGGTACCGTGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGAG=CTTCGGAAAAAG GGGCAGACCTTIATGGTGCAACGCCAGGACATCTG AGCGGGTACTATAAAAAGCTCGGCITCAGCGAACAG CGAAGTCTACGACATACCGCCGGTCGGACCTCATA = GATGTATAAGAA.ATTGACGTAA SEQ ID 10_SC6 ATGMTrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:30 TGAGATCAGGCACCGCATrCTCCGGCCGAATCAGCCGC TGGAAGGATGCAAGTATGiAAACCGATTGCTCGGGGT GCGTTTCACCTCGGTGGATATrACCGGGGCAAGCTGATC AGCATCGCCTCCflrCATCAAGCCGAACATCCAGAC'f GAAGGCcAAAAAcAGTATCAGCTGAGAGGGATGGCGA CACTCGAAGGATACCGCGAGCAAAAAGCGGCGAAGTAC GCTATCCGCATGCGAAGAGCTCTTCGAAAAAAG GCGCGGACCITPATGGTGCAACGCCAGGACATCTGCX AGCGGGTACTATAAAAAGCTGGCTCAGCGAACAG CGGGGTCTACGACATACCGCCGGTCGGACCTCATAfl= GATGTATAAGAAATGACGTAA SEQ DD 11C0 ATGA'TGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:3 1 TGAGATCAGCACCGCAYCTGCGCGAATCACG TrGAAGCATGAAGTATAACCGATTGCTCGGGT ACGTCACCTCGGTGGATATACCGCAGCTCATW AGCATCGCCTCCTCATCAAGCCGAACATCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAATGGA CGTGAAGGGTACCGCGAGCAAAGXGAGTAC GCrrTATCCGCCATGCGAAGAGCTrCCGGAAA GGGCAGAccTF1ATGGTGCAACGCCAGGACACGTG AGCGGGTATATAAAAAGCTCGG~CAGCA cGGGTcTAcGATATACCCGATCGGACTCATA = ______GATGTATAAGAA[TFGACATAA SEQ ID 1 1G3 ATCTAITGAAGTCAAAccAATAAACGCGGAAGATACGTA NO:32 TGAGATcAGCACCGCATTCCGGCCGAATCACG TrGAAGCGTGTATGTATGAAACCGATGCTCGGG ACGTrrcAcGTcGGcGGATATAcCAOCAGCTGAT CAGCATCGTTCCTTCATCAAGCCGAACAAGAC TGAAGGCCAAAAACAGTATCAGCTGAGAGAG ACGTGAGTCGGGAA AGTAC GCrATCCGCCATGCCGAGAGGTrc1GGAAAGG GGGCAGACCT1''AT'GGTGCAACGCCAGGACATGG AGCGGCTACTATGAAAAGCTCGGTCAGCGAACAG cGGGGTcTACGATATACCGccGATCGGACCTCATAT= ______GATGTATAAGAAATTGOCATAA SEQ ID 1 113 ATGATTGAAGTCAAACCAATAAACGCGAGATACGTA NO:33 TGAGATCAGCACCGCATACTCCGCCGAATCAC-rC ______TGGAAGCATGCAAGTATGAAACCGAGCCACrGT -121 - WO 02136782 PCT/USO1/46227 GCGYI=CACCTCGGTGGATATTACCGGGGCAAG M GAC AGCATCGCCTCCTTCATCAAGCCGACACCCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGXATGGA CACTrGAAGGTACCGTGAGCAAACGGGAGTACG CTrATCCGCATGCGAAGCGCTTCCAAAAGG cGOGGAcATGGTG~cAcGccAGGAcAcr~GA GCGGGTACTATAAAAGCTCGGCTrAGCGAACACTGG GAAGTCTACGACATACCGCCAACTGGGCCCATA=TG ATGTATAAG AA AMTGACCGTA SEQ ID 12_1F9 ATGAIQA)AGTcAAAccAATAAACGO(3AAGATACGTA NO:34 TGAGATCAGCACCCATACTCCGCCGAATCAGCCGC TGGAAGCATGCAAGTATGACCGATTGCTCGGGG ACGTTCACCTCGTGGATATACCGCAGTGATC AGCATCGCCTCCTCATCAGCCGACACCAGA&T GAAGCTCCAAAAACAGTATCAGTGAGAGGATGGA cAcTGAAGGGTAccGGAGAAGAAAc GCCTCCAGCAGACTTCGAAG GGGCAGACCTJATGGTGCAACGCCAGACATCTG AGGGATTAAGTCGTCGGAAG CGAAGTCCACGACATACCGCGACCGACCCCATA'f T(-TA TGTATAAGAAATTGACGTAA SEQ ID 12_2(39 ATGATGAAGTCAAACCAATAAACGAGATACGTA NO:35 TGAGATCAGCACCGCATCTCCCCGATCACG TGGAGCATGCAAGTATGAACCGAGTCGGGT CAGCATCGCTCCYTCATCAAGCCGAACATCCAGACT TGAAGGCCAAAACAGTATCAGCTGAGAGATG ACAcrGAAGGATACCGTGAGCAAGCGGAGTAC GcT~ccA~cAG~~cicOAAG GG~cAGAccTCTATGGTGcAACCCAGACAWTGG AGCGGGTACTATAAAAAGCTCGCAGCAG cGAAGTCTAcGAcAcAccGCCGGTCGGACCTCATA=m GATGTAT A A ,A AATTGACGTAA SEQ ID 12_3F1 ATGATrGAAGTCAACCAATAAACCGAATACGTA NO: 36 TGAGATCAGGCACCGCATTCTCCCQAATCAGCCC TCGACTCATTAACG~MTGGG ACGTTCACCTCGGTGGATATTACCCAA~rAT'C AGCATCGGTCClrCATCACGAACATCCAGACT GAAGGccAAAAACATATCAGCTGAGAATGCA CACTCAAGGATACCGTGAGCAAACGGAGTAC GCCTCCAGCAAACrCGAAG GGGCAGACCTI ATGGTGcAACCAACATCTG AGGG~TTAAGTCG~CGGAAG CGGGGTCTACGACATACCGCCGGTCGACCCATAY= GATGTATAAGA A A TGACGTAA SEQ ID 12L5CI0 ATGATGAAGTCAAACCAATAACGGGAGATACGTA NO:37 TGAATCAGCACCGATICTGCCGTCACG TGAGAGAGAGACG~MTGGG ACGICACCTCGGTGGATAATCGCAGTGATC I IAGCATCGCTCCTTTCAACACGACACAGACT -122 - WO 02136782 PCTIUSOI/46227 GAAGGCCAAAAACACGTATCAGCTGAGAGiGGATGGCGA CAC=rAAGAGTACCGCGAGCAAAAAGCGGGAAGCAC GCTCATCCGCCATGCCGAAGAGCTCTCGGAAAAAG GGGCAGiACC1TIATGGTGCAACGCCAGGACATCTG AGCG7GGTACTATAAAAAGCTCGGCTCAGCGAACAGG CGiAAGTCTACGACGCACCGCCGACCGGACCTCATAT=r GATGTATAAGAAATTGACGTAA SEQ ID 12_6A10 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:38 TGAGATCAGGCACCGCATTTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATGCTCGGGG ACGTCACCTCGCGGATATACCGGGGCAAGCTGT CAGCATCGCCTCCTrTCATCAAGCCGAACATCCAGAQCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGG ACACT-rGAAGciATACCGTGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGAGCTrCTTCGGAAAAAGG GGGCAGACCT1TATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTCAGCGAACAAG CGGGGTCTACGACATACCGCCGGTCGGACCTCATAT= ______GATGTATAAGiAAATI'GACGTAA SEQ DD 12_6D 1 ATGAfTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:39 TGAGATCAGGCACCGCA1TCTCCGGCCGAATCAGCCGC TGGAAGCATGTATGTATGAAACCGATITOGTCGGGCGGC ACGTITACCTCGGTGGATATTACCGGGGCAAGCTGAC AGCATCGTCCTrCATCAAGCCGAACATCCAGACfT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGCGA CAC1TGAAGAGTACCGCGAGCAAAAAGCGGGAACAC GCrCATCCGCCATGCCGAAGAGCTF"ICTTCGCTAAAAAGG GGGCACTACCI FilATGGTOCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAACTCTCCGGCTCAGCGAACAGG CGGGGTCTACGACATACCGCCTGTCGGACCTCATA =~ GATGTATAAGAAAYPGACGTAA SEQ ID 12_69 ATGATTGAAGTCAAACCAATAAACGCGQAAGATACGTA NO:40 TGAGATCAGGCACCGCA1TCTCCGGCCGAATCAGCCC 'ITGAAGCATGTAAGTATGAAACCGATrrGCTCGGGGGT ACGT1TCACCTCGGTGGATATACCGGAGCTGAC AGCATCGCCTCCTrCATCAAGCCGACATCCAGAC~ GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGA CACTCGAAGGATACCGCGAGCAAAAAGCGGAACAC GC17CATCCGCCATGCCGAAGAGCTrCGGAAAAAGG GGGCAGACCTIIATGGTGCAACGCCAGGACATCTGCG AGCGGCTACTATAAAAAGCTCGGCTCAGCGAACAG CGAAGTCTACGACATACCGCCGACCGGACCCATA = ________GATGTATAAGAAATI'GACGTAA SEQ DD 12_6116 ATGAITGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:41 TGAGATCAGGCACCGCATACTCCG*CCGAATCAGCCC TGGAAGCATGCAAGTATGAAACCGATTGCTCGGGGC ACGTTCACCTCGGTGGATATTACCGGGGCACTGT AGCATCGCCTC=flCACCAAGCCGAACATCCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA - 123 - WO 02/36782 PCTUS01/46227 CTATCCGCCAT0CAAGGCTTC1CGGAAAAGG CGCCGGACC=rTATGGTGCAACGCCAGACATCTiCA GCGGTACTATAAAATCGGCTCAGGAACACTG GAAGTCTACGACATACCGCACCGGACCCATA = GATGTATAAGAAA1TGACATAA SEQ ID 12_7D6 ATGArrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:42 TGAGATCAGCACCGCAfl7CTCCGGCGAATCACCC TGGAAGCATGCAAGTATGACTGAGCTCGGG ACGTTCACCTCGGTGATAACCGGGAGTGAC AGCATCG-TCCTCATCAAGCGAACACCAGACT GAAGGCCAAAAACAGTATCAGcTGAGAGGGATGGA CACTGAAGGGTACCGGGCAAAAAGCGGGCAGTACG CTATCCGCCATGCCGAAAGCTCTCGGAAAGG cGGcAGAcT1TATGGTGCAACGCCAGGACATCTCGA GCGGGTACTATAAAAAGCTCGGTCAGCAACAGG GGGGTCTACGACATAccGCCGACCGGACCCCATA = GATGTATAAGAAATTGACGTAA SEQ ID 12_7011 ATGAITGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:43 TGAGATcAGGcAccGcATrcTCCGGCCGAATCAGCCC TGGAAGcATGcAAGTATGAAAcCGA =CGGGGC ACGTnTCACCTCGGTGATTTACCGGGAGCTGAT'C A0.CATCG~CCTTCATCAAGCCGAACATCAGAC~ GAAGGCCAAAACATATCAGCTGAGAATGCA CACTTGAAGGATACCGCGAGCAAAAAGCGGAGTACG CTTATCCGCCATGCCAGAGCTCTTCGGAGG GGCAGACC=f1AT0GTGCAACGCCAGGACATCTCGA GCGGGTACTATAAAAACTCGGCAGGAACAGG GAAGTCTACGACACACCGCCGGTCGGACCTCATATG= ______ATGTATAAGAAAITGACGTAA SEQ ID 12F5 ATGATTGAAGTCAAACCAATAAACGCG(AGATACGTA NO:44 TGAGATCAGCACCGCArCTCCGCCGAATCACG TTGAACATGTATGTATGAAACC TCGGT ACGTTCACCTCGGTGGATTTACCAGGCAACGATC ACTCATCGTTCCTTCATAA0CCGAACATCAGAC~ GAGOGCCAAAAACAGTATCACTGAGAATGGA CAC'ITGAAGGGTAccGcGACAAAAACGCAGTACG cTTATCCGccATGGCAGAGA0~cCTCcAAGG G0CAGACC1TATGGTGCAATGCCAGGACATCGTGA GCGGGTACTATAAAAGCTCGGCTCAGCGAACAGC GGGATCTACGACATACCGCCGATCGGACCTCATAm'G _______ATGTATAAGAAATrGACGTAA SEQ ID 12G7 ATGATTOAAGTCAAACCAATAAACGCGGAAGATACGTA NO:45 TGAGATCAGCACCGCATCCGGCCGAATCACG 1TGAAGCATCAAGTATGAAACCGATGCTCGGGT ACGT-TCACcGGTGGATAT1TACCAGGGCAA~GATC AGCATCGCTCCTTCATAAAGCCGAACACAGACT GAAGGCCAAAAACAGTATCAGCTGAAGGATGCGA CGTGAAGGATACCGTGAGCAAAAAGrGAAGCAC ACTCATCCOccATGCCGAAGAGCTCCGGAAAA ________ GCGCAGACCTITAGTGAGCAGCTTT - 124 - WO 02/36782 PCT/US01/46227 AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGG CGAAGTCTACGACATACCGCCGATCGGACCTCATATT GATGTATAAGAAATTGACGTAA SEQ ID 1_2H6 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:46 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGT GCGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTTCATCAAGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATOCCGAAGAGCTrCTTCGGAAAAAGG GGGCAGACCTETATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAAGG CGGGGTCTACGACATACCGCCGATCGGACCTCATAT IGATGTATAAGAAATTGACGTAA SEQ ID 13_12G12 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:47 TGAGATCAGCACCGCATrCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGT ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCTTCCTTTAATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGAGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATGCCGAAGAGCTTTGGAAAAAAG GCGCGGACcTTATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGGTCGGACCTCATATT GATGCATAAGAAATTGACGTAA SEQ ID 13_6D10 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:48 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTCGCTCGGAGGC ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCTCCTTAATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTCGAAGGGTACCGTGAGCAAAAAGCGGGAAGCAC GCTCATCGCCATGCCGAAGAGCTTCTTCGGAAAAAGG GGGCAGACCTCTTATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACACACCGCCGGTCGGACCTCATATT GATGTATAAGAAATTGACGTAA SEQ ID 13_7A7 ATGATCGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:49 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTlGCTCAGGAGT GCGTCACCTCGGCGGATATTACCGGGCAAGCTGAT CAGCATCGCCTCCTTTCACCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGGGGGATGGCG ACACTTGAAGAGTACCGCGAGCAAAAAGCGGGAAGTA CGCTTATCCGCCATGCCGAAGAGCTTCCGGAAAAAG GGGGCAGACCTTTATGGTGCAACGCCAGGACATCTGC GAGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGG GCGAAGTCTACGACACACCGCCGGTCGGACCTATATT - 125 - WO 02(36782 PCTJSO1/46227 TTGATGTATAAGAAATTGACGTAA SEQ ID 13_7B 12 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:50 TGAGATCAGCACCGCATCTCCGCCGAATCACG ACGTCACCTCGGTGGA TA1ATACGCAACrGATC AGCATCGGCTCCTCATCAAGCCAACATCCAACTI GAAGGCCAAkAAACAGTATCA0CTGAGAGGATGCGA CACTCGAAGGATACCOGC~AGCAAAAAGCGCAGTAC GC-TEATCCGCATCCGAAGAG-TCTCGAAAAG 0CGCGGACCTI]GTGGTGCAACGCCAGGACATGCG A~CrCGGTACTATAAAAA0cTCGGcTcA~cAAcAc CGAAGTCTACGACATACCGCCGACTGGGCCCATArm ______GATGTATAAGAAGTrGACGTAA SEQ ID 13_7C1 ATGA2[TGAACITCAAACCAATAAATGCGGAAGATACGTA NO:5 1 TGAGATcAG~cACCGCATACTCCGCCGATCACG TTGAAGCATGCAAGTATAAACCATCTCAC-GT GCGTITCACCTCGGTGTATATTACCGGGAGCTGAC AGGATcGccTccTTcATcAAGOCG~cATCcAG~cT GAAGGCAAAAACAGTATCAGCTAGAGGGAGGCGA cACTrT0AAGGATACCGTAGCAAAAGCGGTAGTACG CTTATCCGCCATGCCGAAGAG=CTrCGGAAAAA.IGG CGCOGACC1I"TlGTGGTGCAACGCAGGACATCGGA GAGGGTACTATAAAAAGCTCG TCAGAACGG GAAGTCTAcGACATAcCCCCGACTGGCCCCATA~ ______ATGTATAACGAAATICGACGTAA SEQ ED 13_806 ATGATTGAAGTCAAACCAATAAACGGGGAAGATACGTA NO:52 TGAGATCAGCACCGCAICTCCGCGATCAGCCC TGGAAGCATGCAAGTATGAAACCGATTCGCTCGGGG AcGTT~cAccTcGGGGATATTG~ AAcCrAT cAGcATcGcTTrAATCAAGCCGAACATCCAGACT TGAGGTCAAAAACAGTATCAGCTGAGAGGATGGA cAcTTGAAGGATACCGTGAGCAAAAAGGGGAGTACG CTTATCCGCCATGCCGAAGAGC1TCCGAAAAAG CGCGGACCr1TATcGGcAACGCCAGGACGTCTGCGA GCGGGTACTATAAAAAGTCGGCTCAGCAACCPG GGGTCTACGACATACCGCCGTCGGACCTCATAm=TG _____________ATGTATAAGAAA'TGACGTAA SEQ ED 13_9F6 AT GAT0'GAATCAAACCAATAAACGCGGAGATACGTA NO:53 TGAGATCAGCACCCACC0CCGATCACG TGGAACATGCAAGTATAAACCGATCTGGGG ACGTTICACCTAGGTGGATATACCGAGTAT CAGCATCGCCTCCTTCATCAAGCCGAACATCCAGAC T0AAGGCCAAAAACAGTATCAGCTGAGAAG AcAcTGAAAGTACCGGAGCAAAGAAGTA CGCTATCCGCATGCCGAA AGCTC=GAAAG GGGGCAGACCTITATGGTGCAACGCCAGGACATCTC GA0G0rGTAcTATAAAAAGCTCGGGTCAGCGAACAG GCGAAGTCTACGACATACCGCCGGTCGGACCTCATA= TGATGTATAAGAAATTGACGTA-AI SEQ ID 14 1C9 k TAIGATAACCAATAAACGCGCTAAGATACGTAI - 126 - WO 02/36782 PCT/US01/46227 NO:54 TGAGATCAGGCACCGCATACTCCGGCCGAATCAGCCGC TAGAAGCATGCAAGTATGAAACCGATTTGCTCAGGGGT GCGTTTCACCTCGTGGATATTACCGGGGCAAGCTGATC AGCATCGCTTcCTTTCATCAAGCTGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGAGTACCGCGAGCAAAAAGCGGGAAGTAC GCTCATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAGG GGGCAGACCTATGGTGCAACGCCAGGACGTCTGCG AGCGGGTACTATAAAAAGCTCGGCTCAGCGAACAGGG CGAAGTCTACGACACACCGCCGGTCGGACCTCATATTT GATGTATAAGAAGTTGACGTAA SEQ ID 14_10H3 ATGATTGAAGTCAAACCAATAAACGCGGA TACGTA NO:55 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATGCTCAGGGGT GCGTTCACCTCGGCGATATTACCGGGGCAAGCTGGT CAGCATCGCCTCCTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCG ACACTTGAAGAGTACCGCGAGCAAAAAGCGGGAAGCA CGCTCATCCGCCATGCCGAAGAGCTTCTTGGAAAAAA GGCGCAGACCTTATGGTGCAACGCCAGGACATCTGC GAGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGG GCGAAGTCTACGACACACCGCCGGTCGGACCTCATATT , TrGATGTATAAGAAGTTGACGTAA SEQ ID 14_101H9 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:56 TGAGATCAGGCACCGCATACTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCAGGGGT GCGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGGTC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAACAGTATCAGCTGAGGGATGGCGA CACTTGAAGGATACCGTGAGAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTCTAAAAAAGG CGCGGACCTTTTGTGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCOGCTTCAGCGAACAGGGC GAAGTCTACGACACCCGCGGACCTCATATTTTG ATGTATAAGAAATTGACATAA SEQ ID 14_11C2 ATGATTGAAGTCAAACCAATAAACCGGAAGATACGTA NO:57 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGAGC ACGTTTCACCTCGGCGGATATTACCGGGGCAAGCTGGT CAGCATCGCTTCCTTTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGACGATGGCG ACACTGAAGAGTACCGCGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGCGCTTCTTCGGAAAAAGG GGGCAGACCTATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACACACCGCCGACCGGACCCCATATTT TGATGTATAAGAAATTGACGTAA SEQ ID 14_12D8 ATGATTGAAGTCAAACCAATAAACCGGATACGTA NO:58 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC T__ GAAGCATGTAAGTATGAAAC CGAT TlTCTCGGT - 127 - WO 02136782 PCTIUSOI/46227 ACGTTCACCTCGGC-GGATATTACCGQGGCAAGCTGGT CAGCATCGCCTCCTTTATCAAGCCGAACATCCAGAGT TGAAGGC-CAAAAACAGTATCAGCTGAGAGGGATGCG ACACTTGAAGGATACCGTGAGCAAAAAGCTGGCAGTAC CYCTTATCCGCCATCGOCGAACGOGC i"C1.CGGAAAAAkG GCCGGiACC1TIGTCGGTCAACGCCACGGACATCTG ACGGTCCTACTATAAAAACTCTCGGGTCAGGCGAACAAT CGGTGGTCTACCGACATACCGCCTGTCGACCTATAT= GATGTATAAGAAITrGACGTAA SBQ HD 14_12H-6 ATG3ATIfGAAGTCAAACCAATAAACGCGGAAGATAXGTA NO:59 TGAGATCAGGCACCGCAITCTCCGGCCGAATCAGCCC TGGAAGCATGCAAGTATCAAACCGATTGCTCGGT GCGTITACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTITCATCAAGCCGAACATCCAACTT GAAGGCCAAAAACAGTATCAGCTGAGAGC-GATGCGA CAC-fTGAAGAGTACCGCGAGCAAAA-AGCGGGCAGTACG CY[ATCCGCCATGCGAAGAGCT CTCGGAAG CGCGGACCTTTGTGGTGCAACGCCAGGACGTCTCGA GCGGGTACTATAAAAAGCTCGGCTCAGCGAACAGG GAAGTCTACGACATACCGCCGACTGGGCCCCATATIT ________ATGTATAAGAAATrGACGTAA SEQ ED) 14_2B6 ATGA2FPGAACTCAAACCAATAAATGCGGiAAGATACGTA NO:60 TGAGATCAGCACCGCACTCCGCGATCACG TGGAAGCATGCAAGTATGAAACCGATTGTCGGT ACGfTrcAccTCGGTGGATATCGGcAGTGATC AGCATCGCTCCflAATCAAGCCGAACATCCAGA= GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTCGAAGGATACCGTGAGCAAAAAGGAGTACG cTATCCGCCATGCCGAAGAGC 1TC1TCAAAAG CGCGGACCT1TATGGTGCAACGCCAGGACGTCTGCGA GCGGGTACTATAAAAAGCTCGGCTCAGCAACGG GGGGTCTACGACATACCGCCGGTCGGACCTCATA =mG ATGTATAAGTAAA'TGCA CGTAA SEQ ID 14_2G1 1 ATGAT27GAAGTCAAAccAATAAATGCGGAAGATACGTA NO:61 TGAGATCAGCACCGCATCTCCGGCCGATCACG TGGAGCATGCAAGTATGAACCGAGCTAGGT GC = CACCTCGGTGGATATrACCGGGGCAAGCTGT AGCATCGCCTCCYICATCAAGCCGAACATCCAGAC2f GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGjGCGA CACTCGAAGGGTACCGTGAGCAAAAAGGAGTACG crrATCCGCCATGCCGAAGAGcTTCTrcGGAAAAGG CGCGGACC 1'TG TGGTGCAACGCCAGGACATCTCGA GTGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGG GAAGTCTACGACATACCGCCGACTGGGCCCCATAmTG ________ATGTATAAGAAATTGACGTAA SEQ ID 14_3B2 ATGAfl7GAAGTCALAACCAATAAACGCG3AAGATACGTA NO:62
TGAGATCAGCACCGCATCTCAGCCGATCA
3 CCC TGGAAGCATGiCAAGTATGAAACCGATLGCTCAGGGGT GCGTCACCTCGGTGGATATTACCGGGCAAGCTGT _____ ACATGCCCCTCTCACCGACACIAGA~r - 128 - WO 02136792 PCTTJSOI/46227 GAAGGCCAAAAACAGTATCACGCTGAGAGG3GATGGCGA CAC'fTGAAGGATACCcITGAGCAAAAAGCGGGAAGCAC GCfTATCCGccATGCCGAAkGcGcTTTrcGAAAA-AAG GCGCGGACCTFIATGGTGCAACGCCAGGxACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTCAGCGAACAG CciiGGTcTrACGACATACCGCCGGCCGGACCTCATA=m ______GATLGTATAAGAAAITGACGTAA SEQ ID 14_4H8 ATGATrGAAC-TCAAACCAATAAACGCc3GAAGATACGTA NO:63 TGAGATCAGGCACCGCATCTCCGGCCGAATCAGCCC TGGAAGCATGCAAGTATGAAACCGATITGCTCGGGAGC ACGT1TACCT'CGGCGGATA'TACCGGGGCAAGGTGAT CAGCATCGCCTCCTITCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGG ACACTCGAAGGGTACCGTGAGCAAAAAGCGGGAAGCA CGCTCATCCGCCATGCCGAAGAGCT'T'TCGGAAAAAA GGCGCGGACCTITGTGGTGC.AACGCCAGGACGTCTGC GAGCGGCTACTATAAAAAGCTCGGCTCAGCGAACAG GCGAAGTCTACGACACACCGCCGGTCGGACCTCATATT TI7GATGTATAAGAAATTGACGTAA SEQ ID 14_6A8 ATGA~rGAAGTCAAACCAATAAACGG:GGAAGATACGTA NO:64 TGAGATCAGGCACCGCACTCCGGCCG)AGCCC TTGAAGCATGTATGTATGAACCGATGCTCGGGT ACGY1TCACCTCGGTGiGATA'fTACCGGGGCAAGCTAGC AGCATCGCTTCGTIAATCAAGCCGAACACCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CAG=GAAGGYATACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTITCGGAAAAGG CGCGGACC1TiTiiGTGGTGCAACGCCAGGACATCTCGA GCGGGTACTATAAAAAGCTCGGCT7CAGCGAACAGGGC GAAGTCTACGACACACCGCCGGTCGGACCTCATGT =r ATGTATAAGAATrGACGTAA SEQ ID 14_6B10 ATGA'1TGAAGTCAAACCAATAAACQCGGAAGATACGTA NO:65 TGAGATCAGGCACCGCA1TCTCCGGCCGAATCAGCCC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGT ACGTITCACc'TGGTGGATATTAccGGGGcAAcTGATc AGCATCGC'ITCCfTCATCAAGCCGAACATCCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGCGA CACTCGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG crrATCCGCCATGCCGAAGAGCTTicGGAAAAAA CGCGGACC=1]ATGGTGCAACGCCAGGACATCTWGA GCGGGTACTATAAAAAGCTCGCTCAGCGAACAAG CGGGGTCTACGACATGCCGCCGGTCGGACTCATAmTG ________ATGTATAAGAAiTGACGTAA SEQ ID 14_6D4 A TG ATI'GAAGTCAAACCAATAAACGCGGAAQATACGTA NO:66 TGAGATrCAGGACCGCATCCGACCGAATCACG TGGAAGCATGCAAGTATGAAACCGATITGCTCGGAGGC ACGTTfC-ACCTCGGTGGATATfTACCGGGGCAAGCTGAWC AGCATCGCTTCCTIAATCAAGCCGAACATCCAGAGCT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA ______CACITGAAGGATACCGTGAGCAAAAAgCgggCAGTACG - 129 - WO 02/36782 PCTIUJSOI/46227 - =rATccGccATGcGAAGcTGcTTC1GGAAGG GGCAGACCTCTATGGTGCAACGCAGGACATCTCGA GCCGGTACTATAAAAAGCTCC7CAGGAACACTG GAAGTCTACGACACACCGCCGCTTCGGACCTCATAmTG ATGTATAAGAAATTGACGTAA SEQ ID 14_7A1 1 ATGATrGAAGITCAAACCAATAA-ACGCGGAGGxATACGTA NO:67 TGAGATCAGCACCGCACTCCGCCGATCACG TGGCAAGCATGCAAGTATGAAACCGATTGCTCAGGT GCGTTCACCTCGGTGGATATTAccGGGGOACAAGT AGCATCGCCTCCTTTCATCAAGCCGAAcATCCAGAGCT GAAGGCCTAAAACAGTATAGcTGAGAGAGGGAC ACTCAAGGTACGTAGCAAAAGCGGAGTACG CTCATCCGCATGCCGAAGACTCITGGAAAAGG GCTCAGACCGTJATGGTGCAACGCCAGGACGTCGGA GCGGGTACTATAAAAGCTCGGTCATCGAACAGG GAAGTCTACGACACACCGCCGACCGACTCATAT= ____GAT-GTATAAG A A A -T-ACCI-TA SEQ ID 14_7A1 TGAFPGAAGTCAAACCAATAAACGCGGAGATACGTA NO:68 TGAGATCAGGCACCGCATTCTCCGCCGAATCACG TGGAAGCATGCAAGTATGAAACCGAGCAGGT GCGfITCACCTCGGTGGATATACCCAGG~TIC AGCATGCCTCCTCATCAGCCGACATCCAGACT GAAGGCCTAAAACAGTATCAGCTGAGAGGGATGGCGAC ACTCGAAGGrGTACCGTGAGCAAAAAGCGAGTACG CTCATCCGCCATGCCGAGAA'=GGCTAA GGCAGACCTTATGGTGOAAcGccAGGACGTTGGA GCGGGTACTATAAAAGCTCGGcTTcAGAACAGG GAAGTCTACGACACACCGCCGACCGGACCTCATATI= ______GATGTATAAGAA.ATG.ACGTAA SEQ IID 14_7A9 ATGATTGAAGTCAAACCAATAAACGCGGAGATACGT NO:69 TGAGATCAGCACCGCATTCTCCGGCCGAATCAGCCC TGGAAGCATGCAAGTATGAAACCGATTGGGT AcGrTCACCTCGGCGGATATGAACGTGG AGCATCGCCTCCTTCATrAACTCCAAAcATcCAGACT GAACGGCCAAAACAGTATCACTGAGAATGGA CAGTCGAAGGGTACCGTGAGCAAAAACGGTAGTACG rATcccGccATGccGAAGAG~cTTcGGCAAAAA CGCGGACCrIT]?1"ATGGTGAACGCCAGGACGTTGGA GCGGGTACTATAAAGcTcGGcTcAGcGAACAGG GAAGTCTACGACACACCGCCGGTCGGACCTCATA~r _____________ATGTATAAGAAA'ITGACGTAA SEQ IID 14_7(11 ATGATrTiAAGTCAAACCAATAAACGCAGAAGATACGTA NO:70 TGAGATcAG~cAcc~CCGCCCCGAAGCCGC TGGAAGCATGCAAGTATGAACCGATITGCTCAGGGGT GCGTITCACCTCGGTGGATATACCGGGGCAATA AGCATCGCTCCTEAATCAAGCCGAACATCCAGATr GAAGGCCAAAAACAGTATAGTTGAGAGATGGA CACTTGAAGAGTACCGTGAGCAAAAAGCGGAAGTACG CTTATCCGCCATGCGAAGCGTCCGAAAGC GCAGACCC=ATGGTCCAACGCCAGGACATCTGCGA -130- WO 02/36782 PCT/USO1/46227 GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACACACCGCCGGTCGGACCTCATATTG A_______ ____AA ATGTATAAGAAATT'GACGTAA SEQ ID 14_7H9 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:71 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGT ACGTTTCACCTCGGCGGATATTACCGGGGCAAGGGT CAGCATCGCTTCCTTTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGAT GG ACACTTGAAGGATACCGTGAGCAAAGCGGGAAGCA CGCTCcACCGCCATGCCGAAGAG CTTCGGAAAAAA GGCGCGGACCTTGTGGTGCAACGCCAGGACATCTGC GAGCGGGTACTATAAAAAGCTCGGCTCAGCGAACAGG GCGAAGTCTACGACATACCGCCGGTCGGACCTCATATTT TGATGTATAAGAAATTGACGTAA SEQ ID 14_8F7 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:72 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TTGAAGCATGCAAGTATGAAACCGATTGCTCGGGGGT ACGTTTCACCTCGGCGGATATTACCGGGCAAGCTGGT CAGCATCGCCCCTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGGGATGGCG ACACTTGAAGAGTACCGCGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCTGAAGCGCTTCTTCGGAAAAAAG GCGCGGACCTTGTGGTGCAACGCCAGGACATCTGCA AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGACTGGGCCCCATATT I GATGTATAAGAAA TTGACGTAA SEQ ID 15_10C2 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:73 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCAGGGGT GCGTTTCACCTCGGTGGATATTACCGGGCAAGCTGGTC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGATACCGTGAGCAAAAAGCGGCAAGTACG CTCATCCGCCATGCCGAAGAG TCTTCGGAAAAAGGG GGCAGACCTCTTATGGTGCAACGCCAGGACAACTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGGT GAAGTCTTCGACATACCGCCGACCGGACCCCATATTG ATGTATAAGAAATTGACGTAA SEQ ID 15_10D6 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:74 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGC ACGTTCACCTAGGTGGATATTACCGGGGCAAGCTGGT CAGCATCGCCTCCTTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGATGGCG ACACTTGAAGAGTACCGCGAGCAAAAAGCGGGAAGCA CGCTCATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAG GGGGCAGACCTCTTATGGTGCAACGCCAGGACATCTGC GAGCGGGTACTATAAAAGCTCGGCTTCAGCGAACAGG 1GCGAAGTCTACGACATA - 131 - WO 02136782 PCT/US01/46227 TGATGTATAAGAAATTGACGTAA SEQ ID 15_11F9 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:75 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTGCTCAGGGGT GCGTTTCACCTTGGTGGATATTACCGGGGCAAGCTGGTC AGCATCGCCTCCTTAATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTCGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTCTTCGGAGAAAGGG GGCAGACCTTATGGTGCAACGCCAGGACATCTGCGA GCGGCTACTATAAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACATACCGCCGACCGGACCCCATA TT GATGTATAAGAAATTGACGTAA SEQ ID 15_11H3 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:76 TGAGATCAGGCACCGCATACTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCAGGGGT GCGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAACACCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGCGCTCTTCGGAAAAAAGG CGCGGACCTTTATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACATACCGCCAACTGGGCCCCATATITG ATGTATAAGAAATTGACGTAA SEQ ID 15_12A8 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:77 TGAGATCAGGCACCGCATTCTCCGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGT ACGTITCACcTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGCGCTCTCGGAAAAAGGG GGCAGACCTCTATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACATACCGCCGACCGGACCCCATATTT GATGTATAAGAAATTGACGTAA SEQ ID 15_12D6 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:78 TGAGATCAGGCACCGCATTCTCCGCCGAATCAGCCGC TGGAAGCATGTATGTATGAAACCGATTTGCTCAGGGGT GCGTTTCACCTCGGCGGATATTACCGGGGCAAGCTGGT CAGCATCGCCTCCTTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCG ACACTTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGAGCTTCTrCGGAAAAAAG GCGCGGACC'TTATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACACACCGCCGGTCGGACCTCATATTTT GATGTATAAGAAGTTGACGTAA SE ID 15_12D8 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA -132- WO 02136782 PCTIUSOI/46Z27 NO:79 TGAGATCAGCACCGCATACTCCCCGATCACG TGGAAGCATGCAAGTATGAAACCGACCGGGT ACGITCACTCGGCGATAUGGAAGCTGGT CAGCATCGCCTCCCATCAArCGCATCCAAC TGAAGCAAAAACAGTATCT)GAGGGATGG ACATTGAAGATACCGTGAGAGGGTAGTAC Gc=ATCCGCCATGCCGAAGAGCCAAAAG GCGCGGACCT IAGGTGCAACGCAACGTGO AGCGGTACTATAAAAGCTCGrCAG CAG CAAAGTCTACGACATACCGCCTCGACCATA=m GATGTATAAGAAAITGACGTAA SEQ -ID 15_12D)9 ATGATGAAGTCAACCAATAAACCGAGATACGTA NO: 8 0 TGAGATcAGGcc~ArcccGAcTcGAcGAcc~ TGGAAGCATCAAGTATGAAACCAq-GCCAGGT ACGTITCACCTCGGCGGATATTACCGAAGT CAOCATCGCTCCTTATCAACCGACATCCAGAC TGAAGGCCAAAACAGTAAGAGAATG ACACTCAAGAGTACCGCGAACGAC CGCTCATCCGCCATGCCGAAGATCAAAG GGGCAGACCTGTATGGTGCAACGCCAGACATCTC GAGCGGGTACTATAAAAAGCT CTCAGCAG GCGAAGTCTACGACATACCGCTCGACCTCATA~ TGATGTATAAGAATAAA SEQ D 15_3F10 ATGATrGAAGTCAAACcAATAAACCXGAGATACGTA NO:8 1 TGGTAGAC-CTTTCGCATACG TGAGAGAGAGAACAIGTAGG GcGTrcAccTTGGGATATTAGGcA~GATC AGcATCGTCCTcATcAAG~cGATCCAGAci GAAGGCCAAAAACAGTATCACTGAGAGATGGA CAITGAAOGTACCGTGAAGOGAGACG cTTATcGccATcATGAAGAGrC~AAAAG CCGGACC=ATGGTGCAACGCCAGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCAGG GAAGTCTACGACACACCGCCGCACCATAT= GATGTATACGATCTA SEQ ID 15_3(31 ATGATGAAGTAAACCAATAACGCGAGATACGTA NO:82 TGAGATCAGCACCGCATACTCCCGAATCAGCCGC fTACrAGAGAGAACArGTGGG AcG TcAccTCGGOGGATATTAc GoGAAGCTGGT CAGCATCGCCTCCT-fCATGCGACATCCAGAC TGiAAGGCCAAAAACAGTATCAGAGAAG ACACTGAAGAGTACCCGAAACGGAGTAC GCTTATCCGCCATCCGAAGAG CAAG GCGCGACCTTGTGGTGCAACGCCAACG~rG AGGGATTAAGTCG=CGGAAG CGAAGTCTACGACATACCCGCCTCACCATA=m GATGTATAAG A AATGACGTAA SEQ ID 15_4F1 1 ATGATGAAGTCAAACCAATACGGAGATACGTA NO:83 TAAGTATCAGCACCCATACTCCCGATCACG iGAAGCATGTATTATGAAACcGATGCTcGGC - 133 - WO 02/36782 PCTIUSO1/46227 ACM~cc~AATACGGCACGT AGCATCGCTcCAATcAAGccGAAcACAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGCGA CACTTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGCGCTCTGAAG CGCGGACCI'1m'ATGGTGCAACGCCAGGACACGA GCGGGTAcTATAAAAAGCTCGGTcAGCGAACAGG GAAGTCTACGACATACCGCCGACCGGACCCCATA=m ________GATGTATAAGAAA'T~IACGTAA SEQ ID 15_4113 ATGA1TGAAGTCAAACCAATAAACGTCGGAAGATACGTA NO:84 TGAGATCAGCACCGCAcTCCGCCGATACG TTGAAGCATGCAATATGAAACG GCTCGGT ACGTITCAccTCG7GCGGATATTACCGGGGCAATG CAGCATCGcTTCCTCATCAAGCCGAACACCAGAC TGAAGGCCAAAACAGTATCAGCTGAGAGAG ACAC = AAGAGTACCGCGAGCAAAAGGAGTA C=TCCAGCAGG CrGAAA GGCGCGGACCI-1lT-ATIGGTGCAACGCCAGGACATCTC GAGCGGGTACTATAAAAAGCTCGGCTTCAGACAG GCGAAGTCTACGAcATACCGCCGACTGGGCCCCATA~r 'TGATGTATAAGAAATTGACGTAA SEQ ID 15_61)3 ATGATGAAGTCAAACCAATAAACCGGAAGATACGTA NO: 85 TGAGATCAGCACCGCATACTCCGCCGAATCACG TGGAGCATGCAGTATGAAACCGAGTCGGGT ACG =CACCTCGGTGGATATACCCAGTATC AQCATCGCTCCTCATCAAGCCGAACACCCAGAC~ GAAGGCCAAAAACAGTATcAGCTGAGAGGAT'GCGA CACTGAGAGTACCGCGAGCAAAAGGAAGAC G-CTCATCCGCCATGCCGAAGAGTGGTCACAG GGGCAGACCTCTrATGTGCAACGCAGGACATTG AGCGGGTACTATAAAAAGCTCGGCTTCAGGACAG CGAAGTCTACGACATACCGCCGACCGGACCCCATA = _____GATGTATAAGAAA'ITGACGTAA T SEQ ID 15_6G1 1 ATGAYTGAAGTCAACCAATAACG~GAATACGT NO: 86 TGAGATCAGCACCCATTCTCCGCGATCAGCCC TGGAAGCATGCAGTATGAAAccGAGTCAGGT GCGTCACCTCGGTGGATATTACCACTGT AGCATCCCCnTCATCAAGCCGAACACCAGAC~ GAAGGTCCAAAAcAGTATcAGcTGAGAGAGGGA CACTTGAAGAGTACCGCGAGCAAAGGAGAC GCTCATCCGCCATGCCGAGAGC~CACAAAG GGGCAGACCTlI,,ATGGTGCAACGCCAGACATCG AGGGATTAAGTCGTCGGAAG CAAAGTCTACGACATACCGCCGGTCGGACCTECATA=m GrATGTAT A A CA AG(TGACGTAA SEQ ID 15_9F6 ATGATGAAGTCAACCAATAACCGGAGATACGTA NO:87 TGAGATAGGCACCGCAccGGccGTCACG TGGAAGCATGCAAGTATGAAACCGAGCTCGGT ACGTCACCTCGGCGGATAcicGGGAAGCGAT CACCATCCCCTATCAAGCCGAACATCCAGACT - 134 - WO 02/36782 PCTUSOI/46227 TGAAGGCCAAAACAGTATCAGCTAGAATG ACACTGAAGAGTACCGCAAAACGGAGTA CGTACGCTCGAAC' CGGA. iGCC GjAGGGGTACTATAAAAACTCGC=TAGCAGG GccGAAGTCTACGACATACCGCTTCGACCTCATAT TGATGTATAArUA ATACGTAA SEQ ID 15F5 ATGATcG~arcAAACCAATAAACCYJAGATACGTA NO: 88 TGAGATCAGCACCGCATCCGCCGTCACG TGGAAGCATGAAGTATGAAACCGATCGGT ACGTCACCTCGGTGGGTACTACCAA~GAT cAGATCG~ccT~cATAACCGiAACArcAGAc TGAGGGAGAACAGTATCAGCTGAGAGGATGC AcGc~rGAAGGATAccGTGAGAAAG~AGTAc GcTTATccGcTATGccGAAGAGTT~cGAACGAG GCGCGGACCIT ATrGGTGCAACGCCAGGACATTG AGCGGGTACTATAAAAAGGTcCAGACAG CGAAGTCTACGACATACCGCGATCACATAT= ______ ATGTATAAGAATGACGTAA SEQ DD 16A1 ATGATrGAAGTCAAACCTATAAACCCGAGATACGTA NO: 89 TGAGATAGGCACCGCACTCCGCGATCACCT TrGAAGCATGTATGTATGAAACCGATTCTCGT ACGTCACCTCGGTGGATATACCACAGCGAT CACTGTCTTAAACCTAATAGC TGAGGGCGAGAACAGTATCACTGAGAGGATG ACGCTCGAGGTACGGAACOGG AGTA CG'TTCCAGCAGC(=rGAAA GGCGCGCGAC=nATGGTGCAATGCCAGGACATT GCGAAGTCTACGACATACCGCGATCGACTCATA'I _____TGATGTATAArTAAATTGACGTAA SEQ ID 16H3 ATGA1rGACGTCAACCTATAACCOGAATACGTA NO:90 TGAGATCAGCACCGCACTCCCGATCACG TGGAAGCATGCAAGTATGAAACCGATGGC~ ACGTCACCTCGGcGGATAITACCACACGAT cAGcATCGccTccTTATcAA~ccGCATCAGAGCT TGAAC.GCCAAAAACAGTATCATGAGAGATGGCG AcAcYrGAAGGGTACCGCGAGC AcGGGTA CGcTcATCCGOcATcCGAAGAGTCGAAAG GGGCTCAGACCTI IlATIGGTGCAATGCCAGACATCTGT GACGGTACTATGAAAGCTcCAGCAGG GCGAAGTCTACGACATACCGCCGATCGGACCCATAI TGATGTATAAGA A A1TGACGTAA SEQ ID 17012 ATGArGAAGTCAAACCAATAAGCCGGAGATACGTA NO91 TGAGATcAGGcAccGcACTCCGGCCGATCACG rNO: 9 1TGGAAGCATGTATGTATGAATCTCGGAGGT GCG-TCACCTCGGTGGATATTACCGCAGTGATC AGCATCGCCTCCTCATCAAGCCAACACAGACT GAGGCCAAAAACAGTATCAGCTGAGAGGGAOWGA t-A0nfCA A trA GOGTACC-AGCGAAA GTAC - 135 - WO 02/36782 PCT/USOI/46227 GCTTATCCGCCATCCGAAGAGCTTCT WGAAAAG GCGCGGACCTIATGGTGCAACGCCAGACAT~rGTG AGcGGTACTATGAAAGCTCGTCAGIACAGG CGAAGTCTACGACATACCGCCGATCACTCATAT= GATrGTATAAGAAMTrGACGTAA SEQ ID 18D6 ATGrATTGAAGTCAAACCAATAAACCGGAGATACGTA NO:92 TGAGATCAGCACCGCACTCCCGTCACG TGGAAGATGCAAGTATGAAACCGA TCGGG ACGTTCACCTCGGTGGATAGGCACGATC AGACCW-TAAAGCACTAACf GAAGGCCAAAACAGTATCAGCTGAGAATCCA CGCITGAAGGATACCGTGAGCAAACGAGTACG CTATCCGCCATGCCGAAGAGCCG cAAAG GGCAGACCT=ATGGTGACGCCAGACATCTGCGA GCGGCTACTATGAAAAGCTCGGC1TACAAGCAGG GAAGTCTACGACATACCGCCGATCGGACCWCATA =mG ATG3TATAAGAA AKTTGCATAA SEQ ID 19C6 ATATAGCACATACCGkGTCT NO:93 TGGTAGACCTCTCGCATACG TGGAAGCATGCAAGTATGAAACCGACTCGGGT ACGTCACCTCGGTGATAACCGCAGTGATC TGCATcGCCTCCTTCATCAAGCCGCATCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGATCGA CGCTGAAGGGTACCGCGAGCAAACGGAGTAC GCTTATCCGCCATGCGAAGAGC'IC~GAAAAG GGGCAGACCTITATGGTGCAATGCCAGGACACTG AGAGGcTAcTATGAAAAGCTCrGCGAG~CAG CGGGGTCTACGATATACCGCCGATCGACCTCATAT= GATGTATAAG-AA ArTCGGCGTA'A SEQ ID 19D5 ATGATGAGTCAAACCAATAAACGC~GAATACGTA NO:94 TGo~A~~Tc~c~c~cA~Ac~ TGAACTCATGTATGTATGAAACCGArGTCGGGT AcGTrcAccTCGGTGGATAcAGCCAATGAT'c AGAC~.CTTAAACGAArAACI GAAGGCCAAAAACAGTATCAGCTGAGAGGAGGCGA CGCTGAGGGTACCGCGAGCGGGAGTACG CTACGCTCGAAG~CTGAAAG GGCAGACcnATGGTGCrAATCCAGGACATTGTGA GCGGTACTATAAAAAGCTCGGCCAGCAGC~ GAAGTCTACGACATACCGCCGATCGACCTCATA = G _____ATGTATAAGAAATGACGTA SEQ ID 20A12 ATGATGAATCAACCAATACCIGAGATACGTA NO:95 TGAGATcAG~cAccGcAcTcGccGTCACG TTGAAGCATGTATGTATGAAACCAGTCGGGT ACGTTCACCTCGGTGGATATACCCAGCTGATC AGCATCGCTCCTCATAATGCCGACArCAGAC~ GAAGGCCAAAACAGTATCAGGAGAAGGGA CGCITGAAGGGTACCGTGAGCAAAGGAGAC GcTcATCCGCCATGCCGAAGAGAAAG MnT A G A rCCTATGTGAACCAGACATTG - 136 - WO 02136782 PCT/USO1/46227 AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAAGG CGGGATCTACGACATACCGCCGATCGGACCTCATATT _ GATGTATAAGAAATTGQCATAA SEQ ID 20F2 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:96 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGT ACGTITCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTTCATCAAGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTrGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAAG GCGCAGACCITATGGTGCAACGCCAGGACATCTGTG AGCGGCTACTATGAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGATCGGACCTCATATT GATGTATAAGAAATTGACGTAA SEQ ID 2.10E+12 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:97 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGT GCGTITCACCTCGGTGGATATTACCAGGGCAAGCTGATC AGCATCGCTTCCTTTCATCAAGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CGCTrGAAGGATACCGTGAGCAAAAAGCGGGAAGCAC GCTCATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAAG GCGCAGACCTTATGGTGCAACGCCAGGACATCTGTG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGATCGGACCTCATAT T GATGTATAAGAAATTGACGTAA SEQ ID 23H11 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:98 TGAGATCAGGCACCGCATrCTCCGGCCGAATCAGCCGC TGGAGGCATGTATGTATGAAACCGATTTGCTCGGGGGT ACGTTTCACCTCGGTGGATATTACCAGGGCAAGCTGATC AGCATCGCTTCTCATAAAGCCGAACATrCAGAGCT GAACGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CGCTrGAAGGGTACCGCGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTCTCCGAAAAAAAGG CGCGGACCTTTATGGTGCAATGCCAGGACATCTGCGA GCGGCTACTATGAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACATACCACCGATCGGACCTCATAITTG ATGTATAAGAAATTGGCATAA SEQ ID 24C1 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:99 TGAGATCAGGCACCGCATICTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGC ACGTITCACCTCGGCGATATTATCGGGACAGGCTGATC AGCATCGCTTCCTTCATCAAGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CGCTTGAAGGGTACCGCGAGCAAAAAGCGGGAAGCAC GCTCATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAGG GGGCAGACCTTATGGTGCAACGCCAGGACATCTGTG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGATCGGACCTCATAT -137- WO 02/36782 PCTUSOI/46227 _____ GA lt-GTATA AGMA A ACGACTA A SEQ ID 2406 ATG'ATTGAAGTCAAACCTATAAACGCGG~AAGATACGTA NO: 100 TGAGATCAGCACCGCACTCCGCGATCACG TrGAAGCATGTATGTATGAAACCGAGTCGCT ACGTnTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AQOATCGO [TOOTCATCAAGCCGiAACAT1CAGAGT GAAGGCCAAAAACAGTATCAGCTAACATGGA CGCTTGAAGGGTAccGGAGcAAAAAGCGGAGTAC GCTATCCGCCATGCGAAGAGTTCTTCAAAAAG GCCTCGGACCTTATGGTGCAACGCCAGGATAGG AGCGGCTACTATAAAAAGCTCGGCTTCAGGCAG CGGGGTCTACGACATACCGCCGATCGGACCTCATA = GATCTTATAAGAAATrGGCATAA SEQ ID 2.40E+i08 ATGAITGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 101 TGAGATCAGGCACCGCA1T=CCGCGATCAQOCCQ TGGAGGCATGCAAGTATGAAACCGATCGG ACGTTTCATCTCGGTGGATTACCGGGGCTGAC AGCATCG0rCTCCTTcATAATGCCGAACATCAGAC~ GAAGGCCAAAAACAGTATCAGCTGAGAGGATG.OA CGCTrGAAGGATACCGcGAGCAAAAAGCGGAGTACG CTTATCCGCCATGCCGAAGACG OCAAGG CGGCAGACC=TATGGTGCAATGCCAGGACATGGA GCGGCTACTATGAAAAGCTCGGCCAGGCAGG GAAGTCTACGACATACCGCCGATCG3GACCTCATAmTG ATGTATAAG.AAATI7C30ATAA SEQ ID 2_803 ATGATGAAGTCAAACCAATAAAOCTGGAGATACGTA NO: 102 TGAGATCAGCACCGTATTTCCGCGAQOGCGC TTGAAGCATGTATGTATAACCGATGCTCGGGT ACGTICACCTCGGCGGATATTACCGACACTGAC AGCATCGCTCTTCATCAGCCGAACATTCAGAGCTI' GAAGGCCAAAAACAGTATAGCTGAGAGGATGGCGA CGCTrGAAGGTTACCGGAGCAAAAGGGGATACG CTTATCCGCCATGGGAGAGCTTCAAGGG GGCAGACC=mATGGTGCAACGCCAGACATCTGCGA GCGCGCTACTATGAAAAGCTCGGCCAGCGACAGG GAAGTCTACACATACCGCCGATCGACCTCATA = G ATGTATAAGTAAMTGCACGTAA SEQ ID 2H3 ATATAGrAACAAAGGAGTCT NO: 103 TGAGATCAG-GCACCGCATTCTCCC'CGATCAQOCCQ TGGAAGCATGAAGTATGAAACCGAT CGGGT ACGTCACCTCGGTGGATATTACCACAGCGAWI AGCACCGO TTTCATCAAGCCGGACATTCAGAGOUT GAAGG0ccAAAAcAGTATCAGCTGAGAATGGA cAcTrGAAGGGTAccGGAGcGAAAAGGAATAc GcTcATccGccATG CGAAGAGGAAG GGGCAGACCTTATGGTGCAACGCAGATACTGCG AGCGGGTATATAAAAAGCTCGGO~AGGCAG CGGGGTCTACGATATACCGCGATCGACCCATA = GATGTATAAGIA A ATTGiACGTAA SE ID 130GS IATGATA ATCAAAcCAATAAACGCGGAAGATACGTA - 138 - WO 02136782 PCTIUSOI/46227 NO: 104 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TrGAAGCATGTATGTTTGAAACCGAT'TTGCTCGGGGGTG CGTTrcAccTcGGTGGATAfTAccAGGGcAAGCTGATcA GCATCGTC=fCATCAAGCCGAACArCAGA~GCG AAGOCAAAAACAGTATCAGCTGAGAGGGATGGCGAC GCTTGAAGGGTACCGCGAGCAAAAAGCGGGCAGTACGC rrTATCCGCCATGcCCGAAGAGC ITC I CGGAAAAAAC~ GCAGACC~iTiTiATGGTGCAACGCCAGGACATCTGTGAG CGGGTACTATAAAAAGCTCGGCTCAGCGAACAG~GC AAGTCTACGACATACCGCCGATCGGACCTCATA=F1GA TGTATAAGAAATTGACGTAA SEQ ID 3B_10C4 ATGATrGAAGTCAGACCAATAAACGCGGAAGATACGTA NO: 105 TGAGATCAGGCACCGTAT1CTCCGGCCGAATCAGCCC ITGAAOGATGTATGTATc3AAACCGAYIGCTCGGGGGC AcGITrcAccT.cGGTGGATATTACCGGGGGAAGCTGATC AGCATCGCCTCC=rTATCAACCGACATCAGACL7 GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACrGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CTTACGCTCGAACC'TGAAAG GGCAGACCTI=ATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAGCTCGGCTCAGCGAACAGGC GAAGCCTACGACATACCOCCGATCGGACCTCATXITIG _____ATGTATAAGAAAITGACGTAA SEQ ID 3B_10G7 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 106 TGAGATCAGCACCGACTCCGCCGATCACG TrGAAGCATGTATGTATGAAACCGAT1TrCTCGGGGGT ACGrICACCTCGGTGGATA'fTACCGGGGCAAGCTGATC AGCATCGCCTCCTTCATCAAGCCGAACTCAGA= GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGTGCGA CGCTI'GAAGGGTACCOCGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGC2TCTrCGGAAAAAAGG CGCC-GACC=flATGGTGCAACGCCAGGACATCTCGA GCGGGTACTATAAAAAGTCGGCTCAGCGAACAAG GGGGTCTACGACATACCGCCGATCGGACCCCATAm=TG ________ATGTATAAGAAATTGACGTAA SEQ ID 3B_12B 1 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 107 TGAGATCAGGCACCGCATCTCCGGCCGAATCA(CG TGGAAGCATGTATGTATGAACCGATGCTGG ACGFI-CACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCCGTCCTCCTATCAAGCCGAACArCAGA= GAAGGCCAAAAACAGTATCAGCrGAGAGGGATGGCGA CACTTGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATGCCGAAGAGCTCTCGAAAAAG GGGCAGACCYITATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGC'1TAGCGAACAGG CGAAGTCTACGACATACCGCCGATCGGACCTCATAT1 GATGTATAAGAAATI'GACGTAA SEQ ID 3B_12Dlb ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 108 TGAGATCAGGCACCGTA'TCTCCGGCCGAATCAGCCC TOAAGCATGTATGTACAAACCGATITCTCGGGT -1[39 - WO 02136782 PCTIUSOI/46227 GCGTITACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTITATCCAGCCGAACATTCAGAC1T GAAGGCCAAAAACAGTATCAGOTGAGAGiGGATGGOGA CACTrGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGC'ITCTrCGGAAAAAAGG CGCGGACCITIATGGTGCAACGCCAGGATATCTGCGA GOGGGTACTATGAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACATACCCGCCGATCGGACCCCATATJIT ______ATGTATAAGAAATI'GACGTAA SEQ ID 3B_2E5 ATGATTGAAGTCAAACCAATAAACGCGGAAGiATACGTA NO: 109 TGAGATCAGGCACCGCA2TCTCCGGCCGAATCAGCCC TGGAAGCATGTATGTATGAAACCGATITQCTCGGGGGC ACGTTTCACCTCGGTGGATA'TACCGGGGCAAGCTGATC AGCATCGCCTCCYTATCAAGCCGACATCAGAGT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTrGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG C1TATCCGCCATGCCGAAGAGC'TCTCGAAAAAGG CGCGGACCTEiTAT'~iGGTGCAACGCcAGGACATCTGCGA GCGGCTACTATGAAAAGCTCGGCTTCAGCAAACAGGGC GAAGTCTACGACATACCGCCGATCGACCTCATA=mG ______ATGTATAAGAAATTGACGTAA SEQ ID) 3C_10113 ATGAITGAAGTCAAACCAATAAACGCGTGAAGATACGTA NO: 110 TGAGATCAGGCACCGTATTCTCCGGCCGAATrCAGCCC rGAAGCATGTATGTATGAAACCGAITTTGCTCGGGGGC ACGTITCACCTCGGTGC-ATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTITATCAAGCCGAACATTCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTPGAAGGATACCGTGAGCAAAAAGCGGGAAGTACG CTTATCCGCCATGCCGAAGACTCTCGGAAAAAGG GGCAGACCT1TATGGTGCAACGCCAGGATATCTCGA GCGGCTACTATAAAAAGCTCGGCTTCAGCGAACAAGGC GGGGTCTACGACATACCGCGGTCGGACCTCATA1'G ______ _____ATGTATAAGAAATI'GACGTAA SEQ ID 3C_121?I110 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 111 TGAGATCAGGCACCGCAITCTCCGGCCGAATCAGCCGC TrGAAGCATGTATGTATc3AAACCGATITrGCT'CGGGQGC ACG = ACCTCGGTGGATATTACCGGGCAACTGAC AGCATCGCCTCC2FITCATCAAGCCGAACATCAGAGC'fT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CAC~rGAAGGGTACCGTGGGCAAAAAGCGGGCAGTACG CTrATCCGCCATGCCGAAGAGcCTT-CGGAAAAAAGG CGCGGACCm=TATGGTGCAACGCCAGGACATCTGCGA GcGGGTAcTATGAAAAGcTCGcITrCAGCGAACAGGGC GAAGTCTACGACATACCGCCGATCGGACCTCATATITG ________ATGTATAAGAAAITGACGTAA SEQ ID 3C_9118 ATGATTCIAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 112 TGAGATCAGCACCGTArCTCCC-GCCGATCAGCCG TGAAGCATGTATGTATGAAACCGATEJrGCTCGGGGGC ACGTI)CACCTCGGCGGATA'ITATCAGGACAGGCTGATC AGCTCGCTC2FTCACAGCCGAACN[TCAGAGfT - 140 - WO 02/36782 PCTLTSOI/46227 GAAGCCAAAAACAGTATCAGCTGAGAGGGATQGCGA CACTIrGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCITATCCGCTATGCCGAAGAGCTTCTTCGGAAAAAAG GCGCGCTACCITIATGGTGCAACGCCAGGATATGTGCO AGCGGCTACTATGAAAACTCGGCTrCAGCGAACAQGGG CGAACTTCTACGACATACCGCCGATCGGACCTCATATI= GATGTATAAGAAAITGACGTAA SEQ ID 4A-il 1 ATGATrc+AAGTCAAACCTATAAACGCGGAAGATACGTA NO: 113 TGACTATCAGCACCCATACTC7CCGAACA~GACG TTAAGCATGTATGTATGAACCGATTGCTCGGGT AcGTrcAccTCGGTGGATATrAccGGGGcAA~cTGATC AGCATCGCCTCCTTCATCAAGCCGAACATCCAGACI GAAGGTCCAAAAACAGTATCAGCTGAGAGGGATGGOGA CGCTrGAAGGGTACCGCGAGCAAAAAGCGGOCAGTACG CTrATCCGCCATGCCGAAGAGCTrTC OGAAAGG GGCAGACCTTATGGTGCAACGCCAGGACATOTGGA GCGGCTCAGAGTGCTAGAA~ GAAGTCTACGACATACCGCGATCGGACCTCATA=IG ATGTATAAGAAATGAcGTAA SEQIfl 4A_1C2 ATGAFI7GAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 114 TGAGATCAGCACCGCATCCGCCGTCACG TGGAAGCATGCAAGTATGAAACCGAOTCGCGG ACGTrCACcTCGGCGGATATTATCGAGTGATC AGcATGccTccTTcATCAAGCCGAACAICAGAC~ GAAGGCCAAAAACAGTATCAGCTAGAGGATGGA CGCTTGAAGAGTACCGGAGCAAAAACGGGAGTACG CTTATCCCCATGCCGAAGAGCITCCGGAA CGCAGACCT=ATGGTGCACGCCAGACACTGGA GCGGGTACTATAAAAGCTCGGTCAGGAACACrG GAAGTCTACGACATACCGCCGATCGGACCTCATA = G ATGTATAAGA A ATG(ACGTAA SEQ ID 4B-13El A TGATTAAGTCAAACCTATAAACGCGGAAGATACGTA NO: 115 TGAGATCAGCACCGCATCCGGCCGAATCACG TGGAAGCATGCrAAGTATGAAAccGAThrGcTcGGO AcGMTCACCTCGGTGGATAACCAGTGATC AGcATcGcTrcc71TATCAAGCCGAACATCCAGAcT GAAGGOCAAAACAGTATCAGCTGAGAGGGATCGA CACTTGAAGAGTACCGCGAGCAAAACGGGAGTACG cYFATCCGCCATGCCGAAAG~=GGAAAAAAOGG CGCGGACCTTGTGGTGCAACGCCAGGATATTGGA GCGGCTACTATGAAAAGCTCGGCTTCAGCGAACAGG GAAGTCTACGACATACCCCGATCGACCTCATATM~ _______ATGTATAArAAATTrGACGTAA SEQEII 4B_13G10 TTACGTCAATITC'ITATACATCAAAATAT)GAGGTCCGAT NO: 116 cGOcGGTATGTCGTAGACTCGCCCTTCGTGGCC GAGOTT=ATAGTACCCGCTCGCAGATGTCCTG~ GcAccATAAAAGGTCCGCGGCYICCGAAGAGC TTCGGCATGTGCGGATGAGCGTGCTTCCCGCTT=GCTIC GCGGTACCCTTCAAGCGTCGCATCCCTCTACTGATA ______ TGTE1GGCCTTCAACCGAGTGCGT - 141 - WO 02/36782 PCTIUS01/46227 _______-AAAGGAGvGCGATGCTGATCAGCTJ7GCCCCGGTAATATC CACCGAGGTGAAACGTGCCCCCGAGCAAATCAG~CA TACT rCATGCICCAGcGGCTGAT7CGGCCGGAGAATG cGGTGccTATCTcATAcGTATCTCCGGrA~TrG _________ rTrTCAATCAT GAATCA SEQ ID 4B_16E1 ATGATTGAACTTCAA.ACCAATAAACGOGiAATCT NO: 117 TGAGATCAGGCACCCATCTCCGCGATCACG AcGTrcAccTCGGCGGATATACCGGGGCAAGCTGAT cAGGATCGccTCCTrcATcAAGccGAACArCAGAC TGAAGGGjCAAAAACAGTATCAGCTGAGAGGGATGCG ACACTGAGGGTACCGCGACAAAAAGGAGTAC GCITIATCCGCCATGCCGAAGACTTCGGAAAA GGGCAGACC1I I IATGGTGAACGCCAGGACATCTG AGGCTCAAAACTGCrACACAC CGGGGTCTACGATATACCGCCGATCGGACCTCATA =~ ________GATGTATAAGAAATI'GACGTAA SEQ ED 4B_17A1 ATGATTGAAGTCAAACCAATAAACGCG(3AAGATACGTA NO: 118 TGAGATCAGCACCGCATCTCCrGCCGAATCACG TGGAAGCATGCAAGTATGAAACCGCGGiG AcGTrcAccTCGGGGATA1TGG~AAcGAT CAGCATCOCCTrCATAGCGAGATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGATG ACGI(_GAAGAGTACCGCGAGCAAAGGGAGTAC GCTATCCCCATGCCGAAGAGCTTCAAAG GGGCAGACCTITATGGTGGAAcGcCAGGACATCTG AGCGGTACTATGAAAAGCTCGGCAGCAG CGAAGTCTACGACATACCGCCGATCGACCTCATA = ____GATGTATAAGAAA'TGACATAA CT SEQ ID 4B_1 SF11 ATG3AnGAAGTcAATCcAATAAAc~cGGAGATACT NO: 119 TGAGATCAGCACGACTCCGCCGATCAC TrGAAGCATGTATGTATGAAACCGATGCTCGG ACGTTCACCTCGCGGATACGGAAGC~TGAT CAGCATCGTCGTCATAATGCGAACArCAGAC TGATGGCCAAAACAGTATCAGCTGAGAACTGGA CACGAAGGGTACCGCGAGAAAAGGAGAC GCTCATCCCCATGCGAAGAGCTCcGAAAG GCGCAGACCT=lATGGTGCAACGCCAGGACATTGTG AGCGGTCTATATAAGCTCGCAGCAG CGAAGTCTACGACATACCGCCGATCGGACCATA=l _____GATGTATAArA A A'TACGTAA SEQ ID 4B_19C8 ATGAnrGAAGTCAAACCAATAAAcXcGAGATACGTA NO: 120 TGAGATCAGGCACCGCATCccGCGCCGAC TGGAAGCATGCAAGTATGAAACCG~TTGCTCXG ACGTTCACCTCGGCGGATA 1ACCGACGAT CAGCATCGC~Tc=nCATCAAGCCGAACATCCAGACT TrGAAGGCCAAAACAGTATCAGCTGAGAGATGGCG ACGCTGAAGGGTACCGCGAGAAAAGAAGA CGCTCATCCGCCATGCGAAGACCTcTCAAAG ______GGGGAGACCTATGGcTGCkACGCCA(ACAT - 142 - WO 02136782 PCTIUSOI/46227 - GAGCGGGTAcTATAAAAAGCTCGCCAGCG:AACAAG GCGGGTCTACGATATACCGCCGATCGGACCTCATA'1Tr ________TGATGTATAAc3AAATTGGCATAA SEQ ID 4B_1G4 ATGATrGAAGTCAAAccAATAAAcGCGGAAGATAc&TA NO: 121 TGAGATcAGGcAccOcATcTccGCCGAATCACC TGGAAGCATGGAAGTATGAAACCGATITGCTCGGGGGT GCGTITCACCTCGGGATAITACCGGGGCAAGCTGAT CAGCATCGCCTCCT1CATCAATCCGAACATCCAGAGCT TGAAGOCAAAAAcAGTAT)CAOcTGAGAGGGATGGG ACGCTTGAAGGGTACCGOGAGCTAAAAGCGGGAAGTAC GGTrATCCGCCATGCCGAAGAGCTTCGGAAAAAAG GCGcGGAccTIJATOGTGCAACGCCAGGATATCTOCG AGCGGGTACTATAAAAAGCTC-GGTCAGCGAACAGG CGAAGTCTACGACATACCGCCGATCGGACCTCATATI GATGTATAAGAAAITGACGTAA SEQ ID 4B_21C6 ATGATrGAAGTcAAAccAATAAACGCGGAAGATACGTA NO: 122 TGAGATCAGGCACCGCATCTCCGGCCGAATCAGCCC TTGAAGCATGTATGTATGAAACCGATITGCTCGGGGGC ACG'TrrCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGcATcGc~ccTflTcATcAAGccGAAcAYcAGAcT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACITAAGAGTACCGCGAGCAAAAAGCGGGAAGCAC GCTCATCCGccATGCCGAAGAGC1CTTCGGAAAAAAG GCGCGGACCTEIATGGTGCAACGCCAGGATATCTGCG AGCGGCTACTATAAAAGCTCGGC~rCAGCGAACAAG CGGGGTCTACGATATACCGCCGATCGGACCTCATA =r _____GATGTATAAGAATGACGTAA SEQ ID 4B_2R7 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 123 TGAGATCAGCACCGCACTCCCCGTCACG TTGAAGCATGTATGTATGAAACCGATTGCTCGGG ACGTrACCTCGGTGGATATTACCGGCAATA AGCATCGCCTCCTCATCAAGCCGAACATCAGACT GAAGGCCAAAAACAGTACCAGCTGAGAGGGATGCGA CGCTrGAAGGGTACCGCGAGCAAAAAGCGCGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTr T~CGG AAGG GGCAGACCF17ATGGTGCAACGCCAGGACATCGGA GCGGCTACTATAAAAAGCTCGCTrCAGCGAACAAGC GGGGTCTACGGCATACCGCCGATCGGACCTCATA = G ATGTATAAGAAATTGACATAA SEQ ID 4B_2118 ATGATI'GAAGCCAAACCAATAAACGCGGAAGATACGTA NO: 124 TGAGATCAG~cAccGcAT~cTCCGGCCGATCACG TGGAAGCATGCAAGTATGAAACTGATGCTCGGGG ACG=CACCTCGGTGGATA-fTACCGGGGCAGCTGAC AGcATccTCCTTTCATCAAGCCGAACATCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAXXATGGA CGC-TGAAGGGTACCGCGAGCAAAAAGCGGGAACAC GCTCATCCGCCATGCCGAAGAGCTCTCGGAAAAG GCGCGGACCITIATGGTGCAACGCCAGGACATCTG AGCGGGTACTATAAAAAGCTCGGCflCAGCGAACAG I ICGAAGTCTACGACATACCGCCGATCGGACCTCATA=' - 143 - WO 02/36782 PCTTJSOI/46227 _______ ATGTATAAGiAAAITGACGTAA SPIQ ID 413_6D8 ATG'ATrGAAGTCAAACCAATAAACGGGAGATACGTA NO: 125 TGAGATcAGGcAccGcATACTCCCCGATCACG ACGTCACCTCGGTGGATA fl~cccGWAC~TGATC AGCATCCCflCATCAAGCCGACATCAGACTr GAAGGccAAAAAcAGTATcAGcGAGAAGGCGA CQCGAAGGGTACCGCGAGCAAAACGGTAGTACG CTTATCCGCCATGCCGAAGAGCGAAAAG GGCAGACCTFATGGTGccAACGCCAGGACATCTCGA GCGGGTACTATAAAAGCTCGGCTCAGCGACATG GAAGTCTACGACATACCGCCGATCGGACCTCATAmTG ____I ATGTATAArU A ATYACCGTAA SEQ ID 4378 A TGATGAAGTcAAACCAATAAACXGAGATACGTA NO: r126 TGAGATCAGCACCGCACTCCCCTCACG TGGAAGATGCATGTATGAAACGATCGG AcGTcAccTCGGTGGAiTAcGTAAc~TGATC GAAGGCCAAAAACAGTATCAGCTGAGAGACA CACTGAAGGGTACCGCGAGCAGCGGAGTACG CTTATCCCCATGcGAAGAGCTrCGAAAGG GCGCAGACC=ATGGTGcAAcGCCAGACATTGTA GCGGGTAcTATAAAAAGCTCGGCTCAGCGCAGG GAAGTCTACGACATACCGCCGATCGGACCTCATAWJG ______ATGTATAAGAAAITGACGTAA SEQ ID 4C_8C9 ATGAfTGAAGTCAAACCAATAAACCGGAGATACGTA NO: 127 TGAGATCAGGCACCGCATCGGCCGAATCAGCCC TrGAAGCATGTATGTATGAAAcArGcAGGGAGT GCGTCACCTCGGTGGATATACCGCAGTGATC GAAGGCCAAAAACAGTATCAGCTGAGAGAGGGA CACTGAAGGATACCGTGAGCAAACGGAGTACG cTTATccGccATGccGAAGACGGAAAG GGCAGACCflT,1,ATGGTGCAACGCAGACATCTG-GA GAGTCTAcGAcATACCGCCGATCGACATA='G _______ATGTATAAGAAATTrA ACATAA SEQ ID 4H1 ATGATGAGGTGAACCGATAACCAGAGAGAC~rA NO: 128 TG3AACTAAGGCATAGGATACTCAGACCACACCACCGA TAGAGGTrGTATGTATGAAACCGA TIACTGTGT CGTCACTTAGOCGACTAGCAAGCTGA CCATAG~t-CATTCCACCAGGCCGACATCCAGACTCC AGGCCAGAAACAATACCAACTCGAGTATGCACC TrGGAAGGTrATCGTGACCAGAAAGCGGGATCGACC AATAACACGCTGAACAGATCCTCGGAAGGG CGGACATGCTATGGTGCAATGCGGACACCG GGCTACTACAAAAGTTAGGCTTCAGCGACAGGAGA GGTATTTGAAACGCCGCCAGTAGGACCTCACATCGTA ______TGTATAAACGCCTCACATAA SE ID 16 14D10 IATGjA-TGTU.CAAACCAATAACCGGAGATACGTA -144- WO 02/36782 PCTIUSO1/46227 NO: 129 TGAGATCAGCACCGCATCTCCGCCGAATCAGCG TGGAAGCATGTATGTATGAACCGA GCTGG AcGTrrcAccTcGGTGGATNFI'AccGAG0cAA~cTGATc AGCATCGCCTCCTCCATCAACCGAACATCAACT GAAGGCCATAAACAGTATCAGCTGAGAGGGATGGCGAC ACITAAGAGTACCGCGAGCAAAAAGCGGGAAGCACG CTCATCCGCCAT0CCGAAGACTCTCGGAAAAGGG GGCAGACC1TIATGGTGCAACGCCAGGACATCTGCGA GCGGCTACTATAAAAAGCTCGGTTCAGGAACAAG GGGGTCTACGACATACCGCCGGTCGGACCTCATA = G ATGTATAAGAAATTGiACGTAA SEQ ID 6_15G7 ATGATFGAAGTCAAAccAATAAACGCGGAAGATACGTA NO: 130 TGAGATCAGGCACCGCATCTCCGCCGATCACG rAAGCATGTAAGTATGAAACCGA=TCGGGGC ACG = TACCTCGGCGGATATTACCGGGGCAAGCTGAT CAGCATCGC=CCTTCATCAAGCCGAACAYI'CAGAGCT TGAAGGCCAAAAACAGTATCAGCTAGAGGATGG ACACTGAAGGGTACCGCACAAAAAGCGGACA CGCTCATCCGCCATGCCGAAGACTCTTCGGAAAAA GGCGcGGACC1TTTi'ATGGTGCAACGCCAGGACATCTC GAGCGGGTACTATAAAAGCTCGGC1CAGCGACAG GCGAAGTCTACGACATACCCCGGTCGGACCTCATAT TGATGTATAAGAAATrGACGTAA SEQ ED 6_16A5 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 131 TGAGATCAGCACCGCATCTCCGCCGATCACG TGGAAGccATGcAAGTATGAAAccGAiGAGcTCGGT AcGTITCACCTCGGTGGATATTACCGGGCAACGATC AGCATGCCTCCTTCACCAAGCAACATrCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGATGGA CACTTGAAGGATACCGTGAGCAAAAAGCGGAGTACG CTATCCGCCATGCCGAAGAGCTTCAAAG GGCAGACCTTIATGGTGCAACGCGAGGACATCTGCGA GCGGGTACTATAAAAAGCrCGGCTCACGAACAGG GGGGTCTACGACATACCGCCGGTCGGACCTCATAmTG ATGTTATA AGAAATTGACGTAA SEQ ID 6_16F5 ATGA'TGAAGTCAAACCAATAAACGCG(GAAGATACGTA NO: 132 TGAGATCAGCACCGCATCTCCGCCGAACACG TTGAGCATGTATGTAT03AAACCGAflGCTCGGGG Ac=CTGTGTTTCGGCACGT AGCATCGCCTTCATCAAGCCGTACAAACT GAAGGCCAAAAACAGTATCAGCTGAGAGGATGGA CACTTGAAGGATACCGTGAGCAAAAAGGAGTACG CTrATCCGCCATGCCGAAAGCTCTCGGAAAG GGCAGACC1TTTATGGTGCAACGCCAGGACATCTGCGA GCGGCTACTATAAAAAGCTCGGCTAGCGAACGG GGGGTCTACGACATACCGCCGGTCGGACCTCATA= ATGTATAAOyAAATTGACGTAA SEQ ID 6_17C5 ATGATTC3AAGTCAAACCAATAAACGCGGAAGATACGTA NO: 133 TGAGATCAGGCACCGCATCTCCGCCGAATCACG TTmA ArGATGCAAGTATGAAGCCGATTGCTCGGG - 145 - WO 02/36782 PCTJUS01/46227 ACGTTCACCTCGTGATATTCCGGAGCGATC AGCATCGc~rccTcATcAAGCCGAGCATCCAGACT GAAGGCCAAAACAGTATCAGCTGAGAGGATGGA CACTrGAACGAAACCGTGAGCAAAAAGCGGGCAGTAC GCTATCCGCCATGCGAAGAGCGGAAAAAG GGGcAGAccTTii~ATGGTGCAACGCCAGGACATCTGG AGCGGGTACTATAAAAAGCT CTCAGACAO CGAAGTCTACGACGTACCGCCGATCACCCATAT= GATGTA.TAACi-AAATACOTAA SEQ ID 6_18C7 ATGATI7GAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 134 TGAGATCAGCACCGCATCTCCGCCGATACG ACGTTCACCTCGGTGGATAATGcAGTGATC AGCATCCTTCCTTATCAAGCCGCACCAGAC'I GAAGGCCAAAAACAGTATCAGCTGAGAATGGA CGCTrGAAGGATACCGTGAGCAAAAAGCGGAGTACG CTATCCGCCATGcGAAGAGGCTTCAAAC-G GCAGACCTT1''ATGGTGACGCCAGGATATCGGA GCCOGGTATATAAAAAGCTCGG~CAGCGCAGG GAAGTIACGACATACCGCCGGTCGGACCTCATATO ATGTATAArGA A AfT7Ar-GTAA SEQ. ID 6_18D7 A TGAnTGAAGTCAAACCAATAAACCGGAAATACGTA NO: 135 TGAGATCAGCMCCGCACTCCCGATCACO TrGAGCATGTATGTATGAACCGATTGCTCGGGT ACGTCACCTCGGTGGATACGGGAAGCGATC AGACC~~ACACG.CTCGGT GAAGGCCAAAACAGTATCAGCTGAGAGGAGGGA CACTGAAGGGTACA-rAGCGGGAAGCAC GCGCGGACCT=ATGGGCAACGCCAGACATCTG AGCGGTACTATAAAAGCGTCAGCGAACA CGGGGTCTACGACATACCGCTCGACCTCATAT= _____GATGTATAAGAAATTrACGrTAA SEQ ID 6_19A10 IATGAYGAAGCCAAACCAATACICGGAGATACGTA NO: 136 TGAGATCAGGACCGCACCCCGATCACG TTGAAGCATGTATGTATGAACCGATCCCYTG ACG1TACCTCGGTGGATAGGAAGC~TGATC GAACGGCCAAAACAGTATCAcTGAGAGGAGC-rGA CACTGAAGCTGTACCGcGAGAACCCTAGTAC GCTTATCCGCCATGCCGAAGAGCGAAAG GGGCAGACcY=ATGGTGCAACGCCAACATTG AGCGGGTACTATAAAAAGCTCAGCAG CGAAGTCTACGACATACCcCGACCGACCCCATA=m ______GATGTATAAGAAAYPGACGTAA SEQ ID 6_19B6 ATGATGAAGTcAAAcCAATACXGAGATACGTA NO: 137 TGAGATCAGCACCGCATCTCCGCGTCACO ,IGAACATGTATGTATGAAACCGATCTCAGGT GCGTTACCTCGGTGGATAGG)AGCTGATC -146- WO 02136782 PCTIUSOI/46227 GAAGGccAAAAAcAGTATcAGcrGAGAGGGTATGcGA CACTCGAAGGATACCGTGAGCAAAAAGCGGCAGTACG CTTATCCGCCATGCCAAGAGCTGTCGGAAAGG CGCAGACCTITATGGTGCAACGCCAGGACATCTGCGA GCGGCTACTATAAAAAGCTCGCTCAGCGAACAGG GAAGTCTACGACATACCGCCGGTCGGACCTCATA1TG _____ATGTATAAGAAATTGACGTAA SEQ ED) 619C3 ATGATAAGTCAAACCAATAAACGCGGAATCT NO: 138 TGAGATCAGGCACCGCATCTCCGCCGAATCACG TGGAAGCATGCAAGTATGAAACCGATGCTCGCFGT ACGTITCACCT'CGGCGGATAITACCGGGGCAAGCTG AT CAGCATCGCCTCCTITATCAAGCCGAACATCCAGACT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGG ACAC1GAAGGATACCGTGAGCAAAAAGCGGGCAGTAC GCTFATCCGCCATGCGAAGAGCTrCIGGAAAkG GCOCGGACC11ITATGGTGCAACGCCAGGACATCTG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAG CGAAGTCTACGACATACCGCCGATCGGACTCATAm' GATGTATAAGAAAITGACGTAA SEQ ID 6_19GB ATGAfTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 139 TGAGATCAGCACCGCATCTCCGGCCGAATCACG TGGAAGCATGCAAGTATGAAACCGAGTCGCGGT ACGTTACACCT'CGGTGGATATACCGGGGCAACGAT CAGCATCGCTCCITCATCAAGCCGAACACCAGAC TGAAGGCCAAAAACAGTATCAGCTGAGAGATG ACACTIGAAGGATACCGTGAGCAAAAAGCGOAGTAC GCTTATCCGCCAAGCCGAAGAGCTfCAA GGGCAGACGTTATGGTGCAACGCCAGACATCG AGCGGGTACTATAAAAAGCTCGGCTTCAGACAG CGGGGTCTACGACATACCGCCGGTCGGACCTCATAT= _____GATGTATAAGGAAITGACGTAA SEQ IID 6_20A7 ATGATTGAAGTCAAACCAATAAACCrGGAAGATACGTA NO: 140 TGAGATCAGGCACCGCACCGCCGAACG TrGAAGCATGTATGTATGAAACCGAGCTCAGKO ACGTrCACCTCGGCGGATAlACCGGACGAT cAGcATcGCTccTcATcAAGccGAAcArCAGATT TGAAGGCCAAAAACAGTATCAGCTGAGAGATG ACACTGAAGAGTACCGCGAGAGGGAC CGCTCATCCCCATGCCGAAGAGCTCGAAAG GGGGCAGACCTTATGGTCAACGCCAGACATCG GAGCGGGTACTATAAAAAGTCGCGCAACAG GCGAAGTCTACGACATACCGCCGGTCGGACCTCATA'f _____TGiATGTATAAG AA ANTMAC(GTAA SEQ ]I) 6_20A9 ATGAfTGAAGTcAAACCAATAAACCGGAGATACGTA NO: 141 TGAGATCAGGCACCGCATCCCGGCCGATCACG TGGAAGCATGCAAGTATGAAACCGAGCTCGG AcGTTrcAccTcGGTGGATATAccGGGGCAA~GATc AGCATCGCTCCTTCATCAAGCCGACAAGACT GAGGCCAAAAACAGTATCAGCTGAGAGGATGGA CACTTGAACGGGTACCGCGAGCAAAAAGCggAGTACG - 147 - WO 02/36782 PCTIUSO1/46227 CTTATCCGCCATGCCGAAGAGCTTCTACGGAAAAAAGG CGCGGACCTTATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAAGGC GGGGTCTACGACATACCGCCGGTCGGACCTCATATTG ATGTATAAGAAATTGACGTAA SEQ ID 6201H5 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:142 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATCAAGTATGAAACCGATTTGCTCGGGGC ACGTTTCACCTCGGCGGATATTACCGGGGCAAGCTGAT CAGCATCGCCTCCTTTCATCAAGCCGAACATTCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCG ACACTGAAGGATACCGTGAGCAAAGCGGGA.AGTAC GCTrATCCGCCATGCCGAAGAGCTT CGGAAAAAAG GCGCGGACCnTrATGGTGCAACGCCAGGACATCTGCG AGCGGCTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGATCGGACCTCATAT GATGTATAAGAAATTACGTAA SEQ ID 6_21F4 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:143 TGAGATCAGGCACCGCGTTCCCGGCCGAATCAGCCGC TGGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGT GCGTCACCTCGGTGGATATACCGGGGCAAGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCT GAAGGCCAAAAACAGTATCAGcTGAGAGGGATGGCGA CACTTGAAGGGTACCGCGAGCAAAAAGCGGGCAGTACG CTrATCCGCCATGCCGAAGACTTCTTCGGAAAAAAGG CGCGGACCTTATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACGTACCGCCGGTCGGACCTCATATTG ATGTATAAGAAATTGACGTAA SEQ ID 6_22C9 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:144 TGAGATCAGGCACCGCATTCTCCGGCCGAATCGGCCGC TGAAGCATGTATGTATGAAACCGATGCTCGGGGGC ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGGGC GAAGGCAAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGAGTACCGCGAGCAAAAAGCGGGAAGCAC GCTATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAAG GCGCGGACCTTTTATGGTGCAACGCCAGGACTTCCGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAAGG AGGGGTCTACGACATACCGCCGGTCGGACCTCATAT GATGTATAAGAAATrGACGTAA SEQ ID 6_22D9 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:145 TGAGATCAGGCACCGTATTCTCCCGAATCAGCCGC TGGAAGCATGCATGTATGAAACCGATTGCTCGAGGGC ACGTITCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAGCATTCAGAGC GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAAGG CGCGGACCTTTATGGTGCAACGCCAGGACATCTGA - 148 - WO 02136782 PCT1US01/46227 GccGGGTAkcTTAAAAkcirTcGGcTrcAGcGAAcAG~ GAAGTCTACGACATACCCGCCGGTCGGACTCATA="CG ATnTATAAGAAATTGACGTAA SEQ ID 6_22M1 ATG A'TGAAGTEAAACCAATAAACOCGGAAGATACGTA NO: 146 TGGTAGACC~CTCGCATACG fTGAkAGCATGTATGTATGAAACCGA1TGCTCGGGO ACGTTCACTCGTGATATTACCCAGTGATC GAA.GGccAAAAAcAGTATcAGCTGAGAGGGATGCGA CCTTGATGAGTACCGCGAGCAAAAA GGGAGTACG CTACGCTCGAAC=GAAAG COCAGACC ITT1"ATGGTGCAACGCCAGACATCGGA GCGGGTAcTATAAAAAGCTCGGCTCAGGAACAGGC GAAGTCrACGACATACCGCCGATCGGACCCCATAm=TG ATGTATAAGAAATTGACGTAA SEQ ID 6_23113 ATG3ATrGAGTCAAACCAATAACGCGGAAATACGTA NO: 147 TGAGATCAGGCACCGCATTCTCGCGATCAOOCCO TTGAAGCATGTATGTATGAACTOAYOTTCTCGG ACGTTCACCTCGGTGGATAACCGGGCTGAI AGCTGCTrFCCATCAAGCCGCACCAGACIT GAAGGCCAAAAACAGTATC-AGCTGAGAGGAGTCGA CACTTGAAGGTACCGCGAGCAAAAACGGGAGTACG cTATcCGCCAT~CGAAGAGcTGAACYCGG GGCAGAccTTATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTCAGGAGG GGGGTCTACGACATACCGCCGGTCGGACCrATAmTG ______ATGTATAAriAAATr7GACGTAA SEQ ID 6_23H17 ATGATTGAAGTCAAACCAATAAACOOOGAGATACGTA NO: 148 TGAGATCAGGCACCGCA =CCGGCGATCAOCCO TTGAAGCATGTATGTATAAACCGATTCGGG ACGTrCACCTCGGTGATATTACCGGAACGAi GAAGGCCAAAAACAGTATCAGCTGAGAAGGCGA CGCTTGAAGGATACCCGAGCAAAAAGGGAGTAC GCTATCCGCCATGCAGAA AGATGAAAG 0000 GAccTC TATGGTGCAACGCCAGGACATCTOG AGCGGGTACTATAAAAAGCTC CACGCAG CGGGGTCTAcGAcATACCGCCGGTCGACCTCATA = GATGTATA ArA A ATrGACGTAA SEQ ID 6_2111 ATGATGAAGTCAAACCAATAACGGAGATACGTA NO: 149 TGAGATrCAGCACCGCCWI'CCCGCGAATAGCCGC TGGAAGCATGTATGTATGAAACCGAGCCGTGC ACGTTCACCTCGGTGGATATACCAGTATC AGCATCCCTCCnCATCAAGCGACATCCAGACTT GAAGCAAAACCGTATCAGTGAGAATGGA CAcrGAAGATACCGAAAAGCGGAGTACG CrrATCCGCATGCGAAGAGCT rCGGAAAAAG CGCGGACCTATGCTTGCAACGCCAGACACTGGA GCGGGTACTATAAAAAGCTCGCCAGACAG GAAATCTACG-AATAccGCCGATCACCTCATAT=T - 149 - WO 02136782 PCT/TSO1/46227 ATGTATAAGiAAATTGiACGTAA SEQ ID 6_3D6 ATGX1TrGAAATCAAACCAATAAACGCGGAAGATACGTA NO: 150 TGAGATCAGGCACCCAfCTCCGCCGATCACG rGAAGCATGTATGTATGAAAccGAT'1GcTcGGGGT ACGTTCACCTCGGTGGATATTCACCGAGGCAACTGATC AGCATCGCCTCCTTCATCAAGCGAACATCCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGATGCGA CTCTTrGAAGGATACCGTGIAGCAAAAAGCGGGCAGTACG C=ATCCGCCATGCCGAAGAGCTCTTCGGAAAAAG GGCAGACCTI=ATGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTCAGCAACAGG GAGGTCTACGACATACCGCCGGTCGGACCTCATA~rrTrc SATGTATAAGAAITrGACGTAA SEQ ID 6_3(3 ATGATrGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 151 TGAGATcAG~cAccGcATCCGGccGAAACG TGGAAGCATGTATGTATGAAACCGA GCTCGGG ACGTTrCACCTCGGTGGATATTACCGGGCAAGCTGATC AGCATCCCTCCTCATCAGCCGAACATCAGACT GAAGGCCAAAA-ACAGTATCAGCTGAGTAGGGATGGCGA CACTrGAAGGATACCGTGAGCAAAA-AGCGGGCAGTACG CTrATCCGCCATGCCGAAGAGCTrCCGGAAG CGCGGACCTTAT' KGGTGCAACGCCAGACATTGCA GCGGCTACTATAAAAAGCTCGGCTTCAGCGAACAGG cTAAGTCTAcGAcATAccGccGGTcGGAccTCATA=~G SATGTATAAGAAAITGACGTAA SEQ ID 6_3H2 ATGATGAAGTC-AAACCAATAAACGCGGAAGATACGTA NO: 152 TGAGATCAGCACCGCATCTCCGCCGAATCACG TGGAAGCATGTATGTATGAAACCGATT GCTCGG ACG=TACCTCGGTGGATATGCACTGAT'C AGCATCGCCTCCT=ATcAAGcCGAACACCAGACT GAAGGCCAAAAACAGTATCAGCTAGAGATGGA cAcTrAAGAGTACCGCGAGCAAAAACGGAAGCAC GCCTCCAGCAGGT=CGAAG GGCAGACCTCfTATGGTGAACGCCAGACATTG AGcGGGTAcTATAAAAAGcTCGGCCAGGCAG cGAGT~cTAcGAcATACCGCCGGCGGACCCATAT GATGTATAAG-rA A A TG(-A CATAA SEQ ID.- 6_4A10 ATGATTGAGTCAAACCAATAACCGAGATACGTA NO: 153 TGAGATCAGCACCGCATCTCCGGCCGATCACG rnAAGCATGTATGTATGAAAccGATGTGCGGX ACGITCACCTCGGTGGATATrAC CAA~GATC AGCATCGcCTCcTTCATCAAGCCGAACATCCAGACT GAAGGCCAAAAACAGTATCACTGAGAGGGAGGGA CGCTGAGGATACCGTGAGAGGAGTACG CTATCCGCCAT GCGAAGAGcCGA AG cGcGGACITiATGGTGCAACGCCAGACATCTGCGA GCGGCTACTATAAAAAGCTCGGCTCACGAACAGGC GAAGTGTAcGAcATAccGGcGGTcGGAcTcATATT=G ATGTATAAGAAAfTGACGTAA FS- ID 6 4B1 IATGArGAGTCAAACCAATAAACGAGATACGTA -150- WO 02/36782 PCT/USOI/46227 NO: 154 TGAGATCAGCACCGCTACTCCGCCGATCACG TrGAAGCATGTATGTATGAAACCGATTGCTGC~ GGCATCGCTCCTCATCAAGCCGAACATCCAGA~f GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGCGA CAG TGAAGGGTACCGCCGAGCAAAAAGGGGCXAGTACG CTTATCGCCTGCCAAGACT~C G~GG GGCAGACCnmT'ATOGTGCAACGCCAGGACATCTCGA GCGGCTACTATGAAAGCTCGGCTrCAGGACAGG GAAGTCTACGACATACCGCCGATCGGACCTCATAmTG ATGTATAkAGAAATrGACATAA SEQ ID) 6_51)11 ATG3ATrGAAGTCAAACCAATAAACCG(AATACGTA NO: 155 TGAGATCAGGCACCGCATCTCCGGCGTCACG TrGAAGCATGTATGTATGAAACCGTTGCTCGGG ACGTTACCTCGGTGGATATACCCGGCAGTGATC AGCATCGTCCTCATCAAGCCGAACATCCAGACT GAACGCCAAAAACAGTATCACTAGAGAGGATGGA CACTAAGAGTACCGCGACr-AAAAACGGGAGTACG cTTccACGccATGAAGAcrGcTTAAGG~ CCGGACCTATGTGCAACGCCAGGACATCTGCGA GCGTCAAAACCG~rACACGG GTAAGTCTACGACATACCGCGATCGGACTCATAmTO ______ _____ATGTATAA(OA AATI'CiACGTAA SEQ ID) 6_SF11 ATGATrGAAGTCAAACCATAAACCGAGATACGTA NO: 156 TGAGATCAGGCACCGCATCTCGCCGATCACG TGAGA~iAGAGAACA GTGGG ACGTCACCTCGGTGGATA ACCCAGTAT AGCATCGTCGTCATCAACCGACATCCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGAGGGA CACTGAGAGTACCGCGAGCAA GGAGTAC GCTTATCCGCATGCCGAAGAGIC'TCGAAG GCGCGGACCTnATGGTGCAACGCAACATTG AGCGGGTACTATAAAAAGCTCGGAGGGACAG CGAAGTCCACGACATACCGCCGTCGACCTCATAT _____GATGTATAAnAAAITTGACGTAA SEQ ID) 6_5G9 ATGATGAAGTCAACCAATAACCGAGATACGTA NO: 157 TGAGATCAGCACCGATGTCCCGATCACG TGGAAGCATGTATGTATGAAACCGA'fTGTGG ACGTTCACCTCGCGGATATTACCAGC~TA CAGCATCGCTCCTTCATAAGCGCATCAGAC TGAAGGCCAAAAACAGTATCAGCTGAGAGAG ACGCYrGAAGAGTACCGTGAGCAAGGGAGTAC GCTTATCCGCCATGCCGAAGAGCCTTCAAAG GGGCAGACGT1ATGGTGCAACCCAGATATTG AGCGGGTACTATAAAAGTCGGCTCAGACA CGGTCTACGACATACCGCCGGTCGGACCATA=~ GATGTAT A AGA AATTGACGTAA SEQ ID 6_615 ATGATGAAGTcAAAccAATAAAcG(GGAGATOGGTA NO: 158 TGAGATCAGGCACCGCAT = CGOCCGAATCAGCG TTGGA A GATGCAAGTATGAAACTGAMCTTG -151- WO 02136782 PCT/US01/46227 ACGTCACCTCGGCGGATATACCGGGCAAGCGAT CAGCATCGCCTTCATCAGCCGAACATCAGAGCT TGAAGGCCAAAAACAGTATCAGCGAGAGAGC ACACTGAAGGGTACCGCGAGCAAAAAGGAGTAC OGOGGAccTTGTGTGCAACGCCAGGACATT)GG AGOGGGTAcTATAAAAAGcTCGGrCAGCAOGG CGGGGTCTACGACATACCGCCGGTCGGACCATAT= ______ _____GATGTATAAG.AAATTG-ACGTAA SEQ ID 6_71)1 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 159 TGAGATCAGCACCGCACTCCGCCGATCACG 1TGAAGCATGTATGTATGAAACCGATGCTCAGGT GCc*TITCACCTCGGTGGATATrACCGGGGCAAGCTGATC GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGA CACTTGAAGGATACCGTGAGCAAAAACGGGAGTACG GGCAGACCTTATGGTCAACCCAACACTGGA GCGGGTACTATAAAAAGCTCGC'ITCAGCGAACGG GGGGTCTACGACATACCGCCGGTCGGACCTCATAmTG ATGTATAAr A AA TTCACGTAA SEQ ID 6_8113 ATGATGAATCAACCATAAACGGGAGATACGTA NO: 160 TGAGATcAG~cAccGcACTCCGGCGTCACG AcG = ACcTCGGTGGATATTACCGGCAGCGATC AGCATCGCCTCCTTCATCAAGCCGACATCCAGAC~ GAAGGccAAAAACAGTATCAGCTGAGAGGATGGA CGcTTGAAGGGTACCGCGAGCAAAGGGGAGTAC GCTTATCCGCCATGCCGAAGAGCYJCGGAAAAA GCGCGGACC=mATGGTGCAACGCCAGACATTCG AGCGGGTACTATAAAACGCTTCAGCGAACAOGG CGGGGTCTACGACATACCGCCGGTCGGACrCTATArm G-ATG-TATAAGiAAATTGACGTAA SEQ ID) 6_9G1 1 ATGA1TTGAAGTCAACCAATAGCGAGATACGTA NO: 161: TGAGATCAGCACCGCACTCCCGATCACG TGGAAGCATGCAAGTATGAAACCGACTGGGG ACGGTTCACCTCGGTGGATATTACCGCAGCGAT CAGCATCGCTCCTTCATCAAGCGACATCAGAGCT ITGAAGGCCAAAAACAGTATCAGCTGAGAAG ACGC~rGAAGGTACCGCGAAGCGAAGTA c~TAc~cTcGAG~~c~GAAA GGCGCGGACCTTATGGTGCAACGCCAGACATTG GAGCGGGTACTATAAAAAGCTCACGACAAG GCGAAGTCTACGACATACCGCCGGTCGGACCTCATA~r TGATGTATAAGyAAATrGACGTAA SEQ ID 6F1 ATGATTGAATCAAACCAATACCGAAATACGTA NO: 162 TGAATCAGCACCGCA CTCCGCGACACG TrGAAGCATGUTATGTATGAAACCGATGCTCGOGT ACGTCACCTCGGTOGATAGGAGTGTC TGCATCGGfTCCTATAAAGGCGTCAAGATT -152- WO 02136782 PCTUSOI/46227 - GAAGGCCAAAAAcAGTATCACTGA~AGATGGA CG=CGATGATACCGCGAGCAAAACGAGACG CTCATCCGCCATGCCGAAGAGCTTCGAAAA) ,crcGGAcc1TrATGGTGcAATGcCAGGACATCTGTGA GCGGCTACTATGAAAAGCTCGGGCAGCGCAGO GAAGTCTACGACATACCCCGTCGACCTCATAM I IATGTATAAGAAATrGACGTAA SEQ ID 7_1C4 ATCGAITQAAGTCAAACCAATAAACGCGGAACTATACGTA NO: 163 TGAGATCAGCACCCACTCCGCCGATCACG TGGAAGCATGTATGTATGAAACCGATrGCTCGCTGG ACGTCACCTCGGTcGGATTAccGGGCAAGCTGATC AGcATCGTrccTT~cATcAAGccGAGcAT~ccAGA= GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGA CACTrGAAGAGTACCGCGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGC'TCCGGAAAAAPG CGCc3GACC'TIATGGTGCAACGCCAGGACATCTGCGA GCCGOGTACTATAAAAAGCTCGGC=rAGCGAACAAG GGGGTCTACGATATACCCCGATCGGACCTCATAm=G _____ATGTATAAA A A Tr A CGTAA SEQ IID 7_2A10 ATGATGAAGTCAACCAATAACQC~GAATACGTA NO: 164 TGAGATCAGGACCGCATCCCGCGAATCACG TGGAAGCATGCAAGTATGAAACTGATCGG ACGT1TATCTCGGTGGATAT1?ACCGGGGCAAGCTGATC AGCATCGCTC = CATCAAGCCGAACATCCAGAC~ GAAGGCCAAAAACAGTATCAGCTGAGAGGGATCGA CCcTrGAAGGGTACCGCGAAAAAGAAGCAC GCGCGGACcTIATGGTGCAACGCCAGGACATTG AGCGGTACTATAAAAAGTCGGCCAGACA CGGGTCTACGATATACCCCGATCGACTCATA=l ______GATGTATAAGAATGACGTAA SEQ ID 7_2A1 1 ATGATTGAGTcAACCAATAAACGAGATACGTA NO: 165 TGAGATcAG~cAccGcArcTccG~ccGAATCAcC TrGAAGCATGTATGTATGAACCGATTGCTCGG ACGTTCACCTCGTGGATATTAGCAAGCTGATC AGCATCGT-CCTTCATCAAGCCGAACATTCAGAGC= GAAGGCCAAAAACAGTATCACTGAAGGATCGA CACTGAAGGGTACCGCAGCAAAGGGGAGTAC GCTTATCCGCCATGCCGAAGAGTC1TC1TAAG GGGCAGACCTATGGTGcAACGCCAGACATCTG AGGG~rTAAGTC~GTCGGAAG CGGGGTCTACGACATACCGCCGGTCGGACCTCATATmI ______GATGTATAAGAAATJ7GACGTAA SEQ ID 7_2D7 ATGATrG)AGTCAAACCAATAAACGCCGAAGATACGTA NO:166 TGATAGCATCArAGTATCGGCCGAMAGCGC ACGTCAD'cTCGGTGGATArrACCCGGGCAGTGATC AGCATCGCCTCCTTCATCAGCCGAACATCCAGAC"I GAAGGCCAAAAACAGTATCAGcTGAGAATGGA ______ _____cGcFFGAAGGGTACCGTGACAAAAAGGCTAAGTACG - 153 - WO 02/36782 PCTJIUSO1/46227 - CTCATCCOCATGCCGAGAA=~rGCTCACA-GG GTCTGGTACTATAAAAGCTCGGTCAGGCAGG GAACTTCTACGACATACCCCGTCGGACTCATAmTG ATGTATAAGyA AMACyArGTAA SEQ ID 7_5C7 ATGATGAAGTcAA~ccAATAAACGCGAGATACGTA NO: 167 TGAGATCAGGCACCGATTCTCGCCGATCAOGCGC TrGAAGCATGTATGTATGAACCGATTGCTCGGGC ACGTrCAccTCGGTGGAATTA COGTGATC AGCATCGCTCCATCATCCACACCAGACT GAAGGCCAAAAACATATCACTAAOOGATGCA CACTGAAGGATACCGTGAGCAAAGTAAACG CTCATCCGCCAT CGAAGA CCAAAAG CGCGGACCTTII'ATGGTGCAACGCCAGGACATTOGGA GCGGGTACTATAAACTGGTACG AAAAOGGC GGGGTCTACGATATACCGCCGGTCGGACTCATATG ______ATGTATAAGAA-ATTOACGTAA SEQ ID 7_9C9 ATGATGAAGTCAAACCAATAACCGGAATACGTA NO: 168 TGAAATCAGGCACCGCAITCTCCGGCCGAACAOGCCG TTGAACATTATGTATGAAACCGATCG*GT AGCATCGGTCCTTCATCAAGCCAACATCCAGAC~ GAAGGCCAAAAACAGTATCAGCTGAGAGGGAGGCGA CACTGAAGGTACCGGAAGCGAGTAC GCTCATCCGCCATGCCAAGCCTACAAAG GCGGGACCTATGGTGCAACCCAGACATTG AGCGGGTACTATAAAAAGcGTCAGGCAG CGAAGTCTACACATACCGCCGATCGACCTCATA = rATG.TATAAGAAATTTACTA SEQ ID 9_13F10 ATGATGAGTCAACCAATCCAGATCTA NO: 169 TGAGATCAGCACCGCAICTCCGCCGATCAGCCG TGGAAGCATGCAGTATGACCGATGTCAGGT GcGTT~cAccTrTGGGrcGTAACCGT AGCATCGCCTCCTACATCCGACACAGAC~ GAAGCCAAAAACAGTATCAGGAGAGATGGA CACTOAAGAGTACGCAGCGGG AGAC GCTCATCCGCCATGGAAGAfCTGGCAAAAG GGGCAGACcTTATGGTCAACCCAACATGC AGCGGTACTATAAAAAGCTCICAGACAG CGAAGTTACGACATACCCCGACTGCCCATA'L JGATGTTATAAGAAATrGACGTAA SEQ ID 9_13F1 ATGATGAAGTCAAACTAC~AAAC NO: 170 TGAGATCAGCACCGCATCTCCGGCCOATCAGCCG TG3GAAGCATGCAAGTATGAAACCGAGTCAGGT GGT-iTcAccTTGGGATATTAGCAcTGTc AGGATCGCCTCCTCATCAAGCGACArCAACT GAAGGCCAAAAcAGTATCAGAGAGOATGGCGA CATTGAAGAGTACCGCGAGCGAGAC GCTCATCCGCCAT OCGAAGAICGAAGG~ GGCAGACCTrCrrATGGTGCAACCCAGACATCTOG~g -154- WO 02/36782 PCTIUSOI/46227 AGCGTCAAAACCTMACACCG CCGAAGTCTACGACATACCGCCGACTGGGCCCCATAT= GATGTATAAGAA ATTrGACGTAA SEQ ID 9_15D5 ATCTATTGAAGTCAAACCAATAAACGCGAGATACGTA NO: 171 TGAGATcAGGcAcccAcTccGGATcA~cc TGGTAcGCATGAAGTAT GAA~ccACCrGAGG~T ACGTTCACCTCGTGATATTACCGAGTGAT AGCATCGCCTC=mCATCAAGCCGAACACAGACT GAAGGCCAAAACAGTATCAGCTGAGAAGGGA CACTTCTAGGGTACCGCGAGCAAAAAGGAGTACG CTfTATCC CCTCG AGT TC~ cGCAGAccTCIATGGTGCAACGCCAGGACATCTCGA GCGGGTACTATAAAAAGCTCGCTCAGCGAACAGG GAAGTCTACGACATACCGCCGGTCGGACCTCATATG _____ATGTATAAGA A ATTI'GACCTAA SEQ ID 9_15D9 ATGATGAAGTCAAACCAATAACGCGX LGATACGTA NO; 172 TGAGATCAGGCACCGCATACTCCGC:CGTCACG TTGAAGCATGTATGTATGAAACCGATGTCGGCcGT ACGTTCACCTCGGCGGATAGTYCACGGT CAGCATCGCCTCCT1TCATCAAGCTGAACACCAGACT TGAAGGCCAAAAACAGTATCAGGAGAGATGGCG ACACTIGAAGGGTACCGTGAGCAAAAAGGAGTAC GC1fTATCCGCCATCCGAAGCGCTCTTCGGAAG GCGCGGACTATGGTGCAACGCCAGGACATCTG AGCGGGTACTATAAAAAGCTCGGCTCAGCAG CGAAGTCTACGACACACCCCGGTCGACCCCATA = _____GATGTATAAGAAG'ITGACGTAA SEQ ID 9_15113 ATGATGAAGTCAAGCCAATAAACCGGAGATACGTA NO: 173 TGAGATCAGGCACCGCA CTCCCGTCACG rAAGCATGTATGTATGAACCGATATGCTCAGGGT GcG=TAcc~cTGTGGATfAcGGIA~cGATc AGCATCGCCTCCTCATCAAGCCGACATCCAGAC~ GAAGGCCAAAAACAGTATCAGCTGAGAGGATCTGA CACTGAAGAGTACCACGAGCAAAOAAC-rAC GcTcATccGccATGccGAAGAGTTcGGAAAAG GCGCGGACCTTII'ATGGTGCAACGCCAGGACATGG AGGGATTAA~TcGTTGGAAG CGAAGTCTACAACACAccGccGGrGGAcc~ATA = ______ ATGTATAAaAAATTGACGTAA SEQ ID 9_18112 ATGAIGAAGTCACAATAGCGAGATACGTA NO: 174 TGAGATCAGCACCGCATCCGCCGTCACG TGGAAGCATGTATGTATGAACCGA GCCGGGG ACG1TCACCTCGCGGATA lACCGGGGAACTAT CAGCATCGCCTCCITCATCAAGCCGAACATCCAGAGCT TGTAGGccAAAAAcAGTATCAGTGAGAGGATGGA CACTTGAGGATACCGTGAGCAAAAACKGGAGTACA CITATCCGCCATGCCGAGAGCCflCAAAGG GGcAGACCFT riATGGTGCAACGCCAGGACATTGGA GcGGGTAcTATAAAAACTCGGTCAGCGCAGO GAAGTCTACGACATACCGCCGGTCGGACCTCATATGj -155 - WO 02/36782 PCT/USO1/46227 ATGTATAAAAATGACGTAA SEQ ID 9_20F12 ATGATTGAAGTAAAACCAATAAACGCGGAAGATACGTA NO:175 TGAGATCAGGCACCGCGTTCTCCGGCCGAATCAGCCGC TGGAAGCATGTATGTATGAAACCGATTTGCTCGGGGGC ACGTTCACCTCGGTGGATATrACCGGGGCGAGCTGGTC AGCATCGCTTCCTTTCATCAAGCCGAACATCCAGAGCT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTCGGAAAAA-AGG CGCGGACCTTGTGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAAGGC GGGGTCTACGACATACCGCCGGTCGGACCTCATATITG ATGTATAAGiAAATTGACGTAA ____ ___ SEQ ID 9_21C8 ATGATTGAAGTCAAACCAATAAACGCGGATACGTA NO:176 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGTATGTATGAAACTGATTTGCTCGGGGGC ACGTTTCACCTCGGCGGATATTACCGGGGCAAGCTGAT CAGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCG ACACTCGAAGGATACCGCGAGCAAAAAGCGGGCAGTA CGCTAATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAG GGGGCAGACCTCTTATGGTGCAACGCCAGGACATCTGC GAGCGGGTACTATAAAAAGCTCGGCTTCAGCGATCAGG GCGAAGTCTACGACATACCGCCGGTCGGACCTCATATT TGATGTATAAGAAATTGACGTAA SEQ ID 9_22B 1 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:177 TGAGATAAGGCACCGCATCCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGC ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGGTC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTACG CTTATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAGGG GGCAGACCTTATGGTGCAACGCCAGGACATCTGCGA GCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGGC GAAGTCTACGACTTACCGCCGACCGGACCCCATATTG ATGTATAAGAAATiTGACGTAA SEQ ID 9_23A10 ATGATTGAAGTCAAACCAATAAACGCGGATACGTA NO:178 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGC ACGcTTCACCTCGGTGGATATTACCGGGGCAAGCTGGT CAGCATTGCTTCCTTTCATCAAGCCGAACATCCAGAGCT TGAGGGCCAAAAACAGTATCAGCTGGAAGGGATGGCG ACACTTGAAGGGTACCGCGGGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAGG GGGCAGACCTTTTATGGTGCAATGCCAGGACATCTGCG AGCGGGTACTATAAAAGCTCGGCTTCAGCGAACAAGG CGGGGTCTACGACATACCGCCGGTCGGACCTCATAT GATGTATAAAAAGACGT SEQID 9_24F6 ATGATTGAAGTCAAACCAATACCGGAGATACGTA - 156 - WO 02136782 PCT/USO1/46227 NO:179 TGAGATCAGG CACTCAGGCCGAATCAGCCGC TAGAAGCATGCAAGTATGAAACCGATTTGCTCAGGGGT GCGTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTCATCAAGCCGACATCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CTrATCCGCCATGCCGAAGCGCTT CGGAAAAAAGG CCGGACCTTGTGGTGCAACGCCAGGACGTCTGCGA GCGGGTACTATAAAAAGCTCGGCTCAGCGAACAGGGC GAAGTCTACGACATACCGCCGACCGGACCCCATAT GATGTATAAGAAATTGACGTAA SEQ ID 9_4H10 ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:180 TGAGATCAGGCACCGCATTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACTGATTTGCTAGGGGGT ACGCTTCACCTCGGTGGATATTACCGGGGCAAGCTGAT CAGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCG ACACTTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGAGCTTCTTCGGAAAAAGG GCGCGGACCTTATATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGGTCGGACCTCATAT GATGTATAAGAAATTGACATAA SEQ ID 9_4H8 ATGATTGAAGTCAAACCAATAAATGCGGAAGATACGTA NO:181 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TTGAAGCATGTATGTATGAAACCGATTTGCTCGGAGGC ACGTTTCACCTAGGTGGATATTACCGGGGCAAGCTGAT CAGCATCGCTTCCTTAATCAAGCCGAACATCCAGAGCT TGAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCG ACACTTGAAGGGTACCGTGAGCAAAAAGCGGGCAGTAC GCTTATCCGCCATGCCGAAGAGCTCGGAAAAAGG GGGCAGACCTTATGGTGCAACGCCAGGACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGGTCGGACCTCATAT GATGTATAAGAAATTGACATAA SEQ ID 9_811 ATGATTGAAGTCAAACCAATAACCGCGGATACGTA NO:182 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC NO:182TGGAAGCATGCAAGTATGAAACCGATTTGCTCGGGGGT ACGTTTCACCTCGGTGGATATTACCGGGGCAAGCTGATC AGCATCGCCTCCTTTCATCAAGCCGAACATCCAGAGCTT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTAGAAGGGTACCGCGAGCAAAAAGCGGGCAGTAC GCTCATCCG CCATG CCGAAGAGCTTCGGAAAAAGG GGGCAGACCTATGGTGCAACGCCAGAACATCTGCG AGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAGGG CGAAGTCTACGACATACCGCCGACCGGACCCCATATT GATGTATAAGAAATTGACGT SEQ ID 9_9H7 ATGATTGAAGTCAAACCAATAAACGCGGAAGATGCGTA NO:183 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCAT TA157 - 157 - WO 02136782 PCTJUS01I46227 ACGiT1TACCTCGGTGGATATTACCGGGGCAAGCTGAC AGCATCGCCTCcT1CATCAAGCCGAACATCCAGAGCT GAAGTGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA GCTTATCCGCCATGCCGAAGAGC'1CTrCGGAAAAAAG GCGCGGACCT1TATGGTGCAACGCCAGGACATCrGCG AGCGGGTACTATAAAAAGCTC-GGCTCAGCGAACAGG CGAAGTCTACGACATACCGCCTGTCGGACCTCATATT= _____________GATGTATAAGAAATrGACGTAA SEQ ID 9C6 ATGATTGAAGTCAAACCAATAAAcGcGGA.AGATAcGTA NO: 184 TGAGATCAGGCACCGCATTCTCCGGCCGAATCAGCCGC TGGAAGCATGCAAGTATGAAACCGATITGCT'CGGGGGT ACGTCACCTCGGTGGATATrACCGGGGCAAGCTGAC TGCATCGCCTCClTCATCAAGCCGAACATCAGACT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CGC-1TGAAGGGTACCGCGAGCAAAAAGCGGGAAGTAC GCTTATCCGCCATGCCGAAGAGCfl'CTTCGGAAAAAGG GGGCAGACCTITATGGTGiCAATGCCAGGACATCTGTG AGAGGCTACTATGAAAAGCTCGGCTCAGCGAACAAG CGGGGTCTACGATATACCGCCGATCGGACCTCATAY= _______GATGTATAAGAAMTrGGGGTAA SEQ ID 91111 ATGATrGAAGTcAAAccAATAAAcGCGGAAGATAcGTA NO: 185 TGAGATCAGGCACCGCATTCTCC3GCCGiAATCAGCCGT TGGAAGCATGCAAGTATGAAACCGATTCGGT ACGTCACCTCGGCGGATATACCGGGGCAACTGAT CAGCATCGCTCCTCATAAAGCCGAACATCAGAGCT TGAGGGCGAGAACAGTATCAGGAGAATG ACGC1TGAAGGATACCGTGACAAAAAGCGAC CGCTCATCCGCCATGCCGAAGAG~CTT TCAAAAG GGGGCAGACCT-=ATGGTGCAATGCCAGGACATCTGT GAGCGGGTACTATAAAAAGCTCGGCTTCAGCGAACAG GCGAAGTCTACGACATACCGCGATCGGACCTCATA M TGATGTATAAGAAATTGACGTAA SEQ ID 0_4B 10 ATGATAGAAGTGAAACCGATAACGCAGAGATACCTA NO: 186 TGAACTAAGGCATAAAATACTCAGACCAAACCAGCOA TAGAAGCGTGTATGTATGAAGCGATACICGGTG CAT TCACTrAGGCGGCYI=ACAGGGGCAAACTGTATIT CCATAGC'CAYJ'CCACCAGGCCGAGCACTCAGACCTCG AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TrGGAAGGTTATCGTGATCAGAAAGCGGGATCGACTCT AATrAAAcAcGcTGAAGAAATXICGTAAGAGGGGGG cGGAcATGcn-rGGTGCAATGCGCGGACAACCCCTCA GGCTACTACAAAAAGITAGGCTrCAGCGAGCAGGGAGA GATAT1GATACGCCGCCAGTAGGACCTCACATCCTGAT _____________GTATAAAAGGCTCACATAA SEQ ID 0_5311 ATGATAGAGGTGAAACCGATTAACOCAGAGGATACCTA NO: 187 TGAACTAAGGCATAAAATACTC-AGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGATAC=GGTG CAT TCACTAGGCGG ACGGKGAAACTGAI CCAACTACCCAGCAATAATC -158- WO 02136782 PCTUTSOI/46227 AAGGCCAGAAACAGTACCAGCTCCGAGGTATGCTACC 'ITGAAGGrATGTGACAGAAGCGGA cGATGT ATAAcAcGcTGAAcAAc1rTCTCGTAAAGG CGGACATGCTTGGTGCAATGCGCGGACATCCCTA GGCTACTACAAAAAGTAGGCTCAGCGAGCAGGAGA GGTATLTrGAAACGCCGCCAGTAGGACCT)CACATCCTGA _____________TCTATAAAAAGATCACA SEQ ID 0_5B3 ATGCTAGAGGTGAAACCGATJ2AAcGCAGAGGATACCTA NO: 188 TGAACTAAGGCATAGAATACTCAGACCAAACCACCGA TAGAAGGTGTATGTATGAACCGAACrCGTGGTG cATT~cAc-TAGGcGGCT fACAGGGGCAAACTGATJT CCATAGTCATCCACCAGCCGAGCACTCAGAACTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGCTAAGGTTATCGTGATCACTAAAGCGGGATCGAGTCT AATrAAACACGcTGAACAACTrC=CGTAAGAG~GG cGGAcTTGcTITGGTGcAATGcGCGCTACATCCGCCTCAG GCTACTACAAAAAGTTAGGCTCAGCGAGCAGGGAGAG GTAT1GATACGCCGCCAGTAGGACCTCACATCCTGATG TATAAAAGGATCACA SEQ ID 0_5B4 ATGCTAGAGGTGAAACTGATAACGCAGAGATACCTA NO: 189 TGAACTAAGGCATAGAATACTCAGACCA.ACCACCGT TAGAAGCGTGTATGTATGACCGATTACrCGGTG cAT TCAC=AGGCGGCT flACAQGGGGCAAACTGATIT CCATAGCl7CATrCCACCAGGCCGAGCACTrCAGACCCG AAGGCCAGAAACAGTACCAGCT)CCGAGGTATGGCTACC 1.TGGAAGGTI=CGTGATCACTAAAGCGGGATCGAGTCT AATTAAAcAcGcGAAGATcTCGTAGAGGG CGAACTrGCTrGGTGTAATGCGCGGACATCCCCCAG GCTACTACAAAAAGY]AGG~c1AGCGAGCAGGAGAG GTATITATACGCCGCCAGTAGGACCTCACACTATG TATAAAAGGATCACA SEQ ID 0_538 ATGATAGAGGrTGAAACCGATTAACGCAGAGATACCTA NO: 190 TGAAcTAAGGcATAAAATAcTcAGACCACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGAm~ACLTrCGTGGTG CAflTACTTAGGCGOCTACAGCACTGA= CCATAGCTTCAYJCCACCAGCCGAOCACTCAGACCTCC AAGGccAGAA~cAGTAccAGTCCGAGGTATGCACC TTGGAAGGTTATCGTGATCAGAAAGCGGGATCGAGTCT AATrAGACACGCTGAAcAAATC1CGTAAGAGG CGCGACTTGtflGGTGCAATGCGCGGACACGCTCAG GCTACTACAAAAAGTAGGCTTCAGCGACAGGGAGAG ATA-TTGATACGCCGCCAGTAGGACCTCACATCCTGATG TATAAAAGGCTCACA SEQ ID 0_5C4 ATGATAGrAGGTGAAACCGATrAACGCAGAG3ATACCTA NO: 191 TGAACTAAGGCATAAATACTCAGACCACCAGCCGT TAGAAGCGTGTATGTATGAAACCGATACCGGGT CAT CACTAGGCGGCT11ACAGC1G AAACTGA=I CCATAGTCATCCACCAGCCGAGACAC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TIGGAAGGTrATCGTGAGCAGAAAGCgggATCGAGTAT - 159 - WO 02136782 PCTJUSOI/46227 AAA~DCACGCTGAAGAAATTCTTAAGACTG CGGACCTFGGTGCAATGCGCGACGTCCGCCTCAG GCTACTACAAAAGTAGc1CAGGACAGGAGAG AT~nAAGCCATGACCCATCG TATAAAAGGATGACA SEQ ID 0_51)11 ATGATAGAGGTGAAACCATAACCAGAGATACTA NO: 192 TGAACTAAGGCATAGAATACTCAGACCAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGAACCGTGTG CCATAGCrCArCCACCAGCCGACACTCAGACC AAGGCCAGAAACAGTACCAGCTCCGAGTATGTACC TrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGACTT AATAGACACGGAACAACTCGTGAGGG cGGAcTT = GGTGCAATGCGGACATCCGCCTCAG GCTACTACAAAGGTAGGCTTCAGCGACAGAGAG GTA=rGATACGCCGCCAGTAGGACCTCACATCCTGATG TATAAAAGGCTCACA SEQ ID 0_51)3 ATGCTAGAGGTGAccGATAAc~cAGAGATAccTA NO: 193 TGAACTAAGGcATAGATAcTcAGACCACCAGCCGA TAGAAGCGTGTATGTATGAGCGAACGTGGTG cATTCAcTAGGCGGCTAACACAACGAT CCATAGCTCATTCACCAGCGACACTCAGAC AAGCCAGAA-ACAGTACCAGCTCCGAGTAGGCACC TTGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTCT ATTAACACGCTGAAGAAATCrCGTAAGAGG CGGACYrCTTGGTGTAATCGACATCCCCCAG GCTAcTACAAAAGTAGGTCACGAOCAGGAGAG ATATGAACGCCGCCAGTAGGACCTCACATCCTGAT _____ (TATAAAAGGATCACATAA SEQ ID 0_51)7 ATGATAGAAGTGAACCGArAACCAGAGAGACCTA NO: 194 TGAACTAAGGCATAGATACTCAGACCAACCAGCCGA TAGAACGTGTATGTATGAAACCGA=TACGTGGTG CATTCACTAGGCGG ACAGCACTGA~ CCATAGTCATTCCACCAGCCGACACTCAGACI GAAGGCCAGAAACAGTACCAGCTCCGAGTATGTAC CTTGGAAGGflATcTGATcAGAGGGATCGAGTC TAATTAGAcAcGcGAAc~tcGAA~GAAGG GCGAATATG = GGTGTAATGCG 3CACCCCT AGGCTACTACAAAAGTTACCAGCGACAGAG AGATATTGATAcGccGccAGTAGGACCCACACT ATGTTATAAAAGGATCACA SEQ ID 0_6fl4 ATGCTAGAGGTGAAACGATACCAGAGATACCTA NO: 195 TGACTAGGATAGATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGGCAACATACGTGT =CCrGC~~7CGGCACGTf ccT~~c~c~cG~c~~~~A~cc AAGCCAGAAACAGTACCAGCTCCGAGTATGCACC TrGGAAGGTTGTGATcAGAAAGGACGAGC CGGAC--TrT TGGTGCAATGCGCGGACAWCCOCCTCAG - 160 - WO 02/36782 PCTIUSOI/46227 CYTCAAAATAGCTAGGATGAA GTATrGATACGCCGCCAGTAGiGACCrCACATCCTGATG TATAAAAGGATCACA SEQ ID 0_6D10 ATGCT.AGAGGTGAAACCGNITAACGCAGAGGATACCTA NO: 196 TGAACTAAGGjcATAAAATACTCAGACCAAACCACCGT TAGAAGTGTGTATGTATGAACCGAITACrGTOGTG cATTcAcTrAGGcGGcT~~'AcAGGGGCACGAM CCATAGCTrcATrccAccAGGCCGAGCACTCAGACCTCC AAGCCAGAAACAGTACCAGCTCCGAGGTAGGTACC TIGjGAAGGTTATCGTGATCAGAAAGCGGGATCGAGTCT CGGACATGGTTGTGCAATGCGACACGCTCA GGCTACTACAAAAAG'fTAGGCTCAGCGACAGAGA GGTA-f-TGAAACGCCGCCAGTAGGACCTCACATCGA TGTATAAAAGGCTCACA SEQ DD 0_6D11 ATGATrGAAGTcAAACCAATAAACGCGGAAGATACGTA NO: 197 TGAGATcAGGcAccOcATcTCCGCGAATCACG TGGAAGCATGCAAGTATGAAACCGAT TGGC~ ACGTCACCTCGTGGATATTACCGA~GT CAGCATCGCTCCT1TATcAAGCCGAACATCCAGAC TGAAGGCCAAAAACAGTATCAGCTGAGAAGG AcGcTrGAAGGGTAccGTGAGcAAAAAcGG~AGTAc GC-fTATCCGCCATGCCGAAGCTTCGAAAAGG GGGCAGACCITATGGTGCAACGCCAGACATCTG AGCGGGTACTATAAAAAGCTCGGCCAGCAG CGAAGTCTACGACATACCGCCGGTCGACCTCATAT= GATGTATAAGIAA AXTGACGTAA SEQ ID 0_6F2 ATGATAGAGGTGAACCGATTAACCAGAGATAC~rA NO: 198 TGAJACTAAGGCATAGAATACTCAGACCAAACCACCGA TAGAAGCGGTATGTATGAGCGAACCGIGTG CAYTCACTAGGCGGCTAflACAGACGAI CCATAG cTAl ccAccAGGccGAGcAcTCAGAACTCc AAGGCCAGAAACAGTACCAGCTCCGAGGTATGTACC TrGGAAGGT1TCGTGAGCAGAAAGCGTGGATCGACT'CT AfTAGACACCTGAACATCTTCGTGAGG CGGAcATGcTTrGGTGcAATGcGcGGAcATcc~~CcA GGCTACTACAAAAGTrAGGCTCAGACAGGAGA GATATCATACGCCGCCAGTAGGACCTCACATCCTIGAT _____GTATAAAAGGATCACA SE-QIDD 0_6119 ATGATAGAGGTGAAACCGATAACCAGAGGATACCTA NO: 199 TGAACTAAGGCATAAAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAACCGATTACflC-GTTG cATrrcAcTTAGGcGGcT IACGGGGGCAAACTGAT1? CCATAGCrCATCCACCAGCCGACA~CAGACCTCG AAGGCCAGAACAGTACCAGCTCCGATATGCACC TTGGAAGCflTATCGTGAGCAGAAAGCGGGATCGACTCT AfTAGACACGCTGAAGAATTCTCGTAGGG CGAACTGcTGG~ATGCAGACATCCGCTCAG GCTACTACAAAAGTAGGTTCAGCACAGAGAG ______GTATTrGACACGCCGCCAGTAGGACCTCACATCCTGATG, _ 161 - WO 02136782 PCTITJSOI46227 TATAAAAGCTCACA SEQ ID 10_4C10 ATGATAGAGGTGAACCGATACGCAGAGATACCTA NO:200 TGAACTAAGGCATAAAATACTCAGACCACCAGCCGT TAGAAGTGTGTATGTATGAAACCGAMACrrCGTGT CATTCACTAG GGTACAGGACTAr CCATAGCrA1CCACCAGGCCGAGCACTCAGAACTCC AAGCCAGAACAGTACCAGCTCCGACTATGTACC TTGGAAGG1TATCGTGATCAGAAAGCGGCTATCGACTC AATA~kCACGCTGAACAAATCTCGTAAGAGGG CGGANTGTGGGCAAGC~CA~GCCTCA GGCTACTACAAAAGTAGGCTCAGCGACAGGAGA GATATGATACGCCGCCAGTAGGACTCACATCCTGAT ______GTATAAAAGC*CTCACATAA SEQ ID 10_4D5 IATGATAGAGGTGAAACCGATTAACCAGAGATACCTA NO:201 TGATAGAAATCTAACACACG TAAGGGAGAGAcGTTcrGGT cATrcAcflAGOcGGCTTAcAGcAAAcTA CCATAGTCATCCACCAGGCGAGACTCAGACCTCC AAGGCCAGAACAGTACCAGTCCGAGTATGTACC TTGGAACTGTATCGTGAGGAGAAGCGCTGATCGATC ATAGACACGCTGAACATCTCGTAACGAGGG CGGAC~rG=TGGTGCAATGGGACATCCGCTCAG GCTACTACAAAAAGTTAGGCTCAGCACAGAGAG GTATrGATACGCCGCCAGTAGACCACATCTATG ________TATAAAAGGATCACATAA SEQ ID 10_4F2 ATGCTAGAGGTGAAACCGATrAACGCAGAGGATACCTA NO:202 TGAACTAAGGCATAGAATACTCAGACCACCAGCCGA TAGAAGCGTGTATGGAAAGCGAATCGTGGTG CAT TCrACTTAGGCGGCITTTACAGG CACTGATF CCATAGCTA~CCACCAGGCGAGACTCAGACTCC AAGGCCAGACAGTACCAGCTCCGAGTATGCACC TTGGAAGGTATCGTGAGCAGAAAGCGGGATCGAGTT AAMTAGACACGCTGAAGAAATCTCGTGAGGG CGGACATG =GGTGTAATGCGACATCCCCCA GGCTACTAcAAAAAGTAGGc~rcAGAcA~GAGA GATA~GAAACGCCGCCAGTAGGACCCACATCCTGA ______TGTATA AAAG7GCTCACATAA SEQ ID 10_4F9 ATGATAGAGGTGAACCGATAACCAGAGATACCTA NO:203 TGAACTAAGGCATAGAATACTCAGACCAACCAGCCGA TAGAAGTGTGTATGTATGCCGAACGTGT CATrCACAGGCGGC TACACAACGAT CCTG'TATCCAGCAC~CGATC AAGGCAGAACAGTACAGCTCCGATATGTAC TrGGAAGGTTCGTGAGCAGAAACGGGATCGAGr AATr AGACACGTGACAA GAGAGGG CGGACTGCTGGTAATCGACATCCCCTAG GCTACTACAAAAGTAGGTTCACGACAGAGAG ATA1GATACGCCGCCAGTAGACCTCACATCCTGATG TATAAAAGGCTCACATAA SE ID 110_4G5 IATGATAAGTGAAACCGTTACCAGAGATACCTA -162- WO 02136782 PCTUSOI/46227 NO:204 TGAACTAAGGCATAGAATACTCAGACCACCAGCGOA TAGAAGCGTGTATGT-1TTAAAGCGATJITACTCGTGGTG CATI-CACTAGGCGGCTAAGCAAACTCAT CCATAGCTTCATTCCACCAGGCCGAOCACTCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTrACCGCGATC-AGAAAGCGGGATCGAGTT ATrAGAcAcGcTGAAAcTCGTAAAGG CGGACTGCnGGTrTTAATGCGGACATCCCCCAG GCTACTACAAAAAGTAGGCTCAGcGAAGAGAG ATATITGIATACGCCGCCAGTAGACCTCACATCTATG _____TATAAAAGGCTC-ACATAA SEQ ID 10_4H4 ATGTAGAGGTAACCCGATAACGCAGAGATACCTA NO:205 TGAACTAAGGCATAAAATACTCAGACCAACCACCGT TAGAAGTGTTATGTATGAAACCATATCGTGGTG CATITCACTAGGCGGC1ACAGAACTGA CCTGTrArCCAGCGGATAACC AAGGCCAGAAkACAGTACCAGCTCCGAGGTATGTACC TrGGAAGGTTATCciTGAGCAGAAAGcGGGATCGAGTCT AATrAAACACGTGAAGAATTCTCGTAAGAGG CGGACrGCT-TGGTGCAATGCGGACATCCCTCAG GCTACTAcAAAAAGn AGG CAGCGAGCAGGGAGAG GTATrrGATACCG.CCGCCAGiTAGGACCTCACATCTGATG TATAAAAGGATCACATAA SEQ ID 11_3A1 1 ATGATAGAAGTGAAACCGATACOGAGAGGATACCTA NO:206 TGAACTGAGGCATAAAATACTCAGACCACCAOGCGA TAGAGTGGTAGTAGAAGGG~AIGTGGTG cATrT-ACTGG=COCTTCAACGA~ CCATAGCGTC.ATCCACCAGGCGC'AGACCCAGACT CAAGGcCAGAAACAGTACCAGCTCCGAGTATGCAC CTGGGGTATCGTGTCAGAAAGGGATCGAGTC TTAAAcACGCTGAACAAACGTAAGAcGOG GCGGA = GC = GGTGCAATGGCGGACATCCCCTA GGTAITJGAAACGCCGCCAGTAGGACCTCACATCCTGA TGTATAAAAGGCTCACATAA AGTCA SEQ ID 11_3B1 ATGCTAGAGGTGAAACCGATTAACGAGOTC NO:207 TGAACTGAGGCATAGAATACTCAGACCAAACCAOGCGA TAGAAGCGTGTATGTTGAACCGA =ACCGTGGTG CATACTTAGGCGG CAGGACAATGAM CCATAGCTCATrCCACCAGGCCGAGATCAGACC AAGGCCAGAACAGTACCAACTCCGAGGTATGGOTACC TrGGAAGGT1TCGTGAGCAGAAAGCGGGATCGACTCT ATAGACACGCTGAAGAATrCCGTGA~GGOG cGGAcITGcTTTGGTGCAATGCGcGGAcATCCOGCCTCAG GCTACTACAAAAGGTAGGCTCAGGACAAGAG ATATIGACACGCCGCCAGTAG3CCT)ACATCCTGATG TATAAAAGGCTCACATAA SEQ ID 11_3B5 ATGATAGAGTGAAACCGAACGAGAGATACCTA NO:208 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA ____TAGAAGCTTATG ,T-TGAAAGCGAATCGTGGTG - 163 - WO 02/36782 PCT/US01/46227 CATTTCACTTAGGCGGCTATTACAGGGGCAAACTGAT CCATAGCGTCATTCCACCAGGCCGAGCACTCGGAACTC CAAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTAC CTTGGAAGGTTATCGTGATCAGAAAGCGGGATCGAGTC TAATTAGACACGCGAACAAATTCTCGTAAGAGGGG GCGGACATGCTTGGTGCAATGCGCGGACATCCGCCTC AGGCTACTACAAAAAGTTAGGCTCAGCGAGCAGGGAG AGGTATTTGATACGCCGCCAGTAGGACCTCACATCCTG ATGTATAAAAGGATCACATAA SEQ ID 11_3C12 ATGATAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:209 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGT TAGAAGTGTGTATGTATGAAACCGATTTACTTCGTGGTG CATTTCACTTGGGCGGCTTACGGGGGCAAACTGATT CCATAGCGTCATTCCACCAGGCCGAGCACCCAGACCTC CAAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTAC CTTGGAAGGTTATCGTGATCAGAAAGCGGGATCGAGTC TAATrAGACACGCTGAACAACTTCTTCGTAAGAGGGGG GCGGACTTGCTTTGGTGCAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGA GATATrCGAAACGCCGCCAGTAGGACCTCACATCCTGA TGTATAAAAGGATCACATAA SEQ ID 11_3C3 ATGATAGAAGTGAAACCGATTAACGCAGAGGATACCTA NO:210 TGAACTAAGGCATAAAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGATTTACTTCGTGGTG CACTTCACTTAGGCGGCTATTACAGGGGCAAACTGAT CCATAGCGTCATTCCACCAGGCCGAGCACTCAGAACTC CAAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTAC CTrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTC TAATTAAACACGCTGAAGAAATCTCGTAAGAGGGG GCGGACTTGCTTGGTGCAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGA GGTATTTGACACGCCGCCAGTAGGACCTCACATCCTGAT GTATAAAAGGATCACATAA SEQ ID 11_3C6 ATGCTAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:211 TGAACTAAGGCATAAAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTTTGAAAGCGATTTACTTCGTGGTG CATTCACTTAGGCGGCTTIACGGGGGCAAACTGATTT CCATAGCTTCATTCCACCAGGCCGAGCACTCAGACCTCG AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTTATCGTGAGCAGAAAGCGGGATCGACTCT AATTAGACACGCTGAAGAAATT CGTAAGAGGGGGG CGGACITGCTTTGGTGCAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGAG ATATITGATACGCCGCCAGTAGGACCTCACATCCTGATG TATAAAAGGATCACATAA SEQ ID 11_3D6 ATGATAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:212 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGTGTGTATGTATGAAACCGATTTACTTCGTGGTG CATTTCACTTAGGCGGCTTACAGGGGCAAACTGAT ,CCATAGCTTCATTCCACCAGGCCGAACTCAGACC -164- WO 02/36782 PCT/USO1/46227 AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTrATCGTGAGCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAACAAATCTTCGTAAGAGGGGGG CGGACTrGCTTTGGTGCAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGTTAGGCTCAGCGAGCAGGGAGAG GTATITGATACGCCGCCAGTAGGACCTCACATCCTGATG TATAAAAGGCTCACATAA SEQ ID 1_1G12 ATGCTAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:213 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGTGTGTATGTATGAAACCGATTTACTTCGTGGTG CATTCACTTAGGCGGCTTACGGGGGCAAACTGATIT CCATAGCTTCATTCCACCAGGCCGAGCACTCAGAACTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTrATCGTGATCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAAGAAATrCTTCGTAAGAGGGGGG CGGACTrGCfTGGTGTAATGCGCGGACATCCGCTCAG GCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGAG GTATrTGAAACGCCGCCAGTAGGACCTCACATCCTGAT GTATAAAAGGCTCACATAA SEQ ID 1_11 ATGATAGAAGTGAAACCTATTAACGCAGAGGAGACTTA NO:214 CGAACTTCGACACAAGATCCTGCGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAAGCGATCTGCTGCGGGGC TCGTTCCATTGGGCGGGTTCTATCGTGGCCAATTGATC TCGATTGCGAGTTTCCACAAAGCTGAACACTCAGAACT GCAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCG ACCCTCGAAGGATTCCGTGAGCAGAAGGCTGGCTCTTC GCTATTAGGCACGCCGAGGAGATACTACGGAATAAAG GGGCAGATCTGCTTTGGTGTAATGCACGCACGACAGCC TCCGGTTACTATAAAAGGCTTGGTAGTGAGCACOGC T=GGTTCGAAACCCCGCCGGTTGGGCCGCACATTCTT ATGTACAAAAGAATCACT SEQ ID 1_1H2 ATGATAGAAGTGAAACCTATTAACGCAGAGGATACTTA NO:215 CGAACTTCGACACAGGATCCTGCGCCCTAATCAGCCGTT AGAGGCATGCATGTATGAAAGCGATCTOCTGCGGGCT CGTTCCATTGGGCGGTCTATCGTGATGATCT CGATTGCGAGTTTCCACCAAGCTGAACACTCAGAACTG GAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCGA CCCTCGAAGGATTCCGTGAGCAGAAGGCTGGCTCTTCG CTTATTAGGCACGCCGAGGAGATACTACGGAAAAGAGG GGCAGATCTGCTTTGGTGTAATGCACGCACGACAGCCG CCGGTTACTATAAAAAGCTGGTTAGTGAGCAGGGC GAAATTCGACACCCCGCCGGTTGGGCCGCACATTC ATGTACAAAAGAATCACT SEQ ID 1_1H5 ATGATAGAAGTGAAACCTATTAACGCAGAGGATACTTA NO:216 CGAAATTCGACACAGGATCCTGCGCCCTAATCAGCCGT TAGAGGCATGCATGTATGAAAGCGATCTGCTGCGGGGC TCGTTCCATTTGGGCGGGTTCTATCGTGGCAAATTGATC TCGATTGCGAGTTTCCACCAAGCTGAACACTCAGACCTG GAAGGGCAAAAGCAGTATCAATTACGAGGATGGCGA CCCTCGAAGGATACCGTGATCAGAAG TQGTCMCG - 165 - WO 02136782 PCT/USO1/46227 CTTATTAGGCACGCCGAGCAGATACTACGGAAAAGAGG GGCAGATCTGCTTTGGTGCAATGCACGCACGACAGCCG CCGGTTACTATAAAAGGCTTGGTTAGTGAGCAGGGC GAAGTTCGACACCCCGCCGGTGGGCCGCACATCT ATGTACAAAAAACTCACT SEQ ID 1_2A12 ATGATAGAAGTGAAACCTATTAACGCAGAGGATACTTA NO:217 CGAACTTCGACACAGGATCCTGCGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAAGCGATCTOCTGCGGGGC TCGTTCCATTTGGGCGGGTTCTATCGTGGCAAATTGATC TCGATTGCGAGTTTCCACCAAGCTGAACAGTCAGAACT GGAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCG ACCCTCGAAGGATACCGTGATCAGAAGGCTGGCTCTAC GCTTATrAAGCACGCCGAGGAGATACTACGGAAAAAAG GGGCAGATCTGCTTTGGTGCAATGCACGCACGTCAGCC GCCGGTTACTATAAAAGGCTGGTTAGTGAGCAGGG CGAAATTCGACACCCCGCCGGTTGGGCCGCACATTCT TATGTACAAAAGACTCACT SEQ ID 1_2B6 ATGATAGAAGTGAAACCTATfTAACGCAGAGGAGACTTA NO:218 CGAACTTCGACACAAGATCCTGCGCCCTAATCAGCCGTT AGAGGCATGCATGTATGAAACCGATCTGCTGCGGGGCT CGTrCCATTTGGGCGGGTTCTATCGTGGCAAATTGATCT CGATTGCGAGTTCCACCAAGCTGAACACTCAGAACTG GAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCGA CCCTCGAAGGATTCCGTGATCAGAAGGCTGGCTCTTCOC TTATTAAGCACGCCGAGGAGATACTACGGAAAAGAGGG GCAGATCTGCTTGGTGCAATGCACGCACGTCAGCCTCC GGTTACTATAAAAAGCTTGGTTAGTGAGCAGGGCGA AATTCGAAACCCGOCCGGTTGGGCCGCACATTCTTAT GTACAAAAGACTCACT SEQ ID 1_2C4 ATGCTAGAAGTGAAACCTATTAACGCAGAGGAGACTTA NO:219 CGAACTTCGACACAAGATCCTGCGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAACCGATCTGCTGCTGGGC TCGTTCCATTGGGCGGGTTCTATCGTGGCCAATTGATC TCGATTGCGAGTTTCCACCAAGCTGAACACTCAGACCTG CAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCGAC CCTCGAAGGATACCGTGAGCAGAAGGCTGGCTCTACGC TTATTAAGCACGCCGAGGAGCTACTACGGAAAAAAGGG GCAGATCTGCTTTGGTGCAATGCACGCACGACAGCCGC CGGTTACTATAAAAAGCTTGGT AGTGAGCAGGG AAGTCGACACCCCGCCGGTTGGGCCGCACATTCTTA TGTACAAAAAAATCACT SEQ ID 1_2D2 ATGATAGAAGTGAAACCTATTAACCAGAGGATACTTA NO:220 CGAACTTCGACACAAGATCCTGCGCCCTAATCAGCCGTT AGAGGCATGCATGTATGAAAGCGATCTGCTGCGGAGCG CATrCCATTTGGGCGGGTTCTATCGTGGCAAATTGATCT CGATTGCGAGTTTCCACAAAGCTGAACACTCAGAACTG CAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCGAC CCTCGAAGGATACCGTGATCAGAAGGCTGGCTCTTCGC TTATTAGGCACGCCGAGGAGATACTACGGAAAAGAGGG I GCAGATATGCTTTGGTGCAATGCACACGTCACC -166- WO 02136782 PCTIUSO1/46227 CGGTTACAAAGC GTITGGGA G AAGTTTTCGACACCCCGCCGG1TGGGCCGCACATITCTrA ______TGTACAAAAGAATCAC1TAA SEQ ID 1_2D4 ATG3ATAGAAGTGAAACCTATJ7AACGCAGAGGATACflA NO:221 CGAAG'1TCGACACAGGATCCTGCGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAAGCGATCTGCTGCGGGGC TCGTTCCATITGGGCGGGTTCTATCGTGGCAAATTGATC TCGATTGCGAGTITCCACCAAGCTGAACACTCAGACCTG CAAGGGCAAAAGCAGTATCAATTACGAGGGATGGGGAC CCTCGAAGGATACCGTGAGCAGAAGGCTGGCTC'=TCGC 'TKTrAAGCACGCCGAGCAGCTACTACGGAAAAAAGGG GCAGATATGCTIGGTGTAATGCACGCACGTCAGCCGC CGGTTACTATAAAAGGCTTGGTTIAGTGAGCACGGG AAAITITGAAACCCCGCCGGTTGGGCCGCACATTCTTA TGTACAAAAGAATCACT SEQ ID 1_2F8 ATGCTAGAAGTGAAACCTMTTAACGCAGAGGATACTTA NO:222 CGAAC'TCGACACAGGATCCTGCGCCCTAATCAc3CCG~r AGAGGCATGCATGTATGAAACCGATCTGCTGCGGGGCT CGTCCAT1rGGGCGGGTTCATCGTGGCAAATTGATCT CGAITrGGAGITICCACCAAGCTGAACATTCAGAACTG GAAGGAAGGTTATAAGGTGA CTCTCGAAGGATACCGTGATCAGAAGGCTGGCTCTTCG CTrArrAGCACGCCGAGGAGATACTACGGAAAAGAGG GGCAGATATGCrGGTGCAATGCACGCACGACAGCCG CCGGTTACTATAAAAAGCTGG =ITAGTGAGCAGGGC GAAATIACGACACCCCGCCGGTTGGGCCGCACATrCTT ATGTACAAAAAACTCACT SEQ ID 1_2H8 ATGATAGAAGTGAAACCTATTAACGCAGAGGAGACT1TA NO:223 CGAACTCGACACAAGATCCTGCGCCTAATCAGCCGTT AGAGGCATGCATGTATGAAACCGAT=TGGTGCGGGGCG CGITCCATIGCGCGGGYICATCGTGGCAAATrGATCT CGAITGCGAGIT]CCACCAAGCTGACCACTC-AGAACTG CAAGGGCAAXAGCAGTATCAATTACGAGGGATGGCGAC CCTCGAAGGATACCGTGAGCAGAAGGCTGGTCTACGC TTAfTAGc3CACGCCGAGCAGATACTACGGAAAAGAGGG GCAGATCTACfl1GGTGCAATGCACGCACGTCAGCCGC CGGTrACTATAAAAAGCTTGGT1TAGTGAGCACCGGCG AAATnTGAAACCCCGCCGYTrGGGCCGCACAiTCTA TGTACAAAACIACTCACTTAA SEQ ID 1_A2 ATGATAGAAGTGAAACCTKFIAACGCAGAGGATACTTA NO:224 CGAAC'ITCGACACAGGATCCTG3CGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAAGCGATCTGCTGCGGGGC GCGTrCCATTTGGGCGGGITCATCGTGGCAAATTGATC TCGATTGCGAGT'rTCCACCAAGCTGAACACTCAGACCrG CAAGGGCAAAAGCAGTATCAATTACGAGGGATGGCGAC CCTCGAAGGATACCGTGAGCAGAAGGCTGGCTITrCGC rrTATAGGCACGCCGAGGAGATACTACGGAAAAAAGGG GCAciATATGCITGGTGCAATGCACGCACGACAGCCGC CGG'ITACTATAAAAGGCITGGfT1AGTGAGCAGGGG ____________ AGTIrCGCACCCGCGGflX3000CGCACATTCTTA -167- WO 02/36782 PCTUSOI/462Z7 TGTACAAAAGAATCACT SEQ ID 1_3D6 ATGATAGAGCTTGAAACCGA1TAACGCAGAGGATACCTA NO:225 TACAOCTGAATAACACACG TAGAAGTGTGTATGTATGAAACCGAT-ITACTTCGT.GGTG CATTTCACTAGCGGCT1TACAGGGGCAAACTGAI ccATAGcTcAT7ccAccAGGCCGAGCACcTAGACCTCC AAGGCCAGAAAcAGTACCAGCTCCGAQGTATGGCTACC TTGGAAGGITATCGTGAGCAGAAAGCGGGATCGAGTCT AAiTAAAcAcGCTGAACAAATTCT-'CGTAAG-AGOGGGG cGGAcTrGCITGGTGCAATGCGCGGACATCCGCCrCAG GCTACTAcAAAAAGTTAGGCTrCAGCGAGCAGGGAGAG GTAFTGATAcGccGCCAGTAGGACCTcAcATCCTGATG _____TATAAAAGGCTCACATAA SEQ ID 1_33 ATGATAGAAGTGAAACCTATTAACGCAGAGGAGACTrA NO:226 CGAACTTCGAcAGAGGATCCTGCGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAAGCGATCTGCTGCGGG TCGTccATTTGGGcGGGT~cTATcGTGGCCAAfTGATC TCGATTGCGAGTCCACCAAGCTGAACACTCAAC G3CAAGGGCAAAAGCAGTATCAATIACGAGGGATGGCG ACcCTCGAAGGATACCGTGAGCAGAAGGCTTCAC GCTTATITAAGcAcGccGAGGAGATACTACGGAAAAAAG GGGCAGATCTGCTITGGiTGCAATGCACGCACGTCACC GCCGGTACTATAAAAGGTGGTTAGTGACACG CGAAATTCGACACCCCGCCGGUGCCACATC TATGTACAAAAGAATCACT SEQ ID 1_3H2 ATGATAGAAGTGiAAACCTA[TAACGCAGAGGATACTTA NO:227 CGAACTrcGAcACAGGATCCTGCGCCCTAATCACCGA TAGAGGCATGATGTATGACCGAGTGGGG GCGTrCCATYTGGGCGGGTACTATCGTGGCCATGA TCGATTGcGiAGTTrccACAAAGCTGAACACTCAGAC GCAAGGGCAAAAGCAGTATCAETrACGAGGGATGGCG ACCCTCGAAGGATACCGGAGCAGAAGCCAC GCTTATrAAGCACGCCGAGCAGCTACTACGAAAG GGGCAGATATGCr1TGGTGCAATGCACGCACGTCACC GCCGGTACTATAAAAGGCTGAGTGACAG CGAAGTTCGAc.AccccGccGGGGCGACA~rCT ______TATGTACAAAAAACTCACT SEQ ID 1_4C5 ATGATAGAAGTGAAACCTTTAACGCAGAGGATACTA NO:228 CGAACTCGACACAAGATCCTGCGCCCTAATCACCGA TAGAGGATGCATGTATGAAAGCGATCTGGG TCGTccATGGGcGGGTrCTATCGTGcA GATC TCGATTGCGAGTITCACAAAGCTGAACACTCAGAC GGAAGGGCAAAACCAGTATCAATTACGAGGGATGGCG ACCCTCGAAGGATACCGTAGCAGAAGGCTTCAC GCTTATAGGCACGCCGAGGAGATACTACGGAAAAGAG GGGCAGATATGCT1GGTGCAATGCACGCACGTCAGC TCCGGTTACTATAAAAGGTGGTTAGTGAGCACG GAAAI=rCGACAcccCGCCGGITGGGCCGCACArPTT ATGTACAAAAGACTCACTAA SEQ ID 11 4D6 IATGCTAGAAGTGAAACCTAAACAGAGATATA -168- WO 02136782 PCTJSOI/46227 NO:229 -CGAACITCGACACAGGATCCTGCGCCCTAATCAGCCGA TAGAGGCATGCATGTATGAAACCGATCTGCTGG TCGTrCCATITGGGCGGGTTCTATCGTGGCCAArATC TCGATrGCGAGT1TCCACAAAGCTGAACACTCAGACT GGAAGGGCAAAAGCAGTATCAATrACGAGGGATGGCG ACCCTCGAAGGATACCGTGAGCAGAAGGCTGCTCTAC GCTTATTAGGCACGCCcIAGCAciATACTACGGAAAAGAG GGGGAGATATGCTCTGGTGCAATGCACGCACGTCAGCC GccGGTrAcTATAAAAGGGTrGGTI=AGTGAGcAGGG~ CGAAGTTTrCGAAACCCCGCCGGTrGGGCCGCACA1TCT ______TATGTAC AAAAGACTCACT SEQ ED 1_4111 ATGATAGAAGTGAAACCTATrAACGCAGAGGATATA NO:230 cAc~rcGACACAGGATCCTGCGCCCTAATCACCGT AGAGGCATGCATGTATGAAACCGATCTGCTGCGGT cGTrccATrGGGcGGGTcTATCGTGGcAAATGATT CGATIX3cGAG = cCAccAAGcTGAAcAcTcAGAccTG CAAGGGCAAAAGCAGTATCAATTACGAGGGATGCGAC CCTCGAAGGATACCGTGAGCAGAAGGCTGGCTCTACC rTAGGCACGCCGAGCAGCTACTACGGAAAAGAGG GCAGATCTGCTlGGTGCAATGCACGCACGTCA~CCC GGTrACTATAAAAGGCTTGGT1TAGTGAGCACGCGA AGTTCGACACCCCGCCGGTTGGGCCGCACATCTAT GTACAAAAGACTCACT SEQ ID 1_5H5 ATGCTAGAAGTGAACCTATAACGCAGAG(AGATA NO:23 1 CGAACTCGACACAGATCCTGCCCCTAATCACCGT AGAGGCATGCATGTATGAAAGCGATCTCTGCGGC CG1TCCATITGGGCGGGTACTATCGTGGCCAATIGATCT CGATGCGAGCCACCAAGTGACACAG~ GAAGGGCAAAAGCAGTATCAA'fTACGAGGATGGA CCCTCGAAGGATCCGTAGCAGAAGTGCTTACG CTTATAAGCACGCCGAGCAGATACTACAAGAG GGCAGATAT = GGTGCAATGCACGCACGACC CCGGrTACTATAAAAAGCGGTAGTGACACG GAAATTCGACACCCCGCCGGTGGCCGCACAC~ ______ATC3TACAAAAAACTCACIfTAA SEQ IID 1_6F12 ATGATAGAAGTGAAACCTATAACGCAGAGAGACTTA NO:232 CGAA=CGACACAGGATCCTGCCCCTAACGA TAGAGGCATGCATGTATGAAAGCGATCT TCGGG TcGTrccATTiGGGGGGTcTATcGTGGcAAATGA TCGATITGCGAGTFI'CACCAAGCTGAACACTCAGACCTA GAAGGGCAAAAGCAGTATCAATACGAATGGA CCCTCGAAGGATACCGTGATCAGAAGTGCCTACG C~rAMTAAGCACGCCGAGGAGCTACTACGA'GAGG GGCAGATATGC=lGGTGCAATGCACGCACGTCACCG CCGGTACTATAAAAGGCTGGTAGTGACACG GAAATTAcGAAAccccCGiGCGGAcA~c _______ ______ATGTACA-AAAAAATCACT SEQ IID 1_-616 ATAAAGGACTfTAOAAGTCT NO:233 CGAACTTCGACACAGATCCTGCCCAAGCGA TAGAGGCATGCATGTATGAAAGCGATCGGGGC -169- WO 02136782 PCT/USOI/46227 TCGTrccATTGGOGG1CATCGTGCcAATI'GATc TCGATrGOGAGTICCACCAAGCTGAACACTCAGACTG GAAGOGCAAAAGCAGTATCAATrACGAGTGGATGTGCGA cccTCGAAGGATACCGTGATCAGAAGGCTGCT=CG C=ATrAAGCACGOCCGAGGAGATACTACGGAAAAGAGG GGCAGrATCTGCTITGGTGCAATGCACGCACGTCAGCCG CCGxflTACTATAAAAGGCTGG=flAGTGAGCAGGC GAAMTITGACACCCCGCCGGTTGGGCCGCACA1fTC'T _____ATCITACAALAAAAATCACT SEQ ID 3_1lA1O ATGCTAGAGGTGAAACCGATTCAACGCAGAGGATACCTA NO:234 TGAACTAAQGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGATrTACTTCGTIGGTG CATITCACTTAGGCGGCTAT1rACAGGGGCAAACTGAT ccATAcT~CATrccAccAGGccGAGcAcTcAGAACTCC AAGGCCAGAAACAGTACCAGCTCCGAGGT.ATGGCTACC TrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTCT AGTrAAAcAcGcTGAAGAAATTCGTAAGAGGGGG CGGACTITGCTrrGGTGTAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGITAGGCTTCAGCGAGCAGGGAGAG ATATJTGAAACGCCQGCAGTAGGACCTCACATCCTGAT _____GTATAAAAGGATCACATAA SEQ ID 3_14F6 ATGCTAGAGGTGAAACCGATrAACGCAGAGGATACCTA NO:235 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCATPfTACTCGTGTG CAT1ITCACTAGGCGGC1TfACAGGGGCAAACTGA'f ccATAGcTCATTCcACCAGGCCGAGCACTCAGAACTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGG-CTACC rGAAGcYETATCGTGAGCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAAGAAAYJ'C1CGTAAGAGGGGG CGGACTGCTGGTGTAATGCGCGCTACGTCCC~rAG GcTAcTAcAAAAAG~rAGGC1CAGCGAGCAGGAGAG ATATITGAAACGCCGCCAGTAGGACCTCACATCCTGAT GTATAAAAC3GCTCACATAA SEQ ID 3_15B2 ATGCTAGAGGTGAAACCGAfTAACGCAGAGGATACCTA NO:236 TGAACTAAGGCATAAAATACTCAGACCAAACCACCGT TAGAAGTGTGTATGTATGAAACCGATTAC'CGTGTG CATITCACTTAGGCGGCTATTACGGGGGCAAACTGAI CCATAGCTTCATrCCACCAGGCCGAGCACTCAGACTC AAGCCAGAACAGTACCAGTCCGAGGTATGCACC TrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTT AATTAAACACGcTGAAGAATCTCGTAAGAGGG CGGACTTO =TGGTGTAATGCGCGGACATCCCCCAG GCTACTACAAAAAGTTAGGCTCAGCGAGCAGGAGAG ATATTTGAAAcGccoccAGTAGGACCTCACATCCGAT GTATAAAAGGATCACATAA SEQ ID 3_6A10 ATGATAGAAGTGAAACCGATrAACGCAGAGGATACCTA NO:237 TGAACTAGGCATAGAATACTCAGACCAAACCACCA TAGAAGCGTGTATGTATGAAAGCGATACF=CGTGT CAII7ACrAGGCGGCTATTACAGGGGCAAACTGA= -170- WO 02/36782 PCT/USO1/46227 AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAAGAAATTCTTCGTAAGAGGGGGG CGGACTrGCITTGGTGTAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGAG ATATTTGAAACGCCGCCAGTAGGACCTCACATCCTGAT GTATAAAAGGATCACATAA SEQ ID 3_6B 1 ATGCTAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:238 TGAACTAAGOCATAGAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGATTTACTTCGTGGTG CATTTCACTTAGGCGGCTATTACAGGGGCAAACTGATIT CCATAGCTTCATTCCACCAGGCCGAGCACCCAGAACTC CAAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTAC CTrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTC TAATTAAACACGCTGAAGAAATTCTTCGTAAGAGGGGG CCGGACTTGCTITGGTGTAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGA GGTATITGAAACGCCGCCAGTAGGACCTCACATCCTGA TGTATAAAAGGATCACATAA SEQ ID 3_7F9 ATGCTAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:239 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGCGTGTATGTATGAAAGCGATTTACTTCGTGGTG CATTTCACTTAGGCGGCTATTACGGGGGCAAACTGATIT CCATAGCTTCATTCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAAGAAATTCTTCGTAAGAGGGGGG CGGACTTGCTTGGTGTAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGTTAGGCITCAGCGAGCAGGGAGAG ATATTTGAAACGCCGCCAGTAGGACCTCACATCCTGAT GTATAAAAGGATCACATAA SEQ ID 3_8G11 ATGCTAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:240 TGAACTAAGGCATAGAATACTCAGACCCAACCAGCCGA TAGAAGTGTGTATGTATGAAAGCGATTACTTCGTGGTG CATTTCACTTAGGCGGCTATTACAGGGGCAAACTGATIT CCATAGCTCATTCCACCAGGCCGAGCACTCAGAACTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAAGAAATTCTCGTAAGAGOGGGG CGGACTTGCTTTGGTGTAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGTTAGGCTTCAGCGAGCAGGGAGAG ATATTTGAAACGCCGCCAGTAGGACCTCACATCCTGAT GTATAAAAGGATCACATAA SEQ ID 4_1B10 ATGATAGAAGTGAAACCTATTAACGCAGAGGATACCTA NO:241 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGTGTGTATGTATGAAACCGATTTACTTCGTGGTG CATTCACTTAGGCGGCTTACGGGGGCAAACTGATT CCATAGcTTCATTCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTTATCGTGATCAGAAAGCGGATCGAGTCT - 171 - WO 02/36782 PCTIUS03J46227 A[TAGACACGCTGAACAAAITCIGTAAGAGGGGG CGGAATGCITGGGCAAGOGGGACA CC-TCA GGCTACTACAAAAAGrTAGGi1TAGCGAGCAGGGAGA GATAT1TAAACGCCGCCAGTAGGAC4CTCACATCCTGA ________TaTATAAAAGGATCACATAA SEQ ID 5_2B3 ATGATAGAAGTGAAACCTATrAACGCAGAGGATACCTA NO:242 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGT TAGAAGTGTGTATGTATGAAACCGATTTACTTCGTGGTG CATrICACTTCAGGCGGCTFIACGGGGGCAAACGATIT CCATAGITrCA1TCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTTATCGTGATCAGAAAGCGGGATCGAGTCT AAITAGACACGCTGAACAAATTC1TCGTAAGAGGGGGG cGGAcATGcTITGGTGTAATGcGcGGAcATCCGccTcA GGCTACTACAAAAAGTTAGGC1TCAGCGAGCAGGGAGA GATATI=GAAACGCCGCCAGTAGGACCTCACATCCTGA ______TGTATAAAAGGATCACATAA SEQ ID 5_219 ATGCTAGANGTGAAACCGAITAACGCAGAGGATACCTA NO:243 TGAACTAAGGCATAAAATACTCAGACCAAACCAGCCGN TAGAAGTGTGTATGTATGAAANCGATITACTTCGTGGTG CAT rAC'fTAGGCGGC'IrACAGGCGGCAAACTGATfIT CCATAGCITCATTCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGcr~rA'rCGTGATCAGAAAGCGGGATCGAGTG AATrAAACACGCTGAACAAATI'CTrCGTGAGAGGGGGG CGGACATGCT1TGGTGCAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAGTfTAGGCTTCAGCGAGCAGGGAGA GGTATIfTGACACGCCGCCAGTAGGACCTCACAT)CCTGAT ______ _____GTATAAAAGGCTCACATAA SEQ ID 5_2F10 ATGCTAGAAGTGAAACCTATTAACGCAGAGGATACCTA NO:244 TGAACTAAGGCATAAAATACTCAGACCAAACCAGCCGA TAGAAGTGTGTATGTATGAAACCGATTACTCGTGTG CAT TAC=rACGGCGCTACGGGOCrAAACTGA1TI CCATAGCTTCATTCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TrGGAAGGTTATCGTGATCAGAAAGCGGGAT'CGAGTCT AA[TAGACACGCTGAACAATTCTCGTAAGAGGGG cGGAcATGcTITGGTGcAAToGcGGGAcAT~ccGccTcA GGCTACTACAAAAAG1TAGGCICAGCGAGCAGGGAGA GATAT1TGAAACGCCGCCAGTAGGACCTCCACATCCTGA ______TGTATAAAAGGCTCACATAA SEQ ID 6_lAl 1 ATGCTAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:245 TGAACTAAGGCATAAAATACTCAGACCAAACCACCGT TAAATTTTTTAACATACTGGT CATrCAC'TAGGCGGCmTACAGGGGCAAACTGATFIT CCATAGCGTCAYI'CCACCAGGCCGCACTCAGACCTC CAAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTAC CTTGGAAGG'LTATCGTGATCAGAAAGCGGGATCGAGTC TAA TAGACACGCTGAACAA TCTTCGTAAGAGGGGG ________GCGGACATGCTTI'GTGCAATGCGCGGACATCCGCCTC - 172 - WO 02/36782 PCTUSO1/46227 AGGCTACTACAGAAGITAGCTCAGGGACAGGGAG AGGTATITGAAACGCCGGrCAGTAGGACCTCACATCCTCT ATC*TATAAAAGGCTCACATAA SEQ ID 6_1D5 ATGATAGAGGTGAAACCGALITAACGCAGAGGATACCTA NO :246 TGAACTAAGGATAAAATACTCAGiACCAAACCAGCCGT TAGAAGTGTGTATGTATGAAACCATTATCGTGT CATITCACTTAGGCGGCTI=ACAGGGCAAACTGATTT CCATAGCITCITrCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAAkACAGTACCAGCTCCGAGGTATGGCTACC TroGAAGGiTATcGTGATcAGTAAAQcGGGATcGTAGTcT AATTAGACACGCTGAACAAAJTCTTCGTAAGAGOGGGGG CGGACATGCT=TOGTGCAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAGTrAGGCTTCAGoGAGCAGGGGA GGTATTTGAAACGCCGCCAGTAGGACCTCACATCCmGA _____________TGTATAAAAGGATCACATAA SEQ ID 6_-iFi 1 ATGATAGAGGTGAAACCGALTAACGCAGAGGATACTA NO:247 TGAACTAAGGCATAAAATACTCAGACCAAACCACCGT TAGAAGTGTGTATGTATGAACCGAmAC2ITCGTGGTG CATITCACTTAGGCGGC'11ACAGGGCAAACTGAI CCATAGCTTCATTCCACCAGGCCGAGCACTCAGACTCC AAGOCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGG'TATCGTGAGCAGAAAGCOGGATCGAGTCT AArAGACACGCTGAACAATC1TCGTAAGAGGGGG CGGACATG = GGTGCAATCGCGGACATCCCCCA GGCTACTACAAAAAGTrAOGCTCAGOGACAGGAGA GGTATITGAAACGCCGCCAGTAGGACCTCACATCCTGA TGTATAAAAGGCTCACATAA SEQ ID 6_IFi ATGATAGAGGTGAACCGATTAACGCAGAGATACCTA NO:248 TGAACTAAGGCATAAAATACTCAGACCAAACCACCGT TAGAAGTGTGTATGTATGAAACCGAmTAC1TCGTGGTG cATTcACTTAGGGG CTr iACAGGAAACGA CCATAGCTTCATTCCACCAGGCCGAGCACTCAGACCTCC AAGGCCAGAACAGTACCAGCTCCGAGGTATrACC TI2GGAAGGTrATCGTGAGCAGAAAOCGGGATCGAGTCT ATAGACACCTGAACArCTCGTAAGAGGG CGGACATGCTTGGTGCAATGCGCGGACACGCTCA GGCTACTACAAAAAGTAGGC'1TCAGCGAGCAGGAGA GGTATITGAA.ACGCCGCCAGTAGGACCTCACATCCTGA _____________TCTATAAAAGGCTCACATAA SEQ ID 6_iHiD ATGCTAOTAGGTCIAAACCGATTAACGCAGAGGATACCTA NO:249 TGAACTAAGGCATAAAATACTCAGACCAAACCACCGT TAGAAGTGTGTATGTATAAACCGATTACrCGTGTG CAlTcAcTrAGGcGGcTI=AcGGGGGCAAAcTGATT'T CCATAGCTCATCCACCAGGCCAGCArCGACCWCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATOTACC TrGGAAGGTrATCGTGATCAGAAAGCGGGATCGAGTCT AATTAGACACGCTGAGAAATCTCGTAAGAGG CGGACATGCTTGGTGCAATGCGCGGACACGCTCA GGCTACTACAAAAAGTrAGGCTTCAGOGAGCAGGGAGA GGTATrTGACACGCCGCCAGTAGGACCTCACATCCTGATI _ 173 - WO 02136782 PCTIUSO1/46227 GTATAAAAAGATCACATAA SEQ ID 6_1114 ATGCTAGAAGTGAAACCGATTAACGCAGAGGATACCTA NO:250 TGAACTAAGGCATAAAATACTCAGACCAAACCAGOCGT TAGAAGTGTGTATGTATGAAACCGATfrTACTFCGTGGTG CATIrAC=AGGCGGGTT=ACGGOGGCAAACTOATIT CCATAGCTCATCCACCAGGCCGACACCAGACCTCC AAGGGCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TrGGAAGGTrATCGTGATCAGAAAGCGGGATCGACTCT AATTAAACACGCTGAACAAAITCTrCGTAAGAGQGGGG CGGACATGCTrrGGTGCAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAG'TAGGCICAGCGAQCAGGGAGA GGTATITGAAACGCCGCCAGTAGGACCTCACATCCTGA ______TGTATAAAAGGCTCACATAA SEQ ID 8-1FS ATGATAGAGGTGAAACCGATTAACGCAGAGGATACCTA NQ:25 1 TGAAGTAAGGCATAGAATACTCAGACCAAACCAGCCGT TAGAAGTGTGTATGTATGAAACCGATTJACTCGTGGTG CAI=rACrrAGGCGGCT1ACAGGGGAAACTGAT'T CCATAGC2ITCATTCCACCAGGCCGAGCACrCAGACCTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TTGGAAGGTrATCGTGAGCAGAAAGCGGGATCGAGTCT AATTAAACACGCTGAAGA-AATTCTrCGTAAGAGGGGGG CGGACTTGCTTTGGTGTAATG~CGCGGACATCCGCCTCAG GCTACTACAAAAAG'TAGGCTrCAGCGAGCAGGGAGAG ATA=jTATACGCCGCCAGTAGGACCTCACATCCTGATG ______TATAAAAGGATCACATAA SEQ ID S_1G2 ATGATAGAGGTGAAACCGATTAACGCAGAGGATACCTA NO:252 TGAAGTAAGGCATAGAGTACTCAGACCAAACCAGCCGT TAGAAGTGTG3TATGTATGAAACCGATrrACTT'CGTGGTG CATITCAC'TAGGCGGCTAITACAGCOOOCAAACTGAYT CCATAGCTI7CATJ7CCACCAGGCCGAGCACTCAGAACTCC AAGGCCAGAAACAGTACCAGGTCCGAGGTATGGCTACC TTGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGT'CT AATTAAACACGCTGAAGAAAYFCTrCGTAAGAGGGGGG CGGACTrCTITGGTGCAATGCGCGGACATCCGCCTCAG G-CTACTACAAAAAGTTAGGCITCAGCGAGCAGGGAGAG GTA =TrAGACGCCGCCAGTAGGACCTCACATCCTGAT ______GTATAAAAGGCTCACGTAA SEQ ID S_1G3 ATGCTAGAGGTGAAACCGATTAACGCAGAQGATACTrA NO:253 CGAACTAAGGCATAAAATACTCAGACCAAACCAGGCGA TAGAAGTGTGTATGTATGAAACCGATITACrTCGTGGTG CATITCAC'FTAGGCGGCTATTACAG%300CAAACTGATI'1 CCATAGCITCATTCCACCAGGCCGAGCACTCAGAACTCC AAGGCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TrGGAAGGTTATCGTGAGCAGAAAGCGGGATCGAGTCT' AATTAGACACGCTciAAGAAATTC=CGTAAGAGGGOGG CGGACTTrGCTTTGGTGTAAT)GCGCGGACATCCGCCTCAG GCTACTACAAAAAc3TrAGGCTrCAGCGAGCAGGGAGAG ATATITGATACGCCGCCAGTAGGACCTCACAT'CCTGATG _____________TATAAAAGGATCACGTAA SEQ ID 8S1117 1ATGCTAGAGGTGAAACCGMTTAACGCAGAGGATACCTA - 174- WO 02/36782 PCTIUSOI/46227 NO :254 TGAACTAAGGCATAGAATACTCAGACCAAACCAGCCGA TAGAAGTGTGTATGTATGAAACCGATTTACTrCGTGGTG CATITCACITAGGCGGC=fIACAGGGGCAAACTGATJT CCATAGCITCA'ITCCACCAGGCCGAGCACTCAGAACTCC AAGGCCAGAAkACAGTACCAGCTCCGAGGTATGGCTACC 'TJGAAGGTrATCGTGAGCAGAAAGCGGGATCGAGTCT AATfTAAACACGCTGAAGAAATTTCGTAAGAGGGGGG CGGACATGCTIGGTrOCAATGCGCGGACATCCGCCTCA GGCTACTACAAAAAGTTAGGCTrCAGCGAGCAGGGAGA GATATITGAAACGCCGCCAGTAGGACGTCACATCCTGA ______TGTATAAAAGGCTCACATAA SEQ ID 8_1119 ATGCTAGAGGTGAAACCGAITAACGCAGAGGATACCTA NO:255 TGAACTAAGGCATAAAATACTCAGACCAAACCACCGT TAGAAGTGTGTATGTATGAAACCGATTTACICGTGGTG CATTTCACTAGGCGGCTATI'ACAGGGGCAA.ACTGA'II CCATAGCITCAFPCCACCAGGCCGAGCACTCAGACCTCC AAOCCAGAAACAGTACCAGCTCCGAGGTATGGCTACC TIrGGAAGGTrATCGTGAGCAGAAAGCGGGATCGAGTCT AArAGACACGCTGAAGAATCfTCGTAAGAGGGG3G CGciACTrGCTfITGGTGTAATGCGCGGACATCCGCCTCAG GCTACTACAAAAAGTTAGGC'ITCAGCGAGCAGGGAGAG GTAT1TATACGCCGCCAGTAGGACCTCACATCCTGATG _____TATAAAAGGCTCACATAA SEQ ID GAT1_21F ATGATGAAGTCAAACCTATAAACGCGGAAGATACGTA NO:256 12 TGAGATCAGCACCGCACTCCGCCGATAC TGGAAGCATGCAAGTATGAAACCGATITGCTCGGGGW ACGTITCACCTCGGCGGATATACCGGGGAACGAT CAGCATCGG1TICTCATAATGCCGAACATCAGACT TGAAGGCCAAAAACAGTATCAGCTGAGAGAG ACGCTTGAAGGATACCGTGAGCAAAAAGCA cGcTcATCCGccATGccGAAGAG rCr~mGAAAA GGCGCGGACCTTiTiiATGGTGCAACGCCAGGACATCTGT GAGCOGGTACTATAAAAAGCTCGGCCAGCAG GCGAAGTCTACGACATACCGCCGATCGGACCTCATATIT TGATGTATAAGAAAITGACGTAA SEQ ID GATI-24G ATGAT[GAAGTCAAACCAATAAACGCGGAAGATACGTA NO:257 3 TGAGATCAGGCACCGCACTCCGCCGATCACG TTGAAGCATGTATGTATACCGAGCTCGG ACG = ACCTCGGTGGATATACCGCAGCTGAC AGcATcGccTccTTATcAAGccGAAcAFcAGAcfl GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAAGGGTACCGCGAGCAAAGGAGTAG CTrATCCGCCATGCCGAAGAG TC1TCGGAAAG GGCAGACCfl-'i'ATGCGTGCAATGCCAGGACATA GCGGTTACTATGAAAAGICGGT1rCAGCGAACAGG GAAGTCTACGACATACCGCCGATCGGACC~FATATfl ATGTAT'AryA A A TGACATAA SEQ ID GATi_29G A TGATTGAAGTCAAACCAATAA-ACGCGGAAGATACGTA NO:258 1 TGAGATCAGGCACCGCATCTCCGGCCGAATCACG TTGAAGATAGAAACGTTcT - 175 - WO 02/36782 PCTUSOI/46227 AC~fTCACTCGTGGAMTACGGGCAGTGATC AGCATCGCfTC=ATCAAGCCGAACA1?CAGAGCT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CACTTGAACGGGTACCGCGAGCAAAAAGCGGGTAGTACG CrrATCCGCCATGCCGAAGAGCTrCTCGGAAAAAO GGCAGACCITJATGGTGCAACGOCAGGACATCTGTGA GCC-GGTACTATAAAAAGTCGGCTCAGGAACAAGC GGTCTGCGATATACCGCCGATCGGACCTCATATEIT ATGTATAAGAAA'TGGCATAA SEQ DD GATi_32Cr ATGAITGAAGTCAAACCAATAAACGCGGiAAGATACGTA NQ:259 1 TGAGATCAGGCACCGCATACTCCGCCGAATCAGCCGC TrGAAGCATGTATGTATGAAACCGATrrGCTCGGGGGC ACGYIfTCACCTCGGTGGATATfTACCGGGGCAAGCTGATC AGCATCGCTTCCTITCATCAAGCGAACATCCAGA= GAAGCCAAAAACAGTATCAGCTGAGAGGGATGGCGA CAC'ITGAAGGGTACCGCGAGCAAAAAGCGGGCAGTACG CTrATCCGCCATGCCGAAGAGCTCTCGGAAAAAMG CGCAGACGITIATGGTGCAACGCCAGGACATCTGTGA GCGGCTACTATGAAAGCTCGGCTTCAGGAACAGG GAAGTCTACGACATACCGCGATCGGACCTCATAT=G ________ATGTATAAGAAATTGACATAA SEQ ED GAT2_-15Cr ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:260 8 TGAGATCAG GAC CAUCTCCGGCCGAATCAGCCGC TGGAACATGCAAGTATGAAACCGATGCTCGGC~ ACGTTrCACCTCGGTGGATATrACCGGGGCAAGWTGAC AGCATCWTrCCTTCATAATGCCGAACA2rCAGACT GAAGGCCAAAAACAGTATCAGTGAGAGGGATGGA cG vTGAAGGGTACCGCGAGCAAAAAGCGGGAACAC GGTcATccGccATGcGAAGAAGTcTCGGAAAAG wCGAGACC1TTTi'ATGGTGCAACGCCAGGACATCTGTG AGCGGGTACTATAAAAAGTCGCCAGCGACAG CGAAGTCTACGACATACCGCGATCGGACCTCATAT= ______GATGTATAAGAAATGACGTAA SEQ DD GAT2_19H ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO:261 8 TGAGATCAGCACCGCATACTCCGCGTCACW TPGAAGCATGTATGTATGAAACCGAT1TGT0GG AcGTcAccTcGGTGATAACCGGAGTGA~c AGCATCGCTCCTTCATCAAGCCGAACATCCAGATT GAAGGCCAAAAACAGTATCAGCTGAGAGGGATGGA CAcTAAGGGTACCGGAGCAAAAAGGAGTACG cTTATccwcCATGccGAAGAGcTcTrcGGAAAAAAG CGCAGACCITTTTATGGTGCAACGCCAGGACATCTGTGA GCGGCTACTATGAAAAGTCGGCTCAGGAACAGG GAAGTCT'GCGACATACCGCGATCGGACCTCATAmTG ATGTATAArGA A A TGAkCATAA SEQ ID) GAT2_-21F ATGATTGAAGTCAAACCAATAAACGCGGAAGATACGTA NO: 262 1 TGAGATCAGGCACCGCATCTCCGGCGAATCACG TIGAAGCATGTATGTATGAAACCGATTGCTCGGGO AcGTTcAccTCGGTGGATATrAcGGGA~c AGATCGCTCCTTCATCAACGACArCAGATT - 176 - WO 02/36782 PCTIUSOI/46227 GAACGGCCAAAAACAGTATCAGCTGAGAGGGATGGA CACTI'GAAGGATACCGTGAGCAAAAAGCGGGCAGTACG CITTCCCCAGCCAAGGCICTTGGAAAGG GGCAGACCTTfTATGGTGCAACGCCAGGACATCTGTGA GCGGGTACTATAAAAAGCTCGGCY1CAGCGAACAAGC GGGGTCTACGATATACCGCCGATCGGTACCTCATATlTF _____ATGTATAAGAAATI'GACGTAA SEQ ID 13_10F6 M1EVKPINAIDTY 131RBRLRPNQPLE-ACKYETDLLRGWFH NO:263 LGGYYPGKLISIASFHQAFI{PELEGQKQYQLPGMATLERGY RBKAGSTRMHAEELLRKKGADULWCNARTSASGYKK LGFSEQGEVYDIPPVGPH)LmyYLT SEQ ID 13_12G6 MIEVKPINAED I R1nU 1LPNQPLEIACKYE'IDLLPGAFH NO:264 LGGYYPGKLVSIASFHQAEHPELEGQRQYQLRGMATLEG YREQKAGsThTRHNEELLRKKGADLLWCNARTSASGYYK IKLG-TFQGEVYDIPPTGPI~LMvYXXLT SEQ ID 14_2A5 MIEVKPINAEDTYEIURRRI[PNQPLACKYETDLLGSTFHL NO:265 GGYYRGKLISIASFNQAEHPELEGQKQYQLRGMATLEGYR, EQKAGS Ii 11AFFT iRXKGADLLWCNAPRTSASGYYXXL ________GFSEQEYTPPVGPBILMfYKKLT SEQ ID 14_-2C 1mEEVKPNADTYEInRPNQPLEACYETDLLRGAFHI NQ:266 LGGYYRGKLVSIASFHQAEHELEGQKQYQLRGMATLEG YRBQKAGS1 TR~t-AEEL IRTKTGADLLWCNARTSASGYYK KLGFSEQGEVDTPPTGPHaIMYKKLT SEQ ID 14_Fi 1 MIEVIKPIINAEDTYEIULRPNQPLE-ACKYEfDLTLRGAFH NO:267 LGGYYRGKLVSIASFHQABPELEGQKQYQLRGMATLEG yREQyKAGST ji TT~AALLRKK(TGADLLWCNARTSASGYYK ______ _____KLGFSEQGTEVYDTPPATPHJLMYKXLT SEQ ID CHERA ENIAEDTYEBI~LRPNQPLEACMYD IRGAF NO:268 LGGYYRGKLTSIASFHQARHSELEGQKQYQLRGMATLEGY REQKAGSTURHAEELLRKKGADLLWCNARTSASGYYKK _____LGFSEQG3EVYDTPPVGPHJLMYKKLT SEQ ID 1012D7 NuVKPIIAEDTYEm1EURPNQPLE CKYDLLGTL NQ:269 LGGYYRGKISIASFHQAEPELEGQKQYQLRGMEE REQXAG3STLJRHAEBLRKKGADLLWCNARTSASGYY-KK _______LGFSEQGEVYDIPPT)GPH'EIMYKKLT SEQ ID 10_15F4 MIEVIKPINAED)TYEIRILRPNQPLEACMYETi)LLRGTFH NO:270 LG RGY VSIASFHQAEIPaLEQKQYQLRGMATLEE YREQXAGS'TL-TAPPI TiRYTKGADILWCNARTSASGYYK KLGFSEQ-GGVYDIPPVGPIULMYKKT SEQ HD 10_171)1 MIEVKYPINAEDTIYEIhR]L.RPNQPLEACKYE-ITLLGGTFH NO:27 1 I.GGYYRGKLISIASFHQAEKPELEGQKQYQLRGMATLhEGY REQKAGcSTLiRAEEIRKCKGADLLWCNARTSASGYrYKK _______LGFSEQG--EVYDTPPVGPH]IMYKYLT SEQ ID 10_-17F6 MVKPhINAEIYEThR]LRPNQPLEACKYETDLLGGTTH NQ:272 LGGYYRGxLvsi~sHASEEGQKQYQL 0TLE YREQKAGSI T-API 1~CGADLLWCNARTSASGYYK KLGFSE-QGTE)YIPVGPHLmyKLT SEQ ID 10_18(39 MIEV2Th1AEDTYEIRHRUPNQPLEACKYEIDLLGGTH NO:273 ILGGYYRGIKLVSIASFHQAEHSELEGQKQYQLRGMATLEE NO:273 QKAciSTLIRI3AEELLRKKGADLLWCNARTSASGYYK - 177 - WO 02/36782 PCTIUSOI46227 KLGFSEQGGVYDII'PVGPHLIVIYKEXT SEQ ED 10_1H3 MLEVYKPNAEDT1Y.E]RHR1RPNQPLEACKYETDUGTH NQ:274 LGGYYRGIKLVSIASFHQAFBPELBMiPKQYQLRGMATLEG yREQKAGSTLpiRAEELLRKGADLLwCNARTSASGYYK ______KLGFSBQGEVYDIIPTGPBU-MY=XT SEQ IID 10_2D10 MIEVKPINABD I YhaUMUR]NNQPLEACMYETDTLGGTLI NO:275 LGGYYRGKUSIASFHQAEHPELBGQKQYQLRGMATLEE-Y RBQKAGS Ti R1zABE~I 1R WGADLLWCNARTSASGYYKI( LrGFSEQ0GVypDIPPVGPBIUMYXKLT SEQ ID 10_23F2 MEPIhrABDTiYEmBRRILPNQPLEACMYEDLLGGTFH NQ:276 LGGYYRGIKLVSIASFIHQAEHPIELEGQKQYQLRGMAThE-G YRBQKAGSTLIRHAEFLLRKXGADLLWCNARTSASGYYK ______KL01~sEQGEVYDTPPVGPBIMYEELT SEQ DD 10_238 MEEVKPINAEDTYEIRRRIRPNQPLE-ACKYBIDLLGGTFH NO:277 LGGYYGKUJSIASFHQAEHPELEGQKQYQLRCTMATLEEY RBQKAGS Mj 1FT-APFnI 1f GADLLWCNAR.TSASGYYKK _______ 10 LGFsEQGEvYDTPPVGPHIIVYKXKLT OCEDLG SEQ ID 102C M]BVPIIAED 1 1hRUPfL~RPNQPLEACXEDLGF NO:278 LGGYYRGKLIIASFHQAIHPaLEGQKQYQLRGMA=hEY REQIKAGS ~TRHLAPFJ LiRKTGADLLWCNARTSASGYYKK LGFsEQGEvYDTPPVGPHIMYK:KLT SEQ ID 10_305 MEV<PINAEDTYERHJLRFNQPL-ACKYETLLGGTFH NO:279 LCGYYRGLVSIASFHQABHPELBGQKQYQtRGMATLJ-G YREQKAGSThIRHIAEE.LKGADLLWCNARTSASGYY KLGFsEQGEVYDIPPTGPBILMYKXLT SEQ ID 10_4H7 MEEVKpJNAIDTIYEJRH]RPNQPLE-ACMYE'=LLGGTFH NO:280 LGGYYRKLSIASFHQAELGQKQYQI-RGMTLG ,yRBQKAGS j RT-AEPK iKrGADLLWCNARTSASG=Y IKLGFSEOGEVYDIPPTGPEILMYKKLT SEQ ID 10_6D1 1 ME~VKPNAEDTYEmRILRPNQPLE-AGKYeTDLLGiJLH NO:28 1 LGGYYRGLVSIASFHQABEQKQYQI-RGMALG YRBPQYAGSTLIRHAEELLRKI(GADLLWCNARTSASGYYK ______ _____KLGFSEQEVDPPVI'ILMYKKLT SEQ ID 10_8C6 M1E NAEDTYERNQPNEACKYETLLGGAFH~ NO:Z82 LGGYYRGKISIASFBiQAEBPELEGQKQYQLRGMA=GY RBQKAGS ~ITWFTAP1ET P KGADLLWCNARTSASGYYKK LciFsEQGvyD]PPVGP13IJMYKIKLT SEQ ID 1 10 MM]pNETEBIRNPFAKEDLGF NO:283 LGGYYQGKLISIASFHQAESSELGQKQYQLRGMATLPGY REQKAGS I1TLMAPPT I.PT(TGADJLWCNARTSVSGYYKK LGFSEQGGVYDIPPIGPHJJMYKKLT kMEDLGF SEQ ID 1 1G3 MEIPNETERURNP NO:284 LGGYYQGKUISIASFHQAEHSFIEGQKQYQI-RGMATEGY REQKAGSThTRRAEELLRKKGADLLwCNARTSASGYYEK LGFSEQGGVYDIPPIGPIMYKYKLA SEQ ID 11113 MEVKNAEDTYERHR-RPNQPL-ACMYETDLLGTGAFH NO:285 LGGYYQGKJisiASFHAEHSELEGQKQYQLRGMALGY REQKAGSThIRHAEELLRKKGADLIWCNARTSVRGYYK _____________LCFSEQGvyDIPPIGPBUJLMYKKLT SEQID 12 1f9 M]VNAEDTY -RNQLECYETDLGuiF -178- WO 02136782 PCT[USO1146227 NO:286 LGGYYRGKUSIASFHQAEHPELEGQKQYQLRGiMATLEGY REQKACGSTLRRAEELLRXXGADL.LWCNARTSASGYYKK LGFSEQGEVHDIPPTGPHaln MYLT SEQ ID 12_209 MIEVIKPflNAEDTYERHRILRPNQPLEACKYETDUJ-GUlTH NO:287 LGGYYRGYLVSLASFHQARHPELEGQKQYQJLRGMATLEG yRBQKAGSThTRHAEELLRKI(GADTILWCNARTSASGYYK ______KLGFSEQOEYYDTPPVIPHIVMKKT SEQ ID 12_3F NMVKPINAEDI.i. 1FTT[ .PNQPLEACK-YETDTLGG1FH NO:288 LGGGKLISIASFHQAEPELBGQQYQLRGMATLBGY REQKAGSTLhtRRAEELLRXKKGADLLWCNARTSASGYYKK ______LGiFSEQ(GGVYDIPPVGPHLYKKLT SEQ ID 12_5C10 MIVKIADYR~fRNPFKEDLGF NO:289 LGGYYRGKLTSIASFHQAEHiPELEQKQYQLRGMAThEEY REQK.AGS 11 R HAEPI-IaGKTADILWCNARTSASGYYKK LGFSEQGEVYDAPPTGPHUIAYKXLT SEQ ED 12_6A10 MIEVKPINAED)TYEnIRPJRPNQPLE.A.CKYETDLLGTH NO:290 LGGYYRGKLVSIASFHQABHPELBGQKQYQLRGMAThE-G yREQKAGSTIRHAEELRKKGADLLWCNARTSASGYK ICLOFSEQGGVYDII'PVGOHLYKKLT SEQ ]D 12_6D1 MEVIKNAEDTYEflRPNQPLEAC MYEDLLGTF NO:291 LGGYYRGKLISIASFHQAEHPELEGQKQYQLRGMATLEEY REQKAGSThHAEELLRKKGADULWCNARTSASGYK _____LGFSEQGGVYDIPPVGPHILMYKIMT SEQ ID 12_6F9 MIVPNET QPFAKEDLGF NO:292 LCGGYYRGKLISIASFHQAEBPEIEGQKQYQLRGMATLhEGY REQKAGS Ti RkR KKTPI CTADLLWCNARTSASGYYKK LGFSEQ(3EVYDIP''PT-M M2VYKYIT SEQ ID 12_6116 MEVNAEDTlYEIRRP~RNQPLBACKYEDLLGTI NO:293 LGGYYRGILVSIASFHQAHELC3QKQYQLRGMATLE-G yRBQKAGSThtRRkALALRKGADLWCNARTSASGY _______KGIFSEQGEVYDIPPTGPHIMXL SEQ ID 12_7D6 MIVPNE--.RBURNPFAKEDLGF NO:294 LGGYYRGTLSIASFHQABEBGQKQYQRGMTLEY NO:94REQKAGTLTHABELRKKGADIWCNARTSASGYX LGFS QGGVYDIPPTGPHILMYKXKLT SEQ ID 12_7011 IEVKPNADTYEITRPNQPL EACKKY=LGTF NO:295 LGGYY GKLTI6SFHQAEHSELEGQKQYQLRGMATLEGY RBQKAGS Ti J AEELL RKGAD.LWCNARTSASGYYKE( _______LGFSEQG--EVYDTPPVGPBUEMYKKLT SEQ ID 12F5 MIEVIKPINAEDTYEIRRRJLRPNQPL-ACMY=rLLGGTFH NO:296 LGGYYQGKLISIASFKAEHSELEGQKQYQILRGMATLEGY REQKAGSTI,.TRHAERLLRXKGADLLWCNARTSVSGYY'KK LGFSEQGGIYDIPPIGPHELMYEKLT SEQ ID 1207 NDVKIAE~~uu~pQL~KEDiuF NQ:297 LGGYYQOUSIASFKA=ELSEGQKQYQLRGATLGY REKAST1R1{AEELRKADLLWCNARTSVSYYXK _______ ______LGFSEQGEVYDIPPIGPHflUMYKYLT SEQ ID) 1_216 MIEVKPINABD~TYEBILRPNQPLE-ACMYETDLLGGAFH NO:298 LGGYYRGIASFHQAEELGQKQYQRGMALEGY REQ AGS- 179
-
WO 02136782 PCTJUSOI/46227 _______LGFSE013GVYI)IPPIGPHIMYKXXLT SEQ ID 13_12G12 MIVPNETEUMPNPFAMEDLGF NO:299 LGGYYRGLS1SFNQAELGQKQYQLRGMTLEY REQKAGSTL2RIAEEULRKKGADLLWCNARTSASGYYKK ______LGFSEQGEVYDIPPVGPHMEHKKLT SEQ ED 13_6D10 1vEVIKPIhAEDYERRR]RTNQPLAC1YE-fSLGGTF NO:300 LGGYYRGfLTSIASFNQAEEPELEGQKQYQLRGMALGY REQKAGSTj L(AUikT I 1RWTGADLLWCNA1RTSASGYYK _____LGFSEQGEVYDTPPVGPHILJVIKKT SEQ ID 13_7A7 MIEVNAEDTYERBJLRNQPL-ACMYETDURSAFH NO:30 1 LGGYYRGICLISLASFHQAEHPELEOQKQYQLRGMATLEE RLEQKAGS 1TRPITAEELLIRKKGADLLWCNARTSASGYYKK LGFEQEVYD'ITPVGPIEILMYKIKLT SEQ ID 13_7B12 MIEV2KPNAEDYERJLRPNQPLACYT)LLSTFHL NO:302 GGYYRGKSIASFHQAEEGQKQYQLRGMTLGGY EQKAGS i1 1M1APPTT 1 WTGADLLWCNARTSASGYYKKL _____________GFSEQGEVYD]IPGHIMKL SEQ ED 13_7C1 MIBVPflADTY UE.PNQPLE-ACKYEILGF NO:303 LGGYYRGKLUSIASFHQAEEPFEGQKQYQLRGMALGY REQKAGSTLMHALRKGAD1LWCNARTSGYK LGFSEQGiEVYDIPPTGPBILMYKXLT SEQ ID 13_806 MIEVYINAEDTYT RLpNQLCKYET1DSLjGTFH NO:304 IJJOYYRCTKLISIASFNQAEBPELEGQKQYQLRGMAThE-GY REQKAGS 1T1ART CGLLWCNARTSASGYYKT( ______LOPSEQ(GGVYDIPPVGPHELMYKCKLT SEQ ID) 13_9F6 MEVPINAETYE RRPQPLEACKYTGGTF NO:305 LGGYYRGKISIASFHQAEEEGQKQYQIRGM'LEY REQKAGSil T-AP CGDLCNARTSASGYYEX ______LGFSE OEVYDIPPVGPT-JLMYKKLT SEQ ID 14_10C9 MIfEVKPINAFDTYEIRR RQPLEACKYEfl)LLRGA.FH NO:306 J)OYYRGiKUSIASFHQAEBPELEGQKQYQlRGMATBEFY REQIAGST'IRIHAPJR 3 GADLLWCNARTSASGYYKK ______ _____LGFSE GEVYDTPPVGPHIIMYKK<LT SEQ ID 14_10H13 MIEVT PINAEDTYIRIRNPECKYETD.LLLRGAF NO :307 LGGYYRGKINSIASFHIQAEELEGQKQYQLRGMAkTLEE ThIRE3EELLRKGADLLWCNARTSASGYYK _____KLGPSEQGEVYDTPPVGPHILMYKKLT SEQ ID 14_10119 BivyPNAETYERPNQPLF-,KEACKLRG NO:308 LGGYYRGKLVSIASFHQAEBPELEGQKQYQLRGMATLEG YREQKAGSTLTRHAEELLRKKGADL-LWCNARTSASGYYK SKLGFSEQGEVYDfl'PVGPBREMYKKLT SEQ ID 14_1 1C2 MMYIAT~HMNPECYTLGTE NO:309 GGYYRGKLVSIASFHQAELGQKQYQLRGTEFY RBQEAGS 11.111AE T RGADLLWCNARTSASGY=K LGFSE EVYDT PPPTGPHJMYKKLT SEQ ID 14_12D8 MJEVKPINAEDTYEIRHRILRPNQPLEACKYEfl)LLGGTFH NO:3 10 LGGYYRGLVSIAS HQABLEQKQYQGALG yRBQKAGSTh1RHAEALLRKLGADLLWCNARTSASGY KLGFRBQGGVYD]FPVGPHJLMYICKLT SEQ ID 14= 1211H6 -180DT-lBJLP nLAKYTLIGF WO 02/36782 PCTIUSOI/46227 NO:3 11 LGGYYRGKLISIASFHQAEBPELE-GQK-QYQLRGMATLEEY REQKAGSTL'I'iAEELLIRKciGADILLwcNARTsASGYYKK LGFsEQGEvYDIPPTGPHILMYXKLT SEQ ID 14_2B6 MMIEKP]NAED) i YxMIFTR PNQPILEACKYFETDILGGTFH NO32REQKIAGST ji T-APPI LTRGADLLWCNARTSASGYYIQ( ______LGFSEQG3GV'YDIPPVGPH[LMYKYKLT SEQ ID 14_2G11 MIEVIKPINAEDTYER11R]LRPNQPLE-ACKYETfDTLRGAFH NQ:3 13 LGGYYRGY[NSIASFHQAEHIPELEGQKQYQLRGMATLEG yREQKAGSTRHAEELLRxKGADL~wcNARTsASGYYK ________ KLGFSEQGEVYDI'PTGPHILIVYKXLT SEQ ID 14_3B2 MEEVypINAEDTYEIRHILRNQpLEAiCKyETDLLRGAFH NO:3 14 LGGYYRGKLVSIASFHQAEBPELEGQKQYQLRGMATLEG yRBQKAGSTLIRFIAEAILRKKGADL LWCNARTSASGY YK KLGFSEQGGVYDIPPAGPHJLMYKKLT LGTB SEQ ID) 14_4118 M]EVPINAEDTYBIRPNQPLEACKYETDLS~~ NO:3 15 GGYYRGK1IASFQAEBGQKQYQLRGMATGY EQKAGSTLmHAkELLRKcKGADILLwCNARTSASGYYKKL ___ _GFpsEQciEvYDT1PPVGPIDLMYKKLT DLGF SEQ ID) 14_6A8 MIEVYPINAED)TYEJERILRPNQPLACMYBTD NO:3 16 LGGYYRGKLVSIASFNQAEEIPELEGQKQYQLRGM-ATLEG yREQXAGSThERHAEELLRXKGADLLWCNARTSASGYYK KLJGFSEQGEVYD)TPPVGPHVLMYKKLT SEQ ID 14_6B10 MIEPINAED TYEM3iP -R PNQPLEFACKYEIDLLGGTFHI NO:3 17 LGGYRGKJSIASFHQAELEaGQKQYQIRGMALGY REQKAO-s j.11IkA1M IR T(GA)ILWCNARTSASGYYKK _______LGFsEQGGVYDMPGPBILMYKIKLT SEQ ]D 14_6D4 MIVKIAETEHIRNQIFAKEDJI~F NO:3 18 LGGYYRGK1TSIASFNQABELEGQKQYQL-RGMALGY RBQKAGS iT TTAEALLRKKCTGALLwcNARTsASGYYKK LGFSEQGEVYIDTPPVGPHRIMYKKLT SEQ ID 14_7A1l 1-MEVKPINABD I YiEJiRI 11RPNQPI-EACKYBTDLLRGAFH NO:319 LGGYYRGLVSIASFHQAEGLKQYQRGMATEG NO: 19YREQYKAcSTLERHAEELLRKKGAIiWCNARTSASGYYK ______ _____KLG-FSE GEVYD)TPPTGPH11IMYKKLT SEQ ID 14_7A1 MIEVYPINAEDTYEIRR]RPNQPLE-ACMYBTDLiRGTFPH NO:320 LGGYYRGCLVSIASFHQAEPELGQKQYQGTLE yREQKAGSTLnUHAEELLRKKGAD)LLWCNARTSASGYYK ______KLGFSEQGEYI)TPPAGPBIAMYKKILT SEQ ID 14_7A9 M[EvKp]NAEDTYEIRRRLRPNQPLACKYETDUGTH NO:321LGGYYRGKCLVSIASFHQAKBPELEGQKQYQL-RGMA'ILEG NO:21YREQKAGSTLTHAEELRKGA)LLWCNARTSASGY KLGFSEQGEVYDTPPVGPHELMYKIKLT ELRGH SEQ ID 14_701 MIEVKPINAEDTYEM RU-RPNQPL'ACKYTDLRAF NO:322 LGGYYRGKTSIASFNQABELBGQKQYQRGMALEEY RBQKACGSTI 1ETAPAT I PT(GADU.LWCNARTSASGYYK _________ LGFBQGBVYTPPVGEMEJ~ SEQ ID 14_7H19 MCVyflITAEDTYEER1RILRPNQPLEACKYEI1)LLCTGTFH NO:323 IGGYYRGLVSIASFQAELGQKQYQGATL ______ YREQKAsTRAK DLWNSSG -181- WO 02/36782 PCTIUSOI/46227 _____________KLGFSEQGEVYDIPPVGPHMIMYXXLT SEQ ID 14_SF7 MEVIKPINAED'YERBRH11RPNQPLE.ACKY='TLLGGTFHi NO:324 LGGYYRGKLVSIASFHQAEHPELEQQKQYQLRGMATLEE YREQKAGS .I1TTRA1RAT DLRKGADI.LWCNARTSASGYYK XLGFSBQGEYIPGBIMKL SEQ ED 15_10C2 MIVKPINAEDTYEZRBPLRNQPLEACKYE'IDLLRGAFH NO:325 LGGYYRGKLVSIASFPHQABUPELBGQKQYQLRGTMATLE-G YREQKAGSThIRRAEELLREXGADILLWCNARTTASGYYK KLOFSEQGVDPTP]MKL SEQ IID 15_10D6MJVPADITl4T1 PQLAYTLGTF NO:326 LGGYYRGKLVSIASFHQAEHPELEGQKQYQLRGMATLEE, YREQKAGSTLHIRAEELLRKKGADLLWCNARTSASGYYK ______KLGPSEQEY PGPILMNfYKIKLT SEQ IID 15_11F9 MIEVIKANA.EDTYEIRHIRPNQPLE-ACKYETDLLRGAFH NO:327 LGGYYRGKLVSIASFNQAEBPELBGQKQYQLRGMATLEG YREQKAGSnZHI{AEELULRRKGADLLWCNARTSASGYYK ______ LGFSBQGEVYDIPPTGPIULMYKIKLT SEQ IOD 15_11113 MIEVKPTNABDTYEIRHR1LRPNQPLEACKYETDULRGAFH NO:328 LGGYYRGKLISIASFHQAEHPELEGQKQYQL-RGMAThEGY RLBQKAGSTLTR11AEALLRKLKGADLLWCNARTSASGYYKK LGFSBQGEVYDIPPTGPBLmyKIKLT SEQ ID 15_12A8 MIEVIKPNAEDi )T1MH11 RPNQPL.FACKYETDTLLGGThH NO:329 LGGYYRGKUISIASFHQAEEPELEGQKQYQLRGMATLEGY RBQKAGSTLIRRHABALLRIKKGADLLWCNARTSASGYYKK LGFSEQG7EVYDIPPTGPEIMYKKLT SEQ ID 15_12D6 MIEVIKPNAEDTjY frjR1TTR1NQPLE-ACMYETDU..RGAFH NO:330 LGGYYRGIKLVSIASFHQAEPQKQYQLRGMATLEG yREQKAGSTLtRHAEELLRKKGAD-J.WCNARTSASGYYK ________KLGiPSEQGEVYDTPPVGPHILMYKKLT SEQ ID 15_12D8 MIEVXPTNAEDTYEIRBJLRPNQPLEACKYETDLLGGTFH NO:33 1 LGGYYRGKLVSIASFHQAEBPELEGQKQYQLRGMAThEG yREQKAGSThRzHAEELLRXICGADULWCNARTSASGYYK ______KLGFSEQGiKVYDIPPVGPEILAMYKKLT SEQ ID 15_12D9 MIEVKPflNAEDi 'Id-TR 11 LRPNQPLEACKYETDLLRGTFfI NO:332 LGGYYRGKLVSIASFHQAEHPELBEGQKQYQILRGMAThEE yREQKAGSTURRiAEELLRKKGADLLWCNARTSASGYYK KLGFSEQGEVYDIPPVGPH[lAYKKLT SEQ ID 15_3F10 MIEVKPINAEDTYElR1U .1RPNQPLEACKYETDLLRGAFH NO:333 LGGYYRGKLIIVSFHQAEEPELEGQKQYQI.RGMATLEGY REQKAGSThIRHABELRKKGADILWCNARTSASGYYKK ______LGFSEQGVYPAGPIJLMYTL(LT SEQ ID 15_3(311 M1EVKINAEDTYEIR1LRPNQPLBACKYET1)LLGGTFH NO:334
LCGGYYRGKLVSIASFHQAEBPELEGQKQYQLRGMATLEE
YREQKAGSii 1MHAEELLP 1KKGADLLWCNARTSASGYYK KLGFsEQGEvYDIPPVGPILMYKIKLT SEQ ID 15_4F11 MIEVKPIh1AEDTYIMRIRPNQPLE-ACMYEDLLGGTFH NO:335 LGGYYRGYVSIASFNQAEBELEGQKQYQLRGMALG yREQ]KAGsTLniRLAEALLRXXGADILWCNARTSASGYYK _______ KLGSEQGEVYDIPPTGPIELMYKLKLT S ID 115_413 IMEVKPINAEDTYEIRHRILRPNQP LEACKYETDLLGTGTFH - 182- WO 02/36782 PCT/USO1/46227 NO :336 LGGYYRGKLVSIASFHQAEHPELEGQKQYQILRGMATLEE YREQKAGSThTRHAEELLRXKGADL.LWCNARTSASGYYK ________KLGFSEQGEVYDIPPTGPHILMYKKLT SEQ ]D 15_6D3 MVIEVKPINABD Ix EiP-l1 11RPNQPLEACKYETLnGTFlH NO:337 LGGYYRGKUISIASFHQAEBPELEGQKQYQLRGMATLEEY RBQKAGS j JR 1AE1R K GADL-LWCNARTSASGYYKI( ______LGFSEQGEVYDIPPTCIPHIUVYKIKLT SEQ ID 15_iG1l 1 IKPINAED1L'RRnURPNQPLEACKYETDLL-RGAFPH NO:338 LGGYYRGIKLVSIASFHQAEHPELEGQKQYQLRGMATLEE yRBQKAGSThIRRAEELLRXKGADLLWCNARTSASGYYK _____KLGFSEQGKVYDIPPVGPHMhMYXIIT SEQ ID 15_9F6 M[EvKpIAED .. zTR1~ 1 nu RPNQPLE-ACKYEII)LLGGTFH NO:339 LGGYYRGKLISIASFHQAEHPELEGQKQYQLRGMATLHEY RBQKAcISThIRHAEELLRRKGADL.LWCNARTSASGYYKK LGFSEQG--EVYDIPPVGPHLMYKXKLT SEQ ID 15F5 MEVKNAEDYRLPQLAKED~LG NO:340 L.GGYYRGKLISIASFHKAE2ESELEGEEQ YQLRGMATLEGY REQKAGSTLIRyAEELLRKKGADLLWCNARTSVSGYYKK _______LGFSEQ(3EVYDIPPIGPBILMYKIKLT SEQ ID 16A1 MIEVKPINAEDTIY.EnU~RLRNQPLE.ACMYETDLLGGThH NO:341 LGGYYQGKLISIASFHKAEHSGLEGEEQYQLRGMATLEGY' R]BQKAGSThUiRAEELLRKGADL.LWCNARTSVSGYYEK ______ _____LGFSEQGEVYDIPPIGPEILMYKYLT SEQ ID 16H3 MD[VKPINAEDTYEIRIRPNQPLACKYETDLLGGWFH NO:342 LGGYYQGKLTSIASFHQAEHSELEGQKQYQLRGMATLEGY REQKAGSTh1RHAEELLRKKGADT-JWCNARTSVSGYYEK LGrFSEQG3EVYDIPPIGPHMUYKIKLT SEQ ID 17C12 EVKPISAETERRLPQLAM~lILGF NO:343 LGGYYQGKLISIASFHQAFMSELEGQKQYQLRGMATLhEGY RpBQKAGSThTRHAEELLRXXGADLLWCNARTSVSGYYEK _____LGFSEQ YIPGPHILMfYLT SEQ I[) 1 8D6 NfvrA)NAEDTiY hnZHRILRPNQPLEACKYETDLLGGTFH NO:344 LGGYYRGKLISIASFEKAEHSELEGQKQYQLRGMATLEGY REQKAGSThTRHAkEELLRKKGADL.LWCNARTSASGYYEK LGFSEQGEVYDIPPIGPHIVYKKLA SEQ ID 19C6 MJEVKIINAEDTYEIRR]RPNQPLB-ACKYEDIIGGTFH NO :345 LGGYYRGKELJCIASFHQAEHSELEGQKQYQLRGMATLEGY REQKAGSTh1RHAEELLRKKGADL-LWCNARTSVRGYYE _________LGFSEQOGVYDIPPIGiPHJMYKKLA SEQ ID 19D5 MY KPNAE DTYEJRCIUPNQPLE-ACMYEIDLLGGTFH NO :346 LGGiYYQGKLISIASFHKAEHSELEGQKQYQLRGMATLEGY RBQKAGSTTRHAEELLRKKGADLLWCNARTSVSGYYKK LGFSEQGEVYDIPPIGPILNYKKLT SEQ ID 20A12 MIEVKPINAEDTYE]RR1RPNQPLEACMYETDL)LGGTFH NO:347 LGGYYQGKISIASFHNAEHSELBGQKQYQLRGMATLEGY REQKAGSTL1RHAEELLRKKGVDI-LWCNARTSVSGYYKK LGFSEQQIGIYDIPPIGPHJLMY.ELA SEQ ID 20F2 MIEVKPIhTAED I Y bRURURPNQPLEACMYETD)LLGGTFH NO:348 LGGYYRGKLISIASFHQAEHELEGQKQYQLRGMATLRGY _____________REQKAGSTh1RHAEELLR.IKKGADLLWCNAR.TSVSGYYEK -183- WO 02136782 PCTUSOI/46227 ________LGFSBQGEVYDIIPPIGPH11MYKKLT SEQ ID 2.10E+12 IEVKNAEDTYRR RPNQPLEACKYILGGAFH NO:349 LGGYYQGKLIIASFPHQAFBSELEGQKQYQLRGMATLEGY RIBQKAGTS iT-TAEEI LLRcKGADLLWCNARTSVSGYYKK SLGFSEQG3EVYDIPPIGPEMYXLT SEQ I[) 23H1 1 M[EVYPIAETYEBRPNQPLACMYETDJLGGTFH NO:350 LGCTYYQGKLIIASFHKAEHSEGQKQYQLRGMATLEGY RBQyKACSThJRRAkEELLRKKGADLLWCNARTSASGYYEK _______LGFSEQGEVYDIPPIGPHLMYKLEA SEQ U) 24C1 MIJEVKPINAEDTIY ERRIRPNQPLE-AC:KYEITLGGTFH NO :351 LGGYYRDRISIASFHQAEE{SELEGQKQYQLRCGMATLEGY REQKACGSThTRRAEELLRKKIADLLWCNARTSVSGYYKK LGFSEQG:EVYDIPPIGPHLLMYKIKLT SEQ ID 24C6 MIEVKINAEDTlYERBRTLRPNQPLB-ACMYETDLLGGI4 NO:352 LGGYYRCTKLSIASF.HQAEEHSELEGQKQYQILRGMATLhEGY REQK-AGSThTPLHAEELLRKKGADLLWCNARISVSGYYKI GFSEQCGGVYDIPPIGPBILMYKKLA SEQ IID 2.40E+08 MIEVKPINAEDTYEIRHRIRPNQPLE-ACKYETDLLGF NO :353 LGGYYR.GKLISIASFHNAERSELEGQKQYQLRCTMAThEGY REQKAGSTLTRBAEELLRKKGADILLWCNARTSASGYYEK LGFSEQGEVYDIPPIGPHJLMYKKLA SEQ ID 2_8SC3 MJEVYNAEDTYEIRRLRPNQPLB-ACMYETDLLGGTFH NO:354 LGGYYRDRLISIASFHQAEHSEL'EGQKQYQLRGMATLEGY REQKAGSThTRHAEELLRKKGADLLWCNARTSASGYYEK ______LGFSEQG3EVYDIPPIGPHJIMYKKLT SEQ ID 2H3 MMKPINhAED [Xfr)T-U IRPNQPLEACKYETDLLGGTFH NO:355 LGGYYQGEJISTASFHQAGHSELEGQKQYQL-RGMATLEG YRERKAGS TR T4AEELL ~RKGADLWCNARISASGYYKK ______LGFSEQGGVYDIPPIGHILMYKKLT SEQ ID 30G8 MIEVK2INAIEDTYELRBRILRPNQPLACMI=LLGG3AFH NO:356 LGGYYQGKLISIASFHQAFEHSELEGQKQYQLRGMATLEGY RBQKAGSTLIHAEELLRICKGADL.LWCNARTSVSGYYKK LGFSEQ(EVYDIPPIGPHJLMYKKLT SEQ ID 3B_10C4 MIEVRPNAEDTYEIBRRRIRPNQPLACMYELGTH NO:357 LGGYYRGKLISIASFHQAEHSETFBGQKQYQLRGMATLEGY RpQKAGSThIRHIAEELLRKKGADL.LWCNARTSASGYYKK LGFsEQGE-AyDiPPIGPBIMYKXLT SEQ ID 3B_10G7 M]EVYPINAIEDTYEIRUH1RPNQPLE-ACMYETDLLGGTFH NO:358 LGGYYRGKUSIASFHQAEHSELEGQKQYQL-RGMATLhEGY REQKAGS'II IRTAPPI 1RKKGADLLWCNARTSASGYYKK SLGFSEQGGVYDIPPIGPUJLMfYKKLT SEQ ID 3B_12B 1 M]EVXINAEDTYEIRHR]LRPNQPLE-ACMYETLLGGTFH NO:359 LGGYYRGKICSIASFHQAEHSELEGQKQYiQLRGMAThEGY REQKAGSTnHAELRKGADLLWCNARTSASGYYIK ______LGFSEQGVYIPPIGPHnMKKLT SEQ ID 3B_12Dlb MVKPINADTYERHIRNQPLEACMYETDLLGGAPH NO:360 LOGYYRGKISIASFBPAEEOEQKQYQLRGMA'LGY REQKAGSThuHAELLRKKGADLLWCNAR.ISASGYYEL SGPSEQGE.vyDippiGPHaIMYKKLT SE ID 3B -2E5 MTVPINAEDTYEIRHILRPNQPL-ACMYETDLLGGTFH - 184- WO 02136782 PCT11JS01/46227 NO:361 LQ3GYYRGKLISIASFHQAEHSELEGQKQYQLRGMATLEGY REQKAGSuITRMFJ T 1?TCGADLLWCNARTSASGYYEK LGFSKQGVDPGPIIJKL SEQ IID 3C_10113 M]EVNKP]NAED 1TYELF1R ITRPNQPLE-ACMYEIDUJIGTFH NO:362 LGGYYRGK[ISIASFHQABHSELEGQKQYQILRGMAThEGY REQKAGS ~ TRT-APFI J ] XGADI.LWCNARISASGYYKXKL ______GFSEQGGOVYDIPPVGPI3IlMYKKLT SEQ ID 3C_121110 MIKPINAED i ii~-Il 11RPNQPILEACMYETDLLGGTFH NO:363 LGGYYRGKLISIASFHQAFBSELEGQKQYQLRGMATLB-GY RGQKAGS 1TTAFPa RTCGADILWCNARTSASGYYEK ________LGFSEQ(3EVYDIPPIGPEJLMYKKLT SEQ ]ID 3C_9H8 MIPINAEDTYEIRBR]RPNQPLEACMYEPDLLGGTFH NO:364 LGGYYQDRUISIASFHQAEE1SELEGQKQYQLRGMATLE-GY REQKAGSTLIRYAEELLRKKGA-DLLWCNAPJSASGYYEKL GFSEQGIEVYDIPPIGPHILMYKKLT SEQ ID 4A-lB 11 MIEVIKPINAEDi YEUM R.~TRPNQPLE-ACNMYTDILGGTFH NQ:365 LGGYYRGKLISIASFHQAEE-PELE-GQKQYQLRGMATLE-GY RBQKAGsThuiRAEELLRKKGADLLWCNARTSASGYYEK _____________LGFSEQGEVYDIPPIGPILMfYKKLT SEQ ED 4A_1C2 MIEVICPINAEDTY iniP RnlRPNQPLBACKYETDLLGGTFH NO:366 LGGYYRGKLISIASFHQAEHSELBGQKQYQLRGMATLEEY REQ]KAGSThTRHAEELLRKKGADLLWCNARTSASGYYKK ______LGFSEQGEVYD]PPIGPHILMYKKLT SEQ ID) 43_13E1 MIEVIKP]NAED i x frEiRfl11RPNQPLEACKYE1TDLLGGTFH NO:367 LGGYYRGKULSIASFHQAEHFELEGQKQYQL-RGMATLEEY RRQKAGSTh1RHAEELLRKKGADLLWCNARISASGYYEK[L _______GFSEQGE--VYDTPPIGPH]LMYXKLT SEQ ID 43_13(110 MIEVKPINAEDTYEIRHILnRPNQPLEACMVYETDLLGGTFH NQ:368 LGGYYRGIKLIIASFHQAEHSELEGQKQYQLRGMATLEGY REQKAGSThIRRAELLRKKGADILLWCNARTSASGYY'KK ________LGFSEQGGVYDII'PIGPY]JIMYKKLT SEQ H) 4B_16E1 MIEVYPNAEDTYERHPJLRPNQPL-ACKYETDILIGGTFH NO:369 LGGYYRGKLJIASFHQAEHSELEGQKQYQL-RGMATLE-GY REQKAGSTLJRHAEELIRKKGADLLWCNARTSASGYYKK LGFSEQGGVYDIPPIGPBILMYXKLT SEQ ID 4B_17A1 MIEVKPJNABDTYEIRBPLRPNQPLEACKYETDLLCGTFH NO:370 LGGYYRGKLISIASFHQAEBPELR-GQKQYQLRGMATLEEY REQKAGSTh~iRAEEU-LRXKGADLLWCNARTSASGYYEK LGFSEQG3EVYDIPPIGPH]LMYKKLT SEQ4 I 4_18SF1 1 MJEVNp]NABEDTI RHR1RPNQPLEACMYETDLLGGTSH NO:371 LGGYYRGELTSIASFHNAEHSELDGQKQYQT-RGMATLEGY REQKAGSThRiA-EELLRKKGADLLWCNARTSVSGYYEK LC}FSEQG3EVYDIPPIGPHISMYKKLT SEQ ID 43_19C8 MIdEVKP]NAEDTIYEUIR1R1RPNQPLEACKYETDLLGGTFHI NO:372 LGGyyRGKL*siAsFHQAEEIpBLEGQKQYQLRGMATLEGY REQKAGSThaiHAEELLRKKQADLLWCNARTSASGYYEX LGTFSEQGGVYDII'PIGPHILMYKKL-A SEQ ID 43_1(14 M]BVKPINAIEDTYEIRHRJLRPNQPLR-ACKYEI1)LLGGAFH NO:373 L)GGYYRGKLISIASFHQSEHPLEGQKQYQLRGMATLEGY ______RELKAG5TIEFLSRKADLcATAGYK - 185- WO 02136782 PCT/USOI/46227 _____________GFSEQGEVYDIPPIGKMTMYKXKLT SEQ ID 4B_21ZC6 MIEVXKPIhAEDTYEEHRLRPNQPLE-ACMYETDILLGGTFH NO:374 LGGYYRGKUISIASFHQAEHSELEGQKQYQILRGMAThEEY REQKAOSTIUtRABELLRKKGADLWCNARISASGYYKXXL ________ 0SEQ0GVYDIPPIGPH11VYKYLT SEQ ID) 4B_2W7 YMVKNAD-1YERMRPNQPLEACMYETDLGTF NO:375 JGYYRGiKLSIAFQAEHSELBOQKQYQLRGMATLBGY REQKAGSTi 1APJEF IRTCGADLLWCNARTSASGYYKK _______LGFSEQ(GGVYGIPPIGPHIMYKIKLT SEQ I[) 4B_-218 MMAPNAEDTIYEREIRPNQPLEACKYTDLLGTF NO:376 LGGYYRGKITSIASFHQAEHSELEGQKQYQLRGMALGY RBQKAGS I 1T-APPE I1R TGADLLWCNARTSASGYYKK ________LGFSEQGEVYD]PPIGPBIAYKYLT SEQ ID 4B_6iD8 MIEVKPINAED .rTEIRHRT RPNQPLEACKYETDILGGTFH NO:377 LAGGYYRGKLISIASFHQAEISELEGQKQYQLRGMALGY RBQKAGSTLRHIAELRKKGADLLWCNARTSASGYXK ______LGFSEEHGEVYIPPIGPEaLYXXLT SEQ ID) 4B_7E8 MIEVKPINAIDTYE MiI1T1U RPNQPLE-ACMYETDLLGGTFH NO:378 LGGYYRGKLISIASFHQAEffSELEGQKQYQLRGMATEGY RBQKAGSTL1RHAEELLRKKGADLLWCNARTSVSGYYK ______LGFSEQG--EVYDIPPIGPHILMYKKLT SEQ ED 4C_8C9 MIEVKPINABDTiYEIRBULRPNQPLEACM-YETDLLRGAFH NO:379 LGGYYRGKISIASFHQAB-IELBOQKQYQLRGMAThGY REQKAGS~hIHAIERXKGADILWCNARTSASGYYEK ______LGFSEQGEVYDIPPIGPHnLMYKYLT SEQ ID) 4111 MIEVIKPINAEDTYE]RHRJLRPNQPL-A CMYETT)LLGGAFII NO:380 LGGYYQGKSIASHQAVHSELGQKQYQLRGMTLG YREQKAGSTLEHiAEELLRKKGADLLWCNA(TSVSGY _______KL-GPsEQGGvyDippiGpHaiLYKKLT SEQ ID 6_-14D10 m~vY]NAEDYE1RLRNQPLEACMYEfl)LLGGTFlH NO:38 1 LGGYYRGLSIASFHQAESEGQYQLRGLEY REQyAGsTLHuAELRKGADLLWCNARTSASGYKK LGFSEQGGVYDIPPVGPUJLMYKRLT SEQ IID 6_15G7 MIE VKPINAEE)TYEIRHRIRPNQPLEACKYETDLLGGTFH NO:382 LGGYYRGKLISIASFHQAEE{ELEGQKQYQLRGMATLEGY REQKAGsThRHAEEjLLRXGADILWCNARTSASYYK LGiFSEQ(7EVYDIPPVGPBILMYKIKLT SEQ IID 6_16A5 M]EvIK2INAEDTYIImHR]LRPNQPLACKYETD-JGGTFH NO:383 LGGYRGLSIASFQAEHSEEGQKQYQMALGY RBQKAGS ~1TR1~iAPJE I RCGADLWCNARTSASGYYKK LGFSEQC)GVYDIPPVCGPHflMYKYJLT SEBQ ID 6_16F5 M]EVKPINAEDTYEaRRLRNQPLEACNMEIILLGGTFlH NO:384 LGGYYRGKLISIASFHQAVHSEEGQKQYQLRGMTLFY REQKAGST.IRH~1AEELLR1KKCFGAD.LWCNARTSASGYYKK ______LGFSEQGGVYDIPPVGPH)UlvYKKLT SEQ ID 6_17C5 MIEVKPIhAEDTYEIRHILRNQPLE-ACKYEADILGGIFH NO:385 LGGYYRGKLISIASFHQA EBGZCQKQYQLRGMATLE-GN REQKAGST' 1MHA1EEr Ii- TRGADLWCNARTSASGYYKK LCGFSEQGEVYDVPPIGPHMFMYKK1T FSE ID 61C I VPNAEDTYE]R RNQLEACRYBETDLL GTFI -186- WO 02/36792 PCTITJSO1/46227 NO:386 LGGYYRt2KLISIASFHQAEEHPELEGQKQYQIRGMATLEGY RBQKAGS i TRT-AFPJJ RTCGADLLWCNARISASGYYKIKL _____ FSQEYDPVGHLMKL SEQ ID 6_10D7 MEVKPNAEDTYtPXR1RPNQPLEACMYEIDLLGGIFH NO:387 LGGYYRGKLISIASFHQAFHPELGQKQYQLRGMATLEGY REQKAGSTRMAElRKKGADLLWCNARTSASGYYKK LGFSEQ(GYYDIPPVGPHILMfYKKLT SEQ ID 6_19A10 MEEAKPINAEDTYERR3LPNQPLE-ACMYETDLLGGTFH NO:388 LGGYYRGKLISIASEEIQAEBPELBGQKQYQLRGMATLEGY REQKAGS ji RT-AEJPI T .1TGADILWCNARTSASGYYK LGFSIB GEVYD]PPTGPHLYKKLT SEQ ID 6_19B6 MIEVKPINAEDI YI.ERRUIRPNQPLEACMYE'TDLLRGAFH NO:389 LGGYYRCCLISIASFHQAESELEGQKQYQLRGMATLEGY REQKAGSTaHIRAEELLRKE(GADLLWCNARTSASGYYKK LGFSEQG7EVYDIPPVGPEILMYKKLT SEQ ID 6_19C3 NEEVKPNAEDTEHRUQPACY3)LGF NQ:390 LGGYYRGKLISIASFHQAEHELEGQKQYQLRGMATLEGY RBQKAGSTLdiRAEELLRKKGADLLWCNARTSASGYY'KK ________LGFSEQG3EVYDIPPIGPHJ1MYKKLT SEQ ID 6_19C8 MIEVKPINAEDTYZIHR]L.RPNQPLE-ACKYETDLLGGThH NO:391 LGGYYRGKLIIASFHQAEBPELEGQK1QYQLRGMATLERGY REQKAcISThERQAEELLRKKGADLLWCNARTSASGYYKK LGFSEQGGVYDIPPVGPHILMYYELT SEQ ID 6_-20A7 MIEKPIAEDTYETRHRIRPNQPLEACMYETDLLRGTFH NO:392 LGGYYRGJSIASFHQAEHSDLEGQKQYQLRGMATESY REQKAGSTURRAERKKGADIiWCNARTSASGYYK ______LGFSEQC3-EVYDIPPVGPHMIMYKKLT SEQ ID) 6_20A9 MI~fEVPNAGD I xLtFPTF PNQPLEACKYETDLLGGTFH NQ:393 LGGYYRGKUSIASFHQAEHSELEGQKQYQIRGMATLEGY REQKAGSTh]RHAEELLRKKGADLLWCNARTSASGYYKK LCTFSEQ(3GVYDII'PVGPHLMYKKLT SEQ ID 6_20H15 MEVWA)NAEDTYEJR1PNQPLE-ACKYETDILGGTFH NO:394 LGGYYRGULSIASEE1QAHSEEGQKQYQLRGMTLGY RBQKAGSThHAELRKKGADULWCNARTSASGYK ______LGFSEQ(3EYYDIPPIGPILMIYKKLT SEQ ID 6_21F4 MIEVKYINAEDTYERIRVLRFPNQPLB-ACMYETDL~LGGAF NO:395 HLGGYRGKLSIASFHQAEHPELEQQKQYQLRGMATLEG YREQXAGSThIRHAEELLRKKGADLLWCNARTSASGYYK KLGFSEQGEVYDVPPVGPHILMYKYKLT SEQ ID 6_22C9 MEEVKPJNAEDY El ~fMTR1 RPNRPLEACMYE1TLLGGTFH NO:396 LGGYYRGKLISIASFHQA BGLEGKKQYQLPGMATLEEY REQKAG5ThRHAEELLRXKGADLLWCNARTSASGYYKK LGPSEQGGVDJ PPVGPHELMYKIKLT SEQ ID 6_22D9 NEEVKPINAEDTyBIRRPJLRNQPLEACMYEfl)LLGTFH NQ:397 LGGYYRGKLISIASFHQAEHSELEGQKQYQLPGMAThEGY REQKAGS TRHAEELT 1LRXfJADULWCNAPRTSASGYYKK LGFSEQG--EVYDIPPVGPHILmyKXLT SEQ ID 6_22119 M[EVPNAEDTYEJRIPNQPLACMYElTDILGTF NO:398 LGGYYGLSIASFHQAESEGQKQYGALOEY ______ ____REQKAGSI1 1-1AP1' CRiDLCARSSY -187- WO 02(36782 PCTIUSOI/46227 LGPSEQGEVYD]PPIGPHRIMYKXLT SEQ ID 6_23H13 hMVKPINAEDTYEIHR]RPNQPLEACMYGTDLLQGTFH NQ:399 LGGYYRGICLISIASFHQAEQPELBGQKQYQLRGMAThBGY REQKAGSThIRALkLRKI(GADTLWCNARTSASGYYKK LGFSEQ(GGVYD]PPVGPHILMYKCLT SEQ ID 6_23I17 MIEVKPINAEDTYBIRRIRPNQPLEACMYETDLLGGTFH NO:400 LG3GYYRGKLISIASFHQAEHSELBGQKQYQLRGMATLBGY REQKAGST~ LMHFAPF1 .1 TGADILLWCNARTSASGYYKKI GFSEQGGVYDIPPVGPHIIMYKYIT SEQ ID) 6_2111H MIEVKPIAEDTYEIRHRVLRPNQPLE-ACMYETDLLGGTF NO:401 BLC3GYYRGKLISIASFHQAEBHPELGQKPYQLRGMATLEG YREQKGSTLHAEELRKKADLLWCNARTSASGYYK SEQ ]ID 6_ 3D6 MEPINAEDYIHRILRPNQPLACMYETDLGGTFHL NQ:402 GGYYRGKUIASFHQAEBPETLBGQKQYQLRGMATLEGYR SEQ ID 6_303 MIEviNAEDTy1nuiRRpNQPLACMYEIDTLLGGTFH NO :403 LGGYYRGKLISIASFHQAEHSELEGQKQYQLRGMATLEGiY REQKAGS iTRT4APPT I 1KGADLLWCNARTSASGYYXE( _____________LGFSEQGEVYDIPPVGPEILMYKIKLT SEQ ID 6_3H2 miEviKpINAEDTyEIRBR]LRPNQPLEAcMYBITDLLGGTFH NO:404 LGGYYRGKLISIASFHQAEHEEFGQKQYQIRGMAThEEY REQKAGSTLIRHAEELLRKGADLLWCNARTSASGYYKK ______LGFSEQG3EVYDIPPVGPHM)MYE-KLT SEQ ID 6_4A10 MEVKPIAEDYI-ZILRQPLACMYE'IVLLGGF NO:405 IGGYYRGKLISLASFHQAEHPELEGQKQYQLRGMATLEGY RBQKAGS ~ T-IHAEEJ 1RKKGADLWCNARTSASGYYKK ________LGFSEQGEVYDIPPVGPHJLMYKYLT SEQ ID 6_4B 1 MIEVYNADTYIUVLRPNQPLEACMYETDlLLGGTF NO:406 BLGGYYRGKLIGIASFHQAEBPELBGQKQYQLRGMATLE GyREQKAGSTLIRHAEL-RXKGADILWCNARTSASGYY E=LFSGQGEV-YDIPPIGPH)UMYKKLT SEQ ID 6_5D1 1 M]EVIK1)NAEDTYRTRERILRPNQPLFA-CMYETD1LGGTFH NO:407 LGGYYRGKLSIASFHQAE1PEGQKQYQLRGMATLEY REQKAGSTh1RHABELLRKcKGADLLWCNARTSASGYYK ________LGFSEQGEVYDIPPIGPHELNYKKLT SEQ ID 6_5F1 1 1EVINADTYEII~JRPNQPLF-ACMYETLLG1TFH NO:408 LGGiYYRGKLISIASFHQABHPELEC3QKQYQLRGiMATWEEY REQKAGSThJRHAEELLRKKGADLLWCNARTSASGYYK LGFSEO(GEVBDIPPVGPH]LMYEKKLT SEQ ID) 6_5(9 MJBVYPINAEDTYEaIRRRPNQPLEACMYELTLLGGTH NO:409 L.GGYYRGKLIIASFHQAEIISEEGQKQYQLRGMATLEEY RBQKAG3ST ji T-AEPJTILT(TGADLLWCNARISASGYYKKL ________ GFS QGYIPGPBILMYKIKLT SEQ ID 6_6iD5 NMEVKPIAEDAYEIRHRILRPNQPLFEACKYETDLLGGTFH NO:410 LGGYYRGISIASFHQAE1SELEGQKQYQLRGMALFGY REQKAGSThTRHAEELLRKKGADLLWCNARTSASGYYKK ______LGFEQGGVYDIPPVGPHLvrYKKLT SEQ ID j6 7Dl IEVKPINAEDTYERHRLRPNQPLEACMETDL-RGAFH -188- WO 02136782 PCTTJS01146227 NQ:41 1 LGGYYRGKLTSIASFHQAEHSELEGQKQYQLRGMATLEGY SEQ ID 6_8113 MTEVKPINABD Y IR~H1l ILR IfQPLEACMYETDIWLOTFH NQ:412 LGGYYRGKUSIASFHQAEH3PELEGQKQYQLRGMATLEGY RBQyXAGSTL'iJtHAPI IRTCKGAD...WCNARTSASGYYKK LGIPSEQ(GGVYDIPPVGPHJLMYKKLT SEQ ID 6_9G1 1 MIEVKPIhTAEDTYEI]LRPNQPLEACKYETDLLGGTLH NO:413 LGGYYkGKLJIIASFHQAEHSELEGQKQYQ-RGMATLEGY RBQKAGSIhIRRiAEELRKKGADLLWCNARTSASGYYKK ________ LGFSEIPPVGPILMYXKLT MEDLGF SEQ ID 6F1 MIEVPINADTYhIHURULRPNQPLAVYDLG'h NO:414 LGGyyRGyLVCIASFHXAEEHsELEGQKQYQ'LRGMATIDG YREQKAGSTaHIEAELLRIKKGADLLWCNARTSVSGYYE _______ ______ KGFSEQGEvYDEIPPGPBHIYKIKLT SEQ ID 7_1 C4 MJBYKPIhTAEDTIYkE1IT-RTJ PNQPLE-ACMYE'TDLLGCTTFH NO:415 LGGyyRGLCLISIASFHFQAEHPELBGQKQYQLRGMATLEEY REQKAGSTLTRRJAEELLRKGADLWCNARTSASGYYK _________LGFSEQGGVYDIPPIGPHELMYKKLT SEQ ID 7_2A10 MIEVKCP]NABDTYEIRHRILRPNQPLE-ACKYETDLLGGTFHI NO:416 LGGiYYRGKUSIASFHQAEBPELBGQKQYQLRGMATLEGY REQKAGSTL]RHABELLRXKGADLLWCNARTSASGYYKK ______LGFsEQ:GvyDiPPIGPHILMYKKLT SEQ ID 7_2A11 M Ev1KNAETYHIRPNQPLEACETAA NO:417 LGGYYRGKSIASFHQAERSEEGQKQYQLRGMALGY REQKAGSTLIRHAEELU-RXKGADLLWCNARTSASGYK LGiFSEQG3GVYDIPPVGPBU-MYKIKLT SEQ ID) 7_2D7 MMBv~NAED-YERHRURPNQPLACKYET)U-.GGTFH NO:418 LGGYYRGKISIASFHQAEEGQKQYQILRGMALEGY RBEQKAGSTLIRHAEELLRKKGADILWCNARTSASGYY'KK ______LCTFSEQGEVYDII'PVGPIUL1MYICKLT SEQ ID 7_5C7 MI~cjAE)Yn-R RNPFAMEDLGF NO:419 LGGYYRGKUJSIASFHQAEHPELEGQKQYQLRGMATLEGY REQKVGSTUIRIAEELLRXKGADILLWCNARTSASGYYKK _____________LGFSEQ(3GVYDIPPVG-PT-LMYKKLT SEQ ID 7_9C9 IrEEvKpINAEDTYERHR1RPNQPLE-ACMYETI)LLGGTFH NO:420 ILGGYYRGKLISIASFHQAEEPELEGQKQYQLRGMALGY RBQKAGS ~ITL-TAPaRK TKGADLLWCNARTSASGYYKK LGFSEQ(3EVYDIPPIG~PHIUV1 YKXLT EDIGF SEQ ID 9_13F10 MIVPNETEMIRNPJ-CY NO:421 LGGYYRGiKLVSIASFHQAERSELE-GQKQYQLRGMAILEE _____________KLGIFSEQGEVYDIPPTGPBIMYKKLT SEQ ID 9_13F1 MIEAKPINAEDTYEIRPJRPNQPLEFACMYFPDLLGGTFH NQ:422 LGGYYRGKLVSIASFHQAELEQQKQYQLGMTLE YREQKGSTLTHfABEU..RKKGADLLWCNARTSASGYYK KLGFSEQGEVYDIPPVGPHILMYKKLTA SEQ ID 9_15D5 IMEVKNAEDTYEaRRPNQPDACKYEI)LLCGTFH NO:423 LGGYYRGKISIASFHQABRELGQKQYQLRCJMAGY ______ EQKAGSTLnURHAM T DVrIKAM T WCARTSASGYYKK -189- WO 02136782 PCTUSO1/46227 ______LGFSIEQGEVYDII)PVGPHfflAYKKLT SEQ ID 9_15D8 MIEVYPINAEDTYEHRII-RPNQPLEACVYE'IT)LLiGGTFH NO:424 LOGYYRGIKLVSIASFHQAEBPELGQKQYQIRGMATE yREQ]KAGS jI RH-A1EALLRKK T<GADILWCNARTSASGYYK _______ KLEI EQEV YDTPPVGPHILMYKKLT SEQ ID 9_15H13 NEVKIADYEBIRPQL-(M NLG NO:425 LGGYYRGKLISIASFHQAFRPEEGQKQ YQLRGMATLEEY HBQKAGSTLJRTIAEFJ L]RTCTGADLWCNARTSASGYYIK ______LGTFSEQGEvYNTPPvGPHIMYKKLT SEQ ID 9_1812 MIEVKpINAEDTYETRIHRILRPNQPLEACMYE1TLLG3GTFH NO:426 LGGYYRKSIASFHQAEPEVCQKQYQGMTLEY RBQKACGS ~ TRHFAEP I1UGADLLWCNARTSASGYYIK _______LGiFSEQGIEVYDIPPVGPBILMYKIKLT SEQ ID 9_20F12 VPNAEDTYERRVLRPNQPLEACMYETDILGGTF NO:427HLG3GYYRGELVSIASFHQABPELEGQKQYQLRGMA'ILE NQ47GyRQKAGSThH~lAEELLRKKGADI.WNARTSASGY ______KKLGFSEQGGVYDIPPVGPBILMYKIKLT SEQ ID 9_21C8 MEVPNAEDTY~flRPNQPLMEADCYTF NO:428 LGYYRGKL-ISIASFHQAEHPELEGQKQYQLRGMATLEGY REQKAGSTLERHAEELRKGADLLWCNARTSASGY'YKK LGFSDQG3EVYDETPvPVGPHMYKKLT SEQ ID 9_22Bl 1 EVKPNAEDTYEIIP=NQPLACKYDIIIGTI NO :429 LGGYYRGKLVSIASFHQABEEGQKQYQLRGATLG YREQKAGS ~RAELKGDLcNARSASGYYK _______ KGFSEQGEVYDIPPTr-PTTTMYKKLT SEQ ID 9_23A10 MVKPNAEDTYEIRPNQPLCKEACOLGT NO:430 LGGYYRGKLVSIASFHQAPELBGQQYQLRGALFG yRGQKAGSThXHIAEELLRKKXGADLWCNARTSASGY ______KLGFsEoGGvyDippvGpBJLMYrKKLT SEQ ID 9_24F6 IaVKpNAETZERiLuRpNQPLEACK-YETLIRGAFH NO:431 LGGYYRGKLISLASFHQAEHSELEGQ!KQYQLRGMAThE-GY NO:3 1REQKAGSLUHAEALLRKKGADLLWCNARTSASGY'K ______LGFSEQGIEVYDIPPTG-PTI-T MYXKUT SEQ ID 9_4H110 NEEVKYpINAE[DTYEMIRR1RPNQPLE-ACKYETDLLGGThH NO:432 LGGyyRGKLUSIASFHQAPELEGQQYQGATGY REQKAG5ThIRHAEELLRKGADLIWCNARTSASGYYKKL GFSEQGEYIPVIILYKT SEQ ID 9_4H8 MBVPNADTYBRILRPNQPLMEACLLmrGF NO:433 LCGGYYRGKIASFNQAEEGQKQYQMALGY NQ43REQKAGSTLJAELRKKGADLLWCNARTSASGYE ________LGFSEQGEVYDII)PVGPHMhMYKKLT SEQ ID 9_811 MN VX~prrAED)TyEJERRLpNQPLE-AcKYEDiL LGTFBL NO:434 GGyyRGKISIASFQAEPELGQKQYQGALGYG EQKAGST H R-AFaRT KKTRGADLWCNARTSASGYYKKL ______GFSEQGEVYDIPPTr-PTflMYKELT SEQ ID 9_9H7 MEVPNAEDAYERFLRNQPLEACKYELLSF NO:435 LG-GYYRGKIASFQAELGQKQYQATEE REQKAGS jITR1TA1~PI TP TCTADLWCNARTSASGYYKK ______ _____LGFSBQGEVYDIPPVrGPT-ILMYKKLT SE M ID 9 C6 ME~NETYERNQP EACM=TLLGTF -190- WO 02136782 PCT/USOI/46227 NO:436 LGGYYQGKLISIASFHNAEHISELEGQKQYQLRGMATLEGY REQKAGSThTRHARELLRKKGADLWCNARTSVSGYYK ________LGFSEQGEVYDIPPVGPHJLMYKKLA SEQ ID 91111 MHVINADTYEEIIIRPNQPLE-ACKEDLJUT NO:437 LGGYYRGKLISIASFBXAEHSEGEEQYQLRGMATLEGY REQKAGsTh1RHAEELLRKXGADLLWCNARTSVSGYYK ________ LGFSEQGEVYDIPPIGPJLMYXELT SEQ ID 0_4B 10 MIEVYPINAEDTYELRHECERPNQP1EACMYESDU-RGAFH NO: 438 LGGFYRGKLISIASFHQAEHSDLEGQKQYQLRGMATLBGY RDQKAGSTLhHABEELRXKRGADMiLWCNARTTASGYYKK~ LGFSBQGEEFDTPPVGPHILMYKRLT SEQ ID Q0B11 MIEVKPNAEDTYELRRIGLRPNQPIEACMYESDIRGAFH NO:439 LGGFYGGalISIASFHQAEIHSDLEGQKQYQLRGMATLEGY RDQKAGSTLXBAEQLLRKRGADMLWCNARTSASGYYK _______ _____ LGPSEQGEVFE'I?PVGPHJLMYKKIT SEQ ID 0_5B3 MLVKPIhTAEDTYELRHRLRPNQPIEACMYEITDLLRGAFH NQ:440 LGGFYRCTKLISIASFHEQAEhSELQGQKQYQLRGMAThBGY RDQKACGSSLtKHABEQULRKRGADLLWCNARTSASGYYKK ________LGFSEQG3EVFDTPPVGPHLMYKRI1T SEQ ID 0_5B4 NHEKIADYLHIRNPECYTLRA NQ:441 HLC3GFYRGLSIASFHQAEHSDLGQKQYQLRGMALEG FRDQKAGSST .WTP vf RGAN~LLWCNARTSASGYYKK LGFSE GEVFDTI>PVGPHILMYKRIT SEQ ID 0_538 MIEVKPINAEDTYELRBKILRPNQPIEACMfYEDLLRGAFEH NO:442 LGGRGSIASFHQAEISDLQGQKQYQRGATGY RDQKAGSSLIRHIABQ]LRXGADLLWCNARTSASGYYKK ________LGFSEQC3E[FDTPPVGPBILMY-KRLT SEQ ID 0_5C4 MIEVKPINABDTYELREXTLRPNQPLEAGMTYETDLLRGAF NO:443 HLGGFYRGKUSIASFHQAEHSGLQGQKQYQLRGMALG YREQKAGSSTEKAEEIRKKGADLWCNARTSASGYK ________LGFSEQGEIFDTPPVGPEJLMYKRIT SEQ ID 0_51)11 MIEVKPINAEDTYELR1IRJLRPNQPIEACMYESDILLRGAFH NO:444 LGGFYRGKISMIASFHQAEHSDLQGQKQYQLRGMATLEGY REQKAGSTLH~AQULRGADLLWCNARTSASGY]K LGFSEQCTEVFDTPPVGPHHIMYERLT SEQ ID 0_51)3 IvIIVKPINAEDTYELRHRLRPNQPIEACMYESDLLRGAFH NO:445 LGGYRGKLISIASFHQAHSEQGQKQYQLRGMTLGY RBQKAGSST 1XT-A1EiTR1YGADLLWCNARTSASGYYKKL _____GFSEQcE1ETPPVGPBILMYKRIrr SEQ ID 0_51)7 MIVKPNAETYELRRLRPNQPIEACMYZIDLLRGAFH NO:446
LGGFYRGKLIIASFIIQAEHSELEGQKQYQLRGMATLE(
3 Y RDQKAGSSIRHAEQUKGNMLWCNARflGY ________ KLGFSEQGEIFTPPVGPI3JMYKRrr SEQ ID 0_64 Mg-VKINAEDTYELRRLRPNQPIEACMYESDL.LRGAL NO:447LGGFYRGKLIIASFHQAEHSDLQGQKQYQLRGMAThE-GF NO:47RDQKAGssLHAQIIEGADILWCNARTSASGYKK _______ ______LOFSEQGiKVEIDTPPVGPHLMYKRI1T SEQ ID 0_61)10 MLEVKPINAEDTYELRHK]LPNQPL-EVCMYETDLLRGAF NO:448 BIrCGFYRGKLISIASFHQAEHSDLQGQKQYQLRGMATLEG NO:48YRDQKAGSSLTRIAQ]LP-RGADMLWCNARTSASGY -191- WO 02/36782 PCTUSOI/46227 ______ LGFSEV-L-x=F-TVGPHEMYKRLVWTT SEQ ID) 0_6D11 IIEVKP]NAEDTY siRT-R U RPNQP]EACMYESDLLRGTAFH NO: 449 LGGYYRGSIASFHQAEHSDLQGQKQYQLRGATLGF RDQKAGssLzHAEQIIYGADILWCNARTSASGYK LGFSEQ(3EVFETPPVGPHI1MYKRIT SEQ ]D 0_6F2 MIEVKPINAEDTYEL.RRTLRPNQPIEACMYESDLLRGAFH NO:450 LGGYYRCTKLSIASFHQAHSEQGQKQYQLRGMBCF REQKAGSflHU~AEQIIRRGADNLWCNARTSASGYIE LGOF GEPTPPVGPHILMYKRJT SEQ H) 0_6119 MBVIKPIAEDTjYBt.iRMIWIT PNQPIEACMYETI)LLRGAFH NO:45 1 LGGFYGG1LSIASFHQAEHSDLEGQKQYQLGMATEY RBQKAGS i1T-THAPJFF MTRKKANLLWCNARTSASGYYKKL ________ FSEQGEVFDTPPVGPHILMYKRLT SEQ ID 10_4C10 MIEVIKPIhTAEDTYELBKI - PNQPLEVCMYETDLLRGAF NO:452 BIGGXRGKSIASFHQAEHSELQGQKQYQLRGMTLG yRDQKAGSSLuKELABQRKRGADxLWCNARTSASGYK ______ LGFS QEFTPPVGPH1LMYKRLT SEQ ID 10_4D5 NUVPNETERHLPQIVMED.RAF NO:453 LGGFYRGKLJSIASFHQAEHSDLQGQKQYQLRGMATLEGY RBQKAGSTaHn1ABQIRKRGADLLWCNARTSASGYYKKL ______GFSBQGEVFDTPPVGPfl]IAYKRrr SEQ ID) 10_4F2 N-VINAEDTYELRHRILRPNQPIEACMDLLRGAFH NO:454 LGGFYRGLSIASFHQAEHELQGQKQYQRGATGY REQKAGSSLHAEURGADMLWCNARTSASGYKK LGFSEQ( IEITPPVGPBOL-MYXRLT SEQ ID 10_4F9 NMVKIAEDTYELRRRNQPEVCYETDLLRGAFH NO:455 LGGFYRGiKUSIASFHQAEHSELQGQKQYQLRGMAThEGF ________ (FSEQCIEIFDTPPVGPHJLMYKRLT SEQ ID) 10_4G5 MEVYNAEDTYELRRmIRPNQPIACMDISDLLRGF NO:456 LGOGYYRCTKLSIASFHQAEHSDLQGQKQYQILRGTMATLEG YRDQKAGssijRHAEQ]LRKRGADULWCNARTSASGYYK IKLGFSEQGEIFDTPPVGPRLLMYKRLT SEQ BD 10_4H4 MLEVKPIAEDTYELRHKTLRPNQPLEVCMYETDLLRGAF NO:457 HLGYCKISHAE-SQQQQRMTE yREQKAGSSLfAEEEIRXRGADLLWCNARTSASGYK ________ LFSEQGEVFDTPPVGPHILMIRI SEQ ID 11_3A1 1 MIEVXFINAEDTYELRHKILRPNQPIEVCMYESDU-RGAIH NO:458 LGCJFYRGKLISIASFHQAEBPDLQGQKQYQL-RGMATLEGY RDQKAGSSLUHRAQILR'RGADULWCNARTSASGYK ______LGFSEQGEVFETPPVGPHILMYKRLT SEQ ID 11_3Bl vLEVKPNAEDTYELRHPJLRPNQPIEACMEII)DLLRGAFPH NO:459 LGGFYRGKUSIASFBQAEEISDLQGQKQYQLRGMALGF ________GFsEQcyEIDTPPVGPHJLmyKRLT SEQ ID 11_3B5 mi~EvKPNAEDTYELRHRLRPNQPIEACESDLRAFH NO:460 LGGYYRGKJ.IIASFHQAEHiSELQGQKQYQLRGMATLEGY RDQKAGSSLIAEQURUGADMLWCNARTSASGYK LGFSEQGEVFDflPPYGPEUYKRIT S ID 111 3C12 IMEVKPINABD IR-R PQLVMEDLGF - 192 WO 02136782 PCTf[USO1I46227 NO :461 LGGFYGGKLISIASFHQAEBPDLQGQKQYQLRGMATLEGY RDQKAGSSURRAEQLLRKRGADU-LWCNARTSASGYYKK ______LGFSEQGEIRTPPVGPILMYKRIT SEQ ID 11_C3 MIEVKY]NARDTYELRIHCILPNQP]BACMYEDLLRGALH NO:462 LGGYYRGKLISIASFHQAEEHSELQGQKQYQILRGMATLEGY RBQKAGSSI ]1KT-TAV~fl ER GADL.LWCNARTSASGYYXXL ________GFS QEFTPGPHILMYKRI1T SEQ ID 11_3C6 MLEVKA)NAEDTYBLRH~tLRPNQPIEACMTIESDILRGAFH NO:463 LGGFYGGKLTIASFHQABHSDLEGQKQYQLRGMATLEGY REQKAGSTLHTRAEEILRKRGADLLWCNARTSASGYYKIKL GFSQGEFTnPVGPHII2VYKR1 SEQ ID 11_3D6 MIBVKPINABDTYELRHRILRPNQPIEV CMYETDLLRGAFH NO :464 LGC*FYRGKLISIASFHQAERSDLQGQKQYQLRGMATLBGY SEQ ID U1_i2 MLVK)INAEDTYELRRILJRPNQPIEVCMYETDLLRGAFH NO:465 LGGTFYGGKLJSIASFHQAEHSELQGQKQYQLRGMATLEGY RDQKAGSSLHXAEEILRKGADLLWCNARTSASGYYIL GFSEQGIEVFETPPVGPHaIMYKRLT SEQ ID 1_HI MIEVKPINAEEI YE1 RB-TWT .INQPIEACM-YESDLIRGiSFH NO:466 LGGFYRGQLISIASFBKAEHSELQGQKQYQLRGMAThP-F REQKAGSSILHEEIL2KGADLLWCNARflrASGYYKRL CIFSEkHGEVFETPPVGPBILMYKR1 SEQ ID 1_H2 MgVKPNAEDTYELRRI.RPNQPLEACMYESDLLRGSFH NO:467 LGGFYRGKLSIASFHQAFSELEGQKQYQLRGMAThBC-F REQKAGSSL1HEELKGADLLWCNAR1TAAGYYKK LGFSEQGEBWDTPPVGrPIIMYKRIT SEQ ID 1_1H-5 MIEKPNAED YIRLRNQPLEACMYSDLTLRGSFH NO:468 LGGFYRGKLISIASFIIQAEHSDLEGQKQYQLRGMAThGY RDQKAGSSLaHIAEQILRKRGADILWCNARTTAAGYYKR _______LGFSEQGEVEDTPPVGPELYEIKLT SEQ ID 1_2A12 miEVIKPINAEDTYBLRHPJLRPNQPIEACMYESDUJ-RGSFH NO:469 LGGFYRGKLUSIASFHQAEQEEGQKQYQLRGMALGY RDQICAGSTHAELRKKGADLLWCNARTSAAGYK LFEGEIFDTPPVGPEILMfYKRLT SEQ ID 1_2B6 MBEVNAEETYELRBICRNQPLEACM4YEILGSFH NO:470 LGGFYRCTKLISIASFHQAEHSELB GQKQYQLRGMATLEGF RDQKAGSSIL AE LRKRGADLLWCNARTSASGYYKKL CIFSEQGiEIFETPPVGPBILMY]KRLT SEQ ID 1_,2C4 MLEVIKPINAETE I(L~NPECYTLLRGSFH NO:47 1 LG)GFYRGQLISIASFHQAEE{SDLQGQKQYQLRGMATLEGY RBQKAGS ITCRLA~RKKR GADLLWCNARTrAAGYYKK LGFSEQGEVIEDTPPVGPHMMYK91T SEQ ID 1_2D2 IEEPINAED I xELR.IEKT 1PNQPLEACMYESDILRSAFH NO:472 LGGFYRGKE.ISIASFBXAEHISELQGQKQYQLRGMATLEGY RDQKAGSSLIRAEEILRKRGADMLWCNARTSAAGYYKR ________LGFSEQGEVFIX[?PVGPHHELMK SEQ ID 1_2D4 MIEVIKPINAEDTYELRHJLRNQPIBACMYEDLJRGSFH NO:473 LGCJFYRGKLSIASFHQAEH-SDLQGQKQYQLRGMA'TLEGY ___________REQKAGSSLIAEQLLRKKGADMLWCNARTSAAGYYK - 193 - WO 02/36782 PCTIUSOI/46227 SEQ H) 1 RLGFSEflGEIFETPVGPIILMYKr= JTDLR SEQ ID 12F8 MVKPIhTAEDTYELRBRLRPNQPLEACMYTLRS NO:474 HLGGFYRGKLISIASFHQARHSELE3Q.KQYQLRGMATLEBG YRDQKAGSST LM-AETF ,R1GADMLWCNARTrAAGYYK KLGFSEQGEIYDTPPVGPHIlAVYCKLT SEQ ID 1_2H8 MEV-KPINAEEYELRHRPNQPLF-ACMYE~T)ILRGA 1 H NO :475 LGGFYRGKLISIASFHQADHSELQGQKQYQLRGMATLEGY REQKAGSThIRHAEQLRXRGADLLWCNARTSAAGYYXX LGFSEHGEJFETPVGPILMfYRLT SEQ ID 1_3A2 MIEVKPINAEDTYELRHRILRPNQP]EACMYESDLLRGAFH NO:476 LGGFYRGKLISIASFHQAEHSDLQGQKQYQLRGMATLBGY REQKAGSSIIRRAERKKGAD%iLWCNARTfTAAGYYIR LGFSEQGEVFI)TPPVGPHILMYKRIT SEQ ID 1_3D6 MIEVKPINAEDTYELRHKILRPNQP]EACMYESDLLQGSFH NO:477 LGGFYRGQUISIASFHQAEHSDLQGQKQYQLRGMALGF REQKAGsnhIKHAEE]LRKKGADLLWCNARTSAAGYYK LGFSEHGEIF-DTPPAGPH]LMYIKLT SEQ ID 1_3F3 !MdEVYPNAEETYELRQRILRPNQPIEACMYESDIJLRGSFBL NO: 478 GGFYRGQLISIASFHQAEHSELQGQKQYQLRGMATLEGYR EQXAGSTL1.WIAEEF11 PEXADl.LWCNARTSAAGYYKRL GFSERGEIFDTPPVGPHIAYKRIT SEQ IID 1_3H2 NEVKIAD~RM"QIAMFUG NO:479 LGGYYRGQUTSIASFHXAEHSELQGQKQYQLRGMATLEGY REQIAGSTLHMAQREKGAD1MLWCNARTSAAGYK RLGiFEQGVFDTPPYGPMIMYKCLT SEQ IID 1_4C5 MIEVKINABIJTYELRHI(IRPNQPIEACMYESDLLRGSFH NO:480 LGGFYRGKLTSIASFBICAEHSDLEGQNQYQLRGMAThEGY REQYKAGSTLflILEELRKRGADMLWCNARTSASGYYKR ______LGFSEHGEIEDTPPvGpiialMYKRLT SEQ ID 1_4D6 MLUVYPINAEDTYELRERILRPNQPIEACMYETDU-RGSFH NO:48 1 LGGFRGQLSIASFHKABHSDLBGQKQYQLRGMATLEGY REQKAGSTLhIEQflRKRGADMLWCNARTSAAGYY LQFSEQGEVFETPVGPEJLMYKRLT SEQ ID 1-4M MIEVyp]NAEDTyELRE{RuLRPNQPLEACMYETDLLRGSFH NO:482 LG3GFYRGKLIIASFHQAESDLQGQKQYQLRGMALFGY REQKAGSThJIAEQLLRKRGADLLWCNARTSASGYK LGFSEHG3EVFDTPPVGPHJMYKfRLT SEQ ID 1_5H5 MLVPNA~rBLRKJLRPNQPL-ACMYESDLLRGSFH NO:483 LGGYYRGQLISIASFHQAEHSELEGQKQYQLRGMAThEGF REQKAr-STLIUcAEQ]LRKRGADMLWCNARTSAAGYYKK _____________LGTFSEBGEIFDTPPVGPHILMYKKLT SEQ ID 1_6F12 MMKIAEYLRIRNPECYSLRSB NO:484 GGFYRGKLSIASFHQAEHSDLEGQKQYQL-RGMATLGYR DQKAGSTHABKYRGADMLWCNTSAGY _______ ______ GFSEHGEYETPVGPHJLMfYKKIT SEQ ID 1_6H6 M[EVYPINAIEDTYELRHKIRPNQPIEACMYESDLLRGSFH NO:485 LGGFYRGQLISIASFHQABHDLEGQKQYQGATGY RDQKAGSST .WT-AEERI XGcADLLWCNAR.TSAAGYYKR LGFS GEIFDTPPVGPHMU~YKKJ SEQ ID 13 -194-
GF
WO 02/36782 PCTfUS01J46227 NO:486 LGGYYRGKLISFHQAEHSELQGQKQYQI.RGMATTLEGY REpQ]KAGsSLVKHIAFRII RRGADLLWCNARTSASGYYKK ______LGFSEQciEIfETPPVGPHILMYKRIT SEQ ID 3_46 NLEVKYINAED j.Y HRFUUI~ RFNQPIEACMYESDLLRGAFH NO:487 LGGFYRGKLISIASFHQAEHISELQGQKQYQLRGMATLRGY RBQKAGSSLHaAEEILRKRGADLWCNARTSASGYYK _______ _____GFSEQCTEEPPVGPHLVYKRLT SEQ ID 3_15B2 MLEVKCPTNAED i x HR1WTI PNQPLEVCMYETD)I1RGAIF NO:488 BLGGYYGGKLISIASFHQAEHSELQGQKQYQLRGMATLE GYREQKAGSST LkEIAITRTR GADLLWCNARTSASGYYK YLGFSEQGEEI=?PVGPBILMYKRIT SEQ ID 3_6A10 m-VKPINAED j Y H RHR ITRPNQPIEAC-MYESDLLRGAFH NO :489 LGGYYRGKLISIASFHQAEHSELQGQKQYQLRGMATLEGY REQKAGSSL XTaPPIF RTCRG(ADL.LWCNARTSASGYYKIKL OFSEQGIM=TPVGPHILMYKRIT SEQ ID 3_6BI InEVKPINAEDTYELRBPJLRPNQPIEACMYESDLLRGAFH NO:490 LGGYYRGKSIASFHQAEBELQGQKQYQLRGMATEGY REQKAGSSI iTAEFIT RRGADL-LWCNARTSASGYYKXL ______GFSEQGEVYFPvGPH]LMyxR.I SEQ ID 3_79 MLVKPhIAEDTYELRRIRPNQPIEACMYESDLLRGAFH NO:49 1 LAGGYYGGKLTSIASFHQAEHSDLQGQKQYQ1LRGMATLEG YREQKAGSSLIKHAEEILR]KRGADLLWCNARTSASGYYKK ________ LGFSEQCIE]FETPVGPILYERI1T SEQ ID 3_8011 MLEVKPINAEDTYBLRIHRLRFNQPIEVCMYEDLLRGAFH NO:492 LGGYYRGTKLJIASFHQALEHSELQGQKQYQLRGMATLEGY REQKAGSST .WT-TE1TR IR GADLLWCNARTSASGYYKKL GFSEQGEH=I'PVGPHEhMYKRIT SEQ HD 4_lB 10 MIEVKPINAEDTYELRBPLRPNQPIEVCMYETI)LLRGAFH NO:493 LGTGFYGGSASFQASDLQGQKQYQM, Y RDQKAGSSLABQURRGADMLWCNARTSASGK LGFSEQE TPGPHELMYKRIT SE D 52B3 MMVKPNADT ERRPNQP CIEVCM-RGF NO:494 LGGFYGGULSL4SFHQAHSDLQGQKQYQG~fGY RDQKAGSSLMHAIZQLGADMLWCNARTSASGYK _____LGFSEQGEIPPVGPIMYKR --. J SEQ ID 5_2D9 MLfXV\TNAEDTYERK]RPNQPXEVCMYEXDU-RGAF; NO:495 BLGGFYRGKLSIASHQAEHSDLQGQKQYQLRGATLG yRDQKAGssLEHABEQILERGADMLWCNARTSASGY ________ KLGrFSEQGrEVEDTPPVGPIMKL SEQ ID 5_2F10 M1RVKINlAEDTYELRHKELRPNQPIEVCMYETDLLRGAF NO :496 HLGGFYGGKJUISIASFHQAMEDLQGQKQYQLRGMATLEG yRDQKAGSSLRAQURGADLWCNARTSSGY ______KLGpSEQGEHZETPPVGPBJUVIYKRLT SEQ ID 6_lAl 1 MLEKPNAE TYEMLRIEVPNQPL\-AG NO:497 HGGFYRGKLSIASFHQAEHSDLQGQKQYQRGALG NO :497YRDQKAGSSLUIAEQn]LRGADMLWCNARTSASGYYR _____________KLGFSEQUEVFETPPVGPHI1MYYKRLT SEQ ID 6_1D5 NJBVKPINAEDTYELRI]XJLRNQPLEVCMYETDLLRGAF NO:498 HLGGFYRGUSIASFHQAEHSDLQGQKQYQLRMALFG YRDQKAGSSLTR1AQ]RKRADMN'LWCNARTSASGY - 195 - WO 02/36782 PCTUJSOI/462Z7 ________ KbcFSBQGEVFEI'PVGPH!LMYKRIT SEQ ID 6_IFi11 MUEVIKPINAEDTYEBICKLRNQPI.EVCMYETDiLGAF NO:499 HLGGIFYRGICLISIASFHQAEHSDLQGQKQYQLRGMATLEG YREQKAGSSLTRHABQLRYRGADMLWCNARTSASGYYK KWFSEQGEVFEPPPVGPHELMYKERLT SEQ HD 6lIl MLEVKPTNAEDTYELRHKILRPNQPLEVCMYBTDLLRGAFI NO:500 F±CJGFYRGTKLISIASFHQABHSELQGQKQYQLRGMATLEG YRDQKAGSSLIRHABQ]LRXRGADIMWCNARTSASGYYK ICLGFSEQGEVFETPPVGPHIL~ymYLT SEQ ID 6_11110 1%4LVP]NAEDTYEL RTWTF PNQPLEVCMYETDLLRGAF NO:501 IBLGGFYGGKLIIASFHQAEHSDLQGQKQYQLRGMATLEG YRDQKAGSST 11HEiU R1GADMLWCNARTSASGYYK KLGPSEQG-EVFDTPPVGPILMYKK1 SEQ ID 6_114 MLfEVYPINAEDTYELRHiKllRPNQPLEVCnMYDJJRGAF NQ:502 BLGGFYGGKLISIASFHQAEHSDLQGQ KQYQLRGMATLEG YRDQKAGSTh1XHAEQIUMKGADMLWCNARTSASGYYK ______KLGFSEQCYEVFETPVGPHLIVYKRLT SEQ ID 8_iFS MIE-KPINAEDTYELRHRILRPNQPLEVCMThILRGAFH NO:503 LGGFYRGKLISIASFHQAESDLQGQKQYQLRGMALGY REQKAGSSL .FCAFJFTF TRGADIiWCNARTSASGYYKYKL GFSEQGEIFDTPVGPHLJYXR SEQ ID 8_1G2 MIEYKPINAEDTYELRHRVLRPNQPL.EVCMYETDLLRGAF NO:504 HLGGYYRGKLISIASFHQAEHSELQGQKQYQIRGMAThE-G YREQKAGSSLHABEIIR RGADLLWCNARTSASGYK _____LGFSEQGEVTPVGPBTLMYKRLT SEQ ID S_1(33 MLfEVKYINAEDTYELREI(]LRPNQPIE VCMYETLJRGAF NO:50 ELGYYRKLISIASFHQAESELQGQKQYQLRGMATLEG NO:505YREQKAGSSLIRHABURRGADLLWCNARTSASGY'K _______ _____LGFSEQG-EIFDTPPVGPHMJMYKRrr SEQ ID S_1117 NLEYKPINAEDTYELRIRIRPNQPIEVCMYETDLLRGAFH NO: 506 LGGFYRGKLISIASFHQAEHSELQGQKQYQLRGMATEGY REQKAGSSLHAEElRRGADMLWCNARTSASGYYIK LGFSEQGEIFETPPVGPH[LIVYKRLT SEQ ID S_1119 MLEVIKPINAEDTYELREILRPNQPLEVCMYETIDLLRGAF NO:507 IILGGYYRGKLISIASFHQAEHSDLQCTQKQYQLRGMATLE GYRBQXAGSSLIRHAEEJLRKRGADILWCNARTSASGYYJ( KLCGFSEQGEVFDTPVGPBE)LMYKRLT SEQ ID GATi_21F EVIKINAEDTYJTHRLRPNQPLEACKYELGTFH NO:508 12 LGGYYRGKUISIASEHINAEHSELEBGQKQYQLRGMATLEGY REQKAGsThIRHAEELLRKGADLWCNARTSVSGYK LGFSEQGEVYDIPPIGPKTIMYKKLT SEQ ID GATi_24G MIEVyNAEDTyE1pTRPNQPLmECMYEbiTiF NQ:509 3 LGGYYRGKISIASFHQASEPGQKQYQLRGMALGY REQKAGSTLIRRIAEELLRKKGADILLWCNARTFVSGYYEK ________LGFSEQGEYYDIPPIGPYMIMYEKLT SEQ ID) GAT1_290 MEEVKPNAB)TEHRLRPNQPL-ACMTYEDLLGGF NO:5 10 1 LGGYYRGLSIASFHQASBGQKQYQICGMATGY RBQKAGSIITRIIAPYJ I TCGADLLWCNARTSVSGYYKK LGFSEQ CDIPPIGPBU-MYKKLA SRE ID IGATi 20IIVPNMiYEUURNPY C =TLLGF - 196 - WO 02/36782 PCTTJS01146227 NQ:51 1 1 LGGYYRGKLASFHQABHPELEGQKQYQLRGMAThBGY REQKAGsTLdiRAEELLRKGADL-LWCNARTSVSGYYEK _____________LGFSBQGEVYDIPPIGPHamyXXLT SEQ ID GAT2_15G NEVNAEIDTYRPQPECKamLLG NO:512 8 LG3GYYRGKSIASNAEEHSELEG(QKQYQLRGMALGY REQKAGS T nM1APBELIRTCGADLLWCNARTSVSGYYK LGFSEQGEVYDIPPIGPHILMYXXLT SEQ ID GAT2_.1911 NEVKPNAEDTIYE MIIRPNQPLRACMYETD)LLGGTH NO:5 13 8 LGGYYRGKLSIASFHQAEEIPELEGQKQYQLRGMAThB-GY RBQKAGS iTRRAPP II (ADLLWCNARTSVSGYYEK _______ _____LGFSEQG1EVCDIP>PIGPHLIVYEKLT SEQ ID GAT2_2W MVICNADTYEHRRPNQLACMY~fDLGGTFH NO:5 14 1 LOGGYYRGKLISIASFHQAEERSELEGQKQYQLRGMATLFGY REQXAGS ~ TR1AEPJ IRKCGADLLWCNARTSYSGYYKK LGFSEQ(GGVYDIPPIGPIELNTYKIKLT SEQ ID B. AACT)GAAGCGAGGAATCT)C NO:515 licheniform is ribosome _________ I binding site __ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ - 197 -

Claims (28)

1. A transgenic plant cell, a transgenic plant, a transgenic seed, or a transgenic explant comprising said transgenic plant cell comprising a heterologous polypeptide with glyphosate N-acetyltransferase activity, wherein said plant cell produces N-acetylglyphosate when treated with glyphosate.
2. The transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant of claim 1, wherein the plant cell, plant, plant explant or seed exhibits enhanced resistance to glyphosate as compared to a wild type plant cell, plant, or plant extract of the same species, strain or cultivar.
3. The transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant of claim 1 or 2 further comprising (a) at least one polypeptide imparting glyphosate tol rance by an additional mechanism; and/or (b) at least one polypeptide imparting tolerance to an additional herbicide.
4. The transgenic plant, transgenic plant cell, transgenic seed or transgenic plant explant of claim 4, wherein (a) the at least one polypeptide imparting glyphosate tolerance by an additional mechanism is a glyphosate-tolerant 5-enolpyruvylshikimate-3 -phosphate synthase or glyphosate-tolerant glyphosate oxido-reductase; and/or (b) the at least one polypeptide imparting tolerance to an additional herbicide is a mutated hydroxyphenylpyruvatedioxygenase, a sulfonamide-tolerant acetolactate synthase, a sulfonamide-tolerant acetohydroxy acid synthase, an imidazolinone tolerant acetolactate synthase, an imidazolinone-tolerant acetohydroxy acid synthase, a phosphinothricin acetyl transferase or a mutated protoporphyrinogen oxidase.
5. The transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant of any one of claims 1-4, wherein the transgenic plant cell, trasgenic plant, transgenic plant explant, or transgenic seed is a crop plant selected front among the genera: Eleusine, Lollium, Bambusa, Brassica, Dactylis, Sorghum, Pennisetum, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Coix, Glycine and Gossypium. -199
6. A method to produce a polypeptide that has glyphosate N-acetyltransferase activity which method comprises culturing the transgenic plant cell, the transgenic plant, the transgenic seed or the transgenic plant explant of any one of claims 1-5.
7. A method of producing a glyphosate resistant transgenic plant comprising: (a) transforming a plant or plant cell with a polynuc eotide encoding a glyphosate N-acetyltransferase; and (b) optionally regenerating a transgenic plant from the transformed plant cell.
8. The method of claim 7 which further comprises growing the transformed plant or plant cell in a concentration of glyphosate that inhibits the growth of a wild-type plant of the same species, which concentration does not inhibit the growth of tbe transformed plant, wherein said growing is: in increasing concentrations of glyphosate; and/or in a concentration of glyphosate that is lethal to a wild-type pla t or plant cell of the same species.
9. The method of claim 7 or 8, which further comprises pr pagating said transgenic plant by crossing said transgenic plant with a second plant, uch that at least some progeny of the cross display glyphosate tolerance.
10. A method for selectively controlling weeds in a field containing a crop comprising: (a) planting the field with crop seeds or plants which are glyphosate tolerant as a result of being transformed with a polynucleotide encoding a glyphosate N acetyltransferase; and (b) applying to the crop and weeds in the field a suf fcient amount of glyphosate to control the weeds without significantly affecting the crop.
11. A method for selectively controlling weeds in a field containing a crop comprising: - 200 (a) planting the field with crop, seeds or plants which are glyphosate tolerant as a result of being transformed with a polynucleotide encoding glyphosate N acetyltransferase and further comprising (i) at least one polypeptide imparting glyphosate tolerance by an additional mechanism; and/or (ii) at least one polypeptide imparting tolerance to an additional herbicide; and, (b) applying to the crop and weeds in the field a sufficient amount of glyphosate to inhibit growth of with weeds in the field without significantly affecting the crop; and (c) optionally, applying to the crop and weeds in the field a simultaneous or chronologically staggered application of an additional herbicide.
12. The method of claim 11, wherein the additional herbicide is applied and is selected from the group consisting of a hydroxyphenylpyruvatedioxygenase inhibitor, sulfonamide, imidazolinone, bialaphos, phosphinothricin, azafenidin, butafenacil, sulfosate, glufosinate, and a protox inhibitor.
13. The method of any one of claims 10, 11, or 12, wherein the transgenic plant or transgenic seed is a crop plant selected from among the genera: Eleusine, Lollium, Bambusa, Brassica, Dactylis, Sorghum, Pennisetum, Zea, Oryza, Triticum, Secale, Avena, Hordeum, Saccharum, Coix, Glycine and Gossypium.
14. A plant cell comprising a metabolic product of glyphosate which is N acetylglyphosate.
15. A method for detecting the presence of a GAT polypeptide or evaluating the activity of a GAT polypeptide in plant tissue comprising treating a plant with glyphosate and assaying plant tissue from said plant for the presence of N-acetylglyphosate.
16. A method for detecting GAT polypeptides comprising analyzing plant tissues using an immunoassay comprising GAT-specific antibody or antibodies. -201
17. A method for detecting the presence of a polynucleotide that encodes a GAT polypeptide comprising assaying plant tissue using PCR amplification.
18. The transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant of claim 1, substantially as herein described with reference to any one of the Examples and/or Figures thereof.
19. The transgenic plant cell, transgenic plant, transgenic seed or transgenic plant explant of any one of claims 1 to 5, substantially as herein described.
20. The method of claim 6, substantially as herein described with reference to any one of the Examples and/or Figures thereof.
21. The method of claim 6, substantially as herein described.
22. The method of claim 7, substantially as herein described with reference to any one of the Examples and/or Figures thereof.
23. The method of any one of claims 7 to 9, substantially as herein described.
24. The method of claim 10 or 11, substantially as herein described with reference to any one of the Examples and/or Figures thereof.
25. The method of any one of claims 10 to 13, substantially as herein described.
26. The plant cell of claim 14, substantially as herein described.
27. The method of any one of claims 15 to 17, substantially as herein described with reference to any one of the Examples and/or Figures thereof.
28. The method of any one of claims 15 to 17, substantially as herein described.
AU2007224390A 2000-10-30 2007-10-15 Novel glyphosate N-acetyltransferase (GAT) genes Ceased AU2007224390B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AU2007224390A AU2007224390B2 (en) 2000-10-30 2007-10-15 Novel glyphosate N-acetyltransferase (GAT) genes
AU2009201716A AU2009201716B2 (en) 2000-10-30 2009-04-30 Novel glyphosate-N-acetyltransferase (GAT) genes

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US60/244,385 2000-10-30
AU2002220181A AU2002220181B2 (en) 2000-10-30 2001-10-29 Novel glyphosate n-acetyltransferase (gat) genes
AU2007224390A AU2007224390B2 (en) 2000-10-30 2007-10-15 Novel glyphosate N-acetyltransferase (GAT) genes

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2002220181A Division AU2002220181B2 (en) 2000-10-30 2001-10-29 Novel glyphosate n-acetyltransferase (gat) genes

Related Child Applications (1)

Application Number Title Priority Date Filing Date
AU2009201716A Division AU2009201716B2 (en) 2000-10-30 2009-04-30 Novel glyphosate-N-acetyltransferase (GAT) genes

Publications (2)

Publication Number Publication Date
AU2007224390A1 AU2007224390A1 (en) 2007-11-01
AU2007224390B2 true AU2007224390B2 (en) 2011-03-17

Family

ID=38476312

Family Applications (2)

Application Number Title Priority Date Filing Date
AU2007205733A Ceased AU2007205733B2 (en) 2000-10-30 2007-08-10 Novel glyphosate N-acetyltransferase (GAT) genes
AU2007224390A Ceased AU2007224390B2 (en) 2000-10-30 2007-10-15 Novel glyphosate N-acetyltransferase (GAT) genes

Family Applications Before (1)

Application Number Title Priority Date Filing Date
AU2007205733A Ceased AU2007205733B2 (en) 2000-10-30 2007-08-10 Novel glyphosate N-acetyltransferase (GAT) genes

Country Status (1)

Country Link
AU (2) AU2007205733B2 (en)

Also Published As

Publication number Publication date
AU2007224390A1 (en) 2007-11-01
AU2007205733A1 (en) 2007-08-30
AU2007205733B2 (en) 2008-11-20

Similar Documents

Publication Publication Date Title
AU2002220181B2 (en) Novel glyphosate n-acetyltransferase (gat) genes
US7998703B2 (en) Glyphosate-N-acetyltransferase (GAT) genes
US8222489B2 (en) Glyphosate-N-acetyltransferase (GAT) genes
AU2002220181A1 (en) Novel glyphosate n-acetyltransferase (gat) genes
WO2003092360A2 (en) Novel glyphosate-n-acetyltransferase (gat) genes
AU2007224390B2 (en) Novel glyphosate N-acetyltransferase (GAT) genes

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)
MK14 Patent ceased section 143(a) (annual fees not paid) or expired