MXPA01004558A - Virulence genes and proteins, and their use - Google Patents

Virulence genes and proteins, and their use

Info

Publication number
MXPA01004558A
MXPA01004558A MXPA/A/2001/004558A MXPA01004558A MXPA01004558A MX PA01004558 A MXPA01004558 A MX PA01004558A MX PA01004558 A MXPA01004558 A MX PA01004558A MX PA01004558 A MXPA01004558 A MX PA01004558A
Authority
MX
Mexico
Prior art keywords
leu
lys
gly
val
ser
Prior art date
Application number
MXPA/A/2001/004558A
Other languages
Spanish (es)
Inventor
Helen Rachel Crooke
Enda Elizabeth Clarke
Paul Howard Everest
Gordon Dougan
David William Holden
Jacqueline Elizabeth Shea
Robert Graham Feldman
Original Assignee
Enda Elizabeth Clarke
Helen Rachel Crooke
Gordon Dougan
Paul Howard Everest
Robert Graham Feldman
David William Holden
Microscience Limited
Jacqueline Elizabeth Shea
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Enda Elizabeth Clarke, Helen Rachel Crooke, Gordon Dougan, Paul Howard Everest, Robert Graham Feldman, David William Holden, Microscience Limited, Jacqueline Elizabeth Shea filed Critical Enda Elizabeth Clarke
Publication of MXPA01004558A publication Critical patent/MXPA01004558A/en

Links

Abstract

The present invention is based on the identification of a series of virulence genes in E. coli K1, the products of which may be implicated in the pathogenicity of the organism. The identificaiton of the genes allows them, or their expressed products, to be used in a number of ways to treat infection.

Description

GENES AND PROTEINS OF VIRULENCE, AND THEIR USE DESCRIPTION OF THE INVENTION This invention relates to the identification of genes and virulence proteins, and their use. More particularly, it relates to its use in therapy and screening for drugs. E. coli is a member of the Enterobacteria, or enteric bacteria, which are Gram-negative microorganisms that inhabit the intestinal tracts of animals. Other members of this family of bacteria include E-terobacter, Klebsiella, Salmonella, Shigella and Yersinia. Although E. coli is normally found in the human gastrointestinal tract, it has been implicated in human diseases, including septicemia, meningitis, urinary tract infection, wound infection, abscess formation, peritonitis and cholangitis. Disease states caused by E. coli are dependent on certain virulence determinants. For example, E. coli has been implicated in neonatal meningitis and a major determinant of virulence has been identified as the Kl antigen, which is a sialic acid homopolymer. The Kl antigen may have a role in preventing the host's immune system and preventing phagocytosis. The present invention is based on the identification of a series of virulence genes in E. coli Kl, and also relates to organisms, the products of which, may be involved in the pathogenicity of the organism. According to one aspect of the present invention, a peptide is encoded by an operon that includes any of the genes identified herein as mdoG, creC, recG, yggN, tatA, tatB, tatC, tatE, eckl, iroD, iroC, iroE, mtd2 and msl to 16, of E. coli Kl, or a homologue thereof in a Gram negative bacterium, or a functional fragment thereof. Such a peptide is suitable for therapeutic use, for example, when it is isolated. The term "functional fragments" is used herein to define a part of the gene or peptide which retains similar therapeutic utility as the entire gene or peptide. For example, a functional fragment of the peptide can be used as an antigenic determinant, useful in a vaccine or in the production of antibodies. A gene fragment can be used to encode the active peptide. Alternatively, the gene fragment may have utility in gene therapy, targeting the wild-type gene in vivo to exert a therapeutic effect. A peptide according to the present invention can comprise any of the amino acid sequences identified herein as identification sequences No. 2, 5, 7, 9, 11, 12, 13, 14, 16, 23, 24, 25, 26, 28, 31, 29, 32 and 35-48.
The identification of these peptides as virulence determinants allows them to be used in a variety of ways in the treatment of the infection. For example, a host may be transformed to express a peptide according to the invention or modified to alter the expression of the gene encoding the peptide. A vaccine may also comprise a peptide according to the invention, or the means for its expression, for the treatment of the infection. In addition, a vaccine can comprise a microorganism having a virulence gene deletion, wherein the gene encodes a peptide according to the invention. According to another aspect of the invention, the peptides or genes can be used to screen potential antimicrobial drugs or for the suppression of virulence. A further aspect of this invention is the use of any of the products identified herein, for the treatment or prevention of a condition associated with infection by a Gram negative bacterium, in particular E. coli. The present invention has made use of signature-tagged mutagenesis (MEF) (Hensel et al, Science, 1995; 269: 400-403) for screening strain of E. coli Kl RS228 (Pluschke et al, Infection and Immunity 39: 599-608) mini-Tn5 mutant bank for attenuated mutants, to identify virulence genes (and determinants) of virulence) of E. coli. Although E. coli Kl is used as the microorganism to identify virulence genes, corresponding genes in other enteric bacteria are considered to be within the scope of the present invention. For example, corresponding genes or encoded proteins can be found, based on sequence homology, on Enterobacter, Klebsiella and other genera involved in human intestinal diseases, including Salmonella, Shigella and Yersinia. The term "virulence determinant" is used herein to define a product, for example a peptide or protein that may have a role in the maintenance of pathogenic bacteria. In particular, a virulence determinant is a bacterial protein or peptide that is involved in the pathogenicity of the infectious microorganism or that causes the disease. A gene encoding a virulence determinant can be named "a virulence gene". The alteration of a virulence gene by mutation, deletion or insertion form, will result in a reduced level of survival of the bacteria in a host, or a general reduction in the pathogenicity of the microorganism.
Signature-tagged mutagenesis has proven to be a very useful technique for identifying virulence genes and their products. The technique is based on the ability of transposons to insert themselves randomly into the genome of a microorganism, under permissive conditions. Transposons are individually labeled for easy identification, and then separately introduced into a microorganism, resulting in alteration of the genome. The mutated microorganisms with reduced virulence are then detected by negative selection and the genes where insertion inactivation has occurred are identified. A first step in the MEF process is the preparation of suitable transposons or elements similar to transposons. A library of transposons are prepared, each being incorporated into a vector or plasmid to facilitate transfer into the microorganism. The preparation of vectors with suitable transposons will be apparent to a person skilled in the art and is further described in O-A-96/17951. For Gram negative bacteria, for example E. coli, suitable transposons include Tn5 and TnlO. having prepared the transposons, then mutagenesis of a bacterial strain is performed to create a library of individually mutated bacteria. Reserves of the mutated microorganisms are then introduced into a suitable host. After an adequate length of time, microorganisms are recovered from the host and those microorganisms that have survived in the host are identified, so that the mutated strains that fail to survive, that is, avirulent strains, are also identified. The corresponding avirulent strains in a stored library are then used to identify the genes where insertion inactivation occurs. Usually, the insertion site of the transposon is identified by isolating the DNA that flanks the insertion site of the transposon, and this allows the characterization of the genes involved in the virulence. Once the avirulent microorganism has been identified, it is possible to more fully determine the potential role of the mutated gene in virulence by infecting a suitable host animal with a lethal dose of the mutant. The survival time of the infected animal is compared to that of the control infected with the wild-type strain, and those animals that survive for periods larger than the control can be said to be infected with microorganisms that have mutated virulence genes. Alternatively, the potential role in virulence can be investigated by infecting an animal host with a mixture of the wild type and mutant bacteria. After a suitable period of time, the bacteria are harvested from the organs of the host animal and the proportion of wild-type and mutant bacteria is determined. This proportion is divided by the proportion of wild-type mutant bacteria in the inoculum, to determine the competitive index (CI). Mutants that have a competitive index of less than 1 can be said to be avirulent. It is possible that the gene that is inactivated by the transposon insertion may not be a true virulence gene, but may have a polar effect in a downstream (virulence) gene. This can be determined by further experimentation, by placing non-polar mutations in more defined regions of the gene, or by mutating other adjacent genes, and establishing whether the mutant is avirulent or not. Having characterized a virulence gene in E. coli, it is possible to use the gene sequence to establish homologies in other microorganisms. In this way it is possible to determine if other microorganisms have similar virulence determinants. Sequence homologies can be established by searching existing databases, for example EMBL or Genbank. Virulence genes are often agglomerated with each other in different chromosomal regions called islands of pathogenicity. Pathogenicity islands can be recognized as they are usually flanked by repeated sequences, insertion elements or tRNA genes. Also the content of G + C is normally different from the rest of the chromosome, suggesting that they are acquired by horizontal transmission of another organism. For example, the G + C content of the E. coli K12 genome is 52%. Any islands of pathogenicity found in strains of E. coli are likely to have a G + C content that varies from this average. The virulence genes identified are likely to be useful both in generating strains of attenuated vaccines and as a target for antimicrobials. The same can be true for homologues in Gram-negative bacteria in general. For the purpose of this invention, the appropriate degree of homology is typically at least 30%, preferably at least 50%, 60%, or 70%, and more preferably at least 80% or 90% (at the level of amino acid, or nucleotide). The proteins according to the invention can be verified and isolated by methods known in the art. In particular, having identified the gene sequence, it will be possible to use recombinant techniques to express the genes in a suitable host. The fragments and active homologues can be identified and can be useful in therapy. For example, proteins or their active fragments can be used as antigenic determinants in a vaccine, to elicit an immune response. They can also be used in the preparation of antibodies, for passive immunization, or diagnostic applications. Suitable antibodies include monoclonal antibodies, or fragments thereof, including single chain fv fragments. Methods for the preparation of antibodies will be apparent to those skilled in the art. The preparation of vaccines based on attenuated microorganisms is known to those skilled in the art. The vaccine compositions can be formulated with suitable carriers or adjuvants, for example, alum, as needed or desired, and used in therapy, to provide effective immunization against E. coli or other Gram-negative bacteria. The preparation of vaccine formulations will be apparent to the skilled person. More generally, and as is well known to those skilled in the art, a suitable amount of an active component of the invention can be selected for therapeutic use, as well as suitable carriers or excipients, and routes of administration. These factors will be chosen or determined according to known criteria such as nature / severity of the condition to be treated, the type or health of the subject, etc. The following examples illustrate the invention. For the examples, MEF is used to screen a library of E. coli Kl mini-Tn5 mutants for attenuated mutants, using a mouse model of systematic infection. The basic procedure followed is that described in Hensel et al, supra. E. coli Kl containing a mini-Tn5 insert within the virulence gene of mice inoculated with a mixed population of mutants is not recovered and is therefore likely to be attenuated. The region of DNA flanking either side of the mini-Tn5 insertion is cloned by reverse PCR or by rescue of a kanamycin resistance marker. In the latter case, the chromosomal DNA of the MEF-derived mutant is digested with restriction enzymes, ligated into the pUC19 plasmid, and selected kanamycin-resistant clones after transformation become competent E. coli K12 cells. The cloning and subsequent sequencing is then performed and the gene sequences are compared using sequences in publicly available sequence databases (EMBL) to help characterize the putative gene products. Example 1 In a first mutant, two fragments of the cloned DNA are sequenced. The nucleotide sequences are shown as SEQ ID NO. 1 and SEQ ID NO. 3 and a translated region of the DNA of SEQ ID NO. 1 is shown as SEQ ID NO. 2. SEQ ID NO. 1 shows 99.8% identity with the mdoGH region of E. coli K12 (accession number AE000206 from the EMBL database) of nucleotides 2577 to 6908. This DNA fragment encodes the 5 'part of the ymdD gene, the entire mdoG gene and the 5'- part of the mdoH gene. The product of the mdoG gene is of unknown function, but it is believed to be involved in the biosynthesis of membrane derived oligosaccharides. SEQ ID NO. 3 shows 98.3% identity with the 3 'part of the mdoH gene and the gene sequences downstream of E. coli K12. (nucleotides 7187 to 7760). SEQ ID NO. 2 shows 99.6% identity with the mdoG protein of E. coli K12 (accession number P33136 of S iss Prot) at amino acid 1 to 511. The novel gene is tested by virulence attenuation, using mixed infections, in a model of Murine of systematic infection (Achtman et al., Infection and Immunity, 1983; Vol. 39: 315-335), and shows to be attenuated with a competitive index (CI) of 0.38. This confirms that the attenuation of the original transposon mutant is probably due to the alteration of the mdoG gene. The polar and non-polar deletion mutants of mdoG are constructed. The mdoG gene and the flanking regions are amplified by PCR with oligonucleotides 5'-TGCTCTAGAGCCATTACTCAGAATGGG-3 '(SEQ ID NO: 49) and 5' -CGCGAGCTCGACGACTGAATGATCCC-3 '(SEQ ID NO: 50). The product is cloned in pUC19. A PCR product containing the 5 'and 3"terminal fragments of the mdoG and the entire pUC19 sequence is then amplified by inverse PCR with the oligonucleotides 5'-TCCCCCGGGTACTGCAGCACTCAACC-3' (SEQ ID NO.51) and 5'-GATCCCGGGACCACTGAAATGCGTGC -3 '(SEQ ID NO.52) A non-polar kanamycin resistance cassette is inserted {. Aph T) in both orientations between the mdoG sequences to give a polar and non-polar construction mdoG:: aph T fusions they are then transferred to the suicide vector pCDV442. The chromosomal copy of the mdoG is mutated by allelic transfer after conjugation of the pCDV442 constructs in wild type E. coli Kl.The mutants constructed for virulence attenuation are tested in a murine model of infection (Achtman et al., supra) Both polar and non-polar constructs are attenuated in virulence, with competitive indices of 0.37 and 0.35, respectively (mean CI of every three, mice). the attenuation of the original transposon mutant is probably due to the alteration of the mdoG gene. Example 2 A second mutant is identified with a virulence gene having the nucleotide sequence shown in SEQ ID NO. 4 and the translated amino acid sequence shown as SEQ ID NO. 5. The mini-Tn5 transposon inserted in nucleotide 581 (SEQ ID No. 4) and in amino acid 187 (SEQ ID No. 5). These sequences show 97.9% identity for the creC gene of E. coli K12 (accession numbers M13608, AE000510 and U14003 of EMBL and Genbank). The CreC protein of E. coli K12 belongs to the family of histidine kinase proteins as well as the family of proteins consisting of proteins that contain a signal domain. The novel gene is tested for virulence attenuation (Achtman et al., Supra.) And shown to be attenuated with a competitive index of 0.09. Since the creC gene of E. coli K12 is transcribed as part of an operon with the creD gene. It is possible that this attenuation is due to a polar effect in a creD gene of E. coli Kl. Example 3 A third mutant has a nucleotide sequence shown as SEQ ID NO. 6 immediately after mini-Tn5. A translation of this sequence is shown as SEQ ID NO. 7. The nucleotide sequence shows 93.7% identity for the recG gene of E. coli K12, in nucleotides 5-146 (accession numbers P24230 and M64367 of EMBL and Genbank). This shows that the altered gene is at least partially identical to the recG gene of E. coli K12. The recG gene of E. coli K12 encodes a 76.4 kD protein that functions as ATP-dependent helicase DNA, and plays a critical role in DNA repair. In tests for attenuation, the competitive index is shown to be 0.48. The recG gene is transcribed as the terminal gene of an operon, and therefore it is not likely that this attenuation is due to the polar effect in another E. coli Kl gene. Example 4 A fourth mutant has a transposon inserted into the nucleotide sequence shown as SEQ ID NO. 8, with a translation product shown as SEQ ID NO. 9. The mini-Tn5 transposon inserted in the nucleotide 359 and amino acid 80. These sequences show 98.5% sequence identity to the yggN gene of E. coli K12 (accession number EMBL AE000378) at nucleotides 339-1054, and 99.6% identity at the amino acid level. Although the sequence of the yggN gene is known, the function of this encoded protein has not been determined. The novel gene is tested for virulence attenuation, and shown to be attenuated with a competitive index of 0.43.
Example 5 Several mutants are found with a transposon insert within the same region. The cloning and sequencing of the region reveals a nucleotide sequence shown as SEQ ID NO. 10. This sequence has homology with the tatABCD operon of E. coli K12 (accession numbers AJ005830, AE000459 and AE000167 of EMBL and Genbank). This operon encodes proteins of predicted mass 9.6 kD, 18.4 kD, 28.9 kD and 29.5 kD, which function as components of an export path of independent Sec protein. The trajectory allows translocation of fully folded proteins to the periplasm through a gate pro, after the cofactors are joined in the cytoplasm. The translation of the nucleotide sequence reveals a protein corresponding to tatA (SEQ ID No. 11), a sequence corresponding to ta tB (SEQ ID No. 12), a sequence corresponding to tatC (SEQ ID No. 13) and a sequence corresponding to tat (SEQ ID NO.14). The mini-Tn5 transposons are located in the mutants identified by MEF in nucleotides 1429 and 2226 of the SEQ ID DO NOT. 10. These transposon insertions alter the tatB protein sequence at amino acid 50 and the tatC protein sequence at amino acid 143. The tatB and tatC genes are tested for virulence attenuation and shown to be attenuated at competitive rates of 0.0012. and 0.0039, respectively. These genes are also attenuated in virulence when tested in simple infections in the same model of systematic infection. EXAMPLE 6 An additional mutant is inactivated in the form of an insert within a region corresponding to the ta tE gene of E. coli K12, shown as SEQ ID NO. 15. A translation of the sequence is shown as SEQ ID NO. 16. The tatE gene shows 98% identity to that E. coli K12 gene (accession number AE000167) at nucleotides 6719-7306. To establish whether the ta tA, ta tD and ta tE genes are required for virulence, nonpolar suppression mutations are constructed in each. The flanking regions of DNA on either side of the tatA, ta tD and tatE genes are amplified with the following primers: ta tA 5 '-TCG TCT AGA GAT GAT GGT GAT GGA GCG-3' (SEQ ID NO. '-GAA CTG CAG CCA AAT ACT GAT ACC ACC C-3' (SEQ ID NO.54) 5 '-GAA CTG CAG GCT AAA ACA GAA GAC GCG-3' (SEQ ID NO.55) 5 '-CAT GCA TGC ACT CCA TACGAC AAC CGC-3 '(SEQ ID No. 56) The primers SEQ ID NO. 53 and SEQ ID NO. 54 to amplify the DNA sequences upstream of ta tA, primers SEQ ID NO. 55 and SEQ ID NO. 56, to amplify the DNA sequences downstream of tatA. tatD 5 '-TCG TCT AGA ATG AAG CTG CGC ATG AGG-3' (SEQ ID No. 57) 5 '-CCA CTG CAG TCG CAA ATT GCG AAC TGG-3' (SEQ ID NO.58) 5 '-CCA CTG CAG ACC GCA ACT TTT CGA CGC-3"(SEQ ID No. 59) 5 '-CAT GCA TGC CAG TGA GCC ATT GTT CCC- 3" (SEQ ID NO 60) The primers SEQ ID NO. 57 and SEQ ID NO. To amplify the upstream DNA sequences of tatD, the primers SEQ ID NO. 59 and SEQ ID NO. 60 to amplify the DNA sequences downstream of tatD. tatE 5 '-TGC TCT AGA TAC GAC TCT GAC AGG AGG-3' (SEQ ID NO 61) 5 '-TCA GAT ATC AAC TAC CAG CAG TTT GG-3 (SEQ ID NO 62) X' -TCA GAT ATC CAT AAA GAG TGA CGT GGC-3 '(SEQ ID No. 63) 5' -TGC TCT AGA AAA CGT GGC AAC AGA GCG-3 (SEQ ID No. 64) The primers SEQ ID NO. 61 and SEQ ID NO. 62 to amplify the DNA sequences upstream of ta tE, the primers SEQ ID NO. 63 and SEQ ID NO. 64 to amplify the DNA sequences downstream of tatE. After cloning these flanking DNA fragments in pUC19, a kanamycin resistance cassette aph T (Galán et al, J. Bacteriol, 1992) is inserted.; 174: 4338-4349) between the flanking DNA fragments to replace the ta tA, ta tD and ta tE genes. These DNA fragments are then transferred to the suicide vector pCVD442 (Blomfield et al, Mol.Micro., 1991; 5: 1447-1457). The chromosomal copies of the ta tA, ta tD and ta tE genes of E. coli Kl are then mutated by allelic transfer after conjugation of the pCVD442 constructs in wild-type E. coli Kl. Alterations of the ta tA, ta tD and ta tE genes have been tested for virulence attenuation (Achtman et al., Supra). None of the genes is attenuated when it is suppressed in isolation. Genes can still play a role in virulence, and to prove this, mutants are prepared with deletions in both tatA and ta tE genes. The double mutation for virulence attenuation is tested using infections mixed with the wild-type strain and shown to be attenuated with a competitive index of 0.0017. It seems therefore that the ta tA, ta tD and ta tE genes can be used in combination to create aviruletan microorganisms. Given the similarity of the ta tABCE genes of E. coli Kl to predict ta tABC genes present in the S genome. Typhimurium and the Neisseria meningi tidis genome probably seems that the tat system may also be required for virulence in these, and other, organisms. A deletion is constructed in the tatC gene of s. Typhimurium (SEQ ID NO: 17) by amplifying the DNA flanking either side of the tatC gene with the following primers: 5 '-TGC TCT AGA AGG CGT TGT CGA CBT TG-3' (SEQ ID NO.65) 5 '-GAA CTG CAG GAA AAG GCC GAG CAG ACT G-3 '(SEQ ID NO.66) 5' -GAA CTG CAG TAC AGC CAT GTT TAC GGT-3 '(SEQ ID NO.67) 5' -CAT GCA TGC GGT GTA CGA CAG TTT GCG-3 '(SEQ ID NO 68) The primers SEQ ID NO. 65 and SEQ ID NO. 66 to amplify the DNA sequences downstream of the tatC gene of S. typhimurium, the primers SEQ ID NO. 67 and SEQ ID NO. 68 to amplify the DNA sequences upstream of the tatC gene of S. typhimurium. The encoded amino acid sequences for two tatC gene regions are shown as SEQ ID NO. 18 and SEQ ID NO. 19. After cloning these flanking DNA fragments into pUC19, a non-polar kanamycin resistance cassette (aph T) is inserted between the flanking DNA fragments to replace the tatC gene of S. typhimurium. This DNA fragment is then transferred to the suicide vector pCVD442. The chromosomal copy of the tatC gene of S. typhimurium is then mutated by allelic transfer after conjugation of the pCVD442 construct in wildtype TML and SL1344 strains of S. typhimurium. The tatC gene of S is tested. typhimurium for virulence attenuation, using mixed and simple infections in a murine model of systematic infection. For mixed infections, 6-7 week old balbC mice are inoculated intraperitoneally with 10 4 bacterial cells. The competitive indices are calculated after comparing the numbers of mutant and wild-type bacteria present in spleens after three days. For simple infections, mice are inoculated either intraperitoneally or orally with several doses and mouse survival is monitored for 17 days. The strains are attenuated in virulence, the competitive indices of suppression strains of SL1344 taC and TML tatC that are 0.078 and 0.098, respectively. In simple infections, mouse survival is extended compared to wild-type controls. The sequence homology with the tat sequence of Neisseria meningi tidis is also shown. The gene sequence of N. meningitidis is shown as SEQ ID NO. 20 and the amino acid sequence encoded for tatC is shown as SEQ ID NO. 21. To test for virulence, a deletion mutant is created using the following primers: 5 '-TGCTCTAGACACATCATGGGCACACC-3' (SEQ ID NO: 69) 5 '-GAACTGCAGAACCGTCCACATCAGGCG-3' (SEQ ID NO. 5 '-GAACTGCAGACCCTGCTTGCCATTCCG-3' (SEQ ID NO 71) 5 '-GAACTGCAGACCCTGCTTGCCATTCCG-3' (SEQ ID NO 72) The cloning of the DNA fragments and the kanamycin resistance cassette aph T in pUC19 follows the procedure indicated above for S. typhimurium. The chromosomal copy of the tatC gene of N. meningi tidis is mutated by transformation of the pUC19-based constructs into wild type N. meningi tidis cells. Southern analysis of the resulting transformants indicates that all of the transformants are merodiploid and contain both wild type and mutated copies of the tatC gene. This indicates that there is some selection against the isolation of mutants in which the tatC gene has been deleted. Additional studies in polar and non-polar constructions show that the transformants do not grow on selective medium. This suggests that the tatC gene of N. meningi tidis is essential for in vitro growth of this organism. Example 7 An additional mutant is identified with a transposon insert within a nucleotide sequence identified herein as SEQ ID? O. 22, in nucleotide 3981. The sequence defined herein as eckl, shows sequence homology to several glycosyltransferase group 1 of a number of bacteria. Sequence homology is shown for the gnd gene of E. coli K12 (in nucleotides 4197-4604 of SEQ ID? O.22). The translation of the eckl gene of E. coli is shown as SEQ ID? O. 26. The gene has tested for virulence attenuation, as described above, and is shown to be attenuated with a competitive index of 0.025. Several open reading structures (ELA) of the DNA sequence are also identified (SEQ ID No. 22). The first of these is defined herein as MSI and a translation product shown as SEQ ID NO. 25. The amino acid sequence is shown to have 50.3% identity to a putative glycosyl transferase of serotype 011 of E. coli (access number AAD46732 from the TrEMBL database). The amino acid sequence also shows homology with the eckl protein of E. coli Kl and also the TersE protein of Yersinia entercoli tica (access number Q56917 of the TrEMBL database). A second open reading frame identified herein as MS2 has the gene sequence shown as SEQ ID NO. 24. This shows sequence homology to the putative glycosyl transferase TrsC from Yersinia entercoli tica (access number Q56915 from the TrRMBL database), and also the glycosyl transferase bnA from serotype 0113 from E. coli (access number AAD50485 from the base of TrEMBL data). A third open reading structure encodes a product identified herein as MS3 (SEQ ID NO.23). The amino acid sequence shows 30.2% identity to a ramnosyltransferase of Streptoccus mutans. The sequence of genes shown as SEQ ID NO. 22 may be at least part of an island of pathogenicity, with multiple virulence genes that are placed in a glomerulus in the microorganism's genome. Example 8 An additional mutant having a transposon insert within the iroCED operon is identified. The nucleotide sequences flanking either side of the mini-Tn5 insert are shown as SEQ ID NO. 27 and SEQ ID NO. 30. The mini-Tn5 transposon is inserted into the nucleotide 1272 of SEQ ID NO. 27 and in nucleotide 1 of SEQ ID NO. 30, and interrupts the iroD gene. The N terminal region of IroD is shown as SEQ ID NO. 29, and the C terminal region is shown as SEQ ID NO. 31. In addition to iroD, the gene shown as SEQ ID NO. 27 encodes a partial peptide with the amino acid sequence shown in SEQ ID NO. 28. This amino acid sequence shows 70.9% identity to the iroC transporter of the ATP binding cassette of Salmonella typhi. The gene sequence shown as SEQ ID NO. includes an open reading frame that encodes a peptide with the amino acid sequence shown as SEQ.
ID NO. 32 and this has sequence homology to the iroE protein of Salmonella typhi. When testing the genes in a model for virulence attenuation, as described above, it is shown that the iroD gene is attenuated with a competitive index of 0.107. the mini-Tn5 mutation in the iroD gene has been reintroduced into the wild-type E. coli Kl strain by transduction of Pl. The resulting transductant is also attenuated in virulence with a competitive index of 0.1. This indicates that the attenuated phenotype is linked to the insertion within iroD. However, it is possible that the attenuation is due to a polar effect in the iroE gene of E. coli Kl. Example 9 An additional mutant is identified with a transposon insert within the nucleotide sequence shown as SEQ ID NO. 33. The transposon is inserted into nucleotide 2264 of SEQ ID NO. 33. The nucleotide sequence shows sequence homology to the asIA / hem region of E. coli K12 (accession number AE000456 of EMBL). ASIA codes for an arylsubatase homologue whereas hemY is involved in the biosynthesis of protohema IX. This shows that the altered region is at least partially identical to the asIA / hemY region of E. coli K12. The transposon is inserted into nucleotide 2264 of SEQ ID NO. 33. This insertion site has 216 nucleotides downstream of the coding codon of the hemY gene and 472 nucleotides upstream of the start codon of the asIA gene. The novel region has been tested for virulence attenuation, as described above and is attenuated with a competitive index of 0.033. The mini-Tn5 mutation in this region has been reintroduced into the wild-type E. coli Kl strain by transduction of Pl. The resulting transductant is also attenuated in virulence with a competitive index of 0.008. This indicates that the attenuated phenotype is linked to the transposon insert in this region. However, the polar and non-polar deletion mutants of ASIA are constructed and tested for virulence attenuation as described above. Neither the polar nor the non-polar mutants are attenuated in virulence and this demonstrates that the attenuation of the original transposon mutant is not due to a polar effect in the ASIA gene. This indicates that the transposon is altering some other encoded function within the intergenic region between asIA and hemY. For example, it may be some untranslated RNA molecule such as regulatory RNA similar to oxyS (Altuvia et al., Cell, 1997; 90: 43-53), encoded within this region. Alternatively, the transposon may be altering some DNA structure that may, for example, be involved in DNA replication. This DNA region is also present in the pathogen Salmonella typhimurium, suggesting that it may be important for pathogenicity in other organisms. This region (SEQ ID No. 33) can be used as an objective, to identify antimicrobial drugs. Example 10 An additional mutant is identified and the DNA region flanking either side of the mini-Tn5 insert is cloned and has the nucleotide sequence shown as SEQ ID NO. 34. This nucleotide sequence has homology with the mtd2 gene of Herpetosiphon auran tiacus (accession number P25265 of EMBL) with the product of the mtd2 gene that functions as a cytosine-specific methyltransferase. The mtd2 gene is not found in the E. coli K12 genome and may represent an island of pathogenicity. The inserts of the mini-Tn5 transposon are located in nucleotides 4773 and 3764 of SEQ ID NO. 34 and are shown to interrupt the mtd2 gene. The amino acid sequence of the mtd2 gene is shown as SEQ ID NO. 43. The mtd2 gene of E. coli Kl is tested for virulence attenuation, as described above, and shown to be attenuated with a competitive index of 0.073. In addition to the mtd2 gene, a series of open reading structures are also identified with translation products identified herein as Ms4 to MS16, SEQ ID NO. 48-44 and 42-35, respectively. As soon as open reading structures are located on an island of potential pathogenicity, mutations in these genes can also result in attenuation in virulence. In addition, since it is known that E. coli and other bacteria can encode peptides in different forms in the nucleotide sequence, the coding regions of some of these proteins can overlap. In addition, any amino acid sequence shown starting with Val can in fact start with Met.
SEQUENCE LIST < 110 > Microscience Limited < 120 > GENES AND PROTEINS OF VIRULENCE, AND THEIR USE < 130 > REP05921 O < 140 > < 141 > < 160 > 72 < 170 > Patentln Ver. 2. 1 < 210 > 1 < 211 > 4333 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (1017). . (2549) < 400 > 1 ccattactca gaatgggcgg atacacaata aaaattgttc ttcttattac cgcataaccg 60 atgccgaggc acaaaaaaat caccgatagt tttaccatcg agaatttttt attcgtttta 120 tcagaatttt ctaaattatt tctgatacgt ttgaatatcc agacgcacag cgtcgtcatg 180 accactaaca ccagtaaaaa ccacaggtgt gatattaatt cccaggccaa cgtattatat 240 ttgtcataca atgacagtcc aggccaactt tccgctttcc ctttgacgta ttgcagcata 300 ataaattgcg gcaatgtcag tagggggatg gctgttaaca tcgggatacc tacacgttcg 360 acacgtactt tccaccattt tttcaaggga tagcgtaaaa aaagcatgta ggaaaagtac 420 ccggatataa cgaaaaatac ctgcatgcgg aacgagtgga tgaagtcatt aaaaagggtc 480 agccataatg acggttcggc gctattcaca tgccatgtat ggctcgaata gattaaagaa 540 atatgaaaag ggatccctaa caacatcagc caggcgcgga tggagtcgag gaaatattca 600 cgttgcgcgg gtactgggtt catatatggt taactaatct cggatttttc gtcttatccc 660 tgtcgggtta tgcctttagg cttgttgcca tagcgacacc gacctgaccg cgccaggcgc 720 aggcttcaag gtttttatgc atagcatcat cgctaccact aaccagaatg gaagcgtctg 780 taagacggtt gataaataaa tttgctggca aaccctaca'c gaagtcgatg cttctgtctt 840 taggagaagc acggaaagtg aaaacggttg caatcaggtg cttaatccat gagccagtgt 900 gctgaacgat accgggattc tgttgtcgga atggcaggtt atccattaaa atagatcgga 960 tcgatataag cacacaaagg gggaagtgct tactaattat gaaacataaa ctacaa atg 1019 Met 1 atg aaa atg cgt tgg ttg agt gct gca gta atg tta acc ctg tat here 1067 Met Lys Met Arg Trp Leu Ser Ala Ala Val Met Leu Thr Leu Tyr Thr 5 10 15 tet tea age tgg gct ttc agt att gat gat gtc gca aag ca gct ca 1115 Being Ser Trp Wing Phe Ser lie Asp Asp Val Wing Lys Gln Wing Gln 20 25 30 tec tta gcc ggg aaa ggc tat gag gcg ccc aaa age aac ttg ccc tec 1163 Ser Leu Wing Gly Lys Gly Tyr Glu Wing Pro Lys Ser Asn Leu Pro Ser 35 40 45 gtt ttc cgc gat atg aaa tac gcg gae tat cag cag ate cag ttt aat 1211 Val Phe Arg Asp Met Lys Tyr Wing Asp Tyr Gln Gln He Gln Phe Asn 50 55 60 65 cat gac aaa gcg tac tgg aac aat ctg aag acc cea ttc aaa etc gag 1259 His Asp Lys Wing Tyr Trp Asn Asn Leu Lys Thr Pro Phe Lys Leu Glu 70 75 80 ttc tac cat cag ggt atg tac ttc gat acc ccg gtc aaa ata aat gaa 1307 Phe Tyr His Gln Gly Met Tyr Phe Asp Thr Pro Val Lys He Asn Glu 85 90 95 gtg act gcc acc gca gtc aaa cga ate aaa tac age ccg gat tat ttc 1355 Val Thr Ala Thr Ala Val Lys Arg He Lys Tyr Ser Pro Asp Tyr Phe 100 105 110 act ttc ggc gat gtt cag cat gac aaa gac acg gta aaa gac ctt ggt 1403 Thr Phe Gly Asp Val Gln His Asp Lys Asp Thr Val Lys Asp Leu Gly 115 120 125 ttt gcc ggt ttc aaa gtg ctt tac ccg ate aac age aaa gat aaa aac 1451 Phe Ala Gly Phe Lys Val Leu Tyr Pro He Asn Ser Lys Asp Lys Asn 130 135 140 145 gat gaa ate gtc age atg etc ggg gcc age tat ttc cgc gtg att ggt 1499 A = p Glu He Val Ser Met Leu Gly Ala Ser Tyr Phe Arg Val He Gly 150 155, 160 gca ggt cag gtt tat ggc ctt tet gca cgc ggc ctg gca att gat acc 1547 Wing Gly Gln Val Tyr Gly Leu Ser Wing Arg Gly Leu Wing Asp Thr 165 170 175 gcc .ttg cea tcg ggt gaa gaa ttt cea cgc ttc aaa gag ttc tgg ate 1595 Wing Leu Pro Ser Gly Glu Glu Phe Pro Arg Phe Lys Glu Phe Trp He 180 185 190 gag cgt cea aaa ccg act gat aaa cgt tta acc att tat gca ttg ctt 1643 Glu Arg Pro Lys Pro Thr Asp Lys Arg Leu Thr He Tyr Ala Leu Leu 195 200 205 gac tcg ccg cgc gcg here ggt gct tac aaa ttc gta gtt atg cea gga 1691 Asp Ser Pro Arg Ala Thr Gly Ala Tyr Lys Phe Val Val Met Pro Gly 210 215 220 225 cgt gac acg gtt gtg gat gtg cag tcg aaa ate tat ctg cgc gat aaa 1739 Arg Asp Thr Val Val Asp Val Gln Ser Lys He Tyr Leu Arg Asp Lys 230 235 240 gtc ggc aaa ctg ggg gtt gca ccg tta acc agt atg ttc ctg ttt ggg 1787 Val Gly Lys Leu Gly Val Ala Pro Leu Thr Ser Met Phe Leu Phe Gly 245 250 255 ccg aac cac ccg tcg ect gca aat aac tat cgt ccg gag ttg cae gac 1835 Pro Asn Gln Pro Ser Pro Wing Asn Asn Tyr Arg Pro Glu Leu His Asp 260 265 270 tet aac ggt ctg tet ate cat gct ggt aat ggc gaa tgg ate tgg cgt 1883 Ser Asn Gly Leu Ser He His Wing Gly Asn Gly Glu Trp He Trp Arg 275 280 285 ccg ttg aat aac ccg aaa cat tta gcg gtc age age ttc tcg atg gaa 1931 Pro Leu Asn Asn Pro Lys His Leu Wing Val Being Ser Phe Ser Met Glu 290 295 300 305 aac ccg ca ggc ttc ggt cta tg ggt ggt cgt gat ttc tec cgc 1979 Asn Pro Gln Gly Phe Gly Leu Leu Gln Arg Gly Arg Asp Phe Ser Arg 310 315 320 ttt gaa gat etc. gat gat cgt tac gat ctt cgt cea age gca tgg gtg 2027 Phe Glu Asp Leu Asp Asp Arg Tyr Asp Leu Arg Pro Ser Wing Trp Val 325 330 335 act ccg aaa ggg gag tgg ggc aaa ggc age gtt gag ctg gtg gaa att 2075 Thr Pro Lys Gly Glu Trp Gly Lys Gly Ser Val Glu Leu Val Glu He 340 345 350 cea acc aac gat gaa acc aac gat aac ate gtc gct tac tgg acg ccg 2123 Pro Thr Asn Asp Glu Thr Asn Asp Asn He Val Wing Tyr Trp Thr Pro 355 360 365 gat cag ctg ccg gag ccg ggt aaa gag atg aac ttt aaa tac acc ate 2171 Asp Gln Leu Pro Glu Pro Gly Lys Glu Met Asn Phe Lys Tyr Thr He 370 375 380 385 acc ttc age cgt gat gaa gac aaa ctg cat gcg cea gat aac gca tgg 2219 Thr Phe Ser Arg Asp Glu Asp Lys Leu His Wing Pro Asp Asn Wing Trp 390 395 400 gtg caá cag acg cgt cgt tea acg ggg gat gtg aag cag tcg aac ctg 2267 Val Gln Gln Thr Arg Arg Ser Thr Gly Asp Val Lys Gln Ser Asn Leu 405 410 415 att cgc cag ect gac ggt act ate gcc ttt gtg gtc gat ttt acc ggc 2315 He Arg Gln Pro Asp Gly Thr He Wing Phe Val Val Asp Phe Thr Gly 420 425 430 gct gag atg aaa aaa ctg cea gag gat acc ccg gtc here gcg ca acc 2363 Wing Glu Met Lys Lys Leu Pro Glu Asp Thr Pro Val Thr Ala Gln Thr 435 440 445 age att ggt gat aat ggt gag ata gtt gaa age acg gtg cgt tat aac 2411 Ser He Gly Asp Asn Gly Glu He Val Glu Ser Thr Val Arg Tyr Asn 450 455 460 465 ccg gtt acc aaa ggc tgg cgt ctg gtg atg cgt gtg aaa gtg aaa gat 2459 Pro Val Thr Lys Gly Trp Arg Leu Val Met Arg Val Lys Val Lys Asp 470 475 480 gcc aag aaa ac act gaa atg cgt gct gcg ctg gtg aat gcc gat cag 2507 Wing Lys Lys Thr Thr Glu Met Arg Ala Wing Leu Val Asn Wing Asp Gln 485 490 495 acg ttg agt gaa acc tgg age tac cag tta ect gcc aat gaa 2549 Thr Leu Ser Glu Thr Trp Ser Tyr Gln Leu Pro Wing Asn Glu 500 505 510 taagacaact gagtacattg acgcaatgcc catcgccgca agcgagaaag cggcattgcc 2609 gaagactgat atccgcgccg tteatcaggc gctggatgcc gaacaccgca cctgggcgcg 2669 ggaggatgac tccccgcaag gctcggtaaa ggcgcgtctg gaacaagcct ggccagattc 2729 acttgctgat ggacagttaa ttaaagacga cgaagggcgc gatcagctaa aggcgatgcc 2789 cgctcctcga agaagtaaaa tgtttcccga cccgtggcgt accaacccgg taggccgttt 2849 ctgggatcgc ctgcgtggac gcgatgtgac gccgcgctat ctggctcgtt tgaccaaaga 2909 agagcaggag agtgagcaaa agtggcgtac cgtcggtacc atccgccgtt acattctgtt 2969 gatcctgacg ctcgcgcaaa ctgttgtcgc gacctggtat ttcttcctta atgaagacca 3029 tcaggggtgg gcgctgatta atcctatgga tatggttggt caggatgtgt gggtttcctt 3089 tatgcagctt ctgccttata tgctgcaaac cggtatcctg atcctctttg cggtactgtt 3149 ctgttgggtg tccgccggat tctggaccgg cgttgatggg cttcctgcaa ctgcttattg 3209 atacagtata gtcgcgataa tctgcgtcaa tgaaccatta cagttggcga aacccggagc 3269 atcgcacggc gttgatcatg cctatctgta acgaagacgt gaaccgtgtt tttgctggct 3329 tgcgtgcaac gtgggaatca ccgggaatgc gtaaaagcca caaacatttt gatgtctaca 3389 ttcttagtga cagttataac ccggatatct gcgtcgcaga gcaaaaagcc tggatggagc 3449 ttatc gctga agtcggtggc gaaggtcaga ttttctatcg ccgccgccgc cgtcgcgtga 3509 agcgtaaaag cggtaatatc gatgacttct gccgtcgctg gggcagccag tacagctaca 3569 tggtggtgct ggatgctgac tcggtaatga ccggtgattg tttgtgcggc ctggtgcgcc 3629 tgatggaagc caacccgaac gccgggatca ttcagtcgtc gccgaaagcg tccggcatgg 3689 atacgctgta tgcgcgctgt cagcagttcg cgacccgcgt gtatgggcca ctgtttacag 3749 cttctggcaa ccggtttgca cttggcgagt cgcactactg ggggcataac gcgattatcc 3809 gcgtgaaacc gtttatcgag cactgtgcac tggctccgct gccgggcgaa ggttcttttg 3869 ccggttcaat cctgtcacat gacttcgtgg aagcggcgtt gatgcgccgt gcaggttggg 3929 gggtctggat tgcttacgat ctcccgggtt cttatgaaga attaccgcct aacttgcttg 3989 atgagctaaa acgtgaccgc cgctggtgcc acggtaacct gatgaacttc cgtctgttcc 4049 tggtgaaggg tatgcacccg gttcaccgtg cggtgttcct gacgggcgtg atgtcttatc 4109 tctccgctcc gctgtggttt atgttcctcg cgctctctac tgcattgcag gtagtacatg 4169 cgttgaccga accgcaatac ttcctgcaac cacggcagt't gttcccggta tggccgcagt 4229 ggcgtcctga gctggcgatt gcactttttg cttcgaccat ggtgctgttg ttcctgccga 4289 agctattgag cattttgctt atctggtgca aaggaacgaa agaa 4333 < 210 > 2 < 211 > 511 < 212 > PRT < 213 > Escherichia coli < 400 > 2 Met Met Lys Met Arg Trp Leu Ser Ala Ala Val Met Leu Thr Leu Tyr 1 5 10 15 Thr Ser Ser Ser Trp Wing Phe Ser He Asp Asp Val Wing Lys Gln Wing 20 25 30 Gln Ser Leu Wing Gly Lys Gly Tyr Glu Wing Pro Lys Ser Asn Leu Pro 35 40 45 Ser Val Phe Arg Asp Met Lys Tyr Wing Asp Tyr Gln Gln He Gln Phe 50 55 60 Asn His Asp Lys Wing Tyr Trp Asn Asn Leu Lys Thr Pro Phe Lys Leu 65 70 75 80 Glu Phe Tyr His Gln Gly Met Tyr Phe Asp Thr Pro Val Lys He Asn 85 90 95 Glu Val Thr Ala Thr Ala Val Lys Arg He Lys Tyr Ser Pro Asp Tyr 100 105 110 Phe Thr Phe Gly Asp Val Gln His A = p Lys Asp Thr Val Lys Asp Leu 115 120 125 Gly Phe Wing Gly Phe Lys Val Leu Tyr Pro He Asn Ser Lys Asp Lys 130 135 140 Asn Asp Glu He Val Ser Met Leu Gly Wing Ser Tyr Phe Arg Val He 145 150 155 160 Gly Wing Gly Gln Val Tyr Gly Leu Wing Wing Arg Gly Leu Wing He Asp 165 170 175 Thr Wing Leu Pro Being Gly Glu Glu Phe Pro Arg Phe Lys Glu Phe Trp 180 185 190 He Glu Arg Pro Lys Pro Thr Asp Lys Arg Leu Thr He Tyr Ala Leu 195 200 205 Leu Asp Ser Pro Arg Ala Thr Gly Ala Tyr Lys Phe Val Val Met Pro 210 215 220 Gly Arg Asp Thr Val Val Asp Val Gln Ser Lys He Tyr Leu Arg Asp 225 230 235 240 Lys Val Gly Lys Leu Gly Val Wing Pro Leu Thr Ser Met Phe Leu Phe 245 250 255 Gly Pro Asn Gln Pro Pro Pro Wing Asn Asn Tyr Arg Pro Glu Leu His 260 265 270 Asp Being Asn Gly Leu Being He His Wing Gly Asn Gly Glu Trp He Trp 27S 280 285 Arg Pro Leu Asn Asn Pro Lys His Leu Wing Val Being Ser Phe Ser Met 290 295 300 Glu Asn Pro Gln Gly Phe Gly Leu Leu Gln Arg Gly Arg Asp Phe Ser 305 310 315 320 Arg Phe Glu Asp Leu Asp Asp Arg Tyr Asp Leu Arg Pro Ser Wing Trp 325 330 335 Val Thr Pro Lys Gly Glu Trp Gly Lys Gly Ser Val Glu Leu Val Glu 340 345 350 He Pro Thr Asn Asp Glu Thr Asn Asp Asn He Val Wing Tyr Trp Thr 355 360 365 Pro Asp Gln Leu Pro Glu Pro Gly Lys Glu Met Asn Phe Lys Tyr Thr 370 375 380 He Thr Phe Ser Arg Asp Glu Asp Lys Leu His Wing Pro Asp Asn Wing 385 390 395 400 Trp Val Gln Gln Thr Arg Arg Ser Thr Gly Asp Val Lys Gln Ser Asn 405 410 415 Leu He Arg Gln Pro Asp Gly Thr He Wing Phe Val Val Asp Phe Thr 420 425 430 Gly Wing Glu Met Lys Lys Leu Pro Glu Asp Thr Pro Val Thr Ala Gln 435 440 445 Thr Ser He Gly Asp Asn Gly Glu He Val Glu Ser Thr Val Arg Tyr 450 455 460 Asn Pro Val Thr Lys Gly Trp Arg Leu Val Met Arg Val Lys Val Lys 465 470 475 480 Asp Ala Lys Lys Thr Thr Glu Met Arg Ala Ala Leu Val Asn Ala Asp 485 490 495 Gln Thr Leu Ser Glu Thr Trp Ser Tyr Gln Leu Pro Wing Asn Glu 500 505 510 < 210 > 3 < 211 > 574 < 212 > DNA < 213 > Escherichia coli < 400 > 3 ttcgttgatc ctgtcaccgt ttgttcggtt atttccagcc gtgccaccgt tggtctgcga 60 accaaacgct ggaaactgtt ccctgatccc ggaagagtat tcaccgccgc aggtgetggt 120 tgataccgat cggttccttg agatgaatcg tcaatgctcc cttgatgatg gttttatgca 180 cgcggtgttt aacccgtcat ttaacgctct ggcaaccgca atggcgaccg cgcgtcaccg 240 cgccagcaag gtgctggaaa tcgcccgtga ccgccacgtt gaacaggcgc tgaacgagac 300 gccagagaag ctgaatcgcg atcgtcgcct ggtgctgcta agcgatccgg tgacgatggc 360 ccgtctgcat ttccgcgtct ggaattcccc ggagagatat tcttcatggg tgagttatta 420 cgaagggata aagctcaatc cactggcatt gcgtaaaccg gatgcggctt cgcaataaaa 480 acgtagttgc ctgatgcgct aegettatca ggcctacatc gttcctgcaa tttattgatt 540 ttgcaagatt ttgtaggtcg gataaggcgt tcac 574 < 210 > 4 < 211 > 1478 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (25) .. (1449) < 400 > 4 gggataatgc ctgaggggcc tgta atg cgt ate ggc atg cgg ttg ttg ctg 51 Met Arg He Gly Met Arg Leu Leu Leu 8 ggc tat ttt tta ctg gtg gcg gtg gcg gcc tgg ttc gta ctg gct att 99 Gly Tyr Phe Leu Leu Val Wing Val Wing Wing Trp Phe Val Leu Wing He 10 15 20 25 ttt gtc aaa gaa gtt aaa ccg ggc gtg cga aga gca acc gag ggg acg 147 Phe Val Lys Glu Val Lys Pro Gly Val Arg Arg Ala Thr Glu Gly Thr 30 35 40 tta ate gac acc gca acg ttg ctg gcg gag ctg gcg cgt ccc gat ttg 195 Leu He Asp Thr Ala Thr Leu Leu Ala Glu Leu Ala Arg Pro Asp Leu 45 50 55 etc. tet ggg gac cea acg cat ggg ca gt cg cg gg cag ttt aat cag 243 Leu Ser Gly Asp Pro Thr His Gly Gln Leu Ala Gln Ala Phe Asn Gln 60 65 70 cta ca cat cat cgc ccg ttt cgc gcc aat ate ggt ggc att aac aaa gtg 291 Leu Gln His Arg Pro Phe Arg Ala Asn He Gly Gly He Asn Lys Val 75 80 85 cgc aac gaa tat cat gtc tat atg acc gat gcg cag ggc aaa gta ttg 339 Arg Asn Glu Tyr His Val Tyr Met Thr Asp Ala Gln Gly Lys Val Leu 90 95 100 105 ttc gat tcg gca aat aaa gcc gtt gga cag gat tat tcg cgc tgg aat 387 Phe Asp Being Wing Asn Lys Wing Val Gly Gln Asp Tyr Being Arg Trp Asn 110 115 120 gac gtc tgg cta acg ttg cgt ggt cag tat ggt gcg cgc age acg ttg 435 Asp Val Trp Leu Thr Leu Arg Gly Gln Tyr Gly Ala Arg Ser Thr Leu 125 130 135 caat aat ect gcc gat ccc gaa agt tet gtg atg tat gtt gcc gcg ccg 483 Gln Asn Pro Wing Asp Pro Glu Ser Ser Val Met Tyr Val Ala Wing Pro 140 145 150 att atg gac ggc tcg cgg ctt att ggc gtt ttg age gta ggc aaa ccg 531 He Met Asp Gly Ser Arg Leu He Gly Val Leu Ser Val Gly Lys Pro 155 160 165 aac gcg gcg atg gct ccg gtc att aag cgt age gag cgg cga att tta 579 Asn Ala Ala Met Ala Pro Val He Lys Arg Ser Glu Arg Arg He Leu 170 175 180 185 tgg gcc age gcc att ttg ttg ggg att gca ctg gtg att ggc gca ggc 627 Trp Wing Being Wing He Leu Leu Gly He Wing Leu Val He Gly Wing Gly 190 195 200 atg gtt tgg tgg ate aac cgc tet att gcc agg etc act cgc tat gct 675 Met Val Trp Trp He Asn Arg Ser He Wing Arg Leu Thr Arg Tyr Wing 205 210 215 gat tec gtc act gac aat aag ccc gtt ect etc ccc gat etc ggt agt 723 Asp Ser Val Thr Asp Asn Lys Pro Val Pro Leu Pro Asp Leu Gly Ser 220 225 230 age gag ttg cgt aaa etc gcg cag gcg ctg gaa agt atg cgc gtg aag 771 Ser Glu Leu Arg Lys Leu Ala Gln Ala Leu Glu Ser Met Arg Val Lys 235 240 245 ctg gaa ggg aaa aac tat att gag cag tat gtt tat gcg tta act cat 819 Leu Glu Gly Lys Asn Tyr He Glu Gln Tyr Val Tyr Ala Leu Thr His 250 255 260 265 gag cta aaa age cea ctg gcg gcg att cgt ggc gcg gcg gaa att tta 867 Glu Leu Lys Ser Pro Leu Wing Wing He Arg Gly Wing Wing Glu He Leu 270 275 280 cgc gaa ggt ccg ccg ccg gaa gtg gtg gct cgt ttt acc gac aac att 915 Arg Glu Gly Pro Pro Pro Glu Val Val Wing Arg Phe Thr Asp Asn He 285 290 295 ctg acg caa aat gcg cga atg cag gca ctg gtg gaa acg tta cta cgc 963 Leu Thr Gln Asn Ala Arg Met Gln Ala Leu Val Glu Thr Leu Leu Arg 300 305 310 cag gca aga ctg gag aat cgt cag gaa gtc gtt ctg act gct gtt gat 1011 Gln Ala Arg Leu Glu Asn Arg Gln Glu Val Val Leu Thr Ala Val Asp 315 320 325 gtg gcg gca tta ttt cgc cgc gtc age gaa gcg cgc acc gtg cag ttg 1059 Val Ala Ala Leu Phe Arg Arg Val Ser Glu Ala Arg Thr Val Gln Leu 330 335 340 345 gca gaa aaa aac ate act ttg cat gtt atg ect act gag gtt aac gtt 1107 Ala Glu Lys Asn He Thr Leu His Val Met Pro Thr Glu Val Asn Val 350 355 360 gct tet gaa ccg gcg tta ctg gg cag gcg ctg ggg aat tta ctg gat 1155 Wing Ser Glu Pro Wing Leu Leu Glu Gln Wing Leu Gly Asn Leu Leu Asp 365 370 375 aac gcc ate gat ttt act ccc gag age ggt tgc ata acg cta age gcc 1203 Asn Ala He Asp Phe Thr Pro Glu Ser Gly Cys He Thr Leu Ser Ala 380 385 390 gaa gtg gat cag gaa tac gtc acc ctt aag gtg ctg gat acc ggt agt 1251 Glu Val Asp Gln Glu Tyr Val Thr Leu Lys Val Leu Asp Thr Gly Ser 395 400 405 ggg att ect gac tac gcg ctg tea cgt att ttt gaa cgc ttt tac tet 1299 Gly He Pro A = p Tyr Ala Leu Ser Arg He Phe Glu Arg Phe Tyr Ser 410 415 420 425 ttg ccg cgt gca aat ggg caa aaa age age ggt ctg ggg ttg gcg ttt 1347 Leu Pro Arg Ala Asn Gly Gln Lys Ser Ser Gly Leu Gly Leu Ala Phe 430 435 440 gtc agt gag gtc gcc cgt ttg ttt aac ggc gaa gtc acg ctg cgc aac 1395 Val Ser Glu Val Wing Arg Leu Phe Asn Gly Glu Val Thr Leu Arg Asn 445 450 455 gtg cag gaa ggt ggc gtg ctg gcc tcg ctt cga ctt fall cgt falls ttc 1443 Val Gln Glu Gly Gly Val Leu Wing Ser Leu Arg Leu His Arg His Phe 460 465 470 here tag cttcaaattc ttcccacata gtcttcgta 1478 Thr 475 < 210 > 5 < 211 > 474 < 212 > PRT < 213 > Escherichia coli < 400 > 5 Met Arg He Gly Met Arg Leu Leu Leu Gly Tyr Phe Leu Leu Val Wing 1 5 10 15 Val Ala Ala Trp Phe Val Leu Ala He Phe Val Lys Glu Val Lys Pro 20 25 30 Gly Val Arg Arg Ala Thr Glu Gly Thr Leu He Asp Thr Ala Thr Leu 35 40 45 Leu Ala Glu Leu Ala Arg Pro Asp Leu Leu Ser Gly Asp Pro Thr His 50 55 60 Gly Gln Leu Wing Gln Wing Phe Asn Gln Leu Gln His Arg Pro Phe Arg 65 70 75 80 11 Wing Asn He Gly Gly He Asn Lys Val Arg Asn Glu Tyr His Val Tyr 85 90 95 Met Thr A = p Ala Gln Gly Lys Val Leu Phe Asp Ser Ala Asn Lys Ala 100 105 110 Val Gly Gln Asp Tyr Ser Arg Trp Asn Asp Val Trp Leu Thr Leu Arg 115 120 125 Gly Gln Tyr Gly Wing Arg Ser Thr Leu Gln Asn Pro Wing Asp Pro Glu 130 135 140 Ser Ser Val Met Tyr Val Wing Wing Pro He Met Asp Gly Ser Arg Leu 145 150 155 160 He Gly Val Leu Ser Val Gly Lys Pro Asn Ala Ala Ala Pro Wing Val 165 170 175 He Lys Arg Ser Glu Arg Arg He Leu Trp Wing Ser Wing He Leu Leu 180 185 190 Gly He Wing Leu Val He Gly Wing Gly Met Val Trp Trp He Asn Arg 195 200 205 Be He Wing Arg Leu Thr Arg Tyr Wing Asp Ser Val Thr Asp Asn Lys 210 215 220 Pro Val Pro Leu Pro Asp Leu Gly Be Ser Glu Leu Arg Lys Leu Ala 225 230 235 240 Gln Ala Leu Glu Ser Met Arg Val Lys Leu Glu Gly Lys Asn Tyr He 245 250 255 Glu Gln Tyr Val Tyr Ala Leu Thr His Glu Leu Lys Ser Pro Leu Ala 260 265 270 Ala He Arg Gly Ala Ala Glu He Leu Arg Glu Gly Pro Pro Pro Glu 275 280 285 Val Val Ala Arg Phe Thr Asp Asn He Leu Thr Gln Asn Ala Arg Met 290 295 300 Gln Ala Leu Val Glu Thr Leu Leu Arg Gln Ala Arg Leu Glu Asn Arg 305 310 315 320 Gln Glu Val Val Leu Thr Ala Val Asp Val Ala Ala Leu Phe Arg Arg 325 330 335 12 Val Ser Glu Ala Arg Thr Val Gln Leu Ala Glu Lys Asn He Thr Leu 340 345 350 His Val Met Pro Thr Glu Val Asn Val Ala Ser Glu Pro Ala Leu Leu 355 360"365 Glu Gln Ala Leu Gly Asn Leu Leu Asp Asn Ala He Asp Phe Thr Pro 370 375 380 Glu Ser Gly Cys He Thr Leu Ser Wing Glu Val Asp Gln Glu Tyr Val 385 390 395 400 Thr Leu Lys Val Leu Asp Thr Gly Ser Gly He Pro Asp Tyr Ala Leu 405 410 415 Being Arg He Phe Glu Arg Phe Tyr Being Leu Pro Arg Wing Asn Gly Gln 420 425 430 Lys Ser Ser Gly Leu Gly Leu Ala Phe Val Ser Glu Val Ala Arg Leu 435 440 445 Phe Asn Gly Glu Val Thr Leu Arg Asn Val Gln Glu Gly Gly Val Leu 450 455 460 Ala Ser Leu Arg Leu His Arg His Phe Thr 465 470 < 210 > 6 < 211 > 128 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (1) .. (126) < 400 > 6 atg aaa ggt cgc ctg tta gat gct gtc ccg etc agt tec cta acg ggc 48 Met Lys Gly Arg Leu Leu Asp Ala Val Pro Leu Ser Ser Leu Thr Gly 1 5 10 15 gtt ggc gca gcg ctt agt aac aag ctg gcg aaa ate aac ctg cat acc 96 Val Gly Ala Ala Leu Ser Asn Lys Leu Ala Lys He Asn Leu His Thr 20 25 30 gta cag gat tta etc tta falls ctt ect ctg cg 128 13 Val Gln Asp Leu Leu Leu His Leu Pro Leu 35 40 < 210 > 7 < 211 > 42 < 212 > PRT < 213 > Escherichia coli < 400 > 7 Met Lys Gly Arg Leu Leu Asp Ala Val Pro Leu Ser Ser Leu Thr Gly 1 5 10 15 Val Gly Ala Ala Leu Ser Asn Lys Leu Ala Lys He Asn Leu His Thr 20 25 30 Val Gln Asp Leu Leu Leu His Leu Pro Leu 35 40 < 210 > 8 < 211 > 1174 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (121) .. (837) < 400 > 8 agatgcacga tcgagtaggc cggataaggc gtttacgccg catccagcat ggaaaacgcg 60 cactttgtta tcaatetggg gccagcaaat gctggcctga tttgttcttg agggaagact 120 atg atg cgc aaa atg ctg ctg gcg gca gca ctt tea gtg acg gca atg 168 Met Met Arg Lys Met Leu Leu Ala Ala Ala Leu Ser Val Thr Ala Met 1 5 10 15 acc gct falls gcc gac tac cag tgc age gtc acg ccg cgt gac gat gtg 216 Thr Wing His Wing Asp Tyr Gln Cys Ser Val Thr Pro Arg Asp Asp Val 20 25 30 att gtc age ccg caà acc gtg cag gtg aag ggc gaa aac ggc aat ctg 264 He Val Ser Pro Gln Thr Val Gln Val Lys Gly Glu Asn Gly Asn Leu 35 40 45 gtg ate acg cea gac ggc aac gtg atg tat aac ggt aag caa tat tec 312 14 Val He Thr Pro Asp Gly Asn Val Met Tyr Asn Gly Lys Gln Tyr Ser 50 55 60 ctg aat gcc gcc cag cgc gag cag gcg aag gat tat cag gct gaa cta 360 Leu A = n Wing Wing Gln Arg Glu Gln Wing Lys A = p Tyr Gln Wing Glu Leu 65 70 75 80 cgt age acc ctg ccg tgg att gat gga gcg gcg aaa age cgc gtc gag 408 Arg Ser Thr Leu Pro Trp He Asp Gly Gly Wing Lys Ser Arg Val Glu 85 90 95 aaa gct cgt att gcg ctg gat aaa att ate gtt cag gag atg ggc gaa 456 Lys Wing Arg He Wing Leu Asp Lys He He Val Gln Glu Met Gly Glu 100 105 110 age age aaa atg cgc age cgt ctg acc aaa ctt gat gcg cag ctg aaa 504 Ser Ser Lys Met Arg Ser Arg Leu Thr Lys Leu Asp Ala Gln Leu Lys 115 120 125 gag cag atg aac cgc att ate gaa acg cgc age gat ggc ctg acg ttt 552 Glu Gln Met Asn Arg He He Glu Thr Arg Ser Asp Gly Leu Thr Phe 130 135 140 falls tat aaa gcc att gat cag gtt cgt gcc gaa ggc cag cata tta gtg 600 His Tyr Lys Ala He Asp Gln Val Arg Ala Glu Gly Gln Gln Leu Val 145 150 155 160 aat cag gca atg ggc gga att tta cag gac age att aat gaa atg ggc 648 Asn Gln Wing Met Gly Gly He Leu Gln Asp Ser He Asn Glu Met Gly 165 170 175 gcg aaa gcg gtg ctg aaa age ggc ggt aac cea tta cag aac gtg ctg 696 Wing Lys Wing Val Leu Lys Ser Gly Gly Asn Pro Leu Gln Asn Val Leu 180 185 190 gga age ctg ggc ggc ctg caa tec ate ate ca a acc gag tgg aaa aag 744 Gly Ser Leu Gly Gly Leu Gln Ser Ser He Gln Thr Glu Trp Lys Lys 195 200 205 cag gaa aaa gat ttc cag cag ttt ggc aaa gat gtt tgt age cgc gtt 792 Gln Glu Lys Asp Phe Gln Gln Phe Gly Lys Asp Val cys Ser Arg Val 210 215 220 gtg act ctg gaa gat age cgc aaa gcc ctg gtc ggg aat tta aaa 837 Val Thr Leu Glu Asp Ser Arg Lys Ala Leu Val Gly Asn Leu Lys 225 230 235 taatcctcta ttttaagacg geataatact tttttatgcc gtttaattct tcgttttgtt 897 acctgcctct aactttgtaa gggcgaattc tgcagatatc catcacactg gcggccgctc 957 gagcatgcat ctagagggcc caattcgccc tatagtgagt cgtattacaa ttcactggcc 1017 gtcgttttac aaccgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 1077 agcacatccc cctttcgcca gctggcgtaa tagcgaaaag gcccgcaccg atcgcccttc 1137 caacagttgc gcacctgatg gccaatggac gcgcctg 1174 < 210 > 9 < 211 > 239 < 212 > PRT < 213 > Escherichia coli < 400 > 9 Met Met Arg Lys Met Leu Leu Ala Ala Ala Leu Ser Val Thr Ala Met 1 5 10 15 Thr Ala His Wing Asp Tyr Gln Cys Ser Val Thr Pro Arg Asp Asp Val 20 25 30 He Val Ser Pro Gln Thr Val Gln Val Lys Gly Glu Asn Gly Asn Leu 35 40 45 Val He Thr Pro Asp Gly Asn Val Met Tyr Asn Gly Lys Gln Tyr Ser 50 55 60 Leu Asn Ala Ala Gln Arg Glu Gln Ala Lys Asp Tyr Gln Ala Glu Leu 65 70 75 80 Arg Ser Thr Leu Pro Trp He Asp Gly Gly Wing Lys Ser Arg Val Glu 85 90 95 Lys Ala Arg He Ala Leu A = p Lys He He Val Gln Glu Met Gly Glu 100 105 110 Be Ser Lys Met Arg Ser Arg Leu Thr Lys Leu Asp Ala Gln Leu Lys 115 120 125 Glu Gln Met Asn Arg He He Gl'u Thr Arg Ser Asp Gly Leu Thr Phe 130 135 140 is Tyr Lys Ala He Asp Gln Val Arg Ala Glu Gly Gln Gln Leu Val 145 150 155 160 16 Asn Gln Wing Met Gly Gly He Leu Gln Asp Ser He Asn Glu Met Gly 165 170 175 Ala Lys Ala Val Leu Lys Ser Gly Gly Asn Pro Leu Gln Asn Val Leu 180 185 190 Gly Be Leu Gly Gly Leu Gln Be Ser He Gln Thr Glu Trp Lys Lys 195 200 205 Gln Glu Lys Asp Phe Gln Gln Phe Gly Lys Asp Val Cys Ser Arg Val 210 215 220 Val Thr Leu Glu Asp Ser Arg Lys Ala Leu Val Gly Asn Leu Lys 225 230 235 < 210 > 10 < 211 > 3406 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (1007) .. (1276) < 220 > < 221 > CDS < 222 > (1280) .. (1792) < 220 > < 221 > CDS < 222 > (1798) .. (2574) < 220 > < 221 > CDS < 222 > (2604) .. (3398) < 400 > 10 gatgatggtg atggagcgta tttacggcat tccggtgtct gatgttgcga cgctggagaa 60 aaacggcacc aacatgaaat tgctggcgga acgcggcgtg caggtgttct tcactcaggt 120 ctttcgcgac agctttttcc atgetgatat gcaccctggc aacatetteg taagctatga 180 acacccggaa aacccgaaat atatcggcat tgattgcggg attgttggct cgctaaacaa 240 agaagataaa cgctatctgg cggaaaactt tatcgccttc tttaatcgcg actategeaa 300 17 agtggcagag ctacacgtcg attctggttg ggtgccacca gataccaaeg ttgaagagtt 360 egaatttgee attcgtacgg tctgtgaacc tatetttgag aaaccgctgg ccgaaatttc 420 gtttggacat gtactgttaa atctgtttaa tacggcgcgt cgcttcaata tggaagtgca 480 gccgcaactg gtgttactcc agaaaaccct gctctacgtc gaaggggtag gacgccagct 540 ttatccgcaa etegatttat ggaaaacggc gaagcctttc ctggagtcgt ggattaaaga 600 tcaggtcggt attcctgcgc tggtgagagc atttaaagaa aaagcgccgt tctgggtcga 660 aaaaatgcca gaactgcctg aactggttta cgacagtttg cgccagggca agtatttaca 720 gcatagtgtt ggtaagattg cccgcgagct tcagtcaaat catgtacgtc agggacaatt 780 cgcgttattt tctcggaatt ggcgctacgt tagtatttaa gtggcacatt cttgttggtc 840 agccgacctg aatgggggct gatgcccggc tggttaatgg caggtggtct gatcgcctgg 900 tttgtccggt tggcgcaaaa cacgctgatt ttttcatcgc tcaaggcggg ccgtgtaacg 960 tataatgcgg ctttgtttaa teatcateta ccacagagga acatgt atg ggt ggt 1015 Met Gly Gly 1 ate agt att tgg cag tta ttg att att gcc gtc ate gtt gta ctg ctt 1063 He Ser He Trp Gln Leu Leu He He Wing Val He Val Val Leu Leu 5 10 15 ttt ggc acc aaa aag etc ggc tec ate ggt tec gat ctt ggt gcg tcg 1111 Phe Gly Thr Lys Lys Leu Gly Ser He Gly Being Asp Leu Gly Wing Ser 20 25 30 35 ate aaa ggc ttt aaa aaa gca atg age gat gat gaa cea aag cag gat 1159 He Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro Lys Gln Asp 40 45 50 aaa acc age cag gat gct gat ttt act gcg aaa act ate gcc gat aag 1207 Lys Thr Ser Gln Asp Wing Asp Phe Thr Wing Lys Thr He Wing Asp Lys 55 60 65 cag gcg gat acg aat cag gaa cag gct aaa ata gaa gac gcg aag cgc 1255 Gln Wing Asp Thr Asn Gln Glu Gln Wing Lys He Glu Asp Wing Lys Arg 70 75 80 falls gat aaa gag cag gtg taa tet gtg ttt gat ate ggt ttt age gaa 1303 18 His Asp Lys Glu Gln Val Val Phe Asp He Gly Phe Ser Glu 85 90 95 ctg cta ttg gtg ttc ate ate ggc etc gtc gtt ctg ggg ccg caga cga 1351 Leu Leu Leu Val Phe He He Gly Leu Val Val Leu Gly Pro Gln Arg 100 105 110 ctg ect gtg gcg gta aaa acg gta gcg ggc tgg att cgc gcg ttg cgt 1399 Leu Pro Val Wing Val Lys Thr Val Wing Gly Trp He Arg Wing Leu Arg 115 120 125 130 tea ctg gcg here acg gtg cag aac gaa ctg acc cag gag tta aaa etc 1447 Ser Leu Ala Thr Thr Val Gln Asn Glu Leu Thr Gln Glu Leu Lys Leu 135 140 145 cag gag ttt cag gac agt ctg aaa aag gtt gaa aag gcg age etc act 1495 Gln Glu Phe Gln Asp Ser Leu Lys Lys Val Glu Lys Ala Ser Leu Thr 150 155 160 aac ctg acg ccc gaa ctg aaa gcg tcg atg gat gaa tta cgc cag gct 1543 Asn Leu Thr Pro Glu Leu Lys Wing Ser Met Asp Glu Leu Arg Gln Wing 165 170 175 gcg gag tcg atg aaa cgt tec tac gtt gca aac gat ect gaa aag gcg 1591 Wing Glu Ser Met Lys Arg Ser Tyr Val Wing Asn Asp Pro Glu Lys Wing 180 185 190 age gat gaa gcg falls acc ate cat aac ccg gtg gtg aaa gac aat gaa 1639 Ser Asp Glu Ala His Thr He His Asn Pro Val Val Lys Asp Asn Glu 195 200 205 210 act gcg cat gaa ggc gta acg ect gct gct gca cag acg cag gcc agt 1687 Thr Ala His Glu Gly Val Thr Pro Ala Ala Ala Gln Thr Gln Ala Ser 215 220 225 tcg ccg gaa cag aag cea gaa acc acg cea gag ccg gtg gta aaa ect 1735 Ser Pro Glu Gln Lys Pro Glu Thr Pro Pro Glu Pro Val Val Lys Pro 230 235 240 gct gcg gac gct gaa ccg aaa acc gct gca ect tec ect tcg tcg agt 1783 Wing Wing Asp Wing Glu Pro Lys Thr Wing Wing Pro Ser Pro Being Ser 245 250 255 gat aaa ccg taaac atg tet gta gaa gat act cac ccg ctt ate acg cat 1833 Asp Lys Pro Met Ser Val Glu Asp Thr Gln Pro Leu He Thr His 260 265 270 ctg att gag ctg cgt aag cgt ctg ctg aac tgc att ate tcg gtg ate 1881 19 Leu He Glu Leu Arg Lys Arg Leu Leu Asn Cys He He Ser Val He 275 280 285 gtg ata ttc ctg tgt ctg gtc tat ttc gcc aat gac ate tat fall ctg 1929 Val He Phe Leu Cys Leu Val Tyr Phe Wing Asn Asp He Tyr His Leu 290 295 300 305 gta tec gcg cea ctg ate aag cag ttg ccg ca ggt tea acg atg ate 1977 Val Ser Ala Pro Leu He Lys Gln Leu Pro Gln Gly Ser Thr Met He 310 315 320 gcc acc gac gtg gcc tcg ccg ttc ttt acg ccg ate aag ctg acc ttt 2025 Wing Thr Asp Val Ala Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe 325 330 335 atg gtg tcg ctg att ctg tea gcg ccg gtg att etc tat cag gtg tgg 2073 Met Val Ser Leu He Leu Ser Ala Pro Val He Leu Tyr Gln Val Trp 340 345 350 gcg ttt ate gcc cea gcg ctg tat aag cat gaa cgt cgc ctg gtg gtg 2121 Wing Phe He Wing Pro Wing Leu Tyr Lys His Glu Arg Arg Leu Val Val 355 360 365 ccg ctg ctg gtt tec age tet ctg ctg ttt tat ate ggc atg gcg ttc 2169 Pro Leu Leu Val Being Ser Leu Leu Phe Tyr He Gly Met Ala Phe 370 375 380 385 gcc tac ttt gtg gtc ttt ccg ctg gca ttt ggc ttc ctt gcc aat acc 2217 Wing Tyr Phe Val Val Phe Pro Leu Wing Phe Gly Phe Leu Wing Asn Thr 390 395 400 gcg ccg gaa ggg gta cag gta tec acc gac ate gcg age tat tta age 2265 Wing Pro Glu Gly Val Gln Val Ser Thr Asp He Wing Ser Tyr Leu Ser 405 410 415 ttc gtt atg gcg ctg ttt atg gcg ttt ggt gtc tec ttt gaa gtg ccg 2313 Phe Val Met Ala Leu Phe Met Ala Phe Gly Val Ser Phe Glu Val Pro 420 425 430 gtg gca att gtg ctg ctg tgc tgg atg ggg att acc tcg cea gaa gac 2361 Val Ala He Val Leu Leu Cys Trp Met Gly He Thr Ser Pro Glu Asp 435 440 445 tta cgc aaa aaa cgc ccg tat gtg ctg gtt ggt gca tte gtt gtc ggg 2409 Leu Arg Lys Lg Arg Pro Tyr Val Leu Val Gly Ala Phe Val Val Gly 450 455 460 465 atg ttg ctg acg ccg ccg gat gtc ttc tcg caa acg ctg ttg gcg ate 2457 Met Leu Leu Thr Pro Pro Asp Val Phe Ser Gln Thr Leu Leu Ala He 470 475 480 ect atg tac tgc ctg ttt gaa ate ggt gtc ttc ttc tea cgc ttt tac 2505 Pro Met Tyr Cys Leu Phe Glu He Gly Val Phe Phe Ser Arg Phe Tyr 485 490 495 gtt ggt aaa ggg cga aac cgg gaa gag gaa aac gac gct gaa gca gaa 2553 Val Gly Lys Gly Arg Asn Arg Glu Glu Glu A = n Asp Ala Glu Ala Glu 500 505 510 age gaa aaa act gaa gaa taa attcaaccgc ccgtcagggc ggttgtcat atg 2606 Ser Glu Lys Thr Glu Glu Met 515 520 gag tac agg atg ttt gat ate ggc gtt aat ttg acc agt tcg cat ttt 2654 Glu Tyr Arg Met Phe Asp He Gly Val Asn Leu Thr Ser Ser Gln Phe 525 530 535 gcg aaa gac cgt gat gat gtt gta gcg cgc gct ttt gac gcg gga gtt 2702 Wing Lys Asp Arg Asp Asp Val Val Wing Arg Wing Phe Asp Wing Gly Val 540 545 550 aat ggg cta etc ate acc ggt acc aat ctg cgt gaa age cag cag gcg 2750 Asn Gly Leu Leu He Thr Gly Thr Asn Leu Arg Glu Ser Gln Gln Ala 555 560 565 caa aag ctg gcg cgt cag tat tcg tec tgt tgg tea acg gcg ggc gta 2798 Gln Lys Leu Wing Arg Gln Tyr Ser Ser Cys Trp Ser Thr Ala Gly Val 570 575 580 585 cat ect falls gac age age cag tgg ca gct gtg act gaa gac gcg att 2846 His Pro His Asp Ser Ser Gln Trp Gln Ala Val Thr Glu Glu Ala He 590 595 600 att gag ctg gcc gcg cag cea gaa gtg gtg gcg att ggt gaa tgt ggt 2894 He Glu Leu Ala Ala Gln Pro Glu Val Val Ala He Gly Glu Cys Gly 605 610 615 etc gac ttt aac cgc aac ttt tcg acg ccg gaa gag cag gaa cgc gct 2942 Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gln Glu Arg Wing 620 625 630 ttt gtt gcc cag cta cgc att gcc gca gaa tta aac atg ccg gta ttt 2990 Phe Val Wing Gln Leu Arg He Wing Wing Glu Leu Asn Met Pro Val Phe 635 640 645 atg falls tgt cgc gat gcc drops gag cgg ttt atg here ttg ctg gag ccg 3038 21 Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu Pro 650 655 660 665 tgg ctg gat aaa ctg ect ggt gcg gtt ctt cat tgc ttt acc ggc here 3086 Trp Leu Asp Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly Thr 670 675 680 cgc gaa gag atg cag gcg tgc gtg gcg tgt gga att tat ate ggc att 3134 Arg Glu Glu Met Gln Wing Cys Val Wing Cys Gly He Tyr He Gly He 685 690 695 acc ggt tgg gtt tgc gat gaa cga cgc ggg ctg gag ctg cgg gaa ttg 3182 Thr Gly Trp Val Cys Asp Glu Arg Arg Gly Leu Glu Leu Arg Glu Leu 700 705 710 ttg ccg ttg att ccg gcg gag aaa ttg ctg ate gaa act gat gcg ccg 3230 Leu Pro Leu He Pro Wing Glu Lys Leu Leu He Glu Thr Asp Wing Pro 715 720 725 tat ctg etc ect cgc gat etc acg cea aag cea tea tec cgg cgc aac 3278 Tyr Leu Leu Pro Arg Asp Leu Thr Pro Lys Pro Ser Ser Arg Arg Asn 730 735 740 745 gag cea gcc cat ctg ccc cat att ttg cag cgt att gcg falls tgg cgt 3326 Glu Pro Wing His Leu Pro His He Leu Gln Arg He Wing His Trp Arg 750 755 760 gga gaa gat gcc gca tgg ctg gct gcc acc acg gat gcc aat gtc aaa 3374 Gly Glu Asp Wing Wing Trp Leu Wing Wing Thr Thr Asp Wing Asn Val Lys 765 770 775 here ctg ttt ggg att gcg ttt tag agtttgcg 3406 Thr Leu Phe Gly He Wing Phe 780 785 < 210 > 11 < 211 > 89 < 212 > PRT < 213 > Escherichia coli < 400 > 11 Met Gly Gly He Ser He Trp Gl'n Leu Leu He He Ala Val He Val 1 5 10 15 Val Leu Leu Phe Gly Thr Lys Lys Leu Gly Ser He Gly Ser Asp Leu 20 25 30 22 Gly Wing Be He Lys Gly Phe Lys Lys Wing Met Ser Asp Asp Glu Pro 40 45 Lys Gln Asp Lys Thr Ser Gln A = p Wing Asp Phe Thr Wing Lys Thr He 50 55 60 Wing Asp Lys Gln Wing Asp Thr Asn Gln Glu Gln Wing Lys He Glu Asp 65 70 75 80 Ala Lys Arg His Asp Lys Glu Gln Val 85 < 210 > 12 < 211 > 171 < 212 > PRT < 213 > Escherichia coli < 400 > 12 Val Phe Asp He Gly Phe Ser Glu Leu Leu Leu Val Phe He He Gly 1 5 10 15 Leu Val Val Leu Gly Pro Gln Arg Leu Pro Val Val Wing Val Lys Thr Val 20 25 30 Wing Gly Trp He Arg Wing Leu Arg Being Leu Wing Thr Thr Val Gln Asn 35 40 45 Glu Leu Thr Gln Glu Leu Lys Leu Gln Glu Phe Gln Asp Ser Leu Lys 50 55 60 Lys Val Glu Lys Wing Ser Leu Thr Asn Leu Thr Pro Glu Leu Lys Wing 65 70 75 80 Being Met Asp Glu Leu Arg Gln Wing Wing Glu Being Met Lys Arg Being Tyr 85 90 95 Val Ala Asn Asp Pro Glu Lys Ala Ser Asp Glu Ala His Thr He His 100 105 105 Asn Pro Val Val Lys Asp Asn Glu Thr Wing His Glu Gly Val Thr Pro 115 120 125 Ala Ala Ala Gln Thr Gln Ala Be Ser Pro Glu Gln Lys Pro Glu Thr 130 135 140 Thr Pro Glu Pro Val Val Lys Pro Ala Wing Asp Ala Glu Pro Lys Thr 145 150 155 160 23 Ala Ala Pro Ser Pro Ser Ser Ser Asp Lys Pro 165 170 < 210 > 13 < 211 > 258 < 212 > PRT < 213 > Escherichia coli < 400 > 13 Met Ser Val Glu A = p Thr Gln Pro Leu He Thr His Leu He Glu Leu 1 5 10 15 Arg Lys Arg Leu Leu Asn Cys He He Ser Val He Val He Phe Leu 20 25 30 Cys Leu Val Tyr Phe Wing Asn Asp He Tyr His Leu Val Ser Wing Pro 35 40 45 Leu He Lys Gln Leu Pro Gln Gly Ser Thr Met He Wing Thr Asp Val 50 55 60 Wing Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu 65 70 75 80 He Leu Ser Ala Pro Val He Leu Tyr Gln Val Trp Ala Phe He Ala 85 90 95 Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val 100 105 110 Be Being Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val 115 120 125 Val Phe Pro Leu Wing Phe Gly Phe Leu Wing Asn Thr Wing Pro Glu Gly 130 135 140 Val Gln Val Ser Thr Asp He Wing Ser Tyr Leu Ser Phe Val Met Wing 145 150 155 160 Leu Phe Met Wing Phe Gly Val Ser Phe Glu Val Pro Val Wing He Val 165 170 175 Leu Leu Cys Trp Met Gly He Thr Ser Pro Glu Asp Leu Arg Lys Lys 180 185 190 Arg Pro Tyr Val Leu Val Gly Ala Phe Val Val Gly Met Leu Leu Thr 24 195 200 205 Pro Pro Asp Val Phe Ser Gln Thr Leu Leu Ala He Pro Met Tyr Cys 210 215. 220 Leu Phe Glu He Gly Val Phe Phe Ser Arg Phe Tyr Val Gly Lys Gly 225 230 235 240 Arg Asn Arg Glu Glu Glu Asn Asp Wing Glu Wing Glu Ser Glu Lys Thr 245 250 255 Glu Glu < 210 > 14 < 211 > 264 < 212 > PRT < 213 > Escherichia coli < 400 > 14 Met Glu Tyr Arg Met Phe Asp He Gly Val Asn Leu Thr Ser Ser Gln 1 5 10 15 Phe Ala Lys Asp Arg Asp Asp Val Val Ala Arg Ala Phe Asp Ala Gly 20 25 30 Val Asn Gly Leu Leu He Thr Gly Thr Asn Leu Arg Glu Ser Gln Gln 35 40 45 Wing Gln Lys Leu Wing Arg Gln Tyr Ser Ser Cys Trp Ser Thr Wing Gly 50 55 60 Val His Pro His Asp Ser Ser Gln Trp Gln Ala Val Thr Glu Glu Ala 65 70 75 80 He He Glu Leu Ala Wing Gln Pro Glu Val Val Wing He Gly Glu Cys 85 90 95 Gly Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gln Glu Arg 100 105 110 Ala Phe Val Ala Gln Leu Arg He Ala Ala Ala Glu Leu Asn Met Pro Val 115 120 125 Phe Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu 130 135 140 Pro Trp Leu A = p Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly 145 150 155 160 Thr Arg Glu Glu Met Gln Ala Cys Val Ala Cys Gly He Tyr He Gly 165 170. 175 He Thr Gly Trp Val Cys Asp Glu Arg Arg Gly Leu Glu Leu Arg Glu 180 185 190 Leu Leu Pro Leu He Pro Wing Glu Lys Leu Leu He Glu Thr Asp Wing 195 200 205 Pro Tyr Leu Leu Pro Arg Asp Leu Thr Pro Lys Pro Ser Ser Arg Arg 210 215 220 Asn Glu Pro Wing His Leu Pro His He Leu Gln Arg He Wing His Trp 225 230 235 240 Arg Gly Glu Asp Ala Ala Trp Leu Ala Ala Thr Thr Asp Ala Asn Val 245 250 255 Lys Thr Leu Phe Gly He Wing Phe 260 < 210 > 15 < 211 > 586 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (170) .. (370) < 400 > 15 tcttaaacaa ccgtcgcttt gcgccgccgc aattattatg atgttttttt actcggcgct 60 tgattcacct tgttacagat tgetattgtg tgcgcgcgtc gaatgaccgt taatattetc 120 tggtttttaa ggcgcgttct gttgccggtt atatgtcaag aaggtatct atg ggt gag 178 Met Gly Glu 1 att agt att acc aaa ctg ctg gta gtt gcg gcg ctg gtc gtt ctg ctg 226 He Be He Thr Lys Leu Leu Val Val Ala Wing Leu Val Val Leu Leu 5 10 15 ttt ggg act aag aag tta cgt acg ctg ggc gga gac ctt gga gcg gcc 274 26 Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu Gly Ala Wing 20 25 30 35 att aaa ggg ttc aag aag gcg atg aat gat ga.c gat gct gcg gcg aaa 322 He Lys Gly Phe Lys Lys Wing Met Asn Asp Asp Asp Wing Wing Wing Lys 40 45 50 aaa ggc gca gac gtt gat ctt cag gct gaa aag etc tet cat aaa gag 370 Lys Gly Wing Asp Val Asp Leu Gln Wing Glu Lys Leu Ser His Lys Glu 55 60 65 tgacgtggcg agcaggacgc tccctcaata tcttgttcga tacaaaaacc cgcttcaaaa 430 agegggtttt ttatcagaca gatgtaagta attattacag gattaettaa cttccatccc 490 tttcgcctgc aaatcggcgt ggtaagaaga gcggacaaac ggaccgcatg cagcatgggt 550 aaagcccatc gccagcgctt cgctttcatt tcgtcg 586 < 210 > 16 < 211 > 67 < 212 > PRT < 213 > Escherichia coli < 400 > 16 Met Gly Glu He Ser He Thr Lys Leu Leu Val Val Ala Ala Leu Val 1 5 10 15 Val Leu Leu Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu 20 25 30 Gly Ala Ala He Lys Gly Phe Lys Lys Ala Met Asn Asp Asp Asp Ala 35 40 45 Ala Ala Lys Lys Gly Ala Asp Val Asp Leu Gln Ala Glu Lys Leu Ser 50 55 60 His Lys Glu 65 < 210 > 17 < 211 > 4200 < 212 > DNA < 213 > Salmonella typhimurium 27 < 220 > < 221 > CDS < 222 > (947) .. (1444) < 220 > < 221 > CDS < 222 > (1450) .. (1722) < 400 > 17 cgcaagtcaa tgtcgtcccg gtcgtatgta aaagtatgtg aatagggcgg gcgaaagcgg 60 ctaacaaaga ggcagegtga aggataatgt gtataatgcg gccctaataa ttcatcatct 120 atcacagagg aacatgtatg ggtggtatca gtatttggca gttgttgatt gttgccgtta 180 tcgtcgtact gctgttcggc accaaaaaac tcggttccat cggttccgat cttggcgcgt 240 ctatcaaagg ctttaaaaag gccatgagcg atgatgatgc caaacaggat aaaaccagtc 300 aggacgctga ttttaccgct aaatctatcg cggataagca aggcgaagcg aaaaaggaag 360 acgctaaaag ccaagataaa gagcaggtat aatccgtgtt tgatatcggt tttagcgaac 420 tgetgttagt gttcgttatc ggcctcattg tgttggggcc gcaacgattg ccagtagcgg 480 taaaaacggt agcgggctgg attcgcgcgt tgcggtccct tgcgacaacg gttcagaatg 540 aactgactca ggaactgaaa cttcaggagt tccaggacag tctgaaaaaa gtcgaaaagg 600 cgagcctgga aaatctgact cccgaactga aageatetat ggatgaactg cgtcaggcgg 660 cggagtcgat gaaacgcacc tacagegeta acgatcccga acaagegage gatgaagcgc 720 ataccatr.ca taatccggtg gtaaaaggga acgaaacgca gcatgagggc gtcacccctg 780 ccgccgctga aacacaggcg agcgcgccgg aacaaaagcc ggagcccgtt aaagctaacg 840 tgcctgagtc gacggaaacc gcttccgtag ccacgataga cgccgagaag aaatccgctg 900 cgcctgttgt cgaatcttcc ccctcgtcga gtgataaacc gtaaac atg gct gta 955 Met Ala Val 1 gaa gat act cac ccg ctt ate acg cat ctg ate gag ttg cgt aag cgc 1003 Glu Asp Thr Gln Pro Leu He Thr His Leu He Glu Leu Arg Lys Arg 5 10 15 ctg cta aac tgc ate gtc gca gta ctt ctg att ttt ctg gcg tta att 1051 28 Leu Leu A = n Cys He Val Wing Val Leu Leu He Phe Leu Wing Leu He 20 25 30 35 tat ttc gcc aat gat att tat cat tta gtc gcc gca ccg ctg att aaa 1099 Tyr Phe Wing Asn Asp He Tyr His Leu Val Wing Pro Pro Leu He Lys 40 45 50 cag atg ccg cag ggg gcg here atg att gcg acg gat gtg gcg tcg ccg 1147 Gln Met Pro Gln Gly Ala Thr Met He Ala Thr Asp Val Ala Ser Pro 55 60 65 ttt ttt acg ect ate aaa etc acc ttc atg gtg tet ttg ate tta tec 1195 Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu He Leu Ser 70 75 80 gcg ect gtc att ttg tac cag gtt tgg gcc ttt ate gcc ccg gcg ctg 1243 Ala Pro Val He Leu Tyr Gln Val Trp Ala Phe He Ala Pro Ala Leu 85 90 95 tat aag cat gag cgt cgt ctg gtc gta ect ctg ctg gta tec age tcg 1291 Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val Ser Ser Ser 100 105 110 115 ctg ctt ttc tat att ggt atg gcc ttc gcc tat ttt gtc gta ttc ect 1339 Leu Leu Phe Tyr He Gly Met Wing Phe Wing Tyr Phe Val Val Phe Pro 120 125 130 ttg gee ttt ggt ttc ctg acg cat acg gcg ccg gaa ggg gta cag gtt 1387 Leu Wing Phe Gly Phe Leu Thr His Thr Wing Pro Glu Gly Val Gln Val 135 140 145 tcg here gat ate gcc age tat ctt age ttt gtc atg gcg ctt ttt atg 1435 Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala Leu Phe Met 150 155 160 gcc ttt gcg tagee ttt gaa gtg ccg gtg gcg att gtg ttg ctg tgc tgg 1485 Wing Phe Wing Phe Glu Val Pro Val Wing He Val Leu Leu Cys Trp 165 170 175 atg ggc ate acc ac cea gaa gat ttg cgt aaa aaa cgg ect tat ate 1533 Met Gly He Thr Thr Pro Glu Asp Leu Arg Lys Lys Arg Pro Tyr He 180 185 190 ctg gtc ggg gca ttc att gtg gga atg ctg ctt acg ccg cea gat gtt 1581 Leu Val Gly Wing Phe He Val Gly Met Leu Leu Thr Pro Pro Asp Val 195 200 205 210 ttc tcg caa acg ttg ctg gcg ata ccg atg tac tcc ctg ttt gaa att 1629 29 Phe Ser Gln Thr Leu Leu Ala He Pro Met Tyr Cys Leu Phe Glu He 215 220 225 ggc gtt ttc tgc tea cgc ttt tat gtc ggt aag cga cgg acg cgc gac 1677 Gly Val Phe Cys Ser Arg Phe Tyr Val Gly Lys Arg Arg Thr Arg Asp 230 235 240 gaa gat aac gag gcc gaa acc gaa aag gcc gag falls act gaa gac 1722 Glu Asp Asn Glu Wing Glu Thr Glu Lys Wing Glu His Thr Glu Asp 245 250 255 taaacacaac cgcccgccag ggcggttgtc atatgggggc aagcatgttt gatattggcg 1782 ttaatttaac cagtagccag tttgcaaaag atcgtgatga tgtggtcgcc cgtgcgtttg 1842 cggcgggagt aaaaggtatg ctactgaccg gaaegaacat ccatgaaagt cagcaggcgt 1902 taaaactggc gcggcgctac ccccattgtt ggtcgacggc tggcgtccat ccccatgaca 1962 gcagtcagtg gtcacccgcg tctgaagacg ccattattgc gctggcgaac cagccggaag 2022 tcgtcgctat cggtgagtgc gggctggatt tcaatcgcaa tttttccacg ccgcaggagc 2082 aggagcgtgc ctttcaggcg cagetacaaa ttgccgccga attgcagata ecaatettta 2142 tgcactgccg ggacgcgcat gagcgatttc tggtattgct tgatccctgg ctggatagtc 2202 ttcctggtgc aatactgcac tgctttaccg gttcacgcca gcaaatgeag gcctgtgtgg 2262 atagagggct ctatatcggt attaccgggt gggtttgcga cgaacgacgc gggcttgagc 2322 tacgtgaact cttaccgttt attccagcgg aaaagctact gatagaaacc gacgcgccgt 2382 atctgttgcc tcgcgatctt acgccgaaac caacgtcacg acgcaacgag cccgcgtatc 2442 tgcctcacat cctggagcgc atagegetat ggcgtggtga agatcegeaa tggttagcgg 2502 cgatgacaga tgccaacgcc agaacettat ttgaggttgt attctgaacg atcgctaaat 2562 cttgcgaaaa ccggtgtttt ttacgctctg cttcacttct ttattgagta aattaagcag 2622 taacategaa cgcgtttcgc catccggttc ggtaaaaatc gctttcagcc cttcaaatgc 2682 gccttccgtg atgatgacgc tatcgccggg atagggggtt tcaggatcga caacgccttc 2742 gggcttgtag atagaaagct gatgaataac gctggaaggc acgatcgcag gatgccgcca 2802 aagcgcacaa aatggetgac gccgcgcgtg gcgttgattg tagtggtatg tatcacttcc 2862 ggatcaaatt caacgaacag ataattagga aagageggtt cgctgacgga ggtacgtttt 2922 ccgcgtacca ttttttccag ggtgatcatc ggtgtcaggc aacttaccgc ttgtctttcg 2982 aggtgttcct gageaegetg aagttgcccg cgtttgcagt acagtaaata ccaggattgc 3042 ataatgaetc ttatccgctt gttcggggcg caageatage aaaagccatg cgcgaagtta 3102 attatacact teatcettta agccgtatct ggattagcgt tggttgccag agtteaeget 3162 aaatacagea aatttaacaa gaacgccgta tcccgatgat cttaccaaga taatgatgcg 3222 gacgccatga ggctacaatg aatatcaega tttacgcgac ttcctgacgc tacttgagca 3282 gcagggggaa ctaaaacgca tcacgctacc tgtggatcct catctggaaa tcacggaaat 3342 cgctgaccgc acgctgcgtg ccggtggacc ggcgttgctg tttgaaaatc ctaaaggtta 3402 cgccatgccg gtgctgtgca cacgccaaaa acctttttgg cgcgtggcga tgggcatggg 3462 gcaggatgat gtttccgcct tacgggaagt gggtaaatta ttagcgtttc ttaaagaacc 3522 tgagccgccg aaagcgtttc gcgatctgtt tgacaagctg ccgcagttta agcaagtgct 3582 gaatatgccg acgaaacggt tacgcggcgc gccttgccag cagaaaatcg cgtctggcga 3642 tgatgtcgat ttaacgcgtc ttcctgtcat gacctgttgg ccggacgacg ccgcgccgct 3702 ga ttacctgg ggactgacgg taacgcgtgg tccgcacaaa gagcggcaaa acctgggcat 3762 ttatcgtcag cagttgatag gtaaaaataa gctgattatg cgctggctgt ctcaccgcgg 3822 cggcgcgctg gattttcagg agtggttagc cgcgcgtccg ggtgaacgtt tcccggtcte 3882 cgtcgcattg ggcgccgatc cggcacgata cttggcgccg tgactcctgt tcccgatact 3942 ctgtcggagt atgcctttgc gggcctgctg cgcggcacga aaactgaagt ggttaatgct 4002 ttetaegate tggagtgctg cagcgcgaga tatcttgaag tacatgagcg gagagatgeg 4062 tgcgatcata cggagacgta cggcatatat gagtgatagc tcgtcttacg teaegeaata 4122 acagcgtaga tgeatetata teactataeg cgcgcatgag ctcgtatagg tgeetcatat 4182 ctcgtctatc tcaaagtc 4200 31 < 210 > 18 < 211 > 166 < 212 > PRT < 213 > Salmonella typhimurium < 400 > 18 Met Wing Val Glu Asp Thr Gln Pro Leu He Thr His Leu He Glu Leu 1 5 10 15 Arg Lys Arg Leu Leu Asn Cys He Val Wing Val Leu Leu He Phe Leu 20 25 30 Ala Leu He Tyr Phe Ala Asn Asp He Tyr His Leu Val Ala Ala Pro 35 40 45 Leu He Lys Gln Met Pro Gln Gly Ala Thr Met He Ala Thr Asp Val 50 55 60 Wing Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu 65 70 75 80 He Leu Ser Ala Pro Val He Leu Tyr Gln Val Trp Ala Phe He Ala 85 90 95 Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val 100 105 110 Be Being Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val 115 120 125 Val Phe Pro Leu Wing Phe Gly Phe Leu Thr His Thr Wing Pro Glu Gly 130 135 140 Val Gln Val Ser Thr A = p He Wing Ser Tyr Leu Ser Phe Val Met Wing 145 150 155 160 Leu Phe Met Ala Phe Wing 165 < 210 > 19 < 211 > 91 < 212 > PRT < 213 > Salmonella typhimurium < 400 > 19 Phe Glu Val Pro Val Wing He Val Leu Leu Cys Trp Met Gly He Thr 1 5 10 15 32 atgggcacac ccgtataaac cgttatttca caatcaaccc ctaatactta cttaaggata 660 catcatgggc agtttttctc tgacgcactg gattatcgta ctgattatcg tcgttttgat 720 attcggcacc aaaaaactgc gcaacgtcgg caaagacctc ggcggtgcgg ttcatgactt 780 caaacagggg ctgaacgaag gtacagacgg caaagaagcc caaaaagacg atgtaatcga 840 acacaaaaaa gacgaagaca aagcgtaatt tatgtttgat ttcggtttgg gcgagctggt 900 ttttgtcggc attatcgccc tgattgtcct cggccccgaa cgcctgcccg aggccgcccg 960 caccgccgga cggctcatcg gcaggctgca acgctttgtc ggcagcgtca aacaggaatt 1020 tgacacgcaa atcgaactgg aagaactaag gaaggcaaag caggaatttg aagctgccgc 1080 tgctcaggtt cgagacagcc tcaaagaaac cggtacggat atggagggta atctgcacga 1140 catttccgac ggtctgaagc cttgggaaaa actgcccgaa cagcgcacgc ctgctgattt 1200 cggtgtcgat gaaaacggca atccctttcc cgatgcggca aacaccctat tagacggcat 1260 ttccgacgtt atgccgtccg aacgttccta cgcttccgcc gaaacccttg gggacagcgg 1320 gcaaaccggc agtacagccg aacccgcgga aaccgaccaa gaccgtgcat ggcgggaata 1380 cctgactgct tctgccgccg cacccgtcgt acagaccgtc gaagtcagct atatcgatac 1440 cgctgttgaa acccctgttc cgcataccac ttcgctgcgt aaacaggcaa taagccgcaa 1500 aegegatttg cgtcctaaat cccgcgccaa acctaaattg cgcgtccgta aatcataaag 1560 agggcaatcc g gtg tec gaa here caac aac gaa cac ccc gtc caa ccg ctt 1610 Val Ser Glu Thr Gln Asn Glu Gln Pro Val Gln Pro Leu 1 5 10 gtc gag cat etc ate gag cgc cgc cgc cgc ctg atg tgg acg gtt gtc 1658 Val Glu His Leu He Glu Leu Arg Arg Arg Leu Met Trp Thr Val Val 15 20 25 ggt ate tta gtc tgc ttt tcc ggc cta atg ccg ttt gcc ca ca cac 1706 Gly He Leu Val Cys Phe Phe Gly Leu Met Pro Phe Wing Gln Gln Leu 30 35 40 45 tat act ttt ate gcc gac ccg ctg atg gca aac ctg ccc aaa gac acc 1754 Tyr Thr Phe He Wing Asp Pro Leu Met Wing Asn Leu Pro Lys Asp Thr 50 55 60 34 Thr Pro Glu Asp Leu Arg Lys Lys Arg Pro Tyr He Leu Val Gly Wing 20 25 30 Phe He Val Gly Met Leu Leu Thr Pro Pro Asp Val Phe Ser Gln Thr 35 40 45 Leu Leu Ala He Pro Met Tyr Cys Leu Phe Glu He Gly Val Phe Cys 50 55 60 Be Arg Phe Tyr Val Gly Lys Arg Arg Thr Arg Asp Glu Asp Asn Glu 65 70 75 80 Ala Glu Thr Glu Lys Ala Glu His Thr Glu Asp 85 90 < 210 > 20 < 211 > 2601 < 212 > DNA < 213 > Neisseria meningitidis < 220 > < 221 > CDS < 222 > (1572) .. (2339) < 400 > 20 agacaaaatc ctaaaaaaag tgattgaaga ggcgggcgaa gtgttgatgg catccaaaga 60 caaaaacccg tcccacctgg tttacgaagt tgccgactta tggtttcaca ccatgattct 120 tctgacacac cacgacctga aggcggaaga cgtattggac gaacttgcgc gccgccaagg 180 tttgtcgggc ttggccgaaa aagccgctcg cacagaatct tgaatttata ttaaaatccg 240 cactttccca cattcaatcc gtctgaccgc tgttcagacg gcatcggagc cgttatggac 300 aactgtattt tctgcaaaat cgccgccaaa gacattccgg cgcaaaccgt ctatgaagac 360 ggcgaaatgg tttgtttcaa agacatcaac cccgctgctc cggttcatct gctgctgatt 420 cccaaagtcc atttegatte gttggcacac gccgcgcccg aacatcagcc ccttttggga 480 aaaatgatge tgaaagttec cgaaatcgcc aaageggeag gactggcaga cggcttcaaa 540 accctgatca acaccggaaa aggcggcgga caagaggtct tccacctgca tatacacatc 600 33 age atg att gcc acc gat gtc ate gca cea ttt ttc gtg ccg gtc aaa 1802 Ser Met He Wing Thr Asp Val He Wing Pro Phe Phe Val Pro Val Lys 65 70 75 gtt acc ctg atg gcg gca ttt tta att tcg ctg ccg cat acg etc tac 1850 Val Thr Leu Met Wing Wing Phe Leu He Ser Leu Pro His Thr Leu Tyr 80 85 90 ca ate tgg gca ttc gtc gcc ccc gca etc tac caa aac gaa aaa cgc 1898 Gln He Trp Ala Phe Val Ala Pro Ala Leu Tyr Gln Asn Glu Lys Arg 95 100 105 ctg att acg ccg etc gtc etc tec age gtc age ctg ttt ttc ate ggc 1946 Leu He Thr Pro Leu Val Leu Ser Ser Val Ser Leu Phe Phe He Gly 110 115 120 125 atg gca ttt gcc tac ttt ttg gtt ttc ccc gtc att ttc aaa ttc ctt 1994 Met Ala Phe Ala Tyr Phe Leu Val Phe Pro Val He Phe Lys Phe Leu 130 135 140 gcc age gtt acc ect gtc ggt gtc aat atg gcg here gac ate gac aaa 2042 Wing Being Val Thr Pro Val Gly Val Asn Met Wing Thr Asp He Asp Lys 145 150 155 tac etc tec ttc ate ttg ggg atg ttt gtc gca ttc ggt here acg ttt 2090 Tyr Leu Ser Phe He Leu Gly Met Phe Val Wing Phe Gly Thr Thr Phe 160 165 170 gaa gtc ccc att gtc gtt ate ctg tta acc aaa att ggt gtg gta here 2138 Glu Val Pro He Val Val He Leu Leu Thr Lys He Gly Val Val Thr 175 180 185 acc gaa cag etc aaa ege gcc cgc ccc tat gtg att gtc ggc gcg ttt 2186 Thr Glu Gln Leu Lys Arg Wing Arg Pro Tyr Val He Val Gly Wing Phe 190 195 200 205 gtc att gcc gcc ate ate acg ccg ccc gat gtg att tea cac acc ctg 2234 Val He Ala Ala He He Thr Pro Pro Asp Val He Ser Gln Thr Leu 210 215 220 ctt gcc att ccg ctg att etc tta tac gaa gca ggt att tgg ttc gga 2282 Leu Ala He Pro Leu He Leu Leu Tyr Glu Wing Gly He Trp Phe Gly 225 '230 235 cgc ttt ttc acg cea cgt tea gaa cag gat ggc gac ata cag ccg ect 2330 Arg Phe Phe Thr Pro Arg Ser Glu Gln Asp Gly Asp He Gln Pro Pro 240 245 250 gca here acc tgacactatg ccgtccgaac ctccgcctca taccgccaca 2379 Ala Thr Thr 255 gattaaggaa tacctttgaa taccctctat ttaggttcaa acagcccgcg ccgaatggaa 2439 atcctgacac agttgggcta tcaggtcgtc aagctgcctg ccaacatcga cgaaacggtc 2499 agacagaacg aagaccctgc ccgttacgtt caaaggatgg cagaagaaaa aaaccgaacc 2559 gccctgaccc tcttttgcga aaccaacggc acaatgcccg at 2601 < 210 > 21 < 211 > 256 < 212 > PRT < 213 > Neisseria meningitidis < 400 > 21 Val Ser Glu Thr Gln Asn Glu Gln Pro Val Gln Pro Leu Val Glu His 1 5 10 15 Leu He Glu Leu Arg Arg Arg Leu Met Trp Thr Val Val Gly He Leu 20 25 30 Val Cys Phe Phe Gly Leu Met Pro Phe Wing Gln Gln Leu Tyr Thr Phe 35 40 45 He Wing Asp Pro Leu Met Wing Asn Leu Pro Lys Asp Thr Ser Met He 50 55 60 Wing Thr Asp Val He Wing Pro Phe Phe Val Pro Val Lys Val Thr Leu 65 70 75 80 Met Ala Ala Phe Leu He Ser Leu Pro His Thr Leu Tyr Gln He Trp 85 90 95 Wing Phe Val Wing Pro Wing Leu Tyr Gln Asn Glu Lys Arg Leu He Thr 100 105 110 Pro Leu Val Leu Ser Ser Val Ser Leu Phe Phe He Gly Met Ala Phe 115 120 125 Ala Tyr Phe Leu Val Phe Pro Val He Phe Lys Phe Leu Ala Ser Val 130 135 140 Thr Pro Val Gly Val Asn Met Wing Thr Asp He Asp Lys Tyr Leu Ser 145 150 155 160 36 Phe He Leu Gly Met Phe Val Wing Phe Gly Thr Thr Phe Glu Val Pro 165 170 175 He Val Val He Leu Leu Thr Lys He Gly Val Val Thr Thr Glu Gln 180 185 190 Leu Lys Arg Wing Arg Pro Tyr Val He Val Gly Wing Phe Val He Wing 195 200 205 Ala He He Thr Pro Pro Asp Val He Be Gln Thr Leu Leu Ala He 210 215 220 Pro Leu He Leu Leu Tyr Glu Wing Gly He Trp Phe Gly Arg Phe Phe 225 230 235 240 Thr Pro Arg Ser Glu Gln Asp Gly Asp He Gln Pro Pro Wing Thr Thr 245 250 255 < 210 > 22 < 211 > 4604 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (2982) .. (4082) < 220 > < 221 > CDS < 222 > (1534) .. (2637) < 220 > < 221 > CDS < 222 > (749) .. (1531) < 220 > < 221 > CDS < 222 > (6) .. (746) < 400 > 22 ggcta gtt gat gat aat ttg aaa ggt caa ggt gca gga aaa aat ttt tta 50 Val Asp Asp Asn Leu Lys Gly Gln Gly Wing Gly Lys Asn Phe Leu 1 5 10 15 tcg ctg ata aag tac age gag here gat tat here att tat tgt gae caa 98 37 Ser Leu He Lys Tyr Ser Glu Thr Asp Tyr Thr He Tyr Cys Asp Gln 20 25 30 gat gat att tgg tta gaa aac aaa ata ttt gaa tta gta aag tat gca 146 Asp Asp He Trp Leu Glu A = n Lys He Phe Glu Leu Val Lys Tyr Ala 35 40 45 aat gaa att aaa ttg aat gta tea gat gcg ect tcg cta gtt tat gct 194 Asn Glu He Lys Leu Asn Val Ser Asp Ala Pro Ser Leu Val Tyr Ala 50 55 60 gat ggc tat gct tat atg gat ggt gag ggt ac ate gat ttt tet ggg 242 Asp Gly Tyr Wing Tyr Met Asp Gly Glu Gly Thr He Asp Phe Ser Gly 65 70 75 ata tet aac aat cat gct gat cata tta aag gat ttt ctt ttt ttt aat 290 He As Asn Asn His Wing Asp Gln Leu Lys Asp Phe Leu Phe Phe Asn 80 85 90 95 ggt gga tac ca gga tgt tet att atg ttc aat cgt gca atg acc aaa 338 Gly Gly Tyr Gln Gly Cys Ser Met Met Phe Asn Arg Ala Met Thr Lys 100 105 110 ttt ctt ctg aat tat cga gga ttt gta tat cta cat gac gat ate here 386 Phe Leu Leu Asn Tyr Arg Gly Phe Val Tyr Leu His Asp Asp He Thr 115 120 125 here tta gct gca tac gct ctt ggt aaa gtt tat ttt etc ccg aaa tac 434 Thr Leu Ala Ala Tyr Ala Leu Gly Lys Val Tyr Phe Leu Pro Lys Tyr 130 135 140 ctt atg tta tat aga cag falls acg aat gcg gta act ggt ate aaa here 482 Leu Met Leu Tyr Arg Gln His Thr Asn Ala Val Thr Gly He Lys Thr 145 150 155 ttc cgc aat gga ttg act tet aaa ttt aaa tea cea gta aac tat ctt 530 Phe Arg Asn Gly Leu Thr Ser Lys Phe Lys Ser Pro Val Asn Tyr Leu 160 165 170 175 tta tea cga aaa cat tat cag gta aaa aaa tet ttt ttt gaa tgt aac 578 Leu Ser Arg Lys His Tyr Gln Val Lys Lys Ser Phe Phe Glu Cys Asn 180 185 190 age tet ate tta tea gag acg aat aaa aaa gtt ttt ttg gat ttt att 626 Ser Ser He Leu Ser Glu Thr Asn Lys Lys Val Phe Leu Asp Phe He 195 200 205 tea ttt tgt gaa tea aat aat aaa ttt here gat ttt ttt aag tta tgg 674 38 Ser Phe Cys Glu Ser Asn Asn Lys Phe Thr Asp Phe Phe Lys Leu Trp 210 215 220 cga ggt ggg ttt aga tta aat aac agt aga act aaa tta tta tta aaa 722 Arg Gly Gly Phe Arg Leu Asn Asn Ser Arg Thr Lys Leu Leu Leu Lys 225 230 235 ttc tta ata cgg aga aaa ttt age ga atg att tea ata ctt here ect 769 Phe Leu He Arg Arg Lys Phe Ser Met He Ser He Leu Thr Pro 240 245 250 act ttt aat cgg caa cat act tta tea agg cta ttc aat tet ctt ata 817 Thr Phe Asn Arg Gln His Thr Leu Ser Arg Leu Phe Asn Ser Leu He 255 260 265 270 tta ca gat act gat aaa gat ttt gag tgg ata att ata gat gat ggt agt 865 Leu Gln Thr A = p Lys Asp Phe Glu Trp He He He Asp Asp Gly Ser 275 280 285 ata gat gca here gcg gta ctt gta gaa gat ttt aga aaa aga tgt gat 913 He Asp Ala Thr Ala Val Leu Val Glu Asp phe Arg Lys Lys Cys Asp 290 295 300 ttt gac ttg att tat tgc tat cag gaa aat aat ggt aag ccc atg gct 961 Phe Asp Leu He Tyr Cys Tyr Gln Glu Asn Asn Gly Lys Pro Met Ala 305 310 315 tta aac gct ggt gtt aaa gct tgt aga ggc gat tat ate ttt att gtt 1009 Leu Asn Wing Gly Val Lys Wing Cys Arg Gly Asp Tyr He Phe He Val 320 325 330 gac agt gat gat gca cta act ccc gat gcc ata aaa tta att aaa gaa 1057 Asp Ser Asp Asp Ala Leu Thr Pro A = p Ala He Lys Leu He Lys Glu 335 340 345 350 tea ata cat gat tgc tta tet gag aag gaa agt ttc age gga gtc ggt 1105 Ser He His Asp Cys Leu Ser Glu Lys Glu Ser Phe Ser Gly Val Gly 355 360. 365 ttt aga aaa gca tat ata aaa ggg ggg att att ggt aat gat tta aat 1153 Phe Arg Lys Wing Tyr He Lys Gly Gly He He Gly Asn Asp Leu Asn 370 375 380 aat tet tea gaa cat ata tac tat tta aat gcg act gag att age aat 1201 Asn Ser Ser Glu His He Tyr Tyr Leu Asn Wing Thr Glu He Ser Asn 385 390 395 tta ata aat ggt gat gtt gca tat tgt ttt aaa aaa gaa agt ttg gta 1249 39 Leu He Asn Gly Asp Val Ala Tyr Cys Phe Lys Lys Glu Ser Leu Val 400 405 410 aaa aat cea tcc ccc cgt ata gaa gat gaa aaa ttt gtt cea gaa tta 1297 Lys Asn Pro Phe Pro Arg He Glu Asp Glu Lys Phe Val Pro Glu Leu 415 420 425 430 tat att tgg aat aaa ata act gac aag gcg aag att cga ttt aac ata 1345 Tyr He Trp Asn Lys He Thr Asp Lys Wing Lys He Arg Phe A = n He 435 440 445 age aaa gtt ata tat ctt tgt gag tat ctt gat gat ggt ctt tet aaa 1393 Ser Lys Val He Tyr Leu Cys Glu Tyr Leu Asp Asp Gly Leu Ser Lys 450 455 460 aat ttc cat aac cag ctt aaa aaa tac cea aag ggg ttt aag att tat 1441 Asn Phe His A = n Gln Leu Lys Lys Tyr Pro Lys Gly Phe Lys He Tyr 465 470 475 tac aaa gat ca aga aga aga cga gag aaa act tat ata aaa aaa here aag 1489 Tyr Lys Asp Gln Arg Lys Arg Glu Lys Thr Tyr He Lys Lys Thr Lys 480 485 490 atg cta att aga tat ttg ca tgt tgt tat tat gag aaa ata aa atg 1536 Met Leu He Arg Tyr Leu Gln Cys Cys Tyr Tyr Glu Lys He Met 495 500 505 aaa ata cta ttt gtc att here ggt tta ggc ctt gga ggt gct gag aag 1584 Lys He Leu Phe Val He Thr Gly Leu Gly Leu Gly Gly Wing Glu Lys 510 515 520 525 cag gtt tgt ctt tta gct gat aaa tta agt tta age ggg falls cat gta 1632 Gln Val Cys Leu Leu Wing Asp Lys Leu Ser Leu Ser Gly His His Val 530 535 540 aa att att tea ctt gga cat atg tet aat aat aaa gtc ttt ect age 1680 Lys He He Ser Leu Gly His Met Ser A = n Asn Lys Val Phe Pro Ser 545 550 555 gaa aat aat gtt aat gtc att aat gta aat atg tea aaa att tet 1728 Glu Asn Asn Val Asn Val He Asn Val Asn Met Ser Lys Asn He Ser 560 565 570 gga gtt ata aaa ggt tgt gtc aga att aga gat gtt ata gct aat ttc 1776 Gly Val He Lys Gly Cys Val Arg He Arg Asp Val He Wing Asn Phe 575 580 585 aaa cea gac att gta falls agt cat atg ttt cat gca aac att ate act 1824 40 Lys Pro Asp He Val His Ser His Met Phe His Wing Asn He He Thr 590 595 600 605 aga ttg tet gta att gga ate aaa aac aga ect: ggt att ata tea act 1872 Arg Leu Ser Val He Gly He Lys Asn Arg Pro Gly He He Ser Thr 610 615 620 gca cat aat aaa aat gaa ggt ggg tat ttc aga atg etc here tat aga 1920 Ala His Asn Lys Asn Glu Gly Gly Tyr Phe Arg Met Leu Thr Tyr Arg 625 630 635 ata acc gat tgt tta agt gat tgt tgt here aat gtt age aaa gaa gca 1968 He Thr Asp Cys Leu Ser Asp Cys Cys Thr Asn Val Ser Lys Glu Ala 640 645 650 gtg gat gag ttt tta cgg ata aaa gcc ttt aat ccc gct aaa gca att 2016 Val A = p Glu Phe Leu Arg He Lys Ala Phe A = n Pro Ala Lys Ala He 655 660 665 act atg tat aat ggg ata gat acc aat aaa ttt aaa ttt gat tta ttg 2064 Thr Met Tyr Asn Gly He Asp Thr Asn Lys Phe Lys Phe Asp Leu Leu 670 675 680 685 gca agg agg gaa att cga gac ggt att aat ata aaa aat gat gat ata 2112 Wing Arg Arg Glu He Arg Asp Gly He Asn He Lys Asn Asp Asp He 690 695 700 tta tta ctt gct gg ggt cgt tta acg tta gct aaa gat tat ect aat 2160 Leu Leu Leu Wing Wing Gly Arg Leu Thr Leu Wing Lys Asp Tyr Pro Asn 705 710 715 tta ttg aat gca atg act ctg ctt ect gaa falls ttt aaa ctt att att 2208 Leu Leu Asn Wing Met Thr Leu Leu Pro Glu His Phe Lys Leu He He 720 725 730 att ggt gat ggt gaa ttg cgt gae gaa att aat atg ctt ata aaa aaa 2256 He Gly Asp Gly Glu Leu Arg Asp Glu He Asn Met Leu He Lys Lys 735 740 745 ttg cata tta tet aat agg gtg tec ttg ttg gga gtt aaa aaa aat att 2304 Leu Gln Leu Ser Asn Arg Val Ser Leu Leu Gly Val Lys Lys Asn He 750 755 760 765 gct ccc tat ttt tet gea tgt gat att ttt gtt etc tet tet cgt tgg 2352 Wing Pro Tyr Phe Be Wing Cys Asp He Phe Val Leu Ser Ser Arg Trp 770 775 780 gaa gga ttt gga tta gtc gtg gca gaa gct atg tea tgt gag cga att 2400 41 Glu Gly Phe Gly Leu Val Val Ala Glu Ala Met Ser Cys Glu Arg He 785 790 795 gtt gtt ggc acg gat tea ggg gga gta aga gaa gtt att ggt gac gat 2448 Val Val Gly Thr Asp Ser Gly Gly Val Arg Glu Val He Gly Asp Asp 800 805 810 gat ttt ctt gta ccc ata tet gat tea here caá ctt gca age aaa att 2496 Asp Phe Leu Val Pro He Ser Asp Ser Thr Gln Leu Ala Ser Lys He 815 820 825 gaa aaa ttg tet ttg age cag ata cgt gat fall att ggt ttt cgg aat 2544 Glu Lys Leu Ser Leu Ser Gln He Arg Asp His He Gly Phe Arg Asn 830 835 840 845 cgt gag cgt att tta aaa aat ttc tea ata gat att act att atg cag 2592 Arg Glu Arg He Leu Lys Asn Phe Ser He Asp Thr He He Met Gln 850 855 860 tgg caa gaa etc tat gga act ata att tgc tea aaa cat gaa agg 2637 Trp Gln Glu Leu Tyr Gly Thr He He Cys Ser Lys His Glu Arg 865 870 875 tagatttata tttggaacgt gtcttttgtt tgaatttaat tcaatetcaa ttgagatttt 2697 tgtatttcaa aaataccatc atagetaaeg atgattggta tttattttaa gatgctttct 2757 ataaatatat tgaegttttt aatgcgccga aacgattggg ctgggaacag agaagtaaaa 2817 ctgttttgag aatgaagagt ttttgagatg tttatggata ttaaaaattg atccagtgaa 2877 ttaattattt ataataaatc aagatttaat gttaatáaat gataatcttt tctgacactc 2937 atattaatta tgagtggtac gtttggtaaa cggtaaacta ttat atg here gct aga 2993 Met Thr Ala Arg 880 here act aaa gtt ttg falls tta cata tta etc cea etc tta agt ggc gtt 3041 Thr Thr Lys Val Leu His Leu Gln Leu Pro Leu Leu Ser Gly Val 885 890 895 cag agg gta here tta aac gaa att agt gcg tta tat act gat gat gat 3089 Gln Arg Val Thr Leu Asn Glu Be Ser Wing Leu Tyr Thr Asp Tyr Asp 900 905 910 tat here cta gtt tgc tea aaa aaa ggt cea cta here aaa gca ttg ctg 3137 Tyr Thr Leu Val Cys Ser Lys Lys Gly Pro Leu Thr Lys Ala Leu Leu 915 920 925 42 gaa tat gat gtc gat tgt cat tgt ate ccc gaa ctt acg aga gaa att 3185 Glu Tyr Asp Val Asp Cys His Cys He Pro Glu Leu Thr Arg Glu He 930 935 940 acc gta aag aat gat ttt aaa gca ttg ttc aag ctt tat aag ttc ata 3233 Thr Val Lys Asn Asp Phe Lys Ala Leu Phe Lys Leu Tyr Lys Phe He 945 950 955 960 aaa aaa gaa aaa ttt gac att gtg cat here cat tet tea aaa here ggt 3281 Lys Lys Glu Lys Phe A = p He Val His Thr His Ser Ser Lys Thr Gly 965 970 975 att ttg ggg cga gtt gct gcc aaa tta gca cgt gtt gga aag gtg ate 3329 He Leu Gly Arg Val Ala Ala Lys Leu Ala Arg Val Gly Lys Val He 980 985 990 falls act gta cat ggt ttt tet ttt cea gcc gca tet agt aaa aaa agt 3377 His Thr Val His Gly Phe Ser Phe Pro Wing Wing Ser Ser Lys Lys Ser 995 1000 1005 tat tac ctt tat ttt ttc atg gaa tgg ata gca aag ttc ttt acg gat 3425 Tyr Tyr Leu Tyr Phe Phe Met Glu Trp He Wing Lys Phe Phe Thr Asp 1010 1015 1020 aag tta ate gtc ttg aat gta gat gat gaa tat ata gca ata aac aaa 3473 Lys Leu He Val Leu Asn Val Asp Asp Glu Tyr He Ala He Asn Lys 1025 1030 1035 1040 tta aaa ttc aag cgg gat aaa gtt ttt tta att ect aat gga gta gac 3521 Leu Lys Phe Lys Arg Asp Lys Val Phe Leu He Pro Asn Gly Val Asp 1045 1050 1055 act gat aag ttt tet ect tta gaa aat aaa att tat agt age acc ttg 3569 Thr Asp Lys Phe Ser Pro Leu Glu Asn Lys He Tyr Ser Ser Thr Leu 1060 1065 1070 aat cta gta atg gtt ggt aga tta tec aag caa aaa gat ect gag here 3617 Asn Leu Val Met Val Gly Arg Leu Ser Lys Gln Lys Asp Pro Glu Thr 1075 1080 1085 tta ttg ctt gtt gtt gaa aaa ctg ctg aat gaa aat gtt aat gtt aag 3665 Leu Leu Leu Ala Val Glu Lys Leu Leu Asn Glu Asn Val Asn Val Lys 1090 1095 1100 ctg here ctt gta gga gat ggt gaa cta aaa gaa cag tta gaa age agg 3713 Leu Thr Leu Val Gly Asp Gly Glu Leu Lys Glu Gln Leu Glu Ser Arg 1105 1110 1H5 1120 43 ttc aaa cgg ca g gat cgt ata att ttt cat gga tgg tea gat aac 3761 Phe Lys Arg Gln Asp Gly Arg He He Phe His Gly Trp Ser Asp Asn 1125 1130. 1135 att gtt aat att tta aaa gtt aat gat ctt ttt ata tta ect tet ctt 3809 He Val Asn He Leu Lys Val Asn Asp Leu Phe He Leu Pro Ser Leu 1140 1145 1150 tgg gag ggt atg cea tta gca att tta gaa gca ttg age tgt gga ctt 3857 Trp Glu Gly Met Pro Leu Ala He Leu Glu Ala Leu Ser Cys Gly Leu 1155 1160 1165 cea tgt ata gtc act aat att cea ggt aat aat age tta ata gaa gat 3905 Pro Cys He Val Thr Asn He Pro Gly Asn Asn Ser Leu He Glu Asp 1170 1175 1180 ggc tat aat ggt tgt ttg ttt gaa att aga gat tgt cag tta tta tet 3953 Gly Tyr Asn Gly Cys Leu Phe Glu He Arg Asp cys Gln Leu Leu Ser 1185 1190 1195 1200 ca aaa ate atg tea tat gtt ggt aag cea gaa ctg att gca cag caa 4001 Gln Lys He Met Ser Tyr Val Gly Lys Pro Glu Leu He Wing Gln Gln 1205 1210 1215 tet acc aat gca cga tea ttt att ctg aaa aat tat gga tta gtt aaa 4049 Ser Thr Asn Ala Arg Ser Phe He Leu Lys Asn Tyr Gly Leu Val Lys 1220 1225 1230 aga aat aat aag gtc aga cag tat tat gat aat taaatgaaac cgaaaagtta 4102 Arg Asn Asn Lys Val Arg Gln Leu Tyr Asp Asn 1235 1240 aaaaagaaca ggtttttcaa agtgaaaata aaattacagt ttttttattg caatgattaa 4162 cgtaacatct gcattacatt caagccgcac aaccccgcgg tgaccacccc tgacaggagt 4222 aaacaatgtc aaagcaacag atcggcgtcg tcggtatggc agtgatggga cgcaacctcg 4282 cgctcaacat cgaaagccgt ggttataccg tctctatttt caaccgttcc cgtgaaaaga 4342 cggaagaagt tattgeegaa aatccaggca agaaactggt teettaetat acggtgaaag 4402 agttcgttga atctcttgaa acgcctcgtc gcatcctgtt aatgggttaa agcaggtgca 4462 ggeaeggatg ctgetattga ttccctgaaa ccatatctcg ataaaggcga tatcatcatt 4522 gatgggtggg taatacette tttcaggaca ccattcgtcg taaccgcgag ctttctgcac 4582 44 aaggctttac ttcatcggta cc 4604 < 210 > 23 < 211 > 247 < 212 > PRT < 213 > Escherichia coli < 400 > 23 Val Asp Asp Asn Leu Lys Gly Gln Gly Wing Gly Lys Asn Phe Leu Ser 1 5 10 15 Leu He Lys Tyr Ser Glu Thr Asp Tyr Thr He Tyr Cys Asp Gln Asp 20 25 30 Asp He Trp Leu Glu Asn Lys He Phe Glu Leu Val Lys Tyr Ala Asn 35 40 45 Glu He Lys Leu Asn Val Ser Asp Ala Pro Ser Leu Val Tyr Ala Asp 50 55 60 Gly Tyr Ala Tyr Met Asp Gly Glu Gly Thr He Asp Phe Ser Gly He 65 70 75 80 Being Asn Asn His Wing Asp Gln Leu Lys Asp Phe Leu Phe Phe Asn Gly 85 90 95 Gly Tyr Gln Gly Cys Ser Met Met Phe Asn Arg Ala Met Thr Lys Phe 100 105 110 Leu Leu Asn Tyr Arg Gly Phe Val Tyr Leu His Asp Asp He Thr Thr 115 120 125 Leu Ala Ala Tyr Ala Leu Gly Lys Val Tyr Phe Leu Pro Lys Tyr Leu 130 135 140 Met Leu Tyr Arg Gln His Thr A = n Wing Val Thr Gly He Lys Thr Phe 145 150 155 160 Arg Asn Gly Leu Thr Ser Lys Phe Lys Ser Pro Val Asn Tyr Leu Leu 165 170 175 Ser Arg Lys His Tyr Gln Val Lys Lys Ser Phe Phe Glu Cys Asn Ser 180 185 190 Ser He Leu Ser Glu Thr Asn Lys Lys Val Phe Leu Asp Phe He Ser 195 200 205 45 Phe Cys Glu Ser Asn Asn Lys Phe Thr Asp Phe Phe Lys Leu Trp Arg 210 215 220 Gly Gly Phe Arg Leu Asn Asn Ser Arg Thr Lys Leu Leu Leu Lys Phe 225 230 235 240 Leu He Arg Arg Lys Phe Ser 245 < 210 > 24 < 211 > 261 < 212 > PRT < 213 > Escherichia coli < 400 > 24 Met He Ser He Leu Thr Pro Thr Phe Asn Arg Gln His Thr Leu Ser 1 5 10 15 Arg Leu Phe Asn Ser Leu He Leu Gln Thr Asp Lys Asp Phe Glu Trp 20 25 30 He He He Asp Asp Gly Ser He Asp Ala Thr Ala Val Leu Val Glu 35 40 45 Asp Phe Arg Lys Lys Cys Asp Phe Asp Leu He Tyr Cys Tyr Gln Glu 50 55 60 Asn Asn Gly Lys Pro Met Wing Leu Asn Wing Gly Val Lys Wing Cys Arg 65 70 75 80 Gly Asp Tyr He Phe He Val Asp Ser Asp Asp Ala Leu Thr Pro Asp 85 90 95 Ala He Lys Leu He Lys Glu Be He His Asp Cys Leu Ser Glu Lys 100 105 110 Glu Ser Phe Ser Gly Val Gly Phe Arg Lys Wing Tyr He Lys Gly Gly 115 120 125 He He Gly Asn Asp Leu Asn Asn Ser Glu His He Tyr Tyr Leu 130 135 140 Asn Ala Thr Glu He Be Asn Leu He Asn Gly Asp Val Ala Tyr Cys 145 150 155 160 Phe Lys Lys Glu Ser Leu Val Lys Asn Pro Phe Pro Arg He Glu Asp 46 165 170 175 Glu Lys Phe Val Pro Glu Leu Tyr He Trp Asn Lys He Thr Asp Lys 180 185 190 Ala Lys He Arg Phe Asn He Ser Lys Val He Tyr Leu Cys Glu Tyr 195 200 205 Leu Asp Asp Gly Leu Ser Lys Asn Phe His Asn Gln Leu Lys Lys Tyr 210 215 220 Pro Lys Gly Phe Lys He Tyr Tyr Lys Asp Gln Arg Lys Arg Glu Lys 225 230 235 240 Thr Tyr He Lys Lys Thr Lys Met Leu He Arg Tyr Leu Gln Cys Cys 245 250 255 Tyr Tyr Glu Lys He 260 < 210 > 25 < 211 > 368 < 212 > PRT < 213 > Escherichia coli < 400 > 25 Met Lys He Leu Phe Val He Thr Gly Leu Gly Leu Gly Gly Ala Glu 1 5 10 15 Lys Gln Val Cys Leu Leu Wing Asp Lys Leu Ser Leu Ser Gly His His 20 25 30 Val Lys He He Ser Leu Gly His Met Ser Asn Asn Lys Val Phe Pro 35 40 45 Ser Glu Asn Asn Val Asn Val He Asn Val Asn Met Ser Lys Asn He 50 55 60 Ser Gly Val He Lys Gly Cys Val Arg He Arg A = p Val He Wing Asn 65 70 75 80 Phe Lys Pro Asp He Val His Ser His Met Phe His Wing Asn He He 85 90 95 Thr Arg Leu Ser Val He Gly He Lys Asn Arg Pro Gly He He Ser 100 105 110 47 Thr Ala His Asn Lys Asn Glu Gly Gly Tyr Phe Arg Met Leu Thr Tyr 115 120 125 Arg He Thr Asp Cys Leu Ser Asp Cys Cys Thr Asn Val Ser Lys Glu 130 135 140 Ala Val Asp Glu Phe Leu Arg He Lys Ala Phe Asn Pro Ala Lys Ala 145 150 155 160 He Thr Met Tyr Asn Gly He Asp Thr Asn Lys Phe Lys Phe Asp Leu 165 170 175 Leu Ala Arg Arg Glu He Arg Asp Gly He Asn He Lys Asn Asp Asp 180 185 190 He Leu Leu Leu Wing Wing Gly Arg Leu Thr Leu Wing Lys Asp Tyr Pro 195 200 205 Asn Leu Leu Asn Wing Met Thr Leu Leu Pro Glu His Phe Lys Leu He 210 215 220 He He Gly Asp Gly Glu Leu Arg Asp Glu He Asn Met Leu He Lys 225 230 235 240 Lys Leu Gln Leu Ser Asn Arg Val Ser Leu Leu Gly Val Lys Lys Asn 245 250 255 He Ala Pro Tyr Phe Be Ala Cys Asp He Phe Val Leu Ser Ser Arg 260 265 270 Trp Glu Gly Phe Gly Leu Val Val Wing Glu Wing Met Ser Cys Glu Arg 275 280 285 He Val Val Gly Thr Asp Ser Gly Gly Val Arg Glu Val He Gly Asp 290 295 300 A = p Asp Phe Leu Val Pro He Ser Asp Ser Thr Gln Leu Ala Ser Lys 305 310 315 320 He Glu Lys Leu Ser Leu Ser Gln He Arg Asp His He Gly Phe Arg 325 330 335 Asn Arg Glu Arg He Leu Lys Asn Phe Ser He Asp Thr He He Met 340 345 350 Gln Trp Gln Glu Leu Tyr Gly Thr He He Cys Ser Lys His Glu Arg 355 360 365 48 < 210 > 26 < 211 > 367 < 212 > PRT < 213 > Escherichia coli < 400 > 26 Met Thr Ala Arg Thr Thr Lys Val Leu His Leu Gln Leu Leu Pro Leu 1 5 10 15 Leu Ser Gly Val Gln Arg Val Thr Leu Asn Glu Be Ser Ala Leu Tyr 20 25 30 Thr Asp Tyr Asp Tyr Thr Leu Val Cys Ser Lys Lys Gly Pro Leu Thr 35 40 45 Lys Ala Leu Leu Glu Tyr Asp Val Asp Cys His Cys He Pro Glu Leu 50 55 60 Thr Arg Glu He Thr Val Lys Asn Asp Phe Lys Ala Leu Phe Lys Leu 65 70 75 80 Tyr Lys Phe He Lys Lys Glu Lys Phe Asp He Val His Thr His Ser 85 90 95 Ser Lys Thr Gly He Leu Gly Arg Val Wing Wing Lys Leu Wing Arg Val 100 105 110 Gly Lys Val He His Thr Val His Gly Phe Ser Phe Pro Wing Ala Ser 115 120 125 Ser Lys Lys Ser Tyr Tyr Leu Tyr Phe Phe Met Glu Trp He Wing Lys 130 135 140 Phe Phe Thr Asp Lys Leu He Val Leu Asn Val Asp Asp Glu Tyr He 145 150 155 160 Ala He Asn Lys Leu Lys Phe Lys Arg Asp Lys Val Phe Leu He Pro 165 170 175 Asn Gly Val Asp Thr Asp Lys Phe Ser Pro Leu Glu Asn Lys He Tyr 180 185 190 Be Ser Thr Leu Asn Leu Val Met Val Gly Arg Leu Ser Lys Gln Lys 195 200 205 sp Pro Glu Thr Leu Leu Leu Wing Val Glu Lys Leu Leu Asn Glu Asn 210 215 220 49 Val Asn Val Lys Leu Thr Leu Val Gly Asp Gly Glu Leu Lys Glu Gln 225 230 235 240 Leu Glu Be Arg Phe Lys Arg Gln Asp Gly Arg He He Phe His Gly 245 250 255 Trp Ser Asp Asn He Val Asn He Leu Lys Val Asn Asp Leu Phe He 260 265 270 Leu Pro Ser Leu Trp Glu Gly Met Pro Leu Ala He Leu Glu Ala Leu 275 280 285 Be Cys Gly Leu Pro Cys He Val Thr Asn He Pro Gly Asn Asn Ser 290 295 300 Leu He Glu Asp Gly Tyr Asn Gly Cys Leu Phe Glu He Arg Asp Cys 305 310 315 320 Gln Leu Leu Ser Gln Lys He Met Ser Tyr Val Gly Lys Pro Glu Leu 325 330 335 He Ala Gln Gln Ser Thr Asn Ala Arg Ser Phe He Leu Lys Asn Tyr 340 345 350 Gly Leu Val Lys Arg Asn Asn Lys Val Arg Gln Leu Tyr Asp Asn 355 360 365 < 210 > 27 < 211 > 1272 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (319) .. (1269) < 220 > < 221 > CDS < 222 > (3) .. (215) < 400 > 27 cc ggg aag falls tcg gcg ctg att gtt gca cat cgt ctg acc acc gcg 47 Gly Lys His Ser Wing Leu He Val Wing His Arg Leu Thr Thr Wing 1 5 10 15 50 cac cgc tgc gat ctg att gcc gtt att gat aag ggg tta ctt gcg gaa 95 Gln Arg Cys Asp Leu He Wing Val He Asp Lys Gly Leu Leu Wing Glu 20 25 30 tac gga acc falls gaa cag ctg tta tet gcg ggc ggc etc tat acc cgc 143 Tyr Gly Thr His Glu Gln Leu Leu Ser Wing Gly Gly Leu Tyr Thr Arg 35 40 45 tta tgg cat gac age gtc age agt act gct etc cat cgc cag falls aac 191 Leu Trp His Asp Ser Val Ser Ser Thr Ala Leu His Arg Gln His Asn 50 55 60 atg aag gag gaa acc ccg gga tag ttactggaca cgtaatgtat taaaaacaca 245 Met Lys Glu Glu Thr Pro Gly 65 70 gtcagaagcg gcggtaccgt gaatagccgc tttaattatt tatactgaca teettaattt 305 ttaaagagta tga atg ctg aac atg caà caa cat etc tet gct ate gcc 354 Met Leu Asn Met Gln Gln His Leu Ser Ala Ala 75 80 age ctc cgc aac ca gt cg gcg ggc falls att gct aac ctt act gac 402 Ser Leu Arg Asn Gln Leu Wing Wing Gly His He Wing Asn Leu Thr Asp 85 90 95 ttc tgg cgc gaa gct gag tcg ctg aat gtt ect ctt gtg acg cea gtc 450 Phe Trp Arg Glu Ala Glu Ser Leu Asn Val Pro Leu Val Thr Pro Val 100 105 110 115 gaa gga gcg gaa gat gag cga gaa gtg acc ttt ctg tgg cgc gcc cga 498 Glu Gly Wing Glu Asp Glu Arg Glu Val Thr Phe Leu Trp Arg Ala Arg 120 125 130 cat ect ctg cag ggc gtt tat ctg cgt ctg aac cgg gtg acg gat aaa 546 His Pro Leu Gln Gly Val Tyr Leu Arg Leu A = n Arg Val Thr Asp Lys 135 140 145 gag falls gta gaa aaa gga atg atg age gcc ctt ccc gaa acg gat ate 594 Glu His Val Glu Lys Gly Met Met Be Ala Leu Pro Glu Thr Asp He 150 155 160 tgg here ctg here ctg cgt tta ccc gca agt tac tgc ggc tech tat tcg 642 Trp Thr Leu Thr Leu Arg Leu Pro Wing Ser Tyr Cys Gly Ser Tyr Ser 165 170 175 ctg ctg gaa ate ccc ccc ggc act acg gct gag acg att gca ctg tec 690 Leu Leu Glu He Pro Pro Gly Thr Thr Ala Glu Thr He Ala Leu Ser 51 180 185 190 195 gga ggc cgt ttt gcc acc ctt gcc gga aag gcc gat ccg cta aac aaa 738 Gly Gly Arg Phe Wing Thr Leu Wing Gly Lys Wing Asp Pro Leu Asn Lys 200 205 210 atg ccg gag ate aac gtt cgg gga aac gca aag gaa tea gtg ctg here 786 Met Pro Glu He Asn Val Arg Gly Asn Ala Lys Glu Ser Val Leu Thr 215 220 225 ctt gat aaa gct ccc gcc ctg tcg gaa tgg aac ggc ggc ttc falls acc 834 Leu Asp Lys Ala Pro Ala Leu Ser Glu Trp Asn Gly Gly Phe His Thr 230 235 240 gga cat ctg ctt acc tec atg cgc att ate gcc ggg aaa tet cgc cag 882 Gly Gln Leu Leu Thr Ser Met Arg He He Wing Gly Lys Ser Arg Gln 245 250 255 gtt cgg etc tat att ccg gat gtt gat att tet cag ccc etc ggg ctg 930 Val Arg Leu Tyr He Pro Asp Val Asp He Ser Gln Pro Leu Gly Leu 260 265 270 275 gtc gtg ctg ccc gat ggt gaa acc tgg ttt gat fall ctt ggc gta tgc 978 Val Val Leu Pro Asp Gly Glu Thr Trp Phe Asp His Leu Gly Val Cys 280 285 290 gcg gca att gac gcc gcc ata aat aat ggg cgc ate gtg ccc gtg gct 1026 Ala Ala He Asp Ala Ala He Asn Asn Gly Arg He Val Pro Val Ala 295 300 305 gta ctg ggc att gac aac att aat gaa cat gaa cgc act gag ata etc 1074 Val Leu Gly He Asp Asn He Asn Glu His Glu Arg Thr Glu He Leu 310 315 320 ggc ggg cgc age aaa ctg ata aag gat ate gcc gga cat ctg ctg ccg 1122 Gly Gly Arg Ser Lys Leu He Lys Asp He Wing Gly His Leu Leu Pro 325 330 335 atg att cgc gct gaa cac ccg cag cgt cag tgg gca gac cgt tcg cgc 1170 Met He Arg Wing Glu Gln Pro Gln Arg Gln Trp Wing Asp Arg Ser Arg 340 345 350 355 here gtg ctg gcc ggg cag age etc ggc ggg ate agt gcg cta atg ggg 1218 Thr Val Leu Wing Gly Gln Ser Leu Gly Gly He Ser Wing Leu Met Gly 360 365 370 gct cgt tac gca ccg gaa acg ttc ggt ctg gtg etc age falls tet ect 1266 Wing Arg Tyr Wing Pro Glu Thr Phe Gly Leu Val Leu Ser His Ser Pro 52 375 380 385 cag tgc 1272 Gln < 210 > 28 < 211 > 70 < 212 > PRT < 213 > Escherichia coli < 400 > 28 Gly Lys His Ser Wing Leu He Val Wing His Arg Leu Thr Thr Ala Gln 1 5 10 15 Arg Cys A = p Leu He Wing Val He Asp Lys Gly Leu Leu Wing Glu Tyr 20 25 30 Gly Thr His Glu Gln Leu Leu Ser Wing Gly Gly Leu Tyr Thr Arg Leu 35 40 45 Trp His Asp Ser Val Ser Ser Thr Ala Leu His Arg Gln His Asn Met 50 55 60 Lys Glu Glu Thr Pro Gly 65 70 < 210 > 29 < 211 > 317 < 212 > PRT < 213 > Escherichia coli < 400 > 29 Met Leu Asn Met Gln Gln His Leu Ser Ala Ala Be Ser Leu Arg Asn 1 5 10 15 Gln Leu Ala Ala Gly His He Ala Asn Leu Thr Asp Phe Trp Arg Glu 20 25 30 Wing Glu Ser Leu Asn Val Pro Leu Val Thr Pro Val Glu Gly Ala Glu 35 40 45 Asp Glu Arg Glu Val Thr Phe Leu Trp Arg Ala Arg His Pro Leu Gln 50 55 60 Gly Val Tyr Leu Arg Leu Asn Arg Val Thr Asp Lys Glu His Val Glu 65 70 75 80 53 Lys Gly Met Met Be Wing Leu Pro Glu Thr Asp He Trp Thr Leu Thr 85 90 95 Leu Arg Leu Pro Wing Ser Tyr Cys Gly Ser Tyr Ser Leu Leu Glu He 100 105 110 Pro Pro Gly Thr Thr Wing Glu Thr He Wing Leu Ser Gly Gly Arg Phe 115 120 125 Wing Thr Leu Wing Gly Lys Wing Asp Pro Leu Asn Lys Met Pro Glu He 130 135 140 Asn Val Arg Gly Asn Wing Lys Glu Ser Val Leu Thr Leu Asp Lys Wing 145 150 155 160 Pro Ala Leu Ser Glu Trp Asn Gly Gly Phe His Thr Gly Gln Leu Leu 165 170 175 Thr Ser Met Arg He He Wing Gly Lys Ser Arg Gln Val Arg Leu Tyr 180 185 190 He Pro Asp Val Asp He Ser Gln Pro Leu Gly Leu Val Val Leu Pro 195 200 205 Asp Gly Glu Thr Trp Phe Asp His Leu Gly Val Cys Ala Wing He Asp 210 215 220 Ala Ala He Asn Asn Gly Arg He Val Pro Val Ala Val Leu Gly He 225 230 235 240 Asp Asn He Asn Glu His Glu Arg Thr Glu He Leu Gly Gly Arg Ser 245 250 255 Lys Leu He Lys Asp He Wing Gly His Leu Leu Pro Met He Arg Wing 260 265 270 Glu Gln Pro Gln Arg Gln Trp Wing Asp Arg Ser Arg Thr Val Leu Wing 275 280 285 Gly Gln Ser Leu Gly Gly He Ser Ala Leu Met Gly Ala Arg Tyr Ala 290 295 300 Pro Glu Thr Phe Gly Leu Val Leu Ser His Ser Pro Gln 305 310 315 54 < 210 > 30 < 211 > 4039 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (1) .. (285) < 220 > < 221 > CDS < 222 > (370) .. (1326) < 400 > 30 ect tea atg tgg tgg acg cea gaa aga acc agt cga cea ggc ttg ttc 48 Pro Ser Met Trp Trp Thr Pro Glu Arg Thr Ser Arg Pro Gly Leu Phe 1 5 10 15 age gaa acc gat action tea tgg gtg agt gag cat ctg ctt tet gcc cea 96 Ser Glu Thr Asp Thr Ser Trp Val Ser Glu His Leu Leu Ser Ala Pro 20 25 30 ccg cag ggc gta cgt ate age ctg tgc gtg gga tcg ctg gaa ggt tcg 144 Pro Gln Gly Val Arg He Ser Leu Cys Val Gly Ser Leu Glu Gly Ser 35 40 45 here gtg ect falls gtt cag cag ctt falls cag cgg ctg att acc gct ggc 192 Thr Val Pro His Val Gln Gln Leu His Gln Arg Leu He Thr Ala Gly 50 55 60 gtc gaa age cat tgc gca ate tac acc ggt ggt falls gat tac gca tgg 240 Val Glu Ser His Cys Ala He Tyr Thr Gly Gly His Asp Tyr Ala Trp 65 70 75 80 tgg cgc ggt gca ctg att gac ggg att ggt tta cta cag ggt tga 285 Trp Arg Gly Ala Leu He A = p Gly He Gly Leu Leu Gln Gly 85 90 95 gttgacccac aaacactttc aggaaacggt acagacttcc tgaataaatc aaatagtcac 345 ctgcggaaaa ggaataatca tcag atg tat gcc cgc gag tat cgc tea here 396 Met Tyr Ala Arg Glu Tyr Arg Ser Thr 100 cgc ccg cat aaa gcg att ttc ttt cat ctt tet tgc etc acc ctt ate 444 Arg Pro His Lys Wing He Phe Phe His Leu Ser Cys Leu Thr Leu He 105 110 115 120 55 tgt agt gcg ca gtt tat gcg aag ccg gat atg cgg cea ctg ggg ccg 492 Cys Ser Ala Gln Val Tyr Ala Lys Pro Asp Met Arg Pro Leu Gly Pro 125 130 135 aat ata gcc gat aaa ggc tec gtg ttt tac cat ttc age gtc acc tet 540 Asn He Wing A = p Lys Gly Ser Val Phe Tyr His Phe Ser Val Thr Ser 140 145 150 ttc gac tet gtc gat ggc here cgc cat tat cgg gta tgg acg gcc gtg 588 Phe Asp Ser Val Asp Gly Thr Arg His Tyr Arg Val Trp Thr Wing Val 155 160 165 ccg aat acc acc gca ccg gca tcg ggt tac ccg att tta tat atg ctt 636 Pro Asn Thr Thr Ala Pro Wing Ser Gly Tyr Pro He Leu Tyr Met Leu 170 175 180 gac ggt aac gca gtt atg gat cgc ctg gat gac gaa ctg etc aaa ca 684 Asp Gly Asn Wing Val Met Asp Arg Leu Asp Asp Glu Leu Leu Lys Gln 185 190 195 200 ttg tea gaa aaa here ccg cea gtg ate gtg gct gtc ggg tat cag acc 732 Leu Ser Glu Lys Thr Pro Pro Val He Val Val Wing Gly Tyr Gln Thr 205 210 215 aac etc ect ttc gat etc aac age agg gct tac gac tat acg cea gca 780 Asn Leu Pro Phe Asp Leu Asn Ser Arg Ala Tyr Asp Tyr Thr Pro Wing 220 225 230 gca gaa age aga aaa here gat etc fall tea ggg cgt ttt age cgt aag 828 Wing Glu Ser Arg Lys Thr Asp Leu His Ser Gly Arg Phe Ser Arg Lys 235 240 245 agt ggt ggc age aac aac ttc cgc cag tta ctg gaa acg cgt att gcc 876 Ser Gly Gly Ser Asn Asn Phe Arg Gln Leu Leu Glu Thr Arg He Ala 250 255 260 cea aaa gtg gaa cag gga ctg aat ate gat cgg ca cgc cgc ggc tta 924 Pro Lys Val Glu Gln Gly Leu Asn He Asp Arg Gln Arg Arg Gly Leu 265 270 275 280 tgg ggg ac tcc tac ° ctc ttc tg gat tec tgg ctg tec 972 Trp Gly His Ser Tyr Gly Gly Leu Phe Val Leu Asp Ser Trp Leu Ser 285 290 295 tcc tet tac tcc cgg tcg tac age gcc age ccg tcg tgc ggc aga 1020 Ser Ser Tyr Phe Arg Ser Tyr Tyr Ser Wing Pro Pro Ser Leu Gly Arg 300 305 310 56 ggt tat gat gct ttg cta age cgc gtt acg gcg gtt gag ect ctg caa 1068 Gly Tyr Asp Ala Leu Leu Ser Arg Val Thr Ala Val Glu Pro Leu Gln 315 320 325 ttc tgc gcc aaa falls ctg gcg ata atg gaa ggc tcg gcg here cag ggt 1116 Phe Cys Wing Lys His Leu Wing He Met Glu Gly Being Wing Thr Gln Gly 330 335 340 gat aac cgg gaa acg cat gct gtc ggg gtg ctg tcg aaa att cat acc 1164 Asp Asn Arg Glu Thr His Wing Val Gly Val Leu Ser Lys He His Thr 345 350 355 360 acc etc act ata ctg aaa gat aaa ggc gtc aat gcc gta ttt tgg gat 1212 Thr Leu Thr He Leu Lys Asp Lys Gly Val Asn Wing Val Phe Trp Asp 365 370 375 ttc ccc aac ctg gga drops ggg ccg atg ttc aat gee tcc ttt cgc cag 1260 Phe Pro Asn Leu Gly His Gly Pro Met Phe Asn Wing Ser Phe Arg Gln 380 385 390 gca ctg tta gat ate agt ggt gaa aac gca aat tac here gca ggt tgt 1308 Wing Leu Leu A = p He Ser Gly Glu Asn Wing Asn Tyr Thr Wing Gly Cys 395 400 405 cat gag tta age falls taa acactgcccg cttttacgcg ggcagtacgc 1356 His Glu Leu Ser His 410 ctgaaacact acgatcagaa tgatgcggta actccggcat agtaagcccg gcctggctcg 1416 ttataggtat tcgccccttc agaagategg aagatctgtt tattgaggat attactgacg 1476 cegacattaa gaegeagatt tttattaata tcgtaattga agttcgtccc caccagtgaa 1536 taagcgccca gctctttacc tgacagaccg ccagtatctt cactgcgggt ttccgcatga 1596 gtacgcggtt tttgtctgcc atataacgtc cagttgacgc tggcagaaaa cgcctgggtg 1656 atggtecagt taagcgagtt attgatagta tatttcggga tgaccgacag aggattaccg 1716 gtgtcttttt gctccgaagt gatcatecat gtggcattgg tattecagtt cagaegatet 1776 ttcaccagtg ggaaagacat actggctlcg ataccgtcca ecagagettt cccgccattc 1836 tgccacttga ggatatatgc gcctgaagcg gtttgcccga taacgttatc cccggccacg 1896 atcttattct ggtaatcatt gcggaagtag gtcacacttg cgtggtaatc ttcccaggtg 1956 57 aactccagcc caatttcttt attgacgctg atttccggat cgagatcttt attaccgatc 2016 aggtagcacc cgcctgatgt aatatctttt ggacagccat tgcctttcga gtagagcaga 2076 tagccttcac tggattgata caggtttggg gctttaaagg ttcgggcaac ccctgctttg 2136 actttgaaat aatcgcccaa ttcctgcgaa agattcagac tggggctgaa gttcccgccg 2196 gataatcaaa gagtcgctga gcgcaggccg ggaatgatat tcgtgccagg aaccggctca 2256 atgttatctt caatatacag cgcactgatt tgagaatgat ttttactgct gcgatccgca 2316 gcagagccag aaataccgct gatatcactg tcattcaccg tcaggctggt agaggaagga 2376 tcatcgagct tatcgcggtt ccactctgca ccaacggtca gcgtttgatc aaccatcaca 2436 ttcaaaggaa tattaagctc gccgctggtt cgccaggaac tcaggcgatt ggtcgtaaac 2496 ttttcacccg ctaaaatacg cccttcacca atccttcatt ccgccggata catgcgggta 2556 ttattggttt tctcgtaata aacaccaaag cgactttgtc gataccatta cccagtccca 2616 tgcgtaatgc cataattctg tcggtacagg cggctcgtct cttgtgccgg attttgccag 2676 gcttttcggt aactgcactg gaagaactgt tttgcgtatc gccgcataga tattcccctg 2736 gcggctatat ccggcttcga aatcgagaat ctgctgcgga tttaatttcc acgagacaac 2796 gc cgttaata tctttgttac gtaccccttc atgcccggct gcgtttttcg taccgaccgg 2856 tcccaactgt agaattaata cagcatccgt tttattcaga ttaccataca aacgcgtggt 2916 aagagcatta ccagccagag gcccactaag gctgaaattg gcgcgacgcg tagcgccctc 2976 atcgctactt tccggctgat tggtgtataa cgacagcgaa ccgtgccagt cgttggtggg 3036 atgatgttca acgtttggta ccaccccccc ggctgccccc gaaccgtagc gcgccgccgc 3096 agggccgcgg atcacttcaa tacgctcaac ctgttccggt ggcacccagt tggtgtcacc 3156 gcgggtatca cgctctccac gccagctata acgcacggag ttacgtgacg tcaccggtac 3216 accatcaatt aaaattaagg tgttttccgg ccccatacca cgaatatcga tctggcggtt 3276 gttaccgcgt gtgcccgagg cgctattgcc ggtaagattg acgccaggca ttttacgaat 3336 aatatctgaa aggtegttta ccggaggggt etttttaata tcctcgctgg taataacega 3396 58 cacgcccggc tgetgtttta atacctgctc agcggtggct tccaccacca gagtctcgtc 3456 attatcatcg tcggaggatt tggctactga tacctggcta ttcaacccaa ccaggagcac 3516 agttagcgac cagaggattt tgttaattct cataectatt cectaataaa tgcetaactt 3576 aaaatgtttg atcgttaagc tcacatcctt gccagatatt ttttactgcc attattgttt 3636 ttatataaga atgataatta atatcattta gcaaaagaaa aagcaatccc tcacaagata 3696 aatatatega tttttcataa atatcaaatt gatatataac atatgttttt tatttcattg 3756 tacttcagtc aaataaattt ctgaagcact gctagtagtg ccagttcagc tttctttttg 3816 actcattccg gcaaagtcag taccgttcat cttttgtact gatgttgcca ctggaaaatc 3876 ggtgcgcttg tegatcateg ggaattttgt cacaatttct aacggatagt gttcacattg 3936 tttctaacct gcattttcag acacgggcgc tgcttatgta tataagatca gcatcactag 3996 gtctttctgc aacactactg ctttcaacaa ggtcaggcat ttc 4039 < 210 > 31 < 211 > 94 < 212 > PRT < 213 > Escherichia coli < 400 > 31 Pro Ser Met Trp Trp Thr Pro Glu Arg Thr Ser Arg Pro Gly Leu Phe 1 5 10 15 Ser Glu Thr Asp Thr Ser Trp Val Ser Glu His Leu Leu Ser Wing Pro 20 25 30 Pro Gln Gly Val Arg He Ser Leu Cys Val Gly Ser Leu Glu Gly Ser 35 40 45 Thr Val Pro His Val Gln Gln Leu His Gln Arg Leu He Thr Ala Gly 50 55 60 Val Glu Ser His Cys Wing He Tyr Thr Gly Gly His Asp Tyr Wing Trp 65 70 75 80 Trp Arg Gly Ala Leu He Asp Gly He Gly Leu Leu Gln Gly 85 90 59 < 210 > 32 < 211 > 318 < 212 > PRT < 213 > Escherichia coli < 400 > 32 Met Tyr Ala Arg Glu Tyr Arg Ser Thr Arg Pro His Lys Wing He Phe 1 5 10 15 Phe His Leu Ser Cys Leu Thr Leu He Cys Ser Wing Gln Val Tyr Wing 20 25 30 Lys Pro Asp Met Arg Pro Leu Gly Pro Asn He Wing Asp Lys Gly Ser 35 40 45 Val Phe Tyr His Phe Ser Val Thr Ser Phe Asp Ser Val Asp Gly Thr 50 55 60 Arg His Tyr Arg Val Trp Thr Ala Val Pro A = n Thr Thr Ala Pro Wing 65 70 75 80 Ser Gly Tyr Pro He Leu Tyr Met Leu Asp Gly Asn Wing Val Met Asp 85 90 95 Arg Leu Asp Asp Glu Leu Leu Lys Gln Leu Ser Glu Lys Thr Pro Pro 100 105 110 Val He Val Val Ala Gly Tyr Gln Thr Asn Leu Pro Phe Asp Leu Asn 115 120 125 Ser Arg Ala Tyr Asp Tyr Thr Pro Wing Ala Glu Ser Arg Lys Thr Asp 130 135 140 Leu His Ser Gly Arg Phe Ser Arg Lys Ser Gly Gly Ser Asn Asn Phe 145 150 155 160 Arg Gln Leu Leu Glu Thr Arg He Wing Pro Lys Val Glu Gln Gly Leu 165 170 175 Asn He Asp Arg Gln Arg Arg Gly Leu Trp Gly His Ser Tyr Gly Gly 180 185 190 Leu Phe Val Leu Asp Ser Trp Leu Ser Ser Ser Tyr Phe Arg Ser Tyr 195 200 205 Tyr Ser Ala Ser Pro Ser Leu Gly Arg Gly Tyr Asp Ala Leu Leu Ser 210 215 220 60 Arg Val Thr Ala Val Glu Pro Leu Gln Phe Cys Ala Lys His Leu Ala 225 230 235 240 He Met Glu Gly Be Wing Thr Gln Gly Asp Asn Arg Glu Thr His Wing 245 250 255 Val Gly Val Leu Ser Lys He His Thr Thr Leu Thr He Leu Lys Asp 260 265 270 Lys Gly Val Asn Wing Val Phe Trp Asp Phe Pro Asn Leu Gly His Gly 275 280 285 Pro Met Phe A = n Wing Being Phe Arg Gln Wing Leu Leu Asp He Ser Gly 290 295 300 Glu Asn Wing Asn Tyr Thr Wing Gly Cys His Glu Leu Ser His 305 310 315 < 210 > 33 < 211 > 3292 < 212 > DNA < 213 > Escherichia coli < 400 > 33 ccgctgcggt tgattgccgg atgcggcgtg aacgccttat ccggcctaca atcattgcaa 60 attcaataaa ttgcagcgtt ctgtaggctg gataagatgc gtcagcatcg catccggcaa 120 aggcagatct cagcgatagc gccggcttag tcagatttaa tctgcgcgcg tggtggatat 180 tttttcagga tctccatata cgcgtgcatt tcggtctgta gcggtacacc catcggaata 240 tggcgcacgc cgatagagtc gctttcctgc ggatcggtgt acaggttaaa caccgacgat 300 cccgccgttt gcattactgt gccggtgaat ccaccctgat atccgctctg ggtataagcg 360 taaggttgct gaatcaggac gtgatacttg aactcatcca tacgcacagc agaaagttta 420 ccgttgagga agtagtgctc ggccttacgg ttagactgae catttgttcc caggaagaaa 480 gatgtctggt ccacaccatc gataaaggtg gttttcggta ctaaattcgc tactttcgcc 540 ccaggatgcc ccgccagatc cagcgccgta gggaagagat ccgccagatc gacaatgccg 600 tcagatttac gcggttggat catgccttte cagtaaacga aggttggcac acgaacgccg 660 ccttcccatg tagaaccttt cgcaccgcgg aacggcgtgc gaccgtgcgg cggaacttcg 720 cgttatcaga gcttccggac ggtgaacaca atcagcgtgt tatcaagctg accgtttttc 780 tccagtgctt tatacagatt ggcgaagata tcgttcatct gtcgccgtaa ccaccatgca 840 gaggtgcgcg ccggaga gct acccgcatat ttggcgttcg ggtagttatc ga.agtgacag 900 ccgcgagtgc cgtaatagag gaagaaaggc ttatcactct tcgccatctt gtcgaggaac 960 ttaacgccat attccatcca gcgttgatcc aggtcttcca tatatttcgg cgtaatgtcg 1020 gcaatggctt cctgttcacc gccgcgcacg gcatgaacgt catctttgct gaacggcagt 1080 ttttggatgt attcagaacg agccgggctc agggcgactt ccgggttgac atgaacatca 1140 cgccattcgg tatacatatc ggacaccgag ttaaagccgc ggaaatcatc aaagccaacg 1200 ttctgcggct gcgactcttt gttttcccce atatgecatt tcccgatagc ctgggtgacg 1260 tagccctgat cgtgcagcaa ctgcggcagc gtggttaacc cttgcagacc gcccggttgc 1320 61 ccgtacattg gcggeatcag aatgccgtgg tggatggagt attgtccggt gagaatcgtg 1380 gcgcgggttg gggaagaact cggttgagaa tacgccgaag ttaaaatcag cccctggctg 1440 gcaacggcgt cgatatctgg tgtagggtta cccaccgcca gttaaagcca cgccgccgce 1500 acgtccatcc agcccacatc atccagcaag aaaacaacca cgttcggttt cttaccggtt 1560 tttttctcaa gttccgccag tttctgctgg gtttctttgt cttgtgcagg atgctgcatc 1620 accggcatca tgttgtcggc aatagtggtc gccggtttaa ccagatactg gtttgggtga 1680 tcgtatccgg caaagccttt gcgtgcggtg gcggttgacg gggtatctgc tgcgctggct 1740 atgagaggaa gagctgcggc gacageaaca acaagaagtt tgggtgaaaa cgaaaattcc 1800 atgcaaaatg ctccggtttc atgtcgtcaa aatgatgacg taattaagea ttgataattg 1860 agatccctct ccctgacagg atgattgcat aaataatagt gataaaaata aattatttat 1920 atgaattgaa ttatccagaa aaatcaggag agcattttca atcctacctc tggcgcaggt 1980 gatattgtaa ggcggtgatg ttatatcgcg ttgattattg atgctgtttt tagttttaac 2040 ggeattaata tatatgttat taattgaatg acttttatta ttcattatat atatgtgtag 2100 aattgtgcgc aggagaaata ttcactcagg aagttattac tcaggaagca aagaggatta 2160 cagaattatc tcataacaag tgttaaggga tgttatttcc cggttctctg tggcataata 2220 aacgagtaga tgctcattcc atetettatg ttcgccttag tgcctcataa actccggaat 2280 gaegeagage cgtttacggt gcttatcgtc cactgacaga tgtcgcttat gcctcatcag 2340 acaccatgga cacaacgttg agtgaagcac ccacttgttg tcatacagac ctgttttaac 2400 gcctgctccg taataagage aggcgttttt ttatatatca gaaaggcccc ggaggtgctt 2460 gcctccgggt gagaaagage tactgtggcg ggttgttctg caacgttaae atcaaaccgt 2520 cgcgacgcat cgctgcggct tcttccggct tgtgcagtct gtccagcgcg tcggcaagcc 2580 atgcgtaatc gtaggcgtcc ggacgttgtt tcagcgctgc gcggaaggcg agcgatgctt 2640 cctgcc attc tccgtgtttc atcagcgact gacccagtgt gctccacaac agcgggcgat 2700 cgcccacgtt tttgatctgt tggcgcagca ctttttccaa gctgttcccg ggttaattgg 2760 gttttcagac gccggggatc ggeageagea ggccgaatcg tcatactggc gtttcaggcc 2820 ategatgata atttgctggg cagtatcatg atcgtcacac tcaataagat gttccgccat 2880 tgccacctgc aaggccacct gatgacgcgt tttccggctt tggtttttcc accagttacg 2940 caaaccttca ctaccgttat cggcacgcgc ctgatccatc aggccaatcc atgcctgttg 3000 ttecageatt gcacgatgtt cttcatcacc aacatgggct ttcgccattg atgggatgat 3060 atccagcagc gaactccatg caccagtgcg gatataegee tgttccgcca gacgtaatac 3120 ttccggatgg cgtggcgtaa cttccagcag cttatccacg ccgtgacgtg ctgcatggtt 3180 ttcattacgg gccagttgca gacgtacacg ggtgatttet accggaatgg tgtcattgcc 3240 ggccagctcc gctgcgcgtt ccagatgttg gttggcgcgt gcttcagcat 3292 cc < 210 > 34 < 211 > 11165 < 212 > DNA < 213 > Escherichia coli < 220 > < 221 > CDS < 222 > (3791) .. (4834) < 220 > < 221 > CDS < 222 > (10459) .. (10776) 62 < 220 > < 221 > CDS < 222 > (10134) .. (10427) < 220 > < 221 > CDS < 222 > (9836) .. (10081) < 220 > < 221 > CDS < 222 > (7816) .. (9480) < 220 > < 221 > CDS < 222 > (4878) .. (7802) < 220 > < 221 > CDS < 222 > (3460) .. (3702) < 220 > < 221 > CDS < 222 > (3054) .. (3407) < 220 > < 221 > CDS < 222 > (2613) .. (3041) < 220 > < 221 > CDS < 222 > (2198) .. (2530) < 220 > < 221 > CDS < 222 > (1939) .. (2196) < 220 > < 221 > CDS < 222 > (1573) .. (1893) < 220 > < 221 > CDS < 222 > (1102) .. (1485) < 220 > < 221 > CDS < 222 > (2) .. (1099) 63 < 400 > 34 c age gat atg cag cgc ggt ate cag gct gca acg gct gca ctt cag ggc 49 Being Asp Met Gln Arg Gly He Gln Wing Wing Thr Wing Wing Leu Gln Gly 1 5 10 15 ctg gtg ggc ggc aat atg gca ggc gcg ctg gca ggt gct tea gcg ccg 97 Leu Val Gly Gly Asn Met Wing Gly Wing Leu Wing Gly Wing Being Wing Pro 20 25 30 gag ctg gcg aac ate ate ggt cat fall gcg ggt att gat gac aat here 145 Glu Leu Wing Asn He He Gly His His Wing Gly He Asp Asp Asn Thr 35 40 45 gcg gca aaa gcc att gcc cat gcc att ggt ggt gtg here gca gcc 193 Wing Wing Wing Wing Wing His Wing He Leu Gly Wing Val Thr Wing Wing 50 55 60 ctt cag ggc aac agt gcg gca gca ggc gca att ggt gcg ggt act ggt 241 Leu Gln Gly Asn Ser Ala Ala Ala Gly Ala He Gly Ala Gly Thr Gly 65 70 75 80 gaa gtg ate gcg tea gcc att gcg aaa age etc tac ccg ggc gta gat 289 Glu Val He Wing Wing Wing Wing Wing Lys Ser Leu Tyr Pro Gly Val Asp 85 90 95 ccg tcg aaa ctg here gaa gat cag aag caa act gta age acg ctg gca 337 Pro Ser Lys Leu Thr Glu A = p Gln Lys Gln Thr Val Ser Thr Leu Ala 100 105 110 acg ctg tea gcg ggt atg gcc ggc ggc att gcc agt ggc gat gtg gct 385 Thr Leu Ser Wing Gly Met Wing Gly Gly He Wing Ser Gly Asp Val Wing 115 120 125 ggc acg gct gct gg gct ggt gcc ggg aag aac gtt gtt gag aat aat 433 Gly Wing Wing Wing Gly Wing Gly Wing Gly Lys Asn Val Val Glu Asn Asn 130 135 140 gcg ctg agt ctg gtt gcc aga ggc tgt gcg gtc gca gca ect tgc agg 481 Wing Leu Ser Leu Val Wing Arg Gly Cys Wing Val Wing Wing Pro Cys Arg 145 150 155 160 act aaa gtt gg gag cag ttg cta gaa ate ggg gcg aaa gcg ggc atg 529 Thr Lys Val Wing Glu Gln Leu Leu Glu He Gly Wing Lys Wing Gly Met 165 170 175 gcc ggg ctt gcc ggg gcg gca gtc aag gat atg gcc gac agg atg acc 577 Wing Gly Leu Wing Gly Wing Wing Val Lys Asp Met Wing Asp Arg Met Thr 180 185 190 64 tcc gat gaa ctg gag cat ctg att acc ctg caa atg atg ggt aat gat 625 Being Asp Glu Leu Glu His Leu He Thr Leu Gln Met Met Gly Asn Asp 195 200 205 gag ate act act aag tat etc agt tcg ttg cat gat aag tac ggt tcc 673 Glu He Thr Thr Lys Tyr Leu Ser Ser Leu His Asp Lys Tyr Gly Ser 210 215 220 ggg gct gcc tcg aat ccg aat ate ggt aaa gat ctg acc gat gcg gaa 721 Gly Ala Ala Ser Asn Pro Asn He Gly Lys Asp Leu Thr Asp Ala Glu 225 230 235 240 aaa gta gaa ctg ggc ggt tcc ggc tea gga acc ggt here cea cea cea 769 Lys Val Glu Leu Gly Gly Ser Gly Ser Gly Thr Gly Thr Pro Pro Pro 245 250 255 tcg gaa aat gat ect aag cag aat gaa aaa act gta gat aag ctt 817 Ser Glu Asn Asp Pro Lys Gln Gln Asn Glu Lys Thr Val Asp Lys Leu 260 265 270 aat cag aag caa gaa agt gcg att aag aag ate gat aac ata ata aaa 865 Asn Gln Lys Gln Glu Be Ala He Lys Lys He Asp Asn Thr He Lys 275 '280 285 aat gct ctg aaa gat cat gat att att gga act etc aag gat atg gat 913 Asn Ala Leu Lys Asp His Asp He He Gly Thr Leu Lys Asp Met Asp 290 295 300 ggt aag cea gtt ect aaa gag aat gga gga tat tgg gat cat atg cag 961 Gly Lys Pro Val Pro Lys Glu Asn Gly Gly Tyr Trp Asp His Met Gln 305 310 315 320 gaa atg caa aat acg etc aga gga tta aga aat cat gcg gat acg ttg 1009 Glu Met Gln Asn Thr Leu Arg Gly Leu Arg Asn His Wing Asp Thr Leu 325 330 335 aaa aac gtc aac aat ect gaa gct cag gct gcg tat ggc aga gca here 1057 Lys Asn Val A = n Asn Pro Glu Ala Gln Ala Ala Tyr Gly Arg Ala Thr 340 345 350 gat gct att aat aaa ata gaa tea gcc ttg aaa gga tat gga at atg 1104 Asp Ala He Asn Lys He Glu Be Ala Leu Lys Gly Tyr Gly Met 355 360 365 att acc tta cgt aaa ttg att gga aac ate aat atg here aaa gag ect 1152 He Thr Leu Arg Lys Leu He Gly Asn He Asn Met Thr Lys Glu Pro 370 375 380 65 gag caá tea ccg ctt gaa etc tgg ttc gaa cgt ate ata gtg gat 1200 Glu Gln Gln Ser Pro Leu Glu Leu Trp Phe Glu Arg He He Asp Val 385 390 395 ect ctt gaa aag tta here gtg gaa gat ctt tgc cgc gct ate cga caa 1248 Pro Leu Glu Lys Leu Thr Val Glu Asp Leu Cys Arg Ala He Arg Gln 400 405 410 415 aat tta tgt att gat cag ttg atg cea aga gtg ttg gaa gtt act cta 1296 Asn Leu Cys He Asp Gln Leu Met Pro Arg Val Leu Glu Val Leu Thr 420 425 430 aaa gag ccg tta gcg ggt gaa tat tac gat ggt gaa cta att gca gct 1344 Lys Glu Pro Leu Wing Gly Glu Tyr Tyr Asp Gly Glu Leu He Wing Wing 435 440 445 tta tea acg ata aaa gga gaa gat cta aaa gat cag aaa agt acc ttt 1392 Leu Ser Thr He Lys Gly Glu Asp Leu Lys Asp Gln Lys Ser Thr Phe 450 455 460 acc ata ata agg caa ctt ata aac cag cta gaa ccg tea gat att aac 1440 Thr Gln He Arg Gln Leu He Asn Gln Leu Glu Pro Ser Asp He Asn 465 470 475 gat gat tta aga aaa gat ata ata tta aaa ate aat cag ata att gta 1485 Asp Asp Leu Arg Lys Asp He Leu Lys He Asn Gln He He Val 480 485 490 taactaatec cggccactga gccgagatct tetttgtgtg ccgggcatgt tcagcagctt 1545 gggggtgaaa gtcccctgtc cagectg atg gtg gcg aag gcg ttc gcg tac gca 1599 Met Val Ala Lys Ala Phe Ala Tyr Ala 495 500 ctt aac cag tgg ccg gca ctg acg tac tat gcg aac gat ggc tgg gtg 1647 Leu Asn Gln Trp Pro Wing Leu Thr Tyr Tyr Wing Asn Asp Gly Trp Val 505 510 515 gaa ate gac aac aac ate gct gaa aat gcc ctg cgg gcg gtc agt ctg 1695 Glu He Asp Asn Asn He Wing Glu Asn Wing Leu Arg Wing Val Ser Leu 520 525 530 535 ggt cgt aaa aac ttc ctg ttc ttc ggc tet gac cat ggt ggt gag cgg 1743 Gly Arg Lys Asn Phe Leu Phe Phe Gly Ser Asp His Gly Gly Glu Arg 540 545 550 gga gcg cta ctg tac age ctg ate ggg acg tgc aaa ctg aat gac gtg 1791 66 Gly Ala Leu Leu Tyr Ser Leu He Gly Thr Cys Lys Leu Asn Asp Val 555 560 565 gat cea gaa age tac ctt cgc cat gtg ctt gcc gtc ata gca gac tgg 1839 A = p Pro Glu Ser Tyr Leu Arg His Val Leu Ala Val He Wing Asp Trp 570 575 580 ccg gtc aac cgg gtc age gaa ctg ctt cg tgg cgc ata gca ctg cea 1887 Pro Val Asn Arg Val Ser Glu Leu Leu Pro Trp Arg He Ala Leu Pro 585 590 595 gct gaa taacacatcc ccgtcaatac ggccctcgct gtaegettac agaaa atg ctg 1944 Wing Glu Met Leu 600 atg tet gta cag aaa gaa aag aac gtc gca gag agt gtg gta tet gaa 1992 Met Ser Val Gln Lys Glu Lys Asn Val Ala Glu Ser Val Val Ser Glu 605 610 615 acg cat acc ggc gac age gta tat gct tcc ctg ttt gaa aaa att aac 2040 Thr His Thr Gly Asp Ser Val Tyr Ala Ser Leu Phe Glu Lys He Asn 620 625 630 635 ctg aat ccg gta tet gcc ctg agt gca ctg gat aac ect ttc cgg tea 2088 Leu Asn Pro Val Ser Ala Leu Ser Ala Leu Asp Asn Pro Phe Arg Ser 640 645 650 gca gat aac gcg act ggc aga att acc tcc age ata cac ect gcg gtg 2136 Wing Asp Asn Wing Thr Gly Arg He Thr Ser Ser He Gln Pro Wing Val 655 660 665 cag tgc gca gct gct gca gca act gag ggt tet tgt ccc cgg cca cacc tcc 2184 Gln Cys Ala Ala Ala Ala Ala Thr Glu Gly Ser Cys Pro Arg Gln Ser 670 675 680 ccg tgt tea gga to atg gtg gat aac tgg cag aag agt gta agg agt cgt 2233 Pro Cys Ser Gly Met Val Asp Asn Trp Gln Lys Ser Val Arg Ser Arg 685 690 695 gcg etc ccg gaa gac gcg atg acg ggc tgg aac gaa ggc atg ate cgc 2281 Wing Leu Pro Glu Glu Wing Met Thr Gly Trp Asn Glu Gly Met He Arg 700 705 710 715 tta cag cag ttg gct gag cgc ctg aac cgt cag gat gaa cag cgg gga 2329 Leu Gln Gln Leu Wing Glu Arg Leu Asn Arg Gln Asp Glu Gln Arg Gly 720 725 730 aaa tac atg acg gtc agt gaa ctg aaa acg gag gtg ttt ggc ate atg 2377 67 Lys Tyr Met Thr Val Ser Glu Leu Lys Thr Glu Val Phe Gly He Met 735 740 745 cag gct ttt aac cgg cat ate ccg gcg gaa gag- cag tta cgt cgc tac 2425 Gln Wing Phe Asn Arg His He Pro Wing Glu Glu Gln Leu Arg Arg Tyr 750 755 760 ggt gaa gtc cgt aac cag aat ggc agt gaa cag cag caa aaa cag gct 2473 Gly Glu Val Arg Asn Gln Asn Gly Ser Glu Gln Gln Gln Lys Gln Wing 765 770 775 gaa atg gcg cta aat cag tta att aac cgt tat cag atg ata cgt gca 2521 Glu Met Ala Leu Asn Gln Leu He Asn Arg Tyr Gln Met He Arg Ala 780 785 790 795 ggc aaa caà tagtggtagc cataatgcag gagcaaagcc tgaatcagga 2570 Gly Lys Gln agagttattc tgactgagtt tggttttctg gcgattcttg tg atg gtg gga tgt 2624 Met Val Gly Cys 800 gct tgg tta gct gaa cag gcc ttt tc gac cat gcg ctt tea cea falls 2672 Wing Trp Leu Wing Glu Gln Wing Phe Ser A = p His Wing Leu Ser Pro His 805 810 815 agt gct tgg ccg tac agt gca tcg cgc gat gcc ggg ctg gcc gat acg 2720 Ser Wing Trp Pro Tyr Ser Wing Ser Arg Asp Wing Gly Leu Wing Asp Thr 820 825 830 ggc gcg ggc ggc tat ccc act tgt aaa cag cgg tgg gcc gac gac acc 2768 Gly Wing Gly Gly Tyr Pro Thr Cys Lys Gln Arg Trp Wing Asp Asp Thr 835 840 845 850 gtt ggg ctg aaa gcc cgt cta ct g ca ctt ect gee cta gat ate tgg 2816 Val Gly Leu Lys Wing Arg Leu Leu Gln Leu Pro Wing Leu Asp He Trp 855 860 865 acg gcg ttt aaa aaa ate gae cag tcg cag gta gtg tat gaa gag gcc 2864 Thr Wing Phe Lys Lys He Asp Gln Ser Gln Val Val Tyr Glu Glu Wing 870 875 880 gtg ctg cgc tcg cgg gtc agt gaa cga aat atg cag gta tcg cag aat 2912 Val Leu Arg Ser Arg Val Ser Glu Arg Asn Met Gln Val Ser Gln Asn 885 890 895 ggg cgc gtt tat cea age tat ggc ggt aac gtt gat ggc acc gtc gcc 2960 Gly Arg Val Tyr Pro Ser Tyr Gly Gly Asn Val Asp Gly Thr Val Wing 68 900 905 910 aat gcc gcc acc cgg ttg gca tcc ggc gct aga aat agate etc ggc age 3008 Asn Wing Wing Thr Arg Leu Wing Ser Gly Wing Arg Asn He Leu Gly Ser 915 920 925 930 ata gcg gca tgt acg gca ttc gac age gtg cgt taggcactac cg atg gta 3059 Wing Wing Cys Thr Wing Phe A = p Ser Val Arg Met Val 935 940 cag gcg cag ctg caa ata gcg ctg gtg ate tgt att ccg ctg ata acg 3107 Gln Ala Gln Leu Gln He Ala Leu Val He Cys He Pro Leu He Thr 945 950 955 cte tgt tcg gcg tgg gat gtg aaa gta gtg atg acg ctg acg ttt gtg 3155 Leu Cys Ser Wing Trp Asp Val Lys Val Val Met Thr Leu Thr Phe Val 960 965 970 975 cag ttt gca cta ttt ttc etc acc ttt tgg tgg gag ctg gca cgg tgg 3203 Gln Phe Wing Leu Phe Phe Leu Thr Phe Trp Trp Glu Leu Wing Arg Trp 980 985 990 ctt gat age tgg ctg ctg gat gtg etc tac aac age gat acc falls agt 3251 Leu A = p Ser Trp Leu Leu Asp Val Leu Tyr Asn Ser Asp Thr His Ser 995 1000 1005 age tgg aat tta gcc ggg ate cag aat acg cag gat gac gtg att ate 3299 Ser Trp Asn Leu Wing Gly He Gln Asn Thr Gln Asp Asp Val He He 1010 1015 1020 aat ctg gtg atg agg ttg atg ttt ctg gtg ttg ccg here ttc tgg ctg 3347 Asn Leu Val Met Arg Leu Met Phe Leu Val Leu Pro Thr Phe Trp Leu 1025 1030 1035 ggg gcg atg acg tgg gct gg gg gg ggg ggg gg gg gtg aat gga 3395 Gly Ala Met Thr Trp Gly Wing Val Gg Val Gly Val Wing Ala Leu Asn Gly 1040 1045 1050 1055 gcg ctg gcg gga tgattgggag gtgattcgcc aatctcactt tcctatacac 3447 Ala Leu Ala Gly atataaaatg ta atg aaa tat etc ttt ttt gag aat ata cat tet ata ttt 3498 Met Lys Tyr Leu Phe Phe Glu Asn He His Ser He Phe 1060 1065 1070 tta here ttc agt etc ttc cga here tet gtg tcg ect gat ttc cea atg 3546 Leu Thr Phe Ser Leu Phe Arg Thr Ser Val Pro Pro Asp Phe Pro Met 1075 1080 1085 69 att ttt gca ttg ccc tea ate att tta ggt caa ttt acg acc aac caa 3594 He Phe Ala Leu Pro Ser He He Leu Gly Gln Phe Thr Thr Asn Gln 1090 1095 1100 tta act aac ttt gtg ata tgt atg ggt aac acc gtt gaa cgt cgg ctg 3642 Leu Thr Asn Phe Val He Cys Met Gly Asn Thr Val Glu Arg Arg Leu 1105 1110 1115 1120 gtt gtt gtt cat aat ccc ttt aaa agg tet ggg gat ggc cat gac etc 3690 Gly Val Val His Asn Pro Phe Lys Arg Ser Gly Asp Gly His A = p Leu 1125 '1130 1135 agg gcg gta gcg tgaccaaagt teatatecat accaattatt tttatttaaa 3742 Arg Ala Val Ala 1140 atatcaactt attegagttg ttttatttag ttcaaagaag gtatcaaa ttg ata gtt 3799 Leu He Val ata gat ttt ttt tgt ggc tgt ggt gga gcc agt gaa ggg cta cgt cag 3847 He Asp Phe Phe Cys Gly Cys Gly Gly Wing Ser Glu Gly Leu Arg Gln 1145 1150 1155 gct ggc ttt gat ate gag ctt gga tta gat att gac caá gá tea 3895 Wing Gly Phe Asp He Glu Leu Gly Leu Asp He Asp Gln Gln Wing Ser 1160 1165 1170 1175 gaa here ttt aaa gct aat ttc ect gat gca aaa ttc ate ca a gat gat 3943 Glu Thr Phe Lys Wing Asn Phe Pro Asp Wing Lys Phe He Gln Asp Asp 1180 1185 1190 att agg aaa ate gaa ect caa gat ate tcc gac ate att gat att aaa 3991 He Arg Lys He Glu Pro Gln Asp He Ser Asp He He Asp He Lys 1195 1200 1205 gct aaa cgg ect ttg tta ctg agt gca tgt gca cea tgt caa cea ttt 4039 Wing Lys Arg Pro Leu Leu Leu Wing Wing Cys Wing Pro Cys Gln Pro Phe 1210 1215 1220 tcg caa cag aat aaaaat aaa act agt gac gac tea agg aga aat cta 4087 Ser Gln Gln Asn Lys Asn Lys Thr Ser Asp Asp Ser Arg Arg Asn Leu 1225 1230 1235 cta aat gaa act cat cgt ttt att aga gaa ctt ctt ect gaa tat att 4135 Leu Asn Glu Thr His Arg Phe He Arg Glu Leu Leu Pro Glu Tyr He 1240 1245 1250 1255 70 atg ctt gaa aat gtt ect gga atg caa aaa att gat gaa gaa aaa gaa 4183 Met Leu Glu Asn Val Pro Gly Met Gln Lys He Asp Glu Glu Lys Glu 1260 1265 1270 ggc cea ttt cag gag ttt att aag cta ctt aaa gag tta gag tat aac 4231 Gly Pro Phe Gln Glu Phe He Lys Leu Leu Lys Glu Leu Glu Tyr Asn 1275 1280 1285 tat ata tet ttt ata gcc aat gct gag aac tat ggg att ccc ca aga 4279 Tyr He Ser Phe He Wing Asn Wing Glu Asn Tyr Gly He Pro Gln Arg 1290 1295 1300 aga aga aga etc gtg etc tta gct agt cga gta ggt aaa gtt acc tab 4327 Arg Lys Arg Leu Val Leu Leu Wing Ser Arg Val Gly Lys Val Thr Leu 1305 1310 1315 cea gag ata cat ggt aaa aat aaa ate cea ttc aaa act gta cga 4375 Pro Glu He Thr His Gly Lys Asn Lys He Pro Phe Lys Thr Val Arg 1320 1325 1330 1335 gat tat ate cag gac ttc here aag tta tgt tea gga gaa acc gac ccc 4423 Asp Tyr He Gln Asp Phe Thr Lys Leu Cys Ser Gly Glu Thr Asp Pro 1340 1345 1350 aaa gat ect tta cat agg gct gga here ctg age ect ctt aac cta aaa 4471 Lys Asp Pro Leu His Arg Wing Gly Thr Leu Ser Pro Leu Asn Leu Lys 1355 1360 1365 aga att atg falls act cea gaa gga ggg gat aga aga aga tgg cea gaa 4519 Arg He Met His Thr Pro Glu Gly Gly Asp Arg Arg Asn Trp Pro Glu 1370 1375 1380 gag tta gtt aat aaa tgc cat aaa aat tat gat ggc falls here gat act 4567 Glu Leu Val Asn Lys Cys His Lys Asn Tyr A = p Gly His Thr Asp Thr 1385 1390 1395 tat gga aga atg agt tgg gat aag ect gcg ect here ctt acg acg aaa 4615 Tyr Gly Arg Met Ser Trp Asp Lys Pro Wing Pro Thr Leu Thr Thr Lys 1400 1405 1410 1415 tgt aat agt tac tcc aat ggt cgt ttt ggg cat ect gac ccc act caa 4663 Cys Asn Ser Tyr Ser Asn Gly Arg Phe Gly His Pro Asp Pro Thr Gln 1420 1425 1430 cat aga gca att age ata aga gaa gca tea aga tta caá here ttt ect 4711 His Arg Wing He Ser He Arg Glu Wing Being Arg Leu Gln Thr Phe Pro 1435 1440 1445 71 tta age tat gtt ttt aaa ggt tcg ctg aat tea atg gca aag caa ate 4759 Leu Ser Tyr Val Phe Lys Gly Ser Leu Asn Ser Met Ala Ala Lys Gln He 1450 1455 1460 ggc aat gct gta ect tgc gaa etc gct aga cta ttt ggg cta cat etc 4807 Gly Asn Wing Val Pro Cys Glu Leu Wing Arg Leu Phe Gly Leu His Leu 1465 1470 1475 ata gaa aat tgt act aat aag gat tea tagatatatg gctaaaataa 4854 He Glu Asn Cys Thr Asn Lys Asp Ser 1480 1485 gaacaaaggc tcgagctttg gac atg ctt ggc aga caa caa att gca ggt ata 4907 Met Leu Gly Arg Gln Gln He Wing Gly He 1490 1495 ect act gcc ttg agt gag tta ttt aaa aat gct cat gat gcc tat gct 4955 Pro Thr Ala Leu Ser Glu Leu Phe Lys Asn Ala His Asp Ala Tyr Ala 1500 1505 1510 gat aat gtc gaa gtt gat ttt ttt agg aaa gaa aat ctt ctt ate ttg 5003 Asp Asn Val Glu Val Asp Phe Phe Arg Lys Glu Asn Leu Leu He Leu 1515 1520 1525 1530 aga gat gat gga tta ggt atg here acc gat gaa ttt gaa gag agg tgg 5051 Arg Asp Asp Gly Leu Gly Met Thr Thr Asp Glu Phe Glu Glu Arg Trp 1535 1540 1545 ttg act att gga acc tcc age aaa tta ate gac gat gat gca att aat 5099 Leu Thr He Gly Thr Ser Ser Lys Leu He Asp Asp Asp Ala He Asn 1550 1555 1560 aaa cea gca gtg gat agat aat aaa gcc ttt cgc ect ate atg gga gag 5147 Lys Pro Wing Val Asp Ser Asn Lys Wing Phe Arg Pro He Met Gly Glu 1565 1570 1575 aaa gga ata ggc cgt tta tet ate gca gca att gga cea cag gtg ctg 5195 Lys Gly He Gly Arg Leu Ser He Wing Wing He Gly Pro Gln Val Leu 1580 1585 1590 gtt ctt act agg gcc aaa aga gac aat gag ctt aag cea tta gtt gct 5243 Val Leu Thr Arg Ala Lys Arg Asp Asn Glu Leu Lys Pro Leu Val Wing 1595 1600 1605 1610 gca ttt gtt aat tgg agt tta ttt gct ata cea tea ctt gat ctt gat 5291 Wing Phe Val Asn Trp Ser Leu Phe Wing Pro Pro Leu Asp Leu Asp 1615 1620 1625 72 gat ata gaa ata ata cea att aga aet att ate aac gac gaa tcc tcc act 5339 Asp He Glu He Pro He Arg Thr He He Asn Asp Glu Cys Phe Thr 1630 1635 1640 aaa aaa act ctt gat gag atg att gag ca gca aga aat aat tta gac 5387 Lys Thr Leu Asp Glu Met He Glu Gln Ala Arg Asn Asn Leu Asp 1645 1650 1655 tet tta tea falls aaa ata tea aaa tea aaa gta tea ca ata ata aat here 5435 Ser Leu Ser His Lys He Ser Lys Ser Lys Val Ser Gln He Asn Thr 1660 1665 1670 cata tta tea tet ttt gaa ttt gat ect att cta tgg gaa aaa aaa tta 5483 Gln Leu Ser Ser Phe Glu Phe Asp Pro He Leu Trp Glu Lys Lys Leu 1675 1680 1685 1690 ggt ggg cta aga cta tet gga gat ggg cat gga act falls ttc ata ata 5531 Gly Gly Leu Arg Leu Ser Gly Asp Gly His Gly Thr His Phe He He 1695 1700 1705 atg ect acc gaa gaa ata tta ata gat gac att tcc acg age gat age 5579 Met Pro Thr Glu Glu He Leu He A = p Asp He Ser Thr Ser Asp Ser 1710 1715 1720 aat aaa here tea gag cag tet tet cgc tta gaa aaa gct tta tta ggt 5627 Asn Lys Thr Ser Glu Gln Ser Ser Arg Leu Glu Lys Ala Leu Leu Gly 1725 1730 1735 ttt here aac here atg tac agt gat tea aac ect ect att ata gct cgt 5675 Phe Thr Asn Thr Met Tyr Ser Asp Ser Asn Pro Pro He He Wing Arg 1740 1745 1750 ttt aga gac tat ctg gaa gat ggt gag tgc att gac aga att age gaa 5723 Phe Arg Asp Tyr Leu Glu Asp Gly Glu Cys He Asp Arg He Ser Glu 1755 1760 1765 1770 tea att ttt ttt here ccg caa gaa ttc aat ctt gca gat falls falls att 5771 Ser He Phe Phe Thr Pro Gln Glu Phe Asn Leu Wing Asp His His He 1775 1780 1785 gaa gga tgg ttc aat gaa ttt ggt cat ttc agt gga act gtt tet gtt 5819 Glu Gly Trp Phe Asn Glu Phe Gly Gln Phe Ser Gly Thr Val Ser Val 1790 • - 1795 1800 tat ggt gaa gag cea att cat cat gtc gtg act tgg aaa aat aat aat aat 5867 Tyr Gly Glu Glu Pro He His His Val Val Thr Trp Lys Asn Asn Asn 1805 1810 1815 73 caá tta acc caá tgc ggt cea ttt aaa ata aaa tta gcg tat att cat 5915 Gln Leu Thr Gln Cys Gly Pro Phe Lys He Lys Leu Wing Tyr He His 1820 1825 1830 ggt cgg ctt cgt gat tea cgc tta ccc atg gag ttg tgg gcc ect ctg 5963 Gly Arg Leu Arg Asp Ser Arg Leu Pro Met Glu Leu Trp Wing Pro Leu 1835 1840 1845 1850 aag gag aaa here gat aga tat ggt ggt tta tat ate tat cga gat gga 6011 Lys Glu Lys Thr Asp Arg Tyr Gly Gly Leu Tyr He Tyr Arg A = p Gly 1855 1860 1865 tta aga att ttg ccc tat gga gat tea gat acg gat ttt cta aaa ata 6059 Leu Arg He Leu Pro Tyr Gly Asp Ser Asp Thr Asp Phe Leu Lys He 1870 1875 1880 gaa aag aga aga acg tta tcc gct tet gaa tat ttt ttc tea tat cga 6107 Glu Lys Arg Arg Thr Leu Ser Wing Ser Glu Tyr Phe Phe Ser Tyr Arg 1885 1890 1895 cgt ttg ttt gga gca ata gaa tta here aaa gaa aac aat gct tea tta 6155 Arg Leu Phe Gly Ala He Glu Leu Thr Lys Glu Asn Asn Ala Ser Leu 1900 1905 1910 gtt gaa aaa gct ggg cga gaga gga ttc att gaa aat aag cea tat aaa 6203 Val Glu Lys Ala Gly Arg Glu Gly Phe He Glu Asn Lys Pro Tyr Lys 1915 1920 1925 1930 cag ttt aaa gaa atg ctt gaa aat ttc ttc ate gaa ate gca aga gat 6251 Gln Phe Lys Glu Met Leu Glu Asn Phe Phe He Glu He Wing Arg Asp 1935 1940 1945 ttc ttt aag gac gat ggc gat atg tet gaa tta ttt gtt gag here aag 6299 Phe Phe Lys Asp Asp Gly Asp Met Ser Glu Leu Phe Val Glu Thr Lys 1950 1955 1960 cag cgt aga aat gaa gaa cat gat ttg tta tet aaa aga tet aaa caa 6347 Gln Arg Arg Asn Glu Glu His Asp Leu Leu Ser Lys Arg Ser Lys Gln 1965 1970 1975 act aaa gct aaa aaa gat aga tta aag aaa gat ctg tat gat ttt ttt 6395 Thr Lys Wing Lys Lys Asp Arg Leu Lys Lys Asp Leu Tyr Asp Phe Phe 1980 1985 1990 gat aag tta gat aat gat tac tgg aat att gaa ata aat aag cta ate 6443 Asp Lys Leu Asp Asn Asp Tyr Trp Asn He Glu He Asn Lys Leu He 1995 2000 2005 2010 74 aat aaa aac gag gaa tat ttc tcc agt here gaa ata ata gac acc aat 6491 Asn Lys Asn Glu Glu Tyr Phe Be Ser Thr Glu He Thr Asp Thr Asn 2015 2020 2025 ata gat tat gta tac aat aaa att aaa gaa caat aat gat gct ate att 6539 He Asp Tyr Val Tyr A = n Lys He Lys Glu Gln Asn Asp Ala He He 2030 2035 2040 aaa aat cta cgt aat tet gtg gat ata aag aaa ccc tet gga gtt gga 6587 Lys Asn Leu Arg Asn Ser Val Asp He Lys Lys Pro Ser Gly Val Gly 2045 2050 2055 tta here aaa gag tta tet aat tta .tgg gat aga tat ca ata ata gaa aga 6635 Leu Thr Lys Glu Leu Ser Asn Leu Trp Asp Arg Tyr Gln He Glu Arg 2060 2065 2070 caa aaa ata ctg tta tea cta aat gag cta aaa gat aac gtt gat aga 6683 Gln Lys He Leu Leu Ser Leu Asn Glu Leu Lys Asp Asn Val Asp Arg 2075 2080 2085 2090 aag ctt ata gaa ctg gat aat aaa aat aat gat ttt etc aac tta cgg 6731 Lys Leu He Glu Leu Asp Asn Lys Asn Asn Asp Phe Leu Asn Leu Arg 2095 2100 2105 aag aga ctt gaa gat tet ttg aat cta caa cata agt tac tat gaa aaa 6779 Lys Arg Leu Glu Asp Ser Leu Asn Leu Gln Gln Ser Tyr Tyr Glu Lys 2110 2115 2120 gaa cta here aag tta tat aat gac gct aaa aat gct ttg aaa gat gtg 6827 Glu Leu Thr Lys Leu Tyr Asn Asp Ala Lys Asn Ala Leu Lys Asp Val 2125 2130 2135 caat tet aaa gca aat agg tta att tet gat aat aaaaa aaa cat aag 6875 Gln Ser Lys Wing Asn Arg Leu He As As Asn Lys Lys Lys His Lys 214C 2145 2150 agt gaa cta aaa aac att tet tat gaa ttc caa tea act aat etc aat 6923 Ser Glu Leu Lys Asn He Ser Tyr Glu Phe Gln Ser Thr Asn Leu Asn 2155 2160 2165 2170 ggc aaa gat act gcg tat ata ttg gat gta aaa aga aat cta gaa agt 6971 Gly Lys Asp Thr Wing Tyr He Leu Asp Val Lys Arg Asn Leu Glu Ser 2175 2180 2185 aaa att gag aat act tea aac gaa gtg att aat gaa ata aga aaa cta 7019 Lys He Glu Asn Thr Ser Asn Glu Val He Asn Glu He Arg Lys Leu 2190 2195 2200 75 acc gac cag att gca ata att agt gat agt acc act tet gaa aat tta 7067 Thr Asp Gln He Wing He He Ser Asp Ser Thr Thr Ser Glu Asn Leu 2205 2210 2215 tea tcg gct ca gta act gaa gca ate gaa act gaa ctt gaa cat tta 7115 Ser Ser Wing Gln Val Thr Glu Wing He Glu Thr Glu Leu Glu His Leu 2220 2225 2230 cga gac caá caca gca aat aac gca gag tta ata cta ctt ggc atg gct 7163 Arg Asp Gln Gln Wing Asn Asn Wing Glu Leu He Leu Leu Gly Met Wing 2235 2240 2245 2250 ctt tet gta gta cat cat gaa ttt aat ggt aat att agg gca att aga 7211 Leu Ser Val Val His His Glu Phe Asn Gly Asn He Arg Ala He Arg 2255 2260 2265 agt gcg cta agg gaa tta aaa gca tgg gct gac aga aat ect aag ctt 7259 Be Ala Leu Arg Glu Leu Lys Ala Trp Ala Asp Arg Asn Pro Lys Leu 2270 2275 2280 gat ata ata tac caa aaa ate aga act agt ttt gat fall tta gat ggt 7307 Asp He He Tyr Gln Lys He Arg Thr Ser Phe Asp His Leu Asp Gly 2285 2290 2295 tat tta aaa acc ttt here cea ttg here aga cgt tta agt cgc tet aaa 7355 Tyr Leu Lys Thr Phe Thr Pro Leu Thr Arg Arg Leu Ser Arg Ser Lys 2300 2305 2310 acc aat ata act gga act gcc att tta gaa ttt ate aga gat gta ttc 7403 Thr Asn He Thr Gly Thr Ala He Leu Glu Phe He Arg Asp Val Phe 2315 2320 2325 2330 gat gat cgt ctt gag aaa gaa gga att gaa tta ttc act acc tea aag 7451 Asp Asp Arg Leu Glu Lys Glu Gly He Glu Leu Phe Thr Thr Ser Lys 2335 2340 2345 ttt gtt aat caa gaa att gta act tac here tea att tact ect gtc 7499 Phe Val Asn Gln Glu He Val Thr Tyr Thr Ser Thr He Tyr Pro Val 2350 2355 2360 ttt ata aat cta att gat aac gca ata tac tgg ctt ggg aaa here act 7547 Phe He Asn Leu He Asp Asn Ala He Tyr Trp Leu Gly Lys Thr Thr 2365 2370 2375 gga gaa aaa aga ctt ata ctt gat gct act gaa here gga ttt gtt att 7595 Gly Glu Lys Arg Leu He Leu Asp Wing Thr Glu Thr Gly Phe Val He 2380 2385 2390 76 ggt gat act ggt ccc ggt gtt tea act aga gat gat gat ata ata ttt 7643 Gly Asp Thr Gly Pro Gly Val Ser Thr Arg Asp Arg Asp He He Phe 2395 2400 2405 2410 gat atg gga ttt here cga aaa here gga ggg cgt gga atg gga tta ttc 7691 Asp Met Gly Phe Thr Arg Lys Thr Gly Gly Arg Gly Met Gly Leu Phe 2415 2420 2425 att tcc aaa gag tgt tta tet cga gat gga ttt act ata aga ttg gat 7739 He Ser Lys Glu Cys Leu Ser Arg Asp Gly Phe Thr He Arg Leu Asp 2430 2435 2440 gat tac act ect gaa cag ggt gct. ttc ttt att att gag cea tea gaa 7787 Asp Tyr Thr Pro Glu Gln Gly Wing Phe Phe He He Glu Pro Ser Glu 2445 2450 2455 gaa here agt gaa tag cggatataaa taa atg here age tet act gat ttt 7836 Glu Thr Ser Glu Met Thr Be Ser Thr Asp Phe 2460 2465 2470 cat aaa ctt tet gaa gac tgc gtt cgc cgt ttt tta cat tet gta ttt gta 7884 His Lys Leu Ser Glu Asp Cys Val Arg Arg Phe Leu His Ser Val Val 2475 2480 2485 gct gta gat gac aat atg tet ttt gga ggt agg gat act ttc ect 7932 Wing Val Asp Asp Asn Met Ser Phe Gly Wing Gly Ser Asp Thr Phe Pro 2490 2495 2500 here gac gaa gat att aat gct tta gtt gat ccc gac gat gat ect here 7980 Thr Asp Glu Asp He Asn Wing Leu Val Asp Pro Asp Asp Asp Pro Thr 2505 2510 2515 cea ata ata here gca tea gca tcc cea agg ata gaa tea act aaa tea 8028 Pro He He Thr Wing Being Wing Pro Pro Arg He Glu Ser Thr Lys Ser 2520 2525 2530 aaa gca aag gta aaa aac cat ect ttt gat tac ca gct cta gca gaa 8076 Lys Ala Lys Val Lys Asn His Pro Phe Asp Tyr Gln Ala Leu Ala Glu 2535 2540 2545 2550 gct ttc gcc aaa gat ggt att gct tgt tgc gga tta tta gct aag agt 8124 Wing Phe Wing Lys Asp Gly He Wing Cys Cys Gly Leu Wing Lys Lys Ser 2555. 2560 2565 ttt aat gtt gaa gaa aga gat ata ata here gca tea tcc falls aag gca 8172 Phe Asn Val Glu Glu Arg Asp He He Thr Ala Ser Ser His Lys Ala 2570 2575 2580 77 gat ata ata ata ctt gac tgg gat atg caa age gat agt ggg ca t tt 8220 Asp He Thr He Leu Asp Trp Asp Met Gln Ser Asp Ser Gly Gln Phe 2585 2530 2595 gct att gaa ata ata aaa tcg ata ate gtt tea gat ata aat tet gga 8268 Wing He Glu He He Lys Ser He He Val Ser Asp He Asn Ser Gly 2600 2605 2610 gga cgt tta cgt ctt ctt tet att tat act ggt gaa cat gtt act gct 8316 Gly Arg Leu Arg Leu Leu Ser He Tyr Thr Gly Glu His Val Thr Wing 2615 2620 2625 2630 gtt ata act aag ttg aac aat gag .tta aag aaa here tac cgt age gta 8364 Val He Thr Lys Leu Asn Asn Glu Leu Lys Lys Thr Tyr Arg Ser Val 2635 2640 2645 ata aaa aat gat gat agt att ttt att gaa gat aac tat gca etc gaa 8412 He Lys Asn A = p Asp Ser He Phe He Glu Asp Asn Tyr Ala Leu Glu 2650 2655 2660 cagg tgg tgt ata gtt gtt att agt aaa gac gtt tat gaa aaa gat ctt 8460 Gln Trp Cys He Val Val He Ser Lys Asp Val Tyr Glu Lys Asp Leu 2665 2670 2675 cea aat gtg tta ata aaa aaa ttc act aac ctt here gct ggg ttg cta 8508 Pro Asn Val Leu He Lys Lys Phe Thr Asn Leu Thr Wing Gly Leu Leu 2680 2685 2690 tcc aac gcc gca etc tet tgc att tet gaa ata aga gaa aaa acc cat 8556 Ser Asn Ala Ala Leu Ser Cys He Ser Glu He Arg Glu Lys Thr His 2695 2700 2705 2710 ggg ata tta here aaa tat aat aat aaa tta gac act gca tat gtt tcc 8604 Gly He Leu Thr Lys Tyr Asn Asn Lys Leu Asp Thr Wing Tyr Val Ser 2715 2720 2725 cae ate tta aat tta ata aaa tcc aag gag tea agg gca tat gct tat 8652 His He Leu Asn Leu He Lys Ser Lys Glu Ser Arg Ala Tyr Ala Tyr 2730 2735 2740 gaa aat gct cat gat tat gca gta gat tta att tet gaa gaa ata aga 8700 Glu Asn Wing His Asp Tyr Wing Val Asp Leu He Ser Glu Glu He Arg 2745 2750 2755 tea ata ttg caa ata agt gaa aac tta aag aaa tet cta age aaa aac 8748 Ser He Leu Gln He Ser Glu Asn Leu Lys Lys Ser Leu Ser Lys Asn 2760 2765 2770 78 tcc tta tcc cat tgg ect att ttt fall tat gca aaa aat ggt tgt aag 8796 Ser Leu Ser His Trp Pro He Phe His Tyr Ala Lys Asn Gly Cys Lys 2775 2780 2785 2790 aat ttt cta tta act gga aaa aaa caa aaa gac tta tea gta gaa cat 8844 Asn Phe Leu Leu Thr Gly Lys Lys Gln Lys Asp Leu Ser Val Glu His 2795 2800 2805 cta agg aat ata etc tet gct gat tet tta gaa gaa att caá falls gct 8892 Leu Arg Asn He Leu Ser Wing Asp Ser Leu Glu Glu He Gln His Wing 2810 2815 2820 att gaa falls gca tet tta ggt aa .aag gaa tac tta age ca gt gat ggt 8940 He Glu His Wing Ser Leu Gly Lys Lys Glu Tyr Leu Ser Gln Asp Gly 2825 2830 2835 gaa gaa gat aaa aag tta atg cata tta tgc tet ctg gaa ate acg cgc 8988 Glu Glu Asp Lys Lys Leu Met Gln Leu Cys Ser Leu Glu He Thr Arg 2840 2845 2850 agg agt tta aga tat cat tet cat ata gat aat gtg tcc tta aaa caa 9036 Arg Ser Leu Arg Tyr His Ser His He Asp Asn Val Ser Leu Lys Gln 2855 2860 2865 2870 gga act tta ctt tta gat gca tat aat ttt gtc tat cta tgc ata caa 9084 Gly Thr Leu Leu Leu Asp Ala Tyr Asn Phe Val Tyr Leu Cys He Gln 2875 2880 2885 cea tta tgt gat age gtc aga ttg cat gaa aaa gcc gat ttt tta ttc 9132 Pro Leu Cys Asp Ser Val Arg Leu His Glu Lys Wing Asp Phe Leu Phe 2890 2895 2900 etc agg gga here ctg gac gat aat aat tac aat ttg tta ate gaa gat 9180 Leu Arg Gly Thr Leu Asp A = p Asn Asn Tyr Asn Leu Leu He Glu Asp 2905 2910 2915 gaa tat ggc ggt ttt tat aaa att aaa atg ccg gca aaa gct tet aat 9228 Glu Tyr Gly Gly Phe Tyr Lys He Lys Met Pro Wing Lys Wing Ser Asn 2920 2925 • 2930 att att tea ttt tea ttt gga gtc gaa aat gga aac ggt gtc ate ata 9276 He He Be Ser Phe Ser Phe Gly Val Glu Asn Gly Asn Gly Val He He 2935 2940 2945 2950 ggg aaa aag aac aat cta gtt aat act gac tat ate tea ttc gtt ect 9324 Gly Lys Lys Asn Asn Leu Val Asn Thr Asp Tyr He Ser Phe Val Pro 2955 2960 2965 79 tta etc gtt gaa aaa ata tet act cea aaa gta ttg aaa tgg ate ggg 9372 Leu Leu Val Glu Lys He Ser Thr Pro Lys Val Leu Lys Trp He Gly 2970 2975 2980 gaa ata aaa here acg tac gcg caa aaa ata here act gat gtt gct gct 9420 Glu He Lys Thr Thr Tyr Ala Gln Lys He Thr Thr Asp He Val Wing 2985 2990 2995 aat ctg tea aga ata ggt tta gat cat cat gag tgg tta cga ata aaa 9468 Asn Leu Ser Arg He Gly Leu Asp Gln His Glu Trp Leu Arg He Lys 3000 3005 3010 tea aaa gat ata taaatgatta tatatgccgt cgttttataa aaactggcgg 9520 Ser Lys Asp He 3015 catgtatatc tagttagtcc atcatagaag tcaagaaatt tagtttgccc tatatettat 9580 agaaaatata ttttatatgc ttaaaaaaca ccatctttct aagatggcat ttatgtgctt 9640 tgtttcgatc aattacaact gatatattac catattgatt aattttatgt tatttaccaa 9700 agtaacggca tettaatata tcgtcataat atagtgcgcg ttctgactct aatactgaaa 9760 aatttatttg ttetatttta cacttactgc aaatagcatc cagtttatca tatagtgteg 9820 catcaattgg cgcag atg tea tea cgc ca ate ate ctt gag cat tat aat gct 9871 Met Being Ser Arg Gln He Leu Glu His Tyr Asn Ala 3020 3025 3030 cta here tat ccc cta cat ca ate tea ttg ttg cag ata atg act tcg 9919 Leu Thr Tyr Pro Leu His Gln Ser He Leu Leu Gln He Met Thr Ser 3035 3040 3045 aat ttg tta tea gtt tgc act gga aaa tcc att tac gag gat ate tcc 9967 Asn Leu Leu Ser Val Cys Thr Gly Lys Ser He Tyr Glu Asp He Ser 3050 3055 3060 ggc agt tet tgg aat ate ata fall ttc aat ate ect etc ccc ate tet 10015 Gly Be Ser Trp Asn He He His Phe A = n He Pro Leu Pro He Ser 3065 3070 3075 aga gcg aga ctt 'tcc ata ttt tet tat tgt gtc aga att aaa ect tgg 10063 Arg Ala Arg Leu Ser He Phe Ser Tyr Cys Val Arg He Lys Pro Trp 3080 3085 3090 atg agt atg gat tac atg taaccggctc atttaaaccg tctggtctgt 10111 Met Met Met Asp Tyr Met 80 3095 3100 ttcctccggt tttacaaaaa ta atg tcc ate att ttt aat gga falls tat cgt 10163 Met Ser He He Phe Asn Gly His Tyr Arg 3105 3110 atg aaa falls cgg act tgg ate act gaa gct tta cgt ctt falls ttt gaa 10211 Met Lys His Arg Thr Trp He Thr Glu Ala Leu Arg Leu His Phe Glu 3115 3120 3125 gaa cat tta ccc cag gtt gtg gtc ggg cgt cgc ctg ggc gta cea aaa 10259 Glu His Leu Pro Gln Val Val Val Gly Arg Arg Leu Gly Val Pro Lys 3130 3135 3140 tea here gct tgt ggt atg ttc gtg cgc ttt cgc aaa gct ggc ttt tea 10307 Be Thr Wing Cys Gly Met Phe Val Arg Phe Arg Lys Wing Gly Phe Ser 3145 3150 3155 tgg ect ctg ccc gca ggt atg tcg gag cgg gag ctt gat ggc cgt ctt 10355 Trp Pro Leu Pro Wing Gly Met Ser Glu Arg Glu Leu Asp Gly Arg Leu 3160 3165 3170 tac ggg agt acc tcc here gta ect gtc gta ctt tgt agt gga tcg gta 10403 Tyr Gly Ser Thr Ser Thr Val Pro Val Val Leu Cys Ser Gly Ser Val 3175 3180 3185 3190 att cag gac acc tcg aaa tcc tgt taatgttaaa acagtgaaaa tgaggtgatg 10457 He Gln Asp Thr Ser Lys Ser Cys 3195 c atg ate aaa act cgt cgg act aaa cgt acc ttt ccg gag ttc aag 10506 Met He Lys Thr Arg Arg Thr Lys Arg Thr Phe Ser Pro Glu Phe Lys 3200 3205 3210 ctt gaa gct ttc gag cag gtg gtg gtt aaa tac cag cgt gat gtc aga 10554 Leu Glu Wing Phe Glu Gln Val Val Val Lys Tyr Gln Arg Asp Val Arg 3215 3220 3225 3230 gaa gtc gcg cag gca etc gag etc aac ect gac cat ttg cgt aaa tgg 10602 Glu Val Ala Gln Ala Leu Glu Leu Asn Pro A = p His Leu Arg Lys Trp 3235 3240 3245 ata cgg ttg tat aag cag gaa ctt cag ggt att gag cea gct ggt aat 10650 He Arg Leu Tyr Lys Gln Glu Leu Gln Gly He Glu Pro Wing Gly Asn 3250 3255 3260 gct att acc ect gaa cag cgc gaa att cag cag ctt aaa gcg cag ata 10698 Wing He Thr Pro Glu Gln Arg Glu He Gln Gln Leu Lys Wing Gln He 81 3265 3270 3275 aag cgc gtt gag atg gaa aaa gaa ata cta aag cag gct gcc gtg ctg 10746 Lys Arg Val Glu Met Glu Lys Glu He Leu Lys.Gln Ala Ala Val Leu 3280 3285 3290 atg age gaa ate ccc ggg aag ctg tcg cgc taatcacaca gctgaaagca 10796 Met Ser Glu He Pro Gly Lys Leu Ser Arg 3295 3300 aagtggccag tgtgggttat ttgtcattta ttcggtatta accgtagcgt ttattacgcg 10856 caggtgaagc gtcctgttaa tgtgcaaaga attgaattac gaagccgggt gagggctttc 10916 catgctctca gtcgtggcgc agccgggtag ccgggcaatc agtcagatgt tgcgccagag 10976 tggcgttgat gcaggccggt ggetggeatg aegactgatg cgggaatgag ggctgacaag 11036 tcgacagccg gttaaacatc acaaccgggt aaaegaagac aaaagtecgc cattgccaaa 11096 tttactgaac cggcaatttc accccgccgc accaaactgc gtctggtgcg gcgacatcag 11156 ttttattcg 11165 < 210 > 35 < 211 > 366 < 212 > PRT < 213 > Escherichia coli < 400 > 35 Being Asp Met Gln Arg Gly He Gln Wing Wing Thr Wing Wing Leu Gln Gly 1 5 10 15 Leu Val Gly Gly Asn Met Ala Gly Ala Leu Ala Gly Ala Ser Ala Pro 20 25 30 Glu Leu Wing Asn He He Gly His His Wing Gly He Asp Asp Asn Thr 35 40 45 Ala Ala Lys Ala He Ala His Ala He Leu Gly Gly Val Thr Ala Ala 50 55 60 Leu Gln Gly Asn Be Wing Wing Wing Gly Wing He Gly Wing Gly Thr Gly 65 70 75 80 Glu Val He Ala Wing Wing He Wing Lys Ser Leu Tyr Pro Gly Val Asp 85 90 95 82 Pro Ser Lys Leu Thr Glu Asp Gln Lys Gln Thr Val Ser Thr Leu Ala 100 105 110 Thr Leu Ser Ala Gly Met Ala Gly Gly He Ala Ser Gly Asp Val Ala 115 120 125 Gly Ala Ala Ala Gly Ala Gly Ala Gly Lys Asn Val Val Glu Asn Asn 130 135 140 Ala Leu Ser Leu Val Ala Arg Gly Cys Ala Val Ala Ala Pro Cys Arg 145 150 155 160 Thr Lys Val Wing Glu Gln Leu Leu Glu He Gly Wing Lys Wing Gly Met 165 170 175 Ala Gly Leu Ala Gly Ala Ala Val Lys Asp Met Ala Asp Arg Met Thr 180 185 190 Being Asp Glu Leu Glu His Leu He Thr Leu Gln Met Met Gly Asn Asp 195 200 205 Glu He Thr Thr Lys Tyr Leu Ser Ser Leu His Asp Lys Tyr Gly Ser 210 215 220 Gly Ala Ala Ser Asn Pro Asn He Gly Lys Asp Leu Thr Asp Ala Glu 225 230 235 240 Lys Val Glu Leu Gly Gly Ser Gly Ser Gly Thr Gly Thr Pro Pro Pro 245 250 255 Ser Glu Asn Asp Pro Lys Gln Gln Asn Glu Lys Thr Val Asp Lys Leu 260 265 270 Asn Gln Lys Gln Glu Be Wing He Lys Lys He Asp Asn Thr He Lys 275 280 285 Asn Ala Leu Lys Asp His Asp He He Gly Thr Leu Lys Asp Met Asp 290 295 300 Gly Lys Pro Val Pro Lys Glu Asn Gly Gly Tyr Trp Asp His Met Gln 305 310 315 320 Glu Met Gln Asn Thr Leu Arg Gly Leu Arg Asn His Wing Asp Thr Leu 325 330 335 Lys Asn Val Asn Asn Pro Glu Wing Gln Wing Wing Tyr Gly Arg Wing Thr 340 345 350 83 Asp Ala He Asn Lys He Glu Be Ala Leu Lys Gly Tyr Gly 355 360 365 < 210 > 36 < 211 > 128 < 212 > PRT < 213 > Escherichia coli < 400 > 36 Met He Thr Leu Arg Lys Leu He Gly A = n He Asn Met Thr Lys Glu 1 5 10 15 Pro Glu Gln Gln Ser Pro Leu Glu Leu Trp Phe Glu Arg He He Asp 20 25 30 Val Pro Leu Glu Lys Leu Thr Val Glu A = p Leu Cys Arg Ala He Arg 35 40 45 Gln Asn Leu Cys He Asp Gln Leu Met Pro Arg Val Leu Glu Val Leu 50 55 60 Thr Lys Glu Pro Leu Wing Gly Glu Tyr Tyr Asp Gly Glu Leu He Wing 65 70 75 80 Ala Leu Ser Thr He Lys Gly Glu Asp Leu Lys Asp Gln Lys Ser Thr 85 90 95 Phe Thr Gln He Arg Gln Leu He Asn Gln Leu Glu Pro Ser Asp He 100 105 110 As Asp Asp Leu Arg Lys Asp As Leu As Asn Gln He He Val 115 120 125 < 210 > 37 < 211 > 107 < 212 > PRT < 213 > Escherichia coli < 400 > 37 Met Val Ala Lys Ala Phe Ala Tyr Ala Leu Asn Gln Trp Pro Ala Leu 1 5 10 15 Thr Tyr Tyr Wing Asn Asp Gly Trp Val Glu He Asp Asn Asn He Wing 20 25 30 84 Glu Asn Ala Leu Arg Ala Val Ser Leu Gly Arg Lys Asn Phe Leu Phe 35 40 45 Phe Gly Ser Asp His Gly Gly Glu Arg Gly Wing Leu Leu Tyr Ser Leu 50 55 60 He Gly Thr Cys Lys Leu Asn Asp Val A = p Pro Glu Ser Tyr Leu Arg 65 70 75 80 His Val Leu Ala Val He Ala Asp Trp Pro Val A = n Arg Val Ser Glu 85 90 95 Leu Leu Pro Trp Arg He Ala Leu Pro Ala Glu 100 105 < 210 > 38 < 211 > 86 < 212 > PRT < 213 > Escherichia coli < 400 > 38 Met Leu Met Ser Val Gln Lys Glu Lys Asn Val Ala Glu Ser Val Val 1 5 10 15 Ser Glu Thr His Thr Gly Asp Ser Val Tyr Ala Ser Leu Phe Glu Lys 20 25 30 He Asn Leu Asn Pro Val Ser Ala Leu Ser Ala Leu Asp Asn Pro Phe 35 40 45 Arg Ser Wing Asp Asn Wing Thr Gly Arg He Thr Ser Ser He Gln Pro 50 55 60 Ala Val Gln Cys Ala Ala Ala Ala Ala Thr Glu Gly Ser Cys Pro Arg 65 70 75 80 Gln Ser Pro Cys Ser Gly 85 < 210 > 39 < 211 > 111 < 212 > PRT < 213 > Escherichia coli < 400 > 39 Met Val Asp Asn Trp Gln Lys Ser Val Arg Ser Arg Ala Leu Pro Glu 85 1 5 10 15 Glu Wing Met Thr Gly Trp Asn Glu Gly Met He Arg Leu Gln Gln Leu 20 25 30 Wing Glu Arg Leu Asn Arg Gln Asp Glu Gln Arg Gly Lys Tyr Met Thr 35 40 45 Val Ser Glu Leu Lys Thr Glu Val Phe Gly He Met Gln Wing Phe A = n 50 55 60 Arg His He Pro Wing Glu Glu Gln Leu Arg Arg Tyr Gly Glu Val Arg 65 70 75 80 Asn Gln Asn Gly Ser Glu Gln Gln Gln Lys Gln Wing Glu Met Wing Leu 85 90 95 Asn Gln Leu He Asn Arg Tyr Gln Met He Arg Wing Gly Lys Gln 100 105 110 < 210 > 40 < 211 > 143 < 212 > PRT < 213 > Escherichia coli < 400 > 40 Met Val Gly Cys Ala Trp Leu Ala Glu Gln Ala Phe Ser A = p His Ala 1 5 10 15 Leu Ser Pro His Ser Wing Trp Pro Tyr Ser Wing Ser Arg A = p Wing Gly 20 25 30 Leu Wing Asp Thr Gly Wing Gly Gly Tyr Pro Thr Cys Lys Gln Arg Trp 35 40 45 Wing Asp Asp Thr Val Gly Leu Lys Wing Arg Leu Leu Gln Leu Pro Wing 50 55 60 Leu Asp He Trp Thr Wing Phe Lys Lys He Asp Gln Ser Gln Val Val 65 70 75 80 Tyr Glu Glu Wing Val Leu Arg Ser Arg Val Ser Glu Arg Asn Met Gln 85 90 95 Val Ser Gln Asn Gly Arg Val Tyr Pro Ser Tyr Gly Gly Asn Val Asp 100 105 110 86 Gly Thr Val Ala Asn Ala Ala Thr Arg Leu Ala Ser Gly Ala Arg Asn 115 120 125 He Leu Gly Be He Wing Wing Cys Thr Wing Phe Asp Ser Val Arg 130 135 140 < 210 > 41 < 211 > 118 < 212 > PRT < 213 > Escherichia coli < 400 > 41 Met Val Gln Ala Gln Leu Gln He Ala Leu Val He Cys He Pro Leu 1 5 10 15 He Thr Leu Cys Ser Wing Trp Asp Val Lys Val Val Met Thr Leu Thr 20 25 30 Phe Val Gln Phe Ala Leu Phe Phe Leu Thr Phe Trp Trp Glu Leu Ala 35 40 45 Arg Trp Leu Asp Ser Trp Leu Leu Asp Val Leu Tyr Asn Ser Asp Thr 50 55 60 His Ser Ser Trp Asn Leu Ala Gly He Gln Asn Thr Gln Asp Asp Val 65 70 75 80 He He Asn Leu Val Met Arg Leu Met Phe Leu Val Leu Pro Thr Phe 85 90 95 Trp Leu Gly Wing Met Thr Trp Wing Gly Val Arg Val Gly Val Ala Leu 100 105 110 Asn Gly Ala Leu Ala Gly 115 < 210 > 42 < 211 > 81 < 212 > PRT < 213 > Escherichia coli < 400 > 42 Met Lys Tyr Leu Phe Phe Glu Asn He His Ser He Phe Leu Thr Phe 1 5 10 15 Ser Leu Phe Arg Thr Ser Val Pro Pro Asp Phe Pro Met He Phe Wing 87 20 25 30 Leu Pro Ser He He Leu Gly Gln Phe Thr Thr Asn Gln Leu Thr Asn 35 40 '45 Phe Val He Cys Met Gly Asn Thr Val Glu Arg Arg Leu Gly Val Val 50 55 60 His Asn Pro Phe Lys Arg Ser Gly Asp Gly His Asp Leu Arg Ala Val 65 70 75 80 To < 210 > 43 < 211 > 348 < 212 > PRT < 213 > Escherichia coli < 400 > 43 Leu He Val He Asp Phe Phe Cys Gly Cys Gly Gly Wing Ser Glu Gly 1 5 10 15 Leu Arg Gln Wing Gly Phe Asp He Glu Leu Gly Leu Asp He Asp Gln 20 25 30 Gln Ala Ser Glu Thr Phe Lys Wing Asn Phe Pro Asp Wing Lys Phe He 35 40 45 Gln Asp Asp He Arg Lys He Glu Pro Gln Asp He As Asp He He 50 55 60 Asp He Lys Wing Lys Arg Pro Leu Leu Leu Wing Wing Cys Wing Pro Cys 65 70 75 80 Gln Pro Phe Ser Gln Gln Asn Lys Asn Lys Thr Ser Asp Asp Ser Arg 85 90 95 Arg Asn Leu Leu Asn Glu Thr His Arg Phe He Arg Glu Leu Leu Pro 100 105 110 Glu Tyr He Met Leu Glu Asn Val Pro Gly Met Gln Lys He Asp Glu 115 120 125 Glu Lys Glu Gly Pro Phe Gln Glu Phe He Lys Leu Leu Lys Glu Leu 130 135 140 Glu Tyr Asn Tyr He Ser Phe He Wing A = n Wing Glu Asn Tyr Gly He 88 145 150 155 160 Pro Gln Arg Arg Lys Arg Leu Val Leu Leu? The Ser Arg Val Gly Lys 165 170 175 Val Thr Leu Pro Glu He Thr His Gly Lys Asn Lys He Pro Phe Lys 180 185 190 Thr Val Arg Asp Tyr He Gln Asp Phe Thr Lys Leu Cys Ser Gly Glu 195 200 205 Thr Asp Pro Lys Asp Pro Leu His Arg Wing Gly Thr Leu Ser Pro Leu 210 215 220 Asn Leu Lys Arg He Met His Thr Pro Glu Gly Gly Asp Arg Arg Asn 225 230 235 240 Trp Pro Glu Glu Leu Val Asn Lys Cys His Lys Asn Tyr Asp Gly His 245 250 255 Thr Asp Thr Tyr Gly Arg Met Ser Trp Asp Lys Pro Pro Wing Thr Leu 260 265 270 Thr Thr Lys Cys Asn Ser Tyr Ser Asn Gly Arg Phe Gly His Pro Asp 275 280 285 Pro Thr Gln His Arg Wing He Ser He Arg Glu Wing Ser Arg Leu Gln 290 295 300 Thr Phe Pro Leu Ser Tyr Val Phe Lys Gly Ser Leu Asn Ser Met Ala 305 310 315 320 Lys Gln He Gly Asn Wing Val Pro Cys Glu Leu Wing Arg Leu Phe Gly 325 330 335 Leu His Leu He Glu Asn Cys Thr Asn Lys Asp Ser 340 345 < 210 > 44 < 211 > 974 < 212 > PRT < 213 > Escherichia coli < 400 > 44 Met Leu Gly Arg Gln Gln He Wing Gly He Pro Thr Wing Leu Ser Glu 1 5 10 15 89 Leu Phe Lys Asn Ala His Asp Ala Tyr Ala Asp Asn Val Glu Val Asp 20 25 30 Phe Phe Arg Lys Glu Asn Leu Leu He Leu Arg Asp Asp Gly Leu Gly 35 40 45 Met Thr Thr Asp Glu Phe Glu Glu Arg Trp Leu Thr He Gly Thr Ser 50 55 60 Ser Lys Leu He Asp Asp Asp Wing He Asn Lys Pro Wing Val Asp Ser 65 70 75 80 Asn Lys Wing Phe Arg Pro He Met Gly Glu Lys Gly He Gly Arg Leu 85 90 95 Be He Wing Wing He Gly Pro Gln Val Leu Val Leu Thr Arg Wing Lys 100 105 110 Arg Asp Asn Glu Leu Lys Pro Leu Val Wing Wing Phe Val Asn Trp Ser 115 120 125 Leu Phe Ala He Pro Ser Leu Asp Leu Asp Asp He Glu He Pro He 130 135 140 Arg Thr He He Asn Asp Glu Cys Phe Thr Lys Lys Thr Leu Asp Glu 145 150 155 160 Met He Glu Gln Ala Arg Asn Asn Leu Asp Ser Leu Ser His Lys He 165 170 175 Ser Lys Ser Lys Val Ser Gln He Asn Thr Gln Leu Ser Ser Phe Glu 180 185 190 Phe Asp Pro He Leu Trp Glu Lys Lys Leu Gly Gly Leu Arg Leu Ser 195 200 205 Gly Asp Gly His Gly Thr His Phe He Met Met Pro Thr Glu Glu He 210 215 220 Leu He Asp Asp He Ser Thr Ser Asp Ser Asn Lys Thr Ser Glu Gln 225 230 235 240 Being Ser Arg Leu Glu Lys Ala Leu Leu Gly Phe Thr Asn Thr Met Tyr 245 250 255 Being Asp be Asn Pro Pro He He Wing Arg Phe Arg Asp Tyr Leu Glu 260 265 270 90 Asp Gly Glu Cys He A = p Arg He Ser Glu Be He Phe Phe Thr Pro 275 280 285 Gln Glu Phe A = n Leu Wing Asp His His He Glu Gly Trp Phe Asn Glu 290 295 '300 Phe Gly Gln Phe Ser Gly Thr Val Ser Val Tyr Gly Glu Glu Pro He 305 310 315 320 His His Val Val Thr Trp Lys Asn Asn Asn Gln Leu Thr Gln Cys Gly 325 330 335 Pro Phe Lys He Lys Leu Wing Tyr He His Gly Arg Leu Arg Asp Ser 340 345 350 Arg Leu Pro Met Glu Leu Trp Wing Pro Leu Lys Glu Lys Thr Asp Arg 355 360 365 Tyr Gly Gly Leu Tyr He Tyr Arg Asp Gly Leu Arg He Leu Pro Tyr 370 375 380 Gly Asp Ser Asp Thr Asp Phe Leu Lys He Glu Lys Arg Arg Thr Leu 385 390 395 400 Be Wing Ser Glu Tyr Phe Phe Ser Tyr Arg Arg Leu Phe Gly Wing He 405 410 415 Glu Leu Thr Lys Glu Asn Asn Wing Ser Leu Val Glu Lys Wing Gly Arg 420 425 430 Glu Gly Phe He Glu Asn Lys Pro Tyr Lys Gln Phe Lys Glu Met Leu 435 440 445 Glu Asn Phe Phe He Glu He Wing Arg Asp Phe Phe Lys Asp Asp Gly 450 455 460 Asp Met Ser Glu Leu Phe Val Glu Thr Lys Gln Arg Arg Asn Glu Glu 465 470 475 480 His Asp Leu Leu Ser Lys Arg Ser Lys Gln Thr Lys Wing Lys Lys Asp 485 490 495 Arg Leu Lys Lys Asp Leu Tyr Asp Phe Phe Asp Lys Leu Asp Asn Asp 500 505 510 Tyr Trp Asn He Glu He Asn Lys Leu He Asn Lys Asn Glu Glu Tyr 515 520 525 91 Phe Ser Ser Thr Glu He Thr Asp Thr Asn He Asp Tyr Val Tyr Asn 530 535 540 Lys He Lys Glu Gln Asn Asp Wing He He Lys Asn Leu Arg Asn Ser 545 550 555 560 Val Asp He Lys Lys Pro Ser Gly Val Gly Leu Thr Lys Glu Leu Ser 565 570 575 Asn Leu Trp Asp Arg Tyr Gln He Glu Arg Gln Lys He Leu Leu Ser 580 585 590 Leu Asn Glu Leu Lys Asp Asn Val Asp Arg Lys Leu He Glu Leu Asp 595 600 605 Asn Lys A = n Asn Asp Phe Leu Asn Leu Arg Lys Arg Leu Glu Asp Ser 610 615 620 Leu Asn Leu Gln Gln Ser Tyr Tyr Glu Lys Glu Leu Thr Lys Leu Tyr 625 630 635 640 Asn Asp Ala Lys Asn Ala Leu Lys Asp Val Gln Ser Lys Ala Asn Arg 645 650 655 Leu He Ser Asp Asn Lys Lys Lys His Lys Ser Glu Leu Lys Asn He 660 665 670 Ser Tyr Glu Phe Gln Ser Thr Asn Leu Asn Gly Lys Asp Thr Ala Tyr 675 680 685 He Leu Asp Val Lys Arg Asn Leu Glu Ser Lys He Glu Asn Thr Ser 690 695 700 Asn Glu Val He Asn Glu He Arg Lys Leu Thr Asp Gln He Wing He 705 710 715 720 I Have To Be Asp Being Thr Thr Ser Glu Asn Leu Being Being Wing Gln Val Thr 725 730 735 Glu Ala He Glu Thr Glu Leu Glu His Leu Arg Asp Gln Gln Ala Asn 740 745 750 Asn Ala Glu Leu He Leu Leu Gly Met Ala Leu Ser Val Val His His 755 760 765 Glu Phe Asn Gly Asn He Arg Wing He Arg Be Wing Leu Arg Glu Leu 770 775 780 92 Lys Wing Trp Wing Asp Arg Asn Pro Lys Leu Asp He He Tyr Gln Lys 785 790 795 800 He Arg Thr Ser Phe Asp His Leu Asp Gly Tyr Leu Lys Thr Phe Thr 805 810 815 Pro Leu Thr Arg Arg Leu be Arg Ser Lys Thr Asn He Thr Gly Thr 820 825 830 Ala He Leu Glu Phe He Arg Asp Val Phe Asp Asp Arg Leu Glu Lys 835 840 845 Glu Gly He Glu Leu Phe Thr Thr Ser Lys Phe Val Asn Gln Glu He 850 855 860 Val Thr Tyr Thr Ser Thr He Tyr Pro Val Phe He Asn Leu He Asp 865 870 875 880 Asn Wing He Tyr Trp Leu Gly Lys Thr Thr Gly Glu Lys Arg Leu He 885 890 895 Leu A = p Wing Thr Glu Thr Gly Phe Val He Gly Asp Thr Gly Pro Gly 900 905 910 Val Ser Thr Arg Asp Arg A = p He He Phe Asp Met Gly Phe Thr Arg 915 920 925 Lys Thr Gly Gly Arg Gly Met Gly Leu Phe He Ser Lys Glu Cys Leu 930 935 940 Ser Arg Asp Gly Phe Thr He Arg Leu Asp Asp Tyr Thr Pro Glu Gln 945 950 955 960 Gly Ala Phe Phe He l e Glu Pro Ser Glu Glu Thr Ser Glu 965 970 < 210 > 45 < 211 > 555 < 212 > PRT < 213 > Escherichia coli < 400 > 45 Met Thr Ser Ser Thr Asp Phe His Lys Leu Ser Glu Asp Cys Val Arg 1 5 10 15 Arg Phe Leu His Ser Val Val Ala Val Asp Asp Asn Met Ser Phe Gly 20 25 30 93 Wing Gly Ser Asp Thr Phe Pro Thr Asp Glu Asp He Asn Wing Leu Val 35 40 45 Asp Pro Asp Asp Asp Pro Thr Pro He He Thr Wing Be Wing Ser Pro 50 55 60 Arg He Glu Ser Thr Lys Ser Lys Wing Lys Val Lys Asn His Pro Phe 65 70 75 80 Asp Tyr Gln Ala Leu Ala Glu Ala Phe Ala Lys Asp Gly He Ala Cys 85 90 95 Cys Gly Leu Leu Wing Lys Ser Phe Asn Val Glu Glu Arg A = p He He 100 105 110 Thr Ala Ser Ser His Lys Wing Asp He Thr He Leu Asp Trp Asp Met 115 120 125 Gln Ser Asp Ser Gly Gln Phe Wing He Glu He He Lys be He He 130 135 140 Val Ser Asp He Asn Ser Gly Gly Arg Leu Arg Leu Leu Ser He Tyr 145 150 155 160 Thr Gly Glu His Val Thr Wing Val He Thr Lys Leu Asn Asn Glu Leu 165 170 175 Lys Lys Thr Tyr Arg Ser Val He Lys Asn Asp Asp Ser He He Phe He 180 185 190 Glu Asp Asn Tyr Ala Leu Glu Gln Trp Cys He Val Val He Ser Lys 195 200 205 Asp Val Tyr Glu Lys Asp Leu Pro A = n Val Leu He Lys Lys Phe Thr 210 215 220 Asn Leu Thr Ala Gly Leu Leu Ser Asn Ala Ala Leu Ser Cys He Ser 225 230 235 240 Glu He Arg Glu Lys Thr His Gly He Leu Thr Lys Tyr Asn Asn Lys 245 250 255 Leu Asp Thr Ala Tyr Val Ser His He Leu Asn Leu He Lys Ser Lys 260 265 270 Glu Ser Arg Ala Tyr Ala Tyr Glu Asn Ala His Asp Tyr Ala Val Asp 275 280 285 94 Leu He Ser Glu Glu He Arg Ser He Leu Gln He Ser Glu Asn Leu 290 295 300 Lys Lys Ser Leu Ser Lys Asn Ser Leu Ser His Trp Pro He Phe His 305 310 315 320 Tyr Ala Lys A = n Gly cys Lys Asn Phe Leu Leu Thr Gly Lys Lys Gln 325 330 335 Lys Asp Leu Ser Val Glu His Leu Arg Asn He Leu Ser Wing Asp Ser 340 345 350 Leu Glu Glu He Gln His Wing He Glu His Wing Ser Leu Gly Lys Lys 355 360 365 Glu Tyr Leu Ser Gln Asp Gly Glu Glu Asp Lys Lys Leu Met Gln Leu 370 375 380 Cys Ser Leu Glu He Thr Arg Arg Ser Leu Arg Tyr His Ser His He 385 390 395 400 Asp A = n Val Ser Leu Lys Gln Gly Thr Leu Leu Leu Asp Wing Tyr Asn 405 410 415 Phe Val Tyr Leu Cys He Gln Pro Leu Cys Asp Ser Val Arg Leu His 420 425 430 Glu Lys Wing Asp Phe Leu Phe Leu Arg Gly Thr Leu Asp Asp Asn Asn 435 440 445 Tyr A = n Leu Leu He Glu Asp Glu Tyr Gly Gly Phe Tyr Lys He Lys 450 455 460 Met Pro Ala Lys Ala Ser Asn He He Ser Phe Ser Phe Gly Val Glu 465 470 475 480 Asn Gly Asn Gly Val He He Gly Lys Lys Asn Asn Leu Val Asn Thr 485 490 495 Asp Tyr He Ser Phe Val Pro Leu Leu Val Glu Lys He Ser Thr Pro 500 505 510 Lys Val Leu Lys Trp He Gly Glu He Lys Thr Thr Tyr Wing Gln Lys 515 520 525 He Thr Thr Asp He Val Wing Asn Leu Ser Arg He Gly Leu Asp Gln 530 535 540 95 His Glu Trp Leu Arg He Lys Ser Lys Asp He 545 550 555 < 210 > 46 < 211 > 82 < 212 > PRT < 213 > Escherichia coli < 400 > 46 Met Ser Ser Arg Gln He Leu Glu His Tyr Asn Ala Leu Thr Tyr Pro 1 5 10 15 Leu His Gln Be He Leu Leu Gln He Met Thr Ser Asn Leu Leu Ser 20 25 30 Val Cys Thr Gly Lys Ser He Tyr Glu Asp He Ser Gly Ser Ser Trp 35 40 45 Asn He He His Phe Asn He Pro Pro Leu Pro He Ser Arg Ala Arg Leu 50 55 60 Ser He Phe Ser Tyr Cys Val Arg He Lys Pro Trp Met Ser Met Asp 65 70 75 80 Tyr Met < 210 > 47 < 211 > 98 < 212 > PRT < 213 > Escherichia coli < 400 > 47 Met Ser He He Phe Asn Gly His Tyr Arg Met Lys His Arg Thr Trp 1 5 10 15 He Thr Glu Ala Leu Arg Leu His Phe Glu Glu His Leu Pro Gln Val 20 25 30 Val Val Gly Arg Arg Leu Gly Val Pro Lys Ser Thr Ala Cys Gly Met 35 40 45 Phe Val Arg Phe Arg Lys Wing Gly Phe Ser Trp Pro Leu Pro Wing Gly 50 55 60 Met Ser Glu Arg Glu Leu Asp Gly Axg Leu Tyr Gly Ser Thr Ser Thr 96 65 70 75 80 Val Pro Val Val Leu Cys Ser Gly Ser Val He Gln Asp Thr Ser Lys 85 SO 95 Be cys < 210 > 48 < 211 > 106 < 212 > PRT < 213 > Escherichia coli < 400 > 48 Met He Lys Thr Arg Arg Thr Lys Arg Thr Phe Ser Pro Glu Phe Lys 1 5 10 15 Leu Glu Wing Phe Glu Gln Val Val Val Lys Tyr Gln Arg Asp Val Arg 20 25 30 Glu Val Ala Gln Ala Leu Glu Leu Asn Pro Asp His Leu Arg Lys Trp 35 40 45 He Arg Leu Tyr Lys Gln Glu Leu Gln Gly He Glu Pro Gly Wing Asn 50 55 60 Wing He Thr Pro Glu Gln Arg Glu He Gln Gln Leu Lys Wing Gln He 65 70 75 80 Lys Arg Val Glu Met Glu Lys Glu He Leu Lys Gln Ala Wing Val Leu 85 90 95 Met Ser Glu He Pro Gly Lys Leu Ser Arg 100 105 < 210 > 49 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 49 tgctctagag ccattactca gaatggg 27 97 < 210 > 50 < 211 > 26 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 50 cgcgagctcg acgactgaat gatccc 26 < 210 > 51 < 211 > 26 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 51 tcccccgggt actgcagcac tcaacc 6 < 210 > 52 < 211 > 26 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 52 gatcccggga ccactgaaat gcgtgc 26 < 210 > 53 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 53 tcgtctagag atgatggtga tggagcg 27 98 < 210 > 54 < 211 > 28 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence.-oligonucleotide < 400 > 54 gaactgcagc caaatactga taccaccc 28 < 210 > 55 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: 01igonucleotide < 400 > 55 gaactgcagg ctaaaacaga agacgcg 27 < 210 > 56 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 56 catgcatgca ctccatatga caaccgc 27 < 210 > 57 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 57 tcgtctagaa tgaagctgcg catgagg 27 99 < 210 > 58 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 58 caactgcagt cgcaaattgc gaactgg 27 < 210 > 59 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 59 caactgcaga ccgcaacttt tcgacgc 27 < 210 > 60 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 60 catgcatgcc agtgagccat tgttccc 27 < 210 > 61 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 61 tgctctagat acgactctga caggagg 27 100 < 210 > 62 < 211 > 26 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 62 tcagatatca actaccagca gtttgg 26 < 210 > 63 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 63 tcagatatcc ataaagagtg acgtggc 27 < 210 > 64 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 64 tgctctagaa aacgtggcaa cagagcg 27 < 210 > 65 < 211 > 26 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 65 tgctctagaa ggcgttgtcg atcctg 26 101 < 210 > 66 < 211 > 28 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 66 gaactgcagg aaaaggccga gcagactg 28 < 210 > 67 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 67 gaactgcagt acagccatgt ttacggt 27 < 210 > 68 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 68 catgcatgcg gtgtacgaca gtttgcg 27 < 210 > 69 < 211 > 26 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 69 tgctctagac acatcatggg cacacc 26 102 < 210 > 70 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 70 gaactgcaga accgtccaca tcaggcg 27 < 210 > 71 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 71 gaactgcaga ccctgcttgc cattccg 27 < 210 > 72 < 211 > 27 < 212 > DNA < 213 > Artificial Sequence < 220 > < 223 > Description of Artificial Sequence: Oligonucleotide < 400 > 72 catgcatgca taagcgtcga acaggcg 27 103

Claims (11)

  1. 29 CLAIMS 1. A peptide characterized in that it is encoded by an operon that includes any of the genes identified herein as ta tA, ta tB, ta tC, ta tE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE , mtd2 and msl to 16, obtainable from E. coli Kl, or a homologue thereof, in a Gram negative bacterium, having at least 30% homology at the amino acid or nucleotide level, or a functional fragment of the same, for therapeutic use.
  2. 2. A peptide according to claim 1, characterized in that it comprises any of the amino acid sequences identified herein as SEQ ID NO. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48.
  3. 3. A polynucleotide that encodes a peptide characterized in accordance with claim 1 or 2, characterized for therapeutic use.
  4. 4. A transformed host for expressing a peptide according to claim 1 or 2.
  5. 5. A vaccine characterized in that it comprises a peptide according to claim 1 or 2, or the means for its expression.
  6. 6. A vaccine characterized in that it comprises a microorganism having a virulence gene mutation, wherein the gene encodes a peptide according to the invention. claim 1 or 2.
  7. 7. A vaccine according to claim 6, characterized in that it has a suppression of virulence gene in two genes, where one gene encodes ta tA and the other encodes ta tE.
  8. 8. A vaccine according to claim 6, characterized in that the gene is within the pathogenicity island, wherein the island comprises a gene identified herein.
  9. 9. The use of a product according to any of claims 1 to 4, or SEQ ID NO. 33, to screen potential drugs or for the detection of virulence.
  10. The use of a product according to any of claims 1 to 4, characterized for the manufacture of a medicament for use in the treatment or prevention of a condition associated with infection by a Gram negative bacterium.
  11. 11. The use according to claim 10, characterized in that the bacterium is E. coli.
MXPA/A/2001/004558A 1998-11-09 2001-05-04 Virulence genes and proteins, and their use MXPA01004558A (en)

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
GB9824569.9 1998-11-09
GB9824570.7 1998-11-09
GB9827818.7 1998-12-17
GB9827815.3 1998-12-17
GB9827816.1 1998-12-17
GB9827814.6 1998-12-17
GB9900710.6 1999-01-13
GB9900708.0 1999-01-13
GB9900711.4 1999-01-13
GB9901915.0 1999-01-28

Publications (1)

Publication Number Publication Date
MXPA01004558A true MXPA01004558A (en) 2002-06-05

Family

ID=

Similar Documents

Publication Publication Date Title
Law Adhesion and its role in the virulence of enteropathogenic Escherichia coli
Agin et al. Identification of a family of intimins common to Escherichia coli causing attaching-effacing lesions in rabbits, humans, and swine
US5631010A (en) Genetically stable cholera vaccines with deletions of ctxA, recA and attRS1
US20050142149A1 (en) Virulence genes and proteins, and their use
Fuller et al. Identification of Actinobacillus pleuropneumoniae virulence genes using signature-tagged mutagenesis in a swine infection model
US20110287052A1 (en) Recombinant bacterium capable of eliciting an immune response against streptococcus pneumoniae
Turner et al. Construction and characterization of genetically defined aro omp mutants of enterotoxigenic Escherichia coli and preliminary studies of safety and immunogenicity in humans
KR20010024650A (en) RECOMBINANT VACCINES COMPRISING IMMUNOGENIC ATTENUATED BACTERIA HAVING RpoS POSITIVE PHENOTYPE
Martin et al. The tryptophanase gene cluster of Haemophilus influenzae type b: evidence for horizontal gene transfer
KR100628657B1 (en) Bacteria attenuated by a non-reverting mutation in each of the AroC, OmpF and OmpC genes, useful as vaccines
KR20080080069A (en) Virulence genes, proteins, and their use
Moxon et al. Type b capsular polysaccharide as a virulence factor of Haemophilus influenzae
JP2000506368A (en) Live vaccine carrier strains useful for high-level expression of atypical O antigen from gram-negative bacteria and their derivatives for use as live vaccines
JP2005519872A (en) Salmonella vaccine
US20090298713A1 (en) Polynucleotides which are of nature b2/d+ a- and which are isolated from e. coli, and biological uses of these polynucleotides and of their polypeptides
Thomas et al. Regulation of tcp genes in classical and E1 Tor strains of Vibrio cholerae O1
MXPA01004558A (en) Virulence genes and proteins, and their use
JP2001522605A (en) Pathogen host membrane receptor, HP90, encoded by the pathogen TIR gene
Clarke et al. Galactose epimeraseless mutants of Salmonella typhimurium as live vaccines for calves.
AU2003236302B2 (en) Virulence genes and proteins, and their use
CA2359469A1 (en) Compositions and methods for treating and preventing pathogenic bacterial infection based on the essential role of dna methylation in bacterial virulence
AU2007200542A1 (en) Virulence genes and proteins, and their use
Fischetti et al. Effect of mucosal antibodies to M protein on colonization by group A streptococci
WO1994001562A1 (en) Bivalent living vaccines against bacterial intestinal pathogenic agents, process for preparing the same, plasmids and strains useful as base material
MXPA00009354A (en) Bacteria attenuated by a non-reverting mutation in each of the aroc, ompf and ompc genes, useful as vaccines