Patents
Search within the title, abstract, claims, or full patent document: You can restrict your search to a specific field using field names.
Use TI= to search in the title, AB= for the abstract, CL= for the claims, or TAC= for all three. For example, TI=(safety belt).
Search by Cooperative Patent Classifications (CPCs): These are commonly used to represent ideas in place of keywords, and can also be entered in a search term box. If you're searching forseat belts, you could also search for B60R22/00 to retrieve documents that mention safety belts or body harnesses. CPC=B60R22 will match documents with exactly this CPC, CPC=B60R22/low matches documents with this CPC or a child classification of this CPC.
Learn MoreKeywords and boolean syntax (USPTO or EPO format): seat belt searches these two words, or their plurals and close synonyms. "seat belt" searches this exact phrase, in order. -seat -belt searches for documents not containing either word.
For searches using boolean logic, the default operator is AND with left associativity. Note: this means safety OR seat belt is searched as (safety OR seat) AND belt. Each word automatically includes plurals and close synonyms. Adjacent words that are implicitly ANDed together, such as (safety belt), are treated as a phrase when generating synonyms.
Learn MoreChemistry searches match terms (trade names, IUPAC names, etc. extracted from the entire document, and processed from .MOL files.)
Substructure (use SSS=) and similarity (use ~) searches are limited to one per search at the top-level AND condition. Exact searches can be used multiple times throughout the search query.
Searching by SMILES or InChi key requires no special syntax. To search by SMARTS, use SMARTS=.
To search for multiple molecules, select "Batch" in the "Type" menu. Enter multiple molecules separated by whitespace or by comma.
Learn MoreSearch specific patents by importing a CSV or list of patent publication or application numbers.
Chlamydia pneumoniae genome sequence
CA2350775A1
Canada
- Other languages
French - Inventor
Richard Stephens Wayne Mitchell Sue Kalman Ronald Davis - Current Assignee
- University of California San Diego UCSD
Description
translated from
COMPREND PLUS D'UN TOME.
CECI EST LE TOME _ ~'DE c1 NOTE. Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets THIS SECTION OF THE APPUCATION/PATENT CONTAINS MORE
THAN ONE VOLUME
THIS IS VOLUME -O>=
- . -WOTE_ For additional volumes please contact'the Canadian Patent Offfice :~. ..
CHLAMYDIA PNEUMONIAE GENOME SEQUENCE
CROSS-REFERENCES TO RELATED APPLICATIONS
The present application is related to 60/128,606, filed April 8, 1999 and 60/108,279, filed November 12, 1998, which are incorporated herein by reference.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT
FIELD OF THE INVENTION
This invention relates to nucleic acids and polypeptides from Chlamydia pneumoniae and to their use in the diagnosis, prevention and treatment of diseases associated with C. pneumoniae.
BACKGROUND OF THE INVENTION
Chlamydiaceae is a family of obligate intracellular parasite with a tropism for epithelial cells lining the mucus membranes. The bacteria have two morphologically distinct forms, "elementary body" and "reticulate body". The elementary body is the infectious form, and has a rigid cell wall, primarily of crass-linked outer membrane proteins. The reticulate body is the intracellular, metabolically active form.
A unique developmental cycle between these two forms characterizes Chlamydia growth.
C. pneumoniae is a human respiratory pathogen that causes acute respiratory disease, and approximately 10% of community-acquired pneumonia.
Antibody prevalence studies have shown that virtually everyone is infected with C.
pneumoniae at some time, and that reinfection is common. In addition to respiratory disease, studies have shown an association of this organism with coronary artery disease.
It has been demonstrated in atherosclerotic lesions of the aorta and coronary arteries by immunocytochemistry and by polymerase chain reaction (Kuo et al. (1993) J
Infect Dis 167(4):841-849).
Recent reports have further demonstrated the presence of C. pneumoniae in the walls of abdominal aortic aneurysms (Juvonen et al. (1997) J Vasc Sure 25(3):499-505). Abdominal aortic aneurysms are frequently associated with atherosclerosis, and inflammation may be an important factor in aneurysmal dilatation.
C. pneumoniae may play a role in maintaining an inflammation and triggering the development of aortic aneurysms.
Muhlestein et al. (1996) JACC 27:1555-61, reported a differential incidence of Chlamydia species within the coronary artery wall of patients with ~ atherosclerosis versus those with other forms of cardiovascular disease. The extremely high rate of possible infection in patients with symptomatic atherosclerotic disease compared to the very low rate in patients with normal coronary arteries or coronary artery disease from chronic transplant rejection provides evidence for a direct link between the atherosclerotic process and Chlamydia infection. Because a history of chlamydial infection is so prevalent in the population, the issue of causality remains.
On a physiologic and pathologic level, abnormal interactions among endothelial cells, platelets, macrophages and lymphocytes may lead to a cascade of events resulting in acute endothelial damage, thrombosis and repair, chronically leading to the development of atheroma in blood vessels.
C. pneumoniae is related to other Chlamydia species, but the level of sequence similarity is relatively low. Very little is known about the biology of this organism, although it appears to be an important human pathogen. Allelic diversity and structural relationships between specific genes of Chlamydial species is described in Kaltenboeck et al. (1993) J Bacteriol 175(2):487-502; Gaydos et al. (1992) Infect Immun 60{12):5319-5323; Everett et al. (1997) Int J Syst Bacteriol 47(2):461-473;
and Pudjiatmoko et al. (1997) Int J Syst Bacteriol 47(2):425-431.
A number of studies have been published describing methods for detection of C. pneumoniae, and for distinguishing between Chlamydial species. Such methods include PCR detection (Rasmussen et al. (1992) ~Vlol Cell Probes 6(5):389-394;
Holland et al. (1990) J Infect Dis 162(4):984-987); a simplified polymerase chain reaction-enzyme immunoassay (Wilson et al. (1996) J Appl Bacteriol 80(4):431-438); sequence determination and restriction endonuclease cleavage (Herrmann et al. (1996) J
lin Micro io134(8):1897-1902).
Antigenic and molecular analyses of different C. pneumoniae strains is described in 3antos et al. (1997) J Clin Microbiol 35(3):620-623. Some genes of C.
pneumoniae have been isolated and sequenced. These include the Gro E operon (Kikuta et al. { i 99I ) Infect Immun 59( 12):4665-4669); the major outer membrane protein Perez et al. ( 1991 ) Infect Immun 59(6):2195-2199; the DnaK protein homolog (Kornak et al.
(1991) Infect Immun 59(2):721-725); as well as a number of ribosomal and other genes.
SUMMARY OF THE IIWENTION
This invention provides the genomic sequence of Chlamydia pneumoniae.
The sequence information is useful for a variety of diagnostic and analytical methods.
The genomic sequence may be embodied in a variety of media, including computer readable forms, or as a nucleic acid comprising a selected fragment of the sequence.
Such fragments generally consist of an open reading frame, transcriptional or translational control elements, or fragments derived therefrom. Proteins encoded by the open reading frames are useful for diagnostic purposes, as well as for their enzymatic or structural activity.
DEFIhIITIONS
The term "amino acid" refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, 'y-carboxyglutamate, and 0-phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group., e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally occurring amino acid.
Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the ILTPAC-ILJB
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes.
"Antibody" refers to an immunoglobulin molecule able to bind to a specific epitope on an antigen. Antibodies can be a polyclonal mixture or monoclonal.
Antibodies can be intact immunoglobulins derived from natural sources or from recombinant sources and can be immunoreactive portions of intact immunoglobulins.
Antibodies may exist in a variety of forms including, for example, Fv, Fab, and F(ab)Z, as well as in single chains. Single-chain antibodies, in which genes for a heavy chain and a light chain are combined into a single coding sequence, may also be used.
An "antigen" is a molecule that is recognized and bound by an antibody, e.g., peptides, carbohydrates, organic molecules, or more complex molecules such as glycolipids and glycoproteins. The part of the antigen that is the target of antibody binding is an antigenic determinant and a small functional group that corresponds to a single antigenic determinant is called a hapten.
"Biological sample" refers to any sample obtained from a living or dead organism. Examples of biological samples include biological fluids and tissue specimens.
Such biological samples can be prepared for analysis of the presence of C.
pneumoniae nucleic acids, proteins, or antibodies specifically reactive with the proteins.
The term "C. pneumoniae gene" shall be intended to mean the open reading frame encoding specific C. pneumoniae polypeptides, as well as adjacent 5' and 3' non-coding nucleotide sequences involved in the regulation of expression, up to about 2 kb beyond the coding region, but possibly further in either direction. The gene may be introduced into an appropriate vector for extrachromosomal maintenance or for integration into a host genome.
"Conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batter et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol.
Chem. 260:2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are "silent variations,"
which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the «nly codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule.
Accordingly, each silen: variation of a nucleic acid which encodes a polypeptide is implicit in each describ :d sequence.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid.
Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention.
The following groups each contain amino acids that are conservative substitutions for one another:
1 ) Alanine (A), Glycine (G);
2) Serine (S), Threonine (T);
3) Aspartic acid (D), Glutamic acid (E);
4) Asparagine (N), Glutamine (Q);
see, e.g., Creighton, Proteins (1984)).
The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence over a comparison window, as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. This definition also refers to the complement of a test sequence, which has a designated percent sequence or subsequence complementarity when the test sequence has a designated or substantial identity to a reference sequence. For example, a designated amino acid percent identity of 95% refers to sequences or subsequences that have at least about 95% amino acid identity when aligned for maximum correspondence over a comparison window as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Such sequences would then be said to have substantial identity, or to be substantially identical to each other. Preferably, sequences have at least about 70% identity, more preferably 80% identity, more preferably 90-95%
identity and above. Preferably, the percent identity exists over a region of the sequence that is at least about 25 amino acids in length, more preferably over a region that is 50-100 amino acids in length.
When percentage of sequence identity is used in reference to proteins or peptides, it is recognized that residue positions that are not identical often differ by conservative amino acid substitutions, where amino acids residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule.
Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Means for making this adjustment are well known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of l and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated according to, e.g., the algorithm of Meyers &
Miller, Computer Applic. Biol. Sci. 4:11-17 {1988) e.g., as implemented in the program PCIGENE (Intelligenetics, Mountain View, California, USA)..
For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequences) relative to the reference sequence, based on the designated or default program parameters.
A comparison window includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 25 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Watetman, Adv.
Appl. Math. 2:482 ( 1981 ), by the homology alignment algorithm of Needleman &
Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method ofPearson &
Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, e.g., Ausubel et al., supra).
One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, patrwise alignments to show relationship and percent sequence identity. It also plots a tree or dendogram showing the clustering relationships used to create the alignment.
PILEUP
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol.
Evol. 35:351-360 (1987). The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences;
each of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure begins with the pairwise alignment of the two most similar sequences, producing a cluster of two aligned sequences. This cluster is then aligned to the next most related sequence or cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension of the pairwise alignment of two individual sequences. The final alignment is achieved by a series of progressive, pairwise alignments.
The program is run by designating specific sequences and their amino acid or nucleotide coordinates for regions of sequence comparison and by designating the program parameters.
Using PILEUP, a reference sequence is compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence analysis software package, e.g, version 7.0 (Devereaux et al., Nuc. Acids Res. 12:387-395 (1984).
Another example of algorithm that is suitable for determining percent sequence identity (i.e., substantial similarity or identity) is the BLAST
algorithm, which is described in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.govn. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T
when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues;
always > 0) and N (penalty score for mismatching residues, always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score.
Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X
determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=4, and a comparison of both strands. For amino acid sequences, the BLASTP
program uses as default parameters a wordlength (W) of 3, an expectation (E) of 10, and the BLOSLTM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci.
USA
89:10915 ( 1989)).
The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci.
USA
90:5873-5787 (1993)). One measure of similarity provided by the BLAST
algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance.
For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
L O An indication that two nucleic acid sequences or polypeptides are substantially identical is that the polypeptide encoded by the first nucleic acid is immunologically cross ,-eactive with the antibodies raised against the polypeptide encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically substantially identical to a second polypeptide, for example, where the two peptides differ I S only by conservative suostitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described below.
Another indication that polynucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Stringent 20 conditions are sequence dependent and will be different in different circumstances.
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Typically stringent conditions for a 25 Southern blot protocol involve hybridizing in a buffer comprising Sx SSC, 1% SDS at 65°C or hybridizing in a buffer containing Sx SSC and 1% SDS at 42°C and washing at 65°C with a 0.2x SSC, 0.1% SDS wash.
A "label" is a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include 30 3zP, Iluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal antibodies are available.
Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated.
The term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, and polynucleotide.
As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic acid capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, for example, probes may be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be understood by one of skill in the art that probes may bind target sequences lacking complete complementarity with the probe sequence depending upon the stringency of the hybridization conditions. The probes are preferably directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin complex may later bind. By assaying for the presence or absence of the probe, one can detect the presence or absence of the select sequence or subsequence.
A labeled nucleic acid probe or oligonucleotide is one that is bound, either covalently, through a linker, or through ionic, van der Waals or hydrogen bonds to a label such that the presence of the probe may be detected by detecting the presence of the label bound to the probe.
"Pharmaceutically acceptable" means a material that is not biologically or otherwise undesirable, i.e., the material can be administered to an individual along with a Chlamydia antigen without causing any undesirable biological effects or interacting in a deleterious manner with any of the other components of the pharmaceutical composition.
The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an analog or mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
The phrase "specifically or selectively hybridizing to," refers to hybridization between a probe and a target sequence in which the probe binds substantially only to the target sequence, forming a hybridization complex, when the target is in a heterogeneous mixture of polynucleotides and other compounds.
Such hybridization is determinative of the presence of the target sequence.
Although the probe may bind other unrelated sequences, at least 90%, preferably 95% or more of the hybridization complexes formed are with the target sequence.
The term "recombinant" when used with reference to a cell, or nucleic acid, or vector, indicates that the cell, or nucleic acid, or vector, has been modified by the introduction of a heterologous nucleic acid or the alteration of a native nucleic acid, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all.
The phrase "specifically immunoreactive with", when referring to a protein or peptide, refers to a binding reaction between the protein and an antibody which is determinative of the presence of the protein in the presence of a heterogeneous population of proteins and other compounds. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein and are described in detail below.
The phrase "substantially pure" or "isolated" when referring to a Chlamydia peptide or protein, means a chemical composition which is free of other subcellular components of the Chlamydia organism. Typically, a monomeric protein is substantially pure when at least about 85% or more of a sample exhibits a single polypeptide backbone. Minor variants or chemical modifications may typically share the same polypeptide sequence. Depending on the purification procedure, purities of 85%, and preferably over 95% pure are possible. Protein purity or homogeneity may be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, followed by visualizing a single polypeptide band on a polyacrylamide gel upon silver staining. For certain purposes high resolution will be needed and HPLC or a similar means for purification utilized.
DETAILED DESCRIPTION
The present invention provides the nucleotide sequence of the C.
pneumoniae genome SEQ ID NO: 1 or a representative fragment thereof, in a form which can be readily used, analyzed, and interpreted by a skilled artisan. As used herein, a "representative fragment" of the nucleotide sequence depicted in SEQ ID NO: 1 refers to any portion which is not presently represented within a publicly available database.
Preferred representative fragments of the present invention are open reading frames, expression modulating fragments, uptake modulating fragments, and fragments which can be used to diagnose the presence of G pneumoniae in sample. Using the information provided in the present application, together with routine cloning and sequencing methods, one of ordinary skill in the art will be able to clone and sequence all "representative fragments" of interest including open reading frames (ORFs) encoding a large variety of C. pneumoniae proteins. A non-limiting identification of such preferred representative fragments is provided in Tables 2 and 3.
Diasnostic use of C pneumoniae nucleic acids Hybridization-based assays Using the nucleic acids disclosed here, one of skill can design nucleic acid hybridization-based assays for the detection of C. pneumoniae. Any of a number of well known techniques for the specific detection of target nucleic acids can be used.
Exemplary hybridization-based assays include, but are not limited to, traditional "direct probe" methods such as Southern Blots, dot blots, in situ hybridization (e.g., FISH), PCR, and the like. The methods can be used in a wide variety of formats including, but not limited to substrate- (e.g. membrane or glass) bound methods or array-based approaches as described below. As noted above, this invention also embraces methods for detecting the presence of Chlamydia DNA or RNA in biological samples. These sequences can be used to detect Chlamydia in biological samples from patients suspected of being infected.
A variety of methods of specific DNA and RNA measurement using nucleic acid hybridization techniques are known to those of skill in the art (see Sambrook et al., supra).
In situ hybridization assays are well known (e.g., Angerer {1987) Meth.
Enrymol 152: 649). Generally, in situ hybridization comprises the following major steps:
(1) fixation of tissue or l;~iological structure to analyzed; (2) prehybridization treatment of the biological structure t ~ increase accessibility of target DNA, and to reduce nonspecific binding; (3) hybridizatic n of the mixture of nucleic acids to the nucleic acid in the biological structure or tissue; (4) post-hybridization washes to remove nucleic acid fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid fragments. The reagent used in each of these steps and the conditions for use vary depending on the particular application.
In a typical in situ hybridization assay, cells are fixed to a solid support, typically a glass slide. If a nucleic acid is to be probed, the cells are typically denatured with heat or alkali. The cells are then contacted with a hybridization solution at a moderate temperature to permit annealing of labeled probes specific to the nucleic acid sequence encoding the protein. The targets (e.g., cells) are then typically washed at a predetermined stringency or at an increasing stringency until an appropriate signal to noise ratio is obtained.
The nucleic acids of this invention are particularly well suited to array-based hybridization formats. Arrays are a multiplicity of different "probe" or "target"
nucleic acids (or other compounds) attached to one or more surfaces (e.g., solid, membrane, or gel). In a preferred embodiment, the multiplicity of nucleic acids (or other moieties) is attached to a single contiguous surface or to a multiplicity of surfaces juxtaposed to each other.
In an array format a large number of different hybridization reactions can be run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation of a number of hybridizations in a single "experiment". Methods of performing hybridization reactions in array based formats are well known to those of skill in the art (see, e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958.
Arrays, particularly nucleic acid arrays can be produced according to a wide variety of methods well known to those of skill in the art. For example, in a simple embodiment, "low density" arrays can simply be produced by spotting (e.g. by hand using a pipette) different nucleic acids at different locations on a solid support (e.g. a glass surface, a membrane, etc.).
This simple spotting, approach has been automated to produce high density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the use of an automated systems that taps a microcapillary against a surface to deposit a small volume of a biological sample. The process is repeated to generate high density arrays.
Arrays can also be produced using oligonucleotide synthesis technology. Thus, for example, U.S. Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 92/10092 teach the use of light-directed combinatorial synthesis of high density oligonucleotide arrays.
Many methods for immobilizing nucleic acids on a variety of solid surfaces are known in the art. A wide variety of organic and inorganic polymers, as well as other materials, both natural and synthetic, can be employed as the material for the solid surface. Illustrative solid surfaces include, e.g., nitrocellulose, nylon, glass, quartz, diazotized membranes (paper or nylon), silicones, polyformaldehyde, cellulose, and cellulose acetate. In addition, plastics such as polyethylene, polypropylene, polystyrene, and the like can be used. Other materials which may be employed include paper, ceramics, metals, metalloids, semiconductive materials, cermets or the like.
In addition, substances that form gels can be used. Such materials include, e.g., proteins (e.g., gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides. Where the solid surface is porous, various pore sizes may be employed depending upon the nature of the system.
In preparing the surface, a plurality of different materials may be employed, particularly as laminates, to obtain various properties. For example, proteins (e.g., bovine serum albumin) or mixtures of macromolecules (e.g., Denhardt's solution) can be employed to avoid non-specific binding, simplify covalent conjugation, enhance signal detection or the like. If covalent bonding between a compound and the surface is desired, the surface will usually be polyfunctional or be capable of being polyfunctionalized. Functional groups which may be present on the surface and used for linking can include carboxylic acids, aidehydes, amino groups, cyano groups, ethylenic groups, hydroxyl groups, mercapto groups and the like. The manner of linking a wide variety of compounds to various surfaces is well known and is amply illustrated in the literature.
For example, methods for immobilizing nucleic acids by introduction of various functional groups to the molecules is known (see, e.g., Bischoff (1987) Anal.
Biochem., 164: 336-344; Kremsky {1987) Nucl. Acids Res. 15: 2891-2910).
Modified nucleotides can be placed on the target using PCR primers containing the modified nucleotide, or by enzymatic end labeling with modified nucleotides. Use of glass or membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic acid arrays of the invention is advantageous because of well developed technology employing manual and robotic methods of arraying targets at relatively high element densities. Such membranes are generally available and protocols and equipment for hybridization to membranes is well known.
Target elements of various sizes, ranging from 1 mm diameter down to 1 p,m can be used. Smaller target elements containing low amounts of concentrated, fixed probe DNA are used for high complexity comparative hybridizations since the total amount of sample available for binding to each target element will be limited.
Thus it is advantageous to have small array target elements that contain a small amount of concentrated probe DNA so that the signal that is obtained is highly localized and bright.
Such small array target elements are typically used in arrays with densities greater than 104/cmz. Relatively simple approaches capable of quantitative fluorescent imaging of 1 cmz areas have been described that permit acquisition of data from a large number of target elements in a single image (see, e.g., Wittrup (1994) Cytometry 16:206-213).
If fluorescently labeled nucleic acid samples are used, arrays on solid surface substrates with much lower fluorescence than membranes, such as glass, quartz, or small beads, can achieve much better sensitivity. Substrates such as glass or fused silica are advantageous in that they provide a very low fluorescence substrate, and a highly efficient hybridization environment. Covalent attachment of the target nucleic acids to glass or synthetic fused silica can be accomplished according to a number of known techniques (described above). Nucleic acids can be conveniently coupled to glass using commercially available reagents. For instance, materials for preparation of silanized glass with a number of functional groups are commercially available or can be prepared using standard techniques (see, e.g., Gait ( 1984) Oligonucleotide Synthesis: A
~ Practical Approach, IRL Press, Wash., D.C.). Quartz cover slips, which have at least 10-fold lower autofluorescence than glass, can also be silanized.
Alternatively, probes can also be immobilized on commercially available coated beads or other surfaces. For instance, biotin end-labeled nucleic acids can be bound to commercially available avidin-coated beads. Streptavidin or anti-digoxigenin antibody can also be attached to silanized glass slides by protein-mediated coupling using e.g., protein A following standard protocols (see, e.g., Smith (1992) Science 258: 1122-1126). Biotin or digoxigenin end-labeled nucleic acids can be prepared according to standard techniques. Hybridization to nucleic acids attached to beads is accomplished by suspending them in the hybridization mix, and then depositing them on the glass substrate for analysis after washing. Alternatively, paramagnetic particles, such as ferric oxide particles, with or without avidin coating, can be used.
A variety of other nucleic acid hybridization formats are known to those skilled in the art. For example, common formats include sandwich assays and competition or displacement assays. Hybridization techniques are generally described in Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL
Press;
Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; and John et al.
(1969) Nature 223: 582-587.
Sandwich assays are commercially useful hybridization assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture"
nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution.
The sample will provide the target nucleic acid. The "capture" nucleic acid and "signal"
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich"
hybridization complex. To be most effective, the signal nucleic acid should not hybridize with the capture nucleic acid.
Detection of a hybridization complex may require the binding of a signal generating complex to a duplex of target and probe polynucleotides or nucleic acids.
Typically, such binding occurs through ligand and anti-ligand interactions as between a ligand-conjugated probe and an anti-ligand conjugated with a signal.
The sensitivity of the hybridization assays may be enhanced through use of a nucleic acid amplification system that multiplies the target nucleic acid being detected.
Examples of such systems include the polymerise chain reaction (PCR) system and the ligase chain reaction (LCR) system. Other methods recently described in the art are the nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) and Q Beta Replicase systems.
Nucleic acid hybridization simply involves providing a denatured probe and target nucleic acid under conditions where the probe and its complementary target can form stable hybrid duplexes through complementary base pairing. The nucleic acids that do not form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be detected, tyl:ically through detection of an attached detectable label. It is generally recognized that nucleic acids are denatured by increasing the temperature or decreasing the salt concentration of the buffer containing the nucleic acids, or in the addition of chemical agents, or the raising of the pH. Under low stringency conditions (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes {e.g., DNA:DNA, RNA:RNA, or RNA:DNA) will form even where the annealed sequences are not perfectly complementary. Thus specificity of hybridization is reduced at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower salt) successful hybridization requires fewer mismatches.
One of skill in the art will appreciate that hybridization conditions may be selected to provide any degree of stringency. In a preferred embodiment, hybridization is performed at low stringency to ensure hybridization and then subsequent washes are performed at higher stringency to eliminate mismatched hybrid duplexes.
Successive washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 X SSPE-T at 37°C to 70°C) until a desired level of hybridization specificity is obtained.
Stringency can also be increased by addition of agents such as formamide.
Hybridization specificity may be evaluated by comparison of hybridization to the test probes with hybridization to the various controls that can be present.
In general, there is a tradeoff between hybridization specificity (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed at the highest stringency that produces consistent results and that provides a signal intensity greater than approximately 10% of the background intensity. Thus, in a preferred embodiment, the hybridized array may be washed at successively higher stringency solutions and read between each wash. Analysis of the data sets thus produced will reveal a wash stringency above which the hybridization pattern is not appreciably altered and which provides adequate signal for the particular probes of interest.
Methods of optimizing hybridization conditions are well known to those of skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.).
_LabelinQ and detection of nucleic acids.
In a preferred embodiment, the hybridized nucleic acids are detected by detecting one or more labels attached to the sample or probe nucleic acids.
The labels may be incorporated by any of a number of means well known to those of skill in the art.
Means of attaching labels to nucleic acids include, for example nick translation or end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label (e.g., a fluorophore). A wide variety of linkers for the attachment of labels to nucleic acids are also known. In addition, intercalating dyes and fluorescent nucleotides can also be used.
Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads~), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3H, lzsh 355,''~C, or 32P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752;
3,939,350;
3,996,345; 4,277,437; 4,275,149; and 4,366,241.
A fluorescent label is preferred because it provides a very strong signal with low background. It is also optically detectable at high resolution and sensitivity through a quick scanning procedure. The nucleic acid samples can all be labeled with a single label, e.g., a single fluorescent label. Alternatively, in another embodiment, different nucleic acid samples can be simultaneously hybridized where each nucleic acid sample has a different label. For instance, one target could have a green fluorescent label and a second target could have a red fluorescent label. The scanning step will distinguish cites of binding of the red label from those binding the green fluorescent label. Each nucleic acid sample (target nucleic acid) can be analyzed independently from one another.
Suitable chromogens which can be employed include those molecules and compounds which absorb light in a distinctive range of wavelengths so that a color can be observed or, alternatively, which emit light when irradiated with radiation of a particular '- wave length or wave length range, e.g., fluorescers.
Desirably, fluorescers should absorb light above about 300 nm, preferably about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths greater than about 10 nm higher than the wavelength of the light absorbed. It should be noted that the absorption and emission characteristics of the bound dye can differ from the unbound dye. Therefore, when referring to the various wavelength ranges and characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye which is unconjugated and characterized in an arbitrary solvent.
Fluorescers are generally preferred because by irradiating a fluorescer with light, one can obtain a plurality of emissions. Thus, a single label can provide for a plurality of measurable events.
Detectable signal can also be provided by chemiluminescent and bioluminescent sources. Chemiluminescent sources include a compound which becomes electronically excited by a chemical reaction and can then emit light which serves as the detectable signal or donates energy to a fluorescent acceptor. Alternatively, luciferins can be used in conjunction with luciferase or lucigenins to provide bioluminescence.
Spin labels are provided by reporter molecules with an unpaired electron spin which can be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin labels include organic free radicals, transitional metal complexes, particularly vanadium, copper, iron, and manganese, and the like. Exemplary spin labels include nitroxide free radicals.
The label may be added to the target (sample) nucleic acids) prior to, or after the hybridization. So called "direct labels" are detectable labels that are directly attached to or incorporated into the target (sample) nucleic acid prior to hybridization. In contrast, so called "indirect labels" are joined to the hybrid duplex after hybridization.
Often, the indirect label is attached to a binding moiety that has been attached to the target nucleic acid prior to the hybridization. Thus, for example. the target nucleic acid may be biotinylated before the hybridization. After hybridization, an avidin-conjugated fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily detected. For a detailed review of methods of labeling nucleic acids and detecting labeled hybridized nucleic acids see Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed.
filsevier, N.Y., ( 1993)).
Fluorescent labels are easily added during an in vitro transcription reaction. Thus, for example, fluorescein labeled UTP and CTP can be incorporated into the RNA produced in an in vitro transcription.
The labels can be attached directly or through a linker moiety. In general, the site of label or linker-label attachment is not limited to any specific position. For example, a label may be attached to a nucleoside, nucleotide, or analogue thereof at any position that does not interfere with detection or hybridization as desired.
For example, certain Label-ON Reagents from Clontech (Palo Alto, CA) provide for labeling interspersed throughout the phosphate backbone of an oligonucleotide and for terminal labeling at the 3' and 5' ends. As shown for example herein, labels can be attached at positions on the ribose ring or the ribose can be modified and even eliminated as desired.
The base moieties of useful labeling reagents can include those that are naturally occurring or modified in a manner that does not interfere with the purpose to which they are put. Modified bases include but are not limited to 7-deaza A and G, 7-deaza-8-aza A
and G, and other heterocyclic moieties.
It will be recognized that fluorescent labels are not to be limited to single species organic molecules, but include inorganic molecules, mufti-molecular mixtures of organic and/or inorganic molecules, crystals, heteropolymers, and the like.
Thus, for example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily derivatized for coupling to a biological molecule (Bruchez et al. (1998) Science, 281:
2013-2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium selenide) have been covalently coupled to biomolecules for use in ultrasensitive biological detection (Warren and Nie (1998) Science, 281: 2016-2018).
AmQ,ification-based assays.
In another embodiment, amplification-based assays can be used to detect nucleic acids. In such amplification-based assays, the nucleic acid sequences act as a template in an amplification reaction (e.g. Polymerase Chain Reaction (PCR).
Detailed protocols for quantitative PCR are provided in Innis et al. ( 1990) PCR
Protocols, A Guide to Methods and Applications, Academic Press, Inc. N.Y.).
Other suitable amplification methods include, but are not limited to ligase chain reaction (LCR) (see Wu and Wallace (1989) Genomics 4: 560, Landegren et al.
(1988) Science 241: 1077, and Barringer et al. (1990) Gene 89: 117, transcription amplification (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173), and self sustained sequence replication (Guatelli et al. ( 1990) Proc. Nat. Acad. Sci.
USA 87:
1874).
Detection;. of C. pneumoniae gene expression The nucl:;ic acids of the invention can also be used to G pneumoniae detect gene transcripts. Methods of detecting and/or quantifying gene transcripts using nucleic acid hybridization techniques are known to those of skill in the art (see Sambrook et al. supra). For example , a Northern transfer may be used for the detection of the desired mRNA directly. In brief, the mRNA is isolated from a given cell sample using, for example, an acid guanidinium-phenol-chloroform extraction method. The mRNA
is then electrophoresed to separate the mRNA species and the mRNA is transferred from the gel to a nitrocellulose membrane. As with the Southern blots, labeled probes are used to identify and/or quantify the target mRNA.
In another preferred embodiment, the gene transcript can be measured using amplification (e.g. PCR) based methods as described above for directly assessing copy number of the target sequences.
Expression of C. Dneumoniae proteins The nucleic acids disclosed here can be used for recombinant expression of the proteins. In these methods, the nucleic acids encoding the proteins of interest are introduced into suitable host cells, followed by induction of the cells to produce large amounts of the protein. The invention relies on routine techniques in the field of recombinant genetics, well known to those of ordinary skill in the art. A
basic text disclosing the general methods of use in this invention is Sambrook et al., Molecular Cloning, A Laboratory Manual (2nd ed. 1989).
Standard transfection methods are used to produce prokaryotic, mammalian, yeast or insect cell lines which express large quantities of the desired polypeptide, which is then purified using standard techniques (see, e.g., Colley et al., J.
Biol. Chem. 264:17619-17622, 1989; Guide to Protein PuriJZCation, supra).
The nucleotide sequences used to transfect the host cells can be modified to yield Chlamydia polypeptides with a variety of desired properties. For example, the polypeptides can vary from the naturally-occurring sequence at the primary structure level by amino acid, insertions, substitutions, deletions, and the like. These modifications can be used in a number of combinations to produce the final modified protein chain.
The amino acid sequence variants can be prepared with various objectives in mind, including facilitating purification and preparation of the recombinant polypeptide. The modified polypeptides are also useful for modifying plasma half life, improving therapeutic efficacy, and lessening the severity or occurrence of side effects during therapeutic use. The amino acid sequence variants are usually predetermined variants not found in nature but exhibit the same immunogenic activity as naturally occurring protein. In general, modifications of the sequences encoding the polypeptides may be readily accomplished by a variety of well-known techniques, such as site-directed mutagenesis (see Gillman & Smith, Gene 8:81-97 (1979); Roberts et al., Nature 328:731-734 (1987)). One of ordinary skill will appreciate that the effect of many mutations is difficult to predict. Thus, most modifications are evaluated by routine screening in a suitable assay for the desired characteristic. For instance, the effect of various modifications on the ability of the polypeptide to elicit a protective immune response can be easily determined using in vitro assays. For instance, the polypeptides can be tested for their ability to induce lymphoproliferation, T cell cytotoxicity, or cytokine production using standard techniques.
The particular procedure used to introduce the genetic material into the host cell for expression of the polypeptide is not particularly critical. Any of the well known procedures for introducing foreign nucleotide sequences into host cells may be used. These include the use of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, plasmid vectors, viral vectors and any of the other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA
or other foreign genetic material into a host cell (see Sambrook et al., supra). It is only necessary that the particular procedure utilized be capable of successfully introducing at least one gene into the host cell which is capable of expressing the gene.
Any of a number of well known cells and cell lines can be used to express the polypeptides of the invention. For instance, prokaryotic cells such as E.
toll can be used. Eukaryotic cells include, yeast, Chinese hamster ovary (CHO) cells, COS
cells, and insect cells.
The particular vector used to transport the genetic information into the cell is also not particularly critical. Any of the conventional vectors used for expression of recombinant proteins in prokaryotic and eukaryotic cells may be used.
Expression - vectors for mammalian cells typically contain regulatory elements from eukaryotic viruses.
The expression vector typically contains a transcription unit or expression cassette that contains all the elements required for the expression of the polypeptide DNA
in the host cells. A typical expression cassette contains a promoter operably linked to the DNA sequence encoding a polypeptide and signals required for efficient polyadenylation of the transcript. The term "operably linked" as used herein refers to linkage of a promoter upstream from a DNA sequence such that the promoter mediates transcription of the DNA sequence. The promoter is preferably positioned about the same distance from the heterologous transcription start site as it is from the transcription start site in its natural setting. As is known in the art, however, some variation in this distance can be accommodated without loss of promoter function.
Following the growth of the recombinant cells and expression of the polypeptide, the culture medium is harvested for purification of the secreted protein. The media are typically clarified by centrifugation or filtration to remove cells and cell debris and the proteins are concentrated by adsorption to any suitable resin or by use of ammonium sulfate fractionation, polyethylene glycol precipitation, or by ultrafiltration.
Other routine means known in the art may be equally suitable. Further purification of the polypeptide can be accomplished by standard techniques, for example, affinity chromatography, ion exchange chromatography, sizing chromatography, HkS6 tagging and Ni-agarose chromatography (as described in Dobeli et al., Mol. and Biochem.
Parasit.
41:259-268 ( 1990)), or other protein purification techniques to obtain homogeneity. The purified proteins are then used to produce pharmaceutical compositions, as described below.
An alternative method of preparing recombinant polypeptides useful as vaccines involves the use of recombinant viruses (e.g., vaccinia). Vaccinia virus is grown in suitable cultured mammalian cells such as the HeLa S3 spinner cells, as described by Mackett et al., in DNA cloning Vol. IL~ A practical approach, pp. 191-211 (Glover, ed.).
Antibod~Production The proteins of the present invention can be used to produce antibodies specifically reactive with C pneumoniae antigens. If isolated proteins are used, they may be recombinantly produced or isolated from Chlamydia cultures. Synthetic peptides made using the protein sequences may also be used.
Methods of production of polyclonal antibodies are known to those of skill in the art. In brief, an immunogen, preferably a purified protein, is mixed with an adjuvant and animals are immunized. When appropriately high titers of antibody to the immunogen are obtained, blood is collected from the animal and antisera is prepared.
Further fractionation of the antisera to enrich for antibodies reactive to Chlamydia proteins can be done if desired (see Harlow & Lane, Antibodies: A Laboratory Manual ( 1988)).
Polyclonal antisera are used to identify and characterize Chlamydia in the tissues of patients using, for instance, in situ techniques and immunoperoxidase test procedures described in Anderson et al. JA VMA 198:241 ( 1991 ) and Barr et al. Vet.
Pathol. 28:110-116 (1991).
Monoclonal antibodies may be obtained by various techniques familiar to those skilled in the art. Briefly, spleen cells from an animal immunized with a desired antigen are immortalized, commonly by fizsion with a myeloma cell (see Kohler &
Milstein, Eur. J. Immunol. 6:511-519 (1976)). Alternative methods of immortalization include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other methods well known in the art. Colonies arising from single immortalized cells are screened for production of antibodies of the desired specificity and affinity for the antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced by various techniques, including injection into the peritoneal cavity of a vertebrate host.
Monoclonal antibodies produced in such a manner are used, for instance, in ELISA diagnostic tests, immunoperoxidase tests, immunohistochemical tests, for the in vitro evaluation of spirochete invasion, to select candidate antigens for vaccine development, protein isolation, and for screening genomic and cDNA libraries to select appropriate gene sequences.
Immunodiagonostic detection of C. pneumoniae infections The present invention also provides methods for detecting the presence or absence of C. pneumoniae, or antibodies reactive with it, in a biological sample. For instance, antibodies specifically reactive with Chlamydia can be detected using either Chlamydia proteins or the isolates described here. The proteins and isolates can also be used to raise specific antibodies (either monoclonal or polyclonal) to detect the antigen in a sample. In addition, the nucleic acids disclosed and claimed here can be used to detect Chlamydia-specific sequences using standard hybridization techniques.
For a review of immunological and immunoassay procedures in general, see Basic and Clinical.rmmunology (Stites & Terr ed., 7th ed. 1991)). The immunoassays of the present invention can be perfonmed in any of several configurations, which are reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); Tijssen, Laboratory Techniques in Biochem.stry and Molecular Biology ( 1985)). For instance, the proteins and antibodies disclose 1 here are conveniently used in ELISA, immunobiot analysis and agglutination assays.
In brief, immunoassays to measure anti-Chlamydia antibodies or antigens can be either competitive or noncompetitive binding assays. In competitive binding assays, the sample analyte (e.g., anti-Chlamydia antibodies) competes with a labeled analyte (e.g., anti-Chlamydia monoclonal antibody) for specific binding sites on a capture agent (e.g., isolated Chlamydia protein) bound to a solid surface. The concentration of labeled analyte bound to the capture agent is inversely proportional to the amount of free analyte present in the sample.
Noncompetitive assays are typically sandwich assays, in which the sample analyze is bound between two analyte-specific binding reagents. One of the binding agents is used as a capture agent and is bound to a solid surface. The second binding agent is labelled and is used to measure or detect the resultant complex by visual or W strument means.
A number of combinations of capture agent and labelled binding agent can be used. For instance, an isolated Chlamydia protein or culture can be used as the capture agent and labelled anti-human antibodies specific for the constant region of human antibodies can be used as the labelled binding agent. Goat, sheep and other non-l.uman antibodies specific for human immunoglobulin constant regions (e.g., y or p.) are well known in the art. Alternatively, the anti-human antibodies can be the capture agent and the antigen can be labelled.
Various components of the assay, including the antigen, anti-Chlamydia antibody, or anti-human antibody, may be bound to a solid surface. Many methods for immobilizing biomolecules to a variety of solid surfaces are known in the art.
For instance, the solid surface may be a membrane (e.g., nitrocellulose), a microtiter dish (e.g., PVC or polystyrene) or a bead. The desired component may be covalently bound or noncovalently attached through nonspecific bonding.
Alternatively, the immunoassay may be carried out in liquid phase and a variety of separation methods may be employed to separate the bound labeled component from the unbound labelled components. These methods are known to those of skill in the art and include immunoprecipitation, column chromatography, adsorption, addition of magnetizable particles coated with a binding agent and other similar procedures.
An immunoassay may also be carried out in liquid phase without a separation procedure. Various homogeneous immunoassay methods are now being applied to immunoassays for protein analytes. In these methods, the binding of the binding agent to the analyte causes a change in the signal emitted by the label, so that binding may be measured without separating the bound from the unbound labelled component.
Western blot (immunoblot) analysis can also be used to detect the presence of antibodies to Chlamydia in the sample. This technique is a reliable method for confirming the presence of antibodies against a particular protein in the sample. The technique generally comprises separating proteins by gel electrophoresis on the basis of molecular weight, transferring the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample with the separated proteins. This causes specific target antibodies present in the sample to bind their respective proteins. Target antibodies are then detected using labeled anti-human antibodies.
The immunoassay formats described above employ labelled assay components. The label may be coupled directly or indirectly to the desired component of the assay according to methods well known in the art. A wide variety of labels may be used. The component may be labelled by any one of several methods.
Traditionally a radioactive label incorporating 3H,'ZSh ass, i4C, or 32P was used. Non-radioactive labels include ligands which bind to labelled antibodies, fluorophores, chemiluminescent agents, enzymes, and antibodies which can serve as specific binding pair members for a labelled ligand. The choice of label depends on sensitivity required, ease of conjugation with the compound, stability requirements, and available instrumentation.
$ Enzymes of interest as labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or oxidoreductases, particularly peroxidases.
Fluorescent compounds include fluorescein and its derivatives, rhodamine and its '- derivatives, dansyl, umbeliiferone, etc. Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various labelling or signal producing systems which may be used, see U.S. Patent No. 4,391,904, which is incorporated herein by reference.
Non-radioactive labels are often attached by indirect means. Generally, a ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or covalently bound to a signal system, such as a detectable enzyme, a fluorescent compound, or a chemiluminescent compound. A number of ligands and anti-ligands can be used. Where a Iigand has a natural anti-ligand, for example, biotin, thyroxine, and cortisol, it can be used in conjunction with the labelled, naturally occurring anti-ligands.
Alternatively, any haptenic or antigenic compound can be used in combination with an antibody.
Some assay formats do not require the use of labelled components. For instance, agglutination assays can be used to detect the presence of the target antibodies.
In this case, antigen-coated particles are agglutinated by samples comprising the target antibodies. In this format, none of the components need be labelled and the presence of the target antibody is detected by simple visual inspection.
Phazmaceutical Compositions The peptides or antibodies (typically monoclonal antibodies) of the present invention and pharmaceutical compositions thereof are useful for administration to mammals, particularly humans, to treat and/or prevent Chlamydia infections.
Suitable formulations are found in Remington's Pharmaceutical Sciences, Mack Publishing Company, Philadelphia, PA, 17th ed. (1985).
The immunogenic peptides or antibodies of the invention are administered prophylactically or to an individual already suffering from the disease. The peptide compositions are administered to a patient in an amount sufficient to elicit an effective immune response to Chlamydia. An effective immune response is one that inhibits infection. An amount adequate to accomplish this is defined as "therapeutically effective dose" or "immunogenically effective dose." Amounts effective for this use will depend on, e.g., the peptide composition, the manner of administration, the stage and severity of the disease being treated, the weight and general state of health of the patient, and the judgment of the prescribing physician, but generally range for the initial immunization (that is for therapeutic or prophylactic administration) from about 0.1 mg to about 1.0 mg per 70 kilogram patient, more commonly from about 0.5 mg to about 0.75 mg per 70 kg of body weight. Boosting dosages are typically from about 0.1 mg to about 0.5 mg of peptide using a boosting regimen over weeks to months depending upon the patient's response and condition. A suitable protocol would include injection at time 0, 4, 2, 6, 10 and 14 weeks, followed by further booster injections at 24 and 28 weeks.
For therapeutic use, administration should begin at the first sign of infection. This is followed by boosting doses until at least symptoms are substantially abated and for a period thereafter. In some circumstances, loading doses followed by boosting doses may be required. The resulting immune response helps to cure or at least partially arrest symptoms and/or complications. Vaccine compositions containing the peptides are administered prophylactically to a patient susceptible to or otherwise at risk of the infection.
The pharmaceutical compositions (containing either peptides or antibodies) are intended for parenteral or oral administration. Preferably, the pharmaceutical compositions are administered parenterally, e.g., subcutaneously, intradermally, or intramuscularly. Thus, the invention provides compositions for parenteral administration which comprise a solution of the immunogenic polypeptides dissolved or suspended in an acceptable carrier, preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, buffered water, 0.4% saline, 0.3% glycine, hyaluronic acid and the like. These compositions may be sterilized by conventional, well known sterilization techniques, or may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or lyophilized, the lyophilized preparation being combined with a sterile solution prior to administration. The compositions may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions, such as buffering agents, tonicity adjusting agents, wetting agents and the like, for example, sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc.
The compositions may also comprise carriers to enhance the immune response. Useful carriers are well known in the art, and include, e.g., KLH, thyroglobulin, alburnins such as human serum albumin, tetanus toxoid, poiyamino acids such as poly(lysine:glutamic acid), influenza, hepatitis B virus core protein, hepatitis B
virus recombinant vaccine and the like.
For solid compositions, conventional nontoxic solid carriers may be used which include, for exarr.ple, pharmaceutical grades of mannitol, lactase, starch, magnesium stearate, soc!.ium saccharin, talcum, cellulose, glucose, sucrose, magnesium carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic composition is formed Y y incorporating any of the normally employed excipients, such as 1 ~ those carriers previously listed, and generally 10-95% of active ingredient, that is, one or more peptides of the invention, and more preferably at a concentration of 25%-75%.
As noted above, the peptide compositions are intended to induce an immune response to Chlamydia. Thus, compositions and methods of administration suitable for maximizing the immune response are preferred. For instance, peptides may be introduced into a host, including humans, linked to a carrier or as a homopoiymer or heteropolymer of active peptide units from various Chlamydia proteins disclosed here.
Alternatively, a "cocktail" of polypeptides can be used. A mixture of more than one polypeptide has the advantage of increased immunological reaction and, where different peptides are used to make up the polymer, the additional ability to induce antibodies to a number of epitopes.
The compositions also include an adjuvant. As used here, number of adjuvants are well known to one skilled in the art. Suitable adjuvants include incomplete Freund's adjuvant, alum, aluminum phosphate, aluminum hydroxide, N-acetyl-rnuramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylinuramyl-Lalanyl-D-isoglutaminyl-L-alanine-2-{1'-2'-dipalmitoyl-sn-g:ycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), and RIBI, which contains three components extracted from bacteria, monophosphoryl WO 00!17994 PCT/US99/26923 lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2%
squalenelTween 80 emulsion. The effectiveness of an adjuvant may be determined by measuring the amount of antibodies directed against the immunogenic peptide.
The concentration of immunogenic peptides of the invention in the S pharmaceutical formulations can vary widely, i.e. from less than about 0.1 %, usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be selected primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of administration selected.
The peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus as a vector to express nucleotide sequences that encode the peptides of the invention. Upon introduction into a host, the recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. Patent No. 4,722,848.
Another vector is BCG (Bacille Calmette Guerin). BCG vectors are described in Stover et aI.
(Nature 351:456-460 (1991)). A wide variety of other vectors useful for therapeutic administration or immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, will be apparent to those skilled in the art from the description herein.
The DNA encoding one or more of the peptides of the invention can also be administered to the patient. This approach is described, for instance, in Wolff et. al., Science 247: 1465-1468 (1990) as well as U.S. Patent Nos. 5,580,859 and 5,589,466.
In order to enhance serum half life, the peptides may also be encapsulated, introduced into the lumen of liposomes, prepared as a colloid, or other conventional techniques may be employed which provide an extended serum half life of the peptides.
A variety of methods are available for preparing liposomes, as described in, e.g., Szoka et al., Ann. Rev. Biophys. Bioeng. 9:467 (1980), U.S. Pat. Nos. 4, 235,871, 4,501,728 and 4,837,028.
EXAMPLES
The following examples are offered to illustrate, but no to limit the claimed invention.
Examvle 1:
This example describes comparison of the C. pneumoniae genome disclosed here and the, previously sequenced, C. trachomatis genome (Stephens, et al.
Science 282:754-759 (1998)).
The apparent low level of DNA homology between C. trachomaris and C.
pneumoniae (Campbell, et al., J. Clin. Microbiol. 25:1911-1916 {1987)) yet analogous cell structures and developmental cycles, predicts that comparative analysis of the two genomes will significantly enhance the understanding of both pathogens.
Identification of genes that are present in one species but not the other are of particular importance for the mutually exclusive biological, virulence and pathogenesis capabilities of each.
Identification of genes shared between the two species strongly supports the requirement for these capabilities in a biological system that has, over its long-term association with mammalian host cells, evolved to reduce the metabolic capacities while optimizing survival, growth and transmission of these unique pathogens.
The previously sequenced G trachomatis genome contains 1,042,519 I S nucleotides and 875 likely protein-coding genes. Similarity searching permitted the inferred functional assignment of sequences 636 {60%) genes disclosed here and (23%) are similar to hypothetical genes for other bacterial organisms including those for G trachomatis. The remaining 186 (17%) genes are not homologous to sequences deposited in GenBank.. Seventy C. trachomatis genes are not represented in the C.
pneumoniae genome. These are contained within blocks consisting of 2-17 genes and 19 single genes. Of the 70 G trachomatis genes without homologs in C. pneumoniae, 60 are classified as encoding hypothetical proteins. The remaining genes not represented in C
pneumoniae consist of the tryptophan operon (trpA,B,R), trpC, two predicted thiol protease genes, and 4 genes assigned to the phospholipase-D superfamily.
It is evident that there is a high level of functional conservation between C.
pneumoniae and C. trachomatis as orthologs to C. trachomatis genes were identified for 859 (80%) of the predicted coding sequences for G pneumoniae. The level of similarity for individual encoded proteins spans a wide spectrum (22-95% amino acid identity) with an average of 62% amino acid identity between orthologs from the two species.
The percent amino acid identity between orthologous chlamydial proteins is similar among functional groups with the highest for proteins associated with translation and the lowest for proteins whose function in chlamydiae is uncharacterized and not related to proteins encoded by other organisms. The gene order of the homologous set of genes in C.
WO 00/27994 PCT/US99/2b923 pneumoniae shows reorganization relative to the genome of C. trachomatis;
however, there is a high level of synteny for the gene organization of the two genomes.
We identified thirty-nine blocks of 2 or more genes whose gene organization is colinear with homologs to C. trachomatis, although some of these are inverted. The distribution of genome reorganization is not evenly distributed on the chromosome as the region between G pneumoniae coding sequences 0130-0300 contains substantially more reorganization than other areas of the genome. This region coincides with the predicted chromosome replication terminus.
We identified orthologs of enzymes characterized in other bacteria that account for the essential requirements for DNA replication, repair, transcription and translation including two predicted DNA helicases of the Swi2/Snf2 family found in C.
trachomatis. Similar to G trachomatis, alternative sigma subunits for RNA
polymerase, X28 ~d ~54~ were identified in addition to anti-a~ regulatory system factors RsbV, a RsbW-like single-domain histidine kinase, and a RsbU-like protein phosphatase.
These findings suggest that the fundamental mechanisms of transcriptional regulation are conserved among Chlamydia. The C. trachomatis proteins containing SET and SWIB
domains, and a SWiB domain fused to the C-terminus of the chlamydial topoisomerase I, not identified outside eukaryotes, are found in C. pneumoniae supporting their possible role in the chromatin condensation-decondensation characteristic of the biologically unique chlamydial developmental cycle.
The central metabolic pathways inferred from the G pneumoniae genome sequence are the same as those identified for C. trachomatis G pneumoniae has a glycolytic pathway and a linked tricarboxylic acid cycle, although likely functional, is incomplete as genes for citrate synthase, aconitase, and isocitrate dehydrogenase were not identified. C. pneumoniae has a complete glycogen synthesis and degradation system supporting a role for glycogen synthesis and utilization of glucose-derivatives in chlamydial metabolism. Genes encoding essential functions in aerobic respiration are present and electron flux may be supported by pyruvate, succinate, glycerol-3-phosphate, and NADH dehydrogenases, NADH-ubiquinone oxidoreductase and cytochrome oxidase.
C. pneumoniae also contains the V (vacuolar}-type ATPase operon and the two ATP
translocases found in C trachomatis.
The type-III secretion virulence system required for invasion by several pathogenic bacteria and found in the C. trachomatis genome in three chromosomal locationsis also present in the C. pneumoniae genome. Each of the components is conserved and their relative genomic contexts are conserved. Genes such as a predicted serine/threonine protein kinase and other genes physically linked to genes encoding structural components of the type-III secretion apparatus, but without identified homologs, are also highly similar between the two species suggesting the functional roles in modifying cellular biology are fundamentally conserved.
Chlamydia-encoded proteins that are not found in chlamydial organisms but localized to the intracellular chlamydial inclusion membrane are likely essential for the unique intracellular biology and perhaps differences in inclusion morphology observed between species of Chlamydia. Several such proteins, termed incA,B&C, have been characterized for a !:. psittaci strain (Rockey, et al. Mol. Microbiol.
15:617-626 (1995); Rockey et al. Inf:~ct. Immun. 62:106-112 (1994)). C. pneumoniae and C.
trachomatis encode orthc~logs to C. psittaci Inca and IncC and C. trachomatis also contains an ortholog to LicA. C. pneumoniae contains two genes that encode proteins with similarity to IncA (CPn0186 and CPn0585), although the level of homology is low suggesting analogous but possibily altered functions.
The tryptophan biosynthesis operon (trpA, trpB, trpR) and trpC identified in C. trachomatis is conspicuously missing in the C. pneumoniae genome. This represents the entire repertoire of genes associated with tryptophan biosynthesis identified in C. trachomatis. Seventeen genes adjacent to the C. trachomatis tryptophan operon also were not found in the G pneumoniae genome. This region is the single largest loss of a contiguous genomic segment and includes 4 HKD superfamily encoding genes that encompass a family of proteins related to endonuclease and phospholipase D.
These findings may be important for the ability of Chlamydia to persist in their hosts and cause disease by eliciting potent, focal and persistent inflammatory responses thought to be essential for pathogenesis.
The C. pneumoniae genome contains 187,711 additional nucleotides compared to the C. trachomatis genome, and the 214 coding sequences not found in C.
trachomatis account for most of the increased genome size. Eighty-eight of these genes are found in blocks of >10 genes {11-30 genes/block), 41 are single genes, and the remainder are partnered with at least one other gene. Based upon the observation that ~%U% of all the C. pneumoniae genes have an identifiable homolog in GenBank, exclusive of C. trachomatis, it would be expected that over 150 of the 214 genes should have a homolog in GenBank, many associated with a function. However, only 28 coding sequences have similarity to genes from other organisms. Thus the majority of the genes that are mutually exclusive of C. trachomatis (186 of 214), and the 60 of 70 G
trachomatis genes that lacked an identifiable homolog in C. pneumoniae, do not have detectable homologs to genes from other organisms. We predict that most of the unique genes are essential for specific attributes that define the differential biology, tropism and pathogenesis of C. trachomatis and C. pneumoniae. Moreover, this suggests that C.
pneumoniae has more unique biological (i.e., virulence) capacity than C.
trachomatis.
The ability of C. pneumoniae to be more invasive and survive in a broader range of host cell types than C. trachomatis is consistent with this hypothesis. Not all of the differences in biological capacity may be associated with mutually exclusive genes. One explanation for the significantly lower level of homology between protein sequences assigned as having G pneumoniae and C. trachomatis orthologs but no identifiable orthologs in other organisms is that this set of proteins is not only associated with biological requirements specific for Chlamydia but this polymorphism may account for differential biology between the two species. The determination of the genome sequence from a representative of the C. psittaci group will precisely delineate those genes that are mutually exclusive and specific for each species.
The major functionally identifiable addition to the C. pneumoniae genome is a large expansion of genes encoding a new family of chlamydial polymorphic membrane proteins (Pmp), alone representing 22% of the increased coding capacity.
While the C. trachomatis genome has 9 pmp genes, remarkably the C. pneumoniae genome contains 21 pmp genes. Most of these genes appear to be amplified in two regions of the genome with three stand-alone genes. Interestingly one of the stand-alone genes is most closely related to the C. trachomatis pmpD which is the only stand-alone pmp gene in the C. trachomatis genome and it is located with the same relative genomic context, suggesting an essential and conserved function for this paralog. Six Pmp-coding genes are presumably not functional as five contain predicted coding frame-shifts and one is truncated. The amplification of this gene family and the confidently predicted frame-shifts suggest a specific molecular mechanism to promote functional or antigenic diversity. The biological role of this protein family remains enigmatic, although at least one of the proteins in G psittaci related to this family is exposed on the chlamydial surface.
WO 00/27994 PCT/US99/2b923 While a function could not be assigned for most of the unique G
pneumoniae genes, several have significant similarity to genes from other organisms.
Functional assignments could be made for genes encoding GMP synthetase, IMP
dehydrogenase, (JMP synthase, uridine kinase, biotin svnthase pathway proteins, methylthioadenosine nucleosidase, a DNA glycosylase and aromatic amino acid hydroxylase. Thus a complete pathway was identified for biotin biosynthesis.
The additional purine and pyrimidine salvage pathway genes presumably reflect metabolic ' limitations in one of the cell types that G pneumoniae infects or differences in the ability of C. pneumoniae to transport precursor nucleosides or nucleotides.
The addition of aromatic amino acid hydroxylase in G pneumoniae is intriguing especially in light of the loss of tryptophan biosynthetic genes and the inability to synthesize other amino acids including phenylalanine. Aromatic amino acid hyroxlyases include three distinct enzymes that function to receptively oxidize phenylalanine to tyrosine, tyrosine to Dopa, and tryptophan to 5-hydroxytryptophan and serotonin. Although the chlamydial protein is similar to proteins of this family and incrementally more closely related to tryptophan hydroxyiase, its specific function could not be confidently predicted. We hypothesize that it may be involved in C.
pneumoniae virulence. Tryptophan hydroxylase has not been previously identified in bacteria and the origin of the chlamydial gene appears to be from eukaryotes. The functional role of an aromatic amino acid hydroxyiase for C. pneumoniae is linked to the unique intracellular biology of this organism and may represent a key contribution to C. pneumoniae persistence and pathogenesis.
It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.
Table 1 provides functional assignments of C. pneumoniae nonprotein-encoding genomic sequences. Table 2 provides functional assignments of protein coding sequences. Table 3 provides the amino acid sequences of the proteins corresponding to the coding sequences.
type SEQ iD N0:1 SEQ tD N0:1 Gene start position end position Ori 841664 841396 (R) Putative Origin of Replica tmRNA 138493 138074 (R) tmRNA
pRNA 607342 607649 Ribonuclease P
RNA
rRNA 1000564 1002115 165 rRNA
rRNA 1002415 1005278 235 rRNA
rRNA 1005393 1005509 5S rRNA
tRNA 269070 269142 Ala tRNA_1 tRNA 164318 164389 Asn tRNA
tRNA 296224 296151 (R) Asp tRNA
tRNA 836191 836119 (R) Ala tRNA_2 tRNA 1030533 1030603 Cys tRNA
tRNA 784896 784822 (R> Glu tRNA
tRNA 781680 781610 (R) Gly tRNA'1 tRNA 961536 961607 Gly tRN~2 tRNA 999949 1000023 His tRNA
tRNA 268992 269065 Ile tRNA
tRNA 672236 672318 Leu tRNA 1 tRNA 680178 680257 Leu tRNA'2 tRNA 715889 715971 Leu tRNF~3 tRNA 739403 739486 Leu tRNPr_4 tRNA 1175863 1175944 Leu tRNA'5 tRNA 784994 784922 (R) Lys tRNA
tRNA 843926, 843999 Pro tRNA_2 tRNA 409922' 409848 (R> Pro tRNA_1 tRNA 631373 631445 Phe tRNA
tRNA 677337 677264 (R) Arg tRNA~,2 tRNA 807413 807341 (R) Arg tRNA_3 tRNA 877473 877400 (R) Arg tRNA_4 tRNA 462141 462214 Arg tRNA_1 tRNA 1085605 . 10.85676 Gln tRNA
tRNA 786780 786708 (R) Thr tRNA_3 tRNA 89728 89657 (R) Thr tRNA_I
tRNA 293477 293405 (R) Thr tRNA'2 tRNA 87522 87450 (R) Met tRNP~l tRNA 199301 199229 (R) Met tRNA_2 tRNA 199390 199317 (R) Met tRNA_3 tRNA 626904 626987 Ser tRNA_1 tRNA 708359 708440 Ser tRNA_2 tRNA 1112034 1142117 Ser tRNA_3 tRNA 1230028 1229945 (R) Ser tRNA_4 tRNA 91070 90999 (R) Trp tRNA
tRNA 293399 293317 (R) Tyr tRNA
tRNA 296147 296075 (R) Val tRNA_1 tRNA 1137389 1137462 Va1 tRNA_2 gacggatttgcactgccggtagaactccgcgaggtcgtccagcctcaggcagcagctgaa2520 ccaactcgcgaggggatcgagcccggggtgggcgaagaactccagcatgagatccccgcg2580 ctggagg ~ana ~rem "2 ctraeW:eee a~~n~'fee ~~ rerheeierfl erheiee iartateaGheaeel Clft0001111 4 R CT001 hypothetical protein CPa0003577 175 t QatC-Glu-CRNA Gla luaidotransterasa tC subunit)-CPn0007195 X770 T aat~-Glu eRNl1 Gln Aaridotransterae-1GT003!
ma)un.t:!ih :~:! t- ,(.~rN.cPe~ll=t ~:In rnrt:, vln Ammto~san.-.:~rrrsr t1 ::uWtniW -t~:T9n.n vPn~0U5ils7 edJ1 F pmp_1-7olymorphic ~uc.rt !(emDrin~ Procmn G Famsly CPn0005~=93 7111 R
CPn00077805 10196 F
CPn000810975 11615 F
CPn000911115 13119 t cPnooloa a 13x6 r s CPa00101379 13746 t frame-shift with 0010 CPn00I11519= 16114 !
CPn001316it4 11x12 !
CPn001311511 =1106 F ymp_1-Polynnrphie Outer Membrane Protein C
lamily CPa0014II392 219x3 r ymD_3-Polyanrphic outer Membrane Protein G
lastly CPn001537.135x174 t' pmp-"3-PNP_3 lirasre-shift with 0011) CPn001614416 26118 t' pmp_t-Polyteosphie Outer Membrane Protein G lastly CPn001726094 x7170 F' pop_t-PMP_< Iiras~-shift with 00161 CPn0018375x2 29007 t pmp_5-Polymorphic outer Membrane Procain G
Pamily cIPn001929007 30356 t pap_S-PMP_S Iirame-shift with 0011) CPa00x031617 30603 P. Predieted OHP (leader (14) pepCide= outer membrane)-ICT7511 CPn00x131410 3x707 R Predicted OHP (Leader 119) prptide)-1CT350) CPn00xx1191 34395 F maL-1CT319) CPa00x336607 3301 F y~ilc/alr-ABC TraasDOrter Protein ATPeee-ICT348) CPn00x137596 36661 F xerC-InteQrsse/reeombinaae-lCT3t7) CPa00x51860 37614 A elaC/atsa-Sulphohydrolase/Glycosuliataae-( Crlt6l CPa00x639625 3176= R GT3t5 hypoe3tetical protein-ICTltS) CPn00x7txx3t 39778 R lon-Lon ATP-dependent Protease-lGT3tl CPn00x813325 txSt3 R
CPa00x93755 43310 R
CPn003013191 415x9 f Qep_1-O-SialoQlycaDroeaia Eadopeptidase_1-ICT343) CPa003114711 44111 ! rs=1-SZ1 Ribosaetal Ptrouin-fCT3t=I
Cln003x4913 46098 T daaJ-Meat Shoek Proteia J-ICT7ti) CPn003346138 8171 P pdhArrB/odbAiodbH-lpyruvatel Oxoisovalerate DehYdroOsasse Alpba i seta Pusioa-ICT3t0) CPa003449457 41=10 R
CPis0035510x9 49569 R CT339 hypothetical protein CPts00365100? 51796 t CT338 hypothetical Droeeia CPn003751792 5x115 F ptsH-PTS Phosphocarrisr Protein Hpr-It-T337) CPn00385x119 63831 F DtsI-PTS PtP Phosphotransierase-ICT336) ePn003954x50 53163 R ybal-ICT335) CPn00405563 St318 R dnaX_1-DN11 Pol III Gamma and Tau_1-fCT3341 CPn004156996 5733 T
CPa004257103 5113 !
CPn00t35847 60372 !
CPn004460419 60771 !
CPn00t561069 6=790 t CPa004661790 61x63 t CPn004761155 63151 ~
T
CPn00486311? 85101. ! yqiP-Is conserved hypothetical IM proesin CPn001966x96 651)7 R
CPn0050B613 66199 R
CPn00s166173 67111 t CPn005261005 6730 R hemC-Porphobilinot:en Oeaaunass-ICTZ991 CPn005369744 67916 A sms-Sau Protein-ICTZ91) CPn0054700:3 69713 R rnc-Ribonucluse III-(CTx97) CPn0055701x9 70590 F CT296 hypothetical proteia ' Cpn00s670953 72746 t .n~rsA-PhosDhoernnomucasr ICTt95) CPn00577=971 73551 F sodM-SuDeroxide Dismucase lMnlIC'T'l94) CPn00587)839 7156= F aec0-AeCoA Casboxylase/Transierase eca-tCT293) ePa0059'1618 71050 F duc-dtlTP Nueleocidohydrolase-ICTZ92) CPn006075055 755x8 F pesN_t-PTS IIA Protein-IC'C:9lf ~Pn005175514 76:08 F DtaN_Z-PTS IIA Protein ttTN DNJ1-lin3inQ
Domain-lGTx90) ~Pn~05274)04 77490 F CT-~9 hypocnecieAl proceln ~:Pn~~b77811: 74':67 F
~Pn00b17N146 78S7b F
~:Pnn9551N9:4 40651 F C't'=99 hyDOther:i,:nl protein ~Pn0055409:5 d=655 F
CPn006782953 8053 F
CPn006884903 81331 R CT360 hypothetical pzocein CPn006985236 87086 F
CPn007087378 87208 R
CPn007188045 87599 R CT325 hypothetical protein CPn007289061 88057 R CT324 hypothetical protein CPn007389356 89574 F infA-Initiation Factar IF-1-ICT323) CPn007d89774 90955 F cufA-Elongation Factor Tu-1CT322) CPn007591102 91350 F secE-preprotein cranslocaseICT321) CPn007691358 91903 F nusG-Transcriptional Anciterminacion-(CT320) CPn007792013 92435 F zlll-L11 Ribosomal Protein-(CT3191 CPn007892465 93160 F zll-L1 Ribosomal Protein-(CT318) CPn007993179 93688 F r110-L10 Ribosomal Protein-(CT317) CPn008093735 91131 F r17-L7/L12 Ribosomal Protein-(CT316>
CPn008194261 98016 F rpoH-RNA Polymerase Heca-(CT315) CPn008298043 102221 F rpoC-RNA Polymeraae Heta~ -ICT314) CPn0083102332 103312 F tal-Transaldolase-(CT313) CPn0084103362 103751 F predicted ferredoxin-ICT312) CPn0085104506 103755 R CT311 hypothetical protein CPn0086104904 105527 F atpE-ATP Synchase Subuait E-(CT310) CPn0087105579 105376 F CT309 hypothetical protein CPn0088106373 108145 F atpA-ATP Syachase Subuait A-(CT308) CPn0089108153 10966 F atpH-ATP Synthase Subunit e-(CT307) ~Pn0090109454 110080 F atpD-ATP Synthase Subunit D-(CT306) CPn009110074 112053 F 1 atpI-ATP Synthase Subunit I-ICT3051 CPn0092112151 112573 F atpK-ATP Synthase Subunit K-(CT304) CPn0093112509 113015 F CT303 hypothetical protein CPn0094113152 115971 F valS-Valyl tRNA Synthetase-ICT302) CPn0095116037 118790 F pfai0-5/T Protein Kinsse-(CT301) CPn0096124314 118837 R uvrA-Excinuclease AeC Subunit A-(CT333) CPn0097124555 126006 F pyk-Pyruvate Kinase-(CT332) CPn0098127491 126091 R htrH-ACyltransferase-ICT010) CPn0099127593 127865 F
CPn0100129141 127882 R CT011 hypothetical protein CPn0101129932 129141 R ybbP family hypothetical protein-ICT012) CPn0102130123 131466 F cydA-Cytochrome Oxidase Subunit I-(CT013) CPn0103131480 132511 F cycle-Cytochrome Oxidase Subunit II-(CT014) CPnOlOd133875 132676 R~ CT017 hypothetical protein CPn0105134847 134029 R CT016 hypothetical protein CPn0106135091 136374 F phoH-ATPase-(CT015) CPn0107137162 136392 R CT058 hypothetical pzotein_1 CPn0108137857 137303 R CT018 CPn0109138655 141783 F ileS-Isoleucyl-tRNA Synthecase-1CT019) CPn01101373 141827 R lepe-Signal Peptidase I-ICT020) CPn011114686 143934 R CT021 hypothetical protein CPn0112144767 145093 F r131-L31 Ribosomal Protein-(CT022) CPn0113145335 146405 F pfrA-Peptide Chain Releasing Factor (RF-1)-(CT0231 CPnOlld146398 147261 F hemK-A/G specific methylase-(CT024) CPa0115147279 148622 F ffh-Signal Recognition Particle GTPase-(CT025) CPn0116148616 148972 F rsl6-516 Ribosomal Protein-(CT026) CPn0117148989 150071 F tzmD-tRNA (guanine N-1)-Methylttansferase-(CT027) CPn0118150102 150464 ~ s119-L19 Ribosomal Protsin-(CT'028) F
CPn0119150523 151164 F rnhe_1-Ribonuclease HII_1-ICT029) CPn0120151164 151778 F gmk-GMP Kinase-(CT030) CPn0121151778 152068 F CT031 hypothetical protein CPn0122152071 153723 F mete-Methionyl-tRNA Synehetase-(CT032) CPa0123155969 153774 R recD_1-Exodeoxyribonuclease V (Alpha Subunit)_1-(CT033) CPn0124156614 158068 F
CPn0125158096 158605 F
CPn0126158809 161085 F
CPn0127162143 161130 R ytfF-Cationic Amino Acid Transporter-(CT034) CPn0128162277 163053 F bpll-Hiotin Protein Lipase-(CT035) CPn0129163717 16306 R similarity to CT036 CPn013016425 163751 R
CPn0131164519 165580 F
CPn0132165587 166561 F
CPn013316733 16656 R CHLPS hypothetical protein-(CT109) CPn0134169098 167467 R groEL_1-HSP-60_1-tCT1101 CPn0135169448 16913 R groES-lOKDa Chaperonin-(CT1111 CPn0136171401 169569 R pepF-Oligopepcidnse-ICT112) CPn0137172254 171502 A ybgI-ACR family-tCT1081 CPn013817019 172700 R hem:..-Glucatrace-1-semialdehyde-2.1-aminomutase-CPn013917465617093 R ypq=-(~j10) CPaoltO175110171173 R yqdi-tCTSlI
CPnoltl175103175110 R splA-Ribose-5-P Isasrrase A-tCTZ131 CP1l01t2176091175116 R
CPn01t317T33s176114 R 'yxjC_Ds_1 ttypothecical Proceia CPn0114177963180560 F elpl-Clp Protease ATPass-tCT1I31 cPaoltsI8o777I1=369 F CTllt hypochetiul protein cPnoltsIaI1131e3o9s r cPetolt7Ia3tI5113171 F
CPn0ltB18316 183702 F plasl-S/T Protein Kinase-tCT115) CPaolt918371517700 F dalJ-DNA LiQase-tCTi46) CPno1501171311911 F CTlt7 hypothetical protein CPn01511911 19=635 R mhpJ~-tioaooxypeeuse-(CT1181 ClnolSl19!=6s19718 R CT119 hypotbetiul prot~ia ~Paol5319533 197113 F leul-Leueyl tRNA 8yaehttass-fCTt09!
ClnolSt197892199301 F pseA-1CD0 Traaslsrase-ICTI08f CPnolSS191691191118 R
CPnol561001171!1770 R
CPao157100713100=98 A
CPtl015820130 100191 R
CPnoi59=01772101167 R
CPno160303791303137 R pfkJ~i-Fructose-b-P Phospdocraasferase_1-lCTI07) CPao161101612303798 R psedietad aeylcraasferase lamily-tGTI06) CPn016220511810803 R
CPa0163308016=06391 !
CPnolbt208198106!98 !
CPnoi65306198207583 P
CPetoi66z07830207963 !
CP~WI67201306107977 R
CPnolBB20161 201417 R
CPno169109501101710 R
CPao170111016110015 R
Clnol7lIi=13621119 R '~faA-Clip 9ynthaas Clnol7l11317721=110 R QuaD/lapD-laosiae 5'-moaophosphass dehydro0anase IC00R-tesa~iaal savior.
only) Clno173113987213715 R
cPnol7tIlass7Iu7It F
CPnol75214198215175 F' Claol76213=86z16318 F CTi53 hypoehatieal protein CPno17721759 116608 R
CPn0178211052317789 R
CPn0179211103218056 R
CPuo180111851218356 R
CPao181219175111777 R
CPn0182110596219331 R aceC-Biocia Carboxylass-tCTiIt) CPn0183111195330695 R ace!-Diocia Carboxyl Carrier Protein-tCTlI3) CPa018t211775231331 R s!p_1-EloaQacion Factor P_I-tCTl3I) CPUo185113151231765 R spe/araD-Ribulose-P Epimsrus-tCT121) CPn0186111199111068 F stmilaricy to Cps IaeJL1-tCT1191 CPnQ117111118213015 F predicted metdylass-fCTi331 CPno188116111111100 F CTI3I hypoebecical prouia CPn0189116100211815 F CSI31 homolo0-(Possible Transmembraas Prouin) ~
Clao19013!!19131271 F
Cla019133199133131 R QlaQ-1180 Amigo Acid Trsasportes ATPase-1CT130) CPao192I3=631131981 R Qln!-A8C Amigo Acid Tsaasporeer Pesmsass-ICT1I9I
CPa0193233126231686 R arQR-ArQiaias Re~tssor CPa019t13311023111 F Qep_I-O-Sialo0lyeoprotsia trrdopsptidase_I-tC?197) CPaoi95231190135786 F oppA_1-Olfpopeptide BindiaQ Procai~l CPa0196336939137519 F app!'.I-Olipopepeids DindiaQ Protais~l-(CT1981 CPnol972375'!8331183 F oppJ~3-Olipopeptlds Diadtap Protsitt-3 Clnol98=79169It07tb F opplL,t-OliQOpsptide Dindta? Psocsl~l CPnoi99ItlOtz31983 F oppD_1-Olipopeptide Pesmsase_1-ICT1l9) CPn0I00111017147868 F opp~i-Olipopeptide Pesmease_1-(CTt00) Cln0I01111161zt371s F oppD-oli0opepclde Transport ATPass-tCTI01) CPnO=02zt1715111500 F oppF-Olipopepcids Trtasporc ATPass-tCTtOI
ClelO=03-'25008 I510z F
craolotztsel7ztlooz F
csoozosztu3 It13z7 F
Clet0I0611610927161 F CTI03 hypothetical psoteia CPt10I07zt7I08111617 F ybhI/sodiT!-OxoQluearacs/Halace Translocatos-tGT20t1 CPa0I08111953z50602 F pi)cJ~.Z-Fructose-b-P Ptsosphotraaatesass_1-tCTI051 CPe0I09251036:51172 F
CPn0210252384 251140 R
CPn0211252756 252463 R
CPn0212254066 252888 A
CPn0213254342 254190 R
CPn0I14255657 254146 R
CPn0215257015 255759 R
CPn0216257608 257174 R
CPn0217257896 258579 F ypdP-(CT140) CPn0218259058 258582 R
CPn0219259357 260472 F tgt-pueuine tANA Ribosyl Transferase-(CT193) CPn0220260696 261238 F
CPn0221261657 262064 F
CPn0222262504 262842 F wak similarity to Hacteriophage CHP1 (Orl4>
CPn0223262956 263333 F
CPn0224263435 263674 !
.
Cpn0225263873 264541 !
CPn0226264566 261967 F
CPn0227265116 265009 R dsb8-Disulfide bond Oxidoreductase-(CT176) CPn0228266110 265412 R dsbG-Disulfide Bond Chaperone-(CT177) CPn0229266328 267560 F CT178 hypothetical protein CPn0230268253 267576 R CT179 hypothetical protein CPn0231268957 268253 R tauH-AHC Transport ATPase (Nitrate/Fe)-(CT180) CPa0232270122 269232 R similarity to 5~-Methylthioadenosine / S-Adenosylhosaeysteine Nucleosidase CPa0233270424 270218 R
CPn0234271240 270548 R CT181 hypothetical protein CPa0235271416 272177 F kdaH-deoxyoetulonosic Acid Syathetase-(CT182) CPn0236272156 273766 F pyre-CTP Synthecase-(CT1831 CPn0237273762 274214 F yggF Family-(CT184) ' CPn0238274303 27$838 F zwf-Glucose-6-P Dehyrogenase-(CT185) CPn0239275899 276672 F devB-Glucose-6-P Dehyrogenase (DevH family)-(CT186) CPa02d0277861 276698 R
CPn0241279354 278203 R
CPn02d2279918 279487 R
CPa02d3280555 280133 R
CPn0244280918 281556 F adk-Adenylate Kinase-(CT128) CPn0215281645 282499 F ydh0-Polysaccharide tiydrolase-Invasin Repeat Family-(CT127) CPn02d6282952 282551 R~ rs9-S9 Ribososial Protein-(CT126) CPn0247283615 282969 R r113-L13 Ribosomal Protein-(CT125) CPa02d8284327 283650 R ycfV/ybbA-AHC Transporter ATPase-(CT152) CPn02d9285841 28333 R CT151 hypothetical protein CPn0250286057 285902 R r133-L33 Ribosomal Protein-(CT1501 CPn0251286060 287559 F eonserved hypothetical protein CPa0252288112 287576 R CT144 hypothetical protein (frame-shift with 0253?) CPn0I5328856 287950 R CT144 hypothetical protein_1 CPn0254289262 288159 R CT143 hypothetical protein'1 CPn0255290165 289329 R CT142 hypothetical protein_1 CPn0256291264 290398 A CTl4d hypothetical protein_2 CPn0257292127 291267 R CTld3 hypothetical proteln,",2 CPn0258292531 292133 R CT142 hypothetical protein (frame-shift with 02591) CPa0259292986 292441 R CTld2 hypothetical protei~2 CPn0260294045 29358 R sec~l-Protein Translocase Subunit_1-(CTidl) CPn0261294302 295033 F ydn0-PP-Loop Superfnmily ATPase-(CT217) CPn0262295091 295933 F surf-Surf-like Aeid Phosphatase-(CT218) CPn0263296249 297136 F yQfU hypothetical protein-ICT221) CPn0264297730 297155 A ubiD-Phenylacrylate Decarboxylase-(CT2201 CPn0265298620 297730 R ubiA-Benzoate Oetaphenyltransferase-(CT219) CPn0266299184 299876 F
CPn0267300122 300910 F
CPn0268300935 301318 F
CPn0269302150 301476 R Dipeptidase-(CT138) CPn0270303325 302468 R ywlC-SuAS Superfamily-related Protein-(CT137) CPn0271303634 301362 F Lysophoepholipase esterase-(CT1361 CPn0272305233 304340 R dnaX_2-DNA Pol III Gamma and Tau_2-(CT187) CPn0273305844 305227 R tdk-Thymidylace Kinase-(CT1881 CPn0274308353 305852 R gyrA_1-DNA Gyrase Subunit A_1-1CT189) CPn0275310786 308372 R gyr8_1-DNA Gyrase Subunit H_1-(CT190) CPn0276311137 310793 R CT191 hypothetical protein CPn0277311910 311104 A
CPn0278312875 312060 R conserved outer membrane lipoprotein protein CPn0279313537 312875 A Posaibls ABC Transporter Pe:mease Protein CPa0280314572 313550 A dppF-Oipeptide Transporter ATPaee-(CT689) CPn0281315057 316103 F dhnA-Predicted 1.6-Fructose Hiphosph..-i Aldolase Idehydrin family)-(CT215) CPn0282316126 317529 F xasA/gadC-Amino Acid Transporter-(CT216) CPn028331897 317532 R
CPn0284319045 318551 R
CPn0285320595 319051 R
CPn018632=059 320650 R mgtE-Mq Transporter ICHS Domain)-(CT194) CPn0287321221 322089 R ' CPn0288325716 321571 R CT195 hypothetical protein CPn0289325812 326996 F aaaT-Neutral Amino Acid lGlutamate) Traruporter-(CTZ30I
CPn0290327042 328523 F Na-dependent Transporter-ICT231) CPn0291321667 3=9191 F incH-Inclusion Membrane Protein H-ICT232) CPn0292329118 329836 F incC-Iaclusioa Membrane Protein C-ICT233) CPn0293329919 332723 F CT234 hypoehecieal proteia CPn0291333092 333502 F eAMP-Dependent Protein Kiaase Regulatory Subuait-fCT=35) CPn0295333863 333627 R aepP-ACyl Carrier Protein-ICT236) CPn0296331765 331022 R labG-Oxoacyl lCarrier Procaia) Reductase-ICT237) CPn0297335697 334771 a fabD-Malonyl Acyl Carrier Tr~sacyclase-fCT238) CPn0298336721 335717 1t fabN-Oxoacyl Carrier Protein Synthase ZZZ-fCT239) CPn0299336816 337115 ) reeR-Recombination Protein-fCTZ40) CPn0300337783 340152 I yaeT-Omp85 Analog-fCT141I
CPn0301340250 340762 I' fCmpH-Like outer Membrane Protein)-fCT242) CPn0302340787 311866 I' lpxD-UDP Glueosamine N-Aryltransferase-fCT243) CPn0303342958 341921 F' CT211 hypothetical protein CPn0304343133 344158 F pdhA/odpA-Pyruvate Dehydrogenase Alpha-!0235) CPn0305341154 345137 I pdhe/odp8-Pyruvace Dehydrogenase Beta-(022461 CPn0306345145 346431 1 pdhC-Dihydrolipoamide Aeetyltra~leraae-102247) CPn0307348986 346515 1: glgP-Glycogen Phosphorylase-!02248) CPn0308349231 349596 F' simflarity to CT249 CPn0309350974 349595 R dnaA_1-Replication Initiation Protein_1-!02250) CPn0310353433 351049 R 60IM-60kDa Inner Membrane Proteia-!02251) CPn0311354438 353575 R lgt-Prolipoprocein Diacylglycerol Transferase-!0225=I
CPn0312354524 354976 F CT101 hypothetical protein CPa0313354990 355355 F acpS-Aeyl-carrier Pzotein Synchase-102100) CPa0314356285 355353 R trxe-Thioredoxin Reduccase-1020991 CPa0315356977 358716 F rsl-51 Ribosomal Protein-102098) CPa0316358820 360121 F nusA-N Utilisation Protein A-(02097) CPn0317360081 362750 F~ infH-Initiation Faecor-2-!02096) CPn0318363767 363126 F rbfA-Ribosome Binding Factor A-102095) CPn0319363175 363879 F truth-tRNA Pseudouridine Synthase-!02091) CPn0320363860 364783 F ribF-FAD Syntluse-(CTD93) CPn0321365858 364767 R ychF-GTP Binding Protein-!02092) CPn0322366219 367328 F yscU-YopS Translocation Protein U -!02091) CPn0323367331 369460 F lcrD- Low Calcium Response D-(02090) CPn032d369492 3.70688F lcrE- Low Calcium Response E-(02089) CPn0325370708 371148 F sycE-Secretion Chaperone-(02088) CPn0326371148 372725 F malQ-Glueanotransferase-102087) CPn0327372915 373211 F r128-L38 Ribosomal Protein-!02086) CPn0328373241 371992 F GT085 hypothetical protein CPn0329375088 376146 F Phopholipase D SuDerf~lY (leader f33) peptide)-!02084) CPn0330376675 376202 R CT083 hypothetical protein CPa0331378437 376701 R CT082 hypothetical protein ~
CPn0332378655 378536 R CNLTR T2 Protein-!02081) CPa0333379090 378800 R ltue-102080) CPn0334379311 379823 F CT079 similarity CPa0335379817 380671 F folD-Methylene Tetrahydrofolate Dehydrogenase-(02078) CPn0336380650 381591 F yojL-1020771 C>?n0337382027 381575 R smp8- Small Protein 8-102076) CPn0338383278 383375 F dnaN-DNA Pol III (beta chain)-iGT075) CPn0339383420 384030 F reeF-ABC superfamily ATPase-!02074) CPn0340383802 '384156F (frame-shift with 0339) CPn0341384160 384195 F (frame-shift with 0340) CPn0342384622 385062 F predicted OMP ;leader 119) peptide)-tCT073) CPn0313:84999 385595 F (frame-shift with 0342?) CPn0341387420 385558 R yaeL-Metalloprocease-(02072) CPn0315388572 387136 R yaeM-IGT071) CPn0346389675 388704 R cro0/ycgD-Integral Membrane Protein-!02070) CPn0317391021 389678 R croC/ytgC-Integral Membrane Protein-102069) CPn0348391803 391027 R troe/ytqH-ABC transporter ATPase-(020681 CPn0349392770 391790 R t:oA/ycgA-Solute Protein Binding Family-(0':067) CPn035J393181 39368 F CT066 ty~ochecscal Drotein CPn0351397888 395132 F adt_1-ADP/ATP Transloease_1-!02065) CPn0352395574 396830 F
CPn0353396893 397135 F
CPn0354397167 398507 F
CPn0355399889 398591 R
CPn0356400459 400109 R
CPn0357401317 400469 R
CPn0358401751 401578 R
CPn0359402012 403817 F lepA-GTPase-ICT064) CPn0360405358 403922 R gnd-6-Phosphogluconace Dehydrogenase-tCT063) CPn0361406647 405382 R tyrS-tyrosyl tRNA Synthecase-ICT062) CPn0362407825 407055 R fliA/rpsD-Sigma-28/WhiG Family-(CT061) CPn0363409688 407943 R flhA-Flagellar Secretion Protein-(CT060) CPn0361409966 410238 F ferd-Ferredoxin IV-(CT059) CPn0365410528 411544 F
CPn0366411976 412440 F
CPn0367413102 413836 F
CPn0368413790 114107 F
CPn0369414351 415562 F CT058 hypothetical protein_2 CPn0370415800 416912 F CT058 hypothetical procein_3 CPn0371417147 417503 F
CPn0372417687 418001 F
CPn0373418380 420218 F gcpE-ICT057) CPn0374420218 420961 F CT056 hypothetical protein CPn0375421121 411615 F
CPn0376421854 422294 F
CPn0377423438 422347 R suc8_1-Dihydrolipoamide Succiayltransferase_1-ICT055) CPn0378426168 423445 R aucA-Oxoglutarate Dehydrogsnase-ICT054) CPn0379426322 426765 F CT053 hypothetical protein CPn03H0426758 427876 F hemN_1-Coproporphyrinoqen III Oxidase_1-ICT052) CPn0381429809 428037 R CT326 similarity CPn0382430719 470036 R yabC/yraL-SAM-Dependent Methytransferase-(CT048) CPn0383431693 430749 R CT047 hypothetical protein CPn0384432377 431862 R hcte-Histone-like Protein 2-(CT016) CPn0385434018 432522 R pepA-Leuryl Aminopeptidase A-fCTOdS) CPn0386434525 434046 R ssb-SS DNA Binding Protein-ICTOd4) CPn0387435196 431699 R CT043 hypothetical protein CPn0388435329 437320 F qlgX-Glycogen Hydrolase Idebranchiag)-ICTOd2) CPn0389438134 437319 R CTOdl hypothetical protein CPn0390439144 438134 R ruvH-HOlliday Junction Helicase-(CTOdO) CPn0391439692 439510 R
CPn0392439811 440383 F dcd-dCTP Deaminase-fCT039) CPn0393440379 440723 F CT038 hypothetical protein CPn0394440736 441968 F tlyC_1-CBS Domain protein (Hemolysin Homolog)_1-fCT256) CPn0395441964 443175 F CT257 hypothetical protein CPn0395444353 443241 R yhf0-NifS-related protein-ICT258) CPn0397445115 444381 R PP2C phosphatase family-tCT259) CPn0398445533 445700 F
CPn0399445879 446523 F CT253 hypothetical protein CPnOd00446536 447306 F CT254 hypothetical protein CPnOd01117881 417195 R CT255 hypothetical protein CPnOd02448994 447888 R mutt-Adenine Glycosylase-fCT1071 CPnOd03449015 419710 F yceC-predicted pseudouridine synthetase ~ family-(CT106) CPnOd04450887 419871 R
CPa0d05451739 450966 R CT105 hypothetical protein CPn0406451969 452865 F fabI-Enoyl-ACyl-Carrier Protein Reductase-fCT104) CPnOd07453742 452858 R HAD superfamily hydrolase/phosphatase(CT103) CPnOd08454105 454581 F CT102 hypothetical protein CPn0109154645 455127 F CT260 hypothetical protein CPn0410455123 455833 F dna~l-DNA Pol III Epsilon Chain_1-ICT261) CPnOdll455833 456609 F CT262 hypothetical protein CPn0412456590 457246 F CT263 hypothetical protein CPn0413459203 457227 R msbA-Transport ATP Binding Protein-ICT264) CPn0414460113 459172 R accA-ACCOA Carboxylase/Transferase Alpha-fCT265) CPn0115461498 160221 R CT266 hypothetical protein CPn0416461856 461557 R himD/ihfA-Integration Host Factor Alphn-tCT267) CPn0117463035 462244 R nmiA-N-Acetylmuramoyl Alanine Amidase-fCT268) CPn0118464401 462953 A murE-N-ACetylmuramoylalanylglutamyl DAP Lipase-tCT269) CPn0419466834 464876 R pbp3- transqlyeolase/transpeptidase-tCT270f CPnOd20467108 466824 R CT271 hypothetical protein CPn0121467998 467108 R yabC-PHP2H Family.methylcransferase-ICT'272) CPn0122db8242 46.8784F CT273 hypochatical protein CPn0423468791 469216 F CT271 hypothetical protein CPn0t2d169612470961 F dnaA_2-Replication Initiation Factor_c-ICT1751 CPn0425470980!71564 F CT276 hypothetical proteins CPn0426472111471536 R CT277 similarity CPn0427472207473715 F nqrZ-NJ1DH fUbiquinonel DehydroQenase-;CTZ781 CPnOt2847372247681 F nqr3-NJ1DH Itlbiquinonel Oxidoreductass, Gamma-tCT2791 CPn0129471681475319 F nqrl-NADH It7biquinonel Reduetase 1-fCT1801 CPn0130475326476093 F nqr5-N1~DH ttlbiquiaonel Reductase 5-ICT281) CPn0131476183176151 R
CPn0t32176816476514 R
CPn0133477273476929 R QesH-Glycine Clsavape System H Protein-ICT2821 CPn0134179462477276 R CT2B3 hypothetical Drotein Cln0t3548090247975 R Phospholipase D superfamily (uncleavable leader peptide)-(CT38t CPn0t36481618180902 R lpl~-LiDoau Protein LiQase-Like Protein-(CT2851 CPn0137481816184350 F clpC-ClpC Protease-ICT1861 CPn0138185116181334 R yebF-PP-loop superfamily aTPase-ICT287) CPn0139485553486077 F
CPn0ta0486105486710 F
CPn04t1486891187838 F CT007 hypothetical protein CPn0t42188013188528 F Ct006 hypothetical protein CPn043!88729189979 F CT005 hypochecieal protein CPn0114190187191507 F mnp_6-POlymorphic Outer liembrasse Protein G/I Family CPn0115194772197579 F pn~_7-Polymorphic outer ltembraae Protein C Family CPn0446197626500115 F pmD_8-Polymosphic Outer Hembrane Protein G Family CPn0147500568503351 F ps~ 9-Poiyaarphic Outer Membrane Protein G/i Family CPn01t8501810503698 R yxjC~s_2 Hypothetical Protein CPn01t9507131505330 R pmp_10-P!!P_10 tlrame-shift with 0151) CPn0150508112507180 R pmp_10-POlyaasphic Outer Membrane Proteia G Family CPn0t51508275511058 F ymp_11-Polyaasphic Outer !lembrane Protein C Family CPa0152511319512860 F pmp_12-POlymorphie Outer Hembrans Protein 11/I Famfly ltruncated) CPn0453513234516152 F pmp_13 -POlymorphic Outer Hembrane Protein C Family CPn015d516182519115 F pmp_14-POlymorphic Outer Membrane Protein H Family CPn0155520348519458 R
CPa0t56521532520337 A
CPn015751386552=120 R
CPn0458526310521136 R
CPn0t59517005526619 R
CPn0460527840526992 R
CPn0461528638527811 R
CPa0t6Z531052519037 R
CPn0463532357531191 R
CPn0t64531842532366 R
CPn0465533212532871 R
CPn0466533724536537 F pa~_15-Polymosphic outer Membrane Protein E Family CPn04b7536633539434 F pop_16-Poiymorphic Outer M~bsane Protein E Family CPn0168539632540132 F pmp_17-Polymorphic Outer Membrane Proteia E Family CPn0t69540399511160 F pmp_17-POlymorphic Outer Membrane Protein (Frame-shift with 01691 CPn0t705!1357512532 P pmp_17-Polymorphic Outer Membrane Proteia (Frame-shift with 01701 CPn0t715!2564515401 F pn~_18-Polymorphic outer Membrane Protela EIF lamily CPn0t72517905515581 R
CPn0473519593548070 R
CPn0171551573519807 R CT365 hypothetical protein CPnOt755538!4551685 ~ Q198-Gluean Hranchir~ Lnzyme-ICT8661 R
CPa0176551844553858 R CT865 hypothetical proteia CPn0t77556106551814 R yqsV_8s Hypothetical Protein CPn0478557615556210 R hilX-GTP 8indiaQ Protein-tCT3791 CPn0179558125557616 R phnP-Metal Dependent Nydrolase-ICT3801 CPa0t80559301558650 R CT383 hypothetical protein CPn0t81560946559339 R
CPa0482561737560961 R artJ-7lrQinina Periplasmic 8indinQ Protein-tCT3811 CPn0t8356183656961 F
CPn0484564970565824 F aroC-Deoxyhepconats Aldolsse-ICT3B21 CPn0t85566038566129 F CT382.1 hypothetical protein CPa0t86567781566105 R hypothetical proline permease CPn0487569740568112 R CT384 hypothetical protein CPa0t88570096569767 R hitA-HIT Family Nydrolase-ICT3851 CPnOt89570965570096 R CT386 hypothetical protein CPn0490571279573333 F CT387 hypothetical pzotein CPn0191571352577336 R CT389 hypothetical Drotein CPa019Z571652571804 F
CPn0193575004571855 R
CPnOtS1575364575146 R
CPn0495575607576793 F aspC-l~spartate Aminotran:ferase-ICT3901 CPn0196576793 57712 F CT391 hypothetical protein CPn0197571069 5771=0R CT388 hyposhscical protein CPn0198579035 5705 R
CPa0199580359 579=05R
CPn0500580559 581363F pros-Prolyl tRNA Synchetass-ICT393) CPn0501SA=57 563550F hreA-HTH Transcrzpcional Repressor-ICT39t1 CPa0502563550 SA1=01F qrpt-HSP-70 Colactor-ICT395) CPa05035613 55113 F dasK-HSP-70-tCT396) CPa050t56587 56151 F vacD-riboaueluse family-ICT397) CPn0505586519 SA9105F 3-aeehyladsnins DNA qlycosylass CPn0506569172 56940 E CT4=1 hypothetical protein CPn0507589961 590112F CT121.1 hypothetical protein CPa050A59012 590300F CTt=1.= hypoctsetical protein CPn0509590335 590108F IDredietsd Metallosazyme)-ICTt33) cPn0510590113 591973F ClyC_3-C8S Damaias tHamolysin homoloq)_2-ICTt33I
CPn051159111 59118 F rsbV_1-Siqsa Rspulatory Factor_1-ICZIII) Cpn051259=553 59113 F CT115 hypothetical pzoteiss CPn0513591517 593753F Fs-8 oxidorsduetass_1-ICT=6) CPn051d5957=9 596!=0F Ct117 hypothetical protein CPn0515595192 597111F obit-Ubiquiaone Mschyltraa:fsrase-ICT138) CPa0516598111 597255R
CPa0517599531 59795 R
CPa0518600103 59933 A CTt29 hypothetical protein CPa051960167 60090 R dap!-Diaminopioulace tpimerass-ICTt30) ~
CPa0520601=18 601616R elpP-CLP Protuss-lCTt31) CPn0511603797 60331 R qlyA-Ssrine Hydsoxymsehylcraasferass-tCTt37) CPa0522503987 601655F CTt33 hypothetical protein CPa0523604733 505052F
CPa051t605103 606179F
CPn0525505532 607=83F CT398 hypothetical protein CPn05Z6601696 607710R yrbH-GutO/lCpsd Tamily Suqar-P Isomerase-ICT399) Cln0527609l0a 607=6 R sucs_Z-Dihydrolit~oasids Succinyltrsnsferase_2-tCTt00) CPa0538611162 509931R qltT-tilutaaate Sympore-tCT101) CPUQ5I961==59 511165R yeah-ATPass-IGTtOt) CPn0530613=51 61160 R spotJ_1rRNA Hstlsylass_1-ICT103) CPn0531511069 613315R S1v!! dependent msthyltransfsrus-1CT101) CPa0532611674 61075 R ribC/risA-Riboflavin Syutbaas-ICT105) CPn0533611930 61335 F~ CTt05 hypothetical protein CPa053t515113 51578 F dksA-Dnalc Suppressor-lCTt07) CPn0535615793 616395F lspA-Lipoprotaia Sisal Peptidase-tCT108) CPn0535616315 617591F daQA_1-D-Ala/Gly Psrmsase_1-tCTt09) CPa0537617633 611169F CTtll.l hypothetical protein Cln0538618212 51511 F C?d1t hypothetical protein CPn0539616705 611515F pmp_19-polysoorphic outer membrane protein A family -ICT112) CPn0510521590 626862F pmp_20-polymorphic outer membrane protein a Tamily-ICT113) CPa05t1617170 6=003 F Solute binding protein I-ysbL-Synschoeyscis Adheein Haeoloq)-tGTtlS) CPa05t2526003 6=737 F JIaC Transporter ATPass-1CT116) CPa0513531735 619603F IMStal Traosporc Protein)-iCTtl7) CPn051a630529 629525A yhbL-GtP binding protein-tCTtlA) CPa05t5630tea 630533R r117-L=7 ribosomal protein-tCTtl9) CPn0516631=Z9 630911R rlll-LZl Ribosaul Protein-IGTt=0) CPn05t7631661 631188~ yqbs family-ICT131) F
CPn0518533=31 631191R eysJ-Sulfite RsductaseICT435) Cln0519633669 ' 53355R rsl0-SIO Ribosomal Protein-ICTt35) CPn0550635561 633560R lusA-tloaqation Factor G-tCTt371 CPn0551638166 635596R rs7-S7 Ribosomal Protein-tCTt381 Cltt0552635587 535=19R rsll-512 Ribosomal Protein-ICT1391 CPn0553537717 53812 R
CPa0551637651 636111F C?tt0 hypoehstieal protein CPn0555531=9B 50211 F tsp-Tail-SDseific Protusr ICTtl1) CPa05566t091~ 610325A cspA-lSkDa Cysteins-Rich Protein-ICTt1=) CPSf055761161 611191R omcD-60kDa Cysceins-Rieh Outer Membrane Complex Protein-tLTtl3) CPn0558613300 613031A omcA-9kDa-Cyscsine-Rich outer Membrane Complex Lipoprotein-ICTttt) CPn0559613712 53927 F CTt41.1 hypothetical prouin CPn0560515612 611098R qlGX-Clutamyl-cRNA Synchetass-ICTtlS) CPn056i6510 6571 R euo-CHLPS too Protein-ICTtt51 CPn056268036 615918R CHLPS t3 k0a prouin honwloq_1 CPn0563650056 611=97A recJ-ssDNA txonucleaas-tCT117) CPn0561651350 650115R seeDisseF-Protein Export Proteins SeeD/SeeF
Itusionl-ICTItB) CPn0565655530 65533 R CTIt9 hypothetical Drocein CPn056665511 656890F yaeS family-tCTt50) CPn0567655191 657817F cdsa-PhospMCidacs Cytidylytransferaes-lC:t51) CPn0568657817 658161 F cdsA-Priosphacidaee cytidylytransierast-lCTt52) Cln05696516 659099 F plat-Glycerol-3-P Aeylcranslesue-ICT153) CPa0570659107 660789 F arg8-Argsnyl tJtNA Transierase-ICT451) CPn0571662122 660719 R musA-tJDP-N-Aeetylglucosamine Transierase-ICT1551 CPn0572662352 661616 F CT156 hypothetical protein CPn0573665101 661191 R yebG lamily-ICT157) CPn057t665915 665391 R
CPa057566619 665182 R YhhY-Amino Group Acetyl Transisrast-(CT58) CPn0576667513 666191 R pri8-Peptide Chain Release Faecor 2 tnacural tTGA irawe-shift )-(CT155 Cla0576657598 667530 R pri8-Inatural UGA trams-shift 1 CPe105776b7195 561155 F SWIG tYH7t) coaoplex protein-ICT601 CPa0578668106 689365 F yaeI-phosphohydrolase-(CTt61) CPn0579bbl361 669993 F ygbP/yaeH-Sugar Nucleotide Phosphorylase-fCTt6Z) CPn0580669993 670793 F truA-Pseudouridylate Syntbase I-ICTt63) CPn0581b7113t 670715 R Phosphoglycolace Phosphatase-(CTt6t) CPa058Z671503 672177 F CT165 hypothetical Drotsln CPn0583671100 671717 F CT166 hypoehetieal protein CPn0584671707 673798 )' aco8/atr8-Z-Component 8ansor-ICTt67) GPa0585675817 673855 F: similarity co Cps laeA_Z
CPa0586676026 677183 F' atoC/ntrC-Z-Component Regulator-fCTtbB) CPn0587677ta1 671121 F yvyD~s conserved hypothetical protein CPa0588678081 6786=6 F' CTt69 hyposbetieal protein CPn0589671610 679795 F CT470 hypothetical proctin CPn0590680112 679516 F CTt71 hypothetical protein CPa0591680373 681010 F yagE family-tCTt7Z) CPn059Z681153 611161 F yidD family-(CTt73) CPn0593682176 681391 F. CTt7t hypothetical protein CPn059468=583 681958 F pheT-phenylalaayl tRNA Synthetase Beta-(CTt751 CPn0595611958 615926 F CT176 hypothetical protsin CPe10596615939 61bt57 F ada-mecbyltraasierase-(CTt77) CPn0597681215 685179 R oppC~-Oligopeptide Psrmeast_Z-(CTt78) Cla0598619697 611=19 R opp8_Z-Oligopepcide Ptsmease_Z-tCTt79) CPn0599691802 681882 R oppl~5-oligopeptide 8indiag Lipoprotein-,5-(CT110) Cln0100693117 691137 R
CP80i01693053 69=736 R CTt83 hypochetieal protein GPn0i02691105 693101 R CTtet hypotheeical protein CPa0603691305 695115 F hmZ-Fsrroehecalase-(CT415) Cla060t695115 615196 A~ iliY-Glucamiae 8lading Procsin-(CTttb) Cla0605691707 696150 R yhbd-Ilethylase -iCT187) CPa0606617111 691707 R CTtlB hypothetical protsin CPn0607698195 697573 A glpC-Olueose-1-P Adenyltransierase-1CT119) CPn0608691615 699016 R -pyre-tJrid3ne 5'-HOnophosplsate Syntbass It)a~ Sy~sthase)-truaeatad7 CPn0609699705 699916 F CTt90 hypochseical protein GPn06i0T01tZ0 700029 R rho-Traascripcioa Terraisucion Factor-fCTt91) CPa0611702025 701120 R yacE-predicisd phosphatase/kinase-(CTt9Z) CPn0612701631 701022 R polA-DNA Polys~srise I-ICTt93) CPa0613705656 701651 R soh8-Proteasr 1CT194) GPa0611707102 705713 R adt~-ADP/ATP Transloease_Z-FCT195) CPn0615701137 707634 R pgsA_1-Glycerol-3-P Phosphatidyltransisrase_1-fCTt96) Cla0516708791 710137 F dnaD-Replieatlw DNA Heliease-ICTtl7) CPn051771081 732316 F gidA-FADdependtac oxidoseduetase-ICTt98) CPn061B711306 713010 F lplA-Lipoace-Protein Lipase A-ICTtl9) Cln0619713114 713013 R ndk-Nucleoside-Z-P Ftinase-(CT300) CPa0620711139 717519 R ruvA-Holliday Junction Heliease-1CT301) CPa06Z1711617 711111 R ruvC-Grosswer Junction Endonuclease-fCT502) CPn05Z2715752 711793 R CT503 hypocheeieal protein CPn0633716993 7161b3 R CTSOt hypothetical protein CPn0631711015 717011 R gapA-Olyeeraldthyds-3-P DehyroQetfase-)C'i"305) CPn06ZS711115 711060 R r117-L17 Ribosomal Procsin-ICT506) CPn0iZ6711616 718495 R rpoA-RNA Polymerase Alpha-(CT507I
CPa06Z7720018 719610 R rsll-S11 Ribosomal Protein-ICT508>
CPit06Z8720128 720063 R rsl3-513 Ribosomal Protein-ICT509) CPa06Z9721157 720117 R seeY-Translocase-ICT510) ' Cla0630:22316 721815 R r115-L15 Ribosomal Protein-(GT511I
C1n0631722106 722312 R rs5-S5 Ribosasul Protein-ICTS1Z) CPn0632723195 721127 R r111-L18 Ribosomal Protein-fCT513) CPn0633723757 733209 A rib-L6 Ribosomal Protein-lCT511) CPn063172115 7=3717 R rs1-S8 Ribosomal Protein-ICT515) CPn0635721715 721206 R rl5-LS Ribosomal Protein-ICTS16) C?n0536725012 721750 A rlZt-L21 Ribosomal Protein-(Cl'S171 CPn0637725161 ~Z1099 R r111-L11 Ribosomal Protein-(C:5-8) CPn053B~Z57t7 725190 R rsl7-817 Ribosomal Procein(C519) CPn0539725958 725743 R r129-L29 Ribosomal Protein-fCT520) CDn0640725377 725961 R r116-L16 Ribosomal Protein-fCT521) CPn0541727077 725109 R rs3S3 Ribosomal Protein-fCT522) CPn0542727428 727096 R r122-L22 Ribosomal Protein-fCT523>
CPn0643727713 727450 R rsl4-519 Ribosomal Protein-fCT521I
CPn05t4728573 727722 R r12-L2 Ribosomal Protein-(CT525) CPn0545728930 728598 R r123-L23 Ribosomal Protein-fCT526>
CPn0546729621 728950 R r14-LI Ribosomal Protein-fCT527) CPn0647730331 729657 R r13-L3 Ribosomal Protein-fCT328) CPn0518731603 730605 R CT529 hypothetical protein CPn0649732572 731710 R fmc-Nechioryl eRNA Fornylcransferase-fCT530) CPn0650733501 731665 R lpx~1-Aey1-Carrier tIDP-GlcNAe -fCT531) CPn0651733975 733317 R fabt-!lyriseoyl-hcyl Carrier Dehydratase-fCT532) CPn0652731835 733990 R lpxC-Myriscoyl GlcNae Deacttylase-fCTS33) Cla0653736490 731868 R eucE-Apolipoprotein N-l~cetyleransferase-fCT534) CPn065t735957 735503 R vdlD/yciA-aeyl-CoA Thiossterase-ICTS35) CPn0655737847 737101 R dnaQ_2-DNA Pol III Lpsilon Chain_2-fCT536) CPn0656737872 738048 F
CPn0657738473 738051 R yjeE (I~TPase or Kinase)-fCT537) CPn065A739168 738455 R CT538 hypothetical proton CPn0559739533 739838 F trxh-Thioredoxin-ICTS39) CPn0660710327 739860 R spoD_2-rRNa Ntthylass_2-fCT540) CPn0661741100 740327 R mip-FKeP-type pepcidyl-prolyl cis-crane isomerase-;CTStl) CPn0662742923 741172 R asps-l~spartyl tRNA Synthetase-fCT5t2) CPn0563744190 742901 R hiss-Hiscidyl tRNR Synthetase-fCT5t3) CPn0664744757 744557 R
CPa0665745001 716365 F uhpC-Hexosphosphate Transport -fCT541) CPn0666746388 750107 F dnaE-DNA Pol III Jllpha-fCT515) CPn0567751058 750177 R predicted 0lIP (leadar f17)-fCT516) CPn0558751209 752162 F CT547 hypothetical protein CPn0559752179 752775 F CT548 hypothetical protein CPn0670732765 753196 F rsbN-sigma regulatory factor-hiscidine kiaase-fCT519) CPn0571753530 753205 R CT550 hypothetical protein CPn0672753741 755018 F dacF(pbp5)-D-hla-D-Ala Caroxypeptidase-fCT551) CPn0673755287 755163 F CT552 hypothetical protein CPn0574755568 755577 R fmu-RN1~ Hechyltransfezase-fCT553) CPn0675757919 756768 R CT69b hypothetical protein CPn0676759217 758051 R~ homologous to CT695 CPn0677750401 759256 R
CPn0678751320 760582 R
CPn0679762930 761725 R pqk-Phosphoglyesrate Kinase-fCT693) CPn0580764248 762971 R yqo4-Phosphate Permeast-ICT692) CPn0681764929 764258 R CT691 hypothetical protein CPn0582761984 765955 F dppD-A8C ATPaee Dipeptide Transport-fCT690) CPn0583765948 766919 F dppF-A8C ATPase Dipeptide Transport-ICT6891 CPn0684768038 767181 R spoJ/par8-Chromosome Partitioning Protein-fCT588) CPn0585768068 768217 F
CPn0686758361 768176 R
CPn0687758564 769214 F CT482 hypothetical protein CPn0688769382 770137 F CT481 hypoehacieal protein CPn0689771104 770187 R yfh0_1-NilS-related Jlminotransferast_1-ICT687) CPn0590772580 771136 R AeC Transporcsr tiembrane Protein-fCT685) ~
CPn0691773452 772685 R abcX-R8C Transporter llTPase-fCT685) CPn0592774912 773161 R J18C Transporter-fCT6Bt) CPn0593776256 775240 R TPR Repeats to-Linked G1CNJIC Tzansferase similarity!-fCT683) CPn0594779599 776330 R pbp2-P8P2-cransqlycolase/cranspepcidase-fCT582) CPn0695780216 781382 F ompA-Major Outer Nambrane Protein-fCT681) CPn0696781769 782599 F rs2-S2 Ribosomal Protein-ICT5801 CPn0697782602 783447 F csf-Elongation Factor TS-ICT679) CPn0698783458 784201 F pyres-UHP Kinase-fCTB79) CPn0599784182 784721 F rrf-Ribosome Releasing Factor-ICT677) CPn0700785097 785609 F CT676 hypothetical protein CPn0701785599 786672 F karG-Arqinine Kinase-fCT675) CPn0702789685 786929 R yscC/qapD-YOp C/Gen Secretion Protein D-fCT67d) CPn0703791190 789685 R pkn5-S/T Protein Kinase-fCT677) CPn0704792321 791209 R fllN- Flaqellar Motor Snitch Domain/YSeQ
family-fCT672) CPn0705793173 792334 R CT671 hypothetical protein CPn0706793683 793180 R CT670 hypothetical protein CPn0707795029 793704 R yscN-Yop N lFlaqellar-Type ATPase)-fGT569) CPn0708795705 795034 R CT668 hypochecicnl protein CPn0709796188 795742 R CT667 hypothetical protein CPn0710796461 796210 R CT666 hypott:ecical protein CPn0711796771 796186 R CT665 hypochecicai protein CPn0712799315 796781 R FICA domain: homology to adenylace eyelase)-fCT664) CPn0713799721 799332 R CTb63 hypothetical protein CPn0714801107 800091 R haM-Glutamyl tRNA Reductase-1CT662) CPn0715801657 803462 F gyre_2-triJA Gyrase Subunic 8_2-1CT661) CPn0716803469 801902 F gyrA_2-DNA Gyrase Subunit A_2-fGT6601 CPn0717805010 805306 F CT656 hypothetical protein CPn0718805309 805626 F CT657 hypothetical protein CPn0719805916 806890 F sth8-lPseudouridine Synthase)-1CT658) CPn0720807003 807236 F CT659 hypothetical protein CPa0721807683 808489 F kdsA-KDO Synthetase-1CT6551 CPa0722808489 808974 F CT654 hypothetical protein CPn0723808984 809703 F yhbG-AHC Transporter ATPase-(CT653) _CPn0724810527 809706 A
CPn0725810811 810387 R C?652.1 hypothetical procsin CPn0726813372 810880 R CT620 hypothetical protein CPn0727813577 816192 F CT619 hypothetical protein CPn0728818477 816525 R CIiLPN 76k0a HomoloQ_1 (CT6221 CPn0T29819857 818592 A CHLPN 76kOa somolog_2 tCT623) CPn0730821603 818963 R mviN-Integral !lembrane Protein-(CT624) CPn0731821587 821760 F
CPn0732822098 822976 F ato-Endonuclease IV-(CT625) CPn0733823727 823101 R rs4-S4 Ribosomal Protein-fCT626) CPa073d823914 824915 F yceA-ICT627) CPn0735825668 825003 R pyrH/udk-Uridine Kinase fUridine lionophosphokinase) (Pyrimidine Ribonucleoside Kfnasel.
CPn0736827686 825992 R ygeD-Lttlux Protein-(CT641) CPn0737827685 830756 F recC-Exodeoxyriboauclease v, Gamma-fCT640) CPn0738830746 833895 F race-Exodeoxyribonucluse V, Heta-(CT639) CPn0739834871 833861 R CT638 hypothetical protein CPn0740836018 031861 R tyr8-Aromatic 871 Aminotransterase-(CTb37) CPt10741838350 836185 R greA-Transcription Elongation Factor-(CT636) CPn0742838463 838888 F CT635 hypothetical protein CPn0743838962 840762 F aqzA-Vbiquinone Oxidoraduccase. Alpha-(CT631) CPa0714841384 840389 R heutB-POZphobilinogen Synchase-(CT633I
CPn0T45841903 841742 R
CPn0T46841975 843567 F CT632 hypothetical protsin CPn074783675 843740 F~ CT631 hypothetical protein CPn0747843725 843910 F CT671 hypothetical protein (frame-ahitt) CPn0748844987 844121 A ispA-Geraryl Transtransterase-(CT628) CPn0719845629 845006 R glsW-VDP-GlcNAC Pyrophosphorylase-ICT629) CPa0750846411 845707 R tctD/epxR-fiTH Transeriptional Regulatory Protein Receiver Doman-CPn0751846606 848434 F CT651 hypothetical protein CPn0752848601 850082 F reeD_2-fxodeoxyribonuelease V, Alpha_2-(CT6521 CPa0753851006 850161 R
CPn075d851336 851040 R rs20-S20 Aibososul Protein-(CT617) CPn0755851597 852799 F CT616 hypothetical protein CPn0756852961 854676 F rpoD-RNA POlymersss Sigma-66 -(CT615) CPn0757854733 855134 F tolX-Dihydroneopterin Aldoiase-(CT614) CPn0758855110 856459 F tolP/dhpS-Dihydropteroate Synthase-ICT613) CPn0759856488 856997 F tolls-Dihydrotolace Reduecase-(CT6121 -CPn0760856957 857694 F CT611 hypothetical protein CPn0761857704 858375 F CT610 hypothetical protein CPn0762859597 858539 R recA-ReG reeos~bination protein-(CT650) CPn0763860511 859972 R ygtA-FOrmyltetrahydrotolace Cycloligase-fCT649) CPn0764861807 860524 R CT648 hypochscical protein CPn0765862382 861801 R CT647 hypothetical protein CPn0766863782 862394 R CT646 hypothetical protein CPn0767863881 864177 F CT645 hypothetical protein CPn0768864159 865163 F yohI/nir3-predicted oxidoreduccase -(CT644) CPn0769867733 865121 R topA-DNA Topoisomerase I-Fused to SWI Domnin-fCT643) CPn0770868340 869131 F CT642 hypothetical protein CPnOT71870163 869144 R rpoN-RNA Polymerase Sigma-54-(CT609) CPn0772872385 870469 R uvrD-DNA Nelicase-fCT608) CPn0T73872188 873195 F ung-Vracil DNA Glyeosylase-fCT607) CPn0774873195 873425 F CT606.1 hypothetical protein CPn0775871031 873414 R yggV family-ICT606) CPn0776874246 875487 F CT605 hypothetical protein CPn0T77875601 877178 F groEL_2-heat shock protein-60 -fCT604) CPn0778877505 878092 F tsa/ahpC-Thio-specific Anuoxidanc (TSA) Peroxidase-(CT6031 CPn0779878481 878095 R CT602 hypothetical protein CPn07A0179205 871591 R papQ/amie-N-ACetylmuramoyl-L-111a Amidaae-CT601) CPn0781879773 179191 A pal-PeDtidoqlycan-Associated Lipoprotein-ICT6001 CPn0782181065 879773 R tolH-polysaccharide transporter-ICTS991 CPn07AJ881115 881100 R CT59A hypothetical protein CPn07B1812296 881892 R exbD-8iopolymer Transport Protein-ICT5971 CPh0785812991 881296 A exb8/tolQ-polysaccharide transporter-GT5961 CPa0786883185 815293 F dsbD/xprA-Thio:disulfide Interchange Protein-CT595) CPa07A7885619 116401 F yabD/ycl:!-PHP superlamily luruse/pyrimidinasel hydrolase-ICT5911 CPa07A8816542 887432 F sdhC-Succinace Dehydroqenase-fCT593!
CPa0789887139 889316 F sdhA-Succinate Dehydroqenase-ICT592f CPn0790889330 890103 F sdhe-Succinace Dehydrogenase-ICT5911 CPn0791893050 190111 R CT590 hypothetical proceia CPn0792894919 893108 R CT5A9 hypothetical protein CPn0793196123 894919 R rbsU-sigma regulaeory family protein-PP2C
phosphatase IRSbW
ancaqoniscl-ICT5881 CPa0791897171 898001 F
Cla0795891128 899195 F
CPn0796899301 901310 F
CPn0797901600 902694 !
CPa0791902116 903156 F
CPa079990916 903910 R
CPn0800906532 905249 R eno-ISSOlase-ICT587) CPa0801908697 906727 R uvrn-Fxiauclease AeC Subunit H-ICT5861 CPn0102909740 908709 R CrpS-Trypeophanyl CRNA Synthetaae-(CT5151 CPn0A03910303 909752 R CT58d hypothetical protein CPa010d911059 910310 R qp6D-CitLTR Plasmid Paraloq-ICT583) CPn0105911831 911067 R miaD-chromosome partitioning ATPase-CHLTR
plasmi.d protein GPSD-ICT5ti2) CPn0106913771 911867 R thrS-Threo~l tRNA Syachecaae-ICT5811 CPn0A07913971 91879 F CTSAO hypothetical proeein CPn010A916287 914956 R CT579 hypothetical protein CPn0A09917785 916307 R CT578 hypothetical protein CPn0110918111 917825 R GT577 hypothetical protein CPn0111918900 918308 R lesti_1-Low Ca Response Proeein H_1-ICT5761 CPa0812919123 910162 F mucL-DNA tdiamstch Repair-ICT5751 CPa.0A13920870 921934 F pepP-Aminopeptidase P-ICTS7df CPn011d922107 933357 F CT573 hypothetical protein CPn0815923361 9=5622 F gspD/pilQ-Gen. Secretion Protsfn D-ICT5721 CPa0A16925615 927102 F~ gspE-Gen. Secretion Protein E-fCT571) CPa0A17927115 928287 F gspF-Gea. Secretion Protein F-ICT570I
CPa081B928314 92868? F predicted OtiP (leader 1161 peptide)-CT5691 CPn0119928619 929132 F CT56A hypothetical protein CPa0820929120 929659 F CT567 hypothetical protein CPn0821929667 930668 F CT566 hypothetical protein CPn0122930756 931229 F CT565 hypothetical protein CPa0823932367 931501 R yscT/spaR-YopT Tranlocation T-ICT5641 CPa0121932662 932378 R yscS/IliQ-YOpS/IliQ Transloeation Protein-fCT563) CPn0A25933594 932677 R yscR-YOp Transloeation R-ICTSB2) CPn0826934310 933612 R yscL-YOp Ti:anslocation L-ICT5611 CPn0127935264 934434 R CT560 hypothetical protein CPa0828936771 935267 R yacJ-Yop Traaslocation J-ICTS59) CPn08299367da 937298 F
CPn0830937441 937959 F
Cln0831938267 938434 F
CPn0132939747 938827 R lipA-Lipoace Synchetase-ICT55A1 CPn0A33941129 939747 R lpdA-Lipoamide Dehydrogenase-ICT557) CPa0A31941553 942014 F CT556 hypoehecieal protein CPn0835915689 962015 R motl_1-SWI/SNF family helicase_1-ICTS551 CPa0A3696879 95722 R brnQ-Amino Acid (Branched) Transport-iCT55d1 CPn0837917771 917115 R nth-F.nodnueluse III-ICT697) CPa0838949106 97781 A thdF-Thiophene/Puran Oxidation Protein-1CT698) CPn0839949257 950159 F psdD-Phosphatidylserine Oecsrboxylase-ICT699) CPa0Ad0950222 951541 F CT700 hypochetlcal protein CPn08d1951771 95640 F secA_2-Translocase SecA_2-ICT701) CPa01d2954883 954710 R CT702 hypothetical prosain Ilrame-spilt with 0843) CPn0813955191 951991 R CT702 hypothetical protein CPn08dd956730 955270 R yphC-CTPase/CTP-binding protein-ICT703) CPn0A15951079 956150 R pene_1-Poly A Polymerase_1-fC:70d1 CPn0816959371 958112 R clp%-CLP Protease ATPase-ICT7051 CPn0817959995 959387 R clpP-CLP Protease subunit-ICT7061 CPa0811961502 960177 R tig/murI-Triqqar factor-pepcidyl-prolyl isomerase-ICT707) CPn0819961781 965285 F ~tl_2-SWI/SNF family heliease_2-ICT7011 CPnC85996529) 966390 F m:eB-Rod Shape Proceirt-sugar %inase-1GT7091 CPn0A51 966396 96A195 F pckA-Phosphoenolpyruvate Carboxykinese-ICT710) CPn0A5Z 968316 970613 F CT711 hypothetical protein CPn0853 970637 971A03 F CT712 hypothetical protein CPn0A54 972837 971806 R ompB-Outez Membrane Protein B-ICT713) CPn0855 973995 972994 R gpdA-Glycerol-3-P Dehydrogenase-fCT711) CPn0856 975377 973995 R Apx-1 Homolog-VDP-Glucose Pyrophoaphorylase-tCT715) CPn0857 975757 975392 R CT716 hypothetical protein CPn0858 977055 975757 R tliI-Flagellum-apeeitic ATP Synthase-(CT717) CPn0A59 977588 977055 R CT7I8 hypothetical protein CPn0A50 978630 977608 R tliF-Flagellar M-Ring Protein-ICT719) CPn0851 979722 97A925 R nitV-NitV-related protein-IC:"720) CPn0862 980873 979722 R yth0_2-Nits-relaeed protain_2-tCT721) CPn0A63 981514 980831 R pgmA-Phosphoglyeerate Mutase-ICT722) CPn0A5d 981670 982374 F yjbC-predicted pseudouridine synthase-1CT7231 CPn0A55 98241A 982912 F CT724 hypothetical protein CPn0866 9A3491 982916 R birA-Biotin Synthetase-ICT725) CPn0867 983t23 984667 F rodA-Rod Shape Protein-1CT726) CPn0868 986613 981670 P. zntA/cadA-Metal Transport P-type ATPase-ICT727) CPn0869 987401 986658 F. CT728 hypothetical protein CPn0870 988728 987!48 F. serS-Seryl cRNA Synthecase_2-ICT7291 CPn0871 988772 989899 F' ribD-Riboflavin Deaminase-ICT730) CPn0872 989963 991216 F' ribA4ribe-('TP Cyclohydratase i DHHP Synthase -ICT731) CPn0873 991233 991694 F ribF:-Ribicyllumazine Synthase-ICT732) CPn0871 993107 991719 F CT733 hypothetical protein CPn0A75 993372 994022 F CT734 hypothetical protein CPn0876 99!144 995517 F dagA_2-D-Alanine/Glycine Permease_2-ICT735) CPn0877 995533 995982 F ybcL family-ICT7361 CPn0878 996654 995992 F SET Domain protein-ICT737) CPn0A79 997439 996645 R yycJ-metal dependent hydrolase-ICT73A) CPn08B0 999A61 9971!! R ttsK-Cell Division Protein FtsK-fCT739) CPn08A1 1005667 1006209 F
CPn0A82 1006268 1007~04 F
CPn0A83 1008865 1007573 R dmpP/nqr6-Phenolhydrolase/NADH ubiquinone oxidoreduetase-(027!0) CPn0A8t 1009359 1009009 R CT7t1 hypothetical protein CPn0885 1010635 1009433 R ygcA-rRNA Methyltransterse-IGT742) CPn08Bb 1011276 1010908 R hetA-Histone-Like Developmental Protein-fCT7t3Y
CPn08A7 1011692 101!157 F CHLTR possible phosphoprotein-ICT7lt) CPa0A88 1015423 1011119 R- hemG-protoporphyrinogen Oxidase-ICT?15) CPn08B9 1016835 I015t62 R hemN_2-Coproporphyrinogen III Oxidase_2-ICT746) CPn0890 1017805 1016819 R hemE-Uroporphyrinogen Decarboxylase-ICT747) CPn0891 1021073 1017A19 R mtd-Transcription-Repair Coupling-ICT71A) CPn0892 1023661 1021016 R alas-Alanyl CRNA Synchecase-ICT719) CPn0893 1023894 1025A88 F cktH-Transkecolase-IGT750) CPn0894 1026766 10258AA R anus-AMP Nucleosidase-fCT751) CPn0A95 1026988 1027557 F efp_2->=longation Factor P_2-fCT752) CPn0896 1027595 1027822 F CT753 hypothetical protein CPn0A97 1028737 1027853 R (possible phosphohydrolasel-ICT75t) CPn0898 1030~60 1028904 R Mitochondrial HSP60 Chaperonin Homolog-ICT7551 CPn0899 1030875 1032215 F murF-MUramoyl-DAP Lipase-fCT756) CPn0900 1032235 1033281 F mraY-MUramoyl-Pentapeptfde Transterase-ICT757I
CPa0901 1033287 1031537 F murD-Muramoylalanine-Glutamate Lipase-ICT7581 CPa0902 1034513 1035211 ~ F nlpD-Muramidase finvasin repeat family>-It:T759) CPn0903 1035263 1036417 F ttsw-Cell Division Protein Ftsw-fCT760) CPn090d 1035326 I037396 F murG-Pepcidoglycan Transterase-ICT761) CPn0905 1037109 1039835 F murCiddlA-Huramace-Ala Lipase 4 D-AJ.a-D-Alum Ligass-fCT762) CPn0906 1040310 1039915 R CT763 hypothetical protein CPn0907 I0407B0 1010!45 R ~cutA Periplasmic Divalsnt Cation Tolerance Protein CutA IC-Type Cytoehrome Biogenesis Procainf CPn0908 1041589 1040780 R CT761 hypothetical protein CPn0909 10!1537 1041966 F rsbV_2-Sigma Factor Regulator_2-fCT765) CPn0910 1041979 1043004 F 'miaA-tRNA Pyrophosphate Transterase-ICT766) CPn0911 10!1043 1012985 R Fe-S cluster oxidoreduetase_2-ICT767) CPn0912 1014129 10<5760 F GT76B hypothetical protein CPn0913 :045760 1015945 F
CPn0914 1045999 1016397 F
CPn0915 1015461 1016817 F ybeH-iojap supertamily ortholog-tCT769) CPn0916 1016837 1018084 F tabF-Acyl Carrier Protein Synthase-ICT7701 CPn0917 10!8090 1018539 F hydzolaseiahosphacase homolog-tCT771I
CPn0918 1049223 1048579 R ppa-Inorganic Pyrophosphatase-tCT773) CPn0919 10!9378 1050430 F ldh-Leuciae Dehydrogenase-tCT777) CPn0920 1051405 1050431 R eys0-Sul:::e Synchesis/biphosphate phosphatase-ICT774) CPn0921 1051535 1052293 F snGlycezoi-3-P Acylczans:erase-fCT775) CPn092210523141053927F ass-ACylplycerophosphoechanolamine Acycransferass-ICT776) CPn092310539841055093F bioF_1-Oxononanoaca Synthase_1-ICT777) CPn092410572741055028R priA-Primosomal Protein N' -fGT7781 CPn092510579001057226R G?779 hypothetical protein CPn092610580601058557F Thioredoxin Disulfide Isomerase-ICT7801 CPa092710598091058670R CItLPS 43 kDa protein homoloQ_2 CPn092810610081059884R CHLPS 43 kDa protein homoloQ_3 CPn092910622921061186A CHLPS 43 kDa protein homoloy_4 CPn093010628571063330F
CPn093110641381065718F lysS-Lysyl tRNA Synthetase-(CT7811 CPn093210671421065721R cysS-Cysteinyl cRNA Synchetase-ICT7821 CPn093310675351068578F predicted disulfide bond isomerase-ICT783) CPn093410689421068526R rnpA-Ribonuclease P Protein Componeat-fCT78d) CPn093510690911068957R rl3d-L34 Ribosomal Proeein-ICT'785) CPn093610693361069470F r136-L36 Ribosomal Proesin-ICT786) CPn0937.10694961069798F raid-514 Ribosomal Protein-ICT787) CPn093810703221069849R CT788 hypothetical protein -(leader 160) peptide-periplasa~fe~
CPn093910707281071195F CT790 hypothetical protein CPn09d010730121071204R uvrC-Excinueluse ABC. Subunft C-fCT791) CPn09d110755011073018R stutS-DNA Mismatch Repair-ICT792) CPn09d210759851077754F dnaC/prf!!-DNA Primsse-(CT7941 CPn094310779781078238F CT794.1 hypothetical protein CPn094d10785121078997F
CPn094510790701079660F C'f795 hypothetical protein CPn09d610827861079745R QlyQ-Glycyl tRNA Synthetase-ICT796) CPn094710834421084059F pQsA_2-Glycerol-3-P-Phosphacydylcransfarase_2-ICT797) CPn09d810854741084047R Q1QA-Glycogen Synthase-(CT798) CPn09d910859291086483F etc-General Stress Protein-ICT799) CPn095010864881087027F pth-Pepcidyl CRNA ttydrolase-ICT8001 CPn095110871221087157F rs6-S6 Ribosomal Protein-ICT8011 CPn095210874781087723F rsl8-518 Ribosomal Protein-fCT802) CPn095310877421088218F r19-L9 Ribososial Protein-ICT8031 CPn095410882861088708P yehe-Predicted Kinase-ICTBOdI
CPn095510886121089175F Iframs-shift with 0951) CPn095610895601090909F CT805 hypothetical proeein CPn095710937881090963R ide/ptr-Insulinase family/Prouase ZII-fCT806) CPn095810947851093793R pls8-Glycerol-3-P Acylcransferase-ICT8071 CPn095910963431094799R~ cafE-Axial Filament Protein-ICT80B) CPn096010967641097102F CT809 hypothetical protein CPn096110971181097297F r132-L32 Ribosomal Procsin-ICT810) CPn096210973161098I75F plsX-FA/Phospholipid Synthesis Protein-ICT811) CPn096310983981103221F pnq~_21Polymorphie Outer Membrane Protein D
Family-(CT812) CPn096d11047581103301R
CPn096511067361104925R lpxe-Lipid A Disaccharide Synthase-(CT411) CPa096611080371106718R pcnH_2-PolyA Polymerase_2-ICT4101 CPn096711085121109885F mrsA/pgm-PhosphoQlueomutase-ICT815) CPn096811098951111721F QlmS-Glucosamine-Fructose-6-P Aminocransferase-ICT816) CPn096911118121112999F 0969-CyrP_1-Tyrosine Transport_1-ICt817) tyrP_1-Tyrosine Transport 1-ICT8I7) CPn097011134611114648! 0970-CyrP_2-Tyrosine Transport_2-ICt818) tyrP_2-Tyrosine Tzansport_2-Irreie) CPn097111147021115115F yeeA-Transport Permease-(CT819) ' CPn097211162991115430A ltsY-Cell Division Protein TtsY-ICT8201 CPn097311163701117527F sucC-Succinyl-CoA Synchacase. Beta-ICT821) CPa097411175411118432F sucD-Succiuyl-CoA Synthecase. Alpha-ICT822) CPn097511191041119637f CPn097611200821121185F .
CPn097711213711122402F
CPn097811226651123693F
CPn097911239801125413F htrA-DO Serine Protease-ICT8231 CPn098011269821125501A similarity to Saccharomyees sersvisiae hypothetical 52.9KD protein CPa098111270311129952F tint Metalloprotease linsulinase family)-ICT814) cPn0982113119a1129962R yipN family-IGT825) CPn098311320001131206R pssA-Glycerol-Serine Phosphacidyltransferase-ICT8261 CPn098411323791135510F nrdA-Ribonucleoside Reduccase. Large Chain-ICT827) CPa098511355341136571F nrd8-Ribonueleoside Reduetase. Small Chain-ICT828) CPn098611367241:37395F Y00H-Dredieted rRNA tdethylase-ICT8291 CPn098711375161138115F ytQB-like predicted rRNA methylase-ICT830) CPn09881138986113805 R murB-UDP-N-AeecylsnolpyruvoylQlucosamine Rsduccase-CPa098911391951139016R CT832 hypothetical protein CPn099011398831140440F iafC-Initiation Factor ~-(CT8331 CPn09911140421111061?F r135-L35 Ribosomal Protein-IC':8341 CPn099:11406341110996F r120-L20 Ribosomal Protein-ICT8351 CPn099311410141112030F pheS-Phenyialanyl tRNA Synthecaee. Alpha-ICTB361 CPn099d11423981141410F CT837 hypothetical protein CPn099511455121111415R CT838 hypotheticnl protein CPn099611165891145519R CT839 hypothetical protein CPn099711467081147664F mssJ-PP-loop superfamily ATPase-ICT8401 CPn099811478551150584F ftsH-ATP-dependent zinc protease-ICT8411 CPn099911538471150766R pnp-Polyribonueleocide Nucieotidyltrnnsferase-fCT8421 CPn100011531571152891R rsl5-S15 Ribosomal Protein-tCT8431 CPn100111534051153869F yfhC-cytosine deaminase-ICT8441 CPn10021153862115089 F CT845 hypothetical protein CPn100311517961154092R CT846 hypothetical protein CPa100d1155397115879 R CT8d7 hypothetical protein CEn100511559331155115R CT818 hypothetical protein CPa100611564721155990R CT819 hypothetical protein CPa100711566891156907F GT819.1 hypothetical protein CPn100811569281158223! CT850 hypothetical protein CPn100911590581158186R map-Hschionine Aminopeptidase-fCT8511 CPn101011596721159067R CT852 hypothetical protein CPn101111603061159902R CT853 hypothetical protein CPn101211621931160421R yzs8-AHC transporter permease-ICT8541 CPn10131162245. 1163624F fuaaC-Fumarats Hydraease-fCT8551 CPn101411654261163732R yehM-Sulfate Transporter-fCT8561 CPn101511656341166893F CT857 hypoctsecical protein !possible I1i proteia) CPn101611670421168898F CT858 hypothetical protein CPn101711690061169935T lytB-Metalloproteass-ICT8591 CPn101811698981170629F
CPn101911721281170638R CT860 hypocheciesl protein CPn102011736791172150R CT861 hypothetical protein CPnI02111742131173698R lcrH_2-Low Calcium Response_2-ICT8621 CPn102211756'731174216R CT863 hypothetical protein CPn102311760351176331F
CPn102411772361176334R xerD-InteQrase/ree~binase-fCT86d1 CPa102511773021178879F pgi-Glucose-6-P Isomsrase-ICT3781 CPa102611789971.179137F ltuA-ICT3771 CPn102711791751180755F
CPn102B11810161181999F s~dhC-palate Dehyropenase-ICT3761 CPn102911820081182844F
CPa103011838861182843R predicted D-amino acid dehyrogenaae-ICT3751 CPn103111855521184098R areD-Arginine/arnithine Antiporter-ICT374) CPn103211861501185566R CT373 hypothetical proesin CPn103311875001186187R CT372 hypothetical protein CPn103411885171187732R Predicted OItP_1 ICT371I (leader f18) peptide]
CPn103511900001188570R AroE-Shikisnace 5-DehyroQenase-(CT3701 CPn103611911351189984R AroB-Dehyroquinate Synthase-IGT3691 CPn103711921991191123R AroC-Chorissiats Synchase-ICT3681 CPa103811927261192199R aroL-Shikimats Xinase II-fCT3671 CPn10391193999119=665R aroA-Phosphoshikimats Vinyltransfsrase-ICT3661 CPn101011947411194073R
CPn104111959941194726R bioA-Adsnosylmtthionine-8-Amino-7-Oxononanoats Aminotrutsferase CPn1042X1965901195934R bioD-dechiobiotin synehecass CPa10431197717119657?R bioF_l-Oxononanoats Synchass_2 ~
CPn104411986911197699R bioH-Biotin Synthase CPn104511995901198901R conserved hypothetical bacterial membrane protein CPn104612006751199590R TSyptophan Hyroxylase CPn104712005521201343F dap8-DihydrodiDicolinace Reduetasa-ICT364f CPn104B12016061202604F asd-ASpartate DehydroQenase-ICT3631 CPn101912025951203914F lysC-ASpartokinass III-fCT3621 CPn105012039261104798F dapA-Dihydrodipieolinace Synthase-ICT3611 CPn105112049621205270F
CPn105212054171206169F
CPn10531=061531206701F
CPn105d12070341209466F
CPn105512096941210521F
CPn105612105271211228F
CPn105712111971213596F CT156 hypothetical protein CPn105812137481214836F CT355 hypothetical protein CPn105912148481215678F kpsA-Dimethyladenosine Transferase-fCT3541 CPn106011176581215727R dxs/tkt-Transketolase-ICT3311 CPn106112179201217666A CT330 hypothetical protein CPn106212198201218159R xseA-Exodoxyribonucluse VII-ICT3I91 CPn106712199511220712F cpiS-Triosephosphate Isomerase-(CT3281 CPn106112=07191=20895F
CPsa105512210951=20928R
CPa106611311351221!88F
CPn1067122173512=2292F def-Polypepcida Deformylase-ICT353) CPn106B12232581222365R rnh8_2-Ribonucleue HII_2-ICT008) CPn106912235131123941F yfp~-HTH Tranacripcional ReQulacor-fCT0091 CPn10701225511122114 R
CPn107112273241225885R
CPn107212279691228835f CPn107312290111229832F Predicted 0!!P_2 -ICT371) Table 2 (Supplemental Data) Functional Assignrxnts of C. pneumonine Coding Sequences. C. trncltomatis genes arc shown in parrntheses.
Amino Acid Blosynthcsis .Iromatic Familv 1039 (CT366)aroAPhosphoshikimate Vinyltransferase 1036 (CT369)aroBDehyroquinau Synthase 1037 (CT368)aroCChorismate Synthase 1 1035 (CT370)aroEShikimate i-Dehyrogenase ~
0486 (CT382)aroGDeoxyheptonate Aldolue 1038 (CT367)aroLShikimate Kinase II
0740 (CT637)tyrBAromatic AA Aminatransfense AsparrateFomily !lysine) 1 1048 (CT363)asd Asp:~ute Dehydrogenase 1050 (CT361dapADihy<bodipicolinate ) Synthasc 1047 (CT364)dapBDihydrodipicolinate Reductasc 0519 (CT430)dapFDian inopirnelate Epimerue 1049 (CT362)IysCAspa zokinase llI
2~ Serint Family 0433 (CT282)gcsfiGiyc ne Cleavage System H Protein 0521 (CT432)glyASerine tiydroxymethyltransfense Base &
Nuclmtidt Metabolism 0171 guaAGMP Synfhase 25 0172 guaBInosine 5'-Monophosphase Dehydrogenase 0608 Utidine S'-Monophosphate Synthase 0735 Uridine Kinase 0244 (CT128)adk Adenylate Kinase 0894 (CT751atnnAMP Nucieosidase ) 3~ 0568 (CT452)cmk CMP Kituue 0392 (CT039)dcd dCTP Deaminue 0059 (CT292)dut dUTP Nucleotidohydrolase OI20 (CT030)gmk GMP Kinase 0619 (CT500)ndk Nucleoside-2-P Kinase 3 0984 (CT827)nrdARibonucleoside Reductase.
5 Large Chain 0985 (CT828)nrdBRibonucieoside Reductase, Small Chain 0236 (CT183)pytGCTP Synthetase 0698 (CT678)pyresUMP Kittase 0271 (CT188)tdk Thymidylate Kinase 0659 (CT539)mtA Thioredoxin 0314 (CT099)trx8Thiorcdoxin Reductase I (CT844)yfhCCytosine Deaminasc OOI
45 Biotin. Lipoate dr Ubiquinone Biosynthesis of Cotacton 1041 bioAAdenosylmethionine-8-Amino-7-Oxononanoate Aminottatufetasc 1044 bioBBiotin Synthase 1042 bioDDethiobiotin Synthetase 0923 (CT777)bioF_IOxononanoate Synthase-1 1043 (C'C777)bioFOxononanoate Synthase-2 0866 (CT725)birABiotin Synthetase 0748 (CT628)ispAGmnyl Tnnsaatuferasc 0832 (CT558)IipALipoate Synthetase 0265 (CT219)ubiA Benzoau Ocnphenyhransiense 0264 (CT220)ubiD Phenylaerylate Decarboxylue OSIS (CT428)ubiE UbiquinoneMethyltransfense Folic Acid 0759 (CT612)folA DihydrofolateReducuse 0335 (CT078)folD Methylene Tcaahydrofolate Dehydrogenase 0758 (CT613)folP Dihydropteroate Synthuc 0757 (CT614)folX Dihydroneopmrin Aldolue 0763 (CT649)ygfA FortnyltetrahydrofolateCycloligase 1 Porphyrin ~
0714 (CT662)hertWGlutamyl tRNA Reducnse 0744 (CT633)hemB Porphobilinogen Synthue OOS2 (CT299)hemC Porphobilinogen Deaminue 0890 (CT747)hemE Uroporphyrinogen Decarboxylase I 0888 (CT74$)hemG protoporphyrinogen S Oxidise 0138 (CT210)hems.Glutamate-1-Semialdehyde-2.1-Aminomutue 0380 (CT052)hemN_ICoproporphyrinogen Ilt Oxidase_I
0889 (CT746)hemlVCoproporphynnogen 2 111 Oxidise 2 _ 0603 (CT485)hemZ Ferrochentue Riboflavin 0872 (CT731nbA&rib8 ) GTP
Cyclohydranse &
DHBP
Synthase 0532 (CT40S)ribC Riboflavin Synthase 0871 (CT730)ribD Riboflavin Deaminue 0877 (CT7J2)ribE Ribiryllumazine Synthue 25 0320 (CT09))ribF FAD Synthase Cell Envelope Forty Acid &
Phospho(ipid Merabolisrn 0161 (CT206) (predicted uyltnnsferase family) 0922 (CT776)au Acylg(yeerophosphoethanolamine Acyhnnsferase 0414 (CT265)accA AcCoA CarboxylasrrTransferrse Alpha 0183 (CT123)accB Biotin Carboxyl Carrier Protein 0182 (CT124)accC BiotinCarboxylase 0058 (CT29J)accD AeCoA Carboxylaseffranafense Ben 35 0295 (CT2Jb)acpP Acy1 Cartier Protein 0313 (CTI00)acpS Acyl-rartier Protein Synthue 0567 (CT451)cdsA Phosphatidate Cytidylytransferasc 0297 (CT238)fabD Malonyl Acyl Cartier Transeyclase 0916 (CT770)fabF Acyl Carrier Protein Synthasc 0296 (CT237)fabG Oxoacyl (Carrier Protein) Reductue .0298(CT239)fabH Oxoacyl Carrier Protein Synthue III
0406 (CTlOa)fabl Enoyl-Acyl-Cartier Protein Reducnsc 0651 (CT532)fabZ Myristoyl-Aeyl Carrier Dehydranse 0098 (CTOIO)hcB Acyltransferue 45 0271 (CTIJ6) LysophoapholipueEsterue 0615 (CT496)pgsA-1Glycerol-3-P Phosphatidyltratufense-I
0947 (CT797)pgsA Glycerol-J-P Phospharydyltransfensse_2 0958 (CT807)plsB Glycerol-3-P Acylcansferase 0569 (CT453)plsC Glycerol3-P Acylaansferau 50 0962 (CT811plsX FA/Phospholipid Synthesis ) Protein 0839 (CT699)psdD Phosphatidylserirte Deearboxylue 0983 (CT826)pssA Glycerol-Serine Phosphatidyltransfecue 0921 (CT775) sttGlyeerol-J-P Acyltraruferase 0654 (CTS35)yciA Acyl-CoA Thioestcrasc S 0877 (C1'736)ybcL CT1J6 Hypothetical Protein LPS
WO 00127994 PCT/US99/Zb923 0154 (CT208)gseAKDO Tnnsfense 0721 (CT655)kdsAKDO Synthetue 0235 (CT182)kds8Deoxyoctutotrosic Aeid Synthetue 0650 (CT531IpxAAcyl-Carrier UDPGIcIvAc ) O-Acyltnnsfensc 0965 (CT411IpxBLipid A Disucharide ) Synthase 0652 (CT533)IpxCMyristoyl GIcNac Deaeetyiau 0302 (CT243)lpxDUDP Glueosamine N-Acyltransferase Membrant Proteins.
Lipoproteins &
Porins 0310 (CT25160IM60kDa lacer Membrane ) Protein 0556 (CT442)crpAISkDa Cysnine-Rich Protein 0653 (CT534)cutEApolipoprotein N-ACetyltnttsferue 031 (CT252)Igt Prolipoprotein Diacylglyeerol I Tnnsfense 0558 (CT444)omcA9kDa-Cysteine-Rich Lipoprotein 0557 (CT443)omcB60kDa Cysteine-Rich OMP
0695 (CT681ompAMajor Outer Membrane ) Protein 0854 (CT713)ompBOuter Memebnne Protein 0781 (CT600)pat Pepddoglyean-Associated Lipoprotein 0300 (CT241yaeTOmp85 Hotnolog ) Peptidoglye:an 0417 (CT268)amiAN-Acetylmuramoyl Alanine Amidue 0780 (CT601amiBN-Acetylmunmoyl-L-Ala ) Amidue 0672 (CT55duF D-Ala-D-Ala Caroxypeptidase t ) 0968 (CT816)glmSGlueoumine-Fructose-6-P
Aminotnnsfense 0749 (CT629)glmUUDP-GIcNAc Pyrophosphorylue 0900 (CT757)mnY MunmoylPennpeptide Tnnxferue 0571 (CT455)murAUDP-N-Acetylg(ucosamine Tnmferue 0988 (CT831)murBUDPN-At:etylenolpyruvoylglucosamineReductue 0905 (CT762)murCdcddlA
Mutartutc-Ala Liguc &
D-AlaD-Alam Ligue 0901 (CT758)murDMunmoylalanine-Glunmate LiBase 0418 (CT269)murEN-Aeetylmunmoylalanyl8lunmyl DAP Ligue 0899 (CT756)murFMuramoylDAP Ligau 0904 (CT761murGPeptidoglyean Tnnsferue ) 0902 (CT759)nlpDMunmidue (invuin repeat family) 0694 (CT682)pbp2PBP2-Tnnsglycoluelfnnspeptidue 0419 (CT270)pbp3Tntesglycoluelfnnsprptidase 0421 (CT272)yabCPBP2B Family Methyltntufensc Cellular Prueeases Ctil Division 0959 (CT808)catEAxial Filament Protein 0880 (CT739)ftsKCell Division Protein FaK
0903 (CT760)fhW Cell Division Protein FtsW
0972 (CT820)ftsYCell Division Pronin FnY
0617 (CT498)gidAFAD-dependerttOxidorcducttue 0805 (CT582)minDChromosatx Partitioning ATPase 0850 (CT'109)mteBRod Shape: ProteinSugar Kinue 0867 (CT726)rodARad Shape Protein 0684 (CT688)parBChromosome Partitioning Protein Deroztijcatioa 5~ 0057 (CT294)sodMSupe:roxideDismunsefMn) 0778 (CT603)ahpCThio-spucifie Antioxidant (TSA) Peroxidase Signal Transduetioa 0148 (CT145) S!T Protein Kinue 0584 (CT467)uoS Two-Component Sensor 0294 (CT235) cAMP-Dependrnt Protein Kinase Regulatory Subunit 0712 (CT664) (FHA domain) 0478 (CT379)h!!XGTP Binding Protein 0703 (CT673) S!C Protein Kinase 0095 (CT301 S!f Protein Kinau ) 0397 (CT259) PP2C Phosphatax Family 0037 (CT337)puH PTS Phosphoeartier Protein Hpr 0038 (CT336)ptslPTS PEP Phosphotnnsferase 0060 (CT29prsN_1PTS IIA Protein_t f ) 0061 (CT290)ptsNPTS IIA Protein 2 r HTH DYA-Binding Dorttain 0262 (CT218)surfSurf-like Acid Phosphatase 0838 (CT698)thdFThiophenelFuran Oxidation Protein 0693 (CT683) TPR Repeats-CT683 Hypothetical Protein 0321 (CT092)ychFGTP Binding Protein 0544 (CT4 yhbZGTP binding protein t 8) 0844 (CT703)yphCGTPaseiGTP-binding protein Smedard Protein Secretion 01 (CT025)fIh Signal Recognition I Particle GTPax S
03b3 (CT060)tlhAFtagellar Secretion Protein 0858 (CT717)ffiIFlagellum-specific ATP Synthax 0704 (CT672)fl(NFlagellu Motor Switch DomainIYseQ family 0815 (CT572)gspDGen. Secrcdon Protein D
0816 (CT571gspEGen. Secretion Protein ) E
0817 (CT570)gspFGen. Secretion Protein F
0359 (CT064)IepAGTPase 0110 (CT020)lepBSignal Peptidue I
0535 (CT408)IspALipoprotein Signal Peptidax 0260 (CT141xeA_IProtein Translocax ) Subunit-1 0841 (CT701secA_2Transloerue SecA-2 ) 0564 (CT448)secD&secF
Protein Export Proteins SecDiSecF
(fusion) 0075 (CT321secEPrcprorcin Transloeax ) 3v 0629 (CT510)xcY Tnnslocase 0848 (CT707)rig Trigger Factor-Peptidyl-prolyl lsomersse Tronsporr-Related Proteins 0486 Hypothetical Praline Permease 0289 (CT230)aaaTNeutral Amino Acid (Glutamate) Tranaponer 3 0691 (CTb85)abcXABC Transporter 5 ATPax 1031 (CT374)arcDArginine/Omithine Antiporter 0482 (CT381artlArginine Periplasmic ) Binding Protein 0836 (CT554)bmQ Amino Acid (Benched) Transpon 0536 (CT409)dagA_ID-Ala/Gly Permcax I
0876 (CT735)dagAD-AlaninelGlycine 2 Permease 2 0682 (CTb90)dppDABC ATPase Dipeptide Transpon Ob83 (CT689)dppFABC ATPase Dipeptide Transport 0280 (CT689)dppFDipeptide Transporter ATPase 0785 (CT596)exbBMuromolecule Transporter 45 0784 (CT597)exbDBiopolymerTansporiProtein 0404 (CT486)OiY Glutatnine Binding Protein 0192 (CT129)glnPABC Amino Acid Trmsporter Permease 0191 (CT130)ginQABC Amino Acid Transporter ATPase 0528 (CT401)gltTGlutamateSymport 028b (CT194)mgtEMg'+Transportt:r(CHS
Domain) 0413 (CT264)msbATransport ATP Binding Protein 0290 (CT231) Na;-dependentTnnsporier 0195 (CT198)oppA_IOligopeptide Binding Protein_1 0196 (CT198)oppA_2Oligopeptide Binding Protein 2 _ 5 0197 (CT139)oppAOligopeptide Binding 3 Protein 3 0198 (CT175)oppAOligopeptide Binding 4 Protein .t 0599 ICT480oppA Oligopeptide Binding 5 Lipoproretn i ) 0199 (CTI99)opp8-1Oligopeptide Pemtesse-:
0598 (CT479)oppB Oligopeptide Permease_2 0200 (CT200)oppC_1Oligopeptide Permeue_1 0597 (CT478)oppC_2Oligopeptide Pemuase_'_ 0201 (CT201oppD Oligopeptide Tnnspon ) ?~TPase 0202 (CT202)oppF Oligopeptide Transport ATPue 0231 (CT180)tauB ABC Tnnspon ATPue f~itntaFe) 0782 (CT599)tolB Macromolecule Transporter 0969 (CT8I7)tyrP_ITyrosine Tnnsport_1 0970 (CTalB)tyrP_2Tyrosine Transport _ 0665 (CT544)uhpC He~osphosphate Transport 0282 (CT216)xuA Amine Acid Transporter 0207 (CT204)ybhl dicarboxylate Tnnslocator 1 0971 (CT819)yccA Tnnspon Permease S
0248 (CT152)ycCV ABC TnnsporterATPase lOt4 (CT856)ychM Sulfa a Tnnsponer 0736 (CT641ygeD fllu:. Protein ) 0680 (CT692)ygo4 Phosp gate Pennease 0723 (CT653)yhbG ABC Tnnsponer ATPue 0023 (CT348)yjjK ABC Transporter Protein ATPue 0127 (CTD34)ytfF Catioi.ie Amino Acid Transporter 0349 (CT067)ytgA Solute Protein Binding Family 0348 (CT068)ytgB ABC ransporter ATPue 0347 (CTD69)ytgC Integrsl Membrane Protein 0346 (CT070)ytg0 Integral Membrane Protein 1012 (CT854)yze8 AHC Tnnsponer Permease 0868 (CT727)znlA Metal Tnnspon P-type .4TPase 0279 Possible ABC Tnnsportcr Pertneue Protein 0543 (CT417) (Metal Tnnspon Protein) 0692 (CT684) ABC Transponer 0542 (CT416) ABC Transporter ATPase 0690 (CT686) ABC Transporter Membrane Protein 0541 (CT415) solute binding protein 3 7yve-msttetro~
0323 (CT090)IcrD Low Caleium Response D
0324 (CT089)IcrE Low Calcium Response E
D8 (CT576)IcrH_tLow Ca Response c Protein H-1 I
1021 (CT862)IcrH Low Calcium Response _ 0325 (CT088)sycE Seerction Chaperone 0702 (CT674)yscC Yop GGen Secretion Protein D
0828 (CT559)yscJ Yop Tnnslocation J
0826 (CT561yscL Yop Tnnsloeation ) L
0707 (CT669)yscN Yop N (Flagellar-Type ATPase) 45 0825 (CT562)yscR Yop Tnnslocadon R
0824 yscS YopS Tnnslocation (CT563) Protein 0823 yscT YopT Tnnloeation (CT564) T
0322 yscU Yop Translocation (CT091 Protein U
) 5o Central Intermediary Metabolism Glycogen Merobofism 0856 (CT715) UDP-Glueose Pyrophosphorylue 0948 (CT798)glgAGlycogen Synthase , 0475 (CT866)glgBGlucan Benching Enzyme JS 0607(CT489)glgCGlucoseI-P Adenyltransferase 0307 (C7-248)glgPGlycogen Phosphorylase 0388 (CT042)glBXGlycogen Hydrolase (debnnching) 0326 (CT087)malQGlucanoesnsfense 0851 (CT710)pckAPhosphoenolpyrovate Carboxykinase Phosphorous Qc Suljur 0548 (CT435)cystSulfite Reductase S 0920 (CT774)cysQSulfite SynthesivBiphosphace Phosphatau 0025 (CT346)actASulphohydrolue 0918 (CT77?)ppaInorganic Pyrophosphacase DNA Replication. Madlfication. Repair & Recombination 1 O DNA Mismareh Repair 0505 3-Methyladenine DNA Glycosylue 0812 (CTS75)mutt DNA Mismatch Repair 0941 (CT792)mutS DNA Mismatch Repair 0402 (CT107)mutt Adenine Glycosyiase S 0732 (CT625)nfo Endonuclease IV
0837 (CT697)nth Enodnucleue 111 DNA
Modification 0596 (CT477)ada Methylmnsferau Ol (CT024)hemK AJG-specific Methylue ZO 0891 (CT748)mfd Tnnscnprion-Repair Coupling 0620 (CT501ruvA Holliday Junction ) Helicue 0390 (CT040)rov8 Holliday Junction Helicue 0621 (CT502)rovC Crossover Junction Endonucleue 0053 (CT298)sms Strn Protein 2S 0771 (CT607)un8 Uncil DNA Glycosylue 1062 (CT329)xseA Exodoxyribonucleue VII
DNA
Recombination 0762 (CT650)recA RecA Recombination Protein 0738 (CT639)recB Exodeoxyribonuclease V. Beu 3O 0737 (CT640)recC Exodeoxyribonucleue V, Gamma 0123 (CT033)rccD_IExodeoxyrtbonuclease V (Alpha Subunit)_I
0752 (CT6S2)reeD Exodeoxyribonuclease 2 V. Alpha 2 _ 0339 (CT074)recF ABC Superfamdy ATPuc 0340 (CT074) (frame-shift with 0339) 3S 0563 (CT.t47)recJ ssDNA Exonucleue 0299 (CT240)rceR Recombination Protein DNA
Replication 0309 (CT2S0)dnaA_IReplication Initiation Protein_I
0424 (CT275)dnaA Replication Initiation 2 Faetor_2 4O 0616 (CT497)dnaB Replicative DNA
Helicue 0666 (CT545)dmE DNA Pol tI1 Alpha 0942 (CT794)druG DNA Primax 0338 (CT075)dnaN DNA Pol III (Beta) 0410 (CT261dnaQ_1DNA Pol III Epsilon ) Chain_1 4S 0655 (CT536)dnsQ DNA Pol III Epsilon 2 Chain_2 0040 (CT334)dnaX_1DNA Pol III Gamma and Tau_l 0272 (CTI87)dnaX DNA Pol III Gamma 2 and Tau_2 _ 0149 (CT146)dnU DNA Ligue 0274 (CT189)ByrA_IDNA Gyrue Subunit A_I
SO 0716 (CT660)gyrA DNA Gyrase Subunit _ 0275 (CTI90)gyrB_IDNA Gynse Subunit 8_I
0715 (CT661gyrB DNA Gynse Subunit ) 2 B_2 0416 (CT267)himD lntegntion Host Factor Alpha 0612 (CT493)polA DNA Polymerise I
S 0924 (CT778)priA Primosomal Protein S N
0386 (CT044)ssb SS DNA Binding Protein 0835 (CT555) SWUSNF family helieue_t 0849 (~7pg) SWUSNF family helicue _ 0769 (CT643)topADNA Topoisomense t-Fused to SWI
Domain 0024 (CT347)xerCIntegruvrecombinue 1024 (CT864)xerDIntegrudrccombinue Eukaryotic-Typt Chromatin Factors 0886 (CT743)hctAHiswne-Like Developmental Protein 0384 (CT046)hct8Histone-like Protein 0878 (CT737) SET Domain protein 0577 (CT460) SWIB (YM74) Complex Protein UVR
Exinutlease Repair System 0096 (CT33;)uvrAExcinueletux ABC
Subunit A
0801 (CT586)uvr8Exinucleue ABC Subunit B
p9a0 (CT791uvrCExcinucleue ABC.
) Subunit C
I 0772 (CT608)uvrDDNA Hclicue S
Energy Metabolism Aerobic 0855 (CT714)gpdA Glycerol-3-P Dehydrogenase 0743 (CT634)nqrA Ubiquinone Oxidorcductue.
Alpha 0427 (CT278)nqr2 NADH (Ubiquinone) Dehydrogenase 0428 (C'C279)nqr3 NADH (Ubiquinone) Oxidorcductase.
Gamma 0429 (CT280)nqr4 NADH (Ubiquinone) Reductue 0430 (CT281nqr5 NADH (Ubiquinone) Redueuse ) 5 25 0883 (CT740)nqr6 PhenolhydrolasdNADH
(Ubiquinone) Oxidoreductase .1 TP
Biogenesis and mttabolistn 0351 (CT065)adt_IADPIATP Traralaeast_1 0614 (CT495)adt ADP/ATP Tnnslocase_2 0088 (CT308)atpA ATP Synthue Subunit A
0089 (CTJ07)atpB ATP SYn~uc Subunit B
0090 (CT306)atpD ATP Synthue Submit D
0086 (CT3I0)atpE ATP Synthue Subunit E
0091 (CT305)atpl ATP Synthase Subunit 0092 (CT304)atpK ATP Synthue Subunit K
35 0860 (CT119)fliF FIageIlar M-Ring Protein Electron Transport Chain 0102 (CT013)cydA Cytochrorne Oxidue Subunit 0103 (CT014)cydB Cytochrome Oxidise Subunit 0364 (CT059) Fertedoxin L~~ 0084 (CT312) Predicted Ferrcdoxin Glyrnlysis & Gluconeogtnesis 0281 (CT215)dhnA Predicted 1.6-Fructose Biphosphate Aldolase OB00 (CT587)erro Enolue 0624 (CT505)gapA Glyceraldehyde-3-P Dehyrogenue 45 0056 (CT295)mrsA PhosDhornannomutue 0967 (CT8I5)pgm Phosphoglucomutue 0160 (C'T207)plkA_IFructose-6-P Phosphotransfense_I
0208 (CT205)ptkA Fructose-6-P Phosphoasnsferue_2 1025 (CT378)pgi Glucose-6-P Isomerase 0679 (CT693)pgk Phoaphoglyeerate Kituse 0863 (CT722)pgrrtAPhosphoglyetnte Mutant 0097 (CT332)pyk Pyruvate Kinase 1063 (CT328)tpiS Triosephosphate Isomertue Ptntose Phosphate Pothway 55 0239 (CTI86)devB Glucose-bP Dehyrogtnast (DevB family) 1060 (CT331)dxs Tnruketolue 0360 ICT063)gnd 6-Phosphogluconate Dehydrogenase 0185 ICTI21)rpe Ribulose-P Epimense 0141 (CT213)tpiARibose-5-P Isomerase A
0083 (Cf313)tal Transaldolue S 0893 IC1-750)UttBTnnsketolue 0238 (CT185)zwf Glucose-6-P Dehyrogenue Pyruvalt Dehydrogenase 0833 (CT557)IDdALipoamitle Dehydrogenue 0436 ICT285)IpIA_ILipoate Protein Ligue-Like Protein 0618 (CT499)IpIALipoam-Protein ? Ligase A
0033 (CT340)ptJhA&BOxoisovalente Dehydrogenase a/(i Fusion 0304 (CTZ45)pdhAPyruvate Dehydrogenase Alpha 0305 (CT246)pdhBPyruvate Dehydrogenue Beta 0306 (CT247)pdhCDihydrolipoamide Acetyltransferase S Cycle 0495 (CT390)aspCAspartam Aminotnnsferase 1013 (CT855)fumCFumarate Hydratue 1028 (CT376)mdhCMalate Dehyrogenase 0789 (CT592)sdhASuccinam Dehydrogenue 0790 (CT591)sdhBSuccinate Dehydrogrnase 0788 (CT593)sdhCSuccinate Dehydrogenue 0378 (CT054)sueAOaoglutarate Dehydrogenue 0377 (CTO55)sucB_1Dihytleolipoamide Succinylnansfetase_I
0527 (CT400)sucBDihydrolipwmide 2 Succinyltratuferue ?
-2 0973 (CT821sucCSuccinyl-CoA Synthetue.
S ) Ben 0974 (CT822)sucDSuecinyl-CoA Synthetase, Alpha Protein Folding, Assembly & Modification Chaperonu 30 0949 (CT799)ctc General Stress Protein 0534 (CT407)dksA DnaK Suppressor 0032 (CT34IdnaJ Heat Shock Protein ) J
0503 (CT396)dnaK Hsp-70 0134 (CT110)groEL_1Hsp-60_1 3 0777 (CT604)groELHsp-60 2 0898 (CT755)groELHsp-60 3 0135 (CTI groESlOKDa Chaperonin ) 0502 (CT395)grpE HSP-70 Cofutor 0661 (CT541mip FKBP-type Peptidyl-prolyl ) CisTrans lsotnerue Prattasts OI44 (CTI clpB CIp Proteue ATPue 13) 0437 (CTt86)clpC CIpC Proteue 0520 (CT431clpP CLP Protease ) 1 0847 (CT706)clpP CLP Protease Subunit 4S 0846 (CT705)dpX CLP Proteue ATPue 0269 (CC138) Dipeptit3ue 0998 (CT841)fliesATPdepmdent Zinc Proteue 0030 (CT343)gcp_1O-Sialoglytoprotein Endopeptidue_I
0194 (CT197)gcp_2OSialoglycoprotein Endopeptidast_2 S~ 0979 (CT823)htrA DO Senne Proteue 0957 (CT806)ide Insulinue family/Proteaae 0027 (CT344)ion Lon ATP-dependent Protect 1017 (CT859)IytB Metalloproceue 1009 (CT85t)trap MethionmeAminopeptitlase S 0185 (CT045)pepA Leucyl Aminopeptidau S A
OI36 (CT113)DepF Oligopeptidase 0813 (CT574)pepPAminopeptidase P
0613 fCT494)soh8Protease 0555 (CT441tsp Tail-Specific ) Protease 0344 (CT072)yaeL~tetal!optoteue 0981 (CT824) Zinc ~tetalloprotease (insu)intue family) Proteinomtrasts ls 0227 (CT176)dsb8Disulfide bond Oxidorcductue 0786 (CT595)dsbDThio:disulfide Interchan8e Protein 0228 fCT177)dsbGDisulfide Bond Chaperone 0933(CT783) Prcdieted Disulfide Bond lsotnerase 0926 (CT780) Thiorcdoxin Disulfide Isomerase Transcription RNA
Degradation 0999 (CT842)pnp Polyribonucleotide ~ueleotidylmnsfense 0054 (CT297)me Ribonuelease III
0119 (CT029)mhB_1Ribonuelesse HII_1 1068 (CT008)mhB Ribonucleax Hlf 0934 (CT784)mpA Ribonueleue P Protein Component 0504 (CT397)vac8Ribonucleue Family ~ Elongation Qe Termination Faetors 0741 (CT636)greATranscription Elongation Factor 0316 (CT097)nuSAN Utilization Protein A
0076 (CT320)nusGTnrueriptional Antitermination 0845 (CT704)pcnB-1Poly A Polymenx_I
I 0966 (CT410)pcnBPolyA Polymerise ?
S
0610 (CT491rho Transcription Termination Factor ) RNA
Merhyiases 0674 (CTSS3)fmu RNA Methyloansferue 1059 (CT3S4)kgsADimethy(adenosine Tnnsfense 2~ 0187 (CT133) PttdictedMethylue 0530 (CT403)spoU_1rRNA Methylue_1 0660 (CTS40)spoUrRNA Methylue_2 0117 (CT027)trmDtRNA (Gtunine N-I )-Methylttansfense 0885 (CT742)ygcArRNA Methyltransferse 25 0986 (CT829)yggHPredicted rRNA Methylue 0987 (CT830)ytg8Predicted rRNA Methylase RNA
Modification 0649 (CTS30)fmt Methionyl tRNA Formyhnnsferase 0910 (CT766)miaAtRNA Pyrophosphate Tnnsferise 30 07t9 (CT658)sthBPredicted Pxudouridine Synthue 0219 (CT193)tgt Queuine tRNA Ribosyl Tnnsfense OS80 (CT463)truAPseudouridylate Synthue I
0319 (CT094)tru8tRNA Pseudouridine Synthue 0401 (CTt06)yceCPredicted Pseudouridine Synthetue Family 3 0864 (CT723)yjbCPredicted Pseudouridine Synthue RNA
Po(ymerote Qc Trantcriprion Rtgulators OS86 (CT4b8)atoCTwo-Component Regulator 0362 (CTObIrpsDSigma-28/WhiG Family ) OS01 (CT394)hrcAHTH Tnnxriptional Repressor 40 0793 (CTS88)rbsUSigma Regulatory Family Protein-PP2C
Phosphanse (RsbW Anngonist) 062b (CTS07)tpoARNA Polymerise Alpha 0081 (CT31rpoBRNA Polymerise Ben S) 0082 (CT314)rpoCRNA Polymerise Ben' 075b (CT61rpoDRNA Polymenx Sigma-66 S) 45 0771 (CT609)rpoNRNA Polyrrrcnse Sigma-S4 OSI1 (CT424)rsbV_ISigtnaRegulatoryFaetor_I
0909 (CT76S)rsbVSigma Factor Regulator 2 0670 (CTS49)rsbWSigma Regulatory Factor-Histidine tCitux 0750 (CT630)tctDHTH Tranuriptional Regulatory Protein Receiver Doman 1069 (CT009)yfgAHTH Tnnscripoonal Regulator Amino Aeyl tRNA Synthesis 0892 (CT749) alas Alanyl tRNA Synthetue 55 0570 (CT454) argS Arginyl tRNA Tnnsfenx 0662 (CT542) asps Aspartyl tRNA Synthense Translation 0932 (CT782)cysSCysuinyl tfL'IA
Synthetue 0003 (CT003)gatAGlu tRNA Gln Amidotnmfertue (A subunit) 0004 (CT0o4)gatesGlu tRNA GIn Amidotnnsfmse (B Subunit) 0002 (CT002)gatCGlu tRNA Gln Amidotnnsfetase (C subunit) 0560 (CT445)gltXGlutamyl-tRNA Synthetue 0946 (CT796)glyQGlycyl tRNA Synthetax 0663 (CT543)hissHisadyl tRNA Syntherase 0109 (CT019)ileSIsoleucyl-tRNA
Synthetax 0153 (CT209)IeuSLeucyltRNA Synthetue 1 0931 (CT781IysSLysyl tRNA Synthetase ~ ) OI22 (CT032)tttetGMeth'ronyl-tRNA
Synthenx 0993 (CT836)pheSPhenylalanyl tRNA
Synthetase, Alpha 0594 (CT475)pheTPhenyla)anyl tRNA
Synthetax Beta 0500 (CT393)prosProlyl tRNA Synthetax I 0870 (CT729)xrS Seryl cRNA Syntherase 0806 (CT581)thrSThrconyltRNA Synthense 0802 (CT585)apS TryptophanyItRNA
Synthetase 0361 (CT062)tyrSTyrosyl tRNA Synthetase 0094 (CT302)vaiSValyl tRNA Synthetue Pepridc Chain Initiation.
Elongation &
Termination 1067 (CT333)def Polypeptide Dcformylase 0184 (CT122)eCp_IElongation Futor 0895 (CT752)efp Elongation Futor 0550 (CT437)CusAElongation Facror G
25 0073 (CT323)inCAInitiation Factor IF-I
0317 (CT096)inf8Initiation Factor-2 0990 (CT$33)infCInitiation Futon 01 (CT023)plrAPeptide Chain Releasing I3 Futon 1 0576 (CT459)prt8Peptide Chain Release Factor 2 3~ 0950 (CT800)pth Peptidyl tRNA Hydrolax 0318 (CT095)rbfARibosome Binding Futon A
0699 (CT677)rrf Ribosome Releasing Factor 0697 (CT679)tsC Elongation Factor TS
t>074(CT322)tufAElongation Factor Tu 35 Ribosomal Prortins 0078 (CT318)rll LI Ribosomal Protein 0644 (CT525)r12 L2 Ribosomal Protein 0647 (CT528)r13 L3 Ribosomal Protein 0646 (CT527)rl4 L4 Ribosomal Protein 0635 (CT516)r15 LS Ribosomal Prouin 0633 (CT514)rl6 L6 Ribosomal Protein 0080 (CT316)r17 L7/LI2 Ribosomal Prouin 0953 (CT803)rl9 L9 Ribosomal Protein 0079 (CT317)r110L10 Ribosomal Protein 45 0077 (CT319)rl L1 f Ribosomal I Prouin 0247 (CT125)r113Ll3 W'b~onui Prouin 0637 (GT518)r114L14 Ribosomal Prouin 0630 (CT511)r115LIS Ribosomal Prouin 0640 (CT521)r116L16 Ribosomal Prouin 0625 (CT506)r117Ll7 Ribosomal Protein 0632 (CT513)rll8Lt8 Ribosomal Prouin 01 (CT028)r119Ll9 Ribosomal Protein l8 0992 (CT835)r120L20 Ribowmal Protein 0546 (CT420)r121L21 Ribosomal Protein 55 0642 (CT5I3)r122L22 Ribosomal Prouin 0643 (CT526)r123L23 Ribosomal Protein 0636(CT517)r124L24 Ribosomal Prouin 0545(CT419)r127427 ribosomal protein 0327(CT086)r128L28 Ribosomal Prouin 0639(CT520)r129L29 Ribosomal Protein 0112(CT022)r131L31 Rbosomal Protein 0961(CT810)r132L32 Ribosomal Prouin 0250(CT150)r133L33 Ribosomal Prouin 0935(CT785)r134L34 Ribosomal Prouin 0991(CT834)r135L35 Ribosomal Prouin 1 0936(CT786)r136L36 Ribosomal ~ Prouin 0315(CT098)rst SI Ribosomal Protein 0696(CT680)rs2 S2 Ribosomal Prouin 0641(CT522)rs3 S3 Ribosomal Prouin 0733(CT626)rs4 S4 Ribosomal Prouin 15 0631(CT512)rs5 S5 Ribosomal Prouin 0951(CT801rs6 S6 Ribosomal ) Prouin 0551(CT438)rs7 S7 Ribosomal Prouin 0634(CT515)rs8 S8 Ribosomal Protein 0246(CT126)rs9 S9 Ribosomal Prouin 0549(CT436)rs10S10 Ribosomal Prouin 0627(CTSOB)rsl1511 Ribosortul Protein 0552(CT439)rsl2SI2 Ribosomal Prouin 0628(CT509)rs13SI3 Ribosomal Prouin 0937(CT787)rs14514 Ribosomal Prouin 25 1000(CT843)rsl5S15 Riboaomal Protein 0116(C'f026)rs16SI6 Ribosomal Protein 0638(CT519)rsl7517 Ribosomal Protein 0952(CT802)rs18SI8 Ribosomal Protein 0643(CT524)rsl9519 Ribosomal Prouin 0754(CT617)rs20S20 Ribosomal Prouin 0031(CT342)rs21521 Ribosomal Protein 35 Other Catc'orica Ch(cmydiaSpccific Proteins 0561 (CT446)Euo CHLPS Euo Prouin 0804 (CT583)Gp6D CHLTR Plasmid Paralog 0186 (CTt SimiLriey to IncA_t t9) 0291 (CT232)ineB Inelmion Membrane Protein B
0292 (CT233)incC Inclusion Membrane Protein C
1026 (CT377) LtuA Prouin 0333 (CTO80) LtuB Protein 0005 (CT871pmp_IPolymorphic Ouur ) Membrane Protein G Family 45 0013 (CT871pmp_2Polymorphie Ouur ) Membrane Prouin G Family 0014 (CT871pmp Polymorphic Ouur ) ~ Membrane Prouin G Family 0015 (CT871pmp_3PMP 3 (frame-shit!
) with 0014) 0016 (CT874)pmp Polymorphic Ouur 4 Membrane Prouin G Family OOi7 (CT871)pmp_4PMP 4(fttune-shiftwith0016) 0018 (CT874)pmp Polymorphic Outer 5 Membrane Protein G Family 0019 (CT87IPmp_5PMP 5 (frame-shift ) with 0018) 0444 (CT871pmp Polymorphie Ouur ) 6 Membrane Prouin G/I Family 0445 (CT871pmp_7Polymorphic Outer ) Membrane Protein G Family 0446 (CT871pmp Polymorphic Outer ) 8 Membrane Protein G Family 55 0447 (CT871pmp Polymorphic Ouur ) 9 Membrane Prouin G/I Family 0450 (CT871pmp_IPolymorphic Ouur ) O Membrane Protein G Family 0449 (CT871DmP_10PMP_l0 (Frame-shift ) with 0450) 0451 (CT87t ) pmp_I I Polymorphic Outer Membrane Protein G Family 0452 (CT874) Potymorphic Outer Membrane pmp_12 Protein (truncated) AEI Family 0453 (CT871) Polyrnorphie Outer Membrane pmp_I3 Protein G Family 0454 fCT872) Polymorphic Outer Membrane pmp_t4 Protein H Family 0466 (CT869)pmp_I5Polymorphic Outer Membrane Protein Family 0467 (CT869)pmp_16Polymorphic Outer Membrane Protein E Family 0468 fCT869)pmp_17Polymorphic Outer Membnnc Protein E Family 0469 (CT869)ptnp_17PMP_t7 (Fame-shift with 0468) 0470 fCT869)prnp_I7PMP_17 (Fame-shill with 0469) 0471 (CT870)pmp_18Polymorphic Outer Membrane Protein FrF Family 0579 fCT412)prrtp_19Polymorphic Membrane Protein A Family 0540 (CT413)pmp Polytrrorphic Membrane 30 Protein B Family 0967 (CT8t2)pmp_21Polymocphic Membrane Protein D Family 0562 CHLPS 47 kDa Protein Hotnolog_I
1 0927 CHLPS 47 kDa Protein S Homolog_2 0928 CHL?S -43 kDa Protein Homolog 3 0929 CHL.'S _43 kDa Protein Homolog 4 0728 (CT622) CHL.'N 76kDa Homolog_I
(CT622) 07.9 (CT623) CHLPN 76kDa Homolog_3 (CT623) 0137 (CTI09) CHLI'S Hypothetical Protein 0332 (CTO81 CHL"'R T2 Protein ) Mistellonmur Err-rymu~Conservtd Prote irtf 0193 argR Possi de Arginine Repressor 106 Arort atie Amino Aeid Hydroxyiase 25 0232 Similarity ro 5'-Methylthioadentnine Nucleosidase 0128 (CT035) Biotin Protein Ligue 0513 (CT426) Fe-S Oxidoreducuse_I
I (CT767) Fe-S Oxidorcductue 2 0373 (CT057)gepE GcpE Protein 30 0407 (CT103)' HAD Superfamily HydrolauJPhosphatue 0917 (CT771) HydrolasdPhosphatue Homolog 0488 (CT385)ycfF HIT Family Hydrolase 070! (CT675)karG Arginine Kinase 0526 (CT399)kpsF GutQ/KpsF Family Sugar-P
Isomense 35 0919 (CT773)Idh Leucine Dehydrogenase 0022 (CT349)maC Mafprotein 0997 (CT840)mes! PP-loop superfamily ATPase OISI (CT148)mhpA Monooxygrnase 0730 (CT624)mviN Integral Membrane Protein 0861 (CT720) NiN-Related Protein 0479 (CT380)phnP Metal Dependent Hydrolase 0106 (CT015)phoH ATPase 0729 (CT084) Phophotipue D Sttperfamily 0435 (CTI84) Phospholipase D Superfamily 45 0581 (CT464) Phosphoglycolate Phosphanse 0897 (CT754) Predicted Phosphohydrolue 0509 (CT422) Predicud Metalloen:yme 1030 (CT375) Pmdicted D-Amino Acid Dehyrogenase 0531 (CT404) SAM Deprndent Methyltramferue 50 0337 (CT076)smp8 Srnatl Protein B
0394 (CT256)t(yC_ICBS Domain Protein (Hemolysin Homolog)_t 0510 (CT423)ttyC_2CHS Domains (Hemolysin Homolog)_2 0382 (CT048)yabC SAM-Dependent Methyarnaferase 0787 (CT594)yabD PHP Superfamity (Urcase/Pyrimidinuc) Hydrolau 55 0611 (CT492)yacE Predicted PhoaphatuelKinue 0579 (CT462)yachtSugar Nucleotide: Phosphorytue OS78 (CT461)yael Phosphohydrolase _ 0145(CT071yaeM CT071 Hypothetical Ptotem ) 0566(CT450)yaeS YaeS family Hypothetical Protein 0591(CT472)yagE YagE family 0039(CT335)ybaB YbaH family Hypothetical Protein OI01(CTOl2)ybbP YbbP family Hypothetical Protein 0915(CT769)ybeB iojap Superfamily Ortholog 0137(CTf08)ybgl ACR family 0529(CT402)ycaH ATPau 0438(CT287)ycbF PP-loop Superfamily ATPase 1 0734(CT627)yceA YceA Hypothetical Protein ~
0954(CT804)ychH Predicted Kinase 0261(CT217)yda0 PPLoop Superfamily ATPase 0245(CT127)ydh0 Polysaccharide Hydrolue-tnvasin Repeat Family 0573(CT457)yebC YebC Family Hypothetical Protein IS 0689(CT687)yfh0_I Nif$-rclatedAminotransfenae_I
0862(CT721yfh0 2 Nits-related Aminomnsfetau-2 ) 0547(CT43t)ygbB YgbB Family Hypothetical Protein 0237(CT184)yggF YggF Ftunily Hypothetical Protein 0775(CT606)yggV YggV Family Hypothetical Promin 0396(CTZ58)yh10,3 NifS-related AminotnnsCense 0605(CT487)yhhf Predicted Methylase 0575(CT458)yhhY Amino Group Acetyl Tnnsfense 0592(CT473)yidD YidD Family 0982(CT825)yigN YigN Family Hypothetical Protein 25 0657(CT537)yjeE YjeE Hypothetical Protein 0768(CT644)yohl Yoht Predicted Oxidoteductue 0336(CT077)yajL YojL Hypothetical Protein 0217(CT140)ypdP YpdP Hypothetical Protein 0140(CT212)yqdE YqdE Hypothetical Protein 0263(CT221yqfiJ YqfU Hypothetical Protein ) 0139(CT211yqgE YqgE Hypothetical Protein ) 0270(CT137)ywlC SuAS Superfamilyrelated Protein 0879(CT738)yyc! Menl Dependent Hydrolase 35 Homologs to CHLTR Hypothetical Caling Genes 0001(CT001CTOOI Hypothetical Protein ) 0020(CT351CT351 Nypothetieal Protein ) 0021(CT350)CT350 Hypothetical Protein 0026(CT345)CT345 Hypothetical Protein 0035(CT339)CT339 Hypothetical Protein 0036(CT338)CT338 Hypothetical Protein 0055(CT296)CT296 Hypothetical Protein 0062(CT289)CT289 Hypothetical Proxin 0065(CTZ88)CT288 Hypothetical Protein 45 0068(CT360)CT360 Hypothetical Protein 0071(CT325)CT325 Hypothetical Protein 0072(CT324)CT324 Hypothetical Protein 0085(CT31CT711 Hypothetical Protein l ) 0087(CT309)CT309 Hypothetical Protein 0093(CT303)CT303 Hypoehstieal Protein 0100(CT011CT011 Hypothetiesl Protein ) 0104(CT017)CT017 Hypothetical Protein 0105(CT016)CT016 Hypothetical Protein 0107(CT058)CT058 Hypothetical Protein_I
55 otoetcrnlg)crolg similarity 011 (CT021CT021 Hypothetical Protein I ) 0121(CT031CT031 Hypothetical Protein ) 0129(CT036tCT036 Similarity 0145(CTt CT114 Hypothetical 14) Protein Ot50(CTI47)CT147 Hypothetical Protein 0152(CTt49)CT149 Hypothetical Protein 0176(CTI53)CT153 Hypothetical Protein 0188(CT132)CT132 Hypothetical Protein 0189(CT131CTl3l Hypothetical ) Protein 0206(CT203)CT203 Hypothetit:al Protein 0229(CT178)CT178 Hypothetical Protein 0230(CT179)CT179 Hypothetical Protein 0234(CT18ICT181 Hypothetical ) Protein 0249(CTI51CTlS t Hypothetical ) Protein - 0253(CT144)CT144 Hypothetical Protein_1 0254(CT143)CT143 HypoUtetical Protein-1 I S 0255(CT142)CT142 Hypothetical Protein_I
0256(CTtaa)CT144 Hypothetical Protein 2 0257(CT143)CT143 Hypothetical Protein 2 0259(CT142)CT142 Hypothetical Protein 2 0276(CT191CT191 Hypothetiesl ) Protein 0288(CT195)CT195 Hypothetical Protein 0293(CT234)CT234 Hypothetical Protein 0301(CT242)CT368 Hypothetical Protein 0303(CT244)CT244 Hypothetical Protein 0308(CT249)CT249 Similuity 25 0312(CT101)CT101 HypothetiealProtein 0328(CTO85)CT085 Hypothetical Proosin 0330(CT083)CT083 Hypothetical Protein 0331(CT082)CT082 Hypothetical Protein 0374(CT079)CT079 Similarity 0342(CT073)CTOT3 Hypothetical Protein 0343(CT073)(hams-ahiR
with 0342?) 0350(CT066)CT066 Hypothetical Protein 0369(CT058)CTO58 Hypothetical Protein 2 0370(CTO58)CT058 Hypothetical Protein 3 35 0374(CT056)CT056 Hypothetical Protein 0379(00053)CT053 Hypothetical Protein 0381(CT326)CT326 Similarity 0383(CT047)CT047 Hypothetical Protein 0387(CT043)CT043 Hypothetical Protein 0389(CT041CT04 t Hypotitetieal ) Protein 0393(CT038)01'038 Hypothetical Protein 0395(t.'C257)CT257 Hypothetical Protein 0399(CT253)CT253 Hypothetical Protein 0400(CT254)CT254 Hypothetical Protein 45 0401(CT255)CT255 Hypothetical Protein 0405(CT10S)CTI05 Hypothetical Protein 0408(CT102)CT102 Hypothetical Protein 0409(CT260)Cf260 Hypotheatal Protein 0411(CT262)CT262 Hypothetical Prooein 0412(CT263)CT'263 Hypothetical Protein 0415(t:T266)CT266 Hypothetiea!
Protein 0420(CT271CT271 Hypothetical ) Protein 0422(CT273)CT273 Hypothetical Protein 0423(CT274)CT274 Hypothetical Protein 55 0425(CT276)CT276 Hypothetical Proteins 0426(CT277)CT277 Similarity 0434(CT283)CT283 Hypothetical Protein 0441ICT007)CT007 Hypothetical Protein 0442(CT006)CT006 Hypothetical Protein 0443(CT003)CT003 Hypothetical Protein 0474(CT363)CT363 Hypothetical Protein 0476(CT863)CT863 Hypothetical Protein 0480(C7383)CT383 Hypothetical Protein 0485(CT382)CT382.1 Hypothetical Protein 0487(CT384)CT384 Hypothetical Protein 0489(CT386)CT386 Hypothetieat Protein 1 0490(CT387)CT387 Hypothetical ~ Proxin 0491(CT389)CT389 Hypothetical Protein 0496(CT791CT391 Hypothetical ) Protein 0497(CT388)CT388 Hypothetical Protein 0506(CT421CT421 Hypothetical ) Protein 1 0507(CT421CT421.1 Hypothetical S ) Protein 0508(CT421CT421.2 Hypothetical ) Protein Osl2(CT423)CT423 Hypothetical Protein 0314(CT427)CT427 Hypothetical Protein 0518(CT429)CT429 Hypothetical Protein 2~ Os22(CT433)CT433 Hypothetical Protein 0525(CT398)CT398 Hypothetical Protein 0533(CT406)CT406 Hypothetical Protein 0537(CT814)CT814.1 Hypothetical Protein 0538(CT814)CT814 Hypothetical Protein 25 oss4(CT440)CT440 Hypothetical Prouin OSS9(CT441)CT441.1 Hypothetical Protein 0363(G?449)CT449 Hypothetical Protein 0372(CT436)CT436 Hypothetical Protein 0382(CT463)CT463 HypotlKtieal Protein 30 0383(CT466)CT466 Hypothetical Protein 0388(CT469)CT469 ~iypothetieal Protein 0589(CT470)CT470 Hypothetical Protein 0390(CT471)CT471 Hypothetical ProOein 0393(CT474)CT474 Hypothetical Protein 35 0393(CT476)CT476 Hypothetical Protein 0601(CT483)CT483 Hypothetical Protein 0602(CT484)CT484 Hypothetical Protein 0606(CT488)CT488 Hypothetical Protein 0609(CT490)CT490 Hypothetical Protein 4U 0622(CT303)CT303 Hypothetical Protein 0623(CTS04)CT304 Hypothetical Protein 0648(CTS29)CTS29 Hypothetical Protein 0658(CTS38)CT338 Hypothetical Protein 0667(CT346)CT346 Hypothetical Protein 45 0668(CTS47)CT347 Hypothetical Protein 0669(CTS48)CT348 Hypothetical Protein 0671(CTS30)CT350 Hypothetical Protein 0673(CT332)CT332 Hypothetical Protein 0673(CT696)CT696 Hypothedeal Protein 0676(CT695)CT693 Similarity 0681(CT691CT691 Hypothetical ) Prooein 0687(CT482)CT482 Hypothetical Protein 0688(CT481CT481 Hypothetical ) Protein 0700(CT676)CT676 Hypothetical Protein 55 0703(CT671)CT671 Hypothetical Protein 0706(CT670)CT670 Hypothetical Protein 0708(CT668)CT668 Hypothetical Protein 0709 vCT667)CT6b7 Hypothetical Prouin 0710 ~CT666)CTb6b Hypothetical Protein 0711 lCTbbS)CT665 Hypothetical Protein 0713 (CTb63)CT663 Hypothetical Prouin 0717 (CT6Sb)CTbSb Hypothetical Prouin 0718 (CT6S7)CT637 Hypothetical Prouin 0720 (CT659)CT659 Hypothetical Prouin 0722 (CTbS4)CTbS4 Hypothetical Prouin 0725 (CTbS2)CT652.1 Hypothetical Prouin 1 0726 i CT620 Hypothetical ~ CT620)Prouin 0727 (CT619)CT619 Hypothetical Ptouin 0739 fCTb38)CT368 Hypothetical Prouin 0742 (CT63S)CT635 Hypothetical Prouin 0746 (CTb32)CT632 Hypothetical Prouin I 0747 (CTb31CT631 Hypothetical S ) Prouin 0751 (CTbSCT65I Hypotheti:at 1 Protein ) 0755 (CT616)CT616 Hypotheti:al Prouin 0760 (CTbII)CT611 Hypotheti:alProuin 07b1 (CT610)CT610 Hypotheti:al Prouin 0764 (CT648)CT648 Hypotheti:al Prouin 0765 (C1'647)CT647 Hypotheti:al Prouin 076b (CT646)CT64b Hypothetic al Prouin 07b7 (CT64S)CT64S Hypothed al Prouin 0770 (CT642)CT642 Hypotheti ;al Protein 25 0774 (CT606)CT60b.1 Hypothetical Prouin 077b (CT605)CT60S Hypothetical Protein 0779 (CT602)CT602 Hypothetical Protein 0783 (CTS98)CTS98 Hypothetical Protein 0791 (CTS90)CT590 Hypothetical Protein 0792 (CTS89)CT589 Hypothet'rcal Protein 0803 (CTS84)CTS84 Hypothetical Prouin 0807 (CTS80)CTS80 Hypothetical Protein 0808 (CTS79)CT579 Hypothetical Prouin 0809 (CTS78)CTS78 Hypothetical Protein 3 0810 (CTS77)CT577 Hypothetical > Protein 0814 (CT573)CTS73 Hypothetical Protein 0818 (CT569)CTS69 Hypothetical Prouin 0819 (CTS68)CTS68 Hypothetical Prouin 0820 (CTSb7)CTSb7 Hypothetical Protein 0821 (CTS66)CTSbb Hypothetical Protein OB22 (CTSbS)CTS65 Hypothetical Protein 0827 (CTS60)CTS60 Hypothetical Prouin 0834 (CTSSb)CTSSb Hypothetical Prouin 0840 (CT700)CT700 Hypothetical Protein 45 0842 (CT702)CT702 Hypothetical Protein 0843 (CT702)CT702 Hypothetical Prouin 0852 (CT711CT71 ! Hypothetical ) Protein 0851 (CT712)CT712 Hypothetical Prouin 0857 (CT716)CT7Ib Hypothetical Prouin OSS9 (CT718)CT718 Hypothetical Prouin 0865 (CT724)CT724 Hypothetical Prouin 0869 (CT728)CT728 Hypothetical Prouin 0874 (Ct'773)CT733 Hypothetical Protein 0875 (CT734)CT734 Hypothetical Protein 55 0884 (CT741)CT741 HypotheticalProuin 0887 (CT744)CHLTR Possible Phosphoprouin 0896 tCT753)CT751 Hypothetical Prouin 0906 (CT7631CT763 Hypothetical Protein 0908 (CT764)CT764 Hypothetical Protein 0912 (CT768)CT768 Hypothetical Protein 0925 (CT779)CT779 Hypothetical Prouin 0938 (CT788)CT78B Hypothetical Protein 0939 (CT790)CT790 Hypothetical Prouin 0943 (CT794)CT794.1 Hypothetical Prouin 0945 (C'f795)CT795 Hypothetical Prouin 0956 (CT805)CTSOS Hypothetical Prouin 1 0960 (CT809)CT809 Hypothetical Prouin ~
0989 (CT832)CT832 Hypothetical Protein 0994 (CT837)CT837 Hypothetical Prouin 0995 (CT838)CT838 Hypothetical Prouin 0996 (CT839)CT839 Hypothetical Prouin I 1002 (CTB45)CT845 Hypothetical Protein S
1003 (CT846)CT846 Hypothetical Protein 1004 (CT847)CT847 Hypothetical Prouin 1005 (CT848)CT848 Hypothetical Prouin 1006 (CT849)CT849 Hypothetical Prouin 1001 (CT849)CT849.1 Hypothetical Protein 1008 (CT850)CT850 Hypothetical Prouin 1010 (CT852)CT852 Hypothetical Prouin 1011 (CT853)CT853 Hypothetical Prouin 1015 (CT857)CT857 Hypothetical Prouin 25 1016 (CT858)CT858 Hypothetical Prouin IOl9 (CT860)CT860 Hypothetical Prooein 1020 (CT861CT861 Hypothetical Prouin ) 1022 {CT863)CT863 Hypothetical Prouin 1032 (CT373)CT373 Hypothetical Prouin 30 IOl3 (CT372)CT372 Hypothetical Prouin 1034 (ty CT371 Hypothetical Protein f37I
) 1057 (CT356)CT356 Hypothetical Prouin 1058 (CT355)CT355 Hypothetical Prouin 1061 (CT330)CT330 Hypothetical Prouin 35 1077 (CT371CT77I Hypothetical Prouin ) Coding Genes Vot in C. trachomaris 0486 Hypothetical Praline Permeau 0279 Possible ABC Transporter Petmease Prouin 0505 3-Methyladenine DNA Glycosylue 0193 argR Similarity to Arginine Reprcswr 1041 bioA Adenosylmethionine-8-Amitto-7-Oxononanoate Aminouatuferue 1044 bioB Biotin Synthase 1042 bioD Dethiobiotin synthetue 45 0585 Similarity to Cps tneA 2 0562 CHIPS 43 kDa Prouin Homolog_I
0927 CHLPS 43 kDa Prouin Homolog_2 0928 CHLPS 43 kDa Prouin Homolog_3 0929 CHLPS 43 kDa Prouin Hornolog 1045 Conxrved Hypothetical Metttbrana fhouin 0251 Conxrved Hypothetical Prouin 0278 Comerved Ouur Membrane Lipoprotein Protein 0907 CutA-like Periplumic Divalent Cation Tolerance Protein 0171 guaA GMP Synthase 55 0172 guaB lnosine 5'-Motwphosphue Dehydrogenase 0608 Uridine 5'-Monophosphate Synthase 0735 Uridine Kinase pgg0 Similar w Sacchnromyces ctrevisiat 52.9KDa Protein 0232 Similarity to 5'Wtethyhhioadanosine Nucleosidue 1046 Tryptophan Hydroxylase 0477 yqeV Conserved Hypothetical Bs Protein 0048 yqfF-Bs Conserved Hypothetical 1\A Protein 0587 yvyD_Bs Conxrved Hypothetical Protein 0143 yxjG Conxrved Hypothetical Bs_l Protein 0448 yxjG
Bs_2 Conserved Hypothetical Protein 0007 oral o4ss o97s ' 0008 0190 0456 1018 - 0010 0204 0458 t027 0042 ozl4 o46s loss olss o3s3 o74s Ols9 0357 0796 0162 o3s8 0797 ~1 r.
CKYFYLR..~YPPPP~rISIA:U. ~'K:.RVL1ITF:::Frlt:.Lf.l.uwl.F:.TL.:LFCiSMLS
Clalssdrdla t~~~nslt 0tnar Beaodw FCLG tCi.~.Ai.CCViJII9GLL :LLVKREI
.roulos P1YRPEEI Pf~V,,'LAPSEEPAiAAACK':
LACL
PKELOpLLTtDLOEVJ:BSI:R~tbSitYBltliilLNDAw(~IVFDEY'.~.~VV
CPn_OOOt )30 4 AOB~IDWFLINCGRSih!!!'AESLSLDLFNVSKRLCTLPSCDVAC~C'klGStlK!'tllJl~1 ~.'l'001 hyp~~hnr ieal Protein SLHCEIHKYAVAFORNSYAhAEKAFAKALuALEESVYRSL?QSYRDKFLESERAKIPNNG
TSLRRKANUGKIIRGLSSLIVLLCAW~GLICITHNKWILAKI~'OCVS'IPPR~RNLCKQSFf' ' KRWCDEIKI ' CCYIIi iIACVICLLS!'CPFC3KK'aRHSHCD5C3SLuCHSHHSOKtIIWLRDDAK.iGCAEKKILi AT HI
Q CPn OOtO.: 11768 15715 q75 ~Pn nn0? 570 :.rm.. ,.:"
~ r nt,r. y ... :,n.vi.i... ~..,., rr~... ...._ ; y'~.h,;vt~:: ~.~r..
'~rl.r': r:\c:x: :~:.r ~:,r--. :: ~\F~:,I:I:I::.r. -w::;. : w \:':" ;
' w ::
T
::
v ' ' .. .. FLKAWRKCAWtT'l'FEK1CF-iKKNWAVEEANARRLKYVROWYDfiEFQKnY:6RLEKWAL
.
.
n :.. _ .
:
.
,..I:.:I L;.F.c.::.:.:.::i .:,.i :. "~.
:,.
HHWNVEDLREDSVTSDttJREEFLRHVPE.iIJGGLVKVPAVIKYP6YSVSIRDiKIQETRSNLEKAYGIEENYRCCVR
OpEIfYiIKEEEKKEAEFK>T~BCIL
Spear ~cer ~pp~',I,pIFSp(ytya',HILKL.OICGTAEVC41CIL$OIIBSRLEIVfm'V
CPn_0003 889 2370 KDIPCRIEEiEKTIJWAG.PLLPTKKAFEKACSOYNSCADILEKVKPYCXCSIaYYISKE
qatA-Glu tRNA Gin NsidotransEerae RLVSLDEDLRRAYFfl~AFOCDSGLESEVRACRlCLRERIQEFL:pGLDL.VDfSLLCVS
~
IDK
KEOALEOAEf .
SRW~II'Z~DCVSCIfKKGPPGKKFYAOYYDEIYRVRVOSIBtIflIISERLK1~VOAC~01LK
KINYRYSALEWtAViIGSLTA'ICVfafFFNRIEEA30VCAPISLC
G
KRSRGEPLGKi.ACVPUCIKLNItM'CLKTCCASRVL>T1YOPPFDATVVERIKI<F~CIILYKEIRKNKEKRLVGTKI
VA'l'QQRIQCFQPSOIVESSNOIVSLIt KI1~EFAMGSTTLYS11FNF17BiP41DLSRVFOGSSCGSAAAVSARFCPVALCSD1GGSIA.
NRI~ftS
QPMFCCWGFKPSYCAVSRYCLVAFJ1SSLDDICPL.AN1'VmVALJB~VfSGADPKD11TSOKIIItFLF
REFFRDSFNSKLS?EVPIfVIGVPRTFLECLRDDIRCNFFSSL1IFDCl~rTHLVDVCLDIL
C' 0011 15877 Iddl1 ' CPf1 'KEVIBIKILIr .
' OatB-IPet1121 Glu tRM1 Gln AmidotransEerase 3HAVSIYYILASAFJ1ATM.IIRFDCVRYGYRSPQ11fI1'ISOLYDISROQ3f1t8 Subunit) ' LYL
P14YSIlfCAAPAIWVSP'IPPEfTItBYIPKDSKSRJ1LGITLLVL'GIIwV1fiG71IVtBGVIS
NYVLSAERQNVYYKKATAVRAKIVKAFRTAFEKCEILAMPVCSSPAFEIGEILDPIr!ELKELQ41'I
QDIYTVM4dLAYLPAIAVPSCFSKF7CLPLGL0IIG00C~DQQVCQVCYSFGEHAOIKOLFK
QIAITFJID
GLSALIVCCLCISTISL ..NVLfVIGLILLLRXRELTLEpIEA
SKRY)1K~S 09TDtSLEKIFiiSRYSDQCf WR711'OKILDLESSLSSITSEFRDLRQLFDEEKIELLBGI
RLLEFIAANLFKOCRDVYIi4GGHLADIRAYIIOPNtMNIWVIEKAKAWHEFIVLT'LlUIR
CPn_0004 233 f~fP
VacB-IPeelllt Glu CRNII Gln AnliriotransEeras 1B Subllnttl ' G
LICQIOCCSRASINSAVYADWESVIGLEVHVGL.NI'ASIa.!'SSAIlJAFGDLPNlZ4IS~C10011 16596 1831?
CPi!
LPGSLPVGNOSAVEKAVLFGCAVECLISLLSRFZMKS7fFYPDgpRHIOLrppplplIl~i' 11 QatB-IPstllZl Clu tRNA Cln Asidotransferass IB Subunitl RTKAIVQGEERYFELAQTHIEDDiiIGNLKHFGEFACVDYIiRAGVPLILIV5KPQ0CPl~GIRVFFLI(NICYCLW(~
4Y00'~A~:RLLYNSVOKSYADRLFSYflITKMMDTPLIPNBE
VAYATSLVS:.LDYIGISDQ~B1EEGSIRFDVNVSVRPIa:SPELRNKVEIKN6~1SFA1NWAK00CAfJ10tAE'LEG
OKILLDYGKSIFWLNENDEINIl4DPWSWCWIIfKTRICVIpEVDDS
LFUfWRQIDEYLNOPt4KDPIG.VIPMTYRWDPEIGIKIYIJ9tLKESAt~YKYFPEPtE.PTD
13~1f~r r rrreSK~~KL(SDLVDRLEDU1K19fFlWKQI~VCIR
LQLTESYIERIRIiTLPF.LPYDKYfOtYIOEYGLSmIASILISDIOQIATFFEV11CKDGC~1F.
~
VKDLKAKYCGTVDPKQL1TF~11QGtVLT.E71.SLETFLDSIESELVOCLEDQDIYfiIt~DVI~L
RSLSlIWIIIYEFGGRCKTLGV10:.PSSGiFPEGVACLVNAIt7pCVIIGKIAKEIA0U11ESPM'1'O~EEODI~~I
"~'WK~'~~IITIZ?BC
VDY10~KTKAiGFLVCOtIBCtT
GKNPEEILIC>XPELI.PNSDE7GELQKIIAEIfVLANPE.STDNKARLKILIfEDITSVLPEIDEIL'TCISLLEi.P
LL.TTRELLTKSYLICFKICSETLiafl'S
AGIUPPKRVNELLLLLDKG
VF~lII7NOEYEVOLONLCFR4CISQKTGKKQDOFAId.EDOVAL4KKRLKEL'1'~'iFCIQ
GFNFIOC~FItIIAAKDLYIRST
AKI~.C$LpI.DBKFi.LOICEIIGCCEIRQI(ltpQRNADRSRfITI'YQKLIIIAEG.ALEL.1UDCI
Outer Nse6sane~Protein AVDfi'S~P. OFPOEIfTPFVKVQAVttARODSFVBLCAISRDFSDSHL.YM.AIPIaIU.IGDtF
t'IdDYTEIdGItGSIECRPNARHmIHCCSKlRPAOpYYHWJWYS CSNLARQAGIYQA9GFRSLGAAA6 CPn_0006 7299 7111 LFrIIICF!'SfRCBSRSYNVDAGSKIKF
No robust homoloQ Dcesent in Gentbenk/EI~L
as of 11/7/98 CRL0011 11365 21922 KQLQEPLRSALLERLSEWLVLtGITSPETTRSTPEKpMpLPKDSRNItTLESLp11~3-POlymorphic Ouur Nalsbsane Protein IONOSIYPI't0C8SFPKFVFSTFAIFPLSNIATETYLDSSASFDCNKNf~IFBVIIESQEDA~D
CPn ' ' ' _ YDAG
No robust houolop Dresent in Cewbank/E'!'Z8LIYAGAAVHSS
as of 11/7/98 KCDLTP1~ISLLF(71 TTYLfKf~VTLENLPGTCL'AITIfSCFNtII
"
KSFRYNLSLIFSFLWIPLTDSTTSSLSI'SLLOECNPOSt9IKLRILAIVLIALSIILIAGIIGSLSLTIWSVCSSAKT
ICRIIIAVLS
WDK.STTFIC!'SSLSFI1LSPCSSITTCKGAVSCS
GWLLTVAIPGLSSVISSPACNGACALGCV!C.AIGIDVLLXXAEVPIVIrISV'lITPG'IIGSPOfU.FH
PRSGISISCADSTIRSLP1'YLLDmQiPQSNRKtRILAIVLIVFSIILIASC<fVLL'M1IP
GLSSVISSP7~tIGACALCCIIHLrILGItNLi.IfJIRIVPIVLASYIT?PCi'GSPRSGLSISGACPn,-0015 DSl'IRSLPTYPGDEGHPOSNRItLRILAIVLIVFSIILIASGWLLTVAIPGLSSIISSPApmD_3-PNP_7 IEramt-shift with 0011 ' FMG71CALGCI.rLAiGIOVLLKKREVPIWPAPIPEEWIDDIDEESIItLQQEAtAALARLSSKKOGAIplSDALTIT
LEFdQiVSLLPSKNFSTDNGC11ITAKTLSLTCTTMSALFSEIJ1 PEE?tSAFECYIKWFSNLI~cSLPYDCHGLEEtcTKtIQIRWRSSLKANVPEFLDIRRIFGNOGEVSFSDN'I'SSDSG
AiIIITEASVTISNNAKVSPIDNKVIGASa~SITCDNStxIIIGY
' ' EEEFF?LSaRKRLIDIATTLVERKILTEQLEW9iLRJtAESYLYQDSIPIOtIItINFEIWAPRGGA
l KTS'fOTIML1'CNOIG.LFSDBI'l'S1TAGGAIYVKKLCLASGCLTLFSRHSVI~G
' WKtTIILSKSICRPTIIFENHEtKIIAKSLLHIQ4AVLLEKIIIYRSLOKSYRDIGNSSAl00CISAKM'ALRSAACMI
YFYD
IAIEDSCELSLSADSCDLVFLGNTYfSTI'PCTNRSSIDLC'1 ' LHCNPFFSLEDNIaTINKtNAEl4<.ESLSSYRKVFLALSOENVVD'1'PSDPKIbrDISCIpCRfEAADSKMTSKLLQ
PVrLS
PITICSSTZYi'DVLKVNCI'PADSJ1LOY1'rNIIFTCICiSE
wILSEISRDEQNOKKAHLKNOESLYTQARDRL'1'DOSSKENOKELEIUWEYISSNERVKK~aTLSGKNGVTLQ'SOAP
'1~OADSRLCHDN'T1'LCPAt7lSTINNLVINISSIDCAKKAKIE
F=IERVQERIIGIOKLYPNILEREEEi'IGOETVTPfVQCZTASSDLTDILGRIEYSSREDTKJ1TSKNLTLSC?ITLL
DPTGTFYCrtIISLRNPQSYDIt.ELNASGT<?STAVTPDPIIbEK
:JQNQESCVKVLRSIffiVEflSirE\IKQEYGPKKKEPQOOMGSLERFFTEHIEELEVL4KDYSKFHYGYQG'1WGPI
VNCEGASTTATFNN'tKTGIfIPNPERIGSLVPNSLNNAPIDISSLNYtJI
HLSYFKKVtINKKEVQYAKFRLKVLESDLfCILAOTESAESLLTQEELPILATItGALCKAVE1'ANEGLQGDRAfWCA
GLSNFFHKDSTKTRRCFRHLSGGYVIGGNLIIECSDKIISAAFCQ
' ' FKGSLCCALASKJ1KPYFEEDPRFQDSLri'QLIGLTLRL.OEAKASLEEEIKRFSNLENDIAEI
LFGRWIDYFVAKtfQCTVYGn'LrfOHI4CTYISLPCKLRPCSLSYYPTEIPVLFSCM.BY
ERRLLKFSKQ1'FERAGL,GVLRELAVESTYDLRSLTN7ylECCPESEKVYFSNYLNYYNEEKH?ONDLKTKYITYPI' VKCSW~IOSFALEPxRAPICLDESALfEQYNPFIOfLpNYANDE
' RRAKTRLVMCORYRDFKM1ILEAl~FNEE71LLDEEL.~aIQAPSEL
riFKEOCTF~1REFCSSRLVNLdLFICIRFDKE80CQ0ATYNL?LGY7If~.VRSNIOCTlI
RISGCFCTNLARQAL\tRACNHFCFNSNFEAFSOFSFELRCSSRHYNVLKGAKYQF
!:Pn_0009 10780 11685 7 CPn 0016 21383 251B8 / 1~lymocphic Outar Nelntcant Pcottin No robusr homoloq present in Genebank/EHBLmp as oL 11/
S
~
A f VLLLTLGIPCLTdGISFCACLGF _ ':KYSYLLNYPPPPRRSLGVSCSKLRSLSITL.LVLu'sL7IFFLLMSVSADAADLTLCSRDSYNCDI'~a'fTEl1'P
K
' RSDFALKRCCHNRSSFSLLLIs3 ' 1KICTLDRLPKE .
PSEEPALEKAQKEPF
MTSDASG1TYILDGDVSI:'r'AI:KQTGLTT.~.CFSNTACNLTFIGNu'FSWFONLIBSTVA
CGG'JL'JISGi.LFLLVRREVPIVRSEEIPRCVSVI
(.DOLCI'YIQEVFACLERLKDPKYfDRCLLTEAKEKLRVFDWEKOMHSEFLDIQRVWEE
AYWEHCODFLENIAYEIFSSQELitDYYCAGYCCYLPSCt)ARADRLKRa'VKEVI~ItFNRV~NTMOCiTIfp3CFST
'..RNiJVIPRTTr:KKGAIKITOf:LVFE.sICtJt~LJDrAB&OJC
TWkWEASVMLOHSYCVARELFKIUVGVLEESVYKILFKSYRI)J1FYDCEKAKIQRDGRFKr.AINTKTGSL'LC.STR
fYAFG:NB3.i0Qa:AIYASCO.."VI~ENACILSFGNNSATTSOGiII
' ' At7~
~nABDNLVTSNNQNtPFDCCK.aTTtIX:AtUNKMIANPDPILTI:ii.NESWFWM
' 1'ST
~.AIYTKKLVLSS-GROCVLF:NNYAMIATPF.GC:AIALLDSCEICLSADtL:NI
IF~tfl <.'Pn 'P:GPA.art'RNAL~15NAKFWLPJiTR:IIY.'ItFYDPITS:X:ATfRIL.~,LttKAOIIL'w70tfiYE
DOU') i1e89 11117 _ ~:YtVF!X:EKL.vEEELKKFONLK.:TF'h~lv/KLNi7ALVLKDf~J717t:1Nr!'ft~iSKVYIID
tNr mMr.~r honnlorr prrrsa:nt in r:aneb.<nk/P11BL na of I1/7/nR
tri':;AIIAF/JRYRDINfxWEDLKQ'l'IFWVf:EIIOCTDLC111RN::CHWLDRYaDKFILREKEEKrY:ITFF.
n\:.AhT:V'Ml4c.l.AINID:.LDt:TtIYAf IKATAA:.KDVAL:x:1'fNI.VLYIOfY'IYYPJIH
Nf.RHELFIIATtIVRKA::f:IIAYAKAKMFEKER::Nt~RY.VKDVEK4iLSIG:U1EFRNpFSRRtlL:~wJVt'f t.ft:1~:A0rlrM!'~rLtltY1'I:If!'ITNIIYr:YQtiWltaJt.'IA:NIn'!k'KtNtICCIIW
AftERLkt3A,"l'LYPEV.SI/F:ERVLERQRTKKVNLFNL'fAD
L EIfKYfOK.'VREOEH'MIfEVFI7K
I ALYRt?%:LKVI::AEEII:a:LLQftLEGt:
Ll:'ISi::KKLTKAE~.'VFE?IKFM\TEKIltIKVLED~
' './'PNRLEIIa:EDAEFI~ItPRIEEIENTLIbAI/ELt'LL.FItKNI'FEKA.SL.vYN.'.t:KENf.AKVEPO.
;~II
':In ~ml'l ..",~
lu :Y.F~PTYR::::("RNLERLN~~fH.~rPn\YTtr.'QERL(t:F::GLE.~.KVRfCRDIILRFJJNKIIFEVOC:f ~uf~1'IMI' I Ilr.nrur-,:hit m.f.
nul~.l ~HFfNRI:W.WW:Af:Lt'IVARLDLVATVPYREPYt.~IYIItItKREKVR::~rMIAKTERYREIROtW!t'fNt:l K~:n:ItVWVf>r\'rAYTKNATI:IWI'K'1':YY.fiJPFHrJ:I'LVtfc:lYl::FVDVR:, Al:r~r;V11KR1'LI.AP:ITtIf*RF.Iri'WIdJ7pOWLl.RDERKNfwRRI.(t'NKIIAMJJhVKf:FIV::II
IIrR::r::::l::::L"1711.YA':i:INit'Idll:IIJKr:NVk::INII::.~.AtiYAll%%SFFTI~:~FFN
r'At'' LL.tt :Yl7KIt111.VAKtIrfIIV'/IJ:AM::YhI
ll a :1~:Y'CI JvK 11.:'r ~I::Ik:l.f'PVtTt1111FAYr:
.:Irr IrrllINbrl'rKY'h:I::INKr:a.A:HLAtt:IF;Y:AI1NVA:%:NR::4N1rlrrlFlN1.P11IYN1() rlnill I f 1'.'.1 11 f:'S
_ rltrt'Kt?%Tt7t:lt:a'V::l'.IA.t'NI
IY, tr.lr,::r lumn.lm pn'.rrnr \'/1~r:IYt'YN.Y::frY:TIIII::IAYI'I'INfRNIrHITITLM
in r:,-rrlr.rnk/EMDI, .r.: .r I1/7/'IH
fYAIOLGGRfCF PSIIITTVDStTlISDSPILG
:'VEKLLDE:EE,~,rEJps't'ERLLpSIG,FILPt.MDtPFF
fEVELRGSSR ' ' ' . l iCLVLTKKCNADILKVSFIpLNKIOVAJIRILA
.L.iRpALLVILU:NHHAFA~NPE. . PClIMP ILI ESCPYIfEVL.KY411t::::OK
.
v~~.,'W.~.tr.'~
LNPICCCSAOVLBSZiEIITAILT~~~
~~
=Pn_OOIN :751! 7'3003 BNALI
LLKLNPLFKBapIFGCH.~.DF'fEPGII1"G~~
ALTT'71 I>eP_5-POlymnrPhu outer Membrane LLKKELDISALOSSINOKIEATITKSOKEPFLKEOLKTIKKEiCLEJICOAJIIDTfKIBER
Protein ' GA33IVLtIMTTPIStPEDGFICDL'~INtFSPKSTCDJ1AG1'lYS
:.YNMKTSVSMLLALLCS SSAEY71ICANYLDWLTLTPWCIQSKCr107LKlGE
rTCF'!TIVGDLTFL~I~tfLKFLSVDAI:ANIAVAHVpCSKNLSLRKRtNPDYAIIiIIIODEIEK:.C'l'LE~.
IAIGG3I' KG~
IIC
CY
C
"RSTAKYLIIAKF
'l P
' ' . a L: ~EVLYIDP ..SKGL
CAYOLODIMWL?SNASVEOCCIITKCNSCLIO L
iPKSIIYRCKGSLVSL G
FTDFL P
iLVITE CI
:
SIv iVIi'llfOtIYGLDEIKORILE:.I,iVGK
i0 tPVIMIDLYOKIG~
KMVDat S
A
. .
. p . l !'fAN ~
:LTiElAltf~fLKFNENKIIVTSGCALDLGAASTI
7f 0 t' FRF3VC~fADFJIEIKGHRRTYICAMPG
~ ~
' _ . . ...lt:\. F
SA ~.. in!~.,. is..
~~ !
.IKtI.;AIFf:OIIt:RKKt . . .... ..
_ rv:.v ICL'.Y
. .. .. . . ::::T~f. ~r ::. '.:;~...: :~..v. WC
''.\:"'9:.'.:\"::1NE.1.:1~T.,....).KfIYLR
_ ' ':dt'.'t.': s ' F r "' ~
~
, ~
. ..
~ .
I:i.:
UCf'J,~WEKPKSKKLTFKI:::iKNWtY:w:KflF:i.iURFYESTP4GVA:IGLiiWfi ~L "f~TL
~ .".: ."IL\TTU
'J:i.:
::.1:.:N..~..:' :!
i~:..'~ 7~Y
:!tl ' TAKYIfCLAASOON
YIESVOVSSLKTDlOILTCOAGEVNKESSCIIWIIYLNSALHRYAPGYTFFPI~pNHINIPE
r:GDLVF~QVTN:APNATCKM1VIHLE.o AASA
LSCSIVFSGERLSI'AFJIIAt7~ILTSRINOPV'I'LVEGSLVL.KOGV'fLSTOG' RINEVS
L SLL.iLLLETPWFNLQ~ETTLTGRVLGVOGIR6KLIAA
J1NOK GJ1TPKDCPSACITfM
fa"OEPESTLL:.Otar~TSL
Ii~tILIFPF~ImRDYEELPA'ILIrCGLItINFYSHY00VLKV11FPKLK
CPr>_0019 19007 30756 CPn pmp_5-PMP_5 Itrtlmesht,Et with 0019)_ O NO robust hanoloA Present in Genebank/EI~OL
' as of 11/7/91 CitaLVNJIOGAtYit IO'IFLOFFHPIVFSOQSLSFLPYIGKSBGI
ASTEDIVITNLSINADTIYGKNPINIV11SA7lNItNITLIIEKCSNIVEHYLHICCDTSVITTGVSCATFL
CHWNIIpVIPCICfQPSQAl6.it'RI
Y
Y
G
8VON71LPISKSEXITKIISYILILPLIIaLfIKIVLRIILFFKYAGLILDVKKEDi.IaTL
O
DYSFVlCLSPGRGCt IITOtUISOKPL6VAPSAPH
RTCYLPNPEROGSLVPNSLHGSFVDORAIOEINVNSSOILCOEAOVWGi1GI11NPI3IRDICI
NEItCYRHSCVCYLVGVGTHAFSOATIN11AFC0LFSRDXO'YWSK1~CCSYSGVVFLEDTLTPOOENLSLPLPSPTTL
KIIINrILIfILVRSGIOfYNELIOECFSFTKITIMr00AP811~DIC
EFRSPQCfYTDSSSE7ICCNpW"I>xIDLSYSHtIldJDtlXTKYTTYPFJ10G&fANDVFGLEFFSYNSLLPNIYFHS
LVSVPNISCEERAIlIYItKEQQEENAVKLKTIpACSFVfASLIIi.PSh GAT'fYYYPNSTFLFDYYSPFLRI4CTYAHOEDFKiI'OOEVRtIFI'SG18.FFE~VPTCVKFEOTKDKKAGFGLL.T
FFPWKIYPL
RFSOCKRGSYEL: JIYVPOVIRKDPKBTATLASGJI'1WSTFI~R~SROCLOLAL~ICLIN
CPlf~0029 13839 13390 PGIE91FSF1GAIL3.ACSSPNYHINLCCKYRF No robust homoloQ present in Genebank/k9'18L
as of 11/7/9L
SNID~tpINHdTYCFNLFRYIRFFR71INIAM4DGLRFCYSYILLRPIdWSSLT.R1~00ELL
CPn_0020 32717 30603 KK10IKLRTZ'STIISSLISLR00LGKAE71TOSDILYGTSRFQYGNSFEIEDPRIPPlNilitQ
Predicted OMP !leader 111) peptide:
outer membrane!
KLWSFIPNt.RIIOCRCFLFLABFVI~fGSSADALTH0EAV10IKNSYLSHFKSVSGIVTIEDCVLOEIIIiSRSVNFL
KIKFYVYLNSERNKTKP
uIIHIaILATCAenIVnrEarlvccss.IavAtK~avMVrnR~LKTLVCDYLEYYEVrI)scLLTNc RFANYPWFLOGSNITLTPETIVIRKClISTSEGPKIfI>LCLSODYLEYSSSLLSIGKTfLCPtL0030 13A10 RVCRTPILFLPPFSiMPMEIPKpPTNFRGGfGGFLGSYLCIISYSPISA1WFSSfFlLDSF0cp-OSialoglycoprocein F.~Opepcidase FKtICVCMGFNLHC~KQVPQ11>fIBOCSYYAt~LJIItkIJIEAHbRYRLNGDFCFTItKHVNfSLKNCTtTIfSLFF
YIKNRANYFYKYVIIDISGYYPFL71CV1R~pQVLFJMSLPIR7PDI.GIVLE
GEYItLSDSWCfVHDIFPNNFML10'tl'CPTRVDCl51tR7lIYFECYLTSSVKVNSFONiWOa.PFLFKSIDd.6F0 0VAVALGPGNFSATAIGISFA0GL71NJIKNVPIdGYSSLITYLLSKDODt YLTLRpYPISIYl~Ti'f.VYLFNIVECG)fLNPIIFSDHIVGFNFSSLRLiWtPKLHKTVPLPIGALIILPLCKPGGY
LTLSSEIPEECLNEKRRGSIGf~LLSYLE71SDYCVAI~Y~NI8PNP0 '."LSS'17.GSSLIYYSDVPEISSRN50LSAKLOLDYRFLLHKSYIORAIIITEPFVTFITLTRLFASSFSDKTTVEE
VAPSVEpIARNVTSOFlISVEIIDItOLSPDYRSYSCIf PLIIQIEOHYIFSIODAFNSLNLLKAGTCfSVLSXIt4PAFPRIN)11Q.F11ZNIL9~TfESKPT
FPKTACELSLPFGKKtIfVSLD~tIWIQOiCWDlB4~tTRNEifIIA'IDNAK1LESTJDtSKYBLCPeL0031 tKCDRENFILINSRPIDQLLDSPLSDNRNLI rs21-521 Ribosomal Protein DTPN
YLEYpNILGT)CTFF2hIQLYGVYERRFaDSAFFFETJC.DKPIDIPPFCMPSVKVRVGEPVDRALAIL10CPCIDKEG
ILKAAKSHRtYOKPSVKKIWf8101ilAKYRBA
0021 34470 32707 CPtL0032 14A81 16098 ~Pn _ dnW-Meat Shook Protein J
Predicted OMP (leader 119) pepeidel CSRSPYPNIETL71AGVF3IRSMGLFNLTLFGLLLCSLPISLV1LKFPESOCHKILYISTOSTSLIGQIVItFVCSVSQ
IDYYSIIGISKTAS71EEIKItAYR7G.71VKYMPDKNPGDR1171iKA1'KE
QOALATYLPaLDJIYGDHDFFVi.AICIC>~YZJCOSZNS80PQ'fRKSTIIG7lGiJ4GSSFaiaVVSEJ1YLYLSDP
OKFDSYDRPGImCPF
' LSQ11!!!1'ADPLOpLLVIS7IVSGltGGKTSDDLLFKALJ1SPYPVTRLE7UlYRLillRi0if1'INIr FFDGLfOCLGE71F'0llRSDPAGARQGABKKVHTNLTFEEAAlIDVAIiLW80YKBCIft~
DHLNSFINKLPEEIf7CLSAAIFi.Rf.E2EESDJ1YIRDLL71J1I0I871IRSATALOIGIYODKROG71VNPGCIK
SCERCKCSGpNVOSRGFFSN7LS11CPECGG8GAII1'DPC8~11000A~II
FLPTLRNLLTSASPQOpEAILYAIGKL1~GQSYYNIKXQLQKPDVDIn'LAAAOALIAIGKRSVNVMIPAGVD
YVFIDVESNlVFp1RG00f.ILdPI
EEDiILPVIK100ALEERPRALYALRfrLPSEIGIPIALPIPLKT1Q'ISFaICLNNALhLL>tLGCCFVDA7Ii~lOO
CEIP1T.LKTDGSCRLTVPEOIOSGTTLKVANOGFPMIIiIOKl311fiDti.VItIS
17T1>XLLEYITERLVOPNYNETLALSFSI~RTLONWKRVNIIVPQDPOEAERLLBTlRGLEVCfP(~1LS880XELLR
TF11STLKAGiFPIO(RSPLDKIKCFFSDF1Y
EpILTFLFRLPKE11YLPCIYKLTr~QKTOLATT11ISFL8lffSHQIALDIi.FQJWIID.PGEP' IIRAyJIDUITyMItICppEtO(RSLHDyA>LIO~fLLF~T~IORpIip~fpyLAYQVTPESCPn_0033 16129 RTRlG.DILETL71TSK8SEDIRLLIQIXf80DAl01FPVLAGLLIKIVEpdhAiB/odbJliodb8-lpyruvacef Oxoisovalerace Dehydsopeaase Alpha i Beta Fusion CPn ERSIk:VtroFIOVISSIRDVLKLVwELR!'AEtIK,C.LLSAOSGSOGTIbLSCIlOIt>Q.i~fL)1G
_ KSLIPGKDiiSFPYYRDOCFPIGIGCDLSEIFASFWAZTPNIISSAAIIIPYIIYrBIC
,Nyt TILQVIStICCiNSNTRSFYSMSLPLViGSSSPRRKTILEKFRVPFTVIPBNPDESKVSYSC085V9CI10!'LOAAGR
7WIAYKfISSADEYVYVBGCOGJITSpGEFNmIIJO!VJItiKaLtLITII
GDPIAYTQEIrI~OKAYAVSELHSPCDCTILTGD1'IVSYDGAItTKPODKADAIQtC.KTLRI~WAISVPfEDOCGiI
DIJISLGRCIIOGLAVYEVDOGTIYTSLTETFSHJIVDOwIbNSVP
NQ1'HITItIiSIAVLNKCKLLiGSET50ISLTNIPDIIAIESYID1YCT'IMiCGAYDUCHGGLALILTDVVRLSSNS
NSDNQEKYRSt_~r rr e(~DPLILLEKE7IIMIfCiSPFIIEEIK11 ZLIDfNtGCVYNVpCLPIOTLKYLLEELHIDLWDYSItIIOEEVRXSCETAEALPFPSKGSTSHEVFSPYTGTLIDYEN
&ESCPKVl4tMI
SE11LVEEMTR054yIVfCEDVACDIIGf.IF4YtRNLTEKFGPpACFNSPLJ1GTITGTAIC
CPrL0023 36657 35011 MALDGTHKP1NEIOFADYIWPGINDLFSEASSIYYRSACiWEVPLVTA71P~T000PY
Y5)K/alr-A8C Transporter Protein HSOSIEGFLAHCPCIKVAYPSNA11D7dULLWl7IIiIDPNPWFLENKJ1T.Y0A1(IFS7IGlVF
ATPase ENRAKLLYSKOHFVt4.S7WSIVLDKIGKBLGTRILFDDVSWFNPGNCYGLTGPNGACICSSNOYVLPFGKAAIVtiPG
KDLTTVSGKRfPLVLSLEYApELASACISTMDLRilNFCDFA
TLLKIINQIIEPTACSISLPKKVGILRONIDBFNDf'M.DCVIFI~TfRIF1t71LQRRONLYTVLKSLEKTGRLLVIH
EASEFCGFGSELVATMSEpCYAYLDAPIRRLOCLNJIPVp)ISKVL
C,QEPTDAIGMELCEIEEIICEFaICYRA0S8AEELLTGIGIPNENFDK101A1fIPIDI4FRVtNEIILPHKESILOA
AKSLAEF
LLCOALFGNPEALLLDEPTrMLDLYSINWfGNFLImYEGTVIWSHDRHFWTITTHIAD
IDYDTIIIYPGNYDOMhI4KTASRDQEK71DIKSKEKKISOLKEtIIAKfC7lGSRASOVOSRCPn_0034 19196 LREIKKL0P0ELKKSNIORPYIRFPLSDKBSGKWL.SLEAITKOYGOHQVIHPFSLETYOCT315 hypothetical Protein CDKLGI1CNNGLGKTTIaOCLLACLVFAPSBGSIKLGHOATCSYFPONFISDVLADCGOE'fLFVNFLLP1'fCRGI4M
EISTPSLPOSSIVSOKTPPVPDPDSSPOHIPTIP3'pAPFKKP~Dt EWLRNRKTGINDOEIASVLGKNLFGGDDAfKOIpALSCCETAALLMAGMILENtOJVLILDETPSSIVNJIIAFAIG7t FLSCIL~GVP'AICLGCSLEITMPLFILTAVFIAFTLLYFINYLEK
EAIdIHLDLESVSALSWATNDYKGTAIFVSHDRCLIODCATKLLIFDKDItITFPOG?MYDYPKIPCPLPTPPPSPfLM
PTLTPIPAPAPGIPLPP'fLPINDRTKLTCNPDIIIYPB'f)IDP
TAGNKQLL
KACFSLLKQLFSLDPETRPEDRKYSNKLASTLLRSKEKSGFRFHCFKCIIPBNOKILNKKS
G11WISSHSSMDFSTTIrGMPAVITCi,QRSCwEKIKNNIPTPEIWLPIG~fSCPNDVEE
CPn_0024 37605 3b661 GAQLYTSHLIVINPPTLETLIKEKMRRAITLIIDFSNKEAFTNLVWYIACFDTCIGI~iLE
xerGIncegrase/recombinase SVOLEVFGLNNLSADOEEFTTWESCCHLAtd.ESVRILLASKEIYALSNVSVN8I8pVPLQ
REVMIAStYSFLDYLKMNtSASPNTLRNYCLDWGLKIFLEERCNLAPSSPLOLATEItRKTACMLFLN
vSELPF3LtTKEHVRHYIAKLtENGKAKRTIKRCLSSIKSFAHYCVIOKILLFIJPAETIH
CPRLPKELPSPMT'fAOVEVLMATPDtSKYHCLR0RCt14ELFYSSGLRISEIVAVNKODfD~'.Pn_0035 51115 LSTHLIRIAGKGKKEAIIPVTSNAtOWIQIYLNHPDRKRLEKOPOAtFt.NRfGRRISTRSCT33:( nyporhecic~l protein IDRSFOEILRASCfsCHITPHTIRHTIATHWLESCMDLKTIOALLGHSSLETTIIIY'fOVSARTTLEF~AGSSLKPLP
IITFPCATALYITHRAERKSEHOMWNRCQVFSSFFFRYPISSfiL.
VKLKKtytH0E7WPHA
IRLRASCECFOORHPIFLCGLYWLAGITSItCHPECSALILIPIGNPLPRNPKONLPLASA
WtISIlfLTPAPFLHDCPISGTFVTHHAOCpCCYYGEA(CTOTPCGKRAHNLSCpILSESR
CPn_0025 38610 37681 LELKKV'IELECTLNHTCOIVFKSNACYKEIPRSRFYIMKEKCRCiSCHFL1RIRPPSSEVC
slat:latsA-9ulptushydrolase/GlyeosulfacasePFAa'SLLLCTPLPONLRDLfROKGi.SHLFAISGWHFBLCATTtiML
CALLPLKIKKILSF
ILMSSRELLIIl.CSSOpPTRTRNpCAYt.FRWNGHGLLFOPGF.CTQROFIFANIAP1"IVNRIVLTSI.ACiFPMSL
:.1IWRSWISVTLLCFSWCF"..CSC~ItIRIGAOFtLCGIFF8PF8PTF
IFVSHFHr,DHCLGLCBMLHRWLDKVSHPIHCYYPASGK1(YFDRLIIYCtIYNETIOWENVLSFLATL(:ILLFFPKI
FSFLYTPW~pFLSPtyR.YPIR'ILANTLAISL3AOLFIVLPT110 PI3EECIVEDFC3FRIEAORLQFIpV01'LL19RLTEPDTIKFLPKELESRGTRCLIIODLIRYffLSLPLE7DLL'lI
iLIVPFTILPIIVFLIATITt.PCCCFTTFJ1LI0CFGSNPIi<JIFIPNILK
DpEISIrI:~TNYGi0V5YVRKGD:aIAtIADTLPCOMIDLAKN.iCMMLCE.3T'lLEOHRNL'rL$FAFVPPWl1LT
41::LILFFTGILRTINSPYIISISAT.iIRPTlTL
AE~IIFHMTAKQAATLJ1KRM't'QKLILTI1F.~'.ARY
WLDDFYKEA;AVFPMMVApEYRSYP
FfIfNPLCtIK nPn ou f4 SOr.i') 5179':
IT'I!H hylx~rh~r i.'.O prr)tsrn I:PIr o02r, sr.:17 )s7o2 AK::I~IG::GftKKMYKI'ONIR:'rfD'/RSFFFFDVLCIEOLFY.El1~~YIF.W:.AKtFRLPpIX(EL
rT(4'. irlpxll.:r.iral prot~tn Mr:L:rKRGRLtIFr:IDIWv:::VG:IEtIKE.:F:ICRFFGLLETIEVYI'IRLEKEP'fQLKIIFYVF
.:NF.1M::IILIt::List)::VT::YfIIKI'r~PIKt)AAFt:K::IIeI~Ir:NtAYLtII(.'yl.V'Pf/LW:
AMLItOraaia~".iRI'fIJ.~I'tWIIIRLPNIf7DRilYEY.FF.~.iNtY:Fr7KWEDf7:IF1'NP.::I
VfIr.~K
'MfII'::Wa19r:1.::::LALLVI.I.:':IF'NfCLINWft:M:K'rKKIAFKIM:C~:UITYaA::RKC~~.~nl 'LV'/MNKN~At:I:Nt.'Y:;11:IF1PYr:IERPFA'l0.~.FFFDPRtRRGLI::Iff/t.LNBE::LE
I! L::1'11111A Ih:l'KlIF I It'fULRK<SVNyN'rNKFK:xI IIt::I.FT I L:L1J
II::Y.::YYI'::f :FE:: FI I I :UE1111:3fRO::KN.~.::EL:IiLKNY1J1:3EGYlNE
I F i::Df:
.:::I'l:llINYAKu:Y.A)119'ATtKl::K'r:.'ftf~:::KKKK%TKII
:1lIRTf~.:: f )IKR.~.APKfMI/P.~.K
KitKINI.I,Y.K')VILII(aH.(:IIp:::x:NF:aU:.;::PPfNqpKAILf4IFy'Kpl'fGF::IyUll:'1 ~1'1(li 'Wll:
la:N 1t:: 1INa:l4e...rc: ~.:r Fr,Irin ~ Nlrr ~'In U()::'1 q.'.~':.'. tv~1'IH KLI~k's:l':LHAI
ILI.:::lal.IItT1'IIVIM.'IMNKfTRTfLE.iEKD1'QOUIFFt:~J191.'1'/KN
2~~T
Lu. L.r. ff'I' 1:(w:rrl.nr FW
t..nr.:AN:IINNf/uNIVlll.t'Ir:I:IY'fiJIIFT'IN:IITTNAN.~.INiLIJIUtAWJt7I3KIl:I1'I
R::VFJI
' CPGPCFDIItFIDtLKtANFE~ 'E?I~CYK:Ef~.GKRCIEKLTf:TPILEKYQRIDDRD
HAILC*h~DAFS'~'~:EL AK tLKOLRAOLLr'~LF3CR..tY~ICA IPV
VLLILL.WfIYCALKAL.: Pfl4:.KSP?IlffGYIA
Y~1~S"
' .
ILTL.iLWCRGTCIE'~'~A'~IMfl~tL3YP8L1!~TAFa.LPIJIi rIKLbCIIAIri~' I~~RE
VFiOi G
' ~:fhytOtH 'W1::' S)dJl A
L7C3WRIOVStJtRViA
OL1~1118WFLSINL
~"1 to hypochec ica: pr~rttrt AL.W10GIESPVYSLITAIa~WALLPVFFJ18F~GASI't~tFBLLTYLSPC~ALLKRLFKt7IPCI
SGl'r?IELMRIACT::Y:f.7IALGKVFFLGT~PLMIRELTLPpEEVEHEINRYYKAtl1~t~~ICADSLYCLVAAHY
NOIGKLINiGFFS~ICIL005t~8L8PL
HD'P Y~~t O
K3DIW1Lt70EVL(7~CLOEVSStLOANLEINKDPLLTFLVVFfttRlCDR10U1YVFSSVECAIMIFIRttIP04vF2 .Att01(GLPESFICVLEEHtIGT5VIR5AYY5NlIV~7PSTGSFDE4.
F:;
FttYSGNKPSSIIETT<'tINIADSFEAASRSLIQIASLPOLQRLID0II0GIti.00COFSCSPIT
LTIJR(TIP.~.'J'IDRVQOIHDh7IRVIGHLCCCNKSSLGE.iDONLIIFSEELTPS
H.KIEE
~
. :,~,hVlr/rr -, ~ .\~. ..p~..,..
. t fr~w AYIRI:FV.'.f.X:AA?:Irt'AtIISRAKSIPYLANISEEii410II1KRYNCKLVLII7GY
.
NAAN
~
r . . . ..
.
. : :~a::
.... ... , ;11...:..,1,,:: ;..;..,.,.I;:,~.:
.. ~~ ,..1,L; , -.
::.:.:.. :'1.f:l.l'f.iv::.Yi'.'I':.:Yfll!F'.
~RY1 :
' "1 :' "' . .
. .:U>_0041 n.
:I No robust homolo0 Pr~~c m Genebank/FltBI, . as of '1!7/91 .
iA
H~t'.~RWLLDIf;tlILEDUWALAKA.itnOCSIKVLIECVSOVSEIIkIIKKKWETIRTRFPItGH
KV;14CMIEFPSAVWNIEEiLPECDFLSIGTNDLVOYTLGISRISALPKHL1JVTLPPAViLKIl~ItVYLLVIIOEIf WL'lt4.HOPYYtutIL~Ni'IYIPGHTNKDSNKLEQK
RtiIHtNIAAANONQVPVSICCEAACOL.iLTPLFIGLCVGt3.SVANPVtNRLRNNIALLELVDFJtPFSLDCFSINF
LIFVSLVPIJ4:.LVRAYOIKKSLDRTIIIQIGYSPSTiCmiI~tEA
N:CLEITEALLQAKTCSEVEELLNRF44KITS
FVNCYCLICISIIl~LCILVPILlLW'LSLLLLGIL31LFSLnYFSIIDtItISIOtICI~ISN
.
CPt~0079 5II56 57967 AT
~T)79 hypochetiul protein KKKKEAICIMEOOFLf~tEASLLEItRY 'UGOA~GLVSWLH~LI~PT00050 66819 66199 resent In Genebank/DlBL as of 11/719A
lo h ISNGSGYA Q p ~ t ~~ ~F atno No robust VSWFPILCtFLAIOIYAKt~aiF~Fi~IVKANLGYLPSTNCKNALCRNSSIILTSSIKlIIGIL
GGCGILLPIP'LLLt~iItsISVLFQLL~G.!!'Rt.CCF71IRpSVSSDIIrINt<LLLLHNZLA
CPn_0040 55677 54318 dnaK-DNA Pol III Caeea and Tau IPYQASSRKYRPQTFREIt.CQSSWAVLIDJALVirNRAAttAYL!'SCIRCPn_0051 66797 67111 cL c in GenebankJENBL as of 11/7/98 p No robust homoloq 0resen AFnHStGYTCf CFAYLIARNIPRtIGMiETYINPGVLPSSNAQDVSRS'IIIYPSRSFiIOIPNtJ~BIFNRVPS
TGKT1'IaRILIKU.yCVHLSEOGEPCNOCFSCKEIASGSSLOVLtIDGASHRGIEDIRO
ItJTVLpTPVKAKFKIYIIDEVIOQ.TKEAFNALLKTLEEPPOFNKTF!'1i''TEINKIPCfIKSSEQiJ4XiNRIPL
KIWLORIPC1C1'ILEKISLMAGDDILIEASOIX.APIARAAOGSLRDAESLYDYVIS
I
"
RC
a a 0052 68008 6730d p CPn LFPKSLSPD'IVAQALCFASQGSLRTLON11ILORDY11TALGIYt'DFLtISGVAPVTFLtIDLr LFYRNLIyTHStTSKFS50YKTE0LLEIIDFLGESAtINtpN1'IFEQTFLETVIINIIRIY.-hvmC-POSphobilinopen Oeemlnase aRPVLSELISSI1ISRGFfGLRNIKEP1'LTQQVSAPQPOZ"~EOSPAAOCKIIKt4.SVrYSDPCLSDFCOQIRPLRI
ASRNSIiIJIICAQVNDCISLLRSInPKLI~ifQLSTi'CL'1' SVEVKSSASIKSAAVOTL:.QFAWEFSCItRQ
G~IPLHLVENSIfFFTOCVDALVH1~VCDLAIHSAKDLPE'1'PSLPVSIAITRCLNPAD
LLVYADNYVNEPLPLSPRtRSSSLRRSAVLICOLFP~00QILDIAGTIFfJtL00Lt0tDttYDA
00d1 SS8A8 57342 IVWIAJLSLRLJILttItAYSILPPPYNALOGSL1ITAKDNAGKwKpLtTPII~NSS
CPn _ No robust homoloQ present in Genebank/TC48L50 67986 as of 11/7/98 HSVCSISSRYKLRVLAITFLVLItr'VLLLISGALFLTLGIPGL?AGySF
P
CKYLYNMSYPP CFeL0057 697 Q
CCVLWSG1.L.~:.VfWEVBXVCPEIPAWPEtTPEDVPVTPFIUPAt~A
GLGIGLSAt . ~'~ Protein QKEpKTOKILDOLPGELDOLORYIOEAFACLGPLKDLKYtDGGTLO~~~KIRNATICIK'1'Ot~IDCGATAPt6iI~p CPGCt04i'INSLVEEYVPQARSCfSSRSS'1'SAW.S
DMIAEFVEi.G0ILC0ECRLLEFVINQTRYIGRIS.FKRt'~SLYKWEyI(l.YLPSGOVRCERSIa.H4ESRIFIW(A
GiDRIiL:OLWRGSLTLIIOGDPCICKSTLLIIOTAERIJLSWLYRVL
LrYREA
LK%SAAEWDRFMRT'tQ4IRItIN4TFDPNVYSVAKTATEICAFGitLETCVYESNRYV~SSTOrSLPAKRLitISSPL
IYLFPITNi.DMIKQOIATLEPDiLIIDSIQIIINPI' FIWVKeDQtIEIDDRIGNSQDISE
?CEYEKAIQ.LCDEEKSAtIAEOAFpDIKNRWBDNImit~PG~T~I~~IICttYlK9GEIAGPRVLaILVDlVLYFICN
RYEflIRITRARWYKVAENGLFNAIT0tV1(DSLRgOJEARV11FEKERSKIT40R~IKKJ~RSNANYRNIRSVRiRFG
PrNt<.LILSNHADGL1LEVSNPSGLFLQBJfTGP'i'iGSIIIIPIItii :.ROLItEGHDOF3.PRAGERLRELOALYPEIAVSYVFJ1RREYASDLEKAtfESIDKHYOSCVRSGALLIBIAALVSS
SPF11NPVRKT11GFDPNRFSLLLJ1VLEKRAQVKZ.F1?mVPLSI'~f.
~~Y
KIICP11110LGALLAVASSLYNRLLPNHSIVICEVGLGGEIRNVAtQ.FRRIK~IItJGFEG
AILPECQISSIPICCIRENFW.GGVKTIImAIRLLL
CPn_004Z 573d6 58112 No robust haaloloQ present in GeneWnk/EI4BL70089 69313 as of 11/7/98 EECEf00EAEFRENGTKIRSNEEYSEYLOQVI~IQLESCSKAL'1'!00"fFlli.CVItLtaKEEIECPti".0054 rttc-Ribotttieleattt III
SII4SDVVNRlE~ILCROIEDFILSRVEEIERIB.RNACLPLLPIKEiItTKAFWttt4&CK>ZIG.TLSFFPPIKIPN
SKFKDGAIiSNNPPIDITAIWILNF!!"IOPKLLEI11LTRPS~IO~iJ1 OSIJaOTIQRAYIIGSQKVSGLESEVRACREGLKDOVROFL~S'ZIaGIG
PYFKESPAYLTSSFRL ' V
. LLFP9IO~TtSTARASLYttAKAGCRYZ
TK VGIEDSERLEFIIrMViCLZVTDG1' C
DYLLIGKG6KIOSERGRLBAYANLFESILGAVYS.DGGLSPARKLTVPLLPPREEILPL18 pGVSLIKEEILt11TS1'FRTKFSYHSFRIJIVPCIOti.YLEYYODID~LERTRAAWNAFIStItYR
t~p~tlQplIJ<J~LVEAQAL~TEYWLYRZER1ISK>Ofitt7N110a.LpOF'IIQKOFRVLPVY05TAVT0AGCNVS
YQIQVLVNQ6ItYtG~I~Y158KKE710CI
....- nn., cem cn177 cPn..ooss 7oo9s 7os9o CT396 hypotMtical protein CIwICYLIRIRIOISALNLOHLRNFIWHaSILFE3JLLTIKDGFLLETKt.ONPIAKASRTID
TVIIMtF~TIFRSNPt:IYTWRKRRLOFFAAF1.VNRPKi.SLVRDLWV!'PCEEILEGEiOCTIL
PLLLSGDRAGSGIFFTGPYP&DLYELEIGCI'1'CLLLAFSSVCIPVI
CPtL..0056 70917 72746 sItCSA_Phosphoaanetosucaee EFLIQ.SiitRISLIIIEYEORIRSLYD11VTA~IICRWLStJDCIaQDNt'fIILWLD'tDPAOLE
pLfGA?LTFG1GCLRSIlIGIGTNRINLtTIRRTZ'OGLVOVLPANLPNPGOPNRWIIOCDT
Rt045IEFA08'fAlNiilt~lCICiVPLFOYPEPL1LVSF7YRYBRAIOCVNITJ18t87PPNYNC
YKVYNASGGpVLPPL00EIVAACSAVNEILSVPSIDtIPNINLIGKEYFaLYRDI'LIIOI4L
YPF)1NRISGRSLSISYSPLIECTCISLVPIIVLIIDWGFLSYNL.VOOD71IWGDFP'I'VOLPNP
CPn_0014 6078 60778 EDPEALTLCI0~4.ANDDDLFIATDPOADRVCVVCI.EOGOPYRFNC~MI~SLLADNILGJ1 f 11/7!98 No robust homolop present 1n Genabtulk/t?18LWSKTRHLGEN~(f.VILSLVTTEFtISAIAKtIYth~.INVG7tGFKYIGEKIFSWANS'1'NK
FVF
as o ' IAKSDCRVWIRiJiSAYKESGKVSSLETEACTYREYLREOWOFETOGVSLIKEELL!'LSS
TLKSKLSYDPLiANIPCFIKtYYCYYDDIOKARAOSRWLEKSERYRNAKRRE'OEIVKI~LFYCYFANKTES
GAEESYCCLYr;fItVF.DKDAIIASALIAFAAt.00Ki.OCKTLCDALLSLYC1 KEAIfPLIDIEEYRLT-0EERSNILEKRLIYMOfAVARpRVGEFESMEIPEWFSAKTDEQEIRKKLSNLEEISSM1FFSGKYOVEKPENYKGGIGF
NLiSItDSYALTLPK
.
TSIC.CYYFSOOGRVIIRPSCfEPKIKFYF~fSTHYPERVTDKEIOKQRFaFSFOH<.ODfI
CPn_0015 60961 62790 CT)45 hypothetical protein CKYTYHPPOLPPDHSVGATSWpPKLRILTITlLYLC'JLLLISGALFLTLGVPCLAAGL5FCPt~0057 7.91) CLGIGLSALOGVLW9CLLFFLIRRGVSKVRPEEIPV1'PSHEAGKIL~_QLPOELDOLDTSsodH-Superoxide Disnwcase lHn1 IDEWSCIGICLKDLKYEDpCLLTEVOLIQ.RVFDFVRKDtIV't'EFLELOOWAOEQOFLDYLILKRYWNSFVPYSLPE
LPYDYDALEPVISSEINILHHOKNNOIYINNtJrMLKRLOAAE
INQVOSISHKLFVPOVNIGAHLAEIGGYLPSGDVRVERLKRSAROWDRF?IRVTCDTRXV'PQpNtllEtIUPU'RfNC
OGHtNHSLFWETL11PLDQCGGOPPKHEIlSLIERF~JCI?ION
' AMAFDENJVCCVAKNAFDKAFCALEECVYKSLTESYREAFYEYEKAKILRNEDVE~R.OOKNGKLPLLL1IDIMfItAY
Y
fLKKLIEVAACVOGSCWAWIGFCPAKOELVLOATANpDPLEPL1 KSARAEORFR6VKWlWEDLKETVFiJVKENGCIDLE'JL?A40L11Pt)ItCPENLIPBIUIRNINfpYtINVRNTnLK
AFPCtItM~ICHItIJNFSEIISSK
NSHKLWFJ1THRFtKGAECTYSV.1RVAFEKDGSR)WQKY.F~EKTKEWLRCLKDLHDOECNRA
0058 77eZ7 74562 CPn RERLAELEALYPEVSVSWETERETKFKLCtAYCNLEERYGStMCDQEDYWKEE4?IKEAE_ ' IceOACCOA Carloxytasv/Transferase Beta tAEFEYTILSDAAN
IRWLVRLI''SYDKPKIKVpKIKADCFSCWLKCNtICNENINANEIGOHYNCCPKCSYNYRiT
FREKGTKVRSPEEWEYLDiLFIJGaEDCSKQLTIAE'~IVhGIIELEA
RLKVtGEDTEDILPRVEEIEINLRIAELPFLPIKOAPI'KAFLpYNSCKDRLAt(VEPYCQEAIERVKLLADKDSWRPL
YTDLKSQDPLEFIDI'DTYANRLEKARKNtI'ESELVIVCICTIC
SVDYKarFRV
IJIPVAGAVNDFNFNAGa'NGAWC89G.TP.LIEFJRCfRLPVI
.~Pn 004e e2775 6)261 tNKTSAAi.AKLNFJVGLPYISVLTNPT~VI'ASFMLGOIIIAEPKALICFAGPRVIIAtW
' ' ' N" robust hnlmlv4t presanc Cn CenebanY./F11BLOCKSKAPRDLSKR
.~s of 11!7/98 l Il~TLLDYFLApfY1 ERf'Q.;LNpOI4NVY0COKATGLF~~~EVSAYRDHLREOITEFEZ'OfiLDVIKEELLFVBSTLLKEIFLLTDOSE
iC$KL..':lDf LiAOIPCNKFYEYYDCIDKARVQ$RWLEY~aERYRKAKKOFQA4LICECLFIfE
4'7 7.1562 75050 ' t>pALY.KALYRt.LREKRFNKF.KLLtCNKIEM(7t,~RVtrEF:P.:Dw:Pn_0 'fut-,111TP Nu,:le,'r t,lohy,irolacr:
t.l ;7 0 )bS~J IKHIfI'A::l.'NIkII
IC'NAIIXIVFCELD:fX3ELPfITI'PI:AN:AOLRANIEEPIALt.i't~RA
1N4'1 'I
, LIPTr:IxJIEtPEr:YEWVRMt:a7:(.ALKINaTVItI:Pt:I'ID.r.DYRi:EiRVILINPC:D'1'FI
.
n ~
JEMBL ns; ~C ll/7/9H
:C tmn.,l"u nw :,!nr tn ~
tunr en w tm y IEPKNFIAOWL:.I'~1i'A1'FWKQE.:LhTARG..IY:Ft:lrl'r:ll;
.
.
.
, m~
:
I:tIF'ItIJ!VTISra~F'ItlVL.t''.J:ILTHYIIFQKIRFfI'LT'I'rJ:F/LNK~WtKhYEL.WFYYt::;C
l'EC
KVY.IIJ!'::::IIF:WI.
~.Pn_Ot,SO 751104 'l5 53H
.'tn 11114a .. ",an ,:SpOI GcaN-VI's I IA Pr.e.rin V'ItF' u:: ,:rncuem~~t hytxrttuti,:.tlFKLPEC/EVLVILEI'AKMI::YtxrFtOqLF::LF::LL:PRLVNFt.GKIICRDCIWUI.Tn t.VDA
IM t.r.rrtin ' ' HKFJkItI::YNttALIIKI.::IKJWVNYFt.YTFlI.'X:::F'IVAIFTFAWf.KVL1'1.'I'EIKN:EISRISl Dt:L139111IDc:
ACIIi.EGY.OAFFDAL~RRf7itlCTR%IC%M'."JAIMK:KLECC:aIFFIAIC:III
I:PAf?41)F::I:.W::NIKFYKHtNIt::E11Ft1KWIII:rL::f~:::l.I::KF7:tlADEKrOYYIPKKJ1A0 ALVRL'JFLIC:GFG4AQAEYLKL4'.TLTL:L.RftECRRqrJt.LUVNTtELIMNVFVt71 1~Lt..'ftIFVU::::I'iK,'LKDLt' IY FPLL.tIKBKKTLE l1 i I: I::NlIr:HV
LAS\:FC:IILK IFLIOE7J
u:flv IIW.I 75iu1 762DN EATYAtORKANKKPtCJtE
:r$'ttFlYTt:!rCK,~,L.W1LIEINRNNC'L'PATJINLGAS
pcsN~PT:: IIA Protein ~ NTII tsNl-BamJnq Oamain UtWUIOPItPYCF:xIPOCC:W
.:.:YL7t11t0'ta'CD:LARADCCLHTt.;'C'I'L9pIKKLPORt It::HOCtt.".:DVKNGLKLDEVA:iLLOVCQRVLOWLK~AIPSY5IlIlICfRF3RECI~ft.L t '~
IItpALHLOERt:EOKEALKDGiLKYSLYKAiNROLYt.CDVWNSKZ~J~4YASKYiAQKFQ
LDE..~VLFEHL:'rIRENLN.."fCLCFZIALPHAKDFLIHAYYDIWPNFt.AEPIEYG7It.OCKP CPn 0073 87153 9757.1 6'GILFFLFAL'ODK~HLNLVNKIVHIGHSLN71RSFFKNt~OL'L'AwKintA-IntCiatlon Factor IF-SNAKKEDTLVLDGKYEELLPGfIHfRV .LENChIPIrfAHLCCKHRHSNIRLi.IC~VT~15 .:Pn_Q057 76251 77690 A~~~R sf4iir~'~' ~:;.,...,:; ,:;-w~l.~,..,:.~:rt~°:.I:;~~.::::: °:--n::r:r:n:::::..:::..:~.r:~.'.. .,r .:... : :. ..,;.._ fWKKPQKt:;iERiIOAKKEPRARKB'fLVPSSRTL'alRat7KMItNSSRIINEISANST CutA-tlunSiecaun ~dctor 'tu PRSVKLRRHKRAEOKMKOGESAPSN<~TLKS~KL'PS1LOKTSIHEREKA?SRtVNfSCL
EDFEHSKET/QRNrIPNINIGTIGHVt#IGKT:'LTAAITAnLSCOGIJISFRDYSSItI~PLE
SSARKRYCTPSSMP;LFLETEIVMWERTKC1QDNEIHIPWQWfNPKLQNI'KZTKQ
MRC1TIN118HVEYEfPNKNYJWVOCPCNADYV14~8ti1G11J101mGilILWSJfI'DGRIIPOT
LASOASIOQSEGTEOSLREI.iICCASLPVLVPSNPEYSYORCKEC3.KCL'VAERIlOCtOIKS
KENILL71R'04GVPYZWE'LN~ISQEDJIF3.a'DLVpIELSELL66~'YI~CPIIIIGi7IL
VROALFJIRSLTKIfVARGGSVTSTLRYDPWU1EIKSRIfNCKVSPtCAREOIDfSSCKRpIIM
K7lLt~DANItIGIVRELHOAVDDNIP?PEREIDKPFLNPIEDVFSI$GwGTWTGRI01GI
NCKOOKTfPSEDASOEEGOTCJ1GLVRKTPKSQVJISIUONFYIWSKH'tNIDSYLTANOIfSC
VKVSDKVOLETIVZCVEtQRKELPEGRJ1GB~RrCLLLRGICKNtaril~lRNCOP
SSEE'fDWPCSSCVSKRRTIOrSISVCTHwt~lIl4lZVCALIIIIWITESdiTSDPTPPIPTP
NBVKPNTKTKSAVYVLOKC~rGRHKPFFSCYRPQFFFRTTLNfCV41'LPt~TfJI~IPCDN
V~V<SLICTVALEEGIWFAIREGCR'tIG7IGTISKiNA
CPn_0063 78109 78267 No robust homoloq presanc in Genebank/!?0L ss of 11/7/98 CPn_0075 91087 91350 pHYANCKrWCLCLYDFSRHRSPPCLPLTFTPPYSFTI~IFLGRCLSTSNIVLL suet-PreDSOCSin cranslocase gRgWpIptapNNRKAtSRKIGTVKKW1KFAGSFLDEIKKIEWVSKHDUOIYIKWLISIFG
CPn_0064 78310 78576 FGFAIYFVDLVLRXSITCLDCITiFLFG
No robust homolog presort in Gensbrnk/EI~L as ot~ 11/7198 LVM'KIOCSApYYRSRPAERAOTPPQPFLARDRRADFiiEAItPRFSJVC~VtJ.t~VU'~L'AL~ CPeL0076 LFLFVIdLPLAAGSYLLAF nueG-Transeriptional Antltesmlnati0a pplCg11K7~I1frMYWOVFTJIpEKKVKKALEDFKESSCIffDFIOEIILPIFJ~MlIVIOK:EH
KyyltJlyIWpGyLLVl40!<.TDESWLYVKSTAGIVEFLOOGVPVAISEDCVRSILTDI~1( S(fWCIfHOFfVGSRVKINDCVFVNFIOtVSEVFtIDKGRLSVMISIFGREl'RYDDLeFWCY
BEVAPGOESE
CPe1_0077 91956 92135 r111-L11 Rlbosowl Protein FP'ygypLFygygQCKVRFSHSVbMXIIKI4IPODKANPAPPIGPJ1LGIYACVNIIGICIc EFTtMLLPVVITVYADKTFTFiTKQPPVSSLIKKTLNLESDSKIPNWiKYCKL
'tpApNEAIAEDKIIXI~IVLf.ESAttANY00TARSlCIDVE
CPr1-0078 92157 93160 rll-L1 Ribosomal Protein SCRIlITKNGKRIRGILKHYDFSKSYSLREAIDILKQCPPVR!'DOTVWSIKLCI1IPIOtBD
GOtRGAVPLPNCiGKTLKILVFASGftKVKE7IViJYGADFMCSI~LVEKiKSQiLEFWAVA
CPn_0066 80916 82655 TPZIIHEVGKGC11VLGPRNLNPfPRTCrVI'tDVIIKAISELRKCKIEF1010R11GVOiWOG
No robust homolosi present in Genebank/Er~LKLSPESSDIKENIPJ1LSS71LIlUlfPPAAICGQYLVSFTI5S1'MGPGISIDfItQJNS
as of 11/7/98 CVYHANR'fQSRPPSPEISICELELOELM:SSNZ'LTISNI'PPPSCMTAEEVSLFILOGRR
NSEDEECP~EVYDVVCITNOGDPESIRDll6VRVN1(INGSCRTaHECILDAIBIiCDZ.PG
a1DYIYLt'8GN 0079 93170 93688 CPrI
EPVRFINNS4'YGLRSGFLCIRNRIPPRIxJVISDAIQA~FFIFA11~111-Yt'GOWITLLiSIRGA?AVlfl~ri' r110-L10 Ribosamel Protein CCLYLQVAGOILSIYSrtItILCVGIGSSYYIOCiIYAVtOiYR~08t0E~LIi.OEY~SAA~'FILLRYLRITMYSRE
IPNSUID11SIICFM.1001IF
LPYADSAEGLFLPSVttCPSYQWALACGEpCLIHtIf~pQVOFRPODSSStIALVVtVLDFNKpHKDBLVFU1DKN~LI
SLS
VSMIO
STWIRLIEWIDRGDSOAVLEfiiPGPSt~RDIJ1LTALYAT'tRISSLiID~L~L~~Vp ' FMLE~~~~DP
GALYEAYAKLPSLKG.RCOWGLFMPHSQWGINNSVLSGVIBCYDQKJIGIQI
RR
FV'1'IrAIVIIGYSIM1'LRYFILLLTNRPOCRRHFRVLRi.MLGL.OStGFLTVLLDttZM~
' V!1 VNRRPPLISVIFCTASFATGSFIYVDLTRNIfTSLRSRI.OL1WRRL11GRGLPLNAV!CPe10080 93720 FHPLIIGFINOLVIQVPAVVIRPN1TAVY~OTSOE~
LITFttO~d r . r17-L7/L12 Ribotamal Protein HLDSLRF~
I
FFFVPSVRIfHI.IDrRPIa VRtIfKVITLBLETLV~ILBNLTVLELSOLKKLLEFJtiIDVTASAPWAVMOOODV1PVM
GDVtAIGO,hIH!'Ii.QIILLVIN1 BPPEFAV'tLmVPJI~DCICVLKWRL~ICLIILJCFJUtE4lt'ODLPIfNKEKTSKSt111m~11I1C
CP1L0067 87910 81053 KI~IIGNIASFI(GL
No robwc haeolop Dresent in Genebsnk/ENBL
as of 11/7/98 ~YSYPDPPNAVEGRVNSSOALNpt7C0Nt~G8IGGLLRCRILSIWAVITFIJILIrV
t CP1L4081 91=19 98016 W
FW rpoe-~ ~lY~rase 8eca LIALTIJLSILTSYPYL7~I,GVFLLIVTIGCiIFAI~C.SEItIKRVPPfPIStd~EIIA
0VLLf'1mN
RSH
FREILBt~ItSRRTRl4JfCPERVSVIGCKEDIPDLPNLIEI01K$YItQFLOIQUJI~tI
KNIDNEKEKEDPEtIFGRTATDIPNRSALOQFNHSCNItIHEBPALTBTY
L
CPYTLPOYfSEEEVLIRSVVGSYLLiEIICVPKYSH<.iDEUNKLK81S6RCCLTIDXKTCLtEYFREIFPIKSYNGTV
LBYLSYtAfGYPKYSPEECIRRGITYSVTLKYRFRLTDQCIG
NDFFTPCHS
Ui R
p IKEEIVYNGTIPtJtfDKr.TFIING71ERVWSOVNR$PGINF6pEKNE100NILFSPRIIPY
~
ORIUSFLlI'OKDLATFFLAYTRVNOGNWPFRJIGAItWILItfYVRLR
TfiDttCDGFLE
CYYARLAFNDTQRLYHOLFNVEKLRSIYAPImKDPLCNPWAPIPIYDLLItpG>RiLEiIIFDINDLIYINIOWtKRRA
KILJ1ITFIRALGYSSDADIIEtCFFTfGttSt~SE
pQE~tEYPSRMQDQFWG
KDFALLVCRILADNIIDEIISSLVYri%71G1~'t'AI4.IDb1U711GI1LSVKiAYD11081a11I
I
104.AXaP'fD'nFJIALtDFYRRLR~EPATWNARSTIIDtLPFDPIOtYtit.CRYGRYKi,~K
CPn_0068 81909 LCFSIDDEALSQtIfLRKEOVIGALKYLIRLIWDDEK7~~CVDDI1111L~RRVR84CELICNQ
CT360 hypothetical Drotein CR9Q.iIRNEKIVRERHNLFDFSSD?LTPCKW5J110LSL.11&VLKDFFGRSDLSOfIDCTNPV
SF11IKKFFIYSLIFSCSFSAPLNGICNEDVSSpSAIIEDPEVLITOLNELILTPIEOGKEI
0AISDGOICSSEEIEESCGTSDSEGLSEKTOKESSNEYVLDFFDSNWRLEGISKHAELTNKRRtSAiGPGGLNRERJ1C
PEVROVtDISNYGRICPIETPECPNICLITSLSSPJ11C
IRNt3 YCTfVIIYJIGE
' ' ' . fEP
COSCQVAGIIDCFNREFDIRNRELELIDIRELEWrrnt.c~SIU7NMKONSRELAFQRADVEEECVIAOII.SASLDEY
ISV
fDEIEYItI
NEPGFIETPYRIVROGIV
AVPLLKT1'JIIIISIC
R
Q
AFEACfSTVTIrIONSPKpLVSTtrZCLIPFLEHDDANRAUIGStJIIp 1'GLflCMAKDSCAIWAEEDGWDPVDGIfKWVMKfDiPTIKATYNUfKFt.RSNSCtCIN
OQPLCAITKCOVI710GPATDRGEL.ALCKNVfNAFNPWYGYNP~JlIiI88KLIRCD
~
CPn_0069 85191 87086 .
AYTSIYIEEFELTARD1'KtGKEEITRDIPNVSDEVLrINLCEDGIIRIGAKVKPGDILVCK
No robust homoloq present in Genebank/EMHLKSDtle as of 11/7/98 F
RK
R
LNFLYVYLLIFNtGIHTTPPPSRSSSPPPYDWILpDLCMT>NtJSSRATPPPPEIIGCELPS
D
Ia ITPKSk'tEUIPEERU.RJIIiGEKAADVKDJ1SLTYPPGT~bWImVKV
LWEKAPMIINRRTACIWIImGLID
A
PYFSJISNiyVIERGAPSLPSPQpLLSLPEYSROPPP.r.YFDETJISITSRTSE~CfLYSTL
LVEFJ1VHLKt%~DKGYKHQVATLKTEYREKIIG
WU~IPNCDt'IGVLtCGL4SDYETAf.~IL6IN1rKT6VlJIIR~OIIIDLDItt OETTERIEGEIX
LCCPANSERDWEDNEVNCIYIAS1'SD'tOLEAVp(xMtIITtLAGEPVRVLYt:TGNLYAFAR, ' GVIAOPKWVASKRKtAVCD101AGRHGNK411VSKIVPRIIONPItt~NCISIVONIIIIPIL~VP
QDI
SRNMAGVLETHLDYAAK'fAGIYVIffPVFEDFPEQRIWDINIOpCLPBDDKSFLYDCIITG
ENTCNSRLEVSH7YRAKrYPYIDRFFSPN4MViCRRFLVFYpCIKiCAYVOAALDSSNN1 'IVLGLSPTVYIRCNIrNVQHYRVRDFWPSCLDSLMGItM'SVLPYCtSSDGIFYPSLFSNERFCNKWICYI1MLKLSH
LIADKIHARSICPYSLV1COPLCCKApIIODORFC0~R1AL
TFDMAIRYCERCLLVCSECMGNLPETCpOTSPLTSLEOGfIEtrALVUIPQpNPEALSLASRQEIL'fVIt5DON5CRT
RIYESIVIIrENLLRSCTPEBFNVLiKAlIxLCLDVR
EAYGVAWIL
I11HEERCCRLESNY?IPGRSSNPFM'fSNYVLVrtINfLIIQIYLHSPYYSFQSNDIVCLIFIS.
.~.MVETV~YLFLTVTDSTCCRRYLRVPRLVCTCLRNLALPiTLLCLLILSYPRSIrLCVPFPNWDA
tJVrtFIIG'fHf.'ITRWFFAWNLILIIWPFIICLRIIGIpLFVNRSI
IfSITtGARITDLTLASHR 0082 97992 102221 I CPrt YAIVFPSIVC~LLTAIJWANiNIt.ALDPYRLIESGDLRRPAPNODEM00~rPWDJIYS_ rpoC-RM Polymarase Bate' (:LVINTCtYMLILFANLiFINYSVRRYNRSRR
CSSYGRRRLKNDVLEKINFCENSRDtCVISKECLFDKLEICIASDITIRDKW&CGKIKKP
.
~Pn_Utl'la 87399 8720A
ETINYRTfICP~OCLFCEKIFCPTKDYIECCCCK'fKKIKHKCtIACDRCGVMLSKVIIRER
NAHICLAVPIVHtwFFKTTPSRtCNVLCN'tA.~.DLERVTYYECYVIIttM~GKT04T100~iJJ
th robust ntsnoloq present in Cenebank/EHBLDAQYREWEKM'.KtHIPVAKMOCPJ1IY0LLK.iEDLQSLL1C0LKERLAKTKSCOMP4ttJlKR
as of 11/7/99 ' LLISFRDTCLKR LKL
IOGFYSSSMIFCWNVLKNtPVVFPDLRPL'/PLOODRFATSDINDLYRRVtNMBIRLK
'fKyrLFNLKNONFFSNOSRTYEORFPKVSPHPESILP.tQSVGFSSOG1 riLYI AIGRtXTPEYIVRHOtRMLOEAVDALPDNDRH
:HPVN17AGNRPLKSLSOXJODKIKIRFRQ
t1U71 N8U~6 8759?
NLiCKRVDY!aCRSIIiVCPELKFNCC~.PKEMALELFEPPIIKRLKOOCiiVYTLRSAKIM
:Fn _ IWKiAPEVWONLEEIIKf;HPVI.WPAM'LF1RIJ:IpAPF.PVLt0t71fAIRINPLVCMFNAD
-f 125 1'y(xxhcc ical protein ' ' IK::LR::ILEPIf FL4IIARt:LKKDNKIIEELFPEPFUYDNLYLKfIIENS:iSRL1/1FOKKRNLf1 F
FGtDl7lIAVIIVPL.aVEA0LFJ1KVUIMf'I~IrFI.P:.'xKPVALP.'.7falfllSLYYIIIApP
'V::IH.YLYEVYQDGILFFFTYTKAtJt.~.fIA.~.LFTI:W.~uCE1'P.STIL1'CKPIFPEFJKY:KTKtFKDE
tEVUtAUJNrr:FIJ717VPt:LPRDL~Ic'.tlc:flllIlP.KIKVRIDr7pIIEIT
YF
NrVVtx . Pt7RVLFNR t VPKEta:F(XJY.~,l11 . ..rP t..EL t t//:YKKVr:1.E11TV1tPLfMfLhDUSP
FYJI<LTh ll_:('~:RIJ4(x:E::LYNRNKO tQATKA
f AVpYLXPP(,T ' ' JAIVKYt,YDDtlttTECERIt:KTI~IWflY:f00luD
ft~Y
At ~Nr:LIrOllrttf'DIK::11ILKDA
'In~ inn/ Ntl5l Na1157 ALriEt::KC~SKNNPLFt?rID:Y:AW7NK:aII.Y~JIt:AI.RIaJIANPNr:AIIF~.PIT.'OJFRE
T:~A hyrrmlirric.~l prntwin ~'LTVLE'l.~.taaK:ARKCLADTAt.KTAD:Y:YI:fIRLVDVApDVItTF.KIX\iTLNIItEI511I
' 'P::YIKEKYI.ILI'fCLLFYFFIfYRIt.TPL.::rf:LCI'L'DDWPOEI,FCDRL%3SRI~rf~:Dlt::f1.IJ
W:a:DVIH:;1/VAF.AIDI'.ltaKTIKLR3 iw:Yh::l'Y tXlt::%EEI.LI'LKpRIY':RTVAK!I
' . f:~
::IE:YIX:NA:Y:IXa'IV::Pr'I::ALVALTDLKLVPYNGtI.iFIihl1'fRLKNAVEKII,LFIpNI~IK'fLT
i:K.':r'Rr7Vl:AKI'Yv:GNLAN:hI.I~:hI~:P:AII~::
tllAtk:lt:EPlfittl.TNRTF111lx:IM
IIItIYALTLTIWLIT/17ILIIGV'/F~IPTATCLDKENKHRNVNSWNL:.Tr'EIITN:;tXSlLYI7ttKJlWIIY
~E~:rINLVInYY~JIIJIWr:DNf:RTWKTKKUl7fK.iIE
IILLfII ' AWALtI
VI' . KW
.
::LKVrPVC4:VKtl.VAIY.TPV:aJYrhI~:FYI:1.rltrtl'Itr:MtfnFIKYKIH.VrJ:I:.TB
.
talc:lUJId~FII:fRUILLA'rNIASI::ALLYAVP57A'/r:LViI:FSIfxIQI:,INfVYCARU.:D
WO 00/27994 PCTNS99i26923 NKHIS:LVELIWONRGfLtIMIAIYDOADL.iEL HIIrMEVI::LG:!' JATPSGAII'llEEf:QRVDPClD.LA
RLPPGA IKTYD ITOCLPRVAELVEMKPEDMDLAK
IDCWOttI(G IOKNKRTLWCDEFff CPn_00t.1 .:~ ~ r~trr~k~~'~"v CCVRELOKYLVNEt/OCVYR
~71EECNLIPLTKNLtVpRCOS'VIKCOOLTDGLVVPNEILET...
IIVIIpNLQKVRL1'DPfiDTI'LLPGEDVHKKLFYCFNRRTCEDGGKPAOAv.\1S-ValYl C1U~IA
fiYnllflC.1 ..
SROIIKFIiLRIt~f'fEDFPIUlNF(;L'fEPL'i'IFWEKNr,7lFKAEA.iaOKPPrSVIN
~fFt f.pCVDINDKHI Yl VPVLLGI'fKA:uLGTESFTrAASF'OD~~'TDAAf'CSt~t~t'' 'I~FK~I~I~ PPPNYI'CVLHNGHALVNTLODVGVPYKRN.'Y:FEt:.'a f Ir:TL'Itlk% IATQAWIRNLOASEG
' ?ETHKRIKOYLEKEODLVFDf VSETECVC :
KRR'IDYSREDFLKNINAWIIEKSEKV':Lat:LAC:G:
~~.JWdIKRIr'1'IIEPLANRAVKfIAFK
FC~ICYtYRCfILVNrDPVWIAt~DE1'EfEEI(LVA'ILYYIR~RMVC"~E.iIWATTRPE
t ~pn X091 102:96 103312 .
t .r,..... . .-cnY,..:~f.".~ ~~' .. . . ....,......'.v.~Vt!
..
. .,.
....
.y.
.
~
.....~... -. F
', :..r.w , .s :. ,.
'~,.. .. ' r'K PKSr '.Y~F'f. :.i ~:' .,.... ::1:
' . ;e:~.
1 ;~~"--rnr, ~
~
4'IEPYL:iKHWFVe'v.::.rm;nu:ia:F.:aL:K:FiK4YVKV'i'u,iw'47iNLRi.ii "
.
. Vs:V::YRsur .
CISROLNMGNAIWWYNIO~CDERYL.'.'C'OI:ErIPEEVACDPDSWYODPDVLD'I~IFSBGWP
.
..
: :1F .
'ELWEAWWGIAONGODLCTLSFLLOKTOVNFALElIKNIPCRISLELDARL1FNVEAM
VOMVFL50LFEAMOGOKKRLLVKIPfsIWECIRAVEFLEAKGI~ftLIFNLVOAIAAALTCLGNPDENSFDLK1IFYPT
ALLY'CCNDILFFWVt'PNVLI.CSSM&GEKPFSMIJIOi.IP
' OIMAASFIt l'OISYKRYHflEGEWSYISGKEKLAYOMCFrILPDL11VAKNCKLSKBKONVIDtLBIIIATYtt:
KAK11TLISPFIGRIYDwMIMYGDF7CYSIDADP41lA5VSNIYAYYKKFGIF1 TKEpVLALAGCDLLTISPK1.LDELKKSONPVKXZLDPAEAKKLOVCPIEL?ESFFRFIl~TIDAWL?I~~IDt'D~~~
F~~tIFGNISDirOGKDLLaGIDCD
EDANATKLAEGIRIFJIGCtOILETAITEFIXOIAAEGASLGfmFYILOGFNOLIHOLEEAYATYAFDKVATL1YEFFR
NDLCSTYICIIKPTLKiKQ
' ~~0~ I~t~t ITESLFLR ODI'hGAi'P~DCDAFI
PYTPODLRESFTLJIOItLVYTIRNIRGEMQLDPRLHL1UMICS
CPn_OOB1 MLPSRAC1IG
predicted fersedoxin 0'1'TCIOSCIPIIpALIGGLESIOLLDIIEPEKGLYSFCYVD1'IRwIFVPEfJILLKCmRGE
SEMKMO'lalfKSOLVFSCPCCCK(TIVCFSVFNLOVIL1'CNVCSSTY'1'FDSVIHNEIROFYAKEAVRLERAVEZi LCRL.LGDEStCOKAHPNLWAKOEALKMJitIELw7GILmG~ISFA
:.CKRIHDANSI'w"NATVSVSVEDN011DIPFOLLFSRFPVVIlILSLDOK1CIAIRFLFD11LN
TSII~sQESDLIS CPn_0095 115956 118790 0085 104512 103'166 pknD-S/T Protein Kinase CPr ACIVCLDRCOORSLERYDTVRIICIC~1CEVY'.aYDPtt~.SRKVALKKIRCP~NPLLKR
_ RFLREARIAADLINP'GWPVY'I'IYSEKDPVYYI2lPYIDGYTLtCrLLKSVNOKESLBKELA
c~311 hypothetical protein .
EKTSVCAFLSIFHKICCTIIYVNSRGIWRDLKPDFIILLCLFSCAVIL.D~.AaVACGCEC
FSMKPFILFILIVAOFPAFSAOPATOVSASHSKpAKARRTSRIRSSMTNASVSRYKTRA
AARKKIGKFflOIPSLSPVOWVRYSCKNYSICfP5L4FOCIDL:K1'QLPEKLDVLLICKGKGNDLLDIDVSIIEEVLS
SRM'IPGRIVC1'PDYMAPERLIGNPJ1SKSTDIYALGVVLYOhLTLS
LTP'fINIAOEITSKSSKEYIEEILAYNKJWEMTLESGIFTOIOSPSCCFTIIK?ESNFPYItR%!~%KIVLDOQRIPS
POEYAFYRCIPP
CRVfCL0A1'lYImiTAYIF'"STATLDDYALSFI'FLKWSSFOIRGGKGTSCDAILEKJ1WNRIQ.AVDPOCItYSSV
TC.1~DI
ESNLI~SPKIiI'L'ITALPPKKSSSWKtirE?ILLS'RIi.~,VSPASIiY$LAISNIESFSil1 LEAt.ONf?1K RLSYTLSKKCWEOFGILLP'ISENAIA7GDFYQGYCiFNW
IKERTLSVSLVIDiSLEZORCs 8 lOS5Z7 ODLFSDKLTFLIAL~00ISLSLIYOGTi~'ILIlOI~tYLPSRSGAIIAIiVRDI~DILEDZCI
SFPCRItOGYCARFRAGITVLCKAS
R
A
CPn_0086 10489 I
E
FESSGSLRVSCLAVPDAFLAmtLYDRALVLYR
acPE-ATP Synthese Subunic E
TDNN~~T11IJ1ZECPSIOi~GSfAIIPLEYLGKALVYORLOEYHEEI1CSLZi.A
NINANLNADG%LKOICDALALDTLKPAEDSAAALfJiNAICEOAKRTI0EA0Et~IRKIT'fTAHESFYIfRDRiJIWF
lLILVLEIAFOAITPCOEEKILVWLKDKSRATLFC
DNVVYRT
EEWItpKIKOGEVAT.S0AG1(PALE71LICOAVFNICIFAESLVEWLEEIV'i'1'DPEYST1G.IOAL.
~ IFRIX
LLDPNLCLIISSIQ~L.FLSYWSCYIPfR3iSLFHRAWDOSDVMLIEIFYVACDIJODAFL
'IOALFaOGVSGN1LTAYIfD0IV5PRAVNELIGKAV'1'1'IC.RIGfSWOGSFVOGYOLKVESSCIDIFKESLEflO
KATE6IVEFSFBWf:AIT.FAIOSITNKCDACIIIFVSNDOLBPILLV
wVt.D<SSSALLEIFTAYL.OKDP'Ra4IlOGS
YIFDLFANRALLE.ROGEAI!'QAL.DLIRSKVPF3IFYItDYLRNHEIRANWCRNFJIALSTIF
ENYT~OL>tDEOY~t'~IOCAF~1AI(OHFDVCRF~RIFPASLLARiIYNAIrGLP
CPn_0087 105510 106376 KDALSYOERRLLLRQKFLYFHCLGt~OtDERDLCQTHYNLLTEEFOL
CT309 hypothetical protein SHCKIFSIFXVWKIOYYFLSSFLPTOLPESVPLFSISDLDDLLYWLSENDLCNYGLLKCPrL0096 134347 AFFDFENFAFFwAGKPIPFSFGEV1'pENVFIl4SSQOi~ISOI~fFKD~IKS50DCT'396 hypocheeieal Protein RWNFSDLFREFLSYttOTNSSKFLODYfRFO00LRWLAGTRARVL~SY~~SCfFLSILRCTfMCSLPVYVSCIKVRNLK
IIVSIHPNSEEtVLLTGVSQSGKSSIAFDTLYA
TLNRALiILYOFHKLECFCSDSYF ' ' DPWLM~IOKDSPNYELPEEFSDIOCVL~YGLLPH GJSN
NCSTI
.nwrsvrcrt.srrlllTITfLPNPKVECItIGLSPTIAIK~7IIFSHYSNA
DL'NyI;,ARCATYMFAIRNSL.ASVCKGACIINHIC%AIlDr1 CPn_0088 106351 10A14S
CT288 hypocMtieal Drocein SYRIAiIOIDtIVSDGTAOGtIVIEAYCRiII.RVRFDCYVROGIYAYVMfDNTNLKAF:YICVAD
OEYINiDYFC>yIpCACRGALVT!'SGHLLF~IF~GPGLLOGIFDGLOFIRL111FU1EDSSfIARG
KNVNAISDNNLNMTPVASVGDI'LRRGDL.IGIY?~RITIIKINVPFSCFOEVTT~~TSE
RWPIKpAFIflGflCIPAHKIMDIit".LrtILDIbIPVL
KOGTFCTPGPFGaGIITVLOHHLSKYAAVDIVIICACGEPAGEVVCVLQEIPHLIDPH1GK
SLNHSI'CIICNTSSlIPVAARESSIYiaVTIAEYYROIr..LDILLLADSTSRWAQA1RCISG
RL6EIpCEGfPAYLRSAIAAFY~QAIT1'I~GSEGSLTICGAVSPA~?1FECPVTOST
L71Vi1CAFCGLSKARADARRYPSIDPLISWSKYL~KNGOTLEEKVSC~GAVI0fAA0tLEII
CSEICKPMCWGEEGVSMEOMEIYLRACLYDFC'1ii00NAPDPVDCYC~ItI.FSLIS
RIFOAKINFDSPDI1ARSFFLELpSKIKTLlICLIIFLSBEYttESKEVIVRLLEKTMV~'IA
CPn_0089 1D8111 109466 CT289 hypothetical protein LDL'WIOLON1IKWRKI7MOTIYTKITDIKGNLITVEAEG71RLGEL1TTTRSDCRSSYASVLAF
DLKIM'LQVPGCrTSGLSTGDHVTFIGRPMEYI'!'GSSLLCARLNCIGKPIDNEGECFGEPI
EIA?PTtIJPVCRIVPRSNVR'1'NIPNIDVFNCLVKSOKIPIFSSSGENNNJILLIOtIAAQTD
ADIWIO~LTF1IOYSFFVECSKKiRFADI~MFIIiKIIVDAPVECVLVPDMALACAEKF
AVEEKKNVLVLLTOMI'AFADALttEISITMDOIPANRGYFGSLYSDLALRYEKAVEIADGC
STTLITVTTMPSDDITHPVPOH1GYITDGOlYLRDEJRIDPFCSLSRLKOLVICKVTREDH
~DLANALIRLYADSRKATCAMAMG!'KLSNNDKKLL71FSELFETRWSLEVNIPLEEALDT
~IIfILAOSPCSCEVCIItAOLINKYWPKACLSK
FKKQAELLIAKGTI1'F'SDLWIDSHPIASSORSDISTYFUiAPSUIAeTwsL7Vwawu~a SSMFSIT~'KOGOCSDCOGLGYOt'1TDRAFYALEKAFCPTCSGFRIOPLAOEVLY~IOtFG
CPn_009t1 109439 110080 ELLNTPTETV11LRFPFIKKIOKPLKALLDIGLGYLPIGOKLSSLSVSEKTAt.KT'AYFLYO
acPD-ATP Synthase Subunic D
TPETPTLFLIDELFSSLDPIKKOHLPEKLRSLTNSGHSVIYIDNDVKLLKSADYLICIGP
P?LKLK1SALLOAEVQNaV%ZMAECDKDYV
FRLE
TYL
' ' IO~IS GSGKOGGKLLFSCSPKDIYASKDSLLKIfYICNEELDS
.Q
.
l KOKLAR
VLAKSMSYpVKL
OAYERIYAFAELFSIPIGTDCVEKSFEIOSIDNDPENTACVt~IPIVREVTLFPASYSLiG
TPtWLDTMLSASKELWKKVNAEVSKCRLKILEEELMVSIRVNLFEKXLIPEZTKILKK
~ 12459 126006 tAVFLSDRSITCriICQVItMAKKKIELRKARGDECV
CPn_009 PYk-Pyruvace Kinale DSMITRTKIICTIGPATNSPF?ILAKLLDACFl~fVAAWFSNCSHETNGOAICFLK6LRE0K
CPn_0091 110071 112053 RVPf.AINLDTKGPELRLCNIPOPISV~Cp%LRLVSSDItX7:aA0DGV&LYPKCIPPPVPC
acpl-ATP Synchase Subunit I
CADVLIDDGYINAVWSSEADSLCLEFMNSGLLK".allKSt.:allKS'VOVALPPNFEKDIADLK
'JRWIHKYLFtGRHKADFFSASRELGWEFISKKCFITTEOGHAFVECLKVPDHLEAEYSFCVEONNDWAaSFVRYGEDI
ETMNKC1.ADLCWPKMPILAKIENRLCVENFSKIAKLaOG
:.EALEFVKDESVSVEDIVSEVLTLNKEIKCLLETVKALRKEIVRVKPLGAFSSSEIAELSIMIARGOLCIELSWEYPN
IQKIOUIKV.iRETGHPCVTATQMLESIIIRNVLpIMCYSDIA
RKTGLiLRFf'YRTHKDNEDLEEDSPNVFYLSTAYNFDYYLVLGVVI1LPRDRYTEIEAPASNAIYOGSSAVhILSGET
ASGANPVAAVY.IMRSVTLETEINIL.iHDSft.KLD~NFnAI4VSPY
VNELOYDWdLOREIRNRSDRLCDLYAYRREVLAGLCNYONEORLH<1AKF:CCEDLFDGKVLSAICLAGIOTAERADAK
ALIVYTEw.~3PMFLJKYRPKFPLIAVTPSTSVYYRLAtiiiG
FAVAfiNLVDRIKELOSLCNRYQIYMERVPVDPDETIPTYLENKDVGMICEDLVOIYDTPVYPMLTOCSDRAVWRHQAC
IYGtEOGIL.:NYDRTLYLSR(:ACMEC'ttaILTLTLVNDILTG
AYSDKDPS'IWVFFAFVLFFSMIVNDwGIfCLLFWSSLLFSNKFRRKNKFSKNLSRMLKMT
AIII~f~ICWCTRTSFFGMSFSKTSVFREYSMTMVIJIi.KKAEYYLOfatPKAYKELINEYSEFPE!
PSLKAIRDPKAFLLATEICSACIESRYWYDKFIDNIWELALFICWHL.iLGMLRYLRY
17194 !huU
RYA:TGWILFMISAYLYVPIYL.CtVSLIHYLFHVPYEt.CCOICYYGMFCCIGWWLAMIrpn 0099 sc hrnrolnq pt'asenc m, r:..n.:rarlnk/t118L
v:: m tt/7J9R
r OA.~.WR~VEELI:VtOVP30VL5YLRIYALGLACA!?fCATFNQMGaItLPMLIGSIVILLGHNa ro m IK:KKFHOtKRTILFJvPLYYLV:7faIlLr:Rlrl'I'R::FI:fi:U:Kt':Ff:FhAFYII::DYRKTAL
t .~JtltIL3tMrJSVIIK:LRWFIEWYHYSFDCCGRPLRPLRKIVC3EDAfALCIHLtINNStV.
TNLALAFPEATFOERYKI ARO::L01IL: ITLLEI.LAIlXIVA':NtINILITIVT::.~.RIIpIOCFS
::FF.V t.~.tfEDt.EETFKNtAEKIY:L
t LF! :1 NoV hlHt.PFI:I ITKNY I',':
t AFAKA I KNORl::K
:1 ayrp r1 l .'.121 l t=57 3 NIFALIEVFIG:KtVI'PKN:;~IFII:Yr:KI:P:tth:f~AtlJl:":Y'1'YM.P't:::PAFT1'FS
.'r.f.N-ATf. ::ynrhasP ::rtbunit fALLr1'IY'1't:PtVIAVNV:Ut~AK~FFYf..::AK1.YANK::1.1.MYF:::VAtIJMIIlMttFLEKCTA
K
~.A
:HAVrA::AfDFI7Ir:KLtf?1 C
A
'txlAf:V\
' .
::UIRrrWIMIHKRNIIIRKI::NVIKKK'lP'Y.:1111.VI"/I~N::::IIF~.:YI~YAIJII:I.'P:x:ITWL
AL
:P
:
LVtGt.AMI(:::AI0.
l1.Y.f:AHt.Y::MIDM::W
I
"J:'t:fOAYA
:1'M~Y:K( r dLLV::
X
~
' t . r :NAII ILF.fLA?t~FPE'l..L It~I.ItNDpL
. I:.AI.I TN "/f ;\ 11~ lnl.Tf INI
: AJI II r YKIIFHY'1'rl'X AVY
MIKNGTLiPV(xIAU:L::
:A
lt:l ItlILIJII
Mf:::X3::I
b:::::a'lr:Kt:Vn\A ( : f Vf'.::F::LFAW::KHYLFY:iLDllt'OAPf.KN::IJt IFY::YIILY.ItKF:IrYNFKVY::Yr:LItl:lYh FALLL.L .
':fv 44rt 11:440 lliUlS
r:rn on'r'1 t:n'.~n :vrr'. I~.~.gLS
r-r:nt hyl.al,.:rtr:.r1 protein Flr. e.dm::r Iw~lnrJ l'trr.:.ml IIAI~V n . n:'rr'Lmk/h'HItI. .n:: ..I
I ll/'//'rN
:KLVFtiLTVI
RYYY
' .
YYwA;:YIf.KI::HFMKIIAf'F:IttMIlJ.::'tl'Tf~rl'9'M't'LLJLKVIfFJth::TfN(JIItMI~IK
.
:
:AI:FKf.ML.DI.NItYMf:::VMDRLGL.IILFIICLLLF1.
:YA::W.
~.RMRKCt.f'VT
vHDE'JYt:WVWISd::LP
:
TEYVr:t'EYaAAA
' :
:
L
:IEO
' .
'1'IAVFAY.::f'AD'IVAI'FALU::1:1.::17Jlil'VLI:.A::NIIIYrJ::IK111KF1Y.TKF' . F
:
O
:
:
It:f.UX:::F:1 :
IL
F
TV:iARl1 ' ' .7:NOE .
AVALC
t:lliN'PI~:FX:KVEKf:I'YL:VNQ::At:IAVYf:LY.f:LEYYEL~:r::f::\
LK'C!'FJGLIAEELMnINV: v!'::F:'."~!'.'KF4FKNI:.:KKV':::K!IKEN:KAL:iELPNN
:
.
ALOKLIOEEIIrVLTIDOPE:n..iJ:00t::'.:PIr'.'DP:Y:\R:::.4.F~J::.DCCLREPLIVE
~.Pn_OlDt1 l3'Jl9S LI788Z
~IARELVNKIN'llIRRNOG:~tQORtALRI,I~EAWIR.~,F.,LOY,I,X.k~,~~rbl 'T'J1l Isypornecac.il DracHm SDFOCQ'81DINQtRIOiLC:.~.'r:rtDO
I'vfILIJrI(75VTITRTLItIVP~
FAf' ' ' a JVULG ' vLFIRNIiPRK
a'fpKKTFILG~.LEINIKFLr SWIXIID
EWISAAMrIF
"
. CPn 0110 :4:'55 14192 JpOLAPSM.
:'JDLHPDpI'VL.c:LOK:.CfGNKKVSLTT1CllIO~II
VNPK
' KPICSPPIICIfEYID LepB-S(9na1 Pepc(daa t KHlILVSVOHEINIRKDIHSVDANDI!'VRLTOY'JTEOTLLTI:
YIiJpKVSGPKEYINALKEOGLELTFNWKLSFEELENNRIAOGSHOEIIFPTPKWIUIILSYPSIIfOtOHYSLJ4K:a PH:LR~I"fKLLK3KKLAHSPADKKCI"ELLEOLEGIFtNDOE
L
fPFFNrFlIDfIJDFOADFLRt.LFLKAEI:TPINLNf.PVFLFFPVtFI0tf8IPLEYSLDPVPP.
ry' ;e...., s zw..'.FAIS'\'\FI:rl~F'rt~S:.:'~Je'!~:~IRP'K
-...
\.
...,...'.r . ..~J._.;- --r-!.i.I: .
_ . . .. r -: .
' . . .. :.v.~.TKSFr::.tn ::
- :.-~
' : -.1 "
':Ir: :: ;::
' ::
., .
' , I ' '.I:F....F:i:T'.afl' ~..;~ 'NF\iLF:.~ 1..~...: W.:l,:i:i.:~I'P....
.\''1. ::ai r .~
:.:Ir ::':........ ;'. Nil. . ...t:.l~KHYiKRI:ML ~ '' "' .
TKTKETTKLYKKEW
GQKTIIDPKOFNOSYGAL:=iy:'ISIIYCOFFOHKFSMQDEPNKLKDPHLSPVSYA<>L!'~Ki NYAlNRILTEHQARTS fG.rplKVYt'EICIiTANLSYpKPLLRNY1~L5PAI0!!8t 0101 129996 117141 TLLPLRKEJILHLIRM4G.'""f' .FIVAOGCAYKYlIOPItIHfSGIAKAYAILLPKVII>4CYClf Cpn _ $KGGYOIGFGEIRYKLILi.:MPLTOLNDKOVIELFNCCINF$SIYNPVV1t11Pts011PLi~YA
ybbP tamilY hYPothecacal Protein ' f~TM~t'I' FTIICIiNL.YI1~SPVFItOtGP'fL.OKI~I'S.~Of%SSE'IOPYIAlYOKGLPPCt>EKT~VE
?S'I'IG!(fgOY'ITOCPSKTNPFDITYYTtPLLEIILIWVMNYt.LK't ' ' fOE
FINHFGIOVPKGtfVLVLv:rITfPNSADSREItGPVPNENL.GSPLCTfS~IPIGWC~.'DCVSA
FID
pFLFLf~IIJIDKLHLPIIRRII2iMllIIAAIWFTIFOPEIRLIILSRIRPtIGKICF
OFVOpLAA5IY0ISER0ICALWLENKDSFDEYLSFSSVKINAT!'SEELLETIFEPSSPLPCTLSGYLVSCIAIJ1TGL
SLICYVYYOKRRRLFPKKEEKNItKK
HDC7IVIWtG0IIJ1YARWLPWiDTfOLSRSttCfPNMAtJGA501IS0ALIITVSEDiCSV
SLSRDGLLTRGVKIDRFKAVLRSIISPKEHKIIIIPLFSWIYBLR
CPn_0111 114761 113934 Ct031 hypothetical Protein ~Pn_Di02 130099 131166 Ot-0NRYPTNPNDSSTYFER-L~OKYLiKK00KTLF..FLFLSfLFSTAFSC:LFASQ!'S$LRT
cydA-Cycochrome Oxidase Subunic IO~I~'S~~~'~P~IEI THFPCIAHKERP$LEOAS~IT
I
FYIOFNKFHDJ1LILSRIOFGLFITFHYLFVPGSNGLSh?Il.VINECLYLV?I~OTYKON7VIIIOLESPSOVFW$LS
SEGSOFFSLIffRTKSLEPVGKSTTVPAFLOIFDLPLSPAPANV
iWJCIFALTFVW'WfCIMOIFSFGSNNANFSEY1GNIF'CCLL35DGVFAFFLESGFLGII1CTIDQIENKPWSPKVSF
EGAPLTSISVNAWOGLWPKDRCPL.S>:fGI:J4Y!'fOPDISVFIL
:.LFGIUiKVSKKNHFFSTCMVAf GAF91SAF~1IICANSWN01'PSGYEMVIOiKOKLIPALTSFNVSIETPKGTSIVR1WDIGHCATSPYVYSLPDSK'It Q
' ' fICAVIVL
WG1VFSP1'fIDRFINAVI:CtWLSGVFLVISVSAYYLWIOtAIOIETAKOIaOtIG
:'L0I1tS71Wt'ARCVAKNOPAKLAAF'ECTFitTEEYTPIWAFGYVOMEKERVIGLPIPGU.S
CPe1 FLVRItIIIKTPVTGLDOTPRDEWPNVOAVFOLYHL3It4.WCVNVALTLISNSAYIfGk1Rw11i.~
VI 9atH-IPetllZl Glu cRNA Cln Amidotransferase t8 Subunitl KPPFLVILTFSVLLPEIC74ECGNCAAMGROPWV~CLLKT'KZ~V$P~SLDSDIGVVIHK>0~71PEYRQVLFtfDSST
CYIfFIICGSTYOSEKTVPEGKEYPV~GYVSVS$S
FSLVFIALL'ILFI'M,CKKIKHGPEEENDLTEFEVKSHPfPTGSKK1VDAEGRVDKFLKRYSINROPAOOPOPEEDAL
PAA10CIOLKWTKRKIt CPn_0103 131465 132511 CPti cydB-Cytochr~le Oxidase Subuaic .
II PtrA-Peptide Chain ReleasieW factor KAKEDROtRILLNSIG IRF-11 NACIf7IELSLTSLLPLAWYVTLLIIAVFAYSFGDGFDLCLCAVYLGP'MD'DIVAEYIiJRLAEVEIKISNPEIFSNS
KEYSALSKENSYL.t.EL10iJ1YDKILIIftkYi.
FSV$
PVWaZiEVWLVIIVGGLFAGFPACYATLLSIFYMPIWILVLLYIFt~SLEFRSKSADOR011LAIEKDPElNVl4.EEG
INENKVCLEItIliKILESLLVPPDPODDtiNI!>:LRAGT
'rJICIFWDIIFICSGTAISFFLGTIVIirILILCI.PLSPtYfSYASLSyIILFFIIPYAALCG11WAIU1ALFVGOL
~IRNYHLYAS$10(rldIfYT:YISASESDLXGYKEYYl4ISG1'f~IIKRLLAYFa "oAFAItiGSCFALI9fTSt>railiARIAOQFPYILSSFLVFM.FIJGASLISIPICtFOAFPfYPGGS
C'fHRVORVpETET00RVSTSAtTIAVLPCPSEECfELLINEKDLCII7TFR71SG71040fIVt0 ..:.ILLIALTSCCCVAAKTSVSKKRYGYAFIYSTIiiLLSLILSAATLTPPNTLLSTVDPOYVT06AVRiTNLPtGVW
fCODERSOfiKNKDKJIIOtILItJIRIRDADIQKRIDiFJISAIWEiIQV
3Y1'IYNSAVZ:TKTLl(S1:LIIVLTGLPFIITY1'CYIYRVFRGKTNPPSIYGSf~SEAIRTYNESONRVfONRICL
TLYNLDKVNOGDLDPITTANV$NAYNOLLIaGi CPS~O1D4 133884 132676 CPlL0114 146371 117261 CTD17 hypothetical protean hamK-AfG sPKitit methylase EIC5~0IStR.ti.At.CfAINSPAIYAJ1DSOSVSFPEQLPSSt7CEIKGMNRl4rtLAPNTVMPTTSYSlR2IKKAI
O1T'AYLDYYOVPLSOCEAT.YII~07LtE1ISSRA10.FI1LVOISlT
OGTIIREFSKGDLYAVIGFS%DYYVISAPPCITGYVFRStVL.ONWOGEQVNVRLEPSTSYRICRIJILIiGORCPTAY
i1JG71VSFIGLRLRVDSRVLIPRTLTELIJ1EYIIbYLLiNB
lu a APVLVRLSRG1'QIOPASOEPtIGKWLWLPSOCVFYVAKlilVANl06PIELYTOR~CI.
.
t EIOTFYDICCCSGCLGLrII>aCSCPINEWLSDVCPOAVAVANBtIJIKS1(OLWKILIG~S
AIAOLINSAWFAHIELEK5Il9EIDLE71IYXKINLVOSEEF!(aVPCIOGLIOKALEEIODAAPYTRPADAFIR:NPP
YLSFNEIINIDPEVRCYEPWKALVOG51CLCFYQAIApdfltlV!
YLSXSLPSONf$IAS$OCSTPIIVSSSIVTTSLLSRNIAKCrAIJffAPLTQCREIG.EYSLFS'fGVIRiLEICSSOG
ESIKNIFSKtIQIYCRLIIQI>L9GRDRIFFLfI~GRDWS>i~Y$
RIWASLtQOGI~HSE11LT0EAFYRAE0K10(OVL71GVLEVYPtIVV10~8(PGDYLLIWOENTIA' FLYCISINLDOW<GKRVTVECLPRPIRMFAFPAYYWGI1IFJ1SCPiL0115 117779 118632 CPiL0105 1318$3 134039 Cfh-Signal RecoOnicitm Particle OTPase IMNVKDFISRVImCIL
V
ll CT016 hypoehetical protein iW
ALJILit MINSLSOKLSSIFSPLVSSRRINEC'ITSESIRE
A0%11TJ1i11aJ1 YVPFRItFSNpNPl2LIYCKIO~ItiiQwPOTAKIRFTPKIAIOMCTNDOLICIPpFISIUtwGELIf001VSP000FI
RCLRIi~.VAFLSt7CREEFTIOKTPSIILf~CGL0II
' SOI71FIESOEGatKDOGTLALfILIOCKIISIPNLDOSIIDIAFOENLLYt~fSO~SAAVOOLKILVAQTKAEFYOSO
ENKPIMIWK71T.~YJI
DYVI10~1K1tAKKVI'WPCtILIOtP
iLD
Rt7DDKLGVGYII4~iVL00ITKGNDIOVLPKM.TSPLFS17TNPIFJ1ILON1'PCNKdlPDAP?NGN1NVILD?AGR
WIt7NELItdELTAI0KVS0ANERLrV!lHAIICQDVLIITV0111001 R$FDPO&IAiRIIGIGO'1'I
~fDGIIARAGAVFSIKHVfGKPIKFDCCCERIO~
tMlJOIADVIRVLSG4NIltLLPRPEPIICOICRVM4EEDTLAVSD~t.TFRIWDIN.
LT4IlIL>i V'fAAf'fYEI7YYKQNKAFMfII3PIJIKLL0IlPOI~AKP
GIOC
EDAEt QSCDKLYIVTNPWPSOOFSVYLCPPIGC'fCGEPNCEHIKJ1VLYT.
.
NIVK~tREYISEE
pITLCDVNOPItIOpIIS
~~ ~
CPn_0106 135073 136371 g ~
phoH-ATPaee EIIVRTOIOIKI2NIGCSVFIYDPEALFSFENTRIIIPFPVIEELF~1FGKFRDESAIOJASRACPn~0116 11!592 148971 eyAKTKVTpGyyLP5GS1LLRIEVApLSNDDRRGKLLTLELLXIIAIaIEPNVFrsl6-S16 Ribosomal Protein .
LSNIRLr~
.
EICJ11RR1C$VALKIRLROOGRRMiVhtALVLADVESPADCKYIELLGWYDPHSSINYOLKS
VTKSLGRRVRAFJILQIESRDYESKRFSFRSLYRGFRELQVSOEDIm~IFYlOdCYLI%.PLDV' VSSPNEYFFIISJIGENHFJ1LGRYYVSECKIIJILKAfmKSVWCIKPGNT00RCALDLLLRDDOREE
EAIFYWLERGAOLSSKAF~LVKOOAfGVYSALISKQEARKLVYR!(KRRIIYItORR$1 VKLtIfLIGOAGSGKTTL71L.AAAIfiIINFDKE1YHKVLVSRPTVPMORDIGFLPGLIIEDKIa!AAIIDiITK
HwIpPTYDNMEYLFSIt~IpNL-ItSSFJILOALNDAKKL6NGLTYIRCRSLPKAFII
' CPn fGNNFNTA _ LTPHEIK'fIISAAGKGTKIVLTGDPt'OtDSLYFDENSNCLTYWGKFttHLJILcrmD-tRiVJ1 I9isanane N-11-Nechylcraneferase TERSEIJVAAAATIL
'IiGMfIDILSLFPGYfOCPiw.'"ISIIGMIKORLLDVOLTNLRDFGLGKWIfQVlM7fPP$OCG
NLIt4AEPYTSAIRSVRIIFSISKYIYLSPOCA1.LTAEK&RELAAASHLILLOf~IYOCIDAIA
CPn _ IESEVDEEISIGDYVLTNGGTAALVLIDAVSRFIPGVLGNQESAERDSLENCLL~POYT
~f05B hypothetical procein _ KKSPPPVTPKEIPfQPKPPIPORPEVSPTPTDHIVPGSIEASPILCKKPSPDSlIVSPLSL
FHKMLLENWTPVEEPFPWPPAEKNOKIFAWALNOSKLIFVSTSCTIIAOPRLVTDSNSIIMITNRDHFKCDKISSNLEV
NKLKRA10IFYCKVFCLDAtISCENKFCLPItEOKTTIwLR6V0AE
VNAANRTNSRDCAC'INOVLSAAVSVDSWGLSORPLNPEROGTPLNOCECPAGMWPNAOGSKKNIVTLSLSLOCACEEC
Fs:YLLARWELFCGKLLiKQADIaIiAVWALAQDLOCNAWIFSWH
NHTGKQf;KPNYLA44LGPKAVDHNNKSpAAFDRC10JAYLNCFSLAQTIGV'IFLOIPLISSRIIK
.'.IYAPPf3dRKKPNSEENKVRMRWIHAVKCALVAAIpEICNEPf.M'DRRNLI'JLTDLKTPA
ITOPKKIL7HL CPn_0118 r119-Lt9 Ribosaalal Protein 010R 177857 137303 KKEH!'RNYIMMLLKELF~
EsQCRNDLPEFHVCCIIRLATKISEOCKERIIOtIFOCfVWIRR
~Pn _ ~ENSLNRVAYGECNEKSFLI~1SPRTVSIEIVKRGKVARARLYYLRCIfI~KAAKVK
Ct'018 KNLFNYIG1ILNSIFNEEVFIISHRHTPIGQTSTALRIfIPLVNPLFIRTNLOAIASYIPIFSEFVCPRSSKK
TFIGIKTLKGIS3LOYSNVLIrfCNFSSVCKTLPCPEIYEELP1NRKEANLEIfGIKALIY
1511si4 9 15~52D
LVL,~VIKIIKLIVRYLCPCCRPPEPREPONPLTPTPLDNGOOIDAIFS1'PTSPT6FKDPF.
CPn_Oll LDDLLOEDKKKAPNL rrihe-Aibofsuclease HII
IMNf.itSEIORPLS?ttAFEKELVSEDFSWAGIDEACIIGPLAGPWASACILPRCKV!'PG
~:~ 0109 138e46 (11783 VNDSKKLSPKQRAQVItDAIdIOOPEVCFI:IGVISVERIL>QVNILEATKGNIQJ1ISSLPTS
ilaS-Isoleucyl-cRNA Syncnecaer PDILLVDCLYLPHDIPCKKIIO~ONISASIAAASILAKEHRDDLNLOLHRLYPEYGFDRH
' RQKIffADEVf:Y113PAKKEEOVGKFWKDNOtFEK.iIANROCKTLYSFYOCPPFATGLPHYGHAKSP.iPIKONCAI
V
KGYriT.~.GNEAIRRY(:p: A
IILLA.:TIKDWr:RYATNOCYYVPRRFfWr DCHGVPVEYEVEK.iLSLTAPGAIED!'GIASFN 1'1125 111779 %
EECRKtVPRYVHFIdEYYINRLGAWVDFS:"ISrK171DJ1S6?IE.fVIIWVFOSLYN'~LVYl7C'fKn ill3D
CI
'NFF.'TAIJ;'fFLONFEJia1\NYKEVDDPCLWRNpLONLL:ASLLtMM'TPkffLP.."lBIAfAVqmk-J.llt' Kin.sur ~ LKLFTIwAPN:Vr:KTTLVRMLEQEPSSAP
' EI:LF:IIKNfJI:YL'fWKtU'~::PF.~.pOIN)ICCI' fEFPFTPf .
DS EFORI.I.DROALLfIIVFLG~f:pCYGT.iMLEIERIW
:I:TLY'l'JRtQIW.K:X:EOWtLSQcxI/OAWF::NPEEFVTLE;:F~KDLVr:RTFRECEVtt:KCYHlV::HF
~ 'CI::V'ITAY
A!
.FTEEIP .
Y.rtEE~\FNVtNL:FVF.E.:OLTCVV111MPAf~CEC:DFLVCKENH1/PLVCPVDA1N:..
' .
'.~EELERRLA
SRt::EFr~.ORKERLFJISL
IIAVAVtDIC~Iw\LPIF::RNP.~.V::IFIAPP
::II:Y
JAVEKTK .
y'ftfYJlIt:HADKEIIKFLKKECRIFYIfCNKIIAYPFt3rATUfCLIYKAVN~iJF.
.
' ' ~ N(HLIsNJ: B: IIfYNpElI fUHa :Rff:K4lLlI:AAISYIAAYRVLK:: t f t Al:f7lAtt I f.
I::RNAYWt.TP t p fYiK::AU:E ILWr'w~I filJlAAlK7F VN I INDDIIw I
ItRLEEL'D:l! ~ t'tU I IIHI IP I
OL'LN I VK0.:XPFIIR I PYVFDCWFD:x:ANPYAr~t iHYPFENOK
I:'fEEAFPADFtAfi:LOCt'rR(~IFYTGTVI::AIC.FORPAFIINAIVfK:ItLAFII:IIKNSKRLN1.'I~r_ Ulal i'i7n:n I'.11.'if NYI".:l'Y.'NLIIfYr:AIMt.RLYt.WC:WIfKAEDLitF.~uGKC:IFYNLKQILt.PLTIIJL:iFFNTYt.'l tlil hYt~rlurt u:.nl srnr..in ' f7lINlKKIN<F'ITII:KLNKI.t".':'.f'F::LVtYIAIKVAKIY
' fAKf:INIt::::NVAIl.'fL.VLLDRErt JTFIDDL .
fN
fTf:I:IV'JTA::ITIJf:Ai:H::l7rrN::lYKftl'::AY'IW::D'/N
Al:I.Y:FDfK:7JDIl:PAYTEtLtWIIL::Nf.Y::VVt:KVHF-':M:aJYIILNNAVEPFI'Y
' t .KVIApFVPFtAI:DIYOY.LKt.EKEPFS .
wYtl.W'NHItlWI:AEDTCMiI4VIF::TLYCVLTVFiJ
VIIIfhf'I! NFrir~K l LPIILEKRMIIDI
AI:IVCtt:II::LRKEIIKI*VAOPI.ANf"f~JJ:3KDAL:i 7$
ALLKNPOf:I~ I KGLKrjFL/: _ ~:.~
_ "f'n; l 1.:.: ' 5!0:7 1 : J7.'.
f mntr,Nt)fntmrYl-CRNA ::YntnrtC.tao....
::ALPYANGPIJIFrfItAGVfLPADYIARFRRLLGODVL'lI~~3DEFCIAit.'Pn 01?~ . ..1.i.61S84 tt nLsG56l , p~8.~
.Dr~rsMtn..lVi..l7MediWflitIBL arG.
IY~ .S
ac ttowo'IuIa ':K1MPOKVL:T .
TLNADRBCU:yOEYVONYHKWKOTFEKLGFALDF!'SRTTNPPHAELVODFYSOLKASGLNo rotm NIEFAPVPHTSYTADR:EDRNACRIpIKLSTLAITSLCYL.ISS~fCIN:O:;.:ISCIVGTY
tFl~KtISEDLYfQEORFtJIDRYVFLRCPRCGFDNARCDCCOSCGADYEAIDLIGPKSKISS
ALVACVfFLYFFYFSSEEPKGASSOEFRFLFIPAWSJ1LRSYEYISODA
FWfitIFSVL
'VELVKKETEILiYFLLDRNKDALLSFIOr'rLYLPDNVRKFWDYTOfVR.iRAITRDLSWGI.
A
INDVIKL9fNOt.~':ILiSLLDPEAPFLEPPYFNSLIVNNSNKEADRLSRGIFLI:.1GEI'ISiK
PVPDPPCKVF'fVWFDAPIJYLi~'NEWAAiOCNPDE~IIfRFNLEDGVLfWQFTrKONLPFHDCETKILPWLKDPNTT
PDGfVfKLLKDNFDLKDFKKRIA'IWIRKJ1YPEIRLPKKNCLDKS:
~NFP1INCLl:OYLD.fKK'JDALYV.'.EFYLLDCROESKSflGN'Nd~KFI-SSYSLDKLRWL. ... "~, fe ,FFPV?11T: a.~...rJf"r"r....P1P1P?~.'!.'!1~R:~F
. -~Y.:....t....
o , ... . E . , . ,.
W.\EY.!1115~~ .. D..:~. . . .
.., ~ 4h:rl!NI ..
. .
' M
. : . .
. 157349 166561 ..y y .!
" .i:RY~\I
' .
r,~~r . ~:'; I?K'': Al4J~:..' '/ALWFl4I::FK5LE?ICNLI7fhfYMGL.iIKEEtLDVINEE
' lPI IPEiA CPn~OIJ3 :.F~:AI:yCpKLLrli.i2 FNLKSPRLLF'TfYE C11LP5 ttYDOCl~lcal protein NRIJflOV1'VCRVSIRTSCIKIRN
if NSSAYNPKLLIOrLfLIrPCCIVGYfWNRitC5IVE0 0123 155775 15377.1 ICItMP4ISERFPYAACIEYADVRtSSISNLLTKOLEISfT.IIiICJINPTIFPYDSNC!1KT
CPn _ NWSLVW!~POK~'P~IDRAPVLIRRCLFLNfRLYGLRANNKDIPNLSVPSLFJNS
recD-EJCOdeoxyribonuelease v tAlpha Subunicl NSNEKICfiYLEOILVEt4KDSGDZTAYIKIPNKT'fPILIKCKLPOPLELGSPIOIYGVWSNNTSSAKE<'P!(LSFJ
II'PSLL'fGAL6ESLYNLNLPCDIIKPLS00ANKNFYSSYPQFODRW
WLDITP DINTPC1'P'fEEIICFIRCLPFH
.iPSM'KYFOIHSYDSPLLYEYRGVFNYL?SKLIKGIGPKI11BKIIEKFOE1(Ti.
"NLSCVSGISE:RCVSTCKOLCEOKILR1CTLL!'LDEYNIPINYQVRIFKIfYOEKSIEKIC
r . . _ _. -. . .
EDPfLLARENECIGFIfPADFIAIOfLGSfPRNSESRLCAGIONSLEELO~YPI1Q.LI
a/VAKLtlJQwFD'tPITLEEID'fpILP810KRKLWI0DI5~1'LHViFfRYWLAEIITIVSD
JIRZLPSSRRIRSIDGEKJ1TAWVEF?1LSIDLAEOORGIKACFSEKLT..I:GCPGTCKST
:'.'QAILKIFEQVrIfIII:LAAP1'GKAAKRNTEITOKHSVTIHAiS.05fOFKTKSFRKNNONP
tDCDLIIVDESGHNDTHGLHNFLKALPDY1?LVFICDINOLPSVCPGNILKDLITSNKMT
'JLRWKIPROVNDSCIVfNJINRVNt7GELPILYSETCRI~tDFLFFOLmDOtF~IiJI!IINLVT
KFVPOKYN1YPODIOVLAPNKKCTfGIYNWKALIUiALNPKKANLNCRfOSYAVGDKVNQ
IRNNYNKEVENCDICYVS:INFEDKAVWRNEG1QIVGYSFSELDDLVLAYATSVNKYOGS
' ESPCIIIPINTSHFNMLYANLLYTAITRGKKLVILVCI'KKAlAIATPfB'tRV0NAC1 =?r>_0124 156575 158068 Genebenk/E!>8L as of 11/7/98 i n No robust tJOmolog present 169N8 169143 IRSKORTVAITLLVLGILLIASGIIFWVAIPGLSSAVALGLGCGMI'AILTVLLTIGLVL
LIRSEKLALEOVEIKOAR'fR~LDOLSOYVFYTEIiVLDNt.KFW&YR~~FVR~OE',;:
EI CPtL0135 proES-LORDS Glapesonin TNLEODIEEIFL:~..RDIRNALDNEEPFNTNAKOCLAOVGFSLP'ODASIDEFIMJWLShIS00ATfLRIKPLCDRIL
VKREEEE71TAR~IILPDTAKKKODMEIILVfGlGKItTODCf ROHLDINDPRWSMITKICVIICIINRIIYVSTNYKQIK9JPDISDfGQLR~4.L~!1ITIEELLPFEVWQOIliiIDII
YACOEITIDDEEYVILOSSEINAVLK
VLYOSFOKGYNRAALLSFJ<TAIINTSSLLIBaEKDEDIDILNIRfIfCASRL.aIFRxPRTLFL
P
GLSEFFnVVIDFTDASG4DCSKLPAKEVPLI)GGKKKLNFKRTFADCQVGDWDRTTSLCPA-0136 171119 OEEDPLDRLImQVEOFATSVLKDODRYWKEIETSFaKFRSLPREOS0IDSIlIItDL
DDHLSVW11NOLSAAEDALIEV:'DVOBH~tRF~iL104I00GLELIEDAVKATLPRVDFIOELpepP-OliQOpepcidase !!i'f>~%TF~PKFICIiDTIflfIfANREEWKKDPDLCSSI~PSPItIPEF
KCVPSI
LEKEELPLVAARMSLENS , SPSIfYQII7NPESLLEIi.BKItFSVOIKLDDLYIYANLINDODITNP00ESDY0SIVYGYTI.
FSOEISWIOPALIALSEEKYAAI3SSSVC~PYRPnF~IIEALSPEfI~'A~KIL~FA
CPn_0125 158072 158605 ALNVSNKAFSSLBDAEIPPGIAKt~NCEatPLSNALASLYhOSPDO'AY
f 11/7/98 No r~usc tromolog present in Genebank/EtIaLYDYR?IfPANLfi' rif'INOAHLFEAKARNYPSCLFJ15LFONNIPrNIfINLINCll00lTSLIN
as o VDLVCICSLLPIr171YV6IL1~L
KISSCAEINSEYKPLFLI~fDSFDIJ1TORFQIIL.It~G.QEOAEIYNEYEI0C111RWNEIKEO~
KDPVIQtCIEDPPARGL417L><rT'f-''TrRDFttD%AKALTSl2IECPCIGtYIfSIN0E1nOR0.
tE RYPNLKIIIaIld.KtPHFYDVYAPISQ1TSIWYBYEE
D
ISlAIWVDRYLf'8IN1WSGIlYSSCCY0SJ1PYILIiiYltFfLYDVSVIJWIJION9O15rFSAP~I
O OPyNpFlpypypyAEIA81'FNDSd~IEAtSRSDO8KED1IIVI
ROERL.QKNAHtYRDCKOVLEAVQVEQKDNISSRWVDDSYtEEAfEEOKVDNRIOITKTLDI'IFATLFRQ'!!TA
AFCYEINSAAmGTPLTEEFGSATYGM4>fEPYOCVVTSDSL.SALEiOUIIPHFYYNFWY
CPtt~0126 158806 161085 pYATCIIAALSFAEKILTDEFGALELYLKFLKSGRSDFPI~IILKKSGLDMI'fSAPLatAF
/98 AFITK1CIDLLSSLLS~
No robust hoalolog prssenc in Gsnebank/EHBL
ae of 11/
~LLLpKIOPK
V
LLLL
L
. 0137 17=263 171502 LLVPSYYCNCLFFFSGAISSCCLLVSLGVGIGLS1LCCPt1 APDLLDLEDASERLRVKASASLJLSL.PKEI~LGRYIRSAANDIaTfIK'LDiPNKDORLVZTV
SRKLERLiIAA0N1MISELCEISEILEEEEIOILILAQESL6I,lIGKSLFSTFLDIIESFWLS~
il NLSEVRPYLAVNDPRLLEITEESWEVtfSHFINVfSAFIOtAQILPXNNHtSPl90~LEb~fOY
YbDI-11CR tas L4&NLETLLSSKIFODYOPNGL-0VGDPO'fPVKIfIAVAVTADLETIK011VJ1I16 ELLFTFIYKSLKRSYRELCCLSEIQBIIINDNPLFPWV000pKYANAIDIEFGEIARCLEEFVC31CNAD
ANVLIVNNOIINKfiItPYPi1'CNIMIRIOLLIEtCfIOLIAYNLPI.DIINPTIGNlBiRIfALDI.
EKTFFNLDEECAISYMOCWDFLNFSIONKXSRV~t01fIS3'ACIAL.K17RARTIfAKVLLEEtiIIWImLKPPGSSL
PYLGVOGSFSPIDIDSFIDLLSOYYQAPLKCSAROGP91l1fiSAfILISO
PTCGG1CIIE.OOIIQRAFEROSOEFYTLENTLTIfVRLCALGOCFSOGREATNVRpVRfTNSECAYREZSSAATSO'1 IDCFI1'G~iFDEPAWSTALESNINFtaf'CIFfATEKVCPKSLAfiILKEE
NANDLICESFEKIDKERVRYOKEORLYWETIL>RNEOELREEICESLRWNRRKCYRA01!DA
GRLKGLLROWKKNLADVEANLEDAThIDFENEVSKSEGCSVRARLEVLEEEWGlLSPKVADFPIS1TFIDTANPF
IEELCSYEERCILPIRENLERIIYLpYNKCSEILSKA1(FFPPEDEOLLVSEANLREVCAQL
KQVpGKCOERAQKFAIFEKHI0E0KSLIKLOVRSFDLAGVGFLKSELLSIACNLYINJ1WCPn_0138 -Glucaleate-1-seaialdehyde-2.1-awinoniucase-'Itemt KESIPVDVPCNOLYYSYYEDNF~IVVRNRLLt~IfERYt7NFKRSLNSIOFNDDVLLRDPWO.
TNSRLFLAIImOLLOI~WKLTKRNl;~ICSNOKF~'VTFEEACOVFPOGVNSPVRIICRSVC
PEGNETALKERELOEZTLSCKKLKVAQDNLSELESRLSRRVTPPIVSS71QCDI!'L~fIfCREFIDFCCDWCALIHGN
SHPKIVKAIOKTALKGTSYCLTSE
EEILFATNLLSSLKLKEHKIRPVSSCTFJ11M'AVRLAf~ITNRSI
CPt>_0127 162152 161130 LCGI57TECTIDNLTSLINtPSPNSLLISLPYNNSQILHHVNEJ~IGPOVIYCIIFEPICAN
ycfF-Cationic Amino Acid TransportertIDIVLPKAEFLDDIIELCKRFCSLSINDEYVfGFRVAFQGAQDIFNLSPDITlYCKILDC
ESFNPPSANOESRTRNVPLGIFtiGLVACLYWGTVPVIPNFLGSFGDLDIVLTRYTIFCIF' SLIACAIKNPSVIIDI'1'PLYIWRKSLLWTLLINPVYYFCITLGIRYVGSAITWIASLAPTIFOACMSGNFWNATGHA
CLPJW1LVONRSILDNI?IPF~'I
' AVLYNSNT'KOKELPYSLLFAISSVIITCVILTHLSAWLPTAASPLYSIILVTAVILSIStNFDEAIOJStrifEICfQ
TfYSEVPONG
LFYSPIEEEIRSOGFPVSLVt~GTIIFSLFFTESAP
LWVIYVIRNQSLLElDfPNLTPD1WSYLICISALITCLPNIIILDLCCITHVTNNLISHTPVYLSPSPLEANFTSSAHT
EENLTYAONIIIDSLIKIFDSSAORFF
PSLOECICIFTIIi.CGSLLCLVLFGRIfVOKSLENSOVSSSNECPn_0139 SPA' KNItLRDIMCIPYARLEKCSLLVASPDINpCVFARSIfIt.LCEHSI14CSFCLIWKTL.C
CPn_0129 16226: 163057 FEISDDLPtFEIfVSNHNLRPCNCCPLOANpt4dLLHSCSEIPEO'CLEICPSWL.~DLPPL
bpll-Bioctn Protein L)gase QEIASSESCPEINLCFGYSG1'lpAGpLEKEFLSNDWFLJIPfBJICL)YVPYSEPEDWALVLKO
EDRCRNLRNt7Vf.WCSECVSPYYLRHTIRFLKWSTODCAFDTIRVDCNFLIIaJPFWEET
TRLLVFPGGADRPYIfRVLHGLCTARTFOYVSECfR,IPLGIC1GAYFCS1WIYFYEPECAPLLCGKYA:aLa'-TVPONLLW
:GARDLCFFPGTAKCFAYRGNFSYVSPSGVRVSPOLFSDFCLGYANFNGCCFFEOSECYP
.~.JMIE.iRYDDLFGKPASIVSRIVSKGWVLSGPHIEfLPHYCRMVKENVOKTREFLORERC~ 0140 TTLDRYCOt4LVQRLRQPAFSKAfIC ~E
PRSNOOKIFCNSLEKELLETPLVLLNF.IKLVSFCNIACNILGTEEKKFAIYGHVSIICOJI
012J 163747 1ti3064 FOCAOTE.HSPORPFAHDLWFVFSCFDIOVLR'NLNDYKDNVFYTRLFLEOKDREFLYV
vfm _ VWDARPSDSTPLALTHKIPILCVICSVFDAWPYEE
similarity co CT036 DEQYILSHIIMDPRIFVrSEPLOKTYOKLQEKHVNNLGIASQVSLTDLONKTQYtTfIJLIE
TTMEITYYFPWIMPDILRSEWDPISNOLYLIFKKFFIHYHNLPSTALfRiJQTLLIDSLCPn 0141 NTG~SNPTARONELL1FLCVFEOLDYNEDEYTIEPRGYFNRFVYKNSOTAPOIOSFCLLHrpiA-Rtte!-S-P
Iicmerafe A
' HSSSAVEYDW W EKKCL1HEAATOVT:
W iR IOfESLAVHA
LTfIS IVLCSP ILYOL ITEFClTKIHADDFOCLII41~.rONa~fALAKOLJ1I PLW
PEKPSSLDLTVDCADEVDPOLRN
I IKCCOGA I FREKILLRA
o f CF?L~YASNN L RN1.
Pn 01)tl 164251 167751 ANRSII:.'JDESKLVFV4:RFRVFLEI3RF~IRa~AIIEEIRNLGYEGSiRWDI'COLFITDS
:' .
::NYIYLLF3PNSYPNPEKDLLKLIOIHr:/IEWSP/LONEtIW:SN.4pGLI::KKYSV
Nu rotxmt hrmwloa pr~senc in uenehfnk/f7tAL
au .~t IL/7/~8 .:::MVKf::::: I I I IENKKP.k't.LFESKF11 I'PKLSL1I L~LFLG IANI: I L I ALSf:LLI'NCLLl 175914 I 4% l:dl' Ir\L:LI::It'Jf::'it:ILf.P:TQt'.::K.~.VQKDEOKPK.:IFPKBfP.:LDPWLWM.KNKIQa:;s CPnIIl u: ur 11 ~nk/f:NflL
rrranc m :wnet mlwl t t FTLLLDff::INLKNtT.i'FN:FEfIiKKIFLKGPDFLIY::ALAN41KILE.
.
, v .,r.:
c No rn ' ' ' ~
~
' .H V. iIIDNA 101( I R: f1 i LKP LAEN
KN.~. L LIfPr.
:HFE t.
.
!:H::Y::Y::Y':'LLEKFIIFK I LvLL..
mCynltl I~.44.1L lu,'.':RO
ftlNfl':FYDLKiDYfKCh:KRFRFLY.~.I':PIfLIIYLWF:IF'IT
t7.. rAar:r Innw.l..t I,r..r,,au in .aenrhmk/FlAlll. .n:. ,r 11/')lnk.fn!)l4: 1.'::47 1'/r.14 ' ' ' . .
:WIIJfT:IV/AQ t IWf>,nttJCt:.il Pr.m.tn .::;:LYKh:Rh::l::1 \IhI.IPFF'l::AYVFP::If:FLt'LFHI7JAH::::::hVYNaB-' "Yxlti ' ' ' ' /FV1.TIALIMI \I::LVLFLLIRSV _ f::VMt.FLU:1A _ LI
tR'fftftf..LKRf'LIf:01FUl1TL~.FLRI'EIILYKTFE:LY.Of:::I::LWLNUIhxJIALUDLIKK
r7/YY
Y.IH:Kt:IC'VLL:.
AFIFCt:\
:I
llEDP:DpM
HALWP
:
' ' ~
~
' n r Mt:L::hI'PU:EhItRA'IwIIYLFHW:F'IY:H:uItFATEtNFFtf.EHANfIJLfIYLTDKf:Y..
YIKV JK
:
~
.:F:K.:\:\IJWIIhIAa LINKIS
rrL
fi.Ll ::l IC:::FNUkLw.
:Ah'fLDDWI:O
PI
"
' ' "
.:
IIIII'hVLIIFY.FYKALEUEFTT\YyTLPAINrFLYrhIIFMIhIfEVTRKFYf~'ELfEDIVA
JI~YY:::HIF:RV::1.I.V
':1.1.1::IWfYI":::VI~VF:ALLI
.:I
~'Ehat:K1 lLf :h'uV::UAMF:IWRY
:M
:
L
Y'fr:Vl' UI
r ' ' .
':YIIKVfIGI:fDKX'ItYl.~tl.ltft:TV't:l:llnFlr'/r:.'Wfr:IDEY.r:II~ULIyOYLLIMJLVIA
D
:
:
:
,:VHIf1411NI:F:I
LYh (NKI
v\HF:IIv:IVUfKIIEfN:
E
L
LIe:AId:l'C::ha.la'I'Itl.l:a'1'Yt:INVIHU
\::Ft?!I'KG1/1U::UF'LW 1'KNVI:Wt:KFI:iAI:I:K
OKYMDILGDAPVSLLYUAFA 1F:.:lHt~:.taAit?IL1AEEEAKRYVEEK(~CSPIT:' Pf'(>OL'PrttiJIVr:Rf~IYH.~.KFFA:x7.:lDFtAKPLF.WEYJ11t0ALFJ1LAAELDOIltNOL'TL:.EC
E:PLANLK:iiFSDLNGRLKVSVEKAiILEEEiO
:'DCfYLEFDNEK iGDF3PLTFI
::f)EK'CIP:II:L'r:SKTPTLENKOEYfARIfIpAAOYLPLERt~iLiPO~FA.iCEtGMLTEEGIOEOY~ImRED
Lt?lt~i"LO~AIDfwI/DKQki~LG[rhCiEE
:
EOWAKVAt:~ItEL iEEVtdK .
~pn ~1t44 17794?. 190560 CPn_0151 1?4179 :?2i25 :100-t:Ip pt~cu.)ae ATPase DAVSFrILEKAFELAK.iSK)fCl'lTi?1HLLL\:S.EMP=:.FYLVIADINCndtpA-lbnoo>NCfanase ~
CY~LPKYPiLVICI1NP'K:L:L1NMLLGHCLSVKVIDNRASPEDPSF'..DCRKLP
M/LGVNFMEKF
' . ..)n~...-.r-- ..,_\rhn... -..,rA.vf,e-..ryr-rt~..._r.,.pr~L..n'.~'Y:
. v REP'N'/t:EIfDPKPSFr:LCTLLRLIAKOEAKTLCD~IISCOH1.(.I.At ~
ftAVKDAL
A:Lf . . . ,.~r.:r. : 'N : :.\.._:rl:Y
. ,.P.:: h ",: . . .. .._.F'::':
N ?:.' d ' .. ..". .~ :.i"."'- \:r\f'.' ~ F
.::... . . : 'Ill i:w :,:rr-.. f ~'1 '~
.. . .
. .
. ..f. .
'::Irl:.N:.: .
._.,,... ~./:-:I:;,... ,...;v.: .
t111 '~
:.'i~~::..:.:'re.'sJ,ii:H:.:.i::KaFI.IFV
P1.
"n:4hE:it' NLDLROLVK:.'r:.~in KQLYVLOMCALIAGAKYRGEFEERLKSVLKInIE.:Gt>:EH11FIDEVHTLYGaCATOCAMDLCLPOGTNSISPKLKS
.KT'I'CfYNLVIa'DENFHIKT3HHAFPPEI~NVLFLGSLSNtLLLS
AANIiJfPAWtGTLt~ICATTLNEYOKYIEKDAALERRFOPIFVTEPSLtDJIVFILRGLRYI14DINtttINAAFHtr IWKLLP~'KK~KNLVITXDGCOf.TIILPYISPT'ItBtAAt~.PFS
EKYEIFNGYRITEGALNMVLLSYRYIPDRFLPDItAIDLIDGASLIRNOIGSLPLPIDERFYTPAI~tYYFLK0.'AAF
HtTCEEYYYPPHQAIJfYASSDIIAMSP00AEIHGPGPG101AI
GWD~.I EEYGLrWIEICNVKEPRIIld.YNJINP
LREELASIJti PDLXEAL
. .Q
KCAELAALIVKDFJIIKRflOSPSYOEEADAMOKSIDADARL~SFLLDPLKSSKNLLIFFKDI
E~DE
KpIgLEa~'ptf'gEgtaIERVADYNRVAELRYSLIPOLCECIKDDEASLNOAI:t'iRLLONSIJIIRPDRYIGYR'1 'Ifl'FKLNELISYLLRIFASEATg RLIAOWAt~t'GIPVOKNLEGF~1EIG.LILEESL6CAWGOPFAV811VSDSiRAARVQ1'tD
PQRPt:CVFLFLGP'1GVGKTELAKALADLLFNKIEAMVRFDNSCYNGDfBIS%LIGSSPGY
CPn 'rGYEEGGSLSEALRRAPYSVVLFDEIEKADKEYIHILLOVPDOCILTDG~.
..
CT119 hYDOthectcal Drotein FIM'SNICSPELADYCSKK~SCL'l7tFJIILSWSPVLKRYLSP6l1l4RIDEILPPVPLTitELIK~~VS~'~AICAS
1NIPVIIVPGFPOIPEDLYOIXTtI7~CPA~ICLAtIfII~D
DLVKIVOIQMtRIAORLKARRINLSWDOSVILFLSEOGYDSAFGARPLttRLI00KVV~DtDtt..IGVI2Ii.PN1'P
TPtOGP'V1WL.F'NGFRGTI~C..'iLAYRKIGRAFAAVGIA2LRYDM
K~1LLKGDIAPD'ISLELTMAKEVLVFKKVETPS AG~DSEf'V, AEEYPIErYLRDAQl'ILF1'VQEHPDIlIAYRL;sISGFSLGCHIAFR.iIKIYN
PRDIXIXALSVWAPIAOOCILLKELYR,tFSKHGECDIISIrCKI~GIGPPPIItIC8CD4I~.
CPn_0145 180717 182369 LIRIWfNTA~LPTKPYILHQOCIODTLVSRTpOTLFKNfAPL~tT!'ISYPM'OttI1t.11T
CT114 hypocnecical Dracein APDLOttILOClIVSNPOATL
tKTYDK1U15 NCAASFIWLNKSSNRNLRSPMFKSFIVRYMIYOGLVSFLLPIPtH.F.CAtil4V
WNLDPYKLESLCAYOYLSSKRIAFlFPQIOKD~PIPATM195130 197892 VISRDLIG.OEDCOKF CPtL0153 :LTLCKVDRGFSPEEISLIOt(LSYPGLSLASLRCS1'EIDPNTtHJ1R11LWSEFSCDI.A~1C4leu8-LaucYl tRNA BYnthetsse RADYYSNCLDILALRIHAERORYLDDSPCVPC1'SEFHKATItAINI'ILFYtFAYRYPSKKt~tYDPNLI1D00a0QF
iiKEHR5F0ItNEDEDINKYYVLDNFPYPSCAGLNVGMLIGY?ATD
EHFSDEFSFLSSVTDRKFGVCL.GVSSLYFSLSORLDLPLEAVTPPCNIYLRYOOGBVNIEIVARYKRAAGPSVLHPMO
WDSFOLPAtOYAIRTGTNPKVTTQ10JIANFKKOLBAI~PSYD
TTAOGRHLPTASYCDCL.DLE'LOVATPEEMIGLT!lINOaSFALOKKIfYKFAtGY~D~EOGREPATBDPGrYNWtOi (LFLPLYOOCLAYMA~IA~PtLCfVt.BNEEVE~.FSImG
YIr'u'DEEWELL.GtVOIt.GGKKKLGASLIGXSPPASORCSVAYDYLIIGAINI?TLALLPSYYPVAtIOQ.RONIL
KITAYA~.LECL0AL0YfPESIVKQL0f0'MIGICS~ALVTPl0.1'~S
PCSNIfEEIASYEEELKKANKSgMPCCDGOARLaSVAFNLGATAFJ1YJILt.EItL~IFDIPNDLLEAIT'I'ALDrL
IGVBPLVIAPENPDa.DSIVSEDOROEV'1'AYVOEBLAXSERDRI8SVA1'K
SLtILRt.CAILCDRN1:YZ7fALKYFIIAERLNEDOCFLKImI~tSFJILtYEVXKII8KVJ1POK'PfyPZCNYJ11 01PIi~.LPVNISDYVVi~YCI'L"VVf4CVPAND~REPA00''SLPINEVI
ANTLLLllESR
IppCI,NGL9G0E11107YVINYLEI4lSIGRAk'tMYRLR~R.FSRORYi~IP
IPIIRPEDG?IOfPLmDE<.PLLPP1'tIDDYRPf7CPCOGPLAKA00WVNIY06ElCRi'DCRI:
0i46 182595 183095 TY'1~ONA0'A~~~K~~~'YIGCUNAYLILLYtRII~Dt CPn _ VPYD~GLYSTPEPFA7CLIN~
W.W15SYAIPGKGYVSIEpNAEENGIWISI'CGEIVt~tO
No robust horaolog present in Cenebank/EM84 as of 11/7/98 IIVGISILSSgEWPOTVtIGLGFCCL55KSVVPFKKBLSDAPRVVCSILVLTLGIGALVCGdK~~POVI'IEEIfGiID
ALRItYAMPSGPLD104K1w5N8GVGOCRRPt181tYDLV
IAI'KWCVPGVIIlIGGICAIVLGAIgLALSLFWLWGt.PSNCCGSRRVLPGEGLLADRT.LDTSBEVODIFDRDGLVL
AIaLVFAITtHIE1048LN1'IPSSFMEFLNDFSALWYBaIALSIt GGF&RAAPSQCLPCDGSPRAS'i'PSCLEEWAEIOAV1'W1IDOMSDDt>DAAWPOIDESYLVAQIYtIVVO'VNCKLx AVRVLEPIAPHISEEt1114VILGNPPGI
_ CPtL0147 183213 183671 No robust hostoloQ Dresent >,n Genabank/EMBL0151 197174 199202 as of 11/7/98 CPtf HCGPMAVOSIKFrIVTSAATgVCCIrtJCSRLAIPAFITZEPRATSIARSVIAAIIAWAISLI',., t 9seA-KDO ?raaslerase GLGLVVLAGCCPiGMAAf:AI1?IZL~fALLAWAILITLRLtNIPKAEIPSPQB,REP~TSEPCPMNLRGVNItIFACT
YWVLVC1WIALPKLLYKlILVYGKYKKSU1VRPGLIOtPIN
SA1'PPLEGGSfAGEAGRGGGSPLTOLDLNSGAGSpGtIGPLVWftIGilgyt;CVRLLLPVLFJtFCEEF1~WRCLY1 'SCTELGYOVABWPIPI~'!V
SILPLaFSIIIABWAKLNPSLWF514CDt.'YtLNFIEFJIItRICAlTLYINGRI8ID88AltF
CPrt0118 183822 185702 APWtt~IYPSPVDGPLLODEYpKOAFLSLGIPF3iRT~tIICCYIfARpTALNL~l1' pknl-S/T Protein Kinase WAI7RLRLPTDSKL.VIIG5141R8DAGf0,ILPVWKLIKt7GVSVLWVPRINP.LTLtDVEiN01 tJItVSSltEBEIfDICAA1IGDYRILYRKGQSIFtSrnrr.acuRp'IAItAYLIRIil.PDtOS000TF~KtO~iiP
LOCE
! LNIPItCLWSRGJ1NFSYVWVVVDEICLLKQLYVAGDL.AF
SOP/fffJlF'NIriNVKLi11G11tfPGILSIENVSFSEGRCFLVTODCDIPILSL'1GYZ.1t8I?RKVPLITGPNI
T80SFZa0ALLL8GACLCLDEIEPIIIriYSPLLt~iQElfuJIYVGI~IOf!\tK
LTILEIVDIVSQiASLt.DYV11g~10EEWtd.DSVYIHILNGVPItVILPDIGFASLIXERAEIABPDRZIfRALItS
YIPLY10'1S
ILDGFISDEINRLSKII(FRVLLNTS~GAED1YA1'GAIlYYLi.FGPLpOGIFPNP81N
FS~IYOND!'LISSCLSCTftEERAKfiGFPLIRIUCTt&EEI4NVVTNCIEBtiLRCVPDPLE
VGEI7NSti00KESAENLEFVLYF~ICSIDEAl~1'AIESt888GVEP.80YSCPeL0155 199697 199488 SSONLPOAVLA /
RYVEAEKEEPKPOPILTEMVLISRGSVEGQADELPVNKVILNo robust tmmolop Dresent in Cewbank/EMBL
V as of 11/
r LALQSLLVREPV IfBDLtGYEDL
S
NSLSFCVPFLEKLKISLIPIEEMRNELFfIKTNNSSSNGFSNOEIOGIRTYI
NSP'FLDVHWZTIP.QFIAYLECCCSEOTHfYYNELIALRDSAIOARSGItLVIEPGYA1D1PW
CVTWYCASGYAEWIGKRLPTEAE~TEIAASGGYAALRYPCGEIEASAANFPTADII'~fMSYFLIAtINPN
YPPNPYGLYIA~1VY&ICOI7WYGYDFYEISAQEPESPOGPAOtittYRVL~~LKaD
CPtL0156 200147 199770 LRCAtpWRM4PGAVNSTYGPRCAtOIIN No robust homolop present in Genebank/EMBL
as of 11/7/9A
IG%QKLLARt~I~AP~TAPPP~PIAQOGVCIPSTICHLITIWYC
CPn_0119 185706 187700 FYIYRAATPQSIriIPDGCCFILLERLKELGAGFFYCDIJtESNTTGFTLFPGGSNKGVLIQIN
dnlJ-DNA Liqase L!'IADE
ERFIOtft?1SOJWYL71LGRLEDHDYSYYVLNRPRISDYEYDNXLRKLLEIERSNPEWRVL
WSPSTRLGDRPSGTFSVVSfIKEPIQ.SIANSYSKEELSEFFBRVFxSLGTSPAY1VELKID
CPfL0157 - 200753 200298 GIAVAIRYE~IVLVOALSRCNCKOCEDITSNIRTIRSLPLRLPEDAPEPIEYRGtVPPSYNo robust homolo0 Present in Genebank/EMBL
ICLL5P0EVAKRKLEISIYNLIAPGDNDgltYE as of 11/7/98 STFQIINEKOQQLEKTIFANPRNAACGTL L
atOfE
. .
NWRCLlS4GFPV~KPRLCSTPEEVISVLKTIETERASLPNEIDG)1VIKVDSLASDRVLGFSFYI'YKEAL'liIY0F5 PGJ1SPNWQAStatAQLNSYFCLGGETVTRIISLAPSGLI
IIA
ATCKNYRWALAYKYAPEEAETLLEDILVOVGR1'GVt.TPVI1KLTPVLLSGSLVSRASLYNEKAWSTAEKILKILSFI
' DEINAKDIRIGD'1YCVAKGGEYIP!(WItVCRCARPEG5E1IWNMPEPCPVGHSNVVRftDRtP
KttPIILYKEAAL'IYSPLFYSLP1G(YOLI9CVf VSVRCVNPECVACAIEKIRFFVGRGALNIDHhGVttVITKLFEIGLVNl'CADLFOL'n'EDL
CPn HQIPGIRERSARNLLESIEOAKNVDLDRFLVALGIPLIGIGVATVL7IGIfFETLDRVISAT_ No robust tlomolo0 Dresent in Genebank/EMBL
as of 11/7/98 FEELISLEGICEKVAHAIAEYFSDSTHLNEIAKMODLGVCISPYNKSGSTCFGAAtVITGPPNLTLSINLDLLLEDLOT
DSLPWPKL'1LSEDFDFAYYPfSKAIID'IYAKLtIaHPGCCP
TrDCMSRLDAETAIRNCOGKVC55VSKOTDYWMGNNPCSKLffKARKIGVSILDOEAITNCLtSKKItJIRYLLEOLFK
LETGtl'IFPTSTIDGCRESFLIEFSNE1'KKPTIMAFIYFYYYN
LIHLE
SNGPKLEKDPKOAGCEVHNRLLM.GLKfRPOAGAONDGRNCGPYGPICFLIVWEENYGSV
~Pn_0150 187759 192141 LKONGFLKON
~f117 hypothetical protein CIYYKFFYSYNCPYFISFFVLLGYNMASSSNNSTKODGIPSWVNPMIOWNRASOVGDOEACPn,-0159 01811 f 11/7/98 MSLTPEAp!fSR.S'WFSDRKHFLEhWgLEEMENNDLKKYSRYKTIILIATLVTVAIi'CIVNo robust hanoloq present )n Cunehsnk/EMBL
as o CCP!OCE1'ATRIF'aMPSGFSLATEK/OVSTAEKVIKILALIFFPIILIAIJ1IRYFfJOtK
PISNVFGIPMWVPCLILFi.JIGLSSAFLSHRWSKCKEIHLRYRAYOtYROOLLgOYPDLRFDRIOCFVLPCD'fPKEL
ELIW1NPOL'JENAAREVHPGFFALPTKYOSMYIO'tSKG
K3TLYKYSITIiVKPKKCFVGKLVENLRPDLNANKD00GAAADSRLDFAGYCVKHYOlDAL
L .V~.lfttSVIYpRLASLIMSVKNOttlIDNCSREPIDFAORSALWSC~DtGGEIOP~I
L CPn 0160 203794 20.127 D
DLSRDILAICCYCMNtlGVE7U(KAiDOYKKWYLNSSTFIAWNPOLPAIAOSYLLE00ANLpfkA-Fructose-ti-P Phosphotransiarase ~ALTTAHG1GOALEDLDSLLCYYDOLIESKCVGEKILASItIpKHLDt.AMOD' 'i KIF
DL
u RIPE
IA
TV6LLSWKSYPEI(NtLRYRPEILTLLETIRSKNtOE't'SSPPSPPPEWKNIPNit O
::cTiOENLKKWSNLYNVFSITiKEFTECKLEONEWSRIORLRGALEKSKCSILCNCItTNA
ElITK3EKKLADYLWIGDREPFLTGMHKAIATCKAIQGKVECSIISONPEIfOIMILPCSIVSLYTEOETSSKPLKICV
LL.iCCOAPr7.HMIVICL.FDALRVFNPKTRLfCFIKGPG~.TR
DL'VIYDYYMACItFDMLSSGREKIKTEEDKKNTtIJtVKOLKLOCLLIIOf~&ft .L'fKtR
Ef!LEL7ALRRE04Jf:AIWKNEDEVL.ALK"TMFaQWf;FItDLVCTiItGKYOEFKKNKi.SINL.
' TDTN1LAEYFIr\HNCKTSVICVPKTICI:OLKNCWIETSIf;FN7".iCRTYgeIICNL1KI1AL
tILFQVTPECLBLL (rTLPNIALt.~nELIATRKISLKOLSODLAI1CLVRRY
tfDFTK::Y;Nt.LNRLEVLHAErrT'DDLVLtIVDRMSEDLKKTIEEIIII.iAKKIHIIFtRLNCOQA::YTTLEf:
r:t ANt:lQcat'TIELL'LIVOEDNRLOEAt.~.~.f'"..VSQGLMLLIt.~.LLt7RDEKtNKNtEiSRKI4LVA.
.
KrCIQf!~fVLLPEGLIEHtFD?RKLILELN\'LI.HHiCD...~.IEK1L:K4iPETLKTFNLFPK
AY')AH::f1\ItNtL.:(r:L\PLIORNR/13L(Ntiti~faflLFtK7SIRNIHALDTETLVATSSNMDIANOLLIA
RD.~.IKxIVItV::KIATEEL.IJWMI'KKEfEKIKMIMEFIC:V::IIFFf:'IFJWAGFP
?tHIiLHHO
DV
~
' .
:iNFri'.N'l(aaLCIt::\I.FLVRGK'Ir:IMfTINN(JW::YTEW01~4\TPLYKIIIHLPIIRv't".CE
.
fNt.t.DVLfIOSKPAPAfMENPLH.P:ALPf6VODAVAE
t'::AMIITFD4INlY
:LLILSC17C
' \LFI
X'tIIL
r II
' .
.
'PfNtYTD.'1't'PK~PAVQ)If.t.OO::D.':rt:/I:~LVNFf'C:f'IlYIFt:KERLtUONPLTL(J!IDpT
..
L
v AUf.KIMt::QWK:iINKY
iIAKAIV<L:fVA
ILFrI
VI.YNfa LIfATY:LPwEFNNKDLNRWY.WDNLNLE
'I'LIFURI::K::KEFEYOVLETAO
~
f 6'~
. IIChPr~Ai.'f::hY:KR::I.
.
.
LL::
.a':f~IWAHNIV::ULE~:ff'TKf:K:a.iCDL'fKEFRRD.~.Yf1I11KRIKRRFKMCf.I:OFJIPWRPT
lI JI/tH~~\tYPAt:LIIRLI.I:IIWKOKEEI::IRCOAL'/'rEPMCLt:LEK;:KYDNP.KNIAAAMT
ILIASN iO Ut'm ' t)161 ,:fK
KK'Ir:KL~M)IUHI.t'KNNLTYVRIOHFFRTLIQEKLGt.tI%rVpEtIriIVKEAKELfIELAAIIYG.
.
S
Nf:X:N::~.K(tIIAKKOt'Kf7firUlIAC:KfiQLEL.LG\'iL:flCA~p:IA:NfK.~MOAwPRERLLLNPIfn f!rliCtwt .ncylCC.uc;tnee.r:m Lun, lyl IIR:::a':RKQENLLI'n:WLt:KFIt7JN'fMt::I:1'I.IlJNFTTFt:ldJtfl'LIIYNPfYtIVILtJIG
It:AKIPa;\t:HTlr\::RKt?1Wt't'LCI':YLTPFVRFs::fF::1'O:Y:YMJII.INREOt.FDIEORLt.IL
:IAtTt:SKR::HVRLIWELTRII:IMLNVIH.It:IK:Dt.D':F:lliUt'::LtNYK~tIINEtIEYT
CLV
:KV7TLMRDIJ1AVF
rP
' ' :
' .
Il::ldlItOL)ERLAIFt::::aLX:I'1.AllJP:.LFFNKtKAir\VWAITf:aa:lWlAKMrYNAPEYI
:
~/NKIIE~.LIV:,9 Yv:LI:A
ILY:~
:
,rlVrt:IV::IItINMV!)AALAA
tTY:VLt'1'RLN(c:l~'I).ItRVtI::VLIISIILRGCD.~.:;tar:IIDWKKLFEt.LNNNI:IWPNOPEC
'I'N::OK~ALTYACNTIIJPDFYTpFLIIIDIYKELHFLTDSNSPELLSEVKFaLK. '" ' NLPPtLYNOGE00LLVSINHRTL
FTFJ1FANCDKP ITILTYPOV1H1AFPFAE.iSALSDL'f'QNLKRELTSCE
CPtf~017!i .. w.,.l.ir927r r :~~~Sld , cr153 Iffooensr:i~ar p~onwoa "..~
..
:~ ~rf~: 2nse7o 2DIeD3 NDDOPI~SDDEFJ1SKDSAfSASFSyEfYKSSTRGKt~fl1''"11TASRTLYILRpOCdYDP
,b rrfousc nowoloq present in CenWank/ENHLRALKVDDEPIIYfiVEKRLDAKNPOSLNAFHKEVG111YVAs'VrYGCTCFpVLRIISYL4VCEL
as of 11/7/98 tI/YTLYN:OSPFRtNKLYSIS50VCl'PWIFOLNSKVDSYLFIGCNRIIfWSIVhpEPNLIEKEKISISVAAASSLLK
SKT~IATEK~SSYQSESSAGIVFt.O~'VL.POLOOIHtLDFKDN
tCKVfIJVRI3TIVKILKTLSPLIFPLLLIALALInFLHAKYANNLLVSKIt.ER11P0YVPLiLPNEPIPLAIr~SIT
CI:IIPELFPSEDJI0VGi0KKSALAxVILNYLLSNKPKE~SP
SE
irR:>r.L.h'A.SHIKLTTLVPV.';01fM4AlICSNPLEVFJULR'I'I'KPSFINPAKYROITISSH.
:...t.,..... ;..Y:1~~ '-''T"r'.!~Yf~"'~~.,......,ir..w....kn~
FYLRF
. _ .
'=:.y~mr'; t.vr:'.r:W '.".~'/ .,.;~~y,.
~ .
.. " ':.:,lm!~-:1:1.:~.'uta~ . ',-.-;r.,... " ..: r rcn:_w'. :.
' ~ :~I 'Ai ..;..;y~KDpc;,~:
' ;n::nrr.....
ri~
, :
:. i tm.Fr,. Lk:Nll~ll'1CDLF lR.ir~LEIR;.iUlNi:i~:iCV
:~;";;.~:..; ":a' Lii:'..dl.::I i i w:.: ~ : i r':;iU.:v :~:~.n!~'.alr F I IT'J
':t;,. .: . . .... L,yl.: 'rrr:x:rl.~.':.:W
PLDEDRC'uCFEILEOLOELCVRFPICPSOCPDNPNFOCFOCIRtYWEDSYDPNKPV
CPn CPtf_016) 205931 206191 _ No robust hateoloq present in Genebenk/fllBL
as of 11/7/98 No robust tfomoloq present in Genebank/EMHLDKRIaTI'KSIIFIFLISCESIOfOPNSLIFSSVCLt~GLCSLbSd~IOKP>WMIiHrI'STSEEF
F
as oC 11/7/98 ?EI(AIVYCIKCKOIIKC'SIItITP'1'PATPILTE('aEIFPGPVDSAIQ~fDLERLLTyIDfRPDNOLPMIPSAFR
TTQIFSEEfHiDPYWAKTDEESRIfINR6IN1~1LICIIfGSYIPI
IIRIYLR~IGOSLV'I'IYPKDGORLRSPEDLRVGDDLVOSYPNHLNAIELDCWIP~LIGASTIfGSLJ41PKSAALTL
KTYRPNPIWINCYERSFNIDTCKYLKEGSRRRT$NDGP10111RVL
TYIITFADFSTYILSLRSYOANSPSD011~lGIWPGSIDDPVOAVISFLKt#IGFALPSTLWM.IKSSGRRGHAICL~f I'EEDFYIJUIRRCGVYSLYWlVCSYPQI?IPFVIAYAIiIA0is11 DPLt.CrNlt CSKLVLPVKCYYSLVi~f~'iVSSSDSLirAFCDSF71~YGRSTFLANCl'SILCVIItSYKRVPP
0161 206141 206998 OP . .
CPn _ No robust tawoloq present in Genebsnk/EHBLCP(L0178 218052 217789 as oC 11/7/98 V
I .
LCFKCIY:KIIFSFLKDLNTRSTIESSDSLCSRSFSOKLSVpTt.IOiICESRiJOCITSLenc in Genebsnk/E1'oiL as of 11/7/91 LTLIVOCALIALAGGOVLSFPLGLII~GSVLVLFSSIYLVSCCKFFlLKaIIXCCSVICS>amoio0 Dres frobusx !
No KICLG
' ' AF _ DLFGEEBCRNOCNRSARNOLFJIILHETDGIILKRYtsOGAK_ _ !
ECQLFIIIIVGKTEPCNC
ESIMICI
~
'KLNIWFEKpPNICDIEKALENP ~~ G~
CPtL0165 206983 207582 CPr~0179 218550 218056 No robust homoloq present in Genebenk/EHHL
as of 1117/98 No robust honaloq presaft in Gerfebank/EI~LPKLWDI'NFETRIGTSVPKFNRRLPKSFHKSGRSSRPSKAL1IANFPN1'TipJIGRSCIIPG
as of 11/7/99 NVLLFNNhfVPKTIDifiIDPESEIDIRKWSCY>Q.IKECQPLFRSLISFLLCVIRCOLRI1.KKIfAILLaiVNDAKT
PNYSCItLSIGFPNEpDLEAQtBJpQAALVRKILICWPNNfLKGLIJ1K
RSKYOmARTVSDEDAPLFCLTRSYYQDGYLTPUWGPRDLINNYIRLRRRENPlOIFFSPLKKDRlQ2LSSLIFiKLSYA
I
KNPCYYARLAFNESVCYYRZ<.FDIERLTKMYVECDYSKEOEKNt4AILSlyK'1'Lt>DGImFLP
IS
LIEHKDTDLIGACFlDVFCT
CPIL,0180 218963 218355 ~ No Irobust haeolo0 Dresetft in GMebank/ENBL
as of 11/7/91 CPn_0166 207591 207962 TSLIHIILOCKYRPYFION'1~ASETYPSOILIU10REVRDiIYFNOADCNPARANOtLGIDtI
No robust tfamotoq present in Genebsnk/ElmI.'IWINL
as of 11/7/98 NCLROYM(SDSD1SESINRSIHLEASTPF!'IKLllnCESRLVItI?SLVISLtaLVGAGVTCLLDVYf~NYS~T~DI' ~R'RFTFVSSKNDIENNGLS?IPLONVLViAMVRR1 ' LAAdCIRNIEWRWCLDLRSOILIS1U.FZKOPOFOSLTEDFVNNS'1'IIOEGRVIpNtNL
D S
LWLF1/AGILPLLPVLILEIILITVLVLLFCLVLEPYLIFxPSKIKELPKVDELSVVETR
' STL S
OEKK
LISLIIZCItCiAVLESE
CPn_0167 208309 207977 CPn_o181 219175 218777 No robust tfomoloq present in Genebenk/O~L
as of 11/7/98 .lo robust homoloq present in Cenebenk/EHBLFYIHnSLNSHNLIXPSSLFJIAVpALDSYIYWOGDITDVL71A
as oC 11/7/98 ELFKIOCVYIfFFIDIFNKL
V
NLwSHFPRGFFNLPFCPTILWCPFIaISENYGLEAL71J1TVD5YF1%.GOSOIYFL.
SKODDD H
DDISREIYCVPRLYIRFWIVSISOSLSRIPWRLKRILLRYCfLRGKYVNPILIKRIJ1ILL
ITVELSaI(1%tKFKP~GSIIiCI'LYTEDPILPAIC'tSFSNCSDIOHRTPISPIHCLIRFSRLRNSNY
CPtfr0168 208715 IOAI17 No robust ffomoloQ preserve in CPn_01s2 220701 219331 Genebank/0~L as oC 11/7/98 SyINLRRREZIpENFlNpGIIpCYYARLtUTIE,gVRIYR1G.PM'AQJIONYGAGDYEOt~aedHlotin Carboocylase LKSILSFVQILDEKDGF11DFLATlIKDr1'FIGROG71DITCSRCZIHDNLIAMtDtIAVRIIMGfIDLCL3TVAVYS
sYLKISNU.A~ICEnGAa~fYHPCYCFLSENtwFASrcESC~.TFIaPSSSSr~IL~cIa ANSLi110CI1ICPVIFOS1~iIIEDGS~IAE1IIG!'PIVIKAVJ1000CROIRItI~FY
CPn ' ' _ IIGNYVIaGLIDCTIGRIUIpNL
No robuse homoloq present in Genebenk/~i.ItZfPRNLEICVI0D1 as of 11/7/98 RAFSA7IRAl:AGGF1~811~NV1fIEK!
SFNIEFTICENNIBe~NCSECSOPLVIdEtM'OPLRNLCESRLVIfII'SFYI~VGGLTLIEETPSPIL.NiI6IRVKV
OLVA
TJ1L8G71GILSfLPWLVL.GIVLVVLCAL.FLLFSYIU'CPINaGVI/Yl~ti'DSDIHQNFDRpRNZTELtrI'CID
LV1CC0INVAlGEDfLPWKa00IEPSGNIIOCRIN11EDPTpiFBPHIORLDFII
LPPAGPSIRVOGACYSCYAIPPYYDS!lIAIfVIAI0G10DlEGIAIisWALIItlNZ~1108'1' K IPFHOFHLDNPKFLFSIiYDINYIDNt.L7IQCNSPFhEP
TNDOVDPVSEDSIRTVISCYIQ.IKACKPEFRSLISELLRAIIQSGIGLLSRCSRYQEMKT
VStIIUSIPLFCPTIISYYRDGYLTPLRAGPRYIINRAI
CPef.0183 231207 220695 0170 311098 210025 aces-Hiocin Grboxyl Grrier Protein CPn _ RRtL~LI00IEKL11IANORHDHfRPAIKAL~.GdLERDTAECSiRpEPVIYDSRLFBGFS
No robust hanloloq Dresent in Genebenk/EMHL
as of 11/7/98 NVRIQetURGE!(YNTCTVIAPVLSMSYInLFKNLLKEDSVHKICNEIFALWRtlTrIACTOERPIPTDPKKD?IKEIT
rENSE't'STCTSSCDFISSPLVIfTFYGSPAPOSPSFVKPODIV
E71IIKNLPKADIHVHLPCTI'rPOLiIWII~GV1044PLKWSYNS~IrNtIRLLSPKNPNKOYSNISED'fIVCIVEJ
U~IKVIHtlYK7l~lSCRVLEVLITNGDPVOFCSKLFRIAI~J1S
FRNFimICKLfOPDLSVIQYttIIIQYDFNSPD1IVNATVOGHRPPPOCIDNEF~LLLIFNNY
LOpCLDprIYYTEVODNIRLANVLYPSLPEKHARl9cFY0ILYRASQTFSIDiGITLRFIlVCCPIL_0184 FNKTFAPOINIDEPAOGtrQWt.OEVDSTFpGLE11GI0SACSESAP011CPKRL71SGYRNkYe!pElonQacion Fattor P
DSGFGCEANAGEGIETRTIFSSAKVNPEGLIEITRVTFSSLKRKOPSSLPIRV'ICpLGOWKIKFt7CCEE1IINVLSS
OLSVCI~IFISrKDCLYKVTSVSKVJ1GPKGLRFIIIVAt.pAADSD
WIDWFKATOEVKGOFCfRTLEYLYLEDESYLfLDiGNYEKLFIPOEIMKI8JJf4FLIU
CPff _ ICDVIKIOfRTCEYIORV' ~OuaA-ONP Synchase IIKLOSJIRtIHLNTIFILDFCSOYZYVLAKOVRKLFVYCEVLP4MISVCCIJCERAPLGIIL
SCCPNSVYENKAPHLDPEIYKLOIPILJ1ICYGNOLtIARDFGG'IYSPG11GEFCYTPIHLY?CPfi_0185 CELFKHIYDCESLD?EIRNSHRI~fVTTIPEDFNVIASTSQCSISGIEIPrICORLYGLOFHPcpe/araD-Ribulose-P Epialerase EVSDSTPl'QJK:L6~.'FVOEICSAPTLWNPLYIODDLVSKIOD'IYIEVFDIYAOSLDVONI.AEVKKQESVWGPSI
NGADLTCLNEAKKLEOAGSDFIHIDIMOOtfFI/PNLTP!CPCIIM
AOLTIYSDVIESSRSGHASEVIKSHHNVGCLPKNLKLKLVEPLRYLPKDEVRILOFa(.CLINRSTDLFLEYNJWIYNP
FEFILSFVRSGADRIIVHFGSEDIKELLSIfIRICGGVpAGLA
SSYLLDRHPFPGPGLTIRVICEILPEYL71ILRMDLIFIEELRKAKLYDKISOAFALfLPFSPDTSTEFLPSPLPFCWV
VLNSVYPCIIGDSFLPN2'IEKIAFARHJIIICriGLKDBCLI
tKSVSVKODCRSYfuY'EIJ1LRAVESTDFNTGRWAYLPCDVLSSCSSRIINEIPEVSRWYDEVOGOIDDOSAPLCRDa GADILVTASYLFEADSIJWEDKILLLRCHrYWIC
ISDNPPATIEtrIE
CPn_0186 :_'3878 221069 0172 213237 312110 'sinilartcY co Cps IncA
CFn _ PIKDKILItSSPVNNI'PSAPNIPIPAP'ITPGIPT1'KPRSSFIEKVIIVAKYILFAIM'1'SO
-fmpD-Inosine 5'-monophosphese detwdroqenase ICOOH-terminal rsqton oniyl ALCrILOLSCALTPOICIALLVIFFVSNVLIGLILKDSLSOGEERRLRECVSRPT80pR
APIGAAICIOPLCISRAHHLVGGANVLVIDTAHANSKGVFt3'lIILELKSOFPOLSLWCNLTVITTTLETEVKDLKAA
KDOLTLEIGFRNENCMLKTTAEIILEEpVSKLSFQLGLiRI
LVTAEAJ1VSL1EICVDAVKVOICPGSICI'1'RIVSGVCYPOITAITNVAKALKNSAVTVTANOLIpANAG0A0EISS
ELKKLt~1DSKWEDINTSIpAiJfVL(GOEiAPQG017IVID1NQ
D,RIRYSCDWKALAAG10CVHLCSLLIGTDGPCDIVSIDEKLFKRYRCMDSLCIWKOCEOIOALOAEIIGIWNDSTAWK
SVFNLLVODQI1LTRWriELLE.iC'DLLS011Cb'ALRQEIE
.~.ADRYFVtOGOKKLVPOCVEGLVAYKCSVHDVLYOILCCIRSGHaYVCAAETLXDLKTNASKLAOHETSLOORIDAN
LAOEONLAEQVTALEKNKOEJIpKAESEFTACVRDRTlORRETPP
F'IRITESGPAESHIHNTYKVOPTItIY P'r'l'PWOCDE~EED~CI'PPVSQPSSPVDRATCDCO
017: 211041 211715 CPn_0197 331218 ..5015 ~.Pn _ Dradiccad methylasa nn rotf.:.~.c tfoaaloq pcesenc fn Genebenk/EMBL as of 11/7/98 TIFDLIYKIDSYKHQQCFMDFSVFPDRFVESTSPSFIEDIDAI(rLVSNCCNYCSRCLFLFVPLTYTRTLPMNSKFI~~
sRRKKN..iHKEET.~.WDr.'LAS.~>yHKfIIODKrHYYIIRETILPOLLP
I::LL SI I
Ir_F::Vlr;l'SCETASLVFCIL.~rLIVLVLLLSLTLOSKSSVLDICCCOt:FLERALPKECRYLI:IDL::.~.FL
IALAY.Y.NIL:VNSIIDPIfVADLS
IECRNRECCRRIS
KRLEFVEPTLFSHAVATL.iGNMEFPt:F.A
1 RNTATLLEPiMFF I VLMIPt'.PR I
PRASSN
:Nn Ot'lA 314215 3L1721 IIYDEtIKIUISRHILIItYL.~.FHIIIPIHAIIt't:OND.':P::TL.:FIIFPL::IWFKELI:.~.IK'.Fi.V
DDL
th. rMoar lu>tnolrul present in EF7dl'::.~.ifT:.'r.KRAKAfN4'RKEFPLFIIII:;rtKtK
~enab.mk/ENOL .i:: or 11/7/99 A'lT I PAOCRRS W
~
'X'mT::L
~
Y I F INF'/RK I V I Lahl Ihrrr L
NSP::PALNPEL::LI FFM'L'J
.. ~.'.~..fuo . W'fsOlu>t ~
.
.
.
'I'Lfi(ILLIF'IIILl:W1'II:.1'FTVIFFLNC:LNLL:."CC::IIG.~.::1'.LIIVGLLFLINCLYFH
::::LO(~:L'Jt:LL~Y.EL::OAEEREEEYIOEIEALR(7AFRAE:a'Tf:::P~IwL~'I'if2 tWP~cIa'.ru:.W pmcr:tn A'rIlrX:INFRKLFPP::KKK'I'a(JY.ORLkNM:LV~I11 tV::IYVLIJItINA::KFAI:VL::YY1ILI.
'Frsnl'/: ':11>I'n. .'.Li275 '15'VIIGVFFLRL::~~IILFTNLNWYtWLtIKF'1IiIY.Kf'I'/AIVFJ1A'IIIATr::~IIr:LVl.l2'w"F
th. r.ds~.-.r tuNnfIW prssnnr m WtY.SrN:ILMLI.::I.FJx:LNKIFPT'..'Wrf'I::LYILV::YI'/iYLV::I'MfYtIVtY::bIIYITO
.;v.nrtamk/r?IfsL .n:: nl IL/')/ny LLLACFYiFt.I.HItPIMEUI'Nr:VLOD~'1'fVLYALN:;FL(RI::WCIfRLGICp::PLEAFNAtA:E!HI'Ir ~'IAKLF::L;.11:1f1'AL1FI::RFVI~YLLI:It.AI.Fir:YA1L.F!YAIf,IK'r::AI.I::I'LII<:
t'rFl:Lr/ITC:FI'LEbVAfI'ILPr:YfIPKFYLSFIDRDUf:I/HYEVLDt.VFLK1YAACLIN:i::VWI'/FW
tAFF::I,~~11::IFM'::FTIt:At.VALI".:FLLt.I.lt'l'rNI'II.FYk:Ai.TFftI~NRIIf.'I' AQFIAK03I(VPIGEVSOC:.DVL E:tFtYYL."YD1IL'fODF:FJ. .'..~.AVDLK'tTYF:
t :L .,. ~ ' .
FLFtGDKILf.~.CYLOLIT.:"."!ILALTTPOi7~tECFn ~)t7N ".L.h,1145. ; .~.9t1,'!1n PVPNPSELTtKDIADKLLHREIfKKfNPOLGTTFtENSFONTfNOA
EKHCfLFPYNFK:YO opM-Q.i t'td~t "0 t~ldmt>s: ~'rf,e IWICfNLTL::EIARRIK ) n (,~~ ,;~
IEYYINKMIRUtPt'LK:L.PNLLPLLLTL3.:CSKGKCEFLGK~~fI1111SHDI~I.~RN
AY4ER~4~'Y~'~~f~~~~P~~rAY
~.Pn OlN7 DFEKSIKQLYfEEFSPSIH'.'~'VIKNSSAIHNAGK.~nLE:wICAtfDCL:v.VITLGOPfP
ols Transnlaalbrane Prxelnl P
ossl YFLTLIARPVf3PVHHTLPE..'YKKCfPPSTYTSNGPFVLKKHEItQHY:.::.CKNPHYYOHE
~:131 aalwtrxt-t .
NpIB(RRs'WLKIiGL~.LI:S.'aL'JIGfLIFL?QLISZ'ESRKnVFSLIHKE3uLiCa'eIEELK-., EPAS "
' '!
;
~e .
--rFT' ,F~~r-:
r -c ~
~
A ~
.GIdSLKIN .
EAKDEVF:>AEKFELDG~LLRLLIYKKPKGITL..
. , ~.- y ....",.. ;-M'ARKIKLTr _ .r .a ~lF~ .;
t .A
.
'j _ ~,t w.
...,,...t.r....,~-._., -, . :'. t' 'v. .
, . ..1:
, t,lr wYf . .
. .... .. ..
...~H......n~~r.~ ' ~ v:"_:1,:::..::.. ::1:'-:.~
n . ~..111.:~i : :'~!
'I '~\F_ tL::
,. 1 ' ' ..t. r.
'f'!'dfstl:~
\
:"
w ' ~
. ...
.\Ft li ~ ilnFr . htlfiirUv\
YHCfLI(KRROGDFFIAT~)IIAEYVSPVAfLSILCNPRDLTQWRNSDYEKTLEKI'YLpHA
r ! : I.
;'.:17: ":
T.FYIVECsSSFIELKPELASALCNGItPLS'fP
IONKLOGHIHK
N
' ' ' .
'IKH~.KRAFMIIEEETPIIFL'fHGKYIYAIHPKI0N1'FCSILSJITDCICiIDILS
ASGDG
O
NOCKT:rtfi ITSKOIHAtYSYAKIPLDITI6dKItIEIT9QAGLpEyAINPKDPM.At.OLI~E
VI?N
DIAYSSSIVIFCASPSHG1JGL.ISIONKKtILTKFRLJ?OIIOLp~I'RAIFPOPF
KF~
K~E
a CPn_0199 211019 :11983 PLOVAYYSLNIF~1"IKNAHLtJIMILONPLS.KISCSM.SG1~N~'FKfNl-011~pe0ctde Psnnease TD1?JLFFPKFSGKITJIRENtLLIEIAKIGSp~pIKP6ITSI
:IaptIPSYAFJJtF~KJI0I opp LItIGOFCSLPLSLVSNHLApFHLKKLTfSFIIfDGGKFVTKGNL4ALIENPDYPOLMJfRIKCLICLSLVF&YILO1R
ILF:ILi.SLIaIVLTLTFLVIOITIPGDPFNC~NLSEEVLOTLK
fV IG
it'I
LDCSSTSPSSKDLKIOGSGEIFSLPLDSITKTYaItOVRLSpYfGSSGDIiJ.
SRYGGDKpLYGCYT'QYLHSIAKLDfCNSLVYKDRKV7NLST.11PISAIiwiCSL
' ' IPDGLLS !It KLTLLSNFKSFaLIGEL%LVlIaFSNIfLSSOICD'tIAWLVSPERYASFFIOiAGGIAI~'IAALE~RRYILG7LSIt .OISIPAFIFA?LL7YVFAVKIPLLPIAL1GP
VNYNPKDON
TILPTLAIJ1YPPN~IIOLTIfSSVSAAt3IKDYVLLAYAK~.SpLKWIKHILPYAIfIII
'CSPItLLHRTANVALDISKISCPEETKGLSCLTLLA71GGLEGSLGTpLIFYDINSKET1 r l' a SY~'T~V~AIFlIIFCIPGIGKWFICSIKQRDYPVAIhiLSVFYGTLfNt~SIZ.
llOC SIIDFQIRYAtiGKF.%%R!I
FIINDfKCSLRAMiLDAKIEYDL.KCSCLApAGDSKT4AE~~SPESRtI
LKAQISSLAGPRINVSItOQAFRTGEGPVDT~S~
PMSPr' ' ' .n OL
aI Q
uu ANYIIHIPSSFIA
Rf?1LTAHLSIfLEDVHKAFL.OEFNpt.L~YSGYPVTLEIIrifONFYLp AEKSILi .
IPLIill 0200 241996 212968 tRPYSFEEFRIQSATLOte;KISIAtnGTMYALfOfLDITOQKOfVE~TpIfF5V01~SCPn IICKRLDALIDRRIRIJILIfGKTDTAHDRLF!!t'LGIDPLVIKKYFIiTSLKTIWffLIKIR..
fSStlrDSTPPPTYHPFPWDCSNFD oDPC-Olipopepcide Pe>:~ase LADKt LTT IILI4If'.J1LLLPWFYQ
VL:
IKSIIpN104 RSI' . .
CSISSPEVDWSSAYAAIALL4SYSLGHPFSS .
r C~OO~I?ISrILSSAPS
ILVSPCSRFPFCTDTLGRCIIFARTLAGLRLStd'IATIATLIDIILIIt'.LWATYAISOGKKI
SIEHK
DPLJ!!R'ITEILFSLPRIPIfILLLVIftdB~.f.PLIfJg!'1'I'KAIIPISRIIYCQFLLLIDiK
PFVGfALIAI~(A.4TFNILKZIIS1'LIFtIPNAIYTCAIISFIGLGIGPPOAS
CPn ' y .S1IG
170 :obusc homolog present in Genebank/tf~LLG'ILVK~INJ1IDYYtsILFFFPSLII4IAf.SISfNLIGEG1KTLCLE~
as of 11/7/99 STSTKKF)1VSKAIQKIIKINCITDPSIlIVETPNAEIGSILQEIKEI
Lf.GIKLNRK1WSFO
KOKLSKQAEDLGLLLILYCSQETLSM.fldINASLKLSIGSVZEt~SLKOLVEESIEtShGCPeI-'0101 212110 IVIOCLLIKfiNPEKSEAASgGIIVOTLL t ATPase pQDpLIGSVLIEISDXFLSSIGEILSLNLOIf' oppD-0119opeDCide Transpor ~SVA~tERCHIt7M~CYRVL~iGEpI~EOiIV
SKADLttDNYLLNIKDL?ITSTNPKRTLI~1LSLGLIt!lIAI~LVG~
e LG ASISSAPGLIDIW
~yS~IAS,, Ci~II7fAILGFLPF3JCLIK1GSILFEDIDITIG.SPKELIKIRGI~CIATIIL
RYVSSGLTIDKVEDKPITKFIRaGKLLYSGGTSt?~ESMP4GL4TSGI9fPlWK
SKt7YLE
TPSCItIGNOIIE?LROHHKtI~tKEEAYWOl110LLTDVCIPNPKYSPJpYPFS.SritilIlORV
SASKSNDGSFPFSALRHKFTFSt7TDCPGITS'1'1'LSGNOAGfY191SLSLKVLVPSIP9IEK
' VIAIAL71SOPKLILIIDEPITALOSNSOAQVLRILRNI00QICW1TILLV114a.9LVKtt~l1 PEVOLSLVYSYEONLPIDNIFIOfSOPRTIPL71LI~Tt4.xDKYDILEL.AAHC'1 SPNCSRFSLOIxOTNOfENS>MIfYtVNAAHSf OICIIKDG1G.IE'1'CI'VEFIfLSPKHPYTLXLIN7NSKIPIAtCtSSPILR~OtiiJI~CG
CPtL0191 231079 271314 gln0-AIiC Jlmt,no Aeid Transporter ATPase CP1L0202 213692 211500 CYDKREGVMI'IRVRNLJ1YSVNlGDIILDGVTFSLERQIITLF9GICSGSGK11IIP-OlipopaPCide ?ramporc ATPase OHHFPVFL o . pp LRALJIGLVDP1GGDTt~tIEGF~IpALVFGQPHLFSfNiIVLGNGTHpDIHIKGRSTEF~AtKAVPT9NEYAAWAlfI
'LLSIK~.SLTIRCKKILNHINLNLIKGSYLTIVGP>Kt~%Si.iLLT
FEf yHLr~IEE51AKNYPDOLSCCDKORVAIVRSLGIDKIITLLFDEPL'SJ1LDPFATASFAH%FTisCi'ITfI~PKIPR
JWNGVIWGDIDSSLNPC?1SIKCZISEPIliIIGTYfKA
ILDLT
LLZTLRDQELTVGL217~tIpFVHSCLORIYLIDpGTVAGVYIGtDGp4~YIHS.
yNyyDI,yNLpKbYWLKInKLSGGGKpRIAIAKALVSKPELLICOB>tL1LDTL
flt;LDLIQTIXKEYGtn'LLFITHIxISAAYYIAOTIAVlmOC81.V0~CitSTPKH
N
. p 'I~DLLDAIPIF6LISTOIZpStCYBLOVASK
CPn_0192 232617 271991 glnP-ABC )uaino Aeid Transporter pernaaae CP1L0303 211966 215802 CVSGIGIICGSIIGLLIGTV'ISLYFPSIG.TKLLIINSYVhomolog Dresene in Genebsnk/t?~L
RGCGYZT as o! 11/7/11 GVpttNWIARLtt E
. No sobusc .
IVPLPpIO'S~TFSPTt4KSFSLFLLEKLDSYFPFOClRI9ILVI?I'L11IALA
V
TVIRG?PLFIOILIIYFGLPEVLPIEPTPLV11GITALSisB'ISAAYLI~NIRGGTNSLSIGD
vIESIWVLCYKKYQIFVYTIYPQVFlf7ILPSLTNEPVSLIKESSILMVVGVpELTKVTiInIAW010CKVSTIEKIIK
ILSFILLPLVIIAFILRYfLHiDtFDKpPLCIPKVIt~I.iG
VSREIJiI~S4YLICAGLYfLMI'SFSCISAISBCRRSYDNSRFQAVEK71VAEISP11FFSIPRKYQLIAIDTPK17D
AP8ILFPIGIEIII.I~CI~I~a ' r NLTLIOtEI07TLGNPEEKaLFDSICSIEK00~1N8LESKKLLI'1'HILIDfWSGIIOWIf CPeL0193 233111 232696 FNp~'IGRGYFSEISTAKIHFHGI~YCPIRSSCPIIOttI
acpR-AS'Qinine Repressor KLtILIlPl00CKVTIDf.V.KEILRLEG7UITOEc'rtavr.f~,FATTOSSVSRWLRKIQAVIN0201 215691 AGOIGARYSLPSSTEKi'I'1'RHLVISIRtOtASLIVIRlYPGSASWIAALLDOGLImEILGT~
No sobuec hoaalog present in Genebank/~tBL
as o! 11/7/99 LaGODTIFVTPIDEGRLPLLNVSIAti4LDVFLDpAaAtNFfNNKYSImPFSSARBIWANPFIbTItHEGNIKIKClIC
IfQIFTRLKt<GItf88 YNSINFNPYFFDEDCIVYwtESOIKSAIADHGILGKCILTFYPNT
CPn,-019 273162 231211 qcp-O-Sialoglycoprocein Entiopeptidase EVPHTIKfi~M'FSNFFIQ.TLGLESSCDLTACAIVNEt>KQI1.ANIIASQDIHASYGGWPECPIL0205 216077 No robust tloeOloQ pree'uIC in GenebenkJF1'~L
as o! 11/7/98 U1SRAHLHIfpQttINItAL.OCANLLIEDImLIAVTGTPGLIGSLSVCVHfCKGIAIGAKKSICDSIKGYGSASIIFt WPpCIi.LKFFLVCEELCILTVATHRALLETPL7ILSFlXG.ATKYV
LIGV1MVEANLYAAYHAAQNWFPJ1LGLWSGAHfAAFfIFlIPI'SYIG.IGKTRI~AIGETYRAKDIIALM'11f10C
pTILh?SPLCS
FD~FIGLPYPAGPLIEKLiILEGSEDSYPFSPAKVt,IfYDFSFSGLKTAVLYAIIttORIS
SPRSfAPEISLEKORDIAASFOKAAC1TIAQKLPTITKEFSCRSILICGGVAINlYtRSA
IQTACNLPVYFPPAKT.CSDNAAMIAGIGGBtFQIQ7SSIPEIRICAttYGWESVSPFSL71SPCT207 hypothetical D~cein IVDAASPACYDSINSDAIGVSLtJ~ISHILF~UIYDt7GILPREJ1IB~11AIVKGNQITpYLL
CPn_0195 231172 :75785 HILNDAInRVPEIVNDGSYOGHLYANYLLAOFRESAALPLTIKLPAPE~TPHAIAGWL
oppA-Oligopepcide BmdinQ Protein TEDLPRILASYC1IDDSLIKELILTPXINPYVIWN1I9GLVTLVCJIGKIPRDKVIRYtAEL
'lSCNSYNRKISW
TCITTLLSLSVVLOCCKSSHSSTSRGELJ1INIRDEPRSLDPROVRLLNYRLEKOPSFAWONLIAu'ICTLYPGELFYP
ISKAFDGGLVDTSFISNEDVCNIINtrfTl1 ' fAED ESCIHTLCSSTELINDTLEEHEKWLEDfPIEP
LSEISLVKHIYEGLVOENNLSGNIEPALAEDYSLSSOGLTYTPKLKSAPNSNGDPL
' fLESPTSH
FTESIdIfGVATGLNSCIYAFAINPIfO~IVRKIQEGHLSII>EtFGVNSPNESTLW
FLNLIaLPVfFPVHKSORTLOSKSLPIASGAFYP1WIKOKQWIIfI3KNPHYYNDSOVEtX
CPn "ITIHPIPOANCAAKLFTaOGtct.NwpGPPwGERIPpETt.SNtASKGHLHSFOtIACTSNLTf_ ybnl/sodiTl-Oxoglucarace/Nelate Translxacor NTNKFPLNNNKLRFaLIsALDKFr\LVSTTFLGPAKTADHLLp'INTHSYPEHOKQGUWROVNKKKRFLSLLFLTAVL:
riTWFSPNPASINStJA4.lOLFAIFTfI'INGIIFQPVPNG11IAII
AYAKKLfKEALEEtAITAKDt.EHIliLTFPVSSSASSLLVQLIR&QIiKFSIGFAIpIVGKEGISTLLLTOTLTLEQG
L:.:fHNpIAWLVfLSFStJIIIGIIKIGi.GIRIAYPFVSAIGKBPL
FALLQADISSCNFSLATCCWFADFADPMAFLTIFAYPSGVPPYAINNKDFLEILQNIEQEGLaI.LVITDFFtrIPAIF
aTARAGGILYPW1'SLSOSPGSSAEIFCCODLICSFLIINAY
ODHpKItSELVSQASLYLETFHIIEPIYHDAFGFAMVKKLSNLGVSP'ICVVDFRYAKFNOSStItTSJWFLTANJ1G1 IFLV 1ALAGHVrIISLS4MWAKAAI
IPCLPSLFfJIpIILYKLYP
PKITrCEEALRSAKLRLKt~tDpLKKEEKTTLtIPFLLV'JWl'P~.LCISA'ITAALIGLS
CPn_0196 235906 237519 LLILTNILOw~COVTANTTANETPTWft:ALIeatASPIIK?LGFIPLVGOSAAALVSG49MC
appA-OW gopepclde Binding Protein ICFPLLFLIYfYSHYLFA.'NI'ANICAIfIPIFWVSISIIUTNPTPAALTLAFASHLFCCLT
KLKSYSKERSFNLRFFAVfISTLWLITS'GCSPSOSSKGIFwHHKEHpRSLDPGKTRLIApAPLYFGSHLVT1'~EWWI
!SGfALifVNIVIWWtCSLMiKA(J~'LI
DO'LUIRHLYEGLVfEHSQNCEIKPALAESYTISEDGTRYTfKTKNILWSNCDPLTAQDFVHYr~
:>SWKEILKDaSSVYLYAFLPIiQJARAIfDOTESPENIOVMLDKIpILEIOLETPCAHFL
IFFPVHETLRNYSTSFEE71PITCCAFRPVSLEICCLRLHLEKNPNYHNKSRVKLHCPr~-0208 :x9935 250(102 E
HFLTLP ttrase KIIVpFIStIAHI'AAILFKHKKLDWOOPPWCEPIPPEISASLHOOOOLfSLpGASI'IWti.fptkA-Fructose-n-P Fho:.photrane SVAVIL19IPLYYDLDTIL''3Y'opPLPKEPOEAA.SLtA'/PDT.SHSKPWPCVKTLFPO~!'1H
NIQKKW1NNAKLRKAL.iLAIDKOMLTKt~YYOCIJ1EPCDHILHPRLYPGTYPERKRONERILpYLKFVQtTEMMITf LKi'':VNF'.X7f:PAPtX.71NVI0liLFNSLKDFHPDSSLVGFYN11r7DG
Lti\OOLFEEALDELCM'fREDLEKETL'tFSTfSFSY.RICpIILREOWKKVLKFTTPIVGOE~.IDITEEFI::KFR
N:::X:FNI:IrTORKKIYI'PEAY.EAt.'LKTAEAGDLODLVIIODD
TtQIK
I
FFTIOrtIFt.G:NY::LTVN~%'fl'AI1FIDCN::YI11IFANPt3GISPYI1LOD::HFDfLLIKITOE.
.
I::aITATAILAEYFaIiARf~T::I'/rVPYTIDr:UL011TFLDLTPGFDTATKFYS3IISNISR
IIKKHLf!tK)LItFALDYLClIf.HtLEpL.'l1I'NLRI\ClIKNTtWFNLFVRR't~DFRFIEKLtlIII:X:KAt IYtIFtKtatt:K: :\:at tALD:ALVI'tN'rllAL Ir:EEIAP.IWLM.KTI
IHKIC:iVIA
I>itAIWEKYY.NILtfF,.a:SStfETItIt.fTfIF.~..L::E'lt~ItI::RL:a~7pRLLKSFPAPI
rays n1.7 W751'1 :'aHR=
IEUfINDRONY.:N1:YSI:~R::."/f.YLI.IIII:I::NIII!)r)YFMNPFNAI::HfIGYt.A'K
..Nlu\ "It.p,lId 1.1. Ititul)n.t 1'I,ftl'Y~:!::IITI<:.4:ILt'I:\il.'.'Y:ff:."PtI:::IJU.'fYftYYIrI.RAIfNVKMFTVK(~A
~M.Q
1'r.,t.ein :KIIKO::LIIPtHDDPV
' Y:
' ' ' ' fta.
1KIYK'ILVDIf:::Pl\F'RI:.'s:.~lYIwAt.HIr.:YPF'll:ltlrtF:rl1'fTlU:l)NFP~LTLLWHN
:
::::
LfLLFI::L
IX:KVIKVItKNF::RWI
rllYlthr.:LDI.KF
:EDFt:::Y'fFFTK0..AL
V
' ' : Fl,IrrPINJ:t:cetrmn'r IT
rIIRk:~NOLRLAfA:at nF::l'Ir~AKILIMUI::IdVI.LFCL:LTHE.
.TP::aNAfPIIILD::PNPDFPKLIJIFp NF' TII'h:F:DIRfIAWI:YAVENSPIIISIfOt:I
:
f .
. ~:IW!l.'.IIV :'l04'l ..'.l'.'.'l W:
D
.
Ah'A ( fKPFNfKLF:?a'1"rLVEYf P1:11NiFILKKNpIffYDYIK'V::IN.i IKLLi t PDIYTAIH
Id.NIK:YVIMVt~I5n1~1:fIWRt.11K0::y1'ItYYTYfNI)t:AfyIIL:I.NMCSpHIJ)DLQNRIIRLAm:v :mt m .:.,n.t.mY/h?Htl. .ts:
.,t Ili'!"IN
~.t tu>Ha,l.m tmtrtt tY
'ri'fI.KP:aII:F'.At.cl:ryH'Af.'I'L:ah:Al'~,pMOYKltrll(TL'ff174:1ILVLTYP::OtLRCO
RfA~
KLWNF .
.
tE:::IItIIHI.KNE1'Y:'.1':"1'!~...yYtR.'a.IIMI:KIdd:Y1'.:h'I:YI!'1'INIAI'l'1'N':
LALAYEGNI
IaLrFY.HIRM4:IId.(t.l:!LfYlI1.l'VNKRKVQDYAfAT)TtNAYYfY:ANLI::fED
rxv rodt.r cuxmUxr G. i. t.~. ';~n~ottu.
EF18L Js ~: 1::~I98-IIL:.7t~IK::.Y'IL3FPRSLLR'.T.'.LWYRF:TNLIf:RY :..~.DDCPTEATKNtr't iK:..:F1.'RDN4Er:.TNPISEIVSET35SI1IDSYGRSL
IF
lA..$
FtI~::IIf.CAR4l:~L:.TDIOdLDCC0C14iWlrLLRLirt, LG
"/ .h[u~st~hllE
~
~
~:Fns021v! ".215 251147 .
noraolog Prea~tnc tn rJenebank/ENHL~
as Uf !1/7(99 ., .
H ..
NLVICFCK
o rodt3c E3674 IEfIEREIFKTIREKEHATISITrLVELE71L.1tREFJWLKDQICPTSDOETTSLYQCLDH
'I
KL
r r.pn_O::S 2b7402 _ O No robes: nomaloQ Presrnc :n GmtbanK/EMBL
:.EFYLLGL::rDKFLKATEDED'JLFESOKALD~AP8L1L.LTKARDYIrGi.GDI~wIIYOTIEFLiras of L1~7/99 ' .NDCVEIAKAKL
YTF10~IPKKNKKMKFNSIIFLENTKHYPOIFRECEYRDRNGUtEASL%JL:.STZTIZRSIL
A'fGSKYNRRAFCiT:iEIHF;.KTAIRDLNAYYLLDPRWPLCKIEEFVWI
' LPS . '~!...:
QEEYOKD -' ~!'FtDtETKEGt7E.:LLREEHANEKCSIODLORKL.."D::IELHDVSLF':FSKrf ' ..
' _ .
.: .
... . .. :.a:~.. . ... ..... ... .
: F:r ~ . . ..
L~~
.'."'r..KI!:~',:.'.~.r'...:_...:
it:; '.MI'r:l:?:.:.. .. =y-.rr: U..'. Loldic :oiaii :Vn _ No coDusc rJamolog present in Gaubmk/t>,8L
as of 11:7(98 CPn_0211 252765 252167 NSRIKFLtXtIDrAINSO'l'1'I'POPNL:'DAEPIASRAQCKSIAYIISLIWQIL.LLGL::I
No robust homolop presort in Gmebmk/DlBLSE
as of 11/7/98 ECVFISYPDISNVQASSiOSALLNKTSDOIOOKRCPKOSTFVTLAVSLYIIGSLFLLAGVALISIPIPGLAAOVALCLG
IVSLILGI1L1NIG1LCLL:.RCKOVPOKPDCLPSESSKOP
K
AGGVGLL'JLF1KSLL
CSTPI'ALPWOAGEFLEKVOVSATPILLPKNKDEELSAKVIOCEGAFItASSTKOAYLCB?E
SLVFCVLGIYLCLLLi LTVP
.
LIDivRKOEESRREiIRKKIVAEEAUURXRI~OOMAaOpErILRKNKELYJ1KRK
I
SHGVL
=Pn_0213 254081 252888 No roausc hrxeolog presort in Genebank/EMHLCPC~032b 261515 264967 as of 11/7/98 ELSYWWSIYSETLSFSELTSC%NSLJPFGPIETASIRINNVPNVtIIVCLI:LCTLFVCNo robust hdsolop presene in Gmebank/ENBL
as of 1117/98 ' LGNVPLGVFSTYLIGNSSMTILLLLISIGIltLLKFKERYCL.EPKELFiYEOGFDK>IG.PSElifK
AIfNRRRNPYYANfLEFIOGTOSLCPLfKYCFVRFIHYIIGOLEIEDASIIDiIDfLEPPB
' 'II~DQTADLARELDLEOKKD'ZLIRD!'SARLIM~SKTEKKOILKIGVPRN(SBIOERLCAVLCIIIGLiIVALILIt I
RTLLAAIPILGSVIGLGRS.FSIWSIREPODSOEYKSIfWtTI
AOEONSILEOCKFJ1LLFRRKS110EIFKKLYDRK)1AFWRSYREDLWCYSEINVSKXALSNLLATFIMANPCLKRYAT
FLFYS
YICDVFEC'I'APFFFIIIEaIYAMCRTJU04L11HYVINCIfEDNRYNEEIWiAKOLSVSELLCCCT
CPn EIyTDLFiETtiLfTSDSEDVLEEYOIFICIRV1TIWALWAIYNDEWSItKPIDTL.-~ d d id OMCYVIff><:'LELEIJ1QLYYDZ.:~.F, Ox MAVEDCZE"Fg< orv uccase dsbB-Disulfide bon ..
KEPPNIFVSCKLIJ(EIf!lINFIRSYALYFAWAISCAG?LISIFYSYIIIiVEPCZilYYOR
ICLFPL'iYILCZSaYREDSSIKLYILPQAVLGIGISIYpvFLpEIPGMOI~IC~CST
CPn _ KIFLFSYVfIPMASWAlGAZVCLLVLTKKYRC
No robust homolog presort in Gatvbenk/EM8L
as of 11/7(98 ILWFSRVIFSYfNQIGIPRLELILPLWKXENDPFCFLFSRVECtF'IIWIK
CPeL0228 266242 26512 0211 255768 25146 dabG-DlsulEide Bond Chaperone CPn ' _ ZNSSL.RCPL111DCILVLCTANPFIY
No robust honalog Dresmc in Gmebank/EtlRLVKDBADTtM.%gKFSCSILKItENAFEFYVFGSIKOL
as of 11/7/98 PLGLIIEDYERPTYCIIPPAPHPQRVDSKGCIAStIVS?<N1JVALEILGIFFLSGSLAPLVtfCFGFLI1lK10fIIL
PPKANIPTNA1WFP?ICNPYAPINTTVfEEPSC571CJ1EF1TNFPLL
TSCCVLIaAALPILCIC'dYL:.JWALIVFLCNKHkI'RODLOIfYDODLDSLVTHIOCEIPNDIKIQIYIDIGEZSFT
LIPVCFIRGSKP11A0ALdICIYIBiDPRQADIDAYIICIfFNRTLTYPI~E
SELRVTFEKLONLFQFHTImFSDLSOELOC1CFINCNERWLTLFDEVIIfFLIVRDIIfLETRCSHWI1TPEYLTXI~I
ECLILINSGRSVNPKGL.EQCIASCQYNDDTKKNNL7fGSOVLOGOLLIT
RNPTI'fGEQVKGI05NIFDLIiEEKSSLYLELYRLtAfDIAVLLt~IFFLLPPGIL1CVDYOLIEPTAWCDYLIEDPT
FHEIEAAIONIROLOJ1YDGDN~
AIKGLFIRLTSRLDIG.DVKAQERIOIFINF715REP'IfLVEKAFDIVDRATKIO:J~RAKKESP
ARLINGRTESLLt?OLIaIEtAL.ID~K)GLDPF1ILSIiFET.FSPYOQLLZLltYLNSIVLHfIYEFCPnL0229 LISCTVTSCLTLEECCRMRAASIIGLNALLVRlG4FR~IKSAYFEKLTEZEKELRSLODCT178 hypothetical P~tein 'lIKSLELE;LIHKIKDIVTLET
NS1D1!'SFLRIEOENFSFK~OfSIILSi'IYNI'ANLTKSTFTFILLLLLRiDIDipCLRt11D8*T
LEMYRHFRYRFLLGltILPAl7.cLLLRCSPNTLNY'1'pVDVIFSDRLCSCLLIFL1IABLT
KRSLLWLGIIPLGIWVCLF11CVAGASP'I'TFANDTLIGF71ILAWCISPTIlP6AZ.6SICPTLP
CPn _ ECPSYNPSA~RRAAYLFLSLLGWL.FARYLTASSLGITSSOSSNFLLLYSSIttEVYSLLV
No robust hotnolo0 Dresmt in Genebrnk/ENHL
as of 11/7/98 LTSSIOCOVNSSAIARDCFPSPSPQPSSTLGVFtPPKYKSLILSVSLfVLGVLLL.CVCFELLI.VLSIaGSERRWHTR
PKIVIIITAIUTGIIIILTLLPIIGHpLRYOCWICIGLTIEPAEJIW
' VNAIFSFSVL'IYGIGCAGVFIGSLLLILGLIFPVSYNRKL8EJ1TRSLLf<.Q4KTLLEYQPWJ1DFGSEYYKIfZLS
IEER'1'Vi.PWKAY>oQiIP~TS
FAYD!<LRATLRYISOFI&DKRALTNAS!
LRKEWEVOWSNFLLDEWEDTKEWAOHKSOFATFECDLLLFGREVCKYI"lIWILELDGRFPINOLWILVA'LVF'V1NN
SSNCLP1'TPRNFWICCIifIIVLFIW11BSLRNLRY1WLI
DVJ1LLTELIDOIWCPLEFLRIfKCDRiOCEIOEQ.RKZ'.lffBMiKSGLXT.ACELTXFKSALImVFSAAILFSPVL
PNIPVESPNFLPTIV1'GLILIILSIGKRRRTIOIKI.
KIEpfxYRDXRKVIIQ.EVFPOGYRRELL.EVLKTRLSVOCEIOLFEEW511!'LEICJISLNA
'CVFSEEELOEAL~tAKAELLDIOVRKSWEDLSCEP'1'LIpYHIiJtL.YE170CRIVtOFLTOCPtL0230 TFSSEpEKVLEEYFU.KARIRKTLiNKLDOVR71NVAFVAS1TDLLSFSESLt%~16VFEDCT179 hypothetical protein p RPIOTALIYMSSOPLYITSSSLSRYWLTGEEKVACYKItAPNHIWN011PAIIL71ML1JIPC
IFCPVLCSILiGAPLEGASILYDVILPWLLPSILVFYLLVLPWIYAYSNfDOpVLJItJIER
CPn_0216 257623 25717 Z1'08I~KEIYDHCEKEKRTPNKKALSLYIESOVLVPEYSKR1SSNTIGKTL1CIIPID~SP
No sobust hoarolog Dresmc in Genebank/Dt9LLSL~DELIOKALiR7IKENZYIB~JDRtKRDERFJUtRGxNIVSK'1NPLW8LiiG't as of 11/7/98 NKJ1RTNNPVTFDRIQVDFIPFDTSLRINSYIVAOGLLIt.CWLSIISYICLDIGLVGLSA
GAAl1'LGLGCLIFALFLFSFSLILLL9QEKRVPDVLSLYLEKEVPOYE'LPLYKEDLEBERCPI~0271 268996 DMSAiSERLGTTEEKLRIAOCFRYSDSVfIf cauB-A8C Transport ATPSSe INierate/Fel POAFVSIOD10GFSMLOAHRLCYSCDt~VILJIDASFOASPCTIT:ILGSSGVGkITLFRLL
CPIt_02I7 257881 258579 l1G!'LPLOEGLLWHGSPWR1~VAYNpOK>JtLLPWRTALKNNTLS'fEIGINTSNE~l7IL~iE
yip RLEEIIMIFDLCQLLDRYPDELSCOORORIALAAQCLSLKPILLLDEPPSSLWLLIC~L
PKCGKLKGFLSVNELIFCFOTFSVWIGV!'FASRCKAWL1GWLSLLSSIHNVFVWCpIHLYQOIVAWtKENICTVLLVT
HDFHDVSCLCDVLYVIKNKTLTPVPLDPSMRPLii4f3LCFIK
WCFEVTSADVYVICLLTCLNYARFJIytKNDINDVIIQ.CSWVISIAFLVLTOLNLFLIPSPNDLIDWLYT
DSSOEHFL1LFSSTPRTWASLVTLIFVOIVDIKLFTFLpRVFSKKYFA!!RS'LISLLFSO
LIDTZIFSFIL3IYGLVSNLCDVHIFAtIt.VKGTVITLATPTL.TVTKAVL~tRSSCPn_0232 270171 siailaricy to 5'-Neehylchioadmosine/S-Adenosylhaaoeyseeine CPn_0218 259061 258582 Nueleosidsse No robust hdtlolog presort in GmeDank/f?1BLKKP'I1BtRFLFLILSSLPLVAFSADNFTILEEKOSPLSRVSIIFALPGYtPVSFDCNCPIP
as of 11/7/98 IFLSKIIVFFESYDFANV115SWPKSLRALVOGRYFVDSELKtTPYRINDFKXTPINHRLYWFSHSKIITLECORIYYS
GDSFGKYFWS11LWPNKVSSAWACtMILKNRVDLZLIIGSCY
RSLPIISTIGCIIRLIEAliSGPIHPRDKNIfYRFEVLQAVIEILCLCYL:LVFDITCCFLASRSODSRfCSVLVSKCY
INYDAONRPFFERFEIPDIKKSVFATSEVHREAILRGCEEFIS
FLVAIILSLLLYCNSTFTCVONLSPTERFII.EGTGEAVNFLATNKOEIEELLKTHCYLKS1TKTEtTI'IJIEGLVAT
GESFANSPNYFLSLOKLYPEIiIGIDSV
SGAYSQVCYEYSIfCLGVNILLPHPLESASNEOWKHLQSE115KIYNDTLLKSVLKtICSS
IF'ti_0219 259319 260172 H
cgc-Oueu:,ne cRNA Ribosyl TransEerase CSSL1LKFHLIHOSKIISOARVGOIETSHGVIDTR1F/PIfATFtGALKGVIDNSDIPLLFCNCPt>_0273 TYHLLLNPGPEAVAItt.~LHOFMCROAPIZTDSGGFOIFSIJIYCSVlrEEIKSCGKNRCMSNo robust homolog present in Genebank/F71BL
as of 11(7(98 SLVKITDECAWFKSYRDCRKLFtSPELSVOAONOt.GADIIIPLDELLPFHTDOEYFLTSCEKARt?tFiGIIVLLFLL
RISRRSYVOEtGIFFHLETPDLKIVt.CAPYSTFLWIIIWSLKN
3RTYVWEKRSLEYHRKDPRHOSMYCVIHCCLOPEQRRIGVRFVEDEPFDGSAfOGSLGRNKGQS
Lpf?ISGWI(I7~SFLSKERPVNLLGICOLPSIYANVN~FGZDSFDSSYFT%AARHGLILSK
aCPIKICOQKYSODSSTLDPSCSCLTCLSCISRAYLPJtLFWREPNM:WASIHNLHHNpCCn_0234 :71216 QVNKEIREAILKDEI
CTIH1 hyPOenecical protein FIML03CKXALLSIWSILAFHPIPCW)VEJ1KSGFLGKVKGWPSKKEIOEEARTLPVKDS
CPn_U220 260660 261236 LSWKRYDYfSs's'GFSVEFPGEPDF15GOIVEVPOSEITIRYDTYVTCLHPONIVYWSVWE
rln robust homolog presene in CeneNnk/EMBL'IPEKVDISRPEWLOEGFSCl410ALPESOVLFNOMOIQGHNALEFWIVCEDVYfRGHt.I
as of 11/7/98 F'fSFGKKKCIFYMSKESIRSYSEISTP1'PZFRETPSKLCVAYKI4LRSPAKOCILRNRVSSVNHTLYQVFtAVYKNK
NPQALpKfYEAFSOSFKITKIREPRTIPSSVIfKKVSL
LKCALLR:iIPFYCf.FLCAICRIHSAWSNCDAPC1'fRVINYLVCCLELLGLG'VVVIaCKVLA
'fALKFLFSKASSKIKOFIKWREKARNLANtDN~S;KEFCSVDLTSCFTRCFRLRNRWEE~Pn_0235 271195 t:A,iENp'NREIIV kda8-deoxyoecutonosic Acld ..".ynchecase VFVfIYLU4KPE;IECf.C I(;y(,PARWN.~..:RYPGKPWCIFIGK.iLIORTYENASOSSLLDItI
t:f n_U221 2ti 1621 262051 WAT~OHI
IOHItTDF~AVMT.iPTr:.:fICTERTCEVARK'IFPKAEI
IVNIOCDEPCWS
Flrr toc.ru:c lwnlolog present E1IVDALVOKL.tL:SPEAELVTT~/ALTTGfEEILTEKKVKCVFD.iECRALYF.iRL:PIPFILX
in anrt'.tnk/EHOL a~ oc lt/7f~9 Tn111RYK'fEZJiOMVNRYK~3AEFF,ADNYYDDfILI'RMr:fKRNLRC:L11'b'ENEVCLFEE?MLKATMrfLHI
t:VYAFKREALFR'IIpH~:."TPL.:DAEDLEOLRFLEIfCA:KINVCIVDAKSPSV
Gl::>1I1N::11'IM:.:LIGtI'.HLII~IIW.':fODFKDSKIIIFIITALGLLLTL.:IC,IIVLLLKITDYFl:
4IAKVFX~YI'h:I::WIYF
t'f ILLILFTrt'.LU.'YPMY.~.MYSDFIIPI
r'r,r nz :o e7a t Iw : mn,;
nlr~ U:I2:.:1474 ~0_:14~ . IHr% 'Z'F ynrhut.r:u wwk ::iortl.ir ity trr OdcihriOpn.al::Il'rfYIMOfKi IFL'TI'l:W.~.::L;KCLTI.A::I,ALLLERORLFNJ1MLKLDrYLJJVDi'(Y114!P
'llfI 1'tt11 .tF:KFI.KIWEKIJ2flJJJIFEt.'IY)PEC'fRNRYN4Ift:.Y:RF'.Tt'f011AKVW:'.YRrVHEASLYEF' F:IK:EfYVfDCtNEfDLDLC'JI'IIIRF.:::AAL.:hIL::.AT7VJIYARVIKREREf:DYLC.,~NO
Fat'FLTI:f1'oflKIlLrr)YI:::LVKIJIWLFLKF.LRKMI::PHKIRYFfiI:A1'f.TK(.ORPHYHLVII~I
IfTFIEIIt.WILDMKCII.~.rTNI.I'fF:Ir7~fI:DIE.:LFFLFJvIRUFRY011,~.EDCWIH
1 J:: MT'NF'f WMVEVK::K r'fVI C: /rJfL1 : i'; I 1 ftM L IJ:R::fYFL'TOEVI(~K
L::LPr:NVFNR
AVFN1/ InVKHT IYE74Tf J1LA(/PJf LAtlf': :F:Kt.YLATVPEtI4DOWKVf.YN(N.uVIX.PKVK
I
'far ~I'.1t .'.r.1rSU ~~, r f n:W:K'IVOHRI1AYK::fFEALT11M4Vf/:IIMI:I
v IC IDAEGF1ILTNEII.fjt:LMt:LVPt7ItFG
IVY LEVMKFEFSVALKYLIPGRGI
AIV'r'LF'rI..LV'.'4tL.ii'.'F:::VIHGLC.~NIEJL
POVDL'YDPE4DYLLPE:
EKIA:
~:' YR(.~IF~:YIArbVtf~CRCOGiI'YfrICII~VLWEn.
.~,NLOOANaLFJ4DPNTPItP ,n LKPCDKAHKAlNCS5LI0ERHRHRYE'ItIPDYIOSLED30LHSPITILPSD1'YY~~rtYOT~NSSL:~T1Y:T
' PLKOCDI~O~'u'~~w'Y ' fPCL F
'MFX'AOf'LYAT'SCFMRLt.A TNFL'fYPSKLSYE~YGPtfD~T~
IIr;LAtyt:fr:I'FOCLCEItEV~DNPirltt.VOFHPEIYSKLISPHPLFIAFIEAALVYSKDA,y~ySip~Kltp YTYIrII~,FYNpGLSPLGfOIZYFIDPDIJIRSiR50 .iHV ~D~,yE
Ta~DYOYFOP
7 ~
4 ~EGId ~I~i~~~.
421 PLVKAImLO~ETfMAFPGONLPNSVHPOAIYFIGLG'i.
.F~_023' 2'3741 2 'fiFAIITLKNtA
Y'fAf FnmtiY .
'.rK,vYtrI~KRI''.LAYAAEPLLLTLP:~NIEJ1GKNLKLiA)tAIJFKACGWIG
..,L~t,IC,IW;Kph .:.\t.' ~ :Pa, W'!~1!'.-r......-PM
' ' -.
.:rAE a7rJ~
.. ' . . .. .. ... , ..::.iii.:.F
.... .;,~., 1.. :.::.:: .. . .
.
.:,...., -, ::
~
~
. luu: -:. .. . yOtlJ c1 . ~~:r..,'..:.._-;..:,f::
,~..a.FYY
ri37-L33 Ribosas~ai Procsin L!'K6M
' 274110 275839 L~RL~~DRKLRRIN
_pn_0239 KDSSNRS10IREIIKLKSSESSONriTfIOtKRK
wf-Giueoss-6-P Dahyra9snse NFLLFVIFASAGtID!EIIOfMVVOETI~I1~ISPRTCPPCILVIFGAT0151 286036 287559 ' CPt ~CMIOKLRDFNPR 4-L'IKECRLSDDFVC'.IGFARRFJCSIt~OENK~VI0FSP5F3.DIKVconseswd hYPothetieai Dsotsin SPDICLPYMSPFIOCTVIiRLICY15FOKE8A?LPT::REPRiTKSLGStNSVIS101KINF
RLFYHRSEFDNLi'ICYTSLKDSLDLDK1IAL~POYFSRII~.NID
F
IEp ISI,fiCSRNLtI~6Vt4iGILLKIVGY11'~STNEIItDADYL.ILN1CAFUtSARDGKTIYI~NL
OQ ~SNINYLiGSGfIVI18ILSAIESRiS~ItIStIKSIfI
KHKLFYKNDODGKPIiSRVITEKP~IDLDSAKOL00CTNflVIHIDfIYLiGK>:NO
KSf:NLRDINONL1110LLC
NIL'I'fRFANt'IFESCWI'tSOYIDHV0I5LSETICICSR~IPFEVTI~~
PYITFW1DEIRKEKIKILQRISPISEGSSIVf10QY0~'VO~~~~~N~A~'~' IIPSIKIDtLRSKPL0p1L1~ERTLYf~BV
:.LTNE KEITLIAODLGDYG~LSTDRE'~OLEE~NQ''~MLYLYP~It~
KDSRVETYVALKTVINNPRWLCVPFYLRACKRLAKKSTDISIIFIOISPYIILtAA~SI~GFPGE
PLtIJGLL::AIOPOEGVALKFNGKVPCTNNIVRWI~tRYDSYIOrT'rPERYERLLCDCSNPKLLPYVDIPLOtII11 D1lILK0!'BIIfIT'SRCQIIGFLEKLMKVPQVYIRSSVIV
EEiiDODSSPSFPNYPJIGSSGPKE~DALIERDQRSWiIDF'TGEOWIDNLGTFLYSO>FN?fPAt\ELPDOIPEKVKP
SRLILILSOIOKRNV
VNASWKLFTPVL TOBEIOEL
' .
IICDRTLITGGDE
pKlOIDKLIGOLIEAVICtIYtIPE114fid.TJIRFYGpAPEVDPCI
IVNEAIa.VSHIGCI~FIE
RPL I~~WB~~
~Pn_0239 275863 276672 . ' .
se-6-P Dehyro0enase IDsvB familyt Gi 01ST 288112 281576 CPtl ueo -devB- CT111 hypocMCieai Ptacein lframle-shift KaISt'toiIGITNATLINFND'PNKLLLTKOPSLFIDI~SKDNIASANOATKDLwith 0257?t SGGKTPLETYKDIVINKDKLTDPSKIFL1WODERIJ1PITSStSNYGOANSILR~I~tIPDEATSTVCAWTZrDnOSlB
7DARSCSFRRACRFriRYWLGGVIRIPNNKFt~tdl'STDSIVINSAI
aIFR!!!:1'EIPDGAtOtYOELIFSBtIPDASFDNI1'G~GLCED.~.YIO8S0''O~~IPRLFRTSIlXIKtIGDNI
DNCfGGELLLVAYW10NPLFPDIR
SLFSNI'SATEBtl~LW N
FNSVPtIhlTF7Df1'LTFPMOOGKNVWYVOGEMOCPILKSVFFSEt.'RECKZ.YPI1~V~DIEiaIIS'fCSGTSYY
RARPIIGNLCSTIYA~~'~'~SFRVpSPBwtIATLPFV
ASPLlWI ISPESYDIMiONISSTYlOmIL
0210 277861 276698 CPtL0253 288171 287950 CPn Cf111 hypothetical Drocsin Ifratae-shift with Ot53?I
F
_ FC"f3CRT!'ISSSIPTCOIfITISIPTFVRFNIESINLTDEQKKTALTTGONIATEtIIWi~GN
No robust homoioq present: in Gsnabenk/F1~L
as of 11/7/98 LVYPNVFSPSSESWKaNSWRSN~~VSPSESTEY!!>fSETM00RVPDIESLfDVDADODLICONtttE~N~P~SGRVNL
SNSPFSYQOSIGMtRQDYI4'tIt~fl RP'fD!!8'ffGFItAAQM.GNLFNSFGILI4lCFSQCKSCOTPGC>El'SATVLCJ1TLLF1<WALIEQPOQYVPY~O
a'TN~RAALSIi~t~SGDI414GE.5lIYLGTSSIKI~I~VO
t,GpTI~,AL,VYCAY1NY'1'LCKtIIYSIliKAItAKVLRHP110ERIFNRARGVATIRSSiEGVK'~
~
I
CPIL-.0251 289368 28859.
itLYKSANIGSLWSLI11SL71LIALTAGIVLVLFFVAPGRAPVITAAM10CCA7100GAI
tDLFLTDCISH
t fItATX CT143 hypothetical Protein SLtGWIAIVNKALDIOLTN~~AVSERLLHDPSNFOATLSVIaNVRi~BJLETRDLKVLLPfTTSPCEFIVIfONIISAt xS
RPSOHYOGSSDYQHRRGINtONFI'GSHFOGOOGFAGSH
YGNLFSNEEVAOLVOGGAPGGGS IPHKTLt3tIlmONLFIDO
' SAAI>ALTFSYYRI(TOCORANLYTYYPGN
fIYC ~T~p~ntSKTDVSOTP4CNNTSDPO
:..AGYPTAPfNPSAPPPFPPPAYD
CYYVAPNL'ttZTHVAATTiKSV&RNRTPDFBAYADIEPWKLfCOVCIYf7Y11'I<u.TRYIBCQ
IATLTINFVSQiIOITLLC1'SD'~GYSSDRTSVAVTAIFSVTILVSSPIYDrPWI
CPt>
_ It~JtSLS~I~PFPSNlV6VD
No robucc hosalop Dresenc in Gsnebank/E!!8L
as of 11/7/98 :FLVKFMSA!lISLSSSHGSTASEfI'pVRDVLVSL~EfYIDREfEILPTKVFLRR01Z.SS
TAIIDDIJIDVVETBIGBHIIFOVYSNTSLR4IYORFFEKIFOICCCFLLLVTDBNIfl'DPOGA
CPel L;TCIIF1111Vh!'1'VCAIVFCPTi.CTLCYSAY1CTY0LTKKISSLSRIIi?Zi~FTNSVOKSDPFI...
AAAAS05TIKACKStFROSTGTFFVt.GLIITISLAALIVGLVFALtTLDPGAPACT12 hypothsciul Procsin V
KNNINIiFBCYFtILDSTVDCDtSaANLKTFtI~AOGISS1'CIFSIIOQItiTPKDO
i A TLLKVIN
HRSG
V8A1GLTSGTI~~ONFTEEOISIDFKfBIRLSNCALPK6DCDPVPANYVRBPY!!CS
I
VIffAANIOCCAAGG'fGILLSVIGFLtabvYSWKSODGVHIQOn'ALLRCIVSNI'IION~Y
LPITPCiI7UfVLTOSIRRYDOFFSDDEYRDIESEVPLNRQZTPPPSYE1'LFHEECSOaSSNKPT.IGDTtuNSO>eS
~T~ETTI'~NVNSTfRTIGWKQSTRIL'NC~I'AZ~T.RA
VIPRCSPPAYS1'IDSSNSPFPSSSPPPYYA I~ELypK7lNptBJNGfIOGRIYINIiDLOCVGC
1STIYSOGCYATICrLCrtTYRASVD
VAPNPNDPNRSDNYNAGI~~IGNYSFSLLYYP~C
CPt~02i2 279975 279487 No robust ttowolt>Q Dreesnc in 0256 291=82 290398 Gsnsbartk/Hlst as of 11/7/98 CP
KSLKYCSLYOFSOKPTVILN71CSIFF1(MSt]f'D,n.
YZmEPLSKKTACLWD?f4.YPVIAWCA CT111 hY9othsclcal Procsin 'NSWLLILKVLFLLLSFPFtQ.CSASSALPCERVSLGSHFKCLYGCCLPYLLitCItIVPVFCGGRIJISERA'PKTKI
SIprIVRFNIOSTNLTF~OKKT'1'FrVCCKSt l'fQ'lIVVR~LrCT
.GTAIt~FI
ISHRTSEDARLSSAIVII~APILOL71015GLIKPDachTCOSt~r:aKD:rITRErsTNSEIVeDCRLN~.sNSPLma ~ISacODTTDraaESSaKP
OEYVPIGYYKRTOIEIIR~ORARN890YVOOGSVPSCSYVPwNKFDOTS'fQICISCfEIYTDP
CPet_0243 280609 280133 E1D~TK<.VE'EVNNKVPKLFET~I~~TLLRANEY00000RINYfDLRN
No robust homolog prn~t in GensGank/EL~LBRGSSYYE'fRPI4YVCVTYYAQ~CYETFOEaRAGGCLRVSFPSwNIVIILPYVL
as of 11/7/98 iNYNIfLVFLLKFVKGRIINACSIGYItLCNANEPDRF5111SINALV11DILLYPFNAVIGWTT
FAVLltWK<.LFL71TKFLVNfCIAACKSRPLPSCKENFOCLFGPK~(PGPSDWLGCLVLIP
CPtL,0257 292136 291267 IIGTLIYSTIITYOSDZi~RLRYFIISPAYInICSTAIINWCT143 hypothetical Drocein _ GVVIBtRRM.OKTGPHASTPSINttAfNtG ~0 ~ ~~T~'N
CPn_0244 280906 281556 ADTTTSPCEFIVODCB.SAESSOFKATTLSKCLBTTSEDOODAVPKPIfN&DPQSPR011LT
' adk-Adettyiate K>,nase GAPLVTKCSVFIIMGPPGSGKC'EDSOYLANRIGLPHISTGDLLRAIIRELTPNBLKAKAYPYI
YNY1IRMLiCOAtNL~SSSQPL'NGKPIETVC~IPNPE'fYRISASAKIYDAVIII!
IIIRESr#ISGLDNPNbYWI3tIGI34KTLTGU~DTRCY~RtRTSIAV
LDKGAFItPSDFI1WEILKEKLDSOACSKGCIIDGFPRTLDQAHLLDSI~i~VNSIVYI'VIFLOFE~GIYOVrIO
TGTFTLTEIVATPPHDIfPNLFLE1TIGIDIKSMSTCVIWFPFOANFJILVD
ISFDEILKRVCSRfLGPSCSRIYM'SOGHTECPDCNVPLIRRSDDTPEIIKERLTKYOE
R'fAPVItIYYDSLGKLCRVSSENKEDLVFEDILKCIYIt CT112 hypothetical protein /frame-shift with 0259?t =Pn_0215 281627 282199 CFSFCRLCSKFEKITLOCKCAIOLLAAGTYILTPTICKRN~WERiL~3GSIRLFBt:KYTGD
ydh0-Polysaccharide Hydrolase-InvasinQMIGGSTV1STI~TAVYRDHSDIDPDPNNPSDKYEB'MFLfYRNCOHSAVIG
Repeat Family TCOKEIMCNIfL.iFSPSADFFSKOCAIETOVLfGERVI:JKGSTCYAYSOLFHNELLWKPYPNYSITLLYFAG~fV
r:115FR5TLVPCTPEFHIHPNVSWSVDAFLDPWOIPLPFGTLLtNNSQNNIFPKDIIJ4it !4'ff IWGSGTPOCDPRHLRRLNYNFFAELLIKDADf3 .IttFPYVWOGRS1MESLEKPCVOCS CPn 0259 293031 292141 CFINILYOAOCtNVPRNAADOYADCHwISSPENLPSCCLIFLYPK6EKRISHVMLKODSSCT142 hypothetieel Drocein /frame-shift vith 0259?I
TLIHASCCGKKVEYFILEpOGKFLDS1YLFFRNEbRORAFfGIPRKRKAFLI
YFYFKRKTYtNFIEM1'I'INNODMIECYFKLDSTVDCDLLASNIOTFOttOAKCISSTETF
3tI00NATFKEIfVSATCLTSASTYKLNATGPAPBSITIDNKNNRtSNWILPKNPCDPVPAN
~P
GTME'DDSSRYLPI1GDCSNYTLYOSSKAGDVPRPVDWOONSKKL
. YVRSPOYFFCAKPIE
n_ HLGLiT4PYNPLtAEPTS
rs9-S9 Ribosomal Protein VvAKSTIQESVATCRRKOAVSSVRLRPCSCKIDVNGYSFEDYFPLEIOATTILSPLKKIT
EDOSQYDLIIRV.iGCCIQGOVIATRLGLARALLKENEENRODLKSCCFLTADPRKKERKK.
CPn YGHKKARKSFOFSKR _ secA-Procetn Trensloease &ubunic AYLDFSKRSCVEEDNVSKKINRE7tK~CPCOSNKKYKOCCLKKEEOTARY7Tl~%PKPSAEV
tp =0247 283130 293969 LSASEOGEIIGONC'MtLtORISOSLTSEOKMVGKFNOITKItKEIMSKKALJIKAQAKE6KL
r11:-L1) Rihooal.~l Protein VTEKLQOfINFEIWICENWPPEIFS1'ATLNOCrNFVUEDFIPTOEDFRISENSOKPPVEE
D::YIINKILRKOTK7TiVK.iSETTKSWYVSIDMGKTtl'ALS:uEVAKILRGKHKVTYTPHVA
McaXNIVINAEKVRLTt'AKKGOKIYRYY7~CYifWF.EIPPE1d94ARKPNYTIENAIKCMMD
I ~HfIetL:KKUIJt::LR I VKGDS IETFE;eKP
ILLDI CFn 0:.'t:l 1'1A27" 3~StW t ~'fw n4>t :x4151 ~dO:Sf! ytl.~-FF-Gx.f. .utlltrt.lmilY A'rPner.
~.FIRIPFIVFN::'1'i.LIIdPMStN:KRLE:LVRKALYTIITtILANIINKIWAIJ#X:KDSLTL
Y
;'~tV/Vt4N1 .\tu' 'Tt.mcpwt.r ATF.~r:.-.
L::K?IOOQN LtJtL.KAI.
.TRC.ftWLDI.ItAVNt~Y:KY~tr.AEVNKPYLTFIt:DUt.CIPFRTIP~WAPETP
te::1Iat.I':ItVIt'/A'rP.:It::FR.~.PA<:KK::RKtL\t:tdtlIFY::RIJ1M::LLIEAKNEr:YPt:
.~.UARRRLLFOAAKER~A::AIAEY:IIIIRDULtIOTALLJit.I.IIKAKFNitLI'VIOINIIF
A
DLKNODL
:f'~ LRFFGY
' ' ' . ?dl'fUiFLIETPEEWIRKFAKtah:FARVTt'tt , IM/::(.Ii::YAKU::IJtt.lB~VFPL7WIWIA
M.LtILtF:fLDVP:~ K
iY:A::t:M:K1 Y
I::I I
:'tJl::If:l'IN::1.::IJINa'I
.nIttKAIV:F'VIy/tll~tLl.l:DVrJLIfNVltfti\LLARY.tIIwYti::PVYTRALELLGLVNLEDKV
1't,l.:::Y.I:r::l?Yn.NtVAIAItALINBI'AtLIrIDRI':aatILEET::EUttINLLLEUA::ALCGIL.
:Q
LAfQEtK::::
!V'1711/KItI\::1':::HI77lI::N:KLFI'IIN::nanVl2i.~ ='t.U'.'. :'n':w t ~
~IH!1 ~1' : NN'it1 :''1111 inllB::4rB~ t ikne AI:1.1 ItN7f:~tl.It.ll:,.
I.IFNINKEItK\t'tL'IIJ~MKALKI IL'1'NUU:I'FAKtxt;;t:t.V.':ALLFNIIt:UtYIMfVAE7U::
'I'1'.I tnYt.n 1.t m.vl tt..t.ein GiLRTLFE3V3PDLVISrINCC FTEPOAV\:LE:.RL'IiLT'::.
.iQK6YF.ELLNKiAYYKCVw"L'ECw:YC:IRNG4C:.
I "
:SP" ' ' ' ~
.. t ~iDOYVKRFIPVKYFKEQRRtIDHCIF:
X fNE:.IfiIT.
1CAS LKNNIIVARRT".tEFDAGPIROrEOII
PYAYFQPVKGWAV
..N(JV
:K.74\L..
LtSOPFP
tlNt':KN\WV:x.TICMKQALYt7raW'NALSOtNMISFFQQDKAPEZUtALVIYP iN t P YW4'IOIWOT.Pt~u .FObD(CACFLKAY'd311K'Illd(Lb ':LTt:IlJINFPT::Pf7CSSwlOL7INLVPPvIDEFFYCEPQYLGSVNKNtIYYVCKISG1I1LICAt~
?~~
PCEELAAlWIK11FI7114GFta~'LA SLt7~'~PIIK
EELACNLENfII:,IIGPIF.ipFt;aPICLM'LCEFOKTQINPFHL4LLSSEL1TKIFHIVSDEOfVMLf'MtGMAVR
FPHfxVRPhCRTARGYRGVSGIQfECOKVVSCQIVCD~SV
LIVCOpCft:KRSLVWfRETFWOGVGVASILINERNCNVIJG11IPYZDHOSILWISSQCQA
:9517 297136 IRIIIIQDVRIMCR..~l~7r:'IRLJHLKEGCALVSFtCKL~SNCJDCeILSCSEEF7CSC:YSLR
Y7i17 hytbchetlr;.tt Protein ~> .
aPRKLRVRPP::LAKYAFRGFRNSHCPRPTKFSFPLYFSKtLSWFIIGGFiJIAC~uV0- ~Jr,01'~4 ~
' .1 , \L. .
.. v:::7.:7iYA:. . ':L~!':F~~:~...-:- ! '_' .. .. ..
.'~I'.'-'FI
.
' . ~.r .. .. .
."(f.Y~:.._;,L~.:
::P1NY::.'.~Cr;;'E:f::fNDPKEKNf;.:~AtTi:.c'v:wn'fRlfRi~tSIYIi:l7lto'C,isiitlLti 6llYLtNiDEANir.lv.i :H:::
.u.Yil;
:IINKKKCYT11GOLILE~INFFIfAiiGIVY%NWHTAFYSFLTYCIATKYNDNVIInCLECCRIDVRILCCCCIVIV>
xIGRwIPIEVNERLSAKOCRlYSALCVVLTVUIAODKTOImSYKV
KSVTI:TSSPRKLGHIIlIEfLGIGLTYIHAIIiCYSCEPRNLLWWEItLCLSOWtEIVNRSOGWICVGV~CVNaLSEI
a.VA'lYP'I(DKKCYONLISKGIPITfPiQYV$VSDROOTiIVFYP
EOPSAFIAIFIfLIIEVINCRRT
DPKI8TC1'P~ISILIOIRLRGJIFLNRGZTZVFEDDAOVfSFDKVTIPYCOGIOSIyfYIH
OFB~L.!'SEPIYICCfRVRDODEIEFW1I4WNSGYBELVYSYJMtIIPI'Rp00TNL'1'GPS
CPn_OZ6< 297770 297155 TALTILVIFT1'YZKAtOHWO'a'tKL7ILTG~IRi~i.TAVISVKVPNPQI~O!'IOQI~NBDVS
ubiD-Phertylaerylau Deeasbo7cYlaseSVAQQV1AC&ILTIFFCZHPQIARNIVDIIVFVAAQARB3IAItKARB.TLRK8ALDSARLIGK
WCZSCASGYIL71VKLIKELVNAKHQVCVIISPSGRKiLYYELGCOSFDALPStEN
M
R
K
LIDCLEKOPEItCQ4IfIV~,DSAGCSJIIIpORORRFQAILPZRGKILNVCKJIRI4KIlONQE
Y ~C
LEYIHTNSIQAIFSSLASCSCPVEIiTIZIPCSllITVAAISIGL710NLLRRVADVALKaR
LMSIHL>DiLLXLSK5Gl1TIIPPNPNWYFIIPO~~L~~~IIOTIZAAIGG:ZGADIfIt4.SKZ.RYMIIINl01101 fDCSNIRTLLLTP/YIUIftALI:
PLILVPREIP
VYZAQPPLYINSKIOtDfRYILSEKDfaSYLT2lLGTNESSILFKSTCRCJICPaLCiTINV
PSDLTKOwSNPE
ILDVCSFIIITLEKKAIPPSE!'LElIYItO'..Iu"YPLYYL71P7f1~f00GRYLYSDtAtCCAL110 EC1?IKP1CIIELYIfVAYFVDI~LOLKKYCLDISSYLIPOKNEIVILRf~SP5CNY8CYTLE
=Pn_0265 298672 297730 EVINYLXNLGRKGZEIDRYKGLGL~Ii100LWD'1'l9NPOQRTLZMVSIJfaA~TaDNZPTIQ.
ubiA-Benzoate OctaphettyltransferaseNGIlYPPRREFIFSHAL4IRIFOILDI
:!IIIVRLYYFI1JLVNTlCYSIFSILFLSAS1Yf11LSINEZSpNLSFICEGFKISVFG71IAFV
FARTTGIWFIQCiDAFTDKIQffRTSKRVLPANLVSLNFAWVLSLICSFLFLFLCKZLRIF
.'4IVYPYMKRVTFFGIRJGIIGLVY1Y11ILlNlCAFAFSCLSIIRLCFLULiIGG9 aLGIASLa CPh~0276 311110 310 . CT191 hypothetical Drotem SVCNVIAaNDIIYAIEDTEFDREDGLRSVPAHYGEK101VEIAKVNLWVSYtJIYIFSGTVGI7NP'LKRKKRDGSQVO
NKRTASPIIDWWYLFOfYLQEI.QKZNiIANPN011IDAWNOVf'ItDKY
SLDKEFYFfAIIPLWILKWRMYSNYSKKDOEGLSKPFLANIAIALSFLVSNTLTWSLSKGMSpAIGFRDHILLVKVYNS
SLYALLI~'1'PONDLINSLYQVASNVpIREIQFLI~r R
C?n_0266 299181 299876 CPIL0277 312003 311104 No robust hosolop Dres~t in GensWnk/ElmL
as of 11/7/98 No robust halsoloQ Drsssnt in Genebehk/ENHLIKHLPPLIFYGYILNZIHVRAtAtGITSVQQPSTN!'OAAIPIL
as of 11/7/98 NISIFYPKYFIEGKCVL
IMALDEINNOF87PSppI115STSOTSKINODR1(TFACTVTLLWATL!(ILSDIVLLtTIGS.
NIVIOCSRZSSTYAEDIEEVAQEIILFJfSTNSKSSTSVM.WJWRVRCIfVEILCOCIVILAL
IGLSVPLSCILCTFAVTVCAVLFZ1CLTILVRKSLGIEQ10~DI3ifT.KIKTPTPPARPLIf VVVVV
F~
t GEC<
SKFSVrCSTTSIVLGNALLIG71WSVFFL'iGYLpLCLCACLVCLG'1'ALTVAGLiIRNSPRS.
C
I
YLCP
G
EITALSILpVIZKLIItCLIDVLCVCLFGLGVCWAIIG71IA
WDOCCSGSADSQSNIVGICEPKAAQDOKWY)0lAIMIG>mGZPTAIIILTPEKPIIVKTi.ISPDKPYPfVVYV
=PtL0267 300122 300910 CPtL0278 712881 312060 No robust homolog Dresent in Genshsnk/EMBLcaausrved oueer eualbrane llpopsoesin as of 11/7/98 SINSWd(TN71LLNQPEPAVCLNAWDPKYINQDRKTFACTVtLLVZATLMILT1GVIVLLR08FBfKICLSLLVCLIi~
fLSSCF(KCWIpMCIRIVJ1SPTPNAELLESL01<aAItDIGIKLKIL
VS:aGTSVITLGTJ1LFIICLVKL210f5L7~WI0YpKYFOE<fVICOKYEPFSPI7CCYRIPNRLLLDKpVDANYPWI
011FLDDECE<tIfDGIOCELWIA1MILCPOAZYSKKNS
.
SLaILKSQKKL.TZAIPVOIITILIQRALNLLF~CGLIVCKCPAMitIffAKDVC~KCiRSZNI
-PKNONVttKLTSG.PSPLDIESPSPEASTPVSIQ.RIACSGYAiVILIVTLLIGAWS~IFFC
ptaLllIGFACLGT71LFVGGLirGLRTNSLIAQGINYLYLTYYLSSALEERNEITIImQLEVSJ1PLLVGSLPDVDAA
VIPGNF'AIMNLSPKKDSLCLEDLSVSK7fIi~ILWIRSCWGS
:~GYL
, P101IKt.QKLFQSPSVQHlFDTKYFK~TIILTIEI~FIC
RNEINTYLTEfl(~tQ01dtL1DILLE
0268 30091 701318 CPn_0279 313516 312875 CPn _ Pwsibls A8C Transporter Peceease No robust homolop Dr~~t in Csnsbank/F~LProtein es of 11/7/98 xawOltSLNSQCOSSSTS"110EWNKStVPFK'R~1PTPPLSPIPSLDEFIL7IYEPtI2PKSDPEKKD~SDLIQIL.L
KETVNI'LYIIVSTAFF1SCAIGCNLGLGLf'C1'SPItBLNPIDfSLYATIS
NAQIIFtPPCI'STPNVFNCIDDtlIPLLGpPNEOFE1JIFBtPGTSCSNPTSLPAPtE~'EtNSNZLSE'LTAIPFAI
LZVILFPITRIdIVGTS1.GP'1'ASIVPLTZGAIPFWTIWD311RNiAL
QECZaGSCN~LIG
NYLC$J1VALCIPKRNILtGILLPESYPQLIFSLKSLWNLISCETL7IGlVOOOOI~pIi.I.
QYCYYRPZ<3iSVTtSVLVITLVLIESVRILfIDIWGRRVLKIfROIL
CPn_0269 302168 301176 CPIL
DipeDtidase -, VATRCVIffII7F0LCIM.LSHPNFGRImPAVRCSPEQLLSOCVRpQVCAIFVPHSRGEPNCDItdppl'CiDSDtids Transporter ATPase pFtSLifSLPNQYPDIGLLSYEEEElIGSSS010CSLSLIRSIEN1l5J1LCODTAPf.C'ILI31KLIKCGWLVSEpH
SPIISVQt7VSKKLGDILILISIfVSPSVYF'CEVFGIVCH9I'~GK1TLLRC
IHLTKOGPIJ1YLGIVWIOGDNRpGOCfF.APIDtLBNDGKVLLDINYELCVPIDLSHCSCKL.ALDPLDNPTSGSISV
AGTDNSLPTQKFSR11NFSKI(VAYISONYGLFSSKl'VFCFIIAYILItI
EDIGDYT11DKLPF7LiIVIII.f' NSTIPRSVGDHRANLVDAHAKiZVRRIOCVIGLNGVRSYtICDSHHSCISKSEYCEOVYI7I'I14FLNLYNRNDJ1YP
GNL.SGGOKQKVAZAR71IVCQPLYVf~GD<I
IGDLEKNVLtIAENiGILSSIVLGSOFPYANFaENIFFFtF.CSSAffJWPVW~OLIHRIFSItGT511LDPKSTENII
ERLLQI11QERGITLVLVSHEID'WfOCZCSHVLVl4fpCAVCELGTIEE
KAESILSSRAOSFLKQ11IVEQVNPKITDVKf.
LFIi4SFiISITNEL!'HEDZNIAJII3SCYFAEDREEYf.RilIPSKELAIQCI
ISKVIQ'fla.VS
INIL~FIINLPAKSPFFGFLI IVLOCEYD~tKKAKELLIE1.G17VIKTPIf CPtL0270 303313 302168 CPn yvlC-SuAS SupatEamily-related Protein_ SIFGVIVPDKIUQITFSLPEVMSAINQGKNALPTD1'VYGFVLSLY11SE71EERLYALKDR-dhnA-Predicted i.6-fructose BiphosDhate Aldolase Idlhydsin EPSIGFALYVNSIEDIENISGYPLSPTAKKLAOLFPGAITLWKfiPNPRFPKLTLiLFAIVEamily7' DNS~ft7REIVINtCGTLZGTSANLSEFPSJ1LTAQEIFADFADHDLCIFDCPCSHGLESTVIIAISLRRHTWtrIIHD
ILGNDt>E2tLL5YQCKNITtmKLTLPSNDFYDKVFCLSDRFB1RVLRS
SDPLYIYREGLISRSVIENI71G'fEIUCIFHRTSHAFSKHIKIYTSIIO~OEQLVSFLSGSLDFLOTNFSI~ftLANS
GYLSILPVDpCIEHSAGASFAINPIYTDPF11IVKWIESOCSAVAST
KCWCENPKPIOIFYTRLREALKKKTPSIVPIYDINt'SDYP>r3.fPFLSPYYIEYCTLSLLSRKYAHKIfFNLKLNHN
ELISYPPKYHpIFFT0VE7N1YSNCAVAVCATWIGS
ETSNEEIVAVSNAFAKAPSLCL71TVLWCYLRNPAFVAFICIfDYfffAADLTGOADHLWTLG
CPn_0271 303628 301362 ADIVKQKLPTCQOCFKAIFtFGICIDIItVYSELSSNHPZDLCRYOVINSYCGKVCLZNSGCP
LysophosDtsolipasa esterase SGKNDFTfAARTAVINKRACQIGLILGRKAFORPLSFGIGLLNLVQDIYLDPNITIA
KLIifDYSFFRRKICNIPJ1IECPCNPQDPIIILCF~YGSL110NLTFFPSICSFSKLRP'lWI
FPNGILPLENDFRGSRACFPLNVLLLpELSRLYAF1GVI07LQEKYDELFDVDLETPKFALECPnr0282 3160A1 ELILNf.HRPYNEIZICGFSOGAIIrITHLVLTSQNPYAGALIFAGARLFNQt2rlEOGLKQCAxasA/gadC-Mino Acid Transporesr QVPP'LQSHCYEDEILPYHLG1W
LNDLLLTKCI4CpFVSf'H~HEIPSWFQIO~VTVPNWIILILQSLNFSKKVETMSHSKP1'KPLCTFT~IiLSLJIW
I:at.RNLPLTAK11DLSTLPPYCL
DPARG
AVICPFIIPYALISAELASFKPpCIYIWlIRDALCKWWGFFAIWNOWFiUlflyIYPAV4AFIA
STIVYKINPELAHNKIYIATVIIJIDFWILTFFNFLGITSs~ALfSSINLIC~LIPCVIW
CPn_0272 305272 301340 SLA4lWIFSGNPIAISLa~FtLLPNFSMISSL
DNVNPRK
dnaK-I711A Pol tII Gases and Tau NYPKAVFICAIA'tLT:L'JLv'SLSIAIVIPKEEISLVSGL'(K'EiTLf'IDKYNL9WIfEGIW
FNRQSI7AT'IATyNMHLEEENQGWFrILLRKVYHQEVPPAILLHGPTLPVLQDKAEOLASEIVMTIAGSIGEit~AWM
FAGTKGLFISTONDCLPRLFKIMt3KNVP'INIJa.PQGIWTIPTL
LLS.iSPCSEHKVSQKIHPDIYQFFPEGKGRLHSjDLPRCIKKQIYISPFPJvNYKIYIIHELFLCLDSADLVYWILT.
1LSVOHYW1NYICLFLAGPILRIKEPRApRLYSVPCKFIGICEF1 ADRNTIJWtSAFLKVFEEPPKHAVIILTTAKVpRLpKTIISRSLSIFIERCEKILCSKETSIU'..ILSCAFALWVaFL
PPRELAQISFX'SKICY7TFLLLAFSLNCLIPFU'IYFTNKRLSK
FSYLPRYAQCEIPVTEVSQIIKESSEl'DKQVLRDKVQRFNEVLLELYRDRY'fW9tGLIUSKS
.\L.NYPEMIKEILQLPLLPLOKVLLIVESACRSIaBJSSSAASVLEWVAIQLVSi7pYKEKEL
vsVSP~COCL.SF7 cPn_0293 318581 317532 Nn cobuac homolaa Dresenc in Genebank/ENBL
es of 11(7/?R
'.Pn_U27f 305853 305227 c:RRL:fFODLIKNAV1KIISFRKSPPNPVItLLIKFAKKGLFlJSSIAPLYEVLLEILF31PG
rdk-Thylsiriyluca Kinase EEILEVLFSLDPNwWtBNLDPKKHSTtGIEIS~aETAETIESCSIGLISINLLLSGLCLRS
.'.aVFt'/IDOCEG:xK:SLAKALGDOLVAQDRIfVLLTREPrY:CLICERLRDLILEPPHLE.~>NDRrQAVKIIQO
Fti'QFS.'EEVQNFVEQRNILTPFWIHLFECDEVALLWQW:LRLOLIV
L~F.CCELFLFIG~RIIpHIQEVIIPALROCYIVICERFHD?fI'IIQCIAEGLGALIFIIADLCPNALYPEPDC:;CW
<,kaNSEItAKt711E00QEDFlIKTKFA(:Y.Ef:LKKLVLPAL~ITSIPQLL
.:Y.VVI:PTFFLPNFVLLLDIPADIGi.QRKHRQKVFDKFEKKPLYNNRIRFf..Ft.iLASADPRARRFf!Q!'w\E
IW~L\IMtKKNKQNPFIFLEALLE::EEF::t.'.'X:KYWIW~1I'IIIL.WOKLWIA
at'(LVLGAPE:U1::L IDK1INLNt'OIw.LCTI Y U:YF71V L ICps :r I ETF~'RRIWIJdPEAFQM
IQOr:f L4:FLFfKNLLD
:an:R;7A 7UN1FR 305952 t.'Pn t72i,4 IInU'i1 IIHSSI
IyrA fAIA ::yr..::.. ::utfunlC N., rr.t.:::a (HNNILW I'r'::unr n :n ::.,nvGW k/F:YGL .u: ..1 ll/'1/rR
:::I'11*tTIKDEIIVFKNLEEFJIKE:u'YLRY:.M.iVII:PALPDIRIX:I.Kf::QRRVLYANKQLfLIMFIIf A('\WfV114?ft1'NNf::::'(r:IL:LK::::LitfIT'I:.IWIUUA'fLdI::VLYFnt:II::
:I~:IV:AI!IIRXti\YIY:DT.'.:I;DYHF1K:E..'VIYPTLVPNAQF1WANRYPLVDCxXINFCSIDIiDV:."
PI'/f(:MLIfL::Vu'::1'la't:lYl~F"(:yJ':::IFKTFVF::IT.':I::VFI'::I1EIU.N(d.L:REF~
::
I'fnANR'ffEAkL'rIC:nNYLMEDLDKDTVDIVPNYDETKHEPVVFPSKFPNLLf:NGSSf:IAV::AfOELI.KNF
PAf71'fItRPItHI.fI::fIFLOFJJ.ftfiIRf:FEED~H'f::Y.lt.
'r:NA'ft7I1141NU:KLIEA'fLLLWNPQASVDEIWVNNIPDFt~IY:CIf~.0 Lpl ICCCEGIRSAYTT
:Rr:KIK'/PAkWIVRFNEDK11R&:IIiTF?1PYNVNK::PLIEQIANLVNRKTIJICI3I>1IROEW rsniH'.
'..m.ul slm'.I
::lrYff:IfNLEIKKr:Fw::EIIINRLYKPTDVQV'fPY:AtMWLINtNLPRTN32HRMI:iAWINm rmtmrr lu~nul.vl fa..,:~m in ym;t.mk/Falf'1. .n:: .n Il/'I/:N
ItIIHYEVIPPf!TRYF.tI7KAETRAHVLECYLKAL:xf.DALVXTfPE~J.TJKEH/ULERIIESFCK4LFT7LFF' P'fJUJK6fT1::111?LIYIvtY,::F::I::f~ITIV:LIAI::V1.1.1.la:VVF'ALVt:'IiVI.
MPIGLL'JW~AASVCS741AIVStJICLYKOGKPVeAttPOependwnr Prac~ W n~xy Reaul.yrL'T\' SNEEKIDPT%DLEIKOPESLKPV .:nIL%1.~.1:
tRNFIIQJLIDMFLLKK'.'IITVSLDNDL:.L'::ADKt~::FKI'C~tVF::::.GIriFSfYI:
tNt~QSLPKERKTI3lItAIfIPSIVWDntPYVIQSrFYHGNKVYSKPIAEpIpSLEIfEITVECYITI3KCKL4PLNR
.IIR~iDCFf'!EKP~iYJn4l::w\~flplW~li~Yy~firf~t EEYA
TLtYDFPRALEE.~SaKS".rGSLLRCVI3EIKNLFLPRFL.iRKVKYSL:ACLRRLGS1ViFLELYAItt~IIIFIIE
W .....: , s SSA.LILLLTKPEPIJ~tM'OOLIJWLNSLKTEKIUILTPIMDKLVISINFNFY41~ISLiE.
tEKIVAYDPM.LTDELIWiLFJICittIVOFLLS!'OSSO10REFRALFP~OELPSAKDUSrf 777866 ?37627 YVPAINSSEYNIfDPKDI-SVLIIl4,"LSERLU'CEKIPSPSSNttPTSSVASHYImFSLL!'fFF'CPIt~0295 'yl ~'~rmer Protein ecDP-At SNppSVILONPFLLtELWENPKGOTFGKCLLEKANPNSNWAAL.FKPNLNCNtISCIJ1NK.
ANSLEDDVtAI~JEvL.."'VOPKEVNFJISSFIEDLNAD~LCLTE:.IM:LEEKFAFEISE~J1 KELICITAEHW. PFKETTpJIIASCKILDLLLONLPDFy.. r7..-..,. . ...y., rrT.~.
. ..,.
... .~,~,~~ .~: t~ , . :". .~; 3a~ :
O mi:'W W':--mgtE-Ng Tcansporcer tCBS Lk7lCiihl CPn_U
AFTCLSIDZHSH av CT296 hypocMCrea: Drocein SCRFSKGKINVGEpNIUtECKLOiAFSSGItJ~SRTSNt~DELSFKLFXKIPIIGIfICNDITLVCNRYIVTCGSROIG
Il..IVKLFLFIJfiADVEINCWEtR00AVIL5I.
DLSICIVIEYNPI~J1YAVSCLPSESRAILYIU~ILSCITAKVAFIINTDSASRWAIFRRLSD
SEVCALIE(xiPPDPJ1VWVLDDZPDRAYRRILELIDSIOtALICZR~.010K'rRNI'kRLNTNETOLOGCVSFJ1RV
CVSNNOCVKDCVOKFLDIUWKIDILVNNACITRpPLLIaIIdC~QSV
V
V
"
I
' FFAf(XETrVKDV$11CIRSNPGIDLTRLVFVLDFKCELOIW'IDRSLIINPPO(SL10QIHASI
AKIwSA
.7pl~lYAAARAGIIAFTRStJ
SSVIRlOIIKARSCSIIN
ISTeR.TSLYYK
' ' ' NpIOOLVLPDIATRECWDLYERYKIJ1J1LPWDCaIFLIGAM'Y~Z~ZJ1DE1'IARSVINONLKAt3~ILKSIPIirR
AGTPm V71RVALfLiIfOL
tt7tfl ItEIfAAPNIRVNCLAPCFIE
tiACITfDUCYQTClIWOR!'LhRAPWLLYILFA~ChZSASYNAYFOKISPAtl.ALIIFFIFLSSYIft110TL.WD~
:LTY
INGItBCItN4'VpCSTILVRBNATCCLSFGRPRETIFK~1SIGLLICVSII&IIICGLWYIJIr FLCiZiIISGOGIOLGVTIIAI'CVIG71SL?ATl'LCYLSPFFFAKI.L1IDPALA90PIYI71t1fabD-Halortyl Acyl Grrier Transeyclase IMSltIIFFLIACCINFLPFN
SHSIt~Nt?OUtRYAELFP00GS0YVCHUpDLYNEYPEVRELFDPANN<<RIRLCFSLTSZI~E
GPEaiJ~IETVHSOLAIYLNSNAWKVL50RSSI0PSLVSGLSLOLYTIIi.VIIS~IISVLDG
CPn _ LELVR>~OLl9~tEACNpSPGAHAALLGLPSEVIEENITSLGOCIWIAHYNAPItOLWAGI
No iooust homolog Dtesenc in Gatebank/E!!BL
as of 11/7/98 RACIIIRSPLPFISSKFAtidQCLODEFSCPEDWDFLFSEIELLA90DEPSt~YiJILSRSAEkYDOAIELFRMACKRA
VRt.IIVSGATMTPLNQY1100GLAPDIYJIt.QIIU7SSLPWSI1V
LLMIlI'tIMiPKVIfKRVIFYGVSYCLIWESMSIFIDVLTYIDFLFEKLGISASDRi.SLCSARVGKSLVNTE~IECL
APC~SPTtWYQSCYHIESEVDEFLELCPGKVLAGINRSIUISItP
TCINFILYSpIGD9ffLSEWDNFRLIE0LL1U01P0L1UaRi~fOIFRZGAIWEEVSLVASITSIGTF710IEKFLSEV
ASVYpAVCRSFIELYtIIWLEISDLAL'GIOtCtJIt.ALDLSPttIAItIHADYAIOGLVYIGTRQG
KSLLIERGIIENFSIfAZFiSFSRDGTrTL11Y0NYRYliYALA'9VKLFDLTY10CEIIFOQANIIfabN-OxoacYl Carrier Protein Synchase IIZ
YQTVpAFPNLSCttMVWGELLIRSGWWSNMCYIEVOLEKLASI4KKTNDPIA4SCLI'ATYTSFFLYIMtfSVNIO'pI
KMIWATOSYLPEKVLSN71DLF1Q4VDTSDEWIVTRZCZK>CRRIA
f'KDSRHRLISANRTFPGtISJILVHAI~IVQLCSJILYIttEDSHPASAI
tU'M
.
GPQEYTSZJ1GAIAAIEINiAGLSEDOIDCIIFSTAAPDYIFPSSGAirIQIINLGItOVPT
GIAILCLYL ' SCFQSCLESiDLDA~~WU.FDAYFSwCIIUUtSNtLd.RIfAVDVASRLCSiJtPPJIILFWSD' ' RGLALKCLAFJ1TIDGAYKEIFLSLSLLJnORANDLSGRLEILELWGOSHYLLAQ-0OSLPGDOQARCV
WIIFI
C11G!
FDOOAAC11CYLYCLSVARAY~S~fONLLIAADIUSSFVDK1 IUESRPUSLEINRLSLGADUIdUELiSLPJYa'sSRCPA~SKLTipSCIUI1IAMEGRtYFRHA
HYDEAYTLLTKVDLTLSSSRVKLILAAVLLG1(GRLL.pIri'DFAEFJWEZLCFLY6YYLEDEVRRI~TAARHSIALA
GIOEEDIDWFVPtIQANERIIOALaKRFEIDESRV!'KSVNKY17N!'A
TSLGCPEAYYTIGKFYAVIImNNIUWG ASSVGZALOFl.VHTESIHI
DDYLLLVAFt~IGULSWG11WL1(QV
<N
HVIRSAQYGVRITEAiIWWDPYLJ1NLREIHAFRL1NFN010GRL1iiGNKTm90 0288 325785 724571 CPn_0299 716726 777115 CPn _ recR-Recaabination Psoteih CT288 hypoducieal Drotsin tt100<Z.VYYSESLY$MM.UPRPECIUiICIHITNTRYPDYLSiILIFFLR>Q.PGIGFKTAAC.A
ISITIREFLFFCFECRAKFYNVIMSCFNLTSTHFSLRPISPKASFPIODaiOSYlRSALRK
HRSOTLSVSYCKVNKYDANLFVRLTVIALAVVGVLILFSItd.ASIQGTLVZTSWPLVTAAFELISWDSEOLKILLi'I
APHIIVASEpSttCPLCFTLKESKEADCHFCItEptlfipSLCIVASP
ILIPIZLLTOGMfILfRhGEXVDVISGVCZPPFSRAGWVPISSSIftLDCFDEKIiIfSACSYKDVFFLQISKVFKGRY
LDISTL.SAOUSCZJU1VYQCPPLLFR11FPCFGIPCANPFVALLPNIYNLZRFLWPPYIIFGDATALIIJtOEL~!'S
VNISRL1IGLPIGLSFDYVDS4TLARAf9GRH8Y
RNIYEHFFCIUD:P>~DRFIYItDVARtcIGRSLJIAFLtIAPFYJI6aC'IIQJIFYSLLDPLiICRV
tiIGSVERDtitIOl~iVZLARS1ISLAtIF~WSLFRFEOGOGR10GIGQHAFYLHLCCpPOSVfLFD
KGEIVSGAttPSIOLPERRCLDTSCRYPHZSVIPDS~iD&AIUIFIV
CP(>_0289 725797 726996 C-1'2A9 Hypothetical Drot:ein NFtdltl'BfKpRSHYKKNNLLLLLSILVGLGLGSVOSPNIVYSAECIANl'FLKFI~Li.SIPL
VFCA1GSTITSIOFtFNflNTLGIUtILYYTLLTTVI11J1SIGLLLFFLLRPONI'1'~ALAT'1' TKCNPLCYLtNLSDTLPtNIFRPFipGNVZSAACL.IwVt.LCSASLFI.Q~~FtfIS
rFSIFUa.~ocLxLLFIAfa,crsYItFxFS.IIDOSNtTIaAaxtscvloun.AOCFIYLP
ILLKINKVSPLKVJ11UNSPALVTAFFSKSSA11TLPLTMELAtODLKIN%NLSRFSFPLCS
VINtIIGCMFILITVLFVATSNI?IIISPIJISI4WIFIATLIIAItRIAGVPlIGCYFLTLSLL
T5181VPLSILGLILPFYTVIIMZlTSLHVWSDCCWSLAN
trtTAEUVPVSERFFt.CCETIVRCIfKSFIICPKYSATFPQOGLSSLLISEEIpYILIBpPl1 CPn _ ISAFYiLDSGFVCLOEYItISLKDLRSSAGFCLRFDVL~OJMfPVHtGFGWPIItPT~ILI~K
Na-dapeldenc Transporter RSALTHNKKHASFSSRLCFIFSNIGIAVGAGtiIWRFPRVM~JOGCAFLILWICFLFLWSIDNSORltFALOQip IPLIIIEISIGKLTKKAPIGJVLIXTAGiUCFAWAGCFZTLVTTCILAYYSTIVCtIICLSYTY
YAVSGKIHIL~DFAXLWTSHYOSSIPLWAHLTSLGLAYLVIRKGIVtIGIEK(ZIKILIPAFCPlL0J01 710167 FLCTIaLLLRAYTLPCAVOGIKOLFSCLnCSCISNYKVWIEJILTp~111WDTf'JV~GLLLYYAfOapIhLike Oucsr Haebrane Proteihl GFASKK1'L1VSNCALTAIGNNLVSLINGIZIFSTCASLDItGI'rpLODDAGI1SSIGITPIIKtX.SKEIF11VFRI
IGFWYPFSIPIU.VQVIt9(%LLFS2FLLVLGS'fSAAHANI.CYVt~It~lC
YLPELFTRLPtxIYLTTLFSSIFFL11FSMJ1ALSSNISHLFLLSpTLAEFGIKPYISEfLALEESDLCKKETEELEAH
KCOFV1UIAEEZZEELTSIYNKt.pDEDYMESLSDSAStG.RIUCF
TI:AFVLGIPSALSLTFFSNpDIVwCVJILIVNGL.IFIYJU1LVYCFPKLKIfEVINAAPGDLEDLSCEYHJ1YOSOY
YOSIt~SNVIfRIQKLIQEVKIAAESVRSK8KLF31IWEGVGAIAP
AWIGFDYIIXYLLPIEGILLL.CiWYFYDCLFPFJ~>GQwWtIPISLYSLCSLVLQWSLCLIILCl'DKTTEZIAII1J
ESFItItON
wxFNKOLYLAFSRYNIiEIL
CPn_0J02 710766 311866 CPn lpxD-UDP Clueosamsne N-Aeylcransterase _ SKFI~FSNSFJtPVYTLKOLAELLQ~00NIETPISGVEDIS0A0PHNI11FGDNEKYSSF
ine8-Inclusion Hembtane Protein B
EKHMSAPIPTPQELSDOITCLNVipYCQYSELARENKCDIECLKTLTAALTADAGIOPSADLKNTKAGAIILSRSQAII
QHAHLKIOJFI.ITNFSPSLTFOKCZELFIEPVT90FPUIHPTAV
EIYSLQ'fJIAALILSASEKPCSCPSGSTECSVTVQSPC%FKKVIJ1WLT:IALIAIAVLIAIHPl7IRIElUVV'!'I
EPYWISpNJItIIGSOTYIGAGSVIGAHSVIGANCLINPKWIRERVL
CIIAACGGFPLLLSaLNLYTICACVSLPII11S'1'SVALICLLTFV1WSLIKPVITVRTfRtGNItWVpPUAYIGSCC
FGYITNAlCNNKPLKHLGYVIVCDDVEIGAM'1'IDRCRFI4PlV
LN>~1'KIDNOtfOVAHNVEICKHSIZVAOAGIACSTKICEHVI
IC1C01'UITCHISIADNVI
CPn_0292 329201 729836 MIA4TCVTKSITSPCIYCWPARPYpE,TIIRLIAKIRNLPKTEERLSKLtIOQVItDLSTPSL
ind-Inclusion Membrane Protean AEIPSEI
C
VKNfl07SDFM1'SPIPPQSSCDASFtJIEOPQQLPSTSESQLVTOLLTMMKHTGALSTVLQ
fJORDRLPTASIILOVCCAP't'OCACJ1PFOPGPADDHHNPIPPPWPApIETEITTIRSELOCPn_0307 )12982 tMRSTLCQSTKGAR10VLWTAILMTISLLAIIIIILAVLGFIGVL.PQVALt.NOCETNLICT303 tWpochaticel Drotein wANVSCSIICFIALIC'tt.CLILTNIUrI'PLPASREOKCLHHNDVSRKINRtITOFYVDSIDCVIKNFDHKPSEDIt sRDtiEELEEKLLTITKRIY
pSApEFQNRItTDSKNYYLKKTOWLPFKNEELEOTKELFANLTStIDIfKIAOLFFYSPOCSS
CPn DWVEFTEVICNLNOSICLGGVLIxCCLFE00CEHVVTVNKKLDLPLLLICTtVVNSLRYYL
_ TYRNISLLNCO~HSELOKELCDVLKQHCVAFTLIFKEIVDIDLLNYVKLIOGLKRSGNIO
CT271 hypochecicel protein VWSNpNVLRLLFNLHHGEEKRAFLFFLIGLVWCICCYCI'LSLAECLFIEKLCSAELPKIYARIYONDVP1'LPSVSSS
PIALRYSLAM'IRCLAt.NVOFSSLKFISPSIL~fENTAKALN
LCGSLILCVLSSLILYNLFKKHISATJ1LF'LIPVSLSILCNFYLtLSSIFAIOPPRSPLFF:,f',CECFIFSNLDEF
Nt~IIKIVtIpLLR'IICKLSPEIWKNIMKILNIKRRVRSLYI
YRiVIWSLTILSYTSFWGf'VDpFPNLODGKRHFCIFNAIIFLCDnIICSv'IIASLVN7IGI
OCILILFTAALVLTFPIVFriSKSLKSISDDHDLFIVK:HPPPLS%ALKLCFYDKYTfYL~Pn 0)04 1.11091 )AI15%1 G:F'tFLtpLLAIATEFNYLKIFEIOFASKEEFELVAHICKCSLWISIGtQICFJ1LFAYSRIpdtlA/t~pA-PYruvar r Wthydrory!n.m., AlDh.t VKRLGYtJNIILFAFLwFLSLFLFWTFK'l'l'LSIAVIJItNVREGV'I'YALDDNNLOLLIYCVPDQKPLPKRLF'Y
%KVMD.~.SAPYNIA::yaEK.~TVFRtLDLYCPA:x'.IKFLKONVGIREFEA
NKIRIIrJIRIWESFIEf'IC:NLVWGLICFL:iSpQWFCLtISLtATILVVLVR~fYAKAILRCEEAYLECLVCI7FY
1L:YAWEAVATMIAN'I'~:LDPWVF.~.;YRt~lfAtltLWIPLDCIM
KN4:ApALpLTRSNC~C~WIK:.K1VKQKRQVELFLLAfILKHPSERHt?TFAFQHLWWSRSVRLII:KE't''l'.At s:RtY:::NHtkic:PtIFHN:FtaVtIYJIPLAA(:AAFT(KY(~KNRV3LCFIC
LP::LLAIN9JKL.iLFN%LKTIF?NIf3SLWAKDFLTLELLKRWTSIFPHFAIIuAIHLYFAEIY7AVA:~:vF'IIt TWFV::L11QL1IatLtIFlJNdd::W?.~,LNRAVAKQIyIAE:.'~I:.~...1'DIRAV
IIDLIJIITIIIAEOLYDT~rt:DRLLMILTVRRUEAYf:PYRDLADKRLKELLNSC/~PEDIVNC'fVN:F'fO.F'N
:'Id.:FHhAYRYM/UfF-':IVt.Vtx.'Ia:::NFRC:It::l::Dl'tll.YN::%RYlIUCLfKIf LTLI.Y.LEKNPONFPiLLDFLNTKNEDtLIV'I't'.KAUIT:SVRANIIKt'YCf'ELLKRLROCSHNUI'tVI.AY
1MLIRLFIfI.1'EF.t:FqFIIRQIY:KTAV1.FJ1F::NAKL::::D1'::YITf.EFI:VYA
f>F.A:X)'f l.I.KT I:: I AL01::F'VKOLf~'I':NI.KNT::R%YAF:AM
A:t:LOKEVaFAFLOVLTDE
:rIiNRChILMML~KI!lNWLLKKIIAYKiVKC%A:;KALFY~YIk:IIYIQKK'ftTINL.:LW~:Pnyn'.
LL11A2 iA~.I :'I
tlflJ7:.T!'YfAEVNFIII::LItat.GSMEHSC1/LIRAL'1";:KNDKIKk~ALFSLEKtF:DSHLF3LtIhli/
rlNIt.lYmv.W OuhylnrpHay::., pr..
L!1F~/tKJt~iII:Y:.EKYYFKC(.1/IPLTLKELWlWI~7:P::::W%LTAQI?WCEEU'YCDFDFNKt"'...Mh YIIK'PIk:INE\LRF\ILRE7A::NGI'NV"IfI:EF:Jt:D'IIY:A1'KVT'Yt:LLIIKYXiPKRV
U::VFRI'fWQKHEDYR'PEE::L'fLl:a'L.1ItUAPI:a:MF'::i:k:tt:AAl.:a:l.l~t'I
IF:YM::YrtIY::F1A11Nl t::IIAAKNIItfflYt:KP::VPI
V F'MTffI:MA1?Y:a:(%I::Ik'VF_':L
lMt t tc A. t 1 1 AI':all'YMYr:I.LK::A
1 HNNNINLPl.EN
ilyn ~l'L'~4 1 f 11177 s 1702 .
F:I.t.YHLYt:EVI'1'EI?YLVPIy:KAIIftYfjFJ:NIU:fI
1'I"/::IMV::I'PKFUIy'::LAKKRWf:LaIEI
"'VRffC:iRCIVIEEC:NYFn oEiIALITEINF05LDAPPLJIVCK'CPI'CKRVIL:KIVKLLP1T'N
(EEGiDC:LIHI:IGISI~tVIDdIVDP.~.IYJNKC:OIVEAIV
LDLRTIKf'LUI~fIL
i . IKNLTNYC:AFVELLPGI6G
.n L1N
V HVNAE
(IEEKYPICL
tlpwOF
ICKDOCKISLGLKOTER
t-:
MP _ ~KETPIrPY:IKtLEOATLPNVNRILCfIEK _ _ _ _ _ KL
P'II ?~C181r~11AV~L~I
VI".1GV1ITK TATQATL.~GiLIIVSALSL1K
CPn_03ne KKVSLSVKEYf.IDNAYDODSKTE4DFK0~CPKERKKKCK
ptltlC-DiMdcol ~poare>,de Aeetyltcansferase .~.KFVI9LLKNPKLSPCNEV ,~T.VKAOIKK.iN00VSlGOVIVEISTDKAiLEIfI'ANEDf~VIR
A ~Pn 0316 359794 3e0121 EILRHECEKtVt4RPIAVL:.'TEANEPFNLEELLPKTEPSN<.F31SPKCSsLLVSPATTPOin A
P
;ATFTAVTFKPEPPL;::PLVFKlIIICT1'!I(ILaPLAROLAKEKNiLIVSSIOCSCPOatIVKroea A nusA-N Ueilixaevon r ~
_...
...
.r-~
r .
M
' ' ~
~
' , "
...,.;..,;-.r,r-i . I..- ;.:\.\Ff!.Iv'.Y.\
...,It,,-;~,m.v../'.' Y, .Y
a , : : :
HF .y.w:
I21 _ , ~ ~
~ v .
~ :\LF:fAA!
YT.
?.
.
.L
:
,v\
.
..
. ". N..F:YF..., ...1~"f;F;.
.... ;~Y.\REYI;i-W':l:r:YMDVPF'.':a:Nf'f7R:\>:i . !:C'4":f':
. w _:
r.,-r.
.,.:... ..~
.
rt, a.;
w ra:..L,..,t :r.r:::r~.f.
;~:-.. ' v:rl ~
r ~ri r L
AAfWttvNYwR141EitLVI'CLtsHtIxVNt:1.w.'YYKHF'NLwiNLILDLGK'lFrli:.PTRFiF
' , , ..
..~:.
.
.
.
.I.
:z:,w :
7lI:iAEIKSW.KaP7N:iLCDTEYK(M:F~JSNLCMTCIT
.rAIPDGIITPtIRCAURKNtl . GAEVIuRSNAEFVKOLFIaEVPELEECSVLIVAIA
EFTAIVNPPCAAILAVCSVTF.OIILVLOCLITICSICNLTL5VD11RVIDCYPAANFNKItLQKTEKHKIGDKIYALL
YE1NESENL.
' RffAGIIRTKiJWRSSD%ITDPVCAFVC~CSRVKNI
IRELNDEKIDIVNIfSPVSI
LLL~IL
KILLAPAVLIlX
LYPILIOKIAILLDDKVIAItNN001DYATYICIO)UiINARLISHILDYLLCVpRNitYtIIL
CPtL0307 31199A 316515 LEIOAi.CLAEFDSPNLOCPLEi4Df3IS1(LVICNGEHACY~i'IARVLLASJINphASVICISL
41QP-Glycogen PhosprorYlase ELAYKILLQVSKYCESXVDLICPLIED
NGCIVBflFBSFDKNKVSVDSH~AILDRLYLSWQSPLSA6PRDIFTAVIUCIVI~tLIUIG
wLKTQNGYytO'Ipv><RVYYLSML.,PttGRSLKSNLti~GILt#.yRKJILlf1'LNYDFONLVA(ECPr1_0317 St7AGLC~CiGRLAACYLDSNATU1VPAYCYCIRYDYGIFD~tIHrCYp~IPI>EWGAYGinfl-Initiation Faecor-2 ' ' NPWLICRGEYLYPVRFYCRVINY?DSRCKOVADLVD'1'OtViJIIIAYDIPIPCYGNQ1YNSL1 lOJLKLKIKNACLTK7iAGLDKLKQKLApAGiSFaKSSSLIIPS
SLLIASLSKSANMCIfVIQ.
RLW(MpSPRCFEF5YFN1K.TIYICAILDL11LILNISRVLYPNDSITLGOLLRLKOtYFiNSAKDfSVKVAi.i~ATS
TPTASAE0A5PLSTSRRIRAK)IRSSFSSSEEESSAIITPVDfSLPAP
ATTUDIIRRYTKTHICLONLADKVWOLNO'1'NPALGLAFJIItILVDRL&LPWDKAWFJIrI'VSIJ1DPEPLLEVVD
EVCDLSPEVIIPVAtVLpEQPVLPETPPCEKELLP1IP11ItPALIAIVV
VIFNYIFRITILPEALERWP4DLFSKLLPRNLLIIYLINSRirG.LKVCSAYPKIFD0101RSLSNIItSKfCPaGIOf INIR.LAKTPK11PAKC0NVAGSK$lxPVAS~CPGKPC'1'SLUGWIIitL
.IVEOGYipKRINIfANLAWGSAKVIGVSSFHSCLIKLriT.FKtfYEFIPGffIINTNGVZ'PRKQFNPANItSPASG
RWIALCIIPRLSKLIXETICDRYII.SLIRSFA~SCFRL11ROLGIrKLttIOC~LTSRIRVYILPKKtiYDGSIORPI
HIKISLPITVItDLAALl9CLKASEVIOKLFIIK?fi'1WNDILO
YNEYCEIVDPNSLTDCHIKRINEYKROLM1ILRVIYVYNOLKLNPNOtHVp'ZVIFBGKASETAVpFICLSFCCTIDID
YSEpDIfLCLSNDTVRDEIpSTDPSKLVIRSPIVAfIOINdI
lGAI'i'QIDICAFCCSTPVGDITILDTPCIIFJ1F811lIMAOAM
GrtEASGIGt~IIKFAIlrGALTIC:'I~DANILNALNIGKPi~tf'IFCLLmOIVOLRREIfCPOTDIWLWIIGDnGI
KmlLFaIENAKAADIAIWAINKCDKpNFNSETIYRQLItLIfi.PC
' ICDKNPKIROVLDLLEQCFFNSNDKDLFKpIVNRLLNFIiDPFFVIJ1DLESYIlUIlILNVNK.LSFLL~Q.ALW1EV
LLLKADPSARARaLVILSfiJO~LOPVA
AHCCS'CVTVNTSJIIITC1L
LPKEPDSWfKISIYNfAfiKsFFSSDRAIQDYARDIWNVPTKSCSGmIITVLIONGSLKLCGLVFIiDGYGKVKTNHNE
IIrIaJICLAOPSIPVLI1GGSDIPAI~DPFF
W104BKTARDI IIJUtSAGOQRFAiAQIGDiPNF0.RM.ONKkTLKLLIKADVOCSI6AWtS
ISKIA30tYDVEILTNSVCEISESDIRLAAASKJIVLIGFH1ICI~lIALPLIItiiaVAV6L
CPn _ FTVIYHAIDAILLIMfSLLDPIAEBItDDGSJ1EIKLIPRSSQVCSiYCCIV'1'DIfANNK
No robust holeolog Present in Genebenk/H~I.
a at 11/7/98 FFlbHffe'(ATVAQTPQTTOPOPSVSHKATHRYCSWVFPICPILVSrr_r.-.rVRVLPNKLILN!(GTLSSLKRVKEDVKEVR10GLLLS7ILLEGYppACIGWI4CY8VIYNPQ
rer.~,LVIA
sGVrrLSICxGTVLAIQIVLaGTaLVLAFNHIROFKOARTALLNSIOOtuAPAAATVOKCKKL
LEr7RrssK
CPtL0318 36270 363176 0309 350977 319595 rbfA-Ribosome ,iA4iaQ Faecor A
CPn _ VIISYNVIRa.SIItOOrIYId.IfYQFI'EiRAIKRVNJILLpEAI111NILImVKIIpKISNNITRt CT309 hypocMtical parocein FNRAWEEFLLLpEKEIGTNTYOKWLRSLKVi.CFDACNLYI,FaQI~FQITIiFELIIIRIDIVKRV$L$ImLNSARVY
VSV11PNENTICEEALBaLINSAGFZJWRASKNWLKYIPGJIFYLDD
SGLVtiTl1ICPIAVNVTSVDKAAPFYAEIO(xlppLKTAYITIIfYCiSVNPQlI'FSNFLVTPLIJIFSPOOYI)~L
IJIOIO
DLPFRVLQ6F17CSPDLt~K~YrFNPIYLtGF~BGIITI~SAISVLRP80CKILY1BSDI.
FTEIG.VSAIRSfiElJC7~RSFYRNiOALFILDIEV!'SCKSATCtArIIRFNSIJIS>rlr~.IVCP1L0319 VSSSYAPVDLVAVtDRLISRFLNCVAIPINPLVp)Z~.RSFUtNQVCRLSIRIC~l'ALOFLcxul-tRNA
PseOdouridine Synthase IYAfS
LLYF.DWRTIdJLDPLEAtIGtVALTPLKItRTIllr~fINI'IKtxllflaLAV~.KIxILLVDKPpCRTSFSLIRAL
TKLIGVIOtIalilOnDP
NVAQY7fCVS0E5ILCR5pSRLYVLPRCVANYFCRQKLBLSYVIIICDVFBRDIISTVISSIRFATBVILVItt.IGRK
LIDpKIE<ISNDIHMAIODISKNLNSUDLSLLFFPSLLItIILSAACYFOCLIQQLPPNFSAKKIrQCKIILYEYARKG
PV11SCSKCrYIRSIAHPZC'1'IGGCCAYLEpLRItLRSGRFSIDLCIL7CNLLCIIPLII'DIfPY
CPt~0310 353173 351019 uL
60IN-60kDa Inner namosane Prxein AKISL CPII
TLA
Q _ O rib!'-FAD Synthase YFOLLSLIFRVY014~IKIlTLi.FVSLIGIAF1ICC0IFFGYDIEFRSCKNLAC
A AVAVCDILLFLLtMGEAAQSVIfSSGLSNSFVONKOC
FDNINLI1LYRCOGSSFNP1'N'fCKVFLpTNIICCLPVLtNEFRHN1C6PLVFLCLYAGCRISN7TPISIFLPTY~IP
NLIAYSLTSSPSVDSV1VCFFDCCHLt'JiSNGLSILTSYBCSiCIIIT
KDSTIFGT11LVP9iRSGSDYIPIGLYDSRLE1C.VSLDLPITMVIT~00DSAKSSDTANHFDENPOTVLSZ11<17~I
NtIQERLOLIATFPII7Wi.CYLTfT%MANpSAE6lLTLLIOINL
YVLlNDYNpINSL85CSILCINLPP11S'1'NMfSIVNEIGFDROLILSflISPEaIit'FGLSSKKCIOtLIIGYDSC
ICK00pSM'LALDTIGKPLGIIYILIPPYAlIDNIW88LAIRp!(iAG
LPL7COQAIDISIGCYYPLLRRGLLSDSKKLLPLEYIIALTNV~RELATRIALRYRVLSYTPNLLCIWAFLCIIPYAIS
CKIT~SGIQGSLGFATINLPREFSLIPL~VYAC6IAYCITI'Cp HSIpLESLDRSVCKVY1C.PLNPLLICpYVFEI'AITLTKE'1'EDVNVtSGVPLVLINSNA811PGVlIILCTAP?FG
RESLYAE11LIIFSPAENLYCKEtISIIPRKFLREEKKFQSKCILIMIIIL
TIKYRVI>nO~GGSLOKVKt.PIfVItEPLAIRROVYPOWILNSNGYFGIILTPLSLIASCYCSDILDApDNPAKGSFN
Y~TA
LYISGSTAPTRLSAISPKNOLYPVSKYPCYESLLPLPKI9A61'NRFLVYACPLAFPTLKVL
I7KTITCEKCP~IPLYLDSISFPGVFAFITAPFAJ1LLFIINKIFIQ.VI~1CISIILLTVFLCPr1..03Z1 36f900 361767 KLLLYPLNAWSIRSIItPNpILSPYIQpICCKYIOrEPKRAONEIMGLYKTNKVNPITOCLPYahr'wGTP
Binding Protein LLIQLPFLIAIffttt~fSSFLLAGILRFIP'G<'tIDNLTAPI1VLFSWpI'SINFICNLFNLLPILYSK1QNIIFIF
RCLNSNTLOGIVGLPNVCKSCLFNAL'SCAQVASCNYPFCTIDPINOIVP
IGIVlffL001NTSLNKKGPVTOQpXCQOVrCFiIIIJIILtTANFYNFPSGLNIYNLSSNILGVIL7ERLEALJ1KIS
NSQKIIYADbCFVDIAGLVKCASL>CACIGNRFLSNIACTIIAIANVVR
W'OG4IITN1(ILDSKHLKNEWIIdnCKHR
CFIL7PDVTHVSGKVNPVF~IEVINLELIFSDFSSAKNIHSKLFJtLAKGKAL~C~LLPIlD
TIIANLEKGLPLATLELTPLOIVALKPYPFLTNKPNFYIANVDLSSLPOI~KIYVMVRL
CPn_0311 351153 353575 vAAIt~ISKWpICVRIELLIVSLPIELRLEFLHStGLEKSGLHRLVIWIYDTt~OLISYiT
CT711 hypothetical protein TGPCLSRAWNVACSSAWEAAGEIHTDIQKCFIRAEVITFFit'IIECpGRAAAREtaKLHI
OFlMIHAVIYWDRSKiVWSFEPWSLNLTWYGVFFTVCIFLJICISMYL71LSYYCLtIDHLSE:CRDYTVCDGLIfIC.
FLNN
FSKSpLRVALFtiFFIYSZLFIVPG7UtLAYVIFYGWSPYi.QNPLLTIOIWfiOGLSSt~OVL
GFLWAAIFSWIYKKKISKLTFLFLT~CGSVFGIAAFFIRLCNFIiNOEIVCTP'fSLPNGCPn_0312 366231 wFSDPMpGVQGVPVIiPVOLYECISYtNVSCILYFLSYKRYIRLCKCYV'ISIACISVAFI' YscU-YOpS
Translocacion Protein U
RFFAEYVIfSHQGIM.AEDCLLTIGQILSIPLFLlG1111LL.IICSLKARRHRSHIs'NI~i4SMGEICfEIUITPKR
LRDARJ<IOCOVAKSODFPSAVTFNSMF'YAFSLSTFPPKIIIGC
FLVSM1.SQAPTRHDPVTTLFYWO~CtJ4<.ILTASLpLIGAVAWCVIVCFLIVCPTFfTN
CPr>_0312 351518 351976 FKPDIKKFNPIF?llltpKFKIKTLIELIKSILKIFGAALILYITIJfiINSLIILTa01IS1I
CT101 hypothetical protein ITACIPKEIFYKAVTSICIFFLIVAILDLVYCRHNFAKEL1WEKPEVKQEFKDIIOtIPLI
CTNARNIKYFLIIFPGILWISACNOILLLKATAIALDPLSSFFTYCLLSMVS1rGLILSLIGIRKCRRRQIJ1G~EIAY
'CLLSKTIRKGL:LSSEFFSpKITWIAYIKOTFISRRFLIININIAFSLVLRRYLSNPOALILDEAEKYGIPIMRNVPL
AHOLLDECKELKFIPESTYFJ1IGEILLYITSWIICNPNNKM' FVIRATVG1(ALIKTAIAYFSKLQNAIXENpEGtiNOPDHI.
CPn_0113 354957 355355 CPn_0323 36731? 369160 acpS-ACyI-carrier Protein SynthaselcrD- Lov Calcium Response D
wKILKEISANSNEIIHIGTDIIEISRIREAIATt~NRLWRIFTF~1ECKYCLEKTDPIPS'SFIMNKLWFVSRTtGf~T
TAWMINKSSDLIWt.wl9~CtMIIIIIPLPPPLVDIJ1ITINL
FACRFACKEAVAKALC'IGICSWAWKDIEVFINStIGPEVLLPSHVYAKICISKVILSISHSISVPLWVALYIPSALOL
SVFPSLLLITTNFRLCINISSSROILLKAYACNVIpAPCDP
CKEYATATAIALA WOGtiYWCFI IFLI ITI
IQFIWTKCAERVAEVAARFRLDIWpGKQNAIDADtJIA~IID
ATCARDKMGLCKESELYCANDPAIIKFTKCDVIACIVISLLNI4CCLTIGV711iKlmIJIO
~:Pn_O31A 156185 355353 AAHVYTLL.SICOGLVSGtPSLLIALTACiIV'1'l'RVSSDKtdINLCKEISTOLVKLPMLIi.It erxl-Thiors~xin Reduetase CAATUiVCPFKGFPLWSPSitJILIFVALCILLLTItKSJIAGKK00GSCJ~tiTNCAAGODM
MINSRLIIIC.SfiP~Y1'MIYASRALLHpLLFECFF:l~I'WOLMI'l1'VENF'PGFPECI'IIfC'.DNPDDYSLT
LPVILEICKDL.iKLI011KTK'.,~CSFVDONIPKNROALYCDICIRYICI
IGPKt)ltifMKEQAVRFC'I'KTLApOtiSVDFSVRPFILKSKEETYSCDACIIATGASJIKttLHVR'I'~>PSLEC
YDYMLLWE1IPYVRCKIPPHHVLTNEVEON4SRYNLPPI'l'YKNAACLPS
BI('CN:rK)EFHOKCVTACAVCI1CA.;pIFKNKDLYVItY7CD.iJILEEALYLTRYC:.~.M/YWIi\WV.~.EDA
KAILEKAAIKYWI'FLE'IIILNL.:YFFIIK.~.SC~EFI!',,IpEVRSNIEFNLRSFPOL
RRDKLRA:IKAMEAPApFAdEKITFLWNGEIVIIISC:OIVR.:VDIKNI/~YfCEITTREAACVFVKE1II'RLIFLQ
KLTEIFKRLVDE9ISIKDt.RTILE::I-.EWJWTEKL1NLLTC(VR3SLKL
r'AiCgIKI?fl'DPIdxDLTLDE;CYLVTEKC;TSIIT~'VFr:VPAACOV~WKYYR0111'1':.ACSCCYI::FI!
F.','QI:Q:~1I3V1'LLDI'EIEENIRC;AIKrT::N::S'llJlfpPD!:VFILILKrhRNI'ITP1' f MLOARRFI.I: PAtTiqFPVLLTA IDVRRWRKLtETEFFDIAV
t:: CpEIL.PEIR IOPLGRiQIF
'Iwy f I'. t5t:977 15H71r: n't'ti rl Sa4 t~tlbfl f'llh:ff!
r::l :a ItiLrANtt.li Prutuin ~"r1:11 hypocMrriurl protein MI*VAF.Y15J~:::KKIIiXJIEC.'LTEDVAEFKDLL1'TNIPIT.~.::EEE:IWEIJFC:ALLII4TWYWNIRRI
eIAA.~r(XTl'GCIL'A.'l'Or:/tILMVhAIJ~AKAOME'NA.~.GEI'(EFNNIOp':.O~.T
nIHKUF'VWIIw:LK::pt,;VIFM::EPtDS.~.EI:LVLuAEJIWLOpAEDEF.:KYIL.:REKATRNI'AMTRTKK
KEEKF~~fLE::RKY.~FJ1C:KAF:YY:a_:fEEYt'f11'DLADKYAa(Ii!;EIv~
,7ItS7Ylh:! I
LAIN'F:h72:IYKt.'(IITAKVKCI:LIVDG7FIRAFLPt::.UIf7FIKKItta.YlaAIt:UI)/I:PEDILAL1 /VEYIKGIAlrr::
IKNLLDYVGKVC: fJ Id7YLVrT'I'PF:7Ir:KLKFr\LiGARNI'rIT
r:F'KILKtrNFRRNtW::RNELLEAPRL~.KKAELIFlrL~.It:LYRKvri'VKNITDFCiVFLDLDFktFr:M'AI
tJIKNILFA:~EYAiYrIlN:a".:ra.f::LYLEVTn7I1P1ITI:ppLL:TILOpRYTYC30 r:IfX:UJII'fI>lrl5dKRIItIIP::PlIVEINOELFNIIL::/CAIRKC:RV\L.:(..I:v'KFJIrIhYIEOt EKMAIV~::FItIKQIATELKItG:Fr/F::Ny,7Wtrrl-rf.rnrrAYL'r::YnyFF.;RVFILLD..rLK
AtYIIr~'fCa)IlIFVKVAE.TIHKIIt)DKFPTJL:KV.CPn 013' % I Ja1575 dNLIC00V0~'f:VWLPFSuR snpB- Snwll ProcW n t QTSSRLF:~ADKROOt!)IWIANALDAVNTNNCDYPKASDFPKPYPw3IEEIFPQIOL'*RLLILUL.RRKtICFLLYW
Fi3PIllclidi KEt ~A
CPrt03:5 J7tJb9A :71119 PlI~'1.'~Y~hI
EAGLVLTCCEIKBLRdIIaQB;IGDJIYILl~tI~EC8iI3180 WRYFLRKLExKI110KQf:LIPIGNF4SRGYVKVRL.xCRGKKAYDKRRTIIERFJIEREV
CT3:5 hypochectcal ororwn MAIIfRRNN
KRLAIpNQYECLLE.iLAPLLNiT'.J1PDYJJNSCLIRFSDTfIVPWIEI9l'~NSGOLAVSTL:.
?LP04VFRER: FKAALO"VNC.:FQSS IK.' ILGYCEYTOQLYLSDIL~fYIl4GEKLFEYL 033P 393373 387375 ' CPn ' ~
fVA _ :NLPDLNVLfa% 11U ...~ty.ry..-. - vil~
LRT
KLFSLIAKIWMFw . ~ ;~.tl' . ... . . rrINKF:-:.:aNr:;. ;r ~.:~v: ...:., ., ' ,, r.~~T :..,-.,.:.,;L...;h; ,.::.vvrv-.ct.r:~::
Z:Y
' ' ' f.E:::EANLtt~.:.:.v.:a;l4>.;i eHLL:iIIEIImfPll:.
LX:.i.~:
AK1.YEK::ai,;IP::KRFtL::/
real0-Glucanocranaarase PDION71LRFSLPAEOLKTi4f~RTSFAVSREESRYYL'1CYLLIIAMiVATII~TJX'nKRIaK
7iLLRRVNVIJfY'tIfHSP5J1NAWNLICfSPKfKiIYLPLFSIHTKNSC~uI~vEFLDLIP' PSClL
r RLLBG
LI~OIfOCFSVIOLLPLND1'CtL?fSPYNSISSVAWPLPLSLSSLRtID?IPNAIDQ.QIDAMLDKSFSGEYIIPIIL
1VEEIIKHCSDEGBr1IIFLL)pOKIAVOCt>MLLIl ' CSTPSVSYIbVKDIKYIAFLREYY01ICCKSSLO(ZISNFSEFLESERYYILYPYCfFRNF$SH$VRFSTLpC$L?LTJ
BiCIRV
OII~ PVIST6SNVKLDL)IREF1.ITLLKOVALII
CPfPDFS
. _ AIKMIUiCEPIHNWPKSLTDOENFPDLZKKEHDEVGiFSYL~OFLC1f00LCEYlIAYAO(RIH~~F~EIAFNPfFFLD
ILKHSKDCLVSIGISDSYNiGIITDSI196WI
VLLi(~LPILISKDSCDVWYfROYFSSSRSVCAPPDLYNS1DLPIYNPSOLJ1I~DYhP~LNt~
LYNBlF3tLRYAONFYSV1IRLOHI IGFFW.YIIitDS>iCRCRFIPONPKDIfIKOOTt;ILS
t~.G 193105 384034 ASSIQ.PICEDLCIIPOWKTTLTNLOIC~'L'RIPRWLRNNG4DSAFIPLXDrNPLSV1S2.SCPn_0339 StIFS7ISIlRINLFI~Y LT339 hypocMcical prauin KTLT'L'ITOIDILIC
PKEAK
fAKFIJtLPt . _ _ O T
Q
THDSDTFAQwWGNS
LAICPDLVSKNLORERINfPCI'ISKKNNSYRVIIPSLEEWIRKKfNGIfIF)IILTGL
~~~~P ~
OlZ7 371937 373311 KaRIl.I5G71PADRALELNLL~OCDNNYTI"~LSYYNRAt~ANU'I'KSKQTStVASml4S
CPn _ ~P~~~~'r' ' r138-L38 Ribosomal Ptocein RIHRKNNSRKGPLTZiKRPRRCYSYT1.RGIAXX100GIGLKVZGKTKRRFFP4llL'HUtLWST
CPf~0310 383843 381156 E~iAFLKLKiSASALRHIDKLGLE%YLERAKSIO~tF1lrarltehitt wieh 0339) PLYPLLIVLSSRSSACICCSLKKOAM1JAGLWDEOLVKHGTYLSIQRfLCSOKLSDLS~L.
CPn _ wSNIGJCEOLALKFKSSLIItNSDISCtAVAEEFHKOLSISLPRDLE
cT085 hypocheeical protein LIfYRCIFNSPLRRNISLFRSQKpLIOVFAPVSPNLEU1EIHRRVILDpCPJILLF10JVIGS
SFPVL,tNLPO'fRNRV00LFSpAPD6ILIJ1RV11NLISSTPKLSSZiIXSRDLLIQtIgSi.CLKItCPI).0311 ARFpRIpFVSNSSVM.ImiLPLLTSWP(FLTLPLVYTGRPTLTTPNI.~IYRWRFNOtfraas-shift with 0310) S1fFL9CNPFLTLSAIAPLPWVSLLLfIITFL
CSZ'SK3PRRtDPLLTNNONPV50FSSG.OKHSLL71ILRL7IflCLYLKpSHNVSPLVCLODI
NI?>DLtlFOI01030(3'OILYC71Ep1~l4LWHAGI~iCRWOLLDPAPTLCOTLITSTHNNGFi.PKTSLYLSIFN7 OG7110.LYKKTHDHPHPLLYDAEFILVCFSPAOKRRPbiGPFODNIGYIISLONDFPCIIIQDC
IYNRImAIYPATVVG1IP'YOIaFYICtiKt.OLYLSPIFPLV1IPGVRRL1ISYDESGFRALTM
CPfL0313 381619 385067 VVK)CRYfdItESLTTALRILGDGpLSLTKFiJIV'tDOEVPGDRTSYVLETILOiLOPORDLIIDtbit:ced ONP
(leader 119) pvptidel fSETIINDTLDYTCPSW10GSKCIFNOIGKAIRLK.Pt~YpOCKIHCVODIAPfC~CLVL$
TSLEDRCIXSLLHNPDIJt541PLIILiIUiLRETIOSEK01WR1'!'1'RCAPAt~LII~JtSNFIBBfKFLLTILFL
)lVri~IPLFSETSVIQTLPSGIOGLICITSKOKEbVVCVIUFLRSYTSLKP
ATNRPNYHFPFVTDAtI9CPSYPKEYIYDPSTKOIfVSEANHAYFPNItEfFYIIARVLEKLtIYWEI4MYlatRKE'I
LEKHAFJILNRLLK1CI11ELKPGVPINfYI7tSI0f.IfIVR
VAWIPDCPEFJ110C>l7D.tStALLRTODW
CPn_0339 375085 376146 Phtapholipasa D Supertanily (leadesCPIf_0313 184999 385595 133) peptide) KNNKROK~CICVIISTLILVGIFAMPRGDTFICffLKSEt)IIIYSNpCIIEOIOtKILCDItratleshltt with 0313?1 AIF11ADEEIFLRIYNLSEPRIQOSLTROAO~RfVTIYY01D'KIPOILXOAbN9TLYE0PLPRRBQKRKAILI471PP
N71CSTLJI7tRYRCVKfVOFVft3GKIGRQLLTYCPTK~NGKLPS
PAGRKIl0I0KALSIOID0371WLGSANYTNLSLRLtadJLIL08lSSG.CDLIITNlSDWSISLDVLIiS~181IFLP
FRLPIfCi00KVCTIETKLD'fPNKAYVINTSN1YII'i~KSLYL
KDQ1GKYPYLPODIIIIIAIOAVLflCI0T11QKTI0VAlDbILTNSbIIQALRO~QIeGIINDI>B~'LICt~ILTPI
IGIVPEMLt~'tIMEDKQKNSRLJIPYPNODIYVINCfG8RPIlNLYGP
IIDRSIIbID:.TFKOLR0I14INImFVSIMAPClLt00CFAYIDNKTLIJIGSZtiiISKGRTSLNPIDMSLi7pKNS
INPEK~R
DESLIIL121LTKOONOKLPID:Ii10~l0iSDIPrV~>I~LLII>''~fSi.PVEGOGA
CPt>'0330 376930 376303 CT083 hypochstical Drouin FISII~Id.SiIOLFSVLPSRIpDLHVYRfKF'SLIQirOFlnTtf100EIWVLiICIKEF~LRA
RIfhPVAKRRIICIYLRIFRVLSRFDVIOtIIWDPYGALSJ~pSIA~6R1711SPLVF3CISE~I
ATNGIRLi(LLAIGDRDQ
00Bl~EIQRWKRSI)M'K1DDOCISLCM~OiIIMMYJ1WVIARNICGVLBTASTLFYOKD
t7J1 CPlL0331 37153 376701 CT083 hypocMCit:ai protein IOAIIN)IVSGf70GVpPSSDPCl011tPALOCD(?AECPSPLKLSIfSETKQASSMIWESLVR
' SGbICNYATESOINKAKYRKAODRSSTSPKSKLKf.TFSK)01ASVOGfl16GF05RASRVSAPiC
LVPPTFLLLIFPIPLTFODLFR!
KIIASOSGAGTSLLP'l'CIDAIALKKQRIISPDIOCFFLD118GNCCbSSDISOLSLGLKBSA
PSCiUtSLSLSSSESSSVJISFGSFOKAICPlISEDIVNAwfVARLOCtE?IVSSLLDPHVLT55CPI>.,0345 LVRRANATOIIEGMIDLSDLCQLEVSTAK1'SPRAVEGKVKVSSSDSPGNPTGIPNSNTLECT715 hypothetical 0rocsin RAEKF?.EKOFSRDpLSEDQN14.ARAH71GLLT~AlIP0EVL5NSV1ISCP51~IFPPPKFSC1'LLKVaCIJWLAVI
~f'wRTOSICRQTL6IVRRYPSEFKIISN71SYGNNLRLffQOLICFAPLM
DKSKHKSPCIF~CSTNttTNFSPLREGTVK511VKSLPHPESMIRfPKDSIVSREEAVYNECVYNF~ICQRPPH110FF
LCQDGL71QLCIND'IYTTWAASSOIPJILPAILA&000CK
PEAWIIFSTAFKNPINSSONfLPIAVESVFPRESCI1~'rAllf'aSDAVS55YHFLAORGVSLLALALJ1NKEILVCA
GELYSKTAKPNOIKVLPIDSEHNALYOCLE1GR'l'It~IIDa.ILTJ190G
APLPI41TDDYKEKLEANIICPGGPPDPLIY9YRNIfAVEPPIVLRSPOPFSGSSRISVOGKPPLLNKSLCELSCVTKO
WWRPIN8~14CSXIIrVI>SSTLVNI(CLIIEJIYN4lCGC~NLIL)1 ' EAASVIiDOGCGGNSGGFSGOpRRCSSCOKASROty00CKKLSTDIF
VIHPOSLIHfBNCELDCSVISIIBIPPDMLFPIOYJILTIIPERfASPRDOIDFSKIOpTL.P~
PVDCeIfPSIRLAppVLEKOCSSCSFfNRANEVLVRAFLCEISWCDILIIIILTTU18CNK
CPn_0332 378676 )79536 VYACHSLF~ILE51DGEAR7(frlOEI
CHLTR T2 Protean : ' ' 0316 389690 388701 CPn ILLHLLAVLCPPISFFTOGVSPCVFfCFLDf _ YLDSRIRVIPU1RQRC 070-eroDlytQD-Intpral lhalbtane Protein CPn KKOSW7sLRPSPYYCVSFfOFfSVfFSRLfSOSLPTCSLYIDDIQIIVfIJIISC9CAFlIG
_ TFLVLRKNAhYANAVSH11'LFCLVCVCLP'CNQLTTLSIGTi.TL.71ANATANLiGFLIY1IR
itu8 VDFFVFVFFMCKPKKSRTDRALAOEIOKKSTEVLKKP11RIKAKNRRKFLIAKEOKTLKHRNTfXVSEESSTALVFSLL
FSISLYLLVPMT10'1JWIGTELVIGNADSLTKEDIFPVTIVIL
ApEIIDDL'IR~LLDSOKKLITDKVLIFNYENCFVFTDICDNFSKYSIRLANAVITIFAFRSWCSSFDSVFASSLCIPI
RLVDYLIIFOLSACLVGAFKAVGVt?UtaF
LI IPSLIIUfVIAKSIRSWAWSLVFSICI'AF4APASSMILSAYDLCLSfSCISWPLTN
CPn_D3)4 379309 379837 N'IIWKFISYFRGYFSKNfEKISERSSOY
CTD79 imilaricy TMSVHITPRKCFItCILs7IFTLPTLFpKAHLILFSPYIVIGFYCFSKDKCLVt.At,CCGVLCPn_0347 )?t078 .~.DIJ1LGSRCVFLLLYPLTALITH)fANLIFSKESKAALVIVNNIFYCVPLLLTIPICCALFC069-traC/yt9C-Inepral ll~abtane Protein HEVRWrIDVLHIPLKCSFLDNLIFTSVIYILPCALNSCLHKMISFFRRLVCYTf?ItIPGLSRK1'IWIVLINLSC11F
SDTLFLaSFLIVTLICN1TALWG1'ILLLSIOOPLLS
ESLSNAS'IPCLLV0AIJIlrY'lvFStQAr~IFWIVLFOCAASVfaYOI
IVFttIKYCKLNKD8J1 379p08 )80671 LCfVLWFFAIGVILASYt'KESSPTLYNRINAYLYQOAATLCfLFJITW1IVICASLIAL
~.Pn 0.35 _ wWWlfR0hM1'fDKDfAVTC'LKTVLYFJ1LSLIPISLVIVSCVRSVCIVLISAMVAPSL
tolD-Nechylene Tecrahydrotolace Oehydrovenaae EICNLLP':LPMEKILJRLKEEISOSPTSPCL.AVVLICNDPASEVYVCMKVKKATEICILGAROISDRL.a"l'ILI4 SAFFwI~.ALC.~>YISVAFTCRJ1IICOQAVPVTLPiGPLWICAG
.~.KJWKLPCDSTLSrVLKLtERLN00PSINCILVOLPLPKHLDSEYILQAISPOKDVDCLHLLAGLCLLFSPKr'.~W
VIRfI'RRKNFSFSKDOEHLLKVFWHISHNRLFNISVRDFVCSYKY
PVN4CKLLtA:NFOGLLFiTPACIIELLNYYEIPLRGRNMIVCRSNIVCKPLMLJaIQKHQE'IfCPKPfPRWRV'OIL
EH11'w"YSIKKEODYYRLTKKCRSEALRLYMHRLWE.9YLVNSLDF
I~ytTX.'"'ffVWt:OSFlILtEILKT.~DLIIAAtGAPLFIKE7?NAPHAVIVDUr7TrRVPADN:~YE..~VNELJ
IEEfEHVLTEELD11TLTEIWDP~!DPIIPr3IIPNKKKEV
AKr;pTLII:DVDFNNVI.TKt.'.IAITfVfCtIVf:PM'VIWLN.3MWRf.'YONFS
CPn_0349 )x1915 l9lD'!.7 "fKS u:sr. irlt)S~a tw1591 Db9-troBlyeWB-ALK: erannpnrewr A'fPase:
.!m 11, tll~ILIIJVYDETtW
:VHNI.SIN'IEtIAAVLY11I::F.~.IGI!r:CLTAIII:PNI)M:KI:TLI*A."LI:
vt'IXCMI:apl::::;:WftHW::IIIa:RY1111XJYNaNLPKFFLYLG:LCIt:::I::X1KTTTIOGEpMfILIY
P.~.Sr.TrIFFNOKPKK1ROPIA'MPQRA.:ILYIAFPNTVLDIJUl4rXVYCYKI71~RL~.S
;"lNIVII.T::I::AKi:1'vl::I,:rf~IpBt:FHKID::LYNNWNPYS6L::rfNRAPADVPITL::VELINnP.
RFJ1FIIILfRVGL.E:3VACRrJIr.'OL;:tXaXyJPAFLAHAU10KADLYU4DELF::ALONAS
:aat.lY1'Ifll'LYKL::17GRFhITtICPLKTWf.IJILK::OTLPPKDVWF.ONYK0W~1~IQHLEFO::FT'TS
V1:'IWELRDOCKTW't'VNHDL::IIVRpC.FFdIWLWKRLIr:Ia:I'TDfX:LttLDfLfQT
trrKTt.IYKN1~IIVrJfDL.I:WKriY\VIX:WEIt:NTFr:PNNYVI:WIX:EIKT::CH11P:~RPWRYra:EIE
LLF70TLKL.~.IK'*VF;:C~:
Ir~:FlV:fILlilIr111AtA'Y:7~IIIIWIYAYf7r:KIYT1IILClfR'R:KII.ELaSYPI0:1V.3WN'T'-, ;.,I
It:r'A'lAhIllA'fVIXfFIC:KIFJIICyWAEEI111IL't'YTt)fl7A.~.'.ir y ':f'n O14'l y ' y I-rr.A! rQA-::slur.. Prnr.
vtr Ismv)tro7 t'.vnily n.
n WILKHA::RFIAWIDIGYIFKVMttWTFrFVACC:TkALItOPEGIPVTr'::...'Wr:~
~KI~fAIDAW:i::'C:..::.:::.A~..ARF~.:::4MEIRDCA
iIANSRPCILS'FTIRNIHDC
LFF71PNDP'VFIDDVFHALYJ1~aKIISItAGGt~ILL::FA.'iKCftFR'.LOhGEIA
'JERIM:NHIr\TAVLIKGSGDPFIAY~IVIttIDKDKIAGSAVIFCNGLGLOtCLaLRKtILEM'lARIKPGTPL
LFMI10CCIIOSAFirniltH AAilPf~faLFO~fIIAIiI~i.R
R LP
I'FI;:VKLV.ERLIARCAFVPLEEOCICDPHIY81DL.~stYlKfaVIEITLIILIEKFPCSiSAEFKAIPCLAAAIT
FIf~:7PR'IABLiNOGGRO~IfCAFCdFERitDR
a;EEL12Fa1SILD~IAKOCL~IIPOiUIYLV~FffiAFSYPfRRYLATPEEVASCAWPSRC
ISPECL.iPEAOI ~VRDINAWDYINENOVSWFPED'I'I1~DALKKIVSSLKKSNLVRLAOK
KPLYaDNVDON'lF.:TFKHNVf:LITELC~IALECOR106b50 40578.
Cpn_0361 SYt .P~ Synchee~se Yr tYrS
~tm o75n 797167 J~7b84 ; ~
T
- ,. .~ ~ .....,... . , ......: .P \ r-:1~\..LT':IIW:-~
, ... I.-: ...... , . :!~r':F'.-~ ~ ',K:'.: ' ny:.ll,~,.
..,....~ .,.,y..14F111i'!': -';'tw: .-:?..:.:..:.':':".',::.
y.. '...
.... YILb\4:
:w. " ' KlU!Y ..
~
, , ., .
, 'CL:JWIAOWWEI.:~...i:w:'w;iKIIthaw:wML':K~:.i::HVH.i::E;;IrYiEF::YL:LOiYD
, DFIRRICGIG
. AYG:
... . :YPLLiNJ
. GIGCiG%TiSG
! ' ...!,;;
.;.,.... ;, :Y:.:':YA . ..tlCG....
ILFSETPRTINPKPW1PR(iSKKRRDFINFTtITDICRYLEIJIROVt70KDLi.
' F:I" O
LLKNLKTF .
ARFSPKKPLTSLKRELIRSIRNGIVSVELWNAYVFaVRAVSSPNLEV1'SPFVDG~'tiTSCI
IO
FYHLFKNYCTILr.Ci..S
PKIARTL2LLSNEEIODI~tRVpTDPVAVKWA
'MiLDSOLTSPFELYOY;.LRLPDDTT
ODILS11IIK:DIGLEtaLSS~TRS~tPt~SGS%~Flt~.!'AG~ASLDKSLVIGIWIiLD
CPn_0351 791861 395133 4FLVIGLCKSKGIIRRLIEOKCVYINNVPiANEHbI7CEE0DIGYCdIYVLLrIQCKIOt%t.VL
adc-ADP/ATP Translouse YW
ROTImTL
TVL
.
KIKVfQRVI~BTfKTEDCPFGKLRSFWPIHTHCLKKVLPNFIJIFFCITIt~iY7 StIIt.SKGALFYAVG'i'PFLIFFALFPT
IYAm IVCApGSGAEAIPFIKFWLWPCAI If?G
. 055 . CPt4.,036Z 107113 10 AAIYVLA~S~t~~'F
W
fF CliA/rpsD-SiOea-1B/WhIG FaailY
VIYPLROVLHPfEFIIDRLQAILPPGLIGLVAILANSINIITE
KRFYALFGIGANISLLASGRAIVNJ1SKLRUYSDCV~WGISLItIi.WVIIi' ANEITK
H
I
LDKJOIIVIITOQ'I'ONIIE1R4NFYWCIOEIEYRDSLIEFYLPLVIISWNRLi&ONI
EA VN
IV9GLVUtASYWWINIfNVLTDPRIYNPEIHOKG100GA1CPIOB4lAmSFLYf.ARSPYiLLLAK7WKL5G
' LVIAYGICINLIEIrIyIKSOLKIQYPN4JDYSEFNCNFSII~IfGWSVLIt4.FVOf:NViRKO
t LI1GAIIDDLRKOOIiVPRS
DLYASGVWLVRAVERYNP6R$RAFEDYAV!
SD~i10101I
VSWtMPS
VMRPAL
. .
FGWL'iGALVTpVNVGLTCIVFF11LVLFRNQASCLVANFG1TPIJG.AWYGIIIGJIW''1ISTI
O
ASLROSLGKE~WI~~~
EFS~IOELEKERKVMALYYYEELYLI~IGKVtaVS
KYALFDSTIIB'IAYIPLDDEOKV1IGKMIOWAARt~CKSQGALI00GLLVICGSICAK1'PYEERIPDE>1A~
SI
FR
LAVILLFIIAIWLVSATKLN%LfLAOSALKEOEVApEDSAPASSt , , .
ESRVSOINSKALLKLPAAL
0357 395178 396130 CPn_0363 109700 107913 ' CPn _ flhA-FlaQellar Secretion Protein No robust homolog present in Genebartk/~I.KIRlfT1118S1t as of 11/7/98 !
WVGIFFINSHFfNSYAFFNOtfVIITVRIfSCf.Tl9CCSPLTLVPNiTLID4DCECHRSCSLKIGLCISFAfSLLT
GVIYSGKKDGVROIIPVPLSILVLIFLPLPOILLD!
~Ri~W~TRWIVSSGTASSLIVSLGSFFSLGSWAATFACL4LF
CLlPPFFLYLC' RTTARLILGLVLALVSJ1LSFVFW1PISYAICGTLAtaAIVTLIITLWALLA1ISKVLPI5J11 1VNFLJIVSKGSDtIAEIIRSRFFLEALPAKQNU.DSDLVSGW1SYR11V11%OICiALI~DF
PNELQKIIYNRYPKEVFYFVIC"HSLTVNELKIFINCWKSCfDLP~LJ~AEAPT3iDILKFSAI~CVFRFVIIGDAiIS
CILLLVNWSVTCLYYTSGYIILOpNMPIVfGpIIt.VSQVPALL
IDIGYTT
' QKVYGCi.GPWI
TSCA71ATLISKI~LSLI31YLFEYYItQLRQHFRVVSLLIFSLCCiPSlPI~PIVLLiISL
SIDLTLFPEFEEILLONCPLYWGSNFIDKTESVAGEIGLMCl IFHSYTRPLLTLISESpYKFLYSKASIC~IQWDSPSVIOCl'CLEIF'~lpEIStIFRI~IIOGIS
OFLFLFFSHGI'MEQAQNIOLINPDNWIC4LC0FDKAGGI~TFOCFIM'L79'~'DPVSL.wLRYRJ~EPIIS~SCIFR
11FSYVOGACPKEOESQFYQVYRAASIEVF~VRLM'tS
tAEVV
KYLCfSERVICIAVED
RNIJI
A
' SNYEPTVNFKIwKE4KVLLEKVKESPMiPASALVOKICttN~tIOIDNLLO~OFVRM'SSOPFJ1VLPFL
Hd II
O
LRIL~tPWLRVFT70NVYLD~1 IVPACISLSSLVVLSRLLVRERVSLRLFPxILiJIVAVYONSGDSLEILitBI(IIRKSfaYWI
art'SSLPOYAFHAOTYKLEKKIESSLPIRSSL
GRSIJtOG100TLEVITIDPNVP~LINSSYSRSNPVlIpt3iVIRRVDSLLBRSVFKDFRAIV'I
CPn_0353 396893 397135 SCLTRF~9flG4.DPHr~CViS~LP~IPISFLGIVSDEVLVP
No robust haooloq present in G~tabsttk/H~L
as of 11/7/98 109951 110378 LRFRNIKKSLIPIKRIAYSOSGKEOKOARPtfIGtSITSSLVILtS.IAIFNH~iFSSII0fC~1CPI1.,0361 FFKa4FIWI0~tTSINRIFVKFT1 Eer1-Ferredotdn IV
KaISNAKLVITSDDL00EFF3.EDNSEiAEPCES?ICI
PFACTF.CVCCTCVIIIIL~RAif.S
CPn",0351 397062 398507 ~W'~~F
No robust hoslolop Dresent in Genebsetk/C~I.Cp~0765 410198 411511 as of 11/7/98 l015T
~TYYF~
. No robuac hotsoioQ Prvsestc in Gafebank/D~L
YKTISIKILKIXTFLLIGF4LNLRYNTOIDEPRRG45NITSPVIOlC4as of 1117/91 'T-'r'-~-rr.IALIGVILGII~ITPNISSItE
'"IHIVISAILLCGALIAFLCIIAAPVSYI!~
.
FKQtQVNSL~TISPISLTVOHPLVCrKIGLRCSNF~CI05RILLITAiIAVWtIOlLI.
pVPPQELVNRIPAIIYPKPVSDFVSGKPN<.I~LISFIDLLNOLNSLYGSS1NYNVSE~.O
OKIDTFEGIARLi04EVRTASLKRLESMSSRPLFPSLPXIIQINPPFPWLGBFISJIGS%VIGLI1111PVIYFL'i'r ISFIAWLSNFILYIIRATTiZICPRACGI~ttilalaV~SS
VEyNRVID(IOGSLF~DLSDYIKP~.PTYWLIPL~'RPTNSSIWLHTLViaRVLTRDVFISIAINRSKPI~LPAPSALL
TDNPYEIWIDIIWSLFSLVSLLPO~dLI
fiKTLLIF1TSGNAFISSYVDTi'PSPKSLLN6JIIOfTRVEINI'tLIAWIO~LY
SAB~
ONLICYAAI14G684iiWISDLNMKOQLFAKYNAAY0SY10i1.60PSLOmEFYNLLLCIFIOf.
WOPDPI%iRVFLPOIPtTPFJIIYOYYYALYV'I'YIOTAIITINt'pIIOIPLYSLJtOQ.YfRC.
RYSWK~ISLIKTVP)1DWEJL.CCLTLDIti'GRPODI~FASLIGTLYTOCLiNKFS%AFLSSPPp9RN00SIJ1NITA
VKY11AELHP6YPLTIACVERSLAQt.POESI~L.B
LTLLSLOQFKTIRROSTNIAIffLBJLA't'ID4STtRSLPPITVNPLKRSVFSOPE~Si'L
IG
CPeLD366 4119'76 113140 CPt~03S5 399955 398591 No robust haooloq Present in Genebank/FJ~L
as o! 11/7/91 ' No robust holeoloq prnenc in Cenebank/EI~LGNOKLIELKGKOOAESSPRTi?SVILEVti.VmC
as of 1117/98 NGYLPVSATDULtISPAAPLINSAt~T1 ' ' IRDPYLHIIY?AFNRSISKELA!lSKfIVPHJIf.P1041tCECNSTFPLSSRTIVRIAIASLFCYKY
ILLVAVIILF'CFLMVPFMIOfI
CLIVLSLL71IRFALOF?LCfGNPIUIIAVLJIVSCI
ICAtaAIGCLiIPPVSYIVCSVLtIFIAFYILSLVIIJILIFTiCIGG.PP'1'PRIIPDRITNVIDVKTVODIfASTI
IISHGQTPTL~'IFSGIVYAE&QAGL
e'rIYGLSISAFVREOQVTLAEFttOFSTALf.CNISPEEKIXOLPSELRSKViSFGISRLAGD
CPn t.FIG4NGIPIF~LLSQTCPLYWLOKFISAGDPOVCRDLCVPREI:YCYYWt.GPt.GIfSTAID1T_ EIfwDTDEVKAIYERIY7TYTAROTLxi'F31GG1.TNo robust howolop present in OetfebanklFiBL
LTKEwLLLxNKAL as of 11/7/98 L
IFCKETHNI
SFPWRYfKfKiTSIPOVHFi~IDSHLSVDERLISFSPVLTKXEVIAKIIKLTALILiII~IIA
DG
O
KETISKELLLLSLHGYSFDOLQLITOLPRD71WDWLCFVDNSTAYM.OICALVGALSSGNL
LDESSIDP'WNLCLYVIODLKFrIVpAFSASDLPIGCtLGKFWtFDSSVSIGtLSSVLROGLVGTAWAGVLtiIPL4t7 tI11TGAALtaAWLSCLLLRRREPSKPTEELLGPOKNVPKDIAJIQ
' HRIALEt~NARARVYOVNPYTCMINRKTSIlP'ImEGDIJ1II
WPSVPi.0Y0KLLRN6Wl'LVDtfLSEINISWCLOOPNORYYVWEtpGAPITLVXI
PRLICtSCRVNiVNAaNSNIOSCCACIWU1ISMTNPTCWNDII'RTSGGKII~1I!GIOGLSVGDC
CPn."0356 400165 100109 No robust homoloq present in Genebank/Et~L
as of 11/7/98 113766 414107 KOVQLFOYtQIESQJDWt.CDFDSOCDGFQLSRLVGLLHS5WJ1LYPJ1KE0FYLPEVSLLISlECPt~0368 No robust homoioq pressnt in Genebank/F~tBL
as of 11/7/98 FyIDpLIsSKpCIWGyAKDLCNVFEKHIpRFRQYLGSLDLI,10RFHJTFLNYpKyNLpRETLAKDIfLWVN71A0HPC
SIETCRINDTNPGFJUiFLAOLLCPKYDCLXANPEKLSN1II%KA
YLNCFDF~IIId~tOA.IJVOVPLISSSIYSPCGKLELEPVNQ'fKPNSSAYKLYHIRT
CPn _ uo robust homoloq present in Genebenk/D~HL
as of 1117/98 YSSF81CASMVNIOPVYRNfQVNYSOATOFSVCOPALSLIIVS17VMVIJIIVIILVCSOSLLCPn_0369 111115 ~IELGTALVLVSLtLFASAMFNIYIMROEPKELLIPKKINELIOENYPSIWDFIRDGEVCR'058 hypochecieai Drocein_3 3LYEIHHLISIWKTNVFDKAPVYI.OEKLLOFGIEKFKDVHPSKLPNPEEILL4HCPIJIWNIKCDSNPLPSYT~'1SL
YRTPAKHSYPIRLPLNRTDRIEKILKIVtLTLALaCJILGFSIA
LCRLYIPMVSOVTPfiCYCYYWCCPLCLYENAPSLFERRSLLLLKKISlGEFALLEDGLKKAGILJWPIFSAVLYTTL1 L11VSLYSLLKKPKLYEILPOIEPESEOSSLSPSPQIP~OD
MWSSSELVOTRONLFTRYYADKEEVDEAELNADYEOFDSLLHLIFSHKLSLPLOIDPLPDPES:1EVSL1DLTTPPEEL
TAITVTPGYCALLEONYIa.LPSLAAVDPSFT
' TMtAL.S
TETPOOPCFLWKLKDSKLIFISTSCOIAVPRIKTpCRVMIVNJUWI71I8RmOG
LATSLOG1INASRL?RAHSRSGSOLOPGECRSAKWFatSDHTSNDN11PGKJ1NFL14~hGPEA
CPn _ AKCtiJDPKOAFE'YSKKAPHM.FOFJIEIICVDVIOLPLIGCNLFAPSRf.WLCKTRAWIE
Flo robust homolog Oresenc in Genebank/E11BL
as of 11/7/98 EEVLSV.SMKLIPTQDSIERE'fDSKROKKIFTIYICSSKVL74GHFFSHLDKHNKIHSTGVAIKLALITSLODFv'nI
E00NlEEDKIIILTOKDOPPIIPPRFOLTTP
~
401991 403117 CPn 0770 415755 416913 ~Prt _ CTO58 hypothetical procein_3 ~e~~.pasa ITLpYILItEYKIFlITRHFSIIAHIDHGKSTIADRLL.a~TSTVEERFytREOLLDStiDGEREKRIFFKLFVFYLKS
FMSTfEPNLTNVNLTNLL3SESMPMIJ15NKLKGLDLVAPILIiGI
R,ITIIVWPYITffYLYECEVYOWLIOTPCHVDFSYEYSRSLSACECALLIVDMQGVOAAVSSG't'MIIIGIPLLFIL
T.1WVLAF.iILLYFLLREPKSPISYMIQPTPTTKDTDLPW
t35tJ1NV'1L\LERDLEIIPVWKtDLPMDPVRIA00IEDYICLDTTNtIACSAK1COCIPPPLALTPVPTEAILEEPP
LFSPRTHOTLLOEIB~JDIttPDLOANTOMPFIMDNO'fOYAYML
atLKAIIDLVPPPKAPAETELKALVFDSHYDPYVCINVYVRTISCELIfI(ODRITFMAAKGKNSNLTLISTIGFIEKP
R1'A"COCTVNLVNAATPF81AMJVK(TSLALAIfAT3VPGIiDISKK
::SFEJII:LCAFLPKATFtECSLRFCQVCFFIANLKKtIKDVKICDTV'fKTKNPAKTPLECFSPOPLRSKOPLC4:E
C:RSAAI~tE'NtlxT1'NAJ:KI1CLPDFLCCLIGPIUSDYNYNPNDAFTF
KEINPV'~FACIYPIDS3DFDTLKDALGRLQWDSALTIEpESSNSLGFCFRCGFLCLWLCROAYLNCWFJIKRRKT'M' CLPLL:.~.fIFW:.~.FKDEETfSLRLOWIOCNKIALIDAIptF
F:IIFERIIREFDI~IiATAt'.~.VIYKWLKNCKVLDIGNPSCIfPOPAIIftIVEEPWVHVNIGf.FJIENONOPWV
T:=TTLViHPLITP
ITfOEl4;N IlMLt'.LDKRC tt.'VKTE34LOpHRLVL1 YELP WEIVSDFHDKLICSVIItGYCS
Yfi'IRf/iG'lI!KC~IIKLCVLtNEEPIDAF3CLVHRDYAESRGR~ICEKLVDVIP00LPKIPCPn 0771 ~lhl4t 417:4:
tJMItIYKVIAREfIRII::KNVTAttCYOf:DfTRKRY.LWEKOKItt3KKRNKEFCKVSIPM'ANo ntGUSC
tuvn..l.xl Vt'.eahnt m W rn:rmnk/F1~IDL .t:: mt tl!'I/'tN
!: ip-Jtyl,p K'MPV3:APLPT~IIRPS:x:NU:WF.C'fC:KAI.YAY11~~DY~'fY.TTKLLVKTLVAILVtEVII:
f IMPFIPrI'PPI::.I IIt;:LILTTIV
VI.Lt:IfNL.If.VIIKTtLTfAF7~~aTKRKI:1'SII3i wttt o:!o au.tt.I 4ut'mz >
'Ts:.u tr/tt.tt.rt.:tt innr.in 'JAfJiftltr:1.f:WVFK:KNLVINNIDtICFSV.'WNRTFEKTRGFLKEYf'NtIRELVI:F~SLEoPt~ Ut7.
'tl'oa 41wn.1 Id~'IFC:LF:HI~ItKIItUIIVN:KfVDp::IIIALLPFLEhifiJIIffIN::YFKDSERFCKELOEKFlo rrtt~u.~.t hom.Uxt 4'tarertu.
W t:.yaa.mk/F:NItI. .t:: nl Il/~//'tN
:It.PIJ:P:I:;t%:EI~I:ARIN:P::IMFt7f:NPRAWfLVAFCFOaIMKVIY:RPrC::'YAT7YX:ACNYRACI
IRHIt:HHII~':PWh:C:.::A::F'/FXrPYI::YFI.F:kI:Y::X:HUIKIAFM:.TALLLW
' !!'I'/YA'/tIN:IF.'ft:AtULU:P.A'h:ILROYLKL:iA'PAVATILKLWM'LELESYLIRLA.~sEVL1'CSV
tHII'FT/Ir:f4.F'LI:::.ILL.AIHt.I::MYKITtit'NVl'1I:N
'CF'V7DIVAIAMIF\
".Pn_0383 1. ~)07is ~'pn 017r IlRlSri 120218 CT017 hypochectsal procetn ' .
V
VODITPLTLPMOIWt.?3iD0w"uWYAEIWBAIA:.dt'eaa4ED01CQ1.
' i ' OGIJIPATLM~ICtPALIOB(t~'hG~CZNY1KPPLA'91(O~iRYfKIfH~P~1 ' ' B
EC1RQ.SMLPSAL,.'LSGFGEiiPADItOICRIIRt.:.LpMERIp:I:rGSC.."LA:ILFLMI~ST
ITLTTDIDST
EpIY
LITPAINSSRRKTNTVRIGNLYIGSDNSIK
OSITS
N
IPEIFN!
ALAFJfNCDIVRVTVOCIKEJ10ACEKIKERLIAIGLNIPLYIIDINPPPOAAl4,VADPADKV
AINPCNYIOKHNMPICGTKIYTEASYAO.r~LLRLEEKFAPLVflCCKRLCKAIOtICVN~SSLSSLPOILSEPOIILL
CSJf~KTSLONSDIKELYVKKEKIaLHKPRDSLIJtRDPV~1100IJIF
ERIMOKYCCtCIIVA.iAIE'IIAVCIOG.NYRDWFSMKSSNPKZl4V1'AYROW(OLOAACLLEDGEDPLG'.ITFLR
l'OCLYCLIISIEEGSKEMIHP1!F'ntYGKERLHOALN~aLIYM:L.:
wLYPLHLCJTEAtA4CV0taIK5AVGICTLLAEGLCD1'IRCSL'iGCPITEIPYCDSGt.RNTIO~BNpOPIVAVta' LVIRMVNL
..... . . :";ICrr/,a--:..r.-': ..~:
Ff.r;.~ : 'r:l' __~",.~..
.;.' _ .
' ,~
~
. , _ .:, .'I w "
....
,.
;
,,.y.,,~l...~;,:~:~ItFr:.Flnlln_YO::,v~I:,FEr~r~:~rr JHt;APtvHFHA50PF1H1'S'il0FFt3cOGNOt:KPTKI:JFSROFDNNEEN:I:.Ia:EFGALL:.ncCU-Htrconw-i:Ke 1rUt1il11 ~
OCiGEJIWLDLPHLPLOWLXIAFCTLOtIANRLVKTEYISCItlICCRTLFDLEEVlTRIRVITCLItIGIKMIGAOKK
OSGIOITA.iMVRKPAKK'JMKR'1S'KKATVItKTAVKKPAVItRTII
KRTpNLPGLKIAIMGCZVNGPGPHAOADFGPVCSK1'Cf4ZDLYVKNTCVKIJtIPIff~AEEEAKKTVAKK1TAXRTV
RKTVJUOIPAYKKVMKRVVKK'.SfAKKT'fAKRAVRK1YA10tpVARK
LIRLLOEf~VWKDPELrTIC.TV
TTVJ110GSPKMMCaLICNKNNKNTS~KRVCSSTATRKNGSKSRVR'1'AtII~IRNtX.I>0!!f SR
CPrt_0374 120109 130961 CT056 hypothetical protein CPrL0385 431011 43252=
VDSlICLSFNTHPL~NYWL'fI~FDGLPIRHCVFSKOKDAEGTYPAAICtPEIASALOSPKYCDpepA-LeucYl Aeinopepcidaee A
LNORNGTSYRMPTSPrYQPIIOGIL'TOSPLLSIJIIRNSDCOAAIFYDREIOD1IANVHSGFLVIIOGt~VH.FItAQ
i~RNRV1U1DJ1IVLPI~IiHPKDA10JAA5FFJ1EFEP5YLPAL~OG
wRGGtGNIYAVTVGTMKIfLFHI'KPQDLtYAIGPSIGPDYAIYPDYATLFPRSFLPF4III~tPKK7GCIELLYSSPM
KWtIVLLCI1GKNEELTSDVVP01'YATLTRYLIIKAKCSTVNIIGPT
PIHfDLRAIARKOLTNLGZSIfDRIFISDLLTYTENDAPFSSRYLI1NNPDPNLtGQH5I0J10JISELItL511E6FL
VCLSSGILSGNYDYPRYMNDHNLETPLSKVTVIGIYP101ApAIfRlIE
WI'AVLLLPRD
MII~iYYLTRDLVNAHADEITPKKLIVRVAI14L~CKEFPSIDTKYI,~DAZAttEIOGLi.Li1 VS10MC1fDPNFIWRYOGRPK$KZ~'VLIGIK'S..
'fFDS~LDLIIPGKStQ.'IlOfJC~IilOC7lT
Cpn_0375 421111 411615 VLGILSALiIYLCLPIHVICIZPATENAZOCILSYlOIGtNYVCtIS"LSVEICSTOi~LIL
Ho robust homolop presort in Genebsak/EMBLAD11ZTYALKYCKPTRIZDF11TLTC11MWSLGEEYAGFPSNNDYLAEDLLE7LSAlTteEPLt4 as of 11/7/98 RLSMKLGASTNHKVHEPVKPKKApLAEIEAtBCTOJITEClLRSKSL71WZARJ1VLYILPMRLPLV)tIfYDKTLN5D
IA0t0QiLCSMUGJ1ITA1LT1.QRFLEESSVAfiAllf,0laC1'AYNZIC
' :lILAJYCITFVTFL71L.CFPLIQAYSIACIITLVGIaICLVLLILSLLPKEDe~1~rr.e~gEDRIIPKYASGFGVR
SILYYLENSLSK
LLPLTIIVIEOOPZTPKPEIPYSYLTKLU.L.TSLfLTLRRSSSORIfIN
CPn_0386 131543 131016 CPn_0376 121680 411191 sab-85 DNA 8lndinq Protein No robust hamoloq present in Genebank/EMNL.KSIE:YL18Q'CNFAGYLCAD ETVNCKCNIt~Y
as of 11/7198 FKV1I1'AKIIPNLTEIROIGARWSLPLLSPLTSMaIOGtXxIIfSAPLIIQL00LiGEEOMfJIMK~M.PIfLbIGSG
YZYAGOISVEBYMSKDGSPOSSLVISYDSLWSPPGRNtsI7SRSPSLED
TKMNSRKIfAGpNAIFNSPTPCVSSTLVWPI'PWGYYDKWODILLR1I9PNSSSL9EKDSKNNp00G7fESVSVGPtI:
PJILMBJ1IKDKaNYACYGQEppYVCEDVPt EFLI04LFVDLLELJGfTSVIfINAEEAFTPLDNTGKPHPKRONVYLPC>Qi.GiILtIPJIAVOAN
vSaDTpFTLFL,TpDECNPPttDltlOiC CPI>r03A7 435229 431699 CT013 hypothetical proceln CI~0377 123111 122317 M~Id.OGDSLMSRONALtZILIQO'AKIffr.RLPDVAFDOMJIICILFVDGEPSLNLTYElQi6D
sue8-Dihydrolipoamida SueciflyltransferaseRLYYYAPLLDGLPON1'OWGJILY6KLL1~SMLCCpINCGGVCV11TKEOLILMNCVLI
lOLY
iM'1TEVRIPNIATSISEVN7LSLLYi'EGALIOB~IpGLLEIESDKVNOLZYAPVSCRI~WEAETIaiJWAOLPZETW
KWNTVCADZCJIGREpSVD'fIIPQMPOGOLiQAPPP1GIM
VSEGDUVPYOL1IVGKIEPAGEGEEtaDSOSKETIEAEIICFPOSGVRQSPPt~C1'tZPLR
DQ!IDOCSpGLSaGORGETRERt!!'SIRKTISRALLSALiiESlltIZ.TTFtIEYYN1'PLFtILAEECPfL038B
131313 4)7320 KOEEPLSRYGVKIGFMSPtYKIIVLEALKAYPAVN11YIDCEEIVYRIIYYDI5I11VGI0MGLqlqX-GlYCOqen Nydsolase Idebranehinq) VVPVIRDCDIG.SrGEIOpKLADLALRARECLIJ1IAELEGf~FTITIK7L11YGSLLSTPI~NSZTIGLVSSYPSVPL
PLGASKISPNRYRP1ILYAliOATEVIL71L1T~18EVIEVPLYPDIMR
pPOUGZLCi9fKZIGtPVVLONEIVIAOtIatYVALSYDNRLIDGKTAVGFLVKVICi7GGENPAiGAIWNIEIEGISD
p88YaFRVIIOPR>O~MpYSFIfLYLRDPYAKFIINSPOS!'GSRKRDOD
SLLDL
YAIrYLItEEPPPt~OpPLtQ.PI~BMIIY1'lBIVRSP1'OSSSSIMIAp~QrPIGIIEKIONL
NKLGINAVCLLPIPt~6171NPf1~KPPYi.CtiYWOYAPIi~IPPSPCRRY11YASDPGPSR
CPn_0378 126195 123115 EPRTLVKTLt~ECILVILDWPNIIiCLOGTICSLPItZDrPSYYILMOGNITNlfS00CET1' suU-Oxoqlucarace DahydroqHaae LIfIIMAP1TOWILDILRIMII>IJQOiVOGPRPtIt.ASVESRGPSCSPLQPAPVLOIIfI~LL
IVPICFNYFIlIDSSEFVCOVIISSIk~WIESMYQRFMNNETLDPSWKYPPI4CYpLGQMSPSEASTKIIA6PWWGGLI
lQ9CYPff1'LSPRfiSEWIiGPYRONV101!'LI~OQNI,IG?FA51lI8Gs ASTKISQLEl'IAFIZ.QCQKSOPLCTIYRYYGYLQSOISTLAP1TDSRPIOEKI71KIDLDDpQDIYPIIOStrITIS
IMfVSCNDGPTLCD'I1RYNIIKiWEANCEONRDGT011NYltYNIOT~IC1' VPS71GLLP1IAQVStfltELIEALKI(CYCCSLTLETLTCTPLLOEFVtIiLIOIKPAEpLI~PGILEYR>OtOLPNP
PLTLMVSOGIPMI0SG0EYAN'1'AF~BEiRMALDSNNfYfIi~/pL
LRSYIIDLCItA?PFEEFLOIIQ'I'GpKRFSLGCGETLVPtC.EI6N11YGSJILGISNYVGD01HTJUIPItJOIPL
CDLZAPRIGtYKTLFNRGFLStRCEISSiVWFMtPM'lliRpGNPLAPICIItiPK
AGRLNVLTNVLCEPrRYVPMePmDP
ANV:vArNVG~oDOLILTLPMSm~fLProIVASSOOCrvPONVATPSw:LOeFn~l'tsus NASIG.ESVOPIVEGV11AAIQI~tAGKEQ&SIJ1ILVHGDAAF90pCVHfCI'LOLSRVPGYMl~.lrl' STEGTLHIWtNYIGPTAVPRESRSTPYtTDZAIO~.GIPVPRVNSCDWACItJIIEYJ1L0 VRFJtf'SGDVSIDLCCYRKYiWBtE8t7DPSV'1'APLLYDOIXRKIfSIREL!'AQYLL~IOFIIDICPIIr0389 SEETL71SIEKEZOESLNREFOvLR;i'DPEPPPK1IECHHCDRLN4GELILNOCDNSt.ONt:TCT011 hypothet:ieal protein LFNNSSRICGfPONFlIPHPKI
11SLLIDGYMLRLTVPNPKRPYOKZ750RQta71'ICLRPPKXTCKELIEPRRRTVKLLKlNLIGLFISNSIBGF
~rODSIAGTFSORNLVFISt7NICD1'IfSPLYHi.SAEpGSVGIYNSPLSEYAIL6!'EYGYAOSEVRVSDTPVKpDT
WEPKIRVLL6N!'.bl'1'ALIPrIKGPYRIYGONVLL,DTJ1I000RL11VN
QALKTLVLWE1IOPGDPANGIVQIIFDQYISSGISDIVGLLPtICYECOGPPJISSSR7lLYmCiZRWGEFYPCIrQCL
KZEPVD02ASLPPNGI0Y0CSLY1MRKDtIICIMVSti6YPIED
IIxlYLQLAANWNFOWLPSTPVOYFRILRENAKRDLSLPLVIPTPKLIiRYPQCVSSIEEYLK&VLSI1IYLEELDREJ
1LSACIILRTALYEKLLiIRNPONFWIMtAEEITiYJIGIIGYtICQ
FTEPCGFRAILEN1DPNYDASILVLCSGILIYYDYAJ''lE.p~tRKDPSCLRIESLYPLALEFYGVEEAIDWCARLWD
SPOGLIIOApMS~pSNVDRIJ1IEGFNARQILEKFYKDVOPVIII
DLVSLIDKYSNLKNFIrt4tpEESI0e1G11Y0YtIPMAL.(lOILPEKLLYICRPRSSSTASCSAKE.S~IIEELDGE
IR
LSRQ!S.V1CMETLFSLR
CPn_0390 139171 438134 CPn_0379 416168 126765 ruv8-NOlliday Junction Nellcase C'I053 hypothetical procetn RKSZI~EGSYMINOVAVL~DKIIFDVSLRPIOGLEI~'IfCQHHLKERLDLPLCMLpRGRVPC
KNKKMLC?CSRIODGNPWMKSeer Yvr seEW)n,~,LVp((,KEISRIIOEEIRILEHHCLFtGPPCLGKTSW1IVAY1'VCKrLVLASGPQLIKPSOLrGLLTSL
Q~ONfPIDEIN
KIYEEKERLOLLKF11GEIEYVfPRRSPAX1'VYPDGPSMSDIEFIrEPTLTEIDZDPCETVRMGKVAEEYLYSAMEDP
KVDITIDSGPGARSVRVDLAPFTLVGATTRSOM.SEpLRARPA
ELF3.T~ECREDCAVEVDYSNEDDEDPFSDRNRWRRGGIIOPDANEttFSARLSYYSOpOLKEILVPSSHLLGIE71DS
SALLEI111fRSRGTPRLAWILLRWVRDPAQI
REGNCINCDVAEKAIat2LII~NCWEIDI1ILLTTZI0YY0GGPHGIKTLSVAVCEDIKT
=Pn_0380 416671 127876 LEDVYEPFLILKCFIKK1'PRGRINVTpIJIYDIiLKRHAIDR.tSLC6CQ
hanN-COproporphyrtnoqen IZI Oxidase KSTIPTICftIKTLSAIAiIIGDIIWSLIPtfLM~CMPL71LYIHIPt'CT1IXCRYCSFYTIPyIICPn_0391 SESVSLYCNAVIQCLRKLAPIQETHFIETVPt~O~TPSLVSPLDLKRILKEL1PNAREINo robust hamoLOq Dresenc in Genebank/EJ~L
as of 11/7/98 TLE71NPFM.TVSYLRQLQE1'pINRISVC1IQTPDDSILOLtGRTNSSSMITALOECpNHGKDOLYKOEKPIPKATIL
SRNLEVtO.DtIPKCKRQTLFLGRTSGRSALY5Y5RRILVLIJIAT
FSNLSIDLIYCLPfpSLEIFLSDWOALTLPITHISLYNLTIDPHTStY%HRKILVPTIANRCP
OEEILJ1F~ISLLJ1ENLLLSOGFORYELASYJ1KPDYPAKHNLYYWfDRPF'LGLGNSASOYLN
CEASKNYSHISHYLRJ1VRKNLPTQtTSEILPKKERIKEALALRLRLLWIDIJVEP'PSl'LTCPn_0392 139914 .~.MLTODVKLpM.FSYfbpGLAWRQGRLPHDTIAEEIMCYSFdcd-dCTP Deaelnase MSIKEDIWIREMAItIrIDMIHPFVNGQVNVNEETGEKLI3YCL.~aSYCYOLRLSREFKVF'M
"Pn 0781 12N836 418037 VYNSWDPKCP'fEDIFISITDONCIVPPNSFALARSVEYFRIPRNVLTMCIGKSTYARCG
aT326 similarity I I VNVI'PPEPEWEONV?IELSN7TPLPAK
IY11HOGIApVLEFFS51TCCV51fA0RKGK7f0 aLPNKFAAWfAPTESRSSPPTLLEETEPLSPNPIPADIOIPRITISPPSLDVSIYASSAKOQGItYPCV
EDI~VFIACCPRSSSSASVASOWELVCLCCGDEDPEPPDSEVRTLYVNGSWOTNQPJ1V0 ELLYIaEVRCFJ1VRLL'INOCSCNSPWPISPCRTLPTLDHPLC(~ALLTVWCpPPSAPEI~NCPn_0173 110129 AEFLVIFYCDIUPYI00ALTQSRHSPRLWVCISPTVPIOf;DFRVfINYRVSGDPPSSLOC<~f03R
hypttheCtc.U protein FGTPAFIICTtLPYS9CLECVFLPSIRCPSFIWAVRPr;EOCLVAF1RCE0VEDRtJCLSppAEKFLTLRNCORXFTII
dct:LpR.,~Y;,LSL'/FPARFtJIOTEKESIKSNM:SPYLVSNVSVRKKN
ASGLPfI:ERDt.AWTDL'TDPSsNSRLVEWWOCSt'SSOMEINPYPORePOVAtSALYAISwCPRLLEEVNIIfSWWV
IF.~,ILII,T',FV'IDRAIOELRTEELHLpSKVSSIC00IVSAQEKOR
~'J::.LSVEWILr4IVHE(a.DWIC'l3LIIl4HTTFAVRYFFLLFTNYt~SRERFRTARIYAQOLOLHLOIfWQD.~
.MI61ALLORII:LtPY..YKY.t.CVSPKpQ.~,f~OID
:r.YI.P.: f LVLVPUCr~JVLRK1,WMPpEILRAIF
tSA::TISGS f VFVt:GTRtM:fa:LRNRVp ..F'w/WVICrt:Ll'Vff:fVRASYROR1K:FIICFLOTVH(y:LYLPV.~.IMILt~IAIOVPRILVI'Pn ilf'~4 4A0717 A41u.q vlaItfCAV'IDUINK.~,aEENW:I::CDVWVI~TWFIta:APVLFVNLWFFVKSVLRH3RRRRRtly:~:R:' ltwnut pfotetnlHr:mvlY::m Iw>arprrll KI:'TMI VftIt.NPPI tt'.FTII'..Jf:FI
:L::y IALF::1.1f:'.f.t::IIYKR::K::KK(><H1VATLLIJIPH
..'I,yt~t 170752 4lnnlo HLLITLLF(.'DIC:WLAfONCPAILFr:LM:l.Mr11"n :l.l'LAITf.tIa:EII.(HCAVALpfNfQ
y.rm:Jyt.U.-::nMfwL.mHnt MrrrttycranatnraaelA::~1IAPLI1...'VTKIFNpLWWtaV~':fsrvw~tltt::Kr.JfOItVIVELKFII
W:x:KOIf:V
' I'VTL
VtIpEE?nLLYt7YL,~aLvDCS:IMERtIC~Ftr~4ll.l"II)IIYPff.ENLY1.LF::K(Hk::NVPIf:Hf7N
ILI.fTITft:fAAVETLI':CVICELVIIRLpf:LIVE.iDI&7CAAFLSLNKIPEVIIKPPWI
I::Y.IIAFL1KANDFYLEfIVKHCEN4~,LI:DAfiLM:IJ1DPCA::LVARAItAIIUiPVMF.SCPIA~NWI:Lt_ 't'AN:a.LlJI0KPl4.~.::DfiLi.lLLYYIyYMI1~.'PI::AKMAI.I'NIIMND6TIJ*ti ".:fTI.AItAL;:(:LI'::~E:F'PF4:YLPSY.:PKERVK:iIKKMT.3KEV:uTS'Vt'fET.~.IRtAIYTFEt >!'Y(i::Itl~.LtTUEDLFEIVN:EIVff,NII1K11.'l'Pf:a:Al,VtIA:7~:pL1:LftRF::F:IYDINL
:LIJ7fl.f'::'IAI:Ia'VA::I)L:.uP:iELVLTAQVYI$yp'~EDLC..~VK7vtTKVPI'IFLFHIPNffNNH
IATGkIrJt.tEyIf:fIPTfrl4Yl :'Wt7flIJ.l'~VIJtAAIYINIftIIVYIIIKI.YIn l, "Pn_If'I'~ 111955 141175 NYrJ7AMEKLLtTDfVT:"' nLDKK:'IERLYA:.FpAW
1(LFFLT.:RYYK1'AAP:.FSD
' .
CT257 hypotnecical 0roceln yEr.ATALFSIESCiIPri .DNY
FOAPYLLCC~ICAfVW::-:::.nLLYSKSLP.:DL:.:::;.x.
' ' CNC?fMSALFI~ttCVNIICIVLOCPYSllfOIACISFNRVRLOYYLTKDF(KKARYINFLZRR:
R:iLKDp,YAEP i7G IAI
YRFSPfPIAODLIfCYVOPRSFPNAIfERELLFE
~LI:FLTIIG1L~ 1&MA
RWiIfDPRx ' l PYRL!'~7JIIGYMIALAVCSESSRNCIfRAIGITPDYAPFTOIFtWIFAf3.LPLTISRIfIr ~falI
f OKELEAOCALTSVJI
SAPE00NH,1DFLAPPJIDIfHCIwAWEALIfItYY00LMSL
DLIEACDFKIVM:
PEKLALWCI1PILY'fSHYIFYPLIOLICSLT.F~.I:IYLIJ'IIRKEKfltSTLSRDEFOK71LCTHr SCDDIWDL
HEED'!11'IATNIF.iw~11TC1100VCQPLEQVTIQ.PSSANVKDFCRTIfO'JfDINFIPVYNK
ARIWVLCIAHPKDFVNKALDEPLINNWSPwFITAKSKLIRILKEFRDNR55VAWWASCPn_0108 ~EPLCIL:iLNAIFKILFNITHIIWIJIPK?ISVIERTFFGFISRIImLOKLLDIOFPOYPVECT10-hYpotheci~al pratetn ~-~
. -"
' "
".v~n ..~." \i.-t .'."... ........... , ...... ... -:h:..":Y1Il,:-: up: ~.vet ~,r, t, , .:.:lr-~
.:a':IJ!!'i F, ..... , .
.: _ .
:
:
' _ ~
. .. ... . .. . ~
.
.711~'v.
~':VWAY!w..
r_Pn_0396 111J1y 447741 EFPPDTUINHi.Wt;tulxv::.:.:.:a':.:~t':..wY:,::~::W(.VI7x.L
yhf0-Nits-related protein LVLEASIRIIfWCVL CPn~0109 151615 1551:7 ' PPERCLLEFLOk?FLIEC:YJ1NPSSVNOLGKILSROCT3(i0 hypotheercal protein YSMIYLCtMRIfl WPE~AC
V
SY YINI
SFpCRVLYTSCATESLNLAIASLPKDSHVITSCSE11PAILEPLKIiSSLSAN
C
' ' VLTIEOZERAVTPKTSAIILC,IVNSE'FGAKADIAAIANFAOEROLOFIVDATANVa%RILT
p~f.PLFFVIASa fiJONNLTKfLKSSDEEPFLERFS
iEfLALICY
O
N'Curt VLPSfrYf!liUlfSCMCf'H71L~IGALLVSFGVKLIiPOLWCOGQOCGLA7IGTFi~tLYI7IASLLLPY~JtESNK
ASTARLLHLLNRDIDIPGFf?IDEEOCLIfYRLVLPCLIRittIICIIi.RIYI
YIFKYLDLHOERISOEILTIIRNGPEKAIIWIIPOVNZIfCADOPAIIMiVSAIJ1FPPLl~EVOn'IffL.VCDSFSH
AICLIS9fA11~ILDCLRA0AL0E00EKRNE
LOIALDIECIJICCYGSACSSCATAPFKSLVSNC1IDEELTLATtRFSPSHLLt.OEWt~IAV
CIIEKVVCRL1045 CPn_0410 155087 155833 dnaQ-OIIA Pol III Epsilaf Chain CPtL0397 145124 441)81 tIVRLFKSWKKMfIISSQI1~VLIFYDTCrCC:OIERDRIIEIMYNSVTDESPLTYV11PEI
PPZC Dhosphacase family PIPDGSKItICIITDAVLSAPKFPGYDCFRKfCGEDSILVAlOS4DCFDFPLLGK1ICRRN
EHPVDfDYFCLSDIGRVRAANEDFWOVNWSQWAIAOGVCCRfl;CDIA50E11VTSLNELSLEPLTNRTIDS4KWAOKY
RPDLPKNNLOYL.RpVYCFAEt4pAHMLDDt.'VIUOIVFTSLI
IDEOQSKLNGYGDDpYKETL10CILLEYNCWYEEK%4EEHf.Oh~'1'fLSFIOFRI~tAWLCDLPPQOVLDLLOOSYN
PKVFxNPFCKYKCOPLVDIPKSYFENLEF~CJ1LOKPEIdIDZKJI
FHVCOSRIYRIROGELRRLTEDHSLf140LKNRYGLPKOSDKVYSYRHILTNVIGSAiYVNAIALLMOPT
PDIANLPCEKEDLYCLCSDGLTMIVPDZDIRDIIafOPATLEER~I71LISLaNI'RIiG'Od~fA
TWLVRIO CPt~0411 155794 156609 CTZ67 hypothetical protein CPn_0398 115518 15700 RHQSRYSSITSTDNILTAAFSPCPNDIFLFRSFLfmPOFRPLLNOVTIADILTI1F1'LU.O
No robust homolog pnstmc in Casebank/ENaI.RRLSLNKFISMLFPLVSDYYNLN09CM'LCYNSCPIVLSLDPECSLDrtaTPCtOfi'1'JWA
as of 11/7/98 IEELPFtQIENSSILFAEWFOCWPiFSVISAPWFLPCxTLIPKEKVTIIVPSOWStSLSOLCKLYYPKJUCLIPMPY~f ILSIIILOCINOCGALINEERFSYDLOLTLRADIGI
p FPLPL~CWIAKYVPM711vDJ1LTAALRKSLZCSLKDPITaGAKAVEYSKNIQAIIVIfUt!'I
CfYIIM~I>!'OLSIITDKKJILIHQ.WlI7INyC(:pY1' CPtt _ CPet_0112 156515 457216 Cf253 hypothetical Drocein YKLGIViIiGKSLNCFSIDLItSKNFPIU1RIFCKISNLA'NIMtKNLVLLASLGLLSpTLSSCT363 hypotheCiul protein rTHLC71SGS7MpKLYTSS~S ZSKATYAS
EPZS'lIIKPfNYLKfGKKi.YICSCRI~HIVNfPKKZLCNADY%ISPLI~TpIN~t EKVFLIKfMASP~FYAPIANRLPETIfEOFLPAEPIVATB.LEOK'IGKF~IGYDSVI'LYSYAC1'DYHLDLYIVIIV
IICSTAVWaLpSYCpAYTDYDWINPGF11CRCSPEIII~OCIf ASVRVRVIDIRHNKZALIYQEIIF7CSOPLTTLVNDYfCtYfIJFISKNFDSTPIGtJbiSRLFRTIDCIANLTfD!'P
PVLSFaPPYIFDALPDSLPKSSLV'tSPVLYftYalfOfIFKII~YA
IASOA71ENNIPCSFLKITSDYTYPGDCPFSRLEEVS01(LTQT~.YE<i.PIGJIfhIJIIPItKLL
LPCP ' CPtL0400 416537 417306 C1'251 hypothetical protein SKS~ISKPILLt.SIGVMtaSKNFFIWPAPSC1(TPL1QRQVLFGG11LLVFSSLVALSVSSO
TABLLS1111CISLAFAFLFYLLFLPKDZTRAILFSCERWXTSWR7IfGSJIIRIMIIIIPV
'1\7LICINIISKFL?LVLPTOEZH1'QEYl'pEVpNSLPI'~NYISNILNtL:VLTPF'CFiIIFFR
GILQl'FIJOJKNTAZMYtCSSIIFSFIHZENSLCSWVFVWLFVFSGSACFLYEI~fIIL
SPIALIGLFNLTSLZ.FLCIK
CPtIr0101 147881' 447195 GT~SS hypothecif:al protein NRDHAlSIQ.I4'IVRANVVECRCPWSiQOSLVSNVEHILCECOEFHEJ1VG.OGKTVOEVaSE
AO~IGTLVLILCFLLEAI7CVi.ivSED17J1HEAFlEfILRRAAPYIFAEDYKPVSIEERDRIJfEL
AIGOtBI~ES'f mutt-lldenine Glyeosylase NPIDtFCNTKI11FSEKAIOJFI?VEAL<CKWFEIOVIfASLPWRDNhfPYSVWVSEYFILpOTRJLEVCP(L-.011! 160103 159172 VIDYFI~t~RFPTIESL71MKEF~IfIKLwF7CUYYSRARHLL7lRMIl~EFIIDKIPDDaeU-AeCOA
CarboJtylase/TransEerase Alpha aISLRQI1~VCPYIIIHAILAFAFKRMMVDCNVLAVLSRIFLZ>:fSIDt.ESl'R'IyRIBRILCLRIVCIID'IILF
IRGENIWELLPN4CQVVEYEKJ1IAEFKEIQIIDLNSLL658LIOID.~I
AQAt.LpNKSPEVIAEALIEIGACI
FVLPVRNAAKKVRLWfiJCEKIYSDLTPWltAII0IC1WPSRPRTVNYIpGlICCEFVELCGOR1'FRDOpAWCDF
IFLNRLVAIVLYt7GSLWEIDtRPKB~41AGLYEFPYZEVEPEDCLQDIDDF'fI~IBLSLESVKZOOORI~.IGOBKG
CDTJ15RWRNlCNLCP~FRKALRLCKLAEKf~CLPVYILVDTPG
'PLEFLGFR.KRHAf'CIMKVHLCPIIFKATSLPOFGEWLLSDZDHU1FSSCNKICIKDJ1LAYPGLTAEERCOfWAIA
iDILFCLSRL11TPVIIWICEGCSGCAtGNAVG0SYA14.lNEYY
LIYtGOVRSRESIGV
SVIBPGGGASIWKDP100'1SF~1ASM..%MICENLKOFCIIDTVIKEPIOCAHHDPALVYSN
VRIFZIQEWLRLKDLIIItELLEKRYEKFRSIGLYEtTSESGPEJ1 CPrL010) 119009 419710 yeeC-predicted pseudouridine syhchecaseCPt>_0115 161522 460221 family NFNpL&NOKRMi.OYFME4F'SWLiLTpVSRLSSFLRSOLPNISKOEILtISIRONRCRVNCFCT266 hypocheclcal protein IERFP.SYKVOPCDRVSLSLIPST100pPSILWEDDYSIIYEIIPPHLTTEQNAHNTRF!'CVFtSOIGFL.PCLTLIF
YIIIVWCNAFLIKLCVINCLOSRLQHCIEVSONSNfOSOVKOFIYAC
RLWOGTSCCLWGKSKOMTELF~LFKQRKINKOYIAFVFCNPKKKFGTVKSYTAPVYAAODKTLROSVLKZFRYNPLLKI
HDIARAVYLLMALEEGEDLGLSFLtiVOpYPSCI1VELFSG
C~vAVIFCAAGPSOGEPtKSAYIfWDCI,IVILLSE4f5TfDLIOJSLPRSSAL55lIL.TPGCFPWIfCLPYPAEHAE
FGLLLLQIAEFYEESOAYVSIOISHFQpAL!'DNOGSVFPSW90E
NSRLLKEKTTLSOSFLFOLCIIOIHPE'ISLEDPALCFWlpRTRSSSANM9CGpS8IG11Y
CPn_0104 150967 449871 SSC09GVIAYCPCSCDISDCYYFCCCCIAKEtyCpKSHpITEISFLTSTCKPHPMPDC15 No robust homolo9 Dresehe in Cenebenk/EMBLYLROSYVHLPIRCKITISDKOYRVHMLAFrITSAMfPSIFCKCNNCQWDDPRLASCSLD
as of 11/7/98 ELEALCOKYCKAVLLIALSELCID'MSLLSCNALEGFPPIAEVNAACDRCSMDFCEILKSSY10CPCNDINILGENDAI
NIVSISPYMEIF7lLpCKEKFWNADFLINIPYK6OGVlILIFEK
QSMDWADMSCVDCLIADPFWSTAIASGIAKSSLQETEP'ECESKVN1.~SSWCEQGAQVCKVTSEXCRFFTKIW
SPFNLERICMSFPSLKVFSLK10JGCENMGIOLariSCWJLWSIFFVATNGCS1'PIWTTKE
NIJIALVIILVLSHYOCYFVPA1'CDPORCNIIIOJPEI1NAILAAGNCNRVDLERKRCCESSSSCPn_0116 Iu1871 161557 RYLELWtCFENSLTKTSLISDAFaAfpERDKCLLONSTSLI~sfrACWWRPPVPTPSGVThim0/lhfA-Ihce<trotion Hosc Faecor Alpha AfiPOPOPOPVVfSOPSGLGaRERSPVSSRCRFPt'VLPLSVISPRSHPCAVERRDLEDEEEFJ1LSNNATMTKKKLtS
TISODHKIHPt81VR1VI0NFLDK.YTDALVKCDRLEIRDfGVf.QV
EVI~ VERKPKVCRNPKNMVPIH
IPARRAVKFTPGKPNKRLIETPlIKHS
CPn_0105 151814 450960 CPn_0417 463017 4ti~~1 CT105 trypochetl.cal protein amiA-N-Acatylmuramoyl Alantne Amidasa Ntf,TfSHSRVLLI(KFSKEF'fIRTYRSLCFTDYLCu~LTNPLCKFPSPONPOWTIApSSITREKCJIKLTKYLNTKO
LRSNISRLFVRY.iLFNSKOLSFFAL~VIGSNPIFAOTPNPPpRVR
PpAVS~uuAWGFLO'fOGAASSTATTTTASCASAi.rL5p0pVOALLTNLLNYGOP$VOQPSTR:uEVIFIDFCFKxK0 0CTAS%ELHYEEKSLTI:
IJ1LTNQ.>ILKPMfiYKPpLTR330VYV0 ACl"'aCA :.~.S.SA~IQQpLLOLI
LDKTTCSCCSSVSSEOLOOLLSLVSOItT?SOCCSOCfOII:KRVAL~,NRCQc:OVF IS
IHCNHSSNAAJIIrCTEVYFYN:I~ICSPTRNRMSEVLGK(iILAA
aCpMSVLL.NLLSATCSAAANPI.CfAAsLAQIIYMVTSPCAK1ITSEfCYNYCCE1'CpGNMEKNCtLKSRCLKTJWF
WIRDTSMPAVLVETf:FISNSFERAAWDARYRMHVAKCIAEf:
~'C(:P1'~CPDCOCCCCCFCRFFCCVWttNCCCLCEC:.OEPAIPLVHNFt-:r;PFOKPKONtAK fRKPOIOild .
<'f9~ 010,, A519b0 4529~s '. Fn-IIIIu .In4111I 16.51 t.rbl&uJyl-ACYI-t:arr(er 1rorein murk rf M:,'rylnnm.umfylnl.~nYL)lutamyl R.ylu~cose DAF Liau:r tX:FfILKIDL'fr:KVANAc:ICD0f7CYl:WCIfAKLLr\L;V7J1TIIVCI54VFIYKIFSO.S~iELCKMIJIJ( ht..IJK:V~/(KIYt:KVRFLEVRNLTROSRCV3VCDiFTNIKrIJPYDt:NIA:AVIL~1.ANC
F'NP':RKf::Nf~fLLEIAKIYCNtM.:FD::FFDVPECIAENKRYKf:ITt:FTi?E'IAfQVKKDFAf\tA::::I
.YNI'Ff::l7VpIfTf'Nf.F.ELFJ,ELSAKYYEYP.~.3FLHTfr:IfCTNiF7~I~fCLI
:IItDLLVIL:LAN::1'Rf::K::Lt.ET.~.RKCYIriAL;.L:::T::N::LL::I1F.:::l!?IRtX:iTI::L
TKALI.U:Y!':KI':a:1.laa'(IvJII4;FJf7VtFfX:ITrtTPALV~FYLATM/It4NRIM'PMGW:.iI
YlJI::MNAVI'tTIW
1X.I::::AKAALE.:DTKTfr\WEh:RFW,tRVNTf:'.w:F4l::RAC7KAICFf':IJI:a:I'.VA'i'ITJI'I
rpAVI:ITIITIJJ111.DFlKTFT'NMItAYLF::LV4f':iteMVIffrpSPYA
F:HFIVDY'ItJFSIAFII'FrVINAI~tV(:AVMFLAiFLI::AITf:ETI.WDIiUANVFK:I(:PF?1FPK::y't F':AKAI'Vf'Plv:ll'_'..\.1V'IPATDfuL:'.:::a.TKYTL'llr:p(jKIA':::::::FIr:KYNVYNL
!>:: LAA I:. I'VIfA;:1 Ja ~I,LF:IH.1.1:K
Ir:IJ.'r,Ff'It :RLDPVLFIfiN:F'll IU'IAIrfFItAIJaiVl.'117L
I IF:1 J ~19~Y 7:It4I W F: ; s Y
.I ntl,k::Y.kYLMA~ WERYCFA'/'lf::ONII::H
FhF:L l'/HP tt:Or:
~'Im 11111'! .1.l~li'f .IV2H5H
I"/::NFI'/F'I6I11<KyAI'I'YAI::111::UFLIVI.IAC:KCHIiAYrJlFKIIrjP/AF'INnYJI~AFYLA
IIAU ::"Wrt.lmfly hV'It'nl.ru./Idw.:ph.ft.u:~::/V
.:KHGPS I::KARTf.NtR.GL
CPn U4L'1 466997 464A~6 r ~
1 ~en ~ Aii~
IIa Z81 ~7~
~9La ~
~w~
~
e t SYRK~IVtGIffALYIILi.VLRYYKIOICEDtMtWAAE.
3VPYPN eswnc FPIIINIwtItPOKKV l"r-QL e c f p .
/
fo No cbuse ho .ILGOiIEFCVROPFRRGTFFAIA'IVRKCDKDLQOPFAVDITKFNtGADPLAIpECHRI~IIKtf4ll'LFKYVPRSR
~IPDTLTFLKRYS.TJLLHSEN:LSYRIPAKYInIIiw?SIJ1VAPAt?
r>.ILpPIDGG"'Y'..~t~LKL~fKSIIYCKLIPLLDVSVIIDRLSLWWKGYATKNRLPZN71LFFLFSCE'L'.iJL
RLCAL'fIGIAL1ICVL:.TIVVYCIA::KIA".'A::KKPPSISRIEIV
ITOYQRSYPF.'KL1JDOVLItTLREIKDCKTGKAFP'fGQttiAYFIB(IL6GOVCERKLLJISPL
4749\7 47 ;514 NRt.D4TIRVIKLPKOt:.'.OiYLTiNPVIt7t'IAI~ELtiRGVL.FaKA~00GRLILINSCICEIG1 epn_Oil_ . .. -..
.., !':a~:: ... ..,rr::,_.::~7f!'i.'7::I"::'1 ~.-=i a:.::YYf .':," :.:.\:IEEw:;.Y..'.:, . ..:: m, ~ . . .. . , . .
~
W .C . , . ..... _ ..i.~...:, ~y:W-.
..Ir : :: ,'-GS~f:.vR:: ~~.
.I'tvrr:rt'i: :'' ;:Ii':::::.%:.::1t:..:~a174\t.:
.. . G' :: ., 1.
' ' ' VAWYQQKLLAL:IPCRlCTCIfLpSFJISGLVPSPNRl7IIt~.SLESrSLS'TPYSLANGYNIU1:a:dvWh ::.,:ML
i F 1.;N:aLr.H I wi'AHla .;LFLIi:aAPLvi..: Lit trlA:?.:
GIpMVOAYAILANCCIIAVRPTLVKKIVSASCEEYHLPTKIDfIRL.FSEEITAEWRAHRFI' TLpOCSGPRASP%HNSSACKl'Ci'fC101IHf'KaCPtA_0433 4773.7 476929 YDKRRHIASFIGfTPVESSP~NFPPLVIG. ' ygIODpEYCLRAOCIKNYNOCRCAAPIFSRyADRTGLyLOII.ppKKLpNCp=p~,AAt~DlLystem N Protein OesN-Glycine Cleavage a YEEAtJRSPKpOGTR
RTFRILYGTLIR'lCSitKV1811YSDYHVWILPVNERWRt~:LTEKL4pKNLCAILNVDL1SVG
SL.CK>'~.EVLVILESSKSAIEVLSPVSCEV:DINLDLVDNPQKINEAPEGEtfIiLilWRt.DQ
CPci_0120 167120 166124 ~P~~~
CT271 hypOChatical protein KSFPtdJNSRFLRLCCCLCFCGSLFYFIfINKONSLTKLRLEIPCLSVRLROLEQOIIISLRFCPeL0134 179471 LIDKIEAP1EiAALPEYQYLEYPSEESISLLSYELPCT213 hypothetical Drocein RPMfRIYOpDLPCRLCRDPAWFFSLLSFTLRFYCLGRGWTLLSFIYtOpKKFICIVIAW
CPn_0121 46A007 167108 CIfSGICVWCRFSRKCSAE'~TSRRI~fPT:ASGIWYVEKDFNAIBUtFPIItIGYPfI'~iPRA
yabC-P8P2B Family mechylcransferaseWNfINIIGLL?DYFLZTRVGOG.FLKVYNPGFJCFSKEKAYpPYRitPOIIpPISS6iVNIt SS
EILNSERAHIPVLVEECi.Ai.FAQRPPt7fFRJ7VTLCAOGNAYAFLF~iYPSLTtYDGSDRDLAPpLLEILKVlOOI
ENPISKFTiFLARAKLFLLERRFPHYVLROfIZ.IYRRQMF11LPPDiAL
QAIJ1IAFJfRLtTFQDRVSFSNIISFEDLANOPLPItLYDGYtaDIGNSStpLDI'LSRGPSFQSRQ~LRLFCY01'I
OOWICOJIYLSAAVSLLIRFIDEpKKVLPRPSKOFaRDDIYdtaKNA
GEKEELDHRI~O'fOELSASIriR.NSLKEffFi.GRIFRE1IGEEPpWKSAAKAVHIFRRIfOCILYT1IISK?MEPS
IJGFCEISTfSYFQFLEISESEFFt'ItYRDILLCKMLti.IQOGVEFDtOPL
SIODVI(F~ILLGVFPIiYRFNRKINPLTLIFOALRVYVNGtDROLKSLLTSAISWi.APOGRL1TFFVGGKDSIQVEF
PRLPKE1ISFKTKQELKAFCVYLIQ,VSLpKSDBt~VPNEILPIRTI
yIISFCSSEDRPVKWPFKEAEASGIGKVITIDCVIOPTYQEVRRNPRSRSAKZ.RCEEK1LS0KAK6PRLVCRRFSIDY
KRVIILODW1TVPNVEVLNYppNSENFQEILOp!'PDVCfCOSYK
p!'pICJCPALRDKISLITRKEILMRPERIL4SL.QpVPK09pEVLLSAGIDdSALPCISOCQ
CPn_0122 161233 1617A1 OLIIKYLLANiYLDLYSODACrYY'CIIVNSSFCKEEVLPYREVWtDIJISOLLTSN01Q.VD
CT273 hypothetical protein 1~RTRY~~~~'~~FSWSL~LKTIER~.
GLANVEIFNYSTSIYEQH715Ht4RIVSt~'RICEIO!!~'.ISIRDVAIDSApILtIOIPKPSALTSPORDRIFSIIt VCOYSSVINSPNDGPCYYOCLSttLLYDRPASV~CL.FIaKSOLDEiLiGS
L:.pTNpKSNWACFSPPNNFYKQRFSTPYLAPSLGSPDOpDEDIDCISSFLIiVLTRG1~'SYYIaRFIEQCWR
RSOITPFLSYKDKEtEF~EDPEi~DPRVOQGKVLLGLDL.fvKSTNVMIDYVISRIFO
gypG CPtL0135 110908 479175 Plfospttolipase D superfamily (uncleavable leader peptitlel CPn_0423 1617A8 169216 GYtI~'RLRFRLMLGIFFILLVPNSVSJNt'1'IVIISIIItOtVCVLVYDNSVp~IiOpILDCIDII
CT271 hypothetical Protein ANPYVCLCPClIIGGRTLKllNDIft.EANI'QfLVPBICSYIIIQPTFTpiIEttLLKAL1~RN
CHLDNEWKAIL~WGDOELEELRISG1ISP'LitQCHYSKAILPFEALVILDPLSIYDNpTLGCPNRPPYV!'ICCPPST
LYLQIGENSOALiWt.DQ7ILRNQCONLPTLISJIITKALPCLCRIEE71TAIATYLSSCPIPAINPRLIVSCVRRpLi i!'RDODIIG.RSTAfGt.OLRECID!<GOPAIWD~YYA101AiPItAG
ANDAFJILIJISYSKATiDD~tIUILVR
ACPPLTLZ'3aAEETVfPGFDIOtEDLVLVDSSKIRIVLCGPItDICpPNPVYOEYLKLICGiIRS
SVIC.iIlOIYFIPKDELi1071LVDVSlI9d~rVIiLSLITNCCIIELSPAITOPYAtIQIItDnPALL
CPJ>_0421 469528 170961 YGI~IfP4WlUd11C61t41CPY~tVSIYEFAIWC1'QIJIKI~NIIDatIPYIGSYtII~miIF
dnaA-Replication Initiation IaeeorCYL4IWIESPIttfAAItAIDfVR'iKDIGLSIPVSNGDIFbwYFHSVNNTLCNL~.TrMPA
SRCHEIFSPSLIK111VDCIWLSFIMCE901LTCNFLNYVKTRCSKTAFHrWISP
IOVLElTQEKIRLEVPH1 ALCf'WAGDDCPSAPVCPIlr0131 I1i33 110902 ASI IEGPSNOpVK871J1VGLAGKPGRSYNPL 1p1J1-Lipoace Proeein LSQUe-Like Protein FIIiOGVCIGKI7fL(JIJIV~WiYVRt:NNNKNLRIIK:IITilIFIN~VYNLItSKSVDKNI0~1FYRSpYVC1llI
KVRIVDfQKSSAASNIWtDRDi.LESLQOGELILHLYt04D1PCSLTYQWl411mt Lt)i.LLVDDI0FL01110NFEEEFCN1'FCfLTt~.~OIVITSIrifPPSOIiC.SiGtIIARI~IIGFLL.S4YAOI
.GLDA11VRP1COGIVPIDOODYAFSVtIISATHPSYSSSVLA~tylflVIISPVAKV
LVAHVCIPDL1C1'RVAILOHKAEpIGLLIPN~IAFIfIADItIYCNVRQL~GAIl'iIG.TIIYCRt.LEIVFRIGQf ZaPI;~iSSSRDSGIiPCNIUITSKYDVLFGDIDCIGr3AAQRKVOpGPIJpGS
FCKSLTE1911RETLKELFRSPTIC~ISVEI'ILKS1fA111FONlCLNDt.KGNSRSKDLVWtQLFL90SSSETYORF
LKP6YLEIIn0I0IHAFFPLCLEA71DEVL,pFJIRQOVKiA/II~IC
IAMYLiINTLITDSLVAIG71APGItTNSTVLY71CKTILt0~l4DlcfLKRpVM.CKNNIVCC~.t.
CPn_0125 170965 171561 CPU,-0437 41110 11350 CT271 hypothetical Dreceins elpC-ClpC Protease FRGCPtffRRTCIIGPFEDVOTLYEEETSSPSSYSPYSRSERPETPPSLFdJPKASE7IRpLNlfplinKPTNRAKQVI
KWDt~IQRLNNNYLGTFJtILLCLLK1.00GVAVNV1~IL4I0PDT
HNLTJ'.R-SSLPpWSSTPRTESLLPLEEPETTLGEDVTFKCEWIIILRI3.RIDCI'FflGILVSK1IROEVICRLIGYGpEIQVYG
OPAL1CRVKlCSFESANBEASLLEIOJYVCTOILLLGILNI~D
GKIIIGPKGSMUDIOLOEAIIEGW6CNITVSCI(VELRGGAIIKGDIOANTLCVDDGVRSVALOVL~ILIfI~RLVRKC
ILKELETFNLOLPPSSSSSSSSSRSNPSSSK6Pi~liStGS
ILGYIrIIACI'1'DItSERZGtDL
DKIIGC.SAIJUIYGYDL?EiNRISKLDPVICRSSEVERLILILCRRRKNNPVLIQ6AGYCK
TAIVdriL710KIILti1VP011LRKlWLITLDL71f1tIAGTKYRCQFEERIKAV10E11RK1A.11I
CPn_0126 172111 171536 LLFIDELNTIVC'.iIWAOGAIMSNILKPAL71RCEI0CIGATTIDEYRbIIEKOMLtiRRF
CT277 similarity QKIVVJtPPSVDLTIBILRGLKKKYECNNNVFITEEALKAAATLSIlQYV11G1lFLPOKAIDL
NVLFSLLFPKLCYGCOAPGAYFCSNCLEKLLVEDREGRCLNCFRYLCSSETRLCSOCSPSLDIfaGARVRVNlIDQPTD
IJOa.EAEIENTKtJIICEQAIGTOEYPXA71GLR~EKKiRERLQ
SQLQAP'SLYLPSOTALSVYARACEQCRPALOFFSKSIAFtG.ASt.DI:TPSCIAYITSTISRSHK~?llKl:<EIIQ
VPVt>EGVAOVVSGOTCZPS71RLTEABSFJG,hKLfp'ILRpKYIOpII
KIWEVAKLEKLLRIPLWPWLPKKRQIEKLPKGEGICFL511YpL~KWMQTIVGGSASPLtlilVT8ICRAIItRSRIGI
KDPNIIPICSFLFtI3PL~\/GKSL~LIIQQIIIIEHF0GA~11LI0~
VSISLFLSQNDQ
SEYI~IfFIUITKM~aSPPGYV~tL~HLTEOVRRAPYCWLFDEIE1WIPDIJ4r0IL
OQGRLTDSFGRKVDTRHAI II4li'SNLGADLIRKSGEIGFGLKSHH9YKVIOEIfIAtAIOtK
CPn_0127 472157 173715 .
HLKPEPINRLD6SVIFRPLEKBSLSEIIHLEINKLDSRW041fpNAtJtIP06VISFLVT10C
nqr2-NAI7H (Ubl quinonel Dehydros:alase NSP~4DMPLRRVIEOYLEDPL
'~YR~'OFJIRKLRATLVFNRVAFEREEEOpEiUIL
aVCYVFERVEASTFLSITHLKKFINSLWKLCpQ0KY0Rf'TPIVDAIDCFCYEPIETPSKPPSMiLPS
PFIRDSVOVKRWIM(.WIALFPATPVAIWNBGLpSIVYSSCNWIJIEOFLtII~3FGSYLS
tvYKEIHIVPILWEGLKIFIPLLTISYVVOLTCI:YLf'AVVRCNKIAGOLLV'l'GILYPLTLCPn_0139 PPTIPYWNAAIGIrIFGIWSKELFCGTGMNIWpALSGRAFLFFTFPAIOt~DVWVCSNPyebF-PF-loop supetEamily ATPasa GVIKI>,SLt41001SSTCKVLIDGFSQSTCLOTLNSTPPSVKRLHVDAIAAI~021tIPHVPirODNLTLPNPP_OVR
EINOOlYIVANSOCVDSSWAYLPKKFTNY1CVIGLFIODafEEDSDOCLC
'JIHSQPSIIrII'ETHPGWVLDNLTLTOLOTFVTAPVAI':OGLGLLPTQFDSAYAITDVIYCIGSSTKDY1CLNERV
CLpLDIPYY1VSFAK6YRERV!'ARFLKEYSLCYTPNPDIi.CNRCIKFD
KFSACNLFwK:NIIGSLGETSTFACLLGAIFLIV1GIASWRTNAAPCICaFLTGWLFKFISLIAKKV:ELCGDYLATGH
YCRLIfI'ELOE'IQLLRGCDPQKDOSIFLSCTPKSALfONLFPL
'ILIVCQNG/1WAPARFFIPAYROLFLGGLrIt~GLV!'NATDPVSSPTItIILCKWIYCFFICFHTCE191KR'F1RJ
IIAApAALPTAEKKOSTGICFIGKRPFKEFLEKFLPNKIGtNIDND~I'KEIV
IVIRLINPAYPBGVMLAILL.GNVFAPLIDYFAVRKYRKtiGV~HOGAN'f'fTICORRCLDLGGSdIPCYVKiINfIE
ENSIYIVRCEOHPpLYLRELTARtIli WFTPPKTCNCSAKVRYiISPDEACTIDYSSCDEVIfVRFSOPVKAVTPOOTIAFY0C0DCL
~Pn_0129 173719 .171481 ~~.lIILVpNIPSEC
-nqr3-NAat (Ubiquinonel OI(idoraduecase.
Cenma-NMSicC;SXHiIIRINQTWYIVSFILCLSLFAGVLLSTI(Yl7t.SPIOEQAATFDRNKpNLLACPn_04:>
IA5523 I8ti077 ARIL.7FKGRFOIOEKKEWVPATFDKKTQLLEYATKKVSEVSYPELELYAERF~IRPLLTDJ1tM rotma hamolog present in Genebank/NBL
as of 11!7/98 QGIfVFSFEEKNWPIEpFEKYQESppCCOSPLP!'YYILEN'I'SRTEHNSCADV11KDLStIrQIiSSNttf.'ILFV
SSTLNCVFPSSLPEESADLFITNKEIVAiGEXCNVFLTHSIPlOIL~UIIT
ALIFPI~t~GLYJGPIHGYLGVKNOCD'IIJLfTAWYGO,'ETPCLCANITNPEWDEQFYGKKILLVIVALA:IAIICL
GCYSCSILLIAVCIVLTLLTLLCipALVGFIKFLROLP00t.IftTf FLpDS.~CT'INFATTDIGLWKCSVRTTLCp$PKrILSAIDCISGATLTCNCVTFJ1YVOSLQFIREK.IRPEs"SLQL
YTNAVRKTTQDTLKLYEELCDL.iOKEFKLQSTLYQKRFEL$INOfC
Av.YROLLINF~tIt.THEKKTCE ' K'fNON
::Pn 942'1 4711:611 .1'571 CFn_0119 INfi9Ht 491:74D
mlr4NADH IllhiquilKmel Rullwtase tM rot'::c llaaalol tr.tctnf in J rayusAink/EMLL nr. of II:7/7R
KRNPFMfwKK::YK::YFFDI'LW.~.WJpILIr\ILCIC;.ALAVT'tTWPAIThK7IA'J.~.IV'~':C::WTIM:
IKMAT::VAP;:fNPF_:::PL:IIATEVIlILf?iAIITQMiPIPMW1ETPR5KLS1'IIN
::FFV::I.LRKFTPtr.'VRtIITQLIIt:LFYIVIDQFLKAFppDL::KTt.VFIK:LtITM'.tVN'fl.c.'FA:
:::.:.LTIt7GTt.'..1~7YY:YTf~IWIIty:Ir:II:fIVLTt.ILALLLAiM.KNKQTIl'KL
~~Ita:LARI1VTPIPAFt.DC:FIISCLt:'riaiVLLVtt:JIFELFY:FI'ftJ4c:FRtIFQFVYA::t:fIUEi :a~:.I:SIC:x:FV(IRYI:U4F.~.t'tY.aVIILtELT1'r~EKTRIUIEtEAKK&:IONLEL
11f'fl:IVil(.::It4Vl ~P!:AFFL4:INIWLI'NIRD:.IfYPYRYfTFI:/i::YIJ1~~K0PKRK:::'.('t:.~.FMP:'.IKIIt::Y.N
F'JILFf%:
.'I~m_nA 111 .I-/512! .t~ln.1)m r:lal 114 4: I>:na7n .IH'/H SN
IHIf: t1At111 (lll.i.llllll<IIIQI T1111'1 :.%tYlfINef 1.:.11 t'fOCbln IkIIL:C.1!:1, 5 t'MwIt:ArIW(lNFr:ILWAAFtONILLWFtt7tiewYt.M:::TRV::fAN:taat::VALVLTVT1!WktU.::4F
KW.t'111Mt'lu:llVl_Tf11'IVyldUy:IIlt:liArJ:HD1'1hIF::A~llnyfl.KVllO
:::(IIWt'VIL\t't'PaKAI:IW(::P:L.A:iVNLt:FLELIiFIWtMFI'Qtt.IiLLLEKV::RNLYAKt'KKL
:fYrl'tI:YRVYt7ITFL('.TLII'rt:II:Y:Lt.Y::Tr:Ytt:AI~IVWKt:::Ltl:a''1'pptOLt:
t::ll:It't.lLtAVIVt:AIItY:VLF(:ITR::YPFIf'M4IF::Il:/VtXV,l~ll~Il.AfYif.ATIKEKLA
1'WATPVL'.'~FYNYVLI_:Lt:AYTL::LKtIWI.4ra If:7a.VIl1KtItF74t:Yt:LYU:YL:x:KYOAT
'tltlYttly;Nra::Fl't'ha.f.WAFN::L'tt7(DI::Kf.':AKIVItAit.t.-fEl/VtllM'N('LKC:~t:Itt::A:F":/TNETt:L110RKTYlII:NSV::YKA'flirt:rtdY:lYl~/tIF::
IGYR:.T::VI:Ntax.AY
:ltKf:F'frXI::hHDISLADDN r:FYAISPNLI~T..~it.,.~ttE
t:R.~YNAD:..':7ltF~F
ft' JVY
' .
.
l:hFICAt NL'PRFRKKLtIYIIHLt~::PI;IFF:f NNNKT:iIITFKT.':AFFTYI::AVIdIF
0llg -~ ~[' 9MA7t X a5(~136M1 p .
Pn _ _ :
~
y7t7Cy9>t.: 'lt0teet Pf01lM w.,l...__ '('~_OId: 4N77h1 1RH5ZN FLOPSRREIHEWK
r...GS.SLRII~LSPlOOPEOCHFDVVC:.FLIIPEuLTMRSMI~R
;.TUUe nvo~chemc.ll Pcol9ln IVYCONRWEDAAIP1JLIKKp'I'IJICLIlf'fDGElIIKYSwDlDIMiCIFICVDIIRA~PE
ARKLKNCAKSYPRTAL1IEVLVSSVt&AL 'IPSPSOltHlJttlAPHLKNTRKFY
tIf:KQr'~L'fLtI.IFPERLL ItKTlEKCNAKAKC
. .
NILFOTt7MClKt tCVYLKDKISVSKHPFIENlEF
LHILTIA'J:ICLVFSLVFI LDLRAPSWf:JDSHdILOEI:
HLASYAIMIft W
SC
. .
O CRL
KvtLtFCA..~TfMLTLPLAALFtIAIKTK PTNOELIDDIVFYfPOVICGLYAAGCRNLOLDDCA
'1T::/'fLFriMKNLFPPYEPPP:iRPHTPPPl710E'NPLISESYFD.._ ...,y_...
.".,.. fi y ,__.....
n , _.-.....
.
.
PPPWFtSG:Li.M. , .
.
, ;r .
~
....
..a.~l.;t..
. ' . :.. ; F w\"" ....-.::',::.......;
;;.:.:.:a:.' .' . . . ,. ...,.
. :.-a_;
... ..;i~:
1~'; i .,; . I;. . WF~ECIN1RK:E'at:''~'r:AFlihE:AKtI',Y,;
cai P=oter,n . .
c ':OOSCnacJ
.
. 507231 505330 VDsIISOPPINPLCOPOVPM~PSrOpsIVKRLKTSSiCLFKRFITIPDKYPKJOtYVYDT
GIIALAAIAILSILLTA,iGNSWt:IALAPAi.AI&ALCVTLLISDILDSPKAKKICIJIITACPIL.0449 LO tlrana-shift vitA 01511 palp LWPIIW1IAAGLIAGAFVJ1SSG114LVlANPMPVMOLL1VCLYtNSLNKLTLDYfRREN_ _ EI\Y'IGFR~ODISFSNNIVQLTI'X~O~IS1LAAGECSLSAEACDITlI~i7IIVAZTPQ
LLRMEKKTOETAEPILVTPSAI>DAItKIAVEKKKDLSASARt~~(TJL9DAQD~AAILL~IL~TTKIWSIDICSI'AI
!:11~.AAISCHSIPIYDPITJ1NI' NPEHRRSFGSLSRIKTKPSD71ASTRPJ15ISPPfI~DIIDPYNlI~LRS$SFVLKI1GVTLDTKGFIQTAGSSVIIDi IO
TAGfa T
TLK
PVIL
' AQGSIFYSSR .
GSGASSAFTPIMPASSRSPNlSiCfVIJtPEPVYPKGGKEPSIPRVSSSSRRSPR~KOS
JADfIL
O
IVFSC6ItLSEDE71K
TTLK7f$TCEYPLT'-'r'SIPVDSLCiflGKKVVIAASAASKNV~PIGLLDIIOGN71YAI~L
OOOONODEEO1CQQSKKK~KSNOSLKTPPPOGKSTANLSPSNPFSOCYOERCKRKHR10ACCKTODlSIVOLSALGCAT
TTDVPAVPNATPTltYCIIOOTWQfIwVODI'JLS't'PXTKTAtt.A
9llMCYLPNlLItOGPLVPNSIiiCSF8DI0AI0GYItRSALTLCSIJRGfwAAGVANIf~
CPn_0144 90365 191507 IOfGZKRKYAMK$GClxI0GAi101'CSCM.ISFAPCQLPCSdfDFLVAIWITd?'1tJ10RlYIO
pmp_6-POlymorphlc Outer Membrane HITLCSGFI~KLPGSWSHKPLVLCGOWYSNVSNDLxfKYTAYPiVKClN
Proce>,n PNL4LPI0Y
SLPWLLTSSALVFSLHPLMMtiTDLSSSi#.tYENDSSGSAAFTAKETSDJ1SY
R
D
F
K
I
OIt fDOSNL
KAFPORHFO(Y S
CTTYTLTSDVSITNVSAITPADKa~CP'TM'CCALSFVGADNSLVIpI'IALTHOG71J1IIM'N.
I
SE
C
E
S
O
M!<GAS11H8YPE1LI~~~IXLM'T
CItRAG6N
RNDPKCTT
ILVISCA
d~ILiW
OtIt LSlSGlSSLLIDSAPATCTSGGKGAICVTNTDGGTATF'fDNASVII~KNCSDfDGAAV.
r T
i KFEKISDCZIDISYDL?LSYVPDLI
NVDLOGKF
!
' TA ~OFJlIYRGSSRIY
SAYSIDLAK'1'1TMLLDCfITSTIO'1GCAL.CSTAN'1'NOCNSCi'V'flSSNfATDKGGGZYSKO
YAPSPNFP.VIL
EKDSTLttilt~lfGVV1'lKSNTAKTCCAWSSDONLALTGNTQVLF0~1KT1GSAA9AN'LPECC
SOAll1 5071A0 GGAIGCYLATATDICfCLAISQNOE?ISF1SNITTANGGItIYAT10C1'LOCMTLTIDGf~IITCPeLOSO
TGP 10PolymorFhic Outes Meabrane Procaia ' PAD
1~NTNLL.FSGNKA _ ACCGCAIYTETEDFSLKGSTCNCISTNTIUtTC~LYSI~SSLSGIMtSQISWLVLSSTLACFTSCSTYF1111TAENIG
PSDSIDDSTNIrrTYTPIC'f!'iTGIDY
SNS&ANOEGtGGAILAFIDSGSVSOK1GLSIAtJ~tOEVSLTSNAATV80GAIYATKCTLTGONLCDSMLTISCI'SDT
IESLSlAGI4r~tSLSPLliIKSfAEGA7ILSVlTOXHi.
' TL1GDITi fN .
NGSLTFDCiPI'AGTSGOAIYTETEDtTLTGSTGTS?ISTl~T1'111ITOOU.YSIt~INSGS~SLIGlSSLTFWtPS
SVI1TPSGKGAYIOCGCOLTFDNNC1'ILFICODIfCEEp10071ISTIC~Ii.
LLFSGNIfATGPSNSSANpF.CCCGAZLSFLESASVSTKKCLWIEQIAJIfSL6GNTATVSOGSL10181IGSISIaiO
tSSATGKXi0G11ICATGTVDIT'tIKTAPrLlSllIIJ1EA710GAIt1$lClt ' ' ' 71G1tIxT
Nl'SLVISDtNfATAGNGG71LSGDiIDVTISGNOSV'1'lSGNQAVANOGJ1IY7110Q.T
IITlSTNS CTI7C
J11~
AIYATKCALHGNfIT.TFDGM'AETACG71IYTETED!'rLTCS't'Oi NGNfSPTKNIGLVFSGNSATATATTfTDOIIL~ISESDIATKSLTLT1~SLSF>
INM'J1KRSGGGIYAPKCVISGSESINIDGNTAf:SCCAIY51GCSITIWGWSP1!$1SCCLRSOOOOVSPFL?I
KGGtIlYIADSGELSLEiIIDGDITFSCNRJITt~TSI'PNSI1QGAGi~ttITKLAAAPGNTIYt' YDPITNIJIPASGGTIEELVINWVKAIVPPPQP~'.PIAsITPVVWJIpANPNIGTIVISgCPJL0451 11FMDJ1C1'TLt:T 10 IFralne-shift: with 01511 pm0L
GKt.PSOW15IPAN1T1'ILNOKINLaGf~NVLK~ITLpVIfSFIGOP031_ BiEGSPYDNPGL .
lt1'ORV!<IKILDSCIVItNi.IYLICIYIDANSSIJOiKSITM1C1'SIPWV4VSSVIJIlBCIR.Q
TTlt~TfDOSII8.101LSVFK.DALDGKRMIT1AVNSTSOGLKISGDLKIf?SL71NEELLSPDDSF7i~tIDSGTIT
PKTS)1TTYSL1'GOVFFYEPGITal'PLSDRC!'KdtTDN
LVAWO
' LVPKVGIIGGKVT
LTFLCiiGNSLTFGFIDAClt1A0J1MS'1'TAN101LTfSClSLLS!'DSSISTIYrT00t3'1'LSS
KAtdin.PFLDLSSTSGTVNLDDFNPIPSSNAAPDYGYOGSWLS'1'IAOOtIIJI'1TJIOA
WITLVPNSLWNAYVNI1t5i00EIATANSDAPSNPDIWIGOIGNtIlIi00KQN
AtGYTPKPEt . Si KtTtAGFRLISRGYIVGGSM:TPOEYTIAVAFSOLFCILSKDYWSDIKSOVYAGSIC710SSAOGVNLlliIRKLVII~
ISTADOGAIKC71SPLLTCTSGDA4T5l 7AICM'KA1DSP
i C
YVIPLHSSLRRHVLSKVLPELPGEIPLVLt(GQVSYGRNNtOd~fl'tIQ.A!>Nl'QGKStRfDBHSJ111RTP
RTJIt~tI;DYVRF1.SNIA&TSOGiIIDOf7GTSILSNNKFLYFEti ELIISNNKTL1FASNVAC1'SOOAIN7FKKLJILSSOGF?EFLANNVSSATII~WISIIfASC
FAVEVGCSLPVDLNYRYLTSYSPYVKIaWSVNOKGEOEVAADPRIIDASIQ.VNVSIPhaELSLSJ1E1'GfIITFVRI
fCLTI'ICS1'D'1'P1QN71INIGSWCBCITCLII~IITIlIYDTTI'SE
LTFKHESAKPPSALLLT'w~YAVDAYR~iPHCLTSLTNGTSWS'fFATNLSRQAlIAEASGHGTSSOVLKIM'~SAL'J
I(~tPY00TILF9GLTLTItOQJIVA~i.K88l1pPViL100KIi.LO
' RYSF
ImV't'LESTSPS01JIGSLtGItDSCrlLSTTAGSTI'ITNf.GINVDSI~L.ICQPV8LTA1~A8N
LKLLIiCLOCIASGSCELRSSSRSYNANCG'1 KVIVSGKLNLIDI~tIYESI$tPBImOLlSI3*ITVOIIDVD'lIiVDISSLIPVPII~IIISE
CPn,-015 YG!'ODO'~L~T~ATA15r1'ILTGIVPSPERKSALVC?11'LwCVITDIRSLOOLV
pmp_7-polyarorphie Outer Ma4brane EIG711~tNOGIWfCS$!ItNILNKTGDH4P1GCFAIiTSOCYVIGGSiUftPKDI7GlTPAICN
Protein tIBESAIEKIPREIPGiILD
FNlLVSKIICLOMKSSVSWLFPSSIPLFSSLSIVAAEVTLDSSNNSYDGSNCiT!'lvlr~n't DMAG?TYSLLS11V5lQNAGALG1PL71&GCFLEAGGDLTPOaIQHAL%FAFINA05871GTW
' LIRRD~CFIALO'~AT~'l~SNTLOPONYLRLG
VOVSlSHSI~CIItYTSLPES' ix.SI110fiIGL~.PIYLSNPNPLIRITIl011I1tC
ONISSD
MnVSOftSFIESSSDGRGlSIGRLLHISIWG71KPVQGDIGDS7fI'YD~OlIVaWYRI$1 VASTSAAf>a(ItLLFNDFSRLSIISCPSLLLSPfGOCALKSVGNL&L1GNSOIIF1tJD
NGGVIHTIOJlLLSC1S01ASFSRNOILF1G1~GVVYATv~TITIINSPGIVSPSONLiIKGS
GGALYSTINCSI?DNlQVIFDCNSAWIN~QAOGGaICC:TfDKTtIfLT~ILSITN~APOSTItTLVNSPDSTiKIROG
tdSROAlLLRGSNNYVYIISNCGJGHYAI~
' LT7~CilISCLKVS1SAGGPTLFOSNISGSSAGOCicGG7IINIASAGEIaLSATSCDITIrHI'ttKI.R!
VG
NQVITiCSTSTRNAINI IDTAKVTSI1W1TGOSIYFYDPITNPG'17N1S'1'OrWLrtLADANS
' CPt>
rlKDLTDS _ EIEYCGAIVpSCEIGS~1IAANVTSTIROPAVL.7IWGpLYt~tDt:YNDmP 12-POlymorJhic Oueer Mrnbrane ID'1'l9DS Protein Itruneatadl TFaADtDIISLSGTIU
.
f'NEEI'KrILPNlLTCSALlLU.PAMQVVYLNESDQYNG11INNKSGEPRITCYP~'ISYI
RILNDGGTTLSAKEIINLSLNGLAVNLSSLOCTNKAALK
FYF1JIOJLKSAS?YPLLELTTAGANCrITLGALSTLTL0EPE1'HYG7t0DNNOLS>ttANIITSS
KIGSINWfNTGYIPSPERKSNLPIiJSt.WGNFIDIRSINOLI>~l'ICSSCEPPERELWLSGIAfLDDVRISNVIWD0 8DJIGVF I$~ICIyIZT.T
' AFTSJ1PLLP00DGaIYStGSVMICJSEIYI'FCGNYSSIiSGSi171IY1'PYLfaSKJ1 LSNf3Yi GKNNGDTYG .
NFtfYRDSMPTAHGFRH15GG1fALCITATTPAEDOLTIAFCOLPARDRNH11SCCIRYL~FRONVISOG7fCCA1ST1 0iLTLITAGPSClC~NtAYNDIItiSl10CJ1IAI
' 6RPSVM
SYLHTDFAIMICCYYTDtJ .
ASLYFNtfIEGLlDIANFL4JGK11TRAPWVLSEISOlIPLSlDAIC!APOGSISISVKSGDLI!'tKiNl'ASOI7t~
f1'ItIZISIHIQ~"IIQlIDJ4RJIVfE6DVYlYDPISH
3IIKGsSNRNDJ1FCJIDi.GASLPIVtSVPYLLItEVEPEYXWYIYAtWODIYFJISEIifIKITDLVINAPECKETY
tJG'rISFSCLCLDDtIEYCAENLTSTILQOV'1'L7100TLSLSD
KSELINVEIPIGVTICRDSKSI(CfYDLTLMYILDAYRtINPKCOTSLIASDANWMAYGTNGVTIALNSFItOGSSTLT
MSPGT1'LLCSCDIIRVpNLttILIED'I'DN!'VPVRIRAmKWILV
tJvRpGFSVRAUitflpVNPIiMEIFGOIAFEVRS6SRNYNfNLOSKFCFSLCKLKVAFFJIYwSVYDFPOIKEAFfIP
LLELLGPSFDSLLIGEI'fl.tRl~V1'1'EIIDAVR
CPn_0116 497602 500415 GFWSISWtEEYPPSLDKaRRITPI'IOn'HFI'TwNPEITSTP
pmp_8-Polymorphie Outer Membrane Protein CPn_0153 513156 516152 LIEPIOtLSHKIPLHKLLISSTLVTPILLSIATYGADASLSPTDSFDGAGCSTF'fPKSTAD
ANCTNYVLSGNVYINDAGKCTALTG:CFTE'i'1'CDLTFfGbGYSFSIIffVOAGSNAGAAASpnp_13 -POlymorphic Oucer Membrane Protein ' YASGKSTLSSACAIiJLTt%IGTILPSOtJVSN6ANNNGfLAPCFASTAl7YEVIMPSENF
TTADI(ALTFTGISNLSFIMPGT NCVLLYLFFYSLSLI:RIIWFHLYVOht(TSIRKFLISi .
DDSSGKIFPYTfLSDPRGTLGI1SGDLYlANLONAISRTSSSCISNRAGAWILG100GVF
CGAlYSSAAASISGN1'GOLVIMINKCi~.'tr'iGGAL
SISGNI'SSITFTSNSAIOC1 ~
A1TTK:'L
.
SFWIRSSAOGMIsSVITONPELCPLSFSGPSOMIlDNCESL?SDTSIIir~JVIPHASAIY
. ' .
.
CFFJ1SSSITa'ISSLFFSCN1'ATDaAGIttxAIYCEKTCETPTLTISCNKSLTPADiSSVI'0 CGAICAHGLDLSAACPTLFSNNRCCNTAAGKCGJ1IAIADSCSLSLSANOGDITFL.GKtLTGSM
ATTPMLFINNDSILFOYNRSdOFCMIRGTSITI>~ffKKSLLINfirIGSI~ICGAL1 ' STSAP'CSTRNAIYLGSSAKITNLRAAQGpSIYFYDPtASNIIGASOVLTINOPDSNSPLDf INLINNSAPVIlSTt411'GIYOGAIYLTGGSMLTSCNLSCYLFVNNSSItS06AIYANONV
' ' ' KOPLAWSCTLALKCNVELDNNGF?OTEGSTLL PPATPPP1GVSLTIS
'fSGTI'/PSCt7tLSADEAtWIDNFISIt dIf FSNNSOLTlOFIlfnlaPONSLPAPTPPPTPPAVTPLLCYCf ' . JtCKGCJ1IAIPESCELSLS71NQG
MOP~fKLKAOTEA1SLTKLWDLSALEGNKSVSIETAGANK1'ITLTSPLVFODSSCNFYECENSVTPLENIAS,OGALY
GKKISIDSNKSTIFII(~fl SHTINpAFfOPLWF?MTAASDIYIDALLTSPVpTPEPNYCYQGHWEATWADTSTAKSCDILFNKHISITSC'.'i""IW
SIHFCKDAKFA?IiGAtpCYTLYFYDPITSDDLSAASAMTW
"
' iONtSIY00RGLtAASGTANPF L1LM~ATLJiIMN
TNTYflITCYNPNPERRACWPDSLWASFTDIRTL00IFlft VNPKA.~sADGAYSC:.'_VFSGETLTATEMTPANATSTII~OKLEL~L
' . fND7ILT
NKt)IISGTh'QAFRNKSItCYIVGGSAEDFSENIFSVAFCOLFGRDKDLFIVENTSNNYLASLFTODEKSWINDA.."
.TLATTNDAtPII'DGAITLNKLVINLDSLDCTKAAVVNIIpS
' 'fLOHAAFtJ'GLPMPSFCSLTLN.ILKDIPLLLNAOL.~>Y~fTKItDND'fRYTSYPEIIOGSWfNNNCYOpSIYCI
fO
I~GTGGLVNNSOIt:7HHGMlNADWpVPILELKATSNTVZTfDISLGI
NP
~
SLALYLPKEAPFF~.'1'FPFLKFOAVYSRQONFKESC:AEattAFDOCDLVNCSIA5AADCED
~
CTWEFTII7lTffftl':GtJNKKTGYLPHPERLAPLIPNSLWJ1NVIDLRAY60 CALELCt' ' . ILINTYTRITPDAALSIGPCQLITIfSKDYL
. CKOLSI1GITNF!!!?.NtfCCDMSYANMOCG
.
PVCIRLEKI~EDEKNNFEISLAY:ODVYRKNPRSRT3LMV:.f:ASWCSLCKNLAROAF'WS
SGEA>YELRG=AHIYNVDCGLRY3F
VGHrHSNVYFAR~.:NITKSLFC.iSRFFSCCfSRVTYSRSNEKVKTSIRKLPKDRCSWSN
~>IHiVEL ' IC
iHLTL
. PECRIFGHGHLLNVA
.
~NL~ELECNLPI'.LSSRILNLKOIIPFVIU1EVAYATWiGIOt:M
.
.
rn VPUrVRFGKN3HNR?DFYTITVA'fAPDVYRNNPDCDTfLPINGJITiiIfSICNNLTRSTfi.V
r 447 SU05.11 503351 ~ OA.iSHTa~VNDVLE:FGHCGCDIAttTSROYTLDIG3KLRF
, tyap_r-Ftlymorpnu outer Membrane Procein F'JKPP IAL'fMKSS W IWFL IESSCaLPLSLNFSAFMWEt:III3I'fNSFSG(~TY'f PPAOT CFn 0154 510179 51911'.
T
TtJAIJ:ff'fNLTf:DV:itTNAi7St'TALTASf:FKETTtaIL:F~NUYOFLLOtJIDAGANCTFl4 W7lymc:.hlr ~utsr MeaICrane ~ PtOteln ' m sNIM~CAW P_ YFf~NF P
NT.WIYt.L::F::f:F:.'YL:'Ltv"l'fNATlC'K:AIIL."ff'.Af:SIOCtIY~.Lt:uA:CAFAETRLIX't IP/PPITIY)GEEILLT::DFVI'.:xJFLf'JL;F
' CMiL:FK::~::F't:L.l .".:t:::.:a.tIPNLTFAKNKATUKA:dL'f.~.ff:taTttAITt.N::A.:f:tSNTAAMICCIIIYTEAS.
'LNFIfJRAIT:.fX3At .':.~.Fltf::al4:L:":f:::LSLTFTCCOAfrfN.~.tr/ALLSMETLTFKNF::::INFT(xJO.~.7ytL
:
:
CNta C
' :
T
' A'f :
A
' C~/LTL
~
~
.
r:CLf'f'fYDIVt'9::IC.~LIFTtNAVAY3PA.TIfTATPAITII/Ttr:A:.At.~~ITIk:I.TVEtJI
.
.
:
.
.
FIMC
V
:
, , AIY
:~:T.
t Y.
::Ft.
.:alY:,I:
a':ALJ.x:DITFEX',ttIWXCAS
l'KNN::AIDTAAPLJ:f'.AIAtAL
'.~::L
::::IXa"rt '/t'fitJL'/L
.
:a)::IYFF(:NIJVJF.::AI::::.FTAWKPiNNTATFI.iF3HtIFTSSGf;I:VIYW
. :::::I.LlIJHC
.
;I:TYALt:::tY:AICIPnI'FELKtd'IOI:IG'PP::YNt:TPNN1G
. :L
':::nJl1TlIJ::INI~.TIfNAKIIILR~::;)r'dlI'tYFYOFI'i'f.~.ITML::DAIJJIlY;PDL.ALTIP
Att ':i VTf' ;
' nf:f:'fF:w:F*L:a'J1F414~\ONLF"PlrxltLTI.NYJAt:LY.:F:VT't.VAK:F~O::fr:.~.TLLr%f.
y .
v:
:
:
IIFTI1N:
LLD::NfMRH(X:AI~:AY'fLNIV:Ix7PtEF::RNRAHKtX:AIFt~:C
'NIVraI~\:~:
AI'fAETr :41rN71-rLFTAIta'fINNLVINV~:'.LKI.-fKNA'fl*.1'frJA::r/~PPI.::::L::LVDP:~Y7NVYED.
'/::WMII~'JY::LL'rI:rAIN~I'ANlllt'IDLrIAUPLF:KtIFIIIW:IrJ:NWAL::WOECffATK.~.KM.
::Vr:f~PAK(rr:.Tt.T::.rI::FY_a'l\FTf~IMLNfYi~:IRNAIT/EMIIIEIV.'II~SA(JtXi:RL'fF
Y
"
'fL'tWrwl;tNl'FlIHPf!r7rt.VANTI.Wt:::FVIriR::I(/~L%ATY'/Rn:I~RTR(aYX:Ef:L:iNFFf ELLLPAtrITfILItTVKtA:7.E
DfITII.~.LifiT.~.1sKK3lTfN.VJ~:A:77::WFT.~.Y~:L.S
' ' TYItIK.:F'ItIII::f4:YVVr:ATPrI
I::DtJLI'1:1AF'r:fjLh:Y.UI1UIIFINKNRA.iAYAA.iLf:AFMVDFTV;KIr\FOM
IIKIx :.ILYRD
l*ITIAIAWfrJII:F.A'~f:::ywLTII::xYTPII:LATP1 ' ' .. fK
(JIUA'rl:::a~::I.1.uYl.n::a:::E:~~PVLI'nAVt::yl'f.':YNTMKTY1"f~LtPKI:E:::I~fYNO
t:CIt:YIfl:E
f11 F'V:;A:~/IW:fYfJW::.r:AL1'LUt:111MPIN.YfB/t.W~I~/AIPIAVFKn:ATV
' ' ' ' . fLVR:1 tnrl:lL:aUr:LF'HAYFI'FIKVF:A::YtilrrL::FKF:Ptri-CL.VK::FD.rI:DLINV~rVPt nl It.LPE
F:ll~:::l TW::RtLI.IFAPUX:FFT:F.:P..AtffLYAVNN::h IA'ft..ll'lt:Y~f:KW:a ' . :7:k .
HYr:(a'/::N::LWI::F':.aY~AF::UIV)INLLIIAlI~:4itTAICAWAWF7rfM!~~:11FI:F
In:t'fF'I:II'::IeNP:VA::Ylu1'fVIS'V:\INYftKPll~l>t~I'ff.t.t.Itlttl":WKTit.'rtJL
.:R(MLiIfsAA
Yrt:YL'AAL::NtfITDHTTII:L::f'V'OLICKTNAHF ....~LI.3FF~;0FPLYIiOKSEA ::Pn (IIA4 S v ',i-i~~~
..I:1YKAAW:'f::KNHLNT'I'ILTIPDKAPK.Spt~WNNNSYYVLLiAEHPPLMrCLLTRPLAOA No roDusc nomoloa Dr:aemt :w:'.rw~dmK EMII;. n: ..
WDL;X:FI3At7~'LOWO~KfTIT'IDLQRSPSRGK.YNVSLPLCC35pWITPF'K1GPSTLTI
HRL~RFTE~ILtRI.IS1M9L~~MILR ~l~f~CKINN1~~
KLAYKPD I YRVNPfIN f',ff VI/:pJDEST3 LiGANLRRHCLFVOINOWDLTECIOAFIi'IYTF
UI:KNCPTNIIItV.'.'".' :LK.: v F
IRCGPF3EDAVPE.iEPFDL.i:'lVIfCDRSI:PGPTKKRS::,i~",C."yE;,PESIYPOSEP'.LM
RPRMLS
'I'n_0455 520 )63 517158 Na mrntsc rtnm"loV Dresant tn GlIteWnk/ENBL5~=all as of 11/7/98 5 -Z~a =~~ 015 .
'ar n .
..;_n...,., ...I,..;;..r;.-.Lc--v:'~:. .
--E.:;~rL:r.~.' .
. ._.. .
~ .
...
' "
;
' ~ , .. r. . :.;r: .1;:
.:_. . :::u':':~ ;..., ;~.~ ~-:. P,.6EAACA': .~. _ ::u:t' .;LF::F~:... ::.l~r:~nl~ .R.rl.ylJ..:.
:.?NFAL :
:
~
' ' ' ' ~"'KWAFUDEHLPWVi:iHIAYAEEIREKOEQ'IFIIXI'.ILTEEOIVA:.i.CMYSTE1WWF.
.''\ :li ::m .,;;i:.ilk.:kVEDILKRQR'...iLEi Ul,~t :.~.i:iv'v:.'!':';:.i.'wi.:.,ii.:i:i:l'.fi~:il.
. RDCCKVHCDLPSAPFF
aLaAVIKOSVNRFRNPDLFAYERCALfJISVTDALVSYVSNLDIfIPYTSSOGIVI~SSIV
K.., CPn_0466 pmp,15-POlymorphic Outtr Mlnrorane Proelin TSIOtFPCf'CIGl.PfTPIrt.ANEGL4LPL.E!'YITLSPEYO~AAPpVCF!'IO~pDtJIIV~IW
;:Pn _ DFILDYKYYRSNGCALTCI~.LISF21ICNVFFEIOIVCPNSGGAiYAApNCTI$XtrpWllAF
No roousc t>omolo0 Drtttnt in CentWnk/E7tBL"' as of 11/7/98 ' IPCTFES%RKFIaffHCLIK:WFSWRHNFVOAFNFSRPLYSRITHPAt.CVIKAIPIVGHLV30CCFV1u1LJ1iIiKt w,ALYTETtQiIl~1 TTNLVSDNPTATJYCSLiGGALFAINCSI?NNG
HGVDNLISHCF6RGVSNPGFPSDIJ1PILKVEKIAGRDtiISRiA'IDLKSLRKTIEtIEDLDKKGPIIIIfQNMLi4S
DSLGGS.LYSO~iSil'IIFGYSG71IOT:SNSIfSTQZLT=SSN
VEICOYpENPYA0M715SEYLKLDK,NiIVSEL.ciKAFSRVRNRITRSYSYAPTPOLDSIaIVGKKLIEIStc)SAFA
NNYGSNFHPGOOCLTTTtTTILMiRtGVLFN~BpSQSI~GtIINBKSI
' tLLVSPEEOENLVRLANEYIOLYPKSKTTLYLLIDFDRaWVGDISSO~OLRSLGLHSELATRGGAL:liLS7YCSGNCS
FILSAONGDIITl4~IVfASIDiAWIPYRN
LIKIIBiWYFLNN
' ' 'IiCCL.SyLEPpGADGEDTKHFDt.IfVOCYGKDSYLREGKILOQAiiGTSLG1YPWDlJPMNTLPAIH$TPNlOJI4 IGARPCYRVLFYDPIFitELPSSFPILFNFE
fGtnG
NLFSGAMR~IIPt ' SRYRSRLSLPINTEIfDICfELYKEISRTHNOLHTIJCMCLGAODSGLLLDRORLW1PL$QGSDI?MFPSYLRNISELR
OGVLaVEDGI1GLACY1CFFOROC.
LliirpGaYITTA4TIP11SST
i.FSAtVDIIKNISKKEL.REVSINFANDTSVECGCAFYFPTZITaSTITI2iHIAIDLPSI'SFQAOAPKIWIYPCK7 GSTYTEDSNPTITISCILTLRNS
HCHSYLADLTHELKI:
.
IBIEDPYDSLDLSNSLEKVPLLYIVDNAAQKINSSQLDLSTS~I9GOlYGYOCIWSfYWVE?
TTITNPTSLL.GWfKHKLLYANWSPLGYRPNPCRRGEFITNAIiASAY:ALiIGLJiSL6SW
=Pn _ DEEKOIiAASLOGIGiI'YHOK~OGFKCFRSIOnGItSA'I'fGTSSOSPNPSG:FAQFISKA
No roousc t>omaloQ present in Gentbsnk/Ct9L
as of 11/7/98 VFLPSRVMASCLSAwFSIVREHFYRAEDFSL?FC11RITEFVLGVIKGIPVVGHIIVGIEwKEHZ~NSTSSNHYFSOMC
IIZICLFKEWIRLSVSLAYMFfSENTIFIMYOCLLE>a~CSF
LVSRYLESPVfXPTFVSDWSLLKTEKVACRDIiIARWETL1CR0RVAVAPIftIF~KVIKiICIHNlft'L71GALSCV
FLPQPIIOfSLQIYPFITA)JIIRf~ILAAFQESOWtAREFSLiRtPL1'0V5 PVHPFOGIO~EVLTLYPEVODATf.GWFSKIRNRVRQAYL.OAFRPItLOKIYIICN~IPLP9CIRASW10~RIHRVPL
VWLTEISYRS1'i.YRODPELHSKLLI50C11~'fQIITWnINIL&
FEVDDFLNLARL.QJtTORLYPDATISLYLTASGGRNAMD)00RICi.SDCELNPKIACLDFIKVlaPl1'IOVFPKVT
L.SLDYS11DISS5TLSHYLNVASRIOtF
tIOCOWKQ11TCDCWHVYNGHDpCfLNOIOEELEILSGECfPNIHVCOKPLSOSLWDTSPP
SSLEMKCDKEItALCYSELtKEOLYSRLVYVGRSSVLSLCIGDSR~ILID)PIDIVNAPLS.CP(~0167 536528 ~HYCHSYL11DLENPGL4K1'IL7U1FI14PKELSS1'ILOPISLNLIWSKTYLRQH!'CFFER
MSRSDR1JVVVWCDSWWCfDWKI3:PSFOHFINLLDGRCYSNFNIFAFRSN6MM.ARIL
NFSSOEKAP'fElIFCEDSVSOGDIRCLHL715f:GMLCOXECYAVWY'fSCCANF1?~tVLTL
ERFSNLWNRKHGLWI(AEVRKpK0EA71LDODESEIYVCNpL?AOpNfACS
CPtI_0468 539608 50132 CPI~0459 527062 526619 prnp_17FOlYlno~hic Ouctr Mtmbrant Proclin No robust hdsolog pnaanc in Gtneberlk/D!lBLLIYKLLDNKLMIFYDKLYFHIMnMFMttPICLSILSTALCCSLSONEVPItL71SC01IS1110 as of 11/7/9A
STKIQMHPGLRNWRTSTNKLREE7CSVSFRtYFMYlICDKIVApICifLFTLDAVIKQAIhIRSAFNTSPSPRiaN1'P
EFLVSSFRP9NLL14GFOHDITODITITGNSI118VIDYlMIYfD00 SOEKU1LFYVESt3ALGREIKVSLEEYIOSMVICILGSOATKKSFKPSVDFTPLEOALQERCILiICKNLt'IS~Rt~I
LSFWJSSfISBGCALYSVRC~'IISi~NYSFISW1J1SLJ1Tl1'L'SO
SSDDI)EDATATSTA:..71TASPTIfIOtI~E
FOGivIHAL~DSYITNNL.CECQFLON11SKNRODAIYVGVSLSITDNLirPIVIKKNO'I'L.tDSS
FOOGIFCRAVNIERNYpNI0IN0NASGOGVVYFLP
CHy0460 527810 526992 No robust ttomolo0 present in fllntbar1k/003LCPIL0169 510399 541160 as of 11/7/98 VIQNLLNFALEETPSISVOYQCOEKLSPCDNSPEIGK>OCRWNKLESFSTYCSLFNSV107Hpmp_17-Polymorphic Ouilr Mlmbran Protein IFrm!-shift with YKIl~IGIONSLSGWLLDPYRVCAPLSSPYSCPSYLLDL~1KELARSLLSTFLDPIOiLTS60169) TFRSVSIHFGEISSFCORWSEELSRVLHDEKEItttVAVIfiElD7IKLL.EECGSPEALSLLCEDLCFA1'~iIFSAL
GVIISSNKEIIEISNNSASSINTASGKLYPOIxCDfCTSLVIt?ItiPIOG
RESGYSYIl'IILSVSPELIISIfV0ER0ILRRDLOGRSF'IIIMITDLPLGSEDIRSIAL71&~tILIFMIIfTAIIL
SGCAIHTRSFIFQEBiGPfAFINNSATSGGALINLSOIOSTPON!'ILBADY
LVSSSLDAADACASGCIfVLVYENPNASWJ10ELF3JFYKQVEAARCDILFNNfIfITSSSPOPCYRNALY11J1PGIN
LKLGAROGYKILFYDPIdIDOIZ'fDPIVFN
YEPtHiLCTYLFSCINVDSffATNPLNFLSKFSNSSRLERGVLAIEDRAAISCRTL.SQlGDI
,0161 528617 527811 LRLGNAALIRTKCPCSSINFNAIAINLPSILOSFJ1SAPKtWIYPI'LTDSTYSBD1'SSTIT
CPn _ LSGPLTFLNDFJJE7JPYDSLDLSEPRRDIPPPLPPRCDCKIOdNYPESHCRSHELR
No roousc hanolov prestnc in Genlbank/F318L
as of 11/7/98 ISIVACPSISSWt'NVRpHFVNAFDFTHPVCSRITNFALGIIKJ1IPVLCHIVlxIEWLIS
wIPRNTVRHCRIFTSt7VSSAIKVEOTRGHNCLAPLEAYL59LRVPISOEDLCKVfIGRTPEDCPn_0170 51357 ?FJDITP1'EIVOLLPDEEL57VDFJIIAGVRSRLTYAYRSVEKPMIODLALVCFCLRD.SADpmp_17-POlymorphic Outer Meiebrant Protein IFrelne-shift vlch LINtVRIJUJGVONHYPHTKVItLYIJIKNLAOVWDCEISEEEKCOLRALGLDPKIESISL'fS04701 ACLPSVPEVATVDFMITCYCKDQEVpDP
ISLHLERZSPLLYLLDVTAKKIDTSNLIVEN9JLDEHYCYOCIWSPYABIET'1'1'1'fSSTVP
F.pTTrlNHROLWDWfPVGYRPNPERHGEFIANTLWOSAYNALIGIRILPp~iLK0i0L'G
~Pn 0462 531121 529037 SGOCLGLLINOHNR)~GRKGFRNtII'IGYMTTSAKTAARHSFSLCFApMFBK'lRER08PST
tlo roousc homoloq present in Gentbank/EMBLTSSHNYFAGLRFDSLLFROFISTCLSLCYSYCDtIHMt.CNYTEILKGSSKAFPNNHTLVAS
as of 11/7/98 LIFYLFLNLYLACVRFHFCCZIFDPNACYISIWISTVICQNFtRAFDFTRPG'rSRITNFALCt.DCTFLPARITRTLE
L.OPFISAIALRCS0ASF0ErC0EtLRKFI1PKHPLTDISSPICFRSE
VIYJ.IPItGCVSIICVSWLVSTCSARRFCKPAFTSDVASIVKIEKTRL'YNPLAWVE0YLR0WKTSHNIPNLWCfEIS
YVPTLYRKNPt3~IFTTLLISNCTWRQATPVSYNS11AAKIKNT50 LRVRLPECDLCKIHCKVSRDYVCDRTPOENLNM/PHOYLGEL.GRAFYC1RNRVTKAYORVLFSRVTLSLDYSAOVSSS
TV~YLKJ1FSNC1'F
TPLEYPCLTLVGFDILDPEDpVNFVRLANGIpTQYPOTQIKLYLISIOKIYAi0COC1'I50 EKEQpLRSLCLDAKIKCVSAPAt.LLpKYLpSENLPSCDLLINYYCKOpSVROVDSIKSLLCPn_0171 5J2561 NL.iSEHIPAtSVTYRPDDPFYSYYFFPGSpOGTAPDORiPWSlOEHLQI'Y1TT.SNPRCDRpmp_11)-POlymorphr,c L'ucer Membrane Protein 'IAVHLGMEDFASC'JFLDPLRVSAPLSCEYSCPSYLLOLKSEELRCFLLSAFIDPNNSGOCTVONNRS4iKSSFFVtG
ALI4:KTTILLNATP~DYFDNpANOLTTLFPLIDTLTNfIPIfS
NPRPMSINFCNSPLCORWSEFLSRVLNDfTEIfHVAVNCMJPOLIKKSFPSHSLSLLHtELNRATLPCVRDCCNODIVL
DH~YJSIESWf'CNFSpOCGA4SCKS41ITNTKNOILFLNSFAI
EEJ;ISYIJ1IVSV~.,pERTCVKERRILSSDPSGRSFTVILTDLPEGSSDIRNL.OLASDRILKRACANYVNCNFDIS
ENHCSIIFSCNLSFPNASNFADTC'~GAVLCSKNVTISKNOCfAY
.".~AIJ7AAL1ACASECKILEYEDPEpEWA00YASFYRNIDRAGDLOROCIPCEPLCVSASTFINNKAKSSCCAIC'A
A)INIKDNTCPCLFFNNAACCTACCIILFAHACRI~HSOPIYFIN
RVJLEKD)VFNLNAVIOI'AMWKFKKRDLPAVESQJILCtIOMARALEGYICSSLLVDCTiOPNOSGI~.AIRVtIC'F
it'.ILTKIrCC:.JIFNEfJFAMEADISANNSSCCAtYCISCSIKDNPCIA
~~/u:NVNV.~.FATLDEAVCrIACDSAQOAPSEENM'DDAFDNNTAARDOC)AID'T0.~.LTIOD~':PWPTNNO?I
110CAIMLRODf'.ACTLPAOOCDIIFY
NNRHFIf flfFe:N11V~17JCTRM::LTN~.A.~.fXat.~.ATFYDP
L LQRYTIONR IOKFNPNPOILC
't'n X14..) x)24911 Slll?1 TILFS:TYIFDT:TVRDDFI;aIFRN111.LYNl:fL.ALEGtAES~IKWKFDOFpC1'L.RLA33M
le> cnlnra. hair'luit t,l:tsenc VP:TI't7l:F~':::::::a'.:::VINIh'NIAINLf.~.(U:NRVAPKLWIRPfI:,:::APY:80lNPIINL
in t:Pnadank/ENDL ai; ut L1/7/98 .::a~YEi!TP.til.l.l:fPNCRTPRVNI:;RK:IPIDETCtIAFV~MMK(X:VCTODAKELYTFL~.R:f:ffta.
t.pDE7JLDfYD'fADLrk;f(AEVPL.LYI.LDVTAK11INTONFYPPIfILM'1V11Yf(YQC
tlt7flnl~'.LWF::f.l:EE4:FLFDEKMLCAFf::EDH'l~tI~YLVDLVDt711LKDLLt.SIIFLDPOVw::fY
WIF'PITT:'.M'.':F.M~MIJIR'JLt::frll'Vft:YYVNf011cf.DlAL::AFYIaiPHNLP
rll::rva:f.IJCV::Itrin:U;:F::PLtjOKDFL:?IVLRDE'f~:KNWWFKI:VLGLPATOVCKLVEE:1'1'L
PYtYtt't>t::]LAI'fl:7:F.f"fet.F"/INJN::NNNAY.r:FIMFJrIY:Y.~.IJ'IT::HTA:aIII:IPf IM
Irc:YD'C:'/LNIF:X:IY:Lw:::hQf.LPRKELECT~:R7FRVhAL'ILt:DTOMRSWf.JIi6RINF:XJLF::N
I.YF::'.IL'.i'N::VA::If~~fIAUJttIIIIWI.()IxIY:.'C.:A.sI.AY::Y.''.NIIIIIKICII:Y:
XrK
'J.~.I!Pk'LLVfd\YAAh~:Kt.LKIDHTNWRP:I'F::RFIADFADAVDV::M:fNSREFKLI'fpAElpCIVfNC
:KC'.Y::'1'fG:.IrIL:Y:::4:IJ,WR::P.ILIIFffFIOAIAVRaNVfAf(7f::X:GeARKF:1111 I1.l:Ja:f.hLt~.:Y.TIWFX:ft.lF!.'DRVTVTRIIFILtILGAAIK0AVH1'fIKtIP::LIDKOCFJ1LDKI
'LYNL'MII'I~:IS'::1WF_':Kfl:f.f'1"IYrtIIKIrIYVtVf.Y~JJFIPEIIIV::LE_::Xl::arld:l .T1 LY'I'~J.'!.l:aV::'il.l:lV'1'N::IIExCT:'.KCfFtpl(EIIAff:.~.PLKCALFIGSDEDVPL'l'sE
DP.'if.AfHAIAfK,:uH~'It'IFIKL::Vfl.liY:J:::V::::::'(TfIlY1J1A4PfPKl' I~IAIP.~.I~IJ:U::
: a'i, o1'I L '..1 ' ' 19 '.1''',n C)~_0458 526314 521236 No robust homoloo oresenc in GentDSnk/t?BL as of 11/7/98 Nn r.arcn trlwoltxt pr~sant In r:enetCrl /fl48L es o: SI;W 99 FYFflA:xilr7::::n.Ln. CLPPKtR.;PSPKIiELCSHCISLPPpENCEEGASCSSHIHS
)3.~.FLPEDU~:OSS~aSAAS.~sPCfP;iRVRSCYCPALKSF~.~.AE$'f"..OARE3'RGAPVRLCpn 01A l .. L....S~' 1. ~. 5'~(L,1 " W . , pMSentN
No robust noACl~o4 s~f, f'wtneb inltL ~
fC;:~~5~;
Y~BNPSOCVPGTSSGPEPORLP$LPSVKKO .
.iKTITAOERROVDSSSAAATPJ1RVAEDAo ..
:iCTJr'RLVOTVRDRtVLPSGAPPTDSEpLSLYELNLRLS5LR0EI~iDIOSNDQLTPECKAE, .
t JCLRIECILMAT~VP4'CSJ:r:GEANSSNERFTERT::RMYYMLVL..'.A~.L:FIAIIIV:
:.TVfIQOLIOITEFOCCYMEATOSSVSLrIfrIRFKCVITSDEINSL.C3Irt.TDPELOCLlSOFPQVCWAWCrFAL
:,C:.:.:.:.LAtVPAV~.,GLVLI;KTLEPSREATPPEIVAVKE
.~.. CCL
':D.~.t4NLL0ETADDLFJ1ALSTft'RLSFSLDDNPTPIDNNPTLISOEEPIYEEIG~MOPORLGNEYWRSELIS:.
F:.R~~LH~:.:S~SIIDR.~.LC:CC::wZFI:;.KLEP:..r:....i.w.KKDMi TRFNWSTRLWNpIRE\L1I3:,;...~tIL.iILGSILHRLRIARHAAACAVGRCC'"CRGEELTSSINI:LHLVROWN
L:~.~IrPE'JTAHAEEL:.LFLiEE?YY3FCTLK:.:RYCMLC~A?SPI?I
'::!1 .~' . . .. '~!!,~'.'f:rl.l.... ~~...:. ... . .. . .-. . ,. ......
...M:W :::'eJl..tty:!':!.ITY!'.-:'~, . . : ~!ir"'~nr c...,.. .
. .:
~
' il': 4' .
ct ' .....~.. :.".111:x . . ...
.tr LV!.~::Hrrr:.-:,:c :~.s.-:,.
F. . 1..:::a:r :::fPw.'rnr-L.......
:;~. .. ':'.~I::de!:-:GDY6YpIT;iA.FP:iKDKNi'MCPRLATPALYDL6'wRFI:S.xSSR:iFSSLRVR9S:iPNRRG'lE:irlN~'NL
.iiN:.h'i:v~:.:':r:u~KH;,e';.i;'i.:.::~i:~inNy:Ni:y:LaKSE:LfaIE.iuFi,ia VPLPPVPSPAMSEECSIY$I7MSGA,SGACESDYECMSRSPSPRGDLDEPIYnWI'PEDNPFTLIEYPLSYL:GWA~..
I.'CVi?;fEi3LEC0ADY'IS:.'.QCLCS14I$OFASRi.t7SGQKt,'IatPR
ORNIDRILOERSGCJ1SASPVEPIYDEIPWItICRPPATLPRPEM'LTNVSLRV$PCFGPNDVLSEOMVMLVHGLMt7C
V3FOCLKALHIfLTAVPORMWL~uAi,PLfESfPVFNRIBfFfT.G
MALLSfSVSAVNVEAESIVPP'tEPCOGESEYLEPLOGLVATTKILGp100WPPGG9NAfSLCD
CPn_073 519602 518070 CPn_0482 561764 560961 No robust: hoteolog present in Oenebsnk/00LattJ-Asnsne Peripiasntse Bsnding as of 11/7/9B Protesn ~$IMAV~OCSRSPSPIPpNRRH56DGKVSPKDM&fJiTVS$$DSSLASOGPTIEUKANtrIYWICI'MIKOIGRfFRAF
IfIMPISLTSCESKICRNRIWit.'t'"N.
ATYPPFfYVOIIOC
OI~GTWCIPLPSVKEPGDSOTSORSGVLQRIWIO(11KEYVGFDID4.AKAISEKLCKOLEVRLFAFDALIIiIWOWRI
DAILAQISITpS110tICe'..~...r rfYfKiCI'pOARPCVSSPRLPSIiVOH
GORLpCLOCfRDRIOKRSENPEADLGKH><RSYSDGDLDRVOtiDSNEDBTEDSRSEOCEPSPYYCOEVOEI14WSKRS
LE':PJLPLTOYSSVAV
.p"~r~.FQEHYLLSOpCICVRSFL>BTi.LSt SKSSSPLSGVItCAVSKVHCJ1LGDIKDKFQRSASE~L'1'I'OOEDS11GDTVKaIR$EGEASIME11RYGKSPVAVL
EPSVGRtIVLKDfpNLVATRLELPPECWt7I~CGL,I11AKDRPECIG?:
SKSSSFLSGVRC71TSTV'pGJlL;.011KEKV$AFGEOMGAIRSAPGNIRTRIQRSSSDWLSOOAITDLKSEGVIQSL
TKkftFILSEVAYE
"NNKAAKIiLRKJILfNLEINApEQVSPEVJ1SRVOSLLARNE0LTN0EPP1YEDLITFVESN
VOSDSVEYASIVPOOGSOAPAET1(FJ1PETGCVLGSAJ1QGJ1WKATJIDfWSIfQAVASffRCPeL,0183 aIJISRLS$ARRESAVDDLASESNTQWFVEpEGVSNPSAAPSLSFAEEIARRAAOiSNRNANo mbusc hanolog present in GeAebank/EFIBL
as of 11:7'98 OSLEKLF~d1V't'DPVIQOCLGLrIitSFAPECOIILIKICRAIfaIIFPIPPPNCPPNNID'NFYHLTfDTIGDPLL
LRILRTIGYVLTJfIIT.GL
C~ 0171 551600 519807 CT365 hypothetical protein LKIIISISFHSTSPISNpPRYLSLSNATEKTSLLtINSRSLSPVPNSLVPSNPEDTCLRKS
IFTHSVTLFAGLWLLVAVSVWV71LTVLAPOVPOAILLGIAISCVOIOGFSIl0f5LVYN
TTL7LSFSIIYIYTKFFRSEKVAKG61Q.TEAETIKE71KKLHYISLSIATIGVCLAViGILIJ1 IAG112i.CG1IPATTAI IL1PPLISICLT1'VLQTILHSSIGKWRAPLLTOEIOIOLFVD'!SL
KDIRLEKZ.PPSEVEESEI'SOSVIEVPDSECIAE1'RIS7IEffIDTRLSLTTRQKYIFALATL
r r~~eIAAfIVrCFOGL7YMpVLLVASVG$AVAS\I1T.PIIVSSCfSYVAY0LItARiI4ISKL
RWKE7UfWOCRVROFLILxGVIASt$IEFNOIIWK'IYYIOCQIOKTDAAIREEVRNFLOmGLVN
SALVCGILL.CVC:R3IIQ3.ALVPAFApIVPGILALCCSTLCIAGSILT~BtTCCtVtiWLYDELVK
LYERRRIiRRELLYGpESKIGtSIATDLWEALA7LSt~HLIDLDGfVDFIDVDVDIDGMEKNOIOFLRJ1TFPNYQLIT
PJIILLDCEIESTPRNGyBIVfLTRI.NVCS1CGSP8$PT)1tS
DOf$KSFLIFGFtl-BIYPKLL4KKTPLJ1ARLDAfOREASHRFTOVKDIa.LLSLKYGFPL11T
CPn_0175 553850 551685 ATINOIfSRAROQLICNLL1DTIV'PAS17GFCRSG1ROSLIGYLHSLBSNELODILOmVlm071 glgB-Gluean Branching En~rme EANDVAAK1TVPLOPFAVCLINSDRD1YSEEFtIENPVJN4iCFt11CISPERDRRIFLIRPP
PSHVDKLIHPWDGDLLVSCRQKDPfIKLLtcILASEDSSDHIVITRPCAHIYAIrrrrnHMIYOCLLpRHPRTCOI~IS
KPDSSNP
aVAYRSCLFfLSVPKGICHCDYRVYlIQNCLL,7IHDpYIIPPPLWGEIDSFLFHR01'hIYRIYB
RNG(IIPNEYOCISGVLFVLWAPHAORVSWGDFIVIWI~LVNPLRKISDOGIIdCLFVpGLGCPn_0184 561931 BGIRYKWEIVT09raiVIV!<TpPYGKSFDPPPQt:1'ARVADSISYSWSOHRT~RRS70pS19Garo0-Deaxyhepconate Aldolase PVTIYMR.CSWOW00GRPLSYS
ITOIPLT4&SlJfrYpVTRSILKTOOLKSLVLHIVLILTF'1'YPLPRTLKOHPDI:'VFfrVpISPMS1G8ID8PILI
AGPC
G1I1D1Pf$RYGTLQEfpYFVDYLHKiNIGIILGWVPCHfWOiIFALASFDOEPLYEYTGHS.TLISYEH'IYSSALTV
IIFaGAQVP'RGSIRKPRTSPFSPOfR4CKECVIiiliK8J108IHOLPII
QALNpNWJI'FTFDYSRHEVTNfLIGSALFWLDKhBCIDCLRVOAVAS6Q.YRDYGREDCWITEVLOVADVEITAf7iV
DILRIGrIiO~IIHM'PI~d,OEVSKSHAPIIL~tSPMTLFJiWLt'.AIIC
PNIYGGKFNLESIEFLKNLNSVIHKBfSGVLTFA&fiSTAPPGVTImVDpOGLCPDIfKiBILYItaS8PSCPCNILCE
RGIRTfFJI51'RY"fLDLNfSIALLKEI$BLPVIVDPBNAilOItRiLV
Ci1ll~fFHllfIBLDpmRKYHOKDLTFSLWYAFOTSFILPLSHDEV111KTKGSLVfBC.PCDTLPL71511GLSVGA
DOiJIIIYHANPEKAt.GDAKpOIT?EELHLFAIDDIFCP$E8R71HAIB
WtRPAOIDtVLLSYQICLPGKKLLF?GGEFCQYGIWSPOItPLOWELI2tdBlYIOCfLRNCVSIt LN71LYIN0PYtidIpCRSQECFHWVDFHDI~7E1VIAYYRTAOSNRSSAiJ.CVIiHfSAS'1'FPCPnL0185 565993 5662=9 $WLRCF7GhWCELLLNfDDESFGGSGKCNRAwVCQDOGVAWCLDIB.PPLATVIYLVTCi381.1 hypothetical Drocein OPIORTPiRVfLWRFHIKOACKFYLLpCLLCALYWLLKYCRKLi.IOGTLIH$IITL,Ypui, $SLIDLLYOLICQLPAP1NE
CPn,-0476 551877 553858 CT865 hypochscical procesn CPeI-0186 56'799 566105 GRGRRADWCDCNIDIIIOHFRPYTMVPGpKLpIPGSLLYAOVFPTLWRLFSSKHEILNF7pThypotheeieal Droline paraease IAVOGPLIOtPAVFQDLHRGGI)1VT$EttYKYYLLPSGDClOSI100KLPSA710AGPLLSL1'ViAOIIRSLLKGNI
FHLGCGVLYE?U~1!'SLFLFPLIAIOGICLYVCRRGSKKVEORfSIffLRGR
HKHADWQNVRCRRDLKEILPLWFRFMM71PKCSYRDLETTaICSLVKTAfIORVLHRE1TESLKIFPLI~f'1'FIATO
IOOGVLLGAAEEAFCYCYGGILYPLGVALGLIfi.011CP010lWEG
IAPALLSIJILRGfSGCFLPRSYDEEFpGILPODCDPEGGVPFELLSYSPGMIODIFLRHpSLTTYVSIF111IfYGSK
KLRKIAFLLSAGSLFFILVAQVIALDRLf55fPFCKYV1'VJIAiI
COLVEILPALPPEfPCGRLIHVALPNILTLSIVWrKIITIRpVELHAEYSGEVFGKFCSSLVLASYtSTGGfRCWRTDV
I01GFLLIAVLVCGVSIhILSVPKSLSVLDPfOSLPC71KLSN
CSARLREWSERALSGSKRLSLGETLEIKAI:TTYLWDCFHKWIINPhLFNLVE0CMV0RCVAdSSpKRLAWMVGAGLVL
LLfNFIPLfLCStGAKACLIG
GCPLIDTIAYFCNPSLAAVNMAICVAILSTADSIl'TtAV50LIAEEIfpTWIPYYRYLVL
CPn_0477 556112 551844 GLJ1VMPLVAIGfTNIVDVLILSYSLSYCCLSVP~CfYLLAPKGRRVSGAAAfiJIGVLYCA
~yQeV_Bs Hypochecscal protein tGYGwVOIVSIJ~ELLAWVCSLVAFSFVGFIEITWIg4KVKi0T
RYMIVAEVKCTFKLVCLGCRVNpYEVpAYRDOL?ILCYQEVLDSEIPADLCIINfCAVIA
SAES$GRHAVROt.CRQt~IPTAHIVVTCCI.GESDKEPF)15LDROCTLV~11C~CSRLIEKIFSCPn_087 YD2TFPEFKIHSFEGKSRAfIKV00GCNSFCSYCIIPYt.I~ItSVSRPAEKILAEIAOWDCT381 hypothetical protein OGYR>;1IVIAGINVGDYCOGERSLASLIEOVDRIPGIERIRISSIDPDDITEDU1MITS5RR1CGISLTYSSFRWASF
RCYSLIFFCfCGSLFCSESLtY0LLI0DfAKVSEECIGLLES
R11'CCpSSHLVLOSGSN$ILKR~TIRKYSRGDFLDCVEI(FRASDPRYAPTI'DVIVOfPGESDKEYSLLOA1ILVLR
AL.apNSSFDI1Y1FRSFKKCOISYPELAHDROVLEEFCIWLR~IQ~IP
ODfEDTLRIIEDVGFIKVtiSFPFSJ1RRRT1UYTFDNQIPNOVIYERKICILAEVAKRV00KSVTVRAVSVIJ1ICLV
TDFRLVPLLLOSCNDDSAIVRSLAt.QVAVNYGSESLKKALVQ.AR
F1BIKRLGE1TEVLVEKV1GQVATGHSPYFENVSFPWCTVAINtLVSVRLDRVEEIxLIGNDDSINVRITAY0W1LLOI
EELLPFLRERAENIILYDSVERREAWKACLELS$0FL6TCV
Eri AKODIDOALFTCEVLANGMLPETTEIFTELLSVEHPEVpESLt.T.SALI1WSHOLQIpIKEFL
SKVRHVNCTSPFAINRFOA 11LLHLHCDPLGRDSLVDCLRSpOpLVCEAABMLC$IGIN
CPn_0179 557640 556210 GVpLNfEHLESL.iSRKMNIL.,'ILLLVSREDIERA~CDVIARYLSNPE?ICWAIEYFGWDiIQ
hfl%-CTP Binding Protein wNLACDTFPLYSDMINREIuKKLIRLLJ(VARYSQNtAVTATfLSGppA0Q4SfF9fiffllE
WHOGPLt)TIDTPCDpOSOSPY,7,tSLCARFDLPRKEGDP$pALAVASYQNKTDSQV'JEEHLDECDVKT3EDLVTDA
CF.IAKt cns: scuOKKDOASLORVSOLYNDSRWODKLAILESV11F
ELISLADSCGISVLETRSWILKTpSASTYINVCKLEEtEEILKEfPSICTLIIDEEI?PSSENLD11VPFLLDCCHHEA
P:LRSAAAGALFSIFK
OORNLtKRG,CLWLaRTELILEZFSSRALTAGNIQVOLApARYLLPRLKRLHK:HLSRQK
SCCCSCGFVKGEGEKOIELDRRHVRERIHKL.SApLKAVIKORAERRKVKSRRGIPTFALICPn_0499 57,1147 ~YTNSGK.a~TLLNLLTAADTYVEDKLFATLDPKTRKC11LPGGRNVLLTCNGFIRKLPHTLttitA-HIT Famlty Hydcatase VMFKSTLEMFIitpVLLHWDASIIpI.ALEHVpI'1'IfDLFQEL.KIEKPRIITVLNKVpRLPRKLPTCFAVNVTRSR
DHMR'FKOIIDGLIDCEI~IFENENFIAIKDRFPOAPVNLLIIPKIt rx;SIPMNt.RLLSPLPVLISAXTCEGIDNLLSLMTEIIOEKSLtM'LNFPYTEYCNFTELCPIPRF7DIPCDFJ4IIl IAFJr,:KTVOELAAEFt:fADGYRWtNNGJI~QpAVFNLHIHT.LGC
DACiWASSRYOEDFLWEAYLPKEIQKKFRpFISIYFPEDCGUDEGRGPVLFSSFGDRPIGALA
~.'Fn_0477 559431 55761ti CPn D49v 5'.IJl7 i')tlllr..
I>ttnP-Mtaal Drlperldenc Ilydrolaas~Z'79') hylxtthrrt t.:.rl prJthln AIC~IVHDtOSESICKLVFLCTC:NPE7CCPVPFCSCR\Y'ONT<;IHRLRSSVLIO'IQtIKTLVIRIVFAIFIdYF.
~.LV'fltEtAJt.FftI'/~aM)fFR::I':"IIDCCFIIN)EVTAf.'N.LItft%.VDI:?llfT
nN.pDFR1'pM,VAGV~ELDrNFLTHPHYDNICIaDLLRN'h'IV1'OR~LPLVf..:A..~TYRFLIR::RDPWL.~.
Kt:F.51\'(~n;lYaEIIKRFDIIIInY::YDt:::W::::NxIILIIYLKRtt:YNOCEE
' NKAKfYLFATPMIFa YIIF'IMffLVINYDE~hFI:RFF:'.Y.Er:F'r.'::F::D(fY.fYNPHEERETFCaJN)!X',ALtIPTIDF
~:LPAVLEFTTI.NEDCl70EEFU:IPYTW~YYQK~CfMr:FRFY7NL
' A
IJ:HLHYYFt)YDHth'IUaVHF:WETFx'Mt:LYF'LPItJIWNENFFFtJX:RKIIT'MF11P'.fP..~f lL'rDL(:::IDAKIF.':YLDNVKTLIL.:AI:F::ECrtFFIf:lili.'..~.IiL'IVFFJUUF)WHACIKN'D
Lf tTtlt::IK:'LFaEHD(tllrtIVTFAY0L71E1IL4ffL.
:~Wtl.Hr:IriNLINtIMIUY1':r~'~FJMN:LfE:KF:L::KV'X:IM.AVFf.'ItKnI.FId:IIWM1RF~.C
Vf4\L.kLTI.LUN.r: l 1 ~.'f~s114rr'1 .5'(375 Sr.H.:SU
~.'I'SHt ttyl>ttrMrri.:.)l t:Ptt 114'ltl :.')., hrr)LRIn '.'/ t 1 f ,1 ::Uh~tl:W:l:~:IIII;RFY::KKt9ppt:NtJ:::Lrt:Yl\:XaIIEEYKNRYF1'IOLCAfWiPYWn.-1'W'1 Ir/Imr(nn r,~.y 1'r.m.:m 1'V i WDVrJ:Arrn:Il.lyVL IH:KGHKF(rIMYNLW INIINlAA::I'1 t :b Lf::I
:L.T'VFH:P IT:a.W ALEpK:M:AIVL&~AMYEICILYYlG:f11 I'IIY:LYL IF71 tfAYFtl:f111.1!
: ~.~' I(,tVNLK:i 'JVHHFY)tr..:IV.:WVF!.f.i~.iPMLIVf:VMI/ttAPLIVa.:AYJVIiR'.tltl:Vl7AILCLFAIL::LW
U:VF'IvVIJ111t.F:LNYA12KF7.FIJl'/Lt'M:9rl.ltA'l'/WLELLI:IY:::FVv:KLYMItIWNI.
' HA VIC:I~.YtNHMF-Mrl'InIC(r::'.t'I:.hFr:Y.Yi.F:IIF'l'I'IJ:TItIGHLWFLI'If.lv71'U"II:F.'fI'Ir:
/In:1:In'lHhi'ILJIIt:IIEYfTOf.'Flt'.ROIONIii(WY.:/ITEYrA'tY'\t~:VFITKLPNGSRR
FLPIIt::K::I:PRrIILKIRKFLPL'IGNVTORPPVPE.W n'70- '' ~ IKTRPLNIRTVFAR71VODLL ~rDC-N:iP-7O ~:ot.le.~c i'QIILpIITMIOILEP11'OFD'17DI'IEFY;..~.'3EPICR:PLEFFTLtPYKEH3FFfYR~Q~E.DVIICDTPP
tI0EI9V ~ .a)~' EKNDKti 9 81,1.
'LGipOyFRVFE.iIPE0E00AANFLaK.SELLEt.~r~ItKPRI3P3DERNARLIOKNOKEROELhOYAL>C~1'".I
JZDR'WPI' AT'CIIn E
IEDQPCFP!'LKANECOHITSO.f'ttLF.TRYFPSA.iLKCNPLSNYSRYYIQNfYFOIPSPT9GItEYSSIGOKtNP
ft.JIEAVQTEFfSEVPFl..'fLEEFAKCYKICERPIRVAKVKVJ1KJ1P
' K~
t v EFFSIRIOR.iFLLDLYFA(:I~OLEvKRLLOYIKRNNKOVCNFVP101QAEDPAGSY!TPKBHIIE
:LI~aSCLI:rCDYQEFLRELLT~.IORL.iODFt'IPEFPPOTPLAIL'fCOCSGAN~JItIRVAT
r r er .
r.iILSCCNL:::'....~'1'CNAYVEaF~SYAIPOLLERQAD~LA~I~~58425 53n31J
tK:SEiIIMNCL!'CLSSAXAGIA CPcy0501 .' ~:.Klr;KMWPVFLIGPVDYWItSKITALYN3NHAVLTI..
'.::If.". : ..cl,...TFir:,~I.. ..ay~....% ..
1~
y~~~NF
FF
~ IP
':Pn_J4'J1 571595 ~.'lJJJ4 CITA
~YI~
~V?EAVI:VPAY
PN~OPASTKDAGRIAGLDVKR
1CAOILlDWKFI
~1389 hypocnee>.cal 0soceJn AAt.AYCIOKVGD~IAVFDLOr'GG1'FDLSILEIt~GYFEVLSTNCOTf.hCGIDIDEIIIIKW
NSCYSWt'~tLFSFLVLFVCCI110CIPLCPOCKYETKSYIJtSDOLPRLKDAAEKAKILL9C1ISSTEINpPPTM011 DCPKHLRLT
AI
ILSSLY7YF11QITAF :p ~RTOWIEIDiRAYLFSLPVDSSLSEAITNIVRDLNIE~EGIDLSKOtlI
' YKELICK
DVLLVG
V
I
A
IGYVOSLL~01FL p DNGNNYENDCYL KCIO
IPIfCGKDCHiLP~TfTLFSPLfADP CNSNIIPA
I
LTRJIpFEK4AASLIJ<R?KSPCIKALSDAKIS
PFICAYEICERPYCECITRSSAERPLLPKEKTCOEPN10CVNP0EWAIGAAIOOOVIGCI:YKDVLLLZNIPLSIGIET
IG~LVINIITIP
~~rILLRLfDVSRF1NDCDPGI00GVF>i~t!'~.DttiVfIDID
ROVINSAGIRFNEKtIVQiIIVOATIlr7 '110KK0IFSTAAONOPAVfIWLOGERPNAKDNKLIGRFDLTDIPPAPR~IPQI
HPNFPRPNLSD~VDLF
PESdIVIiSDFPVAGWSG11Id0t8FRFRW1~.S5H~EFILTANCIFHVSAI~IIASGKEOKIRIEJ1SSCI4EDEIQR
NVRDJ1EINKEEOKRRRF3ISIWlN6A
RLYGCCCYIVSRIILTFPERPFYCEWGAE4RPrIIL.RNOE~AQPIFAIfPETLVKEIEERIENVRHALRDDAPTLlII
KEVTmLSIOt~II
iSFRYTPOI DSNIFRAEMIKDYKEO
WEOQIIFGLDOSYILCNWAIVOEfGRKIMVLEYNQOFSK~OFIRtPCNYYGFRLTYGFce~lo~sAtAAASSAANAIOC
GPNINTEDLKIwsrsTKPPSNHCSSr~HItGCVCIIDeI
CPn_0192 571617 571801 ~ .
t homoioQ present >.n Genebank/FlIBL
as of 11/1/98 586118 588511 l N
xls CPn_0501 o ro vac8-ribonueluse fraily LFSLIFPICEERNSQQTYItHLNVESACFLLFSPLKINWSSPYCFPPPYRROLKL
ATOPTS>"1'IGF:NOCPKL'~9LL~LPKRKPGRR1'YGKSLILIFIPCTLFVNARIOGFOFV
=Pn_0197 57514? 5718s5 SPONPEEYPFDIFVPAR~.RGALOGD1NIVSVLPYPRDCOKLRCTISEYWtCKTI'LVGT
nt in Genebank/F~Olt. as of 11/1/98EDRSPALON
' LSTPPWVDKP
l C
t ~
op prese O
No robust homo Iv SKTEGSHSKTSKGFUCRFVGWIRTf'IGRGSKKRSPSSFSP1'HPYIRGRTYTRSPR09aVE~
IL
ITSLVSFTBALAYTSIISCSOSLIPVBLLPGRTYI
LEFIQITf~IAKAOf0AI0AC1fNL11ELFPPEVIEFaSLFSOKHITOVIJISRKDI~DLLCF':
RKpEOAETSFIETP10GIL10CQ~DPKGKIiVIRJK1MI
HLDKEAAKRCNSIYFf~It IDSgPARDF00AISLTYOIRirNYII .L'NfIJIDVSIiYVTPNB
_ _ 5~~~ ~WI~~I
CPn_0191 575370 515116 ~GOtr3fIPLSKItJtF7at'II~KK1'SDIREERCCIRFVLPSVZfLSL~PVALIItIbTFS
~c in Galebenk/ENHL as of 11/7/98 IDiKi!'DIT!'1'PfOE
l L
No robust homo A
o0 Pr~
HKi.II6FNLKAIIEWAYNI8N0GVSLPFItSHEPPNOENLLaFOE
CSASVGVTS~1~LA
YINIRVNPYGSYRC~RNPSPEDGKKDVPLSCNSRLNRPOOIAR~PDYOnS.fTLTS71GIIPLDOVLHSQFVitSNK1' ASYSTF1~80GN1IGL%IJ7YYTNlTSPIIULYID
SLEKRVlOCISLANIrIC LIVIHtLWNPL3IDQT1Q.EI
IVRAGSTKERVSAKAF1i5F1~il0LTRFIMVLCWP~
NAYIITANHBGLStIIVTEFCN<tGFIAAATi.PICtYSLKKriALPESIPDKIatIGJISIRSrtID
CPn_095 575507 576793 SVNLLTQKIVWSIA1~ICPl~IK>ITPSK1IKDTKK
aspC-ASparcace AlninocransEerase KPOYIISEISIOG.iIGFRK
EMt~tIWPRFSIi . CPtL0505 588471 589106 , ttRLK>Q1QIWAIOKAGAFLRCLPSESRPYL l ETIPEISVIDLSIGDrtOPLCRST?OAIKEFCVSOZILQp~''t'~
YENRISPEEIFISDOAKPDIFRLfSFICSEKfiGL~ODPVYPAYRDIJ1NITGIRDIIPIJICase .
)-alechYladenine ONA 9lYcosY
KRIiRK'KEPIOCCPRNVLO>I~rLSEwTTLl100LLOfiKLITTHOGLITSC1LIVLT6J1YR
RKETdFIPELPt~IQOSLDILCIA:YPNNP3LZVL'1TGOI4ALVNYANOtKTCVLIFDJ171YSAFR
IPEAKYCAIEINSFSKSLGFTGNItL7lG8tVIPKLLTIDIdiEPNINDNKRCPOOIU1CHAYN1IRK7CMIMlKLI(O
OBAYLYRCYGMNNLLNVV1'GPEDIPNAVLIIU1ILP
ISKOIISQI'LT
PAL
VSDPSLPKSIFE Y
OLFPTPPAISLYLTIIAOKWISL><1'MFfV110CDHAPYL'DOGK>ILNIORROtIRDKpIHLLTNGPOKVCOALOIS
LENNRORLNI
FATTPNGASLLNQCAOYYGZ KVLS
: ' . LLSPI9SG
A?11RIGIDY1HOEYRDIiPNRI
WVELPEGISDELAFDFFLttQYNIAVTPGHGPGSCCQGFVRrSALTOPpNIALaCDRTL'1'A
SLKITNVIJv cPtt'OSOS st9ass ss9tlo CPeI"0196 576751 577811 CT131 hypothetical Dsocein CPNEISPIPRRICKSFILNM.>a.YSKETN7WFLISCRRfNKRYFITiit.VILLPLiIf?IAI
CT391 hypoeMSieal Drocein VlIIIItIFLTOPIITZaStFF>DCFSFY17UIMLIJO)VLOfILLP'GLrPIITVLtGFLTIItIII
SCMfILRIUSOYLFFt'SLICSFIYVATCGSOFOSVSSPICIAIFLSFPNVNIIPPPNNiYOCfGWA
D ' ' ' PPLYRITKRN IFCSKECSFKQV
PLLEDCSKSCIE1'LKDf84LPEIWLJiiIf~SIVKARKTARSLFtfOACtWAIVTLGTI11TK1 FKSIiiIYDIIILHRfPIIKTVYKM00VN
GDAPNCC'TO~PLVTVFIPTCPNPTSGFLTLFRKSDIV!'LCNIII6DJ11KYII80DV
VIISNTETORPVIY71l1VPDILESLTLPKNtI4tIYfs'VI81LLDIt7I71ICFAI01WATN710fIVYLPSSPLPD
EiJfOD00S
KPSBPFPSDLOKEIVKKLMSGIEYIEISITSSTFlCfItIR0AI010tPS11IFIPLSPLStOCLSTPNAC
ECfAFLOEILIfIdCIPISTDDfSLISDGKCI11CSVD1fRK8GKQT71KTV~LYN~~BL
0s07 SH198 590122 CPet RKII71QRLSPTi'1'PNEDIIKYLGIKLiDCTDINOpLSE'KSAVS-Cf431.1 hYDOChecical Drocein S'fPYPQFPLSCEIKI~'NIELFlIfRNSKOARRRAKSPKKRKPRYAIVHPAPAPItIVYaJCf CPeL0197 578107 5178/0 NALSTSDSIFIPKIG
CT388 hypothetical protein iPQRWL~SWILCVKVTPKAKtNICIVGFOOQALKVRVTEPPT~OGXANDJ1VISLLtUCAGS590133 590300 l,pKRINTLIAGETSRRKKFLLPNRWDIIFSLNIDV~CPt4..0508 Cl'131.Z hypothetical protein SRINSRNRSYGKSVIICVTKPIIVLIDiFERVEVLR)aGRWNOSTA%KVICLPRTPILK
CPn_0198 579D62 578085 No robust hanoloq present in Genebank/EMBL590808 as of 11/7/98 YCRLRRAPFIBaRRKARWVVALFAffrALISVOGCPWSQAKSRCSI~fYIPWNRt'I'EVCCLCPn_0509 590299 PEAENVEDLIESSSAWVLTPEERFSGELVSICOVKDEIIJ1FYNDLSLLIttICAVPSYSATYIPr~iceed Netalloenzymel NKFVFLYGNFIRV1'QEKIKItIVSNEOTCIPIHLVSVEKLVLTLLF71LKVTlIIEIFIYILE
DCAWPGGPLPALRORLDFLVRENORCVRFKICIVFL.CGERGRYOSIEEDfJiFFDSRYNPFDMtaELHDKVFADPSLT
D'lITLPIDAPCDPAYPtM.CEAFISP0AA4RFLCJISPNDm PC~6IESGNRVTPSSEEEIAKFVWMONLLPMWRDSTSG11RY1'PLLAKPEF~1RWANRKIYEEISRYLVHSILIWLGY
DDTSSEEKRIaIRVKIaIDIIGNLRKlOIALLTA
VI'LLLFRSYQEAFPGRVLFVSSOPFIGLDACRVGQFFKGESYDIJIGPG1AOCVLKYNNAP
RICLIITL7~'ILICETt~CLNISEGCFG
CPn_0510 590801 591971 CPn_0499 580104 579705 clyC-C85 Daalains INenolysin homolopl OLNNLHILWfFCILLFLrItGLTOPSCHGSSKFLIrfItJpRFFKDKGREYPPFPSAPTILA
TLLCILYGALfiTKLYTLLPPK'tJWKDLLIWPLYSLSALIAYOFLPPNISTINPK~iI'IAHL
No robust homoloo present in Genebank/ClaL
as of 11/
LaVYLLIFYF~.FX;STMSSVNOSSGTPNPEEV'fSPESTEFl'IKNWSSDEiI0ATN11VALPIVRFLASVFOLCLFP
LOLLFYRRRPNQpVRSSI'SFOSOLSEALSAF01'R.IVRIVNIPKVDIF
".'OLSLPDGVGTSSEETASNPRVDEIVAEVSSSMVADQISSLVfltVGELLODLt(C710SLFOEALVLVSEEGYSRV
PVYKKNLDNITrILLVKDW.LLYTSSHDLSOPtBSVA
ALPEEITL
TSFOSEL1CJCLPAWKSSTRRL>:T'AGRGONADIARLELERSDIfAVLGNANOFfIGKAHLIL.
SKLTDVNHKLOCLSREDLSIrIFONNDRVLEHt.GSLGI.aVpilECiK'ISLSCERGIPRLVLTAKPPPYAPEIKKAS
SLLOEFROKNRHLAIIVNEYCFTECIATMEDIIEEIICBIAD~W
~aINN
V
DSNLVOIKKVNLPTVEELRTL.OCITESSSDPRVEEStSCCERLLNELRRIbIANIVOFISSFHKVOAVPW
ENTPYKKICSSNIVOGPINISDAEEYFftt*IDNENSYDTIiGGFI
~YONIVEYfJBIIVRRINLLPCLGCLPFIGJPDASOEDORSS~ERSTRRERLSRRSDLSEECNFDIEIITCTERHVCKt JIITPRKRKt?lIS
FlItVMECESINPESPHCDCRNpPSttCDKODSDSEEETEL
CPn_o511 5:3111 592488 ,am_0500 59064? 58236: csbV-9iqma Ratulatoly Facto:
NSDfOKEEHCSTTIFHLNGKLDGISSFLnIOENL~>OSLaAGSKNIILaCAHLDYNSSJYDIR
pcoS-Prolyl cRNA Synchetose Vt.IQ..'YtIpVCOHSGKiVLTTVPKTIEOTLYVTCFLSYFKIFNTVDEAIOTLMfD00 ~PNSNKTSOLF'tKTSKNANKSArIVLSNELLEKdGYLPKVSKGVY'CY'CPLWRWSK?!8'LI
IREEttrAICGOELLLPLLNNAELW~GRWEAFTSEGLLYTLKDRECKSI~LdPrNEbVI
2 5x3538 CSFVAQWLSSKROLPLNLYOIA?KFRDEIRPRFGLIRa~RCLL1IED9YTPSDSPEO~'~OY.
CFn 0 EKLRSAYSKIFDRLCLAWIVTADCCKIGKGKSEEFOVLCStGEDTIGVSOSYCANIEAACT125 hypothetical Droeein GIIPTOWLAPAT
::;iPF011AYDREFLFVEEVATPGITTIEAUWPFSIPLNKILKTLWKLSYSNCEKFIAISLPLTNRRSVCYVNP9IAR
rY:OISTWKFLYSLATPLPAGTKCKFDLAGS
V
~
' xIRGDROVNLVKVAiKWADDIaLrISDEEIERVLGTEKGFICPLNCPIDPFA0E1TSPM'Q
ILTf .sEIIEATAIPVKDHPVPOFEFTLPYCLOVGG
DLSOTRNVIYAENPIrJ
WVIOrK
.
VLrDACWSAOt.FAORRKPFYLYIDP9CE~.NYOEPOVFBI~IR~IVLKKIEIPTPS
St-X:ALINAKDK11WIJVNWORDLLPPQYGDFLWEEGD'1'C'PF1IPGHPYRLYOGIEVAHIFN
:.:TP.'rt'tY.:FEVFIFQDEH~TCQc.IrNCTYCIGVGRT(.AACVEOt.A~RGtVWPKAWPFStRFDf7YRFE0E
FGNL'fLIF:t'EETRtEL~IEItLRE:><.NWpLFIPETGPYILPNLYfNI;PCI
FI::APIKCFADSAFtItJetf:LLHGESERVDSED1ICICIRtYFRDORAL
' ' ' '."tAFtIJ:DTV;.QEIrIE"LYHEW:OCYEFLLODROERLCFKLKD:DLIGIPYF.LILGKSYIRIOLKNL.S
OET
NFYA::.~..~.FEtIQENL::1'DIWKLIN01Y.~.LFNEEDPFTTL'~'I:) JY3GEPNLmVRNIGNIKE
~e::Y:IFEIE:R:.GCKYTV.iPEnlFt"IWC.~tRlLJ1'PKSII::KMKf.YKH I FI 1KLTK:'fVfNIf7N
I:: I i'::FTA.9KEfYiFDFC'IFIfPEFERVVEIYNAN
':R:~1:~1 s~7n50 ~:$::FTfMIIMPFPtCX:KI':x01'Rc.9'VIFl:LKKIILRIY:FVIVSt',LppRCIYKDYPDSPQW
' fn USUI
'l::It:f:fAIItaIKYTItF_'ab't'\Lt'ANllv:'IA'I'It:PPIVL:iFtIITOAPIIf:SELSTOSKPOU1 V
: ' U Rupla~.anr mrctiPtuer ~:l~IfPl1 Tr I
. KAPP
. NRIII:Y:11'/M.TALL~1'VKt flttY:I:VLIPfFFI'D!:IBR.DYEYGONVPLt:VTGKOPN,t rc ' 'a:::F'PIHL:-.~T(VLVC:W~JMIt::KVSKR(Y:KILfSILFATTELYLh"IIOIrI~P'7SKTt.K
ILLTF
. lYNfiIN
t.ay:::h4a'1'fIHNYF'AliI.PJlI:u:F1*KNIIT:XX:RI(?DWLRIIWIfItyI:FI:PFJ1EI.:APtVFY
YLItVi'c~AIILWW::.:I
NI.K I:YJId ::E::I<N l t KISt hKA'f EL4:EILD4PTFF.~.::ARPFNO::VTN ~'tny'.1 f ~.t'.1'. '.v'.7'::
f Q I1'G'/MIORAYt' ' ~1:7LYNEV rvlr.r..Amn.n::.
:L.:CEFP~It'Pfff'C.WL!'l~~t:UCI.::IKRIEKFLONYtPYLP'1'NEEL::KKEEHL.. :: ~
' n .ELLtllC711iKC .
WLYITRriYII':;I:EDt.YrJi~:N::KI.I*YC,1FKDPEVLAL<~I::LFENRR~WV..
LFAN
VI'IrYA.P::I*~:VKNUU4\KKYIIYtKr~::PVia.)IIi.E:RIVIi(6'T/lplfl'l'CLl'(WPKT.'uPLY
::
' ' ' KFJU.1 I
F'F:KIJiM/Ft'I::DRUAIJII.I.tJ:IIJKI:I/,NTikRli'ADV1IRY.qPVCO'l'VYY::.:Tt.YLYP'M
F
LLK
INI.t NA'fAFI':Kt:lV.hfla7P:TlN:C::VITII'YITIIR:iPL(:ALI:IL(a "
' PI, n'I ~F: a .'K!'r.'::F'lAKft il n'M.WI.
l:: aKL.LP::KR Y::1'I Nal.l W~ 1 vl l I Y.Ti' w ItIF:Piav)::F'IKt'KI:a'NNFH;i:ak:KU;NFPITEVH IVIY:I:FP."(. '.NldtlY iDLF
ILRTE
'fKIKE-fl7lJtIIIKALTAfE'fAYL.iDLGIJLaIRG'.t'.f'n t15~5 X' _ ~'t"397 ..KDIIGLDSIP~GAEILYDKCRN CT77H hypnchecvcal 0rocafn FLAPKRL:.:::DFLNIHYMAlIOLGtHSNtTMLC'YHKECPCDLV1'l0IVKVPOLODETOGFKNGt CFMHDALL
iZWIIOaL~L'~IR41111VHKEHCK~AINO~L~
~~
FCLLKFACEMLVLCKRLRK.',~fiALPLiLiIIIAVARtFLDNfSPaI4KALWNYLGtF~IALDLLI~
ICTOIRDCCkl4RI0Ehr'~O'INKL~OOME~LRL'~
Lx'CANDL,~>.~TIR4CEK'IFOMA,i.iKEP
KOAGC.~.IVSLKE3LASTflrSSSVIEKEIFE:RKKCNECGKALLCpRTELKNAIttPELl.
iIYERLLi~1'IKKDRWVPtF?1RVCSCCHri'LTPONENLVRKKDR1.IFCHCSRILYWOCSQ
CPn_05Li 575690 595530 VNApCJSTAKRRRRMAV
~T4-7 hypotnecscat prota>.n . .;~' .. . ._ .
~.NfT,PHN".TR~JAM.~.tFit4YiiIllCY.iftN.iFPLSWLIKRNDIRCJLAPPADLINLL.Iw v -R:; ~ ,:~s..s.m : :-.t.s. ..
'..~:1\C"FF.':.'. ..~\T:.-... " . :.':. ~ I:F'.'."'iDL
VFNL1EK~F.%(:'(IJ:LK:::i::KFTrHM'r.iiMl,i:DW:QGawiKUKvIV9FFF0AFJPKW1 .'.FIILAW .-"~.;;, .~.. ~~.~..-LLt.:D.V.:.;;::'::.
' C1.PP5 OCJ1EKILGMS<.WVFFSGItLiK3fICYARKLVATLOSLSERIIf.FFSP11DLIJ(Cr~L.VSPGDI
nV'GWYDLTKLPFVFALi.:.fiSC'.:WiCEHFLPNLWEEALaQFESSPEEVLKEAHONl LLOEYYI1LCOYRLGEEH'tEiFEKFREY'tCTLY00ARLVCLFSKSGETOELI~fVPHLKSRRAILVAITSIIPYSMr IALSOLWILP>;VAiLDPffIC.I
~~I~~~~~
CPn_0515 596450 597181 HLG~NSFSLEVFSJ1YCCCC'JCIVDPOFRtl'CIFTDCDLRRSLASYOGEVLiLBLEKVIfi' ubiE-Ubiquinone Meehyltransferasa ANPRCITEDSDIAIALOIJIFSSSPVAVL
EKNI'fKALKNSGNINEPSTNKPDCKKIFDSIASKYDR1'NtILSLCfINHFWNRSLIOIIf~S
GYSLLDLCAGl'GM/AKRYIlUW PQASVTLVDFSSH4.DIAKDHLPOGSCSFINSDINpLP
' 0527 609910 608726 CPn YSAHKLYL _ LENNSYPLaAMAYGLRNLSDPHKJsLOEISRVtI~tPSGKLCILELTPPKKTHP1sueH-Dihydrolipoalaidt Succiaylcranstvcsse RAWPWICKSVSKDPDAYSYLSICSIOQLP(tDt(Dt.I~LP.SRSCFYIApOC%tFLGAATIyft,RY!lItCFRFPKI
GLTSSOGSIVRfiLKNLGDNVARDCPLIEVSTmCIA'ISZ.PSPItA~tLVR
LEKQ
FCVN(93D6yA9COViGLIFi.CISEADDESTSCPPTBCCTKSPJVGSSSS&ttl'fPSPAVLSL
J10R8GIGLDF4.OKIAG1GKOGRVTRODLEAYISESOOVSIPEIF004VNRIPIISPLRRJ1I
CPn_0516 598909 597255 ~
ASSLSKSSDEVPitASLWWD1(T~1LISCZs~ORFI'CfNCVR..ITSFIVOCL~TLEO
f 11/7/98 No robust (wmoloQ present in Genabrnk/tJ03LFM.LI4GSLDGITIVIeIKSIIINCVAVMNKF~VVVPVIHNCODRGLVSIAKAIdIDtiSIGR
as o VVVRDDDSLIIIRK
IILnVLGRAIAKAYYVCMVARGLCDFPTLVPNERLPIGPPEVPOHTS
RISISFRVSWFVK LNKLDPS~~~'~"I~IIRYPEVAILGICIIOKR
WDliLSVKSDYEEJVGPAICIRSLEPO
A
' I4 MVYM'LTFDHRVLDCTYGSEPLTSLIO~tRLESVfMG
SIISGLDDILt(LCILORRPF
w110CKEFAKRNF
~RLKFPKSIGSKDAVIVDSF3~lVPVN
!' F
VSOISPAHCRLCSTLVOWAPILGSEEOLVWLEE
T1'D1TSGVSFJ1AAAE7lAVDSTPGTEE
' ITIPAJISC CPfL0528 611165 609921 ANPfQCIPMSETVESSPVAPCNTTD t PSPSLRYALWOFRfPYPEPPKEPEVMFTDEEICSLILE71TRARRMQ.DLYNCIILiIDItEL.'K
DEIOKIIVPDLPENWRtNWRwSERLYKFPFKTKKEGLEEI!'LMCELGFHIt.ARGZ.RATOSQ:
t?1cT-Glutamate Syspor Ll~OItJ~CIPIGLPSIG1RCGLVLEGIAItFKPICDIFLNLLSMWYPLVPCSIlV11GI1LRIS
AitIKVFNSLYAkOi.QSFNV~GttSCTIIKPLPTSKLDLFKSEFrSKPIOatILTEFLVASDEEIDMOO.GRICIKSt IGLYIG1TALAIVICLCFAt4IFSPCNGCDFA0AQ8~SAVTVIf IRT
LFKGLRVLEPGIEL7fYDHPt>DAGEIRSVLEGLVO11GRISCYtitNOPPGRFYLRGVG~AAY!'CSIIAQVFPSNPV
RSFAEQiILOIIIFAIFLGIALRLSGERC1RWPJtfIDD~N
lCQRFKSCVR'1'IO.VG&F11DES GCLVAPG
FFESSDEEGAFIIDNfPSKTAt4 GEIlt .
LIIMItItIMSFAPYGItit181tAhtISCWIGt~Vti~IGKFII1IYYLiICLl7IATLVI
p ELVt$i.ESLV11S
LPItiGRFTILV ca,ISrsxFLSS~IISUIVSrASSSATL
GTAIF0~4AAWI~If~~~'~'t'I'~ATFSAVCliMYP000lITtGSf1Jt81RL
CPr~0517 599637 598795 PIOCIAII~1GIDRLRDIVGTPIQtILGOAWATYVJ1SG~.SPYESI1C0E8VE1T
f 11/7/98 No robust hattolog present in Ganabank/1?18I.
as o 65 FIMSSLLSCGRIEPTRV1'CSLKTYLEDTSONOLSTRLVRASVIFLCALLIILVCVAtSSL
IPSIMALATSFTVMGLILFVMSLT.GtrifAIISYLTYSTVi'SYR~(1DIAFEIHKP~SVYYECPn..05Z9 CVR)811DLCRSSLGCGEIPIVRTLFSPFONIIGLNHAL.71AICIPLEftFJIFSPGPPFIEPt.VDwAYc_aH-ATPas_e ~PSTLFLFYRRVTIAISLEGILGCyOGSLLSIIVPAPLVAL111F
F
~LIRDrRPHVSSLCFVIKQ(iSSLRTKDCNI'ICEAFRSI)YOhHFAMVDCYRLZHSKLIIERFSWS1PYRARSTVI
KNGLIfNL7I IPSVMVREDYPSRPGF~YRt:GLLIUnt'sGKG7II.
KLTWDSIIVlIBA&YVCDEPtiIIAJDILP6SVWVNKaIIRISAARJUIEKIGILI.ItOGLOYR
KLHI~IfEIAWN00DPtGGRAFFPItGRLRDFPLRLKlYDAIIVN00GKEJIG'1'VVIaV8f17t CPn_0518 600806 59983 POIFVKPTIASVVWIIB~RIPKF~4RaRVCVFCCLGFPOGFiHrt.REIHILDKYLL
CT119 hypothetical Drocein I~SVIG.PRLSGEVSLLPIAKVCIItI.iVNOD
Flail'IfPVPONfLLLRILRI~~iiFSRSDDEIiDFYLDRVflGFILYIDL~fDOf'~Ci~RCIYOEL
EiztIIERYCLIPKLTFYEVKKZlICfFINEKI'YDIDTK>Q(FLEILOSFLEFIYDHEt71'LSLtiK4IE0IHKNRO
N
AELIDOFpOfYVERSRIRII LKIHLFDAK\ICIICITQ
ARQLLSNKUCIYYSNEALiJPRPIOtGRPPKOSAKVEI'>:1TISSDIYTKVPQAARRFLFLPECPeL0530 613323 611160 , ITSPSSITFSEKFD'IEEEFLANLRGSTRVEOpIHLTNLSERFASLKF.iSAKtGYDSGSTGspoU-r~
Nachylasa SVVL~1CKFLWARCCSLiIPWEFCSl4OCIGK10QPLVl(F31UIL.KRSRCWISStiPL1f19311REI0 DFFGDDDEKVVTKTKGSKiIGRKKSS IK~iPVAVI
t.DSTLtIQLdF
KALRTr,YLCOHVrCSTtILSItXiXEFLYf3.KmiSTICILYC
_ _ QKRWIMOmFI'IQPFYLIIOV~CPOFNCiIZLR
ADGaGVDDV LOIPIV06Y1tP
CPe~0519 601707 600901 MMtSSLGAVFSLPILSISRE6GKELFKOEDwLYISrTSPPALTMYFBKNYLGPIALVIGS
dapF-Diamit7opimalace Epimersse EKDGL?E~1FSEOlSEIALPMIGCSDSLM.ATSYAAVAYEWRQR1NN
OPT1G.RILVYWMAFYSPSI'ISKYFIYSGMIiRP'L.LGETLPEVEDVRFLCOECRVDGFLYL
KPSSCADAQLIIFNSDGSRPTMCf~iGLRCI1IAFQ.i1S010CKSDISVSTDSGLYSCYFIfSYtD
RVLVDKI'LADWRJVVHRLESAPDPLPK1LIMIfffLlfPNIIWILPEISIT.DLSILGPFLRYSJ11!
depasdant methYlcransttcane HOTFSPDCVNVFtPWICGHCOLRVRTYERGVFGETAACGiGAtaSALWSNSY~IKE.SIODS&ImDFRKEKC'Rr RK501I~R~K~RHSKTYFSLIRERLVMDYKLGDSGtICiJKLt~F
IH'tSit7GELMTVSONRGRVYLQGSSIfRDL
GPVI'LIRPSSYAVWPKSRPEI3FSQAAt4YVRDGERGAiiIO~IFKRLPLBNEVAPStNRCLLK
05I0 601233 fi01616 RTPFCNLCYFPEIBGENPALKOJIIEKfOCERWIi~t.FAYIGAGSIPMKOGARY111V0iAS0 CPf IYF
L
MVAtIAQW4V~I~tAPPEIIRIFYfVIEDVISFLKItEIRPNKKIfOVILL~PSYGRGPOG
elDP-CLP Protease KIDImLFPLLSt.CSKLLRDDiI.SYFLLTSHTFGNTPEFLRAIARRSVPlLVSPAiiSC'GESF
ERHYFlIADCEVHK1.RDIIEKELLFJvRRVFFSEPVTEKSASDAI1GG.WYLE1JIDPGICPIVF
YINSPGGSVLVIGFAVWDpIKMLTSPVI9WfGf.IILSMGSVLSICAAPGRRFATPNSRIMICGiGiIGJILP&GStV0 IpNYVEJ1TNOPRDIIEK7IIt>ROMWltTAl4FaCPrt0572 611716 614075 KDFCLLDGILPSfNDL ribC/risA-Riboflavin Synchase ESFCCKDSVVIMOGIffSGII0E1GlbIlCFFEAOCa4CLu'LCIKS'fPLFVTPLVTCDSVAVOG
CPn~0521 607807 601211 VGLTLTSCNFSKIFrDVIPCI'LACrTLGEIOtCSt7pVNLhAi.KMGDSIGGNLLSGItVlCT
QlyA-Sarina HYdroxymethyleranstaraseAEIFLIKPJJRYYFRGSKELSOYLFEKGFIAIDGISLTLVSVDSD'fF'SSICLI
PE7TQRT'fi.
NFEKFKKFAIVEIFTIIVtIAWSLLNKFLF3~ASGKKGOSLJ~STAYLAALWILLNAF
KSLLi GKKROGERVNIEIt)IISfICIQVDTVKRILASSCKD
PS ICERIIDELKSORSHLIGIIASENYSSLSVQIrIMGMd.TDKYCEGSPFKRFYSCCBNVD
AIf.WOCVETAKELFAAOCACVQPHSGADANLLAVMAILTHKVOCPAVSKfGYK'IYNELTE
CPtL0533 614918 615385 EEYTLLKAfl4SSCVCLCPSIIiSGGHLTHCNVRLNVMSKLIOtCFPYDVNPDTCFDYAEISCT106 hypothetical pcotein R:.AKEYKPKVLIAGYSSYSRRLNFAVLKQIAEDCGSVLWVDMANFACLVAGGVFVDEENPEYAPHOCPFCNHGELKVI
DSRNAPEJ1NAIKRRAECLKCSORFTTFE1YELTT.OVLICRDCR
LFYADIVTT:Tt(KTLRGPRCGLVLATREYES'CS.FM.ICPLfOAGCPLPNVIAAKTVALKFIvLYCNFOESKLIItC
IaLIASSHTRIGODOVHAIJ1SNVKSELLCKQNREtSTKEICELVNKYLK
S'JDFKKYAHQWFBJARRi.7IERFtSItGLRLLTOLTDNHMMVIDCGSLGISGKIAEDILSSVKADHIAYIRFACWRR
FKDYCEIJIEIfL.LSATPOF1EK
CIAVHANSLPSDAIGKWOCSGIRLLTPALTTL.CMCIDEMEIfADItVKVLRNIRLSCNVE
CFn_0574 615389 515784 =Pn_05~2 607835 601655 dksA-OnaK ~uppressor WFTRS(tWPta'DOEIEOFKKRLLFI41WILSHTLEGNAQEVKKPNEATGYSO(pADOGTD
':'C433 hYpotheci.-~: protein TFDRTISLEV'ITKEYELLRQINRJ1LEKINESSI.ICDVSGEEIPLARLIAIPYATMVKA
REPLSPEKTSL1FKVKNVNpRMIKKNQGKKKNYFQYIPLKVpKLROPSFYPKRLMTLYLC' ItIOKTARKYOAHYLPILTLFPYAKSTPONKRALQFLPOATHVILTSPSSTNLFLSRMTSLCN
OEOFEKCLLs LSYJITLKTK1'YLCIGEs'TKERLLSFLf'.OVKYWaTGEIAECIFPLT4aLPS5ARILYPHS
CPn ::LIRPVtREFLYNRFTFFSYPHYTYKPRKLKKNILSIC1KKIIFTSPS7VRAFAKIFPRFP_ lspA-Lipoprotein St4na1 Peptidase cYT'fWCO.RMTLOEFCKFSSOKOVSLLET1JGKSRTSFKRTPCWKLSSMJ1TRFRSILLVITLP'VLIDNV1'KLWLG
D'IKDLOILT(tPTLYTH~CRFS
052i ~047~0 d050S3 F.iIAPVFHF7CAARGLFSNYKYFLFLLRIFVILGL1J1YLFFKKKSIO'"I'CCI('ALVL(.CACA
'_tm ' ' _ ILISCGTLLLWKFYFPTKpFEKKR
tk robutc hollwlcxc pcesene in fFNVAD
CenebanK/ENBL ass .'i 11/7~aA hNVCDttFYCNIVDFt~FNYKf,MAFP
s~Nfv~::ATC~FDGTAF:sLFPFITRPRYNFKLALFVT1AIALVWIALtA".'TIAICLCIHPLC
~Fn O:s~. eI6700 X175'71 .:F'IFLTAICLYFISRYIC.HYARNVYIai.DWfDII.':YLODNR~HSFIF~DRbrqA-f Al.rl:ly Pacslsaaxe 'tn n'.4 0:15070 W 1v1'1s 'fR::I4EY1.R1'FFK1\fINRLLsLt.."VFDGFFWSY'/AFILII'ILGV:F:111I::RFFY)F'fIIPSQ
rmtnt::r IN411(tl,.ul Ns':::rnc F'~KLFF'I'f::\MICQERETKL1:VIIPLKVFFASHYXILCICIPl.ICIVTMta~:::Ct:ALPYRhiI
in Urnatkmk/F?1RL .u: ut 11/'7"tN ~
IN ' ' . .VIVAtIdJaYIVFI
'.y':'.VL::S'1.'FRDKCIADKYyFTtAKf:TL\ILA::Ir\LC:ALVAUtPLIKJ1FKTPW.
:RT!'. fIPM
'r'fE~fYKFNN::f:C~ N:(Fr:::t'JKY::F:V1'L:IKFRKLDRDfNf~
' ' ' . ILPFFMLLYt . .af~SLORIUKLCa :::CIVII:NI'VYI~\Ll.f'I'rAI.Y::VYfFLV'fIKtFrC::Y'/::::MJtJY.'JLG~sNFKPG:14\WVEK
NAL:L
'/riF::V(T0.~.L\I IIWNI.FKI.'YFM~LLFL'/FYAI
'/!.':'f:alF7M.:F"fHNIILN1'KFKt:\LyfDW:QPF'QFTFL".':LRVIF:1!MQ::TC:IfFNf'Vf:PfN
L'Iff.VKRFI1TLFIiLIt:IYF:.::AYYrp,iAL~FArYZYAITINOCISRMY:iaiIa~7Fiki1 :LIJIVLAr.3W5IGLENA:W'VFd(TLL:YF
D
'f ' TL
' ' IsarfA~ftlt.::'Cll;f::Tt.KI*::VWF/tt.'Kl'HEIr%PAK':Ff.FF::f'fFNRWKLPNEALOQTFNLP
:
~41~
I::IWAICMLI
:
..:AK
:
IJ::F
VI':YYr:AKFL'1X11'CAY.IYTLYr:(d.lL1'LF"'FI::()Nf (TF'F'YVn:Y'ITII::YFt If4VKFFt Itl.:::AF:YY::II.I'fI~L:IS\rtta'K::l'ELFNW6'YYItVALIJ1'fF?X.'LK.1\(E::ItM\tVALC
LF.
DFFNTtE .
' ALI.IN.'9/:7 i\1.t.1.t'1'Nl.4SF(LIKE'fIFPAFAA:LTET::L.iTE
' ' /It~liiLLVII.O
S
WIW:,rIVAFY:KIAL(.DAIiIfPAi.t 'r::'/'IF:VIIFit11.1Kfi:fF
7,; :I::F:F:
:H' ~t'.sY nlIH1'. '.Itlltr'.
;:Pn 051.: ~i ~luW 3 ~THI~I.: I,ynncnRtt':.t: n.,r..t:, rL:I-L2: P:bnsal!.x: Pr~cetn ROLNKILKWTHR:~
~:LVPG1KI
:I'atr~'IIVNTI~FC
1:
~
' .
faKORLTL~IER&fBKRIYNBP~YANiQ'0('~.I~~W~YQVBc~~DY'D1.'EW.CR.til.'~
.
SHI'LJiNAO~fAEYL56N~WA161KY61I1KI~fYiI~l~Ilil .;.: FVPD01'KASIl' :::::::LLF~
.
:.IFLLFNDN
LAptII4:NNL4:,WLKKRKNM'..:L:CD1GELLDEKKOR:.'WKKNLDGGIKhCAALVLIWKV
F f NN0 r.
EILI
'Fn 1539 .t8129 5:A511 051'.' :115Rn u32198 CPn ::T914 nyprYCn,c ir:.i: Prott:Yn _ 'F'IiKTKTLRDI'nIPRNNHKPNKTKCKRFRWLRyGbB famil/
.VLF,r'CFIATLL ..-..
~Ft:Kr -TKEI .. . -tK:AQHw .... .,.
~ -.
:
r-~
' ~
-' . _ , _ . ,, , t .. .. ...._...,s ;...........,.,,.:1Y:.::.Ya:.':!':.;CFi:fi . ~F
... ,....: r ; :::~ w"~;:::: w:.... .........,,..I
r :
4:
.t ~
.~.
....
...
.: _ ..-.........
'..:"
.
:
; '_ ;:
:
;
~
' ' . :.
, ..
. .~
, .:.
_ ,.
, . .
., m .
, ...;
..
... ._..
..
! ..... . . .!. ....... ........
..1.. ... ......v....-.. ..
t;,>f,~LKPNGY:.:HVnI:T:w.iitPKFV:KL;iALRt,~:.l~;Vr:M7L'i a":::~L': l:.iu~fivLi.'n Cpn_0539 518678 621545 FCCGOGVOCF~PT'JNCiCD
Pm0_I7-polymotphie malorans proeeln .~.fHLyCLR10~0(~OILWGFLFLSSFGQVSILRANDVLLPLSGINSGBDLELP1'S.RSSSPfKCPtc~0548 633231 a32191 'rJ'YSLRRDFIVCDFAGNSIHKPQAAFLi4LItGDLFFINSTPLAALTP104INLGAPG11GLFSeysJ-Sulfite ReducGass ' ' ' ' ' ;NVTFI~WtSLVLENNESWGGVLTTSGDLSFItBfI'SVLC~l4ISYGPGtCALLLOGRKSK:
E IKVGDALGVLPElIS
DSND
IS
KHYLpEKPKrWOVPLVLRELLSCSDSINDSDPIYRNVF
.
KEVSINVLOLLr"YSPTfLYNVKKTSEKVSAQKFIQGYVCL00fIPAKLtISFPPOKOPKI?L
ALFPRONRCTiLFLKNKAYF)pDESNPGIfCG7IVSSISPGSPITF'AONQEILFQENEGELGG' AIYNDOCJ1ITFPNNFQTTSPPSNKASFGCAVY.~aAYCNLYSOWf.~TLFTID~IAAAINCGAIHY
YDAIQEYRPQLPIELFACSVFPLLPAlYSI7ISSFDLMPKSIELLVKtRISYPGKYOKRFG
' ' ADYVIIIRDCIOGS'aVFEENSATAGCAIAVNJWCDINIWCPVREIFCJ&AI,GII~GiIIYYAATCfECKPLVIfICA
G
PCIAPYKAFLEERLFlBID
:SILRLHANOGDIEFCCMfVRSpFNSNINSTSNITFINAITIOGiIPREPSL4ANEDHAICFPGNNLLFPGERKEKVN
SRERDpKVYVOCL:.RI~DCVR
' 'fDPIISaTENYNSLYINHORLLF~1CGAVIFSCAALSPEHNXENKCtK'1'SIINOWRLCSGYLASLRKENRYWDVY
KAYEEGCPFF'JCGRINC~GZEVK11ALEEILGImI
.CAILAVRSPYQD~LLiItGPGSKLT?OCICQSD~ICIVZTM.CFNLO~IL.aSSDPAE
LSIFf . CRL0519 677662 633255 IRATEKASLEI:a~.VPRVYGHTESFYENNEYASKPY1TSZIIS11K1aNTAPSRPEKDIQNL
LLAESEYMCYGYQGSWEFSWgPNacKEKKTIIAS111'P7~CEPSLDPKR~SFIPTTLWSTFrsl0-510 Ribosaslal Procvin S.'.LNIASNI'ltdd4YLNNSEVIPLQNLCVP!OGWYpIN~IPKOSSNNLLViONAfit04VGARPOD90NOPWNDNS
LLAFLKKFK1CRLLRSKCCIIIOppKQK-RIAIJOCFDpGOLDRSTADIVE
iPFSTNlILSAALTOLPSSSSOQNVApKSNAQILIGIVSLNKSWQALSLiISSFSYTEDSQTAKRIGARWGPIPLPTIU
tEVYIYLRSPHVD1(KSREQFEIATFIKRLVDI:.DPTGIITIDiIL
'INKRVFPYKGTSAGSWHIiYGWSGSVGNSYAYPKCIRYLIQlI'PPVDLOYTKLVQNPPVEPGKllL7ILPAGVDIKI
IWI
'fDPRYPSSSFJSft4LSLPIGIALC~DtFICSRSSLFLQVSTSYIKDLRRVNPQSS11SLVI14N
Y'IWDIpGVPL:KAUiITLNSfIIIYIfIVCAIMGISSTORtr.SNLSANANAGLSLSFCPtL0550 ti35dB8 tusA-Blongacion Paetor C
tlJYG~'DRflf15N0EPDL9AIRNIGIFD1HIWIGKTrlTEHILPYAGRTHICIGEVH~GATFf CPn ' _ dIKINIIDTPQHVDFTI>:Y>DtSLRVLDGiIVAVIDAVS
20-polymorphie mtatnbsane Protein OWRl7IQCOEItG:TITSAATrVFwLL
' PmD
_ GVCPOSEIVWROADRYGVPRIAIVNKfE1R11CAD1fPMVESMCPxLCANAPPVNCPIOSCS
FIHLIYSSLIEFVNISDRFSSHXyK.PATAVFAAVLPALTAFCDPASVEI9TStnGSCDPT
SDAALTGF'Jtp55TETDLTTYTIVGDITFSTITNIWPVVTPD~AF1DSSSNSSKOGSSSSG7101''VONDLISQKAL
YFLDDTiGAIIWEEIfEISEDLKERCAFLRAMl.FZLiITIDEStIGM
~.
SLIRSSNL.NSDFDPTKDSVGDLYNLFPPSASNTLNPALLSSSSSOGSSSSSSSSSSGSJ1l9fVLEDPDSITEDEINp VIfRIOGVIEIiIIINPVLCGTAFKNKGVQpLLNVIVl0iLP8PLORG
SAWAADPIOOG71AFYSNTANfitL?F1TD5(.TIPGSLTLQNLKlIIGDGIU1IYSKGPLVITCLNIROINLKTDQEI
SLEPRROGPWII~1PIIIFIJ'DPYVGRITFIRIYSG'ILK100SAIIJIS?K
10ILTPfCNESQKSGGAAYTEGiILTTQAIVEAVT~1'SAGOGGAIYVKEATLPNAL.DSLDKKERISRLLEIWAFIER
TDRDEPTVGDIGACVGLKFSYfCDTICODt~pCIYLERI!lflOP
' KFEKNTSGOAGGCIYTFSTLTISNITKSZCPISNK)tSWAPAPEPTSPAPSSLINSiTIDI'IISCFOifJG.DIi.Rf RWI
Vlpl4iIILPK$KGDRBKLBDAI.SSLSLmPTPHNVSTHEE'1'CQ
STLQTRAASATPAYAPVAAV1'Pl'PISTQE'IAGNGGAIYAKOGISISTFImL.TFKSNSASREPKVEAMtGKPQVSY
KETI'lV9CNSCI'KYVKQSOCI~pYANVCLEIEPNEPGIt~IIWS
VDATLTVDSSTIGESGGAIFRADSIQIQpCPCZTLFSGNfANItSOGGIYIIVCQV1'LEDIAKIVOGYIPKEYIPAVI
IIGIEEGI.FtCCYiJYCYGLVDVKVSIVf'CSYHtVDSSF~111PKICGS
?JLKMlItiTCKGErw.AI'fTKKALTINNGAIL1TFSGZJTSTDNGG11IFASRiGITLSDLVEVANAVIGIiICRKA
KPVILEPIYDfVAVITPEDNLCiDVIGDI~IItRAGKII~0ES811014141MAEY
FSKNKIGtiYSAPITKA1~NTAPW5SS1TAA.SPAVPJWN1APVTNAApGGU.YSTECLTVPLS>QQ'GYTTSLASLTS
GMTSTI~PAFFAKVPOKIQEEIVIUI
S~vZTSILSFFIddECQIIQOGCAYYfKTPQCSDSNRLQFTSNIWIDEGGGLYCGi70lITLTNL
.
r'IfTLFOENSSEKI~GGLSLASGKSLTNTSLESPCLN7INl'AKFai00G11NVPE?tIVLTPI'Y636174 ti35d98 CPt1.0551 TPTPNEPAPVQQPVYGEALVT!GNIASKSCCGIYSIaIAAFSNLSSVTPOQNtSSH~EiGALLrs7-S7 Ribosooal Protein ?QKAADKTDCSF1YITNIINITNNI'A'1'~IAOGIfANFDRIDNLTVOSNDJ1R1000GYYLI1YNSRRHSAEIGtDZ
PGDPIYGSVILEXFINIM401GKKSVJIRIfIVYSAL~tPOKIC.ti.QY
E'DALZLDNITGSVSONIATESOGGIYAKDIOI4ALPGSFTITDMNETSLTPSII~.YGGVL>!T!!GEALHiAKPILE
VRSRRVGGiITYQVPIIEVJI3iRANCL.RIpWIIXNARBKPGIt~E
GIYSSGAVTLTNISCTFCITGNSVINTATSQDJ1DIQGGGIYATTSLStZtOCHI'PILd~B1VGLI1TELIDCTMtOG
ATZKKREDTtDtNAPaNKAPANYKW
SAATKKTSIZfOQIAGGAIFSAAVTIENNSQPI
IFLNNSAKSEATTJIIITAf~IDS00CAIA
ANbYI'LTNNPEITFKGHYAETGGAIGCZOLTNGSPPRRVSIADtxiSVIJQ4~1SALN11~CPIL0552 636698 636219 .
IYGCTZDISRTGJ1TPIGNSSKHOGSAICCST71LTLAPNSQLIFC~RM'L"1'1'ATfKASINrsl2-512 Ribososal Proetein NLCAAIYGMJETSDVtISLSAFHGSIFFKFM.G1'ATNKYCSIAGIiVKFTAIP~1SAGK11I5IQaOYVPSSSFJ~DC
PLPnGtALI.YZSNLWVItLKREEYFIPTINQLIRKRRK8SWH0IiPA
FYDJ1VNVSi'KCl'NAQELKLNEIUITSTG1'ILFStigLiiCd~lfIPOKVTPAlIfi4t.ILGK~LELQKCPQKRG
LS11VSFTQSPGTTITNGPGSVLSFRfSKFrIGCI11I1ZiVIIDFSEIVP'J'ImNA'NAPPTL%1.aGRVImLIaGV
RYHIVRCILDCAAV10iR1(OSRSRYGAKRPK
VSRTNRDSKDKIDITGTVTLLDPNGNLYQNSYIGEDPDI?LFNIDNSA~iIYI'ATNVTLQ
GNIGAKKGYLG'iWtd.DPNSSCSKIIL1(WfFOKYLRWPYIPROILtFYINSI9>GA~SLVTVCPeL-0553 KQGILGNI4ttaWtfEDPAFNNPWASAICSFLRKEVSRNSDSFTYHGAGYTMVDiUCPItQENo sobust t~moloq prwnt in GewbenklENBL
as of 11/7/98 FILGMFSpVPGHAESEYHLONYKNKGSGHSTQASLYAG14IFYFPAIRSRPILFQGVATYGCl6rRWLRFLIIFIt.CR
AYFPLRASffSPSWETSTCLT4'LGIPFIDIIL'1'lliIDFVAOCG
CYNptIDTTZYYPSIEEKNMAIiWDSIAWLFDLRFSVDLKEPOPHSTARLTFYTEAEYTRIALQIGTISSTNNAKIKEI
FLIYKEKPPE71SISTKRKEIWLSQSNLSDfGII~MU~!!YA
OEKFTf:.DYDPRSPSACSYGNLAIPTGFSVDGALAWREIILYNKV511J1YLPVILRF4iP1U1EGNIIFDCflVGPA
LKOPKDLRLVLRCPNQPDTLLYSpIFJIEtOCIETNTCLCNOGIfTIi.OCQL
TYEVLSTKEKGNVVNVLPTRNAARAEVSSQIYLGSYWfLYCTYTIDASl9TlLVpNIIFICtiIILYGDSIEKFLKETK
RIO~D4HTLVDLCDSOVVT1'FLGRFWSLI3iYVpYLFLSEDSAKILAG
RPVF
IPOL110J1TQLLSRTVPLLFIYTNDSIRIIEQGKESSFTYHpOLTEPILGILIGYZN~t EYCPNCAOSSLGET
Cpn_051 617137 628003 Solute binding protean t-yebL-SynechocyscisCPn_0554 637506 638111 Adhesin Hasologl NNRSSYQTAFVNHICVIVFIFLTLYSLKSYCNDVIDKPIiVLVSIAPYKFLVDDIAEE7rFVCT440 hypothetical protein YAIYTNHYDPtlTYIrS.PPQQIKELRQGOLSiFRIGFAFEKTCERNLTCOQVDf.SONVSLIQGVFSYLLLCIILVYY
RFlfIfEGKSRMASPTPGQLHLpQIfVESIQ1YDYSRSLANIATALLFfI
KPCCTiQIfITNYD'tffIWLSPKNLKVpVETlVTI'LSKKYPQHATLYQSNGEKrVALILSCLSLLPOVFLPFSCAYF
r . .r nQyf,IE
E:LTITSKAKQRHILVSNGAFCYFCRDYNPSQliTIEKSSHVEPSPKDVARVFRDZEOYKI
SVILLEYSCRRSSANL1DRFNMHTVNLDPYAFNVLVNL,KTIATTPSSLCPn_0555 638298 640241 cap-Tail-Specific Protease ~Pn 0542 628000 618737 NFVIRfICLVALCWLLSL.LPNVLPSSDLLREDCIKKhI~I(LIEYNVDAGEVSTDILSRSLS
ABC Transporter ATPaee SYIQSPDPHKSY LSNQEVAVFLQSPI~UtRLLIQJYKACNFAIYRNINQLINESILRAROW
FYTIRILAEGLAFRYC5ICGPNIIHDVSFSVYDGDFICIIGPNOGCKSTLTM.ILCLLTPTRNdVKNPKELVLFaSSYQ
ISKQPIpYISKSLDEYKORQRALt.LSYLSLYIt.IIOASSSRYEG
FCSLKTFPSHSAGKpTHSMIGWVPQNFSYDPCFPISVKDWLSGRLSOLSWHGKYKKKDFKEEOLAALCLRQIENHFNVY
CL;INDNGVAN~iDEFJIYOFHIRWKALANSLDANTAYPBK
EAVDHALDLIr'CL:aDHHHHCFAHLSGCQIQRVLWRALASYPEILILDEP1TNIDPDNQQRDEALAYIRIOLEKCNCG
IGVYLKEDIOGVVVREIIPGGPAiIN&CDIQiGDIIYRVDOImIE
ILSILKKINRTCTI(IIVTHDLHHTTNYFNKVFYMNKTLTSLADTSTLTDQFCCNPYKNDEHLSFRGVLDC:.RGGtIC
STYVLDINRGESDNTIALRREKILLE~IRVDVSYEPYCOGVIGK
F,~.CSPH
VTLNSFYEC~~VSSEVDLRRAIQCLKE1WLLCLVLDIRfN!'OGFLSCrAIKVSGGlInI~IC
VVWSRYADCTNKCYRTVSPKNFYDGPIJVILVSKSSASAAEIVAQTLCDYC1f11LVK~p t:Pn_0511 6JR7f0 e~?603 TYCKC1'IOHCTLT'CD.1SOOOCFKV7IAiICYYSPSfiKSTQC.QGVKSDILIPSLYAEDR(A3R
INeca1 Transport Proceanl FLENPLPADCCDNVLHDPLTDLOtOTRPWFQKYYLPNIQKQETLYIRQILpQL'l'IQIB~Itt.
K~IF?IIu.SLLRDSFPLLILLPTFLAAIw:A'.iVACCVFIC'I'YIWKRIVSISCSISHAILCCSENSNFQAFL.;v IKSSfKTDLSYCSNDIQLEf iINILKDMILL.QQCRK
LJLT W IQYKLHL.iFFPMYCAIVGAI FL11.CICKIHLKYpEREDSL
IANIWS'J~IAIG I I
FISRLPTFNCELINFLFGNIt.WVI'PSDLYSLCIFDLLVLGIWLCHTRPLALCFDERYTA~Pn~0555 e40921 LNHC.~VQLWYELLLVLTAITIVNLLYVIY..'TLLNLSMLVLPVAIJ1CRFSYKtn'RIItFISVLcrpA-ISkD.t wyafkane-Ractf prOCein ll1t11:.~.F':rC:ICIAYCLDFPVf:PTISLLHCLGYTASLCVKKRYNPSTPSPVSPEIHTNVENGNSSNLHFI:C
fCf(1AMPESVLNIVEEIM:CSVTACLQ/1IT.~.sl'CI1VNLLLCWAKT
N F iOP I RE.~sIfLFQ.~.RM'Q ITLLVIrILLWACLACMF
I FHSQLCANAYIiLI I PMIGLIK
~'tm o.44 eln5.tH .;_9525 LLVTSLCFDE,i:T.~.EKLNVFQKWAGSfLED0LD0.TLNN~NKIFr;rNKTEC2dI'.~.RA'1'1'pVL
ylurL-t:TH tricuJinu pruc.:tn ND(:RGTpVL'I'LV::KIMV
K.:;:VFY': x; t K >::FFt:LNKDKNV
t NFVI~' LTLELRAGKrX:Ia,'WAWRKfKYLPI!xPIfOGN
.:.:WY:::'lltf\TT::VP.:FG\'iRNIFF1.I;APfl:67::~vATtMRTf:R::CKDLIV3VP'l~l'LLRDr' .Pt~ IIS~~ ..d.'.H'!9 n.Atln4 :\F:ha:IlJIDI'IVIya:IlLLV::aa!:t:KrX:Kt:M'FFKT.'IRIMPTKATF'r:KPI:EIRGVELELKLryI
ICH~.!.ifei .y.:l.tn.:ktrh nMt I:\nua:r:FlNU:K::ft.l'N'CI
IIYfEVh't'..AYPFT'TLAP3LGLVLc:KURLYOKIMIIADIPEIt'M::Y.LIRItIITJI~t:P::YW
:r:FA:aX:IFJ1AVAF:LITKIVA::At:TKI'AfNItfPAKKVR
::th7:AIY-MKI:U:Id)F'LItIIII:Is'TLLI.i.FVIW::KREP1YSPEEDLLTLIItELIL:HOPDFEKLVHRNKrJfNF.~K
.aa:\FCnKEFYIt:EFf:Rt:VINF.AfjQE.:I:Yr:RLYSYNVNIA1'NVRV:Q:;
!.:I*LVAIJIF;IDIH.1.I'1'F.uI:L~'Lvcil'QNRF'p.':'f'PFVLI::ri:fY;t):VIa:LYRFFTr)R
WV'/Pb'YAY'1N:::C'ff Ilal 1U:KKV.'/fAIVI'lYp)LPr:EAEI1I::.~.UhETTI'1'::I1:KLVWF.IGtI.
r:N:DK<:Y ITLWI.'KILK FI a t'F"PMTVr:M'.
PFLR::Y'f Kl'.f:pPA IC IKytl:ffY.'N.'LHt:fM' n.4'. .. ulInN n au.:f a 'IKIEYVtfh':::iIANM.'11.7Wf'Vfil:'I::IIA:X:QHVI::FNLC.DNRfr:DKKVFTVF.FY:PQIrIN:
~
n iJl'fIJVA9"Il'1'.1;I:IIM.':'..WL-PI-I/IIF.f?HJVtII:Y.IIOW::YVCKF'V6'Y::f::V::NI'tlUf:/Lll f.'/ L.:f n nl.c:uwwt Im.w.:ttr 'I'I?r\HI:FHVYtnIIYYrxa::V:l<FY:RD::Y;:KKL::%KVt:N:rJYV::T:::If.VIa~N'fFYItIPAp NIK:OWIIH/TLl'::1.-IYLYr\IW
:I:It?:IIY'IIWHIYFY4t:W:FTLpFNLWKAi,N17ltF'1'YY~VAV
!n a<IAII'LI~ ALVI t :I \'4TINN'1'tllt'1'Y'P::E:~N't: t'r."T:t: AI.TPfIRnHb I::V\'ft7UL .I.AA'fllW'1L171'tlDt' f CV~!7!?7fV'!R
111.?NMH:AFl'lltM:
1.11 *Y::YEL<,rF I A:::::f"rN. PP
I: r xrrrn~ f~Al.lY.I l::fKE::VEF
:VTI Jtt: I Als xtAM:FA 1 vLNSNIfrKStrn~.AY~;Df.Fr.P'. .suz'rFL.LL:.r~::LFF4-::F
A:.:F:-A.:..wc-Y
~.:::a-r:r: ;w::nTENnr~f EYSBlIJIKAKNHYPLa"CF.SAf :3FLFLrIGiFG.'.IRW*.'.Ltt:FFDA:.Pi.::.:.~
JWIIIIWStF
RVIUCS'!'ICALOL~1.SIT.'YVLIRIIIII~ffVLI.~EPYC~~ill~~.
'~"I~.A
Nn 11.'.N w4 f t4w h13031 DIICYFf'GKAT:NKKIAf0I5PNKTVOCFYAGCGCATLISFTFFtfOSPTRFAS1'lfIIMI
uIM:A-'lk0.m'Ync:me-Rf:h LiPOProtetnLIPLCLALG:~iFFGOIIE:a,:FKRDANLKNSNKLKAVOCNLDTLDiLI.:.STPIAYLFL:.:
~
RIVDCCFEOPCAPSSCNPCEVIRKKERSC~TIACCSY
~
' FCCV'!
:
' :
C
. TOSKEFIG
.
.
.
.
.
.
.
KitIKKAVLIA.W
' ~N:~ SPpVKr.,CTSPOf:RCKO
'IPSC:,Np:f::1 r~ nr,.;.t : ~ !7qn f.41't?7 F'Pn ~S6q w5'1905 riSRlA1 :.'.\ ;'1. .. : . 1..' .. , s\.
. i... ; ,1 ' r :
~ -' I
, .
. ,,."...,.. .,..; Sr~,Iw.-4~4'KF:.
. . . I'.591M:..- , ~.,~:\:.v.;s~.Vr-r:
Si:l ,. ... .
....,. . . ,i':-r: :'.?IF::\11.::.::
rh :~:
\
:-:' ~
a:
''.
::
. EEPPFSfTFA'1'r','JPLE.iFF:7GHL:.TS:..' ,. :':GTEVANA/LStL~tiLPEVRAFNDDIQRRYAOL
..........,...........:.
.
.
.
.e:u:e:r~
.
:r,:;
:
FFLARSVFlIrCYNTNL CML111CGRDlGSKVFPNADLlC:
TLT3SPE1IMORRLKDLPtExr rCGSPOpLQAELVKRDRAD
CPn_0560 b15666 611098 AQRAHDPLVIPLCtCIVIDSSOLTIROVLEKILIt.LFRNEL
4ltX-Glutatnyl-(RNA Synthetase RNSRtQfX9CSI~ISKDKRiNMrFJJVRVRVAPSPIGDPtIW'1'AYMALFNLIFAKR1KG10fILCPtI..'0569 6519)98 659099 RItDTDRTRSRODYEaIIFSALRWCGIWDEGPDVCGPYGPY1~SERT7CIY0GYVf1'LLKPIsC-Glycerol-3-P
A..-Ylcransferase 'fDCAYICCFA?P0EIJ1El0tAVASTLC7fROGYdIRYRYLSPECVASREAk~QPYTIRLltVPLLFGI~IKTSSG1H
FSFfISKRANIFRICKFl7wVAFSLFYKLKVYCHDa1tI10GPAIIAV
SCE<1ZEOYSKGRWFPWADYDDOVLVKSDGtPTYNtANIIIDDIILJICITNVGPGEEWISSNNNSFLDPIALlOICVN
ECIHLARASLFfIIPWWICQ41CCFPVRQDI~tSAAFKIIISRi.FN
TPKNLLLYEAF~TIIEPPVFLIDIPLLii~POCTKLSKRIWPTSIFYYRDSGYVKEAtYNtLTLKRIOC.VIYPDGAPS
POCQLOPGi(VGICI~tAAKSRWIIPVYIRCTPEAFMINQKIPNVWK
HCY~EEVYSLERIIETNPRRIGKSCAVFDIOKLOYI~iKtiYII~NEGSP~Li.ICa.OTITCZtFGTPMf!'DDIION
PEIKNKkTYQIITNpI?IIKIAELKAWYtSDCI~rDVP
GWLWDEFFLKILPLCOSRIZ?GIILFINLTSFF!&GLLEYRVCELLPQAISPOGAILLY
. 659011 6607e9 SYVKYLEKTDOWI'KEf~S'LGSRWLAOAFNVNNKKAIIPLLYVAITGKXpCLPLFD6ItILCPn_05 :~KPRARAALVYALKLt~CGVPKKIJ1J11YDKFNDR~'CGT1DLsrQS-ArQinyl CRNA TransEerase ' TKLPSSKfIGt4JRGAlftTSPKLMSTLLSIGSVICSQAIAKAFPNLF~WAPEri'PSTKnIIG
NYQQIDAIOtWtVLIUtAPRAIAtAIY7IE:.POEPFSLIEIAGAGIrifFTFSPVFtI~.EH
CPn _ PKW1LKIGtpV80PKKIIIDFSSPNIAItDIONGNLRSTIICDSLAItITSYVGNWLRiJttt euo-CHLPS tuo Protein CH7ICtQttDCGYELEtREEIEDIKDSDtKWV5IT0AAXLWVfRQAIYVAIKOKKLKASKEIGONOTATC14.ITYLOE
NPCDYSDLEDLTSLYKKAYVCFINDEEtKKRS00MIVAI4AK0 TRWEIDIKDGEYKIOiRYSRXKSLYOGELVFONrKDCYSINQVAOIIGIPVOKVYYATRTPpIIIAIW1JCIClTSEKA
FOKIYDILDIWOQIGP$FYNPFLPEIIEDLOfIIfiLLTVS~A
GTIRGERKG7IAwVINVSEItR7fKNEYLSKOAAKKLKGAEPKFJ~APNfEPPTEIFPLSNKCVPNEAPSIPFNVOKSO
GGYNYATTDLAAIQtYRIEEDNAIXCIIIVC~GOSLiitnLGG
TAIAPGYLOPGItSNVGFGLVLDPOGKKLKTRSGEMI1G.RFi.LCfAIEKAretr ssItRPE
LTDEAIQERAPVIGINAItfYSOLSSNRTSDYVFSFCIflS.RFEGNtAMFLLYAYVRIOGIK
CPn _ RRIITISOLSL>a:PPEIOtPAEELLRLTLLRTPL1L6STIKELCPHtLTDYLYNLTNKFND
CHLPS 13 kDa Protein homoloq_1 NYKVINSI11IJ1RLD7fAAILDtOtPKPSIANFSSEOARTSNE1GWANPYLYRLLEIIWGYVKFIRDSNIOOSPYJII
tSRLFLCAI~1E0VLATGNHLLCLKTLOtL
FLLGLIFFIPLGLFWVL.QKICONFILIG~.
TIFRPICRDSNii.RtNIYAARLFSAStOWt VSSVRRVCLOYDEYPIDC'h.ELRLPNAKPDRWNLI~BDCLEYRTVI4GA~fItRIAECPtL0571 661179 ESQSIJILIFNYPGVMttSpGNITRIaNVKSYQACVRYLRDEPACP0AR0IVJ1YGY5L~ASffslu:A-ODP-N-Aeet:ylplueos~nine Transterase QAFaISKEIA0G5D5VRWFVV1CDRGARSICAVAKOFIGSIGVWL11NLTNMNINSEKASImTIDtVNVSFSDFOiIKC
ERRNQI11QVFCCGRLNCEVKV,9CAIDIMTKIi.YJ~LLROpKLTL
LNCPELFI7CGtmSOGNLIGOCLFKICEZ'CFAAPFLDPKNLEECSG1DCIPVAQ1CL.RNDttILRNVPDICDVSLTV
Et.CKSLGtIiIVSwOKETEVLEIY1'PEIQLTRVPPTPSNVtIRIPILLIG
ALiGiICPIa'.rVYVPNODW1IGFRTIi1tt11tGLKOIGIfDISSDSSGYYAKAPRGLItQNIfIN
LPYP81ICi1TEtd.ILIIAINAIiGRTVIKNVALFJ1EII~LVLFIrDXAGAOITTDNDIIlIDIfC
CPn 1'OGtCwSVOfITILIDKIEiIASPGIIAAWla~GGRVPYRNAKpELLIPFLRQ.RSIC001LVSE
_ 9DIEFtQERPLVGLWLLTOVNPGFL'IOWOQPPAVLLSOApGSSVINElVN~ILOYLIIG
reW-ssONA Dwnuelease OYKNLWDFSPKCPCGIKFNTNSCNASAAGLLW71NPKEDPAFILalIIIKFHLPPIYAOIFiAIOGiIECOLFHOCLS1 'KACRYAIGNFPNSAVIHGATPLWASNLVIPOLRIIGtAYVIIML
ISACFOTIOEIHKFLYSHLSSLYDPGLFLCIISKIIYFRLLLARDRIttNVItIYCDSOV0~f1'IAmODSIIENTHLL
DRGYTIKIVDKLRSLGrIICIQIP'DlIEpEELITSPKSLALRMiL
GVALLVEFLRDIDVHVSYFFLGAILRQHCITSTLIAIG.KLEJCITLLI'fVDOCITAWCNS
OITPQCIDVZITDIOMPTG1CIPHCV11TLNPKLRDtfIYPNRILTWCItlU9Q.ARGVti~tItbICPt7-0571 SRNLVPKSOCSLKICIi.DLVTLCfITDSRrIfLiGEtittVNVAYGIKEIARGiIRPGLDMt.CALCCTIS6 hypothetical 0rotein CVCKSEVTSTDTVLKIAPKLNSLCALOOPA10GVELL.LTOODCRVD11LL~TfINRERORIMAAPINO~ITO'f~Cl~
'~SLGEHSVT!'fGSCAAAprI'~11'V'iL.IAdIDpE
IGEVFODVOtII~tSNPEILtOMIVLSSTAWNARVIPIISARLattTYtiKWVIIAIQRGIAS~GSAVSPSACNSfSTL
PPETGSLGATJYpSApSAGLISLSGRTORaObEIfSfi0D8 IGKGSARTICSFPLLrGYLKKCSSLLLSYGGNDtAM.llilatiDIM>mtICKKFVHLVNfSLKSISRT8SNASSf'.E
l'SRA68SPDtrcDLDSLSGSERAEWEGPtDP'GGLPLSIIPNYDII'01f )CGDTLPtQ.EIDAYADFDAIOYDr ~n tEPtGIOf~EItPIFYSINROVRYPICVLP'~iNLASIIJ1PLIOIPAVOOR~O1'K~iHIVYVDE'J1R&SFIIIIRN
GOWSTAFSIXYBNitkTK14QT1C
KLYLSQKERNLEGYAFGLGRNADALKA.SWNYPLEIAYTPRLSOTSCSCVItiLLVRDIRISPADLDICIAKFCVCYET
INSOifI'GRVKPTI~ERSG711~IYtptIJG.SNI~lt1'AWYORIIA
SEPRPSD
KESS>iGYTPSAWR110A1fYCICPIWImVCGLXGIIXiKITPAPDFSFINLTP00GRNliblfl' CPQlWGATWPNVNIIIrtGGIKVDI~iIHt.CGITTMrTI'F.~DDD'fNITSI7tST81001NS
CPO, ISS1GEOSTIEED'!'IOtDDPGOGFDDNAIPCTNCPPPPf'P' 0561 651759 650115 PPNLSSSRLZTI~N~1I
_ t.~iVlYOtdXtAYDSNG~SISDLNQOLCQVtrIGtStNDVNPPIVILPttiTGD1'DPb00AtGG
seeDiseeP-Protein Export Proteins SeeD/SecF (fusion) SGAMWKVKRNFAIIICVPAIALYYVLPTCLYYAKPLDRKIDGNBAEHIIKSFTIWAppVVTEOOGHIIINIIORNTOSI
GOSLGATPTPOPTIJUCIVTSLPI(ANVSSSSVLPQPQVATII
R!(OVIPRVSAILSSIJILRGNI00HPAIPDIVSVR1KRGEDAEDFICNLVIIDEPNVPIKSATPOARTAST51TSIG' ICI'EStS'ITSTGTC'LICSVSTOSI~ICfPT'1'1'fRSCCrSATIITSS
RLNVYCYSREHDDNVIOVASSINISLVESDFSFVSYSSIQ~t~fll7ISSILORVYSACTiPKAS'IOTPQAPLPSCfR
HVATISLVRNAAGRSIVIQpGGRSQSPPIPPSCCC10t~11Gi1QtJlA
OKOCSCSYPSIWETAPKLOL:QYAIDiLSSGFEVFSSRLSAlCOOSFSSNQORtJIFLSRLSMbOVASIL.GQVVNQ~l SLSNDA71IDVEDOKLLKSVYLTLSpTIICIRSLOCPYIEGLRLDCSE$SL11SSIIYCPKE
RKIFLTLHSDLLAORTSISKEORLDFD.SRLAVEKptLSKNLTWVEDYlrc1C181pW1~CPn'DS77 665117 TQCKIILOGERLLOCIAENLTALTLHRP71AESCDLIPEN1PVFCAQPRESiAFr3CYIFSPYe6C fauilY
NTOCKHFSKGSVYILGKGLRSIVAKYOpCCGKB.OSFGONLYNCFSHTFJII~EVEdIACfISKWAttI'KHRKERADH
KKGKIFSRIIKELISAV1G.OCADPKSNARLIWVIpKAK
OpvLEIRHPLpQFLDVwGECFVICXLOCAFLEVKDIODRLIiTVNOItKNROSDLVRNNLQCiNIPNG~tIER(iZlvt A?SALOKNFE6VPYELYGftaGVCIIVFJIIffONIQiIlTASOIGIIAIN
YRHAKCSMDLQERLSAPIPYONLFLLNNKI.fA~fRKISI~HiILRLGIDFVOGROLLLSFKDIOIOGSLVEPCSVLYN
PARKGACTV11KSSIDEEYIFSYAIEAGAtDLCI'EDEEIJFLVICAP
HOCKOLTDKEDILKVSDES.CARLNKLGVSEILPRDGDYIHLSVPGSSTISSSEILGTSKSLL6SVICLKLISpGATCS
EDRLIYLPLRLVDCDEKDGtAMALIDWLEOIEDVDOVYtC~IN
tiSItIVVNERPSSYS716RYEVDAFLDYiJVlt1'SDApGKTBPttIN1'111SALFNEEVDVPPSVS
HEAITKLKSEGt.u'SPSGCETPSTDLD1TFSNIAIGKOALOKANPLVIVFRNYALDGASL
KDLRPEFAAGOGYS2.NFSV%DTSPKKttAEKLSPttStIfllvfSAYCOi7GISCfANGOYS1WCPn_0571.
aGWRNAWIDCYNVSSPILNVPLKNttASVSGKFTNREVSKt.ASDLKSCANSFVPEVLSEENo robust holnoloQ Dresenc in Genebank/ElOIL
as of 11/7198 TISSDLGKKpCI'OCIISACCCLAMLIVI19SVYYR1CGVIASCAVLWLLLIWAAirpYLDASAGGIRNPIVNVCIYLN
NFORYLSKYLYRVFRPPCRKKTFLSSHRVLARPSFPVDYCPG
PLTLSGL1GIVLANGNAWANVLVFERIAEEFLISOSLKKSVEKGY'fKJIFGAIFDSNL'1'1'KIYDLQETYEELiI1 14LFOGALRLOICWFCRKJ1TRKGKSVVLGLFHENflDLIRINRSI~RQ
'JLASALLPPLDTGPIKCFALTLILGIFSSNPTALETfIICFFFMLWl9rK'IOHTOLNNNMKFVEIPRPfNEYLVYHf HVNSVVPREYSLSCRSIFI~KItFKEYEORFPLYWU1VAWEfDINAYL
:IKHDFLRGCKKWAVSCSVFLL.CC:fALGFCA4MSYtGNDt'!(GGYAPfINPKEHCISDVALRCYXIfRVOCCYCRA
OMRCKVVHKLQEAGLSSRDFRIOTFCSSEKIKIYFSDKALSYTKADTSLSPKINDIInr:..r AVGLLSE1'GLDFSI'ETLNE1'ONFWSKVSSKL&KIWFYOATIGLLGALAIILLYVSLRFEWCPn_0575 oooSZ4 56598?
'fAFSAVCALIHDLLATCAVLFIAHPFLKKIOIDLpAIGALJfIYIOYSLNNfLIIPDRIRYhhY-Nnino Jroup Acetyl Transferase FDROANLFTPMFfVLVNOALOKTFSATVM"fATTLSVLIlILLFIGGSSVFNFAFItftIGILSIFGRVWRSFHTatIC
ONTGILGLEIRYTLPSDATYMLKWfIJDPKILACFPIOTEALIRCT
U.TISSLYIAPPLLLFMIRKENRSK .
VNPYNCPYRYHSSLTAV1'WuNVA4IfATLVWFYVKVSNNALISIIVGEEFRNKGIGTJ1LI.
NNLIHLAK1'RFKLEVLYLEVYLGNPALHLYORFGFVEVGRONRFYKDEICYt.AK'1'ITtEKD
CPn_0565 655741 ti51531 L
r."r94.r hypOthetiCal protein NKLFCFLIFC;FVNISAILFDSSFLLKIKRHSKRM.RSttKFPRISISDLIPfQMVIWw~GCPn_0576 X67513 5b6491 ~NVNYVtrNAOMLPKKILGGVLACFCLALLCCMFAAGVCOTIFPCiCt~IILGLVLLGFAY.prtB-PCPCida t'.hain Release Factor 2 Inacural UGA tranle-shift 1 t.QYSKn.
iliRPERPLFRETKVFEKPINWIL:CLSLLQSWKKIRPGCYYMPGCPOVEICDGSOMpCB.DKRLE.1LRTEISLWRSL
EIVfKtFOKK~DRtPfSIFLt0EMD0IALROCIEKSF.LSRKTFALDPSWSSLLStIOREE
rJ~YLGPKVI~k:SEDOA:iDRTHPK,iAIYVNISDJ1A(tEPQCRCYIDAYTKAFF'CVLDOIGDCPn 057ri.I
ne7S!:1 IMIVKKIrrtYVLTPILGVPDALPKELOENLKLGSOAAFLYSAEOVAKRNREEKODSIRIKPrtBIn.tcur.ll UCA f:.ameshitt:
F I FTDi'T:: fT::L'f F:ii'IOt: :".'rI'H.M'PI:iLSCFVGEOE,SYTFAMUEHt.DKRLF 14RTE
I:iLlIR:::-.'ftl 4S.h W hntr r:Sf.9911 CM IIS'I7 ~ ~..:Hrn, 1,l.Nl'.S
~.1.:: t.,mi 1V :.WIO 1'fM7.11 .axnplrx Pcrrn.in I:IrIYCfAl.Itlld'VDI::11'rNNAE:Y.FP::LORLPNHVAIINDCNRRWYKKIIREECGHTHT:E:a'N.'iV
KtIKN::AFiOIrVMi:."lGIl.VLV':Y.t:I~MrRTEIVYKVWF:1'IKKIItK:OrQKNKRNIL
alYYr:AY.Vt.fYItINAVtA)f.GIKVLTLYTP.:TENfT:f.PKEEIOEIFNIFYTOLDKOLPYLMfGANIaKVFY
L:'Df'IDMF~YdfYLL:;RIIIYK
r7lK U: LI!r' I::Irt.::Kl.rKl :IQTK
IMIV::RMTA:iF:iRLELVLAVNYrX:KDELVfiAFKKLIiVO~:r ' mh~
~157A '(agy,..H
Ct II.tIKY.I::::nlri::F .
_~t.l::::YI.Or:r:LTOPDLLIfeTr7:EfIRV::NFLf.WOtAYTELIITDTLW. .
I'IN-f1,rld.fl:ItHVYIrhIt::RIIIi:K , n y.mr1 ptln:a.htnylcttl.t;:.
TTtk:YIVLt:;I:aATLPIL\F'.:4tA:a~
II:fTMI.frPfAIfWRI.PKK11A1tUk:LHIAttI::WJI
"Ityr.r./ ..'.nN14 e.S7Nl~l VIIKRVPPXFWKV::K:II:NP::fIU.fVFr~:fN.lf:IrARt.EUKERLA'rFtlffL.filrItYFAII.
..,p:A 1In.::l.ll.ll i.l.ll . 1'yt r.YOIIYY:I::YI::RN'fK(:F:rft'IIf:F:Y::I!I'tulAI
t.lylyl r.ln:a,.r.,;,. IA
VhY~t:LF:::Sr::1'ItYOttR.T1<al:(YIMII.
LKLLKNCT'LTLi.NNfiHVIPNTWIVGIIGDLP.1R1.._dEOAFKNYDPSLPr'.LLrLSHNPDf'':.FLYriDiR
I~FACP'ILFFFi. .3:SFi.tli;L:-~Vltld:..;E:u?.'Ff".'IIRf".:F:KSYP~i ' :TRitwYf'r:DFyL"~'HSHGPOVTLWIPK/MKFFERLSGLC~IPYLARCIFJTXII7GKDLYV.rCIOIC!!
NHEIDAORKKRYEFIIL1GEFPKLTW'IYtt::iFrfILRAK.~.Rt:VVISLYAWFiCS
' NRC<LX;LKR IRFCSPPEICYLl'C3Y0 C,P~. I
DFRaMNKGSTLT'"rGKLRI~C~II .[dtr ' ' "
YfiNDLVCFSEVIL'SFHV9~E(7GTLTFS
CPn_057'r ri69110 569993 CPc: 0591 5901ed 6:tt030 yqDP)yteNiudar NuclaDtide Phosphocylase KEPJI:1PLLK'w\T:rNVPHIKS3LiLL.iOCOCTRFr'~SKlPKDYLPtIJCI'FL'fLHSLK-aLSSYatiE
family ' .'.POfAEV~FI~.DP.~.YOETPpEYPVSPAIPGERR0tI5VFSCLCOV.~YPN'/SIHDGARPFIY~:.IHRCTAIC
TVA'rtIIJNLfIILLKPRYFTtL:iREri UHi.D,?."DASNDLAIFPPFGYAV
..
.
ia'1 :r :'.~:If:'!'." y ..,(yeT:.:'nLYlir\lil:..
_ ._:!r~n::.:. A
I 4 :.':d.~...F,rrt,r,.:~ .. . -..~...~ya:
. :
..
i -:.....v ...'.:.i:::~:F'.1 .;,v: . w hTiKilC::: :.:'i i C.'.:.:'::~.....-. y. .~.
.:.'i~:i:.:. .. !. ... , .,. . .. .,. ,.....\...v;, ..
' cAR 1NV L L':
HSDILDEPDFFiiWIPCfQAIYRL'W.iL..:
I
CPI>_0580 669936 e70793 059Z 66113= dd1161 CPn ccuA-Pseudouridylate Synchase ! ."
ASSiI~IPLPRRSNDCFSPPKtKVALLIAYDCtAY~W000PN~SIQ1YIE8SLI0fITKTYip taailY
' RTPLIASvRTDALIfNA:lGOV7111FRAPOMPL!'JWANL.TKKALtIAILPKDIVlRDVALFDDNLYSKNFSIISFK
RFLpOIPVItICi.:..
IYLYpWLISPLIrCSCCRFFPSCSIiYAE0ALK8NGF
' FNARYWIAXEYRYSLSRLIIKPLPNORiIp'L'YTP1WP1'STLt1~100~6LIGTNDFABFANUIOfIrLSIKRIGKC
,r.PNHPGCIDHVPK
"ALQLYLEFYQEID~DSSHFSE
iICRDYNSTVRTIYTLDIVD~.SI ICRGNGFLYKNVKNLVC7It.LDNG10CRYPP~G.LD
ILDOIOJRRtxpSAApAYGLSLHHVCYggPYlIIFGCEpCgVSTSNECCPn_0593 fi8119i 581391 CT171 hypothetical protein CPn_0581 671533 670715 VLC'dtKCNAFKRKTRNL.t~QVLIL5VCL4l4.FLLLFYSAlFRImIYKLHLFSCPLIAKBSItIt PMsphoqlycolace Phosphacase VYGSISOASLODLISLPKDEItYMYGRPIKL41ALSSfAlASNHIDITPVL~fPLTY
EDLRNRSVKSFLROL10IYSI~GNSDEFDLCLRSCHYLEDYDVFFFDLDGLLVDTEPCFYRJ1tELKCSSVPWLLIrtI
IDLKDFFVILDYLRCNIfYPYTSIpGLFLLIKHYpE~IiVDEpCLYNF
FLpACAEFSLEV1MDFSTYYSHTnG'lEIFSKKFIFpYPWIQEYNAEIFAIUILpIYYKSLCSTPEFGYLRTLLVC7l0 90A5SVASLARNVIRCCSERFFNFCNEESRTSNISA1'O~KYL
r:NAGPAL'IPCVEAFIELVLSWKTFCVVTNSPRDATfITLATllYPIIiIKFLFNVTRWYARKSYI~CEESLAALLLi .VNDSGYVLttEFCDEDLEKVlRLNPQSpYSONP'1SRL~ISPRIIE
PKPYGDSYDYAYRTFAREGMIVIGFLDSVI(GLRALSKIPATLVCIN9171EI?FLDYPELKLAOISCQRVGPRVpEDO
DEEWVODGDSLWLIr110iFGIPHDKIIGKNGWiNRLFPQIV
GKE!'!'SYPSiDVLTEfi<517QKLL LKLPAKQS
CPn, CPe1,.0591 682517 681958 _ phtYl'-phalylalalfyl tRNA Synthetase CT165 hypothetical protein Beta KNPNALLKKlONRLV100iDKMfVLYLOAMiIiJQKRIIR10iPINI'YHSSNi'1'ETRRLPTYYKNTCNY1'CVIVK
SLVKTSLRLSSNRIPITI.LOTYPSEPLSTICEILP)1CDNIGIGEIITfit SNIVLIO.IILRIS'IVSLLTSCSFSKNSATCPIrfPERITSOKDCPVLLNPK'.FITISPPLYDWLYSFASVITAKIL
NTIFlIPN7IDKLAVATLTDaGIF)~BHIKCApNCEAGLIVAtuF'GI1KL
ISPNREVITAYSFYCRGpGNSIITPECVLYDCDGLIN8ITKLEFRYINPRLIB:VVRLLGQPDBP~AYTI
I~RALLELPGTPILiEDLiITVLC
OHPKVSIIGFCCPKHFHFLE71SGISLSDtJit.OCtA71TF71LDFPLPNE%I.LiiTIKKLYIDINtSLEISLTPNL
01~71SPLGLiIRBICtM'QANLVIPKtFSFENLFTfAt.p~DPDICFF
NSDPSLSNEIVTGTLTNPELRLTGOGSHTEITVCILDhZGOtBIEALSSAFSYWITCIS110PSPIKLOt$LOALKOKP
INJIIVDlTNYIH~LSLGQPLH71YDASNVAt.DS
LItVOt~'pESLTLIJKiElVLLPSGVWVRDONS
~PI>_0583 67239 672717 AYFLPEALRA9t7KLLPIPSESAYRFTRCIDP~WPALpJIdiIfYlLEIFPGTISPIY88 CTt66 hypothetical protein CEICRBLIfEVAtJtPKTLORILGRSF'SIEILSOKLOSiGFSTTFpCtSLLVKVPBYRItaIN
IVLSFFIGK'1'KV'1'pRFiIOJERTLLLLWKIQOGLFLAILDLTQTFSSLT3PELEKYLKOKKEEIDGVEEICRT85 1:MIE'IDNWSCYTPIYKLKR1TADPLAN71GL.QEFFTPDLLDP61YA
IFLSCIDRVDLOIREPWNAFSSELPpDIGFELEEIADIfIIRILOTDK11NYA0KKXEFGIYLTRI~KEtISLOG810t rtVLRSSLLPCLL.ItS7NITNt.NRQAPSVpAFEItS1WA10~8Q'!0 ERP
!1'OTG71ILLTEDCEBRSNLPKPSLSFYSLKDiiVAILLYNNNLSIDALTL6SS11ICEfllllf QOCVLRIIDCOSFATLDOVNPEL7UOtAQIKHPVPFAELNLDIi.CKM,KK1TK<.YKB'YAIYP
CPn_0584 677659 673798 SSTR~.TLTVPEDIPANLLI~IfLLHECSKSiLCSITIISIYQDKSLETRNIOiVSIJILV/p0 acoS/DCrB-Z-Caaponenc Sensor YERTL&NDDIEEEYCRLVALWLLLTDf!(<.TINS
IRINJITlOIRKKRNLVFTPIV?OSKNLtIPPAYFZLEIKARI1'OSYKDISAILTAIPDGILLL
SITONFLIGNSOARLILGIDFiJi.EIGMtSPI'Wi.pD'ICiJGFSI0GL6SLINPRTLIIiSLCPIL.0595 CKESKFKEYELFIRIfNL;SGYLFIpIRDRBDYlI0t.~4RTERYIDtIJId3GKtt1'A2LlltltTRCT176 hypothecleal psottin NPLSGIVGFASILIOtEISSPRHQPIIZ.SSIISCfRSLt~LVSSlILEYTRSOPIiiLKIIt~>z.QROYpIIBCOLL
FCVCYFANSCSAYASPRRODPSVIOQTFRNNYGIIV9001~KIKTBDG'tI
DFFSSLIPLLSVSFPNCKlYRE(i7lpPLfRSIDPDR!1.RVVWhR.VIOAAVE'ICNSFI1'LTLNTKVLKNGJ1TWE
YY9GGLLIIGtITLTFPIrI'L'ALDWOIYDpGfILVSRRTFFHGLPB~E
TSGDISViNPCTIPSEIlIaRLPTPFFTi'KREONDi4taF)IpKIIRV10GDI0LKT8DSAYLPNF~CIFVLTRNPOl ~BIDSD'i'I11CPYFIE'lTIIQCNVIEGSYTSPNCK7fSS8IN1~8DYR8 SFFIIIPELLAALPKF31AAS VF$SltlI PESZTHYpItCpPHGLItLTYLpOCIPNfIEE
1RIYGi~ODf.TTIVl10a3CKTSEIAYV10fr1IKEGLELRYNGOEIVAECVSNItl87FU8iE111fIY
CPi1_0i85 67518D 673865 AGDIOKNDiYYRORSVBGI~FJtiXi7UlG
siailaricy co Cps Iucl~,Z
ISLRRKILRPIBtPSlGDCS&M71TPADKSFT!'ppPSFVREIGSIiBiFVFSPLTLLEIEGD(ACPe1_059ti IARVpDO~IliItTIVRVSLIILiILLTIIGGCLLVCLLPAVINFICDCLIAtGAVIF11LALIada-ssthyltransttrase LC1.YDSpCLPEELPPVPEPppIQIEDGRNETREVLEC1'LLEVLLKDRDAKDPAVPWWDFAtMiIDCfLIPKIJ00I8 4S0ACSECLLIAKYPPLAVIVHTDNNLWIC1'NLSVAPV18CLE
CEKRIGlE.DRKLRREEEILYRST
VADRLEITtRASYflIFVIGPIWiKANpEIWdCSRYAGMEtIPPFSSHFAKDLIPSQYLEIiI4CVAtIPPCEpQTYAE
DGINTVPSE6GEKEISALADLISLpppTVpOt.RSRID~OKRCwtAi.~IIHOSpKaIORAIAKICI'D'1'IIFRTVG
rIaCKG4IPfLLFFPCHRVHf;SHGEI1NYVI~rPVINEILLK!'D~LSY
N~tP~ISORACEG'1'EI4DCAEAGOLEKDLRAOLKSIIOESiItI~G1'INOOEKAWRRQItI~KLER
LOED4RLTGIAFDEOSLFYREYXE&YLSDK4DND1fIL0EVN718KSGNCLESLYHDYEKQCPeL0597 681215 LEQKDMIWKAAAVNEEELGKpppB~fEpTpEIRRLSTTILEYODSLRGFJMJtDFQELoppC-OliOOpapcide Pecmease pQAYSRLpfEKDVKEIflLEESNIBIFAIM.FEKAQKENNAY1WDJ1DL.EGiWIP'CEIGB~DNQKHPSFYORFL571 YYKtd.LABLSWKFFISVJILlCIYAFLFASSKFLWTf4IICEIFFPLL
WVt.TDSASLSOKKIRELVEENpELLKAIaFKSNEISpLVADAVG&KEISKLREHIEEDKRYLFFPCYYTICPV~.FFN
ViJlIrl'FPFTILSFKLTRGWLRRWLLCiLCII80CNIFANAYBC
DGLRALDIMNAQAIKDCGAQRKCCDLESLtSPVREDKiIWP'>G.EffEt.ORLOEENApLRAINQDPALABNLKKMIA
FJfIfPINiSKIMSEt4IMLLPKC1'R15lp1ERRYNSTYLpiGILIG
EVERLEQEDFDG
KYRKKOGSVKKYOVAFEEf~QSPNPTLRIILiMOrDGICLKRLOQRVOKIpItPYEtIRpGJI
iINWITONYRPFWALTRIEHF3.NLIDYDiJWOQpEDLCIAYANVEKKAEPYKKBLLEIRpV
CPft_0586 675993 677193 LEDY11KLRSAISFIQDKRLWICKESEDLRILINPPFSSFiIWEDOWGGSRE?84KYVPIiWpL
atoC/ntrC-2-Component Regulator SRVTRItDLtaAt.VFCIRIALWACit3ITIALAIGINIGLVSGYFGGTVOItII~RFTEIirtE
KEKINPSRGENHAIKNlLWDDEPLLRDFLSELLTSQCFIPDTAENLRN71T.ONIRSItDYDTNPVLFiLHLVISlTGQ
KSLLtIJIYLLCCFSWICFSRYVRIEVLKpRDRGYVLAATMGY
LVlSDMSMPDGSCLDLIKIIKDSSPNTPVLWTAYCSIENJ1VEAT810GiiFNYLTKPPSSEStiYYINVHOILPNrII
VPViSLVFPAIOIANISCGGLTFLGLGEESSASWQitJOtOCVIGF
ALFAFISKJ1ECLI0JLVNENLFLHSdtTFDSHPLIAESKAd~OfDLt.AfAKRA~SSS11NIFINPAESAVLWPPAII
L'lldLLIAIALIGDCVRDALDPR1.QOS
GE90CCKEVLSFPINNNSPRANNPIfIKVNCAAIPETLLESELFCHEKl8IFTGATTKKAGR
FELAHKGTLLI~EITCVPUNLOAKLhRAIpEKEIEHt~OGTKTLSVDVRILATSNRKLKGCPn_0598 68971?
tODKSFRpDLYYRWVIPLHLPPLRDRpDDILPtrINYFLMtFCI~KtPLKTLSPKADELopp8-Oli9opeptide Peszaease LLNYPWPGNIRELSNVLERWILEH1'SLLTEDMIJ1WEEOCSVLKYILJfRL\'LlPLTLFAIVSINFVIWAAPCDVLE
EKSRDAIGEAGKSDKNRSY
KGPDRYLQFRENYGLTLPIFFNTRPKITHKKIOTALOELANANIII'1'PSAKNAAKSLVYWC
CPn_0587 677779 678111 DCAKFiMPALLFE~DASRDDK'IRHIIIADLFIRCGVLpGFVCPNLSPI4pMQNICEIAESN
yvyD_es conaecvad hypothetical proteinAFWROWEEDLDTKVEALKCYII~DNGCTEVFCYSSKDFYiKTFFLETRFARYNSRVLIILD
SYCELFILSTLLKHHVTLGDKNRPHRKfIVSSKSL71L!(pSAS'l'HVEITTK.1P'RLSNPLKDLFCTLRHDJ1HKT
VISEVIKRLRCSLVLSILPNIVCFVI.CQIFCNINALKRNRNIDHSLNFI
ILEKSDHLPPNETIRWLTSNKWfLCTEVHVVASHGKEILQTKVNNANPYTAVINAFKKIFLILFSIL>lfAfAVFNILD
NNIlRd'IPFTTIPHPYSCLRSPPEVPNEL.S'TIJCRIFDLVSH
RTNANKHSNI(RK~tTKfIDt.Ct.AAKEERIAIQEEOmRLSNEWLPVEGLD~AWDSLIfTLGYVGFLPFCAVSYGAt.
IM7SRLSRSIFLEVLSpDFICMNARGLRWFDILYKNVCNHAAVBIV
PASAKKICISKKKMSIRlQ.SGDGIRDLESAAFtiFLIfLNEOEHKIQCIYKKNOCNYVLIETSLASSLG1'LI.GCat .YVETLFIIIDCFQiffYOAIt.NRDIiNVIILFSVLVGSAL.iLIICYLLG
PSLKFGFCI ~ DICYVLLDPRVDLECRRI
CPn_OS9R 679033 67866 CPn_0599 691927 .89682 CTa64 flypothecieel protein oppA~IiqopepttJe Hindinq Lipoprotein TSKSIKSNAPIIWfI'ATHSLLNLPSSQDSA.iEDSTSDSpIFDPIRNRELVSTPEEKVRpRKRRES'tlfiMYKRCIf LCKILKCt'I:.:uLILLYWSBDLLERDI1L:IKl?IVRDtQ!EDLREtSRV
LI::FWHKLNYPKKLIIIEKELKTLFPLLHRKCfLIPKRRPDILIITPFTY1'GW(,MtTNNVKD(JVI:rpAIPAAt\
:VMLAPKL'/ItDEaFALLFI:OPSYPNLI:
LDPYKppTLPELIGTNFH
LCDPKPLLLIECKAIJ1VNQNALKpLLCYNIfCIt,,aTCtAMACKHSQVSALFNPKTO'LLDFYPfK:ILRTAIIVt:
K('ENL~PFNCFG'IVIIGF'IOLv:IP.~.IJI::PfIVr,KYEEP-~.PDLAVKIEENLV
(Y:LPE'ISQLLNYFL~.WL RDC:.'X:DKEFHIYLRFNVFSdRP
IuPKALFY.HV~1LDL11PDRfHIfM'AIIDIKFFYOAVIOiPW
ATNRAVALR:%'.YEI'IY::V;:VFJIGIJGLV'/PWKAfIT/TNEtT:KEERKVLY~.AFaNI'I:,IQPL
t'1'1 ~(ISHn r:79671 i79175 f'fiFV'ltjlFANf:EKIIEDEZIIt7TYPTN::I~IAr~NF711HY1J11NJ'f IV!%Y:AYYFH:MpDBKLVF
rTA'/4 hypntlutu:I1 Dmcein ::RNI'Lf"IDPUALII'KRFV1'FYE.'TI!:'.LFMDFY.T:KIDf::It.PIMDRONFY~FIBCifiAYN
::::Hr)Ir~J'h:WLR::RPta:KNIITLTPLFTPDCLFTFFAKpW'fWr:DYRt:'LVGI::LCKYTK~VAYI:AVR
FTV::ADIiAYTI'I':rfilr:F::LFP.c:fr)VIH:AItrMIIIIHtFJtfIh~r:L0t~t7YT
IJIIIN':.~.ftl.1'Kt:PIK:DCWAFfJIIKy'!"IALLffA:Ii:KNIQJILI.A:X1NKEId':IIKLF:iLFW
FI::tah'A:::::P::YNKv'IECWlIY:atW\I1.Ll:Ia7.WID1'tl:ff:IHItKVIfIiVIVPFRFRL1,'.
IJIHIfC:::.NI'EFFAJ1IF'VLKLLpYD:ILDLTPAC::LCKII::f.PY4:Y1tY~1:11KL.CKKIIpiIKYYV
Y.:~ITAII'ffAM1'A'1'At'KEL':IIx.'::Ll!:IJIMDL:YjAI'fA:Ytlh'M1WJ1%e'.II:IPPED
~L\C:If.KEEEUILt7AIINAKaF"ELLALAEFPfAfAEKIFYLt'D::WEEKK::ERN.~..~.F.DFPRAILaI:F
)t:ANfiM:'..WWv:F11t1EFAUY!IfH~I:;/h:'IDI.Kh:ffIRLYIIRFIIh:fIIIRFJ11'YA
'II IF.ILFI::K WItIW h'L1 ::I!IIC:iLLYKih1'KN Ih'VTTIIITI
IL f t hi4.'irl~.fYtNfrWII.ItKKhl7it'I_:1~.:
~'1'n 11~'UI r.H1111r: ti7'n;lri n:ltyt~1111 r.~; t~,.. .n1:1'.'./
' .-t'A N.. rr.tnr::l. Ir.wrl.rrl Im.:..r~r II Irylru lure i.vl Prnrrin rr. :..rn.r-rnk!I:MnL .u: ..1 11 ' I!'IH
KK:FJYSIICOAKRFONttLPNIIFDIx:LOF. "RPPNLK:iPY,~iLSDLLlfIEL
1LDKAK~fPAEI'LGIi.R.IEIIf:::JLLi.AFR'.":CKL.LS
..'JLKODRLAYGELIILL.~aKY00KT R IHPGFDCIYIAt~it:RIiv:RDFNL4;N
L f : PCEIIG I:.LRKAFAL.iEK
SIf Nf:YHKI tO
F3SLLKEETr.'.~.LNPAKOHLL'IK t LRDFtrfMDFILR.iLGL1JG111CETY11KALPKOVO
IPHSPCL FL9YFL3ADYSOIGJt11L9U~pR;8.~8~E~'EC1NF
A:70!(~ . A
~~1~
R~IR~T
K111NICTVYr70WLF~AKVItKPStCENCEC'..A'tFSRiAHA"E
':Pn_OSOt ~.?7073 ';727)5 HIGRFRIIOS~INEFPCRFAVN1'RIOT:SAAELIKLAIFLDISOAIKOQa0t5AfQ:.
CT493 hypochotical protein OIHDELLFE11PEEEILFMI:RLVREKMESAKf:..iJPIWN:LKtBGIEC
?(DEITPN'fPL4RODSLWtiR'IRVSWRADL.S11SSRYEIASAIAIL:LLY
' FPRINADDLIN
.
a c~ nr; l 'IOS~d2 ~;t.t:5lt O
AFCASAAVS I IFTANPi.AQIIF I DrCLJIIfiLL2I
PLY IvLLI IG I IVL:.YGIYLFPOORE
~D~. ~''~tt~ : ~il~.~ w:'ti :' ~.':~:~ :. .-...,. .. .
,. . ;..: :,.:: ,...._:,....
:c~~;...:.._ I..\_,;..-,Y,;,.
'VNh'f:
' -_ .
': vy;,ll:.::::... :.:r..~:. .
KTAPLIAVItHKDVI?u:KM'AK?tvN::.t:'Jt'KnIYLKDHYKvIV10NO1:PCuiVFEIDR
D.iGFIiKPIGFOENLEALCNKTStIOLLKYLLKGILfVC'GASLLIALEFSFPLYFFLFSGKTRFWILCRItGFPI'f ~IVNCi.CAS~YY'J:CAi1?KIYA?SSSLICSICVASGPFIlNK
LYSIQ
VIPAPCLACFFLTLFVCLVTRLYLLSCIIGDFFE~.ASEYLOGAVPPtOCRSOttIVEiQSHL.
OCLNRYGYESDLL:Ar.%KDIGPl4JPYTPWfSHDREEROATLDFLYGOFItDIYIO~LPii.TK
AAAI11'KISINLONOEYSLLSEIFKFLPKHDLIRKFSCFCFWILDYFGFRECLLOKAIiJLYIEKLVIIfIICIIRIF
SPEKAKOELYIITI\"GATKEQVL.COIVaYCKIC~IYAVICSOIa~~RfR
' F1LT71R VASAMSSPLVfCIIIKHDILPLSHDAAYIPPYIJ1L
KWpAIpVDLSAHVSLAOAYVALSGLYADPRKYPEFDANYWIPSGRYS7lEI0Gtt ' 'IL
RAIECTOIINEYAPCNAI~MJ10LAYSYHDLOhIPHEEIOEYEIVLKLKPftWl1115KiL
YlppOt4AKGIRIYLEIKKRDYKKSOKLIKFYf.IItYIfYCPeL.0611 707175 705793 CPn_060) 691136 695185 adc-ADPJATP Translot:ase PIYKSEFSKPItPLFtiJIFFItCFNYCLLKNOID
LAAYLC
VFIAHK1R3KtP1p58EYKPPSA
hweZFerroeheealase _ 71NFOGPRHAKDI4EFLISLLT~tDVICTF _ TPAYLL M
y ~
YS
I
. YYVHSB~S VPL.IffG.
WICIMtLIVt3IpCLVSLFLAKKYM I
t.PRVLNRHLFfFtA~(RVPKYLPpYOSLQtiWSPIYFO?ETL.71KTLSEILPAPVIPfIdtYLELLPOGLRGFIVMI
IIYWS
I
PYC!~SLIILNSL11DKL.Q
CLiiNOITTI'!FaGRFYALINTGLMSSICAGEISYWIIfIKOTFVAYSFACD~IitSVIIIJILT
PSTIIEKTLLALRTLHTRHYICIPLFPHP1'YSVTGSIVRFFT00MEIPISWIPOFCSDSKNLITCSGLIHIWI'YARI
HHLTIDTSIPPSAAW1EDC'dATANLKLOUIPKAKARHLPLJLL
FVSLITCHIRDFLOKLCILEKECCFLFSVf~LPVRYI50GDPYSKQCYESFS1lI11TlFKQIOSRYLi~GL7IIIYLS
YIfLYIRLFMIWKD0YS0IYSSiIVEFNCYNSA.?TLIGV1ISVL117L
' YLPL .
VLL1COCIRId~CALYTP1.YHLVSGLLFFGTIFAA1WDISIFGGYi.~ffPL.71L~W1' S~IFLCFOSKFGPGKwiSPSTAOLCQNIDTOKPNVIWPFCFISDNLITLYEIERD
LRSRGYRALRIPAIYSSPLWVSfLVDIVIfEN51'WAEELIRSGI0011GIROtllbNyLgRV"TKFfFFDQI'K84AP
IPLSPEDKNIIGKAAIL1GVVSRICKSOaLlYOGLiN
IFSSVAASiIWIJILVGLIIMIVWLAWAYIGKEYYSRAAOAVATLKOPKCPSSSIVREAO
CPn_0601 695981 695196 EIiY-Glutanline Bindia0 Procsm NSEil0I14VKIKFSW1IVNFLICLIJ1VGLIFFCCSRYKREVLVGRDffIWP'P1LOFGIY9 707631 TSaLNAPLNDLVSEINYitENLHINIVNODWVHLFFNLDORICIOGIIFTSVLPlLH~.~tYQ14 pQsA-Glycerol-3-P Phosphatadylcransterase FSppILL'IGpVLWAODSPYOSIEDLKGRLIGVYAFDSSVLWIOI~tIPDAViSLYOfLAKIIOtQFCNIZSLSRWLAL
Y!'CQEI~HIRLLRIVGAI4.SDIFLDCYi.iIARYIUfISRLGS
r_5...'TTSNCYWLiaPVTLYPJ1LIET11YKGRLKIISKPIlIAOCLRLiIILKOTRGDLLiLDPITDINIVPVCIT
OLYIIECSIStIWLFFICARDLFLIItV~CYGSLVIOf,T~11~Y0YGSL
ACLVK1'RRSGKYDAIKOAYRLP
lWCitIF'rVVOFIILLLYfAOCEIPW1~GLVPLVAIrcrFLYFLERIlIflYIO~LA
CPn.,0605 696777 696150 CPeI_0616 708701 710137 yhhF-Nethylue dna8-Aepllcative ON71 Hslieaie LRKLCSSRGOVRILrGKYKGKSt.K'fFSNPHIRPTSCLtnCFaFFSiCAEDIDGAAFLOLFATGVHYLMJU1NOLYCE
DFYYLEH
CIQ
' ~IIIGFE7lLSRGAJ1SWFVDISIAAIOLIHTNSALIGEOLPWIFRODAOSAIQRLIKO.
GVPLPSPPHSKESEHIVI.C
TLTNYESSLIJO)KS1 iIBFJIG'fA7IYLiEYVDZ
KRIDiOL'fVIGGPSYLITi IDVNi RGEtL
KRSFDLIYIIfPPYELCt~ICYV1'ttOKIVSGNILNPEGTLFLF3JASDEEIACEGLTLRRRR.
.
.
KIIFRVLODAFKOIRCP
IRS1CRILRRHISTAKEIEKAALEOPKNVJLEIILDEJVONSFFKISt75TSYSQYTLVAtSCi~
KLGK1'YLAEYIVP
LTTTfDKPYLVOIQEROELFL~OtLIpGDNIISFfTGIPTHFIDLDOLI11CFSP9NWILMR
PAIKtKl'ALA~IIAlI4UCFOHALPIGIFSLQlfVDQLIHRMICSRSIIFDSK1LISTOOLBDH
CPn_0606 69749? 696707 DFORIVSVIlIEt40EtlLLLIDOOPCLKVSDLRARUtRl9tESYDIOFLIIDYLOLi.Sri80'fI' CT188 hypothetical Drouin RATFSROTEISEISRM.%TL7IREItIIPIICLSOLSRKVEDMIWRPlIIiDLRESG8IR10D
tOiIYCLADLIL
!
SDLVM!'LLRREY7IDPNDKPGTAELIIAlOiNICSIGSVPLVFEKEGIAPRNYBJIF~IS
SSYSRItOLRFYLGSLO
EDIVLLPGDISWAIB~iLSEANKDFAFICDLPOtKYHIRGaRiOYWSSASTSItITAALPPSLY
YLNp~'71LLTPHL71WGVRLWDSPTICVKKJQJFLTPSTOEOSYTEQDEKIFLRELGRLKR
AFAALPXEVTEVIVKrNYPPISSDGTPGPISEFLGDGRVSLCLtGHIHKVORPIDGIGII, IAGIHYILVAADYVNFVPQEVN
CPII_0607 698910 697577 010C-Glucolrl-P lldwyltrantEstafe NRAIOtIIflIMPEASNFFSSHPYRDlLVCVIILCGCEGIfRLSPLTItCRCKPfVSFGGRItIa.
IDIPISIGISaGFSItIFVICQYLTYTL00HLFK1'YFYf90VL.ODOIHLLAPEAR0000I41Y
QGTADiIIAIDC.t.YF~DTEIEYFLILSGDOLY68mFASIVOTAIATHV~IVL.VAOPIPEKD
AYpIGVLDIDS~R.IDFY»PQCKLVLIIRFOLSSEDRRIIIKL?~oSGDFLC~~ICIYLFR
RDSLFSLLREEEGNDI~sKtiLI01lp10CROQVO'fLLYNGriIADIG'1'IESYYEIW IALTOKPH
ACIOIGLNC7fDDtxIHIYSKNHHLPGAIITDSNISSSLLCEGLItINCSHV8R5VIGIRSKIG
ERSWDOSIIIlGN7IIlYGSPStiPSLGIGKDCEI1DIAIIDF34CCICSiGVKt.~ILKGYIKYOS
PDKKLFVRONIIIVPOGTNIPDNYIF
0608 699690 699016 CPeL0618 71Z)00 713010 CPn _ lplA-Lipoace-Protein Lipase A
Oridine 5'-NOnophospifate SynthaseKNHPfCNCIFLDLPGIISILHOLOIEFJ1LLRVANONFCIINSGJ11(DSIVLCISAIA?10WH
itlmp Synchssel-truncated?
' PLYVOMtLV
ISRJIOADItIPIIRRYSOGGIVFIDSM'IJFVSWIt4JSSE71SA0P0ELL.AWrYGIYSPLLPN
VSPLYFVIDtGRRLWPll49YEDAKLRGQAV11ILYQICaIKFGIWIL7l5GEE3 ISSPEVI41VATLIWRLRPSFNSSLLGGVPYT111.'fL7vTSISLKYNIPNVLRRKfit~tiVOPTFSIRErmYVIGH
K1II0CNAQYIORHRWVHH'NfFLWOIDLDItiSYYLPIP000PTYRNOR
SDAIKVEGLFTPCQIrLVLIJO14VSSGKSIIETAVALEENGLWRFJILVFLORRItEiICOPLSNEEFLTTLRPWFPS
1LDDFLFRIKASGSLLFTWEEFLDftELEEILAOPHRK11TTVW
GPQCIKVSSVFTVPTLIKAGIAYCKLSSGOLTLANKISEILEIES
0609' 699672 699986 CPtL0619 713162 713013 CPn _ ndk-Nucleoside-I-P Kinese CT190 hypothetical Drotein RRYVYThtEOTLSIIKPDSVSKAHICEILSIFE05CLRIAAMKM0iL50TFJ1ECFYFVNRE
ONTKNSLIRFMILIRLFLGISLPKCFPLYLEPPLVLATFOCTOFVGTYSEATNPLYIDNLRPFFOELVDt7tVSOPWVL
VLEGANAVSRNREtI'1GATNPAEJ1ASGTLPAKFGGSIGVtMV
NLNYNYTOELLYKAVPCNYKSIYREIPLIIFPEVLIGSTPTOSTEHGSOZ'LFiJAAVEIAYFFSKIEVVNASKPLV
CPn_0610 '01150 7000:9 CPn rho-Transcription termination Factor_ RLFLrtFKGSIHKCERSSEILPRVKETKKHAYVSMOEKSCVGECAWASESEEAESVTVTKruvA-HOlliday Junction Nslicaee IAKLORNCIEELNIIJ1RCYCVNNIGSLTKSaWFEIVKAttSERPDELLICECVLEYLPDCDKMYDYIRGTLTWHTGaI
VIECOCIC'MLAITERWAIECIRALNpDFLVETIIVIFRCIE
eCFLRSP1'YNYLBSAEDIYVSPAOIRRFDLKKGtn'IIG1'LRSPKEKEKYFALGKVDKINCHL.LYCFHSREERECP
RILISFSCICPKLALAIWALPLKVLCSWRSEDIRALASVSCIG
:iTPdWfERVLFENLTPLYPNQRIVl484CKDHIr\ERVLDLTAPIGKGORGLIVAPPRSOKKKTAEKLtiVELKOKLP
DLLFLDSRVITSOTKITSSCLEEGIOALrLILGYSKIAAENIiAE
'!YILOSIAHAIAVNNPDIVLIVLLIDERPEEVTDNIROVRGEWASTFDEOPERHIOVAEAIKDLPEGSSLTDILPIAL
KKNFSCVNKD
KMRLVEHCNDNVLLLDSITRLARAYNCVOPNSCKILTGGVDASALHKPKRFFCJW
MVIF
. CPn_0621 71x707 7111.14 RNIECCGSLTILATlILIDTCSRHDEVIFEEFKC'ICNMELVLORRLSDRRTYPAIDLIKSG
LYNPSELERVYLFROAIJ1DL'1'fiDAIWLLLGRLKKTNSNAEFLLSLKEruvr:-Crossover Junction Endonuclaase TRKEEL
.
L:iRWSSFKDNKFKYF0E31VSELIIGVDPC'fIVACYALIAVEQRYOLRPYSYGAIRLSS
t7NPLPNRYKTLFEOtSCVLDDTOPNAIFVLE'K~FVNKNPOSfMKWDIRGIVLtJIAIIpRDI
173 7011:0 ' ' .
LIFE'tAPNVAKKAWC.KGtLi:iKROIpVMVSKILFNPE~/LNPSNEDIADAFALAICNTNV71 _t n_0611 yacE-predicted phosphatass/kinass R~aPtr<:CYR
V
F
RtfNRRDAKTSEREOGISYDFIRSYSCEYLNWICKLGN4Ll(LLKVSITCDLSSGKTE71CO
.AYWwADEISHSFLIPHTRIGRRVIDLLGSDVWOCAFDAQAIMKVFYNSVLLOC
JEII' ' . 1':1 LEAttJIPEVCRILEfQYHOSIODCNYPLFVAEVPLLYEIHYAKWFOSVLLVHANfDIRRECfm 01.72 '15761 ~.EDFOORuRFU~VEEKt.AQADVWENNGTKKELHOKIEEYFYALKCALv:T'.Ui hyp.~cnACi.:.O
pcotein RFHYKTCR3 "
. .PwKDADINP110O LCNIfSCV
NY:: JR t.t.:: i LKLHLF::I.H: aS..4a'lIY
YH: d.'::R:.'F1LIILId :fn II.lJ ~vl.l6RH ~JV_U_=
:.':FH:Y:Y.1:J::IIJY.E~fC\~a'II~~EHERIIIIJ,IYRF.~~L.::ALEEEIRRREEA10i00L.EKL.OQ
QPf ' ' ImIA-UNA ll.lyaa':.Iz;u I YFJIKIKOLE~LORYVS
fEI:E.RPtt.
wtlJllf!1lF:KhrJtRlk.'::~II~EIY.KELUJ::VSH
H.:IIff::LIL~fVERFRREfNBCKLFVLUA::.:FIFIL1'fFALPENKMIpGOATrJAVFGFitI~>LnIR:AI".
:(t:LEEfAI:::N\.\1'\I:IFIPLKK::LIDL.yEY.DIYIKTY11::FIAKLHEKL.OROICAO
' NKLIKEF.~.F1=fNI::VFDv:hNHK0::R0AIYADYK3Nk!JKKFEDIPWIALVKt:IC:SLICIr\HANP.~.Flf KLDHWI
'f::::h'/t.'::fEKLifF:VLYfDI\I:KY.YAI\t.tlJUfI:UJ'fWJl.i!Gf.IIKE:Kt YLE7!E::VFAUInIIA:;IAKKWEFtJYKVIIU'1'ADKOLWLVNDHWAWNFWAIY~~WC'aI::E~:LIJ:YI:IF_ :FI;tNV1::Ii;:K:aii::
V 1 Eh'n: l Pfr xi I fDYf.ALVt:D::.~.DN
I FGLKX:~PKKAAAIdJIOF'f:.~>ltEt%LLENLIN1VKGL' .:l!fNl-:EROt~fLKL::KH11L1.D::NIPIPI'FIESLTFPQFIPVDEEKLIIIFYI40t:FKTGVPI ~nl l II..I..~
'.'fhry.: s ::KrJfUATVt7VrJIINUA1::'.L'fNiLNLV~\:::DI1FAVAYTr:NIIW..~.LKLEGIlvLTOC:.(:VF~T' .n4 INlr,rtr.t i.:.n! L~.....,n ' t:EE7ta'KILIILKLWILHCDI:fFYufNLKRDCIIALLJ~W:I'JIREI.':YGLAL.AEHLTNFFRNI'D
FIII
IY!rftfl'rlF"MeUINI~IYt'I~:vF;YY.l:7/I:FIrKIIN::rJnfFt4V1:1W
:VI::IJ111r .
IC:Yf'Yi:Jt'l4:fflF!MI'.l'RIfAtltt.Y.A'/!:LINa:VKIIVY:I!FJvLIKUfY..'."f1'LINIOE
KPLrI
t::iYJ::IJ.'lNII:FTFTAIIRFAKEIa:N:x:LfLGt!LPFJJt'F.OYFY:EFVA'ILPIIKOAIL' XY:Y
. .h::RKISKKF::\ItFh::l Yr:F:Y.WKNKKYI..':1't!FIIIIKf.IAFYIYtA:YJKILtffVK
r:l: ttINYFH1111 t L::U 1 f?fl'LF:KVLF::HEH4::1KhY:I'F:f f:
l.:VfLOVEfIJI I I.I S\I.FETEWVLTEEIYOt.:
FEWEE::pFIIEIVEOKKF:LLPPPAKLI:rEYINC.r~l't.:I'7 Ribasrx4rl h stn JPWTSJIDWfrLOALVRESSDL HKKEKVK".sMA'3EP1 t.RKVKI;WVSAKNEKTWVNVERIF:iHP~YLKV'lR3.iKKYYA!(:' WALL::AGUAtHFPETEEEPT.3Jl.r'FE&i.SANFFPETSSATEEEELKVSEGDxvKIG4'i'll~ItI:KAIINV, ~CVt/Sf;.;
CPn_Ob24 7I8D19 717011 Ob33 725979 725743 CPn ~ap~-.IY~er~idehYdefP Dehyroqenase_ AMKWTNCFGRt:RLVLROIGIRNSSV~LAINDLVPGDJ1LTYLFKFOSTHGRPPEOVACr1.?-L2? RaDOSOmaI
Protein ASGKGIMIAAKKI:L:.TOLRCFaDDDL~w\YVHENKKALFALRAENL~.(~IJKWKVIMFSI11K
EAI7HLIW':KRKIGFI-iERNVONLPWKDLCVDLVIFaCTCiLFTKKEDAALhIQAGAKRYLISKNIARALTIKOEPYr%KYH~
' EGtifITUHA
nrvllr:rtl"f!~'.frPINHYTFtIPP:ICDI~IL~ItA.Sf.'rl'4t.'trIPIAKVLLIBIF'::".
"'Y:4l:N7a-=: ::i~\:::'::.:./.\':.':..::.Fi:LF':Y;.'."~CAFRVC:::... . , v ' . ... . , .
'r ...~L~:::.:.~'.~L'E:'.~.'..'.:~I':'::,.u_:.:FC:.:.:.:.-. .::._:,Z:.L.''.;.IY:.L:aYvtrY~ rllo-Lte RtDOSOMaI Protean IAtI~7DRFFKLVAWYONEZCYATRIVDLLEYVEKNSKI
IIINIPKATKFRKOGKGQFRGLSKGaTFIIDFGI:YANOTL.EP(~IVI'SROIE71CRVAIIIIYL
KAI!<SKVWIAIFPDKS':KKPAETAMCKCKCAPDMWVAYVRPGRILF1YANVSK~t.
CPn_0625 718188 718060 AAAAAIG1CIKTAPIKAVER
r117-L17 Ribosomal Protein AAHAOp4ITW~S
vtpNARKKPAVCRTSSIWRC71WJl4.KSLIIIYERILTfLPKAKEL727D92 726409 IIAFVERI(~1~' CPCL0611 LAARRIAICNiHVRYIKQ.TSKEARQAItGCDI'SVYNVDRLWNKL.FDILGrs3-S3 Ribosomal Protein KGRRIIICOt(QCPICFR'IGVTIUtWRSLWItGNKDEFGKFLIEDVAIAOFLAIOCPSCOCANCP
WPAILSGKZEY1'IG'IJIAPOLYIGKIOCIIF.YDLLKBLLAALiGKEI7IiLEIJIEI1WGJ41KL
CPn_0626 719670 718495 VJUaJIAAOIERAYSFRIW4KXANOSVlmAG71VCV1II0VSGRLi1G71CIARSdYf~AVPL
rpoA-RNA Polytsereee Alpha HTLAADII1YATACJ1E'!'!'YCIIGIKVWII~GiSSSITPt'84PAAPSAAA
WLGKEKCaISDNAIO~iLLYDKFELPEAV1QQ.WlxLPIDKHAAFIAEPLER
wLPAKK%AQS
CNGHTL)GNALPAALLIGLFJ1PAIIS!'AM'GVLHEYNAIEGVI~ILHLKGAL.LIIKY72711D 727096 PNQO&SLGAT'CQVLfUISISIDJISOt.AAANCQKM'LDaLi.ODCOFCAVNPDOVIF'M'OPCPn.-0642 r122-L22 Ribosomal Protein IOLh1)vLAIAFGRCYTPSEAIYLEDIaCVICEIVLOAAFSPVTLVNYFVCDTRVGODTDFDRAAHSIVKATI1AYIRV
OPRKARLAAGLIOtNLSYOEAEEpLGFSOLKAGACLKXViliSAYIW
LVLiVffDCAVTPKEIVLA!'S'IQILTKHPSIFFi~I~EKKIVFEFJ1ISIEKC4KDDILNKLISVTEVAVDAGPVYK
RSKSKSRGfiRSPILKICTSHLTVIY00fm . A~1IAREM
4:INEIELSVRSTNCLSN7INIlTIt;FININPEPRLLOFRNFGKKSLCEIKNKLKElDQ.EL.
G~.TOFCVCLONVICEKt9IWYAEKIM10r1'IOGCPCI,.0643 727725 727450 CPn_0627 720059 719640 rsl9-519 Ribosomal Protein EIRDICRSLRKGPFVDHNLLRKVRAIB'IIEEKKTpIIrIWSRASNITPOtIGIn'FM~IDI
rail-511 Ribosomal Protein ItLTVPVSEITNGICKIGEFSPTRIFKSNPVI~
AQAKIISVIIRICOLIC~tIPSOWICVKATFIB'TfIVSITDPACNVI9WASAGK
VLVIOJ
A
FLI
SR CPt7_0641 728594 727722 O
VCYSGSAKSSAP1U1TVAAOOAAKTJIIO~ISGLKFVE11CLWGTCAGRtSIIYRALI&71GLWSY
IRDETPVPtBiCCRPAKRARV rl2-L2 Aibt>aaaal Protein CPr~0628 720461 720063 FIREIN&QR(!'KIrV'i'POI'RDLYLPwiDEL'1TRGELRG?'RSKRSLRPMtKLBFTIOtSSOG
RiII~IISCRHROOGAIOOLYRWDF1W'BIDGITAKWrVEYDPMISAYIALL8Y8D~R
csl3-513 Ribosomal Protein YILAP>OGIOAGOVYVSGl7GSPFKPC7CGK1'LKSIPCGLSVfOIIENRPSSOGKf.VR8A0LM
IltY1'ILREAQRNPRIICIDIPAKK1G.KISLTYIYGIGSJ1RSDEIIIOQJILOPEIiRASELT' EEEVGRLNSLLOSIYIYOGI)LRRRVGSDIKALIAIHSYRGQAiIRLSLPVRCQRTKTNSRTiT.KItPSGEFRt4.ti DGCRATIG
OVIAIt$PGYV
RKGIDtKTVAGKKX TJ148~IPVDNP110GCE(ZAH14Ci1fIPICT
0629 721881 720487 CPn_0645 728933 728598 CPIt _ r137-L21 Ribosaeal Proteia sect-Transloease OM~IfOYIKRHYVTCKAKl4.EHLSA~1'Cflfr~fl~CS!'CILDPKTVFIV51~11?I~LIaOAL
KIRLL~'RPYKI'1'LROFFLITELRQKLFYTFALLTACAVGVfIPVPGINGELAVAYfICQLLC' ~' SCONLFOLilDIF5t~71FA0NTVIJIILiWPYIS11SIIVOLFLVIfIPALOAty'OItSSDOGKRf1~f0018VG
IOfAIV
EAIYVDKNVKVKSVNfTrA7KPOPAPMFAGRPt~ATSGI
RIGRLTALFTVALiaVIOSLLFAt~ALAINLTIPGIVLPTLLSSKLFGVPwIFIfI'1TVV1M
CPl1 TfGTLLL36(IGEpISpIOGIpJGISLIIAI,GILSSFpSVLCSIVIIIGiaCSODSSO~.IS-r11-tA Ribosanal Psotsin ILILALVPVFVLITTILIIECVRKIPVOYARRVIGRRbVPGGGSYLPl.KW1(~P~FyAiDIJNLLSK1DFSCNKIGEV
EVADSLPAD~OCLOLIKDYIVAIAN11010ti8AC'!fit ASSLLNFPATICOFIAS&SYl9tRIAALLAPGSLVYSICYVLLIIFF'lYlwi'ATOFHPEOIALL
' IASEI~fOQtAFIPCIROGKPTOIIYLEY'llAfIRYCL1LGALFLAAIAILPSLd.CCLLRVDStJVRGGGIVFGPII
SEYBNSTAKPPKOIOGTGIIAROGCLiISPO!
A
SJILIIFIJIDCNVOCRSILFIDNLO11V0~a LTAPkT
' Lt?OtRYDSVLiITOATIOCIW .
GWLDT p 4 Li101II0 INKLTPVD~~DR
ISLANLTAVIOCFVYCININCYDLASAIaIIVISpfAL.OELYERLVfiTIID
CWOJIP
SYFLOGTAIT<.IW
CPei_0630 722316 721885 CPt1 r115-L15 Ribosomal Prouin -RRFGYE0I1GVPLYR r13-L3 Ribosa~l Protein NIKLESLFDISERIWAKIQ.LGRGPSStaiGKTSC~IKGOGSRSGYKYLEYPSYCIC4L.PPLITCPFIFLA~FLFFLt ?1SISKILSRFVSLTf.OBEBIfSLIi310Kf11 RVPTRGFSHKRFDKCIfEEITTCRLAELF0E7GGITLOALKAKKAIAAOAVRVKVILIIGOL' XESOOYPSLOIGAEOIIAP
RSHISVIGKK>DQ4IHIFDKOCSLVACSYIRVEPNVYf0IR1 EKTIVNOCiAWiSGCVONLLGIT ~ZTK~
CI810GICGFOGtMKKFG1~GPGSHGSG!'1(RNAGBIGIBtSTPGRCPPGSKAPS1o83i1~M' CPn_0631 722812 722712 VIO'E.EVIKVtM.tKKVLLVKGAIPGAItGSIVIVKfISSRT
ray-SS Ribosomal Protein ~15GSKNSHKEOOLEEIfVLWNRCSRRFSFSALILVGDCKGAI~SYGPAKANEL
CPn TDAIAKOCEAAxtWtI9CIEALEOGSIPHEVLVHHOGAOt.LLKPAKPGIt3IYAGSRIRLI_ CTS29 hypothetical D~tein ' eHAGiKDIVAKSFGSNNPI4NQV1W1FKALTt~.SPRImLLARGAAINDFFFIGIPCXEVIOtATNJIIASAGBAASi0 0.LPVAXEPMVSSFJ1QKGIYCI00!lTliP'GNXL
AK!'00J1TKSL00(CFKLSKAVSDCWCSLEF011LTSAMIApOIa.KiTAEWAW~1V
CPn _ ItIGtIVPSfVNSIbRCY0YTA0AFEUSKTKERKTPCEYSR~.LTRODYLWvBAGCtA
r118-L18 Ribosomal Protein ' KCLISSWLVNLLOVFAPNVLLNLIKVREFVMCMaISWKLVKLRIf0Al0iRSRVMESSLCKItif11J1GVAGAVOGIA
L
G71TTYSATFGVLRPLG.INKLTAKPFLOKATVGIIFGTAVAGIItI
KSL40(RRAALRVRKVLKGSP'fKPRLSWKTNKHIYVOLIDDSIG%TLASVSI'LSKLtJICSOEOKLFKJWCESLYNE
RCALCJOOSOL9GDVILSAERALRKEtIVATLKAHVLTi.L>GGt.EI.
CLTKKNOEVAKVLGl'OIAEIGHIrt.OLDAWFDRGPPKYNCIVSMIADGAAEDGLOFWDG11KLIPLPITVACSAiII
SGaLTAASAGIGLYSIWOKTKSGK
CPn_0633 727760 723209 CPrr_0649 772672 731710 !mc-Nethionyl cRNA Poa'myicransferase rl6-Lb Ribosomal Protein IJOLKVVYFCTPtFrIITVL00LLHHKIOITAW1'RVDKPOKA8AOLIPSPVKTIALTIIGLP
SHSRKAREPILLPOGVtVSIGODKIIVKGP1CCSLTOKSVKEVEITLKDNSIFVHAAPNVV
ORPSCHOCLYWALISNMVpCVHLGFEKRLFI4ICVGFAASVQGAFLDLSIGVSHPTKIPIPLLOPSKASOPOFIEELRA
FNADVPIWAYGAILROIVLDIPRYGCYNLHAGLi.PXT~GM
' STLQVSVEKNTLISVKGLDKGLVOEF)1ASIAAKRPPEPYKGKGIRYeHEYVRA1UVGKAAKSGEU1W1L11SpGiIIV
LIK
PIOAGINEGATESCNIVIAL'~11Gf4TI'GONANITRVPICPOIT!
TLQOIESGOLOLVSODMf.ATIAPKLSKELf~VPWD1IPAKFJ1YANIAC11TPAPOAKILFS
tGKK
FSEKAPKRllCIRKdSLLAEAGRYGJ1PGTVVYI'DROELAIACSEGAICLHEYOV~KGSTN
CPn_0~74 724215 723787 SILiPIlJGYPI1KKLICIVf'CLNN
rsH-aP Ribosomal Protein E3SIKRKAIYMCKCSDSTAOLLTRIANAI~IAENLYVDVEHSKNREAIVKILKHKOFVAHYCPrr_0650 777517 7326b5 LVKEEt?iRKAANAVPLOYSDDRKPVIHQLicRVSKPSRAV'NSAAKIPYVFCM4CISVLSTSlpxA-ACyI-Carrier UOP-~lcNAc 0-ACylcransEerase rX:VtIECSLARSIDIiCGELLCLVW SRRN4ASIHPTAI IEPGAKIGKOW IEPYW
IKATVTLCI7NV WKSYAYIIXNIITIQOC!
TIWPSANICNIfPGOLICYOCEKTYVCIClTICEIAEFAI
ITSSTFECI'1YSIC~IiCLINPWA
HVAtINCI'ICIRiVVLSNNAQLACHVQVCDYAILOGIIVGVNOFVRIGAHAN~CALSGIMW
CPn _ PPY'I'IGSGNPYGI~.aGtNKVOLORROVPFATRLALIKAPKKIYRADGCFFESLEITLCEYC
rl5-LS Ribosomal Protein CERKANNSRLKKFYTEEIRKSLFEKFGYANKIpIPVLKKIVLSHCIrIEAAICD10JLF0AHLOIPMMFLEFCC3PSKR
CIERSIOKGJ1LEEESAWfEL~ILIES
ECLTNISCQKPLVTKARNSIACFKLRF.CpCIGAKVTLRCIRNYDIMDRFCNIVSPRIRDF
ttb5! 733975 737517 CPn R4F~NKCOCRCCYSVCLDDQQIFPEIIILDRVKRTOCLNIIWIJ'l'fApTDDlxTfLLEWCL_ C.aGZ.Nyrlstoyl\cyl Cacrier Oehy3racase kFKKJIp MJUPrIIIKLAELLCLLPtIRYPFLLVDKVLSYDIEAR5ITl4pKNV'fiNEPpFNfAIFPNAPI
<:Pn brit.: 775tOR 724750 Nf~f:VLLLEALAtsArh:VLiCLVLEIIDRNKRIALFI~IOKAF.FROAVRiCDVLTi.OMFSLt ' ~
rt2AL~1 Ribrartm,sl Prntr.in !'t:QI.VTFrIEL.iFALVDKFw t :~Kr7r:IfAWAGAR1 FY, t:KEIMKKVN I RVC~KVF I L.ACNOKtiKECKVG.LTEDKW
VEC:VNVR t KtJ I KR'~(FK ~Pn:N:S~ 7t.lNqy 7)799D
:YkI;:IFIvt'ItII:,NffRL?fAf:EPAKt.:DVKVTEGGREWORRPU:TSVLYRLVRCKKG
Ipxr: Ptysrryt r:lcN.W nre.metyla5e :In or: s7 ~_'u47 t 7'.'.5D'1s KRN::I
t'ft;O::L:;:l'1'NC.F.R'lltR'CLItREI/RYA:W:IHLl3K.~.STIJCLOPAQ'lNl~':I11FGR0.~.
t4 Hilrc:rrr..sl Irnr A:x:lffEtiVPAId.IWVY'lT':R:'I'fL;Ar:::AVIA7Yt31LNAALRSNNtDIILIIOr::7t:EEtPI
.W
r114.1 .
c:Or:.:?1VFYIJ.ICtsAt:(t:ELsE00Y.V:.IARLTPP/YYOHQOIFLAAFP
. I~L.KL:YTW1YPQ
tt:ttrtWt.:VLKVACAdIt:AKKVNr'FKVta7r:.~.RRRYA'l~/f:lriltlft.':a'Rf7VEI?C:::LKKC
:DV
VIIDOKC:NINr.T1!ll'r:WARt:IirOkr:FIKL".:iL::::CIr.TVYK::LVINEE;:FRVl:IAf~.'RTFA
L'fIIELCFIlIEKGLI~:LIMfAWFKOtt:II
IYAV1VR'Mtltlll'PNKOt.:TI.KFIri'H:a' . ::IrrYlIHFAPEf'VRItK I
LGLI.:OC.;:I.W:RPFVAIIVUIW::.r:NESTItAFCKKIi.EALhL
AthV t wtn ~u. rH ~: ': f'7 t 'f~bA'ro rtn 7r.5 s f s...t..'r 7 t4H4sr ':
.-urE-Apnlvpopmtem NAeatYleranst.-".rseANFCVSLFEIOCLtCMLVAC-....DKISIICxtR:PMNVLF~L:L:.FAItGNWF.'.RSNtKrMV
' ' ' ':EPVGRIFr'FVLiyIt:LLAFAOPOL~PVStLCMCGYG!'FSJYSLEPLKKPSLPLRTI.FVSYPL~T.DI' ttl4PAYF:ATFA..
IA.~.T
OCILLFVICFFLYOPQIM LAAaEL'((((eAAG
f 'I~~R
I~ f ' ' , ~
CFFTIIP'PIEv~INF~rWIIL.i00YICKLIYLVWLTLITILSYLFSCFSCLLYAIVROKRTAFLI
LP
:
WIRVIIGFFIALtJILR6f .. v... .."i. .~ , i WSLPCVWVAICiLItFYGIF~"!. fSFDYLdIPMTJ1SAYGROFGGFIGrtAG05FAVIAVI~IISF
YCLLLKKpNAKMLWVLTL:.LPYTFGAIHYCYLKHAF00DKRALRVAWQP)WPPIRPRIJfCPn..0666 71677n SPfvVWEpLLpLV::PIOOPIDLLIFPCVWPFGKNRpVYPYESCAHLLSSFAPLPIOGItAT'dnsE-ONA Pol :II Alpne ' ~.N3DCATAL~HFOCPVLI:LERWVKKENVLYWYNSANISHKGISUGYOKRILVPOGE0K
L
GFFL711IPGHGNSOYSYLCAHSSIKDFVAKGOEFGIPA~I:.vOHQILYGAWFYKEL~
.
~TpPttrrE_~IlAW:~PPDYKKEKRSRAAHHLILLCKNECC:YPHL:LLTS1JIFTlI:FYYF
'fY:KF~:":.I~P~t.FnY'/AfIX'KRLPr:RR.'7:'!'l':'VRGLPRIr:LT:.~lEGZ'FCYRLOSYK.
..
' ' ' n~ tl .,..; .f Y: I'll iw?tf"".?!Ar...,..~,.Vl:i:::ii~R:LKLO
:,:i,:..:::1:::'tlf .nr .. .'.:~8':Ilr.:
J ML . , . :OL'r'-'_ .a,~..;~IJiK
,.v.WF
. . :.:
:'It:CKe:.~'.'(.'.'~: : .. . , ~
~
' ' ' , .
....~r..Ag._.\7::i:::~::a'L,:YRf.irlKF:~
... ~i UPlET14\I .: : /i.cT.::.v :.:.1.
Ll't1i KT:.Yf: i ,~ :L . : 1. :.:SIT
a. a :\
~:l::f~:a.~'JU'IQAII
:
!t':vF'l:l~'FY.ewGi:.i:'.' ' KEIR CILIJr'VOSCLf VItIAKQIP: H I
PNPKRKVYRSREYYFKSPApNAELFKDIPEV
I$NILLYA
KRCDlTFDFSKKfIYPIYVPESLKTWSYTEEDRYOASAVFLK~IIEALPIOIrSSIVIaN
CPrL0651 777051 776507 IAIOfFPNRDPIDIVIfEPlmNL<4AI L I
PKCwlICDYLLIVWDI INNJ1KATL;IPIICPGRGBCiIG
vdlD/yciA-scyl-COA Thioescvraav SVLLFLLGITEIEPIRFDLFFERFINPERLSYPDIDIDIt~IA~GAERVIMfAILIItB3RWV
' KKIIDF45VtNlYYRNOEYPIKIGSVESTML10IKPV5FSCIDCNIYIfIFPDR7L!(Al'BnVIGfLalAt.SKVNNI
AKHIPDLNITL$KALCIpPDL1101.YDlD
AOIITFCITOtAKMAVKD9CR
' CLLIISLLORLALWACRNTE,SVCIrfAFVOJILrtFYAPAYItDENLICKAAVNRTWRTSLEVGMIIIPICI$KCSII
tIT'IpY9BtlVCS
AESAQVIDMALCLOGSIPNICViIAAGVIICGOpL
VIfVWAEI~tIYKOERRHITSAYF'l'FVAVNEDNOPIPVHOIVPE'1'PEDCRRYNFADARROARLVOIGJNDLLGLK
TLTSINTANSAIEKKIGpSGAMATLP3.ODATrFShLN0IRl11CI1~IC
SIaipELaIOILRPDLFEEIIANGALYRPGPIIDIIIPSFINRKIiGKEIIEYDHPLJQSILRI
TrGnwYOCOVMOIACALASxsLCECwLRRArnKKaFOOM~a~cxlcKRACOBIaIDPc W'IYIFDIMOtFAAYOFNKSHAAAYOLITYTTAYLKANIfPKIi~ILIALLTCDSDDI
CPn _ LIRI~QSlIGIPiLPPIIlNVSSNHFVATDEGIRFAMGAIKGL.R(H.IFSIVLERDIINDPYB
dnap-DNA Pol III Epsilon Chain KEIMSLLIfDTVITCLDCEH1CLWK>fDItIIEIMVRFTFDSVISSIEFLINPERWSAESSIRDFIORSDGKKVSKIIS
IESLIDACCFDCFDSNRDIid.ASVEPLYIJIIAKDIDI6AAffiV
ORVNHISNAMLRDOPKIAEVFPOIKAFFKDGDYIVCHSVGFDtpVfaOFlIERIGCfFLSKM'FITLCAMDRIO(EVPI
CLPKDIPTRSKKELL.~IFKELLGIYLTEHPI~'1YRDNGfRLSV
Y'tIIDTLRWCEYGDSPM'ISLESLJ1VHFNVPYOG~MRAHKZNEININIFKHLCKRFRTLEVLJ1GEFlNLPNGSWRT
VFIIDKVL'1'ICISSKAQf~FAVLRVSL10ID&YQ.PIIiPDNY6OQ
OLKQVLAKPIKMKYMPLGKHKGRCFSEIPIAYLOWASxIIDFDSDLLFSIRHEIKHRQKIiTOELLLLDRLIYAILVLD
xRSDSLRISCAWMNDLSIVNCtIIYtI:D0AF01lIKHQV0101SF
GFSpVNNPFMEL
TMSI'SGKETKAKGNKPNENCHTOALIIPVTLSLDLHB.LRIl5HLCILKKIVQKHPG~'1'LVL
VF'IIQONFRVASIISPDD11YFVCEDIEELAQELVTJ1DLPVRVITV
CPn_0656 737842 738018 No robust hanoloQ prasane in Gerstbsnk/EMBLCPn_0667 751097 750177 as of 11/7/98 THNFLLLPLSLFDILLTVEGFLCL':LYFASVORMPCEQKAVP(~1LYYYYIAAHSSLCLSVNo sobusc homoloa prestnt in Gent6snk/EitBI.
as of 11/7/98 ~gtKp NISi.LCICIOKRYFHKKLILYFAAPYASLfCGYFLGIDRVPCAOKIMRLMDNSSEVFSKSC
RlIRtKISGFSF1.01IFLRHVSPEOALALFPEYRDDKSIVELAFIPNTLtOiVRPSKEEPIIIOC
HII80DDiIWSLVt'IOOIVIJtI~IWrCSRaFRECtS.tJIAGK001DIVI0TLATt~'1TSRE
CPn _ SLApAt.At.IWIRAERVIK!%:OKIDCLIFASGNOIGTHFOQFQPIRtICTITWNNPWILpIIP
Y7aE tATPasa or Kinaael PMGRYRRVSNSSpETLLL~.'TELGpVLVPGAVLLLFCDYGAGXTEFVRGIVSGYLCDTIAERNAAVFPAOYSLORVRI
ILVIfIIIFGONFLIVRSSMVYVpVYKISLVSADNSVRVEYILBIVt EVA3PSFSII.ifiIYGt~R:PKRLCHYDLYRIDOKNOEYIFODAEEDDVLCIEWADRLPKPW~CGKSIpDL
In'INIYITHpI'N.IBREIIIEbt CPer_0668 751176 751162 CPr>_0658 739180 778155 CT547 hypocMCical Drottin CT578 hypocMCical Protein WRFVWSPRLIHIIFLLYVPLLLVLVSTOCMKPVSFEPFSGKLSIbRPEPONSAFiYISQ
KRVCi~ISGAVKQW.t.QFIGXQKKPELLATYLFYLDpALSLRPVVFVRDKIIFKTPEDAVGOEPLKIIfRIFRI(ALI
CFGIITHIIPPRDILRNOApYLIGVLYF'fQDIIPDt.iIDRAIASYtQL
RiL~CIwRETEIOISSEKPpVN~N1'KRIYICPF1GKVFACWVYANPODfIIYDwLSSCPDAiYSEFi.FOISIYAIAO
RPAOCKRKRICRLOCFPKIaIiADCpiILItIYDEILTAFPfI~.
PQNIODIQCCVRIKRFLVSEDPDVIKEYAVPPKEPIIK'fVFASAI1CKL!'HSLPPLLEDFIGAOAi.YSKAALLIVt OJ~.'1'F~ITxTI~ItILTLOFPLifILSSEAFVRLSEIYLQQAIOICPM7L
SSYLRPIITLEEVONOTKFOLESSFLSLLOWLV1D>KIAAFIESLA~I'APHYYISpWVDTQYLtIFJUQlJEP~xQHP
NNPLNEWSANVGiIMAFJIYARGLYATGRFYEKKKRAWI~tIY
YRTAITNIfpdTLLVA10G01IRLDRISKNTB
CPn_0659 77948? 779838 CPe1.0669 751110 752775 CrxA-Thloradoxin CTSIB hypothetical protein LOENNRDSNSIFREGKLHVICIISSENFDSFIASGLVLVDFFAE1VCGPCRIC.TPILHdaAIEYGSILPKICINMRLF
SI4TIYLFFSLUSSCCCIfSIWSPYNLSSLGKSti4IIFIA
ELPNVTIGKZCIIDOJSKPAE'1'YEVSSIP3LILFKDGNEVARVVCL.KDKEFLTNLINKHAPIKEDPHOOLCSALTY
ELSKRSFAISCR&SCACYTLKVELtIICIDI01I01r1'PAPI~ICDK
'FNNtFIVSNEGRLSLSAKWLIMID1~0EVLIDOCVARESVDlDFEPOGLTANANCF71GOQ
CPr!_0660 710737 739860 FDB18L1IKSARRILSIRLAr<1IA00VYYDLF
apo0-rRNA Mathylasa MRWIJICPDIPOMGttiCRTCVAt)OAE<.ILVRPLGFSLADItlYIfRAt?lD7fWDKLOLTWDCPn_0670 SIEGtXOVPEDOZFCLSTKGSASYTEFSLPSSGTYVFGSESKCLPItEILIOCYYItrtCLBIrsbW-sigma rs9ulaeory Laetor-hiscidina kinasa PMQQDIRSLNWTSVGIVLYEVVRQKTV)1LQKNPTVPRRLt~1RY171I'FFLCETV!'PAVLSB.tISMLDLIKIIJU
OKOSKCP08KLLJILEtJICEELLVN
IISYAYpGENSPCC'IAISCISH1IGOLtVYIKDHGPSFNPLAV5INI0EDLPLEORKLOGL
CPn_0661 711179 740717 GIFLAXSSYDEFLYARED1K~1IYNLiOl:1'IGpHS
miD-~P-type paptidyl-prolyl eis-Crens isanarasa tiSRCLKIKDRRRKMNRrtIJNLJLATVALALSVASCDVRSKDKDKDOGSLYEYKDlRtDINDICPei0671 ELSDNQKLSRTPGHLLARQLRKSCDSff'FDIAEVAKGLQAELVCXSAPLTETEYEEKWIEVC?550 hypothetical protein QKLVFEKKSKENLSt.AfKP'LNENSKNALWLNpPSKLQYKIIlfGIIGKJ1ISGKPSALLHYRITIN0RKY1'MSLDF
FEEFYHOSIIM~CI'SFPtCYLNIAEILSYPHCI'DANI'DFLC$OSD
KCSFINCQ'VFSSSL~'4~IEPILLPLCQ'l'IPGFAIGMpCIGIDOETRVLYIHPDWYC'1'AGOLNDFIIAEbKDIf LTLFNADFAIWLVPLLVOGOAVTRCYIAVSQGDGNYCPET01FGSOpYN
PPNSLLIFEINLIOASADEVJN1VPQECi'IQCEOSSLILFJ1LQLYLKDIKDI'ENALR&PRF'tNDN
CPt>'066Z 742938 7411'7 CPeL0672 757TZ3 755018 asps-Aaparcyl tRNA Synehacase dacPlpbpS)-D-Ala-D-Ala Caroxypepcidasa SKCtCYlIkYRTNRCNELTSNHIGENWta(iWVMRYRNiIGGWFIDLRDRPGITOIVCREDETLKSPMIKRPFFTYLCI
OPELtIORLDAVRSEWVLSVRGKVCPRLAGMENPNLATGNIEVEVJISFEyLSKSONLPFSIVIYPASM?KIATALFIL
KHYPrVLDTLIKVKODAIASITPOAKKOSGYRSPPIMLCIDCS
ADDHINYNEELRLEYRYL.Dl9tRCDIIEKLLCRNOVMtJLCRIiFMWIpGFTEIVTWLGILr~I'TLOWLREEFHALL
VCSANDMt~NLiINACCCSVEKFMDKLNFF
EEIOCTtn'N
PECARDYLVPSRIYPCKFYJ1LPQSPOLFKOLIZIVGCLDRYFQIATCFRDEDLRADROPEFf'NNPIIGLNtiPNNYI
7TRDLISIMRCALKEPPFRGVISTTSYKIGiITMJIGRPtNKL
AQIDIEMSFGOTODLLPIIEQLVATLFATQGIEIPLPIaIO'!l'YpEAKDSYCTWCPDLI1FDL.LPGSTYNYPPALD
GK'1CTTKTACKNLINAAEKNNRLLVTIATGYSCPVSDLYODVIAL.C
LKt.KOCRDYAKRSSFSIFLDQLaHOGI'IKCFCVPOCATMSRKOLOGYTEFVI(RYGAtGLVETVFNEPLLRXELVPP
SDCLOLEIANLCKLSCPLPECLYYDFYASEDREPLSVSPI7UiAD
WLKNQOGINASNIAKFMDEEVFHELPAYFDAKDODILLLIAAPESVANOSLDHLRRLIAKAFPIEOCDLLCHWVIYWEG
KKISSOPFYAPCRFERTIKPWKLYMKRVPTSYRTYNSITM
ERELYSIXipYNFVWtTDFPLFSLEOGKIVAEHHPfTAPLEEDIPLLE~DPIdVRSSSYOLL.tIiYFRIRKHRKYKNL
VI.NCSfEIASGSORIHNPDLOSOIETILKISPESIOEKfGFFIKAISFGTPPHtGIALGLD
RLVM/LTAAESIREVIAFPK1'OKASDIJ~BWAPSEINSSOWfELSIKVAFCPn_0677 755217 755167 CTSSZ hypothatieat protein r'.Pn_0667 711270 712901 3KS1'LGKAYHCFLKOVSLAWREE11W~IPHHWFILt1pF00FSGEQDRFCSFLFrITIROR
his::-Hiscadyi tRNA Synchatase VSFLVLpEKIATLK
Y.SNNFEARHIM11TLPKCVFDIFPYLADAKOLJdUITSWNSVEKAIHTVCMLYGFCEIRT
PIFFJLSEVFLNVCEESDWKKEVYSFLDRIOGRS!!1'LRPEGl'MWRSFLEFICASNRSONKCPn_0671 75669's 755577 Pt'lILPMFRYER00IVGRYROHHOFCVEAIfNRHPLRDAEVLaLi.WDFYSRVGi~161QI0LImu-RNA
Mettfylcranstatase NFLOCSETRFRYDKVLRAYLKCSNCELSALSQQRFS'fNVLRIt.DSKEPE00EIIR0APPIRGILYYI'NVPPRQNHA
YOLLKOLHTSAISEADRVSYYPKONR3LGSK~IOWLONIIFNIL
LGYV3DEDLKYFNEILOALRVLEIPYAINPRLVRGLDYYSDLVFFJ1'I'f'LFOEVSYALdGGRHRRLLETLILOSGf Otrl'PEALVAKVNOCVLENLDSYSALPWPVRYSISODWiFLVt~Y
r:RYDGLI':AFGGA:iLPACCFCVGLEMIOTLLAOKRIEPOFPHKLRLIPMEPDADpFCLEGEEpAEEIAKLWLTEAP
ITIRVtflDKI3I/KELOEKLEYPSSPCELPEALNFSKR11PLQST
W::QIILRRIiaPTEVDMSHKKVKCAL.KAASTEOVaPIr:LICERCLISG'OLVIKNMSLRKEFEAFRIK',FFEIOD
EHSORL'C1'IwLTDKOIVLDFCACAOGK3LIFA0KAKHWINDSRK711 FY1'KEEVEQRLLYEIONTFL L.p'f'AKHRLLRACARNFSIrILVL.RIfi$F.~.W
IVDAFC~FRRNPEHKWQFSKKLLLNY
YR WjKG ILIfI7ASAYVGPRCRLVY tlY:::L.LKEENEANVA'rMN.SIaaIKEVHRKTLPL4~VGKG
~a,W i.r..l ;.(1775 741557 OAFFT.~afIFL'Kt :>,. irdnl::r nan,Uwt Plarent in tanabank/ENBL .t:; ut 11/7/')8 I.WFJIfIAMKKLIALI,:tYLVPIKI;NTNKEHIIAHATVLYJU1RAKYNLFtYODVFPVFIEVtEP~.Fr_IIi.75 :S'h.sl n5r.7..d f::l~'1?:LVIIYEIbIV rTr.'l~ hypOthlittCal I'rr.t,lh '/PL:dIILOFUFS II:YYLHVf.EL:,tI!U:1't!
f t.\'. IdtKK).LL.fiAWl'Vllld'L1't'NYIiI'::V.'7 f' f I~r il.r.. n.lqry ')4'.'If,S
R(r/IHELF:M::At;;Y::l::::NIdJ,LfFLf'LII~t:Y.hJI'WdI:YHt.FFI'::FriIIKKAIVDKLL7A
~Jq~: Ia'xrr.:ldwa:ph,rr.e 'rr.rn::PnrtFK.~.LILFL:RRPVDKIVI'AAN1'.'/f.~.Yr:Y::Nh'::::WhUITIItrtSI::Iln~l'f f 4:fVlWIRLM
YMrNWI'KI'Y\'I'I'KIIIKRIEDIIEWKKKYK'IWIiIRIF't::Mf'fv:YIbYYFTRKGFTPAMPTLDA.:LVN
tIULTfLLFIaITAYL:~L::f.ftf.f.l~'f'Ira:KAylLKTL:'.f:K::'NLLRI.LIhLF.iL
f.\Idr?fIKn4W
:Itv::.TL1'F::Yr;I:KFV:rIMSOq.~.tIPRYf.TfAf!:Wtl'f:LTNIFFI:I~S::AEDPhTtIM..LL'.
D::f.ab'L'LiVt:Lt'aYl1'1:1'f~IY:KTA/r:l.WhF:PAf.A::1'I:U::KLALL:FL
::IYLYAIYIYI;:I.MWF'(XfiA.WI'h:AItLLTIIWfAK::EAr:75'aI::VW:.T::iINICG11LIPILTGF
AEVLRKVIVEKKLtN::K::IMn'1'FEEW:Iff'I::11!f/~Ml'AI.WDKNCtMf.IJlitYlId.tH'LhtDIr:
I IUY:XWIa:AMYVh:(Lifv:NGLVLINRLRDTFQ::II:f.PffAfJ!KMlYYNPHP.~.FWRhYt.tLilQtFf fEKYKRDPFIHAHfIEL'KSl4iE
' : I'I.:R I I :IU :L:."1'Rl: f (.F'M'V f:ITX~W1.WFLAAA::FF!'!
f VRMAVNDW :ALFL t E1'KIIYMVK
WO 00/27994 PC"TNS99/26923 75?:I39 758051 ..~LNLIV4IFRpVFF.~.NSRS4.
.JCNYLRL:.K:NFA::':.1'KER~..'KTt.::.~1:.'.'fCFASF:r l ~
ri FYTNTFPFLEEOYTPAVUr:VA.:RYtI',~.NNIvDL111'SHRLK:::E".':J1F':DE:F.TIYIIPFCC
,Pn_On tloalnlogou:: to CT695 ONELI0MK3PYI~3GFA'IRNt~Illl~fLTTEC~II>~~K
DRM'ISDPLEESAAEf7CD5DLEDRVSESATOVIETIADTGIPEATPSDG
' ' ' :, LQPI'TNRKG~IYRLG K 'I~CIST. E
EK SDPISRKLAAOHYPYSFC
I
, piSfi~OC
T1~pLM.iOLVDRVEYEARCSLLT114.ARIRKAVSOIW~IVKTKRNPKEO~IRSIGOIPCD.
LLMTRLPKETAEPPYIYAGITALASCR.iFFINVFLRLITLLRRONPEAPLDLCCI'OPISKSKOLYLKKOLPKR
PfAAVAPALILRSCCKWVATDAVOECLPLEVIEEACNYNJ1FSLEATTTVEEVSKRLSELL
o X71107 770147 Y.~.DKRIOCLANVRCITKIITSPYLCACOCVSWOM.KTYDLGRNY'EOVLACASOIDEFAD~Pn_069 7NiE:-r:liceet Hwnnrranzflroaa ft Y~.FNFALVNIfDtLYI;JM3DR.~.YfI/:DFtarteISEEHASEIrtJYDWtaILEVNLPILEEDYRr u r.rrv ;::1~-r 'EI Y:' .
~;:n'_1Y- i:e:Ifi':: ""i'.'~'' . E
.. ~.::ivY~r~s-:~ :. :.i.":. : ).
. Lr-=u.'.': :y, i=
HNANVtw"WEIKRRRCSLVKKIRVHOSt:LICLDDLEKLLNEGAt)FV3IPINSN41GC11pP
CPn_OD
LOOVAELVNRYOAYLAYDGAOCAPNLPIDVOLWOVDFYVFSSNKIYOP'TOIGYLYI~DL
No robust homolop Dresent in t:enel>Mk/ENBL
as of 11/7/98 RIAEGINPSGNRSPDDVWVpGAOCOSSSTOCfGiITNSEEGIWEM'1'STSQPQV1~4KAKQLLOOLPPVOC~DNVAIY
O~tPEYLPAPEIKFEACTPNIAGVLCLGAALDYI~GLSAKFIY
' WQIVRCFFLCKICSPDSSOCASGPAIIOSPSOPIIRITRPAPPPPI'IC071NJ1KRPATIIC~RIOGANPLOIGFLLD
IJECI71VIC:
DKEIALTTYtJiKELLEIPCVEIIGPSIEEPRCALICEtI
APOPPTAGSSSOSEOPTANSSEVAKLVSELKDAVNSIIAEB~CVL10NSOELOTKiII'QiC'1CHOCADPJWERWNVC
NVLRVSLCIYNDF~DIDOFILVLCDSLOKIRR
NRCPDYLWCYRVI11RAT.OpTYTLOS14.IELTSSTCPVPQAVTYAKDAVTO'lvRG11I1QiL
RVSDpGGWSOIDYTSDIARL 0690 772701 771176 CPn ENPKPCNDPONL~IpWISLGIOCPTLDPGESIONPLLT_ AGKN11'1'RDVNOIANESSRL ASC TransPOrter Nelnbraru Protein GSALDRVRENNPNENPRIWIALARCIGAJ1VHSNATSVRIANGSV4AGDlM.VSICfFSSIASOSPVOKAAEACYTQYS
KOPSSKL17LS5FS1iI0EtSGfPO
~,~~I~y~
RYNUITi'J1SELIKOIRSL71FBCILINGKYEPS4SOLPEWIVCCIDW1CSLSSF
NOCFOVN1WPLAfWAVCSEDRCWLYIPEID~'1'SDPIFVRNISFPTVSOHDVIFfTRIV
CPn ' _ ELFVCECADLTVIfNPCYSEtSDfLSWS
No robust homolop present in Centebank/ENBLVILCQRASAOIQISNDVDLENUCSSKTIVNGYt as of 11/7/95 ' ' KiINSVNPSG7JSKND4WITCANDOHPDVKLSCVISANL~SNRVTASCGROGLLARIKGVi SYIVCKKGNAESLVLVOSPRIL~IKJLSN
TIA'M1~11ICMffGIJLL.ESCOGPGWPDN
7CPFSRMSFFRSGAPRGSQQPSAPSACIVRSPLPOGDARAT<x3AGRNLIK10GY0PGlDM'011~IYSRQHIKSILY9 GNPLFF~CI'ISISSOGCLSDANQKHDTLLLSSLAAVSTIPRLEI
IPWPf7fIGAORSSGS1TLKP'TItPAPPPpKTOGTNAKRPATNCttGPAPOPPKTOCfEIAKM~6YKASNC~ATVOPL
DPOpIFYl9LSiKitl'EAFaQEKLIHGFL1~LVSDTFtaSST~.Et ATI~KCPAPOPPKCILKOPOOSGfSGKIOtVSWSDED
0679 763936 761735 CPn.0691 773167 7736C!
CPn _ CT691 hypothetical protein pyk-PhosphoQlyrerace Kinase "
CY!>nIfLTWDLSPEtICKVLVRVDFNVPMpDGKIL~IRIRS11NP'1'INYLLKKEW1VIWSfCaKILXiaCSVLV
RGLGSNLKIKfiLHASCE10VKILDOFNWIOPCTMNIIt;PNDA0K5 ' HLCRPKCpOFOEEYSLOPVVDVLECYLWtNVPLI1PDC11CEVAR0AYA0LSPGRVLLLfl:IL.DI
SSOEIALOEONLLSNLP~LRSMGWiCFONPPEIPTI~tKMFLRDAYNAIIRRN~10~
RFNIGEEtIPEKDPfPAAELSSYGDFYVNDAIGT
SIt~FNZ'LLSTVLI.TIfEYNII'1TDLFL~INIMOGFSGGERKRNEICONLVLEPEfIIVVLf~EP
r'f.
.EFLGRNLLTSPKRPFTAII~GAICISSKIGVIDSOLDVDALRLICRVLEKYRELIIPtSSLCIVTHIQPKLLi7LIRP
OWIG.LLDGIIVALIfiW
' It>cISLVEKS11LDLAREtVLKIAKSRNH'I'IVLPSDVKAAEM4SI~YSVISIDOOIPPIQ4GKRVAWR
SIlBIELfJIKSY0E1r1 FDIGPRTTEEFIRIINOSATVFWNGPVGVYVPPPDSGSIAIANAf~?SiPSAYMI00GD
AAAWALAGCSTKVSNYS1GCCASLEFLEQGFtpCIEVLSpSKSCPeL0692 771915 773161 11AC Transporter IOE!'G11GGKYEIOESVKVPLEEREDYPYC:IYI'PIESOGLTRCGSEE?IEEIAAL1~POP
ygol-Phosphate Pesmease IIDPRL011YRYwIODiJtEPANARI3tYGPIAY~IVYFSSPKOKKPLGRtiOIIDPIILDTFK
YSNLPLIIFVIi.CGFYTSWNIt.ANWANAVCPSVCSCVLTLRQAWIAAIFE!!'GALLirGKIGIPLDDOKRLLI~N~
iIfAV~.VFDSVSICTTFKE71LEKAG11IFCSLGGIIOmEFEILyKIt 7RVAGTIESSIVSVTNPNI715GDYMIf~IfAALW'fOVWC.OL71SFFOWWS1TNSIVGAVIYLGSWSHRfIfFFAAW
AAVFS00Sf11YVPKCVKCPIO)ISTYFRINNKF~YOQFOITLIVY
GFGLViGKGTIIYWNSVCIILISWILSPfEGCCVAYLIFSFIRRNIFYICiDPVWNVRYAEDOCIfASYLOCCTAPAYS
SNt,~rNMWELVANBHAVIRYSTVONYfYAGDKKTGI~IYNF
' PFLAALVZM1LG11MISGCVILKVSSTPWAVSCVLVCCLLSYIITFYIMrI'lCNCSYISOTK
VT1~LCAOYRSKISNSpVl110AAITWKYPSCILKGDESVCTcPYSIIJ1LTSGKIQAD1CI
PKRGSLTYRLKFJIO~iYCRKYLWERIFAYLOIIVACPNAFANOtiNDVANAIAPVAGVLRNLN9CKRTTSIVISIOGI
SSDESKNT!'RSLVSLCKIUIOfSSNYTOCDSIQ.IOKASGiII<TDP
pAYPASYTSYTLIRi~tA!'OGIGLVICL71IWGWRVICNCC1CITG.TPS1~!'SVGIIG&11LTKIWS'1STSSIEN
GTlSICLREDQLLYLRSRCLSPLI'AVSLVINGICRLIIEDLILVAO
IALASILCLPIS'1'1'INWGAVLCICLARGIMl'HIItIIKDIVGSWFITLPAGAI3SILFFEASKLLLIKLC9S11G
FALRALPN
CPeL0693 776393 773310 CPn TPR Repescs ID-Linked 0lt:NAC Traneterase 0681 765001 764358 hanolt~pl _ LRSTEItM.GEISNEGiUWIJIXElTrCSGI
C1'691 hypocMCieal protein AAL11YCY
NGIR~KSFTRSFRQVIIAXKAII~lp1'IJLALFGOSPFAPLOIIIQ.~IWSCVEriI<.PIfTALGIIALL1CRVSE~
WCSKGLASEPGDSYLRYCYGVJ1LDRONpYWIIEO~iIYVAWP
LRDORYECL.LQEAIQ.VSDKEYpA0CI10~i0laNtiLPA0LTIlPISRAGILEIISIODSIADTDOVECyIFSLGSV
YEGtLKRLQGLDCFDKILALDWlIIPOSLYNKAVILSEfWEAIiIRLL
' ' AF~DVAILLTIRRWTYPSIIfL.FFRFLtiOr?.iAFELI7t!LWEp'NOLLESSFOCRKADKAL0K
EVAVAIO~IPLYfrKAWhLLGFLLSRSKRWOKATEAYDtYWLRPDGSD011Yl~iCYL
!
Pr~r-SK;RVAKSENESDVLOREtHQIFFSI~FIIPEKEFYLWLOVIRRTJ1GISDS58adWRTRLALKAFOFALFTJtAEDA0 11lIFYVCLJIIILDLKOIOIGYEAINSALSDSL.C101ARnIa INNTLEEK
YLH1MQGETDIUITKELLFLpIfKDS'IPAPLIQKTWSDPSSNOFCRRIDTIS
CPc>_0682 761913 765955 CPrt..0691 779135 776330 dppD-ABC ATPase Dipepcide Transportpbp3-PBP3-transplyeolase/transpepcitiase TSKCWKNSLFPHIR1LPKRSCKRLNASNPILOIEDLSITLitK0R00YPIVOSLSFTINDGFSOESEAIOrINSNKRPI
Uf!'PIYESIAOkTNItLISCIVIAFAVIALRWYtJlWOIWKLE
TIC~tIGKTIJ1VNOWYDVSVAYCAIRDLPfRAWIIVDEiIOIKO
IPONPpJLSWPVfTIEQOFREIItLMLILTAEVAXEKIC.YALEEItGfNDPRLCWLYPNOLIPVRKNYINCLSELLSQ
F1JILDREAIF~AIHAKASVLGSVPYLVAANfIS6RTYLKLKMf.
1501?Q.pRICIAMALt.CSPl4LIADEPI'IALDVSVpYQILOLt.KTLOKICfOISLLIITICISXIJNPOLINGVV
RRNYPOESVASDILCYIICPISLOEYKRVTOEG80LRECYRAYE~
NGWAETADOVLVLYAGPNVECAPAVOMFMQPSNPY'1'RDLIaSRPSLQPOpLCSFNPIPCPKLPL9GLASIOOVMLLE
SVESNJ1YSWALVCKEKiVFJ4CWDSKL~ItICIOIPILV0N101i OPPHYTAFPSOCRYNPRCSKILNRCSAfAPEIYPVRGGNKV1~WLYDDFIOE~OAVPEAPatKIpL.TLSAEI4AYADA
LLLEYEKT6TFRS~IIKREKi.PPLPPW
IKIXiAI IJ1LDPNNGEILANASSPRYRDRJDFVNAKVAEDSKAVRSSIYRWIrWKIOIIAEIY
CPn_0683 765936 766919 DRKVPLIRERRNPLTCLCItEEILPLTFOCFLOFLFPENSVIKLOLKIWSFVO0AIt110NL
9ppF-AHC ATPase Dipepcide TransportVTRLLSLFPYEECfCPCSI1IFDAVFPNEEaHILIOEYISL0E0KWINECWDNKADItL
CVCCt~f!"1'NFPOPLIQATSLT>IYYKRSFWFpCKTIASRPVDDVSFSLYSRMVCLICESCKEJILOpVFN6LPANY
DXILY'TDILRLIVDPEItFSPVLPSEVNRLSLSEF1'6LOGRYWIR
SCKSfWLAtxLLPLTSGFLTFNG1'PIKLHSKIK;RtWLRSQVRLVFONPOASLNPRKTISAFSTILEDAFIEVHFKSW
RKSEFLOYLAAKROEEALRKORYP'TPYVOYLEKEKTRQAfKII
:.DSLGHSLLYHKLVPKEKVLATVREYLELVGLSEEYFYRYPHOLSGCOO~tVSIARALf&FCOEHLD'T!'U1YLFSR
TPYIfDGLEPY7fDILDLWINELONGAtIRALBWNEIIYLFLKiRVSN
VPOLIICDEIVSALDLSIOAQIIlMLAELQKKLSLTYLFISNDU1WRSFCTEVFINYKCLSENLPALFSTFREFNCLAR
PLLOKYPISIVRNKROTEODLAASFYPVYCYOYWIPEUIYG
OIVEKCNTICRIFSDPOHPYTRffLWAOLPCTPDORDS1(PI!'pEYEIKt~EESC51GCYPYNOMTLGSIFKLVSAYS
VLSORILWGHNECPANPLVIIDKNSP'CYRSSKPHVCFFKflO'!PI
RCPpKQEACKSEIIPDpG1HH'lYRCIH
PTFFROCSLPCNL~ffICRCFIDLVSALOISSNPYFSLLVGECLOOPEDWDAASLICPGEK
TGLGLPGEYAORVPHDLAYNRSCLYATAICOHTLWTPLOTAVMLASLVIA)OWYVPKLL
~Pn_0684 768056 767181 LCEWEGEHVSYLaSKKKRTIFNPDAWEYLKTCNRNVIWGOYGTAMiCSOFPPOLLBRI
spoJIparB-Chromosome Particioninp tCKT51'AESIMRVGLDREYG'MKNICDiWFMVCFSDODLSLPTIWIVYLRfGEIORGA
Protein ERSCDIVPEISKDTIIEVAIODIRVSPFOPRRVFSNEEWELIASIKIIVCLIHPWVItEPMAVKNIDMNEKI~~ORLSF
LRG
IGTGDRVLYYELIAGERRWRJIEIpLACATTIPVILKNVIAOC'l'JIAFATLIFlIIORVNWPI
ENAEAFKRLINVFCLTpOKVAYWCKKRe"LYANYLItLLALSKTIOESLLOGQITIGHAKVCPn.-Ofi95 7Rt1301 78178:
ILTLEDPILREKtlJEIIIOEHLAVRLAELI11KOLISEECSSIEL1CPTPLDNAESSKOHEEhonalagous to LOORLSDLCCYKVOIKTRCSKATVSFHWM~ODLOKLGWL.iSEK:I'LSESIa"SLEYSNKKLLKSaL4SMFIk:SVCS
LOALPWC.NPSDPSLLIDCt'IWECJ1AGDP'CDPCATW
CDAISLRAGFY'vC1'VFORILKVDAPKTFSMCAKPI~C.:AAANYTTAVDRPNPAYNKIILND71 ~:Ptl",0685 76800 768317 EWPTNAGFIAWIWDRFDVFGTLCASNf;IfIRCNSTAFNLVCLFt:SIKGTTVNANiLPNVSL
No robust haslolog Dresenc in Genebank/EMBLsNCWELYfDTsF3WSVr,Am;ALy~CATGCAEF0YA0SKPKVtEWVTCNVSQPfVNK
as of 11/7/99 FPOSOYLLIFPNRILDWAFEILWQCML?DQRKHIOMLNKHHSIEIFLSNNWEYKLFFPKCYKCVAFPLPTDAG11ATA1 CTKSATINYHEWpVCA3LSYRW.~.LVPYtCVpWSMTPD
KTLK AOEtTR
IAOPKLCTAVLNLT11WNPSLG3NATALSTI'D3FSDFMO
IVSt.'pINK/KSRKACGV
'IVr.ATLVDADKWSLTAFJ1RLINERAAIIVr',.CQFRF
.:rn_0586 758373 76817c Nr, roD,)sc homolog present in ~,enebank/ENBLCPn 0646 7RI707 7R~5'n) as or 11/7/98 :\KD.SMIPtX:RLFRVIOELFFFSa'LYVCEORRPRKL7P~WHWFPIEKPRFLLKCFKKELt.'f:'le hypnrhcwr:,l Iw"chm EIFYERhI N:Y:FYVPNI'LLTY :71FE I EV~):a.F~.p:
~ .'KI: P I KDL11::N:AI IE':I
lOrl'RIadNf'KNKLY IPEE
NtJtILY I I N(.AK'T'L.~~1WIIAt.111 I HKV IOfkIKTVLt1N TKK4AKt.'V
I RY,MIEM.EFPIAE
:'Pn r)b87 759501 '1W)3l1 RWU%71LTNM1'1'IRN::II!T1111CrF.KfH.::HfIVAYf:PKKEJ1ALLAY.ItIK~KLLNNLEITtRYEIK
'T4Nt IrYp~thutic,ll pruratl:
KAf~:LLVWOC::YEKIA'IAI;IKYI/:fIVI.ALVUTEIC:DIt'IIINIV11~'NUIr:I.K::fRLItN
ItKINYdtLNIIAYRF~I'PM.'fc:FN~KLVItNIWY.KfY~F.SAIAICIVL.\::FL::LKTV::N7YKVIYFNII
FJlKIIKI.:It:t'I::IVY.:a.EYIl4a:AII:::::UI~IdvarEarINE:ItI4.IJ1KKYIi:YJW
II::OAI!ffl:: t LLLTRMbYAV.'~1:FLPSK:
AL.i:iLEYAYNIIX:E'.a'EtKPYIU:FIJI::I:FY
IIIN
I:1LM:AYYN:LAYfQI:.VAWLDHPIqKLLKETSWJA00L'fDVALSK:fYOLImAN.~.:aCtYyrfi.r7 'r4 'a'.'1 nt t~.1'1 C
'il~rl::Fl.TLt.tNIELKELL1YJDV:.'y~DFMLKSSPLFHpFFRN'fl:OCEWTL:KRFY'*Kf:r::l-Klcxn7acv,m F'.n:rmt f::
WNr:YI JA::fiF::NHT1.K'TLI\trl':
Ir:f:l'Yr y, I:AL1JN t Y:Nf.E:IJ1WYLItKI~:LA::rItSKKPJ1R
'hyln.NN Ir.'117~ 171)1)7 ETYIfi:IIMKTINH.TALiIV'rPII~.'I'Id~/ANEIAVI'kIsEY::IILtlII)ILKYKVIII'Vli\I:%!Ad ."pqal trrl!Ir..r:.:.rl 4rntriu ::::rJUP::I.:'/DELHAYftYn'Vt:EJIII!I::VVAYI1KAH'fr'I"/r7f'/::IY:M:K'I'VAI:rNI:Y
:::::
SWIDIAFOIWAAOPOFL3xF-.~VPAFJtIAK! 4're68 nypocnec:~.at t. .m '~tOCxPOEJIEKIV'fCKLN': AFK'lVKRffCM IDP'JlCf PtILDI:DAEACv' TAD :"'=N:a: FG~:ELKKC:.'.?FALC'.::YAAPKD
. TTLVOCFKPNPNIOI~DOMIChLI9ft~.~~tt.E;~INN
fFOEAt:LLECPf IKNADLi IO~L:OOf.~.K':'.SI:a~VAIAOfAD~IIIDAVOViIDOiN>I;GCNIINIOIfOODARKCDI~
' "
' ':Pn 06os 79)44) 754201 t CL;.
.
ALLYPSDItDONRF~LANFL>r!'~YAVORA''ORAELFA:~
ri'L : SV33SIKTIlI
pyrN-tMP Ktnase EPNKNNAKOTRRVLFK:'~,GL.iKLS"~1RIDEMRISRLVSELRAVPNNDIEInILVIGQCN715207 ~~574~
CPn tLIiGIJtEGKELa7INRV.~.AttOMCMLATLINCMAVADUJIAE:.IPCL:.':~_ LSCPOIaDLI' 'f~e7 hYPOCne.:~.~t Gr~t~tn .
.
' PCK.~>tEILDOr:KIttr'~':A(t:PYL'ITDT".AALPJICELtNWLtYJITMNVOf:VYDKDPRL.
..
' :
' ...
...",: : . . .:1, '. :~ ;.
.
~
, ' : .... ~yr~;
. .
....,..:.i.FYi .
:rt.: ~:'F'C'.:=.i:r..'.':..:/~i.;.......:.~TIL::II:~fPn..iv.niv'i.~
' ~ .. .... .
: . . . ..
'..:i~?'.:':.:' :.::.~: .. . ...., ... ... :.
.._ ~:lll : '.': I I' OV IANCKIESTPdtifJS: LIaMLri' WAK.:AV:PtG:
CPn_0699 781179 794721 CPrL0710 794492 796210 rrf-Ribosome Relusin0 factor Cf66e hypochscaal Protein ODTEKKMAAALDffNKEVKSFR1GK1WPALVE'IVVVDVYCT~SDIASISVADRC>CKSMAT~TAfDfNIMLOCVCTYV
ItGV00YLTELZZTSTQGTVDtL:CMfNLOFRM
TMSIrt . RS
LRQLVISPYOGNNASAIAKCIIAAM.M4PEVP1GSIIRIKVPEPTADYROEMIKOLRRKCOILSpYIIESVSNILTAVN
TCIZT>4ARAVKCS
EFJ1KINVPNIRRE71NDKLKKDSALTEWVKON~IOELTDKPCKOLDELTKOKf~EIAS
CPn_0711 795791 796484 0700 795094 785609 .~
CP
.ZdFINYLyLCItYpSMfFM~IfItKEEKt4SOPL:.DLE00M00t1oR.10ELKJLSVODK
n_ ~
CT676 hypochee ical prou:n VHKLUtLLRF~SDKFSfCOOCSLL1GYVAtw~KVLCRIt4R10tI
PATICYTEII>K~LVIRSYVCATCPCPSHYYNNEHLSLSKGVCVLT' tJIVNSPTIIOCYFICQO .
LECfrI4CKTVWttSKODD Lf.GCFlQCYTNFKNOITSKLKSERWSSSTMEKCOGSLttIGR
OTLEREDYEOAAVIRDOINHLKT104~DPS CPn_0712 799)15 796781 At nylacs yelase) ~ co eds l .D _ E ogy APOEASNLFtpLLKLI FHA dolesin: haao I
~~
~ IP~
CPn_0701 795594 796672 EE~D
VYDFD
FISDEFD
~,pprIpIyyNGVAIOfT'fQLKNE~ILS
karG-ArQln:ne Kinass gOpLSpSNEpGKDLLPROTS6fIHISPKL.TKDOGSSDPTTSCDQFlJID~JIfIJISAKAE
MTLPFIflLLEfLVI(RKESPOANKVWPVTTFSLARMLSVSKFLPCLSI~Ofa.EIbOF
I
IAKVA>ULO~S~~PKEONiUtDSPKGEBRTNKPpNiIINEDNOASPRODPOPK
KPK IO~PI
p SAEPSWOJfARDCfPLIIB4KPVElxAFHfKATPDSPEKKDOPEDGS1~OGSKIFJ1TPLDSQ
ITSMfIAIIEGFGEFIVLPLKOTPLWOKIIPLLEtIFLLPYDLV~'iP~'.EtILW'fRSfDFIs~A
INfpDHLVLt~IDFOGNVEKTLDOLVOLDSYLHSKLSFAFSSEPOFUfTNPKNCCTGLIISK~~ETDSAADAFiDAtAS
DHTAEOtHtCfPItKV84lt%SA
JCFLHIPALLYSRTTNLIDEEVEIITSSLLLGVfGFPGNI~iRCSLGLTI'~i.L~SVLSPfHVODt.FRfDO'fIFPA
EIt~IAKl04I&VDL'PpPSRFLLIMaGAtIICII6FHLDBG%
fAITASKLSVAEVAAKKRLSEfl~WLIOdLILRSIGLLTHSCQLELKITLDALS~OIf'IIDKTSILg HDOGILIEDL~KNDVIYFX:RIt SNpIWCITVC
YIIGTDP!"fCDI
V
T
DLCLIKV'IENHPLhR4PLFW0IRRAHL71LOKQAt~SRDLOKm'ISHLRASVLKFLTI(GLHP_ _ _ _ _ !SF ' ~pV00LAILPGTdTASLtHI'K
OELM.AOVINOfPIYR9'ffN
~L~I~~'~~~~ ~ ~~Iii.SIOIt~ILO
CPt~0702 789700 786929 ISIOiSPEPGKFII1GYVKTEEOAACLVDYINIH
yscCJqspO-YOD C/Gen Secretion Procsin~F~~~~L '~~~~~~IPCVRLV104fAVLLPAtRCGIID
D
LNL.RYPNRYRM~fgRYOETSIMIWNCRILTRGDVTDCKM'SIOPMIIFLfJCD3LXYK
I.IQWPVKIVIINICRItILOGIKIUOt>utICILSGLFfLDLVLLCVSSORP2'EI'SANV~04L
'PAKpt RDEKLAACPKNSJU1SLSAXKSIftlOCT?PGSIPSKVFSKFD11T01~fFOKTSGSAFIIIYHII
TLXELEERKKPRPERRT'fADVKRSPRFLP'PDEVEPVPAASKDOLDSIQ~~
AVNAINLSIKKOLEED'fSTV'fEKD110PKT011TPHASKIQ4VASPSTSHPGIDfAATTVAVP
YSKISCfDIIYfDSNDLO CPe~071J 799!17 799132 .,.._..m........".,.rree.,.-r.,m~r~rtarrmtsILELIaFCT66J hypochscieal P>rocsin LDLItBEKAGPPNtft(SIPOGTKl'fTAAL>aNfSMLEICLIKNFATYMGITSTLELDiIDGAYV
LPISEVVIGVMQOFDIOFitIVLSAShGAL.PPSADTA1ILYL0l4~tTfrliLPCRL1COSAf.OLDS
EGNVVMVRRFSGt~EYRtIVL8Il~fSEISII.SDLCLGKO
MwA-Glucawyi cRt4A Rsdoccase mUaAt*BRERJvIOYLOSFEKNLFLA~ORPIG10~ATTPLL'f~tA
g.YIITSESPLTAGAULSf'~.TSaGIRPYRHRCLSCIIItLfOVTBCTDSLifOElEI000V
KMYIJIGSKZRCLffDWTLPQKJ1LKZI01ICYRSRICTPDI~V1'IESWOCILLSYDKiTIt 'i'I4LF4C3YSDIZiRILVIUIYLYpt~YHRITFCSROQYfAPYR?LfRLTt.SfI~PYDVTF1LC5 SESASQFSDL~l3LASIPKRIVFDINVPRTfLWKETPTGIVYLDTDfISHCWI01T/0Cf K9lCVIHWILLLTCAAKKOWCIYtINCSSHITORQISSPRIPSVISY
CPn,.070J 791205 789695 pkn5-S/T PrOCSlh Klnass RSRWHOLNP6TRItSTVIKVfSPSPSF CPt~0715 901436 80)162 RKIGFMOCROGIPLPEPOVIGCYMVXKILSKKL pyrH-ON71 GYrass &ubunic H
TSRSVYNFLKE710SLHOITHPNIVKFHRYGIIWOt7CLYIAMEIfIt7CISLREYILAOFISLPKFIIKISHMAAYTE
ASILSLASLDtiIRLAAGNYICRi.Q7CSOKEDOIYTLIKBWONCIDE
QAIDIIFDIAOALEHLHSRNILHKDIKPENILITPOGKIKLIDFGLADWtrfEIORAHPSVfIMfRIGKSLKISASOKQ
ISIQDOGPGIPL.~LIt7CVSKINTCAKY1'QDVFHFSVCIN~r C
IGTPYYNSPEQROGESHSPASDIYAt.Gi.LivYELILI~ISIGRVFLSLVPRISKILA%AL~~EIFSVRSVRKKKYFI
LtffFMRGVIAESXOGST1IDPDGTfYgPTPDPSIP~1'f OPSPN4RYSSTREFIODIHHYRFtSGDMOEDLRIKDHTVALYE0I4ZQR~POfNNDFLKDKI~~TSHNDLIIDLFDAEI
TtPPLYSPL<"fONEZM.T
FISCVLYHQCYPLYPNAYDTL.LOt7VtNGWGGYSPISt47tTIALSVVKSLVC00DLDRPLLFI!'SHLF%3~TfERY
fSfVNCOTGD~T~'TAFKEAIVKG1MEFFZiKTItSISNDIRI~IVCC
DRVCEINECLIRIOCIPIDEtGISILCLEISKENICLSWIJ1CCKTtfWIKROCRVWDFESIAIKLISPIfESOTKNKI
GNitOIRSSLTKDVKEAIVOALRKDKVAPIIKffiEKlR
FSPCIGKITSLOIRETKVAWEIGDFJ1VVCTLELEESVIISLiClLSIaEI4DRROKAIFCPIKNIOFIKODIJCSIfO
KKVIIYKIPKLADCIIFHYN~tSLYGEA55IfLTG'JCSASASIL7ISRti ESIHGCIOSRl,7HG5NSPSTi.ISLKRIR
PL'lOAVFSLRG1IPFTNfSLEI~.TIOtYICtDELFYi~TAIGI?ONEIOHLRYItKIfILiITDADV
I>GKIIRNLLITFPLKTLLPLVE~iLFILETPLFKVRNKT1TLYYYSEO
1'DK
CPCt_0704 792))0 791209 KDSSLEITRFKGIGEISPKEFAAFIGPEIRLTPVTITSLESISSIt.QtYNC~XOf fliN- Plapellar Motor Sw:ceh t)anar,n/YscOIlIDId.ITDf family RYfM11V1U1DSSAS1ILKSRNNFLSSLOKTEEpVMPCFPKEItOHKIREKFPLEDVOVSIK
FRGSITAVEATKEFGVHLLIOPMNOPWEVENLL.fLTSEFaEQEI?NAVFDDASI~SYFY
CPn 0716 80366 804902 EKDKLt.GFHYYFVAE71CKLFELOWVPgLSAKVOCDAIFTATSLOGSFOWDISLRLDGK9YrA-DNA Gyrass Submit A
C
NVRCRLLLPfDTFQSC0KFF5GLHDCSDLHNIDGtOQISLSIlE9CY50LT0EEWilOVVPFMRWSELFRTHfMNYASY
VILERAIPHILDCLKPVORRLLWFL.FI11DDOKMHKVIWIAC
SFIIa.DSCLYDPETEESGaLt.:'VOxHOfiroGRFLTPSSCEFKITSYPNLTHEDPPLPENPRTMALIiPHGDV1PI
YEaLWLINKGYLItri'OCNFCNPLTCDPNAAARYIfJvRLSPLARICfL
QASAAPLPCYSRLWEVARYSLAVSEFIKLNLCSILSIGNHPAYCVDIILDGAKVCRCEIFNTpLIAFHDSYDGREKCPD
ILPAKLPVLL4NGVDGIAVQftZ'KIFPHNfAELLKAOIAI
.~~I~ INIHCI(F7VFPDFPSwINDP8EYO0IfIGSITLRASIDI
- INDKTLWKOICPOS'C1'E1Z.IR
SItZIAAKRCTIKTDTIODfSTDVPNIEIKLPKCSMKE?1LPLLFEHTECOVILYSKPIVI
CPn_0705 79)176 792734 YENKWECSISEILKLIriTAt.OGYt.EKELLLL0E0LTt.OH'IHIfTLEYIFIKMKLYDBVRE
CT671 hypothetical procsin VLAINKKISAI~Uh'AVLH.1LEPWLHELATPVTKOOT'".~OLASLTIKKTLCFNEGCTKEL
' fSSKCMfRTES
tAIEKKOMIOKDL:RIKE\TVKYLKf:LLERHCHLGEP.K1'QITNFKYAKTSILIOOOTLI
FMELKKTAESLYSAKTf7HHZYYONSPEPRDSRDVKVFSLECKO'fRCEKT
RKFADEEKRVt~ELAEVGSKEEE0ES0EFCLAENAFAGMSLIDL~1AGSAEAWEYAPtA
VSSIDTQWIENIIt~~TVESMVISEINGEOLVELVLDASSSVPFrIfYGANLTLVOSGODLS
' CPn 0717 H0~~69 HOSJOti ' t CT556 hypochec a:a1 prccstn VKFSSFVDATONAE11ADLVTNNPSOLSSLVSALKGHOLTLKFStt6NLLVOLPKIEEVO
PLHNIA.r~TIRNREEKOORDONOKOKODIHfEODSYKIEEARLIRIKFIDTLTIWRMEPRHIYIRKPETPKAPDVEKP
I1IPEYMTMANTITFa:PVKTLOOL
RRALTEOROAEEDv'KFIYDNFIOSILISfFCLVHKt7ttDPJWKJ1SKRMRfVYKEQ
C~ 07Gb 79)689 79)180 CT570 hypothetical proca:n YJtVAK'JPLEPVLAIKKDRVDRAEKWi(EKRRLLEIEOEKLfIEKEAERDiIVIOJHYNOIII00C~ tt7tlt ptt5700 905626 ~
S.RDLLDELTfCDAYLOIKSYIKtIVAVQLSEEEE1NNKOKEWLA.VKEL.EKAEVNLAKRRol procetn CT5'7 hYfMfhect.
~.fTYFf.ALF\'MtINOEItFLC~IHCRWAPF
I N~PLYLTLIADHDTfY LrIKNLDKfPLP
RA'/M
KEEEKTRLHKEEiMKFtL.KEEARAEEKEODEI1COLLFOLROxKKRESCGS.
VEr~WFJCI~Jf.iTl.~..~.LLK I FL:.~.DLiSLItLL.IU.TKFE
I LT1JIDLYt.AON I
':Pn 07n7 'I'1501~ 7aJ7114 :'.Fn rfllr NP~477 HOewtn y:a:NYtttt N IFIJUnllatTyMt ATPJtel:ant: II:,rt.Irml I,linr ::ynrh.naI
'JNIIDULTTDFtfftNSOI~.DVNL'I'TVVCRfTE.YVCMLIKAVJPNVRV::6"ICLVKRN1~1EPLVKKVIKt::
FIKTYf::ffVC:Y.>?t:J:Rt.UK'fLTE'IltfYY,':RAYYV171IL:a:LVOINPQ
FGfYt 'IfEVW;t'fll::FAFt_:PG:EL::CI':;FSSEVIPTi:Lt'I.HIRACtK:Lt.:I:VLFY:GCEPIDVET.
C*S
Itffl'/A'fItIJX.'rat!\'.'IDtv'F:KF.EL1.ELLI'FJlfl't.l~Y'~IF.G:MIt.VINII1'RDMMIP
AWi Yr:fIJJtPJlt~rfln'IF'ItAFi'DCIAIRwKLR(?IL7T~:VRf:tCC:MLTV\I;.:ylt(CIFM:ACVV11A1 JJ11:1.:EItIJCKh:Ff'EF:1WHIv:IVIIftI.LYfIf:X:LIITAKTIt~tAK.YyftIELF~
IIF'IY:f1 .:LUY11AHNABF7Il)VMJIALA:ERtiREVREFIF1:OL:EfX:MYn::\'ItV:.'f::D(J:::iOIJtLN.
'L:KI'k::l'flIr191I::It11VN1fHKFMI"r::ll:YlJ1VTIIr't.WIJItIa:K4~.fV
::Y1 A'/n 'fYhI
YY
AIIYWrfntAl:'tFl.llr:KTwt?iMD:;YfI:FARALtitr:L.AM~Er1'\HV.:YTr:vf~aTLMtL.
' .
.
.
AL::I'hlt:lrftY~l.lt\'IIMKlllatft Ill:ltl'l/lft:lP:attl.~..':Yr:l.fiYtXNJIA'i::Vifr'1111'RTROF
(IIYfALOVLA 1I:IuMIt:'LLtYi:FtiNl:1'fIINKNLI.I:::ILYFi f.ER:Y:A:a*t:fl'fAFYTt'LVr4:f1h19'IEPVADL1't:.'.ILff:IIfVI::N,IIrWA'::1 YM:1 .a::f<L.LTAIVf1:19JRftIIv:KARfVLAiCYK.WI?1LIP.It:I:'fIId:::ItKfIttFAIDIiLOKINR.
.
( ~94 FI ~K(~L I tll:KTtI7FGIkV LHA I ttnn: sr.
FR ~'I~t, _ l.ai '!~
' ':Iw_t~l'W '/"n,ytft '/r5G14 In.m.,n u'1".'.', I,yl..tlvI n.~..l ,1 ..:.:.VAKPDI~:Y:: ~EGATIKAIR (i:FRTVPFLPY.:IL.7t::.::.c:., LR ITMKEF Irt'f : tKNL'JDRPEcJR . f HFFLRIIC:iLI:RnIIFPFRRF.:RL:Y
IKEIQCfttT:. ~..~: LF.~LLIE
~
' ' . ..
MINf:A:.LH::ENKP~I:LAPVWNII
.If(7411 A'/IJrVVLOY'/EEC:"fLM::.:.TM:LLPt TLL;f~VA::PNfIVRIIGLELMEEK W f PfYIMRHSDf~R
C'I!:~.tSfAG!AI~P~'.x>:WLT.'.iAitGII~IKFLwf'1 ~~LL
'.LTOIlt'..'...-.DI"IJ1RYVTIEIw~FLYL?tY.iLKIYOL'iT~~
APt~"i7GIL:5 PA
':Pn 07.',: 90771 999199 .
t oRCYOREDMERGf.KLMKFtfhTLTISVNL
LY'ACLL:.:rILPC:VRVf.YEHCLPpQSAVYlI:
KdsA-KDt7 :;vrsc netns~
VRVLRCY'SASIIPItALAPLV:SI:.FYAQROYAVPLF:~:',T.'A:rIN:V:.S:,VLCRWVLitDIfS
.1PYSDRIOWFFY.:.>~IDKANRSSL.v LEIIIGKLOS1:
:AG?C'JIEG' C:' KML:
MFPM
.
';YAT.iITAWJOL'lf:.ilYY~SKRLPM'SKLWE.iIRRSIKVIC'.'".'MIUICItITUiIJIILT
. ' / ' c ' . ' I
LRI:r\KVKETF':Vf::LT~VHTPODIIYAAAE1.~\IG','I?AFLCR~':~:.:.VA
:FR!ax:LT
W
. ApAIAF:..
. F.
. CIFI 1FLFT,FAKLLRVEDLINLaS!
'.',tE~"'.AtVNt.YY~F:..~.PNfIIt!:PTtIY'f!.:'f~.tlflY:;a.TF.P~'..~~r~,Wlf~;r:pMp SIpVL!YW
:T"'~YVIFLNFLTPf~ADt?:r.
~i~.
..
.
., .. ..:. . ...~.. I..-...: .I.. .:.,::
..:..~~::~:~ .... ......,;:''~:.\~:~yv . .-.....-..y.,t~- . _..
m. ........._. .r._..:': N V. :.:'.
. . ....:~ .'. F7 : . ~:. -. ~ n .~
a08971 :eneoenk:~BL as oC 11;7/98 0722 roWtst r7omotov present ln 909177 Nc CPn ' _ IPDNILKIRAKETSLSFLLIKPPSPPPLKf:DYLFDISpYTSSE
CT651 rlypochee7.ca1 protein VAIAISANIF/IRL,CK
"
YCLSM'KFLYCLFYSL;.LL~JIFGTNVAIIOVDOICDVSCIOaDIPOGPPFIJIIKI(VNVTLRLRSISIIS
LTICCSYFKLNKASLGS
SKQICSpEERF!'NCKIDKSCMELNPPOSSYSCKEYLTRISIRIIL?tINFtKQMpIRGNSGL
LNYODCSLHVYDCRFOVDPVP~YGSPDKEDSSSGGFGCTLYLSLPRN)2-CPn,0 nEoBhdonueluse tV
072) 908979 809703 NPl9IVL.PPPSIPLLCAHTSTACGL10JAIYECRDIWSTVpIPTANORpWDRRALJtEEVIE
CPn _ DPKA11LKETCLSYIIiSHIIGYLIHPC11PDPVILEKSRICIYQEiLOCITLr:STVNItIPGA
yhbC-ABC Tranaporte: ATPase ASNPILSVCNLVIOfYNKIIPVTNDVSFOINPGEIVGLLGPNGJIGKTfAFYLTVGLIRPDSGALKSSKEDClB~IKIV
SSFSOSAPLFDSSPPLWLLEZTAGQGTI,TGSNPEEiGYLVt~t.Ri KIIFKMIDVCKK!l~ItRARLGIGYLAOEPTIFKELTVODNLICILEIIYKARKpOSHLLNOIPIGVCVDfGHIPMGYO
TTSPOGWEDVLIJEID
TLWDLOL:.'S~~1.HKKACfLS~uGERRRLEIACVLUl4P5VLLLDEPPANVDPLVIQNVKYLHAPLDEGYIGKESFK
FW"DCRTRKIPKYLETPGCpENWOKEIGELI2IFS1QiR0S
IKZLAGRGIGiL:TDNNJUIE~IADRCYLIIDGKIFPEGSSSONISNPNVKOHYLGDSF
CPn_0777 927779 827101 rat-81 Ribosomal PrOteln .
CLKYMAAYCCPKNRVARRFCANIEGRSRNPWOCPNPPGp!lGMQRKKKSDYCLCLCiKOK
CPn _ LKACYGMINCItOLVKAFKEVIHKOGNVAqIIET.ZDtPECRLGBNYRI~ICPAK':IFAIIQpi.VA
No robust homoloQ present in Genebsnk/EMBL
as oC 11!7!98 RTSTRLOYRSGCII.SKILPFPFLWIQILLGFLCDCpCASWOCMVAIK~IDSVFMSRpEHKPtK:HILVNGRRVDRRSF
FLfIt~IDISLKEKSKRLOSVKDiu.ESKDESSLpSYISLDKTCPK
NIPYITKJ1TRRGLRNKTLAYLASLKDARQLAYD!'LKDPGSIaRWtALIAPKE71L.0Et4ILGELLVSPEODOIEAp LPLPINISVVCEPLSNRT
FFYGCSNIEDILEEIBIRPNRILLJGFSYCOKPKIw.pDGRfNDACRYDPSHPIrASCSTGT
MMRIlIARRYTZYIIpTFIDIAKHLHTLIOtRYPGYOILFAV'fACELSLKMFGDYAS11l8rLKCPf1~0771 82786) 821915 GVGIRL1GRICNfPKAFKLAERGVKpGVt'ILEEDCF~1LARTLTEYSSAPFPRDFCEINYeM
QNI'IIDtFSSNf~iF'LOCNYPODYVRVFI!!>DQfY7CALAYYYI'IRVDHPHEETALIiKIVLmL
OVSCRIYISEpGINGOPSCYEpHAELYMOart.KERPNPSKIKFKTNHIKOiTPPRT1YKYR
CPn ' _ KELiIILLGCt:VDLSKQAaIISPOEWIiEtG4FNRCLILDVRHHYEWRiCii!
CT552.: hypothetical protein ~TLPDIOlF
SCGWGMFFAPLLYESLRRGL?BiPTSNMpOQLARLEFINDOL'1~.'ELEHVNF3.tLSLCFPEREPPEYAEKLi40EC
DPCITWl4IYC'1'OCIRCELYSPVLLEIOGP'KIYYpL00DVIRY~OIf CLTTIlUIAEEVLSDDEPLLD
C'fCKWL.GKLFVFD~IPIDESDPDYAPIAECC!!<l~l'pSDAYYNCANIOQtfILPGIxDE
CItIQf~CCGEECSOSPRVRXFDSSRQJICPFRRANLCEISEN&ES~LI
CPn_0726 917381 810880 ' 0735 825680 825007 CPn ~: _ 620 hypothetical Drocsin 'Utidine Kinase IUridine ADIOrlIYSrSISTFYKKLSLVSSMHSFAORNRESLEHI11N1fEKTfA~tDTLKFtLTEVLDQItonoptlosphoki nasel IPyriddine RASERYRSAVEKLlIKYEVERATVAKSIPVAAItff7IPLSSTHASVO~Yt'ASTpMTGSGVCJ1Riboffuclsosad e KmasW
YYN71VK0KWAQDLIVELN'fVtff'1'INASVNSKNPANKDIRt>XLNZ'8LOALVAAG~.TEZNGEK!l2MlCJ9II
IGITOGSGACK'ITLTONIKEIFGEOVSVICODNYYKDRSIfl?PC10171N
YOTLYNFPEEIITAIQRACI'f?Gt;l9(TD!'iNOLAGKYG10ATLTCI'F1IDGRVEGPKDILTLIWWIPOAFONDL
LISDIKALIfCMEIVOAPVFOFVLCa'NRSKI'EIETIYPSKVILVOGILV
AVQCVLTPEOtI'IPAEIATELOAL71DN,~IPDEAGLORILDI1CEIG.itAVTNSSDLTIIFiDKPENpEL.RDt~T
RIFVDTDADERILRRMVRDV0E0CDSVDCItISRYL9lRIKPIBIBKpIEP
INFCOHITDLYSDpVAAIGSFD111LDIl4TYVNON00TllF8NLS5FYGSLTGTPJ1PIDLRSTRKYADITVIKafYR
pNVYINTLSOKIICIHLENALESOET1M11MSK
STWALNP'fATMDHVKAAILEEAKELDN8SF0L~1SSIKS11111'SIVNSSGSFSVTVNSS?L
QY1'IYSEKNGKVEINOILLNYGS1GPLPEITII1.A><CNAESTARSYFRPKALAAVESaiVO
NKIZtDt.OS0L00FTN9CTELFDGOLLSQASELRALPLPSAVASYL.IDRYMPlCE110YIHET
YKKLYYSNLCSSIGNSIIDAISOYVNGATYFNP'ASYVCOQPAVCAGGANAPPCSOESAOA
KLOQERKQaALYLOE'fRGALTVIEEORARVLKDDKIII~lE0RS1'ILDSLRNYEDNINSISG
STIQSDOQSFADMGONFOLDLQLOtI.TSMpQEWI'WATSL.QLI1J~QYLSLARSLTG
CPn_0727 81)559 816192 CT619 hypothetical protein KYYLFSMSTFSIQNRLRTISGESTRI TKLGOfYSCFDPRSVPJ1INLEELNSCIYALRilI1f NALOSENTNVMLIlIpNNfTFp1'TSWTCfIfIWSRPOJISSpRAPSSOTP?DIVSAAARALVL
VIDGGIJIEi.VASVTEIDLGUSTISTVROLJ~IASYLCL:TLTAEQEKWfSSSYVPSEIOJL
LEHVKpENAAEIOAKQEEI1WVLEJ1KGVSTEEIEAILKEYPDIYM~'PKCFIEEPLHTYCPef-.0777 A27ti69 8)0756 RAKVCAPIOEITIENAIOLLPTPPAITPDNVNEVNGIQPI'LSTILOAIDDAIKOAPALOCDOreeC-Cxodeouyribonuclsase V. Ganm EIITILOTLVPLVDK1TPTKAEPDLIYTATOLpIfI'ASLKLYLTDROIAEYRCKTTXVYQNKRSAKLPASGASKAKGR
AKKKLTDERIFAPSVRVLPbNRIOrAKRNLYKLSFITYRKCV1IP
SIONLSETIIRVVENNRSIa.L'fpLSMFOQMnICFV'IWISOMIAI1~IIAITNKYISAVLTtSMSAiNDFPLTGIVI
RfATKNCRASPSNSpIWLLiWLAEDLTSTIIpKPPTIDBiILVJWRT10H
EMYOGLLCLSYMYERLIIDDEKAIFDKSVNEyLPIHIV1AGGSWVNatIAKHAAYQELAEYSWIID~IOLVHVLSDHIF
MGSTIFTASDSIVKHLPLGSGCSOPNIPDYLTLPL.LINNILiEIS
CG1'AVTSOOOLKAYCpTRGNEFKATRNPFHNICDQMY0P71NE'NFCNCLTI'ANG11I0PDLKASKPENGREPLSPP
TYEZTItiCIJtAAFKOPNTFSORPt10r7frSNY0EL1'pILESNPS1YEE
GGFiREAIfrNVCTVEADYVSNAORILNEFNCMTAHVLOL0L0IAELOKKADDLDPGKASMFTfILTBJIt?pEEDCSL
HIFCYJWLPKHIJ1EFPINLS1'YPPVYFYCFSPCIIEYIGDIJSD
F'fENRIIFAVMWITSESLGDALISMII1JSOLPKOEJIFLICPLIECINPFB~ttJIANALNSt:LORAIDFPWNOLP
DSPI%NAWF71YVLSDRQALIJ1M.IWKSOSS~FFLDREIOYOQ?LPiK
:TNEFSTTSVYYSLSSYLVOSK'LGpNLFAGDYYETLLAAAREREYIYRDTARCKOAINLVHDSSLGVIONSILDLKP1 'SPODFSOTKCtICIYRALNIPREVOVPCKV'L'ELIJIRDVfPE
NGLLOKINSLPGATSAOKOEMLNATTYYpYSLSVTLNOLTVLESLLAGLKMfipf:'SNNKEIFILSSNIESYKVNLNA
IFNPHVPIYFTDEVDPRAEDLPIdKKIGLL&SIt.~i'QDDGNYIL
YDKSVFKIESFDDWIPTL1ALESFLTSGFPNISATGGI7GPLP11?VOSI>QOTYTSOCQ1G0OLLTHPOLOOPIDQNK
VPYLIKJfLSSEWGKISSKDRASGpQMKAL&DLILECYPPIIOBCx LNLIAlpMT1'I00EWfLVSTShIQVLNGIISOL.AGAIYSNRVSOVEVWKiTVPLIYFIQERINLYLSSSOHSYEDLF
~1VPSCLEKIFVLSPCII'SPI't't' ~
LRNSLPPTPl~$SCSLLFFTDFCLDFLLHFHKPSPLYDKpGPYICSLSSLSLIP1OCYIf!'I
CPn_0728 818187 816525 Lu'ANK1TSSDIFDLIJJRTT'fIIEELAFSSTEDEtTrFHPLQILVSTKttEWISYISSMQPN
HLPN 76kDa Homoloq tGT6221 LPSPfGHNIKE'ILDLPVE1'LPTOPYLSAFFKNKACLHTSOEYNYSLANJIlY8KKALLP8L
VFMVNPICPCPIDETERTPPADLSAQCLEASAANKSAEJIpRIAGAEAKPKSKTDSVEAWFIPTVKOyNLpOHCSLNEI
IKCIFSPLDLFLKTNYNLRISYPENLKK00KLPt~1'IDpIED
ILRSAVNAIJtSLAGChCL.Ia'SNSSSSTSRSADVDSTTATAPTPPPPTFDDYKTOAQTAYt1MECPVDKEHDLLF.' ISPHAEELFTYYREKTILLRNCLDKDPKtISPYTVrPS8&I166R
DTIfT,~.TCtrIDIpAALVSLVDAVTNIKD1'MTDEETAIMEWITKNaDAVKVCAOITELAPYNE..~YLFPPISLSF
~\:NPVOIHGTLHCVCNFl'.nLYLCSIDPRDSLKK'!'fRTIGSLPLTSS
KYI1SDNOALLDSLGKLTGFDLL.pAALt.OSVANIMI(AAELLKEf~ONPWPCKTPAIAOSLEOKOLLERYVAL.AVL
~'M>roHL:SDSALIKLTSFtrIKOMHPPPSOP~YLRKVLlIIYNLN
VI7t?TDJ1TA?CIEKDCNAIRDAYPAGONASGAVENAKSNNSISNID9AKMIATAKTOIAESSOPIPLLSPL.CWKTL
L'DEEKFHOAVL..iAISEF.AIfHPSLPIFWOPHNRNIECILiIVCAS
AOKKFPDSPILOEAEOMVIQAEKDLKNIKPAL7GSDVpNpGITVCCSKOpCSSICSIRVSMERLKILaLFRGPCE.\t' LLDDAENEfAS ILJds'GFR(.'N I HNPNTENPDSOAAQGELAAOARMI(AAGDDSAAAALJ1DA
QKALFJ\AtirK.IGOOOGILNAL.GQLAs'AAWSAGVPPAMSSTCSSVKOLYKT3KS1'GSDYCPn 0778 910719 97)B95 KTOISAGYDAYKSiNDAYC1URNDATRDVt68~IVSTPJ1LTRSVPRJ1RTEAt6CPEKTDpALArecB-Exodsoxvrlt,'onuclense V. Beta' KVt:.CNSRTCGDVYSOVSALOStM0II0SNP0ANNEEIROKLTGAVTKPPOFt,YPYVOt.SKFYLFCEYM~CPFNIF
DSNSSIOr'IIFF4EASJV~CCKTF'fIEpIVLRALI~:SL111Vls1AL
MU,STCKFTAKLESLFAECSRTAAEIKALSFTNSLFIOOVLVNLCSLYSCYI.OAITf127ASTNELKVRtKDNLAOTL
RELKAVLNrOpASL?"l"ILOINCNVKOIYNpVR1i11LA
TL.DOM.iLFTIHGFCNFS'LEpYPPffTRLLI1KNPALTNSOL'ILHNI'MYLKODLaIKNVL/QE
:an_u72w 81.1905 x19591 OFNLIJ1VRYNIT::KIf."::3LVDKLLA.TtTOPICCIF.':GRVfRLEOIw(JrMOQIYNSL.LdIP
i:NLPN 7tk0.7 fk~mnltm ICTI:':17 KpVFLDOL'~WI:.CFIIKOPF::ILGGLHfIFVDL.LYTSETIf.'LFSFFKIAETPNPKIIRWI
f'AWi.~.V:a'f.NIDTKDCMKKtWY0WCr11:WL.LALTL::rYAELtL:iPwKVKSH'I"!'f1'LDEVKYNff.' MFfIfLENlI:W1'ERTLISF'CfILGftIt'NTLL%DL.VEYL.IIQNYTMnR.iiPDESVPALiKL
D'f L::KRf:fYETItKt)DCVLR IACIAIRARWLYFRED1.: :.~.FJI4PVl'(? ALJIF.~ Ys?f.VL
l CtIP:a)KDI!'fNPLFV NR'llt.~.EFYL'fI DEf rflrDK(/;~1.~.I F:itILF
I81'KF7~fLICDPKQ9IY
I
IffnAEPNWf..~.::KNNWfAIMX:ENT',W:VDINRAFLC'fRFYKtIFt'fKTDFFMEff:R::uLCDf~IR::AD
LI'f1'1.TAK:;::F::EWIyL'rL'rteNR."~fYIJIEAIIIpIFGIIL::pFLEIPtIYLPILY
a.iE::EVVfV:.NF'Li:LJItYWTREL:KD1'1'YrJVIVII(T:PFV111JMTKK111'A4A7VFJ:ILNRLPKII
AtNI~)::::LTFENf'TII\t'tIIFFF'frfIKOrJAI.'rIIF.~.EALPWIf0DKt1'LW11VVLVGDSH
art'tlKt.'::WLVM'PhYt~t'f:TfT:KAA'fNAMK'lK'i.:'lY7trWt.l/r:YIC;~VtNtNYJi(KPLILY
nJAFELI::'fATtMI::F::KNK:aFIt:.TE'fllIl.TT:.t.LEAIWIPENYEKILIKLIi::.~.LFf;<..:L
:AFIJ411I
\KA'PKTPf.Ir:KI~Nt.AWI'll%'PIrI:LRCM:IM::A'I"/It1'!:1'VF:~1L:VPEIDV:I:17t'VI'fK
K(77FTfYF'v::iJC:'ft.:IIIY:LIrYfF"/4'IMP!(Y:IffLF::::MH7M..IF'QDIEKLCUY
7r:Rt;NLI.1:FWF'A~yIIMM'Pf'KFrINr:FTN'fKt:F::AL'!M'h:fTf.::l.::rWYi:A'I::KPANnK
c.l!Pl::::'ft'YN~~LWII.I:NF~.:Rh:rilh:::l:LAt::::'/::I:()LETLY,LTTfIL:::IGa.EYD
IVA:P.
Ir:::dFfP'7KFl7It:1 f::Ai' I fK::KKNK.~.::::ELlJit7IYVM-PP
AYYVI.YLh I:: LOPf::IJJR::::AL9'NYVKLP%'.La:BIIYD
LA I I IIJIUFJ IPDLP 91'::l.l'K
UIK :IIL'rt"f I Jil.lLLkTFAt.Y'lTPPK'f t YaY:::.TKPt.LDlIIK
~.'Im oW n t:t:w ::l'.r'n.r IR:U::It>Y:.KLt'1:'Ky\~LF'~:I:KT:fLfIIYII.h::a01'::IJIf/rEYtM:
'CINHPIKHTIILOP
nrviN Irrr.Ir.rl MIYNI7t.rIN:
1I.r..7rrVEPTILKU~:A"fFF::t'LTF::::(~fF.':1.:?ryLIiIYIFttET.':FLFLFaI)h:IJ~IIr:
VIDLPPEHE
.:n:P'Kmr:I~Jtr:IM::RKIdIFI'::I
r:KYYIIf7WK'P::FU:IVfN::DY::K::111.::IYIKrrF:YI.D'frl:l1'NKAVItKFWL?FKIIKICNEt.
\It::fFNII:Y:fI~w::hf'h:ll'Itf:I
VLt'1'Ytr:ADfIVMfW
:VIF(Yt;;::T'JIa4GFFAL::::EDIPNFNPKd:'J!.LCCDICpIIR-HTH Tr.(c: :on.tl Rl~uaco~r !H e':~:ean . RecelVer Daman ':~ n~fv al19I2 af79rii KITDFILRTNSYIIGFCTNh~DK;TS.p4~'P,.DL,5L8~
IrtSQI~~;'~/I F
tI
~PEEDI:
PDTF
' ' xfrnrrca~ E
a: Dr~cein SII
'T Ir;w t E
QKVLJIpGR
lCYCL
ESVAIPCEYG4LPE0lF6P
ryf OAVIRAFLRQNEt~tENSIPD~ffTfG011TFRVLNLVIESPEGS~~I:.TPSE71GILICKi.LINR
. ~
':KVLFKLIL"f:LRNKI!TK:,~..'t:IIALCII-iFRSLFQFIYOKIRS3FVSLNVKFFPKIKO
AP.~.:dlt.rWLEi.ENLI~Y.ER'!.V~LCEKLKLYEV3NN1'PPLFPEiLTPYFHKLVEGKWYRDaLRNKLGPYGS
KIWI14.'IfCYLFSDDCSIP
GHIGLRKNLLAEIKCKfKE:IARNVDVHRi 'fTlB.ISSSCwr/NV.KTIi~IKKN.iPVL,SCNVLIf::L'IDYVCEItOSRIRLITDVCMKPSWAtIRIANtiONT
ANPNEE
~.D I7.':W41IKI1.".LREL I PQ11EO
IaHAY TLEKDN': K I SOLOELD3LI~EGFNQALLRCIL. ..
.
~~
~
.. N,:;~~: . . ... :.::IcTr,:.l:: , .
..:aF-f;:...,.,. r-=:'.,';,as=. .;,,-~ . . . ~ y ....
.
.
:':;.::.::. ,., .;: ..::::a::.:::.:..:..::.trc.lzw;...., ,.
-..:: ...;,...."., MFRCILFG1FL:.T:Fsa.;~:::. Y YLft::HOf:i:GPKEKaRSVW
I EEEKsFTDfVLNIILPSO
0740 97b054 874861 HONLHILCFOCFLTrOK,70KFS01EKIFSK4Y0E7vODCpfLFKEEiLtSRLINSFFLKTD
Pn _ VlIETILCLLNORCPNSPYYHLFNALVCYKOKLYA>:lfTE0ir1Y4pEEKTRALAPLtI4ISIE
cyrB-Aranacae M Nnanocranstsrase SYNSFFNNIPTFSPOiIILGLCNVPfADKRPEIfVNLVICVYENPQKRY0GL5CIA1UQTVIOLLTDFLLDYISANSLI
EOKNFPOGRVILNRNINRLIJWECEWNAKTYDRIAILLSIIfYf EEEQ14XSYLPISGa'.QIFLDCIRCLVFCiJIVDPSAIVGFQSt~'l'G71IJII.GARLLSV11KGSLELV8SK511 DIYFOYYENVLfYLKKIYIGEpCPYAELLPLEELVSLINENVfILPI~C.Y
GKVYVPEpIWSHtIIRIFSOECLEVIRYPYYSKDQKpLLPEPGIAfLKEYE1C45VILLHGCPLIQLLtiIYlQKHYVN
PNSSLWOILYDRfSTNNFGJ1IRFCE71LVSFSGLEG.IKaQIIZTF
CHNPICVDFfE~IWKEIJ1ILHK8RF3.IPFFDI'AYQGFAHGIELORKPIEIFISDCNTVLVfEii.&NIfVGOIICI
EEAKOC'/A:.LHILDPS:SISEKL.rILSSD':LQNIVSCDO00lfrKLTINY
AASSSKNFALYOLRVCiFAVHSTF1'DELVICIHSFLEfltIRCEYSSPORWCVEIVSTILSNLDLWF~1IOSYDI~tC
OLVHNLVYGAKDLwKKGrh'DEKrI.N'..;.;L\:.RFTSYDIDClSWF
PYLKEE110SELNFIRESLGKNATRNW14RKVACNTFDFLLSOIIGFFAYPCfSDKQVLFLLFIKOAYKOALSSH11IA
RLLKLKFISEANIPSIVISFaEKANFL1D11CYLF11NlIDYOKC
REpHAVYI'IAGGRJBJLNCITEKMIONVVOSFIQAYELYLYSHWLTKVAPSPQSYRLAGLC
LMENKRYDEALEFLCNLSPNDSIh'DYIITOKAiaPICQK
NQSKDRAAS
CPn_0741 838387 976185 CPr>
greA-Transcription Elongation factor_ ItIFRLK'1GI::.fCYLEKIpb~IEEGOSANFLSLWECYCFNOWI4GRELVEILEKVKSSSL'recD-bcodeoxyribonuelease V, alpha-ASLFGRIVD'IWPLwETCIPEGKDKDRVLQLILDLCfSNSONFFDIATEYVNKKYSGEENF4WALlffEFAPFLEDLVN
QQVISPLDIAFASKNISSDFEESFVFLWSSAIiiRYORpfiSL
NEALRWCLRDGRDFQFSLSRFDPi~ODBIKGNlVFlIQOGWGVG)l9lCtfSFLQOKVLIEFEEPI~IRIRPSLGCIS!
'IDLYRGPIOJLPKtARDKLFVWSCRLYLRSLYTIRS1ILLDKLBLLC
GINSAKDISFE1'AFKSLTPLSGDHFLSRRFGOPDGFE7IfAKENPItINILLROLCPKTASATPNYfPPSIDSSILSE
EDNFIFMCITOCCFSIVSCfiPGIGKTFLAAQLILSLVKQpPK
KEIKDELVDLVIPEAOWNRwNOSAIITKIKKGTRIISPONPKEPYVLSDiII'sCStPIGOLERKLRIAIVSPTG101T
SNIRQItJOfYNIFDDNVIlK)'fVNHFLQEYAYRRYNSIDVLLVD~1 G:LSLNSAEKISLIYHFIRDLJiSI?.IG~tIEIRIISLVKALODLDN60(rTdCSLILORELLLSEVTPOLLYSLVpT
irpCYEKDK1Q.YTSSLIILGDTtIpLPPIGICVCNPLQDLIGYFNFXfFF
YL.GIKDASI~CEYITSLSEDDTSRLLEI~BIPIV71LQKSFLSLVRKYSSFWQQVFlI7ILLYTLKTSNRAKTCVVOO
LTOSVGRGOlISFSPLPSISSAIEVLIQJRFVKSLROSGttICVt.TP
TSPII9tDFVYKTIlQ4DPSSVEVLKKRLLDSAHQPIttIFPELFVwFFLKIGrBtEDCLFDPEDMRIICPNfVLM~rI
ltINORWiSDPDLRIPIMVTSRYETWGLFNCDZCLi.CLKTQNLIIfPO
KEVLRLfLESJILNfNYOVASI'PNKELGKKLHHYLVCORYLi(VAOllIOGIISLPPLKELLLLNEPIDSitALSQYV
nIYVI4SVNItSQCSEYDEVIVIIPKGSEVFCVSILYTAITMKIfRVSV
STKCPOFSSSDLNVLpSL7IEVWPTL10WKSNVEEFiiVLwSTSESFSRIOIAKT4SLVGKEwGDpEfLNKItKtISNf NVONJ1KEIEDJ1RSIGDLRENSFIKFALEKRARLpEEIRVLSEBINRARIL?IIDLVF'fINN
GVCCKVTLKGD~AGEVVEYTILGPwDADPDSCIISLOSKLILQr8QGK10.tiDWItQCKEY1CCPaL0753 IgRIQSIWEEl~1 No robust homolag prssenc in Gensbalak/ENBL
as of 11/Yf98 IM71TAHLf.RQALLNLRSfr1'PAIR7~1LFRQQSNSLI8~8iVLF7IGDIVCAIKNSTAISRIU1 CPIt_0712 938442 878888 LGSSHYANAALQKTDGFLCMOGVNI'J1VJ10ANLWI'.OLItJGS!(IfETDEE'DGCLRRC~J1D
CT635 hypothetical protein AE(x?lfOu.TITGINARIdSKi'IGTATFLNEi4tiWSLGrWJWItIQC>M'SCLNI.
TKNMVIVI4VSIISAQKIIDSIKGILTIYNIDFDPSIV~SSLSSDSDADYEYLITKTOEKIQEVATOCSLTESSISLYA
ILSTRPITISDpENPNKPSAEFAARSt(AIWiIIPIAwt.GOWOLV
LDKRApEIL:~SlSXIFAMiPDNFSPEEWL71LEKVRSSCDEYRKETENLINEITLCDAI4TLSLFLPAIT~VLIMAI1 GLISCVINFVIfDYJIKIG
DLHpTKESKRPlCptaSSTKKNKIfKNWIPL
CPrL0754 851781 851040 CPn_0743 838956 840761 rs20-520 Ribosomal Protein ~nqrA-Obiquin~fe Oxidorsductase.
OFILId.XVLVLSCDIIUIPKRPH1001VI0RRPSAEItRILTJ10KRELINNSFICBKVKTIVIOt Alpha-IFMKITVNRGLDLSLQGSPKESGFYNKIDPEFVSIDLRPFOPLSLKLKVO<XsDAVCSG71PFFJ1SLKLDD'1'QATL
SNi.OSVYSWDKAVKRCIfKZxtKAARIKSKATLXYN~MB
IAEYKIitPNrYITSNVSLW1'AIRRGt80t5LLDYIIKKTPGPTSTEYIYDIaTLRRSOLS
EIFKtNGLFALIKQRPFDIPAIP2'OfPROVFINL~PPTPSPOIHLALFSSRHEGFYVCPrL0755 851579 P1IVCVRJ1IANLFGLRPHIVFRDRLTLPTpELKTIAHLNTVSGPFPSGSPSININSVAPITCT618 hypoehscieal Drocsin NEKtWFTLSFQDVLTIGNLFLRGRILttEQVTALaGTALiLSSLRRYVITTXGASFSSLINYKDLPFIa.LLVRKWGN1 'CfKYWIYFLPWi'LLLPLV'CYPFLSISOKIYC1IFVFITIif~1 LNDISD27D'fLISGDPLiGRIG%KEPFLGFRDHSISVLHNPTKRELFSFIRICFNKPTFfFALMRC~IOLIITMVGLL
QTKIRKLTENNDGLRQIRESL1CEI~Q$SJIQIQIB
ETRPIII1TDIYDKVNPIBIIPVVPLIID1VIT10~ffDT.ALFIQ.OGLLVKT1C0OOKLETLLIJtRTL~IRCLIDI
pVpSLIOECGEKTCEVpliSBtIQ.ALT
NEt~GFLEVCCEDFALP?LIDPSKTE~Il.TIVKESLIEY111tESGILTPNQDLAY001lIl4DEYQA1'FSDORNIQ
.DKROIYIGKLENKVQDLMIEIRNLLQLCSDSAIUIIBQ
CSN7IYLGtiISLQLSSELKItIAFKAtZtIEAASSLTJ1SRYLHTDTSVHNYSLEICROLFDBLR
CPn_0744 811387 840389 EEI~.FVYAROSORAVFANALFKTKn:YCaEDFLKFGSOIVISGGKQwIICOt~lBll1E
ham8-POrphobilinogen Synchase CSGRLVIKTKSRGNLPFRYCLNALIDfCPLCYIM4vLYPLHKEVLOS
ENSSLTLSRRPARNR1(T1N1IRDLLaETHLSPKDLL1PFFVKIICYBJIKEEIPSLPGVFRWS
LT7LLLKEIERLCTYGLMVNLFPIIPYGSYSSNPIOdILCIISIHEIKNAFPNLCLCPI1~0756 855889 ISDIALDPY177K;HDGIFLNGEVWDESVRIFCNI11TLHA~GADIVApSONI~mGRIGYIrpoD-RN71 Polymerise 8igmw66 RSKLDOSGYSKTSIMSYSVRYASCLYSPFRDALSSNVI'SCDKKQYQt~IPIQM.EALLfSSISYLPLTKLSSKARNPL
VLFWRIQ.FIQlf(SISQJ1TEYSSEEESOKKLEELVALiIKEpGFI
LDEIDGADIIlIVXPIYGLYLOVIYRIRONTCLPLdriYOV5GEY11NILSAfQOGwLDKETLFTYEFINEILPNSFGC
PEpIDOVLIFLT at'IIDIOVI14QIDVERQKEKKKF~1KELEGL.ARRTE
NESLIAIKRAGADHI
ISYSAPFILET.LIiOGFEFCTPDDPVtINYLKE~iTVPt.LTREEEVEISKRIEIGOVQIERI
ISCKFRFDKIISEKEVFDK1'HFLKLLPKLITLLKEEDTYLCJLLLa'LKOpDLSKOBRJfiG.
CPn_0745 941903 841742 NDSLEKCAIRTQAYLRCFHCR1WVTEDFCEWFKAYDSFLHLEQpINDLKVR718RNKFA71 No robust h~colog present in Genebenk/Et~LAKLMAKRKLYKREVAACRTLEEFKKOVRMLpRWNDKSOEAKItEMVESNLRLVISIAKKY
as of 1117198 VDSCFDI%4RJ1.SSLOGS i":
YtTfIIYDPKHTLaYGFCNOVSVIfItFHLKPPIISOEKFL72tROLSFLDLIO~'BIGTJIKJ1VEKFEYRRCYICPS
FYIITWwIROAVTRJ1IAD0ARTIRIPV
HNIETINKVLPGAKKLtMETCICEPTPEEL1EELGLTPORVREIYKIaQIipISLOAIVCEG
CPn_0746 ' 841979 813567 SFSSFGDFLEDTAVESpAFrITGYSNLIIDKt4KEVLK'TLTORERFVLIHRFCLLDGKPkTLE
~'632 hypochecacai protein EVCSAENVTRERIRQIEAKaLRKIWHPIRSKQLRAFLDLLEEEKZGTSKVKSLKSK
FSGRCPFSFEVFMLGKEe'EF':CKQKOCLSHFVTNLTSDVFALKNLPEWI(GALFSKYSRS
VL,:LRALGLKEFLSNEEDCDVCDFr~YDFETOVQK.IADFYQRVLDNFGDDSVCECDGAtILACPn_0757 MENVSILAAKVLEDARICGSPLEKSTRYVYFDpKVROEYLYYRDPILMTSAFKDMfLCtCfolK-Dihydroneopceran AldOlase DFLFDTYSALIPOVRJ1YFEKLYPKDSKTPASAYdTSLRNfVLDCIRGLLPAATLTNLGFFPCIKNIALVIAIERYOLI
IaKFRIIWLFIGCSVEERHFlIOPVLISVfFSYNEVPSACLSDK
~TIGRFwQNLIHKLOGHNL1ELRRTGDESLTELlIKVIPSFVSRAEPHHHHHQAMtOYRMLLSDACCYLEVTSLIEEIN
yTKPYJ1LIENLANELFDSLVISFGDKASKIOLEVEKERpPVP
KEOLKGLAEpATFSEE~1SSSPSVOLVYGDPDGIYINJUVGFLFPYSNRSLTOL:DYCK%NPNLLNPIKFTISKELGPS
PVLSA
HEDLVQILES3VSARENRRNKSPRGLECVEFCFDI:.ADFCAYRDLORHRTLTOERQLLST
IiNCYNFPVELLDTPMEKSYREAMERAMETYNEri'0FPEEAOYINPMAYNIRwFFHVNARCPn_0758 555101 95645a ALOWICELRSQPQCHOM'RTIATGLVREWKFNPKtELFFKFVDYSOIDLGRt.NOENRKEtolPfdhpS-Dihydropcaroace Synifuse PIT
RANSEPRFVCLSLC3NIfiNRFKNLOIARTLIGEOAVLGLRSSVILETE,IItLPGSPPaiD
LPYFNSVLVCETfL:LRELLVTIKOTENWCRAEESPPwSPRTIDVDILLYCDFSPCCDN
~Pn_074'i R17o49 841057 TEITIPLSNLLSRPFLIAGIASLCPYRRFCfOCSPYHNFTFGEIrLIHLPSPPCMIRRSLS
:T6J1 hypothetical procain PDMLNL:WNVTNDCMSOCGMFLDPEKAVAOAEKLFTECMVLDFGApATHPKVIOpFLSV
RTCMCCKCAEVOILSSRSLSCMKILSGSLFYKKFCDptDiERLEPVLRLLKETWSNRKOYPIISLDTFYPEIILR141D
IYPiQWINWSOCS08NA
EVARDCEL.iLVNIaIS.i.~nLPS'DPItNILSF3VPIGEOLLSWC:EKOUWFSDVGLNANDI/iFD
:Pn_p74R R44 Pa6 944121 PGIGtGKGMQSLATLYEIAKFKRLGCPILIGHSRKSFL3LFf;IJIIDPKDROWEIVCLSIL
tripA-.ar~nyt Transcranafsrase t.OpOCVDYLRVHNVAAHQK,1LSVAACFJICAPt r:TLfIJfALCI"f RP~ L ESA I EKALECFCP
ICNP IRSP'/EYALQL~CKRLRfCLVCNNAQCL
I:LNiIDVMOS.1LAVEFVIIT~'TL:aDDLPCNONDDERPr:RP7IMKAFDFATALLNran X1759 aSriA i4 :156~n :YALIPA
AY!:IILRLNAICKLKEQGCDFREIDIAYNIICDITDKIIICCGGtIi.CCOYDDMFfrNRGOEHVfolA-DihydrofrrLacr R,xlr,.;carh _yIMIKK'1t:SLFELW.'I::(:GILFf.~,;Dl'OFAPTIT..'.F.~.NNFr:LLFOIKDDF3DLQKD.SQOILLV
KPVIIPGNFF-NfU:VGI~KIIW:VPr;I'!M.'DI'ttf;JIr:LEGKLf'WIiYIEDLOFF:ETIOK
:INlALL.fC;EIG\ALCLI.aR~~MJI-.LELLDRLSA.i.:LYt7S.~.EFETIIC.~.IGFFPIVNGP.KTWETLFPKYFI'LIRA'/'.NF.~.IIRKRI~:VH
r:EIWVT::LfEFLU.JSIL
:PTFLIf7G
r:EL'I::LFLENOiVPGFFI::It!YYElA':VfFFt tt:LLCTWTKTVI.TtL~f~W ITTt.-YYEMIIIR
'Ir _117A" I~.r.IN Ntr.0(Ir. VM'Klli::L
rllmll NUf-.;Ir:lIM: lrrruly~rntu~rYLil;r Vc:YM-lfA::::IF::1'F:Dff:fl~t:II::KAlIY'fWOIt.hLtItIJI4LF.NiiVF::c:llIt7TVF_'x.YfLKNr :(r:47.:n ... . .,,')r.r.f (EKIEfAEI'AYVi::'r:AS'1\'r:ft'.IG:::V'rEVRllt,1'!Li!:NVI'Ivl::R('W(:HCPELKNSYLG
t.'Pr.ll hYfN'rnr,r.m.RVrna..y HIrrKAAllli\'ilr:G::Vt::::1:VNG:At:VR(.'ANFRLLY:PtII'Nft:'r::UK.~.KKIO?r:P.RKLG
AFRFK:PKLCLEIPKP::r~IVTHRIT'I"IKTIYfYf'YUDLI:aLG:::Lf'KLNRf!::IWI'f::KIV~
~ r :Kr:VA IrWNVtI f Hiv;~'I( I n:4T:AWELEKV::YLELtKylilLA'f V I-/F:Kf Lf'IITR I Rh.'UV I ; I'lL'fY.K4a:IL I f~.: A. a lut:::llVlY:YYVLY
INUFLL.':VNTII:fAIIJiNFYlii.f:IN.%:f I I::Ir.:lft'ffLHRt;TNf;G:I~'WtJr:l'!'i'I:INYH:Kf ~'Iw- /'.n n:.tr..lr.: H.tn.'In"
D'.'lriRALKM'Y::NLf.UI:I.:AAA'/t.t'W:1:;fU:trPlIAIIEFJIFKITFIC:::fTrL.~NIM:'fLA
f Af:IYCDL'fr:Pt.LO::MAWETI'AI't".:
1~~
NU1KRE9TLWVHE:LLPK: rLCKLPAPYP4,:IKCfAECL::F11E.:.:FPAIE1KAVA
3DR.TAKCIP1'AR
APLDIFPLKHLFPR.fl10D53HSKt7~IVLOWIR(,'IifATEC7TPL,'.C:rJI
~Pn p7wl ss57.tH 858775 , RTVAKYMOLKILtAIIKPKItDtI,:I~i~t~tPROA
:TnlO mn.'.'rnrattc.)1 procmn n . ~ . ~~ ~I
.Ilrf:ldfELLDKOIEDOHHLKHEFYORWS~'.lfLEl(COIQAYAKDYYUtIKAFPCYLSALH972100 d'016J
ARCDDLOIRROILt39LH0EFJwCHPNHIDLWRpFALSLGVSEEEiJINHCPSOA~ATFCPn-0772 RRI.CDNPULA:::LGALYTfEIOIPOVCVEKIRGLKEYFGItSAIIGYAYl7IIHOGDIKNASuvr0-ONA
Haficasa NLGLLIfI'CISELtJfa:RKAtffAPI?1PVLVLV;.1~K'iAVt : fRILHLIPJpCIAPREIIrI
''eEKDILpT;.~.~RL~1IPDAVL.QG,~,QE~fLDfLLdiFLaSFINSfEPCSCKVf!'ITpfAARELtfEP:'.'N
pr:Jl..'-'~NEFDVPHVCTFHSL..'VFTLRRSINLWR~'INPTIYDOS
..., '::Jf~..'- .- -.;-~.~KK:.
. "
~
. ,.. ~ :I:...':.\;.:'I".~'; ~:'c..; , TIIIA'::".~
: a":.:-:K~y~
v .r .
, ;... c ...... ,.., ;:r.. . .;:
NVFASA:DPCOSLY::wnrJAMLHtlttlfFGYDYIMAtCVL:LEFNYf4iYCNItl'IAANAI,IIWNA
:. t:r "
~.iw'~IIETKRSIYl4'ILPDRKK1LF~1AVAYIEKO~GS~(~((S
~ELLYFSRFSEKWJY
. fIIKUtDICIPYfCINSQS
ALGIHGVPKGRVIEIFCPESSGKTL'LATHIVAWWIO~fiVAAYSRLOfEIRSVKGPGEKIRLFI&SfDREEaIDtYAA
EILOGiRVv ATHEISTIK'IGALSLDL ' . GGLSFYKRKEIOOILIFIJtIFISKSDIVAF0R11MLPKRGIGS
LDAF3GLDPSYASLICVNIDDLIIISOPDCGEDALSIAELLaRSCIIVWIViDSVAALVPKRTFCDA4LRRRIPYE.a 3ELECDIGCVMVGWARI91SOALRKLTATLSRSOTCAVFINDIRERIGVS!'GNPIT~fCG't'fIFJILlCYAIAOGL
PILKACOOALDTKDVKLSKKOOEGi.OEYU~I.FPOImHIYtfILSLR
' RAUCFYSSIRLDIRRIGSIIICSDNSDIGNRIKVKVAIDOCLIWPFRIAEFDILFNOGISSAfNLFlfFLDDIXXu DPIG4VVRI1GYLEiLKEDAOTFKDRKSNLEELYHKALESECONPK
' CILDLAYEYNIIEKKGS1JFNYQEKKLGOCRE!'IREELKRNRIa.FEEI00tIYDVIAANKIJOSGfOCLEFR'rSF
VCLEEDLLPHANSIGGTYENIEEEIIIILCYV
SOODIiQLT'!1 CITRAODLLYLTAAQVRSLWCTVRlBIKPSRFLKEIPKDYNIQVR
TPSVNANlTPOEVPApIYEA
0763 860520 859972 CPeL0773 872185 871195 CPn _ unQ-Uracil DN11 ~lycosylasl yyfA-FOrsyleecrahydrofolace GyeloliQUe NFPKfDPKIEKSALRKLPISIRRDLSEERID1EASS11VASFVRSFSKtSWL.SPYSFI~BtCItlIONATIDDLWS51 CECLPLC1~IREpLKEEWSKPYMpQLLIFLKOEYKEHTVYPEC~xVFS
ONOFaNRILIOKGTLALPKIDQ~Ii.YPVLIPSIDDLISYVHPImPFSKOTPISSDEITI(VALRSZPFDQVRWILGDD
PYPGRGQAi~LSFSVPECpRLPPSLINIFRELKTDGGIiIVHIi ' :.VPGU1FDQOGYRLCYGHGPYDRWLAQHPYPSIRTIGIC1~QKIDRLPOESHDIPL$QIJ1AA
GCLOS1;IANQGILLLNIIIi.TVRJtGEPFSHAfiKGWELF?DAIV'ffa.IDFItTHIItViJ~li RKKCELLFNSKHOHAVLSSPHPSPLAAItRGFFGGSHFSKINYLilIlCI1'IKKPHIM~i.P
0761 A61819 860521 CPn_0771 871183 873125 .
CPn _ Cf606.1 hypochecual protein CT618 hypothetical Drotein ' GYKShmIKKLFCLFLCSSLIANSPiYGICICDYEIQ.TLTCINIIDRNCLSEIICSKE1U.KXITOOLPSAECMPSVAN
LFJ1DFLAAFaLL
LFJ1P101DCIHSVCFQKTPRLTAKSWSME<Qi.
YTKVDFLiIpQPYOKVHRNV1011tRGDNVSCLTAYR1NOQIKOYLECLI~BrBfAYGRYRt~IIiIMADIREIACCLE
OSLATLVPSE
CNIKIpAEVIGGIADLHPSAESGWLFDOTTFA'sILFJIAIVYE14GLLOGSSVYYIfIN
01~5IRYSED$EEWLIIAtEEY(( 0775 871010 873111 ~CF CPq tiY
' Q -fYTSSGKLI yODY family GNIWICECPYHKGVPQGKFL
FGRLL%AEYLDPOTNEIYATINEGNGIOAIYCKYAVILTRATYRGEPYCKYfRFaISCI'0 IVQTYNLtAGAKI1GEEFFFYP1'fiKPI~LII~WII~ILNDIVKIWYPCXaTLESQi~.VlllKER11BIIYIASSHC
YKIRE1'RTFLNRLGDPDIFSLSDFPDY1CLPQEOGDSITANitL'II~IIi KSGLLTZYYPflCQINATELYDNDLLIKGEYFNPCDRHPYSKI~CCIAVFFSSAG1'ITICAAMQ.GCWVIADOL>Q.R
WJ1LNGLPGPLSANFACVGJIYDImItRKXf.LD3J(SSLGRLVDRS
KIPYQDCKPLLN
AYFEGCYVLVSPNGEIFK1'YOICDGYISNOF3fGSSGP~CYDPIFVKYDYKpTFAFi.B~
. NDVSHRAKAWKLi1P14.OSLFGOILLTRD
CPt>_,0765 862415 861801 CPtL0776 871180 875187 CT617 hypothetical protein , TfIYIKLLGRLIItfi'tISILILSFLSLtSILPVLAITSNHVKISORWSDWSOILTLKVIRCT605 hypothetical prosein DHELDVI%fOIARISImRNNLSIF.~~LI3ASCKDLRPISRFRDtI.l~OfiliSNSLL.71QSI~VWERFIFVL1DIP
YDCLIJ~'FOFLSf7fBOfIFYSP1ZLSCIFPYVCCA0Nt1i0LDRIFS~EY1R
tAALEKSNHOLVWNCE0tfi01DFAFVIit.E0AT0~'fEDIESLFSLFNPt7IPVAPLVFFLCWCIOf~CIALI&HBA
AINSI~ODJ1LSVFYSRK(tDCfVEILCTLF31CYYCiITPfiTVWIDPS
' IQ11'KplTPl.GNEVWLTHAEAISRWI YM~IpIVKA80tY~LI
RYRE.RSySLYCVKEVPffEVAINCDVFVYDVpDIGVRSYSP
' ' YAPN11LNWIPIB(G
i~11'PGELALFFIOII
VLDRPNPIGGRIVOCPLPNPI'fSCSLIIPYCYC
~B~fl'P'DLIGLItrhPTSPOhPDPOSPFFriNITGILCAL81fJ1SIG1f6YTLPPKVIGAP111 CPn _ OGONADCl~l4DCIPNLFLPFFYEPFPCKYttI~~'1CSCVLLVLODPKIFYWETOCZI91C
CT616 hypothetical protein AMIFKLPVYNICLTKAFldJI'IKIAILQKTCIO.~ftIPDGRtI'tSLP)QiYFA71PT1'FVLKALYPKQVEOTLKS
IERIPARIISSICHGPGGDEFLSISHKERYIVIIPWtLCKESRES
SLpCSDILVKSSSSSL101R10iILKVALTHLF~ISLiILPWESLIVOPOIGKPIDRaITPLTLFHOLRS~LLSEY~
WIAqQI'fLKKELSFLS0110IFPDfQ.SCRAADIFFL7100SPLK5LPAYLLIYf7GSEEYCCI
Fv10W01IAVlIRSFStaISTIdfSCmIHATWYIQETPPOlYi.PAINVAOISPM.~(ILEOKCP1L0777 875586 LSLPLWCOS!!1'YGVEDEDWEIYGDfIAAAW~J15RRPLTfPYDATSVSPAA910Rit~tSQroEC~2-Mat aback Drocein-60 SLLIGKYALJ~171TVWSIGSVLKLKSLSSSASNHFAF71GPEP1CVL.PRSLKAALKTVKAIC%TS
TKAVfPAICPROYNWIKIO~iAPIVLT1~RI
IOiSABNYPLLPTIPTSEp'1'LKFLUILGIiSSPSIRFSYF8YIf11'SYPSI~I1PSLPYSALVEAKEIIf.ODAI' 6SLDVKGKFaLLRWE01GDCS1TALWIDtILITpGL10GI1tADt.DPQEI
VIOCOOQP~IPQFLKICISSNPIa.QIiVSFSLED0R5f~L0!'ftSSKAGILLSVDNYQOLOROAIQ.OS
ATVISD71D(JODIIPS
SIm9GISRTROI~IfKSGYLSDYFVTRPLTImVVWEEALVLIL4IIBl.VSLfE~.IRYL6 LIB'111lPLVIIAEDFD~'1VLATLIIHKLRNCLPVCAVKAPGSRE4110VVL.BOLAIL'1G
CPrr ' ATLICQEBB~ICEIPVSLDVLCRVIWVMITtILTPTFLECGCDAEIIQAR~.CIJ1IARST
CTblS hypothetical Drotein NIfC.SYLi.RTAINVYSFLIL71YIFASWVPDCOSARWYQLVSXCVDPFIIiFFRRE1IPRIGFSESiCOELLGILAI
FIGSIPOVDITADlOTEpROIQFOLPSALMTKA71l8mCIVi'~OV
IDPSPFVGLLCLGILPFVILRVLRFIILiIIFNSPWLLQYLAF4RAANItIEVPANiSSCrtITPGFEPLL011VRTPL
KVLaQNCGRSSEEVIHTILSHC7PRF
OYIKiIn'DZ'FEDLVDAGICDPLIV1TSSLKCAVSVSCLLLTSSFFISSRTKT
CPn_0768 861114 865161 CPtL
yohI/nir3-predicted oxidoreduetase, YFSFSHAAPIFI10fILLRSSIVYAPLJ1GFSDYPYRCHSALYOPGI~1FCFJMCVECILYAPtsa/ahpC-Thio-specific Mcioxidanc fTSA1 Peroxidase ERTSICLLDYI~f7~PIGAQLCGSNPETSCFJ1AXILEGiGFDLIMti00CPlWCITKDCSGAPVApSORVPGYEPGCO
RFESSLVRfB~IXRVEEEVPNILSLVGKFJ1POPVAOIUMrGCICT
SGLLKTPEi.IGRILDKIINSVSIPV1'VKIRStZiOHEHItAtE~'VRIIRDAGASAVFVFICRYSLImYLCKYVVLF
f~fPKDFTYVCPTFLTIAPODAIL'aEFH'fRGAEYIGCSV~IJITIIOOWL
TRAQGYHGPSKOEYISRANAAACKEFPVfaiGDIFSPEAAOAIQ.TTGCOCVLVMGTIGJ1ATIO~IECITYPLLSD~1 CVISRSYHVLKPEEELSFRGVFLIDKDCIIRHLtMrDLP
PWICKOIDDYL'1"fGSYEKIPFIKRKAAFLENtQtLVEDYYOSCfIfFLSSfRKL.OGNYLISALGRSIEEa.RTLDA
LIFFEfNCLVCPANWIIEGLRANAPNEECLQ~P4TID
AKVRFLRSSLAKATSYpEYYOLVNDYEFJ1DDSSLEIF~tKG
CPn_0779 8'!8502 878095 =Pn_0769 867763 865121 . CT602 hypothetical protein _opA,DNA Topoisomerase I-Fused RFDLIPOIOCPNALFGEiEKGSYDTAYFCRSLVDLHNYLCDVSSPCI'IL71IKTLLSDYNV
to S1'II Domain SIOGPIfJIIRIJOfKSLIIVESPAKIKTWKLIGSEFVFASSICHIVDLPAKEFGIDVDHDFVYIRVREDGYCVDSYFF
GLHF'LNiQZ'rLKNIIAICLPCVGtIpHIIFJ1SRSLCOKWrSLLL
EPQYQVLPDKpEVINHIRKLAAKCEKVYLSPDPpREGGIAWHIANOLPDSPLIORVSFNFFDlIDLYDLLTFNOPF
AITIWA'JTEALfHPRTIDMALVNAOQMRLLDRIVCYKISPILSRKLODRSGISAGRVOS
'JaLXLWDREKAIDAFVPVEYWNLRVI?IQDPK'l'!K'I~IAHLYAVOGKkWaCEIPECKTENCPfL0780 DVLLINSEEIWtHYAELLEKSSY1'ITRVEAKAKRRFAPPPFITSTLpOGSRtIFRFSWpap0/ami8-N-ACecYlmuramoyl-L-Ala SR Ilmidase TN3IAQTLYECVDLDSEDS'hCLITYMRTDSVRVDPEALTTVREYIOCTFGKEYLPEKANIIHGNKIAVOSLRFMiAKL
SFFILLSLLFSGIDCSP.LtIAAGRSPSLOCYtaEIEDISAKUS
YTTKIOffpDJWEAIRPTDINLTPDKLKNKISDOQFKVYNLIWKRFVASOITPAIYD1'LAVHEVtIVHLSERLDEODS
KCOKWTAAKPEfIJvOKIRELESGOKAWKTLJ1VI9TSVKDtpi OITTCYEIDLRASCSLLKF1(GFLAWEEKODDF3'IDpEEDIiPLPPGHApWILIKEb11S0E0NWSKWEIOKDHRALW
OLRLVRRSLLJ1LVDS=SPGAYADFSDPVPD7IYIVRGGDSLS
:,FTI!PLPRFTEASLVKELEKSCICRPSTYATIMJKIOSREYITKt?JORLRP'l'ELGKII50KIAKKYKLSV1'EL
KKINKLDSDAIYAGpRLCWPNKQ
r LETNFPR INDIGFTAIIIEDELELIADNKKPWKLLLOEF1VL'fFLPWITAEKFJ1VI
PRI L
TNIECSKCHKCKLVKIWSKNSYFYGCSEYPECDYRTSEEELAFNKEDYAED'fPWDSPCPLCPn_0781 879851 x79199 '.,t:VMCVRtICRYGTFLGCEKYPECRCTISINKKGEEIEpEEPIPCPAIGCNCKIFKKRSApat-Pepcidoplycan-Associated Lipoprotein YNY.IFYSCSE1PECSVIGNSIDAVITKYSGTtXIPYKKKTPrIO(KSSAK1TKMRTPSKKQNCYRSRRKTVPLLG:FP
SATDIfFlIT!?IIHSLWY3.C'fLLALLALPACBLSPNYOWEDSCN
r,Y.AKSSVKKSSEKKTGPLFLPSPDLJ1KMICNEPVSRGPJ1TKKIwDYLKEHOLQAPTI4KKtTCHirtRRKKPSSF
CFVPLYTEEDfNPNITFGEYDSKEE!!QYKSSOVAAFRNITFATDSYT
LYFOM~tLAT I ICPNPIMFOL,iKHLSOHLTIfVSNDFSSASSI KCEBNIJ'I
LTNLVHYNKIWPKATLYIECHTDEW:.AAS'ItILALOARRANAI
W RLo~TISYCKEHFWSCNNEWW00lIRRTEFY
IHAR
.
;Fn 077p 868722 ar;9lll T.42 hypothetical protein l;Pr:07R2 PPL077 87977?
KFRTRtIVEKLEFVTCL.~.SPDDDLITFNKOGLL.k7PEEEKVAFLVRSN1WLD:CPETPASFmlb~f~'tty;:.tc clt.trida transporter rF..:IJiEUFDIFPEYVEVLY;:NECLDVWFrICC."111ILt1ttElffIOLRKHHRKASRWiL:HYSRDr:l)tr:
MLROI.ef'VVFFFSFA.:LWAEELeIWR~EItITLFIEV::c'.OTDTKDI'KfOKYL::.~.L
t:/trrVtFJIVIIAVRHKFtfEPVFE~r'VWYUTSRWf:yiRRFFr:PLFR~Pt:ESYLLLFFTILCLGITRIfY:Y.
DIAfa:Df'.lw'I'rAA::KC:aSSFLAISLRLNVPOL3'P/LWa;:KTPU'fLC::aTI:~II
::1.'rfIPA(:ILINLVLIfIYFIAARLCMAOSYL'fRAHKKIFYl'IfGVPPLWVLLRLTDKEIKNFAL::VDIrJY
fIIIIMPf1911'AL'llf:fPCL~.N:KIVFALSSLCYI~KLKUr7RW1'fM'b:KNI.AP
vt:l f PVLEIfIMKRKLF.NVHWKp IYU.~.YI:
fT(Y'.':I.:ITI'KWlt,1\:::NPI'YLWC'lY.'If~/PY
FV IFLGFLFJffEf:ICKVLPIJU ~IJIITP;
1'I<KYLLAI'VAttl-h;Nt l'1.l'Ivsl'I:a.T.'7.PMl:RFPRLLNEtI~,fIP::FNI'fl::IJI.VFf.~, Yr~ ~I'171 ~~ItsSll u.n11.1 NKW
:NtbLY111::1.I'I't:11.\I'IiI.LTKKYFN::;~FAW~PDr:YF.IAPt:::VIKtiVH~strf'IDI::
rt.'.tl IetIA fatlym.r.r::.
::i,lm.t'id::I:l:lilrJl:rf::ITNKF:a'::WAfIY:AIIf.VF::J4:NAE~iELYLIiLVTKKTNK
IAIs:Vra:KItF
II'Yr:H;:YPI.%U::::ALUNF1'1.tKrJKL:LKYLPC:LIiIIQ(/:WNW.~.PLTEL:iSYY/OEILONPP:lf :At'fa?PIKIYrI.
of 41 :;:Lt:EI:I1Y::(r -IFi"rN::TF.~,YLtIpTPt:Is~'~:L%1'P.LLPOI
EFJ1F:.'fAEERFIIWO i v:Itl.::Ut)a.nl.l'NIt:DFll~'FLELPLEKIIIKVWnTtrJtIL::PEfaII:PSWSYY1MKLLRNSSs'Or~
rr/HI IIHIItNII :IHllon sN~,A%::1'/ffY.'Yt'LlrlTk.'EFAPINKKFwL;:L::ELRtIILKKAfI:.~.IPWf:PAAACfIIKPMVScT
'.'sss I,yl,.rtu'r s,~.nl 1'snr..rtm 't'I~LIIU'If.F'I::.'7::.WYf111:.'iRt:1.t':iIKLtIY.FfFfIF%FJILPKEEUKNL:i00If~aAK
WLIKINNK%LI'YfAI'rN'IIL::ILLLVI~A::fLi'KKHViPKAF4EYLVTIUf'KPIYITI'::VWIA' AKTtRP.':V.1'~QPQKQAKCCPPOt?tVQKALGKPTPI.fNEPNEILaILFW.'tL.'vLL'~.ll;.
..1I':DYII:x:EKI..iYc:..:::..::::.:.'.1:.:..:FCfP:'fPKti :."EPPKPSPAPTVAKK'1'fATEKP Wf'/:Ot'w:""'_.,.....~.F..........lA_.
pp':LFIr1YJ16SD
L~IFN' R
"
PP.~.ITKKNTO4iK1'QLQTL.iEVAGAL.~sLtIVDKTERSETSLKNT~IP3TAQLTNHSCLKJ1T.
..QHIJIR
O
TF:
.::
~L:
a~.lK S
K'4'IFlxIFJ\WAWRIIIY\'EI1~:1~
D"' F
~
'3EDEIl:ELFRT11LALPSKriYVRiKLVL,iPNGEfOECSFIw~EVSAADKOLLTORIOALPPO.
~ , ' .
r .
.
V:iLOKISKDTA
ALPLEIfIQALOP~ItLt'~ :'T.EDIfKYPSCLF:EE:.:.KCFL:IFttpC
'lfl~1 L ETENSADCfLTIL:aF~
.N.:
KFLEKYKV.~,KNt.:FHIKL
CPn 0791 x92359 991972 ~ or.;lr)9 9.r4~lv U .7 ~.Pn A%bD-BtOpOlymer Transport PrOCein _ ORAD.~.Tr""'!'!N'~tl''QYY.tI4KYPrI'EETCEPVMLTPL:Dt'/FVTLJMFTVAVPLIKrbsn-slom\
rrrriac~ry f.rmilv nr~rwm-PF~C
cnospnacase IRSEi~t -~
: : : . .... o ...,:.:IY.1'i ~ : :.. _ .. ~:A...\i~.. . .... .. i~:.Ial n.
:.. ... ~ rr.
'::Ii1:" . .. . . . , ...,.....
. , .'S'~VYSI''b:A
: i~: :i-! I::W rJ:.:.:::
NTLTOIVPLNVDVL:LFSLVL:ILDA.i:Ft:fPNL':.L...'VE11L~KVFx'.:YNELiLIKVFPNGD
..~.yy :,::_:::i: ..
-::'IY:IS
KIWASSIPENLGtJf(NHKIDIPILYfPFLAAi.KOSP101pEVfSJIpIINVFpAKCpELOGI
CPn_0795 9970 LYTfFSAGLLJCCtl.It>IOOSYLTVKTAILSKYGV:LItASDPAiJILNTVYPDIIt'RIfIIIC~QV
exbB/col0-polysaccharide transporterFU4~PCPIDSELGPLT.SPLDIGFNFYSFKIKDTEIWGCIETNPSIDIAVLSYAIGIEES
ONLYFETLS4NKDt'YSMMFSNNPIIQAY'fFJIDFFGKSIFf''LLILSVrISIM.HOItiJII
OKNFLKAGItSLKOFLIKNRNAPLSLDIHPELSPFJ1DLYF'fIKRCCLELLDtOIROSAPDRGfAPL.WRMRM'FAYf PCILLGSLIAFIVARRLSLPIRIC.ATANIESRIOdOJCLYTDDiLG
' PILSSEDIQSLETLLCAINP1LYKALLH10ISFIPATTISLAPFLGLLGTVWGILVAII'ttISLPSYPNIE
FEIIGII~IfItNAIIIIEt1L11L.AKTNFP.IQfIlG\OF4l:.Hi:.EpAQpRLLPN1 SCSS~tSAINEGWTALCTTIICLFVAIPSLIAPNYt.ItANSSELI5EIE0'1'AYLLIaISIEWIAYIPAITVfxDPF
fHFVVCECSXARLFLIVADJISGKGVNACGYSLfLIQIIZ.RlfLSR
"
' ' I
MYYSCNPPACYLDPDCETS
SA
SSSi.Q0AI0L"1'SRLfYFPttKNSCMFVTL.~IYCYN~
WLINpGMALGFLPEVJWITSKLFNPKPCSLPYLYSDGITE~111t~P7~I0lffCCERI4AAI0G
CPn_0786 881137 995293 LTGKSAAOAVNRIJG.SHCI'FtICNStIpIiDDITLLILKVLES
dsbD/xprA-Thio:disultide tneerchaelQe Protein CPrL0791 197123 891001 IPG
' L No robust holuolo0 Dresenc in Genebank/ET~L
fOOVHIIPGAEGLSESSY as of 11'/98 NHGVILNKFRTYLOTALIAPFFSFPALSCSFSSIpAeEI
' ' pKVfEEEGTTFF
KSSKNRSFLLKKSOQiQV5LY0lfWWFISOLKKSLCYSTVAdL:FNIPSOESFADSLIDLNL
QTPRIGIK:TASKGSHI'lWlOIPGEIGSPLKISWOLPIE
EWtJICGDSCLPGNVDLKLTLPY~!(iPSLY
.
CY1;9SALIVAINMPEGYTPGQEVELRAOV
GLDPSVECLSGDGAfSVGYFTtUGSTPVEIfpPFKYDV5K1IT!TT'..SVCTANOSGYAYGIS
PtriIIAEFTKTLHJIQt~ftVLFIJDHSVQVAOGKCNEIILNISKItINIITNAWE1ISEKAt>IQ.FAY' AE'tSYSGCTCCAWRLKViQitSGV01Q4EKLHCILLLrIDIIIGRPVESLTINSSAVIbVIOCFSYDA
YDCI'I11IC1'CSLJG71G1CYNCAKiI$ADCTLTPLTGITC.3FStfCFaRAISKC
' AGLSQYITILIMAFIGLIILtI4IMPCVLPLVTLKVYGLIKSAGENRSSVIANGWFTLL1IVAVkWVN
SCpPKAVOffASGAT'fYCOLADISGGSRSSYAYAISDDGT::VCSNESTITR
' ' CCPYIGt.7~GVAFIt.KVLCtOJIGWGFOL01ATLIIVfFLFALSStGLFBdG'tDffANLG.IYIVCAANFATVTNC
NpESNAtMYKDNOIIfD
~' GLYISGt%
NVPTYLCfLDI
GKIQSSF20CSSNNKAVGAFtIJGILATLYtTPC'fCPFLGSVLGLVNSLSfIAOLLIFTAIG
L~L7LSpYLVFSVfPKMLS1ILPKPOGWFISTfKOLTCfIQ.LVTV1WLVWIFGSETS'iTSWVCP1L0795 LL.OGWL1.:LGAWILGRWGTWSPK1LORVCASLLTFAFLOGAISItSGt~SNYFABPOQTVNo robust Maaolop present in Genebank/GwBL
as of 11/7198 SVNEDSLWpPFSLEKLAOLRAQGR15VF1MFTAKWCLTCpITBCPVLYCOAVOIC~?LTfIGIVGTLOGANSSA1GVSS
DCSVIVCpAQTADKSVHAFpYYNGEtIKDLCTLGGTSSTA1LTVSPD
TLEAWfRKDPGITEEIJ1RLCAASVPSYVYYPGDN&APVVLPEKITOM.LEDWSRFVRGKVLORSOIADGSWFIAP14C
NTDFSSNNVLFpLil~il'YKTInENGRQWSIFNLONBdOR
ASDrt!"lTftIAi.Gt~GLYVMILONLPStI~AQYfGIAYKIRPKYRLGVfLDF81F8Sil CPrL0717 185604 186101 WlrIINVSHIIRWIGAFII~IpDSDAt.G55VKVSfGYCKOKATITRDpL.C~fFIJIL''SGaNf ' yabD/yctH-PHP supertamilY lurease/pyrimidlnaselCVNfL
tydrolue ECVA7l0I>I~RYCKSLGdMtWPFLGLOFVNITRKEYTENAVOPPVNYDPIDySI
TRROPVDIrIDJUItMLSDDAFEEDINSVLOMODSCVSLWiMTfEKETI4RSFAYJIEitFPGICSNI11LVDSGiVCI
'NI~ONFAANTDRFSGSIASIGNRVFENLDYCIIfRAFA~tIM
' KIRFCNVCGTPPQDVDpDIEF~YRtIPNAAANSIQfLAAIGbIIGLDYGFATE>03IARI~YLSSDLRYIILGF
YELPYLQSLNLILRVNOQPI4CV!!G!
QAYLiILSLECFS.PLWHCRGJvF4iDFFRM.DOYY1Q7DPRSRPGMJICfIG?L.EAOELISR
(;<JPISISvIVFFKNAODLRDLWELPLflILLI>:fDJIPILAPVPYAG~04EPA1Nl~TNA
CPt1_0796 VuqV~~~y~,~G NO robust haabloq present in Cenebenk/<i~L
as of 11/7/98 SELYSSYLOP~IVPNSIILPLPCLSRSETFKXVRS108(TlBM.TPIfIYRRDWYfAF
CI~0781 186521 887132 LLTAIPGSFJIfnT.VDIJ1GEPRHAA0A1GVSGOGKIVICNIfVPODPFAI1VOFQ7CItlONLQ
sdhC-Sucranace Dehydrogenase PLL1VRPQCSVYPNDITPOG'CVIVCZt~IIfAIGIICSVAVKWHJCKVSELpM.IDTLDdVJISA
SLVKSLRNSRIiEICPEVSHK1IGKYYSTFIFRCIHSLAGIAtTFFI~DiLF1?A4Jt.RSYFSVBAOORVIIIt>LGi ISVAVK4f~OVITOLPSLPDAlIUICVIICISSOCSIIV011RIDV
QGKCFVANVNGTNKIPGLKIIEVAGLVLPFLCHJ1IIGIVYLFOGKSNCIfSGDGSRPNLRYSWRNfAVQWICDQLSVI
GThOGTI'SVASAISTOCCVIVGGS817ADSOTRAYAYIQ~MSD
1U0~1YSYTS~IpRW'tAWILLFGIAFtIVVfILRFIRYPViIVDIHC'1TYYAVDIOPSRYDVIVIIG'1'IGTIACI
YSWtAVSSDGSVTVCVBTNSENRYMAFQYAOGONVDtI~TIGGPE5IlAQf~YSG
IIGFLTLNLPNI'~ISSItYSRHDLGGADAALLSFJINSYLLTPSADTAFLYWRt111LGSLFIDGkVIVGPAQNPSOW
ILAFLCPP~SPAPVNOGSTWI'SONPRCINDINAt'YB~.II~
ALLYTILVIAAAFt~FNGLWI'PCCR<AGVWSLRMGGVi.RIVCYL71NIWTFIK:VSAVWiLOOL4RLLION51UNES
VSSGAPSFTSYIOGAISROSPAVpFIDVpKGTILSYRBOSIIOMION
YSVA
COLLTCAPMDWKiASAPRCGfKVALNYGSOMLVERAALPYTEpOLG8SVL80lODQOOG
RYDFMGETVVLQPFIIQIOWtLSREGYS610i11AFPVSYDSVAY8AAT8lIIfiJUIVrJIfLf CPtL0789 187136 889316 P101SfAATINERDL1~ISNI PFASLAIIYYWRQ00LV
sdtlA-Suceinau DehydroQenase TL.h'TNle'IpQPLTCfLSLVSp88YNLSf 0t40ltJiRKVIVVC~CIJIGiSAAI~LANLGI
IVELVSL'1'JCVIfRSNSVCAOGCINAALtfLKPE
EDSPYVNAYDTIKGCDFL7IDOppVLt7~lCLAAPRIIKI4<,tItPOCpPFIPBpSGNLDNRRFGCPtLC797 GTLYHRIyFCGAS.I'~Oil'IYTLDEQVRRRpUIGRVIKR>SMtEFVRLVT~GRACGIIhNNo robust homoloQ presafc in Genebenk/Et~t.
as of 11/7 H1 NLFNNRLEILRGD11VIIATCGPGVIlRtS'fFISTFG1GAJINGRLFLOCKAYANPEFIOIHPVLIL1WINVGTKIG' LNNSKKIKVf.GHLTi.CTLFWCVLCAAALSNIGYASTS0E8lrORSI
TAiPGRDIQ.RLISESVRGEGGRVtafPGDSSKRIVIPDGSDtPCGETGAPWYfLCMYPAYVSIaiGSRIVGASGaGaC
SbTAVIWC5NL11W(.G'1'h0 CNLVSRWGAW1ILRVCEAGLCIDGAl4P~lYLDVTHLPERTRHKLEVVLDIYIGCFIGEDPNGGSSA~ISKDGEYW~IS
DTREfiY'1'WIFVfDCROIOfDLCILGJ1TYSVARDVl3t~II
TVRBtIFPAVHYSMfiGAWVDWPAADOPDRDSRFROFltNIPGCFNtxESDFOYHCxNRLCAVCVS11TIUIG)~lllt ~OVIGVIIWEKCKIKQGKLLPQCLWSPJ1NAISEt>CI11ITIT'~10EI818lItI
NSLtSCLFAGLVSGDE71SRFIEAFGASOATSSDFORAt.00EKEEN71RLLSASG1I~1IFVLVAVKNNKNAVYSLC' 1'LOGSVASAFaISANGINIVtiGiSTINNOETNAF181KDE11lfDfI7lL
NEEIAKINVRfIV'MCRDBfRDLQE'll~KLKEFRfRLfffVSVLDSSPfANKBfItFVRONGPftOCGPSYATGVSAO
GPAIVGPSAVK1GEIHAFYYAEGET1EDLTTLGCEFJIRVFDISE~ID
t:EL.AL7IITKCALLRLiEl7~SHYKPEFPERDDEIiWLILTTVAVYAPEEPEISYLPVDTRHVAIIGSIK1'DI~GJ
PTLRDYTKSSTCKI1:LTNIPDNIRLPI
CPn_0791 902810 907856 =t'n_0790 119279 990103 No robust homoloq present in Genebank/P~OL
as of 11/7/98 sdh8-Sueel.nace Dehydroqenase WFEIIFWRVPMtNTCCONYRSiCWFSWLFVLT'i'Q'fLFACHFIDICTSGLYSWAPGv :1SRIPLIISVYPYRKAFItItZ7LETFILKIYRGVPGKOYWESFELPLHPGENVISAUdEIESGDGAVWCYE~NAfKY
KRPVNILGEINNPWWEpCCLEEVCGSCSILVNGVPROACTALIOEYIDATOSREIViJIPKWVNGALVDLGIFSOGIIp SFAEGVSSDGKTIVCCLYSDDTE'fNFAVIMDETGFNVLPNLP
:.TKFPLIADLIVDASIMFI>1JLERIQGWVAADIt7CETFGPQYf0E00ELLYALSOCMTCCCEDRNSCAWDASEDGS
VIVGD71MCSEEIAKJ1VYWKDCDDIit.LSNIPGAKRSSAW1V8KDGS
.1'FJICPQIDl7KSDFICPa.1i50ARYFNTYPGDKRSRKRWRAfJIGItGGIEGCC0Al04L11RVFIVGEFISEDJ
CPKKLPLTESISAVGREISKFSLRSLPSJ1LPKKKXicYVDGRNIDLCI'LCGSASPAFGVSDDGKTIVCKFETELGEC
HAFIYLDD
CPn_0791 893101 890111 CPn_0799 905001 903910 CT590 hypothetical protein No robust htlmoloq Dresenc In Cenebank/t7i8L
as of 11/7/90 T_LRSSRKIWEDISDRNNYSCYSKGISHNYLLHPFISRLDIFVFDSLIrINQt7pNLLEEIFNREWIMIKOILRSMLSO
SSLWMVLFSLYSL~Y~VITDKPEDOFNSSSAVIMD181CK
SV."tCrITTVCFIKD7WSPTYAVRWNYWCfKELPfSSWVKKSKATG
iGPHRHNEM~REEIIt.L101LKAf.Kf11PKLILESIRTLFVPSYSIIQFILIRHTL~11LFIPQTILSISSDG.iII
AGIVt3iELSOSfAV'IWIQMllIYLLP""fwAVrSKAS'GISSDCSVIVOS11KDA11 TIHVRQAALTALFTYLRQIrIGSCFATAPAILIHOEYPERFLKDLNDLISSGKLSRIVNORSRTFAVKWTGHEAQVLWG
WAVKSVANSVSANGSIIVCSVODA~LLYAVKWEON1'I1'HL
'cIAVPINL :GC IGELf KPLRILDLYPDPLVKLSSSH:LIIKAFSAANLIETLGDSFJIpI00.1'LIX'..IS.I IAKAVSNNGKV
IVGRSETYYGEVHAf~'HKN
0'MSDUG'tLCGSYSAAKGVSAT
LLSHOYI1AOKIQNVHETLTaNDIINSTLLHYY0L0ES~IPPKECLFSKEQVAFSTQH.KVtV~iSTTJINGKLH1FKY
1.:CGRNIOIJ;EYSWKEACAtIAVSIDGEIISnGVOSE
PAELSEIQRVYNYLHAYEEAKSAFIHDTQNPLLKAWEYTLATLADASOPTISNltIRLAtG
WKwEDPHSLVSLVMIFVEEEVENIRILYOQCEpTYNEJINSpLLYIECRNANPLNNpDSpICPn 49(10 90ti550 LTNGMRfR0El3JKALYEWDSAQENAKKFLHLPEFLLSFYTKOIPLYFRS8YD11FI0EFAeno-Enol.aae NL'lANAPACFRILFftICRTHPFrtWSPIYSINEFIRFLSEFfTSTESELLCKHAVINLEKERKEIKINFEAVIADIC
AAEILOSRGYPTLIfiIK'rtT~TG.~'/:EARVPSf:ASTCKKEALCPR
~.~ALVNHITAHLHTDVFQEAL.L.TRILFJtYOLPVPPSIIliHLOpL;~011'PWVYVSCGTVDTLLTL'tIvPRYQ
CKGVLQ.1VKNVILEILFFLVKGCSV'lE9SLIL::(J~iNDSDIL:FNKITLGANAIL
:.:.G'IFE:CEPLTLTEKHPENPHEWAFYA0J1LKDLPTGIKSYLEECSHSt.LSSSPTHVFStV:a.ATNIAAAATL
RRPLYRYLOCCFACSLPCfMNLItrYxlilAOFIriLEFpEFfIIRPICA
I:AGSPLFRFJ(WDNDWY~'YTWLRDVWVKQNODPLQC/1'ILPpLSIYAFIENFCNKYALOHVSSIKEAVNHtv\011 FHTtKKLLHERCLw~~lr~Vf7G~Y:FAFt1(J1SNEFJ1LELLLLAIPIUIGff 'IIIDFHDFr:CDHSLTLPELYDKCSRFLSSLFTKDK'IVALIYTRRLLYIlNREVPYVSEOOLFt:KUI::LILDt:A
A:::'FYWKTCT'IIY:RIIYEL7GIAIL.~.NL':DRYFIU::IELI:WUDYDGW
iE'/LONV.~..~YLI:I:i::RLTYEKFRSLIEETIPKITI'LL:>:ML*HIYKi:LU1()SYOKIYTEEALLTE/Ia :EKI/Qt6'~:I'f~LFYfNI'ELILECLS1Y:IJW::VL:Y.1'N~It;rI.TFTWAtKLAQN
U"'ILRLTTAMAlIIINIr\YFaFLLFAD1:NWPSI'/FCFILtIfC'FfEtl>l.WKFNYACLOtiQPLOA<:Y'ITI
t'IIR:x:f1TI"tTlADirIVAFFU(CQIY.T:.:L::I'::F.f?V\NYtIHI~IEIEEELG.4FAI
:1I9ELFA'T::HiwrfLYANF IDYI:NPPPP(:YR::RLPKEfFfTU::LfJt~.:YI:D::1:f:
':In 117n: w~4:r55 N:rIIUH 'a'n_rxnl rUN'fU'1 nUf.7i7 Wt'.stn M/Iinhrr.u:.~l pc~cein avrH r:xm.1..n::' Alr' ::ufslnrl It ''rHHt.ItlIYtaRtHKIrfFTKRVLFFFFLVfPIPLLLILlAM:FF::P::AANANLWVLtITRAIIF'Mfft~:Ll 1\I1':llvta>L,t'FU\/APL::AiNRrYiM~QV'.:/a-nly:Y'rFTINNVAFMII.
7YIL:'.IF.FF:YY.L'fIIIKLFLDRIJINC(./1LKSYA'.'sP:iAEF'fAQAYNE?MA(.::NTDF~LCLLDPI
'fl.Vl.AIiNKTL.AAA'L\'~'h:F'IiF:YFfiRIAVEYFL':Y'fDY%yifillY
f AI!::IIr1'tI:K::IJ.INDf.
F'f/:::VRTYtIIV:Ut'FIRYLNOtIPE711tKNL.HMV(:KAFt.LTII~:KPL1.111'LILVELWAa~WDSILN
tI*L::ACH::II.t'Jifil~fl.1\::::V.~.t:I'ftal:::4~JYT:.'NALVI.fU':YI"tl'ItNII:FnW
LVK
'ITf:X:LL'/::1'YPll::1I.QKDLFOSt.IItTKCNLCLVNCY'a7VLFt:IIQU::E:::FVFSLDLPNLNIIY
yA::PItVN::APRtaata'llrlF'1A'If::HIl.I*LEFIlIUfL'f al:'C:PIh:M111*E::VP
inFqAR.:P::AI EIEKA:a.Ita:GFIJLITVS::A77.Yrr:::llW 1 t't:\ t HF>La\
INKKRYLJ:LVIrIY. I t'!\t:rYTt.::LVPVSDLI fe'r ItJFI:L1:F.NNAFfDDRt I EKOH
I 1 Flllrl'Mlfrl F7iIKE V:
J::ALKVPtIIICFFWI.AFLiMWWIF:iKINTKLNKPVit:LTF'~.'MIlldWRn:NIINVRFEWjPYPY'Kr:llv 'fY::HIIF'n:Alw:.\I1'n:1.td:fl'I'It.OFt.I.IIDE=:INJfLI~lIIAfIYHt:IIJ::RKQ::1.
ia9 yE'C:FRLf".:AFOIVRPLTYCFJWKYFRIrJtYVwAT.m.,tLDNA Mtsawetn Hey .'EVOESSCitt'Jr"ptIRPTGIPDP TRAP LQLL,DFL~. IHQ 2.11CE l: EV.
~ ::1.'If EL I Et'w':.DACADE:
' LCWtIf~ILTXAPM
. i:
NPE IRFAT: ;tJVDDLwEEIRLRL:iOKHEK .
I LV I :ITKRLAEOMAGFL iELEI PMYLNSG, I .
'JRfNq'~IC~'I~EF;tCI~
SI
EIETL3GOOGALi'IRONCr~9FRAEJai;l~
~~' ETAERTCtL:DLR.7aftOVLICVNLLRECLDLPE1ISLVAILOADItEGFLRSTSSLIOFCG, aAApHtlr:KVIFIADOKTRSLEElLRETERRROLCLDYNKEHNIwPKPTIKALFANPILOr " ~
A:,::.IOtEIQSSIECDOCVR'lYtflOW'IVxEPCAtIpLGrtYtVIISL
P~pR&t :ORFLiKEDLEEOIKKYEAL1~1QPJIAItEFRFNfJIAKYRDAM~CKEOLLYL0.?DRLGIRKLIEHRILST1WIGWS
~1ISECHHEIQIAK06~:P'OERVAYVMCONfMQOALTI
' ' 'SKDSESPKE ~
. .DAYAL:.LPLNRY/VF
. .LF: :KKLT
OKL1NGYRIVCYL:.~PSFtIRPTROCOKIFaIDRPIE.
F
VLK;,YLPSS>JCDFMMPOKI6ARICKEEL4t'.DCiXEAIVE'.'LrICPPf::'..:R'"HOEIEESO
~n~751 90R7n~
::VPLPMFRMLE'.'r0'I:EEESVEFOCNLFAY35EOV~::LEKCEYT.~.R~PKSCNDYIIYSSWR
~P
ORO
n_ . r'.":4\:'..,:. ..,.., ...,~...
: .
. . :: r:r ~ : .. . y..r ;,..r .
r-,,.. . .
' . _.~ ... ..
_. ,_. ... .;,It : .. E,r-J .:v:i. ~:.'.~: 'Y:i "/:'.... ; :
:TF
,. N~ICECLTCATF3KH0f1~'FDVSWLK:.i%d-%":KPkKIa'UirlitlRHLLLIlSGfMF~i ....
.
Y :, iAKEEVLt>uDMIIYEVLADW4iJGIDPIKSIZYLOSAIPEIYELHLLF.itfLLSINRVNGI
PSI~tDMARNASIEEGSLSYGLICYPILOBADILLAXAQFVP1ICKDfIIJdtIr~.TRDIAPNF
R CPIL0813 920813 9:193) ' DPN P
NRLYGOVFPEPEVL.OCELTSL'dGI000G10tSKSAtNAIYLSOSDATITkVRX11Y1 IRATTPGRVEGNPLFIYHOIFNPHKDIVEEFK)1RYROCCIXDIEVRARLAEELIHFLIPIPepP'Anr>.nopepc>.
dase ' KERRSEFLSKPLALQNVLCOGTHIOUIEVAKS~IEEVNL*ICFSHXWRSLLKEfL.iEOLAYFLHWJ1IJ1CILLIOGO
EYIIF
TLILyIKOtAitISNDRILNA4RALSEHNLDALLt FVYPMDKDLYSHIORVPLTFL'tODWADLSLYVOKQRYCKIGFDSASTVYIfKFAQ~LP
CLWtPLOCFTEItIRSIKSEEEIRRMOEAAAUGSA<iYOYVLTLLR~"ITCXEVVRpGRAIIi CPn ' _ iDRPLKXCCIV~IDIGIWG7fCSOKlInf171LG
CT581 hypochecacal Prace>,n AEAGAEGPSFPPIIAFCENSAFPHSIP
FMMKTKTLELEONVfLLL>''~JLIfRIFATPIGYITPREFQtiVVFNCANCQOEIANFFPEMTPH
I0~1L1VRVLRENHLDTYIINCICIIIRICR
LINGKLTQELAPOQKOAAHSLIAEFlOIPIRV71IIDINERGEFINFITSOMLTOOFRCIFLNHIHEYPCSPRGSQVIC
.f.~fl'ITVEPGVYFPGICGIRIEDTLCIt~l0~IF5LT11RPVISE
RLARVDCQEFLLMIOVDNTCHLIRNLLaRLLEAQtOdPNCEIOdLQEIQEEITSIJOVtiFDELL
CPn 0811 911996 923357 0804 911071 910310 CT911.1 hypothetical protein CPn _ FfLFFKLSYNtIFNLPLTMYOLLSICYSFVSFIALLWNLCYSPNYVTDLYRISLSAEESL
qp6D~CHLTR Plasnid Paralog EIFSSMGNLKTLLESRFKKNTPTIMEALARKRMEGDPSPLILVRLSNPfLSSKEKEOLRHLGGIRAFPOAESLLCCACA
LNFPDLEERLPDLRKELLFLGSNDRPDAOGCRFSIALiISSKE
LQNYNFREQIEEPDLTQLCT'..SAEVItOIHIiQSVLLHGERITINRDLLXSYREGAFSSWLLCYIAALKFRVYLIiV
'1'NSSItGPVYSFSP10GVP1'EWIECFSVSVDCRVE111fVRLOGLIaEL
LTYGtrRpTPYNFLVYYELtTLLPEPLKIlD'IEIDIPRQAVYTLASROGPOEIOCECIIRNYAGISKPRDCETLFLNP
PJ1NKLDCWEIACFRVOASFPVIIQXIRRIGVDKFLIJOIOGAEIfADXA
ERXSELLDAIRKEFPLVETDCRICTSPVKQAt.At4.TXGSQILTXC1'SLSSDEQIILEIG.IKTXERVDFVSSDEEt iIISRYLAVCtM.LWDCNC~IpTCGEFpCASSRAPLFEIfIaI00KVMIA
XyNyFpm.XV
DLWNIbO'1'ORQTISLVXGVPSPIEINEYIREIEFTCMRSWSKPIVLVOCrpRt.ILSPOpN
LRTAIOf3iiEICLSRAD0IQ0YV1GKV1CPLLVFERLEXDLRGFVLRGNI~'t7~RTLVC1'ISL
CPrI_0805 911816 911067 ~ PLItaCPtPAVASpEVSSN1'ItSAAANPGIL'L19ROG5 minD-ehranosame partitioning ATPasrCHLTR
Dlesmid protein GPSD
GYJIRR!!K1'IAVpISF>(CCTAKLSTTLffLGAAi.AOyfIQARVLLIDFDApANLTSGfGLDPDCCPfL0815 YDSLAVVLpCEKEIQEVIRPIOD'L'OLDLIPAD'l~f.ERIEVIiCttWADRYBHERIJfYVLGSgspD/OilQ-Gen. Secretion Protein D
VQDKYDYVIIDTPPSLCWLTESALIAAI7AlALICATPEFYSVKGLERL1GFIOCISARHPLMVPfPNS4LNLVAL&~G
.CCSS4YALTIAEIQIASLEHSGRGAODYEIiIASPNANOtEYSL
TILGtJALSFWNCRCIO~tISAFAELItitCTffGKTthtl'KIRRDTIVSEAAItt~VF'ATSPSAOLSKLYEFJ1RX
LRASG'1'~EALWICDLIRRIGEVRCYLREIEELWAAEIRIEi~.EDIfAL
RASCOYFNLTKEL.LILLRDI
WIQIPCC1'IYNLVTDYCTEDSIYLIPOEICAIXIA4'LSKTWPKESFEDCT.TQILSRfGIC
VRQVNSWIXELYl091K~CSVAGVFSSRKtE.EALPIrI'AYICFVLNSNVDAtlTN011VLDIF
1NPLTlfNDVIAGRVWIPGS7VGENGELLXIYNFVpSESIROEYRHIPLTEI~IISIL
=Pn _ NMFREDLZX1HSEESLGLRYVPLOYOGRSLFLSCTAALVOOUZI'IRELtEDI:MPIDK
LhrS-Thraorlyl cANA Synthecase NANNESPPti!$J1WN104IOV1r00RiYEVLEGTTMEWCOLfOQSf~FIGVLINERPRDISTVFWYItVKNSDPOrr' ~rre~DyfSGEtOtASVGAADGCG80LJ~L18I0IDfIYSEfARD
THI1JE1GDTLVFLTSEDPDGREIfLNTSAHLLAQAVLRL41PDAIPTIGPVIDNGTYY~'ANGSVKYGNFIADSkItG
TLIMVVEKEVLPRIpIC.LXKLWPKIOIVRIEYLLF1Jt10.IWt~IIB
LSISFSDFPLIEDTVIIQIVDEK1J1ISRF1'YCDI(QOAL.ApFPQNPFKTG.IRELPCiEEISGLNtI.RLCEEVCX
XGCSPSV~111 .LXT~
GILEFLFtOGSTGSSIVPGYDLAYQFLJ111CEWRI
AYSOCEFFDLCRGPHLPSTAlIVKAFKVLRTSAAYWRCDPSRESLVRIYCTSFPISKELRANASPSWI70ip1'PIIRI
AW~tSIAVSSDKDKApYNRApYGIMIIOIZWINVGE~tSY
HLEQIEEAKXpDIiRVi.GAII1.DLFSOQESSPGMPFFHPROMIVWOALIRYWKQLff1'A71GYXITLtTDTi'FL7 1'I'GXNHD~tPDVTRRNITN1IYRIAOCETVIIGC1RCIO011SD8107GI1lLC
EILTPQIlaIRpLNEYSGNWDNYXAtItY'1'LQIODmYAIXPIB~ICiGClI:.YYKTfILHSYXEPDIPGIGKZfGM
SSTSDSLTEIPVPITPKILENPVEQQmrrrsrre~~pp( Pt.AVAEVCNV1IR0 TPEOVIdlTillILOLVSTLYCfFASFJIAAWWIKXLEMFPAbCVSLSpV0t0EYDGC
GLE7MLELSTRP~TIGDDSLWEL71TMI1~IALVOSG'1'PFIVRPGEOAFYGPKIDIHVII
t7AI0R1WOCG1'IQLtMFLPERFELEYITApGTXSVPVlfLfPALFCSIERFLCILIF31FKCCPn_0816 RFPIIiLSPE01JRIITVAl7RtIIPRAKELEE7WKRLCLVVTLDDSSESVSK%IRNAONIpVNgspE-Gen.
Secretion Protein E
YMITLCDHEINENVLAVRTRONRVINDVSVZfiFINI'ILEE109SLSLTALLRGIOfellMSILSOELL.DILPY'tF
GIOIiICLLPIEEBSLLITIANATATSVI110DEVIG.LIX
KPVRFVLXEESCIt.ORL00LY8NRl~tI80!$.LTIDtICDCITISEEED4LkTl0SIPWR
CPI>r0807 913950 914879 LtifliILKFaIIJ~tASDINfEPCE~IRYRIOGVLHDRNSPPSNLRSALT1'RL.1tV61001 GT580 hypothetical protein DIAEMRLP~ODGRIXIHIt7GQEVOMRVSTVRIIYGERWLRIL01WNVILDIACLJ111103'L' TLQI~LtMSLFLVFLTAFIWSSSFALSIQ.VtBIASAPIFAZGARMtfIAGAILILaAwIPOGEILTKDTITAPECILL
V'IICPIGSGKT1TLY5VL.QEWOGPLTNIM1'IEDPP6YIO.IOIJIQI
FVGISKXIPLYIVIS.ALTv'FYLTNIFEFIGLOSLSSSKTCFIYCLSPIHSALFSYIOLKEAVKPKIGLTFARGUtHL
LJtQDPDIi?IVCBIRDOETAEIdIQAA4TGlR.WSTLJf11D11IS
KYft.ICKVLGLSLGLVSYICYLTFGGGGDDSpPWISapICLPELLIL~GMSLASFLW1'LLRQAIPRLLOMGILSYLL
SATLVGWAQRLVRTICPYCKVAYTPENDEKSFLiIBtL~I'~L
IEKOSfLSVTAINAYAMLIAGM<SIMHSAWEPWRPLPVQDISOFLYATLALWISNLICYROQCMICPRSCYIfGRQCIY
EFLRPNTLFRSkSrASCIRPYHILREfAEpIGFLPIL.EtIDI
YNLYAKLLRK1CSSTFLSFCNLVMPLYSCFYG<JILL~GEKCVSt.GLVt.AVAPMVAGCRLIYHAL.71VSGETTL11 EVLRVTIOLCD
EEFROGYri75 CPn_0817 927106 928187 CPn_0808 916398 911956 gapF-Gen. Secretion Proclin F
C'"579 hypothetical protein GGRMPRYRY1'YLDPKERRXAGYL.EaL.HIOEAREKLAQEtIIWi.DIREVALRRNSIKSTEL
IXKLPSWALKSLKRMPQSAEPSLAHIKPIIFKGaCIAtl1'SGVSGSSSODPTLAAQLAOSSIyFTKptrrr.r.sa:L
OKAGHAOSGHDI'KNVTKQCAQAEVMOGFEDLIQDASAQSTGKKFATSSTTKSSKGEItSEFDH!'YCSGV1N1GESYC
NLOGC1.~IITWLEERAOITKKMKiALSYP'CVLLVFSFAVIQ.FP
KSGKSKSSTSVASASETATApAVpGPKGLRONNYDSPSLPTPEAQTINCIVLKKGhCCtJ1Lt.GIfIPSLKETFENNE
VKCLTItIVIGVSDCLSAYRYLFLCPASALI15ACIIl9U0tIPWICK
LLCL'JNTtsIANJIAGESWKASFOSONOAIRSQVESAPIf,IGFJIIfDtOANIWASATFaQAIWS.
ILEKLLF11LPGTKKFWKVAVNRFCSVASAILXGOGTLIEGLDLGCDiIIPYDRLRTDIOtD
LISCIVNIVGFTVSVGAGIFSAAKGJ1TSALKSASFAKETCASAAOGAASKALTSJLSSSVOIVQAV1GCCSLSOSLrI
QRSWVPItLAIGMIALGEESGDLADYLCYVAHIYNmItpKTLASI
QTMASfAIt)1ATTMSSAGSrIITKAAANLTDOMAAAASKMJVSOGASKASGGLFGIYLI~KPNTSWCOPVILIFLGGL
IGVIMIJ1ILIPLT~IIQTL
wSEICVSRGMNWKTOCARVASFAfRJALSSSMOMSOLMHGLTMVEGISAGCfGIFJANNQ
RLAGQAFAQAEVLKQMSSVYCQQAGpAC:OLQEQAMQSFNTALQTLOIdIADSQ1'0'1TSAI-FCPn_0818 N predt,ecad OMP (lesdec 11b1 pepcidel CYTKM7GF~JVWSTRDSDFSWWPDRCpNV~IIDPt'HXOYPNIIKCVLRG909fROKRXO
CPn_0809 91'791 916307 SITLIIhafVVI1'LICIIOCALAFtMRCSIHKCKVFOSEQNCAKVYDIiJMEYATGCSB'L1I
CT578 hypoehetical protein EIIAHKETWEEAs~CKEGRKLt.KDAWGEDLIVQWDKCODLVIFSKRVOS~ROt dfMISISSSSCPONOKNINSOVLTSTPQCVPQQDKLSCNE'1'ICOIOOTROGKNTEfIESDAT
IACASCKDK'C'STTKTETAFL'pGVAAGKESSESQKACAD1GVSGAAATTASNTATKIAIIOCfn_0919 929117 TSIEEASK.iM~TLESLOSLsAApMKEVEAWVAALSCKSSGSAKLETPELPKPGVTPRSCT5e9 hypochecWal protein EVIEIGLAt.AKIIICTLGEATIISAISNYAST~ADQtNKLGLEKOAIKLDXEREEYOEMKA9LY:'ICLFLIWEKFHN
NIGKANFHLKIITTDFLTDLYIVTIRDPIAYPLTGIC
AAEpXSKOLEC2MDI'VNTVMIAVSVAITVISIVMIPTCCACLAGLA1GAAVGAAAAGCA
ACAMATTVATQITVQAWQAVKOAVITAVRQAITMIKAAVKSCLKAFI!(TLVKAlAKACPn_0930 729012 ISKG:3KVFAKCTCNIAIWFPKL,~>KViSSLTSICWVNr:VGWVMPAG:KGTMOIpLSENCT567 hypothec i.al protein t7QNVAQFUKEVGKLOAAADMISNFTOFWQpASKIdSKOTGESNEMTOKATKt.CApILKAYOEBLPCRCL'CGTFFRr iET~SIRTEMPMCNSIAMKKOKRCFVLMEt.tJISF'FLIALLLC1'LC
.1AISCJ1I AGAHKTNNF FWYRK IYTVOKQKER IYNF'lt EFSRAYKOLRTLP.'.TI$Li3'.iYEEPGBLFSLI
PORGVYRD
PKLAGAVR.1SLIIHCTKDURLEWtLCNIKI7QSYFETQRLL
:HVTHVVL.~>FOIWPDPEKLPE
s:Fn DAIO ?18193 17925 TtILTITREPKAYPFRTLTYOFAV.K
r.'T577 hyprXMICIC~I VCOteln t:EIWIKKtKKTKKA\b>KMFVKRVPEE:iOEMIIQQLEL\V~DLYKELFLAUTFJ15LTDKefYt_01121 nvH~l7 'llOni.:1 N(IItL:fIML::r;'t'LE.:LJILEELTQGLFF:.'.AQEDAI,IFAKEL::.7lNfK:LKNLTTIVNKQMVKrTSi .:, nytt~cn..t r,:.si Innr.nrn l:At: IYfNhkLA:NKfYM)I'F I
FTI.LI:L'f::I.V::(.%AFOAANAHKRCM:AOTf Ela<:F.NFYI:IKRSACA
F f EYrrF:K::RII4: A I LR I::KI*r:l!VTfY.p LAKVATKKKCsRYRLWVI'F::RFItIN.~.RYNLYA
CPr_tIHII ~s13.s).f v1920~
t.L:'.EffEI'\':'f7TA:lA'\IFIRLLRhA'l'JOTYxP/Ff~:.~.f.'IAIANALI::NKUELLERGAQLG
Is:rllli.v ~.n k.r:ynlu:.
Oror...rnI'1'\'IF.'fl:I'LI'I:f:IsAE(F'IKMtJ!r:::::N::4::LlItYG('IEEK:a.C:IK.'Kf .Nf.IFfIDfLLLEAVL
H
llsfAIFII):IIr.:M::KI~:fkNWI;WJKP:il::1'tIKKTR:SP.LAtL,nVWKK:\K.\DLLFY~1IIIIHPT
l4il'IN'lRl'1'::LLkII:IWHAVKhrJF7IAVII:IV70lJvALELFYTHTDFftI.El.Ht*M(rt.LL:iR'f F:FEfYY:.t):fill'F1:1::NI:LL~,.~w~LLA:L:D7LLEEI7TVAYTF'1::~~:KYNF.A'J:LFYJLIJ1A
I~LL1'.IlIKKMFDY1'f.:::Y:InYLF'LVIdIJMfAI::Pn:Ia'f'.:K::fKI.
yrt~Jlr/Y'MI ~ :1.::: ~:YI IULIILYNEJIAtI:FFLAF
DAQPONP I ffYY 1 Af'::I.LKL.pL\P nl'tr IW :.: ~3x~ml.w ml :'l EE :N
NFIDVI'NIsIm;fINl'F:FNfl.f;lRc:VIMKQ:aEKVWY:ETKKAITYKI'r4:K::YTT17JKY.::(:K
Yk n~I'.r.. Iry1tn11Nf r.~.nl In.rirr fYLII'JLI::rIYNiIt.elflMf*.~I'1YF7C:::KI:::::::QFD!:LYRKVKDLII::NI'KW:KWKKFL
~'Iw_nNl.' vlmHl '~~Ilyn_ ::HHN:F'.Atia'LVLL',:11ALYI::WN7:1.1'IN":~/I/:FIIVF.IRKMI:xIL~::Y::IMK:PIY
NAILf.'~:LIIkFVLNIPSFAV.';FIYLCVILaFI'.~.::ITMn'~CAEFJIKVNfTt :F
.(KDROiHPKTrtIc:::VEWAKTHGY:TGPKAIALPIYA
_.iTCSKDHCDIfHpDTSNKPS
tpLt,ADKFK00LLiLG:YD~sLEYALRYDIRt.LROJ1SFSFSAYL\TPrx.'LONGSLIYPNYC
OR2S '15191A 7lLSO/
Y.iP!'I(CIJfOVVCITI~RROAIIiYIC.r'LNERPIILCOEPGf'~iHi'~E.'t~RIL' r.'Pn ' ~ 1~SITtIpCFFt.EKKNDLPIQ~t.rVEPQDIFUfVIOa yscTJapaHYr,nT TranloCation T EOGIrtlFNYOVGDPST('EIRF'3lWl RYAIQVRFSN':'::INr.TIKELMCICLPELFSNLCSAYLDYIFONPPAYVWSVFLLiS:r'POAALKRLPNFFSSPI
rFfLKOLLIEVtIROSRGIK',LDLKPILVCIG6SRCId.IGVEL.YRmIC
CFAVAPFLCAICLFPSPIKIC:~L~"WLAIIFPKYL1DT'pll2tYM0l44Lf'fVLLVKfZtZIG:fSLIPI'PLOGL
CFLPRVLPPtatVPQFLTQYIIpHERILFPNPpTILPPESYELVIQSINRPH
VTCFVL1FPFYlU0SAC3F:INQQCIOGLEGAT~LISIEOTSPNGIL'lH'lF'ITIIFwLVCPASPWLOLELKTNIG3 5rPTCIAIw7CWCSKHTFLPfOACFLDLIfONLFQFLKOfL$TOKC
~HRTVT.iLLI4TLFVTPIHaFFPAf?MSLu~APIYIITNIKMCOLCLVM?'LOLSAPAAIJWLVIAEN'IYTJ1NITO
VFKLDAIrIPL.3VTCi'TIJWPL:DLOFFSQLKAACLPpIPOM.F$$OFIC
-"d.r?.:: ::!M\Lw''':. ...:.:.r Li.. .'-:L:.i?!WF:vI11111'...' .
FT!:L:.:'......::vWFI:~Uii.-''.:.i:dF~F.':I~IM:..:.a""tiv.\-\:.:.::':F',.'.
.':.iiF
~.'~i :.
:.
' ,:
....:::!: ~, ... .. ., ....:.: .
.
. .,.:.:'!~.t!':'.:'.~,..,. ..I~'..~,y-.i:A~...
:;;f::..:!iii.' .~ rl:...;..
VFDELNMAKNK.i:nOiHKLLCR:lJI4ihSK:n,iL'iv:
PtENNLi,EFKI;(j,Dllt,pNS(ypSpilLF
Cprt_O8Z1 972677 932779 KKL("tKRCSSEELFIIPSOCLLLKL?RPFI''rRRTJtKLVLPELPDKYESIIACtLSPDOE
yse5/IliO-YOpS/tli0 Translocation KLYIIATLOROISHIOKLE1'PEEPATNFLNIFALWHLKOIC~Ip7IVF!'1<DpDpYK~ESG
Protein IRTRAVLAFFATSFKSVLFYSYOSLLLILIVSAPPIILASIVCINYAIFOMTOIOEOTKwNAIVKLLKFSLNACYKVWF
SOYIHMIRI:1'LYLEEICIKYILSIOG7ISIlI~ILTF
FAFAVITLWItGTtIIIStXIiL.SNNILRFACOIFONFYKWK'ITDPNCQVFVCSLLaAGTGINLTA~IV4INYDRwI
MPAKENQALDRVIOtIGQIDJMIYR
LIT1DTLEERIHYLIEKKIRLLDKVIASODSNII3MCNREDLLTILSYKDDICISDSCtS
CPt>r.,0875 933618 977677 PVDAPVEDCIIGVLPPEDS
ysCR-Yop Transloeation R
ERIKVfTItARSIFRFSLCFFfLSVSCClADASLYC4SCPSRCOPTPPPSNSNPIliWOQPCPeL_0836 916960 VAASSVPSYNPPLNADOVLPRDNLSDGSFSDTYPDITTOAIILIFLAtSPt~'LYNLLTSYLbrn0-Amino Aeid t8raneMdS Transpost KIIiTLVLLANALGV00'IPPSQVLNGIJ1LILSIYVNFPI'LYAMYKDARKEIFJINIIPOSLIMKI~ASNSLSIWSI
GCSIPIIIItFCAGNIVFPLALGY1IYNAtIpwS7lYlGlIG.TA
:TAEGJ1L1YFVALtIKSKEPLRSFLIRNTPKA0I05FYKI50KTFPSCIRAHLTASOFViIVCVPLLGLVSMLFYSGI
IYOKFFFSIGAIPCMIFITAIILL:GPFGGIPRAIAYSNATLIS.
IPAFIMGDIKNJLFEIGVLIYLPFFYIDLVTANVLYAMOlI:IIL.SPLSISLPLKLLLIVMVDLSENKSAFIPSLPIF
SAICCVLIYIFSCKLSALIQWLGSVFFPIIG.VTLtilVZIRSIIIIP
'GWtLLLOaltISFK
THPMVpEFIPNAROAwIaGFIEG1~T1?~LLAAFFFCSIVLISLRQLVAEEID(Pf6IEIPL
' SfOCI8K1C41aSLiILrGFFLAAILLGM'YIRFVLSMRIIAGLLVNVSKptILGRISAlAIG
CPc~0826 931382 933611 PNSILAGVSVFIACLTTEIALVCZVADfLARVIISFKR14YAS11VICTLIPt'YLISIWFE
yseL-YOp Tranaloutlon L
TISNLLLPLIALS1IPALIVLACGHIAYKLWNFAYSPVLFYLTLSLTIVLK<.VN
HDNKRSGVFSSL1IFIDPORYYAIVIQBCFFSLIFKD~VSPNKKVLSPFJLPSAFLDAICpT~T
KTKADSFAYVAETEQKCAQIRQFaImpCFKECSESWS1IQIA!'LEECTIDrLRIRVREALVPCPrL0877 917777 LAIASVRKIIt~tELELt(PEfIVSIISQALKLT~ICNIIISVNPKDLPLVLKSRPELID'tIhch-fnodnucleasv III
VEYADSLILT11KPDV1'PGGCIIETEAGIINAOLOVGLD7iLEtU1F51'ILKA1CIPVDEPSETLTNKO!'ILRTWA
LFPNPKPSLEGNSSPFQLLIAILiSQiSTDKAVNBVTPQLPAKAp011 SSSTDSS&LSNDODKXE
OSILDLPP'G1C.YOLIAPCGLCERKSAYIYQLSOILVRDFNGEPPNONU.LTOLPG1~P1IT
ASVFTGIAYC1IPTFPVDTNILRL74QRWICISElIKSPSAAEKDLARYP!GNENTPIY
CPr>_0827 935773 934131 YAAQYCPALNNKIDNCPICSYLiIKJEiINSTRT
CT560 hypothetical protein CCLVTANfFCILDILMKNSKEDDLSRFLP10JLLVESPNPEEIPLxSLSf'RISWLPTINPBCPeL0838 919196 wITIAMKFFPPEI0G0LLAWwPEPLVOCILPLLEGISIAPHRt:APFCAFYLLLIC.SIOCIRthdF-Thiophafe/Puren tbcidation Protein ?CGITEEIFLPASSANAILYYTGPVICIALINCIGLYSIAKB.KftILDKWIERYIDiALSPISINIIfPNSFIQ.FNL
KLGILSESSFNP'SIFMLIQ(DttIMIATPpGECSIAWRISQpWII
TEKLFLTYCOSNPMOtLET'1N!'LSSW1TDALROFVNKOGLfPIGRJILTKENJLSFLwYFLVIADRIF9CSVASFAS
HTIHLCpVIFEEM.IDOALIl.LI9tSPRSF'iGK'iC!'F
RRLDVCRAYIVEQTLKTWYDHPYVDYFKSRLEOCMKVLVKACSOILDALIAI&ARPALPGEFSOMFIlIGKIOLVpAFJ
IIONLIVAENIDAFRIAQT!!P0 GNPSIDCIOEINTLIIF~i.IIFLEVWIDFPEEEOPDLLVp0EKI0Ni1L1lIVmFI88f0lDO
CPr>_08~8 936292 935267 RLAOGTSLILAGKPNVC%SSLIJ4ALWLaIMIVTHIPCTtROILEEOiditOCIUtIRLLDT
ysCJ-YOp Transloeation J
AGORT~IDCDGI&PALS11MEF~1DCILWVIDATOPL6DLPKILtZI~BILtJ11U1DLT
IKRriIWIMVRRSISFCLFFLKfLLCCTSCNSRSLIVHCLPGREANEIWLLVSKGIIJNIOKPPPFLOTSLPOFAISAI
tiGECL'lpVKQALIQSAMOKOEAGKTSRVFLVS87UOAlIiALVAR
LPQAAAATAGMTLGOIA4VDIAVPSJVpITEAIJLILNOAGLPPIOfGTSLLDLp'AirpCLVPSELCLI~IpQNLYLO
PPEIIJ1LELREUNSIGMLSCKIV'IESILGIfSI~'C1GK
OEKIRYOEGLSEOMASTIRIOIDCWD71SVOISFTTENJS~lLPLTASVYIKI~rVLDNPNS
IMVSKIKRLIASAVPGLVPENVSWS>RIAJLYSDITINGDNOLTLtIDYVSVNCIIWCRSCPeS'0139 9N~30 LTKFRLIFYVLILILPVISCGLLWViWKTHTLINtMOCl'10;FFNPTPYT1(N71LE71KKAEGpsdD-Phosphatidylavsine Decarboxylase AJN1DKEIOIEDiIvDSOGESIQiALTSDKDSSDIfD7LP0GSNEIE:11PLfIVBRt~.VQXPOYIDRITIDtRVIEP
IFYEKTMLFLYNSKLGKXt.SVPLSINPI18RIY
frWI.OKCbyl1'RRIQIRPFl84RYKISEKELTKPVADF'1'SFNDFITRJ(LKPWIPIV~KLVFI
CPeL0829 936729 937198 TPVDOAYLVYPNVSCPDKlIMfSKJLPSLPR3.LW~LTKLYANGSIV1MLIPfDKIIIIFN
No robust hwsolop Dresent in Genebenk/E!~LFPCDCLPQKTACV51G11LFSVIIPLAVKDNFILFCENKRTVTVLCC6pIrGKVLYLLVCR111V
as of 11/7/98 iCYICFVpTLAKSfYINIRDSRFYSWL.CFI
GSIVpTISPNOTYAKDDEKGFFAItOGSTVILLFLPNAIRFONDLLID'ISRIBPCfRCIJbQ
IYKT)YCE
FFLANAKWPLVPACYRRVRGImfYiSPLVDLVILFPWlr1'KD6RYSPCSMTII'CICRSIVESIlaRiDfIELI
CIPWSTLFGIGRFCAVWCVGFSCSTFDKIYNTIVAVLGILGLGILTFILRIIPSVLHt.
pVwPLFKCYS CPtI,",0810 950111 951541 CT700 hypothetical protein CPn_0830 937339 977959 ISaRNtJCILKTFIGIAKRDKSOILwNIMwLVIWAt.AASL71IALVA1~YYRlYYlIIItYAV
No robust homoloQ present in Genebank/ti'>8LOVIRHVRL3NELKLWALAEOQLLPILKIOtSYRROCLFItYlQIILRIDpRtE681JQ.LAlAI
C
as of 11/7/98 DSCSFLLPCTEYEAQTFPOVFSKVWYKYXSSRI:.LIALLYNITLVIGLIFINKKYLCOKKLG~PYFFLCIAYKAYRFG
AFIfECAOAFASVpQOGf'EEEDAAKYASALVIILG0L0ARC
GRVILKIY(~EEFFMTERFPSIGAGYLRVRNIWSVLFPFEDLtC.VCPSVPKDFPLSAFSLI6PWISPLSNOETFVrIO
ttIYITSKRYKDAI
n KYL17CLIYWSYLESIPVVGAFFPSIGRLFAMWCiEDFPGSIFSRIYNTIVCVLCILGLGISSYAKAGKLI"RIILLSN
PVYKLEALFNIGLCEOKLGRFGKALLIYOSSDGWBRCDAiiJIKY
IMFILRI IFfLLTLPFWLISCLKSSM
AAMAAMDORDYVLAEPCWCL.IILRCSI'FAKDYKCCIGYCFSLCRLRKYCpIIENVYCQ.ION
FPDCLTACKAIAWLCGVCYATLLDSEEGIXYAIDTAVELtkiSCETLELLSACEARCCHFDA
CPn_0831 938219 938174 AYEIOSFLSSPDTSLOEKORRSOILR1LRKKLPI1~HNIVEVDALLAA
No robust homolog present in Cenebenk/CM7fL
as of 11/7/98 NKRKN1TVLIAKSESEGAFFEATpNYPTIQpGYQLVRIREHNLSVRAHFDLSLSLDASVNPCPr>_0811 951719 M . secA-TraneloCase SecA
IKRHIG.CF(.IfRFFGSSOERILKKPOKLVDtIVNIYD~.TPLSDDfLRNKTAELKOItYpNG
CPn_0832 979750 .938827 ESLDSNLPEAYCW101VCRRLAGTPVEVSGYNORWpMtIPYDVOILGAIAl0t1(GFIT~Idt lipA-Lipoate Synthecase CEGKTT?AVtiPLYLNAL1GXPVHLVTVNDYLAORDCLWVGSVLRNLGLTTGVLV&G?LGE
VMItCRpTLNTDQPRVRKKLPERFPKwI)QRPLPOGSAFNATDATIKRSGIIPCVGEEALCPNKRKKIYOCDWYC'fAS
EFGFDYLRONSIATRLEEpVGRCYYFAIIDCVDSILIDfJIRI'PL
RACWSRKTATYLAIGIriICTRSCSFCNiGNSKTPPALDPTEPERIaLSAKEi.GLKNWITIISGPGEKNNPVYFELKF
JfVASWYLOKELCSRIALCARRGLDSFGpVDILPKOKKVLEC
MVARDDLEDCCAQCLVDIIOKLREELppJITTE4~.ISEFCRSLWLVSKGMPI1JRVLRRVREHPDLRANIDKWDVYYN
AEpNKCCSLERLSBLYII
\SDFOGNVSALHTLLOSCITIYMOiV
ETVARLSPWRHKATYARSMFYLEOAANYLPDLKIK.iCINVGLGF11DGEVKQTLODLASIVDEHNNDFELTDKGMOOW
VEYAGGSTEEFVlIIDMCNEYALIENDETLSPADKINKKIAI3r GVRIVTI~,.OYLRPSRKHt.QVKSYVIPETFDYYRRVGEAMGLFVYJ1GPFYR55FNADMILAEEDI'LL!FaPAIiC
LRpLLRAOLIldERLriIDYIVRDDOIVIIDENIGRpOPGRRFSECL11pJ1I
SVQIHIASA
EAKI'NVI'IRKCSO'L'LATVTLDNFFRLYEKIJ1GM'CCI'AITESREFKEIYNLYVLQNpIFKP
CLRIDNtR7EFYITfEREKYHAIVNEIATItK:KCNPILVGTG~VEVSEKLSRILRpNRIEHT
CPn_0933 !11171 979717 VLNAKtRfAQFJIEIIACACIfLGAVTVATll21J1GACTDIKLDtdEAVIVGGGiVIGTTR1108RR
lpdA-Lipoamide oehydrogenase IDROLRGACARLGDPGAANPFLSFEDRVIRLFASPRLN'ILIRNFRPP6DGYISDPMFNRL
RCVLFEILITVSEISA1'pEFDCWIGAGPSCYVrIIITAAOSKLATALIEEDQACGi'CLNRGIETADKRVODRNY'tI
RKFnLEIfDDVIWKOR0AI1N1PRNDVLtIll6$VFDLAKEIICHVSLM
CIPSKALIAGANVt/SHIKHAEOFCINVOGYTIDYPAMAKRKtnVVOCIRf?CLECLIRSNKVASL'MSDROFKLWL'L
PNLEEWITSSFPIAtNIEELROLKDTDSIAEKIAAELLOEFOVR
ITVLKCT,3LVSSTEV!(VIGOLiITIIKJWHIILLT'C3EPRPFPGVPFSSRILSSTCILELFDHMVE.LSKAGCEEL
OASAICRI1WRSVMVMHIDEOWRIHLVDMDLLRSlYGLRTIIDQK
EVLPKKWIICCGVICCEFASLFHTLGVEITVIEALGHILAVNNKEVSOTYTNKFTKQGIDPLLEFKHESFt.LFESLIR
DIRITIARHLFRLELTVEPNPRVNNVIPTVATSFt0a411NfIC
RILTKA3ISAIEES~OVRITVNOpVEEFDYVLtII:ROPNTJ15IGLOC1ALVIRDDRCVIPLELT'llrDSEDOD
PVDETlIATNVPNIYAIGDITGKWLIJUIVAStIQCS'IARKNISGHNEVMDYSAIPSVIFTHP
EIAMVG4:LOFAEQ(NiLPAKLTKFPFKAICKAVaG:115DGFMIVSHEITOOILGAWIGI:Pn_OS142 vR5015 PIIASSLt.EMTIrIIRNELTLt'CIYETVIIAHPTL_EV'~ALLATNHPLHFPFK3~.T702 nyptttnHtw,O
prucmn /frame-ahitt with OBI31 KYYTFFTI.~.A:IPW::NL ALICfI::EPEYIY:NQLLKTQ.iL(.TTtNDTLLNAPKDFPlISKIIDKN
''fr~!IHtA nAISAA '1IGOta ILFI:f/dQfrL.::ll%AQFLIN?IRRKFWIF'PINOOVW.~.EWLPFI
't'. :r. Ilyh.tr It.:t ir:a l Iltrtr.:
tn t:IS,ADFANETF110RTCWKt'.lX:::V::MIIVtr;!:FY('.'\FVrDPf'VA:XX:FS::r.'HIt:Pn 4YA
: $H ~ wSS~ fD 't.A'rtA
:FPECA.iK
N.FAFGLF'AV:.::EIAtIf:AVV:Iy?NP1'OFTNKOVIphW.~iR.0:1dPL1ALF47hLLAFAFLILr1'7n:
r.ylxtrhu i.:.U prntam trr.,mr...~.hitr wir.h O9A11 Lt':'ftr.7:LVL'IWIKNAAYIO',:I U: tIKtIKL: :
:'S~IrtFKVICJYItVYS>PI'(IPDIt11Jt1EI)(.(.DN.~.FJ1A::LDKYr~CIr;V'IVEFJJrOQG
V1VAYRCYAK:FL
GLLI'/L/:F':VFKlf1'tllVftCt .:1't. IIH :'. 'IAL4'IH 'tA~LIAv J
m,! 1 ::WI/::NI I.unilY heli.:ar:a 'Ftt 4RAA ~N~ wv.'P.I ..'hl , tlNtIIVLFIUrtIFRt~7AM011f.LJIHRKETWTiFY'Et::~?titll9Uit'u\I'EC:YWL~TLKWDIDm %ptw:r.TF.c:.:m:Tt'-ItualuI Irr..n.
Nla'F.lt:.':a:(IX:f7'L'LIIIXiS\YFAVYpAU:IJIfWILYFHII:aWIAVF::11FFLD::IPWA~YNHIF
TTHLKIALIGRIMV1:K::::l.l'DII1!.'YI::IAIVfI::I~%.TfRI*LYt:FIJIAPV:VFAVV
:1:HV'ITt.E:.:T'SIITLTIFR.'(.::I:EVFQOWLRTIIL\.~EfaT'JFTN!"PF'LK::AL'lR'fAKKFFF
LIfM'7:/L11N::ED'IFqKlli'IN~~AlaywlYFJviH/i.Lt:Jti)ITr'3tTEF.DN11.AKLLG'LILKPL
NF:FT:Af?S:flt:l?1::47ra'f~.:IIF::LOY1'jiLVFKAFIt.:FtTf.f:DIFIKf.FalIIT;:LFIJI/
::ltptLLVAtIYADI:H~EELt~IHETYK1~:11LI'P!I:."PAllf)KII(Uft.tI~RIKLVMILPEPREEEEE
i:
:LEEV:VDfIIF.E~EAALP.'aft'fPOf:LVITflC:F:.LNt.~IYROLTENNLPMf:'.
'EILWIRSFQNC:VNC'."f:A.~,I~:::,f".'!!tRLr""...:.::
r~f.'TLPESPCCAPICTLKIAL.IGRP
tM:K:'.'.:IfNr:LI.NEERCIIONCP4T!'RDftLOILY:NK>XtOYLFIDTAGLRKlIKSVKNSIEfY1'YOCCA
TiftaISYCT.TI'7AKCNYLDALNOEKSYWOAR~F:L
"DQ'fDOFATNI~v5 YIIiw:RTEKA1::RADIt:LL;ttOATOKL:w(EKRI::'LI:iKRKKPHIILINKtiDLLEIyRtKGTZYRGLDLFK
IfIKIRICI!>pIFL~I/RLRI
PI~, :illY~,~'L.'p 'w"I
EHY''.KOLRATOPYLfbAFJILCt.iJITTKRNLKKIF:IIIDtLHIIWSNKVPt'PtVFfKTL~ISAt fir'rtQIATFO'tpKH9CLP5LI:KYPIyI'NK711FIK
PLONf~S fOkTt.'PTOONV
LHRMIPOV ICrRRLR IYfA
ICKT"TPGOFLLFIASIGI~IIYS<iK7IAKYIltELIKEITTFOSADLYYSL.iIYLKCIItR.pAVAOP
ItIAKSLLTKHYEYYLKIfCLKSSFNLYCI LGKAVG1:.
PfDLEfICEKMCPtIN
NDLKTRANADITRCNIIIKAAIDKtILVEIKAOiIELSK.~aCTRtLI."_~.'_TNfKS(;SOIw.lANL
SCL(~'FL;iCLTLKAVNDFNATYFJ1F:AEIF~tPfNM~IItRCLATFLiFVtOIxC~CITPGC
''Fn nrl.t5 n5ql il ~5~a5r1 OOOLLOANE.S!'.QCOF3'?F.~rNQOftILfLESa~ANQQESIfGVSAAL~LLNpNVSKIJIRIIIKS
.. .,:,.., , . HaO,,~.~-, ::.li.~:'.:' . t,.r~,.. ,.:Ih:YrJA'IF'.":::":Pl7Wl.Mt:n., ..~., , -RPLEDLDIATNAs'PTIVSTtPPtriII::LCJAFGIIIJKpOCRLfEVATFRSOGtYKDCRHPiT7l~
nypotnecvca: Protean ORIIFSSI9tGDALRRDFIVttCNYYDPfEDKVFDfYCIRDILKl07IRAIGNPRLRFSEDKNINNPKIAUWSLPLTAJ
APVFEESYffPa'Va:~.11DYVOAT'lGSPIILTVLKDVIKGwIR.D
LRILRAIRFSSSLu'FCLDP'ITLRAILXLAPALVNSYSPERINOGJIKIQ.IOfOPI'GALSLLIGKiIFL.T~OCFI
N'ILTLU1IIQA.iIrIDpSSRFSRKKEtKIIItQFIILtIUIAfOMTklSG
LKLKVLZFIFPCLRDIPYS4LRTTZLFARKfHPTIIPPILFLLPLfnGVStWITVJVCRVpPI7IDPVADKItPLOSAF
AYVLLOKYIPAOttALYALCRELHLSGYApIILFSPLLISIIKS
LRISNKEt.KLIESNYEALPNfQNpSGNRVPWANFL~ISPfIIPLFLELfS11L4KDPSR00HFINSAPINYNIGSYIS
OTS<.TANFAYCY!?IILSRYt~IILVSpCRLDIAB'1VK111GItWIIiA
ISRVptLESRLEOFILRIKTSSPWSAPDLZ.1KGISPGRLtGt7f.LRRJItILSZENCLDKSVKJWVSL'tDROKKCI
ECIIASYTKSLOVINTOLTDVITI'FiI,ASITFVPGL~I7fDISYRIV
EKZLLLL.OLKGtsec O~.SI
IAL4NDL1M.VDGKVDITTAVNOGLLNFFT1YL?DOpNYCpt~tpTpptIlLtx.E
LIWpppWSLVSASL%LU7CNY7TVI9GF10~t CPeL0846 9597!7 95A11~
clpX-CLP Protease ATPase CPn_0851 97119 971106 RENHMtKXNLT:CSFCGRSLKIriItKLIJIGPSVIfICDYCIKLCSCILDKKPSSTISSAPVSEatopB-Ourer l4smbrarte Protein 8 TPSCPSDLRVLTPKEIKKNIDEYVICOERAJUCtIAVAVYNNYKRIPALLF01KOVSYGKSNCPTDINSKNLKNLRLAT
LSfSMFfCZVSSPAVYALGAtZJPMPVLPC1R4PE0'tWICA!'OL
VLLLGP'ICSGKTLZAKTLAKILOVPFTIAMITLTtAGYVCEDVO~tIVLRLf~AADYWACNSYOLFMI.IVGiLKtGt r' rlf'DYVfSLSANITNVPtIITSVTfSG.~L"t'fPfZTST'1'IfNb'DFD
RAEpGIIYIDEIDKICRTlJINVSITRDVSGmCVCOALLKIVifXiITANVPDKGGRKHPNpEIIOiSSISSSMATIAL
OtTSPAAIPLt~IAPfLKOYYRLPWiIYRDITfIIPGtfA
YIRVFTtENILFIVCCJ1PYNLDKIIAKAIGK'f1"aCFSD00ADL.1~KTRDHLLJtKVF.TtDLIE.SLYIDCLI~I
C15DYCIVAIGLSLOKVLiiKDNSFVCVSADYRNCSSPIMYIIVYNKJWPE.
AFGNIPEfVGRFNCIVNCEELSLDELVAILTtP'L'N71IVKOYNLLFAt&NVKLVFIOCFALYIYFD11TDCNLSYKt SISIISIGISTYIaIDYVLPYASYSIGNTSRKAPSOSPTELH~FlMFIC
AIAKKAIfQAKIGAPALQlILFtiLfttDII9PEIP5DYNGINI0E0TIAt2DtAPZIIRRTPFKIRKITNFDRVNFCF
LZTC:ZSMiFYYSV~RWCYORAINITSGLpF
FaIA
CPtL0855 971001 972991 CPtL0847 960019 959787 ppdA-Glycerol-3-P Dehydro0enase elPP-CLP Protease Subutfic GGBBIpNIGYLQ83IWCPCLASLIJ1NKGYPWANSRNPpLIKOLQtERRNPLAPNWISPN
KLFDEEfOHTL.VPYVVEDTGRGERIIFmIYSRLLImAIVNIGQEITtPLitHIYIApLLFLItISP1TDM0J1INNAE
NIV~11TSAGIRPVALpLKOZTDLSVPFVITSI~ItpMSiLIi$E
SEDPIQ(OICIFINSPCGYITAGLAIYDTIRFLGCDVNrYCIGOAASIkiiILLLSA01'l~lINLLViGDSV'PPYIG
IfL&GPSIJIKlYiI4GSPCSWtISAYOSOTLIIpINl~IISLPIANYP
NALPNSRMttIHOPSGGIICTSADICLOAAtZLTLIOUILANILSECTGQPVEKIIF~SDtDHTDIIOGAALG~LIONI
AIAGGIA~LRfCaO~tAKAGLVTRGLHtMiKLJUItI~CKPiTW
FFNGAEE71ZSYGLIIIKWfSAKETNKDfSST
GLiKiGDL.CVSCFSPSSPNf.ItFCNLLJ1QGLTFmAKAKIClNVI9GAYTJILSJ1YQVAIOWK
ILIIQITDGZYRVLYEM.Da.KtGIALLtARNtKCEFL
CPn_,0818 961556 960177 cig/muri-Tripper Factor-pepcidyl-ProlylCPeL0856 975110 977995 isoaerasa VOASSPAFPFKSNJOCGCLVPRSLSNEOfSVOLiFSPGCIVSAWKVSP~TKLFa071LIfA0X-1 Homolo0-WP-Glucose Pyrophosphorylase KIKKEITLPCPRKGXAPDINIASRYPfINRIQLGC.VTOQAYfUILSZYCDNRPLSPKAVRGSRLIwNVRLTVIffESV
YSPSAIqfVNSL7IDIfLKAINOEHILDINPSLSPKQppRLf00LTS
SNSITQFt)L0LG7UNEFSYCAFpAISDLPWSiLSLPOHE7N1.SEZSDSDIEKGLTHIOI~FVDZt~fAlOppOLLSS
PTAIIJmFNPITSF1ISSGtDPGtANAGTTLLKtKINAL11VLR~Q
ATKTPVERPSODGDFtSISLtNSKSNDtNIISSMIP:NKYFKLSIIiA~'LWKLINLCISG4RtxCDOPK~fPVSPIK1 0IPLfOLVALKVRAASKL7tCOPLPLAlIftBPiXfROTRSFF
TCHRWtTITSPEIQSFLRGDTLT!'fVN7IVINSIPEIDD6KARpIQAtSLDDLtUIKLRILSIISnFtG.DPNO~V~F
COPWPLL?LSGDLPLtf7l~'1'i.AIGPt~NCCZ11TLLYfif;NYAC
CLEKC~11~CCLOKRFSEAEOALAIIt.VDFCLPI'SLLLERISLITREKLLtiARLIOYCSDttNIOfRGItHVBVIP
I~PLiILPFWELOCFHANSt~B~tEVTIKAJILRpfIIILDIICILYKSNDS
LLIfRKSELIKEAEtD~ATKALICLLFLTFDCIFSDCU.TISRtGIAYI~SRLRfOpOPPKCKfSVILYSLIPONEAF1 1L1~DGKLK7fCL7WZGLYCL~FIRIWIYOpLPLYKVNKIWt DIS~1'LQELVNSARDRLTYSKAIEIM.RKASLL.ASTPSJ1QL.GI~'SLaI610~OBiKICCFIFDLfRYSDHCQfL
VYPROLCPAPLIOa.~NIISPDt111EQRLS
IXtAIpLFHKVlGKI(LSPNTTPLLFJIDFYYPSTSTSLNWFaIK
AFFCEPFfGB
CPn_0819 961752 965~A5 , mocl/snt-SwF/SNF family hsliease CPtL,0157 975108 975792 ADYIINSYSRCF~~LMWt~RDFSANILODCKKLFtOGJIVItfAICZL>iE~E'1'VCISAQVRCT716 hypoehecieal protein CLYCNIYECEIE111>RStBO'ISIDSNCpCSYHYDCONIVALLfYLlOIfFN~4VIlAYJIR>iRDI.IJiiJIpYIK
TARGISRI141DRL.G6LSLZLKVKIHKYLDTLIOJpKRLALTVSRNI0f1'pa0!
ETDNCINLLVItKiLIfETFYAAATKtEERKDRAtpKtIAOtdQ.GIYFlIFISRONIKNYDILLEYLIITLOSSLYXQ
OSLSLRFLEINNOOI.O~.I~tR
EKDBIILLiIVLTYSVNEDTfAPIINDPIEPOLVLRLPCRSKPFYISNIRTFLCGVLYOtPIVKIIACIKNNKYSKDOL
IOT
i~IGRRfFfTIQStNABt>AKI,II7LLZ ALOVILiIIDPtN
CLIIDNGf70SI~ttSFSGLPCCNL$EPIL~ISLTPCPn,pSSB 977115 975757 VD
ODi110PlroTNLLESL1APGZIHHFVYNWFSpOIKRIWLRSFSRtJIDLIIPEALIGSIROiAtlii-Flagellum-spscifie ATP Synchase LPV!~pIYIIEIANVHLLNSFVTLPYVDEYRJ1ICI7NSYLDGLLEAIfLIIPLYGSLRVPAASLiIIDISt'fRNQRR
TRPSTFCFDSIB~INLNKLKLtIINNWQPYRACCLLSKVSfrTILILYDGLSiICL
LOYOOVRAFISDLGILARNLVLERKNLL6VFSGPIYDCROGAPRVKSLKKIY6l7ffETIPCCLQ(ISSCImPNLLIEY
:v'FNNNTTL.taISLSPLHSVAL.CI'EVLPLRRPPSWLSDNii.G
ANQIIRITFNCPLNLSCpFIYDETIFB.SFRtxiSDRVLDJ1PCNPI.PKfItRKPt.LSLPPSPl0~0lpPIDpIFP' tCIK7IIDtIFLTL~RI
I
SAKKRFLLLPKAGOQSNGTRRGKVNSGKLPCILVLpL~tIAPWOIfNtIGfKVLDLX.VQGVISCPOSGKS&LLS71IA
LGSKSTINVIALIGAtCRtVREYIflOfSNALKp~tTZIIAAP
KCPLNSLTCISLDOFEJ1LPVNPSNSERLIEICKOIRGLIEFDfQDVPOCIOATLRSYpTCAtItTAPTKVZAGRAAIf I'IMYFRLOCNEVLFIHaSLSRWIAALpI.IIALARGLTLSNpYA
GVfBILERLRKFOiLNGILADDtI~IGKTLOAIZAVTDSKLEKGSCCSLIVCPI'SLVYNNI~EASVFHFNSE!'1'LM
GI~IJOOfGSITJ1LYAILYYPKNPDIFTDYLKSLLDfNFFLTSOGLALLI
fRKfNPEPRTLVIDGVPSORRXCLTAtaDRqVAITSYHt.LpKOVB.YItSFRFDYWLDtASPPIDILSSLSRSApALA
LPHNYIIAAERL.RSLLKVYNtALDIIHLCJ1Y1'PGDDEII.OKAV
HHIKNRTTRNAKSVIQiIOSDNRLILTCTPItPISLtEWSLFDFIJIPGLLSSYORfVGKYIKLLPSIKAPLAQPLSSY
CYLCBfI'LIfOLtALAOS
RTC,11YNGNKAONFNALXIONSPFILRPF9fEtM.KDLPPVSEILYHCHLTBSOKELYOSYA
ASAKOELSRLVKOEGFERINIIM.ATL?ALKQICCNP11IPAKOApEpGD.SA1CY0M.1~LL
CPt~0A59 SSWDSGHNIYVFSOYTKNLCII1CKDLESRGIpFVYLOCSTKNRLDLVNOFNEDPSLLVFCT718 hypothatieal Prot:ain LISLKAGCTCLNL4CAD1VIHYL7tiMNPAVENQATDRVNRZGOSRSVSSYKLVTIiJ'1'IEEVfLYtt'POSPGSLS
pSHLPNPHDPWDTtP'tSLPEDPNOKASCELNSLVNt.FRK<SINLLS
KILTLQNRKKSLVKKVINSDDlWSKLTWEEVLELLpIEVLK!lVppLKPDIrz'!m.-ZCEKFLYKXLENPOELALLLSTAIARHTTLRSLTPIKVFLN
PLIL.KTLTOWISTHELPNIKHAEFPPOTSCARSGFKIETPNCILRQLISELLONLLbvLT
CPn_0A50 96575 ?66790 . A
mreB-Rod Shape Protein-Sugar Kinase LCKKYwNCCRYDFNSPNRNLFKLKNFSNRLYNRALGRFOKVFNFfSCNVCIOt.CCANfLVCPn_0860 x'8679 YVRGRGIVLSEPSVVAVDAOTHAVLAVGHKJUtAM.GKTPRKINAVRPMtDCVIADFEIAEIliF-flagellar N-Ri:fg Protein CML.KALIKRVTPSRSVFRPRILIAVPSGITGVEKRJ1VEOSJ1LNAGAOEYILIEEPN71AAIRTLVFfONLAKKLTA
LCISiLCCLLIG31IVSCAILfGRSSNPSt.APTQVKTEKT9CNnK.K
..rvpLPVHEPAASNIIDICGC?EIAIISLCCIVESRSLItIACDEfDECIINYNRRTYNLNLTONLTtPKLIESLTKK
ECLEKDLTSFNPIASAINAIALSTEDOUNSPI1ILSVILTLRKe6 ICPRTAEEIKITIG.iAYPhiOpELENEVRGRDp4ACLPITKRINSVEIADCLAEPIQCIISLTPSL.LFSITDYLC8.
3L1LKRLNISLSt7NLQILYIPFw~ITVNSLPtN'ILIDIYtGKIFP
ECVRLTLEKCPPELSAOLVERGNVtJ~CALIKGLDKALSKNTGLSVtTAPHPLLAVCLCKEMPALAYNAKJIOCPTt.C
LTt~ItNYIIWLTKEtStKIVAHTKHYLYONYUDSYDtVIETL
TcKALEHLDOFKKRKCNLV
PFARt.QNItKSFPAKVLIC:HILVISLMI'/ALASFYLARHAYERVSPEPRKIKRCINISKL
LEIIOKESPLKIALLi.S1 L: PKlfAPaLLNRLPEOLIWCVGIfYKL
CPn 0951 ?66778 068195 PckA-Phospnoenolpyruvace Carboxykinas~CPt1_0961 ~-975Z 979925 REP.~IMVWSTNLKHECtJfSWIDtVAKLTTPKDIRLCDCSOrtEYDELLTthESTLTMIRLnitU-NitU-related p:'oceln NPEFNFNCFLVRSSADDYARVEQfTFICTSTfAEaGPTNNWRDPOFIOtRELHOLFRCCNOASYPFTWKPLJftLPLEF
NIFWSSLSAK'MKKFLTPHCACTFSEEDAFJ1KLNILYI9IIpGN
r;RTLlIVPFCMGPLDSPF.~.IVCVELTDSPYV~IC9~ItIlftRFGDDVLRSI.CI'~IfIFLKCLHRLNaKZfIFr iK.VDtIKNCt'LLDAKFQYF.HPYLIPWPJ11R:NLVCGKSYSLAYIOfILODI
::VGKPL.:PCEADII~IPCNPKSNRIVHFODDSSVMSP''SSCYGGNALi.CICKCVALRL71SYW1OKSLRVHJWCP
ALPED::I: LYitPVIDALDTAVEOCLEIPLEDCSLpf4711~uPNNL~CMN
K::pcA.IWEHNLtIGITNPECKKKYFSASFPSACCKTttLANt~IPKLPGWKLECtrppIAWIPYSOSDWEALTHEOK
t.YALR.tTLAEKT:PYtANCIfCEViYESLENFTVTLAYSQiC90CP
HH:RIYiPLYAVtIPtYCFFCVAFOTrERTNPNlIIrITCRSNSIFTNVALTADCDVYM80LTE3SLG3'lIlJSICOL
LRAY IfELpVKVDE3~L.NL:HP
OPPEPLTIKPWItPCiC;iPIAHPtI.SRFTAPLRCCPSLDPE4MSPGCVPLDAIIFGGRRS
tTltL'flFAt.:~lF9KKY1'Ii.~CllS'.':ITTAAIW7CL.aCLPHDPFNILPFI.'GYHMAYYfpNWL~Pft_0 9f2 ,~Jn24 >7'l7.~..
.:IIFtIR.~.LKLCNLFt:VHNFRNFBJpI:EFLWW:FCENLI'ILEWIfQRTDCLEDIAERTPICYyfttwttit5-rFt.m..l tc.~chin t.l?IfQYFtIIIX:I.HLDL.~rrVQELF.~VOAtY'1J(J1EVEIdt':EYLKLA:~Ot.'L'~~OITDELLRLK'i PtiTIFRLTf7CKT::r.'f:'NEKIQtIRKAFPIFWLtRIQVAtIPSERVKE:9'AIJI::OIPdLPpG
::1:1 Kh:Y. :.AIJf IhJiKTCE.~.I RUL>.t:LY(fi:t1 L FRF'/PtIFt'IFMI I VLAALVt?IL:WPtt:RNll t ILPAH
U)pLLItL~LCRHOt:tt:IT':'tJIIlV181F.':P
IVEF'.r~LtETL::PR::f.LF::It:AAHC:LT1:VIQP
~'irn nHS:: .wrt~7A ,n~a.:f!
lLPL4:LCKDRRtLI~II.CI;UItJVL\FLTI'EIIiIADIITF::.~.AAIJ,Xi~It:::It7CIFIRKGL
":'W l hytxmtx:r ruU Protoiu t7tVf:..'IiFPPFfPSA::I:
F:'.:\t'MNp'1'N:F.ERI::ALI'f.FTFI1T:.Nl~'KKLIUELO::VLr::I
~
iY.lattU'flYt.rt7rV1'14~i'::YINFTPNVITAf.::.~ltlflP::AfEt:a'.::1LFFOELODKIIOCiL
.AF::EVONRLPNIWAAfPDttAEa~:FiILllr/J:fYI'.'aJ:YERF6~F4WVi.tlNwtiI::PP
LKIIAII:LVVF:L::AF:ALN('A~1VOT::I::YLPTEE:i'.'.RCS:L~.N:LIDR'1'tlPt'h'1'ODfVKAI
LQll:H::AU1F.~.LTER::KUI.t::::KLAH.WIILLIKIIIa'ILLt::~:::
Nt,FIFFT::K t FVFI :LINfVFK::1't.~:lTPt'Ff:
IDP::NFE::A ( I LNY ITLIJJNLhPKFAACST
I'rnAU'InALIAt.t:UFVKRIFJ1LKMIMP1W::Itb'iIAFWUF:IFfPt'INMIQV4:lPVTDYW':Pry nH67 t~YISjOWl,y ~.ylxtt ~Y~3 I
VyINd.::INITAAytIK~WI.KNPf.':ILK(llLtlf\AJt'fftJA'MIYPADAEYNARMCNIOSLIspM
(hoePtKxFIYm't.y. N.H.t...
WO 00/27994 PCT/US99l26923 FHMALLILLPiIG0:rIMMEKNLf.iG4JVDLPL...~pOCHLLIOt-SROSfMCL:ptLP::.
XfFKCF':I'AIIKJL:.FF:..~.'tK.'.F~.::.It:LESAL1:.
.fSACAAIONLPIDCIlT3RVR f;'...1RL
iLRR
' 7FEOOEIf:w Ih'LRTFI'E:
PIAKA:
' ' ' .;IXtALLWnINNSKKIPYIVF1EDPIW(ENSRI'I:aAEEtTN~IPLY053AWFRNYGELO..
.
..
.
.
.
rTD'~.FOS
t Ft!P1 RHLCC3AKAf:
Llt ~
~' ' '~T
~
KNKKOTAEOF'iEERVKLYIItR,iIKTAPPOGESLYD'CKORTLPYFEKNILPOtANGIWVFtf~
' .
' .
r~t~YD.WTL
/%' ' ' OONATLCFJV':nPIF'.'T1ERNRLDFOC't'3R'~d~ftLVRCATC?l9L.i ~110~'PSD
EELYL iLELPCCKWVYQ~D~KIEKNPCF~
EAAiLVNS!"t't'IQCgItPLTIRGLP.iLVIGL..~VATFICo:
.~ANCNSLR.iLtNDLEKL 13P0~JRLRC,LYSTMLSLLVKS
LRSMREMWKOLLPOLTVLDFSEr~..SSCC:LDVFAEGIAVRiNtJJCAVSIN:.
ePn_ORFI 99!559 993371 y7bt:-Dradrcced Pteudouridine sYnituse Yf;IIIJVt'KVRIIIKFLA.irYllAiPRKGDEILFSGSIrtVNGRVAECPFVL'IDPEDKVOVt~1'SCPn 11875 ~ ~77h7 't't44I2 -~..
.;l:u.m..,.. y,y.;YYf~l~:l'F:: -'vHLiYF":~':6L.'.i....
..,.,,....w ' -v....
_ _ ...w=..-rr , vc r--.,nr, .. , .,.y.... ...,f.:_:A fi::r.-. . .f'~:F'IP:F:~!.i':!~th'!~'::
!:FVIY ? III:-.:.' :I . .f....F":.
SRRLFAP.KW':.iKL'fL.:VyANh'h:iAEK~:~t'.;,LEdCL.aYIaSAArw~.i::i4AL.i K RLsRR
SD
tWSEGKKNEIRLFADAAf:FPLLELNAIRIGSLVL.GGLRYCEYRELTWELGTYN%L_ ~ITOLSkI'FSOAit80 M
~
Y
,. ER11PELI
Ptt_0865 9811 II 987942 VOG
AAFASCL;.LDSCIY
CVIS6ffDf ~1'865 hypothetical protein SPNGYVIYVIJIGSIFIGISLGJIIfCOLYYSVKSVLfS'hIYLL.'iYYIILEKRNALU1LSOLVGECPn"-0876 EDApSpKEIDFLSOCDKtSWMFLIG~tSYEIIPTFK~LLSFAVOCFLESIETI1~RdaOA-D-Alanine/Glycine Pensease ' AILCIEtifyiASIOJGFDFEI11AYEFJIVFJfYLKLRQMPi~ti~ISKLFRFLOVPSIRFSSSIR'fCLITG.Y1IE
0I1~IKLSTSFCVFPNILLIGCFLlfIKLRGLOf IIOLIO.CFNiJ4.CbLLD
DSSSKANEVSSYGVAGILAf~7JlGIIGNIAGNAVAI~1C~PGALVWVWIaALi.CAIVpYJYG
SYLGSKYRKP~'CEf'IOGPIJUCLAt~RItKIIJYGFF11LPTIhtLAFCACNCVptISCIVP
CPn _ LCAGClPGKLLVGILLALWIPVI'7lGGt~BIRILRFSARVIPFIAGFYCI'SC~IILfONABA
birA-Biotin Synehatase "
hs~fKVIYYEIEEIPSTIILZtAKSYNIIIIrIDPYALTVIS1'KCOTAG't'GKFGKS~rIKSSKGDLLN1'ILPAIK
..SCIIVSILOANlKSI01 FCFFITDLfIIDVSRLFRLGTEJIWALCKDLGITEAICIIfWPNDVLVHGEKt.CGVLPC1'LPVPVVOCLVTLVPpVI
tMVIICSTT14.VLIVSGAY8SCA0GlLNVNSAFIO~tSLGSLGSVIVIL
ECLIGWtGIGLNI~I'1'KOALKDVCOPATSLOEILCNPIDLETtRELLIRNLLGVL4~ILANALPGY1'1'IL'itiF
ACAEKSIpYMIPGRRAM.WI.IfALYVLIIPLGCVIOIOtNIWILSD1'0 PDSLATKSNRGNL .
fS(RIVIL1JCI11LIALLKDVLSINRWALLttRECSVADPVRNLD1 0867 983105 981667 CPeL0877 995521 995982 CPn _ ybeL family roM-Rod Shape Protein RRRDIOLLSPAF11YGAPIPRfYTCOCJ4GISPPLTFVDVPC11AQSL1LIVEDPDVPKEIRS
KYFRYVNSWVFLW
CIRIPa9liICFCHL11A0fiNFFYNINNFNILEIYSLINSNIIMIYHOCLWINNIVYNLSTLITNLAEGAEIFAVOGI
JIT~IKPVYDDPCpPWCQtIRYFFTLFALDV
LTI14.LSWVISShmPTANLIrI'SSKGLL11JKSINOLRIIFAt~IiVVFFIGYFDYNLfIOtW
AWVLYPtTlIGILVGLFFYpSVpNVNRNYRIPFItOISVOPSE~IGI~VIVIlQ.67fIL~RKAVLPEC~1'RDQLYF~
EFNIIEpAEIJ~1GTYEKS
DITSK'iTAFLiICLWALPPFLILKEPDtIJTALVLCPVTLTIFYLSNVNSLLVIfFrTWAT
IGIICSLLIFSCIVSNOKVKPYALKVIKEYQYERLSPSNIOtQRASLISIGIGGIRGPGiICCPtL0878 996660 TGEFAGRGWLPYGYTDSVFSAhGttPCLt.GLLF1'LOLFYCLICIOCItTVAVATD~GIC.LSET Doolain P~isin AAGITVYLAIDM.INIS~'IOGLLpIl'LIfPLILISYOCSBVISTNASLLIILOSIYSHRFAKGCNStVS'fEPCSSI
NISL~D10MIDSOPYSLDR~BELLiIFRFLPSLV1'81WK11COOIClLC
Y
NItSnIRRLISPL7110iLGKLNKODLLCPPAPPVSVCWINANMGYGVFARDtGPYII'YIGEY
TGILPlIROAIf~fDCmIC!'RYPNPLF1'I.RYFI'IDSDKOta:M'RFINNSLDRiAIJIIGVFS
CPn_OB68 986733 981670 EGLFNVIIR'IYJIPIY11G0EICYHYOPLriRDIRKKREEFIPEF~
CPfL0879 997163 996615 yyaT-metal depefdenc hydsolase YRIIfiKVBNOGFFpW&GSKCaNSAYIG1'DSCKILIDLGVSKOWI'RELi.SINIDptDIOA
IPIrllItHSONISGIKSFVKAYNTPIYCHLCI'AMt.CHLLDSfIPEFKItBZOSS!'aODLE
VOTFNVPttDAVOPVAFIPHYRE~CFC1'Dt~4~'SWITRELYDCDYLLII~PB.VR
OSORPDVYKIDtVLSRGHISNOCCGQLLOKIITPIa.KIC.YWtL8T0d0'171=LAISIYSE
SI115ITSIAPEGWIOGITSPIYFSRLCVJlCIII
CPtL0B80 999861 997111 . ttsK-Cell Division Protein FtsK
PtfIR~O(SRRPRLYfLPtJUIItASLYLFFIVCFSCLSLWSFNRDOPC1'Q~RIIGIi~QiifBS
IIPWLAVILHDGS'MNGIIrALRLL1CS
FLLYlFCiIAAFFIpLYFWLBFLYFRRTPRPLFtYKJWIFISLPECSAILLSIR.iPll~l't.
PALi.D1'JtLpKFIL~1PPVSYVGGIPFYLFYCCpSFCLKiiLIGSVG1'ALIIOfVM.IiVL
CPt~0869 987179 986658 YLODGI11LLIOOtTFODOHIKAFCSFFpI'CFIOfGKKLINRANYLPKPSVPFV8101P11Ca'K
CT778 hypothstieal protein SOpSPRRVSETIILDCSISPLPOEEIPCSKKESTFLTPNpCKRFLTKIVtIpaUlx~lt OfRiRFFFPICtS~'11'SDCPQt~ILAKIKtQDPNOHFICSRTPEDHIIOfVRDfDtRVCKCEPNTTIALS$1'PlyV
R6S1a3KSRAALPIC.KSLJ1VPCIDLPQYHLLBIDiRtJlRpfiLOAtLtAIUI
T110CPF7fNW~NALS1G1IFIFFIATLFFLIpi'NRALQVKSLISLCVGWI'FYftGCLKARKALIL1~'FLTSIOID
ADL.Q1IC9CPTLAAPatLPNSCVKVpKIKSLn~IDIAIitL0A8iIRII
w7lYl~.SHRSM.EGD'IEIElNt'D00CIlLRILFlI'AGFfmPLLOnIVEYVCSDbTLLLDTAPIPGKAA1K1IEIP
fPFPOAVNFRDLLEDYQKTNRKI4IPLLILIKKANDpiGIAOW'lflP
MIRES.YIRKmLPFIPLI00CSRIL~LCGWIIFLpLVLCISYTLALViSALNVLVL.SFLHLIIJ(DT1CSGKSVCINf IVNSNINTTLPSEIKLVIIDp%KVELl'CYEOLPIIa.BPVITI
NAKIL70JDKISFJtVWVLCIFITSASI
ISBiaIKLLSREVYNALVWLVK~IBSRYEILRYLOLRNIOAFNSRTRNKTIFASYDItCIRt7?8/MIGI
IOEtSDLLLSSSODIETPIIRLAONAMVGIHLIIJITORpSREVITQLIKANPPlIII=FK
CPf>_0979 988A81 987118 VSNKVhISOIIIDEPG11H~1LIGN00lQ.VLLPSVPG1'IRAODAYICDEDINKVIOdCSIIPR
serS-Seryl tRNA Synihetase-2 't'OYVIpSFNAFDDSDSDNSGEKDPLPAQJ1KTLILQ1'GNAS1'1'FLARKLKICYARAASLID
TI'fNPI'QGFGGAVILPFSPISIJIRItIRKSCCSEKSSIYSIiFCTLLLtC~1E1'SlR.DIKIIRKOLtEARIIGP
S~CAKPAQILIONPLEG
TPEDCEIRLRIDCDPKISLEPVLSLDKEVROLKTDSt1T.0110RRLLSODIRKAK1'pCVDAT
NLIpEIII'LAADLEKIEOHLD010~IAQLNELLSNLPNYpJI~IPVBEDKAGtfOVIKSVDDLCPeL0881 PIFSFPPKHHLELNQELDILDfOAAAItTrC9L~rIPAYIOJRCVL.L6WALLTYNLQKpAANGFNo robust homolog present in Genebank/l3lBL
as of 11/7/98 OLiJI.PPLLVIOfEILPGSGpIPKFOGpYYRVEDGppIfLYLIPTAEIIVIatGFRSODILTEKENKKFAVIMPVPID
NSSRNLQhIfPFSLEDLEQNAEFSP1'HOSAESSSLOLSIrISSAISSIiV
LPLYYAAGTPCFRAfaCrWGAOEItGLYRVNOFNKVH9FAFITPNODDIAYEIDILSIVCE7iEQLSSLVL~ISDPSSL
RDVPIFSAIYESSTFrI'PVPTPLVGVGYINDSOSOYYCtORES
LTLIU.PYRLSLLSTGDNSFTASKTIDAE1MLPG0KAlYbI7SSISOCTDFOSAASCTRYKWiLSOLLGSRRVEWYNOG
NFfIFASLIiJLCPRRPRRDPSPISLALLEtyIFIVFFLBNPPGS
DSQCKLQF171fCLNDSGIJ1TPRLLVAILF~BLpQADCSWIPEVLRPYhCCLEILi.PKDOTP7JPIFFW
CPn 0871 988766 989899 CPn_0882 1006169 1007101 ribD-Riboflavin Deaminase No robust homoloq present in Genebank/t't~I.
as of 11/7/98 EYNE:DFSEQOLFtTIRRAIEIGEKGRITAPPNPiVVCCW1IQFNRIICDGFNAYJVGGPNAEEM'POVALLIOYFFCN
GAPYVREALRLTPHA~IIVWGICPSLYPENPRSLYYRVSr,DIGS
LAL~JASNPISGSDVYVSLEPCSHFCSCPPCANLLIaiKVSRVFVALVDPDPKVIIGpCIaRFDORGFVNSL1IETLPY
SSGSFGIEWISII'DPTPNFAIVNIFNRTAGINEVSRPNl0~1'E
Nt.ROAGIQVYVCiCESEAOASLOPYLYORTIWfPWI'ILKSAAS~FDGOVaDSpGK90NITCTSLIDIRDL3F~C6V~
irtDSLEOEFSLJiGIVCH71JCCVSIftVTSSPNIPYIIIpTLi.GGPE
PE~WIDteGKLRAESOAILVGSR71250DPNLTAROPO(stLYPKOPLRtJVLOSRGSVPPTST4AEAEFi~IPtFPNS
I'IDSLAEIlQBIWRISDAVSIIWIFPIVDTTYNGVWJNCIGPICI
KVFDK'tSPTLYVTTERCPFNYIKVLDSLDVPVLLTES1'PSGVDLHKV1'EYIJIpKKILpVLNCICSTFLtLTNPRS
RRORWRNLRIMVLCYRSLGSGIO~tLFDL51~NRNAMRiM'SCIYA
VEO~'fLHTSLLKERf'VNSLVLYSGPNILCDOKRPLVI'VIGtILLESAaPLTLKSSOILGNLYANV1'LttCWtVaI
00J1tt0YCFPSVRDAPYRYCLRNRYCLTpRNEDSi.Q1'IIDTR/pY!'lt1' 3LKW WELSPpVPEPIRN HLf00pNVAS I tldl.~'VFGLFFGF1IGLMlI'PCGLEIS
CPn_0872~ 989903 991216 CPt~0983 . 1008901 1007577 ribAiribB-GCP Cyclohydratase i DH8Pdntpp/ogre-Phenolhydrolase/NADH
Synchase ubiquinone oxidoreliuctase KERIFRVACLASESVNARESNIETREEVCSAHFVSLEMIEDLRAGKFVIWDEASREDELYELFIKSCIFII?IML:C:L
YFLCIASLLFCAIf3VIL.ACVILt~:RKLFIKVNPCKLKIND
CDLIIJ1GEKI'IYEIDfI'FLLOHTIGWCAALSQERLLSLDLPPNVKDNRCRFKTPtTVSIIDNEELTKTVSGy?TLL
Vf.LLziSCIPIPSPCCCKATCKOCKVRVVKNiIDBPLfTDR9TFSKR
MFYISYt'IGVSAADRTKWOLL10PKSKPEDFISPGNFFPLASSPCGVL1(RAGNTESTVDLOLttGWRt.sL'CCKVQ
tIDNSLEIEERYi11A3S1IlLTVLSNONVATPIKELWAVD~KPIP
Mff.ACLpPCITILAELVNEDYS!!'8tLP0ILEFARKNNIAVIPVTSIIANRNLSDRLVSKISFKPODYLOL'fVPa' YKTNS3DWKp7IMPE'fYSDWEHFHLFOpVtDNSbLPADSANKAYSLA
3APLPTIYGDPfLHVYESLL.E~IQHLALVKCNVADK.3N'JLVRVH.iEC~'fGDLLCSKACDC~YPAELPTtKFNIR
L1TPPFtNCICPN.~.EIPWf:IK3SWF.~.LKPGDKITVSGPY~INKD
.EOL,i3AMSYtAF.KCTCVLVYLACOGGRGIGt.GHKVPJ1'1AL00NGYLTUDIINlJINCFPVDDDRPLiFLIi.w k'r.::Ft,R:;HILDLLUIYHSKREIDLWYr:ARSLKtNIYOEYEfI4EAQPP
SREIGICAOILVDLKLTTLKLITHNPOKYFr:LOGF,LITERVPLPVRISF~NEpYLRTKNFFIYNL'JL::EFLCEOL
W~JpItODM'YTNFLh'MFNLC~I-:RLONPEDYLYY'/r~'PPWN
4fFN,HWLDLF't:CNNRVO .'.::ILKLLCD\'.;l:Ff ::a ILI?DFt;,:
:F'n ~)a7S n!IIINN v'11511 ~'.t91 pNN.I lit.1rsr.N Irlll'Nlrl') ra.ERibityllum.Wine ::ynth.\n.f 1'.T7ll hy(tttth.tit:.ll ftfl.tain f:aIhJTtr:ltB'IFFEYMARI.Kt:ItL:IAKNLpfAIlflC~ftFyAtIADALV::aQETFLKFl7G:iE~:fif:
ML::RIV9l:Fl.h'LL:::a.CLPAEEEALr~::Ktffi-/r)I'AVMIr\IAfI.I~YYFtI~WNtE~NRR
IJaXCIR'Jh:AFEftt'.TIKKW.:::a9IKFDAtVAf'.(T/LIrJCET(kIYW~tIVNtJVn1117It'..\L::K
AMKKKKNDtr\Kua?KV'1'AtA:fItTfVi7f.IPFJft'VtIlltA::r:Y.'/IVIJCIaII::EIt.Y('fI~NK
::
LEt'r:L1ITL::IV V1f':'.e\EtAWt,)R:xaK(:RHV:VS~fAIEMA'fLITy r'.fn OHNS InIOn~~.U I~NI'r~ 1:
~:Ir._tIN%A '11116A n91'74n yycArHNA M.tlrVlrl.lrca..r::..
'1"7':': hypntlN,r i.:.tl Prnr.irt A:al:l'M:7IT~Wt.'1'llht:Vma:::l'1J:al'I::Ix:LYYYE1:L1.INJl.h'AILVI~.:hhllAlLltt .'::I
f.f::WJf.KILTKpRINtEF\::MWIlLKIKVLVFPLALW~1'Y_tl::Ir:yAC:I~\:::WTN:b'fKVK::LRr:
RtIKMEF'::FFVfYFIa:K::lt:1't::::l'YIKK)afVTh'LLIIIIJ~rfM)tLKI:fNhiIMnfll fr:::F.'/W(HpKLRUYPh:I.L.W1.T&~O1:A1'LL'f.~.TDIOItIIY:EKLFNKKVPALDIAtfc:MIHLFEI
IIA'fFPI'KNK.:::1:.'f1:1'L'It'Iv::a ,r.llYM/It.T!':f.Tl'f:%hVNI'~1f'ltltS.Iltt:IlJ.~.::::I.
a~
NtA:aYWlEKVAARCLiCYYETKLLYI-dPaWY EFLAFCU..L:!5C~FYiEGL
:!ItRIF:LL::::..:K:AGA:K~::iE.'atQKA.iSCfIPGL
~50CN.iAaF"LRPR~FFOPOLTQ
AAKItETAVEFINPEC~:.:.:LY~-CACt':GINL:PYVKNVIGVEIIPOAVASApENI%APLCCAEW1YLYC'IVLRONPRDt'HWINRCRF'Jt.;:A;.fI
C;.1'.:.Y~::.NLICfVVrLE'JL.CE
'VEYILCOAKAFCKRHENCKAPDVILtGFPRCCIWSK'Jt.KIILRtG.iPKIVYISCFROLNaiR'l'PrINPIYEt' t'1/0'~lTiQt'0:1~."!~tJ:NANAIIfWGNItFR''~'S.
NNKEDI F~GK'.
~
~~
. Pf7lYG
NPKTOFQECADLta~OCYRIKtO~IDPIDQFPYSTHLENICLLfREIOPYCt.~00CPNh<xVVSHEV~GSFWDaIJ4I
J'etiANi'ftRINI~i.UC' Y6~lRTi ' ' ' LViAHTIICHGSPK .~~.NKAtIQSPLCV~I
N
WDVYEIDCYOFTNINfTFSSIKRGOERf'( 0996 10:1299 l01 X709 ETKQII~MLPEEItFfVPPAVIOJFFAHK:vEDRK110EQWLDEVRVWSK.7FPEIJtEEEYAL:S
: Pn _ HKLPIQ4LE3t::CSVE."tPDSIAGRAA;iNKLIOVLSIOHIPYLI~L,iSSDt~.WIANt~' nccA-Htscone-Like Developmental ' F:~tean ItTLFWILKDTAKNMKDi.Iw.StiHDLIKAEKtTIItAAAORVR1'DSIKLdtVAKLYRKESiKAililYDFSCRNTK
YUVPEfCtMTtNNf:LAYS
OVFRPFGLTFLVFSDYNRNAIRLAALiKLP
... . ...,.,~ ... r.!:y_ ,;... . w_F~'!C..._. . . -:'Z~lfaA.':'-;
. ~.,...... ":.~F'.''T:4;s\':F'Y~:'<'t :.t?f'-~.
.. . :..:R.~ALw~. . . . . ~ , .:i'.F':
!"!'": ' "\:. ":.1K:.Ef :,ppWRVS':;FH.iIELFtiIuL':'ruXv.::%.:.:::L.:1RV.;:'cr4ianliwwiXY
iv:.iEtitrlitlMD
:Pn_0897 1011692 1014157 RFOYSCA:DOVSfCCFTi .1~a,l:.~R::.oO
CNLTR possible phosphoprotetn NKKLYHPTLFLRPLIRLSLIFALSLTLISQ4FPQQKSFGHCCAONNSALISCKNCCCtACPn_0191 10268:3 10.5988 DPIERVLADRtTLTJINOwG'ty'JVLVREYLLKCIRKGDCDYCVItILOKIS.ALRLPKIIiIRItDemn-ANP
Nuclwsldase LpILwIIRtNPfpAPLRDWOQLFTIGGNLSI4DHLLFCLYIIrTt~ISCYENRKOtxIQiJIKtPRI4D10tA104LRR
KHYKCERVSKHTSESRIt10tk4.1:RY~SSVICOFCPYLLLTNF8YYI01' OGDYKKAIEWCF1.VMLiKOSCSPHPEIVOIEKTFt.OKTI3.At4IKYAQtAOESCD11LI.FAKtiICVPVFf7CSN
TPYCLSEiAYTEANDAWLRIARGIVSRTNEVDSVLLSNAI4NLPFAREXJ1IPELEVLIDCLRSFfYOVCDY~~IRCOG
TSDAYFPPEVPAt.7INFVVpKATTCVLCDItKAFiYHIGIT
NCfIYLESTLLYYAYFSLLBLYtIpNImFIISLERLLEKGOiIVLrVPENPYIPEYOFFLOAYFYtlrlFtIRIWEPNK
1CP1UIKLYlIKAOSA~IEGATLFAAGYRIINLPIrALti,ISDLPLIfI~Z
AKGKYLSJ1GIMLOIIDPAVKLCATFARAYLYIGCZAYVGNNY~fAEEYFLMY%SNCREKTKSfONP'IfN~Y'fF9KI
LTGOCVIEFG.EKVFtLKMJvSDNIOIDppIfRGLPNNEVCt71IX1t ESGIGLF'IJvYAVQKKXTACEDI4LYNPKFSIIYRHLLDSi.CSLSYPt~SNIOGSSJ1I0RVNRNASGSE?SDSDY
AVPEtSEIYSRCIYMIKYRNVTYTtIPIIEt.AYNOV11NLEKRNLLEICRD11QDPCYDKAL
AFHGALQSCASVPRSLIESStNDE7UtiTIRCYEALYFlrI4PDAIJIFIZ.POAFSEEQ4SWOTACPeL0995 LRLViPI'LVRPKGAPNNAKYWDHLVLRPHGDSLYFFCYDLOEYLIGKEDIILKNLSVFAELFefp-Elongation Factor P
PKSSLLSLVYYLOGYSESSAiJttIVCiJFVKALEEF'.EISNSGENNK1WAYIYYNVItLDt.iIDEIDCFNVRVSTS
EFRVGLRIEiDGOPY'..IIIJDPVKPGKv~7A!'NRIKVID4t'LIGRVfERT
TYISGCNFSOAVNILEEVK~WpIIASNPKLIIFLI(CEDLYLWELRWVECLi(YAYFOLHETYKSGESVFL'ADIVERS
IIRLLY'."DOEGATFft~tIFE01;11VA4EKLFNIRONLLEDTIYTL
' AHLSt0ILLENVEKNLISPRSYRDYYCESLQRTLGLCpRFLCVVLYNfr~VItAVEPPIPFiELSIAETAPCVRCDfAS
GRVLKPAVZ'N:CAKIIIVPIFIDEGELV
KVD'1'R1GSYESIIVSK
~PILOCAB 1015141 1014119 henG-procoporphyrlnogen OFCidase CPn_0A96 1027574 1027822 AERRFCVKRAIIIGJIGISC:.SX~IwLNKKFPOAEILVLDKFJ1YA0GFVltTESP~OGiSIDLCT753 hypochet:iul Drocein GPKGFLTRGDGEYTLKLIHELGLOt4SLIFSDRAAR~1RF11YYROKANKIST1'11'LtJIKCti.PBKYFI'FlVIU
d'DJ' lltItIKELSKGOLLIQfLREKSRVLDEIOJKRItANVAIq,VAIIPESIREIE
SLIKDFRAPCYTpDSSVODPLKANBSONITSYZLDPLIT11IRJ~ON&SILSTIDIfIFPS.iIKKCEKVLTPQLFQAI
AEKILE~V
REASSCSLLRSYLIQJRSPKKSKTDRYL71SLSPSIGI'LITTIOEK<.PATWKFSTSVTNIDC
SPKFaCVL'1'PSETPFADNVIYTCPLQpLPVLi.PHIICIENLSKAVLPWfG.SSISILrIRltAF1CPet_0A97 FSLPIOGYfiG.PADELPLLGIVwNS0IFP0ATPCItTVLSLLIEGKI4RESEfvNAPAIAAISEIphospholtydro lasel YLNINOICPDAP'AIFSSQDfI'IPOHAVCFLERKERILPNLPGtd.KIVCpNIACPCLiatCIASNPSLDSttI'VDO
ICdfSt4PRPM0EKPRlMIRIINISDVNFNVLPVNPVNCFNKRLI~LLKKV
FCLVNFQJITTICORFPKtIVR51a71DSVCITGDFSLTANDCEPLWtNIYLTLJ4KNSSVYL
LPOMIDVYTIUSLiIpQ1'FYTIIlpNDQLOpNKVSPNKI?DFMMLILLt7CSQJia41S11lLY
CPn_0899 1016941 1015462 VNLtIOISAIFfFLLBLSPEEN11IIANF(YPLLSSONPSNDLINNfNiptiVLXKrPKVRi.YL
hemN-Coproporphyritwpen III
OxidaseAAVYNCAD1'SPSYIUi9GSISLPTNSRPNVI~.YPEKYQVIrMILOa.LDIDAP
FIJtFNVNFNfLECii(pPAPRY':SYPTALaWEPSDAAPALt.At'ORIRFNPOPLSGY!'IfIPFLEIANEATNOCp KL
C0574CLYCGCSWLNRREDIYEaYINTLI0Et0a.Wt:TIGFRPOVSRINIOGGTPSRLSR
ELFTLLFDtiINKLFDLSHAEEIJIIbIIDPRSLRt~lIEKAD!'P~7VCFNRVSIGYOD'IOADVCPt1~0898 OEAVRARpSNEESLKAYEKFKELAFOSINIDLiYGLPKOTKLSPSItTI0DIL71N1fPORLANitochondrial NSP60 Uaperonrn Nomolog L!'SFASVPNIKPFpKAMtAStINPSIIEOtFAIYSOSRtG.L'!'KA4~0AIQ~IIFSLPNDPGTTKKRi.OSVKIIi ttiGVC~ISEOCKLStMiADKKLFSGIDXt.FOIVfOCSYOPKQiLSPTiFF
IJtFIORITLIRNfOCYSLPPEEDLIGIrCaTtI'S1'SFIRCIYt.OI~NtKTLEEYI0T1'VLRGTfATVK~' 'YAISOTELSFtSIfC4i.CVDFNU11NNKINKENBDCATTGLILtiDIILOiSIfAIILEK
KSKILTE<~ItIRtIWIIINKLIC'ZF'"It4l~EFFNLFL'IfEFD2'YFIFSRDRLI~E'IlGLIt04CISTHKI.I
ASLKLOGEKI-0SALpppSSiPIKDAi.KVRNIIFSSIJ~P'1'I11di1YWIfSWC
SPCSLKYtPiGFS.FVRVIATAFDHYFLNKVSK!(tCFSASIPEGLISITKERG~IL1'SI~VFOCFKIPJIGYASTYF
VSDTASRLTRIANPLILITDRKI~E
INSLLPti.OEIS~NDNLIIFCtDIDPtfVLJITLWNKIQGLLpV'IWl'IPpGiITNOt<.i1 CPtIr0890 10I7A29 1016519 EDIJ1LF'LO'1'ILICPCpI7ISItVt.iIPENV1't-0SCLSIEISESQ1'Ti.IOCLJtiILYLTLILTWf.
hsmE-uroposphyrinogen DecarboxylaseABEIRTCSCLC1'RIIIILIKSTNRt.QSSVAILPTDEDtVEPLYTLiII~tINf~ALI:RCfIVP
STIJA4WDSFtSJIFFDLLKSOTASHPPIWLLRQVGRYFtPPYOES.IOCSQSLKTFFNtifCAIVCVAL/YASLTIGT
PKDDADENSIAISLLOKACCAPLKLi~1TH11DL.DCD11VIAKLSSLC1T8 ATGt.CPSii.NVDA71ILF71DILSILDCFAV1'1f~'APGPRIOPSPEQPFTFTSDPOTIFSYLIlSISVFSREIED
LIAGOILDS<J1TTSTIIJIQALD1'AILVLSSKILIl.DpYCI!!L
LD11IRTLJt9Kt.PVPLIVFAASPPfi.ACIfLIDGG7tSIIDPSKTILSFLYVYPEKFDQLISTI
I
EGTAIYLK1'pFmJIOAAAVOLFESSSLALPSALFTRYV't'EPHRRLIA%tJtLOi(IPVSLtCRCPt1~0A99 CFELt4FYTL0AT0A0n'LNPDYNVDLttAIOKNiiCSLpr.FfLDPAIFLLPOEKLLNYVEAFLntttF-Nutasoyl-DAP Lipae VPLRTYPNPIFNSIitGILPETPLFliVpLWSYVQROLNNACxAONYfQtAFIt.L'EDNVSI~i.SOVSCPKCDKtCI
TGFAIDS00V0P~LFFALPON7lTD
GNOFLKf4AATAG11VAAWSIIDYpf~SFCLELIRVDCIK5AL0EACSNOCNf.IpO'ILVCIT
CPn 0891 11121079 1017819 CSVOK1T17CEFSKTTLSSIYKTNASPKSYNSOLTVPISLtXAOCDEDtIMIL~GVfiP04 mEd-Transcription-Repair Coupling Hpt#TRIVOPEIAVITNINOQNAWiFPp0I0EILKEKSYILOKSKLOLLPKDSPYYLDf.R
NFNIItIDFNPVNLDFSISKEFKEhTLPLLLFIdIHPCATJ1!'L.71A104E'NDCItASVIFIITIPARSCSPTAEK
fSFSFNDPL7IDPnfKAISCOSWZOTPE~4YCt.PIAFSYIIpAYTtdi.IAWIL
LDDLFEId,ATFLt7p11PVEFPSSEIDLSPKLVNIDAt7GIIRDNLLYStiJpHRAPITCYtTLKSY~IILBVPEECV
IRSLPELKLPPNRFENSMRNGNpIIINDAYNACPF~WIAALOALPLP800 ALLEKTRSPOATSOOHLDLAVCDYLpPEATTEt.CICSLGY50VlQ.TSEKCEFSCAGCIVDIGKIILILCHNI~LCRY
SESGW1LYAEKAA$RCDIIIPPICEKWIPV05VLKSYSCEVSFFS
FPLSSPEPFRIEPS.IGEKIISIRSYNPSDOLSTGKVSKISISPAYTET~ISGGItYSNSLLDYSAODVKDILKOVARI
fCDIfILLIfGSRALU.t;.SLf.ACF
FSTPPLYt.FDNLEILCDDFADISCfLSSLPDRFFSIGTLYDRISTSNQVYFSETPIPNVK
NLKINRVIIE1F'NRFMEASROAiPILYPE0Ii0NDEHPLLAFLONLOEYNPPI~KPtJa.ACPn_0900 1032208 =YSTKTKSLKEAAAL11ETVARGDVEIYEKTGNLTSSFALVNEAFIN1ISLSEFASTKVLRRmraY-NUrawoyl-Pencapapttde TransEerase OKORTHFSV7TEEVFVPIPGCnMiINNGtGKFG;IEKRPMiWIETOYLVLEYADKARLLVFFfILaASNIPLIPNPL10 SLFPSIJ1LT1?fl'1'LVLTVAhCVWNM4LItplC4YRDrINKt YVPStpAYLISRYVGTSDKAADi3iNINSSKWKRSRDLTEKSLriYAEKLi.QLFJIpRSITPYCEKi.BMLNKDKAEY
P1COGVLLFISLIASLLVWLPWCKFSrWFFIILLTCYAGL41VYD0 AFVYPPNCESVIKFAETFPYE!'IPI70LKTIDOIYNDlILSPKtIIDRLICGDACFCKTEVINRIKIKRKQGNGLIWt NKPNVpIAIMFTLIALPYIYGSTEPWCLKIPFN~FIfiLPE'WL
RMVKIIVCOCHRpVIVMVPTf::.aTONYE'l'FKERM71GLPIEIaVLSRFSOJ1KVOKLICEOCKVFCLCL1LVAII
CrSNAVNLTDC:L0GLiIAGINSFAAIGPIFVALRSS1'IPIAQDVAYV
'JASGQIDIIICTNKLINKSLEFKNPGLLIIDEECRFGVKVKDNt.ICERYPMIDCLTVSATPLAALVCACIGFL3IYN
GFPAQLFtIGDICSLLIGGLLOSCAVNLAAECILWICf.IffVAGG
TPRTtJtNSLSGAADLSVIAMPPLDRLPVSTFVNEHN~EfLTAALRIiFI.LR0GOJ1YVTHNRSVItAVISCRWIKKR
LFLCSPLNHHYEYqCLPETKIVMRfWIFSFVCACLCIMWtR
tESIYTLAETIRNLIPEAftIGVAHGpNGAEDLSNIF'IKPKNQKTDILVATALICt4GIDIP
NANTILtOHADKF1~WDLYONKCRVGRWNKKAYCYFLVPHLDRLSCPAAKRLMIl4K0EYCPn_0901 1033279 :::(~ICIAt.HDLEiRGACNItw.DpSCNICTIGFNLYCKLtJUtAVSALNKHTSPLLTNDDVmurD-NUramoylalanine-Glutamau Liqase KfEFPYNSRIPDTYIETC.~.MRIEFYQKICNAESSEELTAIOEENRDRFGPLPOEICWLFAFCFIRRSRYSuCiJtEI
dICpRILILCTCTTCKSVARFLYQOCHYLICAONSLBSLISVDML
LAEIRLFALQHGISSIKCTANALY~/OKCLSKSEOTKKTLPYALSPTPELLVIIEYIESIERHDRLIJlGA3EFPO4ID
LVTR3F~CIKP'fNPWVEpAVSLKIPWTDIOVALKTP6lnAYpSF
~FLtNAS
CtT~JCKTlTILFLTHLLKILCIPAIJ1M(.TIICLPtLDHltt7pP.VRWEISSP~JITOEE
N t PAISCSVFtJ4FSRNHLDY11RNLDJ(YFDAIILItIOKCLROD1ITFWVWEECSL.CIISYOIYS
CPn_0992 1027673 1 (13101 d EEI EEI LDKCDAWtP IYLNINtONYCM
YAIJINEVIIJVSPECFLItJ(tATFEKPANALIYi.G
alas-Alsnyl (:RNA Synthetase KKOGVHYINDSKATIYTAVEKALJIAVPKDVTVtLCGKDKCCDFPAt.AS'lft.SOTIIttIVIAN
EFFFNLSNIIRSNFLKFYANRHNTILPSSPVFPHNDPSILFTNAGNNOFKOIFI.NKEIIVSCECI~TIADALSEKI~P
LTLSKDLOEAVSIAQTIAQECCCVLL3FY-,CA'iFOQ!'QSFK611GA
YSMTTSOKCIRACGKNNDLDMtCHTSRHLTFFE?tt.CNFSFGDYF%AFJ11AFAWEYSLSV'IFKLLIRF11C~AVR
FNfNPEGTYATVHEKDDFJIFJ1LNEAYLPTDRIFRLTDNONFWSNANTCPCCIf~SELLFDR
~PSFrJ4ASSPLODTDCERFLEYWNLVFNEFNRT3ECSLLALPNKHVDNDN3LERLVSLIACPn_tt'W _ lA3Sfl7 10)5311 r:'fIIT/FEADYLRELlAKTEOtSCICV'!NPODS(:adFRViAt%IVRSIsFAIADCLLPGNfERnlpD-Nur,lmm4te:r. linv.vsln rep'tat t.tmlly ':YVLRKII.ItR.',VHYCRRLCFRNPFLAEIVP;iIrICnVICEAYPELIWt'aL.~.OfOK'/LTLEEESAVpOkt LV.:::EVtaaIRRONVITA'J'/VNAILLYALFVT~Kkl!:VY.DYD.F~n,FPHFASSKVTOA
F'FKTLDRt%:NLWt)VLII:i':':~.
$Ct::jEDAFKLKIfI'YQ1PIDEI:iLLAKDYD'ISVDNDi'F'N:',EEKVIEKCWAEVP::RPtAKETL~FtE.:K
PViVTTPP'JP1/V'.'.ETDE1/iTIIAVPPp IIKLEDF.AKER:;1!KNW(R;Qf~I'.~.G:i'INELIILT:'EFL(:YDH4~.t'.DTFIFJ1IILY.DNIVSF.LF
VRE'IYKEFX~APYA'('VWKK~:OFLERIAPANIpITVAKLt4r)IHGUI'I'IVLKIfDpYIKVt'TS
:~EKUF7:AIYLKV::PFYAEKtI~VGG.~(;gtt'f:.~.F~.'fFIVTIITP::PKAc:LLV111N:RISOCSLI~D
V:aIF:K'Cf'>.1'(''t'.WIPlIY1'tW~Er:D.~.PN'1'~ALRNtIIRLDDLLKNNDLLFYKARRLKPfID
' IYFM'/TAtWNRYRRKRU1NMITII:HLtJtAALEITLf:DIIIR(L4:;:W0tITKIRLDFT11P0VLRtP
~03'i~'r AI::PELL4: tLTLVNR::IItENEf'VfiIREILYwL'ifW:..~.EIKQFFY:DII7.~.DV'/P'~.Y;H~S
III:II!Y?I'IL\F_1'IYA)O:FF'RITKFJIi:%AFI:IkRIErWTr:EKAIGV'fVH(,~J.~.EVLEEtA'rLL
.Ot.'Fvsnwm IW '.::1'. Inlv..l1'/
Vfklli(V::Rt:hA'PI.DERKQt)UKRI~fELFSt::LyfKLDKLIIINt'llhl!y:IT!:L'/NHIrIEHEtt::
W n..l l I'mt::m.n Inrn..ln t'n.;W
NIIIIIIUV'IA~x'IJIIJI!11't:KLI::LSPI'TEKtt:KYII'L:-.RV!:f1111.19'(r:VIIAODLLYAVLTPm.':Kr~:Il:NKNFVI:a'Lfl:ll'::Ia:I.INVFVY:::AIVI~
P.~.LFT:TIIKALIINJVrYLIIl~:I/
:kYI7r:KtS':
~l~x:;:ADAL1'A'ftVIHETLbi(tYltA::I.LYNNF1VRUFLKI::INI.La:iJI.ALIr:'/F'ff~:IL:Ll :fttJ:AkkYII~:F7Yif:PIVI".IKINK
:TyLi '(LV I' I VALY 1'I: rI'::.~.L'lJlit LKMYLY.I:1'A I l.l' I I' I LL IA
L EP Gtr::'.,yl'/ l::A::l. t INF
IN
"1'n_'IH'm In.:IN-2 tn~'iNRr;
't'::VkI.ItYWl.l.l'LLr:VLINi:AIh'/fMlY'/nYIILNVYL11PELDIKr:IU9Y~f'ICIAKfAH::X:
I Y.t n 'fLmr:knfl nrl.t::. KL1I:I!r:11N1::LUKLTYLW:1~411.YfAA
i'fAt:F:F'r:Ff!?ILVLLLLYM.'h'PNY:YAtAIKA::
::LECM4AMttTLII'K011FKlltIW:~LLF.~.Y':. .FF:OfX:3CLIANFICCIffLLLKV
VbIFB~ful.lp autr~tt.lmv. c~. nolaa 'f DESK::.°~L X7tRFRRPNCP".niUSKGiFFS FHLKKfaTL:ld I I f: f PKIVGFf40:iF"-F.~.(.LKVAAKA I OCKK:.?JNL::~w"1JR'.":3EP'.":YFY
YV~~VNVHVKAl9ltlfiø9E1'.1<AIKOKV,'~~'I,I~NEi. L'I~Y~t/:DY~I aV.~l ':Pn_0904 1(175720 177396 RLEEtJ4ItOCPTYPDKLIa:: ~ . ....
muc.-PeotiJtwlYCan Transterase RYINICKIRKVALAVOtiu~CGHIVPALSVKEAFSRECIDVLLiGK.f'aL100lPSLOOGISYREI vPn 091e I046R1) :049094 P!xLPNWPIKIF1.'.RTL;LCCCYLKARKELKIFDPCLVICF~~..SYIISLPVLwIGL'aNKIP fabF-ACV:
Carrvsr Pr~cefn ~Yntnase :.FIJIEONL'JtY:(h?ra.F.~RYARf:L~F.":PVTYI!FRCPAEEVFf.PKR.iF.iIA,SPMIKRCT
LLHrIfRV'Ma'KKRV'l.'~:P'':1':SC:uNl.1'CTFYDNLLACVSGVRPTTSFPCEDYATRTJIw ...,.:... . .."... Il.;.:.lt~'.~ . ,~..;~La,.~... ...,.... ..,"..i.h..:;- .
".,'Tl.:
..t..,..,1.: ~nY:.~.':= . .Ji::::I'K..L...F.: . ... .-..y-v ~w::: .
" . ,Al.. .. ... . ~.~ u:;~;
. . ,... ....1. ,. :.,,~lYr': :~ . . ~Hr. . .'~ 'A."'..'C
../~~,~~CCw~i:'1:.:.nvlLLi:.:i ~::.'f.':?':: . nt't:.~.~, . ...
DVLEGG'MILEKELTEKLLVEICV1'FALDSHNREKORNSLAAYSQQRSTK':FHAFICECL
0~\AYGHLVS4Rn\OK i~w3Ti:AAVrNA:.:Leu:FtlFilidi.9ERNGAFuWIiRWIDPDROGfV
4;1~AGILVLEi'LE.iA_IRRDAPIFAF14LCSYVTCDAFNITAPRDOCF~ITACVU'ailllISA
CPn_0905 1037400 1079875 ..IPKERVNYVliAHCTo.
Pt,CrLSEIrt.AVKKAF'GSHVRNIJIITISrKSLIGtICLCAAOGYEA
murCiddlA-KUraauce-Ala Lipase i G-Ala-0-Alam Lipase WAIpAILTCKLHPTINLDNPIAEIEDFOWANKAQDWDIDVAKSNSPGF~O~~STTLFS
.____.»._.~-~-~~-~..........-nawvw~a.aMVIeVTTPCf.Ylllf'JRC/~.IiD RYVP
cPl~o917 loleosa lo1as39 hydsolase/phosphacase namolog lNDI I4EVCTLVFll0C1'ItYEYSFGVTPIKFFGTPDIO~tfUUCFICFiTRCKMiCFPIGRtaEa KEDPOEAACRELVEETCLSWNFFPKVLTEpYSFNNEEOVlVRKEVTYFLAS1IRGDIW1D
pKETt:pSOWLSLOECLRLLSFPELIIDLTV~ADKFINNYLESS
YLRNYIRIHDVCVSI~GAC~fix'cf:GiJlLlcfarnrfusa.saw.nvw.ww.ai.w~...~..,r...-., CVOCLFPVLtICP 0918 1019272 1019579 A CPn 7 _ YISPtFYDVSYFIINRpGLWRTGKDFPHLTEETOCDSPLSSEiASALDW-Inotpanlc PYroDt>osphatase FCEDGTICCFFEILCIIPYAGPSLSt.IATAt'mttLLTKRiASAVGVPVVPYQPLNLCFr(K~'t' PELCIOI1LI>:I'FSFPKIVKTAHIGSSIGIFLVRDKFEG'7EKISEAFLYDTOVFVEESRLG~tIIESLCCYIEITP
YDSVKFEGDNATCLLKV<EtPQItFS
ELLNSKKPLYYAHPWHSPTLT
RP
SREIEVSCICNSSSWYCMAGPNERCC11SCFIDYOEKYGFDGIDCAKT5FOI4LSQESLDCNfCPCLYCLLPQTY~AS~
YS~'tfICGDKDPLDVCVLTEIDiINHDNTLLOA
' VRELAEAVYPA>'K7G)cGSARIOFFLDE~IYWt.SEIMPIPCFfI'AASP!'i.QAfYHAL~'1'QEQVLDKIOHYFL
TYIUITPIWt.IKC
IGGLRTIDSCEADDKIIAVLEDDLVFAETEDISDCPCI
' IVWIFITDALHKFDICQQTIEQAFTKECDLVKR tALVN
SPAKIEIVGIYCKKEAOKVIOLANCDYLSYICD
0906 1010514 1079915 CPef.-0919 1019375 1050170 C?n _ ltlh-Ltuca.ne Dehydroganase CT767 hypothetical protein FKRYSIlIFICEIKIDOYERVILVfCSIfVRLfIAIIAIHOTAVCPALGGYRASLYSSICQACT
' NE
DAGRL71P011'ItKAIISN'fC~'1'ODCtISVITLP~APBLTED10;.RAFu~AVNAi.DGTYICAD
KWGSEVLELV1~SQLSREASAFRLDIDFFIINIYPFFRNF104IELCFFLSISOFNLDf EFVAYIVIQJLVTNPFJIVEIRSIEtBSIKLEIRVAAEDTGKIIGRRGNI'IHAIRTILRfGYaINDISIVAEE1'PYV
CGIADVSCDPSIYTANDCFLCIKTAKYIiICSSSL1~IAI
RVCSRLIQDfVpIDLVQPFixTWIADQOYICDND55NSTFqIifGESZITCCSCHCH1IDEDLCCICSVCRALLQSLFF
EGAEZ.YVADVLERIIVpDAARLYGATIVPTEETNALECDTfSPCA
NQEEpERWNSCffCSNHH
RCNVIRKDIiLADtI4CKATVGVAN4pLEDS511GK12ItERCILYGPDYLVNAOGLIMtAAAI
tr:RVYAPKEVi.LJIVEELPTYLS1CLYNOSK'~11C1IDLVALSDSFVEDItfi.ilYTS
CPn _ ~cutA Periplaflmic Divalent Cation 1051423 1050471 Tolerance Protein CutA IC-Type Cyeoehrome Biogenesis ProteinlCPn_0920 GTSTYLWEGKL cys0->sullice Synthesis/biphosphece t#osphacase FAFSKFLIIKSSIffAVLILTSFPSESARSLARHI.ZTERtJISCVHVFPKILCBiElOISELPNY~1TVCSWTEITTQ
LS3:YRSDIRLYPFirEKSDGSFITJIADIIO&OYIf CESEEHNIOIKSIDIRFSETC.J1IOEFSGYEyPEYLLFPIENCDPRYLNWLTILSYPEItP' RLLTSSVSRD~.ISTLVPPIlPTS
LFVLVDpII7GTACFIRNRA~AVATSLIYLYRPILSVMACPAYNOTFKLYSAA10GI10LSIY
0908 1011607 I0407a0 HSQNLDRRFYYA'fKQFCE11SLAALi~i00NNA?RKLSLGLPNTPSPRRVISQYKY11LY
CP
n_ AEGAVDFFIRYPFTDSPARAWDNVPGAFLVEtAGGRVTDALCAPLEYRKESLVL~INVI
:.T761 hypothetical protein ILaILFHIIIKNNEI1~1TRRFFKTLTPPCPQYSL:CY11SILIVISSLYCVPTFCWLFLPELSLAS~O!'IHE'tTf.
AAL~IGLtIWPTDKLIJ1L
LSKFNPSPIPNLfLVSSTLSKVPP'CAIJIFIiLRL511DAPTYLNEFSIKD1FSSI3IAI.GIFS
Q , SLVIEKSPOIAODITTFYTI.QTPIAYVCtOtaNTI~TILfI;SCFL~CpPYFPSIiIi.PQI.
PfKTLLKELAKESPKIIDLSLSDAYPCEIIVTTSSGSLLRLPIKTLDsnOlycerol-3-P Acyleransfvrase D
KLPI(ElOlt LIDIO
CEI1G.IKLWAATYfI;ZM'1'FLVCRLLKLRYRNpVEfND'MNINPKpI,~LFLIIIMVAIZVaI
.
RALDLYK1090CSPVIE$EKOYVYDLRFPNFLLLKAL
IL6YLfWSRFHVRPIBIYEYLFItSRWQNFLNSVRSIPTPQLVPGKiBILRSLGDIBIC>(iE
CPeL0909 1011592 1041966 ASRAWRGESLLLYPSGRLSRTGKFETVNOYSAYYLIJtRVf~CfIWLVRVSCti~f7lffR
rsbV-Sigma Faeeor Rapulaeor YKpN51'PKLGPAFIIFJ~ALLR1~TFFHPKRFVIt IISLI1TRTLLRLtl'OlLal~7lGDiIVIYIAC5LD11VSVPSVpLYLEOFIpKKNLKI11LWFNOCDONLPIEVPYA
N!1'DVSYISSAGIRLLLSNFKLVCSROG101CLCCVICESPTEVKAI71GLOQLILLCOSEQE
CPn_0922 1052266 1053927 au-ACylglyeerophoaphoechanolamine Acyltransterase QFJWRSSLRITRKLAR10100RNRCHNUaO1LRLRPCSTLLEAFLIL:SGIEOCI11GFDDIL
CPn _ GSLSYRELRNAZTAVAIKVSKFSFIHtVG1IK14P11SIGAFIAYFGILL7ILiKTPPIINWaOGL
miaA-tRNA Pyrophosphate Transteraae FLYf4.PFEFEFNTTSSPECDib'C..CPpKLFVKLFKRTIVLLSCP'PGSCKTDVSLAL11PNIDRELRACTKTVEVR
RVLTSQQFTKHLTEVOG
IGLYS
CEIVSVDSKQVYOGKDICTAINSLRARpEIPHNLIDTRNVOEPFHWDFYYEAIOAC~JIKCSVPWLLRIFCVSGVESDD
TAVILFTSGTEKLPKAVPLTNKMI~IFiJOFJICL1IFF0PNT0 LSRNKVPILVGGSGFYFHAFLSGPPKGPAADPGIREQLFaIAEENGVSALYEDLLLImPEDYIQ.AFLPPPHAYGFNSC
GLFPLIfIGVNVI1FASNPLNPKKLVEFIl7DIUfVfFfGSTIVIF
AQTI?KHDKHKTTACLEIIOLTCKXVSDtII~SIDIVPKASREYCGRAWILSPETEFLKNNIDYILICI'AKKQNSCLE
SLALWIGGDALKDTLYECTXKi.OPOIALYOGYGATLCSPVISIT
t7FIRCEAMLOEGLLEEVRGLLNOGIRENPSAFKAICYAEWIEFLDNGFJILEEYE6TKRKFVTICESPR%SEGVCNPI
a?mVLIISKETHIPVSSGECGLIVVAfI'fSVFSGYi~diHENOSFIt ~NSWNYTKXOKTNF1IRYSIFRELPTLGI3SDAIAOKTAKDYLLYSSLGGDQWYL1GOLGHIGPSCDLFLEGRLSRFVK
ICCEMVSLEALESILNFJIFTENONmA
CSLWCCIPGDKVRLCLFT'tLITTIHEVPTDILKSAETSSIYKISYVIIDVlSIPIIGICIIP
CPn_0911 1011079 1042985 DYVSLNALAVSLFG
Fe-S cluster OxldOreducCafe EVTYVLDAN C~ 0927 1057966 1055093 SLLLJ1IFNVNYFNNLCKAISFEEGLfLFVSSPIRLOEAADATRKERYPSNbioF_1-Oxononanoace Synchase_1 PNYTNICKIDCTFCAPYRKPKSPDAYLLSFDEVIlSLLORYVSSGVK1YLL.OGGVHPCIGI' DYLEELVRITVOEFPSINPNFFSaVEIEHACRVSCISIEpGLORLWDACQRTIf'r~GAEIVCKESFL7TSDVIDt~.' IT1DFLCFARSPI'IYCEVSKRFOIHCQOFPHEKLGIRGSRL1NGP
LSERVRKIISPKKFICPr~IfINLHKLAHI~1GFRTTATHtIPGIMCIPEDILIHLO'ILRDAQDSSVTDDLESKIASY
NGAPNAFIVNSGYNAMiCLCNNVSRSTDVL4WDCEVHKSWIIaLSA
SCPGFYSFIPWSYKPGNTALRRNVPQQASIETYYRILAIGRIFLDNFDHVMSWP'GECKSISGOHHTFHHNNLEHLESL
LOCYRISSKGRIFIFVSSVYSPRG'fWPLCOIIAISRICYNA
LGAKALHYGADDFC~uVILDESVHKAT4WSICSSEEEIt3'tIIRSEGFIPVERNI'FYOHISCHLIVDEAHAKCIFCC
OGKGLCFIALCYENFYAVLVFYGKALCfKGASLLTSSCVKYDLfIpFI
TVSSL
SPPLRYSTSLSPNTLTSICTAYDfLASDCEIARKOVFKLKEHFHDCFDSHAPOC11QPIFL
PHTCLEEAISVLETTCIHVCVYAFAKHPFLAVNLNAYNfVDEVNLLAOVKKPYLBKSSHR
CPn_0~12 1044120 10157u0 'MINHEFHLWRELCC'H
CT768 hypothetical protein tNINDNSONSFHTLETEOGSFLNDEWVEEVASTESTEISDATLCFAEKKVAFILNIWRE~Pn_0924 1057301 ALTCSSOGiDLRLFidDLRKQCLPLFNEIEDTAKRAOHWRCYIELTKECRHLKCWDEECSpriAPrimosanal Ptocsan H' FVVGQIDLAITCLEKOTLK!'QECTEDKIFKDREDNFLESpALDKHOAFYKONHTSLLWLSY.RFTAKTKSNGYIESa' TPRLYAEVIVCSNINNVLDYCVPENLEHITKC'fAVThLRODKK
' SFSSKIIDLRKELIHVCNRNRLKSKFFORLSNIwiFIQVFPKRKELIEKVSGTFAEDVOAFVfLKLILPAIS
'AiVIYQIKITtOCKKILPi4;Iw~DSEIVLPQDLLDLLFWISOYYFAPCGK
AKYFiCSDKETLKKTVFFLRKEIKNLAHAAKRLF'lS3HVFAETRLKLSKCWDOLKGKEKE3FNIOPKOHYRWLKVSKA
KTKEILAKLEVLHPSGGAVLKILLOHASPPGLSSIJ~3'A1IV
tROEOGRLRWSM4SKEVROKirIEVSSLLIECNDL31NRKDLECISKKINALDLTHDDVOSPIH3L&KIGILDIVWIAt 7LELGEDLLTFFPPAPKDLHPEOpSJIIDKIFSSLKTIOFN
I:LKKF140pLF0pLREK00AAEHSY0E01.AK0KC'VI(Y.EAARSLAERI1TFSICTCS<f~1ITTHLLF'uITv~S
GKTEIYLIiATSEALKOCK:n"CTLLVPEIALT/Q'fVSLFKARFGKDVGVWN
:iFaAEEWQTLKELL:KHSFLPPPEKISLDNpLNLrILC/CIVNFFEEGLLSSPDSRCKLVFMKLiOSG!SRTWROA:E
GSLRILIGPRSALf'CPKKNLCLIIVDCEHDPAYKGTE3PPCIfIIA
RCVLKORRERRQELKDKLEODKKLLCSSCLDFDR~Y3ALVEEDKRALEELDASILELKFDVAVKPCKIJvNA'I1S'L( :!:ATP.~.LE.a~Y1NALSI:KYV(SRLS3RAAAANPAIITSLININLC
W f00LL hEK~KTY.ILF~OPVLKK IAERLE'JrEOVL
IFFNRRCYHTtNSC?11CKHTUtCPNCOMILT
FHKY Vf/LLCIIIf.N.';:PKDLt'U.~.CPKCt.CTHT(l)YRC3GTEKIEKIIf~IFPpIRTI/LID
I:Pn Oll f 1U4570'1 tU4S74g "D'ITKFYf:;fIETLLRVFA'h:K\GVI.t~fQFILAKE.FNFSAVTU1YII1Jf:0.,~CLYIPDFIUIS
NII fl.tNlt;r tr111Nlhxa Vresenc EpVFOLt'h)VN:R::I:R::IILIt:EILIQ::FLPDIII?1:NSAI4PUGY'AF'L':QEIT4RELCEYP
in I:linrlvnk/ElIBL >t: at tl/71'IH
Hl.x'K'fYRIFJITD::dIIWRRNCf'fAFDLDt'.'fLLK~:If:::xt~FYC'R:LG.t:LF::IK'PLPE'1:IF
FIHLtt~IFMCY.ufKG'IWt:F\I(I1VIIHILKEULEL"tNPLt4t~/TPt'GHFKtKD'fPRYQFLI
-i ltl'FNFKFF!'f:I FIIPS I 1R Y :AW I FVNKKLIINAL.KIr\KU:1Y.VKF'FI
IINUIMR'FF
~'IW ~lnl1 111.IS'HI'1 lU4inf'Iw ':I'n lfaL'i 10'./w!'~ !'n'.:Wt NII ftHNl:iL INNMILNI hfla:l'flt TI'1'1 llyt111CIN!f I '.11 Inl..l 111 n:.IIHI\IhY/EHDL .!G Jl I l/7/'IHt.lll ' VFFWGLP::I'Y'/:al.Tftl.t::::Vf'CDDLYCVAINFI':'..'t.'If:::DFYAINLEKLEErIFADI'IT:
Ctf::fF.t:ll'ft \::I.tIJVTLIftrAI::A::Y.::a1'EKAY611PN
YIMLPMF11S(NIFIIB'rl.t:QLI.DH
VILE.':'.::IrIFIVtIIIAC~Ia:I:::.WIA:x.'YRU):vItIJTIYKKCLTi:DKKAVIL::Y(KKIFICIAt fyl'(TPPPITtIt.::~'t:K'rKl~.:LWKWVILIiCUL::9NAILKEKYPALYG:::W'Ai*IPC~I
AH::IITF::ImI
ILW.I'YL.HLa:EEKTWR('Ct:HLKKH:1YY'fYWNIVF'rvF:FIUIEEVI.FFNRIVKn:f 'JI.f~I'rY::rI.IIIAKTNtF'/tIIIPNFFI.AIAI'ltP/IRYKIP
' f::, ;.t:'ft".Kf.::IiI.KItItLWAILlIftLPFAYTI'Y::::
'ITDYIYJ::LTOtF'J:IFt.t'L
'11 11'1 l'~. 1t1.vi.lq l 11141,H
1"
CPn 097'1 lr1'. . 10711'7s .:~ p'I,p., LOr.ADDn 10'SNSS7 Cf790 MOoctlectcol Procem Tnroratnlrtn OioulIiJrf lenmtriee IIINRW1'IRL'.rf' .Ttla' "~'n.'l'PPSB'E~F~'~"
CLLLTLPCCAARRRASGCIL00tRPIAAANL NM;~LIPfK .~8,~~
i0HKP1IL01XAF~ D91HIi1~IKVR
FKD n ' . lSO
. VIOVIILKLAKTI~V~irG00NGIDahfrlRDiERI~GIY
IIHTOpTY;.TRF
VSIPERTEEIOCCIVSEISEYTCLHVAAVHVIIKCLTQPKatIDEEIEELV3YCOLPSPE
<.
OWP3YAEALENSKOtIHKPI~LFF'fC50WCMhICIRND00ItpSSEFKHFAGVNWIVEVDF
PgD~RIOPEEGROKNOELKAQYKYCOFPELVFIOAtZKOWt~FEP00CAAW5RVRSAL
KLA
OFLt.~SEG
' ~Pn neln W i0\n tD?t_aA
?. ;os~nllf tnsaa7n ' r..t:f:om:. alr:::
- qy .
_ ~ ..
,. ~..rr: .,.""....,_, .", ... .;.\..... .. 1f w.:?
h':\ur~:.=~~.' -:.at. r. :::INP."Ff ....~".
:Yt:..: :wsr:l:
:Nr:
::
' ' .
ERIPFLIU(K'IASIETIW.iNC:EALLLENNL:KCNNPKYNVLLKDDKTFFCIrIIsWISW
. ' .
:.: r:.
: ::: :::.::: ::.:-:a:: .a :
WVLKKTCOFFILPSSIISOSHSKTAVAIRl4I1'FLSNIIfOGLSLKEI
FLISLILFLPt.ALL CILY
IDRGDSSLNDt.AKJtTG
PKVdJIIRTKAITSSORCLItGPYVSAI~CfITLLEVISOWFPLRTCSDREtALRKR1 SGLh~
.
dRlIbCLAPCVCYLTPECYO~'LDKAILFLIOORIEEWKDLLKVIOKASDNLEIWIIAIfYY
SAAORVIITOYDDLWDSLAIKIPtIALPNRWILYSpGNRTILTLLTVItSCKLIGARNFBFFl~7lQ
YPGINSSKGSAKRENLVIISYQACVRYLRDECiGPKANOIIAPGYSLGTStIOMUt SNLLVTN O
ALDRMOCo~DLTSwIWKORGPRSLADVANDICItPIASAII1Q.VG~IHIDSYKPSiaLRCRTLSLIKOAI'U1KOQVE
XFHIONIDAI~LYRt ~ODLL88FILQYYVSQP1'IPKEILTPLPLEFPtLSYVWA6SPPRLRSP~GY~~tELLD
PETFIYNSNNDQELISOGLFERfNGVATPFLELPLYRTSCTRIPIPE11DLLHIlIPLSPHVLAYRNAKAYAATTLPSS
1'LPYODFONILRIISO'YPYRILGY~I7111lOGUlATGVYIVF(~IJ
VDRLMVISNYLDSENPKSOQPD
GFDP(fOYRTFSI08ERTl~I~.ALi.IBVLd.RRFNSLTTJILPOIIWDOCR'I11YNRTKRIIO
'l'tJ'ILTGIOVYI'IAKEKlIMSRCLIiIWCIfGITFPIO<CFSLPPTSMi.QFFOILRDWdtFA
CPtt_0928 1061075 1039881 ISKNRRRIIGRALFEOERIPCICEVRIIKRLt4RFKSMOpVIILSSQisEltaIPGLTIOtDIAV
CHLPS I7 kps P~eeln hoalolo~3 ~tRRD
RRKDFAFTLIl~SDILSGIFSNPNPV5YFS8TRAKOLSDFSKKHPILTICIYI'IIVICILGROt FKLLICLIIPPLGIYWLCOLVCSLALFPRSSMLYSVLIIfCFRKYRLWBIODYWIDfL.DP
SFIfDPAVSESKRITI00DHLTIInLAINFSTARPKItwLLISLGSCOFLEDMIGLRDBLFL~PeI..0911 SWKELAKLLGANILIYNYPGVIt551G%tliLtNLATAIBiLL:Alat.0~I0GPGANEIITYGrq-~ Nimcch Reoas!
YSt.00WQSAALOKNPFTNSETSWVAVKDRAPNSLPAAAtt~FFGPIGKLIAViJ1RW10~A
EKNSR&LPGPEILVYSADAFRPSEII~DTALLPEITLAWIIKRTPFMSKKFIGEVNfi.H
CPIf_0929 1062701 1061186 ~CMLPS 13 kDa Drocein homolop_1 EKllIiIPIHGSNAFVEDILNSNPSPOATYFSSTRAQKLNEFI~WPVLTRZ1~SVIIRIFRV
LIGLIILPLGIYWL.cOTLICTNSILPSIWLLKIFKItOPNTRTL1LTNYLHU4DY88tD'1HVA
SNARVPILQDNVLIt7fLEICLSOAPTNRWIG.ISfGSDCSLE~IAC%LIFDSfIORFAK:.IG
ANILVYNYPGVNS51GSSSLKDLASAIWICTRYL%DRmGPGAIOCIITYI~'SLGCL.ZDAE
ALRDOKIVAD87DT1tiIAVKDRCPLFISPEGF1CSCRRIGKLVMLPGWC1'RAVOeSODLPC
LEIFLYPIDSLRRSTVRpNKIi.APELTi.ANAIICrSPINONREFIEVRLSSDIDPIDSRTR
VAL71TPILKldS
CPn_0970 1062851 1067370 No robust honoloq Present in Oenebetlk/ElOILLCP1L0912 I07S955 107775 as of 11/7/98 NlOISELAPCSTGLOMVPNTOVHtiALDTRRVILTIAACLSLI11GIVLVGIaAAAILPSLFGdnaG/PSil1-ONA
Psiaase R'IAHYTEESLZ>cIWtSIDIIIDVLREHINLBRSGATYKACCPFIRCCi'PSFIVN
VIG(r?tILILFSSIALIYLYXK?REIIOpIALEPLPEMISKDOSIIDIVETRDYASLEKRAT.
YRCFGCGRNGDAIGFi~IOItLGYSFTEAILVLSIOCFwOLVI.OPK08GYTlP00NC
A
A
FAYTIfrNYYOCSMV!'IfREIPRFr~CSYLiIL.RKDlIaROALE'P
G
N
~IZTJStABTFFRYCLYNLPEARNAIAYLYMRGPSPD1'I~tFNLGIRiP~08GFi4ilwE
~PtL0931 1061078 1065718 DlRId060LM'AGFFG180IFLFARRIIFPVIIDAiOttTIGPSARIQC.ENDp00RYVM'PET
MIE
PIFIaSRILtof.NIBRRRIAi~IVILVOD0110CL0lIIDSGPNLZYAIIpQrAPTJA
lyeS-LYSy7 eRNA Synehecaee IDFRVLQ'1KSDIYTNILCEPM'APAEYLDNEDFLY~UGQGSE<GVVLYPYETPGVFS
IALLLOS(JDYL'1'li.It001$$YPKPGPRERALLVEGIROIwbaSPILVYEIG.I~Ir~SL
CEDI10t1'PASOEIGN8EAANSRS'1'PRVRFI~ILWRAIDKNiIFG0ILa0~fOTI0~1RIIRtPEDNVLBLAHPOP
'1'AEPpHIPIROKVPKINPNIVhEiDILRf7D.ICGfail'ICILY'1'J10 ElTSVNGLSEDItEITPIKFIEEKLDLCDIIGIDCYi.FFTNSGELTVLVLTVTIl.CESIi.6FYIVPEDIIINPI.I
AFMISYYEKIfRKNVPFDEACOVL8DS0ILpLLI'IOtRIXIfALD
aAHCiFLCVE1'PILO
I
f TIfIRfL0t0lADRRfdIEOCRPL$LHONIOD~EILEDYWLRRDRTIITLL0P68ELIP
LPiROfAGLSI7KEVRYRKRWLDLISSREVSDIFVIOtSYIIIC.IRNY
i'I~ALNSENFLRISLEIALIGfILVOGAPRIYELGRVFRNt~ItR~'!Q1 T
NIYOGiIFJIKPI
T CPeL0913 1077972 1078238 PCTMIGYMYIfsYIfEVHVFVfM.VFJILVR7IVl~ffSLVYSY18f11DPOBVDIKAIWIR
I
LI 1 hypoclucieal pmcein tfll8~r8IA1YAGIQVDVfIBDOKLEEILKKKT'lFpg'1'AIrATASR~.IAALfDELVSfRC1T94 N
APNNITDtIWEITPLCxTLR5GD,1AFVE<eFESTCLGKELCNAYSI'3i~PIROR1~LE00.
' PPIBfSPRFLLPFLSVILCaia.LSSPR5RJ1ISVTESIGtISAVKTLVC.BBIDIREfita'.1GY
fNMSIRDVLYFPIIl9iR GVGASSILItONQfOOWLCIESLLAQNlVM
TIOIL.LPDSECNPIDEEFLE/\LCOC?IPPAOGFGICVDRLVItIL
FDAGfIN
CPtf_0932 1067160 1065721 CPeL_0941 1078503 1078997 ban ~
V~KSS5D~11IMAF;NIyECLYFYNCAS rhQ>O~F9PNtFt'PVRLYTCGPTVYOIfAHIGNFATYVFED~
FEAAYIipA
VRAELOP$ ~
VFfGYSVTfIVIQIITDVEDRTIAG11SKRNIPLOEYTOPY'fFAFFEDt.CI'LHIARAI~PGILVFTSGIITPBFII
DLTNGSPSLSTPIAKCFfNWLCPOLISPLDI11110DPV
ILKR:T
.
ILYIGS!'LQ~PEVF~tVSGPRLCYILIDL00CAQC0AVLPLLTIGt DFYPNATtIYIPQNIOAITKLLEOGIAYICODASVYFSLNRFPHYGKLSICt.~.SSLROCSR.
ISADEYOKDiPSDIVLWNAYNPERDGVIYSiCSPPfRCGRPGP81LDC8INAMELIGDSLDIN
CPef ACCVtxtIFPNNENEIAOSERLSGKPFARYWIJtSPitLLIDGRG1SKSLONPiRr i FICptVAYIRiASNYRTOLN!'TECALL1CRNALRRLItDFVSRLEGVDLPGESPLPRTLDSn CTY95 hypocheeical Drorv SSOFIEAFSRALANDLNVSTGFASLFDFVNEINTLIDOGNFSKADSLYILLTLKKVD'HII.8IFKNRILPSYFCHHFD
DLRPHYINtIALSLLSLi~IIFPIFCEESRPGSEDCNSMfQLIIIC
GVLPLTfSVCIPETVNQLVaEAEEARKTKNWANAI7l'LRDEILAAGFLVEDSKSGPRVKPL50171'00CLY1~1(RI
EGKPLVTWILNSCDOCQACfIGLSETCEIYLSVLBGSI
FSELifNIWLVP9GVNPLIYPPI~PILAEIVKFKELFKDESFPfGGSI
IWCVTP1~PC
CPn_0973 1067532 1068578 . DIIEVSPVSLTV6EEETLPSEQTTEVESTSEtQSEDPAIA
predicted disulfide bond isoaerase ' K CPri0916 1082816 1079715 AEL
C t)ly0-GlYCYI cRHA Synchecase PVILI4NIKRCSLKOLKVLATLLLSLSLPTLEJIA~IRDSOSIVWHLOYOEAL.OKS
GJ
L
' t GECOKICKCYTLESFVSEHPLTLOSNIATILRFWSEOGCVIHpCYDLEVCACTFNPATFLR
JEVEYLKHRPQVCiIRO~
PLLVIFSCSOWNGPGMKIRKEVGt=SPEFIKRVOCKFtIC
L
KSKPKINELPCNILL3NEEREIYRiGSFCNETCSM.CDSLCNIVEBDSLLRRAFPPDfISALDPEPYKAAYVEPSRRPO
DDRYCVItPNRIAHYi~LOVILKPVPQIFLSLYTESGRAIGL
SLSELORYYRL11EELSHKEFLKtIALEIGVRSDDYFFL:aEKFRLLVEVCKl07SEECORIKKDLRDNOIRFINl7DN
CIPTICAWGLGWt1MI11GNEITOLTYFOAIGSKPLDTISGBIZYGI
RLLNKDPKNEK01'HFTVALIEF9ELAKltSPAOVAQDASOVIAPLESYISOFG000KD1~R.WERIJ1NYIAKKISIY
DV WID1'LTYGOITOASEKAWSEYNFDYANfAIIFKNPF~IACGL
R'J~tIAOFY LDSDQWtIMAi.0lf AEVAFEAAPNEVRSNISRSLEYIRNOSRTLIDIGLSVPAYDfYIKASNAPNILDARCTISV1'ERTRYIMIRpLTRLV1 1DSWEwRAS
0971 l Oo8918 1068526 lt~lYPLtw"LSSTSEPICETSfSWPMISSTEDLLLEICSEELPATPVPIGIOpLESLiIRO'Vt.
CPn _ TDtMIW(TJrLEVIGSPRRLALLVKNVAPEWOKAFEKKGPNLTSLFBPOCDVBPpCpOFF
:npA-Ribonuclease P Protein Canponenc 'IFVNPLTLPKpSRVLKRKOFLYITRSGFCCACSOATFf'/VPSRHPCTGRMGI'CVSK!(lCKASOGVDISRYQDL8R
HJ1ST.AIR7YNCSEYLFLLNPEIRLRTADIIlIpEt.PLLI0RT8IFPK
AkEANSFttttWItLYFRHVRNOLPNCQIWFPKCHKORFVFSKLLODFItJOIPOGGHRLGKYII~WONSOVEYARPIR
WLVALYGEHILPITtGTIIASRNSF~11RDLDPRKI$ISSP00YY
TKATTOCIx:TPItSEKC\TAPR
CTLROACVVVSOK6RRNIIEOGLRANSSOTISAIPLPRLIGTPLSENPFVSCOOP$Op PCALPK6LLIAENVNNOKYFP'fNETSSGJ1ISNFFIWCDNSPNOtfIIEC~tALTPRLTD
rPn :EFLFKODWIPLTtFIEKLKSVTYFEALGSLYDKVERLKANORVFS1TS5LAASEDC.DI
0735 1Dc9100 1068957 ' _ ~OKLSTIGT
r171-L7A Ribofloafal Protein AIOYCKADLVSAWNEFPELOGINCEYILKHANLPTASAVAVtIEHLRNI
' EtIIVKRTYOPSKRKRRNSVGFRTRNATRNGRKLLNRFFRfIGRNSLVDLGLDRLiIDH
LL~rLLDRLDNLLJUCFIIGLfCFTSSNDPYALRROSLEVLTLV
:ASRLPIDLAu' FPSTtEEKVWDKSKTINEtLEFIWGRLKTFMGSLEFRKDEIAAVLIDSATKNPtEILt)'fA
!f! Jr. l i)r: )310 1069170 EALOLLKEENTEKLAV iTfTNNRLKKI
L.i.~.LKL'oM1':~SP
r:lr: I EVLCDRESNFKI,WLDAP'PGF
' _ IidE
r:7t.-L:'. Rrtxfurnul Froceln PKET.~J1HAFLEYFL.:LApL::NDiODFLfRVIIIMIDDGAIRNLR15LLLTANDKFSIt~
'11J1KV.~.:;::VKAGP:aM:DKLVRRKGRLYVLNKKDPNPY.~.VAV
iRO~PARKK
u':: I IJ.:'t.lH7 IJr:'ti'rK ~Rr_tlnA7 II)K IA f f li)NAO'~'I
am _ pttsA r:lYVarrcl f 1' PM>trptf.ttyh/ltc.m:ltKC.ncr.
r::lA::lA ItrW rt.vml f'cocwi"
~/KRNAYY.::::V\kF
V:RRRLVE.WFKKR::DLRXIVKC:..:'!.~.EEEIII7JARt::LNKIIKROTSP:.:AN.LMJYCTF::RLFITt' IhffIL'ILYr:YWFra'PCVVI.I'Y'It.LAtd \L::EI.'fDJII0.A'VA
'f'fIJItW''LIa'a<I'1a:1'td<KIAI::RCt:Pft~MA:2'1t)Efia:JIKAaIRKF.~/JIt'~:KLLC11' HAIxIt1'ItC::CYL'PF1~~C'INNLC'LLL'/FlF( IRIX:VI::Tt.RTVt:AF
1'r:fMIMNAat:KL.KA t 11r :Y::F'YI.I
1.1:/M 1 1'11:.1 t :l..l: X/tY:LL:I
h1\::YN:: f ! AVY::IA::
wfn U.::, 111..111'.. ltlo'rrll., iIEIfFWMIKNh'I'RtNAKTKI\:hY.tDll'_:YU
..~~/i1R t.ylo.rlrI i.~.:l 1'r..r.im -IL:.uMr rr.'fl ir~rn i.l.' 1'rriPl.f!'JIiCI' rIN1:971'LIt'IT)1.1'1::ILLIvYVIIJ7f:C::AYIADKKYFtf/It~.IFF'M:Ah'F:FIr:LWLLLL':h r_If'nAk I111c'.As!t IW AIVA
/
I~.:ItktlAl.h:KfAlt.('YfMI::I)1.FDDfJCK::LY?IDEIP:::~:LWEI411YFF.1.'WFYflIKDRFN
V'IUIA r:lYt'rrNar ::Yldtl.n:.
' ' ' :I' I ::Fh:h:l.l'l'LLYr :K'fYf'h:l't'L I ::KI ~.
WNWKKI 7MKUW,tk VKL'IGC:IIJt~ItL.Kh'.A::K.:~.~ I::R
':F'71N. I'ArIAVEhTI' 1 VKVI S
:1 t :IX YA::L::K 1:1 J1 KVNIIV
h.'lLf.('l11' 1 F::FYYEFLCKWA::AL:Y::1'1t:1:1'l:t!
1'fId'KaJII:LF:.TC:IY::h7JlIVVkN::AYAAAAAII
JEADIAUIYIILNIIIOM:LLJII:LLKHPIJ~IPVH_CPn 0?~0 I' '. l:l.'.':'::
"INIIPCYIlGYC3L'OLLAASOIp '!:
, tT909 hYpocttetscal pw.r~sn .~;few'IIYOLPRDI'OT3VLJIKCALYCGDYI'I1'VrL:.iOEttNDY.iDIfALNDAILARNSVF:iL::Ltl3Y
LrNPraKAI.W'w~F L tt.
OISSDKFPLI E
3EP0VLFTKK6CIRAVLYFxL
' . EELWSPLEVGR'.fGA~i'v:.'oOWL
:NtaDEDVWNPKTOPALAVOYDA:iLL
I
' ~NEVLtNCFtALt.ODCLASSPNIRL DCFFCOCSL~:PERKNtLKfLEflRKKNCG~PFCYL
-Vt3RIYEERCPEFNKEIILMAHGISYAFLLICi VCSGVFDCRPLIROEw::E:
' ' .F .
~
LDFNDPWILTYAAAGHICfPSNILEAG~GLTCLIAMtYClVPLVRKTOGLADTVI
.PT
TFFDLNIIFNEFRAHL.iNAVTYYADEPDVWI1~LIESCHLIIJIiGLDAHAKNYVNi.YOSLLS
'IT
~-~
Cpn_09~t 109710.; IO
.t.n n.~.tv lUR5Aa7 lOnF181 rl t:-Li2 Ratna~ln.~: Pmresh ...
.
.
._. .
. .
, . -.KAt",r :: ;
.. . , , . ..
.. . . .. . ... ...:.;
' ' ' . :..,.... ... ;.s.,:.,slt ., ~;; 'A.:L.. 'dl:T'.....i..: .;:
:LalIt11H1:r:~-::n~!~:r~':vr:;~ :.,.
".': -:I1:,. .... .._ KKFI::NLFSvALSSIYFa'LSYEGR L IKALVI(DIOYO
ITPYOV INLDfCCLVEDRPIXtI'fI 1D9730t 1099:75 PIIKINAVDCICVIC~SLROVIRAVRVMCKPKDIVPFLELDNRSVGLSOTRKLSDIKIFCPI1~0962 . plsK-PA/PhosphollPid SyntMats Protein ~yA
IL3L1111C1r0ICIDIlCWHSPLWWaVLVDVLKSO$SffPFAl1'LFAiClIRI4~tOCAf ~I~
0950 106170 1017037 80LP00CFPKIfSAEtiIVANm8PIw1A1RKK38 CPn V
~
_ TLARAKIPLFPAVSRPALLVLyPTIEUGIU1VILDYCANISVKPiDNOFAIAOGAYROL
pch-PePtadyl CRNA Mydrolase LC
PSLCOlahAKLIVAICNPRNCYANTRIINNGFLLADRLVE~.Of;PPFKPLSKCNAiJfl'LVCSDSKIPTIGLtIfIG
SC~IKGTC7WRCtfIWLRE?liG6AFLQ(ItSGAVPDDiIADIWTt~F
SSGPLVFIRIT'FVNLSGIU1WLAKKYFNVALStIILVLA~SFGKiJIt.CFNOQ>IOGIITCItIFLRTAICVTCPLO
RILGDKL6ADIORRLDYTFYPDSVIIL~GLAKLVIKCIIGKA~fS
NtiLKSfTASIGSNEYWOLRFGVra'RPLCL7I:VELSNFVLCKF8ICZ~.OLGSIFVF~ISTLf'tLFIGILGSINta QARt.CKRILS~.I
EadCSItF
CPn_0963 109371 1103231 0951 107113 107157 pm~ZlPutatlw Oucet Msebrane Protein CPt>
_ TPLRFKVAHW1KKTVRSYRSSFSHSVNAfTSACIAFCiINSLNSSF.LeLGVTNI0FS~5 rs6-S6 Ribosaeai Protein .EFI1~ICKKENQLYEGAYVPSVTLSEGRRKALDKVISGITNYCGEIHKIHDQCRIOfLAYTIANVBCAQTSVLKGSDP
VNPSQKESEIfVLYIpVPL
I~AREGYYYFIYFSVSPOAI JISLPE
TlVPCIDQKLVNStxOTIINFSOP11QEPDTSNAVSPJTISSCEKDipKpLiTCDPCKCIGLK
EvSSDLPKSPETAVAAISEDLEISENISARDPLOGt.AFFYLO?fSSQSISEKDSSF17QIIr ct~o9si 1oe711;9 loe7n3 sasaANSCLCFarsrIAVKSOAAVrsaRDIVr~avKCLSFISCESt,ma:s~l~nrrH
rslA-S18 Ribosanrl Prouin ~T~'fG'u'~~~~T~~'~I~A~
CENTNKPVHl6LEttRRKRFNKIICPFVSAGWK1'IDYIfDVCfLfitFITEPGKVLPRRI1'L11SS
RFQLYLSQJ1IKRARNIGLLPFVaED
CPtl_0953 10~7717 10BA3~8 r19-L9 Ribosaml Protein 1TARFGYVRIIYLIPIDDUIVIAG714TLRLOAIa.KW
RLIOAAADItADSERIApALI~IVLEFQVRVDPt7~i~DIYC.RVTIHDIIAF~IAiDDIIFLVRIC~t FPHANY71IKNLGKKNIPL IQ.KCCVTATLLVEV'tS~IE'YYMa0GK0'tCENOEC
CPI~0954 108~359 10~8708 ychB-Prttdleted Kinase GRKVCY%DLNpYfSPAKLM.P'LKIWGRRPI7NF1IG.TTLYpAfDFf'sD1'tSLSLSS
NVNELLSPSNLIWKSLEIFRRETQINOPVSWNLHKSIPL09GLOOGSSNAATALYAt~iH
F01'HIPITTLQt.WAREfCSDVPFFFL0E0H
CPIt'0955 lOAAtilZ 10A9175 I Erawe-shitc Wich 09561 LK06IFAICITI110RNELtH'8TNVi~SCLGVV1~CQNICCFtDlKIIItLlGYJiLCLtYtOWE
RAFPliP7ISYlMIATLGSRNRIfRCSFFFSSCfALGHD~i~4.fSIKLQi'111K7DCYVLYLDIIQ
Y11GILAGPWLIKCiIfVYQit~T1'D
GfptEfTAY05LLPQDYSiGIHC~IACFYGCNDLdKSVFAIRTLL10~IKElaC.CRNWBPFCS11V
YOTLOIS7GSWT~.PIACt8IDYRIfIHJPRRFIfIIIVS117VPlVtAiYHIIE~PtIiCO
tlISGSGJ1TLFVCYLEELEODSIIVSS0IIL9LIKO~fpCIPVSRLYAEPNNYSLKOS'fYIOiBP
GKEVRTFQRTRIENVAIPFatALFJIAYSRDSRAEHiSVpLAYVFDV><IIIA7PVCLITL1~J1 LOCFpPpI AYSIIKCYCVDI IlIYDII~IIIIIF
CPtI_0956 1oA9515 1090909 CPn_0961 110~~11 1103301 CTA05 hypothaeical psouin No robust homolop present in Oenebank(CIOI. as of 11/7/9!
WWP$NILPPYSYSLKIGAAVLFPCSILNtFLTP11LY1u0SYODtKLVFPl7CWIDlYJIRt.
OSILCSIIKYIYLIIOiSKNMLBNPISLFSPAELIAICYIrt.IPKISPIYIRIIIiLIIl.1~11 SELTRILSRVSIVFFLWAVPL!'fWFLYTCGYRISL01YFNSPNYOTAVFI11VILILT~~P
CQTRL'fNVAOVGNPSSLI~SIIXIIJ~CSGGPLCWYfEIOILAFITTiVtlIIfi.IHC.
IVYFACLVLSSIAIQGKTSPKSWWWfLlIIAPPLLSCLL1TC1GANIIG11?LLJaWIYV!'8P
fVAOLRLFIIPLPPKKIVmt.SCPTTECIIiEVfOPFIF11L0ALLFC~LiIfF!<IV~fIIC
SRRFAYATNGt3.FSNISICCLTSYVSSRALTLIPPALI6iENSFFLSNFAMfAIVATLIST
KApI.PIIIFGHRi.VAISPOCSOFJII/tRIPtLKKVLISLaVLTPAi1K116nY1~i TIYYFIFRKEFKKFPDIPSD>mPSVCKVPWWIfCVNIIlVGSIILSPSTPLFlICAfdi.FY
OItIANl000lTFPILIIQ.LIGL1CKSSLPICfPSfKtKI~AALFIAS/!IA>WiT
iGPnXFTIEYQOPINLSKVCYVCLrYIIGLWFGOI.pDIWVt3It3100LSDFGYIfIYSIrfLS
R$~RLYSIANDf~LLIW90CFtDCREf~ISIODG~A6EYRFAA0p11DA1tY11G1ICaVi.
IFLDNALVNYLVtOJISVATDCYNYLWXK'NIUILiGLTLVSNIPNIVGYLILRSAFPSSTi RNCSMKIAWNVINT1IKPTf(OK'taCLVTENLQDTIG11LTLRQTNiI'114mtCW10LlML
HHONLFLCALGPSI IShfIVPSiLLIQ.1VPEFLYCFFR
PLIIKYLNS~.VNSVFKSHOIUIDPCTItALIREPALDILY11SLRLPpTSJIlEfM5Tti31 SEEFLKRIFOtLPAV
CP1~096e 110055 L10d719 pcnA_:-POIyA Polyeerase LLITtIINCENNILr<:RSiLELLKKKSNITLTFTIYSVSNHNfKLKDFSPNALSVIK?LRK
AGYIAYIVCC.~IRDLLLMl'PKDFDI :'l'SAKPECIKAIFIO~ILVCIIRFRf.ANIR~Opt CPn_095A 109803 1093793 ' IEVSTFRSCSTOEOVLITKDNLW>1'PEEtIVLRRDPTINGL!'YDPENCCIIDYTr7D11lKKJt pls8-.lycerol-3-P AcyltransEerase NRYLRTICDPFTRFKODFVRNLRLLKILSRSPFTVE1'OT'OEALIJICIt06LIK850ARVFE
LYRAIYHOFSRYLRYAFONpYLPEPLY0KFS11FliQNYIDAATKKAAADOAEVLCLOWhfVELIKHW.SCRAIO~PPO
LLIl7JfILLEILFPYNDKAPRWPALCCpTATYLKALDDICILKItE
TIDLYtIPFIFPPYHKKIRAPIDLFRLSIDFFSLVIDDIfNSALtIrLItRLKEIBEYIARCDAEYDRIWUMIFLPPLV
NPNVRYKHOKHPYLSLT~VF?(IKNFLCQFFAOSFTSCSKKtIf' IIWLUWHOTEGDPOT1IYYAiGK'CI1PCLJIENHIFVACDRViSDPWIPPS11CCOLL.CIYSILTI1LILQNDYRLT
PLIPIKKALPFNKKLLHHTRFLPJIL:LLCIRSLVYPKLOKVYVJWl KRNIATPPELREEKLLHNpKSMOILIITLLNEGCKPIYYAPAOGRtHIKNAEGRLYPSEFSPRHHOTLKCKKD3NSOK
ESLEVFRLLAKA.,~IIQITHFYPFALKTYDILPPPPKIEIIAICEORAIFFAPVFFNFGACLF
FDALC.~.KEELtFK:DKHAORTLRAEKVFSIVKNLYCELCPn-0967 IIOM171 L1U~8>r:
mrnA/pps-Plx~yhnt l uc-cwwt.>,.~..~
''Ur_U''.n 1117AJ7e lU.s179~
PTAYKFAFIC.k'R::EKIRRTCtDFRRHHp~h:VkYLlY:fGIYRCR/W!'EPlfIYE1'fVLLCK
.:.stE-IUi.tI Fshnrnc 1>rocein AVARVLRl7:Ra:KHNV11',:KUTRI:X:YHFFNALIAP,ItJ::M;ICEtVt(:PLITFGVAIITR
A(X:Wa.TRKVNENEILIi~IIE:iKEIRYAIILKNCf~LFLLTtERKKVROt.KCNLYRGRVTHTAYRADR::IHI:
i4:11NM'RI?ltal!TF::LEI:FKI::fi/Gl~P1E711V:IPJIDPtIPLPCOINVOK
LIlNfO::AFtNLDERfaN:FIHL~.DILF3~I;:KKFfONFOHDVDALPEEA.iFJIPt.L.iSEEAPIENKRVIOII
IK:N\'VLF(NK\TFI'Ke:NTIJII:I.IIIVLI/:NK:A::YIIVAI~.T/FERLI1AEVI~OCE
F!'1.1;LL::PVt.VWVKEILU::KVARLTSDII::Lfc:RYL'dLLPIIf.PIIRGV.~.RKIEDPIMRCDLP'l'C
tNINEIk\:ALFIth'L'KAVIL7KlI11t4:111.U:DWPITMVDF7Ni1L110s7111ILaICA
YyLIN::!'Fl~tXnIY:LIt.'R'PA:TCTA:,TFr\LINPJWDLLLTWII'I'ILEKFYSTEGPI:LLY~ET;:DLK
KR::ALMINItWI'fllrlN!'rtJIJI'(Lf7t:L:IIrVFT.':1'P'.YJRIIVIJMHLYJIEVTIl14E0 Inll.l!Yl'/I.1't:IIH!ILYKNLLIDDYATYQIG:KIMLKKY:PDA::IKIB'fYRDSIPNFERPNLE:il:IOA
IFLDYM"Aa:I:II::TIlY'/LRINIF-':I_:HI::W.TAtIVK::I'pl't.INIIAVRFJIIM.6T
YliILYATIeNKIWL:::~h:1'LFFnI(TEJ1l61TII7VNS~1R:STQL.C'~VEETLVptIILEMCEIAIPLIERT
IJtt~lltY:Al~:1':::1<fl.LInY:y71'FJJInI'MVPINIrKlltnlll'IrtVM.AWIMF3.I:
L:HFI*1M't:f.VILDFIDHK::RKNDRRVLERLKE11N!!YDAARCTLG:HSEF.LVtMRpR'Ir'"lRE
I
~~
NHF::! t4JftJ"tU: Mtt::a:NA I I vl O
KTPFwIIV/ t E T CRDl.YIfV IMIK6113HLCLWHPEIASYN(IC
Kyl7lLrillh?IINWKOLK~VCLQINT::D:iVill.lRIYOFFr'LITGE.:fOLCI91 ILIbN IIILnHNY
.IIIIC:.:14n:.b..MW t.,.;:t",r.,.:.
.. 1 ~IUlrt.yr.W :ittr.t::.
lI7 CPtL0957 1093~1? 10909E3 ide/ptr-Insulinase Eaeily/Protease III
KIYTRNCKNfWKLt.CPILICTSLSITSCEOQFkWPNOCPIQVSTPAAAOpICfEKI ICSN
GLPLLIISDPNLPTSGAALLVK'1~INADPEEYPGNAHFTEHCVFLGNGfYPEVSGFPCFL
WO 00/27994 PCTNS99l26923 Ie ~
VEOCLFIRKTVCRVOQ.SNL.F
RMDGIF!:YW'!J~(,YSfVLLGLAKLCYRGYD,iNW4J
D CPn !)'179 11:1:71 l' ~
a / ' 1 httA-DD :ieriM/~d~'ltlYWllai' p AWIOtGfIflIFKCLRRii.TAOCI ~ G':... !I.
-rERitITA:IVICItfIIiIATIICVP'IiINANPNVDEGR
ES
iP~~DIO.r~EIIVQLFSLYYOF.SQOLYFSFCQTIJiGLRCiSVACALtHKDNPIYfILCA80caOIIITKOLRS1~I
DI1VL~'LLRLPr~Af'SKKESRYStCP~00b"'.
VR .SC~fS~SkV~~fK
O .
PLIIGIL.7cEE'fFIASDSMFFKYTRHSpALASCiFAIVSOGKEPEVYIE.ILKKIHKDATPAWYIE.iFPKSOAVTH
PSpGRIK.PYCNPFOYFNCEFfNIIFFGLPSpRDIPOSKMVIt SEDASOK3GYCYYNLKEIYOOP6YLECLIOKIW~falIILSECLiDVPIKSFKiITf IT~.
.
~LVSPOCIIVTNNIIWED1GKIHVTLHOGOKYPATIIw.DPIITDIrIYIKIKS~.PY
VACf"'w~YHAGYLAKYIIF$LVSTPVIIICVASGFRYRRPYIGKCTLTrILf.iQSGCfAO?LI1 ALKiLRRRNIAYLLCICNVPGSAIAL.CVDNCLFGFaGVEIGVATTKAITSQLLLLVFIGLLSIr?ISOHLKVCDWAIA
IGNPFI:LOAT.:.ICYISJ1KCRNQLHIAD!'EDFIC:~AAINIGN
Kf.IWVIK7ALTHAE0t:3f~f:LQ.SLPptGpKLLANE;LHSWAOPYSYEDKfLFLCRRI~IYP:,rI:PLWLDrfjt !tGVHfAiV:;rStrlfICIf:FAiP.ftaIAltRtIDDLIRD!:pVIRCFt~Y':.
..I; . . ..I, ...,:" . Y' ::.F"~Cln!.'!:YM:..:iiLrLS'."!Y i.i:f~.' ,y,. ..
L ~tif. .. .y:. v. :: ~\'f.'i. :..'MFRIiif:.:
...F. ..I Y.Li: ~i:.n . ..d 1i .. hl h 1:.:.:.' -~
i""
' ' 1;.~~
...:Nf:tA::l.l:.':!:v::v:l:n .. . .... .. - .. . . . .
. ........n ...... "".;t?IA..,.. '.!
..\.47AE!~J' ~:Sf ~!'SYF:.:tII..
. :
at:::'."PI':!.%L':PI'a':' PRNLAKSVTVE TKGILL ZSVEPCSVnIASS:aIAPGOLLLAVNRQINS~
IECt.NRTLIt05Np1ENItilIfl9OCD
VfRFfALKPEE
CPn_0969 1111101 1111999 CPn cyrP_1-Tyrosine Transpore_1 _ VYVMSNKVLGCSLLIAGSI1IGAGVLAVPVLTAKGGFFPATFLYIVSWLPSMASGLCLLEV'si~ilsricy co Saecharamyees serevssiae hypocnecsul 53.9KD
MfstlitESKNWNa.SIIAFSILCNVGKISICLVYLfLFYSLLIAYFCfxIGHILCRVIMONPtocvin LGIrtiIRHLGPIGFAfI~CPIINIIOfKVIDYQ~1R!lMIGLTV111GIlCAtGFLItIOPSFLFVMJItIAKKNAKP
WLIFFSTKDKt3YCDIIf'NNCSGKPt4lt.DSKHFDINSANFLi~JNt vRS~ILTTf401FPVFFLAIGfpSIIPTLYYYlK7RKYGDVKKAILIG?LIPLVLYVLtiiWFISFPSISADSDHI40C
1I~ICAHFLVDHVNKfFDVPGWITPGHPPIIYASYKSCDPLSPfL
YiGIIVSLPILS0711CICCYTAVtALKOJWllIWAFYIAGiLIGlFIIt.VSBlVGVALGV!loFLMLY11IY090PA
OLSOCIi~I~!'ILREC~NLY
AOGt.KWNKXSNPFSIFFLTFIIPL11WAVCYPtIVLTCLKYIIGCFG71AVIICVFPTLIVWKPLNIIWLICGEBISG
SLiILPIWLGUtKEALW1DYLLIVDGuFL40tNPYVSIGitRGIVfM
CRYCKQNlIR00pLVFGGItFJIL!'I10TLLfVIHWSfYHELKI8LEG~IKD018f'.YttX'.IAYNMRALSiILSS
LHHPONSIAIEGFYDDLiIi.PSDSORPD
LPKSDTLREC~FRPOGYE7l?YSPEESALR%YEINGISCicrYTGPGFXTVIPYMTA
CPn_0970 1117153 1111618 Yt.9CRLVPNpDP~tAAHOVIIIIILROQVPBSLKFSYEILPGGSIMiRSWILpfVKVIt~I
cyrP_3-Tyrosine Transport_3 YSDLYNBiCLRLVMPATIPIGPLLGtAAOTSPfICGTSYLSDDIHAAaNFEIIDOLItIOCF
VYVMSNKVLOGSLLZAGS71IGi1CViJIVfHLTAKOGFFPATFLYIVSiiLFSMASCICLLIYLSICOLLDKLPKIKE
, M'Iwl9LiSKNPVII~.&IHIESII~It~ICISICLVYLTLIYSLLIAYIC~NILCRVP!>CQN
IGISWIRNLGPLGFAILIGpIIMACTKVIOYCNRFPMItGLTVAIGIFCALGFLKIOPSFLCPn_0981 1137019 VRSyWL.ITINiIFpVPtGFGFQSIIpThYYYImKKVCDVIOtAILIG?LIpLVLYVLWiWZinc Mecalloprocese tinsulansse Eanilyt VLG71VSLPIL~AICIfiL~fTAVFALKQJWR~IAfIffAGC.FGFFJ1WSSFNGVAI41!tm!'LVTLSMtAGDTYRN
FIIKSCJtDL.PEICSKLGFJIiNKPIGASIIOIIVNNDEIfIVIT~tICFIffC
ADCLKWlIKKSNPFSIFFLTFIIpWiAVCYPLIVLTCLKYlIOGlGA7IVI
IGVFpTLIVWK
CRYGKONHREKpLVP'C~71LPLMFLLIVIMWSIYNEI.
CPItr.0971 11f1697 1115~15 yecA-Transpore Penlease DGSNGLYDRDYIQDSRVOGTtASRVYfR!IMTAGLIVTSCVALGLYPSGLYRSLFSPIiYMMC
PATLGVSFPINSKIOTIS1!SAVCGLFLLY
ALYhGLAAVYGJ1F'fKSI7LTKISKIIfl~FALIGLLLVTLVFAWSIOVSNPLIYLLICYf~GL
VIFYCLTAiIGADAIRRISSTICta~BJfLSYKISt*ffJlLIaIYCNVIINfIiYLLQIFSSSaNR
D
CPIr_0973 1116)77 11154)0 ttsY-Cell Division Procein FCSY
RCIIIHSLLFPSYLVSFLti.OLTLLWIFKPPRIDG.OSLFK81YISLDLICDALSLFYfJIDF
GTELTEELCAALRRTKKADiLSTIRDLITVLWSLOGLPSOA:90SSQTRPIVSLI.L?fNG
PGGfliL7lAI LR~t!LTLDI~FPIVPAI
SCKTn'1 YKGtSiSVIQ VA
WI
Wn _ _ _ _ Q
t~01I CPItv0913 1111315 1139963 E0VRVFNDWpLSGLIF?RYDGSAIIGGI'LPOIAKRt3CIPi7fFIGYG
~iP
~.DLFLatKLFPLVDCI YipN taaily .
KIIS.ASVhBILPVSLiYCIi.ISGCVFFI.
sNfYS85LYlWOCRAPLEKIOK<.Ol~Ipti.O'!SL
CPIt_0973 1116116 1117537 NLiRIp&pi.Ii~DFSNRW.SSHKLIKDM1t66AQN1flrro'fSKSFQSILSPIQTri.Tl11031 'suoC-Succinyl-CoA Synchecase.
Beca'LiTF~fRtt~IORLKCOISOLLAV6KKLEIIC17M1hDIL10Ip0iI0LMifiT.
IPPYWVVSSCELCELLITKSGLDSAWKWVtWG
AWG.iYGDYDSpTT8A0GItFRADIIIRLP00RCLIID11KAPISD~iIIFSVMl01m0i.V0K
GRGI0K9GVIVJ11tS8XiILQIIVAKLht'InI0IPT5NQTADGFLPVEKVLISPWAIQRaYYVAVIK~IIKTLKSKS
YWGIPHOSPiYVILFLPCKSLFND11IRIJIp~MIQAiINVIfiSpLT
IImRJOIRCPVIJQSKAGQIDIT.EVJWSSPWILTLPLTSYGHIYSYpL110ATK1101WiGiLLALLKTIAYNiIIQC
IL4KQIQ6YSt.LGKCWRRIQW!'l'HPQKIOIIfILJp'tVOlYlal' Vt4pNOLIIOG~KCFYAD7VSLLCINPLVLTLCGCLLVLDSKITID1S111LY1WPNG6VLSSInYRVL.PTLRKFiGL
CfSSSI~ILEP?PIESL1TSFPNTCDIDTtIUIYf YDP8Q6lNRDVi.NfOIGL&YIALSC~ItIGCIVNGil0i.7WSTi.DILIOiKKitiIlAtiFLDNGGG
ASpKQIQEAVSLVLSDESVKVLFINIFGGtI~CSWASGLVAVNCCADOVIIP'fVIRLI%:TCPn..0981 NvEtGKiIVpQSGIPCQtYSSMF.OfiIIPRAVELSNpssA-Glycerol-Serine Phosphacidyltransferase KNPf.CY0QK1aJ10ID1171fx.DLIJ1I1GKRRVYtPNAfTAIChCCGLFI
IFKSVLRTSSBViL
CPn_0971 1117537 1118133 FHRt.OGLSLLLISJWIJ1~SOGAIARiIBLAFSAPCJIpFDSLSDAYfIGIAPPLIAIRfLD
'sucD-Suecinyl-CoA Synchetase.
AlphaGfYVG~IFFSSW.LI?SIIYSLCGVLRLVRYNLFSOKTVDVSKpYCFIGLPIPAAAASiVS
VCRFRRYMFNSLSKNfPIIInCITGKAGSFHTLpCL~YGTNFVCGVTPCKGGTiJit.DLPVULILiISDIFPOLPJYO
LRVGLLSFALLFIOGLMISPWKFP4InWFRINVSSFLLWTICL
YDSVLF~IKpJIT3GRATMIFVPPPYAAEJ1ILF~!'.1GIELIVCIT~IPVRDMLNARVMDAACLFFSGLVDHFVIVY
FFLVSWLYTLYGFPIFSIIYRKKS
NST50LIGPNCPCIIKPGECKIGIIIPGIffHLPGNIGWSRSGTGTYSAVWOLTQLItIGDS
ICVCIGGDP111C1'SFIDVLpALfJf~PYTELILMIGEIGGS71i6FJVAlIWIQANCTKPWAFCPIL0981 IaGYI'APIIGKRMGHAGAIISGNSCDAKSKfWLRItSCYTWESP1WIGK'NDAVLMItEL'nrtlA-Ribonucleoside Reduecase.
Large Chain' GKVMVIYEtKNYTIVKRNGNFVpFNODRIFOALEAAFRDTRSLETSSPLPKI7t.EtsIJIQI
CRt_0975 1119075 1119677 TNKWKEVLAKISDGQWZ'VERfODI.VESOLYISCLpDVARDYIVYR00RKJlER011SSSI
No ralttlsc homolo0 Prnenc in Genabank/D~LIAIIRRD(>GSAKFNPItKISAALE1WRJ1TLQINQMTPPATLSEINDLTLRIVCDVGSLHC
as of 11/7/98 3IEEQVALSIAIKfLKIItJILILFPLVLLJ1WVIRYQIJiANFHCSWPFPGPSVNpJIYKCSEEAIM.iEIQDIVCKO
EAKIEEM.DLLDLITLEWSSRCLRQI>M'FANRLEZELIOELRVSETE6LISLOGKRNLVRQKEDf:TTYLL1IKTDLE
KRF&WAGKRFPKZTDSOLL71DMJ1Fl8dLYSCIKEDtYI'fACLIIIA
:.LLTHFPNPPKRSRVESVGHEWFPVFDRLKREEEIICOGPITRSNEELWALLDHCTARGRAMIEREPDYAPIAAiLL?
SSLYEE3'LCCSSODPNLSEINKKHFKEYILNGEIYRLNpL
IHKTLWfSIFFKYLTQIELF
KDYDLOAi3EVLDLSRDOpFSYMCV~R.YDRYINIJtEGRRLETAQIPIi~OIVSMGLatJ~G
EQ10011tAITFYNLLB'IFRYTpATPtLFNSCMRHSpLSSCYLSTVKDDLSHIYJfVIIDNAL
CPn_0976 1130079 1131185 LSKWAOCfGNLfNfOVPATGAVIKG7TICKSQGVIPFIINAIIDTAIAVHpGGKRKC:AIICVYL
No robusc ttoll~loq pnsenc in Genebsnk/DIBLfrIWNLDYEDTLELRKNTGDERRRTHDINTASWIPDLPFKP.LEItKplPfLFSP00VPGWi as of 11/'!/98 II*1LVYCFDPS1IPTSPFJiR(lfAALpRWFFt.OCHW1RILTLECt511!MFOENMSISTVi!(IAYCLEFEKLYEE
YERKVFSCEfRLYKKVEAEtILWRKMLSMLYE1CHPWITPKDPBNIRiN
LKLISYLLIPIVLIALLIRCFLIISRPKCNWKCDSISOWIVPHDVQPFNDFOLFNNOERLNQDI~CSMLZCILtlICSF
a~ETAVCN4:SINLVENIPDIDKLDE&KLKCTISIAIRIL
IWKHRRWSCItrirt*NPVDYLRSQFPCFKEIPEAIRCENYVSDGQFS~SYLRAMLTONVIDWFYPTPEAKQIINLTHR
AVGICJMGFOtA/LYEWISIASOEAVEFSDG:SiIIAY
DIVCIfILSLDETYWfNVILKIRMICITFESFPCKEADPN1ISPRVTHHYFDESWKAGARNVYAft.A.iSLLAKBRCT
YASYSGSKWDNCILPLDTIEL4KETRCEHNVLVDTSSKIIdffPVII
LGOQiIIMRL.OEALfRTEKPGKOGECITKOFLKDYCKKHLEVNSCPDFIESLVDEKIREFDTIQKYriARNSOVMAIA
PTATISNtfCVTOSLEPM1!KHLf'JKSNLSGEFTIPNTYLIKKL
RCPSIWSAVCDVIDRKCOEHLLKAIINEANRRLFCl50JSSPTMI~NQVLFY'fIFSPPKLKEIGtJVDAEMLODLIfY
f'OGGLLEIERfFNNLKKLPLTAFEIEPEWtIE~fSRRpI01IDE1G
PPM$SVYF .
VSI*6:lLAEPDGKKLSNMYLTAWKKGLK11YYLR..~QAATa~~/EKSFIDINKRGIOPRi150i KSAST.i T WERKITPtrt.'~MEEGCESCO
CPn_0977 1131339 113310:
No~toousc ttomoloq prnenc in u~enebank/EMBLCPn_09s15 1115173 11)!:571 ss of 11/7/95 LYIlIQFANtLKSSFt*fEVYSFSPSVRTSFOIiRVNAALONWFFLCCRALKWSLDSCNSCQ'nfd8-Ritmnucleonide HrNiucca.~.N.
:amLl !:h.tin-aCELrNPIDT'ffKVLKItSYLLIPIVtIALLfRYLLJlSNFTAKV3QKPWLKTLQLGIDIKI3VNKY~'..RKKNNPR
LFN!:RRLRIL:iITEK7ktAKMEADiLI7:KLKRVFV::KKCLVNCNpV
::FtLPCiIiVllMO:fATLFKAIRLECKPVDViYHRLHS01IWFYIPAQKLPDDLRLTMdLWt~LYFIKYKWIIWPiI
YLNI7CAWAdLPTEVI'MAF.DIELWY.~.LEI::EDERRVILW4iFPfs PEKE'IRKTEYVRNMWiVMCIfLT~.aQtiKERIQQWv~DSRa3TrIJCABKVLQYRFIONPQSQ'fAE:II.V':NNI
VLAIPKIIITNfEAf~YLWWAFEFJIVIfMITFL'(It:F':4:LDEI:EVFNAYN
':.EFURLWF*IITTK'Cw~nEDKIYVQ~DLFOIUIFOCIrWPQFI;7VI0.'~.PTFSEELVHENSOKLOPAA!:IRA
KDOff.Mff(.TV(WI.MtIF.~.VV.~.::FLaa7~YIKNhN:%YIINI1:IFPY!X:PVIIIL.i LOr:tYPEDDEFEDKFt.NfL.LKAVtJIMt:FECC~S!A~YIFLICPIkILIWIPFLJOIpKFtIRpIWtTfta:EUY
VY11.RDI?IIILNI:IDLIN:IKEfIII'~IWITl:I4tEEIVALIEKAVi f.EIEYAYD!:LPR!a4:IJC::?tFIIWVPIIIAINthI.I:H
I<:LYf'1'/tC:HNi'YfWIL:P~11M14K
!.'hue U 17N t 133ti5 ! L l.'. )h'tRKNPFETRVTFY!Ifl4VII::W
;
w. tanu:a tNrrnrrlrxl uw~aene 1n tamrl.tnk/!?IOL .,:: c ll/7/'tN
KYFFlIEVY::1'IIf'AVR'P!:FVIIRVIIAALpAWFF4w:lIRU!W::Lp::!:N:X'.WAYOELY.iI"'M'nIY
y'rW. I1 u./l'.'. 1I of!..
I:IM.1!LI~:ILLVI'IVIIALLfRCLGi:NFRIDVEKERWf.KIREIl:IDIF~KLP:SW!>pY.ut114r'W'r..
l rNtM 11~rlWl.e:.
V:::;FIWFY.KnK::KItPRIDVDYIITLH:vIfWINFPtt'FQKII'KT:RF:iYWF"..QK!'fRKROYV1'LLFN
YtVOI~:PI''YIYIKt7!NIwInpn:Vl.%41'NIfII'I:IIVIIY:.T::71YJE:YF~NIM'::Ilk'YiX:
NlwI.IMIVU:YLT::ITXaStIWYL::K'P:~IQ::1T::LDFERVLQIV'.LTDIIDEWGEVORLLNEE:a:NC:LW
'N/KtAyKIHVII1.WIAVPx.let'IA'fNKIW::YNININ~fr~IILI!Ith7:fAY.'fFHJYWP
::ATY.:::X;OKLVf.I::I1VSDIIC(k~IItFKFLE'VIV;:FAFIF.ELVEEII::f:KLliLDFIGLEKIWllt YINI!LWNFPDfUII'KMNIINKIINId~rl<:1S'.l:I::It:aIL::AVNALATINaffYlJ*::II:A
'I'!.IS~RLRN'.:LLNAWIIIH : aY:VD I yfIIIJIPRMFTI"IY I KlfPnPi al::wl'f7 IKI!Vf:M:L t I1TF.A t~~f.~~ I PF::R::n rllrfY aJN l F'ITl:F I KKIk t 1, Pt70PRNLYLCKTPtt:WEPSpu LFTPLFLILYLLfI'1~VP::RCiANSPiSCS
CFn U~tN7 1117491 llJN115 PARNLLIICONKVfFAGVACIEk.ucEELLEIVDPLKNPNIti"SLJCRIPK4yLt.iGPIGlG
YtVe-Like Pr~iccee rRtlA metetyiase KTLIAKAV"''QG~t911~I1~3DPV~1~'"t~"~ll''EpAI~
LQ~CTFAIGPITtPAYRTLLT)INVNOVrHEIPKTL1NPCDTVIMTCCNCNDSLFLAALL.O RNRCACIOOGND
6LVENDOPOTRECVIIIhAIrINRP
CCGRL11VYOM1(G4aNALLLFE'INLSEOERSVIEtIKEDSNENILEKDVKLIHYM.GYLP
VlIR.PDIKGRFEII11VNAKAIKLDPTVDLIIAVARST
KCMtEITTLMTTEISLi.YJItJ~tIVRPDCLLWVCYPGNPEGEKETNSVE7IA0RLNPKEW
TAVTAVDVAGUtDKVLYGKDtRSLENDAEERKTIAYHESCHAWCLL11~NGDPVOKVfII
'YS.iFYYANRCRAPRLFLlORQC,i ,F'.aSIIDKC
PRCL6LCJ1TNFLPEKtIlail'AttKELYOpLAYLNCCRAAEEIFLCDIaS('rIpODISOATIG.
VR~~NVCGrr:NSPOLGNY:YDCR50CL;GYOCYNEKSYSEETA1C'IDTELRNLLON1YDRA
::::i:.l:::~:..K.":.".~ .. .... . ..::'KE:"t.'fv;y; "::i:'.. :. ..:'::r :!LP47Li.i.w:
.. ~y'.H. ;r.~. .;.,~j.tn.,. rrTtn.~ N...p,... :~I~t~r~I~f;F~...m .. ., .
KPFINLINLDOCILKNKfAAPNtIPPPPVRRSVWIJtRYSTFRIGCPANYFKJItHTIEEARE
VIRfLNSINYPFLIIGKCSNCLfDDRCfDCIYLYNAIYGKDPLEDMIKAYStx.SFA71LIG
KATAYNCIfSGLEFAAGIPCSVCGAIttIQAC'ft~ILSDISSVVPNVITINSEGG.CS~fSVEEL
~LSYRSSRFNRQpECIL.iIITPI0LS1~QVSADHSIISIt~IRLKfOPYI'QPSACCIIRNPEG
TSAGKLIDAACLJ~.11ZGWOISPLNANFIIN1G%J1TSDEVICOLIAIIOSTLKTQCIDLE
HEIRIIPYQPKINSPVSEK
CPn_0989 1179552 1179016 CTS77 hypothetical Deocein LRTSLaVNCVLLTIFNLLVNJtTLSPEKP'SGSPISISKEFPCOitllHtEIILQ'S.YAL.DNAPS
AEDSLVPLIJISQTAVSOKHVLVAIhIpTKSILEKSOELDLIIGNALIfIIKSPDSLDLV110rV
LRLTLFEHPYSPPINKAILIALAIRLVKXPSYSE71CPFIpAII3iDIfTDSSIJ4fNSLSI
Cpn_0990 1179A80 1140~10 intC-Initiation Factor 3 ~ QRIENISOVVK~DIIOVKLLSINfltGGLKLSHKATLE , SVAINFKINROIRJ1PKVRLIGSAGEpLCILAiKDALDLARE11CLDLVEVASNSEPPVCKI
I~IfOKYRYCLTl4CAmSKKAQNQVRIKEVKLKPNIDENDFSTKtl00ARTFVEKCiIKVKIT CPtt,_1000 CMFRGItEGYPENCFKwOKNSpGLEDIGPVGEPKLN3RSLICWAPCMfI'ttttKQFJtS rsl5-815 Ribosanlal Protein SAFAI1IILRRNPNSLDKCTI~EITKKFQLHEKD'fCSADVpIAILTENIAELItENLIOtSPK
Op~RLALLKLVGQRRKGLIYLNSTDTERYIOiLITRiJR.RK
CPtt_0991 1110391 1110611 CPL>
r175-L75 Ribosomal Ptrocein _ KORKNRKSLIPKK~K1NKSVSMFXLTTRPCKRttIQYthC-tYtosins deminase SKKSSGEKiINLSItOPLVD
KCpVODfKRIIC.V
YYLEIrGCEKLIt~KDIFPI~pOAPKEAAKAYDQDIVPtiIsCVIVImDICIIAIWlI4V~CJt DATAWIEILCIGSAAODL.0NNiILLDTVLYCTLEPCI1lC1tCltlt?LNtIPRIVW11APD11RLD
CPtl.-0997 1110612 1110996 AGGSWVNIP'fEENPFtPIIISC1CCVCSEFJIBIIIIJ00IffPVPJ(RRENSEK
r120-L20 Rlboeamal Proclln GIfhVNVRItT(iSyAgRRRRKRILKpAKGpyIGt>RKCHIROSR.,SVIOiANItFIIYtOiqtmRlOGDCPIL1002 FRSWIARU~IVASRINSLSYSRLIDK:LKCANISt~iRpG.SEiAINNPOGFAEIAtaQAptACTfIS
hypothetical protein LEATV
KSAERIMCIKIVtLLDOLYEDOCSRt.QKLCEElVPNLTPEDLIQPNDFPOL1!>u~WAPRfE
DGVLSGICEVPAAILAALSpEN
CPn_0993 1110975 1141070 -pheS-Phettylalattyl cRNl1 Synchecase.CPtt_1007 1151862 1151091 Alpha-KSttCSttSLGIRIS1~9t8EIEAVKQpFNSBGDOVNSSOALiIDLICVRYL~GIFRSFSEKCTll6 hypothetical protein LKpLTDKAKI~.SLINDFKTYVEOLLDEKSLVLi.ILSEQAEA!'SKEKIDSSLPGDSQP90GRTBNkTINPLIJ~GPD
ROIAGRASIrnVIFPDKtSINPPNLSKii.KKLPSVILYI'SCIAPIISY
HILKSILDDNVDIFVHtGFCVREAPNIESSJIiHIFI'LWITCDIIPAII~ItDI'FYil9ATTVLIINIDrIGIFGLL
EItJtL9NtGI0KNNIwQFLTYPLITJ1DSLSIi~SFEI1'pRLti.Rlnl RTtffSNWiIRELKKGOPlIKVVAF'Gi.CFAN~ILDFTLFYKJIIpHLIRKUGAFSVLVNISVroAi.IIGJ1VIJ10 FMAt.INSSQHFIOPESZD~Ii TA
ILSAFYNSFfORKTID.RFRNSYFPPVEPGILVD11SCECCGKCC71LCKH'1GMLEVAG7101I.LTVOIFLDPEKRI
TICPTPLSIfSItdiGFLFVLCFYCCILIPSCJ1PLLLIJ1811LAIVi~IIL
NPWLRNfZiVDPEIYSCYAVtiICIERLaNLtfYGVSDIRLFSENDLRFLQpFSI~CIpbPYTTSLRF
CPI!_0991 1112771 1111110 CPn",1004 1155115 1151A79 Ci177 hypothetical protein CTB4T hypotheciul Dlrouin L!>illIIRDGRI~IfitSRRMEpALENi.EIQ.KEISL71TSe1DSIlLINPARFI~RKO't~SSVNlD9CNLSIEEZ
t4SIQPVSNTTPKADKIfIPDSTKVISDSITINKQSAFYPCISNQ.RtifilTlY
EJ1LKNVaJYLLEISCVSXStitIDKAtJVSDPLIAGV~MSFL8A08~.YKSLLDEYSEV'tOKSIL71VL1xN1'IVC
OQRVKELItC.PLLKVPDI4KKDCSDDEYIQJpNCI0I1Y06~QIS
IG PECFLNNL
ANRQMIQQELSSAt70RrW11NOKSVNS1TIESNQILpJITSSIH.STLKELTIKJ1NLTWPID
NDJILVpIIYI(QIQfLNtNNDGDPLT1ITL.IJ~dd8E8VIDIIASSLVN1iG11PLItLFY~tALSN
LDIIaWKVtINAVNIILPfSRYEAZlIVIKSPKKNNisiYFNDFLLPLRWIItDI1a~11fIDSDCPn_1005 ERKqfKLt,ISALSLCIFGSKLVPEEASRYLYFNIQTKLENtINOKKPLSPGQYLTDAYEELCTAIA
hypochetiul Drotein tiRLISKYPNGPL!'KAN<RIVLtNLRRPYOPNILGILPSLCC1'LKiJGK$IDIIRSPSPV'1'QNRRPVRLfMWIID
PLSAKICPI4AAINVPGTPITt70PN1'ATADDIIAKPSKDSNPLNNY
SSILYFLGFL.NaIGNRSEVfLVLNIONRISRKERARSRVIEEALEOEBNAPYVNYVYOSVLVApONLSIIAQEGQANS
SAOTYfJJIJOFJ1LYQWSIPKNKIJtDItBSSYLptIp$
AFS!'PCPEELL.pNLESItIGDZE1TADPFSILpEtFHKPLG&SFPL?KELKEFVCS1LKEONOAIGASROAIONDIS
SLCNIU1QVISSNLNPt~NII0pSI4VC0ALI0TlSCIVSLIAN
KLTALKDIFFAKIUCILTAN~C.LLLHLLSYLIVPKLIERTNPNSIWVSKDGLDYVSVPII
AGPJ1FPSRGfWDGfSLKLLLTNVLSP?LVARDRLVFVSNIELLSKFVNCLKKNRQGFSS
LKSpFKDDIECK'IEPTCYLNELTEYStp00JI. CPtL1C06 1156197 1155990 CTS19 hypoehecical Drocein CPt>_0995 1115515 1114115 TKVNFPINSITTL.GTLPIVNfINSSRPPLEPildfPKIGI1VL!'SIYELIiAAIEIRDdM.
CT87A hypothetical protein TCSQOLNDNTNIpOQLNOLTNDIKYAIVSAGAKEDEITRVONQNpNYSAQRSNIO~LV'f RNLIwKRNtvLTRFNFALTSLLVLALIFYASINHStJiTLKCASTMSt7ASVKLSILYYL71QTRaNGQIIL5HA51NI
NIIQpQSSQDSSPIKT?'NSICS1'VtJQLNKPLC
LSLKAEPLIPOLVAVATTSTLFANQNIOtEIILIrQASGLSLKStJOIPLLLSCAVIIQRILYA
NFQWLHPICEKISITKEMiDRGTfDKEpGKIPALYLKD01YLLYSSIEPKTI.TLiaIVIwICPn_1007 1156689 KDPttTIYTNEKLAF1TLSLPIGLJVrt'OfFANDSENLELKEFFl7l9tEFPEICPNFYPi~tPFS07519.1 hypothetical Drotein KLFSJIGIJlO'1RLSEPPKItIPWNJ1T0LGLSTpVPpRILSLi.AQFYYVLISPLACNAAIILSALwYKSLNDEEKD
VSCNECNDYPEVFKDDVSAYVLVTCCQNSSECKIQVt?IflfDPAYIS
YIJ:LRFSRTPM'WYLIPLCI11NIFFVFLKACiVLASSSVLPTLPVNAFPLIVLPLLTNYLLTKARDSLDES
YAYAIQ.Q
CPn_t008 115d901 1158227 CPn_0996 1116592 1115519 CT81A50 hypothetical pcocein CT879 hypothetical Drotein VLNYSFII;MLKPNYVLSKRLYRWVNpLIKLCDLVI0JSR3F5VEWVPISALLLIf'CCIQCA
ANpLLWICVLIFRYLKTAAFCTLSLICISIISSWEIVAYIAKOVPYDTVLRLNAYDIPYLSWKVSLVPFLLLFSFLIfF
LILCPRCKCYALLtrGSfFVTLYVAKYV1IDETLYVSINGSGL
LPPILPCSCPVSAFSLFRKLSDNNfOffFLRASCASQSIIMPPVIJIVSCAICCLNFY1'CSE:VSPLLAPCLFLt%7V
Wt 1QEEEtiVKGKEpLRLSEDLDApRSAYEDLLLTKSQOCEFLOAR
L.ASICRYQTCKEIAIeI~M'SPAL.tJ.p'tLpKKENNRIFIAVDHCAKSKFDNVIVALKGIJtIEApCLDRELTf7C
QEt.LKA?.YCKOEYLTIDLKILADpKN~ILEDYAELNNKYIELVSK~DV
ISNVCIiRSIIPD1'IICDIVKAKtriNPISKLPDSLTESSSPSSpRPYIETLDELL1PKITSVFPWVAEPSVCCSOC$
ERVDVSRWVSAt-0EKEESLLRLPNEILVEKpIICSDYOIRCptiL.G
TLFIIGKSYLKTRTDYLPWKOLVKQSLIWSHLpETLNRVAIGFLCITLTYACNILGINKPRLLLDNFTALERRCEELVW
LWpKITpINELi~LVCK3EElfVSVEPSJINAEtSCVEEKDYK
FRKSIALYFIFPILDLILLIVCKNTKNLpLAIrLFVFppLVSMNPAARAYRFSRCYACLY:QWEpFLEK:,E'1L3LYR
KKLFAVDFKYLTLKKKEELTKpDISPDDISMICDLLERI
EILEEELiHLEELVSRSt.SL
l:pn_0997 111d699 111Tbd1 mesf-PP-Loop supertamily ATPase CPn_InOU 1159095 t1581Rd AYIttM.GSDLLRODKOLDLPFASLpVKKRYLLAGs"(iC3DSLFLFYLLKERCVSFTAVHIDmaptNfthioninN
.leitutpnptadaae Htlllt'o'CiIWEIIKELEELCAREGVPEVLYTLTAE00CDKDLENDARKKRYAFLYESYRpLD'fRLLHR'/I41KN
NDR'Nt.'C:ivRNWKOrHYPpPPNNSpFJILKOHYASQYNILLKTPCOKAK
.lcxl:FLAHHANDOAETVLKRLLE,~.JWLTNLKAM7IER.iNEDVLLLRPLLNIPKS.iLKEALf YNNC.YJITARILDELt.'KA:i~'KCYITNELDEL.~.pELNKI!'fDAIMPPIfYI:SPPPPKTICtS
OAR(:I::IWUp::NEDERYLRARNRKKLPPWLEEVPCP.ttITFPLLTLCEE.sAEL::EYLEKpIJtEVtt:IY:IP
NDtf'LKL1:DIMJIDV::I'.IVDf:YYCV_'CPMMIr:EVpEiKKKICOAAL.~L
..WPFF.~.M'191VD:~tt:ES.PCPDt.'LI()()Aff.CKWVNKKFFHNAC:LAV::RHFLCMV'IDHL::R~tIL
:aAILYfr3tfIt:EI.:F.IIt:rIRApT'lt:F~WDpF'h.IF:W:IEFIIENPYVPHYIUiRyMIP
a:ATtJilIRNK t V t t KItJVW t D 1 J11~?I t I-P I Ef'M 1 tllh 7KKta:1.17DPKN1AJEAR'fCDtK~G::A(~WEIII' I1 tTETt:YEiLTLIJ1D
':Itr ~19.')H Il17Ni1 IISOSNd ar:_Ir:l': 11'.tr:'!5 115NH.'7 tt::l1 A'f1'-rls'paui.nt =1110 Wtrtt.t.tGreLr'.:~ n~tn tn:r i..rl Ivrt.rrt LI:a:NKtTI::KUKKNKI'EPKKNFI'IVFFFLLFr:WPr:P/At'(INt'1.11t:KK.VtVn:FatIQLEI1'MLI
LIJII:a.l.l"tVI.FP::W ;:fl"/tVII.WCN1'::RKY'/lPVILRtI'1.4'AIf:ALILFVITfIR
I.VNLJILIVI'Efk:IIKIALNDNLV
>F(7LRFRLRIOTQEf'fiIJtYIIYI.I:LIt)t7t:HRLDLDLOETU.':FFQPLfiR:l.sit':rl t.'t:Fl.1.t'N::IKMI1J11'NPEPAYDUf.~.KTF.PIFFPIJ1F('VI'If:PA
N:a:M'fI:KEVTtJ:atldF:.At:li::PIPS.YAt:;YN:E-I:I.::VIa'HPLVV'1'f:PIICP'QLINL'!t'fAtJ::lNt:l:ta'l:.I:RIIt'f:VIttAYIAF.~.t.tTt .tl:::::FFIXeLtY:NFY:L1~LERLPISIAL
IC:1 .~t:P
YI1't.:R::1'F'ALRTW:::DLYELICKYt::I'VII:Ir::aTLKREIJCttLYpQVEVSLTO1JJA::VtNf4IJ
:.:I::1 AI'N I, a1' I.:
tTtJfFJIAV'f 1.W .'QVt:: f WR
LS:i.~.LW::EOfJERF:aJLtr::VRLYREPJe1KY11KLVPrIRDtl:
VI1VLEIttJa:Rl::(j111WYPNNOEt~::RSLEKODPEVftNIWFJItLIKEtWfAFKFNII::L::FKA'an_14 11 1 n.n um t I'.~rNt:!
111YY~~-1:1 DaLppELLC3RRCfRSEfYA uEKKLETKVpIKD4.'KCLF,'~QDQDSNGFOKKSPL<C'C
'TN'il Irytnchetieal proton CrSRKNRIAKAAOAVPVIPPP:n.
YF'a3YLLTKOGIG..ZDP.:.4YCCNKDSVfSI'ORELDA
4V~tY;n'VKORNL'ILLRESPFArI Y
:JL7QiLITtILAL ' C'IL
' ..... .V3l~iLGLSfX~LA~iI
. WOLRLETLKV~K:;ItRC1G10S11:~1~fIJ18k . ~IVI'~
FItRLKNYPHipf.irFLPC ~
xtJ<VIXfWGLEIM7CIAYLLIILVRrYLRLt:KEEptTP'fKfNMSPSYS
lt AMfALYt:IJ IL
a LIiW IKCLaDKFNDNLENICPLK>rCEGIIRKI
tTL LO
Pf. T.'.P fALPW P,:P3CI ..~sP I
L~,.'71IKGLOPAIESCNAALRCAILFSQAEIYKLKGKL1'K
lOLOIG.KSFOROOIIYE~R
s'OF.LL~SIESSFEALSRLIlIYIRtELDOVYLH:.LRG
~Pn_1012 1 L5Z3Z0 1160121 y:.:0-AOC c ranoporcer penlleass "FF'"PPP.i:IA.:.~.fI4SLPLLL t.'Pn_IOCI 1171270 L171b9H
YMKYKFIfYFVTVF
~
LLFIS~tCffSRMPPTF
~
. ' . '.. w - ' . ..
. ..
AIP. .
.LIT; ..
'' . ...~L\iL::::ri r "lF r v:
"'.,i ' ~ :
: /
~
' .. ,F
.. rT
. rc .. l .dlrrir. .c : err.
. ~t:t:.
_" ~~.'. ~-p.,; 1::: r,.',:.
;;y,RIfJI.\LP il::
.:. .~..;.......Ia.i?' :y.'-.: .!.'111 r ' ~ ' -' "" "
~;.';..
i r~
.
TfGKIfKIa~EGL.EK:r"fKtvIMAYLGKDYAK.iITVFkWLYFFNPfVSKFWP'aiir13IJ01a ......
..
:r.: ~.. .
.
LGILHLLSRPNYQ1ELAFAGLATLSILT 'L.~LF
~
' aAGFaALAGiiNASOSt:
ECYSOAWAYufTAVLRDXDPYPHYYAYICYTL.TNENLEAEKALDfAWVRiIt)HIIILYNR.
:IIFa GL.KIAIC
KSV111LKALSVLALIPINLIPWKDNSKSPPtaUD~R.TSL
SLK
N KEEILDIRKHK
7ITLLICKLIrSLrRYKRN
TLLL~WI'PNPNNIPLYAGVAKCYPKpMGLDLOtQIClIDSSSAVPNVLft0Y0llALYHAL.G
INKT SIKQIPIOIVCRLZDSSLODFLYRSGDPIYKFmLNDKVL.GFCiJ~t~RTILfRIi.ET
' 1022 1175709 1171:16 CPn IGHPVIOCrLBaICOLP .-WpNCVIIPSEVIaIVSBDLISt>NLtJsRtIDFLYCAFYNI1931IKL01CTt63 hypotheraeal Drotsin TGPQLIVrfKttGTKASEPEIVGinKAt.OESIIFSKDNP~rKLYAKLTKSIPItNLYOErSTFIIfALKLGIlIIfPV
PSAVPSANITLKEDSS1YST115fiILKTA1~EYLIISCfAL~SS
YI:pWEElfPLLIpSODPLSKDLVDKLLE'tIIKRYPELASEVAXrST.NDLYNPSLPEBD~fTtlu.ISLAtGOIILA
TQQELLLOSINVHOLLrLPPI:VVELBICWDLLVCLOIAtTITS
6PpCICTOSRSEQrLP0088S1I05ALSPRSLCPEISDSKpp(~AIQTPKD$AVPl01SGP8 CPI>_1013 1162209 1163621 PEl~pIIMSLSQASSSSQRSLPP0E8APaTLLtOGKASSriPLSOrSABItpItGLTISKS
lulK-ruearste Nydracass NELY1~POODROGRECHDRGDOE~KKKIOOIRGLCVGVAEEl0~.0IMLIrSD
RENSWNRGNIDNROEKDSLCIVEYPfDICLYGApTNRSRNFrSWGPBL?tPYEYIMLVNI
KKCAAQATIpDLCfLDSKNCDNIVAMDEILOGCFECHrPLKWIp'PCSGTQSNIIiVNCVIAONRPPAEETSK>CEl'I
FIUtKLPSPHSVISRFIPSKNPLSVCSSINGPIQTPKV~MIiVl1 NLAIRNItOGVLGSKDPLMPNDNVNK~SSNflVIPTA1HIAAYISLIC11LLLPALDtIIIItVI.KLJfARIf.ODAG
EANELYHRVKORTDDVDTLTVLISKIIRI~ILRfS6DICD9tALT.I~DtA
DAKVEBERHIHKIGRiHL?IDAVPKILGQEFSDYSSD4RIICLESIArSLiWLYBZiIIGATAKEICVTI
ITQIIEKIIBlQRHLQEISOCNOARSN
Vr.TCWVPEGPVafIIHYLRICETDLPrIPASNYrSALSCHDALVDAfIGSGTLAGLTKIVGKLLKELlOTIFIYHLRP
~
ATt7LSFf.CSGPPG'CLGI~S.ffPFNEPCBSINPGXVNP1'CCGT.O~CAQ~~IOTVII~. .
' CPt~1023 1176005 1176331 RCNpE~IMCpVIIyNFLpSVpyiS>57,01AFSEfFVKOLKVNKARi~fINNSLIG.Ho robust hoKbloq prestnc in 4swbank/E~I. as o! 11/7/95 LAPVLGYDKCSKAALKArHESISLKEACLALGYLSI~ETDRLYYP8t~5~Na.DFLLIFINKKWrLSIIrFATYCASIL
4AVTWAVPLSEAPCKIQVItPVVIiL.QPOEEQ
CPeL1011 1165156 1163732 GSVIYSFfIfPYDYGYYYPL"LYCYTRlI~OESRLCY'IRrEDCI'IIYLCD
yehH-Sullace Transporter AyAgTLCYGIVKVpW1r101rIpTtLYTSIKD3Y5FPL1'FKKI~pAGITIIfiZLIrPFAIAIJ1ICPt1_1021 CVGVSPIOGLLASIIGGLLASAHDCSNVLISCPSSAFISILYCLSAIDfGAItALlllrlLLitxerD-InceOrast/rscombinase ' CVILIAFGLTGL.CTP'IKYNPYPVtIrGLTTCL71IIITSSQIKDFLGtpGIINIPADrLPIdftILCDFSIJLSVDI
GICQQSIAA
IfIFPNISI:CSLKIAPLPILKLN8IJ1SHTNPSTQFIII
IAYWDIILWIWDSKSFAVGGLTLLINIYFRNYKPRYPGYNIAIYIA'1'fLVYIL3.EIDIPTIGYRQDISSFLTISAI
SSP~ISpNSVYIFABELYRRItLAITILIIRRLIALKVIrLrLKDOG
SRYCfLPTAIPLPKIPOLSITIfILOIJtPDALTI71VL9CLLTT.LSAWA17G9flGNNK)iSICLLPYPPIIEHPKI
NIOtLPSVLTPOE<tDI1LL71VPLONERIP11HIJ1FRDTAILHTLYiIGYR
QLVApGVANZCTSLfSGIPYfGSLSRTAASIXSGTTPIAGIVNSIFICFILLL.LAPLTVVSSIGDLRLGHYSDDCIRV
fGKDSKTItLVPLGSMRGIDAYLCPrRDOYOIDDSIN~IL
KIPLTCLAAVLILIAWWhl9fSEIMiFIHLITAPKKDIWLLTVFILTVIfI'IITAAVpYOIII.FLSTRGHKLLRSCV
WRRIInfYAKOVTSKlVSPHSLRNAFATHLLLSfICADL1WI0~
' AAFLtTIKOMSI7LSDVISfAKYFl~.4DFLSKAIYP~tfEIYEINGPFFIGIIILStLIDGi.NWDfPRNL . .
RIASTEYYTfNAADSLIIOf!
DIEKPPKIFIIIGKfRVPTII7J15 1025 11Tt266 1175579 CPn ELICVDNIpSNIKSALi.rACJILTtiLt=RKTSfRiQ.Y,.
ppi-dlucoss-6-P Isalasr~ss GillOatSSYRCI!lIDERKRFIDGOSTICILOELALNPLDLTAPOYLS1IERItKFSLLt~FTF
CPn ' ' ' _ A
CT857 hypothetical protein tpossibieIRJSM
IH proceinl SrATDtI~DAILMLISL1CERGLHESIBJINOOOQWNYItsttPS~PAtJn KNIMD~IFSrITSVRVRSKVDNIILEVitCJtLOi.CALrLFGYL1IVPEHIVRVNKSAI71LDSSIZGCACDIAVPSR
VGORLI~TLTKYR80FZTZVOICIOIDSdIJ3PKJ1LYRALRAYCP
ANOfLHWLVCTSNIPNADHttILVEEIAdISOVIFFLF&ANAIVCLIDAIDCGFSYIVItICR'10001V1trISNIDP
0~.A6YLDTIDU1KAT.WWSKSCTIIETAVNWrA0F1'AIO~rLSF
IQSRTLLL.WALiGLSrFLSAALCNLTSIIIIISISxRLVItARRRRLLT.G71ICVI11VNK7CKDHFIJ1V1CECSP
NOD'fQIttLMIQaiESIGORrSSTBNVOCWTGIAYfiI6YfL.OLLpO
QPNARC~1L.RGSALISIfsIRNFLJCIfPTGVIPYSS0T.IlFPJIIGQOfGI!ICS
ASAImOIAT
AM'PLGWT'l~fiItdlIIITSWGIIPALIVPSLVCVLVJIfIFCbprTLRKRGSTLLVmVL.
~IISFfOCLNOGTDIIPVeIICFIfDIS
ARVOrBTSPVIWDEPDTNC
LpSAPPKSWIIfIGLGSLLIVPVWKACLCLPPFNCi7ILLGLCLVtd.TSDWIHBY, NDKSIAGDG
HLRVPNILTKIDISSITFrICILL71VN11LSFANLLTL7FSIi5~CIFSRHVVAIIfICLLSSFt3Q1'fSSOKLl7W
fIA0AIALiICGSEN1NPNKNrDGNRPSSVLVSipIIIPYfTGdLBYY
VLONVPLVAA1llOfYTLPL17DTLWlQ.IAYAJIGTCCSILIIGSAA6VAFlx.LEKVDrISIYFINKIVFODL.C1V
GINSfD01"~1SI~KKAL~VI'>rfL'DGADASNrPfJIASLT.TLPI~IFR
KRISI~tIALASYlOGt?sYFVLESIIIFrI
CPtI'1026 1175961 1179177 CPtt_1016 1167027 1165595 1cW
CSfGFCKICI~IFIAVRSRDFIJ'IHCIL71AR70GfQVVKSThGUtVFYSLVS
Ct551 hypothetical prouin KREVE10DODKLG7IINrGLLFTSSVAGFSKIX.'flmllAYDOIlIffIOILISLII7fAPLPNKIi.L
lGi~.SpQ'IOpARLpI,YLECI~TINYCpKVLSNYVRgLNDYNAGLTrYRTCSAYIPYVLKCPn..102 LSEDGHVF1IVDVQTSOCDIYtGDEILEVDCI~IRGIESLRFGRCSATDYSAAVRSLTSA.NO robust honolo0 present 1n Genelank/t!>aL as o! 11/7/95 SAAfCDAVPSGLA!!<r.KLRRPSDLIRS'fPVRWRYTPatfIGOFSLVAPLIPDIKPQLPIbSCNNIOSVSSPPLSPf IIVrtrtDIVPSS~S~.IQPMVLKZSILIrIILVTILGIVLWLSiAIG
VLFRSCVNSDSSSSSLFSSYNVPYTWCELRVpNKpRfDS~RDiIGSRNGPLPI'FGPILiICOALPSWLTYSIiCIAIA
VGLIGLGILVTRLILSTIRKVDitICYDAAVICEEpYLSItIRQJS
DKGPYRSYIFKAIf~'aNPNRIGFLRISSYVNTDLECLCLDiIKDSIhIEL.PCEIIDNLCKSDIR6IRdWMV~0'WIL
SEE~SfIMJtDPEYLlK7!lIBRLIAELEIEfi011LVAQiILLKON
' TDALIIDpiIDIPGfiSVrYLYSLtSM.TDHPLL7lrKHRHIP1'pDEVSSALtR9pDLLEW1TORVLYPIt N118LSRL1FRJ1YKOKFPTGALCPYRIEDIJUCI1~QILFLICP6CIAMVKSLPGLZ
DEOAVAVLGI:'1llCliYCf104D1AV11SLQNFSQSVLSSWVSGDINL.SKPNPLLGFAQVRP1IPKCFOSLVHRFA
PRSRITQTPKYEYNSRN~1EDDKVMVCARLIIKEFl811ViGiICSY~Xi HOY1'KPGF!B.IDEDDfSCGI)L1PAIWfDNGMTLICKPIAGAOGFVIpVTFPMISDIKGLICEMVALKITLPLPGVY
DfLVOLrPNLLTACSWKDICI~fSYPIfLRPYL.SVDIICKRLI
SLTGSLAVWtDGEfIENL.cVAPHIDLG1TSRDLQTSRPTDYVGVKTIVLTSISCIAtOfSYQLrCEICLKLFTICSPL
DpAWRLISYYRNHIPAVLtIBTCLPPPE'IbGSVFVti.PRT~Elt EEtn'SPO~fPEVIRVSYPTTTSAS
LLW$QIEVLATRYLKD?FVRNS6WlGSFBI~IftSYNEHCKEISIxItIltrAmYCI'IIHSLEP
CPn_1017 1168997 1169975 CPt~1028 1150995 1151999 lyre-Necalloprouaae indhC-Nalats Dehyrogsnase VIINR!(LILCNPItGFC9GWMI0vVCVaLI~PIYVKIiEIVIWRNWNALMKG11IF
'JEELVDVPEGERVIYSIWGIPPSVRAGKARICLIDIt7JITCGLVTIfVtISAAKLYASKGYKIFFLKOVRNAfKLWR
VAVi'C6>CGpIAYNFLFALAHGIriIFGVDRGVDLRIYD11PC?CRJILS
tLICNKKNVEViCIVGEVPEHLTWEILYaWEALPFSSOTPLFYITQ'ITLSLDDVOEISSGVRH6LDDGAYPLLNRLRV
TTSWDAFDGIDAAFLIGAVPRCPGNERGOLLII'QIpOIFSL
ALt.KRYPSILTLPSSSICYATTNRpIUILRSVLSRVNWYWGDVNSSNSNRLREVALRRGGGMLXIMKRDAKLFWCNPV
HfNCWIAMKIIAPRLNRKNFfHIMLKLO0N~M8M.IWM
VPIIDLINNPEDLI1T'NIVNNSGDIAMfACASTPEtiWQJICIRKISSLIPGLpVl3iDLrAVEEVPLCCVSRWINON
HSAKOVPDtTQJIRISCKPMEVICINtONLFalILVNSVGMICSJ1VT
DWF~pLPKELRCS
GRCKSSAASASRAIJ1EMRSIFCPKSDEWFSSCVCSDHNPYGIPEIM.IFGrPCNIGPSC
DYETIPGLPWEPFINNKIpISLDEIAQEKASVSSL
CPn_L019 11698~5 117062 No rotwac honwloq present tn Gsntloank/EMBLCPn_lOZ9 1151987 118:511 as of 11/7/98 RMSYENYOKNSWLRS:.CLL\KFFSRLLYRVPF3FR0DIYLfSSLYLKYPRLfFYDLGKYNo robust troeoloq present in Genebenk/EMSL ae of 11~7f98 VYSLRHCPYAKLCRLIx3A.f:.LKECRJVYCETPWS4'L1KICQAFDITSCDILYDt.CCCLGKVRVFVTSTMLWGVS
NRpSfDBLSONJIrKIIIrNKORFCFIrCSLCCFGFVFALrLKLGSRLA
~.FWFSNWRCOVIGIDNDPHFIRFSSNMRKLSSGFALFDTEEPKNWLSOASYVYFYGSPEISLSTLGtIiAffGIrSVI
CASAIIVpFLtJIKCSOCETSKt.CCAIKN'lliiSSLJftSLLVS
SFSRRLWEILLKISEMAPC~IVTSISFPLDSFSRGXECFFTfNSCSVRfPWGKTIAYKNMPFrTANVAWTVAMLSSFLG
SLPYMrKLrHTVLIFIPYLSAT11LILLFLaTSfSGLFFCI
tRKCS
PVWQIOE.iIDYRNLLCFRf3JILRpfTIVVIALVDLAICfWLALDSPYIIIrHLVELADIH
T4ISfLApIIFVLIVPIALILTPAVSFFFNfSFSFYLAKpEECKALVK
~:Fn_101~ 117:116 117Dti)~
,:THi.n hypncnacLCal protein CPe~t07D 1151901 1192913 tHRPNtMTVrYOShTPPPOCEFDIFVDCNATEEAY/MEVQVALPACFa?YAL1LMTSELprsvllcteA D-nstino ecid dehyr,Jqeneae '.FGtL'f9.~.ECAL'LVALPPKEKPIQEEpFLVKNDIWF3f3LPNLKPfI(>>CQ'!SL?SHRNPFKVNFNRIAVI/
:.1G:YAC44V74MLLLH.ipCfATLDLFDPIPLGIfL'rA.4~1S3i.LIJfAITGK
LAOQ:7f::::N.,~1't:Kn\7fET't$SeFPFfSCK.\PECD$.~.'IDKTFTV~PKTQEr'rOGSAuOKAL.IfPP
t.ADCI:INATHALITFJ1..''K1LLNVPIVT:ar~ILRPAIDEDO.WLITERVEEFPKE1/
::()AOF71VR::Y::::rTIKEtI:.AKEKV:~bTtt.~.AE717KH'frlrK::DATL::PIISLY~TLMKEVPQFI
rWEKAfd:FIIPSNVTPPNLCALFIK.7:YrIdRLDLYItf:LAOAfNKU:71.'fYDELIEDL
.\L::::PX:~>UKttFFJIHDLRQcIDCYEC'fOECEE'tKILKTfMK'7YF..:L.(xJT::::tYl'/'fESITPI
ADIEEF'fDtItIVTIt:WA:aLPELKDNfVNKVYt:qLLEt.'.WPY.DI.AML;iF3TNANKY1NA
IPDPI'/FFAL::E::QI::%t~4:ItRVTNLDVLRtr'fEWYII1LK::RANOITffRLEEREIJIERENP~iKITh:
it.H\TFEIINUPEFTPDPAtAYOtINPP'/L::LFNILK(MQVIJh'1'.kilR
r&IL~
AIIEt.AA::I::RyAKY.INWU:1\'PITtGTL:AfAMI.r;EL::C:D::III:FVC'RI:d:PFKDATAKItI.rV
I::PLNRILt.W6'Iva:IL:::K4LLY1K:IT,IDIILAt3AVLRY.:.TAYIAKEFLtTI
'PFFY.t: (.:YVF'P::l.::pl::'1;.\A::KVIIF:L.:iE::
A't R A~':\E'fRK r.YFHMItVU6l.'TRT
t EEYKDNW
N::MIJ!IFt.I N I llJ'fIJIWAH::LY'; 'frr_ l4 t 1 I I N.S~i7 I l HID4$
.ir.:DAt'Irnrnr..ICnirtinr ArN.vfmtt.rt .'I'n_10u Illt....t ll.l~l~.' IKF71>lffl:R'rK::.~.KNt/.TIALNiIVV.~..~.IIrYY:IFa.fI~INIAAT14w4:AVlhbtlt.'It:Ft i .TN.1 hYlnlv.m,:nl Wwr.m NFFIMITFRII~:TLRrULJtl3tl'IMY::f.EC:ft:l"fI':FTIt~:/Wl.t.'~tLF':NiY:YAYfTNDA
n'r:N::fM:::1ll:~tA:.F.YLtJkHJ1'1'ft'I/APR:J;I:::::\'IYI::Y::ITVAtritllv'K::LPK
PF1'OKIHYF'YIl!Fl\T:NP4YALUY:::ILIYI1If71FIVr.Y':III(IA::TtIIVIuTtI'KIIf1.fffti L
fV::nJ'K::1?I'ffMIIIK'rFtIATPRERI1.RF'1:::.~.FF:xJUINr:~JAGlr::::IWNLF::91IN:IT
EA'PAFFFY.LAYYK'rDFI.IfSIIAYPKAOP::Lf:::l:::yl.Y,TfML'JTLWAYh:IFY:A\1'1S:7.YIAI
WP
:'.KAIIAnJIa:rMf*::1'h:KT::ItKAI.UKN1.::::KVF::A.'KIIFIYrIJIIyIILKLFn.TID.~.LY:
:Q~LI::W7tJA'r/4:FLiY'.I:fI'/IL.F::II.IF~:::t.l'uINJINJtItIP.'.TH:VIJ)II.V:KWr:
EVUTIV
I2~
':LITAVL::.'.'ItIL:.1IfII'IAEIPF;AAKN.TFPEIfRFFLBRGVLLRPL.~'LN:
fOEEDLRIIY3tIL,;dIL::;
. .EKSPSV3LYIT53VNOLJWIL f'~3 WF.~"N.WIfITiL.SIT~.VNVf.PAYLA:3AAFLFKt.,~,h.:., ...._ _.. ... .._ .....
CYPKKCISIKAPLANITCIL.L1IVY CPn 104 ;r 96i3II C~57)4I~. , (:IPFYIDAGKKKKNAKTFFAKKEI11GNC!'IGLL.At.TAI~
NALVLLAf ~
~
. th~
::tWLIYA~LKYLF ~B
b 'bioD-dethiobid FLFL'IYHtIKt NRSPlTYFRANFrfIpRI
IIVOIL7IGYGRTIV~,,AtLARALNAEYWKPIQAL~.CtSDSNIV
N8L4GAYCitPGYALJIKPLSPHKAAO Lt>NVStEESHICAPK~fSN:.I
ILTSCGFLSPCTS
Pn_ 1072 l 19515 7 11955b5 KRL4CDVFSSWSCSN1LVS0lIYLC.RINHICLTVPJWRSRNWItGNVVlK3YPEDEEHNLT
CTJ7) hypotnecical protein OEIKLPIIC'LAKEILEITKTIIS~'YAEarItEVWI'SNf10CI0t:VSfYfPSLNLM
ItIA'n'rrYr'':.AFIr~%a~F..~.DO~tPOqPf'RTFr:fD.:ALI4AKIFNPtITVPYTSVt.PKEL
:':I.....:':~ir:iL1':i.an:~.'ii.::v:..
:iW:Y'LI':.:i::.: ridln ~
. ! . : -' : ':~~'.
. .
..
...
r , .y , ~ i., OtoF_:d-Oecononanoaca iynthase_--~!n='.'fn:..'P'. :.-IIF::.:.L,T."
. ::.'!:r.: elq:(fr...
..r i:::::
Atf.Ftr)PENAEPAKVN
pNLOQQFLIE7tLARRKSKHTYRS1SLNSHLIDFT~IDYL.GFASSPELRKIYITKLHAIES
LGAl'G$Ri.LTOHSrit~lIEiI7LAAYlWFESCLIINiGY'fAM.CLiyJILJI?OODRILJ~L
YtIGfIYDCIRLBK710SFPlliNND~tLEItRLASSHLCRTIVNESVYStJIDiVAPLOAt CPn_1037 118 CT372 hypothetical psouin SLLtaYSAYLIVOGNAVCV1GDOCi0;LV5AG.~hODINL~ITVIITF~.KJIfpTIKiMIIIGS
NNKKKDYSCE!'LTTtTIVDSIAFLPSEENFCYIKTILFFRVRIKHYA!'FYCEPfIISFRFLL
'ISSYAETPKCrBCHYNAYKJ1RI0KKNPESIKlSAP8ETPNIBISLISPV1NIFSILKDrLINKRPFIYTTAOPPHAL
T11IELJIYEIOiQRAPNORENLiALIIINFREKA~IiG
LSGLCAL APN
u LOLJItDNI'TTPIOSIGVSCSadtAROAAL.0I0N9GYDVRPIVSP1Y1LQREELLRICLN
I5L0FSILPOWFYPNK71IGOT011L.EIPSWOIYlSP
T
' t N T104LIDViLGHTLEQIFiCNVSSL
t p CSHPMCOCISVSNLLTSVEKA
NGVDI1ZKIAAGTASSINDIfifR1L41NLilOLTFB
a ' ' f O
OTFPGDPLTLJ1IGOYSLYAIDGTLYDNDOYSG'FISYALlON11S7lT1fBlaSTaAYLOITPN
SEIKVOLGFpDSYNIDCTNFSIYNLTESKYNPYGY~PKPSCCDCQ7f8VLLYSTRttVPCPn,1044 t~i 'bioB-Dioein 9ynthase G
.
AKLBOIEETVSWSLEDZREIYHTPVFCLIHKANAILRSNFIJISEtC':CYx.ISIR1GOC11lD
EONSQVTarSLNAAOHIHEKLYLFCRINCATCTALPINRSYVLCLVSENPIJ~IWA
ICFATNKVNAKAISNVNKLRRYESVtiGEATICP'DPYISLTPDFaLYIHPJILLtPEltlfl'S0 CAYCJ1QSSRYIrCHV?PCMOfIVDWEMKRAVELICATRVCI:.71AWRNAKZ7DRYP0RVL
VYGLPANLSL
IMtSITDIaAEVCCUGfC.SEEOAKKLYW1CLYAYNlB1L06SPEFY)rl'IITTRSY6DRW
118773?
IZ.WVNK$CtSl'CCOCIVDICESEEDRIKTi.NVfaTItoNtPESVPVNLL.9TPIDCTPLODO
CPh_1034 L188599 PPISliiESILRTIATARWFPRSlRIRL.JN1GRAFLTVEOOTLCFLAGANSIfYCDKii.TVEN
Predicted OMP fCT77I1 (leader (181 pePCide1 KTSWOKYKKYLSYSIWOKI1IRYVlOCIWLFFTILFSCSSFYASCRYAIVRSINEYACOILNDIOB~CIIIQi.G4IPR
PSfGIERGNPCYJWNS
YDEC4fWLILOt.DCILLOCGEaLSHBItrKSKAIOGL.OKOCTP~F~IfiF3IWPFWIEIOEH
APL 104s 1199603 119A90i CPeI
rTWPiESAIFLLIEKIQKOCKTTTVYTERPKT111~.TLKOLHtIIINSLtDTAPOPO.-LY,ISY;ILFSGDYNKGPCLDLFLE1CL.PLPAItIIyIDNQKDrVL.RIf~t.COKYCIAyeoni:ernd hypothetical bacterial Pla4t membrane protlib ' . LLLVLSM.VLSSKLIPI'LTFNFIIPOCiLILYPLTFLI
ALLLRfIQIDiE GTLPNNI'SNRKTLVFSYLSSTI"1 FGITyKApELHPPIYFItcIIAQVQYNYSKIfLLSNHJIASDWNGIPCP10LARVNIFSAFIJINLf.7lSSIVOIIMF
fPVASPEfIp'1'J~.!'17LSPLIIFL
CPeL1075 1190081 1188570 ASLL.71FZVSOOLDTVI'YTF'F~TFNSSWLRS1E''&71iIS0IPDTP'IVO'1'GILY!'OIGIS
' aroE-Shikimce S-t>.hyro0enase IRKFLQIPSTKI11N1YpLI!)pP
FPO'1TJLIIStYSYtYttITFCVt.TrPL.FYL11VM
WQLPIJIVPIVfiLQIWRFSNIYYGVBV!!t CATVSOPSFCEAK00ILItSLLQ.VOIIELRLD 1016 1100675 1199590 ITA CPet LINELDDOELHTLtTT110NPILTFRONLt~ISTU.iIIWa.YSt.AIG.EIIOBDIDVSLPI_ LOTIRKSNPKIKLILSYIiTDIOVEDLD11IYNHCaTPMZY1CIVLSPaISSEIItNYIKIGR'TtypCOPhan NroxYlase ' LLPKPSTVi.CNf.'I'fICLPSRVLSPLISNAIBtYJIJIGISAPQVAPCQPKLEELLSYNIfSIC.SEFONSOSLQR
AYSTPYSYYRIIL.OKENKfJDOIILA
VHYCERTLDPKIfILRIALKL.IIpSLSL!
RLSHLSiff4FLZSKLGiJi7ITYIKFPVTICEW1'FPSAIRDLPF~L.r~RH~IS
VVS1'PFPNRNWYRLLSSRFdiIIMS
KSHIYGLIGDPV~I
.
YCPRFFLDYLE11TGLLS>uLDl~7lVIKFPELETHFSYYPVBCFYJ1PNQ'IfLSI34DRYFPI
VTNPLKTAIFDHVOxLDASI1QLCESINTLVFRNOKILGYN1'DD~.IfAIC.iJ4QKNISVfIIK
HIA11GAGCiAAKAIAATi.fIMOGAfdJiITNRTLSSAAALJITLCKfRfAYPLGSL1~S1FRTIDIASVICtI'LDK
IxIFSLTPDLIHM.iLifNPWLLtIPSPSBFFItQGItLFTRVItIIVOALPB10C0 'INCLPPEVTFPWRFPPIVlIDINfKPNPSPYLERAQKNGSLIINCYGP'IEDALt4PU.WRt0!'i45NLIAIVRC1~
TVESr'L'IE~IBDRIUYQAVL'ISSPpC~rIUFIZSiVRVLPI'G' FPDFLTPE~DSFRNYVIDiIMAKV
DOIIALPFNTSTPOETLFSIRHFDEt.VG.TSKL&MLODGLLESIPLYNOLIIYf.I3GFEVL.
CO
CPn_1036 1191190 1189954 cetLlo47 lioosr 1iD13u arw-Dallyrowinau synehase dew-DShydroaipieounat. R.a~case cYDescRSCIILrNwrfmsel'uTTPHVVtmISNrFQLaa.FSSISTAYPLVIrravs vaoTaicPlt.otLlxiac.YwlvtTFPPaEPfa~ls~te:rraIaYOLVLroNISncssIICICDrasta~nssfenn svlccscxreKVIVSALEOSSEYTIaxrsRSSALTLe~wIllL~olrrv GTVLiIfIGFLdIATYCRCLPLYLIP1TITANVDTSICGLD~IGt1'~GII~Ri~01'FYLPKM'LOIStIPLLTXEWA
HLLISPKPLIIGTi'CiIfGtOCKSAHDSLEELTNIVWVYfrINRiLGAY
NCP~QFLSTLPREEyfIIHGiAEAIKtiGFIA~1YLWEFLNSHSKILL.FSSSOILNiFIKAI~QIIHKIIWIB.L.90 L.C~IPOFDIRIRITIBIRYWfDSLSGTAQDLi.DTIOpVI~BV00 TRISL~IIK
RDSSKKTIEVOSSR1R~I1QGIMlTIBS~OnvRNTVfERHVtCRCILSIt:<RdJCI'LX
I
S VL~CLLKK!'fDl!
KAAIVA~PYDRSLRKILNtCftsIAHAIElLAKGrVMKiOAVSVGlIIPOLYSIGDTLBL
P
TPOLIDOLDtLLKRFNLPSTLKDLpSIVPEHLLQiSLYSPENIIYT1QYDKKNL~tEfJOf.
O
INIEHI.t'rRA7IPFNGTYCASPNNEILYDILNSEOLVIRIHC
CPeLlO1 1201518 1201601 CPtI aad-ASparcate D.llydropenau ,-LIDERKC~IAVLGVDGLVGQKFVUiiIKWYRDiIIVIAEYVASNSKYCOSYOGCIt~pGIG
aroc-Chorismate smehase ' LHFSRGSRRSFLEELLATSYSRStiYLVKV~ISFGSLFSf'L'II~rESI~PSIGWIDGCPASNiIiTYRlIIIMII
PIIPJQNNRDLPIAKIEE110SDIWSFLPSSAESNEAYCLSQDIfVVR
.~.LELNESDFVPAfOtARRPCRiPGI'SSRXA4DIVOILSGV1f10GKTTt'.'l'PLSLOILNIWDSSIPEVNSOHF
OLLGOPYPGEIITSPNCCVSGITLALAPLRKFSLONVNIVTLOSAS~GY
' PYENSERLYRPGHSQYTYEKKFGIVDPNGOGRSSMETAiCRVAAGVVAEKlWIONIITLiiLN
PCVPSLDLLANTVSHIVCSdEKIL.RBfVICtLCSSKOPLPCKLSV'i'lIIIRVwJIYOf!!
AYLSSLCSLTLPHYLKISPELIHKIHTSPFYSPLPNEKIQEILTSLtitJOSDSIGCVISFIVTFfKDVDLDEiLYSYO
EKNIfEPPNTYQLYDNPNSPOARKIQ.Sl07t>t111V11LOPITYO~
TSPIHDFLCEPLFGKVHALLASAiIISIPAA%GFEIOKGFASAOIDIGSOYTDPFVIO~tIRTIKIIiVLIHNLVRGII
AIZ'LtJISNSiYFIDYLKRENCLR
TLKSNNCOGtLGCITICVPIEGRIAFKPTSSIKRPCR'nIrKTILlL111YRTPQ1GRHDPCV
1049 1101s86 1203911 CPft AIRAVPWFaMINLVLADLVLY0RC5KL _ lyaC-ASpas:eokinase III
EOfNSKIVriCFOCI'SiJITAlNICLVCDIICKDKPSPVVVSAIIIGVTDLLV~'CSSSLJtER
CPn _ EtYLRLtIEGKNEBIVImRJIIPFwSIIi'fSRLLPYLQNLEISDLDFARILSL.CEDISASLV
aroL-Shikiwee Kinase II
WKLELRNVM'ltt.~LPTSGKSSLGKALAKFLNLPFYDLDDLIVSNYSSALYSSSAEIYKRA11CSTROWDLGFLF~1R
SVILT~SYRRASPNL~IKAIiWtWL6LiJQPSYIIOGFIGS~
AYGDOKFSECEARILETLPPEDALISLt;Of.'rLNYEASYRAIOTRGAW!'LSVELPLIYERLCETVLLGRCGSOYSA
TLIAELARATEVRIY1'WNGIY1?IDPKVISI7JIQRIPEL8FEIlIp t.EKRGLPERIJIEAMCTKPLSEILTERIDRMCLIADYIPPVD11VDI1SSKSS4E0ASODLITNi.ASFCA1NL.YPP
t2FPCMPAGtPtFVTSTFDFEI~TWVYAVDKSVBYEPRIKALB<.SD
LT,>t$
YOSFCSVDYTVLCCDGLEEILGILESHCIDPELNIAOMJVStOT'VL4DODI
ISOEi1QE11LVD
VLSLSSVTRLNHSVALTCFIICONLSSPKWSTITEKLRGTOGPVFClCQSSIIALSf CPrt_1079 1194011 1191665 ELAEGZIEELlS4DY11KpKAIVAT
aroA-Phosphoshikimaee Vinyltrenaterase TE7WICAC 1050 1:0)981 1201798 P CPn O _ VCP!lILTYKVSPSSVYGNAFIPSSKSHTLRA1LWASVAEGKSIIYNYLDSdaDA-Dihydrodipicolinate Synthase KpHDAStKKFPOILEIVCNPLAIFPKYTLIDACNSGTVLRfM'ALACVFSKBI~1TCSS0 LORRPNAPL:.OALRNFCASFHFSSDKSVLPFTNSGPLRSAYSDVODSDSOFASJ1LJ1VACSGCKTKSYSRNVGRINH
LLTATVTPPPPNCTIDFASLERLLSFODJ1VONDWLLCSICIdiL
LAf7GPC.iF':fIEPKERPWFDLSLWWLEKLHLPYSCSOTI'YSFPGSSHPQ~'SY~'~FSSLTKKEKOALICFJH:D
LOtJIVPLFVCTSGTLLLEVLDWIHFCNOLPLSCFtIfl'l'PLYIIfP
S.\AFIAAAALW''K.it4PIRLRNLDILDtOGDKIFFSLII~fL.GASIOYtSIEEILVFPSSFSKLCGOIWFEiIVL
NAAKNPAILYNIPSRMTPLYLD'IVKAtaHHPpFtGIKDSOGS1IBBF
tX:StOMDCCLDALPILTVt.CCFADSPSNLYNARSNCDKESDRILAITBEt.QKMCACIOPTOSYKSLAPHIOLYCCD
DVFW.'EMAACCANGLtsV4SNAYIPEEAREYVLNP00pDYRSWf HDCLLVNP!;fLYr:AVLDSHDDtiRIAMALTR1.1LYASCDSRIHNTACVRKTFPNPVQ1'WLETCRW11Y1Z'fNPI
CIK.1IL.AYKKAITHAOLRLPt ;IEDFDLENVSPAVBSNLAfrPKf.RTS
NEARIEECHONY:a'NWSTNKRKVFARBSPC VFSYS
:Pn_1040 1191876 llJ4p7) CPn_LO51 120495b L205270 Nn rabu:lc htxnoloo present in \:anebank/FJABLNo robust fwmoloq present rn t;anelank/EfIBL
as of 1!/7/78 as of 11/7/99 RP::OSLFLRTWGPSSSFREHTVG1APLLYPRRRSPDYLFSPTGCPMST'CMtHPIHTASRFFM'PKSIOOLHLtIITt wFPVLKEtVd::NYW11AQWINTLSFt.ENSCaICKISASBNPTEVKEEVLKHAAEEFRHCtIYLHLATFICRCLILFL
TTLFLs7fICILHFITLPWICKEDPRILRKNK
KTQt.ikI: E?SLPDYTSKHLLGCLLTKYYLJILLDt~iTCRVLBNEYSLSCQTLK'1'AAYILV
'PlALELRA::ELYFLYHDILKF.1QSNITVK:iIILEE0CHL0EHERELKDLPtiCEELLCYACrPn_1051 1~O51D2 lZOnl6n :PECEU'.L~:fYERLFx'WIFDPS~'TFTKF No rottusc homolal Praswnc in r:enebank/ENOL
.rc of ll/7/9R
FF IQKMKYNSREK IK::ALR Iv.:..~.YC
ITVFRNNF :L.~.CYONI F'l::La,'YVFfaIPNS
ICR<R
In.l1 II',ei3pt IIId72" , :'.Fr.'PFIN:KKTEVETXEVKCKQETPP::L.FI:HMNKVAE.:FPYRkMLE:a'~'.~~Q~~IL:NLCA
':IY, _ t:NFLD:7~NL::ftNF:.:KEllli::"fIf'fR::K::nY410t:::EPFR'ITACC.1'I
I,trM.NLa>"t:yl:a.rlnv.min..H.Nntno-I-t7x.non.\naae.rVa.R::KI.A~aYEL
Nu s tu.c t . Hn: t.: r . c:. tTITACtGCII:RLKDV::D::I IR'fRAT::::
t L:VIK:::MUTRt'L::CTYY IVt:K.W
Pt.lFFFRLTSD
Id'HRLt'II'LISIKy'r:::laYI~FLVHFt.OP::EF~tSRK'fu.~.IILF1JCR:~1V.TY.F9YLRYl'x.KV
RROLKKKFRLF.f't.'KD
YN::fa:f IAL.W::Htl~'\:.INflY~t:LviKPNIIKdJIIKLLCfTMk7i::f~lf1):i:.Cfl.'7lxLWfIP!'f ~::,\Lp.':rl'IFIViH:fi:.\YL'iAEC:TRYGt:\t",.4M1'.NLIK:IK:IIPYITKKLCBOA~LLBIIV'L
t._UI'.n IW!r.ItH IIH.%111 tl'WFTIII;~:V.1:LV::KI.APt.1.t'Lt7LEtlFFP~DfX::.T::IF.tANY.IAVQWYNQtKIAKSHFVtY.
rmt~t::r ttrww,lnrl yr.:v>r.r in :.at..l.mklf7lfsl. .v:: .,t tl/7/,N
:L::NA'tlY:lI'LY:MC:t.V:P::ITtVPFIIDLFLP:::.T(MPY'fr:Kf:RLAfA011KTVFSB3NIKK:Y:I
illlf'AKIIAINIILYLTt'l~:IdI:VN:LIV.'17x.'IdWY::I.:FA::11611.W::KPNt:L:X:EP
:~\I' I'lf:l 1.1.x. ~u M :NIIffNM:t.:kLAI'IJJD'fYY::1 V::LIN:LH'/f.'~f::I:Y:NIIINiN
:LKR f LKL1KIIYr Ntl: IADRI LT:ft 1 DMIIVAfW. I:.\VItRY.TIIFIIKBIPrT
xl'fCPLFA::Ef'fD I
InItlf 1::I4:f:Iyi:YI.I'LAf:M'fKBLIII?AFV:.(1l*NYAL.LIIr:IffFTt:NPflX:~aMLrLiL::.':KDY
APF.~.I.TARE:IJIIaE7tiJlDffFYf/::LVl.~tx~r::~yrwrrKTNIxPprRlu:.7lKIr:F' in:fl::1.aml.~:'h.Nlf:la'IN:F.FtMAl8:::1.tJ('ROfYiI:I"JIJU.ffIPAFd'IY:YF30YROII
LNEL:NK::AF
izi SO
ipt3-Triosv0noleDnat~ Grass III'nd 1307O1U 12091br; IsCRE~71RIKFReiICEJIKNTR:'.
iLa.RiliIQIKTLCL1ICE11~..n1.:;.:.~CEF:.:~'I:IA
:Pn _ 5P!'f..~LMINEVIM'AI'JR!'Bwltifl~~l~lLStilll'~?~w:.Pl4rt~"' Nn rnoucc hoslolop pressnc m umebmk/DtBLR1ERR
as of 11/7/98 ::RwIOiRFtNOVLLSPOLPPPPOHSVCSIS3P5KLRVIJIITFLYPCNttt.I~JILFLTiGIHCP~AP'IASRVKlY
ApAC4YPV11E4mlSLlYRiGKAII~nIR
SE
Pt:L:aMISPGIGIGISAIICGVIJtI".tCLLCLLVKRELPIYRPEEIPELYSLAPSECPJIiQAPLIAYEPVWAICI
CKVAEAiDVODIHNPCREWAERPSF~ITAEEISI::ICwi~KYDNAQR
WKTLApLPKEL00LD'tDZOEVFACLRKLKDSKYESRSFLNDAIfK6LRVPDt~lllLp'fLSEHOC.it7NDCLi.VC
~SLEGOSF!'EVAKNFNV
IFELROIYAOtCNDIJtFLILI;GRSLl4t'tAFSESLDGFINSKRLCYLP9CDVRG~i.KKSA
!ri!t'/IIPr~IJN:LrtHIIVAYAPDRN.iYIIIMEKAFAKALIiALEE3VYNSL'MSYRDKFLGSECpn_tOK1 i22071n 1~309a5 ' ' ~
.:al:i.:.::LLvIX.:::16'FY.LiI:nn~7.YIVFIirh':t1.h'I:EGT:F:::7NL4:Did~ST. .":.
. .,.,;: : :.."..,.,,:v.:
.
..
:
r :
..n:~ ~,., :..."r.,."i,. . ..-..v..yrp,:F.Y.Nt.:.c.'EF.~HAppLY, . , ,.
...\I;.". .
. h......tt...= ... ...rntaNrw:v:.;:'.,,....~,_,,...
.., -.;,y: t"::
........~~!r--...~,;.._.~y..,...
IVROKYOQEF~CRLCIfiiiALYPCVSVSIR~IKIOETRSNL6KAYFJIItatIfRCCVRE
~p,~~pl~,ptZLS ZLKS,t~ CPt1~1065 1221110 :::0928 TAEVtI~RCILS011ESRLtIVFIOVKiMPCRIECIEKT(JWA6LPLLPTKKAtEKACSOYNSNo robust homoloq Present m Getlebsnk;C~BL
as of 11/7J98 GC4.EKVKPYGKESLAYVTSKERLVSLD6aLRRAYTECQKRPOC~.ESEVRACREOLIWRIL:RNRRTSDPCTLfIfFS
IPEFSLPPDSCRL~IOItPKNEIILPSILtxKPIIOYLKZTSI
RCRIOEFClOGLOL ~Y Y~
y YIIEERlGIKEKIILYGTfIIVAT
OORVAAfFSIEVpEIPGPtFICPSLLDKARSLPTREOHTCPell086 1221132 1:211 No robust t>omoloQ Dreamt in GenebanklE!~L
as of 11;7198 SNSWCEIGItI'VLIYAFLFIFLILCYiLCCLILVOESKSIGLCSSFCVDSCDSVI~GIISTP
CPO
,-DILIGM'SiuCAYAFCZGCLL'SFSTNLI~KIILDAKEFLLPAAECSDTpASSISVGOES
No robust homolop presort in Gsnsbmk/00L
as of 11/7/9 CKYLYHIiSYPPPPDNSNGAf'FCLSKFRVWITFLVI.CrIILFLISG71LFLTIGI9CLSAAIS
FCLfJIGLSALGGVLWSCLtGi.LAtOtEVPCVRpEEZpixVBVApSEEPALQATOKTIaOLCP1~1067 1221675 PKEt~OL~tYIOEVYSCLGttLItDLRCEDt7GLLItORKFJG.OYIrDAMItDOf1'EIVG.OOIHdeE-Polypspcide OatosRylase OpELyIYLKCLIOEtOtDIGSTLFHSQVSLFKWFWIrG7fLPSGDURGERWISAR,6VIlORFIIIQVLWRDFPTEL~0 711IVQ1ItIRRLEYYCSPILRKKSSPIAEITDEIRNi.VSt>IICDZItEA
RRICDTRIfVAM'FDRN71YGVAXT11P
EYIGITKILIlDBNRGIA7W1P0YCiQiVSLFVNCVORFIEDCELIFSESPRVFINPVLSDPSETPII~ItGCL
E7CILRICYLEIRR SIPCLRCEVFRP~fIl'VTAl07Lt~KIITCNLDGE'fARI
INHLTpNLNGYLYIDLNEiPKD
pKKFKI4RLEIIIIOtJtYNI'NLZ;Xl~.VS
CPttr1056 1210182 121122 No robust hosolop presort in Gmvbank/D~LCPr>'1065 1223267 1222365 as of 11/7/9 CEDIKDNtSRVEEI~l4.RVIELPLLPIKC~Ai.EKAIYpYNSYKAKLTKVCPCFRESPAYIrnh1-Ribonuelvase NII
TSEERtASLOp'1'LERJ1YKEYpKRFQEPSRLFSI~PPPFVKLT?SAOtdILRDOLKEKtiFIF50P0It1YFQARSN
MCTLYPSCKLYIOG
IFVSWLFRKHVSCLVSTVNVP
IYSKVAKAFPS4.KGSEEPIEFFLEPEZLNiPTlIARVDpDLRPNLGVDESCKCOFFGPLCIAAVYASNABILK
KETLEIG1KJ1PREEtYWLILEERKSKFJ(RLI1NKIEA71QORVKt%.CPPPIKE1'dfpKRKKEKLYCtKVODSK41 IJ071'KIASIdRIIRSLCVCDItIILYPiKYNELYCKIOt~KM'LLi1W11HA
YSFFIALKS
TVI~.APKPAGwP'AISDOFJN1SEYTLLIL1I.~Ga"rDITI,IOKPRAEODVWAAASIL71 RDAFVOSIOKLEEOYOVOLP10GIIfiINVKJU1GREIAKpRGKELLAKISKT1IFKTFf%ICSG
CPn_1057 1211167 1213596 K
CT356 hypocheeical Dsocein IINFYFFNFANPEPLY1T0Q.ITnLSPYLLLYAiITPVNWYPWCAFJ1!'NIMIENKPVFISICPtI,-1069 GCKNSRNCpVlGpESYTNPE AIC.YGDt.At5ILAVSGdfQYt9A-HTN Trmseripcional Rpulaeor ETVSIiPLNVILTPOLVPFFSVNYLONEGKLGf.PSPPOZIDKLiFl6IE011EEREALVD2ANVIIQGtINKt<.LN~
rEIFRSSRESOSLSLIG7VG71TSIRYSCLFJIIEpOCLCKLISPVYA
KVLEIASFLEGCVRKEILDESSLIGtTVAALYODIDPtMDGVKAFPKRLPGLLLOFILRYSOGFIKKYJ1TYLGLDGDS
IL00IPYVMIIFXEFSDt0~91EfILLDLESIGG'RNSPERAINSItS
IGGGVYSYTI<7DIOd.IPAFa~RLIDIIiItJ1Ai31YIdIJRi7YGLIIZOCI1MIWLL'.6GFSIF
LFAWICIGKaYRGICKpILSYILSELYSP1VCAFYSSE011DRIZ11G00ER''Y911SVEEZS
.
NAiGtDAEIPCDYYDISRECFPNGRItILNIPVNREIEELS1DCY11RSItAIEDIVDRSRDI1225523 1221114 CPr>'1070 LKGIMOR~SmSKDtlLSLTtMifBSIIYTFAYAGRLLGEVEYIEICKI~GtPVIINSLYIQIHNo /robust homolop pswmt 1n Gmebmk/ENBL
a of 11/7/!1 YESOCGSFWLSFAEQJIpEWLBPRSEEOC RPfIJlIFPCtiJaCYYRETPPPNPOG~IPLO
ZSL
FYSVOGRDSTLLIKGSPLSOGiTIS~01LI~.LSLHLITDWDILTYJIt4IL0IA0ACPiiCREIICiCFL06tI9K~D
CACCL
Atrl7(KFS51GLLIAS(S7YPSR10iVKVLIAiGDQE~tSPVLKCLSGLFLPYLSLI>~sf10fl~1ETVODPDNPSA
OFLQOLIOOYGPZCVGNtF00GPlICI'OICIEOGEPLG~1~ESI~iOCKL
OEIfLCfVLPCYEE1CLIPKGDCTAITI7fVi.LYDpCKRFKDLELFRR7fLISLHRELLKAAOPIIL7lCESL
VSE9AL5FYPStaIIPtC
WIIQPEppPCPPTPTDELpLOCAVOGAPAPppIGWP
CPtl..1058 1217742 1211536 LSLESGYIt3PLG0ANI0IVOLIKKSLKRLVASDLATfIGPGICLSLT~pVIMNLICLL
CT355 hypoeMCieal procsin SKGYLPLDPLNPEO'M.DPAl100PNORILRKVLV'1T111GZ<llIwRqI00GtR0itPIPIDP
EVIeQ.YpTLPGIVLVS7CCIFiL.75K;GYAAEVPVTSSGY82tLLESKEpOPSCIJ1INDRILWODD6IERDGlVDO
GGPGIPCQCLRfSiRKLPTEKItPNAWL
FKVDBENVYtALOVINKLNLLFYNSYPHLIDSFPAR80YYT11l1iPV11LiSVIt~Ft3NAD
AIUIIOtIATDPTAVNGEIECtQCR~.SPLYANFENSPNDIFNVIDR?L?AOIIVIKSSNIISKCPfIL1071 vhS.KYfPGKIREYYWtLEWSRKVIwKYRVGTIKANrE5Li1S0I11DIMWtLNWI191DNo robust harolo0 prnmc in Gmtbmk/OIBL
as of 11/7/9 KDRLTALVISOGGOLYCSEEFSR>2ISELS05HKOEL~.IGYPKCt~CGLP7GWKSCYIG.YIIKC'11'IN~CPNILS
YtPRlCCNFfICEANI:ViTI'EGTTRQSASDISEE11L'wRSOGAfIPITTO/1'KI
LGDKTSCSIEPLDVNESKIKQNLFALEAE5IILKpYKDRLRIOIYGYDASNIAKIiSEGPPTlfVO0tV0PNTApGDCS
I'IISIIpF.~.VDSILSHRRZ'pCCtEYCYD81LA'i~C~ROGSP
LFSLf.
CRLICGTYKACCLDRLDNpIIAGLVItECEpTIIGPIAYAL11AK1fGLNLIIELVIKNtILStE
QI~AQtICSFaKI'OLYQINQSLSONFFLEGVNSIRERGLDDSLVOAVLffIl1?RSii~fT
CPtI,-1059 1211118 1215678 IESPlJ15G1'SSAWi9TRIPACYZ1lX11'SPLTfSRLSCGSROJIRIIPSSVCAiPOYVAKKYND
kysA-Dialachyladmosine TranstsraseNDiiWOLGIIIW'Il~it.KTGDPSAiGPFCLLIV10~ISFLLSASOSTSSZLKH1'GGEICYTC
VTRSSPAOLSRFLSEIONKP7GLSLSQNFLVDQNIVKKIVATSEVIPOWVL6IGPCFGRI.PNFRDIWLLt4.AIGYCP
AM'DLTSWDIIMIDDPIhII'IFYRLOYSYR!'OKTSASFIJOGf TET:LIAIIGApVIAIEKDPNFAPSLCELPIRLEIZDACILYPLDOLOEYKTLGKGRWJ1NLPPSLVROffSLDCPTPA
ESVPLNSSLEEEDE~DDEDCNIJ1YQ0RILEGSCNL.pTLFLGIK
YHITfPLLTKLFLE7IPDFiiRTIfTVNVQDEVARRZV110PfxRDYGSLTIFLOFFADIHYAFIMO
IfVSASCFYPKPOVOSAVTNNKVIIETLPLSDEEIPVFII'LTRTAt'OORRKVLAHfIJIGLYP
KEOVEpALKELGLLWYRPEVLSWDYLALFNKNOAGCPn_1072 1227921 12235 No robust homolop Dresmt in Gmvbank/E!~L
as of 11/7/98 CPn_ID60 1217691 1215727 KKDYILIIANWCCWKONLKIOKKRNCVSWITYCJ1IVCFFNSADAApKKIDCIPIOILYSFT
~cs/tkt-Transkecolase KYSSYIJG~ICDASTIFC11DVORGLt.OtIRYLCSPCWOETRRRQLFKSLCdOSYGNO1LCEET
YXRILYIHITKVIfI'SSSCPLLOLILSPADLItKLSISOLPCLAEEIRYRIISVLSCIOCNLLAIDIFNNKDCLC&EI
PZONE71ILJWSSALVLGISSFCITGIPATLHSLLRt~M.SFpKRS
SSMGIVELTIALNYVFSSPKDKFIFDI~pTYPHKLLTGRNNBDFDNIRNDNGLSGF'1'NIASESFLLKIOSAPSDASV
FYKGVLFRCE'1'AIVDALSpLFAOLDLSPIGCIIFL..'EDPIW
PTESDNDLFFSGNIVLTALSW.GIUIQITPLESATItVIPII~GDAAFSCGLTLEAIl~NISTDOAVCSACIGWCi9Q( FIGLVYYPJ10ESLPSYVNPYSTATELOEAOGLQVISDLYAOLTIiJAL
LSKFWILN01~4~RISISKNWGANSRIFSRNLIIHPA'fNIG.TKOVFJWLAKIPRYC06LNilISPKNN
RRISpCVKNLPCP'fPLFEGFGLAYVCPIDGHNVKIfLIPiLOSVRNLPFPILVIIVCI'tIOCK
~LDpwOMdPAKYtIGYRANFNKRfSIUtHLpAIKPKPSFPDIFGOTtGELCEVSSRLNWTPCPn_107) t=9011 1NSIGSRLECPKQKFPERFFDVGIAEGNAVTFSAGIAIfMR~IPVICSIYSTFIJiRALDNVFPredicted OMP
IC'I'37l HDUC7pDLPVtFAIDRACWYGDCRSNtIGIYDNSFLMIIPQNIICOPRSQWFa0LLY5SMRRYLIMVGAL(:LYRAAPL
F~1WIKITDJWJ1VLKFAREKTLVCFNIED'1WFPKONNCOS
LNYISSP3AIRYPNIPAPNCDPLTGDPNFLRSPCHAETLSt3CEDVLIIALCTLCFTAGSIKAWLYNRELDLKTTISEE
pAREpAfLEWNGISFLVDYELV:ANLRNJLTCLSLKRSWVLCI
H0tL1YCI3ATlh'DPIFIKPFONDLFSLLLIiSNSKVITIEEIiSIRCCLIS'EFNNFVATFNSORPVIILIKM'LRI
LRSFNIOFTSCPAICEDCwtSNPTKDTfFDpAMAtFJWILPVGSLK
FKVDIWFAIPDTFLSHCSKE11LTKSIGLDES9!lINRILTHFNFRSKKQ111GDVItVNCOPNDAALEYLLSGIaSPP
SOIIYV000AERLRSIGAF~_KKANIYFICNLtf'fPAKpRVf :.'YNPKLTAIpWSOIRKNLSDEYYESLLSYVKSK
CPn_IOGI 1217932 1217666 C"330 hypothetical Drocein ' ' FI:SIINEIHNKDPSLKKLFAi.ppSLFfL.NSLSDIVATYEAMFSLIYECLNKALRKDQt.CY' LIa'1ltdSK.fLLKSPSCDPIVQTFPINPNN ' RNA SECTION
;:Pn_lUi.2 12191135 1.1815) . . . . . . . . . . . . . . . .
. . . .
><aw1-fxaJoxYriOOnuclease VII
Ix:FPM::.~.PCOIIVA:iLTERIKTLLESNtL'OILVK~EL::NV~LOP:X:HL1FCIKDSOAFWcmlsHA I
\N.t'r, 1 ~ND7A
:AFFtIFK::KYYDf:KPKDf:OAV I I14:KLAVYAPRI:QYq I VN IALVYArExDLL4KFEETKR
Id.TAFxYFXrl7:KKPLPFAPQCICViT:a"n;AVIpDILRVLRilxam.~l...v::. f NN:\ n.n71.11 :RRARNYKILYIPVIIroCN ,4'L.4o ::AAIIEI::KAIEVtfUIfNLIDVLI T.1R(3CC;:IEDLWAFNEEILVKAItIA:'rIPiVSA11G71E
'rDYTG.'hvA::WfNP'rP::AAAEtVt'.KC:EEf~)VFFY:'ILRHLL::II:;ROLLTCKK0f3LLPW1~..:
rICNA lUrun.r.d Itlll_ll.:
I!I! vLDfIAEFYTTIIt~(~LOa IE LA10Kl3V(~CK
I IIE.,Kr,INYDN I::RWLtX:DLYwPMlCRLOS
LKKNL:rJAL:YIKAI::WVRr:IIQLKK::LT1PR0It~11.:OKL::ISi~LDTLt~RRLIIYOKE.:s:: rNNA
Ino:4l'. Iml'..:'!Ir !:YF11KIIT1LKl W IN1II.EWLIt::IIVQKf.ELLCRNL::MX.'EIIJt4>NVK
IA1:WYKETLATI L
h:NNYllI::/ARY::ALKEWJI::WPKNVLKRUyANLFDFtif:Pt::AHL::VO::I.QeWIVRi9L0..: tNNA
114~'.t: Inn..ur I r:Ell LTIrt'H f R Ir:KL IKI:
~'Ar_t4a; t~l'~'N1U 1~~U71:
WO 00/27994 PCTNS99/Zb923 ' tlldAi . . . . . . . . . ~ . . . . . . . . .
CMUI 1 6aqln Erxt Type Codon t 99657 89728 Thr CCl' 2 90o9N 91070 TrD t:~:A
w:c ~~~~~- ~M~r 2ri075 294117 Val TI1C
6. 296151 296111 Asp GTC
7 109818 109921 Pro T0G
8 167111 162211 ArQ CCr 9 671=36 67231A Lw GJI
677161 677337 7tp TtC
11 739103 739186 Leu G1G
11 781912 781991 Lys T.T
836119 836191 Ala OOC
16 813926 813999 Pro ODG
17 877400 877473 Acq 11CC
18 10~3605 1085676 Cln T'1C
19 1112031 1112118 Ser TCA
1175163 1175911 Iwu TJ10 21 1230028 1229912 Ser C'aA
22 113?162 1137389 Val G11C
23 1030603 1D30533 Cys OC11 21 1000072 999919 Mls GTa 961607 961536 Gly GCC
26 A07113 807311 Arp TCT
27 7es7eo 7es7oa Thr car se 716971 71se99 Leu T1N
29 70AN1 708351 Bar OLT
68D~59 680178 Leu 6710 31 671115 631373 Phe G7N1 32 626987 626901 Her OGiI
33 293177 293105 Thr 'rC1' 34 293399 293317 Tyr CrA
269112 269070 Ala TGC
36 269065 268992 Ile C11T
37 161389 161318 Asn GTl' 38 87522 87150 llet GT
Contig463 Length: 273254..
85i TCGTTTGAGT AGCAGTCTAC GTTTTTTTCT TGCCACGCTT TTCCCAAAGG
1051 TCCAGCCACC AAAGCTCCTA AAGCTAAAGA AGCTAGGATT GCA.T~GAGTGG
2501 TTTAGAAGTC CTrt'GCTAAAA GTTTTTGAGA AATTTAAGAA ATTCGCAATA
3751 ATCTTGAGTT 'CCGTAGGAAT TGCTGTGAAT CGGAATTCAT TACCTTCAGA
WO 00/27994 PCTNS99/2b923 9401 .CATAAGCACT CGGGAGATTT CTTGTAGGGC TTTATGTGGA TCCGAGAGGT
11851 AGAA.AAAGTG'TCATCATAAG AAATCCTTAG ATAGGATAGT TTCTTAATTT
12001 GGGTCACTGA ACTTAGGGGT ACCATA'hCAT CGTATTCATA GTCCAGGTTA
12$51 TGATGAAGAA GCATAGAAAT TCAGAGCGCG GTCATCTCGG AAATAACGCA
14051 CAGCTGCTAG GGATTGGGAA ATATTTTCTT GTACTTCTGG AGAAGA~ATT
19951 AAAGCCAATT'GTACTAAAGA AGCTCTGTTA TTGCAACTCC TTGATCAGAG
20101 GTCGTTATCA ATAATTTCTA CATCAG~TTC TTCGATATGG TCTTCTGAAG
20651 TCGAAGGAGA CTTCGATTTG~AGGATGGCCT CGAGGAGCCG GAGGGATATC
22751 CTGAAGGATA GTTTAAATTT GTAGAAACTT TG~'GTCAGGG TTTCATTTAT
26801 GTTGCTATGA CPtACATCGTG TTTGTTTTGA TGTGGATAGT GAGACGGATT
28051 TGCTTGCAGG'GTCGGGCAGT TTTTCAAAGG GGAAAGCTAT GATCTTGCTG
28201 TATTTCAGAG GGTTGTTTTG GATGAT'IrCAT GGATCTTAGA GGTTAAAGTC
G'J071 I.HHIH1Vt11t1 l.1Vt11V1MV MVlltat~t~.ean saV,.V~.rm.aal. t~rv.ramava>rr WO 00/27994 PCTlUS99/26923 33901 ATGATCTGTT TCCATGGCTT TTAAAAAAGG GAAACAAGGT TGGTCTTCAA ' WO 00/27994 PCTNS99/2b923 36151 GACAAATAGA'GAAAGTTGCC AACGTGCTTT GCGGTTAACG TTACAAGATC
36301 CCTCAAGCTC CTGTTCATCT TCTTAT~ATT CCTAAAAAAC CTATACCACG
40201 CTTTAATAGC CAATACAGAG CGCAAAGTAA ACAC.TGTAAA AGGTAAAATT
WO 00/Z7994 PC'f/US99/26923 44251 TCTTTTCTGA.CAACCCGATG ATGCTTGTAG TAGTGAATCA GAAGAAGCAA
4430'1 ACCAAGAGTA ATGATATGGA GCAGAACGTA TCCTATGGTA CGTAGAATTC
WO 00/27994 PCTNS99/2b923 4505'1 TGTCCGTTCT TTTGATAGCA CCTTGGAGGT GATTATGGAA GTTCGTTATG
45501 ATGATTGTCA TTTTCCCACA GGTCGGATTG TGGGCTG'~GG TCCTCGGGTT
45601 GTCTCGTTTT AGGCAAGACT TTAGAACC'FA GTCGAGAAGC GACTCCTCCA
4?201 GTACATGGCC CCATTACTTC TTTATGGGCT TTGGAGCCCG TGGGTAAGGG
49251 CAGATCGAGC .TAGACCGTAG AATGGTCCGT GAGCGTATCC ATAAGCTGTC
99751 AAGTATCCCT .ATGAAATTAC GTTTGCTCTC TCCTCTTCCT GTATTGATTT
50601 GTCTCGAGCT TTTATTAAAG TTCAAGATGG CTGTA_~.TTCT TTTTGCTCGT
51001 GATTTTTTAG ATTGTGTAGA G,AAGTTCCGT GCTTCTGATC CTCGCTATGC
52351 GGTCGGAGCG~ACGTCTCTCT GGATCTAAGA GACTTTCTTT AGGAGAAACT
wo oom~4 PCT/US99/26923 WO 00/27994 PG"T/US99126923 55551 GAGAGCCTTT CTGCTTACTC AAGAP.AAAAA AGATCTTTTT GTAGACACCT
WO OOI27994 PCT/US99/Zb923 63151 TTATGATTGC'AGCTTGGATT GCTCCTCCTG AAGATTTTGC CTTGTTGTTA
68?O1 GCATAAGAGC ACCTCCCCAG TTTCCTTGAT TGTTGGTGAA ATATACGTGG
70001 ATTTCTGTGA GC,CAGACTAG GGGAACTCGG TGGTGGTTCT TCCAAGAAGC
71251 CGCACCATCT~TCAACAGCAA GGACTCCTTG ACGTAGTTCC GAAGTGTTCC
71951 GAATTTCCCT CTATATTTAG. AGAGTTCCCA CTATAAATCC CTCCTCCTAA
78001 GGGAGAACCC CAGAAGATCC C.TTCGTAGAT ATCACACCCA CAGAAATTGT
79351 GTGTGATAAA'ATCGTGGCAC AGAAGAACTT CTTATTTACT TTAGACGCTG
79501 TATTCAGAGT ATGGTCGGGA TTTTGGG'ATC TCAGAGAACC AAGAAAAGCT
79601 TGCTCTTCTG ATGATGACGA AGATGCAACA'GCAACTTCGA CCGCTACAGG
80051 TACCGTTCGA CAGCATTTTG TTAAGGCGTT TGATTTCTCT CGTCCCTTT~' 82151 TGCCTAGGAA TATCTTAATT TCACAAACCC CAC~GAGTGCA CAAACTCCTT
86201 GTGATCTCAG AC;CTTCCTGA AGGGCACCCC GATATTCGGA ATTTGCAGTT
$6851 GGGAGTGAAT TCTACACGAG AATCGAGAGG AGAGCGAGTC TTTCAAGGAT
87451 TCTTAAATGG'TGTGAACCAT TGAGAAGAAC ATCCTATCGG TAGGGAAACA
88151 GGGATCTAAG ATATAGGTAG~AACGCACGAG AGTGTCTGAA TTCCATACAG
90251 GAAGTATGGC .TGGATGCTTG TACTAGCAAA GTACTTCTGG TTAGATTATT
WO 00/27994 PC'f/US99/26923 91151 ATGGTAAACT CCCAAGTTCC TTGATACCCA TAGGGAGATT GCTGATAGCC ' 93901 TTCTTAAAGG.GGATTCCCTT AATTATAGTG TAGACTTAGA GATTATGGCG
94801 A.AATCCTCAC GTCATCTAGA AAGATGTAAG AAGTTCCTTC TGGATAACAG
95951 TTGTGAGCGA TAAAACAATC TTTGTCTCTA GCAAAGAGAT GGCAGAACGC~
96901 GGGATCATAG AAGAAAATTG TATGATTT~'T AGCAGCCCGT AATTCCGTGA
WO 00/27994 PCTNS99/2b923 98351 CTGGAATAGA CTATACTCTG ACAGGAGATA TA.T~CTCTGCA AAACCTTGGG
98501 GTGCTGAAGG CGCAGCACTT TCTGTTACAA CTGA'1'AAAAA '1'C'1'c~'1'C:c;c.'1'A
WO 00/27994 PCT/US99/2b9Z3 9 9 B 51 TACAGATV'1"1' (:(:A(iC:(iCi'1"1'C: l:'1'Hl:Hli'1'Hht_.
HHI,:'1'l.l.'1'HL V l.rW.1 r~ ~ vvv i 101651 TCGTAATCTC ATAAAP~AAGC AAACAGAAGC AGGTCTTATC TTTTTTACTG
101$51 AGTTTGTCAA AACTTTTGAG AAGGGAAATG CAAAAGCAAA ACAAACGATT
WO 00/27994 PCT/US99/Zb923 105301 TCCTGTGGAC TfiGATGGCTC CTGTTCCTGT GGTAGCATTC GTGGTTTGTA
110301 CGATCTCACT GTTCGCATCT GCTAAGTTTA AGTTCAATG't' GTCGGTAGAA
111751 TTCATCTTTA~TAAAAAGTAT GTTTTTCTAA GATTCTCGGA GAATCTTAGA
11180'1 AAGAATAACG AGTTCCACAG TTTGCATTAT AGCTTCTTGA GGAGCTGCGC
115801 AGGAGTAATT~GCAGATACAT TCGTAATAGA AACATCGCTA GTGAGAGTGT
11?601 ATTAGGCCCA TGAATTTTCA TTCATAGGAT ATATTTCATA CTATTATAAG
121301 CTCCTTAGAG GTTGCGATCC TCAAAAAGAT CAGAGCTA'~'T TTTTATCAGG
121451 GAA,AAAAAAG ATAGTACAGG CATTTGCTTT ATAGGGAAGC GCCCTTTTAA
124601 AAGGGAAAGT TATGCACAAA'CCTTTTGTAT ATGATACAAT AGTTCAGCTT
125$51 ATCGATGGGA AATATTGTAT TTTAGGTGGT ACCAATTTTG AAGAGTTTAT
130401 AfiGAAGGAGA TCCCCATTCC CTGGAGGTTT TTGGGAATAT CAGAGTAGGC
130651 AAAGAAGCTC TAAGATTTGC GfiGAATGCCG CAATCACCAC GATGAAAATA
132001 ATGGATGGGG~CCCCAAAGGC CGAATCCTGA TATAGGGAAG ATCAAAGCTT
132151 CGGAGGGCTT TCTTGATATT TCTCAAF~AAA TTCAATGGGA TTCAGATTTT
13?501 ATTCATTATC CAACATACAT TATCCTTGAA CAAATTGAAA GATACGAGAG
140101 GAGAGAGAAA~GCTGTTGCGT TCTCCTTTGA ACCGTTTAGA TACGAATCGT
140801 CTTATCTACT CCATATTCTT~TGGCTATGGG ATATAATATT TTGGCAACAG
143751 GACTAGGAAT CGCATGTCAG AAGTACTGGG P.AAAAACATT TTAGCTGCTA
146751 TGAGGGATGT TCAGGTGGAG CTTTGGGCAT GGCTGTAGG'T GATTCTGTAG
148201 ATGTGCTGCT~GCGGAGAGAT TTTATGAAGT CTTGAATCAC CCCGATCTTC
149551 CTTCCCCAAA CATGGACAAT ATAGAGAT~T AGATGGTAGT CAGTACAACG
IIS
164401 AAAGCAAGTT~ATTCCAAGAG AGTTTCATTC CTGTTGTCGG AATCGTTCCG
WO 00/27994 PCTNS9912b923 165751 GGCATAAAGA AGAGACCTTG CTTTGTAAC~C GTAGCTCTTC TGTACGTAAC
DEMANDES OU BREVETS VOLUMINEUX
LA PRESENTS PARTIE DE CETTE DEMANDS OU CE BREVET
COMPREND PLUS D'UN TOME. - .
CECI EST LE TOME _ ~"DE c~
NOTE: Pout les tomes additionels, veuillez contacter le Bureau canadien -des brevets :;,:
JUMBO APPLICATIONS/PATENTS
THiS SECTION OF THE APPLlCATIONIPATENT CONTAINS MORE
THAN ONE VOLUME
THIS IS VOLUME ~ '-OF _ .
WOTE: For additiona'1 volumes please contact'the Canadian Patent Off~cE ~ -_ .. . . . . ~ , ..'. , . _ .~. ,. 'w
Claims ()
Hide Dependent
translated from
1. An isolated nucleic acid encoding a C. pneumoniae protein as set forth in Table 3.
2. The isolated nucleic acid of Claim 1, wherein said nucleic acid has a nucleotide sequence of an open reading frame in SEQ ID NO:1.
3. A probe comprising a hybridizing fragment of an isolated nucleic acid according to Claim 2.
5. An isolated nucleic acid that hybridizes under stringent conditions to the nucleic acid sequence of Claim 2.
6. An expression cassette comprising a transcriptional initiation region functional in an expression host, a nucleic acid having a sequence of the isolated nucleic acid according to Claim 1 under the transcriptional regulation of said transcriptional initiation region, and a transcriptional termination region functional in said expression host.
7. A cell comprising an expression cassette according to Claim 6 as part of an extrachromosomal element or integrated into the genome of a host cell as a result of introduction of said expression cassette into said host cell, and the cellular progeny of said host cell.
comprising:
8. A method for producing a C. pneumoniae protein, said method growing a cell according to Claim 7, whereby said C. pneumoniae protein is expressed; and isolating said C. pneumoniae protein free of other proteins.
9. A purified polypeptide composition comprising at least 50 weight % of the protein present as a C. pneumoniae protein comprising an amino acid sequence of claim 1.
10. A monoclonal antibody binding specifically to the polypeptide of Claim 9.