CA2350775A1 - Chlamydia pneumoniae genome sequence - Google Patents

Chlamydia pneumoniae genome sequence Download PDF

Info

Publication number
CA2350775A1
CA2350775A1 CA002350775A CA2350775A CA2350775A1 CA 2350775 A1 CA2350775 A1 CA 2350775A1 CA 002350775 A CA002350775 A CA 002350775A CA 2350775 A CA2350775 A CA 2350775A CA 2350775 A1 CA2350775 A1 CA 2350775A1
Authority
CA
Canada
Prior art keywords
protein
hypothetical
hypothetical protein
nucleic acid
trna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002350775A
Other languages
French (fr)
Inventor
Richard Stephens
Wayne Mitchell
Sue Kalman
Ronald Davis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of California
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Publication of CA2350775A1 publication Critical patent/CA2350775A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/295Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Chlamydiales (O)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/505Medicinal preparations containing antigens or antibodies comprising antibodies
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies

Abstract

<i>C. pneumoniae</i> genome sequence and analysis of the encoded polypeptides and RNAs are provided. The <i>C. pneumoniae</i> gene nucleic acid compositions find use in identifying homologous or related proteins and the DNA sequences encoding such proteins; in producing compositions that modulate the expression or function of the protein; and in studying associated physiological pathways.
In addition, modulation of the gene activity <i>in vivo</i> is used for prophylactic and therapeutic purposes, such as identification of cell type based on expression, and the like.

Description

DEMANDES OU BREVETS VOLUMINEUX

COMPREND PLUS D'UN TOME.
CECI EST LE TOME _ ~'DE c1 NOTE. Pour les tomes additionels, veuillez contacter le Bureau canadien des brevets THIS SECTION OF THE APPUCATION/PATENT CONTAINS MORE
THAN ONE VOLUME
THIS IS VOLUME -O>=
- . -WOTE_ For additional volumes please contact'the Canadian Patent Offfice :~. ..

CHLAMYDIA PNEUMONIAE GENOME SEQUENCE
CROSS-REFERENCES TO RELATED APPLICATIONS
The present application is related to 60/128,606, filed April 8, 1999 and 60/108,279, filed November 12, 1998, which are incorporated herein by reference.
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER
FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT
FIELD OF THE INVENTION
This invention relates to nucleic acids and polypeptides from Chlamydia pneumoniae and to their use in the diagnosis, prevention and treatment of diseases associated with C. pneumoniae.
BACKGROUND OF THE INVENTION
Chlamydiaceae is a family of obligate intracellular parasite with a tropism for epithelial cells lining the mucus membranes. The bacteria have two morphologically distinct forms, "elementary body" and "reticulate body". The elementary body is the infectious form, and has a rigid cell wall, primarily of crass-linked outer membrane proteins. The reticulate body is the intracellular, metabolically active form.
A unique developmental cycle between these two forms characterizes Chlamydia growth.
C. pneumoniae is a human respiratory pathogen that causes acute respiratory disease, and approximately 10% of community-acquired pneumonia.
Antibody prevalence studies have shown that virtually everyone is infected with C.
pneumoniae at some time, and that reinfection is common. In addition to respiratory disease, studies have shown an association of this organism with coronary artery disease.
It has been demonstrated in atherosclerotic lesions of the aorta and coronary arteries by immunocytochemistry and by polymerase chain reaction (Kuo et al. (1993) J
Infect Dis 167(4):841-849).
Recent reports have further demonstrated the presence of C. pneumoniae in the walls of abdominal aortic aneurysms (Juvonen et al. (1997) J Vasc Sure 25(3):499-505). Abdominal aortic aneurysms are frequently associated with atherosclerosis, and inflammation may be an important factor in aneurysmal dilatation.

C. pneumoniae may play a role in maintaining an inflammation and triggering the development of aortic aneurysms.
Muhlestein et al. (1996) JACC 27:1555-61, reported a differential incidence of Chlamydia species within the coronary artery wall of patients with ~ atherosclerosis versus those with other forms of cardiovascular disease. The extremely high rate of possible infection in patients with symptomatic atherosclerotic disease compared to the very low rate in patients with normal coronary arteries or coronary artery disease from chronic transplant rejection provides evidence for a direct link between the atherosclerotic process and Chlamydia infection. Because a history of chlamydial infection is so prevalent in the population, the issue of causality remains.
On a physiologic and pathologic level, abnormal interactions among endothelial cells, platelets, macrophages and lymphocytes may lead to a cascade of events resulting in acute endothelial damage, thrombosis and repair, chronically leading to the development of atheroma in blood vessels.
C. pneumoniae is related to other Chlamydia species, but the level of sequence similarity is relatively low. Very little is known about the biology of this organism, although it appears to be an important human pathogen. Allelic diversity and structural relationships between specific genes of Chlamydial species is described in Kaltenboeck et al. (1993) J Bacteriol 175(2):487-502; Gaydos et al. (1992) Infect Immun 60{12):5319-5323; Everett et al. (1997) Int J Syst Bacteriol 47(2):461-473;
and Pudjiatmoko et al. (1997) Int J Syst Bacteriol 47(2):425-431.
A number of studies have been published describing methods for detection of C. pneumoniae, and for distinguishing between Chlamydial species. Such methods include PCR detection (Rasmussen et al. (1992) ~Vlol Cell Probes 6(5):389-394;
Holland et al. (1990) J Infect Dis 162(4):984-987); a simplified polymerase chain reaction-enzyme immunoassay (Wilson et al. (1996) J Appl Bacteriol 80(4):431-438); sequence determination and restriction endonuclease cleavage (Herrmann et al. (1996) J
lin Micro io134(8):1897-1902).
Antigenic and molecular analyses of different C. pneumoniae strains is described in 3antos et al. (1997) J Clin Microbiol 35(3):620-623. Some genes of C.
pneumoniae have been isolated and sequenced. These include the Gro E operon (Kikuta et al. { i 99I ) Infect Immun 59( 12):4665-4669); the major outer membrane protein Perez et al. ( 1991 ) Infect Immun 59(6):2195-2199; the DnaK protein homolog (Kornak et al.
(1991) Infect Immun 59(2):721-725); as well as a number of ribosomal and other genes.
SUMMARY OF THE IIWENTION
This invention provides the genomic sequence of Chlamydia pneumoniae.
The sequence information is useful for a variety of diagnostic and analytical methods.
The genomic sequence may be embodied in a variety of media, including computer readable forms, or as a nucleic acid comprising a selected fragment of the sequence.
Such fragments generally consist of an open reading frame, transcriptional or translational control elements, or fragments derived therefrom. Proteins encoded by the open reading frames are useful for diagnostic purposes, as well as for their enzymatic or structural activity.
DEFIhIITIONS
The term "amino acid" refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, 'y-carboxyglutamate, and 0-phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group., e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical compounds that have a structure that is different from the general chemical structure of an amino acid, but that functions in a manner similar to a naturally occurring amino acid.
Amino acids may be referred to herein by either their commonly known three letter symbols or by the one-letter symbols recommended by the ILTPAC-ILJB
Biochemical Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly accepted single-letter codes.
"Amplification" primers are oligonucleotides comprising either natural or analoQUe nucleotides that can serve as the basis for the amplification of a select nucleic acid sequence. They include, e.g., polymerase chain reaction primers and Iigase chain reaction oligonucleotides.
"Antibody" refers to an immunoglobulin molecule able to bind to a specific epitope on an antigen. Antibodies can be a polyclonal mixture or monoclonal.
Antibodies can be intact immunoglobulins derived from natural sources or from recombinant sources and can be immunoreactive portions of intact immunoglobulins.
Antibodies may exist in a variety of forms including, for example, Fv, Fab, and F(ab)Z, as well as in single chains. Single-chain antibodies, in which genes for a heavy chain and a light chain are combined into a single coding sequence, may also be used.
An "antigen" is a molecule that is recognized and bound by an antibody, e.g., peptides, carbohydrates, organic molecules, or more complex molecules such as glycolipids and glycoproteins. The part of the antigen that is the target of antibody binding is an antigenic determinant and a small functional group that corresponds to a single antigenic determinant is called a hapten.
"Biological sample" refers to any sample obtained from a living or dead organism. Examples of biological samples include biological fluids and tissue specimens.
Such biological samples can be prepared for analysis of the presence of C.
pneumoniae nucleic acids, proteins, or antibodies specifically reactive with the proteins.
The term "C. pneumoniae gene" shall be intended to mean the open reading frame encoding specific C. pneumoniae polypeptides, as well as adjacent 5' and 3' non-coding nucleotide sequences involved in the regulation of expression, up to about 2 kb beyond the coding region, but possibly further in either direction. The gene may be introduced into an appropriate vector for extrachromosomal maintenance or for integration into a host genome.
"Conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batter et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol.
Chem. 260:2605-2608 (1985); Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)). Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are "silent variations,"
which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the «nly codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can be modified to yield a functionally identical molecule.
Accordingly, each silen: variation of a nucleic acid which encodes a polypeptide is implicit in each describ :d sequence.
As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid.
Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention.
The following groups each contain amino acids that are conservative substitutions for one another:
1 ) Alanine (A), Glycine (G);
2) Serine (S), Threonine (T);
3) Aspartic acid (D), Glutamic acid (E);
4) Asparagine (N), Glutamine (Q);
5) Cysteine (C), Methionine (M);
6) Arginine (R), Lysine (K), Histidine (H);
7) Isoleucine (I), Leucine (L), Valine (V); and 8) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).
see, e.g., Creighton, Proteins (1984)).

The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same, when compared and aligned for maximum correspondence over a comparison window, as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. This definition also refers to the complement of a test sequence, which has a designated percent sequence or subsequence complementarity when the test sequence has a designated or substantial identity to a reference sequence. For example, a designated amino acid percent identity of 95% refers to sequences or subsequences that have at least about 95% amino acid identity when aligned for maximum correspondence over a comparison window as measured using one of the following sequence comparison algorithms or by manual alignment and visual inspection. Such sequences would then be said to have substantial identity, or to be substantially identical to each other. Preferably, sequences have at least about 70% identity, more preferably 80% identity, more preferably 90-95%
identity and above. Preferably, the percent identity exists over a region of the sequence that is at least about 25 amino acids in length, more preferably over a region that is 50-100 amino acids in length.
When percentage of sequence identity is used in reference to proteins or peptides, it is recognized that residue positions that are not identical often differ by conservative amino acid substitutions, where amino acids residues are substituted for other amino acid residues with similar chemical properties (e.g., charge or hydrophobicity) and therefore do not change the functional properties of the molecule.
Where sequences differ in conservative substitutions, the percent sequence identity may be adjusted upwards to correct for the conservative nature of the substitution. Means for making this adjustment are well known to those of skill in the art. Typically this involves scoring a conservative substitution as a partial rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for example, where an identical amino acid is given a score of l and a non-conservative substitution is given a score of zero, a conservative substitution is given a score between zero and 1. The scoring of conservative substitutions is calculated according to, e.g., the algorithm of Meyers &
Miller, Computer Applic. Biol. Sci. 4:11-17 {1988) e.g., as implemented in the program PCIGENE (Intelligenetics, Mountain View, California, USA)..

For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequences) relative to the reference sequence, based on the designated or default program parameters.
A comparison window includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 25 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Watetman, Adv.
Appl. Math. 2:482 ( 1981 ), by the homology alignment algorithm of Needleman &
Wunsch, J. Mol. Biol. 48:443 (1970), by the search for similarity method ofPearson &
Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and visual inspection (see, e.g., Ausubel et al., supra).
One example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, patrwise alignments to show relationship and percent sequence identity. It also plots a tree or dendogram showing the clustering relationships used to create the alignment.
PILEUP
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol.
Evol. 35:351-360 (1987). The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153 (1989). The program can align up to 300 sequences;
each of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure begins with the pairwise alignment of the two most similar sequences, producing a cluster of two aligned sequences. This cluster is then aligned to the next most related sequence or cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension of the pairwise alignment of two individual sequences. The final alignment is achieved by a series of progressive, pairwise alignments.
The program is run by designating specific sequences and their amino acid or nucleotide coordinates for regions of sequence comparison and by designating the program parameters.
Using PILEUP, a reference sequence is compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence analysis software package, e.g, version 7.0 (Devereaux et al., Nuc. Acids Res. 12:387-395 (1984).
Another example of algorithm that is suitable for determining percent sequence identity (i.e., substantial similarity or identity) is the BLAST
algorithm, which is described in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.govn. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T
when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues;
always > 0) and N (penalty score for mismatching residues, always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score.
Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X
determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=4, and a comparison of both strands. For amino acid sequences, the BLASTP
program uses as default parameters a wordlength (W) of 3, an expectation (E) of 10, and the BLOSLTM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci.
USA
89:10915 ( 1989)).

The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci.
USA
90:5873-5787 (1993)). One measure of similarity provided by the BLAST
algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance.
For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001.
L O An indication that two nucleic acid sequences or polypeptides are substantially identical is that the polypeptide encoded by the first nucleic acid is immunologically cross ,-eactive with the antibodies raised against the polypeptide encoded by the second nucleic acid, as described below. Thus, a polypeptide is typically substantially identical to a second polypeptide, for example, where the two peptides differ I S only by conservative suostitutions. Another indication that two nucleic acid sequences are substantially identical is that the two molecules or their complements hybridize to each other under stringent conditions, as described below.
Another indication that polynucleotide sequences are substantially identical is if two molecules hybridize to each other under stringent conditions. Stringent 20 conditions are sequence dependent and will be different in different circumstances.
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Typically stringent conditions for a 25 Southern blot protocol involve hybridizing in a buffer comprising Sx SSC, 1% SDS at 65°C or hybridizing in a buffer containing Sx SSC and 1% SDS at 42°C and washing at 65°C with a 0.2x SSC, 0.1% SDS wash.
A "label" is a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include 30 3zP, Iluorescent dyes, electron-dense reagents, enzymes (e.g., as commonly used in an ELISA), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal antibodies are available.
The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or double-stranded form. The term encompasses nucleic acids containing known nucleotide analogs or modif ed backbone residues or linkages, which are synthetic, naturally occurring, and non-naturally occurring, which have similar binding properties as the reference nucleic acid, and which are metabolized in a manner similar to the reference nucleotides. Examples of such analogs include, without limitation, phosphorothioates, phosphoramidates, methyl phosphonates, chiral-methyl phosphonates, 2-O-methyl ribonucleotides, peptide-nucleic acids (PNAs).
Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated.
The term nucleic acid is used interchangeably with gene, cDNA, mRNA, oligonucleotide, and polynucleotide.
As used herein a "nucleic acid probe or oligonucleotide" is defined as a nucleic acid capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, for example, probes may be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be understood by one of skill in the art that probes may bind target sequences lacking complete complementarity with the probe sequence depending upon the stringency of the hybridization conditions. The probes are preferably directly labeled as with isotopes, chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin complex may later bind. By assaying for the presence or absence of the probe, one can detect the presence or absence of the select sequence or subsequence.
A labeled nucleic acid probe or oligonucleotide is one that is bound, either covalently, through a linker, or through ionic, van der Waals or hydrogen bonds to a label such that the presence of the probe may be detected by detecting the presence of the label bound to the probe.

"Pharmaceutically acceptable" means a material that is not biologically or otherwise undesirable, i.e., the material can be administered to an individual along with a Chlamydia antigen without causing any undesirable biological effects or interacting in a deleterious manner with any of the other components of the pharmaceutical composition.
The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an analog or mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
The phrase "specifically or selectively hybridizing to," refers to hybridization between a probe and a target sequence in which the probe binds substantially only to the target sequence, forming a hybridization complex, when the target is in a heterogeneous mixture of polynucleotides and other compounds.
Such hybridization is determinative of the presence of the target sequence.
Although the probe may bind other unrelated sequences, at least 90%, preferably 95% or more of the hybridization complexes formed are with the target sequence.
The term "recombinant" when used with reference to a cell, or nucleic acid, or vector, indicates that the cell, or nucleic acid, or vector, has been modified by the introduction of a heterologous nucleic acid or the alteration of a native nucleic acid, or that the cell is derived from a cell so modified. Thus, for example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell or express native genes that are otherwise abnormally expressed, under expressed or not expressed at all.
The phrase "specifically immunoreactive with", when referring to a protein or peptide, refers to a binding reaction between the protein and an antibody which is determinative of the presence of the protein in the presence of a heterogeneous population of proteins and other compounds. Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not bind in a significant amount to other proteins present in the sample. Specific binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular protein and are described in detail below.

The phrase "substantially pure" or "isolated" when referring to a Chlamydia peptide or protein, means a chemical composition which is free of other subcellular components of the Chlamydia organism. Typically, a monomeric protein is substantially pure when at least about 85% or more of a sample exhibits a single polypeptide backbone. Minor variants or chemical modifications may typically share the same polypeptide sequence. Depending on the purification procedure, purities of 85%, and preferably over 95% pure are possible. Protein purity or homogeneity may be indicated by a number of means well known in the art, such as polyacrylamide gel electrophoresis of a protein sample, followed by visualizing a single polypeptide band on a polyacrylamide gel upon silver staining. For certain purposes high resolution will be needed and HPLC or a similar means for purification utilized.
DETAILED DESCRIPTION
The present invention provides the nucleotide sequence of the C.
pneumoniae genome SEQ ID NO: 1 or a representative fragment thereof, in a form which can be readily used, analyzed, and interpreted by a skilled artisan. As used herein, a "representative fragment" of the nucleotide sequence depicted in SEQ ID NO: 1 refers to any portion which is not presently represented within a publicly available database.
Preferred representative fragments of the present invention are open reading frames, expression modulating fragments, uptake modulating fragments, and fragments which can be used to diagnose the presence of G pneumoniae in sample. Using the information provided in the present application, together with routine cloning and sequencing methods, one of ordinary skill in the art will be able to clone and sequence all "representative fragments" of interest including open reading frames (ORFs) encoding a large variety of C. pneumoniae proteins. A non-limiting identification of such preferred representative fragments is provided in Tables 2 and 3.
Diasnostic use of C pneumoniae nucleic acids Hybridization-based assays Using the nucleic acids disclosed here, one of skill can design nucleic acid hybridization-based assays for the detection of C. pneumoniae. Any of a number of well known techniques for the specific detection of target nucleic acids can be used.
Exemplary hybridization-based assays include, but are not limited to, traditional "direct probe" methods such as Southern Blots, dot blots, in situ hybridization (e.g., FISH), PCR, and the like. The methods can be used in a wide variety of formats including, but not limited to substrate- (e.g. membrane or glass) bound methods or array-based approaches as described below. As noted above, this invention also embraces methods for detecting the presence of Chlamydia DNA or RNA in biological samples. These sequences can be used to detect Chlamydia in biological samples from patients suspected of being infected.
A variety of methods of specific DNA and RNA measurement using nucleic acid hybridization techniques are known to those of skill in the art (see Sambrook et al., supra).
In situ hybridization assays are well known (e.g., Angerer {1987) Meth.
Enrymol 152: 649). Generally, in situ hybridization comprises the following major steps:
(1) fixation of tissue or l;~iological structure to analyzed; (2) prehybridization treatment of the biological structure t ~ increase accessibility of target DNA, and to reduce nonspecific binding; (3) hybridizatic n of the mixture of nucleic acids to the nucleic acid in the biological structure or tissue; (4) post-hybridization washes to remove nucleic acid fragments not bound in the hybridization and (5) detection of the hybridized nucleic acid fragments. The reagent used in each of these steps and the conditions for use vary depending on the particular application.
In a typical in situ hybridization assay, cells are fixed to a solid support, typically a glass slide. If a nucleic acid is to be probed, the cells are typically denatured with heat or alkali. The cells are then contacted with a hybridization solution at a moderate temperature to permit annealing of labeled probes specific to the nucleic acid sequence encoding the protein. The targets (e.g., cells) are then typically washed at a predetermined stringency or at an increasing stringency until an appropriate signal to noise ratio is obtained.
The nucleic acids of this invention are particularly well suited to array-based hybridization formats. Arrays are a multiplicity of different "probe" or "target"
nucleic acids (or other compounds) attached to one or more surfaces (e.g., solid, membrane, or gel). In a preferred embodiment, the multiplicity of nucleic acids (or other moieties) is attached to a single contiguous surface or to a multiplicity of surfaces juxtaposed to each other.
In an array format a large number of different hybridization reactions can be run essentially "in parallel." This provides rapid, essentially simultaneous, evaluation of a number of hybridizations in a single "experiment". Methods of performing hybridization reactions in array based formats are well known to those of skill in the art (see, e.g., Pastinen (1997) Genome Res. 7: 606-614; Jackson (1996) Nature Biotechnology 14:1685; Chee (1995) Science 274: 610; WO 96/17958.
Arrays, particularly nucleic acid arrays can be produced according to a wide variety of methods well known to those of skill in the art. For example, in a simple embodiment, "low density" arrays can simply be produced by spotting (e.g. by hand using a pipette) different nucleic acids at different locations on a solid support (e.g. a glass surface, a membrane, etc.).
This simple spotting, approach has been automated to produce high density spotted arrays (see, e.g., U.S. Patent No: 5,807,522). This patent describes the use of an automated systems that taps a microcapillary against a surface to deposit a small volume of a biological sample. The process is repeated to generate high density arrays.
Arrays can also be produced using oligonucleotide synthesis technology. Thus, for example, U.S. Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 92/10092 teach the use of light-directed combinatorial synthesis of high density oligonucleotide arrays.
Many methods for immobilizing nucleic acids on a variety of solid surfaces are known in the art. A wide variety of organic and inorganic polymers, as well as other materials, both natural and synthetic, can be employed as the material for the solid surface. Illustrative solid surfaces include, e.g., nitrocellulose, nylon, glass, quartz, diazotized membranes (paper or nylon), silicones, polyformaldehyde, cellulose, and cellulose acetate. In addition, plastics such as polyethylene, polypropylene, polystyrene, and the like can be used. Other materials which may be employed include paper, ceramics, metals, metalloids, semiconductive materials, cermets or the like.
In addition, substances that form gels can be used. Such materials include, e.g., proteins (e.g., gelatins), lipopolysaccharides, silicates, agarose and polyacrylamides. Where the solid surface is porous, various pore sizes may be employed depending upon the nature of the system.
In preparing the surface, a plurality of different materials may be employed, particularly as laminates, to obtain various properties. For example, proteins (e.g., bovine serum albumin) or mixtures of macromolecules (e.g., Denhardt's solution) can be employed to avoid non-specific binding, simplify covalent conjugation, enhance signal detection or the like. If covalent bonding between a compound and the surface is desired, the surface will usually be polyfunctional or be capable of being polyfunctionalized. Functional groups which may be present on the surface and used for linking can include carboxylic acids, aidehydes, amino groups, cyano groups, ethylenic groups, hydroxyl groups, mercapto groups and the like. The manner of linking a wide variety of compounds to various surfaces is well known and is amply illustrated in the literature.
For example, methods for immobilizing nucleic acids by introduction of various functional groups to the molecules is known (see, e.g., Bischoff (1987) Anal.
Biochem., 164: 336-344; Kremsky {1987) Nucl. Acids Res. 15: 2891-2910).
Modified nucleotides can be placed on the target using PCR primers containing the modified nucleotide, or by enzymatic end labeling with modified nucleotides. Use of glass or membrane supports (e.g., nitrocellulose, nylon, polypropylene) for the nucleic acid arrays of the invention is advantageous because of well developed technology employing manual and robotic methods of arraying targets at relatively high element densities. Such membranes are generally available and protocols and equipment for hybridization to membranes is well known.
Target elements of various sizes, ranging from 1 mm diameter down to 1 p,m can be used. Smaller target elements containing low amounts of concentrated, fixed probe DNA are used for high complexity comparative hybridizations since the total amount of sample available for binding to each target element will be limited.
Thus it is advantageous to have small array target elements that contain a small amount of concentrated probe DNA so that the signal that is obtained is highly localized and bright.
Such small array target elements are typically used in arrays with densities greater than 104/cmz. Relatively simple approaches capable of quantitative fluorescent imaging of 1 cmz areas have been described that permit acquisition of data from a large number of target elements in a single image (see, e.g., Wittrup (1994) Cytometry 16:206-213).
If fluorescently labeled nucleic acid samples are used, arrays on solid surface substrates with much lower fluorescence than membranes, such as glass, quartz, or small beads, can achieve much better sensitivity. Substrates such as glass or fused silica are advantageous in that they provide a very low fluorescence substrate, and a highly efficient hybridization environment. Covalent attachment of the target nucleic acids to glass or synthetic fused silica can be accomplished according to a number of known techniques (described above). Nucleic acids can be conveniently coupled to glass using commercially available reagents. For instance, materials for preparation of silanized glass with a number of functional groups are commercially available or can be prepared using standard techniques (see, e.g., Gait ( 1984) Oligonucleotide Synthesis: A
~ Practical Approach, IRL Press, Wash., D.C.). Quartz cover slips, which have at least 10-fold lower autofluorescence than glass, can also be silanized.
Alternatively, probes can also be immobilized on commercially available coated beads or other surfaces. For instance, biotin end-labeled nucleic acids can be bound to commercially available avidin-coated beads. Streptavidin or anti-digoxigenin antibody can also be attached to silanized glass slides by protein-mediated coupling using e.g., protein A following standard protocols (see, e.g., Smith (1992) Science 258: 1122-1126). Biotin or digoxigenin end-labeled nucleic acids can be prepared according to standard techniques. Hybridization to nucleic acids attached to beads is accomplished by suspending them in the hybridization mix, and then depositing them on the glass substrate for analysis after washing. Alternatively, paramagnetic particles, such as ferric oxide particles, with or without avidin coating, can be used.
A variety of other nucleic acid hybridization formats are known to those skilled in the art. For example, common formats include sandwich assays and competition or displacement assays. Hybridization techniques are generally described in Hames and Higgins (1985) Nucleic Acid Hybridization, A Practical Approach, IRL
Press;
Gall and Pardue (1969) Proc. Natl. Acad. Sci. USA 63: 378-383; and John et al.
(1969) Nature 223: 582-587.
Sandwich assays are commercially useful hybridization assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture"
nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in solution.
The sample will provide the target nucleic acid. The "capture" nucleic acid and "signal"
nucleic acid probe hybridize with the target nucleic acid to form a "sandwich"
hybridization complex. To be most effective, the signal nucleic acid should not hybridize with the capture nucleic acid.
Detection of a hybridization complex may require the binding of a signal generating complex to a duplex of target and probe polynucleotides or nucleic acids.
Typically, such binding occurs through ligand and anti-ligand interactions as between a ligand-conjugated probe and an anti-ligand conjugated with a signal.

The sensitivity of the hybridization assays may be enhanced through use of a nucleic acid amplification system that multiplies the target nucleic acid being detected.
Examples of such systems include the polymerise chain reaction (PCR) system and the ligase chain reaction (LCR) system. Other methods recently described in the art are the nucleic acid sequence based amplification (NASBAO, Cangene, Mississauga, Ontario) and Q Beta Replicase systems.
Nucleic acid hybridization simply involves providing a denatured probe and target nucleic acid under conditions where the probe and its complementary target can form stable hybrid duplexes through complementary base pairing. The nucleic acids that do not form hybrid duplexes are then washed away leaving the hybridized nucleic acids to be detected, tyl:ically through detection of an attached detectable label. It is generally recognized that nucleic acids are denatured by increasing the temperature or decreasing the salt concentration of the buffer containing the nucleic acids, or in the addition of chemical agents, or the raising of the pH. Under low stringency conditions (e.g., low temperature and/or high salt and/or high target concentration) hybrid duplexes {e.g., DNA:DNA, RNA:RNA, or RNA:DNA) will form even where the annealed sequences are not perfectly complementary. Thus specificity of hybridization is reduced at lower stringency. Conversely, at higher stringency (e.g., higher temperature or lower salt) successful hybridization requires fewer mismatches.
One of skill in the art will appreciate that hybridization conditions may be selected to provide any degree of stringency. In a preferred embodiment, hybridization is performed at low stringency to ensure hybridization and then subsequent washes are performed at higher stringency to eliminate mismatched hybrid duplexes.
Successive washes may be performed at increasingly higher stringency (e.g., down to as low as 0.25 X SSPE-T at 37°C to 70°C) until a desired level of hybridization specificity is obtained.
Stringency can also be increased by addition of agents such as formamide.
Hybridization specificity may be evaluated by comparison of hybridization to the test probes with hybridization to the various controls that can be present.
In general, there is a tradeoff between hybridization specificity (stringency) and signal intensity. Thus, in a preferred embodiment, the wash is performed at the highest stringency that produces consistent results and that provides a signal intensity greater than approximately 10% of the background intensity. Thus, in a preferred embodiment, the hybridized array may be washed at successively higher stringency solutions and read between each wash. Analysis of the data sets thus produced will reveal a wash stringency above which the hybridization pattern is not appreciably altered and which provides adequate signal for the particular probes of interest.
Methods of optimizing hybridization conditions are well known to those of skill in the art (see, e.g., Tijssen (1993) Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, Elsevier, N.Y.).
_LabelinQ and detection of nucleic acids.
In a preferred embodiment, the hybridized nucleic acids are detected by detecting one or more labels attached to the sample or probe nucleic acids.
The labels may be incorporated by any of a number of means well known to those of skill in the art.
Means of attaching labels to nucleic acids include, for example nick translation or end-labeling (e.g. with a labeled RNA) by kinasing of the nucleic acid and subsequent attachment (ligation) of a nucleic acid linker joining the sample nucleic acid to a label (e.g., a fluorophore). A wide variety of linkers for the attachment of labels to nucleic acids are also known. In addition, intercalating dyes and fluorescent nucleotides can also be used.
Detectable labels suitable for use in the present invention include any composition detectable by spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical means. Useful labels in the present invention include biotin for staining with labeled streptavidin conjugate, magnetic beads (e.g., Dynabeads~), fluorescent dyes (e.g., fluorescein, texas red, rhodamine, green fluorescent protein, and the like, see, e.g., Molecular Probes, Eugene, Oregon, USA), radiolabels (e.g., 3H, lzsh 355,''~C, or 32P), enzymes (e.g., horse radish peroxidase, alkaline phosphatase and others commonly used in an ELISA), and colorimetric labels such as colloidal gold (e.g., gold particles in the 40 -80 nm diameter size range scatter green light with high efficiency) or colored glass or plastic (e.g., polystyrene, polypropylene, latex, etc.) beads. Patents teaching the use of such labels include U.S. Patent Nos. 3,817,837; 3,850,752;
3,939,350;
3,996,345; 4,277,437; 4,275,149; and 4,366,241.
A fluorescent label is preferred because it provides a very strong signal with low background. It is also optically detectable at high resolution and sensitivity through a quick scanning procedure. The nucleic acid samples can all be labeled with a single label, e.g., a single fluorescent label. Alternatively, in another embodiment, different nucleic acid samples can be simultaneously hybridized where each nucleic acid sample has a different label. For instance, one target could have a green fluorescent label and a second target could have a red fluorescent label. The scanning step will distinguish cites of binding of the red label from those binding the green fluorescent label. Each nucleic acid sample (target nucleic acid) can be analyzed independently from one another.
Suitable chromogens which can be employed include those molecules and compounds which absorb light in a distinctive range of wavelengths so that a color can be observed or, alternatively, which emit light when irradiated with radiation of a particular '- wave length or wave length range, e.g., fluorescers.
Desirably, fluorescers should absorb light above about 300 nm, preferably about 350 nm, and more preferably above about 400 nm, usually emitting at wavelengths greater than about 10 nm higher than the wavelength of the light absorbed. It should be noted that the absorption and emission characteristics of the bound dye can differ from the unbound dye. Therefore, when referring to the various wavelength ranges and characteristics of the dyes, it is intended to indicate the dyes as employed and not the dye which is unconjugated and characterized in an arbitrary solvent.
Fluorescers are generally preferred because by irradiating a fluorescer with light, one can obtain a plurality of emissions. Thus, a single label can provide for a plurality of measurable events.
Detectable signal can also be provided by chemiluminescent and bioluminescent sources. Chemiluminescent sources include a compound which becomes electronically excited by a chemical reaction and can then emit light which serves as the detectable signal or donates energy to a fluorescent acceptor. Alternatively, luciferins can be used in conjunction with luciferase or lucigenins to provide bioluminescence.
Spin labels are provided by reporter molecules with an unpaired electron spin which can be detected by electron spin resonance (ESR) spectroscopy. Exemplary spin labels include organic free radicals, transitional metal complexes, particularly vanadium, copper, iron, and manganese, and the like. Exemplary spin labels include nitroxide free radicals.
The label may be added to the target (sample) nucleic acids) prior to, or after the hybridization. So called "direct labels" are detectable labels that are directly attached to or incorporated into the target (sample) nucleic acid prior to hybridization. In contrast, so called "indirect labels" are joined to the hybrid duplex after hybridization.
Often, the indirect label is attached to a binding moiety that has been attached to the target nucleic acid prior to the hybridization. Thus, for example. the target nucleic acid may be biotinylated before the hybridization. After hybridization, an avidin-conjugated fluorophore will bind the biotin bearing hybrid duplexes providing a label that is easily detected. For a detailed review of methods of labeling nucleic acids and detecting labeled hybridized nucleic acids see Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed.
filsevier, N.Y., ( 1993)).
Fluorescent labels are easily added during an in vitro transcription reaction. Thus, for example, fluorescein labeled UTP and CTP can be incorporated into the RNA produced in an in vitro transcription.
The labels can be attached directly or through a linker moiety. In general, the site of label or linker-label attachment is not limited to any specific position. For example, a label may be attached to a nucleoside, nucleotide, or analogue thereof at any position that does not interfere with detection or hybridization as desired.
For example, certain Label-ON Reagents from Clontech (Palo Alto, CA) provide for labeling interspersed throughout the phosphate backbone of an oligonucleotide and for terminal labeling at the 3' and 5' ends. As shown for example herein, labels can be attached at positions on the ribose ring or the ribose can be modified and even eliminated as desired.
The base moieties of useful labeling reagents can include those that are naturally occurring or modified in a manner that does not interfere with the purpose to which they are put. Modified bases include but are not limited to 7-deaza A and G, 7-deaza-8-aza A
and G, and other heterocyclic moieties.
It will be recognized that fluorescent labels are not to be limited to single species organic molecules, but include inorganic molecules, mufti-molecular mixtures of organic and/or inorganic molecules, crystals, heteropolymers, and the like.
Thus, for example, CdSe-CdS core-shell nanocrystals enclosed in a silica shell can be easily derivatized for coupling to a biological molecule (Bruchez et al. (1998) Science, 281:
2013-2016). Similarly, highly fluorescent quantum dots (zinc sulfide-capped cadmium selenide) have been covalently coupled to biomolecules for use in ultrasensitive biological detection (Warren and Nie (1998) Science, 281: 2016-2018).
AmQ,ification-based assays.
In another embodiment, amplification-based assays can be used to detect nucleic acids. In such amplification-based assays, the nucleic acid sequences act as a template in an amplification reaction (e.g. Polymerase Chain Reaction (PCR).
Detailed protocols for quantitative PCR are provided in Innis et al. ( 1990) PCR
Protocols, A Guide to Methods and Applications, Academic Press, Inc. N.Y.).
Other suitable amplification methods include, but are not limited to ligase chain reaction (LCR) (see Wu and Wallace (1989) Genomics 4: 560, Landegren et al.
(1988) Science 241: 1077, and Barringer et al. (1990) Gene 89: 117, transcription amplification (Kwoh et al. (1989) Proc. Natl. Acad. Sci. USA 86: 1173), and self sustained sequence replication (Guatelli et al. ( 1990) Proc. Nat. Acad. Sci.
USA 87:
1874).
Detection;. of C. pneumoniae gene expression The nucl:;ic acids of the invention can also be used to G pneumoniae detect gene transcripts. Methods of detecting and/or quantifying gene transcripts using nucleic acid hybridization techniques are known to those of skill in the art (see Sambrook et al. supra). For example , a Northern transfer may be used for the detection of the desired mRNA directly. In brief, the mRNA is isolated from a given cell sample using, for example, an acid guanidinium-phenol-chloroform extraction method. The mRNA
is then electrophoresed to separate the mRNA species and the mRNA is transferred from the gel to a nitrocellulose membrane. As with the Southern blots, labeled probes are used to identify and/or quantify the target mRNA.
In another preferred embodiment, the gene transcript can be measured using amplification (e.g. PCR) based methods as described above for directly assessing copy number of the target sequences.
Expression of C. Dneumoniae proteins The nucleic acids disclosed here can be used for recombinant expression of the proteins. In these methods, the nucleic acids encoding the proteins of interest are introduced into suitable host cells, followed by induction of the cells to produce large amounts of the protein. The invention relies on routine techniques in the field of recombinant genetics, well known to those of ordinary skill in the art. A
basic text disclosing the general methods of use in this invention is Sambrook et al., Molecular Cloning, A Laboratory Manual (2nd ed. 1989).
Standard transfection methods are used to produce prokaryotic, mammalian, yeast or insect cell lines which express large quantities of the desired polypeptide, which is then purified using standard techniques (see, e.g., Colley et al., J.
Biol. Chem. 264:17619-17622, 1989; Guide to Protein PuriJZCation, supra).
The nucleotide sequences used to transfect the host cells can be modified to yield Chlamydia polypeptides with a variety of desired properties. For example, the polypeptides can vary from the naturally-occurring sequence at the primary structure level by amino acid, insertions, substitutions, deletions, and the like. These modifications can be used in a number of combinations to produce the final modified protein chain.
The amino acid sequence variants can be prepared with various objectives in mind, including facilitating purification and preparation of the recombinant polypeptide. The modified polypeptides are also useful for modifying plasma half life, improving therapeutic efficacy, and lessening the severity or occurrence of side effects during therapeutic use. The amino acid sequence variants are usually predetermined variants not found in nature but exhibit the same immunogenic activity as naturally occurring protein. In general, modifications of the sequences encoding the polypeptides may be readily accomplished by a variety of well-known techniques, such as site-directed mutagenesis (see Gillman & Smith, Gene 8:81-97 (1979); Roberts et al., Nature 328:731-734 (1987)). One of ordinary skill will appreciate that the effect of many mutations is difficult to predict. Thus, most modifications are evaluated by routine screening in a suitable assay for the desired characteristic. For instance, the effect of various modifications on the ability of the polypeptide to elicit a protective immune response can be easily determined using in vitro assays. For instance, the polypeptides can be tested for their ability to induce lymphoproliferation, T cell cytotoxicity, or cytokine production using standard techniques.
The particular procedure used to introduce the genetic material into the host cell for expression of the polypeptide is not particularly critical. Any of the well known procedures for introducing foreign nucleotide sequences into host cells may be used. These include the use of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, plasmid vectors, viral vectors and any of the other well known methods for introducing cloned genomic DNA, cDNA, synthetic DNA
or other foreign genetic material into a host cell (see Sambrook et al., supra). It is only necessary that the particular procedure utilized be capable of successfully introducing at least one gene into the host cell which is capable of expressing the gene.

Any of a number of well known cells and cell lines can be used to express the polypeptides of the invention. For instance, prokaryotic cells such as E.
toll can be used. Eukaryotic cells include, yeast, Chinese hamster ovary (CHO) cells, COS
cells, and insect cells.
The particular vector used to transport the genetic information into the cell is also not particularly critical. Any of the conventional vectors used for expression of recombinant proteins in prokaryotic and eukaryotic cells may be used.
Expression - vectors for mammalian cells typically contain regulatory elements from eukaryotic viruses.
The expression vector typically contains a transcription unit or expression cassette that contains all the elements required for the expression of the polypeptide DNA
in the host cells. A typical expression cassette contains a promoter operably linked to the DNA sequence encoding a polypeptide and signals required for efficient polyadenylation of the transcript. The term "operably linked" as used herein refers to linkage of a promoter upstream from a DNA sequence such that the promoter mediates transcription of the DNA sequence. The promoter is preferably positioned about the same distance from the heterologous transcription start site as it is from the transcription start site in its natural setting. As is known in the art, however, some variation in this distance can be accommodated without loss of promoter function.
Following the growth of the recombinant cells and expression of the polypeptide, the culture medium is harvested for purification of the secreted protein. The media are typically clarified by centrifugation or filtration to remove cells and cell debris and the proteins are concentrated by adsorption to any suitable resin or by use of ammonium sulfate fractionation, polyethylene glycol precipitation, or by ultrafiltration.
Other routine means known in the art may be equally suitable. Further purification of the polypeptide can be accomplished by standard techniques, for example, affinity chromatography, ion exchange chromatography, sizing chromatography, HkS6 tagging and Ni-agarose chromatography (as described in Dobeli et al., Mol. and Biochem.
Parasit.
41:259-268 ( 1990)), or other protein purification techniques to obtain homogeneity. The purified proteins are then used to produce pharmaceutical compositions, as described below.
An alternative method of preparing recombinant polypeptides useful as vaccines involves the use of recombinant viruses (e.g., vaccinia). Vaccinia virus is grown in suitable cultured mammalian cells such as the HeLa S3 spinner cells, as described by Mackett et al., in DNA cloning Vol. IL~ A practical approach, pp. 191-211 (Glover, ed.).
Antibod~Production The proteins of the present invention can be used to produce antibodies specifically reactive with C pneumoniae antigens. If isolated proteins are used, they may be recombinantly produced or isolated from Chlamydia cultures. Synthetic peptides made using the protein sequences may also be used.
Methods of production of polyclonal antibodies are known to those of skill in the art. In brief, an immunogen, preferably a purified protein, is mixed with an adjuvant and animals are immunized. When appropriately high titers of antibody to the immunogen are obtained, blood is collected from the animal and antisera is prepared.
Further fractionation of the antisera to enrich for antibodies reactive to Chlamydia proteins can be done if desired (see Harlow & Lane, Antibodies: A Laboratory Manual ( 1988)).
Polyclonal antisera are used to identify and characterize Chlamydia in the tissues of patients using, for instance, in situ techniques and immunoperoxidase test procedures described in Anderson et al. JA VMA 198:241 ( 1991 ) and Barr et al. Vet.
Pathol. 28:110-116 (1991).
Monoclonal antibodies may be obtained by various techniques familiar to those skilled in the art. Briefly, spleen cells from an animal immunized with a desired antigen are immortalized, commonly by fizsion with a myeloma cell (see Kohler &
Milstein, Eur. J. Immunol. 6:511-519 (1976)). Alternative methods of immortalization include transformation with Epstein Barr Virus, oncogenes, or retroviruses, or other methods well known in the art. Colonies arising from single immortalized cells are screened for production of antibodies of the desired specificity and affinity for the antigen, and yield of the monoclonal antibodies produced by such cells may be enhanced by various techniques, including injection into the peritoneal cavity of a vertebrate host.
Monoclonal antibodies produced in such a manner are used, for instance, in ELISA diagnostic tests, immunoperoxidase tests, immunohistochemical tests, for the in vitro evaluation of spirochete invasion, to select candidate antigens for vaccine development, protein isolation, and for screening genomic and cDNA libraries to select appropriate gene sequences.

Immunodiagonostic detection of C. pneumoniae infections The present invention also provides methods for detecting the presence or absence of C. pneumoniae, or antibodies reactive with it, in a biological sample. For instance, antibodies specifically reactive with Chlamydia can be detected using either Chlamydia proteins or the isolates described here. The proteins and isolates can also be used to raise specific antibodies (either monoclonal or polyclonal) to detect the antigen in a sample. In addition, the nucleic acids disclosed and claimed here can be used to detect Chlamydia-specific sequences using standard hybridization techniques.
For a review of immunological and immunoassay procedures in general, see Basic and Clinical.rmmunology (Stites & Terr ed., 7th ed. 1991)). The immunoassays of the present invention can be perfonmed in any of several configurations, which are reviewed extensively in Enzyme Immunoassay (Maggio, ed., 1980); Tijssen, Laboratory Techniques in Biochem.stry and Molecular Biology ( 1985)). For instance, the proteins and antibodies disclose 1 here are conveniently used in ELISA, immunobiot analysis and agglutination assays.
In brief, immunoassays to measure anti-Chlamydia antibodies or antigens can be either competitive or noncompetitive binding assays. In competitive binding assays, the sample analyte (e.g., anti-Chlamydia antibodies) competes with a labeled analyte (e.g., anti-Chlamydia monoclonal antibody) for specific binding sites on a capture agent (e.g., isolated Chlamydia protein) bound to a solid surface. The concentration of labeled analyte bound to the capture agent is inversely proportional to the amount of free analyte present in the sample.
Noncompetitive assays are typically sandwich assays, in which the sample analyze is bound between two analyte-specific binding reagents. One of the binding agents is used as a capture agent and is bound to a solid surface. The second binding agent is labelled and is used to measure or detect the resultant complex by visual or W strument means.
A number of combinations of capture agent and labelled binding agent can be used. For instance, an isolated Chlamydia protein or culture can be used as the capture agent and labelled anti-human antibodies specific for the constant region of human antibodies can be used as the labelled binding agent. Goat, sheep and other non-l.uman antibodies specific for human immunoglobulin constant regions (e.g., y or p.) are well known in the art. Alternatively, the anti-human antibodies can be the capture agent and the antigen can be labelled.
Various components of the assay, including the antigen, anti-Chlamydia antibody, or anti-human antibody, may be bound to a solid surface. Many methods for immobilizing biomolecules to a variety of solid surfaces are known in the art.
For instance, the solid surface may be a membrane (e.g., nitrocellulose), a microtiter dish (e.g., PVC or polystyrene) or a bead. The desired component may be covalently bound or noncovalently attached through nonspecific bonding.
Alternatively, the immunoassay may be carried out in liquid phase and a variety of separation methods may be employed to separate the bound labeled component from the unbound labelled components. These methods are known to those of skill in the art and include immunoprecipitation, column chromatography, adsorption, addition of magnetizable particles coated with a binding agent and other similar procedures.
An immunoassay may also be carried out in liquid phase without a separation procedure. Various homogeneous immunoassay methods are now being applied to immunoassays for protein analytes. In these methods, the binding of the binding agent to the analyte causes a change in the signal emitted by the label, so that binding may be measured without separating the bound from the unbound labelled component.
Western blot (immunoblot) analysis can also be used to detect the presence of antibodies to Chlamydia in the sample. This technique is a reliable method for confirming the presence of antibodies against a particular protein in the sample. The technique generally comprises separating proteins by gel electrophoresis on the basis of molecular weight, transferring the separated proteins to a suitable solid support, (such as a nitrocellulose filter, a nylon filter, or derivatized nylon filter), and incubating the sample with the separated proteins. This causes specific target antibodies present in the sample to bind their respective proteins. Target antibodies are then detected using labeled anti-human antibodies.
The immunoassay formats described above employ labelled assay components. The label may be coupled directly or indirectly to the desired component of the assay according to methods well known in the art. A wide variety of labels may be used. The component may be labelled by any one of several methods.
Traditionally a radioactive label incorporating 3H,'ZSh ass, i4C, or 32P was used. Non-radioactive labels include ligands which bind to labelled antibodies, fluorophores, chemiluminescent agents, enzymes, and antibodies which can serve as specific binding pair members for a labelled ligand. The choice of label depends on sensitivity required, ease of conjugation with the compound, stability requirements, and available instrumentation.
$ Enzymes of interest as labels will primarily be hydrolases, particularly phosphatases, esterases and glycosidases, or oxidoreductases, particularly peroxidases.
Fluorescent compounds include fluorescein and its derivatives, rhodamine and its '- derivatives, dansyl, umbeliiferone, etc. Chemiluminescent compounds include luciferin, and 2,3-dihydrophthalazinediones, e.g., luminol. For a review of various labelling or signal producing systems which may be used, see U.S. Patent No. 4,391,904, which is incorporated herein by reference.
Non-radioactive labels are often attached by indirect means. Generally, a ligand molecule (e.g., biotin) is covalently bound to the molecule. The ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either inherently detectable or covalently bound to a signal system, such as a detectable enzyme, a fluorescent compound, or a chemiluminescent compound. A number of ligands and anti-ligands can be used. Where a Iigand has a natural anti-ligand, for example, biotin, thyroxine, and cortisol, it can be used in conjunction with the labelled, naturally occurring anti-ligands.
Alternatively, any haptenic or antigenic compound can be used in combination with an antibody.
Some assay formats do not require the use of labelled components. For instance, agglutination assays can be used to detect the presence of the target antibodies.
In this case, antigen-coated particles are agglutinated by samples comprising the target antibodies. In this format, none of the components need be labelled and the presence of the target antibody is detected by simple visual inspection.
Phazmaceutical Compositions The peptides or antibodies (typically monoclonal antibodies) of the present invention and pharmaceutical compositions thereof are useful for administration to mammals, particularly humans, to treat and/or prevent Chlamydia infections.
Suitable formulations are found in Remington's Pharmaceutical Sciences, Mack Publishing Company, Philadelphia, PA, 17th ed. (1985).

The immunogenic peptides or antibodies of the invention are administered prophylactically or to an individual already suffering from the disease. The peptide compositions are administered to a patient in an amount sufficient to elicit an effective immune response to Chlamydia. An effective immune response is one that inhibits infection. An amount adequate to accomplish this is defined as "therapeutically effective dose" or "immunogenically effective dose." Amounts effective for this use will depend on, e.g., the peptide composition, the manner of administration, the stage and severity of the disease being treated, the weight and general state of health of the patient, and the judgment of the prescribing physician, but generally range for the initial immunization (that is for therapeutic or prophylactic administration) from about 0.1 mg to about 1.0 mg per 70 kilogram patient, more commonly from about 0.5 mg to about 0.75 mg per 70 kg of body weight. Boosting dosages are typically from about 0.1 mg to about 0.5 mg of peptide using a boosting regimen over weeks to months depending upon the patient's response and condition. A suitable protocol would include injection at time 0, 4, 2, 6, 10 and 14 weeks, followed by further booster injections at 24 and 28 weeks.
For therapeutic use, administration should begin at the first sign of infection. This is followed by boosting doses until at least symptoms are substantially abated and for a period thereafter. In some circumstances, loading doses followed by boosting doses may be required. The resulting immune response helps to cure or at least partially arrest symptoms and/or complications. Vaccine compositions containing the peptides are administered prophylactically to a patient susceptible to or otherwise at risk of the infection.
The pharmaceutical compositions (containing either peptides or antibodies) are intended for parenteral or oral administration. Preferably, the pharmaceutical compositions are administered parenterally, e.g., subcutaneously, intradermally, or intramuscularly. Thus, the invention provides compositions for parenteral administration which comprise a solution of the immunogenic polypeptides dissolved or suspended in an acceptable carrier, preferably an aqueous carrier. A variety of aqueous carriers may be used, e.g., water, buffered water, 0.4% saline, 0.3% glycine, hyaluronic acid and the like. These compositions may be sterilized by conventional, well known sterilization techniques, or may be sterile filtered. The resulting aqueous solutions may be packaged for use as is, or lyophilized, the lyophilized preparation being combined with a sterile solution prior to administration. The compositions may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions, such as buffering agents, tonicity adjusting agents, wetting agents and the like, for example, sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, sorbitan monolaurate, triethanolamine oleate, etc.
The compositions may also comprise carriers to enhance the immune response. Useful carriers are well known in the art, and include, e.g., KLH, thyroglobulin, alburnins such as human serum albumin, tetanus toxoid, poiyamino acids such as poly(lysine:glutamic acid), influenza, hepatitis B virus core protein, hepatitis B
virus recombinant vaccine and the like.
For solid compositions, conventional nontoxic solid carriers may be used which include, for exarr.ple, pharmaceutical grades of mannitol, lactase, starch, magnesium stearate, soc!.ium saccharin, talcum, cellulose, glucose, sucrose, magnesium carbonate, and the like. For oral administration, a pharmaceutically acceptable nontoxic composition is formed Y y incorporating any of the normally employed excipients, such as 1 ~ those carriers previously listed, and generally 10-95% of active ingredient, that is, one or more peptides of the invention, and more preferably at a concentration of 25%-75%.
As noted above, the peptide compositions are intended to induce an immune response to Chlamydia. Thus, compositions and methods of administration suitable for maximizing the immune response are preferred. For instance, peptides may be introduced into a host, including humans, linked to a carrier or as a homopoiymer or heteropolymer of active peptide units from various Chlamydia proteins disclosed here.
Alternatively, a "cocktail" of polypeptides can be used. A mixture of more than one polypeptide has the advantage of increased immunological reaction and, where different peptides are used to make up the polymer, the additional ability to induce antibodies to a number of epitopes.
The compositions also include an adjuvant. As used here, number of adjuvants are well known to one skilled in the art. Suitable adjuvants include incomplete Freund's adjuvant, alum, aluminum phosphate, aluminum hydroxide, N-acetyl-rnuramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP), N-acetylinuramyl-Lalanyl-D-isoglutaminyl-L-alanine-2-{1'-2'-dipalmitoyl-sn-g:ycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), and RIBI, which contains three components extracted from bacteria, monophosphoryl WO 00!17994 PCT/US99/26923 lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2%
squalenelTween 80 emulsion. The effectiveness of an adjuvant may be determined by measuring the amount of antibodies directed against the immunogenic peptide.
The concentration of immunogenic peptides of the invention in the S pharmaceutical formulations can vary widely, i.e. from less than about 0.1 %, usually at or at least about 2% to as much as 20% to 50% or more by weight, and will be selected primarily by fluid volumes, viscosities, etc., in accordance with the particular mode of administration selected.
The peptides of the invention can also be expressed by attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of vaccinia virus as a vector to express nucleotide sequences that encode the peptides of the invention. Upon introduction into a host, the recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. Patent No. 4,722,848.
Another vector is BCG (Bacille Calmette Guerin). BCG vectors are described in Stover et aI.
(Nature 351:456-460 (1991)). A wide variety of other vectors useful for therapeutic administration or immunization of the peptides of the invention, e.g., Salmonella typhi vectors and the like, will be apparent to those skilled in the art from the description herein.
The DNA encoding one or more of the peptides of the invention can also be administered to the patient. This approach is described, for instance, in Wolff et. al., Science 247: 1465-1468 (1990) as well as U.S. Patent Nos. 5,580,859 and 5,589,466.
In order to enhance serum half life, the peptides may also be encapsulated, introduced into the lumen of liposomes, prepared as a colloid, or other conventional techniques may be employed which provide an extended serum half life of the peptides.
A variety of methods are available for preparing liposomes, as described in, e.g., Szoka et al., Ann. Rev. Biophys. Bioeng. 9:467 (1980), U.S. Pat. Nos. 4, 235,871, 4,501,728 and 4,837,028.
EXAMPLES
The following examples are offered to illustrate, but no to limit the claimed invention.
Examvle 1:

This example describes comparison of the C. pneumoniae genome disclosed here and the, previously sequenced, C. trachomatis genome (Stephens, et al.
Science 282:754-759 (1998)).
The apparent low level of DNA homology between C. trachomaris and C.
pneumoniae (Campbell, et al., J. Clin. Microbiol. 25:1911-1916 {1987)) yet analogous cell structures and developmental cycles, predicts that comparative analysis of the two genomes will significantly enhance the understanding of both pathogens.
Identification of genes that are present in one species but not the other are of particular importance for the mutually exclusive biological, virulence and pathogenesis capabilities of each.
Identification of genes shared between the two species strongly supports the requirement for these capabilities in a biological system that has, over its long-term association with mammalian host cells, evolved to reduce the metabolic capacities while optimizing survival, growth and transmission of these unique pathogens.
The previously sequenced G trachomatis genome contains 1,042,519 I S nucleotides and 875 likely protein-coding genes. Similarity searching permitted the inferred functional assignment of sequences 636 {60%) genes disclosed here and (23%) are similar to hypothetical genes for other bacterial organisms including those for G trachomatis. The remaining 186 (17%) genes are not homologous to sequences deposited in GenBank.. Seventy C. trachomatis genes are not represented in the C.
pneumoniae genome. These are contained within blocks consisting of 2-17 genes and 19 single genes. Of the 70 G trachomatis genes without homologs in C. pneumoniae, 60 are classified as encoding hypothetical proteins. The remaining genes not represented in C
pneumoniae consist of the tryptophan operon (trpA,B,R), trpC, two predicted thiol protease genes, and 4 genes assigned to the phospholipase-D superfamily.
It is evident that there is a high level of functional conservation between C.
pneumoniae and C. trachomatis as orthologs to C. trachomatis genes were identified for 859 (80%) of the predicted coding sequences for G pneumoniae. The level of similarity for individual encoded proteins spans a wide spectrum (22-95% amino acid identity) with an average of 62% amino acid identity between orthologs from the two species.
The percent amino acid identity between orthologous chlamydial proteins is similar among functional groups with the highest for proteins associated with translation and the lowest for proteins whose function in chlamydiae is uncharacterized and not related to proteins encoded by other organisms. The gene order of the homologous set of genes in C.

WO 00/27994 PCT/US99/2b923 pneumoniae shows reorganization relative to the genome of C. trachomatis;
however, there is a high level of synteny for the gene organization of the two genomes.
We identified thirty-nine blocks of 2 or more genes whose gene organization is colinear with homologs to C. trachomatis, although some of these are inverted. The distribution of genome reorganization is not evenly distributed on the chromosome as the region between G pneumoniae coding sequences 0130-0300 contains substantially more reorganization than other areas of the genome. This region coincides with the predicted chromosome replication terminus.
We identified orthologs of enzymes characterized in other bacteria that account for the essential requirements for DNA replication, repair, transcription and translation including two predicted DNA helicases of the Swi2/Snf2 family found in C.
trachomatis. Similar to G trachomatis, alternative sigma subunits for RNA
polymerase, X28 ~d ~54~ were identified in addition to anti-a~ regulatory system factors RsbV, a RsbW-like single-domain histidine kinase, and a RsbU-like protein phosphatase.
These findings suggest that the fundamental mechanisms of transcriptional regulation are conserved among Chlamydia. The C. trachomatis proteins containing SET and SWIB
domains, and a SWiB domain fused to the C-terminus of the chlamydial topoisomerase I, not identified outside eukaryotes, are found in C. pneumoniae supporting their possible role in the chromatin condensation-decondensation characteristic of the biologically unique chlamydial developmental cycle.
The central metabolic pathways inferred from the G pneumoniae genome sequence are the same as those identified for C. trachomatis G pneumoniae has a glycolytic pathway and a linked tricarboxylic acid cycle, although likely functional, is incomplete as genes for citrate synthase, aconitase, and isocitrate dehydrogenase were not identified. C. pneumoniae has a complete glycogen synthesis and degradation system supporting a role for glycogen synthesis and utilization of glucose-derivatives in chlamydial metabolism. Genes encoding essential functions in aerobic respiration are present and electron flux may be supported by pyruvate, succinate, glycerol-3-phosphate, and NADH dehydrogenases, NADH-ubiquinone oxidoreductase and cytochrome oxidase.
C. pneumoniae also contains the V (vacuolar}-type ATPase operon and the two ATP
translocases found in C trachomatis.
The type-III secretion virulence system required for invasion by several pathogenic bacteria and found in the C. trachomatis genome in three chromosomal locationsis also present in the C. pneumoniae genome. Each of the components is conserved and their relative genomic contexts are conserved. Genes such as a predicted serine/threonine protein kinase and other genes physically linked to genes encoding structural components of the type-III secretion apparatus, but without identified homologs, are also highly similar between the two species suggesting the functional roles in modifying cellular biology are fundamentally conserved.
Chlamydia-encoded proteins that are not found in chlamydial organisms but localized to the intracellular chlamydial inclusion membrane are likely essential for the unique intracellular biology and perhaps differences in inclusion morphology observed between species of Chlamydia. Several such proteins, termed incA,B&C, have been characterized for a !:. psittaci strain (Rockey, et al. Mol. Microbiol.
15:617-626 (1995); Rockey et al. Inf:~ct. Immun. 62:106-112 (1994)). C. pneumoniae and C.
trachomatis encode orthc~logs to C. psittaci Inca and IncC and C. trachomatis also contains an ortholog to LicA. C. pneumoniae contains two genes that encode proteins with similarity to IncA (CPn0186 and CPn0585), although the level of homology is low suggesting analogous but possibily altered functions.
The tryptophan biosynthesis operon (trpA, trpB, trpR) and trpC identified in C. trachomatis is conspicuously missing in the C. pneumoniae genome. This represents the entire repertoire of genes associated with tryptophan biosynthesis identified in C. trachomatis. Seventeen genes adjacent to the C. trachomatis tryptophan operon also were not found in the G pneumoniae genome. This region is the single largest loss of a contiguous genomic segment and includes 4 HKD superfamily encoding genes that encompass a family of proteins related to endonuclease and phospholipase D.
These findings may be important for the ability of Chlamydia to persist in their hosts and cause disease by eliciting potent, focal and persistent inflammatory responses thought to be essential for pathogenesis.
The C. pneumoniae genome contains 187,711 additional nucleotides compared to the C. trachomatis genome, and the 214 coding sequences not found in C.
trachomatis account for most of the increased genome size. Eighty-eight of these genes are found in blocks of >10 genes {11-30 genes/block), 41 are single genes, and the remainder are partnered with at least one other gene. Based upon the observation that ~%U% of all the C. pneumoniae genes have an identifiable homolog in GenBank, exclusive of C. trachomatis, it would be expected that over 150 of the 214 genes should have a homolog in GenBank, many associated with a function. However, only 28 coding sequences have similarity to genes from other organisms. Thus the majority of the genes that are mutually exclusive of C. trachomatis (186 of 214), and the 60 of 70 G
trachomatis genes that lacked an identifiable homolog in C. pneumoniae, do not have detectable homologs to genes from other organisms. We predict that most of the unique genes are essential for specific attributes that define the differential biology, tropism and pathogenesis of C. trachomatis and C. pneumoniae. Moreover, this suggests that C.
pneumoniae has more unique biological (i.e., virulence) capacity than C.
trachomatis.
The ability of C. pneumoniae to be more invasive and survive in a broader range of host cell types than C. trachomatis is consistent with this hypothesis. Not all of the differences in biological capacity may be associated with mutually exclusive genes. One explanation for the significantly lower level of homology between protein sequences assigned as having G pneumoniae and C. trachomatis orthologs but no identifiable orthologs in other organisms is that this set of proteins is not only associated with biological requirements specific for Chlamydia but this polymorphism may account for differential biology between the two species. The determination of the genome sequence from a representative of the C. psittaci group will precisely delineate those genes that are mutually exclusive and specific for each species.
The major functionally identifiable addition to the C. pneumoniae genome is a large expansion of genes encoding a new family of chlamydial polymorphic membrane proteins (Pmp), alone representing 22% of the increased coding capacity.
While the C. trachomatis genome has 9 pmp genes, remarkably the C. pneumoniae genome contains 21 pmp genes. Most of these genes appear to be amplified in two regions of the genome with three stand-alone genes. Interestingly one of the stand-alone genes is most closely related to the C. trachomatis pmpD which is the only stand-alone pmp gene in the C. trachomatis genome and it is located with the same relative genomic context, suggesting an essential and conserved function for this paralog. Six Pmp-coding genes are presumably not functional as five contain predicted coding frame-shifts and one is truncated. The amplification of this gene family and the confidently predicted frame-shifts suggest a specific molecular mechanism to promote functional or antigenic diversity. The biological role of this protein family remains enigmatic, although at least one of the proteins in G psittaci related to this family is exposed on the chlamydial surface.

WO 00/27994 PCT/US99/2b923 While a function could not be assigned for most of the unique G
pneumoniae genes, several have significant similarity to genes from other organisms.
Functional assignments could be made for genes encoding GMP synthetase, IMP
dehydrogenase, (JMP synthase, uridine kinase, biotin svnthase pathway proteins, methylthioadenosine nucleosidase, a DNA glycosylase and aromatic amino acid hydroxylase. Thus a complete pathway was identified for biotin biosynthesis.
The additional purine and pyrimidine salvage pathway genes presumably reflect metabolic ' limitations in one of the cell types that G pneumoniae infects or differences in the ability of C. pneumoniae to transport precursor nucleosides or nucleotides.
The addition of aromatic amino acid hydroxylase in G pneumoniae is intriguing especially in light of the loss of tryptophan biosynthetic genes and the inability to synthesize other amino acids including phenylalanine. Aromatic amino acid hyroxlyases include three distinct enzymes that function to receptively oxidize phenylalanine to tyrosine, tyrosine to Dopa, and tryptophan to 5-hydroxytryptophan and serotonin. Although the chlamydial protein is similar to proteins of this family and incrementally more closely related to tryptophan hydroxyiase, its specific function could not be confidently predicted. We hypothesize that it may be involved in C.
pneumoniae virulence. Tryptophan hydroxylase has not been previously identified in bacteria and the origin of the chlamydial gene appears to be from eukaryotes. The functional role of an aromatic amino acid hydroxyiase for C. pneumoniae is linked to the unique intracellular biology of this organism and may represent a key contribution to C. pneumoniae persistence and pathogenesis.
It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.
Table 1 provides functional assignments of C. pneumoniae nonprotein-encoding genomic sequences. Table 2 provides functional assignments of protein coding sequences. Table 3 provides the amino acid sequences of the proteins corresponding to the coding sequences.

type SEQ iD N0:1 SEQ tD N0:1 Gene start position end position Ori 841664 841396 (R) Putative Origin of Replica tmRNA 138493 138074 (R) tmRNA

pRNA 607342 607649 Ribonuclease P
RNA

rRNA 1000564 1002115 165 rRNA

rRNA 1002415 1005278 235 rRNA

rRNA 1005393 1005509 5S rRNA

tRNA 269070 269142 Ala tRNA_1 tRNA 164318 164389 Asn tRNA

tRNA 296224 296151 (R) Asp tRNA

tRNA 836191 836119 (R) Ala tRNA_2 tRNA 1030533 1030603 Cys tRNA

tRNA 784896 784822 (R> Glu tRNA

tRNA 781680 781610 (R) Gly tRNA'1 tRNA 961536 961607 Gly tRN~2 tRNA 999949 1000023 His tRNA

tRNA 268992 269065 Ile tRNA

tRNA 672236 672318 Leu tRNA 1 tRNA 680178 680257 Leu tRNA'2 tRNA 715889 715971 Leu tRNF~3 tRNA 739403 739486 Leu tRNPr_4 tRNA 1175863 1175944 Leu tRNA'5 tRNA 784994 784922 (R) Lys tRNA

tRNA 843926, 843999 Pro tRNA_2 tRNA 409922' 409848 (R> Pro tRNA_1 tRNA 631373 631445 Phe tRNA

tRNA 677337 677264 (R) Arg tRNA~,2 tRNA 807413 807341 (R) Arg tRNA_3 tRNA 877473 877400 (R) Arg tRNA_4 tRNA 462141 462214 Arg tRNA_1 tRNA 1085605 . 10.85676 Gln tRNA

tRNA 786780 786708 (R) Thr tRNA_3 tRNA 89728 89657 (R) Thr tRNA_I

tRNA 293477 293405 (R) Thr tRNA'2 tRNA 87522 87450 (R) Met tRNP~l tRNA 199301 199229 (R) Met tRNA_2 tRNA 199390 199317 (R) Met tRNA_3 tRNA 626904 626987 Ser tRNA_1 tRNA 708359 708440 Ser tRNA_2 tRNA 1112034 1142117 Ser tRNA_3 tRNA 1230028 1229945 (R) Ser tRNA_4 tRNA 91070 90999 (R) Trp tRNA

tRNA 293399 293317 (R) Tyr tRNA

tRNA 296147 296075 (R) Val tRNA_1 tRNA 1137389 1137462 Va1 tRNA_2 gacggatttgcactgccggtagaactccgcgaggtcgtccagcctcaggcagcagctgaa2520 ccaactcgcgaggggatcgagcccggggtgggcgaagaactccagcatgagatccccgcg2580 ctggagg ~ana ~rem "2 ctraeW:eee a~~n~'fee ~~ rerheeierfl erheiee iartateaGheaeel Clft0001111 4 R CT001 hypothetical protein CPa0003577 175 t QatC-Glu-CRNA Gla luaidotransterasa tC subunit)-CPn0007195 X770 T aat~-Glu eRNl1 Gln Aaridotransterae-1GT003!

ma)un.t:!ih :~:! t- ,(.~rN.cPe~ll=t ~:In rnrt:, vln Ammto~san.-.:~rrrsr t1 ::uWtniW -t~:T9n.n vPn~0U5ils7 edJ1 F pmp_1-7olymorphic ~uc.rt !(emDrin~ Procmn G Famsly CPn0005~=93 7111 R

CPn00077805 10196 F

CPn000810975 11615 F

CPn000911115 13119 t cPnooloa a 13x6 r s CPa00101379 13746 t frame-shift with 0010 CPn00I11519= 16114 !

CPn001316it4 11x12 !

CPn001311511 =1106 F ymp_1-Polynnrphie Outer Membrane Protein C
lamily CPa0014II392 219x3 r ymD_3-Polyanrphic outer Membrane Protein G
lastly CPn001537.135x174 t' pmp-"3-PNP_3 lirasre-shift with 0011) CPn001614416 26118 t' pmp_t-Polyteosphie Outer Membrane Protein G lastly CPn001726094 x7170 F' pop_t-PMP_< Iiras~-shift with 00161 CPn0018375x2 29007 t pmp_5-Polymorphic outer Membrane Procain G
Pamily cIPn001929007 30356 t pap_S-PMP_S Iirame-shift with 0011) CPa00x031617 30603 P. Predieted OHP (leader (14) pepCide= outer membrane)-ICT7511 CPn00x131410 3x707 R Predicted OHP (Leader 119) prptide)-1CT350) CPn00xx1191 34395 F maL-1CT319) CPa00x336607 3301 F y~ilc/alr-ABC TraasDOrter Protein ATPeee-ICT348) CPn00x137596 36661 F xerC-InteQrsse/reeombinaae-lCT3t7) CPa00x51860 37614 A elaC/atsa-Sulphohydrolase/Glycosuliataae-( Crlt6l CPa00x639625 3176= R GT3t5 hypoe3tetical protein-ICTltS) CPn00x7txx3t 39778 R lon-Lon ATP-dependent Protease-lGT3tl CPn00x813325 txSt3 R

CPa00x93755 43310 R

CPn003013191 415x9 f Qep_1-O-SialoQlycaDroeaia Eadopeptidase_1-ICT343) CPa003114711 44111 ! rs=1-SZ1 Ribosaetal Ptrouin-fCT3t=I

Cln003x4913 46098 T daaJ-Meat Shoek Proteia J-ICT7ti) CPn003346138 8171 P pdhArrB/odbAiodbH-lpyruvatel Oxoisovalerate DehYdroOsasse Alpba i seta Pusioa-ICT3t0) CPa003449457 41=10 R

CPis0035510x9 49569 R CT339 hypothetical protein CPts00365100? 51796 t CT338 hypothetical Droeeia CPn003751792 5x115 F ptsH-PTS Phosphocarrisr Protein Hpr-It-T337) CPn00385x119 63831 F DtsI-PTS PtP Phosphotransierase-ICT336) ePn003954x50 53163 R ybal-ICT335) CPn00405563 St318 R dnaX_1-DN11 Pol III Gamma and Tau_1-fCT3341 CPn004156996 5733 T

CPa004257103 5113 !

CPn00t35847 60372 !

CPn004460419 60771 !

CPn00t561069 6=790 t CPa004661790 61x63 t CPn004761155 63151 ~
T

CPn00486311? 85101. ! yqiP-Is conserved hypothetical IM proesin CPn001966x96 651)7 R

CPn0050B613 66199 R

CPn00s166173 67111 t CPn005261005 6730 R hemC-Porphobilinot:en Oeaaunass-ICTZ991 CPn005369744 67916 A sms-Sau Protein-ICTZ91) CPn0054700:3 69713 R rnc-Ribonucluse III-(CTx97) CPn0055701x9 70590 F CT296 hypothetical proteia ' Cpn00s670953 72746 t .n~rsA-PhosDhoernnomucasr ICTt95) CPn00577=971 73551 F sodM-SuDeroxide Dismucase lMnlIC'T'l94) CPn00587)839 7156= F aec0-AeCoA Casboxylase/Transierase eca-tCT293) ePa0059'1618 71050 F duc-dtlTP Nueleocidohydrolase-ICTZ92) CPn006075055 755x8 F pesN_t-PTS IIA Protein-IC'C:9lf ~Pn005175514 76:08 F DtaN_Z-PTS IIA Protein ttTN DNJ1-lin3inQ
Domain-lGTx90) ~Pn~05274)04 77490 F CT-~9 hypocnecieAl proceln ~:Pn~~b77811: 74':67 F

~Pn00b17N146 78S7b F

~:Pnn9551N9:4 40651 F C't'=99 hyDOther:i,:nl protein ~Pn0055409:5 d=655 F

CPn006782953 8053 F

CPn006884903 81331 R CT360 hypothetical pzocein CPn006985236 87086 F

CPn007087378 87208 R

CPn007188045 87599 R CT325 hypothetical protein CPn007289061 88057 R CT324 hypothetical protein CPn007389356 89574 F infA-Initiation Factar IF-1-ICT323) CPn007d89774 90955 F cufA-Elongation Factor Tu-1CT322) CPn007591102 91350 F secE-preprotein cranslocaseICT321) CPn007691358 91903 F nusG-Transcriptional Anciterminacion-(CT320) CPn007792013 92435 F zlll-L11 Ribosomal Protein-(CT3191 CPn007892465 93160 F zll-L1 Ribosomal Protein-(CT318) CPn007993179 93688 F r110-L10 Ribosomal Protein-(CT317) CPn008093735 91131 F r17-L7/L12 Ribosomal Protein-(CT316>

CPn008194261 98016 F rpoH-RNA Polymerase Heca-(CT315) CPn008298043 102221 F rpoC-RNA Polymeraae Heta~ -ICT314) CPn0083102332 103312 F tal-Transaldolase-(CT313) CPn0084103362 103751 F predicted ferredoxin-ICT312) CPn0085104506 103755 R CT311 hypothetical protein CPn0086104904 105527 F atpE-ATP Synchase Subuait E-(CT310) CPn0087105579 105376 F CT309 hypothetical protein CPn0088106373 108145 F atpA-ATP Syachase Subuait A-(CT308) CPn0089108153 10966 F atpH-ATP Synthase Subunit e-(CT307) ~Pn0090109454 110080 F atpD-ATP Synthase Subunit D-(CT306) CPn009110074 112053 F 1 atpI-ATP Synthase Subunit I-ICT3051 CPn0092112151 112573 F atpK-ATP Synthase Subunit K-(CT304) CPn0093112509 113015 F CT303 hypothetical protein CPn0094113152 115971 F valS-Valyl tRNA Synthetase-ICT302) CPn0095116037 118790 F pfai0-5/T Protein Kinsse-(CT301) CPn0096124314 118837 R uvrA-Excinuclease AeC Subunit A-(CT333) CPn0097124555 126006 F pyk-Pyruvate Kinase-(CT332) CPn0098127491 126091 R htrH-ACyltransferase-ICT010) CPn0099127593 127865 F

CPn0100129141 127882 R CT011 hypothetical protein CPn0101129932 129141 R ybbP family hypothetical protein-ICT012) CPn0102130123 131466 F cydA-Cytochrome Oxidase Subunit I-(CT013) CPn0103131480 132511 F cycle-Cytochrome Oxidase Subunit II-(CT014) CPnOlOd133875 132676 R~ CT017 hypothetical protein CPn0105134847 134029 R CT016 hypothetical protein CPn0106135091 136374 F phoH-ATPase-(CT015) CPn0107137162 136392 R CT058 hypothetical pzotein_1 CPn0108137857 137303 R CT018 CPn0109138655 141783 F ileS-Isoleucyl-tRNA Synthecase-1CT019) CPn01101373 141827 R lepe-Signal Peptidase I-ICT020) CPn011114686 143934 R CT021 hypothetical protein CPn0112144767 145093 F r131-L31 Ribosomal Protein-(CT022) CPn0113145335 146405 F pfrA-Peptide Chain Releasing Factor (RF-1)-(CT0231 CPnOlld146398 147261 F hemK-A/G specific methylase-(CT024) CPa0115147279 148622 F ffh-Signal Recognition Particle GTPase-(CT025) CPn0116148616 148972 F rsl6-516 Ribosomal Protein-(CT026) CPn0117148989 150071 F tzmD-tRNA (guanine N-1)-Methylttansferase-(CT027) CPn0118150102 150464 ~ s119-L19 Ribosomal Protsin-(CT'028) F

CPn0119150523 151164 F rnhe_1-Ribonuclease HII_1-ICT029) CPn0120151164 151778 F gmk-GMP Kinase-(CT030) CPn0121151778 152068 F CT031 hypothetical protein CPn0122152071 153723 F mete-Methionyl-tRNA Synehetase-(CT032) CPa0123155969 153774 R recD_1-Exodeoxyribonuclease V (Alpha Subunit)_1-(CT033) CPn0124156614 158068 F

CPn0125158096 158605 F

CPn0126158809 161085 F

CPn0127162143 161130 R ytfF-Cationic Amino Acid Transporter-(CT034) CPn0128162277 163053 F bpll-Hiotin Protein Lipase-(CT035) CPn0129163717 16306 R similarity to CT036 CPn013016425 163751 R

CPn0131164519 165580 F

CPn0132165587 166561 F

CPn013316733 16656 R CHLPS hypothetical protein-(CT109) CPn0134169098 167467 R groEL_1-HSP-60_1-tCT1101 CPn0135169448 16913 R groES-lOKDa Chaperonin-(CT1111 CPn0136171401 169569 R pepF-Oligopepcidnse-ICT112) CPn0137172254 171502 A ybgI-ACR family-tCT1081 CPn013817019 172700 R hem:..-Glucatrace-1-semialdehyde-2.1-aminomutase-CPn013917465617093 R ypq=-(~j10) CPaoltO175110171173 R yqdi-tCTSlI

CPnoltl175103175110 R splA-Ribose-5-P Isasrrase A-tCTZ131 CP1l01t2176091175116 R

CPn01t317T33s176114 R 'yxjC_Ds_1 ttypothecical Proceia CPn0114177963180560 F elpl-Clp Protease ATPass-tCT1I31 cPaoltsI8o777I1=369 F CTllt hypochetiul protein cPnoltsIaI1131e3o9s r cPetolt7Ia3tI5113171 F

CPn0ltB18316 183702 F plasl-S/T Protein Kinase-tCT115) CPaolt918371517700 F dalJ-DNA LiQase-tCTi46) CPno1501171311911 F CTlt7 hypothetical protein CPn01511911 19=635 R mhpJ~-tioaooxypeeuse-(CT1181 ClnolSl19!=6s19718 R CT119 hypotbetiul prot~ia ~Paol5319533 197113 F leul-Leueyl tRNA 8yaehttass-fCTt09!

ClnolSt197892199301 F pseA-1CD0 Traaslsrase-ICTI08f CPnolSS191691191118 R

CPnol561001171!1770 R

CPao157100713100=98 A

CPtl015820130 100191 R

CPnoi59=01772101167 R

CPno160303791303137 R pfkJ~i-Fructose-b-P Phospdocraasferase_1-lCTI07) CPao161101612303798 R psedietad aeylcraasferase lamily-tGTI06) CPn016220511810803 R

CPa0163308016=06391 !

CPnolbt208198106!98 !

CPnoi65306198207583 P

CPetoi66z07830207963 !

CP~WI67201306107977 R

CPnolBB20161 201417 R

CPno169109501101710 R

CPao170111016110015 R

Clnol7lIi=13621119 R '~faA-Clip 9ynthaas Clnol7l11317721=110 R QuaD/lapD-laosiae 5'-moaophosphass dehydro0anase IC00R-tesa~iaal savior.

only) Clno173113987213715 R

cPnol7tIlass7Iu7It F

CPnol75214198215175 F' Claol76213=86z16318 F CTi53 hypoehatieal protein CPno17721759 116608 R

CPn0178211052317789 R

CPn0179211103218056 R

CPuo180111851218356 R

CPao181219175111777 R

CPn0182110596219331 R aceC-Biocia Carboxylass-tCTiIt) CPn0183111195330695 R ace!-Diocia Carboxyl Carrier Protein-tCTlI3) CPa018t211775231331 R s!p_1-EloaQacion Factor P_I-tCTl3I) CPUo185113151231765 R spe/araD-Ribulose-P Epimsrus-tCT121) CPn0186111199111068 F stmilaricy to Cps IaeJL1-tCT1191 CPnQ117111118213015 F predicted metdylass-fCTi331 CPno188116111111100 F CTI3I hypoebecical prouia CPn0189116100211815 F CSI31 homolo0-(Possible Transmembraas Prouin) ~

Clao19013!!19131271 F

Cla019133199133131 R QlaQ-1180 Amigo Acid Trsasportes ATPase-1CT130) CPao192I3=631131981 R Qln!-A8C Amigo Acid Tsaasporeer Pesmsass-ICT1I9I

CPa0193233126231686 R arQR-ArQiaias Re~tssor CPa019t13311023111 F Qep_I-O-Sialo0lyeoprotsia trrdopsptidase_I-tC?197) CPaoi95231190135786 F oppA_1-Olfpopeptide BindiaQ Procai~l CPa0196336939137519 F app!'.I-Olipopepeids DindiaQ Protais~l-(CT1981 CPnol972375'!8331183 F oppJ~3-Olipopeptlds Diadtap Protsitt-3 Clnol98=79169It07tb F opplL,t-OliQOpsptide Dindta? Psocsl~l CPnoi99ItlOtz31983 F oppD_1-Olipopeptide Pesmsase_1-ICT1l9) CPn0I00111017147868 F opp~i-Olipopeptide Pesmease_1-(CTt00) Cln0I01111161zt371s F oppD-oli0opepclde Transport ATPass-tCTI01) CPnO=02zt1715111500 F oppF-Olipopepcids Trtasporc ATPass-tCTtOI

ClelO=03-'25008 I510z F

craolotztsel7ztlooz F

csoozosztu3 It13z7 F

Clet0I0611610927161 F CTI03 hypothetical psoteia CPt10I07zt7I08111617 F ybhI/sodiT!-OxoQluearacs/Halace Translocatos-tGT20t1 CPa0I08111953z50602 F pi)cJ~.Z-Fructose-b-P Ptsosphotraaatesass_1-tCTI051 CPe0I09251036:51172 F

CPn0210252384 251140 R

CPn0211252756 252463 R

CPn0212254066 252888 A

CPn0213254342 254190 R

CPn0I14255657 254146 R

CPn0215257015 255759 R

CPn0216257608 257174 R

CPn0217257896 258579 F ypdP-(CT140) CPn0218259058 258582 R

CPn0219259357 260472 F tgt-pueuine tANA Ribosyl Transferase-(CT193) CPn0220260696 261238 F

CPn0221261657 262064 F

CPn0222262504 262842 F wak similarity to Hacteriophage CHP1 (Orl4>

CPn0223262956 263333 F

CPn0224263435 263674 !
.

Cpn0225263873 264541 !

CPn0226264566 261967 F

CPn0227265116 265009 R dsb8-Disulfide bond Oxidoreductase-(CT176) CPn0228266110 265412 R dsbG-Disulfide Bond Chaperone-(CT177) CPn0229266328 267560 F CT178 hypothetical protein CPn0230268253 267576 R CT179 hypothetical protein CPn0231268957 268253 R tauH-AHC Transport ATPase (Nitrate/Fe)-(CT180) CPa0232270122 269232 R similarity to 5~-Methylthioadenosine / S-Adenosylhosaeysteine Nucleosidase CPa0233270424 270218 R

CPn0234271240 270548 R CT181 hypothetical protein CPa0235271416 272177 F kdaH-deoxyoetulonosic Acid Syathetase-(CT182) CPn0236272156 273766 F pyre-CTP Synthecase-(CT1831 CPn0237273762 274214 F yggF Family-(CT184) ' CPn0238274303 27$838 F zwf-Glucose-6-P Dehyrogenase-(CT185) CPn0239275899 276672 F devB-Glucose-6-P Dehyrogenase (DevH family)-(CT186) CPa02d0277861 276698 R

CPn0241279354 278203 R

CPn02d2279918 279487 R

CPa02d3280555 280133 R

CPn0244280918 281556 F adk-Adenylate Kinase-(CT128) CPn0215281645 282499 F ydh0-Polysaccharide tiydrolase-Invasin Repeat Family-(CT127) CPn02d6282952 282551 R~ rs9-S9 Ribososial Protein-(CT126) CPn0247283615 282969 R r113-L13 Ribosomal Protein-(CT125) CPa02d8284327 283650 R ycfV/ybbA-AHC Transporter ATPase-(CT152) CPn02d9285841 28333 R CT151 hypothetical protein CPn0250286057 285902 R r133-L33 Ribosomal Protein-(CT1501 CPn0251286060 287559 F eonserved hypothetical protein CPa0252288112 287576 R CT144 hypothetical protein (frame-shift with 0253?) CPn0I5328856 287950 R CT144 hypothetical protein_1 CPn0254289262 288159 R CT143 hypothetical protein'1 CPn0255290165 289329 R CT142 hypothetical protein_1 CPn0256291264 290398 A CTl4d hypothetical protein_2 CPn0257292127 291267 R CTld3 hypothetical proteln,",2 CPn0258292531 292133 R CT142 hypothetical protein (frame-shift with 02591) CPa0259292986 292441 R CTld2 hypothetical protei~2 CPn0260294045 29358 R sec~l-Protein Translocase Subunit_1-(CTidl) CPn0261294302 295033 F ydn0-PP-Loop Superfnmily ATPase-(CT217) CPn0262295091 295933 F surf-Surf-like Aeid Phosphatase-(CT218) CPn0263296249 297136 F yQfU hypothetical protein-ICT221) CPn0264297730 297155 A ubiD-Phenylacrylate Decarboxylase-(CT2201 CPn0265298620 297730 R ubiA-Benzoate Oetaphenyltransferase-(CT219) CPn0266299184 299876 F

CPn0267300122 300910 F

CPn0268300935 301318 F

CPn0269302150 301476 R Dipeptidase-(CT138) CPn0270303325 302468 R ywlC-SuAS Superfamily-related Protein-(CT137) CPn0271303634 301362 F Lysophoepholipase esterase-(CT1361 CPn0272305233 304340 R dnaX_2-DNA Pol III Gamma and Tau_2-(CT187) CPn0273305844 305227 R tdk-Thymidylace Kinase-(CT1881 CPn0274308353 305852 R gyrA_1-DNA Gyrase Subunit A_1-1CT189) CPn0275310786 308372 R gyr8_1-DNA Gyrase Subunit H_1-(CT190) CPn0276311137 310793 R CT191 hypothetical protein CPn0277311910 311104 A

CPn0278312875 312060 R conserved outer membrane lipoprotein protein CPn0279313537 312875 A Posaibls ABC Transporter Pe:mease Protein CPa0280314572 313550 A dppF-Oipeptide Transporter ATPaee-(CT689) CPn0281315057 316103 F dhnA-Predicted 1.6-Fructose Hiphosph..-i Aldolase Idehydrin family)-(CT215) CPn0282316126 317529 F xasA/gadC-Amino Acid Transporter-(CT216) CPn028331897 317532 R

CPn0284319045 318551 R

CPn0285320595 319051 R

CPn018632=059 320650 R mgtE-Mq Transporter ICHS Domain)-(CT194) CPn0287321221 322089 R ' CPn0288325716 321571 R CT195 hypothetical protein CPn0289325812 326996 F aaaT-Neutral Amino Acid lGlutamate) Traruporter-(CTZ30I

CPn0290327042 328523 F Na-dependent Transporter-ICT231) CPn0291321667 3=9191 F incH-Inclusion Membrane Protein H-ICT232) CPn0292329118 329836 F incC-Iaclusioa Membrane Protein C-ICT233) CPn0293329919 332723 F CT234 hypoehecieal proteia CPn0291333092 333502 F eAMP-Dependent Protein Kiaase Regulatory Subuait-fCT=35) CPn0295333863 333627 R aepP-ACyl Carrier Protein-ICT236) CPn0296331765 331022 R labG-Oxoacyl lCarrier Procaia) Reductase-ICT237) CPn0297335697 334771 a fabD-Malonyl Acyl Carrier Tr~sacyclase-fCT238) CPn0298336721 335717 1t fabN-Oxoacyl Carrier Protein Synthase ZZZ-fCT239) CPn0299336816 337115 ) reeR-Recombination Protein-fCTZ40) CPn0300337783 340152 I yaeT-Omp85 Analog-fCT141I

CPn0301340250 340762 I' fCmpH-Like outer Membrane Protein)-fCT242) CPn0302340787 311866 I' lpxD-UDP Glueosamine N-Aryltransferase-fCT243) CPn0303342958 341921 F' CT211 hypothetical protein CPn0304343133 344158 F pdhA/odpA-Pyruvate Dehydrogenase Alpha-!0235) CPn0305341154 345137 I pdhe/odp8-Pyruvace Dehydrogenase Beta-(022461 CPn0306345145 346431 1 pdhC-Dihydrolipoamide Aeetyltra~leraae-102247) CPn0307348986 346515 1: glgP-Glycogen Phosphorylase-!02248) CPn0308349231 349596 F' simflarity to CT249 CPn0309350974 349595 R dnaA_1-Replication Initiation Protein_1-!02250) CPn0310353433 351049 R 60IM-60kDa Inner Membrane Proteia-!02251) CPn0311354438 353575 R lgt-Prolipoprocein Diacylglycerol Transferase-!0225=I

CPn0312354524 354976 F CT101 hypothetical protein CPa0313354990 355355 F acpS-Aeyl-carrier Pzotein Synchase-102100) CPa0314356285 355353 R trxe-Thioredoxin Reduccase-1020991 CPa0315356977 358716 F rsl-51 Ribosomal Protein-102098) CPa0316358820 360121 F nusA-N Utilisation Protein A-(02097) CPn0317360081 362750 F~ infH-Initiation Faecor-2-!02096) CPn0318363767 363126 F rbfA-Ribosome Binding Factor A-102095) CPn0319363175 363879 F truth-tRNA Pseudouridine Synthase-!02091) CPn0320363860 364783 F ribF-FAD Syntluse-(CTD93) CPn0321365858 364767 R ychF-GTP Binding Protein-!02092) CPn0322366219 367328 F yscU-YopS Translocation Protein U -!02091) CPn0323367331 369460 F lcrD- Low Calcium Response D-(02090) CPn032d369492 3.70688F lcrE- Low Calcium Response E-(02089) CPn0325370708 371148 F sycE-Secretion Chaperone-(02088) CPn0326371148 372725 F malQ-Glueanotransferase-102087) CPn0327372915 373211 F r128-L38 Ribosomal Protein-!02086) CPn0328373241 371992 F GT085 hypothetical protein CPn0329375088 376146 F Phopholipase D SuDerf~lY (leader f33) peptide)-!02084) CPn0330376675 376202 R CT083 hypothetical protein CPa0331378437 376701 R CT082 hypothetical protein ~

CPn0332378655 378536 R CNLTR T2 Protein-!02081) CPa0333379090 378800 R ltue-102080) CPn0334379311 379823 F CT079 similarity CPa0335379817 380671 F folD-Methylene Tetrahydrofolate Dehydrogenase-(02078) CPn0336380650 381591 F yojL-1020771 C>?n0337382027 381575 R smp8- Small Protein 8-102076) CPn0338383278 383375 F dnaN-DNA Pol III (beta chain)-iGT075) CPn0339383420 384030 F reeF-ABC superfamily ATPase-!02074) CPn0340383802 '384156F (frame-shift with 0339) CPn0341384160 384195 F (frame-shift with 0340) CPn0342384622 385062 F predicted OMP ;leader 119) peptide)-tCT073) CPn0313:84999 385595 F (frame-shift with 0342?) CPn0341387420 385558 R yaeL-Metalloprocease-(02072) CPn0315388572 387136 R yaeM-IGT071) CPn0346389675 388704 R cro0/ycgD-Integral Membrane Protein-!02070) CPn0317391021 389678 R croC/ytgC-Integral Membrane Protein-102069) CPn0348391803 391027 R troe/ytqH-ABC transporter ATPase-(020681 CPn0349392770 391790 R t:oA/ycgA-Solute Protein Binding Family-(0':067) CPn035J393181 39368 F CT066 ty~ochecscal Drotein CPn0351397888 395132 F adt_1-ADP/ATP Transloease_1-!02065) CPn0352395574 396830 F

CPn0353396893 397135 F

CPn0354397167 398507 F

CPn0355399889 398591 R

CPn0356400459 400109 R

CPn0357401317 400469 R

CPn0358401751 401578 R

CPn0359402012 403817 F lepA-GTPase-ICT064) CPn0360405358 403922 R gnd-6-Phosphogluconace Dehydrogenase-tCT063) CPn0361406647 405382 R tyrS-tyrosyl tRNA Synthecase-ICT062) CPn0362407825 407055 R fliA/rpsD-Sigma-28/WhiG Family-(CT061) CPn0363409688 407943 R flhA-Flagellar Secretion Protein-(CT060) CPn0361409966 410238 F ferd-Ferredoxin IV-(CT059) CPn0365410528 411544 F

CPn0366411976 412440 F

CPn0367413102 413836 F

CPn0368413790 114107 F

CPn0369414351 415562 F CT058 hypothetical protein_2 CPn0370415800 416912 F CT058 hypothetical procein_3 CPn0371417147 417503 F

CPn0372417687 418001 F

CPn0373418380 420218 F gcpE-ICT057) CPn0374420218 420961 F CT056 hypothetical protein CPn0375421121 411615 F

CPn0376421854 422294 F

CPn0377423438 422347 R suc8_1-Dihydrolipoamide Succiayltransferase_1-ICT055) CPn0378426168 423445 R aucA-Oxoglutarate Dehydrogsnase-ICT054) CPn0379426322 426765 F CT053 hypothetical protein CPn03H0426758 427876 F hemN_1-Coproporphyrinoqen III Oxidase_1-ICT052) CPn0381429809 428037 R CT326 similarity CPn0382430719 470036 R yabC/yraL-SAM-Dependent Methytransferase-(CT048) CPn0383431693 430749 R CT047 hypothetical protein CPn0384432377 431862 R hcte-Histone-like Protein 2-(CT016) CPn0385434018 432522 R pepA-Leuryl Aminopeptidase A-fCTOdS) CPn0386434525 434046 R ssb-SS DNA Binding Protein-ICTOd4) CPn0387435196 431699 R CT043 hypothetical protein CPn0388435329 437320 F qlgX-Glycogen Hydrolase Idebranchiag)-ICTOd2) CPn0389438134 437319 R CTOdl hypothetical protein CPn0390439144 438134 R ruvH-HOlliday Junction Helicase-(CTOdO) CPn0391439692 439510 R

CPn0392439811 440383 F dcd-dCTP Deaminase-fCT039) CPn0393440379 440723 F CT038 hypothetical protein CPn0394440736 441968 F tlyC_1-CBS Domain protein (Hemolysin Homolog)_1-fCT256) CPn0395441964 443175 F CT257 hypothetical protein CPn0395444353 443241 R yhf0-NifS-related protein-ICT258) CPn0397445115 444381 R PP2C phosphatase family-tCT259) CPn0398445533 445700 F

CPn0399445879 446523 F CT253 hypothetical protein CPnOd00446536 447306 F CT254 hypothetical protein CPnOd01117881 417195 R CT255 hypothetical protein CPnOd02448994 447888 R mutt-Adenine Glycosylase-fCT1071 CPnOd03449015 419710 F yceC-predicted pseudouridine synthetase ~ family-(CT106) CPnOd04450887 419871 R

CPa0d05451739 450966 R CT105 hypothetical protein CPn0406451969 452865 F fabI-Enoyl-ACyl-Carrier Protein Reductase-fCT104) CPnOd07453742 452858 R HAD superfamily hydrolase/phosphatase(CT103) CPnOd08454105 454581 F CT102 hypothetical protein CPn0109154645 455127 F CT260 hypothetical protein CPn0410455123 455833 F dna~l-DNA Pol III Epsilon Chain_1-ICT261) CPnOdll455833 456609 F CT262 hypothetical protein CPn0412456590 457246 F CT263 hypothetical protein CPn0413459203 457227 R msbA-Transport ATP Binding Protein-ICT264) CPn0414460113 459172 R accA-ACCOA Carboxylase/Transferase Alpha-fCT265) CPn0115461498 160221 R CT266 hypothetical protein CPn0416461856 461557 R himD/ihfA-Integration Host Factor Alphn-tCT267) CPn0117463035 462244 R nmiA-N-Acetylmuramoyl Alanine Amidase-fCT268) CPn0118464401 462953 A murE-N-ACetylmuramoylalanylglutamyl DAP Lipase-tCT269) CPn0419466834 464876 R pbp3- transqlyeolase/transpeptidase-tCT270f CPnOd20467108 466824 R CT271 hypothetical protein CPn0121467998 467108 R yabC-PHP2H Family.methylcransferase-ICT'272) CPn0122db8242 46.8784F CT273 hypochatical protein CPn0423468791 469216 F CT271 hypothetical protein CPn0t2d169612470961 F dnaA_2-Replication Initiation Factor_c-ICT1751 CPn0425470980!71564 F CT276 hypothetical proteins CPn0426472111471536 R CT277 similarity CPn0427472207473715 F nqrZ-NJ1DH fUbiquinonel DehydroQenase-;CTZ781 CPnOt2847372247681 F nqr3-NJ1DH Itlbiquinonel Oxidoreductass, Gamma-tCT2791 CPn0129471681475319 F nqrl-NADH It7biquinonel Reduetase 1-fCT1801 CPn0130475326476093 F nqr5-N1~DH ttlbiquiaonel Reductase 5-ICT281) CPn0131476183176151 R

CPn0t32176816476514 R

CPn0133477273476929 R QesH-Glycine Clsavape System H Protein-ICT2821 CPn0134179462477276 R CT2B3 hypothetical Drotein Cln0t3548090247975 R Phospholipase D superfamily (uncleavable leader peptide)-(CT38t CPn0t36481618180902 R lpl~-LiDoau Protein LiQase-Like Protein-(CT2851 CPn0137481816184350 F clpC-ClpC Protease-ICT1861 CPn0138185116181334 R yebF-PP-loop superfamily aTPase-ICT287) CPn0139485553486077 F

CPn0ta0486105486710 F

CPn04t1486891187838 F CT007 hypothetical protein CPn0t42188013188528 F Ct006 hypothetical protein CPn043!88729189979 F CT005 hypochecieal protein CPn0114190187191507 F mnp_6-POlymorphic Outer liembrasse Protein G/I Family CPn0115194772197579 F pn~_7-Polymorphic outer ltembraae Protein C Family CPn0446197626500115 F pmD_8-Polymosphic Outer Hembrane Protein G Family CPn0147500568503351 F ps~ 9-Poiyaarphic Outer Membrane Protein G/i Family CPn01t8501810503698 R yxjC~s_2 Hypothetical Protein CPn01t9507131505330 R pmp_10-P!!P_10 tlrame-shift with 0151) CPn0150508112507180 R pmp_10-POlyaasphic Outer Membrane Proteia G Family CPn0t51508275511058 F ymp_11-Polyaasphic Outer !lembrane Protein C Family CPa0152511319512860 F pmp_12-POlymorphie Outer Hembrans Protein 11/I Famfly ltruncated) CPn0453513234516152 F pmp_13 -POlymorphic Outer Hembrane Protein C Family CPn015d516182519115 F pmp_14-POlymorphic Outer Membrane Protein H Family CPn0155520348519458 R

CPa0t56521532520337 A

CPn015751386552=120 R

CPn0458526310521136 R

CPn0t59517005526619 R

CPn0460527840526992 R

CPn0461528638527811 R

CPa0t6Z531052519037 R

CPn0463532357531191 R

CPn0t64531842532366 R

CPn0465533212532871 R

CPn0466533724536537 F pa~_15-Polymosphic outer Membrane Protein E Family CPn04b7536633539434 F pop_16-Poiymorphic Outer M~bsane Protein E Family CPn0168539632540132 F pmp_17-Polymorphic Outer Membrane Proteia E Family CPn0t69540399511160 F pmp_17-POlymorphic Outer Membrane Protein (Frame-shift with 01691 CPn0t705!1357512532 P pmp_17-Polymorphic Outer Membrane Proteia (Frame-shift with 01701 CPn0t715!2564515401 F pn~_18-Polymorphic outer Membrane Protela EIF lamily CPn0t72517905515581 R

CPn0473519593548070 R

CPn0171551573519807 R CT365 hypothetical protein CPnOt755538!4551685 ~ Q198-Gluean Hranchir~ Lnzyme-ICT8661 R

CPa0176551844553858 R CT865 hypothetical proteia CPn0t77556106551814 R yqsV_8s Hypothetical Protein CPn0478557615556210 R hilX-GTP 8indiaQ Protein-tCT3791 CPn0179558125557616 R phnP-Metal Dependent Nydrolase-ICT3801 CPa0t80559301558650 R CT383 hypothetical protein CPn0t81560946559339 R

CPa0482561737560961 R artJ-7lrQinina Periplasmic 8indinQ Protein-tCT3811 CPn0t8356183656961 F

CPn0484564970565824 F aroC-Deoxyhepconats Aldolsse-ICT3B21 CPn0t85566038566129 F CT382.1 hypothetical protein CPa0t86567781566105 R hypothetical proline permease CPn0487569740568112 R CT384 hypothetical protein CPa0t88570096569767 R hitA-HIT Family Nydrolase-ICT3851 CPnOt89570965570096 R CT386 hypothetical protein CPn0490571279573333 F CT387 hypothetical pzotein CPn0191571352577336 R CT389 hypothetical Drotein CPa019Z571652571804 F

CPn0193575004571855 R

CPnOtS1575364575146 R

CPn0495575607576793 F aspC-l~spartate Aminotran:ferase-ICT3901 CPn0196576793 57712 F CT391 hypothetical protein CPn0197571069 5771=0R CT388 hyposhscical protein CPn0198579035 5705 R

CPa0199580359 579=05R

CPn0500580559 581363F pros-Prolyl tRNA Synchetass-ICT393) CPn0501SA=57 563550F hreA-HTH Transcrzpcional Repressor-ICT39t1 CPa0502563550 SA1=01F qrpt-HSP-70 Colactor-ICT395) CPa05035613 55113 F dasK-HSP-70-tCT396) CPa050t56587 56151 F vacD-riboaueluse family-ICT397) CPn0505586519 SA9105F 3-aeehyladsnins DNA qlycosylass CPn0506569172 56940 E CT4=1 hypothetical protein CPn0507589961 590112F CT121.1 hypothetical protein CPa050A59012 590300F CTt=1.= hypoctsetical protein CPn0509590335 590108F IDredietsd Metallosazyme)-ICTt33) cPn0510590113 591973F ClyC_3-C8S Damaias tHamolysin homoloq)_2-ICTt33I

CPn051159111 59118 F rsbV_1-Siqsa Rspulatory Factor_1-ICZIII) Cpn051259=553 59113 F CT115 hypothetical pzoteiss CPn0513591517 593753F Fs-8 oxidorsduetass_1-ICT=6) CPn051d5957=9 596!=0F Ct117 hypothetical protein CPn0515595192 597111F obit-Ubiquiaone Mschyltraa:fsrase-ICT138) CPa0516598111 597255R

CPa0517599531 59795 R

CPa0518600103 59933 A CTt29 hypothetical protein CPa051960167 60090 R dap!-Diaminopioulace tpimerass-ICTt30) ~

CPa0520601=18 601616R elpP-CLP Protuss-lCTt31) CPn0511603797 60331 R qlyA-Ssrine Hydsoxymsehylcraasferass-tCTt37) CPa0522503987 601655F CTt33 hypothetical protein CPa0523604733 505052F

CPa051t605103 606179F

CPn0525505532 607=83F CT398 hypothetical protein CPn05Z6601696 607710R yrbH-GutO/lCpsd Tamily Suqar-P Isomerase-ICT399) Cln0527609l0a 607=6 R sucs_Z-Dihydrolit~oasids Succinyltrsnsferase_2-tCTt00) CPa0538611162 509931R qltT-tilutaaate Sympore-tCT101) CPUQ5I961==59 511165R yeah-ATPass-IGTtOt) CPn0530613=51 61160 R spotJ_1rRNA Hstlsylass_1-ICT103) CPn0531511069 613315R S1v!! dependent msthyltransfsrus-1CT101) CPa0532611674 61075 R ribC/risA-Riboflavin Syutbaas-ICT105) CPn0533611930 61335 F~ CTt05 hypothetical protein CPa053t515113 51578 F dksA-Dnalc Suppressor-lCTt07) CPn0535615793 616395F lspA-Lipoprotaia Sisal Peptidase-tCT108) CPn0535616315 617591F daQA_1-D-Ala/Gly Psrmsase_1-tCTt09) CPa0537617633 611169F CTtll.l hypothetical protein Cln0538618212 51511 F C?d1t hypothetical protein CPn0539616705 611515F pmp_19-polysoorphic outer membrane protein A family -ICT112) CPn0510521590 626862F pmp_20-polymorphic outer membrane protein a Tamily-ICT113) CPa05t1617170 6=003 F Solute binding protein I-ysbL-Synschoeyscis Adheein Haeoloq)-tGTtlS) CPa05t2526003 6=737 F JIaC Transporter ATPass-1CT116) CPa0513531735 619603F IMStal Traosporc Protein)-iCTtl7) CPn051a630529 629525A yhbL-GtP binding protein-tCTtlA) CPa05t5630tea 630533R r117-L=7 ribosomal protein-tCTtl9) CPn0516631=Z9 630911R rlll-LZl Ribosaul Protein-IGTt=0) CPn05t7631661 631188~ yqbs family-ICT131) F

CPn0518533=31 631191R eysJ-Sulfite RsductaseICT435) Cln0519633669 ' 53355R rsl0-SIO Ribosomal Protein-ICTt35) CPn0550635561 633560R lusA-tloaqation Factor G-tCTt371 CPn0551638166 635596R rs7-S7 Ribosomal Protein-tCTt381 Cltt0552635587 535=19R rsll-512 Ribosomal Protein-ICT1391 CPn0553537717 53812 R

CPa0551637651 636111F C?tt0 hypoehstieal protein CPn0555531=9B 50211 F tsp-Tail-SDseific Protusr ICTtl1) CPa05566t091~ 610325A cspA-lSkDa Cysteins-Rich Protein-ICTt1=) CPSf055761161 611191R omcD-60kDa Cysceins-Rieh Outer Membrane Complex Protein-tLTtl3) CPn0558613300 613031A omcA-9kDa-Cyscsine-Rich outer Membrane Complex Lipoprotein-ICTttt) CPn0559613712 53927 F CTt41.1 hypothetical prouin CPn0560515612 611098R qlGX-Clutamyl-cRNA Synchetass-ICTtlS) CPn056i6510 6571 R euo-CHLPS too Protein-ICTtt51 CPn056268036 615918R CHLPS t3 k0a prouin honwloq_1 CPn0563650056 611=97A recJ-ssDNA txonucleaas-tCT117) CPn0561651350 650115R seeDisseF-Protein Export Proteins SeeD/SeeF
Itusionl-ICTItB) CPn0565655530 65533 R CTIt9 hypothetical Drocein CPn056665511 656890F yaeS family-tCTt50) CPn0567655191 657817F cdsa-PhospMCidacs Cytidylytransferaes-lC:t51) CPn0568657817 658161 F cdsA-Priosphacidaee cytidylytransierast-lCTt52) Cln05696516 659099 F plat-Glycerol-3-P Aeylcranslesue-ICT153) CPa0570659107 660789 F arg8-Argsnyl tJtNA Transierase-ICT451) CPn0571662122 660719 R musA-tJDP-N-Aeetylglucosamine Transierase-ICT1551 CPn0572662352 661616 F CT156 hypothetical protein CPn0573665101 661191 R yebG lamily-ICT157) CPn057t665915 665391 R

CPa057566619 665182 R YhhY-Amino Group Acetyl Transisrast-(CT58) CPn0576667513 666191 R pri8-Peptide Chain Release Faecor 2 tnacural tTGA irawe-shift )-(CT155 Cla0576657598 667530 R pri8-Inatural UGA trams-shift 1 CPe105776b7195 561155 F SWIG tYH7t) coaoplex protein-ICT601 CPa0578668106 689365 F yaeI-phosphohydrolase-(CTt61) CPn0579bbl361 669993 F ygbP/yaeH-Sugar Nucleotide Phosphorylase-fCTt6Z) CPn0580669993 670793 F truA-Pseudouridylate Syntbase I-ICTt63) CPn0581b7113t 670715 R Phosphoglycolace Phosphatase-(CTt6t) CPa058Z671503 672177 F CT165 hypothetical Drotsln CPn0583671100 671717 F CT166 hypoehetieal protein CPn0584671707 673798 )' aco8/atr8-Z-Component 8ansor-ICTt67) GPa0585675817 673855 F: similarity co Cps laeA_Z

CPa0586676026 677183 F' atoC/ntrC-Z-Component Regulator-fCTtbB) CPn0587677ta1 671121 F yvyD~s conserved hypothetical protein CPa0588678081 6786=6 F' CTt69 hyposbetieal protein CPn0589671610 679795 F CT470 hypothetical proctin CPn0590680112 679516 F CTt71 hypothetical protein CPa0591680373 681010 F yagE family-tCTt7Z) CPn059Z681153 611161 F yidD family-(CTt73) CPn0593682176 681391 F. CTt7t hypothetical protein CPn059468=583 681958 F pheT-phenylalaayl tRNA Synthetase Beta-(CTt751 CPn0595611958 615926 F CT176 hypothetical protsin CPe10596615939 61bt57 F ada-mecbyltraasierase-(CTt77) CPn0597681215 685179 R oppC~-Oligopeptide Psrmeast_Z-(CTt78) Cla0598619697 611=19 R opp8_Z-Oligopepcide Ptsmease_Z-tCTt79) CPn0599691802 681882 R oppl~5-oligopeptide 8indiag Lipoprotein-,5-(CT110) Cln0100693117 691137 R

CP80i01693053 69=736 R CTt83 hypochetieal protein GPn0i02691105 693101 R CTtet hypotheeical protein CPa0603691305 695115 F hmZ-Fsrroehecalase-(CT415) Cla060t695115 615196 A~ iliY-Glucamiae 8lading Procsin-(CTttb) Cla0605691707 696150 R yhbd-Ilethylase -iCT187) CPa0606617111 691707 R CTtlB hypothetical protsin CPn0607698195 697573 A glpC-Olueose-1-P Adenyltransierase-1CT119) CPn0608691615 699016 R -pyre-tJrid3ne 5'-HOnophosplsate Syntbass It)a~ Sy~sthase)-truaeatad7 CPn0609699705 699916 F CTt90 hypochseical protein GPn06i0T01tZ0 700029 R rho-Traascripcioa Terraisucion Factor-fCTt91) CPa0611702025 701120 R yacE-predicisd phosphatase/kinase-(CTt9Z) CPn0612701631 701022 R polA-DNA Polys~srise I-ICTt93) CPa0613705656 701651 R soh8-Proteasr 1CT194) GPa0611707102 705713 R adt~-ADP/ATP Transloease_Z-FCT195) CPn0615701137 707634 R pgsA_1-Glycerol-3-P Phosphatidyltransisrase_1-fCTt96) Cla0516708791 710137 F dnaD-Replieatlw DNA Heliease-ICTtl7) CPn051771081 732316 F gidA-FADdependtac oxidoseduetase-ICTt98) CPn061B711306 713010 F lplA-Lipoace-Protein Lipase A-ICTtl9) Cln0619713114 713013 R ndk-Nucleoside-Z-P Ftinase-(CT300) CPa0620711139 717519 R ruvA-Holliday Junction Heliease-1CT301) CPa06Z1711617 711111 R ruvC-Grosswer Junction Endonuclease-fCT502) CPn05Z2715752 711793 R CT503 hypocheeieal protein CPn0633716993 7161b3 R CTSOt hypothetical protein CPn0631711015 717011 R gapA-Olyeeraldthyds-3-P DehyroQetfase-)C'i"305) CPn06ZS711115 711060 R r117-L17 Ribosomal Procsin-ICT506) CPn0iZ6711616 718495 R rpoA-RNA Polymerase Alpha-(CT507I

CPa06Z7720018 719610 R rsll-S11 Ribosomal Protein-ICT508>

CPit06Z8720128 720063 R rsl3-513 Ribosomal Protein-ICT509) CPa06Z9721157 720117 R seeY-Translocase-ICT510) ' Cla0630:22316 721815 R r115-L15 Ribosomal Protein-(GT511I

C1n0631722106 722312 R rs5-S5 Ribosasul Protein-ICTS1Z) CPn0632723195 721127 R r111-L18 Ribosomal Protein-fCT513) CPn0633723757 733209 A rib-L6 Ribosomal Protein-lCT511) CPn063172115 7=3717 R rs1-S8 Ribosomal Protein-ICT515) CPn0635721715 721206 R rl5-LS Ribosomal Protein-ICTS16) C?n0536725012 721750 A rlZt-L21 Ribosomal Protein-(Cl'S171 CPn0637725161 ~Z1099 R r111-L11 Ribosomal Protein-(C:5-8) CPn053B~Z57t7 725190 R rsl7-817 Ribosomal Procein(C519) CPn0539725958 725743 R r129-L29 Ribosomal Protein-fCT520) CDn0640725377 725961 R r116-L16 Ribosomal Protein-fCT521) CPn0541727077 725109 R rs3S3 Ribosomal Protein-fCT522) CPn0542727428 727096 R r122-L22 Ribosomal Protein-fCT523>

CPn0643727713 727450 R rsl4-519 Ribosomal Protein-fCT521I

CPn05t4728573 727722 R r12-L2 Ribosomal Protein-(CT525) CPn0545728930 728598 R r123-L23 Ribosomal Protein-fCT526>

CPn0546729621 728950 R r14-LI Ribosomal Protein-fCT527) CPn0647730331 729657 R r13-L3 Ribosomal Protein-fCT328) CPn0518731603 730605 R CT529 hypothetical protein CPn0649732572 731710 R fmc-Nechioryl eRNA Fornylcransferase-fCT530) CPn0650733501 731665 R lpx~1-Aey1-Carrier tIDP-GlcNAe -fCT531) CPn0651733975 733317 R fabt-!lyriseoyl-hcyl Carrier Dehydratase-fCT532) CPn0652731835 733990 R lpxC-Myriscoyl GlcNae Deacttylase-fCTS33) Cla0653736490 731868 R eucE-Apolipoprotein N-l~cetyleransferase-fCT534) CPn065t735957 735503 R vdlD/yciA-aeyl-CoA Thiossterase-ICTS35) CPn0655737847 737101 R dnaQ_2-DNA Pol III Lpsilon Chain_2-fCT536) CPn0656737872 738048 F

CPn0657738473 738051 R yjeE (I~TPase or Kinase)-fCT537) CPn065A739168 738455 R CT538 hypothetical proton CPn0559739533 739838 F trxh-Thioredoxin-ICTS39) CPn0660710327 739860 R spoD_2-rRNa Ntthylass_2-fCT540) CPn0661741100 740327 R mip-FKeP-type pepcidyl-prolyl cis-crane isomerase-;CTStl) CPn0662742923 741172 R asps-l~spartyl tRNA Synthetase-fCT5t2) CPn0563744190 742901 R hiss-Hiscidyl tRNR Synthetase-fCT5t3) CPn0664744757 744557 R

CPa0665745001 716365 F uhpC-Hexosphosphate Transport -fCT541) CPn0666746388 750107 F dnaE-DNA Pol III Jllpha-fCT515) CPn0567751058 750177 R predicted 0lIP (leadar f17)-fCT516) CPn0558751209 752162 F CT547 hypothetical protein CPn0559752179 752775 F CT548 hypothetical protein CPn0670732765 753196 F rsbN-sigma regulatory factor-hiscidine kiaase-fCT519) CPn0571753530 753205 R CT550 hypothetical protein CPn0672753741 755018 F dacF(pbp5)-D-hla-D-Ala Caroxypeptidase-fCT551) CPn0673755287 755163 F CT552 hypothetical protein CPn0574755568 755577 R fmu-RN1~ Hechyltransfezase-fCT553) CPn0675757919 756768 R CT69b hypothetical protein CPn0676759217 758051 R~ homologous to CT695 CPn0677750401 759256 R

CPn0678751320 760582 R

CPn0679762930 761725 R pqk-Phosphoglyesrate Kinase-fCT693) CPn0580764248 762971 R yqo4-Phosphate Permeast-ICT692) CPn0681764929 764258 R CT691 hypothetical protein CPn0582761984 765955 F dppD-A8C ATPaee Dipeptide Transport-fCT690) CPn0583765948 766919 F dppF-A8C ATPase Dipeptide Transport-ICT6891 CPn0684768038 767181 R spoJ/par8-Chromosome Partitioning Protein-fCT588) CPn0585768068 768217 F

CPn0686758361 768176 R

CPn0687758564 769214 F CT482 hypothetical protein CPn0688769382 770137 F CT481 hypoehacieal protein CPn0689771104 770187 R yfh0_1-NilS-related Jlminotransferast_1-ICT687) CPn0590772580 771136 R AeC Transporcsr tiembrane Protein-fCT685) ~

CPn0691773452 772685 R abcX-R8C Transporter llTPase-fCT685) CPn0592774912 773161 R J18C Transporter-fCT6Bt) CPn0593776256 775240 R TPR Repeats to-Linked G1CNJIC Tzansferase similarity!-fCT683) CPn0594779599 776330 R pbp2-P8P2-cransqlycolase/cranspepcidase-fCT582) CPn0695780216 781382 F ompA-Major Outer Nambrane Protein-fCT681) CPn0696781769 782599 F rs2-S2 Ribosomal Protein-ICT5801 CPn0697782602 783447 F csf-Elongation Factor TS-ICT679) CPn0698783458 784201 F pyres-UHP Kinase-fCTB79) CPn0599784182 784721 F rrf-Ribosome Releasing Factor-ICT677) CPn0700785097 785609 F CT676 hypothetical protein CPn0701785599 786672 F karG-Arqinine Kinase-fCT675) CPn0702789685 786929 R yscC/qapD-YOp C/Gen Secretion Protein D-fCT67d) CPn0703791190 789685 R pkn5-S/T Protein Kinase-fCT677) CPn0704792321 791209 R fllN- Flaqellar Motor Snitch Domain/YSeQ
family-fCT672) CPn0705793173 792334 R CT671 hypothetical protein CPn0706793683 793180 R CT670 hypothetical protein CPn0707795029 793704 R yscN-Yop N lFlaqellar-Type ATPase)-fGT569) CPn0708795705 795034 R CT668 hypochecicnl protein CPn0709796188 795742 R CT667 hypothetical protein CPn0710796461 796210 R CT666 hypott:ecical protein CPn0711796771 796186 R CT665 hypochecicai protein CPn0712799315 796781 R FICA domain: homology to adenylace eyelase)-fCT664) CPn0713799721 799332 R CTb63 hypothetical protein CPn0714801107 800091 R haM-Glutamyl tRNA Reductase-1CT662) CPn0715801657 803462 F gyre_2-triJA Gyrase Subunic 8_2-1CT661) CPn0716803469 801902 F gyrA_2-DNA Gyrase Subunit A_2-fGT6601 CPn0717805010 805306 F CT656 hypothetical protein CPn0718805309 805626 F CT657 hypothetical protein CPn0719805916 806890 F sth8-lPseudouridine Synthase)-1CT658) CPn0720807003 807236 F CT659 hypothetical protein CPa0721807683 808489 F kdsA-KDO Synthetase-1CT6551 CPa0722808489 808974 F CT654 hypothetical protein CPn0723808984 809703 F yhbG-AHC Transporter ATPase-(CT653) _CPn0724810527 809706 A

CPn0725810811 810387 R C?652.1 hypothetical procsin CPn0726813372 810880 R CT620 hypothetical protein CPn0727813577 816192 F CT619 hypothetical protein CPn0728818477 816525 R CIiLPN 76k0a HomoloQ_1 (CT6221 CPn0T29819857 818592 A CHLPN 76kOa somolog_2 tCT623) CPn0730821603 818963 R mviN-Integral !lembrane Protein-(CT624) CPn0731821587 821760 F

CPn0732822098 822976 F ato-Endonuclease IV-(CT625) CPn0733823727 823101 R rs4-S4 Ribosomal Protein-fCT626) CPa073d823914 824915 F yceA-ICT627) CPn0735825668 825003 R pyrH/udk-Uridine Kinase fUridine lionophosphokinase) (Pyrimidine Ribonucleoside Kfnasel.

CPn0736827686 825992 R ygeD-Lttlux Protein-(CT641) CPn0737827685 830756 F recC-Exodeoxyriboauclease v, Gamma-fCT640) CPn0738830746 833895 F race-Exodeoxyribonucluse V, Heta-(CT639) CPn0739834871 833861 R CT638 hypothetical protein CPn0740836018 031861 R tyr8-Aromatic 871 Aminotransterase-(CTb37) CPt10741838350 836185 R greA-Transcription Elongation Factor-(CT636) CPn0742838463 838888 F CT635 hypothetical protein CPn0743838962 840762 F aqzA-Vbiquinone Oxidoraduccase. Alpha-(CT631) CPa0714841384 840389 R heutB-POZphobilinogen Synchase-(CT633I

CPn0T45841903 841742 R

CPn0T46841975 843567 F CT632 hypothetical protsin CPn074783675 843740 F~ CT631 hypothetical protein CPn0747843725 843910 F CT671 hypothetical protein (frame-ahitt) CPn0748844987 844121 A ispA-Geraryl Transtransterase-(CT628) CPn0719845629 845006 R glsW-VDP-GlcNAC Pyrophosphorylase-ICT629) CPa0750846411 845707 R tctD/epxR-fiTH Transeriptional Regulatory Protein Receiver Doman-CPn0751846606 848434 F CT651 hypothetical protein CPn0752848601 850082 F reeD_2-fxodeoxyribonuelease V, Alpha_2-(CT6521 CPa0753851006 850161 R

CPn075d851336 851040 R rs20-S20 Aibososul Protein-(CT617) CPn0755851597 852799 F CT616 hypothetical protein CPn0756852961 854676 F rpoD-RNA POlymersss Sigma-66 -(CT615) CPn0757854733 855134 F tolX-Dihydroneopterin Aldoiase-(CT614) CPn0758855110 856459 F tolP/dhpS-Dihydropteroate Synthase-ICT613) CPn0759856488 856997 F tolls-Dihydrotolace Reduecase-(CT6121 -CPn0760856957 857694 F CT611 hypothetical protein CPn0761857704 858375 F CT610 hypothetical protein CPn0762859597 858539 R recA-ReG reeos~bination protein-(CT650) CPn0763860511 859972 R ygtA-FOrmyltetrahydrotolace Cycloligase-fCT649) CPn0764861807 860524 R CT648 hypochscical protein CPn0765862382 861801 R CT647 hypothetical protein CPn0766863782 862394 R CT646 hypothetical protein CPn0767863881 864177 F CT645 hypothetical protein CPn0768864159 865163 F yohI/nir3-predicted oxidoreduccase -(CT644) CPn0769867733 865121 R topA-DNA Topoisomerase I-Fused to SWI Domnin-fCT643) CPn0770868340 869131 F CT642 hypothetical protein CPnOT71870163 869144 R rpoN-RNA Polymerase Sigma-54-(CT609) CPn0772872385 870469 R uvrD-DNA Nelicase-fCT608) CPn0T73872188 873195 F ung-Vracil DNA Glyeosylase-fCT607) CPn0774873195 873425 F CT606.1 hypothetical protein CPn0775871031 873414 R yggV family-ICT606) CPn0776874246 875487 F CT605 hypothetical protein CPn0T77875601 877178 F groEL_2-heat shock protein-60 -fCT604) CPn0778877505 878092 F tsa/ahpC-Thio-specific Anuoxidanc (TSA) Peroxidase-(CT6031 CPn0779878481 878095 R CT602 hypothetical protein CPn07A0179205 871591 R papQ/amie-N-ACetylmuramoyl-L-111a Amidaae-CT601) CPn0781879773 179191 A pal-PeDtidoqlycan-Associated Lipoprotein-ICT6001 CPn0782181065 879773 R tolH-polysaccharide transporter-ICTS991 CPn07AJ881115 881100 R CT59A hypothetical protein CPn07B1812296 881892 R exbD-8iopolymer Transport Protein-ICT5971 CPh0785812991 881296 A exb8/tolQ-polysaccharide transporter-GT5961 CPa0786883185 815293 F dsbD/xprA-Thio:disulfide Interchange Protein-CT595) CPa07A7885619 116401 F yabD/ycl:!-PHP superlamily luruse/pyrimidinasel hydrolase-ICT5911 CPa07A8816542 887432 F sdhC-Succinace Dehydroqenase-fCT593!

CPa0789887139 889316 F sdhA-Succinate Dehydroqenase-ICT592f CPn0790889330 890103 F sdhe-Succinace Dehydrogenase-ICT5911 CPn0791893050 190111 R CT590 hypothetical proceia CPn0792894919 893108 R CT5A9 hypothetical protein CPn0793196123 894919 R rbsU-sigma regulaeory family protein-PP2C
phosphatase IRSbW

ancaqoniscl-ICT5881 CPa0791897171 898001 F

Cla0795891128 899195 F

CPn0796899301 901310 F

CPn0797901600 902694 !

CPa0791902116 903156 F

CPa079990916 903910 R

CPn0800906532 905249 R eno-ISSOlase-ICT587) CPa0801908697 906727 R uvrn-Fxiauclease AeC Subunit H-ICT5861 CPn0102909740 908709 R CrpS-Trypeophanyl CRNA Synthetaae-(CT5151 CPn0A03910303 909752 R CT58d hypothetical protein CPa010d911059 910310 R qp6D-CitLTR Plasmid Paraloq-ICT583) CPn0105911831 911067 R miaD-chromosome partitioning ATPase-CHLTR
plasmi.d protein GPSD-ICT5ti2) CPn0106913771 911867 R thrS-Threo~l tRNA Syachecaae-ICT5811 CPn0A07913971 91879 F CTSAO hypothetical proeein CPn010A916287 914956 R CT579 hypothetical protein CPn0A09917785 916307 R CT578 hypothetical protein CPn0110918111 917825 R GT577 hypothetical protein CPn0111918900 918308 R lesti_1-Low Ca Response Proeein H_1-ICT5761 CPa0812919123 910162 F mucL-DNA tdiamstch Repair-ICT5751 CPa.0A13920870 921934 F pepP-Aminopeptidase P-ICTS7df CPn011d922107 933357 F CT573 hypothetical protein CPn0815923361 9=5622 F gspD/pilQ-Gen. Secretion Protsfn D-ICT5721 CPa0A16925615 927102 F~ gspE-Gen. Secretion Protein E-fCT571) CPa0A17927115 928287 F gspF-Gea. Secretion Protein F-ICT570I

CPa081B928314 92868? F predicted OtiP (leader 1161 peptide)-CT5691 CPn0119928619 929132 F CT56A hypothetical protein CPa0820929120 929659 F CT567 hypothetical protein CPn0821929667 930668 F CT566 hypothetical protein CPn0122930756 931229 F CT565 hypothetical protein CPa0823932367 931501 R yscT/spaR-YopT Tranlocation T-ICT5641 CPa0121932662 932378 R yscS/IliQ-YOpS/IliQ Transloeation Protein-fCT563) CPn0A25933594 932677 R yscR-YOp Transloeation R-ICTSB2) CPn0826934310 933612 R yscL-YOp Ti:anslocation L-ICT5611 CPn0127935264 934434 R CT560 hypothetical protein CPa0828936771 935267 R yacJ-Yop Traaslocation J-ICTS59) CPn08299367da 937298 F

CPn0830937441 937959 F

Cln0831938267 938434 F

CPn0132939747 938827 R lipA-Lipoace Synchetase-ICT55A1 CPn0A33941129 939747 R lpdA-Lipoamide Dehydrogenase-ICT557) CPa0A31941553 942014 F CT556 hypoehecieal protein CPn0835915689 962015 R motl_1-SWI/SNF family helicase_1-ICTS551 CPa0A3696879 95722 R brnQ-Amino Acid (Branched) Transport-iCT55d1 CPn0837917771 917115 R nth-F.nodnueluse III-ICT697) CPa0838949106 97781 A thdF-Thiophene/Puran Oxidation Protein-1CT698) CPn0839949257 950159 F psdD-Phosphatidylserine Oecsrboxylase-ICT699) CPa0Ad0950222 951541 F CT700 hypochetlcal protein CPn08d1951771 95640 F secA_2-Translocase SecA_2-ICT701) CPa01d2954883 954710 R CT702 hypothetical prosain Ilrame-spilt with 0843) CPn0813955191 951991 R CT702 hypothetical protein CPn08dd956730 955270 R yphC-CTPase/CTP-binding protein-ICT703) CPn0A15951079 956150 R pene_1-Poly A Polymerase_1-fC:70d1 CPn0816959371 958112 R clp%-CLP Protease ATPase-ICT7051 CPn0817959995 959387 R clpP-CLP Protease subunit-ICT7061 CPa0811961502 960177 R tig/murI-Triqqar factor-pepcidyl-prolyl isomerase-ICT707) CPn0819961781 965285 F ~tl_2-SWI/SNF family heliease_2-ICT7011 CPnC85996529) 966390 F m:eB-Rod Shape Proceirt-sugar %inase-1GT7091 CPn0A51 966396 96A195 F pckA-Phosphoenolpyruvate Carboxykinese-ICT710) CPn0A5Z 968316 970613 F CT711 hypothetical protein CPn0853 970637 971A03 F CT712 hypothetical protein CPn0A54 972837 971806 R ompB-Outez Membrane Protein B-ICT713) CPn0855 973995 972994 R gpdA-Glycerol-3-P Dehydrogenase-fCT711) CPn0856 975377 973995 R Apx-1 Homolog-VDP-Glucose Pyrophoaphorylase-tCT715) CPn0857 975757 975392 R CT716 hypothetical protein CPn0858 977055 975757 R tliI-Flagellum-apeeitic ATP Synthase-(CT717) CPn0A59 977588 977055 R CT7I8 hypothetical protein CPn0A50 978630 977608 R tliF-Flagellar M-Ring Protein-ICT719) CPn0851 979722 97A925 R nitV-NitV-related protein-IC:"720) CPn0862 980873 979722 R yth0_2-Nits-relaeed protain_2-tCT721) CPn0A63 981514 980831 R pgmA-Phosphoglyeerate Mutase-ICT722) CPn0A5d 981670 982374 F yjbC-predicted pseudouridine synthase-1CT7231 CPn0A55 98241A 982912 F CT724 hypothetical protein CPn0866 9A3491 982916 R birA-Biotin Synthetase-ICT725) CPn0867 983t23 984667 F rodA-Rod Shape Protein-1CT726) CPn0868 986613 981670 P. zntA/cadA-Metal Transport P-type ATPase-ICT727) CPn0869 987401 986658 F. CT728 hypothetical protein CPn0870 988728 987!48 F. serS-Seryl cRNA Synthecase_2-ICT7291 CPn0871 988772 989899 F' ribD-Riboflavin Deaminase-ICT730) CPn0872 989963 991216 F' ribA4ribe-('TP Cyclohydratase i DHHP Synthase -ICT731) CPn0873 991233 991694 F ribF:-Ribicyllumazine Synthase-ICT732) CPn0871 993107 991719 F CT733 hypothetical protein CPn0A75 993372 994022 F CT734 hypothetical protein CPn0876 99!144 995517 F dagA_2-D-Alanine/Glycine Permease_2-ICT735) CPn0877 995533 995982 F ybcL family-ICT7361 CPn0878 996654 995992 F SET Domain protein-ICT737) CPn0A79 997439 996645 R yycJ-metal dependent hydrolase-ICT73A) CPn08B0 999A61 9971!! R ttsK-Cell Division Protein FtsK-fCT739) CPn08A1 1005667 1006209 F
CPn0A82 1006268 1007~04 F
CPn0A83 1008865 1007573 R dmpP/nqr6-Phenolhydrolase/NADH ubiquinone oxidoreduetase-(027!0) CPn0A8t 1009359 1009009 R CT7t1 hypothetical protein CPn0885 1010635 1009433 R ygcA-rRNA Methyltransterse-IGT742) CPn08Bb 1011276 1010908 R hetA-Histone-Like Developmental Protein-fCT7t3Y
CPn08A7 1011692 101!157 F CHLTR possible phosphoprotein-ICT7lt) CPa0A88 1015423 1011119 R- hemG-protoporphyrinogen Oxidase-ICT?15) CPn08B9 1016835 I015t62 R hemN_2-Coproporphyrinogen III Oxidase_2-ICT746) CPn0890 1017805 1016819 R hemE-Uroporphyrinogen Decarboxylase-ICT747) CPn0891 1021073 1017A19 R mtd-Transcription-Repair Coupling-ICT71A) CPn0892 1023661 1021016 R alas-Alanyl CRNA Synchecase-ICT719) CPn0893 1023894 1025A88 F cktH-Transkecolase-IGT750) CPn0894 1026766 10258AA R anus-AMP Nucleosidase-fCT751) CPn0A95 1026988 1027557 F efp_2->=longation Factor P_2-fCT752) CPn0896 1027595 1027822 F CT753 hypothetical protein CPn0A97 1028737 1027853 R (possible phosphohydrolasel-ICT75t) CPn0898 1030~60 1028904 R Mitochondrial HSP60 Chaperonin Homolog-ICT7551 CPn0899 1030875 1032215 F murF-MUramoyl-DAP Lipase-fCT756) CPn0900 1032235 1033281 F mraY-MUramoyl-Pentapeptfde Transterase-ICT757I
CPa0901 1033287 1031537 F murD-Muramoylalanine-Glutamate Lipase-ICT7581 CPa0902 1034513 1035211 ~ F nlpD-Muramidase finvasin repeat family>-It:T759) CPn0903 1035263 1036417 F ttsw-Cell Division Protein Ftsw-fCT760) CPn090d 1035326 I037396 F murG-Pepcidoglycan Transterase-ICT761) CPn0905 1037109 1039835 F murCiddlA-Huramace-Ala Lipase 4 D-AJ.a-D-Alum Ligass-fCT762) CPn0906 1040310 1039915 R CT763 hypothetical protein CPn0907 I0407B0 1010!45 R ~cutA Periplasmic Divalsnt Cation Tolerance Protein CutA IC-Type Cytoehrome Biogenesis Procainf CPn0908 1041589 1040780 R CT761 hypothetical protein CPn0909 10!1537 1041966 F rsbV_2-Sigma Factor Regulator_2-fCT765) CPn0910 1041979 1043004 F 'miaA-tRNA Pyrophosphate Transterase-ICT766) CPn0911 10!1043 1012985 R Fe-S cluster oxidoreduetase_2-ICT767) CPn0912 1014129 10<5760 F GT76B hypothetical protein CPn0913 :045760 1015945 F
CPn0914 1045999 1016397 F
CPn0915 1015461 1016817 F ybeH-iojap supertamily ortholog-tCT769) CPn0916 1016837 1018084 F tabF-Acyl Carrier Protein Synthase-ICT7701 CPn0917 10!8090 1018539 F hydzolaseiahosphacase homolog-tCT771I
CPn0918 1049223 1048579 R ppa-Inorganic Pyrophosphatase-tCT773) CPn0919 10!9378 1050430 F ldh-Leuciae Dehydrogenase-tCT777) CPn0920 1051405 1050431 R eys0-Sul:::e Synchesis/biphosphate phosphatase-ICT774) CPn0921 1051535 1052293 F snGlycezoi-3-P Acylczans:erase-fCT775) CPn092210523141053927F ass-ACylplycerophosphoechanolamine Acycransferass-ICT776) CPn092310539841055093F bioF_1-Oxononanoaca Synthase_1-ICT777) CPn092410572741055028R priA-Primosomal Protein N' -fGT7781 CPn092510579001057226R G?779 hypothetical protein CPn092610580601058557F Thioredoxin Disulfide Isomerase-ICT7801 CPa092710598091058670R CItLPS 43 kDa protein homoloQ_2 CPn092810610081059884R CHLPS 43 kDa protein homoloQ_3 CPn092910622921061186A CHLPS 43 kDa protein homoloy_4 CPn093010628571063330F

CPn093110641381065718F lysS-Lysyl tRNA Synthetase-(CT7811 CPn093210671421065721R cysS-Cysteinyl cRNA Synchetase-ICT7821 CPn093310675351068578F predicted disulfide bond isomerase-ICT783) CPn093410689421068526R rnpA-Ribonuclease P Protein Componeat-fCT78d) CPn093510690911068957R rl3d-L34 Ribosomal Proeein-ICT'785) CPn093610693361069470F r136-L36 Ribosomal Proesin-ICT786) CPn0937.10694961069798F raid-514 Ribosomal Protein-ICT787) CPn093810703221069849R CT788 hypothetical protein -(leader 160) peptide-periplasa~fe~

CPn093910707281071195F CT790 hypothetical protein CPn09d010730121071204R uvrC-Excinueluse ABC. Subunft C-fCT791) CPn09d110755011073018R stutS-DNA Mismatch Repair-ICT792) CPn09d210759851077754F dnaC/prf!!-DNA Primsse-(CT7941 CPn094310779781078238F CT794.1 hypothetical protein CPn094d10785121078997F

CPn094510790701079660F C'f795 hypothetical protein CPn09d610827861079745R QlyQ-Glycyl tRNA Synthetase-ICT796) CPn094710834421084059F pQsA_2-Glycerol-3-P-Phosphacydylcransfarase_2-ICT797) CPn09d810854741084047R Q1QA-Glycogen Synthase-(CT798) CPn09d910859291086483F etc-General Stress Protein-ICT799) CPn095010864881087027F pth-Pepcidyl CRNA ttydrolase-ICT8001 CPn095110871221087157F rs6-S6 Ribosomal Protein-ICT8011 CPn095210874781087723F rsl8-518 Ribosomal Protein-fCT802) CPn095310877421088218F r19-L9 Ribososial Protein-ICT8031 CPn095410882861088708P yehe-Predicted Kinase-ICTBOdI

CPn095510886121089175F Iframs-shift with 0951) CPn095610895601090909F CT805 hypothetical proeein CPn095710937881090963R ide/ptr-Insulinase family/Prouase ZII-fCT806) CPn095810947851093793R pls8-Glycerol-3-P Acylcransferase-ICT8071 CPn095910963431094799R~ cafE-Axial Filament Protein-ICT80B) CPn096010967641097102F CT809 hypothetical protein CPn096110971181097297F r132-L32 Ribosomal Procsin-ICT810) CPn096210973161098I75F plsX-FA/Phospholipid Synthesis Protein-ICT811) CPn096310983981103221F pnq~_21Polymorphie Outer Membrane Protein D
Family-(CT812) CPn096d11047581103301R

CPn096511067361104925R lpxe-Lipid A Disaccharide Synthase-(CT411) CPa096611080371106718R pcnH_2-PolyA Polymerase_2-ICT4101 CPn096711085121109885F mrsA/pgm-PhosphoQlueomutase-ICT815) CPn096811098951111721F QlmS-Glucosamine-Fructose-6-P Aminocransferase-ICT816) CPn096911118121112999F 0969-CyrP_1-Tyrosine Transport_1-ICt817) tyrP_1-Tyrosine Transport 1-ICT8I7) CPn097011134611114648! 0970-CyrP_2-Tyrosine Transport_2-ICt818) tyrP_2-Tyrosine Tzansport_2-Irreie) CPn097111147021115115F yeeA-Transport Permease-(CT819) ' CPn097211162991115430A ltsY-Cell Division Protein TtsY-ICT8201 CPn097311163701117527F sucC-Succinyl-CoA Synchacase. Beta-ICT821) CPa097411175411118432F sucD-Succiuyl-CoA Synthecase. Alpha-ICT822) CPn097511191041119637f CPn097611200821121185F .

CPn097711213711122402F

CPn097811226651123693F

CPn097911239801125413F htrA-DO Serine Protease-ICT8231 CPn098011269821125501A similarity to Saccharomyees sersvisiae hypothetical 52.9KD protein CPa098111270311129952F tint Metalloprotease linsulinase family)-ICT814) cPn0982113119a1129962R yipN family-IGT825) CPn098311320001131206R pssA-Glycerol-Serine Phosphacidyltransferase-ICT8261 CPn098411323791135510F nrdA-Ribonucleoside Reduccase. Large Chain-ICT827) CPa098511355341136571F nrd8-Ribonueleoside Reduetase. Small Chain-ICT828) CPn098611367241:37395F Y00H-Dredieted rRNA tdethylase-ICT8291 CPn098711375161138115F ytQB-like predicted rRNA methylase-ICT830) CPn09881138986113805 R murB-UDP-N-AeecylsnolpyruvoylQlucosamine Rsduccase-CPa098911391951139016R CT832 hypothetical protein CPn099011398831140440F iafC-Initiation Factor ~-(CT8331 CPn09911140421111061?F r135-L35 Ribosomal Protein-IC':8341 CPn099:11406341110996F r120-L20 Ribosomal Protein-ICT8351 CPn099311410141112030F pheS-Phenyialanyl tRNA Synthecaee. Alpha-ICTB361 CPn099d11423981141410F CT837 hypothetical protein CPn099511455121111415R CT838 hypotheticnl protein CPn099611165891145519R CT839 hypothetical protein CPn099711467081147664F mssJ-PP-loop superfamily ATPase-ICT8401 CPn099811478551150584F ftsH-ATP-dependent zinc protease-ICT8411 CPn099911538471150766R pnp-Polyribonueleocide Nucieotidyltrnnsferase-fCT8421 CPn100011531571152891R rsl5-S15 Ribosomal Protein-tCT8431 CPn100111534051153869F yfhC-cytosine deaminase-ICT8441 CPn10021153862115089 F CT845 hypothetical protein CPn100311517961154092R CT846 hypothetical protein CPa100d1155397115879 R CT8d7 hypothetical protein CEn100511559331155115R CT818 hypothetical protein CPa100611564721155990R CT819 hypothetical protein CPa100711566891156907F GT819.1 hypothetical protein CPn100811569281158223! CT850 hypothetical protein CPn100911590581158186R map-Hschionine Aminopeptidase-fCT8511 CPn101011596721159067R CT852 hypothetical protein CPn101111603061159902R CT853 hypothetical protein CPn101211621931160421R yzs8-AHC transporter permease-ICT8541 CPn10131162245. 1163624F fuaaC-Fumarats Hydraease-fCT8551 CPn101411654261163732R yehM-Sulfate Transporter-fCT8561 CPn101511656341166893F CT857 hypoctsecical protein !possible I1i proteia) CPn101611670421168898F CT858 hypothetical protein CPn101711690061169935T lytB-Metalloproteass-ICT8591 CPn101811698981170629F

CPn101911721281170638R CT860 hypocheciesl protein CPn102011736791172150R CT861 hypothetical protein CPnI02111742131173698R lcrH_2-Low Calcium Response_2-ICT8621 CPn102211756'731174216R CT863 hypothetical protein CPn102311760351176331F

CPn102411772361176334R xerD-InteQrase/ree~binase-fCT86d1 CPa102511773021178879F pgi-Glucose-6-P Isomsrase-ICT3781 CPa102611789971.179137F ltuA-ICT3771 CPn102711791751180755F

CPn102B11810161181999F s~dhC-palate Dehyropenase-ICT3761 CPn102911820081182844F

CPa103011838861182843R predicted D-amino acid dehyrogenaae-ICT3751 CPn103111855521184098R areD-Arginine/arnithine Antiporter-ICT374) CPn103211861501185566R CT373 hypothetical proesin CPn103311875001186187R CT372 hypothetical protein CPn103411885171187732R Predicted OItP_1 ICT371I (leader f18) peptide]

CPn103511900001188570R AroE-Shikisnace 5-DehyroQenase-(CT3701 CPn103611911351189984R AroB-Dehyroquinate Synthase-IGT3691 CPn103711921991191123R AroC-Chorissiats Synchase-ICT3681 CPa103811927261192199R aroL-Shikimats Xinase II-fCT3671 CPn10391193999119=665R aroA-Phosphoshikimats Vinyltransfsrase-ICT3661 CPn101011947411194073R

CPn104111959941194726R bioA-Adsnosylmtthionine-8-Amino-7-Oxononanoats Aminotrutsferase CPn1042X1965901195934R bioD-dechiobiotin synehecass CPa10431197717119657?R bioF_l-Oxononanoats Synchass_2 ~

CPn104411986911197699R bioH-Biotin Synthase CPn104511995901198901R conserved hypothetical bacterial membrane protein CPn104612006751199590R TSyptophan Hyroxylase CPn104712005521201343F dap8-DihydrodiDicolinace Reduetasa-ICT364f CPn104B12016061202604F asd-ASpartate DehydroQenase-ICT3631 CPn101912025951203914F lysC-ASpartokinass III-fCT3621 CPn105012039261104798F dapA-Dihydrodipieolinace Synthase-ICT3611 CPn105112049621205270F

CPn105212054171206169F

CPn10531=061531206701F

CPn105d12070341209466F

CPn105512096941210521F

CPn105612105271211228F

CPn105712111971213596F CT156 hypothetical protein CPn105812137481214836F CT355 hypothetical protein CPn105912148481215678F kpsA-Dimethyladenosine Transferase-fCT3541 CPn106011176581215727R dxs/tkt-Transketolase-ICT3311 CPn106112179201217666A CT330 hypothetical protein CPn106212198201218159R xseA-Exodoxyribonucluse VII-ICT3I91 CPn106712199511220712F cpiS-Triosephosphate Isomerase-(CT3281 CPn106112=07191=20895F

CPsa105512210951=20928R

CPa106611311351221!88F

CPn1067122173512=2292F def-Polypepcida Deformylase-ICT353) CPn106B12232581222365R rnh8_2-Ribonucleue HII_2-ICT008) CPn106912235131123941F yfp~-HTH Tranacripcional ReQulacor-fCT0091 CPn10701225511122114 R

CPn107112273241225885R

CPn107212279691228835f CPn107312290111229832F Predicted 0!!P_2 -ICT371) Table 2 (Supplemental Data) Functional Assignrxnts of C. pneumonine Coding Sequences. C. trncltomatis genes arc shown in parrntheses.
Amino Acid Blosynthcsis .Iromatic Familv 1039 (CT366)aroAPhosphoshikimate Vinyltransferase 1036 (CT369)aroBDehyroquinau Synthase 1037 (CT368)aroCChorismate Synthase 1 1035 (CT370)aroEShikimate i-Dehyrogenase ~

0486 (CT382)aroGDeoxyheptonate Aldolue 1038 (CT367)aroLShikimate Kinase II

0740 (CT637)tyrBAromatic AA Aminatransfense AsparrateFomily !lysine) 1 1048 (CT363)asd Asp:~ute Dehydrogenase 1050 (CT361dapADihy<bodipicolinate ) Synthasc 1047 (CT364)dapBDihydrodipicolinate Reductasc 0519 (CT430)dapFDian inopirnelate Epimerue 1049 (CT362)IysCAspa zokinase llI

2~ Serint Family 0433 (CT282)gcsfiGiyc ne Cleavage System H Protein 0521 (CT432)glyASerine tiydroxymethyltransfense Base &
Nuclmtidt Metabolism 0171 guaAGMP Synfhase 25 0172 guaBInosine 5'-Monophosphase Dehydrogenase 0608 Utidine S'-Monophosphate Synthase 0735 Uridine Kinase 0244 (CT128)adk Adenylate Kinase 0894 (CT751atnnAMP Nucieosidase ) 3~ 0568 (CT452)cmk CMP Kituue 0392 (CT039)dcd dCTP Deaminue 0059 (CT292)dut dUTP Nucleotidohydrolase OI20 (CT030)gmk GMP Kinase 0619 (CT500)ndk Nucleoside-2-P Kinase 3 0984 (CT827)nrdARibonucleoside Reductase.
5 Large Chain 0985 (CT828)nrdBRibonucieoside Reductase, Small Chain 0236 (CT183)pytGCTP Synthetase 0698 (CT678)pyresUMP Kittase 0271 (CT188)tdk Thymidylate Kinase 0659 (CT539)mtA Thioredoxin 0314 (CT099)trx8Thiorcdoxin Reductase I (CT844)yfhCCytosine Deaminasc OOI

45 Biotin. Lipoate dr Ubiquinone Biosynthesis of Cotacton 1041 bioAAdenosylmethionine-8-Amino-7-Oxononanoate Aminottatufetasc 1044 bioBBiotin Synthase 1042 bioDDethiobiotin Synthetase 0923 (CT777)bioF_IOxononanoate Synthase-1 1043 (C'C777)bioFOxononanoate Synthase-2 0866 (CT725)birABiotin Synthetase 0748 (CT628)ispAGmnyl Tnnsaatuferasc 0832 (CT558)IipALipoate Synthetase 0265 (CT219)ubiA Benzoau Ocnphenyhransiense 0264 (CT220)ubiD Phenylaerylate Decarboxylue OSIS (CT428)ubiE UbiquinoneMethyltransfense Folic Acid 0759 (CT612)folA DihydrofolateReducuse 0335 (CT078)folD Methylene Tcaahydrofolate Dehydrogenase 0758 (CT613)folP Dihydropteroate Synthuc 0757 (CT614)folX Dihydroneopmrin Aldolue 0763 (CT649)ygfA FortnyltetrahydrofolateCycloligase 1 Porphyrin ~

0714 (CT662)hertWGlutamyl tRNA Reducnse 0744 (CT633)hemB Porphobilinogen Synthue OOS2 (CT299)hemC Porphobilinogen Deaminue 0890 (CT747)hemE Uroporphyrinogen Decarboxylase I 0888 (CT74$)hemG protoporphyrinogen S Oxidise 0138 (CT210)hems.Glutamate-1-Semialdehyde-2.1-Aminomutue 0380 (CT052)hemN_ICoproporphyrinogen Ilt Oxidase_I

0889 (CT746)hemlVCoproporphynnogen 2 111 Oxidise 2 _ 0603 (CT485)hemZ Ferrochentue Riboflavin 0872 (CT731nbA&rib8 ) GTP
Cyclohydranse &
DHBP
Synthase 0532 (CT40S)ribC Riboflavin Synthase 0871 (CT730)ribD Riboflavin Deaminue 0877 (CT7J2)ribE Ribiryllumazine Synthue 25 0320 (CT09))ribF FAD Synthase Cell Envelope Forty Acid &
Phospho(ipid Merabolisrn 0161 (CT206) (predicted uyltnnsferase family) 0922 (CT776)au Acylg(yeerophosphoethanolamine Acyhnnsferase 0414 (CT265)accA AcCoA CarboxylasrrTransferrse Alpha 0183 (CT123)accB Biotin Carboxyl Carrier Protein 0182 (CT124)accC BiotinCarboxylase 0058 (CT29J)accD AeCoA Carboxylaseffranafense Ben 35 0295 (CT2Jb)acpP Acy1 Cartier Protein 0313 (CTI00)acpS Acyl-rartier Protein Synthue 0567 (CT451)cdsA Phosphatidate Cytidylytransferasc 0297 (CT238)fabD Malonyl Acyl Cartier Transeyclase 0916 (CT770)fabF Acyl Carrier Protein Synthasc 0296 (CT237)fabG Oxoacyl (Carrier Protein) Reductue .0298(CT239)fabH Oxoacyl Carrier Protein Synthue III

0406 (CTlOa)fabl Enoyl-Acyl-Cartier Protein Reducnsc 0651 (CT532)fabZ Myristoyl-Aeyl Carrier Dehydranse 0098 (CTOIO)hcB Acyltransferue 45 0271 (CTIJ6) LysophoapholipueEsterue 0615 (CT496)pgsA-1Glycerol-3-P Phosphatidyltratufense-I

0947 (CT797)pgsA Glycerol-J-P Phospharydyltransfensse_2 0958 (CT807)plsB Glycerol-3-P Acylcansferase 0569 (CT453)plsC Glycerol3-P Acylaansferau 50 0962 (CT811plsX FA/Phospholipid Synthesis ) Protein 0839 (CT699)psdD Phosphatidylserirte Deearboxylue 0983 (CT826)pssA Glycerol-Serine Phosphatidyltransfecue 0921 (CT775) sttGlyeerol-J-P Acyltraruferase 0654 (CTS35)yciA Acyl-CoA Thioestcrasc S 0877 (C1'736)ybcL CT1J6 Hypothetical Protein LPS

WO 00127994 PCT/US99/Zb923 0154 (CT208)gseAKDO Tnnsfense 0721 (CT655)kdsAKDO Synthetue 0235 (CT182)kds8Deoxyoctutotrosic Aeid Synthetue 0650 (CT531IpxAAcyl-Carrier UDPGIcIvAc ) O-Acyltnnsfensc 0965 (CT411IpxBLipid A Disucharide ) Synthase 0652 (CT533)IpxCMyristoyl GIcNac Deaeetyiau 0302 (CT243)lpxDUDP Glueosamine N-Acyltransferase Membrant Proteins.
Lipoproteins &
Porins 0310 (CT25160IM60kDa lacer Membrane ) Protein 0556 (CT442)crpAISkDa Cysnine-Rich Protein 0653 (CT534)cutEApolipoprotein N-ACetyltnttsferue 031 (CT252)Igt Prolipoprotein Diacylglyeerol I Tnnsfense 0558 (CT444)omcA9kDa-Cysteine-Rich Lipoprotein 0557 (CT443)omcB60kDa Cysteine-Rich OMP

0695 (CT681ompAMajor Outer Membrane ) Protein 0854 (CT713)ompBOuter Memebnne Protein 0781 (CT600)pat Pepddoglyean-Associated Lipoprotein 0300 (CT241yaeTOmp85 Hotnolog ) Peptidoglye:an 0417 (CT268)amiAN-Acetylmuramoyl Alanine Amidue 0780 (CT601amiBN-Acetylmunmoyl-L-Ala ) Amidue 0672 (CT55duF D-Ala-D-Ala Caroxypeptidase t ) 0968 (CT816)glmSGlueoumine-Fructose-6-P
Aminotnnsfense 0749 (CT629)glmUUDP-GIcNAc Pyrophosphorylue 0900 (CT757)mnY MunmoylPennpeptide Tnnxferue 0571 (CT455)murAUDP-N-Acetylg(ucosamine Tnmferue 0988 (CT831)murBUDPN-At:etylenolpyruvoylglucosamineReductue 0905 (CT762)murCdcddlA
Mutartutc-Ala Liguc &
D-AlaD-Alam Ligue 0901 (CT758)murDMunmoylalanine-Glunmate LiBase 0418 (CT269)murEN-Aeetylmunmoylalanyl8lunmyl DAP Ligue 0899 (CT756)murFMuramoylDAP Ligau 0904 (CT761murGPeptidoglyean Tnnsferue ) 0902 (CT759)nlpDMunmidue (invuin repeat family) 0694 (CT682)pbp2PBP2-Tnnsglycoluelfnnspeptidue 0419 (CT270)pbp3Tntesglycoluelfnnsprptidase 0421 (CT272)yabCPBP2B Family Methyltntufensc Cellular Prueeases Ctil Division 0959 (CT808)catEAxial Filament Protein 0880 (CT739)ftsKCell Division Protein FaK

0903 (CT760)fhW Cell Division Protein FtsW

0972 (CT820)ftsYCell Division Pronin FnY

0617 (CT498)gidAFAD-dependerttOxidorcducttue 0805 (CT582)minDChromosatx Partitioning ATPase 0850 (CT'109)mteBRod Shape: ProteinSugar Kinue 0867 (CT726)rodARad Shape Protein 0684 (CT688)parBChromosome Partitioning Protein Deroztijcatioa 5~ 0057 (CT294)sodMSupe:roxideDismunsefMn) 0778 (CT603)ahpCThio-spucifie Antioxidant (TSA) Peroxidase Signal Transduetioa 0148 (CT145) S!T Protein Kinue 0584 (CT467)uoS Two-Component Sensor 0294 (CT235) cAMP-Dependrnt Protein Kinase Regulatory Subunit 0712 (CT664) (FHA domain) 0478 (CT379)h!!XGTP Binding Protein 0703 (CT673) S!C Protein Kinase 0095 (CT301 S!f Protein Kinau ) 0397 (CT259) PP2C Phosphatax Family 0037 (CT337)puH PTS Phosphoeartier Protein Hpr 0038 (CT336)ptslPTS PEP Phosphotnnsferase 0060 (CT29prsN_1PTS IIA Protein_t f ) 0061 (CT290)ptsNPTS IIA Protein 2 r HTH DYA-Binding Dorttain 0262 (CT218)surfSurf-like Acid Phosphatase 0838 (CT698)thdFThiophenelFuran Oxidation Protein 0693 (CT683) TPR Repeats-CT683 Hypothetical Protein 0321 (CT092)ychFGTP Binding Protein 0544 (CT4 yhbZGTP binding protein t 8) 0844 (CT703)yphCGTPaseiGTP-binding protein Smedard Protein Secretion 01 (CT025)fIh Signal Recognition I Particle GTPax S

03b3 (CT060)tlhAFtagellar Secretion Protein 0858 (CT717)ffiIFlagellum-specific ATP Synthax 0704 (CT672)fl(NFlagellu Motor Switch DomainIYseQ family 0815 (CT572)gspDGen. Secrcdon Protein D

0816 (CT571gspEGen. Secretion Protein ) E

0817 (CT570)gspFGen. Secretion Protein F

0359 (CT064)IepAGTPase 0110 (CT020)lepBSignal Peptidue I

0535 (CT408)IspALipoprotein Signal Peptidax 0260 (CT141xeA_IProtein Translocax ) Subunit-1 0841 (CT701secA_2Transloerue SecA-2 ) 0564 (CT448)secD&secF
Protein Export Proteins SecDiSecF
(fusion) 0075 (CT321secEPrcprorcin Transloeax ) 3v 0629 (CT510)xcY Tnnslocase 0848 (CT707)rig Trigger Factor-Peptidyl-prolyl lsomersse Tronsporr-Related Proteins 0486 Hypothetical Praline Permease 0289 (CT230)aaaTNeutral Amino Acid (Glutamate) Tranaponer 3 0691 (CTb85)abcXABC Transporter 5 ATPax 1031 (CT374)arcDArginine/Omithine Antiporter 0482 (CT381artlArginine Periplasmic ) Binding Protein 0836 (CT554)bmQ Amino Acid (Benched) Transpon 0536 (CT409)dagA_ID-Ala/Gly Permcax I

0876 (CT735)dagAD-AlaninelGlycine 2 Permease 2 0682 (CTb90)dppDABC ATPase Dipeptide Transpon Ob83 (CT689)dppFABC ATPase Dipeptide Transport 0280 (CT689)dppFDipeptide Transporter ATPase 0785 (CT596)exbBMuromolecule Transporter 45 0784 (CT597)exbDBiopolymerTansporiProtein 0404 (CT486)OiY Glutatnine Binding Protein 0192 (CT129)glnPABC Amino Acid Trmsporter Permease 0191 (CT130)ginQABC Amino Acid Transporter ATPase 0528 (CT401)gltTGlutamateSymport 028b (CT194)mgtEMg'+Transportt:r(CHS
Domain) 0413 (CT264)msbATransport ATP Binding Protein 0290 (CT231) Na;-dependentTnnsporier 0195 (CT198)oppA_IOligopeptide Binding Protein_1 0196 (CT198)oppA_2Oligopeptide Binding Protein 2 _ 5 0197 (CT139)oppAOligopeptide Binding 3 Protein 3 0198 (CT175)oppAOligopeptide Binding 4 Protein .t 0599 ICT480oppA Oligopeptide Binding 5 Lipoproretn i ) 0199 (CTI99)opp8-1Oligopeptide Pemtesse-:

0598 (CT479)oppB Oligopeptide Permease_2 0200 (CT200)oppC_1Oligopeptide Permeue_1 0597 (CT478)oppC_2Oligopeptide Pemuase_'_ 0201 (CT201oppD Oligopeptide Tnnspon ) ?~TPase 0202 (CT202)oppF Oligopeptide Transport ATPue 0231 (CT180)tauB ABC Tnnspon ATPue f~itntaFe) 0782 (CT599)tolB Macromolecule Transporter 0969 (CT8I7)tyrP_ITyrosine Tnnsport_1 0970 (CTalB)tyrP_2Tyrosine Transport _ 0665 (CT544)uhpC He~osphosphate Transport 0282 (CT216)xuA Amine Acid Transporter 0207 (CT204)ybhl dicarboxylate Tnnslocator 1 0971 (CT819)yccA Tnnspon Permease S

0248 (CT152)ycCV ABC TnnsporterATPase lOt4 (CT856)ychM Sulfa a Tnnsponer 0736 (CT641ygeD fllu:. Protein ) 0680 (CT692)ygo4 Phosp gate Pennease 0723 (CT653)yhbG ABC Tnnsponer ATPue 0023 (CT348)yjjK ABC Transporter Protein ATPue 0127 (CTD34)ytfF Catioi.ie Amino Acid Transporter 0349 (CT067)ytgA Solute Protein Binding Family 0348 (CT068)ytgB ABC ransporter ATPue 0347 (CTD69)ytgC Integrsl Membrane Protein 0346 (CT070)ytg0 Integral Membrane Protein 1012 (CT854)yze8 AHC Tnnsponer Permease 0868 (CT727)znlA Metal Tnnspon P-type .4TPase 0279 Possible ABC Tnnsportcr Pertneue Protein 0543 (CT417) (Metal Tnnspon Protein) 0692 (CT684) ABC Transponer 0542 (CT416) ABC Transporter ATPase 0690 (CT686) ABC Transporter Membrane Protein 0541 (CT415) solute binding protein 3 7yve-msttetro~

0323 (CT090)IcrD Low Caleium Response D

0324 (CT089)IcrE Low Calcium Response E

D8 (CT576)IcrH_tLow Ca Response c Protein H-1 I

1021 (CT862)IcrH Low Calcium Response _ 0325 (CT088)sycE Seerction Chaperone 0702 (CT674)yscC Yop GGen Secretion Protein D

0828 (CT559)yscJ Yop Tnnslocation J

0826 (CT561yscL Yop Tnnsloeation ) L

0707 (CT669)yscN Yop N (Flagellar-Type ATPase) 45 0825 (CT562)yscR Yop Tnnslocadon R

0824 yscS YopS Tnnslocation (CT563) Protein 0823 yscT YopT Tnnloeation (CT564) T

0322 yscU Yop Translocation (CT091 Protein U
) 5o Central Intermediary Metabolism Glycogen Merobofism 0856 (CT715) UDP-Glueose Pyrophosphorylue 0948 (CT798)glgAGlycogen Synthase , 0475 (CT866)glgBGlucan Benching Enzyme JS 0607(CT489)glgCGlucoseI-P Adenyltransferase 0307 (C7-248)glgPGlycogen Phosphorylase 0388 (CT042)glBXGlycogen Hydrolase (debnnching) 0326 (CT087)malQGlucanoesnsfense 0851 (CT710)pckAPhosphoenolpyrovate Carboxykinase Phosphorous Qc Suljur 0548 (CT435)cystSulfite Reductase S 0920 (CT774)cysQSulfite SynthesivBiphosphace Phosphatau 0025 (CT346)actASulphohydrolue 0918 (CT77?)ppaInorganic Pyrophosphacase DNA Replication. Madlfication. Repair & Recombination 1 O DNA Mismareh Repair 0505 3-Methyladenine DNA Glycosylue 0812 (CTS75)mutt DNA Mismatch Repair 0941 (CT792)mutS DNA Mismatch Repair 0402 (CT107)mutt Adenine Glycosyiase S 0732 (CT625)nfo Endonuclease IV

0837 (CT697)nth Enodnucleue 111 DNA
Modification 0596 (CT477)ada Methylmnsferau Ol (CT024)hemK AJG-specific Methylue ZO 0891 (CT748)mfd Tnnscnprion-Repair Coupling 0620 (CT501ruvA Holliday Junction ) Helicue 0390 (CT040)rov8 Holliday Junction Helicue 0621 (CT502)rovC Crossover Junction Endonucleue 0053 (CT298)sms Strn Protein 2S 0771 (CT607)un8 Uncil DNA Glycosylue 1062 (CT329)xseA Exodoxyribonucleue VII

DNA
Recombination 0762 (CT650)recA RecA Recombination Protein 0738 (CT639)recB Exodeoxyribonuclease V. Beu 3O 0737 (CT640)recC Exodeoxyribonucleue V, Gamma 0123 (CT033)rccD_IExodeoxyrtbonuclease V (Alpha Subunit)_I

0752 (CT6S2)reeD Exodeoxyribonuclease 2 V. Alpha 2 _ 0339 (CT074)recF ABC Superfamdy ATPuc 0340 (CT074) (frame-shift with 0339) 3S 0563 (CT.t47)recJ ssDNA Exonucleue 0299 (CT240)rceR Recombination Protein DNA
Replication 0309 (CT2S0)dnaA_IReplication Initiation Protein_I

0424 (CT275)dnaA Replication Initiation 2 Faetor_2 4O 0616 (CT497)dnaB Replicative DNA
Helicue 0666 (CT545)dmE DNA Pol tI1 Alpha 0942 (CT794)druG DNA Primax 0338 (CT075)dnaN DNA Pol III (Beta) 0410 (CT261dnaQ_1DNA Pol III Epsilon ) Chain_1 4S 0655 (CT536)dnsQ DNA Pol III Epsilon 2 Chain_2 0040 (CT334)dnaX_1DNA Pol III Gamma and Tau_l 0272 (CTI87)dnaX DNA Pol III Gamma 2 and Tau_2 _ 0149 (CT146)dnU DNA Ligue 0274 (CT189)ByrA_IDNA Gyrue Subunit A_I

SO 0716 (CT660)gyrA DNA Gyrase Subunit _ 0275 (CTI90)gyrB_IDNA Gynse Subunit 8_I

0715 (CT661gyrB DNA Gynse Subunit ) 2 B_2 0416 (CT267)himD lntegntion Host Factor Alpha 0612 (CT493)polA DNA Polymerise I

S 0924 (CT778)priA Primosomal Protein S N

0386 (CT044)ssb SS DNA Binding Protein 0835 (CT555) SWUSNF family helieue_t 0849 (~7pg) SWUSNF family helicue _ 0769 (CT643)topADNA Topoisomense t-Fused to SWI
Domain 0024 (CT347)xerCIntegruvrecombinue 1024 (CT864)xerDIntegrudrccombinue Eukaryotic-Typt Chromatin Factors 0886 (CT743)hctAHiswne-Like Developmental Protein 0384 (CT046)hct8Histone-like Protein 0878 (CT737) SET Domain protein 0577 (CT460) SWIB (YM74) Complex Protein UVR
Exinutlease Repair System 0096 (CT33;)uvrAExcinueletux ABC
Subunit A

0801 (CT586)uvr8Exinucleue ABC Subunit B

p9a0 (CT791uvrCExcinucleue ABC.
) Subunit C

I 0772 (CT608)uvrDDNA Hclicue S

Energy Metabolism Aerobic 0855 (CT714)gpdA Glycerol-3-P Dehydrogenase 0743 (CT634)nqrA Ubiquinone Oxidorcductue.
Alpha 0427 (CT278)nqr2 NADH (Ubiquinone) Dehydrogenase 0428 (C'C279)nqr3 NADH (Ubiquinone) Oxidorcductase.
Gamma 0429 (CT280)nqr4 NADH (Ubiquinone) Reductue 0430 (CT281nqr5 NADH (Ubiquinone) Redueuse ) 5 25 0883 (CT740)nqr6 PhenolhydrolasdNADH
(Ubiquinone) Oxidoreductase .1 TP
Biogenesis and mttabolistn 0351 (CT065)adt_IADPIATP Traralaeast_1 0614 (CT495)adt ADP/ATP Tnnslocase_2 0088 (CT308)atpA ATP Synthue Subunit A

0089 (CTJ07)atpB ATP SYn~uc Subunit B

0090 (CT306)atpD ATP Synthue Submit D

0086 (CT3I0)atpE ATP Synthue Subunit E

0091 (CT305)atpl ATP Synthase Subunit 0092 (CT304)atpK ATP Synthue Subunit K

35 0860 (CT119)fliF FIageIlar M-Ring Protein Electron Transport Chain 0102 (CT013)cydA Cytochrorne Oxidue Subunit 0103 (CT014)cydB Cytochrome Oxidise Subunit 0364 (CT059) Fertedoxin L~~ 0084 (CT312) Predicted Ferrcdoxin Glyrnlysis & Gluconeogtnesis 0281 (CT215)dhnA Predicted 1.6-Fructose Biphosphate Aldolase OB00 (CT587)erro Enolue 0624 (CT505)gapA Glyceraldehyde-3-P Dehyrogenue 45 0056 (CT295)mrsA PhosDhornannomutue 0967 (CT8I5)pgm Phosphoglucomutue 0160 (C'T207)plkA_IFructose-6-P Phosphotransfense_I

0208 (CT205)ptkA Fructose-6-P Phosphoasnsferue_2 1025 (CT378)pgi Glucose-6-P Isomerase 0679 (CT693)pgk Phoaphoglyeerate Kituse 0863 (CT722)pgrrtAPhosphoglyetnte Mutant 0097 (CT332)pyk Pyruvate Kinase 1063 (CT328)tpiS Triosephosphate Isomertue Ptntose Phosphate Pothway 55 0239 (CTI86)devB Glucose-bP Dehyrogtnast (DevB family) 1060 (CT331)dxs Tnruketolue 0360 ICT063)gnd 6-Phosphogluconate Dehydrogenase 0185 ICTI21)rpe Ribulose-P Epimense 0141 (CT213)tpiARibose-5-P Isomerase A

0083 (Cf313)tal Transaldolue S 0893 IC1-750)UttBTnnsketolue 0238 (CT185)zwf Glucose-6-P Dehyrogenue Pyruvalt Dehydrogenase 0833 (CT557)IDdALipoamitle Dehydrogenue 0436 ICT285)IpIA_ILipoate Protein Ligue-Like Protein 0618 (CT499)IpIALipoam-Protein ? Ligase A

0033 (CT340)ptJhA&BOxoisovalente Dehydrogenase a/(i Fusion 0304 (CTZ45)pdhAPyruvate Dehydrogenase Alpha 0305 (CT246)pdhBPyruvate Dehydrogenue Beta 0306 (CT247)pdhCDihydrolipoamide Acetyltransferase S Cycle 0495 (CT390)aspCAspartam Aminotnnsferase 1013 (CT855)fumCFumarate Hydratue 1028 (CT376)mdhCMalate Dehyrogenase 0789 (CT592)sdhASuccinam Dehydrogenue 0790 (CT591)sdhBSuccinate Dehydrogrnase 0788 (CT593)sdhCSuccinate Dehydrogenue 0378 (CT054)sueAOaoglutarate Dehydrogenue 0377 (CTO55)sucB_1Dihytleolipoamide Succinylnansfetase_I

0527 (CT400)sucBDihydrolipwmide 2 Succinyltratuferue ?
-2 0973 (CT821sucCSuccinyl-CoA Synthetue.
S ) Ben 0974 (CT822)sucDSuecinyl-CoA Synthetase, Alpha Protein Folding, Assembly & Modification Chaperonu 30 0949 (CT799)ctc General Stress Protein 0534 (CT407)dksA DnaK Suppressor 0032 (CT34IdnaJ Heat Shock Protein ) J

0503 (CT396)dnaK Hsp-70 0134 (CT110)groEL_1Hsp-60_1 3 0777 (CT604)groELHsp-60 2 0898 (CT755)groELHsp-60 3 0135 (CTI groESlOKDa Chaperonin ) 0502 (CT395)grpE HSP-70 Cofutor 0661 (CT541mip FKBP-type Peptidyl-prolyl ) CisTrans lsotnerue Prattasts OI44 (CTI clpB CIp Proteue ATPue 13) 0437 (CTt86)clpC CIpC Proteue 0520 (CT431clpP CLP Protease ) 1 0847 (CT706)clpP CLP Protease Subunit 4S 0846 (CT705)dpX CLP Proteue ATPue 0269 (CC138) Dipeptit3ue 0998 (CT841)fliesATPdepmdent Zinc Proteue 0030 (CT343)gcp_1O-Sialoglytoprotein Endopeptidue_I

0194 (CT197)gcp_2OSialoglycoprotein Endopeptidast_2 S~ 0979 (CT823)htrA DO Senne Proteue 0957 (CT806)ide Insulinue family/Proteaae 0027 (CT344)ion Lon ATP-dependent Protect 1017 (CT859)IytB Metalloproceue 1009 (CT85t)trap MethionmeAminopeptitlase S 0185 (CT045)pepA Leucyl Aminopeptidau S A

OI36 (CT113)DepF Oligopeptidase 0813 (CT574)pepPAminopeptidase P

0613 fCT494)soh8Protease 0555 (CT441tsp Tail-Specific ) Protease 0344 (CT072)yaeL~tetal!optoteue 0981 (CT824) Zinc ~tetalloprotease (insu)intue family) Proteinomtrasts ls 0227 (CT176)dsb8Disulfide bond Oxidorcductue 0786 (CT595)dsbDThio:disulfide Interchan8e Protein 0228 fCT177)dsbGDisulfide Bond Chaperone 0933(CT783) Prcdieted Disulfide Bond lsotnerase 0926 (CT780) Thiorcdoxin Disulfide Isomerase Transcription RNA
Degradation 0999 (CT842)pnp Polyribonucleotide ~ueleotidylmnsfense 0054 (CT297)me Ribonuelease III

0119 (CT029)mhB_1Ribonuelesse HII_1 1068 (CT008)mhB Ribonucleax Hlf 0934 (CT784)mpA Ribonueleue P Protein Component 0504 (CT397)vac8Ribonucleue Family ~ Elongation Qe Termination Faetors 0741 (CT636)greATranscription Elongation Factor 0316 (CT097)nuSAN Utilization Protein A

0076 (CT320)nusGTnrueriptional Antitermination 0845 (CT704)pcnB-1Poly A Polymenx_I

I 0966 (CT410)pcnBPolyA Polymerise ?

S

0610 (CT491rho Transcription Termination Factor ) RNA
Merhyiases 0674 (CTSS3)fmu RNA Methyloansferue 1059 (CT3S4)kgsADimethy(adenosine Tnnsfense 2~ 0187 (CT133) PttdictedMethylue 0530 (CT403)spoU_1rRNA Methylue_1 0660 (CTS40)spoUrRNA Methylue_2 0117 (CT027)trmDtRNA (Gtunine N-I )-Methylttansfense 0885 (CT742)ygcArRNA Methyltransferse 25 0986 (CT829)yggHPredicted rRNA Methylue 0987 (CT830)ytg8Predicted rRNA Methylase RNA
Modification 0649 (CTS30)fmt Methionyl tRNA Formyhnnsferase 0910 (CT766)miaAtRNA Pyrophosphate Tnnsferise 30 07t9 (CT658)sthBPredicted Pxudouridine Synthue 0219 (CT193)tgt Queuine tRNA Ribosyl Tnnsfense OS80 (CT463)truAPseudouridylate Synthue I

0319 (CT094)tru8tRNA Pseudouridine Synthue 0401 (CTt06)yceCPredicted Pseudouridine Synthetue Family 3 0864 (CT723)yjbCPredicted Pseudouridine Synthue RNA
Po(ymerote Qc Trantcriprion Rtgulators OS86 (CT4b8)atoCTwo-Component Regulator 0362 (CTObIrpsDSigma-28/WhiG Family ) OS01 (CT394)hrcAHTH Tnnxriptional Repressor 40 0793 (CTS88)rbsUSigma Regulatory Family Protein-PP2C
Phosphanse (RsbW Anngonist) 062b (CTS07)tpoARNA Polymerise Alpha 0081 (CT31rpoBRNA Polymerise Ben S) 0082 (CT314)rpoCRNA Polymerise Ben' 075b (CT61rpoDRNA Polymenx Sigma-66 S) 45 0771 (CT609)rpoNRNA Polyrrrcnse Sigma-S4 OSI1 (CT424)rsbV_ISigtnaRegulatoryFaetor_I

0909 (CT76S)rsbVSigma Factor Regulator 2 0670 (CTS49)rsbWSigma Regulatory Factor-Histidine tCitux 0750 (CT630)tctDHTH Tranuriptional Regulatory Protein Receiver Doman 1069 (CT009)yfgAHTH Tnnscripoonal Regulator Amino Aeyl tRNA Synthesis 0892 (CT749) alas Alanyl tRNA Synthetue 55 0570 (CT454) argS Arginyl tRNA Tnnsfenx 0662 (CT542) asps Aspartyl tRNA Synthense Translation 0932 (CT782)cysSCysuinyl tfL'IA
Synthetue 0003 (CT003)gatAGlu tRNA Gln Amidotnmfertue (A subunit) 0004 (CT0o4)gatesGlu tRNA GIn Amidotnnsfmse (B Subunit) 0002 (CT002)gatCGlu tRNA Gln Amidotnnsfetase (C subunit) 0560 (CT445)gltXGlutamyl-tRNA Synthetue 0946 (CT796)glyQGlycyl tRNA Synthetax 0663 (CT543)hissHisadyl tRNA Syntherase 0109 (CT019)ileSIsoleucyl-tRNA
Synthetax 0153 (CT209)IeuSLeucyltRNA Synthetue 1 0931 (CT781IysSLysyl tRNA Synthetase ~ ) OI22 (CT032)tttetGMeth'ronyl-tRNA
Synthenx 0993 (CT836)pheSPhenylalanyl tRNA
Synthetase, Alpha 0594 (CT475)pheTPhenyla)anyl tRNA
Synthetax Beta 0500 (CT393)prosProlyl tRNA Synthetax I 0870 (CT729)xrS Seryl cRNA Syntherase 0806 (CT581)thrSThrconyltRNA Synthense 0802 (CT585)apS TryptophanyItRNA
Synthetase 0361 (CT062)tyrSTyrosyl tRNA Synthetase 0094 (CT302)vaiSValyl tRNA Synthetue Pepridc Chain Initiation.
Elongation &
Termination 1067 (CT333)def Polypeptide Dcformylase 0184 (CT122)eCp_IElongation Futor 0895 (CT752)efp Elongation Futor 0550 (CT437)CusAElongation Facror G

25 0073 (CT323)inCAInitiation Factor IF-I

0317 (CT096)inf8Initiation Factor-2 0990 (CT$33)infCInitiation Futon 01 (CT023)plrAPeptide Chain Releasing I3 Futon 1 0576 (CT459)prt8Peptide Chain Release Factor 2 3~ 0950 (CT800)pth Peptidyl tRNA Hydrolax 0318 (CT095)rbfARibosome Binding Futon A

0699 (CT677)rrf Ribosome Releasing Factor 0697 (CT679)tsC Elongation Factor TS

t>074(CT322)tufAElongation Factor Tu 35 Ribosomal Prortins 0078 (CT318)rll LI Ribosomal Protein 0644 (CT525)r12 L2 Ribosomal Protein 0647 (CT528)r13 L3 Ribosomal Protein 0646 (CT527)rl4 L4 Ribosomal Protein 0635 (CT516)r15 LS Ribosomal Prouin 0633 (CT514)rl6 L6 Ribosomal Protein 0080 (CT316)r17 L7/LI2 Ribosomal Prouin 0953 (CT803)rl9 L9 Ribosomal Protein 0079 (CT317)r110L10 Ribosomal Protein 45 0077 (CT319)rl L1 f Ribosomal I Prouin 0247 (CT125)r113Ll3 W'b~onui Prouin 0637 (GT518)r114L14 Ribosomal Prouin 0630 (CT511)r115LIS Ribosomal Prouin 0640 (CT521)r116L16 Ribosomal Prouin 0625 (CT506)r117Ll7 Ribosomal Protein 0632 (CT513)rll8Lt8 Ribosomal Prouin 01 (CT028)r119Ll9 Ribosomal Protein l8 0992 (CT835)r120L20 Ribowmal Protein 0546 (CT420)r121L21 Ribosomal Protein 55 0642 (CT5I3)r122L22 Ribosomal Prouin 0643 (CT526)r123L23 Ribosomal Protein 0636(CT517)r124L24 Ribosomal Prouin 0545(CT419)r127427 ribosomal protein 0327(CT086)r128L28 Ribosomal Prouin 0639(CT520)r129L29 Ribosomal Protein 0112(CT022)r131L31 Rbosomal Protein 0961(CT810)r132L32 Ribosomal Prouin 0250(CT150)r133L33 Ribosomal Prouin 0935(CT785)r134L34 Ribosomal Prouin 0991(CT834)r135L35 Ribosomal Prouin 1 0936(CT786)r136L36 Ribosomal ~ Prouin 0315(CT098)rst SI Ribosomal Protein 0696(CT680)rs2 S2 Ribosomal Prouin 0641(CT522)rs3 S3 Ribosomal Prouin 0733(CT626)rs4 S4 Ribosomal Prouin 15 0631(CT512)rs5 S5 Ribosomal Prouin 0951(CT801rs6 S6 Ribosomal ) Prouin 0551(CT438)rs7 S7 Ribosomal Prouin 0634(CT515)rs8 S8 Ribosomal Protein 0246(CT126)rs9 S9 Ribosomal Prouin 0549(CT436)rs10S10 Ribosomal Prouin 0627(CTSOB)rsl1511 Ribosortul Protein 0552(CT439)rsl2SI2 Ribosomal Prouin 0628(CT509)rs13SI3 Ribosomal Prouin 0937(CT787)rs14514 Ribosomal Prouin 25 1000(CT843)rsl5S15 Riboaomal Protein 0116(C'f026)rs16SI6 Ribosomal Protein 0638(CT519)rsl7517 Ribosomal Protein 0952(CT802)rs18SI8 Ribosomal Protein 0643(CT524)rsl9519 Ribosomal Prouin 0754(CT617)rs20S20 Ribosomal Prouin 0031(CT342)rs21521 Ribosomal Protein 35 Other Catc'orica Ch(cmydiaSpccific Proteins 0561 (CT446)Euo CHLPS Euo Prouin 0804 (CT583)Gp6D CHLTR Plasmid Paralog 0186 (CTt SimiLriey to IncA_t t9) 0291 (CT232)ineB Inelmion Membrane Protein B

0292 (CT233)incC Inclusion Membrane Protein C

1026 (CT377) LtuA Prouin 0333 (CTO80) LtuB Protein 0005 (CT871pmp_IPolymorphic Ouur ) Membrane Protein G Family 45 0013 (CT871pmp_2Polymorphie Ouur ) Membrane Prouin G Family 0014 (CT871pmp Polymorphic Ouur ) ~ Membrane Prouin G Family 0015 (CT871pmp_3PMP 3 (frame-shit!
) with 0014) 0016 (CT874)pmp Polymorphic Ouur 4 Membrane Prouin G Family OOi7 (CT871)pmp_4PMP 4(fttune-shiftwith0016) 0018 (CT874)pmp Polymorphic Outer 5 Membrane Protein G Family 0019 (CT87IPmp_5PMP 5 (frame-shift ) with 0018) 0444 (CT871pmp Polymorphie Ouur ) 6 Membrane Prouin G/I Family 0445 (CT871pmp_7Polymorphic Outer ) Membrane Protein G Family 0446 (CT871pmp Polymorphic Outer ) 8 Membrane Protein G Family 55 0447 (CT871pmp Polymorphic Ouur ) 9 Membrane Prouin G/I Family 0450 (CT871pmp_IPolymorphic Ouur ) O Membrane Protein G Family 0449 (CT871DmP_10PMP_l0 (Frame-shift ) with 0450) 0451 (CT87t ) pmp_I I Polymorphic Outer Membrane Protein G Family 0452 (CT874) Potymorphic Outer Membrane pmp_12 Protein (truncated) AEI Family 0453 (CT871) Polyrnorphie Outer Membrane pmp_I3 Protein G Family 0454 fCT872) Polymorphic Outer Membrane pmp_t4 Protein H Family 0466 (CT869)pmp_I5Polymorphic Outer Membrane Protein Family 0467 (CT869)pmp_16Polymorphic Outer Membrane Protein E Family 0468 fCT869)pmp_17Polymorphic Outer Membnnc Protein E Family 0469 (CT869)ptnp_17PMP_t7 (Fame-shift with 0468) 0470 fCT869)prnp_I7PMP_17 (Fame-shill with 0469) 0471 (CT870)pmp_18Polymorphic Outer Membrane Protein FrF Family 0579 fCT412)prrtp_19Polymorphic Membrane Protein A Family 0540 (CT413)pmp Polytrrorphic Membrane 30 Protein B Family 0967 (CT8t2)pmp_21Polymocphic Membrane Protein D Family 0562 CHLPS 47 kDa Protein Hotnolog_I

1 0927 CHLPS 47 kDa Protein S Homolog_2 0928 CHL?S -43 kDa Protein Homolog 3 0929 CHL.'S _43 kDa Protein Homolog 4 0728 (CT622) CHL.'N 76kDa Homolog_I
(CT622) 07.9 (CT623) CHLPN 76kDa Homolog_3 (CT623) 0137 (CTI09) CHLI'S Hypothetical Protein 0332 (CTO81 CHL"'R T2 Protein ) Mistellonmur Err-rymu~Conservtd Prote irtf 0193 argR Possi de Arginine Repressor 106 Arort atie Amino Aeid Hydroxyiase 25 0232 Similarity ro 5'-Methylthioadentnine Nucleosidase 0128 (CT035) Biotin Protein Ligue 0513 (CT426) Fe-S Oxidoreducuse_I

I (CT767) Fe-S Oxidorcductue 2 0373 (CT057)gepE GcpE Protein 30 0407 (CT103)' HAD Superfamily HydrolauJPhosphatue 0917 (CT771) HydrolasdPhosphatue Homolog 0488 (CT385)ycfF HIT Family Hydrolase 070! (CT675)karG Arginine Kinase 0526 (CT399)kpsF GutQ/KpsF Family Sugar-P
Isomense 35 0919 (CT773)Idh Leucine Dehydrogenase 0022 (CT349)maC Mafprotein 0997 (CT840)mes! PP-loop superfamily ATPase OISI (CT148)mhpA Monooxygrnase 0730 (CT624)mviN Integral Membrane Protein 0861 (CT720) NiN-Related Protein 0479 (CT380)phnP Metal Dependent Hydrolase 0106 (CT015)phoH ATPase 0729 (CT084) Phophotipue D Sttperfamily 0435 (CTI84) Phospholipase D Superfamily 45 0581 (CT464) Phosphoglycolate Phosphanse 0897 (CT754) Predicted Phosphohydrolue 0509 (CT422) Predicud Metalloen:yme 1030 (CT375) Pmdicted D-Amino Acid Dehyrogenase 0531 (CT404) SAM Deprndent Methyltramferue 50 0337 (CT076)smp8 Srnatl Protein B

0394 (CT256)t(yC_ICBS Domain Protein (Hemolysin Homolog)_t 0510 (CT423)ttyC_2CHS Domains (Hemolysin Homolog)_2 0382 (CT048)yabC SAM-Dependent Methyarnaferase 0787 (CT594)yabD PHP Superfamity (Urcase/Pyrimidinuc) Hydrolau 55 0611 (CT492)yacE Predicted PhoaphatuelKinue 0579 (CT462)yachtSugar Nucleotide: Phosphorytue OS78 (CT461)yael Phosphohydrolase _ 0145(CT071yaeM CT071 Hypothetical Ptotem ) 0566(CT450)yaeS YaeS family Hypothetical Protein 0591(CT472)yagE YagE family 0039(CT335)ybaB YbaH family Hypothetical Protein OI01(CTOl2)ybbP YbbP family Hypothetical Protein 0915(CT769)ybeB iojap Superfamily Ortholog 0137(CTf08)ybgl ACR family 0529(CT402)ycaH ATPau 0438(CT287)ycbF PP-loop Superfamily ATPase 1 0734(CT627)yceA YceA Hypothetical Protein ~

0954(CT804)ychH Predicted Kinase 0261(CT217)yda0 PPLoop Superfamily ATPase 0245(CT127)ydh0 Polysaccharide Hydrolue-tnvasin Repeat Family 0573(CT457)yebC YebC Family Hypothetical Protein IS 0689(CT687)yfh0_I Nif$-rclatedAminotransfenae_I

0862(CT721yfh0 2 Nits-related Aminomnsfetau-2 ) 0547(CT43t)ygbB YgbB Family Hypothetical Protein 0237(CT184)yggF YggF Ftunily Hypothetical Protein 0775(CT606)yggV YggV Family Hypothetical Promin 0396(CTZ58)yh10,3 NifS-related AminotnnsCense 0605(CT487)yhhf Predicted Methylase 0575(CT458)yhhY Amino Group Acetyl Tnnsfense 0592(CT473)yidD YidD Family 0982(CT825)yigN YigN Family Hypothetical Protein 25 0657(CT537)yjeE YjeE Hypothetical Protein 0768(CT644)yohl Yoht Predicted Oxidoteductue 0336(CT077)yajL YojL Hypothetical Protein 0217(CT140)ypdP YpdP Hypothetical Protein 0140(CT212)yqdE YqdE Hypothetical Protein 0263(CT221yqfiJ YqfU Hypothetical Protein ) 0139(CT211yqgE YqgE Hypothetical Protein ) 0270(CT137)ywlC SuAS Superfamilyrelated Protein 0879(CT738)yyc! Menl Dependent Hydrolase 35 Homologs to CHLTR Hypothetical Caling Genes 0001(CT001CTOOI Hypothetical Protein ) 0020(CT351CT351 Nypothetieal Protein ) 0021(CT350)CT350 Hypothetical Protein 0026(CT345)CT345 Hypothetical Protein 0035(CT339)CT339 Hypothetical Protein 0036(CT338)CT338 Hypothetical Protein 0055(CT296)CT296 Hypothetical Protein 0062(CT289)CT289 Hypothetical Proxin 0065(CTZ88)CT288 Hypothetical Protein 45 0068(CT360)CT360 Hypothetical Protein 0071(CT325)CT325 Hypothetical Protein 0072(CT324)CT324 Hypothetical Protein 0085(CT31CT711 Hypothetical Protein l ) 0087(CT309)CT309 Hypothetical Protein 0093(CT303)CT303 Hypoehstieal Protein 0100(CT011CT011 Hypothetiesl Protein ) 0104(CT017)CT017 Hypothetical Protein 0105(CT016)CT016 Hypothetical Protein 0107(CT058)CT058 Hypothetical Protein_I

55 otoetcrnlg)crolg similarity 011 (CT021CT021 Hypothetical Protein I ) 0121(CT031CT031 Hypothetical Protein ) 0129(CT036tCT036 Similarity 0145(CTt CT114 Hypothetical 14) Protein Ot50(CTI47)CT147 Hypothetical Protein 0152(CTt49)CT149 Hypothetical Protein 0176(CTI53)CT153 Hypothetical Protein 0188(CT132)CT132 Hypothetical Protein 0189(CT131CTl3l Hypothetical ) Protein 0206(CT203)CT203 Hypothetit:al Protein 0229(CT178)CT178 Hypothetical Protein 0230(CT179)CT179 Hypothetical Protein 0234(CT18ICT181 Hypothetical ) Protein 0249(CTI51CTlS t Hypothetical ) Protein - 0253(CT144)CT144 Hypothetical Protein_1 0254(CT143)CT143 HypoUtetical Protein-1 I S 0255(CT142)CT142 Hypothetical Protein_I

0256(CTtaa)CT144 Hypothetical Protein 2 0257(CT143)CT143 Hypothetical Protein 2 0259(CT142)CT142 Hypothetical Protein 2 0276(CT191CT191 Hypothetiesl ) Protein 0288(CT195)CT195 Hypothetical Protein 0293(CT234)CT234 Hypothetical Protein 0301(CT242)CT368 Hypothetical Protein 0303(CT244)CT244 Hypothetical Protein 0308(CT249)CT249 Similuity 25 0312(CT101)CT101 HypothetiealProtein 0328(CTO85)CT085 Hypothetical Proosin 0330(CT083)CT083 Hypothetical Protein 0331(CT082)CT082 Hypothetical Protein 0374(CT079)CT079 Similarity 0342(CT073)CTOT3 Hypothetical Protein 0343(CT073)(hams-ahiR
with 0342?) 0350(CT066)CT066 Hypothetical Protein 0369(CT058)CTO58 Hypothetical Protein 2 0370(CTO58)CT058 Hypothetical Protein 3 35 0374(CT056)CT056 Hypothetical Protein 0379(00053)CT053 Hypothetical Protein 0381(CT326)CT326 Similarity 0383(CT047)CT047 Hypothetical Protein 0387(CT043)CT043 Hypothetical Protein 0389(CT041CT04 t Hypotitetieal ) Protein 0393(CT038)01'038 Hypothetical Protein 0395(t.'C257)CT257 Hypothetical Protein 0399(CT253)CT253 Hypothetical Protein 0400(CT254)CT254 Hypothetical Protein 45 0401(CT255)CT255 Hypothetical Protein 0405(CT10S)CTI05 Hypothetical Protein 0408(CT102)CT102 Hypothetical Protein 0409(CT260)Cf260 Hypotheatal Protein 0411(CT262)CT262 Hypothetical Prooein 0412(CT263)CT'263 Hypothetical Protein 0415(t:T266)CT266 Hypothetiea!
Protein 0420(CT271CT271 Hypothetical ) Protein 0422(CT273)CT273 Hypothetical Protein 0423(CT274)CT274 Hypothetical Protein 55 0425(CT276)CT276 Hypothetical Proteins 0426(CT277)CT277 Similarity 0434(CT283)CT283 Hypothetical Protein 0441ICT007)CT007 Hypothetical Protein 0442(CT006)CT006 Hypothetical Protein 0443(CT003)CT003 Hypothetical Protein 0474(CT363)CT363 Hypothetical Protein 0476(CT863)CT863 Hypothetical Protein 0480(C7383)CT383 Hypothetical Protein 0485(CT382)CT382.1 Hypothetical Protein 0487(CT384)CT384 Hypothetical Protein 0489(CT386)CT386 Hypothetieat Protein 1 0490(CT387)CT387 Hypothetical ~ Proxin 0491(CT389)CT389 Hypothetical Protein 0496(CT791CT391 Hypothetical ) Protein 0497(CT388)CT388 Hypothetical Protein 0506(CT421CT421 Hypothetical ) Protein 1 0507(CT421CT421.1 Hypothetical S ) Protein 0508(CT421CT421.2 Hypothetical ) Protein Osl2(CT423)CT423 Hypothetical Protein 0314(CT427)CT427 Hypothetical Protein 0518(CT429)CT429 Hypothetical Protein 2~ Os22(CT433)CT433 Hypothetical Protein 0525(CT398)CT398 Hypothetical Protein 0533(CT406)CT406 Hypothetical Protein 0537(CT814)CT814.1 Hypothetical Protein 0538(CT814)CT814 Hypothetical Protein 25 oss4(CT440)CT440 Hypothetical Prouin OSS9(CT441)CT441.1 Hypothetical Protein 0363(G?449)CT449 Hypothetical Protein 0372(CT436)CT436 Hypothetical Protein 0382(CT463)CT463 HypotlKtieal Protein 30 0383(CT466)CT466 Hypothetical Protein 0388(CT469)CT469 ~iypothetieal Protein 0589(CT470)CT470 Hypothetical Protein 0390(CT471)CT471 Hypothetical ProOein 0393(CT474)CT474 Hypothetical Protein 35 0393(CT476)CT476 Hypothetical Protein 0601(CT483)CT483 Hypothetical Protein 0602(CT484)CT484 Hypothetical Protein 0606(CT488)CT488 Hypothetical Protein 0609(CT490)CT490 Hypothetical Protein 4U 0622(CT303)CT303 Hypothetical Protein 0623(CTS04)CT304 Hypothetical Protein 0648(CTS29)CTS29 Hypothetical Protein 0658(CTS38)CT338 Hypothetical Protein 0667(CT346)CT346 Hypothetical Protein 45 0668(CTS47)CT347 Hypothetical Protein 0669(CTS48)CT348 Hypothetical Protein 0671(CTS30)CT350 Hypothetical Protein 0673(CT332)CT332 Hypothetical Protein 0673(CT696)CT696 Hypothedeal Protein 0676(CT695)CT693 Similarity 0681(CT691CT691 Hypothetical ) Prooein 0687(CT482)CT482 Hypothetical Protein 0688(CT481CT481 Hypothetical ) Protein 0700(CT676)CT676 Hypothetical Protein 55 0703(CT671)CT671 Hypothetical Protein 0706(CT670)CT670 Hypothetical Protein 0708(CT668)CT668 Hypothetical Protein 0709 vCT667)CT6b7 Hypothetical Prouin 0710 ~CT666)CTb6b Hypothetical Protein 0711 lCTbbS)CT665 Hypothetical Protein 0713 (CTb63)CT663 Hypothetical Prouin 0717 (CT6Sb)CTbSb Hypothetical Prouin 0718 (CT6S7)CT637 Hypothetical Prouin 0720 (CT659)CT659 Hypothetical Prouin 0722 (CTbS4)CTbS4 Hypothetical Prouin 0725 (CTbS2)CT652.1 Hypothetical Prouin 1 0726 i CT620 Hypothetical ~ CT620)Prouin 0727 (CT619)CT619 Hypothetical Ptouin 0739 fCTb38)CT368 Hypothetical Prouin 0742 (CT63S)CT635 Hypothetical Prouin 0746 (CTb32)CT632 Hypothetical Prouin I 0747 (CTb31CT631 Hypothetical S ) Prouin 0751 (CTbSCT65I Hypotheti:at 1 Protein ) 0755 (CT616)CT616 Hypotheti:al Prouin 0760 (CTbII)CT611 Hypotheti:alProuin 07b1 (CT610)CT610 Hypotheti:al Prouin 0764 (CT648)CT648 Hypotheti:al Prouin 0765 (C1'647)CT647 Hypotheti:al Prouin 076b (CT646)CT64b Hypothetic al Prouin 07b7 (CT64S)CT64S Hypothed al Prouin 0770 (CT642)CT642 Hypotheti ;al Protein 25 0774 (CT606)CT60b.1 Hypothetical Prouin 077b (CT605)CT60S Hypothetical Protein 0779 (CT602)CT602 Hypothetical Protein 0783 (CTS98)CTS98 Hypothetical Protein 0791 (CTS90)CT590 Hypothetical Protein 0792 (CTS89)CT589 Hypothet'rcal Protein 0803 (CTS84)CTS84 Hypothetical Prouin 0807 (CTS80)CTS80 Hypothetical Protein 0808 (CTS79)CT579 Hypothetical Prouin 0809 (CTS78)CTS78 Hypothetical Protein 3 0810 (CTS77)CT577 Hypothetical > Protein 0814 (CT573)CTS73 Hypothetical Protein 0818 (CT569)CTS69 Hypothetical Prouin 0819 (CTS68)CTS68 Hypothetical Prouin 0820 (CTSb7)CTSb7 Hypothetical Protein 0821 (CTS66)CTSbb Hypothetical Protein OB22 (CTSbS)CTS65 Hypothetical Protein 0827 (CTS60)CTS60 Hypothetical Prouin 0834 (CTSSb)CTSSb Hypothetical Prouin 0840 (CT700)CT700 Hypothetical Protein 45 0842 (CT702)CT702 Hypothetical Protein 0843 (CT702)CT702 Hypothetical Prouin 0852 (CT711CT71 ! Hypothetical ) Protein 0851 (CT712)CT712 Hypothetical Prouin 0857 (CT716)CT7Ib Hypothetical Prouin OSS9 (CT718)CT718 Hypothetical Prouin 0865 (CT724)CT724 Hypothetical Prouin 0869 (CT728)CT728 Hypothetical Prouin 0874 (Ct'773)CT733 Hypothetical Protein 0875 (CT734)CT734 Hypothetical Protein 55 0884 (CT741)CT741 HypotheticalProuin 0887 (CT744)CHLTR Possible Phosphoprouin 0896 tCT753)CT751 Hypothetical Prouin 0906 (CT7631CT763 Hypothetical Protein 0908 (CT764)CT764 Hypothetical Protein 0912 (CT768)CT768 Hypothetical Protein 0925 (CT779)CT779 Hypothetical Prouin 0938 (CT788)CT78B Hypothetical Protein 0939 (CT790)CT790 Hypothetical Prouin 0943 (CT794)CT794.1 Hypothetical Prouin 0945 (C'f795)CT795 Hypothetical Prouin 0956 (CT805)CTSOS Hypothetical Prouin 1 0960 (CT809)CT809 Hypothetical Prouin ~

0989 (CT832)CT832 Hypothetical Protein 0994 (CT837)CT837 Hypothetical Prouin 0995 (CT838)CT838 Hypothetical Prouin 0996 (CT839)CT839 Hypothetical Prouin I 1002 (CTB45)CT845 Hypothetical Protein S

1003 (CT846)CT846 Hypothetical Protein 1004 (CT847)CT847 Hypothetical Prouin 1005 (CT848)CT848 Hypothetical Prouin 1006 (CT849)CT849 Hypothetical Prouin 1001 (CT849)CT849.1 Hypothetical Protein 1008 (CT850)CT850 Hypothetical Prouin 1010 (CT852)CT852 Hypothetical Prouin 1011 (CT853)CT853 Hypothetical Prouin 1015 (CT857)CT857 Hypothetical Prouin 25 1016 (CT858)CT858 Hypothetical Prouin IOl9 (CT860)CT860 Hypothetical Prooein 1020 (CT861CT861 Hypothetical Prouin ) 1022 {CT863)CT863 Hypothetical Prouin 1032 (CT373)CT373 Hypothetical Prouin 30 IOl3 (CT372)CT372 Hypothetical Prouin 1034 (ty CT371 Hypothetical Protein f37I
) 1057 (CT356)CT356 Hypothetical Prouin 1058 (CT355)CT355 Hypothetical Prouin 1061 (CT330)CT330 Hypothetical Prouin 35 1077 (CT371CT77I Hypothetical Prouin ) Coding Genes Vot in C. trachomaris 0486 Hypothetical Praline Permeau 0279 Possible ABC Transporter Petmease Prouin 0505 3-Methyladenine DNA Glycosylue 0193 argR Similarity to Arginine Reprcswr 1041 bioA Adenosylmethionine-8-Amitto-7-Oxononanoate Aminouatuferue 1044 bioB Biotin Synthase 1042 bioD Dethiobiotin synthetue 45 0585 Similarity to Cps tneA 2 0562 CHIPS 43 kDa Prouin Homolog_I

0927 CHLPS 43 kDa Prouin Homolog_2 0928 CHLPS 43 kDa Prouin Homolog_3 0929 CHLPS 43 kDa Prouin Hornolog 1045 Conxrved Hypothetical Metttbrana fhouin 0251 Conxrved Hypothetical Prouin 0278 Comerved Ouur Membrane Lipoprotein Protein 0907 CutA-like Periplumic Divalent Cation Tolerance Protein 0171 guaA GMP Synthase 55 0172 guaB lnosine 5'-Motwphosphue Dehydrogenase 0608 Uridine 5'-Monophosphate Synthase 0735 Uridine Kinase pgg0 Similar w Sacchnromyces ctrevisiat 52.9KDa Protein 0232 Similarity to 5'Wtethyhhioadanosine Nucleosidue 1046 Tryptophan Hydroxylase 0477 yqeV Conserved Hypothetical Bs Protein 0048 yqfF-Bs Conserved Hypothetical 1\A Protein 0587 yvyD_Bs Conxrved Hypothetical Protein 0143 yxjG Conxrved Hypothetical Bs_l Protein 0448 yxjG
Bs_2 Conserved Hypothetical Protein 0007 oral o4ss o97s ' 0008 0190 0456 1018 - 0010 0204 0458 t027 0042 ozl4 o46s loss olss o3s3 o74s Ols9 0357 0796 0162 o3s8 0797 ~1 r.
CKYFYLR..~YPPPP~rISIA:U. ~'K:.RVL1ITF:::Frlt:.Lf.l.uwl.F:.TL.:LFCiSMLS

Clalssdrdla t~~~nslt 0tnar Beaodw FCLG tCi.~.Ai.CCViJII9GLL :LLVKREI
.roulos P1YRPEEI Pf~V,,'LAPSEEPAiAAACK':
LACL

PKELOpLLTtDLOEVJ:BSI:R~tbSitYBltliilLNDAw(~IVFDEY'.~.~VV

CPn_OOOt )30 4 AOB~IDWFLINCGRSih!!!'AESLSLDLFNVSKRLCTLPSCDVAC~C'klGStlK!'tllJl~1 ~.'l'001 hyp~~hnr ieal Protein SLHCEIHKYAVAFORNSYAhAEKAFAKALuALEESVYRSL?QSYRDKFLESERAKIPNNG
TSLRRKANUGKIIRGLSSLIVLLCAW~GLICITHNKWILAKI~'OCVS'IPPR~RNLCKQSFf' ' KRWCDEIKI ' CCYIIi iIACVICLLS!'CPFC3KK'aRHSHCD5C3SLuCHSHHSOKtIIWLRDDAK.iGCAEKKILi AT HI

Q CPn OOtO.: 11768 15715 q75 ~Pn nn0? 570 :.rm.. ,.:"
~ r nt,r. y ... :,n.vi.i... ~..,., rr~... ...._ ; y'~.h,;vt~:: ~.~r..
'~rl.r': r:\c:x: :~:.r ~:,r--. :: ~\F~:,I:I:I::.r. -w::;. : w \:':" ;
' w ::
T
::
v ' ' .. .. FLKAWRKCAWtT'l'FEK1CF-iKKNWAVEEANARRLKYVROWYDfiEFQKnY:6RLEKWAL
.
.
n :.. _ .
:
.
,..I:.:I L;.F.c.::.:.:.::i .:,.i :. "~.
:,.

HHWNVEDLREDSVTSDttJREEFLRHVPE.iIJGGLVKVPAVIKYP6YSVSIRDiKIQETRSNLEKAYGIEENYRCCVR
OpEIfYiIKEEEKKEAEFK>T~BCIL

Spear ~cer ~pp~',I,pIFSp(ytya',HILKL.OICGTAEVC41CIL$OIIBSRLEIVfm'V

CPn_0003 889 2370 KDIPCRIEEiEKTIJWAG.PLLPTKKAFEKACSOYNSCADILEKVKPYCXCSIaYYISKE

qatA-Glu tRNA Gin NsidotransEerae RLVSLDEDLRRAYFfl~AFOCDSGLESEVRACRlCLRERIQEFL:pGLDL.VDfSLLCVS
~
IDK
KEOALEOAEf .
SRW~II'Z~DCVSCIfKKGPPGKKFYAOYYDEIYRVRVOSIBtIflIISERLK1~VOAC~01LK
KINYRYSALEWtAViIGSLTA'ICVfafFFNRIEEA30VCAPISLC
G

KRSRGEPLGKi.ACVPUCIKLNItM'CLKTCCASRVL>T1YOPPFDATVVERIKI<F~CIILYKEIRKNKEKRLVGTKI
VA'l'QQRIQCFQPSOIVESSNOIVSLIt KI1~EFAMGSTTLYS11FNF17BiP41DLSRVFOGSSCGSAAAVSARFCPVALCSD1GGSIA.
NRI~ftS

QPMFCCWGFKPSYCAVSRYCLVAFJ1SSLDDICPL.AN1'VmVALJB~VfSGADPKD11TSOKIIItFLF

REFFRDSFNSKLS?EVPIfVIGVPRTFLECLRDDIRCNFFSSL1IFDCl~rTHLVDVCLDIL
C' 0011 15877 Iddl1 ' CPf1 'KEVIBIKILIr .
' OatB-IPet1121 Glu tRM1 Gln AmidotransEerase 3HAVSIYYILASAFJ1ATM.IIRFDCVRYGYRSPQ11fI1'ISOLYDISROQ3f1t8 Subunit) ' LYL
P14YSIlfCAAPAIWVSP'IPPEfTItBYIPKDSKSRJ1LGITLLVL'GIIwV1fiG71IVtBGVIS
NYVLSAERQNVYYKKATAVRAKIVKAFRTAFEKCEILAMPVCSSPAFEIGEILDPIr!ELKELQ41'I
QDIYTVM4dLAYLPAIAVPSCFSKF7CLPLGL0IIG00C~DQQVCQVCYSFGEHAOIKOLFK

QIAITFJID
GLSALIVCCLCISTISL ..NVLfVIGLILLLRXRELTLEpIEA

SKRY)1K~S 09TDtSLEKIFiiSRYSDQCf WR711'OKILDLESSLSSITSEFRDLRQLFDEEKIELLBGI

RLLEFIAANLFKOCRDVYIi4GGHLADIRAYIIOPNtMNIWVIEKAKAWHEFIVLT'LlUIR

CPn_0004 233 f~fP
VacB-IPeelllt Glu CRNII Gln AnliriotransEeras 1B Subllnttl ' G
LICQIOCCSRASINSAVYADWESVIGLEVHVGL.NI'ASIa.!'SSAIlJAFGDLPNlZ4IS~C10011 16596 1831?
CPi!

LPGSLPVGNOSAVEKAVLFGCAVECLISLLSRFZMKS7fFYPDgpRHIOLrppplplIl~i' 11 QatB-IPstllZl Clu tRNA Cln Asidotransferass IB Subunitl RTKAIVQGEERYFELAQTHIEDDiiIGNLKHFGEFACVDYIiRAGVPLILIV5KPQ0CPl~GIRVFFLI(NICYCLW(~
4Y00'~A~:RLLYNSVOKSYADRLFSYflITKMMDTPLIPNBE

VAYATSLVS:.LDYIGISDQ~B1EEGSIRFDVNVSVRPIa:SPELRNKVEIKN6~1SFA1NWAK00CAfJ10tAE'LEG
OKILLDYGKSIFWLNENDEINIl4DPWSWCWIIfKTRICVIpEVDDS

LFUfWRQIDEYLNOPt4KDPIG.VIPMTYRWDPEIGIKIYIJ9tLKESAt~YKYFPEPtE.PTD
13~1f~r r rrreSK~~KL(SDLVDRLEDU1K19fFlWKQI~VCIR

LQLTESYIERIRIiTLPF.LPYDKYfOtYIOEYGLSmIASILISDIOQIATFFEV11CKDGC~1F.
~
VKDLKAKYCGTVDPKQL1TF~11QGtVLT.E71.SLETFLDSIESELVOCLEDQDIYfiIt~DVI~L

RSLSlIWIIIYEFGGRCKTLGV10:.PSSGiFPEGVACLVNAIt7pCVIIGKIAKEIA0U11ESPM'1'O~EEODI~~I
"~'WK~'~~IITIZ?BC
VDY10~KTKAiGFLVCOtIBCtT

GKNPEEILIC>XPELI.PNSDE7GELQKIIAEIfVLANPE.STDNKARLKILIfEDITSVLPEIDEIL'TCISLLEi.P
LL.TTRELLTKSYLICFKICSETLiafl'S

AGIUPPKRVNELLLLLDKG
VF~lII7NOEYEVOLONLCFR4CISQKTGKKQDOFAId.EDOVAL4KKRLKEL'1'~'iFCIQ

GFNFIOC~FItIIAAKDLYIRST

AKI~.C$LpI.DBKFi.LOICEIIGCCEIRQI(ltpQRNADRSRfITI'YQKLIIIAEG.ALEL.1UDCI

Outer Nse6sane~Protein AVDfi'S~P. OFPOEIfTPFVKVQAVttARODSFVBLCAISRDFSDSHL.YM.AIPIaIU.IGDtF

t'IdDYTEIdGItGSIECRPNARHmIHCCSKlRPAOpYYHWJWYS CSNLARQAGIYQA9GFRSLGAAA6 CPn_0006 7299 7111 LFrIIICF!'SfRCBSRSYNVDAGSKIKF

No robust homoloQ Dcesent in Gentbenk/EI~L
as of 11/7/98 CRL0011 11365 21922 KQLQEPLRSALLERLSEWLVLtGITSPETTRSTPEKpMpLPKDSRNItTLESLp11~3-POlymorphic Ouur Nalsbsane Protein IONOSIYPI't0C8SFPKFVFSTFAIFPLSNIATETYLDSSASFDCNKNf~IFBVIIESQEDA~D
CPn ' ' ' _ YDAG
No robust houolop Dresent in Cewbank/E'!'Z8LIYAGAAVHSS
as of 11/7/98 KCDLTP1~ISLLF(71 TTYLfKf~VTLENLPGTCL'AITIfSCFNtII
"

KSFRYNLSLIFSFLWIPLTDSTTSSLSI'SLLOECNPOSt9IKLRILAIVLIALSIILIAGIIGSLSLTIWSVCSSAKT
ICRIIIAVLS
WDK.STTFIC!'SSLSFI1LSPCSSITTCKGAVSCS

GWLLTVAIPGLSSVISSPACNGACALGCV!C.AIGIDVLLXXAEVPIVIrISV'lITPG'IIGSPOfU.FH

PRSGISISCADSTIRSLP1'YLLDmQiPQSNRKtRILAIVLIVFSIILIASC<fVLL'M1IP

GLSSVISSP7~tIGACALCCIIHLrILGItNLi.IfJIRIVPIVLASYIT?PCi'GSPRSGLSISGACPn,-0015 DSl'IRSLPTYPGDEGHPOSNRItLRILAIVLIVFSIILIASGWLLTVAIPGLSSIISSPApmD_3-PNP_7 IEramt-shift with 0011 ' FMG71CALGCI.rLAiGIOVLLKKREVPIWPAPIPEEWIDDIDEESIItLQQEAtAALARLSSKKOGAIplSDALTIT
LEFdQiVSLLPSKNFSTDNGC11ITAKTLSLTCTTMSALFSEIJ1 PEE?tSAFECYIKWFSNLI~cSLPYDCHGLEEtcTKtIQIRWRSSLKANVPEFLDIRRIFGNOGEVSFSDN'I'SSDSG
AiIIITEASVTISNNAKVSPIDNKVIGASa~SITCDNStxIIIGY
' ' EEEFF?LSaRKRLIDIATTLVERKILTEQLEW9iLRJtAESYLYQDSIPIOtIItINFEIWAPRGGA
l KTS'fOTIML1'CNOIG.LFSDBI'l'S1TAGGAIYVKKLCLASGCLTLFSRHSVI~G
' WKtTIILSKSICRPTIIFENHEtKIIAKSLLHIQ4AVLLEKIIIYRSLOKSYRDIGNSSAl00CISAKM'ALRSAACMI
YFYD
IAIEDSCELSLSADSCDLVFLGNTYfSTI'PCTNRSSIDLC'1 ' LHCNPFFSLEDNIaTINKtNAEl4<.ESLSSYRKVFLALSOENVVD'1'PSDPKIbrDISCIpCRfEAADSKMTSKLLQ
PVrLS
PITICSSTZYi'DVLKVNCI'PADSJ1LOY1'rNIIFTCICiSE

wILSEISRDEQNOKKAHLKNOESLYTQARDRL'1'DOSSKENOKELEIUWEYISSNERVKK~aTLSGKNGVTLQ'SOAP
'1~OADSRLCHDN'T1'LCPAt7lSTINNLVINISSIDCAKKAKIE

F=IERVQERIIGIOKLYPNILEREEEi'IGOETVTPfVQCZTASSDLTDILGRIEYSSREDTKJ1TSKNLTLSC?ITLL
DPTGTFYCrtIISLRNPQSYDIt.ELNASGT<?STAVTPDPIIbEK

:JQNQESCVKVLRSIffiVEflSirE\IKQEYGPKKKEPQOOMGSLERFFTEHIEELEVL4KDYSKFHYGYQG'1WGPI
VNCEGASTTATFNN'tKTGIfIPNPERIGSLVPNSLNNAPIDISSLNYtJI

HLSYFKKVtINKKEVQYAKFRLKVLESDLfCILAOTESAESLLTQEELPILATItGALCKAVE1'ANEGLQGDRAfWCA
GLSNFFHKDSTKTRRCFRHLSGGYVIGGNLIIECSDKIISAAFCQ
' ' FKGSLCCALASKJ1KPYFEEDPRFQDSLri'QLIGLTLRL.OEAKASLEEEIKRFSNLENDIAEI
LFGRWIDYFVAKtfQCTVYGn'LrfOHI4CTYISLPCKLRPCSLSYYPTEIPVLFSCM.BY

ERRLLKFSKQ1'FERAGL,GVLRELAVESTYDLRSLTN7ylECCPESEKVYFSNYLNYYNEEKH?ONDLKTKYITYPI' VKCSW~IOSFALEPxRAPICLDESALfEQYNPFIOfLpNYANDE
' RRAKTRLVMCORYRDFKM1ILEAl~FNEE71LLDEEL.~aIQAPSEL
riFKEOCTF~1REFCSSRLVNLdLFICIRFDKE80CQ0ATYNL?LGY7If~.VRSNIOCTlI

RISGCFCTNLARQAL\tRACNHFCFNSNFEAFSOFSFELRCSSRHYNVLKGAKYQF

!:Pn_0009 10780 11685 7 CPn 0016 21383 251B8 / 1~lymocphic Outar Nelntcant Pcottin No robusr homoloq present in Genebank/EHBLmp as oL 11/
S
~

A f VLLLTLGIPCLTdGISFCACLGF _ ':KYSYLLNYPPPPRRSLGVSCSKLRSLSITL.LVLu'sL7IFFLLMSVSADAADLTLCSRDSYNCDI'~a'fTEl1'P
K
' RSDFALKRCCHNRSSFSLLLIs3 ' 1KICTLDRLPKE .
PSEEPALEKAQKEPF
MTSDASG1TYILDGDVSI:'r'AI:KQTGLTT.~.CFSNTACNLTFIGNu'FSWFONLIBSTVA
CGG'JL'JISGi.LFLLVRREVPIVRSEEIPRCVSVI
(.DOLCI'YIQEVFACLERLKDPKYfDRCLLTEAKEKLRVFDWEKOMHSEFLDIQRVWEE

AYWEHCODFLENIAYEIFSSQELitDYYCAGYCCYLPSCt)ARADRLKRa'VKEVI~ItFNRV~NTMOCiTIfp3CFST
'..RNiJVIPRTTr:KKGAIKITOf:LVFE.sICtJt~LJDrAB&OJC

TWkWEASVMLOHSYCVARELFKIUVGVLEESVYKILFKSYRI)J1FYDCEKAKIQRDGRFKr.AINTKTGSL'LC.STR
fYAFG:NB3.i0Qa:AIYASCO.."VI~ENACILSFGNNSATTSOGiII
' ' At7~
~nABDNLVTSNNQNtPFDCCK.aTTtIX:AtUNKMIANPDPILTI:ii.NESWFWM
' 1'ST
~.AIYTKKLVLSS-GROCVLF:NNYAMIATPF.GC:AIALLDSCEICLSADtL:NI
IF~tfl <.'Pn 'P:GPA.art'RNAL~15NAKFWLPJiTR:IIY.'ItFYDPITS:X:ATfRIL.~,LttKAOIIL'w70tfiYE
DOU') i1e89 11117 _ ~:YtVF!X:EKL.vEEELKKFONLK.:TF'h~lv/KLNi7ALVLKDf~J717t:1Nr!'ft~iSKVYIID
tNr mMr.~r honnlorr prrrsa:nt in r:aneb.<nk/P11BL na of I1/7/nR

tri':;AIIAF/JRYRDINfxWEDLKQ'l'IFWVf:EIIOCTDLC111RN::CHWLDRYaDKFILREKEEKrY:ITFF.
n\:.AhT:V'Ml4c.l.AINID:.LDt:TtIYAf IKATAA:.KDVAL:x:1'fNI.VLYIOfY'IYYPJIH

Nf.RHELFIIATtIVRKA::f:IIAYAKAKMFEKER::Nt~RY.VKDVEK4iLSIG:U1EFRNpFSRRtlL:~wJVt'f t.ft:1~:A0rlrM!'~rLtltY1'I:If!'ITNIIYr:YQtiWltaJt.'IA:NIn'!k'KtNtICCIIW

AftERLkt3A,"l'LYPEV.SI/F:ERVLERQRTKKVNLFNL'fAD
L EIfKYfOK.'VREOEH'MIfEVFI7K

I ALYRt?%:LKVI::AEEII:a:LLQftLEGt:
Ll:'ISi::KKLTKAE~.'VFE?IKFM\TEKIltIKVLED~
' './'PNRLEIIa:EDAEFI~ItPRIEEIENTLIbAI/ELt'LL.FItKNI'FEKA.SL.vYN.'.t:KENf.AKVEPO.
;~II
':In ~ml'l ..",~
lu :Y.F~PTYR::::("RNLERLN~~fH.~rPn\YTtr.'QERL(t:F::GLE.~.KVRfCRDIILRFJJNKIIFEVOC:f ~uf~1'IMI' I Ilr.nrur-,:hit m.f.
nul~.l ~HFfNRI:W.WW:Af:Lt'IVARLDLVATVPYREPYt.~IYIItItKREKVR::~rMIAKTERYREIROtW!t'fNt:l K~:n:ItVWVf>r\'rAYTKNATI:IWI'K'1':YY.fiJPFHrJ:I'LVtfc:lYl::FVDVR:, Al:r~r;V11KR1'LI.AP:ITtIf*RF.Iri'WIdJ7pOWLl.RDERKNfwRRI.(t'NKIIAMJJhVKf:FIV::II
IIrR::r::::l::::L"1711.YA':i:INit'Idll:IIJKr:NVk::INII::.~.AtiYAll%%SFFTI~:~FFN

r'At'' LL.tt :Yl7KIt111.VAKtIrfIIV'/IJ:AM::YhI
ll a :1~:Y'CI JvK 11.:'r ~I::Ik:l.f'PVtTt1111FAYr:

.:Irr IrrllINbrl'rKY'h:I::INKr:a.A:HLAtt:IF;Y:AI1NVA:%:NR::4N1rlrrlFlN1.P11IYN1() rlnill I f 1'.'.1 11 f:'S

_ rltrt'Kt?%Tt7t:lt:a'V::l'.IA.t'NI
IY, tr.lr,::r lumn.lm pn'.rrnr \'/1~r:IYt'YN.Y::frY:TIIII::IAYI'I'INfRNIrHITITLM
in r:,-rrlr.rnk/EMDI, .r.: .r I1/7/'IH

fYAIOLGGRfCF PSIIITTVDStTlISDSPILG
:'VEKLLDE:EE,~,rEJps't'ERLLpSIG,FILPt.MDtPFF
fEVELRGSSR ' ' ' . l iCLVLTKKCNADILKVSFIpLNKIOVAJIRILA
.L.iRpALLVILU:NHHAFA~NPE. . PClIMP ILI ESCPYIfEVL.KY411t::::OK
.
v~~.,'W.~.tr.'~

LNPICCCSAOVLBSZiEIITAILT~~~
~~

=Pn_OOIN :751! 7'3003 BNALI
LLKLNPLFKBapIFGCH.~.DF'fEPGII1"G~~
ALTT'71 I>eP_5-POlymnrPhu outer Membrane LLKKELDISALOSSINOKIEATITKSOKEPFLKEOLKTIKKEiCLEJICOAJIIDTfKIBER
Protein ' GA33IVLtIMTTPIStPEDGFICDL'~INtFSPKSTCDJ1AG1'lYS

:.YNMKTSVSMLLALLCS SSAEY71ICANYLDWLTLTPWCIQSKCr107LKlGE
rTCF'!TIVGDLTFL~I~tfLKFLSVDAI:ANIAVAHVpCSKNLSLRKRtNPDYAIIiIIIODEIEK:.C'l'LE~.
IAIGG3I' KG~
IIC
CY
C
"RSTAKYLIIAKF
'l P
' ' . a L: ~EVLYIDP ..SKGL
CAYOLODIMWL?SNASVEOCCIITKCNSCLIO L
iPKSIIYRCKGSLVSL G
FTDFL P
iLVITE CI
:
SIv iVIi'llfOtIYGLDEIKORILE:.I,iVGK

i0 tPVIMIDLYOKIG~
KMVDat S
A

. .
. p . l !'fAN ~
:LTiElAltf~fLKFNENKIIVTSGCALDLGAASTI
7f 0 t' FRF3VC~fADFJIEIKGHRRTYICAMPG
~ ~
' _ . . ...lt:\. F
SA ~.. in!~.,. is..
~~ !
.IKtI.;AIFf:OIIt:RKKt . . .... ..
_ rv:.v ICL'.Y
. .. .. . . ::::T~f. ~r ::. '.:;~...: :~..v. WC
''.\:"'9:.'.:\"::1NE.1.:1~T.,....).KfIYLR
_ ' ':dt'.'t.': s ' F r "' ~

~
, ~

. ..
~ .
I:i.:
UCf'J,~WEKPKSKKLTFKI:::iKNWtY:w:KflF:i.iURFYESTP4GVA:IGLiiWfi ~L "f~TL
~ .".: ."IL\TTU
'J:i.:
::.1:.:N..~..:' :!
i~:..'~ 7~Y
:!tl ' TAKYIfCLAASOON
YIESVOVSSLKTDlOILTCOAGEVNKESSCIIWIIYLNSALHRYAPGYTFFPI~pNHINIPE
r:GDLVF~QVTN:APNATCKM1VIHLE.o AASA
LSCSIVFSGERLSI'AFJIIAt7~ILTSRINOPV'I'LVEGSLVL.KOGV'fLSTOG' RINEVS

L SLL.iLLLETPWFNLQ~ETTLTGRVLGVOGIR6KLIAA
J1NOK GJ1TPKDCPSACITfM

fa"OEPESTLL:.Otar~TSL
Ii~tILIFPF~ImRDYEELPA'ILIrCGLItINFYSHY00VLKV11FPKLK

CPr>_0019 19007 30756 CPn pmp_5-PMP_5 Itrtlmesht,Et with 0019)_ O NO robust hanoloA Present in Genebank/EI~OL
' as of 11/7/91 CitaLVNJIOGAtYit IO'IFLOFFHPIVFSOQSLSFLPYIGKSBGI
ASTEDIVITNLSINADTIYGKNPINIV11SA7lNItNITLIIEKCSNIVEHYLHICCDTSVITTGVSCATFL
CHWNIIpVIPCICfQPSQAl6.it'RI
Y
Y

G
8VON71LPISKSEXITKIISYILILPLIIaLfIKIVLRIILFFKYAGLILDVKKEDi.IaTL
O
DYSFVlCLSPGRGCt IITOtUISOKPL6VAPSAPH
RTCYLPNPEROGSLVPNSLHGSFVDORAIOEINVNSSOILCOEAOVWGi1GI11NPI3IRDICI

NEItCYRHSCVCYLVGVGTHAFSOATIN11AFC0LFSRDXO'YWSK1~CCSYSGVVFLEDTLTPOOENLSLPLPSPTTL
KIIINrILIfILVRSGIOfYNELIOECFSFTKITIMr00AP811~DIC

EFRSPQCfYTDSSSE7ICCNpW"I>xIDLSYSHtIldJDtlXTKYTTYPFJ10G&fANDVFGLEFFSYNSLLPNIYFHS
LVSVPNISCEERAIlIYItKEQQEENAVKLKTIpACSFVfASLIIi.PSh GAT'fYYYPNSTFLFDYYSPFLRI4CTYAHOEDFKiI'OOEVRtIFI'SG18.FFE~VPTCVKFEOTKDKKAGFGLL.T
FFPWKIYPL

RFSOCKRGSYEL: JIYVPOVIRKDPKBTATLASGJI'1WSTFI~R~SROCLOLAL~ICLIN
CPlf~0029 13839 13390 PGIE91FSF1GAIL3.ACSSPNYHINLCCKYRF No robust homoloQ present in Genebank/k9'18L
as of 11/7/9L

SNID~tpINHdTYCFNLFRYIRFFR71INIAM4DGLRFCYSYILLRPIdWSSLT.R1~00ELL

CPn_0020 32717 30603 KK10IKLRTZ'STIISSLISLR00LGKAE71TOSDILYGTSRFQYGNSFEIEDPRIPPlNilitQ
Predicted OMP !leader 111) peptide:
outer membrane!

KLWSFIPNt.RIIOCRCFLFLABFVI~fGSSADALTH0EAV10IKNSYLSHFKSVSGIVTIEDCVLOEIIIiSRSVNFL
KIKFYVYLNSERNKTKP

uIIHIaILATCAenIVnrEarlvccss.IavAtK~avMVrnR~LKTLVCDYLEYYEVrI)scLLTNc RFANYPWFLOGSNITLTPETIVIRKClISTSEGPKIfI>LCLSODYLEYSSSLLSIGKTfLCPtL0030 13A10 RVCRTPILFLPPFSiMPMEIPKpPTNFRGGfGGFLGSYLCIISYSPISA1WFSSfFlLDSF0cp-OSialoglycoprocein F.~Opepcidase FKtICVCMGFNLHC~KQVPQ11>fIBOCSYYAt~LJIItkIJIEAHbRYRLNGDFCFTItKHVNfSLKNCTtTIfSLFF
YIKNRANYFYKYVIIDISGYYPFL71CV1R~pQVLFJMSLPIR7PDI.GIVLE

GEYItLSDSWCfVHDIFPNNFML10'tl'CPTRVDCl51tR7lIYFECYLTSSVKVNSFONiWOa.PFLFKSIDd.6F0 0VAVALGPGNFSATAIGISFA0GL71NJIKNVPIdGYSSLITYLLSKDODt YLTLRpYPISIYl~Ti'f.VYLFNIVECG)fLNPIIFSDHIVGFNFSSLRLiWtPKLHKTVPLPIGALIILPLCKPGGY
LTLSSEIPEECLNEKRRGSIGf~LLSYLE71SDYCVAI~Y~NI8PNP0 '."LSS'17.GSSLIYYSDVPEISSRN50LSAKLOLDYRFLLHKSYIORAIIITEPFVTFITLTRLFASSFSDKTTVEE
VAPSVEpIARNVTSOFlISVEIIDItOLSPDYRSYSCIf PLIIQIEOHYIFSIODAFNSLNLLKAGTCfSVLSXIt4PAFPRIN)11Q.F11ZNIL9~TfESKPT

FPKTACELSLPFGKKtIfVSLD~tIWIQOiCWDlB4~tTRNEifIIA'IDNAK1LESTJDtSKYBLCPeL0031 tKCDRENFILINSRPIDQLLDSPLSDNRNLI rs21-521 Ribosomal Protein DTPN

YLEYpNILGT)CTFF2hIQLYGVYERRFaDSAFFFETJC.DKPIDIPPFCMPSVKVRVGEPVDRALAIL10CPCIDKEG
ILKAAKSHRtYOKPSVKKIWf8101ilAKYRBA

0021 34470 32707 CPtL0032 14A81 16098 ~Pn _ dnW-Meat Shook Protein J
Predicted OMP (leader 119) pepeidel CSRSPYPNIETL71AGVF3IRSMGLFNLTLFGLLLCSLPISLV1LKFPESOCHKILYISTOSTSLIGQIVItFVCSVSQ
IDYYSIIGISKTAS71EEIKItAYR7G.71VKYMPDKNPGDR1171iKA1'KE

QOALATYLPaLDJIYGDHDFFVi.AICIC>~YZJCOSZNS80PQ'fRKSTIIG7lGiJ4GSSFaiaVVSEJ1YLYLSDP
OKFDSYDRPGImCPF
' LSQ11!!!1'ADPLOpLLVIS7IVSGltGGKTSDDLLFKALJ1SPYPVTRLE7UlYRLillRi0if1'INIr FFDGLfOCLGE71F'0llRSDPAGARQGABKKVHTNLTFEEAAlIDVAIiLW80YKBCIft~

DHLNSFINKLPEEIf7CLSAAIFi.Rf.E2EESDJ1YIRDLL71J1I0I871IRSATALOIGIYODKROG71VNPGCIK
SCERCKCSGpNVOSRGFFSN7LS11CPECGG8GAII1'DPC8~11000A~II

FLPTLRNLLTSASPQOpEAILYAIGKL1~GQSYYNIKXQLQKPDVDIn'LAAAOALIAIGKRSVNVMIPAGVD
YVFIDVESNlVFp1RG00f.ILdPI

EEDiILPVIK100ALEERPRALYALRfrLPSEIGIPIALPIPLKT1Q'ISFaICLNNALhLL>tLGCCFVDA7Ii~lOO
CEIP1T.LKTDGSCRLTVPEOIOSGTTLKVANOGFPMIIiIOKl311fiDti.VItIS

17T1>XLLEYITERLVOPNYNETLALSFSI~RTLONWKRVNIIVPQDPOEAERLLBTlRGLEVCfP(~1LS880XELLR
TF11STLKAGiFPIO(RSPLDKIKCFFSDF1Y

EpILTFLFRLPKE11YLPCIYKLTr~QKTOLATT11ISFL8lffSHQIALDIi.FQJWIID.PGEP' IIRAyJIDUITyMItICppEtO(RSLHDyA>LIO~fLLF~T~IORpIip~fpyLAYQVTPESCPn_0033 16129 RTRlG.DILETL71TSK8SEDIRLLIQIXf80DAl01FPVLAGLLIKIVEpdhAiB/odbJliodb8-lpyruvacef Oxoisovalerace Dehydsopeaase Alpha i Beta Fusion CPn ERSIk:VtroFIOVISSIRDVLKLVwELR!'AEtIK,C.LLSAOSGSOGTIbLSCIlOIt>Q.i~fL)1G

_ KSLIPGKDiiSFPYYRDOCFPIGIGCDLSEIFASFWAZTPNIISSAAIIIPYIIYrBIC
,Nyt TILQVIStICCiNSNTRSFYSMSLPLViGSSSPRRKTILEKFRVPFTVIPBNPDESKVSYSC085V9CI10!'LOAAGR
7WIAYKfISSADEYVYVBGCOGJITSpGEFNmIIJO!VJItiKaLtLITII

GDPIAYTQEIrI~OKAYAVSELHSPCDCTILTGD1'IVSYDGAItTKPODKADAIQtC.KTLRI~WAISVPfEDOCGiI
DIJISLGRCIIOGLAVYEVDOGTIYTSLTETFSHJIVDOwIbNSVP

NQ1'HITItIiSIAVLNKCKLLiGSET50ISLTNIPDIIAIESYID1YCT'IMiCGAYDUCHGGLALILTDVVRLSSNS
NSDNQEKYRSt_~r rr e(~DPLILLEKE7IIMIfCiSPFIIEEIK11 ZLIDfNtGCVYNVpCLPIOTLKYLLEELHIDLWDYSItIIOEEVRXSCETAEALPFPSKGSTSHEVFSPYTGTLIDYEN
&ESCPKVl4tMI

SE11LVEEMTR054yIVfCEDVACDIIGf.IF4YtRNLTEKFGPpACFNSPLJ1GTITGTAIC

CPrL0023 36657 35011 MALDGTHKP1NEIOFADYIWPGINDLFSEASSIYYRSACiWEVPLVTA71P~T000PY

Y5)K/alr-A8C Transporter Protein HSOSIEGFLAHCPCIKVAYPSNA11D7dULLWl7IIiIDPNPWFLENKJ1T.Y0A1(IFS7IGlVF
ATPase ENRAKLLYSKOHFVt4.S7WSIVLDKIGKBLGTRILFDDVSWFNPGNCYGLTGPNGACICSSNOYVLPFGKAAIVtiPG
KDLTTVSGKRfPLVLSLEYApELASACISTMDLRilNFCDFA

TLLKIINQIIEPTACSISLPKKVGILRONIDBFNDf'M.DCVIFI~TfRIF1t71LQRRONLYTVLKSLEKTGRLLVIH
EASEFCGFGSELVATMSEpCYAYLDAPIRRLOCLNJIPVp)ISKVL

C,QEPTDAIGMELCEIEEIICEFaICYRA0S8AEELLTGIGIPNENFDK101A1fIPIDI4FRVtNEIILPHKESILOA
AKSLAEF

LLCOALFGNPEALLLDEPTrMLDLYSINWfGNFLImYEGTVIWSHDRHFWTITTHIAD

IDYDTIIIYPGNYDOMhI4KTASRDQEK71DIKSKEKKISOLKEtIIAKfC7lGSRASOVOSRCPn_0034 19196 LREIKKL0P0ELKKSNIORPYIRFPLSDKBSGKWL.SLEAITKOYGOHQVIHPFSLETYOCT315 hypothetical Protein CDKLGI1CNNGLGKTTIaOCLLACLVFAPSBGSIKLGHOATCSYFPONFISDVLADCGOE'fLFVNFLLP1'fCRGI4M
EISTPSLPOSSIVSOKTPPVPDPDSSPOHIPTIP3'pAPFKKP~Dt EWLRNRKTGINDOEIASVLGKNLFGGDDAfKOIpALSCCETAALLMAGMILENtOJVLILDETPSSIVNJIIAFAIG7t FLSCIL~GVP'AICLGCSLEITMPLFILTAVFIAFTLLYFINYLEK

EAIdIHLDLESVSALSWATNDYKGTAIFVSHDRCLIODCATKLLIFDKDItITFPOG?MYDYPKIPCPLPTPPPSPfLM
PTLTPIPAPAPGIPLPP'fLPINDRTKLTCNPDIIIYPB'f)IDP

TAGNKQLL
KACFSLLKQLFSLDPETRPEDRKYSNKLASTLLRSKEKSGFRFHCFKCIIPBNOKILNKKS

G11WISSHSSMDFSTTIrGMPAVITCi,QRSCwEKIKNNIPTPEIWLPIG~fSCPNDVEE

CPn_0024 37605 3b661 GAQLYTSHLIVINPPTLETLIKEKMRRAITLIIDFSNKEAFTNLVWYIACFDTCIGI~iLE

xerGIncegrase/recombinase SVOLEVFGLNNLSADOEEFTTWESCCHLAtd.ESVRILLASKEIYALSNVSVN8I8pVPLQ

REVMIAStYSFLDYLKMNtSASPNTLRNYCLDWGLKIFLEERCNLAPSSPLOLATEItRKTACMLFLN

vSELPF3LtTKEHVRHYIAKLtENGKAKRTIKRCLSSIKSFAHYCVIOKILLFIJPAETIH

CPRLPKELPSPMT'fAOVEVLMATPDtSKYHCLR0RCt14ELFYSSGLRISEIVAVNKODfD~'.Pn_0035 51115 LSTHLIRIAGKGKKEAIIPVTSNAtOWIQIYLNHPDRKRLEKOPOAtFt.NRfGRRISTRSCT33:( nyporhecic~l protein IDRSFOEILRASCfsCHITPHTIRHTIATHWLESCMDLKTIOALLGHSSLETTIIIY'fOVSARTTLEF~AGSSLKPLP
IITFPCATALYITHRAERKSEHOMWNRCQVFSSFFFRYPISSfiL.

VKLKKtytH0E7WPHA
IRLRASCECFOORHPIFLCGLYWLAGITSItCHPECSALILIPIGNPLPRNPKONLPLASA

WtISIlfLTPAPFLHDCPISGTFVTHHAOCpCCYYGEA(CTOTPCGKRAHNLSCpILSESR

CPn_0025 38610 37681 LELKKV'IELECTLNHTCOIVFKSNACYKEIPRSRFYIMKEKCRCiSCHFL1RIRPPSSEVC

slat:latsA-9ulptushydrolase/GlyeosulfacasePFAa'SLLLCTPLPONLRDLfROKGi.SHLFAISGWHFBLCATTtiML
CALLPLKIKKILSF

ILMSSRELLIIl.CSSOpPTRTRNpCAYt.FRWNGHGLLFOPGF.CTQROFIFANIAP1"IVNRIVLTSI.ACiFPMSL
:.1IWRSWISVTLLCFSWCF"..CSC~ItIRIGAOFtLCGIFF8PF8PTF

IFVSHFHr,DHCLGLCBMLHRWLDKVSHPIHCYYPASGK1(YFDRLIIYCtIYNETIOWENVLSFLATL(:ILLFFPKI
FSFLYTPW~pFLSPtyR.YPIR'ILANTLAISL3AOLFIVLPT110 PI3EECIVEDFC3FRIEAORLQFIpV01'LL19RLTEPDTIKFLPKELESRGTRCLIIODLIRYffLSLPLE7DLL'lI
iLIVPFTILPIIVFLIATITt.PCCCFTTFJ1LI0CFGSNPIi<JIFIPNILK

DpEISIrI:~TNYGi0V5YVRKGD:aIAtIADTLPCOMIDLAKN.iCMMLCE.3T'lLEOHRNL'rL$FAFVPPWl1LT
41::LILFFTGILRTINSPYIISISAT.iIRPTlTL

AE~IIFHMTAKQAATLJ1KRM't'QKLILTI1F.~'.ARY
WLDDFYKEA;AVFPMMVApEYRSYP

FfIfNPLCtIK nPn ou f4 SOr.i') 5179':

IT'I!H hylx~rh~r i.'.O prr)tsrn I:PIr o02r, sr.:17 )s7o2 AK::I~IG::GftKKMYKI'ONIR:'rfD'/RSFFFFDVLCIEOLFY.El1~~YIF.W:.AKtFRLPpIX(EL

rT(4'. irlpxll.:r.iral prot~tn Mr:L:rKRGRLtIFr:IDIWv:::VG:IEtIKE.:F:ICRFFGLLETIEVYI'IRLEKEP'fQLKIIFYVF

.:NF.1M::IILIt::List)::VT::YfIIKI'r~PIKt)AAFt:K::IIeI~Ir:NtAYLtII(.'yl.V'Pf/LW:
AMLItOraaia~".iRI'fIJ.~I'tWIIIRLPNIf7DRilYEY.FF.~.iNtY:Fr7KWEDf7:IF1'NP.::I
VfIr.~K

'MfII'::Wa19r:1.::::LALLVI.I.:':IF'NfCLINWft:M:K'rKKIAFKIM:C~:UITYaA::RKC~~.~nl 'LV'/MNKN~At:I:Nt.'Y:;11:IF1PYr:IERPFA'l0.~.FFFDPRtRRGLI::Iff/t.LNBE::LE

I! L::1'11111A Ih:l'KlIF I It'fULRK<SVNyN'rNKFK:xI IIt::I.FT I L:L1J
II::Y.::YYI'::f :FE:: FI I I :UE1111:3fRO::KN.~.::EL:IiLKNY1J1:3EGYlNE
I F i::Df:

.:::I'l:llINYAKu:Y.A)119'ATtKl::K'r:.'ftf~:::KKKK%TKII
:1lIRTf~.:: f )IKR.~.APKfMI/P.~.K

KitKINI.I,Y.K')VILII(aH.(:IIp:::x:NF:aU:.;::PPfNqpKAILf4IFy'Kpl'fGF::IyUll:'1 ~1'1(li 'Wll:

la:N 1t:: 1INa:l4e...rc: ~.:r Fr,Irin ~ Nlrr ~'In U()::'1 q.'.~':.'. tv~1'IH KLI~k's:l':LHAI
ILI.:::lal.IItT1'IIVIM.'IMNKfTRTfLE.iEKD1'QOUIFFt:~J191.'1'/KN
2~~T

Lu. L.r. ff'I' 1:(w:rrl.nr FW
t..nr.:AN:IINNf/uNIVlll.t'Ir:I:IY'fiJIIFT'IN:IITTNAN.~.INiLIJIUtAWJt7I3KIl:I1'I
R::VFJI
' CPGPCFDIItFIDtLKtANFE~ 'E?I~CYK:Ef~.GKRCIEKLTf:TPILEKYQRIDDRD

HAILC*h~DAFS'~'~:EL AK tLKOLRAOLLr'~LF3CR..tY~ICA IPV
VLLILL.WfIYCALKAL.: Pfl4:.KSP?IlffGYIA
Y~1~S"
' .
ILTL.iLWCRGTCIE'~'~A'~IMfl~tL3YP8L1!~TAFa.LPIJIi rIKLbCIIAIri~' I~~RE
VFiOi G
' ~:fhytOtH 'W1::' S)dJl A

L7C3WRIOVStJtRViA
OL1~1118WFLSINL

~"1 to hypochec ica: pr~rttrt AL.W10GIESPVYSLITAIa~WALLPVFFJ18F~GASI't~tFBLLTYLSPC~ALLKRLFKt7IPCI
SGl'r?IELMRIACT::Y:f.7IALGKVFFLGT~PLMIRELTLPpEEVEHEINRYYKAtl1~t~~ICADSLYCLVAAHY
NOIGKLINiGFFS~ICIL005t~8L8PL
HD'P Y~~t O
K3DIW1Lt70EVL(7~CLOEVSStLOANLEINKDPLLTFLVVFfttRlCDR10U1YVFSSVECAIMIFIRttIP04vF2 .Att01(GLPESFICVLEEHtIGT5VIR5AYY5NlIV~7PSTGSFDE4.

F:;
FttYSGNKPSSIIETT<'tINIADSFEAASRSLIQIASLPOLQRLID0II0GIti.00COFSCSPIT
LTIJR(TIP.~.'J'IDRVQOIHDh7IRVIGHLCCCNKSSLGE.iDONLIIFSEELTPS
H.KIEE
~

. :,~,hVlr/rr -, ~ .\~. ..p~..,..
. t fr~w AYIRI:FV.'.f.X:AA?:Irt'AtIISRAKSIPYLANISEEii410II1KRYNCKLVLII7GY
.
NAAN
~

r . . . ..
.
. : :~a::
.... ... , ;11...:..,1,,:: ;..;..,.,.I;:,~.:
.. ~~ ,..1,L; , -.
::.:.:.. :'1.f:l.l'f.iv::.Yi'.'I':.:Yfll!F'.
~RY1 :
' "1 :' "' . .
. .:U>_0041 n.
:I No robust homolo0 Pr~~c m Genebank/FltBI, . as of '1!7/91 .
iA
H~t'.~RWLLDIf;tlILEDUWALAKA.itnOCSIKVLIECVSOVSEIIkIIKKKWETIRTRFPItGH

KV;14CMIEFPSAVWNIEEiLPECDFLSIGTNDLVOYTLGISRISALPKHL1JVTLPPAViLKIl~ItVYLLVIIOEIf WL'lt4.HOPYYtutIL~Ni'IYIPGHTNKDSNKLEQK

RtiIHtNIAAANONQVPVSICCEAACOL.iLTPLFIGLCVGt3.SVANPVtNRLRNNIALLELVDFJtPFSLDCFSINF
LIFVSLVPIJ4:.LVRAYOIKKSLDRTIIIQIGYSPSTiCmiI~tEA

N:CLEITEALLQAKTCSEVEELLNRF44KITS
FVNCYCLICISIIl~LCILVPILlLW'LSLLLLGIL31LFSLnYFSIIDtItISIOtICI~ISN
.

CPt~0079 5II56 57967 AT

~T)79 hypochetiul protein KKKKEAICIMEOOFLf~tEASLLEItRY 'UGOA~GLVSWLH~LI~PT00050 66819 66199 resent In Genebank/DlBL as of 11/719A
lo h ISNGSGYA Q p ~ t ~~ ~F atno No robust VSWFPILCtFLAIOIYAKt~aiF~Fi~IVKANLGYLPSTNCKNALCRNSSIILTSSIKlIIGIL

GGCGILLPIP'LLLt~iItsISVLFQLL~G.!!'Rt.CCF71IRpSVSSDIIrINt<LLLLHNZLA

CPn_0040 55677 54318 dnaK-DNA Pol III Caeea and Tau IPYQASSRKYRPQTFREIt.CQSSWAVLIDJALVirNRAAttAYL!'SCIRCPn_0051 66797 67111 cL c in GenebankJENBL as of 11/7/98 p No robust homoloq 0resen AFnHStGYTCf CFAYLIARNIPRtIGMiETYINPGVLPSSNAQDVSRS'IIIYPSRSFiIOIPNtJ~BIFNRVPS
TGKT1'IaRILIKU.yCVHLSEOGEPCNOCFSCKEIASGSSLOVLtIDGASHRGIEDIRO

ItJTVLpTPVKAKFKIYIIDEVIOQ.TKEAFNALLKTLEEPPOFNKTF!'1i''TEINKIPCfIKSSEQiJ4XiNRIPL

KIWLORIPC1C1'ILEKISLMAGDDILIEASOIX.APIARAAOGSLRDAESLYDYVIS
I
"
RC

a a 0052 68008 6730d p CPn LFPKSLSPD'IVAQALCFASQGSLRTLON11ILORDY11TALGIYt'DFLtISGVAPVTFLtIDLr LFYRNLIyTHStTSKFS50YKTE0LLEIIDFLGESAtINtpN1'IFEQTFLETVIINIIRIY.-hvmC-POSphobilinopen Oeemlnase aRPVLSELISSI1ISRGFfGLRNIKEP1'LTQQVSAPQPOZ"~EOSPAAOCKIIKt4.SVrYSDPCLSDFCOQIRPLRI
ASRNSIiIJIICAQVNDCISLLRSInPKLI~ifQLSTi'CL'1' SVEVKSSASIKSAAVOTL:.QFAWEFSCItRQ
G~IPLHLVENSIfFFTOCVDALVH1~VCDLAIHSAKDLPE'1'PSLPVSIAITRCLNPAD

LLVYADNYVNEPLPLSPRtRSSSLRRSAVLICOLFP~00QILDIAGTIFfJtL00Lt0tDttYDA

00d1 SS8A8 57342 IVWIAJLSLRLJILttItAYSILPPPYNALOGSL1ITAKDNAGKwKpLtTPII~NSS
CPn _ No robust homoloQ present in Genebank/TC48L50 67986 as of 11/7/98 HSVCSISSRYKLRVLAITFLVLItr'VLLLISGALFLTLGIPGL?AGySF
P

CKYLYNMSYPP CFeL0057 697 Q
CCVLWSG1.L.~:.VfWEVBXVCPEIPAWPEtTPEDVPVTPFIUPAt~A
GLGIGLSAt . ~'~ Protein QKEpKTOKILDOLPGELDOLORYIOEAFACLGPLKDLKYtDGGTLO~~~KIRNATICIK'1'Ot~IDCGATAPt6iI~p CPGCt04i'INSLVEEYVPQARSCfSSRSS'1'SAW.S

DMIAEFVEi.G0ILC0ECRLLEFVINQTRYIGRIS.FKRt'~SLYKWEyI(l.YLPSGOVRCERSIa.H4ESRIFIW(A
GiDRIiL:OLWRGSLTLIIOGDPCICKSTLLIIOTAERIJLSWLYRVL
LrYREA

LK%SAAEWDRFMRT'tQ4IRItIN4TFDPNVYSVAKTATEICAFGitLETCVYESNRYV~SSTOrSLPAKRLitISSPL
IYLFPITNi.DMIKQOIATLEPDiLIIDSIQIIINPI' FIWVKeDQtIEIDDRIGNSQDISE

?CEYEKAIQ.LCDEEKSAtIAEOAFpDIKNRWBDNImit~PG~T~I~~IICttYlK9GEIAGPRVLaILVDlVLYFICN

RYEflIRITRARWYKVAENGLFNAIT0tV1(DSLRgOJEARV11FEKERSKIT40R~IKKJ~RSNANYRNIRSVRiRFG
PrNt<.LILSNHADGL1LEVSNPSGLFLQBJfTGP'i'iGSIIIIPIItii :.ROLItEGHDOF3.PRAGERLRELOALYPEIAVSYVFJ1RREYASDLEKAtfESIDKHYOSCVRSGALLIBIAALVSS
SPF11NPVRKT11GFDPNRFSLLLJ1VLEKRAQVKZ.F1?mVPLSI'~f.

~~Y
KIICP11110LGALLAVASSLYNRLLPNHSIVICEVGLGGEIRNVAtQ.FRRIK~IItJGFEG

AILPECQISSIPICCIRENFW.GGVKTIImAIRLLL

CPn_004Z 573d6 58112 No robust haaloloQ present in GeneWnk/EI4BL70089 69313 as of 11/7/98 EECEf00EAEFRENGTKIRSNEEYSEYLOQVI~IQLESCSKAL'1'!00"fFlli.CVItLtaKEEIECPti".0054 rttc-Ribotttieleattt III

SII4SDVVNRlE~ILCROIEDFILSRVEEIERIB.RNACLPLLPIKEiItTKAFWttt4&CK>ZIG.TLSFFPPIKIPN
SKFKDGAIiSNNPPIDITAIWILNF!!"IOPKLLEI11LTRPS~IO~iJ1 OSIJaOTIQRAYIIGSQKVSGLESEVRACREGLKDOVROFL~S'ZIaGIG
PYFKESPAYLTSSFRL ' V

. LLFP9IO~TtSTARASLYttAKAGCRYZ
TK VGIEDSERLEFIIrMViCLZVTDG1' C
DYLLIGKG6KIOSERGRLBAYANLFESILGAVYS.DGGLSPARKLTVPLLPPREEILPL18 pGVSLIKEEILt11TS1'FRTKFSYHSFRIJIVPCIOti.YLEYYODID~LERTRAAWNAFIStItYR

t~p~tlQplIJ<J~LVEAQAL~TEYWLYRZER1ISK>Ofitt7N110a.LpOF'IIQKOFRVLPVY05TAVT0AGCNVS
YQIQVLVNQ6ItYtG~I~Y158KKE710CI
....- nn., cem cn177 cPn..ooss 7oo9s 7os9o CT396 hypotMtical protein CIwICYLIRIRIOISALNLOHLRNFIWHaSILFE3JLLTIKDGFLLETKt.ONPIAKASRTID
TVIIMtF~TIFRSNPt:IYTWRKRRLOFFAAF1.VNRPKi.SLVRDLWV!'PCEEILEGEiOCTIL
PLLLSGDRAGSGIFFTGPYP&DLYELEIGCI'1'CLLLAFSSVCIPVI
CPtL..0056 70917 72746 sItCSA_Phosphoaanetosucaee EFLIQ.SiitRISLIIIEYEORIRSLYD11VTA~IICRWLStJDCIaQDNt'fIILWLD'tDPAOLE

pLfGA?LTFG1GCLRSIlIGIGTNRINLtTIRRTZ'OGLVOVLPANLPNPGOPNRWIIOCDT

Rt045IEFA08'fAlNiilt~lCICiVPLFOYPEPL1LVSF7YRYBRAIOCVNITJ18t87PPNYNC
YKVYNASGGpVLPPL00EIVAACSAVNEILSVPSIDtIPNINLIGKEYFaLYRDI'LIIOI4L

YPF)1NRISGRSLSISYSPLIECTCISLVPIIVLIIDWGFLSYNL.VOOD71IWGDFP'I'VOLPNP

CPn_0014 6078 60778 EDPEALTLCI0~4.ANDDDLFIATDPOADRVCVVCI.EOGOPYRFNC~MI~SLLADNILGJ1 f 11/7!98 No robust homolop present 1n Genabtulk/t?18LWSKTRHLGEN~(f.VILSLVTTEFtISAIAKtIYth~.INVG7tGFKYIGEKIFSWANS'1'NK
FVF
as o ' IAKSDCRVWIRiJiSAYKESGKVSSLETEACTYREYLREOWOFETOGVSLIKEELL!'LSS

TLKSKLSYDPLiANIPCFIKtYYCYYDDIOKARAOSRWLEKSERYRNAKRRE'OEIVKI~LFYCYFANKTES
GAEESYCCLYr;fItVF.DKDAIIASALIAFAAt.00Ki.OCKTLCDALLSLYC1 KEAIfPLIDIEEYRLT-0EERSNILEKRLIYMOfAVARpRVGEFESMEIPEWFSAKTDEQEIRKKLSNLEEISSM1FFSGKYOVEKPENYKGGIGF
NLiSItDSYALTLPK

.
TSIC.CYYFSOOGRVIIRPSCfEPKIKFYF~fSTHYPERVTDKEIOKQRFaFSFOH<.ODfI

CPn_0015 60961 62790 CT)45 hypothetical protein CKYTYHPPOLPPDHSVGATSWpPKLRILTITlLYLC'JLLLISGALFLTLGVPCLAAGL5FCPt~0057 7.91) CLGIGLSALOGVLW9CLLFFLIRRGVSKVRPEEIPV1'PSHEAGKIL~_QLPOELDOLDTSsodH-Superoxide Disnwcase lHn1 IDEWSCIGICLKDLKYEDpCLLTEVOLIQ.RVFDFVRKDtIV't'EFLELOOWAOEQOFLDYLILKRYWNSFVPYSLPE
LPYDYDALEPVISSEINILHHOKNNOIYINNtJrMLKRLOAAE

INQVOSISHKLFVPOVNIGAHLAEIGGYLPSGDVRVERLKRSAROWDRF?IRVTCDTRXV'PQpNtllEtIUPU'RfNC
OGHtNHSLFWETL11PLDQCGGOPPKHEIlSLIERF~JCI?ION
' AMAFDENJVCCVAKNAFDKAFCALEECVYKSLTESYREAFYEYEKAKILRNEDVE~R.OOKNGKLPLLL1IDIMfItAY
Y
fLKKLIEVAACVOGSCWAWIGFCPAKOELVLOATANpDPLEPL1 KSARAEORFR6VKWlWEDLKETVFiJVKENGCIDLE'JL?A40L11Pt)ItCPENLIPBIUIRNINfpYtINVRNTnLK
AFPCtItM~ICHItIJNFSEIISSK

NSHKLWFJ1THRFtKGAECTYSV.1RVAFEKDGSR)WQKY.F~EKTKEWLRCLKDLHDOECNRA
0058 77eZ7 74562 CPn RERLAELEALYPEVSVSWETERETKFKLCtAYCNLEERYGStMCDQEDYWKEE4?IKEAE_ ' IceOACCOA Carloxytasv/Transferase Beta tAEFEYTILSDAAN
IRWLVRLI''SYDKPKIKVpKIKADCFSCWLKCNtICNENINANEIGOHYNCCPKCSYNYRiT
FREKGTKVRSPEEWEYLDiLFIJGaEDCSKQLTIAE'~IVhGIIELEA

RLKVtGEDTEDILPRVEEIEINLRIAELPFLPIKOAPI'KAFLpYNSCKDRLAt(VEPYCQEAIERVKLLADKDSWRPL
YTDLKSQDPLEFIDI'DTYANRLEKARKNtI'ESELVIVCICTIC

SVDYKarFRV
IJIPVAGAVNDFNFNAGa'NGAWC89G.TP.LIEFJRCfRLPVI

.~Pn 004e e2775 6)261 tNKTSAAi.AKLNFJVGLPYISVLTNPT~VI'ASFMLGOIIIAEPKALICFAGPRVIIAtW
' ' ' N" robust hnlmlv4t presanc Cn CenebanY./F11BLOCKSKAPRDLSKR
.~s of 11!7/98 l Il~TLLDYFLApfY1 ERf'Q.;LNpOI4NVY0COKATGLF~~~EVSAYRDHLREOITEFEZ'OfiLDVIKEELLFVBSTLLKEIFLLTDOSE

iC$KL..':lDf LiAOIPCNKFYEYYDCIDKARVQ$RWLEY~aERYRKAKKOFQA4LICECLFIfE
4'7 7.1562 75050 ' t>pALY.KALYRt.LREKRFNKF.KLLtCNKIEM(7t,~RVtrEF:P.:Dw:Pn_0 'fut-,111TP Nu,:le,'r t,lohy,irolacr:

t.l ;7 0 )bS~J IKHIfI'A::l.'NIkII
IC'NAIIXIVFCELD:fX3ELPfITI'PI:AN:AOLRANIEEPIALt.i't~RA
1N4'1 'I

, LIPTr:IxJIEtPEr:YEWVRMt:a7:(.ALKINaTVItI:Pt:I'ID.r.DYRi:EiRVILINPC:D'1'FI
.
n ~
JEMBL ns; ~C ll/7/9H
:C tmn.,l"u nw :,!nr tn ~
tunr en w tm y IEPKNFIAOWL:.I'~1i'A1'FWKQE.:LhTARG..IY:Ft:lrl'r:ll;
.
.
.
, m~
:
I:tIF'ItIJ!VTISra~F'ItlVL.t''.J:ILTHYIIFQKIRFfI'LT'I'rJ:F/LNK~WtKhYEL.WFYYt::;C
l'EC

KVY.IIJ!'::::IIF:WI.
~.Pn_Ot,SO 751104 'l5 53H

.'tn 11114a .. ",an ,:SpOI GcaN-VI's I IA Pr.e.rin V'ItF' u:: ,:rncuem~~t hytxrttuti,:.tlFKLPEC/EVLVILEI'AKMI::YtxrFtOqLF::LF::LL:PRLVNFt.GKIICRDCIWUI.Tn t.VDA
IM t.r.rrtin ' ' HKFJkItI::YNttALIIKI.::IKJWVNYFt.YTFlI.'X:::F'IVAIFTFAWf.KVL1'1.'I'EIKN:EISRISl Dt:L139111IDc:
ACIIi.EGY.OAFFDAL~RRf7itlCTR%IC%M'."JAIMK:KLECC:aIFFIAIC:III

I:PAf?41)F::I:.W::NIKFYKHtNIt::E11Ft1KWIII:rL::f~:::l.I::KF7:tlADEKrOYYIPKKJ1A0 ALVRL'JFLIC:GFG4AQAEYLKL4'.TLTL:L.RftECRRqrJt.LUVNTtELIMNVFVt71 1~Lt..'ftIFVU::::I'iK,'LKDLt' IY FPLL.tIKBKKTLE l1 i I: I::NlIr:HV
LAS\:FC:IILK IFLIOE7J

u:flv IIW.I 75iu1 762DN EATYAtORKANKKPtCJtE
:r$'ttFlYTt:!rCK,~,L.W1LIEINRNNC'L'PATJINLGAS
pcsN~PT:: IIA Protein ~ NTII tsNl-BamJnq Oamain UtWUIOPItPYCF:xIPOCC:W
.:.:YL7t11t0'ta'CD:LARADCCLHTt.;'C'I'L9pIKKLPORt It::HOCtt.".:DVKNGLKLDEVA:iLLOVCQRVLOWLK~AIPSY5IlIlICfRF3RECI~ft.L t '~
IItpALHLOERt:EOKEALKDGiLKYSLYKAiNROLYt.CDVWNSKZ~J~4YASKYiAQKFQ
LDE..~VLFEHL:'rIRENLN.."fCLCFZIALPHAKDFLIHAYYDIWPNFt.AEPIEYG7It.OCKP CPn 0073 87153 9757.1 6'GILFFLFAL'ODK~HLNLVNKIVHIGHSLN71RSFFKNt~OL'L'AwKintA-IntCiatlon Factor IF-SNAKKEDTLVLDGKYEELLPGfIHfRV .LENChIPIrfAHLCCKHRHSNIRLi.IC~VT~15 .:Pn_Q057 76251 77690 A~~~R sf4iir~'~' ~:;.,...,:; ,:;-w~l.~,..,:.~:rt~°:.I:;~~.::::: °:--n::r:r:n:::::..:::..:~.r:~.'.. .,r .:... : :. ..,;.._ fWKKPQKt:;iERiIOAKKEPRARKB'fLVPSSRTL'alRat7KMItNSSRIINEISANST CutA-tlunSiecaun ~dctor 'tu PRSVKLRRHKRAEOKMKOGESAPSN<~TLKS~KL'PS1LOKTSIHEREKA?SRtVNfSCL
EDFEHSKET/QRNrIPNINIGTIGHVt#IGKT:'LTAAITAnLSCOGIJISFRDYSSItI~PLE
SSARKRYCTPSSMP;LFLETEIVMWERTKC1QDNEIHIPWQWfNPKLQNI'KZTKQ
MRC1TIN118HVEYEfPNKNYJWVOCPCNADYV14~8ti1G11J101mGilILWSJfI'DGRIIPOT
LASOASIOQSEGTEOSLREI.iICCASLPVLVPSNPEYSYORCKEC3.KCL'VAERIlOCtOIKS
KENILL71R'04GVPYZWE'LN~ISQEDJIF3.a'DLVpIELSELL66~'YI~CPIIIIGi7IL
VROALFJIRSLTKIfVARGGSVTSTLRYDPWU1EIKSRIfNCKVSPtCAREOIDfSSCKRpIIM
K7lLt~DANItIGIVRELHOAVDDNIP?PEREIDKPFLNPIEDVFSI$GwGTWTGRI01GI
NCKOOKTfPSEDASOEEGOTCJ1GLVRKTPKSQVJISIUONFYIWSKH'tNIDSYLTANOIfSC
VKVSDKVOLETIVZCVEtQRKELPEGRJ1GB~RrCLLLRGICKNtaril~lRNCOP
SSEE'fDWPCSSCVSKRRTIOrSISVCTHwt~lIl4lZVCALIIIIWITESdiTSDPTPPIPTP
NBVKPNTKTKSAVYVLOKC~rGRHKPFFSCYRPQFFFRTTLNfCV41'LPt~TfJI~IPCDN
V~V<SLICTVALEEGIWFAIREGCR'tIG7IGTISKiNA
CPn_0063 78109 78267 No robust homoloq presanc in Genebank/!?0L ss of 11/7/98 CPn_0075 91087 91350 pHYANCKrWCLCLYDFSRHRSPPCLPLTFTPPYSFTI~IFLGRCLSTSNIVLL suet-PreDSOCSin cranslocase gRgWpIptapNNRKAtSRKIGTVKKW1KFAGSFLDEIKKIEWVSKHDUOIYIKWLISIFG
CPn_0064 78310 78576 FGFAIYFVDLVLRXSITCLDCITiFLFG
No robust homolog presort in Gensbrnk/EI~L as ot~ 11/7198 LVM'KIOCSApYYRSRPAERAOTPPQPFLARDRRADFiiEAItPRFSJVC~VtJ.t~VU'~L'AL~ CPeL0076 LFLFVIdLPLAAGSYLLAF nueG-Transeriptional Antltesmlnati0a pplCg11K7~I1frMYWOVFTJIpEKKVKKALEDFKESSCIffDFIOEIILPIFJ~MlIVIOK:EH
KyyltJlyIWpGyLLVl40!<.TDESWLYVKSTAGIVEFLOOGVPVAISEDCVRSILTDI~1( S(fWCIfHOFfVGSRVKINDCVFVNFIOtVSEVFtIDKGRLSVMISIFGREl'RYDDLeFWCY
BEVAPGOESE
CPe1_0077 91956 92135 r111-L11 Rlbosowl Protein FP'ygypLFygygQCKVRFSHSVbMXIIKI4IPODKANPAPPIGPJ1LGIYACVNIIGICIc EFTtMLLPVVITVYADKTFTFiTKQPPVSSLIKKTLNLESDSKIPNWiKYCKL
'tpApNEAIAEDKIIXI~IVLf.ESAttANY00TARSlCIDVE
CPr1-0078 92157 93160 rll-L1 Ribosomal Protein SCRIlITKNGKRIRGILKHYDFSKSYSLREAIDILKQCPPVR!'DOTVWSIKLCI1IPIOtBD

GOtRGAVPLPNCiGKTLKILVFASGftKVKE7IViJYGADFMCSI~LVEKiKSQiLEFWAVA

CPn_0066 80916 82655 TPZIIHEVGKGC11VLGPRNLNPfPRTCrVI'tDVIIKAISELRKCKIEF1010R11GVOiWOG
No robust homolosi present in Genebank/Er~LKLSPESSDIKENIPJ1LSS71LIlUlfPPAAICGQYLVSFTI5S1'MGPGISIDfItQJNS
as of 11/7/98 CVYHANR'fQSRPPSPEISICELELOELM:SSNZ'LTISNI'PPPSCMTAEEVSLFILOGRR

NSEDEECP~EVYDVVCITNOGDPESIRDll6VRVN1(INGSCRTaHECILDAIBIiCDZ.PG
a1DYIYLt'8GN 0079 93170 93688 CPrI

EPVRFINNS4'YGLRSGFLCIRNRIPPRIxJVISDAIQA~FFIFA11~111-Yt'GOWITLLiSIRGA?AVlfl~ri' r110-L10 Ribosamel Protein CCLYLQVAGOILSIYSrtItILCVGIGSSYYIOCiIYAVtOiYR~08t0E~LIi.OEY~SAA~'FILLRYLRITMYSRE
IPNSUID11SIICFM.1001IF
LPYADSAEGLFLPSVttCPSYQWALACGEpCLIHtIf~pQVOFRPODSSStIALVVtVLDFNKpHKDBLVFU1DKN~LI
SLS
VSMIO

STWIRLIEWIDRGDSOAVLEfiiPGPSt~RDIJ1LTALYAT'tRISSLiID~L~L~~Vp ' FMLE~~~~DP
GALYEAYAKLPSLKG.RCOWGLFMPHSQWGINNSVLSGVIBCYDQKJIGIQI

RR
FV'1'IrAIVIIGYSIM1'LRYFILLLTNRPOCRRHFRVLRi.MLGL.OStGFLTVLLDttZM~
' V!1 VNRRPPLISVIFCTASFATGSFIYVDLTRNIfTSLRSRI.OL1WRRL11GRGLPLNAV!CPe10080 93720 FHPLIIGFINOLVIQVPAVVIRPN1TAVY~OTSOE~
LITFttO~d r . r17-L7/L12 Ribotamal Protein HLDSLRF~
I

FFFVPSVRIfHI.IDrRPIa VRtIfKVITLBLETLV~ILBNLTVLELSOLKKLLEFJtiIDVTASAPWAVMOOODV1PVM
GDVtAIGO,hIH!'Ii.QIILLVIN1 BPPEFAV'tLmVPJI~DCICVLKWRL~ICLIILJCFJUtE4lt'ODLPIfNKEKTSKSt111m~11I1C

CP1L0067 87910 81053 KI~IIGNIASFI(GL
No robwc haeolop Dresent in Genebsnk/ENBL
as of 11/7/98 ~YSYPDPPNAVEGRVNSSOALNpt7C0Nt~G8IGGLLRCRILSIWAVITFIJILIrV
t CP1L4081 91=19 98016 W

FW rpoe-~ ~lY~rase 8eca LIALTIJLSILTSYPYL7~I,GVFLLIVTIGCiIFAI~C.SEItIKRVPPfPIStd~EIIA
0VLLf'1mN
RSH

FREILBt~ItSRRTRl4JfCPERVSVIGCKEDIPDLPNLIEI01K$YItQFLOIQUJI~tI
KNIDNEKEKEDPEtIFGRTATDIPNRSALOQFNHSCNItIHEBPALTBTY
L

CPYTLPOYfSEEEVLIRSVVGSYLLiEIICVPKYSH<.iDEUNKLK81S6RCCLTIDXKTCLtEYFREIFPIKSYNGTV
LBYLSYtAfGYPKYSPEECIRRGITYSVTLKYRFRLTDQCIG
NDFFTPCHS
Ui R

p IKEEIVYNGTIPtJtfDKr.TFIING71ERVWSOVNR$PGINF6pEKNE100NILFSPRIIPY
~
ORIUSFLlI'OKDLATFFLAYTRVNOGNWPFRJIGAItWILItfYVRLR
TfiDttCDGFLE

CYYARLAFNDTQRLYHOLFNVEKLRSIYAPImKDPLCNPWAPIPIYDLLItpG>RiLEiIIFDINDLIYINIOWtKRRA
KILJ1ITFIRALGYSSDADIIEtCFFTfGttSt~SE

pQE~tEYPSRMQDQFWG
KDFALLVCRILADNIIDEIISSLVYri%71G1~'t'AI4.IDb1U711GI1LSVKiAYD11081a11I
I

104.AXaP'fD'nFJIALtDFYRRLR~EPATWNARSTIIDtLPFDPIOtYtit.CRYGRYKi,~K

CPn_0068 81909 LCFSIDDEALSQtIfLRKEOVIGALKYLIRLIWDDEK7~~CVDDI1111L~RRVR84CELICNQ

CT360 hypothetical Drotein CR9Q.iIRNEKIVRERHNLFDFSSD?LTPCKW5J110LSL.11&VLKDFFGRSDLSOfIDCTNPV
SF11IKKFFIYSLIFSCSFSAPLNGICNEDVSSpSAIIEDPEVLITOLNELILTPIEOGKEI

0AISDGOICSSEEIEESCGTSDSEGLSEKTOKESSNEYVLDFFDSNWRLEGISKHAELTNKRRtSAiGPGGLNRERJ1C
PEVROVtDISNYGRICPIETPECPNICLITSLSSPJ11C
IRNt3 YCTfVIIYJIGE
' ' ' . fEP
COSCQVAGIIDCFNREFDIRNRELELIDIRELEWrrnt.c~SIU7NMKONSRELAFQRADVEEECVIAOII.SASLDEY
ISV
fDEIEYItI
NEPGFIETPYRIVROGIV
AVPLLKT1'JIIIISIC
R

Q
AFEACfSTVTIrIONSPKpLVSTtrZCLIPFLEHDDANRAUIGStJIIp 1'GLflCMAKDSCAIWAEEDGWDPVDGIfKWVMKfDiPTIKATYNUfKFt.RSNSCtCIN

OQPLCAITKCOVI710GPATDRGEL.ALCKNVfNAFNPWYGYNP~JlIiI88KLIRCD
~

CPn_0069 85191 87086 .
AYTSIYIEEFELTARD1'KtGKEEITRDIPNVSDEVLrINLCEDGIIRIGAKVKPGDILVCK
No robust homoloq present in Genebank/EMHLKSDtle as of 11/7/98 F
RK
R

LNFLYVYLLIFNtGIHTTPPPSRSSSPPPYDWILpDLCMT>NtJSSRATPPPPEIIGCELPS
D
Ia ITPKSk'tEUIPEERU.RJIIiGEKAADVKDJ1SLTYPPGT~bWImVKV
LWEKAPMIINRRTACIWIImGLID
A

PYFSJISNiyVIERGAPSLPSPQpLLSLPEYSROPPP.r.YFDETJISITSRTSE~CfLYSTL
LVEFJ1VHLKt%~DKGYKHQVATLKTEYREKIIG
WU~IPNCDt'IGVLtCGL4SDYETAf.~IL6IN1rKT6VlJIIR~OIIIDLDItt OETTERIEGEIX

LCCPANSERDWEDNEVNCIYIAS1'SD'tOLEAVp(xMtIITtLAGEPVRVLYt:TGNLYAFAR, ' GVIAOPKWVASKRKtAVCD101AGRHGNK411VSKIVPRIIONPItt~NCISIVONIIIIPIL~VP

QDI
SRNMAGVLETHLDYAAK'fAGIYVIffPVFEDFPEQRIWDINIOpCLPBDDKSFLYDCIITG
ENTCNSRLEVSH7YRAKrYPYIDRFFSPN4MViCRRFLVFYpCIKiCAYVOAALDSSNN1 'IVLGLSPTVYIRCNIrNVQHYRVRDFWPSCLDSLMGItM'SVLPYCtSSDGIFYPSLFSNERFCNKWICYI1MLKLSH
LIADKIHARSICPYSLV1COPLCCKApIIODORFC0~R1AL

TFDMAIRYCERCLLVCSECMGNLPETCpOTSPLTSLEOGfIEtrALVUIPQpNPEALSLASRQEIL'fVIt5DON5CRT
RIYESIVIIrENLLRSCTPEBFNVLiKAlIxLCLDVR
EAYGVAWIL

I11HEERCCRLESNY?IPGRSSNPFM'fSNYVLVrtINfLIIQIYLHSPYYSFQSNDIVCLIFIS.

.~.MVETV~YLFLTVTDSTCCRRYLRVPRLVCTCLRNLALPiTLLCLLILSYPRSIrLCVPFPNWDA

tJVrtFIIG'fHf.'ITRWFFAWNLILIIWPFIICLRIIGIpLFVNRSI
IfSITtGARITDLTLASHR 0082 97992 102221 I CPrt YAIVFPSIVC~LLTAIJWANiNIt.ALDPYRLIESGDLRRPAPNODEM00~rPWDJIYS_ rpoC-RM Polymarase Bate' (:LVINTCtYMLILFANLiFINYSVRRYNRSRR
CSSYGRRRLKNDVLEKINFCENSRDtCVISKECLFDKLEICIASDITIRDKW&CGKIKKP
.

~Pn_Utl'la 87399 8720A
ETINYRTfICP~OCLFCEKIFCPTKDYIECCCCK'fKKIKHKCtIACDRCGVMLSKVIIRER
NAHICLAVPIVHtwFFKTTPSRtCNVLCN'tA.~.DLERVTYYECYVIIttM~GKT04T100~iJJ

th robust ntsnoloq present in Cenebank/EHBLDAQYREWEKM'.KtHIPVAKMOCPJ1IY0LLK.iEDLQSLL1C0LKERLAKTKSCOMP4ttJlKR
as of 11/7/99 ' LLISFRDTCLKR LKL
IOGFYSSSMIFCWNVLKNtPVVFPDLRPL'/PLOODRFATSDINDLYRRVtNMBIRLK
'fKyrLFNLKNONFFSNOSRTYEORFPKVSPHPESILP.tQSVGFSSOG1 riLYI AIGRtXTPEYIVRHOtRMLOEAVDALPDNDRH
:HPVN17AGNRPLKSLSOXJODKIKIRFRQ

t1U71 N8U~6 8759?
NLiCKRVDY!aCRSIIiVCPELKFNCC~.PKEMALELFEPPIIKRLKOOCiiVYTLRSAKIM
:Fn _ IWKiAPEVWONLEEIIKf;HPVI.WPAM'LF1RIJ:IpAPF.PVLt0t71fAIRINPLVCMFNAD
-f 125 1'y(xxhcc ical protein ' ' IK::LR::ILEPIf FL4IIARt:LKKDNKIIEELFPEPFUYDNLYLKfIIENS:iSRL1/1FOKKRNLf1 F
FGtDl7lIAVIIVPL.aVEA0LFJ1KVUIMf'I~IrFI.P:.'xKPVALP.'.7falfllSLYYIIIApP

'V::IH.YLYEVYQDGILFFFTYTKAtJt.~.fIA.~.LFTI:W.~uCE1'P.STIL1'CKPIFPEFJKY:KTKtFKDE
tEVUtAUJNrr:FIJ717VPt:LPRDL~Ic'.tlc:flllIlP.KIKVRIDr7pIIEIT
YF
NrVVtx . Pt7RVLFNR t VPKEta:F(XJY.~,l11 . ..rP t..EL t t//:YKKVr:1.E11TV1tPLfMfLhDUSP
FYJI<LTh ll_:('~:RIJ4(x:E::LYNRNKO tQATKA
f AVpYLXPP(,T ' ' JAIVKYt,YDDtlttTECERIt:KTI~IWflY:f00luD
ft~Y
At ~Nr:LIrOllrttf'DIK::11ILKDA

'In~ inn/ Ntl5l Na1157 ALriEt::KC~SKNNPLFt?rID:Y:AW7NK:aII.Y~JIt:AI.RIaJIANPNr:AIIF~.PIT.'OJFRE

T:~A hyrrmlirric.~l prntwin ~'LTVLE'l.~.taaK:ARKCLADTAt.KTAD:Y:YI:fIRLVDVApDVItTF.KIX\iTLNIItEI511I
' 'P::YIKEKYI.ILI'fCLLFYFFIfYRIt.TPL.::rf:LCI'L'DDWPOEI,FCDRL%3SRI~rf~:Dlt::f1.IJ
W:a:DVIH:;1/VAF.AIDI'.ltaKTIKLR3 iw:Yh::l'Y tXlt::%EEI.LI'LKpRIY':RTVAK!I
' . f:~
::IE:YIX:NA:Y:IXa'IV::Pr'I::ALVALTDLKLVPYNGtI.iFIihl1'fRLKNAVEKII,LFIpNI~IK'fLT
i:K.':r'Rr7Vl:AKI'Yv:GNLAN:hI.I~:hI~:P:AII~::
tllAtk:lt:EPlfittl.TNRTF111lx:IM

IIItIYALTLTIWLIT/17ILIIGV'/F~IPTATCLDKENKHRNVNSWNL:.Tr'EIITN:;tXSlLYI7ttKJlWIIY
~E~:rINLVInYY~JIIJIWr:DNf:RTWKTKKUl7fK.iIE
IILLfII ' AWALtI
VI' . KW
.
::LKVrPVC4:VKtl.VAIY.TPV:aJYrhI~:FYI:1.rltrtl'Itr:MtfnFIKYKIH.VrJ:I:.TB
.
talc:lUJId~FII:fRUILLA'rNIASI::ALLYAVP57A'/r:LViI:FSIfxIQI:,INfVYCARU.:D

WO 00/27994 PCTNS99i26923 NKHIS:LVELIWONRGfLtIMIAIYDOADL.iEL HIIrMEVI::LG:!' JATPSGAII'llEEf:QRVDPClD.LA

RLPPGA IKTYD ITOCLPRVAELVEMKPEDMDLAK
IDCWOttI(G IOKNKRTLWCDEFff CPn_00t.1 .:~ ~ r~trr~k~~'~"v CCVRELOKYLVNEt/OCVYR

~71EECNLIPLTKNLtVpRCOS'VIKCOOLTDGLVVPNEILET...
IIVIIpNLQKVRL1'DPfiDTI'LLPGEDVHKKLFYCFNRRTCEDGGKPAOAv.\1S-ValYl C1U~IA
fiYnllflC.1 ..
SROIIKFIiLRIt~f'fEDFPIUlNF(;L'fEPL'i'IFWEKNr,7lFKAEA.iaOKPPrSVIN
~fFt f.pCVDINDKHI Yl VPVLLGI'fKA:uLGTESFTrAASF'OD~~'TDAAf'CSt~t~t'' 'I~FK~I~I~ PPPNYI'CVLHNGHALVNTLODVGVPYKRN.'Y:FEt:.'a f Ir:TL'Itlk% IATQAWIRNLOASEG
' ?ETHKRIKOYLEKEODLVFDf VSETECVC :
KRR'IDYSREDFLKNINAWIIEKSEKV':Lat:LAC:G:
~~.JWdIKRIr'1'IIEPLANRAVKfIAFK

FC~ICYtYRCfILVNrDPVWIAt~DE1'EfEEI(LVA'ILYYIR~RMVC"~E.iIWATTRPE
t ~pn X091 102:96 103312 .
t .r,..... . .-cnY,..:~f.".~ ~~' .. . . ....,......'.v.~Vt!
..
. .,.
....
.y.
.
~

.....~... -. F
', :..r.w , .s :. ,.
'~,.. .. ' r'K PKSr '.Y~F'f. :.i ~:' .,.... ::1:
' . ;e:~.
1 ;~~"--rnr, ~
~
4'IEPYL:iKHWFVe'v.::.rm;nu:ia:F.:aL:K:FiK4YVKV'i'u,iw'47iNLRi.ii "

.
. Vs:V::YRsur .
CISROLNMGNAIWWYNIO~CDERYL.'.'C'OI:ErIPEEVACDPDSWYODPDVLD'I~IFSBGWP
.
..
: :1F .
'ELWEAWWGIAONGODLCTLSFLLOKTOVNFALElIKNIPCRISLELDARL1FNVEAM

VOMVFL50LFEAMOGOKKRLLVKIPfsIWECIRAVEFLEAKGI~ftLIFNLVOAIAAALTCLGNPDENSFDLK1IFYPT
ALLY'CCNDILFFWVt'PNVLI.CSSM&GEKPFSMIJIOi.IP
' OIMAASFIt l'OISYKRYHflEGEWSYISGKEKLAYOMCFrILPDL11VAKNCKLSKBKONVIDtLBIIIATYtt:
KAK11TLISPFIGRIYDwMIMYGDF7CYSIDADP41lA5VSNIYAYYKKFGIF1 TKEpVLALAGCDLLTISPK1.LDELKKSONPVKXZLDPAEAKKLOVCPIEL?ESFFRFIl~TIDAWL?I~~IDt'D~~~
F~~tIFGNISDirOGKDLLaGIDCD

EDANATKLAEGIRIFJIGCtOILETAITEFIXOIAAEGASLGfmFYILOGFNOLIHOLEEAYATYAFDKVATL1YEFFR
NDLCSTYICIIKPTLKiKQ
' ~~0~ I~t~t ITESLFLR ODI'hGAi'P~DCDAFI
PYTPODLRESFTLJIOItLVYTIRNIRGEMQLDPRLHL1UMICS

CPn_OOB1 MLPSRAC1IG
predicted fersedoxin 0'1'TCIOSCIPIIpALIGGLESIOLLDIIEPEKGLYSFCYVD1'IRwIFVPEfJILLKCmRGE

SEMKMO'lalfKSOLVFSCPCCCK(TIVCFSVFNLOVIL1'CNVCSSTY'1'FDSVIHNEIROFYAKEAVRLERAVEZi LCRL.LGDEStCOKAHPNLWAKOEALKMJitIELw7GILmG~ISFA

:.CKRIHDANSI'w"NATVSVSVEDN011DIPFOLLFSRFPVVIlILSLDOK1CIAIRFLFD11LN

TSII~sQESDLIS CPn_0095 115956 118790 0085 104512 103'166 pknD-S/T Protein Kinase CPr ACIVCLDRCOORSLERYDTVRIICIC~1CEVY'.aYDPtt~.SRKVALKKIRCP~NPLLKR

_ RFLREARIAADLINP'GWPVY'I'IYSEKDPVYYI2lPYIDGYTLtCrLLKSVNOKESLBKELA
c~311 hypothetical protein .
EKTSVCAFLSIFHKICCTIIYVNSRGIWRDLKPDFIILLCLFSCAVIL.D~.AaVACGCEC
FSMKPFILFILIVAOFPAFSAOPATOVSASHSKpAKARRTSRIRSSMTNASVSRYKTRA

AARKKIGKFflOIPSLSPVOWVRYSCKNYSICfP5L4FOCIDL:K1'QLPEKLDVLLICKGKGNDLLDIDVSIIEEVLS
SRM'IPGRIVC1'PDYMAPERLIGNPJ1SKSTDIYALGVVLYOhLTLS

LTP'fINIAOEITSKSSKEYIEEILAYNKJWEMTLESGIFTOIOSPSCCFTIIK?ESNFPYItR%!~%KIVLDOQRIPS
POEYAFYRCIPP
CRVfCL0A1'lYImiTAYIF'"STATLDDYALSFI'FLKWSSFOIRGGKGTSCDAILEKJ1WNRIQ.AVDPOCItYSSV
TC.1~DI
ESNLI~SPKIiI'L'ITALPPKKSSSWKtirE?ILLS'RIi.~,VSPASIiY$LAISNIESFSil1 LEAt.ONf?1K RLSYTLSKKCWEOFGILLP'ISENAIA7GDFYQGYCiFNW
IKERTLSVSLVIDiSLEZORCs 8 lOS5Z7 ODLFSDKLTFLIAL~00ISLSLIYOGTi~'ILIlOI~tYLPSRSGAIIAIiVRDI~DILEDZCI
SFPCRItOGYCARFRAGITVLCKAS
R
A

CPn_0086 10489 I
E
FESSGSLRVSCLAVPDAFLAmtLYDRALVLYR

acPE-ATP Synthese Subunic E
TDNN~~T11IJ1ZECPSIOi~GSfAIIPLEYLGKALVYORLOEYHEEI1CSLZi.A
NINANLNADG%LKOICDALALDTLKPAEDSAAALfJiNAICEOAKRTI0EA0Et~IRKIT'fTAHESFYIfRDRiJIWF
lLILVLEIAFOAITPCOEEKILVWLKDKSRATLFC
DNVVYRT

EEWItpKIKOGEVAT.S0AG1(PALE71LICOAVFNICIFAESLVEWLEEIV'i'1'DPEYST1G.IOAL.
~ IFRIX
LLDPNLCLIISSIQ~L.FLSYWSCYIPfR3iSLFHRAWDOSDVMLIEIFYVACDIJODAFL

'IOALFaOGVSGN1LTAYIfD0IV5PRAVNELIGKAV'1'1'IC.RIGfSWOGSFVOGYOLKVESSCIDIFKESLEflO
KATE6IVEFSFBWf:AIT.FAIOSITNKCDACIIIFVSNDOLBPILLV

wVt.D<SSSALLEIFTAYL.OKDP'Ra4IlOGS
YIFDLFANRALLE.ROGEAI!'QAL.DLIRSKVPF3IFYItDYLRNHEIRANWCRNFJIALSTIF

ENYT~OL>tDEOY~t'~IOCAF~1AI(OHFDVCRF~RIFPASLLARiIYNAIrGLP

CPn_0087 105510 106376 KDALSYOERRLLLRQKFLYFHCLGt~OtDERDLCQTHYNLLTEEFOL

CT309 hypothetical protein SHCKIFSIFXVWKIOYYFLSSFLPTOLPESVPLFSISDLDDLLYWLSENDLCNYGLLKCPrL0096 134347 AFFDFENFAFFwAGKPIPFSFGEV1'pENVFIl4SSQOi~ISOI~fFKD~IKS50DCT'396 hypocheeieal Protein RWNFSDLFREFLSYttOTNSSKFLODYfRFO00LRWLAGTRARVL~SY~~SCfFLSILRCTfMCSLPVYVSCIKVRNLK
IIVSIHPNSEEtVLLTGVSQSGKSSIAFDTLYA
TLNRALiILYOFHKLECFCSDSYF ' ' DPWLM~IOKDSPNYELPEEFSDIOCVL~YGLLPH GJSN
NCSTI
.nwrsvrcrt.srrlllTITfLPNPKVECItIGLSPTIAIK~7IIFSHYSNA

DL'NyI;,ARCATYMFAIRNSL.ASVCKGACIINHIC%AIlDr1 CPn_0088 106351 10A14S
CT288 hypocMtieal Drocein SYRIAiIOIDtIVSDGTAOGtIVIEAYCRiII.RVRFDCYVROGIYAYVMfDNTNLKAF:YICVAD
OEYINiDYFC>yIpCACRGALVT!'SGHLLF~IF~GPGLLOGIFDGLOFIRL111FU1EDSSfIARG
KNVNAISDNNLNMTPVASVGDI'LRRGDL.IGIY?~RITIIKINVPFSCFOEVTT~~TSE
RWPIKpAFIflGflCIPAHKIMDIit".LrtILDIbIPVL
KOGTFCTPGPFGaGIITVLOHHLSKYAAVDIVIICACGEPAGEVVCVLQEIPHLIDPH1GK
SLNHSI'CIICNTSSlIPVAARESSIYiaVTIAEYYROIr..LDILLLADSTSRWAQA1RCISG
RL6EIpCEGfPAYLRSAIAAFY~QAIT1'I~GSEGSLTICGAVSPA~?1FECPVTOST
L71Vi1CAFCGLSKARADARRYPSIDPLISWSKYL~KNGOTLEEKVSC~GAVI0fAA0tLEII
CSEICKPMCWGEEGVSMEOMEIYLRACLYDFC'1ii00NAPDPVDCYC~ItI.FSLIS
RIFOAKINFDSPDI1ARSFFLELpSKIKTLlICLIIFLSBEYttESKEVIVRLLEKTMV~'IA
CPn_0089 1D8111 109466 CT289 hypothetical protein LDL'WIOLON1IKWRKI7MOTIYTKITDIKGNLITVEAEG71RLGEL1TTTRSDCRSSYASVLAF

DLKIM'LQVPGCrTSGLSTGDHVTFIGRPMEYI'!'GSSLLCARLNCIGKPIDNEGECFGEPI

EIA?PTtIJPVCRIVPRSNVR'1'NIPNIDVFNCLVKSOKIPIFSSSGENNNJILLIOtIAAQTD

ADIWIO~LTF1IOYSFFVECSKKiRFADI~MFIIiKIIVDAPVECVLVPDMALACAEKF

AVEEKKNVLVLLTOMI'AFADALttEISITMDOIPANRGYFGSLYSDLALRYEKAVEIADGC

STTLITVTTMPSDDITHPVPOH1GYITDGOlYLRDEJRIDPFCSLSRLKOLVICKVTREDH

~DLANALIRLYADSRKATCAMAMG!'KLSNNDKKLL71FSELFETRWSLEVNIPLEEALDT

~IIfILAOSPCSCEVCIItAOLINKYWPKACLSK
FKKQAELLIAKGTI1'F'SDLWIDSHPIASSORSDISTYFUiAPSUIAeTwsL7Vwawu~a SSMFSIT~'KOGOCSDCOGLGYOt'1TDRAFYALEKAFCPTCSGFRIOPLAOEVLY~IOtFG

CPn_009t1 109439 110080 ELLNTPTETV11LRFPFIKKIOKPLKALLDIGLGYLPIGOKLSSLSVSEKTAt.KT'AYFLYO

acPD-ATP Synthase Subunic D
TPETPTLFLIDELFSSLDPIKKOHLPEKLRSLTNSGHSVIYIDNDVKLLKSADYLICIGP
P?LKLK1SALLOAEVQNaV%ZMAECDKDYV
FRLE

TYL
' ' IO~IS GSGKOGGKLLFSCSPKDIYASKDSLLKIfYICNEELDS
.Q
.
l KOKLAR
VLAKSMSYpVKL
OAYERIYAFAELFSIPIGTDCVEKSFEIOSIDNDPENTACVt~IPIVREVTLFPASYSLiG

TPtWLDTMLSASKELWKKVNAEVSKCRLKILEEELMVSIRVNLFEKXLIPEZTKILKK
~ 12459 126006 tAVFLSDRSITCriICQVItMAKKKIELRKARGDECV
CPn_009 PYk-Pyruvace Kinale DSMITRTKIICTIGPATNSPF?ILAKLLDACFl~fVAAWFSNCSHETNGOAICFLK6LRE0K

CPn_0091 110071 112053 RVPf.AINLDTKGPELRLCNIPOPISV~Cp%LRLVSSDItX7:aA0DGV&LYPKCIPPPVPC

acpl-ATP Synchase Subunit I
CADVLIDDGYINAVWSSEADSLCLEFMNSGLLK".allKSt.:allKS'VOVALPPNFEKDIADLK

'JRWIHKYLFtGRHKADFFSASRELGWEFISKKCFITTEOGHAFVECLKVPDHLEAEYSFCVEONNDWAaSFVRYGEDI
ETMNKC1.ADLCWPKMPILAKIENRLCVENFSKIAKLaOG

:.EALEFVKDESVSVEDIVSEVLTLNKEIKCLLETVKALRKEIVRVKPLGAFSSSEIAELSIMIARGOLCIELSWEYPN
IQKIOUIKV.iRETGHPCVTATQMLESIIIRNVLpIMCYSDIA

RKTGLiLRFf'YRTHKDNEDLEEDSPNVFYLSTAYNFDYYLVLGVVI1LPRDRYTEIEAPASNAIYOGSSAVhILSGET
ASGANPVAAVY.IMRSVTLETEINIL.iHDSft.KLD~NFnAI4VSPY

VNELOYDWdLOREIRNRSDRLCDLYAYRREVLAGLCNYONEORLH<1AKF:CCEDLFDGKVLSAICLAGIOTAERADAK
ALIVYTEw.~3PMFLJKYRPKFPLIAVTPSTSVYYRLAtiiiG

FAVAfiNLVDRIKELOSLCNRYQIYMERVPVDPDETIPTYLENKDVGMICEDLVOIYDTPVYPMLTOCSDRAVWRHQAC
IYGtEOGIL.:NYDRTLYLSR(:ACMEC'ttaILTLTLVNDILTG

AYSDKDPS'IWVFFAFVLFFSMIVNDwGIfCLLFWSSLLFSNKFRRKNKFSKNLSRMLKMT

AIII~f~ICWCTRTSFFGMSFSKTSVFREYSMTMVIJIi.KKAEYYLOfatPKAYKELINEYSEFPE!

PSLKAIRDPKAFLLATEICSACIESRYWYDKFIDNIWELALFICWHL.iLGMLRYLRY
17194 !huU

RYA:TGWILFMISAYLYVPIYL.CtVSLIHYLFHVPYEt.CCOICYYGMFCCIGWWLAMIrpn 0099 sc hrnrolnq pt'asenc m, r:..n.:rarlnk/t118L
v:: m tt/7J9R
r OA.~.WR~VEELI:VtOVP30VL5YLRIYALGLACA!?fCATFNQMGaItLPMLIGSIVILLGHNa ro m IK:KKFHOtKRTILFJvPLYYLV:7faIlLr:Rlrl'I'R::FI:fi:U:Kt':Ff:FhAFYII::DYRKTAL
t .~JtltIL3tMrJSVIIK:LRWFIEWYHYSFDCCGRPLRPLRKIVC3EDAfALCIHLtINNStV.
TNLALAFPEATFOERYKI ARO::L01IL: ITLLEI.LAIlXIVA':NtINILITIVT::.~.RIIpIOCFS

::FF.V t.~.tfEDt.EETFKNtAEKIY:L
t LF! :1 NoV hlHt.PFI:I ITKNY I',':
t AFAKA I KNORl::K

:1 ayrp r1 l .'.121 l t=57 3 NIFALIEVFIG:KtVI'PKN:;~IFII:Yr:KI:P:tth:f~AtlJl:":Y'1'YM.P't:::PAFT1'FS

.'r.f.N-ATf. ::ynrhasP ::rtbunit fALLr1'IY'1't:PtVIAVNV:Ut~AK~FFYf..::AK1.YANK::1.1.MYF:::VAtIJMIIlMttFLEKCTA
K
~.A
:HAVrA::AfDFI7Ir:KLtf?1 C
A
'txlAf:V\
' .
::UIRrrWIMIHKRNIIIRKI::NVIKKK'lP'Y.:1111.VI"/I~N::::IIF~.:YI~YAIJII:I.'P:x:ITWL
AL
:P
:
LVtGt.AMI(:::AI0.
l1.Y.f:AHt.Y::MIDM::W
I
"J:'t:fOAYA
:1'M~Y:K( r dLLV::
X
~
' t . r :NAII ILF.fLA?t~FPE'l..L It~I.ItNDpL
. I:.AI.I TN "/f ;\ 11~ lnl.Tf INI
: AJI II r YKIIFHY'1'rl'X AVY
MIKNGTLiPV(xIAU:L::
:A
lt:l ItlILIJII
Mf:::X3::I

b:::::a'lr:Kt:Vn\A ( : f Vf'.::F::LFAW::KHYLFY:iLDllt'OAPf.KN::IJt IFY::YIILY.ItKF:IrYNFKVY::Yr:LItl:lYh FALLL.L .

':fv 44rt 11:440 lliUlS
r:rn on'r'1 t:n'.~n :vrr'. I~.~.gLS

r-r:nt hyl.al,.:rtr:.r1 protein Flr. e.dm::r Iw~lnrJ l'trr.:.ml IIAI~V n . n:'rr'Lmk/h'HItI. .n:: ..I
I ll/'//'rN
:KLVFtiLTVI
RYYY
' .
YYwA;:YIf.KI::HFMKIIAf'F:IttMIlJ.::'tl'Tf~rl'9'M't'LLJLKVIfFJth::TfN(JIItMI~IK
.
:
:AI:FKf.ML.DI.NItYMf:::VMDRLGL.IILFIICLLLF1.
:YA::W.
~.RMRKCt.f'VT
vHDE'JYt:WVWISd::LP
:
TEYVr:t'EYaAAA
' :
:
L
:IEO
' .
'1'IAVFAY.::f'AD'IVAI'FALU::1:1.::17Jlil'VLI:.A::NIIIYrJ::IK111KF1Y.TKF' . F
:
O
:
:
It:f.UX:::F:1 :
IL
F
TV:iARl1 ' ' .7:NOE .
AVALC
t:lliN'PI~:FX:KVEKf:I'YL:VNQ::At:IAVYf:LY.f:LEYYEL~:r::f::\

LK'C!'FJGLIAEELMnINV: v!'::F:'."~!'.'KF4FKNI:.:KKV':::K!IKEN:KAL:iELPNN
:

.
ALOKLIOEEIIrVLTIDOPE:n..iJ:00t::'.:PIr'.'DP:Y:\R:::.4.F~J::.DCCLREPLIVE

~.Pn_OlDt1 l3'Jl9S LI788Z
~IARELVNKIN'llIRRNOG:~tQORtALRI,I~EAWIR.~,F.,LOY,I,X.k~,~~rbl 'T'J1l Isypornecac.il DracHm SDFOCQ'81DINQtRIOiLC:.~.'r:rtDO
I'vfILIJrI(75VTITRTLItIVP~
FAf' ' ' a JVULG ' vLFIRNIiPRK
a'fpKKTFILG~.LEINIKFLr SWIXIID
EWISAAMrIF
"

. CPn 0110 :4:'55 14192 JpOLAPSM.
:'JDLHPDpI'VL.c:LOK:.CfGNKKVSLTT1CllIO~II
VNPK
' KPICSPPIICIfEYID LepB-S(9na1 Pepc(daa t KHlILVSVOHEINIRKDIHSVDANDI!'VRLTOY'JTEOTLLTI:

YIiJpKVSGPKEYINALKEOGLELTFNWKLSFEELENNRIAOGSHOEIIFPTPKWIUIILSYPSIIfOtOHYSLJ4K:a PH:LR~I"fKLLK3KKLAHSPADKKCI"ELLEOLEGIFtNDOE
L

fPFFNrFlIDfIJDFOADFLRt.LFLKAEI:TPINLNf.PVFLFFPVtFI0tf8IPLEYSLDPVPP.
ry' ;e...., s zw..'.FAIS'\'\FI:rl~F'rt~S:.:'~Je'!~:~IRP'K
-...
\.

...,...'.r . ..~J._.;- --r-!.i.I: .
_ . . .. r -: .
' . . .. :.v.~.TKSFr::.tn ::
- :.-~

' : -.1 "
':Ir: :: ;::
' ::
., .
' , I ' '.I:F....F:i:T'.afl' ~..;~ 'NF\iLF:.~ 1..~...: W.:l,:i:i.:~I'P....
.\''1. ::ai r .~
:.:Ir ::':........ ;'. Nil. . ...t:.l~KHYiKRI:ML ~ '' "' .

TKTKETTKLYKKEW
GQKTIIDPKOFNOSYGAL:=iy:'ISIIYCOFFOHKFSMQDEPNKLKDPHLSPVSYA<>L!'~Ki NYAlNRILTEHQARTS fG.rplKVYt'EICIiTANLSYpKPLLRNY1~L5PAI0!!8t 0101 129996 117141 TLLPLRKEJILHLIRM4G.'""f' .FIVAOGCAYKYlIOPItIHfSGIAKAYAILLPKVII>4CYClf Cpn _ $KGGYOIGFGEIRYKLILi.:MPLTOLNDKOVIELFNCCINF$SIYNPVV1t11Pts011PLi~YA
ybbP tamilY hYPothecacal Protein ' f~TM~t'I' FTIICIiNL.YI1~SPVFItOtGP'fL.OKI~I'S.~Of%SSE'IOPYIAlYOKGLPPCt>EKT~VE
?S'I'IG!(fgOY'ITOCPSKTNPFDITYYTtPLLEIILIWVMNYt.LK't ' ' fOE
FINHFGIOVPKGtfVLVLv:rITfPNSADSREItGPVPNENL.GSPLCTfS~IPIGWC~.'DCVSA
FID
pFLFLf~IIJIDKLHLPIIRRII2iMllIIAAIWFTIFOPEIRLIILSRIRPtIGKICF

OFVOpLAA5IY0ISER0ICALWLENKDSFDEYLSFSSVKINAT!'SEELLETIFEPSSPLPCTLSGYLVSCIAIJ1TGL
SLICYVYYOKRRRLFPKKEEKNItKK

HDC7IVIWtG0IIJ1YARWLPWiDTfOLSRSttCfPNMAtJGA501IS0ALIITVSEDiCSV

SLSRDGLLTRGVKIDRFKAVLRSIISPKEHKIIIIPLFSWIYBLR
CPn_0111 114761 113934 Ct031 hypothetical Protein ~Pn_Di02 130099 131166 Ot-0NRYPTNPNDSSTYFER-L~OKYLiKK00KTLF..FLFLSfLFSTAFSC:LFASQ!'S$LRT

cydA-Cycochrome Oxidase Subunic IO~I~'S~~~'~P~IEI THFPCIAHKERP$LEOAS~IT
I

FYIOFNKFHDJ1LILSRIOFGLFITFHYLFVPGSNGLSh?Il.VINECLYLV?I~OTYKON7VIIIOLESPSOVFW$LS
SEGSOFFSLIffRTKSLEPVGKSTTVPAFLOIFDLPLSPAPANV

iWJCIFALTFVW'WfCIMOIFSFGSNNANFSEY1GNIF'CCLL35DGVFAFFLESGFLGII1CTIDQIENKPWSPKVSF
EGAPLTSISVNAWOGLWPKDRCPL.S>:fGI:J4Y!'fOPDISVFIL

:.LFGIUiKVSKKNHFFSTCMVAf GAF91SAF~1IICANSWN01'PSGYEMVIOiKOKLIPALTSFNVSIETPKGTSIVR1WDIGHCATSPYVYSLPDSK'It Q
' ' fICAVIVL
WG1VFSP1'fIDRFINAVI:CtWLSGVFLVISVSAYYLWIOtAIOIETAKOIaOtIG

:'L0I1tS71Wt'ARCVAKNOPAKLAAF'ECTFitTEEYTPIWAFGYVOMEKERVIGLPIPGU.S

CPe1 FLVRItIIIKTPVTGLDOTPRDEWPNVOAVFOLYHL3It4.WCVNVALTLISNSAYIfGk1Rw11i.~
VI 9atH-IPetllZl Glu cRNA Cln Amidotransferase t8 Subunitl KPPFLVILTFSVLLPEIC74ECGNCAAMGROPWV~CLLKT'KZ~V$P~SLDSDIGVVIHK>0~71PEYRQVLFtfDSST
CYIfFIICGSTYOSEKTVPEGKEYPV~GYVSVS$S

FSLVFIALL'ILFI'M,CKKIKHGPEEENDLTEFEVKSHPfPTGSKK1VDAEGRVDKFLKRYSINROPAOOPOPEEDAL
PAA10CIOLKWTKRKIt CPn_0103 131465 132511 CPti cydB-Cytochr~le Oxidase Subuaic .
II PtrA-Peptide Chain ReleasieW factor KAKEDROtRILLNSIG IRF-11 NACIf7IELSLTSLLPLAWYVTLLIIAVFAYSFGDGFDLCLCAVYLGP'MD'DIVAEYIiJRLAEVEIKISNPEIFSNS
KEYSALSKENSYL.t.EL10iJ1YDKILIIftkYi.
FSV$

PVWaZiEVWLVIIVGGLFAGFPACYATLLSIFYMPIWILVLLYIFt~SLEFRSKSADOR011LAIEKDPElNVl4.EEG
INENKVCLEItIliKILESLLVPPDPODDtiNI!>:LRAGT

'rJICIFWDIIFICSGTAISFFLGTIVIirILILCI.PLSPtYfSYASLSyIILFFIIPYAALCG11WAIU1ALFVGOL
~IRNYHLYAS$10(rldIfYT:YISASESDLXGYKEYYl4ISG1'f~IIKRLLAYFa "oAFAItiGSCFALI9fTSt>railiARIAOQFPYILSSFLVFM.FIJGASLISIPICtFOAFPfYPGGS
C'fHRVORVpETET00RVSTSAtTIAVLPCPSEECfELLINEKDLCII7TFR71SG71040fIVt0 ..:.ILLIALTSCCCVAAKTSVSKKRYGYAFIYSTIiiLLSLILSAATLTPPNTLLSTVDPOYVT06AVRiTNLPtGVW
fCODERSOfiKNKDKJIIOtILItJIRIRDADIQKRIDiFJISAIWEiIQV

3Y1'IYNSAVZ:TKTLl(S1:LIIVLTGLPFIITY1'CYIYRVFRGKTNPPSIYGSf~SEAIRTYNESONRVfONRICL
TLYNLDKVNOGDLDPITTANV$NAYNOLLIaGi CPS~O1D4 133884 132676 CPlL0114 146371 117261 CTD17 hypothetical protean hamK-AfG sPKitit methylase EIC5~0IStR.ti.At.CfAINSPAIYAJ1DSOSVSFPEQLPSSt7CEIKGMNRl4rtLAPNTVMPTTSYSlR2IKKAI
O1T'AYLDYYOVPLSOCEAT.YII~07LtE1ISSRA10.FI1LVOISlT

OGTIIREFSKGDLYAVIGFS%DYYVISAPPCITGYVFRStVL.ONWOGEQVNVRLEPSTSYRICRIJILIiGORCPTAY
i1JG71VSFIGLRLRVDSRVLIPRTLTELIJ1EYIIbYLLiNB
lu a APVLVRLSRG1'QIOPASOEPtIGKWLWLPSOCVFYVAKlilVANl06PIELYTOR~CI.
.
t EIOTFYDICCCSGCLGLrII>aCSCPINEWLSDVCPOAVAVANBtIJIKS1(OLWKILIG~S

AIAOLINSAWFAHIELEK5Il9EIDLE71IYXKINLVOSEEF!(aVPCIOGLIOKALEEIODAAPYTRPADAFIR:NPP
YLSFNEIINIDPEVRCYEPWKALVOG51CLCFYQAIApdfltlV!

YLSXSLPSONf$IAS$OCSTPIIVSSSIVTTSLLSRNIAKCrAIJffAPLTQCREIG.EYSLFS'fGVIRiLEICSSOG
ESIKNIFSKtIQIYCRLIIQI>L9GRDRIFFLfI~GRDWS>i~Y$

RIWASLtQOGI~HSE11LT0EAFYRAE0K10(OVL71GVLEVYPtIVV10~8(PGDYLLIWOENTIA' FLYCISINLDOW<GKRVTVECLPRPIRMFAFPAYYWGI1IFJ1SCPiL0115 117779 118632 CPiL0105 1318$3 134039 Cfh-Signal RecoOnicitm Particle OTPase IMNVKDFISRVImCIL
V
ll CT016 hypoehetical protein iW
ALJILit MINSLSOKLSSIFSPLVSSRRINEC'ITSESIRE
A0%11TJ1i11aJ1 YVPFRItFSNpNPl2LIYCKIO~ItiiQwPOTAKIRFTPKIAIOMCTNDOLICIPpFISIUtwGELIf001VSP000FI
RCLRIi~.VAFLSt7CREEFTIOKTPSIILf~CGL0II

' SOI71FIESOEGatKDOGTLALfILIOCKIISIPNLDOSIIDIAFOENLLYt~fSO~SAAVOOLKILVAQTKAEFYOSO
ENKPIMIWK71T.~YJI
DYVI10~1K1tAKKVI'WPCtILIOtP
iLD

Rt7DDKLGVGYII4~iVL00ITKGNDIOVLPKM.TSPLFS17TNPIFJ1ILON1'PCNKdlPDAP?NGN1NVILD?AGR
WIt7NELItdELTAI0KVS0ANERLrV!lHAIICQDVLIITV0111001 R$FDPO&IAiRIIGIGO'1'I
~fDGIIARAGAVFSIKHVfGKPIKFDCCCERIO~

tMlJOIADVIRVLSG4NIltLLPRPEPIICOICRVM4EEDTLAVSD~t.TFRIWDIN.
LT4IlIL>i V'fAAf'fYEI7YYKQNKAFMfII3PIJIKLL0IlPOI~AKP
GIOC
EDAEt QSCDKLYIVTNPWPSOOFSVYLCPPIGC'fCGEPNCEHIKJ1VLYT.
.
NIVK~tREYISEE
pITLCDVNOPItIOpIIS
~~ ~

CPn_0106 135073 136371 g ~

phoH-ATPaee EIIVRTOIOIKI2NIGCSVFIYDPEALFSFENTRIIIPFPVIEELF~1FGKFRDESAIOJASRACPn~0116 11!592 148971 eyAKTKVTpGyyLP5GS1LLRIEVApLSNDDRRGKLLTLELLXIIAIaIEPNVFrsl6-S16 Ribosomal Protein .
LSNIRLr~

.
EICJ11RR1C$VALKIRLROOGRRMiVhtALVLADVESPADCKYIELLGWYDPHSSINYOLKS
VTKSLGRRVRAFJILQIESRDYESKRFSFRSLYRGFRELQVSOEDIm~IFYlOdCYLI%.PLDV' VSSPNEYFFIISJIGENHFJ1LGRYYVSECKIIJILKAfmKSVWCIKPGNT00RCALDLLLRDDOREE
EAIFYWLERGAOLSSKAF~LVKOOAfGVYSALISKQEARKLVYR!(KRRIIYItORR$1 VKLtIfLIGOAGSGKTTL71L.AAAIfiIINFDKE1YHKVLVSRPTVPMORDIGFLPGLIIEDKIa!AAIIDiITK

HwIpPTYDNMEYLFSIt~IpNL-ItSSFJILOALNDAKKL6NGLTYIRCRSLPKAFII

' CPn fGNNFNTA _ LTPHEIK'fIISAAGKGTKIVLTGDPt'OtDSLYFDENSNCLTYWGKFttHLJILcrmD-tRiVJ1 I9isanane N-11-Nechylcraneferase TERSEIJVAAAATIL
'IiGMfIDILSLFPGYfOCPiw.'"ISIIGMIKORLLDVOLTNLRDFGLGKWIfQVlM7fPP$OCG

NLIt4AEPYTSAIRSVRIIFSISKYIYLSPOCA1.LTAEK&RELAAASHLILLOf~IYOCIDAIA
CPn _ IESEVDEEISIGDYVLTNGGTAALVLIDAVSRFIPGVLGNQESAERDSLENCLL~POYT

~f05B hypothetical procein _ KKSPPPVTPKEIPfQPKPPIPORPEVSPTPTDHIVPGSIEASPILCKKPSPDSlIVSPLSL

FHKMLLENWTPVEEPFPWPPAEKNOKIFAWALNOSKLIFVSTSCTIIAOPRLVTDSNSIIMITNRDHFKCDKISSNLEV
NKLKRA10IFYCKVFCLDAtISCENKFCLPItEOKTTIwLR6V0AE

VNAANRTNSRDCAC'INOVLSAAVSVDSWGLSORPLNPEROGTPLNOCECPAGMWPNAOGSKKNIVTLSLSLOCACEEC
Fs:YLLARWELFCGKLLiKQADIaIiAVWALAQDLOCNAWIFSWH

NHTGKQf;KPNYLA44LGPKAVDHNNKSpAAFDRC10JAYLNCFSLAQTIGV'IFLOIPLISSRIIK

.'.IYAPPf3dRKKPNSEENKVRMRWIHAVKCALVAAIpEICNEPf.M'DRRNLI'JLTDLKTPA

ITOPKKIL7HL CPn_0118 r119-Lt9 Ribosaalal Protein 010R 177857 137303 KKEH!'RNYIMMLLKELF~
EsQCRNDLPEFHVCCIIRLATKISEOCKERIIOtIFOCfVWIRR
~Pn _ ~ENSLNRVAYGECNEKSFLI~1SPRTVSIEIVKRGKVARARLYYLRCIfI~KAAKVK
Ct'018 KNLFNYIG1ILNSIFNEEVFIISHRHTPIGQTSTALRIfIPLVNPLFIRTNLOAIASYIPIFSEFVCPRSSKK

TFIGIKTLKGIS3LOYSNVLIrfCNFSSVCKTLPCPEIYEELP1NRKEANLEIfGIKALIY
1511si4 9 15~52D

LVL,~VIKIIKLIVRYLCPCCRPPEPREPONPLTPTPLDNGOOIDAIFS1'PTSPT6FKDPF.
CPn_Oll LDDLLOEDKKKAPNL rrihe-Aibofsuclease HII
IMNf.itSEIORPLS?ttAFEKELVSEDFSWAGIDEACIIGPLAGPWASACILPRCKV!'PG

~:~ 0109 138e46 (11783 VNDSKKLSPKQRAQVItDAIdIOOPEVCFI:IGVISVERIL>QVNILEATKGNIQJ1ISSLPTS

ilaS-Isoleucyl-cRNA Syncnecaer PDILLVDCLYLPHDIPCKKIIO~ONISASIAAASILAKEHRDDLNLOLHRLYPEYGFDRH
' RQKIffADEVf:Y113PAKKEEOVGKFWKDNOtFEK.iIANROCKTLYSFYOCPPFATGLPHYGHAKSP.iPIKONCAI
V
KGYriT.~.GNEAIRRY(:p: A

IILLA.:TIKDWr:RYATNOCYYVPRRFfWr DCHGVPVEYEVEK.iLSLTAPGAIED!'GIASFN 1'1125 111779 %

EECRKtVPRYVHFIdEYYINRLGAWVDFS:"ISrK171DJ1S6?IE.fVIIWVFOSLYN'~LVYl7C'fKn ill3D
CI

'NFF.'TAIJ;'fFLONFEJia1\NYKEVDDPCLWRNpLONLL:ASLLtMM'TPkffLP.."lBIAfAVqmk-J.llt' Kin.sur ~ LKLFTIwAPN:Vr:KTTLVRMLEQEPSSAP
' EI:LF:IIKNfJI:YL'fWKtU'~::PF.~.pOIN)ICCI' fEFPFTPf .
DS EFORI.I.DROALLfIIVFLG~f:pCYGT.iMLEIERIW
:I:TLY'l'JRtQIW.K:X:EOWtLSQcxI/OAWF::NPEEFVTLE;:F~KDLVr:RTFRECEVtt:KCYHlV::HF
~ 'CI::V'ITAY
A!

.FTEEIP .
Y.rtEE~\FNVtNL:FVF.E.:OLTCVV111MPAf~CEC:DFLVCKENH1/PLVCPVDA1N:..
' .
'.~EELERRLA
SRt::EFr~.ORKERLFJISL
IIAVAVtDIC~Iw\LPIF::RNP.~.V::IFIAPP
::II:Y

JAVEKTK .
y'ftfYJlIt:HADKEIIKFLKKECRIFYIfCNKIIAYPFt3rATUfCLIYKAVN~iJF.
.
' ' ~ N(HLIsNJ: B: IIfYNpElI fUHa :Rff:K4lLlI:AAISYIAAYRVLK:: t f t Al:f7lAtt I f.
I::RNAYWt.TP t p fYiK::AU:E ILWr'w~I filJlAAlK7F VN I INDDIIw I

ItRLEEL'D:l! ~ t'tU I IIHI IP I
OL'LN I VK0.:XPFIIR I PYVFDCWFD:x:ANPYAr~t iHYPFENOK

I:'fEEAFPADFtAfi:LOCt'rR(~IFYTGTVI::AIC.FORPAFIINAIVfK:ItLAFII:IIKNSKRLN1.'I~r_ Ulal i'i7n:n I'.11.'if NYI".:l'Y.'NLIIfYr:AIMt.RLYt.WC:WIfKAEDLitF.~uGKC:IFYNLKQILt.PLTIIJL:iFFNTYt.'l tlil hYt~rlurt u:.nl srnr..in ' f7lINlKKIN<F'ITII:KLNKI.t".':'.f'F::LVtYIAIKVAKIY
' fAKf:INIt::::NVAIl.'fL.VLLDRErt JTFIDDL .
fN
fTf:I:IV'JTA::ITIJf:Ai:H::l7rrN::lYKftl'::AY'IW::D'/N
Al:I.Y:FDfK:7JDIl:PAYTEtLtWIIL::Nf.Y::VVt:KVHF-':M:aJYIILNNAVEPFI'Y
' t .KVIApFVPFtAI:DIYOY.LKt.EKEPFS .
wYtl.W'NHItlWI:AEDTCMiI4VIF::TLYCVLTVFiJ

VIIIfhf'I! NFrir~K l LPIILEKRMIIDI
AI:IVCtt:II::LRKEIIKI*VAOPI.ANf"f~JJ:3KDAL:i 7$

ALLKNPOf:I~ I KGLKrjFL/: _ ~:.~
_ "f'n; l 1.:.: ' 5!0:7 1 : J7.'.
f mntr,Nt)fntmrYl-CRNA ::YntnrtC.tao....
::ALPYANGPIJIFrfItAGVfLPADYIARFRRLLGODVL'lI~~3DEFCIAit.'Pn 01?~ . ..1.i.61S84 tt nLsG56l , p~8.~
.Dr~rsMtn..lVi..l7MediWflitIBL arG.
IY~ .S
ac ttowo'IuIa ':K1MPOKVL:T .
TLNADRBCU:yOEYVONYHKWKOTFEKLGFALDF!'SRTTNPPHAELVODFYSOLKASGLNo rotm NIEFAPVPHTSYTADR:EDRNACRIpIKLSTLAITSLCYL.ISS~fCIN:O:;.:ISCIVGTY

tFl~KtISEDLYfQEORFtJIDRYVFLRCPRCGFDNARCDCCOSCGADYEAIDLIGPKSKISS
ALVACVfFLYFFYFSSEEPKGASSOEFRFLFIPAWSJ1LRSYEYISODA
FWfitIFSVL

'VELVKKETEILiYFLLDRNKDALLSFIOr'rLYLPDNVRKFWDYTOfVR.iRAITRDLSWGI.
A
INDVIKL9fNOt.~':ILiSLLDPEAPFLEPPYFNSLIVNNSNKEADRLSRGIFLI:.1GEI'ISiK

PVPDPPCKVF'fVWFDAPIJYLi~'NEWAAiOCNPDE~IIfRFNLEDGVLfWQFTrKONLPFHDCETKILPWLKDPNTT
PDGfVfKLLKDNFDLKDFKKRIA'IWIRKJ1YPEIRLPKKNCLDKS:

~NFP1INCLl:OYLD.fKK'JDALYV.'.EFYLLDCROESKSflGN'Nd~KFI-SSYSLDKLRWL. ... "~, fe ,FFPV?11T: a.~...rJf"r"r....P1P1P?~.'!.'!1~R:~F
. -~Y.:....t....

o , ... . E . , . ,.
W.\EY.!1115~~ .. D..:~. . . .
.., ~ 4h:rl!NI ..
. .
' M

. : . .
. 157349 166561 ..y y .!
" .i:RY~\I
' .
r,~~r . ~:'; I?K'': Al4J~:..' '/ALWFl4I::FK5LE?ICNLI7fhfYMGL.iIKEEtLDVINEE
' lPI IPEiA CPn~OIJ3 :.F~:AI:yCpKLLrli.i2 FNLKSPRLLF'TfYE C11LP5 ttYDOCl~lcal protein NRIJflOV1'VCRVSIRTSCIKIRN
if NSSAYNPKLLIOrLfLIrPCCIVGYfWNRitC5IVE0 0123 155775 15377.1 ICItMP4ISERFPYAACIEYADVRtSSISNLLTKOLEISfT.IIiICJINPTIFPYDSNC!1KT
CPn _ NWSLVW!~POK~'P~IDRAPVLIRRCLFLNfRLYGLRANNKDIPNLSVPSLFJNS
recD-EJCOdeoxyribonuelease v tAlpha Subunicl NSNEKICfiYLEOILVEt4KDSGDZTAYIKIPNKT'fPILIKCKLPOPLELGSPIOIYGVWSNNTSSAKE<'P!(LSFJ
II'PSLL'fGAL6ESLYNLNLPCDIIKPLS00ANKNFYSSYPQFODRW

WLDITP DINTPC1'P'fEEIICFIRCLPFH
.iPSM'KYFOIHSYDSPLLYEYRGVFNYL?SKLIKGIGPKI11BKIIEKFOE1(Ti.

"NLSCVSGISE:RCVSTCKOLCEOKILR1CTLL!'LDEYNIPINYQVRIFKIfYOEKSIEKIC
r . . _ _. -. . .
EDPfLLARENECIGFIfPADFIAIOfLGSfPRNSESRLCAGIONSLEELO~YPI1Q.LI

a/VAKLtlJQwFD'tPITLEEID'fpILP810KRKLWI0DI5~1'LHViFfRYWLAEIITIVSD

JIRZLPSSRRIRSIDGEKJ1TAWVEF?1LSIDLAEOORGIKACFSEKLT..I:GCPGTCKST

:'.'QAILKIFEQVrIfIII:LAAP1'GKAAKRNTEITOKHSVTIHAiS.05fOFKTKSFRKNNONP

tDCDLIIVDESGHNDTHGLHNFLKALPDY1?LVFICDINOLPSVCPGNILKDLITSNKMT

'JLRWKIPROVNDSCIVfNJINRVNt7GELPILYSETCRI~tDFLFFOLmDOtF~IiJI!IINLVT

KFVPOKYN1YPODIOVLAPNKKCTfGIYNWKALIUiALNPKKANLNCRfOSYAVGDKVNQ

IRNNYNKEVENCDICYVS:INFEDKAVWRNEG1QIVGYSFSELDDLVLAYATSVNKYOGS
' ESPCIIIPINTSHFNMLYANLLYTAITRGKKLVILVCI'KKAlAIATPfB'tRV0NAC1 =?r>_0124 156575 158068 Genebenk/E!>8L as of 11/7/98 i n No robust tJOmolog present 169N8 169143 IRSKORTVAITLLVLGILLIASGIIFWVAIPGLSSAVALGLGCGMI'AILTVLLTIGLVL

LIRSEKLALEOVEIKOAR'fR~LDOLSOYVFYTEIiVLDNt.KFW&YR~~FVR~OE',;:
EI CPtL0135 proES-LORDS Glapesonin TNLEODIEEIFL:~..RDIRNALDNEEPFNTNAKOCLAOVGFSLP'ODASIDEFIMJWLShIS00ATfLRIKPLCDRIL
VKREEEE71TAR~IILPDTAKKKODMEIILVfGlGKItTODCf ROHLDINDPRWSMITKICVIICIINRIIYVSTNYKQIK9JPDISDfGQLR~4.L~!1ITIEELLPFEVWQOIliiIDII
YACOEITIDDEEYVILOSSEINAVLK

VLYOSFOKGYNRAALLSFJ<TAIINTSSLLIBaEKDEDIDILNIRfIfCASRL.aIFRxPRTLFL
P

GLSEFFnVVIDFTDASG4DCSKLPAKEVPLI)GGKKKLNFKRTFADCQVGDWDRTTSLCPA-0136 171119 OEEDPLDRLImQVEOFATSVLKDODRYWKEIETSFaKFRSLPREOS0IDSIlIItDL

DDHLSVW11NOLSAAEDALIEV:'DVOBH~tRF~iL104I00GLELIEDAVKATLPRVDFIOELpepP-OliQOpepcidase !!i'f>~%TF~PKFICIiDTIflfIfANREEWKKDPDLCSSI~PSPItIPEF
KCVPSI

LEKEELPLVAARMSLENS , SPSIfYQII7NPESLLEIi.BKItFSVOIKLDDLYIYANLINDODITNP00ESDY0SIVYGYTI.

FSOEISWIOPALIALSEEKYAAI3SSSVC~PYRPnF~IIEALSPEfI~'A~KIL~FA

CPn_0125 158072 158605 ALNVSNKAFSSLBDAEIPPGIAKt~NCEatPLSNALASLYhOSPDO'AY
f 11/7/98 No r~usc tromolog present in Genebank/EtIaLYDYR?IfPANLfi' rif'INOAHLFEAKARNYPSCLFJ15LFONNIPrNIfINLINCll00lTSLIN
as o VDLVCICSLLPIr171YV6IL1~L
KISSCAEINSEYKPLFLI~fDSFDIJ1TORFQIIL.It~G.QEOAEIYNEYEI0C111RWNEIKEO~

KDPVIQtCIEDPPARGL417L><rT'f-''TrRDFttD%AKALTSl2IECPCIGtYIfSIN0E1nOR0.
tE RYPNLKIIIaIld.KtPHFYDVYAPISQ1TSIWYBYEE
D
ISlAIWVDRYLf'8IN1WSGIlYSSCCY0SJ1PYILIiiYltFfLYDVSVIJWIJION9O15rFSAP~I

O OPyNpFlpypypyAEIA81'FNDSd~IEAtSRSDO8KED1IIVI
ROERL.QKNAHtYRDCKOVLEAVQVEQKDNISSRWVDDSYtEEAfEEOKVDNRIOITKTLDI'IFATLFRQ'!!TA

AFCYEINSAAmGTPLTEEFGSATYGM4>fEPYOCVVTSDSL.SALEiOUIIPHFYYNFWY

CPtt~0126 158806 161085 pYATCIIAALSFAEKILTDEFGALELYLKFLKSGRSDFPI~IILKKSGLDMI'fSAPLatAF

/98 AFITK1CIDLLSSLLS~
No robust hoalolog prssenc in Gsnebank/EHBL
ae of 11/
~LLLpKIOPK
V
LLLL

L
. 0137 17=263 171502 LLVPSYYCNCLFFFSGAISSCCLLVSLGVGIGLS1LCCPt1 APDLLDLEDASERLRVKASASLJLSL.PKEI~LGRYIRSAANDIaTfIK'LDiPNKDORLVZTV

SRKLERLiIAA0N1MISELCEISEILEEEEIOILILAQESL6I,lIGKSLFSTFLDIIESFWLS~
il NLSEVRPYLAVNDPRLLEITEESWEVtfSHFINVfSAFIOtAQILPXNNHtSPl90~LEb~fOY
YbDI-11CR tas L4&NLETLLSSKIFODYOPNGL-0VGDPO'fPVKIfIAVAVTADLETIK011VJ1I16 ELLFTFIYKSLKRSYRELCCLSEIQBIIINDNPLFPWV000pKYANAIDIEFGEIARCLEEFVC31CNAD
ANVLIVNNOIINKfiItPYPi1'CNIMIRIOLLIEtCfIOLIAYNLPI.DIINPTIGNlBiRIfALDI.

EKTFFNLDEECAISYMOCWDFLNFSIONKXSRV~t01fIS3'ACIAL.K17RARTIfAKVLLEEtiIIWImLKPPGSSL
PYLGVOGSFSPIDIDSFIDLLSOYYQAPLKCSAROGP91l1fiSAfILISO

PTCGG1CIIE.OOIIQRAFEROSOEFYTLENTLTIfVRLCALGOCFSOGREATNVRpVRfTNSECAYREZSSAATSO'1 IDCFI1'G~iFDEPAWSTALESNINFtaf'CIFfATEKVCPKSLAfiILKEE

NANDLICESFEKIDKERVRYOKEORLYWETIL>RNEOELREEICESLRWNRRKCYRA01!DA

GRLKGLLROWKKNLADVEANLEDAThIDFENEVSKSEGCSVRARLEVLEEEWGlLSPKVADFPIS1TFIDTANPF

IEELCSYEERCILPIRENLERIIYLpYNKCSEILSKA1(FFPPEDEOLLVSEANLREVCAQL

KQVpGKCOERAQKFAIFEKHI0E0KSLIKLOVRSFDLAGVGFLKSELLSIACNLYINJ1WCPn_0138 -Glucaleate-1-seaialdehyde-2.1-awinoniucase-'Itemt KESIPVDVPCNOLYYSYYEDNF~IVVRNRLLt~IfERYt7NFKRSLNSIOFNDDVLLRDPWO.
TNSRLFLAIImOLLOI~WKLTKRNl;~ICSNOKF~'VTFEEACOVFPOGVNSPVRIICRSVC

PEGNETALKERELOEZTLSCKKLKVAQDNLSELESRLSRRVTPPIVSS71QCDI!'L~fIfCREFIDFCCDWCALIHGN
SHPKIVKAIOKTALKGTSYCLTSE

EEILFATNLLSSLKLKEHKIRPVSSCTFJ11M'AVRLAf~ITNRSI

CPt>_0127 162152 161130 LCGI57TECTIDNLTSLINtPSPNSLLISLPYNNSQILHHVNEJ~IGPOVIYCIIFEPICAN

ycfF-Cationic Amino Acid TransportertIDIVLPKAEFLDDIIELCKRFCSLSINDEYVfGFRVAFQGAQDIFNLSPDITlYCKILDC
ESFNPPSANOESRTRNVPLGIFtiGLVACLYWGTVPVIPNFLGSFGDLDIVLTRYTIFCIF' SLIACAIKNPSVIIDI'1'PLYIWRKSLLWTLLINPVYYFCITLGIRYVGSAITWIASLAPTIFOACMSGNFWNATGHA

CLPJW1LVONRSILDNI?IPF~'I
' AVLYNSNT'KOKELPYSLLFAISSVIITCVILTHLSAWLPTAASPLYSIILVTAVILSIStNFDEAIOJStrifEICfQ
TfYSEVPONG
LFYSPIEEEIRSOGFPVSLVt~GTIIFSLFFTESAP

LWVIYVIRNQSLLElDfPNLTPD1WSYLICISALITCLPNIIILDLCCITHVTNNLISHTPVYLSPSPLEANFTSSAHT
EENLTYAONIIIDSLIKIFDSSAORFF

PSLOECICIFTIIi.CGSLLCLVLFGRIfVOKSLENSOVSSSNECPn_0139 SPA' KNItLRDIMCIPYARLEKCSLLVASPDINpCVFARSIfIt.LCEHSI14CSFCLIWKTL.C

CPn_0129 16226: 163057 FEISDDLPtFEIfVSNHNLRPCNCCPLOANpt4dLLHSCSEIPEO'CLEICPSWL.~DLPPL

bpll-Bioctn Protein L)gase QEIASSESCPEINLCFGYSG1'lpAGpLEKEFLSNDWFLJIPfBJICL)YVPYSEPEDWALVLKO
EDRCRNLRNt7Vf.WCSECVSPYYLRHTIRFLKWSTODCAFDTIRVDCNFLIIaJPFWEET

TRLLVFPGGADRPYIfRVLHGLCTARTFOYVSECfR,IPLGIC1GAYFCS1WIYFYEPECAPLLCGKYA:aLa'-TVPONLLW

:GARDLCFFPGTAKCFAYRGNFSYVSPSGVRVSPOLFSDFCLGYANFNGCCFFEOSECYP

.~.JMIE.iRYDDLFGKPASIVSRIVSKGWVLSGPHIEfLPHYCRMVKENVOKTREFLORERC~ 0140 TTLDRYCOt4LVQRLRQPAFSKAfIC ~E
PRSNOOKIFCNSLEKELLETPLVLLNF.IKLVSFCNIACNILGTEEKKFAIYGHVSIICOJI

012J 163747 1ti3064 FOCAOTE.HSPORPFAHDLWFVFSCFDIOVLR'NLNDYKDNVFYTRLFLEOKDREFLYV
vfm _ VWDARPSDSTPLALTHKIPILCVICSVFDAWPYEE
similarity co CT036 DEQYILSHIIMDPRIFVrSEPLOKTYOKLQEKHVNNLGIASQVSLTDLONKTQYtTfIJLIE

TTMEITYYFPWIMPDILRSEWDPISNOLYLIFKKFFIHYHNLPSTALfRiJQTLLIDSLCPn 0141 NTG~SNPTARONELL1FLCVFEOLDYNEDEYTIEPRGYFNRFVYKNSOTAPOIOSFCLLHrpiA-Rtte!-S-P
Iicmerafe A
' HSSSAVEYDW W EKKCL1HEAATOVT:

W iR IOfESLAVHA

LTfIS IVLCSP ILYOL ITEFClTKIHADDFOCLII41~.rONa~fALAKOLJ1I PLW
PEKPSSLDLTVDCADEVDPOLRN
I IKCCOGA I FREKILLRA
o f CF?L~YASNN L RN1.

Pn 01)tl 164251 167751 ANRSII:.'JDESKLVFV4:RFRVFLEI3RF~IRa~AIIEEIRNLGYEGSiRWDI'COLFITDS
:' .
::NYIYLLF3PNSYPNPEKDLLKLIOIHr:/IEWSP/LONEtIW:SN.4pGLI::KKYSV
Nu rotxmt hrmwloa pr~senc in uenehfnk/f7tAL
au .~t IL/7/~8 .:::MVKf::::: I I I IENKKP.k't.LFESKF11 I'PKLSL1I L~LFLG IANI: I L I ALSf:LLI'NCLLl 175914 I 4% l:dl' Ir\L:LI::It'Jf::'it:ILf.P:TQt'.::K.~.VQKDEOKPK.:IFPKBfP.:LDPWLWM.KNKIQa:;s CPnIIl u: ur 11 ~nk/f:NflL
rrranc m :wnet mlwl t t FTLLLDff::INLKNtT.i'FN:FEfIiKKIFLKGPDFLIY::ALAN41KILE.
.
, v .,r.:
c No rn ' ' ' ~
~
' .H V. iIIDNA 101( I R: f1 i LKP LAEN
KN.~. L LIfPr.
:HFE t.

.
!:H::Y::Y::Y':'LLEKFIIFK I LvLL..

mCynltl I~.44.1L lu,'.':RO
ftlNfl':FYDLKiDYfKCh:KRFRFLY.~.I':PIfLIIYLWF:IF'IT

t7.. rAar:r Innw.l..t I,r..r,,au in .aenrhmk/FlAlll. .n:. ,r 11/')lnk.fn!)l4: 1.'::47 1'/r.14 ' ' ' . .
:WIIJfT:IV/AQ t IWf>,nttJCt:.il Pr.m.tn .::;:LYKh:Rh::l::1 \IhI.IPFF'l::AYVFP::If:FLt'LFHI7JAH::::::hVYNaB-' "Yxlti ' ' ' ' /FV1.TIALIMI \I::LVLFLLIRSV _ f::VMt.FLU:1A _ LI
tR'fftftf..LKRf'LIf:01FUl1TL~.FLRI'EIILYKTFE:LY.Of:::I::LWLNUIhxJIALUDLIKK
r7/YY
Y.IH:Kt:IC'VLL:.
AFIFCt:\
:I
llEDP:DpM
HALWP
:
' ' ~

~
' n r Mt:L::hI'PU:EhItRA'IwIIYLFHW:F'IY:H:uItFATEtNFFtf.EHANfIJLfIYLTDKf:Y..
YIKV JK
:
~
.:F:K.:\:\IJWIIhIAa LINKIS
rrL
fi.Ll ::l IC:::FNUkLw.

:Ah'fLDDWI:O
PI
"
' ' "

.:
IIIII'hVLIIFY.FYKALEUEFTT\YyTLPAINrFLYrhIIFMIhIfEVTRKFYf~'ELfEDIVA
JI~YY:::HIF:RV::1.I.V
':1.1.1::IWfYI":::VI~VF:ALLI
.:I
~'Ehat:K1 lLf :h'uV::UAMF:IWRY
:M
:

L
Y'fr:Vl' UI
r ' ' .
':YIIKVfIGI:fDKX'ItYl.~tl.ltft:TV't:l:llnFlr'/r:.'Wfr:IDEY.r:II~ULIyOYLLIMJLVIA
D
:
:
:
,:VHIf1411NI:F:I
LYh (NKI
v\HF:IIv:IVUfKIIEfN:
E
L
LIe:AId:l'C::ha.la'I'Itl.l:a'1'Yt:INVIHU
\::Ft?!I'KG1/1U::UF'LW 1'KNVI:Wt:KFI:iAI:I:K

OKYMDILGDAPVSLLYUAFA 1F:.:lHt~:.taAit?IL1AEEEAKRYVEEK(~CSPIT:' Pf'(>OL'PrttiJIVr:Rf~IYH.~.KFFA:x7.:lDFtAKPLF.WEYJ11t0ALFJ1LAAELDOIltNOL'TL:.EC
E:PLANLK:iiFSDLNGRLKVSVEKAiILEEEiO
:'DCfYLEFDNEK iGDF3PLTFI

::f)EK'CIP:II:L'r:SKTPTLENKOEYfARIfIpAAOYLPLERt~iLiPO~FA.iCEtGMLTEEGIOEOY~ImRED
Lt?lt~i"LO~AIDfwI/DKQki~LG[rhCiEE
:

EOWAKVAt:~ItEL iEEVtdK .

~pn ~1t44 17794?. 190560 CPn_0151 1?4179 :?2i25 :100-t:Ip pt~cu.)ae ATPase DAVSFrILEKAFELAK.iSK)fCl'lTi?1HLLL\:S.EMP=:.FYLVIADINCndtpA-lbnoo>NCfanase ~
CY~LPKYPiLVICI1NP'K:L:L1NMLLGHCLSVKVIDNRASPEDPSF'..DCRKLP
M/LGVNFMEKF
' . ..)n~...-.r-- ..,_\rhn... -..,rA.vf,e-..ryr-rt~..._r.,.pr~L..n'.~'Y:
. v REP'N'/t:EIfDPKPSFr:LCTLLRLIAKOEAKTLCD~IISCOH1.(.I.At ~
ftAVKDAL
A:Lf . . . ,.~r.:r. : 'N : :.\.._:rl:Y
. ,.P.:: h ",: . . .. .._.F'::':
N ?:.' d ' .. ..". .~ :.i"."'- \:r\f'.' ~ F
.::... . . : 'Ill i:w :,:rr-.. f ~'1 '~

.. . .
. .
. ..f. .
'::Irl:.N:.: .
._.,,... ~./:-:I:;,... ,...;v.: .
t111 '~
:.'i~~::..:.:'re.'sJ,ii:H:.:.i::KaFI.IFV
P1.
"n:4hE:it' NLDLROLVK:.'r:.~in KQLYVLOMCALIAGAKYRGEFEERLKSVLKInIE.:Gt>:EH11FIDEVHTLYGaCATOCAMDLCLPOGTNSISPKLKS
.KT'I'CfYNLVIa'DENFHIKT3HHAFPPEI~NVLFLGSLSNtLLLS

AANIiJfPAWtGTLt~ICATTLNEYOKYIEKDAALERRFOPIFVTEPSLtDJIVFILRGLRYI14DINtttINAAFHtr IWKLLP~'KK~KNLVITXDGCOf.TIILPYISPT'ItBtAAt~.PFS

EKYEIFNGYRITEGALNMVLLSYRYIPDRFLPDItAIDLIDGASLIRNOIGSLPLPIDERFYTPAI~tYYFLK0.'AAF
HtTCEEYYYPPHQAIJfYASSDIIAMSP00AEIHGPGPG101AI
GWD~.I EEYGLrWIEICNVKEPRIIld.YNJINP
LREELASIJti PDLXEAL

. .Q
KCAELAALIVKDFJIIKRflOSPSYOEEADAMOKSIDADARL~SFLLDPLKSSKNLLIFFKDI
E~DE

KpIgLEa~'ptf'gEgtaIERVADYNRVAELRYSLIPOLCECIKDDEASLNOAI:t'iRLLONSIJIIRPDRYIGYR'1 'Ifl'FKLNELISYLLRIFASEATg RLIAOWAt~t'GIPVOKNLEGF~1EIG.LILEESL6CAWGOPFAV811VSDSiRAARVQ1'tD

PQRPt:CVFLFLGP'1GVGKTELAKALADLLFNKIEAMVRFDNSCYNGDfBIS%LIGSSPGY

CPn 'rGYEEGGSLSEALRRAPYSVVLFDEIEKADKEYIHILLOVPDOCILTDG~.
..
CT119 hYDOthectcal Drotein FIM'SNICSPELADYCSKK~SCL'l7tFJIILSWSPVLKRYLSP6l1l4RIDEILPPVPLTitELIK~~VS~'~AICAS
1NIPVIIVPGFPOIPEDLYOIXTtI7~CPA~ICLAtIfII~D

DLVKIVOIQMtRIAORLKARRINLSWDOSVILFLSEOGYDSAFGARPLttRLI00KVV~DtDtt..IGVI2Ii.PN1'P
TPtOGP'V1WL.F'NGFRGTI~C..'iLAYRKIGRAFAAVGIA2LRYDM

K~1LLKGDIAPD'ISLELTMAKEVLVFKKVETPS AG~DSEf'V, AEEYPIErYLRDAQl'ILF1'VQEHPDIlIAYRL;sISGFSLGCHIAFR.iIKIYN

PRDIXIXALSVWAPIAOOCILLKELYR,tFSKHGECDIISIrCKI~GIGPPPIItIC8CD4I~.

CPn_0145 180717 182369 LIRIWfNTA~LPTKPYILHQOCIODTLVSRTpOTLFKNfAPL~tT!'ISYPM'OttI1t.11T

CT114 hypocnecical Dracein APDLOttILOClIVSNPOATL
tKTYDK1U15 NCAASFIWLNKSSNRNLRSPMFKSFIVRYMIYOGLVSFLLPIPtH.F.CAtil4V
WNLDPYKLESLCAYOYLSSKRIAFlFPQIOKD~PIPATM195130 197892 VISRDLIG.OEDCOKF CPtL0153 :LTLCKVDRGFSPEEISLIOt(LSYPGLSLASLRCS1'EIDPNTtHJ1R11LWSEFSCDI.A~1C4leu8-LaucYl tRNA BYnthetsse RADYYSNCLDILALRIHAERORYLDDSPCVPC1'SEFHKATItAINI'ILFYtFAYRYPSKKt~tYDPNLI1D00a0QF
iiKEHR5F0ItNEDEDINKYYVLDNFPYPSCAGLNVGMLIGY?ATD

EHFSDEFSFLSSVTDRKFGVCL.GVSSLYFSLSORLDLPLEAVTPPCNIYLRYOOGBVNIEIVARYKRAAGPSVLHPMO
WDSFOLPAtOYAIRTGTNPKVTTQ10JIANFKKOLBAI~PSYD

TTAOGRHLPTASYCDCL.DLE'LOVATPEEMIGLT!lINOaSFALOKKIfYKFAtGY~D~EOGREPATBDPGrYNWtOi (LFLPLYOOCLAYMA~IA~PtLCfVt.BNEEVE~.FSImG

YIr'u'DEEWELL.GtVOIt.GGKKKLGASLIGXSPPASORCSVAYDYLIIGAINI?TLALLPSYYPVAtIOQ.RONIL
KITAYA~.LECL0AL0YfPESIVKQL0f0'MIGICS~ALVTPl0.1'~S

PCSNIfEEIASYEEELKKANKSgMPCCDGOARLaSVAFNLGATAFJ1YJILt.EItL~IFDIPNDLLEAIT'I'ALDrL
IGVBPLVIAPENPDa.DSIVSEDOROEV'1'AYVOEBLAXSERDRI8SVA1'K

SLtILRt.CAILCDRN1:YZ7fALKYFIIAERLNEDOCFLKImI~tSFJILtYEVXKII8KVJ1POK'PfyPZCNYJ11 01PIi~.LPVNISDYVVi~YCI'L"VVf4CVPAND~REPA00''SLPINEVI

ANTLLLllESR
IppCI,NGL9G0E11107YVINYLEI4lSIGRAk'tMYRLR~R.FSRORYi~IP

IPIIRPEDG?IOfPLmDE<.PLLPP1'tIDDYRPf7CPCOGPLAKA00WVNIY06ElCRi'DCRI:

0i46 182595 183095 TY'1~ONA0'A~~~K~~~'YIGCUNAYLILLYtRII~Dt CPn _ VPYD~GLYSTPEPFA7CLIN~
W.W15SYAIPGKGYVSIEpNAEENGIWISI'CGEIVt~tO
No robust horaolog present in Cenebank/EM84 as of 11/7/98 IIVGISILSSgEWPOTVtIGLGFCCL55KSVVPFKKBLSDAPRVVCSILVLTLGIGALVCGdK~~POVI'IEEIfGiID
ALRItYAMPSGPLD104K1w5N8GVGOCRRPt181tYDLV

IAI'KWCVPGVIIlIGGICAIVLGAIgLALSLFWLWGt.PSNCCGSRRVLPGEGLLADRT.LDTSBEVODIFDRDGLVL
AIaLVFAITtHIE1048LN1'IPSSFMEFLNDFSALWYBaIALSIt GGF&RAAPSQCLPCDGSPRAS'i'PSCLEEWAEIOAV1'W1IDOMSDDt>DAAWPOIDESYLVAQIYtIVVO'VNCKLx AVRVLEPIAPHISEEt1114VILGNPPGI

_ CPtL0147 183213 183671 No robust hostoloQ Dresent >,n Genabank/EMBL0151 197174 199202 as of 11/7/98 CPtf HCGPMAVOSIKFrIVTSAATgVCCIrtJCSRLAIPAFITZEPRATSIARSVIAAIIAWAISLI',., t 9seA-KDO ?raaslerase GLGLVVLAGCCPiGMAAf:AI1?IZL~fALLAWAILITLRLtNIPKAEIPSPQB,REP~TSEPCPMNLRGVNItIFACT
YWVLVC1WIALPKLLYKlILVYGKYKKSU1VRPGLIOtPIN

SA1'PPLEGGSfAGEAGRGGGSPLTOLDLNSGAGSpGtIGPLVWftIGilgyt;CVRLLLPVLFJtFCEEF1~WRCLY1 'SCTELGYOVABWPIPI~'!V
SILPLaFSIIIABWAKLNPSLWF514CDt.'YtLNFIEFJIItRICAlTLYINGRI8ID88AltF

CPrt0118 183822 185702 APWtt~IYPSPVDGPLLODEYpKOAFLSLGIPF3iRT~tIICCYIfARpTALNL~l1' pknl-S/T Protein Kinase WAI7RLRLPTDSKL.VIIG5141R8DAGf0,ILPVWKLIKt7GVSVLWVPRINP.LTLtDVEiN01 tJItVSSltEBEIfDICAA1IGDYRILYRKGQSIFtSrnrr.acuRp'IAItAYLIRIil.PDtOS000TF~KtO~iiP
LOCE

! LNIPItCLWSRGJ1NFSYVWVVVDEICLLKQLYVAGDL.AF
SOP/fffJlF'NIriNVKLi11G11tfPGILSIENVSFSEGRCFLVTODCDIPILSL'1GYZ.1t8I?RKVPLITGPNI
T80SFZa0ALLL8GACLCLDEIEPIIIriYSPLLt~iQElfuJIYVGI~IOf!\tK

LTILEIVDIVSQiASLt.DYV11g~10EEWtd.DSVYIHILNGVPItVILPDIGFASLIXERAEIABPDRZIfRALItS
YIPLY10'1S

ILDGFISDEINRLSKII(FRVLLNTS~GAED1YA1'GAIlYYLi.FGPLpOGIFPNP81N

FS~IYOND!'LISSCLSCTftEERAKfiGFPLIRIUCTt&EEI4NVVTNCIEBtiLRCVPDPLE
VGEI7NSti00KESAENLEFVLYF~ICSIDEAl~1'AIESt888GVEP.80YSCPeL0155 199697 199488 SSONLPOAVLA /
RYVEAEKEEPKPOPILTEMVLISRGSVEGQADELPVNKVILNo robust tmmolop Dresent in Cewbank/EMBL
V as of 11/
r LALQSLLVREPV IfBDLtGYEDL
S
NSLSFCVPFLEKLKISLIPIEEMRNELFfIKTNNSSSNGFSNOEIOGIRTYI
NSP'FLDVHWZTIP.QFIAYLECCCSEOTHfYYNELIALRDSAIOARSGItLVIEPGYA1D1PW

CVTWYCASGYAEWIGKRLPTEAE~TEIAASGGYAALRYPCGEIEASAANFPTADII'~fMSYFLIAtINPN

YPPNPYGLYIA~1VY&ICOI7WYGYDFYEISAQEPESPOGPAOtittYRVL~~LKaD
CPtL0156 200147 199770 LRCAtpWRM4PGAVNSTYGPRCAtOIIN No robust homolop present in Genebank/EMBL
as of 11/7/9A

IG%QKLLARt~I~AP~TAPPP~PIAQOGVCIPSTICHLITIWYC

CPn_0119 185706 187700 FYIYRAATPQSIriIPDGCCFILLERLKELGAGFFYCDIJtESNTTGFTLFPGGSNKGVLIQIN

dnlJ-DNA Liqase L!'IADE
ERFIOtft?1SOJWYL71LGRLEDHDYSYYVLNRPRISDYEYDNXLRKLLEIERSNPEWRVL

WSPSTRLGDRPSGTFSVVSfIKEPIQ.SIANSYSKEELSEFFBRVFxSLGTSPAY1VELKID
CPfL0157 - 200753 200298 GIAVAIRYE~IVLVOALSRCNCKOCEDITSNIRTIRSLPLRLPEDAPEPIEYRGtVPPSYNo robust homolo0 Present in Genebank/EMBL
ICLL5P0EVAKRKLEISIYNLIAPGDNDgltYE as of 11/7/98 STFQIINEKOQQLEKTIFANPRNAACGTL L
atOfE

. .
NWRCLlS4GFPV~KPRLCSTPEEVISVLKTIETERASLPNEIDG)1VIKVDSLASDRVLGFSFYI'YKEAL'liIY0F5 PGJ1SPNWQAStatAQLNSYFCLGGETVTRIISLAPSGLI
IIA

ATCKNYRWALAYKYAPEEAETLLEDILVOVGR1'GVt.TPVI1KLTPVLLSGSLVSRASLYNEKAWSTAEKILKILSFI

' DEINAKDIRIGD'1YCVAKGGEYIP!(WItVCRCARPEG5E1IWNMPEPCPVGHSNVVRftDRtP
KttPIILYKEAAL'IYSPLFYSLP1G(YOLI9CVf VSVRCVNPECVACAIEKIRFFVGRGALNIDHhGVttVITKLFEIGLVNl'CADLFOL'n'EDL

CPn HQIPGIRERSARNLLESIEOAKNVDLDRFLVALGIPLIGIGVATVL7IGIfFETLDRVISAT_ No robust tlomolo0 Dresent in Genebank/EMBL
as of 11/7/98 FEELISLEGICEKVAHAIAEYFSDSTHLNEIAKMODLGVCISPYNKSGSTCFGAAtVITGPPNLTLSINLDLLLEDLOT
DSLPWPKL'1LSEDFDFAYYPfSKAIID'IYAKLtIaHPGCCP

TrDCMSRLDAETAIRNCOGKVC55VSKOTDYWMGNNPCSKLffKARKIGVSILDOEAITNCLtSKKItJIRYLLEOLFK
LETGtl'IFPTSTIDGCRESFLIEFSNE1'KKPTIMAFIYFYYYN

LIHLE
SNGPKLEKDPKOAGCEVHNRLLM.GLKfRPOAGAONDGRNCGPYGPICFLIVWEENYGSV

~Pn_0150 187759 192141 LKONGFLKON

~f117 hypothetical protein CIYYKFFYSYNCPYFISFFVLLGYNMASSSNNSTKODGIPSWVNPMIOWNRASOVGDOEACPn,-0159 01811 f 11/7/98 MSLTPEAp!fSR.S'WFSDRKHFLEhWgLEEMENNDLKKYSRYKTIILIATLVTVAIi'CIVNo robust hanoloq present )n Cunehsnk/EMBL
as o CCP!OCE1'ATRIF'aMPSGFSLATEK/OVSTAEKVIKILALIFFPIILIAIJ1IRYFfJOtK

PISNVFGIPMWVPCLILFi.JIGLSSAFLSHRWSKCKEIHLRYRAYOtYROOLLgOYPDLRFDRIOCFVLPCD'fPKEL
ELIW1NPOL'JENAAREVHPGFFALPTKYOSMYIO'tSKG

K3TLYKYSITIiVKPKKCFVGKLVENLRPDLNANKD00GAAADSRLDFAGYCVKHYOlDAL

L .V~.lfttSVIYpRLASLIMSVKNOttlIDNCSREPIDFAORSALWSC~DtGGEIOP~I
L CPn 0160 203794 20.127 D

DLSRDILAICCYCMNtlGVE7U(KAiDOYKKWYLNSSTFIAWNPOLPAIAOSYLLE00ANLpfkA-Fructose-ti-P Phosphotransiarase ~ALTTAHG1GOALEDLDSLLCYYDOLIESKCVGEKILASItIpKHLDt.AMOD' 'i KIF
DL

u RIPE
IA
TV6LLSWKSYPEI(NtLRYRPEILTLLETIRSKNtOE't'SSPPSPPPEWKNIPNit O
::cTiOENLKKWSNLYNVFSITiKEFTECKLEONEWSRIORLRGALEKSKCSILCNCItTNA

ElITK3EKKLADYLWIGDREPFLTGMHKAIATCKAIQGKVECSIISONPEIfOIMILPCSIVSLYTEOETSSKPLKICV
LL.iCCOAPr7.HMIVICL.FDALRVFNPKTRLfCFIKGPG~.TR
DL'VIYDYYMACItFDMLSSGREKIKTEEDKKNTtIJtVKOLKLOCLLIIOf~&ft .L'fKtR

Ef!LEL7ALRRE04Jf:AIWKNEDEVL.ALK"TMFaQWf;FItDLVCTiItGKYOEFKKNKi.SINL.
' TDTN1LAEYFIr\HNCKTSVICVPKTICI:OLKNCWIETSIf;FN7".iCRTYgeIICNL1KI1AL

tILFQVTPECLBLL (rTLPNIALt.~nELIATRKISLKOLSODLAI1CLVRRY
tfDFTK::Y;Nt.LNRLEVLHAErrT'DDLVLtIVDRMSEDLKKTIEEIIII.iAKKIHIIFtRLNCOQA::YTTLEf:
r:t ANt:lQcat'TIELL'LIVOEDNRLOEAt.~.~.f'"..VSQGLMLLIt.~.LLt7RDEKtNKNtEiSRKI4LVA.
.
KrCIQf!~fVLLPEGLIEHtFD?RKLILELN\'LI.HHiCD...~.IEK1L:K4iPETLKTFNLFPK

AY')AH::f1\ItNtL.:(r:L\PLIORNR/13L(Ntiti~faflLFtK7SIRNIHALDTETLVATSSNMDIANOLLIA
RD.~.IKxIVItV::KIATEEL.IJWMI'KKEfEKIKMIMEFIC:V::IIFFf:'IFJWAGFP
?tHIiLHHO
DV
~
' .
:iNFri'.N'l(aaLCIt::\I.FLVRGK'Ir:IMfTINN(JW::YTEW01~4\TPLYKIIIHLPIIRv't".CE
.
fNt.t.DVLfIOSKPAPAfMENPLH.P:ALPf6VODAVAE
t'::AMIITFD4INlY
:LLILSC17C
' \LFI
X'tIIL
r II
' .

.
'PfNtYTD.'1't'PK~PAVQ)If.t.OO::D.':rt:/I:~LVNFf'C:f'IlYIFt:KERLtUONPLTL(J!IDpT
..
L
v AUf.KIMt::QWK:iINKY
iIAKAIV<L:fVA
ILFrI
VI.YNfa LIfATY:LPwEFNNKDLNRWY.WDNLNLE
'I'LIFURI::K::KEFEYOVLETAO
~
f 6'~

. IIChPr~Ai.'f::hY:KR::I.
.
.
LL::

.a':f~IWAHNIV::ULE~:ff'TKf:K:a.iCDL'fKEFRRD.~.Yf1I11KRIKRRFKMCf.I:OFJIPWRPT

lI JI/tH~~\tYPAt:LIIRLI.I:IIWKOKEEI::IRCOAL'/'rEPMCLt:LEK;:KYDNP.KNIAAAMT
ILIASN iO Ut'm ' t)161 ,:fK

KK'Ir:KL~M)IUHI.t'KNNLTYVRIOHFFRTLIQEKLGt.tI%rVpEtIriIVKEAKELfIELAAIIYG.
.
S

Nf:X:N::~.K(tIIAKKOt'Kf7firUlIAC:KfiQLEL.LG\'iL:flCA~p:IA:NfK.~MOAwPRERLLLNPIfn f!rliCtwt .ncylCC.uc;tnee.r:m Lun, lyl IIR:::a':RKQENLLI'n:WLt:KFIt7JN'fMt::I:1'I.IlJNFTTFt:ldJtfl'LIIYNPfYtIVILtJIG

It:AKIPa;\t:HTlr\::RKt?1Wt't'LCI':YLTPFVRFs::fF::1'O:Y:YMJII.INREOt.FDIEORLt.IL
:IAtTt:SKR::HVRLIWELTRII:IMLNVIH.It:IK:Dt.D':F:lliUt'::LtNYK~tIINEtIEYT
CLV
:KV7TLMRDIJ1AVF
rP
' ' :
' .
Il::ldlItOL)ERLAIFt::::aLX:I'1.AllJP:.LFFNKtKAir\VWAITf:aa:lWlAKMrYNAPEYI
:
~/NKIIE~.LIV:,9 Yv:LI:A
ILY:~
:
,rlVrt:IV::IItINMV!)AALAA
tTY:VLt'1'RLN(c:l~'I).ItRVtI::VLIISIILRGCD.~.:;tar:IIDWKKLFEt.LNNNI:IWPNOPEC

'I'N::OK~ALTYACNTIIJPDFYTpFLIIIDIYKELHFLTDSNSPELLSEVKFaLK. '" ' NLPPtLYNOGE00LLVSINHRTL

FTFJ1FANCDKP ITILTYPOV1H1AFPFAE.iSALSDL'f'QNLKRELTSCE
CPtf~017!i .. w.,.l.ir927r r :~~~Sld , cr153 Iffooensr:i~ar p~onwoa "..~
..

:~ ~rf~: 2nse7o 2DIeD3 NDDOPI~SDDEFJ1SKDSAfSASFSyEfYKSSTRGKt~fl1''"11TASRTLYILRpOCdYDP

,b rrfousc nowoloq present in CenWank/ENHLRALKVDDEPIIYfiVEKRLDAKNPOSLNAFHKEVG111YVAs'VrYGCTCFpVLRIISYL4VCEL
as of 11/7/98 tI/YTLYN:OSPFRtNKLYSIS50VCl'PWIFOLNSKVDSYLFIGCNRIIfWSIVhpEPNLIEKEKISISVAAASSLLK
SKT~IATEK~SSYQSESSAGIVFt.O~'VL.POLOOIHtLDFKDN

tCKVfIJVRI3TIVKILKTLSPLIFPLLLIALALInFLHAKYANNLLVSKIt.ER11P0YVPLiLPNEPIPLAIr~SIT
CI:IIPELFPSEDJI0VGi0KKSALAxVILNYLLSNKPKE~SP
SE

irR:>r.L.h'A.SHIKLTTLVPV.';01fM4AlICSNPLEVFJULR'I'I'KPSFINPAKYROITISSH.
:...t.,..... ;..Y:1~~ '-''T"r'.!~Yf~"'~~.,......,ir..w....kn~
FYLRF
. _ .

'=:.y~mr'; t.vr:'.r:W '.".~'/ .,.;~~y,.
~ .
.. " ':.:,lm!~-:1:1.:~.'uta~ . ',-.-;r.,... " ..: r rcn:_w'. :.
' ~ :~I 'Ai ..;..;y~KDpc;,~:
' ;n::nrr.....
ri~

, :
:. i tm.Fr,. Lk:Nll~ll'1CDLF lR.ir~LEIR;.iUlNi:i~:iCV
:~;";;.~:..; ":a' Lii:'..dl.::I i i w:.: ~ : i r':;iU.:v :~:~.n!~'.alr F I IT'J
':t;,. .: . . .... L,yl.: 'rrr:x:rl.~.':.:W

PLDEDRC'uCFEILEOLOELCVRFPICPSOCPDNPNFOCFOCIRtYWEDSYDPNKPV

CPn CPtf_016) 205931 206191 _ No robust hateoloq present in Genebenk/fllBL
as of 11/7/98 No robust tfomoloq present in Genebank/EMHLDKRIaTI'KSIIFIFLISCESIOfOPNSLIFSSVCLt~GLCSLbSd~IOKP>WMIiHrI'STSEEF
F
as oC 11/7/98 ?EI(AIVYCIKCKOIIKC'SIItITP'1'PATPILTE('aEIFPGPVDSAIQ~fDLERLLTyIDfRPDNOLPMIPSAFR
TTQIFSEEfHiDPYWAKTDEESRIfINR6IN1~1LICIIfGSYIPI

IIRIYLR~IGOSLV'I'IYPKDGORLRSPEDLRVGDDLVOSYPNHLNAIELDCWIP~LIGASTIfGSLJ41PKSAALTL
KTYRPNPIWINCYERSFNIDTCKYLKEGSRRRT$NDGP10111RVL

TYIITFADFSTYILSLRSYOANSPSD011~lGIWPGSIDDPVOAVISFLKt#IGFALPSTLWM.IKSSGRRGHAICL~f I'EEDFYIJUIRRCGVYSLYWlVCSYPQI?IPFVIAYAIiIA0is11 DPLt.CrNlt CSKLVLPVKCYYSLVi~f~'iVSSSDSLirAFCDSF71~YGRSTFLANCl'SILCVIItSYKRVPP

0161 206141 206998 OP . .
CPn _ No robust tawoloq present in Genebsnk/EHBLCP(L0178 218052 217789 as oC 11/7/98 V

I .
LCFKCIY:KIIFSFLKDLNTRSTIESSDSLCSRSFSOKLSVpTt.IOiICESRiJOCITSLenc in Genebsnk/E1'oiL as of 11/7/91 LTLIVOCALIALAGGOVLSFPLGLII~GSVLVLFSSIYLVSCCKFFlLKaIIXCCSVICS>amoio0 Dres frobusx !
No KICLG
' ' AF _ DLFGEEBCRNOCNRSARNOLFJIILHETDGIILKRYtsOGAK_ _ !
ECQLFIIIIVGKTEPCNC
ESIMICI
~

'KLNIWFEKpPNICDIEKALENP ~~ G~

CPtL0165 206983 207582 CPr~0179 218550 218056 No robust homoloq present in Genebenk/EHHL
as of 1117/98 No robust honaloq presaft in Gerfebank/EI~LPKLWDI'NFETRIGTSVPKFNRRLPKSFHKSGRSSRPSKAL1IANFPN1'TipJIGRSCIIPG
as of 11/7/99 NVLLFNNhfVPKTIDifiIDPESEIDIRKWSCY>Q.IKECQPLFRSLISFLLCVIRCOLRI1.KKIfAILLaiVNDAKT
PNYSCItLSIGFPNEpDLEAQtBJpQAALVRKILICWPNNfLKGLIJ1K

RSKYOmARTVSDEDAPLFCLTRSYYQDGYLTPUWGPRDLINNYIRLRRRENPlOIFFSPLKKDRlQ2LSSLIFiKLSYA

I

KNPCYYARLAFNESVCYYRZ<.FDIERLTKMYVECDYSKEOEKNt4AILSlyK'1'Lt>DGImFLP
IS

LIEHKDTDLIGACFlDVFCT
CPIL,0180 218963 218355 ~ No Irobust haeolo0 Dresetft in GMebank/ENBL
as of 11/7/91 CPn_0166 207591 207962 TSLIHIILOCKYRPYFION'1~ASETYPSOILIU10REVRDiIYFNOADCNPARANOtLGIDtI
No robust tfamotoq present in Genebsnk/ElmI.'IWINL
as of 11/7/98 NCLROYM(SDSD1SESINRSIHLEASTPF!'IKLllnCESRLVItI?SLVISLtaLVGAGVTCLLDVYf~NYS~T~DI' ~R'RFTFVSSKNDIENNGLS?IPLONVLViAMVRR1 ' LAAdCIRNIEWRWCLDLRSOILIS1U.FZKOPOFOSLTEDFVNNS'1'IIOEGRVIpNtNL

D S
LWLF1/AGILPLLPVLILEIILITVLVLLFCLVLEPYLIFxPSKIKELPKVDELSVVETR
' STL S
OEKK
LISLIIZCItCiAVLESE

CPn_0167 208309 207977 CPn_o181 219175 218777 No robust tfomoloq present in Genebenk/O~L
as of 11/7/98 .lo robust homoloq present in Cenebenk/EHBLFYIHnSLNSHNLIXPSSLFJIAVpALDSYIYWOGDITDVL71A
as oC 11/7/98 ELFKIOCVYIfFFIDIFNKL
V

NLwSHFPRGFFNLPFCPTILWCPFIaISENYGLEAL71J1TVD5YF1%.GOSOIYFL.
SKODDD H
DDISREIYCVPRLYIRFWIVSISOSLSRIPWRLKRILLRYCfLRGKYVNPILIKRIJ1ILL

ITVELSaI(1%tKFKP~GSIIiCI'LYTEDPILPAIC'tSFSNCSDIOHRTPISPIHCLIRFSRLRNSNY

CPtfr0168 208715 IOAI17 No robust ffomoloQ preserve in CPn_01s2 220701 219331 Genebank/0~L as oC 11/7/98 SyINLRRREZIpENFlNpGIIpCYYARLtUTIE,gVRIYR1G.PM'AQJIONYGAGDYEOt~aedHlotin Carboocylase LKSILSFVQILDEKDGF11DFLATlIKDr1'FIGROG71DITCSRCZIHDNLIAMtDtIAVRIIMGfIDLCL3TVAVYS

sYLKISNU.A~ICEnGAa~fYHPCYCFLSENtwFASrcESC~.TFIaPSSSSr~IL~cIa ANSLi110CI1ICPVIFOS1~iIIEDGS~IAE1IIG!'PIVIKAVJ1000CROIRItI~FY
CPn ' ' _ IIGNYVIaGLIDCTIGRIUIpNL
No robuse homoloq present in Genebenk/~i.ItZfPRNLEICVI0D1 as of 11/7/98 RAFSA7IRAl:AGGF1~811~NV1fIEK!

SFNIEFTICENNIBe~NCSECSOPLVIdEtM'OPLRNLCESRLVIfII'SFYI~VGGLTLIEETPSPIL.NiI6IRVKV
OLVA

TJ1L8G71GILSfLPWLVL.GIVLVVLCAL.FLLFSYIU'CPINaGVI/Yl~ti'DSDIHQNFDRpRNZTELtrI'CID
LV1CC0INVAlGEDfLPWKa00IEPSGNIIOCRIN11EDPTpiFBPHIORLDFII
LPPAGPSIRVOGACYSCYAIPPYYDS!lIAIfVIAI0G10DlEGIAIisWALIItlNZ~1108'1' K IPFHOFHLDNPKFLFSIiYDINYIDNt.L7IQCNSPFhEP
TNDOVDPVSEDSIRTVISCYIQ.IKACKPEFRSLISELLRAIIQSGIGLLSRCSRYQEMKT

VStIIUSIPLFCPTIISYYRDGYLTPLRAGPRYIINRAI
CPef.0183 231207 220695 0170 311098 210025 aces-Hiocin Grboxyl Grrier Protein CPn _ RRtL~LI00IEKL11IANORHDHfRPAIKAL~.GdLERDTAECSiRpEPVIYDSRLFBGFS
No robust hanloloq Dresent in Genebenk/EMHL
as of 11/7/98 NVRIQetURGE!(YNTCTVIAPVLSMSYInLFKNLLKEDSVHKICNEIFALWRtlTrIACTOERPIPTDPKKD?IKEIT
rENSE't'STCTSSCDFISSPLVIfTFYGSPAPOSPSFVKPODIV

E71IIKNLPKADIHVHLPCTI'rPOLiIWII~GV1044PLKWSYNS~IrNtIRLLSPKNPNKOYSNISED'fIVCIVEJ
U~IKVIHtlYK7l~lSCRVLEVLITNGDPVOFCSKLFRIAI~J1S

FRNFimICKLfOPDLSVIQYttIIIQYDFNSPD1IVNATVOGHRPPPOCIDNEF~LLLIFNNY

LOpCLDprIYYTEVODNIRLANVLYPSLPEKHARl9cFY0ILYRASQTFSIDiGITLRFIlVCCPIL_0184 FNKTFAPOINIDEPAOGtrQWt.OEVDSTFpGLE11GI0SACSESAP011CPKRL71SGYRNkYe!pElonQacion Fattor P

DSGFGCEANAGEGIETRTIFSSAKVNPEGLIEITRVTFSSLKRKOPSSLPIRV'ICpLGOWKIKFt7CCEE1IINVLSS
OLSVCI~IFISrKDCLYKVTSVSKVJ1GPKGLRFIIIVAt.pAADSD

WIDWFKATOEVKGOFCfRTLEYLYLEDESYLfLDiGNYEKLFIPOEIMKI8JJf4FLIU

CPff _ ICDVIKIOfRTCEYIORV' ~OuaA-ONP Synchase IIKLOSJIRtIHLNTIFILDFCSOYZYVLAKOVRKLFVYCEVLP4MISVCCIJCERAPLGIIL

SCCPNSVYENKAPHLDPEIYKLOIPILJ1ICYGNOLtIARDFGG'IYSPG11GEFCYTPIHLY?CPfi_0185 CELFKHIYDCESLD?EIRNSHRI~fVTTIPEDFNVIASTSQCSISGIEIPrICORLYGLOFHPcpe/araD-Ribulose-P Epialerase EVSDSTPl'QJK:L6~.'FVOEICSAPTLWNPLYIODDLVSKIOD'IYIEVFDIYAOSLDVONI.AEVKKQESVWGPSI
NGADLTCLNEAKKLEOAGSDFIHIDIMOOtfFI/PNLTP!CPCIIM

AOLTIYSDVIESSRSGHASEVIKSHHNVGCLPKNLKLKLVEPLRYLPKDEVRILOFa(.CLINRSTDLFLEYNJWIYNP
FEFILSFVRSGADRIIVHFGSEDIKELLSIfIRICGGVpAGLA

SSYLLDRHPFPGPGLTIRVICEILPEYL71ILRMDLIFIEELRKAKLYDKISOAFALfLPFSPDTSTEFLPSPLPFCWV
VLNSVYPCIIGDSFLPN2'IEKIAFARHJIIICriGLKDBCLI

tKSVSVKODCRSYfuY'EIJ1LRAVESTDFNTGRWAYLPCDVLSSCSSRIINEIPEVSRWYDEVOGOIDDOSAPLCRDa GADILVTASYLFEADSIJWEDKILLLRCHrYWIC

ISDNPPATIEtrIE
CPn_0186 :_'3878 221069 0172 213237 312110 'sinilartcY co Cps IncA
CFn _ PIKDKILItSSPVNNI'PSAPNIPIPAP'ITPGIPT1'KPRSSFIEKVIIVAKYILFAIM'1'SO
-fmpD-Inosine 5'-monophosphese detwdroqenase ICOOH-terminal rsqton oniyl ALCrILOLSCALTPOICIALLVIFFVSNVLIGLILKDSLSOGEERRLRECVSRPT80pR

APIGAAICIOPLCISRAHHLVGGANVLVIDTAHANSKGVFt3'lIILELKSOFPOLSLWCNLTVITTTLETEVKDLKAA
KDOLTLEIGFRNENCMLKTTAEIILEEpVSKLSFQLGLiRI

LVTAEAJ1VSL1EICVDAVKVOICPGSICI'1'RIVSGVCYPOITAITNVAKALKNSAVTVTANOLIpANAG0A0EISS
ELKKLt~1DSKWEDINTSIpAiJfVL(GOEiAPQG017IVID1NQ

D,RIRYSCDWKALAAG10CVHLCSLLIGTDGPCDIVSIDEKLFKRYRCMDSLCIWKOCEOIOALOAEIIGIWNDSTAWK
SVFNLLVODQI1LTRWriELLE.iC'DLLS011Cb'ALRQEIE

.~.ADRYFVtOGOKKLVPOCVEGLVAYKCSVHDVLYOILCCIRSGHaYVCAAETLXDLKTNASKLAOHETSLOORIDAN
LAOEONLAEQVTALEKNKOEJIpKAESEFTACVRDRTlORRETPP

F'IRITESGPAESHIHNTYKVOPTItIY P'r'l'PWOCDE~EED~CI'PPVSQPSSPVDRATCDCO

017: 211041 211715 CPn_0197 331218 ..5015 ~.Pn _ Dradiccad methylasa nn rotf.:.~.c tfoaaloq pcesenc fn Genebenk/EMBL as of 11/7/98 TIFDLIYKIDSYKHQQCFMDFSVFPDRFVESTSPSFIEDIDAI(rLVSNCCNYCSRCLFLFVPLTYTRTLPMNSKFI~~
sRRKKN..iHKEET.~.WDr.'LAS.~>yHKfIIODKrHYYIIRETILPOLLP

I::LL SI I
Ir_F::Vlr;l'SCETASLVFCIL.~rLIVLVLLLSLTLOSKSSVLDICCCOt:FLERALPKECRYLI:IDL::.~.FL
IALAY.Y.NIL:VNSIIDPIfVADLS
IECRNRECCRRIS

KRLEFVEPTLFSHAVATL.iGNMEFPt:F.A
1 RNTATLLEPiMFF I VLMIPt'.PR I
PRASSN

:Nn Ot'lA 314215 3L1721 IIYDEtIKIUISRHILIItYL.~.FHIIIPIHAIIt't:OND.':P::TL.:FIIFPL::IWFKELI:.~.IK'.Fi.V
DDL

th. rMoar lu>tnolrul present in EF7dl'::.~.ifT:.'r.KRAKAfN4'RKEFPLFIIII:;rtKtK
~enab.mk/ENOL .i:: or 11/7/99 A'lT I PAOCRRS W
~
'X'mT::L
~
Y I F INF'/RK I V I Lahl Ihrrr L
NSP::PALNPEL::LI FFM'L'J

.. ~.'.~..fuo . W'fsOlu>t ~
.
.
.
'I'Lfi(ILLIF'IIILl:W1'II:.1'FTVIFFLNC:LNLL:."CC::IIG.~.::1'.LIIVGLLFLINCLYFH

::::LO(~:L'Jt:LL~Y.EL::OAEEREEEYIOEIEALR(7AFRAE:a'Tf:::P~IwL~'I'if2 tWP~cIa'.ru:.W pmcr:tn A'rIlrX:INFRKLFPP::KKK'I'a(JY.ORLkNM:LV~I11 tV::IYVLIJItINA::KFAI:VL::YY1ILI.

'Frsnl'/: ':11>I'n. .'.Li275 '15'VIIGVFFLRL::~~IILFTNLNWYtWLtIKF'1IiIY.Kf'I'/AIVFJ1A'IIIATr::~IIr:LVl.l2'w"F

th. r.ds~.-.r tuNnfIW prssnnr m WtY.SrN:ILMLI.::I.FJx:LNKIFPT'..'Wrf'I::LYILV::YI'/iYLV::I'MfYtIVtY::bIIYITO
.;v.nrtamk/r?IfsL .n:: nl IL/')/ny LLLACFYiFt.I.HItPIMEUI'Nr:VLOD~'1'fVLYALN:;FL(RI::WCIfRLGICp::PLEAFNAtA:E!HI'Ir ~'IAKLF::L;.11:1f1'AL1FI::RFVI~YLLI:It.AI.Fir:YA1L.F!YAIf,IK'r::AI.I::I'LII<:

t'rFl:Lr/ITC:FI'LEbVAfI'ILPr:YfIPKFYLSFIDRDUf:I/HYEVLDt.VFLK1YAACLIN:i::VWI'/FW
tAFF::I,~~11::IFM'::FTIt:At.VALI".:FLLt.I.lt'l'rNI'II.FYk:Ai.TFftI~NRIIf.'I' AQFIAK03I(VPIGEVSOC:.DVL E:tFtYYL."YD1IL'fODF:FJ. .'..~.AVDLK'tTYF:
t :L .,. ~ ' .
FLFtGDKILf.~.CYLOLIT.:"."!ILALTTPOi7~tECFn ~)t7N ".L.h,1145. ; .~.9t1,'!1n PVPNPSELTtKDIADKLLHREIfKKfNPOLGTTFtENSFONTfNOA

EKHCfLFPYNFK:YO opM-Q.i t'td~t "0 t~ldmt>s: ~'rf,e IWICfNLTL::EIARRIK ) n (,~~ ,;~
IEYYINKMIRUtPt'LK:L.PNLLPLLLTL3.:CSKGKCEFLGK~~fI1111SHDI~I.~RN

AY4ER~4~'Y~'~~f~~~~P~~rAY

~.Pn OlN7 DFEKSIKQLYfEEFSPSIH'.'~'VIKNSSAIHNAGK.~nLE:wICAtfDCL:v.VITLGOPfP
ols Transnlaalbrane Prxelnl P

ossl YFLTLIARPVf3PVHHTLPE..'YKKCfPPSTYTSNGPFVLKKHEItQHY:.::.CKNPHYYOHE
~:131 aalwtrxt-t .
NpIB(RRs'WLKIiGL~.LI:S.'aL'JIGfLIFL?QLISZ'ESRKnVFSLIHKE3uLiCa'eIEELK-., EPAS "
' '!
;
~e .
--rFT' ,F~~r-:
r -c ~
~

A ~
.GIdSLKIN .
EAKDEVF:>AEKFELDG~LLRLLIYKKPKGITL..
. , ~.- y ....",.. ;-M'ARKIKLTr _ .r .a ~lF~ .;
t .A
.
'j _ ~,t w.
...,,...t.r....,~-._., -, . :'. t' 'v. .
, . ..1:
, t,lr wYf . .
. .... .. ..
...~H......n~~r.~ ' ~ v:"_:1,:::..::.. ::1:'-:.~
n . ~..111.:~i : :'~!
'I '~\F_ tL::
,. 1 ' ' ..t. r.
'f'!'dfstl:~
\
:"
w ' ~

. ...
.\Ft li ~ ilnFr . htlfiirUv\
YHCfLI(KRROGDFFIAT~)IIAEYVSPVAfLSILCNPRDLTQWRNSDYEKTLEKI'YLpHA
r ! : I.
;'.:17: ":
T.FYIVECsSSFIELKPELASALCNGItPLS'fP
IONKLOGHIHK
N
' ' ' .
'IKH~.KRAFMIIEEETPIIFL'fHGKYIYAIHPKI0N1'FCSILSJITDCICiIDILS
ASGDG
O
NOCKT:rtfi ITSKOIHAtYSYAKIPLDITI6dKItIEIT9QAGLpEyAINPKDPM.At.OLI~E
VI?N

DIAYSSSIVIFCASPSHG1JGL.ISIONKKtILTKFRLJ?OIIOLp~I'RAIFPOPF
KF~
K~E

a CPn_0199 211019 :11983 PLOVAYYSLNIF~1"IKNAHLtJIMILONPLS.KISCSM.SG1~N~'FKfNl-011~pe0ctde Psnnease TD1?JLFFPKFSGKITJIRENtLLIEIAKIGSp~pIKP6ITSI

:IaptIPSYAFJJtF~KJI0I opp LItIGOFCSLPLSLVSNHLApFHLKKLTfSFIIfDGGKFVTKGNL4ALIENPDYPOLMJfRIKCLICLSLVF&YILO1R
ILF:ILi.SLIaIVLTLTFLVIOITIPGDPFNC~NLSEEVLOTLK
fV IG
it'I

LDCSSTSPSSKDLKIOGSGEIFSLPLDSITKTYaItOVRLSpYfGSSGDIiJ.
SRYGGDKpLYGCYT'QYLHSIAKLDfCNSLVYKDRKV7NLST.11PISAIiwiCSL
' ' IPDGLLS !It KLTLLSNFKSFaLIGEL%LVlIaFSNIfLSSOICD'tIAWLVSPERYASFFIOiAGGIAI~'IAALE~RRYILG7LSIt .OISIPAFIFA?LL7YVFAVKIPLLPIAL1GP

VNYNPKDON
TILPTLAIJ1YPPN~IIOLTIfSSVSAAt3IKDYVLLAYAK~.SpLKWIKHILPYAIfIII
'CSPItLLHRTANVALDISKISCPEETKGLSCLTLLA71GGLEGSLGTpLIFYDINSKET1 r l' a SY~'T~V~AIFlIIFCIPGIGKWFICSIKQRDYPVAIhiLSVFYGTLfNt~SIZ.
llOC SIIDFQIRYAtiGKF.%%R!I
FIINDfKCSLRAMiLDAKIEYDL.KCSCLApAGDSKT4AE~~SPESRtI
LKAQISSLAGPRINVSItOQAFRTGEGPVDT~S~
PMSPr' ' ' .n OL
aI Q
uu ANYIIHIPSSFIA
Rf?1LTAHLSIfLEDVHKAFL.OEFNpt.L~YSGYPVTLEIIrifONFYLp AEKSILi .
IPLIill 0200 241996 212968 tRPYSFEEFRIQSATLOte;KISIAtnGTMYALfOfLDITOQKOfVE~TpIfF5V01~SCPn IICKRLDALIDRRIRIJILIfGKTDTAHDRLF!!t'LGIDPLVIKKYFIiTSLKTIWffLIKIR..
fSStlrDSTPPPTYHPFPWDCSNFD oDPC-Olipopepcide Pe>:~ase LADKt LTT IILI4If'.J1LLLPWFYQ
VL:
IKSIIpN104 RSI' . .
CSISSPEVDWSSAYAAIALL4SYSLGHPFSS .
r C~OO~I?ISrILSSAPS
ILVSPCSRFPFCTDTLGRCIIFARTLAGLRLStd'IATIATLIDIILIIt'.LWATYAISOGKKI

SIEHK
DPLJ!!R'ITEILFSLPRIPIfILLLVIftdB~.f.PLIfJg!'1'I'KAIIPISRIIYCQFLLLIDiK

PFVGfALIAI~(A.4TFNILKZIIS1'LIFtIPNAIYTCAIISFIGLGIGPPOAS
CPn ' y .S1IG
170 :obusc homolog present in Genebank/tf~LLG'ILVK~INJ1IDYYtsILFFFPSLII4IAf.SISfNLIGEG1KTLCLE~
as of 11/7/99 STSTKKF)1VSKAIQKIIKINCITDPSIlIVETPNAEIGSILQEIKEI

Lf.GIKLNRK1WSFO
KOKLSKQAEDLGLLLILYCSQETLSM.fldINASLKLSIGSVZEt~SLKOLVEESIEtShGCPeI-'0101 212110 IVIOCLLIKfiNPEKSEAASgGIIVOTLL t ATPase pQDpLIGSVLIEISDXFLSSIGEILSLNLOIf' oppD-0119opeDCide Transpor ~SVA~tERCHIt7M~CYRVL~iGEpI~EOiIV
SKADLttDNYLLNIKDL?ITSTNPKRTLI~1LSLGLIt!lIAI~LVG~
e LG ASISSAPGLIDIW
~yS~IAS,, Ci~II7fAILGFLPF3JCLIK1GSILFEDIDITIG.SPKELIKIRGI~CIATIIL
RYVSSGLTIDKVEDKPITKFIRaGKLLYSGGTSt?~ESMP4GL4TSGI9fPlWK

SKt7YLE
TPSCItIGNOIIE?LROHHKtI~tKEEAYWOl110LLTDVCIPNPKYSPJpYPFS.SritilIlORV
SASKSNDGSFPFSALRHKFTFSt7TDCPGITS'1'1'LSGNOAGfY191SLSLKVLVPSIP9IEK
' VIAIAL71SOPKLILIIDEPITALOSNSOAQVLRILRNI00QICW1TILLV114a.9LVKtt~l1 PEVOLSLVYSYEONLPIDNIFIOfSOPRTIPL71LI~Tt4.xDKYDILEL.AAHC'1 SPNCSRFSLOIxOTNOfENS>MIfYtVNAAHSf OICIIKDG1G.IE'1'CI'VEFIfLSPKHPYTLXLIN7NSKIPIAtCtSSPILR~OtiiJI~CG

CPtL0191 231079 271314 gln0-AIiC Jlmt,no Aeid Transporter ATPase CP1L0202 213692 211500 CYDKREGVMI'IRVRNLJ1YSVNlGDIILDGVTFSLERQIITLF9GICSGSGK11IIP-OlipopaPCide ?ramporc ATPase OHHFPVFL o . pp LRALJIGLVDP1GGDTt~tIEGF~IpALVFGQPHLFSfNiIVLGNGTHpDIHIKGRSTEF~AtKAVPT9NEYAAWAlfI
'LLSIK~.SLTIRCKKILNHINLNLIKGSYLTIVGP>Kt~%Si.iLLT

FEf yHLr~IEE51AKNYPDOLSCCDKORVAIVRSLGIDKIITLLFDEPL'SJ1LDPFATASFAH%FTisCi'ITfI~PKIPR
JWNGVIWGDIDSSLNPC?1SIKCZISEPIliIIGTYfKA
ILDLT

LLZTLRDQELTVGL217~tIpFVHSCLORIYLIDpGTVAGVYIGtDGp4~YIHS.

yNyyDI,yNLpKbYWLKInKLSGGGKpRIAIAKALVSKPELLICOB>tL1LDTL
flt;LDLIQTIXKEYGtn'LLFITHIxISAAYYIAOTIAVlmOC81.V0~CitSTPKH
N

. p 'I~DLLDAIPIF6LISTOIZpStCYBLOVASK

CPn_0192 232617 271991 glnP-ABC )uaino Aeid Transporter pernaaae CP1L0303 211966 215802 CVSGIGIICGSIIGLLIGTV'ISLYFPSIG.TKLLIINSYVhomolog Dresene in Genebsnk/t?~L
RGCGYZT as o! 11/7/11 GVpttNWIARLtt E

. No sobusc .
IVPLPpIO'S~TFSPTt4KSFSLFLLEKLDSYFPFOClRI9ILVI?I'L11IALA
V
TVIRG?PLFIOILIIYFGLPEVLPIEPTPLV11GITALSisB'ISAAYLI~NIRGGTNSLSIGD

vIESIWVLCYKKYQIFVYTIYPQVFlf7ILPSLTNEPVSLIKESSILMVVGVpELTKVTiInIAW010CKVSTIEKIIK
ILSFILLPLVIIAFILRYfLHiDtFDKpPLCIPKVIt~I.iG

VSREIJiI~S4YLICAGLYfLMI'SFSCISAISBCRRSYDNSRFQAVEK71VAEISP11FFSIPRKYQLIAIDTPK17D
AP8ILFPIGIEIII.I~CI~I~a ' r NLTLIOtEI07TLGNPEEKaLFDSICSIEK00~1N8LESKKLLI'1'HILIDfWSGIIOWIf CPeL0193 233111 232696 FNp~'IGRGYFSEISTAKIHFHGI~YCPIRSSCPIIOttI

acpR-AS'Qinine Repressor KLtILIlPl00CKVTIDf.V.KEILRLEG7UITOEc'rtavr.f~,FATTOSSVSRWLRKIQAVIN0201 215691 AGOIGARYSLPSSTEKi'I'1'RHLVISIRtOtASLIVIRlYPGSASWIAALLDOGLImEILGT~
No sobuec hoaalog present in Genebank/~tBL
as o! 11/7/99 LaGODTIFVTPIDEGRLPLLNVSIAti4LDVFLDpAaAtNFfNNKYSImPFSSARBIWANPFIbTItHEGNIKIKClIC
IfQIFTRLKt<GItf88 YNSINFNPYFFDEDCIVYwtESOIKSAIADHGILGKCILTFYPNT

CPn,-019 273162 231211 qcp-O-Sialoglycoprocein Entiopeptidase EVPHTIKfi~M'FSNFFIQ.TLGLESSCDLTACAIVNEt>KQI1.ANIIASQDIHASYGGWPECPIL0205 216077 No robust tloeOloQ pree'uIC in GenebenkJF1'~L
as o! 11/7/98 U1SRAHLHIfpQttINItAL.OCANLLIEDImLIAVTGTPGLIGSLSVCVHfCKGIAIGAKKSICDSIKGYGSASIIFt WPpCIi.LKFFLVCEELCILTVATHRALLETPL7ILSFlXG.ATKYV

LIGV1MVEANLYAAYHAAQNWFPJ1LGLWSGAHfAAFfIFlIPI'SYIG.IGKTRI~AIGETYRAKDIIALM'11f10C
pTILh?SPLCS

FD~FIGLPYPAGPLIEKLiILEGSEDSYPFSPAKVt,IfYDFSFSGLKTAVLYAIIttORIS

SPRSfAPEISLEKORDIAASFOKAAC1TIAQKLPTITKEFSCRSILICGGVAINlYtRSA

IQTACNLPVYFPPAKT.CSDNAAMIAGIGGBtFQIQ7SSIPEIRICAttYGWESVSPFSL71SPCT207 hypothetical D~cein IVDAASPACYDSINSDAIGVSLtJ~ISHILF~UIYDt7GILPREJ1IB~11AIVKGNQITpYLL

CPn_0195 231172 :75785 HILNDAInRVPEIVNDGSYOGHLYANYLLAOFRESAALPLTIKLPAPE~TPHAIAGWL

oppA-Oligopepcide BmdinQ Protein TEDLPRILASYC1IDDSLIKELILTPXINPYVIWN1I9GLVTLVCJIGKIPRDKVIRYtAEL
'lSCNSYNRKISW
TCITTLLSLSVVLOCCKSSHSSTSRGELJ1INIRDEPRSLDPROVRLLNYRLEKOPSFAWONLIAu'ICTLYPGELFYP
ISKAFDGGLVDTSFISNEDVCNIINtrfTl1 ' fAED ESCIHTLCSSTELINDTLEEHEKWLEDfPIEP
LSEISLVKHIYEGLVOENNLSGNIEPALAEDYSLSSOGLTYTPKLKSAPNSNGDPL
' fLESPTSH
FTESIdIfGVATGLNSCIYAFAINPIfO~IVRKIQEGHLSII>EtFGVNSPNESTLW

FLNLIaLPVfFPVHKSORTLOSKSLPIASGAFYP1WIKOKQWIIfI3KNPHYYNDSOVEtX

CPn "ITIHPIPOANCAAKLFTaOGtct.NwpGPPwGERIPpETt.SNtASKGHLHSFOtIACTSNLTf_ ybnl/sodiTl-Oxoglucarace/Nelate Translxacor NTNKFPLNNNKLRFaLIsALDKFr\LVSTTFLGPAKTADHLLp'INTHSYPEHOKQGUWROVNKKKRFLSLLFLTAVL:
riTWFSPNPASINStJA4.lOLFAIFTfI'INGIIFQPVPNG11IAII

AYAKKLfKEALEEtAITAKDt.EHIliLTFPVSSSASSLLVQLIR&QIiKFSIGFAIpIVGKEGISTLLLTOTLTLEQG
L:.:fHNpIAWLVfLSFStJIIIGIIKIGi.GIRIAYPFVSAIGKBPL

FALLQADISSCNFSLATCCWFADFADPMAFLTIFAYPSGVPPYAINNKDFLEILQNIEQEGLaI.LVITDFFtrIPAIF
aTARAGGILYPW1'SLSOSPGSSAEIFCCODLICSFLIINAY

ODHpKItSELVSQASLYLETFHIIEPIYHDAFGFAMVKKLSNLGVSP'ICVVDFRYAKFNOSStItTSJWFLTANJ1G1 IFLV 1ALAGHVrIISLS4MWAKAAI
IPCLPSLFfJIpIILYKLYP

PKITrCEEALRSAKLRLKt~tDpLKKEEKTTLtIPFLLV'JWl'P~.LCISA'ITAALIGLS

CPn_0196 235906 237519 LLILTNILOw~COVTANTTANETPTWft:ALIeatASPIIK?LGFIPLVGOSAAALVSG49MC

appA-OW gopepclde Binding Protein ICFPLLFLIYfYSHYLFA.'NI'ANICAIfIPIFWVSISIIUTNPTPAALTLAFASHLFCCLT
KLKSYSKERSFNLRFFAVfISTLWLITS'GCSPSOSSKGIFwHHKEHpRSLDPGKTRLIApAPLYFGSHLVT1'~EWWI
!SGfALifVNIVIWWtCSLMiKA(J~'LI

DO'LUIRHLYEGLVfEHSQNCEIKPALAESYTISEDGTRYTfKTKNILWSNCDPLTAQDFVHYr~

:>SWKEILKDaSSVYLYAFLPIiQJARAIfDOTESPENIOVMLDKIpILEIOLETPCAHFL
IFFPVHETLRNYSTSFEE71PITCCAFRPVSLEICCLRLHLEKNPNYHNKSRVKLHCPr~-0208 :x9935 250(102 E

HFLTLP ttrase KIIVpFIStIAHI'AAILFKHKKLDWOOPPWCEPIPPEISASLHOOOOLfSLpGASI'IWti.fptkA-Fructose-n-P Fho:.photrane SVAVIL19IPLYYDLDTIL''3Y'opPLPKEPOEAA.SLtA'/PDT.SHSKPWPCVKTLFPO~!'1H

NIQKKW1NNAKLRKAL.iLAIDKOMLTKt~YYOCIJ1EPCDHILHPRLYPGTYPERKRONERILpYLKFVQtTEMMITf LKi'':VNF'.X7f:PAPtX.71NVI0liLFNSLKDFHPDSSLVGFYN11r7DG

Lti\OOLFEEALDELCM'fREDLEKETL'tFSTfSFSY.RICpIILREOWKKVLKFTTPIVGOE~.IDITEEFI::KFR
N:::X:FNI:IrTORKKIYI'PEAY.EAt.'LKTAEAGDLODLVIIODD
TtQIK
I

FFTIOrtIFt.G:NY::LTVN~%'fl'AI1FIDCN::YI11IFANPt3GISPYI1LOD::HFDfLLIKITOE.
.
I::aITATAILAEYFaIiARf~T::I'/rVPYTIDr:UL011TFLDLTPGFDTATKFYS3IISNISR

IIKKHLf!tK)LItFALDYLClIf.HtLEpL.'l1I'NLRI\ClIKNTtWFNLFVRR't~DFRFIEKLtlIII:X:KAt IYtIFtKtatt:K: :\:at tALD:ALVI'tN'rllAL Ir:EEIAP.IWLM.KTI
IHKIC:iVIA

I>itAIWEKYY.NILtfF,.a:SStfETItIt.fTfIF.~..L::E'lt~ItI::RL:a~7pRLLKSFPAPI

rays n1.7 W751'1 :'aHR=
IEUfINDRONY.:N1:YSI:~R::."/f.YLI.IIII:I::NIII!)r)YFMNPFNAI::HfIGYt.A'K

..Nlu\ "It.p,lId 1.1. Ititul)n.t 1'I,ftl'Y~:!::IITI<:.4:ILt'I:\il.'.'Y:ff:."PtI:::IJU.'fYftYYIrI.RAIfNVKMFTVK(~A
~M.Q
1'r.,t.ein :KIIKO::LIIPtHDDPV
' Y:
' ' ' ' fta.
1KIYK'ILVDIf:::Pl\F'RI:.'s:.~lYIwAt.HIr.:YPF'll:ltlrtF:rl1'fTlU:l)NFP~LTLLWHN
:
::::
LfLLFI::L

IX:KVIKVItKNF::RWI
rllYlthr.:LDI.KF
:EDFt:::Y'fFFTK0..AL
V
' ' : Fl,IrrPINJ:t:cetrmn'r IT
rIIRk:~NOLRLAfA:at nF::l'Ir~AKILIMUI::IdVI.LFCL:LTHE.
.TP::aNAfPIIILD::PNPDFPKLIJIFp NF' TII'h:F:DIRfIAWI:YAVENSPIIISIfOt:I
:
f .
. ~:IW!l.'.IIV :'l04'l ..'.l'.'.'l W:
D
.
Ah'A ( fKPFNfKLF:?a'1"rLVEYf P1:11NiFILKKNpIffYDYIK'V::IN.i IKLLi t PDIYTAIH

Id.NIK:YVIMVt~I5n1~1:fIWRt.11K0::y1'ItYYTYfNI)t:AfyIIL:I.NMCSpHIJ)DLQNRIIRLAm:v :mt m .:.,n.t.mY/h?Htl. .ts:
.,t Ili'!"IN
~.t tu>Ha,l.m tmtrtt tY

'ri'fI.KP:aII:F'.At.cl:ryH'Af.'I'L:ah:Al'~,pMOYKltrll(TL'ff174:1ILVLTYP::OtLRCO
RfA~
KLWNF .
.
tE:::IItIIHI.KNE1'Y:'.1':"1'!~...yYtR.'a.IIMI:KIdd:Y1'.:h'I:YI!'1'INIAI'l'1'N':
LALAYEGNI

IaLrFY.HIRM4:IId.(t.l:!LfYlI1.l'VNKRKVQDYAfAT)TtNAYYfY:ANLI::fED

rxv rodt.r cuxmUxr G. i. t.~. ';~n~ottu.
EF18L Js ~: 1::~I98-IIL:.7t~IK::.Y'IL3FPRSLLR'.T.'.LWYRF:TNLIf:RY :..~.DDCPTEATKNtr't iK:..:F1.'RDN4Er:.TNPISEIVSET35SI1IDSYGRSL

IF
lA..$
FtI~::IIf.CAR4l:~L:.TDIOdLDCC0C14iWlrLLRLirt, LG
"/ .h[u~st~hllE
~
~

~:Fns021v! ".215 251147 .
noraolog Prea~tnc tn rJenebank/ENHL~
as Uf !1/7(99 ., .
H ..
NLVICFCK

o rodt3c E3674 IEfIEREIFKTIREKEHATISITrLVELE71L.1tREFJWLKDQICPTSDOETTSLYQCLDH
'I
KL

r r.pn_O::S 2b7402 _ O No robes: nomaloQ Presrnc :n GmtbanK/EMBL
:.EFYLLGL::rDKFLKATEDED'JLFESOKALD~AP8L1L.LTKARDYIrGi.GDI~wIIYOTIEFLiras of L1~7/99 ' .NDCVEIAKAKL
YTF10~IPKKNKKMKFNSIIFLENTKHYPOIFRECEYRDRNGUtEASL%JL:.STZTIZRSIL
A'fGSKYNRRAFCiT:iEIHF;.KTAIRDLNAYYLLDPRWPLCKIEEFVWI
' LPS . '~!...:
QEEYOKD -' ~!'FtDtETKEGt7E.:LLREEHANEKCSIODLORKL.."D::IELHDVSLF':FSKrf ' ..
' _ .
.: .
... . .. :.a:~.. . ... ..... ... .
: F:r ~ . . ..
L~~

.'."'r..KI!:~',:.'.~.r'...:_...:
it:; '.MI'r:l:?:.:.. .. =y-.rr: U..'. Loldic :oiaii :Vn _ No coDusc rJamolog present in Gaubmk/t>,8L
as of 11:7(98 CPn_0211 252765 252167 NSRIKFLtXtIDrAINSO'l'1'I'POPNL:'DAEPIASRAQCKSIAYIISLIWQIL.LLGL::I
No robust homolop presort in Gmebmk/DlBLSE
as of 11/7/98 ECVFISYPDISNVQASSiOSALLNKTSDOIOOKRCPKOSTFVTLAVSLYIIGSLFLLAGVALISIPIPGLAAOVALCLG
IVSLILGI1L1NIG1LCLL:.RCKOVPOKPDCLPSESSKOP
K

AGGVGLL'JLF1KSLL
CSTPI'ALPWOAGEFLEKVOVSATPILLPKNKDEELSAKVIOCEGAFItASSTKOAYLCB?E
SLVFCVLGIYLCLLLi LTVP

.
LIDivRKOEESRREiIRKKIVAEEAUURXRI~OOMAaOpErILRKNKELYJ1KRK
I
SHGVL

=Pn_0213 254081 252888 No roausc hrxeolog presort in Genebank/EMHLCPC~032b 261515 264967 as of 11/7/98 ELSYWWSIYSETLSFSELTSC%NSLJPFGPIETASIRINNVPNVtIIVCLI:LCTLFVCNo robust hdsolop presene in Gmebank/ENBL
as of 1117/98 ' LGNVPLGVFSTYLIGNSSMTILLLLISIGIltLLKFKERYCL.EPKELFiYEOGFDK>IG.PSElifK
AIfNRRRNPYYANfLEFIOGTOSLCPLfKYCFVRFIHYIIGOLEIEDASIIDiIDfLEPPB
' 'II~DQTADLARELDLEOKKD'ZLIRD!'SARLIM~SKTEKKOILKIGVPRN(SBIOERLCAVLCIIIGLiIVALILIt I
RTLLAAIPILGSVIGLGRS.FSIWSIREPODSOEYKSIfWtTI

AOEONSILEOCKFJ1LLFRRKS110EIFKKLYDRK)1AFWRSYREDLWCYSEINVSKXALSNLLATFIMANPCLKRYAT
FLFYS

YICDVFEC'I'APFFFIIIEaIYAMCRTJU04L11HYVINCIfEDNRYNEEIWiAKOLSVSELLCCCT

CPn EIyTDLFiETtiLfTSDSEDVLEEYOIFICIRV1TIWALWAIYNDEWSItKPIDTL.-~ d d id OMCYVIff><:'LELEIJ1QLYYDZ.:~.F, Ox MAVEDCZE"Fg< orv uccase dsbB-Disulfide bon ..
KEPPNIFVSCKLIJ(EIf!lINFIRSYALYFAWAISCAG?LISIFYSYIIIiVEPCZilYYOR

ICLFPL'iYILCZSaYREDSSIKLYILPQAVLGIGISIYpvFLpEIPGMOI~IC~CST
CPn _ KIFLFSYVfIPMASWAlGAZVCLLVLTKKYRC
No robust homolog presort in Gatvbenk/EM8L
as of 11/7(98 ILWFSRVIFSYfNQIGIPRLELILPLWKXENDPFCFLFSRVECtF'IIWIK

CPeL0228 266242 26512 0211 255768 25146 dabG-DlsulEide Bond Chaperone CPn ' _ ZNSSL.RCPL111DCILVLCTANPFIY
No robust honalog Dresmc in Gmebank/EtlRLVKDBADTtM.%gKFSCSILKItENAFEFYVFGSIKOL
as of 11/7/98 PLGLIIEDYERPTYCIIPPAPHPQRVDSKGCIAStIVS?<N1JVALEILGIFFLSGSLAPLVtfCFGFLI1lK10fIIL
PPKANIPTNA1WFP?ICNPYAPINTTVfEEPSC571CJ1EF1TNFPLL

TSCCVLIaAALPILCIC'dYL:.JWALIVFLCNKHkI'RODLOIfYDODLDSLVTHIOCEIPNDIKIQIYIDIGEZSFT
LIPVCFIRGSKP11A0ALdICIYIBiDPRQADIDAYIICIfFNRTLTYPI~E

SELRVTFEKLONLFQFHTImFSDLSOELOC1CFINCNERWLTLFDEVIIfFLIVRDIIfLETRCSHWI1TPEYLTXI~I
ECLILINSGRSVNPKGL.EQCIASCQYNDDTKKNNL7fGSOVLOGOLLIT

RNPTI'fGEQVKGI05NIFDLIiEEKSSLYLELYRLtAfDIAVLLt~IFFLLPPGIL1CVDYOLIEPTAWCDYLIEDPT
FHEIEAAIONIROLOJ1YDGDN~

AIKGLFIRLTSRLDIG.DVKAQERIOIFINF715REP'IfLVEKAFDIVDRATKIO:J~RAKKESP

ARLINGRTESLLt?OLIaIEtAL.ID~K)GLDPF1ILSIiFET.FSPYOQLLZLltYLNSIVLHfIYEFCPnL0229 LISCTVTSCLTLEECCRMRAASIIGLNALLVRlG4FR~IKSAYFEKLTEZEKELRSLODCT178 hypothetical P~tein 'lIKSLELE;LIHKIKDIVTLET
NS1D1!'SFLRIEOENFSFK~OfSIILSi'IYNI'ANLTKSTFTFILLLLLRiDIDipCLRt11D8*T

LEMYRHFRYRFLLGltILPAl7.cLLLRCSPNTLNY'1'pVDVIFSDRLCSCLLIFL1IABLT

KRSLLWLGIIPLGIWVCLF11CVAGASP'I'TFANDTLIGF71ILAWCISPTIlP6AZ.6SICPTLP
CPn _ ECPSYNPSA~RRAAYLFLSLLGWL.FARYLTASSLGITSSOSSNFLLLYSSIttEVYSLLV
No robust hotnolo0 Dresmt in Genebrnk/ENHL
as of 11/7/98 LTSSIOCOVNSSAIARDCFPSPSPQPSSTLGVFtPPKYKSLILSVSLfVLGVLLL.CVCFELLI.VLSIaGSERRWHTR
PKIVIIITAIUTGIIIILTLLPIIGHpLRYOCWICIGLTIEPAEJIW
' VNAIFSFSVL'IYGIGCAGVFIGSLLLILGLIFPVSYNRKL8EJ1TRSLLf<.Q4KTLLEYQPWJ1DFGSEYYKIfZLS
IEER'1'Vi.PWKAY>oQiIP~TS
FAYD!<LRATLRYISOFI&DKRALTNAS!

LRKEWEVOWSNFLLDEWEDTKEWAOHKSOFATFECDLLLFGREVCKYI"lIWILELDGRFPINOLWILVA'LVF'V1NN
SSNCLP1'TPRNFWICCIifIIVLFIW11BSLRNLRY1WLI

DVJ1LLTELIDOIWCPLEFLRIfKCDRiOCEIOEQ.RKZ'.lffBMiKSGLXT.ACELTXFKSALImVFSAAILFSPVL
PNIPVESPNFLPTIV1'GLILIILSIGKRRRTIOIKI.

KIEpfxYRDXRKVIIQ.EVFPOGYRRELL.EVLKTRLSVOCEIOLFEEW511!'LEICJISLNA
'CVFSEEELOEAL~tAKAELLDIOVRKSWEDLSCEP'1'LIpYHIiJtL.YE170CRIVtOFLTOCPtL0230 TFSSEpEKVLEEYFU.KARIRKTLiNKLDOVR71NVAFVAS1TDLLSFSESLt%~16VFEDCT179 hypothetical protein p RPIOTALIYMSSOPLYITSSSLSRYWLTGEEKVACYKItAPNHIWN011PAIIL71ML1JIPC

IFCPVLCSILiGAPLEGASILYDVILPWLLPSILVFYLLVLPWIYAYSNfDOpVLJItJIER

CPn_0216 257623 25717 Z1'08I~KEIYDHCEKEKRTPNKKALSLYIESOVLVPEYSKR1SSNTIGKTL1CIIPID~SP

No sobust hoarolog Dresmc in Genebank/Dt9LLSL~DELIOKALiR7IKENZYIB~JDRtKRDERFJUtRGxNIVSK'1NPLW8LiiG't as of 11/7/98 NKJ1RTNNPVTFDRIQVDFIPFDTSLRINSYIVAOGLLIt.CWLSIISYICLDIGLVGLSA

GAAl1'LGLGCLIFALFLFSFSLILLL9QEKRVPDVLSLYLEKEVPOYE'LPLYKEDLEBERCPI~0271 268996 DMSAiSERLGTTEEKLRIAOCFRYSDSVfIf cauB-A8C Transport ATPSSe INierate/Fel POAFVSIOD10GFSMLOAHRLCYSCDt~VILJIDASFOASPCTIT:ILGSSGVGkITLFRLL

CPIt_02I7 257881 258579 l1G!'LPLOEGLLWHGSPWR1~VAYNpOK>JtLLPWRTALKNNTLS'fEIGINTSNE~l7IL~iE

yip RLEEIIMIFDLCQLLDRYPDELSCOORORIALAAQCLSLKPILLLDEPPSSLWLLIC~L

PKCGKLKGFLSVNELIFCFOTFSVWIGV!'FASRCKAWL1GWLSLLSSIHNVFVWCpIHLYQOIVAWtKENICTVLLVT
HDFHDVSCLCDVLYVIKNKTLTPVPLDPSMRPLii4f3LCFIK

WCFEVTSADVYVICLLTCLNYARFJIytKNDINDVIIQ.CSWVISIAFLVLTOLNLFLIPSPNDLIDWLYT

DSSOEHFL1LFSSTPRTWASLVTLIFVOIVDIKLFTFLpRVFSKKYFA!!RS'LISLLFSO

LIDTZIFSFIL3IYGLVSNLCDVHIFAtIt.VKGTVITLATPTL.TVTKAVL~tRSSCPn_0232 270171 siailaricy to 5'-Neehylchioadmosine/S-Adenosylhaaoeyseeine CPn_0218 259061 258582 Nueleosidsse No robust hdtlolog presort in GmeDank/f?1BLKKP'I1BtRFLFLILSSLPLVAFSADNFTILEEKOSPLSRVSIIFALPGYtPVSFDCNCPIP
as of 11/7/98 IFLSKIIVFFESYDFANV115SWPKSLRALVOGRYFVDSELKtTPYRINDFKXTPINHRLYWFSHSKIITLECORIYYS
GDSFGKYFWS11LWPNKVSSAWACtMILKNRVDLZLIIGSCY

RSLPIISTIGCIIRLIEAliSGPIHPRDKNIfYRFEVLQAVIEILCLCYL:LVFDITCCFLASRSODSRfCSVLVSKCY
INYDAONRPFFERFEIPDIKKSVFATSEVHREAILRGCEEFIS

FLVAIILSLLLYCNSTFTCVONLSPTERFII.EGTGEAVNFLATNKOEIEELLKTHCYLKS1TKTEtTI'IJIEGLVAT
GESFANSPNYFLSLOKLYPEIiIGIDSV

SGAYSQVCYEYSIfCLGVNILLPHPLESASNEOWKHLQSE115KIYNDTLLKSVLKtICSS

IF'ti_0219 259319 260172 H

cgc-Oueu:,ne cRNA Ribosyl TransEerase CSSL1LKFHLIHOSKIISOARVGOIETSHGVIDTR1F/PIfATFtGALKGVIDNSDIPLLFCNCPt>_0273 TYHLLLNPGPEAVAItt.~LHOFMCROAPIZTDSGGFOIFSIJIYCSVlrEEIKSCGKNRCMSNo robust homolog present in Genebank/F71BL
as of 11(7(98 SLVKITDECAWFKSYRDCRKLFtSPELSVOAONOt.GADIIIPLDELLPFHTDOEYFLTSCEKARt?tFiGIIVLLFLL
RISRRSYVOEtGIFFHLETPDLKIVt.CAPYSTFLWIIIWSLKN

3RTYVWEKRSLEYHRKDPRHOSMYCVIHCCLOPEQRRIGVRFVEDEPFDGSAfOGSLGRNKGQS

Lpf?ISGWI(I7~SFLSKERPVNLLGICOLPSIYANVN~FGZDSFDSSYFT%AARHGLILSK

aCPIKICOQKYSODSSTLDPSCSCLTCLSCISRAYLPJtLFWREPNM:WASIHNLHHNpCCn_0234 :71216 QVNKEIREAILKDEI
CTIH1 hyPOenecical protein FIML03CKXALLSIWSILAFHPIPCW)VEJ1KSGFLGKVKGWPSKKEIOEEARTLPVKDS

CPn_U220 260660 261236 LSWKRYDYfSs's'GFSVEFPGEPDF15GOIVEVPOSEITIRYDTYVTCLHPONIVYWSVWE

rln robust homolog presene in CeneNnk/EMBL'IPEKVDISRPEWLOEGFSCl410ALPESOVLFNOMOIQGHNALEFWIVCEDVYfRGHt.I
as of 11/7/98 F'fSFGKKKCIFYMSKESIRSYSEISTP1'PZFRETPSKLCVAYKI4LRSPAKOCILRNRVSSVNHTLYQVFtAVYKNK
NPQALpKfYEAFSOSFKITKIREPRTIPSSVIfKKVSL

LKCALLR:iIPFYCf.FLCAICRIHSAWSNCDAPC1'fRVINYLVCCLELLGLG'VVVIaCKVLA

'fALKFLFSKASSKIKOFIKWREKARNLANtDN~S;KEFCSVDLTSCFTRCFRLRNRWEE~Pn_0235 271195 t:A,iENp'NREIIV kda8-deoxyoecutonosic Acld ..".ynchecase VFVfIYLU4KPE;IECf.C I(;y(,PARWN.~..:RYPGKPWCIFIGK.iLIORTYENASOSSLLDItI

t:f n_U221 2ti 1621 262051 WAT~OHI
IOHItTDF~AVMT.iPTr:.:fICTERTCEVARK'IFPKAEI
IVNIOCDEPCWS

Flrr toc.ru:c lwnlolog present E1IVDALVOKL.tL:SPEAELVTT~/ALTTGfEEILTEKKVKCVFD.iECRALYF.iRL:PIPFILX
in anrt'.tnk/EHOL a~ oc lt/7f~9 Tn111RYK'fEZJiOMVNRYK~3AEFF,ADNYYDDfILI'RMr:fKRNLRC:L11'b'ENEVCLFEE?MLKATMrfLHI
t:VYAFKREALFR'IIpH~:."TPL.:DAEDLEOLRFLEIfCA:KINVCIVDAKSPSV

Gl::>1I1N::11'IM:.:LIGtI'.HLII~IIW.':fODFKDSKIIIFIITALGLLLTL.:IC,IIVLLLKITDYFl:
4IAKVFX~YI'h:I::WIYF

t'f ILLILFTrt'.LU.'YPMY.~.MYSDFIIPI

r'r,r nz :o e7a t Iw : mn,;

nlr~ U:I2:.:1474 ~0_:14~ . IHr% 'Z'F ynrhut.r:u wwk ::iortl.ir ity trr OdcihriOpn.al::Il'rfYIMOfKi IFL'TI'l:W.~.::L;KCLTI.A::I,ALLLERORLFNJ1MLKLDrYLJJVDi'(Y114!P
'llfI 1'tt11 .tF:KFI.KIWEKIJ2flJJJIFEt.'IY)PEC'fRNRYN4Ift:.Y:RF'.Tt'f011AKVW:'.YRrVHEASLYEF' F:IK:EfYVfDCtNEfDLDLC'JI'IIIRF.:::AAL.:hIL::.AT7VJIYARVIKREREf:DYLC.,~NO

Fat'FLTI:f1'oflKIlLrr)YI:::LVKIJIWLFLKF.LRKMI::PHKIRYFfiI:A1'f.TK(.ORPHYHLVII~I
IfTFIEIIt.WILDMKCII.~.rTNI.I'fF:Ir7~fI:DIE.:LFFLFJvIRUFRY011,~.EDCWIH

1 J:: MT'NF'f WMVEVK::K r'fVI C: /rJfL1 : i'; I 1 ftM L IJ:R::fYFL'TOEVI(~K
L::LPr:NVFNR

AVFN1/ InVKHT IYE74Tf J1LA(/PJf LAtlf': :F:Kt.YLATVPEtI4DOWKVf.YN(N.uVIX.PKVK
I

'far ~I'.1t .'.r.1rSU ~~, r f n:W:K'IVOHRI1AYK::fFEALT11M4Vf/:IIMI:I
v IC IDAEGF1ILTNEII.fjt:LMt:LVPt7ItFG

IVY LEVMKFEFSVALKYLIPGRGI
AIV'r'LF'rI..LV'.'4tL.ii'.'F:::VIHGLC.~NIEJL
POVDL'YDPE4DYLLPE:
EKIA:
~:' YR(.~IF~:YIArbVtf~CRCOGiI'YfrICII~VLWEn.
.~,NLOOANaLFJ4DPNTPItP ,n LKPCDKAHKAlNCS5LI0ERHRHRYE'ItIPDYIOSLED30LHSPITILPSD1'YY~~rtYOT~NSSL:~T1Y:T
' PLKOCDI~O~'u'~~w'Y ' fPCL F
'MFX'AOf'LYAT'SCFMRLt.A TNFL'fYPSKLSYE~YGPtfD~T~
IIr;LAtyt:fr:I'FOCLCEItEV~DNPirltt.VOFHPEIYSKLISPHPLFIAFIEAALVYSKDA,y~ySip~Kltp YTYIrII~,FYNpGLSPLGfOIZYFIDPDIJIRSiR50 .iHV ~D~,yE
Ta~DYOYFOP

7 ~
4 ~EGId ~I~i~~~.

421 PLVKAImLO~ETfMAFPGONLPNSVHPOAIYFIGLG'i.
.F~_023' 2'3741 2 'fiFAIITLKNtA

Y'fAf FnmtiY .
'.rK,vYtrI~KRI''.LAYAAEPLLLTLP:~NIEJ1GKNLKLiA)tAIJFKACGWIG
..,L~t,IC,IW;Kph .:.\t.' ~ :Pa, W'!~1!'.-r......-PM
' ' -.
.:rAE a7rJ~
.. ' . . .. .. ... , ..::.iii.:.F
.... .;,~., 1.. :.::.:: .. . .
.
.:,...., -, ::
~
~

. luu: -:. .. . yOtlJ c1 . ~~:r..,'..:.._-;..:,f::
,~..a.FYY

ri37-L33 Ribosas~ai Procsin L!'K6M
' 274110 275839 L~RL~~DRKLRRIN
_pn_0239 KDSSNRS10IREIIKLKSSESSONriTfIOtKRK

wf-Giueoss-6-P Dahyra9snse NFLLFVIFASAGtID!EIIOfMVVOETI~I1~ISPRTCPPCILVIFGAT0151 286036 287559 ' CPt ~CMIOKLRDFNPR 4-L'IKECRLSDDFVC'.IGFARRFJCSIt~OENK~VI0FSP5F3.DIKVconseswd hYPothetieai Dsotsin SPDICLPYMSPFIOCTVIiRLICY15FOKE8A?LPT::REPRiTKSLGStNSVIS101KINF
RLFYHRSEFDNLi'ICYTSLKDSLDLDK1IAL~POYFSRII~.NID
F

IEp ISI,fiCSRNLtI~6Vt4iGILLKIVGY11'~STNEIItDADYL.ILN1CAFUtSARDGKTIYI~NL
OQ ~SNINYLiGSGfIVI18ILSAIESRiS~ItIStIKSIfI
KHKLFYKNDODGKPIiSRVITEKP~IDLDSAKOL00CTNflVIHIDfIYLiGK>:NO
KSf:NLRDINONL1110LLC

NIL'I'fRFANt'IFESCWI'tSOYIDHV0I5LSETICICSR~IPFEVTI~~
PYITFW1DEIRKEKIKILQRISPISEGSSIVf10QY0~'VO~~~~~N~A~'~' IIPSIKIDtLRSKPL0p1L1~ERTLYf~BV

:.LTNE KEITLIAODLGDYG~LSTDRE'~OLEE~NQ''~MLYLYP~It~
KDSRVETYVALKTVINNPRWLCVPFYLRACKRLAKKSTDISIIFIOISPYIILtAA~SI~GFPGE

PLtIJGLL::AIOPOEGVALKFNGKVPCTNNIVRWI~tRYDSYIOrT'rPERYERLLCDCSNPKLLPYVDIPLOtII11 D1lILK0!'BIIfIT'SRCQIIGFLEKLMKVPQVYIRSSVIV
EEiiDODSSPSFPNYPJIGSSGPKE~DALIERDQRSWiIDF'TGEOWIDNLGTFLYSO>FN?fPAt\ELPDOIPEKVKP
SRLILILSOIOKRNV
VNASWKLFTPVL TOBEIOEL

' .
IICDRTLITGGDE
pKlOIDKLIGOLIEAVICtIYtIPE114fid.TJIRFYGpAPEVDPCI
IVNEAIa.VSHIGCI~FIE

RPL I~~WB~~

~Pn_0239 275863 276672 . ' .
se-6-P Dehyro0enase IDsvB familyt Gi 01ST 288112 281576 CPtl ueo -devB- CT111 hypocMCieai Ptacein lframle-shift KaISt'toiIGITNATLINFND'PNKLLLTKOPSLFIDI~SKDNIASANOATKDLwith 0257?t SGGKTPLETYKDIVINKDKLTDPSKIFL1WODERIJ1PITSStSNYGOANSILR~I~tIPDEATSTVCAWTZrDnOSlB
7DARSCSFRRACRFriRYWLGGVIRIPNNKFt~tdl'STDSIVINSAI

aIFR!!!:1'EIPDGAtOtYOELIFSBtIPDASFDNI1'G~GLCED.~.YIO8S0''O~~IPRLFRTSIlXIKtIGDNI
DNCfGGELLLVAYW10NPLFPDIR
SLFSNI'SATEBtl~LW N

FNSVPtIhlTF7Df1'LTFPMOOGKNVWYVOGEMOCPILKSVFFSEt.'RECKZ.YPI1~V~DIEiaIIS'fCSGTSYY
RARPIIGNLCSTIYA~~'~'~SFRVpSPBwtIATLPFV

ASPLlWI ISPESYDIMiONISSTYlOmIL

0210 277861 276698 CPtL0253 288171 287950 CPn Cf111 hypothetical Drocsin Ifratae-shift with Ot53?I
F

_ FC"f3CRT!'ISSSIPTCOIfITISIPTFVRFNIESINLTDEQKKTALTTGONIATEtIIWi~GN
No robust homoioq present: in Gsnabenk/F1~L
as of 11/7/98 LVYPNVFSPSSESWKaNSWRSN~~VSPSESTEY!!>fSETM00RVPDIESLfDVDADODLICONtttE~N~P~SGRVNL
SNSPFSYQOSIGMtRQDYI4'tIt~fl RP'fD!!8'ffGFItAAQM.GNLFNSFGILI4lCFSQCKSCOTPGC>El'SATVLCJ1TLLF1<WALIEQPOQYVPY~O
a'TN~RAALSIi~t~SGDI414GE.5lIYLGTSSIKI~I~VO

t,GpTI~,AL,VYCAY1NY'1'LCKtIIYSIliKAItAKVLRHP110ERIFNRARGVATIRSSiEGVK'~
~
I

CPIL-.0251 289368 28859.
itLYKSANIGSLWSLI11SL71LIALTAGIVLVLFFVAPGRAPVITAAM10CCA7100GAI
tDLFLTDCISH
t fItATX CT143 hypothetical Protein SLtGWIAIVNKALDIOLTN~~AVSERLLHDPSNFOATLSVIaNVRi~BJLETRDLKVLLPfTTSPCEFIVIfONIISAt xS
RPSOHYOGSSDYQHRRGINtONFI'GSHFOGOOGFAGSH

YGNLFSNEEVAOLVOGGAPGGGS IPHKTLt3tIlmONLFIDO
' SAAI>ALTFSYYRI(TOCORANLYTYYPGN

fIYC ~T~p~ntSKTDVSOTP4CNNTSDPO
:..AGYPTAPfNPSAPPPFPPPAYD
CYYVAPNL'ttZTHVAATTiKSV&RNRTPDFBAYADIEPWKLfCOVCIYf7Y11'I<u.TRYIBCQ

IATLTINFVSQiIOITLLC1'SD'~GYSSDRTSVAVTAIFSVTILVSSPIYDrPWI
CPt>

_ It~JtSLS~I~PFPSNlV6VD
No robucc hosalop Dresenc in Gsnebank/E!!8L
as of 11/7/98 :FLVKFMSA!lISLSSSHGSTASEfI'pVRDVLVSL~EfYIDREfEILPTKVFLRR01Z.SS

TAIIDDIJIDVVETBIGBHIIFOVYSNTSLR4IYORFFEKIFOICCCFLLLVTDBNIfl'DPOGA

CPel L;TCIIF1111Vh!'1'VCAIVFCPTi.CTLCYSAY1CTY0LTKKISSLSRIIi?Zi~FTNSVOKSDPFI...
AAAAS05TIKACKStFROSTGTFFVt.GLIITISLAALIVGLVFALtTLDPGAPACT12 hypothsciul Procsin V
KNNINIiFBCYFtILDSTVDCDtSaANLKTFtI~AOGISS1'CIFSIIOQItiTPKDO
i A TLLKVIN
HRSG
V8A1GLTSGTI~~ONFTEEOISIDFKfBIRLSNCALPK6DCDPVPANYVRBPY!!CS
I
VIffAANIOCCAAGG'fGILLSVIGFLtabvYSWKSODGVHIQOn'ALLRCIVSNI'IION~Y

LPITPCiI7UfVLTOSIRRYDOFFSDDEYRDIESEVPLNRQZTPPPSYE1'LFHEECSOaSSNKPT.IGDTtuNSO>eS
~T~ETTI'~NVNSTfRTIGWKQSTRIL'NC~I'AZ~T.RA

VIPRCSPPAYS1'IDSSNSPFPSSSPPPYYA I~ELypK7lNptBJNGfIOGRIYINIiDLOCVGC
1STIYSOGCYATICrLCrtTYRASVD

VAPNPNDPNRSDNYNAGI~~IGNYSFSLLYYP~C

CPt~02i2 279975 279487 No robust ttowolt>Q Dreesnc in 0256 291=82 290398 Gsnsbartk/Hlst as of 11/7/98 CP

KSLKYCSLYOFSOKPTVILN71CSIFF1(MSt]f'D,n.
YZmEPLSKKTACLWD?f4.YPVIAWCA CT111 hY9othsclcal Procsin 'NSWLLILKVLFLLLSFPFtQ.CSASSALPCERVSLGSHFKCLYGCCLPYLLitCItIVPVFCGGRIJISERA'PKTKI
SIprIVRFNIOSTNLTF~OKKT'1'FrVCCKSt l'fQ'lIVVR~LrCT

.GTAIt~FI
ISHRTSEDARLSSAIVII~APILOL71015GLIKPDachTCOSt~r:aKD:rITRErsTNSEIVeDCRLN~.sNSPLma ~ISacODTTDraaESSaKP

OEYVPIGYYKRTOIEIIR~ORARN890YVOOGSVPSCSYVPwNKFDOTS'fQICISCfEIYTDP

CPet_0243 280609 280133 E1D~TK<.VE'EVNNKVPKLFET~I~~TLLRANEY00000RINYfDLRN
No robust homolog prn~t in GensGank/EL~LBRGSSYYE'fRPI4YVCVTYYAQ~CYETFOEaRAGGCLRVSFPSwNIVIILPYVL
as of 11/7/98 iNYNIfLVFLLKFVKGRIINACSIGYItLCNANEPDRF5111SINALV11DILLYPFNAVIGWTT

FAVLltWK<.LFL71TKFLVNfCIAACKSRPLPSCKENFOCLFGPK~(PGPSDWLGCLVLIP
CPtL,0257 292136 291267 IIGTLIYSTIITYOSDZi~RLRYFIISPAYInICSTAIINWCT143 hypothetical Drocein _ GVVIBtRRM.OKTGPHASTPSINttAfNtG ~0 ~ ~~T~'N

CPn_0244 280906 281556 ADTTTSPCEFIVODCB.SAESSOFKATTLSKCLBTTSEDOODAVPKPIfN&DPQSPR011LT
' adk-Adettyiate K>,nase GAPLVTKCSVFIIMGPPGSGKC'EDSOYLANRIGLPHISTGDLLRAIIRELTPNBLKAKAYPYI
YNY1IRMLiCOAtNL~SSSQPL'NGKPIETVC~IPNPE'fYRISASAKIYDAVIII!
IIIRESr#ISGLDNPNbYWI3tIGI34KTLTGU~DTRCY~RtRTSIAV

LDKGAFItPSDFI1WEILKEKLDSOACSKGCIIDGFPRTLDQAHLLDSI~i~VNSIVYI'VIFLOFE~GIYOVrIO
TGTFTLTEIVATPPHDIfPNLFLE1TIGIDIKSMSTCVIWFPFOANFJILVD

ISFDEILKRVCSRfLGPSCSRIYM'SOGHTECPDCNVPLIRRSDDTPEIIKERLTKYOE

R'fAPVItIYYDSLGKLCRVSSENKEDLVFEDILKCIYIt CT112 hypothetical protein /frame-shift with 0259?t =Pn_0215 281627 282199 CFSFCRLCSKFEKITLOCKCAIOLLAAGTYILTPTICKRN~WERiL~3GSIRLFBt:KYTGD

ydh0-Polysaccharide Hydrolase-InvasinQMIGGSTV1STI~TAVYRDHSDIDPDPNNPSDKYEB'MFLfYRNCOHSAVIG
Repeat Family TCOKEIMCNIfL.iFSPSADFFSKOCAIETOVLfGERVI:JKGSTCYAYSOLFHNELLWKPYPNYSITLLYFAG~fV

r:115FR5TLVPCTPEFHIHPNVSWSVDAFLDPWOIPLPFGTLLtNNSQNNIFPKDIIJ4it !4'ff IWGSGTPOCDPRHLRRLNYNFFAELLIKDADf3 .IttFPYVWOGRS1MESLEKPCVOCS CPn 0259 293031 292141 CFINILYOAOCtNVPRNAADOYADCHwISSPENLPSCCLIFLYPK6EKRISHVMLKODSSCT142 hypothetieel Drocein /frame-shift vith 0259?I

TLIHASCCGKKVEYFILEpOGKFLDS1YLFFRNEbRORAFfGIPRKRKAFLI
YFYFKRKTYtNFIEM1'I'INNODMIECYFKLDSTVDCDLLASNIOTFOttOAKCISSTETF

3tI00NATFKEIfVSATCLTSASTYKLNATGPAPBSITIDNKNNRtSNWILPKNPCDPVPAN
~P
GTME'DDSSRYLPI1GDCSNYTLYOSSKAGDVPRPVDWOONSKKL

. YVRSPOYFFCAKPIE
n_ HLGLiT4PYNPLtAEPTS
rs9-S9 Ribosomal Protein VvAKSTIQESVATCRRKOAVSSVRLRPCSCKIDVNGYSFEDYFPLEIOATTILSPLKKIT

EDOSQYDLIIRV.iGCCIQGOVIATRLGLARALLKENEENRODLKSCCFLTADPRKKERKK.

CPn YGHKKARKSFOFSKR _ secA-Procetn Trensloease &ubunic AYLDFSKRSCVEEDNVSKKINRE7tK~CPCOSNKKYKOCCLKKEEOTARY7Tl~%PKPSAEV

tp =0247 283130 293969 LSASEOGEIIGONC'MtLtORISOSLTSEOKMVGKFNOITKItKEIMSKKALJIKAQAKE6KL

r11:-L1) Rihooal.~l Protein VTEKLQOfINFEIWICENWPPEIFS1'ATLNOCrNFVUEDFIPTOEDFRISENSOKPPVEE
D::YIINKILRKOTK7TiVK.iSETTKSWYVSIDMGKTtl'ALS:uEVAKILRGKHKVTYTPHVA

McaXNIVINAEKVRLTt'AKKGOKIYRYY7~CYifWF.EIPPE1d94ARKPNYTIENAIKCMMD

I ~HfIetL:KKUIJt::LR I VKGDS IETFE;eKP
ILLDI CFn 0:.'t:l 1'1A27" 3~StW t ~'fw n4>t :x4151 ~dO:Sf! ytl.~-FF-Gx.f. .utlltrt.lmilY A'rPner.
~.FIRIPFIVFN::'1'i.LIIdPMStN:KRLE:LVRKALYTIITtILANIINKIWAIJ#X:KDSLTL
Y

;'~tV/Vt4N1 .\tu' 'Tt.mcpwt.r ATF.~r:.-.
L::K?IOOQN LtJtL.KAI.
.TRC.ftWLDI.ItAVNt~Y:KY~tr.AEVNKPYLTFIt:DUt.CIPFRTIP~WAPETP

te::1Iat.I':ItVIt'/A'rP.:It::FR.~.PA<:KK::RKtL\t:tdtlIFY::RIJ1M::LLIEAKNEr:YPt:
.~.UARRRLLFOAAKER~A::AIAEY:IIIIRDULtIOTALLJit.I.IIKAKFNitLI'VIOINIIF
A
DLKNODL
:f'~ LRFFGY
' ' ' . ?dl'fUiFLIETPEEWIRKFAKtah:FARVTt'tt , IM/::(.Ii::YAKU::IJtt.lB~VFPL7WIWIA
M.LtILtF:fLDVP:~ K
iY:A::t:M:K1 Y
I::I I
:'tJl::If:l'IN::1.::IJINa'I
.nIttKAIV:F'VIy/tll~tLl.l:DVrJLIfNVltfti\LLARY.tIIwYti::PVYTRALELLGLVNLEDKV

1't,l.:::Y.I:r::l?Yn.NtVAIAItALINBI'AtLIrIDRI':aatILEET::EUttINLLLEUA::ALCGIL.
:Q
LAfQEtK::::

!V'1711/KItI\::1':::HI77lI::N:KLFI'IIN::nanVl2i.~ ='t.U'.'. :'n':w t ~

~IH!1 ~1' : NN'it1 :''1111 inllB::4rB~ t ikne AI:1.1 ItN7f:~tl.It.ll:,.
I.IFNINKEItK\t'tL'IIJ~MKALKI IL'1'NUU:I'FAKtxt;;t:t.V.':ALLFNIIt:UtYIMfVAE7U::

'I'1'.I tnYt.n 1.t m.vl tt..t.ein GiLRTLFE3V3PDLVISrINCC FTEPOAV\:LE:.RL'IiLT'::.
.iQK6YF.ELLNKiAYYKCVw"L'ECw:YC:IRNG4C:.
I "
:SP" ' ' ' ~

.. t ~iDOYVKRFIPVKYFKEQRRtIDHCIF:
X fNE:.IfiIT.
1CAS LKNNIIVARRT".tEFDAGPIROrEOII
PYAYFQPVKGWAV
..N(JV
:K.74\L..
LtSOPFP

tlNt':KN\WV:x.TICMKQALYt7raW'NALSOtNMISFFQQDKAPEZUtALVIYP iN t P YW4'IOIWOT.Pt~u .FObD(CACFLKAY'd311K'Illd(Lb ':LTt:IlJINFPT::Pf7CSSwlOL7INLVPPvIDEFFYCEPQYLGSVNKNtIYYVCKISG1I1LICAt~
?~~
PCEELAAlWIK11FI7114GFta~'LA SLt7~'~PIIK

EELACNLENfII:,IIGPIF.ipFt;aPICLM'LCEFOKTQINPFHL4LLSSEL1TKIFHIVSDEOfVMLf'MtGMAVR
FPHfxVRPhCRTARGYRGVSGIQfECOKVVSCQIVCD~SV

LIVCOpCft:KRSLVWfRETFWOGVGVASILINERNCNVIJG11IPYZDHOSILWISSQCQA

:9517 297136 IRIIIIQDVRIMCR..~l~7r:'IRLJHLKEGCALVSFtCKL~SNCJDCeILSCSEEF7CSC:YSLR

Y7i17 hytbchetlr;.tt Protein ~> .
aPRKLRVRPP::LAKYAFRGFRNSHCPRPTKFSFPLYFSKtLSWFIIGGFiJIAC~uV0- ~Jr,01'~4 ~
' .1 , \L. .
.. v:::7.:7iYA:. . ':L~!':F~~:~...-:- ! '_' .. .. ..
.'~I'.'-'FI
.
' . ~.r .. .. .
."(f.Y~:.._;,L~.:
::P1NY::.'.~Cr;;'E:f::fNDPKEKNf;.:~AtTi:.c'v:wn'fRlfRi~tSIYIi:l7lto'C,isiitlLti 6llYLtNiDEANir.lv.i :H:::
.u.Yil;

:IINKKKCYT11GOLILE~INFFIfAiiGIVY%NWHTAFYSFLTYCIATKYNDNVIInCLECCRIDVRILCCCCIVIV>
xIGRwIPIEVNERLSAKOCRlYSALCVVLTVUIAODKTOImSYKV

KSVTI:TSSPRKLGHIIlIEfLGIGLTYIHAIIiCYSCEPRNLLWWEItLCLSOWtEIVNRSOGWICVGV~CVNaLSEI
a.VA'lYP'I(DKKCYONLISKGIPITfPiQYV$VSDROOTiIVFYP

EOPSAFIAIFIfLIIEVINCRRT
DPKI8TC1'P~ISILIOIRLRGJIFLNRGZTZVFEDDAOVfSFDKVTIPYCOGIOSIyfYIH

OFB~L.!'SEPIYICCfRVRDODEIEFW1I4WNSGYBELVYSYJMtIIPI'Rp00TNL'1'GPS

CPn_OZ6< 297770 297155 TALTILVIFT1'YZKAtOHWO'a'tKL7ILTG~IRi~i.TAVISVKVPNPQI~O!'IOQI~NBDVS

ubiD-Phertylaerylau Deeasbo7cYlaseSVAQQV1AC&ILTIFFCZHPQIARNIVDIIVFVAAQARB3IAItKARB.TLRK8ALDSARLIGK
WCZSCASGYIL71VKLIKELVNAKHQVCVIISPSGRKiLYYELGCOSFDALPStEN
M
R

K
LIDCLEKOPEItCQ4IfIV~,DSAGCSJIIIpORORRFQAILPZRGKILNVCKJIRI4KIlONQE
Y ~C
LEYIHTNSIQAIFSSLASCSCPVEIiTIZIPCSllITVAAISIGL710NLLRRVADVALKaR

LMSIHL>DiLLXLSK5Gl1TIIPPNPNWYFIIPO~~L~~~IIOTIZAAIGG:ZGADIfIt4.SKZ.RYMIIINl01101 fDCSNIRTLLLTP/YIUIftALI:

PLILVPREIP
VYZAQPPLYINSKIOtDfRYILSEKDfaSYLT2lLGTNESSILFKSTCRCJICPaLCiTINV

PSDLTKOwSNPE
ILDVCSFIIITLEKKAIPPSE!'LElIYItO'..Iu"YPLYYL71P7f1~f00GRYLYSDtAtCCAL110 EC1?IKP1CIIELYIfVAYFVDI~LOLKKYCLDISSYLIPOKNEIVILRf~SP5CNY8CYTLE

=Pn_0265 298672 297730 EVINYLXNLGRKGZEIDRYKGLGL~Ii100LWD'1'l9NPOQRTLZMVSIJfaA~TaDNZPTIQ.

ubiA-Benzoate OctaphettyltransferaseNGIlYPPRREFIFSHAL4IRIFOILDI
:!IIIVRLYYFI1JLVNTlCYSIFSILFLSAS1Yf11LSINEZSpNLSFICEGFKISVFG71IAFV

FARTTGIWFIQCiDAFTDKIQffRTSKRVLPANLVSLNFAWVLSLICSFLFLFLCKZLRIF

.'4IVYPYMKRVTFFGIRJGIIGLVY1Y11ILlNlCAFAFSCLSIIRLCFLULiIGG9 aLGIASLa CPh~0276 311110 310 . CT191 hypothetical Drotem SVCNVIAaNDIIYAIEDTEFDREDGLRSVPAHYGEK101VEIAKVNLWVSYtJIYIFSGTVGI7NP'LKRKKRDGSQVO
NKRTASPIIDWWYLFOfYLQEI.QKZNiIANPN011IDAWNOVf'ItDKY

SLDKEFYFfAIIPLWILKWRMYSNYSKKDOEGLSKPFLANIAIALSFLVSNTLTWSLSKGMSpAIGFRDHILLVKVYNS
SLYALLI~'1'PONDLINSLYQVASNVpIREIQFLI~r R

C?n_0266 299181 299876 CPIL0277 312003 311104 No robust hosolop Dres~t in GensWnk/ElmL
as of 11/7/98 No robust halsoloQ Drsssnt in Genebehk/ENHLIKHLPPLIFYGYILNZIHVRAtAtGITSVQQPSTN!'OAAIPIL
as of 11/7/98 NISIFYPKYFIEGKCVL

IMALDEINNOF87PSppI115STSOTSKINODR1(TFACTVTLLWATL!(ILSDIVLLtTIGS.
NIVIOCSRZSSTYAEDIEEVAQEIILFJfSTNSKSSTSVM.WJWRVRCIfVEILCOCIVILAL

IGLSVPLSCILCTFAVTVCAVLFZ1CLTILVRKSLGIEQ10~DI3ifT.KIKTPTPPARPLIf VVVVV
F~
t GEC<

SKFSVrCSTTSIVLGNALLIG71WSVFFL'iGYLpLCLCACLVCLG'1'ALTVAGLiIRNSPRS.
C
I
YLCP
G
EITALSILpVIZKLIItCLIDVLCVCLFGLGVCWAIIG71IA

WDOCCSGSADSQSNIVGICEPKAAQDOKWY)0lAIMIG>mGZPTAIIILTPEKPIIVKTi.ISPDKPYPfVVYV

=PtL0267 300122 300910 CPtL0278 712881 312060 No robust homolog Dresent in Genshsnk/EMBLcaausrved oueer eualbrane llpopsoesin as of 11/7/98 SINSWd(TN71LLNQPEPAVCLNAWDPKYINQDRKTFACTVtLLVZATLMILT1GVIVLLR08FBfKICLSLLVCLIi~
fLSSCF(KCWIpMCIRIVJ1SPTPNAELLESL01<aAItDIGIKLKIL

VS:aGTSVITLGTJ1LFIICLVKL210f5L7~WI0YpKYFOE<fVICOKYEPFSPI7CCYRIPNRLLLDKpVDANYPWI
011FLDDECE<tIfDGIOCELWIA1MILCPOAZYSKKNS

.
SLaILKSQKKL.TZAIPVOIITILIQRALNLLF~CGLIVCKCPAMitIffAKDVC~KCiRSZNI
-PKNONVttKLTSG.PSPLDIESPSPEASTPVSIQ.RIACSGYAiVILIVTLLIGAWS~IFFC

ptaLllIGFACLGT71LFVGGLirGLRTNSLIAQGINYLYLTYYLSSALEERNEITIImQLEVSJ1PLLVGSLPDVDAA
VIPGNF'AIMNLSPKKDSLCLEDLSVSK7fIi~ILWIRSCWGS
:~GYL

, P101IKt.QKLFQSPSVQHlFDTKYFK~TIILTIEI~FIC
RNEINTYLTEfl(~tQ01dtL1DILLE

0268 30091 701318 CPn_0279 313516 312875 CPn _ Pwsibls A8C Transporter Peceease No robust homolop Dr~~t in Csnsbank/F~LProtein es of 11/7/98 xawOltSLNSQCOSSSTS"110EWNKStVPFK'R~1PTPPLSPIPSLDEFIL7IYEPtI2PKSDPEKKD~SDLIQIL.L
KETVNI'LYIIVSTAFF1SCAIGCNLGLGLf'C1'SPItBLNPIDfSLYATIS

NAQIIFtPPCI'STPNVFNCIDDtlIPLLGpPNEOFE1JIFBtPGTSCSNPTSLPAPtE~'EtNSNZLSE'LTAIPFAI
LZVILFPITRIdIVGTS1.GP'1'ASIVPLTZGAIPFWTIWD311RNiAL

QECZaGSCN~LIG
NYLC$J1VALCIPKRNILtGILLPESYPQLIFSLKSLWNLISCETL7IGlVOOOOI~pIi.I.

QYCYYRPZ<3iSVTtSVLVITLVLIESVRILfIDIWGRRVLKIfROIL

CPn_0269 302168 301176 CPIL

DipeDtidase -, VATRCVIffII7F0LCIM.LSHPNFGRImPAVRCSPEQLLSOCVRpQVCAIFVPHSRGEPNCDItdppl'CiDSDtids Transporter ATPase pFtSLifSLPNQYPDIGLLSYEEEElIGSSS010CSLSLIRSIEN1l5J1LCODTAPf.C'ILI31KLIKCGWLVSEpH
SPIISVQt7VSKKLGDILILISIfVSPSVYF'CEVFGIVCH9I'~GK1TLLRC

IHLTKOGPIJ1YLGIVWIOGDNRpGOCfF.APIDtLBNDGKVLLDINYELCVPIDLSHCSCKL.ALDPLDNPTSGSISV
AGTDNSLPTQKFSR11NFSKI(VAYISONYGLFSSKl'VFCFIIAYILItI

EDIGDYT11DKLPF7LiIVIII.f' NSTIPRSVGDHRANLVDAHAKiZVRRIOCVIGLNGVRSYtICDSHHSCISKSEYCEOVYI7I'I14FLNLYNRNDJ1YP
GNL.SGGOKQKVAZAR71IVCQPLYVf~GD<I

IGDLEKNVLtIAENiGILSSIVLGSOFPYANFaENIFFFtF.CSSAffJWPVW~OLIHRIFSItGT511LDPKSTENII
ERLLQI11QERGITLVLVSHEID'WfOCZCSHVLVl4fpCAVCELGTIEE

KAESILSSRAOSFLKQ11IVEQVNPKITDVKf.
LFIi4SFiISITNEL!'HEDZNIAJII3SCYFAEDREEYf.RilIPSKELAIQCI
ISKVIQ'fla.VS

INIL~FIINLPAKSPFFGFLI IVLOCEYD~tKKAKELLIE1.G17VIKTPIf CPtL0270 303313 302168 CPn yvlC-SuAS SupatEamily-related Protein_ SIFGVIVPDKIUQITFSLPEVMSAINQGKNALPTD1'VYGFVLSLY11SE71EERLYALKDR-dhnA-Predicted i.6-fructose BiphosDhate Aldolase Idlhydsin EPSIGFALYVNSIEDIENISGYPLSPTAKKLAOLFPGAITLWKfiPNPRFPKLTLiLFAIVEamily7' DNS~ft7REIVINtCGTLZGTSANLSEFPSJ1LTAQEIFADFADHDLCIFDCPCSHGLESTVIIAISLRRHTWtrIIHD
ILGNDt>E2tLL5YQCKNITtmKLTLPSNDFYDKVFCLSDRFB1RVLRS

SDPLYIYREGLISRSVIENI71G'fEIUCIFHRTSHAFSKHIKIYTSIIO~OEQLVSFLSGSLDFLOTNFSI~ftLANS
GYLSILPVDpCIEHSAGASFAINPIYTDPF11IVKWIESOCSAVAST

KCWCENPKPIOIFYTRLREALKKKTPSIVPIYDINt'SDYP>r3.fPFLSPYYIEYCTLSLLSRKYAHKIfFNLKLNHN
ELISYPPKYHpIFFT0VE7N1YSNCAVAVCATWIGS

ETSNEEIVAVSNAFAKAPSLCL71TVLWCYLRNPAFVAFICIfDYfffAADLTGOADHLWTLG

CPn_0271 303628 301362 ADIVKQKLPTCQOCFKAIFtFGICIDIItVYSELSSNHPZDLCRYOVINSYCGKVCLZNSGCP

LysophosDtsolipasa esterase SGKNDFTfAARTAVINKRACQIGLILGRKAFORPLSFGIGLLNLVQDIYLDPNITIA

KLIifDYSFFRRKICNIPJ1IECPCNPQDPIIILCF~YGSL110NLTFFPSICSFSKLRP'lWI

FPNGILPLENDFRGSRACFPLNVLLLpELSRLYAF1GVI07LQEKYDELFDVDLETPKFALECPnr0282 3160A1 ELILNf.HRPYNEIZICGFSOGAIIrITHLVLTSQNPYAGALIFAGARLFNQt2rlEOGLKQCAxasA/gadC-Mino Acid Transporesr QVPP'LQSHCYEDEILPYHLG1W
LNDLLLTKCI4CpFVSf'H~HEIPSWFQIO~VTVPNWIILILQSLNFSKKVETMSHSKP1'KPLCTFT~IiLSLJIW
I:at.RNLPLTAK11DLSTLPPYCL

DPARG
AVICPFIIPYALISAELASFKPpCIYIWlIRDALCKWWGFFAIWNOWFiUlflyIYPAV4AFIA

STIVYKINPELAHNKIYIATVIIJIDFWILTFFNFLGITSs~ALfSSINLIC~LIPCVIW

CPn_0272 305272 301340 SLA4lWIFSGNPIAISLa~FtLLPNFSMISSL
DNVNPRK

dnaK-I711A Pol tII Gases and Tau NYPKAVFICAIA'tLT:L'JLv'SLSIAIVIPKEEISLVSGL'(K'EiTLf'IDKYNL9WIfEGIW

FNRQSI7AT'IATyNMHLEEENQGWFrILLRKVYHQEVPPAILLHGPTLPVLQDKAEOLASEIVMTIAGSIGEit~AWM
FAGTKGLFISTONDCLPRLFKIMt3KNVP'INIJa.PQGIWTIPTL

LLS.iSPCSEHKVSQKIHPDIYQFFPEGKGRLHSjDLPRCIKKQIYISPFPJvNYKIYIIHELFLCLDSADLVYWILT.
1LSVOHYW1NYICLFLAGPILRIKEPRApRLYSVPCKFIGICEF1 ADRNTIJWtSAFLKVFEEPPKHAVIILTTAKVpRLpKTIISRSLSIFIERCEKILCSKETSIU'..ILSCAFALWVaFL
PPRELAQISFX'SKICY7TFLLLAFSLNCLIPFU'IYFTNKRLSK

FSYLPRYAQCEIPVTEVSQIIKESSEl'DKQVLRDKVQRFNEVLLELYRDRY'fW9tGLIUSKS

.\L.NYPEMIKEILQLPLLPLOKVLLIVESACRSIaBJSSSAASVLEWVAIQLVSi7pYKEKEL

vsVSP~COCL.SF7 cPn_0293 318581 317532 Nn cobuac homolaa Dresenc in Genebank/ENBL
es of 11(7/?R

'.Pn_U27f 305853 305227 c:RRL:fFODLIKNAV1KIISFRKSPPNPVItLLIKFAKKGLFlJSSIAPLYEVLLEILF31PG

rdk-Thylsiriyluca Kinase EEILEVLFSLDPNwWtBNLDPKKHSTtGIEIS~aETAETIESCSIGLISINLLLSGLCLRS

.'.aVFt'/IDOCEG:xK:SLAKALGDOLVAQDRIfVLLTREPrY:CLICERLRDLILEPPHLE.~>NDRrQAVKIIQO
Fti'QFS.'EEVQNFVEQRNILTPFWIHLFECDEVALLWQW:LRLOLIV

L~F.CCELFLFIG~RIIpHIQEVIIPALROCYIVICERFHD?fI'IIQCIAEGLGALIFIIADLCPNALYPEPDC:;CW
<,kaNSEItAKt711E00QEDFlIKTKFA(:Y.Ef:LKKLVLPAL~ITSIPQLL

.:Y.VVI:PTFFLPNFVLLLDIPADIGi.QRKHRQKVFDKFEKKPLYNNRIRFf..Ft.iLASADPRARRFf!Q!'w\E
IW~L\IMtKKNKQNPFIFLEALLE::EEF::t.'.'X:KYWIW~1I'IIIL.WOKLWIA

at'(LVLGAPE:U1::L IDK1INLNt'OIw.LCTI Y U:YF71V L ICps :r I ETF~'RRIWIJdPEAFQM
IQOr:f L4:FLFfKNLLD

:an:R;7A 7UN1FR 305952 t.'Pn t72i,4 IInU'i1 IIHSSI

IyrA fAIA ::yr..::.. ::utfunlC N., rr.t.:::a (HNNILW I'r'::unr n :n ::.,nvGW k/F:YGL .u: ..1 ll/'1/rR

:::I'11*tTIKDEIIVFKNLEEFJIKE:u'YLRY:.M.iVII:PALPDIRIX:I.Kf::QRRVLYANKQLfLIMFIIf A('\WfV114?ft1'NNf::::'(r:IL:LK::::LitfIT'I:.IWIUUA'fLdI::VLYFnt:II::

:I~:IV:AI!IIRXti\YIY:DT.'.:I;DYHF1K:E..'VIYPTLVPNAQF1WANRYPLVDCxXINFCSIDIiDV:."
PI'/f(:MLIfL::Vu'::1'la't:lYl~F"(:yJ':::IFKTFVF::IT.':I::VFI'::I1EIU.N(d.L:REF~
::

I'fnANR'ffEAkL'rIC:nNYLMEDLDKDTVDIVPNYDETKHEPVVFPSKFPNLLf:NGSSf:IAV::AfOELI.KNF
PAf71'fItRPItHI.fI::fIFLOFJJ.ftfiIRf:FEED~H'f::Y.lt.

'r:NA'ft7I1141NU:KLIEA'fLLLWNPQASVDEIWVNNIPDFt~IY:CIf~.0 Lpl ICCCEGIRSAYTT

:Rr:KIK'/PAkWIVRFNEDK11R&:IIiTF?1PYNVNK::PLIEQIANLVNRKTIJICI3I>1IROEW rsniH'.
'..m.ul slm'.I

::lrYff:IfNLEIKKr:Fw::EIIINRLYKPTDVQV'fPY:AtMWLINtNLPRTN32HRMI:iAWINm rmtmrr lu~nul.vl fa..,:~m in ym;t.mk/Falf'1. .n:: .n Il/'I/:N

ItIIHYEVIPPf!TRYF.tI7KAETRAHVLECYLKAL:xf.DALVXTfPE~J.TJKEH/ULERIIESFCK4LFT7LFF' P'fJUJK6fT1::111?LIYIvtY,::F::I::f~ITIV:LIAI::V1.1.1.la:VVF'ALVt:'IiVI.

MPIGLL'JW~AASVCS741AIVStJICLYKOGKPVeAttPOependwnr Prac~ W n~xy Reaul.yrL'T\' SNEEKIDPT%DLEIKOPESLKPV .:nIL%1.~.1:
tRNFIIQJLIDMFLLKK'.'IITVSLDNDL:.L'::ADKt~::FKI'C~tVF::::.GIriFSfYI:

tNt~QSLPKERKTI3lItAIfIPSIVWDntPYVIQSrFYHGNKVYSKPIAEpIpSLEIfEITVECYITI3KCKL4PLNR
.IIR~iDCFf'!EKP~iYJn4l::w\~flplW~li~Yy~firf~t EEYA

TLtYDFPRALEE.~SaKS".rGSLLRCVI3EIKNLFLPRFL.iRKVKYSL:ACLRRLGS1ViFLELYAItt~IIIFIIE
W .....: , s SSA.LILLLTKPEPIJ~tM'OOLIJWLNSLKTEKIUILTPIMDKLVISINFNFY41~ISLiE.

tEKIVAYDPM.LTDELIWiLFJICittIVOFLLS!'OSSO10REFRALFP~OELPSAKDUSrf 777866 ?37627 YVPAINSSEYNIfDPKDI-SVLIIl4,"LSERLU'CEKIPSPSSNttPTSSVASHYImFSLL!'fFF'CPIt~0295 'yl ~'~rmer Protein ecDP-At SNppSVILONPFLLtELWENPKGOTFGKCLLEKANPNSNWAAL.FKPNLNCNtISCIJ1NK.
ANSLEDDVtAI~JEvL.."'VOPKEVNFJISSFIEDLNAD~LCLTE:.IM:LEEKFAFEISE~J1 KELICITAEHW. PFKETTpJIIASCKILDLLLONLPDFy.. r7..-..,. . ...y., rrT.~.
. ..,.

... .~,~,~~ .~: t~ , . :". .~; 3a~ :
O mi:'W W':--mgtE-Ng Tcansporcer tCBS Lk7lCiihl CPn_U
AFTCLSIDZHSH av CT296 hypocMCrea: Drocein SCRFSKGKINVGEpNIUtECKLOiAFSSGItJ~SRTSNt~DELSFKLFXKIPIIGIfICNDITLVCNRYIVTCGSROIG
Il..IVKLFLFIJfiADVEINCWEtR00AVIL5I.
DLSICIVIEYNPI~J1YAVSCLPSESRAILYIU~ILSCITAKVAFIINTDSASRWAIFRRLSD

SEVCALIE(xiPPDPJ1VWVLDDZPDRAYRRILELIDSIOtALICZR~.010K'rRNI'kRLNTNETOLOGCVSFJ1RV
CVSNNOCVKDCVOKFLDIUWKIDILVNNACITRpPLLIaIIdC~QSV
V
V
"
I
' FFAf(XETrVKDV$11CIRSNPGIDLTRLVFVLDFKCELOIW'IDRSLIINPPO(SL10QIHASI
AKIwSA
.7pl~lYAAARAGIIAFTRStJ
SSVIRlOIIKARSCSIIN
ISTeR.TSLYYK
' ' ' NpIOOLVLPDIATRECWDLYERYKIJ1J1LPWDCaIFLIGAM'Y~Z~ZJ1DE1'IARSVINONLKAt3~ILKSIPIirR
AGTPm V71RVALfLiIfOL
tt7tfl ItEIfAAPNIRVNCLAPCFIE

tiACITfDUCYQTClIWOR!'LhRAPWLLYILFA~ChZSASYNAYFOKISPAtl.ALIIFFIFLSSYIft110TL.WD~
:LTY

INGItBCItN4'VpCSTILVRBNATCCLSFGRPRETIFK~1SIGLLICVSII&IIICGLWYIJIr FLCiZiIISGOGIOLGVTIIAI'CVIG71SL?ATl'LCYLSPFFFAKI.L1IDPALA90PIYI71t1fabD-Halortyl Acyl Grrier Transeyclase IMSltIIFFLIACCINFLPFN
SHSIt~Nt?OUtRYAELFP00GS0YVCHUpDLYNEYPEVRELFDPANN<<RIRLCFSLTSZI~E

GPEaiJ~IETVHSOLAIYLNSNAWKVL50RSSI0PSLVSGLSLOLYTIIi.VIIS~IISVLDG
CPn _ LELVR>~OLl9~tEACNpSPGAHAALLGLPSEVIEENITSLGOCIWIAHYNAPItOLWAGI
No iooust homolog Dtesenc in Gatebank/E!!BL
as of 11/7/98 RACIIIRSPLPFISSKFAtidQCLODEFSCPEDWDFLFSEIELLA90DEPSt~YiJILSRSAEkYDOAIELFRMACKRA
VRt.IIVSGATMTPLNQY1100GLAPDIYJIt.QIIU7SSLPWSI1V

LLMIlI'tIMiPKVIfKRVIFYGVSYCLIWESMSIFIDVLTYIDFLFEKLGISASDRi.SLCSARVGKSLVNTE~IECL
APC~SPTtWYQSCYHIESEVDEFLELCPGKVLAGINRSIUISItP

TCINFILYSpIGD9ffLSEWDNFRLIE0LL1U01P0L1UaRi~fOIFRZGAIWEEVSLVASITSIGTF710IEKFLSEV

ASVYpAVCRSFIELYtIIWLEISDLAL'GIOtCtJIt.ALDLSPttIAItIHADYAIOGLVYIGTRQG

KSLLIERGIIENFSIfAZFiSFSRDGTrTL11Y0NYRYliYALA'9VKLFDLTY10CEIIFOQANIIfabN-OxoacYl Carrier Protein Synchase IIZ

YQTVpAFPNLSCttMVWGELLIRSGWWSNMCYIEVOLEKLASI4KKTNDPIA4SCLI'ATYTSFFLYIMtfSVNIO'pI
KMIWATOSYLPEKVLSN71DLF1Q4VDTSDEWIVTRZCZK>CRRIA
f'KDSRHRLISANRTFPGtISJILVHAI~IVQLCSJILYIttEDSHPASAI
tU'M

.
GPQEYTSZJ1GAIAAIEINiAGLSEDOIDCIIFSTAAPDYIFPSSGAirIQIINLGItOVPT
GIAILCLYL ' SCFQSCLESiDLDA~~WU.FDAYFSwCIIUUtSNtLd.RIfAVDVASRLCSiJtPPJIILFWSD' ' RGLALKCLAFJ1TIDGAYKEIFLSLSLLJnORANDLSGRLEILELWGOSHYLLAQ-0OSLPGDOQARCV
WIIFI
C11G!
FDOOAAC11CYLYCLSVARAY~S~fONLLIAADIUSSFVDK1 IUESRPUSLEINRLSLGADUIdUELiSLPJYa'sSRCPA~SKLTipSCIUI1IAMEGRtYFRHA

HYDEAYTLLTKVDLTLSSSRVKLILAAVLLG1(GRLL.pIri'DFAEFJWEZLCFLY6YYLEDEVRRI~TAARHSIALA
GIOEEDIDWFVPtIQANERIIOALaKRFEIDESRV!'KSVNKY17N!'A

TSLGCPEAYYTIGKFYAVIImNNIUWG ASSVGZALOFl.VHTESIHI
DDYLLLVAFt~IGULSWG11WL1(QV
<N

HVIRSAQYGVRITEAiIWWDPYLJ1NLREIHAFRL1NFN010GRL1iiGNKTm90 0288 325785 724571 CPn_0299 716726 777115 CPn _ recR-Recaabination Psoteih CT288 hypoducieal Drotsin tt100<Z.VYYSESLY$MM.UPRPECIUiICIHITNTRYPDYLSiILIFFLR>Q.PGIGFKTAAC.A
ISITIREFLFFCFECRAKFYNVIMSCFNLTSTHFSLRPISPKASFPIODaiOSYlRSALRK

HRSOTLSVSYCKVNKYDANLFVRLTVIALAVVGVLILFSItd.ASIQGTLVZTSWPLVTAAFELISWDSEOLKILLi'I
APHIIVASEpSttCPLCFTLKESKEADCHFCItEptlfipSLCIVASP

ILIPIZLLTOGMfILfRhGEXVDVISGVCZPPFSRAGWVPISSSIftLDCFDEKIiIfSACSYKDVFFLQISKVFKGRY

LDISTL.SAOUSCZJU1VYQCPPLLFR11FPCFGIPCANPFVALLPNIYNLZRFLWPPYIIFGDATALIIJtOEL~!'S
VNISRL1IGLPIGLSFDYVDS4TLARAf9GRH8Y

RNIYEHFFCIUD:P>~DRFIYItDVARtcIGRSLJIAFLtIAPFYJI6aC'IIQJIFYSLLDPLiICRV

tiIGSVERDtitIOl~iVZLARS1ISLAtIF~WSLFRFEOGOGR10GIGQHAFYLHLCCpPOSVfLFD

KGEIVSGAttPSIOLPERRCLDTSCRYPHZSVIPDS~iD&AIUIFIV

CP(>_0289 725797 726996 C-1'2A9 Hypothetical Drot:ein NFtdltl'BfKpRSHYKKNNLLLLLSILVGLGLGSVOSPNIVYSAECIANl'FLKFI~Li.SIPL
VFCA1GSTITSIOFtFNflNTLGIUtILYYTLLTTVI11J1SIGLLLFFLLRPONI'1'~ALAT'1' TKCNPLCYLtNLSDTLPtNIFRPFipGNVZSAACL.IwVt.LCSASLFI.Q~~FtfIS
rFSIFUa.~ocLxLLFIAfa,crsYItFxFS.IIDOSNtTIaAaxtscvloun.AOCFIYLP
ILLKINKVSPLKVJ11UNSPALVTAFFSKSSA11TLPLTMELAtODLKIN%NLSRFSFPLCS
VINtIIGCMFILITVLFVATSNI?IIISPIJISI4WIFIATLIIAItRIAGVPlIGCYFLTLSLL
T5181VPLSILGLILPFYTVIIMZlTSLHVWSDCCWSLAN

trtTAEUVPVSERFFt.CCETIVRCIfKSFIICPKYSATFPQOGLSSLLISEEIpYILIBpPl1 CPn _ ISAFYiLDSGFVCLOEYItISLKDLRSSAGFCLRFDVL~OJMfPVHtGFGWPIItPT~ILI~K
Na-dapeldenc Transporter RSALTHNKKHASFSSRLCFIFSNIGIAVGAGtiIWRFPRVM~JOGCAFLILWICFLFLWSIDNSORltFALOQip IPLIIIEISIGKLTKKAPIGJVLIXTAGiUCFAWAGCFZTLVTTCILAYYSTIVCtIICLSYTY

YAVSGKIHIL~DFAXLWTSHYOSSIPLWAHLTSLGLAYLVIRKGIVtIGIEK(ZIKILIPAFCPlL0J01 710167 FLCTIaLLLRAYTLPCAVOGIKOLFSCLnCSCISNYKVWIEJILTp~111WDTf'JV~GLLLYYAfOapIhLike Oucsr Haebrane Proteihl GFASKK1'L1VSNCALTAIGNNLVSLINGIZIFSTCASLDItGI'rpLODDAGI1SSIGITPIIKtX.SKEIF11VFRI
IGFWYPFSIPIU.VQVIt9(%LLFS2FLLVLGS'fSAAHANI.CYVt~It~lC

YLPELFTRLPtxIYLTTLFSSIFFL11FSMJ1ALSSNISHLFLLSpTLAEFGIKPYISEfLALEESDLCKKETEELEAH
KCOFV1UIAEEZZEELTSIYNKt.pDEDYMESLSDSAStG.RIUCF

TI:AFVLGIPSALSLTFFSNpDIVwCVJILIVNGL.IFIYJU1LVYCFPKLKIfEVINAAPGDLEDLSCEYHJ1YOSOY
YOSIt~SNVIfRIQKLIQEVKIAAESVRSK8KLF31IWEGVGAIAP

AWIGFDYIIXYLLPIEGILLL.CiWYFYDCLFPFJ~>GQwWtIPISLYSLCSLVLQWSLCLIILCl'DKTTEZIAII1J
ESFItItON

wxFNKOLYLAFSRYNIiEIL
CPn_0J02 710766 311866 CPn lpxD-UDP Clueosamsne N-Aeylcransterase _ SKFI~FSNSFJtPVYTLKOLAELLQ~00NIETPISGVEDIS0A0PHNI11FGDNEKYSSF
ine8-Inclusion Hembtane Protein B

EKHMSAPIPTPQELSDOITCLNVipYCQYSELARENKCDIECLKTLTAALTADAGIOPSADLKNTKAGAIILSRSQAII
QHAHLKIOJFI.ITNFSPSLTFOKCZELFIEPVT90FPUIHPTAV

EIYSLQ'fJIAALILSASEKPCSCPSGSTECSVTVQSPC%FKKVIJ1WLT:IALIAIAVLIAIHPl7IRIElUVV'!'I
EPYWISpNJItIIGSOTYIGAGSVIGAHSVIGANCLINPKWIRERVL

CIIAACGGFPLLLSaLNLYTICACVSLPII11S'1'SVALICLLTFV1WSLIKPVITVRTfRtGNItWVpPUAYIGSCC
FGYITNAlCNNKPLKHLGYVIVCDDVEIGAM'1'IDRCRFI4PlV

LN>~1'KIDNOtfOVAHNVEICKHSIZVAOAGIACSTKICEHVI
IC1C01'UITCHISIADNVI

CPn_0292 329201 729836 MIA4TCVTKSITSPCIYCWPARPYpE,TIIRLIAKIRNLPKTEERLSKLtIOQVItDLSTPSL

ind-Inclusion Membrane Protean AEIPSEI
C

VKNfl07SDFM1'SPIPPQSSCDASFtJIEOPQQLPSTSESQLVTOLLTMMKHTGALSTVLQ

fJORDRLPTASIILOVCCAP't'OCACJ1PFOPGPADDHHNPIPPPWPApIETEITTIRSELOCPn_0307 )12982 tMRSTLCQSTKGAR10VLWTAILMTISLLAIIIIILAVLGFIGVL.PQVALt.NOCETNLICT303 tWpochaticel Drotein wANVSCSIICFIALIC'tt.CLILTNIUrI'PLPASREOKCLHHNDVSRKINRtITOFYVDSIDCVIKNFDHKPSEDIt sRDtiEELEEKLLTITKRIY

pSApEFQNRItTDSKNYYLKKTOWLPFKNEELEOTKELFANLTStIDIfKIAOLFFYSPOCSS

CPn DWVEFTEVICNLNOSICLGGVLIxCCLFE00CEHVVTVNKKLDLPLLLICTtVVNSLRYYL

_ TYRNISLLNCO~HSELOKELCDVLKQHCVAFTLIFKEIVDIDLLNYVKLIOGLKRSGNIO
CT271 hypochecicel protein VWSNpNVLRLLFNLHHGEEKRAFLFFLIGLVWCICCYCI'LSLAECLFIEKLCSAELPKIYARIYONDVP1'LPSVSSS
PIALRYSLAM'IRCLAt.NVOFSSLKFISPSIL~fENTAKALN

LCGSLILCVLSSLILYNLFKKHISATJ1LF'LIPVSLSILCNFYLtLSSIFAIOPPRSPLFF:,f',CECFIFSNLDEF
Nt~IIKIVtIpLLR'IICKLSPEIWKNIMKILNIKRRVRSLYI

YRiVIWSLTILSYTSFWGf'VDpFPNLODGKRHFCIFNAIIFLCDnIICSv'IIASLVN7IGI

OCILILFTAALVLTFPIVFriSKSLKSISDDHDLFIVK:HPPPLS%ALKLCFYDKYTfYL~Pn 0)04 1.11091 )AI15%1 G:F'tFLtpLLAIATEFNYLKIFEIOFASKEEFELVAHICKCSLWISIGtQICFJ1LFAYSRIpdtlA/t~pA-PYruvar r Wthydrory!n.m., AlDh.t VKRLGYtJNIILFAFLwFLSLFLFWTFK'l'l'LSIAVIJItNVREGV'I'YALDDNNLOLLIYCVPDQKPLPKRLF'Y
%KVMD.~.SAPYNIA::yaEK.~TVFRtLDLYCPA:x'.IKFLKONVGIREFEA

NKIRIIrJIRIWESFIEf'IC:NLVWGLICFL:iSpQWFCLtISLtATILVVLVR~fYAKAILRCEEAYLECLVCI7FY
1L:YAWEAVATMIAN'I'~:LDPWVF.~.;YRt~lfAtltLWIPLDCIM

KN4:ApALpLTRSNC~C~WIK:.K1VKQKRQVELFLLAfILKHPSERHt?TFAFQHLWWSRSVRLII:KE't''l'.At s:RtY:::NHtkic:PtIFHN:FtaVtIYJIPLAA(:AAFT(KY(~KNRV3LCFIC

LP::LLAIN9JKL.iLFN%LKTIF?NIf3SLWAKDFLTLELLKRWTSIFPHFAIIuAIHLYFAEIY7AVA:~:vF'IIt TWFV::L11QL1IatLtIFlJNdd::W?.~,LNRAVAKQIyIAE:.'~I:.~...1'DIRAV

IIDLIJIITIIIAEOLYDT~rt:DRLLMILTVRRUEAYf:PYRDLADKRLKELLNSC/~PEDIVNC'fVN:F'fO.F'N
:'Id.:FHhAYRYM/UfF-':IVt.Vtx.'Ia:::NFRC:It::l::Dl'tll.YN::%RYlIUCLfKIf LTLI.Y.LEKNPONFPiLLDFLNTKNEDtLIV'I't'.KAUIT:SVRANIIKt'YCf'ELLKRLROCSHNUI'tVI.AY
1MLIRLFIfI.1'EF.t:FqFIIRQIY:KTAV1.FJ1F::NAKL::::D1'::YITf.EFI:VYA

f>F.A:X)'f l.I.KT I:: I AL01::F'VKOLf~'I':NI.KNT::R%YAF:AM
A:t:LOKEVaFAFLOVLTDE

:rIiNRChILMML~KI!lNWLLKKIIAYKiVKC%A:;KALFY~YIk:IIYIQKK'ftTINL.:LW~:Pnyn'.
LL11A2 iA~.I :'I

tlflJ7:.T!'YfAEVNFIII::LItat.GSMEHSC1/LIRAL'1";:KNDKIKk~ALFSLEKtF:DSHLF3LtIhli/
rlNIt.lYmv.W OuhylnrpHay::., pr..

L!1F~/tKJt~iII:Y:.EKYYFKC(.1/IPLTLKELWlWI~7:P::::W%LTAQI?WCEEU'YCDFDFNKt"'...Mh YIIK'PIk:INE\LRF\ILRE7A::NGI'NV"IfI:EF:Jt:D'IIY:A1'KVT'Yt:LLIIKYXiPKRV

U::VFRI'fWQKHEDYR'PEE::L'fLl:a'L.1ItUAPI:a:MF'::i:k:tt:AAl.:a:l.l~t'I
IF:YM::YrtIY::F1A11Nl t::IIAAKNIItfflYt:KP::VPI

V F'MTffI:MA1?Y:a:(%I::Ik'VF_':L
lMt t tc A. t 1 1 AI':all'YMYr:I.LK::A
1 HNNNINLPl.EN

ilyn ~l'L'~4 1 f 11177 s 1702 .
F:I.t.YHLYt:EVI'1'EI?YLVPIy:KAIIftYfjFJ:NIU:fI
1'I"/::IMV::I'PKFUIy'::LAKKRWf:LaIEI

"'VRffC:iRCIVIEEC:NYFn oEiIALITEINF05LDAPPLJIVCK'CPI'CKRVIL:KIVKLLP1T'N
(EEGiDC:LIHI:IGISI~tVIDdIVDP.~.IYJNKC:OIVEAIV
LDLRTIKf'LUI~fIL
i . IKNLTNYC:AFVELLPGI6G
.n L1N
V HVNAE
(IEEKYPICL
tlpwOF
ICKDOCKISLGLKOTER
t-:

MP _ ~KETPIrPY:IKtLEOATLPNVNRILCfIEK _ _ _ _ _ KL

P'II ?~C181r~11AV~L~I
VI".1GV1ITK TATQATL.~GiLIIVSALSL1K

CPn_03ne KKVSLSVKEYf.IDNAYDODSKTE4DFK0~CPKERKKKCK
ptltlC-DiMdcol ~poare>,de Aeetyltcansferase .~.KFVI9LLKNPKLSPCNEV ,~T.VKAOIKK.iN00VSlGOVIVEISTDKAiLEIfI'ANEDf~VIR
A ~Pn 0316 359794 3e0121 EILRHECEKtVt4RPIAVL:.'TEANEPFNLEELLPKTEPSN<.F31SPKCSsLLVSPATTPOin A
P

;ATFTAVTFKPEPPL;::PLVFKlIIICT1'!I(ILaPLAROLAKEKNiLIVSSIOCSCPOatIVKroea A nusA-N Ueilixaevon r ~
_...
...
.r-~
r .
M

' ' ~
~
' , "
...,.;..,;-.r,r-i . I..- ;.:\.\Ff!.Iv'.Y.\
...,It,,-;~,m.v../'.' Y, .Y
a , : : :
HF .y.w:
I21 _ , ~ ~
~ v .
~ :\LF:fAA!
YT.
?.
.
.L
:
,v\
.

..
. ". N..F:YF..., ...1~"f;F;.
.... ;~Y.\REYI;i-W':l:r:YMDVPF'.':a:Nf'f7R:\>:i . !:C'4":f':
. w _:
r.,-r.
.,.:... ..~
.
rt, a.;
w ra:..L,..,t :r.r:::r~.f.
;~:-.. ' v:rl ~
r ~ri r L
AAfWttvNYwR141EitLVI'CLtsHtIxVNt:1.w.'YYKHF'NLwiNLILDLGK'lFrli:.PTRFiF
' , , ..
..~:.
.
.
.
.I.
:z:,w :
7lI:iAEIKSW.KaP7N:iLCDTEYK(M:F~JSNLCMTCIT
.rAIPDGIITPtIRCAURKNtl . GAEVIuRSNAEFVKOLFIaEVPELEECSVLIVAIA
EFTAIVNPPCAAILAVCSVTF.OIILVLOCLITICSICNLTL5VD11RVIDCYPAANFNKItLQKTEKHKIGDKIYALL
YE1NESENL.
' RffAGIIRTKiJWRSSD%ITDPVCAFVC~CSRVKNI
IRELNDEKIDIVNIfSPVSI
LLL~IL

KILLAPAVLIlX
LYPILIOKIAILLDDKVIAItNN001DYATYICIO)UiINARLISHILDYLLCVpRNitYtIIL

CPtL0307 31199A 316515 LEIOAi.CLAEFDSPNLOCPLEi4Df3IS1(LVICNGEHACY~i'IARVLLASJINphASVICISL

41QP-Glycogen PhosprorYlase ELAYKILLQVSKYCESXVDLICPLIED

NGCIVBflFBSFDKNKVSVDSH~AILDRLYLSWQSPLSA6PRDIFTAVIUCIVI~tLIUIG

wLKTQNGYytO'Ipv><RVYYLSML.,PttGRSLKSNLti~GILt#.yRKJILlf1'LNYDFONLVA(ECPr1_0317 St7AGLC~CiGRLAACYLDSNATU1VPAYCYCIRYDYGIFD~tIHrCYp~IPI>EWGAYGinfl-Initiation Faecor-2 ' ' NPWLICRGEYLYPVRFYCRVINY?DSRCKOVADLVD'1'OtViJIIIAYDIPIPCYGNQ1YNSL1 lOJLKLKIKNACLTK7iAGLDKLKQKLApAGiSFaKSSSLIIPS
SLLIASLSKSANMCIfVIQ.

RLW(MpSPRCFEF5YFN1K.TIYICAILDL11LILNISRVLYPNDSITLGOLLRLKOtYFiNSAKDfSVKVAi.i~ATS
TPTASAE0A5PLSTSRRIRAK)IRSSFSSSEEESSAIITPVDfSLPAP

ATTUDIIRRYTKTHICLONLADKVWOLNO'1'NPALGLAFJIItILVDRL&LPWDKAWFJIrI'VSIJ1DPEPLLEVVD
EVCDLSPEVIIPVAtVLpEQPVLPETPPCEKELLP1IP11ItPALIAIVV

VIFNYIFRITILPEALERWP4DLFSKLLPRNLLIIYLINSRirG.LKVCSAYPKIFD0101RSLSNIItSKfCPaGIOf INIR.LAKTPK11PAKC0NVAGSK$lxPVAS~CPGKPC'1'SLUGWIIitL

.IVEOGYipKRINIfANLAWGSAKVIGVSSFHSCLIKLriT.FKtfYEFIPGffIINTNGVZ'PRKQFNPANItSPASG

RWIALCIIPRLSKLIXETICDRYII.SLIRSFA~SCFRL11ROLGIrKLttIOC~LTSRIRVYILPKKtiYDGSIORPI
HIKISLPITVItDLAALl9CLKASEVIOKLFIIK?fi'1WNDILO

YNEYCEIVDPNSLTDCHIKRINEYKROLM1ILRVIYVYNOLKLNPNOtHVp'ZVIFBGKASETAVpFICLSFCCTIDID
YSEpDIfLCLSNDTVRDEIpSTDPSKLVIRSPIVAfIOINdI

lGAI'i'QIDICAFCCSTPVGDITILDTPCIIFJ1F811lIMAOAM

GrtEASGIGt~IIKFAIlrGALTIC:'I~DANILNALNIGKPi~tf'IFCLLmOIVOLRREIfCPOTDIWLWIIGDnGI
KmlLFaIENAKAADIAIWAINKCDKpNFNSETIYRQLItLIfi.PC
' ICDKNPKIROVLDLLEQCFFNSNDKDLFKpIVNRLLNFIiDPFFVIJ1DLESYIlUIlILNVNK.LSFLL~Q.ALW1EV
LLLKADPSARARaLVILSfiJO~LOPVA
AHCCS'CVTVNTSJIIITC1L

LPKEPDSWfKISIYNfAfiKsFFSSDRAIQDYARDIWNVPTKSCSGmIITVLIONGSLKLCGLVFIiDGYGKVKTNHNE
IIrIaJICLAOPSIPVLI1GGSDIPAI~DPFF

W104BKTARDI IIJUtSAGOQRFAiAQIGDiPNF0.RM.ONKkTLKLLIKADVOCSI6AWtS

ISKIA30tYDVEILTNSVCEISESDIRLAAASKJIVLIGFH1ICI~lIALPLIItiiaVAV6L
CPn _ FTVIYHAIDAILLIMfSLLDPIAEBItDDGSJ1EIKLIPRSSQVCSiYCCIV'1'DIfANNK
No robust holeolog Present in Genebenk/H~I.
a at 11/7/98 FFlbHffe'(ATVAQTPQTTOPOPSVSHKATHRYCSWVFPICPILVSrr_r.-.rVRVLPNKLILN!(GTLSSLKRVKEDVKEVR10GLLLS7ILLEGYppACIGWI4CY8VIYNPQ
rer.~,LVIA

sGVrrLSICxGTVLAIQIVLaGTaLVLAFNHIROFKOARTALLNSIOOtuAPAAATVOKCKKL

LEr7RrssK
CPtL0318 36270 363176 0309 350977 319595 rbfA-Ribosome ,iA4iaQ Faecor A
CPn _ VIISYNVIRa.SIItOOrIYId.IfYQFI'EiRAIKRVNJILLpEAI111NILImVKIIpKISNNITRt CT309 hypocMtical parocein FNRAWEEFLLLpEKEIGTNTYOKWLRSLKVi.CFDACNLYI,FaQI~FQITIiFELIIIRIDIVKRV$L$ImLNSARVY
VSV11PNENTICEEALBaLINSAGFZJWRASKNWLKYIPGJIFYLDD

SGLVtiTl1ICPIAVNVTSVDKAAPFYAEIO(xlppLKTAYITIIfYCiSVNPQlI'FSNFLVTPLIJIFSPOOYI)~L
IJIOIO

DLPFRVLQ6F17CSPDLt~K~YrFNPIYLtGF~BGIITI~SAISVLRP80CKILY1BSDI.

FTEIG.VSAIRSfiElJC7~RSFYRNiOALFILDIEV!'SCKSATCtArIIRFNSIJIS>rlr~.IVCP1L0319 VSSSYAPVDLVAVtDRLISRFLNCVAIPINPLVp)Z~.RSFUtNQVCRLSIRIC~l'ALOFLcxul-tRNA
PseOdouridine Synthase IYAfS
LLYF.DWRTIdJLDPLEAtIGtVALTPLKItRTIllr~fINI'IKtxllflaLAV~.KIxILLVDKPpCRTSFSLIRAL
TKLIGVIOtIalilOnDP

NVAQY7fCVS0E5ILCR5pSRLYVLPRCVANYFCRQKLBLSYVIIICDVFBRDIISTVISSIRFATBVILVItt.IGRK

LIDpKIE<ISNDIHMAIODISKNLNSUDLSLLFFPSLLItIILSAACYFOCLIQQLPPNFSAKKIrQCKIILYEYARKG

PV11SCSKCrYIRSIAHPZC'1'IGGCCAYLEpLRItLRSGRFSIDLCIL7CNLLCIIPLII'DIfPY

CPt~0310 353173 351019 uL

60IN-60kDa Inner namosane Prxein AKISL CPII
TLA

Q _ O rib!'-FAD Synthase YFOLLSLIFRVY014~IKIlTLi.FVSLIGIAF1ICC0IFFGYDIEFRSCKNLAC
A AVAVCDILLFLLtMGEAAQSVIfSSGLSNSFVONKOC

FDNINLI1LYRCOGSSFNP1'N'fCKVFLpTNIICCLPVLtNEFRHN1C6PLVFLCLYAGCRISN7TPISIFLPTY~IP
NLIAYSLTSSPSVDSV1VCFFDCCHLt'JiSNGLSILTSYBCSiCIIIT

KDSTIFGT11LVP9iRSGSDYIPIGLYDSRLE1C.VSLDLPITMVIT~00DSAKSSDTANHFDENPOTVLSZ11<17~I
NtIQERLOLIATFPII7Wi.CYLTfT%MANpSAE6lLTLLIOINL

YVLlNDYNpINSL85CSILCINLPP11S'1'NMfSIVNEIGFDROLILSflISPEaIit'FGLSSKKCIOtLIIGYDSC
ICK00pSM'LALDTIGKPLGIIYILIPPYAlIDNIW88LAIRp!(iAG

LPL7COQAIDISIGCYYPLLRRGLLSDSKKLLPLEYIIALTNV~RELATRIALRYRVLSYTPNLLCIWAFLCIIPYAIS
CKIT~SGIQGSLGFATINLPREFSLIPL~VYAC6IAYCITI'Cp HSIpLESLDRSVCKVY1C.PLNPLLICpYVFEI'AITLTKE'1'EDVNVtSGVPLVLINSNA811PGVlIILCTAP?FG
RESLYAE11LIIFSPAENLYCKEtISIIPRKFLREEKKFQSKCILIMIIIL

TIKYRVI>nO~GGSLOKVKt.PIfVItEPLAIRROVYPOWILNSNGYFGIILTPLSLIASCYCSDILDApDNPAKGSFN
Y~TA

LYISGSTAPTRLSAISPKNOLYPVSKYPCYESLLPLPKI9A61'NRFLVYACPLAFPTLKVL

I7KTITCEKCP~IPLYLDSISFPGVFAFITAPFAJ1LLFIINKIFIQ.VI~1CISIILLTVFLCPr1..03Z1 36f900 361767 KLLLYPLNAWSIRSIItPNpILSPYIQpICCKYIOrEPKRAONEIMGLYKTNKVNPITOCLPYahr'wGTP
Binding Protein LLIQLPFLIAIffttt~fSSFLLAGILRFIP'G<'tIDNLTAPI1VLFSWpI'SINFICNLFNLLPILYSK1QNIIFIF
RCLNSNTLOGIVGLPNVCKSCLFNAL'SCAQVASCNYPFCTIDPINOIVP

IGIVlffL001NTSLNKKGPVTOQpXCQOVrCFiIIIJIILtTANFYNFPSGLNIYNLSSNILGVIL7ERLEALJ1KIS
NSQKIIYADbCFVDIAGLVKCASL>CACIGNRFLSNIACTIIAIANVVR

W'OG4IITN1(ILDSKHLKNEWIIdnCKHR
CFIL7PDVTHVSGKVNPVF~IEVINLELIFSDFSSAKNIHSKLFJtLAKGKAL~C~LLPIlD

TIIANLEKGLPLATLELTPLOIVALKPYPFLTNKPNFYIANVDLSSLPOI~KIYVMVRL

CPn_0311 351153 353575 vAAIt~ISKWpICVRIELLIVSLPIELRLEFLHStGLEKSGLHRLVIWIYDTt~OLISYiT

CT711 hypothetical protein TGPCLSRAWNVACSSAWEAAGEIHTDIQKCFIRAEVITFFit'IIECpGRAAAREtaKLHI

OFlMIHAVIYWDRSKiVWSFEPWSLNLTWYGVFFTVCIFLJICISMYL71LSYYCLtIDHLSE:CRDYTVCDGLIfIC.
FLNN

FSKSpLRVALFtiFFIYSZLFIVPG7UtLAYVIFYGWSPYi.QNPLLTIOIWfiOGLSSt~OVL

GFLWAAIFSWIYKKKISKLTFLFLT~CGSVFGIAAFFIRLCNFIiNOEIVCTP'fSLPNGCPn_0312 366231 wFSDPMpGVQGVPVIiPVOLYECISYtNVSCILYFLSYKRYIRLCKCYV'ISIACISVAFI' YscU-YOpS
Translocacion Protein U

RFFAEYVIfSHQGIM.AEDCLLTIGQILSIPLFLlG1111LL.IICSLKARRHRSHIs'NI~i4SMGEICfEIUITPKR
LRDARJ<IOCOVAKSODFPSAVTFNSMF'YAFSLSTFPPKIIIGC

FLVSM1.SQAPTRHDPVTTLFYWO~CtJ4<.ILTASLpLIGAVAWCVIVCFLIVCPTFfTN

CPr>_0312 351518 351976 FKPDIKKFNPIF?llltpKFKIKTLIELIKSILKIFGAALILYITIJfiINSLIILTa01IS1I

CT101 hypothetical protein ITACIPKEIFYKAVTSICIFFLIVAILDLVYCRHNFAKEL1WEKPEVKQEFKDIIOtIPLI

CTNARNIKYFLIIFPGILWISACNOILLLKATAIALDPLSSFFTYCLLSMVS1rGLILSLIGIRKCRRRQIJ1G~EIAY

'CLLSKTIRKGL:LSSEFFSpKITWIAYIKOTFISRRFLIININIAFSLVLRRYLSNPOALILDEAEKYGIPIMRNVPL
AHOLLDECKELKFIPESTYFJ1IGEILLYITSWIICNPNNKM' FVIRATVG1(ALIKTAIAYFSKLQNAIXENpEGtiNOPDHI.

CPn_0113 354957 355355 CPn_0323 36731? 369160 acpS-ACyI-carrier Protein SynthaselcrD- Lov Calcium Response D

wKILKEISANSNEIIHIGTDIIEISRIREAIATt~NRLWRIFTF~1ECKYCLEKTDPIPS'SFIMNKLWFVSRTtGf~T
TAWMINKSSDLIWt.wl9~CtMIIIIIPLPPPLVDIJ1ITINL

FACRFACKEAVAKALC'IGICSWAWKDIEVFINStIGPEVLLPSHVYAKICISKVILSISHSISVPLWVALYIPSALOL
SVFPSLLLITTNFRLCINISSSROILLKAYACNVIpAPCDP

CKEYATATAIALA WOGtiYWCFI IFLI ITI
IQFIWTKCAERVAEVAARFRLDIWpGKQNAIDADtJIA~IID

ATCARDKMGLCKESELYCANDPAIIKFTKCDVIACIVISLLNI4CCLTIGV711iKlmIJIO

~:Pn_O31A 156185 355353 AAHVYTLL.SICOGLVSGtPSLLIALTACiIV'1'l'RVSSDKtdINLCKEISTOLVKLPMLIi.It erxl-Thiors~xin Reduetase CAATUiVCPFKGFPLWSPSitJILIFVALCILLLTItKSJIAGKK00GSCJ~tiTNCAAGODM

MINSRLIIIC.SfiP~Y1'MIYASRALLHpLLFECFF:l~I'WOLMI'l1'VENF'PGFPECI'IIfC'.DNPDDYSLT
LPVILEICKDL.iKLI011KTK'.,~CSFVDONIPKNROALYCDICIRYICI

IGPKt)ltifMKEQAVRFC'I'KTLApOtiSVDFSVRPFILKSKEETYSCDACIIATGASJIKttLHVR'I'~>PSLEC
YDYMLLWE1IPYVRCKIPPHHVLTNEVEON4SRYNLPPI'l'YKNAACLPS

BI('CN:rK)EFHOKCVTACAVCI1CA.;pIFKNKDLYVItY7CD.iJILEEALYLTRYC:.~.M/YWIi\WV.~.EDA
KAILEKAAIKYWI'FLE'IIILNL.:YFFIIK.~.SC~EFI!',,IpEVRSNIEFNLRSFPOL

RRDKLRA:IKAMEAPApFAdEKITFLWNGEIVIIISC:OIVR.:VDIKNI/~YfCEITTREAACVFVKE1II'RLIFLQ
KLTEIFKRLVDE9ISIKDt.RTILE::I-.EWJWTEKL1NLLTC(VR3SLKL

r'AiCgIKI?fl'DPIdxDLTLDE;CYLVTEKC;TSIIT~'VFr:VPAACOV~WKYYR0111'1':.ACSCCYI::FI!
F.','QI:Q:~1I3V1'LLDI'EIEENIRC;AIKrT::N::S'llJlfpPD!:VFILILKrhRNI'ITP1' f MLOARRFI.I: PAtTiqFPVLLTA IDVRRWRKLtETEFFDIAV
t:: CpEIL.PEIR IOPLGRiQIF

'Iwy f I'. t5t:977 15H71r: n't'ti rl Sa4 t~tlbfl f'llh:ff!

r::l :a ItiLrANtt.li Prutuin ~"r1:11 hypocMrriurl protein MI*VAF.Y15J~:::KKIIiXJIEC.'LTEDVAEFKDLL1'TNIPIT.~.::EEE:IWEIJFC:ALLII4TWYWNIRRI
eIAA.~r(XTl'GCIL'A.'l'Or:/tILMVhAIJ~AKAOME'NA.~.GEI'(EFNNIOp':.O~.T

nIHKUF'VWIIw:LK::pt,;VIFM::EPtDS.~.EI:LVLuAEJIWLOpAEDEF.:KYIL.:REKATRNI'AMTRTKK
KEEKF~~fLE::RKY.~FJ1C:KAF:YY:a_:fEEYt'f11'DLADKYAa(Ii!;EIv~

,7ItS7Ylh:! I
LAIN'F:h72:IYKt.'(IITAKVKCI:LIVDG7FIRAFLPt::.UIf7FIKKItta.YlaAIt:UI)/I:PEDILAL1 /VEYIKGIAlrr::
IKNLLDYVGKVC: fJ Id7YLVrT'I'PF:7Ir:KLKFr\LiGARNI'rIT

r:F'KILKtrNFRRNtW::RNELLEAPRL~.KKAELIFlrL~.It:LYRKvri'VKNITDFCiVFLDLDFktFr:M'AI
tJIKNILFA:~EYAiYrIlN:a".:ra.f::LYLEVTn7I1P1ITI:ppLL:TILOpRYTYC30 r:IfX:UJII'fI>lrl5dKRIItIIP::PlIVEINOELFNIIL::/CAIRKC:RV\L.:(..I:v'KFJIrIhYIEOt EKMAIV~::FItIKQIATELKItG:Fr/F::Ny,7Wtrrl-rf.rnrrAYL'r::YnyFF.;RVFILLD..rLK

AtYIIr~'fCa)IlIFVKVAE.TIHKIIt)DKFPTJL:KV.CPn 013' % I Ja1575 dNLIC00V0~'f:VWLPFSuR snpB- Snwll ProcW n t QTSSRLF:~ADKROOt!)IWIANALDAVNTNNCDYPKASDFPKPYPw3IEEIFPQIOL'*RLLILUL.RRKtICFLLYW
Fi3PIllclidi KEt ~A

CPrt03:5 J7tJb9A :71119 PlI~'1.'~Y~hI
EAGLVLTCCEIKBLRdIIaQB;IGDJIYILl~tI~EC8iI3180 WRYFLRKLExKI110KQf:LIPIGNF4SRGYVKVRL.xCRGKKAYDKRRTIIERFJIEREV

CT3:5 hypochectcal ororwn MAIIfRRNN
KRLAIpNQYECLLE.iLAPLLNiT'.J1PDYJJNSCLIRFSDTfIVPWIEI9l'~NSGOLAVSTL:.

?LP04VFRER: FKAALO"VNC.:FQSS IK.' ILGYCEYTOQLYLSDIL~fYIl4GEKLFEYL 033P 393373 387375 ' CPn ' ~

fVA _ :NLPDLNVLfa% 11U ...~ty.ry..-. - vil~
LRT
KLFSLIAKIWMFw . ~ ;~.tl' . ... . . rrINKF:-:.:aNr:;. ;r ~.:~v: ...:., ., ' ,, r.~~T :..,-.,.:.,;L...;h; ,.::.vvrv-.ct.r:~::
Z:Y
' ' ' f.E:::EANLtt~.:.:.v.:a;l4>.;i eHLL:iIIEIImfPll:.
LX:.i.~:
AK1.YEK::ai,;IP::KRFtL::/

real0-Glucanocranaarase PDION71LRFSLPAEOLKTi4f~RTSFAVSREESRYYL'1CYLLIIAMiVATII~TJX'nKRIaK
7iLLRRVNVIJfY'tIfHSP5J1NAWNLICfSPKfKiIYLPLFSIHTKNSC~uI~vEFLDLIP' PSClL

r RLLBG
LI~OIfOCFSVIOLLPLND1'CtL?fSPYNSISSVAWPLPLSLSSLRtID?IPNAIDQ.QIDAMLDKSFSGEYIIPIIL
1VEEIIKHCSDEGBr1IIFLL)pOKIAVOCt>MLLIl ' CSTPSVSYIbVKDIKYIAFLREYY01ICCKSSLO(ZISNFSEFLESERYYILYPYCfFRNF$SH$VRFSTLpC$L?LTJ
BiCIRV
OII~ PVIST6SNVKLDL)IREF1.ITLLKOVALII
CPfPDFS

. _ AIKMIUiCEPIHNWPKSLTDOENFPDLZKKEHDEVGiFSYL~OFLC1f00LCEYlIAYAO(RIH~~F~EIAFNPfFFLD
ILKHSKDCLVSIGISDSYNiGIITDSI196WI

VLLi(~LPILISKDSCDVWYfROYFSSSRSVCAPPDLYNS1DLPIYNPSOLJ1I~DYhP~LNt~

LYNBlF3tLRYAONFYSV1IRLOHI IGFFW.YIIitDS>iCRCRFIPONPKDIfIKOOTt;ILS
t~.G 193105 384034 ASSIQ.PICEDLCIIPOWKTTLTNLOIC~'L'RIPRWLRNNG4DSAFIPLXDrNPLSV1S2.SCPn_0339 StIFS7ISIlRINLFI~Y LT339 hypocMcical prauin KTLT'L'ITOIDILIC
PKEAK
fAKFIJtLPt . _ _ O T
Q
THDSDTFAQwWGNS
LAICPDLVSKNLORERINfPCI'ISKKNNSYRVIIPSLEEWIRKKfNGIfIF)IILTGL

~~~~P ~

OlZ7 371937 373311 KaRIl.I5G71PADRALELNLL~OCDNNYTI"~LSYYNRAt~ANU'I'KSKQTStVASml4S
CPn _ ~P~~~~'r' ' r138-L38 Ribosomal Ptocein RIHRKNNSRKGPLTZiKRPRRCYSYT1.RGIAXX100GIGLKVZGKTKRRFFP4llL'HUtLWST
CPf~0310 383843 381156 E~iAFLKLKiSASALRHIDKLGLE%YLERAKSIO~tF1lrarltehitt wieh 0339) PLYPLLIVLSSRSSACICCSLKKOAM1JAGLWDEOLVKHGTYLSIQRfLCSOKLSDLS~L.
CPn _ wSNIGJCEOLALKFKSSLIItNSDISCtAVAEEFHKOLSISLPRDLE
cT085 hypocheeical protein LIfYRCIFNSPLRRNISLFRSQKpLIOVFAPVSPNLEU1EIHRRVILDpCPJILLF10JVIGS

SFPVL,tNLPO'fRNRV00LFSpAPD6ILIJ1RV11NLISSTPKLSSZiIXSRDLLIQtIgSi.CLKItCPI).0311 ARFpRIpFVSNSSVM.ImiLPLLTSWP(FLTLPLVYTGRPTLTTPNI.~IYRWRFNOtfraas-shift with 0310) S1fFL9CNPFLTLSAIAPLPWVSLLLfIITFL
CSZ'SK3PRRtDPLLTNNONPV50FSSG.OKHSLL71ILRL7IflCLYLKpSHNVSPLVCLODI

NI?>DLtlFOI01030(3'OILYC71Ep1~l4LWHAGI~iCRWOLLDPAPTLCOTLITSTHNNGFi.PKTSLYLSIFN7 OG7110.LYKKTHDHPHPLLYDAEFILVCFSPAOKRRPbiGPFODNIGYIISLONDFPCIIIQDC

IYNRImAIYPATVVG1IP'YOIaFYICtiKt.OLYLSPIFPLV1IPGVRRL1ISYDESGFRALTM
CPfL0313 381619 385067 VVK)CRYfdItESLTTALRILGDGpLSLTKFiJIV'tDOEVPGDRTSYVLETILOiLOPORDLIIDtbit:ced ONP
(leader 119) pvptidel fSETIINDTLDYTCPSW10GSKCIFNOIGKAIRLK.Pt~YpOCKIHCVODIAPfC~CLVL$

TSLEDRCIXSLLHNPDIJt541PLIILiIUiLRETIOSEK01WR1'!'1'RCAPAt~LII~JtSNFIBBfKFLLTILFL
)lVri~IPLFSETSVIQTLPSGIOGLICITSKOKEbVVCVIUFLRSYTSLKP

ATNRPNYHFPFVTDAtI9CPSYPKEYIYDPSTKOIfVSEANHAYFPNItEfFYIIARVLEKLtIYWEI4MYlatRKE'I
LEKHAFJILNRLLK1CI11ELKPGVPINfYI7tSI0f.IfIVR
VAWIPDCPEFJ110C>l7D.tStALLRTODW

CPn_0339 375085 376146 Phtapholipasa D Supertanily (leadesCPIf_0313 184999 385595 133) peptide) KNNKROK~CICVIISTLILVGIFAMPRGDTFICffLKSEt)IIIYSNpCIIEOIOtKILCDItratleshltt with 0313?1 AIF11ADEEIFLRIYNLSEPRIQOSLTROAO~RfVTIYY01D'KIPOILXOAbN9TLYE0PLPRRBQKRKAILI471PP
N71CSTLJI7tRYRCVKfVOFVft3GKIGRQLLTYCPTK~NGKLPS

PAGRKIl0I0KALSIOID0371WLGSANYTNLSLRLtadJLIL08lSSG.CDLIITNlSDWSISLDVLIiS~181IFLP
FRLPIfCi00KVCTIETKLD'fPNKAYVINTSN1YII'i~KSLYL

KDQ1GKYPYLPODIIIIIAIOAVLflCI0T11QKTI0VAlDbILTNSbIIQALRO~QIeGIINDI>B~'LICt~ILTPI
IGIVPEMLt~'tIMEDKQKNSRLJIPYPNODIYVINCfG8RPIlNLYGP

IIDRSIIbID:.TFKOLR0I14INImFVSIMAPClLt00CFAYIDNKTLIJIGSZtiiISKGRTSLNPIDMSLi7pKNS
INPEK~R

DESLIIL121LTKOONOKLPID:Ii10~l0iSDIPrV~>I~LLII>''~fSi.PVEGOGA

CPt>'0330 376930 376303 CT083 hypochstical Drouin FISII~Id.SiIOLFSVLPSRIpDLHVYRfKF'SLIQirOFlnTtf100EIWVLiICIKEF~LRA
RIfhPVAKRRIICIYLRIFRVLSRFDVIOtIIWDPYGALSJ~pSIA~6R1711SPLVF3CISE~I
ATNGIRLi(LLAIGDRDQ
00Bl~EIQRWKRSI)M'K1DDOCISLCM~OiIIMMYJ1WVIARNICGVLBTASTLFYOKD
t7J1 CPlL0331 37153 376701 CT083 hypocMCit:ai protein IOAIIN)IVSGf70GVpPSSDPCl011tPALOCD(?AECPSPLKLSIfSETKQASSMIWESLVR
' SGbICNYATESOINKAKYRKAODRSSTSPKSKLKf.TFSK)01ASVOGfl16GF05RASRVSAPiC
LVPPTFLLLIFPIPLTFODLFR!

KIIASOSGAGTSLLP'l'CIDAIALKKQRIISPDIOCFFLD118GNCCbSSDISOLSLGLKBSA

PSCiUtSLSLSSSESSSVJISFGSFOKAICPlISEDIVNAwfVARLOCtE?IVSSLLDPHVLT55CPI>.,0345 LVRRANATOIIEGMIDLSDLCQLEVSTAK1'SPRAVEGKVKVSSSDSPGNPTGIPNSNTLECT715 hypothetical 0rocsin RAEKF?.EKOFSRDpLSEDQN14.ARAH71GLLT~AlIP0EVL5NSV1ISCP51~IFPPPKFSC1'LLKVaCIJWLAVI
~f'wRTOSICRQTL6IVRRYPSEFKIISN71SYGNNLRLffQOLICFAPLM

DKSKHKSPCIF~CSTNttTNFSPLREGTVK511VKSLPHPESMIRfPKDSIVSREEAVYNECVYNF~ICQRPPH110FF
LCQDGL71QLCIND'IYTTWAASSOIPJILPAILA&000CK

PEAWIIFSTAFKNPINSSONfLPIAVESVFPRESCI1~'rAllf'aSDAVS55YHFLAORGVSLLALALJ1NKEILVCA
GELYSKTAKPNOIKVLPIDSEHNALYOCLE1GR'l'It~IIDa.ILTJ190G

APLPI41TDDYKEKLEANIICPGGPPDPLIY9YRNIfAVEPPIVLRSPOPFSGSSRISVOGKPPLLNKSLCELSCVTKO
WWRPIN8~14CSXIIrVI>SSTLVNI(CLIIEJIYN4lCGC~NLIL)1 ' EAASVIiDOGCGGNSGGFSGOpRRCSSCOKASROty00CKKLSTDIF
VIHPOSLIHfBNCELDCSVISIIBIPPDMLFPIOYJILTIIPERfASPRDOIDFSKIOpTL.P~

PVDCeIfPSIRLAppVLEKOCSSCSFfNRANEVLVRAFLCEISWCDILIIIILTTU18CNK

CPn_0332 378676 )79536 VYACHSLF~ILE51DGEAR7(frlOEI

CHLTR T2 Protean : ' ' 0316 389690 388701 CPn ILLHLLAVLCPPISFFTOGVSPCVFfCFLDf _ YLDSRIRVIPU1RQRC 070-eroDlytQD-Intpral lhalbtane Protein CPn KKOSW7sLRPSPYYCVSFfOFfSVfFSRLfSOSLPTCSLYIDDIQIIVfIJIISC9CAFlIG

_ TFLVLRKNAhYANAVSH11'LFCLVCVCLP'CNQLTTLSIGTi.TL.71ANATANLiGFLIY1IR
itu8 VDFFVFVFFMCKPKKSRTDRALAOEIOKKSTEVLKKP11RIKAKNRRKFLIAKEOKTLKHRNTfXVSEESSTALVFSLL
FSISLYLLVPMT10'1JWIGTELVIGNADSLTKEDIFPVTIVIL

ApEIIDDL'IR~LLDSOKKLITDKVLIFNYENCFVFTDICDNFSKYSIRLANAVITIFAFRSWCSSFDSVFASSLCIPI
RLVDYLIIFOLSACLVGAFKAVGVt?UtaF

LI IPSLIIUfVIAKSIRSWAWSLVFSICI'AF4APASSMILSAYDLCLSfSCISWPLTN

CPn_D3)4 379309 379837 N'IIWKFISYFRGYFSKNfEKISERSSOY

CTD79 imilaricy TMSVHITPRKCFItCILs7IFTLPTLFpKAHLILFSPYIVIGFYCFSKDKCLVt.At,CCGVLCPn_0347 )?t078 .~.DIJ1LGSRCVFLLLYPLTALITH)fANLIFSKESKAALVIVNNIFYCVPLLLTIPICCALFC069-traC/yt9C-Inepral ll~abtane Protein HEVRWrIDVLHIPLKCSFLDNLIFTSVIYILPCALNSCLHKMISFFRRLVCYTf?ItIPGLSRK1'IWIVLINLSC11F
SDTLFLaSFLIVTLICN1TALWG1'ILLLSIOOPLLS

ESLSNAS'IPCLLV0AIJIlrY'lvFStQAr~IFWIVLFOCAASVfaYOI
IVFttIKYCKLNKD8J1 379p08 )80671 LCfVLWFFAIGVILASYt'KESSPTLYNRINAYLYQOAATLCfLFJITW1IVICASLIAL
~.Pn 0.35 _ wWWlfR0hM1'fDKDfAVTC'LKTVLYFJ1LSLIPISLVIVSCVRSVCIVLISAMVAPSL
tolD-Nechylene Tecrahydrotolace Oehydrovenaae EICNLLP':LPMEKILJRLKEEISOSPTSPCL.AVVLICNDPASEVYVCMKVKKATEICILGAROISDRL.a"l'ILI4 SAFFwI~.ALC.~>YISVAFTCRJ1IICOQAVPVTLPiGPLWICAG

.~.KJWKLPCDSTLSrVLKLtERLN00PSINCILVOLPLPKHLDSEYILQAISPOKDVDCLHLLAGLCLLFSPKr'.~W
VIRfI'RRKNFSFSKDOEHLLKVFWHISHNRLFNISVRDFVCSYKY

PVN4CKLLtA:NFOGLLFiTPACIIELLNYYEIPLRGRNMIVCRSNIVCKPLMLJaIQKHQE'IfCPKPfPRWRV'OIL
EH11'w"YSIKKEODYYRLTKKCRSEALRLYMHRLWE.9YLVNSLDF

I~ytTX.'"'ffVWt:OSFlILtEILKT.~DLIIAAtGAPLFIKE7?NAPHAVIVDUr7TrRVPADN:~YE..~VNELJ
IEEfEHVLTEELD11TLTEIWDP~!DPIIPr3IIPNKKKEV

AKr;pTLII:DVDFNNVI.TKt.'.IAITfVfCtIVf:PM'VIWLN.3MWRf.'YONFS

CPn_0349 )x1915 l9lD'!.7 "fKS u:sr. irlt)S~a tw1591 Db9-troBlyeWB-ALK: erannpnrewr A'fPase:

.!m 11, tll~ILIIJVYDETtW
:VHNI.SIN'IEtIAAVLY11I::F.~.IGI!r:CLTAIII:PNI)M:KI:TLI*A."LI:

vt'IXCMI:apl::::;:WftHW::IIIa:RY1111XJYNaNLPKFFLYLG:LCIt:::I::X1KTTTIOGEpMfILIY
P.~.Sr.TrIFFNOKPKK1ROPIA'MPQRA.:ILYIAFPNTVLDIJUl4rXVYCYKI71~RL~.S

;"lNIVII.T::I::AKi:1'vl::I,:rf~IpBt:FHKID::LYNNWNPYS6L::rfNRAPADVPITL::VELINnP.
RFJ1FIIILfRVGL.E:3VACRrJIr.'OL;:tXaXyJPAFLAHAU10KADLYU4DELF::ALONAS

:aat.lY1'Ifll'LYKL::17GRFhITtICPLKTWf.IJILK::OTLPPKDVWF.ONYK0W~1~IQHLEFO::FT'TS
V1:'IWELRDOCKTW't'VNHDL::IIVRpC.FFdIWLWKRLIr:Ia:I'TDfX:LttLDfLfQT

trrKTt.IYKN1~IIVrJfDL.I:WKriY\VIX:WEIt:NTFr:PNNYVI:WIX:EIKT::CH11P:~RPWRYra:EIE
LLF70TLKL.~.IK'*VF;:C~:

Ir~:FlV:fILlilIr111AtA'Y:7~IIIIWIYAYf7r:KIYT1IILClfR'R:KII.ELaSYPI0:1V.3WN'T'-, ;.,I
It:r'A'lAhIllA'fVIXfFIC:KIFJIICyWAEEI111IL't'YTt)fl7A.~.'.ir y ':f'n O14'l y ' y I-rr.A! rQA-::slur.. Prnr.
vtr Ismv)tro7 t'.vnily n.

n WILKHA::RFIAWIDIGYIFKVMttWTFrFVACC:TkALItOPEGIPVTr'::...'Wr:~
~KI~fAIDAW:i::'C:..::.:::.A~..ARF~.:::4MEIRDCA
iIANSRPCILS'FTIRNIHDC
LFF71PNDP'VFIDDVFHALYJ1~aKIISItAGGt~ILL::FA.'iKCftFR'.LOhGEIA

'JERIM:NHIr\TAVLIKGSGDPFIAY~IVIttIDKDKIAGSAVIFCNGLGLOtCLaLRKtILEM'lARIKPGTPL
LFMI10CCIIOSAFirniltH AAilPf~faLFO~fIIAIiI~i.R
R LP

I'FI;:VKLV.ERLIARCAFVPLEEOCICDPHIY81DL.~stYlKfaVIEITLIILIEKFPCSiSAEFKAIPCLAAAIT
FIf~:7PR'IABLiNOGGRO~IfCAFCdFERitDR

a;EEL12Fa1SILD~IAKOCL~IIPOiUIYLV~FffiAFSYPfRRYLATPEEVASCAWPSRC

ISPECL.iPEAOI ~VRDINAWDYINENOVSWFPED'I'I1~DALKKIVSSLKKSNLVRLAOK

KPLYaDNVDON'lF.:TFKHNVf:LITELC~IALECOR106b50 40578.
Cpn_0361 SYt .P~ Synchee~se Yr tYrS

~tm o75n 797167 J~7b84 ; ~
T
- ,. .~ ~ .....,... . , ......: .P \ r-:1~\..LT':IIW:-~
, ... I.-: ...... , . :!~r':F'.-~ ~ ',K:'.: ' ny:.ll,~,.
..,....~ .,.,y..14F111i'!': -';'tw: .-:?..:.:..:.':':".',::.
y.. '...
.... YILb\4:
:w. " ' KlU!Y ..
~
, , ., .
, 'CL:JWIAOWWEI.:~...i:w:'w;iKIIthaw:wML':K~:.i::HVH.i::E;;IrYiEF::YL:LOiYD
, DFIRRICGIG
. AYG:
... . :YPLLiNJ
. GIGCiG%TiSG
! ' ...!,;;
.;.,.... ;, :Y:.:':YA . ..tlCG....
ILFSETPRTINPKPW1PR(iSKKRRDFINFTtITDICRYLEIJIROVt70KDLi.
' F:I" O
LLKNLKTF .
ARFSPKKPLTSLKRELIRSIRNGIVSVELWNAYVFaVRAVSSPNLEV1'SPFVDG~'tiTSCI
IO
FYHLFKNYCTILr.Ci..S
PKIARTL2LLSNEEIODI~tRVpTDPVAVKWA

'MiLDSOLTSPFELYOY;.LRLPDDTT

ODILS11IIK:DIGLEtaLSS~TRS~tPt~SGS%~Flt~.!'AG~ASLDKSLVIGIWIiLD

CPn_0351 791861 395133 4FLVIGLCKSKGIIRRLIEOKCVYINNVPiANEHbI7CEE0DIGYCdIYVLLrIQCKIOt%t.VL

adc-ADP/ATP Translouse YW
ROTImTL
TVL

.
KIKVfQRVI~BTfKTEDCPFGKLRSFWPIHTHCLKKVLPNFIJIFFCITIt~iY7 StIIt.SKGALFYAVG'i'PFLIFFALFPT
IYAm IVCApGSGAEAIPFIKFWLWPCAI If?G

. 055 . CPt4.,036Z 107113 10 AAIYVLA~S~t~~'F
W

fF CliA/rpsD-SiOea-1B/WhIG FaailY
VIYPLROVLHPfEFIIDRLQAILPPGLIGLVAILANSINIITE
KRFYALFGIGANISLLASGRAIVNJ1SKLRUYSDCV~WGISLItIi.WVIIi' ANEITK
H

I
LDKJOIIVIITOQ'I'ONIIE1R4NFYWCIOEIEYRDSLIEFYLPLVIISWNRLi&ONI
EA VN
IV9GLVUtASYWWINIfNVLTDPRIYNPEIHOKG100GA1CPIOB4lAmSFLYf.ARSPYiLLLAK7WKL5G
' LVIAYGICINLIEIrIyIKSOLKIQYPN4JDYSEFNCNFSII~IfGWSVLIt4.FVOf:NViRKO
t LI1GAIIDDLRKOOIiVPRS
DLYASGVWLVRAVERYNP6R$RAFEDYAV!
SD~i10101I
VSWtMPS
VMRPAL

. .
FGWL'iGALVTpVNVGLTCIVFF11LVLFRNQASCLVANFG1TPIJG.AWYGIIIGJIW''1ISTI
O
ASLROSLGKE~WI~~~
EFS~IOELEKERKVMALYYYEELYLI~IGKVtaVS

KYALFDSTIIB'IAYIPLDDEOKV1IGKMIOWAARt~CKSQGALI00GLLVICGSICAK1'PYEERIPDE>1A~
SI
FR

LAVILLFIIAIWLVSATKLN%LfLAOSALKEOEVApEDSAPASSt , , .
ESRVSOINSKALLKLPAAL

0357 395178 396130 CPn_0363 109700 107913 ' CPn _ flhA-FlaQellar Secretion Protein No robust homolog present in Genebartk/~I.KIRlfT1118S1t as of 11/7/98 !

WVGIFFINSHFfNSYAFFNOtfVIITVRIfSCf.Tl9CCSPLTLVPNiTLID4DCECHRSCSLKIGLCISFAfSLLT
GVIYSGKKDGVROIIPVPLSILVLIFLPLPOILLD!
~Ri~W~TRWIVSSGTASSLIVSLGSFFSLGSWAATFACL4LF
CLlPPFFLYLC' RTTARLILGLVLALVSJ1LSFVFW1PISYAICGTLAtaAIVTLIITLWALLA1ISKVLPI5J11 1VNFLJIVSKGSDtIAEIIRSRFFLEALPAKQNU.DSDLVSGW1SYR11V11%OICiALI~DF

PNELQKIIYNRYPKEVFYFVIC"HSLTVNELKIFINCWKSCfDLP~LJ~AEAPT3iDILKFSAI~CVFRFVIIGDAiIS
CILLLVNWSVTCLYYTSGYIILOpNMPIVfGpIIt.VSQVPALL
IDIGYTT
' QKVYGCi.GPWI
TSCA71ATLISKI~LSLI31YLFEYYItQLRQHFRVVSLLIFSLCCiPSlPI~PIVLLiISL
SIDLTLFPEFEEILLONCPLYWGSNFIDKTESVAGEIGLMCl IFHSYTRPLLTLISESpYKFLYSKASIC~IQWDSPSVIOCl'CLEIF'~lpEIStIFRI~IIOGIS

OFLFLFFSHGI'MEQAQNIOLINPDNWIC4LC0FDKAGGI~TFOCFIM'L79'~'DPVSL.wLRYRJ~EPIIS~SCIFR
11FSYVOGACPKEOESQFYQVYRAASIEVF~VRLM'tS
tAEVV
KYLCfSERVICIAVED
RNIJI
A
' SNYEPTVNFKIwKE4KVLLEKVKESPMiPASALVOKICttN~tIOIDNLLO~OFVRM'SSOPFJ1VLPFL
Hd II
O
LRIL~tPWLRVFT70NVYLD~1 IVPACISLSSLVVLSRLLVRERVSLRLFPxILiJIVAVYONSGDSLEILitBI(IIRKSfaYWI

art'SSLPOYAFHAOTYKLEKKIESSLPIRSSL
GRSIJtOG100TLEVITIDPNVP~LINSSYSRSNPVlIpt3iVIRRVDSLLBRSVFKDFRAIV'I

CPn_0353 396893 397135 SCLTRF~9flG4.DPHr~CViS~LP~IPISFLGIVSDEVLVP

No robust haooloq present in G~tabsttk/H~L
as of 11/7/98 109951 110378 LRFRNIKKSLIPIKRIAYSOSGKEOKOARPtfIGtSITSSLVILtS.IAIFNH~iFSSII0fC~1CPI1.,0361 FFKa4FIWI0~tTSINRIFVKFT1 Eer1-Ferredotdn IV
KaISNAKLVITSDDL00EFF3.EDNSEiAEPCES?ICI
PFACTF.CVCCTCVIIIIL~RAif.S

CPn",0351 397062 398507 ~W'~~F

No robust hoslolop Dresent in Genebsetk/C~I.Cp~0765 410198 411511 as of 11/7/98 l015T
~TYYF~

. No robuac hotsoioQ Prvsestc in Gafebank/D~L
YKTISIKILKIXTFLLIGF4LNLRYNTOIDEPRRG45NITSPVIOlC4as of 1117/91 'T-'r'-~-rr.IALIGVILGII~ITPNISSItE
'"IHIVISAILLCGALIAFLCIIAAPVSYI!~

.
FKQtQVNSL~TISPISLTVOHPLVCrKIGLRCSNF~CI05RILLITAiIAVWtIOlLI.
pVPPQELVNRIPAIIYPKPVSDFVSGKPN<.I~LISFIDLLNOLNSLYGSS1NYNVSE~.O

OKIDTFEGIARLi04EVRTASLKRLESMSSRPLFPSLPXIIQINPPFPWLGBFISJIGS%VIGLI1111PVIYFL'i'r ISFIAWLSNFILYIIRATTiZICPRACGI~ttilalaV~SS

VEyNRVID(IOGSLF~DLSDYIKP~.PTYWLIPL~'RPTNSSIWLHTLViaRVLTRDVFISIAINRSKPI~LPAPSALL
TDNPYEIWIDIIWSLFSLVSLLPO~dLI
fiKTLLIF1TSGNAFISSYVDTi'PSPKSLLN6JIIOfTRVEINI'tLIAWIO~LY
SAB~

ONLICYAAI14G684iiWISDLNMKOQLFAKYNAAY0SY10i1.60PSLOmEFYNLLLCIFIOf.
WOPDPI%iRVFLPOIPtTPFJIIYOYYYALYV'I'YIOTAIITINt'pIIOIPLYSLJtOQ.YfRC.

RYSWK~ISLIKTVP)1DWEJL.CCLTLDIti'GRPODI~FASLIGTLYTOCLiNKFS%AFLSSPPp9RN00SIJ1NITA
VKY11AELHP6YPLTIACVERSLAQt.POESI~L.B

LTLLSLOQFKTIRROSTNIAIffLBJLA't'ID4STtRSLPPITVNPLKRSVFSOPE~Si'L

IG
CPeLD366 4119'76 113140 CPt~03S5 399955 398591 No robust haooloq Present in Genebank/FJ~L
as o! 11/7/91 ' No robust holeoloq prnenc in Cenebank/EI~LGNOKLIELKGKOOAESSPRTi?SVILEVti.VmC
as of 1117/98 NGYLPVSATDULtISPAAPLINSAt~T1 ' ' IRDPYLHIIY?AFNRSISKELA!lSKfIVPHJIf.P1041tCECNSTFPLSSRTIVRIAIASLFCYKY
ILLVAVIILF'CFLMVPFMIOfI
CLIVLSLL71IRFALOF?LCfGNPIUIIAVLJIVSCI

ICAtaAIGCLiIPPVSYIVCSVLtIFIAFYILSLVIIJILIFTiCIGG.PP'1'PRIIPDRITNVIDVKTVODIfASTI
IISHGQTPTL~'IFSGIVYAE&QAGL

e'rIYGLSISAFVREOQVTLAEFttOFSTALf.CNISPEEKIXOLPSELRSKViSFGISRLAGD

CPn t.FIG4NGIPIF~LLSQTCPLYWLOKFISAGDPOVCRDLCVPREI:YCYYWt.GPt.GIfSTAID1T_ EIfwDTDEVKAIYERIY7TYTAROTLxi'F31GG1.TNo robust howolop present in OetfebanklFiBL
LTKEwLLLxNKAL as of 11/7/98 L

IFCKETHNI
SFPWRYfKfKiTSIPOVHFi~IDSHLSVDERLISFSPVLTKXEVIAKIIKLTALILiII~IIA
DG
O
KETISKELLLLSLHGYSFDOLQLITOLPRD71WDWLCFVDNSTAYM.OICALVGALSSGNL

LDESSIDP'WNLCLYVIODLKFrIVpAFSASDLPIGCtLGKFWtFDSSVSIGtLSSVLROGLVGTAWAGVLtiIPL4t7 tI11TGAALtaAWLSCLLLRRREPSKPTEELLGPOKNVPKDIAJIQ
' HRIALEt~NARARVYOVNPYTCMINRKTSIlP'ImEGDIJ1II
WPSVPi.0Y0KLLRN6Wl'LVDtfLSEINISWCLOOPNORYYVWEtpGAPITLVXI

PRLICtSCRVNiVNAaNSNIOSCCACIWU1ISMTNPTCWNDII'RTSGGKII~1I!GIOGLSVGDC

CPn."0356 400165 100109 No robust homoloq present in Genebank/Et~L
as of 11/7/98 113766 414107 KOVQLFOYtQIESQJDWt.CDFDSOCDGFQLSRLVGLLHS5WJ1LYPJ1KE0FYLPEVSLLISlECPt~0368 No robust homoioq pressnt in Genebank/F~tBL
as of 11/7/98 FyIDpLIsSKpCIWGyAKDLCNVFEKHIpRFRQYLGSLDLI,10RFHJTFLNYpKyNLpRETLAKDIfLWVN71A0HPC
SIETCRINDTNPGFJUiFLAOLLCPKYDCLXANPEKLSN1II%KA

YLNCFDF~IIId~tOA.IJVOVPLISSSIYSPCGKLELEPVNQ'fKPNSSAYKLYHIRT
CPn _ uo robust homoloq present in Genebenk/D~HL
as of 1117/98 YSSF81CASMVNIOPVYRNfQVNYSOATOFSVCOPALSLIIVS17VMVIJIIVIILVCSOSLLCPn_0369 111115 ~IELGTALVLVSLtLFASAMFNIYIMROEPKELLIPKKINELIOENYPSIWDFIRDGEVCR'058 hypochecieai Drocein_3 3LYEIHHLISIWKTNVFDKAPVYI.OEKLLOFGIEKFKDVHPSKLPNPEEILL4HCPIJIWNIKCDSNPLPSYT~'1SL
YRTPAKHSYPIRLPLNRTDRIEKILKIVtLTLALaCJILGFSIA

LCRLYIPMVSOVTPfiCYCYYWCCPLCLYENAPSLFERRSLLLLKKISlGEFALLEDGLKKAGILJWPIFSAVLYTTL1 L11VSLYSLLKKPKLYEILPOIEPESEOSSLSPSPQIP~OD

MWSSSELVOTRONLFTRYYADKEEVDEAELNADYEOFDSLLHLIFSHKLSLPLOIDPLPDPES:1EVSL1DLTTPPEEL
TAITVTPGYCALLEONYIa.LPSLAAVDPSFT
' TMtAL.S
TETPOOPCFLWKLKDSKLIFISTSCOIAVPRIKTpCRVMIVNJUWI71I8RmOG

LATSLOG1INASRL?RAHSRSGSOLOPGECRSAKWFatSDHTSNDN11PGKJ1NFL14~hGPEA
CPn _ AKCtiJDPKOAFE'YSKKAPHM.FOFJIEIICVDVIOLPLIGCNLFAPSRf.WLCKTRAWIE
Flo robust homolog Oresenc in Genebank/E11BL
as of 11/7/98 EEVLSV.SMKLIPTQDSIERE'fDSKROKKIFTIYICSSKVL74GHFFSHLDKHNKIHSTGVAIKLALITSLODFv'nI
E00NlEEDKIIILTOKDOPPIIPPRFOLTTP
~

401991 403117 CPn 0770 415755 416913 ~Prt _ CTO58 hypothetical procein_3 ~e~~.pasa ITLpYILItEYKIFlITRHFSIIAHIDHGKSTIADRLL.a~TSTVEERFytREOLLDStiDGEREKRIFFKLFVFYLKS
FMSTfEPNLTNVNLTNLL3SESMPMIJ15NKLKGLDLVAPILIiGI

R,ITIIVWPYITffYLYECEVYOWLIOTPCHVDFSYEYSRSLSACECALLIVDMQGVOAAVSSG't'MIIIGIPLLFIL
T.1WVLAF.iILLYFLLREPKSPISYMIQPTPTTKDTDLPW

t35tJ1NV'1L\LERDLEIIPVWKtDLPMDPVRIA00IEDYICLDTTNtIACSAK1COCIPPPLALTPVPTEAILEEPP
LFSPRTHOTLLOEIB~JDIttPDLOANTOMPFIMDNO'fOYAYML

atLKAIIDLVPPPKAPAETELKALVFDSHYDPYVCINVYVRTISCELIfI(ODRITFMAAKGKNSNLTLISTIGFIEKP
R1'A"COCTVNLVNAATPF81AMJVK(TSLALAIfAT3VPGIiDISKK

::SFEJII:LCAFLPKATFtECSLRFCQVCFFIANLKKtIKDVKICDTV'fKTKNPAKTPLECFSPOPLRSKOPLC4:E
C:RSAAI~tE'NtlxT1'NAJ:KI1CLPDFLCCLIGPIUSDYNYNPNDAFTF

KEINPV'~FACIYPIDS3DFDTLKDALGRLQWDSALTIEpESSNSLGFCFRCGFLCLWLCROAYLNCWFJIKRRKT'M' CLPLL:.~.fIFW:.~.FKDEETfSLRLOWIOCNKIALIDAIptF

F:IIFERIIREFDI~IiATAt'.~.VIYKWLKNCKVLDIGNPSCIfPOPAIIftIVEEPWVHVNIGf.FJIENONOPWV
T:=TTLViHPLITP

ITfOEl4;N IlMLt'.LDKRC tt.'VKTE34LOpHRLVL1 YELP WEIVSDFHDKLICSVIItGYCS

Yfi'IRf/iG'lI!KC~IIKLCVLtNEEPIDAF3CLVHRDYAESRGR~ICEKLVDVIP00LPKIPCPn 0771 ~lhl4t 417:4:

tJMItIYKVIAREfIRII::KNVTAttCYOf:DfTRKRY.LWEKOKItt3KKRNKEFCKVSIPM'ANo ntGUSC
tuvn..l.xl Vt'.eahnt m W rn:rmnk/F1~IDL .t:: mt tl!'I/'tN

!: ip-Jtyl,p K'MPV3:APLPT~IIRPS:x:NU:WF.C'fC:KAI.YAY11~~DY~'fY.TTKLLVKTLVAILVtEVII:

f IMPFIPrI'PPI::.I IIt;:LILTTIV
VI.Lt:IfNL.If.VIIKTtLTfAF7~~aTKRKI:1'SII3i wttt o:!o au.tt.I 4ut'mz >

'Ts:.u tr/tt.tt.rt.:tt innr.in 'JAfJiftltr:1.f:WVFK:KNLVINNIDtICFSV.'WNRTFEKTRGFLKEYf'NtIRELVI:F~SLEoPt~ Ut7.
'tl'oa 41wn.1 Id~'IFC:LF:HI~ItKIItUIIVN:KfVDp::IIIALLPFLEhifiJIIffIN::YFKDSERFCKELOEKFlo rrtt~u.~.t hom.Uxt 4'tarertu.
W t:.yaa.mk/F:NItI. .t:: nl Il/~//'tN

:It.PIJ:P:I:;t%:EI~I:ARIN:P::IMFt7f:NPRAWfLVAFCFOaIMKVIY:RPrC::'YAT7YX:ACNYRACI
IRHIt:HHII~':PWh:C:.::A::F'/FXrPYI::YFI.F:kI:Y::X:HUIKIAFM:.TALLLW
' !!'I'/YA'/tIN:IF.'ft:AtULU:P.A'h:ILROYLKL:iA'PAVATILKLWM'LELESYLIRLA.~sEVL1'CSV
tHII'FT/Ir:f4.F'LI:::.ILL.AIHt.I::MYKITtit'NVl'1I:N
'CF'V7DIVAIAMIF\

".Pn_0383 1. ~)07is ~'pn 017r IlRlSri 120218 CT017 hypochectsal procetn ' .
V
VODITPLTLPMOIWt.?3iD0w"uWYAEIWBAIA:.dt'eaa4ED01CQ1.
' i ' OGIJIPATLM~ICtPALIOB(t~'hG~CZNY1KPPLA'91(O~iRYfKIfH~P~1 ' ' B

EC1RQ.SMLPSAL,.'LSGFGEiiPADItOICRIIRt.:.LpMERIp:I:rGSC.."LA:ILFLMI~ST
ITLTTDIDST
EpIY
LITPAINSSRRKTNTVRIGNLYIGSDNSIK
OSITS
N
IPEIFN!
ALAFJfNCDIVRVTVOCIKEJ10ACEKIKERLIAIGLNIPLYIIDINPPPOAAl4,VADPADKV

AINPCNYIOKHNMPICGTKIYTEASYAO.r~LLRLEEKFAPLVflCCKRLCKAIOtICVN~SSLSSLPOILSEPOIILL
CSJf~KTSLONSDIKELYVKKEKIaLHKPRDSLIJtRDPV~1100IJIF

ERIMOKYCCtCIIVA.iAIE'IIAVCIOG.NYRDWFSMKSSNPKZl4V1'AYROW(OLOAACLLEDGEDPLG'.ITFLR
l'OCLYCLIISIEEGSKEMIHP1!F'ntYGKERLHOALN~aLIYM:L.:

wLYPLHLCJTEAtA4CV0taIK5AVGICTLLAEGLCD1'IRCSL'iGCPITEIPYCDSGt.RNTIO~BNpOPIVAVta' LVIRMVNL

..... . . :";ICrr/,a--:..r.-': ..~:
Ff.r;.~ : 'r:l' __~",.~..
.;.' _ .
' ,~
~

. , _ .:, .'I w "
....
,.
;
,,.y.,,~l...~;,:~:~ItFr:.Flnlln_YO::,v~I:,FEr~r~:~rr JHt;APtvHFHA50PF1H1'S'il0FFt3cOGNOt:KPTKI:JFSROFDNNEEN:I:.Ia:EFGALL:.ncCU-Htrconw-i:Ke 1rUt1il11 ~

OCiGEJIWLDLPHLPLOWLXIAFCTLOtIANRLVKTEYISCItlICCRTLFDLEEVlTRIRVITCLItIGIKMIGAOKK
OSGIOITA.iMVRKPAKK'JMKR'1S'KKATVItKTAVKKPAVItRTII

KRTpNLPGLKIAIMGCZVNGPGPHAOADFGPVCSK1'Cf4ZDLYVKNTCVKIJtIPIff~AEEEAKKTVAKK1TAXRTV
RKTVJUOIPAYKKVMKRVVKK'.SfAKKT'fAKRAVRK1YA10tpVARK

LIRLLOEf~VWKDPELrTIC.TV
TTVJ110GSPKMMCaLICNKNNKNTS~KRVCSSTATRKNGSKSRVR'1'AtII~IRNtX.I>0!!f SR

CPrt_0374 120109 130961 CT056 hypothetical protein CPrL0385 431011 43252=

VDSlICLSFNTHPL~NYWL'fI~FDGLPIRHCVFSKOKDAEGTYPAAICtPEIASALOSPKYCDpepA-LeucYl Aeinopepcidaee A

LNORNGTSYRMPTSPrYQPIIOGIL'TOSPLLSIJIIRNSDCOAAIFYDREIOD1IANVHSGFLVIIOGt~VH.FItAQ
i~RNRV1U1DJ1IVLPI~IiHPKDA10JAA5FFJ1EFEP5YLPAL~OG

wRGGtGNIYAVTVGTMKIfLFHI'KPQDLtYAIGPSIGPDYAIYPDYATLFPRSFLPF4III~tPKK7GCIELLYSSPM
KWtIVLLCI1GKNEELTSDVVP01'YATLTRYLIIKAKCSTVNIIGPT

PIHfDLRAIARKOLTNLGZSIfDRIFISDLLTYTENDAPFSSRYLI1NNPDPNLtGQH5I0J10JISELItL511E6FL
VCLSSGILSGNYDYPRYMNDHNLETPLSKVTVIGIYP101ApAIfRlIE

WI'AVLLLPRD
MII~iYYLTRDLVNAHADEITPKKLIVRVAI14L~CKEFPSIDTKYI,~DAZAttEIOGLi.Li1 VS10MC1fDPNFIWRYOGRPK$KZ~'VLIGIK'S..
'fFDS~LDLIIPGKStQ.'IlOfJC~IilOC7lT

Cpn_0375 421111 411615 VLGILSALiIYLCLPIHVICIZPATENAZOCILSYlOIGtNYVCtIS"LSVEICSTOi~LIL

Ho robust homolop presort in Genebsak/EMBLAD11ZTYALKYCKPTRIZDF11TLTC11MWSLGEEYAGFPSNNDYLAEDLLE7LSAlTteEPLt4 as of 11/7/98 RLSMKLGASTNHKVHEPVKPKKApLAEIEAtBCTOJITEClLRSKSL71WZARJ1VLYILPMRLPLV)tIfYDKTLN5D
IA0t0QiLCSMUGJ1ITA1LT1.QRFLEESSVAfiAllf,0laC1'AYNZIC
' :lILAJYCITFVTFL71L.CFPLIQAYSIACIITLVGIaICLVLLILSLLPKEDe~1~rr.e~gEDRIIPKYASGFGVR
SILYYLENSLSK

LLPLTIIVIEOOPZTPKPEIPYSYLTKLU.L.TSLfLTLRRSSSORIfIN

CPn_0386 131543 131016 CPn_0376 121680 411191 sab-85 DNA 8lndinq Protein No robust hamoloq present in Genebank/EMNL.KSIE:YL18Q'CNFAGYLCAD ETVNCKCNIt~Y
as of 11/7198 FKV1I1'AKIIPNLTEIROIGARWSLPLLSPLTSMaIOGtXxIIfSAPLIIQL00LiGEEOMfJIMK~M.PIfLbIGSG
YZYAGOISVEBYMSKDGSPOSSLVISYDSLWSPPGRNtsI7SRSPSLED

TKMNSRKIfAGpNAIFNSPTPCVSSTLVWPI'PWGYYDKWODILLR1I9PNSSSL9EKDSKNNp00G7fESVSVGPtI:
PJILMBJ1IKDKaNYACYGQEppYVCEDVPt EFLI04LFVDLLELJGfTSVIfINAEEAFTPLDNTGKPHPKRONVYLPC>Qi.GiILtIPJIAVOAN

vSaDTpFTLFL,TpDECNPPttDltlOiC CPI>r03A7 435229 431699 CT013 hypothetical proceln CI~0377 123111 122317 M~Id.OGDSLMSRONALtZILIQO'AKIffr.RLPDVAFDOMJIICILFVDGEPSLNLTYElQi6D

sue8-Dihydrolipoamida SueciflyltransferaseRLYYYAPLLDGLPON1'OWGJILY6KLL1~SMLCCpINCGGVCV11TKEOLILMNCVLI
lOLY

iM'1TEVRIPNIATSISEVN7LSLLYi'EGALIOB~IpGLLEIESDKVNOLZYAPVSCRI~WEAETIaiJWAOLPZETW
KWNTVCADZCJIGREpSVD'fIIPQMPOGOLiQAPPP1GIM

VSEGDUVPYOL1IVGKIEPAGEGEEtaDSOSKETIEAEIICFPOSGVRQSPPt~C1'tZPLR

DQ!IDOCSpGLSaGORGETRERt!!'SIRKTISRALLSALiiESlltIZ.TTFtIEYYN1'PLFtILAEECPfL038B
131313 4)7320 KOEEPLSRYGVKIGFMSPtYKIIVLEALKAYPAVN11YIDCEEIVYRIIYYDI5I11VGI0MGLqlqX-GlYCOqen Nydsolase Idebranehinq) VVPVIRDCDIG.SrGEIOpKLADLALRARECLIJ1IAELEGf~FTITIK7L11YGSLLSTPI~NSZTIGLVSSYPSVPL
PLGASKISPNRYRP1ILYAliOATEVIL71L1T~18EVIEVPLYPDIMR

pPOUGZLCi9fKZIGtPVVLONEIVIAOtIatYVALSYDNRLIDGKTAVGFLVKVICi7GGENPAiGAIWNIEIEGISD
p88YaFRVIIOPR>O~MpYSFIfLYLRDPYAKFIINSPOS!'GSRKRDOD

SLLDL
YAIrYLItEEPPPt~OpPLtQ.PI~BMIIY1'lBIVRSP1'OSSSSIMIAp~QrPIGIIEKIONL

NKLGINAVCLLPIPt~6171NPf1~KPPYi.CtiYWOYAPIi~IPPSPCRRY11YASDPGPSR

CPn_0378 126195 123115 EPRTLVKTLt~ECILVILDWPNIIiCLOGTICSLPItZDrPSYYILMOGNITNlfS00CET1' suU-Oxoqlucarace DahydroqHaae LIfIIMAP1TOWILDILRIMII>IJQOiVOGPRPtIt.ASVESRGPSCSPLQPAPVLOIIfI~LL

IVPICFNYFIlIDSSEFVCOVIISSIk~WIESMYQRFMNNETLDPSWKYPPI4CYpLGQMSPSEASTKIIA6PWWGGLI
lQ9CYPff1'LSPRfiSEWIiGPYRONV101!'LI~OQNI,IG?FA51lI8Gs ASTKISQLEl'IAFIZ.QCQKSOPLCTIYRYYGYLQSOISTLAP1TDSRPIOEKI71KIDLDDpQDIYPIIOStrITIS
IMfVSCNDGPTLCD'I1RYNIIKiWEANCEONRDGT011NYltYNIOT~IC1' VPS71GLLP1IAQVStfltELIEALKI(CYCCSLTLETLTCTPLLOEFVtIiLIOIKPAEpLI~PGILEYR>OtOLPNP
PLTLMVSOGIPMI0SG0EYAN'1'AF~BEiRMALDSNNfYfIi~/pL

LRSYIIDLCItA?PFEEFLOIIQ'I'GpKRFSLGCGETLVPtC.EI6N11YGSJILGISNYVGD01HTJUIPItJOIPL
CDLZAPRIGtYKTLFNRGFLStRCEISSiVWFMtPM'lliRpGNPLAPICIItiPK

AGRLNVLTNVLCEPrRYVPMePmDP
ANV:vArNVG~oDOLILTLPMSm~fLProIVASSOOCrvPONVATPSw:LOeFn~l'tsus NASIG.ESVOPIVEGV11AAIQI~tAGKEQ&SIJ1ILVHGDAAF90pCVHfCI'LOLSRVPGYMl~.lrl' STEGTLHIWtNYIGPTAVPRESRSTPYtTDZAIO~.GIPVPRVNSCDWACItJIIEYJ1L0 VRFJtf'SGDVSIDLCCYRKYiWBtE8t7DPSV'1'APLLYDOIXRKIfSIREL!'AQYLL~IOFIIDICPIIr0389 SEETL71SIEKEZOESLNREFOvLR;i'DPEPPPK1IECHHCDRLN4GELILNOCDNSt.ONt:TCT011 hypothet:ieal protein LFNNSSRICGfPONFlIPHPKI
11SLLIDGYMLRLTVPNPKRPYOKZ750RQta71'ICLRPPKXTCKELIEPRRRTVKLLKlNLIGLFISNSIBGF

~rODSIAGTFSORNLVFISt7NICD1'IfSPLYHi.SAEpGSVGIYNSPLSEYAIL6!'EYGYAOSEVRVSDTPVKpDT
WEPKIRVLL6N!'.bl'1'ALIPrIKGPYRIYGONVLL,DTJ1I000RL11VN

QALKTLVLWE1IOPGDPANGIVQIIFDQYISSGISDIVGLLPtICYECOGPPJISSSR7lLYmCiZRWGEFYPCIrQCL
KZEPVD02ASLPPNGI0Y0CSLY1MRKDtIICIMVSti6YPIED

IIxlYLQLAANWNFOWLPSTPVOYFRILRENAKRDLSLPLVIPTPKLIiRYPQCVSSIEEYLK&VLSI1IYLEELDREJ
1LSACIILRTALYEKLLiIRNPONFWIMtAEEITiYJIGIIGYtICQ

FTEPCGFRAILEN1DPNYDASILVLCSGILIYYDYAJ''lE.p~tRKDPSCLRIESLYPLALEFYGVEEAIDWCARLWD
SPOGLIIOApMS~pSNVDRIJ1IEGFNARQILEKFYKDVOPVIII

DLVSLIDKYSNLKNFIrt4tpEESI0e1G11Y0YtIPMAL.(lOILPEKLLYICRPRSSSTASCSAKE.S~IIEELDGE
IR

LSRQ!S.V1CMETLFSLR

CPn_0390 139171 438134 CPn_0379 416168 126765 ruv8-NOlliday Junction Nellcase C'I053 hypothetical procetn RKSZI~EGSYMINOVAVL~DKIIFDVSLRPIOGLEI~'IfCQHHLKERLDLPLCMLpRGRVPC

KNKKMLC?CSRIODGNPWMKSeer Yvr seEW)n,~,LVp((,KEISRIIOEEIRILEHHCLFtGPPCLGKTSW1IVAY1'VCKrLVLASGPQLIKPSOLrGLLTSL
Q~ONfPIDEIN

KIYEEKERLOLLKF11GEIEYVfPRRSPAX1'VYPDGPSMSDIEFIrEPTLTEIDZDPCETVRMGKVAEEYLYSAMEDP
KVDITIDSGPGARSVRVDLAPFTLVGATTRSOM.SEpLRARPA

ELF3.T~ECREDCAVEVDYSNEDDEDPFSDRNRWRRGGIIOPDANEttFSARLSYYSOpOLKEILVPSSHLLGIE71DS
SALLEI111fRSRGTPRLAWILLRWVRDPAQI

REGNCINCDVAEKAIat2LII~NCWEIDI1ILLTTZI0YY0GGPHGIKTLSVAVCEDIKT

=Pn_0380 416671 127876 LEDVYEPFLILKCFIKK1'PRGRINVTpIJIYDIiLKRHAIDR.tSLC6CQ

hanN-COproporphyrtnoqen IZI Oxidase KSTIPTICftIKTLSAIAiIIGDIIWSLIPtfLM~CMPL71LYIHIPt'CT1IXCRYCSFYTIPyIICPn_0391 SESVSLYCNAVIQCLRKLAPIQETHFIETVPt~O~TPSLVSPLDLKRILKEL1PNAREINo robust hamoLOq Dresenc in Genebank/EJ~L
as of 11/7/98 TLE71NPFM.TVSYLRQLQE1'pINRISVC1IQTPDDSILOLtGRTNSSSMITALOECpNHGKDOLYKOEKPIPKATIL
SRNLEVtO.DtIPKCKRQTLFLGRTSGRSALY5Y5RRILVLIJIAT

FSNLSIDLIYCLPfpSLEIFLSDWOALTLPITHISLYNLTIDPHTStY%HRKILVPTIANRCP

OEEILJ1F~ISLLJ1ENLLLSOGFORYELASYJ1KPDYPAKHNLYYWfDRPF'LGLGNSASOYLN

CEASKNYSHISHYLRJ1VRKNLPTQtTSEILPKKERIKEALALRLRLLWIDIJVEP'PSl'LTCPn_0392 139914 .~.MLTODVKLpM.FSYfbpGLAWRQGRLPHDTIAEEIMCYSFdcd-dCTP Deaelnase MSIKEDIWIREMAItIrIDMIHPFVNGQVNVNEETGEKLI3YCL.~aSYCYOLRLSREFKVF'M

"Pn 0781 12N836 418037 VYNSWDPKCP'fEDIFISITDONCIVPPNSFALARSVEYFRIPRNVLTMCIGKSTYARCG

aT326 similarity I I VNVI'PPEPEWEONV?IELSN7TPLPAK
IY11HOGIApVLEFFS51TCCV51fA0RKGK7f0 aLPNKFAAWfAPTESRSSPPTLLEETEPLSPNPIPADIOIPRITISPPSLDVSIYASSAKOQGItYPCV

EDI~VFIACCPRSSSSASVASOWELVCLCCGDEDPEPPDSEVRTLYVNGSWOTNQPJ1V0 ELLYIaEVRCFJ1VRLL'INOCSCNSPWPISPCRTLPTLDHPLC(~ALLTVWCpPPSAPEI~NCPn_0173 110129 AEFLVIFYCDIUPYI00ALTQSRHSPRLWVCISPTVPIOf;DFRVfINYRVSGDPPSSLOC<~f03R
hypttheCtc.U protein FGTPAFIICTtLPYS9CLECVFLPSIRCPSFIWAVRPr;EOCLVAF1RCE0VEDRtJCLSppAEKFLTLRNCORXFTII
dct:LpR.,~Y;,LSL'/FPARFtJIOTEKESIKSNM:SPYLVSNVSVRKKN

ASGLPfI:ERDt.AWTDL'TDPSsNSRLVEWWOCSt'SSOMEINPYPORePOVAtSALYAISwCPRLLEEVNIIfSWWV
IF.~,ILII,T',FV'IDRAIOELRTEELHLpSKVSSIC00IVSAQEKOR

~'J::.LSVEWILr4IVHE(a.DWIC'l3LIIl4HTTFAVRYFFLLFTNYt~SRERFRTARIYAQOLOLHLOIfWQD.~
.MI61ALLORII:LtPY..YKY.t.CVSPKpQ.~,f~OID

:r.YI.P.: f LVLVPUCr~JVLRK1,WMPpEILRAIF
tSA::TISGS f VFVt:GTRtM:fa:LRNRVp ..F'w/WVICrt:Ll'Vff:fVRASYROR1K:FIICFLOTVH(y:LYLPV.~.IMILt~IAIOVPRILVI'Pn ilf'~4 4A0717 A41u.q vlaItfCAV'IDUINK.~,aEENW:I::CDVWVI~TWFIta:APVLFVNLWFFVKSVLRH3RRRRRtly:~:R:' ltwnut pfotetnlHr:mvlY::m Iw>arprrll KI:'TMI VftIt.NPPI tt'.FTII'..Jf:FI
:L::y IALF::1.1f:'.f.t::IIYKR::K::KK(><H1VATLLIJIPH

..'I,yt~t 170752 4lnnlo HLLITLLF(.'DIC:WLAfONCPAILFr:LM:l.Mr11"n :l.l'LAITf.tIa:EII.(HCAVALpfNfQ

y.rm:Jyt.U.-::nMfwL.mHnt MrrrttycranatnraaelA::~1IAPLI1...'VTKIFNpLWWtaV~':fsrvw~tltt::Kr.JfOItVIVELKFII
W:x:KOIf:V
' I'VTL
VtIpEE?nLLYt7YL,~aLvDCS:IMERtIC~Ftr~4ll.l"II)IIYPff.ENLY1.LF::K(Hk::NVPIf:Hf7N
ILI.fTITft:fAAVETLI':CVICELVIIRLpf:LIVE.iDI&7CAAFLSLNKIPEVIIKPPWI

I::Y.IIAFL1KANDFYLEfIVKHCEN4~,LI:DAfiLM:IJ1DPCA::LVARAItAIIUiPVMF.SCPIA~NWI:Lt_ 't'AN:a.LlJI0KPl4.~.::DfiLi.lLLYYIyYMI1~.'PI::AKMAI.I'NIIMND6TIJ*ti ".:fTI.AItAL;:(:LI'::~E:F'PF4:YLPSY.:PKERVK:iIKKMT.3KEV:uTS'Vt'fET.~.IRtAIYTFEt >!'Y(i::Itl~.LtTUEDLFEIVN:EIVff,NII1K11.'l'Pf:a:Al,VtIA:7~:pL1:LftRF::F:IYDINL

:LIJ7fl.f'::'IAI:Ia'VA::I)L:.uP:iELVLTAQVYI$yp'~EDLC..~VK7vtTKVPI'IFLFHIPNffNNH
IATGkIrJt.tEyIf:fIPTfrl4Yl :'Wt7flIJ.l'~VIJtAAIYINIftIIVYIIIKI.YIn l, "Pn_If'I'~ 111955 141175 NYrJ7AMEKLLtTDfVT:"' nLDKK:'IERLYA:.FpAW
1(LFFLT.:RYYK1'AAP:.FSD
' .

CT257 hypotnecical 0roceln yEr.ATALFSIESCiIPri .DNY
FOAPYLLCC~ICAfVW::-:::.nLLYSKSLP.:DL:.:::;.x.
' ' CNC?fMSALFI~ttCVNIICIVLOCPYSllfOIACISFNRVRLOYYLTKDF(KKARYINFLZRR:
R:iLKDp,YAEP i7G IAI
YRFSPfPIAODLIfCYVOPRSFPNAIfERELLFE
~LI:FLTIIG1L~ 1&MA
RWiIfDPRx ' l PYRL!'~7JIIGYMIALAVCSESSRNCIfRAIGITPDYAPFTOIFtWIFAf3.LPLTISRIfIr ~falI
f OKELEAOCALTSVJI
SAPE00NH,1DFLAPPJIDIfHCIwAWEALIfItYY00LMSL
DLIEACDFKIVM:

PEKLALWCI1PILY'fSHYIFYPLIOLICSLT.F~.I:IYLIJ'IIRKEKfltSTLSRDEFOK71LCTHr SCDDIWDL

HEED'!11'IATNIF.iw~11TC1100VCQPLEQVTIQ.PSSANVKDFCRTIfO'JfDINFIPVYNK

ARIWVLCIAHPKDFVNKALDEPLINNWSPwFITAKSKLIRILKEFRDNR55VAWWASCPn_0108 ~EPLCIL:iLNAIFKILFNITHIIWIJIPK?ISVIERTFFGFISRIImLOKLLDIOFPOYPVECT10-hYpotheci~al pratetn ~-~
. -"
' "

".v~n ..~." \i.-t .'."... ........... , ...... ... -:h:..":Y1Il,:-: up: ~.vet ~,r, t, , .:.:lr-~
.:a':IJ!!'i F, ..... , .
.: _ .
:
:

' _ ~
. .. ... . .. . ~
.
.711~'v.
~':VWAY!w..

r_Pn_0396 111J1y 447741 EFPPDTUINHi.Wt;tulxv::.:.:.:a':.:~t':..wY:,::~::W(.VI7x.L

yhf0-Nits-related protein LVLEASIRIIfWCVL CPn~0109 151615 1551:7 ' PPERCLLEFLOk?FLIEC:YJ1NPSSVNOLGKILSROCT3(i0 hypotheercal protein YSMIYLCtMRIfl WPE~AC
V

SY YINI
SFpCRVLYTSCATESLNLAIASLPKDSHVITSCSE11PAILEPLKIiSSLSAN
C
' ' VLTIEOZERAVTPKTSAIILC,IVNSE'FGAKADIAAIANFAOEROLOFIVDATANVa%RILT
p~f.PLFFVIASa fiJONNLTKfLKSSDEEPFLERFS
iEfLALICY
O
N'Curt VLPSfrYf!liUlfSCMCf'H71L~IGALLVSFGVKLIiPOLWCOGQOCGLA7IGTFi~tLYI7IASLLLPY~JtESNK
ASTARLLHLLNRDIDIPGFf?IDEEOCLIfYRLVLPCLIRittIICIIi.RIYI

YIFKYLDLHOERISOEILTIIRNGPEKAIIWIIPOVNZIfCADOPAIIMiVSAIJ1FPPLl~EVOn'IffL.VCDSFSH
AICLIS9fA11~ILDCLRA0AL0E00EKRNE

LOIALDIECIJICCYGSACSSCATAPFKSLVSNC1IDEELTLATtRFSPSHLLt.OEWt~IAV

CIIEKVVCRL1045 CPn_0410 155087 155833 dnaQ-OIIA Pol III Epsilaf Chain CPtL0397 145124 441)81 tIVRLFKSWKKMfIISSQI1~VLIFYDTCrCC:OIERDRIIEIMYNSVTDESPLTYV11PEI

PPZC Dhosphacase family PIPDGSKItICIITDAVLSAPKFPGYDCFRKfCGEDSILVAlOS4DCFDFPLLGK1ICRRN

EHPVDfDYFCLSDIGRVRAANEDFWOVNWSQWAIAOGVCCRfl;CDIA50E11VTSLNELSLEPLTNRTIDS4KWAOKY
RPDLPKNNLOYL.RpVYCFAEt4pAHMLDDt.'VIUOIVFTSLI

IDEOQSKLNGYGDDpYKETL10CILLEYNCWYEEK%4EEHf.Oh~'1'fLSFIOFRI~tAWLCDLPPQOVLDLLOOSYN
PKVFxNPFCKYKCOPLVDIPKSYFENLEF~CJ1LOKPEIdIDZKJI

FHVCOSRIYRIROGELRRLTEDHSLf140LKNRYGLPKOSDKVYSYRHILTNVIGSAiYVNAIALLMOPT

PDIANLPCEKEDLYCLCSDGLTMIVPDZDIRDIIafOPATLEER~I71LISLaNI'RIiG'Od~fA

TWLVRIO CPt~0411 155794 156609 CTZ67 hypothetical protein CPn_0398 115518 15700 RHQSRYSSITSTDNILTAAFSPCPNDIFLFRSFLfmPOFRPLLNOVTIADILTI1F1'LU.O

No robust homolog pnstmc in Casebank/ENaI.RRLSLNKFISMLFPLVSDYYNLN09CM'LCYNSCPIVLSLDPECSLDrtaTPCtOfi'1'JWA
as of 11/7/98 IEELPFtQIENSSILFAEWFOCWPiFSVISAPWFLPCxTLIPKEKVTIIVPSOWStSLSOLCKLYYPKJUCLIPMPY~f ILSIIILOCINOCGALINEERFSYDLOLTLRADIGI

p FPLPL~CWIAKYVPM711vDJ1LTAALRKSLZCSLKDPITaGAKAVEYSKNIQAIIVIfUt!'I

CfYIIM~I>!'OLSIITDKKJILIHQ.WlI7INyC(:pY1' CPtt _ CPet_0112 156515 457216 Cf253 hypothetical Drocein YKLGIViIiGKSLNCFSIDLItSKNFPIU1RIFCKISNLA'NIMtKNLVLLASLGLLSpTLSSCT363 hypotheCiul protein rTHLC71SGS7MpKLYTSS~S ZSKATYAS
EPZS'lIIKPfNYLKfGKKi.YICSCRI~HIVNfPKKZLCNADY%ISPLI~TpIN~t EKVFLIKfMASP~FYAPIANRLPETIfEOFLPAEPIVATB.LEOK'IGKF~IGYDSVI'LYSYAC1'DYHLDLYIVIIV
IICSTAVWaLpSYCpAYTDYDWINPGF11CRCSPEIII~OCIf ASVRVRVIDIRHNKZALIYQEIIF7CSOPLTTLVNDYfCtYfIJFISKNFDSTPIGtJbiSRLFRTIDCIANLTfD!'P
PVLSFaPPYIFDALPDSLPKSSLV'tSPVLYftYalfOfIFKII~YA

IASOA71ENNIPCSFLKITSDYTYPGDCPFSRLEEVS01(LTQT~.YE<i.PIGJIfhIJIIPItKLL

LPCP ' CPtL0400 416537 417306 C1'251 hypothetical protein SKS~ISKPILLt.SIGVMtaSKNFFIWPAPSC1(TPL1QRQVLFGG11LLVFSSLVALSVSSO
TABLLS1111CISLAFAFLFYLLFLPKDZTRAILFSCERWXTSWR7IfGSJIIRIMIIIIPV
'1\7LICINIISKFL?LVLPTOEZH1'QEYl'pEVpNSLPI'~NYISNILNtL:VLTPF'CFiIIFFR
GILQl'FIJOJKNTAZMYtCSSIIFSFIHZENSLCSWVFVWLFVFSGSACFLYEI~fIIL
SPIALIGLFNLTSLZ.FLCIK
CPtIr0101 147881' 447195 GT~SS hypothecif:al protein NRDHAlSIQ.I4'IVRANVVECRCPWSiQOSLVSNVEHILCECOEFHEJ1VG.OGKTVOEVaSE
AO~IGTLVLILCFLLEAI7CVi.ivSED17J1HEAFlEfILRRAAPYIFAEDYKPVSIEERDRIJfEL
AIGOtBI~ES'f mutt-lldenine Glyeosylase NPIDtFCNTKI11FSEKAIOJFI?VEAL<CKWFEIOVIfASLPWRDNhfPYSVWVSEYFILpOTRJLEVCP(L-.011! 160103 159172 VIDYFI~t~RFPTIESL71MKEF~IfIKLwF7CUYYSRARHLL7lRMIl~EFIIDKIPDDaeU-AeCOA
CarboJtylase/TransEerase Alpha aISLRQI1~VCPYIIIHAILAFAFKRMMVDCNVLAVLSRIFLZ>:fSIDt.ESl'R'IyRIBRILCLRIVCIID'IILF
IRGENIWELLPN4CQVVEYEKJ1IAEFKEIQIIDLNSLL658LIOID.~I

AQAt.LpNKSPEVIAEALIEIGACI
FVLPVRNAAKKVRLWfiJCEKIYSDLTPWltAII0IC1WPSRPRTVNYIpGlICCEFVELCGOR1'FRDOpAWCDF

IFLNRLVAIVLYt7GSLWEIDtRPKB~41AGLYEFPYZEVEPEDCLQDIDDF'fI~IBLSLESVKZOOORI~.IGOBKG
CDTJ15RWRNlCNLCP~FRKALRLCKLAEKf~CLPVYILVDTPG

'PLEFLGFR.KRHAf'CIMKVHLCPIIFKATSLPOFGEWLLSDZDHU1FSSCNKICIKDJ1LAYPGLTAEERCOfWAIA
iDILFCLSRL11TPVIIWICEGCSGCAtGNAVG0SYA14.lNEYY

LIYtGOVRSRESIGV
SVIBPGGGASIWKDP100'1SF~1ASM..%MICENLKOFCIIDTVIKEPIOCAHHDPALVYSN

VRIFZIQEWLRLKDLIIItELLEKRYEKFRSIGLYEtTSESGPEJ1 CPrL010) 119009 419710 yeeC-predicted pseudouridine syhchecaseCPt>_0115 161522 460221 family NFNpL&NOKRMi.OYFME4F'SWLiLTpVSRLSSFLRSOLPNISKOEILtISIRONRCRVNCFCT266 hypocheclcal protein IERFP.SYKVOPCDRVSLSLIPST100pPSILWEDDYSIIYEIIPPHLTTEQNAHNTRF!'CVFtSOIGFL.PCLTLIF
YIIIVWCNAFLIKLCVINCLOSRLQHCIEVSONSNfOSOVKOFIYAC

RLWOGTSCCLWGKSKOMTELF~LFKQRKINKOYIAFVFCNPKKKFGTVKSYTAPVYAAODKTLROSVLKZFRYNPLLKI
HDIARAVYLLMALEEGEDLGLSFLtiVOpYPSCI1VELFSG

C~vAVIFCAAGPSOGEPtKSAYIfWDCI,IVILLSE4f5TfDLIOJSLPRSSAL55lIL.TPGCFPWIfCLPYPAEHAE
FGLLLLQIAEFYEESOAYVSIOISHFQpAL!'DNOGSVFPSW90E

NSRLLKEKTTLSOSFLFOLCIIOIHPE'ISLEDPALCFWlpRTRSSSANM9CGpS8IG11Y

CPn_0104 150967 449871 SSC09GVIAYCPCSCDISDCYYFCCCCIAKEtyCpKSHpITEISFLTSTCKPHPMPDC15 No robust homolo9 Dresehe in Cenebenk/EMBLYLROSYVHLPIRCKITISDKOYRVHMLAFrITSAMfPSIFCKCNNCQWDDPRLASCSLD
as of 11/7/98 ELEALCOKYCKAVLLIALSELCID'MSLLSCNALEGFPPIAEVNAACDRCSMDFCEILKSSY10CPCNDINILGENDAI
NIVSISPYMEIF7lLpCKEKFWNADFLINIPYK6OGVlILIFEK

QSMDWADMSCVDCLIADPFWSTAIASGIAKSSLQETEP'ECESKVN1.~SSWCEQGAQVCKVTSEXCRFFTKIW

SPFNLERICMSFPSLKVFSLK10JGCENMGIOLariSCWJLWSIFFVATNGCS1'PIWTTKE

NIJIALVIILVLSHYOCYFVPA1'CDPORCNIIIOJPEI1NAILAAGNCNRVDLERKRCCESSSSCPn_0116 Iu1871 161557 RYLELWtCFENSLTKTSLISDAFaAfpERDKCLLONSTSLI~sfrACWWRPPVPTPSGVThim0/lhfA-Ihce<trotion Hosc Faecor Alpha AfiPOPOPOPVVfSOPSGLGaRERSPVSSRCRFPt'VLPLSVISPRSHPCAVERRDLEDEEEFJ1LSNNATMTKKKLtS
TISODHKIHPt81VR1VI0NFLDK.YTDALVKCDRLEIRDfGVf.QV

EVI~ VERKPKVCRNPKNMVPIH
IPARRAVKFTPGKPNKRLIETPlIKHS

CPn_0105 151814 450960 CPn_0417 463017 4ti~~1 CT105 trypochetl.cal protein amiA-N-Acatylmuramoyl Alantne Amidasa Ntf,TfSHSRVLLI(KFSKEF'fIRTYRSLCFTDYLCu~LTNPLCKFPSPONPOWTIApSSITREKCJIKLTKYLNTKO
LRSNISRLFVRY.iLFNSKOLSFFAL~VIGSNPIFAOTPNPPpRVR

PpAVS~uuAWGFLO'fOGAASSTATTTTASCASAi.rL5p0pVOALLTNLLNYGOP$VOQPSTR:uEVIFIDFCFKxK0 0CTAS%ELHYEEKSLTI:
IJ1LTNQ.>ILKPMfiYKPpLTR330VYV0 ACl"'aCA :.~.S.SA~IQQpLLOLI
LDKTTCSCCSSVSSEOLOOLLSLVSOItT?SOCCSOCfOII:KRVAL~,NRCQc:OVF IS
IHCNHSSNAAJIIrCTEVYFYN:I~ICSPTRNRMSEVLGK(iILAA

aCpMSVLL.NLLSATCSAAANPI.CfAAsLAQIIYMVTSPCAK1ITSEfCYNYCCE1'CpGNMEKNCtLKSRCLKTJWF
WIRDTSMPAVLVETf:FISNSFERAAWDARYRMHVAKCIAEf:

~'C(:P1'~CPDCOCCCCCFCRFFCCVWttNCCCLCEC:.OEPAIPLVHNFt-:r;PFOKPKONtAK fRKPOIOild .

<'f9~ 010,, A519b0 4529~s '. Fn-IIIIu .In4111I 16.51 t.rbl&uJyl-ACYI-t:arr(er 1rorein murk rf M:,'rylnnm.umfylnl.~nYL)lutamyl R.ylu~cose DAF Liau:r tX:FfILKIDL'fr:KVANAc:ICD0f7CYl:WCIfAKLLr\L;V7J1TIIVCI54VFIYKIFSO.S~iELCKMIJIJ( ht..IJK:V~/(KIYt:KVRFLEVRNLTROSRCV3VCDiFTNIKrIJPYDt:NIA:AVIL~1.ANC

F'NP':RKf::Nf~fLLEIAKIYCNtM.:FD::FFDVPECIAENKRYKf:ITt:FTi?E'IAfQVKKDFAf\tA::::I
.YNI'Ff::l7VpIfTf'Nf.F.ELFJ,ELSAKYYEYP.~.3FLHTfr:IfCTNiF7~I~fCLI

:IItDLLVIL:LAN::1'Rf::K::Lt.ET.~.RKCYIriAL;.L:::T::N::LL::I1F.:::l!?IRtX:iTI::L
TKALI.U:Y!':KI':a:1.laa'(IvJII4;FJf7VtFfX:ITrtTPALV~FYLATM/It4NRIM'PMGW:.iI

YlJI::MNAVI'tTIW
1X.I::::AKAALE.:DTKTfr\WEh:RFW,tRVNTf:'.w:F4l::RAC7KAICFf':IJI:a:I'.VA'i'ITJI'I
rpAVI:ITIITIJJ111.DFlKTFT'NMItAYLF::LV4f':iteMVIffrpSPYA

F:HFIVDY'ItJFSIAFII'FrVINAI~tV(:AVMFLAiFLI::AITf:ETI.WDIiUANVFK:I(:PF?1FPK::y't F':AKAI'Vf'Plv:ll'_'..\.1V'IPATDfuL:'.:::a.TKYTL'llr:p(jKIA':::::::FIr:KYNVYNL

!>:: LAA I:. I'VIfA;:1 Ja ~I,LF:IH.1.1:K
Ir:IJ.'r,Ff'It :RLDPVLFIfiN:F'll IU'IAIrfFItAIJaiVl.'117L

I IF:1 J ~19~Y 7:It4I W F: ; s Y
.I ntl,k::Y.kYLMA~ WERYCFA'/'lf::ONII::H
FhF:L l'/HP tt:Or:

~'Im 11111'! .1.l~li'f .IV2H5H
I"/::NFI'/F'I6I11<KyAI'I'YAI::111::UFLIVI.IAC:KCHIiAYrJlFKIIrjP/AF'INnYJI~AFYLA

IIAU ::"Wrt.lmfly hV'It'nl.ru./Idw.:ph.ft.u:~::/V

.:KHGPS I::KARTf.NtR.GL

CPn U4L'1 466997 464A~6 r ~
1 ~en ~ Aii~
IIa Z81 ~7~
~9La ~
~w~
~
e t SYRK~IVtGIffALYIILi.VLRYYKIOICEDtMtWAAE.
3VPYPN eswnc FPIIINIwtItPOKKV l"r-QL e c f p .
/
fo No cbuse ho .ILGOiIEFCVROPFRRGTFFAIA'IVRKCDKDLQOPFAVDITKFNtGADPLAIpECHRI~IIKtf4ll'LFKYVPRSR
~IPDTLTFLKRYS.TJLLHSEN:LSYRIPAKYInIIiw?SIJ1VAPAt?

r>.ILpPIDGG"'Y'..~t~LKL~fKSIIYCKLIPLLDVSVIIDRLSLWWKGYATKNRLPZN71LFFLFSCE'L'.iJL
RLCAL'fIGIAL1ICVL:.TIVVYCIA::KIA".'A::KKPPSISRIEIV

ITOYQRSYPF.'KL1JDOVLItTLREIKDCKTGKAFP'fGQttiAYFIB(IL6GOVCERKLLJISPL
4749\7 47 ;514 NRt.D4TIRVIKLPKOt:.'.OiYLTiNPVIt7t'IAI~ELtiRGVL.FaKA~00GRLILINSCICEIG1 epn_Oil_ . .. -..
.., !':a~:: ... ..,rr::,_.::~7f!'i.'7::I"::'1 ~.-=i a:.::YYf .':," :.:.\:IEEw:;.Y..'.:, . ..:: m, ~ . . .. . , . .
~

W .C . , . ..... _ ..i.~...:, ~y:W-.
..Ir : :: ,'-GS~f:.vR:: ~~.
.I'tvrr:rt'i: :'' ;:Ii':::::.%:.::1t:..:~a174\t.:
.. . G' :: ., 1.
' ' ' VAWYQQKLLAL:IPCRlCTCIfLpSFJISGLVPSPNRl7IIt~.SLESrSLS'TPYSLANGYNIU1:a:dvWh ::.,:ML
i F 1.;N:aLr.H I wi'AHla .;LFLIi:aAPLvi..: Lit trlA:?.:

GIpMVOAYAILANCCIIAVRPTLVKKIVSASCEEYHLPTKIDfIRL.FSEEITAEWRAHRFI' TLpOCSGPRASP%HNSSACKl'Ci'fC101IHf'KaCPtA_0433 4773.7 476929 YDKRRHIASFIGfTPVESSP~NFPPLVIG. ' ygIODpEYCLRAOCIKNYNOCRCAAPIFSRyADRTGLyLOII.ppKKLpNCp=p~,AAt~DlLystem N Protein OesN-Glycine Cleavage a YEEAtJRSPKpOGTR
RTFRILYGTLIR'lCSitKV1811YSDYHVWILPVNERWRt~:LTEKL4pKNLCAILNVDL1SVG

SL.CK>'~.EVLVILESSKSAIEVLSPVSCEV:DINLDLVDNPQKINEAPEGEtfIiLilWRt.DQ

CPci_0120 167120 166124 ~P~~~

CT271 hypOChatical protein KSFPtdJNSRFLRLCCCLCFCGSLFYFIfINKONSLTKLRLEIPCLSVRLROLEQOIIISLRFCPeL0134 179471 LIDKIEAP1EiAALPEYQYLEYPSEESISLLSYELPCT213 hypothetical Drocein RPMfRIYOpDLPCRLCRDPAWFFSLLSFTLRFYCLGRGWTLLSFIYtOpKKFICIVIAW

CPn_0121 46A007 167108 CIfSGICVWCRFSRKCSAE'~TSRRI~fPT:ASGIWYVEKDFNAIBUtFPIItIGYPfI'~iPRA

yabC-P8P2B Family mechylcransferaseWNfINIIGLL?DYFLZTRVGOG.FLKVYNPGFJCFSKEKAYpPYRitPOIIpPISS6iVNIt SS

EILNSERAHIPVLVEECi.Ai.FAQRPPt7fFRJ7VTLCAOGNAYAFLF~iYPSLTtYDGSDRDLAPpLLEILKVlOOI
ENPISKFTiFLARAKLFLLERRFPHYVLROfIZ.IYRRQMF11LPPDiAL

QAIJ1IAFJfRLtTFQDRVSFSNIISFEDLANOPLPItLYDGYtaDIGNSStpLDI'LSRGPSFQSRQ~LRLFCY01'I
OOWICOJIYLSAAVSLLIRFIDEpKKVLPRPSKOFaRDDIYdtaKNA

GEKEELDHRI~O'fOELSASIriR.NSLKEffFi.GRIFRE1IGEEPpWKSAAKAVHIFRRIfOCILYT1IISK?MEPS
IJGFCEISTfSYFQFLEISESEFFt'ItYRDILLCKMLti.IQOGVEFDtOPL

SIODVI(F~ILLGVFPIiYRFNRKINPLTLIFOALRVYVNGtDROLKSLLTSAISWi.APOGRL1TFFVGGKDSIQVEF
PRLPKE1ISFKTKQELKAFCVYLIQ,VSLpKSDBt~VPNEILPIRTI

yIISFCSSEDRPVKWPFKEAEASGIGKVITIDCVIOPTYQEVRRNPRSRSAKZ.RCEEK1LS0KAK6PRLVCRRFSIDY
KRVIILODW1TVPNVEVLNYppNSENFQEILOp!'PDVCfCOSYK

p!'pICJCPALRDKISLITRKEILMRPERIL4SL.QpVPK09pEVLLSAGIDdSALPCISOCQ

CPn_0122 161233 1617A1 OLIIKYLLANiYLDLYSODACrYY'CIIVNSSFCKEEVLPYREVWtDIJISOLLTSN01Q.VD

CT273 hypothetical protein 1~RTRY~~~~'~~FSWSL~LKTIER~.

GLANVEIFNYSTSIYEQH715Ht4RIVSt~'RICEIO!!~'.ISIRDVAIDSApILtIOIPKPSALTSPORDRIFSIIt VCOYSSVINSPNDGPCYYOCLSttLLYDRPASV~CL.FIaKSOLDEiLiGS

L:.pTNpKSNWACFSPPNNFYKQRFSTPYLAPSLGSPDOpDEDIDCISSFLIiVLTRG1~'SYYIaRFIEQCWR

RSOITPFLSYKDKEtEF~EDPEi~DPRVOQGKVLLGLDL.fvKSTNVMIDYVISRIFO

gypG CPtL0135 110908 479175 Plfospttolipase D superfamily (uncleavable leader peptitlel CPn_0423 1617A8 169216 GYtI~'RLRFRLMLGIFFILLVPNSVSJNt'1'IVIISIIItOtVCVLVYDNSVp~IiOpILDCIDII

CT271 hypothetical Protein ANPYVCLCPClIIGGRTLKllNDIft.EANI'QfLVPBICSYIIIQPTFTpiIEttLLKAL1~RN

CHLDNEWKAIL~WGDOELEELRISG1ISP'LitQCHYSKAILPFEALVILDPLSIYDNpTLGCPNRPPYV!'ICCPPST

LYLQIGENSOALiWt.DQ7ILRNQCONLPTLISJIITKALPCLCRIEE71TAIATYLSSCPIPAINPRLIVSCVRRpLi i!'RDODIIG.RSTAfGt.OLRECID!<GOPAIWD~YYA101AiPItAG

ANDAFJILIJISYSKATiDD~tIUILVR
ACPPLTLZ'3aAEETVfPGFDIOtEDLVLVDSSKIRIVLCGPItDICpPNPVYOEYLKLICGiIRS

SVIC.iIlOIYFIPKDELi1071LVDVSlI9d~rVIiLSLITNCCIIELSPAITOPYAtIQIItDnPALL

CPJ>_0421 469528 170961 YGI~IfP4WlUd11C61t41CPY~tVSIYEFAIWC1'QIJIKI~NIIDatIPYIGSYtII~miIF

dnaA-Replication Initiation IaeeorCYL4IWIESPIttfAAItAIDfVR'iKDIGLSIPVSNGDIFbwYFHSVNNTLCNL~.TrMPA

SRCHEIFSPSLIK111VDCIWLSFIMCE901LTCNFLNYVKTRCSKTAFHrWISP

IOVLElTQEKIRLEVPH1 ALCf'WAGDDCPSAPVCPIlr0131 I1i33 110902 ASI IEGPSNOpVK871J1VGLAGKPGRSYNPL 1p1J1-Lipoace Proeein LSQUe-Like Protein FIIiOGVCIGKI7fL(JIJIV~WiYVRt:NNNKNLRIIK:IITilIFIN~VYNLItSKSVDKNI0~1FYRSpYVC1llI
KVRIVDfQKSSAASNIWtDRDi.LESLQOGELILHLYt04D1PCSLTYQWl411mt Lt)i.LLVDDI0FL01110NFEEEFCN1'FCfLTt~.~OIVITSIrifPPSOIiC.SiGtIIARI~IIGFLL.S4YAOI
.GLDA11VRP1COGIVPIDOODYAFSVtIISATHPSYSSSVLA~tylflVIISPVAKV

LVAHVCIPDL1C1'RVAILOHKAEpIGLLIPN~IAFIfIADItIYCNVRQL~GAIl'iIG.TIIYCRt.LEIVFRIGQf ZaPI;~iSSSRDSGIiPCNIUITSKYDVLFGDIDCIGr3AAQRKVOpGPIJpGS

FCKSLTE1911RETLKELFRSPTIC~ISVEI'ILKS1fA111FONlCLNDt.KGNSRSKDLVWtQLFL90SSSETYORF
LKP6YLEIIn0I0IHAFFPLCLEA71DEVL,pFJIRQOVKiA/II~IC

IAMYLiINTLITDSLVAIG71APGItTNSTVLY71CKTILt0~l4DlcfLKRpVM.CKNNIVCC~.t.

CPn_0125 170965 171561 CPU,-0437 41110 11350 CT271 hypothetical Dreceins elpC-ClpC Protease FRGCPtffRRTCIIGPFEDVOTLYEEETSSPSSYSPYSRSERPETPPSLFdJPKASE7IRpLNlfplinKPTNRAKQVI
KWDt~IQRLNNNYLGTFJtILLCLLK1.00GVAVNV1~IL4I0PDT

HNLTJ'.R-SSLPpWSSTPRTESLLPLEEPETTLGEDVTFKCEWIIILRI3.RIDCI'FflGILVSK1IROEVICRLIGYGpEIQVYG
OPAL1CRVKlCSFESANBEASLLEIOJYVCTOILLLGILNI~D

GKIIIGPKGSMUDIOLOEAIIEGW6CNITVSCI(VELRGGAIIKGDIOANTLCVDDGVRSVALOVL~ILIfI~RLVRKC
ILKELETFNLOLPPSSSSSSSSSRSNPSSSK6Pi~liStGS

ILGYIrIIACI'1'DItSERZGtDL
DKIIGC.SAIJUIYGYDL?EiNRISKLDPVICRSSEVERLILILCRRRKNNPVLIQ6AGYCK

TAIVdriL710KIILti1VP011LRKlWLITLDL71f1tIAGTKYRCQFEERIKAV10E11RK1A.11I

CPn_0126 172111 171536 LLFIDELNTIVC'.iIWAOGAIMSNILKPAL71RCEI0CIGATTIDEYRbIIEKOMLtiRRF

CT277 similarity QKIVVJtPPSVDLTIBILRGLKKKYECNNNVFITEEALKAAATLSIlQYV11G1lFLPOKAIDL

NVLFSLLFPKLCYGCOAPGAYFCSNCLEKLLVEDREGRCLNCFRYLCSSETRLCSOCSPSLDIfaGARVRVNlIDQPTD
IJOa.EAEIENTKtJIICEQAIGTOEYPXA71GLR~EKKiRERLQ

SQLQAP'SLYLPSOTALSVYARACEQCRPALOFFSKSIAFtG.ASt.DI:TPSCIAYITSTISRSHK~?llKl:<EIIQ
VPVt>EGVAOVVSGOTCZPS71RLTEABSFJG,hKLfp'ILRpKYIOpII

KIWEVAKLEKLLRIPLWPWLPKKRQIEKLPKGEGICFL511YpL~KWMQTIVGGSASPLtlilVT8ICRAIItRSRIGI
KDPNIIPICSFLFtI3PL~\/GKSL~LIIQQIIIIEHF0GA~11LI0~

VSISLFLSQNDQ
SEYI~IfFIUITKM~aSPPGYV~tL~HLTEOVRRAPYCWLFDEIE1WIPDIJ4r0IL

OQGRLTDSFGRKVDTRHAI II4li'SNLGADLIRKSGEIGFGLKSHH9YKVIOEIfIAtAIOtK

CPn_0127 472157 173715 .
HLKPEPINRLD6SVIFRPLEKBSLSEIIHLEINKLDSRW041fpNAtJtIP06VISFLVT10C

nqr2-NAI7H (Ubl quinonel Dehydros:alase NSP~4DMPLRRVIEOYLEDPL
'~YR~'OFJIRKLRATLVFNRVAFEREEEOpEiUIL

aVCYVFERVEASTFLSITHLKKFINSLWKLCpQ0KY0Rf'TPIVDAIDCFCYEPIETPSKPPSMiLPS

PFIRDSVOVKRWIM(.WIALFPATPVAIWNBGLpSIVYSSCNWIJIEOFLtII~3FGSYLS

tvYKEIHIVPILWEGLKIFIPLLTISYVVOLTCI:YLf'AVVRCNKIAGOLLV'l'GILYPLTLCPn_0139 PPTIPYWNAAIGIrIFGIWSKELFCGTGMNIWpALSGRAFLFFTFPAIOt~DVWVCSNPyebF-PF-loop supetEamily ATPasa GVIKI>,SLt41001SSTCKVLIDGFSQSTCLOTLNSTPPSVKRLHVDAIAAI~021tIPHVPirODNLTLPNPP_OVR
EINOOlYIVANSOCVDSSWAYLPKKFTNY1CVIGLFIODafEEDSDOCLC

'JIHSQPSIIrII'ETHPGWVLDNLTLTOLOTFVTAPVAI':OGLGLLPTQFDSAYAITDVIYCIGSSTKDY1CLNERV
CLpLDIPYY1VSFAK6YRERV!'ARFLKEYSLCYTPNPDIi.CNRCIKFD

KFSACNLFwK:NIIGSLGETSTFACLLGAIFLIV1GIASWRTNAAPCICaFLTGWLFKFISLIAKKV:ELCGDYLATGH
YCRLIfI'ELOE'IQLLRGCDPQKDOSIFLSCTPKSALfONLFPL

'ILIVCQNG/1WAPARFFIPAYROLFLGGLrIt~GLV!'NATDPVSSPTItIILCKWIYCFFICFHTCE191KR'F1RJ
IIAApAALPTAEKKOSTGICFIGKRPFKEFLEKFLPNKIGtNIDND~I'KEIV

IVIRLINPAYPBGVMLAILL.GNVFAPLIDYFAVRKYRKtiGV~HOGAN'f'fTICORRCLDLGGSdIPCYVKiINfIE
ENSIYIVRCEOHPpLYLRELTARtIli WFTPPKTCNCSAKVRYiISPDEACTIDYSSCDEVIfVRFSOPVKAVTPOOTIAFY0C0DCL

~Pn_0129 173719 .171481 ~~.lIILVpNIPSEC

-nqr3-NAat (Ubiquinonel OI(idoraduecase.
Cenma-NMSicC;SXHiIIRINQTWYIVSFILCLSLFAGVLLSTI(Yl7t.SPIOEQAATFDRNKpNLLACPn_04:>
IA5523 I8ti077 ARIL.7FKGRFOIOEKKEWVPATFDKKTQLLEYATKKVSEVSYPELELYAERF~IRPLLTDJ1tM rotma hamolog present in Genebank/NBL
as of 11!7/98 QGIfVFSFEEKNWPIEpFEKYQESppCCOSPLP!'YYILEN'I'SRTEHNSCADV11KDLStIrQIiSSNttf.'ILFV
SSTLNCVFPSSLPEESADLFITNKEIVAiGEXCNVFLTHSIPlOIL~UIIT

ALIFPI~t~GLYJGPIHGYLGVKNOCD'IIJLfTAWYGO,'ETPCLCANITNPEWDEQFYGKKILLVIVALA:IAIICL
GCYSCSILLIAVCIVLTLLTLLCipALVGFIKFLROLP00t.IftTf FLpDS.~CT'INFATTDIGLWKCSVRTTLCp$PKrILSAIDCISGATLTCNCVTFJ1YVOSLQFIREK.IRPEs"SLQL
YTNAVRKTTQDTLKLYEELCDL.iOKEFKLQSTLYQKRFEL$INOfC

Av.YROLLINF~tIt.THEKKTCE ' K'fNON

::Pn 942'1 4711:611 .1'571 CFn_0119 INfi9Ht 491:74D

mlr4NADH IllhiquilKmel Rullwtase tM rot'::c llaaalol tr.tctnf in J rayusAink/EMLL nr. of II:7/7R

KRNPFMfwKK::YK::YFFDI'LW.~.WJpILIr\ILCIC;.ALAVT'tTWPAIThK7IA'J.~.IV'~':C::WTIM:
IKMAT::VAP;:fNPF_:::PL:IIATEVIlILf?iAIITQMiPIPMW1ETPR5KLS1'IIN

::FFV::I.LRKFTPtr.'VRtIITQLIIt:LFYIVIDQFLKAFppDL::KTt.VFIK:LtITM'.tVN'fl.c.'FA:
:::.:.LTIt7GTt.'..1~7YY:YTf~IWIIty:Ir:II:fIVLTt.ILALLLAiM.KNKQTIl'KL

~~Ita:LARI1VTPIPAFt.DC:FIISCLt:'riaiVLLVtt:JIFELFY:FI'ftJ4c:FRtIFQFVYA::t:fIUEi :a~:.I:SIC:x:FV(IRYI:U4F.~.t'tY.aVIILtELT1'r~EKTRIUIEtEAKK&:IONLEL

11f'fl:IVil(.::It4Vl ~P!:AFFL4:INIWLI'NIRD:.IfYPYRYfTFI:/i::YIJ1~~K0PKRK:::'.('t:.~.FMP:'.IKIIt::Y.N
F'JILFf%:

.'I~m_nA 111 .I-/512! .t~ln.1)m r:lal 114 4: I>:na7n .IH'/H SN

IHIf: t1At111 (lll.i.llllll<IIIQI T1111'1 :.%tYlfINef 1.:.11 t'fOCbln IkIIL:C.1!:1, 5 t'MwIt:ArIW(lNFr:ILWAAFtONILLWFtt7tiewYt.M:::TRV::fAN:taat::VALVLTVT1!WktU.::4F
KW.t'111Mt'lu:llVl_Tf11'IVyldUy:IIlt:liArJ:HD1'1hIF::A~llnyfl.KVllO

:::(IIWt'VIL\t't'PaKAI:IW(::P:L.A:iVNLt:FLELIiFIWtMFI'Qtt.IiLLLEKV::RNLYAKt'KKL
:fYrl'tI:YRVYt7ITFL('.TLII'rt:II:Y:Lt.Y::Tr:Ytt:AI~IVWKt:::Ltl:a''1'pptOLt:

t::ll:It't.lLtAVIVt:AIItY:VLF(:ITR::YPFIf'M4IF::Il:/VtXV,l~ll~Il.AfYif.ATIKEKLA
1'WATPVL'.'~FYNYVLI_:Lt:AYTL::LKtIWI.4ra If:7a.VIl1KtItF74t:Yt:LYU:YL:x:KYOAT

'tltlYttly;Nra::Fl't'ha.f.WAFN::L'tt7(DI::Kf.':AKIVItAit.t.-fEl/VtllM'N('LKC:~t:Itt::A:F":/TNETt:L110RKTYlII:NSV::YKA'flirt:rtdY:lYl~/tIF::
IGYR:.T::VI:Ntax.AY

:ltKf:F'frXI::hHDISLADDN r:FYAISPNLI~T..~it.,.~ttE
t:R.~YNAD:..':7ltF~F
ft' JVY
' .
.
l:hFICAt NL'PRFRKKLtIYIIHLt~::PI;IFF:f NNNKT:iIITFKT.':AFFTYI::AVIdIF
0llg -~ ~[' 9MA7t X a5(~136M1 p .
Pn _ _ :
~
y7t7Cy9>t.: 'lt0teet Pf01lM w.,l...__ '('~_OId: 4N77h1 1RH5ZN FLOPSRREIHEWK
r...GS.SLRII~LSPlOOPEOCHFDVVC:.FLIIPEuLTMRSMI~R

;.TUUe nvo~chemc.ll Pcol9ln IVYCONRWEDAAIP1JLIKKp'I'IJICLIlf'fDGElIIKYSwDlDIMiCIFICVDIIRA~PE
ARKLKNCAKSYPRTAL1IEVLVSSVt&AL 'IPSPSOltHlJttlAPHLKNTRKFY
tIf:KQr'~L'fLtI.IFPERLL ItKTlEKCNAKAKC

. .
NILFOTt7MClKt tCVYLKDKISVSKHPFIENlEF
LHILTIA'J:ICLVFSLVFI LDLRAPSWf:JDSHdILOEI:
HLASYAIMIft W
SC

. .
O CRL
KvtLtFCA..~TfMLTLPLAALFtIAIKTK PTNOELIDDIVFYfPOVICGLYAAGCRNLOLDDCA
'1T::/'fLFriMKNLFPPYEPPP:iRPHTPPPl710E'NPLISESYFD.._ ...,y_...
.".,.. fi y ,__.....
n , _.-.....
.
.

PPPWFtSG:Li.M. , .
.
, ;r .
~
....
..a.~l.;t..
. ' . :.. ; F w\"" ....-.::',::.......;
;;.:.:.:a:.' .' . . . ,. ...,.
. :.-a_;
... ..;i~:

1~'; i .,; . I;. . WF~ECIN1RK:E'at:''~'r:AFlihE:AKtI',Y,;
cai P=oter,n . .
c ':OOSCnacJ

.
. 507231 505330 VDsIISOPPINPLCOPOVPM~PSrOpsIVKRLKTSSiCLFKRFITIPDKYPKJOtYVYDT

GIIALAAIAILSILLTA,iGNSWt:IALAPAi.AI&ALCVTLLISDILDSPKAKKICIJIITACPIL.0449 LO tlrana-shift vitA 01511 palp LWPIIW1IAAGLIAGAFVJ1SSG114LVlANPMPVMOLL1VCLYtNSLNKLTLDYfRREN_ _ EI\Y'IGFR~ODISFSNNIVQLTI'X~O~IS1LAAGECSLSAEACDITlI~i7IIVAZTPQ

LLRMEKKTOETAEPILVTPSAI>DAItKIAVEKKKDLSASARt~~(TJL9DAQD~AAILL~IL~TTKIWSIDICSI'AI
!:11~.AAISCHSIPIYDPITJ1NI' NPEHRRSFGSLSRIKTKPSD71ASTRPJ15ISPPfI~DIIDPYNlI~LRS$SFVLKI1GVTLDTKGFIQTAGSSVIIDi IO
TAGfa T
TLK
PVIL
' AQGSIFYSSR .
GSGASSAFTPIMPASSRSPNlSiCfVIJtPEPVYPKGGKEPSIPRVSSSSRRSPR~KOS
JADfIL
O

IVFSC6ItLSEDE71K
TTLK7f$TCEYPLT'-'r'SIPVDSLCiflGKKVVIAASAASKNV~PIGLLDIIOGN71YAI~L

OOOONODEEO1CQQSKKK~KSNOSLKTPPPOGKSTANLSPSNPFSOCYOERCKRKHR10ACCKTODlSIVOLSALGCAT
TTDVPAVPNATPTltYCIIOOTWQfIwVODI'JLS't'PXTKTAtt.A

9llMCYLPNlLItOGPLVPNSIiiCSF8DI0AI0GYItRSALTLCSIJRGfwAAGVANIf~

CPn_0144 90365 191507 IOfGZKRKYAMK$GClxI0GAi101'CSCM.ISFAPCQLPCSdfDFLVAIWITd?'1tJ10RlYIO

pmp_6-POlymorphlc Outer Membrane HITLCSGFI~KLPGSWSHKPLVLCGOWYSNVSNDLxfKYTAYPiVKClN
Proce>,n PNL4LPI0Y
SLPWLLTSSALVFSLHPLMMtiTDLSSSi#.tYENDSSGSAAFTAKETSDJ1SY
R
D
F
K
I
OIt fDOSNL

KAFPORHFO(Y S
CTTYTLTSDVSITNVSAITPADKa~CP'TM'CCALSFVGADNSLVIpI'IALTHOG71J1IIM'N.
I
SE
C
E
S
O
M!<GAS11H8YPE1LI~~~IXLM'T
CItRAG6N
RNDPKCTT
ILVISCA
d~ILiW
OtIt LSlSGlSSLLIDSAPATCTSGGKGAICVTNTDGGTATF'fDNASVII~KNCSDfDGAAV.
r T
i KFEKISDCZIDISYDL?LSYVPDLI
NVDLOGKF
!
' TA ~OFJlIYRGSSRIY
SAYSIDLAK'1'1TMLLDCfITSTIO'1GCAL.CSTAN'1'NOCNSCi'V'flSSNfATDKGGGZYSKO
YAPSPNFP.VIL

EKDSTLttilt~lfGVV1'lKSNTAKTCCAWSSDONLALTGNTQVLF0~1KT1GSAA9AN'LPECC
SOAll1 5071A0 GGAIGCYLATATDICfCLAISQNOE?ISF1SNITTANGGItIYAT10C1'LOCMTLTIDGf~IITCPeLOSO
TGP 10PolymorFhic Outes Meabrane Procaia ' PAD

1~NTNLL.FSGNKA _ ACCGCAIYTETEDFSLKGSTCNCISTNTIUtTC~LYSI~SSLSGIMtSQISWLVLSSTLACFTSCSTYF1111TAENIG
PSDSIDDSTNIrrTYTPIC'f!'iTGIDY

SNS&ANOEGtGGAILAFIDSGSVSOK1GLSIAtJ~tOEVSLTSNAATV80GAIYATKCTLTGONLCDSMLTISCI'SDT
IESLSlAGI4r~tSLSPLliIKSfAEGA7ILSVlTOXHi.
' TL1GDITi fN .
NGSLTFDCiPI'AGTSGOAIYTETEDtTLTGSTGTS?ISTl~T1'111ITOOU.YSIt~INSGS~SLIGlSSLTFWtPS
SVI1TPSGKGAYIOCGCOLTFDNNC1'ILFICODIfCEEp10071ISTIC~Ii.

LLFSGNIfATGPSNSSANpF.CCCGAZLSFLESASVSTKKCLWIEQIAJIfSL6GNTATVSOGSL10181IGSISIaiO
tSSATGKXi0G11ICATGTVDIT'tIKTAPrLlSllIIJ1EA710GAIt1$lClt ' ' ' 71G1tIxT
Nl'SLVISDtNfATAGNGG71LSGDiIDVTISGNOSV'1'lSGNQAVANOGJ1IY7110Q.T
IITlSTNS CTI7C
J11~
AIYATKCALHGNfIT.TFDGM'AETACG71IYTETED!'rLTCS't'Oi NGNfSPTKNIGLVFSGNSATATATTfTDOIIL~ISESDIATKSLTLT1~SLSF>

INM'J1KRSGGGIYAPKCVISGSESINIDGNTAf:SCCAIY51GCSITIWGWSP1!$1SCCLRSOOOOVSPFL?I

KGGtIlYIADSGELSLEiIIDGDITFSCNRJITt~TSI'PNSI1QGAGi~ttITKLAAAPGNTIYt' YDPITNIJIPASGGTIEELVINWVKAIVPPPQP~'.PIAsITPVVWJIpANPNIGTIVISgCPJL0451 11FMDJ1C1'TLt:T 10 IFralne-shift: with 01511 pm0L

GKt.PSOW15IPAN1T1'ILNOKINLaGf~NVLK~ITLpVIfSFIGOP031_ BiEGSPYDNPGL .
lt1'ORV!<IKILDSCIVItNi.IYLICIYIDANSSIJOiKSITM1C1'SIPWV4VSSVIJIlBCIR.Q

TTlt~TfDOSII8.101LSVFK.DALDGKRMIT1AVNSTSOGLKISGDLKIf?SL71NEELLSPDDSF7i~tIDSGTIT
PKTS)1TTYSL1'GOVFFYEPGITal'PLSDRC!'KdtTDN
LVAWO
' LVPKVGIIGGKVT
LTFLCiiGNSLTFGFIDAClt1A0J1MS'1'TAN101LTfSClSLLS!'DSSISTIYrT00t3'1'LSS
KAtdin.PFLDLSSTSGTVNLDDFNPIPSSNAAPDYGYOGSWLS'1'IAOOtIIJI'1TJIOA
WITLVPNSLWNAYVNI1t5i00EIATANSDAPSNPDIWIGOIGNtIlIi00KQN
AtGYTPKPEt . Si KtTtAGFRLISRGYIVGGSM:TPOEYTIAVAFSOLFCILSKDYWSDIKSOVYAGSIC710SSAOGVNLlliIRKLVII~
ISTADOGAIKC71SPLLTCTSGDA4T5l 7AICM'KA1DSP
i C

YVIPLHSSLRRHVLSKVLPELPGEIPLVLt(GQVSYGRNNtOd~fl'tIQ.A!>Nl'QGKStRfDBHSJ111RTP

RTJIt~tI;DYVRF1.SNIA&TSOGiIIDOf7GTSILSNNKFLYFEti ELIISNNKTL1FASNVAC1'SOOAIN7FKKLJILSSOGF?EFLANNVSSATII~WISIIfASC

FAVEVGCSLPVDLNYRYLTSYSPYVKIaWSVNOKGEOEVAADPRIIDASIQ.VNVSIPhaELSLSJ1E1'GfIITFVRI
fCLTI'ICS1'D'1'P1QN71INIGSWCBCITCLII~IITIlIYDTTI'SE

LTFKHESAKPPSALLLT'w~YAVDAYR~iPHCLTSLTNGTSWS'fFATNLSRQAlIAEASGHGTSSOVLKIM'~SAL'J
I(~tPY00TILF9GLTLTItOQJIVA~i.K88l1pPViL100KIi.LO
' RYSF
ImV't'LESTSPS01JIGSLtGItDSCrlLSTTAGSTI'ITNf.GINVDSI~L.ICQPV8LTA1~A8N
LKLLIiCLOCIASGSCELRSSSRSYNANCG'1 KVIVSGKLNLIDI~tIYESI$tPBImOLlSI3*ITVOIIDVD'lIiVDISSLIPVPII~IIISE

CPn,-015 YG!'ODO'~L~T~ATA15r1'ILTGIVPSPERKSALVC?11'LwCVITDIRSLOOLV

pmp_7-polyarorphie Outer Ma4brane EIG711~tNOGIWfCS$!ItNILNKTGDH4P1GCFAIiTSOCYVIGGSiUftPKDI7GlTPAICN
Protein tIBESAIEKIPREIPGiILD
FNlLVSKIICLOMKSSVSWLFPSSIPLFSSLSIVAAEVTLDSSNNSYDGSNCiT!'lvlr~n't DMAG?TYSLLS11V5lQNAGALG1PL71&GCFLEAGGDLTPOaIQHAL%FAFINA05871GTW
' LIRRD~CFIALO'~AT~'l~SNTLOPONYLRLG
VOVSlSHSI~CIItYTSLPES' ix.SI110fiIGL~.PIYLSNPNPLIRITIl011I1tC

ONISSD
MnVSOftSFIESSSDGRGlSIGRLLHISIWG71KPVQGDIGDS7fI'YD~OlIVaWYRI$1 VASTSAAf>a(ItLLFNDFSRLSIISCPSLLLSPfGOCALKSVGNL&L1GNSOIIF1tJD
NGGVIHTIOJlLLSC1S01ASFSRNOILF1G1~GVVYATv~TITIINSPGIVSPSONLiIKGS

GGALYSTINCSI?DNlQVIFDCNSAWIN~QAOGGaICC:TfDKTtIfLT~ILSITN~APOSTItTLVNSPDSTiKIROG
tdSROAlLLRGSNNYVYIISNCGJGHYAI~
' LT7~CilISCLKVS1SAGGPTLFOSNISGSSAGOCicGG7IINIASAGEIaLSATSCDITIrHI'ttKI.R!
VG

NQVITiCSTSTRNAINI IDTAKVTSI1W1TGOSIYFYDPITNPG'17N1S'1'OrWLrtLADANS

' CPt>

rlKDLTDS _ EIEYCGAIVpSCEIGS~1IAANVTSTIROPAVL.7IWGpLYt~tDt:YNDmP 12-POlymorJhic Oueer Mrnbrane ID'1'l9DS Protein Itruneatadl TFaADtDIISLSGTIU

.
f'NEEI'KrILPNlLTCSALlLU.PAMQVVYLNESDQYNG11INNKSGEPRITCYP~'ISYI
RILNDGGTTLSAKEIINLSLNGLAVNLSSLOCTNKAALK
FYF1JIOJLKSAS?YPLLELTTAGANCrITLGALSTLTL0EPE1'HYG7t0DNNOLS>ttANIITSS

KIGSINWfNTGYIPSPERKSNLPIiJSt.WGNFIDIRSINOLI>~l'ICSSCEPPERELWLSGIAfLDDVRISNVIWD0 8DJIGVF I$~ICIyIZT.T
' AFTSJ1PLLP00DGaIYStGSVMICJSEIYI'FCGNYSSIiSGSi171IY1'PYLfaSKJ1 LSNf3Yi GKNNGDTYG .
NFtfYRDSMPTAHGFRH15GG1fALCITATTPAEDOLTIAFCOLPARDRNH11SCCIRYL~FRONVISOG7fCCA1ST1 0iLTLITAGPSClC~NtAYNDIItiSl10CJ1IAI
' 6RPSVM

SYLHTDFAIMICCYYTDtJ .
ASLYFNtfIEGLlDIANFL4JGK11TRAPWVLSEISOlIPLSlDAIC!APOGSISISVKSGDLI!'tKiNl'ASOI7t~
f1'ItIZISIHIQ~"IIQlIDJ4RJIVfE6DVYlYDPISH

3IIKGsSNRNDJ1FCJIDi.GASLPIVtSVPYLLItEVEPEYXWYIYAtWODIYFJISEIifIKITDLVINAPECKETY
tJG'rISFSCLCLDDtIEYCAENLTSTILQOV'1'L7100TLSLSD

KSELINVEIPIGVTICRDSKSI(CfYDLTLMYILDAYRtINPKCOTSLIASDANWMAYGTNGVTIALNSFItOGSSTLT
MSPGT1'LLCSCDIIRVpNLttILIED'I'DN!'VPVRIRAmKWILV

tJvRpGFSVRAUitflpVNPIiMEIFGOIAFEVRS6SRNYNfNLOSKFCFSLCKLKVAFFJIYwSVYDFPOIKEAFfIP
LLELLGPSFDSLLIGEI'fl.tRl~V1'1'EIIDAVR

CPn_0116 497602 500415 GFWSISWtEEYPPSLDKaRRITPI'IOn'HFI'TwNPEITSTP

pmp_8-Polymorphie Outer Membrane Protein CPn_0153 513156 516152 LIEPIOtLSHKIPLHKLLISSTLVTPILLSIATYGADASLSPTDSFDGAGCSTF'fPKSTAD

ANCTNYVLSGNVYINDAGKCTALTG:CFTE'i'1'CDLTFfGbGYSFSIIffVOAGSNAGAAASpnp_13 -POlymorphic Oucer Membrane Protein ' YASGKSTLSSACAIiJLTt%IGTILPSOtJVSN6ANNNGfLAPCFASTAl7YEVIMPSENF
TTADI(ALTFTGISNLSFIMPGT NCVLLYLFFYSLSLI:RIIWFHLYVOht(TSIRKFLISi .
DDSSGKIFPYTfLSDPRGTLGI1SGDLYlANLONAISRTSSSCISNRAGAWILG100GVF
CGAlYSSAAASISGN1'GOLVIMINKCi~.'tr'iGGAL
SISGNI'SSITFTSNSAIOC1 ~
A1TTK:'L

.
SFWIRSSAOGMIsSVITONPELCPLSFSGPSOMIlDNCESL?SDTSIIir~JVIPHASAIY
. ' .
.
CFFJ1SSSITa'ISSLFFSCN1'ATDaAGIttxAIYCEKTCETPTLTISCNKSLTPADiSSVI'0 CGAICAHGLDLSAACPTLFSNNRCCNTAAGKCGJ1IAIADSCSLSLSANOGDITFL.GKtLTGSM
ATTPMLFINNDSILFOYNRSdOFCMIRGTSITI>~ffKKSLLINfirIGSI~ICGAL1 ' STSAP'CSTRNAIYLGSSAKITNLRAAQGpSIYFYDPtASNIIGASOVLTINOPDSNSPLDf INLINNSAPVIlSTt411'GIYOGAIYLTGGSMLTSCNLSCYLFVNNSSItS06AIYANONV
' ' ' KOPLAWSCTLALKCNVELDNNGF?OTEGSTLL PPATPPP1GVSLTIS
'fSGTI'/PSCt7tLSADEAtWIDNFISIt dIf FSNNSOLTlOFIlfnlaPONSLPAPTPPPTPPAVTPLLCYCf ' . JtCKGCJ1IAIPESCELSLS71NQG
MOP~fKLKAOTEA1SLTKLWDLSALEGNKSVSIETAGANK1'ITLTSPLVFODSSCNFYECENSVTPLENIAS,OGALY
GKKISIDSNKSTIFII(~fl SHTINpAFfOPLWF?MTAASDIYIDALLTSPVpTPEPNYCYQGHWEATWADTSTAKSCDILFNKHISITSC'.'i""IW
SIHFCKDAKFA?IiGAtpCYTLYFYDPITSDDLSAASAMTW
"
' iONtSIY00RGLtAASGTANPF L1LM~ATLJiIMN
TNTYflITCYNPNPERRACWPDSLWASFTDIRTL00IFlft VNPKA.~sADGAYSC:.'_VFSGETLTATEMTPANATSTII~OKLEL~L
' . fND7ILT
NKt)IISGTh'QAFRNKSItCYIVGGSAEDFSENIFSVAFCOLFGRDKDLFIVENTSNNYLASLFTODEKSWINDA.."
.TLATTNDAtPII'DGAITLNKLVINLDSLDCTKAAVVNIIpS
' 'fLOHAAFtJ'GLPMPSFCSLTLN.ILKDIPLLLNAOL.~>Y~fTKItDND'fRYTSYPEIIOGSWfNNNCYOpSIYCI
fO
I~GTGGLVNNSOIt:7HHGMlNADWpVPILELKATSNTVZTfDISLGI
NP
~

SLALYLPKEAPFF~.'1'FPFLKFOAVYSRQONFKESC:AEattAFDOCDLVNCSIA5AADCED
~
CTWEFTII7lTffftl':GtJNKKTGYLPHPERLAPLIPNSLWJ1NVIDLRAY60 CALELCt' ' . ILINTYTRITPDAALSIGPCQLITIfSKDYL
. CKOLSI1GITNF!!!?.NtfCCDMSYANMOCG
.
PVCIRLEKI~EDEKNNFEISLAY:ODVYRKNPRSRT3LMV:.f:ASWCSLCKNLAROAF'WS

SGEA>YELRG=AHIYNVDCGLRY3F
VGHrHSNVYFAR~.:NITKSLFC.iSRFFSCCfSRVTYSRSNEKVKTSIRKLPKDRCSWSN
~>IHiVEL ' IC
iHLTL

. PECRIFGHGHLLNVA
.
~NL~ELECNLPI'.LSSRILNLKOIIPFVIU1EVAYATWiGIOt:M
.
.

rn VPUrVRFGKN3HNR?DFYTITVA'fAPDVYRNNPDCDTfLPINGJITiiIfSICNNLTRSTfi.V
r 447 SU05.11 503351 ~ OA.iSHTa~VNDVLE:FGHCGCDIAttTSROYTLDIG3KLRF
, tyap_r-Ftlymorpnu outer Membrane Procein F'JKPP IAL'fMKSS W IWFL IESSCaLPLSLNFSAFMWEt:III3I'fNSFSG(~TY'f PPAOT CFn 0154 510179 51911'.
T

TtJAIJ:ff'fNLTf:DV:itTNAi7St'TALTASf:FKETTtaIL:F~NUYOFLLOtJIDAGANCTFl4 W7lymc:.hlr ~utsr MeaICrane ~ PtOteln ' m sNIM~CAW P_ YFf~NF P
NT.WIYt.L::F::f:F:.'YL:'Ltv"l'fNATlC'K:AIIL."ff'.Af:SIOCtIY~.Lt:uA:CAFAETRLIX't IP/PPITIY)GEEILLT::DFVI'.:xJFLf'JL;F
' CMiL:FK::~::F't:L.l .".:t:::.:a.tIPNLTFAKNKATUKA:dL'f.~.ff:taTttAITt.N::A.:f:tSNTAAMICCIIIYTEAS.
'LNFIfJRAIT:.fX3At .':.~.Fltf::al4:L:":f:::LSLTFTCCOAfrfN.~.tr/ALLSMETLTFKNF::::INFT(xJO.~.7ytL
:
:
CNta C
' :
T

' A'f :
A
' C~/LTL
~
~

.
r:CLf'f'fYDIVt'9::IC.~LIFTtNAVAY3PA.TIfTATPAITII/Ttr:A:.At.~~ITIk:I.TVEtJI
.
.
:
.
.
FIMC
V
:
, , AIY
:~:T.
t Y.
::Ft.
.:alY:,I:
a':ALJ.x:DITFEX',ttIWXCAS
l'KNN::AIDTAAPLJ:f'.AIAtAL
'.~::L
::::IXa"rt '/t'fitJL'/L

.
:a)::IYFF(:NIJVJF.::AI::::.FTAWKPiNNTATFI.iF3HtIFTSSGf;I:VIYW
. :::::I.LlIJHC
.
;I:TYALt:::tY:AICIPnI'FELKtd'IOI:IG'PP::YNt:TPNN1G
. :L
':::nJl1TlIJ::INI~.TIfNAKIIILR~::;)r'dlI'tYFYOFI'i'f.~.ITML::DAIJJIlY;PDL.ALTIP
Att ':i VTf' ;
' nf:f:'fF:w:F*L:a'J1F414~\ONLF"PlrxltLTI.NYJAt:LY.:F:VT't.VAK:F~O::fr:.~.TLLr%f.

y .
v:
:
:
IIFTI1N:
LLD::NfMRH(X:AI~:AY'fLNIV:Ix7PtEF::RNRAHKtX:AIFt~:C
'NIVraI~\:~:
AI'fAETr :41rN71-rLFTAIta'fINNLVINV~:'.LKI.-fKNA'fl*.1'frJA::r/~PPI.::::L::LVDP:~Y7NVYED.
'/::WMII~'JY::LL'rI:rAIN~I'ANlllt'IDLrIAUPLF:KtIFIIIW:IrJ:NWAL::WOECffATK.~.KM.

::Vr:f~PAK(rr:.Tt.T::.rI::FY_a'l\FTf~IMLNfYi~:IRNAIT/EMIIIEIV.'II~SA(JtXi:RL'fF
Y
"

'fL'tWrwl;tNl'FlIHPf!r7rt.VANTI.Wt:::FVIriR::I(/~L%ATY'/Rn:I~RTR(aYX:Ef:L:iNFFf ELLLPAtrITfILItTVKtA:7.E
DfITII.~.LifiT.~.1sKK3lTfN.VJ~:A:77::WFT.~.Y~:L.S
' ' TYItIK.:F'ItIII::f4:YVVr:ATPrI
I::DtJLI'1:1AF'r:fjLh:Y.UI1UIIFINKNRA.iAYAA.iLf:AFMVDFTV;KIr\FOM
IIKIx :.ILYRD
l*ITIAIAWfrJII:F.A'~f:::ywLTII::xYTPII:LATP1 ' ' .. fK
(JIUA'rl:::a~::I.1.uYl.n::a:::E:~~PVLI'nAVt::yl'f.':YNTMKTY1"f~LtPKI:E:::I~fYNO
t:CIt:YIfl:E
f11 F'V:;A:~/IW:fYfJW::.r:AL1'LUt:111MPIN.YfB/t.W~I~/AIPIAVFKn:ATV
' ' ' ' . fLVR:1 tnrl:lL:aUr:LF'HAYFI'FIKVF:A::YtilrrL::FKF:Ptri-CL.VK::FD.rI:DLINV~rVPt nl It.LPE
F:ll~:::l TW::RtLI.IFAPUX:FFT:F.:P..AtffLYAVNN::h IA'ft..ll'lt:Y~f:KW:a ' . :7:k .
HYr:(a'/::N::LWI::F':.aY~AF::UIV)INLLIIAlI~:4itTAICAWAWF7rfM!~~:11FI:F
In:t'fF'I:II'::IeNP:VA::Ylu1'fVIS'V:\INYftKPll~l>t~I'ff.t.t.Itlttl":WKTit.'rtJL
.:R(MLiIfsAA

Yrt:YL'AAL::NtfITDHTTII:L::f'V'OLICKTNAHF ....~LI.3FF~;0FPLYIiOKSEA ::Pn (IIA4 S v ',i-i~~~
..I:1YKAAW:'f::KNHLNT'I'ILTIPDKAPK.Spt~WNNNSYYVLLiAEHPPLMrCLLTRPLAOA No roDusc nomoloa Dr:aemt :w:'.rw~dmK EMII;. n: ..
WDL;X:FI3At7~'LOWO~KfTIT'IDLQRSPSRGK.YNVSLPLCC35pWITPF'K1GPSTLTI
HRL~RFTE~ILtRI.IS1M9L~~MILR ~l~f~CKINN1~~
KLAYKPD I YRVNPfIN f',ff VI/:pJDEST3 LiGANLRRHCLFVOINOWDLTECIOAFIi'IYTF
UI:KNCPTNIIItV.'.'".' :LK.: v F
IRCGPF3EDAVPE.iEPFDL.i:'lVIfCDRSI:PGPTKKRS::,i~",C."yE;,PESIYPOSEP'.LM
RPRMLS
'I'n_0455 520 )63 517158 Na mrntsc rtnm"loV Dresant tn GlIteWnk/ENBL5~=all as of 11/7/98 5 -Z~a =~~ 015 .
'ar n .
..;_n...,., ...I,..;;..r;.-.Lc--v:'~:. .
--E.:;~rL:r.~.' .
. ._.. .
~ .
...
' "
;
' ~ , .. r. . :.;r: .1;:
.:_. . :::u':':~ ;..., ;~.~ ~-:. P,.6EAACA': .~. _ ::u:t' .;LF::F~:... ::.l~r:~nl~ .R.rl.ylJ..:.
:.?NFAL :
:
~
' ' ' ' ~"'KWAFUDEHLPWVi:iHIAYAEEIREKOEQ'IFIIXI'.ILTEEOIVA:.i.CMYSTE1WWF.
.''\ :li ::m .,;;i:.ilk.:kVEDILKRQR'...iLEi Ul,~t :.~.i:iv'v:.'!':';:.i.'wi.:.,ii.:i:i:l'.fi~:il.

. RDCCKVHCDLPSAPFF
aLaAVIKOSVNRFRNPDLFAYERCALfJISVTDALVSYVSNLDIfIPYTSSOGIVI~SSIV

K.., CPn_0466 pmp,15-POlymorphic Outtr Mlnrorane Proelin TSIOtFPCf'CIGl.PfTPIrt.ANEGL4LPL.E!'YITLSPEYO~AAPpVCF!'IO~pDtJIIV~IW
;:Pn _ DFILDYKYYRSNGCALTCI~.LISF21ICNVFFEIOIVCPNSGGAiYAApNCTI$XtrpWllAF
No roousc t>omolo0 Drtttnt in CentWnk/E7tBL"' as of 11/7/98 ' IPCTFES%RKFIaffHCLIK:WFSWRHNFVOAFNFSRPLYSRITHPAt.CVIKAIPIVGHLV30CCFV1u1LJ1iIiKt w,ALYTETtQiIl~1 TTNLVSDNPTATJYCSLiGGALFAINCSI?NNG

HGVDNLISHCF6RGVSNPGFPSDIJ1PILKVEKIAGRDtiISRiA'IDLKSLRKTIEtIEDLDKKGPIIIIfQNMLi4S
DSLGGS.LYSO~iSil'IIFGYSG71IOT:SNSIfSTQZLT=SSN

VEICOYpENPYA0M715SEYLKLDK,NiIVSEL.ciKAFSRVRNRITRSYSYAPTPOLDSIaIVGKKLIEIStc)SAFA
NNYGSNFHPGOOCLTTTtTTILMiRtGVLFN~BpSQSI~GtIINBKSI
' tLLVSPEEOENLVRLANEYIOLYPKSKTTLYLLIDFDRaWVGDISSO~OLRSLGLHSELATRGGAL:liLS7YCSGNCS
FILSAONGDIITl4~IVfASIDiAWIPYRN
LIKIIBiWYFLNN
' ' 'IiCCL.SyLEPpGADGEDTKHFDt.IfVOCYGKDSYLREGKILOQAiiGTSLG1YPWDlJPMNTLPAIH$TPNlOJI4 IGARPCYRVLFYDPIFitELPSSFPILFNFE
fGtnG
NLFSGAMR~IIPt ' SRYRSRLSLPINTEIfDICfELYKEISRTHNOLHTIJCMCLGAODSGLLLDRORLW1PL$QGSDI?MFPSYLRNISELR
OGVLaVEDGI1GLACY1CFFOROC.
LliirpGaYITTA4TIP11SST

i.FSAtVDIIKNISKKEL.REVSINFANDTSVECGCAFYFPTZITaSTITI2iHIAIDLPSI'SFQAOAPKIWIYPCK7 GSTYTEDSNPTITISCILTLRNS
HCHSYLADLTHELKI:

.
IBIEDPYDSLDLSNSLEKVPLLYIVDNAAQKINSSQLDLSTS~I9GOlYGYOCIWSfYWVE?

TTITNPTSLL.GWfKHKLLYANWSPLGYRPNPCRRGEFITNAIiASAY:ALiIGLJiSL6SW
=Pn _ DEEKOIiAASLOGIGiI'YHOK~OGFKCFRSIOnGItSA'I'fGTSSOSPNPSG:FAQFISKA
No roousc t>omaloQ present in Gentbsnk/Ct9L
as of 11/7/98 VFLPSRVMASCLSAwFSIVREHFYRAEDFSL?FC11RITEFVLGVIKGIPVVGHIIVGIEwKEHZ~NSTSSNHYFSOMC
IIZICLFKEWIRLSVSLAYMFfSENTIFIMYOCLLE>a~CSF

LVSRYLESPVfXPTFVSDWSLLKTEKVACRDIiIARWETL1CR0RVAVAPIftIF~KVIKiICIHNlft'L71GALSCV
FLPQPIIOfSLQIYPFITA)JIIRf~ILAAFQESOWtAREFSLiRtPL1'0V5 PVHPFOGIO~EVLTLYPEVODATf.GWFSKIRNRVRQAYL.OAFRPItLOKIYIICN~IPLP9CIRASW10~RIHRVPL
VWLTEISYRS1'i.YRODPELHSKLLI50C11~'fQIITWnINIL&

FEVDDFLNLARL.QJtTORLYPDATISLYLTASGGRNAMD)00RICi.SDCELNPKIACLDFIKVlaPl1'IOVFPKVT
L.SLDYS11DISS5TLSHYLNVASRIOtF

tIOCOWKQ11TCDCWHVYNGHDpCfLNOIOEELEILSGECfPNIHVCOKPLSOSLWDTSPP

SSLEMKCDKEItALCYSELtKEOLYSRLVYVGRSSVLSLCIGDSR~ILID)PIDIVNAPLS.CP(~0167 536528 ~HYCHSYL11DLENPGL4K1'IL7U1FI14PKELSS1'ILOPISLNLIWSKTYLRQH!'CFFER

MSRSDR1JVVVWCDSWWCfDWKI3:PSFOHFINLLDGRCYSNFNIFAFRSN6MM.ARIL

NFSSOEKAP'fElIFCEDSVSOGDIRCLHL715f:GMLCOXECYAVWY'fSCCANF1?~tVLTL

ERFSNLWNRKHGLWI(AEVRKpK0EA71LDODESEIYVCNpL?AOpNfACS

CPtI_0468 539608 50132 CPI~0459 527062 526619 prnp_17FOlYlno~hic Ouctr Mtmbrant Proclin No robust hdsolog pnaanc in Gtneberlk/D!lBLLIYKLLDNKLMIFYDKLYFHIMnMFMttPICLSILSTALCCSLSONEVPItL71SC01IS1110 as of 11/7/9A

STKIQMHPGLRNWRTSTNKLREE7CSVSFRtYFMYlICDKIVApICifLFTLDAVIKQAIhIRSAFNTSPSPRiaN1'P
EFLVSSFRP9NLL14GFOHDITODITITGNSI118VIDYlMIYfD00 SOEKU1LFYVESt3ALGREIKVSLEEYIOSMVICILGSOATKKSFKPSVDFTPLEOALQERCILiICKNLt'IS~Rt~I
LSFWJSSfISBGCALYSVRC~'IISi~NYSFISW1J1SLJ1Tl1'L'SO

SSDDI)EDATATSTA:..71TASPTIfIOtI~E
FOGivIHAL~DSYITNNL.CECQFLON11SKNRODAIYVGVSLSITDNLirPIVIKKNO'I'L.tDSS

FOOGIFCRAVNIERNYpNI0IN0NASGOGVVYFLP

CHy0460 527810 526992 No robust ttomolo0 present in fllntbar1k/003LCPIL0169 510399 541160 as of 11/7/98 VIQNLLNFALEETPSISVOYQCOEKLSPCDNSPEIGK>OCRWNKLESFSTYCSLFNSV107Hpmp_17-Polymorphic Ouilr Mlmbran Protein IFrm!-shift with YKIl~IGIONSLSGWLLDPYRVCAPLSSPYSCPSYLLDL~1KELARSLLSTFLDPIOiLTS60169) TFRSVSIHFGEISSFCORWSEELSRVLHDEKEItttVAVIfiElD7IKLL.EECGSPEALSLLCEDLCFA1'~iIFSAL
GVIISSNKEIIEISNNSASSINTASGKLYPOIxCDfCTSLVIt?ItiPIOG

RESGYSYIl'IILSVSPELIISIfV0ER0ILRRDLOGRSF'IIIMITDLPLGSEDIRSIAL71&~tILIFMIIfTAIIL
SGCAIHTRSFIFQEBiGPfAFINNSATSGGALINLSOIOSTPON!'ILBADY

LVSSSLDAADACASGCIfVLVYENPNASWJ10ELF3JFYKQVEAARCDILFNNfIfITSSSPOPCYRNALY11J1PGIN
LKLGAROGYKILFYDPIdIDOIZ'fDPIVFN

YEPtHiLCTYLFSCINVDSffATNPLNFLSKFSNSSRLERGVLAIEDRAAISCRTL.SQlGDI

,0161 528617 527811 LRLGNAALIRTKCPCSSINFNAIAINLPSILOSFJ1SAPKtWIYPI'LTDSTYSBD1'SSTIT
CPn _ LSGPLTFLNDFJJE7JPYDSLDLSEPRRDIPPPLPPRCDCKIOdNYPESHCRSHELR
No roousc hanolov prestnc in Genlbank/F318L
as of 11/7/98 ISIVACPSISSWt'NVRpHFVNAFDFTHPVCSRITNFALGIIKJ1IPVLCHIVlxIEWLIS

wIPRNTVRHCRIFTSt7VSSAIKVEOTRGHNCLAPLEAYL59LRVPISOEDLCKVfIGRTPEDCPn_0170 51357 ?FJDITP1'EIVOLLPDEEL57VDFJIIAGVRSRLTYAYRSVEKPMIODLALVCFCLRD.SADpmp_17-POlymorphic Outer Meiebrant Protein IFrelne-shift vlch LINtVRIJUJGVONHYPHTKVItLYIJIKNLAOVWDCEISEEEKCOLRALGLDPKIESISL'fS04701 ACLPSVPEVATVDFMITCYCKDQEVpDP
ISLHLERZSPLLYLLDVTAKKIDTSNLIVEN9JLDEHYCYOCIWSPYABIET'1'1'1'fSSTVP

F.pTTrlNHROLWDWfPVGYRPNPERHGEFIANTLWOSAYNALIGIRILPp~iLK0i0L'G

~Pn 0462 531121 529037 SGOCLGLLINOHNR)~GRKGFRNtII'IGYMTTSAKTAARHSFSLCFApMFBK'lRER08PST

tlo roousc homoloq present in Gentbank/EMBLTSSHNYFAGLRFDSLLFROFISTCLSLCYSYCDtIHMt.CNYTEILKGSSKAFPNNHTLVAS
as of 11/7/98 LIFYLFLNLYLACVRFHFCCZIFDPNACYISIWISTVICQNFtRAFDFTRPG'rSRITNFALCt.DCTFLPARITRTLE
L.OPFISAIALRCS0ASF0ErC0EtLRKFI1PKHPLTDISSPICFRSE

VIYJ.IPItGCVSIICVSWLVSTCSARRFCKPAFTSDVASIVKIEKTRL'YNPLAWVE0YLR0WKTSHNIPNLWCfEIS
YVPTLYRKNPt3~IFTTLLISNCTWRQATPVSYNS11AAKIKNT50 LRVRLPECDLCKIHCKVSRDYVCDRTPOENLNM/PHOYLGEL.GRAFYC1RNRVTKAYORVLFSRVTLSLDYSAOVSSS
TV~YLKJ1FSNC1'F

TPLEYPCLTLVGFDILDPEDpVNFVRLANGIpTQYPOTQIKLYLISIOKIYAi0COC1'I50 EKEQpLRSLCLDAKIKCVSAPAt.LLpKYLpSENLPSCDLLINYYCKOpSVROVDSIKSLLCPn_0171 5J2561 NL.iSEHIPAtSVTYRPDDPFYSYYFFPGSpOGTAPDORiPWSlOEHLQI'Y1TT.SNPRCDRpmp_11)-POlymorphr,c L'ucer Membrane Protein 'IAVHLGMEDFASC'JFLDPLRVSAPLSCEYSCPSYLLOLKSEELRCFLLSAFIDPNNSGOCTVONNRS4iKSSFFVtG
ALI4:KTTILLNATP~DYFDNpANOLTTLFPLIDTLTNfIPIfS

NPRPMSINFCNSPLCORWSEFLSRVLNDfTEIfHVAVNCMJPOLIKKSFPSHSLSLLHtELNRATLPCVRDCCNODIVL
DH~YJSIESWf'CNFSpOCGA4SCKS41ITNTKNOILFLNSFAI

EEJ;ISYIJ1IVSV~.,pERTCVKERRILSSDPSGRSFTVILTDLPEGSSDIRNL.OLASDRILKRACANYVNCNFDIS
ENHCSIIFSCNLSFPNASNFADTC'~GAVLCSKNVTISKNOCfAY

.".~AIJ7AAL1ACASECKILEYEDPEpEWA00YASFYRNIDRAGDLOROCIPCEPLCVSASTFINNKAKSSCCAIC'A
A)INIKDNTCPCLFFNNAACCTACCIILFAHACRI~HSOPIYFIN

RVJLEKD)VFNLNAVIOI'AMWKFKKRDLPAVESQJILCtIOMARALEGYICSSLLVDCTiOPNOSGI~.AIRVtIC'F
it'.ILTKIrCC:.JIFNEfJFAMEADISANNSSCCAtYCISCSIKDNPCIA

~~/u:NVNV.~.FATLDEAVCrIACDSAQOAPSEENM'DDAFDNNTAARDOC)AID'T0.~.LTIOD~':PWPTNNO?I
110CAIMLRODf'.ACTLPAOOCDIIFY

NNRHFIf flfFe:N11V~17JCTRM::LTN~.A.~.fXat.~.ATFYDP
L LQRYTIONR IOKFNPNPOILC

't'n X14..) x)24911 Slll?1 TILFS:TYIFDT:TVRDDFI;aIFRN111.LYNl:fL.ALEGtAES~IKWKFDOFpC1'L.RLA33M

le> cnlnra. hair'luit t,l:tsenc VP:TI't7l:F~':::::::a'.:::VINIh'NIAINLf.~.(U:NRVAPKLWIRPfI:,:::APY:80lNPIINL
in t:Pnadank/ENDL ai; ut L1/7/98 .::a~YEi!TP.til.l.l:fPNCRTPRVNI:;RK:IPIDETCtIAFV~MMK(X:VCTODAKELYTFL~.R:f:ffta.
t.pDE7JLDfYD'fADLrk;f(AEVPL.LYI.LDVTAK11INTONFYPPIfILM'1V11Yf(YQC

tlt7flnl~'.LWF::f.l:EE4:FLFDEKMLCAFf::EDH'l~tI~YLVDLVDt711LKDLLt.SIIFLDPOVw::fY
WIF'PITT:'.M'.':F.M~MIJIR'JLt::frll'Vft:YYVNf011cf.DlAL::AFYIaiPHNLP

rll::rva:f.IJCV::Itrin:U;:F::PLtjOKDFL:?IVLRDE'f~:KNWWFKI:VLGLPATOVCKLVEE:1'1'L
PYtYtt't>t::]LAI'fl:7:F.f"fet.F"/INJN::NNNAY.r:FIMFJrIY:Y.~.IJ'IT::HTA:aIII:IPf IM

Irc:YD'C:'/LNIF:X:IY:Lw:::hQf.LPRKELECT~:R7FRVhAL'ILt:DTOMRSWf.JIi6RINF:XJLF::N
I.YF::'.IL'.i'N::VA::If~~fIAUJttIIIIWI.()IxIY:.'C.:A.sI.AY::Y.''.NIIIIIKICII:Y:
XrK

'J.~.I!Pk'LLVfd\YAAh~:Kt.LKIDHTNWRP:I'F::RFIADFADAVDV::M:fNSREFKLI'fpAElpCIVfNC
:KC'.Y::'1'fG:.IrIL:Y:::4:IJ,WR::P.ILIIFffFIOAIAVRaNVfAf(7f::X:GeARKF:1111 I1.l:Ja:f.hLt~.:Y.TIWFX:ft.lF!.'DRVTVTRIIFILtILGAAIK0AVH1'fIKtIP::LIDKOCFJ1LDKI
'LYNL'MII'I~:IS'::1WF_':Kfl:f.f'1"IYrtIIKIrIYVtVf.Y~JJFIPEIIIV::LE_::Xl::arld:l .T1 LY'I'~J.'!.l:aV::'il.l:lV'1'N::IIExCT:'.KCfFtpl(EIIAff:.~.PLKCALFIGSDEDVPL'l'sE
DP.'if.AfHAIAfK,:uH~'It'IFIKL::Vfl.liY:J:::V::::::'(TfIlY1J1A4PfPKl' I~IAIP.~.I~IJ:U::

: a'i, o1'I L '..1 ' ' 19 '.1''',n C)~_0458 526314 521236 No robust homoloo oresenc in GentDSnk/t?BL as of 11/7/98 Nn r.arcn trlwoltxt pr~sant In r:enetCrl /fl48L es o: SI;W 99 FYFflA:xilr7::::n.Ln. CLPPKtR.;PSPKIiELCSHCISLPPpENCEEGASCSSHIHS

)3.~.FLPEDU~:OSS~aSAAS.~sPCfP;iRVRSCYCPALKSF~.~.AE$'f"..OARE3'RGAPVRLCpn 01A l .. L....S~' 1. ~. 5'~(L,1 " W . , pMSentN
No robust noACl~o4 s~f, f'wtneb inltL ~
fC;:~~5~;

Y~BNPSOCVPGTSSGPEPORLP$LPSVKKO .
.iKTITAOERROVDSSSAAATPJ1RVAEDAo ..
:iCTJr'RLVOTVRDRtVLPSGAPPTDSEpLSLYELNLRLS5LR0EI~iDIOSNDQLTPECKAE, .
t JCLRIECILMAT~VP4'CSJ:r:GEANSSNERFTERT::RMYYMLVL..'.A~.L:FIAIIIV:

:.TVfIQOLIOITEFOCCYMEATOSSVSLrIfrIRFKCVITSDEINSL.C3Irt.TDPELOCLlSOFPQVCWAWCrFAL
:,C:.:.:.:.LAtVPAV~.,GLVLI;KTLEPSREATPPEIVAVKE
.~.. CCL

':D.~.t4NLL0ETADDLFJ1ALSTft'RLSFSLDDNPTPIDNNPTLISOEEPIYEEIG~MOPORLGNEYWRSELIS:.
F:.R~~LH~:.:S~SIIDR.~.LC:CC::wZFI:;.KLEP:..r:....i.w.KKDMi TRFNWSTRLWNpIRE\L1I3:,;...~tIL.iILGSILHRLRIARHAAACAVGRCC'"CRGEELTSSINI:LHLVROWN
L:~.~IrPE'JTAHAEEL:.LFLiEE?YY3FCTLK:.:RYCMLC~A?SPI?I

'::!1 .~' . . .. '~!!,~'.'f:rl.l.... ~~...:. ... . .. . .-. . ,. ......
...M:W :::'eJl..tty:!':!.ITY!'.-:'~, . . : ~!ir"'~nr c...,.. .
. .:
~
' il': 4' .
ct ' .....~.. :.".111:x . . ...
.tr LV!.~::Hrrr:.-:,:c :~.s.-:,.
F. . 1..:::a:r :::fPw.'rnr-L.......
:;~. .. ':'.~I::de!:-:GDY6YpIT;iA.FP:iKDKNi'MCPRLATPALYDL6'wRFI:S.xSSR:iFSSLRVR9S:iPNRRG'lE:irlN~'NL
.iiN:.h'i:v~:.:':r:u~KH;,e';.i;'i.:.::~i:~inNy:Ni:y:LaKSE:LfaIE.iuFi,ia VPLPPVPSPAMSEECSIY$I7MSGA,SGACESDYECMSRSPSPRGDLDEPIYnWI'PEDNPFTLIEYPLSYL:GWA~..
I.'CVi?;fEi3LEC0ADY'IS:.'.QCLCS14I$OFASRi.t7SGQKt,'IatPR

ORNIDRILOERSGCJ1SASPVEPIYDEIPWItICRPPATLPRPEM'LTNVSLRV$PCFGPNDVLSEOMVMLVHGLMt7C
V3FOCLKALHIfLTAVPORMWL~uAi,PLfESfPVFNRIBfFfT.G

MALLSfSVSAVNVEAESIVPP'tEPCOGESEYLEPLOGLVATTKILGp100WPPGG9NAfSLCD

CPn_073 519602 518070 CPn_0482 561764 560961 No robust: hoteolog present in Oenebsnk/00LattJ-Asnsne Peripiasntse Bsnding as of 11/7/9B Protesn ~$IMAV~OCSRSPSPIPpNRRH56DGKVSPKDM&fJiTVS$$DSSLASOGPTIEUKANtrIYWICI'MIKOIGRfFRAF
IfIMPISLTSCESKICRNRIWit.'t'"N.
ATYPPFfYVOIIOC

OI~GTWCIPLPSVKEPGDSOTSORSGVLQRIWIO(11KEYVGFDID4.AKAISEKLCKOLEVRLFAFDALIIiIWOWRI
DAILAQISITpS110tICe'..~...r rfYfKiCI'pOARPCVSSPRLPSIiVOH

GORLpCLOCfRDRIOKRSENPEADLGKH><RSYSDGDLDRVOtiDSNEDBTEDSRSEOCEPSPYYCOEVOEI14WSKRS
LE':PJLPLTOYSSVAV
.p"~r~.FQEHYLLSOpCICVRSFL>BTi.LSt SKSSSPLSGVItCAVSKVHCJ1LGDIKDKFQRSASE~L'1'I'OOEDS11GDTVKaIR$EGEASIME11RYGKSPVAVL
EPSVGRtIVLKDfpNLVATRLELPPECWt7I~CGL,I11AKDRPECIG?:

SKSSSFLSGVRC71TSTV'pGJlL;.011KEKV$AFGEOMGAIRSAPGNIRTRIQRSSSDWLSOOAITDLKSEGVIQSL
TKkftFILSEVAYE

"NNKAAKIiLRKJILfNLEINApEQVSPEVJ1SRVOSLLARNE0LTN0EPP1YEDLITFVESN

VOSDSVEYASIVPOOGSOAPAET1(FJ1PETGCVLGSAJ1QGJ1WKATJIDfWSIfQAVASffRCPeL,0183 aIJISRLS$ARRESAVDDLASESNTQWFVEpEGVSNPSAAPSLSFAEEIARRAAOiSNRNANo mbusc hanolog present in GeAebank/EFIBL
as of 11:7'98 OSLEKLF~d1V't'DPVIQOCLGLrIitSFAPECOIILIKICRAIfaIIFPIPPPNCPPNNID'NFYHLTfDTIGDPLL
LRILRTIGYVLTJfIIT.GL

C~ 0171 551600 519807 CT365 hypothetical protein LKIIISISFHSTSPISNpPRYLSLSNATEKTSLLtINSRSLSPVPNSLVPSNPEDTCLRKS

IFTHSVTLFAGLWLLVAVSVWV71LTVLAPOVPOAILLGIAISCVOIOGFSIl0f5LVYN

TTL7LSFSIIYIYTKFFRSEKVAKG61Q.TEAETIKE71KKLHYISLSIATIGVCLAViGILIJ1 IAG112i.CG1IPATTAI IL1PPLISICLT1'VLQTILHSSIGKWRAPLLTOEIOIOLFVD'!SL

KDIRLEKZ.PPSEVEESEI'SOSVIEVPDSECIAE1'RIS7IEffIDTRLSLTTRQKYIFALATL

r r~~eIAAfIVrCFOGL7YMpVLLVASVG$AVAS\I1T.PIIVSSCfSYVAY0LItARiI4ISKL

RWKE7UfWOCRVROFLILxGVIASt$IEFNOIIWK'IYYIOCQIOKTDAAIREEVRNFLOmGLVN

SALVCGILL.CVC:R3IIQ3.ALVPAFApIVPGILALCCSTLCIAGSILT~BtTCCtVtiWLYDELVK

LYERRRIiRRELLYGpESKIGtSIATDLWEALA7LSt~HLIDLDGfVDFIDVDVDIDGMEKNOIOFLRJ1TFPNYQLIT
PJIILLDCEIESTPRNGyBIVfLTRI.NVCS1CGSP8$PT)1tS

DOf$KSFLIFGFtl-BIYPKLL4KKTPLJ1ARLDAfOREASHRFTOVKDIa.LLSLKYGFPL11T

CPn_0175 553850 551685 ATINOIfSRAROQLICNLL1DTIV'PAS17GFCRSG1ROSLIGYLHSLBSNELODILOmVlm071 glgB-Gluean Branching En~rme EANDVAAK1TVPLOPFAVCLINSDRD1YSEEFtIENPVJN4iCFt11CISPERDRRIFLIRPP

PSHVDKLIHPWDGDLLVSCRQKDPfIKLLtcILASEDSSDHIVITRPCAHIYAIrrrrnHMIYOCLLpRHPRTCOI~IS
KPDSSNP

aVAYRSCLFfLSVPKGICHCDYRVYlIQNCLL,7IHDpYIIPPPLWGEIDSFLFHR01'hIYRIYB

RNG(IIPNEYOCISGVLFVLWAPHAORVSWGDFIVIWI~LVNPLRKISDOGIIdCLFVpGLGCPn_0184 561931 BGIRYKWEIVT09raiVIV!<TpPYGKSFDPPPQt:1'ARVADSISYSWSOHRT~RRS70pS19Garo0-Deaxyhepconate Aldolase PVTIYMR.CSWOW00GRPLSYS
ITOIPLT4&SlJfrYpVTRSILKTOOLKSLVLHIVLILTF'1'YPLPRTLKOHPDI:'VFfrVpISPMS1G8ID8PILI
AGPC

G1I1D1Pf$RYGTLQEfpYFVDYLHKiNIGIILGWVPCHfWOiIFALASFDOEPLYEYTGHS.TLISYEH'IYSSALTV
IIFaGAQVP'RGSIRKPRTSPFSPOfR4CKECVIiiliK8J108IHOLPII

QALNpNWJI'FTFDYSRHEVTNfLIGSALFWLDKhBCIDCLRVOAVAS6Q.YRDYGREDCWITEVLOVADVEITAf7iV
DILRIGrIiO~IIHM'PI~d,OEVSKSHAPIIL~tSPMTLFJiWLt'.AIIC

PNIYGGKFNLESIEFLKNLNSVIHKBfSGVLTFA&fiSTAPPGVTImVDpOGLCPDIfKiBILYItaS8PSCPCNILCE
RGIRTfFJI51'RY"fLDLNfSIALLKEI$BLPVIVDPBNAilOItRiLV

Ci1ll~fFHllfIBLDpmRKYHOKDLTFSLWYAFOTSFILPLSHDEV111KTKGSLVfBC.PCDTLPL71511GLSVGA
DOiJIIIYHANPEKAt.GDAKpOIT?EELHLFAIDDIFCP$E8R71HAIB

WtRPAOIDtVLLSYQICLPGKKLLF?GGEFCQYGIWSPOItPLOWELI2tdBlYIOCfLRNCVSIt LN71LYIN0PYtidIpCRSQECFHWVDFHDI~7E1VIAYYRTAOSNRSSAiJ.CVIiHfSAS'1'FPCPnL0185 565993 5662=9 $WLRCF7GhWCELLLNfDDESFGGSGKCNRAwVCQDOGVAWCLDIB.PPLATVIYLVTCi381.1 hypothetical Drocein OPIORTPiRVfLWRFHIKOACKFYLLpCLLCALYWLLKYCRKLi.IOGTLIH$IITL,Ypui, $SLIDLLYOLICQLPAP1NE

CPn,-0476 551877 553858 CT865 hypochscical procesn CPeI-0186 56'799 566105 GRGRRADWCDCNIDIIIOHFRPYTMVPGpKLpIPGSLLYAOVFPTLWRLFSSKHEILNF7pThypotheeieal Droline paraease IAVOGPLIOtPAVFQDLHRGGI)1VT$EttYKYYLLPSGDClOSI100KLPSA710AGPLLSL1'ViAOIIRSLLKGNI
FHLGCGVLYE?U~1!'SLFLFPLIAIOGICLYVCRRGSKKVEORfSIffLRGR

HKHADWQNVRCRRDLKEILPLWFRFMM71PKCSYRDLETTaICSLVKTAfIORVLHRE1TESLKIFPLI~f'1'FIATO
IOOGVLLGAAEEAFCYCYGGILYPLGVALGLIfi.011CP010lWEG

IAPALLSIJILRGfSGCFLPRSYDEEFpGILPODCDPEGGVPFELLSYSPGMIODIFLRHpSLTTYVSIF111IfYGSK
KLRKIAFLLSAGSLFFILVAQVIALDRLf55fPFCKYV1'VJIAiI

COLVEILPALPPEfPCGRLIHVALPNILTLSIVWrKIITIRpVELHAEYSGEVFGKFCSSLVLASYtSTGGfRCWRTDV
I01GFLLIAVLVCGVSIhILSVPKSLSVLDPfOSLPC71KLSN

CSARLREWSERALSGSKRLSLGETLEIKAI:TTYLWDCFHKWIINPhLFNLVE0CMV0RCVAdSSpKRLAWMVGAGLVL
LLfNFIPLfLCStGAKACLIG

GCPLIDTIAYFCNPSLAAVNMAICVAILSTADSIl'TtAV50LIAEEIfpTWIPYYRYLVL

CPn_0477 556112 551844 GLJ1VMPLVAIGfTNIVDVLILSYSLSYCCLSVP~CfYLLAPKGRRVSGAAAfiJIGVLYCA

~yQeV_Bs Hypochecscal protein tGYGwVOIVSIJ~ELLAWVCSLVAFSFVGFIEITWIg4KVKi0T

RYMIVAEVKCTFKLVCLGCRVNpYEVpAYRDOL?ILCYQEVLDSEIPADLCIINfCAVIA

SAES$GRHAVROt.CRQt~IPTAHIVVTCCI.GESDKEPF)15LDROCTLV~11C~CSRLIEKIFSCPn_087 YD2TFPEFKIHSFEGKSRAfIKV00GCNSFCSYCIIPYt.I~ItSVSRPAEKILAEIAOWDCT381 hypothetical protein OGYR>;1IVIAGINVGDYCOGERSLASLIEOVDRIPGIERIRISSIDPDDITEDU1MITS5RR1CGISLTYSSFRWASF
RCYSLIFFCfCGSLFCSESLtY0LLI0DfAKVSEECIGLLES

R11'CCpSSHLVLOSGSN$ILKR~TIRKYSRGDFLDCVEI(FRASDPRYAPTI'DVIVOfPGESDKEYSLLOA1ILVLR
AL.apNSSFDI1Y1FRSFKKCOISYPELAHDROVLEEFCIWLR~IQ~IP

ODfEDTLRIIEDVGFIKVtiSFPFSJ1RRRT1UYTFDNQIPNOVIYERKICILAEVAKRV00KSVTVRAVSVIJ1ICLV
TDFRLVPLLLOSCNDDSAIVRSLAt.QVAVNYGSESLKKALVQ.AR

F1BIKRLGE1TEVLVEKV1GQVATGHSPYFENVSFPWCTVAINtLVSVRLDRVEEIxLIGNDDSINVRITAY0W1LLOI
EELLPFLRERAENIILYDSVERREAWKACLELS$0FL6TCV

Eri AKODIDOALFTCEVLANGMLPETTEIFTELLSVEHPEVpESLt.T.SALI1WSHOLQIpIKEFL

SKVRHVNCTSPFAINRFOA 11LLHLHCDPLGRDSLVDCLRSpOpLVCEAABMLC$IGIN

CPn_0179 557640 556210 GVpLNfEHLESL.iSRKMNIL.,'ILLLVSREDIERA~CDVIARYLSNPE?ICWAIEYFGWDiIQ

hfl%-CTP Binding Protein wNLACDTFPLYSDMINREIuKKLIRLLJ(VARYSQNtAVTATfLSGppA0Q4SfF9fiffllE

WHOGPLt)TIDTPCDpOSOSPY,7,tSLCARFDLPRKEGDP$pALAVASYQNKTDSQV'JEEHLDECDVKT3EDLVTDA
CF.IAKt cns: scuOKKDOASLORVSOLYNDSRWODKLAILESV11F

ELISLADSCGISVLETRSWILKTpSASTYINVCKLEEtEEILKEfPSICTLIIDEEI?PSSENLD11VPFLLDCCHHEA
P:LRSAAAGALFSIFK

OORNLtKRG,CLWLaRTELILEZFSSRALTAGNIQVOLApARYLLPRLKRLHK:HLSRQK

SCCCSCGFVKGEGEKOIELDRRHVRERIHKL.SApLKAVIKORAERRKVKSRRGIPTFALICPn_0499 57,1147 ~YTNSGK.a~TLLNLLTAADTYVEDKLFATLDPKTRKC11LPGGRNVLLTCNGFIRKLPHTLttitA-HIT Famlty Hydcatase VMFKSTLEMFIitpVLLHWDASIIpI.ALEHVpI'1'IfDLFQEL.KIEKPRIITVLNKVpRLPRKLPTCFAVNVTRSR
DHMR'FKOIIDGLIDCEI~IFENENFIAIKDRFPOAPVNLLIIPKIt rx;SIPMNt.RLLSPLPVLISAXTCEGIDNLLSLMTEIIOEKSLtM'LNFPYTEYCNFTELCPIPRF7DIPCDFJ4IIl IAFJr,:KTVOELAAEFt:fADGYRWtNNGJI~QpAVFNLHIHT.LGC

DACiWASSRYOEDFLWEAYLPKEIQKKFRpFISIYFPEDCGUDEGRGPVLFSSFGDRPIGALA

~.'Fn_0477 559431 55761ti CPn D49v 5'.IJl7 i')tlllr..

I>ttnP-Mtaal Drlperldenc Ilydrolaas~Z'79') hylxtthrrt t.:.rl prJthln AIC~IVHDtOSESICKLVFLCTC:NPE7CCPVPFCSCR\Y'ONT<;IHRLRSSVLIO'IQtIKTLVIRIVFAIFIdYF.
~.LV'fltEtAJt.FftI'/~aM)fFR::I':"IIDCCFIIN)EVTAf.'N.LItft%.VDI:?llfT

nN.pDFR1'pM,VAGV~ELDrNFLTHPHYDNICIaDLLRN'h'IV1'OR~LPLVf..:A..~TYRFLIR::RDPWL.~.
Kt:F.51\'(~n;lYaEIIKRFDIIIInY::YDt:::W::::NxIILIIYLKRtt:YNOCEE
' NKAKfYLFATPMIFa YIIF'IMffLVINYDE~hFI:RFF:'.Y.Er:F'r.'::F::D(fY.fYNPHEERETFCaJN)!X',ALtIPTIDF
~:LPAVLEFTTI.NEDCl70EEFU:IPYTW~YYQK~CfMr:FRFY7NL
' A
IJ:HLHYYFt)YDHth'IUaVHF:WETFx'Mt:LYF'LPItJIWNENFFFtJX:RKIIT'MF11P'.fP..~f lL'rDL(:::IDAKIF.':YLDNVKTLIL.:AI:F::ECrtFFIf:lili.'..~.IiL'IVFFJUUF)WHACIKN'D

Lf tTtlt::IK:'LFaEHD(tllrtIVTFAY0L71E1IL4ffL.
:~Wtl.Hr:IriNLINtIMIUY1':r~'~FJMN:LfE:KF:L::KV'X:IM.AVFf.'ItKnI.FId:IIWM1RF~.C

Vf4\L.kLTI.LUN.r: l 1 ~.'f~s114rr'1 .5'(375 Sr.H.:SU

~.'I'SHt ttyl>ttrMrri.:.)l t:Ptt 114'ltl :.')., hrr)LRIn '.'/ t 1 f ,1 ::Uh~tl:W:l:~:IIII;RFY::KKt9ppt:NtJ:::Lrt:Yl\:XaIIEEYKNRYF1'IOLCAfWiPYWn.-1'W'1 Ir/Imr(nn r,~.y 1'r.m.:m 1'V i WDVrJ:Arrn:Il.lyVL IH:KGHKF(rIMYNLW INIINlAA::I'1 t :b Lf::I
:L.T'VFH:P IT:a.W ALEpK:M:AIVL&~AMYEICILYYlG:f11 I'IIY:LYL IF71 tfAYFtl:f111.1!
: ~.~' I(,tVNLK:i 'JVHHFY)tr..:IV.:WVF!.f.i~.iPMLIVf:VMI/ttAPLIVa.:AYJVIiR'.tltl:Vl7AILCLFAIL::LW
U:VF'IvVIJ111t.F:LNYA12KF7.FIJl'/Lt'M:9rl.ltA'l'/WLELLI:IY:::FVv:KLYMItIWNI.
' HA VIC:I~.YtNHMF-Mrl'InIC(r::'.t'I:.hFr:Y.Yi.F:IIF'l'I'IJ:TItIGHLWFLI'If.lv71'U"II:F.'fI'Ir:
/In:1:In'lHhi'ILJIIt:IIEYfTOf.'Flt'.ROIONIii(WY.:/ITEYrA'tY'\t~:VFITKLPNGSRR

FLPIIt::K::I:PRrIILKIRKFLPL'IGNVTORPPVPE.W n'70- '' ~ IKTRPLNIRTVFAR71VODLL ~rDC-N:iP-7O ~:ot.le.~c i'QIILpIITMIOILEP11'OFD'17DI'IEFY;..~.'3EPICR:PLEFFTLtPYKEH3FFfYR~Q~E.DVIICDTPP
tI0EI9V ~ .a)~' EKNDKti 9 81,1.

'LGipOyFRVFE.iIPE0E00AANFLaK.SELLEt.~r~ItKPRI3P3DERNARLIOKNOKEROELhOYAL>C~1'".I
JZDR'WPI' AT'CIIn E

IEDQPCFP!'LKANECOHITSO.f'ttLF.TRYFPSA.iLKCNPLSNYSRYYIQNfYFOIPSPT9GItEYSSIGOKtNP
ft.JIEAVQTEFfSEVPFl..'fLEEFAKCYKICERPIRVAKVKVJ1KJ1P
' K~

t v EFFSIRIOR.iFLLDLYFA(:I~OLEvKRLLOYIKRNNKOVCNFVP101QAEDPAGSY!TPKBHIIE

:LI~aSCLI:rCDYQEFLRELLT~.IORL.iODFt'IPEFPPOTPLAIL'fCOCSGAN~JItIRVAT
r r er .
r.iILSCCNL:::'....~'1'CNAYVEaF~SYAIPOLLERQAD~LA~I~~58425 53n31J
tK:SEiIIMNCL!'CLSSAXAGIA CPcy0501 .' ~:.Klr;KMWPVFLIGPVDYWItSKITALYN3NHAVLTI..

'.::If.". : ..cl,...TFir:,~I.. ..ay~....% ..
1~
y~~~NF
FF
~ IP

':Pn_J4'J1 571595 ~.'lJJJ4 CITA
~YI~
~V?EAVI:VPAY
PN~OPASTKDAGRIAGLDVKR
1CAOILlDWKFI

~1389 hypocnee>.cal 0soceJn AAt.AYCIOKVGD~IAVFDLOr'GG1'FDLSILEIt~GYFEVLSTNCOTf.hCGIDIDEIIIIKW
NSCYSWt'~tLFSFLVLFVCCI110CIPLCPOCKYETKSYIJtSDOLPRLKDAAEKAKILL9C1ISSTEINpPPTM011 DCPKHLRLT
AI

ILSSLY7YF11QITAF :p ~RTOWIEIDiRAYLFSLPVDSSLSEAITNIVRDLNIE~EGIDLSKOtlI
' YKELICK
DVLLVG
V
I
A

IGYVOSLL~01FL p DNGNNYENDCYL KCIO
IPIfCGKDCHiLP~TfTLFSPLfADP CNSNIIPA
I
LTRJIpFEK4AASLIJ<R?KSPCIKALSDAKIS

PFICAYEICERPYCECITRSSAERPLLPKEKTCOEPN10CVNP0EWAIGAAIOOOVIGCI:YKDVLLLZNIPLSIGIET
IG~LVINIITIP
~~rILLRLfDVSRF1NDCDPGI00GVF>i~t!'~.DttiVfIDID

ROVINSAGIRFNEKtIVQiIIVOATIlr7 '110KK0IFSTAAONOPAVfIWLOGERPNAKDNKLIGRFDLTDIPPAPR~IPQI
HPNFPRPNLSD~VDLF

PESdIVIiSDFPVAGWSG11Id0t8FRFRW1~.S5H~EFILTANCIFHVSAI~IIASGKEOKIRIEJ1SSCI4EDEIQR
NVRDJ1EINKEEOKRRRF3ISIWlN6A
RLYGCCCYIVSRIILTFPERPFYCEWGAE4RPrIIL.RNOE~AQPIFAIfPETLVKEIEERIENVRHALRDDAPTLlII
KEVTmLSIOt~II

iSFRYTPOI DSNIFRAEMIKDYKEO
WEOQIIFGLDOSYILCNWAIVOEfGRKIMVLEYNQOFSK~OFIRtPCNYYGFRLTYGFce~lo~sAtAAASSAANAIOC
GPNINTEDLKIwsrsTKPPSNHCSSr~HItGCVCIIDeI

CPn_0192 571617 571801 ~ .
t homoioQ present >.n Genebank/FlIBL
as of 11/1/98 586118 588511 l N

xls CPn_0501 o ro vac8-ribonueluse fraily LFSLIFPICEERNSQQTYItHLNVESACFLLFSPLKINWSSPYCFPPPYRROLKL

ATOPTS>"1'IGF:NOCPKL'~9LL~LPKRKPGRR1'YGKSLILIFIPCTLFVNARIOGFOFV

=Pn_0197 57514? 5718s5 SPONPEEYPFDIFVPAR~.RGALOGD1NIVSVLPYPRDCOKLRCTISEYWtCKTI'LVGT
nt in Genebank/F~Olt. as of 11/1/98EDRSPALON
' LSTPPWVDKP
l C
t ~

op prese O
No robust homo Iv SKTEGSHSKTSKGFUCRFVGWIRTf'IGRGSKKRSPSSFSP1'HPYIRGRTYTRSPR09aVE~
IL
ITSLVSFTBALAYTSIISCSOSLIPVBLLPGRTYI
LEFIQITf~IAKAOf0AI0AC1fNL11ELFPPEVIEFaSLFSOKHITOVIJISRKDI~DLLCF':

RKpEOAETSFIETP10GIL10CQ~DPKGKIiVIRJK1MI
HLDKEAAKRCNSIYFf~It IDSgPARDF00AISLTYOIRirNYII .L'NfIJIDVSIiYVTPNB

_ _ 5~~~ ~WI~~I

CPn_0191 575370 515116 ~GOtr3fIPLSKItJtF7at'II~KK1'SDIREERCCIRFVLPSVZfLSL~PVALIItIbTFS
~c in Galebenk/ENHL as of 11/7/98 IDiKi!'DIT!'1'PfOE
l L

No robust homo A
o0 Pr~
HKi.II6FNLKAIIEWAYNI8N0GVSLPFItSHEPPNOENLLaFOE
CSASVGVTS~1~LA

YINIRVNPYGSYRC~RNPSPEDGKKDVPLSCNSRLNRPOOIAR~PDYOnS.fTLTS71GIIPLDOVLHSQFVitSNK1' ASYSTF1~80GN1IGL%IJ7YYTNlTSPIIULYID

SLEKRVlOCISLANIrIC LIVIHtLWNPL3IDQT1Q.EI
IVRAGSTKERVSAKAF1i5F1~il0LTRFIMVLCWP~

NAYIITANHBGLStIIVTEFCN<tGFIAAATi.PICtYSLKKriALPESIPDKIatIGJISIRSrtID

CPn_095 575507 576793 SVNLLTQKIVWSIA1~ICPl~IK>ITPSK1IKDTKK

aspC-ASparcace AlninocransEerase KPOYIISEISIOG.iIGFRK
EMt~tIWPRFSIi . CPtL0505 588471 589106 , ttRLK>Q1QIWAIOKAGAFLRCLPSESRPYL l ETIPEISVIDLSIGDrtOPLCRST?OAIKEFCVSOZILQp~''t'~

YENRISPEEIFISDOAKPDIFRLfSFICSEKfiGL~ODPVYPAYRDIJ1NITGIRDIIPIJICase .
)-alechYladenine ONA 9lYcosY
KRIiRK'KEPIOCCPRNVLO>I~rLSEwTTLl100LLOfiKLITTHOGLITSC1LIVLT6J1YR

RKETdFIPELPt~IQOSLDILCIA:YPNNP3LZVL'1TGOI4ALVNYANOtKTCVLIFDJ171YSAFR
IPEAKYCAIEINSFSKSLGFTGNItL7lG8tVIPKLLTIDIdiEPNINDNKRCPOOIU1CHAYN1IRK7CMIMlKLI(O
OBAYLYRCYGMNNLLNVV1'GPEDIPNAVLIIU1ILP
ISKOIISQI'LT
PAL

VSDPSLPKSIFE Y
OLFPTPPAISLYLTIIAOKWISL><1'MFfV110CDHAPYL'DOGK>ILNIORROtIRDKpIHLLTNGPOKVCOALOIS
LENNRORLNI
FATTPNGASLLNQCAOYYGZ KVLS
: ' . LLSPI9SG
A?11RIGIDY1HOEYRDIiPNRI
WVELPEGISDELAFDFFLttQYNIAVTPGHGPGSCCQGFVRrSALTOPpNIALaCDRTL'1'A

SLKITNVIJv cPtt'OSOS st9ass ss9tlo CPeI"0196 576751 577811 CT131 hypothetical Dsocein CPNEISPIPRRICKSFILNM.>a.YSKETN7WFLISCRRfNKRYFITiit.VILLPLiIf?IAI

CT391 hypoeMSieal Drocein VlIIIItIFLTOPIITZaStFF>DCFSFY17UIMLIJO)VLOfILLP'GLrPIITVLtGFLTIItIII
SCMfILRIUSOYLFFt'SLICSFIYVATCGSOFOSVSSPICIAIFLSFPNVNIIPPPNNiYOCfGWA
D ' ' ' PPLYRITKRN IFCSKECSFKQV

PLLEDCSKSCIE1'LKDf84LPEIWLJiiIf~SIVKARKTARSLFtfOACtWAIVTLGTI11TK1 FKSIiiIYDIIILHRfPIIKTVYKM00VN
GDAPNCC'TO~PLVTVFIPTCPNPTSGFLTLFRKSDIV!'LCNIII6DJ11KYII80DV

VIISNTETORPVIY71l1VPDILESLTLPKNtI4tIYfs'VI81LLDIt7I71ICFAI01WATN710fIVYLPSSPLPD
EiJfOD00S

KPSBPFPSDLOKEIVKKLMSGIEYIEISITSSTFlCfItIR0AI010tPS11IFIPLSPLStOCLSTPNAC

ECfAFLOEILIfIdCIPISTDDfSLISDGKCI11CSVD1fRK8GKQT71KTV~LYN~~BL
0s07 SH198 590122 CPet RKII71QRLSPTi'1'PNEDIIKYLGIKLiDCTDINOpLSE'KSAVS-Cf431.1 hYDOChecical Drocein S'fPYPQFPLSCEIKI~'NIELFlIfRNSKOARRRAKSPKKRKPRYAIVHPAPAPItIVYaJCf CPeL0197 578107 5178/0 NALSTSDSIFIPKIG

CT388 hypothetical protein iPQRWL~SWILCVKVTPKAKtNICIVGFOOQALKVRVTEPPT~OGXANDJ1VISLLtUCAGS590133 590300 l,pKRINTLIAGETSRRKKFLLPNRWDIIFSLNIDV~CPt4..0508 Cl'131.Z hypothetical protein SRINSRNRSYGKSVIICVTKPIIVLIDiFERVEVLR)aGRWNOSTA%KVICLPRTPILK

CPn_0198 579D62 578085 No robust hanoloq present in Genebank/EMBL590808 as of 11/7/98 YCRLRRAPFIBaRRKARWVVALFAffrALISVOGCPWSQAKSRCSI~fYIPWNRt'I'EVCCLCPn_0509 590299 PEAENVEDLIESSSAWVLTPEERFSGELVSICOVKDEIIJ1FYNDLSLLIttICAVPSYSATYIPr~iceed Netalloenzymel NKFVFLYGNFIRV1'QEKIKItIVSNEOTCIPIHLVSVEKLVLTLLF71LKVTlIIEIFIYILE

DCAWPGGPLPALRORLDFLVRENORCVRFKICIVFL.CGERGRYOSIEEDfJiFFDSRYNPFDMtaELHDKVFADPSLT
D'lITLPIDAPCDPAYPtM.CEAFISP0AA4RFLCJISPNDm PC~6IESGNRVTPSSEEEIAKFVWMONLLPMWRDSTSG11RY1'PLLAKPEF~1RWANRKIYEEISRYLVHSILIWLGY
DDTSSEEKRIaIRVKIaIDIIGNLRKlOIALLTA

VI'LLLFRSYQEAFPGRVLFVSSOPFIGLDACRVGQFFKGESYDIJIGPG1AOCVLKYNNAP

RICLIITL7~'ILICETt~CLNISEGCFG
CPn_0510 590801 591971 CPn_0499 580104 579705 clyC-C85 Daalains INenolysin homolopl OLNNLHILWfFCILLFLrItGLTOPSCHGSSKFLIrfItJpRFFKDKGREYPPFPSAPTILA

TLLCILYGALfiTKLYTLLPPK'tJWKDLLIWPLYSLSALIAYOFLPPNISTINPK~iI'IAHL
No robust homoloo present in Genebank/ClaL
as of 11/

LaVYLLIFYF~.FX;STMSSVNOSSGTPNPEEV'fSPESTEFl'IKNWSSDEiI0ATN11VALPIVRFLASVFOLCLFP
LOLLFYRRRPNQpVRSSI'SFOSOLSEALSAF01'R.IVRIVNIPKVDIF

".'OLSLPDGVGTSSEETASNPRVDEIVAEVSSSMVADQISSLVfltVGELLODLt(C710SLFOEALVLVSEEGYSRV
PVYKKNLDNITrILLVKDW.LLYTSSHDLSOPtBSVA
ALPEEITL

TSFOSEL1CJCLPAWKSSTRRL>:T'AGRGONADIARLELERSDIfAVLGNANOFfIGKAHLIL.
SKLTDVNHKLOCLSREDLSIrIFONNDRVLEHt.GSLGI.aVpilECiK'ISLSCERGIPRLVLTAKPPPYAPEIKKAS
SLLOEFROKNRHLAIIVNEYCFTECIATMEDIIEEIICBIAD~W
~aINN
V

DSNLVOIKKVNLPTVEELRTL.OCITESSSDPRVEEStSCCERLLNELRRIbIANIVOFISSFHKVOAVPW
ENTPYKKICSSNIVOGPINISDAEEYFftt*IDNENSYDTIiGGFI

~YONIVEYfJBIIVRRINLLPCLGCLPFIGJPDASOEDORSS~ERSTRRERLSRRSDLSEECNFDIEIITCTERHVCKt JIITPRKRKt?lIS

FlItVMECESINPESPHCDCRNpPSttCDKODSDSEEETEL
CPn_o511 5:3111 592488 ,am_0500 59064? 58236: csbV-9iqma Ratulatoly Facto:
NSDfOKEEHCSTTIFHLNGKLDGISSFLnIOENL~>OSLaAGSKNIILaCAHLDYNSSJYDIR

pcoS-Prolyl cRNA Synchetose Vt.IQ..'YtIpVCOHSGKiVLTTVPKTIEOTLYVTCFLSYFKIFNTVDEAIOTLMfD00 ~PNSNKTSOLF'tKTSKNANKSArIVLSNELLEKdGYLPKVSKGVY'CY'CPLWRWSK?!8'LI

IREEttrAICGOELLLPLLNNAELW~GRWEAFTSEGLLYTLKDRECKSI~LdPrNEbVI

2 5x3538 CSFVAQWLSSKROLPLNLYOIA?KFRDEIRPRFGLIRa~RCLL1IED9YTPSDSPEO~'~OY.
CFn 0 EKLRSAYSKIFDRLCLAWIVTADCCKIGKGKSEEFOVLCStGEDTIGVSOSYCANIEAACT125 hypothetical Droeein GIIPTOWLAPAT

::;iPF011AYDREFLFVEEVATPGITTIEAUWPFSIPLNKILKTLWKLSYSNCEKFIAISLPLTNRRSVCYVNP9IAR
rY:OISTWKFLYSLATPLPAGTKCKFDLAGS

V
~
' xIRGDROVNLVKVAiKWADDIaLrISDEEIERVLGTEKGFICPLNCPIDPFA0E1TSPM'Q

ILTf .sEIIEATAIPVKDHPVPOFEFTLPYCLOVGG
DLSOTRNVIYAENPIrJ
WVIOrK

.
VLrDACWSAOt.FAORRKPFYLYIDP9CE~.NYOEPOVFBI~IR~IVLKKIEIPTPS
St-X:ALINAKDK11WIJVNWORDLLPPQYGDFLWEEGD'1'C'PF1IPGHPYRLYOGIEVAHIFN

:.:TP.'rt'tY.:FEVFIFQDEH~TCQc.IrNCTYCIGVGRT(.AACVEOt.A~RGtVWPKAWPFStRFDf7YRFE0E
FGNL'fLIF:t'EETRtEL~IEItLRE:><.NWpLFIPETGPYILPNLYfNI;PCI
FI::APIKCFADSAFtItJetf:LLHGESERVDSED1ICICIRtYFRDORAL
' ' ' '."tAFtIJ:DTV;.QEIrIE"LYHEW:OCYEFLLODROERLCFKLKD:DLIGIPYF.LILGKSYIRIOLKNL.S

OET
NFYA::.~..~.FEtIQENL::1'DIWKLIN01Y.~.LFNEEDPFTTL'~'I:) JY3GEPNLmVRNIGNIKE

~e::Y:IFEIE:R:.GCKYTV.iPEnlFt"IWC.~tRlLJ1'PKSII::KMKf.YKH I FI 1KLTK:'fVfNIf7N
I:: I i'::FTA.9KEfYiFDFC'IFIfPEFERVVEIYNAN

':R:~1:~1 s~7n50 ~:$::FTfMIIMPFPtCX:KI':x01'Rc.9'VIFl:LKKIILRIY:FVIVSt',LppRCIYKDYPDSPQW
' fn USUI
'l::It:f:fAIItaIKYTItF_'ab't'\Lt'ANllv:'IA'I'It:PPIVL:iFtIITOAPIIf:SELSTOSKPOU1 V
: ' U Rupla~.anr mrctiPtuer ~:l~IfPl1 Tr I

. KAPP
. NRIII:Y:11'/M.TALL~1'VKt flttY:I:VLIPfFFI'D!:IBR.DYEYGONVPLt:VTGKOPN,t rc ' 'a:::F'PIHL:-.~T(VLVC:W~JMIt::KVSKR(Y:KILfSILFATTELYLh"IIOIrI~P'7SKTt.K
ILLTF

. lYNfiIN
t.ay:::h4a'1'fIHNYF'AliI.PJlI:u:F1*KNIIT:XX:RI(?DWLRIIWIfItyI:FI:PFJ1EI.:APtVFY
YLItVi'c~AIILWW::.:I

NI.K I:YJId ::E::I<N l t KISt hKA'f EL4:EILD4PTFF.~.::ARPFNO::VTN ~'tny'.1 f ~.t'.1'. '.v'.7'::
f Q I1'G'/MIORAYt' ' ~1:7LYNEV rvlr.r..Amn.n::.
:L.:CEFP~It'Pfff'C.WL!'l~~t:UCI.::IKRIEKFLONYtPYLP'1'NEEL::KKEEHL.. :: ~
' n .ELLtllC711iKC .
WLYITRriYII':;I:EDt.YrJi~:N::KI.I*YC,1FKDPEVLAL<~I::LFENRR~WV..
LFAN
VI'IrYA.P::I*~:VKNUU4\KKYIIYtKr~::PVia.)IIi.E:RIVIi(6'T/lplfl'l'CLl'(WPKT.'uPLY
::

' ' ' KFJU.1 I
F'F:KIJiM/Ft'I::DRUAIJII.I.tJ:IIJKI:I/,NTikRli'ADV1IRY.qPVCO'l'VYY::.:Tt.YLYP'M
F
LLK

INI.t NA'fAFI':Kt:lV.hfla7P:TlN:C::VITII'YITIIR:iPL(:ALI:IL(a "
' PI, n'I ~F: a .'K!'r.'::F'lAKft il n'M.WI.
l:: aKL.LP::KR Y::1'I Nal.l W~ 1 vl l I Y.Ti' w ItIF:Piav)::F'IKt'KI:a'NNFH;i:ak:KU;NFPITEVH IVIY:I:FP."(. '.NldtlY iDLF
ILRTE

'fKIKE-fl7lJtIIIKALTAfE'fAYL.iDLGIJLaIRG'.t'.f'n t15~5 X' _ ~'t"397 ..KDIIGLDSIP~GAEILYDKCRN CT77H hypnchecvcal 0rocafn FLAPKRL:.:::DFLNIHYMAlIOLGtHSNtTMLC'YHKECPCDLV1'l0IVKVPOLODETOGFKNGt CFMHDALL
iZWIIOaL~L'~IR41111VHKEHCK~AINO~L~
~~

FCLLKFACEMLVLCKRLRK.',~fiALPLiLiIIIAVARtFLDNfSPaI4KALWNYLGtF~IALDLLI~
ICTOIRDCCkl4RI0Ehr'~O'INKL~OOME~LRL'~

Lx'CANDL,~>.~TIR4CEK'IFOMA,i.iKEP
KOAGC.~.IVSLKE3LASTflrSSSVIEKEIFE:RKKCNECGKALLCpRTELKNAIttPELl.

iIYERLLi~1'IKKDRWVPtF?1RVCSCCHri'LTPONENLVRKKDR1.IFCHCSRILYWOCSQ

CPn_05Li 575690 595530 VNApCJSTAKRRRRMAV

~T4-7 hypotnecscat prota>.n . .;~' .. . ._ .
~.NfT,PHN".TR~JAM.~.tFit4YiiIllCY.iftN.iFPLSWLIKRNDIRCJLAPPADLINLL.Iw v -R:; ~ ,:~s..s.m : :-.t.s. ..
'..~:1\C"FF.':.'. ..~\T:.-... " . :.':. ~ I:F'.'."'iDL
VFNL1EK~F.%(:'(IJ:LK:::i::KFTrHM'r.iiMl,i:DW:QGawiKUKvIV9FFF0AFJPKW1 .'.FIILAW .-"~.;;, .~.. ~~.~..-LLt.:D.V.:.;;::'::.
' C1.PP5 OCJ1EKILGMS<.WVFFSGItLiK3fICYARKLVATLOSLSERIIf.FFSP11DLIJ(Cr~L.VSPGDI
nV'GWYDLTKLPFVFALi.:.fiSC'.:WiCEHFLPNLWEEALaQFESSPEEVLKEAHONl LLOEYYI1LCOYRLGEEH'tEiFEKFREY'tCTLY00ARLVCLFSKSGETOELI~fVPHLKSRRAILVAITSIIPYSMr IALSOLWILP>;VAiLDPffIC.I

~~I~~~~~

CPn_0515 596450 597181 HLG~NSFSLEVFSJ1YCCCC'JCIVDPOFRtl'CIFTDCDLRRSLASYOGEVLiLBLEKVIfi' ubiE-Ubiquinone Meehyltransferasa ANPRCITEDSDIAIALOIJIFSSSPVAVL
EKNI'fKALKNSGNINEPSTNKPDCKKIFDSIASKYDR1'NtILSLCfINHFWNRSLIOIIf~S

GYSLLDLCAGl'GM/AKRYIlUW PQASVTLVDFSSH4.DIAKDHLPOGSCSFINSDINpLP
' 0527 609910 608726 CPn YSAHKLYL _ LENNSYPLaAMAYGLRNLSDPHKJsLOEISRVtI~tPSGKLCILELTPPKKTHP1sueH-Dihydrolipoalaidt Succiaylcranstvcsse RAWPWICKSVSKDPDAYSYLSICSIOQLP(tDt(Dt.I~LP.SRSCFYIApOC%tFLGAATIyft,RY!lItCFRFPKI
GLTSSOGSIVRfiLKNLGDNVARDCPLIEVSTmCIA'ISZ.PSPItA~tLVR

LEKQ
FCVN(93D6yA9COViGLIFi.CISEADDESTSCPPTBCCTKSPJVGSSSS&ttl'fPSPAVLSL

J10R8GIGLDF4.OKIAG1GKOGRVTRODLEAYISESOOVSIPEIF004VNRIPIISPLRRJ1I

CPn_0516 598909 597255 ~
ASSLSKSSDEVPitASLWWD1(T~1LISCZs~ORFI'CfNCVR..ITSFIVOCL~TLEO
f 11/7/98 No robust (wmoloQ present in Genabrnk/tJ03LFM.LI4GSLDGITIVIeIKSIIINCVAVMNKF~VVVPVIHNCODRGLVSIAKAIdIDtiSIGR
as o VVVRDDDSLIIIRK
IILnVLGRAIAKAYYVCMVARGLCDFPTLVPNERLPIGPPEVPOHTS

RISISFRVSWFVK LNKLDPS~~~'~"I~IIRYPEVAILGICIIOKR
WDliLSVKSDYEEJVGPAICIRSLEPO
A
' I4 MVYM'LTFDHRVLDCTYGSEPLTSLIO~tRLESVfMG
SIISGLDDILt(LCILORRPF
w110CKEFAKRNF
~RLKFPKSIGSKDAVIVDSF3~lVPVN
!' F
VSOISPAHCRLCSTLVOWAPILGSEEOLVWLEE
T1'D1TSGVSFJ1AAAE7lAVDSTPGTEE
' ITIPAJISC CPfL0528 611165 609921 ANPfQCIPMSETVESSPVAPCNTTD t PSPSLRYALWOFRfPYPEPPKEPEVMFTDEEICSLILE71TRARRMQ.DLYNCIILiIDItEL.'K

DEIOKIIVPDLPENWRtNWRwSERLYKFPFKTKKEGLEEI!'LMCELGFHIt.ARGZ.RATOSQ:
t?1cT-Glutamate Syspor Ll~OItJ~CIPIGLPSIG1RCGLVLEGIAItFKPICDIFLNLLSMWYPLVPCSIlV11GI1LRIS

AitIKVFNSLYAkOi.QSFNV~GttSCTIIKPLPTSKLDLFKSEFrSKPIOatILTEFLVASDEEIDMOO.GRICIKSt IGLYIG1TALAIVICLCFAt4IFSPCNGCDFA0AQ8~SAVTVIf IRT

LFKGLRVLEPGIEL7fYDHPt>DAGEIRSVLEGLVO11GRISCYtitNOPPGRFYLRGVG~AAY!'CSIIAQVFPSNPV
RSFAEQiILOIIIFAIFLGIALRLSGERC1RWPJtfIDD~N
lCQRFKSCVR'1'IO.VG&F11DES GCLVAPG
FFESSDEEGAFIIDNfPSKTAt4 GEIlt .
LIIMItItIMSFAPYGItit181tAhtISCWIGt~Vti~IGKFII1IYYLiICLl7IATLVI
p ELVt$i.ESLV11S

LPItiGRFTILV ca,ISrsxFLSS~IISUIVSrASSSATL

GTAIF0~4AAWI~If~~~'~'t'I'~ATFSAVCliMYP000lITtGSf1Jt81RL

CPr~0517 599637 598795 PIOCIAII~1GIDRLRDIVGTPIQtILGOAWATYVJ1SG~.SPYESI1C0E8VE1T
f 11/7/98 No robust hattolog present in Ganabank/1?18I.
as o 65 FIMSSLLSCGRIEPTRV1'CSLKTYLEDTSONOLSTRLVRASVIFLCALLIILVCVAtSSL

IPSIMALATSFTVMGLILFVMSLT.GtrifAIISYLTYSTVi'SYR~(1DIAFEIHKP~SVYYECPn..05Z9 CVR)811DLCRSSLGCGEIPIVRTLFSPFONIIGLNHAL.71AICIPLEftFJIFSPGPPFIEPt.VDwAYc_aH-ATPas_e ~PSTLFLFYRRVTIAISLEGILGCyOGSLLSIIVPAPLVAL111F
F

~LIRDrRPHVSSLCFVIKQ(iSSLRTKDCNI'ICEAFRSI)YOhHFAMVDCYRLZHSKLIIERFSWS1PYRARSTVI

KNGLIfNL7I IPSVMVREDYPSRPGF~YRt:GLLIUnt'sGKG7II.
KLTWDSIIVlIBA&YVCDEPtiIIAJDILP6SVWVNKaIIRISAARJUIEKIGILI.ItOGLOYR

KLHI~IfEIAWN00DPtGGRAFFPItGRLRDFPLRLKlYDAIIVN00GKEJIG'1'VVIaV8f17t CPn_0518 600806 59983 POIFVKPTIASVVWIIB~RIPKF~4RaRVCVFCCLGFPOGFiHrt.REIHILDKYLL

CT119 hypothetical Drocein I~SVIG.PRLSGEVSLLPIAKVCIItI.iVNOD
Flail'IfPVPONfLLLRILRI~~iiFSRSDDEIiDFYLDRVflGFILYIDL~fDOf'~Ci~RCIYOEL

EiztIIERYCLIPKLTFYEVKKZlICfFINEKI'YDIDTK>Q(FLEILOSFLEFIYDHEt71'LSLtiK4IE0IHKNRO
N

AELIDOFpOfYVERSRIRII LKIHLFDAK\ICIICITQ
ARQLLSNKUCIYYSNEALiJPRPIOtGRPPKOSAKVEI'>:1TISSDIYTKVPQAARRFLFLPECPeL0530 613323 611160 , ITSPSSITFSEKFD'IEEEFLANLRGSTRVEOpIHLTNLSERFASLKF.iSAKtGYDSGSTGspoU-r~
Nachylasa SVVL~1CKFLWARCCSLiIPWEFCSl4OCIGK10QPLVl(F31UIL.KRSRCWISStiPL1f19311REI0 DFFGDDDEKVVTKTKGSKiIGRKKSS IK~iPVAVI
t.DSTLtIQLdF
KALRTr,YLCOHVrCSTtILSItXiXEFLYf3.KmiSTICILYC

_ _ QKRWIMOmFI'IQPFYLIIOV~CPOFNCiIZLR
ADGaGVDDV LOIPIV06Y1tP

CPe~0519 601707 600901 MMtSSLGAVFSLPILSISRE6GKELFKOEDwLYISrTSPPALTMYFBKNYLGPIALVIGS

dapF-Diamit7opimalace Epimersse EKDGL?E~1FSEOlSEIALPMIGCSDSLM.ATSYAAVAYEWRQR1NN
OPT1G.RILVYWMAFYSPSI'ISKYFIYSGMIiRP'L.LGETLPEVEDVRFLCOECRVDGFLYL

KPSSCADAQLIIFNSDGSRPTMCf~iGLRCI1IAFQ.i1S010CKSDISVSTDSGLYSCYFIfSYtD

RVLVDKI'LADWRJVVHRLESAPDPLPK1LIMIfffLlfPNIIWILPEISIT.DLSILGPFLRYSJ11!
depasdant methYlcransttcane HOTFSPDCVNVFtPWICGHCOLRVRTYERGVFGETAACGiGAtaSALWSNSY~IKE.SIODS&ImDFRKEKC'Rr RK501I~R~K~RHSKTYFSLIRERLVMDYKLGDSGtICiJKLt~F

IH'tSit7GELMTVSONRGRVYLQGSSIfRDL
GPVI'LIRPSSYAVWPKSRPEI3FSQAAt4YVRDGERGAiiIO~IFKRLPLBNEVAPStNRCLLK

05I0 601233 fi01616 RTPFCNLCYFPEIBGENPALKOJIIEKfOCERWIi~t.FAYIGAGSIPMKOGARY111V0iAS0 CPf IYF

L
MVAtIAQW4V~I~tAPPEIIRIFYfVIEDVISFLKItEIRPNKKIfOVILL~PSYGRGPOG

elDP-CLP Protease KIDImLFPLLSt.CSKLLRDDiI.SYFLLTSHTFGNTPEFLRAIARRSVPlLVSPAiiSC'GESF
ERHYFlIADCEVHK1.RDIIEKELLFJvRRVFFSEPVTEKSASDAI1GG.WYLE1JIDPGICPIVF

YINSPGGSVLVIGFAVWDpIKMLTSPVI9WfGf.IILSMGSVLSICAAPGRRFATPNSRIMICGiGiIGJILP&GStV0 IpNYVEJ1TNOPRDIIEK7IIt>ROMWltTAl4FaCPrt0572 611716 614075 KDFCLLDGILPSfNDL ribC/risA-Riboflavin Synchase ESFCCKDSVVIMOGIffSGII0E1GlbIlCFFEAOCa4CLu'LCIKS'fPLFVTPLVTCDSVAVOG

CPn~0521 607807 601211 VGLTLTSCNFSKIFrDVIPCI'LACrTLGEIOtCSt7pVNLhAi.KMGDSIGGNLLSGItVlCT

QlyA-Sarina HYdroxymethyleranstaraseAEIFLIKPJJRYYFRGSKELSOYLFEKGFIAIDGISLTLVSVDSD'fF'SSICLI
PE7TQRT'fi.
NFEKFKKFAIVEIFTIIVtIAWSLLNKFLF3~ASGKKGOSLJ~STAYLAALWILLNAF

KSLLi GKKROGERVNIEIt)IISfICIQVDTVKRILASSCKD
PS ICERIIDELKSORSHLIGIIASENYSSLSVQIrIMGMd.TDKYCEGSPFKRFYSCCBNVD

AIf.WOCVETAKELFAAOCACVQPHSGADANLLAVMAILTHKVOCPAVSKfGYK'IYNELTE
CPtL0533 614918 615385 EEYTLLKAfl4SSCVCLCPSIIiSGGHLTHCNVRLNVMSKLIOtCFPYDVNPDTCFDYAEISCT106 hypothetical pcotein R:.AKEYKPKVLIAGYSSYSRRLNFAVLKQIAEDCGSVLWVDMANFACLVAGGVFVDEENPEYAPHOCPFCNHGELKVI
DSRNAPEJ1NAIKRRAECLKCSORFTTFE1YELTT.OVLICRDCR

LFYADIVTT:Tt(KTLRGPRCGLVLATREYES'CS.FM.ICPLfOAGCPLPNVIAAKTVALKFIvLYCNFOESKLIItC
IaLIASSHTRIGODOVHAIJ1SNVKSELLCKQNREtSTKEICELVNKYLK

S'JDFKKYAHQWFBJARRi.7IERFtSItGLRLLTOLTDNHMMVIDCGSLGISGKIAEDILSSVKADHIAYIRFACWRR
FKDYCEIJIEIfL.LSATPOF1EK

CIAVHANSLPSDAIGKWOCSGIRLLTPALTTL.CMCIDEMEIfADItVKVLRNIRLSCNVE

CFn_0574 615389 515784 =Pn_05~2 607835 601655 dksA-OnaK ~uppressor WFTRS(tWPta'DOEIEOFKKRLLFI41WILSHTLEGNAQEVKKPNEATGYSO(pADOGTD

':'C433 hYpotheci.-~: protein TFDRTISLEV'ITKEYELLRQINRJ1LEKINESSI.ICDVSGEEIPLARLIAIPYATMVKA
REPLSPEKTSL1FKVKNVNpRMIKKNQGKKKNYFQYIPLKVpKLROPSFYPKRLMTLYLC' ItIOKTARKYOAHYLPILTLFPYAKSTPONKRALQFLPOATHVILTSPSSTNLFLSRMTSLCN
OEOFEKCLLs LSYJITLKTK1'YLCIGEs'TKERLLSFLf'.OVKYWaTGEIAECIFPLT4aLPS5ARILYPHS

CPn ::LIRPVtREFLYNRFTFFSYPHYTYKPRKLKKNILSIC1KKIIFTSPS7VRAFAKIFPRFP_ lspA-Lipoprotein St4na1 Peptidase cYT'fWCO.RMTLOEFCKFSSOKOVSLLET1JGKSRTSFKRTPCWKLSSMJ1TRFRSILLVITLP'VLIDNV1'KLWLG
D'IKDLOILT(tPTLYTH~CRFS

052i ~047~0 d050S3 F.iIAPVFHF7CAARGLFSNYKYFLFLLRIFVILGL1J1YLFFKKKSIO'"I'CCI('ALVL(.CACA
'_tm ' ' _ ILISCGTLLLWKFYFPTKpFEKKR
tk robutc hollwlcxc pcesene in fFNVAD
CenebanK/ENBL ass .'i 11/7~aA hNVCDttFYCNIVDFt~FNYKf,MAFP

s~Nfv~::ATC~FDGTAF:sLFPFITRPRYNFKLALFVT1AIALVWIALtA".'TIAICLCIHPLC
~Fn O:s~. eI6700 X175'71 .:F'IFLTAICLYFISRYIC.HYARNVYIai.DWfDII.':YLODNR~HSFIF~DRbrqA-f Al.rl:ly Pacslsaaxe 'tn n'.4 0:15070 W 1v1'1s 'fR::I4EY1.R1'FFK1\fINRLLsLt.."VFDGFFWSY'/AFILII'ILGV:F:111I::RFFY)F'fIIPSQ

rmtnt::r IN411(tl,.ul Ns':::rnc F'~KLFF'I'f::\MICQERETKL1:VIIPLKVFFASHYXILCICIPl.ICIVTMta~:::Ct:ALPYRhiI
in Urnatkmk/F?1RL .u: ut 11/'7"tN ~
IN ' ' . .VIVAtIdJaYIVFI
'.y':'.VL::S'1.'FRDKCIADKYyFTtAKf:TL\ILA::Ir\LC:ALVAUtPLIKJ1FKTPW.
:RT!'. fIPM
'r'fE~fYKFNN::f:C~ N:(Fr:::t'JKY::F:V1'L:IKFRKLDRDfNf~
' ' ' . ILPFFMLLYt . .af~SLORIUKLCa :::CIVII:NI'VYI~\Ll.f'I'rAI.Y::VYfFLV'fIKtFrC::Y'/::::MJtJY.'JLG~sNFKPG:14\WVEK
NAL:L
'/riF::V(T0.~.L\I IIWNI.FKI.'YFM~LLFL'/FYAI

'/!.':'f:alF7M.:F"fHNIILN1'KFKt:\LyfDW:QPF'QFTFL".':LRVIF:1!MQ::TC:IfFNf'Vf:PfN
L'Iff.VKRFI1TLFIiLIt:IYF:.::AYYrp,iAL~FArYZYAITINOCISRMY:iaiIa~7Fiki1 :LIJIVLAr.3W5IGLENA:W'VFd(TLL:YF
D
'f ' TL
' ' IsarfA~ftlt.::'Cll;f::Tt.KI*::VWF/tt.'Kl'HEIr%PAK':Ff.FF::f'fFNRWKLPNEALOQTFNLP
:
~41~
I::IWAICMLI
:
..:AK
:
IJ::F
VI':YYr:AKFL'1X11'CAY.IYTLYr:(d.lL1'LF"'FI::()Nf (TF'F'YVn:Y'ITII::YFt If4VKFFt Itl.:::AF:YY::II.I'fI~L:IS\rtta'K::l'ELFNW6'YYItVALIJ1'fF?X.'LK.1\(E::ItM\tVALC
LF.
DFFNTtE .
' ALI.IN.'9/:7 i\1.t.1.t'1'Nl.4SF(LIKE'fIFPAFAA:LTET::L.iTE
' ' /It~liiLLVII.O
S
WIW:,rIVAFY:KIAL(.DAIiIfPAi.t 'r::'/'IF:VIIFit11.1Kfi:fF

7,; :I::F:F:
:H' ~t'.sY nlIH1'. '.Itlltr'.

;:Pn 051.: ~i ~luW 3 ~THI~I.: I,ynncnRtt':.t: n.,r..t:, rL:I-L2: P:bnsal!.x: Pr~cetn ROLNKILKWTHR:~
~:LVPG1KI
:I'atr~'IIVNTI~FC
1:
~
' .
faKORLTL~IER&fBKRIYNBP~YANiQ'0('~.I~~W~YQVBc~~DY'D1.'EW.CR.til.'~
.
SHI'LJiNAO~fAEYL56N~WA161KY61I1KI~fYiI~l~Ilil .;.: FVPD01'KASIl' :::::::LLF~
.
:.IFLLFNDN
LAptII4:NNL4:,WLKKRKNM'..:L:CD1GELLDEKKOR:.'WKKNLDGGIKhCAALVLIWKV

F f NN0 r.
EILI

'Fn 1539 .t8129 5:A511 051'.' :115Rn u32198 CPn ::T914 nyprYCn,c ir:.i: Prott:Yn _ 'F'IiKTKTLRDI'nIPRNNHKPNKTKCKRFRWLRyGbB famil/
.VLF,r'CFIATLL ..-..
~Ft:Kr -TKEI .. . -tK:AQHw .... .,.
~ -.
:
r-~

' ~
-' . _ , _ . ,, , t .. .. ...._...,s ;...........,.,,.:1Y:.::.Ya:.':!':.;CFi:fi . ~F
... ,....: r ; :::~ w"~;:::: w:.... .........,,..I
r :
4:
.t ~
.~.
....
...
.: _ ..-.........
'..:"
.
:
; '_ ;:
:
;
~
' ' . :.
, ..
. .~
, .:.
_ ,.
, . .
., m .
, ...;
..
... ._..
..
! ..... . . .!. ....... ........
..1.. ... ......v....-.. ..
t;,>f,~LKPNGY:.:HVnI:T:w.iitPKFV:KL;iALRt,~:.l~;Vr:M7L'i a":::~L': l:.iu~fivLi.'n Cpn_0539 518678 621545 FCCGOGVOCF~PT'JNCiCD

Pm0_I7-polymotphie malorans proeeln .~.fHLyCLR10~0(~OILWGFLFLSSFGQVSILRANDVLLPLSGINSGBDLELP1'S.RSSSPfKCPtc~0548 633231 a32191 'rJ'YSLRRDFIVCDFAGNSIHKPQAAFLi4LItGDLFFINSTPLAALTP104INLGAPG11GLFSeysJ-Sulfite ReducGass ' ' ' ' ' ;NVTFI~WtSLVLENNESWGGVLTTSGDLSFItBfI'SVLC~l4ISYGPGtCALLLOGRKSK:
E IKVGDALGVLPElIS
DSND

IS
KHYLpEKPKrWOVPLVLRELLSCSDSINDSDPIYRNVF

.
KEVSINVLOLLr"YSPTfLYNVKKTSEKVSAQKFIQGYVCL00fIPAKLtISFPPOKOPKI?L
ALFPRONRCTiLFLKNKAYF)pDESNPGIfCG7IVSSISPGSPITF'AONQEILFQENEGELGG' AIYNDOCJ1ITFPNNFQTTSPPSNKASFGCAVY.~aAYCNLYSOWf.~TLFTID~IAAAINCGAIHY
YDAIQEYRPQLPIELFACSVFPLLPAlYSI7ISSFDLMPKSIELLVKtRISYPGKYOKRFG
' ' ADYVIIIRDCIOGS'aVFEENSATAGCAIAVNJWCDINIWCPVREIFCJ&AI,GII~GiIIYYAATCfECKPLVIfICA
G
PCIAPYKAFLEERLFlBID

:SILRLHANOGDIEFCCMfVRSpFNSNINSTSNITFINAITIOGiIPREPSL4ANEDHAICFPGNNLLFPGERKEKVN
SRERDpKVYVOCL:.RI~DCVR
' 'fDPIISaTENYNSLYINHORLLF~1CGAVIFSCAALSPEHNXENKCtK'1'SIINOWRLCSGYLASLRKENRYWDVY
KAYEEGCPFF'JCGRINC~GZEVK11ALEEILGImI

.CAILAVRSPYQD~LLiItGPGSKLT?OCICQSD~ICIVZTM.CFNLO~IL.aSSDPAE
LSIFf . CRL0519 677662 633255 IRATEKASLEI:a~.VPRVYGHTESFYENNEYASKPY1TSZIIS11K1aNTAPSRPEKDIQNL

LLAESEYMCYGYQGSWEFSWgPNacKEKKTIIAS111'P7~CEPSLDPKR~SFIPTTLWSTFrsl0-510 Ribosaslal Procvin S.'.LNIASNI'ltdd4YLNNSEVIPLQNLCVP!OGWYpIN~IPKOSSNNLLViONAfit04VGARPOD90NOPWNDNS
LLAFLKKFK1CRLLRSKCCIIIOppKQK-RIAIJOCFDpGOLDRSTADIVE

iPFSTNlILSAALTOLPSSSSOQNVApKSNAQILIGIVSLNKSWQALSLiISSFSYTEDSQTAKRIGARWGPIPLPTIU
tEVYIYLRSPHVD1(KSREQFEIATFIKRLVDI:.DPTGIITIDiIL

'INKRVFPYKGTSAGSWHIiYGWSGSVGNSYAYPKCIRYLIQlI'PPVDLOYTKLVQNPPVEPGKllL7ILPAGVDIKI
IWI

'fDPRYPSSSFJSft4LSLPIGIALC~DtFICSRSSLFLQVSTSYIKDLRRVNPQSS11SLVI14N

Y'IWDIpGVPL:KAUiITLNSfIIIYIfIVCAIMGISSTORtr.SNLSANANAGLSLSFCPtL0550 ti35dB8 tusA-Blongacion Paetor C

tlJYG~'DRflf15N0EPDL9AIRNIGIFD1HIWIGKTrlTEHILPYAGRTHICIGEVH~GATFf CPn ' _ dIKINIIDTPQHVDFTI>:Y>DtSLRVLDGiIVAVIDAVS
20-polymorphie mtatnbsane Protein OWRl7IQCOEItG:TITSAATrVFwLL
' PmD

_ GVCPOSEIVWROADRYGVPRIAIVNKfE1R11CAD1fPMVESMCPxLCANAPPVNCPIOSCS
FIHLIYSSLIEFVNISDRFSSHXyK.PATAVFAAVLPALTAFCDPASVEI9TStnGSCDPT

SDAALTGF'Jtp55TETDLTTYTIVGDITFSTITNIWPVVTPD~AF1DSSSNSSKOGSSSSG7101''VONDLISQKAL
YFLDDTiGAIIWEEIfEISEDLKERCAFLRAMl.FZLiITIDEStIGM

~.
SLIRSSNL.NSDFDPTKDSVGDLYNLFPPSASNTLNPALLSSSSSOGSSSSSSSSSSGSJ1l9fVLEDPDSITEDEINp VIfRIOGVIEIiIIINPVLCGTAFKNKGVQpLLNVIVl0iLP8PLORG

SAWAADPIOOG71AFYSNTANfitL?F1TD5(.TIPGSLTLQNLKlIIGDGIU1IYSKGPLVITCLNIROINLKTDQEI
SLEPRROGPWII~1PIIIFIJ'DPYVGRITFIRIYSG'ILK100SAIIJIS?K

10ILTPfCNESQKSGGAAYTEGiILTTQAIVEAVT~1'SAGOGGAIYVKEATLPNAL.DSLDKKERISRLLEIWAFIER
TDRDEPTVGDIGACVGLKFSYfCDTICODt~pCIYLERI!lflOP
' KFEKNTSGOAGGCIYTFSTLTISNITKSZCPISNK)tSWAPAPEPTSPAPSSLINSiTIDI'IISCFOifJG.DIi.Rf RWI
Vlpl4iIILPK$KGDRBKLBDAI.SSLSLmPTPHNVSTHEE'1'CQ

STLQTRAASATPAYAPVAAV1'Pl'PISTQE'IAGNGGAIYAKOGISISTFImL.TFKSNSASREPKVEAMtGKPQVSY
KETI'lV9CNSCI'KYVKQSOCI~pYANVCLEIEPNEPGIt~IIWS

VDATLTVDSSTIGESGGAIFRADSIQIQpCPCZTLFSGNfANItSOGGIYIIVCQV1'LEDIAKIVOGYIPKEYIPAVI
IIGIEEGI.FtCCYiJYCYGLVDVKVSIVf'CSYHtVDSSF~111PKICGS

?JLKMlItiTCKGErw.AI'fTKKALTINNGAIL1TFSGZJTSTDNGG11IFASRiGITLSDLVEVANAVIGIiICRKA
KPVILEPIYDfVAVITPEDNLCiDVIGDI~IItRAGKII~0ES811014141MAEY

FSKNKIGtiYSAPITKA1~NTAPW5SS1TAA.SPAVPJWN1APVTNAApGGU.YSTECLTVPLS>QQ'GYTTSLASLTS
GMTSTI~PAFFAKVPOKIQEEIVIUI

S~vZTSILSFFIddECQIIQOGCAYYfKTPQCSDSNRLQFTSNIWIDEGGGLYCGi70lITLTNL
.

r'IfTLFOENSSEKI~GGLSLASGKSLTNTSLESPCLN7INl'AKFai00G11NVPE?tIVLTPI'Y636174 ti35d98 CPt1.0551 TPTPNEPAPVQQPVYGEALVT!GNIASKSCCGIYSIaIAAFSNLSSVTPOQNtSSH~EiGALLrs7-S7 Ribosooal Protein ?QKAADKTDCSF1YITNIINITNNI'A'1'~IAOGIfANFDRIDNLTVOSNDJ1R1000GYYLI1YNSRRHSAEIGtDZ
PGDPIYGSVILEXFINIM401GKKSVJIRIfIVYSAL~tPOKIC.ti.QY

E'DALZLDNITGSVSONIATESOGGIYAKDIOI4ALPGSFTITDMNETSLTPSII~.YGGVL>!T!!GEALHiAKPILE
VRSRRVGGiITYQVPIIEVJI3iRANCL.RIpWIIXNARBKPGIt~E

GIYSSGAVTLTNISCTFCITGNSVINTATSQDJ1DIQGGGIYATTSLStZtOCHI'PILd~B1VGLI1TELIDCTMtOG
ATZKKREDTtDtNAPaNKAPANYKW

SAATKKTSIZfOQIAGGAIFSAAVTIENNSQPI
IFLNNSAKSEATTJIIITAf~IDS00CAIA

ANbYI'LTNNPEITFKGHYAETGGAIGCZOLTNGSPPRRVSIADtxiSVIJQ4~1SALN11~CPIL0552 636698 636219 .

IYGCTZDISRTGJ1TPIGNSSKHOGSAICCST71LTLAPNSQLIFC~RM'L"1'1'ATfKASINrsl2-512 Ribososal Proetein NLCAAIYGMJETSDVtISLSAFHGSIFFKFM.G1'ATNKYCSIAGIiVKFTAIP~1SAGK11I5IQaOYVPSSSFJ~DC
PLPnGtALI.YZSNLWVItLKREEYFIPTINQLIRKRRK8SWH0IiPA

FYDJ1VNVSi'KCl'NAQELKLNEIUITSTG1'ILFStigLiiCd~lfIPOKVTPAlIfi4t.ILGK~LELQKCPQKRG

LS11VSFTQSPGTTITNGPGSVLSFRfSKFrIGCI11I1ZiVIIDFSEIVP'J'ImNA'NAPPTL%1.aGRVImLIaGV
RYHIVRCILDCAAV10iR1(OSRSRYGAKRPK

VSRTNRDSKDKIDITGTVTLLDPNGNLYQNSYIGEDPDI?LFNIDNSA~iIYI'ATNVTLQ

GNIGAKKGYLG'iWtd.DPNSSCSKIIL1(WfFOKYLRWPYIPROILtFYINSI9>GA~SLVTVCPeL-0553 KQGILGNI4ttaWtfEDPAFNNPWASAICSFLRKEVSRNSDSFTYHGAGYTMVDiUCPItQENo sobust t~moloq prwnt in GewbenklENBL
as of 11/7/98 FILGMFSpVPGHAESEYHLONYKNKGSGHSTQASLYAG14IFYFPAIRSRPILFQGVATYGCl6rRWLRFLIIFIt.CR
AYFPLRASffSPSWETSTCLT4'LGIPFIDIIL'1'lliIDFVAOCG

CYNptIDTTZYYPSIEEKNMAIiWDSIAWLFDLRFSVDLKEPOPHSTARLTFYTEAEYTRIALQIGTISSTNNAKIKEI
FLIYKEKPPE71SISTKRKEIWLSQSNLSDfGII~MU~!!YA

OEKFTf:.DYDPRSPSACSYGNLAIPTGFSVDGALAWREIILYNKV511J1YLPVILRF4iP1U1EGNIIFDCflVGPA
LKOPKDLRLVLRCPNQPDTLLYSpIFJIEtOCIETNTCLCNOGIfTIi.OCQL

TYEVLSTKEKGNVVNVLPTRNAARAEVSSQIYLGSYWfLYCTYTIDASl9TlLVpNIIFICtiIILYGDSIEKFLKETK
RIO~D4HTLVDLCDSOVVT1'FLGRFWSLI3iYVpYLFLSEDSAKILAG

RPVF
IPOL110J1TQLLSRTVPLLFIYTNDSIRIIEQGKESSFTYHpOLTEPILGILIGYZN~t EYCPNCAOSSLGET

Cpn_051 617137 628003 Solute binding protean t-yebL-SynechocyscisCPn_0554 637506 638111 Adhesin Hasologl NNRSSYQTAFVNHICVIVFIFLTLYSLKSYCNDVIDKPIiVLVSIAPYKFLVDDIAEE7rFVCT440 hypothetical protein YAIYTNHYDPtlTYIrS.PPQQIKELRQGOLSiFRIGFAFEKTCERNLTCOQVDf.SONVSLIQGVFSYLLLCIILVYY
RFlfIfEGKSRMASPTPGQLHLpQIfVESIQ1YDYSRSLANIATALLFfI

KPCCTiQIfITNYD'tffIWLSPKNLKVpVETlVTI'LSKKYPQHATLYQSNGEKrVALILSCLSLLPOVFLPFSCAYF

r . .r nQyf,IE

E:LTITSKAKQRHILVSNGAFCYFCRDYNPSQliTIEKSSHVEPSPKDVARVFRDZEOYKI

SVILLEYSCRRSSANL1DRFNMHTVNLDPYAFNVLVNL,KTIATTPSSLCPn_0555 638298 640241 cap-Tail-Specific Protease ~Pn 0542 628000 618737 NFVIRfICLVALCWLLSL.LPNVLPSSDLLREDCIKKhI~I(LIEYNVDAGEVSTDILSRSLS

ABC Transporter ATPaee SYIQSPDPHKSY LSNQEVAVFLQSPI~UtRLLIQJYKACNFAIYRNINQLINESILRAROW

FYTIRILAEGLAFRYC5ICGPNIIHDVSFSVYDGDFICIIGPNOGCKSTLTM.ILCLLTPTRNdVKNPKELVLFaSSYQ
ISKQPIpYISKSLDEYKORQRALt.LSYLSLYIt.IIOASSSRYEG

FCSLKTFPSHSAGKpTHSMIGWVPQNFSYDPCFPISVKDWLSGRLSOLSWHGKYKKKDFKEEOLAALCLRQIENHFNVY
CL;INDNGVAN~iDEFJIYOFHIRWKALANSLDANTAYPBK

EAVDHALDLIr'CL:aDHHHHCFAHLSGCQIQRVLWRALASYPEILILDEP1TNIDPDNQQRDEALAYIRIOLEKCNCG
IGVYLKEDIOGVVVREIIPGGPAiIN&CDIQiGDIIYRVDOImIE

ILSILKKINRTCTI(IIVTHDLHHTTNYFNKVFYMNKTLTSLADTSTLTDQFCCNPYKNDEHLSFRGVLDC:.RGGtIC
STYVLDINRGESDNTIALRREKILLE~IRVDVSYEPYCOGVIGK

F,~.CSPH
VTLNSFYEC~~VSSEVDLRRAIQCLKE1WLLCLVLDIRfN!'OGFLSCrAIKVSGGlInI~IC

VVWSRYADCTNKCYRTVSPKNFYDGPIJVILVSKSSASAAEIVAQTLCDYC1f11LVK~p t:Pn_0511 6JR7f0 e~?603 TYCKC1'IOHCTLT'CD.1SOOOCFKV7IAiICYYSPSfiKSTQC.QGVKSDILIPSLYAEDR(A3R

INeca1 Transport Proceanl FLENPLPADCCDNVLHDPLTDLOtOTRPWFQKYYLPNIQKQETLYIRQILpQL'l'IQIB~Itt.

K~IF?IIu.SLLRDSFPLLILLPTFLAAIw:A'.iVACCVFIC'I'YIWKRIVSISCSISHAILCCSENSNFQAFL.;v IKSSfKTDLSYCSNDIQLEf iINILKDMILL.QQCRK

LJLT W IQYKLHL.iFFPMYCAIVGAI FL11.CICKIHLKYpEREDSL
IANIWS'J~IAIG I I

FISRLPTFNCELINFLFGNIt.WVI'PSDLYSLCIFDLLVLGIWLCHTRPLALCFDERYTA~Pn~0555 e40921 LNHC.~VQLWYELLLVLTAITIVNLLYVIY..'TLLNLSMLVLPVAIJ1CRFSYKtn'RIItFISVLcrpA-ISkD.t wyafkane-Ractf prOCein ll1t11:.~.F':rC:ICIAYCLDFPVf:PTISLLHCLGYTASLCVKKRYNPSTPSPVSPEIHTNVENGNSSNLHFI:C
fCf(1AMPESVLNIVEEIM:CSVTACLQ/1IT.~.sl'CI1VNLLLCWAKT

N F iOP I RE.~sIfLFQ.~.RM'Q ITLLVIrILLWACLACMF
I FHSQLCANAYIiLI I PMIGLIK

~'tm o.44 eln5.tH .;_9525 LLVTSLCFDE,i:T.~.EKLNVFQKWAGSfLED0LD0.TLNN~NKIFr;rNKTEC2dI'.~.RA'1'1'pVL

ylurL-t:TH tricuJinu pruc.:tn ND(:RGTpVL'I'LV::KIMV

K.:;:VFY': x; t K >::FFt:LNKDKNV
t NFVI~' LTLELRAGKrX:Ia,'WAWRKfKYLPI!xPIfOGN

.:.:WY:::'lltf\TT::VP.:FG\'iRNIFF1.I;APfl:67::~vATtMRTf:R::CKDLIV3VP'l~l'LLRDr' .Pt~ IIS~~ ..d.'.H'!9 n.Atln4 :\F:ha:IlJIDI'IVIya:IlLLV::aa!:t:KrX:Kt:M'FFKT.'IRIMPTKATF'r:KPI:EIRGVELELKLryI
ICH~.!.ifei .y.:l.tn.:ktrh nMt I:\nua:r:FlNU:K::ft.l'N'CI
IIYfEVh't'..AYPFT'TLAP3LGLVLc:KURLYOKIMIIADIPEIt'M::Y.LIRItIITJI~t:P::YW
:r:FA:aX:IFJ1AVAF:LITKIVA::At:TKI'AfNItfPAKKVR

::th7:AIY-MKI:U:Id)F'LItIIII:Is'TLLI.i.FVIW::KREP1YSPEEDLLTLIItELIL:HOPDFEKLVHRNKrJfNF.~K
.aa:\FCnKEFYIt:EFf:Rt:VINF.AfjQE.:I:Yr:RLYSYNVNIA1'NVRV:Q:;

!.:I*LVAIJIF;IDIH.1.I'1'F.uI:L~'Lvcil'QNRF'p.':'f'PFVLI::ri:fY;t):VIa:LYRFFTr)R
WV'/Pb'YAY'1N:::C'ff Ilal 1U:KKV.'/fAIVI'lYp)LPr:EAEI1I::.~.UhETTI'1'::I1:KLVWF.IGtI.

r:N:DK<:Y ITLWI.'KILK FI a t'F"PMTVr:M'.
PFLR::Y'f Kl'.f:pPA IC IKytl:ffY.'N.'LHt:fM' n.4'. .. ulInN n au.:f a 'IKIEYVtfh':::iIANM.'11.7Wf'Vfil:'I::IIA:X:QHVI::FNLC.DNRfr:DKKVFTVF.FY:PQIrIN:

~

n iJl'fIJVA9"Il'1'.1;I:IIM.':'..WL-PI-I/IIF.f?HJVtII:Y.IIOW::YVCKF'V6'Y::f::V::NI'tlUf:/Lll f.'/ L.:f n nl.c:uwwt Im.w.:ttr 'I'I?r\HI:FHVYtnIIYYrxa::V:l<FY:RD::Y;:KKL::%KVt:N:rJYV::T:::If.VIa~N'fFYItIPAp NIK:OWIIH/TLl'::1.-IYLYr\IW
:I:It?:IIY'IIWHIYFY4t:W:FTLpFNLWKAi,N17ltF'1'YY~VAV

!n a<IAII'LI~ ALVI t :I \'4TINN'1'tllt'1'Y'P::E:~N't: t'r."T:t: AI.TPfIRnHb I::V\'ft7UL .I.AA'fllW'1L171'tlDt' f CV~!7!?7fV'!R
111.?NMH:AFl'lltM:

1.11 *Y::YEL<,rF I A:::::f"rN. PP
I: r xrrrn~ f~Al.lY.I l::fKE::VEF
:VTI Jtt: I Als xtAM:FA 1 vLNSNIfrKStrn~.AY~;Df.Fr.P'. .suz'rFL.LL:.r~::LFF4-::F
A:.:F:-A.:..wc-Y

~.:::a-r:r: ;w::nTENnr~f EYSBlIJIKAKNHYPLa"CF.SAf :3FLFLrIGiFG.'.IRW*.'.Ltt:FFDA:.Pi.::.:.~
JWIIIIWStF

RVIUCS'!'ICALOL~1.SIT.'YVLIRIIIII~ffVLI.~EPYC~~ill~~.
'~"I~.A

Nn 11.'.N w4 f t4w h13031 DIICYFf'GKAT:NKKIAf0I5PNKTVOCFYAGCGCATLISFTFFtfOSPTRFAS1'lfIIMI

uIM:A-'lk0.m'Ync:me-Rf:h LiPOProtetnLIPLCLALG:~iFFGOIIE:a,:FKRDANLKNSNKLKAVOCNLDTLDiLI.:.STPIAYLFL:.:
~
RIVDCCFEOPCAPSSCNPCEVIRKKERSC~TIACCSY
~
' FCCV'!
:
' :
C

. TOSKEFIG
.
.
.
.
.
.
.
KitIKKAVLIA.W
' ~N:~ SPpVKr.,CTSPOf:RCKO
'IPSC:,Np:f::1 r~ nr,.;.t : ~ !7qn f.41't?7 F'Pn ~S6q w5'1905 riSRlA1 :.'.\ ;'1. .. : . 1..' .. , s\.
. i... ; ,1 ' r :
~ -' I
, .
. ,,."...,.. .,..; Sr~,Iw.-4~4'KF:.
. . . I'.591M:..- , ~.,~:\:.v.;s~.Vr-r:
Si:l ,. ... .
....,. . . ,i':-r: :'.?IF::\11.::.::
rh :~:
\
:-:' ~
a:

''.
::

. EEPPFSfTFA'1'r','JPLE.iFF:7GHL:.TS:..' ,. :':GTEVANA/LStL~tiLPEVRAFNDDIQRRYAOL
..........,...........:.
.
.
.
.e:u:e:r~
.
:r,:;
:

FFLARSVFlIrCYNTNL CML111CGRDlGSKVFPNADLlC:
TLT3SPE1IMORRLKDLPtExr rCGSPOpLQAELVKRDRAD

CPn_0560 b15666 611098 AQRAHDPLVIPLCtCIVIDSSOLTIROVLEKILIt.LFRNEL

4ltX-Glutatnyl-(RNA Synthetase RNSRtQfX9CSI~ISKDKRiNMrFJJVRVRVAPSPIGDPtIW'1'AYMALFNLIFAKR1KG10fILCPtI..'0569 6519)98 659099 RItDTDRTRSRODYEaIIFSALRWCGIWDEGPDVCGPYGPY1~SERT7CIY0GYVf1'LLKPIsC-Glycerol-3-P
A..-Ylcransferase 'fDCAYICCFA?P0EIJ1El0tAVASTLC7fROGYdIRYRYLSPECVASREAk~QPYTIRLltVPLLFGI~IKTSSG1H
FSFfISKRANIFRICKFl7wVAFSLFYKLKVYCHDa1tI10GPAIIAV

SCE<1ZEOYSKGRWFPWADYDDOVLVKSDGtPTYNtANIIIDDIILJICITNVGPGEEWISSNNNSFLDPIALlOICVN
ECIHLARASLFfIIPWWICQ41CCFPVRQDI~tSAAFKIIISRi.FN

TPKNLLLYEAF~TIIEPPVFLIDIPLLii~POCTKLSKRIWPTSIFYYRDSGYVKEAtYNtLTLKRIOC.VIYPDGAPS
POCQLOPGi(VGICI~tAAKSRWIIPVYIRCTPEAFMINQKIPNVWK

HCY~EEVYSLERIIETNPRRIGKSCAVFDIOKLOYI~iKtiYII~NEGSP~Li.ICa.OTITCZtFGTPMf!'DDIION
PEIKNKkTYQIITNpI?IIKIAELKAWYtSDCI~rDVP

GWLWDEFFLKILPLCOSRIZ?GIILFINLTSFF!&GLLEYRVCELLPQAISPOGAILLY
. 659011 6607e9 SYVKYLEKTDOWI'KEf~S'LGSRWLAOAFNVNNKKAIIPLLYVAITGKXpCLPLFD6ItILCPn_05 :~KPRARAALVYALKLt~CGVPKKIJ1J11YDKFNDR~'CGT1DLsrQS-ArQinyl CRNA TransEerase ' TKLPSSKfIGt4JRGAlftTSPKLMSTLLSIGSVICSQAIAKAFPNLF~WAPEri'PSTKnIIG

NYQQIDAIOtWtVLIUtAPRAIAtAIY7IE:.POEPFSLIEIAGAGIrifFTFSPVFtI~.EH
CPn _ PKW1LKIGtpV80PKKIIIDFSSPNIAItDIONGNLRSTIICDSLAItITSYVGNWLRiJttt euo-CHLPS tuo Protein CH7ICtQttDCGYELEtREEIEDIKDSDtKWV5IT0AAXLWVfRQAIYVAIKOKKLKASKEIGONOTATC14.ITYLOE
NPCDYSDLEDLTSLYKKAYVCFINDEEtKKRS00MIVAI4AK0 TRWEIDIKDGEYKIOiRYSRXKSLYOGELVFONrKDCYSINQVAOIIGIPVOKVYYATRTPpIIIAIW1JCIClTSEKA
FOKIYDILDIWOQIGP$FYNPFLPEIIEDLOfIIfiLLTVS~A

GTIRGERKG7IAwVINVSEItR7fKNEYLSKOAAKKLKGAEPKFJ~APNfEPPTEIFPLSNKCVPNEAPSIPFNVOKSO
GGYNYATTDLAAIQtYRIEEDNAIXCIIIVC~GOSLiitnLGG

TAIAPGYLOPGItSNVGFGLVLDPOGKKLKTRSGEMI1G.RFi.LCfAIEKAretr ssItRPE

LTDEAIQERAPVIGINAItfYSOLSSNRTSDYVFSFCIflS.RFEGNtAMFLLYAYVRIOGIK
CPn _ RRIITISOLSL>a:PPEIOtPAEELLRLTLLRTPL1L6STIKELCPHtLTDYLYNLTNKFND
CHLPS 13 kDa Protein homoloq_1 NYKVINSI11IJ1RLD7fAAILDtOtPKPSIANFSSEOARTSNE1GWANPYLYRLLEIIWGYVKFIRDSNIOOSPYJII
tSRLFLCAI~1E0VLATGNHLLCLKTLOtL

FLLGLIFFIPLGLFWVL.QKICONFILIG~.
TIFRPICRDSNii.RtNIYAARLFSAStOWt VSSVRRVCLOYDEYPIDC'h.ELRLPNAKPDRWNLI~BDCLEYRTVI4GA~fItRIAECPtL0571 661179 ESQSIJILIFNYPGVMttSpGNITRIaNVKSYQACVRYLRDEPACP0AR0IVJ1YGY5L~ASffslu:A-ODP-N-Aeet:ylplueos~nine Transterase QAFaISKEIA0G5D5VRWFVV1CDRGARSICAVAKOFIGSIGVWL11NLTNMNINSEKASImTIDtVNVSFSDFOiIKC
ERRNQI11QVFCCGRLNCEVKV,9CAIDIMTKIi.YJ~LLROpKLTL

LNCPELFI7CGtmSOGNLIGOCLFKICEZ'CFAAPFLDPKNLEECSG1DCIPVAQ1CL.RNDttILRNVPDICDVSLTV
Et.CKSLGtIiIVSwOKETEVLEIY1'PEIQLTRVPPTPSNVtIRIPILLIG

ALiGiICPIa'.rVYVPNODW1IGFRTIi1tt11tGLKOIGIfDISSDSSGYYAKAPRGLItQNIfIN

LPYP81ICi1TEtd.ILIIAINAIiGRTVIKNVALFJ1EII~LVLFIrDXAGAOITTDNDIIlIDIfC

CPn 1'OGtCwSVOfITILIDKIEiIASPGIIAAWla~GGRVPYRNAKpELLIPFLRQ.RSIC001LVSE

_ 9DIEFtQERPLVGLWLLTOVNPGFL'IOWOQPPAVLLSOApGSSVINElVN~ILOYLIIG
reW-ssONA Dwnuelease OYKNLWDFSPKCPCGIKFNTNSCNASAAGLLW71NPKEDPAFILalIIIKFHLPPIYAOIFiAIOGiIECOLFHOCLS1 'KACRYAIGNFPNSAVIHGATPLWASNLVIPOLRIIGtAYVIIML

ISACFOTIOEIHKFLYSHLSSLYDPGLFLCIISKIIYFRLLLARDRIttNVItIYCDSOV0~f1'IAmODSIIENTHLL
DRGYTIKIVDKLRSLGrIICIQIP'DlIEpEELITSPKSLALRMiL

GVALLVEFLRDIDVHVSYFFLGAILRQHCITSTLIAIG.KLEJCITLLI'fVDOCITAWCNS

OITPQCIDVZITDIOMPTG1CIPHCV11TLNPKLRDtfIYPNRILTWCItlU9Q.ARGVti~tItbICPt7-0571 SRNLVPKSOCSLKICIi.DLVTLCfITDSRrIfLiGEtittVNVAYGIKEIARGiIRPGLDMt.CALCCTIS6 hypothetical 0rotein CVCKSEVTSTDTVLKIAPKLNSLCALOOPA10GVELL.LTOODCRVD11LL~TfINRERORIMAAPINO~ITO'f~Cl~
'~SLGEHSVT!'fGSCAAAprI'~11'V'iL.IAdIDpE

IGEVFODVOtII~tSNPEILtOMIVLSSTAWNARVIPIISARLattTYtiKWVIIAIQRGIAS~GSAVSPSACNSfSTL
PPETGSLGATJYpSApSAGLISLSGRTORaObEIfSfi0D8 IGKGSARTICSFPLLrGYLKKCSSLLLSYGGNDtAM.llilatiDIM>mtICKKFVHLVNfSLKSISRT8SNASSf'.E
l'SRA68SPDtrcDLDSLSGSERAEWEGPtDP'GGLPLSIIPNYDII'01f )CGDTLPtQ.EIDAYADFDAIOYDr ~n tEPtGIOf~EItPIFYSINROVRYPICVLP'~iNLASIIJ1PLIOIPAVOOR~O1'K~iHIVYVDE'J1R&SFIIIIRN
GOWSTAFSIXYBNitkTK14QT1C

KLYLSQKERNLEGYAFGLGRNADALKA.SWNYPLEIAYTPRLSOTSCSCVItiLLVRDIRISPADLDICIAKFCVCYET
INSOifI'GRVKPTI~ERSG711~IYtptIJG.SNI~lt1'AWYORIIA

SEPRPSD
KESS>iGYTPSAWR110A1fYCICPIWImVCGLXGIIXiKITPAPDFSFINLTP00GRNliblfl' CPQlWGATWPNVNIIIrtGGIKVDI~iIHt.CGITTMrTI'F.~DDD'fNITSI7tST81001NS

CPO, ISS1GEOSTIEED'!'IOtDDPGOGFDDNAIPCTNCPPPPf'P' 0561 651759 650115 PPNLSSSRLZTI~N~1I

_ t.~iVlYOtdXtAYDSNG~SISDLNQOLCQVtrIGtStNDVNPPIVILPttiTGD1'DPb00AtGG
seeDiseeP-Protein Export Proteins SeeD/SecF (fusion) SGAMWKVKRNFAIIICVPAIALYYVLPTCLYYAKPLDRKIDGNBAEHIIKSFTIWAppVVTEOOGHIIINIIORNTOSI
GOSLGATPTPOPTIJUCIVTSLPI(ANVSSSSVLPQPQVATII

R!(OVIPRVSAILSSIJILRGNI00HPAIPDIVSVR1KRGEDAEDFICNLVIIDEPNVPIKSATPOARTAST51TSIG' ICI'EStS'ITSTGTC'LICSVSTOSI~ICfPT'1'1'fRSCCrSATIITSS

RLNVYCYSREHDDNVIOVASSINISLVESDFSFVSYSSIQ~t~fll7ISSILORVYSACTiPKAS'IOTPQAPLPSCfR
HVATISLVRNAAGRSIVIQpGGRSQSPPIPPSCCC10t~11Gi1QtJlA

OKOCSCSYPSIWETAPKLOL:QYAIDiLSSGFEVFSSRLSAlCOOSFSSNQORtJIFLSRLSMbOVASIL.GQVVNQ~l SLSNDA71IDVEDOKLLKSVYLTLSpTIICIRSLOCPYIEGLRLDCSE$SL11SSIIYCPKE

RKIFLTLHSDLLAORTSISKEORLDFD.SRLAVEKptLSKNLTWVEDYlrc1C181pW1~CPn'DS77 665117 TQCKIILOGERLLOCIAENLTALTLHRP71AESCDLIPEN1PVFCAQPRESiAFr3CYIFSPYe6C fauilY

NTOCKHFSKGSVYILGKGLRSIVAKYOpCCGKB.OSFGONLYNCFSHTFJII~EVEdIACfISKWAttI'KHRKERADH
KKGKIFSRIIKELISAV1G.OCADPKSNARLIWVIpKAK

OpvLEIRHPLpQFLDVwGECFVICXLOCAFLEVKDIODRLIiTVNOItKNROSDLVRNNLQCiNIPNG~tIER(iZlvt A?SALOKNFE6VPYELYGftaGVCIIVFJIIffONIQiIlTASOIGIIAIN

YRHAKCSMDLQERLSAPIPYONLFLLNNKI.fA~fRKISI~HiILRLGIDFVOGROLLLSFKDIOIOGSLVEPCSVLYN
PARKGACTV11KSSIDEEYIFSYAIEAGAtDLCI'EDEEIJFLVICAP

HOCKOLTDKEDILKVSDES.CARLNKLGVSEILPRDGDYIHLSVPGSSTISSSEILGTSKSLL6SVICLKLISpGATCS
EDRLIYLPLRLVDCDEKDGtAMALIDWLEOIEDVDOVYtC~IN

tiSItIVVNERPSSYS716RYEVDAFLDYiJVlt1'SDApGKTBPttIN1'111SALFNEEVDVPPSVS

HEAITKLKSEGt.u'SPSGCETPSTDLD1TFSNIAIGKOALOKANPLVIVFRNYALDGASL

KDLRPEFAAGOGYS2.NFSV%DTSPKKttAEKLSPttStIfllvfSAYCOi7GISCfANGOYS1WCPn_0571.

aGWRNAWIDCYNVSSPILNVPLKNttASVSGKFTNREVSKt.ASDLKSCANSFVPEVLSEENo robust holnoloQ Dresenc in Genebank/ElOIL
as of 11/7198 TISSDLGKKpCI'OCIISACCCLAMLIVI19SVYYR1CGVIASCAVLWLLLIWAAirpYLDASAGGIRNPIVNVCIYLN
NFORYLSKYLYRVFRPPCRKKTFLSSHRVLARPSFPVDYCPG

PLTLSGL1GIVLANGNAWANVLVFERIAEEFLISOSLKKSVEKGY'fKJIFGAIFDSNL'1'1'KIYDLQETYEELiI1 14LFOGALRLOICWFCRKJ1TRKGKSVVLGLFHENflDLIRINRSI~RQ

'JLASALLPPLDTGPIKCFALTLILGIFSSNPTALETfIICFFFMLWl9rK'IOHTOLNNNMKFVEIPRPfNEYLVYHf HVNSVVPREYSLSCRSIFI~KItFKEYEORFPLYWU1VAWEfDINAYL

:IKHDFLRGCKKWAVSCSVFLL.CC:fALGFCA4MSYtGNDt'!(GGYAPfINPKEHCISDVALRCYXIfRVOCCYCRA

OMRCKVVHKLQEAGLSSRDFRIOTFCSSEKIKIYFSDKALSYTKADTSLSPKINDIInr:..r AVGLLSE1'GLDFSI'ETLNE1'ONFWSKVSSKL&KIWFYOATIGLLGALAIILLYVSLRFEWCPn_0575 oooSZ4 56598?

'fAFSAVCALIHDLLATCAVLFIAHPFLKKIOIDLpAIGALJfIYIOYSLNNfLIIPDRIRYhhY-Nnino Jroup Acetyl Transferase FDROANLFTPMFfVLVNOALOKTFSATVM"fATTLSVLIlILLFIGGSSVFNFAFItftIGILSIFGRVWRSFHTatIC
ONTGILGLEIRYTLPSDATYMLKWfIJDPKILACFPIOTEALIRCT

U.TISSLYIAPPLLLFMIRKENRSK .
VNPYNCPYRYHSSLTAV1'WuNVA4IfATLVWFYVKVSNNALISIIVGEEFRNKGIGTJ1LI.

NNLIHLAK1'RFKLEVLYLEVYLGNPALHLYORFGFVEVGRONRFYKDEICYt.AK'1'ITtEKD

CPn_0565 655741 ti51531 L

r."r94.r hypOthetiCal protein NKLFCFLIFC;FVNISAILFDSSFLLKIKRHSKRM.RSttKFPRISISDLIPfQMVIWw~GCPn_0576 X67513 5b6491 ~NVNYVtrNAOMLPKKILGGVLACFCLALLCCMFAAGVCOTIFPCiCt~IILGLVLLGFAY.prtB-PCPCida t'.hain Release Factor 2 Inacural UGA tranle-shift 1 t.QYSKn.
iliRPERPLFRETKVFEKPINWIL:CLSLLQSWKKIRPGCYYMPGCPOVEICDGSOMpCB.DKRLE.1LRTEISLWRSL

EIVfKtFOKK~DRtPfSIFLt0EMD0IALROCIEKSF.LSRKTFALDPSWSSLLStIOREE

rJ~YLGPKVI~k:SEDOA:iDRTHPK,iAIYVNISDJ1A(tEPQCRCYIDAYTKAFF'CVLDOIGDCPn 057ri.I
ne7S!:1 IMIVKKIrrtYVLTPILGVPDALPKELOENLKLGSOAAFLYSAEOVAKRNREEKODSIRIKPrtBIn.tcur.ll UCA f:.ameshitt:

F I FTDi'T:: fT::L'f F:ii'IOt: :".'rI'H.M'PI:iLSCFVGEOE,SYTFAMUEHt.DKRLF 14RTE
I:iLlIR:::-.'ftl 4S.h W hntr r:Sf.9911 CM IIS'I7 ~ ~..:Hrn, 1,l.Nl'.S

~.1.:: t.,mi 1V :.WIO 1'fM7.11 .axnplrx Pcrrn.in I:IrIYCfAl.Itlld'VDI::11'rNNAE:Y.FP::LORLPNHVAIINDCNRRWYKKIIREECGHTHT:E:a'N.'iV
KtIKN::AFiOIrVMi:."lGIl.VLV':Y.t:I~MrRTEIVYKVWF:1'IKKIItK:OrQKNKRNIL

alYYr:AY.Vt.fYItINAVtA)f.GIKVLTLYTP.:TENfT:f.PKEEIOEIFNIFYTOLDKOLPYLMfGANIaKVFY
L:'Df'IDMF~YdfYLL:;RIIIYK

r7lK U: LI!r' I::Irt.::Kl.rKl :IQTK
IMIV::RMTA:iF:iRLELVLAVNYrX:KDELVfiAFKKLIiVO~:r ' mh~
~157A '(agy,..H
Ct II.tIKY.I::::nlri::F .
_~t.l::::YI.Or:r:LTOPDLLIfeTr7:EfIRV::NFLf.WOtAYTELIITDTLW. .
I'IN-f1,rld.fl:ItHVYIrhIt::RIIIi:K , n y.mr1 ptln:a.htnylcttl.t;:.

TTtk:YIVLt:;I:aATLPIL\F'.:4tA:a~
II:fTMI.frPfAIfWRI.PKK11A1tUk:LHIAttI::WJI

"Ityr.r./ ..'.nN14 e.S7Nl~l VIIKRVPPXFWKV::K:II:NP::fIU.fVFr~:fN.lf:IrARt.EUKERLA'rFtlffL.filrItYFAII.

..,p:A 1In.::l.ll.ll i.l.ll . 1'yt r.YOIIYY:I::YI::RN'fK(:F:rft'IIf:F:Y::I!I'tulAI
t.lylyl r.ln:a,.r.,;,. IA
VhY~t:LF:::Sr::1'ItYOttR.T1<al:(YIMII.

LKLLKNCT'LTLi.NNfiHVIPNTWIVGIIGDLP.1R1.._dEOAFKNYDPSLPr'.LLrLSHNPDf'':.FLYriDiR
I~FACP'ILFFFi. .3:SFi.tli;L:-~Vltld:..;E:u?.'Ff".'IIRf".:F:KSYP~i ' :TRitwYf'r:DFyL"~'HSHGPOVTLWIPK/MKFFERLSGLC~IPYLARCIFJTXII7GKDLYV.rCIOIC!!
NHEIDAORKKRYEFIIL1GEFPKLTW'IYtt::iFrfILRAK.~.Rt:VVISLYAWFiCS
' NRC<LX;LKR IRFCSPPEICYLl'C3Y0 C,P~. I
DFRaMNKGSTLT'"rGKLRI~C~II .[dtr ' ' "

YfiNDLVCFSEVIL'SFHV9~E(7GTLTFS

CPn_057'r ri69110 569993 CPc: 0591 5901ed 6:tt030 yqDP)yteNiudar NuclaDtide Phosphocylase KEPJI:1PLLK'w\T:rNVPHIKS3LiLL.iOCOCTRFr'~SKlPKDYLPtIJCI'FL'fLHSLK-aLSSYatiE
family ' .'.POfAEV~FI~.DP.~.YOETPpEYPVSPAIPGERR0tI5VFSCLCOV.~YPN'/SIHDGARPFIY~:.IHRCTAIC
TVA'rtIIJNLfIILLKPRYFTtL:iREri UHi.D,?."DASNDLAIFPPFGYAV
..
.

ia'1 :r :'.~:If:'!'." y ..,(yeT:.:'nLYlir\lil:..
_ ._:!r~n::.:. A
I 4 :.':d.~...F,rrt,r,.:~ .. . -..~...~ya:
. :
..
i -:.....v ...'.:.i:::~:F'.1 .;,v: . w hTiKilC::: :.:'i i C.'.:.:'::~.....-. y. .~.
.:.'i~:i:.:. .. !. ... , .,. . .. .,. ,.....\...v;, ..
' cAR 1NV L L':
HSDILDEPDFFiiWIPCfQAIYRL'W.iL..:
I

CPI>_0580 669936 e70793 059Z 66113= dd1161 CPn ccuA-Pseudouridylate Synchase ! ."
ASSiI~IPLPRRSNDCFSPPKtKVALLIAYDCtAY~W000PN~SIQ1YIE8SLI0fITKTYip taailY
' RTPLIASvRTDALIfNA:lGOV7111FRAPOMPL!'JWANL.TKKALtIAILPKDIVlRDVALFDDNLYSKNFSIISFK
RFLpOIPVItICi.:..
IYLYpWLISPLIrCSCCRFFPSCSIiYAE0ALK8NGF
' FNARYWIAXEYRYSLSRLIIKPLPNORiIp'L'YTP1WP1'STLt1~100~6LIGTNDFABFANUIOfIrLSIKRIGKC
,r.PNHPGCIDHVPK
"ALQLYLEFYQEID~DSSHFSE

iICRDYNSTVRTIYTLDIVD~.SI ICRGNGFLYKNVKNLVC7It.LDNG10CRYPP~G.LD

ILDOIOJRRtxpSAApAYGLSLHHVCYggPYlIIFGCEpCgVSTSNECCPn_0593 fi8119i 581391 CT171 hypothetical protein CPn_0581 671533 670715 VLC'dtKCNAFKRKTRNL.t~QVLIL5VCL4l4.FLLLFYSAlFRImIYKLHLFSCPLIAKBSItIt PMsphoqlycolace Phosphacase VYGSISOASLODLISLPKDEItYMYGRPIKL41ALSSfAlASNHIDITPVL~fPLTY

EDLRNRSVKSFLROL10IYSI~GNSDEFDLCLRSCHYLEDYDVFFFDLDGLLVDTEPCFYRJ1tELKCSSVPWLLIrtI
IDLKDFFVILDYLRCNIfYPYTSIpGLFLLIKHYpE~IiVDEpCLYNF

FLpACAEFSLEV1MDFSTYYSHTnG'lEIFSKKFIFpYPWIQEYNAEIFAIUILpIYYKSLCSTPEFGYLRTLLVC7l0 90A5SVASLARNVIRCCSERFFNFCNEESRTSNISA1'O~KYL

r:NAGPAL'IPCVEAFIELVLSWKTFCVVTNSPRDATfITLATllYPIIiIKFLFNVTRWYARKSYI~CEESLAALLLi .VNDSGYVLttEFCDEDLEKVlRLNPQSpYSONP'1SRL~ISPRIIE

PKPYGDSYDYAYRTFAREGMIVIGFLDSVI(GLRALSKIPATLVCIN9171EI?FLDYPELKLAOISCQRVGPRVpEDO
DEEWVODGDSLWLIr110iFGIPHDKIIGKNGWiNRLFPQIV

GKE!'!'SYPSiDVLTEfi<517QKLL LKLPAKQS

CPn, CPe1,.0591 682517 681958 _ phtYl'-phalylalalfyl tRNA Synthetase CT165 hypothetical protein Beta KNPNALLKKlONRLV100iDKMfVLYLOAMiIiJQKRIIR10iPINI'YHSSNi'1'ETRRLPTYYKNTCNY1'CVIVK
SLVKTSLRLSSNRIPITI.LOTYPSEPLSTICEILP)1CDNIGIGEIITfit SNIVLIO.IILRIS'IVSLLTSCSFSKNSATCPIrfPERITSOKDCPVLLNPK'.FITISPPLYDWLYSFASVITAKIL
NTIFlIPN7IDKLAVATLTDaGIF)~BHIKCApNCEAGLIVAtuF'GI1KL

ISPNREVITAYSFYCRGpGNSIITPECVLYDCDGLIN8ITKLEFRYINPRLIB:VVRLLGQPDBP~AYTI
I~RALLELPGTPILiEDLiITVLC

OHPKVSIIGFCCPKHFHFLE71SGISLSDtJit.OCtA71TF71LDFPLPNE%I.LiiTIKKLYIDINtSLEISLTPNL
01~71SPLGLiIRBICtM'QANLVIPKtFSFENLFTfAt.p~DPDICFF

NSDPSLSNEIVTGTLTNPELRLTGOGSHTEITVCILDhZGOtBIEALSSAFSYWITCIS110PSPIKLOt$LOALKOKP
INJIIVDlTNYIH~LSLGQPLH71YDASNVAt.DS

LItVOt~'pESLTLIJKiElVLLPSGVWVRDONS

~PI>_0583 67239 672717 AYFLPEALRA9t7KLLPIPSESAYRFTRCIDP~WPALpJIdiIfYlLEIFPGTISPIY88 CTt66 hypothetical protein CEICRBLIfEVAtJtPKTLORILGRSF'SIEILSOKLOSiGFSTTFpCtSLLVKVPBYRItaIN

IVLSFFIGK'1'KV'1'pRFiIOJERTLLLLWKIQOGLFLAILDLTQTFSSLT3PELEKYLKOKKEEIDGVEEICRT85 1:MIE'IDNWSCYTPIYKLKR1TADPLAN71GL.QEFFTPDLLDP61YA

IFLSCIDRVDLOIREPWNAFSSELPpDIGFELEEIADIfIIRILOTDK11NYA0KKXEFGIYLTRI~KEtISLOG810t rtVLRSSLLPCLL.ItS7NITNt.NRQAPSVpAFEItS1WA10~8Q'!0 ERP
!1'OTG71ILLTEDCEBRSNLPKPSLSFYSLKDiiVAILLYNNNLSIDALTL6SS11ICEfllllf QOCVLRIIDCOSFATLDOVNPEL7UOtAQIKHPVPFAELNLDIi.CKM,KK1TK<.YKB'YAIYP

CPn_0584 677659 673798 SSTR~.TLTVPEDIPANLLI~IfLLHECSKSiLCSITIISIYQDKSLETRNIOiVSIJILV/p0 acoS/DCrB-Z-Caaponenc Sensor YERTL&NDDIEEEYCRLVALWLLLTDf!(<.TINS

IRINJITlOIRKKRNLVFTPIV?OSKNLtIPPAYFZLEIKARI1'OSYKDISAILTAIPDGILLL

SITONFLIGNSOARLILGIDFiJi.EIGMtSPI'Wi.pD'ICiJGFSI0GL6SLINPRTLIIiSLCPIL.0595 CKESKFKEYELFIRIfNL;SGYLFIpIRDRBDYlI0t.~4RTERYIDtIJId3GKtt1'A2LlltltTRCT176 hypothecleal psottin NPLSGIVGFASILIOtEISSPRHQPIIZ.SSIISCfRSLt~LVSSlILEYTRSOPIiiLKIIt~>z.QROYpIIBCOLL
FCVCYFANSCSAYASPRRODPSVIOQTFRNNYGIIV9001~KIKTBDG'tI

DFFSSLIPLLSVSFPNCKlYRE(i7lpPLfRSIDPDR!1.RVVWhR.VIOAAVE'ICNSFI1'LTLNTKVLKNGJ1TWE
YY9GGLLIIGtITLTFPIrI'L'ALDWOIYDpGfILVSRRTFFHGLPB~E

TSGDISViNPCTIPSEIlIaRLPTPFFTi'KREONDi4taF)IpKIIRV10GDI0LKT8DSAYLPNF~CIFVLTRNPOl ~BIDSD'i'I11CPYFIE'lTIIQCNVIEGSYTSPNCK7fSS8IN1~8DYR8 SFFIIIPELLAALPKF31AAS VF$SltlI PESZTHYpItCpPHGLItLTYLpOCIPNfIEE

1RIYGi~ODf.TTIVl10a3CKTSEIAYV10fr1IKEGLELRYNGOEIVAECVSNItl87FU8iE111fIY

CPi1_0i85 67518D 673865 AGDIOKNDiYYRORSVBGI~FJtiXi7UlG

siailaricy co Cps Iucl~,Z

ISLRRKILRPIBtPSlGDCS&M71TPADKSFT!'ppPSFVREIGSIiBiFVFSPLTLLEIEGD(ACPe1_059ti IARVpDO~IliItTIVRVSLIILiILLTIIGGCLLVCLLPAVINFICDCLIAtGAVIF11LALIada-ssthyltransttrase LC1.YDSpCLPEELPPVPEPppIQIEDGRNETREVLEC1'LLEVLLKDRDAKDPAVPWWDFAtMiIDCfLIPKIJ00I8 4S0ACSECLLIAKYPPLAVIVHTDNNLWIC1'NLSVAPV18CLE

CEKRIGlE.DRKLRREEEILYRST
VADRLEITtRASYflIFVIGPIWiKANpEIWdCSRYAGMEtIPPFSSHFAKDLIPSQYLEIiI4CVAtIPPCEpQTYAE

DGINTVPSE6GEKEISALADLISLpppTVpOt.RSRID~OKRCwtAi.~IIHOSpKaIORAIAKICI'D'1'IIFRTVG
rIaCKG4IPfLLFFPCHRVHf;SHGEI1NYVI~rPVINEILLK!'D~LSY

N~tP~ISORACEG'1'EI4DCAEAGOLEKDLRAOLKSIIOESiItI~G1'INOOEKAWRRQItI~KLER

LOED4RLTGIAFDEOSLFYREYXE&YLSDK4DND1fIL0EVN718KSGNCLESLYHDYEKQCPeL0597 681215 LEQKDMIWKAAAVNEEELGKpppB~fEpTpEIRRLSTTILEYODSLRGFJMJtDFQELoppC-OliOOpapcide Pecmease pQAYSRLpfEKDVKEIflLEESNIBIFAIM.FEKAQKENNAY1WDJ1DL.EGiWIP'CEIGB~DNQKHPSFYORFL571 YYKtd.LABLSWKFFISVJILlCIYAFLFASSKFLWTf4IICEIFFPLL

WVt.TDSASLSOKKIRELVEENpELLKAIaFKSNEISpLVADAVG&KEISKLREHIEEDKRYLFFPCYYTICPV~.FFN
ViJlIrl'FPFTILSFKLTRGWLRRWLLCiLCII80CNIFANAYBC

DGLRALDIMNAQAIKDCGAQRKCCDLESLtSPVREDKiIWP'>G.EffEt.ORLOEENApLRAINQDPALABNLKKMIA
FJfIfPINiSKIMSEt4IMLLPKC1'R15lp1ERRYNSTYLpiGILIG

EVERLEQEDFDG
KYRKKOGSVKKYOVAFEEf~QSPNPTLRIILiMOrDGICLKRLOQRVOKIpItPYEtIRpGJI

iINWITONYRPFWALTRIEHF3.NLIDYDiJWOQpEDLCIAYANVEKKAEPYKKBLLEIRpV

CPft_0586 675993 677193 LEDY11KLRSAISFIQDKRLWICKESEDLRILINPPFSSFiIWEDOWGGSRE?84KYVPIiWpL

atoC/ntrC-2-Component Regulator SRVTRItDLtaAt.VFCIRIALWACit3ITIALAIGINIGLVSGYFGGTVOItII~RFTEIirtE

KEKINPSRGENHAIKNlLWDDEPLLRDFLSELLTSQCFIPDTAENLRN71T.ONIRSItDYDTNPVLFiLHLVISlTGQ
KSLLtIJIYLLCCFSWICFSRYVRIEVLKpRDRGYVLAATMGY

LVlSDMSMPDGSCLDLIKIIKDSSPNTPVLWTAYCSIENJ1VEAT810GiiFNYLTKPPSSEStiYYINVHOILPNrII
VPViSLVFPAIOIANISCGGLTFLGLGEESSASWQitJOtOCVIGF

ALFAFISKJ1ECLI0JLVNENLFLHSdtTFDSHPLIAESKAd~OfDLt.AfAKRA~SSS11NIFINPAESAVLWPPAII
L'lldLLIAIALIGDCVRDALDPR1.QOS

GE90CCKEVLSFPINNNSPRANNPIfIKVNCAAIPETLLESELFCHEKl8IFTGATTKKAGR

FELAHKGTLLI~EITCVPUNLOAKLhRAIpEKEIEHt~OGTKTLSVDVRILATSNRKLKGCPn_0598 68971?

tODKSFRpDLYYRWVIPLHLPPLRDRpDDILPtrINYFLMtFCI~KtPLKTLSPKADELopp8-Oli9opeptide Peszaease LLNYPWPGNIRELSNVLERWILEH1'SLLTEDMIJ1WEEOCSVLKYILJfRL\'LlPLTLFAIVSINFVIWAAPCDVLE
EKSRDAIGEAGKSDKNRSY

KGPDRYLQFRENYGLTLPIFFNTRPKITHKKIOTALOELANANIII'1'PSAKNAAKSLVYWC

CPn_0587 677779 678111 DCAKFiMPALLFE~DASRDDK'IRHIIIADLFIRCGVLpGFVCPNLSPI4pMQNICEIAESN

yvyD_es conaecvad hypothetical proteinAFWROWEEDLDTKVEALKCYII~DNGCTEVFCYSSKDFYiKTFFLETRFARYNSRVLIILD

SYCELFILSTLLKHHVTLGDKNRPHRKfIVSSKSL71L!(pSAS'l'HVEITTK.1P'RLSNPLKDLFCTLRHDJ1HKT
VISEVIKRLRCSLVLSILPNIVCFVI.CQIFCNINALKRNRNIDHSLNFI

ILEKSDHLPPNETIRWLTSNKWfLCTEVHVVASHGKEILQTKVNNANPYTAVINAFKKIFLILFSIL>lfAfAVFNILD
NNIlRd'IPFTTIPHPYSCLRSPPEVPNEL.S'TIJCRIFDLVSH

RTNANKHSNI(RK~tTKfIDt.Ct.AAKEERIAIQEEOmRLSNEWLPVEGLD~AWDSLIfTLGYVGFLPFCAVSYGAt.
IM7SRLSRSIFLEVLSpDFICMNARGLRWFDILYKNVCNHAAVBIV

PASAKKICISKKKMSIRlQ.SGDGIRDLESAAFtiFLIfLNEOEHKIQCIYKKNOCNYVLIETSLASSLG1'LI.GCat .YVETLFIIIDCFQiffYOAIt.NRDIiNVIILFSVLVGSAL.iLIICYLLG

PSLKFGFCI ~ DICYVLLDPRVDLECRRI

CPn_OS9R 679033 67866 CPn_0599 691927 .89682 CTa64 flypothecieel protein oppA~IiqopepttJe Hindinq Lipoprotein TSKSIKSNAPIIWfI'ATHSLLNLPSSQDSA.iEDSTSDSpIFDPIRNRELVSTPEEKVRpRKRRES'tlfiMYKRCIf LCKILKCt'I:.:uLILLYWSBDLLERDI1L:IKl?IVRDtQ!EDLREtSRV

LI::FWHKLNYPKKLIIIEKELKTLFPLLHRKCfLIPKRRPDILIITPFTY1'GW(,MtTNNVKD(JVI:rpAIPAAt\
:VMLAPKL'/ItDEaFALLFI:OPSYPNLI:
LDPYKppTLPELIGTNFH

LCDPKPLLLIECKAIJ1VNQNALKpLLCYNIfCIt,,aTCtAMACKHSQVSALFNPKTO'LLDFYPfK:ILRTAIIVt:
K('ENL~PFNCFG'IVIIGF'IOLv:IP.~.IJI::PfIVr,KYEEP-~.PDLAVKIEENLV

(Y:LPE'ISQLLNYFL~.WL RDC:.'X:DKEFHIYLRFNVFSdRP
IuPKALFY.HV~1LDL11PDRfHIfM'AIIDIKFFYOAVIOiPW

ATNRAVALR:%'.YEI'IY::V;:VFJIGIJGLV'/PWKAfIT/TNEtT:KEERKVLY~.AFaNI'I:,IQPL

t'1'1 ~(ISHn r:79671 i79175 f'fiFV'ltjlFANf:EKIIEDEZIIt7TYPTN::I~IAr~NF711HY1J11NJ'f IV!%Y:AYYFH:MpDBKLVF

rTA'/4 hypntlutu:I1 Dmcein ::RNI'Lf"IDPUALII'KRFV1'FYE.'TI!:'.LFMDFY.T:KIDf::It.PIMDRONFY~FIBCifiAYN

::::Hr)Ir~J'h:WLR::RPta:KNIITLTPLFTPDCLFTFFAKpW'fWr:DYRt:'LVGI::LCKYTK~VAYI:AVR
FTV::ADIiAYTI'I':rfilr:F::LFP.c:fr)VIH:AItrMIIIIHtFJtfIh~r:L0t~t7YT

IJIIIN':.~.ftl.1'Kt:PIK:DCWAFfJIIKy'!"IALLffA:Ii:KNIQJILI.A:X1NKEId':IIKLF:iLFW
FI::tah'A:::::P::YNKv'IECWlIY:atW\I1.Ll:Ia7.WID1'tl:ff:IHItKVIfIiVIVPFRFRL1,'.

IJIHIfC:::.NI'EFFAJ1IF'VLKLLpYD:ILDLTPAC::LCKII::f.PY4:Y1tY~1:11KL.CKKIIpiIKYYV
Y.:~ITAII'ffAM1'A'1'At'KEL':IIx.'::Ll!:IJIMDL:YjAI'fA:Ytlh'M1WJ1%e'.II:IPPED

~L\C:If.KEEEUILt7AIINAKaF"ELLALAEFPfAfAEKIFYLt'D::WEEKK::ERN.~..~.F.DFPRAILaI:F
)t:ANfiM:'..WWv:F11t1EFAUY!IfH~I:;/h:'IDI.Kh:ffIRLYIIRFIIh:fIIIRFJ11'YA

'II IF.ILFI::K WItIW h'L1 ::I!IIC:iLLYKih1'KN Ih'VTTIIITI
IL f t hi4.'irl~.fYtNfrWII.ItKKhl7it'I_:1~.:

~'1'n 11~'UI r.H1111r: ti7'n;lri n:ltyt~1111 r.~; t~,.. .n1:1'.'./
' .-t'A N.. rr.tnr::l. Ir.wrl.rrl Im.:..r~r II Irylru lure i.vl Prnrrin rr. :..rn.r-rnk!I:MnL .u: ..1 11 ' I!'IH

KK:FJYSIICOAKRFONttLPNIIFDIx:LOF. "RPPNLK:iPY,~iLSDLLlfIEL
1LDKAK~fPAEI'LGIi.R.IEIIf:::JLLi.AFR'.":CKL.LS
..'JLKODRLAYGELIILL.~aKY00KT R IHPGFDCIYIAt~it:RIiv:RDFNL4;N
L f : PCEIIG I:.LRKAFAL.iEK
SIf Nf:YHKI tO
F3SLLKEETr.'.~.LNPAKOHLL'IK t LRDFtrfMDFILR.iLGL1JG111CETY11KALPKOVO
IPHSPCL FL9YFL3ADYSOIGJt11L9U~pR;8.~8~E~'EC1NF
A:70!(~ . A
~~1~

R~IR~T
K111NICTVYr70WLF~AKVItKPStCENCEC'..A'tFSRiAHA"E

':Pn_OSOt ~.?7073 ';727)5 HIGRFRIIOS~INEFPCRFAVN1'RIOT:SAAELIKLAIFLDISOAIKOQa0t5AfQ:.

CT493 hypochotical protein OIHDELLFE11PEEEILFMI:RLVREKMESAKf:..iJPIWN:LKtBGIEC
?(DEITPN'fPL4RODSLWtiR'IRVSWRADL.S11SSRYEIASAIAIL:LLY
' FPRINADDLIN

.
a c~ nr; l 'IOS~d2 ~;t.t:5lt O
AFCASAAVS I IFTANPi.AQIIF I DrCLJIIfiLL2I
PLY IvLLI IG I IVL:.YGIYLFPOORE

~D~. ~''~tt~ : ~il~.~ w:'ti :' ~.':~:~ :. .-...,. .. .
,. . ;..: :,.:: ,...._:,....
:c~~;...:.._ I..\_,;..-,Y,;,.
'VNh'f:
' -_ .
': vy;,ll:.::::... :.:r..~:. .
KTAPLIAVItHKDVI?u:KM'AK?tvN::.t:'Jt'KnIYLKDHYKvIV10NO1:PCuiVFEIDR

D.iGFIiKPIGFOENLEALCNKTStIOLLKYLLKGILfVC'GASLLIALEFSFPLYFFLFSGKTRFWILCRItGFPI'f ~IVNCi.CAS~YY'J:CAi1?KIYA?SSSLICSICVASGPFIlNK
LYSIQ

VIPAPCLACFFLTLFVCLVTRLYLLSCIIGDFFE~.ASEYLOGAVPPtOCRSOttIVEiQSHL.
OCLNRYGYESDLL:Ar.%KDIGPl4JPYTPWfSHDREEROATLDFLYGOFItDIYIO~LPii.TK

AAAI11'KISINLONOEYSLLSEIFKFLPKHDLIRKFSCFCFWILDYFGFRECLLOKAIiJLYIEKLVIIfIICIIRIF
SPEKAKOELYIITI\"GATKEQVL.COIVaYCKIC~IYAVICSOIa~~RfR
' F1LT71R VASAMSSPLVfCIIIKHDILPLSHDAAYIPPYIJ1L
KWpAIpVDLSAHVSLAOAYVALSGLYADPRKYPEFDANYWIPSGRYS7lEI0Gtt ' 'IL
RAIECTOIINEYAPCNAI~MJ10LAYSYHDLOhIPHEEIOEYEIVLKLKPftWl1115KiL

YlppOt4AKGIRIYLEIKKRDYKKSOKLIKFYf.IItYIfYCPeL.0611 707175 705793 CPn_060) 691136 695185 adc-ADPJATP Translot:ase PIYKSEFSKPItPLFtiJIFFItCFNYCLLKNOID
LAAYLC
VFIAHK1R3KtP1p58EYKPPSA

hweZFerroeheealase _ 71NFOGPRHAKDI4EFLISLLT~tDVICTF _ TPAYLL M
y ~
YS
I

. YYVHSB~S VPL.IffG.
WICIMtLIVt3IpCLVSLFLAKKYM I
t.PRVLNRHLFfFtA~(RVPKYLPpYOSLQtiWSPIYFO?ETL.71KTLSEILPAPVIPfIdtYLELLPOGLRGFIVMI
IIYWS
I
PYC!~SLIILNSL11DKL.Q
CLiiNOITTI'!FaGRFYALINTGLMSSICAGEISYWIIfIKOTFVAYSFACD~IitSVIIIJILT

PSTIIEKTLLALRTLHTRHYICIPLFPHP1'YSVTGSIVRFFT00MEIPISWIPOFCSDSKNLITCSGLIHIWI'YARI
HHLTIDTSIPPSAAW1EDC'dATANLKLOUIPKAKARHLPLJLL

FVSLITCHIRDFLOKLCILEKECCFLFSVf~LPVRYI50GDPYSKQCYESFS1lI11TlFKQIOSRYLi~GL7IIIYLS
YIfLYIRLFMIWKD0YS0IYSSiIVEFNCYNSA.?TLIGV1ISVL117L
' YLPL .
VLL1COCIRId~CALYTP1.YHLVSGLLFFGTIFAA1WDISIFGGYi.~ffPL.71L~W1' S~IFLCFOSKFGPGKwiSPSTAOLCQNIDTOKPNVIWPFCFISDNLITLYEIERD

LRSRGYRALRIPAIYSSPLWVSfLVDIVIfEN51'WAEELIRSGI0011GIROtllbNyLgRV"TKFfFFDQI'K84AP
IPLSPEDKNIIGKAAIL1GVVSRICKSOaLlYOGLiN

IFSSVAASiIWIJILVGLIIMIVWLAWAYIGKEYYSRAAOAVATLKOPKCPSSSIVREAO

CPn_0601 695981 695196 EIiY-Glutanline Bindia0 Procsm NSEil0I14VKIKFSW1IVNFLICLIJ1VGLIFFCCSRYKREVLVGRDffIWP'P1LOFGIY9 707631 TSaLNAPLNDLVSEINYitENLHINIVNODWVHLFFNLDORICIOGIIFTSVLPlLH~.~tYQ14 pQsA-Glycerol-3-P Phosphatadylcransterase FSppILL'IGpVLWAODSPYOSIEDLKGRLIGVYAFDSSVLWIOI~tIPDAViSLYOfLAKIIOtQFCNIZSLSRWLAL
Y!'CQEI~HIRLLRIVGAI4.SDIFLDCYi.iIARYIUfISRLGS

r_5...'TTSNCYWLiaPVTLYPJ1LIET11YKGRLKIISKPIlIAOCLRLiIILKOTRGDLLiLDPITDINIVPVCIT
OLYIIECSIStIWLFFICARDLFLIItV~CYGSLVIOf,T~11~Y0YGSL

ACLVK1'RRSGKYDAIKOAYRLP
lWCitIF'rVVOFIILLLYfAOCEIPW1~GLVPLVAIrcrFLYFLERIlIflYIO~LA

CPn.,0605 696777 696150 CPeI_0616 708701 710137 yhhF-Nethylue dna8-Aepllcative ON71 Hslieaie LRKLCSSRGOVRILrGKYKGKSt.K'fFSNPHIRPTSCLtnCFaFFSiCAEDIDGAAFLOLFATGVHYLMJU1NOLYCE
DFYYLEH
CIQ
' ~IIIGFE7lLSRGAJ1SWFVDISIAAIOLIHTNSALIGEOLPWIFRODAOSAIQRLIKO.
GVPLPSPPHSKESEHIVI.C
TLTNYESSLIJO)KS1 iIBFJIG'fA7IYLiEYVDZ
KRIDiOL'fVIGGPSYLITi IDVNi RGEtL

KRSFDLIYIIfPPYELCt~ICYV1'ttOKIVSGNILNPEGTLFLF3JASDEEIACEGLTLRRRR.
.
.
KIIFRVLODAFKOIRCP
IRS1CRILRRHISTAKEIEKAALEOPKNVJLEIILDEJVONSFFKISt75TSYSQYTLVAtSCi~

KLGK1'YLAEYIVP
LTTTfDKPYLVOIQEROELFL~OtLIpGDNIISFfTGIPTHFIDLDOLI11CFSP9NWILMR

PAIKtKl'ALA~IIAlI4UCFOHALPIGIFSLQlfVDQLIHRMICSRSIIFDSK1LISTOOLBDH

CPn_0606 69749? 696707 DFORIVSVIlIEt40EtlLLLIDOOPCLKVSDLRARUtRl9tESYDIOFLIIDYLOLi.Sri80'fI' CT188 hypothetical Drouin RATFSROTEISEISRM.%TL7IREItIIPIICLSOLSRKVEDMIWRPlIIiDLRESG8IR10D

tOiIYCLADLIL

!
SDLVM!'LLRREY7IDPNDKPGTAELIIAlOiNICSIGSVPLVFEKEGIAPRNYBJIF~IS
SSYSRItOLRFYLGSLO
EDIVLLPGDISWAIB~iLSEANKDFAFICDLPOtKYHIRGaRiOYWSSASTSItITAALPPSLY

YLNp~'71LLTPHL71WGVRLWDSPTICVKKJQJFLTPSTOEOSYTEQDEKIFLRELGRLKR

AFAALPXEVTEVIVKrNYPPISSDGTPGPISEFLGDGRVSLCLtGHIHKVORPIDGIGII, IAGIHYILVAADYVNFVPQEVN

CPII_0607 698910 697577 010C-Glucolrl-P lldwyltrantEstafe NRAIOtIIflIMPEASNFFSSHPYRDlLVCVIILCGCEGIfRLSPLTItCRCKPfVSFGGRItIa.
IDIPISIGISaGFSItIFVICQYLTYTL00HLFK1'YFYf90VL.ODOIHLLAPEAR0000I41Y
QGTADiIIAIDC.t.YF~DTEIEYFLILSGDOLY68mFASIVOTAIATHV~IVL.VAOPIPEKD
AYpIGVLDIDS~R.IDFY»PQCKLVLIIRFOLSSEDRRIIIKL?~oSGDFLC~~ICIYLFR
RDSLFSLLREEEGNDI~sKtiLI01lp10CROQVO'fLLYNGriIADIG'1'IESYYEIW IALTOKPH
ACIOIGLNC7fDDtxIHIYSKNHHLPGAIITDSNISSSLLCEGLItINCSHV8R5VIGIRSKIG
ERSWDOSIIIlGN7IIlYGSPStiPSLGIGKDCEI1DIAIIDF34CCICSiGVKt.~ILKGYIKYOS
PDKKLFVRONIIIVPOGTNIPDNYIF
0608 699690 699016 CPeL0618 71Z)00 713010 CPn _ lplA-Lipoace-Protein Lipase A
Oridine 5'-NOnophospifate SynthaseKNHPfCNCIFLDLPGIISILHOLOIEFJ1LLRVANONFCIINSGJ11(DSIVLCISAIA?10WH
itlmp Synchssel-truncated?
' PLYVOMtLV
ISRJIOADItIPIIRRYSOGGIVFIDSM'IJFVSWIt4JSSE71SA0P0ELL.AWrYGIYSPLLPN
VSPLYFVIDtGRRLWPll49YEDAKLRGQAV11ILYQICaIKFGIWIL7l5GEE3 ISSPEVI41VATLIWRLRPSFNSSLLGGVPYT111.'fL7vTSISLKYNIPNVLRRKfit~tiVOPTFSIRErmYVIGH
K1II0CNAQYIORHRWVHH'NfFLWOIDLDItiSYYLPIP000PTYRNOR

SDAIKVEGLFTPCQIrLVLIJO14VSSGKSIIETAVALEENGLWRFJILVFLORRItEiICOPLSNEEFLTTLRPWFPS
1LDDFLFRIKASGSLLFTWEEFLDftELEEILAOPHRK11TTVW

GPQCIKVSSVFTVPTLIKAGIAYCKLSSGOLTLANKISEILEIES

0609' 699672 699986 CPtL0619 713162 713013 CPn _ ndk-Nucleoside-I-P Kinese CT190 hypothetical Drotein RRYVYThtEOTLSIIKPDSVSKAHICEILSIFE05CLRIAAMKM0iL50TFJ1ECFYFVNRE

ONTKNSLIRFMILIRLFLGISLPKCFPLYLEPPLVLATFOCTOFVGTYSEATNPLYIDNLRPFFOELVDt7tVSOPWVL
VLEGANAVSRNREtI'1GATNPAEJ1ASGTLPAKFGGSIGVtMV

NLNYNYTOELLYKAVPCNYKSIYREIPLIIFPEVLIGSTPTOSTEHGSOZ'LFiJAAVEIAYFFSKIEVVNASKPLV

CPn_0610 '01150 7000:9 CPn rho-Transcription termination Factor_ RLFLrtFKGSIHKCERSSEILPRVKETKKHAYVSMOEKSCVGECAWASESEEAESVTVTKruvA-HOlliday Junction Nslicaee IAKLORNCIEELNIIJ1RCYCVNNIGSLTKSaWFEIVKAttSERPDELLICECVLEYLPDCDKMYDYIRGTLTWHTGaI
VIECOCIC'MLAITERWAIECIRALNpDFLVETIIVIFRCIE

eCFLRSP1'YNYLBSAEDIYVSPAOIRRFDLKKGtn'IIG1'LRSPKEKEKYFALGKVDKINCHL.LYCFHSREERECP
RILISFSCICPKLALAIWALPLKVLCSWRSEDIRALASVSCIG

:iTPdWfERVLFENLTPLYPNQRIVl484CKDHIr\ERVLDLTAPIGKGORGLIVAPPRSOKKKTAEKLtiVELKOKLP
DLLFLDSRVITSOTKITSSCLEEGIOALrLILGYSKIAAENIiAE

'!YILOSIAHAIAVNNPDIVLIVLLIDERPEEVTDNIROVRGEWASTFDEOPERHIOVAEAIKDLPEGSSLTDILPIAL
KKNFSCVNKD

KMRLVEHCNDNVLLLDSITRLARAYNCVOPNSCKILTGGVDASALHKPKRFFCJW
MVIF

. CPn_0621 71x707 7111.14 RNIECCGSLTILATlILIDTCSRHDEVIFEEFKC'ICNMELVLORRLSDRRTYPAIDLIKSG

LYNPSELERVYLFROAIJ1DL'1'fiDAIWLLLGRLKKTNSNAEFLLSLKEruvr:-Crossover Junction Endonuclaase TRKEEL

.
L:iRWSSFKDNKFKYF0E31VSELIIGVDPC'fIVACYALIAVEQRYOLRPYSYGAIRLSS

t7NPLPNRYKTLFEOtSCVLDDTOPNAIFVLE'K~FVNKNPOSfMKWDIRGIVLtJIAIIpRDI
173 7011:0 ' ' .
LIFE'tAPNVAKKAWC.KGtLi:iKROIpVMVSKILFNPE~/LNPSNEDIADAFALAICNTNV71 _t n_0611 yacE-predicted phosphatass/kinass R~aPtr<:CYR
V

F
RtfNRRDAKTSEREOGISYDFIRSYSCEYLNWICKLGN4Ll(LLKVSITCDLSSGKTE71CO

.AYWwADEISHSFLIPHTRIGRRVIDLLGSDVWOCAFDAQAIMKVFYNSVLLOC
JEII' ' . 1':1 LEAttJIPEVCRILEfQYHOSIODCNYPLFVAEVPLLYEIHYAKWFOSVLLVHANfDIRRECfm 01.72 '15761 ~.EDFOORuRFU~VEEKt.AQADVWENNGTKKELHOKIEEYFYALKCALv:T'.Ui hyp.~cnACi.:.O
pcotein RFHYKTCR3 "

. .PwKDADINP110O LCNIfSCV
NY:: JR t.t.:: i LKLHLF::I.H: aS..4a'lIY
YH: d.'::R:.'F1LIILId :fn II.lJ ~vl.l6RH ~JV_U_=
:.':FH:Y:Y.1:J::IIJY.E~fC\~a'II~~EHERIIIIJ,IYRF.~~L.::ALEEEIRRREEA10i00L.EKL.OQ
QPf ' ' ImIA-UNA ll.lyaa':.Iz;u I YFJIKIKOLE~LORYVS
fEI:E.RPtt.
wtlJllf!1lF:KhrJtRlk.'::~II~EIY.KELUJ::VSH

H.:IIff::LIL~fVERFRREfNBCKLFVLUA::.:FIFIL1'fFALPENKMIpGOATrJAVFGFitI~>LnIR:AI".
:(t:LEEfAI:::N\.\1'\I:IFIPLKK::LIDL.yEY.DIYIKTY11::FIAKLHEKL.OROICAO
' NKLIKEF.~.F1=fNI::VFDv:hNHK0::R0AIYADYK3Nk!JKKFEDIPWIALVKt:IC:SLICIr\HANP.~.Flf KLDHWI
'f::::h'/t.'::fEKLifF:VLYfDI\I:KY.YAI\t.tlJUfI:UJ'fWJl.i!Gf.IIKE:Kt YLE7!E::VFAUInIIA:;IAKKWEFtJYKVIIU'1'ADKOLWLVNDHWAWNFWAIY~~WC'aI::E~:LIJ:YI:IF_ :FI;tNV1::Ii;:K:aii::

V 1 Eh'n: l Pfr xi I fDYf.ALVt:D::.~.DN
I FGLKX:~PKKAAAIdJIOF'f:.~>ltEt%LLENLIN1VKGL' .:l!fNl-:EROt~fLKL::KH11L1.D::NIPIPI'FIESLTFPQFIPVDEEKLIIIFYI40t:FKTGVPI ~nl l II..I..~
'.'fhry.: s ::KrJfUATVt7VrJIINUA1::'.L'fNiLNLV~\:::DI1FAVAYTr:NIIW..~.LKLEGIlvLTOC:.(:VF~T' .n4 INlr,rtr.t i.:.n! L~.....,n ' t:EE7ta'KILIILKLWILHCDI:fFYufNLKRDCIIALLJ~W:I'JIREI.':YGLAL.AEHLTNFFRNI'D
FIII
IY!rftfl'rlF"MeUINI~IYt'I~:vF;YY.l:7/I:FIrKIIN::rJnfFt4V1:1W
:VI::IJ111r .
IC:Yf'Yi:Jt'l4:fflF!MI'.l'RIfAtltt.Y.A'/!:LINa:VKIIVY:I!FJvLIKUfY..'."f1'LINIOE
KPLrI
t::iYJ::IJ.'lNII:FTFTAIIRFAKEIa:N:x:LfLGt!LPFJJt'F.OYFY:EFVA'ILPIIKOAIL' XY:Y

. .h::RKISKKF::\ItFh::l Yr:F:Y.WKNKKYI..':1't!FIIIIKf.IAFYIYtA:YJKILtffVK
r:l: ttINYFH1111 t L::U 1 f?fl'LF:KVLF::HEH4::1KhY:I'F:f f:
l.:VfLOVEfIJI I I.I S\I.FETEWVLTEEIYOt.:

FEWEE::pFIIEIVEOKKF:LLPPPAKLI:rEYINC.r~l't.:I'7 Ribasrx4rl h stn JPWTSJIDWfrLOALVRESSDL HKKEKVK".sMA'3EP1 t.RKVKI;WVSAKNEKTWVNVERIF:iHP~YLKV'lR3.iKKYYA!(:' WALL::AGUAtHFPETEEEPT.3Jl.r'FE&i.SANFFPETSSATEEEELKVSEGDxvKIG4'i'll~ItI:KAIINV, ~CVt/Sf;.;

CPn_Ob24 7I8D19 717011 Ob33 725979 725743 CPn ~ap~-.IY~er~idehYdefP Dehyroqenase_ AMKWTNCFGRt:RLVLROIGIRNSSV~LAINDLVPGDJ1LTYLFKFOSTHGRPPEOVACr1.?-L2? RaDOSOmaI
Protein ASGKGIMIAAKKI:L:.TOLRCFaDDDL~w\YVHENKKALFALRAENL~.(~IJKWKVIMFSI11K

EAI7HLIW':KRKIGFI-iERNVONLPWKDLCVDLVIFaCTCiLFTKKEDAALhIQAGAKRYLISKNIARALTIKOEPYr%KYH~
' EGtifITUHA
nrvllr:rtl"f!~'.frPINHYTFtIPP:ICDI~IL~ItA.Sf.'rl'4t.'trIPIAKVLLIBIF'::".

"'Y:4l:N7a-=: ::i~\:::'::.:./.\':.':..::.Fi:LF':Y;.'."~CAFRVC:::... . , v ' . ... . , .
'r ...~L~:::.:.~'.~L'E:'.~.'..'.:~I':'::,.u_:.:FC:.:.:.:.-. .::._:,Z:.L.''.;.IY:.L:aYvtrY~ rllo-Lte RtDOSOMaI Protean IAtI~7DRFFKLVAWYONEZCYATRIVDLLEYVEKNSKI
IIINIPKATKFRKOGKGQFRGLSKGaTFIIDFGI:YANOTL.EP(~IVI'SROIE71CRVAIIIIYL

KAI!<SKVWIAIFPDKS':KKPAETAMCKCKCAPDMWVAYVRPGRILF1YANVSK~t.

CPn_0625 718188 718060 AAAAAIG1CIKTAPIKAVER

r117-L17 Ribosomal Protein AAHAOp4ITW~S

vtpNARKKPAVCRTSSIWRC71WJl4.KSLIIIYERILTfLPKAKEL727D92 726409 IIAFVERI(~1~' CPCL0611 LAARRIAICNiHVRYIKQ.TSKEARQAItGCDI'SVYNVDRLWNKL.FDILGrs3-S3 Ribosomal Protein KGRRIIICOt(QCPICFR'IGVTIUtWRSLWItGNKDEFGKFLIEDVAIAOFLAIOCPSCOCANCP

WPAILSGKZEY1'IG'IJIAPOLYIGKIOCIIF.YDLLKBLLAALiGKEI7IiLEIJIEI1WGJ41KL

CPn_0626 719670 718495 VJUaJIAAOIERAYSFRIW4KXANOSVlmAG71VCV1II0VSGRLi1G71CIARSdYf~AVPL

rpoA-RNA Polytsereee Alpha HTLAADII1YATACJ1E'!'!'YCIIGIKVWII~GiSSSITPt'84PAAPSAAA
WLGKEKCaISDNAIO~iLLYDKFELPEAV1QQ.WlxLPIDKHAAFIAEPLER

wLPAKK%AQS
CNGHTL)GNALPAALLIGLFJ1PAIIS!'AM'GVLHEYNAIEGVI~ILHLKGAL.LIIKY72711D 727096 PNQO&SLGAT'CQVLfUISISIDJISOt.AAANCQKM'LDaLi.ODCOFCAVNPDOVIF'M'OPCPn.-0642 r122-L22 Ribosomal Protein IOLh1)vLAIAFGRCYTPSEAIYLEDIaCVICEIVLOAAFSPVTLVNYFVCDTRVGODTDFDRAAHSIVKATI1AYIRV
OPRKARLAAGLIOtNLSYOEAEEpLGFSOLKAGACLKXViliSAYIW

LVLiVffDCAVTPKEIVLA!'S'IQILTKHPSIFFi~I~EKKIVFEFJ1ISIEKC4KDDILNKLISVTEVAVDAGPVYK
RSKSKSRGfiRSPILKICTSHLTVIY00fm . A~1IAREM

4:INEIELSVRSTNCLSN7INIlTIt;FININPEPRLLOFRNFGKKSLCEIKNKLKElDQ.EL.

G~.TOFCVCLONVICEKt9IWYAEKIM10r1'IOGCPCI,.0643 727725 727450 CPn_0627 720059 719640 rsl9-519 Ribosomal Protein EIRDICRSLRKGPFVDHNLLRKVRAIB'IIEEKKTpIIrIWSRASNITPOtIGIn'FM~IDI

rail-511 Ribosomal Protein ItLTVPVSEITNGICKIGEFSPTRIFKSNPVI~
AQAKIISVIIRICOLIC~tIPSOWICVKATFIB'TfIVSITDPACNVI9WASAGK
VLVIOJ
A

FLI
SR CPt7_0641 728594 727722 O
VCYSGSAKSSAP1U1TVAAOOAAKTJIIO~ISGLKFVE11CLWGTCAGRtSIIYRALI&71GLWSY

IRDETPVPtBiCCRPAKRARV rl2-L2 Aibt>aaaal Protein CPr~0628 720461 720063 FIREIN&QR(!'KIrV'i'POI'RDLYLPwiDEL'1TRGELRG?'RSKRSLRPMtKLBFTIOtSSOG
RiII~IISCRHROOGAIOOLYRWDF1W'BIDGITAKWrVEYDPMISAYIALL8Y8D~R

csl3-513 Ribosomal Protein YILAP>OGIOAGOVYVSGl7GSPFKPC7CGK1'LKSIPCGLSVfOIIENRPSSOGKf.VR8A0LM
IltY1'ILREAQRNPRIICIDIPAKK1G.KISLTYIYGIGSJ1RSDEIIIOQJILOPEIiRASELT' EEEVGRLNSLLOSIYIYOGI)LRRRVGSDIKALIAIHSYRGQAiIRLSLPVRCQRTKTNSRTiT.KItPSGEFRt4.ti DGCRATIG
OVIAIt$PGYV

RKGIDtKTVAGKKX TJ148~IPVDNP110GCE(ZAH14Ci1fIPICT

0629 721881 720487 CPn_0645 728933 728598 CPIt _ r137-L21 Ribosaeal Proteia sect-Transloease OM~IfOYIKRHYVTCKAKl4.EHLSA~1'Cflfr~fl~CS!'CILDPKTVFIV51~11?I~LIaOAL
KIRLL~'RPYKI'1'LROFFLITELRQKLFYTFALLTACAVGVfIPVPGINGELAVAYfICQLLC' ~' SCONLFOLilDIF5t~71FA0NTVIJIILiWPYIS11SIIVOLFLVIfIPALOAty'OItSSDOGKRf1~f0018VG
IOfAIV
EAIYVDKNVKVKSVNfTrA7KPOPAPMFAGRPt~ATSGI

RIGRLTALFTVALiaVIOSLLFAt~ALAINLTIPGIVLPTLLSSKLFGVPwIFIfI'1TVV1M

CPl1 TfGTLLL36(IGEpISpIOGIpJGISLIIAI,GILSSFpSVLCSIVIIIGiaCSODSSO~.IS-r11-tA Ribosanal Psotsin ILILALVPVFVLITTILIIECVRKIPVOYARRVIGRRbVPGGGSYLPl.KW1(~P~FyAiDIJNLLSK1DFSCNKIGEV
EVADSLPAD~OCLOLIKDYIVAIAN11010ti8AC'!fit ASSLLNFPATICOFIAS&SYl9tRIAALLAPGSLVYSICYVLLIIFF'lYlwi'ATOFHPEOIALL
' IASEI~fOQtAFIPCIROGKPTOIIYLEY'llAfIRYCL1LGALFLAAIAILPSLd.CCLLRVDStJVRGGGIVFGPII

SEYBNSTAKPPKOIOGTGIIAROGCLiISPO!
A
SJILIIFIJIDCNVOCRSILFIDNLO11V0~a LTAPkT
' Lt?OtRYDSVLiITOATIOCIW .
GWLDT p 4 Li101II0 INKLTPVD~~DR

ISLANLTAVIOCFVYCININCYDLASAIaIIVISpfAL.OELYERLVfiTIID
CWOJIP
SYFLOGTAIT<.IW

CPei_0630 722316 721885 CPt1 r115-L15 Ribosomal Prouin -RRFGYE0I1GVPLYR r13-L3 Ribosa~l Protein NIKLESLFDISERIWAKIQ.LGRGPSStaiGKTSC~IKGOGSRSGYKYLEYPSYCIC4L.PPLITCPFIFLA~FLFFLt ?1SISKILSRFVSLTf.OBEBIfSLIi310Kf11 RVPTRGFSHKRFDKCIfEEITTCRLAELF0E7GGITLOALKAKKAIAAOAVRVKVILIIGOL' XESOOYPSLOIGAEOIIAP
RSHISVIGKK>DQ4IHIFDKOCSLVACSYIRVEPNVYf0IR1 EKTIVNOCiAWiSGCVONLLGIT ~ZTK~

CI810GICGFOGtMKKFG1~GPGSHGSG!'1(RNAGBIGIBtSTPGRCPPGSKAPS1o83i1~M' CPn_0631 722812 722712 VIO'E.EVIKVtM.tKKVLLVKGAIPGAItGSIVIVKfISSRT

ray-SS Ribosomal Protein ~15GSKNSHKEOOLEEIfVLWNRCSRRFSFSALILVGDCKGAI~SYGPAKANEL

CPn TDAIAKOCEAAxtWtI9CIEALEOGSIPHEVLVHHOGAOt.LLKPAKPGIt3IYAGSRIRLI_ CTS29 hypothetical D~tein ' eHAGiKDIVAKSFGSNNPI4NQV1W1FKALTt~.SPRImLLARGAAINDFFFIGIPCXEVIOtATNJIIASAGBAASi0 0.LPVAXEPMVSSFJ1QKGIYCI00!lTliP'GNXL

AK!'00J1TKSL00(CFKLSKAVSDCWCSLEF011LTSAMIApOIa.KiTAEWAW~1V
CPn _ ItIGtIVPSfVNSIbRCY0YTA0AFEUSKTKERKTPCEYSR~.LTRODYLWvBAGCtA
r118-L18 Ribosomal Protein ' KCLISSWLVNLLOVFAPNVLLNLIKVREFVMCMaISWKLVKLRIf0Al0iRSRVMESSLCKItif11J1GVAGAVOGIA
L
G71TTYSATFGVLRPLG.INKLTAKPFLOKATVGIIFGTAVAGIItI

KSL40(RRAALRVRKVLKGSP'fKPRLSWKTNKHIYVOLIDDSIG%TLASVSI'LSKLtJICSOEOKLFKJWCESLYNE
RCALCJOOSOL9GDVILSAERALRKEtIVATLKAHVLTi.L>GGt.EI.

CLTKKNOEVAKVLGl'OIAEIGHIrt.OLDAWFDRGPPKYNCIVSMIADGAAEDGLOFWDG11KLIPLPITVACSAiII
SGaLTAASAGIGLYSIWOKTKSGK

CPn_0633 727760 723209 CPrr_0649 772672 731710 !mc-Nethionyl cRNA Poa'myicransferase rl6-Lb Ribosomal Protein IJOLKVVYFCTPtFrIITVL00LLHHKIOITAW1'RVDKPOKA8AOLIPSPVKTIALTIIGLP
SHSRKAREPILLPOGVtVSIGODKIIVKGP1CCSLTOKSVKEVEITLKDNSIFVHAAPNVV

ORPSCHOCLYWALISNMVpCVHLGFEKRLFI4ICVGFAASVQGAFLDLSIGVSHPTKIPIPLLOPSKASOPOFIEELRA
FNADVPIWAYGAILROIVLDIPRYGCYNLHAGLi.PXT~GM
' STLQVSVEKNTLISVKGLDKGLVOEF)1ASIAAKRPPEPYKGKGIRYeHEYVRA1UVGKAAKSGEU1W1L11SpGiIIV
LIK
PIOAGINEGATESCNIVIAL'~11Gf4TI'GONANITRVPICPOIT!
TLQOIESGOLOLVSODMf.ATIAPKLSKELf~VPWD1IPAKFJ1YANIAC11TPAPOAKILFS

tGKK
FSEKAPKRllCIRKdSLLAEAGRYGJ1PGTVVYI'DROELAIACSEGAICLHEYOV~KGSTN

CPn_0~74 724215 723787 SILiPIlJGYPI1KKLICIVf'CLNN

rsH-aP Ribosomal Protein E3SIKRKAIYMCKCSDSTAOLLTRIANAI~IAENLYVDVEHSKNREAIVKILKHKOFVAHYCPrr_0650 777517 7326b5 LVKEEt?iRKAANAVPLOYSDDRKPVIHQLicRVSKPSRAV'NSAAKIPYVFCM4CISVLSTSlpxA-ACyI-Carrier UOP-~lcNAc 0-ACylcransEerase rX:VtIECSLARSIDIiCGELLCLVW SRRN4ASIHPTAI IEPGAKIGKOW IEPYW
IKATVTLCI7NV WKSYAYIIXNIITIQOC!

TIWPSANICNIfPGOLICYOCEKTYVCIClTICEIAEFAI
ITSSTFECI'1YSIC~IiCLINPWA

HVAtINCI'ICIRiVVLSNNAQLACHVQVCDYAILOGIIVGVNOFVRIGAHAN~CALSGIMW
CPn _ PPY'I'IGSGNPYGI~.aGtNKVOLORROVPFATRLALIKAPKKIYRADGCFFESLEITLCEYC
rl5-LS Ribosomal Protein CERKANNSRLKKFYTEEIRKSLFEKFGYANKIpIPVLKKIVLSHCIrIEAAICD10JLF0AHLOIPMMFLEFCC3PSKR
CIERSIOKGJ1LEEESAWfEL~ILIES

ECLTNISCQKPLVTKARNSIACFKLRF.CpCIGAKVTLRCIRNYDIMDRFCNIVSPRIRDF
ttb5! 733975 737517 CPn R4F~NKCOCRCCYSVCLDDQQIFPEIIILDRVKRTOCLNIIWIJ'l'fApTDDlxTfLLEWCL_ C.aGZ.Nyrlstoyl\cyl Cacrier Oehy3racase kFKKJIp MJUPrIIIKLAELLCLLPtIRYPFLLVDKVLSYDIEAR5ITl4pKNV'fiNEPpFNfAIFPNAPI

<:Pn brit.: 775tOR 724750 Nf~f:VLLLEALAtsArh:VLiCLVLEIIDRNKRIALFI~IOKAF.FROAVRiCDVLTi.OMFSLt ' ~

rt2AL~1 Ribrartm,sl Prntr.in !'t:QI.VTFrIEL.iFALVDKFw t :~Kr7r:IfAWAGAR1 FY, t:KEIMKKVN I RVC~KVF I L.ACNOKtiKECKVG.LTEDKW
VEC:VNVR t KtJ I KR'~(FK ~Pn:N:S~ 7t.lNqy 7)799D

:YkI;:IFIvt'ItII:,NffRL?fAf:EPAKt.:DVKVTEGGREWORRPU:TSVLYRLVRCKKG

Ipxr: Ptysrryt r:lcN.W nre.metyla5e :In or: s7 ~_'u47 t 7'.'.5D'1s KRN::I
t'ft;O::L:;:l'1'NC.F.R'lltR'CLItREI/RYA:W:IHLl3K.~.STIJCLOPAQ'lNl~':I11FGR0.~.

t4 Hilrc:rrr..sl Irnr A:x:lffEtiVPAId.IWVY'lT':R:'I'fL;Ar:::AVIA7Yt31LNAALRSNNtDIILIIOr::7t:EEtPI
.W
r114.1 .
c:Or:.:?1VFYIJ.ICtsAt:(t:ELsE00Y.V:.IARLTPP/YYOHQOIFLAAFP
. I~L.KL:YTW1YPQ
tt:ttrtWt.:VLKVACAdIt:AKKVNr'FKVta7r:.~.RRRYA'l~/f:lriltlft.':a'Rf7VEI?C:::LKKC
:DV

VIIDOKC:NINr.T1!ll'r:WARt:IirOkr:FIKL".:iL::::CIr.TVYK::LVINEE;:FRVl:IAf~.'RTFA
L'fIIELCFIlIEKGLI~:LIMfAWFKOtt:II
IYAV1VR'Mtltlll'PNKOt.:TI.KFIri'H:a' . ::IrrYlIHFAPEf'VRItK I
LGLI.:OC.;:I.W:RPFVAIIVUIW::.r:NESTItAFCKKIi.EALhL
AthV t wtn ~u. rH ~: ': f'7 t 'f~bA'ro rtn 7r.5 s f s...t..'r 7 t4H4sr ':

.-urE-Apnlvpopmtem NAeatYleranst.-".rseANFCVSLFEIOCLtCMLVAC-....DKISIICxtR:PMNVLF~L:L:.FAItGNWF.'.RSNtKrMV
' ' ' ':EPVGRIFr'FVLiyIt:LLAFAOPOL~PVStLCMCGYG!'FSJYSLEPLKKPSLPLRTI.FVSYPL~T.DI' ttl4PAYF:ATFA..
IA.~.T
OCILLFVICFFLYOPQIM LAAaEL'((((eAAG
f 'I~~R
I~ f ' ' , ~

CFFTIIP'PIEv~INF~rWIIL.i00YICKLIYLVWLTLITILSYLFSCFSCLLYAIVROKRTAFLI
LP
:

WIRVIIGFFIALtJILR6f .. v... .."i. .~ , i WSLPCVWVAICiLItFYGIF~"!. fSFDYLdIPMTJ1SAYGROFGGFIGrtAG05FAVIAVI~IISF

YCLLLKKpNAKMLWVLTL:.LPYTFGAIHYCYLKHAF00DKRALRVAWQP)WPPIRPRIJfCPn..0666 71677n SPfvVWEpLLpLV::PIOOPIDLLIFPCVWPFGKNRpVYPYESCAHLLSSFAPLPIOGItAT'dnsE-ONA Pol :II Alpne ' ~.N3DCATAL~HFOCPVLI:LERWVKKENVLYWYNSANISHKGISUGYOKRILVPOGE0K
L
GFFL711IPGHGNSOYSYLCAHSSIKDFVAKGOEFGIPA~I:.vOHQILYGAWFYKEL~

.
~TpPttrrE_~IlAW:~PPDYKKEKRSRAAHHLILLCKNECC:YPHL:LLTS1JIFTlI:FYYF
'fY:KF~:":.I~P~t.FnY'/AfIX'KRLPr:RR.'7:'!'l':'VRGLPRIr:LT:.~lEGZ'FCYRLOSYK.
..
' ' ' n~ tl .,..; .f Y: I'll iw?tf"".?!Ar...,..~,.Vl:i:::ii~R:LKLO
:,:i,:..:::1:::'tlf .nr .. .'.:~8':Ilr.:
J ML . , . :OL'r'-'_ .a,~..;~IJiK
,.v.WF
. . :.:
:'It:CKe:.~'.'(.'.'~: : .. . , ~
~
' ' ' , .

....~r..Ag._.\7::i:::~::a'L,:YRf.irlKF:~
... ~i UPlET14\I .: : /i.cT.::.v :.:.1.
Ll't1i KT:.Yf: i ,~ :L . : 1. :.:SIT
a. a :\
~:l::f~:a.~'JU'IQAII
:
!t':vF'l:l~'FY.ewGi:.i:'.' ' KEIR CILIJr'VOSCLf VItIAKQIP: H I
PNPKRKVYRSREYYFKSPApNAELFKDIPEV
I$NILLYA

KRCDlTFDFSKKfIYPIYVPESLKTWSYTEEDRYOASAVFLK~IIEALPIOIrSSIVIaN

CPrL0651 777051 776507 IAIOfFPNRDPIDIVIfEPlmNL<4AI L I
PKCwlICDYLLIVWDI INNJ1KATL;IPIICPGRGBCiIG

vdlD/yciA-scyl-COA Thioescvraav SVLLFLLGITEIEPIRFDLFFERFINPERLSYPDIDIDIt~IA~GAERVIMfAILIItB3RWV
' KKIIDF45VtNlYYRNOEYPIKIGSVESTML10IKPV5FSCIDCNIYIfIFPDR7L!(Al'BnVIGfLalAt.SKVNNI
AKHIPDLNITL$KALCIpPDL1101.YDlD
AOIITFCITOtAKMAVKD9CR
' CLLIISLLORLALWACRNTE,SVCIrfAFVOJILrtFYAPAYItDENLICKAAVNRTWRTSLEVGMIIIPICI$KCSII
tIT'IpY9BtlVCS
AESAQVIDMALCLOGSIPNICViIAAGVIICGOpL

VIfVWAEI~tIYKOERRHITSAYF'l'FVAVNEDNOPIPVHOIVPE'1'PEDCRRYNFADARROARLVOIGJNDLLGLK
TLTSINTANSAIEKKIGpSGAMATLP3.ODATrFShLN0IRl11CI1~IC

SIaipELaIOILRPDLFEEIIANGALYRPGPIIDIIIPSFINRKIiGKEIIEYDHPLJQSILRI

TrGnwYOCOVMOIACALASxsLCECwLRRArnKKaFOOM~a~cxlcKRACOBIaIDPc W'IYIFDIMOtFAAYOFNKSHAAAYOLITYTTAYLKANIfPKIi~ILIALLTCDSDDI
CPn _ LIRI~QSlIGIPiLPPIIlNVSSNHFVATDEGIRFAMGAIKGL.R(H.IFSIVLERDIINDPYB
dnap-DNA Pol III Epsilon Chain KEIMSLLIfDTVITCLDCEH1CLWK>fDItIIEIMVRFTFDSVISSIEFLINPERWSAESSIRDFIORSDGKKVSKIIS
IESLIDACCFDCFDSNRDIid.ASVEPLYIJIIAKDIDI6AAffiV

ORVNHISNAMLRDOPKIAEVFPOIKAFFKDGDYIVCHSVGFDtpVfaOFlIERIGCfFLSKM'FITLCAMDRIO(EVPI
CLPKDIPTRSKKELL.~IFKELLGIYLTEHPI~'1YRDNGfRLSV

Y'tIIDTLRWCEYGDSPM'ISLESLJ1VHFNVPYOG~MRAHKZNEININIFKHLCKRFRTLEVLJ1GEFlNLPNGSWRT
VFIIDKVL'1'ICISSKAQf~FAVLRVSL10ID&YQ.PIIiPDNY6OQ

OLKQVLAKPIKMKYMPLGKHKGRCFSEIPIAYLOWASxIIDFDSDLLFSIRHEIKHRQKIiTOELLLLDRLIYAILVLD
xRSDSLRISCAWMNDLSIVNCtIIYtI:D0AF01lIKHQV0101SF

GFSpVNNPFMEL
TMSI'SGKETKAKGNKPNENCHTOALIIPVTLSLDLHB.LRIl5HLCILKKIVQKHPG~'1'LVL

VF'IIQONFRVASIISPDD11YFVCEDIEELAQELVTJ1DLPVRVITV

CPn_0656 737842 738018 No robust hanoloQ prasane in Gerstbsnk/EMBLCPn_0667 751097 750177 as of 11/7/98 THNFLLLPLSLFDILLTVEGFLCL':LYFASVORMPCEQKAVP(~1LYYYYIAAHSSLCLSVNo sobusc homoloa prestnt in Gent6snk/EitBI.
as of 11/7/98 ~gtKp NISi.LCICIOKRYFHKKLILYFAAPYASLfCGYFLGIDRVPCAOKIMRLMDNSSEVFSKSC

RlIRtKISGFSF1.01IFLRHVSPEOALALFPEYRDDKSIVELAFIPNTLtOiVRPSKEEPIIIOC

HII80DDiIWSLVt'IOOIVIJtI~IWrCSRaFRECtS.tJIAGK001DIVI0TLATt~'1TSRE
CPn _ SLApAt.At.IWIRAERVIK!%:OKIDCLIFASGNOIGTHFOQFQPIRtICTITWNNPWILpIIP
Y7aE tATPasa or Kinaael PMGRYRRVSNSSpETLLL~.'TELGpVLVPGAVLLLFCDYGAGXTEFVRGIVSGYLCDTIAERNAAVFPAOYSLORVRI
ILVIfIIIFGONFLIVRSSMVYVpVYKISLVSADNSVRVEYILBIVt EVA3PSFSII.ifiIYGt~R:PKRLCHYDLYRIDOKNOEYIFODAEEDDVLCIEWADRLPKPW~CGKSIpDL

In'INIYITHpI'N.IBREIIIEbt CPer_0668 751176 751162 CPr>_0658 739180 778155 CT547 hypocMCical Drottin CT578 hypocMCical Protein WRFVWSPRLIHIIFLLYVPLLLVLVSTOCMKPVSFEPFSGKLSIbRPEPONSAFiYISQ

KRVCi~ISGAVKQW.t.QFIGXQKKPELLATYLFYLDpALSLRPVVFVRDKIIFKTPEDAVGOEPLKIIfRIFRI(ALI
CFGIITHIIPPRDILRNOApYLIGVLYF'fQDIIPDt.iIDRAIASYtQL

RiL~CIwRETEIOISSEKPpVN~N1'KRIYICPF1GKVFACWVYANPODfIIYDwLSSCPDAiYSEFi.FOISIYAIAO
RPAOCKRKRICRLOCFPKIaIiADCpiILItIYDEILTAFPfI~.

PQNIODIQCCVRIKRFLVSEDPDVIKEYAVPPKEPIIK'fVFASAI1CKL!'HSLPPLLEDFIGAOAi.YSKAALLIVt OJ~.'1'F~ITxTI~ItILTLOFPLifILSSEAFVRLSEIYLQQAIOICPM7L

SSYLRPIITLEEVONOTKFOLESSFLSLLOWLV1D>KIAAFIESLA~I'APHYYISpWVDTQYLtIFJUQlJEP~xQHP
NNPLNEWSANVGiIMAFJIYARGLYATGRFYEKKKRAWI~tIY

YRTAITNIfpdTLLVA10G01IRLDRISKNTB

CPn_0659 77948? 779838 CPe1.0669 751110 752775 CrxA-Thloradoxin CTSIB hypothetical protein LOENNRDSNSIFREGKLHVICIISSENFDSFIASGLVLVDFFAE1VCGPCRIC.TPILHdaAIEYGSILPKICINMRLF
SI4TIYLFFSLUSSCCCIfSIWSPYNLSSLGKSti4IIFIA

ELPNVTIGKZCIIDOJSKPAE'1'YEVSSIP3LILFKDGNEVARVVCL.KDKEFLTNLINKHAPIKEDPHOOLCSALTY
ELSKRSFAISCR&SCACYTLKVELtIICIDI01I01r1'PAPI~ICDK

'FNNtFIVSNEGRLSLSAKWLIMID1~0EVLIDOCVARESVDlDFEPOGLTANANCF71GOQ

CPr!_0660 710737 739860 FDB18L1IKSARRILSIRLAr<1IA00VYYDLF

apo0-rRNA Mathylasa MRWIJICPDIPOMGttiCRTCVAt)OAE<.ILVRPLGFSLADItlYIfRAt?lD7fWDKLOLTWDCPn_0670 SIEGtXOVPEDOZFCLSTKGSASYTEFSLPSSGTYVFGSESKCLPItEILIOCYYItrtCLBIrsbW-sigma rs9ulaeory Laetor-hiscidina kinasa PMQQDIRSLNWTSVGIVLYEVVRQKTV)1LQKNPTVPRRLt~1RY171I'FFLCETV!'PAVLSB.tISMLDLIKIIJU
OKOSKCP08KLLJILEtJICEELLVN

IISYAYpGENSPCC'IAISCISH1IGOLtVYIKDHGPSFNPLAV5INI0EDLPLEORKLOGL

CPn_0661 711179 740717 GIFLAXSSYDEFLYARED1K~1IYNLiOl:1'IGpHS

miD-~P-type paptidyl-prolyl eis-Crens isanarasa tiSRCLKIKDRRRKMNRrtIJNLJLATVALALSVASCDVRSKDKDKDOGSLYEYKDlRtDINDICPei0671 ELSDNQKLSRTPGHLLARQLRKSCDSff'FDIAEVAKGLQAELVCXSAPLTETEYEEKWIEVC?550 hypothetical protein QKLVFEKKSKENLSt.AfKP'LNENSKNALWLNpPSKLQYKIIlfGIIGKJ1ISGKPSALLHYRITIN0RKY1'MSLDF
FEEFYHOSIIM~CI'SFPtCYLNIAEILSYPHCI'DANI'DFLC$OSD

KCSFINCQ'VFSSSL~'4~IEPILLPLCQ'l'IPGFAIGMpCIGIDOETRVLYIHPDWYC'1'AGOLNDFIIAEbKDIf LTLFNADFAIWLVPLLVOGOAVTRCYIAVSQGDGNYCPET01FGSOpYN

PPNSLLIFEINLIOASADEVJN1VPQECi'IQCEOSSLILFJ1LQLYLKDIKDI'ENALR&PRF'tNDN

CPt>'066Z 742938 7411'7 CPeL0672 757TZ3 755018 asps-Aaparcyl tRNA Synehacase dacPlpbpS)-D-Ala-D-Ala Caroxypepcidasa SKCtCYlIkYRTNRCNELTSNHIGENWta(iWVMRYRNiIGGWFIDLRDRPGITOIVCREDETLKSPMIKRPFFTYLCI

OPELtIORLDAVRSEWVLSVRGKVCPRLAGMENPNLATGNIEVEVJISFEyLSKSONLPFSIVIYPASM?KIATALFIL
KHYPrVLDTLIKVKODAIASITPOAKKOSGYRSPPIMLCIDCS

ADDHINYNEELRLEYRYL.Dl9tRCDIIEKLLCRNOVMtJLCRIiFMWIpGFTEIVTWLGILr~I'TLOWLREEFHALL
VCSANDMt~NLiINACCCSVEKFMDKLNFF
EEIOCTtn'N

PECARDYLVPSRIYPCKFYJ1LPQSPOLFKOLIZIVGCLDRYFQIATCFRDEDLRADROPEFf'NNPIIGLNtiPNNYI
7TRDLISIMRCALKEPPFRGVISTTSYKIGiITMJIGRPtNKL

AQIDIEMSFGOTODLLPIIEQLVATLFATQGIEIPLPIaIO'!l'YpEAKDSYCTWCPDLI1FDL.LPGSTYNYPPALD
GK'1CTTKTACKNLINAAEKNNRLLVTIATGYSCPVSDLYODVIAL.C

LKt.KOCRDYAKRSSFSIFLDQLaHOGI'IKCFCVPOCATMSRKOLOGYTEFVI(RYGAtGLVETVFNEPLLRXELVPP
SDCLOLEIANLCKLSCPLPECLYYDFYASEDREPLSVSPI7UiAD

WLKNQOGINASNIAKFMDEEVFHELPAYFDAKDODILLLIAAPESVANOSLDHLRRLIAKAFPIEOCDLLCHWVIYWEG
KKISSOPFYAPCRFERTIKPWKLYMKRVPTSYRTYNSITM

ERELYSIXipYNFVWtTDFPLFSLEOGKIVAEHHPfTAPLEEDIPLLE~DPIdVRSSSYOLL.tIiYFRIRKHRKYKNL

VI.NCSfEIASGSORIHNPDLOSOIETILKISPESIOEKfGFFIKAISFGTPPHtGIALGLD

RLVM/LTAAESIREVIAFPK1'OKASDIJ~BWAPSEINSSOWfELSIKVAFCPn_0677 755217 755167 CTSSZ hypothatieat protein r'.Pn_0667 711270 712901 3KS1'LGKAYHCFLKOVSLAWREE11W~IPHHWFILt1pF00FSGEQDRFCSFLFrITIROR

his::-Hiscadyi tRNA Synchatase VSFLVLpEKIATLK

Y.SNNFEARHIM11TLPKCVFDIFPYLADAKOLJdUITSWNSVEKAIHTVCMLYGFCEIRT

PIFFJLSEVFLNVCEESDWKKEVYSFLDRIOGRS!!1'LRPEGl'MWRSFLEFICASNRSONKCPn_0671 75669's 755577 Pt'lILPMFRYER00IVGRYROHHOFCVEAIfNRHPLRDAEVLaLi.WDFYSRVGi~161QI0LImu-RNA
Mettfylcranstatase NFLOCSETRFRYDKVLRAYLKCSNCELSALSQQRFS'fNVLRIt.DSKEPE00EIIR0APPIRGILYYI'NVPPRQNHA
YOLLKOLHTSAISEADRVSYYPKONR3LGSK~IOWLONIIFNIL

LGYV3DEDLKYFNEILOALRVLEIPYAINPRLVRGLDYYSDLVFFJ1'I'f'LFOEVSYALdGGRHRRLLETLILOSGf Otrl'PEALVAKVNOCVLENLDSYSALPWPVRYSISODWiFLVt~Y

r:RYDGLI':AFGGA:iLPACCFCVGLEMIOTLLAOKRIEPOFPHKLRLIPMEPDADpFCLEGEEpAEEIAKLWLTEAP
ITIRVtflDKI3I/KELOEKLEYPSSPCELPEALNFSKR11PLQST

W::QIILRRIiaPTEVDMSHKKVKCAL.KAASTEOVaPIr:LICERCLISG'OLVIKNMSLRKEFEAFRIK',FFEIOD
EHSORL'C1'IwLTDKOIVLDFCACAOGK3LIFA0KAKHWINDSRK711 FY1'KEEVEQRLLYEIONTFL L.p'f'AKHRLLRACARNFSIrILVL.RIfi$F.~.W
IVDAFC~FRRNPEHKWQFSKKLLLNY

YR WjKG ILIfI7ASAYVGPRCRLVY tlY:::L.LKEENEANVA'rMN.SIaaIKEVHRKTLPL4~VGKG

~a,W i.r..l ;.(1775 741557 OAFFT.~afIFL'Kt :>,. irdnl::r nan,Uwt Plarent in tanabank/ENBL .t:; ut 11/7/')8 I.WFJIfIAMKKLIALI,:tYLVPIKI;NTNKEHIIAHATVLYJU1RAKYNLFtYODVFPVFIEVtEP~.Fr_IIi.75 :S'h.sl n5r.7..d f::l~'1?:LVIIYEIbIV rTr.'l~ hypOthlittCal I'rr.t,lh '/PL:dIILOFUFS II:YYLHVf.EL:,tI!U:1't!
f t.\'. IdtKK).LL.fiAWl'Vllld'L1't'NYIiI'::V.'7 f' f I~r il.r.. n.lqry ')4'.'If,S
R(r/IHELF:M::At;;Y::l::::NIdJ,LfFLf'LII~t:Y.hJI'WdI:YHt.FFI'::FriIIKKAIVDKLL7A

~Jq~: Ia'xrr.:ldwa:ph,rr.e 'rr.rn::PnrtFK.~.LILFL:RRPVDKIVI'AAN1'.'/f.~.Yr:Y::Nh'::::WhUITIItrtSI::Iln~l'f f 4:fVlWIRLM

YMrNWI'KI'Y\'I'I'KIIIKRIEDIIEWKKKYK'IWIiIRIF't::Mf'fv:YIbYYFTRKGFTPAMPTLDA.:LVN
tIULTfLLFIaITAYL:~L::f.ftf.f.l~'f'Ira:KAylLKTL:'.f:K::'NLLRI.LIhLF.iL

f.\Idr?fIKn4W
:Itv::.TL1'F::Yr;I:KFV:rIMSOq.~.tIPRYf.TfAf!:Wtl'f:LTNIFFI:I~S::AEDPhTtIM..LL'.
D::f.ab'L'LiVt:Lt'aYl1'1:1'f~IY:KTA/r:l.WhF:PAf.A::1'I:U::KLALL:FL

::IYLYAIYIYI;:I.MWF'(XfiA.WI'h:AItLLTIIWfAK::EAr:75'aI::VW:.T::iINICG11LIPILTGF
AEVLRKVIVEKKLtN::K::IMn'1'FEEW:Iff'I::11!f/~Ml'AI.WDKNCtMf.IJlitYlId.tH'LhtDIr:

I IUY:XWIa:AMYVh:(Lifv:NGLVLINRLRDTFQ::II:f.PffAfJ!KMlYYNPHP.~.FWRhYt.tLilQtFf fEKYKRDPFIHAHfIEL'KSl4iE

' : I'I.:R I I :IU :L:."1'Rl: f (.F'M'V f:ITX~W1.WFLAAA::FF!'!
f VRMAVNDW :ALFL t E1'KIIYMVK

WO 00/27994 PC"TNS99/26923 75?:I39 758051 ..~LNLIV4IFRpVFF.~.NSRS4.
.JCNYLRL:.K:NFA::':.1'KER~..'KTt.::.~1:.'.'fCFASF:r l ~

ri FYTNTFPFLEEOYTPAVUr:VA.:RYtI',~.NNIvDL111'SHRLK:::E".':J1F':DE:F.TIYIIPFCC
,Pn_On tloalnlogou:: to CT695 ONELI0MK3PYI~3GFA'IRNt~Illl~fLTTEC~II>~~K
DRM'ISDPLEESAAEf7CD5DLEDRVSESATOVIETIADTGIPEATPSDG
' ' ' :, LQPI'TNRKG~IYRLG K 'I~CIST. E
EK SDPISRKLAAOHYPYSFC
I
, piSfi~OC

T1~pLM.iOLVDRVEYEARCSLLT114.ARIRKAVSOIW~IVKTKRNPKEO~IRSIGOIPCD.

LLMTRLPKETAEPPYIYAGITALASCR.iFFINVFLRLITLLRRONPEAPLDLCCI'OPISKSKOLYLKKOLPKR

PfAAVAPALILRSCCKWVATDAVOECLPLEVIEEACNYNJ1FSLEATTTVEEVSKRLSELL
o X71107 770147 Y.~.DKRIOCLANVRCITKIITSPYLCACOCVSWOM.KTYDLGRNY'EOVLACASOIDEFAD~Pn_069 7NiE:-r:liceet Hwnnrranzflroaa ft Y~.FNFALVNIfDtLYI;JM3DR.~.YfI/:DFtarteISEEHASEIrtJYDWtaILEVNLPILEEDYRr u r.rrv ;::1~-r 'EI Y:' .
~;:n'_1Y- i:e:Ifi':: ""i'.'~'' . E
.. ~.::ivY~r~s-:~ :. :.i.":. : ).
. Lr-=u.'.': :y, i=

HNANVtw"WEIKRRRCSLVKKIRVHOSt:LICLDDLEKLLNEGAt)FV3IPINSN41GC11pP

CPn_OD
LOOVAELVNRYOAYLAYDGAOCAPNLPIDVOLWOVDFYVFSSNKIYOP'TOIGYLYI~DL
No robust homolop Dresent in t:enel>Mk/ENBL
as of 11/7/98 RIAEGINPSGNRSPDDVWVpGAOCOSSSTOCfGiITNSEEGIWEM'1'STSQPQV1~4KAKQLLOOLPPVOC~DNVAIY
O~tPEYLPAPEIKFEACTPNIAGVLCLGAALDYI~GLSAKFIY
' WQIVRCFFLCKICSPDSSOCASGPAIIOSPSOPIIRITRPAPPPPI'IC071NJ1KRPATIIC~RIOGANPLOIGFLLD
IJECI71VIC:
DKEIALTTYtJiKELLEIPCVEIIGPSIEEPRCALICEtI

APOPPTAGSSSOSEOPTANSSEVAKLVSELKDAVNSIIAEB~CVL10NSOELOTKiII'QiC'1CHOCADPJWERWNVC
NVLRVSLCIYNDF~DIDOFILVLCDSLOKIRR

NRCPDYLWCYRVI11RAT.OpTYTLOS14.IELTSSTCPVPQAVTYAKDAVTO'lvRG11I1QiL
RVSDpGGWSOIDYTSDIARL 0690 772701 771176 CPn ENPKPCNDPONL~IpWISLGIOCPTLDPGESIONPLLT_ AGKN11'1'RDVNOIANESSRL ASC TransPOrter Nelnbraru Protein GSALDRVRENNPNENPRIWIALARCIGAJ1VHSNATSVRIANGSV4AGDlM.VSICfFSSIASOSPVOKAAEACYTQYS
KOPSSKL17LS5FS1iI0EtSGfPO

~,~~I~y~
RYNUITi'J1SELIKOIRSL71FBCILINGKYEPS4SOLPEWIVCCIDW1CSLSSF

NOCFOVN1WPLAfWAVCSEDRCWLYIPEID~'1'SDPIFVRNISFPTVSOHDVIFfTRIV
CPn ' _ ELFVCECADLTVIfNPCYSEtSDfLSWS
No robust homolop present in Centebank/ENBLVILCQRASAOIQISNDVDLENUCSSKTIVNGYt as of 11/7/95 ' ' KiINSVNPSG7JSKND4WITCANDOHPDVKLSCVISANL~SNRVTASCGROGLLARIKGVi SYIVCKKGNAESLVLVOSPRIL~IKJLSN
TIA'M1~11ICMffGIJLL.ESCOGPGWPDN

7CPFSRMSFFRSGAPRGSQQPSAPSACIVRSPLPOGDARAT<x3AGRNLIK10GY0PGlDM'011~IYSRQHIKSILY9 GNPLFF~CI'ISISSOGCLSDANQKHDTLLLSSLAAVSTIPRLEI

IPWPf7fIGAORSSGS1TLKP'TItPAPPPpKTOGTNAKRPATNCttGPAPOPPKTOCfEIAKM~6YKASNC~ATVOPL
DPOpIFYl9LSiKitl'EAFaQEKLIHGFL1~LVSDTFtaSST~.Et ATI~KCPAPOPPKCILKOPOOSGfSGKIOtVSWSDED

0679 763936 761735 CPn.0691 773167 7736C!
CPn _ CT691 hypothetical protein pyk-PhosphoQlyrerace Kinase "

CY!>nIfLTWDLSPEtICKVLVRVDFNVPMpDGKIL~IRIRS11NP'1'INYLLKKEW1VIWSfCaKILXiaCSVLV
RGLGSNLKIKfiLHASCE10VKILDOFNWIOPCTMNIIt;PNDA0K5 ' HLCRPKCpOFOEEYSLOPVVDVLECYLWtNVPLI1PDC11CEVAR0AYA0LSPGRVLLLfl:IL.DI
SSOEIALOEONLLSNLP~LRSMGWiCFONPPEIPTI~tKMFLRDAYNAIIRRN~10~

RFNIGEEtIPEKDPfPAAELSSYGDFYVNDAIGT
SIt~FNZ'LLSTVLI.TIfEYNII'1TDLFL~INIMOGFSGGERKRNEICONLVLEPEfIIVVLf~EP

r'f.
.EFLGRNLLTSPKRPFTAII~GAICISSKIGVIDSOLDVDALRLICRVLEKYRELIIPtSSLCIVTHIQPKLLi7LIRP
OWIG.LLDGIIVALIfiW
' It>cISLVEKS11LDLAREtVLKIAKSRNH'I'IVLPSDVKAAEM4SI~YSVISIDOOIPPIQ4GKRVAWR
SIlBIELfJIKSY0E1r1 FDIGPRTTEEFIRIINOSATVFWNGPVGVYVPPPDSGSIAIANAf~?SiPSAYMI00GD

AAAWALAGCSTKVSNYS1GCCASLEFLEQGFtpCIEVLSpSKSCPeL0692 771915 773161 11AC Transporter IOE!'G11GGKYEIOESVKVPLEEREDYPYC:IYI'PIESOGLTRCGSEE?IEEIAAL1~POP

ygol-Phosphate Pesmease IIDPRL011YRYwIODiJtEPANARI3tYGPIAY~IVYFSSPKOKKPLGRtiOIIDPIILDTFK

YSNLPLIIFVIi.CGFYTSWNIt.ANWANAVCPSVCSCVLTLRQAWIAAIFE!!'GALLirGKIGIPLDDOKRLLI~N~
iIfAV~.VFDSVSICTTFKE71LEKAG11IFCSLGGIIOmEFEILyKIt 7RVAGTIESSIVSVTNPNI715GDYMIf~IfAALW'fOVWC.OL71SFFOWWS1TNSIVGAVIYLGSWSHRfIfFFAAW
AAVFS00Sf11YVPKCVKCPIO)ISTYFRINNKF~YOQFOITLIVY

GFGLViGKGTIIYWNSVCIILISWILSPfEGCCVAYLIFSFIRRNIFYICiDPVWNVRYAEDOCIfASYLOCCTAPAYS
SNt,~rNMWELVANBHAVIRYSTVONYfYAGDKKTGI~IYNF
' PFLAALVZM1LG11MISGCVILKVSSTPWAVSCVLVCCLLSYIITFYIMrI'lCNCSYISOTK
VT1~LCAOYRSKISNSpVl110AAITWKYPSCILKGDESVCTcPYSIIJ1LTSGKIQAD1CI

PKRGSLTYRLKFJIO~iYCRKYLWERIFAYLOIIVACPNAFANOtiNDVANAIAPVAGVLRNLN9CKRTTSIVISIOGI
SSDESKNT!'RSLVSLCKIUIOfSSNYTOCDSIQ.IOKASGiII<TDP

pAYPASYTSYTLIRi~tA!'OGIGLVICL71IWGWRVICNCC1CITG.TPS1~!'SVGIIG&11LTKIWS'1STSSIEN
GTlSICLREDQLLYLRSRCLSPLI'AVSLVINGICRLIIEDLILVAO

IALASILCLPIS'1'1'INWGAVLCICLARGIMl'HIItIIKDIVGSWFITLPAGAI3SILFFEASKLLLIKLC9S11G

FALRALPN
CPeL0693 776393 773310 CPn TPR Repescs ID-Linked 0lt:NAC Traneterase 0681 765001 764358 hanolt~pl _ LRSTEItM.GEISNEGiUWIJIXElTrCSGI
C1'691 hypocMCieal protein AAL11YCY

NGIR~KSFTRSFRQVIIAXKAII~lp1'IJLALFGOSPFAPLOIIIQ.~IWSCVEriI<.PIfTALGIIALL1CRVSE~
WCSKGLASEPGDSYLRYCYGVJ1LDRONpYWIIEO~iIYVAWP

LRDORYECL.LQEAIQ.VSDKEYpA0CI10~i0laNtiLPA0LTIlPISRAGILEIISIODSIADTDOVECyIFSLGSV
YEGtLKRLQGLDCFDKILALDWlIIPOSLYNKAVILSEfWEAIiIRLL
' ' AF~DVAILLTIRRWTYPSIIfL.FFRFLtiOr?.iAFELI7t!LWEp'NOLLESSFOCRKADKAL0K
EVAVAIO~IPLYfrKAWhLLGFLLSRSKRWOKATEAYDtYWLRPDGSD011Yl~iCYL
!

Pr~r-SK;RVAKSENESDVLOREtHQIFFSI~FIIPEKEFYLWLOVIRRTJ1GISDS58adWRTRLALKAFOFALFTJtAEDA0 11lIFYVCLJIIILDLKOIOIGYEAINSALSDSL.C101ARnIa INNTLEEK
YLH1MQGETDIUITKELLFLpIfKDS'IPAPLIQKTWSDPSSNOFCRRIDTIS

CPc>_0682 761913 765955 CPrt..0691 779135 776330 dppD-ABC ATPase Dipepcide Transportpbp3-PBP3-transplyeolase/transpepcitiase TSKCWKNSLFPHIR1LPKRSCKRLNASNPILOIEDLSITLitK0R00YPIVOSLSFTINDGFSOESEAIOrINSNKRPI
Uf!'PIYESIAOkTNItLISCIVIAFAVIALRWYtJlWOIWKLE

TIC~tIGKTIJ1VNOWYDVSVAYCAIRDLPfRAWIIVDEiIOIKO

IPONPpJLSWPVfTIEQOFREIItLMLILTAEVAXEKIC.YALEEItGfNDPRLCWLYPNOLIPVRKNYINCLSELLSQ
F1JILDREAIF~AIHAKASVLGSVPYLVAANfIS6RTYLKLKMf.

1501?Q.pRICIAMALt.CSPl4LIADEPI'IALDVSVpYQILOLt.KTLOKICfOISLLIITICISXIJNPOLINGVV
RRNYPOESVASDILCYIICPISLOEYKRVTOEG80LRECYRAYE~

NGWAETADOVLVLYAGPNVECAPAVOMFMQPSNPY'1'RDLIaSRPSLQPOpLCSFNPIPCPKLPL9GLASIOOVMLLE
SVESNJ1YSWALVCKEKiVFJ4CWDSKL~ItICIOIPILV0N101i OPPHYTAFPSOCRYNPRCSKILNRCSAfAPEIYPVRGGNKV1~WLYDDFIOE~OAVPEAPatKIpL.TLSAEI4AYADA
LLLEYEKT6TFRS~IIKREKi.PPLPPW

IKIXiAI IJ1LDPNNGEILANASSPRYRDRJDFVNAKVAEDSKAVRSSIYRWIrWKIOIIAEIY

CPn_0683 765936 766919 DRKVPLIRERRNPLTCLCItEEILPLTFOCFLOFLFPENSVIKLOLKIWSFVO0AIt110NL

9ppF-AHC ATPase Dipepcide TransportVTRLLSLFPYEECfCPCSI1IFDAVFPNEEaHILIOEYISL0E0KWINECWDNKADItL

CVCCt~f!"1'NFPOPLIQATSLT>IYYKRSFWFpCKTIASRPVDDVSFSLYSRMVCLICESCKEJILOpVFN6LPANY
DXILY'TDILRLIVDPEItFSPVLPSEVNRLSLSEF1'6LOGRYWIR

SCKSfWLAtxLLPLTSGFLTFNG1'PIKLHSKIK;RtWLRSQVRLVFONPOASLNPRKTISAFSTILEDAFIEVHFKSW
RKSEFLOYLAAKROEEALRKORYP'TPYVOYLEKEKTRQAfKII

:.DSLGHSLLYHKLVPKEKVLATVREYLELVGLSEEYFYRYPHOLSGCOO~tVSIARALf&FCOEHLD'T!'U1YLFSR
TPYIfDGLEPY7fDILDLWINELONGAtIRALBWNEIIYLFLKiRVSN

VPOLIICDEIVSALDLSIOAQIIlMLAELQKKLSLTYLFISNDU1WRSFCTEVFINYKCLSENLPALFSTFREFNCLAR
PLLOKYPISIVRNKROTEODLAASFYPVYCYOYWIPEUIYG

OIVEKCNTICRIFSDPOHPYTRffLWAOLPCTPDORDS1(PI!'pEYEIKt~EESC51GCYPYNOMTLGSIFKLVSAYS
VLSORILWGHNECPANPLVIIDKNSP'CYRSSKPHVCFFKflO'!PI

RCPpKQEACKSEIIPDpG1HH'lYRCIH
PTFFROCSLPCNL~ffICRCFIDLVSALOISSNPYFSLLVGECLOOPEDWDAASLICPGEK

TGLGLPGEYAORVPHDLAYNRSCLYATAICOHTLWTPLOTAVMLASLVIA)OWYVPKLL

~Pn_0684 768056 767181 LCEWEGEHVSYLaSKKKRTIFNPDAWEYLKTCNRNVIWGOYGTAMiCSOFPPOLLBRI

spoJIparB-Chromosome Particioninp tCKT51'AESIMRVGLDREYG'MKNICDiWFMVCFSDODLSLPTIWIVYLRfGEIORGA
Protein ERSCDIVPEISKDTIIEVAIODIRVSPFOPRRVFSNEEWELIASIKIIVCLIHPWVItEPMAVKNIDMNEKI~~ORLSF
LRG

IGTGDRVLYYELIAGERRWRJIEIpLACATTIPVILKNVIAOC'l'JIAFATLIFlIIORVNWPI

ENAEAFKRLINVFCLTpOKVAYWCKKRe"LYANYLItLLALSKTIOESLLOGQITIGHAKVCPn.-Ofi95 7Rt1301 78178:

ILTLEDPILREKtlJEIIIOEHLAVRLAELI11KOLISEECSSIEL1CPTPLDNAESSKOHEEhonalagous to LOORLSDLCCYKVOIKTRCSKATVSFHWM~ODLOKLGWL.iSEK:I'LSESIa"SLEYSNKKLLKSaL4SMFIk:SVCS
LOALPWC.NPSDPSLLIDCt'IWECJ1AGDP'CDPCATW

CDAISLRAGFY'vC1'VFORILKVDAPKTFSMCAKPI~C.:AAANYTTAVDRPNPAYNKIILND71 ~:Ptl",0685 76800 768317 EWPTNAGFIAWIWDRFDVFGTLCASNf;IfIRCNSTAFNLVCLFt:SIKGTTVNANiLPNVSL

No robust haslolog Dresenc in Genebank/EMBLsNCWELYfDTsF3WSVr,Am;ALy~CATGCAEF0YA0SKPKVtEWVTCNVSQPfVNK
as of 11/7/99 FPOSOYLLIFPNRILDWAFEILWQCML?DQRKHIOMLNKHHSIEIFLSNNWEYKLFFPKCYKCVAFPLPTDAG11ATA1 CTKSATINYHEWpVCA3LSYRW.~.LVPYtCVpWSMTPD

KTLK AOEtTR
IAOPKLCTAVLNLT11WNPSLG3NATALSTI'D3FSDFMO
IVSt.'pINK/KSRKACGV

'IVr.ATLVDADKWSLTAFJ1RLINERAAIIVr',.CQFRF

.:rn_0586 758373 76817c Nr, roD,)sc homolog present in ~,enebank/ENBLCPn 0646 7RI707 7R~5'n) as or 11/7/98 :\KD.SMIPtX:RLFRVIOELFFFSa'LYVCEORRPRKL7P~WHWFPIEKPRFLLKCFKKELt.'f:'le hypnrhcwr:,l Iw"chm EIFYERhI N:Y:FYVPNI'LLTY :71FE I EV~):a.F~.p:
~ .'KI: P I KDL11::N:AI IE':I
lOrl'RIadNf'KNKLY IPEE

NtJtILY I I N(.AK'T'L.~~1WIIAt.111 I HKV IOfkIKTVLt1N TKK4AKt.'V
I RY,MIEM.EFPIAE

:'Pn r)b87 759501 '1W)3l1 RWU%71LTNM1'1'IRN::II!T1111CrF.KfH.::HfIVAYf:PKKEJ1ALLAY.ItIK~KLLNNLEITtRYEIK

'T4Nt IrYp~thutic,ll pruratl:
KAf~:LLVWOC::YEKIA'IAI;IKYI/:fIVI.ALVUTEIC:DIt'IIINIV11~'NUIr:I.K::fRLItN

ItKINYdtLNIIAYRF~I'PM.'fc:FN~KLVItNIWY.KfY~F.SAIAICIVL.\::FL::LKTV::N7YKVIYFNII
FJlKIIKI.:It:t'I::IVY.:a.EYIl4a:AII:::::UI~IdvarEarINE:ItI4.IJ1KKYIi:YJW

II::OAI!ffl:: t LLLTRMbYAV.'~1:FLPSK:
AL.i:iLEYAYNIIX:E'.a'EtKPYIU:FIJI::I:FY
IIIN

I:1LM:AYYN:LAYfQI:.VAWLDHPIqKLLKETSWJA00L'fDVALSK:fYOLImAN.~.:aCtYyrfi.r7 'r4 'a'.'1 nt t~.1'1 C

'il~rl::Fl.TLt.tNIELKELL1YJDV:.'y~DFMLKSSPLFHpFFRN'fl:OCEWTL:KRFY'*Kf:r::l-Klcxn7acv,m F'.n:rmt f::

WNr:YI JA::fiF::NHT1.K'TLI\trl':
Ir:f:l'Yr y, I:AL1JN t Y:Nf.E:IJ1WYLItKI~:LA::rItSKKPJ1R

'hyln.NN Ir.'117~ 171)1)7 ETYIfi:IIMKTINH.TALiIV'rPII~.'I'Id~/ANEIAVI'kIsEY::IILtlII)ILKYKVIII'Vli\I:%!Ad ."pqal trrl!Ir..r:.:.rl 4rntriu ::::rJUP::I.:'/DELHAYftYn'Vt:EJIII!I::VVAYI1KAH'fr'I"/r7f'/::IY:M:K'I'VAI:rNI:Y
:::::

SWIDIAFOIWAAOPOFL3xF-.~VPAFJtIAK! 4're68 nypocnec:~.at t. .m '~tOCxPOEJIEKIV'fCKLN': AFK'lVKRffCM IDP'JlCf PtILDI:DAEACv' TAD :"'=N:a: FG~:ELKKC:.'.?FALC'.::YAAPKD

. TTLVOCFKPNPNIOI~DOMIChLI9ft~.~~tt.E;~INN
fFOEAt:LLECPf IKNADLi IO~L:OOf.~.K':'.SI:a~VAIAOfAD~IIIDAVOViIDOiN>I;GCNIINIOIfOODARKCDI~

' "
' ':Pn 06os 79)44) 754201 t CL;.
.
ALLYPSDItDONRF~LANFL>r!'~YAVORA''ORAELFA:~
ri'L : SV33SIKTIlI

pyrN-tMP Ktnase EPNKNNAKOTRRVLFK:'~,GL.iKLS"~1RIDEMRISRLVSELRAVPNNDIEInILVIGQCN715207 ~~574~

CPn tLIiGIJtEGKELa7INRV.~.AttOMCMLATLINCMAVADUJIAE:.IPCL:.':~_ LSCPOIaDLI' 'f~e7 hYPOCne.:~.~t Gr~t~tn .
.
' PCK.~>tEILDOr:KIttr'~':A(t:PYL'ITDT".AALPJICELtNWLtYJITMNVOf:VYDKDPRL.
..
' :
' ...
...",: : . . .:1, '. :~ ;.
.
~
, ' : .... ~yr~;
. .
....,..:.i.FYi .
:rt.: ~:'F'C'.:=.i:r..'.':..:/~i.;.......:.~TIL::II:~fPn..iv.niv'i.~
' ~ .. .... .
: . . . ..
'..:i~?'.:':.:' :.::.~: .. . ...., ... ... :.

.._ ~:lll : '.': I I' OV IANCKIESTPdtifJS: LIaMLri' WAK.:AV:PtG:

CPn_0699 781179 794721 CPrL0710 794492 796210 rrf-Ribosome Relusin0 factor Cf66e hypochscaal Protein ODTEKKMAAALDffNKEVKSFR1GK1WPALVE'IVVVDVYCT~SDIASISVADRC>CKSMAT~TAfDfNIMLOCVCTYV
ItGV00YLTELZZTSTQGTVDtL:CMfNLOFRM
TMSIrt . RS
LRQLVISPYOGNNASAIAKCIIAAM.M4PEVP1GSIIRIKVPEPTADYROEMIKOLRRKCOILSpYIIESVSNILTAVN
TCIZT>4ARAVKCS

EFJ1KINVPNIRRE71NDKLKKDSALTEWVKON~IOELTDKPCKOLDELTKOKf~EIAS

CPn_0711 795791 796484 0700 795094 785609 .~
CP
.ZdFINYLyLCItYpSMfFM~IfItKEEKt4SOPL:.DLE00M00t1oR.10ELKJLSVODK

n_ ~
CT676 hypochee ical prou:n VHKLUtLLRF~SDKFSfCOOCSLL1GYVAtw~KVLCRIt4R10tI
PATICYTEII>K~LVIRSYVCATCPCPSHYYNNEHLSLSKGVCVLT' tJIVNSPTIIOCYFICQO .
LECfrI4CKTVWttSKODD Lf.GCFlQCYTNFKNOITSKLKSERWSSSTMEKCOGSLttIGR
OTLEREDYEOAAVIRDOINHLKT104~DPS CPn_0712 799)15 796781 At nylacs yelase) ~ co eds l .D _ E ogy APOEASNLFtpLLKLI FHA dolesin: haao I
~~
~ IP~

CPn_0701 795594 796672 EE~D
VYDFD
FISDEFD
~,pprIpIyyNGVAIOfT'fQLKNE~ILS

karG-ArQln:ne Kinass gOpLSpSNEpGKDLLPROTS6fIHISPKL.TKDOGSSDPTTSCDQFlJID~JIfIJISAKAE
MTLPFIflLLEfLVI(RKESPOANKVWPVTTFSLARMLSVSKFLPCLSI~Ofa.EIbOF
I
IAKVA>ULO~S~~PKEONiUtDSPKGEBRTNKPpNiIINEDNOASPRODPOPK

KPK IO~PI
p SAEPSWOJfARDCfPLIIB4KPVElxAFHfKATPDSPEKKDOPEDGS1~OGSKIFJ1TPLDSQ
ITSMfIAIIEGFGEFIVLPLKOTPLWOKIIPLLEtIFLLPYDLV~'iP~'.EtILW'fRSfDFIs~A

INfpDHLVLt~IDFOGNVEKTLDOLVOLDSYLHSKLSFAFSSEPOFUfTNPKNCCTGLIISK~~ETDSAADAFiDAtAS
DHTAEOtHtCfPItKV84lt%SA

JCFLHIPALLYSRTTNLIDEEVEIITSSLLLGVfGFPGNI~iRCSLGLTI'~i.L~SVLSPfHVODt.FRfDO'fIFPA
EIt~IAKl04I&VDL'PpPSRFLLIMaGAtIICII6FHLDBG%

fAITASKLSVAEVAAKKRLSEfl~WLIOdLILRSIGLLTHSCQLELKITLDALS~OIf'IIDKTSILg HDOGILIEDL~KNDVIYFX:RIt SNpIWCITVC
YIIGTDP!"fCDI
V
T

DLCLIKV'IENHPLhR4PLFW0IRRAHL71LOKQAt~SRDLOKm'ISHLRASVLKFLTI(GLHP_ _ _ _ _ !SF ' ~pV00LAILPGTdTASLtHI'K
OELM.AOVINOfPIYR9'ffN

~L~I~~'~~~~ ~ ~~Iii.SIOIt~ILO

CPt~0702 789700 786929 ISIOiSPEPGKFII1GYVKTEEOAACLVDYINIH

yscCJqspO-YOD C/Gen Secretion Procsin~F~~~~L '~~~~~~IPCVRLV104fAVLLPAtRCGIID
D
LNL.RYPNRYRM~fgRYOETSIMIWNCRILTRGDVTDCKM'SIOPMIIFLfJCD3LXYK
I.IQWPVKIVIINICRItILOGIKIUOt>utICILSGLFfLDLVLLCVSSORP2'EI'SANV~04L
'PAKpt RDEKLAACPKNSJU1SLSAXKSIftlOCT?PGSIPSKVFSKFD11T01~fFOKTSGSAFIIIYHII

TLXELEERKKPRPERRT'fADVKRSPRFLP'PDEVEPVPAASKDOLDSIQ~~

AVNAINLSIKKOLEED'fSTV'fEKD110PKT011TPHASKIQ4VASPSTSHPGIDfAATTVAVP
YSKISCfDIIYfDSNDLO CPe~071J 799!17 799132 .,.._..m........".,.rree.,.-r.,m~r~rtarrmtsILELIaFCT66J hypochscieal P>rocsin LDLItBEKAGPPNtft(SIPOGTKl'fTAAL>aNfSMLEICLIKNFATYMGITSTLELDiIDGAYV

LPISEVVIGVMQOFDIOFitIVLSAShGAL.PPSADTA1ILYL0l4~tTfrliLPCRL1COSAf.OLDS

EGNVVMVRRFSGt~EYRtIVL8Il~fSEISII.SDLCLGKO

MwA-Glucawyi cRt4A Rsdoccase mUaAt*BRERJvIOYLOSFEKNLFLA~ORPIG10~ATTPLL'f~tA

g.YIITSESPLTAGAULSf'~.TSaGIRPYRHRCLSCIIItLfOVTBCTDSLifOElEI000V

KMYIJIGSKZRCLffDWTLPQKJ1LKZI01ICYRSRICTPDI~V1'IESWOCILLSYDKiTIt 'i'I4LF4C3YSDIZiRILVIUIYLYpt~YHRITFCSROQYfAPYR?LfRLTt.SfI~PYDVTF1LC5 SESASQFSDL~l3LASIPKRIVFDINVPRTfLWKETPTGIVYLDTDfISHCWI01T/0Cf K9lCVIHWILLLTCAAKKOWCIYtINCSSHITORQISSPRIPSVISY

CPn,.070J 791205 789695 pkn5-S/T PrOCSlh Klnass RSRWHOLNP6TRItSTVIKVfSPSPSF CPt~0715 901436 80)162 RKIGFMOCROGIPLPEPOVIGCYMVXKILSKKL pyrH-ON71 GYrass &ubunic H
TSRSVYNFLKE710SLHOITHPNIVKFHRYGIIWOt7CLYIAMEIfIt7CISLREYILAOFISLPKFIIKISHMAAYTE
ASILSLASLDtiIRLAAGNYICRi.Q7CSOKEDOIYTLIKBWONCIDE

QAIDIIFDIAOALEHLHSRNILHKDIKPENILITPOGKIKLIDFGLADWtrfEIORAHPSVfIMfRIGKSLKISASOKQ
ISIQDOGPGIPL.~LIt7CVSKINTCAKY1'QDVFHFSVCIN~r C

IGTPYYNSPEQROGESHSPASDIYAt.Gi.LivYELILI~ISIGRVFLSLVPRISKILA%AL~~EIFSVRSVRKKKYFI
LtffFMRGVIAESXOGST1IDPDGTfYgPTPDPSIP~1'f OPSPN4RYSSTREFIODIHHYRFtSGDMOEDLRIKDHTVALYE0I4ZQR~POfNNDFLKDKI~~TSHNDLIIDLFDAEI
TtPPLYSPL<"fONEZM.T

FISCVLYHQCYPLYPNAYDTL.LOt7VtNGWGGYSPISt47tTIALSVVKSLVC00DLDRPLLFI!'SHLF%3~TfERY
fSfVNCOTGD~T~'TAFKEAIVKG1MEFFZiKTItSISNDIRI~IVCC

DRVCEINECLIRIOCIPIDEtGISILCLEISKENICLSWIJ1CCKTtfWIKROCRVWDFESIAIKLISPIfESOTKNKI
GNitOIRSSLTKDVKEAIVOALRKDKVAPIIKffiEKlR

FSPCIGKITSLOIRETKVAWEIGDFJ1VVCTLELEESVIISLiClLSIaEI4DRROKAIFCPIKNIOFIKODIJCSIfO
KKVIIYKIPKLADCIIFHYN~tSLYGEA55IfLTG'JCSASASIL7ISRti ESIHGCIOSRl,7HG5NSPSTi.ISLKRIR
PL'lOAVFSLRG1IPFTNfSLEI~.TIOtYICtDELFYi~TAIGI?ONEIOHLRYItKIfILiITDADV

I>GKIIRNLLITFPLKTLLPLVE~iLFILETPLFKVRNKT1TLYYYSEO
1'DK

CPCt_0704 792))0 791209 KDSSLEITRFKGIGEISPKEFAAFIGPEIRLTPVTITSLESISSIt.QtYNC~XOf fliN- Plapellar Motor Sw:ceh t)anar,n/YscOIlIDId.ITDf family RYfM11V1U1DSSAS1ILKSRNNFLSSLOKTEEpVMPCFPKEItOHKIREKFPLEDVOVSIK

FRGSITAVEATKEFGVHLLIOPMNOPWEVENLL.fLTSEFaEQEI?NAVFDDASI~SYFY
CPn 0716 80366 804902 EKDKLt.GFHYYFVAE71CKLFELOWVPgLSAKVOCDAIFTATSLOGSFOWDISLRLDGK9YrA-DNA Gyrass Submit A
C

NVRCRLLLPfDTFQSC0KFF5GLHDCSDLHNIDGtOQISLSIlE9CY50LT0EEWilOVVPFMRWSELFRTHfMNYASY
VILERAIPHILDCLKPVORRLLWFL.FI11DDOKMHKVIWIAC

SFIIa.DSCLYDPETEESGaLt.:'VOxHOfiroGRFLTPSSCEFKITSYPNLTHEDPPLPENPRTMALIiPHGDV1PI
YEaLWLINKGYLItri'OCNFCNPLTCDPNAAARYIfJvRLSPLARICfL

QASAAPLPCYSRLWEVARYSLAVSEFIKLNLCSILSIGNHPAYCVDIILDGAKVCRCEIFNTpLIAFHDSYDGREKCPD
ILPAKLPVLL4NGVDGIAVQftZ'KIFPHNfAELLKAOIAI

.~~I~ INIHCI(F7VFPDFPSwINDP8EYO0IfIGSITLRASIDI
- INDKTLWKOICPOS'C1'E1Z.IR

SItZIAAKRCTIKTDTIODfSTDVPNIEIKLPKCSMKE?1LPLLFEHTECOVILYSKPIVI

CPn_0705 79)176 792734 YENKWECSISEILKLIriTAt.OGYt.EKELLLL0E0LTt.OH'IHIfTLEYIFIKMKLYDBVRE

CT671 hypothetical procsin VLAINKKISAI~Uh'AVLH.1LEPWLHELATPVTKOOT'".~OLASLTIKKTLCFNEGCTKEL
' fSSKCMfRTES
tAIEKKOMIOKDL:RIKE\TVKYLKf:LLERHCHLGEP.K1'QITNFKYAKTSILIOOOTLI
FMELKKTAESLYSAKTf7HHZYYONSPEPRDSRDVKVFSLECKO'fRCEKT

RKFADEEKRVt~ELAEVGSKEEE0ES0EFCLAENAFAGMSLIDL~1AGSAEAWEYAPtA

VSSIDTQWIENIIt~~TVESMVISEINGEOLVELVLDASSSVPFrIfYGANLTLVOSGODLS
' CPn 0717 H0~~69 HOSJOti ' t CT556 hypochec a:a1 prccstn VKFSSFVDATONAE11ADLVTNNPSOLSSLVSALKGHOLTLKFStt6NLLVOLPKIEEVO

PLHNIA.r~TIRNREEKOORDONOKOKODIHfEODSYKIEEARLIRIKFIDTLTIWRMEPRHIYIRKPETPKAPDVEKP
I1IPEYMTMANTITFa:PVKTLOOL

RRALTEOROAEEDv'KFIYDNFIOSILISfFCLVHKt7ttDPJWKJ1SKRMRfVYKEQ

C~ 07Gb 79)689 79)180 CT570 hypothetical proca:n YJtVAK'JPLEPVLAIKKDRVDRAEKWi(EKRRLLEIEOEKLfIEKEAERDiIVIOJHYNOIII00C~ tt7tlt ptt5700 905626 ~

S.RDLLDELTfCDAYLOIKSYIKtIVAVQLSEEEE1NNKOKEWLA.VKEL.EKAEVNLAKRRol procetn CT5'7 hYfMfhect.
~.fTYFf.ALF\'MtINOEItFLC~IHCRWAPF
I N~PLYLTLIADHDTfY LrIKNLDKfPLP
RA'/M

KEEEKTRLHKEEiMKFtL.KEEARAEEKEODEI1COLLFOLROxKKRESCGS.
VEr~WFJCI~Jf.iTl.~..~.LLK I FL:.~.DLiSLItLL.IU.TKFE
I LT1JIDLYt.AON I

':Pn 07n7 'I'1501~ 7aJ7114 :'.Fn rfllr NP~477 HOewtn y:a:NYtttt N IFIJUnllatTyMt ATPJtel:ant: II:,rt.Irml I,linr ::ynrh.naI
'JNIIDULTTDFtfftNSOI~.DVNL'I'TVVCRfTE.YVCMLIKAVJPNVRV::6"ICLVKRN1~1EPLVKKVIKt::
FIKTYf::ffVC:Y.>?t:J:Rt.UK'fLTE'IltfYY,':RAYYV171IL:a:LVOINPQ
FGfYt 'IfEVW;t'fll::FAFt_:PG:EL::CI':;FSSEVIPTi:Lt'I.HIRACtK:Lt.:I:VLFY:GCEPIDVET.
C*S
Itffl'/A'fItIJX.'rat!\'.'IDtv'F:KF.EL1.ELLI'FJlfl't.l~Y'~IF.G:MIt.VINII1'RDMMIP
AWi Yr:fIJJtPJlt~rfln'IF'ItAFi'DCIAIRwKLR(?IL7T~:VRf:tCC:MLTV\I;.:ylt(CIFM:ACVV11A1 JJ11:1.:EItIJCKh:Ff'EF:1WHIv:IVIIftI.LYfIf:X:LIITAKTIt~tAK.YyftIELF~
IIF'IY:f1 .:LUY11AHNABF7Il)VMJIALA:ERtiREVREFIF1:OL:EfX:MYn::\'ItV:.'f::D(J:::iOIJtLN.
'L:KI'k::l'flIr191I::It11VN1fHKFMI"r::ll:YlJ1VTIIr't.WIJItIa:K4~.fV
::Y1 A'/n 'fYhI
YY

AIIYWrfntAl:'tFl.llr:KTwt?iMD:;YfI:FARALtitr:L.AM~Er1'\HV.:YTr:vf~aTLMtL.
' .
.
.
AL::I'hlt:lrftY~l.lt\'IIMKlllatft Ill:ltl'l/lft:lP:attl.~..':Yr:l.fiYtXNJIA'i::Vifr'1111'RTROF

(IIYfALOVLA 1I:IuMIt:'LLtYi:FtiNl:1'fIINKNLI.I:::ILYFi f.ER:Y:A:a*t:fl'fAFYTt'LVr4:f1h19'IEPVADL1't:.'.ILff:IIfVI::N,IIrWA'::1 YM:1 .a::f<L.LTAIVf1:19JRftIIv:KARfVLAiCYK.WI?1LIP.It:I:'fIId:::ItKfIttFAIDIiLOKINR.

.
( ~94 FI ~K(~L I tll:KTtI7FGIkV LHA I ttnn: sr.
FR ~'I~t, _ l.ai '!~
' ':Iw_t~l'W '/"n,ytft '/r5G14 In.m.,n u'1".'.', I,yl..tlvI n.~..l ,1 ..:.:.VAKPDI~:Y:: ~EGATIKAIR (i:FRTVPFLPY.:IL.7t::.::.c:., LR ITMKEF Irt'f : tKNL'JDRPEcJR . f HFFLRIIC:iLI:RnIIFPFRRF.:RL:Y
IKEIQCfttT:. ~..~: LF.~LLIE
~
' ' . ..
MINf:A:.LH::ENKP~I:LAPVWNII
.If(7411 A'/IJrVVLOY'/EEC:"fLM::.:.TM:LLPt TLL;f~VA::PNfIVRIIGLELMEEK W f PfYIMRHSDf~R
C'I!:~.tSfAG!AI~P~'.x>:WLT.'.iAitGII~IKFLwf'1 ~~LL

'.LTOIlt'..'...-.DI"IJ1RYVTIEIw~FLYL?tY.iLKIYOL'iT~~
APt~"i7GIL:5 PA

':Pn 07.',: 90771 999199 .
t oRCYOREDMERGf.KLMKFtfhTLTISVNL
LY'ACLL:.:rILPC:VRVf.YEHCLPpQSAVYlI:

KdsA-KDt7 :;vrsc netns~
VRVLRCY'SASIIPItALAPLV:SI:.FYAQROYAVPLF:~:',T.'A:rIN:V:.S:,VLCRWVLitDIfS
.1PYSDRIOWFFY.:.>~IDKANRSSL.v LEIIIGKLOS1:
:AG?C'JIEG' C:' KML:
MFPM

.
';YAT.iITAWJOL'lf:.ilYY~SKRLPM'SKLWE.iIRRSIKVIC'.'".'MIUICItITUiIJIILT
. ' / ' c ' . ' I
LRI:r\KVKETF':Vf::LT~VHTPODIIYAAAE1.~\IG','I?AFLCR~':~:.:.VA

:FR!ax:LT
W

. ApAIAF:..
. F.
. CIFI 1FLFT,FAKLLRVEDLINLaS!
'.',tE~"'.AtVNt.YY~F:..~.PNfIIt!:PTtIY'f!.:'f~.tlflY:;a.TF.P~'..~~r~,Wlf~;r:pMp SIpVL!YW
:T"'~YVIFLNFLTPf~ADt?:r.
~i~.
..

.

., .. ..:. . ...~.. I..-...: .I.. .:.,::
..:..~~::~:~ .... ......,;:''~:.\~:~yv . .-.....-..y.,t~- . _..

m. ........._. .r._..:': N V. :.:'.
. . ....:~ .'. F7 : . ~:. -. ~ n .~

a08971 :eneoenk:~BL as oC 11;7/98 0722 roWtst r7omotov present ln 909177 Nc CPn ' _ IPDNILKIRAKETSLSFLLIKPPSPPPLKf:DYLFDISpYTSSE
CT651 rlypochee7.ca1 protein VAIAISANIF/IRL,CK
"

YCLSM'KFLYCLFYSL;.LL~JIFGTNVAIIOVDOICDVSCIOaDIPOGPPFIJIIKI(VNVTLRLRSISIIS

LTICCSYFKLNKASLGS

SKQICSpEERF!'NCKIDKSCMELNPPOSSYSCKEYLTRISIRIIL?tINFtKQMpIRGNSGL

LNYODCSLHVYDCRFOVDPVP~YGSPDKEDSSSGGFGCTLYLSLPRN)2-CPn,0 nEoBhdonueluse tV

072) 908979 809703 NPl9IVL.PPPSIPLLCAHTSTACGL10JAIYECRDIWSTVpIPTANORpWDRRALJtEEVIE
CPn _ DPKA11LKETCLSYIIiSHIIGYLIHPC11PDPVILEKSRICIYQEiLOCITLr:STVNItIPGA
yhbC-ABC Tranaporte: ATPase ASNPILSVCNLVIOfYNKIIPVTNDVSFOINPGEIVGLLGPNGJIGKTfAFYLTVGLIRPDSGALKSSKEDClB~IKIV
SSFSOSAPLFDSSPPLWLLEZTAGQGTI,TGSNPEEiGYLVt~t.Ri KIIFKMIDVCKK!l~ItRARLGIGYLAOEPTIFKELTVODNLICILEIIYKARKpOSHLLNOIPIGVCVDfGHIPMGYO
TTSPOGWEDVLIJEID

TLWDLOL:.'S~~1.HKKACfLS~uGERRRLEIACVLUl4P5VLLLDEPPANVDPLVIQNVKYLHAPLDEGYIGKESFK
FW"DCRTRKIPKYLETPGCpENWOKEIGELI2IFS1QiR0S

IKZLAGRGIGiL:TDNNJUIE~IADRCYLIIDGKIFPEGSSSONISNPNVKOHYLGDSF

CPn_0777 927779 827101 rat-81 Ribosomal PrOteln .

CLKYMAAYCCPKNRVARRFCANIEGRSRNPWOCPNPPGp!lGMQRKKKSDYCLCLCiKOK
CPn _ LKACYGMINCItOLVKAFKEVIHKOGNVAqIIET.ZDtPECRLGBNYRI~ICPAK':IFAIIQpi.VA
No robust homoloQ present in Genebsnk/EMBL
as oC 11!7!98 RTSTRLOYRSGCII.SKILPFPFLWIQILLGFLCDCpCASWOCMVAIK~IDSVFMSRpEHKPtK:HILVNGRRVDRRSF
FLfIt~IDISLKEKSKRLOSVKDiu.ESKDESSLpSYISLDKTCPK

NIPYITKJ1TRRGLRNKTLAYLASLKDARQLAYD!'LKDPGSIaRWtALIAPKE71L.0Et4ILGELLVSPEODOIEAp LPLPINISVVCEPLSNRT

FFYGCSNIEDILEEIBIRPNRILLJGFSYCOKPKIw.pDGRfNDACRYDPSHPIrASCSTGT

MMRIlIARRYTZYIIpTFIDIAKHLHTLIOtRYPGYOILFAV'fACELSLKMFGDYAS11l8rLKCPf1~0771 82786) 821915 GVGIRL1GRICNfPKAFKLAERGVKpGVt'ILEEDCF~1LARTLTEYSSAPFPRDFCEINYeM

QNI'IIDtFSSNf~iF'LOCNYPODYVRVFI!!>DQfY7CALAYYYI'IRVDHPHEETALIiKIVLmL

OVSCRIYISEpGINGOPSCYEpHAELYMOart.KERPNPSKIKFKTNHIKOiTPPRT1YKYR
CPn ' _ KELiIILLGCt:VDLSKQAaIISPOEWIiEtG4FNRCLILDVRHHYEWRiCii!
CT552.: hypothetical protein ~TLPDIOlF

SCGWGMFFAPLLYESLRRGL?BiPTSNMpOQLARLEFINDOL'1~.'ELEHVNF3.tLSLCFPEREPPEYAEKLi40EC
DPCITWl4IYC'1'OCIRCELYSPVLLEIOGP'KIYYpL00DVIRY~OIf CLTTIlUIAEEVLSDDEPLLD
C'fCKWL.GKLFVFD~IPIDESDPDYAPIAECC!!<l~l'pSDAYYNCANIOQtfILPGIxDE

CItIQf~CCGEECSOSPRVRXFDSSRQJICPFRRANLCEISEN&ES~LI

CPn_0726 917381 810880 ' 0735 825680 825007 CPn ~: _ 620 hypothetical Drocsin 'Utidine Kinase IUridine ADIOrlIYSrSISTFYKKLSLVSSMHSFAORNRESLEHI11N1fEKTfA~tDTLKFtLTEVLDQItonoptlosphoki nasel IPyriddine RASERYRSAVEKLlIKYEVERATVAKSIPVAAItff7IPLSSTHASVO~Yt'ASTpMTGSGVCJ1Riboffuclsosad e KmasW

YYN71VK0KWAQDLIVELN'fVtff'1'INASVNSKNPANKDIRt>XLNZ'8LOALVAAG~.TEZNGEK!l2MlCJ9II
IGITOGSGACK'ITLTONIKEIFGEOVSVICODNYYKDRSIfl?PC10171N

YOTLYNFPEEIITAIQRACI'f?Gt;l9(TD!'iNOLAGKYG10ATLTCI'F1IDGRVEGPKDILTLIWWIPOAFONDL
LISDIKALIfCMEIVOAPVFOFVLCa'NRSKI'EIETIYPSKVILVOGILV

AVQCVLTPEOtI'IPAEIATELOAL71DN,~IPDEAGLORILDI1CEIG.itAVTNSSDLTIIFiDKPENpEL.RDt~T
RIFVDTDADERILRRMVRDV0E0CDSVDCItISRYL9lRIKPIBIBKpIEP

INFCOHITDLYSDpVAAIGSFD111LDIl4TYVNON00TllF8NLS5FYGSLTGTPJ1PIDLRSTRKYADITVIKafYR
pNVYINTLSOKIICIHLENALESOET1M11MSK

STWALNP'fATMDHVKAAILEEAKELDN8SF0L~1SSIKS11111'SIVNSSGSFSVTVNSS?L

QY1'IYSEKNGKVEINOILLNYGS1GPLPEITII1.A><CNAESTARSYFRPKALAAVESaiVO

NKIZtDt.OS0L00FTN9CTELFDGOLLSQASELRALPLPSAVASYL.IDRYMPlCE110YIHET

YKKLYYSNLCSSIGNSIIDAISOYVNGATYFNP'ASYVCOQPAVCAGGANAPPCSOESAOA
KLOQERKQaALYLOE'fRGALTVIEEORARVLKDDKIII~lE0RS1'ILDSLRNYEDNINSISG

STIQSDOQSFADMGONFOLDLQLOtI.TSMpQEWI'WATSL.QLI1J~QYLSLARSLTG
CPn_0727 81)559 816192 CT619 hypothetical protein KYYLFSMSTFSIQNRLRTISGESTRI TKLGOfYSCFDPRSVPJ1INLEELNSCIYALRilI1f NALOSENTNVMLIlIpNNfTFp1'TSWTCfIfIWSRPOJISSpRAPSSOTP?DIVSAAARALVL

VIDGGIJIEi.VASVTEIDLGUSTISTVROLJ~IASYLCL:TLTAEQEKWfSSSYVPSEIOJL

LEHVKpENAAEIOAKQEEI1WVLEJ1KGVSTEEIEAILKEYPDIYM~'PKCFIEEPLHTYCPef-.0777 A27ti69 8)0756 RAKVCAPIOEITIENAIOLLPTPPAITPDNVNEVNGIQPI'LSTILOAIDDAIKOAPALOCDOreeC-Cxodeouyribonuclsase V. Ganm EIITILOTLVPLVDK1TPTKAEPDLIYTATOLpIfI'ASLKLYLTDROIAEYRCKTTXVYQNKRSAKLPASGASKAKGR
AKKKLTDERIFAPSVRVLPbNRIOrAKRNLYKLSFITYRKCV1IP

SIONLSETIIRVVENNRSIa.L'fpLSMFOQMnICFV'IWISOMIAI1~IIAITNKYISAVLTtSMSAiNDFPLTGIVI
RfATKNCRASPSNSpIWLLiWLAEDLTSTIIpKPPTIDBiILVJWRT10H

EMYOGLLCLSYMYERLIIDDEKAIFDKSVNEyLPIHIV1AGGSWVNatIAKHAAYQELAEYSWIID~IOLVHVLSDHIF
MGSTIFTASDSIVKHLPLGSGCSOPNIPDYLTLPL.LINNILiEIS

CG1'AVTSOOOLKAYCpTRGNEFKATRNPFHNICDQMY0P71NE'NFCNCLTI'ANG11I0PDLKASKPENGREPLSPP
TYEZTItiCIJtAAFKOPNTFSORPt10r7frSNY0EL1'pILESNPS1YEE

GGFiREAIfrNVCTVEADYVSNAORILNEFNCMTAHVLOL0L0IAELOKKADDLDPGKASMFTfILTBJIt?pEEDCSL
HIFCYJWLPKHIJ1EFPINLS1'YPPVYFYCFSPCIIEYIGDIJSD

F'fENRIIFAVMWITSESLGDALISMII1JSOLPKOEJIFLICPLIECINPFB~ttJIANALNSt:LORAIDFPWNOLP
DSPI%NAWF71YVLSDRQALIJ1M.IWKSOSS~FFLDREIOYOQ?LPiK

:TNEFSTTSVYYSLSSYLVOSK'LGpNLFAGDYYETLLAAAREREYIYRDTARCKOAINLVHDSSLGVIONSILDLKP1 'SPODFSOTKCtICIYRALNIPREVOVPCKV'L'ELIJIRDVfPE

NGLLOKINSLPGATSAOKOEMLNATTYYpYSLSVTLNOLTVLESLLAGLKMfipf:'SNNKEIFILSSNIESYKVNLNA
IFNPHVPIYFTDEVDPRAEDLPIdKKIGLL&SIt.~i'QDDGNYIL

YDKSVFKIESFDDWIPTL1ALESFLTSGFPNISATGGI7GPLP11?VOSI>QOTYTSOCQ1G0OLLTHPOLOOPIDQNK
VPYLIKJfLSSEWGKISSKDRASGpQMKAL&DLILECYPPIIOBCx LNLIAlpMT1'I00EWfLVSTShIQVLNGIISOL.AGAIYSNRVSOVEVWKiTVPLIYFIQERINLYLSSSOHSYEDLF
~1VPSCLEKIFVLSPCII'SPI't't' ~

LRNSLPPTPl~$SCSLLFFTDFCLDFLLHFHKPSPLYDKpGPYICSLSSLSLIP1OCYIf!'I

CPn_0728 818187 816525 Lu'ANK1TSSDIFDLIJJRTT'fIIEELAFSSTEDEtTrFHPLQILVSTKttEWISYISSMQPN

HLPN 76kDa Homoloq tGT6221 LPSPfGHNIKE'ILDLPVE1'LPTOPYLSAFFKNKACLHTSOEYNYSLANJIlY8KKALLP8L

VFMVNPICPCPIDETERTPPADLSAQCLEASAANKSAEJIpRIAGAEAKPKSKTDSVEAWFIPTVKOyNLpOHCSLNEI
IKCIFSPLDLFLKTNYNLRISYPENLKK00KLPt~1'IDpIED

ILRSAVNAIJtSLAGChCL.Ia'SNSSSSTSRSADVDSTTATAPTPPPPTFDDYKTOAQTAYt1MECPVDKEHDLLF.' ISPHAEELFTYYREKTILLRNCLDKDPKtISPYTVrPS8&I166R

DTIfT,~.TCtrIDIpAALVSLVDAVTNIKD1'MTDEETAIMEWITKNaDAVKVCAOITELAPYNE..~YLFPPISLSF
~\:NPVOIHGTLHCVCNFl'.nLYLCSIDPRDSLKK'!'fRTIGSLPLTSS

KYI1SDNOALLDSLGKLTGFDLL.pAALt.OSVANIMI(AAELLKEf~ONPWPCKTPAIAOSLEOKOLLERYVAL.AVL
~'M>roHL:SDSALIKLTSFtrIKOMHPPPSOP~YLRKVLlIIYNLN

VI7t?TDJ1TA?CIEKDCNAIRDAYPAGONASGAVENAKSNNSISNID9AKMIATAKTOIAESSOPIPLLSPL.CWKTL
L'DEEKFHOAVL..iAISEF.AIfHPSLPIFWOPHNRNIECILiIVCAS

AOKKFPDSPILOEAEOMVIQAEKDLKNIKPAL7GSDVpNpGITVCCSKOpCSSICSIRVSMERLKILaLFRGPCE.\t' LLDDAENEfAS ILJds'GFR(.'N I HNPNTENPDSOAAQGELAAOARMI(AAGDDSAAAALJ1DA

QKALFJ\AtirK.IGOOOGILNAL.GQLAs'AAWSAGVPPAMSSTCSSVKOLYKT3KS1'GSDYCPn 0778 910719 97)B95 KTOISAGYDAYKSiNDAYC1URNDATRDVt68~IVSTPJ1LTRSVPRJ1RTEAt6CPEKTDpALArecB-Exodsoxvrlt,'onuclense V. Beta' KVt:.CNSRTCGDVYSOVSALOStM0II0SNP0ANNEEIROKLTGAVTKPPOFt,YPYVOt.SKFYLFCEYM~CPFNIF
DSNSSIOr'IIFF4EASJV~CCKTF'fIEpIVLRALI~:SL111Vls1AL

MU,STCKFTAKLESLFAECSRTAAEIKALSFTNSLFIOOVLVNLCSLYSCYI.OAITf127ASTNELKVRtKDNLAOTL
RELKAVLNrOpASL?"l"ILOINCNVKOIYNpVR1i11LA

TL.DOM.iLFTIHGFCNFS'LEpYPPffTRLLI1KNPALTNSOL'ILHNI'MYLKODLaIKNVL/QE

:an_u72w 81.1905 x19591 OFNLIJ1VRYNIT::KIf."::3LVDKLLA.TtTOPICCIF.':GRVfRLEOIw(JrMOQIYNSL.LdIP

i:NLPN 7tk0.7 fk~mnltm ICTI:':17 KpVFLDOL'~WI:.CFIIKOPF::ILGGLHfIFVDL.LYTSETIf.'LFSFFKIAETPNPKIIRWI

f'AWi.~.V:a'f.NIDTKDCMKKtWY0WCr11:WL.LALTL::rYAELtL:iPwKVKSH'I"!'f1'LDEVKYNff.' MFfIfLENlI:W1'ERTLISF'CfILGftIt'NTLL%DL.VEYL.IIQNYTMnR.iiPDESVPALiKL

D'f L::KRf:fYETItKt)DCVLR IACIAIRARWLYFRED1.: :.~.FJI4PVl'(? ALJIF.~ Ys?f.VL
l CtIP:a)KDI!'fNPLFV NR'llt.~.EFYL'fI DEf rflrDK(/;~1.~.I F:itILF
I81'KF7~fLICDPKQ9IY
I

IffnAEPNWf..~.::KNNWfAIMX:ENT',W:VDINRAFLC'fRFYKtIFt'fKTDFFMEff:R::uLCDf~IR::AD
LI'f1'1.TAK:;::F::EWIyL'rL'rteNR."~fYIJIEAIIIpIFGIIL::pFLEIPtIYLPILY

a.iE::EVVfV:.NF'Li:LJItYWTREL:KD1'1'YrJVIVII(T:PFV111JMTKK111'A4A7VFJ:ILNRLPKII
AtNI~)::::LTFENf'TII\t'tIIFFF'frfIKOrJAI.'rIIF.~.EALPWIf0DKt1'LW11VVLVGDSH

art'tlKt.'::WLVM'PhYt~t'f:TfT:KAA'fNAMK'lK'i.:'lY7trWt.l/r:YIC;~VtNtNYJi(KPLILY
nJAFELI::'fATtMI::F::KNK:aFIt:.TE'fllIl.TT:.t.LEAIWIPENYEKILIKLIi::.~.LFf;<..:L

:AFIJ411I
\KA'PKTPf.Ir:KI~Nt.AWI'll%'PIrI:LRCM:IM::A'I"/It1'!:1'VF:~1L:VPEIDV:I:17t'VI'fK
K(77FTfYF'v::iJC:'ft.:IIIY:LIrYfF"/4'IMP!(Y:IffLF::::MH7M..IF'QDIEKLCUY

7r:Rt;NLI.1:FWF'A~yIIMM'Pf'KFrINr:FTN'fKt:F::AL'!M'h:fTf.::l.::rWYi:A'I::KPANnK
c.l!Pl::::'ft'YN~~LWII.I:NF~.:Rh:rilh:::l:LAt::::'/::I:()LETLY,LTTfIL:::IGa.EYD
IVA:P.

Ir:::dFfP'7KFl7It:1 f::Ai' I fK::KKNK.~.::::ELlJit7IYVM-PP
AYYVI.YLh I:: LOPf::IJJR::::AL9'NYVKLP%'.La:BIIYD

LA I I IIJIUFJ IPDLP 91'::l.l'K
UIK :IIL'rt"f I Jil.lLLkTFAt.Y'lTPPK'f t YaY:::.TKPt.LDlIIK

~.'Im oW n t:t:w ::l'.r'n.r IR:U::It>Y:.KLt'1:'Ky\~LF'~:I:KT:fLfIIYII.h::a01'::IJIf/rEYtM:
'CINHPIKHTIILOP

nrviN Irrr.Ir.rl MIYNI7t.rIN:
1I.r..7rrVEPTILKU~:A"fFF::t'LTF::::(~fF.':1.:?ryLIiIYIFttET.':FLFLFaI)h:IJ~IIr:
VIDLPPEHE

.:n:P'Kmr:I~Jtr:IM::RKIdIFI'::I
r:KYYIIf7WK'P::FU:IVfN::DY::K::111.::IYIKrrF:YI.D'frl:l1'NKAVItKFWL?FKIIKICNEt.

\It::fFNII:Y:fI~w::hf'h:ll'Itf:I
VLt'1'Ytr:ADfIVMfW

:VIF(Yt;;::T'JIa4GFFAL::::EDIPNFNPKd:'J!.LCCDICpIIR-HTH Tr.(c: :on.tl Rl~uaco~r !H e':~:ean . RecelVer Daman ':~ n~fv al19I2 af79rii KITDFILRTNSYIIGFCTNh~DK;TS.p4~'P,.DL,5L8~
IrtSQI~~;'~/I F
tI
~PEEDI:
PDTF
' ' xfrnrrca~ E
a: Dr~cein SII
'T Ir;w t E
QKVLJIpGR
lCYCL
ESVAIPCEYG4LPE0lF6P

ryf OAVIRAFLRQNEt~tENSIPD~ffTfG011TFRVLNLVIESPEGS~~I:.TPSE71GILICKi.LINR
. ~
':KVLFKLIL"f:LRNKI!TK:,~..'t:IIALCII-iFRSLFQFIYOKIRS3FVSLNVKFFPKIKO

AP.~.:dlt.rWLEi.ENLI~Y.ER'!.V~LCEKLKLYEV3NN1'PPLFPEiLTPYFHKLVEGKWYRDaLRNKLGPYGS
KIWI14.'IfCYLFSDDCSIP
GHIGLRKNLLAEIKCKfKE:IARNVDVHRi 'fTlB.ISSSCwr/NV.KTIi~IKKN.iPVL,SCNVLIf::L'IDYVCEItOSRIRLITDVCMKPSWAtIRIANtiONT
ANPNEE

~.D I7.':W41IKI1.".LREL I PQ11EO
IaHAY TLEKDN': K I SOLOELD3LI~EGFNQALLRCIL. ..
.
~~
~

.. N,:;~~: . . ... :.::IcTr,:.l:: , .
..:aF-f;:...,.,. r-=:'.,';,as=. .;,,-~ . . . ~ y ....
.
.

:':;.::.::. ,., .;: ..::::a::.:::.:..:..::.trc.lzw;...., ,.
-..:: ...;,...."., MFRCILFG1FL:.T:Fsa.;~:::. Y YLft::HOf:i:GPKEKaRSVW
I EEEKsFTDfVLNIILPSO

0740 97b054 874861 HONLHILCFOCFLTrOK,70KFS01EKIFSK4Y0E7vODCpfLFKEEiLtSRLINSFFLKTD
Pn _ VlIETILCLLNORCPNSPYYHLFNALVCYKOKLYA>:lfTE0ir1Y4pEEKTRALAPLtI4ISIE
cyrB-Aranacae M Nnanocranstsrase SYNSFFNNIPTFSPOiIILGLCNVPfADKRPEIfVNLVICVYENPQKRY0GL5CIA1UQTVIOLLTDFLLDYISANSLI
EOKNFPOGRVILNRNINRLIJWECEWNAKTYDRIAILLSIIfYf EEEQ14XSYLPISGa'.QIFLDCIRCLVFCiJIVDPSAIVGFQSt~'l'G71IJII.GARLLSV11KGSLELV8SK511 DIYFOYYENVLfYLKKIYIGEpCPYAELLPLEELVSLINENVfILPI~C.Y

GKVYVPEpIWSHtIIRIFSOECLEVIRYPYYSKDQKpLLPEPGIAfLKEYE1C45VILLHGCPLIQLLtiIYlQKHYVN
PNSSLWOILYDRfSTNNFGJ1IRFCE71LVSFSGLEG.IKaQIIZTF

CHNPICVDFfE~IWKEIJ1ILHK8RF3.IPFFDI'AYQGFAHGIELORKPIEIFISDCNTVLVfEii.&NIfVGOIICI
EEAKOC'/A:.LHILDPS:SISEKL.rILSSD':LQNIVSCDO00lfrKLTINY

AASSSKNFALYOLRVCiFAVHSTF1'DELVICIHSFLEfltIRCEYSSPORWCVEIVSTILSNLDLWF~1IOSYDI~tC
OLVHNLVYGAKDLwKKGrh'DEKrI.N'..;.;L\:.RFTSYDIDClSWF

PYLKEE110SELNFIRESLGKNATRNW14RKVACNTFDFLLSOIIGFFAYPCfSDKQVLFLLFIKOAYKOALSSH11IA
RLLKLKFISEANIPSIVISFaEKANFL1D11CYLF11NlIDYOKC

REpHAVYI'IAGGRJBJLNCITEKMIONVVOSFIQAYELYLYSHWLTKVAPSPQSYRLAGLC
LMENKRYDEALEFLCNLSPNDSIh'DYIITOKAiaPICQK

NQSKDRAAS

CPn_0741 838387 976185 CPr>

greA-Transcription Elongation factor_ ItIFRLK'1GI::.fCYLEKIpb~IEEGOSANFLSLWECYCFNOWI4GRELVEILEKVKSSSL'recD-bcodeoxyribonuelease V, alpha-ASLFGRIVD'IWPLwETCIPEGKDKDRVLQLILDLCfSNSONFFDIATEYVNKKYSGEENF4WALlffEFAPFLEDLVN
QQVISPLDIAFASKNISSDFEESFVFLWSSAIiiRYORpfiSL

NEALRWCLRDGRDFQFSLSRFDPi~ODBIKGNlVFlIQOGWGVG)l9lCtfSFLQOKVLIEFEEPI~IRIRPSLGCIS!
'IDLYRGPIOJLPKtARDKLFVWSCRLYLRSLYTIRS1ILLDKLBLLC

GINSAKDISFE1'AFKSLTPLSGDHFLSRRFGOPDGFE7IfAKENPItINILLROLCPKTASATPNYfPPSIDSSILSE
EDNFIFMCITOCCFSIVSCfiPGIGKTFLAAQLILSLVKQpPK

KEIKDELVDLVIPEAOWNRwNOSAIITKIKKGTRIISPONPKEPYVLSDiII'sCStPIGOLERKLRIAIVSPTG101T
SNIRQItJOfYNIFDDNVIlK)'fVNHFLQEYAYRRYNSIDVLLVD~1 G:LSLNSAEKISLIYHFIRDLJiSI?.IG~tIEIRIISLVKALODLDN60(rTdCSLILORELLLSEVTPOLLYSLVpT
irpCYEKDK1Q.YTSSLIILGDTtIpLPPIGICVCNPLQDLIGYFNFXfFF

YL.GIKDASI~CEYITSLSEDDTSRLLEI~BIPIV71LQKSFLSLVRKYSSFWQQVFlI7ILLYTLKTSNRAKTCVVOO
LTOSVGRGOlISFSPLPSISSAIEVLIQJRFVKSLROSGttICVt.TP

TSPII9tDFVYKTIlQ4DPSSVEVLKKRLLDSAHQPIttIFPELFVwFFLKIGrBtEDCLFDPEDMRIICPNfVLM~rI
ltINORWiSDPDLRIPIMVTSRYETWGLFNCDZCLi.CLKTQNLIIfPO

KEVLRLfLESJILNfNYOVASI'PNKELGKKLHHYLVCORYLi(VAOllIOGIISLPPLKELLLLNEPIDSitALSQYV
nIYVI4SVNItSQCSEYDEVIVIIPKGSEVFCVSILYTAITMKIfRVSV

STKCPOFSSSDLNVLpSL7IEVWPTL10WKSNVEEFiiVLwSTSESFSRIOIAKT4SLVGKEwGDpEfLNKItKtISNf NVONJ1KEIEDJ1RSIGDLRENSFIKFALEKRARLpEEIRVLSEBINRARIL?IIDLVF'fINN

GVCCKVTLKGD~AGEVVEYTILGPwDADPDSCIISLOSKLILQr8QGK10.tiDWItQCKEY1CCPaL0753 IgRIQSIWEEl~1 No robust homolag prssenc in Gensbalak/ENBL
as of 11/Yf98 IM71TAHLf.RQALLNLRSfr1'PAIR7~1LFRQQSNSLI8~8iVLF7IGDIVCAIKNSTAISRIU1 CPIt_0712 938442 878888 LGSSHYANAALQKTDGFLCMOGVNI'J1VJ10ANLWI'.OLItJGS!(IfETDEE'DGCLRRC~J1D

CT635 hypothetical protein AE(x?lfOu.TITGINARIdSKi'IGTATFLNEi4tiWSLGrWJWItIQC>M'SCLNI.

TKNMVIVI4VSIISAQKIIDSIKGILTIYNIDFDPSIV~SSLSSDSDADYEYLITKTOEKIQEVATOCSLTESSISLYA
ILSTRPITISDpENPNKPSAEFAARSt(AIWiIIPIAwt.GOWOLV

LDKRApEIL:~SlSXIFAMiPDNFSPEEWL71LEKVRSSCDEYRKETENLINEITLCDAI4TLSLFLPAIT~VLIMAI1 GLISCVINFVIfDYJIKIG

DLHpTKESKRPlCptaSSTKKNKIfKNWIPL

CPrL0754 851781 851040 CPn_0743 838956 840761 rs20-520 Ribosomal Protein ~nqrA-Obiquin~fe Oxidorsductase.
OFILId.XVLVLSCDIIUIPKRPH1001VI0RRPSAEItRILTJ10KRELINNSFICBKVKTIVIOt Alpha-IFMKITVNRGLDLSLQGSPKESGFYNKIDPEFVSIDLRPFOPLSLKLKVO<XsDAVCSG71PFFJ1SLKLDD'1'QATL
SNi.OSVYSWDKAVKRCIfKZxtKAARIKSKATLXYN~MB

IAEYKIitPNrYITSNVSLW1'AIRRGt80t5LLDYIIKKTPGPTSTEYIYDIaTLRRSOLS

EIFKtNGLFALIKQRPFDIPAIP2'OfPROVFINL~PPTPSPOIHLALFSSRHEGFYVCPrL0755 851579 P1IVCVRJ1IANLFGLRPHIVFRDRLTLPTpELKTIAHLNTVSGPFPSGSPSININSVAPITCT618 hypoehscieal Drocsin NEKtWFTLSFQDVLTIGNLFLRGRILttEQVTALaGTALiLSSLRRYVITTXGASFSSLINYKDLPFIa.LLVRKWGN1 'CfKYWIYFLPWi'LLLPLV'CYPFLSISOKIYC1IFVFITIif~1 LNDISD27D'fLISGDPLiGRIG%KEPFLGFRDHSISVLHNPTKRELFSFIRICFNKPTFfFALMRC~IOLIITMVGLL
QTKIRKLTENNDGLRQIRESL1CEI~Q$SJIQIQIB

ETRPIII1TDIYDKVNPIBIIPVVPLIID1VIT10~ffDT.ALFIQ.OGLLVKT1C0OOKLETLLIJtRTL~IRCLIDI
pVpSLIOECGEKTCEVpliSBtIQ.ALT

NEt~GFLEVCCEDFALP?LIDPSKTE~Il.TIVKESLIEY111tESGILTPNQDLAY001lIl4DEYQA1'FSDORNIQ
.DKROIYIGKLENKVQDLMIEIRNLLQLCSDSAIUIIBQ

CSN7IYLGtiISLQLSSELKItIAFKAtZtIEAASSLTJ1SRYLHTDTSVHNYSLEICROLFDBLR

CPn_0744 811387 840389 EEI~.FVYAROSORAVFANALFKTKn:YCaEDFLKFGSOIVISGGKQwIICOt~lBll1E

ham8-POrphobilinogen Synchase CSGRLVIKTKSRGNLPFRYCLNALIDfCPLCYIM4vLYPLHKEVLOS

ENSSLTLSRRPARNR1(T1N1IRDLLaETHLSPKDLL1PFFVKIICYBJIKEEIPSLPGVFRWS

LT7LLLKEIERLCTYGLMVNLFPIIPYGSYSSNPIOdILCIISIHEIKNAFPNLCLCPI1~0756 855889 ISDIALDPY177K;HDGIFLNGEVWDESVRIFCNI11TLHA~GADIVApSONI~mGRIGYIrpoD-RN71 Polymerise 8igmw66 RSKLDOSGYSKTSIMSYSVRYASCLYSPFRDALSSNVI'SCDKKQYQt~IPIQM.EALLfSSISYLPLTKLSSKARNPL
VLFWRIQ.FIQlf(SISQJ1TEYSSEEESOKKLEELVALiIKEpGFI

LDEIDGADIIlIVXPIYGLYLOVIYRIRONTCLPLdriYOV5GEY11NILSAfQOGwLDKETLFTYEFINEILPNSFGC
PEpIDOVLIFLT at'IIDIOVI14QIDVERQKEKKKF~1KELEGL.ARRTE

NESLIAIKRAGADHI
ISYSAPFILET.LIiOGFEFCTPDDPVtINYLKE~iTVPt.LTREEEVEISKRIEIGOVQIERI

ISCKFRFDKIISEKEVFDK1'HFLKLLPKLITLLKEEDTYLCJLLLa'LKOpDLSKOBRJfiG.

CPn_0745 941903 841742 NDSLEKCAIRTQAYLRCFHCR1WVTEDFCEWFKAYDSFLHLEQpINDLKVR718RNKFA71 No robust h~colog present in Genebenk/Et~LAKLMAKRKLYKREVAACRTLEEFKKOVRMLpRWNDKSOEAKItEMVESNLRLVISIAKKY
as of 1117198 VDSCFDI%4RJ1.SSLOGS i":
YtTfIIYDPKHTLaYGFCNOVSVIfItFHLKPPIISOEKFL72tROLSFLDLIO~'BIGTJIKJ1VEKFEYRRCYICPS
FYIITWwIROAVTRJ1IAD0ARTIRIPV

HNIETINKVLPGAKKLtMETCICEPTPEEL1EELGLTPORVREIYKIaQIipISLOAIVCEG

CPn_0746 ' 841979 813567 SFSSFGDFLEDTAVESpAFrITGYSNLIIDKt4KEVLK'TLTORERFVLIHRFCLLDGKPkTLE

~'632 hypochecacai protein EVCSAENVTRERIRQIEAKaLRKIWHPIRSKQLRAFLDLLEEEKZGTSKVKSLKSK

FSGRCPFSFEVFMLGKEe'EF':CKQKOCLSHFVTNLTSDVFALKNLPEWI(GALFSKYSRS

VL,:LRALGLKEFLSNEEDCDVCDFr~YDFETOVQK.IADFYQRVLDNFGDDSVCECDGAtILACPn_0757 MENVSILAAKVLEDARICGSPLEKSTRYVYFDpKVROEYLYYRDPILMTSAFKDMfLCtCfolK-Dihydroneopceran AldOlase DFLFDTYSALIPOVRJ1YFEKLYPKDSKTPASAYdTSLRNfVLDCIRGLLPAATLTNLGFFPCIKNIALVIAIERYOLI
IaKFRIIWLFIGCSVEERHFlIOPVLISVfFSYNEVPSACLSDK

~TIGRFwQNLIHKLOGHNL1ELRRTGDESLTELlIKVIPSFVSRAEPHHHHHQAMtOYRMLLSDACCYLEVTSLIEEIN
yTKPYJ1LIENLANELFDSLVISFGDKASKIOLEVEKERpPVP

KEOLKGLAEpATFSEE~1SSSPSVOLVYGDPDGIYINJUVGFLFPYSNRSLTOL:DYCK%NPNLLNPIKFTISKELGPS
PVLSA

HEDLVQILES3VSARENRRNKSPRGLECVEFCFDI:.ADFCAYRDLORHRTLTOERQLLST

IiNCYNFPVELLDTPMEKSYREAMERAMETYNEri'0FPEEAOYINPMAYNIRwFFHVNARCPn_0758 555101 95645a ALOWICELRSQPQCHOM'RTIATGLVREWKFNPKtELFFKFVDYSOIDLGRt.NOENRKEtolPfdhpS-Dihydropcaroace Synifuse PIT
RANSEPRFVCLSLC3NIfiNRFKNLOIARTLIGEOAVLGLRSSVILETE,IItLPGSPPaiD

LPYFNSVLVCETfL:LRELLVTIKOTENWCRAEESPPwSPRTIDVDILLYCDFSPCCDN

~Pn_074'i R17o49 841057 TEITIPLSNLLSRPFLIAGIASLCPYRRFCfOCSPYHNFTFGEIrLIHLPSPPCMIRRSLS

:T6J1 hypothetical procain PDMLNL:WNVTNDCMSOCGMFLDPEKAVAOAEKLFTECMVLDFGApATHPKVIOpFLSV

RTCMCCKCAEVOILSSRSLSCMKILSGSLFYKKFCDptDiERLEPVLRLLKETWSNRKOYPIISLDTFYPEIILR141D
IYPiQWINWSOCS08NA

EVARDCEL.iLVNIaIS.i.~nLPS'DPItNILSF3VPIGEOLLSWC:EKOUWFSDVGLNANDI/iFD

:Pn_p74R R44 Pa6 944121 PGIGtGKGMQSLATLYEIAKFKRLGCPILIGHSRKSFL3LFf;IJIIDPKDROWEIVCLSIL

tripA-.ar~nyt Transcranafsrase t.OpOCVDYLRVHNVAAHQK,1LSVAACFJICAPt r:TLfIJfALCI"f RP~ L ESA I EKALECFCP
ICNP IRSP'/EYALQL~CKRLRfCLVCNNAQCL

I:LNiIDVMOS.1LAVEFVIIT~'TL:aDDLPCNONDDERPr:RP7IMKAFDFATALLNran X1759 aSriA i4 :156~n :YALIPA

AY!:IILRLNAICKLKEQGCDFREIDIAYNIICDITDKIIICCGGtIi.CCOYDDMFfrNRGOEHVfolA-DihydrofrrLacr R,xlr,.;carh _yIMIKK'1t:SLFELW.'I::(:GILFf.~,;Dl'OFAPTIT..'.F.~.NNFr:LLFOIKDDF3DLQKD.SQOILLV
KPVIIPGNFF-NfU:VGI~KIIW:VPr;I'!M.'DI'ttf;JIr:LEGKLf'WIiYIEDLOFF:ETIOK

:INlALL.fC;EIG\ALCLI.aR~~MJI-.LELLDRLSA.i.:LYt7S.~.EFETIIC.~.IGFFPIVNGP.KTWETLFPKYFI'LIRA'/'.NF.~.IIRKRI~:VH
r:EIWVT::LfEFLU.JSIL
:PTFLIf7G

r:EL'I::LFLENOiVPGFFI::It!YYElA':VfFFt tt:LLCTWTKTVI.TtL~f~W ITTt.-YYEMIIIR

'Ir _117A" I~.r.IN Ntr.0(Ir. VM'Klli::L

rllmll NUf-.;Ir:lIM: lrrruly~rntu~rYLil;r Vc:YM-lfA::::IF::1'F:Dff:fl~t:II::KAlIY'fWOIt.hLtItIJI4LF.NiiVF::c:llIt7TVF_'x.YfLKNr :(r:47.:n ... . .,,')r.r.f (EKIEfAEI'AYVi::'r:AS'1\'r:ft'.IG:::V'rEVRllt,1'!Li!:NVI'Ivl::R('W(:HCPELKNSYLG
t.'Pr.ll hYfN'rnr,r.m.RVrna..y HIrrKAAllli\'ilr:G::Vt::::1:VNG:At:VR(.'ANFRLLY:PtII'Nft:'r::UK.~.KKIO?r:P.RKLG
AFRFK:PKLCLEIPKP::r~IVTHRIT'I"IKTIYfYf'YUDLI:aLG:::Lf'KLNRf!::IWI'f::KIV~

~ r :Kr:VA IrWNVtI f Hiv;~'I( I n:4T:AWELEKV::YLELtKylilLA'f V I-/F:Kf Lf'IITR I Rh.'UV I ; I'lL'fY.K4a:IL I f~.: A. a lut:::llVlY:YYVLY

INUFLL.':VNTII:fAIIJiNFYlii.f:IN.%:f I I::Ir.:lft'ffLHRt;TNf;G:I~'WtJr:l'!'i'I:INYH:Kf ~'Iw- /'.n n:.tr..lr.: H.tn.'In"
D'.'lriRALKM'Y::NLf.UI:I.:AAA'/t.t'W:1:;fU:trPlIAIIEFJIFKITFIC:::fTrL.~NIM:'fLA

f Af:IYCDL'fr:Pt.LO::MAWETI'AI't".:

1~~

NU1KRE9TLWVHE:LLPK: rLCKLPAPYP4,:IKCfAECL::F11E.:.:FPAIE1KAVA

3DR.TAKCIP1'AR
APLDIFPLKHLFPR.fl10D53HSKt7~IVLOWIR(,'IifATEC7TPL,'.C:rJI

~Pn p7wl ss57.tH 858775 , RTVAKYMOLKILtAIIKPKItDtI,:I~i~t~tPROA

:TnlO mn.'.'rnrattc.)1 procmn n . ~ . ~~ ~I

.Ilrf:ldfELLDKOIEDOHHLKHEFYORWS~'.lfLEl(COIQAYAKDYYUtIKAFPCYLSALH972100 d'016J

ARCDDLOIRROILt39LH0EFJwCHPNHIDLWRpFALSLGVSEEEiJINHCPSOA~ATFCPn-0772 RRI.CDNPULA:::LGALYTfEIOIPOVCVEKIRGLKEYFGItSAIIGYAYl7IIHOGDIKNASuvr0-ONA
Haficasa NLGLLIfI'CISELtJfa:RKAtffAPI?1PVLVLV;.1~K'iAVt : fRILHLIPJpCIAPREIIrI

''eEKDILpT;.~.~RL~1IPDAVL.QG,~,QE~fLDfLLdiFLaSFINSfEPCSCKVf!'ITpfAARELtfEP:'.'N
pr:Jl..'-'~NEFDVPHVCTFHSL..'VFTLRRSINLWR~'INPTIYDOS

..., '::Jf~..'- .- -.;-~.~KK:.
. "
~

. ,.. ~ :I:...':.\;.:'I".~'; ~:'c..; , TIIIA'::".~
: a":.:-:K~y~
v .r .
, ;... c ...... ,.., ;:r.. . .;:
NVFASA:DPCOSLY::wnrJAMLHtlttlfFGYDYIMAtCVL:LEFNYf4iYCNItl'IAANAI,IIWNA
:. t:r "
~.iw'~IIETKRSIYl4'ILPDRKK1LF~1AVAYIEKO~GS~(~((S
~ELLYFSRFSEKWJY

. fIIKUtDICIPYfCINSQS
ALGIHGVPKGRVIEIFCPESSGKTL'LATHIVAWWIO~fiVAAYSRLOfEIRSVKGPGEKIRLFI&SfDREEaIDtYAA
EILOGiRVv ATHEISTIK'IGALSLDL ' . GGLSFYKRKEIOOILIFIJtIFISKSDIVAF0R11MLPKRGIGS
LDAF3GLDPSYASLICVNIDDLIIISOPDCGEDALSIAELLaRSCIIVWIViDSVAALVPKRTFCDA4LRRRIPYE.a 3ELECDIGCVMVGWARI91SOALRKLTATLSRSOTCAVFINDIRERIGVS!'GNPIT~fCG't'fIFJILlCYAIAOGL
PILKACOOALDTKDVKLSKKOOEGi.OEYU~I.FPOImHIYtfILSLR
' RAUCFYSSIRLDIRRIGSIIICSDNSDIGNRIKVKVAIDOCLIWPFRIAEFDILFNOGISSAfNLFlfFLDDIXXu DPIG4VVRI1GYLEiLKEDAOTFKDRKSNLEELYHKALESECONPK
' CILDLAYEYNIIEKKGS1JFNYQEKKLGOCRE!'IREELKRNRIa.FEEI00tIYDVIAANKIJOSGfOCLEFR'rSF
VCLEEDLLPHANSIGGTYENIEEEIIIILCYV
SOODIiQLT'!1 CITRAODLLYLTAAQVRSLWCTVRlBIKPSRFLKEIPKDYNIQVR

TPSVNANlTPOEVPApIYEA

0763 860520 859972 CPeL0773 872185 871195 CPn _ unQ-Uracil DN11 ~lycosylasl yyfA-FOrsyleecrahydrofolace GyeloliQUe NFPKfDPKIEKSALRKLPISIRRDLSEERID1EASS11VASFVRSFSKtSWL.SPYSFI~BtCItlIONATIDDLWS51 CECLPLC1~IREpLKEEWSKPYMpQLLIFLKOEYKEHTVYPEC~xVFS

ONOFaNRILIOKGTLALPKIDQ~Ii.YPVLIPSIDDLISYVHPImPFSKOTPISSDEITI(VALRSZPFDQVRWILGDD
PYPGRGQAi~LSFSVPECpRLPPSLINIFRELKTDGGIiIVHIi ' :.VPGU1FDQOGYRLCYGHGPYDRWLAQHPYPSIRTIGIC1~QKIDRLPOESHDIPL$QIJ1AA
GCLOS1;IANQGILLLNIIIi.TVRJtGEPFSHAfiKGWELF?DAIV'ffa.IDFItTHIItViJ~li RKKCELLFNSKHOHAVLSSPHPSPLAAItRGFFGGSHFSKINYLilIlCI1'IKKPHIM~i.P

0761 A61819 860521 CPn_0771 871183 873125 .
CPn _ Cf606.1 hypochecual protein CT618 hypothetical Drotein ' GYKShmIKKLFCLFLCSSLIANSPiYGICICDYEIQ.TLTCINIIDRNCLSEIICSKE1U.KXITOOLPSAECMPSVAN
LFJ1DFLAAFaLL
LFJ1P101DCIHSVCFQKTPRLTAKSWSME<Qi.

YTKVDFLiIpQPYOKVHRNV1011tRGDNVSCLTAYR1NOQIKOYLECLI~BrBfAYGRYRt~IIiIMADIREIACCLE
OSLATLVPSE

CNIKIpAEVIGGIADLHPSAESGWLFDOTTFA'sILFJIAIVYE14GLLOGSSVYYIfIN
01~5IRYSED$EEWLIIAtEEY(( 0775 871010 873111 ~CF CPq tiY
' Q -fYTSSGKLI yODY family GNIWICECPYHKGVPQGKFL

FGRLL%AEYLDPOTNEIYATINEGNGIOAIYCKYAVILTRATYRGEPYCKYfRFaISCI'0 IVQTYNLtAGAKI1GEEFFFYP1'fiKPI~LII~WII~ILNDIVKIWYPCXaTLESQi~.VlllKER11BIIYIASSHC
YKIRE1'RTFLNRLGDPDIFSLSDFPDY1CLPQEOGDSITANitL'II~IIi KSGLLTZYYPflCQINATELYDNDLLIKGEYFNPCDRHPYSKI~CCIAVFFSSAG1'ITICAAMQ.GCWVIADOL>Q.R
WJ1LNGLPGPLSANFACVGJIYDImItRKXf.LD3J(SSLGRLVDRS

KIPYQDCKPLLN
AYFEGCYVLVSPNGEIFK1'YOICDGYISNOF3fGSSGP~CYDPIFVKYDYKpTFAFi.B~

. NDVSHRAKAWKLi1P14.OSLFGOILLTRD

CPt>_,0765 862415 861801 CPtL0776 871180 875187 CT617 hypothetical protein , TfIYIKLLGRLIItfi'tISILILSFLSLtSILPVLAITSNHVKISORWSDWSOILTLKVIRCT605 hypothetical prosein DHELDVI%fOIARISImRNNLSIF.~~LI3ASCKDLRPISRFRDtI.l~OfiliSNSLL.71QSI~VWERFIFVL1DIP
YDCLIJ~'FOFLSf7fBOfIFYSP1ZLSCIFPYVCCA0Nt1i0LDRIFS~EY1R

tAALEKSNHOLVWNCE0tfi01DFAFVIit.E0AT0~'fEDIESLFSLFNPt7IPVAPLVFFLCWCIOf~CIALI&HBA
AINSI~ODJ1LSVFYSRK(tDCfVEILCTLF31CYYCiITPfiTVWIDPS
' IQ11'KplTPl.GNEVWLTHAEAISRWI YM~IpIVKA80tY~LI
RYRE.RSySLYCVKEVPffEVAINCDVFVYDVpDIGVRSYSP
' ' YAPN11LNWIPIB(G
i~11'PGELALFFIOII
VLDRPNPIGGRIVOCPLPNPI'fSCSLIIPYCYC

~B~fl'P'DLIGLItrhPTSPOhPDPOSPFFriNITGILCAL81fJ1SIG1f6YTLPPKVIGAP111 CPn _ OGONADCl~l4DCIPNLFLPFFYEPFPCKYttI~~'1CSCVLLVLODPKIFYWETOCZI91C
CT616 hypothetical protein AMIFKLPVYNICLTKAFldJI'IKIAILQKTCIO.~ftIPDGRtI'tSLP)QiYFA71PT1'FVLKALYPKQVEOTLKS
IERIPARIISSICHGPGGDEFLSISHKERYIVIIPWtLCKESRES

SLpCSDILVKSSSSSL101R10iILKVALTHLF~ISLiILPWESLIVOPOIGKPIDRaITPLTLFHOLRS~LLSEY~

WIAqQI'fLKKELSFLS0110IFPDfQ.SCRAADIFFL7100SPLK5LPAYLLIYf7GSEEYCCI

Fv10W01IAVlIRSFStaISTIdfSCmIHATWYIQETPPOlYi.PAINVAOISPM.~(ILEOKCP1L0777 875586 LSLPLWCOS!!1'YGVEDEDWEIYGDfIAAAW~J15RRPLTfPYDATSVSPAA910Rit~tSQroEC~2-Mat aback Drocein-60 SLLIGKYALJ~171TVWSIGSVLKLKSLSSSASNHFAF71GPEP1CVL.PRSLKAALKTVKAIC%TS
TKAVfPAICPROYNWIKIO~iAPIVLT1~RI

IOiSABNYPLLPTIPTSEp'1'LKFLUILGIiSSPSIRFSYF8YIf11'SYPSI~I1PSLPYSALVEAKEIIf.ODAI' 6SLDVKGKFaLLRWE01GDCS1TALWIDtILITpGL10GI1tADt.DPQEI

VIOCOOQP~IPQFLKICISSNPIa.QIiVSFSLED0R5f~L0!'ftSSKAGILLSVDNYQOLOROAIQ.OS
ATVISD71D(JODIIPS

SIm9GISRTROI~IfKSGYLSDYFVTRPLTImVVWEEALVLIL4IIBl.VSLfE~.IRYL6 LIB'111lPLVIIAEDFD~'1VLATLIIHKLRNCLPVCAVKAPGSRE4110VVL.BOLAIL'1G
CPrr ' ATLICQEBB~ICEIPVSLDVLCRVIWVMITtILTPTFLECGCDAEIIQAR~.CIJ1IARST
CTblS hypothetical Drotein NIfC.SYLi.RTAINVYSFLIL71YIFASWVPDCOSARWYQLVSXCVDPFIIiFFRRE1IPRIGFSESiCOELLGILAI
FIGSIPOVDITADlOTEpROIQFOLPSALMTKA71l8mCIVi'~OV

IDPSPFVGLLCLGILPFVILRVLRFIILiIIFNSPWLLQYLAF4RAANItIEVPANiSSCrtITPGFEPLL011VRTPL
KVLaQNCGRSSEEVIHTILSHC7PRF

OYIKiIn'DZ'FEDLVDAGICDPLIV1TSSLKCAVSVSCLLLTSSFFISSRTKT

CPn_0768 861114 865161 CPtL

yohI/nir3-predicted oxidoreduetase, YFSFSHAAPIFI10fILLRSSIVYAPLJ1GFSDYPYRCHSALYOPGI~1FCFJMCVECILYAPtsa/ahpC-Thio-specific Mcioxidanc fTSA1 Peroxidase ERTSICLLDYI~f7~PIGAQLCGSNPETSCFJ1AXILEGiGFDLIMti00CPlWCITKDCSGAPVApSORVPGYEPGCO
RFESSLVRfB~IXRVEEEVPNILSLVGKFJ1POPVAOIUMrGCICT

SGLLKTPEi.IGRILDKIINSVSIPV1'VKIRStZiOHEHItAtE~'VRIIRDAGASAVFVFICRYSLImYLCKYVVLF
f~fPKDFTYVCPTFLTIAPODAIL'aEFH'fRGAEYIGCSV~IJITIIOOWL

TRAQGYHGPSKOEYISRANAAACKEFPVfaiGDIFSPEAAOAIQ.TTGCOCVLVMGTIGJ1ATIO~IECITYPLLSD~1 CVISRSYHVLKPEEELSFRGVFLIDKDCIIRHLtMrDLP

PWICKOIDDYL'1"fGSYEKIPFIKRKAAFLENtQtLVEDYYOSCfIfFLSSfRKL.OGNYLISALGRSIEEa.RTLDA
LIFFEfNCLVCPANWIIEGLRANAPNEECLQ~P4TID

AKVRFLRSSLAKATSYpEYYOLVNDYEFJ1DDSSLEIF~tKG
CPn_0779 8'!8502 878095 =Pn_0769 867763 865121 . CT602 hypothetical protein _opA,DNA Topoisomerase I-Fused RFDLIPOIOCPNALFGEiEKGSYDTAYFCRSLVDLHNYLCDVSSPCI'IL71IKTLLSDYNV
to S1'II Domain SIOGPIfJIIRIJOfKSLIIVESPAKIKTWKLIGSEFVFASSICHIVDLPAKEFGIDVDHDFVYIRVREDGYCVDSYFF
GLHF'LNiQZ'rLKNIIAICLPCVGtIpHIIFJ1SRSLCOKWrSLLL

EPQYQVLPDKpEVINHIRKLAAKCEKVYLSPDPpREGGIAWHIANOLPDSPLIORVSFNFFDlIDLYDLLTFNOPF

AITIWA'JTEALfHPRTIDMALVNAOQMRLLDRIVCYKISPILSRKLODRSGISAGRVOS

'JaLXLWDREKAIDAFVPVEYWNLRVI?IQDPK'l'!K'I~IAHLYAVOGKkWaCEIPECKTENCPfL0780 DVLLINSEEIWtHYAELLEKSSY1'ITRVEAKAKRRFAPPPFITSTLpOGSRtIFRFSWpap0/ami8-N-ACecYlmuramoyl-L-Ala SR Ilmidase TN3IAQTLYECVDLDSEDS'hCLITYMRTDSVRVDPEALTTVREYIOCTFGKEYLPEKANIIHGNKIAVOSLRFMiAKL
SFFILLSLLFSGIDCSP.LtIAAGRSPSLOCYtaEIEDISAKUS

YTTKIOffpDJWEAIRPTDINLTPDKLKNKISDOQFKVYNLIWKRFVASOITPAIYD1'LAVHEVtIVHLSERLDEODS
KCOKWTAAKPEfIJvOKIRELESGOKAWKTLJ1VI9TSVKDtpi OITTCYEIDLRASCSLLKF1(GFLAWEEKODDF3'IDpEEDIiPLPPGHApWILIKEb11S0E0NWSKWEIOKDHRALW
OLRLVRRSLLJ1LVDS=SPGAYADFSDPVPD7IYIVRGGDSLS

:,FTI!PLPRFTEASLVKELEKSCICRPSTYATIMJKIOSREYITKt?JORLRP'l'ELGKII50KIAKKYKLSV1'EL
KKINKLDSDAIYAGpRLCWPNKQ

r LETNFPR INDIGFTAIIIEDELELIADNKKPWKLLLOEF1VL'fFLPWITAEKFJ1VI
PRI L

TNIECSKCHKCKLVKIWSKNSYFYGCSEYPECDYRTSEEELAFNKEDYAED'fPWDSPCPLCPn_0781 879851 x79199 '.,t:VMCVRtICRYGTFLGCEKYPECRCTISINKKGEEIEpEEPIPCPAIGCNCKIFKKRSApat-Pepcidoplycan-Associated Lipoprotein YNY.IFYSCSE1PECSVIGNSIDAVITKYSGTtXIPYKKKTPrIO(KSSAK1TKMRTPSKKQNCYRSRRKTVPLLG:FP
SATDIfFlIT!?IIHSLWY3.C'fLLALLALPACBLSPNYOWEDSCN

r,Y.AKSSVKKSSEKKTGPLFLPSPDLJ1KMICNEPVSRGPJ1TKKIwDYLKEHOLQAPTI4KKtTCHirtRRKKPSSF
CFVPLYTEEDfNPNITFGEYDSKEE!!QYKSSOVAAFRNITFATDSYT

LYFOM~tLAT I ICPNPIMFOL,iKHLSOHLTIfVSNDFSSASSI KCEBNIJ'I
LTNLVHYNKIWPKATLYIECHTDEW:.AAS'ItILALOARRANAI

W RLo~TISYCKEHFWSCNNEWW00lIRRTEFY
IHAR
.

;Fn 077p 868722 ar;9lll T.42 hypothetical protein l;Pr:07R2 PPL077 87977?

KFRTRtIVEKLEFVTCL.~.SPDDDLITFNKOGLL.k7PEEEKVAFLVRSN1WLD:CPETPASFmlb~f~'tty;:.tc clt.trida transporter rF..:IJiEUFDIFPEYVEVLY;:NECLDVWFrICC."111ILt1ttElffIOLRKHHRKASRWiL:HYSRDr:l)tr:
MLROI.ef'VVFFFSFA.:LWAEELeIWR~EItITLFIEV::c'.OTDTKDI'KfOKYL::.~.L

t:/trrVtFJIVIIAVRHKFtfEPVFE~r'VWYUTSRWf:yiRRFFr:PLFR~Pt:ESYLLLFFTILCLGITRIfY:Y.
DIAfa:Df'.lw'I'rAA::KC:aSSFLAISLRLNVPOL3'P/LWa;:KTPU'fLC::aTI:~II

::1.'rfIPA(:ILINLVLIfIYFIAARLCMAOSYL'fRAHKKIFYl'IfGVPPLWVLLRLTDKEIKNFAL::VDIrJY
fIIIIMPf1911'AL'llf:fPCL~.N:KIVFALSSLCYI~KLKUr7RW1'fM'b:KNI.AP

vt:l f PVLEIfIMKRKLF.NVHWKp IYU.~.YI:
fT(Y'.':I.:ITI'KWlt,1\:::NPI'YLWC'lY.'If~/PY
FV IFLGFLFJffEf:ICKVLPIJU ~IJIITP;

1'I<KYLLAI'VAttl-h;Nt l'1.l'Ivsl'I:a.T.'7.PMl:RFPRLLNEtI~,fIP::FNI'fl::IJI.VFf.~, Yr~ ~I'171 ~~ItsSll u.n11.1 NKW
:NtbLY111::1.I'I't:11.\I'IiI.LTKKYFN::;~FAW~PDr:YF.IAPt:::VIKtiVH~strf'IDI::

rt.'.tl IetIA fatlym.r.r::.
::i,lm.t'id::I:l:lilrJl:rf::ITNKF:a'::WAfIY:AIIf.VF::J4:NAE~iELYLIiLVTKKTNK
IAIs:Vra:KItF

II'Yr:H;:YPI.%U::::ALUNF1'1.tKrJKL:LKYLPC:LIiIIQ(/:WNW.~.PLTEL:iSYY/OEILONPP:lf :At'fa?PIKIYrI.

of 41 :;:Lt:EI:I1Y::(r -IFi"rN::TF.~,YLtIpTPt:Is~'~:L%1'P.LLPOI
EFJ1F:.'fAEERFIIWO i v:Itl.::Ut)a.nl.l'NIt:DFll~'FLELPLEKIIIKVWnTtrJtIL::PEfaII:PSWSYY1MKLLRNSSs'Or~
rr/HI IIHIItNII :IHllon sN~,A%::1'/ffY.'Yt'LlrlTk.'EFAPINKKFwL;:L::ELRtIILKKAfI:.~.IPWf:PAAACfIIKPMVScT
'.'sss I,yl,.rtu'r s,~.nl 1'snr..rtm 't'I~LIIU'If.F'I::.'7::.WYf111:.'iRt:1.t':iIKLtIY.FfFfIF%FJILPKEEUKNL:i00If~aAK
WLIKINNK%LI'YfAI'rN'IIL::ILLLVI~A::fLi'KKHViPKAF4EYLVTIUf'KPIYITI'::VWIA' AKTtRP.':V.1'~QPQKQAKCCPPOt?tVQKALGKPTPI.fNEPNEILaILFW.'tL.'vLL'~.ll;.
..1I':DYII:x:EKI..iYc:..:::..::::.:.'.1:.:..:FCfP:'fPKti :."EPPKPSPAPTVAKK'1'fATEKP Wf'/:Ot'w:""'_.,.....~.F..........lA_.
pp':LFIr1YJ16SD
L~IFN' R
"

PP.~.ITKKNTO4iK1'QLQTL.iEVAGAL.~sLtIVDKTERSETSLKNT~IP3TAQLTNHSCLKJ1T.
..QHIJIR
O
TF:
.::
~L:
a~.lK S
K'4'IFlxIFJ\WAWRIIIY\'EI1~:1~
D"' F
~

'3EDEIl:ELFRT11LALPSKriYVRiKLVL,iPNGEfOECSFIw~EVSAADKOLLTORIOALPPO.
~ , ' .
r .
.
V:iLOKISKDTA
ALPLEIfIQALOP~ItLt'~ :'T.EDIfKYPSCLF:EE:.:.KCFL:IFttpC
'lfl~1 L ETENSADCfLTIL:aF~
.N.:
KFLEKYKV.~,KNt.:FHIKL

CPn 0791 x92359 991972 ~ or.;lr)9 9.r4~lv U .7 ~.Pn A%bD-BtOpOlymer Transport PrOCein _ ORAD.~.Tr""'!'!N'~tl''QYY.tI4KYPrI'EETCEPVMLTPL:Dt'/FVTLJMFTVAVPLIKrbsn-slom\
rrrriac~ry f.rmilv nr~rwm-PF~C
cnospnacase IRSEi~t -~

: : : . .... o ...,:.:IY.1'i ~ : :.. _ .. ~:A...\i~.. . .... .. i~:.Ial n.
:.. ... ~ rr.
'::Ii1:" . .. . . . , ...,.....
. , .'S'~VYSI''b:A

: i~: :i-! I::W rJ:.:.:::
NTLTOIVPLNVDVL:LFSLVL:ILDA.i:Ft:fPNL':.L...'VE11L~KVFx'.:YNELiLIKVFPNGD
..~.yy :,::_:::i: ..
-::'IY:IS

KIWASSIPENLGtJf(NHKIDIPILYfPFLAAi.KOSP101pEVfSJIpIINVFpAKCpELOGI

CPn_0795 9970 LYTfFSAGLLJCCtl.It>IOOSYLTVKTAILSKYGV:LItASDPAiJILNTVYPDIIt'RIfIIIC~QV

exbB/col0-polysaccharide transporterFU4~PCPIDSELGPLT.SPLDIGFNFYSFKIKDTEIWGCIETNPSIDIAVLSYAIGIEES
ONLYFETLS4NKDt'YSMMFSNNPIIQAY'fFJIDFFGKSIFf''LLILSVrISIM.HOItiJII

OKNFLKAGItSLKOFLIKNRNAPLSLDIHPELSPFJ1DLYF'fIKRCCLELLDtOIROSAPDRGfAPL.WRMRM'FAYf PCILLGSLIAFIVARRLSLPIRIC.ATANIESRIOdOJCLYTDDiLG
' PILSSEDIQSLETLLCAINP1LYKALLH10ISFIPATTISLAPFLGLLGTVWGILVAII'ttISLPSYPNIE
FEIIGII~IfItNAIIIIEt1L11L.AKTNFP.IQfIlG\OF4l:.Hi:.EpAQpRLLPN1 SCSS~tSAINEGWTALCTTIICLFVAIPSLIAPNYt.ItANSSELI5EIE0'1'AYLLIaISIEWIAYIPAITVfxDPF
fHFVVCECSXARLFLIVADJISGKGVNACGYSLfLIQIIZ.RlfLSR
"
' ' I
MYYSCNPPACYLDPDCETS
SA
SSSi.Q0AI0L"1'SRLfYFPttKNSCMFVTL.~IYCYN~

WLINpGMALGFLPEVJWITSKLFNPKPCSLPYLYSDGITE~111t~P7~I0lffCCERI4AAI0G

CPn_0786 881137 995293 LTGKSAAOAVNRIJG.SHCI'FtICNStIpIiDDITLLILKVLES

dsbD/xprA-Thio:disultide tneerchaelQe Protein CPrL0791 197123 891001 IPG
' L No robust holuolo0 Dresenc in Genebank/ET~L
fOOVHIIPGAEGLSESSY as of 11'/98 NHGVILNKFRTYLOTALIAPFFSFPALSCSFSSIpAeEI
' ' pKVfEEEGTTFF
KSSKNRSFLLKKSOQiQV5LY0lfWWFISOLKKSLCYSTVAdL:FNIPSOESFADSLIDLNL

QTPRIGIK:TASKGSHI'lWlOIPGEIGSPLKISWOLPIE
EWtJICGDSCLPGNVDLKLTLPY~!(iPSLY
.

CY1;9SALIVAINMPEGYTPGQEVELRAOV
GLDPSVECLSGDGAfSVGYFTtUGSTPVEIfpPFKYDV5K1IT!TT'..SVCTANOSGYAYGIS
PtriIIAEFTKTLHJIQt~ftVLFIJDHSVQVAOGKCNEIILNISKItINIITNAWE1ISEKAt>IQ.FAY' AE'tSYSGCTCCAWRLKViQitSGV01Q4EKLHCILLLrIDIIIGRPVESLTINSSAVIbVIOCFSYDA
YDCI'I11IC1'CSLJG71G1CYNCAKiI$ADCTLTPLTGITC.3FStfCFaRAISKC
' AGLSQYITILIMAFIGLIILtI4IMPCVLPLVTLKVYGLIKSAGENRSSVIANGWFTLL1IVAVkWVN
SCpPKAVOffASGAT'fYCOLADISGGSRSSYAYAISDDGT::VCSNESTITR
' ' CCPYIGt.7~GVAFIt.KVLCtOJIGWGFOL01ATLIIVfFLFALSStGLFBdG'tDffANLG.IYIVCAANFATVTNC
NpESNAtMYKDNOIIfD
~' GLYISGt%
NVPTYLCfLDI

GKIQSSF20CSSNNKAVGAFtIJGILATLYtTPC'fCPFLGSVLGLVNSLSfIAOLLIFTAIG

L~L7LSpYLVFSVfPKMLS1ILPKPOGWFISTfKOLTCfIQ.LVTV1WLVWIFGSETS'iTSWVCP1L0795 LL.OGWL1.:LGAWILGRWGTWSPK1LORVCASLLTFAFLOGAISItSGt~SNYFABPOQTVNo robust Maaolop present in Genebank/GwBL
as of 11/7198 SVNEDSLWpPFSLEKLAOLRAQGR15VF1MFTAKWCLTCpITBCPVLYCOAVOIC~?LTfIGIVGTLOGANSSA1GVSS
DCSVIVCpAQTADKSVHAFpYYNGEtIKDLCTLGGTSSTA1LTVSPD

TLEAWfRKDPGITEEIJ1RLCAASVPSYVYYPGDN&APVVLPEKITOM.LEDWSRFVRGKVLORSOIADGSWFIAP14C
NTDFSSNNVLFpLil~il'YKTInENGRQWSIFNLONBdOR

ASDrt!"lTftIAi.Gt~GLYVMILONLPStI~AQYfGIAYKIRPKYRLGVfLDF81F8Sil CPrL0717 185604 186101 WlrIINVSHIIRWIGAFII~IpDSDAt.G55VKVSfGYCKOKATITRDpL.C~fFIJIL''SGaNf ' yabD/yctH-PHP supertamilY lurease/pyrimidlnaselCVNfL
tydrolue ECVA7l0I>I~RYCKSLGdMtWPFLGLOFVNITRKEYTENAVOPPVNYDPIDySI

TRROPVDIrIDJUItMLSDDAFEEDINSVLOMODSCVSLWiMTfEKETI4RSFAYJIEitFPGICSNI11LVDSGiVCI
'NI~ONFAANTDRFSGSIASIGNRVFENLDYCIIfRAFA~tIM
' KIRFCNVCGTPPQDVDpDIEF~YRtIPNAAANSIQfLAAIGbIIGLDYGFATE>03IARI~YLSSDLRYIILGF
YELPYLQSLNLILRVNOQPI4CV!!G!

QAYLiILSLECFS.PLWHCRGJvF4iDFFRM.DOYY1Q7DPRSRPGMJICfIG?L.EAOELISR

(;<JPISISvIVFFKNAODLRDLWELPLflILLI>:fDJIPILAPVPYAG~04EPA1Nl~TNA
CPt1_0796 VuqV~~~y~,~G NO robust haabloq present in Cenebenk/<i~L
as of 11/7/98 SELYSSYLOP~IVPNSIILPLPCLSRSETFKXVRS108(TlBM.TPIfIYRRDWYfAF

CI~0781 186521 887132 LLTAIPGSFJIfnT.VDIJ1GEPRHAA0A1GVSGOGKIVICNIfVPODPFAI1VOFQ7CItlONLQ

sdhC-Sucranace Dehydrogenase PLL1VRPQCSVYPNDITPOG'CVIVCZt~IIfAIGIICSVAVKWHJCKVSELpM.IDTLDdVJISA

SLVKSLRNSRIiEICPEVSHK1IGKYYSTFIFRCIHSLAGIAtTFFI~DiLF1?A4Jt.RSYFSVBAOORVIIIt>LGi ISVAVK4f~OVITOLPSLPDAlIUICVIICISSOCSIIV011RIDV

QGKCFVANVNGTNKIPGLKIIEVAGLVLPFLCHJ1IIGIVYLFOGKSNCIfSGDGSRPNLRYSWRNfAVQWICDQLSVI
GThOGTI'SVASAISTOCCVIVGGS817ADSOTRAYAYIQ~MSD

1U0~1YSYTS~IpRW'tAWILLFGIAFtIVVfILRFIRYPViIVDIHC'1TYYAVDIOPSRYDVIVIIG'1'IGTIACI
YSWtAVSSDGSVTVCVBTNSENRYMAFQYAOGONVDtI~TIGGPE5IlAQf~YSG

IIGFLTLNLPNI'~ISSItYSRHDLGGADAALLSFJINSYLLTPSADTAFLYWRt111LGSLFIDGkVIVGPAQNPSOW
ILAFLCPP~SPAPVNOGSTWI'SONPRCINDINAt'YB~.II~

ALLYTILVIAAAFt~FNGLWI'PCCR<AGVWSLRMGGVi.RIVCYL71NIWTFIK:VSAVWiLOOL4RLLION51UNES
VSSGAPSFTSYIOGAISROSPAVpFIDVpKGTILSYRBOSIIOMION

YSVA
COLLTCAPMDWKiASAPRCGfKVALNYGSOMLVERAALPYTEpOLG8SVL80lODQOOG

RYDFMGETVVLQPFIIQIOWtLSREGYS610i11AFPVSYDSVAY8AAT8lIIfiJUIVrJIfLf CPtL0789 187136 889316 P101SfAATINERDL1~ISNI PFASLAIIYYWRQ00LV

sdtlA-Suceinau DehydroQenase TL.h'TNle'IpQPLTCfLSLVSp88YNLSf 0t40ltJiRKVIVVC~CIJIGiSAAI~LANLGI
IVELVSL'1'JCVIfRSNSVCAOGCINAALtfLKPE

EDSPYVNAYDTIKGCDFL7IDOppVLt7~lCLAAPRIIKI4<,tItPOCpPFIPBpSGNLDNRRFGCPtLC797 GTLYHRIyFCGAS.I'~Oil'IYTLDEQVRRRpUIGRVIKR>SMtEFVRLVT~GRACGIIhNNo robust homoloQ presafc in Genebenk/Et~t.
as of 11/7 H1 NLFNNRLEILRGD11VIIATCGPGVIlRtS'fFISTFG1GAJINGRLFLOCKAYANPEFIOIHPVLIL1WINVGTKIG' LNNSKKIKVf.GHLTi.CTLFWCVLCAAALSNIGYASTS0E8lrORSI

TAiPGRDIQ.RLISESVRGEGGRVtafPGDSSKRIVIPDGSDtPCGETGAPWYfLCMYPAYVSIaiGSRIVGASGaGaC
SbTAVIWC5NL11W(.G'1'h0 CNLVSRWGAW1ILRVCEAGLCIDGAl4P~lYLDVTHLPERTRHKLEVVLDIYIGCFIGEDPNGGSSA~ISKDGEYW~IS
DTREfiY'1'WIFVfDCROIOfDLCILGJ1TYSVARDVl3t~II

TVRBtIFPAVHYSMfiGAWVDWPAADOPDRDSRFROFltNIPGCFNtxESDFOYHCxNRLCAVCVS11TIUIG)~lllt ~OVIGVIIWEKCKIKQGKLLPQCLWSPJ1NAISEt>CI11ITIT'~10EI818lItI

NSLtSCLFAGLVSGDE71SRFIEAFGASOATSSDFORAt.00EKEEN71RLLSASG1I~1IFVLVAVKNNKNAVYSLC' 1'LOGSVASAFaISANGINIVtiGiSTINNOETNAF181KDE11lfDfI7lL

NEEIAKINVRfIV'MCRDBfRDLQE'll~KLKEFRfRLfffVSVLDSSPfANKBfItFVRONGPftOCGPSYATGVSAO
GPAIVGPSAVK1GEIHAFYYAEGET1EDLTTLGCEFJIRVFDISE~ID

t:EL.AL7IITKCALLRLiEl7~SHYKPEFPERDDEIiWLILTTVAVYAPEEPEISYLPVDTRHVAIIGSIK1'DI~GJ

PTLRDYTKSSTCKI1:LTNIPDNIRLPI
CPn_0791 902810 907856 =t'n_0790 119279 990103 No robust homoloq present in Genebank/P~OL
as of 11/7/98 sdh8-Sueel.nace Dehydroqenase WFEIIFWRVPMtNTCCONYRSiCWFSWLFVLT'i'Q'fLFACHFIDICTSGLYSWAPGv :1SRIPLIISVYPYRKAFItItZ7LETFILKIYRGVPGKOYWESFELPLHPGENVISAUdEIESGDGAVWCYE~NAfKY

KRPVNILGEINNPWWEpCCLEEVCGSCSILVNGVPROACTALIOEYIDATOSREIViJIPKWVNGALVDLGIFSOGIIp SFAEGVSSDGKTIVCCLYSDDTE'fNFAVIMDETGFNVLPNLP

:.TKFPLIADLIVDASIMFI>1JLERIQGWVAADIt7CETFGPQYf0E00ELLYALSOCMTCCCEDRNSCAWDASEDGS
VIVGD71MCSEEIAKJ1VYWKDCDDIit.LSNIPGAKRSSAW1V8KDGS

.1'FJICPQIDl7KSDFICPa.1i50ARYFNTYPGDKRSRKRWRAfJIGItGGIEGCC0Al04L11RVFIVGEFISEDJ

CPKKLPLTESISAVGREISKFSLRSLPSJ1LPKKKXicYVDGRNIDLCI'LCGSASPAFGVSDDGKTIVCKFETELGEC
HAFIYLDD

CPn_0791 893101 890111 CPn_0799 905001 903910 CT590 hypothetical protein No robust htlmoloq Dresenc In Cenebank/t7i8L
as of 11/7/90 T_LRSSRKIWEDISDRNNYSCYSKGISHNYLLHPFISRLDIFVFDSLIrINQt7pNLLEEIFNREWIMIKOILRSMLSO
SSLWMVLFSLYSL~Y~VITDKPEDOFNSSSAVIMD181CK

SV."tCrITTVCFIKD7WSPTYAVRWNYWCfKELPfSSWVKKSKATG

iGPHRHNEM~REEIIt.L101LKAf.Kf11PKLILESIRTLFVPSYSIIQFILIRHTL~11LFIPQTILSISSDG.iII
AGIVt3iELSOSfAV'IWIQMllIYLLP""fwAVrSKAS'GISSDCSVIVOS11KDA11 TIHVRQAALTALFTYLRQIrIGSCFATAPAILIHOEYPERFLKDLNDLISSGKLSRIVNORSRTFAVKWTGHEAQVLWG
WAVKSVANSVSANGSIIVCSVODA~LLYAVKWEON1'I1'HL

'cIAVPINL :GC IGELf KPLRILDLYPDPLVKLSSSH:LIIKAFSAANLIETLGDSFJIpI00.1'LIX'..IS.I IAKAVSNNGKV
IVGRSETYYGEVHAf~'HKN
0'MSDUG'tLCGSYSAAKGVSAT

LLSHOYI1AOKIQNVHETLTaNDIINSTLLHYY0L0ES~IPPKECLFSKEQVAFSTQH.KVtV~iSTTJINGKLH1FKY
1.:CGRNIOIJ;EYSWKEACAtIAVSIDGEIISnGVOSE

PAELSEIQRVYNYLHAYEEAKSAFIHDTQNPLLKAWEYTLATLADASOPTISNltIRLAtG

WKwEDPHSLVSLVMIFVEEEVENIRILYOQCEpTYNEJINSpLLYIECRNANPLNNpDSpICPn 49(10 90ti550 LTNGMRfR0El3JKALYEWDSAQENAKKFLHLPEFLLSFYTKOIPLYFRS8YD11FI0EFAeno-Enol.aae NL'lANAPACFRILFftICRTHPFrtWSPIYSINEFIRFLSEFfTSTESELLCKHAVINLEKERKEIKINFEAVIADIC
AAEILOSRGYPTLIfiIK'rtT~TG.~'/:EARVPSf:ASTCKKEALCPR

~.~ALVNHITAHLHTDVFQEAL.L.TRILFJtYOLPVPPSIIliHLOpL;~011'PWVYVSCGTVDTLLTL'tIvPRYQ
CKGVLQ.1VKNVILEILFFLVKGCSV'lE9SLIL::(J~iNDSDIL:FNKITLGANAIL

:.:.G'IFE:CEPLTLTEKHPENPHEWAFYA0J1LKDLPTGIKSYLEECSHSt.LSSSPTHVFStV:a.ATNIAAAATL
RRPLYRYLOCCFACSLPCfMNLItrYxlilAOFIriLEFpEFfIIRPICA

I:AGSPLFRFJ(WDNDWY~'YTWLRDVWVKQNODPLQC/1'ILPpLSIYAFIENFCNKYALOHVSSIKEAVNHtv\011 FHTtKKLLHERCLw~~lr~Vf7G~Y:FAFt1(J1SNEFJ1LELLLLAIPIUIGff 'IIIDFHDFr:CDHSLTLPELYDKCSRFLSSLFTKDK'IVALIYTRRLLYIlNREVPYVSEOOLFt:KUI::LILDt:A
A:::'FYWKTCT'IIY:RIIYEL7GIAIL.~.NL':DRYFIU::IELI:WUDYDGW

iE'/LONV.~..~YLI:I:i::RLTYEKFRSLIEETIPKITI'LL:>:ML*HIYKi:LU1()SYOKIYTEEALLTE/Ia :EKI/Qt6'~:I'f~LFYfNI'ELILECLS1Y:IJW::VL:Y.1'N~It;rI.TFTWAtKLAQN

U"'ILRLTTAMAlIIINIr\YFaFLLFAD1:NWPSI'/FCFILtIfC'FfEtl>l.WKFNYACLOtiQPLOA<:Y'ITI
t'IIR:x:f1TI"tTlADirIVAFFU(CQIY.T:.:L::I'::F.f?V\NYtIHI~IEIEEELG.4FAI

:1I9ELFA'T::HiwrfLYANF IDYI:NPPPP(:YR::RLPKEfFfTU::LfJt~.:YI:D::1:f:

':In 117n: w~4:r55 N:rIIUH 'a'n_rxnl rUN'fU'1 nUf.7i7 Wt'.stn M/Iinhrr.u:.~l pc~cein avrH r:xm.1..n::' Alr' ::ufslnrl It ''rHHt.ItlIYtaRtHKIrfFTKRVLFFFFLVfPIPLLLILlAM:FF::P::AANANLWVLtITRAIIF'Mfft~:Ll 1\I1':llvta>L,t'FU\/APL::AiNRrYiM~QV'.:/a-nly:Y'rFTINNVAFMII.

7YIL:'.IF.FF:YY.L'fIIIKLFLDRIJINC(./1LKSYA'.'sP:iAEF'fAQAYNE?MA(.::NTDF~LCLLDPI
'fl.Vl.AIiNKTL.AAA'L\'~'h:F'IiF:YFfiRIAVEYFL':Y'fDY%yifillY
f AI!::IIr1'tI:K::IJ.INDf.

F'f/:::VRTYtIIV:Ut'FIRYLNOtIPE711tKNL.HMV(:KAFt.LTII~:KPL1.111'LILVELWAa~WDSILN
tI*L::ACH::II.t'Jifil~fl.1\::::V.~.t:I'ftal:::4~JYT:.'NALVI.fU':YI"tl'ItNII:FnW
LVK

'ITf:X:LL'/::1'YPll::1I.QKDLFOSt.IItTKCNLCLVNCY'a7VLFt:IIQU::E:::FVFSLDLPNLNIIY
yA::PItVN::APRtaata'llrlF'1A'If::HIl.I*LEFIlIUfL'f al:'C:PIh:M111*E::VP

inFqAR.:P::AI EIEKA:a.Ita:GFIJLITVS::A77.Yrr:::llW 1 t't:\ t HF>La\
INKKRYLJ:LVIrIY. I t'!\t:rYTt.::LVPVSDLI fe'r ItJFI:L1:F.NNAFfDDRt I EKOH
I 1 Flllrl'Mlfrl F7iIKE V:

J::ALKVPtIIICFFWI.AFLiMWWIF:iKINTKLNKPVit:LTF'~.'MIlldWRn:NIINVRFEWjPYPY'Kr:llv 'fY::HIIF'n:Alw:.\I1'n:1.td:fl'I'It.OFt.I.IIDE=:INJfLI~lIIAfIYHt:IIJ::RKQ::1.

ia9 yE'C:FRLf".:AFOIVRPLTYCFJWKYFRIrJtYVwAT.m.,tLDNA Mtsawetn Hey .'EVOESSCitt'Jr"ptIRPTGIPDP TRAP LQLL,DFL~. IHQ 2.11CE l: EV.
~ ::1.'If EL I Et'w':.DACADE:
' LCWtIf~ILTXAPM
. i:

NPE IRFAT: ;tJVDDLwEEIRLRL:iOKHEK .
I LV I :ITKRLAEOMAGFL iELEI PMYLNSG, I .
'JRfNq'~IC~'I~EF;tCI~
SI
EIETL3GOOGALi'IRONCr~9FRAEJai;l~
~~' ETAERTCtL:DLR.7aftOVLICVNLLRECLDLPE1ISLVAILOADItEGFLRSTSSLIOFCG, aAApHtlr:KVIFIADOKTRSLEElLRETERRROLCLDYNKEHNIwPKPTIKALFANPILOr " ~
A:,::.IOtEIQSSIECDOCVR'lYtflOW'IVxEPCAtIpLGrtYtVIISL
P~pR&t :ORFLiKEDLEEOIKKYEAL1~1QPJIAItEFRFNfJIAKYRDAM~CKEOLLYL0.?DRLGIRKLIEHRILST1WIGWS
~1ISECHHEIQIAK06~:P'OERVAYVMCONfMQOALTI
' ' 'SKDSESPKE ~

. .DAYAL:.LPLNRY/VF
. .LF: :KKLT
OKL1NGYRIVCYL:.~PSFtIRPTROCOKIFaIDRPIE.

F
VLK;,YLPSS>JCDFMMPOKI6ARICKEEL4t'.DCiXEAIVE'.'LrICPPf::'..:R'"HOEIEESO

~n~751 90R7n~
::VPLPMFRMLE'.'r0'I:EEESVEFOCNLFAY35EOV~::LEKCEYT.~.R~PKSCNDYIIYSSWR
~P
ORO

n_ . r'.":4\:'..,:. ..,.., ...,~...
: .

. . :: r:r ~ : .. . y..r ;,..r .
r-,,.. . .
' . _.~ ... ..
_. ,_. ... .;,It : .. E,r-J .:v:i. ~:.'.~: 'Y:i "/:'.... ; :
:TF

,. N~ICECLTCATF3KH0f1~'FDVSWLK:.i%d-%":KPkKIa'UirlitlRHLLLIlSGfMF~i ....
.
Y :, iAKEEVLt>uDMIIYEVLADW4iJGIDPIKSIZYLOSAIPEIYELHLLF.itfLLSINRVNGI

PSI~tDMARNASIEEGSLSYGLICYPILOBADILLAXAQFVP1ICKDfIIJdtIr~.TRDIAPNF
R CPIL0813 920813 9:193) ' DPN P
NRLYGOVFPEPEVL.OCELTSL'dGI000G10tSKSAtNAIYLSOSDATITkVRX11Y1 IRATTPGRVEGNPLFIYHOIFNPHKDIVEEFK)1RYROCCIXDIEVRARLAEELIHFLIPIPepP'Anr>.nopepc>.
dase ' KERRSEFLSKPLALQNVLCOGTHIOUIEVAKS~IEEVNL*ICFSHXWRSLLKEfL.iEOLAYFLHWJ1IJ1CILLIOGO
EYIIF
TLILyIKOtAitISNDRILNA4RALSEHNLDALLt FVYPMDKDLYSHIORVPLTFL'tODWADLSLYVOKQRYCKIGFDSASTVYIfKFAQ~LP

CLWtPLOCFTEItIRSIKSEEEIRRMOEAAAUGSA<iYOYVLTLLR~"ITCXEVVRpGRAIIi CPn ' _ iDRPLKXCCIV~IDIGIWG7fCSOKlInf171LG
CT581 hypochecacal Prace>,n AEAGAEGPSFPPIIAFCENSAFPHSIP

FMMKTKTLELEONVfLLL>''~JLIfRIFATPIGYITPREFQtiVVFNCANCQOEIANFFPEMTPH
I0~1L1VRVLRENHLDTYIINCICIIIRICR

LINGKLTQELAPOQKOAAHSLIAEFlOIPIRV71IIDINERGEFINFITSOMLTOOFRCIFLNHIHEYPCSPRGSQVIC
.f.~fl'ITVEPGVYFPGICGIRIEDTLCIt~l0~IF5LT11RPVISE

RLARVDCQEFLLMIOVDNTCHLIRNLLaRLLEAQtOdPNCEIOdLQEIQEEITSIJOVtiFDELL

CPn 0811 911996 923357 0804 911071 910310 CT911.1 hypothetical protein CPn _ FfLFFKLSYNtIFNLPLTMYOLLSICYSFVSFIALLWNLCYSPNYVTDLYRISLSAEESL
qp6D~CHLTR Plasnid Paralog EIFSSMGNLKTLLESRFKKNTPTIMEALARKRMEGDPSPLILVRLSNPfLSSKEKEOLRHLGGIRAFPOAESLLCCACA
LNFPDLEERLPDLRKELLFLGSNDRPDAOGCRFSIALiISSKE

LQNYNFREQIEEPDLTQLCT'..SAEVItOIHIiQSVLLHGERITINRDLLXSYREGAFSSWLLCYIAALKFRVYLIiV
'1'NSSItGPVYSFSP10GVP1'EWIECFSVSVDCRVE111fVRLOGLIaEL

LTYGtrRpTPYNFLVYYELtTLLPEPLKIlD'IEIDIPRQAVYTLASROGPOEIOCECIIRNYAGISKPRDCETLFLNP
PJ1NKLDCWEIACFRVOASFPVIIQXIRRIGVDKFLIJOIOGAEIfADXA

ERXSELLDAIRKEFPLVETDCRICTSPVKQAt.At4.TXGSQILTXC1'SLSSDEQIILEIG.IKTXERVDFVSSDEEt iIISRYLAVCtM.LWDCNC~IpTCGEFpCASSRAPLFEIfIaI00KVMIA

XyNyFpm.XV
DLWNIbO'1'ORQTISLVXGVPSPIEINEYIREIEFTCMRSWSKPIVLVOCrpRt.ILSPOpN

LRTAIOf3iiEICLSRAD0IQ0YV1GKV1CPLLVFERLEXDLRGFVLRGNI~'t7~RTLVC1'ISL

CPrI_0805 911816 911067 ~ PLItaCPtPAVASpEVSSN1'ItSAAANPGIL'L19ROG5 minD-ehranosame partitioning ATPasrCHLTR
Dlesmid protein GPSD

GYJIRR!!K1'IAVpISF>(CCTAKLSTTLffLGAAi.AOyfIQARVLLIDFDApANLTSGfGLDPDCCPfL0815 YDSLAVVLpCEKEIQEVIRPIOD'L'OLDLIPAD'l~f.ERIEVIiCttWADRYBHERIJfYVLGSgspD/OilQ-Gen. Secretion Protein D

VQDKYDYVIIDTPPSLCWLTESALIAAI7AlALICATPEFYSVKGLERL1GFIOCISARHPLMVPfPNS4LNLVAL&~G
.CCSS4YALTIAEIQIASLEHSGRGAODYEIiIASPNANOtEYSL

TILGtJALSFWNCRCIO~tISAFAELItitCTffGKTthtl'KIRRDTIVSEAAItt~VF'ATSPSAOLSKLYEFJ1RX
LRASG'1'~EALWICDLIRRIGEVRCYLREIEELWAAEIRIEi~.EDIfAL

RASCOYFNLTKEL.LILLRDI
WIQIPCC1'IYNLVTDYCTEDSIYLIPOEICAIXIA4'LSKTWPKESFEDCT.TQILSRfGIC

VRQVNSWIXELYl091K~CSVAGVFSSRKtE.EALPIrI'AYICFVLNSNVDAtlTN011VLDIF

1NPLTlfNDVIAGRVWIPGS7VGENGELLXIYNFVpSESIROEYRHIPLTEI~IISIL
=Pn _ NMFREDLZX1HSEESLGLRYVPLOYOGRSLFLSCTAALVOOUZI'IRELtEDI:MPIDK
LhrS-Thraorlyl cANA Synthecase NANNESPPti!$J1WN104IOV1r00RiYEVLEGTTMEWCOLfOQSf~FIGVLINERPRDISTVFWYItVKNSDPOrr' ~rre~DyfSGEtOtASVGAADGCG80LJ~L18I0IDfIYSEfARD

THI1JE1GDTLVFLTSEDPDGREIfLNTSAHLLAQAVLRL41PDAIPTIGPVIDNGTYY~'ANGSVKYGNFIADSkItG
TLIMVVEKEVLPRIpIC.LXKLWPKIOIVRIEYLLF1Jt10.IWt~IIB

LSISFSDFPLIEDTVIIQIVDEK1J1ISRF1'YCDI(QOAL.ApFPQNPFKTG.IRELPCiEEISGLNtI.RLCEEVCX
XGCSPSV~111 .LXT~
GILEFLFtOGSTGSSIVPGYDLAYQFLJ111CEWRI

AYSOCEFFDLCRGPHLPSTAlIVKAFKVLRTSAAYWRCDPSRESLVRIYCTSFPISKELRANASPSWI70ip1'PIIRI
AW~tSIAVSSDKDKApYNRApYGIMIIOIZWINVGE~tSY

HLEQIEEAKXpDIiRVi.GAII1.DLFSOQESSPGMPFFHPROMIVWOALIRYWKQLff1'A71GYXITLtTDTi'FL7 1'I'GXNHD~tPDVTRRNITN1IYRIAOCETVIIGC1RCIO011SD8107GI1lLC

EILTPQIlaIRpLNEYSGNWDNYXAtItY'1'LQIODmYAIXPIB~ICiGClI:.YYKTfILHSYXEPDIPGIGKZfGM
SSTSDSLTEIPVPITPKILENPVEQQmrrrsrre~~pp( Pt.AVAEVCNV1IR0 TPEOVIdlTillILOLVSTLYCfFASFJIAAWWIKXLEMFPAbCVSLSpV0t0EYDGC

GLE7MLELSTRP~TIGDDSLWEL71TMI1~IALVOSG'1'PFIVRPGEOAFYGPKIDIHVII

t7AI0R1WOCG1'IQLtMFLPERFELEYITApGTXSVPVlfLfPALFCSIERFLCILIF31FKCCPn_0816 RFPIIiLSPE01JRIITVAl7RtIIPRAKELEE7WKRLCLVVTLDDSSESVSK%IRNAONIpVNgspE-Gen.
Secretion Protein E

YMITLCDHEINENVLAVRTRONRVINDVSVZfiFINI'ILEE109SLSLTALLRGIOfellMSILSOELL.DILPY'tF
GIOIiICLLPIEEBSLLITIANATATSVI110DEVIG.LIX

KPVRFVLXEESCIt.ORL00LY8NRl~tI80!$.LTIDtICDCITISEEED4LkTl0SIPWR

CPI>r0807 913950 914879 LtifliILKFaIIJ~tASDINfEPCE~IRYRIOGVLHDRNSPPSNLRSALT1'RL.1tV61001 GT580 hypothetical protein DIAEMRLP~ODGRIXIHIt7GQEVOMRVSTVRIIYGERWLRIL01WNVILDIACLJ111103'L' TLQI~LtMSLFLVFLTAFIWSSSFALSIQ.VtBIASAPIFAZGARMtfIAGAILILaAwIPOGEILTKDTITAPECILL
V'IICPIGSGKT1TLY5VL.QEWOGPLTNIM1'IEDPP6YIO.IOIJIQI

FVGISKXIPLYIVIS.ALTv'FYLTNIFEFIGLOSLSSSKTCFIYCLSPIHSALFSYIOLKEAVKPKIGLTFARGUtHL
LJtQDPDIi?IVCBIRDOETAEIdIQAA4TGlR.WSTLJf11D11IS

KYft.ICKVLGLSLGLVSYICYLTFGGGGDDSpPWISapICLPELLIL~GMSLASFLW1'LLRQAIPRLLOMGILSYLL
SATLVGWAQRLVRTICPYCKVAYTPENDEKSFLiIBtL~I'~L

IEKOSfLSVTAINAYAMLIAGM<SIMHSAWEPWRPLPVQDISOFLYATLALWISNLICYROQCMICPRSCYIfGRQCIY
EFLRPNTLFRSkSrASCIRPYHILREfAEpIGFLPIL.EtIDI

YNLYAKLLRK1CSSTFLSFCNLVMPLYSCFYG<JILL~GEKCVSt.GLVt.AVAPMVAGCRLIYHAL.71VSGETTL11 EVLRVTIOLCD

EEFROGYri75 CPn_0817 927106 928187 CPn_0808 916398 911956 gapF-Gen. Secretion Proclin F

C'"579 hypothetical protein GGRMPRYRY1'YLDPKERRXAGYL.EaL.HIOEAREKLAQEtIIWi.DIREVALRRNSIKSTEL

IXKLPSWALKSLKRMPQSAEPSLAHIKPIIFKGaCIAtl1'SGVSGSSSODPTLAAQLAOSSIyFTKptrrr.r.sa:L

OKAGHAOSGHDI'KNVTKQCAQAEVMOGFEDLIQDASAQSTGKKFATSSTTKSSKGEItSEFDH!'YCSGV1N1GESYC
NLOGC1.~IITWLEERAOITKKMKiALSYP'CVLLVFSFAVIQ.FP

KSGKSKSSTSVASASETATApAVpGPKGLRONNYDSPSLPTPEAQTINCIVLKKGhCCtJ1Lt.GIfIPSLKETFENNE
VKCLTItIVIGVSDCLSAYRYLFLCPASALI15ACIIl9U0tIPWICK

LLCL'JNTtsIANJIAGESWKASFOSONOAIRSQVESAPIf,IGFJIIfDtOANIWASATFaQAIWS.
ILEKLLF11LPGTKKFWKVAVNRFCSVASAILXGOGTLIEGLDLGCDiIIPYDRLRTDIOtD

LISCIVNIVGFTVSVGAGIFSAAKGJ1TSALKSASFAKETCASAAOGAASKALTSJLSSSVOIVQAV1GCCSLSOSLrI
QRSWVPItLAIGMIALGEESGDLADYLCYVAHIYNmItpKTLASI

QTMASfAIt)1ATTMSSAGSrIITKAAANLTDOMAAAASKMJVSOGASKASGGLFGIYLI~KPNTSWCOPVILIFLGGL
IGVIMIJ1ILIPLT~IIQTL

wSEICVSRGMNWKTOCARVASFAfRJALSSSMOMSOLMHGLTMVEGISAGCfGIFJANNQ

RLAGQAFAQAEVLKQMSSVYCQQAGpAC:OLQEQAMQSFNTALQTLOIdIADSQ1'0'1TSAI-FCPn_0818 N predt,ecad OMP (lesdec 11b1 pepcidel CYTKM7GF~JVWSTRDSDFSWWPDRCpNV~IIDPt'HXOYPNIIKCVLRG909fROKRXO

CPn_0809 91'791 916307 SITLIIhafVVI1'LICIIOCALAFtMRCSIHKCKVFOSEQNCAKVYDIiJMEYATGCSB'L1I

CT578 hypoehetical protein EIIAHKETWEEAs~CKEGRKLt.KDAWGEDLIVQWDKCODLVIFSKRVOS~ROt dfMISISSSSCPONOKNINSOVLTSTPQCVPQQDKLSCNE'1'ICOIOOTROGKNTEfIESDAT

IACASCKDK'C'STTKTETAFL'pGVAAGKESSESQKACAD1GVSGAAATTASNTATKIAIIOCfn_0919 929117 TSIEEASK.iM~TLESLOSLsAApMKEVEAWVAALSCKSSGSAKLETPELPKPGVTPRSCT5e9 hypochecWal protein EVIEIGLAt.AKIIICTLGEATIISAISNYAST~ADQtNKLGLEKOAIKLDXEREEYOEMKA9LY:'ICLFLIWEKFHN
NIGKANFHLKIITTDFLTDLYIVTIRDPIAYPLTGIC

AAEpXSKOLEC2MDI'VNTVMIAVSVAITVISIVMIPTCCACLAGLA1GAAVGAAAAGCA

ACAMATTVATQITVQAWQAVKOAVITAVRQAITMIKAAVKSCLKAFI!(TLVKAlAKACPn_0930 729012 ISKG:3KVFAKCTCNIAIWFPKL,~>KViSSLTSICWVNr:VGWVMPAG:KGTMOIpLSENCT567 hypothec i.al protein t7QNVAQFUKEVGKLOAAADMISNFTOFWQpASKIdSKOTGESNEMTOKATKt.CApILKAYOEBLPCRCL'CGTFFRr iET~SIRTEMPMCNSIAMKKOKRCFVLMEt.tJISF'FLIALLLC1'LC

.1AISCJ1I AGAHKTNNF FWYRK IYTVOKQKER IYNF'lt EFSRAYKOLRTLP.'.TI$Li3'.iYEEPGBLFSLI
PORGVYRD

PKLAGAVR.1SLIIHCTKDURLEWtLCNIKI7QSYFETQRLL
:HVTHVVL.~>FOIWPDPEKLPE

s:Fn DAIO ?18193 17925 TtILTITREPKAYPFRTLTYOFAV.K

r.'T577 hyprXMICIC~I VCOteln t:EIWIKKtKKTKKA\b>KMFVKRVPEE:iOEMIIQQLEL\V~DLYKELFLAUTFJ15LTDKefYt_01121 nvH~l7 'llOni.:1 N(IItL:fIML::r;'t'LE.:LJILEELTQGLFF:.'.AQEDAI,IFAKEL::.7lNfK:LKNLTTIVNKQMVKrTSi .:, nytt~cn..t r,:.si Innr.nrn l:At: IYfNhkLA:NKfYM)I'F I
FTI.LI:L'f::I.V::(.%AFOAANAHKRCM:AOTf Ela<:F.NFYI:IKRSACA

F f EYrrF:K::RII4: A I LR I::KI*r:l!VTfY.p LAKVATKKKCsRYRLWVI'F::RFItIN.~.RYNLYA

CPr_tIHII ~s13.s).f v1920~
t.L:'.EffEI'\':'f7TA:lA'\IFIRLLRhA'l'JOTYxP/Ff~:.~.f.'IAIANALI::NKUELLERGAQLG

Is:rllli.v ~.n k.r:ynlu:.
Oror...rnI'1'\'IF.'fl:I'LI'I:f:IsAE(F'IKMtJ!r:::::N::4::LlItYG('IEEK:a.C:IK.'Kf .Nf.IFfIDfLLLEAVL
H

llsfAIFII):IIr.:M::KI~:fkNWI;WJKP:il::1'tIKKTR:SP.LAtL,nVWKK:\K.\DLLFY~1IIIIHPT
l4il'IN'lRl'1'::LLkII:IWHAVKhrJF7IAVII:IV70lJvALELFYTHTDFftI.El.Ht*M(rt.LL:iR'f F:FEfYY:.t):fill'F1:1::NI:LL~,.~w~LLA:L:D7LLEEI7TVAYTF'1::~~:KYNF.A'J:LFYJLIJ1A
I~LL1'.IlIKKMFDY1'f.:::Y:InYLF'LVIdIJMfAI::Pn:Ia'f'.:K::fKI.

yrt~Jlr/Y'MI ~ :1.::: ~:YI IULIILYNEJIAtI:FFLAF
DAQPONP I ffYY 1 Af'::I.LKL.pL\P nl'tr IW :.: ~3x~ml.w ml :'l EE :N
NFIDVI'NIsIm;fINl'F:FNfl.f;lRc:VIMKQ:aEKVWY:ETKKAITYKI'r4:K::YTT17JKY.::(:K

Yk n~I'.r.. Iry1tn11Nf r.~.nl In.rirr fYLII'JLI::rIYNiIt.elflMf*.~I'1YF7C:::KI:::::::QFD!:LYRKVKDLII::NI'KW:KWKKFL

~'Iw_nNl.' vlmHl '~~Ilyn_ ::HHN:F'.Atia'LVLL',:11ALYI::WN7:1.1'IN":~/I/:FIIVF.IRKMI:xIL~::Y::IMK:PIY

NAILf.'~:LIIkFVLNIPSFAV.';FIYLCVILaFI'.~.::ITMn'~CAEFJIKVNfTt :F
.(KDROiHPKTrtIc:::VEWAKTHGY:TGPKAIALPIYA
_.iTCSKDHCDIfHpDTSNKPS

tpLt,ADKFK00LLiLG:YD~sLEYALRYDIRt.LROJ1SFSFSAYL\TPrx.'LONGSLIYPNYC

OR2S '15191A 7lLSO/
Y.iP!'I(CIJfOVVCITI~RROAIIiYIC.r'LNERPIILCOEPGf'~iHi'~E.'t~RIL' r.'Pn ' ~ 1~SITtIpCFFt.EKKNDLPIQ~t.rVEPQDIFUfVIOa yscTJapaHYr,nT TranloCation T EOGIrtlFNYOVGDPST('EIRF'3lWl RYAIQVRFSN':'::INr.TIKELMCICLPELFSNLCSAYLDYIFONPPAYVWSVFLLiS:r'POAALKRLPNFFSSPI
rFfLKOLLIEVtIROSRGIK',LDLKPILVCIG6SRCId.IGVEL.YRmIC

CFAVAPFLCAICLFPSPIKIC:~L~"WLAIIFPKYL1DT'pll2tYM0l44Lf'fVLLVKfZtZIG:fSLIPI'PLOGL
CFLPRVLPPtatVPQFLTQYIIpHERILFPNPpTILPPESYELVIQSINRPH

VTCFVL1FPFYlU0SAC3F:INQQCIOGLEGAT~LISIEOTSPNGIL'lH'lF'ITIIFwLVCPASPWLOLELKTNIG3 5rPTCIAIw7CWCSKHTFLPfOACFLDLIfONLFQFLKOfL$TOKC

~HRTVT.iLLI4TLFVTPIHaFFPAf?MSLu~APIYIITNIKMCOLCLVM?'LOLSAPAAIJWLVIAEN'IYTJ1NITO
VFKLDAIrIPL.3VTCi'TIJWPL:DLOFFSQLKAACLPpIPOM.F$$OFIC

-"d.r?.:: ::!M\Lw''':. ...:.:.r Li.. .'-:L:.i?!WF:vI11111'...' .
FT!:L:.:'......::vWFI:~Uii.-''.:.i:dF~F.':I~IM:..:.a""tiv.\-\:.:.::':F',.'.
.':.iiF
~.'~i :.
:.
' ,:

....:::!: ~, ... .. ., ....:.: .
.
. .,.:.:'!~.t!':'.:'.~,..,. ..I~'..~,y-.i:A~...
:;;f::..:!iii.' .~ rl:...;..

VFDELNMAKNK.i:nOiHKLLCR:lJI4ihSK:n,iL'iv:
PtENNLi,EFKI;(j,Dllt,pNS(ypSpilLF

Cprt_O8Z1 972677 932779 KKL("tKRCSSEELFIIPSOCLLLKL?RPFI''rRRTJtKLVLPELPDKYESIIACtLSPDOE

yse5/IliO-YOpS/tli0 Translocation KLYIIATLOROISHIOKLE1'PEEPATNFLNIFALWHLKOIC~Ip7IVF!'1<DpDpYK~ESG
Protein IRTRAVLAFFATSFKSVLFYSYOSLLLILIVSAPPIILASIVCINYAIFOMTOIOEOTKwNAIVKLLKFSLNACYKVWF
SOYIHMIRI:1'LYLEEICIKYILSIOG7ISIlI~ILTF

FAFAVITLWItGTtIIIStXIiL.SNNILRFACOIFONFYKWK'ITDPNCQVFVCSLLaAGTGINLTA~IV4INYDRwI
MPAKENQALDRVIOtIGQIDJMIYR

LIT1DTLEERIHYLIEKKIRLLDKVIASODSNII3MCNREDLLTILSYKDDICISDSCtS

CPt>r.,0875 933618 977677 PVDAPVEDCIIGVLPPEDS

ysCR-Yop Transloeation R

ERIKVfTItARSIFRFSLCFFfLSVSCClADASLYC4SCPSRCOPTPPPSNSNPIliWOQPCPeL_0836 916960 VAASSVPSYNPPLNADOVLPRDNLSDGSFSDTYPDITTOAIILIFLAtSPt~'LYNLLTSYLbrn0-Amino Aeid t8raneMdS Transpost KIIiTLVLLANALGV00'IPPSQVLNGIJ1LILSIYVNFPI'LYAMYKDARKEIFJINIIPOSLIMKI~ASNSLSIWSI
GCSIPIIIItFCAGNIVFPLALGY1IYNAtIpwS7lYlGlIG.TA

:TAEGJ1L1YFVALtIKSKEPLRSFLIRNTPKA0I05FYKI50KTFPSCIRAHLTASOFViIVCVPLLGLVSMLFYSGI
IYOKFFFSIGAIPCMIFITAIILL:GPFGGIPRAIAYSNATLIS.

IPAFIMGDIKNJLFEIGVLIYLPFFYIDLVTANVLYAMOlI:IIL.SPLSISLPLKLLLIVMVDLSENKSAFIPSLPIF
SAICCVLIYIFSCKLSALIQWLGSVFFPIIG.VTLtilVZIRSIIIIP

'GWtLLLOaltISFK
THPMVpEFIPNAROAwIaGFIEG1~T1?~LLAAFFFCSIVLISLRQLVAEEID(Pf6IEIPL
' SfOCI8K1C41aSLiILrGFFLAAILLGM'YIRFVLSMRIIAGLLVNVSKptILGRISAlAIG

CPc~0826 931382 933611 PNSILAGVSVFIACLTTEIALVCZVADfLARVIISFKR14YAS11VICTLIPt'YLISIWFE

yseL-YOp Tranaloutlon L
TISNLLLPLIALS1IPALIVLACGHIAYKLWNFAYSPVLFYLTLSLTIVLK<.VN

HDNKRSGVFSSL1IFIDPORYYAIVIQBCFFSLIFKD~VSPNKKVLSPFJLPSAFLDAICpT~T

KTKADSFAYVAETEQKCAQIRQFaImpCFKECSESWS1IQIA!'LEECTIDrLRIRVREALVPCPrL0877 917777 LAIASVRKIIt~tELELt(PEfIVSIISQALKLT~ICNIIISVNPKDLPLVLKSRPELID'tIhch-fnodnucleasv III

VEYADSLILT11KPDV1'PGGCIIETEAGIINAOLOVGLD7iLEtU1F51'ILKA1CIPVDEPSETLTNKO!'ILRTWA
LFPNPKPSLEGNSSPFQLLIAILiSQiSTDKAVNBVTPQLPAKAp011 SSSTDSS&LSNDODKXE
OSILDLPP'G1C.YOLIAPCGLCERKSAYIYQLSOILVRDFNGEPPNONU.LTOLPG1~P1IT

ASVFTGIAYC1IPTFPVDTNILRL74QRWICISElIKSPSAAEKDLARYP!GNENTPIY

CPr>_0827 935773 934131 YAAQYCPALNNKIDNCPICSYLiIKJEiINSTRT

CT560 hypothetical protein CCLVTANfFCILDILMKNSKEDDLSRFLP10JLLVESPNPEEIPLxSLSf'RISWLPTINPBCPeL0838 919196 wITIAMKFFPPEI0G0LLAWwPEPLVOCILPLLEGISIAPHRt:APFCAFYLLLIC.SIOCIRthdF-Thiophafe/Puren tbcidation Protein ?CGITEEIFLPASSANAILYYTGPVICIALINCIGLYSIAKB.KftILDKWIERYIDiALSPISINIIfPNSFIQ.FNL
KLGILSESSFNP'SIFMLIQ(DttIMIATPpGECSIAWRISQpWII

TEKLFLTYCOSNPMOtLET'1N!'LSSW1TDALROFVNKOGLfPIGRJILTKENJLSFLwYFLVIADRIF9CSVASFAS
HTIHLCpVIFEEM.IDOALIl.LI9tSPRSF'iGK'iC!'F

RRLDVCRAYIVEQTLKTWYDHPYVDYFKSRLEOCMKVLVKACSOILDALIAI&ARPALPGEFSOMFIlIGKIOLVpAFJ
IIONLIVAENIDAFRIAQT!!P0 GNPSIDCIOEINTLIIF~i.IIFLEVWIDFPEEEOPDLLVp0EKI0Ni1L1lIVmFI88f0lDO

CPr>_08~8 936292 935267 RLAOGTSLILAGKPNVC%SSLIJ4ALWLaIMIVTHIPCTtROILEEOiditOCIUtIRLLDT

ysCJ-YOp Transloeation J
AGORT~IDCDGI&PALS11MEF~1DCILWVIDATOPL6DLPKILtZI~BILtJ11U1DLT

IKRriIWIMVRRSISFCLFFLKfLLCCTSCNSRSLIVHCLPGREANEIWLLVSKGIIJNIOKPPPFLOTSLPOFAISAI
tiGECL'lpVKQALIQSAMOKOEAGKTSRVFLVS87UOAlIiALVAR

LPQAAAATAGMTLGOIA4VDIAVPSJVpITEAIJLILNOAGLPPIOfGTSLLDLp'AirpCLVPSELCLI~IpQNLYLO
PPEIIJ1LELREUNSIGMLSCKIV'IESILGIfSI~'C1GK

OEKIRYOEGLSEOMASTIRIOIDCWD71SVOISFTTENJS~lLPLTASVYIKI~rVLDNPNS

IMVSKIKRLIASAVPGLVPENVSWS>RIAJLYSDITINGDNOLTLtIDYVSVNCIIWCRSCPeS'0139 9N~30 LTKFRLIFYVLILILPVISCGLLWViWKTHTLINtMOCl'10;FFNPTPYT1(N71LE71KKAEGpsdD-Phosphatidylavsine Decarboxylase AJN1DKEIOIEDiIvDSOGESIQiALTSDKDSSDIfD7LP0GSNEIE:11PLfIVBRt~.VQXPOYIDRITIDtRVIEP
IFYEKTMLFLYNSKLGKXt.SVPLSINPI18RIY

frWI.OKCbyl1'RRIQIRPFl84RYKISEKELTKPVADF'1'SFNDFITRJ(LKPWIPIV~KLVFI

CPeL0829 936729 937198 TPVDOAYLVYPNVSCPDKlIMfSKJLPSLPR3.LW~LTKLYANGSIV1MLIPfDKIIIIFN

No robust hwsolop Dresent in Genebenk/E!~LFPCDCLPQKTACV51G11LFSVIIPLAVKDNFILFCENKRTVTVLCC6pIrGKVLYLLVCR111V
as of 11/7/98 iCYICFVpTLAKSfYINIRDSRFYSWL.CFI
GSIVpTISPNOTYAKDDEKGFFAItOGSTVILLFLPNAIRFONDLLID'ISRIBPCfRCIJbQ
IYKT)YCE

FFLANAKWPLVPACYRRVRGImfYiSPLVDLVILFPWlr1'KD6RYSPCSMTII'CICRSIVESIlaRiDfIELI

CIPWSTLFGIGRFCAVWCVGFSCSTFDKIYNTIVAVLGILGLGILTFILRIIPSVLHt.

pVwPLFKCYS CPtI,",0810 950111 951541 CT700 hypothetical protein CPn_0830 937339 977959 ISaRNtJCILKTFIGIAKRDKSOILwNIMwLVIWAt.AASL71IALVA1~YYRlYYlIIItYAV

No robust homoloQ present in Genebank/ti'>8LOVIRHVRL3NELKLWALAEOQLLPILKIOtSYRROCLFItYlQIILRIDpRtE681JQ.LAlAI
C
as of 11/7/98 DSCSFLLPCTEYEAQTFPOVFSKVWYKYXSSRI:.LIALLYNITLVIGLIFINKKYLCOKKLG~PYFFLCIAYKAYRFG
AFIfECAOAFASVpQOGf'EEEDAAKYASALVIILG0L0ARC

GRVILKIY(~EEFFMTERFPSIGAGYLRVRNIWSVLFPFEDLtC.VCPSVPKDFPLSAFSLI6PWISPLSNOETFVrIO
ttIYITSKRYKDAI
n KYL17CLIYWSYLESIPVVGAFFPSIGRLFAMWCiEDFPGSIFSRIYNTIVCVLCILGLGISSYAKAGKLI"RIILLSN
PVYKLEALFNIGLCEOKLGRFGKALLIYOSSDGWBRCDAiiJIKY

IMFILRI IFfLLTLPFWLISCLKSSM
AAMAAMDORDYVLAEPCWCL.IILRCSI'FAKDYKCCIGYCFSLCRLRKYCpIIENVYCQ.ION

FPDCLTACKAIAWLCGVCYATLLDSEEGIXYAIDTAVELtkiSCETLELLSACEARCCHFDA

CPn_0831 938219 938174 AYEIOSFLSSPDTSLOEKORRSOILR1LRKKLPI1~HNIVEVDALLAA

No robust homolog present in Cenebenk/CM7fL
as of 11/7/98 NKRKN1TVLIAKSESEGAFFEATpNYPTIQpGYQLVRIREHNLSVRAHFDLSLSLDASVNPCPr>_0811 951719 M . secA-TraneloCase SecA

IKRHIG.CF(.IfRFFGSSOERILKKPOKLVDtIVNIYD~.TPLSDDfLRNKTAELKOItYpNG

CPn_0832 979750 .938827 ESLDSNLPEAYCW101VCRRLAGTPVEVSGYNORWpMtIPYDVOILGAIAl0t1(GFIT~Idt lipA-Lipoate Synthecase CEGKTT?AVtiPLYLNAL1GXPVHLVTVNDYLAORDCLWVGSVLRNLGLTTGVLV&G?LGE

VMItCRpTLNTDQPRVRKKLPERFPKwI)QRPLPOGSAFNATDATIKRSGIIPCVGEEALCPNKRKKIYOCDWYC'fAS
EFGFDYLRONSIATRLEEpVGRCYYFAIIDCVDSILIDfJIRI'PL

RACWSRKTATYLAIGIriICTRSCSFCNiGNSKTPPALDPTEPERIaLSAKEi.GLKNWITIISGPGEKNNPVYFELKF
JfVASWYLOKELCSRIALCARRGLDSFGpVDILPKOKKVLEC

MVARDDLEDCCAQCLVDIIOKLREELppJITTE4~.ISEFCRSLWLVSKGMPI1JRVLRRVREHPDLRANIDKWDVYYN
AEpNKCCSLERLSBLYII
\SDFOGNVSALHTLLOSCITIYMOiV

ETVARLSPWRHKATYARSMFYLEOAANYLPDLKIK.iCINVGLGF11DGEVKQTLODLASIVDEHNNDFELTDKGMOOW
VEYAGGSTEEFVlIIDMCNEYALIENDETLSPADKINKKIAI3r GVRIVTI~,.OYLRPSRKHt.QVKSYVIPETFDYYRRVGEAMGLFVYJ1GPFYR55FNADMILAEEDI'LL!FaPAIiC
LRpLLRAOLIldERLriIDYIVRDDOIVIIDENIGRpOPGRRFSECL11pJ1I

SVQIHIASA
EAKI'NVI'IRKCSO'L'LATVTLDNFFRLYEKIJ1GM'CCI'AITESREFKEIYNLYVLQNpIFKP

CLRIDNtR7EFYITfEREKYHAIVNEIATItK:KCNPILVGTG~VEVSEKLSRILRpNRIEHT

CPn_0933 !11171 979717 VLNAKtRfAQFJIEIIACACIfLGAVTVATll21J1GACTDIKLDtdEAVIVGGGiVIGTTR1108RR

lpdA-Lipoamide oehydrogenase IDROLRGACARLGDPGAANPFLSFEDRVIRLFASPRLN'ILIRNFRPP6DGYISDPMFNRL

RCVLFEILITVSEISA1'pEFDCWIGAGPSCYVrIIITAAOSKLATALIEEDQACGi'CLNRGIETADKRVODRNY'tI
RKFnLEIfDDVIWKOR0AI1N1PRNDVLtIll6$VFDLAKEIICHVSLM

CIPSKALIAGANVt/SHIKHAEOFCINVOGYTIDYPAMAKRKtnVVOCIRf?CLECLIRSNKVASL'MSDROFKLWL'L
PNLEEWITSSFPIAtNIEELROLKDTDSIAEKIAAELLOEFOVR

ITVLKCT,3LVSSTEV!(VIGOLiITIIKJWHIILLT'C3EPRPFPGVPFSSRILSSTCILELFDHMVE.LSKAGCEEL
OASAICRI1WRSVMVMHIDEOWRIHLVDMDLLRSlYGLRTIIDQK

EVLPKKWIICCGVICCEFASLFHTLGVEITVIEALGHILAVNNKEVSOTYTNKFTKQGIDPLLEFKHESFt.LFESLIR
DIRITIARHLFRLELTVEPNPRVNNVIPTVATSFt0a411NfIC

RILTKA3ISAIEES~OVRITVNOpVEEFDYVLtII:ROPNTJ15IGLOC1ALVIRDDRCVIPLELT'llrDSEDOD

PVDETlIATNVPNIYAIGDITGKWLIJUIVAStIQCS'IARKNISGHNEVMDYSAIPSVIFTHP

EIAMVG4:LOFAEQ(NiLPAKLTKFPFKAICKAVaG:115DGFMIVSHEITOOILGAWIGI:Pn_OS142 vR5015 PIIASSLt.EMTIrIIRNELTLt'CIYETVIIAHPTL_EV'~ALLATNHPLHFPFK3~.T702 nyptttnHtw,O
prucmn /frame-ahitt with OBI31 KYYTFFTI.~.A:IPW::NL ALICfI::EPEYIY:NQLLKTQ.iL(.TTtNDTLLNAPKDFPlISKIIDKN

''fr~!IHtA nAISAA '1IGOta ILFI:f/dQfrL.::ll%AQFLIN?IRRKFWIF'PINOOVW.~.EWLPFI

't'. :r. Ilyh.tr It.:t ir:a l Iltrtr.:
tn t:IS,ADFANETF110RTCWKt'.lX:::V::MIIVtr;!:FY('.'\FVrDPf'VA:XX:FS::r.'HIt:Pn 4YA
: $H ~ wSS~ fD 't.A'rtA
:FPECA.iK

N.FAFGLF'AV:.::EIAtIf:AVV:Iy?NP1'OFTNKOVIphW.~iR.0:1dPL1ALF47hLLAFAFLILr1'7n:
r.ylxtrhu i.:.U prntam trr.,mr...~.hitr wir.h O9A11 Lt':'ftr.7:LVL'IWIKNAAYIO',:I U: tIKtIKL: :
:'S~IrtFKVICJYItVYS>PI'(IPDIt11Jt1EI)(.(.DN.~.FJ1A::LDKYr~CIr;V'IVEFJJrOQG
V1VAYRCYAK:FL

GLLI'/L/:F':VFKlf1'tllVftCt .:1't. IIH :'. 'IAL4'IH 'tA~LIAv J
m,! 1 ::WI/::NI I.unilY heli.:ar:a 'Ftt 4RAA ~N~ wv.'P.I ..'hl , tlNtIIVLFIUrtIFRt~7AM011f.LJIHRKETWTiFY'Et::~?titll9Uit'u\I'EC:YWL~TLKWDIDm %ptw:r.TF.c:.:m:Tt'-ItualuI Irr..n.

Nla'F.lt:.':a:(IX:f7'L'LIIIXiS\YFAVYpAU:IJIfWILYFHII:aWIAVF::11FFLD::IPWA~YNHIF
TTHLKIALIGRIMV1:K::::l.l'DII1!.'YI::IAIVfI::I~%.TfRI*LYt:FIJIAPV:VFAVV

:1:HV'ITt.E:.:T'SIITLTIFR.'(.::I:EVFQOWLRTIIL\.~EfaT'JFTN!"PF'LK::AL'lR'fAKKFFF
LIfM'7:/L11N::ED'IFqKlli'IN~~AlaywlYFJviH/i.Lt:Jti)ITr'3tTEF.DN11.AKLLG'LILKPL

NF:FT:Af?S:flt:l?1::47ra'f~.:IIF::LOY1'jiLVFKAFIt.:FtTf.f:DIFIKf.FalIIT;:LFIJI/
::ltptLLVAtIYADI:H~EELt~IHETYK1~:11LI'P!I:."PAllf)KII(Uft.tI~RIKLVMILPEPREEEEE

i:

:LEEV:VDfIIF.E~EAALP.'aft'fPOf:LVITflC:F:.LNt.~IYROLTENNLPMf:'.
'EILWIRSFQNC:VNC'."f:A.~,I~:::,f".'!!tRLr""...:.::
r~f.'TLPESPCCAPICTLKIAL.IGRP

tM:K:'.'.:IfNr:LI.NEERCIIONCP4T!'RDftLOILY:NK>XtOYLFIDTAGLRKlIKSVKNSIEfY1'YOCCA
TiftaISYCT.TI'7AKCNYLDALNOEKSYWOAR~F:L
"DQ'fDOFATNI~v5 YIIiw:RTEKA1::RADIt:LL;ttOATOKL:w(EKRI::'LI:iKRKKPHIILINKtiDLLEIyRtKGTZYRGLDLFK
IfIKIRICI!>pIFL~I/RLRI
PI~, :illY~,~'L.'p 'w"I

EHY''.KOLRATOPYLfbAFJILCt.iJITTKRNLKKIF:IIIDtLHIIWSNKVPt'PtVFfKTL~ISAt fir'rtQIATFO'tpKH9CLP5LI:KYPIyI'NK711FIK
PLONf~S fOkTt.'PTOONV

LHRMIPOV ICrRRLR IYfA
ICKT"TPGOFLLFIASIGI~IIYS<iK7IAKYIltELIKEITTFOSADLYYSL.iIYLKCIItR.pAVAOP
ItIAKSLLTKHYEYYLKIfCLKSSFNLYCI LGKAVG1:.

PfDLEfICEKMCPtIN
NDLKTRANADITRCNIIIKAAIDKtILVEIKAOiIELSK.~aCTRtLI."_~.'_TNfKS(;SOIw.lANL

SCL(~'FL;iCLTLKAVNDFNATYFJ1F:AEIF~tPfNM~IItRCLATFLiFVtOIxC~CITPGC

''Fn nrl.t5 n5ql il ~5~a5r1 OOOLLOANE.S!'.QCOF3'?F.~rNQOftILfLESa~ANQQESIfGVSAAL~LLNpNVSKIJIRIIIKS

.. .,:,.., , . HaO,,~.~-, ::.li.~:'.:' . t,.r~,.. ,.:Ih:YrJA'IF'.":::":Pl7Wl.Mt:n., ..~., , -RPLEDLDIATNAs'PTIVSTtPPtriII::LCJAFGIIIJKpOCRLfEVATFRSOGtYKDCRHPiT7l~
nypotnecvca: Protean ORIIFSSI9tGDALRRDFIVttCNYYDPfEDKVFDfYCIRDILKl07IRAIGNPRLRFSEDKNINNPKIAUWSLPLTAJ
APVFEESYffPa'Va:~.11DYVOAT'lGSPIILTVLKDVIKGwIR.D

LRILRAIRFSSSLu'FCLDP'ITLRAILXLAPALVNSYSPERINOGJIKIQ.IOfOPI'GALSLLIGKiIFL.T~OCFI
N'ILTLU1IIQA.iIrIDpSSRFSRKKEtKIIItQFIILtIUIAfOMTklSG

LKLKVLZFIFPCLRDIPYS4LRTTZLFARKfHPTIIPPILFLLPLfnGVStWITVJVCRVpPI7IDPVADKItPLOSAF
AYVLLOKYIPAOttALYALCRELHLSGYApIILFSPLLISIIKS

LRISNKEt.KLIESNYEALPNfQNpSGNRVPWANFL~ISPfIIPLFLELfS11L4KDPSR00HFINSAPINYNIGSYIS
OTS<.TANFAYCY!?IILSRYt~IILVSpCRLDIAB'1VK111GItWIIiA

ISRVptLESRLEOFILRIKTSSPWSAPDLZ.1KGISPGRLtGt7f.LRRJItILSZENCLDKSVKJWVSL'tDROKKCI
ECIIASYTKSLOVINTOLTDVITI'FiI,ASITFVPGL~I7fDISYRIV

EKZLLLL.OLKGtsec O~.SI
IAL4NDL1M.VDGKVDITTAVNOGLLNFFT1YL?DOpNYCpt~tpTpptIlLtx.E

LIWpppWSLVSASL%LU7CNY7TVI9GF10~t CPeL0846 9597!7 95A11~

clpX-CLP Protease ATPase CPn_0851 97119 971106 RENHMtKXNLT:CSFCGRSLKIriItKLIJIGPSVIfICDYCIKLCSCILDKKPSSTISSAPVSEatopB-Ourer l4smbrarte Protein 8 TPSCPSDLRVLTPKEIKKNIDEYVICOERAJUCtIAVAVYNNYKRIPALLF01KOVSYGKSNCPTDINSKNLKNLRLAT
LSfSMFfCZVSSPAVYALGAtZJPMPVLPC1R4PE0'tWICA!'OL

VLLLGP'ICSGKTLZAKTLAKILOVPFTIAMITLTtAGYVCEDVO~tIVLRLf~AADYWACNSYOLFMI.IVGiLKtGt r' rlf'DYVfSLSANITNVPtIITSVTfSG.~L"t'fPfZTST'1'IfNb'DFD

RAEpGIIYIDEIDKICRTlJINVSITRDVSGmCVCOALLKIVifXiITANVPDKGGRKHPNpEIIOiSSISSSMATIAL
OtTSPAAIPLt~IAPfLKOYYRLPWiIYRDITfIIPGtfA

YIRVFTtENILFIVCCJ1PYNLDKIIAKAIGK'f1"aCFSD00ADL.1~KTRDHLLJtKVF.TtDLIE.SLYIDCLI~I
C15DYCIVAIGLSLOKVLiiKDNSFVCVSADYRNCSSPIMYIIVYNKJWPE.

AFGNIPEfVGRFNCIVNCEELSLDELVAILTtP'L'N71IVKOYNLLFAt&NVKLVFIOCFALYIYFD11TDCNLSYKt SISIISIGISTYIaIDYVLPYASYSIGNTSRKAPSOSPTELH~FlMFIC

AIAKKAIfQAKIGAPALQlILFtiLfttDII9PEIP5DYNGINI0E0TIAt2DtAPZIIRRTPFKIRKITNFDRVNFCF
LZTC:ZSMiFYYSV~RWCYORAINITSGLpF

FaIA
CPtL0855 971001 972991 CPtL0847 960019 959787 ppdA-Glycerol-3-P Dehydro0enase elPP-CLP Protease Subutfic GGBBIpNIGYLQ83IWCPCLASLIJ1NKGYPWANSRNPpLIKOLQtERRNPLAPNWISPN

KLFDEEfOHTL.VPYVVEDTGRGERIIFmIYSRLLImAIVNIGQEITtPLitHIYIApLLFLItISP1TDM0J1INNAE
NIV~11TSAGIRPVALpLKOZTDLSVPFVITSI~ItpMSiLIi$E

SEDPIQ(OICIFINSPCGYITAGLAIYDTIRFLGCDVNrYCIGOAASIkiiILLLSA01'l~lINLLViGDSV'PPYIG
IfL&GPSIJIKlYiI4GSPCSWtISAYOSOTLIIpINl~IISLPIANYP

NALPNSRMttIHOPSGGIICTSADICLOAAtZLTLIOUILANILSECTGQPVEKIIF~SDtDHTDIIOGAALG~LIONI
AIAGGIA~LRfCaO~tAKAGLVTRGLHtMiKLJUItI~CKPiTW

FFNGAEE71ZSYGLIIIKWfSAKETNKDfSST
GLiKiGDL.CVSCFSPSSPNf.ItFCNLLJ1QGLTFmAKAKIClNVI9GAYTJILSJ1YQVAIOWK

ILIIQITDGZYRVLYEM.Da.KtGIALLtARNtKCEFL

CPn_,0818 961556 960177 cig/muri-Tripper Factor-pepcidyl-ProlylCPeL0856 975110 977995 isoaerasa VOASSPAFPFKSNJOCGCLVPRSLSNEOfSVOLiFSPGCIVSAWKVSP~TKLFa071LIfA0X-1 Homolo0-WP-Glucose Pyrophosphorylase KIKKEITLPCPRKGXAPDINIASRYPfINRIQLGC.VTOQAYfUILSZYCDNRPLSPKAVRGSRLIwNVRLTVIffESV
YSPSAIqfVNSL7IDIfLKAINOEHILDINPSLSPKQppRLf00LTS

SNSITQFt)L0LG7UNEFSYCAFpAISDLPWSiLSLPOHE7N1.SEZSDSDIEKGLTHIOI~FVDZt~fAlOppOLLSS
PTAIIJmFNPITSF1ISSGtDPGtANAGTTLLKtKINAL11VLR~Q

ATKTPVERPSODGDFtSISLtNSKSNDtNIISSMIP:NKYFKLSIIiA~'LWKLINLCISG4RtxCDOPK~fPVSPIK1 0IPLfOLVALKVRAASKL7tCOPLPLAlIftBPiXfROTRSFF

TCHRWtTITSPEIQSFLRGDTLT!'fVN7IVINSIPEIDD6KARpIQAtSLDDLtUIKLRILSIISnFtG.DPNO~V~F
COPWPLL?LSGDLPLtf7l~'1'i.AIGPt~NCCZ11TLLYfif;NYAC

CLEKC~11~CCLOKRFSEAEOALAIIt.VDFCLPI'SLLLERISLITREKLLtiARLIOYCSDttNIOfRGItHVBVIP
I~PLiILPFWELOCFHANSt~B~tEVTIKAJILRpfIIILDIICILYKSNDS

LLIfRKSELIKEAEtD~ATKALICLLFLTFDCIFSDCU.TISRtGIAYI~SRLRfOpOPPKCKfSVILYSLIPONEAF1 1L1~DGKLK7fCL7WZGLYCL~FIRIWIYOpLPLYKVNKIWt DIS~1'LQELVNSARDRLTYSKAIEIM.RKASLL.ASTPSJ1QL.GI~'SLaI610~OBiKICCFIFDLfRYSDHCQfL
VYPROLCPAPLIOa.~NIISPDt111EQRLS
IXtAIpLFHKVlGKI(LSPNTTPLLFJIDFYYPSTSTSLNWFaIK

AFFCEPFfGB
CPn_0819 961752 965~A5 , mocl/snt-SwF/SNF family hsliease CPtL,0157 975108 975792 ADYIINSYSRCF~~LMWt~RDFSANILODCKKLFtOGJIVItfAICZL>iE~E'1'VCISAQVRCT716 hypoehecieal protein CLYCNIYECEIE111>RStBO'ISIDSNCpCSYHYDCONIVALLfYLlOIfFN~4VIlAYJIR>iRDI.IJiiJIpYIK
TARGISRI141DRL.G6LSLZLKVKIHKYLDTLIOJpKRLALTVSRNI0f1'pa0!

ETDNCINLLVItKiLIfETFYAAATKtEERKDRAtpKtIAOtdQ.GIYFlIFISRONIKNYDILLEYLIITLOSSLYXQ
OSLSLRFLEINNOOI.O~.I~tR

EKDBIILLiIVLTYSVNEDTfAPIINDPIEPOLVLRLPCRSKPFYISNIRTFLCGVLYOtPIVKIIACIKNNKYSKDOL
IOT

i~IGRRfFfTIQStNABt>AKI,II7LLZ ALOVILiIIDPtN

CLIIDNGf70SI~ttSFSGLPCCNL$EPIL~ISLTPCPn,pSSB 977115 975757 VD

ODi110PlroTNLLESL1APGZIHHFVYNWFSpOIKRIWLRSFSRtJIDLIIPEALIGSIROiAtlii-Flagellum-spscifie ATP Synchase LPV!~pIYIIEIANVHLLNSFVTLPYVDEYRJ1ICI7NSYLDGLLEAIfLIIPLYGSLRVPAASLiIIDISt'fRNQRR
TRPSTFCFDSIB~INLNKLKLtIINNWQPYRACCLLSKVSfrTILILYDGLSiICL

LOYOOVRAFISDLGILARNLVLERKNLL6VFSGPIYDCROGAPRVKSLKKIY6l7ffETIPCCLQ(ISSCImPNLLIEY
:v'FNNNTTL.taISLSPLHSVAL.CI'EVLPLRRPPSWLSDNii.G

ANQIIRITFNCPLNLSCpFIYDETIFB.SFRtxiSDRVLDJ1PCNPI.PKfItRKPt.LSLPPSPl0~0lpPIDpIFP' tCIK7IIDtIFLTL~RI
I

SAKKRFLLLPKAGOQSNGTRRGKVNSGKLPCILVLpL~tIAPWOIfNtIGfKVLDLX.VQGVISCPOSGKS&LLS71IA
LGSKSTINVIALIGAtCRtVREYIflOfSNALKp~tTZIIAAP

KCPLNSLTCISLDOFEJ1LPVNPSNSERLIEICKOIRGLIEFDfQDVPOCIOATLRSYpTCAtItTAPTKVZAGRAAIf I'IMYFRLOCNEVLFIHaSLSRWIAALpI.IIALARGLTLSNpYA

GVfBILERLRKFOiLNGILADDtI~IGKTLOAIZAVTDSKLEKGSCCSLIVCPI'SLVYNNI~EASVFHFNSE!'1'LM
GI~IJOOfGSITJ1LYAILYYPKNPDIFTDYLKSLLDfNFFLTSOGLALLI

fRKfNPEPRTLVIDGVPSORRXCLTAtaDRqVAITSYHt.LpKOVB.YItSFRFDYWLDtASPPIDILSSLSRSApALA
LPHNYIIAAERL.RSLLKVYNtALDIIHLCJ1Y1'PGDDEII.OKAV

HHIKNRTTRNAKSVIQiIOSDNRLILTCTPItPISLtEWSLFDFIJIPGLLSSYORfVGKYIKLLPSIKAPLAQPLSSY
CYLCBfI'LIfOLtALAOS

RTC,11YNGNKAONFNALXIONSPFILRPF9fEtM.KDLPPVSEILYHCHLTBSOKELYOSYA

ASAKOELSRLVKOEGFERINIIM.ATL?ALKQICCNP11IPAKOApEpGD.SA1CY0M.1~LL

CPt~0A59 SSWDSGHNIYVFSOYTKNLCII1CKDLESRGIpFVYLOCSTKNRLDLVNOFNEDPSLLVFCT718 hypothatieal Prot:ain LISLKAGCTCLNL4CAD1VIHYL7tiMNPAVENQATDRVNRZGOSRSVSSYKLVTIiJ'1'IEEVfLYtt'POSPGSLS
pSHLPNPHDPWDTtP'tSLPEDPNOKASCELNSLVNt.FRK<SINLLS

KILTLQNRKKSLVKKVINSDDlWSKLTWEEVLELLpIEVLK!lVppLKPDIrz'!m.-ZCEKFLYKXLENPOELALLLSTAIARHTTLRSLTPIKVFLN

PLIL.KTLTOWISTHELPNIKHAEFPPOTSCARSGFKIETPNCILRQLISELLONLLbvLT

CPn_0A50 96575 ?66790 . A

mreB-Rod Shape Protein-Sugar Kinase LCKKYwNCCRYDFNSPNRNLFKLKNFSNRLYNRALGRFOKVFNFfSCNVCIOt.CCANfLVCPn_0860 x'8679 YVRGRGIVLSEPSVVAVDAOTHAVLAVGHKJUtAM.GKTPRKINAVRPMtDCVIADFEIAEIliF-flagellar N-Ri:fg Protein CML.KALIKRVTPSRSVFRPRILIAVPSGITGVEKRJ1VEOSJ1LNAGAOEYILIEEPN71AAIRTLVFfONLAKKLTA
LCISiLCCLLIG31IVSCAILfGRSSNPSt.APTQVKTEKT9CNnK.K

..rvpLPVHEPAASNIIDICGC?EIAIISLCCIVESRSLItIACDEfDECIINYNRRTYNLNLTONLTtPKLIESLTKK
ECLEKDLTSFNPIASAINAIALSTEDOUNSPI1ILSVILTLRKe6 ICPRTAEEIKITIG.iAYPhiOpELENEVRGRDp4ACLPITKRINSVEIADCLAEPIQCIISLTPSL.LFSITDYLC8.
3L1LKRLNISLSt7NLQILYIPFw~ITVNSLPtN'ILIDIYtGKIFP

ECVRLTLEKCPPELSAOLVERGNVtJ~CALIKGLDKALSKNTGLSVtTAPHPLLAVCLCKEMPALAYNAKJIOCPTt.C
LTt~ItNYIIWLTKEtStKIVAHTKHYLYONYUDSYDtVIETL

TcKALEHLDOFKKRKCNLV
PFARt.QNItKSFPAKVLIC:HILVISLMI'/ALASFYLARHAYERVSPEPRKIKRCINISKL

LEIIOKESPLKIALLi.S1 L: PKlfAPaLLNRLPEOLIWCVGIfYKL

CPn 0951 ?66778 068195 PckA-Phospnoenolpyruvace Carboxykinas~CPt1_0961 ~-975Z 979925 REP.~IMVWSTNLKHECtJfSWIDtVAKLTTPKDIRLCDCSOrtEYDELLTthESTLTMIRLnitU-NitU-related p:'oceln NPEFNFNCFLVRSSADDYARVEQfTFICTSTfAEaGPTNNWRDPOFIOtRELHOLFRCCNOASYPFTWKPLJftLPLEF
NIFWSSLSAK'MKKFLTPHCACTFSEEDAFJ1KLNILYI9IIpGN

r;RTLlIVPFCMGPLDSPF.~.IVCVELTDSPYV~IC9~ItIlftRFGDDVLRSI.CI'~IfIFLKCLHRLNaKZfIFr iK.VDtIKNCt'LLDAKFQYF.HPYLIPWPJ11R:NLVCGKSYSLAYIOfILODI

::VGKPL.:PCEADII~IPCNPKSNRIVHFODDSSVMSP''SSCYGGNALi.CICKCVALRL71SYW1OKSLRVHJWCP
ALPED::I: LYitPVIDALDTAVEOCLEIPLEDCSLpf4711~uPNNL~CMN

K::pcA.IWEHNLtIGITNPECKKKYFSASFPSACCKTttLANt~IPKLPGWKLECtrppIAWIPYSOSDWEALTHEOK
t.YALR.tTLAEKT:PYtANCIfCEViYESLENFTVTLAYSQiC90CP

HH:RIYiPLYAVtIPtYCFFCVAFOTrERTNPNlIIrITCRSNSIFTNVALTADCDVYM80LTE3SLG3'lIlJSICOL
LRAY IfELpVKVDE3~L.NL:HP

OPPEPLTIKPWItPCiC;iPIAHPtI.SRFTAPLRCCPSLDPE4MSPGCVPLDAIIFGGRRS

tTltL'flFAt.:~lF9KKY1'Ii.~CllS'.':ITTAAIW7CL.aCLPHDPFNILPFI.'GYHMAYYfpNWL~Pft_0 9f2 ,~Jn24 >7'l7.~..

.:IIFtIR.~.LKLCNLFt:VHNFRNFBJpI:EFLWW:FCENLI'ILEWIfQRTDCLEDIAERTPICYyfttwttit5-rFt.m..l tc.~chin t.l?IfQYFtIIIX:I.HLDL.~rrVQELF.~VOAtY'1J(J1EVEIdt':EYLKLA:~Ot.'L'~~OITDELLRLK'i PtiTIFRLTf7CKT::r.'f:'NEKIQtIRKAFPIFWLtRIQVAtIPSERVKE:9'AIJI::OIPdLPpG

::1:1 Kh:Y. :.AIJf IhJiKTCE.~.I RUL>.t:LY(fi:t1 L FRF'/PtIFt'IFMI I VLAALVt?IL:WPtt:RNll t ILPAH

U)pLLItL~LCRHOt:tt:IT':'tJIIlV181F.':P
IVEF'.r~LtETL::PR::f.LF::It:AAHC:LT1:VIQP

~'irn nHS:: .wrt~7A ,n~a.:f!
lLPL4:LCKDRRtLI~II.CI;UItJVL\FLTI'EIIiIADIITF::.~.AAIJ,Xi~It:::It7CIFIRKGL

":'W l hytxmtx:r ruU Protoiu t7tVf:..'IiFPPFfPSA::I:
F:'.:\t'MNp'1'N:F.ERI::ALI'f.FTFI1T:.Nl~'KKLIUELO::VLr::I
~

iY.lattU'flYt.rt7rV1'14~i'::YINFTPNVITAf.::.~ltlflP::AfEt:a'.::1LFFOELODKIIOCiL
.AF::EVONRLPNIWAAfPDttAEa~:FiILllr/J:fYI'.'aJ:YERF6~F4WVi.tlNwtiI::PP

LKIIAII:LVVF:L::AF:ALN('A~1VOT::I::YLPTEE:i'.'.RCS:L~.N:LIDR'1'tlPt'h'1'ODfVKAI
LQll:H::AU1F.~.LTER::KUI.t::::KLAH.WIILLIKIIIa'ILLt::~:::

Nt,FIFFT::K t FVFI :LINfVFK::1't.~:lTPt'Ff:
IDP::NFE::A ( I LNY ITLIJJNLhPKFAACST

I'rnAU'InALIAt.t:UFVKRIFJ1LKMIMP1W::Itb'iIAFWUF:IFfPt'INMIQV4:lPVTDYW':Pry nH67 t~YISjOWl,y ~.ylxtt ~Y~3 I

VyINd.::INITAAytIK~WI.KNPf.':ILK(llLtlf\AJt'fftJA'MIYPADAEYNARMCNIOSLIspM
(hoePtKxFIYm't.y. N.H.t...

WO 00/27994 PCT/US99l26923 FHMALLILLPiIG0:rIMMEKNLf.iG4JVDLPL...~pOCHLLIOt-SROSfMCL:ptLP::.
XfFKCF':I'AIIKJL:.FF:..~.'tK.'.F~.::.It:LESAL1:.
.fSACAAIONLPIDCIlT3RVR f;'...1RL
iLRR
' 7FEOOEIf:w Ih'LRTFI'E:
PIAKA:
' ' ' .;IXtALLWnINNSKKIPYIVF1EDPIW(ENSRI'I:aAEEtTN~IPLY053AWFRNYGELO..
.
..
.
.
.
rTD'~.FOS
t Ft!P1 RHLCC3AKAf:
Llt ~
~' ' '~T
~

KNKKOTAEOF'iEERVKLYIItR,iIKTAPPOGESLYD'CKORTLPYFEKNILPOtANGIWVFtf~
' .
' .
r~t~YD.WTL
/%' ' ' OONATLCFJV':nPIF'.'T1ERNRLDFOC't'3R'~d~ftLVRCATC?l9L.i ~110~'PSD

EELYL iLELPCCKWVYQ~D~KIEKNPCF~
EAAiLVNS!"t't'IQCgItPLTIRGLP.iLVIGL..~VATFICo:
.~ANCNSLR.iLtNDLEKL 13P0~JRLRC,LYSTMLSLLVKS

LRSMREMWKOLLPOLTVLDFSEr~..SSCC:LDVFAEGIAVRiNtJJCAVSIN:.

ePn_ORFI 99!559 993371 y7bt:-Dradrcced Pteudouridine sYnituse Yf;IIIJVt'KVRIIIKFLA.irYllAiPRKGDEILFSGSIrtVNGRVAECPFVL'IDPEDKVOVt~1'SCPn 11875 ~ ~77h7 't't44I2 -~..

.;l:u.m..,.. y,y.;YYf~l~:l'F:: -'vHLiYF":~':6L.'.i....
..,.,,....w ' -v....

_ _ ...w=..-rr , vc r--.,nr, .. , .,.y.... ...,f.:_:A fi::r.-. . .f'~:F'IP:F:~!.i':!~th'!~'::
!:FVIY ? III:-.:.' :I . .f....F":.
SRRLFAP.KW':.iKL'fL.:VyANh'h:iAEK~:~t'.;,LEdCL.aYIaSAArw~.i::i4AL.i K RLsRR
SD

tWSEGKKNEIRLFADAAf:FPLLELNAIRIGSLVL.GGLRYCEYRELTWELGTYN%L_ ~ITOLSkI'FSOAit80 M
~
Y

,. ER11PELI
Ptt_0865 9811 II 987942 VOG
AAFASCL;.LDSCIY
CVIS6ffDf ~1'865 hypothetical protein SPNGYVIYVIJIGSIFIGISLGJIIfCOLYYSVKSVLfS'hIYLL.'iYYIILEKRNALU1LSOLVGECPn"-0876 EDApSpKEIDFLSOCDKtSWMFLIG~tSYEIIPTFK~LLSFAVOCFLESIETI1~RdaOA-D-Alanine/Glycine Pensease ' AILCIEtifyiASIOJGFDFEI11AYEFJIVFJfYLKLRQMPi~ti~ISKLFRFLOVPSIRFSSSIR'fCLITG.Y1IE
0I1~IKLSTSFCVFPNILLIGCFLlfIKLRGLOf IIOLIO.CFNiJ4.CbLLD

DSSSKANEVSSYGVAGILAf~7JlGIIGNIAGNAVAI~1C~PGALVWVWIaALi.CAIVpYJYG

SYLGSKYRKP~'CEf'IOGPIJUCLAt~RItKIIJYGFF11LPTIhtLAFCACNCVptISCIVP
CPn _ LCAGClPGKLLVGILLALWIPVI'7lGGt~BIRILRFSARVIPFIAGFYCI'SC~IILfONABA
birA-Biotin Synehatase "

hs~fKVIYYEIEEIPSTIILZtAKSYNIIIIrIDPYALTVIS1'KCOTAG't'GKFGKS~rIKSSKGDLLN1'ILPAIK

..SCIIVSILOANlKSI01 FCFFITDLfIIDVSRLFRLGTEJIWALCKDLGITEAICIIfWPNDVLVHGEKt.CGVLPC1'LPVPVVOCLVTLVPpVI
tMVIICSTT14.VLIVSGAY8SCA0GlLNVNSAFIO~tSLGSLGSVIVIL

ECLIGWtGIGLNI~I'1'KOALKDVCOPATSLOEILCNPIDLETtRELLIRNLLGVL4~ILANALPGY1'1'IL'itiF
ACAEKSIpYMIPGRRAM.WI.IfALYVLIIPLGCVIOIOtNIWILSD1'0 PDSLATKSNRGNL .
fS(RIVIL1JCI11LIALLKDVLSINRWALLttRECSVADPVRNLD1 0867 983105 981667 CPeL0877 995521 995982 CPn _ ybeL family roM-Rod Shape Protein RRRDIOLLSPAF11YGAPIPRfYTCOCJ4GISPPLTFVDVPC11AQSL1LIVEDPDVPKEIRS
KYFRYVNSWVFLW

CIRIPa9liICFCHL11A0fiNFFYNINNFNILEIYSLINSNIIMIYHOCLWINNIVYNLSTLITNLAEGAEIFAVOGI
JIT~IKPVYDDPCpPWCQtIRYFFTLFALDV
LTI14.LSWVISShmPTANLIrI'SSKGLL11JKSINOLRIIFAt~IiVVFFIGYFDYNLfIOtW

AWVLYPtTlIGILVGLFFYpSVpNVNRNYRIPFItOISVOPSE~IGI~VIVIlQ.67fIL~RKAVLPEC~1'RDQLYF~
EFNIIEpAEIJ~1GTYEKS

DITSK'iTAFLiICLWALPPFLILKEPDtIJTALVLCPVTLTIFYLSNVNSLLVIfFrTWAT

IGIICSLLIFSCIVSNOKVKPYALKVIKEYQYERLSPSNIOtQRASLISIGIGGIRGPGiICCPtL0878 996660 TGEFAGRGWLPYGYTDSVFSAhGttPCLt.GLLF1'LOLFYCLICIOCItTVAVATD~GIC.LSET Doolain P~isin AAGITVYLAIDM.INIS~'IOGLLpIl'LIfPLILISYOCSBVISTNASLLIILOSIYSHRFAKGCNStVS'fEPCSSI
NISL~D10MIDSOPYSLDR~BELLiIFRFLPSLV1'81WK11COOIClLC

Y
NItSnIRRLISPL7110iLGKLNKODLLCPPAPPVSVCWINANMGYGVFARDtGPYII'YIGEY

TGILPlIROAIf~fDCmIC!'RYPNPLF1'I.RYFI'IDSDKOta:M'RFINNSLDRiAIJIIGVFS

CPn_OB68 986733 981670 EGLFNVIIR'IYJIPIY11G0EICYHYOPLriRDIRKKREEFIPEF~

CPfL0879 997163 996615 yyaT-metal depefdenc hydsolase YRIIfiKVBNOGFFpW&GSKCaNSAYIG1'DSCKILIDLGVSKOWI'RELi.SINIDptDIOA
IPIrllItHSONISGIKSFVKAYNTPIYCHLCI'AMt.CHLLDSfIPEFKItBZOSS!'aODLE
VOTFNVPttDAVOPVAFIPHYRE~CFC1'Dt~4~'SWITRELYDCDYLLII~PB.VR
OSORPDVYKIDtVLSRGHISNOCCGQLLOKIITPIa.KIC.YWtL8T0d0'171=LAISIYSE
SI115ITSIAPEGWIOGITSPIYFSRLCVJlCIII
CPtL0B80 999861 997111 . ttsK-Cell Division Protein FtsK

PtfIR~O(SRRPRLYfLPtJUIItASLYLFFIVCFSCLSLWSFNRDOPC1'Q~RIIGIi~QiifBS

IIPWLAVILHDGS'MNGIIrALRLL1CS
FLLYlFCiIAAFFIpLYFWLBFLYFRRTPRPLFtYKJWIFISLPECSAILLSIR.iPll~l't.

PALi.D1'JtLpKFIL~1PPVSYVGGIPFYLFYCCpSFCLKiiLIGSVG1'ALIIOfVM.IiVL

CPt~0869 987179 986658 YLODGI11LLIOOtTFODOHIKAFCSFFpI'CFIOfGKKLINRANYLPKPSVPFV8101P11Ca'K

CT778 hypothstieal protein SOpSPRRVSETIILDCSISPLPOEEIPCSKKESTFLTPNpCKRFLTKIVtIpaUlx~lt OfRiRFFFPICtS~'11'SDCPQt~ILAKIKtQDPNOHFICSRTPEDHIIOfVRDfDtRVCKCEPNTTIALS$1'PlyV
R6S1a3KSRAALPIC.KSLJ1VPCIDLPQYHLLBIDiRtJlRpfiLOAtLtAIUI

T110CPF7fNW~NALS1G1IFIFFIATLFFLIpi'NRALQVKSLISLCVGWI'FYftGCLKARKALIL1~'FLTSIOID
ADL.Q1IC9CPTLAAPatLPNSCVKVpKIKSLn~IDIAIitL0A8iIRII

w7lYl~.SHRSM.EGD'IEIElNt'D00CIlLRILFlI'AGFfmPLLOnIVEYVCSDbTLLLDTAPIPGKAA1K1IEIP
fPFPOAVNFRDLLEDYQKTNRKI4IPLLILIKKANDpiGIAOW'lflP

MIRES.YIRKmLPFIPLI00CSRIL~LCGWIIFLpLVLCISYTLALViSALNVLVL.SFLHLIIJ(DT1CSGKSVCINf IVNSNINTTLPSEIKLVIIDp%KVELl'CYEOLPIIa.BPVITI

NAKIL70JDKISFJtVWVLCIFITSASI
ISBiaIKLLSREVYNALVWLVK~IBSRYEILRYLOLRNIOAFNSRTRNKTIFASYDItCIRt7?8/MIGI

IOEtSDLLLSSSODIETPIIRLAONAMVGIHLIIJITORpSREVITQLIKANPPlIII=FK

CPf>_0979 988A81 987118 VSNKVhISOIIIDEPG11H~1LIGN00lQ.VLLPSVPG1'IRAODAYICDEDINKVIOdCSIIPR

serS-Seryl tRNA Synihetase-2 't'OYVIpSFNAFDDSDSDNSGEKDPLPAQJ1KTLILQ1'GNAS1'1'FLARKLKICYARAASLID

TI'fNPI'QGFGGAVILPFSPISIJIRItIRKSCCSEKSSIYSIiFCTLLLtC~1E1'SlR.DIKIIRKOLtEARIIGP
S~CAKPAQILIONPLEG

TPEDCEIRLRIDCDPKISLEPVLSLDKEVROLKTDSt1T.0110RRLLSODIRKAK1'pCVDAT

NLIpEIII'LAADLEKIEOHLD010~IAQLNELLSNLPNYpJI~IPVBEDKAGtfOVIKSVDDLCPeL0881 PIFSFPPKHHLELNQELDILDfOAAAItTrC9L~rIPAYIOJRCVL.L6WALLTYNLQKpAANGFNo robust homolog present in Genebank/l3lBL
as of 11/7/98 OLiJI.PPLLVIOfEILPGSGpIPKFOGpYYRVEDGppIfLYLIPTAEIIVIatGFRSODILTEKENKKFAVIMPVPID
NSSRNLQhIfPFSLEDLEQNAEFSP1'HOSAESSSLOLSIrISSAISSIiV

LPLYYAAGTPCFRAfaCrWGAOEItGLYRVNOFNKVH9FAFITPNODDIAYEIDILSIVCE7iEQLSSLVL~ISDPSSL
RDVPIFSAIYESSTFrI'PVPTPLVGVGYINDSOSOYYCtORES

LTLIU.PYRLSLLSTGDNSFTASKTIDAE1MLPG0KAlYbI7SSISOCTDFOSAASCTRYKWiLSOLLGSRRVEWYNOG
NFfIFASLIiJLCPRRPRRDPSPISLALLEtyIFIVFFLBNPPGS

DSQCKLQF171fCLNDSGIJ1TPRLLVAILF~BLpQADCSWIPEVLRPYhCCLEILi.PKDOTP7JPIFFW

CPn 0871 988766 989899 CPn_0882 1006169 1007101 ribD-Riboflavin Deaminase No robust homoloq present in Genebank/t't~I.
as of 11/7/98 EYNE:DFSEQOLFtTIRRAIEIGEKGRITAPPNPiVVCCW1IQFNRIICDGFNAYJVGGPNAEEM'POVALLIOYFFCN
GAPYVREALRLTPHA~IIVWGICPSLYPENPRSLYYRVSr,DIGS

LAL~JASNPISGSDVYVSLEPCSHFCSCPPCANLLIaiKVSRVFVALVDPDPKVIIGpCIaRFDORGFVNSL1IETLPY
SSGSFGIEWISII'DPTPNFAIVNIFNRTAGINEVSRPNl0~1'E

Nt.ROAGIQVYVCiCESEAOASLOPYLYORTIWfPWI'ILKSAAS~FDGOVaDSpGK90NITCTSLIDIRDL3F~C6V~
irtDSLEOEFSLJiGIVCH71JCCVSIftVTSSPNIPYIIIpTLi.GGPE

PE~WIDteGKLRAESOAILVGSR71250DPNLTAROPO(stLYPKOPLRtJVLOSRGSVPPTST4AEAEFi~IPtFPNS
I'IDSLAEIlQBIWRISDAVSIIWIFPIVDTTYNGVWJNCIGPICI

KVFDK'tSPTLYVTTERCPFNYIKVLDSLDVPVLLTES1'PSGVDLHKV1'EYIJIpKKILpVLNCICSTFLtLTNPRS
RRORWRNLRIMVLCYRSLGSGIO~tLFDL51~NRNAMRiM'SCIYA

VEO~'fLHTSLLKERf'VNSLVLYSGPNILCDOKRPLVI'VIGtILLESAaPLTLKSSOILGNLYANV1'LttCWtVaI
00J1tt0YCFPSVRDAPYRYCLRNRYCLTpRNEDSi.Q1'IIDTR/pY!'lt1' 3LKW WELSPpVPEPIRN HLf00pNVAS I tldl.~'VFGLFFGF1IGLMlI'PCGLEIS

CPn_0872~ 989903 991216 CPt~0983 . 1008901 1007577 ribAiribB-GCP Cyclohydratase i DH8Pdntpp/ogre-Phenolhydrolase/NADH
Synchase ubiquinone oxidoreliuctase KERIFRVACLASESVNARESNIETREEVCSAHFVSLEMIEDLRAGKFVIWDEASREDELYELFIKSCIFII?IML:C:L
YFLCIASLLFCAIf3VIL.ACVILt~:RKLFIKVNPCKLKIND

CDLIIJ1GEKI'IYEIDfI'FLLOHTIGWCAALSQERLLSLDLPPNVKDNRCRFKTPtTVSIIDNEELTKTVSGy?TLL
Vf.LLziSCIPIPSPCCCKATCKOCKVRVVKNiIDBPLfTDR9TFSKR

MFYISYt'IGVSAADRTKWOLL10PKSKPEDFISPGNFFPLASSPCGVL1(RAGNTESTVDLOLttGWRt.sL'CCKVQ
tIDNSLEIEERYi11A3S1IlLTVLSNONVATPIKELWAVD~KPIP

Mff.ACLpPCITILAELVNEDYS!!'8tLP0ILEFARKNNIAVIPVTSIIANRNLSDRLVSKISFKPODYLOL'fVPa' YKTNS3DWKp7IMPE'fYSDWEHFHLFOpVtDNSbLPADSANKAYSLA

3APLPTIYGDPfLHVYESLL.E~IQHLALVKCNVADK.3N'JLVRVH.iEC~'fGDLLCSKACDC~YPAELPTtKFNIR
L1TPPFtNCICPN.~.EIPWf:IK3SWF.~.LKPGDKITVSGPY~INKD

.EOL,i3AMSYtAF.KCTCVLVYLACOGGRGIGt.GHKVPJ1'1AL00NGYLTUDIINlJINCFPVDDDRPLiFLIi.w k'r.::Ft,R:;HILDLLUIYHSKREIDLWYr:ARSLKtNIYOEYEfI4EAQPP

SREIGICAOILVDLKLTTLKLITHNPOKYFr:LOGF,LITERVPLPVRISF~NEpYLRTKNFFIYNL'JL::EFLCEOL
W~JpItODM'YTNFLh'MFNLC~I-:RLONPEDYLYY'/r~'PPWN

4fFN,HWLDLF't:CNNRVO .'.::ILKLLCD\'.;l:Ff ::a ILI?DFt;,:

:F'n ~)a7S n!IIINN v'11511 ~'.t91 pNN.I lit.1rsr.N Irlll'Nlrl') ra.ERibityllum.Wine ::ynth.\n.f 1'.T7ll hy(tttth.tit:.ll ftfl.tain f:aIhJTtr:ltB'IFFEYMARI.Kt:ItL:IAKNLpfAIlflC~ftFyAtIADALV::aQETFLKFl7G:iE~:fif:
ML::RIV9l:Fl.h'LL:::a.CLPAEEEALr~::Ktffi-/r)I'AVMIr\IAfI.I~YYFtI~WNtE~NRR

IJaXCIR'Jh:AFEftt'.TIKKW.:::a9IKFDAtVAf'.(T/LIrJCET(kIYW~tIVNtJVn1117It'..\L::K
AMKKKKNDtr\Kua?KV'1'AtA:fItTfVi7f.IPFJft'VtIlltA::r:Y.'/IVIJCIaII::EIt.Y('fI~NK
::

LEt'r:L1ITL::IV V1f':'.e\EtAWt,)R:xaK(:RHV:VS~fAIEMA'fLITy r'.fn OHNS InIOn~~.U I~NI'r~ 1:

~:Ir._tIN%A '11116A n91'74n yycArHNA M.tlrVlrl.lrca..r::..

'1"7':': hypntlN,r i.:.tl Prnr.irt A:al:l'M:7IT~Wt.'1'llht:Vma:::l'1J:al'I::Ix:LYYYE1:L1.INJl.h'AILVI~.:hhllAlLltt .'::I

f.f::WJf.KILTKpRINtEF\::MWIlLKIKVLVFPLALW~1'Y_tl::Ir:yAC:I~\:::WTN:b'fKVK::LRr:
RtIKMEF'::FFVfYFIa:K::lt:1't::::l'YIKK)afVTh'LLIIIIJ~rfM)tLKI:fNhiIMnfll fr:::F.'/W(HpKLRUYPh:I.L.W1.T&~O1:A1'LL'f.~.TDIOItIIY:EKLFNKKVPALDIAtfc:MIHLFEI
IIA'fFPI'KNK.:::1:.'f1:1'L'It'Iv::a ,r.llYM/It.T!':f.Tl'f:%hVNI'~1f'ltltS.Iltt:IlJ.~.::::I.

a~

NtA:aYWlEKVAARCLiCYYETKLLYI-dPaWY EFLAFCU..L:!5C~FYiEGL
:!ItRIF:LL::::..:K:AGA:K~::iE.'atQKA.iSCfIPGL
~50CN.iAaF"LRPR~FFOPOLTQ

AAKItETAVEFINPEC~:.:.:LY~-CACt':GINL:PYVKNVIGVEIIPOAVASApENI%APLCCAEW1YLYC'IVLRONPRDt'HWINRCRF'Jt.;:A;.fI
C;.1'.:.Y~::.NLICfVVrLE'JL.CE

'VEYILCOAKAFCKRHENCKAPDVILtGFPRCCIWSK'Jt.KIILRtG.iPKIVYISCFROLNaiR'l'PrINPIYEt' t'1/0'~lTiQt'0:1~."!~tJ:NANAIIfWGNItFR''~'S.
NNKEDI F~GK'.
~
~~

. Pf7lYG
NPKTOFQECADLta~OCYRIKtO~IDPIDQFPYSTHLENICLLfREIOPYCt.~00CPNh<xVVSHEV~GSFWDaIJ4I
J'etiANi'ftRINI~i.UC' Y6~lRTi ' ' ' LViAHTIICHGSPK .~~.NKAtIQSPLCV~I
N
WDVYEIDCYOFTNINfTFSSIKRGOERf'( 0996 10:1299 l01 X709 ETKQII~MLPEEItFfVPPAVIOJFFAHK:vEDRK110EQWLDEVRVWSK.7FPEIJtEEEYAL:S
: Pn _ HKLPIQ4LE3t::CSVE."tPDSIAGRAA;iNKLIOVLSIOHIPYLI~L,iSSDt~.WIANt~' nccA-Htscone-Like Developmental ' F:~tean ItTLFWILKDTAKNMKDi.Iw.StiHDLIKAEKtTIItAAAORVR1'DSIKLdtVAKLYRKESiKAililYDFSCRNTK
YUVPEfCtMTtNNf:LAYS
OVFRPFGLTFLVFSDYNRNAIRLAALiKLP

... . ...,.,~ ... r.!:y_ ,;... . w_F~'!C..._. . . -:'Z~lfaA.':'-;
. ~.,...... ":.~F'.''T:4;s\':F'Y~:'<'t :.t?f'-~.

.. . :..:R.~ALw~. . . . . ~ , .:i'.F':
!"!'": ' "\:. ":.1K:.Ef :,ppWRVS':;FH.iIELFtiIuL':'ruXv.::%.:.:::L.:1RV.;:'cr4ianliwwiXY
iv:.iEtitrlitlMD

:Pn_0897 1011692 1014157 RFOYSCA:DOVSfCCFTi .1~a,l:.~R::.oO

CNLTR possible phosphoprotetn NKKLYHPTLFLRPLIRLSLIFALSLTLISQ4FPQQKSFGHCCAONNSALISCKNCCCtACPn_0191 10268:3 10.5988 DPIERVLADRtTLTJINOwG'ty'JVLVREYLLKCIRKGDCDYCVItILOKIS.ALRLPKIIiIRItDemn-ANP
Nuclwsldase LpILwIIRtNPfpAPLRDWOQLFTIGGNLSI4DHLLFCLYIIrTt~ISCYENRKOtxIQiJIKtPRI4D10tA104LRR
KHYKCERVSKHTSESRIt10tk4.1:RY~SSVICOFCPYLLLTNF8YYI01' OGDYKKAIEWCF1.VMLiKOSCSPHPEIVOIEKTFt.OKTI3.At4IKYAQtAOESCD11LI.FAKtiICVPVFf7CSN

TPYCLSEiAYTEANDAWLRIARGIVSRTNEVDSVLLSNAI4NLPFAREXJ1IPELEVLIDCLRSFfYOVCDY~~IRCOG
TSDAYFPPEVPAt.7INFVVpKATTCVLCDItKAFiYHIGIT

NCfIYLESTLLYYAYFSLLBLYtIpNImFIISLERLLEKGOiIVLrVPENPYIPEYOFFLOAYFYtlrlFtIRIWEPNK
1CP1UIKLYlIKAOSA~IEGATLFAAGYRIINLPIrALti,ISDLPLIfI~Z

AKGKYLSJ1GIMLOIIDPAVKLCATFARAYLYIGCZAYVGNNY~fAEEYFLMY%SNCREKTKSfONP'IfN~Y'fF9KI
LTGOCVIEFG.EKVFtLKMJvSDNIOIDppIfRGLPNNEVCt71IX1t ESGIGLF'IJvYAVQKKXTACEDI4LYNPKFSIIYRHLLDSi.CSLSYPt~SNIOGSSJ1I0RVNRNASGSE?SDSDY

AVPEtSEIYSRCIYMIKYRNVTYTtIPIIEt.AYNOV11NLEKRNLLEICRD11QDPCYDKAL

AFHGALQSCASVPRSLIESStNDE7UtiTIRCYEALYFlrI4PDAIJIFIZ.POAFSEEQ4SWOTACPeL0995 LRLViPI'LVRPKGAPNNAKYWDHLVLRPHGDSLYFFCYDLOEYLIGKEDIILKNLSVFAELFefp-Elongation Factor P

PKSSLLSLVYYLOGYSESSAiJttIVCiJFVKALEEF'.EISNSGENNK1WAYIYYNVItLDt.iIDEIDCFNVRVSTS
EFRVGLRIEiDGOPY'..IIIJDPVKPGKv~7A!'NRIKVID4t'LIGRVfERT

TYISGCNFSOAVNILEEVK~WpIIASNPKLIIFLI(CEDLYLWELRWVECLi(YAYFOLHETYKSGESVFL'ADIVERS
IIRLLY'."DOEGATFft~tIFE01;11VA4EKLFNIRONLLEDTIYTL
' AHLSt0ILLENVEKNLISPRSYRDYYCESLQRTLGLCpRFLCVVLYNfr~VItAVEPPIPFiELSIAETAPCVRCDfAS
GRVLKPAVZ'N:CAKIIIVPIFIDEGELV

KVD'1'R1GSYESIIVSK

~PILOCAB 1015141 1014119 henG-procoporphyrlnogen OFCidase CPn_0A96 1027574 1027822 AERRFCVKRAIIIGJIGISC:.SX~IwLNKKFPOAEILVLDKFJ1YA0GFVltTESP~OGiSIDLCT753 hypochet:iul Drocein GPKGFLTRGDGEYTLKLIHELGLOt4SLIFSDRAAR~1RF11YYROKANKIST1'11'LtJIKCti.PBKYFI'FlVIU
d'DJ' lltItIKELSKGOLLIQfLREKSRVLDEIOJKRItANVAIq,VAIIPESIREIE

SLIKDFRAPCYTpDSSVODPLKANBSONITSYZLDPLIT11IRJ~ON&SILSTIDIfIFPS.iIKKCEKVLTPQLFQAI
AEKILE~V

REASSCSLLRSYLIQJRSPKKSKTDRYL71SLSPSIGI'LITTIOEK<.PATWKFSTSVTNIDC

SPKFaCVL'1'PSETPFADNVIYTCPLQpLPVLi.PHIICIENLSKAVLPWfG.SSISILrIRltAF1CPet_0A97 FSLPIOGYfiG.PADELPLLGIVwNS0IFP0ATPCItTVLSLLIEGKI4RESEfvNAPAIAAISEIphospholtydro lasel YLNINOICPDAP'AIFSSQDfI'IPOHAVCFLERKERILPNLPGtd.KIVCpNIACPCLiatCIASNPSLDSttI'VDO
ICdfSt4PRPM0EKPRlMIRIINISDVNFNVLPVNPVNCFNKRLI~LLKKV

FCLVNFQJITTICORFPKtIVR51a71DSVCITGDFSLTANDCEPLWtNIYLTLJ4KNSSVYL

LPOMIDVYTIUSLiIpQ1'FYTIIlpNDQLOpNKVSPNKI?DFMMLILLt7CSQJia41S11lLY

CPn_0899 1016941 1015462 VNLtIOISAIFfFLLBLSPEEN11IIANF(YPLLSSONPSNDLINNfNiptiVLXKrPKVRi.YL

hemN-Coproporphyritwpen III
OxidaseAAVYNCAD1'SPSYIUi9GSISLPTNSRPNVI~.YPEKYQVIrMILOa.LDIDAP

FIJtFNVNFNfLECii(pPAPRY':SYPTALaWEPSDAAPALt.At'ORIRFNPOPLSGY!'IfIPFLEIANEATNOCp KL

C0574CLYCGCSWLNRREDIYEaYINTLI0Et0a.Wt:TIGFRPOVSRINIOGGTPSRLSR

ELFTLLFDtiINKLFDLSHAEEIJIIbIIDPRSLRt~lIEKAD!'P~7VCFNRVSIGYOD'IOADVCPt1~0898 OEAVRARpSNEESLKAYEKFKELAFOSINIDLiYGLPKOTKLSPSItTI0DIL71N1fPORLANitochondrial NSP60 Uaperonrn Nomolog L!'SFASVPNIKPFpKAMtAStINPSIIEOtFAIYSOSRtG.L'!'KA4~0AIQ~IIFSLPNDPGTTKKRi.OSVKIIi ttiGVC~ISEOCKLStMiADKKLFSGIDXt.FOIVfOCSYOPKQiLSPTiFF

IJtFIORITLIRNfOCYSLPPEEDLIGIrCaTtI'S1'SFIRCIYt.OI~NtKTLEEYI0T1'VLRGTfATVK~' 'YAISOTELSFtSIfC4i.CVDFNU11NNKINKENBDCATTGLILtiDIILOiSIfAIILEK

KSKILTE<~ItIRtIWIIINKLIC'ZF'"It4l~EFFNLFL'IfEFD2'YFIFSRDRLI~E'IlGLIt04CISTHKI.I
ASLKLOGEKI-0SALpppSSiPIKDAi.KVRNIIFSSIJ~P'1'I11di1YWIfSWC

SPCSLKYtPiGFS.FVRVIATAFDHYFLNKVSK!(tCFSASIPEGLISITKERG~IL1'SI~VFOCFKIPJIGYASTYF
VSDTASRLTRIANPLILITDRKI~E

INSLLPti.OEIS~NDNLIIFCtDIDPtfVLJITLWNKIQGLLpV'IWl'IPpGiITNOt<.i1 CPtIr0890 10I7A29 1016519 EDIJ1LF'LO'1'ILICPCpI7ISItVt.iIPENV1't-0SCLSIEISESQ1'Ti.IOCLJtiILYLTLILTWf.

hsmE-uroposphyrinogen DecarboxylaseABEIRTCSCLC1'RIIIILIKSTNRt.QSSVAILPTDEDtVEPLYTLiII~tINf~ALI:RCfIVP

STIJA4WDSFtSJIFFDLLKSOTASHPPIWLLRQVGRYFtPPYOES.IOCSQSLKTFFNtifCAIVCVAL/YASLTIGT
PKDDADENSIAISLLOKACCAPLKLi~1TH11DL.DCD11VIAKLSSLC1T8 ATGt.CPSii.NVDA71ILF71DILSILDCFAV1'1f~'APGPRIOPSPEQPFTFTSDPOTIFSYLIlSISVFSREIED
LIAGOILDS<J1TTSTIIJIQALD1'AILVLSSKILIl.DpYCI!!L

LD11IRTLJt9Kt.PVPLIVFAASPPfi.ACIfLIDGG7tSIIDPSKTILSFLYVYPEKFDQLISTI
I

EGTAIYLK1'pFmJIOAAAVOLFESSSLALPSALFTRYV't'EPHRRLIA%tJtLOi(IPVSLtCRCPt1~0A99 CFELt4FYTL0AT0A0n'LNPDYNVDLttAIOKNiiCSLpr.FfLDPAIFLLPOEKLLNYVEAFLntttF-Nutasoyl-DAP Lipae VPLRTYPNPIFNSIitGILPETPLFliVpLWSYVQROLNNACxAONYfQtAFIt.L'EDNVSI~i.SOVSCPKCDKtCI
TGFAIDS00V0P~LFFALPON7lTD

GNOFLKf4AATAG11VAAWSIIDYpf~SFCLELIRVDCIK5AL0EACSNOCNf.IpO'ILVCIT

CPn 0891 11121079 1017819 CSVOK1T17CEFSKTTLSSIYKTNASPKSYNSOLTVPISLtXAOCDEDtIMIL~GVfiP04 mEd-Transcription-Repair Coupling Hpt#TRIVOPEIAVITNINOQNAWiFPp0I0EILKEKSYILOKSKLOLLPKDSPYYLDf.R

NFNIItIDFNPVNLDFSISKEFKEhTLPLLLFIdIHPCATJ1!'L.71A104E'NDCItASVIFIITIPARSCSPTAEK
fSFSFNDPL7IDPnfKAISCOSWZOTPE~4YCt.PIAFSYIIpAYTtdi.IAWIL

LDDLFEId,ATFLt7p11PVEFPSSEIDLSPKLVNIDAt7GIIRDNLLYStiJpHRAPITCYtTLKSY~IILBVPEECV
IRSLPELKLPPNRFENSMRNGNpIIINDAYNACPF~WIAALOALPLP800 ALLEKTRSPOATSOOHLDLAVCDYLpPEATTEt.CICSLGY50VlQ.TSEKCEFSCAGCIVDIGKIILILCHNI~LCRY
SESGW1LYAEKAA$RCDIIIPPICEKWIPV05VLKSYSCEVSFFS

FPLSSPEPFRIEPS.IGEKIISIRSYNPSDOLSTGKVSKISISPAYTET~ISGGItYSNSLLDYSAODVKDILKOVARI
fCDIfILLIfGSRALU.t;.SLf.ACF

FSTPPLYt.FDNLEILCDDFADISCfLSSLPDRFFSIGTLYDRISTSNQVYFSETPIPNVK

NLKINRVIIE1F'NRFMEASROAiPILYPE0Ii0NDEHPLLAFLONLOEYNPPI~KPtJa.ACPn_0900 1032208 =YSTKTKSLKEAAAL11ETVARGDVEIYEKTGNLTSSFALVNEAFIN1ISLSEFASTKVLRRmraY-NUrawoyl-Pencapapttde TransEerase OKORTHFSV7TEEVFVPIPGCnMiINNGtGKFG;IEKRPMiWIETOYLVLEYADKARLLVFFfILaASNIPLIPNPL10 SLFPSIJ1LT1?fl'1'LVLTVAhCVWNM4LItplC4YRDrINKt YVPStpAYLISRYVGTSDKAADi3iNINSSKWKRSRDLTEKSLriYAEKLi.QLFJIpRSITPYCEKi.BMLNKDKAEY
P1COGVLLFISLIASLLVWLPWCKFSrWFFIILLTCYAGL41VYD0 AFVYPPNCESVIKFAETFPYE!'IPI70LKTIDOIYNDlILSPKtIIDRLICGDACFCKTEVINRIKIKRKQGNGLIWt NKPNVpIAIMFTLIALPYIYGSTEPWCLKIPFN~FIfiLPE'WL

RMVKIIVCOCHRpVIVMVPTf::.aTONYE'l'FKERM71GLPIEIaVLSRFSOJ1KVOKLICEOCKVFCLCL1LVAII
CrSNAVNLTDC:L0GLiIAGINSFAAIGPIFVALRSS1'IPIAQDVAYV

'JASGQIDIIICTNKLINKSLEFKNPGLLIIDEECRFGVKVKDNt.ICERYPMIDCLTVSATPLAALVCACIGFL3IYN
GFPAQLFtIGDICSLLIGGLLOSCAVNLAAECILWICf.IffVAGG

TPRTtJtNSLSGAADLSVIAMPPLDRLPVSTFVNEHN~EfLTAALRIiFI.LR0GOJ1YVTHNRSVItAVISCRWIKKR
LFLCSPLNHHYEYqCLPETKIVMRfWIFSFVCACLCIMWtR

tESIYTLAETIRNLIPEAftIGVAHGpNGAEDLSNIF'IKPKNQKTDILVATALICt4GIDIP

NANTILtOHADKF1~WDLYONKCRVGRWNKKAYCYFLVPHLDRLSCPAAKRLMIl4K0EYCPn_0901 1033279 :::(~ICIAt.HDLEiRGACNItw.DpSCNICTIGFNLYCKLtJUtAVSALNKHTSPLLTNDDVmurD-NUramoylalanine-Glutamau Liqase KfEFPYNSRIPDTYIETC.~.MRIEFYQKICNAESSEELTAIOEENRDRFGPLPOEICWLFAFCFIRRSRYSuCiJtEI
dICpRILILCTCTTCKSVARFLYQOCHYLICAONSLBSLISVDML

LAEIRLFALQHGISSIKCTANALY~/OKCLSKSEOTKKTLPYALSPTPELLVIIEYIESIERHDRLIJlGA3EFPO4ID
LVTR3F~CIKP'fNPWVEpAVSLKIPWTDIOVALKTP6lnAYpSF

~FLtNAS
CtT~JCKTlTILFLTHLLKILCIPAIJ1M(.TIICLPtLDHltt7pP.VRWEISSP~JITOEE

N t PAISCSVFtJ4FSRNHLDY11RNLDJ(YFDAIILItIOKCLROD1ITFWVWEECSL.CIISYOIYS

CPn_0992 1027673 1 (13101 d EEI EEI LDKCDAWtP IYLNINtONYCM
YAIJINEVIIJVSPECFLItJ(tATFEKPANALIYi.G

alas-Alsnyl (:RNA Synthetase KKOGVHYINDSKATIYTAVEKALJIAVPKDVTVtLCGKDKCCDFPAt.AS'lft.SOTIIttIVIAN

EFFFNLSNIIRSNFLKFYANRHNTILPSSPVFPHNDPSILFTNAGNNOFKOIFI.NKEIIVSCECI~TIADALSEKI~P
LTLSKDLOEAVSIAQTIAQECCCVLL3FY-,CA'iFOQ!'QSFK611GA

YSMTTSOKCIRACGKNNDLDMtCHTSRHLTFFE?tt.CNFSFGDYF%AFJ11AFAWEYSLSV'IFKLLIRF11C~AVR

FNfNPEGTYATVHEKDDFJIFJ1LNEAYLPTDRIFRLTDNONFWSNANTCPCCIf~SELLFDR

~PSFrJ4ASSPLODTDCERFLEYWNLVFNEFNRT3ECSLLALPNKHVDNDN3LERLVSLIACPn_tt'W _ lA3Sfl7 10)5311 r:'fIIT/FEADYLRELlAKTEOtSCICV'!NPODS(:adFRViAt%IVRSIsFAIADCLLPGNfERnlpD-Nur,lmm4te:r. linv.vsln rep'tat t.tmlly ':YVLRKII.ItR.',VHYCRRLCFRNPFLAEIVP;iIrICnVICEAYPELIWt'aL.~.OfOK'/LTLEEESAVpOkt LV.:::EVtaaIRRONVITA'J'/VNAILLYALFVT~Kkl!:VY.DYD.F~n,FPHFASSKVTOA

F'FKTLDRt%:NLWt)VLII:i':':~.
$Ct::jEDAFKLKIfI'YQ1PIDEI:iLLAKDYD'ISVDNDi'F'N:',EEKVIEKCWAEVP::RPtAKETL~FtE.:K
PViVTTPP'JP1/V'.'.ETDE1/iTIIAVPPp IIKLEDF.AKER:;1!KNW(R;Qf~I'.~.G:i'INELIILT:'EFL(:YDH4~.t'.DTFIFJ1IILY.DNIVSF.LF
VRE'IYKEFX~APYA'('VWKK~:OFLERIAPANIpITVAKLt4r)IHGUI'I'IVLKIfDpYIKVt'TS

:~EKUF7:AIYLKV::PFYAEKtI~VGG.~(;gtt'f:.~.F~.'fFIVTIITP::PKAc:LLV111N:RISOCSLI~D
V:aIF:K'Cf'>.1'(''t'.WIPlIY1'tW~Er:D.~.PN'1'~ALRNtIIRLDDLLKNNDLLFYKARRLKPfID
' IYFM'/TAtWNRYRRKRU1NMITII:HLtJtAALEITLf:DIIIR(L4:;:W0tITKIRLDFT11P0VLRtP
~03'i~'r AI::PELL4: tLTLVNR::IItENEf'VfiIREILYwL'ifW:..~.EIKQFFY:DII7.~.DV'/P'~.Y;H~S

III:II!Y?I'IL\F_1'IYA)O:FF'RITKFJIi:%AFI:IkRIErWTr:EKAIGV'fVH(,~J.~.EVLEEtA'rLL
.Ot.'Fvsnwm IW '.::1'. Inlv..l1'/

Vfklli(V::Rt:hA'PI.DERKQt)UKRI~fELFSt::LyfKLDKLIIINt'llhl!y:IT!:L'/NHIrIEHEtt::
W n..l l I'mt::m.n Inrn..ln t'n.;W

NIIIIIIUV'IA~x'IJIIJI!11't:KLI::LSPI'TEKtt:KYII'L:-.RV!:f1111.19'(r:VIIAODLLYAVLTPm.':Kr~:Il:NKNFVI:a'Lfl:ll'::Ia:I.INVFVY:::AIVI~
P.~.LFT:TIIKALIINJVrYLIIl~:I/

:kYI7r:KtS':
~l~x:;:ADAL1'A'ftVIHETLbi(tYltA::I.LYNNF1VRUFLKI::INI.La:iJI.ALIr:'/F'ff~:IL:Ll :fttJ:AkkYII~:F7Yif:PIVI".IKINK
:TyLi '(LV I' I VALY 1'I: rI'::.~.L'lJlit LKMYLY.I:1'A I l.l' I I' I LL IA
L EP Gtr::'.,yl'/ l::A::l. t INF
IN

"1'n_'IH'm In.:IN-2 tn~'iNRr;
't'::VkI.ItYWl.l.l'LLr:VLINi:AIh'/fMlY'/nYIILNVYL11PELDIKr:IU9Y~f'ICIAKfAH::X:

I Y.t n 'fLmr:knfl nrl.t::. KL1I:I!r:11N1::LUKLTYLW:1~411.YfAA
i'fAt:F:F'r:Ff!?ILVLLLLYM.'h'PNY:YAtAIKA::

::LECM4AMttTLII'K011FKlltIW:~LLF.~.Y':. .FF:OfX:3CLIANFICCIffLLLKV
VbIFB~ful.lp autr~tt.lmv. c~. nolaa 'f DESK::.°~L X7tRFRRPNCP".niUSKGiFFS FHLKKfaTL:ld I I f: f PKIVGFf40:iF"-F.~.(.LKVAAKA I OCKK:.?JNL::~w"1JR'.":3EP'.":YFY
YV~~VNVHVKAl9ltlfiø9E1'.1<AIKOKV,'~~'I,I~NEi. L'I~Y~t/:DY~I aV.~l ':Pn_0904 1(175720 177396 RLEEtJ4ItOCPTYPDKLIa:: ~ . ....
muc.-PeotiJtwlYCan Transterase RYINICKIRKVALAVOtiu~CGHIVPALSVKEAFSRECIDVLLiGK.f'aL100lPSLOOGISYREI vPn 091e I046R1) :049094 P!xLPNWPIKIF1.'.RTL;LCCCYLKARKELKIFDPCLVICF~~..SYIISLPVLwIGL'aNKIP fabF-ACV:
Carrvsr Pr~cefn ~Yntnase :.FIJIEONL'JtY:(h?ra.F.~RYARf:L~F.":PVTYI!FRCPAEEVFf.PKR.iF.iIA,SPMIKRCT
LLHrIfRV'Ma'KKRV'l.'~:P'':1':SC:uNl.1'CTFYDNLLACVSGVRPTTSFPCEDYATRTJIw ...,.:... . .."... Il.;.:.lt~'.~ . ,~..;~La,.~... ...,.... ..,"..i.h..:;- .
".,'Tl.:
..t..,..,1.: ~nY:.~.':= . .Ji::::I'K..L...F.: . ... .-..y-v ~w::: .
" . ,Al.. .. ... . ~.~ u:;~;
. . ,... ....1. ,. :.,,~lYr': :~ . . ~Hr. . .'~ 'A."'..'C
../~~,~~CCw~i:'1:.:.nvlLLi:.:i ~::.'f.':?':: . nt't:.~.~, . ...
DVLEGG'MILEKELTEKLLVEICV1'FALDSHNREKORNSLAAYSQQRSTK':FHAFICECL
0~\AYGHLVS4Rn\OK i~w3Ti:AAVrNA:.:Leu:FtlFilidi.9ERNGAFuWIiRWIDPDROGfV
4;1~AGILVLEi'LE.iA_IRRDAPIFAF14LCSYVTCDAFNITAPRDOCF~ITACVU'ailllISA
CPn_0905 1037400 1079875 ..IPKERVNYVliAHCTo.
Pt,CrLSEIrt.AVKKAF'GSHVRNIJIITISrKSLIGtICLCAAOGYEA
murCiddlA-KUraauce-Ala Lipase i G-Ala-0-Alam Lipase WAIpAILTCKLHPTINLDNPIAEIEDFOWANKAQDWDIDVAKSNSPGF~O~~STTLFS
.____.»._.~-~-~~-~..........-nawvw~a.aMVIeVTTPCf.Ylllf'JRC/~.IiD RYVP
cPl~o917 loleosa lo1as39 hydsolase/phosphacase namolog lNDI I4EVCTLVFll0C1'ItYEYSFGVTPIKFFGTPDIO~tfUUCFICFiTRCKMiCFPIGRtaEa KEDPOEAACRELVEETCLSWNFFPKVLTEpYSFNNEEOVlVRKEVTYFLAS1IRGDIW1D
pKETt:pSOWLSLOECLRLLSFPELIIDLTV~ADKFINNYLESS
YLRNYIRIHDVCVSI~GAC~fix'cf:GiJlLlcfarnrfusa.saw.nvw.ww.ai.w~...~..,r...-., CVOCLFPVLtICP 0918 1019272 1019579 A CPn 7 _ YISPtFYDVSYFIINRpGLWRTGKDFPHLTEETOCDSPLSSEiASALDW-Inotpanlc PYroDt>osphatase FCEDGTICCFFEILCIIPYAGPSLSt.IATAt'mttLLTKRiASAVGVPVVPYQPLNLCFr(K~'t' PELCIOI1LI>:I'FSFPKIVKTAHIGSSIGIFLVRDKFEG'7EKISEAFLYDTOVFVEESRLG~tIIESLCCYIEITP
YDSVKFEGDNATCLLKV<EtPQItFS
ELLNSKKPLYYAHPWHSPTLT
RP

SREIEVSCICNSSSWYCMAGPNERCC11SCFIDYOEKYGFDGIDCAKT5FOI4LSQESLDCNfCPCLYCLLPQTY~AS~
YS~'tfICGDKDPLDVCVLTEIDiINHDNTLLOA
' VRELAEAVYPA>'K7G)cGSARIOFFLDE~IYWt.SEIMPIPCFfI'AASP!'i.QAfYHAL~'1'QEQVLDKIOHYFL
TYIUITPIWt.IKC
IGGLRTIDSCEADDKIIAVLEDDLVFAETEDISDCPCI
' IVWIFITDALHKFDICQQTIEQAFTKECDLVKR tALVN
SPAKIEIVGIYCKKEAOKVIOLANCDYLSYICD

0906 1010514 1079915 CPef.-0919 1019375 1050170 C?n _ ltlh-Ltuca.ne Dehydroganase CT767 hypothetical protein FKRYSIlIFICEIKIDOYERVILVfCSIfVRLfIAIIAIHOTAVCPALGGYRASLYSSICQACT
' NE
DAGRL71P011'ItKAIISN'fC~'1'ODCtISVITLP~APBLTED10;.RAFu~AVNAi.DGTYICAD
KWGSEVLELV1~SQLSREASAFRLDIDFFIINIYPFFRNF104IELCFFLSISOFNLDf EFVAYIVIQJLVTNPFJIVEIRSIEtBSIKLEIRVAAEDTGKIIGRRGNI'IHAIRTILRfGYaINDISIVAEE1'PYV
CGIADVSCDPSIYTANDCFLCIKTAKYIiICSSSL1~IAI

RVCSRLIQDfVpIDLVQPFixTWIADQOYICDND55NSTFqIifGESZITCCSCHCH1IDEDLCCICSVCRALLQSLFF
EGAEZ.YVADVLERIIVpDAARLYGATIVPTEETNALECDTfSPCA

NQEEpERWNSCffCSNHH
RCNVIRKDIiLADtI4CKATVGVAN4pLEDS511GK12ItERCILYGPDYLVNAOGLIMtAAAI

tr:RVYAPKEVi.LJIVEELPTYLS1CLYNOSK'~11C1IDLVALSDSFVEDItfi.ilYTS
CPn _ ~cutA Periplaflmic Divalent Cation 1051423 1050471 Tolerance Protein CutA IC-Type Cyeoehrome Biogenesis ProteinlCPn_0920 GTSTYLWEGKL cys0->sullice Synthesis/biphosphece t#osphacase FAFSKFLIIKSSIffAVLILTSFPSESARSLARHI.ZTERtJISCVHVFPKILCBiElOISELPNY~1TVCSWTEITTQ
LS3:YRSDIRLYPFirEKSDGSFITJIADIIO&OYIf CESEEHNIOIKSIDIRFSETC.J1IOEFSGYEyPEYLLFPIENCDPRYLNWLTILSYPEItP' RLLTSSVSRD~.ISTLVPPIlPTS

LFVLVDpII7GTACFIRNRA~AVATSLIYLYRPILSVMACPAYNOTFKLYSAA10GI10LSIY

0908 1011607 I0407a0 HSQNLDRRFYYA'fKQFCE11SLAALi~i00NNA?RKLSLGLPNTPSPRRVISQYKY11LY
CP

n_ AEGAVDFFIRYPFTDSPARAWDNVPGAFLVEtAGGRVTDALCAPLEYRKESLVL~INVI
:.T761 hypothetical protein ILaILFHIIIKNNEI1~1TRRFFKTLTPPCPQYSL:CY11SILIVISSLYCVPTFCWLFLPELSLAS~O!'IHE'tTf.
AAL~IGLtIWPTDKLIJ1L

LSKFNPSPIPNLfLVSSTLSKVPP'CAIJIFIiLRL511DAPTYLNEFSIKD1FSSI3IAI.GIFS

Q , SLVIEKSPOIAODITTFYTI.QTPIAYVCtOtaNTI~TILfI;SCFL~CpPYFPSIiIi.PQI.
PfKTLLKELAKESPKIIDLSLSDAYPCEIIVTTSSGSLLRLPIKTLDsnOlycerol-3-P Acyleransfvrase D
KLPI(ElOlt LIDIO
CEI1G.IKLWAATYfI;ZM'1'FLVCRLLKLRYRNpVEfND'MNINPKpI,~LFLIIIMVAIZVaI
.
RALDLYK1090CSPVIE$EKOYVYDLRFPNFLLLKAL

IL6YLfWSRFHVRPIBIYEYLFItSRWQNFLNSVRSIPTPQLVPGKiBILRSLGDIBIC>(iE

CPeL0909 1011592 1041966 ASRAWRGESLLLYPSGRLSRTGKFETVNOYSAYYLIJtRVf~CfIWLVRVSCti~f7lffR

rsbV-Sigma Faeeor Rapulaeor YKpN51'PKLGPAFIIFJ~ALLR1~TFFHPKRFVIt IISLI1TRTLLRLtl'OlLal~7lGDiIVIYIAC5LD11VSVPSVpLYLEOFIpKKNLKI11LWFNOCDONLPIEVPYA

N!1'DVSYISSAGIRLLLSNFKLVCSROG101CLCCVICESPTEVKAI71GLOQLILLCOSEQE
CPn_0922 1052266 1053927 au-ACylglyeerophoaphoechanolamine Acyltransterase QFJWRSSLRITRKLAR10100RNRCHNUaO1LRLRPCSTLLEAFLIL:SGIEOCI11GFDDIL
CPn _ GSLSYRELRNAZTAVAIKVSKFSFIHtVG1IK14P11SIGAFIAYFGILL7ILiKTPPIINWaOGL
miaA-tRNA Pyrophosphate Transteraae FLYf4.PFEFEFNTTSSPECDib'C..CPpKLFVKLFKRTIVLLSCP'PGSCKTDVSLAL11PNIDRELRACTKTVEVR
RVLTSQQFTKHLTEVOG
IGLYS

CEIVSVDSKQVYOGKDICTAINSLRARpEIPHNLIDTRNVOEPFHWDFYYEAIOAC~JIKCSVPWLLRIFCVSGVESDD
TAVILFTSGTEKLPKAVPLTNKMI~IFiJOFJICL1IFF0PNT0 LSRNKVPILVGGSGFYFHAFLSGPPKGPAADPGIREQLFaIAEENGVSALYEDLLLImPEDYIQ.AFLPPPHAYGFNSC
GLFPLIfIGVNVI1FASNPLNPKKLVEFIl7DIUfVfFfGSTIVIF

AQTI?KHDKHKTTACLEIIOLTCKXVSDtII~SIDIVPKASREYCGRAWILSPETEFLKNNIDYILICI'AKKQNSCLE
SLALWIGGDALKDTLYECTXKi.OPOIALYOGYGATLCSPVISIT

t7FIRCEAMLOEGLLEEVRGLLNOGIRENPSAFKAICYAEWIEFLDNGFJILEEYE6TKRKFVTICESPR%SEGVCNPI
a?mVLIISKETHIPVSSGECGLIVVAfI'fSVFSGYi~diHENOSFIt ~NSWNYTKXOKTNF1IRYSIFRELPTLGI3SDAIAOKTAKDYLLYSSLGGDQWYL1GOLGHIGPSCDLFLEGRLSRFVK
ICCEMVSLEALESILNFJIFTENONmA

CSLWCCIPGDKVRLCLFT'tLITTIHEVPTDILKSAETSSIYKISYVIIDVlSIPIIGICIIP

CPn_0911 1011079 1042985 DYVSLNALAVSLFG

Fe-S cluster OxldOreducCafe EVTYVLDAN C~ 0927 1057966 1055093 SLLLJ1IFNVNYFNNLCKAISFEEGLfLFVSSPIRLOEAADATRKERYPSNbioF_1-Oxononanoace Synchase_1 PNYTNICKIDCTFCAPYRKPKSPDAYLLSFDEVIlSLLORYVSSGVK1YLL.OGGVHPCIGI' DYLEELVRITVOEFPSINPNFFSaVEIEHACRVSCISIEpGLORLWDACQRTIf'r~GAEIVCKESFL7TSDVIDt~.' IT1DFLCFARSPI'IYCEVSKRFOIHCQOFPHEKLGIRGSRL1NGP

LSERVRKIISPKKFICPr~IfINLHKLAHI~1GFRTTATHtIPGIMCIPEDILIHLO'ILRDAQDSSVTDDLESKIASY
NGAPNAFIVNSGYNAMiCLCNNVSRSTDVL4WDCEVHKSWIIaLSA

SCPGFYSFIPWSYKPGNTALRRNVPQQASIETYYRILAIGRIFLDNFDHVMSWP'GECKSISGOHHTFHHNNLEHLESL
LOCYRISSKGRIFIFVSSVYSPRG'fWPLCOIIAISRICYNA

LGAKALHYGADDFC~uVILDESVHKAT4WSICSSEEEIt3'tIIRSEGFIPVERNI'FYOHISCHLIVDEAHAKCIFCC
OGKGLCFIALCYENFYAVLVFYGKALCfKGASLLTSSCVKYDLfIpFI

TVSSL
SPPLRYSTSLSPNTLTSICTAYDfLASDCEIARKOVFKLKEHFHDCFDSHAPOC11QPIFL

PHTCLEEAISVLETTCIHVCVYAFAKHPFLAVNLNAYNfVDEVNLLAOVKKPYLBKSSHR

CPn_0~12 1044120 10157u0 'MINHEFHLWRELCC'H

CT768 hypothetical protein tNINDNSONSFHTLETEOGSFLNDEWVEEVASTESTEISDATLCFAEKKVAFILNIWRE~Pn_0924 1057301 ALTCSSOGiDLRLFidDLRKQCLPLFNEIEDTAKRAOHWRCYIELTKECRHLKCWDEECSpriAPrimosanal Ptocsan H' FVVGQIDLAITCLEKOTLK!'QECTEDKIFKDREDNFLESpALDKHOAFYKONHTSLLWLSY.RFTAKTKSNGYIESa' TPRLYAEVIVCSNINNVLDYCVPENLEHITKC'fAVThLRODKK
' SFSSKIIDLRKELIHVCNRNRLKSKFFORLSNIwiFIQVFPKRKELIEKVSGTFAEDVOAFVfLKLILPAIS
'AiVIYQIKITtOCKKILPi4;Iw~DSEIVLPQDLLDLLFWISOYYFAPCGK

AKYFiCSDKETLKKTVFFLRKEIKNLAHAAKRLF'lS3HVFAETRLKLSKCWDOLKGKEKE3FNIOPKOHYRWLKVSKA
KTKEILAKLEVLHPSGGAVLKILLOHASPPGLSSIJ~3'A1IV

tROEOGRLRWSM4SKEVROKirIEVSSLLIECNDL31NRKDLECISKKINALDLTHDDVOSPIH3L&KIGILDIVWIAt 7LELGEDLLTFFPPAPKDLHPEOpSJIIDKIFSSLKTIOFN

I:LKKF140pLF0pLREK00AAEHSY0E01.AK0KC'VI(Y.EAARSLAERI1TFSICTCS<f~1ITTHLLF'uITv~S
GKTEIYLIiATSEALKOCK:n"CTLLVPEIALT/Q'fVSLFKARFGKDVGVWN

:iFaAEEWQTLKELL:KHSFLPPPEKISLDNpLNLrILC/CIVNFFEEGLLSSPDSRCKLVFMKLiOSG!SRTWROA:E
GSLRILIGPRSALf'CPKKNLCLIIVDCEHDPAYKGTE3PPCIfIIA

RCVLKORRERRQELKDKLEODKKLLCSSCLDFDR~Y3ALVEEDKRALEELDASILELKFDVAVKPCKIJvNA'I1S'L( :!:ATP.~.LE.a~Y1NALSI:KYV(SRLS3RAAAANPAIITSLININLC

W f00LL hEK~KTY.ILF~OPVLKK IAERLE'JrEOVL
IFFNRRCYHTtNSC?11CKHTUtCPNCOMILT

FHKY Vf/LLCIIIf.N.';:PKDLt'U.~.CPKCt.CTHT(l)YRC3GTEKIEKIIf~IFPpIRTI/LID

I:Pn Oll f 1U4570'1 tU4S74g "D'ITKFYf:;fIETLLRVFA'h:K\GVI.t~fQFILAKE.FNFSAVTU1YII1Jf:0.,~CLYIPDFIUIS

NII fl.tNlt;r tr111Nlhxa Vresenc EpVFOLt'h)VN:R::I:R::IILIt:EILIQ::FLPDIII?1:NSAI4PUGY'AF'L':QEIT4RELCEYP
in I:linrlvnk/ElIBL >t: at tl/71'IH

Hl.x'K'fYRIFJITD::dIIWRRNCf'fAFDLDt'.'fLLK~:If:::xt~FYC'R:LG.t:LF::IK'PLPE'1:IF
FIHLtt~IFMCY.ufKG'IWt:F\I(I1VIIHILKEULEL"tNPLt4t~/TPt'GHFKtKD'fPRYQFLI

-i ltl'FNFKFF!'f:I FIIPS I 1R Y :AW I FVNKKLIINAL.KIr\KU:1Y.VKF'FI
IINUIMR'FF

~'IW ~lnl1 111.IS'HI'1 lU4inf'Iw ':I'n lfaL'i 10'./w!'~ !'n'.:Wt NII ftHNl:iL INNMILNI hfla:l'flt TI'1'1 llyt111CIN!f I '.11 Inl..l 111 n:.IIHI\IhY/EHDL .!G Jl I l/7/'IHt.lll ' VFFWGLP::I'Y'/:al.Tftl.t::::Vf'CDDLYCVAINFI':'..'t.'If:::DFYAINLEKLEErIFADI'IT:
Ctf::fF.t:ll'ft \::I.tIJVTLIftrAI::A::Y.::a1'EKAY611PN
YIMLPMF11S(NIFIIB'rl.t:QLI.DH

VILE.':'.::IrIFIVtIIIAC~Ia:I:::.WIA:x.'YRU):vItIJTIYKKCLTi:DKKAVIL::Y(KKIFICIAt fyl'(TPPPITtIt.::~'t:K'rKl~.:LWKWVILIiCUL::9NAILKEKYPALYG:::W'Ai*IPC~I

AH::IITF::ImI
ILW.I'YL.HLa:EEKTWR('Ct:HLKKH:1YY'fYWNIVF'rvF:FIUIEEVI.FFNRIVKn:f 'JI.f~I'rY::rI.IIIAKTNtF'/tIIIPNFFI.AIAI'ltP/IRYKIP
' f::, ;.t:'ft".Kf.::IiI.KItItLWAILlIftLPFAYTI'Y::::
'ITDYIYJ::LTOtF'J:IFt.t'L

'11 11'1 l'~. 1t1.vi.lq l 11141,H
1"

CPn 097'1 lr1'. . 10711'7s .:~ p'I,p., LOr.ADDn 10'SNSS7 Cf790 MOoctlectcol Procem Tnroratnlrtn OioulIiJrf lenmtriee IIINRW1'IRL'.rf' .Ttla' "~'n.'l'PPSB'E~F~'~"
CLLLTLPCCAARRRASGCIL00tRPIAAANL NM;~LIPfK .~8,~~
i0HKP1IL01XAF~ D91HIi1~IKVR
FKD n ' . lSO
. VIOVIILKLAKTI~V~irG00NGIDahfrlRDiERI~GIY
IIHTOpTY;.TRF
VSIPERTEEIOCCIVSEISEYTCLHVAAVHVIIKCLTQPKatIDEEIEELV3YCOLPSPE
<.
OWP3YAEALENSKOtIHKPI~LFF'fC50WCMhICIRND00ItpSSEFKHFAGVNWIVEVDF
PgD~RIOPEEGROKNOELKAQYKYCOFPELVFIOAtZKOWt~FEP00CAAW5RVRSAL

KLA
OFLt.~SEG

' ~Pn neln W i0\n tD?t_aA

?. ;os~nllf tnsaa7n ' r..t:f:om:. alr:::
- qy .

_ ~ ..
,. ~..rr: .,.""....,_, .", ... .;.\..... .. 1f w.:?
h':\ur~:.=~~.' -:.at. r. :::INP."Ff ....~".
:Yt:..: :wsr:l:
:Nr:
::
' ' .
ERIPFLIU(K'IASIETIW.iNC:EALLLENNL:KCNNPKYNVLLKDDKTFFCIrIIsWISW
. ' .
:.: r:.
: ::: :::.::: ::.:-:a:: .a :
WVLKKTCOFFILPSSIISOSHSKTAVAIRl4I1'FLSNIIfOGLSLKEI

FLISLILFLPt.ALL CILY
IDRGDSSLNDt.AKJtTG
PKVdJIIRTKAITSSORCLItGPYVSAI~CfITLLEVISOWFPLRTCSDREtALRKR1 SGLh~

.
dRlIbCLAPCVCYLTPECYO~'LDKAILFLIOORIEEWKDLLKVIOKASDNLEIWIIAIfYY
SAAORVIITOYDDLWDSLAIKIPtIALPNRWILYSpGNRTILTLLTVItSCKLIGARNFBFFl~7lQ
YPGINSSKGSAKRENLVIISYQACVRYLRDECiGPKANOIIAPGYSLGTStIOMUt SNLLVTN O
ALDRMOCo~DLTSwIWKORGPRSLADVANDICItPIASAII1Q.VG~IHIDSYKPSiaLRCRTLSLIKOAI'U1KOQVE
XFHIONIDAI~LYRt ~ODLL88FILQYYVSQP1'IPKEILTPLPLEFPtLSYVWA6SPPRLRSP~GY~~tELLD

PETFIYNSNNDQELISOGLFERfNGVATPFLELPLYRTSCTRIPIPE11DLLHIlIPLSPHVLAYRNAKAYAATTLPSS
1'LPYODFONILRIISO'YPYRILGY~I7111lOGUlATGVYIVF(~IJ

VDRLMVISNYLDSENPKSOQPD
GFDP(fOYRTFSI08ERTl~I~.ALi.IBVLd.RRFNSLTTJILPOIIWDOCR'I11YNRTKRIIO

'l'tJ'ILTGIOVYI'IAKEKlIMSRCLIiIWCIfGITFPIO<CFSLPPTSMi.QFFOILRDWdtFA

CPtt_0928 1061075 1039881 ISKNRRRIIGRALFEOERIPCICEVRIIKRLt4RFKSMOpVIILSSQisEltaIPGLTIOtDIAV

CHLPS I7 kps P~eeln hoalolo~3 ~tRRD

RRKDFAFTLIl~SDILSGIFSNPNPV5YFS8TRAKOLSDFSKKHPILTICIYI'IIVICILGROt FKLLICLIIPPLGIYWLCOLVCSLALFPRSSMLYSVLIIfCFRKYRLWBIODYWIDfL.DP
SFIfDPAVSESKRITI00DHLTIInLAINFSTARPKItwLLISLGSCOFLEDMIGLRDBLFL~PeI..0911 SWKELAKLLGANILIYNYPGVIt551G%tliLtNLATAIBiLL:Alat.0~I0GPGANEIITYGrq-~ Nimcch Reoas!

YSt.00WQSAALOKNPFTNSETSWVAVKDRAPNSLPAAAtt~FFGPIGKLIAViJ1RW10~A

EKNSR&LPGPEILVYSADAFRPSEII~DTALLPEITLAWIIKRTPFMSKKFIGEVNfi.H

CPIf_0929 1062701 1061186 ~CMLPS 13 kDa Drocein homolop_1 EKllIiIPIHGSNAFVEDILNSNPSPOATYFSSTRAQKLNEFI~WPVLTRZ1~SVIIRIFRV
LIGLIILPLGIYWL.cOTLICTNSILPSIWLLKIFKItOPNTRTL1LTNYLHU4DY88tD'1HVA
SNARVPILQDNVLIt7fLEICLSOAPTNRWIG.ISfGSDCSLE~IAC%LIFDSfIORFAK:.IG
ANILVYNYPGVNS51GSSSLKDLASAIWICTRYL%DRmGPGAIOCIITYI~'SLGCL.ZDAE
ALRDOKIVAD87DT1tiIAVKDRCPLFISPEGF1CSCRRIGKLVMLPGWC1'RAVOeSODLPC
LEIFLYPIDSLRRSTVRpNKIi.APELTi.ANAIICrSPINONREFIEVRLSSDIDPIDSRTR
VAL71TPILKldS
CPn_0970 1062851 1067370 No robust honoloq Present in Oenebetlk/ElOILLCP1L0912 I07S955 107775 as of 11/7/98 NlOISELAPCSTGLOMVPNTOVHtiALDTRRVILTIAACLSLI11GIVLVGIaAAAILPSLFGdnaG/PSil1-ONA
Psiaase R'IAHYTEESLZ>cIWtSIDIIIDVLREHINLBRSGATYKACCPFIRCCi'PSFIVN

VIG(r?tILILFSSIALIYLYXK?REIIOpIALEPLPEMISKDOSIIDIVETRDYASLEKRAT.
YRCFGCGRNGDAIGFi~IOItLGYSFTEAILVLSIOCFwOLVI.OPK08GYTlP00NC
A
A

FAYTIfrNYYOCSMV!'IfREIPRFr~CSYLiIL.RKDlIaROALE'P
G
N
~IZTJStABTFFRYCLYNLPEARNAIAYLYMRGPSPD1'I~tFNLGIRiP~08GFi4ilwE

~PtL0931 1061078 1065718 DlRId060LM'AGFFG180IFLFARRIIFPVIIDAiOttTIGPSARIQC.ENDp00RYVM'PET
MIE

PIFIaSRILtof.NIBRRRIAi~IVILVOD0110CL0lIIDSGPNLZYAIIpQrAPTJA

lyeS-LYSy7 eRNA Synehecaee IDFRVLQ'1KSDIYTNILCEPM'APAEYLDNEDFLY~UGQGSE<GVVLYPYETPGVFS
IALLLOS(JDYL'1'li.It001$$YPKPGPRERALLVEGIROIwbaSPILVYEIG.I~Ir~SL

CEDI10t1'PASOEIGN8EAANSRS'1'PRVRFI~ILWRAIDKNiIFG0ILa0~fOTI0~1RIIRtPEDNVLBLAHPOP
'1'AEPpHIPIROKVPKINPNIVhEiDILRf7D.ICGfail'ICILY'1'J10 ElTSVNGLSEDItEITPIKFIEEKLDLCDIIGIDCYi.FFTNSGELTVLVLTVTIl.CESIi.6FYIVPEDIIINPI.I
AFMISYYEKIfRKNVPFDEACOVL8DS0ILpLLI'IOtRIXIfALD
aAHCiFLCVE1'PILO
I

f TIfIRfL0t0lADRRfdIEOCRPL$LHONIOD~EILEDYWLRRDRTIITLL0P68ELIP
LPiROfAGLSI7KEVRYRKRWLDLISSREVSDIFVIOtSYIIIC.IRNY
i'I~ALNSENFLRISLEIALIGfILVOGAPRIYELGRVFRNt~ItR~'!Q1 T

NIYOGiIFJIKPI
T CPeL0913 1077972 1078238 PCTMIGYMYIfsYIfEVHVFVfM.VFJILVR7IVl~ffSLVYSY18f11DPOBVDIKAIWIR
I

LI 1 hypoclucieal pmcein tfll8~r8IA1YAGIQVDVfIBDOKLEEILKKKT'lFpg'1'AIrATASR~.IAALfDELVSfRC1T94 N

APNNITDtIWEITPLCxTLR5GD,1AFVE<eFESTCLGKELCNAYSI'3i~PIROR1~LE00.
' PPIBfSPRFLLPFLSVILCaia.LSSPR5RJ1ISVTESIGtISAVKTLVC.BBIDIREfita'.1GY

fNMSIRDVLYFPIIl9iR GVGASSILItONQfOOWLCIESLLAQNlVM
TIOIL.LPDSECNPIDEEFLE/\LCOC?IPPAOGFGICVDRLVItIL
FDAGfIN

CPtf_0932 1067160 1065721 CPeL_0941 1078503 1078997 ban ~

V~KSS5D~11IMAF;NIyECLYFYNCAS rhQ>O~F9PNtFt'PVRLYTCGPTVYOIfAHIGNFATYVFED~
FEAAYIipA
VRAELOP$ ~

VFfGYSVTfIVIQIITDVEDRTIAG11SKRNIPLOEYTOPY'fFAFFEDt.CI'LHIARAI~PGILVFTSGIITPBFII
DLTNGSPSLSTPIAKCFfNWLCPOLISPLDI11110DPV
ILKR:T

.
ILYIGS!'LQ~PEVF~tVSGPRLCYILIDL00CAQC0AVLPLLTIGt DFYPNATtIYIPQNIOAITKLLEOGIAYICODASVYFSLNRFPHYGKLSICt.~.SSLROCSR.

ISADEYOKDiPSDIVLWNAYNPERDGVIYSiCSPPfRCGRPGP81LDC8INAMELIGDSLDIN

CPef ACCVtxtIFPNNENEIAOSERLSGKPFARYWIJtSPitLLIDGRG1SKSLONPiRr i FICptVAYIRiASNYRTOLN!'TECALL1CRNALRRLItDFVSRLEGVDLPGESPLPRTLDSn CTY95 hypocheeical Drorv SSOFIEAFSRALANDLNVSTGFASLFDFVNEINTLIDOGNFSKADSLYILLTLKKVD'HII.8IFKNRILPSYFCHHFD
DLRPHYINtIALSLLSLi~IIFPIFCEESRPGSEDCNSMfQLIIIC

GVLPLTfSVCIPETVNQLVaEAEEARKTKNWANAI7l'LRDEILAAGFLVEDSKSGPRVKPL50171'00CLY1~1(RI
EGKPLVTWILNSCDOCQACfIGLSETCEIYLSVLBGSI

FSELifNIWLVP9GVNPLIYPPI~PILAEIVKFKELFKDESFPfGGSI
IWCVTP1~PC

CPn_0973 1067532 1068578 . DIIEVSPVSLTV6EEETLPSEQTTEVESTSEtQSEDPAIA

predicted disulfide bond isoaerase ' K CPri0916 1082816 1079715 AEL

C t)ly0-GlYCYI cRHA Synchecase PVILI4NIKRCSLKOLKVLATLLLSLSLPTLEJIA~IRDSOSIVWHLOYOEAL.OKS
GJ
L
' t GECOKICKCYTLESFVSEHPLTLOSNIATILRFWSEOGCVIHpCYDLEVCACTFNPATFLR
JEVEYLKHRPQVCiIRO~
PLLVIFSCSOWNGPGMKIRKEVGt=SPEFIKRVOCKFtIC
L

KSKPKINELPCNILL3NEEREIYRiGSFCNETCSM.CDSLCNIVEBDSLLRRAFPPDfISALDPEPYKAAYVEPSRRPO
DDRYCVItPNRIAHYi~LOVILKPVPQIFLSLYTESGRAIGL

SLSELORYYRL11EELSHKEFLKtIALEIGVRSDDYFFL:aEKFRLLVEVCKl07SEECORIKKDLRDNOIRFINl7DN
CIPTICAWGLGWt1MI11GNEITOLTYFOAIGSKPLDTISGBIZYGI

RLLNKDPKNEK01'HFTVALIEF9ELAKltSPAOVAQDASOVIAPLESYISOFG000KD1~R.WERIJ1NYIAKKISIY
DV WID1'LTYGOITOASEKAWSEYNFDYANfAIIFKNPF~IACGL

R'J~tIAOFY LDSDQWtIMAi.0lf AEVAFEAAPNEVRSNISRSLEYIRNOSRTLIDIGLSVPAYDfYIKASNAPNILDARCTISV1'ERTRYIMIRpLTRLV1 1DSWEwRAS

0971 l Oo8918 1068526 lt~lYPLtw"LSSTSEPICETSfSWPMISSTEDLLLEICSEELPATPVPIGIOpLESLiIRO'Vt.
CPn _ TDtMIW(TJrLEVIGSPRRLALLVKNVAPEWOKAFEKKGPNLTSLFBPOCDVBPpCpOFF
:npA-Ribonuclease P Protein Canponenc 'IFVNPLTLPKpSRVLKRKOFLYITRSGFCCACSOATFf'/VPSRHPCTGRMGI'CVSK!(lCKASOGVDISRYQDL8R
HJ1ST.AIR7YNCSEYLFLLNPEIRLRTADIIlIpEt.PLLI0RT8IFPK

AkEANSFttttWItLYFRHVRNOLPNCQIWFPKCHKORFVFSKLLODFItJOIPOGGHRLGKYII~WONSOVEYARPIR
WLVALYGEHILPITtGTIIASRNSF~11RDLDPRKI$ISSP00YY

TKATTOCIx:TPItSEKC\TAPR
CTLROACVVVSOK6RRNIIEOGLRANSSOTISAIPLPRLIGTPLSENPFVSCOOP$Op PCALPK6LLIAENVNNOKYFP'fNETSSGJ1ISNFFIWCDNSPNOtfIIEC~tALTPRLTD

rPn :EFLFKODWIPLTtFIEKLKSVTYFEALGSLYDKVERLKANORVFS1TS5LAASEDC.DI
0735 1Dc9100 1068957 ' _ ~OKLSTIGT
r171-L7A Ribofloafal Protein AIOYCKADLVSAWNEFPELOGINCEYILKHANLPTASAVAVtIEHLRNI
' EtIIVKRTYOPSKRKRRNSVGFRTRNATRNGRKLLNRFFRfIGRNSLVDLGLDRLiIDH
LL~rLLDRLDNLLJUCFIIGLfCFTSSNDPYALRROSLEVLTLV
:ASRLPIDLAu' FPSTtEEKVWDKSKTINEtLEFIWGRLKTFMGSLEFRKDEIAAVLIDSATKNPtEILt)'fA

!f! Jr. l i)r: )310 1069170 EALOLLKEENTEKLAV iTfTNNRLKKI
L.i.~.LKL'oM1':~SP
r:lr: I EVLCDRESNFKI,WLDAP'PGF
' _ IidE
r:7t.-L:'. Rrtxfurnul Froceln PKET.~J1HAFLEYFL.:LApL::NDiODFLfRVIIIMIDDGAIRNLR15LLLTANDKFSIt~

'11J1KV.~.:;::VKAGP:aM:DKLVRRKGRLYVLNKKDPNPY.~.VAV
iRO~PARKK

u':: I IJ.:'t.lH7 IJr:'ti'rK ~Rr_tlnA7 II)K IA f f li)NAO'~'I
am _ pttsA r:lYVarrcl f 1' PM>trptf.ttyh/ltc.m:ltKC.ncr.
r::lA::lA ItrW rt.vml f'cocwi"

~/KRNAYY.::::V\kF
V:RRRLVE.WFKKR::DLRXIVKC:..:'!.~.EEEIII7JARt::LNKIIKROTSP:.:AN.LMJYCTF::RLFITt' IhffIL'ILYr:YWFra'PCVVI.I'Y'It.LAtd \L::EI.'fDJII0.A'VA

'f'fIJItW''LIa'a<I'1a:1'td<KIAI::RCt:Pft~MA:2'1t)Efia:JIKAaIRKF.~/JIt'~:KLLC11' HAIxIt1'ItC::CYL'PF1~~C'INNLC'LLL'/FlF( IRIX:VI::Tt.RTVt:AF

1'r:fMIMNAat:KL.KA t 11r :Y::F'YI.I
1.1:/M 1 1'11:.1 t :l..l: X/tY:LL:I
h1\::YN:: f ! AVY::IA::

wfn U.::, 111..111'.. ltlo'rrll., iIEIfFWMIKNh'I'RtNAKTKI\:hY.tDll'_:YU

..~~/i1R t.ylo.rlrI i.~.:l 1'r..r.im -IL:.uMr rr.'fl ir~rn i.l.' 1'rriPl.f!'JIiCI' rIN1:971'LIt'IT)1.1'1::ILLIvYVIIJ7f:C::AYIADKKYFtf/It~.IFF'M:Ah'F:FIr:LWLLLL':h r_If'nAk I111c'.As!t IW AIVA
/

I~.:ItktlAl.h:KfAlt.('YfMI::I)1.FDDfJCK::LY?IDEIP:::~:LWEI411YFF.1.'WFYflIKDRFN
V'IUIA r:lYt'rrNar ::Yldtl.n:.
' ' ' :I' I ::Fh:h:l.l'l'LLYr :K'fYf'h:l't'L I ::KI ~.
WNWKKI 7MKUW,tk VKL'IGC:IIJt~ItL.Kh'.A::K.:~.~ I::R
':F'71N. I'ArIAVEhTI' 1 VKVI S
:1 t :IX YA::L::K 1:1 J1 KVNIIV
h.'lLf.('l11' 1 F::FYYEFLCKWA::AL:Y::1'1t:1:1'l:t!
1'fId'KaJII:LF:.TC:IY::h7JlIVVkN::AYAAAAAII

JEADIAUIYIILNIIIOM:LLJII:LLKHPIJ~IPVH_CPn 0?~0 I' '. l:l.'.':'::
"INIIPCYIlGYC3L'OLLAASOIp '!:

, tT909 hYpocttetscal pw.r~sn .~;few'IIYOLPRDI'OT3VLJIKCALYCGDYI'I1'VrL:.iOEttNDY.iDIfALNDAILARNSVF:iL::Ltl3Y
LrNPraKAI.W'w~F L tt.
OISSDKFPLI E
3EP0VLFTKK6CIRAVLYFxL
' . EELWSPLEVGR'.fGA~i'v:.'oOWL
:NtaDEDVWNPKTOPALAVOYDA:iLL
I
' ~NEVLtNCFtALt.ODCLASSPNIRL DCFFCOCSL~:PERKNtLKfLEflRKKNCG~PFCYL
-Vt3RIYEERCPEFNKEIILMAHGISYAFLLICi VCSGVFDCRPLIROEw::E:
' ' .F .
~
LDFNDPWILTYAAAGHICfPSNILEAG~GLTCLIAMtYClVPLVRKTOGLADTVI
.PT

TFFDLNIIFNEFRAHL.iNAVTYYADEPDVWI1~LIESCHLIIJIiGLDAHAKNYVNi.YOSLLS
'IT
~-~
Cpn_09~t 109710.; IO

.t.n n.~.tv lUR5Aa7 lOnF181 rl t:-Li2 Ratna~ln.~: Pmresh ...
.
.
._. .

. .
, . -.KAt",r :: ;
.. . , , . ..
.. . . .. . ... ...:.;
' ' ' . :..,.... ... ;.s.,:.,slt ., ~;; 'A.:L.. 'dl:T'.....i..: .;:
:LalIt11H1:r:~-::n~!~:r~':vr:;~ :.,.
".': -:I1:,. .... .._ KKFI::NLFSvALSSIYFa'LSYEGR L IKALVI(DIOYO
ITPYOV INLDfCCLVEDRPIXtI'fI 1D9730t 1099:75 PIIKINAVDCICVIC~SLROVIRAVRVMCKPKDIVPFLELDNRSVGLSOTRKLSDIKIFCPI1~0962 . plsK-PA/PhosphollPid SyntMats Protein ~yA
IL3L1111C1r0ICIDIlCWHSPLWWaVLVDVLKSO$SffPFAl1'LFAiClIRI4~tOCAf ~I~

0950 106170 1017037 80LP00CFPKIfSAEtiIVANm8PIw1A1RKK38 CPn V
~

_ TLARAKIPLFPAVSRPALLVLyPTIEUGIU1VILDYCANISVKPiDNOFAIAOGAYROL
pch-PePtadyl CRNA Mydrolase LC

PSLCOlahAKLIVAICNPRNCYANTRIINNGFLLADRLVE~.Of;PPFKPLSKCNAiJfl'LVCSDSKIPTIGLtIfIG
SC~IKGTC7WRCtfIWLRE?liG6AFLQ(ItSGAVPDDiIADIWTt~F

SSGPLVFIRIT'FVNLSGIU1WLAKKYFNVALStIILVLA~SFGKiJIt.CFNOQ>IOGIITCItIFLRTAICVTCPLO
RILGDKL6ADIORRLDYTFYPDSVIIL~GLAKLVIKCIIGKA~fS

NtiLKSfTASIGSNEYWOLRFGVra'RPLCL7I:VELSNFVLCKF8ICZ~.OLGSIFVF~ISTLf'tLFIGILGSINta QARt.CKRILS~.I

EadCSItF
CPn_0963 109371 1103231 0951 107113 107157 pm~ZlPutatlw Oucet Msebrane Protein CPt>

_ TPLRFKVAHW1KKTVRSYRSSFSHSVNAfTSACIAFCiINSLNSSF.LeLGVTNI0FS~5 rs6-S6 Ribosaeai Protein .EFI1~ICKKENQLYEGAYVPSVTLSEGRRKALDKVISGITNYCGEIHKIHDQCRIOfLAYTIANVBCAQTSVLKGSDP
VNPSQKESEIfVLYIpVPL

I~AREGYYYFIYFSVSPOAI JISLPE
TlVPCIDQKLVNStxOTIINFSOP11QEPDTSNAVSPJTISSCEKDipKpLiTCDPCKCIGLK

EvSSDLPKSPETAVAAISEDLEISENISARDPLOGt.AFFYLO?fSSQSISEKDSSF17QIIr ct~o9si 1oe711;9 loe7n3 sasaANSCLCFarsrIAVKSOAAVrsaRDIVr~avKCLSFISCESt,ma:s~l~nrrH

rslA-S18 Ribosanrl Prouin ~T~'fG'u'~~~~T~~'~I~A~

CENTNKPVHl6LEttRRKRFNKIICPFVSAGWK1'IDYIfDVCfLfitFITEPGKVLPRRI1'L11SS

RFQLYLSQJ1IKRARNIGLLPFVaED

CPtl_0953 10~7717 10BA3~8 r19-L9 Ribosaml Protein 1TARFGYVRIIYLIPIDDUIVIAG714TLRLOAIa.KW
RLIOAAADItADSERIApALI~IVLEFQVRVDPt7~i~DIYC.RVTIHDIIAF~IAiDDIIFLVRIC~t FPHANY71IKNLGKKNIPL IQ.KCCVTATLLVEV'tS~IE'YYMa0GK0'tCENOEC
CPI~0954 108~359 10~8708 ychB-Prttdleted Kinase GRKVCY%DLNpYfSPAKLM.P'LKIWGRRPI7NF1IG.TTLYpAfDFf'sD1'tSLSLSS
NVNELLSPSNLIWKSLEIFRRETQINOPVSWNLHKSIPL09GLOOGSSNAATALYAt~iH
F01'HIPITTLQt.WAREfCSDVPFFFL0E0H
CPIt'0955 lOAAtilZ 10A9175 I Erawe-shitc Wich 09561 LK06IFAICITI110RNELtH'8TNVi~SCLGVV1~CQNICCFtDlKIIItLlGYJiLCLtYtOWE
RAFPliP7ISYlMIATLGSRNRIfRCSFFFSSCfALGHD~i~4.fSIKLQi'111K7DCYVLYLDIIQ
Y11GILAGPWLIKCiIfVYQit~T1'D
GfptEfTAY05LLPQDYSiGIHC~IACFYGCNDLdKSVFAIRTLL10~IKElaC.CRNWBPFCS11V
YOTLOIS7GSWT~.PIACt8IDYRIfIHJPRRFIfIIIVS117VPlVtAiYHIIE~PtIiCO
tlISGSGJ1TLFVCYLEELEODSIIVSS0IIL9LIKO~fpCIPVSRLYAEPNNYSLKOS'fYIOiBP
GKEVRTFQRTRIENVAIPFatALFJIAYSRDSRAEHiSVpLAYVFDV><IIIA7PVCLITL1~J1 LOCFpPpI AYSIIKCYCVDI IlIYDII~IIIIIF
CPtI_0956 1oA9515 1090909 CPn_0961 110~~11 1103301 CTA05 hypothaeical psouin No robust homolop present in Oenebank(CIOI. as of 11/7/9!
WWP$NILPPYSYSLKIGAAVLFPCSILNtFLTP11LY1u0SYODtKLVFPl7CWIDlYJIRt.
OSILCSIIKYIYLIIOiSKNMLBNPISLFSPAELIAICYIrt.IPKISPIYIRIIIiLIIl.1~11 SELTRILSRVSIVFFLWAVPL!'fWFLYTCGYRISL01YFNSPNYOTAVFI11VILILT~~P
CQTRL'fNVAOVGNPSSLI~SIIXIIJ~CSGGPLCWYfEIOILAFITTiVtlIIfi.IHC.
IVYFACLVLSSIAIQGKTSPKSWWWfLlIIAPPLLSCLL1TC1GANIIG11?LLJaWIYV!'8P
fVAOLRLFIIPLPPKKIVmt.SCPTTECIIiEVfOPFIF11L0ALLFC~LiIfF!<IV~fIIC
SRRFAYATNGt3.FSNISICCLTSYVSSRALTLIPPALI6iENSFFLSNFAMfAIVATLIST
KApI.PIIIFGHRi.VAISPOCSOFJII/tRIPtLKKVLISLaVLTPAi1K116nY1~i TIYYFIFRKEFKKFPDIPSD>mPSVCKVPWWIfCVNIIlVGSIILSPSTPLFlICAfdi.FY
OItIANl000lTFPILIIQ.LIGL1CKSSLPICfPSfKtKI~AALFIAS/!IA>WiT
iGPnXFTIEYQOPINLSKVCYVCLrYIIGLWFGOI.pDIWVt3It3100LSDFGYIfIYSIrfLS
R$~RLYSIANDf~LLIW90CFtDCREf~ISIODG~A6EYRFAA0p11DA1tY11G1ICaVi.
IFLDNALVNYLVtOJISVATDCYNYLWXK'NIUILiGLTLVSNIPNIVGYLILRSAFPSSTi RNCSMKIAWNVINT1IKPTf(OK'taCLVTENLQDTIG11LTLRQTNiI'114mtCW10LlML
HHONLFLCALGPSI IShfIVPSiLLIQ.1VPEFLYCFFR
PLIIKYLNS~.VNSVFKSHOIUIDPCTItALIREPALDILY11SLRLPpTSJIlEfM5Tti31 SEEFLKRIFOtLPAV
CP1~096e 110055 L10d719 pcnA_:-POIyA Polyeerase LLITtIINCENNILr<:RSiLELLKKKSNITLTFTIYSVSNHNfKLKDFSPNALSVIK?LRK

AGYIAYIVCC.~IRDLLLMl'PKDFDI :'l'SAKPECIKAIFIO~ILVCIIRFRf.ANIR~Opt CPn_095A 109803 1093793 ' IEVSTFRSCSTOEOVLITKDNLW>1'PEEtIVLRRDPTINGL!'YDPENCCIIDYTr7D11lKKJt pls8-.lycerol-3-P AcyltransEerase NRYLRTICDPFTRFKODFVRNLRLLKILSRSPFTVE1'OT'OEALIJICIt06LIK850ARVFE

LYRAIYHOFSRYLRYAFONpYLPEPLY0KFS11FliQNYIDAATKKAAADOAEVLCLOWhfVELIKHW.SCRAIO~PPO
LLIl7JfILLEILFPYNDKAPRWPALCCpTATYLKALDDICILKItE

TIDLYtIPFIFPPYHKKIRAPIDLFRLSIDFFSLVIDDIfNSALtIrLItRLKEIBEYIARCDAEYDRIWUMIFLPPLV
NPNVRYKHOKHPYLSLT~VF?(IKNFLCQFFAOSFTSCSKKtIf' IIWLUWHOTEGDPOT1IYYAiGK'CI1PCLJIENHIFVACDRViSDPWIPPS11CCOLL.CIYSILTI1LILQNDYRLT
PLIPIKKALPFNKKLLHHTRFLPJIL:LLCIRSLVYPKLOKVYVJWl KRNIATPPELREEKLLHNpKSMOILIITLLNEGCKPIYYAPAOGRtHIKNAEGRLYPSEFSPRHHOTLKCKKD3NSOK

ESLEVFRLLAKA.,~IIQITHFYPFALKTYDILPPPPKIEIIAICEORAIFFAPVFFNFGACLF

FDALC.~.KEELtFK:DKHAORTLRAEKVFSIVKNLYCELCPn-0967 IIOM171 L1U~8>r:

mrnA/pps-Plx~yhnt l uc-cwwt.>,.~..~

''Ur_U''.n 1117AJ7e lU.s179~
PTAYKFAFIC.k'R::EKIRRTCtDFRRHHp~h:VkYLlY:fGIYRCR/W!'EPlfIYE1'fVLLCK

.:.stE-IUi.tI Fshnrnc 1>rocein AVARVLRl7:Ra:KHNV11',:KUTRI:X:YHFFNALIAP,ItJ::M;ICEtVt(:PLITFGVAIITR

A(X:Wa.TRKVNENEILIi~IIE:iKEIRYAIILKNCf~LFLLTtERKKVROt.KCNLYRGRVTHTAYRADR::IHI:
i4:11NM'RI?ltal!TF::LEI:FKI::fi/Gl~P1E711V:IPJIDPtIPLPCOINVOK

LIlNfO::AFtNLDERfaN:FIHL~.DILF3~I;:KKFfONFOHDVDALPEEA.iFJIPt.L.iSEEAPIENKRVIOII
IK:N\'VLF(NK\TFI'Ke:NTIJII:I.IIIVLI/:NK:A::YIIVAI~.T/FERLI1AEVI~OCE

F!'1.1;LL::PVt.VWVKEILU::KVARLTSDII::Lfc:RYL'dLLPIIf.PIIRGV.~.RKIEDPIMRCDLP'l'C
tNINEIk\:ALFIth'L'KAVIL7KlI11t4:111.U:DWPITMVDF7Ni1L110s7111ILaICA

YyLIN::!'Fl~tXnIY:LIt.'R'PA:TCTA:,TFr\LINPJWDLLLTWII'I'ILEKFYSTEGPI:LLY~ET;:DLK
KR::ALMINItWI'fllrlN!'rtJIJI'(Lf7t:L:IIrVFT.':1'P'.YJRIIVIJMHLYJIEVTIl14E0 Inll.l!Yl'/I.1't:IIH!ILYKNLLIDDYATYQIG:KIMLKKY:PDA::IKIB'fYRDSIPNFERPNLE:il:IOA
IFLDYM"Aa:I:II::TIlY'/LRINIF-':I_:HI::W.TAtIVK::I'pl't.INIIAVRFJIIM.6T

YliILYATIeNKIWL:::~h:1'LFFnI(TEJ1l61TII7VNS~1R:STQL.C'~VEETLVptIILEMCEIAIPLIERT
IJtt~lltY:Al~:1':::1<fl.LInY:y71'FJJInI'MVPINIrKlltnlll'IrtVM.AWIMF3.I:

L:HFI*1M't:f.VILDFIDHK::RKNDRRVLERLKE11N!!YDAARCTLG:HSEF.LVtMRpR'Ir'"lRE
I
~~

NHF::! t4JftJ"tU: Mtt::a:NA I I vl O
KTPFwIIV/ t E T CRDl.YIfV IMIK6113HLCLWHPEIASYN(IC

Kyl7lLrillh?IINWKOLK~VCLQINT::D:iVill.lRIYOFFr'LITGE.:fOLCI91 ILIbN IIILnHNY

.IIIIC:.:14n:.b..MW t.,.;:t",r.,.:.
.. 1 ~IUlrt.yr.W :ittr.t::.

lI7 CPtL0957 1093~1? 10909E3 ide/ptr-Insulinase Eaeily/Protease III
KIYTRNCKNfWKLt.CPILICTSLSITSCEOQFkWPNOCPIQVSTPAAAOpICfEKI ICSN
GLPLLIISDPNLPTSGAALLVK'1~INADPEEYPGNAHFTEHCVFLGNGfYPEVSGFPCFL

WO 00/27994 PCTNS99l26923 Ie ~

VEOCLFIRKTVCRVOQ.SNL.F
RMDGIF!:YW'!J~(,YSfVLLGLAKLCYRGYD,iNW4J
D CPn !)'179 11:1:71 l' ~
a / ' 1 httA-DD :ieriM/~d~'ltlYWllai' p AWIOtGfIflIFKCLRRii.TAOCI ~ G':... !I.
-rERitITA:IVICItfIIiIATIICVP'IiINANPNVDEGR
ES

iP~~DIO.r~EIIVQLFSLYYOF.SQOLYFSFCQTIJiGLRCiSVACALtHKDNPIYfILCA80caOIIITKOLRS1~I
DI1VL~'LLRLPr~Af'SKKESRYStCP~00b"'.
VR .SC~fS~SkV~~fK

O .
PLIIGIL.7cEE'fFIASDSMFFKYTRHSpALASCiFAIVSOGKEPEVYIE.ILKKIHKDATPAWYIE.iFPKSOAVTH
PSpGRIK.PYCNPFOYFNCEFfNIIFFGLPSpRDIPOSKMVIt SEDASOK3GYCYYNLKEIYOOP6YLECLIOKIW~falIILSECLiDVPIKSFKiITf IT~.

.
~LVSPOCIIVTNNIIWED1GKIHVTLHOGOKYPATIIw.DPIITDIrIYIKIKS~.PY
VACf"'w~YHAGYLAKYIIF$LVSTPVIIICVASGFRYRRPYIGKCTLTrILf.iQSGCfAO?LI1 ALKiLRRRNIAYLLCICNVPGSAIAL.CVDNCLFGFaGVEIGVATTKAITSQLLLLVFIGLLSIr?ISOHLKVCDWAIA
IGNPFI:LOAT.:.ICYISJ1KCRNQLHIAD!'EDFIC:~AAINIGN

Kf.IWVIK7ALTHAE0t:3f~f:LQ.SLPptGpKLLANE;LHSWAOPYSYEDKfLFLCRRI~IYP:,rI:PLWLDrfjt !tGVHfAiV:;rStrlfICIf:FAiP.ftaIAltRtIDDLIRD!:pVIRCFt~Y':.

..I; . . ..I, ...,:" . Y' ::.F"~Cln!.'!:YM:..:iiLrLS'."!Y i.i:f~.' ,y,. ..
L ~tif. .. .y:. v. :: ~\'f.'i. :..'MFRIiif:.:
...F. ..I Y.Li: ~i:.n . ..d 1i .. hl h 1:.:.:.' -~
i""
' ' 1;.~~
...:Nf:tA::l.l:.':!:v::v:l:n .. . .... .. - .. . . . .
. ........n ...... "".;t?IA..,.. '.!
..\.47AE!~J' ~:Sf ~!'SYF:.:tII..
. :
at:::'."PI':!.%L':PI'a':' PRNLAKSVTVE TKGILL ZSVEPCSVnIASS:aIAPGOLLLAVNRQINS~
IECt.NRTLIt05Np1ENItilIfl9OCD

VfRFfALKPEE

CPn_0969 1111101 1111999 CPn cyrP_1-Tyrosine Transpore_1 _ VYVMSNKVLGCSLLIAGSI1IGAGVLAVPVLTAKGGFFPATFLYIVSWLPSMASGLCLLEV'si~ilsricy co Saecharamyees serevssiae hypocnecsul 53.9KD

MfstlitESKNWNa.SIIAFSILCNVGKISICLVYLfLFYSLLIAYFCfxIGHILCRVIMONPtocvin LGIrtiIRHLGPIGFAfI~CPIINIIOfKVIDYQ~1R!lMIGLTV111GIlCAtGFLItIOPSFLFVMJItIAKKNAKP
WLIFFSTKDKt3YCDIIf'NNCSGKPt4lt.DSKHFDINSANFLi~JNt vRS~ILTTf401FPVFFLAIGfpSIIPTLYYYlK7RKYGDVKKAILIG?LIPLVLYVLtiiWFISFPSISADSDHI40C
1I~ICAHFLVDHVNKfFDVPGWITPGHPPIIYASYKSCDPLSPfL

YiGIIVSLPILS0711CICCYTAVtALKOJWllIWAFYIAGiLIGlFIIt.VSBlVGVALGV!loFLMLY11IY090PA
OLSOCIi~I~!'ILREC~NLY

AOGt.KWNKXSNPFSIFFLTFIIPL11WAVCYPtIVLTCLKYIIGCFG71AVIICVFPTLIVWKPLNIIWLICGEBISG
SLiILPIWLGUtKEALW1DYLLIVDGuFL40tNPYVSIGitRGIVfM

CRYCKQNlIR00pLVFGGItFJIL!'I10TLLfVIHWSfYHELKI8LEG~IKD018f'.YttX'.IAYNMRALSiILSS
LHHPONSIAIEGFYDDLiIi.PSDSORPD

LPKSDTLREC~FRPOGYE7l?YSPEESALR%YEINGISCicrYTGPGFXTVIPYMTA

CPn_0970 1117153 1111618 Yt.9CRLVPNpDP~tAAHOVIIIIILROQVPBSLKFSYEILPGGSIMiRSWILpfVKVIt~I

cyrP_3-Tyrosine Transport_3 YSDLYNBiCLRLVMPATIPIGPLLGtAAOTSPfICGTSYLSDDIHAAaNFEIIDOLItIOCF

VYVMSNKVLOGSLLZAGS71IGi1CViJIVfHLTAKOGFFPATFLYIVSiiLFSMASCICLLIYLSICOLLDKLPKIKE
, M'Iwl9LiSKNPVII~.&IHIESII~It~ICISICLVYLTLIYSLLIAYIC~NILCRVP!>CQN

IGISWIRNLGPLGFAILIGpIIMACTKVIOYCNRFPMItGLTVAIGIFCALGFLKIOPSFLCPn_0981 1137019 VRSyWL.ITINiIFpVPtGFGFQSIIpThYYYImKKVCDVIOtAILIG?LIpLVLYVLWiWZinc Mecalloprocese tinsulansse Eanilyt VLG71VSLPIL~AICIfiL~fTAVFALKQJWR~IAfIffAGC.FGFFJ1WSSFNGVAI41!tm!'LVTLSMtAGDTYRN
FIIKSCJtDL.PEICSKLGFJIiNKPIGASIIOIIVNNDEIfIVIT~tICFIffC

ADCLKWlIKKSNPFSIFFLTFIIpWiAVCYPLIVLTCLKYlIOGlGA7IVI
IGVFpTLIVWK

CRYGKONHREKpLVP'C~71LPLMFLLIVIMWSIYNEI.

CPItr.0971 11f1697 1115~15 yecA-Transpore Penlease DGSNGLYDRDYIQDSRVOGTtASRVYfR!IMTAGLIVTSCVALGLYPSGLYRSLFSPIiYMMC
PATLGVSFPINSKIOTIS1!SAVCGLFLLY
ALYhGLAAVYGJ1F'fKSI7LTKISKIIfl~FALIGLLLVTLVFAWSIOVSNPLIYLLICYf~GL
VIFYCLTAiIGADAIRRISSTICta~BJfLSYKISt*ffJlLIaIYCNVIINfIiYLLQIFSSSaNR
D
CPIr_0973 1116)77 11154)0 ttsY-Cell Division Procein FCSY

RCIIIHSLLFPSYLVSFLti.OLTLLWIFKPPRIDG.OSLFK81YISLDLICDALSLFYfJIDF

GTELTEELCAALRRTKKADiLSTIRDLITVLWSLOGLPSOA:90SSQTRPIVSLI.L?fNG

PGGfliL7lAI LR~t!LTLDI~FPIVPAI
SCKTn'1 YKGtSiSVIQ VA
WI
Wn _ _ _ _ Q

t~01I CPItv0913 1111315 1139963 E0VRVFNDWpLSGLIF?RYDGSAIIGGI'LPOIAKRt3CIPi7fFIGYG
~iP

~.DLFLatKLFPLVDCI YipN taaily .

KIIS.ASVhBILPVSLiYCIi.ISGCVFFI.
sNfYS85LYlWOCRAPLEKIOK<.Ol~Ipti.O'!SL

CPIt_0973 1116116 1117537 NLiRIp&pi.Ii~DFSNRW.SSHKLIKDM1t66AQN1flrro'fSKSFQSILSPIQTri.Tl11031 'suoC-Succinyl-CoA Synchecase.
Beca'LiTF~fRtt~IORLKCOISOLLAV6KKLEIIC17M1hDIL10Ip0iI0LMifiT.

IPPYWVVSSCELCELLITKSGLDSAWKWVtWG
AWG.iYGDYDSpTT8A0GItFRADIIIRLP00RCLIID11KAPISD~iIIFSVMl01m0i.V0K

GRGI0K9GVIVJ11tS8XiILQIIVAKLht'InI0IPT5NQTADGFLPVEKVLISPWAIQRaYYVAVIK~IIKTLKSKS
YWGIPHOSPiYVILFLPCKSLFND11IRIJIp~MIQAiINVIfiSpLT

IImRJOIRCPVIJQSKAGQIDIT.EVJWSSPWILTLPLTSYGHIYSYpL110ATK1101WiGiLLALLKTIAYNiIIQC
IL4KQIQ6YSt.LGKCWRRIQW!'l'HPQKIOIIfILJp'tVOlYlal' Vt4pNOLIIOG~KCFYAD7VSLLCINPLVLTLCGCLLVLDSKITID1S111LY1WPNG6VLSSInYRVL.PTLRKFiGL
CfSSSI~ILEP?PIESL1TSFPNTCDIDTtIUIYf YDP8Q6lNRDVi.NfOIGL&YIALSC~ItIGCIVNGil0i.7WSTi.DILIOiKKitiIlAtiFLDNGGG

ASpKQIQEAVSLVLSDESVKVLFINIFGGtI~CSWASGLVAVNCCADOVIIP'fVIRLI%:TCPn..0981 NvEtGKiIVpQSGIPCQtYSSMF.OfiIIPRAVELSNpssA-Glycerol-Serine Phosphacidyltransferase KNPf.CY0QK1aJ10ID1171fx.DLIJ1I1GKRRVYtPNAfTAIChCCGLFI
IFKSVLRTSSBViL

CPn_0971 1117537 1118133 FHRt.OGLSLLLISJWIJ1~SOGAIARiIBLAFSAPCJIpFDSLSDAYfIGIAPPLIAIRfLD

'sucD-Suecinyl-CoA Synchetase.
AlphaGfYVG~IFFSSW.LI?SIIYSLCGVLRLVRYNLFSOKTVDVSKpYCFIGLPIPAAAASiVS

VCRFRRYMFNSLSKNfPIIInCITGKAGSFHTLpCL~YGTNFVCGVTPCKGGTiJit.DLPVULILiISDIFPOLPJYO
LRVGLLSFALLFIOGLMISPWKFP4InWFRINVSSFLLWTICL

YDSVLF~IKpJIT3GRATMIFVPPPYAAEJ1ILF~!'.1GIELIVCIT~IPVRDMLNARVMDAACLFFSGLVDHFVIVY
FFLVSWLYTLYGFPIFSIIYRKKS

NST50LIGPNCPCIIKPGECKIGIIIPGIffHLPGNIGWSRSGTGTYSAVWOLTQLItIGDS

ICVCIGGDP111C1'SFIDVLpALfJf~PYTELILMIGEIGGS71i6FJVAlIWIQANCTKPWAFCPIL0981 IaGYI'APIIGKRMGHAGAIISGNSCDAKSKfWLRItSCYTWESP1WIGK'NDAVLMItEL'nrtlA-Ribonucleoside Reduecase.
Large Chain' GKVMVIYEtKNYTIVKRNGNFVpFNODRIFOALEAAFRDTRSLETSSPLPKI7t.EtsIJIQI

CRt_0975 1119075 1119677 TNKWKEVLAKISDGQWZ'VERfODI.VESOLYISCLpDVARDYIVYR00RKJlER011SSSI

No ralttlsc homolo0 Prnenc in Genabank/D~LIAIIRRD(>GSAKFNPItKISAALE1WRJ1TLQINQMTPPATLSEINDLTLRIVCDVGSLHC
as of 11/7/98 3IEEQVALSIAIKfLKIItJILILFPLVLLJ1WVIRYQIJiANFHCSWPFPGPSVNpJIYKCSEEAIM.iEIQDIVCKO

EAKIEEM.DLLDLITLEWSSRCLRQI>M'FANRLEZELIOELRVSETE6LISLOGKRNLVRQKEDf:TTYLL1IKTDLE
KRF&WAGKRFPKZTDSOLL71DMJ1Fl8dLYSCIKEDtYI'fACLIIIA

:.LLTHFPNPPKRSRVESVGHEWFPVFDRLKREEEIICOGPITRSNEELWALLDHCTARGRAMIEREPDYAPIAAiLL?
SSLYEE3'LCCSSODPNLSEINKKHFKEYILNGEIYRLNpL

IHKTLWfSIFFKYLTQIELF
KDYDLOAi3EVLDLSRDOpFSYMCV~R.YDRYINIJtEGRRLETAQIPIi~OIVSMGLatJ~G

EQ10011tAITFYNLLB'IFRYTpATPtLFNSCMRHSpLSSCYLSTVKDDLSHIYJfVIIDNAL

CPn_0976 1130079 1131185 LSKWAOCfGNLfNfOVPATGAVIKG7TICKSQGVIPFIINAIIDTAIAVHpGGKRKC:AIICVYL

No robusc ttoll~loq pnsenc in Genebsnk/DIBLfrIWNLDYEDTLELRKNTGDERRRTHDINTASWIPDLPFKP.LEItKplPfLFSP00VPGWi as of 11/'!/98 II*1LVYCFDPS1IPTSPFJiR(lfAALpRWFFt.OCHW1RILTLECt511!MFOENMSISTVi!(IAYCLEFEKLYEE
YERKVFSCEfRLYKKVEAEtILWRKMLSMLYE1CHPWITPKDPBNIRiN

LKLISYLLIPIVLIALLIRCFLIISRPKCNWKCDSISOWIVPHDVQPFNDFOLFNNOERLNQDI~CSMLZCILtlICSF
a~ETAVCN4:SINLVENIPDIDKLDE&KLKCTISIAIRIL

IWKHRRWSCItrirt*NPVDYLRSQFPCFKEIPEAIRCENYVSDGQFS~SYLRAMLTONVIDWFYPTPEAKQIINLTHR
AVGICJMGFOtA/LYEWISIASOEAVEFSDG:SiIIAY

DIVCIfILSLDETYWfNVILKIRMICITFESFPCKEADPN1ISPRVTHHYFDESWKAGARNVYAft.A.iSLLAKBRCT
YASYSGSKWDNCILPLDTIEL4KETRCEHNVLVDTSSKIIdffPVII

LGOQiIIMRL.OEALfRTEKPGKOGECITKOFLKDYCKKHLEVNSCPDFIESLVDEKIREFDTIQKYriARNSOVMAIA
PTATISNtfCVTOSLEPM1!KHLf'JKSNLSGEFTIPNTYLIKKL

RCPSIWSAVCDVIDRKCOEHLLKAIINEANRRLFCl50JSSPTMI~NQVLFY'fIFSPPKLKEIGtJVDAEMLODLIfY
f'OGGLLEIERfFNNLKKLPLTAFEIEPEWtIE~fSRRpI01IDE1G

PPM$SVYF .
VSI*6:lLAEPDGKKLSNMYLTAWKKGLK11YYLR..~QAATa~~/EKSFIDINKRGIOPRi150i KSAST.i T WERKITPtrt.'~MEEGCESCO

CPn_0977 1131339 113310:

No~toousc ttomoloq prnenc in u~enebank/EMBLCPn_09s15 1115173 11)!:571 ss of 11/7/95 LYIlIQFANtLKSSFt*fEVYSFSPSVRTSFOIiRVNAALONWFFLCCRALKWSLDSCNSCQ'nfd8-Ritmnucleonide HrNiucca.~.N.
:amLl !:h.tin-aCELrNPIDT'ffKVLKItSYLLIPIVtIALLfRYLLJlSNFTAKV3QKPWLKTLQLGIDIKI3VNKY~'..RKKNNPR
LFN!:RRLRIL:iITEK7ktAKMEADiLI7:KLKRVFV::KKCLVNCNpV

::FtLPCiIiVllMO:fATLFKAIRLECKPVDViYHRLHS01IWFYIPAQKLPDDLRLTMdLWt~LYFIKYKWIIWPiI
YLNI7CAWAdLPTEVI'MAF.DIELWY.~.LEI::EDERRVILW4iFPfs PEKE'IRKTEYVRNMWiVMCIfLT~.aQtiKERIQQWv~DSRa3TrIJCABKVLQYRFIONPQSQ'fAE:II.V':NNI
VLAIPKIIITNfEAf~YLWWAFEFJIVIfMITFL'(It:F':4:LDEI:EVFNAYN

':.EFURLWF*IITTK'Cw~nEDKIYVQ~DLFOIUIFOCIrWPQFI;7VI0.'~.PTFSEELVHENSOKLOPAA!:IRA
KDOff.Mff(.TV(WI.MtIF.~.VV.~.::FLaa7~YIKNhN:%YIINI1:IFPY!X:PVIIIL.i LOr:tYPEDDEFEDKFt.NfL.LKAVtJIMt:FECC~S!A~YIFLICPIkILIWIPFLJOIpKFtIRpIWtTfta:EUY
VY11.RDI?IIILNI:IDLIN:IKEfIII'~IWITl:I4tEEIVALIEKAVi f.EIEYAYD!:LPR!a4:IJC::?tFIIWVPIIIAINthI.I:H
I<:LYf'1'/tC:HNi'YfWIL:P~11M14K

!.'hue U 17N t 133ti5 ! L l.'. )h'tRKNPFETRVTFY!Ifl4VII::W
;

w. tanu:a tNrrnrrlrxl uw~aene 1n tamrl.tnk/!?IOL .,:: c ll/7/'tN

KYFFlIEVY::1'IIf'AVR'P!:FVIIRVIIAALpAWFF4w:lIRU!W::Lp::!:N:X'.WAYOELY.iI"'M'nIY
y'rW. I1 u./l'.'. 1I of!..

I:IM.1!LI~:ILLVI'IVIIALLfRCLGi:NFRIDVEKERWf.KIREIl:IDIF~KLP:SW!>pY.ut114r'W'r..
l rNtM 11~rlWl.e:.

V:::;FIWFY.KnK::KItPRIDVDYIITLH:vIfWINFPtt'FQKII'KT:RF:iYWF"..QK!'fRKROYV1'LLFN
YtVOI~:PI''YIYIKt7!NIwInpn:Vl.%41'NIfII'I:IIVIIY:.T::71YJE:YF~NIM'::Ilk'YiX:

NlwI.IMIVU:YLT::ITXaStIWYL::K'P:~IQ::1T::LDFERVLQIV'.LTDIIDEWGEVORLLNEE:a:NC:LW
'N/KtAyKIHVII1.WIAVPx.let'IA'fNKIW::YNININ~fr~IILI!Ith7:fAY.'fFHJYWP

::ATY.:::X;OKLVf.I::I1VSDIIC(k~IItFKFLE'VIV;:FAFIF.ELVEEII::f:KLliLDFIGLEKIWllt YINI!LWNFPDfUII'KMNIINKIINId~rl<:1S'.l:I::It:aIL::AVNALATINaffYlJ*::II:A

'I'!.IS~RLRN'.:LLNAWIIIH : aY:VD I yfIIIJIPRMFTI"IY I KlfPnPi al::wl'f7 IKI!Vf:M:L t I1TF.A t~~f.~~ I PF::R::n rllrfY aJN l F'ITl:F I KKIk t 1, Pt70PRNLYLCKTPtt:WEPSpu LFTPLFLILYLLfI'1~VP::RCiANSPiSCS
CFn U~tN7 1117491 llJN115 PARNLLIICONKVfFAGVACIEk.ucEELLEIVDPLKNPNIti"SLJCRIPK4yLt.iGPIGlG
YtVe-Like Pr~iccee rRtlA metetyiase KTLIAKAV"''QG~t911~I1~3DPV~1~'"t~"~ll''EpAI~
LQ~CTFAIGPITtPAYRTLLT)INVNOVrHEIPKTL1NPCDTVIMTCCNCNDSLFLAALL.O RNRCACIOOGND
6LVENDOPOTRECVIIIhAIrINRP
CCGRL11VYOM1(G4aNALLLFE'INLSEOERSVIEtIKEDSNENILEKDVKLIHYM.GYLP
VlIR.PDIKGRFEII11VNAKAIKLDPTVDLIIAVARST
KCMtEITTLMTTEISLi.YJItJ~tIVRPDCLLWVCYPGNPEGEKETNSVE7IA0RLNPKEW
TAVTAVDVAGUtDKVLYGKDtRSLENDAEERKTIAYHESCHAWCLL11~NGDPVOKVfII
'YS.iFYYANRCRAPRLFLlORQC,i ,F'.aSIIDKC
PRCL6LCJ1TNFLPEKtIlail'AttKELYOpLAYLNCCRAAEEIFLCDIaS('rIpODISOATIG.
VR~~NVCGrr:NSPOLGNY:YDCR50CL;GYOCYNEKSYSEETA1C'IDTELRNLLON1YDRA
::::i:.l:::~:..K.":.".~ .. .... . ..::'KE:"t.'fv;y; "::i:'.. :. ..:'::r :!LP47Li.i.w:
.. ~y'.H. ;r.~. .;.,~j.tn.,. rrTtn.~ N...p,... :~I~t~r~I~f;F~...m .. ., .
KPFINLINLDOCILKNKfAAPNtIPPPPVRRSVWIJtRYSTFRIGCPANYFKJItHTIEEARE
VIRfLNSINYPFLIIGKCSNCLfDDRCfDCIYLYNAIYGKDPLEDMIKAYStx.SFA71LIG
KATAYNCIfSGLEFAAGIPCSVCGAIttIQAC'ft~ILSDISSVVPNVITINSEGG.CS~fSVEEL
~LSYRSSRFNRQpECIL.iIITPI0LS1~QVSADHSIISIt~IRLKfOPYI'QPSACCIIRNPEG
TSAGKLIDAACLJ~.11ZGWOISPLNANFIIN1G%J1TSDEVICOLIAIIOSTLKTQCIDLE
HEIRIIPYQPKINSPVSEK
CPn_0989 1179552 1179016 CTS77 hypothetical Deocein LRTSLaVNCVLLTIFNLLVNJtTLSPEKP'SGSPISISKEFPCOitllHtEIILQ'S.YAL.DNAPS
AEDSLVPLIJISQTAVSOKHVLVAIhIpTKSILEKSOELDLIIGNALIfIIKSPDSLDLV110rV
LRLTLFEHPYSPPINKAILIALAIRLVKXPSYSE71CPFIpAII3iDIfTDSSIJ4fNSLSI
Cpn_0990 1179A80 1140~10 intC-Initiation Factor 3 ~ QRIENISOVVK~DIIOVKLLSINfltGGLKLSHKATLE , SVAINFKINROIRJ1PKVRLIGSAGEpLCILAiKDALDLARE11CLDLVEVASNSEPPVCKI
I~IfOKYRYCLTl4CAmSKKAQNQVRIKEVKLKPNIDENDFSTKtl00ARTFVEKCiIKVKIT CPtt,_1000 CMFRGItEGYPENCFKwOKNSpGLEDIGPVGEPKLN3RSLICWAPCMfI'ttttKQFJtS rsl5-815 Ribosanlal Protein SAFAI1IILRRNPNSLDKCTI~EITKKFQLHEKD'fCSADVpIAILTENIAELItENLIOtSPK
Op~RLALLKLVGQRRKGLIYLNSTDTERYIOiLITRiJR.RK
CPtt_0991 1110391 1110611 CPL>

r175-L75 Ribosomal Ptrocein _ KORKNRKSLIPKK~K1NKSVSMFXLTTRPCKRttIQYthC-tYtosins deminase SKKSSGEKiINLSItOPLVD

KCpVODfKRIIC.V
YYLEIrGCEKLIt~KDIFPI~pOAPKEAAKAYDQDIVPtiIsCVIVImDICIIAIWlI4V~CJt DATAWIEILCIGSAAODL.0NNiILLDTVLYCTLEPCI1lC1tCltlt?LNtIPRIVW11APD11RLD

CPtl.-0997 1110612 1110996 AGGSWVNIP'fEENPFtPIIISC1CCVCSEFJIBIIIIJ00IffPVPJ(RRENSEK

r120-L20 Rlboeamal Proclln GIfhVNVRItT(iSyAgRRRRKRILKpAKGpyIGt>RKCHIROSR.,SVIOiANItFIIYtOiqtmRlOGDCPIL1002 FRSWIARU~IVASRINSLSYSRLIDK:LKCANISt~iRpG.SEiAINNPOGFAEIAtaQAptACTfIS
hypothetical protein LEATV
KSAERIMCIKIVtLLDOLYEDOCSRt.QKLCEElVPNLTPEDLIQPNDFPOL1!>u~WAPRfE

DGVLSGICEVPAAILAALSpEN

CPn_0993 1110975 1141070 -pheS-Phettylalattyl cRNl1 Synchecase.CPtt_1007 1151862 1151091 Alpha-KSttCSttSLGIRIS1~9t8EIEAVKQpFNSBGDOVNSSOALiIDLICVRYL~GIFRSFSEKCTll6 hypothetical protein LKpLTDKAKI~.SLINDFKTYVEOLLDEKSLVLi.ILSEQAEA!'SKEKIDSSLPGDSQP90GRTBNkTINPLIJ~GPD
ROIAGRASIrnVIFPDKtSINPPNLSKii.KKLPSVILYI'SCIAPIISY

HILKSILDDNVDIFVHtGFCVREAPNIESSJIiHIFI'LWITCDIIPAII~ItDI'FYil9ATTVLIINIDrIGIFGLL
EItJtL9NtGI0KNNIwQFLTYPLITJ1DSLSIi~SFEI1'pRLti.Rlnl RTtffSNWiIRELKKGOPlIKVVAF'Gi.CFAN~ILDFTLFYKJIIpHLIRKUGAFSVLVNISVroAi.IIGJ1VIJ10 FMAt.INSSQHFIOPESZD~Ii TA

ILSAFYNSFfORKTID.RFRNSYFPPVEPGILVD11SCECCGKCC71LCKH'1GMLEVAG7101I.LTVOIFLDPEKRI
TICPTPLSIfSItdiGFLFVLCFYCCILIPSCJ1PLLLIJ1811LAIVi~IIL

NPWLRNfZiVDPEIYSCYAVtiICIERLaNLtfYGVSDIRLFSENDLRFLQpFSI~CIpbPYTTSLRF

CPI!_0991 1112771 1111110 CPn",1004 1155115 1151A79 Ci177 hypothetical protein CTB4T hypotheciul Dlrouin L!>illIIRDGRI~IfitSRRMEpALENi.EIQ.KEISL71TSe1DSIlLINPARFI~RKO't~SSVNlD9CNLSIEEZ
t4SIQPVSNTTPKADKIfIPDSTKVISDSITINKQSAFYPCISNQ.RtifilTlY

EJ1LKNVaJYLLEISCVSXStitIDKAtJVSDPLIAGV~MSFL8A08~.YKSLLDEYSEV'tOKSIL71VL1xN1'IVC
OQRVKELItC.PLLKVPDI4KKDCSDDEYIQJpNCI0I1Y06~QIS

IG PECFLNNL
ANRQMIQQELSSAt70RrW11NOKSVNS1TIESNQILpJITSSIH.STLKELTIKJ1NLTWPID

NDJILVpIIYI(QIQfLNtNNDGDPLT1ITL.IJ~dd8E8VIDIIASSLVN1iG11PLItLFY~tALSN

LDIIaWKVtINAVNIILPfSRYEAZlIVIKSPKKNNisiYFNDFLLPLRWIItDI1a~11fIDSDCPn_1005 ERKqfKLt,ISALSLCIFGSKLVPEEASRYLYFNIQTKLENtINOKKPLSPGQYLTDAYEELCTAIA
hypochetiul Drotein tiRLISKYPNGPL!'KAN<RIVLtNLRRPYOPNILGILPSLCC1'LKiJGK$IDIIRSPSPV'1'QNRRPVRLfMWIID
PLSAKICPI4AAINVPGTPITt70PN1'ATADDIIAKPSKDSNPLNNY

SSILYFLGFL.NaIGNRSEVfLVLNIONRISRKERARSRVIEEALEOEBNAPYVNYVYOSVLVApONLSIIAQEGQANS
SAOTYfJJIJOFJ1LYQWSIPKNKIJtDItBSSYLptIp$

AFS!'PCPEELL.pNLESItIGDZE1TADPFSILpEtFHKPLG&SFPL?KELKEFVCS1LKEONOAIGASROAIONDIS
SLCNIU1QVISSNLNPt~NII0pSI4VC0ALI0TlSCIVSLIAN

KLTALKDIFFAKIUCILTAN~C.LLLHLLSYLIVPKLIERTNPNSIWVSKDGLDYVSVPII

AGPJ1FPSRGfWDGfSLKLLLTNVLSP?LVARDRLVFVSNIELLSKFVNCLKKNRQGFSS

LKSpFKDDIECK'IEPTCYLNELTEYStp00JI. CPtL1C06 1156197 1155990 CTS19 hypoehecical Drocein CPt>_0995 1115515 1114115 TKVNFPINSITTL.GTLPIVNfINSSRPPLEPildfPKIGI1VL!'SIYELIiAAIEIRDdM.

CT87A hypothetical protein TCSQOLNDNTNIpOQLNOLTNDIKYAIVSAGAKEDEITRVONQNpNYSAQRSNIO~LV'f RNLIwKRNtvLTRFNFALTSLLVLALIFYASINHStJiTLKCASTMSt7ASVKLSILYYL71QTRaNGQIIL5HA51NI
NIIQpQSSQDSSPIKT?'NSICS1'VtJQLNKPLC

LSLKAEPLIPOLVAVATTSTLFANQNIOtEIILIrQASGLSLKStJOIPLLLSCAVIIQRILYA

NFQWLHPICEKISITKEMiDRGTfDKEpGKIPALYLKD01YLLYSSIEPKTI.TLiaIVIwICPn_1007 1156689 KDPttTIYTNEKLAF1TLSLPIGLJVrt'OfFANDSENLELKEFFl7l9tEFPEICPNFYPi~tPFS07519.1 hypothetical Drotein KLFSJIGIJlO'1RLSEPPKItIPWNJ1T0LGLSTpVPpRILSLi.AQFYYVLISPLACNAAIILSALwYKSLNDEEKD
VSCNECNDYPEVFKDDVSAYVLVTCCQNSSECKIQVt?IflfDPAYIS

YIJ:LRFSRTPM'WYLIPLCI11NIFFVFLKACiVLASSSVLPTLPVNAFPLIVLPLLTNYLLTKARDSLDES

YAYAIQ.Q

CPn_t008 115d901 1158227 CPn_0996 1116592 1115519 CT81A50 hypothetical pcocein CT879 hypothetical Drotein VLNYSFII;MLKPNYVLSKRLYRWVNpLIKLCDLVI0JSR3F5VEWVPISALLLIf'CCIQCA

ANpLLWICVLIFRYLKTAAFCTLSLICISIISSWEIVAYIAKOVPYDTVLRLNAYDIPYLSWKVSLVPFLLLFSFLIfF
LILCPRCKCYALLtrGSfFVTLYVAKYV1IDETLYVSINGSGL

LPPILPCSCPVSAFSLFRKLSDNNfOffFLRASCASQSIIMPPVIJIVSCAICCLNFY1'CSE:VSPLLAPCLFLt%7V
Wt 1QEEEtiVKGKEpLRLSEDLDApRSAYEDLLLTKSQOCEFLOAR

L.ASICRYQTCKEIAIeI~M'SPAL.tJ.p'tLpKKENNRIFIAVDHCAKSKFDNVIVALKGIJtIEApCLDRELTf7C
QEt.LKA?.YCKOEYLTIDLKILADpKN~ILEDYAELNNKYIELVSK~DV

ISNVCIiRSIIPD1'IICDIVKAKtriNPISKLPDSLTESSSPSSpRPYIETLDELL1PKITSVFPWVAEPSVCCSOC$
ERVDVSRWVSAt-0EKEESLLRLPNEILVEKpIICSDYOIRCptiL.G

TLFIIGKSYLKTRTDYLPWKOLVKQSLIWSHLpETLNRVAIGFLCITLTYACNILGINKPRLLLDNFTALERRCEELVW
LWpKITpINELi~LVCK3EElfVSVEPSJINAEtSCVEEKDYK

FRKSIALYFIFPILDLILLIVCKNTKNLpLAIrLFVFppLVSMNPAARAYRFSRCYACLY:QWEpFLEK:,E'1L3LYR
KKLFAVDFKYLTLKKKEELTKpDISPDDISMICDLLERI

EILEEELiHLEELVSRSt.SL

l:pn_0997 111d699 111Tbd1 mesf-PP-Loop supertamily ATPase CPn_InOU 1159095 t1581Rd AYIttM.GSDLLRODKOLDLPFASLpVKKRYLLAGs"(iC3DSLFLFYLLKERCVSFTAVHIDmaptNfthioninN
.leitutpnptadaae Htlllt'o'CiIWEIIKELEELCAREGVPEVLYTLTAE00CDKDLENDARKKRYAFLYESYRpLD'fRLLHR'/I41KN
NDR'Nt.'C:ivRNWKOrHYPpPPNNSpFJILKOHYASQYNILLKTPCOKAK

.lcxl:FLAHHANDOAETVLKRLLE,~.JWLTNLKAM7IER.iNEDVLLLRPLLNIPKS.iLKEALf YNNC.YJITARILDELt.'KA:i~'KCYITNELDEL.~.pELNKI!'fDAIMPPIfYI:SPPPPKTICtS

OAR(:I::IWUp::NEDERYLRARNRKKLPPWLEEVPCP.ttITFPLLTLCEE.sAEL::EYLEKpIJtEVtt:IY:IP
NDtf'LKL1:DIMJIDV::I'.IVDf:YYCV_'CPMMIr:EVpEiKKKICOAAL.~L

..WPFF.~.M'191VD:~tt:ES.PCPDt.'LI()()Aff.CKWVNKKFFHNAC:LAV::RHFLCMV'IDHL::R~tIL
:aAILYfr3tfIt:EI.:F.IIt:rIRApT'lt:F~WDpF'h.IF:W:IEFIIENPYVPHYIUiRyMIP

a:ATtJilIRNK t V t t KItJVW t D 1 J11~?I t I-P I Ef'M 1 tllh 7KKta:1.17DPKN1AJEAR'fCDtK~G::A(~WEIII' I1 tTETt:YEiLTLIJ1D

':Itr ~19.')H Il17Ni1 IISOSNd ar:_Ir:l': 11'.tr:'!5 115NH.'7 tt::l1 A'f1'-rls'paui.nt =1110 Wtrtt.t.tGreLr'.:~ n~tn tn:r i..rl Ivrt.rrt LI:a:NKtTI::KUKKNKI'EPKKNFI'IVFFFLLFr:WPr:P/At'(INt'1.11t:KK.VtVn:FatIQLEI1'MLI
LIJII:a.l.l"tVI.FP::W ;:fl"/tVII.WCN1'::RKY'/lPVILRtI'1.4'AIf:ALILFVITfIR

I.VNLJILIVI'Efk:IIKIALNDNLV
>F(7LRFRLRIOTQEf'fiIJtYIIYI.I:LIt)t7t:HRLDLDLOETU.':FFQPLfiR:l.sit':rl t.'t:Fl.1.t'N::IKMI1J11'NPEPAYDUf.~.KTF.PIFFPIJ1F('VI'If:PA

N:a:M'fI:KEVTtJ:atldF:.At:li::PIPS.YAt:;YN:E-I:I.::VIa'HPLVV'1'f:PIICP'QLINL'!t'fAtJ::lNt:l:ta'l:.I:RIIt'f:VIttAYIAF.~.t.tTt .tl:::::FFIXeLtY:NFY:L1~LERLPISIAL

IC:1 .~t:P
YI1't.:R::1'F'ALRTW:::DLYELICKYt::I'VII:Ir::aTLKREIJCttLYpQVEVSLTO1JJA::VtNf4IJ
:.:I::1 AI'N I, a1' I.:

tTtJfFJIAV'f 1.W .'QVt:: f WR
LS:i.~.LW::EOfJERF:aJLtr::VRLYREPJe1KY11KLVPrIRDtl:

VI1VLEIttJa:Rl::(j111WYPNNOEt~::RSLEKODPEVftNIWFJItLIKEtWfAFKFNII::L::FKA'an_14 11 1 n.n um t I'.~rNt:!

111YY~~-1:1 DaLppELLC3RRCfRSEfYA uEKKLETKVpIKD4.'KCLF,'~QDQDSNGFOKKSPL<C'C

'TN'il Irytnchetieal proton CrSRKNRIAKAAOAVPVIPPP:n.
YF'a3YLLTKOGIG..ZDP.:.4YCCNKDSVfSI'ORELDA
4V~tY;n'VKORNL'ILLRESPFArI Y
:JL7QiLITtILAL ' C'IL
' ..... .V3l~iLGLSfX~LA~iI
. WOLRLETLKV~K:;ItRC1G10S11:~1~fIJ18k . ~IVI'~
FItRLKNYPHipf.irFLPC ~
xtJ<VIXfWGLEIM7CIAYLLIILVRrYLRLt:KEEptTP'fKfNMSPSYS

lt AMfALYt:IJ IL
a LIiW IKCLaDKFNDNLENICPLK>rCEGIIRKI
tTL LO

Pf. T.'.P fALPW P,:P3CI ..~sP I
L~,.'71IKGLOPAIESCNAALRCAILFSQAEIYKLKGKL1'K
lOLOIG.KSFOROOIIYE~R

s'OF.LL~SIESSFEALSRLIlIYIRtELDOVYLH:.LRG

~Pn_1012 1 L5Z3Z0 1160121 y:.:0-AOC c ranoporcer penlleass "FF'"PPP.i:IA.:.~.fI4SLPLLL t.'Pn_IOCI 1171270 L171b9H
YMKYKFIfYFVTVF
~
LLFIS~tCffSRMPPTF
~

. ' . '.. w - ' . ..
. ..
AIP. .
.LIT; ..
'' . ...~L\iL::::ri r "lF r v:
"'.,i ' ~ :
: /
~
' .. ,F
.. rT
. rc .. l .dlrrir. .c : err.
. ~t:t:.
_" ~~.'. ~-p.,; 1::: r,.',:.
;;y,RIfJI.\LP il::
.:. .~..;.......Ia.i?' :y.'-.: .!.'111 r ' ~ ' -' "" "
~;.';..
i r~

.
TfGKIfKIa~EGL.EK:r"fKtvIMAYLGKDYAK.iITVFkWLYFFNPfVSKFWP'aiir13IJ01a ......
..
:r.: ~.. .
.
LGILHLLSRPNYQ1ELAFAGLATLSILT 'L.~LF
~
' aAGFaALAGiiNASOSt:
ECYSOAWAYufTAVLRDXDPYPHYYAYICYTL.TNENLEAEKALDfAWVRiIt)HIIILYNR.
:IIFa GL.KIAIC
KSV111LKALSVLALIPINLIPWKDNSKSPPtaUD~R.TSL
SLK

N KEEILDIRKHK
7ITLLICKLIrSLrRYKRN
TLLL~WI'PNPNNIPLYAGVAKCYPKpMGLDLOtQIClIDSSSAVPNVLft0Y0llALYHAL.G

INKT SIKQIPIOIVCRLZDSSLODFLYRSGDPIYKFmLNDKVL.GFCiJ~t~RTILfRIi.ET

' 1022 1175709 1171:16 CPn IGHPVIOCrLBaICOLP .-WpNCVIIPSEVIaIVSBDLISt>NLtJsRtIDFLYCAFYNI1931IKL01CTt63 hypotheraeal Drotsin TGPQLIVrfKttGTKASEPEIVGinKAt.OESIIFSKDNP~rKLYAKLTKSIPItNLYOErSTFIIfALKLGIlIIfPV
PSAVPSANITLKEDSS1YST115fiILKTA1~EYLIISCfAL~SS

YI:pWEElfPLLIpSODPLSKDLVDKLLE'tIIKRYPELASEVAXrST.NDLYNPSLPEBD~fTtlu.ISLAtGOIILA
TQQELLLOSINVHOLLrLPPI:VVELBICWDLLVCLOIAtTITS

6PpCICTOSRSEQrLP0088S1I05ALSPRSLCPEISDSKpp(~AIQTPKD$AVPl01SGP8 CPI>_1013 1162209 1163621 PEl~pIIMSLSQASSSSQRSLPP0E8APaTLLtOGKASSriPLSOrSABItpItGLTISKS

lulK-ruearste Nydracass NELY1~POODROGRECHDRGDOE~KKKIOOIRGLCVGVAEEl0~.0IMLIrSD
RENSWNRGNIDNROEKDSLCIVEYPfDICLYGApTNRSRNFrSWGPBL?tPYEYIMLVNI

KKCAAQATIpDLCfLDSKNCDNIVAMDEILOGCFECHrPLKWIp'PCSGTQSNIIiVNCVIAONRPPAEETSK>CEl'I
FIUtKLPSPHSVISRFIPSKNPLSVCSSINGPIQTPKV~MIiVl1 NLAIRNItOGVLGSKDPLMPNDNVNK~SSNflVIPTA1HIAAYISLIC11LLLPALDtIIIItVI.KLJfARIf.ODAG
EANELYHRVKORTDDVDTLTVLISKIIRI~ILRfS6DICD9tALT.I~DtA

DAKVEBERHIHKIGRiHL?IDAVPKILGQEFSDYSSD4RIICLESIArSLiWLYBZiIIGATAKEICVTI
ITQIIEKIIBlQRHLQEISOCNOARSN

Vr.TCWVPEGPVafIIHYLRICETDLPrIPASNYrSALSCHDALVDAfIGSGTLAGLTKIVGKLLKELlOTIFIYHLRP

~

ATt7LSFf.CSGPPG'CLGI~S.ffPFNEPCBSINPGXVNP1'CCGT.O~CAQ~~IOTVII~. .

' CPt~1023 1176005 1176331 RCNpE~IMCpVIIyNFLpSVpyiS>57,01AFSEfFVKOLKVNKARi~fINNSLIG.Ho robust hoKbloq prestnc in 4swbank/E~I. as o! 11/7/95 LAPVLGYDKCSKAALKArHESISLKEACLALGYLSI~ETDRLYYP8t~5~Na.DFLLIFINKKWrLSIIrFATYCASIL
4AVTWAVPLSEAPCKIQVItPVVIiL.QPOEEQ

CPeL1011 1165156 1163732 GSVIYSFfIfPYDYGYYYPL"LYCYTRlI~OESRLCY'IRrEDCI'IIYLCD

yehH-Sullace Transporter AyAgTLCYGIVKVpW1r101rIpTtLYTSIKD3Y5FPL1'FKKI~pAGITIIfiZLIrPFAIAIJ1ICPt1_1021 CVGVSPIOGLLASIIGGLLASAHDCSNVLISCPSSAFISILYCLSAIDfGAItALlllrlLLitxerD-InceOrast/rscombinase ' CVILIAFGLTGL.CTP'IKYNPYPVtIrGLTTCL71IIITSSQIKDFLGtpGIINIPADrLPIdftILCDFSIJLSVDI
GICQQSIAA
IfIFPNISI:CSLKIAPLPILKLN8IJ1SHTNPSTQFIII

IAYWDIILWIWDSKSFAVGGLTLLINIYFRNYKPRYPGYNIAIYIA'1'fLVYIL3.EIDIPTIGYRQDISSFLTISAI
SSP~ISpNSVYIFABELYRRItLAITILIIRRLIALKVIrLrLKDOG

SRYCfLPTAIPLPKIPOLSITIfILOIJtPDALTI71VL9CLLTT.LSAWA17G9flGNNK)iSICLLPYPPIIEHPKI
NIOtLPSVLTPOE<tDI1LL71VPLONERIP11HIJ1FRDTAILHTLYiIGYR

QLVApGVANZCTSLfSGIPYfGSLSRTAASIXSGTTPIAGIVNSIFICFILLL.LAPLTVVSSIGDLRLGHYSDDCIRV
fGKDSKTItLVPLGSMRGIDAYLCPrRDOYOIDDSIN~IL

KIPLTCLAAVLILIAWWhl9fSEIMiFIHLITAPKKDIWLLTVFILTVIfI'IITAAVpYOIII.FLSTRGHKLLRSCV
WRRIInfYAKOVTSKlVSPHSLRNAFATHLLLSfICADL1WI0~
' AAFLtTIKOMSI7LSDVISfAKYFl~.4DFLSKAIYP~tfEIYEINGPFFIGIIILStLIDGi.NWDfPRNL . .

RIASTEYYTfNAADSLIIOf!

DIEKPPKIFIIIGKfRVPTII7J15 1025 11Tt266 1175579 CPn ELICVDNIpSNIKSALi.rACJILTtiLt=RKTSfRiQ.Y,.
ppi-dlucoss-6-P Isalasr~ss GillOatSSYRCI!lIDERKRFIDGOSTICILOELALNPLDLTAPOYLS1IERItKFSLLt~FTF
CPn ' ' ' _ A
CT857 hypothetical protein tpossibieIRJSM
IH proceinl SrATDtI~DAILMLISL1CERGLHESIBJINOOOQWNYItsttPS~PAtJn KNIMD~IFSrITSVRVRSKVDNIILEVitCJtLOi.CALrLFGYL1IVPEHIVRVNKSAI71LDSSIZGCACDIAVPSR
VGORLI~TLTKYR80FZTZVOICIOIDSdIJ3PKJ1LYRALRAYCP

ANOfLHWLVCTSNIPNADHttILVEEIAdISOVIFFLF&ANAIVCLIDAIDCGFSYIVItICR'10001V1trISNIDP
0~.A6YLDTIDU1KAT.WWSKSCTIIETAVNWrA0F1'AIO~rLSF

IQSRTLLL.WALiGLSrFLSAALCNLTSIIIIISISxRLVItARRRRLLT.G71ICVI11VNK7CKDHFIJ1V1CECSP
NOD'fQIttLMIQaiESIGORrSSTBNVOCWTGIAYfiI6YfL.OLLpO
QPNARC~1L.RGSALISIfsIRNFLJCIfPTGVIPYSS0T.IlFPJIIGQOfGI!ICS
ASAImOIAT

AM'PLGWT'l~fiItdlIIITSWGIIPALIVPSLVCVLVJIfIFCbprTLRKRGSTLLVmVL.

~IISFfOCLNOGTDIIPVeIICFIfDIS
ARVOrBTSPVIWDEPDTNC

LpSAPPKSWIIfIGLGSLLIVPVWKACLCLPPFNCi7ILLGLCLVtd.TSDWIHBY, NDKSIAGDG

HLRVPNILTKIDISSITFrICILL71VN11LSFANLLTL7FSIi5~CIFSRHVVAIIfICLLSSFt3Q1'fSSOKLl7W
fIA0AIALiICGSEN1NPNKNrDGNRPSSVLVSipIIIPYfTGdLBYY

VLONVPLVAA1llOfYTLPL17DTLWlQ.IAYAJIGTCCSILIIGSAA6VAFlx.LEKVDrISIYFINKIVFODL.C1V
GINSfD01"~1SI~KKAL~VI'>rfL'DGADASNrPfJIASLT.TLPI~IFR

KRISI~tIALASYlOGt?sYFVLESIIIFrI
CPtI'1026 1175961 1179177 CPtt_1016 1167027 1165595 1cW
CSfGFCKICI~IFIAVRSRDFIJ'IHCIL71AR70GfQVVKSThGUtVFYSLVS

Ct551 hypothetical prouin KREVE10DODKLG7IINrGLLFTSSVAGFSKIX.'flmllAYDOIlIffIOILISLII7fAPLPNKIi.L

lGi~.SpQ'IOpARLpI,YLECI~TINYCpKVLSNYVRgLNDYNAGLTrYRTCSAYIPYVLKCPn..102 LSEDGHVF1IVDVQTSOCDIYtGDEILEVDCI~IRGIESLRFGRCSATDYSAAVRSLTSA.NO robust honolo0 present 1n Genelank/t!>aL as o! 11/7/95 SAAfCDAVPSGLA!!<r.KLRRPSDLIRS'fPVRWRYTPatfIGOFSLVAPLIPDIKPQLPIbSCNNIOSVSSPPLSPf IIVrtrtDIVPSS~S~.IQPMVLKZSILIrIILVTILGIVLWLSiAIG

VLFRSCVNSDSSSSSLFSSYNVPYTWCELRVpNKpRfDS~RDiIGSRNGPLPI'FGPILiICOALPSWLTYSIiCIAIA
VGLIGLGILVTRLILSTIRKVDitICYDAAVICEEpYLSItIRQJS

DKGPYRSYIFKAIf~'aNPNRIGFLRISSYVNTDLECLCLDiIKDSIhIEL.PCEIIDNLCKSDIR6IRdWMV~0'WIL
SEE~SfIMJtDPEYLlK7!lIBRLIAELEIEfi011LVAQiILLKON
' TDALIIDpiIDIPGfiSVrYLYSLtSM.TDHPLL7lrKHRHIP1'pDEVSSALtR9pDLLEW1TORVLYPIt N118LSRL1FRJ1YKOKFPTGALCPYRIEDIJUCI1~QILFLICP6CIAMVKSLPGLZ

DEOAVAVLGI:'1llCliYCf104D1AV11SLQNFSQSVLSSWVSGDINL.SKPNPLLGFAQVRP1IPKCFOSLVHRFA
PRSRITQTPKYEYNSRN~1EDDKVMVCARLIIKEFl811ViGiICSY~Xi HOY1'KPGF!B.IDEDDfSCGI)L1PAIWfDNGMTLICKPIAGAOGFVIpVTFPMISDIKGLICEMVALKITLPLPGVY
DfLVOLrPNLLTACSWKDICI~fSYPIfLRPYL.SVDIICKRLI

SLTGSLAVWtDGEfIENL.cVAPHIDLG1TSRDLQTSRPTDYVGVKTIVLTSISCIAtOfSYQLrCEICLKLFTICSPL
DpAWRLISYYRNHIPAVLtIBTCLPPPE'IbGSVFVti.PRT~Elt EEtn'SPO~fPEVIRVSYPTTTSAS
LLW$QIEVLATRYLKD?FVRNS6WlGSFBI~IftSYNEHCKEISIxItIltrAmYCI'IIHSLEP

CPn_1017 1168997 1169975 CPt~1028 1150995 1151999 lyre-Necalloprouaae indhC-Nalats Dehyrogsnase VIINR!(LILCNPItGFC9GWMI0vVCVaLI~PIYVKIiEIVIWRNWNALMKG11IF

'JEELVDVPEGERVIYSIWGIPPSVRAGKARICLIDIt7JITCGLVTIfVtISAAKLYASKGYKIFFLKOVRNAfKLWR
VAVi'C6>CGpIAYNFLFALAHGIriIFGVDRGVDLRIYD11PC?CRJILS

tLICNKKNVEViCIVGEVPEHLTWEILYaWEALPFSSOTPLFYITQ'ITLSLDDVOEISSGVRH6LDDGAYPLLNRLRV
TTSWDAFDGIDAAFLIGAVPRCPGNERGOLLII'QIpOIFSL

ALt.KRYPSILTLPSSSICYATTNRpIUILRSVLSRVNWYWGDVNSSNSNRLREVALRRGGGMLXIMKRDAKLFWCNPV
HfNCWIAMKIIAPRLNRKNFfHIMLKLO0N~M8M.IWM

VPIIDLINNPEDLI1T'NIVNNSGDIAMfACASTPEtiWQJICIRKISSLIPGLpVl3iDLrAVEEVPLCCVSRWINON
HSAKOVPDtTQJIRISCKPMEVICINtONLFalILVNSVGMICSJ1VT

DWF~pLPKELRCS
GRCKSSAASASRAIJ1EMRSIFCPKSDEWFSSCVCSDHNPYGIPEIM.IFGrPCNIGPSC

DYETIPGLPWEPFINNKIpISLDEIAQEKASVSSL

CPn_L019 11698~5 117062 No rotwac honwloq present tn Gsntloank/EMBLCPn_lOZ9 1151987 118:511 as of 11/7/98 RMSYENYOKNSWLRS:.CLL\KFFSRLLYRVPF3FR0DIYLfSSLYLKYPRLfFYDLGKYNo robust troeoloq present in Genebenk/EMSL ae of 11~7f98 VYSLRHCPYAKLCRLIx3A.f:.LKECRJVYCETPWS4'L1KICQAFDITSCDILYDt.CCCLGKVRVFVTSTMLWGVS
NRpSfDBLSONJIrKIIIrNKORFCFIrCSLCCFGFVFALrLKLGSRLA

~.FWFSNWRCOVIGIDNDPHFIRFSSNMRKLSSGFALFDTEEPKNWLSOASYVYFYGSPEISLSTLGtIiAffGIrSVI
CASAIIVpFLtJIKCSOCETSKt.CCAIKN'lliiSSLJftSLLVS

SFSRRLWEILLKISEMAPC~IVTSISFPLDSFSRGXECFFTfNSCSVRfPWGKTIAYKNMPFrTANVAWTVAMLSSFLG
SLPYMrKLrHTVLIFIPYLSAT11LILLFLaTSfSGLFFCI

tRKCS
PVWQIOE.iIDYRNLLCFRf3JILRpfTIVVIALVDLAICfWLALDSPYIIIrHLVELADIH

T4ISfLApIIFVLIVPIALILTPAVSFFFNfSFSFYLAKpEECKALVK

~:Fn_101~ 117:116 117Dti)~

,:THi.n hypncnacLCal protein CPe~t07D 1151901 1192913 tHRPNtMTVrYOShTPPPOCEFDIFVDCNATEEAY/MEVQVALPACFa?YAL1LMTSELprsvllcteA D-nstino ecid dehyr,Jqeneae '.FGtL'f9.~.ECAL'LVALPPKEKPIQEEpFLVKNDIWF3f3LPNLKPfI(>>CQ'!SL?SHRNPFKVNFNRIAVI/
:.1G:YAC44V74MLLLH.ipCfATLDLFDPIPLGIfL'rA.4~1S3i.LIJfAITGK

LAOQ:7f::::N.,~1't:Kn\7fET't$SeFPFfSCK.\PECD$.~.'IDKTFTV~PKTQEr'rOGSAuOKAL.IfPP
t.ADCI:INATHALITFJ1..''K1LLNVPIVT:ar~ILRPAIDEDO.WLITERVEEFPKE1/

::()AOF71VR::Y::::rTIKEtI:.AKEKV:~bTtt.~.AE717KH'frlrK::DATL::PIISLY~TLMKEVPQFI
rWEKAfd:FIIPSNVTPPNLCALFIK.7:YrIdRLDLYItf:LAOAfNKU:71.'fYDELIEDL

.\L::::PX:~>UKttFFJIHDLRQcIDCYEC'fOECEE'tKILKTfMK'7YF..:L.(xJT::::tYl'/'fESITPI
ADIEEF'fDtItIVTIt:WA:aLPELKDNfVNKVYt:qLLEt.'.WPY.DI.AML;iF3TNANKY1NA

IPDPI'/FFAL::E::QI::%t~4:ItRVTNLDVLRtr'fEWYII1LK::RANOITffRLEEREIJIERENP~iKITh:
it.H\TFEIINUPEFTPDPAtAYOtINPP'/L::LFNILK(MQVIJh'1'.kilR
r&IL~

AIIEt.AA::I::RyAKY.INWU:1\'PITtGTL:AfAMI.r;EL::C:D::III:FVC'RI:d:PFKDATAKItI.rV
I::PLNRILt.W6'Iva:IL:::K4LLY1K:IT,IDIILAt3AVLRY.:.TAYIAKEFLtTI

'PFFY.t: (.:YVF'P::l.::pl::'1;.\A::KVIIF:L.:iE::
A't R A~':\E'fRK r.YFHMItVU6l.'TRT
t EEYKDNW

N::MIJ!IFt.I N I llJ'fIJIWAH::LY'; 'frr_ l4 t 1 I I N.S~i7 I l HID4$

.ir.:DAt'Irnrnr..ICnirtinr ArN.vfmtt.rt .'I'n_10u Illt....t ll.l~l~.' IKF71>lffl:R'rK::.~.KNt/.TIALNiIVV.~..~.IIrYY:IFa.fI~INIAAT14w4:AVlhbtlt.'It:Ft i .TN.1 hYlnlv.m,:nl Wwr.m NFFIMITFRII~:TLRrULJtl3tl'IMY::f.EC:ft:l"fI':FTIt~:/Wl.t.'~tLF':NiY:YAYfTNDA

n'r:N::fM:::1ll:~tA:.F.YLtJkHJ1'1'ft'I/APR:J;I:::::\'IYI::Y::ITVAtritllv'K::LPK
PF1'OKIHYF'YIl!Fl\T:NP4YALUY:::ILIYI1If71FIVr.Y':III(IA::TtIIVIuTtI'KIIf1.fffti L

fV::nJ'K::1?I'ffMIIIK'rFtIATPRERI1.RF'1:::.~.FF:xJUINr:~JAGlr::::IWNLF::91IN:IT
EA'PAFFFY.LAYYK'rDFI.IfSIIAYPKAOP::Lf:::l:::yl.Y,TfML'JTLWAYh:IFY:A\1'1S:7.YIAI
WP

:'.KAIIAnJIa:rMf*::1'h:KT::ItKAI.UKN1.::::KVF::A.'KIIFIYrIJIIyIILKLFn.TID.~.LY:
:Q~LI::W7tJA'r/4:FLiY'.I:fI'/IL.F::II.IF~:::t.l'uINJINJtItIP.'.TH:VIJ)II.V:KWr:
EVUTIV

I2~

':LITAVL::.'.'ItIL:.1IfII'IAEIPF;AAKN.TFPEIfRFFLBRGVLLRPL.~'LN:
fOEEDLRIIY3tIL,;dIL::;
. .EKSPSV3LYIT53VNOLJWIL f'~3 WF.~"N.WIfITiL.SIT~.VNVf.PAYLA:3AAFLFKt.,~,h.:., ...._ _.. ... .._ .....
CYPKKCISIKAPLANITCIL.L1IVY CPn 104 ;r 96i3II C~57)4I~. , (:IPFYIDAGKKKKNAKTFFAKKEI11GNC!'IGLL.At.TAI~
NALVLLAf ~
~

. th~
::tWLIYA~LKYLF ~B
b 'bioD-dethiobid FLFL'IYHtIKt NRSPlTYFRANFrfIpRI
IIVOIL7IGYGRTIV~,,AtLARALNAEYWKPIQAL~.CtSDSNIV

N8L4GAYCitPGYALJIKPLSPHKAAO Lt>NVStEESHICAPK~fSN:.I
ILTSCGFLSPCTS

Pn_ 1072 l 19515 7 11955b5 KRL4CDVFSSWSCSN1LVS0lIYLC.RINHICLTVPJWRSRNWItGNVVlK3YPEDEEHNLT

CTJ7) hypotnecical protein OEIKLPIIC'LAKEILEITKTIIS~'YAEarItEVWI'SNf10CI0t:VSfYfPSLNLM
ItIA'n'rrYr'':.AFIr~%a~F..~.DO~tPOqPf'RTFr:fD.:ALI4AKIFNPtITVPYTSVt.PKEL

:':I.....:':~ir:iL1':i.an:~.'ii.::v:..
:iW:Y'LI':.:i::.: ridln ~
. ! . : -' : ':~~'.
. .
..
...

r , .y , ~ i., OtoF_:d-Oecononanoaca iynthase_--~!n='.'fn:..'P'. :.-IIF::.:.L,T."
. ::.'!:r.: elq:(fr...
..r i:::::

Atf.Ftr)PENAEPAKVN
pNLOQQFLIE7tLARRKSKHTYRS1SLNSHLIDFT~IDYL.GFASSPELRKIYITKLHAIES

LGAl'G$Ri.LTOHSrit~lIEiI7LAAYlWFESCLIINiGY'fAM.CLiyJILJI?OODRILJ~L

YtIGfIYDCIRLBK710SFPlliNND~tLEItRLASSHLCRTIVNESVYStJIDiVAPLOAt CPn_1037 118 CT372 hypothetical psouin SLLtaYSAYLIVOGNAVCV1GDOCi0;LV5AG.~hODINL~ITVIITF~.KJIfpTIKiMIIIGS
NNKKKDYSCE!'LTTtTIVDSIAFLPSEENFCYIKTILFFRVRIKHYA!'FYCEPfIISFRFLL

'ISSYAETPKCrBCHYNAYKJ1RI0KKNPESIKlSAP8ETPNIBISLISPV1NIFSILKDrLINKRPFIYTTAOPPHAL
T11IELJIYEIOiQRAPNORENLiALIIINFREKA~IiG
LSGLCAL APN

u LOLJItDNI'TTPIOSIGVSCSadtAROAAL.0I0N9GYDVRPIVSP1Y1LQREELLRICLN
I5L0FSILPOWFYPNK71IGOT011L.EIPSWOIYlSP
T
' t N T104LIDViLGHTLEQIFiCNVSSL
t p CSHPMCOCISVSNLLTSVEKA
NGVDI1ZKIAAGTASSINDIfifR1L41NLilOLTFB
a ' ' f O
OTFPGDPLTLJ1IGOYSLYAIDGTLYDNDOYSG'FISYALlON11S7lT1fBlaSTaAYLOITPN

SEIKVOLGFpDSYNIDCTNFSIYNLTESKYNPYGY~PKPSCCDCQ7f8VLLYSTRttVPCPn,1044 t~i 'bioB-Dioein 9ynthase G

.
AKLBOIEETVSWSLEDZREIYHTPVFCLIHKANAILRSNFIJISEtC':CYx.ISIR1GOC11lD
EONSQVTarSLNAAOHIHEKLYLFCRINCATCTALPINRSYVLCLVSENPIJ~IWA
ICFATNKVNAKAISNVNKLRRYESVtiGEATICP'DPYISLTPDFaLYIHPJILLtPEltlfl'S0 CAYCJ1QSSRYIrCHV?PCMOfIVDWEMKRAVELICATRVCI:.71AWRNAKZ7DRYP0RVL

VYGLPANLSL
IMtSITDIaAEVCCUGfC.SEEOAKKLYW1CLYAYNlB1L06SPEFY)rl'IITTRSY6DRW

118773?
IZ.WVNK$CtSl'CCOCIVDICESEEDRIKTi.NVfaTItoNtPESVPVNLL.9TPIDCTPLODO

CPh_1034 L188599 PPISliiESILRTIATARWFPRSlRIRL.JN1GRAFLTVEOOTLCFLAGANSIfYCDKii.TVEN
Predicted OMP fCT77I1 (leader (181 pePCide1 KTSWOKYKKYLSYSIWOKI1IRYVlOCIWLFFTILFSCSSFYASCRYAIVRSINEYACOILNDIOB~CIIIQi.G4IPR
PSfGIERGNPCYJWNS

YDEC4fWLILOt.DCILLOCGEaLSHBItrKSKAIOGL.OKOCTP~F~IfiF3IWPFWIEIOEH
APL 104s 1199603 119A90i CPeI

rTWPiESAIFLLIEKIQKOCKTTTVYTERPKT111~.TLKOLHtIIINSLtDTAPOPO.-LY,ISY;ILFSGDYNKGPCLDLFLE1CL.PLPAItIIyIDNQKDrVL.RIf~t.COKYCIAyeoni:ernd hypothetical bacterial Pla4t membrane protlib ' . LLLVLSM.VLSSKLIPI'LTFNFIIPOCiLILYPLTFLI
ALLLRfIQIDiE GTLPNNI'SNRKTLVFSYLSSTI"1 FGITyKApELHPPIYFItcIIAQVQYNYSKIfLLSNHJIASDWNGIPCP10LARVNIFSAFIJINLf.7lSSIVOIIMF
fPVASPEfIp'1'J~.!'17LSPLIIFL

CPeL1075 1190081 1188570 ASLL.71FZVSOOLDTVI'YTF'F~TFNSSWLRS1E''&71iIS0IPDTP'IVO'1'GILY!'OIGIS
' aroE-Shikimce S-t>.hyro0enase IRKFLQIPSTKI11N1YpLI!)pP
FPO'1TJLIIStYSYtYttITFCVt.TrPL.FYL11VM

WQLPIJIVPIVfiLQIWRFSNIYYGVBV!!t CATVSOPSFCEAK00ILItSLLQ.VOIIELRLD 1016 1100675 1199590 ITA CPet LINELDDOELHTLtTT110NPILTFRONLt~ISTU.iIIWa.YSt.AIG.EIIOBDIDVSLPI_ LOTIRKSNPKIKLILSYIiTDIOVEDLD11IYNHCaTPMZY1CIVLSPaISSEIItNYIKIGR'TtypCOPhan NroxYlase ' LLPKPSTVi.CNf.'I'fICLPSRVLSPLISNAIBtYJIJIGISAPQVAPCQPKLEELLSYNIfSIC.SEFONSOSLQR
AYSTPYSYYRIIL.OKENKfJDOIILA
VHYCERTLDPKIfILRIALKL.IIpSLSL!

RLSHLSiff4FLZSKLGiJi7ITYIKFPVTICEW1'FPSAIRDLPF~L.r~RH~IS
VVS1'PFPNRNWYRLLSSRFdiIIMS
KSHIYGLIGDPV~I

.
YCPRFFLDYLE11TGLLS>uLDl~7lVIKFPELETHFSYYPVBCFYJ1PNQ'IfLSI34DRYFPI
VTNPLKTAIFDHVOxLDASI1QLCESINTLVFRNOKILGYN1'DD~.IfAIC.iJ4QKNISVfIIK

HIA11GAGCiAAKAIAATi.fIMOGAfdJiITNRTLSSAAALJITLCKfRfAYPLGSL1~S1FRTIDIASVICtI'LDK
IxIFSLTPDLIHM.iLifNPWLLtIPSPSBFFItQGItLFTRVItIIVOALPB10C0 'INCLPPEVTFPWRFPPIVlIDINfKPNPSPYLERAQKNGSLIINCYGP'IEDALt4PU.WRt0!'i45NLIAIVRC1~
TVESr'L'IE~IBDRIUYQAVL'ISSPpC~rIUFIZSiVRVLPI'G' FPDFLTPE~DSFRNYVIDiIMAKV
DOIIALPFNTSTPOETLFSIRHFDEt.VG.TSKL&MLODGLLESIPLYNOLIIYf.I3GFEVL.

CO

CPn_1036 1191190 1189954 cetLlo47 lioosr 1iD13u arw-Dallyrowinau synehase dew-DShydroaipieounat. R.a~case cYDescRSCIILrNwrfmsel'uTTPHVVtmISNrFQLaa.FSSISTAYPLVIrravs vaoTaicPlt.otLlxiac.YwlvtTFPPaEPfa~ls~te:rraIaYOLVLroNISncssIICICDrasta~nssfenn svlccscxreKVIVSALEOSSEYTIaxrsRSSALTLe~wIllL~olrrv GTVLiIfIGFLdIATYCRCLPLYLIP1TITANVDTSICGLD~IGt1'~GII~Ri~01'FYLPKM'LOIStIPLLTXEWA
HLLISPKPLIIGTi'CiIfGtOCKSAHDSLEELTNIVWVYfrINRiLGAY

NCP~QFLSTLPREEyfIIHGiAEAIKtiGFIA~1YLWEFLNSHSKILL.FSSSOILNiFIKAI~QIIHKIIWIB.L.90 L.C~IPOFDIRIRITIBIRYWfDSLSGTAQDLi.DTIOpVI~BV00 TRISL~IIK
RDSSKKTIEVOSSR1R~I1QGIMlTIBS~OnvRNTVfERHVtCRCILSIt:<RdJCI'LX
I

S VL~CLLKK!'fDl!
KAAIVA~PYDRSLRKILNtCftsIAHAIElLAKGrVMKiOAVSVGlIIPOLYSIGDTLBL
P

TPOLIDOLDtLLKRFNLPSTLKDLpSIVPEHLLQiSLYSPENIIYT1QYDKKNL~tEfJOf.
O

INIEHI.t'rRA7IPFNGTYCASPNNEILYDILNSEOLVIRIHC
CPeLlO1 1201518 1201601 CPtI aad-ASparcate D.llydropenau ,-LIDERKC~IAVLGVDGLVGQKFVUiiIKWYRDiIIVIAEYVASNSKYCOSYOGCIt~pGIG
aroc-Chorismate smehase ' LHFSRGSRRSFLEELLATSYSRStiYLVKV~ISFGSLFSf'L'II~rESI~PSIGWIDGCPASNiIiTYRlIIIMII
PIIPJQNNRDLPIAKIEE110SDIWSFLPSSAESNEAYCLSQDIfVVR

.~.LELNESDFVPAfOtARRPCRiPGI'SSRXA4DIVOILSGV1f10GKTTt'.'l'PLSLOILNIWDSSIPEVNSOHF
OLLGOPYPGEIITSPNCCVSGITLALAPLRKFSLONVNIVTLOSAS~GY
' PYENSERLYRPGHSQYTYEKKFGIVDPNGOGRSSMETAiCRVAAGVVAEKlWIONIITLiiLN
PCVPSLDLLANTVSHIVCSdEKIL.RBfVICtLCSSKOPLPCKLSV'i'lIIIRVwJIYOf!!

AYLSSLCSLTLPHYLKISPELIHKIHTSPFYSPLPNEKIQEILTSLtitJOSDSIGCVISFIVTFfKDVDLDEiLYSYO
EKNIfEPPNTYQLYDNPNSPOARKIQ.Sl07t>t111V11LOPITYO~

TSPIHDFLCEPLFGKVHALLASAiIISIPAA%GFEIOKGFASAOIDIGSOYTDPFVIO~tIRTIKIIiVLIHNLVRGII
AIZ'LtJISNSiYFIDYLKRENCLR

TLKSNNCOGtLGCITICVPIEGRIAFKPTSSIKRPCR'nIrKTILlL111YRTPQ1GRHDPCV
1049 1101s86 1203911 CPft AIRAVPWFaMINLVLADLVLY0RC5KL _ lyaC-ASpas:eokinase III

EOfNSKIVriCFOCI'SiJITAlNICLVCDIICKDKPSPVVVSAIIIGVTDLLV~'CSSSLJtER
CPn _ EtYLRLtIEGKNEBIVImRJIIPFwSIIi'fSRLLPYLQNLEISDLDFARILSL.CEDISASLV
aroL-Shikiwee Kinase II

WKLELRNVM'ltt.~LPTSGKSSLGKALAKFLNLPFYDLDDLIVSNYSSALYSSSAEIYKRA11CSTROWDLGFLF~1R
SVILT~SYRRASPNL~IKAIiWtWL6LiJQPSYIIOGFIGS~

AYGDOKFSECEARILETLPPEDALISLt;Of.'rLNYEASYRAIOTRGAW!'LSVELPLIYERLCETVLLGRCGSOYSA
TLIAELARATEVRIY1'WNGIY1?IDPKVISI7JIQRIPEL8FEIlIp t.EKRGLPERIJIEAMCTKPLSEILTERIDRMCLIADYIPPVD11VDI1SSKSS4E0ASODLITNi.ASFCA1NL.YPP
t2FPCMPAGtPtFVTSTFDFEI~TWVYAVDKSVBYEPRIKALB<.SD

LT,>t$
YOSFCSVDYTVLCCDGLEEILGILESHCIDPELNIAOMJVStOT'VL4DODI
ISOEi1QE11LVD

VLSLSSVTRLNHSVALTCFIICONLSSPKWSTITEKLRGTOGPVFClCQSSIIALSf CPrt_1079 1194011 1191665 ELAEGZIEELlS4DY11KpKAIVAT

aroA-Phosphoshikimaee Vinyltrenaterase TE7WICAC 1050 1:0)981 1201798 P CPn O _ VCP!lILTYKVSPSSVYGNAFIPSSKSHTLRA1LWASVAEGKSIIYNYLDSdaDA-Dihydrodipicolinate Synthase KpHDAStKKFPOILEIVCNPLAIFPKYTLIDACNSGTVLRfM'ALACVFSKBI~1TCSS0 LORRPNAPL:.OALRNFCASFHFSSDKSVLPFTNSGPLRSAYSDVODSDSOFASJ1LJ1VACSGCKTKSYSRNVGRINH
LLTATVTPPPPNCTIDFASLERLLSFODJ1VONDWLLCSICIdiL

LAf7GPC.iF':fIEPKERPWFDLSLWWLEKLHLPYSCSOTI'YSFPGSSHPQ~'SY~'~FSSLTKKEKOALICFJH:D
LOtJIVPLFVCTSGTLLLEVLDWIHFCNOLPLSCFtIfl'l'PLYIIfP

S.\AFIAAAALW''K.it4PIRLRNLDILDtOGDKIFFSLII~fL.GASIOYtSIEEILVFPSSFSKLCGOIWFEiIVL
NAAKNPAILYNIPSRMTPLYLD'IVKAtaHHPpFtGIKDSOGS1IBBF

tX:StOMDCCLDALPILTVt.CCFADSPSNLYNARSNCDKESDRILAITBEt.QKMCACIOPTOSYKSLAPHIOLYCCD
DVFW.'EMAACCANGLtsV4SNAYIPEEAREYVLNP00pDYRSWf HDCLLVNP!;fLYr:AVLDSHDDtiRIAMALTR1.1LYASCDSRIHNTACVRKTFPNPVQ1'WLETCRW11Y1Z'fNPI
CIK.1IL.AYKKAITHAOLRLPt ;IEDFDLENVSPAVBSNLAfrPKf.RTS

NEARIEECHONY:a'NWSTNKRKVFARBSPC VFSYS

:Pn_1040 1191876 llJ4p7) CPn_LO51 120495b L205270 Nn rabu:lc htxnoloo present in \:anebank/FJABLNo robust fwmoloq present rn t;anelank/EfIBL
as of 1!/7/78 as of 11/7/99 RP::OSLFLRTWGPSSSFREHTVG1APLLYPRRRSPDYLFSPTGCPMST'CMtHPIHTASRFFM'PKSIOOLHLtIITt wFPVLKEtVd::NYW11AQWINTLSFt.ENSCaICKISASBNPTEVKEEVLKHAAEEFRHCtIYLHLATFICRCLILFL
TTLFLs7fICILHFITLPWICKEDPRILRKNK

KTQt.ikI: E?SLPDYTSKHLLGCLLTKYYLJILLDt~iTCRVLBNEYSLSCQTLK'1'AAYILV

'PlALELRA::ELYFLYHDILKF.1QSNITVK:iIILEE0CHL0EHERELKDLPtiCEELLCYACrPn_1051 1~O51D2 lZOnl6n :PECEU'.L~:fYERLFx'WIFDPS~'TFTKF No rottusc homolal Praswnc in r:enebank/ENOL
.rc of ll/7/9R

FF IQKMKYNSREK IK::ALR Iv.:..~.YC
ITVFRNNF :L.~.CYONI F'l::La,'YVFfaIPNS
ICR<R

In.l1 II',ei3pt IIId72" , :'.Fr.'PFIN:KKTEVETXEVKCKQETPP::L.FI:HMNKVAE.:FPYRkMLE:a'~'.~~Q~~IL:NLCA
':IY, _ t:NFLD:7~NL::ftNF:.:KEllli::"fIf'fR::K::nY410t:::EPFR'ITACC.1'I
I,trM.NLa>"t:yl:a.rlnv.min..H.Nntno-I-t7x.non.\naae.rVa.R::KI.A~aYEL

Nu s tu.c t . Hn: t.: r . c:. tTITACtGCII:RLKDV::D::I IR'fRAT::::
t L:VIK:::MUTRt'L::CTYY IVt:K.W
Pt.lFFFRLTSD

Id'HRLt'II'LISIKy'r:::laYI~FLVHFt.OP::EF~tSRK'fu.~.IILF1JCR:~1V.TY.F9YLRYl'x.KV
RROLKKKFRLF.f't.'KD

YN::fa:f IAL.W::Htl~'\:.INflY~t:LviKPNIIKdJIIKLLCfTMk7i::f~lf1):i:.Cfl.'7lxLWfIP!'f ~::,\Lp.':rl'IFIViH:fi:.\YL'iAEC:TRYGt:\t",.4M1'.NLIK:IK:IIPYITKKLCBOA~LLBIIV'L
t._UI'.n IW!r.ItH IIH.%111 tl'WFTIII;~:V.1:LV::KI.APt.1.t'Lt7LEtlFFP~DfX::.T::IF.tANY.IAVQWYNQtKIAKSHFVtY.
rmt~t::r ttrww,lnrl yr.:v>r.r in :.at..l.mklf7lfsl. .v:: .,t tl/7/,N

:L::NA'tlY:lI'LY:MC:t.V:P::ITtVPFIIDLFLP:::.T(MPY'fr:Kf:RLAfA011KTVFSB3NIKK:Y:I
illlf'AKIIAINIILYLTt'l~:IdI:VN:LIV.'17x.'IdWY::I.:FA::11611.W::KPNt:L:X:EP

:~\I' I'lf:l 1.1.x. ~u M :NIIffNM:t.:kLAI'IJJD'fYY::1 V::LIN:LH'/f.'~f::I:Y:NIIINiN
:LKR f LKL1KIIYr Ntl: IADRI LT:ft 1 DMIIVAfW. I:.\VItRY.TIIFIIKBIPrT
xl'fCPLFA::Ef'fD I

InItlf 1::I4:f:Iyi:YI.I'LAf:M'fKBLIII?AFV:.(1l*NYAL.LIIr:IffFTt:NPflX:~aMLrLiL::.':KDY
APF.~.I.TARE:IJIIaE7tiJlDffFYf/::LVl.~tx~r::~yrwrrKTNIxPprRlu:.7lKIr:F' in:fl::1.aml.~:'h.Nlf:la'IN:F.FtMAl8:::1.tJ('ROfYiI:I"JIJU.ffIPAFd'IY:YF30YROII
LNEL:NK::AF

izi SO

ipt3-Triosv0noleDnat~ Grass III'nd 1307O1U 12091br; IsCRE~71RIKFReiICEJIKNTR:'.
iLa.RiliIQIKTLCL1ICE11~..n1.:;.:.~CEF:.:~'I:IA
:Pn _ 5P!'f..~LMINEVIM'AI'JR!'Bwltifl~~l~lLStilll'~?~w:.Pl4rt~"' Nn rnoucc hoslolop pressnc m umebmk/DtBLR1ERR
as of 11/7/98 ::RwIOiRFtNOVLLSPOLPPPPOHSVCSIS3P5KLRVIJIITFLYPCNttt.I~JILFLTiGIHCP~AP'IASRVKlY
ApAC4YPV11E4mlSLlYRiGKAII~nIR
SE

Pt:L:aMISPGIGIGISAIICGVIJtI".tCLLCLLVKRELPIYRPEEIPELYSLAPSECPJIiQAPLIAYEPVWAICI
CKVAEAiDVODIHNPCREWAERPSF~ITAEEISI::ICwi~KYDNAQR

WKTLApLPKEL00LD'tDZOEVFACLRKLKDSKYESRSFLNDAIfK6LRVPDt~lllLp'fLSEHOC.it7NDCLi.VC
~SLEGOSF!'EVAKNFNV

IFELROIYAOtCNDIJtFLILI;GRSLl4t'tAFSESLDGFINSKRLCYLP9CDVRG~i.KKSA

!ri!t'/IIPr~IJN:LrtHIIVAYAPDRN.iYIIIMEKAFAKALIiALEE3VYNSL'MSYRDKFLGSECpn_tOK1 i22071n 1~309a5 ' ' ~

.:al:i.:.::LLvIX.:::16'FY.LiI:nn~7.YIVFIirh':t1.h'I:EGT:F:::7NL4:Did~ST. .":.
. .,.,;: : :.."..,.,,:v.:
.
..
:
r :

..n:~ ~,., :..."r.,."i,. . ..-..v..yrp,:F.Y.Nt.:.c.'EF.~HAppLY, . , ,.
...\I;.". .
. h......tt...= ... ...rntaNrw:v:.;:'.,,....~,_,,...
.., -.;,y: t"::
........~~!r--...~,;.._.~y..,...

IVROKYOQEF~CRLCIfiiiALYPCVSVSIR~IKIOETRSNL6KAYFJIItatIfRCCVRE

~p,~~pl~,ptZLS ZLKS,t~ CPt1~1065 1221110 :::0928 TAEVtI~RCILS011ESRLtIVFIOVKiMPCRIECIEKT(JWA6LPLLPTKKAtEKACSOYNSNo robust homoloq Present m Getlebsnk;C~BL
as of 11/7J98 GC4.EKVKPYGKESLAYVTSKERLVSLD6aLRRAYTECQKRPOC~.ESEVRACREOLIWRIL:RNRRTSDPCTLfIfFS
IPEFSLPPDSCRL~IOItPKNEIILPSILtxKPIIOYLKZTSI

RCRIOEFClOGLOL ~Y Y~

y YIIEERlGIKEKIILYGTfIIVAT

OORVAAfFSIEVpEIPGPtFICPSLLDKARSLPTREOHTCPell086 1221132 1:211 No robust t>omoloQ Dreamt in GenebanklE!~L
as of 11;7198 SNSWCEIGItI'VLIYAFLFIFLILCYiLCCLILVOESKSIGLCSSFCVDSCDSVI~GIISTP
CPO

,-DILIGM'SiuCAYAFCZGCLL'SFSTNLI~KIILDAKEFLLPAAECSDTpASSISVGOES
No robust homolop presort in Gsnsbmk/00L
as of 11/7/9 CKYLYHIiSYPPPPDNSNGAf'FCLSKFRVWITFLVI.CrIILFLISG71LFLTIGI9CLSAAIS

FCLfJIGLSALGGVLWSCLtGi.LAtOtEVPCVRpEEZpixVBVApSEEPALQATOKTIaOLCP1~1067 1221675 PKEt~OL~tYIOEVYSCLGttLItDLRCEDt7GLLItORKFJG.OYIrDAMItDOf1'EIVG.OOIHdeE-Polypspcide OatosRylase OpELyIYLKCLIOEtOtDIGSTLFHSQVSLFKWFWIrG7fLPSGDURGERWISAR,6VIlORFIIIQVLWRDFPTEL~0 711IVQ1ItIRRLEYYCSPILRKKSSPIAEITDEIRNi.VSt>IICDZItEA

RRICDTRIfVAM'FDRN71YGVAXT11P
EYIGITKILIlDBNRGIA7W1P0YCiQiVSLFVNCVORFIEDCELIFSESPRVFINPVLSDPSETPII~ItGCL

E7CILRICYLEIRR SIPCLRCEVFRP~fIl'VTAl07Lt~KIITCNLDGE'fARI
INHLTpNLNGYLYIDLNEiPKD

pKKFKI4RLEIIIIOtJtYNI'NLZ;Xl~.VS

CPttr1056 1210182 121122 No robust hosolop presort in Gmvbank/D~LCPr>'1065 1223267 1222365 as of 11/7/9 CEDIKDNtSRVEEI~l4.RVIELPLLPIKC~Ai.EKAIYpYNSYKAKLTKVCPCFRESPAYIrnh1-Ribonuelvase NII

TSEERtASLOp'1'LERJ1YKEYpKRFQEPSRLFSI~PPPFVKLT?SAOtdILRDOLKEKtiFIF50P0It1YFQARSN
MCTLYPSCKLYIOG

IFVSWLFRKHVSCLVSTVNVP
IYSKVAKAFPS4.KGSEEPIEFFLEPEZLNiPTlIARVDpDLRPNLGVDESCKCOFFGPLCIAAVYASNABILK

KETLEIG1KJ1PREEtYWLILEERKSKFJ(RLI1NKIEA71QORVKt%.CPPPIKE1'dfpKRKKEKLYCtKVODSK41 IJ071'KIASIdRIIRSLCVCDItIILYPiKYNELYCKIOt~KM'LLi1W11HA

YSFFIALKS
TVI~.APKPAGwP'AISDOFJN1SEYTLLIL1I.~Ga"rDITI,IOKPRAEODVWAAASIL71 RDAFVOSIOKLEEOYOVOLP10GIIfiINVKJU1GREIAKpRGKELLAKISKT1IFKTFf%ICSG

CPn_1057 1211167 1213596 K

CT356 hypocheeical Dsocein IINFYFFNFANPEPLY1T0Q.ITnLSPYLLLYAiITPVNWYPWCAFJ1!'NIMIENKPVFISICPtI,-1069 GCKNSRNCpVlGpESYTNPE AIC.YGDt.At5ILAVSGdfQYt9A-HTN Trmseripcional Rpulaeor ETVSIiPLNVILTPOLVPFFSVNYLONEGKLGf.PSPPOZIDKLiFl6IE011EEREALVD2ANVIIQGtINKt<.LN~
rEIFRSSRESOSLSLIG7VG71TSIRYSCLFJIIEpOCLCKLISPVYA

KVLEIASFLEGCVRKEILDESSLIGtTVAALYODIDPtMDGVKAFPKRLPGLLLOFILRYSOGFIKKYJ1TYLGLDGDS
IL00IPYVMIIFXEFSDt0~91EfILLDLESIGG'RNSPERAINSItS

IGGGVYSYTI<7DIOd.IPAFa~RLIDIIiItJ1Ai31YIdIJRi7YGLIIZOCI1MIWLL'.6GFSIF

LFAWICIGKaYRGICKpILSYILSELYSP1VCAFYSSE011DRIZ11G00ER''Y911SVEEZS
.

NAiGtDAEIPCDYYDISRECFPNGRItILNIPVNREIEELS1DCY11RSItAIEDIVDRSRDI1225523 1221114 CPr>'1070 LKGIMOR~SmSKDtlLSLTtMifBSIIYTFAYAGRLLGEVEYIEICKI~GtPVIINSLYIQIHNo /robust homolop pswmt 1n Gmebmk/ENBL
a of 11/7/!1 YESOCGSFWLSFAEQJIpEWLBPRSEEOC RPfIJlIFPCtiJaCYYRETPPPNPOG~IPLO
ZSL

FYSVOGRDSTLLIKGSPLSOGiTIS~01LI~.LSLHLITDWDILTYJIt4IL0IA0ACPiiCREIICiCFL06tI9K~D
CACCL

Atrl7(KFS51GLLIAS(S7YPSR10iVKVLIAiGDQE~tSPVLKCLSGLFLPYLSLI>~sf10fl~1ETVODPDNPSA
OFLQOLIOOYGPZCVGNtF00GPlICI'OICIEOGEPLG~1~ESI~iOCKL

OEIfLCfVLPCYEE1CLIPKGDCTAITI7fVi.LYDpCKRFKDLELFRR7fLISLHRELLKAAOPIIL7lCESL
VSE9AL5FYPStaIIPtC

WIIQPEppPCPPTPTDELpLOCAVOGAPAPppIGWP

CPtl..1058 1217742 1211536 LSLESGYIt3PLG0ANI0IVOLIKKSLKRLVASDLATfIGPGICLSLT~pVIMNLICLL

CT355 hypoeMCieal procsin SKGYLPLDPLNPEO'M.DPAl100PNORILRKVLV'1T111GZ<llIwRqI00GtR0itPIPIDP

EVIeQ.YpTLPGIVLVS7CCIFiL.75K;GYAAEVPVTSSGY82tLLESKEpOPSCIJ1INDRILWODD6IERDGlVDO
GGPGIPCQCLRfSiRKLPTEKItPNAWL

FKVDBENVYtALOVINKLNLLFYNSYPHLIDSFPAR80YYT11l1iPV11LiSVIt~Ft3NAD

AIUIIOtIATDPTAVNGEIECtQCR~.SPLYANFENSPNDIFNVIDR?L?AOIIVIKSSNIISKCPfIL1071 vhS.KYfPGKIREYYWtLEWSRKVIwKYRVGTIKANrE5Li1S0I11DIMWtLNWI191DNo robust harolo0 prnmc in Gmtbmk/OIBL
as of 11/7/9 KDRLTALVISOGGOLYCSEEFSR>2ISELS05HKOEL~.IGYPKCt~CGLP7GWKSCYIG.YIIKC'11'IN~CPNILS
YtPRlCCNFfICEANI:ViTI'EGTTRQSASDISEE11L'wRSOGAfIPITTO/1'KI

LGDKTSCSIEPLDVNESKIKQNLFALEAE5IILKpYKDRLRIOIYGYDASNIAKIiSEGPPTlfVO0tV0PNTApGDCS
I'IISIIpF.~.VDSILSHRRZ'pCCtEYCYD81LA'i~C~ROGSP

LFSLf.
CRLICGTYKACCLDRLDNpIIAGLVItECEpTIIGPIAYAL11AK1fGLNLIIELVIKNtILStE

QI~AQtICSFaKI'OLYQINQSLSONFFLEGVNSIRERGLDDSLVOAVLffIl1?RSii~fT

CPtI,-1059 1211118 1215678 IESPlJ15G1'SSAWi9TRIPACYZ1lX11'SPLTfSRLSCGSROJIRIIPSSVCAiPOYVAKKYND

kysA-Dialachyladmosine TranstsraseNDiiWOLGIIIW'Il~it.KTGDPSAiGPFCLLIV10~ISFLLSASOSTSSZLKH1'GGEICYTC

VTRSSPAOLSRFLSEIONKP7GLSLSQNFLVDQNIVKKIVATSEVIPOWVL6IGPCFGRI.PNFRDIWLLt4.AIGYCP
AM'DLTSWDIIMIDDPIhII'IFYRLOYSYR!'OKTSASFIJOGf TET:LIAIIGApVIAIEKDPNFAPSLCELPIRLEIZDACILYPLDOLOEYKTLGKGRWJ1NLPPSLVROffSLDCPTPA
ESVPLNSSLEEEDE~DDEDCNIJ1YQ0RILEGSCNL.pTLFLGIK

YHITfPLLTKLFLE7IPDFiiRTIfTVNVQDEVARRZV110PfxRDYGSLTIFLOFFADIHYAFIMO

IfVSASCFYPKPOVOSAVTNNKVIIETLPLSDEEIPVFII'LTRTAt'OORRKVLAHfIJIGLYP

KEOVEpALKELGLLWYRPEVLSWDYLALFNKNOAGCPn_1072 1227921 12235 No robust homolop Dresmt in Gmvbank/E!~L
as of 11/7/98 CPn_ID60 1217691 1215727 KKDYILIIANWCCWKONLKIOKKRNCVSWITYCJ1IVCFFNSADAApKKIDCIPIOILYSFT

~cs/tkt-Transkecolase KYSSYIJG~ICDASTIFC11DVORGLt.OtIRYLCSPCWOETRRRQLFKSLCdOSYGNO1LCEET

YXRILYIHITKVIfI'SSSCPLLOLILSPADLItKLSISOLPCLAEEIRYRIISVLSCIOCNLLAIDIFNNKDCLC&EI
PZONE71ILJWSSALVLGISSFCITGIPATLHSLLRt~M.SFpKRS

SSMGIVELTIALNYVFSSPKDKFIFDI~pTYPHKLLTGRNNBDFDNIRNDNGLSGF'1'NIASESFLLKIOSAPSDASV
FYKGVLFRCE'1'AIVDALSpLFAOLDLSPIGCIIFL..'EDPIW

PTESDNDLFFSGNIVLTALSW.GIUIQITPLESATItVIPII~GDAAFSCGLTLEAIl~NISTDOAVCSACIGWCi9Q( FIGLVYYPJ10ESLPSYVNPYSTATELOEAOGLQVISDLYAOLTIiJAL

LSKFWILN01~4~RISISKNWGANSRIFSRNLIIHPA'fNIG.TKOVFJWLAKIPRYC06LNilISPKNN

RRISpCVKNLPCP'fPLFEGFGLAYVCPIDGHNVKIfLIPiLOSVRNLPFPILVIIVCI'tIOCK

~LDpwOMdPAKYtIGYRANFNKRfSIUtHLpAIKPKPSFPDIFGOTtGELCEVSSRLNWTPCPn_107) t=9011 1NSIGSRLECPKQKFPERFFDVGIAEGNAVTFSAGIAIfMR~IPVICSIYSTFIJiRALDNVFPredicted OMP
IC'I'37l HDUC7pDLPVtFAIDRACWYGDCRSNtIGIYDNSFLMIIPQNIICOPRSQWFa0LLY5SMRRYLIMVGAL(:LYRAAPL
F~1WIKITDJWJ1VLKFAREKTLVCFNIED'1WFPKONNCOS

LNYISSP3AIRYPNIPAPNCDPLTGDPNFLRSPCHAETLSt3CEDVLIIALCTLCFTAGSIKAWLYNRELDLKTTISEE
pAREpAfLEWNGISFLVDYELV:ANLRNJLTCLSLKRSWVLCI

H0tL1YCI3ATlh'DPIFIKPFONDLFSLLLIiSNSKVITIEEIiSIRCCLIS'EFNNFVATFNSORPVIILIKM'LRI
LRSFNIOFTSCPAICEDCwtSNPTKDTfFDpAMAtFJWILPVGSLK

FKVDIWFAIPDTFLSHCSKE11LTKSIGLDES9!lINRILTHFNFRSKKQ111GDVItVNCOPNDAALEYLLSGIaSPP
SOIIYV000AERLRSIGAF~_KKANIYFICNLtf'fPAKpRVf :.'YNPKLTAIpWSOIRKNLSDEYYESLLSYVKSK

CPn_IOGI 1217932 1217666 C"330 hypothetical Drocein ' ' FI:SIINEIHNKDPSLKKLFAi.ppSLFfL.NSLSDIVATYEAMFSLIYECLNKALRKDQt.CY' LIa'1ltdSK.fLLKSPSCDPIVQTFPINPNN ' RNA SECTION

;:Pn_lUi.2 12191135 1.1815) . . . . . . . . . . . . . . . .
. . . .

><aw1-fxaJoxYriOOnuclease VII

Ix:FPM::.~.PCOIIVA:iLTERIKTLLESNtL'OILVK~EL::NV~LOP:X:HL1FCIKDSOAFWcmlsHA I
\N.t'r, 1 ~ND7A

:AFFtIFK::KYYDf:KPKDf:OAV I I14:KLAVYAPRI:QYq I VN IALVYArExDLL4KFEETKR

Id.TAFxYFXrl7:KKPLPFAPQCICViT:a"n;AVIpDILRVLRilxam.~l...v::. f NN:\ n.n71.11 :RRARNYKILYIPVIIroCN ,4'L.4o ::AAIIEI::KAIEVtfUIfNLIDVLI T.1R(3CC;:IEDLWAFNEEILVKAItIA:'rIPiVSA11G71E

'rDYTG.'hvA::WfNP'rP::AAAEtVt'.KC:EEf~)VFFY:'ILRHLL::II:;ROLLTCKK0f3LLPW1~..:
rICNA lUrun.r.d Itlll_ll.:

I!I! vLDfIAEFYTTIIt~(~LOa IE LA10Kl3V(~CK
I IIE.,Kr,INYDN I::RWLtX:DLYwPMlCRLOS

LKKNL:rJAL:YIKAI::WVRr:IIQLKK::LT1PR0It~11.:OKL::ISi~LDTLt~RRLIIYOKE.:s:: rNNA
Ino:4l'. Iml'..:'!Ir !:YF11KIIT1LKl W IN1II.EWLIt::IIVQKf.ELLCRNL::MX.'EIIJt4>NVK
IA1:WYKETLATI L

h:NNYllI::/ARY::ALKEWJI::WPKNVLKRUyANLFDFtif:Pt::AHL::VO::I.QeWIVRi9L0..: tNNA
114~'.t: Inn..ur I r:Ell LTIrt'H f R Ir:KL IKI:
~'Ar_t4a; t~l'~'N1U 1~~U71:

WO 00/27994 PCTNS99/Zb923 ' tlldAi . . . . . . . . . ~ . . . . . . . . .
CMUI 1 6aqln Erxt Type Codon t 99657 89728 Thr CCl' 2 90o9N 91070 TrD t:~:A
w:c ~~~~~- ~M~r 2ri075 294117 Val TI1C
6. 296151 296111 Asp GTC
7 109818 109921 Pro T0G
8 167111 162211 ArQ CCr 9 671=36 67231A Lw GJI
677161 677337 7tp TtC
11 739103 739186 Leu G1G
12 781610 781110 Gly TCC
13 781~7? 781196 Glu T'1C
11 781912 781991 Lys T.T
836119 836191 Ala OOC
16 813926 813999 Pro ODG
17 877400 877473 Acq 11CC
18 10~3605 1085676 Cln T'1C
19 1112031 1112118 Ser TCA
1175163 1175911 Iwu TJ10 21 1230028 1229912 Ser C'aA
22 113?162 1137389 Val G11C
23 1030603 1D30533 Cys OC11 21 1000072 999919 Mls GTa 961607 961536 Gly GCC
26 A07113 807311 Arp TCT
27 7es7eo 7es7oa Thr car se 716971 71se99 Leu T1N
29 70AN1 708351 Bar OLT
68D~59 680178 Leu 6710 31 671115 631373 Phe G7N1 32 626987 626901 Her OGiI
33 293177 293105 Thr 'rC1' 34 293399 293317 Tyr CrA
269112 269070 Ala TGC
36 269065 268992 Ile C11T
37 161389 161318 Asn GTl' 38 87522 87150 llet GT

Contig463 Length: 273254..

85i TCGTTTGAGT AGCAGTCTAC GTTTTTTTCT TGCCACGCTT TTCCCAAAGG

1051 TCCAGCCACC AAAGCTCCTA AAGCTAAAGA AGCTAGGATT GCA.T~GAGTGG

2501 TTTAGAAGTC CTrt'GCTAAAA GTTTTTGAGA AATTTAAGAA ATTCGCAATA

3751 ATCTTGAGTT 'CCGTAGGAAT TGCTGTGAAT CGGAATTCAT TACCTTCAGA

WO 00/27994 PCTNS99/2b923 9401 .CATAAGCACT CGGGAGATTT CTTGTAGGGC TTTATGTGGA TCCGAGAGGT

11851 AGAA.AAAGTG'TCATCATAAG AAATCCTTAG ATAGGATAGT TTCTTAATTT

12001 GGGTCACTGA ACTTAGGGGT ACCATA'hCAT CGTATTCATA GTCCAGGTTA

12$51 TGATGAAGAA GCATAGAAAT TCAGAGCGCG GTCATCTCGG AAATAACGCA

14051 CAGCTGCTAG GGATTGGGAA ATATTTTCTT GTACTTCTGG AGAAGA~ATT
19301 ATTGCCTGAA AGTCCGCTTT AGCGTTGGTG ATGTGGCCA.A TAAATTCGAG

19951 AAAGCCAATT'GTACTAAAGA AGCTCTGTTA TTGCAACTCC TTGATCAGAG

20101 GTCGTTATCA ATAATTTCTA CATCAG~TTC TTCGATATGG TCTTCTGAAG

20651 TCGAAGGAGA CTTCGATTTG~AGGATGGCCT CGAGGAGCCG GAGGGATATC

22751 CTGAAGGATA GTTTAAATTT GTAGAAACTT TG~'GTCAGGG TTTCATTTAT

26801 GTTGCTATGA CPtACATCGTG TTTGTTTTGA TGTGGATAGT GAGACGGATT

28051 TGCTTGCAGG'GTCGGGCAGT TTTTCAAAGG GGAAAGCTAT GATCTTGCTG

28201 TATTTCAGAG GGTTGTTTTG GATGAT'IrCAT GGATCTTAGA GGTTAAAGTC

G'J071 I.HHIH1Vt11t1 l.1Vt11V1MV MVlltat~t~.ean saV,.V~.rm.aal. t~rv.ramava>rr WO 00/27994 PCTlUS99/26923 33901 ATGATCTGTT TCCATGGCTT TTAAAAAAGG GAAACAAGGT TGGTCTTCAA ' WO 00/27994 PCTNS99/2b923 36151 GACAAATAGA'GAAAGTTGCC AACGTGCTTT GCGGTTAACG TTACAAGATC

36301 CCTCAAGCTC CTGTTCATCT TCTTAT~ATT CCTAAAAAAC CTATACCACG

40201 CTTTAATAGC CAATACAGAG CGCAAAGTAA ACAC.TGTAAA AGGTAAAATT

WO 00/Z7994 PC'f/US99/26923 44251 TCTTTTCTGA.CAACCCGATG ATGCTTGTAG TAGTGAATCA GAAGAAGCAA
4430'1 ACCAAGAGTA ATGATATGGA GCAGAACGTA TCCTATGGTA CGTAGAATTC

WO 00/27994 PCTNS99/2b923 4505'1 TGTCCGTTCT TTTGATAGCA CCTTGGAGGT GATTATGGAA GTTCGTTATG

45501 ATGATTGTCA TTTTCCCACA GGTCGGATTG TGGGCTG'~GG TCCTCGGGTT

45601 GTCTCGTTTT AGGCAAGACT TTAGAACC'FA GTCGAGAAGC GACTCCTCCA

4?201 GTACATGGCC CCATTACTTC TTTATGGGCT TTGGAGCCCG TGGGTAAGGG

49251 CAGATCGAGC .TAGACCGTAG AATGGTCCGT GAGCGTATCC ATAAGCTGTC

99751 AAGTATCCCT .ATGAAATTAC GTTTGCTCTC TCCTCTTCCT GTATTGATTT

50601 GTCTCGAGCT TTTATTAAAG TTCAAGATGG CTGTA_~.TTCT TTTTGCTCGT

51001 GATTTTTTAG ATTGTGTAGA G,AAGTTCCGT GCTTCTGATC CTCGCTATGC

52351 GGTCGGAGCG~ACGTCTCTCT GGATCTAAGA GACTTTCTTT AGGAGAAACT

wo oom~4 PCT/US99/26923 WO 00/27994 PG"T/US99126923 55551 GAGAGCCTTT CTGCTTACTC AAGAP.AAAAA AGATCTTTTT GTAGACACCT

WO OOI27994 PCT/US99/Zb923 63151 TTATGATTGC'AGCTTGGATT GCTCCTCCTG AAGATTTTGC CTTGTTGTTA

68?O1 GCATAAGAGC ACCTCCCCAG TTTCCTTGAT TGTTGGTGAA ATATACGTGG

70001 ATTTCTGTGA GC,CAGACTAG GGGAACTCGG TGGTGGTTCT TCCAAGAAGC

71251 CGCACCATCT~TCAACAGCAA GGACTCCTTG ACGTAGTTCC GAAGTGTTCC

71951 GAATTTCCCT CTATATTTAG. AGAGTTCCCA CTATAAATCC CTCCTCCTAA

78001 GGGAGAACCC CAGAAGATCC C.TTCGTAGAT ATCACACCCA CAGAAATTGT

79351 GTGTGATAAA'ATCGTGGCAC AGAAGAACTT CTTATTTACT TTAGACGCTG

79501 TATTCAGAGT ATGGTCGGGA TTTTGGG'ATC TCAGAGAACC AAGAAAAGCT

79601 TGCTCTTCTG ATGATGACGA AGATGCAACA'GCAACTTCGA CCGCTACAGG

80051 TACCGTTCGA CAGCATTTTG TTAAGGCGTT TGATTTCTCT CGTCCCTTT~' 82151 TGCCTAGGAA TATCTTAATT TCACAAACCC CAC~GAGTGCA CAAACTCCTT

86201 GTGATCTCAG AC;CTTCCTGA AGGGCACCCC GATATTCGGA ATTTGCAGTT

$6851 GGGAGTGAAT TCTACACGAG AATCGAGAGG AGAGCGAGTC TTTCAAGGAT

87451 TCTTAAATGG'TGTGAACCAT TGAGAAGAAC ATCCTATCGG TAGGGAAACA

88151 GGGATCTAAG ATATAGGTAG~AACGCACGAG AGTGTCTGAA TTCCATACAG

90251 GAAGTATGGC .TGGATGCTTG TACTAGCAAA GTACTTCTGG TTAGATTATT

WO 00/27994 PC'f/US99/26923 91151 ATGGTAAACT CCCAAGTTCC TTGATACCCA TAGGGAGATT GCTGATAGCC ' 93901 TTCTTAAAGG.GGATTCCCTT AATTATAGTG TAGACTTAGA GATTATGGCG

94801 A.AATCCTCAC GTCATCTAGA AAGATGTAAG AAGTTCCTTC TGGATAACAG

95951 TTGTGAGCGA TAAAACAATC TTTGTCTCTA GCAAAGAGAT GGCAGAACGC~

96901 GGGATCATAG AAGAAAATTG TATGATTT~'T AGCAGCCCGT AATTCCGTGA

WO 00/27994 PCTNS99/2b923 98351 CTGGAATAGA CTATACTCTG ACAGGAGATA TA.T~CTCTGCA AAACCTTGGG

98501 GTGCTGAAGG CGCAGCACTT TCTGTTACAA CTGA'1'AAAAA '1'C'1'c~'1'C:c;c.'1'A

WO 00/27994 PCT/US99/2b9Z3 9 9 B 51 TACAGATV'1"1' (:(:A(iC:(iCi'1"1'C: l:'1'Hl:Hli'1'Hht_.
HHI,:'1'l.l.'1'HL V l.rW.1 r~ ~ vvv i 101651 TCGTAATCTC ATAAAP~AAGC AAACAGAAGC AGGTCTTATC TTTTTTACTG

101$51 AGTTTGTCAA AACTTTTGAG AAGGGAAATG CAAAAGCAAA ACAAACGATT

WO 00/27994 PCT/US99/Zb923 105301 TCCTGTGGAC TfiGATGGCTC CTGTTCCTGT GGTAGCATTC GTGGTTTGTA

110301 CGATCTCACT GTTCGCATCT GCTAAGTTTA AGTTCAATG't' GTCGGTAGAA

111751 TTCATCTTTA~TAAAAAGTAT GTTTTTCTAA GATTCTCGGA GAATCTTAGA
11180'1 AAGAATAACG AGTTCCACAG TTTGCATTAT AGCTTCTTGA GGAGCTGCGC

115801 AGGAGTAATT~GCAGATACAT TCGTAATAGA AACATCGCTA GTGAGAGTGT

11?601 ATTAGGCCCA TGAATTTTCA TTCATAGGAT ATATTTCATA CTATTATAAG

121301 CTCCTTAGAG GTTGCGATCC TCAAAAAGAT CAGAGCTA'~'T TTTTATCAGG

121451 GAA,AAAAAAG ATAGTACAGG CATTTGCTTT ATAGGGAAGC GCCCTTTTAA

124601 AAGGGAAAGT TATGCACAAA'CCTTTTGTAT ATGATACAAT AGTTCAGCTT

125$51 ATCGATGGGA AATATTGTAT TTTAGGTGGT ACCAATTTTG AAGAGTTTAT

130401 AfiGAAGGAGA TCCCCATTCC CTGGAGGTTT TTGGGAATAT CAGAGTAGGC

130651 AAAGAAGCTC TAAGATTTGC GfiGAATGCCG CAATCACCAC GATGAAAATA

132001 ATGGATGGGG~CCCCAAAGGC CGAATCCTGA TATAGGGAAG ATCAAAGCTT

132151 CGGAGGGCTT TCTTGATATT TCTCAAF~AAA TTCAATGGGA TTCAGATTTT

13?501 ATTCATTATC CAACATACAT TATCCTTGAA CAAATTGAAA GATACGAGAG

140101 GAGAGAGAAA~GCTGTTGCGT TCTCCTTTGA ACCGTTTAGA TACGAATCGT

140801 CTTATCTACT CCATATTCTT~TGGCTATGGG ATATAATATT TTGGCAACAG

143751 GACTAGGAAT CGCATGTCAG AAGTACTGGG P.AAAAACATT TTAGCTGCTA

146751 TGAGGGATGT TCAGGTGGAG CTTTGGGCAT GGCTGTAGG'T GATTCTGTAG

148201 ATGTGCTGCT~GCGGAGAGAT TTTATGAAGT CTTGAATCAC CCCGATCTTC

149551 CTTCCCCAAA CATGGACAAT ATAGAGAT~T AGATGGTAGT CAGTACAACG

IIS
15$851 TTTGAAAAAA CAGAATATAT GCTATCAAAG AAGGGTAAGT TGGGGGCCTT

164401 AAAGCAAGTT~ATTCCAAGAG AGTTTCATTC CTGTTGTCGG AATCGTTCCG

WO 00/27994 PCTNS9912b923 165751 GGCATAAAGA AGAGACCTTG CTTTGTAAC~C GTAGCTCTTC TGTACGTAAC
16?051 TCTATATAGT ATCTCGTGTA CTATGCCGAG TATAACCGAT CGGCGTTATC

DEMANDES OU BREVETS VOLUMINEUX
LA PRESENTS PARTIE DE CETTE DEMANDS OU CE BREVET
COMPREND PLUS D'UN TOME. - .
CECI EST LE TOME _ ~"DE c~
NOTE: Pout les tomes additionels, veuillez contacter le Bureau canadien -des brevets :;,:
JUMBO APPLICATIONS/PATENTS
THiS SECTION OF THE APPLlCATIONIPATENT CONTAINS MORE
THAN ONE VOLUME
THIS IS VOLUME ~ '-OF _ .
WOTE: For additiona'1 volumes please contact'the Canadian Patent Off~cE ~ -_ .. . . . . ~ , ..'. , . _ .~. ,. 'w

Claims

What is Claimed is:
1. An isolated nucleic acid encoding a C. pneumoniae protein as set forth in Table 3.
2. The isolated nucleic acid of Claim 1, wherein said nucleic acid has a nucleotide sequence of an open reading frame in SEQ ID NO:1.
3. A probe comprising a hybridizing fragment of an isolated nucleic acid according to Claim 2.
5. An isolated nucleic acid that hybridizes under stringent conditions to the nucleic acid sequence of Claim 2.
6. An expression cassette comprising a transcriptional initiation region functional in an expression host, a nucleic acid having a sequence of the isolated nucleic acid according to Claim 1 under the transcriptional regulation of said transcriptional initiation region, and a transcriptional termination region functional in said expression host.
7. A cell comprising an expression cassette according to Claim 6 as part of an extrachromosomal element or integrated into the genome of a host cell as a result of introduction of said expression cassette into said host cell, and the cellular progeny of said host cell.
comprising:
8. A method for producing a C. pneumoniae protein, said method growing a cell according to Claim 7, whereby said C. pneumoniae protein is expressed; and isolating said C. pneumoniae protein free of other proteins.

9. A purified polypeptide composition comprising at least 50 weight % of the protein present as a C. pneumoniae protein comprising an amino acid sequence of claim 1.
10. A monoclonal antibody binding specifically to the polypeptide of Claim 9.
CA002350775A 1998-11-12 1999-11-12 Chlamydia pneumoniae genome sequence Abandoned CA2350775A1 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US10827998P 1998-11-12 1998-11-12
US12860699P 1999-04-08 1999-04-08
US60/108,279 1999-04-08
US60/128,606 1999-04-08
PCT/US1999/026923 WO2000027994A2 (en) 1998-11-12 1999-11-12 Chlamydia pneumoniae genome sequence

Publications (1)

Publication Number Publication Date
CA2350775A1 true CA2350775A1 (en) 2000-05-18

Family

ID=26805735

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002350775A Abandoned CA2350775A1 (en) 1998-11-12 1999-11-12 Chlamydia pneumoniae genome sequence

Country Status (5)

Country Link
EP (1) EP1133572A4 (en)
JP (1) JP2002529069A (en)
AU (1) AU1722300A (en)
CA (1) CA2350775A1 (en)
WO (1) WO2000027994A2 (en)

Families Citing this family (75)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2283012T3 (en) 1996-01-04 2007-10-16 Novartis Vaccines And Diagnostics, Inc. HELICOBACTER PYLORI BACTERIOFERRITINE.
EP1105489A1 (en) 1998-08-20 2001-06-13 Aventis Pasteur Limited Nucleic acid molecules encoding pomp91a protein of chlamydia
US6686339B1 (en) 1998-08-20 2004-02-03 Aventis Pasteur Limited Nucleic acid molecules encoding inclusion membrane protein C of Chlamydia
AU5366099A (en) 1998-08-20 2000-03-14 Connaught Laboratories Limited Nucleic acid molecules encoding inclusion membrane protein of (chlamydia)
US6649370B1 (en) 1998-10-28 2003-11-18 Aventis Pasteur Limited Chlamydia antigens and corresponding DNA fragments and uses thereof
US6607730B1 (en) 1998-11-02 2003-08-19 Aventis Pasteur Limited/Aventis Pasteur Limitee Chlamydia antigens and corresponding DNA fragments and uses thereof
CA2353107A1 (en) 1998-12-01 2000-06-08 Andrew D. Murdin Chlamydia antigens and corresponding dna fragments and uses thereof
US20020061848A1 (en) 2000-07-20 2002-05-23 Ajay Bhatia Compounds and methods for treatment and diagnosis of chlamydial infection
BR9916020A (en) 1998-12-08 2002-01-22 Corixa Corp Compounds and methods for the treatment and diagnosis of chlamydia infection
GB9828000D0 (en) 1998-12-18 1999-02-10 Chiron Spa Antigens
WO2000039158A1 (en) 1998-12-23 2000-07-06 Aventis Pasteur Limited Chlamydia antigens and corresponding dna fragments and uses thereof
US7297341B1 (en) 1998-12-23 2007-11-20 Sanofi Pasteur Limited Chlamydia antigens and corresponding DNA fragments and uses thereof
US6808713B1 (en) 1998-12-28 2004-10-26 Aventis Pasteur Limited Chlamydia antigens and corresponding DNA fragments and uses thereof
EP1140998B1 (en) 1998-12-28 2008-01-23 Aventis Pasteur Limited Chlamydia antigens and corresponding dna fragments and uses thereof
GB9902555D0 (en) 1999-02-05 1999-03-24 Neutec Pharma Plc Medicament
WO2000058335A1 (en) * 1999-03-26 2000-10-05 Human Genome Sciences, Inc. 47 human secreted proteins
NZ514140A (en) 1999-03-12 2001-09-28 Aventis Pasteur Chlamydia antigens and corresponding DNA fragments and uses thereof
EP1165828A4 (en) * 1999-03-26 2002-09-25 Human Genome Sciences Inc 50 human secreted proteins
ATE375391T1 (en) 1999-05-03 2007-10-15 Sanofi Pasteur Ltd CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND THEIR USES
JP4667694B2 (en) 1999-09-20 2011-04-13 サノフィ、パストゥール、リミテッド Chlamydia antigen and corresponding DNA fragments and uses thereof
US6632663B1 (en) 1999-09-22 2003-10-14 Aventis Pasteur Limited DNA immunization against chlamydia infection
JP4864264B2 (en) 1999-12-22 2012-02-01 サノフィ、パストゥール、リミテッド Chlamydia antigen and corresponding DNA fragments and uses thereof
CA2407114A1 (en) 2000-04-21 2001-11-01 Corixa Corporation Compounds and methods for treatment and diagnosis of chlamydial infection
CA2408199A1 (en) 2000-05-08 2001-11-15 Aventis Pasteur Limited Chlamydia antigens and corresponding dna fragments and uses thereof
EP1297005B1 (en) * 2000-07-03 2009-08-26 Novartis Vaccines and Diagnostics S.r.l. Immunisation against chlamydia pneumoniae
US7537772B1 (en) 2000-10-02 2009-05-26 Emergent Product Development Gaithersburg Inc. Chlamydia protein, gene sequence and the uses thereof
US7731980B2 (en) 2000-10-02 2010-06-08 Emergent Product Development Gaithersburg Inc. Chlamydia PMP proteins, gene sequences and uses thereof
NZ594877A (en) 2000-10-27 2012-07-27 Novartis Vaccines & Diagnostic Nucleic acids and proteins from streptococcus groups A & B
US20030059896A1 (en) * 2000-12-21 2003-03-27 Shire Biochem Inc. Novel chlamydia antigens and corresponding DNA fragments
GB0107658D0 (en) 2001-03-27 2001-05-16 Chiron Spa Streptococcus pneumoniae
GB0107661D0 (en) 2001-03-27 2001-05-16 Chiron Spa Staphylococcus aureus
GB0115176D0 (en) 2001-06-20 2001-08-15 Chiron Spa Capular polysaccharide solubilisation and combination vaccines
GB0118249D0 (en) 2001-07-26 2001-09-19 Chiron Spa Histidine vaccines
GB0121591D0 (en) 2001-09-06 2001-10-24 Chiron Spa Hybrid and tandem expression of neisserial proteins
ES2312649T3 (en) 2001-12-12 2009-03-01 Novartis Vaccines And Diagnostics S.R.L. IMMUNIZATION AGAINST CHLAMYDIA TRACHOMATIS.
GB0203403D0 (en) 2002-02-13 2002-04-03 Chiron Spa Chlamydia cytotoxic-T cell epitopes
CA2476626A1 (en) 2002-02-20 2003-08-28 Chiron Corporation Microparticles with adsorbed polypeptide-containing molecules
GB0220194D0 (en) 2002-08-30 2002-10-09 Chiron Spa Improved vesicles
EP1556477B1 (en) 2002-11-01 2017-08-09 GlaxoSmithKline Biologicals s.a. Drying process
WO2004046177A2 (en) 2002-11-15 2004-06-03 Chiron Srl Unexpected surface proteins in neisseria meningitidis
GB0227346D0 (en) 2002-11-22 2002-12-31 Chiron Spa 741
WO2004087153A2 (en) 2003-03-28 2004-10-14 Chiron Corporation Use of organic compounds for immunopotentiation
GB0308198D0 (en) 2003-04-09 2003-05-14 Chiron Srl ADP-ribosylating bacterial toxin
ES2596553T3 (en) 2003-06-02 2017-01-10 Glaxosmithkline Biologicals Sa Immunogenic compositions based on microparticles comprising adsorbed toxoid and an antigen containing a polysaccharide
EP1765313A2 (en) 2004-06-24 2007-03-28 Novartis Vaccines and Diagnostics, Inc. Compounds for immunopotentiation
CA2571710A1 (en) 2004-06-24 2006-11-02 Nicholas Valiante Small molecule immunopotentiators and assays for their detection
WO2006078318A2 (en) 2004-07-29 2006-07-27 Novartis Vaccines And Diagnostics Inc. Immunogenic compositions for gram positive bacteria such as streptococcus agalactiae
GB0424092D0 (en) 2004-10-29 2004-12-01 Chiron Srl Immunogenic bacterial vesicles with outer membrane proteins
GB0502095D0 (en) 2005-02-01 2005-03-09 Chiron Srl Conjugation of streptococcal capsular saccharides
HUE027400T2 (en) 2005-02-18 2016-10-28 Glaxosmithkline Biologicals Sa Proteins and nucleic acids from meningitis/sepsis-associated escherichia coli
NZ580974A (en) 2005-02-18 2011-05-27 Novartis Vaccines & Diagnostic Immunogens from uropathogenic escherichia coli
US20110223197A1 (en) 2005-10-18 2011-09-15 Novartis Vaccines And Diagnostics Inc. Mucosal and Systemic Immunization with Alphavirus Replicon Particles
US7527801B2 (en) 2005-11-22 2009-05-05 Novartis Vaccines And Diagnostics, Inc. Norovirus and Sapovirus antigens
ES2536426T3 (en) 2006-03-23 2015-05-25 Novartis Ag Imidazoquinoxaline compounds as immunomodulators
US20100166788A1 (en) 2006-08-16 2010-07-01 Novartis Vaccines And Diagnostics Immunogens from uropathogenic escherichia coli
GB0700562D0 (en) 2007-01-11 2007-02-21 Novartis Vaccines & Diagnostic Modified Saccharides
KR101621837B1 (en) 2007-09-12 2016-05-17 노파르티스 아게 Gas57 mutant antigens and gas57 antibodies
DK2235046T3 (en) 2007-12-21 2012-10-29 Novartis Ag Mutant forms of streptolysin-O
ES2586308T3 (en) 2008-10-27 2016-10-13 Glaxosmithkline Biologicals Sa Purification procedure of a group A Streptococcus carbohydrate
US8585505B2 (en) 2008-12-15 2013-11-19 Tetris Online, Inc. Inter-game interactive hybrid asynchronous computer game infrastructure
WO2010078027A1 (en) * 2008-12-17 2010-07-08 Genocea Biosciences, Inc. Chlamydia antigens and uses thereof
WO2010078556A1 (en) 2009-01-05 2010-07-08 Epitogenesis Inc. Adjuvant compositions and methods of use
AU2010204139A1 (en) 2009-01-12 2011-08-11 Novartis Ag Cna_B domain antigens in vaccines against gram positive bacteria
RU2011140508A (en) 2009-03-06 2013-04-20 Новартис Аг Chlamydia antigens
BR112012009014B8 (en) 2009-09-30 2022-10-04 Novartis Ag PROCESS FOR PREPARING S. AUREUS CAPSULAR POLYSACCHARIDE CONJUGATE TYPE 5 OR TYPE 8 AND CRM197 TRANSPORT MOLECULE, CONJUGATE AND IMMUNOGENIC COMPOSITION
RU2579900C2 (en) 2009-10-30 2016-04-10 Новартис Аг Purification of staphylococcus aureus type 5 and type 8 capsular saccharides
GB201101665D0 (en) 2011-01-31 2011-03-16 Novartis Ag Immunogenic compositions
CA2849391A1 (en) * 2010-10-20 2012-04-26 Genocea Biosciences, Inc. Chlamydia antigens and uses thereof
US20130315959A1 (en) 2010-12-24 2013-11-28 Novartis Ag Compounds
CN107837394A (en) 2011-06-24 2018-03-27 埃皮托吉尼西斯有限公司 Pharmaceutical composition as the combination of the carrier comprising selection of antigen specific immune conditioning agent, vitamin, tannin and flavonoids
EP2755683B1 (en) 2011-09-14 2019-04-03 GlaxoSmithKline Biologicals SA Methods for making saccharide-protein glycoconjugates
EP2776069A1 (en) 2011-11-07 2014-09-17 Novartis AG Carrier molecule comprising a spr0096 and a spr2021 antigen
KR20150021933A (en) 2012-05-22 2015-03-03 노파르티스 아게 Meningococcus serogroup x conjugate
US11612664B2 (en) 2016-04-05 2023-03-28 Gsk Vaccines S.R.L. Immunogenic compositions
CN108514870B (en) * 2018-04-27 2020-02-28 湖南大学 Hydrotalcite-poly (m-phenylenediamine) composite material and preparation method and application thereof

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5374718A (en) * 1992-08-26 1994-12-20 Gen-Probe Incorporated Nucleic acid probes to chlamydia pneumoniae
JPH08294400A (en) * 1995-04-28 1996-11-12 Hitachi Chem Co Ltd Probe and primer for detecting and measuring chlamydia pneumoniae gene, detection and measurement of chlamydia pneumoniae gene using the same probe or primer and reagent for detecting and measuring chlamydia pneumoniae gene containing the same probe or primer
JPH10210978A (en) * 1997-01-31 1998-08-11 Hitachi Chem Co Ltd Recombinant vector and transformant containing the same, and recombinant vacurovirus and its production, and production of chlamydia pneumoniae antigen polypeptide

Also Published As

Publication number Publication date
AU1722300A (en) 2000-05-29
WO2000027994A2 (en) 2000-05-18
EP1133572A2 (en) 2001-09-19
EP1133572A4 (en) 2005-06-15
WO2000027994A3 (en) 2000-11-23
JP2002529069A (en) 2002-09-10

Similar Documents

Publication Publication Date Title
CA2350775A1 (en) Chlamydia pneumoniae genome sequence
US10981978B2 (en) Compositions and methods for the therapy and diagnosis of Inflammatory Bowel Disease
US6822071B1 (en) Polypeptides from Chlamydia pneumoniae and their use in the diagnosis, prevention and treatment of disease
AU2022241521A1 (en) Shared neoantigens
US7410640B2 (en) GBS toxin receptor antibodies
KR101595134B1 (en) Therapeutic agent for pruritus
US20020193329A1 (en) Compositions and methods for the therapy and diagnosis of Her-2/neu-associated malignancies
SA99191283B1 (en) CHLAMYDIA PROTEIN and the chain of religion and its uses
JP2014527398A (en) Compositions and methods for cancer therapy and diagnosis
EP2329838A2 (en) Mutated netrin 4, fragments thereof and uses thereof as drugs
US20040047880A1 (en) Component for vaccine
Ko et al. A novel modified RANKL variant can prevent osteoporosis by acting as a vaccine and an inhibitor
JP2010099072A (en) Gbs toxin receptor
WO2018176732A1 (en) Polypeptide specifically binding to cd56 molecule and use thereof
JP2004537966A (en) Compositions and methods for treatment and diagnosis of ovarian cancer
JP2001286284A (en) Agent for genetically diagnosing and/or treating tumor using tumor-specific antigen and new application of proton pump-inhibitor as antitumor agent
CA3162994A1 (en) Salmonella-based dna vaccines in combination with an antibiotic
JP2008289482A (en) Composition and method for therapy and diagnosis of ovarian cancer

Legal Events

Date Code Title Description
FZDE Discontinued