AU770915B2 - Enterococcus faecalis polynucleotides and polypeptides - Google Patents
Enterococcus faecalis polynucleotides and polypeptides Download PDFInfo
- Publication number
- AU770915B2 AU770915B2 AU55888/01A AU5588801A AU770915B2 AU 770915 B2 AU770915 B2 AU 770915B2 AU 55888/01 A AU55888/01 A AU 55888/01A AU 5588801 A AU5588801 A AU 5588801A AU 770915 B2 AU770915 B2 AU 770915B2
- Authority
- AU
- Australia
- Prior art keywords
- faecalis
- sequence
- polynucleotide
- polypeptides
- isolated polynucleotide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
Description
P/00/011 28/5/91 Regulation 3.2
AUSTRALIA
Patents Act 1990
ORIGINAL
COMPLETE SPECIFICATION STANDARD PATENT Name of Applicant: Actual Inventors: Address for service is: Human Genome Sciences, Inc.
Charles A KUNSCH Gil H CHOI Camella BAILEY Alex HROMOCKYJ WRAY ASSOCIATES 239 Adelaide Terrace Perth, WA 6000 Attorney code: WR Invention Title: "Enterococcus faecalis Polynucleotides and Polypeptides" This application is a divisional application by virtue of Section 39 of Australian Patent Application 72793/98 filed on 4 May 1998.
The following statement is a full description of this invention, including the best method of performing it known to me:- 1/2 Enterococcus faecalis polynucleotides and polypeptides Field of the Invention The present invention relates to novel Enterococcusfaecalis genes faecalis) nucleic acids and polypeptides. Also provided are vectors, host cells and recombinant methods for producing the same. Further provided are diagnostic methods for detecting Enterococcusfaecalis using probes, primers, and antibodies to the E. faecalis nucleic acids and polypeptides of the present invention. The invention further relates to screening methods for identifying agonists and antagonists of E. faecalis lo polypeptide activity and to vaccines using E. faecalis nucleic acids and polypeptides.
Background of the Invention Enterococci have been recognized as being pathogenic for humans since the turn of the century when they were first described by Thiercelin in 1988 as microscopic organisms. The genus Enterococcus includes the species Enterococcus faecalis or E. faecalis which is the most common pathogen in the group, accounting for 90 percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin Microbiol Infect Dis.9: 111-117.
The incidence of enterococcal infections has increased in recent years and enterococci are now the second most frequently reported nosocomial pathogens.
Enterococcal infection is of particular concern because of its resistance to antibiotics.
Recent attention has focused on enterococci not only because of their increasing role in nosocomial infections, but also because of their remarkable and increasing resistance to antimicrobial agents. These factors are mutually reinforcing since resistance allows enterococci to survive in an environment in which antimicrobial agents are heavily used; the hospital setting provides the antibiotics which eliminate or suppress susceptible bacteria, thereby providing a selective advantage for resistant organisms, and the hospital also provides the potential for dissemination of resistant enterococci via the usual routes of hand and environmental contamination.
Antimicrobial resistance can be divided into two general types, inherent or intrinsic property and that which is acquired. The genes for intrinsic resistance, like other species characteristics, appear to reside on the chromosome. Acquired resistance results from either a mutation in the existing DNA or acquisition of new DNA. The various inherent traits expressed by enterococci include resistance to semisynthetic penicillinase-resistant penicillins, cephalosporins, low levels of aminoglycosides, and low levels of clindamycin. Examples of acquired resistance include resistance to chloramphenicol, erythromycin, high levels of clindamycin, tetracycline, high levels of aminoglycosides, penicillin by means of penicillinase, fluoroquinolones, and vancomycin. Resistance to high levels of penicillin without penicillinase and resistance to fluoroquinolones are not known to be plasmid or transposon mediated and presumably are due to mutation(s).
Although the main reservoir for enterococci in humans is the gastrointestinal tract, the bacteria can also reside in the gallbladder, urethra and vagina.
15 E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and neonatal sepsis. See Lewis et al. (1990) supra.. In the 1970s and 1980s enterococci became finnly established as major nosocomial pathogens. They are now the fourth leading cause of hospital-acquired infection and the third leading cause of bacteremia in the United States. Fatality ratios for enterococcal bactermia range from 12% to 68%, with death due to enterococcal sepsis in 4 to 50% of these cases. See T.G. Emori (1993) Clin. Microbiol. Rev. 6:428-442.
The ability of enterococci to colonize the gastrointestinal tract, plus the many intrinsic and acquired resistance traits, means that these organisms, which usually seem to have relatively low intrinsic virulence, are given an excellent opportunity to become secondary invaders. Since nosocomial isolates of enterococci have displayed resistance to essentially every useful antimicrobial agent, it will likely become increasingly difficult to successfully treat and control enterococcal infections.
Particularly when the various resistance genes come together in a single strain, an event almost certain to occur at some time in the future.
The etiology of diseases mediated or exacerbated by Enterococcus faecalis, involves the programmed expression ofE. faecalis genes, and that characterizing these genes and their patterns of expression would dramatically add to our understanding of the organism and its host interactions. Knowledge of the E.faecalis gene and genomic organization would improve our understanding of disease etiology and lead to improved and new ways of preventing, treating and diagnosing diseases. Thus, there is a need to characterize the genome of E. faecalis and for polynucleotides of this organism.
Summary of the Invention The present invention provides for isolated E. faecalis polynucleotides and polypeptides shown in Table 1 and SEQ ID NO: 1 through SEQ ID NO:496 15 (polynucleotide sequences having odd SEQ ID NOs and polypeptide sequences having even SEQ ID NOs). One aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides having a nucleotide sequence selected from the group consisting of: a nucleotide sequence shown in Table 1; a nucleotide :e sequence encoding any of the amino acid sequences of the polypeptides shown in Table 1; and a nucleotide sequence complementary to any of the nucleotide sequences in or The invention further provides for fragments of the nucleic S. acid molecules of(a), above.
Further embodiments of the invention include isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in or above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in or above.
Additional nucleic acid embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides which encode the amino acid sequences of epitope-bearing portions of a E. faecalis polypeptide having an amino acid sequence in above.
The present invention also relates to recombinant vectors, which include the isolated nucleic acid molecules of the present invention, and to host cells containing the recombinant vectors, as well as to methods of making such vectors and host cells.
The present invention further relates to the use of these vectors in the production of E. faecalis polypeptides or peptides by recombinant techniques.
The invention further provides isolated E. faecalis polypeptides having an amino acid sequence selected from the group consisting of an amino acid sequence of any of the polypeptides described in Table 1 or fragments thereof.
The polypeptides of the present invention also include polypeptides having an amino acid sequence with at least 70% similarity, and more preferably at least 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 1, as well as polypeptides having an amino acid sequence at least 70% identical, more preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid molecules encoding such polypeptides.
The present invention further provides a single or multi-component vaccine comprising one or more of the E. faecalis polynucleotides or polypeptides described in Table 1, or fragments thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the E. faecalis polypeptide(s) are present in an amount effective to elicit an immune response to members of the Enterococcus genus, or at least E. faecalis in an animal. The E. faecalis polypeptides of the present invention may further be combined with one or more immunogens of one or more other Enterococcal or non-Enterococcal organisms to produce a multi-component vaccine intended to elicit an immunological response against members of the Enterococcus genus and, optionally, one or more non-Enterococcal organisms.
The vaccines of the present invention can be administered in a DNA form, e.g., "naked" DNA, wherein the DNA encodes one or more Enterococcal polypeptides and, optionally, one or more polypeptides of a non-Enterococcal organism. The DNA encoding one or more polypeptides may be constructed such that these polypeptides are expressed as fusion proteins.
The vaccines of the present invention may also be administered as a component of a genetically engineered organism or host cell. Thus, a genetically engineered organism or host cell which expresses one or more E. faecalis polypeptides may be administered to an animal. For example, such a genetically engineered organism or host cell may contain one or more E. faecalis polypeptides of the present invention intracellularly, on its cell surface, or in its periplasmic space. Further, such a genetically engineered organism or host cell may secrete one or more E. faecalis polypeptides. The vaccines of the present invention may also be co-administered to an animal with an immune system modulator CD86 and GM-CSF).
o The invention also provides a method of inducing an immunological response in an animal to one or more members of the Enterococcus genus, preferably one or 15 more isolates of the E.faecalis species, comprising administering to the animal a vaccine as described above.
The invention further provides a method of inducing a protective immune response in an animal, sufficient to prevent, attenuate, or control an infection by members of the Enterococcus genus, preferably at least E. faecalis species, comprising administering to the animal a composition comprising one or more of the polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may be conjugated to another immunogen and/or administered in admixture with an adjuvant.
The invention further relates to antibodies elicited in an animal by the administration of one or more E. faecalis polypeptides of the present invention and to methods for producing such antibodies and fragments thereof. The invention further relates to recombinant antibodies and fragments thereof and to methods for producing such antibodies and fragments thereof.
The invention also provides diagnostic methods for detecting the expression of the polynucleotides of Table 1 by members of the Enterococcus genus in an animal.
One such method involves assaying for the expression of a polynucleotide encoding E. faecalis polypeptides in a sample from an animal. This expression may be assayed either directly by assaying polypeptide levels using antibodies elicited in response to amino acid sequences described in Table 1) or indirectly by assaying for antibodies having specificity for amino acid sequences described in Table The expression of polynucleotides can also be assayed by detecting the nucleic acids of Table 1. An example of such a method involves the use of the polymerase chain reaction (PCR) to amplify and detect Enterococcus nucleic acid sequences.
The present invention also relates to nucleic acid probes having all or part of a nucleotide sequence described in Table 1 (odd SEQ ID NOs) which are capable of hybridizing under stringent conditions to Enterococcus nucleic acids. The invention further relates to a method of detecting one or more Enterococcus nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids encoding 15 Enterococcus polypeptides, comprising: contacting the sample with one or more of the above-described nucleic acid probes, under conditions such that hybridization occurs, and detecting hybridization of said one or more probes to the Enterococcus nucleic acid present in the biological sample.
Other uses of the polypeptides of the present invention include: inter alia. to 20 detect E. faecalls in immunoassays, as epitope tags, as molecular weight markers on SDS-PAGE gels, as molecular weight markers for molecular sieve gel filtration columns, to generate antibodies that specificaly bind E. faecalis polypeotides of the present invention for the detection E. faecalis in immunoassays, to generate an immune response against E. faecalis and other Enterococcus species, and as vaccines against E. faecalis, other Enterococcus species and other bacteria genuses.
Isolated nucleic acid molecules of the present invention, particularly DNA molecules, are useful as probes for gene mapping and for identifying E. faecalis in a biological samples, for instance, by Southern and Northern blot analysis.
Polynucleotides of the present invention are also useful in detecting E. faecalis by PCR using primers for a particular E. faecalis polynucleotide. Isolated polynucleotides of the present invention are also useful in making the polypeptides of the present invention.
Detailed Description The present invention relates to recombinant E. faecalis nucleic acids and fragments thereof. The present invention further relates to recombinant E. faecalis polypeptides and fragments thereof. The invention also relates to methods for using these polypeptides to produce immunological responses and to confer immunological In protection to disease caused by members of the genus Enterococcus, at least isolates of the E. faecalis genus. The invention further relates to nucleic acid sequences which encode antigenic E. faecalis polypeptides and to methods for detecting E. faecalis nucleic acids and polypeptides in biological samples. The invention also relates to antibodies specific for the polypeptides and peptides of the present invention and 15 methods for detecting such antibodies produced in a host animal.
S* Definitions The following definitions are provided to clarify the subject matter which the inventors consider to be the present invention.
20 As used herein, the phrase "pathogenic agent" means an agent which causes a disease state or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a disease state or render an animal infected with such an organism susceptible to a disease state a secondary infection). Further included are species and strains of the genus Enterococcus which produce disease states in animals.
As used herein, the term "organism" means any living biological system, including viruses, regardless of whether it is a pathogenic agent.
As used herein, the term "Enterococcus" means any species or strain of bacteria which is members of the genus Enterococcus. Such species and strains are known to those of skill in the art, and include those that are pathogenic and those that are not.
As used herein, the phrase "one or more E. faecalis polypeptides of the present invention" means polypeptides comprising the amino acid sequence of one or more of the E. faecalis polypeptides described in Table 1 (even SEQ ID NOs). These polypeptides may be expressed as fusion proteins wherein the E. faecalis polypeptides of the present invention are linked to additional amino acid sequences which may be of Enterococcal or non-Enterococcal origin. This phrase further includes polypeptide comprising fragments of the E. faecalis polypeptides of the I present invention. Additional definitions are provided throughout the specification.
Explanation of Table I Table 1, below, provides information describing genes which encode polypeptides of E.faecalis. The table lists the gene identifier which consists of the 15 letters EF, which denote E.faecalis, followed imunediately by a three digit numeric code, which arbitrarily number the E.faecalis genes of the present invention. A number from 1 through 4 follows the three digit number. A number 1 represents the full length open reading frame of the gene specified by the preceeding three digit number. A number 2 represents the full leng-tpolypeptide encoded by the gene specified the preceeding three digit number. A number 3 represents a polynucleotide fragment, of the gene represented by the preceeding three digit number, used to produce an antigenic polypeptide. A number 4 represents an antigenic polypeptide fragment, of the gene represented by the preceeding three digit number, used to stimulate an immune response or as a vaccine. The nucleotide and amino acid sequences of each gene and fragment are also shown in the Sequence Listing under the SEQ ID NO listed in Table 1.
Explanation of Table 2 Table 2 lists accession numbers for the closest matching sequences between the polypeptides of the present invention and those available through GenBank and Derwent databases. These reference numbers are the database entry numbers commonly used by those of skill in the art, who will be familar with their denominations. The descriptions of the numenclature for GenBank are available from the National Center for Biotechnology Information. Column I lists the gene or ORF of the present invention. Column 2 lists the accession number of a "match" gene sequence in GenBank or Derwent databases. Column 3 lists the description of the "match" gene sequence. Columns 4 and 5 are the high score and smallest sum probability, respectively, calculated by BLAST. Polypeptides of the present invention that do not share significant identity/similarity with any polypeptide sequences of GenBank and Derwent are not represented in Table 2. Polypeptides of the present invention that share significant identity/similarity with more than one of the polypeptides of GenBank and Derwent are represented more than once.
15 Explanation of Table 3.
The E. faecalis polypeptides of the present invention may include one or more conservative amino acid substitutions from natural mutations or human manipulation as indicated in Table 3. Changes are preferably of a minor nature, such as conservative amino acid substitutions that do not significantly affect the folding or activity of the 20 protein. Residues from the following groups, as indicated in Table 3, may be substituted for one another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and Small, Explanation of Table 4 Table 4 lists residues comprising antigenic epitopes of antigenic epitopebearing fragments present in each of the full length E. faecalis polypeptides described in Table 1 as predicted by the inventors using the algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was performed using the computer program PROTEAN (Version 3.11 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). E. faecalis polypeptide shown in Table 1 may one or more antigenic epitopes comprising residues described in Table 4. It will be appreciated that depending on the analytical criteria used to predict antigenic determinants, the exact address of the determinant may vary slightly. The residues and locations shown described in Table 4 correspond to the amino acid sequences for each full length gene sequence shown in Table I and in the Sequence Listing. Polypeptides of the present invention that do not have antigenic epitopes recognized by the Jameson-Wolf algorithm are not represented in Table 2.
Selection of Nucleic Acid Sequences Encoding Antigenic E. faecalis Polypeptides Sequenced E. faecalis genomic DNA was obtained from the E. faecalis strain V586. The E. faecalis strain V586 was deposited 2 May 1997 at the ATCC, 10801 e" University Blvd. Manassas, VA 20110-2209, and given accession number 55969.
Some ORFs contained in the subset of fragments of the E. faecalis genome 15 disclosed herein were derived through the use of a number of screening criteria detailed below. The ORFs are bounded at the amino terminus by a methionine or valine residue and usually at the carboxy terminus by a stop codon.
Most of the selected sequences consist of complete ORFs. The polypeptides that do not comprise a complete ORF can be determined by determining whether the 20 corresponding polynucleotide sequence comprises a stop codon after the codon for the last amino acid residue in the polypeptide sequence. It is not always preferred to a express a complete ORF in a heterologous system. It may be challenging to express and purify a highly hydrophobic protein by common laboratory methods. Some of the polypeptide vaccine candidates described herein have been modified slightly to simplify the production of recombinant protein. For example, nucleotide sequences which encode highly hydrophobic domains, such as those found at the amino terminal signal sequence, have been excluded from some constructs used for expression of the polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus have also been excluded from the recombinant expression constructs. Thus, in one embodiment, a polypeptide which represents a truncated or modified ORF may be used as an antigen.
While numerous methods are known in the art for selecting potentially immunogenic polypeptides, many of the ORFs disclosed herein were selected on the basis of screening Enterococcusfaecalis ORFs for several aspects of potential irmnunogenicity. One set of selection criteria are as follows: 1. Type I signal sequence: An amino terminal type I signal sequence generally directs a nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and Kendall, D. A. Mol. Microbiol. 13:765-773 (1994)). The length of the type I signal sequence is approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus. In addition, the central region of the signal sequence. adopts an alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in i the -1 and -3 positions.
2. Type IV signal sequence: The type IV signal sequence is an example of the several types of functional signal sequences which exist in addition to the type I signal 20 sequence detailed above. Although functionally related, the type IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, J. Bacteriol. 174:7345-7351 (1992)). These are typically six to eight amino acids with a net basic charge followed by an additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative to the cleavage site.
3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors examined contained the sequence at positions -3 to relative to the point of cleavage (Hayashi, S. and Wu, H. Bioenerg. Biomembr. 22:451-471 (1990)).
4. LPXTG motif It has been experimentally determined that most anchored proteins found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal sequence. More than fifty such proteins from organisms such as S.
pyogenes, S. mutans, E. faecalis, S. pneumoniae, and others, have been identified based on their extracellular location and carboxy terminal amino acid sequence (Fischetti, V.
ASMNews 62:405-410 (1996)). The conserved region consists of six charged amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any amino acid.
An algorithm for selecting antigenic and immunogenic Enterococcusfaecalis "mo: polypeptides including the foregoing criteria was developed. The algorithm is similar to that described in U.S. patent application 08/781,986, filed January 3, 1997, which is fully incorporated by reference herein. Use of the algorithm by the inventors to select immunologically useful Enterococcusfaecalis polypeptides resulted in the selection of a number of the disclosed ORFs. Polypeptides comprising the polypeptides identified in this group may be produced by techniques standard in the art and as further described herein.
Nucleic Acid Molecules Sequenced E.faecalis genomic DNA was obtained from the E. faecalis strainV586. As discussed elsewhere hererin, polynucleotides of the present invention readily may be obtained by routine application of well known and standard procedures for cloning and sequencing DNA. Detailed methods for obtaining libraries and for sequencing are provided below, for instance. A wide variety of Enterococcusfaecalis strains that can be used to prepare E.faecalis genomic DNA for cloning and for obtaining polynucleotides and polypeptides of the present invention. A wide variety of Enterococcusfaecalis strains are available to the public from recognized depository institutions, such as the American Type Culture Collection (ATCC). It is recognized that minor variation is the nucleic acid and amino acid sequence may be expected from Efaecalis strain to strain. The present invention provides for genes, including both polynucleotides and polypeptides, of the of the present invention from all the Enterococcusfaecalis strains.
Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA molecule herein were determined using an automated DNA sequencer (such as the Model 373 from Applied Biosystems, Inc., Foster City, CA), and all amino acid sequences of polypeptides encoded by DNA molecules determined herein were predicted by translation of a DNA sequence determined as above. Therefore, as is 15 known in the art for any DNA sequence determined by this automated approach, any nucleotide sequence determined herein may contain some errors. Nucleotide sequences detennined by automation are typically at least about 90% identical, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced DNA molecule. The actual sequence can be more precisely determined by other approaches including manual DNA sequencing methods well known in the art. As is also known in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted amino acid sequence encoded by a determined nucleotide sequence will be completely different from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the point of such an insertion or deletion. In case of conflict between Table 1 and either the nucleic acid sequence of the clones listed in Table 1 or the amino acid sequence of the protein expressed by the clones listed in Table 1, the clones listed in Table 1 are controlling. By "nucleotide sequence" of a nucleic acid molecule or polynucleotide is intended to mean either a DNA or RNA sequence.Using the information provided herein, such as the nucleotide sequence in Table 1, a nucleic acid molecule of the present invention encoding a E. faecalis polypeptide may be obtained using standard cloning and screening procedures, such as those for cloning DNAs using genomic DNA as starting material. See, Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed.
1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989). Illust ative of the invention, the nucleic acid molecule described in Table 1 was discovered in a DNA library derived from a E.
faecalis genomic DNA.
Nucleic acid molecules of the present invention may be in the form of RNA, such as mRNA, or in the form of DNA, including, for instance, DNA and genomic DNA obtained by cloning or produced synthetically. The DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding 15 strand, also known as the sense strand, or it may be the non-coding strand, also referred to as the anti-sense strand.
By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, which has been removed from its native environment. This includes segments of DNA comprising the E. faecalis polynucleotides of the present invention isolated from the native chromosome. These fragments include both isolated fragments consisting only of E. faecalis DNA and fragments comprising heterologous sequences such as vector sequences or other foreign DNA. For example, recombinant DNA molecules contained in a vector are considered isolated for the purposes of the present invention. Further examples of isolated DNA molecules include recombinant DNA molecules maintained in heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention further include such molecules produced synthetically.
In addition, isolated nucleic acid molecules of the invention include DNA molecules which comprise a sequence substantially different from those described above but which, due to the degeneracy of the genetic code, still encode a E. faecalis polypeptides and peptides of the present invention polypeptides of Table 1).
That is, all possible DNA sequences that encode the E. faecalis polypeptides of the present invention. This includes the genetic code and species-specific codon preferences known in the art. Thus, it would be routine for one skilled in the art to generate the degenerate variants described above, for instance, to optimize codon expression for a particular host change codons in the bacteria mRNA to those preferred by a mammalian or other bacterial host such as E. coli).
The invention further provides isolated nucleic acid molecules having the nucleotide sequence shown in Table 1 or a nucleic acid molecule having a sequence complementary to one of the above sequences. Such isolated molecules, particularly DNA molecules, are useful as probes for gene mapping and for identifying E.faecalis in a biological sample, for instance, by PCR, Southern blot, Northern blot, or other form of hybridization analysis.
The present invention is further directed to nucleic acid molecules encoding portions or fragments of the nucleotide sequences described herein. Fragments include S* portions of the nucleotide sequences of Table 1, or the E. faecalis nucleotide sequences contained in the plasimd clones listed in Table 1, at least 10 contiguous nucleotides in length selected from any two integers, one of which representing a nucleotide position and a second of which representing a 3' nucleotide position, where the first nucleotide for each nucleotide sequence in Table 1 is position 1. That is, every combination of a 5' and 3' nucleotide position that a fragment at least contiguous nucleotides in length could occupy is included in the invention. At least means a fragment may be 10 contiguous nucleotide bases in length or any integer between 10 and the length of an entire nucleotide sequence of Table 1 minus 1.
Therefore, included in the invention are contiguous fragments specified by any 5' and 3' nucleotide base positions of a nucleotide sequences of Table 1 wherein the contiguous fragment is any integer between 10 and the length of an entire nucleotide sequence minus 1.
Further, the invention includes polynucleotides comprising fragments specified by size, in nucleotides, rather than by nucleotide positions. The invention includes any fragment size, in contiguous nucleotides, selected from integers between 10 and the length of an entire nucleotide sequence minus 1. Preferred sizes of contiguous nucleotide fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, nucleotides. Other preferred sizes of contiguous nucleotide fragments, which may be useful as diagnostic probes and primers, include fragments 50-300 nucleotides in length which include, as discussed above, fragment sizes representing each integer between 50-300. Larger fragments are also useful according to the present invention corresponding to most, if not all, of the nucleotide sequences shown in Table 1 or of the E. faecalis nucleotide sequences of the plasimd clones listed in Table 1. The preferred sizes are, of course, meant to exemplify not limit the present invention as all size fragments, representing any integer between 10 and the length of an entire Snucleotide sequence minus 1, are included in the invention. Additional preferred nucleic acid fragments of the present invention include nucleic acid molecules encoding epitope-bearing portions of E. faecalis polypeptides identified in Table 4.
•The present invention also provides for the exclusion of any fragment, specified by 5' and 3' base positions or by size in nucleotide bases as described above for any nucleotide sequence of Table 1 or the plasimd clones listed in Table 1. Any number of fragments of nucleotide sequences in Table 1 or the plasimd clones listed in Table 1, specified by 5' and 3' base positions or by size in nucleotides, as described above, may be excluded from the present invention.
In another aspect, the invention provides an isolated nucleic acid molecule comprising a polynucleotide which hybridizes under stringent hybridization conditions to a portion of a polynucleotide in a nucleic acid molecules of the invention described above, for instance, nucleotide sequences of Table 1 or the E. faecalis sequences of the plasimd clones listed in Table 1. By "stringent hybridization conditions" is intended overnight incubation at 42 0 C in a solution comprising: formamide, 5x SSC (150 mM NaCI, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 5x Denhardt's solution, 10% dextran sulfate, and 20 p.g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0. x SSC at about By a polynucleotide which hybridizes to a "portion" of a polynucleotide is intended a polynucleotide (either DNA or RNA) hybridizing to at least about nucleotides bases, and more preferably ai least about 20 nucleotides bases, still more preferably at least about 30 nucleotides bases, and even more preferably about 30-70 50) nucleotides bases of the reference polynucleotide. These are useful as diagnostic probes and primers as discussed above. By a portion of a polynucleotide of "at least 20 nucleotides bases in length," for example, is intended 20 or more contiguous nucleotides bases nucleotides from the nucleotide sequence of the reference "polynucleotide the nucleotide sequence as shown in Table Portions of a polynucleotide which hybridizes to a nucleotide sequence in Table 1, which can be used as probes and primers, may also be precisely specified by 5' and 3' base positions or by size in nucleotide bases as described above or precisely excluded in the same manner.
The nucleic acid molecules of the present invention include those encoding the full length E.faecalis polypeptides of Table 1 and portions of the E. faecalis polypeptides of Table 1. Also included in the present invention are nucleic acids encoding the above full length sequences and further comprise additional sequences, such as those encoding an added secretory leader sequence, such as a pre-, or pro- or prepro- protein sequence. Further included in the present invention are nucleic acids encoding the above full length sequences and portions thereof and further comprise additional heterologous amino acid sequences encoded by nucleic acid sequences from a different source.
Also included in the present invention are nucleic acids encoding the above protein sequences together with additional, non-coding sequences, including for example, but not limited to non-coding 5' and 3' sequences. These sequences include transcribed, non-translated sequences that may play a role in transcription, and mRNA processing, for example, ribosome binding and stability ofmRNA. Also included in the present invention are additional coding sequences which provide additional functionalities.
Thus, a nucleotide sequence encoding a polypeptide may be fused to a marker sequence, such as a sequence encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred 6imbodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine peptidd, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among others, many of which are commercially available. For instance, hexa-histidine provides for convenient purification of the fusion protein. See Gentz et al. (1989) Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for purification which corresponds to an epitope derived from the influenza hemagglutinin protein. See Wilson et al. (1984) Cell 37:767. As discussed below, other such fusion proteins include the E. faecalis polypeptides of the present invention fused to Fc at the N- or C-terminus.
Variant and Mutant Polynucleotides The present invention further relates to variants of the nucleic acid molecules which encode portions, analogs or derivatives of a E. faecalis polypeptides of Table 1 S[and variant polypeptides thereof including portions, analogs, and derivatives of the E.
faecalis polypeptides. Variants may occur naturally, such as a natural allelic variant.
By an "allelic variant" is intended one of several alternate forms of a gene occupying a given locus on a chromosome of an organism. See, B. Lewin, Genes IV (1990).
Non-naturally occurring variants may be produced using art-known mutagenesis techniques.
Such nucleic acid variants include those produced by nucleotide substitutions, deletions, or additions. The substitutions, deletions, or additions may involve one or more nucleotides. The variants may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions may produce conservative or non-conservative amino acid substitutions, deletions or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of a E. faecalis protein of the present invention or portions thereof. Also especially preferred in this regard are conservative substitutions.
Such polypeptide variants include those produced by amino acid substitutions, deletions or additions. The substitutions, deletions, or additions may involve one or more residues. Alterations may produce conservative or non-conservative amino acid substitutions, deletions, or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of a E. faecalis protein of the present invention or "portions thereof. Also especially preferred in this regard are conservative substitutions.
SThe present invention also relates to recombinant vectors, which include the S. isolated nucleic acid molecules of the present invention, and to host cells containing the recombinant vectors, as well as to methods of making such vectors and host cells S"and for using them for production of E. faecalis polypeptides or peptides by recombinant techniques.
The present application is directed to nucleic acid molecules at least 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1.
The above nucleic acid sequences are included irrespective of whether they encode a polypeptide having E. faecalis activity. This is because even where a particular nucleic acid molecule does not encode a polypeptide having E. faecalis activity, one of skill in the art would still know how to use the nucleic acid molecule, for instance, as a hybridization probe. Uses of the nucleic acid molecules of the present invention that do not encode a polypeptide having E. faecalis activity include, inter alia, isolating an E. faecalis gene or allelic variants thereof from a DNA library, and detecting E. faecalis mRNA expression samples, environmental samples, suspected of containing E.
faecalis by Northern Blot analysis.
Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, in fact, encode a polypeptide having E. faecalis protein activity By "a polypeptide having E. faecalis activity" is intended polypeptides exhibiting activity similar, but not necessarily identical, to an activity of the E. faecalis protein of the invention, as measured in a particular biological assay suitable for measuring activity of the specified protein.
Due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a large number of the nucleic acid molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in Table I will encode a polypeptide having E. faecalis protein activity. In fact, since degenerate variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even without performing the above described comparison assay. It will be further recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will also encode a polypeptide having E. faecalis protein activity. This is because the skilled artisan is fully aware of amino acid substitutions that are either less likely or not likely to significantly effect protein function replacing one aliphatic amino acid with a second aliphatic amino acid), as further described below.
The biological activity or function of the polypeptides of the present invention are expected to be similar or identical to polypeptides from other bacteria that share a high degree of structural identity/similarity. Tables 2 lists accession numbers and descriptions for the closest matching sequences of polypeptides available through Genbank and Derwent databases. It is therefore expected that the biological activity or function of the polypeptides of the present invention will be similar or identical to those polypeptides from other bacterial genuses, species, or strains listed in Table 2.
By a polynucleotide having a nucleotide sequence at least, for example, "identical" to a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence may include up to five point s mutations per each 100 nucleotides of the reference nucleotide sequence encoding the E. faecalis polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to of the nucleotides in the reference sequence may be deleted, inserted, or substituted with another nucleotide. The query sequenceanay be an entire sequence n1 shown in Table 1, the ORF (open reading frame), or any fragment specified as described herein.
As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide S sequence of the presence invention can be determined conventionally using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences are both DNA sequences. An RNA sequence can be compared by first converting U's to T's. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty 1, Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap *4 Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever is shorter.
If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for 5' and 3' truncations of the subject sequence when calculating percent identity. For subject sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by calculating the number of bases of the query sequence that are and 3' of the subject sequence, which are not matched/aligned, as a percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score is what is used for the purposes of the present invention. Only nucleotides outside the 5' and 3' nucleotides of the subject sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated for the purposes of manually adjusting the percent identity score.
For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query sequence to determine percent identity. The deletions occur at the 5' end of the 15 subject sequence and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not matched/total number of nucleotides in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 20 nucleotides were perfectly matched the final percent identity would be 90%. In another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query sequence. This time the deletions are internal deletions so that there are no nucleotides on the 5' or 3' of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 5' and 3' of the subject sequence which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are to made for the purposes of the present invention.
Vectors and Host Cell The present invention also relates to vectors which include the isolated DNA molecules of the present invention, host cells comprising the recombinant vectors, and the production of E. faecalis polypeptides and peptides of the present invention expressed by the host cells.
Recombinant constructs may be introduced into host cells using well known techniques such as infection, transduction, transfection, transvection, electroporation and transformation. The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors may be replication competent or replication defective. In the latter case, viral propagation generally Will occur only in complementing host cells.
The polynucleotides may be joined to a vector containing a selectable marker for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 15 and then transduced into host cells.
Preferred are vectors comprising cis-acting control regions to the S. polynucleotide of interest. Appropriate trans-acting factors may be supplied by the host, supplied by a complementing vector or supplied by the vector itself upon introduction into the host.
In certain preferred embodiments in this regard, the vectors provide for specific expression, which may be inducible and/or cell type-specific. Particularly preferred among such vectors are those inducible by environmental factors that are easy to manipulate, such as temperature and nutrient additives.
Expression vectors useful in the present invention include chromosomal-, episomal- and virus-derived vectors, vectors derived from bacterial plasmids, bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as cosmids and phagemids.
The DNA insert should be operatively linked to an appropriate promoter, such as the phage lambda PL promoter, the E. coli lac, trp and lac promoters, the early and late promoters and promoters of retroviral LTRs, to name a few.
Other suitable promoters will be known to the skilled artisan. The expression constructs will further contain sites for transcription initiation, termination and, in the transcribed region, a ribosome binding site for translation. The coding portion of the mature transcripts expressed by the constructs will preferably include a translation initiating site at the beginning and a termination codon (UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to.be translated.
As indicated, the expression vectors will preferably include at least one selectable marker. Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture and tetracycline, kanamycin, or ampicillin resistance genes for culturing in E. coli and other bacteria. Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the above-described host cells are known in the art.
Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, pQE10 available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH 16a, pNH18A, pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTI and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan.
Among known bacterial promoters suitable for use in the present invention include the E. coli lad and lacZ promoters, the T3, T5 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the mouse metallothioiein-1 promoter.
Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection or other methods. Such methods are described in many standard laboratory manuals (for example, Davis, et al., Basic Methods In Molecular Biology (1986)).
Transcription of DNA encoding the polypeptides of the present invention by higher eukaryotes may be increased by inserting an enhancer sequence into the vector.
Enhancers are cis-acting elements of DNA, usually about from 10 to 300 nucleotides that act to increase transcriptional activity of a promoter in a given host cell-type.
Examples of enhancers include the SV40 enhancer, which is located on the late side of 15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular environment, 20 appropriate secretion signals may be incorporated into the expressed polypeptide, for example, the amino acid sequence KDEL. The signals may be endogenous to the polypeptide or they may be heterologous signals.
The polypeptide may be expressed in a modified form, such as a fusion protein, and may include not only secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the polypeptide to improve stability and persistence in the host cell, during purification, or during subsequent handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate purification. Such regions may be removed prior to final preparation of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or excretion, to improve stability and to facilitate purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a heterologous region from immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion proteins comprising various portions of constant region of immunoglobulin molecules together with another human protein or part thereof. In many cases, the Fc part in a fusion protein is thoroughly advantageous for use in therapy and diagnosis and thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein has been expressed, detected and purified in the advantageous manner described. This is the case when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human proteins, such as, hlL5-receptor has been fused with Fc portions for the purpose of high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J.
Molec. Recogn. 8:52-58 and Johanson, K. et al. (1995) J. Biol. Chem. 270 (16):9459-9471.
The E. faecalis polypeptides can be recovered and purified from recombinant 20 cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") is employed for purification.
Polypeptides of the present invention include naturally purified products, products of chemical synthetic procedures, and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and mammalian cells.
Polypeptides and Fragments The invention further provides an isolated E. faecalis polypeptide having an amino acid sequence in Table 1, or a peptide or polypeptide comprising a portion of the above polypeptides.
Variant and Mutant Polypeptides To improve or alter the characteristics of E. faecalis polypeptides of the present invention, protein engineering may be employed. Recombinant DNA technology known to those skilled in the art can be used to create novel mutant proteins or muteins including single or multiple amino acid substitutions, deletions, additions, or fusion proteins. Such modified polypeptides can show, enhanced activity or increased stability. In addition, they may be purified in higher yields and show better solubility than the corresponding natural polypeptide, at least under certain purification and storage conditions.
N-Terminal and C-Terminal Deletion Mutants
S
It is known in the art that one or more amino acids may be deleted from the N-terminus or C-terminus without substantial loss of biological function. For instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 20 proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid residues were missing. Accordingly, the present invention provides polypeptides **having one or more residues deleted from the amino terminus of the amino acid sequence of the E. faecalis polypeptides shown in Table 1, and polynucleotides encoding such polypeptides.
25 Similarly, many examples of biologically functional C-terminal deletion muteins are known. For instance, Interferon gamma shows up to ten times higher activities by deleting 8-10 amino acid residues from the carboxy terminus of the protein See, Dobeli, et al. (1988) J. Biotechnology 7:199-216. Accordingly, the present invention provides polypeptides having one or more residues from the carboxy terminus of the amino acid sequence of the E.faecalis polypeptides shown in Table 1. The invention also provides polypeptides having one or more amino acids deleted from both the amino and the carboxyl termini as described below.
The present invention is further directed to polynucleotide encoding portions or fragments of the amino acid sequences described herein as well as to portions or fragments of the isolated amino acid sequences described herein. Fragments include portions of the amino acid sequences of Table 1, are at least 5 contiguous amino acid in length, are selected from any two integers, one of which representing a N-terminal position. The initiation codon of the polypeptides of thd present inventions position 1. Every combination of a N-terminal and C-terminal position that a fragment at least contiguous amino acid residues in length could occupy, on any given amino acid sequence of Table 1 is included in the invention. At least means a fragment may be contiguous amino acid residues in length or any integer between 5 and the number of residues in a full length amino acid sequence minus 1. Therefore, included in the 15 invention are contiguous fragments specified by any N-terminal and C-terminal positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment S. is any integer between 5 and the number of residues in a full length sequence minus 1.
Further, the invention includes polypeptides comprising fragments specified by size, in amino acid residues, rather than by N-terminal and C-terminal positions.
The invention includes any fragment size, in contiguous amino acid residues, selected from integers between 5 and the number of residues in a full length sequence minus 1.
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100
S.
amino acid residues, about 200 amino acid residues, about 300 amino acid residues, and about 400 amino acid residues. The preferred sizes are, of course, meant to exemplify, not limit, the present invention as all size fragments representing any integer between 5 and the number of residues in a full length sequence minus 1 are included in the invention. The present invention also provides for the exclusion of any fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as described above. Any number of fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as described above may be excluded.
The above fragments need not be active since they would be useful, for example, in immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a particular portion of the protein, as vaccines, and as molecular weight markers.
Other Mutants In addition to N- and C-terminal deletion forms of the protein discussed above, it also will be recognized by one of ordinary skill in the art that some amino acid sequences of the E. faecalis polypeptide can be varied without significant effect of the Sb'. structure or function of the protein. If such differences in sequence are contemplated, it should be remembered that there will be critical areas on the protein which determine activity.
Thus, the invention further includes variations of the E. faecalis polypeptides which show substantial E.faecalis polypeptide activity or which include regions of E.
•faecalis protein such as the protein portions discussed below. Such mutants include deletions, insertions, inversions, repeats, and type substitutions selected according to general rules known in the art so as to have little effect on activity. For example, guidance concerning how to make phenotypically silent amino acid substitutions is provided. There are two main approaches for studying the tolerance of an amino acid sequence to change. See, Bowie, J. U. et al. (1990), Science 247:1306-1310. The first method relies on the process of evolution, in which mutations are either accepted or rejected by natural selection. The second approach uses genetic engineering to introduce amino acid changes at specific positions of a cloned gene and selections or screens to identify sequences that maintain functionality.
These studies have revealed that proteins are surprisingly tolerant of amino acid substitutions. The studies indicate which amino acid changes are likely to be permissive at a certain position of the protein. For example, most buried amino acid residues require nonpolar side chains, whereas few features of surface side chains are generally conserved. Other such phenotypically silent substitutions are described by Bowie et al. (supra) and the references cited therein. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu and lie; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitu-tion between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr.
Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 1, or that encoded by the plaimds listed in Table 1, may be: one in which one or more of the amino acid residues are substituted with a conserved or non-conserved 0 amino acid residue (preferably a conserved amino acid residue) and such substituted 15 amino acid residue may or may not be one encoded by the genetic code: or (ii) one in which one or more of the amino acid residues includes a substituent group: or (iii) one 0* in which the E. faecalis polypeptide is fused with another compound, such as a compound to increase the half-life of the polypeptide (for example, polyethylene glycol): or (iv) one in which the additional amino acids are fused to the above form of the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a sequence which is employed for purification of the above form of the polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those skilled in the art from the teachings herein.
Thus, the E. faecalis polypeptides of the present invention may include one or more amino acid substitutions, deletions, or additions, either from natural mutations or human manipulation. As indicated, changes are preferably of a minor nature, such as conservative amino acid substitutions that do not significantly affect the folding or activity of the protein (see Table 3).
Amino acids in the E. faecalis proteins of the present invention that are essential for function can be identified by methods known in the art, such as sitedirected mutagenesis or alanine-scanning mutagenesis. See, Cunningham et al.
(1989) Science 244:1081-1085. The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for biological activity using assays appropriate for measuring the function of the particular protein.
Of special interest are substitutions of charged amino acids with other charged or neutral amino acids which may produce proteins with highly desirable improved characteristics, such as less aggregation. Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical formulations, because aggregates can be immunogenic. See, Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; Robbins, et al., (1987) Diabetes 36:838-845; Cleland, et al., (1993) Crit. Rev.
Therapeutic Drug Carrier Systems 10:307-377.
The polypeptides of the present invention are preferably provided in an 15 isolated form, and preferably are substantially purified. A recombinantly produced version of the E. faecalis polypeptide can be substantially purified by the one-step method described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the invention also can be purified from natural or recombinant sources using antibodies 6*oooS directed against the polypeptides of the invention in methods which are well known in the art of protein purification.
The invention further provides for isolated E. faecalis polypeptides comprising an amino acid sequence selected from the group consisting of: the amino acid sequence of a full-length E. faecalis polypeptide having the complete amino acid sequence shown in Table 1; the amino acid sequence of a full-length E.
faecalis polypeptide having the complete amino acid sequence shown in Table I excepting the N-terminal methionine; the complete amino acid sequence encoded by the plaimds listed in Table 1; and the complete amino acid sequence excepting the N-terminal methionine encoded by the plaimds listed in Table 1. The polypeptides of the present invention also include polypeptides having an amino acid sequence at least 80% identical, more preferably at least 90% identical, and still more preferably 95%, 96%, 97%, 98% or 99% identical to those described in and above.
Further polypeptides of the present invention include polypeptides which have at least 90% similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 97%, 98% or 99% similarity to those described above.
A further embodiment of the invention relates to a polypeptide which comprises the amino acid sequence of a E. faecalis polypeptide having an amino acid sequence which contains at least one conservative amino acid substitution, but not more than 50 conservative amino acid substitutions, not more than 40 conservative amino acid substitutions, not more than 30 conservative amino acid substitutions, and not more than 20 conservative amino acid substitutions. Also provided are polypeptides which comprise the amino acid sequence of a E. faecalis polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino 15 acid substitutions.
By a polypeptide having an amino acid sequence at least, for example, "identical" to a query amino acid sequence of the present invention, it is intended that the amino acid sequence of the subject polypeptide is identical to the query sequence except that the subject polypeptide sequence may include up to five amino acid alterations per each 100 amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, deleted, (indels) or substituted with another amino acid.
;These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence.
As a practical matter, whether any particular polypeptide is at least 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in Table 1 or to the amino acid sequence encoded by the plaimds listed in Table I can be determined conventionally using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences are both amino acid sequences. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch to Penalty=1, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is shorter.
If the subject sequence is shorter than the query sequence due to N- or Cterminal deletions, not because of internal deletions, the results, in percent identity, I* must be manually corrected. This is because the FASTDB program does not account for N- and C-terminal truncations of the subject sequence when calculating global percent identity. For subject sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is corrected by calculating the number of •residues of the query sequence that are N- and C-terminal of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This final percent identity score is what is used for the purposes of the present invention. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the purposes of manually adjusting the percent identity score. That is, only query amino acid residues outside the farthest N- and C-terminal residues of the subject sequence.
For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query sequence to determine percent identity. The deletion occurs at the Nterminus of the subject sequence and therefore, the FASTDB alignment does not match/align with the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- termini not matched/total number of residues in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining residues were perfectly matched the final percent identity would be 90%. In another example, a 90 residue subject sequence is compared with a 100 residue query sequence. This time the deletions are internal so there are no residues at the N- or Ctermini of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned 15 with the query sequence are manually corrected. No other manual corrections are to made for the purposes of the present invention.
The above polypeptide sequences are included irrespective of whether they have their normal biological activity. This is because even where a particular polypeptide molecule does not have biological activity, one of skill in the art would 20 still know how to use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of the polypeptides of the present invention that do not have E. faecalis activity include, inter alia, as epitope tags, in epitope mapping, and as molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns using methods known to those of skill in the art.
As described below, the polypeptides of the present invention can also be used to raise polyclonal and monoclonal antibodies, which are useful in assays for detecting E. faecalis protein expression or as agonists and antagonists capable of enhancing or inhibiting E. faecalis protein function. Further, such polypeptides can be used in the yeast two-hybrid system to "capture" E. faecalis protein binding proteins which are also candidate agonists and antagonists according to the present invention.
See, Fields et al. (1989) Nature 340:245-246.
Epilope-Bearing Portions In another aspect, the invention provides peptides and polypeptides comprising epitope-bearing portions of the E. faecalis polypeptides of the present invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides of the present invention. An "immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the whole protein or polypeptide is the irmmunogen. These immunogenic epitopes are believed to be confined to a few loci on the molecule. On the other hand, a region of a protein molecule to which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is less than the number of antigenic epitopes. See, Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002.
is' 5 Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 4 only lists amino acid residues comprising epitopes predicted to have the highest degree of antigenicity. The polypeptides not listed in Table 4 and portions of polypeptides not listed in Table 4 are not considered non-antigenic. This is because they may still be antigenic in vivo but merely not recognized as such by the particular algorithm used. Thus, Table 4 lists the amino acid residues comprising preferred antigenic epitopes but not a complete list. Amino acid residues comprising other anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf analysis or by in vivo testing for an antigenic response using the methods described herein or those known in the art.
As to the selection of peptides or polypeptides bearing an antigenic epitope that contain a region of a protein molecule to which an antibody can bind), it is well known in that art that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable of eliciting an antiserum that reacts with the partially mimicked protein. See, Sutcliffe, et al., (1983) Science 219:660-666.
Peptides capable of eliciting protein-reactive sera are frequently represented in the primary sequence of a protein, can be characterized by a set of simple chemical rules, and are confined neither to immunodominant regions of intact proteins immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and those of six or fewer residues generally are ineffective at inducing antibodies that bind to the mimicked protein; longer, peptides, especially those containing proline residues, usually are effective. See, Sutcliffe, et al., supra, p.
661. For instance, 18 of 20 peptides designed according to these guidelines, containing 8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HA1 polypeptide chain, induced antibodies that reacted with the HAl protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies glycoprotein induced antibodies that precipitated the respective proteins.
Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise antibodies, including monoclonal antibodies, that bind 15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the native protein.
See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies to 20 different peptides may be used for tracking the fate of various regions of a protein precursor which undergoes post-translational processing. The peptides and oo• anti-peptide antibodies may be used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in competition assays since it has been shown that even short peptides about 9 amino acids) can bind and displace the larger 25 peptides in immunoprecipitation assays. See, Wilson, et al., (1984) Cell 37:767-778. The anti-peptide antibodies of the invention also are useful for purification of the mimicked protein, for instance, by adsorption chromatography using methods known in the art.
Antigenic epitope-bearing peptides and polypeptides of the invention designed according to the above guidelines preferably contain a sequence of at least seven, more preferably at least nine and most preferably between about 10 to about amino acids any integer between 7 and 50) contained within the amino acid sequence of a polypeptide of the invention. However, peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide of the invention, containing about 50 to about 100 amino acids, or any length up to and including the entire amino acid sequence of a polypeptide of the invention, also are considered epitope-bearing peptides or polypeptides of the invention and also are useful for inducing antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents the sequence includes relatively hydrophilic residues and highly hydrophobic sequences are preferably avoided); and sequences containing proline residues are particularly preferred.
Non-limiting examples of antigenic polypeptides or peptides that can be used to generate an enterococcal-specific immune response or antibodies include portions of the amino acid sequences identified in Table 1. More specifically, Table 4 discloses a list of non-limiting residues that are involved in the antigenicity of the epitope-bearing fragments of the present invention. Therefore, the present inventions provides for isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the present invention comprising a peptide sequences of Table 4. The antigenic epitopebearing fragments comprising a peptide sequence of Table 4 preferably contain a sequence of at least seven, more preferably at least nine and most preferably between about 10 to about 50 amino acids any integer between 7 and 50) of a polypeptide of the present invention. That is, included in the present invention are antigenic polypeptides between the integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4. Therefore, in most cases, the polypeptides of Table 4 make up only a portion of the antigenic polypeptide. All combinations of sequences between the integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4 are included. The antigenic epitope-bearing fragements may be specified by either the number of contiguous amino acid residues or by specific N-terminal and C-terminal positions as described above for the polypeptide fragements of the present invention, wherein the initiation codon is residue 1. Any number of the described antigenic epitope-bearing fragements of the present invention may also be excluded from the present invention in the same manner.
The epitope-bearing peptides and polypeptides of the invention may be produced by any conventional means for making peptides or polypeptides including recombinant means using nucleic acid molecules of the invention. For instance, an epitope-bearing amino acid sequence of the present invention may be fused to a larger polypeptide which acts as a carrier during recombinant production and purification, as well as during immunization to produce anti-peptide antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical synthesis. For "instance, Houghten has described a simple method for synthesis of large numbers of 15 peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid variants of a segment of the HA1 polypeptide which were prepared and characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R.
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,211 to.Houghten and coworkers (1986). In this procedure the individual resins for the solid-phase synthesis of various peptides are contained in separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps involved in solid-phase methods. A completely manual procedure allows 500-1000 or more syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad.
Sci. 82:5131-5135 at 5134.
Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies according to methods well known in the art. See, Sutcliffe, et al., supra;; Wilson, et al., supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354.
Generally, animals may be immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to carrier using a more general linking agent such as glutaraldehyde.
Animals such as rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 100 p.g peptide or carrier protein and Freund's adjuvant.
Several booster injections may be needed, for instance, at intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in serum from an immunized animal may be increased by selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of the selected antibodies according to methods well known in the art.
Immunogenic epitope-bearing peptides of the invention, those parts of a protein that elicit an antibody response when the whole protein is the immunogen, are identified according to methods known in the art. For instance, Geysen, el al., supra, I discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an ELISA. Interaction of synthesized peptides with antibodies is then easily detected without removing them from the support. In this manner a peptide bearing an immunogenic epitope of a desired protein may be identified routinely by one of ordinary skill in the art. For instance, the immunologically important epitope in the coat protein of foot-and-mouth disease virus was located by Geysen el al. supra with a resolution of seven amino acids by synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn at every position within the epitope were synthesized, and the particular amino acids conferring specificity for the reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing peptides of the invention can be made routinely by this method. U.S. Patent No.
4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an imnnunogenic epitope of a desired protein.
Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general method of detecting or determining the sequence of monomers (amino acids or other compounds) which is a topological equivalent of the epitope a "mimotope") which is complementary to a particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a sequence of monomers which is a topographical equivalent of a ligand which is complementary to the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, R. A. et al. (1996) discloses linear C 1
-C
7 -alkyl peralkylated oligopeptides and sets and libraries of such peptides, as well as methods for using such oligopeptide sets and 15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention also can be made routinely by these methods. The entire disclosure of each document cited in this section on "Polypeptides and Fragments" is hereby incorporated herein by reference.
As one of skill in the art will appreciate, the polypeptides of the present invention and the epitope-bearing fragments thereof described above can be combined with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins facilitate purification and show an increased half-life in vivo. This has been shown, for chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of mammalian immunoglobulins. (EPA 0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a disulfide-linked dimeric structure due to the IgG part can also be more efficient in binding and neutralizing other molecules than a monomeric E. faecalis polypeptide or fragment thereof alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964.
Nucleic acids encoding the above epitopes of E. faecalis polypeptides can also be recombined with a gene of interest as an epitope tag to aid in detection and purification of the expressed polypeptide.
Antibodies E. faecalis protein-specific antibodies for use in the present invention can be raised against the intact E. faecalis protein or an antigenic polypeptide fragment thereof, which may be presented together with a carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if it is long enough (at least about amino acids), without a carrier.
As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to include intact molecules, single chain whole antibodies, and antibody fragments. Antibody fragments of the present invention include Fab and F(ab')2 and other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv).
"Also included in the present invention are chimeric and humanized monoclonal antibodies and polyclonal antibodies specific for the polypeptides of the present invention. The antibodies of the present invention may be prepared by any of a variety of methods. For example, cells expressing a polypeptide of the present invention or an antigenic fragment thereof can be administered to an animal in order to induce the production of sera containing polyclonal antibodies. For example, a preparation of E. faecalis polypeptide or fragment thereof is prepared and purified to render it substantially free of natural contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal antisera of greater specific activity.
In a preferred method, the antibodies of the present invention are monoclonal antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared using hybridoma technology. See, Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS 563-681 (Elsevier, 1981). Fab and F(ab')2 fragments may be produced by proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, E. faecalis polypeptide-binding fragments, chimeric, and humanized antibodies can be produced through the application of recombinant DNA technology or through synthetic chemistry using methods known in the art.
Alternatively, additional antibodies capable of binding to the polypeptide antigen of the present invention may be produced in a two-step procedure through the use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance with this method, E. faecalis polypeptide-specific antibodies are used to immunize an animal, preferably a mouse.
The splenocytes of such an animal are then used to produce hybridoma cells, and the 15 hybridoma cells are screened to identify clones which produce an antibody whose ability to bind to the E. faecalis polypeptide-specific antibody can be blocked by the E. faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the E.faecalis polypeptide-specific antibody and can be used to immunize an animal *to induce formation of further E. faecalis polypeptide-specific antibodies.
20 Antibodies and fragements thereof of the present invention may be described by the portion of a polypeptide of the present invention recognized or specifically bound by the antibody. Antibody binding fragements of a polypeptide of the present invention may be described or specified in the same manner as for polypeptide fragements discussed above., i.e, by N-terminal and C-terminal positions or by size in contiguous amino acid residues. Any number of antibody binding fragments, of a polypeptide of the present invention, specified by N-terminal and C-terminal positions or by size in amino acid residues, as described above, may also be excluded from the present invention. Therefore, the present invention includes antibodies the specifically bind a particuarlly discribed fragement of a polypeptide of the present invention and allows for the exclusion of the same.
Antibodies and fragements thereof of the present invention may also be described or specified in terms of their cross-reactivity. Antibodies and fragements that do not bind polypeptides of any other species of Enterococcus other than E.
faecalis are included in the present invention. Likewise, antibodies and fragements that bind only species of Enterococcus, i.e. antibodies and fragements that do not bind bacteria from any genus other than Enterococcus, are included in the present invention.
Diagnostic Assays The present invention further relates to methods for assaying staphylococcal infection in an animal by detecting the expression of genes encoding staphylococcal polypeptides of the present invention. The methods comprise analyzing tissue or body fluid from the animal for Enterococcus-specific antibodies, nucleic acids, or proteins. Analysis of nucleic acid specific to Enterococcus is assayed by PCR or hybridization techniques using nucleic acid sequences of the present invention as either hybridization probes or primers. See, Sambrook et al. Molecular cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 *:reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing differentiation among spotted fever group Rickettsiae species by analysis of restriction fragment length polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin.
Microbiol. 32:589-595 (detecting B. burgdorferi nucleic acids via PCR).
Where diagnosis of a disease state related to infection with Enterococcus has already been made, the present invention is useful for monitoring progression or regression of the disease state whereby patients exhibiting enhanced Enterococcus gene expression will experience a worse clinical outcome relative to patients expressing these gene(s) at a lower level.
By "biological sample" is intended any biological sample obtained from an animal, cell line, tissue culture, or other source which contains Enterococcus polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other biological source suspected of containing Enterococcus polypeptides or nucleic acids. Methods for obtaining biological samples such as tissue are well known in the art.
The present invention is useful for detecting diseases related to Enterococcus infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans.
Total RNA can be isolated from a biological sample using any suitable technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Enterococcus polypeptides having sufficient homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between complementary sequences are then assayed using any appropriate method. These include Northern blot analysis, S 1 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in combination with the polymerase chain reaction (RT-PCR), and reverse transcription in combination with the ligase chain reaction (RT-LCR).
Northern blot analysis can be performed as described in Harada et al. (1990) Cell 63:303-312. Briefly, total RNA is prepared from a biological sample as described above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker, the filter is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 25 phosphate buffer. A E. faecalis polynucleotide sequence shown in Table 1 labeled according to any appropriate method (such as the 2 P-multiprimed DNA labeling system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and exposed to x-ray film. DNA for use as probe according to the present invention is described in the sections above and will preferably at least 15 nucleotides in length.
S1 mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. To prepare probe DNA for use in SI mapping, the sense strand of an above-described E.faecalis DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. The antisense DNA can then be digested using an appropriate restriction endonuclease to generate further DNA probes of a desired length. Such antisense probes are useful for visualizing protected bands corresponding to the target mRNA mRNA encoding Enterococcus polypeptides).
Levels of mRNA encoding Enterococcus polypeptides are assayed, for e.g., using the RT-PCR method described in Makino ct al. (1990) Technique 2:295-301.
By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from a biological sample in a reaction 15 mixture containing a RT primer and appropriate buffer. After incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription of the RNA, the RT products are then subject to PCR using labeled primers.
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 2 PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler according to conventional techniques. After a suitable number of rounds to achieve amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel.
After drying the gel, the radioactivity of the appropriate bands (corresponding to the mRNA encoding the Enterococcus polypeptides of the present invention) are 25 quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, reagent and gel concentrations, and labeling methods are well known in the art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can detect the nucleic acid of the present invention can be found in PCR PRIMER: A LABORATORY MANUAL Dieffenbach et al. eds., Cold Spring Harbor Lab Press, 1995).
The polynucleotides of the present invention, including both DNA and RNA, may be used to detect polynucleotides of the present invention or Enterococcal species including E. faecalis using bio chip technology. The present invention includes both high density chip arrays (>1000 oligonucleotides per cm 2 and low density chip arrays (<1000 oligonucleotides per cm 2 Bio chips comprising arrays of polynucleotides of the present invention may be used to detect Enterococcal species, including E. faecalis, in biological and environmental samples and to diagnose an animal, including humans, with an E. faecalis or other Enterococcal infection. The bio chips of the present invention may comprise polynucleotide sequences of other pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in addition to the polynucleotide sequences of the present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips can also be used to monitor an E. faecalis or other Enterococcal infections and to monitor the genetic 15 changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the "clinic and drug development in the laboratory. The bio chip technology comprising arrays of polynucleotides of the present invention may also be used to simultaneously monitor the expression of a multiplicity of genes, including those of the present invention. The polynucleotides used to comprise a selected array may be specified in the same manner as for the fragements, i.e, by their 5' and 3' positions or length in contigious base pairs and include from. Methods and particular uses of the polynucleotides of the present invention to detect Enterococcal species, including E.
faecalis, using bio chip technology include those known in the art and those of: U.S.
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 25 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, WO/9511995, WO/9743447, WO/9535505, each incorporated herein in their entireties.
Biosensors using the polynucleotides of the present invention may also be used to detect, diagnose, and monitor E. faecalis or other Enterococcal species and infections thereof. Biosensors using the polynucleotides of the present invention may also be used to detect particular polynucleotides of the present invention. Biosensors using the polynucleotides of the present invention may also be used to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the clinic and drug development in the laboratory. Methods and particular uses of the polynucleotides of the present invention to detect Enterococcal species, including E.
faecalis, using biosenors include those known in the art and those of: U.S. Patent Nos 5721102, 5658732, 5631170, and WorldPatentNos. W097/35011, WO/9720203, each incorporated herein in their entireties.
Thus, the present invention includes both bio chips and biosensors comprising polynucleotides of the present invention and methods of their use.
Assaying Enterococcus polypeptide levels in a biological sample can occur using any art-known method, such as antibody-based techniques. For example, Enterococcus polypeptide expression in tissues can be studied with classical 15 immunohistological methods. In these, the specific recognition is provided by the primary antibody (polyclonal or monoclonal) butthe secondary detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an immunohistological staining of tissue section for pathological examination is obtained.
Tissues can also be extracted, with urea and neutral detergent, for the liberation of
S
Enterococcus polypeptides for Western-blot or dot/slot assay. See, Jalkanen, M.
etal. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell. Biol.
105:3087-3096. In this technique, which is based on the use of cationic solid phases, quantitation of a Enterococcus polypeptide can be accomplished using an isolated Enterococcus polypeptide as a standard. This technique can also be applied to body 25 fluids.
Other antibody-based methods useful for detecting Enterococcus polypeptide gene expression include immunoassays, such as the ELISA and the radioimmunoassay (RIA). For example, a Enterococcus polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and quantify a Enterococcus polypeptide. The amount of a Enierococcus polypeptide present in the sample can be calculated by reference to the amount present in a standard preparation using a linear regression computer algorithm. Such an ELISA is described in lacobelli et al. (1988) Breast Cancer Research and Treatment 11:19-30. In another ELISA assay, two distinct specific monoclonal antibodies can be used to detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as the enzyme-labeled probe.
The above techniques may be conducted essentially as a "one-step" or "two-step" assay. The "one-step" assay involves contacting the Enterococcus to polypeptide with irmnobilized antibody and, without washing, contacting the mixture with the labeled antibody. The "two-step" assay involves washing before contacting the mixture with the labeled antibody. Other conventional methods may also be employed as suitable. It is usually desirable to immobilize one component of the assay system on a support, thereby allowing other components of the system to be 15 brought into contact with the component and readily removed from the sample.
Variations of the above and other immunological methods included in the present invention can also be found in Harlow et al., ANTIBODIES: A LABORATORY *so MANUAL, (Cold Spring Harbor Laboratory Press, 2nd .ed. 1988).
Suitable enzyme labels include, for example, those from the oxidase group, 0 20 which catalyze the production of hydrogen peroxide by reacting with substrate.
Glucose oxidase is particularly preferred as it has good stability and its substrate (glucose) is readily available. Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels include 25 radioisotopes, such as iodine (125, 1211), carbon 4 sulphur (35S), tritium 3
H),
indium 1 2 1n), and technetium and fluorescent labels, such as fluorescein and rhodamine, and biotin.
Further suitable labels for the Enterococcus polypeptide-specific antibodies of the present invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase.
Examples of suitable radioisotopic labels include 3 H, 1251, 1311 32 P, 35
S,
14C, 51 Cr, sTTo, 5 sCo, 59 Fe, 75 Se, 5 2 Eu, 90Y, 67 Cu, 217 Ci, 21'At, 212 Pb, 4 7Sc, '1Pd, etc.
In is a preferred isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the 1251 or 31 -labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more favorable gamma emission energy for imaging.
See, Perkins et al. (1985) Eur. J. Nucl. Med. 10:296-301; Carasquillo et al.
(1987) J. Nucl. Med. 28:281-287. For example, "In coupled to monoclonal antibodies with -(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870.
Examples of suitable non-radioactive isotopic labels include 57 Gd, "Mn, i 62 Dy, 52 Tr, and 5 6 Fe.
Examples of suitable fluorescent labels include an 5 2 Eu label, a fluorescein label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label.
Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, ricin, and cholera toxin.
Examples of chemiluminescent labels include a luminal label, an isoluminal label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, a luciferin label, a luciferase label, and an aequorin label.
Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei such as Gd, Mn, and iron.
Typical techniques for binding the above-described labels to antibodies are provided by Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 81:1-40. Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate method, the dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are incorporated by reference herein.
In a related aspect, the invention includes a diagnostic kit for use in screening serum containing antibodies specific against E. faecalis infection. Such a kit may include an isolated E. faecalis antigen comprising an epitope which is specifically immunoreactive with at least one anti-E.faecalis antibody. Such a kit also includes means for detecting the binding of said antibody to the antigen. In specific embodiments, the kit may include a recombinantly produced or chemically synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen may be attached to a solid support.
In a more specific embodiment, the detecting means of the above-described kit includes a solid support to which said peptide or polypeptide antigen is attached.
Such a kit may also include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of the antibody to the E. faecalis antigen can be detected by binding of the reporter labeled antibody to the anti-E. faecalis polypeptide antibody.
In a related aspect, the invention includes a method of detecting E. faecalis infection in a subject. This detection method includes reacting a body fluid, preferably 20 serum, from the subject with an isolated E. faecalis antigen, and examining the antigen for the presence of bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to a solid support, and serum is reacted with the support. Subsequently, the support is reacted with a reporter-labeled anti-human antibody. The support is then examined for the presence of reporter-labeled antibody.
The solid surface reagent employed in the above assays and kits is prepared by known techniques for attaching protein material to solid support material, such as polymeric beads, dip sticks, 96-well plates or filter material. These attachment methods generally include non-specific adsorption of the protein to the support or covalent attachment of the protein typically through a free amine group, to a chemically reactive group on the solid support, such as an activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coated plates can be used in conjunction with biotinylated antigen(s).
The polypeptides and antibodies of the present invention, including fragments thereof, may be used to detect Enterococcal species including E. faecalis using bio chip and biosensor technology. Bio chip and biosensors of the present invention may comprise the polypeptides of the present invention to detect antibodies, which specifically recognize Enterococcal species, including E. faecalis. Bio chip and biosensors of the present invention may also comprise antibodies which specifically recognize the polypeptides of the present invention to detect Enterococcal species, including E. faecalis or specific polypeptides of the present invention. Bio chips or biosensors comprising polypeptides or antibodies of the present invention may be "used todetect Enterococcal species, including E. faecalis, in biological and S. 15 environmental samples and to diagnose an animal, including humans, with an E.
faecalis or other Enterococcal infection. Thus, the present invention includes both bio "chips and biosensors comprising polypeptides or antibodies of the present invention oo and methods of their use.
The bio chips of the present invention may further comprise polypeptide sequences of other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition to the polypeptide sequences of the present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips of the present invention may further comprise antibodies or fragements thereof specific for other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition to the antibodies or fragements thereof of the present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and biosensors of the present invention may also be used to monitor an E. faecalis or other Enterococcal infection and to monitor the genetic changes (amio acid deletions, insertions, substitutions, etc.) in response to drug therapy in the clinic and drug development in the laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the present invention may also be used to simultaneously monitor the expression of a multiplicity of polypeptides, including those of the present invention.
The polypeptides used to comprise a bio chip or biosensor of the present invention may be specified in the same manner as for the fragements, i.e, by their N-tenninal and C-terminal positions or length in contigious amino acid residue. Methods and particular uses of the polypeptides and antibodies of the present invention to detect Enterococcal species, including E. faecalis, or specific polypeptides using bio chip and biosensor technology include those known in the art, those of the U.S. Patent Nos.
and World Patent Nos. listed above for bio chips and biosensors using polynucleotides of the present invention, and those of: U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, 1 W09612957, each incorporated herein in their entireties.
15 Treatment: Agonists and Antagonists Assays and Molecules.
The invention also provides a method of screening compounds to identify those which enhance or block the biological activity of the E. faecalis polypeptides of the present invention. The present invention further provides where the compounds kill or slow the growth of E.faecalis. The ability of E.faecalis antagonists, including E. faecalis ligands, to prophylactically or therapeutically block antibiotic resistance may be easily tested by the skilled artisan. See, Straden et al. (1997) J Bacteriol.
179(1):9-16.
An agonist is a compound which increases the natural biological function or which functions in a manner similar to the polypeptides of the present invention, while antagonists decrease or eliminate such functions. Potential antagonists include small organic molecules, peptides, polypeptides, and antibodies that bind to a polypeptide of the invention and thereby inhibit or extinguish its activity.
The antagonists may be employed for instance to inhibit peptidoglycan cross bridge formation. Antibodies against E. faecalis may be employed to bind to and inhibit E. faecalis activity to treat antibiotic resistance. Any of the above antagonists may be employed in a composition with a pharmaceutically acceptable carrier.
Vaccines The present invention also provides vaccines comprising one or more polypeptides of the present invention. Heterogeneity in the composition of a vaccine may be provided by combining E. faecalis polypeptides of the present invention.
Multi-component vaccines of this type are desirable becautse they are likely to be more effective in eliciting protective immune responses against multiple species and strains of the Enterococcus genus than single polypeptide vaccines.
Multi-component vaccines are known in the art to elicit antibody production to numerous immunogenic components. See, Decker et al. (1996) J. Infect. Dis.
174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent 15 vaccine has recently been demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic agents. See, Aristegui, J. et al. (1997) Vaccine 15:7-9.
The present invention in addition to single-component vaccines includes multi-component vaccines. These vaccines comprise more than one polypeptide, o. 20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine comprising more than one of the E. faecalis polypeptides of the present invention.
Further within the scope of the invention are whole cell and whole viral vaccines. Such vaccines may be produced recombinantly and involve the expression of one or more of the E. faecalis polypeptides described in Table 1. For example, the E. faecalis polypeptides of the present invention may be either secreted or localized intracellular, on the cell surface, or in the periplasmic space. Further, when a recombinant virus is used, the E. faecalis polypeptides of the present invention may, for example, be localized in the viral envelope, on the surface of the capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing heterologous proteins are known in the art. See, Robinson, K. et al. (1997) Nature Biotech. 15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; Chabalgoity, J. et al. (1997) Infect. Immun. 65:2402-2412 These cells may be administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, for example, report the successful use in mice of a live attenuated Sahnonella vaccine strain which expresses a portion of a platyhelminth fatty acid-binding protein as a fusion protein on its cells surface.
A multi-component vaccine can also be prepared using techniques known in the art by combining one or more E. faecalis polypeptides of the present invention, or fragments thereof, with additional non-Enterococcal components diphtheria toxin or tetanus toxin, and/or other compounds known to elicit an immune response).
Such vaccines are useful for eliciting protective immune responses to both members of the Enterococcus genus and non-Enterococcal pathogenic agents.
The vaccines of the present invention also include DNA vaccines. DNA vaccines are currently being developed for a number of infectious diseases. See, et al., Boyer, et al. (1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 14:1285-1288. Such DNA vaccines contain a nucleotide sequence encoding one or more E. faecalis polypeptides of the present invention oriented in a manner that allows for expression of the subject polypeptide. For example, the direct administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to elicit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J.
Infect. Dis. 175:91-97.
"The present invention also relates to the administration of a vaccine which is co-administered with a molecule capable of modulating immune responses. Kim et al.
(1997) Nature Biotech. 15:641-646, for example, report the enhancement of immune responses produced by DNA immunizations when DNA sequences encoding molecules which stimulate the immune response are co-administered. In a similar fashion, the vaccines of the present invention may be co-administered with either nucleic acids encoding immune modulators or the immune modulators themselves.
These immune modulators include granulocyte macrophage colony stimulating factor (GM-CSF) and CD86.
The vaccines of the present invention may be used to confer resistance to Enterococcal infection by either passive or active immunization. When the vaccines of the present invention are used to confer resistance to Enterococcal infection through active immunization, a vaccine of the present invention is administered to an animal to elicit a protective immune response which either prevents or attenuates a Enterococcal infection. When the vaccines of the present invention are used to confer resistance to Enterococcal infection through passive immunization, the vaccine is provided to a host animal human, dog, or mouse), and the antisera elicited by this antisera is recovered and directly provided to a recipient suspected of having an infection caused S. by a member of the Enterococcus genus.
The ability to label antibodies, or fragments of antibodies, with toxin molecules provides an additional method for treating Enterococcal infections when passive immunization is conducted. In this embodiment, antibodies, or fragments of antibodies, capable of recognizing the E. faecalispolypeptides disclosed herein, or fragments thereof, as well as other Enterococcus proteins, are labeled with toxin molecules prior to their administration to the patient. When such toxin derivatized antibodies bind to Enterococcus cells, toxin moieties will be localized to these cells and 20 will cause their death.
The present invention thus concerns and provides a means for preventing or attenuating a Enterococcal infection resulting from organisms which have antigens that are recognized and bound by antisera produced in response to the polypeptides of the present invention. As used herein, a vaccine is said to prevent or attenuate a disease if its administration to an animal results either in the total or partial attenuation suppression) of a symptom or condition of the disease, or in the total or partial immunity of the animal to the disease.
The administration of the vaccine (or the antisera which it elicits) may be for either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are provided in advance of any symptoms of Enterococcal infection.
The prophylactic administration of the compound(s) serves to prevent or attenuate any subsequent infection. When provided therapeutically, the compound(s) is provided upon or after the detection of symptoms which indicate that an animal may be infected with a member of the Enterococcus genus. The therapeutic administration of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis polypeptides, and fragments thereof, of the present invention may be provided either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after the initiation of an actual infection.
The polypeptides of the invention, whether encoding a portion of a native protein or a functional derivative thereof, may be administered in pure form or may be coupled to a macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable proteins which may act as macromolecular carrier for enhancing the immunogenicity of the polypeptides of the present invention include keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of the present invention to such macromolecular carriers are disclosed in Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988).
20 A composition is said to be "pharmacologically or physiologically acceptable" if its administration can be tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an agent is said to be administered in a "therapeutically effective amount" if the amount administered is physiologically significant. An agent is physiologically significant if its presence results in a detectable change in the physiology of a recipient patient.
While in all instances the vaccine of the present invention is administered as a pharmacologically acceptable compound, one skilled in the art would recognize that the composition of a pharmacologically acceptable compound varies with the animal to which it is administered. For example, a vaccine intended for human use will generally not be co-administered with Freund's adjuvant. Further, the level of purity of the E. faecalis polypeptides of the present invention will normally be higher when administered to a human than when administered to a non-human animal.
As would be understood by one of ordinary skill in the art, when the vaccine of the present invention is provided to an animal, it may be in a composition which may contain salts, buffers, adjuvants, or other substances which are desirable for improving the efficacy of the composition. Adjuvants are substances that can be used to specifically augment a specific immune response. These substances generally perform two functions: they protect the antigen(s) frdm being rapidly catabolized after administration and they nonspecifically stimulate immune responses.
Normally, the adjuvant and the composition are mixed prior to presentation to the immune system, or presented separately, but into the same site of the animal being immunized. Adjuvants can be loosely divided into several groups based upon their composition. These groups include oil adjuvants (for example, Freund's complete and 15 incomplete), mineral salts (for example, AIK(S0 4 2 AINa(SO4) 2
AINH
4 (SO4), silica, kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and certain natural substances (for example, wax D from Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or Bordetella pertussis, and members of the genus Brucella. Other substances useful as adjuvants are the saponins 20 such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in the present invention include aluminum salts, such as AIK(SO 4 2 AINa(SO 4 2 and
AINH
4 (S0 4 Examples of materials suitable for use in vaccine compositions are provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A.
Osol, ed, Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference).
The therapeutic compositions of the present invention can be administered parenterally by injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, or orally. The compositions may alternatively be administered intramuscularly, or intravenously. Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance antigen absorption. Liquid dosage forms for oral administration may generally comprise a liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, such as purified water. Besides the inert diluents, such compositions can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, I o flavoring, or perfuming agents.
Therapeutic compositions of the present invention can also be administered in encapsulated form. For example, intranasal immunization using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, Shahin, R. et al. (1995) Infect. Immun. 63:1195-1200. Similarly, orally administered encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K.
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present invention can be administered by a variety of routes including those involving contacting the vaccine with mucous membranes intranasally, intracolonicly, Sintraduodenally).
S 20 Many different techniques exist for the timing of the immunizations when a multiple administration regimen is utilized. It is possible to use the compositions of the invention more than once to increase the levels and diversities of expression of the immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple immunizations are given, they will be given one to two months apart.
According to the present invention, an "effective amount" of a therapeutic composition is one which is sufficient to achieve a desired biological effect. Generally, the dosage needed to provide an effective amount of the composition will vary depending upon such factors as the animal's or human's age, condition, sex, and extent of disease, if any, and other variables which can be adjusted by one of ordinary skill in the art.
The antigenic preparations of the invention can be administered by either single or multiple dosages of an effective amount. Effective amounts of the compositions of the invention can vary from 0.01-1,000 ig/ml per dose, more preferably 0.1-500 p.g/ml per dose, and most preferably 10-300 gg/ml per dose.
Examples Example 1: Isolation of a Selected DNA Clone From the Deposited Sample ofE.
faecalis Three approaches can be used to isolate a E.faecalis clone comprising a polynucleotide of the present invention from any E. faecalis genomic DNA library.
The E.faecalis strain V586 has been deposited as a convienent source for obtaining a E. faecalis strain although a wide varity of strains E. faecalis strains can be used which are known in the art.
E. faecalis genomic DNA is prepared using the following method. A overnight bacterial culture grown in a rich medium Trypticase Soy Broth, Brain Heart Infusion broth or Super broth), pelleted, washed two times with TES Tris-pH 8.0, 25mM EDTA, 50mM NaCI), and resuspended in 5ml high salt TES NaCI). Lysostaphin is added to final concentration of approx 50ug/ml and the 20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then placed in incubator (or place in a shaking water bath) and warmed to 55C. Five hundred micro liter of 20% sarcosyl in TES (final concentration is then added to lyse the cells. Next, guanidine HCI is added to a final concentration of 7M (3.69g in ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear).
A CsCI gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCI and overlaying with 2.85M CsCI. The gradient is carefully overlayed with the DNAcontaining GuHCI solution. The gradient is spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume is increased to 5 ml with TE buffer.
The DNA is then treated with protease K (10 ug/ml) overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer.
In the first method, a plasmid is directly isolated by screening a plasmid E.
faecalis genomic DNA library using a polynucleotide probe corresponding to a polynucleotide of the present invention. Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer according to the sequence reported. The oligonucleotide is labeled, for instance, with 32P-y-ATP using T4 polynucleotide kinase and purified according to routine methods.
(See, Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The library is transformed into a suitable host, as indicated above (such as XL-I Blue (Stratagene)) using techniques known to those of skill in the art. See, Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989). The transformants are plated on 1.5% agar plates (containing the 15 appropriate selection agent, ampicillin) to a density of about 150 transformants (colonies) per plate. These plates are screened using Nylon membranes according to routine methods for bacterial colony screening. See, Sambrook et al.
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR 20 BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of skill in the art.
Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out under routine conditions, for instance, in 25 tl of reaction mixture with 0.5 ug of the above DNA template. A convenient reaction mixture is 1.5-5 mM MgCI 2 0.01% gelatin, 20 pM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94C for 1 min; annealing at 55 0 C for 1 min; elongation at 72 0 C for 1 min) are performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed by agarose gel electrophoresis and the DNA. band with expected molecular weight is excised and purified. The PCR product is verified to be the selected sequence by subcloning and sequencing the DNA product.
Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically synthesized and used to generate a nucleotide sequence of desired length using PCR methods known in the art.
Example Expression and Purification Enterococcal polypeptides in E. coli The bacterial expression vector pQE60 was used for bacterial expression of some of the polypeptide fragements used in the soft tissue and systemic infection models discussed below. (Q1AGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a bacterial origin of replication an IPTG inducible promoter, a ribosome binding 15 site six codons encoding histidine residues that allow affinity purification using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) and suitable single restriction enzyme cleavage sites. These elements are arranged such that an inserted DNA fragment encoding a polypeptide expresses that polypeptide with the six His residues a "6 X His tag") covalently linked to the carboxyl terminus of that polypeptide.
The DNA sequence encoding the desired portion of a E. faecalis protein of the present invention was amplified from E. faecalis genomic DNA using PCR oligonucleotide primers which anneal to the 5' and 3' sequences coding for the portions of the E. faecalis polynucleotide shown in Table 1. Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 vector are added to the and 3' sequences, respectively.
For cloning the mature protein, the 5' primer has a sequence containing an appropriate restriction site followed by nucleotides of the amino terminal coding sequence of the desired E. faecalis polynucleotide sequence in Table 1. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding any desired portion of the complete protein shorter or longer than the mature form.
The 3' primer has a sequence containing an appropriate restriction site followed by nucleotides complementary to the 3' end of the polypeptide coding sequence of Table 1, excluding a stop codon, with the coding sequence aligned with the restriction site so as to maintain its reading frame with that of the six His codons in the pQE60 vector.
The amplified E. faecalis DNA fragment and the vector pQE60 were digested with restriction enzymes which recognize the sites in the primers and the digested DNAs were then ligated together. The E. faecalis DNA was inserted into the restricted pQE60 vector in a manner which places the E.faecalis protein coding region downstream from the IPTG-inducible promoter and in-frame with an initiating AUG and the six histidine codons.
SThe ligation mixture was transformed into competent E. coli cells using 15 standard procedures such as those described by Sambrook et al., supra.. E. coli strain M 15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance was used in carrying out the illustrative example described herein. This strain, which was only one of many that are suitable for expressing a E.faecalis polypeptide, is available commercially 20 (QIAGEN, Inc., supra). Transformants were identified by their ability to grow on LB agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated from resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA sequencing.
Clones containing the desired constructs were grown overnight in liquid culture in LB media supplemented with both ampicillin (100 p.g/ml) and kanamycin (25 p.g/ml). The O/N culture was used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells were grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-p-D-thiogalactopyranoside ("IPTG") was then added to a final concentration of 1 mM to induce transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently were incubated further for 3 to 4 hours. Cells then were harvested by centrifugation.
The cells were then stirred for 3-4 hours at 4 0 C in 6M guanidine-HCl, pH 8.
The cell debris was removed by centrifugation, and the supernatant containing the E.
faecalis polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity were purified in a simple one-step procedure (for details see: The QlAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the supernatant was loaded onto the column in 6 M guanidine-HC1, pH 8, the column was first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with volumes of 6 M guanidine-HCI pH 6, and finally the E. faecalis polypeptide was eluted with 6 M guanidine-HCI, pH The purified protein was then renatured by dialyzing it against 15s phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCI. Alternatively, the protein could be successfully refolded while immobilized on the Ni-NTA column. The recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCI, 20% glycerol, 20 mM Tris/HC1 pH 7.4, containing protease inhibitors. The renaturation should be performed over a period of 20 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 mM immidazole. Immidazole was removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCI. The purified protein was stored at 40 C or frozen at -80° C.
Some of the polypeptide of the present invention were prepared using a nondenaturing protein purification method. For these polypeptides, the cell pellet from each liter of culture was resuspended in 25 mls of Lysis Buffer A at 4 0 C (Lysis Buffer A 50 mM Na-phosphate, 300 mM NaCI, 10 mM 2-mercaptoethanol, Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 nm was approximately 10-20 O.D./ml. The suspension was then put through three freeze/thaw cycles from -70 0 C (using a ethanol-dry ice bath) up to room temperature.
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on ice. The sonicated sample was then centrifuged at 15,000 RPM for 30 minutes at 4 0 C. The supernatant was passed through a column containing 1.0 ml of CL-4B resin to pre-clear the sample of any proteins that may bind to agarose non-specifically, and the flow-through fraction was collected.
The pre-cleared flow-through was applied to a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag to bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 4°C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 of the eluate returns to the baseline. Then, the column was washed with 5 volumes of 40 mM Imidazole (92% Lysis Buffer A 8% Buffer B) (Buffer B 50 mM Na- 15 Phosphate, 300 mM NaCI, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM Imidazole, pH of the final buffer should be The protein was eluted off of the column with a series of increasing Imidazole solutions made by adjusting the ratios of Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 20 Imidazole. The fractions containing the purified protein were analyzed using 8 or 14% SDS-PAGE depending on the protein size. The purified protein was then dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily workable buffer. The purified protein was stored at 40 C or frozen at The following alternative method may be used to purify E. faecalis expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, all of the following steps are conducted at 4-10°C.
Upon completion of the production phase of the E. coli fermentation, the cell culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear mixer.
The cells are then lysed by passing the solution through a microfluidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCI solution to a final concentration of 0.5 M NaCI, followed by centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using NaCI, 100 mM Tris, 50 mM EDTA, pH 7.4.
to The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride (GuHCI) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated at 4 0 C overnight to allow further GuHCI extraction.
Following high speed centrifugation (30,000 x g) to remove insoluble particles, 15 the GuHCl solubilized protein is refolded by quickly mixing the GuHCI extract with volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCI, 2 mM EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4 C without mixing for 12 hours prior to further purification steps.
To clarify the refolded E. faecalis polypeptide solution, a previously prepared tangential filtration unit equipped with 0.16 lm membrane filter with appropriate surface area Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is S: employed. The filtered sample is loaded onto a cation exchange resin Poros HS- Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCI in the same buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE.
Fractions containing the E.faecalis polypeptide are then pooled and mixed with 4 volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins.
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaC1. The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCI, mM sodium acetate, pH 6.0 to 1.0 M NaCI, 50 mM sodium acetate, pH Fractions are collected under constant A 28 monitoring of the effluent. Fractions containing the E. faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled.
The resultant E. faecalis polypeptide exhibits greater than 95% purity after the above refolding and purification steps. No major contaminant bands are observed from Commassie blue stained 16% SDS-PAGE gel when 5 Ig of purified protein is loaded. The purified protein is also tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL assays.
S 15 Example Alernative Expression and Purification Enterococcalpolypeptides in E.
coli Tthe vector pQE10 was alternatively used to clone and express some of the polypeptides of the present invention for use in the soft tissue and systemic infection models discussed below. The difference being such that an inserted DNA fragment 20 encoding a polypeptide expresses that polypeptide with the six His residues a "6 X His tag") covalently linked to the amino terminus of that polypeptide. The bacterial s* expression vector pQE10 (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311) was used in this example. The components of the pQEO1 plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present invention expresses the polypeptide with the six His residues a "6 X His tag")) covalently linked to the amino terminus.
The DNA sequences encoding the desired portions of a polypeptide of Table 1 were amplified using PCR oligonucleotide primers from genomic E. faecalis DNA.
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid sequence of a polypeptide of the present invention. Additional nucleotides containing restriction sites to facilitate cloning in the pQE 10 vector were added to the 5' and 3' primer sequences, respectively.
For cloning a polypeptide of the present invention, the 5' and 3' primers were selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the and 3' primers begins may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present invention. The 5' primer was designed so the coding sequence of the 6 X His tag is aligned with the restriction site so as to maintain its reading frame with that of E. faecalis polypeptide. The 3' was designed to include an stop codon. The amplified DNA fragment was then cloned, and the protein expressed, as described above for the pQE60 plasmid.
The DNA sequences encoding the amino acid sequences of Table 1 may also be cloned and expressed as fusion proteins by a protocol similar to that described *15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, Madison, WI 53711) is preferentially used in place ofpQElO.
.The above methods are not limited to the polypeptide fragements actually produced. The above method, like the methods below, can be used to produce either full length polypeptides or desired fragements therof.
Example Alternative Expression and Purification of Enterococcal polypeptides in E. coli The bacterial expression vector pQE60 is used for bacterial expression in this example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in this example, the polypeptide coding sequence is inserted such that translation of the six His codons is prevented and, therefore, the polypeptide is produced with no 6 X His tag.
The DNA sequence encoding the desired portion of the E. faecalis amino acid sequence is amplified from an E. faecalis genomic DNA prep the deposited DNA clones using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences corresponding to the desired portion of the E. faecalis polypeptides.
Additional nucleotides containing restriction sites to facilitate cloning in the vector are added to the 5' and 3' primer sequences.
For cloning a E. faecalis polypeptides of the present invention, 5' and 3' primers are selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present invention. The 3' and 5' primers contain appropriate restriction sites followed by nucleotides complementary to the and 3' ends of the coding sequence respectively. The 3' primer is additionally designed to include an in-frame stop codon.
The amplified E. faecalis DNA fragments and the vector pQE60 are digested with restriction enzymes recognizing the sites in the primers and the digested DNAs 15 are then ligated together. Insertion of the E.faecalis DNA into the restricted vector places the E. faecalis protein coding region including its associated stop codon downstream from the IPTG-inducible promoter and in-frame with an initiating AUG.
The associated stop codon prevents translation of the six histidine codons downstream of the insertion point.
The ligation mixture is transformed into competent E. coli cells using standard procedures such as those described by Sambrook et al. E. coli strain M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac repressor *o and confers kanamycin resistance is used in carrying out the illustrative example described herein. This strain, which is only one of many that are suitable for expressing E. faecalis polypeptide, is available commercially (QIAGEN, Inc., supra).
Transformants are identified by their ability to grow on LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA sequencing.
Clones containing the desired constructs are grown overnight in liquid culture in LB media supplemented with both ampicillin (100 p.g/ml) and kanamycin The O/N culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation.
To purify the E. faecalis polypeptide, the cells are then stirred for 3-4 hours at 4 0 C in 6M guanidine-HC1, pH 8. The cell debris is removed by centrifugation, and the supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Naacetate buffer pH 6, supplemented with 200 mM NaCI. Alternatively, the protein can be successfully refolded by dialyzing it against 500 mM NaCI, 20% glycerol, mM Tris/HCl pH 7.4, containing protease inhibitors. After renaturation the protein 15 can be purified by ion exchange, hydrophobic interaction and size exclusion chromatography. Alternatively, an affinity chromatography step such as an antibody column can be used to obtain pure E. faecalis polypeptide. The purified protein is stored at 40 C or frozen at -80° C.
The following alternative method may be used to purify E. faecalis S 20 polypeptides expressed in E coli when it is present in the form of inclusion bodies.
Unless otherwise specified, all of the following steps are conducted at 4-10°C.
S. Upon completion of the production phase of the E. coli fermentation, the cell culture is cooled to 4-10 0 C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear mixer.
The cells ware then lysed by passing the solution through a microfluidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCI solution to a final concentration of 0.5 M NaCI, followed by centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using NaCI, 100 mMl Tris, 50 mM EDTA, pH 7.4.
The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride (GuHCI) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated at 4 0 C overnight to allow further GuHCI extraction.
Following high speed centrifugation (30,000 x g) to remove insoluble particles, to the GuHCI solubilized protein is refolded by quickly mixing the GuHCI extract with volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCI, 2 mM EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further purification steps.
To clarify the refolded E. faecalis polypeptide solution, a previously prepared S^ 15 tangential filtration unit equipped with 0.16 prm membrane filter with appropriate surface area Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample is loaded onto a cation exchange resin Poros HS- Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCI in the same buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE.
Fractions containing the E.faecalis polypeptide are then pooled and mixed with 4 volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins.
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCI. The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCI, mM sodium acetate, pH 6.0 to 1.0 M NaCI, 50 mM sodium acetate, pH Fractions are collected under constant Azso monitoring of the effluent. Fractions containing the E. faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled.
The resultant E. faecalis polypeptide exhibits greater than 95% purity after the above refolding and purification steps. No major contaminant bands are observed from Commassie blue stained 16% SDS-PAGE gel when 5 p.g of purified protein is loaded. The purified protein is also tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL assays.
o1 Example Cloning and Expression of E. faecalis in Other Bacteria E. faecalis polypeptides can also be produced in: E.faecalis using the methods of S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. 1. Moreno (1996) Protein Expr. Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods 15 Chang et al., U.S. Patent No. 4,952,508.
Example 3: Cloning and Expression in COS Cells A E. faecalis expression plasmid is made by cloning a portion of the DNA encoding a E. faecalis polypeptide into the expression vector pDNAI/Amp or pDNAIIl (which can be obtained from Invitrogen, Inc.). The expression vector pDNAI/amp contains: an E. coli origin of replication effective for propagation in E. coli and other prokaryotic cells; an ampicillin resistance gene for selection of plasmid-containing prokaryotic cells; an SV40 origin of replication for propagation in eukaryotic cells; a CMV promoter, a polylinker, an SV40 intron; several codons encoding a hemagglutinin fragment an "HA" tag to facilitate purification) followed by a termination codon and polyadenylation signal arranged so that a DNA can be conveniently placed under expression control of the CMV promoter and operably linked to the SV40 intron and the polyadenylation signal by means of restriction sites in the polylinker. The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:767.
The fusion of the HA tag to the target protein allows easy detection and recovery of the rccombinant protein with an antibody that recognizes the HA epitope. pDNAIll contains, in addition, the selectable neomycin marker.
A DNA fragment encoding a E. faecalis polypeptide is cloned into the polylinker region of the vector so that recombinant protein expression is directed by the CMV promoter. The plasmid construction strategy is as follows. The DNA from a E. faecalis genomic DNA prep is amplified using primers that contain convenient restriction sites, much as described above for construction of vectors for expression of E. faecalis in E. coli. The 5' primer contains a Kozak sequence, an AUG start codon, and nucleotides of the 5' coding region of the E. faecalis polypeptide. The 3' primer, contains nucleotides complementary to the 3' coding sequence of the E. faecalis DNA, a stop codon, and a convenient restriction site.
The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested with appropriate restriction enzymes and then ligated. The ligation mixture is transformed into an appropriate E. coli strain such as SURET (Stratagene Cloning Systems, La Jolla, CA 92037), and the transformed culture is plated on ampicillin media plates which then are incubated to allow growth of ampicillin resistant colonies.
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis or other means for the presence of the fragment encoding the E. faecalis polypeptide For expression of a recombinant E. faecalis polypeptide, COS cells are transfected with an expression vector, as described above, using DEAE-dextran, as described, for instance, by Sambrook et al. (supra). Cells are incubated under conditions for expression of E. faecalis by the vector.
Expression of the E. faecalis-HA fusion protein is detected by radiolabeling and immunoprecipitation, using methods described in, for example Harlow et al., supra.. To this end, two days after transfection, the cells are labeled by incubation in media containing "S-cysteine for 8 hours. The cells and the media are collected, and the cells are washed and the lysed with detergent-containing RIPA buffer: 150 mM NaCI, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. (supra Proteins are precipitated from the cell lysate and from the culture media using an HA-specific monoclonal antibody. The precipitated proteins then are analyzed by SDS-PAGE and autoradiography. An expression product of the expected size is seen in the cell lysate, which is not seen in negative controls.
Example 4. Cloning and Expression in CHO Cells The vector pC4 is used for the expression of E. faecalis polypeptide in this example. Plasmid pC 4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No. 37146). The plasmid contains the mouse DHFR gene under control of the early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate activity that are transfected with these plasmids can be selected by growing the cells in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 15 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells resistant to methotrexate (MTX) has been well documented. See, Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 1097:107-143; Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing concentrations of MTX develop resistance to the drug by overproducing the target enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is linked to the DHFR gene, it is usually co-amplified and over-expressed. It is known in the art that this approach may be used to develop cell lines carrying more than S1,000 copies of the amplified gene(s). Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain the amplified gene integrated into one or more chromosome(s) of the host cell.
Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al.
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the immediate early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. Downstream of the promoter are the following single restriction enzyme cleavage sites that allow the integration of the genes: Barn HI, Xba 1, and Asp 718.
Behind these cloning sites the plasmid contains the 3' intron and polyadenylation site of the rat preproinsulin gene. Other high efficiency promoters can also be used for the expression, the human 1-actin promoter, the SV40 early or late promoters or the long terminal repeats from other retroviruses, HIV and HTLVI. Clontech's Tet- Off and Tet-On gene expression systems and similar systems can be used to express the E. faecalis polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. Sci. USA 89:5547-5551. For the polyadenylation of the mRNA other signals, from the human growth hormone or globin genes can be used as well. Stable cell lines carrying a gene of interest integrated into the chromosomes can also be selected upon co-transfection with a selectable marker such as gpt, G418 or hygromycin. It is advantageous to use more than one selectable marker in the beginning, G418 plus methotrexate.
15 The plasmid pC4 is digested with the restriction enzymes and then dephosphorylated using calf intestinal phosphates by procedures known in the art.
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E.
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to the 5' and 3' sequences of the desired portion of the gene. A 5' primer containing a 20 restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the coding region of the E.faecalis polypeptide is synthesized and used. A 3' primer, containing a restriction site, stop codon, and nucleotides complementary to the 3' coding sequence of the E. faecalis polypeptides is synthesized and used. The amplified fragment is digested with the restriction endonucleases and then purified again on a 1% agarose gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. E. coli HB 101 or XL-1 Blue cells are then transformed and bacteria are identified that contain the fragment inserted into plasmid pC4 using, for instance, restriction enzyme analysis.
Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. Five p.g of the expression plasmid pC4 is cotransfected with 0.5 j.g of the plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin T or LipofectAMINE.
T
r (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that confers resistance to a group of antibiotics including G418. The cells are seeded in alpha minus MEM supplemented with 1 mg/ml G41 8. After 2 days, the cells are trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of methotrexate are then transferred to new 6-well plates containing even higher concentrations of methotrexate (1 pM, 2 .tM, 5 p.M, 10 mM, 20 mM). The same procedure is repeated until clones are obtained which grow at a concentration of 15 100-200 ptM. Expression of the desired gene product is analyzed, for instance, by SDS-PAGE and Western blot or by reversed phase HPLC analysis.
Example 5. Quantitative Murine Soft Tissue Infection Model for E. faecalis Compositions of the present invention, including polypeptides and peptides, are assayed for their ability to function as vaccines or to enhance/stimulate an immune response to a bacterial species E. faecalis) using the following quantitative murine soft tissue infection model. Mice NIH Swiss female mice, approximately 7 weeks old) are first treated with a biologically protective effective amount, or immune enhancing/stimulating effective amount of a composition of the present invention using methods known in the art, such as those discussed above. See,e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is per animal.
The desired bacterial species used to challenge the mice, such as E. faecalis, is grown as an overnight culture. The culture is diluted to a concentration of 5 X 108 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired doses are further diliuted 1:2 with sterilized Cytodex 3 microcarrier beads preswollen in sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile and injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal subcutaneously in the inguinal region. After four days, counting the day of injection as day one, mice are sacrificed and the contents ofthe abscess is excised and placed in a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is then enzymatically treated and plated as follows.
The abscess is first disrupted by vortexing with sterilized glass beads placed in the tubes. 3.0mls of prepared enzyme mixture (1.0ml Collagenase D (4.0 mg/ml), 1.0ml Trypsin (6.0 mg/ml) and 8.0 mis PBS) is then added to each tube followed by a min. incubation at 37C. The solution is then centrifuged and the supernatant drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated for 10 min. at room temperature. 0.5 ml media is then added and samples are serially i diluted and plated onto agar plates, and grown overnight at 37C. Plates with distinct and separate colonies are then counted, compared to positive and negative control samples, and quantified. The method can be used to identify composition and 20 determine appropriate and effective doses for humans and other animals by comparing the effective doses of compositions of the present invention with compositions known in the art to be effective in both mice and humans. Doses for the effective treatment of humans and other animals, using compositions of the present invention, are extrapolated using the data from the above experiments of mice. It is appreciated that further studies in humans and other animals may be needed to determine the most effective doses using methods of clinical practice known in the art.
Example 6: Murine Systemic Neutropenic Model for E. faecalis Infection Compositions of the present invention, including polypeptides and peptides, are assayed for their ability to function as vaccines or to enhance/stimulate an immune response to a bacterial species E.faecalis) using the following qualitative murine systemic neutropenic model. Mice N1H Swiss female mice, approximately 7 weeks old) are first treated with a biologically protective effective amount, or immune enhancing/stimulating effective amount of a composition of the present invention using methods known in the art, such as those discussed above. See,e.g., Harlow et al., ANTIBODIES: A LABORATORY IMANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug per animal.
in Mice are then injected with 250 300 mg/kg cyclophosphamide intraperitonially.
Counting the day of C.P. injection as day one, the mice are left untreated for 5 days to begin recovery of PMNL'S.
The desired bacterial species used to challenge the mice, such as E. faecalis, is grown as an overnight culture. The culture is diluted to a concentration of 5 X 108 S 5 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired doses are further diliuted 1:2 in 4% Brewer's yeast in media.
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The Brewer's yeast solution alone is used as a control. The mice are then monitered twice daily for the first week following challenge, and once a day for the next week to ascertain morbidity and mortality. Mice remaining at the end of the experiment are sacrificed. The method can be used to identify compositions and determine appropriate and effective doses for humans and other animals by comparing the effective doses of compositions of the present invention with compositions known in the art to be effective in both mice and humans. Doses for the effective treatment of humans and other animals, using compositions of the present invention, are extrapolated using the data from the above experiments of mice. It is appreciated that further studies in humans and other animals may be needed to determine the most effective doses using methods of clinical practice known in the art.
The disclosure of all publications (including patents, patent applications, journal articles, laboratory manuals, books, or other documents) cited herein are hereby incorporated by reference in their entireties.
The present invention is not to be limited in scope by the specific embodiments described herein, which are intended as single illustrations of individual aspects of the invention. Functionally equivalent methods and components are within the scope of the invention, in addition to those shown and described herein and will become apparant to those skilled in the art from the foregoing description and accompanying drawings. Such modifications are intended to fall within the scope of the appended claims.
Throughout the specification, unless the context requires otherwise, the word "comprise" or variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated integer or group of integers but not the exclusion of any other integer or group of integers.
e 79 TABLE 1. Nucleotide and Amiho Acid Seqeuences of E. faecalis Genes.
EFOO1-1 (SEQ ID NQ:l)
TGAAAGAATA
AAAAAAATGA
TTGGGGGCVIT
AAGGAAGATA
ACAAAATTAA
CAATCTGCTT
TTACCAACAA
TTAGTGGACT
ATTCGTGAAG
AATAAATCGA
GTACCGAAAA
AAAGAAGTCG
AACAAAGGCG
GTGGACTATT
TATTIATCTG
GCTGGTT'rTG CCTrGAAAAAA
GAACAACGGA
TACTGGGCAC
TACAAAAATT
TTCGCTATCC
GAAAGTATTT
CAATTTGAAC
TTGCCAGAAC
AG 1TCAAAAC
GTGGTAACGG
CCACAATCAC
CGAAAGACT
ACCCTGATTT
TTACCCA.AC
TAAA.ACCATA
TATTGTTAGA
CAGAAATGTT
CATTAGAGGA
TTGGTGCTGG
TTGATTTTAA
ACCGTGATGG
GCCCATTTGC
TTCAAAAAGA
TCAACTTACA
CAGCGGCATT
AACAAACAGG
CTAAGACAAC
CAGTAGAAGA
TTGC-TTCATC
AAGCATGGAA
GTGGCGAGCA
TCTAGCAACA
TAATGGGGCC
TTrCTGGCAT
CATGAAAGAA
ACAAGCCAAA
GTACCCAGGC
TATGGATGAT
CGGCGCCAAA
ATTCTATAAT
ATTAAA.AGAA
rTIGACTCG
TAAAGACTTA
TATCGAAGCA
AAACAAAAAG
TGCTGAAGCT
ACAAGGAACA
TGAATTCATG
TTATATGCCA
CAAAGTACCT
AAATGCTGAT
AAATAAAGAC
CCAATAA
AATTOTI'TTA
ACAGTGTTAG
AAAGAATCAA
GCAATGAATG
AATCCAAAAA
ATCAATTCGA
TGGTTATGGA
GACACAATCG
ATCGACGGCA
G6TGATTGT
GCTTCTAAAA
TTAAATAACT
GATTAACAA
GGTTACTTCC
GTAGCAATGT
GGTGGCTATG
GATATTTATA
AA.ATTCTTAG
ATTFAGAAT
GCACAACTTG
TCAGCCTATA
ACGAGAAA-AT
TAAATTTTTT
CAACCGCAGC
ACGATATTGT
GGGTTCAAGA
TTAAAGTGGA
CTTTAACTTC
ATGCTGCACA
GCTGGAAAGA
AACAATACGG
TGAAAGAATA
CAATTTACrGA
ATTACGPAAT
GCAAAGATTC
GCACAGCT-GG
TI'GTCGGT2AG
AATACGGTGT
TGITCGATAG
CTACTCCTGA
CTGTTTTACA
AAAACGCAGT
ATGAAATGCG
TATTGAAAGA
TAAGGGAGAG
TATTCGCA
GAAAGAAGTG
AGAAGCGTTA
ATTACAAAAT
ACCAAAAGAT
AGATGAAATG
TrGCAGAGCCA
CATTCCATTT
TGGTGTTGAA
AAAATCCAAC
TGGAATGAA.A
ACAAGAAGTC
TTCAGATAAA
TATTGCTGGT
TGCACCACGT
TGCTACGCCA
TTCACAATTG
CAGTGATGAG
AAAAGATTTA
GACAATTATG
TGCAACATCA
EF001-2 (SEQ ID NO:2) MFXTLATT VLATAAIFALj GACGNGNGAK ESNDIVKEVK, a.
EDTTITFWHA
PTITQAYPGW
KSTEMLFYNA
KGVDFNKDLD
GFVQKDAE-AG
WAQQTG;YMP I S IFASSNKDT M4NGVQEEALT LWNAAQDE4L
DLLKEYGVEV
LTSKDSQEVV
GYEYGVAPRP
LESVLHSDEY
RKLLKDATSQ
KLTKDFI<EN
VDLKPYMDDD
PKTLEELKEA
DYYRDGIEAG
EKINLQQGTD
KNSKTTKVPA
FEQAWNQ
PK IKVELQNQ
TIGWKCDAEPI
SKTIYEKSNfl
YFRTAGSOKY
IYMFDSATPE
QLEN'AVKDLF
SAYPDLQAKI
REVLLDGAKI
EVVGAGFDSL
LSGPFANXKV
QRTAAFEFMfl
AIPVEENADS
NSTLTSPKDL
DGKQYGIPFN
NNY"YAIGMKN
AMFVGS IAGA
FLATPOSQLY
AYNEMRTIME
EF001-3 (SEQ ID NO:3) TT GTGGTAACGG TAATGGGGCC AAAGAATCAA ACGATATTGT GAAAG-AGTG
AAGGAAGATA
ACAAAMTTAA
CAATCTGCTT
TTACCAACAA
TTAGTGGACT
ATTCGTGAAG
AATAAATCGA
GTACCGAAAA
AAAGAAGTCG
AACAAAGGCG
GTGGACTATT
TArI-rATCTG
CGACAATCAC
CGAAAGACTT
ACCCTGATTT
TTACGCAAGC
TAAAACCATA
TATTGTTAGA
CAGAAATGTT
CATTAGAGGA
TTGGTGCTGG
TTGATTTTAA
ACCGTGATGG
CCCCATTTrGC
TTTCTGGCAT
CATGAAAGAA
ACMA.GCCAAA
GTACCCAGGC
TATGGATGAT
CGGCGCCAAA
ATTCTATAAT
ATTAAAAGAA
TrTTGACTCG
TAAAGACTTA
TATCGAAGCA
AAACAAAAAG
GCAATGAAG
AATCCAAAAA
ATCAATTCGA
TGGTITATIGGA
GACACAATCG
ATCGACGGCA
GCTGATTTGT
GCTTCTAAAA
TTAAATAACT
GATTTAZACAA
GGTTACITCC
GTAGCAATGT
GGGTTCA.AGA
TTAAAGTGGA
CTTAACTTC
ATGCTGCACA
GCTGGAAAGA
AACAATACGG
TGAAAGAATA
cAATTTACGA
ATTACCCAAT
GCAAAGATTC
GCACAGCTGG
TTGTCGGTAG
AGAAGCGTT1A
ATTACAAAAT
ACCAAAAGAT
AGATGAAATG
TGC-AGAGCCA
CATTCCATT
TGGTGTTGAA
AAAATCCAAC
TGGAATGAAA
ACAAGAAGTC
TTICAGATAAA
TATTGCTGGT
TABLE 1. Nucleofide and Amino Acid Seqeuences of E.f[aecalis Genes.
GCTGGTTTG
CCTGAAAAAA
GAACAACGGA
TACTCGGCAC
TACAAAAATT
TTCGCTATCC
GAAAGTATTT
CAATTTGAAC
TTCAAAA.AGA
TCAACTTACA
CAGCGGCATT
AACAAACAGG
CTAAGACAAC
CAGTAGAAGA
TTGCTTCATC
AAGCATGGA.A
TGCTGAAGCT
ACAAGGAACA
TGAATTICATG
TTATATGCCA
CAAAGTACCT
AAATGCTGAT
AAATAAAGAC
CCAA
GGTGGCTATC
GATATTATA
AAATTCTTAG
ATT~rAGA-AT
GCACAACI'
TCAGCCTATA
ACCAGAAAAT
AATACGGTrGT
TGOTTCGATAG
CTACTCCTGA
CTG'T=TACA
AAAACGCAGT
ATGAAAT'GCG
TATTGAAAGA
TGCACCACCT
TGCTACGCCA
TrCACAl-rG
CACTGATGAG
AAAAGATrA
GACAATI'ATG
TGCAACATCA
EF001-4 (SEQ ID NO:4) CGNGNGA< ESNDIVKEVK EDTTITFWHA MNGVQEE-ALT PTITQAYPGW LWNAAQDEML KSTZEMLFYNA DLLKEYGVEV KGVDFNKDLD LTSKDSQEVV GFVQKDAEAG GYEYGVAPRP WAQQTGYMP I LESVLHSDEY SIFASSNXDT RKLLKDATSQ FF002-1 (SEQ ID KLTKDFrMEN
VDLKPYMDDD
PKTLEELKEA
DYYREXMEAG
EKINLQQGTD
KNSKTI'KVPA
FEQAWNQ
PKIKVFELQNQ
TIGWKDAEPI
SKTIYEKSNK
YFRTAGSDKY
IYMFDSATPE
QLENAVYKDLF
SAYPDLQAKI
REVLLDGAKI
EVVGAGFDSL
LSGPFANKKV
QRTAAFEFMK
Al PVEENADS
NSTLTSPKDL
DGKQYGIPFN
NNYYA IGbM(N AMFVGS IAGA
FLATPDSQLY
AYNEMRTIME
S S
S
5 p
TAAATAGCGG
TTAGCAGTGG
GAAAAGAGTG
CCAGAATTTG
CCGGTGGACA
GATACGACGG
AATCAATTGG
AGTTACGAGA
TGGGTATTGT
TTAACTI'GGG
TATGGTG CCT
AATGCCAATT
AGAATGCAAA
TATCAATCAC
GGGACTTI'AT
ATACCACAAC
AATAAAAACA
GAAGGTGCAA
GATAAAATCT
AGGTAGTACA
CGGCACTAAC
AAGATGGCAA
AGAAATTA T
TTGCTTCAGA
ATATTTTAAC
T'GGATTTAAC
TGTATGAAAT
ATTACAATAA
ATGAATATGA
ATCAACATAC
TGATTGAACC
AAGATCAATC
AATI!TGAAAA
TAACAAACAT
AAGAAAAGG
GTAAAAAACA
AACTT'ITAGC
AC'ITTGCAAG
AATGAAATI"I
TTTAACAGCA
AACAAAATTA
CAGAGCTTTT
TGATTATGAC
CATGAA-AAAC
CGATCACG'11
CGATGGTAAA
AAAAATGTTT
AGCGTTAGCG
TTGGCGCTCA
AAAATACAAT
ACAAATGGAT
TCAAAAGCG
TGATGATGGC
CAAAGCAACT
AAAAGCTGCT
AGAAGTAGGG
AAAAGGAATG
TGGAAAA-AAG
TGTGGTGGTT
ACAGTAACTA
GAAGCGGAAA
ACAAAAGTAA
TTACIFICAT
AAAGATTTAG
ACCTATGCTC
GATGAAGCCG
AAAAAATTAT
ACCGT1'CAAG
TATATGGAAA
TTTGGAACAG
GCGATGATGT
AAAACAAATG
ACCTTI'1'GCT
CAAAAATTCT
GTGGTTCCTT
CCTI'CAGACG
GCTTAACAGC
CAAGTGAAAA
CT-TGGAATTA
ATCCTGATAT
CAACGATGCT
'ATTCTAATTA
ATATCGAACC
AGCCTTACCG
GAATTGCCTA
CTAAACCAGA
CGATI'GCTGC
CTrATTATGA
CAAAATCAAC
ACATGGGTAG
TCGAATGGGG
CACCGACAAC
TAGACTTTG-C
C'FrATAAAAC
AGTCTCACAA
GGCAGCGCTG
GAAACCAACT
TGACACGACC
CACTATTGAA
'PrCATCAGGA
CGCGCTACGC
TGCCAAAGCA
TACAGATTC
TCCCCATAAC
AGAACAAGTA
TGCTCAAAAC
TCGCGCATTG
AAAAGTAACG
CTGGTACATG
GATTGCCGAA
TTTTGCAATT
TICAGGTAAA
AGATGAAATT
AAAGCC'ITTA
ACCCAGATAC AATTAATTTA G EF002-2 (SEQ ID NO:6) MXFW KXGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTXLT VMITNYDIfP EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTI'MLSSGD TTDI LTMKJNL LSYSNYALRN QLVDLTDHVI( DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL TWDEYE-ALAX KLSKPEEQVY GAYQHWRST VQAIAAAQNN ANLIEPRYNY METYYDRALR MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GAKLLAEVGV VPSYKTDEID 81 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Genes.
KIYFARKGMP SDESHKKPLT QIQLI FQO02-3 (SEQ ID NO:7) A TGTGGTGGTT CAAGTGAAAA GAAACCAAC GAAAAGAGTC- A.AGATG.GCAA AACAAAA LTA
CCAGAATI'G
CCGGTGGACA
GATACGACGG
AATCAATTGG
AGTTACGAGA
TGGCCTATTGT
TTAACTTGGG
TATGGTG-CCT
AATGCCAArI'
AGAATGCAAA
TATCAATCAC
GGGACTTTAT
ATACCACAAC
AATAAAAACA
CAAGGTGCAA
GATAAAATCT
ACCCAGATAC
AGAAATTATT
TTGCTTCAGA
ATA TITTAAC
TGGATTTAAC
TGTATGAAAT
ATTACAATA
ATGAATATGA
ATCAACATAC
TGATTGAACC
AAGATCAATC
AATTTGAAAA
TAACAAACAT
AAGAAAAAGG
GTAAAAAACA
AACTITTAC
ACTTI'GCAAG
AATTAATT
CAGAGCTTTT
TGATTATGAC
CATGAAAAAC
CGATCACGTT
CGATGGTAAA
AAAAATGTTT
AGCGTTAGCG
TTGGCGCTCA
AAAATACAAT
ACAAATGGAT
TTCAAAAGCG
TGATGATGC
CAAAGCAACT
AAAAGCTGCT
AGAAGTAGGG
AAAAGGAATG
T
ACAGTAACTA
GAAGCGGAAA
ACAAALAGTAA
TTACTTTCAT
AALAGATTAG
ACCTATGCTC
GATGAAGCCG
AAAAAATTAT
ACCGTTCAAG
TATATGGAAA
TTTGGAACAG
GCGATGATGT
AA.AACAAATG
ACCTTTGGcT
CAAAAATTCT
GTGGTTCCTT
CCTTCAGACG
cTI'GGAATTA
ATCCTGATAT
CAACGATGCT
ATTCTAATTA
ATATCGAACC
AGC CTTACCG
GAATTGCCTA
CTA-A.ACCAGA
CGATT GCTGC CTTAkTTATGA
CAAAATCAAC
ACATGGGTAG
TCGAATGGGG
CACCGACAAG
TAGACTTTGC
C'TATAAAAC
IAGTCTCACAP
TGACACGACC
CACTATTGAA
TTCATCAGGA
CGCGCTACGC
TGCCAAAGCA
TACAGATTTC
TCCCGATAAC
AGAACAAGTA
TGCTCAAAAC
TCGCGCATTG
AAAAGTAACG
CTGGTACATG
GATTGCCGAA
TTTGCAATT
TTCAGGTAAA
AGATGAAATT
AAAGCCTTTA
9* 9* S S EF002-4 (SEQ ID N0:8) C GGSSEKKATE KSEDGKTKLT VTTWNYDTTP EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTMLSSGD TTDILTMKNL QLVDLTDHVK OLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKK<MFD TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANL IEPYYNY MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MJ4YMGSWYMG TLLTN~IDDGK PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE GA.KLLAEVGV KIYFARKGMP SDESHKKPLT QIQLI
LSYSNYALRN
EAG lAY PDNL
METYYDRALR
TNVEWGIAEI
VPSYKTDEID
EF003-1 (SEQ ID NO:9) 5*55 *5SS *5 S
S.
a.
TAGGAGGACA
AIITAGCTG
GTTGCCGTC
AAAAAAGGGT
GTGCAACATG
AACAAAGAGA
TTCTATTCAA
CCTAGCGATC
AAATTAAAAG
AACATCACTr
ATCGCTATGG
GCGATCTTGT
AAAGGCGAAA
G'ITGCTGAAT
AAAGAATGAA
CCTGTGGGGG
AATTGGAATC
ACAAAATTAA
ACGAAGCGGA
AAAAAGCTGA
AAGAATACCA
CAACCAATGA
AAGGTGTCGG
TTGAAAGCAT
TGTTCTGCTA
TAGAAGATAA
AAGATAGCGA
ACATCAAGAA
GAAGTTTTAT
AAATAAACAA
TTCAAAAGAT
CATTATGGAA
TGCTAATTT
TTAGTGGCT
AGATGCGAA-A
AGGTCGTGCT
CTrTAACGGC
TGATTTACTG
CCCAGCCTAC
AGAAGCAAGT
AAAAATCAAG
TTAGCNACAT
GCAGACCAGA
ATCTTGGAGA
GTGAGCGACA
GCGCAACATC
GTGCAACCGA
GATTIACCTG
TTAGCAA'TT
ACGGTGGCAG
AATTAGCTA
TTAGAACCTG
AAACATTACG
GTTTTAAAAG
TCGCTGTIAT
AAGAAGACAA
TTGCCA-AGAA
ATcGTTGCCTA
AACCCTTCAT
TTTATTATT
AAAATGCCAA
TAAATGCAAA
ATGTCGTGGA
AAGCCTATGA
CTGGTTTAAC
CATTGCAAGT
AAGCGATGAC
TTCCTGCGTT
TGCAACAGT
GGAGATTACC
AGAAGCTGAG
CAACGATGCC
GGAAATGTTT
TGCTGGTGGT
AGTGrGGGATT
CGGCGTGATT
AAATCCTAAA
TGAAAAAGAC
AACGAAAGAT
TGTGACACGC
AACAAAAGAA
TTAA
AAATTCTAA.A GGCGCCAATA EF003-2 (SEQ ID NO:lO) 82 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
M"flKFYL ATFAVIATVI LAACGCNXQA DQKEDKEITV AVQLESSKDIl LEIAKXEAEK KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN KEKKADEJVAV QPIYYFAGGF YSKEYQDAKD LPENAKVGIP SOPTNEGRAL AILNANGVIK LKEGVGFNGT VADVVENPKN ITFESIDLLN L.AKAYDERDI AMVFCYPAYL EPAGLF1'KDA ILLEDKEASK HYALQVVTRK GEKDSEKIKV LKEAIM4I'EV AEYIKKNSKG ANIPAF EF003-3 (SEQ ID NO:11) CTGTGGGGG AAATAAACAA GCAGACCAGA AAGAAGACAA GGAGATTACC
GTTGCCGTGC
AAAAAAGGGT
GTGCAACATG
AACAAAGAGA
TTCTATTCAA
CCTAGCGATC
AAA ITAAAAG
AACATCACTT
ATCGCTATGG
GCGATCTTGT
AAAGGCGAAA
GTTGCTGAAT
AATTGCAATC
ACAAAAT'AA
ACGAAGCGGA
AAAAAGCTGA
AAGAATACCA
CAACCAATGA
AAGGTGTCGG
TTGAA.AGCAT
TGTTCTGCTA
TAGAAGATAA
AAGATACCGA
ACATCAAGAA
TI'CAAAAGAT
CATTATGGAA
TGCTAATTTT
TTTAGTGGCT
AGATGCGAAA
AGGTCGTGCT
CTI'TAACGGC
TGATTTACTC
CCCAGCCTAC
AGAAGCAAGT
AAAAATCAAG
AAATTCTAA.A
ATCTGGAGA
GTGAGCGACA
6CGCAACATC
GTGCAACCGA
GAT=ACCTG
TTAGCAATT
ACGGTGGCAG
AATTTAGCTA
TTAGAACCTG
AAACATTACG
GTTTAAAAG
GGCGCCAATA
TTGCCAACAA
ATrGTTGCCTA AACCC7TTCAT
TTATTATT
AAAAT13CCAA
TAA.ATGCAAA
ATGTCGTGGA
AAGCCTATGA
CTGGTTI'AAC
CATTGCA.AGT
AAGCGATGAC
TTCCTGCGTT
AGAAGCTGAG
CAACGATCC
GGAAATGTT
TGCTGGTGGT
AGTGGGC-ATT
CGCGCTGATT
AAATCCTAAA
TGAAAAAGAC
AACCAAAGAT
TGTGACACGC
AACAkAAAGAA
T
S
*5 5* S S S S
S
b 5S55
S
S. S. S
S.
EF003-4 (SEQ ID NO:12) CGGNKQA DQKEDKEITV AVQLESSKDI LEIAKKEAEXK KGYKINIMEV SDNVAYNDAV QHDEADANFA QHQPFMEMFN YSKEYQDAXD LPENAKVGIP SDPTNEGRAL *AILNANGVIX ITFESIDLLN LAKAYDEKOI AMVFCYPAYL EPAGLTTKDA GEKDSEKIKV LKEAMTTKEV AEYIKXNSKG ANIPAF KEKXADLVAV QP IYYFAGGF LKEGVGFNGT VADWVENPKN ILLEDKEASK HYALQVVTRK EF004-1 (SEQ ID NO:13)
TAAATCGAAA
ATTTGCAG
GAAACTGCTG
ACAGAAAACA
TTAGTAAAAA
GTCAFI'ACGG
GCGATAGAT
AATGTTGAAG
GAAGGATGAT
CTCTTCTCTC
CCCAGTCAAG
TAACCCAAC
TAGATCGITAA
TTAAGGTAAT
CCAACTCAGG
AAGTAGCCAA
AGAAATGAAA
TGCCTGTAC
CACTA'ITGAA
AGTTAAACAG
AAATAATGT'
TAATCATGAA
TACAGAGGCA
AACATTAGAA
CAGAACCATA
AAAATGATTA
AACGCAAAAA
GCTI'CAGACA
TTAGA.AGAAA
AAAGATGACA
GCAAAAAAAA
CAAA.AGACTG
AATGATAACG
CCAAAATCAA
AATTTGCAGG
ATAATACACA
GTAACGAAAA
AATTAACTC
CATCAGATAA
ATATGCAACA
CCATATACGG
ATGT'rAI=C
CTAAAAATGA
CATTGCTCTT
APLAGAAAG' CC
CGACCCTAAT
TGACGAGAAA
CCCTCACGCT
AATGCAGACT
AATTCAATTA
MTCATCACA
AAATAWI'ATT
CCTTACACGA ATGGGAACCA CCGTTAGTAA AATAA EF004-2 (SEQ ID NO:14) MKK MIKFAGIALI FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT ENITQAVKQL EE-KFNSDEXL VKIDVKNNVK DDTSDNP14AV ITVKVINDEA KKNMEEMQTA IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNEN~IIP
LVK
EF004-3 (SEQ ID 83 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Gencs.
CTGTAGC AACCCAAAAA ATAATACACA AAAGAAAGCC GAAACTGCTG CCCAGTCAAG ACAGAAAACA TAACCCAAGC TTAGTAAAA-A TAGATGTTAA GTCATI'ACGG TTAAGGTAAT GCGATAGA'TT CCAACTCAGG AATCTTGAAG AAGTAGCCAA CCTTACACGA ATGGGAACGA CCGTITAGTAA AA
CACTATTGA.A
AcTAA-ACAC
AAATAATGTT
TAATGATGAA
TACAOAGGCA
AACATTAGAA
CAGAACCATA
GCTTCAGACA GTAACGA-AAA TTAGAAGAAA AATTTAACTC AAAGATGACA CATCAGATAA GCAAAAAAAA ATATGGAAGA CAAAAGACTG CCATATACGG AATGATAACG ATG-TTATTTC GCAAAATCAA cTAAkAAATGA
CGAGCCTAAT
TGACGAGAAA
CCCTCACGCT
AATGCAGACT
AATTCAAITA
TTTCATCACA
AAATATTATT
EF004-4 (SEQ ID NO:16) CSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT ENITQAVKQL EEKFNSDEI(L VKIDV.KNNVK DDTSDNPHAV ITVXVINDEA KKNME-.EMQTA IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP
LVK
EF005-1 (SEQ ID NO:17) 00 9* 9 .09 6
TAAAAAATGA
GTTGGTTGt'G
CTAACTCTTT
GGATGGTCAG
GGACTGAAAG
ACTGCTCAAC
CCAGATTTAC
CAGGATATTG
GAATCCT ITG
TGGCCTGCAG
GTTGCCACAG
AGTATTTCAG
AAGAATGCTA
AATGATGTCA
AAAAACGATT
GTAAAAATCA
ACATI'GTGCG
ATGCGGTCCT
ATGTTGCCTT
TTATFI'AGA
GTGAATTTAA
CTGATGATCA
CCAATAGTGT
AAGACTATGC
AATCAGCCAA
CGTTG LTAGC
GTGTCACAAC
GCTACTTAGA
CACGATTGTG
GCAAGCAACG
CCACGGAAAA
A.ACACCAGAA
TCAAAATGCA
TCAAAATAAA
'TTrGGTAGC
AGGTGTTTCC
AGCTAAACTG
TACAATTACT
TTCTGCGAAT
AACTTTMT
AATTCATI'AC
AGCAGGCGAA
GGGATGCTTT
ACAAALAGAAA
ACCATGTTAA
GGTGAAA-AAG
TATAGTACTO
GCAGGCAAAG
TATGAAGGGG
TTAGAA~GAAT
GATCAACAGC
AAACGTTTGA
GGCAATG=T
GATGATTA
AAAA.ATGGCG
AAAGAATCAA
TTCTGGCCAT
AAGAGACAAA
ATACGACGGA
TTGTGACAGC
ATAGTGGCCG
ACCTTGAAGT
ATTTAAATAA
'TATGAAAAA
GCGAGGAAAG
AAAA.AGGCTr
TAGTGGTCTC
AAGTCCCAGA
AATATACTTT
AATA.A
TTTAGTAATG
ACCTGAAGAA
CCGCGTACAA
AACTGGGAT
CGCCTTGCAA
CGTGCGTGAC
GACALATGTGG
CATGACTCCT
CPLAGAATAAC
AGATAA.AATT
TCATGGCTTG
AGGCGGTTTG
GGATAPAGTC
see* EFOO5-2 (SEQ ID N'O:18) MKKRLTIVG MLFLAILVMV CGCKNQQATT KEKETKPEEL TLYIVRHGKT
MLNTTDRVQG
WSDAVLTPEG EKVVTATGIG LXDVAFQNAY sSDSGRALQT AQLILDQNKA GKDLEVVRDP DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EEPMKENMTPE SFANSVAKLD QQREESKNW PAEDYATITK RLKKGLDKIV ATESANSGNG NVLVVSHGLS I SALLATLFD DFKVPEGGLK NASVTTIHYK NGE-YTLDKVN DVSYLE.AGEK ESK EFOO5-3 (SEQ ID NO:19) TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA CTAACTCTIT ACATTGTGCG CCACCCAAAA ACCATGTAA ATACCACGGA CCGCGTACAA GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGATT GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAGTGGCCG CGCCTTGCAA ACTCCTCAAC TTATITTTAGA TCAAAATAAA GCAGG-CAAAG ACCTTCAAGT CGTGCGTGAC CCAGA TTAC GTGAATTTAA TTTTGGTAGC TATGAAGGGG ATTTAAATAA GACAATGTG CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGA1AGAAT TTATGAAAAA CATGACTCCT TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GAATCCTTTG
TGGCCT'GCAG
GTTGCCACAG
AGTATTTCAG
AAGAATGCTA
AATGAIrTCA
CCAATAGTGT
AAGACTATGC
AATCAGCCAA
CGTTGTTAGC
GTGTCACAAC
GCTACTTAGA
AGCTAAACTG
TACAATTACT
'ITCTGGGAAT
AACTTTATT
AATTCATTAC
AGCAGGCGAA
GATCAACAGC GCGAGGAAAC AAACGTTTGA AAAAAGGCTT GGCAATGTTT
TAGTGC-TCTC
GATGATTTTA
AAGTCCCAGA
AAAAATGGCG AATATACTTT AAAGAATCAA AA
CAACAATAAC
AGATAAAATT
TCATGGCTTG
AGGCGGTTTG
GGATAAAGTC
EF005-4 (SEQ ID CGKNQQATT KEKETKPEEL TLY IVRHGKT MLNTTDRVQG WSOAVLTPEG EKV'JTATGIG LKDVAFQNAY SSDSCRALQT AQLILDQNKA
GKDLEVVRDP
DLREFNFGSY EGDLNKTMWQ DIADDQGVSL EFMKNMTPE SFANSVAKLD
QQREESKNNW
PAEDYATITK RLKKGLDKIV ATESANSGNG NVLVVSHGLS ISALLATLFD
DFKVPEGGLK
NASVTTIHYK NGEYTLDVN DVSYLEAGEI< ESK0 EF006-1 (SEQ ID NO:21) ev 0 00.*0 S00 0000 0e
TAAACGATAA
GCAGTAGCTG
AAAG ETGGAG
GAAAAAGAAG
GCGTTGGAAA
GCGGTTAA-AG
GGGCTT1'ACT
GTTAGCTCTT
ATCACGCTGA
ACTAAAAAGT
AATGAAGAAG
CCGAAAAAAG
G ETCGTAAAG
AAAGAAGTCC
ATGGAGGGAA
TCTTAG'FITT
CTTCACCAGT
GCGTAAAATT
GTGGCGATAT
AAAATGATTA
CGAAAAAATA
CCGTTTCAGA
AAGAAGGGGT
TGAAATTCA.A
GGGCTGCGGT
ATGCGATTGC
AAGACGAAAA
AAGATTGGAT
TAAGATGAAA
AGGGGCTTGC
TCCACATGCA
AGAAGTGACG
CGATGCCAAC
TrGACTTTGTG
CAAATCGTTA
TTGGCCACGC
AGACCGGACA
TCATGAAAGT
TTTAATI'AAC
CTTAGAAAAA
CAACGAAAAT
TACGAAAAAA
AAACGTACAT
GGCAATAAAA
GAGATTTTAG
ACTTATACAG
TATTI'CCAAC
AATGCAGG;TG
CAAGAAATTC
GTATTkACTA
ACTGCTAC'T
GATCCAGCAA
TCAALACTTTG
GAAAGTTCAC
GTAAAAAAAT
TGGAACGGCG
TATGC-TCAGT
AGAGTGATGA
AACATGTAAA
ATTACGTGCT
ATGTGCCGTT
CTGATGGTTC
TCITAGAAGA
TCGATGATAT
TCATGACCAC
CCGTGGATCA
CTTATGCCAA
TAGTCAAAGT
CTATTGT'CC
AAT1'ACTGTA
CTLCGGTCTTG
ACCTTTATTA
ACCTAACAAG
cTTTAATGAA
AGAACCAGTT
AACGATTTAC
TGCTGGTTTA
TGATAAAAAT
TCTITATGAC
AGGATTAAAT
TATTATTGCG
GTTrACGTAGC
AGTCAATGAA
TAA
EF006-2 (SEQ ID NO:22) MKK RTI.WSVITVA VAVLVLGACG NKKSIDDSVLK VGASPVPH-AE ILEHV'KPLLE KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNEA VKENDYDFVN
AGAIHLEPVG
LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT
ATFDDIDKNT
KKLKvFNHESD PAIMTLYDN~ EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANI IAV RKEDENNMENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE EF006-3 (SEQ ID NO:23) TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG
AA.AGTTGGAG
GAAAAAGAAG
GCGTTGGAAA
GCGGT'AAAG
GGGCITTTACT
GTTAGCTCTT
ATCACGCTGA
ACTAAAAAGT
AATGAAGAAG
C LTCACCAGT
GCGTAAA.ATT
GTGGCGATAT
AAAATGAT'A
CGAAAAAATA
CCGTTTCAGA
AAGAAGCGT
TGAAATTCAA
GGGCTGCGGT
TCCACATGCA
AGAAGTGACG
CGATGCCA.AC
TGACTTTGTG
CAAATCGTTA
TTGGCCACGC
AGACCGGACA
TCATGAAAGT
TTTAATTAAC
GAGA'ITTTAG
AC~TTATAcAG
TATI'TCCAAC
AATGCXGGTb CAAGAAATrC
GTATTAACTA
ACTGCTACTT
GATCCAGCAA
TCAA.ACTTTG
AACATGTAAA
ATTACGTGCT
ATGTGCCGTT
CGATTCAT'T
CTcATGGTTC- TC?1'AGAAGA
TCGA'GATAT
TCATGACCAC
CCGTGGATCA
ACCTTTATTA
ACCTAACAAG
CTTTAATGAA
AGAAcCAGT
ALACGATTTAC
TGCTGGTTTA
TGATAAAAAT
TfC'ITTATrGAC
AGGATTAAAT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E.JAecalis Genes.
CCGAAAAAAG ATGCGATTGC CTTAGAAAAA GAAAGT'--CAC CTTATGCCAA TATTATIGCG GTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAAGT GTTACGTAC AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTG'=C AGTCAATGAA EF006-4 (SEQ ID NO:24) CG NKKSODSVLK VGASP JPHAE ILEHVKPLLE KEGVKLEVTT YTDYVLPNKA LESGD IDANY FQHVPFFNEA LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI KKLKFNHESD PAIMTTLYDN EEGAAVL INS NFAVDQGLNP RKEDENNENV KKLVKVLRSK EVQDWITKKW Di6AI-VPVNE:
VKENOYDFVN
TLKEGVDRTT
KKDAIALEKE
AGAI HLEPVG
ATFDDIDKNT
SSPYANI IAV EF008-1 (SEQ ID
TAAACCGTGA
ACACTTTTAG
AAAITAGCAA
GACAAAATTG
TTACCAGAAG
TTAGAAACAG
AATAAAGATT
CAAGAACAAA
GAAAACATL'C
AACGCGAAA.A
GCTGATATTC
TCCAAAGCTT
ACACCTGAAC
TTGTTGAAA
CCAA TI!ACG
TACTACAGCA
GAAAGAAATG
CAGGGTTAAC
TTGTGACAAC
AGCTGCATAG
ACATTGCGAA
GCGGAAATGG
AC=1TCTAC
CAGAAGATCC
GTGACGTGTT
ATTATACCCA
CTGATGATAA
ATGATIAAA
AAATGACCAC
CCAGTGTCGA
ATACACTTTT
TGATCAACTG
GAGGAATCAA
GTTAGCTGCT
GAACTCGATC
TATTGTGCCA
AGCTTCTGAA
CTGGTITAAC
AAGCAAAAAT
ACATGCTTGG
AGTAGAAAAA
AAAACTTAGC
AAAATTA ETA
TGCCGCTTAT
GATTATTGAT
TAAACGTAGT
CACAGACTCT
GAA ITTAACA CGAATGAAAA AATTTAGT= TGCGGGAATC AAGCCGCTGA CTATCTGATT TAGTGAAAAA ATTGGGACAG ACCCTCACGA
GCGGACATTI'
AAA'IT1AATGA
GTTACGCCAC
TTAGACATTG
GATCCAAAAA
AAACTACATG
GTTACAAGTG
ATTI'GGGAAA
ACCATTAAGA
ATGGAACGGG
TATTCTI'PA
AAACCGCCAA
AATATTTAAC
AAAATGGCAT
ATAAAGAT1T
AGGAAGCCAA
AAGGTGCCTT
'rTAACACAGA
AATCAAAAGC
TCTCAAAAGA
ATT'TTTTTA
AAAGAAAGAA
TGTTGGGCAA
ATATGAACCG
CGGCTTGAAC
AAAAGTTGAG
AAGTGCCGGT
TAAATATGTA
CTATACAGAA
AGCTAAAT
TAAATATTTC
AAGTCAAGGN
ACCTGTGTTA
AGTGAAACGA
CTTGCCAA.AG AACGAACAGA AGGCGATACG AAAATCCATG ATGGCTIAAT GAGTAAATAA EF008-2 (SEQ ID NO:26) t-(KFSLFFLT
KIELHSIVPI
KDYFSTSKNV
AKYTEKLSK
PEQMTTI IDT YSbZ2NWNLTK
LLAGLTLAAC
GTDPHEYEPL
TPQYLTSAGQ
LHEEAKAKFA
IKKSKAPVLF
IHDGLMSI(
GNQAAEKKEK
PEDIAKASEA
EQTEDPHAWL
DIPDDKKLLV
VETS VDKRSM LAIV'ITNS IL
DILFFNGLNL
DIENGIKY'VE
TSEGAFI(YFS
ERVSKEVKRP
SDLVKNVGQD
ETGGNG1WFNI<
NIRDVLVE-KD
KAYDLNAAYI
IYDTLFTDSL
1 M(TAKKVEN
PKN~KFYTEN
WEINTESQGT
AI(EGTEGD'IY
EF008-3 (SEQ ID NO:27) T TCGGGAATC AAGCCGCTGA AAAGAAAGAA
AAATTAGCAA
GACAAAATTG
TTACCAGAAG
TTAGA-AACAG
AATAAAGATT
CAAGAACAAA
GAAAACATTaC
AACGCGAAAA
GCTGATATC
TTGTGACAAC
AGCTGCATAG
ACATI'GCGAA
GCGGAAATGG
ACTTTTCTAC
CAGAAGATCC
GTGACGTG7T
ATTATACCGA
CTGATGATAA
GAACTCGATC
TATTGTGCCA
AGCTTCTGAA
CTGGTTTAAC
AAGCAAAAAT
ACATGCTTGG
AGTAGAAAAA
AAAACTTAGC
AAAA ITATTA CTATCTGATT TAGTGAAAAA rGGCA ATT'GGGACAG ACCCTCACGA ATATCAACCC GCGGACATTT TATTCTrrAA CGGCTTGAAC AAATTAATGA AAACGGCCAA AAAAGTTGAG GTTACGCCAC AATATTTAAC AAGTCCCGGT TTAGACATTG AAAATGGCAT TAAATATGTA GATCCAAAAkA ATAAACA'PIT CTATACAGAA AAACTACATG AGGAAGCCAA AGCTAAATTT GTTACAAGTG AAGGTGCCTT TAAATATI'TC 86 TABLE 1. Nucleotide and Am~ino Acid Seqeuences of E.faecalis Genes.
TCCAAAGCTT ATGATT1'AAA TGCCGCTTAT ATTTGGGAAA TTAACACAGA AAGTCAAGGN ACACCTGAAC AAATGACCAC GATTATTGAT ACCATTAAGA AATCAAAAGC ACCTGTGTTA
TTTGTTGAAA
CCAATTTACG
TACTACAGCA
CCAGTGTCGA
ATACACTTT
TGATGAACTC
TAAACGTAGT
CACAGACTCT
CAAT'TAACA
ATGGAACGGG
CTTGCCAAAG
AAAATCCATG
TCTCAAAAGA
AAGGAACAGA
ATGGCTTAAT
AGTGAAACCA
AGGCGATACG
GAGTAAA
EFOO8-4 (SEQ ID NO:28) C GNQAAEKKE( LAIVTTNSIL SOLVKNVGQD KIELHSTVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL KDYFSTSXNV TPQYLTSACQ EQTEDPHAWL DIENGIKYVE AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS PEQM'FTI IDT IKKSKAPVLF VETSVDKRSM ERVSKEVKRP YSMM4NWNLTK IHIJGLMSK
ETGGNGWFNK
N IRDVLVERD KAYDLNAAY I
TYDTLFTD)SL
LMKTAKKVEN
PKNKDFYTEN
WEINTESQGT
AKEGTEGDTY
EF009-1 (SEQ ID NO:29)
TGACAA.ATGA
GCAGGTTGTG
GTAGGAACAA
GATLACAAT
GAAATTGT
GGAACGAAAT
AAATTGAAAG
ACGAATGGCG
GCGAAACAGC
ACTGAATTAG
AATAGCGGCA
CCTCTAAACG
GAGAATAAAC
ATTGCAGAAA
AAA.AATTTAG
CATCGGGGTC
AAAATGATrGA
TGGTAGAATT
TAAATGCCTT
TAGTATCAAT
ATATCACGAA
GGCGGGCGTT
AACTACCGAC
ATGCTACGCA
TGGCT'GTCGA
AAAAAGCGAA
T'ITATCAAAA
CATCAAAAGG
TAAAT1'AAT'T
TGTGAAGGAT
ATGGGAATCG
TACAGACTAT
TCAGCATCAA
TGGCAATACA
AATTrAAAGAC
AATTTTATTA
TGTCAGTGAT
AACAGCGCGC
TGCTGGGTAT
ACCTI'ATGTG
AGTTGTAGAA
CGCCAATGTT
GGACTTATTG
ACTAAGACAG
GTCAAAGACC
ACGCAACCAA
ATCTITI'TAG
GTCAATGCAC
GGCGGAGAAA
CAAACTGCAG
ATIACTGAAA
GCTTTACAAG
ACACCAGATA
AACATTGTCG
GAATATCAAC
CCAGCCTGGG
GGGTATTAGC
AAACCGTI'AA
GTI'TGAAAAA
ACGC.AGCATT
ACA.ATTACAA
CATITGGGAAT
TTGCTAITCC
GACTGATAAA
ATAAACGCCA
ATGTCGATGC
AAGATGCTAT
TGGCCCGAGA
AAGA-AGAAAC
AAACA G
TTTTACGATT
ACTAGGGGTT
GAAAAATATT
AGCAGAAAAA
TAAAGAGCAT
TTACGCTAAT
TAATGACCCA
AGTAGATCCT
ATTGAAAATA
LTCAGTGAT
TTCTTAGAA
AGAAGATCAA
GAAAAAGGTC
TAAAAAATAA
EF009-2 (SEQ ID MKKFSKLIG L IGVLAFTIA GCASGSVKDT KTETVKLGVV GTK(NDENESV KDRLKKKNID LQLVEFTDYT QPNAALAEXE IDLNAFQH-QI FLDNTYNKEHG TKLVSIGNTV NAPLGIYANK LKDITKIKDG GEIAIPNDPT NGGRAL ILLQ TAGLIKVDPA KQQLPTVSDI TENKRQLK IT ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP vNEK.AKPYvN ivvAREEDQE NKLYQKWVEE YQQEETKKVI AETSKGANVP AWETFGKK EF009-3 (SEQ ID NO:31) TTGTG CATCGGGGTC IGTGAAGGAT ACTAAGACAG AAACCGTTAA ACTAGGGGT
GTAGGAACAA
GAT'ITACAAT
GAAATGA'T
GCGAACGAAAT
AAATTGAAAG
ACGAATGGCG
GCGAAACAGC
ACTGAATTAG
AATAGCGGCA
CCTGTAAACG
AAAATGATGA
TGGTAGAATT
TAAATGCCTT
TAGTATCAAT
ATATCACGAA
GCGGGCGTr
AACTACCGAC
ATGCTACGCA
TGGCTGTCGA
AAA.AAGCGAA
ATGGGAATCG
TACAGACTAT
TCAGCATCAA
TGGCAATACA
AATTAAAGAC
AAITTTATTA
TGTCAGTGAT
AACAGCGCGC
TGCTGGGTAT
ACCTTATGTG
GTCAAAGACC GTTTGAAAAA GAAAAATATT
ACGCAACCAA
ATC TITTTAG
GTCAATGCAC
GGCGGAGAAA
CAAACTGCAG
ATTACTGAAA
GCTTTACA-AG
ACACCAGATA
AACAI1TGTCG
ACGCAGCATT
ACAATTACAA
CATTGGGAAT
TTGCTATTCC
GACTGATAAA
ATAAACGCCA
ATGTCGATGC
AAGATGCTAT
TGGCCCGAGA
AGCAGAAAAA
TAAAGAGCAT
TTAcGCTAAT
TAATGACCCA
AGTAGATCCT
ATI'GAAAATA
TTCAGTGATT
TTTCTTAGA-A
AGAAGATCAA
87 TABLE 1. Nucleotide and Amino Acid Seqeuences or E. faecalis Genes.
GAGAATAAAC TTTATCAAAA AGTTGTAGAA GAATATCAAC A.AGAAGAALAC GAAAALAGGTC ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAAA EF009-4 (SEQ ID NO:32) CASGSVKflT KTETVKLCVV GTKNDEWESV KORLKKKNID LQLVEFTDYT QPNAALAEXE IDLNAFQHQI FLONYNKEHG TKLVS7GNTV NAPLGIYANK LKDITKIXDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLP--1SDI TENKRQLKIT ELDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLE-P VNEKAKPYVN IVVAREEDQE NKLYQKVVEE YQQEETKKVI AETSKGANVP AWETFGKK EF010-1 (SEQ ID NO:33)
S.
S
S
T'GAA.AGAATA
CGTAGCTGGC
GCAGCGTCAA
CCAGATAATT
GCAAAATTAA
GTTGCTTCTG
AAAGGCGCCT
CAATTGCCAG
ATTTTAGGTA
TACAATTTAC
GAATTCCNTA
ACAGGTAAT
ACTGGCTCAC
GTCTTGCATC
CCATATGACT
TATGGTGATA
GAACCATTAA
AAAAACAAAG
AACGGACTTG
AAA7TTGA
TTAATCATIT
ATCGAAGAAG
ACTGAAATCA
ACTGTTGATC
GATAAAGTTC
AAATTGTACAC
GCACAGCTT
AAGGTGATGA
ATGACCAATT
AAATGGAATT
GTGAAAGCTA
ATGCTGATTT
ATAACTATAT
ACTCTTACGG
ATATTAGTAA
AAAANGANCC
ATGACTTCCC
cAAAAATTAT
AATGCTATAA
TAAATACCAA
,.CAATCTTAAC
AAACAACAGC
AAAAATCTGT
TTTATGGTGA
AAGATTACAC
GGCCAGAAGA
cAAAAGATI'C
CTAACGTI'QC
CAGAAGAAAC
AAA.AAGAAAT
;GAGGAAATA
kLGCTGTAGGA rAGTACACCA
A.ATCCATAAT
TGTTGGTT'GG
I'GATATTTCA
A.ACTGATTTA
TAAAGGAAAT
TCAACAAGTT
AGTCGATGGT
AAATATTGCT
TATTGGTAAC
TAACCAATAT
AGATGGCTTG
TACTTGGTTT
ACAAGCTGCT
TCAAGCGCAA
TGAATI'GTTA
AGAAGGCAAA
.ACCAACAACT
ATCTGTCACT
ACCAATTCTI
TACAGTTATG-
ACTTCCAAA-P
GCAAACACA-P
AGGAATGAPA
CTA-AGCGCCT
ACGTIATTAA
GCGAATAAAA
GGCGATTGGG
TTAGCACAAA
GCACCTAAAT
ACCA'IWAATC
TTAACTT ITA
AGTTATGAAA
GCTFrTCTA CAATATC CAT
GCCGACAAAG
ATCCAACAG
ATGCGTCAAG
GGCAAACCAC
ATCCCTAACT
GGT ETATTAA
CAATATGAAA
CATTTGAGTG
GAAGAAATG
GGTTTTACT'I
AACCGTTACCG
TTAATGGATC
TTAGACGAAM
AAATGGCAAA
CGTAAATC
TGTATCGTGT
TTATCGAGAA
ACCAAAAAAT
ATTATGCAAC
ATGCCAAAGA
GAAAACTGTA
ATAAAGAATA
GTGCAACGGA
TCGGCCAAAC
TTGCAGTAAA
ACATGXFTAA
ATGCTGCTAC
AAACACAAGG
TTGTTTCTCG
ATGTTGTTGC
ACAGCAATCC
AAGTTGGCGA
CTTGGAACAC
TTAAAGAACC
TTGTAAATGP
CAGCAAr.CTI]
ACCTAAAAAC
ATATCCAATC
AAGGATTAGC
TTCAAAAGAT
TGGGGACAAA
AAAAAT'rGGG
CTCAACAATC
GAATGCACA-A
AGCCTATGAT-
TGCCI'CCCA
TGTCGATAAA
AGI'CTAAAA
ATTCTTTGCA
AACAACTGAT
TPLACTTAAAA
AAGTACAACA
ACCTATGGAT
TCCACTAACA
AA.ACACGTCT
AGAATTGTTA
TGATCGTGTG
AGGAAACAAC
TGATAAGAC
LTAAAGTGAAA
AAATACAGGA
AGCTGGCTGG
TCAAAAATA
EF010-2 (SEQ ID NO:34) MAXRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMYRVGDKP
ONYDQLIDNA
GAYADLTDLA
NLDISKVDGS
GSPKI INQYA
GDTILTQAAG
GLVYGEEGKQ
EEAKDSPILC
KVQKEMQTQL
NKI IEKKIGA
PKYAI(EAYDQ
YESATEVLKE
IDKDMINNLKV
KPLVSRPLTE
YEKVGDDRVK
FTFVNDKVKT
DEYIQSQK
KLKMEFVGWG
LPDNYIKGNT
FXKXXPNIAA
LHQWYKDGLI
PEJXTTAQAQM
LLIKDYTPTTH
EITNVATVMN
DWDQKMSTIV
INGKLYAFPI
FAIGQTFFAT
PTDAATSITP
ANYVVANTSK
LSAWNTGNNL
RYAASLJNTGT
ASGESYDI SL
LGNSYGQQVL
GNYDFPIGNQ
YDLNTNTWFM
NKIEKSVELLG
I IWPEESVTE
VDPEETLPKL
AQNYATNAQK
'FFNKEyVDKY ypFAVYCTTDT
RQETQGPMDY
LLNSNPELLN
EMrVKERD(S I
MDLKTAGWD
EFO1O-3 (SEQ ID TABLE 1. Nucleotide and Amino Acid Scqeuences of E. faecalis Genes.
CT CCTAAATC TTCAAA.AGAT
GCAGCGTCAA
CCACATAATT
GCAAAATTAA
GTTGCTTCTC
AAAGCCGCCT
CAATTGCCAC
ATTITTAGGTA
TACAATTTAG
GAATTCCNTA
ACAGGTAATT
ACTGGCTCAC
GTC TTCATC
CCATATGACT
TATGGTGATA
GAACCATTAA
AAAAACAAAG
AACGGACTTG
AAATTGTTGA
TTAATCA=T
ATCGAAGAAG
ACTGAAATCA
ACTGTTGATC
AAGGTGATGA
ATGACCAAT'T
AAATGGAATT
GTGAAAGCTA
ATGCTGA=T
ATAACTATAT
ACTCTTACGG
ATATTAGTAA
AAAANGANCC
ATGACTTCCC
CAAAAATI'AT
AATGGTATAA
TAAATACCAA
CAATCTTAAC
AAACAACACC
AAAAATCTGT
TTTATGGTGA
AAGATTACAC
CGCCAGAAGA
CAAAAGA'ITC
CTAACGTTGC
CAGAACAAAC
TAGTACACCA
AATCGATAAT
TTGTTG
TGATAI'TCA
AACTGATTA
TAAAGGAAAT
TCAACAAGTT
AGTCGATGGT
AAATATTGCT
TATTGGTAAC
TAACCAATAT
AGATGGCTTG
TACTTGGTT
ACAAGCTGCT
TCAAGCGCAA
TGA.ATTGTTA
AGAAGGCAAA
ACCAACAACT
ATCTCGTCACT
ACCAATTCTT
TACAGTTATG
ACTTCCAAAA
GCAAACACAA
ACGTTATTAA
GCGAATAAAA
GGCGATTGr.G
TTAGCACAAA
GCACCTAAAT
ACCATTAATG
TTAACTA
AGTTATGAAA
GCTTTTGCTA
CAATATCCAT
GCCGACIAAAG
ATTCCAACAG
ATGCGTCAAG
GGCAAACCAC
ATCGCTAACT
GGTTTATTAA
CAATATGAAA
CATTTGAGTG
GAAGAAATGG
GGT'ITITACTT'
AACCGTTACG
TTAATCGATG
TTAGACGAAT
TGTATCGTGT
TrATCGAGAA
ACCAAAAAAT
ATTrATGCAAC
ATGCCAAAGA
GAAAACTGTA
ATAA.AGAATA
GTGCAACGGA
TCGCCCAAAC
TTGCAGTAAA
ACATGATTAA
ATGCTt''CTAC
AAACACAAGG
TTGTTI'CTCG
A'TGTTGTTGC
ACAGCAATCC
AAGTTGGCGA
CTTGGAACAC
TrAAAGAACG 1-rGTAAATGA
CAGCAAGCTT
AC CTAAAAAC
ATATCCA-ATC
TGGGGACAALA
AAAAATTCG
GTCAACAATC
GAATGCACAA
AGCCTATGAT
TGCGTTCCCA
TGTCGATAAA
AGTTCTAAAA
ATTCTTTCCA
AACAACTGAT
TAACTTAAAA
AAGTACAACA
ACCTATGGAT
TCCACTAACA
AAACACGTCT
AGAATTGTI'A
TGATCGTGTG
AGGAAACAAC
TCATAAGAC
TAAAGTGAAA
AAATACAGGA
AGCTGGCTG
TCAAAAA
p GATAAAGTTC AAAAAGAAAT EF010-4 (SEQ ID NO:36) CGKSSKDA ASKGDDSTPT LLMYRVGDKP DNYDQL IDNA
GAYADLTDLA
NLDI SKVDCS
GSPKIINQYA
GDTILTQAAG
GLVYGEEGKQ
EEAKOSPILG
KVQKEMQTQL
NKI IEKKIGA
PKYAKEAYDQ
YESATEVLKE
DKOMINNLKV
KPLVSRPLTE
YEKVGDDRVK
FTFVNDKVKT
DEYIQSQK
KLKMEFVGWG
LPDNYIKGNT
FXKXXPNIAA
LHQWYKDGL.I
PLKTTAQAQM
LLKDYTPTTH
EITNVATVMN
DWDQKMSTIV
INGKLYAFP I
FAIGQTFFAT
PTDAATSTTP
ANYVVANTSI(
LSAWNTGNNL
RYAASLNTGT
ASGESYDISL
LGNSYGQQVL
GNYDFP IGNQ
YDLNTNTWFM
NKEKSVELLG
I IWPEESVTE VDPEEThPKL
AQNYATNAQK
TFNKEYVDKY
YPFAVKTTDT
RQETQCPMDY
LLNSNPELLN
EMVKERDKS I
MDDLKTAGWD
EF011-1 (SEQ ID NO:37)
TAACGIIT
ATGGGACTGT
ACAGAAACAA
CCCAAAAATC
CTAGGT-GTCG
AAATACCAAA
CAACTAAAAC
AAAGCCATTG
AAACAAAATA
ATAACTCGCT
AATGCGCTTG
TTCGGTTTAA
GAGAAAAG
TAATGTTAAG
CAGCTAAAAC
CTAAGAAACT
GTGACCGCGT
AAGTTGAATC
CAGACTTAAT
CGCCAACCAT
TCGAAACGTT
TAGAAAAAGA
TTTGTTAGT
TTCATGATAC
AATGAAAAAG
TGCTTGTCAA
GGAAGTCACA
CCTTCTTTT
GGTAGGTGCG
AGCAGGCGGC
TATrATTTCT
TTACTTAGCT
AGGCACTATT
AATI'GCTGAC
TAACCAAGGA
ATTTGGCTTC
AAATT LTTAG
ACAAATAAA
GTCAAAGACA
GATAATCG T
CCAACTAAAA
ATTAAAGAAC
GGTCGTCAAC
GTACATGCCA
TTTGATAAAG
GTGAAAAAAC
CAAC'rTTCCG
AAAGCAGCAG
CAATGATGGC
AAACAGCAGA
CCAkATGGTCA
CCTTGGATAC
ATATCCCTGC
CAGATIrAGA AAGATrATCA AAAA'rCC7'rC-
AAGAGGTAGC
AAGCAGAAGC
CTTACGGA.AA
ACGATAAGAT
AGTrTCAATG
TTCTGCA-ACA
ATTAACCGT
AATGGATGCA
GTAFFGAAA
AAAAATCAAT
AGAACAATTA
GCCATCAACG
TAAAGAAAAA.
TAGCGCGAAT
ACGCTCTCGT
TGAAGCTTCC
89 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Gencs.
ACTCATGGC
GTGGTAGATC
GAATTGATTC
GTTTGGTATC
AAAGGATTAG
AAAGTG'FITC TTACGAATAT GTFI'TAGAAA AAAATCCTGG GATTCTCTPI' GCACCAAAGC AATTGGTCGC GACGATTCAA AAGATAACGT CGCTGCPAAAC AAAAAACCGA TGCTGGTAA-A AATGATAAAG TCAITATGCT TCAACCAGAT TAAGCGGTGG TGGATTAGAA TCAATGCATT TGATGATAGA AGATGTTAAA
AGTAA
EF011.-2 (SEQ ID NO:38) bMflKK FLAMMAVSM GLLMLSACQT NKKTADSATT ETTAI(TEVTV KDTNGQLTVP KNPXKVVVFD NGSLDTMDAL GVGDRVVGAP TKNIPAYLKK YQKVE-SAGGI KEPOLEKINQ LKPDL II ISG RQQDYQEQLK AIAPT IYLAV DAKNPWASTK QNI ETLGTIF DKEEVAKEKI TGLEKEIADV KKQAEASANN ALVVLVNEGQ L SAYGKGSRF GLIHE)TFG;F AADDKIEAST HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE LIQKTDAGKN DKVIMLQPDV WYLSGGGLES MHLMIEDVKK GLE EF011-3 (SEQ ID NO:39) TTGTCAA ACAAATAAAA AAACAGCAGA TTCTGCAACA
ACAGAAACAA
CCCAAAAATC
CTAGGTGTCG
AAATACCAAA
CAACTAAAAC
AAAGCCATTG
AAACAAAATA
ATAACTGGCT
AATGCGCTTG
ITCGGTTTAA
ACTCATGGGC
GTGCTAGATC
GAATTGATTC
GTTTGGTATC
AAAGGATTAG
CAGCTAAAAC
CTAAGAAAGT
GTGACCGCGT
AAG'ITGAATC
CAGACTTAAT
CGCCAACCAT
TCGAAACCTT
TAGAAAAAGA
TTGTGTTAGT
TTATGATAC
AAAGTGITTC
GCACCAAAGC
AAAAAACCGA
TAAGCGGTGG
AG
GGAAGTCACA
CGTTG'TI'T
GGTAGGTGCG
AGCAGGCGGC
TATTAT'TCT
TTACTTAGCT
AGGCACTATT
AAITGCTGAC
TAACGAAGGA
ATTCGCTTC
TTACGAATAT
AATTGGTGGC
TGCTGGTAAA
TGGATTAGA-A
GTCA-AAGACA
GATAATGGTT
CCAACTAAAA
ATTAAAGAAC
GGTCGTCAAC
GTAGATGCCA
TTGATAAAG
GTGAAAAAAC
CAACTTTCCG
AAAGCAGCAG
GTTTTAGAAA
GACGATTCAA
AATGATAAAG
TCAATGCATT
CCAATCGTCA ATTAACCGTT CCTTGGATAC AATGGATGCA ATATCCCTGC GTATTTGAAA CAGATTTAGA AAAAATCAAT ALAGATTATCA AGAACAATTA AAAATCC'ITG GGCATCAACG AAGAGCTAGC TAAAGAAAAA AAGCAGAAGC TAGCGCGAAT CTTACGGAAA AGGCTCTCGT AC-GATAAGAT TGAAGCTTCC AAAATCCTGG GATTCTCTTT AAGATAACGT CGCTGCAAAC TCATTATGCT TCAACCAGAT TGATGATAGA AGATGTTAAA EF011-4 (SEQ ID CQT NKKTADSAT1' ETITAKTEVTV KDTNGQLTVP KN4PKKVVVFD NGSLDTMDAL GVGDRVVGAP TKNIPAYLKK LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK TGLEKEIADV KKQAEASANN ALVVLVNEGQ LSAYGKGSRF HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDN'JAANE WYLSGGGLES MHLMIEDVKK GLE YQKVESAGG I
QNIE:TLGTIF
GLIHDTFGFK
LIQKTDAGKN
KEPDLEKINQ
DKEEVAKEK I
AADDKIEAST
DKVIMLQDV
EF012-1 (SEQ ID NO:41)
TGAGGGGGCA
CTITTAGCCG
GCAGCTGAAC
CAAACAACAG
TTTGATGATG
ACAACATGAA
CATGTIGGCGG
AAAAAATCAG
ATAAAAATAC
ATAGTGCCAC
ATTGGGGAAA
AACCAAAGAA
TATTAGTTCA
CTIACAATG
GGTGCCAGCT
AAAGTAGTAG
GCGGCAGAGA
CCTGCACCAA
GCACAACAFI'
CTAGCTAAAG
GTTTIGATTGC
AAGTAGAITC
TCTCAACAT
TATTTGAAGG
ATGTCAAGAT
AACAGGGT
GGGAAATA.
GGATACAACA
CCTTTATCGG
TAGTGACGAT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GGGCGCAAGT
ACGGCCCAAG
AATGCCTATT
GTCGATGAAT
GCCCAACCTT
T= GTCGAAG
CCATTTACGC
GAATACTATG
GATAATACTO
CAATATGTC
TACTTCTTAG
GCGATTGGCC
AAACCCCTTA
TCCGAGCTT
ACGAAAGCCC
GATCAAGGAA
TTAGAAATTA
AATTATGAGT
AACTTATATG
CAATT-GTAG
TACAAAGAAG
AGTGCCTCAA
GATTAT'ITCC
ACCACTITAC
ATTTTGTTTA
TACTAGACAG
TAGGGATTTC
CCTTC'ITAGC
CGCAAGGCAA
TAGCCAATTG
ATGCGGATCA
GGATTAAC'T
AACAATATCA
ATI'TCAACAA
AAGCAATTGA
ACGGATTGAT
ACAGTGGCGA
AAGCGGATGT
AACGAATITGC
CCAT'ITCATC
TGTCTCTTTC
CAGGAGAATC
AAGAGGCACG
CGGAAGACAT
ATTATCTAAT
AC'ITGCGCAA
CTT'GGGAG
TTCTTGGAAA
TGTTAAAAAT
AGCCCCGAAT
AGTCGTTTCG
AGATTACGCC
GGATGCGACT
AGTGAAACTG
ATATCAAGTG
AGATGATCCA
AAAAGAAGGA
TAAAGAAGCC
TCCAAGTAAA
ATAI'rTGAAA
CGGTAAAAAA
TGAATATGTT
GCAACCAAGT
AGGATGGATT
AAGTTACAAT
AACGATTAAT
CTTGTTGAAC
CAATCCTAAA
TGCCTATI'TA
GGGATTAAGT
AAACTGGTGA
AGTTTTGAAA
GACAAAGAAT
ATTGCTT
TTGATAGTG
TCAGATACTT
GAAGAAGTTG
AATGAACTAG
GGCTATGTCA
ACGCCATrAG TTAiACACAAA
CTT'TATGCGA
AATGACGTCA
GTGAAA?=
CAAAGTCAGT
AATAATGTGA
GCCGGCAGTA
TACGGCAATT
GCCAATAATC
CAAGATGCTG
TTGAAAGGCA
ACAGAATCA
GGAGCAACG
CACCAGCCAC
TACGCAACGG
TCATTGTTGA
TAGCGOCACA
AACATTTACT
GGACATTGA
CGGTTAGCAC
ACTTAGTTCG
GTCATCCAGA
CGAATGTITCA
GTGTCTTAAA
ATCCAGAAAC
AAAAAG;tTCA
CATTGCTGGC
TGCAAGAAAA
ACCAATCGCG
GTGAATTAGA
ATCATAATIC
CAGAGAAACA
C CCAA'-TACC
TTAGTTATCA
'CGAGCCAATC
GATTGGACCG
TGAAAAGTCA
A'ITIAAAACAG
AAATCAAAAA
TTATAGCGGG
AAAAAATCCA
AATCAAAGAA
CATTAACGGA
TGTGGCCAkAC
TTTACGAAAA
CGATGGGTCA
GGATGAAGAT
AGCTGAATGG
GGCAGACACA
TCTGCCAGGT
ACGTGAAAAA
CTCTTACTTT
CAAATACGAC
GTTr~TGCAGAA
GCTCTATCAA
CTTGTATGGG
9* EF012-2 (SEQ ID NO:42) MKLGKK VVGL.IATGFL LAACGGTKEA AEKrVDSGNLA AEQK IS ISSP AP: -STLDTTQ
TTDKNTFTKA
AQDFVYSWKK
QPSFLAVVSI
YYDADQVKLE
FLDFNKKEGT
RAYSGEYLKN
EITISSQPSN
LVEEARTINA
YFHLRNAYLT
QHLFEGLYRF
LVTPATIGPN
AWLAPQNQKF
EVAVSTIKED
PLANV!4LRKA
DVKKAQAEWT
NVNQSRREKN
NNPEKQFAEY
DDDSATVPAL
AYLLDSVKNS
VEAQGKDYAL
NTG INLYQVN
IGQAIDKEAL
KAQADVGKKV
YELSLSGWIA
KEAEDILLNQ
AI(DVKI SDDG.
FEI RNGEXSV
DSEHLLYSGP
ELDLVRINGQ
TQSVLJNDCSK
KLSLLAAD)TD
GSSELDSYFN
DAAQVPLYQS
RKYHFTLR.EG
DELGISAPND
FTLANWDATS
YVQQYQDDPG
PLNGLIPSKL
QGKRIAZYVQ
LYAGESSYNY
ASNYLIINPKL
IKWSNGEPIT
KEFIVELKQA
DTWTLKKNPE
YVSHPDVANY
YANPETDEDF
SQL.QENLPGL
GNYHNAKYIDQ
KG ISYHLYGD EF012-3 (SEQ ID NO:43) ATGTGGCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATA
GCAGCTCAAC
CAAACAACAG
=rGATGATG
GGGCGCAAGT
ACGGCCCAAG
AATGCCTATT
GTCGATGAAT
GCCCAACCT'
rrl'TCGAAG
CCATTTACCC
GAATACTATG
GATAATACTG
CAATATGTTC
AAAAAATCAG
ATAAAAATAC
ATAGTGCCAC
ACCACTTTAC
ATTTTGTTTA
TACTAGACAG
TAGGGATTTC
CCTTICTTAGC
CGCAAGGCAA
TAGCCAAT'G
ATGCGGATCA
GGATTAACTT
AACAATATCA
TATTACTTCA
CTI'TACA-ATG
GGTGCCAGCT
CTTGCGGGAG
TTCTTGGAAA
TGTTAAAAAkT
ACCCCGAAT
AGTCGTTTCG
AGATTACCCC
GGATGCGACT
AGTGAAACTG
ATATCAAGTG
AGATGATC CA
CCTGCACCAA
GCACAACATT
CTAGCTAAAG
GGGATTAAGT
AAACTGGTGA
AGTTTTGAAA
GACAAAGAAT
ATTGCTqIGGT rrGGATAGTG
TCAGATACTT
GAAGAAG7MI~
AATGA.ACTAG
GGCTATGTCA
TCTCAACATT
TATI=AAGG
ATGTCAAGAT
GGAGCAACGG
CACCACCGAC
TACGCAACGG
TCA TGTTGA
TAGCGCCACA
AACATTTACT
GGACA'GAA
CGGTTAGCAC
ACTrAGTTCG
GTCATCCAGA
GGATACAACA
CCT1'TATCGG
TAGTGACGAT
CGAGCCAATC
GATTGdACCG
TGAAAAGTCA
ATTAAAACAG
AALATCAAAAA
TTATAGCGGG
AAAAAATCCA
AATCAAAGAA
CATTAACGGA
TGTGGCCAAC
91 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TACTTCTTAG
GCGATTGGCC
AAACCCCTTA
TTCCGAOCTT
ACGAAAGCCC
GATCAAGGAA
TTAGAAATTA
AATTATGAGT
AACTTATATG
CAATTGGTAG
TACAAAGAAG
AGTGCCTCAA
ATTCAACAA
AAGCAATTGA
ACGGATTGAT
ACAGTGCGA
AAGCGGATGT
AACGAATTGC
CCATITCATC
TGTCTCTTC
CAGGAGAATC
AACAGGCACG
CGGAAGACAT
AkTTATCTAAT
AAAAGAAGGA
TAAAGAACCC
TCCA.AGTAAA
ATATTGAAA
CGGTAAAAAA
TGAATATGTT
GCAACCAAGT
AGGATGGATT
A.AGTTACAAT
ALACGATTAAT
CTTGTTGAAC
CAATCCTAAA
TGCCTATTTA
ACGCCATTAG
TTAACACAAA
CTTTrATGCC-A
AATGACGTCA
GTGAAACT'fT
CAAAGTCAGT
AATAATGIGA
GCCGGCAGTA
TACGGCAATI
GCCAATAATC
CAAOATGCTG
TT GAAAGGCA
ACAGAA
CGAATrG'ITCA
GTGTCTTAAA
ATCCAGAAAC
AAAA.AGCTCA
CA TGCTGGC
TGCA.AGAAAA
ACCAATCGCG
GTGAATTAGA
ATCATATGC
CAGAGAAACA
CC CAAGTACC
TTAGTTATCA
=PIACGAAAA
CGATCGGTCA
GGATGAAGAT
AGCTGAATGG
OCCAGACACA
TCTCCCAGGT
ACGTGAAAAA
CTCTTACTTI'
CAA.ATACGAC
GTTGCAGAA
GCTGTATCAA
CTTGTATGGG
GATTATTTCC ACTTGCGCAA EF012-4 (SEQ ID NO:44) CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ
ITDKNTFTMA
AQDFVYSWKK
QPSFLAVVSI
YYDADQVKLE
FLDFN'KYKEGT
RAYSGEYLKN
EITISSQPSN
L'JEEARTINA
QHLFEGLYRF
LVTPATIGPN
AWLAPQNQKF
EVAVSTIKED
PLANVHLRXA
OVKKAQAEWT
NVNQSRREKN
NNPEKQFAEY
DDDSATVPAL
AYLLDSVKNS
VEAQGKDYAL
N~TG INLYQVN
IGQAIDKEAL
KAQADVGKKV
YELSLSGWIA
KE-AEDILLNQ
AKDVKISDDG
FEIRNGEKSV
DSEHLLYSrCP
ELDLVRINGQ
TQSVLNDGSK
KLSLLA-ADTD
GSSELOSYFN
DAAQVPLYQS
RKYH-FTLREG
DELGISAPND
FTLANDATS
Y-VQQYQDDPG
PLNGLIPSKL
QGKR IAEYVQ
LYAGESSYNY
ASNYLINPKL
IKWSNGEP IT KEF IVELKQA
DTWTL.KKNPE
YVSHPDVANY
YANPETDEDF
SQLQENLPGL
GNYHNAKYDQ
KGISYHLYGD
YFHLRNAYLT E EF013-1 (SEQ ID
TAACGAAAAA
AGTCTAGCAG
GAAACAAAGA
TCTGTCACGG
CCTTCTGCAG
ATTACAATTA
CCTAATGCAA
TATGCGGATG
ITAATCGCAA
GGACAAGAAG
TCTGGTAAAA
ATTTTAGCAA
GGCACAAAAA
GA I=ACCAA
TGAAAAAAAT
GATGTGGAAA
AAGCCACGCA
GTCAAAATTC
CAACTGATGA
CCAATGTTGT
CCATTACAGC
CCAATGGCAA
CCGTTGATCA
CAGCGCTTTC
CAGCACCGAA
GTTTTACTGC
ATCGCTTAGA
ATTAA
TGCTTTGTTC
CAAAAAAACA
GAAATCCTCT
TAATGTGACA
AACGAATGAA
TTTCAATCCT
AACGGTAGTC
TTTTACAGTA
ACGGAATAGT
TTTTAGCAAT
TGCAACTATT
GGATGCCCAA
TGTTACGI A
AGTATGTTAA
GCAAGCACAA
AGCCAACAAG
GGGGAAAATC
GTCCCTCAAA
GAAAGAAATG
GGTGATGCTT
ATTAGTCCCA
GCACCTGTCC
ATTACGATTG
TTAGTGTCAC
GGGAATTTCA
AATGGAGAAA
CGTTCAGTGT
ATGAT'rCTAA
AAATGAAAAG
CGTCAGAAAA
ACCAAGCACC
AA.ATTAATGG
CTGCACAAGC
GAGCGGGAGC
AAA'FIGATAT
ATCCGAAACA
GTGCAGATGA
CAGCCAGTAA
TAGGGACACC
ATTGTCTTTA
GCCAAAGCAA
TAGTCATTCG
TGCCACGCAG
TGATACAAAC
TACTACATTA
AGGTG ITTTT
GACTACTCAA
TCCAAGTTCA
ACGGACAATT
TGCGCGGGTG
LIAGTITCC C
TTACTTGTFI'
EF013-2 (SEQ ID NO:46) NKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWVFNPE RNEINGTTLP NATITATVVG DASAQAGVFY ADANGNFTVI SPPRAGATTQL IAT JDQRNSA PVQIDIPSSG QEAALSFSNI TIDPKQGTIS CKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG TKNRLDVTLN GEIGTPYLFO LPN 92 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF013-3 (SEQ ID NO:47) ATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCA.A
GAAACAAAGA
TCTGTCACGG
CCTTCTGCAG
ATT ACAATTA
CCTAATGCAA
TATGCGGATG
TTAATCGCAA
GGACAAGAAG
TCTGGTAAAA
ATTTTAGCAA
GGCACAAAAA
AAGCCACGCA
GTCAAAATTC
GAACTGATGA
CCAATGTT1GT
CCA'ITACAC
CCAATGGCA.A
CCGTTGATCA
CAGCGCTITC
CAGCACCGAA
GTTTACTGC
ATCGCTTAGA
GAAATC-CTCT
TAATGTGACA
AACGAATGAA
TTTCAATCCT
AACGGTAGTC
TTTTACAGTA
ACGGAATAGT
TTTTAGCAAT
TGCAACTATT
GGATGCCCAA
TGTI'ACGI'A
AGCCAACAAG
GGGGAAAATC
GTCCCTCAAA
GAAAGAAATG
GGTGATGCTr
ATTAGTCCCA
GCACCTGTCC
A-TTACG;ATG
TTAGTGTCAC
GGGAATTTCA
AATGGAGAAA
AAATGAAAAG
CCTCACAAAA
ACCAAGCACC
AAATTAATGG
CTGC-ACAAGC
GAGCGGGAGC
AAATTGATAT
ATCCGAAACA
GTGCAGATGA
CAGC CAGTAA TAGG-dACACC
TAGTCATTICG
TGCCACGCAG
TGATACAAAC
TACTACATTA
AGGTGT'TTTT
GACTACTCAA
TCCAAGTTCA
AGGGACAATT
TGCGCGGGTG
TTTAGTTCCC
TTACTTGTTT
S.,
S
GATTTACCAA AT EF013-4 (SEQ ID NO:48) CGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS VTGQNSNVTG ENrPSENATQP SAGTDETNEV PQNQAPDTNI TITNVVFNPE RNEINGTTLP NATITATVVG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG TKNRLDVTLN GEIGTPYLFO LPN EF014-1 (SEQ ID NO:49)
TGATGGTCCA
AGCTTAA'rrA
GTAGCGGGTA
TATAATGAAG
ATTGAACGGA
AAGACGAAGA
AAAAATTACC
TCTGTTAACA
TTTTTATCT
GCGTTTGATC
CATTTAGATG
ATGCGTGGAT
CAATCAATCA
GTGGATTCCA
AAACAACAGC
CTATACCCAG
AAGCCAGATG
CAAAGTGATT
GGCAATACGT
ACGGAAAATG
rGACTTTTTAA
GTTTAGTCAT
GCTATTTAAA
CGCAAAATAA
AATTAGGCAC
AAATAACCTA
AAGGGATGCA
CAGTTGAGAA
TTATTAAGTT
GTGTCACCA.A
GTACGAA.AGC
TCCGACAACA
TGAAAATAAT
ATGAATTGAC
TTTCTTTTGA
ATAGTATTGA
AACGAGATCA
ATACCGTTCA
ATATTGGTGT
GCTTTATAAA
GAGAGAGGAA
CATTTTGGTT
GAAAACAATT
AGATACTCAA
AACTAGGACT
TTTAAGTTTG
GCGAATTGAA
ATTATTGAAT
AATTGATGCG
AGACGGGCCA
'TrTATCTTAT
AGAAATTATT
GGACATTATT
TCATTTAGTC
CTGGCGCACT
AAATGTCCGT
AGACGGCTAT
AGATGAAGCA
TCCTGGTAAT
ATAA
GTACAGCCAA
TTTGTCACAG
GATAAAGGCT
TCCTTTTTGA
GATGCTATGA
CCACGGGATA
CCCCCTATA
ATTCCAATCA
GTTGGCGGCA
GGATCCATTC
GCCCGTGAAA
CAAGCAGTTG
GATTCGTTAA
AAAGAAGGTT
TTAGTAATG
CATCAATTAC
GTCTTCCATA
GCTGAGGAAA
ACACAGACCG
TCAGTAGGAA
TCGGCTCAGC
ATGTTCCCAT
'ITATGGGGCT
TGGTGATTAC
GTTTTGTTCA
CCTACGATGG
ATCATI'ACGT
TAGA2'GTCAA ArTC=ATGC
GACATAGCGA
AAGACAAGTT
ATGGAAACAT
TGACTTGGAC
AAGGGCGCAG
GTGTGTCTTT
CGAACGGTGA
ACGAAAT1GAC
GCCCGTTGCC
GCGAA.AAATC
ATACTTTGCT
AAAAAATGAT
AGACAATACA
CGTGAATAAC
A.ATTGATGCG
ACCAACACCT
TGTGTT1'AAC
TGTCAAGCAG
AGGGAAACAG
TAACGATATT
GAAATCTGGT
TCAAACTGAT
cAATTATGAT
TATGGTTGAA
AAATTTAGAA
ATTTITTATAT
TTCCATCAAC
ATCAGTTAA
S
a.
EF014-2 (SEQ ID NO:SO) MSRKRRIS LISLVIILVF VTVGSAYFAV AGSYLKKTID KGYVPIKNDY NEAQNKDSQS FLIMGLDNTI ERKLGTTRTD AMMVITVNNK TKKITYLSLP NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI PINHyvvFNF LSFIKLIDAV FDGVTKDGPG SIHFDACKQH LDGTKALSYA RERHSDNDIM RGFRQQEIIQ SIMXIMDIID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF RDSFVQ IDAK
GGIDVNVKQA
AVE-DKLKSGQ
SNEGRsMVEL 93 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
YPOSIENVR{ QLRVSLNLEK PDERDQDCYV FHTNGEFLYQ S:YTVQDZEAA EENEb4TSING NTYIGVPGNT QTGPLPSVKT
ENGFIK
EF014-3 (SEQ 10 NO:51)
TGCT
GTAGCGGGTA
TATAATCAAG
ATTGAACGGA
AAGACGAAGA
AAAAATTACC
TCTGTTAACA
TTTTTATCTT
GCGTTTGATC
CATTTAGATC
ATGCGTGGAT
CAATCAATCA
GTGGATTCCA
AAACAACAGC
CTATACCCAG
AAGCCAGATG
CAAAGTGATT
GGCAATACGT
ACGGAALAATG
GcTATTTAAA
CGCAAAATAA
AA'ITAGGCAC
AAATAACCTA
AAGGGATGCA
CAGTTGAGAA
TTATTAAGTT
GTGTCACCAA
GTACGAA-A-C
TCCGACAACA
TGAAA.ATAAT
ATGAATTGAC
TTTCTTTTGA
ATAGTATTGA
AACGAGATCA
ATACCGTTCA
ATATTU'GTGT
GAAAACAATT
AGATAGTCAA
AACTAGGACT
TTTAAGT'G
GCGAATTGAA
ATTATTGAAT
AATTGATGCG
AGACGGGCCA
TTTATCTTAT
AGAALATTATT
GGACATTATT
TCAFI'TAGTC
CTGGCGCACT
AAATGTCCGT
AGACGGCTAT
AGATGAAGCA
TCCTGGTAAT
GATAAAGGCT ATG3TTCCCAT AAAAAATGAT TCGTT TGA TTATCGGCCT
AGACAATACA
GATCCTATGA TGGTGATTAC
CGTGAATA-AC
CCACGGGATA GTTTTGTTCA
AATTGATG.CG
GCCGCCTATA CCTACCATGG
ACCAACAGCT
A TTCCAATCA ATCATTACGT TGTGTTTA-AC GTTGGCGGCA TAGATGTCAA
TGTCAAGCAG
CCATCCA'ITC ATTF1MATGC AGGGAAACAG GCCCGTCGAAA GACATAGCGA TAACGATATI' CAAGCAGTTG AAGACAAGTT
GAAATCTGGT
GAT*CGTTAA ATGGAAACAT
TCAAACTGAT
AAAGA.AGGTT TGACTTGGAC
CAATTATGAT
TTTAGTAAI'G AAGGGCGCAG
TATGG'FTGAA
CATCAATTAC GTGTGTCTTT AAA'FrTAGAA GTCTTCCATA CGAACGGTGA
AITITTATAT
GCTGAGGAAA ACGAAATGAC
TTCCATCAAC
ACACAGACCG GCCCGTTGCC
ATCAGTTAAA
*4~ GCTTTATAAA A EF014-4 (SEQ ID NO:S2)
I
AV AGSYLKKTID
KGYVPIKNDY
NEAQNKDSQS FLIMGLDNTI
ERKLGTTRTD
NYQGMQR lEA AYTYDGPTAS VNTVEKLLNI FDGVTKDGPG SIHFOAGKQH
LDGT-KALSYA
SIMCIMDIID SLNGNIQTDV
DSNELTHLVK
YPDSIENVRH QLRVSLNLEK
PDERDQDGYV
NTYIGVPGNT QTGPLPSVKT ENGFIK
AMMVITVNNK
P INI4YVVFNF
RERHSDNDIM
EGLTWTNYDK
FHTNGEFLYQ
TKKITYLSLP
LSFIKLIDAV
RGFRQQEI ,IQ
QQLSFDWRTF
SDYTVQDEAA
RDSFVQ IDAI( CnIDvNVKQA
AVEDKLKSGQ
SNEGRSMVEL
LENEMTS ING EF015-1 (SEQ ID NO:53)
TAATTAAAAA
TTAATAGCTA
AAAAGTTTTG
ATAAATTTAG
AATACTATAG
ATTGCGGTGA
AGATTCATTTI
TGTGTAAAAA
TCTCT'ITITC
ACGACACTAA
ATGATTCAA
AAG LTAAAAA
AAACAGGATG
TAGAAAITTGT
GGGTCTGATG
TCCATAT1T ATATrGCTGTG
AGAATTTATC
TGGGCACGTT
GATATCAGAA
TCAACAATAT
AAAAAAGGAG
ATTTTTTTTC
GTCAAGATAG
AAAACATATT
CGTGTAAAAA
CCAGGGCNAA
TcTAAGGATT
ACATAATAGT
AcA.ATAATCC
ATGGGAAAGA
ATCcATCAAA
AAGATAATAG
CTAGTATCTG
ATTATATTTA
TATTATCTTT
ATI'TAACTCC
GATTGAGCGT
AGGGCAATAT
TCCACATCAA
TATTCCTCAC
CTAA
EF015-2 (SEQ ID NO:54) MX KGDIIVIIFL IAISFSPYFI FFHNNPFNSK SFDDTKYA'JV
KIDGKEIERI
NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPDQI AVKTGWISEp
GXTSICIPHR
FILEIVQQYS
KDYYIY
EF015-3 (SEQ ID 94 TABLE 1. Nucieotide and Amino Acid Seqeuenccs of E. faecalis Genes.
CAATAATCC ATTTAACTCC AAAAGTI'TG ACGACACTAA ATAAATTTAG ATGATTCAAA AATACTATAG AAGTTAAAAA ATTGCGGTGA AAACACGATC AGATTCATT TAGAAATTGT ATATGCTGTG GTCAAGATAG AGAATTTATC AAAACATATT TGGGCACGTT CGTGTAAAAA GATATCAGAA CCAGGGCNAA TCAACAATAT TCTAAGGATT ATGrGGAAAGA
ATCCATCAAA
AAGATAATAG
CTAGTATCTG
ATTATATTTA
GATTGAGCGT
AGGGCAATAT
TCCAGATCAA
TATTCCTCAC
EF015-4 (SEQ ID NO:56) NNPFNSI( SFDDTKYAVV KIDGKEIERI NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKDNSPD)Ql AVXKTGWvISEP GXTSICIPHR FILEIVQQYS KDYYIY EF01.6-1 (SEQ ID NO:57) 4.
TGACGGTTGC
TTGCTGGTTA
GAAGATCAAT
TTTGTCCCCA
GCCAAACCGG
ATGAAAGAAA
ACGAGCGAGC
CTTGTTTCTT
GGGGTTCAAA
AAATTTGTTA
TTAAAATCTG
TCCCACGAAG
TTTGCTGTGG
GAAkACGTTAC
GTTACAAATA
CCCCGTCCAA
TCTCTTGTAG
GGACACGGAT
TGGGTTTTCA
TTTTTAAACT
CAGAATTACA
GGGCCGAAAA
TAAAAGAAAA
ACGGCTCTTC
AAGACCAAAC
GTCGAATTGA
ATA-A EAAA
GCGTCCGCAA
GAAAAGATGG
ACACAAXAAAT
TAGAAAGGAG
TTTACTCCTA
TAACGAAGAA
AGATAAATCA
TTATGGCATT
AAATCAAACC
AGTTCAAFFC
AAACATTCCA
TGGCTATGAT
ACCTATTA
CGGACTCCTA
AAACTATACT
ATCAGACAAT
CACCCTTAGT
AAACTAA
TTTATCATGA
'TTGCAGGTT
AAACGGATTA
GGCAAAATTG
TCCGTI'ACT
ATTGATCTTA
ACACAACCTT
ACAGCGAGCG
GGCTTCGAAA
TATGACGGCT
ATCGATCGCG
ATTTCTCATG
CAATTAGTCC
AAAATTTCTC
AAAAGAAATA
GTGGTAAAAG
TTATTGGCTT
TCGGCTTTGA
TCCA.ACCGAT
TTTGGAACGG
ACATCACGAA
ACATGCAAGG
GTCAGCCTGA
TTAATGAAGC
TTTACCCCAA
TAGGCTATGA
AAA.AAATCAA
AAAAATGGTT
TTCTTTAGCC
AAA.AAGCAAC
AGATGACTCC
TGTCGACTTA
TG-ATI'CCTCT
CTACACTAAA
CGACCAAGTA
CAAAATTTA
CGTTTTGAAA
TTTCTTAGAT
CTACTATCTT
CAATGAAGAT
TACTGCCT
TGGAGAGGAC
EF016-2 (SEQ ID NO:58)
MMKKKYSLAL
VPMGFQDKSG
SERAEKVQFT
FVKDQTP ILY
AVGVRKSDNQ
LVI CCSLLLF
KIVGFDVDLA
QPYMTNDQVL
DGFNEAFLDL
LVQKINTAFE
AGCCKRKSNE
KAVFKLYGIS
VSLKEKNIAT
KSGRIOGLLI
TLRKDGTLSK
DQWTRINEE(
\JDFQPIDWSM
ASDMQGKILG
ORVYANYYLS
IS QKWFCEDV RuIGILDDSF
KETELQNQTI
VQNGSSGYDG
I-EDNLKNYTI
TNNTKIN
DLIWNGYTKT
FESQPDVLKK
SHVGYDNEDF
EF016-3 (SEQ ID NO:59)
AAGCAAC
GAAGATCAAT
TTGTGCCCA
GCCAAAGCGG
ATGAAAGAAA
ACGAGCGAGC
CTTGTTTCTr
CGGTTCAAA
AAJTGTTA
GGACACGGAT
TGGGTTTTCA
TT=rAAACT
CAGAATTACA
GGCCCAAAA
TAAAAGAAAA
ACGGCTC ITC
AAGACCAAAC
TAACGAAGAA
AGATAAATCA
~TTATGGCATT
AAATCAAACC
AGTTCAAT1'C
AAACATTGCA
TGCCTATGAT
ACCTATI'TA
CGGACTCCTA
AA.ACGGATTA
GCcAAAA'1rG
TCCGTTGACT
ATTGATCTI'A
ACACAACCTT
ACAGCGAGCG
GGCTCrGAAA
TATGACGGCT
ATCGATCGCG
TTATTGGCTT
TCG.GCTTTGA
TCCAACCGAT
TTTGGAACGG
ACATCACGAA
ACATGCAAGG
GTCAGCCTGA
TTAATrCAAGC
=IIACGCCAA
AGATGACTCC
TGTCGACT'A
TGATrTGGTCT
CTACACTAAA
CCGACCAAr-TA
CAAAATITTA
CGTFI'TAAA
F1TCTTAGAT
CTACTATCT
TTAAAATCTG GTCGAATIGA TABLE 1. Nucleotide and Amino Acid Sequences of E. fecalis Genes.
TCCCACGAAG
r=TCTGTGG
GAAACGTTAC
GTTACAAATA
ATAATTTAAA AAACTATACT ATTTCTCATG TAGGCTATGA CAATGAAGAT GCGTCCGCAA ATCAGACAAT CAATTAGTCC AAAAAATCAA TACTGCCTTT GAAAAGATGG CAC-CCTAGT AAAATTTCTC AAAAATGCTT TCGAGAGGAC ACACAAAAAT AAAC EF016-4 (SEQ ID SNE DQWTRINEEK RIIIGLDDSF VPMGFQDKSG KIVGFOVDLA KAVFKLYCIS SERAEKVQFT QPYMTNDQVL VSLKE-KNIAT FVKDQTPILY DGFNEAFLDL KSGRIDGLLI AVGVRKSONQ LVQKINTAFE TLRKDGTLJSI VDFQPIDWSM KETELQNQTI DLIWNGYTKT ASDMQGKILG VQNGSSGYDG FESQPDVLKK rJRVYANYYLS I{EDNLKNYTI SHVGYDNEDF ISQKWFCEIDV TNNTK IN EFC17-1 (SEQ ID NO:61)
TGAGGTGTI'T
CTACTTCTCT
TTACGGGTCG
ACCGCAGCAG
GA.AGCGAAAC
ACTTTTACGA
TTGAATACT
TTGAAGGdT
GGGGTAACAG
TTTCAACAAT
ACGGGCAAAA
GAAGGTTGGG
CAAGCGAATG
AAAAATCTTT
CAAGA-ACAAG
TTAAATACGC
TTGAATTC-TG
TTCGTGCCAA
GATTTAAATC
TTAGGAATTG
ATCAGTGACT
TCACCAGITC
GTTGGTGGCT
AAAAATTCCA
GCAAACGTAA
CAATrGGTTG GTGrGCCGATT
GTCTCTATCG
TTATGAAAAG GGCAACAAAG CGGGCTCTMC A.AGTGTTGGG GGATTGATTC GGAATTATCA A.TGTAATGAG CCAAGTAGGG CAGCATTGGq AACTGAAAAA TTCGAAAAGA TGCAAAATGG CTTGGAAGCG CACAGTGGAC TAAAAAATA TCGTGCTATT CCATTGATGA CCATACCI'G TATrGGCGGT ACCAGCTTT ACTATGGTAC ATCAGCTGAG ATGGCACGAA TAATACTTGG TI TCGCTAGA TAAGGTGGAT TCGAAGGGAA AGAATTAGAT GcAATGCAGc TTTGAAAATT AAAAAGATCT TITGGCAAAT AGCGTTTAGC TAA.AAATGTT CAGGTTTCAC TAATCAAGAA CTAGTGAACC AGAAAAAGCG AAAAAGCGGA GCTAACGATT ATGTTCAAGC AGCTTI'AGCA CTTTTAATAA TCGT~rAGA.A GGACGCCAGT ATATGCTGAT ATAATTTTGG TAAATGGTC'I CTTATGCAAA TAAATATGA" CGGAAGAAGC CCCCCTAGI CTGTCCAAAA TTTAGTCTANI GCGACA.AGTA A
CAAAGGCTGT
AAAGAAACCA
ACGGCAGACG
GAGGGACTTT
GTACACCCCT
AGTAACGG-CG
CCAAAAACAG
GTITGACGGTA
GAAGTAGAGC
TATCCTIAA
TCAACACTTT
TCCTATGTGA
GTCCAAGTAG
GTTGTAAA-AA
CGTGAAATTC
AAGAATGCAC
TTAAATGATG
ACGCAAAAAG
AAAGAGTAT
TTAAGTTCGG
GATAATI'TAG
AAAAGTCGCA
CCAATCGATI
*AATAAGAC CI GAACGTrGGA
*CCTCTTTATC
GGTCCATTAC
CTTTGGCAC
AAAAGCAAGA
TGTCGTTGC
TCTCCITI'A
CCAATGATGG
AGCCAATCAC
CTTCCCCGCA
GCAAATCTAA
TAAGCTATCC
ATGAAGCATT
ACAATGGCGC
AGAATAAkAA rTAAAGAAGT
TTCTGGAGA
CTGGA-ACGTA
GTCGAGCAAT
GCTCAAAAAA
ATTTTGCAGA
GGCAALACCGC
ATACAGAAAA
AAAAT7rAAC
GCGGAGA'-TT
TCTI'AAACTT
TTGATCAGTT
AAACATTACA
AATTA-ACAGA
GTTCAGGCTA
AATCATGGTT
ACAACAGGTA
AATGGATAAT
CGAAAAAGGA
TTTA-AGCTAT
AGCAAATGAT
AGCGTATTAC
AGAAGAGTTA
TATGAGTTAT
TGTCGAAAAA
CTTCACATTA
ETATTGGGAT
CAATACTGGG
AATTGTTGCA
'TATATCCAA
AGCATTATCA
AGCACTTGGC
GGAATTAGGA
TAAAAAAGAA
TGCTAAAAA
AGTCAATGTT
CGACATTGTG
ACTGCAATCA
GCTTCAAGAA
AAAAGCGGAT
AGCACGCTTA
*TI'ACAA.ATCA
EF017-2 (SEQ ID NO:62) MKRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT AADVMSQVGE GLFSFDEKGE AKP ALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKS1(EELG VTAIDDHTLE VELSYPMSYF QQLLAVPAFY PLNEAFVEKT GKNYGTSAES TLYNGAFTLE GWDGTNNTNS YVKNKNYWDQ ANVSLDKVDV Q1JVKEVNTGK NLFEGKELDV VXISGEIVAQ EQGNAALXIR EIPGTYYIQL NTQKDLLANK NARRAIALSL NSERLAKNVL NDGSKYALGF VPTGFTNQET QKDFAEELGO TAB3LE 1. Nucleotide and Amino Acid Seqeuenices of E. faecalis Genes.
LNPSEPEKAK ELWQTAXK<EL GIEXAELTIL SSDTENAXKI SEY-VQGALAD NLENLTVNVS PVPFNNRLEK SRSGDFDIVV GGWTPVYADP IDPLNLLQSI( NSNNFG.KWSN KTFDQLLQEA NVTYANKYEE RWKTLQKADQ LVAEEAPLVP LYQLTEARLV ADSVQNLVYG PLGSGYYKSV
SIGIDK
EF017-3 (SEQ ID NO:63) CTGTGG AAGTGTTGGG AAAGAAACCA AAAAGCAAGA ACAACAGGTA
C
TTACGGGTCG
ACCGCAGCAG
GAACGAAAC
ACTTTTACGA
TTTGAATACT
TTTGAAGGGT
GGGGTAACAG
TTTCA-ACAAT
ACGGGCAAAA
GAAGGTTGGG
CAAGCGAATG
AAAAATCTTT
CAAGAACAAG
TTAAATACGC
TTGAATTCTG
TTCGTGCCAA
GATTTAAATC
TTAGGAATTG
ATCAGTGAGT
TCACCAGT'C
GTTGGTGGCT
AAAAATTCCA
GCAAACGTAA
CAATTGGTTG
GTGGCCGATT
GTCTCTATCG
GGATTGATTC
ATGTAATGAG
CAGCATTCGC
TTCGAAAAGA
CTTGGAAGCG
TAAAAAATTA
CCATTGATGA
TATTGGCGGT
ACTATGGTAC
ATGGCACGAA
TTTCGCTAGA
TCGAAGGGAA
GCAATGCAGC
AAAAAGATCT
AGCGTTTAGC
CAGGTTTCAC
CTAGTGAACC
AAAAAGCGGA
ATGTI'CAAGG
CTTTITAATAA
GGACGCCAGT
ATAATTTTGG
CTTATG CAA
CGGAAGAAGC
CTGTCCAAAA
GCGACAAG
GGAATIATCA
CCALAGTAGGG
AACTGAAAAA
TGCAAAATGG
CACAGTGGAC
TCGTGCTATT
CCATACCTTG
ACCAGCTITTT
ATCAGCTGAG
TAATACTTGG
TAAGGTGGAT
AGAATTAGAT
'ITTGAAAATT
TTTGGCAAAT
TAAALAATGTT
TAATCAAGAA
AGAAAAAGCG
GCTAACGATT
AGCTTTAGCA
TCGTTI'AGAA
ATATGCTGAT
TAAATGGTCT
TAAATAT'GAA
CCCCCTAGTT
TAGTCTAT
ACGGCAGACG TGTCGTTr-GC GAGGGACTTT TCTCCT=IGA GTACAGCCCT CCAATGATGG AGTAACGGCC AGCCAATCAC CCAAAAACAG CTTCCCCGCA G ITGACGGTA GCAAXTCTAA GAAGTAGAGC TAAGCTATCC TATCCTTTAA ATGA-AGCATT
TCAACACTTT
TCCTATGTGA
GTCCAAGTAG
GTTGTAAAAA
CGTGAAATTC
AAGAATGCAC
TTAAATGATG
ACGCAAA.AAG
ACAATGGCGC
AGAATAAAAA
TTAAAGAACT
TTTCTGGAGA
CTGCGAACGTA
GTCGACCAAT
GCTCAAAAA
ATTTTGCAGA
AATGGATAAT
CGAAAAAGGA
TTTAAGCTAT
AGCAAATGAT
AGCGTATTAC
AGAAGAGTTA
TATGAGTTAT
TGTCGAAAAA
C'rTCACATTA
ITATI'GGGAT
CAATACTGGG
AATTGTTGCA
TTATATCCAA
AGCATTATCA
AGCACTTGGC
GGAATTAGGA
AAAGAGTTAT GGCAAACGGC TAAAAAAGAA
TTAAGTTCGG
GATAATTTAG
AAAAGTCGCA
CCAATCGATT
AATAAGACCT
GAACGTTGGA
CCTCTIATC
GGTCCATTAG
ATACAGAAALA
AAATTTAAC
GCGGAGAI=
TC-TIAAACTT
ITGATCAGTT
AA.ACATTACA
AATTAACAGA
GTTCAGGCTA
TGCTAAAAAA
AGTCAATGTT
CGACATTGTG
ACTGCAATCA
GCTTCAAGAA
AAAAGCGG3AT
AGCACGCTTA
TITACAAATCA
EF017-4 (SEQ ID NO:64).
C C CCSVGK ETKKQEQQVL RVGIDSELST ADVSLAMNT
AADVMSQVGE
EYSWKRTVDP
QQLLAVPAFY
ANVSLDKVDV
NTQKDLLANK
LNPSEPEKAK
PVPFNNRLEK
NVTYANKYEE
SIGDK
GLFSFDEKGE
KTASPQAYYF
PLNEAFJEKT
QVVKEVNTGK
NARRAIALSL
ELWQTAKKEL
SRSGDFIJIVV
RWKTLQKADQ
AKPALATEKV
EGLKNYRAIV
GKNYGTSAES
NLFEGKELDV
NSERLAKNVL
GIEKAELTIL
GGWTPVYADP
LVAEE-APLVP
QPSNDGLSYT
DGSKSKEIELG
TLYNGAFTLE
VKISGEIVAQ
NDGSKKALGF
SSDTENAKKI
IDFLNLLQSK
LYQLTEARLV
FTIRKDAKWS
VTAIDDHTLE
GWDGTNNWS
EQGNAALKIR
VPTCFTQET
SEYVQGALAD
NSNNFGKWSN
ADS VQNLVYG
NGEPITANDF
VELSYPMSYF
yVKNYWDQ
EIPGTYYIQL
QKDFAEELGD
NLENLTVNVS
KTFDQLLQE.A
PLGSGYYKSV
EF018-1 (SEQ ID TGTCATTACA ACGATACCAA TTTTAATCAT TTATCCATTA CTACAAAAAC ACTTTATCGG CGGTATGATG GCCGGTGCAG TAAAAGAATA AAGAAAGTAG GGAACAATAT OAAAAAAGTT TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
TTAGCGCTT
AAGAAAGCTA
GCTTGGAACT
TGGACGCGC
GTGGATATTG
GATATGCCTG
TCTTGGGCCT
GTTAACCAAG
AATTACTCTA
GTTATTCGTG
TTTGAAAAAG
ACCACAG-TGG
CCTTTAGAGG
TGGTTAAAAA
GATGATCGGG
GCTGGAACCA
TATATAGCCA
GGCAT'ICAG
ACTCAACTCT
GAAGGAGTTA
AAATTAGAAC
TACTTTAACA
ATGCAACAAT
GATGCAGGAA
GTTAT'ICAA
AAAGCATTCT
GCGGAAAACA
TATTCCTGCC
GCTCAGATGT
TAGATAAGAA
TCC'TT=
AATTTA=IC
ATATCGTGAC
ATTCT'IAAA
ATACC ETTAA
ATACAAAAGC
AAGATCTCTA
TCATCCAACA
GCGATGGTGC
ATAAAAATGG
CATTTAATGA
CAACGTTTGA
GTGGTCAAGG
TTGATGCACC
GTITGGTTAAG
TCACATATTT
CTTATCC-GTA
AA.ACAGACA-A
ACGACCGTGT
CCGGCAAACG
CCCGGAAGC
TGATTCGTGC
TAAAATCAAA
GAGACAAACT
AACGGCGGTC
CTCAATAAG
GAAAGAACCG
AAAACACGTG
CGGCGATGAT
AI'AACTGAA
CGAT-TTACCT
ATGGTATGCC
GGATTATGAA
TAATGCATTA
GATTAAAGAA
AGGACCATTT
TAAATACTAT
TGTTTACCGA
TGAAAAGTG
TGGGAACTTC
AAGTAGCACT
TAATTACATT
AATTGATGAA
CAATCATCAA
TGATGCCTAC
CAATAAACTA
AAAATTAGTC
TCGTGCGAAT
GAAAGATGAT
TAAATGGGAT
TAAGTAA
GTTAGTTTAG
GATCGGTATG
ACCAAGAMfA
ACCACTCCC
TCAAkAATTAA
AAAACTGGAC
AAA.AAATATG
TTAGAGGATG
AGTGGAAATA
GGCAAGCCAG
AAATATCCTC
TTAGACAAAT
GATCGAAATT
GCAGGCAATA
AAACAAGGAA
ACAGAATTTA
TCTGGCCGAA
ACGAAAGATG
CCGGGACAAA
GGAAAAATTG
AACAAAAA.AT
AAAGTACCAA
CCACATTTCG
GAAGCGATTG
AAAGCCI=G
GCAATTGAAA
CGGCCTGTAG
A.ATTAGATGA
AATGGTATAT
AGATTAAAAA
ATGCCATGAT
AAGCAGCALr
ACCCCTA'TT
GAAAAACATA
TCCCAGTAAA
ACGTTI'CAAC
ACATGACCCC
TACAAGACTI'
TAGATAAAGA
TTAGtG'ATGA
ATTATGCAAC
TGAAAAAATC
AACCAACATI'
CGAA-AGATCC
TTTTAACAAA
ATTATTTACC
ATCCCATTAG
TGGAAAGTGC
TAATTGAAAA
A.AACCAAACT
ACAAATCT
AAATAAAATC
CGGTGGGGAA
AAAGACGCCT
TAACTCAGAT
AGACTTAAAT
TTCAACTGGA
GAAAGCAGAT
AATGAAAGTT
TrGGTTACCCT
TGATAAT
ACCAGAAAAT
AATCCGCTTC
CTTAGGTGTT
ATATTTAGAA
TACCTTCACA
CATGCTCGTT
TGGCACACGT
AAATCAAACC
AGCAAAAGTC
ATATGGCGTTr
AGAAGTGAAA
TCGTFI'CCTA
TTTAACGCA-A
TATTAATCCA
AAATACAACC
AGAAGACTAC
TGAGAAAATG
CO. 00** :*o EF018-2 (SEQ ID NO:66) MKKV LCGLLVATAV VSLAACSGGE
KKA.SSDVSIK
VDIEFISGDD
VNQDTFKWYA
FEKVMQQ IKE
WLKTFNDVYR
YIAIDGPSST
EGVTYAYNDQ
MQEWGKCKLV
KATFLKSNKWD
DRYELDEKTP
SKLNAMISSG
LEDGKTYGYP
KYPEMTPMGF
ACNISDDSFT
SGRKPTLNQT
GKIDYLPEVK
PHFVIENINP
AIEKIKSEKM
AWKLDKKKEP
DMPDIVTLTE
NYSNTKADYE
TTVGDGAGPF
DDCATFDEKV
GISGWLSNYI
KLEQTDNDAY
DAGTPEARAN
AENROKLK
TKIKWY INSD
KTGQAALKAD
SGNI PVNDNF
LDKLQDFLGV
KQGNYATMLV
TKDAYXDPA.KV
NKKYGISRFL
EAIETKLNTT
WTALPFGKDV
SWAYSLNDLA
VIREDVYNAL
PLEDKNGKYY
AGTSGQGGNF
TQLFTYLI DE
YFNNDRVNKL
VISMIRAKDD
TTAQ IKKDLN
KKYDPYLMKV
GKPOVSTPEN
DRNLDKEYLE
TEFMXIKSCTR
PGQILTKYGV
KVPMESALTQ
KAFDKSLEDY
EF018-3 (SEQ ID NO:67) CTGTAG CGGTGGGGAA AAGAAACCTA GCTCAGATGT GC I=GGAAGT TAGATAAGAA TGGACGGCGC TGCCTTTCGG GTGGATATTG AA ATTTC GATATCGCCTG ATATCGTGAC TCTTGGGCCT ATTCTT='AAA GTTAACCAAG ATACGTI'TAA AATTACTCTA ATACAAAAGC
CTCAATTAAG
GAAAGAACCG
AAAAGACGTG
CGGCGATGAT
A ITAACTGAA
CGATTTAGCT
ATGGTATGCC
CCATTATGAA
TAATGCATTA
GATITAAAGAA
GATCGGTATI
ACCAAGATTA
ACCACTGCGC
TCAAAATTAA
AAAACTGGAC
AAAAAATATG
TI'AGAGGATG
AGTCCAAATA
GGCAAGCCAG
AAATATCCTG
AAI'AGATGA
AATGGTATAT
AGATTAAAAA
ATIGCCATGAT
AAGCAGCATT
ACCCCTATTT
GAAAAACATA
TCCCAGTAAA
ACC PFCAAC
AGATGACCCC
AAAGACGCCT
TAACTCAGAT
AGACTTAAAT
TTCAAGTGGA
GAAAGCAGAT
AATrCAAAGTT TGG ITACcCT
TGATAATTT'
ACCAGAAAAT
AATGGGCTTC
GTTATTCGTG
TI'TGAAAAAG
AAGATGTCTA
TCATGCAACA
98 TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
ACCACAGTG
C CTTTAGAGG
TGGTTAAAAA
GATCATGGGG
CMGAACCA
TATATAGCCA
GGCATTTCAG
ACTCAACTGT
GAAGGAGTTA
AAATTAGAAC
TACTI'AACA
ATGCAAGAAT
GATGCAGGAA
GTATTTCAA
AAAGCATTCT
GCGGAAAACA
rGCGATGGTC
ATAAAAATG'G
CATTAATGA
CAACGTITTGA
GTGGTCAAGC
11GATGGACC
GTTGGTTAAG
TCACATATTT
CTTATGCGTA
AAACAGACAA
ACGACCGTGT
GGGGCAAAGG
CGCCGGAAGC
TGATTCGTGC
TAAAATCAAA
GAGACAAACT
AGGACCATTT
TAAATACTAT
TGTTTACCGA
TrGAAAAAGTG
TGGGAACTTC
AAGTAGCACT
TAATTACATT
AA'ITGATGAA
CAATGATCAA
TGATGCCTAC
CAATAAACTA
AAAATTAGTC
TCGTGCGAAT
GAAAGATGAT
TAAATGGGAT
TAAG
TTAGACAA-A-
GATCGAAAT'-
GCAGGCAATA
AAACAAGGAA
ACAGAATTA
TCTGGCCGAA
ACGAAAGATG
CCGCGACAAA
GGAAAAATTG
TACAAGACTT
TAGATAAAGA
TTAGTMATGA
ATTATCCAAC
TGAAAAAATC
AACCAACAT
CGAA-AGATCC
TT'rAACAAA
ATTA=TACC
CTTAGGTGTT
ATA'I=ACAA
TAGCTTCACA
CATGCTCGTT
TGGCACACGT
AAATCAAACC
AGCAAAACTC
ATATGGCGTT
AGAAGTGAAA
TCG TrCCTA
TTTAACGCAA
TATTAATCCA
AAATACAACC
AGAAGACTAC
TGAGAAAATG
AACAAAAAAT ATGGCATT-AG AAAGTACCAA TGGAAACT-GC CCACATTT~CG TAATTAAAA GAAGCGAT7G AAACCAAACT AAACCCTTTG ACAAATCTTT GCAATTGAAA AAATAAAATC EF018-4 (SEQ ID NO:68)
C
C*
C.
*CSGGE
KKASSDVSI(
VDIEFISGOD
VNQDTFKWYA
FEKVMQQIKE
WLKTFNDVYR
YIAIDGPSST
EGVTYAYNDQ
MQEWGKGKLV
KAFLKSNKWD
DRYELDEKTP
SKLNAMI SSG
LEDGKTYGYP
KYPEMTPMGF
AGNISDDSFT
SGRKPTLNQT
GKIDYLPEVK
PHFVIENINP
AIEKIKSEQ4
AWKLDKKKEP
OMPD IVTLTE
NYSNTKADYE
TTJGDGAGPF
DUGATFDEKV
GISGWLSNYI
KLEQTDNDAY
DAGTPEARAN
AENRDKLX
TK IKWY INSO
KTGQAALKAD
SGNIPVNDNF
LDKLQDFLGV
KQGNYATMLV
TKDAKDPAXV
NKKYCISRFL
EAIETKLN--,
WTALPFGKDV
SWAYSLNDLA
VIREDVYNAL
PLEDKNGKYY
AGTSGQGGNF
TQLFTYLIDE
YFNNDRVNKL
VISMIRAODD
TTAQIKKDLN
KKYDPYLMKV
GKPDVSTPEN
DRNLDKEYLE
TEFMKKSGTR
PGQILTKYGV
KVPMESALTQ
KAFDKSLEDY
EF019-1. (SEQ ID NO:69)
TAAAGGAGTT
CTTGGTTCAT
AAAACACATG
CGGATTATrG
CAATGGACAG
CCCACTATT
TTAATCAGTT
CCAACTTATG
GCCACTGTTT
ACCAAAGGCG
TGGGTA.ACCA
TATCAGGACT
GCGGATTGGA
CTTGTAAACA
GCTGTGAAAA
CCTArI'GCGA ACACAATGAA ACTT=AAAA
TCTTACTCGC
AAGTAACAGA
CGAGTTA=T
F1GGACAAGG
CCTATGACTT
CATCTGCTCT
TAGTCAAAAA
TAGATAAAAA
TCCAAGAATA
ACAACCAAGT
TAGGCCTCCA
ATCAAGTTI'C
GCGATGAATC
ATAACCAAGT
ATACTCAAAT
AGCTTGTGGT
TACCTTAGGC
AGAAGATTAT
CAGCATTCAA
GCCATATGAA
AGTTGAAGGC
CGGCGAAAAT
AGAACAAGCG
TCTTCGCAAA
CTTTATGGTT
AGTI'CCAAAA
T'ITAGAAAAA
AGCACCTCTT
TCATACCTAT
TGTTGAAGAT
AAGACGGTCC TAATTGGTAC AATACGAATA AAGA.AGCCAA AATAAAGTAA CCGTCCCCGC CTAGTTGCAT TAG GAGAAAA
GATTATTIAG
GCGGTTCTAA
GGTAAATACA
GTCACCTCGC
AAAAAAGTGT
AAAGATGCTG
AGCGATAATC
TTAGTGGAAG
TTAGCTGAGC
TTCCA-AGAAG
CGAAAGAATT
AATT~TGAACC
AAGAATACAG
GTATCAAT
TAGAAGATTA
GCAAATCTGC
GCTCAAGCGG
AAATTCTAA
TTGACGCAGA
CAATTTGGAA
AACCCTTCTT
CAACGCTGAC
GAAACCCAAA
ACCAGTGGCA
GAAAGATGTC
TGACTTATTA
TAAAATTGCG
GGAAGATATT
TGATACCTI'A
GGCAGTCTTA
AACCGTGCTC
AAACGCTACT
CCACATTTTC
GAACTTACCT
GATAAAAAAA GTAGTTCGGTT ATACAACGGA GTAAAAAAAG CGCTCT-TAAA 'ITAA EF019-2 ((SEQ ID MKLLKK TLIGTITLLL CSFLIJAACGN TNKEANNADK THFJTDTLGN KV'IVPAKPKR IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA VLKFEPDLLL 99 TABLE 1. Nucleotide and Amino Acid Seqeucnccs of Efaecalis Genes.
ISSSALVEGG KYKEYSKIAP TYVVKNqGENV TWRDQLEDIA TVLDIKKEQAK KVLEDYDTLT KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGT\JLY QDLGLQVPKL 'JEEISKNATA DWNQVSLEKL AELDADHIFL VNSDESAPLF QE-AIWKNLPA VXNNQVHTYD KKSSWLYNGP IANTQIVEDV KKALLN EF019-3 (SEQ ID NO:71) TTGTGGT AATACGAATA AAGAAGCCAA CAACGCTGAC
AAAACACATG
CGGATTATI'C
CAATGGACAG
CCCACTAT
TTAATCAGTT
CCAACTTATG
GCCACTGTTT
ACCAAAGGCG
TGGGTAACCA
TATCAGGACT
CCCAT'rGGA
CTTGTAAACA
GCTGTGAAAA
CCTA'ITGCGA
AAGTAACAGA
CGAGTTATTT
TTGGACAAGG
CCTATGACTT
CATCTGCTCT
TAGTCAAAAA
TAGATAAAAA
TCCAAGAATA
ACAACCAAGT
TAGGCCTCCA
ATCAAGTTTC
GCGATGAATC
ATAACCAAGT
ATACTCAAAT
TACCTTAGGC
AGAAGATTAT
CAGCATTCAA
GCCATATGA.A
AGTTIGAkAGGC
CGGCGAAAAT
AGAACAAGCG
TCTTGGCAAA
CTTTATGGTT
AGTTCCAAAA
TTTAGAAAAA
AGCACCTCTT
TCATACCTAT
TGTTGAAGAT
AATAAAGTAA
CTAGTTGCAT
G-ATTATTTAG
GCGGTTCTAA
GGTAAATACA
GTCACCTGGC
AAAAAAGTGT
AAAGATGCTG
AGCGATAATC
TTAGTIGGAAG
TTAGCTGAGC
TTCCAAGAAG
GATAAAAAAA
GTAAAAAA.AG
CCGTCCCCGC
TAGGAGAAAA
CGAAAGAATT
AATTTGAACC
AAGAATACAG
TAGAAGATTA
GCAAATCTGC
GCTCAAGCGG
AAATTTCTALA
TTGACGCAGA
CAAT=~GAA
GTACGTTGGT'
GAAACCCAAA
ACCAGTGGCA
CAAAGATGTC
TGACTTATTA
TAAAATTGCG
GGA-AGATATT
TGATACCTTA
GGCAGTCTTA
AACCGTGCTC
AAACGCTACT
CCACATTTTC
GAACTTACCT
ATACAACGGA
S
S
CGCTCETAAA T EF019-4 (SEQ ID NO:72) CGN TNKEANNADK THEVTDTLGN KV'IVPAKPKR IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP TISYDLPYEA ISSSALVEGG KYKEYSKIAP TYVVKNGENV TWRDQLEDIA TVLDKKEQAK KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA VKNNQVHTYD IANTQIVEOV KKALLN
VLKFEPDLLL
KVLEDYDTLT
VEEISKNATA
KK S SWLYNG P EF020-1 (SEQ ID NO:73) 5* 5 .5 *5
TGAGGAGATG
ACATTAACTG
ATAATGAAAG
TATAAAGATA
ACTAGCGCAT
AGTAAAGTAA
GGGAAATrrA
AGAAAATGAA
CATGTAATGG
ATTCTTCATA
AAGACACTAA
TGGAGTAT
CCTGrATGA
AATAA
AAAGGTAGTT
TrCTAAATTA
TGGTGATGAA
TCGTTATTTG
TTATTATTAT
TATGAAAGCT
TCAAT=TGT
GATAAAACAG
TATTCAGA.AG
GCTGATGTT'
GATGAAGATA
ACTGGAAACT
TGATGGTTGT
GTGAAGA.A T
ATIGGT=P'AG
GGGTTCCTCT
AGCGATTAGA
ATIGAAGTAGT
TGCAGTC'ITC
TAAAA.ATTCT
TTTTTAATA
TAAAGATGAA
TAGTACTAAA
GTATAAATCA
EF020-2 (SEQ ID NO:74) MXKVVS ILLMVVAVFT LTACNGSKLD KTGEEFKNSI MKDSSYGDEY SEDCFSFLIY KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTKS KVTFDDMKAS GNYEVVYKSG
KFK
EF020-3 (SEQ ID ATGTAATGG TTCTAAATA GATAAAACAG GTGAAGAATT TAAAAATTCT TABLE 1. Nucleotide and Amino Acid Scqeuences of Efaeca/is Genes.
ATAATGAAAG ATTC'FrCATA TGGTGATGAA TATTCAGAAC TATA.AAGATA AAGACACTAA TCGTTA'FFG GCTGATGC=1 ACTAGCGCAT TGGAGTATI TTATTATrAT GATGAAGATA AGTAAAGTAA CCTTTGATGA TATGAAAGCT AGTGGAAACT GGGAAA=TA AA ATGGTTrAG
GGGTTCCTGT
AGCGATTAGA
ATGAAGTAGT
'FFFITAATA
TAAAGATGAA
TAGTACTAAA
GTATAAATCA
EF020-4 (SEQ ID NO:76) CNGSKLO KTGEEFKNSI I MK(DSSYGDEY SEIDGFSFLIY KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTI(S KVTFDDK(AS GNYEWVYKSG
KFK
EF021-1 (SEQ ID NO:77) a.
a 4* a a a a a a a
TAGTTGTTTA
TTATTCGT
GGCAAAGGCA
ATCATTACAG
TTGCAACT
CAATCGAA E,
AACACAATCT
AACCCTGATA
TCTGCAACAT
ACAAAAACGA
CAAGCTGGTT
GTTGATACCA
GCAATGTACC
CTCTTCCAAG
GGCGTTGACC
AACTTCACGT
CGTGCGTTAG
GGCGTTGACT
AAAGATAAAG
AATACATTAA
TTAGTTTGAT
AAACCGCTGA
ATACAGGCGG
GGGGTAAAGA
ATGCAGCTGA
TTGGTATTGG
CAAAC'TI'GT
TTAGAGATAA
ACAA.AGTCGG
TTGAAAAAGG
AATATGCGGC
AAAACGGCGT
AAGCAAAAGA
GCGATCAAGA
TAACTTCAAC
AAGACAAATI
TAACAGACGG
TAATCTCAGG
ACTATTTTA
TGCATTAGOT
AAGCGGCGGT
CGTGGATGAC
ACATGATTTA
CTATACAACC
CTACTTGCTA
TTTAATCGAT
TGA.AGCAGCT
TTTTGTTGGT
TGTGGCTGAT
TTCATTGrCT
TGATATCATC
CTTGAATGAA
TGCTGATGGC
GCITAAAGGT
CCCTGGTGGC
CTATTTAAAC
TGACGTAAAA
GGAGGCTTTA CAGAAATGA-A TTATCAGTTT CACTI'GAGC GGCAAAGGGG ATGCAGCGCA AAGTCGTTCA ACCAATCTTC CCAGAAGGTT CAAAAGC.GTA AATATTGACC AAGCGGTATC AAAGATGCAA TTTCTTCTGC GATCAAATCG ATGGCAAAA.A TACT1'AGCCG GTGTTGCTGC GGTGAAGA-AG GGGTCGTAAT
AAAAGCAAAA
ATGTGGTGGT
TAGTGCTGTA
TTGGGAAGGA
TGCATATATT
AAGTAAATTC
AGCAGATGCC
GAATGTCGTT
TGCAAATGAA
TGACCGTTTC
AGAAATTACT
TTTAGCTGCT
TGGACAAGGG
'PTGGGTAATC
CAAAGAAGAC
TATTGCCAAC
AAAAGATGGT
TAAAACAGCA
GCTGCGA-AAG
GATCCTCCCA
TTCCATGCTT
TCAGGTTCTG
AAGTACAAAA
GTCGG3CACAG
GAACATTTAG
GACAAAACAA
GTCCCAGAAA
AATTIAGGTA-A
AAGGGAA.AGC
CTGGT'GCGAC
GCGACAAAGT
CA.AAAGACGG
CGGTTAAGA
=I'GGATT
AAGAAG^C-GT
a AACCACAATA A EF021-2 (SEQ ID NO:78) MKKAXL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP EGSKGYAY IQ SNDAADYTTN TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNVVS ATFRDNE-AAY KTNKVGFVGG EEGVVIDRFQ AGFEKGVADA AKELGKEITV DTKYAASFAD MYQNGVDIIF HASGATG-QGV FQE.AKDLNES GSGDKVWVIG VDRDQDADGI( FTLTSTLKGv GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND DKVISGDVyKV PEKPE
IDQAVSSKFN
LAG VAAANET
PAKGKALAAA
YKTKDGKEDN
KTKEAVKTAK
EF021-3 (SEQ ID NO:79)
ATGTGGTGGT
GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GGCAAAGGGG ATGCAGCGCA ATCATTACAG ATACACGGCGG CGTGGATGAC AAGTCGarrCA ACCAATCITC TTGCAAGCTT GCGGTAAAGA ACATGATTTA CCAGAAGGCTI CAAAAGCGTA CAATCGAATG ATGCAGCTGA CTATACAACC AATATTGACC AAGCGGTAI'C
TAGTGCTGTA
TTIGGGAAGGA
TCCATATATT
AAGTAAALTC
101 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fiacalis Genes.
AACACAATCT
AACCCTGATA
TCTGCAACAT
ACAAAAACGA
CAAGCTGGTT
GTTGATACGA
GCAATGTACC
GTCTTCCAAG
GGCGTI'GACC
AACTTCACGT
CGTGCGTTAG
GGCGITGACT
AAAGATAAAG
TTGGTATTGG
CAAACTTTGT
TTAGAGATAA
ACAAAGTCGG
TTGAAAAAGG
AATATGCGGC
AAAACGGCGT
AAGCAAAAGA
GCGATCAAGA
TAACTTCAAC
AAGACAAATT
TAACAGACGG
TAATCTCAGG
CTACTTGCTA
TTTAATCGAT
TGAAGCAGCT
TTTFIGTTGGT
TGTGGCTGAT
TI'CATTTGCT
TGATATCATC
CTTGAATGAA
TGCTGATGGC
GCTTAAAGGT
CCCMTGGC
CTA=TAAAC
TGACGTAAAA
AAAGATGCAA
GATCAAATCG
TACTTAGCCG
GGTGA.AGAAG
GCTGCGAAAG
GATCCTGCCA
TTCCATGCTT
TCAGGTTCTG
AAGTACAAAA
GTCGGCACAG
GAACATTrAG 6ACAAAACA.A
GTCCCAGAAA
TTTC'ITCTG~C
ATGGCAAAAA
GTCTTGCTC
GGCGTCGTAAT
AkATTAGGTAA A.AGGGAAkAGC
CTGGTGCGAC
GCGACAAAGT
CAAAAGACGG
CGGTTCAAGA
TTTATGGATT
AAGAAGCTGT
AACCAGAA
AG CAGAT C C
GAATGTCGTT
TGCAAATGAA
TGACCGTTTC
AGAAATI'ACT
TTTAGCTGCT
TGGACAAGGG
TTGGGTAATC
CAAAGAAGAC
TATTGCCAAC
AAAAGATGGT
TAAAACAGCA
EF021-4 (SEQ ID 0 0 0 0* 00 @0 0 0 0 0@ 0* 0 0 *00 00 0 0 000 0 000000 0 00*0 0 .000 0 000 0 0000 0000 0 0* 0 00 CGGG KGKTAESGGG KGDAAHSAVI ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP TIFGIGYLL( DAISSAADAN PDTNFVLIDD KTNKVGFVGG EEGVVIDRFQ AGFE-KGVADA MYQNGVDIIF HASGATOQGV FQEAKDLNES FTLTSTLKGV GTAVQDIANR ALEDKFPGGE DKVISGDVKV PEKPE EGSKGYAYI Q
QIDGKKNVVS
AKELGK(EII'V
GSGDKVWVIG
HLVYGLKDGG
SNIDAADYTTN
ATFRDNEAAY
DTKYAASFAD
VDRDQDADGK
VDLTDGYLND
IDQAVSSKFN
LAG VAAANET
PAKGKALAAA
YKTKDGKEDN
KTKEAVKTA(
EF7022-1 (SEQ ID NO:81)
TAAGAGCATA
ACAATGGT
AAAAAGAAAC
ACGCTCAACA
GAAGGGTTAT
CCGATGA ITT
AGTAACGATG
CCTAAAAACG
ATCTCAGCGG
TTA.AAGGrIGA
TTITTTCCCC
GATAAAGTCG
TGGCAACTAG
AATTATACAG
GATGTGGCTA
TCGTATCCAA
ACGCCGCrrG
CTAGTCAATA
TTTGTGGCGA
TATAACAAAG
GTTAACG LTG
CAAGGCTCGC
GAAGCTGCAT
CCAGACTATC
TATCAGAACC
CCAGAAAAA.A
AAAAAATGAA
GTATTTTATT
AGAAAAATAC
CCTCTGTATT
ATAGT IAGA
CAGAAGATGG
ATCCTGTCAC
GCTTTGT'ITA
GGAAATTAGC
CGCTCAAAGA
AAAATCNAAA
TCTATAATGG
CAAAA.AATAA
TTATCAAAGA
CACTAAGTGG
CAGCGACAAT
CAAACGAAAA
ATATTA'ITGC
ATCCCACAAC
AAAAAGCGCA
AArTrGATGGT
TACAAGAATT
TGAACTI'TGG
AAGACCCTAT
CTGTCTATGA
GATGGGCGAC
GAGTTATAGG
GGTAGGATTT
CAAAGAAGCC
ATTGGATTTTI
TGAACA.AGAC
AAAAACCTAC
AGCACATGAT
TAGCTTCCTC
ACCCAATGAA
GCCAAAACCG
AGTAGTCGAA
TCCGTTCGTG
TCGCTATTGG
AACATCTACC
TGAACTGGCG
GAACTATTTG
CCTGCGTAAA
AGATGGTTCT
GGGTCTCGAT
AAGTTATTGG
AACAGATGAT
GTTTCCTGGT
GCGAGAAAGT
TTCTACCCTG
CAAATI'A?'A
ACTGATTGCA
AGAAAGAAGA
TTAGCTGGGT
GTTCAACTGA
CCAGATGCTA
CAATTGGTAC
ACGATTTCTT
TTTGAATATG
ATCGTTGAA-A
CTAGGTGTCA
TACTTTACGT
CAATTTGGTG
GTAAAAGAT
GATCACCAGA
GCATTGAATC
CAACAGAATA
CGCTTAAATC
GCATTGGCTT
AAAGCGCTAC
T'FTCGTCAAG
AAAAAAGCAC
GGTTCTTACA
TTGACAATAG3
GACTATGATT
ATGACTTrAT
GATGAAGCAG
GCTGAAAAAG
TGAAAAAGTA
GTACCAATAA
TGTCACCCTC
TTGTCCAAAC
CAGCCGTAGC
TGAGAAAAGA
CTTGGAAAAA
CAATrCAAAA
CAGCTGTGGA
CC=TTTAGC
CGGACTATGG
GGCAGCAAAC
ACGTGCGCTC
TTTTIAAGA
AAAATAATAC
AAAAACGGNA
TAGGAATAGA
ATGGTGCGAT
AAGCAGGTAA
AAGCAGAATT
AAAAAATTGG
AGCTAACCGC
TATTCITrAAT
ACAAGGGCAA
CCACAACCTA
AAGTGATTGA
'rTTAAAAATC
AAATGAAAAT
GGAATTAACA
TGCAGCGTTT
AAAAGCATG
AGCGGTTTG
AATGATTGAT
TGGTGCAGAA
TGATTATACA
TTTTCCGACA
AACTGCTAGT
AAAGATGGAC
AGACATTATC
TGGACAATTA
GTTGTATCAT
AGGGCAAGCN
TAAAGAAAAT
TACGGAAGGC
TTTAATGGTT
AGGAGAAAAG
TGAAAGT ITG
A'ITGCCGACT
TTACTGGACA
TGATCGCAAT
TGCCTTAGAG
AACGACTGCT
102 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. .aecalis Genes.
GGCATCATTC CACTTAGCCA AAATGALACAA ACAGTCCTGC AAAATGATAA AGTCAALAGGC FIGSAATTTTC ATACCT TCrG CGCTCCATTA ACGTTAAAAA ATGTTTATAA GGAAAAATAA EF022-2 (SEQ ID NO:82) MKKYLK IT MVC ILLVGFL AGCTNXNENK KKQKNTKEAV QLMSPSEL-T
LNTSVLLDFP
NODPVTAHDF
KVTLKEPKPY
QL.AKNNRYWD
YPTATMNYLR
VANPTTGLDF
GSLQELFPGL
QN'PVYDKLLD
NFHTFGAPLT
DAIVQTAAPE
EYAWKXMIDP
FTSLLAFPTF
HQNVRSDIIN
LNQKRXGQAT
RQEAGNLMVY
TIELTALPTE
EAATTYALEP
LKNVYKEK
GLYSLDEQDQ
KNGFVYSFL I
FPQNXKVVEQ
YTVIKETSTA
PLANENLRKA
NKEKAQSYWK
AALNFGRESD
EKRWATL IAA
LVPAVAKALP
\TETIQNGAEI
FGADYGTASO
LNLFEDGQLD
LALD IDKENL
KAQAELGEKV
YDLFLIYWTP
EKEV'IETTAG
MISEDGKflIT
SAGKLAPNEL
XVVY.NGPFVV
VATLSGELAQ
VNNI IADGSK
NVE-LMVTDDG
DYQDP ISTLM
MIPLSQNEQT
I SLRKE-AVWS
GVTAVIDDYTL
KDWQQTKMDW
QNKNNTLYHS
ALHGAITEGF
SYKKIGESLQ
TLYKGNDRNY
VLQNDKVKGL
00 0 @0 0 4ss Vs 6 0 06000 00000 0*00 00 0 EF022-3 (SEQ ID NO:83) GT GTACCAATAA AAATGAAAAT
AAAAAGAAAC
ACGCTCAACA
GAAGGGTTA.T
CCGATGATTT
AGTAACGATG
CCTAAAAACG
ATCTCACCGG
TTAAAGrGTGA
TTT'ITCCCGC
GATAAAGTCG
TGGCAACTAG
AA ETATACAG
GATIGTCCCTA
TCGTATCCA-A
ACGCCGCTTG
CTAGTCAATA
TTTGTGGCGA
TATAACAAAG
GTTAACCTTG
CAAGGCTCGC
GAAGCTGCAT
CCAGACTATC
TATCAGAACC
CCAGAAAAAA
GGCATGATTC
TTCA.ATTTTC
AGAAAAATAC
CCTCTGTATT
ATAGTI'TAGA
CAGAAGATGG
ATCCTGTCAC
GCTTTGTTTA
GGAAATTAC
CGCTCAAAGA
AAAATCNAAA
TCTATAATGG
CAAAAAATAA
TTATCAAAGA
CACTAAGTGG
CAGCGACAAT
CAAACGAAAA
ATAT'ATT'CC
ATCCCACAAC
AAAAAGCGCA
AATTCOATGGT
TACAAGAATT
TGAACTT1'GG
AAGACCCTAT
CTGTCTATGA
GATGGGCGAC
CACTTAGCCA
ATACCTTTGG
CAAAGAAGCC
ATTGGNFI
5 Fr
TGAACAAGAC
AAAAACCTAC
AGCACATGAT
TAGC'ITCCTC
AC CCAATGAA
GCCAAAACCG
AGTAGTCGAA
TCCGTTCGTG
TCGCTATTGG
AACATCTACC
TGAACTGGCG
GAACTATI'T
CCTGCGTAAA
AGATGCTTCT
GGGTCTCGAT
AAGTTATTGG
AACAGATGAT
GTTTCCTGGT
GCGAGAAAGT
'TCTACCCTG
CAAATTATTA
ACTGATTGCA
AAATGAACAA
CGCTCCATTA
GTTCAACTGA
CCAGATGCTA
CAATTGTAC
AC-ATrC T
TTTGAATATG
ATCGI'GAAA
CTAGGTGTdA
TACTTTACGT
CAATTTGGTG
GTAAAAGATT
GATCACCAGA
GCATTGAATC
CAACAGAATA
CGCTTAAATC
GCATTGGCTT
AAAGCGCTAC
TTTCGTCAAC
AAAAAAGCAC
GGTTCTACA
TTGACAATAG
GACTATGATT
ATGACTTTAT
GATGAAGCAG
GCTGAAAAAG
ACAGTCCTGC
ACGTTAAAAA
TGTCACCCTC
TTGTCCAAAC
CACCCTAC
TGAGAAAAGA
CT-GGAAAAA
CAATTCAAAA
CAGCTGTG.GA
CCTTGTTAGC
CGGACTATGG
GGCAGCAA.AC
ACGTGCGCTC
TIITGAAGA
AAAATAATAC
AAAAACGGNA
TAGGAATAGA
ATGGTGCGAT
AAGCAGGTAA
AAGCAGAAT
AAAAAATTGG
AGCTAACCC
TATTCTTAAT
ACAAGGC-CAA
CCACAACCTA
AAGTGAT'rGA
AAAATGATAA
ATG=TATAA
GGAATAACA
TGCAGCGTI'
AAAAGCATTG
AGCGGTTTGG
AATGATTGAT
TGGTGCAGAA
TGATTATACA
TTTCCGACA
AACTGCTAGT
AAAGATGGAC
AGACATTATC
TGGACAATrA
GTI'GTATCAT
AGGGCAAGCN
TAAAGAAAAT
TACGGAAGGC
TTTAATGGTT
AGGAGAAAAG
TGAAAGTTTG
ATTGCCGACT
TTACTGGACA
TGATCGCAAT
TCCTTAGAG
AACGACTGCT
AGTCAAAGGC
GGAAAAA
EF022-4 (SEQ ID NO:84) CTNKNENX KKQKNTKEAV QLMSPSELTT LNTSVLLDFP DAIVQTAAFE CLYSLDEQDQ LVPAVAKALP M7ISEDGKTYT ISLRKEAVWS NDDPVTAHDF EYAWKKI'.IDP KNGFVYSFLI VETIQNGAEI SAGKLAPNEL GVTAVDDYTL 103 TABLE 1. Nucleoride and Amino Acid Seqeuences of E. faecalis Genes.
KVTLKEPKPY
QLA-KNNRYWD
YPTATMNYLR
VANPTTGLDF
GSLQELFPGL
QNPVYDKLLD
NFHTFGAPLT
FTSLLAFPTF
HQNVRSDIIN
LNQKRXGQAT
RQEAGNLMWY
TIELTALPTE
EAATTYALEP
LKNVYKEK
FPQNXKVVEQ
YTVIKETSTA
PLANENLRKA
NKEKAQSYWiJ(
AALNFGRESD
EKRWATL IAA
FGADYGTASD
LNLFEDGQLD
LALGIDRENL
KAQAELGEKV
YDLFL IYWTP
EKEVIETTAG
KVVYNGPFWV
VATLSGELAQ
VNNI IADGSK
.WELMVTDDG
DYQDPISTLM
MIPLSQNEQT
KDWQQTKMIDW
QNKNNTLYIs
ALHGAITEGF
SYXKIGESLQ
TLYXCNDRNY
VLQNIDKVKGL
EF023-1 (SEQ ID
C
TAAAATGGAG
GCTTTAACGG
CAAA.AAATTG
GATACGACCT
GAAGATGAAT
GTTTATACAT
AACGACTTTG
GCTTTATTTT
AATACTCTTG
ACCCCTTATT
ATCAAAGAAC
TTTAAATTGA
ACGTATTGGG
CCAACGACAG
GGTGAATITA
ACATAC'TIT
AACAATATTC
TTGCAAGATC
GATGcGAACAG
GCGAAAGCAA
TTAGAATTTT
CAATTGGAAG
CGTGTTGATC
TATCGTGATC
ACGTTCAAGA
GATCAAGAGG
ACGGTTTTAG
GATCTCTATT
GGATCGGTAT
CTTGTCAGGC
CAATTAGTTC
CGACTTTAGT
TAGAGTTGGG
TTAAGATTAG
TTTATGCATG
TTGATGTCAT
GGOTTAAGGC
TGAA.ATCATT
AAGGGGATAA
AAGAATGGG'A
AMTCAAAA
CGGTGAA=~
TCCTGGTTA
TAAAAATCAA
GTAAAGCGTI'
AATCAACGGC
ATTTCACAAA
AAGAATTCTG
TAACAGATGA
AAAATCTAGA
GTGATCAAAC
CA?1'AACAGT
GTGATACGTA
CTCCTTTAAA
CACCAATCTA
GGCATTCATT
GAAGAAkATTA AAAATGTTAG GGGAACGGGA A ACTCGGCTG TGAAGCGGCT ATTTCGACAA CATGAATCAA CTTTATGAAG GGTCGCTGCC GAAGAACCAG AGAAGATGCC AAATGGTCGA GCAACAAGTT CCTTCCCCTA TAAAAATGCT AAGGAAATTG GCTAGATGAT AAAACGTTAG ACTTTLCG=T CCTC.TTTTGT ATATGCTACT GATGCAGAAC TAATGCCTCT TCTGATGACT AGTTAAATTA ACAGAAGCGA GrI-GACTCG AATGAATrGG TGTTGATAAT CCAGCCT=TC CAGCGTTCGT GATGGAAAAG GGCACAAGCT TTTGATAAAG TACAGATCALA GTAATTCCGC ACTAGCTGCT AAGAAAAATA GGAAAAAGGG AAAAAAGAAA TACAGACAGC GCCAAAAAAG TGGATTAGAA GTGAATGI'TA GAGAGACTAT GATITAGAAT TATGCGCATC TTTACTTCGG TGATCAATTA ATTCAAGAAA TGACTTTGCT CAAGCACAAG CAATCGAAGC AT= CTGTAT TGGACCCACG TACAGT= AA
GATGCGTCGG
ATAGTAACAA
TGGAACCACA
G.AGTd.'ATG.T
CGATTTCTGA
ATGATGATCC
AATCAGGATC
CTTTAGAACG
AAATAAC=T
'TCCACAAAA
ATTI'ATTTA
GGACCTACGA
AAGTTTCACT
ATGTAGTGA
TTTCA.ATTCC
AAAATCCGGC
AAAGTTTI'T
CGGGACAAAC
ACTACTTAAC
7MTGGCTGGA
CTGCTGAGTT
CTCAAGTTCC
TATCTGGTTG
ATAGTACCFI'
CTAGAACAAC
ATATI'TTGGT
TAGCTAATCA
AATGGGCTTA
G'ITGCTTTTA
AGCAGCGGAA
CACAGCGGGG
TTIAGGTAA.A
AGATGAAACC
AGTAACAGCA
GATTCATCAA
CGCAGATGTG
AGAACGGCCC
TGAAAAATAT
TAATGGTCCT
AAAAAATGAT
AATI'AAGAGC
TAAGCTAAGT
TCAATTCGTC
=I'AGCGAAC
AAAAGAAGTC
GATTGCGCCA
CTACGATACA
TAAAATCAAA
TT1'CCAATITT
TTTACTATT
GGGAACCGAT
GGGCGGCGTA
ACATGCGGCT
GAATCAGGAA
AAAAATCAAG
TGTI'AACTAA
EF'023-2 (SEQ ID 00:86) MKKLK MLGCVGLLLA LTACQAGTr.N SADSNKAAEQ KIAISSEAAI STNEPHTAGD TTSTLVM4NQV
DFVYAWQQVA
PYLKSLLSFP
YWDAEI{VKLT
YFLKMNSVRD
GTDFTKLAAX
LEENLDGLEV
FKSDTYDQLI
LYWHSF'GPTY
YEGLYVLGKE
SPKSGS IHQA
VLFPQNEKYI
EAKVSVIKSP
GKENPALANN
XNNYLTYDTA
NVTQVPFTIR
QETRTTHAAD
SLKWAYVN
DELELGVAAE
LFFDVIKNAI(
KEQGDKYATD
TTAVNLFDSN
NIRKALAQAF
KAKEFWEKGK
VDRDQTRDYD
QEARLNDFAQ
EPAISEDETV
EIALEGADVN
AEHLIYNGPF
ELDVVNI(LSG
DKESFVKEVL
KE IGLDKIKL
LELSGWGTDY
AQDILVNQET
YTFKI REDAX
TLGVKALDDK
KLE:WDNASS
EFI PGYVDNP
QDQSTATDQV
EFLTDDTDSA
ROPLTVMRIF
VLAPIYNRSI
WSNDDPVTAN
TLEITLERPT
DDWTYEKNDT
AFLS IPQFVT
IPPGQTIAPD
KKAAEFFQFQ
TSDSTLGGVT
SVLANQKIKD
104 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF023-3 (SEQ ID NO:87)
GGGAACOGGA
CAAAAAATTG
GATACGACCT
GAAGATGAAT
GTTTATACAT
AACCACTTTG
GCTTTATTTT
AATACTCTTG
ACCCCTTATT
ATCAAAGAAC
TTTAAATTGA
ACCTATTGGG
CCAACGACAG
GGTGAATTTA
ACATACTTTT
AACAATATTC
TTGCAAGATC
CATGGAACAG
GCGAAAGCAA
TTAGATT'
CAATTGGAAG
CGTGTTGATC
TATCGTGATC
ACGTTCAACA
GATCAAGAGG
ACGGTTTTAG
GATCTGTATTI
AACTCGGCTG
CAATTAGTTC
CG.ACTTTAGT
TAGAGTTGGG
TAACATAG
TTTATGCAT-G
TTGATGTCAT
GGGTTAAGGC
TGAAATCATT
A.AGGGGATAA
AAGAATGGGA
ATCCTGAAAA
CGGTGAATT
TTCCTGGTTA
TAAAAATGAA
GTAAAGCGTT
AATCAACGGC
ATTTCACAAA
AAGAATTCTG
TAACAGATGA
AAAATCTAGA
GTGATCAAAC
CATTAACAGT
GTGATACGTA
CTCGTTTAAA
CACCAATCTA
GGCATTCATT
ATAGTAACAA
TCAAGCGGCT
CATGAATCAA
GGTCGCTGCC
AGAAGATGCC
GCAACAAGTT
TAAAAATGCT
GCTAGATGAT
ACTTTCGTT
ATATGCTACT
TAATGCCTCT
AGTTAAATTA
GTTTGACTCG
TGTTGATAAT
CAGCGTTCGT
GGCACAAGCT
TACAGATCAA
ACTAGCTGCT
GGAAAAAGG
TACAGACAGC
TGGATTAGAA
GAGAGACTAT
TATGCGCATC
TGATCAATTA
TGACTTITGCT
CAATCGAAGC
TGGACCCACG
AGCAGCGGAA
ATTTCGACAA
GTTTATGAAG
GAAGAACCAG
AAATGGTCGA
GC ITCCCCTA
AAGGA-AATTG
AAAACGTTAG
CCTGTTTTGT
6GLTGCAGAAC
TCTGATGACT
ACAGAAGCGA
AATGA.ATTGG
CCAGCC=rC
GATGGAAAAG
TTI'GATAAAG
GTAATTCCC
AAGAAAAATA
AAAAAAGAAA
GCCAAAAAAG
GTGAATGTTA
GA'ITTAGAAT
TTTACTTCGG
ATTCAAGAAA
CAAGCACAAG
ATTI'CTGTAT
TACAGTTTAA
TGGAACCACA
GACTCTATGT
CGATTTCTGA
ATCATGATCC
AATCAGGATC
CTTTAGAACG
AAATAACTTT
TITCCACAAAA
AT'ITATTTA
GGACCTACGA
AAG'ITTCAGT
ATCGTAGTGAA
TTTCAATTCC
AAAATCCGGC
AAAGTT'TGT
CGGGACAAAC
ACTACTTAAC
TTGGGCTGGA
C--GCTGAGTT
CTCAAGT1'CC
TATCTGGTTG
ATAGTACCT
CTAGAACAAC
ATATGGT
TAGCTAATCA
AATGGGCTTA
CACAGCGGGG
TTTAGGTAAA
AGATGAAACC
AGTAACACCA
GATTCATCAA
CGCAGATG-TG
AGAACGGCCC
TGAAAAATAT
TAATGGTCCT
AAAAAATGAT
AATTAAGAC
TAAGCTAAGT
TCA.ATTCGTC
TTTAGCGAAC
AAAACA.AGTC
GATTGCGCCA
CTACGATACA
TAAA.ATCAAA
TTTCCAATTT
TTTTACTATT
GGGAACCGAT
GGGCGGCGTA
ACATG-CGGCT
GAATCAGGAA
AAAAATCAAG
T13TTAAC 0* e.
0* EF023-4 (SEQ ID NO:88) GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD
TTSTLVMNQV
DFVYAWQQVA
PYLKSLLSFP
YWDAEKVKLT
YFLKNNSVRD
GTDFTKLAAK
LEENLDGLEV
FKSDTYDQLI
YEGLIYVLGKE
SPKSGSIHQA
VLFPQNEKYI
EAKVSVIKSP
GKENPALANN
KNNYLTYDTA
NVTQVPFTIR
QETRTTHAAD
DELELGVAAE
LFFDVIKNAK
KEQGDKYATO
TTAVNLFDSN
NIRKALAQAF
KAKEFWEKGK
VDRDQTRDYJ
QEARLNDFAQ
EPAISEDETV
EIALEGADVN
A.EHLIYNGPF
ELDVVNKLSG
DKESFVKEVL
KEIGLDKIKL
LELSGWGTDY
AQDILVNQET
YTFKIREDAI(
TLGVKALDDK
KLKEWDNASS
EFIPGYVDNP
QDQSTATDQV
EFLTDDTDSA
RDPLTVMRIF
VLAPIYNRSI
WSNDDPVTAN
TLEITLERPT
EDDWTYEKNDT
AFLS IPQFVT
IPPGQTIAPD
KKAAEFFQFQ
TSDSTLGGVT
SVLANQKIKD
LYWHSFGPTY SLKWAYVN EF024-1 (SEQ ID NO:89)
TAATGGCCGT
AACAAGGATC
GTCGCCTTCT
GGTGGTAAAT
T'CAGCTA
AAAAAAATAA
AANGNTANCA
ACAGAATAA
TTCGTCTACT
ATAAAAAAGG
TA ITGTI'GTC
GGAAAGTGGA
ATGACTCAGA
CCTI'TGACNT
AGATTACAGG
AATAAAGAGG
AGAAGTGAGC
AGGTTGTGGA
AGAAACACGT
GGATAGTGTT
TACTAGCAGN
GGAAATTGC
ATGAAGCTAC
ATGAAAAAAG
ACAGATATGA
GCAACTTACA
AGTGGGACAT
AACTCTTTCA
GAAAAACAAA
TCAAATGGCG
TACTACCI'I'
AAAAGATATT
CTTnI'TTTGA
ACACTTATGA
TTATGGAAAA
GAACACTTAT
TTGGCAATGG
TATTGCC'ITA
GACTGCCGAT
TGACGGTAAA
TGAAAAAAAT
AGTNGANTNC
AAAACAAAAA
105 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
E-F024-2 (SEQ ID M KKVLPFIALV GLLLLSGCGT DMKILTADG GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFIDXTSXN SFIMEKVXXX XXKITGEIGE KQRTLIKQKT E EF024-3 (SEQ ID NO:91) ATT GACTGCCGAT GGTGGTAAAT GGAAAGTGGA AGAAACACGT TTTrCAGCTA ATGACTCAGA GGATAGTCTT AAAAAAATAA CCI=TGACNT TACTAGCAGN AANGNTANCA AGAPI'ACAGG GGAAATTGCC
ACAGAA
GCA.ACTTACA
AGTGGGACAT
AACTCTTTCA
GAAAAACAAA
CTTI'ITTGA
ACACtTATrGA
TTATGGAAAA
GAACACTTAT
TGACGGTAAA
TGAAAAAAAT
AGTNGANTNC
AAAACAAAAA
EF024-4 (SEQ ID NO:92)
LTADG
GKWKVEETRA TYTFFDDGKF SANDS EDSVS XXKITGEIdE KQRTLIKQKT E GTYTYDEKNK KITFDXTSXN SFIMEKVXXX EF025-1 (SEQ ID NO:93)
TCAATGAAAC
GCGCCAGATC
AAGCAAGITA
AATGACGAAC
GTTACAGGGG
GAAAAAAAGA
TA'ITATAAAA
GAACCTGAAA
TATGATACAC
AGCAACACGC
AAACAAATC
ACTTCTGCGC
CCATCGAAAG
CTTCAAGTGA
CATGAACAGG
ATTCACCAGG
ATATTAAAGG
AAGAGCCAAC
CCGTCACCAA
TGATTGCTAA
CCACACAAAC
AAAAAATCTT
ATGAAGGTGT
CGCAAAGGAT
CCTA
4 FrCCG
GTACAGACGA
GTGACGAAAA
GTGAAGGATT
AAACGTATAT
TAACAGAAGA
AAAAAATCAC
ATACAATAAA
AATGTTGGTT
GAAACAAACA
TCAAACGACT
TCAATTGACT
GACATTTGGA
TTGGTCCAAT
ATTTACTGC
TCTGAATGTT
TGTCAACAAA
TACGTI'AGTC
TCGTGTTACA
AATGCCTTTA
CGGTTACGCA
GCAGAAAATA
AGAAACAGCC
CAAACCAACA
TTTATCGGAT
ACAAGTGGTC
TCTGCGGTG
TTTGATTCTC
ACAACCCCAC
CAACCGCCTT
GGAAATTACG
GAGTTTACAG
CCC CTGTCGG ACCGTT GTTA
GGTAAT'TT
GCAGCAGAGA
GAAGATTTAG
AAACATGTCA
TGYCGCCTT
ATTCTTTTAT
TATTTATTTI
CGCAAGAGAC
AAAAACA.AGC
ATGAATACAC
CAGCAAAATA
TGGGATTAAT
GCATTGTAGA
AGTTTGCTAG
ATTATCCTGA
ATCCrTATTAC ATACG4GTACG
TGGACACT=
GCAATGGCCT
GCTATGATGA
TTATCGTCAA
TCATTI'TGTA
GTTGTTGT
AAAGCAAGTG
CCCGACTAAA
GTACCAAGTG
TACACCGGAA
GACGGGTAAC
GATTATTACG
TGATCCTTAT
ATTTCAAGCA
TTATGTAGAA
CGG'ITCATCA
GCTAAAAGAG
AATCGCTCGA
ATACTTTTCA
TCGAAATATT
G
EF025-2 (SEQ ID NO:94) MKH IKGMLVF
DELIANQLTF
YKNECVFTGG
NTRTDDTLVT
SKETYICYkE
HQDTINKPTI
IGLJFILVGCA
DSHEYTYEVV
NYGIVEI ITE VVNG ITYVTEK
DLGNGLIARL
LLFIL
PDQEPTKQTT
TGATQTTFGT
PETQRILNVE
QMRDENRVTG
QVITEEQKIK
SGPQETKQVI(
TPPAKYTPEE
FI'EFASD)PYY
NFYTVRGSST
HVSYDEYFSD
QVIVTNQTTS
KKKKMFWSNQ
DTRYSGVNK(R
SAREGLMPLA
EQEK ITE:TAC
AVEKQAPTKN
PPLGLMTGNY
LSDYPEFQAS
A.EMDTWLKEP
GLFIVNRNI I EF025-3 (SEQ ID 106 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG
AAGCAAGTTA
AATGACGAAC
G ETACAGGGG
GAAAAAAAGA
TATTATAAAA
GAACCTGAAA
TATGATACAC
AGCAACACGC
AAACAAATGC
ACTTCTGCGC
CCATCGAAAG
CTTCAAGTGA
GATGAACAGG
ATTCACCAGG
CCGTCACCAA
TGATTGCTAA
CCACACAALAC
AAAAAATGTT
ATGAACCTCT
CGCAAAGGAT
GCTAT1'CGGG
GTACAGACGA
GTGACGAAAA
GTGAAGGATT
AAACGTATAT
TAACAGAACA
AAAAAATCAC
ATACAATAAA
TCAAACGACT
TCAMA1'GACT
GACATTTGGA
F1'GGTCCAAT
ATTACTGGC
TCTCAATGTT
TGTCAACAAA
TACGT1'AGTC
TCGTGTTACA
AATGCCTTTA
CGG'ITACGCA
CCAGAAAATA
AGAAACAGCC
CAAACCAACA
TCTGCGGTGG
TTTGATTCTC
ACAACCCCAC
CAACCGCCTT
GGAAATTACG
GAGTTTACAG
CGCCTGTCGG
ACCGTTGTTA
GCTAATTTT
GC6AGCAGAGA
GAAGATTTAG
AAACATGTCA
TGCGCCTT
ATTCTTTTAT
AAAAACALAC
ATGAATACAC
CAGCAAAATA
TGGGATTA.AT
GCATTTAGA
AGTTTCGCTAC
ATTATCCTGA
ATGGTATTAC
ATACCGTACG
TGGACACTI'G
GCAATGGCCT
GCTATdATGA
ITATCGTCAA
TCAT=TG
GCCCACTAAA
GTACGAAGTG
TACACCGGAA
GACGCGTAAC
GATTATTACG
TGATCCTTAT
AITTCAAGCA
TTATGTAGAA
CGGTTCATCA
GCTAAAAGAG
AATCGCTCGA
ATACTTTTCA
TCGAAATATT
EF025-4 (SEQ ID NO:96) TKQTIT SGPQETKQVK QVTVTNQ'FrS AXTEKQAPTI(N DEL IANQLTF DSHEYTYEVV TGATQTI'FGT TPPAKYTPEE YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY NTRTDDTLVT VVNGITYVEK QMRDENRVTG NFYTVRGSST SKETYIGYAE DLGNCLIARL QVITEEQKIK HVSYDEYFSD HQDTINKPTI LLFIL KKKKMFWS NQ
DTRYSGVNKR
SAREGLMPLA
EQEK ITETAC
PPLGLMTGNY
LSDYPEFQAS
AEMDTWLKEP
GLFIVNRNI I EF026-1 (SEQ ID NO:97) TGAGTGTATrG
TTTTTTGGAG
GCTGCTCTTG
AGCAAAGAAA
CCTGAAGAGC
AATGTCACTI'
TCCAACCATG
GCTGAATATG
CACGGGACTG
AAAGTTAATG
AAAGATGGTG
ATTACTCATT
GGATITTCAT
TGrI'GCTTAG
CAGCTAATTC
TCGAAATGC
TTGATAAAGA
TCTATCGTAA
AAATCACGGT
TCAAAGGTGA
GCAATATTAC
CCACTG'?rAC
TCCCTTTGAA
GAAAATGTCT
TGCTTGTTCA
AAGTACAGAA
GTTAAGTGAT
GGTAACAGTT
ATTACCACTTl
TCCTAAGCTA
TATTGAGGTG
TTTTGATAAA
TGGTGAAGTC
TCAGTrATGA
AALAGTACTCA
TCTGATAAAA
GTAGTCTCTG
AAAGGAAATI'
GCTGGTACTr
TATTCCCAAG
ATCGTTTC~l'
AAAGCAAATG
CAAGAATACA
AC CGTAGC CA
TAAAGGAAGA
CCACTGTTTT
AAACAGATAG
GTGCTTCAAT
GGATTCTC-GC
TCCATGATAA
ATGATAATAA
CTGA.AAATTT
GCTTTACTTT
AAGATrCTGC
ATAA'FIAA
AATAAATA-AA
GACGGCAACT
TAGTTCTAGT
TAGTGCCAAG
AGC'IACTGAC
GGGGAAAGAT
AAAAGTAACT
CAACATCGTT
AAATGGTACC
TGACI'AGAA
*f EF026-2 (SEQ ID NO:98) MXMSK VLTIVLTATA ALVLLSACSS DKKTDSSSSS KETANSSTEV VSGASISAKP EELEMALSDK GNWIVATON VTFDYEVTVA GTFHDKGKDS NDVYRXLALY SQDDNKKVTA EYEITVPKLI VSSENPNIVH GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN EF026-3 (SEQ ID NO:99) AACAGATAG TAGTTCTAGT AGCAA.AGAA.A CAGCTAA'TC AAGTACAGAA GTAGTCTCTG GTGCTrCAAT TAGTGCCAAG CCTGAAGAGC TCGAAATCCC GTTAAGTrAT AAAGGAAAI' GGATTGTCGC AGCTACTGAC AATGTCACTT TTGATAAAGA GGTAACAGTT GCTGGTACT1' TCCATGATAA GGGGAAAGAT 107 TABLE 1. Nucleofide and Anfino Acid Seqeuences of Efaecalis Genes.
TCCAACCATG3
GCTGAATATG
CACGGGACTG
A.AAGTTAATG
AAAGATGGTG
TCTATCGTAA
AAATCACGGT
TCAAAGGTGA
GCAATATTAC
CCACTGTTAC
ATTAGCACT
TCCTAAGCTA
TA'FIGAGGTG
TITTGATAAA
TGGTGALACTC
TATTCCCAAG
ATCCTTTC IT
AAAGCAAATIG
CAAGAATACA
ACCCTAGCCA
ATGAT.AATAA
CTGAAAATTT
GCTTTACTTI
AAGATTCTGC
ATAAT
AAAAGTAACT
CAACATCGTT
AAATGGTACC
TGACTTAGAA
EF026-4 (SEQ ID NO:100)
TDSSSSS
KETANSSTEV
NOVYRKLALY
VNGNITFDKQ
VSGASISAKP EELEMArJSDK SQDDNI<KVTA EYEITVPKLI EYKDSADLEK OGATVTG-EVT GNWIVAATDN VTFDKEVTVA GTFHDKGKDS VSSENFNIVH GTKGDIEVX ANGFTLNGTI( VANN EF027-1 (SEQ ID NO:101)
TTTGGTATGA
GCATGTGGAA
GTCGCATCTG
TCCGATATGA
GAGCTAGCTA
AAGTTACGAG
GCGTTTAGAA
TTTAAAAAT±G
GCAATCGATG
GTCTTGGTTG
GCCTATGGGA
GATGGCAAT
GTAAAATTGA
TTTGATAATG
GAGTCAAAAC
CGCCGTGAAA
GAAACTTTTG
GCTAAT
TGCCATATAA
AACAGAAAAA
GTGGCGGTI'C
GTGGTGAACT
TTGGTCAAGT
TGGCGAAAC
AAGCAAAATG
ACGTGGTCGA
GGCGTGCGGT
ACCAGACACT
GGACACCTT
C'FrCTCCAGA
CCGAAACTTG
ATGAAATTGA
GCGACTTAGA
AAGCGCATr
TTACCGGCAA
CAAAAGAAAT
CAAAAATCCA
TATTAAAGAA
GTGGTTAATC
GAAAACGACC
CTCGACATTA
AGTTGAAGGC
AGAGCCACAA
GACAAACGGG
TCCAGCATAC
GCGGGAAGGA
AGAACTAACA
TATGCCTAAA
TAATTTI
GAAA'ITGAAG
TGTrCAAGTA
TTACACTGT
TGTACCTA.AA
CGAACATGTI'
TTTAGGAGAT
GATACAGGTG
GCCCAAGCTA
GGACTTIGTTG
TCAAACGA.AC
GACAGCGCTC
TTGTATCGAC
GTTACTGAAG
GATCCAG=A
GGTTCAAGTA
CAAGCCACGA
TTGGAAAATC
AATGAAGCCT
GGCAATGGGC
AAGAATGATC
GTGAAAGAAA
TACAGATA
GCCATGGTG
CGAAAAGC 1T
GGCTCGACAG
AAGATCCG
ACTGGAACAA
CACTGGGCTT
CAGCTACACA
ArATACAGA AAGATAkAAA
ACGGGTTAGT
AAGCAGGGGA
GCAGTAATCA
TGGA-AGCATT
CAATTCCTTA
TTGCCAAAGA
CGTIGTLAAT
ATTATTGGGA
TTGrGCACAGG
CTI'ATCCACT
G ITTAAAG
TTIACAAGC
C'FTrAAATGG
CAAAGAAAAT
TT
GGTrTTAGCA
GAA.AATTAAC
TGTCTATAGT
CGGAGATCCT
CTATACATTC
TTTTGTAGTT
AATGGATATT
TGGTGTCAAA
TTTAGCCCAA
AAAAGGTACT
TTCAGC FrGG
TAAAGAACAC
AGCCAATCTT
TCAGTATAAA
CCCCAATCAT
GATTGACAAA
NTTTGTACCA
GGTGATTTAT
EF027-2 (SEQ ID NO:102) MKQKKWLI GLVALGLVLA ACGSGGSKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS SDMIGQVVEG LYRQDKNGDP E LAMAKAEPQ VSEDGLVYTF KLRE.AK;4TNG DPVKAGDFVV AFRNVVDPAY GSSSSNQMDI FKNGRAVREC QATMEEFGVK AIDDQTLELT LENPI PYLAQ VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSETWXrLK KNDHYWDKEH VKLNEIDVQV VKEIGTCANL FDNGDLDYIV LADTYALQYK ESKQA}{FVPK AMVGYLSPNH RREITGNEHV RKAFLQAIDK ETFAKEILGD GSTALNGFVP ANFAKIQIQV KISAXKM1VIY CHIILKKPKL TGTI EF027-3 (SEQ ID NO:103) AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC GTCGCATCTG GTGGTGAACT CTCGACA ETA GACAGCCCTC TCCGATATGA TTGGTCAAGT AGTrGAAGGC TTGTATCGAC GAGCTAGCTA TGCCGAAAGC AGAGCCACAA GTTAGTGAAG AAGTTACGAG AkAGCAAAATG GACAAACGGG GATCCAGTTA GCGTTTAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA
AITATACAGA
AAGATAAAAA
ACGGGTTAGT
AAGCAGGGGA
GCAGTAATCA
TGTCTATACT
CGGAGATCCT
CTATACAITC
'TTTGTAGTT
AATGGATAT
108 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
?FTAAAAATG
GCAATCGATG
GTCTTGGTTG
GCCTATGGGA
GATGGCAATT
GTAAAATTGA
TTGATA.ATG
GAGTCAAAAC
CGCCGTGAA.A
GAAAC=TTG
GCTAATTr'rG
TGCCATATAA
GGCGTGCGGT
ACCAGACACT
GGACACCTI'
CTTCTGCAGA
CCGAAACTTG
ATGAAATTGA
GCGACTTAGA
AAGCGCATTT
TTACCGGCAA
CAAAAGAAAT
CAAAAATCCA
TATTAAAGAA
GCGGGAAGGA
AGAACTAACA
TATGCCTAA
TAATTTTGTT
GAAATTGAAG
TGTTCAAGTA
TTACACTGTTI
TGTACCTAAA
CGAACATGTT
TTTAGGAGAT
GATACAGGTG
CCCCAAGCTA
CAAGCCACGA
TTGGAAAATC
AATGAAGCCT
GGCAATG-GGC
AAGAATGATC
GTGAAAGAA.A
TTAGCAGATA
GCCATGGTGG
CGAAAAGCTT
GGCTCGACAG
AAGA ETTCCG
TGGAAGAAT
CAATTCCTTA
TTGCCAAAGA
CGTTTGTAAT
ATTATTGGGA
TTGCCACAGG
CTTATGCACT
GTTATI'AAG
~T=ACAAGC
CTTTAAATGG
CAAAGAAAAT
TGGTG-TCAAA
TTTIAGCCCAA
AAAAGGTACT
TTCAGGTTGG
TAAAGAACAC
AGCCAATCTT
TCAGTATAAA
CCCCAATCAT
GATTGACAAA
NTTTGTACCA
GGTGA ITIAT EF027-4 (SEQ ID NO:104) TT SNEPATQKIN VASGGELSTL DSAHYTDVYS SDMIGQVVEG LYRQDKNGDP EIJAMAKAEPQ VSEDGLVYTF AFRNVVDPAY GSSSSNQMDI FKNGRAVREG QATNEEFGVK VLVGTPFMPK NEAFAXEKGT AYGTSADNFV GNGPFVISGW VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK RREITGNEHV RKAFLQAIDK ETFAXE-ILGD GSTALNGFVP
CHIILKKPKL
KLREAKWTNG
Al DDQTLELT
EDGNSETWKLY
ESKQAHFVPI(
ANFA.KIQIQV
DPVKAGDFVV
LENP IPYLAQ
KNDHYWEDKEH
AM'VGYLSPNH
KISAKKMVIY
a.
C.
EF028-1 (SEQ ID NO:1OS) S S *4
TAACAGAAGC
AAGACTTG'rr
AGAGCTIGC
TCTGAACAGA
GAAAAAGCAT
ACAACGGGCT
TTrGATACCT ACCGA ITCAG
ATTGCACTCG"
GGGAAATCAA
GGCGCACATA
CAAATCGACG
CGGAAAGATC
AAAAAGTCGT
CTACCTAAAA
GCTCTTCAAC
ATTGATTGGG
GAAGCGGCGT
CAACTGCAGA
AATACAACAA
ATAGTCAATG
TAGGGGTTAC
AAAGCGGCGA
CAGTAAAAAA
ATCGCTATT
ATTTGGTCGG
CTTCCGCAGC
ATAATGACAA
CGGGTCTTGT
ATGTICACG
GACA.ACACAA
GTGATTTAGT
TAAATGAGAA
TGATTGACCG
GGTTAGATAA
CCGGGCATAG
ITGAAA.AGGC
TCATTCAACA
CTTAACACTT
TATGGGTAGA
CTTATTAACA
AAAACAAACA
TGTTATTTT
CAAAGCCAAT
ACAGCAAGCC
GACACATG
GTCCAAAACA
AGCAACATCT
CAAAAATATG
AGTCGATGTG
CAAAGAATTT
CCAAGACGAC
AACGGAAGAA
AAATGAAAAA
CAATGATATT
CATCGATTTT
GGGGGCTTGT
TGTTTACTTG
TATGAAr.GAG
TTCACAACAT
GAGGTTGCTG
ATGATTGGAG
CACTCAGACA
ACTTATCCAG
GCTGCCGGAG
TTATTTATCA
GAAACAAGGA
TAGCGGGTTG
A.AGCGAAGGC
ATGGCATGGG
AGCGTrGTTCC
AAGATGAAGA
TGAAAACCTA
GAAACAGTGC TCGAACGTGC GAAAAAAGTG GAAATAACAC ATGCAACCCC TGCTGCATAT
GAAATCAACT
AATGAAGAAA
TACAAATTTA
AACTGAATCT
GAATCCGTAT
CCAA.ACAGCT
AGAGAATGTC
TAATAATGCT
GCAGAAATCG
TTAC'ITGGCG
TCCCAAGCGG
AAAATTTAG
GTCCCTTCAT
GG ITTC'TTT
GTTGGCGCGA
GCCAAAAAAG
CTIAG
CCGATGACTA TTrTGATGAT GCGGCTCCGA ATTATITGCCC GTTATGGTCA TGTCACAGAC GCTTGTTTGC ACCAGGCGGG TAGCTOATAT GACAGAACG TALATGGTTGA AGGTAGTCAA TGAGCGAAAT GCAAGACTTC ATGGTGAACA TTG.GTGGTTA EF028-2 (SEQ ID NO:106) MKKR ALLGVTLLTF TTLAGCTNLS EQKSGEKQTE VAEAKATESE KASVKNVIFM DTYLVGQQAT YPEDEEENVr DSASAATAMA KSTGLVATSE ITH-ATPAAYG AHNVSRKNMA KDRDLVKEFS QAGYGHVTDK KSLNENQDDK LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV IGDGMGNPYT TGYRYFKANH AGVKTYNNAI ALDNDKSXTE EIADDYFDD)Q IDGQMXVDVL ILGLFAPGGL PKMIDRTEEV GAMSEMQDFE AAFEKAIDFA
SDKRVPQTAF
TVLERAKXVG
LGGGSEILFAR
PSLADMTEAA
KXDGEHWLQ
109 TABLE 1. Nucleotide and Amino Acid Seqeuences: of E. faecalis Genes.
LQI IQQGACL EF028-3 (SEQ ID NO:107) ACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT
GAAAAACCAT
ACA.ACGGGCT
TTTGATACCT
ACCGATTCAG
ATTGCACTCG
GGGAAATCAA
GGCGCACATA
CAA.ATCGACG
CGGAAAGATC
AAAAAGTCGT
CTACCTAAAA
GCTCTTCAAC
ATTGATTGGG
GAAGCGGCGT
CAACTGCAGA
CAGTAAAAA
ATCGCTA=F
ATTTGGTCGG
CTITCCGCAGC
ATAATCACAA
CGGGTCTITGT
ATG TCACG
GACAACACAA
GTGATTTAGT
TAA.ATGAGAA
TGAT~rGACCG
GGTTAGATAA
CCGGGCATAG
ITGAAAAGC
TCATTCAACA
TGTTATT=r
CAAAGCCAAT
ACAGCAAGCC
GACAGCGATG
GTCCAAAACA
AGCAACATCT
CAAAAATATG
AGTCGATGTO
CAAAGAAT'T
CCAAGACGAC
AACOGAAGAA
AAATGAAAAA
CAATGATATT
CATCGATTT
GGGGGCTTGT
ATGATTGGAG
CACTCAGACA
ACTTATCCAG
GCTCCCGGAG
GAAACAGTGC
GAAATAACAC
GCAGAAATCG
TTACTTGGCG
TiCCCAAGCGG
AAAATTTTAG
GTCCCTTCAT
GGTTTCTTTT
G ITGGCGCGA
GCCAAAAAAG
CTr
ATGGCATGG
AGCGTGTTCC
AAGATGAAGA
TGAAAACCTA
TCGAACGTC
ATGCA-ACCCC
CCGATGACTA
GCGGCTCCGA
GTTATGGTCA
GCTTGTTrGC
TAGCTGATAT
TAATGG~rGA
TGAGCGAAAT
ATGGTGAACA
GAATCCGTAT
CCAAACAGCT
AGAGAATGTC
TAATAATGCT
GAAAAAAGTG
TGCTGCATAT
TTTTGATGAT
ATI'ATT1TGCC
TGTCACAGAC
ACCAGGCGGG
GACAGAAGCG
AGGTAGTCAA
GCAAGACTTC
TTG-GTGGTTA
EF028-4 (SEQ ID NO:108) QKSGEXQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF DTYLVGQQAT YPEDEEENVT DSASAATANA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR KDRDLVKEFS QAGYGHVTDK KSLNENQDDI( ILGLFAPGGL PKMI DRTEEV PSLADMTEAA LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KXDGEHWWLQ LQI IQQGACL EF029-1 (SEQ ID NO1:109)
C
TGAAGGAGGG
GCCACTII'TT
GAAGTTACTG
GTI'AGGCACG
CGG'rrTTTAT
GGATTGGAAG
GAATATGATA
GATGGAACAT
AAAGAGCCTA
GAGTCACTCG
AGAAAATGAA
TATTATCAGG
AAAATGTTAC
TGGATGGCGA
TAATTGACAC
CTAGCAAACG
AGGGCGATAA
TACTACAAAA
CAACTAAGTA
GA.ATCTGGAG
AAAGTTAATC
ATGCGCAAGT
TCAAACTAT
TACCACACTA
ACCCGAGACT
CACAAAAGAG
AACAGATCGT
AACGCTTGTA
TTTGGCAGAA
CATACCAGGT
GCTAAAAACT
CTTGAACAAA
TCAAACGATC
TTAAAAATTG-
GTGAAACCGA
CTTTTGTCTA
TACGGACGAG
AGTGAAGGAT
CTAGAGCAAG
TATCTGACAC
GGCTGCTGCT
AAGCACAGGA
AACGTATACC
ACCGAAAAGA
AAACAAA.AGT
CTGCTTCAGA
C= IGrGCTA
TAGCTCGTGT
CCCAAGAACA
AMCGGGGO'TT
TACAGCAGTA
TAGTGTAAAA
AGCTGATTTT
ACAAAAAGTT
TCAGCCGTTC
AA'FrACGTTT
CATATTCGTA
TGCCTATGTA
GGCTAAAALAT
TAGTAAATAA
EF029-2 (SEQ ID NO:110) MKKLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV RHVDGDTTVL KIDGKEQKVR FLLIDTPETIV KPKTKVQPFG LEASKRTKEL LSTASEITFE YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAKNE SLO IWSIPGY VTQRGFSK EF029-3 (SEQ ID NO:111) AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT GTTAGGCACG TGGATGGCGA TACCACACTA TAAAAATTG ACGGAAAAGA ACAAAAAGTT 110 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CGGTT'FIAT
GGATTGGAAG
CAATATCATA
GATGGAACAT
AAAGAGCCTA
GAGTCACTCG
TAATTGACAC
CTAGCAAACG
AGCCCGATAA
TACTACAAA-A
CAACTAA GTA
GAATCTGGAG
ACCCGAGACT GTGAAACCCA AAACAAA-AGT
CACAAAAGAG
AACAGATCGT
AACGCTTGTA
TTTGGCAGAA
CATACCAGCT
CTTTTGTCTA
TACGGACGAG
AGTGAACCAT
CTAGAGCAAG
TATGTGACAC
CTGCTTCAGA
CGTTGGGCTA
TAGCTCGTGT
CCCAAGA-ACA
AACGGGGGITT
TCAGCCGTTC
AATTACG=~
CATATTCGTA
TGCCTATGTA
GGCTAAAAAT
TAGTAAA
EFC29-4.(SEQ ID NO:112) NVTQTIS NDQRIPADFV RHVDGDTTVL KIDGKEQKVR FLLjIDTPETV K-PKTKVQPFG LEASKRTKEL, LSTASEITFE YDKGDKTDRY GRALGYIFVD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL EQAQEQAXNE SLGIWSIPGY VTQRGFSI( EF030-1 (SEQ ID NO:113)
TGATTGACAC
GTTACTACGG
ACTGAGAAAA
TCAATGGATT
GAAGGGTTAT
CCAGAAArT
TCCA.ATGGTG
CCCAAAAACC
ATTACAA.AAG
TTGGAGGI'A
TTTTTCCCAC
GAAATGATTG
TGGGATITG
CATTTTGAAG
GATGTGGCTG
GCAATCGAAC
TCCtATTITT
TTAGTCGATA
TTTGTATATA
ACAGACCCCA
GTAGCCATTG
ATAGGGGGAA
TCTTGGTAGG
CCAGTGTCTC
CTAT ITTTAC
ATCAATTGGA
CTGAGGATGG
ATGCTGTAAC
AAGCCAATTA
AGGAAAAAGC
C ITTAGAAAA
AAAACGAAGC
TATCTAATGG
TGCGTAATCC
TTCTTAAAGA
TCTTAACAGG
GGTCAAAAGT
CAAATGAA
ATATTTTAC
ACCCAGAAAC
AAAAAGCCAA
AACTTCTTTC
TAGTATGAAA
TTGTGGCACA
GAAAA.AAGTT
ACAAGATGAA
TGAAA.AAGAT
CAAACGATAT
AGCCAATCAT
CTTTTTCTTG
ACCAGAGGAA
GCCTGTACCA
ATTCGTGAAA
TCCGTTTTA
CTACTATTAC
AACCAATACC
AGATTTGCT
CTATTCCTTA
TGTCCGCAAA
AGATGGCTCA
GAATGAAGAT
AGAGTATTTA
AAGAGATGGT
AAGTTAAAA.A
ACAGCAGANA
TTAAATTTAA
GCCAGTATTA
CAACTAATAC
ACCATTAAAC
TI'CGT'TTTG
TTAGAAGGAA
TTGGGTGTCA
TATTTTACGT
GAAAAAGGAC
ATGAAAAATT
GATAAAGAAA
G ITATAAT
AAACAAAATC
CGT'IAAACC
GCT'IAGCTT
AAAGAAATTT
TITCGTCAAG
GATAAACCAA
GATAGTGACC
TGATGGGGAT TATGTTATTT CAAAA.ATACA CGAGAAAGCA TGGAGAACTC GGAAATCGGT ACGCACAGT'C CAATGTCTTT CTGCTGCTGC TAAAGAGATG TAAGAGAAGA TGGCAAGTGG C'FrGCGTAA ATTAGCGAAT CGATI'CTGAA CGGAACAGCT AAGCGCTTGA TGATTATACT CGFI'ATTGGC ATTTTCTCCA AAGCCTATGG CACTTCTAGT GGGATCAGTC AGCGATGTCG AAGTAAAATC AGAAACGATT TGTACGAATIC AGGTGAATTA GAGACAACCC AGACTATGAA
AAAAAAGAAA
ATCCTTTGGA
ATGGGTACAT
AACCAGGCGC
AAGCAGAGCT
GA
CGAAAA.ACCA
TAAAAAAAGT
TCCAGAAAAA
TCTTGTCAAA
AAACGGAGAT
C EFC30-2 (SEQ ID NO:114) MKX LKMMGIMLFV STVLVGCG ET AXTXIDEKAT EKTSVSKKVL NIMENSEIGS MDSIFTQDEA SINAQSNVFE CLYQLDEXDQ LIPAAAKEMP EISEDGKRYT IKLREDGKWS NGDAVTANDF VFAWRKLANP KNQANYFFLL ECTILNGTAI TKEEKAPEEL GVXALDDYTL EVTLEKPVPY FTSLLAFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW DFVRNPYYYO KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTGDFAI( QNRDNPDYEA IERSKVYSLR LNQKRNEKPS IFANENVRKA LAYALDXXSL VDNILADGSI( EIYGYIPEKF VYNPETNEDF RQE.AGALVKT DAXKAKEYLD KAKAELNGDV AIELLSRDGD SDR EF030-3 (SEQ ID NO:115)
GAGAAAGCA
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT 'TAAATTTAA TGCAGAACTC GGAAATCGGT TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACACTC CAATGTCTT'r GAAGGGT1'AT ATCAATTGGA TGAAA.AAGAT CAACTAATAC CT-GCTGCT'GC TAAAGAGATG TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
CCAGAAATIT
TCCAATGGTG
CCCAAAAACC
ATTACAAAAG
TTGGACGTTA
TTTTCCCAC
GAAATCATTC
TGGGATTTTG
CATTTTGAAG
GATGTCGCTG
GCAATCGAAC
TCCATT1'TG
TTAGTCGATA
TTTGTATATA
ACAGACGCCA
GTAGCCATTG
CTGAGGATGG
ATGCTGTAAC
AAGCCAATTA
AGGAAAAAC
CTTTAGAAA
AAAACGAAGC
TATCTAATGG
TGCGTAATCC
TTCTTAAAGA
TCTTAACAGG
GGTCAAAAGT
CAAATGAGAA
ATATI'ITAGC
ACCCAGAA-AC
AAAAAGCCAA
AACTTCTTTC
CAAACCATAT
AGCCAATGAT
CTTTI'CTTG
ACCAGAGGAA
GCCTGTACCA
AITCGTGAAA
TCCGT'TTITA
CTACTATTAC
AACCAATACC
AGATTITTGCT
CTATTCC'ITA
TGTCCGCAAA
AGATGOGCTCA
GAAMAACAT
AGAGTATTTA
AAGAGATGGT
ACCATTAAAC
TTCGT1rTIG
TTAGAAGGAA
TTCGCTO;TCA
TAT'TTTACGT
GAAAAAGGAC
ATGAAAAATT
GATAAAGAA.A
GTTTATAATT
AAACAAAATC
CGTTTAA.ACC
TA.AGAGAAGA
CTTGGCGTAA
CGATTCTGAA
AAGCGCTTGA
CGTTAITGGC
AAGCCTATGG
GGGATCAGTC
AAGTAAAATC
TGTACGAATC
GAGACAACCC
A-AAAAACAAA
TGGCAAGTG
ATTAGCGAAT
CGGAACAGCT
TGATTATACT
ATTTTCTCCA
CACTTCTAGT
AGCGATGTCG
AGAAACGATT
AGGTGAATTA
AGACTATGAA
CGAAAAACCA
TAAAAAAAGT
TCCAGAAAAA
TCTTGTCAAA
AAACGGAGAT
GCTTTAGCTT ATGCTTTGA AAGAAAT'F ATGGGTACAT TTCGTCAAG AAGCAGCGC GATAAACCALA AAGCAGAGCT EF030-4 (SEQ ID NO:116) EKAT EKTSVSKKVL NLMENSEIGS MDSIFTQDE-A SINAQSNVFE GLYQLDEXDQ NGDAVTANDF VFAWRKLANP KNQANYFFLL EVTLEKPVPY FTSLLAFSPF FPQNE.AFVKE DFVRN~PYYYD KE-KVKSETIH FEVLKETNTV IERSKVYSLR LNQKRNEKPS IFANENVRKA VYNPETNE-;DF RQEAGALVKT DAKKAKEYLD LIPAAAKENP EISEDGKRYT EGTILNGTAI TKEEKAPEEL KGQAYGTSSE MIVSNGPFLM YNLYESGELD VAVLTGDFAK LAYALDKKSL VDNILADGSK KAXAELNGDV AIELLSRDG
IKLREDGKWS
GVKALDDYTJ
KNWDQSAMSW
QNRDNPDYEA
EIYGYIPEKF
EF031-1 (SEQ ID NO:117)
TGAGAAATTA
AAACGCGTAA
AGCCAAGCAA
TTCGGATTAA
GAAGCGAAAG
AATCCCAATG
AAAGATGGGT
CCGGGAGCAA
GGG.ATTGTTT
TCAGCTGATT
ATGTTrATACG
GCTTTTGAAG
GACTAGCN'A
GTTCATATTA
GTTATTTTAG
'ITTTAGGGAC
CTACGAAAAG
GTGAAGATAT
TTACC7I'AGA
CGGGAATTGA
TATFI'GAAAA
AAGAGGTTi'
ACAACAAAGA
TGAAAGGTAA
TTGCTAGTGA
CGATGAAAGA
ATATGTTCCA
TTCAAGGCGC
AAAAATAA.AA
ATTAGTCGCT
CGAGAGCAAA
TGTCAAAAAA
AGTAGGCAAT
TGTCATTGAA
AATTACAGAA
TGAAAGTGGT
AAAATTAGGC
AATTTCTGTT
ACATGCTGGT
ATTAAAACCA
ATCTGGTGAA
ACAGAAAACG
ACCAT=GG
GCAACGTTAT
GGTGGAAGTA
GACATTATCG
AGCGCAGACC
TTAGCACAAG
AAAGAACTAC
GCTGGCGTAC
AAAGAAATI'A
CCAGACG TC
CAAGATATTA
AACGTTGTTA
ATTGAACCAG
TGA
AGGAAGA=T
TAATGACTGC
ATGC'IIAGT
CTCCATTI'CA
GCTTACGAA
CAAATGCAGC
CTAAT'ITAAC
CAATCGCTGT
AAAACTGGGA
CCACCACGGC
cAAAAGATAA AAACGTAT1'C cTGTcGGMC
AAAAATGAAA
TTGTGGAA-AC
CG'TTCAACT
AAAAGAGAAT
ATTAAAAAAT
ACAAGGTGGA
TCAGTTAACG
AAACAGTATC
TGACTTATGG
AGGTCCTITA
CGGGAAGGCC
AAAATCGTCA
TCATTTTGCG
S S EFOO31-2 (SEQ ID NO:118) MYK RVILGTLVAA TLLMTACGNS E-ATTKSESKG GSRALVVSTF GLSEDIVKKD IIAPFEXENE AXVTLEVGNS ADRPTKLKNN PNAGIDVIEL AQANAAQGGK DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKMGLK EIKNWDDLWS ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELI(PN VVKTYSKSSD LANMFQSGEI EAAVVADFAV DIIQGAQKT EF031-3 (SEQ ID NO:119) 112 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCTFAGT CGTTTCAACT
TTCGGATTAA
GAAGCGAAAG
AATCCCAATG
AAACATGGGT
CCGGGAGCAA
GGGATTGTTT
TCAGCTGATT
ATG ITATACG
GCTTTTGAAG
GACTTAGCNA
GTTGATATTA
GTGAAGATAT
TTACCTTAGA
CGGGAATTGA
TATTTGAA-AA
AAGACCFTT
ACAACAAAGA
TGAA.AGGTAA
TTGCTAGTGA
CGATGAAAGA
ATATGTTCCA
TTCAACGCOC
TGTCAkAAAAA
AGTAGGCAAT
TGTCATI'GAA
AATTACAGAA
TCAAAGTGGT
AAAATTAGGC
AATTTCTGTI'
ACATGCTGGT
ATTAAAACCA
ATCTGGTGAA
ACAGAAAA
GACATTATCG
AGCGCAGACC
TTAGCACAAG
AAAGA.AGTAC
GCTGCCGTAC
AAAGAAATTA
CCAGACGTTG
CAACATATA
AACG=rTTA
ATTGAAGCAG
CTCCAT?1'GA
GCTTI'ACGAA
CAAATGCAGC
CTAA=~AAG
CAATCGCTGT
AAAACTGGGA
CCACGACGGC
CAAAAGATAA
AAACGTATTC
CTGTGGTTGC
AAPLAGAGAAT
ATTAAAAAAT
ACAAGGTGGA
TCAGTTAACG
AAACAGTATC
TGACTTATGG
AGGTCCTTA
CGGGAAGGCC
AAAATCGTCA
TGATTTTGCG
EF031-4 (SEQ ID NO:120) TTKSESXC GSNALATVSTF GLSEDIVKKD I IAPFEKENE AKVTLEVCNS ADRFTXLKNN PNAGI DVIEL DCLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNCKAA FEAMKELKPN LANMFQSGEI EAAVVADFAV DIIQGAQK
AQANAAQGGK
EIKNWDDLWS
VVKTYSKSSD
EF032-1 (SEQ ID NO:121)
TGAATAAATT
GTTTGTGGTA
AAGTCAACCA
TCAGGGGAAT
CAACTAGATG
AACCATGACT
CTCACATTCG
CAAGAATATA
TCTGATGT T
GAACTAAGTA
AATACTAATG
GATACTTAG
TCAATTATCG
ATTTAGGAGG
TTTCACTACT
GTCAATCTAG
ATTCAGTTGG
ATAAATCGAG
TATACGGAGT
AAACTCCCGA
TGAAAAATCC
CTAAAAGTAG
CTGAAGCG;A
ACATTIACTAA
CTAAGCATCA
GCCTCATCCC
AATTATGATG
TACTGCTTGT
CAGCACAGTT
AAAAGATATT
CATAGTTCT
GGGAAACAAG
CAAAGATTT
ACTATCNAGT
TAGCCAAGAT
GTCTGATGTA
GCTAGCAGAT
ATITAATGAT
AACCATGGAC
AAAAAATI'AA TTAGT=TAGG NCGGGAAATA ATGATAATAA AAACAACCGA ATTCAAAAGA GATCCTGGAG ATTACTATGC ATTACCGTCA AATCAGGCGG AAAAA.AGTAT CTCTTAAAAA GTTGTTAGAT TTTTAAATGA ACTGAAACTA GCAAACANAA AATAAACAAT CTGATGTATC GCTACTAATA CTTTACCGAG CAGCCAACCT TAGAACAACA ATGTATCCTI' ATAAAGGAAG GCAAAAAGAT GGTAA
ATTGGTTTIGT
AGATACTCAA
CTTTGTTGCG
TGTATTAACT
AGAAAATAGT
GGGAGATACT
AAA.AGATTTT
AACACTAAAC
TGAAAAAAAA
CGAAGATAAA
AACCGTCTTA
CAAAATGCAT
C- EF032-2 (SEQ ID NO:122) MK KLISLGLVCV CGISLLTACX GNNDNKDTE( STSQSSSTVK QPNSKDFVAS GEYSVGKDID PGDYYAVLTQ LDDKSSIVLJI TVKSGGENSN HDLYGVGNKY, KVSLKKGDTL TFETADKDFV VRFLNEKDFQ EYMKNPVSST ETSKXKTVNS DVSKS SSQDN KQSDVSEKKE VSTEAKSDVA TNTLPSEDKN TNIJITKLADE PTLEQQTVLD TLAKEQFNDM YPYKGSKMHS IIGVIPTMDA KRW EF032-3 (SEQ ID N'O:123) TA ATGATAATAA AGATACTGAA AAGTCAACCA GTCAATCTAG CAGCACAGTT AALACAACCGA ATTCAAAAGA CTTT'CTTGCG TCAGGGGAAT ATTCAG IrGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATTAACT 1.13 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecais Genes.
CAACTAGATG
AACCATGACT
CTCACATTCG
CAAGAATATA
TCTGATGTT
GAACTAAGTA
AATACTAATG
GATACTTTAG
TCAATTATCG
ATAAATCGAG
TATACGGAGT
A.AACTGCCGA
TGAAAAATCC
CTAAAAGTAC
CTGAAGCGAA
ACATTACTAA
CTAAGCATCA
GCGTCATCCC
CATAGTTCTT
GGGAAACAAG
CAAAGA=TT
AGTATCNAGT
TAGCCAAGAT
GTCTGATGTA
GCTAGCAGAT
ATTTAATGAT
AACCATGGAC
ATTACCGTCA
AAAAAAGTAT
GTTGTAGAT
ACTGAAACTA
AATAAACAAT
GCTACTAATA
GAGCCAACCT
ATGTATCCTF
GCAAAAAGAT
ALATCAGGCGG
CTCTTAAAAA
TLTAAATGA
CCAAACANAA
CTGATGTATC
C:TTTACCGAG
TAGAACAACA
ATAAAGGAAG
GG
AGAAAATAGT
GGGAGATACT
AAAAGATTTT
AACAGTAAAC
TCAAAAAAAA
CGAAGATAAA
AACCGTCTTA
CAAAATCCAT
EF032-4 (SEQ ID NO:124) NDNKDTEK STSQSSSTVK QPNSKDFVAS GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TFETADKDFV VRFLNEKDFQ EYfrflNPVSST 'JSTEAKSDVA TNTLPSEDKN TNDITKLADE IIGVIPTMDA KRW TVKSGGENSN H-DLYGVGNXK KVSLKKCDTL ETSKXXTVNS DVSKSSSQDN KQSDVSEKKE PTLEQQTVLD TLAKNIQFNDM YPYKGSKMHS EF033-1 (SEQ ID NO:125)
TGACTGCT'T
CAAAGGAGGT
AAAATGAAAA
GCAGGATGTG
TTACCAACCA
ATr'TCCCTAG
ATCGCAGTTG
GATATGATG
AATGACATCA
ACAGTCGTTA
ATCGCTGATA
GAAATCGACG
TTTTCTATTG
TCXITTCAGA
AATTTACTTT
GTAAACAGGA
AAGACCGTAG
TGCCATCAAC
ATACTCAAAG
CTGTCGATGC
ATTTAGCTAG
ATATCCCCAC
GCITATCTGA
AGTAG
GAGAAAAAAG
AA.ATTTTCCC
AACAATGATG
AAAGAAAGCA
CGGCAAAGAA
AACAGAAGTG
TAGTACAATG
CGAAAAATTG
CTCAGAAAGT
TAGTACAAGC
ACATGAAAAA
TGGTTTTTTT
CAAAATA-AAA
ACTTIAGG'
ACTACCTCIT
AI'ACTTTAC
ATTGAAGACT
ATGACTGATT
ATTGCCTTGA
GT'TGCAAGC
ATCAAAGCAA
GGACAAAAGT
GTATTGI=
TAGACGAATCG
TAG.TAGCAAC
CTGAAAAAAC
CCAAAGAAGC
TAGGTAAAAC
TAAAAAAATT
A-ACCACAAAT
AAGTGGAAGA
TCAAAGAAGA
TAATCAAAAC
GACGTTGAGA
CGAGGATGAA
ACTTGGCTTA
AGAAGTAACG
AACCAAAAT
CGACCAATTA
ACCACAAATG
TGTTTATGTG
TGCTGGAAT
CGTCCAATTC
AATGGATCAA
EF033-2 (SEQ ID NO:126) MKKFTLTMMT LGLVATLGLA GCGKQEKXAT TSSEKTEVTL AVDTQSSTMMh TDLKKLPQMD VVNIPTSTSI KAIKEDVQFI
PTKDRSGKEI
MMfAVDAEKLI
ADSLSEH-EKG
TLPKEATKII
ALKPQIVYVN
QKLIKTMDQE
SLVPSTTEVI EDLGKTDQLI DINLASSESV WKQVEDAGIT
IDE
EF033-3 (SEQ ID NO:127) CTCTT CTGAAAAAC AGAAGTAACG TTACCAACCA AAGACCGTAG ATTTCCCTAG TGCCATCAAC ATCGCAG ETG ATACTCAAAG GATATGATGG CTGTCGATGC A.ATGACATCA ATTTAGCTAG ACAGTCGITA ATATCCCCAC ATCCCTGATA GCTIATCTGA GAAATCGACG AGTAG
CGGCAAAGAA
AACAGAAGTG
TAGTACAATG
CGAAAAATTG
CTCAGAAAGT
TAGTACAAC
ACATGAAAAA
ATTACITTAC
A~GAAGACT
ATGACTGATD
ATTrGCCTTGA
GTTTGGAAGC
ATCAAAGCAA
GGACAA;LAGT
CCAAAGAAGC
TAGGTAAAAC
TAAAAAAATI'
AACCACAAAT
AAGTGGAAGA
TCAAAGAAGA
TAATCAAAAC
AACCAA.AATr
CGACCAATA
ACCACAAATG
TGTTI'ATGTG
TGCTGGAATT
CGTCCAATTC
AATCGATCA-A
114 TABLE 1. Nucleotide arnd Amino Acid Seqeuences of E. faecalis Genes.
EF033-4 (SEQ ID NO:128) SSEKTEVTL PTKDRSGKEI TLPKEATKII SLVPSTTEVI EDLGKTDQLI AVDTQSSM TDLKKLPQMD MAVDAEKLI ALKPQIVYVN DINLASSE-Sv WKQVEDAGIT VVNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE EF034-1 (SEQ ID NO:129)
TAGGAGGGAG
TTGGTAGGTT
CCTT~TAATAC
TTCAACA.AGG
GACACAAAAA
AkATAAAGAGA
AAAAAAGTTG
TAATCATGAA
GTAGTAATAA
TCACCACGAT
ATAAAACCAT
AAACAAGTAG
GCTATGAAAT
ATGA.AGGTAA
AAAAATCGGG
CAAAAAAGAA
TATTGAAAAA
GACGCTTGAA
AACAGAAAAA
TATAGGTCA-A
ACGTATATCT
TATfl'TAG1r1
AACGGCAATC
GAAGAAGACC
AAAGAATAT
AAGGTATATA
'ITGGACAAAA
GATGCAGA-AG
GTATTA'ITTT
'-TTTGAATGC
TAACGAAAGG
TAGTTAATCC
A-AAATATTAA
AAACGAAAAA
GTAATGTGTA
TTCATGT
CAGTTCGI-r
TTCAATTTITL
CAATAATGAA
AATACAAGAA
AATAGAGTTT
TGGTGAT=~
(ft ft GGTGGTAAAT AG EF034-2 (SEQ ID NO:130) MKKIGY FSCIIFFMFL VGCSNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK KVDEGKRISD AEGNVYGDFG GK EF034-3 (SEQ ID NO:131) AGAA AACGGCAATC TTTTGAATGC CAGTTCGTTT CCTTTAATAC TCACCACGAT TATI'GAAAAA GAAGAAGACC TTCAACA.AGG ATAAAAC CAT GACGCTI'GAA AAAGAATATT GACACAAAAA AAACAAGTAG AACAGAAAAA AAGGTATATA AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAAAAAGTITG ATGAAGGTAA ACGTATATCT GATGCAGAAG GGTGGTAAAT AG
TAACGAAAGG
TAGTTAATCC
AAAATATTAA
AAACGAAAAA
GTAATGTGTA
ITCAATFI'TT
CAATAATGAA
AATACAAGAA
AATAGAGTTT~
TGGTGATITT
ft EF034-4 (SEQ ID NO:132) KEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF NKDKTMTLEK EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKKIEFK KVDECKRISD AEGNVYGDFG GK EF035-1 (SEQ ID NO1:133)
TAAACGAGAG
TTATTCACAA
ACAATTGATC
GAAAATTCCA
AAAATCGATA
GTGAGTITAT GAAAACAAAA ATCGGAAAAA CAGTTATCIT GTCAGCAT GTTTCCTTIP ACTGAGTGGT TGTACCTCGG CTGGCGAAGA GATGGAAAAA GACAGAAAGA AAAAGTCGAT AAAACGGTCG ATA.AGCAGAA ACATAAAAAT TGGAAAGTTA CGACGAAAAA GTTIGACCGTT CTTTAGATAG TCAAGAAGAC CTACTGAGTA A EF035-2 (SEQ ID 1.1:134) MK'TKI GKTVILSAFL FTSFLLLSGC TSAGEEM~EKT IDRQKEI(VDK TVDKQKHXNE NSMESYDEXV DRSLDSQEDK IOTTE 115 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecczlis Genes.
EF035-3 (SEQ ID NO:135)
GATGGAAAAA
ACAATTGATC GACAGAAAGA AAAAGTCGAT AAA.ACGTCG ATAAGCAGAA ACATAAAAAT GAAAATTCCA TCGAAAGTTA CGACGAAAAA CTTGACCGTT CTTTACATAG TCAAGAAGAC AAAATCGATA CTACTGAG EF035-4 (SEQ ID NO:136) MEKT IDRQKEKVDK TVDKQKHKNE NSMESYDEKV DRSLDSQEDK IDTTE EF036-1 (SEQ ID NO:137)
S
S
S
5555
TAATTTTCAA
AATGAATCGA
ACAGGATGTG
TTACAACCAT
ATTAATGTCC
GACATTGGTA
ATTGATCATA
AA.AGATATCT
GAACTTGGCG
CGTGCGACTT
GATTCCAGCG
GCATTT1TCAT
GATGAAAATG
AAAAATCCAA
GAACGTGTGA
TGGCAAGGAA
GTCCTACATA
TGAAAAAAAG
CAAAATGGAT
TAGTAGAGAC
AAGGTGGCGG
ATTCTGA~T
AAGTTGCTGT
CAATGGAAAA
GGAAAGACCA
TTGAAAAGTG
GCATGGTTCG
ATGTTACTGA
TAATGAACAA
GTGATTTAAC
TTGGTCAATT
ATGTCATTAA
TAATCGTAAA
ATTGCTATTA
TGATCGTGGT
AGCGAGTGAG
AAGCGGAACA
ATTIGCAGAA
CGTTGGGATT
TTI'AAAGAAA
AAAAATTGTT
GGTCTTGGGA
TTCCATTOTT
TGAAGTAGCT
TAAATGGATT
CAAAGAGTTI'
AGGGTATA T
ATAA
ATAGAATGGA TTGAAA'ITAA TTTATTG7 TGGCAAGTAT GAATCCATCA CAGCGGTAGG GAATATCAAA GCCAAAATCC GGTCTGAGTC AACTCCAATC GAGAAAAAGG GCATCAAAGC ACACCAkATCG
ATCTTITTAG
ATTTTGAATA
GATAAAACAG
TCTGATACAC
ACGTTAAGTA
A1"rGTCTT
TI'AGACTTTA
CCTGTICGA
TTAACAAAAA
GTGAAGTAAC
GAGCGGCCGG
CCATTCGTGC
CAGGAGCGAT
TTGATGGTGT
ATGAACACAT
TGTTCTCAGA
AAATGGAAAT
rI'GGAGGAAT
ACTTACTI'TG
CTCATCAGCT
GGGAAGATTT
TGGCGCGGTA
GGAAGACTTA
TGTCGGTGTC
AAACTGGAAA
TAGTGGTACG
GCAAGAACAA
TACTTATACC
TCAGCCAACA
GTACACTCGT
TGATATCCAA
TGAACGGGAT
EF-36-2 (SEQ ID NO:138) ~'*MKKRLLLF IGLASILTLT GCAXWIDRGE SITAVGSSAL QPEJVETASEE YQSQNPGRFI NVQGGGSGTG LSQVQSGAVD DHKVAVVGIT P IVNKNVGVK DI SMENLXKI FLGEVTNWXE ATFEKWVLCD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE
QGNVIK
IGNSDLFAEE
LGGKDQKIVI
FSYVTDEVAT
RVIGQLGYIP
KKGIKAEDLI
LNRAAGSGTR
LS IDGVQPTD
VSKMEIERDW
EF036-3 (SEQ ID NO:139) GAT TGATCGTGGT GAATCCATCA CAGCGGTAGG CTCATCAGCT
TTACA-ACCAT
ATTAATGTCC
GACAITGGTA
ATTGATCATA
AAAGATATCT
GAACTTGGCG
CGTGCGACTT
GATTCCAGCG
TACTAGAGAC
AAGGTGGCGG
ATITCTGATTT
AAGTTGCTGT
CAATGGAAAA
GGAAAGACCA
TTGAAAAGTG
GCATGGTTCG
AGCGAGTGAG
AAGCGGAACA
ATTTGCAGAA
CGTTGGGATT
TTTAAAGAAA
AAAAATTGTT
GGTCTTGGGA
TTCCATTGTT
GAATATCAAA GCCAAAATCC GGGAAGATTT GGTCTGAGTC AAGTCCAATC TGGCGCGGTA GAGAAAAAGG GCATCAAAGC GGAAGACTTA ACACCAATCG TTAACAAAAA TGTCGGTGTC ATCT ITrTAG GTGAAGTAAC AAACTGGAA-A ATI'TGAATA GAGCGGCCGG TAGTGGTACG GATAAAACAG CCATTCGTGC GCAAGAACAA TCTGATACAC CAGGAGCGAT TAGTTATACC 116 TABLE 1. Nucleotide and Amnino Acid Seqeuences of E. faccalis Genes.
GCAT=rCAT
GATGAAAATG
AAAAATCCAA
GAACGTGTGA
TGGCAAGGAA
ATCTTACTGA TGAAGTAGCT ACGTTAAGTA TAATGAACA-A TAAATCGATT AITTGGTCTr GTGATTI'AAC CAAAGAGTTT TTAGACTTTA TTGGTCAATT AGGGTATATT CCTGTTTCGA ATGTCATTAA A TTGATMGTGT TCACCCAACA ATGAACACAT GTACACTCGT TGTTGTCAGA TGATATCCAA AAATGGAA.AT TGAACGGGAT EF036-4 (SEQ ID NO:140) IDRGE SITAVGSSAL QPLVETASEE YQSQNPGRFI NVQGGGSGTG DHKVAVVGIT PIVNKNVGVK DISMENLXKI ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS ENVMNNKWI I WSYEHMYTRK NPSDLTKEFL
QGNVIK
LSQVQSGAVD
FLGEVTNWKE
DTPCAI SYTA
D)FMLSDIDIQE
IGNSOLFAEE
LGGKD)QKIVI
FSYVTDEVAT
RVIGQLCYIP
KKGIKAEDLI
LNRA.AGSGTR
LS IDGVQPTD
VSKIMEIEROW
EF037-1 (SEQ ID NO:141)
TGAGTGTATG
TTTTTTGGAG
GCTGCTCTTG
ACCAAAGAAA
CCTGAAGAGC,
AATCTCACTT
TCCA-ACGATG
GCTGAATATG
CACGGGACTG
AAAGI'AATG
AAAGATGOTG
ATTACTCATI'
GGATTT"TCAT
TGTTCCTI'AG
CAGCTAATTC
TCGAAATGGC
'1WGATAAAGA
TCTATCGTAA
AAATCACGGT
TCAAAGGTGA
GCAATATTAC
CCACTGTTAC
TCCCTTTGAA
GAAAATGTCT
TGCTTGTTCA
AAGTACAGAA
GTTAAGTGAT
GGTAACAGTT
ATTACCACTT
TCCTAAGCTA
TATTGAGGTG
TTTTGATAAA
TGGTGAAGTC
TCAGTTATGA
AAAGTACTCA
TCTGATAAAA
GTAGTCTCTG
AAAGGAAATI
GCTGGTACI'
TATTCCCAAG
ATCGITTCTT
AAAGCAAATG.
CAAGAATACA
ACCGTAGCCA
TAAAGGA.AGA
CCACTGTTTT
AAACAGATAG
GTGCTTCAAT
GGATTGTCGC
TCCATGATAA
ATGATAATA.A
CTGAAAATT
GCTTWTACTTT
AAGATTCTGC
ATAA
AATAAATAAA
GACGGCAACT
TAGTTCTAGT
TAGTGCCAAG
AGCTACTGAC
GGGGAAAGAT
AAAAGTAACT
CAACATCGTT
AAATGGTACC
TGACTTAGAA
EF037-2 (SEQ ID NO:142) MKMSK VLTTVLTATA ALVLLSACSS DKKTDSSSSS KETANSSTEV VSGASISAKP EELEMALSDK GNWIVAATDN VTFDKEVIVA GTFHDKGKDS NDVYRKLALY SQDDNKKVTA EYEITVPKL I VSSENFNIVH GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN EF037-3 (SEQ ID NO:143) AACAGATAG TAGTTCTAGT
AGCAAAGAAA
CCTGAAGAGC
AATGTCACTT
TCCAACGATG
GCTGAATATG
CACGGGACTG
AAAGTI'AATG
AAAGATGGTG
CAGCTAATTC
TCGAAATGGC
TTGATAAAGA
TCTATCGTAA
AAATCACGGT
TCAAAGGTGA
GCAATATTAC
CCACTGTTAC
A.AGTACAGAA
GTTAAGTGAT
GGTAACAGTT
ATTAGCACTT
TCCTAAGCTA
TATI'GAGGTG
TTrTGATAAA
TGGTGAAGTC
GTAGTCTCTG GTGCTTCAAT AAAGGAAATT GGATTGTCGC GCTGGTACT'T TCCATGATAA TATTCCCAAG ATGATAATAA ATCGI'TCTT CTGAAAAUTTT AAAGCAAATG GCTTTACTTT CAAGAATACA AAGA~rCTGC ACCGTAGCCA A
TAGTGCCAAG
AGCTACTGAC
GGGGAAAGAT
AAAAGTAACT
CAACATCGTT
AAATGGTACC
TGACTIAGAA
EF037-4 (SEQ ID NO:144)
TDSSSSS
KETANSSTEV VSCASISAXP EELEI4ALSDK NDVYRKLALY SQDDNKKVTA EYEIT'JPKLI GNWIVAATDN VTFDKEVTVA GTFHDKGKDS VSSE-NFNIVH GTVKGDIEVK ANGFTLNGTI( 117 TABLE 1. Nucleotide and Amino Acid Sequences of Efaecalis Genes.
VNGNI TFDKQ EYKDSADLEK DGATVTGEVT VANN EF038-1 (SEQ ID NC:145)
TAATGGCCAT
AACAAGGATC
GTCGGCTrGT
GGTIYGTAAAT
GAAAC=~TT
AATAATAAAA
GAATATAAAG
TNGAATAAGA
TTCCTCTACT AATAAAGAGG ATAAAAAAGG AGAAGTGAGC TATTGTTGTC AGGTTGTGGA GAACTAGA AAATAAAAGT CGAGGTATAA TTCAAAA.ATT- AACTCACTTT GGATATAAAA ACGGTAAATT AAAAGGTGAA OGTGTCTTTG A
ATGA.AGCTAC
ATGAAAAAAG
ACAGATATCA
CCAACTACTA
AGTGATAGTG
AATAAAGAAC
ATi'GGGCG
TCAAATGGCG
TACTACCTTT
AAAAGATATT
CTTACACTTT
GA.ACGTACTC
A.ATI'AATAAT
AGAAGGACTC
TTGGCA.ATG
TATTGCCTTA
GACTGCCGAT
'FFITCATGAT
TTACGATdAA GGkAAATGTT
TGATAAAAAA
EF038-2 (SEQ ID NO:146) M KLLKWRWQWN KDHKXGEVSM KKVLPFIALV GLLLLSGCGT DWKl(ILTACG GKWELENXSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE YKDGKLKGEI GGEKDSDKKX NKRCL EF038-3 (SEQ ID NO:14'7)
S
at..
S
TTGTGGA ACAGATATGA AAAAGATAT GACTGCCGAT GGTGGTAAAT GGGAACTAGA AA.ATAAAAGT CCAACTACTA GAAAC rITI I CGAGGTATA.A T'rCAAAAATT AGTGATAGTG3 AATAATAAAA AACTCACTTT GGATATAAAA AATAAAGAAC GAATATAAAG ACGGTAAA'T AAAAGGTGAA ATTGGAGGCG TNGAATAAGA CGTGTCTTTG A
CTTACACTTT
GA.ACGrACTC
AATTAATA.AT
AGAAGGACTC
TTTTGATGAT
TTACGATGAA
GGAAAATGTT
TGATAAAAAA
EF038-4 (SEQ ID NO:148) CGT DMKKILTADG GKWELENKSP TTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLOIKN KEQLIM.ENVE YKDGKLKGEI GGEKDSDKXX NKRCL EF039-1. (SEQ ID NO:149)
TAAATATATC
GCGCTTACCT
TCTGCAACAG
GCAGCTAAAG
CAAGGCACAA
A=GGTTAG
GGTGAAACCC
TTAATTTCGG
AAATTTACCT
GAACATATCC
ACAGAAACTG
TGGAATGGCA
TITATrAGTT
ACGGGTAAAA
GCTCACGTT
GATAAACGTT
AAAAAGAAAA
TATTAACCTT
ATAAATCAAG
AGCAATCAAA
AAGTTTACGA
CAAAATATCA
GTrGGCGATGA
ATACACAAAA
ATAAGCGAAT
C'rrATTCTGA
GCAAGATTGT
CGAAAGIT'IT
TAGC.GAAATT
CACGTCGAGA
CAATTGGTGA
TACGTATAC
AAGGGGATTA
TAGTACGTTG
TGCAGCTAGC
AGGACAAGAA
CAAAAATNAT
TGGTGAAACA
AGGCACA ETC
CTATCAAGCG
GGGTAAAGAT
CGAGAAATTA
TACCAATGAA
AGATGAAGAC
TGATAATAAA
TTITGGFI'AC
CAATAAATAT
ACGAATGGCT
CCAACCATCA AAAAGAAAA.A AGTITTAGT TTGA'ITGCAG GCTGTGCTGG CGGAGCCAAC
TCAAGCACTG
TTAACAGAAA
AATAATTTAA
GGTTITI'ATG
TTTGTGACAG
GTGGTCGATT
AAAGACGGGA
ACCT ITACGA
CCTGGGGATG
GGTAACGATG
CAGTCTCTAG
TTTTATCCAG
CAGCAGAAAA
AA EvTTITCGA
ACCATGCCGA
TAACGGAAGT
AAGATGTAGA
ACGrGCCGTAA
ACATT=AGG
TTACTGAAGC
TTCAGCAGAA
TACTGATTGG
TGcTAATTI'T
CAAAGAAACA
AAAGCGTATC
GACGAAAGAT
AGTCTTTGTA
AGATAGA-A
GCCCACATTA
AAATAAAATG
TGATTTAGAA
CAAAA'TCCGT
AGAATTAAAT
AATTAAAGTC
ACAAGTAAAT ATGAATICTT TTCCAAGTAA ?TGATAATAA GGAGCTGCAT TAGA.ATTA-AC AP.AGACA.ACA ATCGCAAAGA 118 TABLE 1. Nucleotide and Amino Acid Seqcuences of E.faecalis Genes.
TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA EF039-2 (SEQ ID NO:150) MXKKKVFSA LTLLTFSTLL IACCAGGANS ATDKSSAASS STAVSSSAEA AXKEQSKGQEL TEILSSTDWQ CTKWYDKNXN NLTAENANFI GLAKYDCETG ETRGDEGTFF VTDDGEXRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKOK HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDIDILGATLW NGTKVLDEDG ISLA.KFDWKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNXYG KRFTYTRMGK DUNGKEIKVF VEHEPYEGDF TPDFTF
FYEFFDKETG
DGKDVEVFVE
NDVTEANKMF
AALELTELND
EF039-3 (SEQ ID NO:151) TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTC TAG TTCAGCAGA.A i* S
CCAGCTAAAG
CAAGGCACAA
ATTGGTTTAG
GGTG.AAACCC
TTAATTCGG
AAATTTACCT
GAACATATCC
ACAGAAACTG
TGGAkATGGCA
TTTATTAGTT
ACGGGTAAAA
GCTCACGFI
GATAAACGTT
TTTGTAGAAC
AGCAATCAAA
AAGTTTACr-A
CAAAATATGA
GTGGCGATGA
ATACACAAA-A
ATAAGCGAAT
CTTATTCTGA
CCAAGATTGT
CGAAAG=TT
TAGCGAALATT
CACGTGGAGA
CAATI'GGTGA
TTACGTATAC
ATGAACCATA
AGGACAAGAA
CAAAAATNAT
TGGTGOAAACA
AGGCACATTC
CTATCAAGCG
GGGTAAAGAT
CCAGAAATTA
TACCAATGAA
AGATGAAGAC
TGATAATAAA
T=IGGTTAC
CAATAAATAT
ACGAATGGGT
TGAAGGAGAC
TTA.ACAGAAA
AATAATTTAA
GGTTTTTATG AATTT=TCGA TTTGTGACAG ACGATGGCGA GTGGTCGATT TAACGGAAGT AALAGACGGGA AAGATGTAGA ACCTTACGA ACGGCCGTA CCTGGGGATG ACAITIAGG GGTAACGATG TrACTGAACC ACAAGTAAAT ATGAAITCTT TTCCAAGTA.A TTGATAATAA GGAGCTGCAT TAGAATrAAC AAAGACAACA ATGGCAAAGA TTTACCCCAG ACTTCACCTr
TTTATCCAG
CAGCAGAAAA
TACTGATTGG
TGCTAATTTT
CAAAGAAACA
AAAGCGTATC
GACGAAAGAT
AGTCT'ITGTA
AGATTTAGAA
GGCCACATTA
AAATAAAATG
TGATTTAGAA
CAAAATCCGT
AGAA'ITAAAT
AATTAAAGTC
CTAA
EF039-4 (SEQ ID NO:152) ATDKSSAASS STAVSSSAE-A AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV HIPYSDEKLT FTNGRKDLET ETGKIVTNEP ISLAKFDNKT SKYEFFDLET GKTRGDFGYF KRFTYTRM4GK DNNGKEIKVF VEHEPYEGDF
NLTAENANFI
VOLTEVTKDK
GOD ILGATLW
QVIDNNKIRA
TPDFTF
GLAKYDGETG
FTYKRGKDK
NGTKVLDEDG
HVS IGDNI(YG
FYEFFDKETG
DGKDVEVFVE
NDVTEANKMF
AALELTELND
EF040-1 (SEQ ID NO:153) S S
TAGAITAGAA
ATGAACAAA.A
TGTCAAGGAG
AGTAGTGCAA
GAGGAAGTGA
CAACGAGATA
GTCGACACAG
GCAAAAGCGT
CCCTATCTTT
GCGACCCCAG
GAAAAAGGAC
TTCCAAAATG
CAACTTTACC
GACAATTCTG
CCACTGGAGA
AAATTTTAAT
GAGAAACACC
AAACGAAAG
AAGAACCGAT
CGAAGA.AGGT
AAGGCAACAC
TGATTGATCG
ATGCGGGAGC
ATTTAA.ATAA
ATATTGACAA
ACTT'TGGCTA
AAGATATTGT
GTCGAACAGC
AAAATCTCAT
GGGGCTATTA
TTCCGCAGCG
CACCAGTACA
GAAGACCTAT
CAAGGAAATT
ACCGCTCAAT
GGGTGCCGAT
GCAAGGACGT
GCATAACCGT
TGTGAACCTC
TACAGCAT
AAAATTGTTA
AATGGACTAT
ATTTCTCTCC
AGTGTCGTGA
TCAAAAAATA
ACCCG'ITCGG
GAAGTGGGTIG
TI'ACAAGATA
ATCGCTGTTC
ATTAATCTC
ACGGAGATT
AGGA.AAGGAA
CCATTCCATT
GTCAA.ACGGT
TAGCTCAAAC
CGCTTITAGA
CTACTTATCA
ACAATAATGA
AAAACAGCAT
TAGCGTATAT
GTTGAGCACA
ACTTGCTGCG
GACTACTCAA
AACATCAAAA
AGCAGCCA.AT
AGTGGATGAA
CATTIGAGATI'
TAGTGATAGT
GTTAAAACAT
TCCGGCAGCT
AGACATAGAT
TGAAGGGAAC
ATCCATTAAA
GGAAATTAGT
TACGGTGGCA ATGCG ITAAT 'TCGTTAGAAG ATOGACCAGA ATTGAGGCAG TGGGGTTACG ATGGAAPLATG GTGCGGATCA GCCAAT'CAAA AAGGTTATAC 119 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAAATTTTAG CACAGTACAA CTAA EF040-2 (SEQ ID NO:154) M NKKILMGLLS VVTIPLLAAC QGGETPSAAS I<NSQTVTTQS SAKTESTS IT RSVAQTTSKE EVKEPMKlTYE VGALLEAANQ RDTKKVKEIL QDTTYQVDEV OTEGNTPLNI AVHNNDIEIA KALIORGADI NLQNSISDSP YLYAGAQGRT EILAYMLKHA TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LEDGREDIDF QNDFGYTALI EAVGLREGNQ LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN EF040-3 (SEQ ID NO:155) AGCG TCAAAAAATA GTCAAACGGT GACTACTCAA
AGTAGTGCAA
CAGGAAGTGA
CAACGAGATA
GTCGACACAG
GCAAAAGCGT
CCCTATCTTT
GCGACCCCAG
GAAAAAGGAC
TTCCAAAATG
CAACTTETACC
GACAAMTCTC
AAACTGAAAG
AAGAACCGAT
CGAAGAAGGT
AAGGCAACAC
TGATTGATCG
ATGCGGGAC
AITTAA.ATAA
ATATTGACAA
ACTTTGDCTA
AAGATATT1GT
GTCGAACAGC
CACCAGTACA
GAAGACCTAT
CAAGGAAATT
ACCGCTCAAT
G-GTGCCGAT
GCAAGGACGT
GCATAACCGT
TCTGAAGCTC
TACAGCATTG
AAAATTGTTA
AATGGACTAT
ACCCGTTCGG
GAAGTGG07r
TTACAAGATA
ATCGCTGTTC
ATTAATCTGC
ACGGAGATTT
TACGGTGGCA
TTGTTAGA-AG
ATTGAGGCAG
ATGGAAAATG
GCCAATCAAA
TAGC~tAAAC
CGCTTTTAGA
CTACTrATCA ACAATAkATGA
AAAACAGCAT
TAGCGTATAT
ATGCGTTAAT
ATGGACGAGA
TrCGGGTr-ACG
GTC-CGGATCA
AACGG=ATAC
AACATCAAAA
AGCAGCCAAT
AGT~rCATGAA
CATTGAGATT
TAGTGATAGT
GTTAAAACAT
TCCGGCAGCT
AGACATAGAT
TGAAGGGAAC
ATCCATTAA
GGAAA'AGT
0 0* 0 0 00 00 0 0 *0 *0 *0 0 000 0 00 0 0 000 0 AAAAT =TIAC CACACTACAA C EF040-4 (SEQ ID NO:156) AS KNSQTVTTQS SA.KTESTSTT RSVAQTTSKE EVKEPMX<TYE DTEGNTPLNI AVHNNDIEIA KALIDRGADI TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LYQDIVKLLM ENGADQSIKD NSGRTAM~DYA
VGALLEAANQ
NLQNSISDSP
LEDGREDI DF NQKGYTEI SI RDTKKVKEI L
YLYAGAQGRT
QNDFGYTAL I
IL.AQYN
QDTTYQVDE=V
EILAYMLKHA
EAVGLREGNQ
0* 0000 0 0 000* 0000 0 0000 00 0 00 00 EF041-1 (SEQ ID NO:157)
TAATTATTAA
ATGAAA'ITGA
CCGGCT~rGTG
AGTGGCGAAC
CTAGCAACAG
TAGACAA.AG
GATGGACTAA
GTGACTGCTA
GAATATGCTT
GATAAATCAG
AAAGCAACAC
GACATTGTGG
GGTCCATTCG
AA.AAACGATC
GTGAAAGAAT
CTTTCTGGTG
GCATCAACAC
NTITCTGATIT
AAA.AGTCAT
GAGGCGGCGG
AAGTTTI'ACG
NCAGAAT'AG
ATAACAAAGT
CATACAAAAT
ATGACTATGT
ATCTGTATGC
AAI'IAGGAAT
CATACTITGA
AAAAATATGG
TCTTAGACGG
AATATTGGGA
CACCAACCGC
AATTAGCCCA
AATATATGGA
TTCAGAAAAT
AACATTCGGT
AACGTCAGAT
TGTCACAGAA
T'FrTATTGCA
CCAACCTGCA
TAAATTAAAT
TTACGGATGG
CTCTGTAAAA
TAAAGCAGTC
TTAC~rATTA
TAAAAAI'AT
CTTTGATGGT
TAAAGATACT
GTTGAACTTG
ACAAATGGCC
ACTAAATCAA
ACAGA=TCA
GTGATI'ACAT
AGCTCAAGCG
CAACAAGAAA
TTAAATAATrG
GGTGCAGCGG
AAAGATGCAA
CAACGAACAG
AATGGTGATrG
AGTGATACAG
GCTTTCCCAT
GCATCAAACA
CCTGGTACAG
GTGAAACTGG
TTCCAAGATG
TTAIT=TAGG
TATTTACT
CGTCTGGTGG
TGCCAACAC
TATATGAAGG
AAAAAGCAGA
AATGGTCAGA
TTGATCCAGC
CCA TGCTAA
AATTAGAAAT
CAITCTTrCC
GCGAAAGTGC
ATACAAAATG
ACTCAGTAGA
GACAAACAGA
AGGCAACACT
AAcAAcTI'TA
CGGTAAGGCA
TGATTTATCA
AATTTATCGT
AGTTTcTGAA
CGGTAAACCA
CACAGCTTCT
AGGGGAAAAA
CACTTTAGAA
GCAACGTCAA
TGTCTACAAT
GTCATTCAAG
TGTGAATGTC
CGATGTCGTT
AATGACCCAG CTI'TTG'ITAG TCAAAAAGAA CGTGATGAAM AATCACCATT TAG-AAATGCG 120 TABLE 1. Nucleotide and Amino Acid Scqeuences of E. faecalis Genes.
AACTTACGTA AAGCAATC TTACTCA-ATC GACCGTAAAC CGTTAGTTGA ATCAATCCTT
AGGGGATGG
EF041-2 (SEQ ID NO:158) M KLKKSLTFGV ITLFSVTI'LA ACGGGGTSDS SSASrCGGKAS GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKIVQPAG GLTYKIKLNK DAXWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN KSELCIKAVS OTELEITLEK ATPYFDYLLA FPSFFPQRQD IVE-KYG-kNA PFVLDGFDGP GTDTKWSFKK NDQYWDKDT KLOSVDVNVV KESPTALNLF SGELAQQMAN DPAFVSQKEA STQYMELNQR DEKSPFRNAN' LRKAISYS ID AAEKAE VS ED
GDAIAKGEKD
SNSESAVYNG
QDGQTDOVVL
RKALVESILR
EF041-3 (SEQ ID NO:159) TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA 0 4 0* 000 *0 00 0 0. 0
AGTGGCGAAC
CTAGCAACAG
TTAGACAAAG
GATGGACTAA
GTGACTGCTA
GAATATGCTI'
GATAAATCAG
AAACCAACAC
GACAFI'GTGG
GGTCCATTCG
AAAAACGATC
GTG;AAAGAAT
CTTTCTGGTG
GCATCAACAC
AACTTACGTA
AGGGGATGG
AAGTTTTrACG
NCAGAATTAG
ATAACAAAGT
CATACAAAAT
ATCACT.ATGT
ATCTGTATGC
AATTAGGAAT
CATACTTTGA
AAAAATATGG
TCTTAGACGG
AATATTGGGA
CACCAACCGC
AATTAGCCCA
AATATATGGA
AAGCAATTTC
TGTCACAGAA
TTTTATTGCA
CCAACCTGCA
TAAATTAAAT
TTACGGATGG
CTCTiGTAAA.A
TAAAGCAGTC
TTACTTATTA
TAAAAA 1TAT
CTTTGATGGT
TAAAGATACT
GTTGAACTTIG
ACAAATGGCC
ACTAAATCAA
TTACTCAATC
CAACAAGAAA
TTAAATAATG
GGTGCAGCGG
AAAGATGCAA
CAACGAACAG
AATGGTGATG
AGTGATACAG
GCT='C CCAT
GCATCAA-ACA
CCGGTACAG
GTGAAACTGG
TTCCAAGATG
AATGACCCAG
CGTGATGAAA
GACCGTAA-AG
TGCCAACAGC
TATATGAACG
AAAAAGCAGA
AATGGTCAGA
TTWGATCCAGC
CCA=GCTAA
AATTAGAAAT
CATTC'TrCCC
G-CGAAAGTGC
ATACAAAATG
ACTCACTAGA
GACAAACAGA
CT'T7MITTAG AATCACCA 1T
CGTTAGTTGA
TGAI'TATCA
AATATCGT
AGTTTCTGAA
CGGTAAACCA
GACAGCTTCT
AGGGGA.AAAA
CACTTTAGAA
GCAACGTCAA
TGTCTACAAT
GTCATTCAAG
TGTGAATGTC
CGATGTCGTT
TCAAA.AAGAA
TAGAAATGCG
ATCAATCC T EF041-4 (SEQ ID NO:160) CGCGTSDS SSASGGGKAS GEQVLRVTEQ QEIMPTADLSL ATXRISFIAL GLTYKIKLNI( DAKWSEDGKPV TANDYVYGWQ KSELGIKAVS DTELEITLEK ATPYFDYLLA PFVLDGFDGP GTD)TKWSFKK NDQYWDKDIV SGELAQQMAII DPAFVSQKEA STQYMELNQR
GW
EF044-1 (SEQ ID NO:161) NNVYEG IYRL
RTVDPATASE
FPSFFPQRQD
KLDSVDVNVV
DEKSPFRNAN
DKDNI<VQPAG
YAYLYASVKN
IVEKYGKNYA
KES PTALNLF
LRKAISYSID
AAEKAEVSED
GDAIAKGEKD
SNSESAVYNG
QDGQTDDVVL
RKALVES ILR
TAAGATAAAA
ATTI'TAGTTA
GAAAATTCTA
ATAGAAACAA
ACrCGTCAA
GCAACACCTT
GGAATCAATG
rrCCGAACTA
TI'AGTTATAG
TrTTTTTAAC
CCAATCATAC
GCTCTAATAA
AGTCGAAAAA
ATGCTCGTCGA
TGCCAACTI'C
AATTGTrTTGG
CGTCTATAGG
AGG=TTAGT
AACATCAATA
TAAACTAAAA
TGAAGTATCT
TCTrAGTAGC
AATI'ACGTTA
GGCTGAAA.AT
AGGAATAGTA
TCTCAAAAAG
AAAAGCAGTA
GAAACTTCAG
ACAAATGTCG
TTAAACAATC
GAGAACTTAA
GGTCAAGTGA
TGAAkAAAATT CGAATGAkACC
CTAATCATTA
AAAGTGCCAG
AAGAAGCAAA
CACTCGTATT
ATTCAACACC
AAGAAGCCAT
AGTTTGTGTT
TAAA.AAACAA
CAGTTCTAC
CACCACTCAA
TTCTrAGAA
TAATITTAA-A
AACTGCTACC
TAATAAATAT
121 TABLE 1. Nucleotide and Ami'no Acid Seqeuences of E. faecalis Genes.
GAGCTATCTA TAAATACAAT TCCTACAAAA GAGATTAGAA TATTrCAGC GGCCGATAAC AGTATTCGCA CCGTTAAAGT AAATACAGAA TTAATTTAG GAACTAATAT TTCTTCAAAC GATGAACAAA ATAGATCGGG CACTTTATAC TTA'ITCAACA ATAAAAATGG TI'CGATATCT TTAATCACTC CTAACTACGC TGGCAATGTT ACGGATG-ATC AAAAAGACGT TATGCTAGA.A GTAATTCAAT AA EF044-2 (SEQ ID NO:162) MrKKLVCVI L\JIFLTCCSS QKANEPKXQE NSTNHTTSIK SSTNHYSSSI ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYA'JDLSSL NNPLVFNFKG MNVPTSITLE NLNSTPTATF RTXLFGAENG QVKE-AINKYE LSINTIPTKE IRIFSAADNS IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNN~kNGSISL ITPNYAGNVT DDQKDVMLEV
IQ
EF044-3 (SEQ ID NO:3.63) ITGTAGT TCTCAAAAAG CGAATGAACC TAAAAAACAA
GA.AAATI'CTA
ATAGA.AACAA
ACTTCGTCAA
GCAACACCTT
GGAATGAATG
TTCCGAACTA
GAGCTATCTA
AGTATTCGCA
GATGAACAAA
TTAATCACTC
GTAATTCAA
CCAATCATAC
GCTCTAATAA
AGTCGAAAAA
ATGCTGTCGA
TGCCAAC'ITC
AATTGTTTGG
TAAATACAAT
CCGTTAAAGT
ATAGATCGGG
CTAACTACGC
AACATCAATA
TAAACTAAAA
TGAAGTATCT
TCTTAGTAGC
AATTACG ITA
GGCTGAAAAT
TCCTACAAAA
AAATACAGAA
CACTTATAC
TGGCAATGTT
AAAAGCAGTA
GAAACTTCAG
ACAAATGTCG
TTAAACAATC
GAGAACTALA
GGTCAAGTGA
GAGATTAGAA
TTAATTTTAG
TTATTCAACA
ACGGATGAT.C
CTAATCATTA
AAAGTGCCAG
AAGAAGCAAA
CACTCGTAT
ATTCAACACC
AAGAAGCCAT
TATTTTCAGC
GA.ACTAATAT
ATAAAAATGG
AAAAACACGT
CAGTTCTAGC
CACCACTCAA
TTCTTTAGAA
TAATrTITAAA
AACTGCTACC
TAATAAATAT
GGCCGATAAC
TTCTTCAAAC
TTCGATATCT
TATGCTAGAA
EF044-4 (SEQ ID NO:164) CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANS LEA MNVPTS ITLE NLNSTPTATF RTKLFGAENG QVKEAINXYE IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNNKNGSISL
IQ
TPYAVDLSSL
LSINTIPTKE
ITPNYAGNVT
NNPLVFNFKG
IRIFSAADNS
DDQKDVMLEV
4 EFO4S-1 (SEQ ID NO:165)
TAGCCAAAAA
TTAGCTTCTG
AATAAACCAA
AAAGCAATTG
C ITTTCCAGC
CACCCATTAT
AAATTAGATC
GACGGTAAAG
GACTATACAG
CATGATGGTA
ATCACTTATA
GTTTTACCAA
GTTCGTAAAA
TCTGI'rGAAT
ATAGGGAGG
TGTTAGTATT
CGGAAACAGA
ATGGCGGTAC
AAGAATTTTA
TTAACAATGA
AAGATCCCAA
ATGTGACAGC
GGATTCGITA
AATCGCCAAC
AAGAAGTTCA
AACATrCCTT
ACCCTCTGAC
ACCTACCAAA
AAAAGAGATG
CGGGTTAGCT
AGACAT7I'CA
ATI'AGATGTC
TCAAGACAAC
TGCAGACT
TACAGCAACC
CGATGACGTG
TGATGATAAC
cAITTCTGGC
CCCAGGAATG
TGAAGGAAT
TATTGGACCA
TGAGCATTAC
AACAAGAAAC
GCATGTGGTG
AAAATGCCAA
GCTGTACTTA
TATGATG-CAC
AAGATTGTCG
ATTAATTAC
ATITI'CTC'T
T~rACGA-ATA
ATAGAAAMAG
CAACAATTAG
GCTGTTAAAG
TACTACATGA
TACGGTGGTA
GGATTI'TAGG
GccGGcAATAA
TCGCTGTI'A)
TGGATACACA
AATACATGCT
ATGGGGGTCC
GTGACAATTT
ATGAAGTCAT
TTGTTGGCAT
TCAATCATAA
GTGGCGGTGT
ACATGGAATC
GTAATATTGT
AACCTAAA-T
TGCAATCACG
AGGCGGGGGC
AAATGATAAA
ATTCCAAGGA
TCCAACGGTA
TGCGGATCTG
GAAATGGTCT
TGGTCATA;A
GGAAGACTAC
AGAAGTTAAA
TrGGGGCTCA
AAGCGATGCA
GACAGGTGAA
AGATAAATTA
122 TABLE 1. Nucleotide and Amino Acid Seqeueiices of E.fcaecalis Genes.
GTGTTCAAAT
ATTCCATTAT
ATCTTAGGAC
AAAGAAACAA
CAAGCCATGG
CGAACAGGGG
AAAGGCTATA
GACGTAGACG
GCTTCAATGT
AAAGAAATTG
TATGATAAAT
GGTTCAGATC
TTTGAGTCAG
GAAGAAAAAC
GTA.ATCCCAA
ACTTGGGCAG
CTGTTCCTTC
CAATGCCAAC
GTCCCGAACA
ATACAGTGAA
CCTATGCAAT
CAACAACGTT
CGCTTGA=I
GCGATGGCAT
CAGGCGGCGA
GCTTAAACGT
TGAAAAATGA
CTTCACCAAC
AAGAAAATAC
GTAAAAAAGC
CCCTTTACAG
TTGATACGAA
TGCGAGCATT
AGATACGTAT
AGCCTACACG
ATACAATCCA
TGACAATCAT
AATCCCACCA
AGACAAAGCG
TCGCGAAGAC
PLACTGCACA.A
AACGTATACA
TGACCCAGAA
CGGCTrATAT
TAAATTACTT
CTTCTACGAT
AAATGAAGTC
AGATAATCCA
GTAGAAGCGA TGAAAGCGAA CCAACATACA AAGATACTGA TATATTGG;CT TTAAAATCG AAAGCTAAAA TGGCAGATAA GCAGTCGGCC AAAAATTCTA GTCTTCAAGA
GCTTGCATGA
AAAAAATTAT TAGACGATGC AAAGAAGGCA AACCACTAGA CCACTTGCTO.
ATTACTATT
ACAGGACGCT TAATTG-AT=t GTAGATATCT
ATCAAGGCGC
_GjtCCAACT CACCCTIAA GATGCGATTG ATTCA.AAGC TGGCAAGAGT ATGCCAkTTGA rI'CCCTGTCA
ACGACCGTGT
TGGG~cAACG-G TGGGTfGTCAC
ACAATACGAT
AGGGTATCAA
TACGTTGAC
AAGCTTACGT
CAACGGCTTA
TAGCGAAGCG
TGGTTATAAA
AATCAAGTTT
CCAACAATGG
CCAAGCATTC
GTGGGGCACA
CTATACACGT
ATCATTAT
TGAAGCGT
AGTTGACTTT
AGCAGACTCA
4
CGGAAATAA
EF045-2 (SEQ ID NO:166) MN KKRILGAITL ASVLVFGLAA CGGGNKGGGN KATETEDISK MPIAVKND AIDfGTLDVA
LDEDANTATI
DGKSPTISGI
RKNPVTIGPY
ALSMPTDTYP
AMGYAIDNDA
VDGDGIREDK
DKLKNEDDPEV
E:KRKKAFYDW
VVMDTQFQGL
KLRONLKWSD
EKVNDKEVK I
YMSNIVTGES
TYKDTEGYQI
VGQKFYNGLR
EGKPLEIKFA
DIYQGAWGTG
QEYAIDEAFV
FQQEFYQDNY
GKDVTADDVI
TYKEVHPGMQ
'JEYLPNE}{YY
LGRPEQAYTY
TGATTLIPPV
SMSGGETAQP
SDPSPTGLYG
IPTLYRNEVL
DAQYMLPTVQ
FSYEVIGHKD
QLGGGVWGSV
GGKPKLDKLV
IGFKMGTFOK
FKSLHDSEAY%
LADYYVQQWKI
PNSAFNYTRF
PVNDRVVDFT
YTG IRYDONF
LPKHAFEGIA
FKSVPSASIV
ETNTVKYNPK
GYTLDLDKAK
EIGLNV'TYTT
ESEENTKLLD
WAVDTKDNPW
XK
IVDGGPADLK
TNIVGMEDYH
VKDMESSDAV
EAMKAKQYDI
AKMADKSLRQ
KLLDDAGYKD
GRLIDFQAFY
AIDSKASFDE
ATVGVTADSR
K
EF045-3 (SEQ ID NO:167) ATGTGGTG GCGGCAATAA ACGCGGGGC
AATAAAGCAA
AAAGCAATTG
CT'ITTCCAGC
CAGCCATTAT
AAATTAGATG
GACGGTAALAG
GACTATACAG
CATOATOCTA
ATCACTTATA
GTTTACCAA
GTTCGTAAAA
TCTGTTGAAT
GTGTTCAAAT
ATTOCATTAT
ATC ITAGCAC
AAAGAAACAA
CAAGCCATGG
CGAACAGGGG
AAAGGCTATA
CGGAAACAGA
ATGGCGGTAC
AAGAATTITA
TTAACAATGA
AAGATGCCAA
ATGTGACAC
GGATTCGTTA
AATCGCCAAC
AAGAAGTTCA
AACATGCCTT
ACCCTGTGAC
ACCTACCAAA
CTGTTrCr
CAATGCCAAC
GTCCCGAACA
ATACAGTGAA
GCTATGCAAT
CAACAACGTI-
CGC=TATTI
A.GACA=TCA
ATTAGATGTC
TCAAGACAAC
TGCAGACTTT
TACAGCAACC
CGATCACGTG
TGATGATAAC
CATTTCTGGC
CCCAGGAATG
TGAAGGAATT
TATTGGACCA
TGAGCATTAC
TGCGAGCATT
AGATACGTAT
AGCCTACACG
ATACAATCCA
TGACAATGAT
AATCCCACCA
AGACAAAGCG
AAAATGCCAA
GCTGTAGTTA
TATGATGCAC
AAGATTGTCG
ATTAAATAC
ATTTTCTCTT
TTTACGAATA
ATAGAAAAAG
CAACAATITAG
GCTGTTAAAG
TACTACATGA
TACGGTGGTA
GTAGAAGCGA
CCAACATACA
TATATTGGCT
AAAGCTAAAA
GCAGTCGGCC
GTCTTCAAGA
AAAAAATTAT
TCGCTG'ITAA
TGGATACACA
AATACATIGCT
ATGGGGGTCC
GTGACAATTT
ATGAAGTCAT
T'rGTTGG.CAT
TCAATGATAA
GTGGCGGTGT
ACATGGAATC
GTAATATrGT
AACCTAAAT'
TGAAAGCGAA
AAGATACTGA
TTAAAATGGG
TGGCACGATIA
AAAAATTCTA
GCTTGCATGA
TAGACGATGC
AAATGATAAA
ATTCCAAGGA
TCCAACGGTA
TGCGGATCTG
GAAATGGTCT
TGGTCATAAA
GGAAGACTAC
AGAAGTTAAA
7TrGGGGCTCA
AAGCGATGCA
GACAGGTGAA
AGATAAATTA
ACAATACGAT
AGGGTATCAA
TACGTTTGAC
AAGCTTACGT
cAAcGcTTA
TAGCGAAGCG
TGGTTATAAA
123 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GACGTAGACG
GCTTCAATGT
AAAGAAAITG
TATGATAAAT
GGTTCAGATC
Tm'TGAGTCAG
GAAGAAAAAC
GTAATCCCAA
ACTTGGGCAG
CGGAAA
GCGATGGCAT
CAGGCGGCGA
GCTTWACGT
TGAAA.AATGA
CTTCACCAAC
AAGAAAATAC
GTAAAAAAGC
CGCTTACAG
TTGATACGAA
TCGCGAAGAC
AACTGCACAA
AACGTATACA
TGACCCAGAA
CGGCTTATAT
TAAATTACTT
CTTCTACGAT
AAATGAAGTC
AGATAATCCA
AAAGAAGGCA AACCACTAGA CCACT1'GCTG ATTACTATGT ACAGGACGCT TAANTTGATrT GTAGATATCT ATCA-AGGCG-C GGTCCAA.ACT CAGCCTTPA GATGCGATTG ATTCAAAAGC TGGCAAGAGT
ATGCCATTGA
TTCCCTCTCA ACGACCGTCT TGGGCAACGG
TGGGTGTCAC
AATCAAGTTT
CCAACAATGG
CCAAGCATTC
GTGGGCACA
CTATACACGT
ATCATTTGAT
TGAAGCGTT
AGTTrGAC=F
AGCAGACTCA
EFO4S-4 (SEQ ID NO:168)
CGGGNKGGGN
Al DGGTLDVA
LDEDANTATI
DGKSPTISGI
RKNPVTIGPY
ALSMPTDTYP
AMGYAIDNDA
VDGDGIREDI(
DKLA(NODPEV
EKRKKAFYDW
KATETEDISK
VVMDTQFQGL
KLRDNLKWSD
EKVNDKEVKI
YMSNIVTGES
TYKDTEGYQI
VGQKFYNGLR
EGKPLEIKFA
D IYQGAWGTG
QEYAIDEAFV
MP IAVKNDKK
FQQEFYQDNY
GKDVTADDVI
rYKEVHPGMQ
VEYLPNEHYY
LORPEQAYTY
TCATTLIPPV
SMSGGETAQP
SDPSPTGLYG
IPTLYRNEVL
DAQYMLPTVQ
FSYEV IORKO
QL.GGGVWGSV
GGKPKLDKLV
IC FKMGTF OK
FKSLHDSEA-K
LADYYVQQWK
PNSAFNYTRF
PVNDRVVDFT
PLFNNDADFK IVOGGPADLK YTr I RYDD)NF TNIVGMEDYH LPKHAFEGIA
VKDMESSDAV
FKSVPSASIV EAi'ThAKQYDI ETNTVKYNPK
AKMADKSLRQ
GY'TLDLDKAK
KLLDDAGYKD
EIGLNVTYTT
GRLIDFQAFY
ESEENTKLLD AIDSKASFDE WAVDTKONPW ATSJCVTADSR EF046-1. (SEQ ID NO:169) 0 0
TACCAGGATA
TGTAGTA.ATA
CAGCAAACTA
ACATCATCTA
GATGATGTTT
ACGGATAAAG
AATTATATTA
TAATGAAAAA
ATACTGGGGG
CCCAGTCTTC
TAACAATTGA
CAAAAACTAG
AACTAAAGGA
AGCAAAAA
AAAACTTA T
AAAAAATAGC
TAAAAAAGAT
AACAACCGAG
ACGACAATTG
ATATATATCA
GTACTATIGT TAGCCTTA GACGCTTCAT CTACTGAAGT AGTACTAATC CGGACACAAC AATTTAAAGA ATAGAGAATT TATGAACA.AG GAATTAACAG GAGGCTAAAG AACAAAAGAA
TTTAACGGCA
ATCAACTAAG
ACCAACTTCT
GAATCCAACA
TTCAACAATT
AGATGTCATT
EF046-2 (SEQ ID NO:170) MKKKLIV LLLALFLTAC SNNTGGKNSD ASSTEVSTKQ QgQSSKYDS
SNFDTTPTST
SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSSTIT DKELKEYISE AXEQKKDVIN
YIKQK
EF046-3 (SEQ ID NO:171)
A
TGTACTAATA ATACTGGGGG CACCAAACTA CCCAGTCTI'C ACATCATCTA TAAcAATTGA GATGATGTI'T CAAAAACTAG ACGGATAAAG AACTAAAGGA AATTATATTA AGCAAAAA
AA.AAAATAGC
TAAAAAAGAT
AACAACCGAG
ACGACAATTG
ATATATATCA
GACGCTTCAT CTACTGAAGT AGTAGTAATC
CGGACACAAC
AATTTAAACA ATAGAGATT TATGAACAAG GAATTAACAG GAGGCTAAAG AACA.AAAGAA
ATCAACTAAG
ACcAAcTTcT
GAATCCAACA
TTCAACAATT
AGATGTCATT
124 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF046-4 (SEQ ID NO:172) C SNNTGGKNSD ASSTEVSTXQ QTTQSSKKDS SNPDTTPTST SS ITIETTEN LKNRELNPTD DVSKTRRQLY EQG INS STIT DKELKEY ISE
YIKQK
AKEQKKDVIN
EF047-1. (SEQ ID NO:173)
TAGGGAAAC
FI'AACCCTTG
TCCAGTGCTG
AGTAGCAAAG
TCGAAAGCAA
TCACCAGCGA
AACATrGTTTC TTAkACTGCAG
GAAAAAGAAG
T'IGAGAAAA
GACAATGGCG
GCCGTTCTA
AATATCAATG
GAAACATTAC
GACTATAATC
TTTGACCCA.A
AAGGAGGAAT
CTTTAGCAGC
CGGAAACTAC
CAACAACAAA
CAGCTI'CTTC
AAAATGCGAA
CTGGTCAAG
CGACAACTTC
CGATTCCAGT
AAACATATG
GTCAACCAGT
GTI'ACITATC
GTCAATCGCC
CAGCACCGAA
GAAACTCAGT
TTCAAGCAGT
TCTTATGAAA
ATGCGGCGGC
CACAGTCGAG
ATCTAGTGAT
TACGAAGGAA
TCCAGATGAC
CTTACCGCAG
ACAAGCGGAT
GAATGATGCA
CTCTGATGCC
AGATITAGGT
TTGGCAAGAA
TGATGAT'ITA
TACCGTTGGT
AGTTTGGCAA
GAAGATGGCA
AAGATAGGGC TTP GGAAAAAGTA CAC AGTACAAAkAG CAI GCGAAACCGT CAC GCGGCAAATA ATC CAAGCCAACC AAC GCAATrTTAA CGI CAAAACAATI TC( CGTGTCAATC AG~ G.AAGCAAAAA AT( TACA.ATATTA CT( GGCAATTGGA GT GCGAAAAATG Tr( CAAATCACAC TG~ GCCGGTACAG TCC ACATCAATGT AA
;AAAATAC
'CTGCTAC
;GA-ACA.AC
;GcTCAGC
TGCTITAA
.GTCAAAC
:GTGT=~
ETAACGCC
~CACTrCAA ;GG 'ATAA rTAGTCGT
:;TCAACAT
A.ACGTGGC
GT7TTACTC
GGATAGTCGT
AAALAGAATCA
AACAGCTGAT
AGAGAAGCAA
CCAGCTAGCA
GAATAACTT
ATATTATGCA
AATTAG'ITCT
CCAAATCAT
ACAAGGGGCG
ACGGGCCTCA
TTTGGAACAA
AGGAACCACT
TGTCCATCAT
TCTAG TGCITTCTT EF047-2 (SEQ ID NO:174) MKK IGLISSAFLL TLALAACGGG KSTF.NTDSRS SAAESTTVES TKASATKESS SKATTKSSDA KPSGTIADS KATASSTKEA ANNGSAEKQS PAKNANPDDQ ANQVLNQLAN MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE KEAIPVNDAR VNQLTPISSF EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWQEG NWSLVVRASN INGESPDDLA KNVVNILEQE TLPAPNT-VGQ ITLNVAGTTD YNRNSVVWQA GTVVYSVHHF DPIQAVKMAT SM EF047-3 (SEQ ID NO:175)
ATGCGGCGGC
TCCAGTGCTG
AGTAGCAAAG
TCGAAACCAA
TCACCAGCGA
AACATGTITC
TTAACTGCAG
GAAAAAGAAG
TI'TGAGAAAA
GACAATGGCG
GCAGGTTCTA
AATATCAATG
GAAACATTAC
GACTATAATC
'rITGACCCAA
GGAAAAAGTA
CGGAAAGTAC
CAACAACAAA
CAGCTI'CTTC
AAAATGCGAA
CTGGTCAAGG
CGACAACTTC
CGAT1'CCAGT
AAACATATGG
GTCAACCAGT
GTTACTTATC
GTGA.ATCGCC
CAGCACCGAA
GAAACTCAGT
TTCAAGCAGT
CAGAAAATAC
CACAGTCGAG
ATCTAGTGAT
TACGAAGGAA
TCCAGATGAC
CTTACCGCAG
ACAAGCGCAT
GAATfGATGCA
CTCTGATGCC
AGATTTAGGT
TTGGCAAGAA
TGATGATTA
TACCGTTGCT
GGATAGTCGT
AGTACAAAAG
GCGAAACCGT
GCGGCAAATA
CAAGCCAACC
GCAATTTTAA
CAAAACAATT
CGTGTCAkATC
CATCTGCTAC
CAGGAACAAC
ATGCTCACC
AAGTGCI'TA.A
CGAGTCAAAC
TCCGTGTTTT
AGTTAACCCC
AAAAGAATCA
AACAGCTGAT
AGAGAACCAA
CCAGCTAGCA
GAATAACTTT
ATATTATGCA
AATTAGTTCT
CCAAATCATT
ACAAGGGGCG
ACOGGCCTCA
TTTGGAACAA
AGGAACCACT
TGTCCATCAT
GAAGCAAAAA ATGCAG GAA TACAATATTA CTGGGTATAA GGCAATTGGA GTTTAGTCGT GCGAAAAATG TTGTCAACAT CAAATCACAC TGAACGTGGC AGTTICGCAA GCCGGTACAG TCGT'TrACTC GAAGATGGCA ACATCAATGT AA EF047-4 (SEQ ID NO:176) 125 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CGO KSTENTDSRS SAAESTTVES TKASATKESS SKATTKSSOA KPSGTTI'ADS KATASSTY<EA ANNGSAEI(QS MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVLYYAE EK:-TYGSDAE AXNAVNQIIED NGGQPVDLGY NITGYKQGAA INGESPDDLA KNVVNIELEQE TLPAPNTVGQ ITLNVAGTTD DPIQAVKMAT SM
PAKNANPDDQ
KEA IPVNDAR
GSSYLSWQEG
YNRNSVVWQA
ANQSJLNQLAN
VNQLTPISSF
NWSLVVRASN
G;TVVYSVHHF
EF048-1 (SEQ I0 NC:177)
TAAGGAGAAA
GGATTACAGT
TTGCGCTATG
GAATTAGTAG
TTAGGAGGAG
GTCAGTGGAT
ATTTIGATA
GTCTATGATT
CGAAGTTT
ATTCGGGTCA
GTACCTATGG
GAGAATAATG
TATGATGAGC
TTGACAGCGA
AAAACGGTCT
GTTGTiGTTCA
TCATTAAAA
AGTTCATGAA
TGCTAAGTGG
CGTATGCTAG
AGGAAA.AAAC
AAACAGAACT
CAGCATTAGA
ATGAAAAACA
CTACAAAAAA
ATATGAGCAA
TGCAAAGTGA
GTAGTTCGGA
AGTTCGTI'1T
ATACTCGAGT
AACAAGAACA
TTAAAGAAC
ATCAAGTAGA
ATAGCTCAGA
AAAAAGAAAG
T'TGCGGCAAA
TAATAGCCAA
AGATGGTAAA
AATTAATTA
APAG=T7CT TrTTTTAAA
ATTAGGATTT
AGGGCCTGTT
AACCGCCATC
AGTATATACT
ATATACAGCT
GCCAGATATT
AGCGATTGAA
GGTTGAAGAA
CAGTGAACCA
ACATGGCGAA
G~tTTATTTA
ACAGAAGCTT
CCAGTTATCG
GTTCAAATTG
ACACAAACAG
AAAGATTATIT
GTAATGGATA
GTTGGTTTAA
ACATCTCCAG
AAAATGGTAG
TCTCTACA.AT
GGTCATGGTG
GTGATTATGA
GAAGCAGCAA
GAAAAGAAAA
TTCCAAAAAC
CTGTATCAGG
CAGCAGTTAT
CGGCA AATGA ATITC TATGAA
AATA=TCC
GTGCAAT-TGA
CTGTAT-ITtC
ATCA.AGCGCT
CTATTATGA
ATGA=FTGAA
AACT=TAGG
CTAATCTAAT
GTGTrGGCTAA
ACGACGGGAAC
AAGAATCGAC
AAGCACAAC
TTGTTCAACC
CTATTCGCCA
GGTATTGGCCA
TACGGTAGTC
GAAATTCGGT
AGATGGTCAA
TTTTGCA.AAG
CATTCCGTAT
AATGCAACCA
CTCTGGTCAA
AGGTAAAAAA
GGGTTCGCCA
CAACGCTGCA
GTATTATTCT
AAAAGAACGT
CGCTTTTGA
AGAATATGGC
GTTGCATGAA
GTTGGCGGAC
EF048-2 (SEQ ID NO:178) MKKRKV LFTAVMVLAG LQLLSGCGKT E-ASANDTVVL RYAYASNSQP VIDSMKKFGE LVEEKTDGKV Q IEYFPDGQL GGETELIELT QTGAIDFAXV SGSALESFSK DYSVFAI PY I FDNEKHFFXV MDNQAL.MQPV YDSTKXLGFV GLTYYDSGQR SFYMSKGPVT SPDDLKGKKI RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE NNEFVLYTAG HGGVAKYYSY DEHTRVPDIV IMNEGTKERL TAKQEQAIEE AAKESTAFEI( T\TFKEAVEEE KKKAQAEYGV VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD EF048-3 (SEQ ID NO:179)
TTGCGGCAAA
TTGCGCTATG
GAATI'AGTAG
TI'AGGAGGAG
GTCAGTGGAT
ATTITTTGATA
GTGTATGATI'
CGAAGTTTT
ATCGGGTCA
GTACCTATGG
GAGAATAATG
TATGATGAGC
ETGACAGCGA
ACAGA.AGCTT
CGTATGCTAG
AGGAAAAAAC
AAACAGAACT
CAGCATTAGA
ATGAAAAACA
CTACAAAAAA
ATATGAGCAA
TGCAAAGTGA
GTAGTrCGGA
AGTTCGTTTT
ATACTCGAGT
AACAAGAACA
CGGCAAATGA
TAATAGCCAA
AGATGGTAAA
AATTGAATTA
AAGTTTT'TCT
TTTrTTrTAAA
ATTAGGATTT
AGGGCCTGTT
AACCGCCATC
AGTATATACT
ATATACAGCT
GCCAGATATT
AGCGATTGAA
TACGGTAGTC
CCAGTTATCG ATTCTATIGAA GTTCAAATTG AATAT'rTrCC ACACAAACAG GTGCAATIGA AAAGAWIATT CTGTA7MI'C GTAATGGATA ATCAAGCGCT GTTGGTIrTAA
ACATCTCCAG
AAAATGGTAG
TCTCTACAAT
GGTCATGG TG
GTGATTATGA
CTTA'ITATGA
ATGATTI'GAA
AACTF1=AGG
CTAATCTAAT
GTGTG.GCTAA
ACGAGGGAAC
GAAA'FTCGGT
AGATGGTCAA
TTTTGCAAAG
CATTCCGTAT
AATGCAACCA
CTCTGGTCAA
AGGTAAAAAA
GGGTTCGCCA
CAACGCTGCA
GTATrATTCT
AAAAGAACGT
GAAGCAGCAA AAGAATCr.AC CGCITTIGAA 126 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fuecalis Genes.
AAAACGGTCT TrAAAGAAGC GGTTGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC G=TGT TCA ATCA-AGTAGA CAGTGAACCA TTCCAAAAAC TTGTTCAACC GTTGCATGAA TCA'PrCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG CTAT-TCCCCA GTTGGCGGAC
TAA
EF048-4 (SEQ ID NO:180) CGKT EASAN1TVVL RYAYASNSQP VIDSMICKGFGE LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAXV FONEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR RVMQSETAIK MVELLGGSPV PMGSSEVYTS LQSNLINGAE DEHTRVPDIV IMNhEGTKERL TAKQEQAIEE AAXESTAFEK VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAD
SGSALESFSK
SFY-MSKGPVT
NNEFVLYTAG
TVFKEAVEEZ
DYSVFAIPYI
S?DDLKGKKI
HGGVAKYYSY
KKKAQAEYGV
EF049-1 (SEQ ID NO:181)
TGAGACTCT
AAAACAGGAG
GCCGTTrTA
TCAACAATTA
GCG'I'AGCC
GACAAAGANA
AAGTTCTCTG
AAACA.AAGAG
TTAAAA.ACAG
TCAGAAGATG
ATTGCTAAAG
GAI'CACAAG
GGCGA.AGTGT
GTAGTGAAAA
ATCAAGAAA.A
AGTGACGAAT
GCAGCCTACA
AGTGATTCTA
AGCAAAACAA
TCTTTT'CAA
TGCATAAGAG
GTTTAGCAGC
CTGTTGATGA
AAATCCGTTAT
TTCAAAAAAA
ATGCATTAAA
CAGCCTATGA
C'ITGGGCAAG
ATGCCAAAGC
AAAAATCAAC
CAACA.ACTGT
CAGAACCAAT
TGACCAAAAA
TTGCTGAAGA
TAAAAGCGGC
TrGCAAACTGA
AA.ACAAGCGA
CAGAATCTTC
AATGAGGTAT
AATGAAGAAA
GTGTTCAAGC
TTTTTATAAC
TTATAAAGTC
C=TGACGAA
ACAAGCTrGGT
TGCAGGTCTA
=rCCATCCA
TGTCAAGAAA
ACATACTGCT
TCCTGCCGAA
TGCTGCAACA
CAAAGCAAAA
AACAAAATTA
CAATGTGAALA
ATCTTCAAGC
TACCAAAACA
ICTAAATAA
GGTATAGTTA
AAACTAATCT
GGTTCAAAAG
CAAATTAAAG
TTTGAAGAAA
GCCAAAGAAC
TTAACTGAAA
AAACCCCACT
GAAGTAGAAG
GAAATCACTG
ACGAAAAAAG
GTI'AAAGAAG
AATATGCAAA
GCCAATGACA
GCCGATCAAA
ATTAAMGATG
GCTTCTTCAG
AGCGACTCTG
TAACAGANAT
TAGCTGCAC
ATATCGCAAC
AACAAAGCAC
AATATGCCGA
AAGTAGAAC
AAACATTCAA
TAAAAATTAC
CACAAATTAT
ACGGCGGCGA
ATGGCCTAA
CTGCCTTCAA
CCTACCAAAC
TGAAACC ITA CATTrTTTC
ATGCCT'CAA
AGAAAAAAGA
AAAAAGCAAC
AAAACTANAA
GGGCGCAATG
AATGAAAGGT
TAGCCAACAA
CAAAGTAACT
ACAAGGCGGA
GAAACAGTTA
AGATGAAGAC
CCAAGTTGCT
TTTCACAAAA
AATTAAATTT
ATTAAAAGAT
AACCTACTAT
TGAAAAAGAG
GAAAGTCATT
GAACGC=TA
ATCAAAATCA
AGATI'CTTCA
EF049-2 (SEQ ID NO:182) MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS TITVDDFYNQ IIEQSTSQQA FSQMVIYKVF EEKYGDKVTD FSDALKQAGL TEKTFKKQLI( QRAAYDAGLK AHLKITDEDL EDDAXAVKKE ITDGGDFTKI AI(EKSTDTAT KKDGGKIKFD EVSEPIAATN MQTYQITYYV VKMTKNKAXG NDMKPYEKEI DELKAANVKI KDDAFKNALA GYMQTESSSA SSEKKESYSS
KTTESSSK
KXIQKNFDE-A
KTAWASFHPE
SQATIPAEV
KKIAEET KLA
DSKTSDTKTS
KEQVEAQGGK
VEAQI IQVAS KEAkAFKLKDG DQTFVSKV IS
DSEKATDSSS
EF049-3 (SEQ ID NO:183)
GTGTTCAAGC
TCAACAATTA
GCGT~T AGCC
GACAAAGANA
AAGTTCTCTG
GGI'CAAAAG
CTGTTGATGA
AAATGGTTAT
TI'CAAAAAAA
ATGCATTAAA
ATATCGCAAC
TTTTTATAAC
TrATAA.AGTC
CITTTGACGAA
ACAACCTGGT
AATGAAAGGT
CAAATTAAAG
TTTGAAGAAA
GCCAAAGAAC
TTAACTG'AA
AACAA.AGCAC TAGCCAACAA AATATGCGA CAAAGTA.ACT AAGTAGAAGC ACAAGGCGGA AAACATTCAA GAAACAGTTA 127 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Gencs.
AA.ACAAAGAG
TTAAAAACAG
TCAGAAGATrG
ATTCTAA-AG
GATI'CACAAG
GGCGAAGTGT
CTAGTGAAAA
ATCAAGAAAA
AGTGACGAAT
GCAGGCTACA
AGTCA 1TCTA
AGCAAAACAA
CAGCCTATGA
CTTGGGCAAG
ATGCCAALAGC
AAAAATCAAC
CAACAACTGT
CAGAACCAAT
TGACGAAAAA
rGCTCAAGA
TAAAAGCGGC
TGCAAACTGA
AAACAAGCGA
CAGAATCTTC
TGCACGTCTA AAAGCCCACT T'FCCATCCA GAATAGAAG TGTCAAGAAA GAAATCACTG AGATACTGCT ACGAAAAAAG TCCTGCCGAA CrAAAGAAG TGCTGCAACA AATATGCAAA CAAAGCAAAA GGCAATGACA AACAAAATI'A GCCGATCAAA CAATGTGAAA AITAAAGATG ATCITCAAGC GCTTCTTCAG TACCAAAACA AGCGACTCTG
TTCTAAATAA
TAAAAATTAC
CACAAATTAT
ACG(GCGGCGA
AT-GGCGGTAA
CTCCTCA-A
CCTACCAAAC
TGAAACCTTA
CATTTr.TTTC
ATGCCITCA-A
AGAAAAAAGA
A.AAAAGCAAC
AGATGAAGAC
CCAAGTTGCT
TI'CACAAAA
AATTAAATTT
ATTAAAAGAT
AACCTACTAT
TGAAAAAGAG
GAAAGTCATT
GAACGCTTTA
ATCAAAATCA
AGATTCTTCA
EF049-4 (SEQ ID NO:2.84) a CSSG SKDIATHKGS TITVDDFYNQ IKEQSTSQQA FSDALKQAGL TEKTFKKQLK EDDAKAVKKE ITDGGDFTKI EVSEPIAATN MQTYQTTYYV DELA(AANVKI KDDAFKNALA
KTTESSSK
FSQMVIYK'JF
QRAAYDAGLK
AKEKSTDTAT
VKMTKWKAKG
GYMQTESSSA
EEKYGDKVTD
AHLKITDEDL
KKDGGK IKFD
NDMKPYEKEI
SSEKKESKSS
KX IQKNFDEA
KTAWASFHPE
SQATTVPAEV
KKIA=ETKLA
DSKTS=TTS
KEQVEAQGGK
VEAQI IQVAS
KEAAFKLKDG
D)QTFVSKVIS
DSEKATDSSS
EF050-1 (SEQ ID NO:185)
TAGGGTCTGG
AAAGGATGNA
CTTGTTCTAT
GCCACAAAAG
GGTTACGCAG
GCCACAACTC
GGAGCI'TAA
AACAATGAAT
TATCAAGTGA
TATI'TAGGAA
GGAGCACCTT
GAAAACGGCC
GTTCCAGAAA
GATGGAACGG
ACTCAAGTCA
AATCCTGAAG
AAAAGCAGTC
AAAAAATGAA
TATTAAGTGC
AAGCAACTGT
GAACTAAAAA
AAGAAI'AGT
GTGGCAAAGC
GGAATCAAAC
ACAAAGCCAA
CCTATGGGCA
TCAATCAT'T
CCACAGAAAA
AACTGCCCAT
TGCCGTTATC
CTAGCCAGAT
TAGATCAAT'
AACTGACT'1C
CATGCCCAAA
TTGCCAAATT
TCACTTAAAC
TTCGTTTGGC
GCTACTCGTT
GACGAATCCC
AGAATGGATA
TATTGTCGGG
AGATACATCG
TATTGATACG
AAGTAGCCGC
TI'TATTAATT
TAGTGCCTTA
TATTAAAGGA
GCTAATCGAA
ITTTTCCAAGC
AATATCNT
GGGGCAACTA
CGCACAACAA
TCGTTACTC
AAACCTGATG
AGTGTTCAAG
AAAAACACAT
CACTCTATGG
TACCTAAAA
AGTCAACAGC
TATTGGATT
GGTGGTCAAT
GCAGTCAACG
GAAAAGCAC
TTICTATGGC
CCTTTFIAG
ATTT'ITCIT
CGAAGGATGA
CACCAACGCT
ATCGC'ITGGA
GGACCGTGGT
TTCTAT1rGA
TACTCTATT
GTGGTlGITAG
TTGAAAAATT
AAACCATCGA
ATCAAGAGAT
TAAGTCCAAC
CCTTGCTAAG
AACATAGTCA
CGAGTAAAAA
TTCATCCCAG
GCTAATGGGT
CAACCAAGCC
TTI=ICAT
GAAACAAGGT
TAAAGAGCGA
AGATAATAAA
ACAAAAAAAT
TGGITACGT
CGTCAGCATT
AACCGAACTA
GATTAATGTT
AGAT1'TAAGT
ACAGCAGGA
ATTACATGA.A
ATAG
a a EF050-2 (SEQ ID NO:18 6 MNMPKN IXYFSLLMGL VLLLSACQIG A'ITKDDNQAA TKEATVELNR MrTTTLFFHG YAGTXNSFGS LLHRLEKQGA ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQK1NY LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNA.LLRQRGT PEVDQLLIEF LWPSKK
TTQELVLLIVK
QvNKANIVGH
NGPTEKSSRY
QVTSQI IKGE
PDGTVVKERG
SMGGVSGLRY
LDYQEMINVV
NAQHSQLHEN
EFOSO-3 (SEQ ID NO:18 7 128 TABLE 1. Nucleotide and Anlino Acid Seqeuences of E.f[aecalis Genes.
TTGCCAA.ATT
GCCACAAA.AG
GGTTACGCAG
GCCACAACTC
GGAGCTTTAA
AACAATGAAT
TATCAAGTGA
TATI'TAGGAA
CGAGCACCTT
GAAAACGGCC
GTTCCAGAAA
GATGGAACGG
ACTCAAGTCA
AATCCTGAAG
GGGGCAACTA
AAGCAACTGT
GAACTAAAAA
AAGAATTAGT
GTGGCAAAGC
GGAATCAAkAC
ACAAAGCCAA
CCTATGGGCA
TCAATGATTT
CCACAGAAAA
AACTGCCCAT
TGCCGTI'ATC
CTACCCAGAT
TAGATCAATT
CGAAGGATGA
TGAGTTAAAC
TTCGTTTGGC
GCTACTCGTT
GACGAATCCC
AGAATGGATA
TATTrGTCGGG
AGATACATCG
TATTGATACG
AAGTAGCCGC
TTATTAATT
TAGTGCCTTA
TATTAAAGGA
GCTAATCGAA
CAACCAAGCC
CGCACAACAA
TCGTTACTGC
AAACCTGATG
AGTGTTCAAG
AAAAACACAT
CACTCTATGG
TTACCTAAAA
AGTCAACAGC
TATTTGGA1T
GGTGGTCAAT
GCAGTCAACG
GAAAATGCAC
=TCTATGC
CACCAACGCT ?rTTTTTTCAT ATCGCTTGGA GAAACAAGGT CGACCGTGGT TAAAGAGCGA TTCTATTTGA AGATAATAAA TACTCTAFI'T ACAAAAAAAT GTGGTGTTAG TGGTTTACGT TTGAAAAATT CGTCAGCATT AAACCATCGA AACGGAACTA ATCAAGAGAT GATTIAATGTT TA-AGTCCAAC AGATTTAAGT CCTTGCTAAG ACAGCGAGGA AACATAGTCA ATTACATGAA CGAGTAAAAA ATAG EF050-4 (SEQ ID NO:188) CQIG A ETKDDNQAA TKE-ATVELNR TTTPTLFFHG ALSGKATNPS VQVLFEDNKN LGTYGQDTSL PKIEKFVSIG PEKLPILLIG GQLSPTDLSD PEVDQLLIEF LWPSKK
YAGTKNSFGS
NEWNQTEWIK
APFNDFIDTS
GTVPLSSALA
LLHRLEKQGA
NTLLYLQKNY
QQQTIETELE
VNALLRQRGT
TTQ ELVLLNK QVNKAN7VGH NG P TKS SRY QVTSQI IKGE
PDGTVVKERG
SMGVSGLRY
LDYQEMINVV
NAQHSQLHEN
EF051-1 (SEQ ID NO:189)
TA-AAAGAAAA
TTCAGTTTAG
ACAAAAAAAG
AAAATTATTC
AGTrTTGGAA
GCI'CGACAA
OTTCCTTTAT
CATGATT=~
CCAGCTGGTC
AAACACGCAA
GCAGAAGCTG
GCGGCCATGC
GCCTCTAAGA
AGAAAATATT
GAGGCGTTCA
TTGCCCTAAT
AAGAAATAAC
CAGCCTTTGA
AATTACAGAT
AACAAATGAA
TGGAAAACCA
CTGATITAA
AATATGCCGA
GCITrGGCAC
CCTTAGTTA
CTGAAGCT
AATGTCTAAA
TGCTGCATGT
CC=~CGGCA
AAAAGAGCAT
GCAA.ATTGA.A
TGCA'ITGGTT
GCTCGTTCTT
AAAAGCCCAA
AGAAGGCTTA
GAATGTAACA
TCCACAGAT
TI'TGAAAAAG
CAAAAAAAGG CTGTGrrCCT GCTTAGTTTA ACAAATCAGC CGCAAAAAGA AACAGTTTCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG CCAGATATTC AGGTAACTGG AACCTATGAT AAAGCCCTAA AAGCCGATGT ATMW1TCTCA GCAGAAAAAC TAATTAATAA AAAAAGTGTC ATTGTGCCTA ACCAAGATCA AGCAAAGTGG ATOATAGCAA TTGGTGATCC TGCAAGTGTT AAAGCTI'TAG GCGCTTGGTC TTATGTAGAA GAAGTCC ITG AATGGGTAGC TAATGCAAGT GCAGCAACCA ATTCAAAAGT AGCGATTGTT CCAATTATCT ATCCAGTTGG TAAAGTTGCC TT'IMAAAT TTTTACAGAG TCAACAATGC TTAACAAAGT AG AACAAAAATC AGCAGATGCT TTGANAATAT TGGCTTTA-AG EF051-2 (SEQ ID NO:190) MSKQ KKAVFLLSLF SLVALIAACT NQPQKETVST KKEEITLAAA ASLESVMEKK IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSVV PLLENQLVLI VPNQDQAKWH DFSDLKYAQM IAIGDPASVP AGQYAEEGLK ALGAWSYVEK HAS FGTNVTE VLEWVAJA-SA E-AGLVYATDA ATNSKVA IVA AMP EAVLKKP I IYPVGKVAA SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK EF051-3 (SEQ ID NO:191) ATMT ACAA.ATCAGC CGCAAAAAGA AACAGTTTCA ACAAAAAAAG AAGAAATAAC CCTTGCGGCA GCAGCTAGCT TAGAATCAGT CATGGAGAAG TABLE 1. Nucleofide and Amino Acid Seqeuences of E. faecalis Genes.
A.AAATTATTC
AGTTCTGGAA
GCTTCGACAA
GTTCCTTTAT
CATGATTTTT
CCAGCTGGTC
AAACACGCAA
CCAGAAGCTG
GCGCCATC
GCCTCTAAGA
AGAAAATA IT CAGCCT 1TGA
AATTACAGAT
AACAAATGAA
TGGAAAACCA
CTGATTTAAA
AATATGCCGA
GCTTTGGCAC
GCTTAG=IA
CTGAACCTGT
AACAAAAATC
TTGANAATAT
AAAAGAGCAT
GCAAAT'TGAA
TGCATTGGTT
GCTCGTTCTT
A-AAAGCCCAA
AGAAGGCTTA
GAATGTAACA
TGCGACAGAT
TTTGAAAAAG
AGCAGATGCT
TGGCTTTAAG
CCACATATTC
AAAGGCCTAA
GCAGAAAAAC
ATTGTGCCTA
ATGATAGCAA
AAA~cCTTAG
GAAGTCC=T
GCAGCAACCA
CCALATTATCT
TTTTTAAAr
TTAACAAAGT
AGGTAACTGG
AAGCCGATGT
TAATTAATAA
ACCAACATCA
TTGGTGATCC
GCGCTTGGTC
AATGGGTAGC
ATTCAAAAGT
ATCCAGTTGG
TTTTACAGAG
AG
AACCTATGAT
ATTTTTCTCA
AAAAAGTGTC
AGCAAAGTG
TGCAAGTGTT
TTATrGTAGAA
TAATGCAAGT
AGCGATTGTT
TAAACTTICC
TCA.ACAATGC
EF051-4 (SEQ ID NO:192) CT NQPQKETVST K.KEEITL.AAA ASLESVMEKK IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQMNALVA EKLINKKSVV PLLENQLVL I VPNQDQAI(WH DFSDLKKAQM IA IGDPASVP AGQYAE-EGLK ALGAWSYVEK HASFGTNVTE VLE-WVANASA EAGLVYATDA ATNSKVA IVA AMPEAVLKKP I IYPVGXVAA SKKQKSADAF LNFLQSQQCR KYFXNIGFK(L TK EF052-1 (SEQ ID NO:2.93)
TAAAGTAGGA
CCCACAATGG
TTACTTAGTT
NGAAATCTTC
GCAGAAAATA
TCAAAAAAAA
GTTCGTAGCA
CTAAGCTGGT
GAAGCGCAAG
GTACCATGGG
CGTCGTGGAG
NGTTTITATT
GGGAGACCAC
ATCCAGT'GT
GAACGCAAAT
T1'GGCATATT
CGAAAAAAGT
TAGCATTATC
AGAAGTGGAA
ATTGTTGGTT
AGAAGTCGGA
GAATGTA'G
AAAAAGATA
G'TTTTTAATA
GAATCAATCG
TTTG.ACAGTA
AAACGAAGGG
CTA~rAMT, ATCGGGTTrA CCC CAAACAA
CCTAAAACTG
AGTAGT'I=
GCAGCGTATC
GCACTTGCTG
GAAGTGACAT
ATATTCCTCA
CAAAAACTTC
CCATTCAATC
GTGACAATCG
GGCTGTWCT
AAGTAGTGAT
GATTCATTGC
AATGAGANGA
AACAACTTAT
AGACATACCA
GCTATCAATC
AATAACTTGG
A 1TTAGACAA TTATGTAGAA AAGGAGAATA A EF052-2 (SEQ ID NO:194)
MRXX
NLXFLLLLVL LIYIPQTTYA ENRETTEVGI GFTKTSDIPS KKNPVVNVLP QTT-IQSLSIV RSRTQ IKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE EFQS2-3 (SEQ ID NO:195) AGAAA.ATA GGGAGACCAC AGAAGTCGGA ATCCGGGMTA CAAAAACT'C ACACATACCA TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC GTCGTAGCA GAACGcAAAT AAAAGAT EFOS2-4 (SEQ ID NO:196) ENRETTEVG I GFTKTSDI PS KI(NP'JVNVLP QI'IQSLS IV
RSRTQIKR
EF053-1 (SEQ ID NO:197) TAGTCATGGC ACCATA-ACAA GGAGGAGAGA AGTGACATGA AAAAATACCT TTTGCTTAGT 130 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TGT' 7ITAG
GAAAATAGTIG
ACAGCGACTA
TCATCCGTCC
ACCACCAATC
TTTATTAAAA
ATTAG
GTCTTTCAG
GTGTTGTCTC
GTAATTCTTC
T1'TCTTCTGG
AAGCACCATT
GGAGAAGAGA
CTTCTGTCAT
CTTTTATGGA
CACAACCACC
CGTATATGGA
TATTTATITG
AGATGAAAAA
TCAGACACTG
ACGTATGAAT
GAACCCACCA
TCCCGACAAG
GGAATCAGCC
AACAGTATTA
CGTTT-GAGA
ATCCCACAGA
AGCCACCA
GAAGATTACC
TTATCACTAT
GCAGTAGTAG
AGCAGC TAT
AGAGTCGACA
CGGAGGCGCT
AGCGACAGGT
AGGCATATTA
GGATTGTAGG
EF053-2 (SEQ ID NO:198) MKKYLLLSC FLJGLFSFCHS DTAFGEAAYE NSGVVSFYGT YEYPTEESTT ATSNSSTTTE PTXPADGGAS SVLSSGVYGS RQG3RLPATGT TNQAPFIYLG IKRRREDEKN SISSSRDCRI
ISLITIGILF
EF053-3 (SEQ ID NO:199) a a.
a a a a a a a a.
TTTGGAGA AGCAGCTTAT GAAAATAGTG GTGTrGTCTC CTTTTATGGA ACAGCGACTA CTAATTCTTC CACAACGACC TCATCCGTCC TTTCTTCTGG CGTATATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA GAACCCACCA AGCCAGCTGA CGGAGGCGCT TCGCGACAAG GAAGA EF053-4 (SEQ ID NO:200) FGEAAYE NSGWVSFYGT YEYPTEESTT ATSNS STTE PTKPADCGAS SVLSSGVYGS RQGR EFOS4-1 (SEQ ID NO:201)
TAAATAAAA
AGTGCAGTAC
ACAGATCCAG
ACTGAGCCGG
AAGCCAACCG
ATTGATCCTG
CCAACCGAGC
GAACCAACTC
GAACACCCCA
AATCAATCGG
ACACATGTAC
GTAGATAAAG
TATAAGGTTT
TTGCCACATA
TTAA!1'CAG
ATTA=IGGA
TAGTATTCGG
CAACTCCACC
TGGATCCTGC
AGCCTACAAC
GAACCCCGT
CTACAACACC
AACCAGTGAT
CTAAACCAAT
CAGGAACACA
CAAGTGAAAC
CTGTTCCGTI'
TACCTAGCGG
CAGGAGAGAA
GATTCTTTT
GGAAATTACA
TGGCGGAAGT
AAI'ACCGAA
AGAACCGCCA
ACCGAGTGAG
TGAACCCACT
AAGCGAACCA
TCCAGAAAAA
AGACGTAGTC
ACCAAGTAT
TACTCCAATT
AACCAAAACA
AAACGTTGAG
ATTCACACTC
CTTTAAAAAG
ATGAAAAAAA
ATAACAG CAT
CCAACTGATT
GTAATACCAA
CCAGAAAAGC
GAACCAAGCG
GAAAAACCTG
CCAGCAGAAC
GTTACACCTA
CCTATTGAAA
ACAACAGAAG
CCAGAAGGAT
GTAAALAGCAA
CTT17CTCTG
AATAAG.AMA
TTATTIATC
TCGCTGACGA
CTAGTGAACC
CTGATCCAAC
CAACAGAACC
AGCCAACAGA
TTACTCCAGA
CAGAAACACC
GTCGAGAAAT
CAAGCAACTT
CTGGGGAAGA
TA.AAACCAAT
GTGATGGAAA
TATTGGGAAG
AAGCTTAA
AAGC~TT
TTTAGGACCA
TACCAATCCT
AGAACCAAC
AACAACGCCA
ACCTAGTCAA
ACAACCGAAA
AAAAACTCCT
TGATAAAACG
AGCGGAGGTA
AATTGTAGCA
TAGCAGCTCG
AATGA.AAGTA
CTTCTTTGTA
EF054-2 (SEQ ID NO:202) M KKIILSSLFS AVLVFGGCSI TAFADDLGPT DPATPPITEP TDSSEPTNPT EPVDPAEPPV IPTDPTEPSK PTEPTTPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP TEPTTPSEPE: KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVVV TPSGEIDKTN QSAGTQPSIP IETSNLAEVT HVPSETTPIT TEACEEIVAV DKGVPLTKTP EGLKPISSSY KVLPSGNVEV KASDGKM<VL PHTGEKFTLL FSVLGSFFVL ISGFFFFKKN KKKA EF054-3 (SEQ ID NO:203) 131 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
ACAGATCCAG
ACTCAGCCGG
AAGCCAACCC
AT IGATCCTG
CCAACCGAGC
GAACCAACTC
GAACAGCCCA
AATCAATCGG
ACACATGTAC
GTAGATAAAG
TATAAGG=T
CAACTCCACC
TGGATCCTGC
AGCCTACA.AC
GAACGCCGGT
CTACAACACC
AACCAGTGAT
CTAAACCAAT
CAGGAACACA
CAAGTGAAAC
GTGTTCCCTT
TACCTAGCGG
AATI'ACCCAA
AGAACCCCCA
ACCGAGTGAG
TGAACCCACT
AAGCGAACCA
TCCAGAAAAA
AGACGTAGTC
ACCAAGTATT
TACTCCAAT
AACCAAAACA
AAACCTI'GAG
CCAACTCATT
GTAATACCAA
CCAGAAAAGC
GAACCAAGCC
GAAAAACCTG
CCAGCAGAAC
G'ITACACCTA
CCTATTGAAA
ACAACAGA-AG
CCkAAAGGAT
GTAA-AAGCAA
C TAGTGAACC
CTGATCCA.AC
CAACAGAACC
AGCCA.ACACA
'rTACTCCAGA
CAGAA.ACACC
G=AGAAAT
CAACCAACTT
CTGGGGAAGA
TAAAACCAAT
GTCATGGAAA
TACGAATCCT
AGAACCA-AGC
AACAACGCCA
ACCTAGTCAA
ACAACCGAAA
AAAAACTCCT
TGATAAAkACG
AGCGGAGOTA
AATTGTAC A
TAGCAGCTCG
AATGAAACTA
9 9 9 999 9 9 9 EF054-4 (SEQ ID NO:204) DDLdGPT DPATPPITEP TDSSEPTNPT EPVDPAEPPV IPTDPTEPSK PTEPITPSEP EKPTEP'-PI DPGTPVEPTE PSEPTEPSQP TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPWDVVV TPSGEIDKTN QSAGTQPSIP IETSNL.AEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY KVLPSGNVEV KASOGKMKV EF055-1 (SEQ ID NQ:205) 9 *99*
TAACAAAAGG
GGAGGTIT
AGTTTTTTA
TACCAAAATC
GCTGATCCCA
CAGACCACCA
AGCATI'CTCN
CATGAAGCAA
TTGTTITCTC
CAATG-AAAAA
TAAATGTTGA
CGAGAACACC
AGGAACCAC
CAACTGGCTC
GN LTCGCCTT
ACTAA
TTTCTTGTGT
AAAGCGTTAT
AGCGTCTGAT
CGCTCCTAAA
TGGTCCTCCG
AACGCTCCCT
AATCGG'TrG
AAAAGGGCAA
TTAATGATTG
GGTGGTTCTA
GATCCCCCAC
CAAGGAGATC
CGTACAGGGA
GCGGGAATCG
GAAAGGCTAG
TGTGTCTACT
GTTCGGTGGG
CGAAAACAGA
AACGAAGTGG
GCAAGAGTCA
TACATAGAAA
CGAGTTIAAAA
ATCTTCTCCT
GATTGAATT
T'GCGCCAGCT
TCGTTCGACA
GGCAALATTTG
GAAGGGACGA
EF055-2 (SEQ ID NO:206) MKKKRYL MIVCLLSSPS FFINVEASDG GSSSVGIEFY QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TTTGSTLPR TGSKSQANLS ILXFALIGLA GIVHRKXGRH EAN EFOSS-3 (SEQ ID NO:207)
AGCGTCTGAT
TACCAAAATC
GCTGATCCCA
CACACCACCA
GGTG.GTTCTA GTTCGGTGGG GATrGAArTT CGAGAACACC CGCTCCTAAA GATCCCCCAC CGAAAACAGA TGCGCCAGCT AGGAACCAGC TOGTCCTCCG CAAGGAGATC AACGAAGTGG TGGTTCGACA CAACTGGCTC AACG EF055-4 (SEQ ID NO:208) SDC GSSSVGIEFY QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ TT'TTGST 132 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecaliks Genes.
EF056-1 (SEQ ID NO:209)
TAAATGAAAA
ATAAATGTTG
CCGGCAACAC
AAGGAACCAG
ACAGCTGGCT
GGTCTTGTCT
AACTAA
AAAAGCGTTA
AkAGCATCTGA
CCGCTCCTAA
CTG.GTCCTCT
CGCAGCTCCC
TGATTGGTCT
TrAATAATT GCGTGTTITAC GGGTGGTTCT AGTTCGCTGG AGATGCCCCA CCGAAAACAG GCAAGGAGAT CAACGAAGTG TCGTACAGGA AGCAAGAGTC TGTCGGAATC3 GTCCAGAGAA
TATTTCCCC
GAATI'AATT
ATGAGCCAGC
GTGGTTCGAC
AGGCAAACCT
AGAAGGGACC
TAGTTTTTI'T
TTACCAAAAT
TGCGGATCCC
ACAGACCACC
CAGCATCT
ACATGP.AGCA
EF056-2 (SEQ ID NO:210) MKKKRYLIIA CLLFSPSFFI NVEASEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK EPAGPLQGDQ RSGGSTQTTT ACSQLPRTG-S KSQANL.SILO LVLIGEYGMV QRKXGRHEAN EF056-3 (SEQ ID NO:211) 0 0* 0 0 000000 0 0 ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAMT rTACCAAAAT CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGACCCAGC AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAACTG GTGGTTCGAC ACAGCTGGCT CGCAG
TGCGGATCCC
ACAGACCACC
EF056-4 (SEQ ID NO:212) SEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK EPAGPLQGDQ RSGGSTQTI'T AGSQ EF057-1 (SEQ ID NO:213)
TAATGTTTAT
ATCATAAAAA
AGCGNTATGG
TACGTTTATG
CAGGGAAGAA
GAAAATCCGC
AATAAAATGA
TrGGCTGCGCC
GGTTAGTTT
CTGAAGAAAA
AGAATGAAAA
ACAATTTAGC
TGTA~rCCTT
AACGA--AGAA
AGTCAATGTT GAA.AATGGGG GGTATGTTTA GGGCTA=t~A TAATTATGAA TCAAATGGTC AGAGTCAAAT GACGTAGCGT TGCTTCMGA CAAGCAGTT GATAGGAGTT AGTITCTTGG CGAGTTATT TAA
AAGGAGGA;AT
TCATTC-GGTT
AAGCGAGCT'
ATACCCAACA
TACCTAAAAC
GGATAGTCAT
TCAGATCAAA
GCNAACAAAA
CTATG;GTACC
ATCACAAGAA
AGGCGAGTCT
TTA=TAAT
0 0* 00 EF057-2 (SEQ ID NO:214) MKI IKRFSLVCLG LLIIGLXTKS XMAEENTNYES NGQASvFYGTY VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLIN
KMKREKEFI
EF057-3 (SEQ ID NO:215) AAA TAATTATGAA TCAAATGCTrC AAGCGAGCTT CTATGGTACC TACGTTTATG AGAATGAAAA AGAGTCAAAT CACGTAGCGT ATACCCAACA ATCAGAAGCM CAGGAAGAA ACAATTTAGC TGCTTCTGGA CAACCAGTTT EF057-4 (SEQ ID NO:216) EENNYES NCQASFYGTY VYENEI(ESND VAYTQQSEEQ GRNWLAASGQ AV 133 TABLE 1. Nucleotide and Amino Acid Seqeuences ofrE.f[aecalis Genes.
EF058-1 (SEQ ID N'O:217) A.
A
TGAAGAACGT
TTGGCTATTT
TGGTACACCG
ACAACTGCAT
ATGACGGATT
AA.ATATCAAG
GAGCAACGAG
CCTGGGAAAC
CCTAAAAAkAC
GTAGTT-GCTG
GATGGTTCCT
GTGGTAGCCA
TTAAATGGCG
ATCCAAGGAG
=TATTACTG
ACGGGAGAAT
GCTCCAAATA
AACAATCAAA
ACACCAAGCT
GTAAATATTC
AATTTAGTTG
GCTTATGCGT
CAAGCAAATG
GGCACACTAA
TTTAAAAATG
GAAGTTGTGA
GCCcTGGCGc
AAAATCGATG
=IiACAACAA
TI'AGAACAAA
GTCAATGA.AC
AAACACAAAG
GGCGCAGTCT
TAA
TCTATTFIGC-T
TATCAAAAAC
TTAGTACCTT
TTGCAGAAGA
TACCAGATCC
GACTGGCAGA
CGGCAGGCGC
CTGTTGCTCA
AAAATrGTAA
CTACGAATAT
ATAAATATGG
ATGATGGTAG
CAGAATTITGT
TCAA.AGATGG
GGAAAAGTTA
TAACAGTTAA
ATGCAGAA1T
CACCTGTTGA
TAGATGGTAA
CATTGGGGAT
ATAAACATGA
TATATGATGG
GCTTCACTGT
AATTCGTTTA
AGGCGAATGT
CAGGTGGGAA
GAGCTTCCTT
AAACAACGAA
CGGCTGATGG
CTGTAGCTCC
AATCATATG
GTACCTTACC
TGCTACTrAT
TCACGATTGC
AA.AAAAGAGG
GTTACTAATT
AAATGGGGAG
GC~rATICA-A
TGTGACGTT'
AAGCGTTGAT
AGGAACCACC
AGATGCAGTG
GGTGGTGGCG
AACAGAAGAA
TTTACATGTG
TATTTCTAAA
ATTATATACA
TGAAATTGGC
AAATCTTGAG
AATTGAAAAT
AAAAACAGTC
AGATGTGGCA
TGCAGACAAA
TGCAGCCTTA
GGATACAGTG
CGCCGTTAAT
CTTTATGCAT
TGATAACGGT
ACGTTTCATT
TGTCGTCCGT
AGCAGCAACT
ATI'AGTTGAT
TGATGATTAT
CACAACAGAA
TTCAACAGGT
TGCAGrGAGTC kGGCCTGCTA
AGAGAGAAAAA
ITGCCACTrT
AGCGCACAGC
AATAGCGGCA
PGTATTTATA
GCAGCTAAAC
6-TGCAAATG-
TATACCATTA
TTCCCAGTT
TTAGCGGTTG
AAAAAAGTAG
AGCGAAGGCT
TGGACAACGG
GAAA.ATCATT
GTTGGTTCGT
CAAACAAAAA
AAAAATCATA
ATTGGCGAAA
GAAGGCGACG
ACITTI'GATA
A=TCTCCTG
CCAGCGTATA
TTAAATGAAA
CATACCGACG
AAACTCGATG
GATCAAAACA
TGGGTGAAAA
ATCACAGGGC
GTCTTGTrAA
AACCTAGTTT
GGCAAAGGAA
TACTTGCTA
kTCATTGGGA rGA.AGcAArI'
TCACAAGTGT
TCGT'GATTCA
AAGAAATGAG
ACGTGACGAA
AAGCTGTCCA
GGAATGTCAC
AAGAAGAACC
ACGAAATGAT
TTCATA'ITTA
GAACTGCTGA
CAC CAGGCAC
ATAAAGAACA
TCACAGAAC
ATATITAGA
CACCATTTAC
CCTCTAAAGT
AAALTTAAATA
CTAATAAATA
ACGTCAC ITC
AAAAITATCA
TrCCTACGCT
AAGCAGATCC
ACCAAACACC
GCGATGTGCAC
GCGACACAC
CAAAAGCTGA
TTAAATACGG
CAAATCGGAT
CACCAGAAAA
TCTACGTTA
GACGTAGAAA
TGGTAGTCAT
A4.AAAAAACTT
PTTAGGGACA
CAAAAAGAAA
CGAGTTTGAT
CGAATTTTAC
AAGTTTAACT
TGTTCAGTTA
AAAAGAGGGT
CAAGCAAACA
TCCTAAAAAT
AAATG.AGGA
AGTAAAATAT
AGCAAAACGC
AGAGAATGGA
AGAAGTAAAA
AA'FrGAAGCA
TGATAAAACA
TCAAATTTCT
CGTCAAATTC
TGGAGAGTAT
AGTGACTGAA
AACACCAGGC
TACCAAAGGC
ACCAACTGTT
AGCGACACAA
AAATTATTTG
AGCAACTACT
TACCTATTAT
TGAATTTGTG
AGTACCAAAC
CTTAGGAAGT
AGAAAATGCT
EF058-2 (SEQ ID NO:218) MKQLKKVW YTVSTLLLIL PLFTSVLGTT
TAFAEENGES
QRAAGASVDA
VAATNMVVAF
NGAEFVISKS
GELTVKNLEV
PSLDGKDVAI
YALYDGDTVI
KNEANVDNGH
IDETTKAATW
NEQSYGTTEN
AQLVI HKKKM
AKQAVQSLTP
PVYEMIKQTI)
EGSPGTVKYI
GSY ILEEVKA GEK IKYQISV
APENYQVTEQ
TDDQTPPTVE
VKTKAE-ATTF
LVSPEKVPNK
TOLPDPLIQN
GKPVAQGTTD
GSYKYGTEEL
QGVKDGLYTW
PNNAELIENQ
NIPLGIADKE
ANGFTVAVNP
WVTGGKRFIK
TTTADGLVDI
HKGTLPSTGG
SGKEMSEFDK
ANGNVTVQLP
AVVI{IYPICNV
TTOKEQAKRF
TKTPFTIEAN
GDANKYVKFN
AYIPTLTPGG
VDG-DVTATQA
TGLKYGTYYL
KGIYVYLGSG
YQGLADVTFS
KKQNGKDAVY
VAND)GSLHVK
ITGKSYEIGE
NQTPVEKIVK
LVOKHDAALT
TLKEVYFMHL
LAGASFVVRD
EETVAPDDYV
AVLLLIAGVY
IYNVTNEFYE
TIKEEPKEGV
KVGTAENEGL
NDFTEAENGT
NDTSKvDKTT
FDNVTSGEYA
NEKADPTKGF
QNSDTANYLK
LLTNRIEFVV
FARREKENA
EF058-3 (SEQ ID NO:219) TABLE 1. Nucleotide and Amino Acid Seqeuences or E. foecalis Genes.
ACAAGA AAATGGGGAG AGCGCACAGC TCCTGATTCA CAAAAAGAAA
ATGACGGAT
AAATATCAAG
CAGCAACGAG
CCTGGGAA.AC
CCTAAAAAAC
GTAGTTGCTG
GATGGTTCCT
GTGGTAGCCA
TTAAATGGCG
ATCCAAGGAG
TTTATTACTG
ACGGGAGAAT
GCTCCAAATA
AACAATCAAA
ACACCAAGCT
GTAAATATTC
AATTTAGTTG
GCTTATGCGT
CAAGCAAATC
GGCACACTAA
TTT AAAAATG
GAAGTTIGTGA
GCCTTGGCGG
AAAATCGATG
'TTTACAACAA
TTAGAAGAAA
GTCAATGAAC
AAACACAAAG
TACCAGATCC
GACTGGCAGA
CGGCAGGCGC
CTGTTGCTCA
A.AAATGGTAA
CTACGAATAT
ATAAATATGG
ATGATGGTAG
CAGAATTTGT
TCAAAGATGG
GGAAAAGTTA
TAACAGTTAA
ATGCAGAATT
CACCTGTI'GA
TAGATGGTAA
CATTGGGGAT
ATAAACATGA
TATATGATGG
GCTTCACTGT
AATTCG'TTA
AGGCGAATGT
CAGGTGCGAA
GAGCTTCCTT
AAACAACGAA
CGGCTGATGG
CTGTAGCTCC
AATCATATGG
GTACCTTACC
GCTTATTCAA
TGTGACGTI
AAGCGTTrCAT
AGGAACCACC
AGATGCAGTG
GCGTGGCG
AACAGAAGAA
TTTACATGTG
TATTTCTAAA
AT1'ATATACA
TGAAATI'GGC
AAATCTTFGAG
AATTGAAAAT
AAAAACAGTC
AGATGTGGCA
TGCAGACAAA
TGCAGCCTTA
GGATACAGTG
CCCCGTTAAT
CTTTATGCAT
TGATAACGGT
ACGTTCATT
TGTCGTCCGT
AGCAGCAACT
ATI'AGTTGAT
TGATICATIAT
CACAACAGAA
AATAGCGGGA
AGTATTTATA
GCAGCTAAAC
GATGCAAATG
TATACCA1TA
TTCCCAGTTT
TTAGCGGTTG
AAAAA.AGTAG
ACGCGAAGGCT
TGCACAACGG
GAAAATGATT
GTTGGTTCGT
CAAACAAAAA
AAAAATGATA
ATI'GGCGAAA
GAAGGCGACG
AVITrTGATA AT1'GCTCCTG
CCAGCGTATA
TTAAATGAAA
CATACCGACG
AAAGTCGATG
GATCAAAACA
TGGGTGAAAA
ATCACAGGGC
GTCTTG =AA AACCTAG7I'T
AAGAAATGAG
ACGTGACGAA
AAGCTG-TCCA
GGA.ATCTCAC
AAGAAGAACC
ACGAAATGAT
TTCATATTA
GAACTGOCTGA
CACCAGGCAC
ATAAAGAACA
TCACAGAAGC
ATA7TIAGA
CACCA=IAC
CCTCTAA.AGT
AAA'FrAAATA
CTAATAALTA
ACGMtACT1'C
AAAATTATCA
TTCCTACGCT
AACCAGATCC
ACCAAACACC
GCGATGTGAC
GCGACACAGC
CAAAAGCTGA
TTAAATACGG
CAAATCGGAT
CACCAGAAA-A
CGAGTTTCAT
CGAATTI'TAC
AAGTTTAACT
TGTTCAGITA
AAAAGAGGGT
CAAGCAAACA
TCCTAAAAAT
AAATGAAGGA
AGTAAAATAT
AGCAAAACC
AGAGA.ATGGA
AGAAGTAAAA
AATTGAAGCA
TGATAAAACA
TCAAATTTCT
CGTCAAATTC
TGGAGACTAT
AGTGACTGAA
AACACCAGGC
TACGAAAGGC
ACCAACTGTT
AGCGACACAA
AA.ATTATTTG
AGCAACTACT
TACCTATTAT
TGAATTTGTG
AGTACCAAAC
a a.
a a
S
a EF058-4 (SEQ ID NO:220) EENGES AQLVIHKKKM TDLPDPLIQN SGKEMSEFDI( YQGLADVTFS IYNVTNEFYE a *a.a a. a a.
QRAAGASVDA
VAATNMVVAF
NGAEFVISKS
GELTVKNLEV
PSLDGKDVAI
YALYDGDTVI
KNEANVDNGH
IDETTKAATW
NEQSYGTTEN
AKQAVQSLTP
PVYEMIKQTD
EGSPGTVKY I
GSYILEEVKA
GEKIKYQISV
APENYQVTEQ
TDDQTPPTVE
VKTKA-AT=F
LVSPEKVPNI(
GKPVAQGTTIJ
GSYKYGTEEL
QCWKDGLY I PNNAEL IENQ
NIPLGIADKE
ANGFTVAVNP
VVTGGKRFIK
TTTADGLVDI
HKGT
ANGNVTVQLP
AVVHIYPKNV
TTDKEQAKRF
TKTP FT lEAN
GDANKYVKFN
AY IPTLTPGG
VDGDVTATQA
TGLKYGTYYL
KKQ1'NGKDAVY VANflGSLHVK
ITGKSYEIGE
NQTPVEKTVK
LVDI<HDAALT
TLKFVYFMHL
LAGASFVVRD
EETVAPDDYV
TIKEEPKEGV
KVGTAENEGL
NDFTEAENGT
NDTSKVDKTT
FDNVTSGEYA
NEKADPTKGF
QNSDTANYLK
LLTNRIEFVV
EF059-1 (SEQ ID NO:221)
TAGATTGGAA
TTAGCAGGGG
ACAACAGGGA
GAGCCAGAGC
ACCCAACCTA
CCAACAGAGC
GTACCAGAGC
GAATGAAAAT
GAAGCAGTT
GTGTTTTACC
AACCAACAGA
GTGAGCCTTC
CA.ACAACGCC
AACCAACAGA
GAAAAA.AATG
TTCTGCTTAT
AGATGAACCG
GCCAAGTACA
AAAACCGACG
AAGTAAGCCA
GCCAAGTGTA
ATrATTATTG
GCGCAAGAAT
AATGTACCAA
CCAGAGCAAC
GATCCTTCGT
GAGCAACCA
CCAGAAAAAC
CCTTATTCAG TACAAGCCTT CAGAAGGAAA TCTTGGTGAA CTGACCCA.AT AACGCCAAGT CATCCCAACC GTCAACACCA TACCAGACGA ACCGAGCGTA CAGAGCCAJ4C AACGCCAAGT CAGTAGAACC AAATAAACCA 135 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faca/is Genes.
ACCGAGCCAG
GAGCAACCAA
CAACAGCCAA
GTGACGACAC
CCTTTAACAC
AGTGGCAATG
GAAAAAATGG
TTAA=TATA
AAAAGCCTGT
CAGATGTGGT
CTGTTCCAAT
CCATTGAAAC
AAACGCCTGA
TACAAGTGAA
GCATAA'ITGG
AAAAACCTAA
GCCAGTTGTT
GGTAAAGCCA
TGAAACGAAT
AGCAAGCGGA
TCGATTAAAA
AAGTGCTGAC
GTCAATCGCT
AGTCTAG
CCrrGAAkAAAC
AATCGAGAA.A
AACCT=CAG
GAAGCAAMI
CCGATTAAkA.
GGAAAAATGA
GGTGTATGTT
CAGTTGTACC
TTGCAACAGG
AkAGTAACACA
TCGCAGTGGA
CTGAATATAA
AAGTAkCTTCC TGACTrGTTTT
ACAACAACCA
AGAATCTACA
TGTCCCAACT
TAAGGGCGT
AGTATTACCA
TTIACACTGGT
ATCAGGAATC
EF059-2 (SEQ ID NO:222) MKKMI IIALFSTSLL AGGSSVSAYA QESEGNLGET- TGSVLJPDEPN VPTDPITPSE PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEP'TPSKPE QPTEPTTPSV PEQPTEPSVP EKPVEPNKPT EPE-KPVPVVP EKPVVPQQPE QPTDVVPN GEIATGESTQ QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKCVP LTQTADGLKP IKSEYKVLPS GNVQVKSADG KMKUVLPYTGE KMGIIGSIAG VCLIVLSGThL IYKKRKV EF059-3 (SEQ ID NO:223) 0 0 a.
a 5* S S
S.
S S 0 50 a a 00~ a *SaSSS
S
*000 a AGAAGGAAA TCTTGGTGAA
ACAACAGGGA
GAGCCAGAGC
ACCGAACCTA
CCAACAGAGC
GTACCAGAGC
ACCGAGCCAG
GAGCAACCAA
CAACAGCCAA
GTGACGACAC
CCT'ITAACAC
AGTGGCAATG
GTC TTACC
AACCAACAGA
GTGAGCCTTC
CAACAACGCC
AACCAACAGA
AAAAGCCTGT
CAGATGTGGT
CTGTTCCAAT
CGATTGAAAC
AAACGGCTGA
TACAAGTGAA
AGATGAACCG
GCCAAGTACA
AAAACCGACG
AAGTAAGCCA
GCCAAGTGTA
GCCAGTTGTT
GGTAA.AGCCA
TGAAACGAAT
AGCAAGCGGA
TGGATTAAAA
AAGTGCTGAC
AATGTACCAA
CCAGAGCAAC
GATCCTTCGT
GAGCAACCAA
CCAGAAAAAC
CCTGAAAAAC
AATGGAGAAA
AACCTTI'CAG
GAAGCAATTG
CCGATTAAPA
GGAAAAATGA
CTGACCCAAT
CATCGGAACC
TACCAGACGA
CAGAGCCAAC
CAGTAGAACC
CAGTTGTACC
TTGCAACAGG
AAGTAACACA
TCGCAGTGGA
GTGAATATAA
"rQTAC
AACGCCAAGT
GTCAACACCA
ACCGAGCGTA
AACCCCAAGT
AAATAAACCA
ACAACAACCA
AGAATCTACA
TGTCCCAACT
TAAGGGCGTT
AGTATTACCA
EF059-4 (SEQ ID NO:224) 500 a *0 0 S a a.
EGNLGET TGSVLPDEPN VPTDPITPSE PEQPTIEPSTP EQPSEPSTPT EPSEPSKPTD PEQPTEPSVP EKPVEPNKPT EPEKPVPVVP QPTVPIETNN LSEVTHVPTV TPIETASGE GNVQVKSADG KMKV
PSLPDEPSVP
EKPVVPQQPE
AIVAVDKGVP
TEPTTPSKPE QPTEPTTPSV QPTDVVVKPN GEIATGESTQ LTQTADGLKP IKSEYKVLPS EF060-1 (SEQ ID NO:225)
TGAAAA~ATAG
ACGTTGGGAA
GCAGAAGAAG
AAAACGGAAG
TCACAACAAC
CAGACGTCAT
GCCTGTAC
AA.ATAA
ACAAGGAGCA
TACITIAAC
TAGGGCAAAC
GTGTCA'ITAA
AAGACAAAGT
TACCAAGGAC
TCGTAGTACT
CGCGATCATG
CGTTGGCATT
GAATATCGGT
GCCAATAGAG
GAGCAGAAAA
AGGTGAACGA
AACAAGTTTC
ACAATGAAAA
GCGAGTCTAA
GTAACG=C
CAACCAGTCA
ACCACTGCTA
AATAGCACGT
TATTATTTGA
GTAAAGGGTC
TTGT'TCTI'C
ATGGAGGAAA
CTGATAAAGA
AA-ACCAATCC
GGCTTTACAG
ATA-AAAAAG
ACTI'CTGGTG
TGAGAGTTTT
AGAGCCACTA
TAAAAAAACG
GACTAATGCA
CCTTGGTATT
GAAAAAGGAA
EF060-2 (SEQ ID NO:226) MMT HKSKGSLLVT LGILLTVGIA SLIVSSESFA EEVGQTNIGV TFYGGKEPLI( 136 TABLE 1. Nucleotide and Amino Acid Seqeuences; of E. foecalis Genes.
TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAXTNPTNAQ TSLPRTG;ERN STWLYSLCIA CLLVVLTSFY YLNKKRXKEK EF060-3 (SEQ ID NO:227) AGAAGAAG TAGGCCAAAC GAATATCGGT CTAACGTTCT ATGGAGGAA.A AGAGCCACTA AAA.ACGGAAG GTGTCATTAA GCCAATAGAG CAACCAGTCA CTGATAAAGA TAAAAAAACG TCACAACAAC AACACAAMGT GAGCAGAAAA ACCACTOCTA AAACGAATCC GACTAATGCA
CAGACGTCAT
EF060-4 (SEQ ID NO:228) EEVGQTNIGV TFYGGXEPLK TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TS EF061-1. (SEQ ID NO:229)
S.
S. S
S.
OS
5 0
S.
S.
5 5 5* 0* 5 S 0 @0 5 5 eSS 0
TAATGGAACG
ATAATGATGA
AGTGA-AArI'T
GAAGTACCAA
CCACCTGTAG
CCGACAACAC
GAGCCAAGTA
GAAAAAACTG
CCAACCAAC
GGTACACAAC
CCTAGTGTAA
GGTGTTCCAC
TTGCCTAGCG
ACCGACAGAA
AAAAAAMTCT
CTGCTTTGC
CAGAACCAAG
ACCCTGTAGA
CAACAGAACC
AACCAGTAGA
TGACACCAAC
CAATCGACGT
AGCCAACAGT
CAACACCTAT
TTACACAAAC
GAAATGTAGA
GAAGATT=IG
TTTTGCTAGT
ACA.AGAAATT
TACACCAGAA
GCCACCTATT
TACAACTCCT
ACCTGAAAAA
TAAACCAACA
TGTTGTAACG
CCCTA ETGAA
TACAACTACA
AGCAGAAGG
AGTAAAAGGT
AACTTACAAA
TTATTTAGTG
ATCCCTGATG
AAGCCAACAG
ACACCAACGG
ACAGAGCCAA
CCAGTTACAC
GAATCTGAAA
CCAACACGG
ACAAGTAAT
GACGGAGAAA
TTAAAACCTA
AAGG;ACGGTA
GCCGTAGCGG
TTAAAATTAA
CCACACTACT
ATACTACGAC
ATCCAACACC
AGCCAACAGA
GAACCAGA
CAAGCAAACC
AACCAGTACA
AATTAAATCA
TGGCAGAAAT
ACATTGTAGC
TTCAATCNAG
AAATGAAGGT
TATCTTGTCT
AATGGAGGAA
ATTGGGGGA
ACCGCCCATT
GCCAATTGAG
ACCGACAGAG
ACAACCAACG
AGCAGAACCC
ACCAGCAGAA
CGCTGGAAAT
CACGCACGTG
TGTAGAAAAA
TTACAAACTA
TTTACCATAC
GTAG
ACAGGTGAAG AAATGAATAT CTIT'IATCT EF061-2 (SEQ ID NO:230)
S
00 0
OS
OS
MMXJ I LFASL
VPTEPSTPEK
PSKPVEPEKP
TQQPTVP lET
PSGNVEVKGK
FSATLLFCGS
PTDPTPPIEP
VTPSKPAEPE
SNLAEITHVP
DGKMKVLPYT
EISAFAQEI I PVDPVEPP IT
KTVTPTKPTE
SVTTPI'rrrD
GEEMNIFLSA
PDDTTTPPIE
PTEPTEPTEP
SERPVQPAEP
GENIVAVEYXG
VAVSCL
TTPTEPTTPT
SKP IDVVVTP
VPLTQTAEGL
EPSEPEQPTE
TGELNHAGNG
KPIQSSYKVL
EF061-3 (SEQ ID NO:231) GAAA=I CTGCTI'TGC ACAAGAAAIT ATCCCTGATG ATACTACGAC ACCGCCCATT
GAAGTACCAA
CCACCTGTAG
CCGACAACAC
GAGCCAAGTA
GAAAAAACTG
CCAAGCAAGC
GOTACACAAC
CCTAGTGTAA
GGTG'rTCCAC
TTGCCTAGCG
CAGAACCAAG
ACCCTGTAGA
CAACAGAACC
AACCAGTAGA
TCACACCAAC
CAATCGACGT
AGCCAACAGT
CAACACCTAT
TTACACAAAC
GAAATGTAGA
TACACCAGAA
GCCACCTATT
TACAACTCCT
ACCTGAAAAA
TAAACCAACA
TGTTGTAACG
CCCTATTGAA
TACAACTACA
AGCAGAAGG
AGTAAAAGGT
AAGCCAACAG
ACACCAACCC
ACAGAGCCAA
CCAGTTACAC
GAATCTGAAA
CCAACAGGGG
ACAAGTAATT
GACGGAGAAA
TTAAAACCTA
AAGGACGGTA
ATCCAACACC GCCAATTGAG AGCCAACAGA ACCGACAGAG GTG-AACCAGA ACAACCAACG CAAGCAAACC AGCAGAACCC AACCAGTACA ACCAGCAGAA AATAAATCA CGCTGGAAAT TGGCAGAAAT CACGCACGTG ACATTGTAGC TGTAGAAAAA ITCAATCINAG TTACAAAGTA AAAT'GAAGGT TT 137 TABLE 1. Nucleotide and Amino Acid Seqeuerices of E. faccalis Gencs.
EF061-4 (SEQ ID NO:2]2) QEII PDDTTTPPIE VPTEPSTPE( PTDPTPP IEP PSKPVEPEKP VTPSKPAEPE TQQPTJP ET SNLAEITHVP PSGNVEVYKGK DGKMKV PVDPVEPPIT PTEPTEPTEP TTPT-EPTTPT EPSEPEQPTE KTVTPTKPTE SEXPVQPAEP SIXPIDVVVTP TGELNHAGNG SVTTPITTD GENIVAVEKG VLTQTAZGL KPIQSSYKVL EF062-1 (SEQ ID NO:2]3) o
TGATTCTTGA
CACTGGGTAA
GATAATGTAC
AACCCCGACC
GTACAAAAAG
GGTACTGAAC
GGAGCTGAAA
CTAGGGCAGC
CCTA-AAGAAT
;AAXTACCG
GCTGCAAAAG
CAAGCAGCTG
GCAA.AAA6ACA
AAGAACGAAA
GTCGTG.ACGA
GATTTTAATA
ATGGNGGGA
TCTAATGGTG
GGACTAAACG
CAATCCTCAC
ACNGCTTTTA
AAGT-TC L=G
GCGCTGTCTT
GATTTTGGGG
GATGGAAAAT
AATAGTGATT
GCNAATGGNC
GTATCTAGTC
ATTCAATT
AAAGCCAATG
NTCAkATGATT
ACAGAACTTG
GATGCAGAAA
CTNAACGANA
GTGACTGCTT
AACGCNGACG
GTACTGAAAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATAT1'TCTC
AATCAAGCAT
AGCAACAAkAT
CTGTCCCTAT
AAGCCGCGGA
TGCAGTCAGA
ACACTAC LTC
AAAGTTCAGC
AATCAGCACA
CAACAGAAGT
TTGA-AACACC
TTwGTAAAA
AAAAAGAAGT
AATTAAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTIGG
GGTATGCGGT
CTAGrTTT
CAAGCCAAAG
TTGGTACAAA
ATCGTCAGG
CTTTAAATTC
CNAACAATGC
TTIACTCACC
GGC-ACGCTGT
GTATTI'CCTT
CGCAATGGTT
ATGGGAATCC
TCGTTCCTGT
TAAATGTGAA
CCAA.AGTAGA
A.AGTGACGAT
AAGACTNAA.A
CITATGANCT
GNTCNGTTGT
ATGTAGAAGG
CAAATACAGT
AAAACGGTAC
TTTATTATGA
GGGGCATCAA
rrACNAANTA
CCTACATTCT
TATTGGCNGC
TCTTTTTCTA
ATTAGATACG
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TCCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGAGCGNTAN
CAATGAAGCG
AATTTCGTCG
AGGGGATTCA
CAACTI'CCAG
TC TITAGAA
AGGACGTAAA
TGGAACGTTA
CAGAGTCAAT
TAAAGAAGTA
AAGTI'AACG
GTTCAAATAC
GGAAGATATI'
AGGTCACAAG
TTCTTTTGGT
TGCCTTTAGN
AAAAGAACCA
NCTTGTGCCN
NCGTGGCGAT
TCCPIAAAACA
TGATTTATCC
AGCTGTTGCT
CAATTTAGAT
1 TAGCAATC
CGATTTTGAA
GATTAACCAT
AGTTGGCACT
AGTGAAATCT
TGATGTCTTG
TGACCTTAAA
TTI'AGAAAAC
NTTAAATGAA
AGTC.TGTTAG
CAACCAGAA.A
CCTAAAACGG
AAAGTAGAAG
GATACCACAA
GTAGTAAGCC
GAAAATGAAG
AAAGCAGTrG
GACTTAGGCA
CAAAAAGAAC
AAAATTGCCA
GANAAAGAAG
ATTAGTAAAA
ATTAAAGGCG
AAAGATATCT
AATTCCTTTG
AAAAATAAAC
ATTACAAAAG
AATGCAGTAT
GGTAAGGATG
CTACCAGATA
AATAAAGGTG
ATTAATGGNT
CACTATGGCA
AATGCCTACT
ATGACAACAA
ACTAACTTAA
GAAAAAGCAA
AATAAAGAAG
TCTTTACAAT
GTAACNAAAC
AAAGTGAAAG
GCAGC14ATTA
CAAAACACCC
GGGTATAAAT
AATACAGCTG
GTGCCAGGTA
GTTTCTCTAC
TCCOAACGTC
GACACGACCC
GTAGGGGANA
AAAGACAATA
GGAAGCAATA
GAAAGCAAAA AAACAATATA AGACATATAA
GAGCCGTAGG
CAACGACGGT
CAGTMDTCTGA
AAG-TAGCGCC
ACGCGCAACA
CTGAAACAAC
TGAATAA.ATC
ATGAAGTA.AA
ACG=IC'C
AAGCGAAAAA
A6AGAAAATGC
TCGCNGAATA
ACCTAGTGTT
GAAAATTTAT
TTACAAAATT
TAAAAGAGGC
CAGTIACAGT
CAGAATTTGT
TTTCAAACGA
TTAAAACACG
AAGATACTCC
GCCATGCGGA
CNTATGTGAA
CAGGACCTTC
TTGGTTCAGG
AAGGAAAAAG
ATGCCCAATC
CGATIGAATI'
TCACTGATGG
ACA'IGTGAC
AAGGGATICG
IIATCAAGC
ATTCAGGAAN
TCACAGCA6AT
ATTTACTTGT
TTCAGCTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCG=
AAAC=AAA
AAGAC'ITGAC
AAGTAGGCAA
AGCTAAGA6AT
P.TTAGCTACT
TCAACCCAAT
AGAAGCAACA
PGAAAATA.AA
ACCAACAGTA
CAATGAACCT
A.ACGTCCATT
AAAAGATCCA
TAAAGATITA
GATTGCCCAA
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CAAAGCAACT
ACGGAAAGAT
AAATCTTGGG
GACCTATACA
TTATGA.ACTA
TCCGATTATC
CTTAACGAT
ATTTGCGTAT
ATTTGTTTCT
AAAACAAGCG
TGGATTGAAA
TGTAGGTCTA
TAATGTGCCT
AGTGAAGCCT
CAATNGATAC
NCAGAAAAAT
AGGGGATACG
AGATACNT 7 AGACGCAkAGT
AGCTAAAGAC
GATGAAAACC
CTTGCCGTTT
AAANGATGGN
cAAAGATGTA
TATTCCGTTA
TGGcGGAATN CACAGrGNAA6A AGcAGGAACA
GTT'TACNATG
ACAAGc [TG 138 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TCTGTGTATC
AAACCAACCA
CGTGGTGATG
GCCTTI'GATA
AANGTGACAC
AACCAGTTCA
CCACAAGCCT
GTCAAAGCCG
CGAATTAAAA
GTTATTAAAG
TTCTTCTATG
TGGTCGATTA
TTTGCCAATT
TCGA.AACTAT
rI'TrTNGATG
ATTGGTGTAG
AATGAGAAGA
CCAGAAAAAA
'ITAGTGGTAG
GTCTTATTAA
AAACGTAGAA
TGGAAGTCGA
AAGAC1TTGT
AAGCCGTTCA
TTCT'PrCTTA
CAGTCGATCT
CAATCAA.AGA
CGATCTfCNTG TTAT'rCTAGC
ATGT'ITCTGG
CCAATACNGT
TNCGTGACAA
AATTTACAAG
GCGATAAACT
CTAA'I=T
TCACGATGAC
CGATGAATCT
AACGAATTGC
TTAAAACNAA
CAGTGATTrGT
AAAAGGCAAG
CGGTAGCTGG
AAGAAACAAA
ACGGATNAAA
NCGTITCTAAT
TAACAAGAAA
TGAAATGACN
TGCGACACGC
CTIV4CTTCGT
GGACCATGCC
GNATCTGG
NGATGTTTAT
TGTCAACCAT
ACAAAGTCA
TACTGACAT
AGACGTCAAA
TTAGCAGAC
CTTTGAACAA
AAAAGAAAAC
GGCAGGAGAC
TACGGTAGTG
ACCACCAACA
TGTNGTGCCA
TAGTTTAGCC
ATAA
ACAGGTGACG
ACN4GTGGTCA
GGGGAAGANA
T5GGACTTAA
GTTCTTTCT
GTCAALAGATT
AAAGGCACGG
CAAGAATTGC
AATTCAGCGG
ATCCAAAAG
AATGGNGCCA
6CCTGCAGAAT
CATGACAAAT
GGAACCAAAG
=CGTAGTGA
AAAAACGTTG
GT ETACAACA
ACNCATACGC
CCAAAA-ACAC
GAATTGCCGC
GCAATCCTTG
TAGAAAACAC
CCCATACNCC
?I'AANCATGG
AAGGGTACGA
TCGATGATTA
CTAAAGGGGN
TGACNATNTC
GTGTAACNCT
AACAAAATAC
TGAANCCTAA
CAATCAAATT
ACCCTCGNCT
'FTAGTGGCCA
TIGAATAAAGG
AA.ATCACGGC
CACACTCATG
CAATCGAAGA
CAGAAkAAACC
CGCAAGCACC
AAACAGGCGA
GCTTAGCAGG
GCAAACAGAA
TGATGATCCA
AAAAGTNGCT
TAAAGACTTT
CCATGAAACG
AGACATTACC
TGCCAAAGAC
CCCTACAAAA
ATTTGGNCAA
AAAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
ATGGTCTGTG
GGACGACATT
CAGTCAAGCC
GAAAGCGTTC
ATCTTTCAAC
ACAAACNCCA
AGTAGAGCCA
AAAACAAAAT
CTTAGGCTTT
C
EF062-2 (SEQ ID NO:234) MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN
PDLQSEKETP
AEKSAQEQPV
ITVVEKPAED
KNKAEKERXX
FNKVNAGDSK
LNASYLGRKI
FFDASGKEVL
GKFYSPEDID
SSAQWFAFXT
NDLNVXRGDS
NXKDXKAVAA
VKNVEGDFEN
TKIYYEVKSS
ISJAYLLENK
YNKELVRSNT
FDTVDLATGV
QAFILAXGGQ
IKVGDKQSQN
ANSNFVLADC
GVERIAAGDV
VVEKASVVPE
KTAVSEEATV
VSPETTNEPL
LGNVSSKDLA
KEVAEYNKHK
DIFTKLRKDM
TKAEF'VYELQ
PDKDS PFAYA
YGTGPSGLKN
NLNAQSVKP I
LQYIVTCDTT
AINSGXAKDV
TAVQLTXDGE
ERPANYGGXT
DNKDLTFTMN
VVTHTPDDPK
SFFDDYDETX
ELRVTLPTKV
GATIKLGEXF
TKVNKGDDI S
YNTIEESFNN
LPQTGEKQNV
QKDTTSQPTK
GQPTE.VAPAE
AKEKEVDQLQ
NENSYVNE-Al
GGKXTGNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKTV
TASYXLNLDQ
TVTN'IVINI{V
EEWGMNDVLD
QALLAALNEG
PTKAVHNKKG
VTP IKDLLRV
KADVSGDVYN
FYEFTSSDI P
KLFTMTFEQG
EKIKTNTVVT
LLTVAGSLAA
VEEVAPENKC
NEVNKSTSIP
KEQAKIKIAQQ
SKNLVFDQSV
SFVXE-ANLGS
AVFSN.DP IIT KGG14AEFVSD
AYFGSGVGLA
KATIEFNXYK
TKQGIRDTFD
NTVTAMMKflTN
PGSNPSKDVK
TTHDRFTGKW
SNKVGKQAWS
EXIXHGKVAR
YDSKGXDITN
SAEQNTFGQR
AEYAGVVEEW
VVKITASQAF
HTPEKPQTPP
MLGLAGLGFK
TEQSSATPND
KEFETPDVDK
AAELKAKNEK
VTKDTKIS SI
NGGYAVLLEK
AF IGTNRVNG
FGANNAFKYI
NGRI SFSFGM
ANVVPVLVPN
AEKVTIDLSK
ADGSVVLAMG
ADKNGTVGSV
HAITXYDLKV
VYLEVERXKT
GDVLSYE2MTW QFTI SWDDA(
IKTN'IVVNHI
S ISDKLDVKH XDAI-I1ThENK
EKI'VIVPPTP
RRKETK
TTNAQQPTVG
AVDEVKKDPN
IAKENAEIAA
KGGKF IKATD
NKPVTVTYTG
KDVKTRLTIK
NGSYVKKQAD
TrKGKSNVPV
KEVTDGQKNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTO
GDVE'JTQTEN
DLKGYDKDFA
GTVTXSAKDP
PKVXPKKDVV
DKFSGQWSVF
NVAHSWKAFI
KTPQAP\TEPL
EF062-3 (SEQ ID NO:235) TGAITCTTCA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT CACTGGGTAA CTGTCCCTAT TCT=FTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 139 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
S.
S S
S
*SS.
.555
S
S*.
*SS.
5S5* S S *5
GATAATGTAC
AACCCCGACC
GTACAAAAAG
GGTACTGAAC
GGAGCTGAAA
CTAGGGCACC
CCTAAAGAAT
AACATTACCG
GCTGCAAAAG
CAAGCAGCTG
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GAT'I=AATA
ATCGGNGGA
TCTAkATGGTG
GGACTAAACG
CAATCCTCAC
ACNGCTT'ITA
AAGTTCT=G
GCGCTGTCTT
GA ITTTCGGG
GATGGAAAAT
AATAGTGATT
GCNAATGGNC
GTATCTAGTG
ATTTTCAATT
AAAGCCAATG
NTCAATCT
ACAGAACTTG
GATGCAGAAA
CTNAACCANA
GTGACTGCTT
AACGCNCACG
GTAGTGAAAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTG
AATCAAGCAT
TCTGTGTATC
AACTACAACA
AAACCAACCA
CGTGGTGATG
GCCTTTGATA
AANGiTCACAC
AACCAGTTCA
CCACA ACCCT
CTCAAAGCCC
CGAATTAAAA
GTTATTAAAC
TTC LTCTATG
AAGCCGCGGA
TGCAGTCAGA
ACACTACTTC
AAAGTTCAGC
AATCAGCACA
CAACAGAACT
TGAAACACC
'ITGTTGAAAA
AAAAAGAAGT
AATI'AAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTC
GGTATrGCCCT CTAGTTJWI7T
CAAGCCAAAO
TTGGTACAAA
ATGCGTCAGG
CTI'TAAATTC
CNAACAATGC
TI'ACTCACC
GGGACGCTGT
GTATTTCCTT
CGCAATGGTT
ATGGGAATCC
TCGTTCCTGT
TAAATGTGAA
CCAAAGTAGA
AAGTGACGAT
AAGACTNAAA
CTTATGANCT
GNTCNGTTGT
ATGTAGAAGG
CAAATACAGT
AAAACGGTAC
TTTATTATGA
GGGGCATGAA
TTACNAANTA
CCTACATI
TATF'GGCNGC
TGGAAGTCGP
AAGAGCTTGCI
AAGCCGTTCP
TTC=ITCTTP
CAGTCGATCO
CAATCAAAG;
CGATCTCNTC
TTATTCTAGC
ATGTTI'CTC
CCAATACNGI
TNGCTGACA)
AATTTACAAC
ATTAGATACG C AAAGGAAACA C
TCAACCGACC
TACCCCAAAT C
AGAACA.ACCAC
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGAGCGNTAN
CAATGAAGCG
AATTTCGTCG
AGGGGATTCA
CAACTTCCAG
TCTTTTAGAA
AGGACGTAAA
TGGAACG I A
CAGAGTCAAT
TAAAGAAGTA
AAGTTTAACG
GTTCAAATAC
GGAAGATAIT
AGGTCACAAG
TTCTTTTGGT
TGCCTTTAGN
AAAAGAACCA
NCTTGTGCCN
NCGTGGCGAT
TCCAAAAACA
TGATTTATCC
AGCTGTTGCT
CAATTTAGAT
TI'AGCAATG
CGAT'LTTGA-A
GATTAAC CAT
AGTTGGCAGT
*AGTCAAATCT
*TGATGTCTTG
TGACCTTAAA
TITAGAAA-AC
NTTAAATGAA
ACGGATNAAA
NCGTTCTAAT
LTAACAAGAAA
TGAAATGACN
7TGCGACAGrOC
CTTACTTCGT
GGACGATG-CC
GtIATGGTGGG NGAMT TAT
STGTCAACCAT
kACAAAGTCAA
;TAGCACATT
:AACCAGAAA C :CTAAAACGG C
LZAAGTAGAAG
;ATACCACAA Ja 'TACTAAGCC C .AAAATGAAC 9J k.AAGCAG~rG 3ACTTAGGCA I
AAAAAGAAC
\AAATTGCCA
aA1AAAGAAG
CTTAGTAAAA
ATTAAA-GGCGC
AAAGATATCT'
AATTCC TrG
AAAAATAAAC
ATTACAAAAG
P.ATGCAGTAT
GGTAAGGATG
CTACCAGATA
AATAAAGGTG
ATAATGGNT
GACTATGGCA
AATGCCTACT
ATGACAACAA
ACTAACTTAA
GAAAAAGCAA
AATAAAGAAG
TCTTI'ACAAT
GTAACNA.AAC
AAArGTGAA.AG GCAGCNATrA
CAAAACACCG
GGGTATAAAT
AATACAGCr
GTGCCAGGTA
GTTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AAACACAATA
GGAAGCA.ATA
ACAGGTGACG
ACNGTGGTGA
GGGGAAGANA
TGGGACTTAA
GTTTCTTTCT
GTCAAACAT1'
AAAGGCACGG
CAAGAATTGC
AA EWCAGCGG
AITCCAAAAG
AATGGNGCCA
CCTGCAGAAT
:AACGACGGT TI :AGTATCTGA A
LAGTAGCG--C
~CGCGCAACA A :TGAAACAAC C rGAATAAATC
MTAAGTAAA
~CGTTTCT1'C AGCAMAAA C kAGAAAATGC CCGCNGAATA kCCTAGTGTT
;AAAATTTAT
E'TACAAAATT
EAAAAGAGGC
AGTGACAGT
:AGAA--TTGT
UTTCAAACGA
TTAAAACACG
kAGATAGTCC
GCCATGCGGA
CNTATGTGAA
:AGGACCTTC
TTGGIWCAGG
AAGGAAAAAG
ATGCGCAATC
CGATGAATT
TCACTGATGG
ACATTGT--AC
AAGGGATTCG
TTTATCAAGC
ATTCAGGAAN
TCACAGC'AAT
ATTTACT
TTCAGCTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCGTTT
AAACGTTAA-A
AAGACTrGAC
AACTAGGCAA
TAGAAAACAC
CGCATACNCC
TTAANCATGG
AAGGGTACGA
TCGATGATTA
CTAAAGCGGN
TGACNATNTC
GTrGTAACNCT
AACAAAATAC
TGAANCCTAA
CAATCAAATT
ACGCTGGT
CAACCCAAT
IGAAGCAACA
LGAAAATAAA
~CCAACAGTA
:AATGAACCT
LACGTCCAT
LAAAGATCCA
.AAAGATTTA
ATTGCCCAA
IGAA.ATTGCG
:AACAAGCAT
GATCAATCT
:AAAGCAACT
%CGGAAkAGAT
%AATCTTGGG
'ACCTATACA
TTATGAACTA
rCCGATTATC
CTTAACGATT
AT'IGCGTAT
ATTGTTTCT
PAAACAAGCG
TGGATTGAAA
TGTAGGTCTA
TAATGTGCCT
AGTGAAGCCT
CAATNGATAC
NCAGAAAAAT'
AGGGGATACG
AGATACN'TT
AGACGCAAGT
AGCTAAAGAC
GATGAAAACC
CTTGCCGTT'
AAANGATGGN~
CAAAGATGTA
TATTCCG~rA
TGGCGGAATN
CACAGGNAAA
AGCAGGAACA
GTTTACNATG
ACAAGCTTGG
GCAAACAGAA
TGATGATCCA
AAAAGTNGCT
TAAAGACTTT
CGATGAAACG
AGACATTACG
TOccAAAGAC
CCCTACAAAA
ATITGGNCAA
MAAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
140 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
TG;GTCGATTA
TI'TGCCAATT
TCGAAACTAT
TT=FNGATG
ATTGGTGTAG
AATGAGAAGA
CCAGAAAAA
TTAGTGGTAG
GCGATAAACT AGACGTCAAA CTAATrTGT 'FTTAGCAGAC TCACGATGAC CTTTGAACAA CGATGAATCT AAAAGAAAAC AACGAATTGC GGCAGGAGAC TTAAA-ACNAA TACGCTACTG CAGTGATTGT ACCACCAACA AAAAGGCAAG Tr,
CATGACAA.AT
GGAACCAAAG
GGGGTAGTGA
AAAACGTTG
GTIACAACA
ACNCATACGC
CCAAAA.ACAC
TI'AGTC-CCCA
TGAATAAAGG
AAATCACGC
CACAVI'CATG
CAATCGAAGA
CAGA.AAAACC
CGCAACCACC
ATGGTCTG
GGACGACATT
CAGTCAAGCC
GAAAGCGTTC
ATCTTTCAAC
ACAAACNCCA
AGTAGAGCCA
EF062-4 (SEQ ID NO:236) AELDTQ PET ITVQPNN PDLQSEKETP KTAVSEEATV AEKSAQEQPV VSPETTNEPL ITVVEKPAED LGNVSSKDLA KNKAEXERXX KEVAEYNXHK FNKVNAGDSK DIFTKLRKDM LNASYLGRK I TKAEFVYELQ FFDASGKEVL PDKDSPFAYA GKFYSPEDID YGTCSGLKN SSAQWFAFXT NLNAQSVKPI NDLNVXRGDS LQYIVTGDTT NXKDXKAVAA AINSGXAXDV VK<NVEGDFEN TAVQLTXDGE TKIYYEVKSS ERPANYGGXT I SAY ILLENK DNKDLTFTMN YNKELVRSNT VVTHTPDDFK FDTVDLATGV SFFDDYDETX QAPILAXGGQ ELRVTLPTKV IKVGDKQSQN GATIKLGEXF ANSNFVLADG TKVNKGDDIS CVERIAAGDV YNTIEESFNN
VVEKASV
QKDTTSQPTK
GQPTEVAPAE
AKEKEVDQLQ
NENSYVNEAI
GGKXTGNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKIV
TASYXLNLDQ
'IVTNTVINHV
EEWGMNDVLIJ
QALLAALNEG
PTKAVHNYKKG
VTPIKDLLRV
KADVSGOVYN
FYEFTSSDIP
KLFTMTFEQG
EKIKTNTVVT
VEEVAPENXG
NEVNKSTSIP
KEQAKK IAQQ
SKNLVFDQSV
SFVKEANLGS
AVFSNDPI IT
KGGHAEFVSD
AYFGSGVGLA
KAT IEFNXYK
TKQGIRDTFD
NTVTAMKKTN
PGSNPSKDVK
TTHDRFTGKW
SNKVGKQAW8
EXIXHCKVAR
KDSKGXDITN
SAEQNTFGQR
AEYAGVVEEW
VVKITASQAF
HTPEKPQTPP
TEQSSATPND
KEFETPDVDK
AAELKAKNEK
VTKDTICSSI
NGGYAVLLEK
AF IGTNRVNG FGANNAFKY I NGR ISFSFGM
ANVVPVLVPN
AEKVTIDLSK
ADGSWLAM4G
ADKNGTVGSV
HAIl(YD 4
KV
VYLEVERXYKT
GE)VLSYEMW
QFTISVIDDAI(
IKTNFJVNH I S ISDKLDVKH
XDAMLKENK
EKTVIVPPTP
TTNAQQPTVG
AVDEVKKDPN
IA.KENAEIAA
KGGKF IKATD
NKPVTVTYTG
KDVKTRLTIK
NGSYVKKQAD
TTKGKSNVPV
KEVTDGQKNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTD
GDVENTQTEN
DLKGYDKDFA
GTVTXSAKDP
PKVXPKKDVV
DKFSGQWSVF
NVAHSWKAF I
KTPQAPVEPL
EF063-1 (SEQ ID NO:237)
TGATTCTTGA
CACTGGGTAA
GATAATGTAC
AACCCCGACC
GTACAAAAAG
GGTACTGAAC
GGAGCTGAAA
CTAGGGCAGC
CCTAAAGAAT
AACATTACCG
GCTGCAAAAG
CAAGCAGC'rG
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GATTTTAATA
AGCAACAAAT
CTGTCCCTAT
AAGCCGCGGA
TGCAGTCAGA
ACACTACTTC
AAACTTCAOC
AATCAGCACA
CAACAGAAGT
TTGAAACACC
TTGTTGAAAA
AAAAAGAAGT
AATTAAAAGC
AAGCNCAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
GAAAGCAAAA
TCTTTTTCTA
A ITAGATACG
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGACCGNTAN
CAATGAAGCG
AAITTCGTCG
AGGGGATTCA
AAACAATATA
AGTGTGTTAG
CAACCAGAAA
CCTAAAACGG
AAAGTAGAAG
GATACCACAA
CTACTAAGCCC
GAAAATGAAC
AAAGCAGTIG
GACTTAGGCA
CAAAAAGAAC
AAAATTGCCA
CANAAAGAAG
ATTACTAAAA
ATTMAAGGCG
AAAGATATCT
AGACATATAA
GAGCCGTAGG
CAACGACGGT
CAGTATCTGA
AAGTAGCCC
ACGCGCAACA
CTGAAACAAC
TGAATAAATC
ATGAAGTAAA
ACGI"I'CTTC
AAGCGAAA
AAGAAAATGC
TCGCNiGAATA
ACCTAGTGTT
GAAAATTTAT
TACAAAATT
AGCTAAGAAT
ATIAGCTACT
TCAACCcAAT
AGAAGCAACA
AGAAAATAAA
ACCAACAGTA
CAATCAACCT
AACGTCCATT
AAAAGATCCA
TAAAGATTTA
GATTGCCCAA
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CAAAGCAACT
ACGGAAAGAT
141 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
a.
ATGGGNGGGA
TCTAATGGT'G
GGACTAAACG
CAATCCTCAC
ACNGCI'rTTA AAGTTrCTTTG
GCGCTGTCTT
GATTTTGGGC
GATGGAAAAT
AATAGTCATT
GCNAAT13GNC
GTATCTAGTG
ATTTTCAATT
AA.AGCCAATG
NTCAATGATT
ACAGAAC LTG
GATGCAGAAA
CTNAACGANA
GTCACTGCCT
AACGCNGACG
GTAGTGAAAA
GAAACGGTAA
AAAcCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTG
AATCAAGCAT
TCTCTGTATC
AACTACAACA
AA.ACCAACCA
CGTGGTCATG
GCCTT~FGATA
AANGTGACAC
AACCAGTTCA
CCACAAGCCT
GTCAAAGCCG
CCAATTAAAA
GTTATTAAAG
TTCTTCTATG
TCGTCCA'ITA
TTTGCCAATTI
TCGAAACTAT
T TTTT-NGATC A ITGGTrGTAG
AATGAGAAGA
CCAGAAAAAA
TTACTGGTAG
GTCTTATTAA
AAACGTAGAA
AAGNTACTGG
GGTATGCGGT
CTAGTTATT
CAAGCCAAAG
TI'GGTACAAA
ATGCGTCAGG
CTTTAAATTC
CNAACAATGC
FI'TACTCACC
GCCACGCTGT
GTATTTCCTT
CGCAATGGTT
ATGGGAATCC
TCGITTCCTGT
TAAATGTGAA
CCAAAGTAGA
AAGTIGACGAT
AAGACTNAAA
CTTATGANCT
GNTCNGTITGT
ATGTAGAAGG
CAAATACAGT
AAAACGGTAC
TTATTATGA
GGGGCATGAA
TTACNAANTA
CCTACATTCT
TATTGGCNGC
TGGAAGTCGA
AAGAGCTTGT
AAGCCGTTCA
TTCTI'CTTA
CAGTCGATCT
CAATCAAAGA
CGATCTCNTG
TTATT-:TAGC
ATGTTCTGG
CCAATACNGT
TNGGTGACAA
AATTTACAAG
GCGATAAACT
CTAAT=TGT
TCACGATGAC
CGATGAATCT
AACGAATTGC
TTAAAACNAA
CAGTGATTGT
AAAAGGCAAG
CGGTAGCTGG
AAGAAACAAA
CAACTTCCAG
TCTTTTAGAA
AGGACGTAALA
TGGAACG1'TA
CAGAGTCAAT
TAAAGAAGTA
AAGTTTAACG
GTTCAAATAC
GGAAGATATT
AGGTCACAAG
TTCTTITTGT
TGCC=TAGN
AAAAGAACCA
NCTI'GTGCCN
NCGTGGCGAT
TCCAA.AAACA
TGATI'TATCC
AGCTGTTGCT
CAA ITTAGAT
=~AGCA.ATG
CGATTTTGAA
GATTAACCAT
ACGFrGGCAGT
AGTGAAATCT
TGATGTCTTG
TG3ACCTTAAA
TTTAGAAAAC
NTTAAATGAA
ACGGATNAAA
NCCTTCTA.AT
TAACAAGAAA
TGAAATGACN
TCGACAGGC
C ITACTI'CGT
GGACGATGCC
GNATGGTCGC
NGATGTII'AT
TGTCAACCAT
ACAA.AGTCAA
TAGTGACATT
AGACGTCAAA
TTTAGCAGAC
CTTrGAACAA
AAAAGAAAAC
GGCAGGAGAC
TACGGTAG
ACCACCAACA
TGTNGTGCCA
TAGTTTAGCC
ATAA
AATTCCTTG
AAAAATAAAC
ATTACA-AAAG
AATQCAGTAT
GGTAAGG-ATG
CTACCAGATA
AATAGGTG
ATAATGCNT
GACTATGGCA
AATGCCTACT
ATGACAACAA
AKCTAACTTAA
CAAAAAGCAA
AATAAAGAAC
TCTTTACAAT
GTAACNAAAC
A.AAGTGAAAC
GCAGCNATTA
CAAAACACCG
GGGTATAAAT
AATACAGCTC
GTGCCAGGTA
CTTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AAAGACAATA
GGAAGCAATA
ACAGGTGACC.
ACNGTGGTGA
GGGGAAGA'A
TGGGACTTAA
GTTTCTTCT
GTCAAAGA'T
AAAGGCACGG
CAAGAATTC
AATTCAGCCG
ATITCCAAAG
AATGGNGCCA
CCTGCAGAAT
CATGACAAAT
GGA.ACCAAAG
GGGGTAGTGA
AAAAACG1rG
GTTTACAACA
ACNCATACGC
CCAAAAACAC
GAATTGCCGC
CCAATCc'r
TAAA-AGAGGC
CAGTGACAGT
CAGAA=rGT
TTTCAAACGA
TTAAAACACG
AAGATAGTCC
GCCAMCGGA
CNTATCTGAA
CAGGACCTTC TGGATTGAAA TTGGTTCAGG TGTAGGTCTA A.ACCAAAAAG TAATGTCCCT ATGCGCAATC AGTGALAGCCT CGATTGAATT CAATNGATAC TCACTGATGG NCAGAAAAAT ACATPMTGAC ACGGGATACC AAGGGATTCG ACATACNTTT TTTATCAACC AGACGCAAGT ATTCAGGAAN ACCTAAAGAC TCACAGCALAT GATGAAAACC AT'LTACTTGT C'IrGCCGTTT TTCAGCTGAC AAANGATGGN GTAATCCTTC CAAAGATGTA ATGATAAAGA TATCCGTTA CAGCNAACTA TGGCGGAATN ATGATCGTTT CACAGGNAA-A AAACGTTAAA AGCAGGAACA A.AGACTTGAC GTTTACNATG AAGTAGGCAA ACAAGCTTG TAGAAAACAC GCAAACAGAA CGCATACNCC TGATGATCCA TTAANCATGG AAAAGTNGCT AAGGGTACGA TAAAGACTTT TCGATGATTA CGATGAAACG CTAA-AGCGGN AGACATTACG TGACNATNTC TGCCAAAGAC GTGTAACNCT CCCTACAA;AA AACAAAATAC AITI'GGNCAA TGAANCCTAA AAAACACGTG CAATCAAATT AGGGGAGAA-N ACGCTGGNCT TC-TCGAAGAA TTAGTGGCCA ATGGTCTGTG TGAATAAAGG GGACGACATT AAATCACGG;C CAGTCAAGCC CACACTCATG GAAAGCGTTC CAATCGAAGA ATCTITCAAC CAGA.AAAACC ACAAACNCCA CGCA.AGCACC ACTAGAGCCA AAACAGGCGA AAAACAAAAT GCTTAGCAGG; CTTAGGcTITT
AAATCTTGGG
GACCTATACA
TI'ATGAACTA
TCCGATTATC
CTTAACGATTr
ATTTGCGTAT
ATTTGTTTCT
AAAACAACCC
EF063-2 (SEQ ID NO:238) 142 TABLE 1. Nucleotide and Anino Acid Seqeuences of E. faeca/is Genes.
MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN
PDLQSEKETP
AEKSAQEQPV
ITVVEKPAED
KNKAEKERXX
FNKVNAGOSK
LNASYLGRKI
FFDASGKEVL
GKFYSPEDID
SSAQWFAFXT
NDLNVXRCDS
NXKDXKAVAA
VKNVEGOFEN
TX IYYEVKSS
ISAYILLENK
YNXELVRSNT
FDTVDLATCV
QAFILAXGGQ
IKVGDKQSQN
ANSNFVLADG
GVER IAAGDV
VVEKASVVPE
KTAVSEEATV
VSPETTNEPL
LGNVSSKDLA
KEVAEYNKHK
DIFTKLRKDM
TKAEFVYELQ
PDKDSPFAYA
YGTGPSGLKN
NLNAQSVKP I LQY IVTGDTT
AINSGXAKDV
TAVQLTXDGE
ERPANYGGXT
DNKDLTFTMN
VVTH-TPDOPK
SFFDDYDETX
ELRVTLPTKV
GATIKLGEXF
TKVNKGDOI S
YNTIEESFN
LPQTEKQNV
QKDTTSQPTK
GQPTEVAPAE
AK:EKEVDQLQ
NENSY-VNEAI
GGKXTGNFQN
SSPSQSCTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKTV
TASYXLNLDQ
TVTNTVINH-V
EEWGMNDVLD
QALLAALNEG
PTKAVHNKKG
VTPIKOLLRV
KADVSGDVYN
FYEFTSSDIP
KLFTMTFEQG
EK IKTNTVVT
LLTVAGSLAA
VEEVAPENYG
NEVNKSTSIP
KEQAKKIAQQ
SKNLVFEDQSV
SFVIKEANLGS
AVFSNDPI IT
KGGRAEFVSD
AYFGSGVGLA
'ATIEFNXYK
TKQGIRDTFO
fTVTAMTN
PGSNPSKDVK
TTHDRFTGKW
SNKVGKQAWS
EXIXHGKVAR
KDSKGXDITN
SAEQNTF'GQR
AEYAGVVEEW
VVKITASQAF
TE-QSSATPND
KEFETPDVDK
AAELKAKNEK
VTKIDTKISSI
NGGYAVLLEK
AFIGTNRVNG
FGANNAFKYI
NGRISFSFGM
ANVVPVLVPN
AEKVTIDLSK
ADGSVVLAMG
ADKNGTVGSV
HAITXYIJLKV
VYLEVERXKT
GDVLSYEMTW
QFTI SWDDA( IKTN'IVVNH I
SISDKLDVKH
XDAMNLKENK
TTNAQQPTVG
AVDEVKKDPN
IAKENAE IAA
KGGKFIKATD
NKPV7VTYTG
XDVKTRLTIK
NGSYVKKQAD
TTKGKSNVPV
KEVTDGQ-KNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTD
GDVENTQTEN
DLKGYDKDFA
GTVTXSAKDP
PKVXPKKEDVV
DKFSGQWSVF
NVAHSWKAFI
KTPQAPVEPL HTPEKPQTPP EKTVIVPPTP MLGLAGLG:FK RRKETK EF063-3 (SEQ ID N'O:239) GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT
AACCCCGACC
GTACAAAAAG
GGTACTGAAC
GGAGCTGAAA
CTACGCAGC
CCTAAAGAAT
AACATTACCG
GCTG.CAAAAC
CAAGCAGCTG
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GAI=TAATA
ATGGGNGGGA
TCTAATGGTG
GGACTAAACG
CAATCCTCAC
ACNGCTTTTA
AAGTTCTTTG
GCGCTGTCTT
GATTTGGGG
GATGGAAAAT
AATAGTGATT
GCNAATCGNC
GTATCTAGTG
ATTTTCAATT
TGCAGTCAGA
ACACTACTTC
AAAGTTCAGC
AATCAGCACA
CAACAGAAGT
TTGAAACACC
TGTTGAAAA
AAAAAGAAGT
AATTAAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTGG
GGTATGCGGT
CTAGTTATTTI
CAAGCCAAAG
TTGGTACAAA
ATGCGT'CAGG
CTTTAAATI'C
CNAACAATGC
TTTACTCACC
GGGACGCTGT
CTATTTCCTT
CGCAATGGTT
ATGGGAATCC
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TGCACCAGCT
AGACG ITGAT
ACCACCAGAA
AGACCAACTA
CAAAAATGAA
AGAGCGNTAN
CAATGAAGCG
AATTTCGTCG
AGGGGATI'CA
CAACTTCCAG
TCTTTTAGAA
AGGACGTAAA
TGGAACGTTA
CAGAGTCAAT
TAAAGAAGTA
AAGTTTAACG
GTTCAAATAC
GGAAGATATT
AGGTCACAAG
CCTAAAACGG
AAAGTAGAAG
GATACCACAA
GTAGTAAGCC
GAAAATGAAG
AAAGCAGTTG
GACTI'AGGCA
CAAAAAGAAC
AAAATTGCCA
GANAAAGAAG
ATTAGTAAAA
ATTAAAGGCG
AAAGATATCT
AATTCCTTTG
AAAAATAAAC
ATI'ACAAAAG
AATGCAGTAT
GGTAAGGATG
CTACCAGATA
AATAAAGGTG
ATTAATGGNT
GACTATGGCA
AATGCC TACT
CAGTATCTGA
AAGTAGCGCC
ACGCGCAACA
CTGAAACAAC
TGAATAAATC
ATGAAGTAAA
ACGTTI'CTTC
AAGCGAAAAA
AAGAAAATGC
TCGCNGAATA
ACCTAGTGT
GAAAA TIAT TTACAAAA T
TAAAAGACGC
CAGTCACAGT
CAGAA'TTGT
TTCAAACGA
TI'AAAACACG
AAGATAGTCC
GCCATGCGGA
CNTAT13TGAA
CAGGACCTTLC
TI'GGTCAGG
AAGGAAAA.AG
ATrCCGCAATC
CGATTGAATT
AGAAGCAACA
AGAAAATAAA
ACCAACAGTA
CAATGAACCT
AACGTCCT
AAAAGATCCA
TAAAGATTTA
GA'ITGCCCAA
AGAAATTC
CAACAAGCAT
CGATCAATCT
CAAAGCAACT
ACGGAAAGAT
AAATCTTGGG
GACCTATACA
TTATGAACTA
TCCGATTATC
CTTAACGATT
ATTTGCGTAT
ATTTGTI'TCT
AAAACAACCG
TGGAITGAAA
TGTAGGTCTA
TAATGTGCCT
AGTGAAGCCT
CAATNGATAC
TTCrTrGGT ATCACAACAA MCCTITAGN ACTAACTTAA AAAAGAACCA GAAAAAGCAA 143 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAAGCCAATG TCGTTCCTGT NCT'TGTGCCII AATAAAGAAG NTCAATGATr TAA.ATGTGAA NCG TGGCGAT TCTT'FACAAT ACAGAACTTG CCAAAGTAGA TCCAAAAACA GTAACNAAAC GATCCAGAAA AAGTGACGAT TGATTTATCC AAACTG
TCACTGATGG
ACATTTGAC
AAGG;GATTCG
NCAGAAkAAAT
AGGGGATACG
AGATACNFT
EF063-4 (SEQ ID NO:240) ELDTQ PETVQPNN PDLQSEKETP KTAVSEEATV AEKSAQEQPV VSPETTNEPL ITVVEXPAED LGNVSSKDLA KNKAEKERXX KEVAEYNKHK FNXVNAGDSK DIFTKLRKDM LNASYLGRXI TKAEFVYELQ FFDASGKEVL PDKDSPFAYA CKFYS PEDID YGTGPSGLKN SSAQWFAFXT NLNAQSVKP I NDLNVXRGDS LQYIVTGDTT
QKDTTSQPT(
GQPTEVAPAE
AKEKEVDQLQ
NENSYVNEAI
GGKXTCN'FQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKTV
VEEVAPEN(G
NEVNKSTSIP
KEO2AKK IAQQ
SKNLVFDQSV
SFVKEANLGS
AVFSNDPIIT
KGGHAEFVSD
AYFGSGVGLA
KATIEFNXYK
TKQGIRDTFD
TEQS SATPND
KEFETPOVDK
AAELKAKNEK
VTKDTKISS I
NGGYAVLLEK
AF IGTNRVNG
FGANNAFKYI
NGRI SFSFGM
ANVVPVLVPN
AEKVTIDLSK
TTNAQQPTVG
AVDEVKKDPN
IAKENAETIAA
KGGKF IKATD
NI(PVTVTYTG
KDVKTRLTIK
NGSYVKKQAD
TTKGKSNVPV
KEVTDGQKNX
EF064-1 (SEQ ID NO:241)
C..
C.
TCATTCTTGA
CACTGGGTAA
GATAATGTAC
AACCCCGACC
GTACAAAAAG
GGTACTGAAC
GGAGCTGAAA
CTAGGGCAC
CCTAAAGAAT
AACA'ITACCG
GCTGCAAAAG
CAAGCAGCTG
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GA'I=TAATA
ATGGGNCGGA
TCTAATGTG
GGACTAAACG
CAATCCTCAC-
ACNGC=TTA
AAGTI'CTTTG
GCGCTGTCTT
GATTTTGGGG
GATGGAAAAT
AATAGTGATT
GCNAATGGNC
GTATCTAGTG
ATICAT
AAACCCAATG
NTCAATGATT
ACAGAACTI'G
ACCAACAAAT
CTGTCCCTAT
AAGCCGCGGA
TGCAGT-CAGA
ACACTACTTC
AAAGTTCAGC
AATCAGCACA
CAACAGAACT
TTGAAACACC
T
TGTTGAAAA
AAA.AAGAIAGT
AATTAAAAGC
AAGCNGAAAA
ACACCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTGG
GGTATCCGGT
CTAGTTATT
CAAGCCAALAG
T1'CGTACAAA
ATGCGTCAGG
CTI'TAAATTC
CNAACAATC
TTTACTCACC
GGGACGCTGT
GTATTTCCT
CGCAATGGTT
ATGGGAATCC
TCGTTCCTCT
TAAATGTGAA
CCAAAGTAGA
GAAAGCAAAA
TCT'ITTTCTA
ATTAGATACG
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAA.ATGAA
AGAGCGNTAN
CAATGAAGCG
AATTCGTCG
AGGGGATTCA
CAACTTCCAG
TCTTTTAGAA
AGGACGTAAA
TGGAACGTTA
CAGAGTCAAT
TAAAGAAGTA
AAGFI 'AACG
GTTCAAATAC
GGAAGATT
AGGTCACAAG
TICTTTGT
TGCC IT'AGN
AAAAGAACCA
NCTTGTCCCN
NCGTGGCGAT
TCCAAAAACA
AAACAATATA
AGTCTGTTAG
CAACCAGAAA
CCTAAAACGG
AAACTAGAAG
GATACCACAA
GTAGTAAGCC
GAAAATGA.AG
AAAGCAGTTG
GACI'AGGCA
CAAAAAGAAC
AAAATTGCCA
GANAAAGAAG
ATTIACTAAAA
ATTIAAAGGCG
AAAGATATCT
AATTCCTTTG
AAAAATAAAC
ATTACAAAAG
AATGCAGTAT
GrGTAAGGATG
CTACCAGATA
AATAAAGGTG
ATTAATGGNT
GACTATGGCA
AATGCCTACT
ATGACAACAA
ACTAACTTAA
GAAAAAGCAA
AATAAAGAAG
TC=IACAAT
GTAACNAAAC
AGACATATAA
GACCCCTAGG
CAACGACGGT
CAGTATCTGA
AAr-TAGCGCC
ACGCGCAACA
CTGAAACAAC
TGAATAAATC
ATGAAGTAAA
ACGT=ITTC
AAGCGAAAAA
AAGAAAATGC
TCGCNGAATA
ACCTAGTGTT
GAAAATI'TAT
TTACAAAATT
TAAAAGAGGC
CAGTCACAGT
CAGAATTTGT
TTTCAAACGA
TTAAAACACG
AAGATAGTCC
GCCATGCGGA
CNTATGTGAA
CAGGACCT'C
TTGGITCAGG
AAGGAAAAAG
ATGCGCAATC
CGATLTGAATrI
TCACTGATGG
ACATTGTGAC
AAGGGATTCG
AGCTAAGAAT
ATTAGCTACT
TCAACCCAAT
AGAAGCAACA
AGAAAATA.AA
ACCAACAGTA
CAATGAACCT
AACGTCCAT'
AAAAGATCCA
TAAkAGATFE'A
GATTGCCCAA
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CAAAGCAACT
ACGGAAAGAT
AAATCTTGGG
GACCTATACA
TTATGAACTA
TCCGATTATC
CTAACGT
ATI'GCGTAT
ATTT GTTI'CT
AAAACAAC
TGGATI'GAAA
TGTAGGTICTA
TAATGTGCCT
AGTGAAGCCT
CAATNGATAC
NCAGAAAAAT
AGGGGATACG
AGATACNTIT
TABLE 1. Nucleotide and Amnino Acid Seqeucnces of E.faecalis Genes.
GATGCAGAAA AAGTGACGAT CTNAACGANA AAGACTNAAA GTGACTGCTT CTTATGANCT AACGCNGACG GNTCNGTTGT GTAGTGAAAA ATCTAGAAGC GAAACGGTAA CAAATACAGT AAAGCAGATA AAAACGGTAC CAAACAAAAA TTTATTATGA ACNGAAGAAT GGGGCATGAA TGGCACGCTA 'ITACNAANTA GATATTTCTG CCTACATTCT AATCAAGCAT TATTGGCNGC TCTGTGTATC TGGAAGTCGA AACTACAACA AAGAGCTTCT AAACCAACCA AAGCCjTTCA CGTGGTGATG TTCTTTCTTA GCCT=G-AT-A CAGTCGATCT AANGCACAC CAATCAAAGA AACCAGTTCA CGATCTCNTG CCACAAGCCT TTATTCTAGC GTCAAAGCCG ATG =TCTGG CGAATTAAAA CCAATACNGT GTATAAAG TNGGTGACAA TTCTTCTATG AATI'TACAAG TGGTCGATTA GCGATAAACT ITrGCCAATT CTAA'T7T TCCAAACTAT TCACGATGAC TITTNGATG CGATGAATCT ATTGGTGTAG AACGAATTGC
AATGACAAGA*TTAAAACNAA
CCAGAAAAAA CAGTGATTGT
TGATTTATCC
AGCTGTTGCT
CAATTTAGAT
TrTTAGCAATG CGA'TTrCAA
GATI'AACCAT
AGTTGGCAGT
AGTGAAATCT
TGATGTCTTG
TGACCTTAAA
TITTAGAAAAC
NTTAAATGAA
ACGGATNAAA
NCGTTCTAAT
TAACAAGAAA
TGAAATGACN
TGCGACAGGC
CTTACTTCGT
GGACGATGCC
GNATGGTGGG
NGATCTTTAT
TGTCAAC CAT
ACAAAGTCAA
TAGTGACATT
AGACGTCAAA
T'ITAGCAGAC
CT'rrGAACA-A
AAAAGAAAAC
GGCAGGAGAC
TACGGTAGTG
ACCACCAACA
AAAGTGAAAG
GCACCNATTA
CAAAACACCG
GGGTATAAAT
AATACAGCTlr
GTGCCAGGTA
GTTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGA1NA
AAAGACAATA
GGA-AGCAAT-A
ACAGGTGACG
ACNGTGGTGA
GGGGAAGANA
TGGGACTTAA
G'TTCTTTCT
GTCAAAGATI'
AAAGGCACGG
CAAGAATTC
AATTCAGCGG
ATTCCAAAAG
AATGGNGCCA
CCTGCAGAAT
CATGACAAAT
GGAACCAAAG
GGTAGTGA
AAAAACGTTG
GTTTACAACA
ACNCATACGC
CCAAAAACAC
GAA ITGCCGC
GCAATGCTTG
T=I1ATCAAGC
ATTCAGCAAN
TCACAGCAAT
ATTTACTTGT
F1'CAGCTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCGTT
AAACGTTAAA
AAGAC=IAC
AAGTACCCAA
TAGAAAACAC
CGCA#ICNCC
TTA.ANCATGG
AAGGGTACGA
TCGATGATTA
CTAAAGGGN
TGACNATNTC
GTGTAACNCT
AACA-AAATAC
TGAANCCTAA
CAATCAAATT
ACCCTGGNGT
TTAGTGGCCA
TG.AATAAAGG
AAATCACGC
CACACTCATG
CAATCGAAGA
CAGAAAAACC
CGCA.AGCACC
AAACAGGCGA
GCTTAGCAGG
AGACGCAAGT
ACCTAAAGAC
GATGAAAACC
CTTGCCGTTT
AAANGATGGN
CAAAGATGTA
TATTCCGTTA
TGGCGGAATN
CACAG4GNA.AA
AGCAGGAACA
G'TTACNATG
ACAAGCTTGG
GCAAACAGAA
TGATGATCCA
AAAAGTNGCT
TAAAGACTT
CGATGAAACG
ACACA'ITACG
TGCCAAAGAC
CCCTACAAAA
ATTTGGNCAA
AAAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
ATGGTCTGTG
GGACGACATT
CACTCAACCC
GAAAGCGTTC
ATCT PTCAAC
ACAAACNCCA
AGTAGAGCCA
AA.AACAAAAT
CTAGGCrTTT 4 4
C.
4 4 T1'AGTGGTAG
GTCTTATTAA
AAACGTAGAA
AAAAGGCAAG TGTNGTGCCA CGGTAGCTGG TAGTTTACC AAGAAACAAA ATAA EF064-2 (SEQ ID NO:242) MXAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PE-TTVQPNN 4 C. 4 .4
S.
PDLQSEKETP
AEKSAQEQPV
ITVVEKPAED
KNKAEKERXX
FN](VNAGOSK
LNASYLGRKI
FFDASGKEVL
GKFYSPEDID
SSAQWFAFXT
NDLNVXRGDS
NXKDXKAVAA
VKNVEGDFEN
TKIYYEVKss I SAY ILLENK
YNKELVRSNT
KTAVSEEATV
VSPETITNEPL
LGNVSSKDLA
KEVAEYNKH(
DIFTKLRKDM
TKAEFVYELQ
PDKDSPFAYA
YGTGPSGLKN
NLNAQSVKP I
LQYIVTGDTT
AINSGXAKDV
TAVQLTXDGE
QKDTTSQPTI(
GQPTEVAPAE
AXEXKEVDQLQ
NENSY-VNEAI
GGKXTGNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKTV
TASYXLNLDQ
TVTNTVINHV
VEEVAPEN(C
NEVNKSTSIP
KEQAYKKIAQQ
SXNLVFD)QSV
SFVKEANr~LGS AVFSNDP IIT
KGGHAEFVSD
AYFGSGVGLA
KATIEFNXYK
TKQGIRIDTFD
NTVTAMMdXTN
PGSN'PSKDVK
TTHDRFTGI(W
SNXVGKQAWS
EX IXHGKVAR TEQSSATPND ITNAQQPTVG KEFETPDVJDK AVDEVKKDPN AAELKCAKNEK IAKENAEIAA VTXDTKISS I KGGKFIKATD NGGYAVLLEK NKPVTVTYTG AF IGTNRVNG KDVKTRLTIK FGANNAFKYI NGSYVKKQAD NGRI SFSFGM TTKGKSNVPV ANVVPVLVPN KEVTDGQKNX AEKV'T-DLSK VKVYQADASL ADCSVVLAMC YKYLLVLPFV ADKNGTVGSV SLI.DKD IPLQ HAITXYDLKV GXKTLKAGTD VYLEVERXKT GDVE.NTQTEN GDVLSYEMTW~ DLKGYDKDFA ERPANYGGXT EE'WCMNDVLD DNKDLTFTMN QALLAALNEC VVTHTPDDPK PTKAVHNKXG 145 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalisv Genes.
FDTIVDLA G-V QAF IL.XGGQ
I(VGDKQSQN
ANSNFVLADG
GVER IAAGDV
VVEKASVVPE
SFFDDYDETX
ELRVTLPTKV
GATIKLGEXF
TKVNKGDDIS
YNTIEESFNN
LPQTGEI{QNV
VTPIKDLLRV
KADVSGDVYN
FYEF'TSSDIP
KLFTMTFEQG
EKIKTNTVVT
LLTVAGSLA-A
KDSKGXD ITN
SAEQNTFGQR
AEYAGVVEEW
VVI(ITASQAF
HiTPEXPQTPP
MLGLAGLGFK
QFTISWDOAX
I KTNTVVNH4 I
SISDKLDVKH
XDAMNLKENK
EKTVIVPPTP
RRKETK
GTVTXSAKDP
PKVXPKXDVV
DKFSGQWSVF
NVAHSW1(AF I
KTPQAPVEPL
EFO64-3. (SEQ ID NO:243) AGTGACGAT TGATATCC AAAGTGAAAG IT=ATCAAGC AGACGCAAGT a a.
a p a a *aa.
CTNAACGANA
GTGACTGCTT
AACGCNGACG
GTAGTGAAAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
CATATTTCTG
AATCAAGCAT
TCTGTGTATC
AACTACAACA
AAACCAACCA
CGTGGTGATG
GCCTTTGATA
AANGTCACAC
AACCAGTTCA
CCACAAGCCT
GTCAAAGCCG
CGAATTAAAA
GTTATI'AAAG
TTCTTCTATG
TGGTCGATA
TTTGCCAATT
TCGAAACTAT
TITI'NGATG
ATTGGTGTAG
AATGAGAAGA
CCAGAAAAAA
?TAGTGGTAG
GTCTTAI'AA
AAACGTAGAA
AAGACTNAAA AGCTGTTGCT CTTATGANCT CAATTAGAT GNTCNGTI'GT TFTtArcAATG ATGTAGAAGG CGATTTTGAA CAAATACAGT GATTAACCAT AAAACGGTAC AGTIGGCAGT =TTATTATGA AGTGAAATCT GGGGCATGAA TGATCTCTTG TTACNAANTA TGACCTTAAA CCTACATTCT TTTAGAAAAC TATTGGCNGC NTTAAATGAA TGGAAGTCGA ACGGATNAAA AAGAGCTTGT NCGTTCTAAT AAGCCGTTCA TAACAAGAAA TTCTTI'CTTA TGAAATGACN CAGTCGATCT TGCGACAGGC CAATCAAAGA CTTACTTCGT CGATCTCNTG GGACGATGCC TTATTCTAGC GNATGGTGGG ATCTTTCTGG NGATGTTTAT CCAATACNGT TGTCAACCAT TNGGTGACAA ACAAAGTCAA AATTTACAAG TAGTGACATT GCGATAAACT ACACGTCAAA CTAATTI'GT TT1AGCAGAC TCACGATGAC CI'TGAACAA CGATGAATCT AAAAGA.AAAC AACGAATTGC GGCAGGAGAC TTAAAACNAA TACGGTAGTG CAGTGA ITGT ACCACCAACA AAAAGGCAAG TGTNGTGCCA CGGTAGCTGG TAGTTTAGCC AAGAAACAAA ATAA GCAGCNATrA
CAAAACACCC
GGGTATAAAT
AATACAGCTG
GTGCCAGGTA
GITTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AAAGACA.ATA
GGAACCA.ATA
ACAGGTGACG
ACNGTGGTGA
GGGGAAGANA
TGGGACTTAA
G=TTTCT
GTCAAAGATT
AAAGGCACGO
CAAGAA ITGC
AATTCAGCGG
ATTCCAAAC
AATGGNGCCA
CCTGCAGAAT
CATGACAAAT
GGAACCAAAG
GGGGTAGTGA
AA.AAACGMr
GTTTACAACA
ACN'CATACGC
CCAAAAACAC
GAATTGCCGC
GCAATGCTTG
ATTCAGGAAN
TCACAGCAAT
ATTAC'ITGT
TCAGICTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCGTIT
AAACG'ITAAA
AAGACTTGAC
AAGTAGGCAA
TAGAAAACAC
CGCATACNCC
TTAAIWCATGG
AAGGGTACGA
TCGATGMAT
CTAAAGGGGN
TGACNATNTC
GTGTAACNCT
AACAAAATAC
TCAANCCTAA
CAATCAAATT
ACGCTGGNGT
TTAGTGGCCA
TGA-ATAAAGG
AAATCACGGC
CACACTCATO
CAATCGAAGA
CAGAAAAACC
CGCA.AGCACC
AAACAGGCGA
G;C IrArCAGG
AGCTAAAGAC
GATGAAAACC
CTTGCCGTTT
AAANGATGGN
CAAAGATGTA
TATTCCGTTA
TGGCGGAATN
CACAGGNAA
AGCAGGAACA
GTTTACNATG
ACAAGCTTGG
CCAAACAGAA
TGATGATCCA
AAAACTNGCT
TAAAGACTTT
CGATGAAACG
AGACATTACG
TGCCAAAGAC
CCCTACAAAA
ATTGGNCAA
AAAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
ATCGTCTGTG
GGACGACATT
CAGTCAAGCC
GAAAGCCTTC
ATCTTTCAAC
ACAAACNCCA
AGTAGAGCCA
AAAACAAAAT
CTTAGGCTT
EF064-4 (SEQ ID 110:244) VTIDLSK VKVYQADASL
NXKDXYKAVAA
VKNVEGDFEN
TKIYYEVKS S
ISAYILLENK
YNKELVRSNT
FDTVD)LATCV
QAFILAXGGQ
AINSGXAKDV
TAVQLITXDGE
ERPANYCCXT
DNKDLTFTMN
VVTHTPDDPK
SFFDDYDETX
ELRVTLPTKV
TASYXLNLDQ
TVTN'IVINHV
EEWGMNDVLD
QALLAALNEG
PTKAVHNKKG
VTP IKDLLRV
KADVSGDVYN
NTVTAMMYKTN
PGSNPSKDVK
TI'NDRFTGKW
SNKVGKQAWS
EXIXHGKVAR
KDSICCXD ITN
SAEQNTFGQR
AD)GSVVLAMG
ADKNGTVGSV
HA ITXYDLK V
V'YLEVERXKT
GDVLSYEMTW
QFTI SWDDA(
IKTNTIVVNHI
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTD
GDVENTQTEN
DLJKGYDKOFA
GTVTXSAKDP
PKVXPKKDVV
146 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efaecalis Genes.
IKVGDXQSQN GATIKLGEXF FYEFTSSDIP AEYAGVV-EEW ANSNFVLADG TKVNKGDDIS KL.FTMTFEQG VVKITASQAF GVERIAACDV YNTIEESFNN EKIKTNTVVT HTPEKPQTPP
VVEKASV
SISDKLDVKH
XDAMNLKENK
EKTV IVPPTP
DKFSGQWSVF
NVAHSWKAF I
KTPQAPVEPL
EF065-1 (SEQ ID NO:245)
S
S
TAGCGAAAGA
ATGGTGA'TTO
GAAGCGGTTA
AGTCTGGCTG
GAAACGGAAC
TTTCCACATG
GTGATGCTGG
ATTAATTCAC
AGCCAATTTG
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAATTACTA
AGITCAGTGA
GAGTTGCAAC
TTTACAACCC
GCTI'CATTAA
GGAAATGATG
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATTA
GATGTTACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACCGTAACAC
TTAGATITAA
GAAACAAGCA
ACTGATGTAA
CAACAAAAkTA
AGTGGTCATA
GAATTAGCAC
AACGAAGC
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAATG
ACTT'ITACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
ATI'GTAAACG
CAGTCCAT'N'
TAAGACAAGC
CTTCATATCG
CTGATI'ATGA
TCTCGGTTT
TCGCTTTAGA
TATTAGTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTT
AAGCGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATGCAA
TCGAAAATCA
AAkACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACGCAAC
CCGAC ITAAA
CACCGCCAGT
CCAACCGCGA
CTI'GGACCCA
AAGTCACAGA
ACAAACTAAC
CGTACACAAT
CTATATTGA
ACGTGTTGCA
CGAITACAAA
TTAAATGGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GGATTCCCAA
AGCCAACCGT
GATr-AAAATG
AGTTGTGGGA
TACAGAAGGA
GATATTGGAA
AGTAGTTCTI'
GATTGATGAA
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGALATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATT
AGTAACACAA
CTATI'TAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATT
AGCCAGCATC
TGAAAATGGT
'FITrGAA.AT
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CATTAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCCT
TTTAAGAAAG
A.ATT-CAGTC
ATGACCAATA
GGACAAG CTA
GAAGATGCTA
GTGGTTCAAG
CAATTTATGT
CGCGTCAATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTATr
AAAACAGAAC
GAT=TATTA
GATCGTCTGG
ATTCAATCTG
CAAACCATT
GTCAAAGAAA
GGAACAATIG
TGTGACGTCA
GATTTAACCA
ACCAGTACTT
GATGTGAAAG
GAAAATAACA
GGTCATACGTr TTAGCGCCTr
GAAGGTGACG
ATTGCTAAAG
AAATGOAACG
GTAGATGACA
AAAGATGTrA
AACAANCAAG
ACTAAA.ATCA
ATTCCCAACC
CCAACCGTAA
GG.CCAA=AC
GCTTrCGGTA
GTGTTAGACA
GGCATAr-TAA
TACTCTTACT
GCAACGGATG
TTAAACTTTG
GCACCA.ACC
CAAC3AAATT
CCACATTG.GC
CGGTGAAAGT
TTICCCAGT
GTGGTAGTTT
GCTTATCTGA
I'TrCCTGATGG
CGCAA'ITGAC
GTGGTACCCC
GAGATIAAC
GTTTAGATGG
CAAGA--ATCC
AAGTTTTAGC
GGGAAAGTGT
TGGGTCCTTI'
CAAGCCAATC
CGCAATCGAC
CGACCCCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAATT
CGCCAGA.AGA
ATCGTGAAGA
GGACCCAAGC
TCACCGACGA
AAGTAACTTT
ATACAATGAC
ACATTGAACA
TGTTACATTC
ACGTAGAAGG
TCAAAACAC
TTAATAAAGT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
AAGCCGACTT
CACCACCT'GC
ATrTAGA~T
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCTGGTCA
AAGAATTAGC
GCAACGALAGG
CAGA-AGACCC
ATTATCGACA
TTTAGCTGAA
GAAAGACGAC
TCAAGCGGGT
TTCAGATAAT
TCAACACCC
AAAGACAAAA
TTATGATAAA
AACCCCCCCA
GAATCGAAAA
TTACTTGCAT
TCTTCAAGTC
GTTAAACCAA
TGAATCTTTA
TGTAAAACAA
TAITGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGATGCAGCA
TCCAGAAGCG
TCCAACGATT
TAGTTTCGAT
CAGCATGGTG
AAATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
CAACAAACCA
ACAAGAACAT
TTCGGTAAC
GTI'AGACATC
CAAAGTAACA
TGACTAITA
AACGGACGAA
GAACTTTGGC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAG'ITNCT
TAACAAAGTA
TACATACACA
GCcTTATATT
TGACGTGTTG
AAAAAAACCT
S
S
147 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GAACCTAA.AC
ACGAACCCAG
ACAACAGTAA
TTTGGCATAA
AACCCCTAAA ACCGAAAAAA CCGTTGACGC CTACAA.ATCA TCAAGCACCA TCAATTTGG AAAATCAGCA AGTAAAGGAA TTCA=rACC AATGACTAAT ATCCACTFIA CATGATCGCA GGTTTAATTG TCCTI'ATAGT GGCTATTAGC CA.AAAAATAA AAAAAGAAAA AATTAG EF065-2 (SEQ ID NO:246) MF KKATKLLSTM VIVAGTVVGN FSPTLALAEE AVKAGDTEGM TN'IVKVKDDS
LADCKRILEG
MLASYRGGKQ
LKLALDTYNQ
VEYSUDYQGA
LQQGSSTPED
NDVPVQINGQ
IPKNDNAHAC
DINKVLDI ID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEMN
EGDVLHSDTKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
GITKNKKRKN
QATFPVQAGE
FMFPDGKTK I
THGDLTNRKT
AAEVLALNQE
FITSQS IDDF
TISATSTEGY
DVTPEDPTIT
VYKVTDENGKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYI E
LTPTNHQAPT
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYENiIN TTQLKQ IVKD
VGNITIHYEV
KD IENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
G14TYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGGI FNQADL
NPVNFGKSAS
DASCSFSDNF
VNTQLTYDXS
NTRLDGYLEK
kYWESVESLS
RLAQSTPATA
KENTAIIDAAT
LT.NREDSFDW
NN~KVTFE-M
GDVLHSNXPT
DDINKVLDIT
K IKASATDEE
QEHLOLTNRD
IVTQENNRVTr
NFGNEGDVLH
KG IHLPMTNT
PHVRQAIDEV
QFVSGFCDVR
TNTNDSINEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMNQG
HVKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENGK
LAPYIEQGG I
QEFKWNVKTA
FTtNKKDDSY
SNKPTVTPPA
TVNP LYM lAG
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STTQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKE
L IVLIVAI SF
S..
S.
S
S.
EF065-3 (SEQ ID NO:247) GGITA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAG-T GAAAGACGAC
AGTCTGGCTG
GAAACGGAAC
TTTCCACATG
GTGATGCTGG
ATTAATTCAG
AGCCAATTTC
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAATTACTA
AGTTCAGTG-A
GAGTT1GCAAC
TTTACAACCC
GCTTCATTAA
GGAAATGATG
TACGTAGGAA
ACCCI'GTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACAFI'A
GATGTTACAG
AAACAAGCAG
AAAA71AAAA
ATTGTAAACG
CAGTCGATTT
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGGITT
TCGCTTTAGA
TATTAGTGAC
CCAATGAITC
ATAGTAATGA
ACCAAGGCTA
ATI'CATACTI
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACGCAAC
GATATTGGAA
AGTAGTTGTT
GATI'ATGAA
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATCATT
WATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATT
AGTAACACAA
CTATTTAAGT
GGACGAAGAA
GGACAAGCTA CTTTCCCAGT TCAAGCGGGT GAAGATGCTA GTGGTAG=I TTCAGATAAT GTGGTTCAAG GCTTATCTGA TCAAGACCGC CAAT'TATGT TTCCTGATGG AAAGACAAAA CGCGTCAATA CGCAATI'GAC TTATGATAAA CGGACGTATG GTGGTACGCC AACCGCCCCA CAAACACACG GAGATI'TAAC GAATCGAAAA GCTAATACAC GTITAGATGG TTACITGCAT TATCCAGATC CAAGACATCC TCTTCAAGTC GCAGCAGCAG A.AGTTI'AGC GTTAAACCAA AATGCGTATT GGGAAAGTGT TGAATCTITTA AAAACAGAAG TGGGTC TGTAAAACAA GAI=IATTA CAAGCCAATC TATTGATGAT GATCGTCTGG CGCAATCGAC ACCAGCAACA A'FrCAATCTG CGACCGCTAC GGACGATGCT CAAACCAT'T CAGCAACTAG TACAGAAGGT G-TCAAAGAAA ATACAGCGAT TGATG-CAGCA GGAACAATTG CTAAGGAT TCCAGAAGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT GAT~rAACCA ATCGTCAAGA TAGTTTCGAT ACCAGTACTI GGACCCAkAGC CAGCATGGTG GATGTGAAAG TCACCGACGA AAATGGTAAA GAAAATAACA AAGTAACTTT TGAAATGAAC GGTCATACGT ATACAATGAC TATCACCACT TTAGCGCC'IT ACATGAACA AGGCGGGAT!' TABLE I1. Nucleotide and Amino Acid Seqeuences of E. faccalis Genes.
CCCAACCAAG
ACCGTAACAC
TTAGATTTAA
GAAkACAAGCA
ACTGATGTAA
CAAGAAAATA
ACTGGTCATA
GAATTAGCAC
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
CANGAAAATG
AC I=TACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGAACCCAG
C--GACTTAAA
CACCGCCAGT
CCAACCGCGA
CTTGCACCCA
AAGTCACAGA
ACAAAGTAAC
CGTACACAAT
CTTATATTGA
ACGTTGCA
CGATTACAAA
TIAAATGGAA
TGGT'GGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GOATrCCCAA
AGCCAACCGT
AACCGCTAAA
TCAATTTGG
C=~GGCAAT
TGATCCAAAT
TCAAGAATTT
AGCCACCATG
TGAAAATGGT
T'ITTGAAATG
CACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CA'ITAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
GAAGGTGACG
AT*TGCTA-AAG
AAkATGGAACO
GTAGATGACA
AAAGAT'GTTA
AACAANCAAG
ACTAAALATCA
ATTCCCA.ACC
CCAACCGTAA
GGCCAAGAAC
GCTTTCGGTA
GT(.-TTAGACA
GGCATAGTAA
TACTCTTACT
GCAACGGATG
TTAAACTTI'TC
GCACCA.ACGC
CCGTTGACGC
AGTAAAGGAA
Tr.TTACATTC
ACGTAGAAGG
TCAAAACAGC
'rrAATAAAGT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
ALAGCCGACTT
CACCACCTGC
ATTTAGATTT
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCTGTCA
AAGAATTAC
GCAACGAAGG
CAGAAGACCC
CTACAAATCA
TT
CAACAAkACCA
ACAAGAACAT
TTTCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATTTA
AACGGACGAA
GAACTTTGGC
ACCAACGCCA
AAICCAACCCT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
EF065-4 (SEQ ID NO:248) a a a
AVKAGIJTEGM
£LADCKRILE G
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGS STED
NDVPVQINGQ
IPKNONAHAC
D INXVLIDI I D
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEM
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
TNTVKVKDDS
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVLALNQE
FITSQS IDDF TI SATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYIE
LTPTNHQAPT
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
'ITQLKQIVKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLJNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGG IPNQADL
NPVNF'GKSAS
DASGSFSIDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAIDAAT
LTNREDSFDW
NNKVTFEIM
GDVLHSN'KPT
DDINKVLDIT
K IKASATDEE
QEHLDLTNRD
IVTQEiNNKVT
NFGNEGDVLH
KGIH
PHVRQAIDEV
QFVSGFCGDVR
TNTNDSINEY
SVNSYFOKYK
SLTIANQFDI
LVSSGTMNQG
HVKTAFONET
QADSYDYLSG
VTPPPVrEPNI
DVKVTDENGK
LAPY IEQGGI
QEFKWNVKTA
FTMNKKDDSY
SNKPTVTPPA
VQGLSDQDRV
TYGGTPTAPG
POPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFP EAT
STWTQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKPE
EF066-1 (SEQ ID NO:249)
TAGCGAAACA
ATGGTGATTG
GAAGCGGTTA
AGTCTGCCTG
GAAACGGAAC
TTCCACATG
GTGATGCTGO
ATTAAT-TCAG
AGCCAATTTG
GCATI'GAAAC
ACGTATTTcC
AAGACCAATA
TCAGTGGAAT
AAATAGGGAG
'TCGCTGGAAC
AAGCAGGAGA
ATTGTAAACC
CAGTCGATTT
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGG I=
TCGCTTTAGA
TATTAGTGAC
CCAATGATI'C
ATAGTAATGA
GATTAAAATG
AC ITGTGGGA
TACAGAAGGA
GATATTGGAA
AGTAGTTGTT
GATI'GATGAA
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGTC
AATCAATCAA
CTACCAAGGT
TTI'AAGA.AAG
AATI-TCACTC
ATGACCAATA
GGACAAGCTA
GAAGATGCTA
GTGG'ITCAAG
CAATTTATGT
CGCGTCA.ATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
CAACGAAATT
CCACATICC
CGGTGAAAGT
C'I=CCCAGT
GTGCTATI'
GCTTATICTGA
TTCCTGATGG
CGCAATTGAC
GTGGTACCC
GAGATTTAAC
G=TAATGG
CAAGACATCC
AAGTTITAGC
ATTATCGACA
TTTAGCTGAA
GAAAGACG-AC
TCAAGCGGGT
TTCAGATAAT
TCAAGACCGC
AAAGACAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TCTTCAAGTC
GTTAAACCAA
149 TABLE 1. Nucleotide and Amnino Acid Seqeuences of E.faecalis Genes.
GAAAITACTA
AGTTCAGrA
GAGTTGCAAC
TIrAC.AACCC
GCTTCATTAA
GGAAATGATG
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACA ITA CATGT'rACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACC.GTAACAC
TTAGATTTAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCA.C
AACGAAGGTG
GAAGATCCAA
CACCAAGAAT
CAAGCCACCA
GANGAAAATG
AC=TTACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGAACCCAG
ACAACAGTAA
TT~rCCCATAA
ACCAAGGCTA
ATTCATACT
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TCCCTCTTCA
ACATCACCAT
GTAGTGCGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTTGGACCCA
AAGTCACAGA
ACAAkAGTAAC
CGTACACAAT
CTTATATTGA
ACGTGTTGCA
CGATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GGATTCCCAA
AGCCAABCCGT
AACCGCTAAA
TCAArTCG
ATCCACTTTA
CAAAAAATAA
TCAAATGCATT
TGATAAATAT
CACACCAGAA
AATTCTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACCAA
AGATATCATI'
AGTAACACAA
CTATTTAACT
GGACGAkAGAA
CTITTGCAAT
TGATCCAAAT
TCAAGAkATT
AGCCAGCATG
TGAAAATCGT
TTTTGAAATC
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CATTrAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTCAC
CCAAGCCGAC
AACACCGCCT
ACCCAAAAAA
AAAATCAGCA
CATCATCGCA
AAAAAGAAAA
AATGCGTATr GGGAAAGTG3T TGAATCTFTA AAAACAGAAG TGGGTCCTTT TGTAAAACAA GATTTTATIA CAAGCCAATC TATTGATGAT GATCGTCTCG CGCAATCGAC ACCAGCAACA ATTCAATCTG CGACCCCTAC CGACGATC-CT CAAACCATTT CAGCAACTAG TACAGAAGGT GTCAAAGAAA ATACAGCGAT TGATGCAGCA GGAACAATTC CTAAGCAATT TCCAGAAGCC TGTGACGTGA CCCCAGAAGA TCCAACGATT GATTTAACCA ATCGTGAAGA TAGTTTCGAT ACCAGTACTT GGACCCAAGC CAGCATCGTG GA'±GTGAAAG TC:ACCC;ACGA AAATGGTAA.A GAAkAATAACA AAGTAACTTT TGAAATGAAC GGTCATACCT ATACAATGAC TATCACCACT TTAGCGCCTT ACATTGAACA AGGCGGGATT GAAGGTGACG TGTTACATI'C CAACAAACCA ATTGCTAAAG ACGTAGAAGG ACAAGAACAT AAATGGAACrG TCAAAACAGC TTTCGGTAAC GTAGATGACA TTAATAAAGT GTTAGACATC AAAGATTA CAGCTAACGG CAAAGTAACA AACAANCAAG CNGACAGCTA TCACTATTA ACTAAAATCA AAGCTAGCGC kACGGACGAA ATTCCCAACC AAGCCGACTT GAACTTTGGC CCAACCGTAA CACCACCTGC ACCAACGCCA GGCCAAGAAC ATTTAGATTT AACCAACCGT GCT CGGTA ACGAAACAAG CACATGGACC GTCTTAGACA TCACAGACGT GAAAGTTNCT GGCATAGTAA CACAAGAAAA TAACAAAGTA TACTCTTACT TAGCTGG'TCA TACATACACA GCAACGGATr. AAGA.ATTAGC CCCTTATATT TTAAACTTTG GCAA-CCAAGG TGACGTGTTG GCACCAACGC CAGAAGACCC AAAAAAACCT CCGTTGACGC CTACAAATCA TCAAGCACCA AGTAAAGGAA TrCA=LACC AATGACTAAT GG=TAA=F TCCTTATAGT GGCTATTAGC
AATTAG
o 00.
EF066-2 (SEQ ID NO:250) MF KKATKLLSTM VIVACTVVGN FSPTLALAEE AVKAGDTEGM TNIVKVKDDS 0 0000 00 0 ~0 00
LADCKRIELEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQ INGQ
IPKNDNAHAC
DINXVLDI ID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEM
EGDVLHSNKP
ASMVDD INK V TITTK IKTDA
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVLAIJNQE
FITSQS IDDF
TISATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYIE
TEPVDLVVVE
NSADYDMNVR
YFLLVTDCVA
ITNQGYEMIN
TI'QLKQIVYKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMT ITT
DPTITKDIEG
ENGKDVTDNG
QGGI PNQADL
DASGSFSDMF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAI DAAT
LTURE-DSFDW
NKVTFEMNK
GDVLI4SNKPT
DDINKVLDIT
KIKASATDEE
QEHLDIJTNRD
IVTQENNKVT
NFGNEGDVLH
PHVRQAIDEV
QFVSGFGDVR
TNTNDSINEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMQG
HVKTAFGNET
QADSYDYLSG
VTPP PVDPNI
DVKVT-DENGK
LAPY IEQGG I
QEFKW.NVKTA
FTMNKKODSY
SNKP'IVTPPA
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
HTYTMTITI
AKDVEGQEH-L
DVTANCKVTQ
PNQADLNFGN
FGNETSITQ
SYLACHTYTM
PTPEDpKKPE 150 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PKQPLKPKKP LTPTNIHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVL7VAISF C ITKNKKRKN EF066-3 (SEQ ID NO:251) GGTTA AAGCAGGAGA TACAGA-AGGA ATGACCAATA CGGTGAAAGT GAAAGACGAC
AGTCTGGCTG
GAAACGGAAC
TTTCCACATG
CTGATGCTGG
ATTAATTCAG
AGCCAArI'TG
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAATTACTA
AGTTCAGTGA
GAGTTGCAAC
TTTACAACCC
GCTI'CATTAA
GGAAATGATG
TACGTAGGAA
ACCCTICTAA
ACGATTCCTA
ACAAA.AGATA
TGGCATGTCA
GATGACATTA
ATTGTAAACG
CACTCGATTT
TAAGACAAGC
CTTCATATCG
CTCATTATGA
TCTCTGGTTT
TCGCTTTAGA
TATTAGTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATI'CATACTT
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTCGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTCCT
GATA'IGAA
AGTAGTTCTT
GATTGATGAA
CGGCGGAAAA
TATCAATCTG
TGGAGACGTT
TACGTACAAT
AGATCGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGA'T
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TCCATGCG
AGAACACTI'A
TGGCAACGAA
AGATATCATT
GGACAAGCTA
GAACATGCTA
GTGGTTCAAG
CAATTTATLGT
CGCGTCA-ATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC-
GCAGCAGCAG
AATGCGTAIT
AAAACAGAAG
GATTITATTA
GATCG;TCTGbG
ATTCAATCTC
CAAACCATT
GTCAALAGAAA
GGAACAATTG
TGTGACGTGA
GATTTAACCA
ACCAGTAC'T
GATGTGAAAG
CTTTCCCAGT
GTCGTAG'FIT
GCTTATCTGA
'PTCCTGATGG
CGCAATTGAC
GTGGTACGCC
GAGATTAA
GTI'TAGATGG
CAAGACATCC
AAGT=IAGC
GGGAAAGTGT
TCCGTCC=T
CAAGCCAATC
CGCA.ATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAATT
CGCCAGAAGA
ATCGTGAAGA
GGACCCAAGC
TCA
TCAAGCGGGT
TTCACATAAT
TCAAGACCGC
AA.AGACAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TCTTCAAGTC
GTTAAACCAA
TGAATCTTTA
TGTAAAACAA
TATTGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGATGCAGCA
TCCAGA-AGCG
TCCAACGATT
TAGT'=CGAT
CAGCATGGTG
S.
SS
S
S
S
55 S S S 55 55 5
S
EF066-4 (SEQ ID.NO:.252) AVKAGD'rEGM TNTVKVKDDS
LADCKRILEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQ INGQ
IPKNDNAHAC
QATFPVQAGE
FMFPDGKTK I
THGDLTNRKT
AAEVLALNQE
FITSQSIDDF
TI SATSTEGY
DVTPEDPTIT
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQ IVKD
VGNITIHYEV
KDIENQEHLD
DASGSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAIDAAT
LTNREDSFDW
PHVRQAIDEV
QFVSGFGDVR
TNTNDS INEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMNQG
1-VXTAFGNET
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
DINKVLDIID VKVT EF067-1 (SEQ ID NO:253)
S
S. S
S.
*5
TAGCGAAAGA
ATGGTGATTG
GAAGCGGTTA
AGTCTGGCTrC
GAAACGGAAC
T TCCACATC.
GTGATGCTGG
ATTAATTCAG
AGCCAATTTG
GGATTGAAAC
ACGTATI'TCC
AAATAGGGAC
TCGCTGGAAC
AAGCAGGAGA
ATI'GTAAACG
CAGTCGATTT
TAAGACAAGC
CTTCATATCG
CTGAITATGA
T CTCTGGT
TCGCTTTAGA
TATTAGTGAC
GA'ITAAAATG
AGTI'GTGGGA
TACAGAAGGA
GATATTCGAA
AGTAGTTGTT
GATTGATGAA
CGGCGGAAAA
TATGAATIGTG
TGGAGACG T
TACGTACAAT
AGATGGGGTC
T ETAAGAAAG
AATTTCAGTC
ATGACCAATA
GGACAAGCTA
GAAGATGCTA
GTGG TCAAG CAA IITATCT
CGCGTCA-ATA
CGGACGTATG
CAAACACACG
GCTAATACAC
CAACGAAAT
CCACAMIGC
CGGTGAAAGT
CTTTCCCAGT
GTGGTAGTTr
GCTTATCTGA
'IWCCTCATGG
CGCAA'ITAC
GTGGTACGCC
CGATTTAAC
GTTTAGATGC
ATTATCGACA
TTTAGcTGAA
GAAAGACGAC
TCAAGCGGGT
ITrcAGATAAT
TC-AAGACCGC
AAAGACAAAA
TTATGATAALA
AACCGC CC CA
GAATCGAAAA
ITACTTGCAT
151 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Genes.
AAGACCAATA
TCAGTGGAAT
GAAA'ITACTA
AGTITCAGTGA
GAGTTGCAAC
TTTACAACCC
GCTTCATTAA
GGAAATGATG
TACGTAGGA-A
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATI'A
GATG TTACAG
AAACAAGCAG
AAAkITTAAAA
CCCAACCA.AG
ACCGTAACAC
TTAGATTAA
GC.AACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAATG
ACTTTACTA
ATGACTATrA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGAACCCAG
ACAACAGTAA
rI'TGGCATAA
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATrCATACT
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCC-TGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
PAACAGCCTT
ATAAAGTGCT
CTAACCGCAC
ACAGCTATGA
CTGACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTI'GGACCCA
AAGTCACAGA
ACAAAGTA.AC
CGTACACAAT
CTTATATTGA
ACGTGrI'GCA
CGATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAA.AT
GGATTCCCAA
AGCCAACCCT
AACCGCTAAA
TCAATrTGG
ATCCACTTTA
CAAAAA.ATAA
AATCAATGAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGA-A
AATTGTCAAA
TCAATrAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TGCGCATIGCG
AGAACACTTA
TIGGCAACGAA
AGATATCATT
AGTAACACAA
CTATITAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATTT
AGCCAGCATG
TGAAAATGGT
TTTTGAAATG
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAA.ACA
CATTAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCCAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
CATGATCGCA
AAAAAGAAAA
TATCCAGATC
GCAGCAGCAG
AATGCGTAI'
AAAACAGAAG
GATTTATI'A
GATCGTCTGG
ATTCAATCTG
CAAACCAFTr
GTCAAAGAAA
GGAACA.ATG
TGTGACGTGA
GATTTAACCA
ACCAGTACI'
GATGTG-AAAG
GAAAATAACA
GGTCATACGT
TTAGCGCC!IT
GAAGGTGACG
ATTGCTAAAG
AAATGGAACG
GTAGATCACA
AAAGATGTrA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTAA
GGCCAAGAAC
GCTTTCGCTA
GTGTTAGACA
GGCATAGTAA
TACTCTTACT
GCAACGGATG
TT-AAACT'Fr
GCACCAACGC
CCGTTGACC
AGTAAAGGAA
GGTAATTG
AATTAG
CAAGACATCC
AAG'=''AGC
GGGAAAGTGT
TGGGTCCTTT
CAAGCCAATC
CCCAATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAATT'
CGCCAGAAGA
ATCGTGAAGA
GGACCCAAGC
TCACC&ACGA
AAGTAACTTT
ATACAATGAC
ACATTGAACA
TGTTACATTC
ACGTAGAAGG
TCAAAACAGC
TTAATAA.AGT
CAGCTAACGG
CNGACAGCTA
ALAGCTAGCGC
ACCGrACTT
CACCACCTGC
ATTTAGATT
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCTGGTCA
AAGAAT-LAGC
GCAACGAAGG
CAGAAGACCC
CTACAAATCA
TTCA~rACC
TCCTTATAGT
TCTTCAAGTC
'TTAAACCAA
TCAATCTTTA
TGTAAAACAA
TATTGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGATGCAGCA
ICCAGAAGCG
TCCAACGATT
TAGTTTCGAT
CAGCATGGTG
AAATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
CAACAAACCA
ACAAGAACAT
TTTCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATTTA
AACGGACGAA
GAACTI'TGGC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
AATGACTAA-t
GGCTATTAGC
.0* S S
S.
S.
S S
S.
S.
S S
S
S.
S *5*S
S
S
*5 5O
S.
EF067-2 (SEQ ID NO:254) HF KKATKLLSTM VIVAGTVVGN FSPTLALAEE AVKACDTEGM TNTVKVKDDS LADCKRI LEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQINGQ
IPKNDNAHAC
DINKVLDIID
IKTDATDEEL
DLTNRDQEFK
E~NKVTFEM
EGDVLHSNKP
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVLALNQE
FITSQSIDDF
TISATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
WNVKTA.FGNE
XQADSYDYLS
TVTPPAPTPE
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQIVKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
DASGSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAIDIAAT
LTNREDSFOW
NNKVTFEI4NK
GDVLHSNKPT
DDINKVLDIT
KIKASATDEE
QEHLDLTI'RD
PHVRQAIDEV
QFVSGFGDVR
TNTNDS INEY
SVNSYFDKYK
SLTIANQPDI
LVSSGTMNQG
HVKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENGK
LAPYIEQGGI
QEFKVINVKTA
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TrEVGPFVK<QE
QSATATDDAG
TIAKEFPEAT
STW.'TQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGN"ETSTWTQ
152 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMXDDSY TITTXIKTDA TDEELAPYIE QGGIPNQADL NFGNEGDVLH SNKPTVT-PPA PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYbIIAG G ITKNKKRKN
SYLAGHTYTM
PTPEDPKKPE
LIVLIVAISF
EF067-3 (SEQ ID NO:255) GCT AGATATCATT GATGTGAAAG TCACCGACGA AAATGGTAAA
GATO.TTACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACCGTAACAC
TTAGATTTAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAAT G
ACTTTTACTA
ATCACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGA.ACCCAG
CTAACGGCAC
ACAGCTATGA
CTGACGCA.AC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTTGGACCCA
AAGTCACAGA
ACAA.AGTAAC
CGTACACAAT
CTTATATTGA
ACGTGTTGCA
CGATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GGA'ITCCCAA
AGCCAACCGT
AACCGCTAAA
TCAATTTTGG
AGTAACACAA
CTATTTAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATTT
AGCCAGCATG
TGAAAATGGT
TITTGAAATG
GACCATTACT
ACAAGGTGGC
TTCCA.ACAAA
AGATATCGAA
CGTCAAAACA
CA ITAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
GAAAATAACA
GGTCATACGT
7TAGCGCCTT
GAAGGTGACG
ATI'GCTAAAG
AAATOGAACG
GTAGATGACA
AAAGATGTTA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTAA
GGCCAAGA-AC
GCTTICGGTA
6TGTTAGACA
GGCATAGTAA
TACTCTTACT
GCAACGGATG
TTAAACTTG
GCACCAACC
CCGTTIGACGC
AGTAAAGGAA
AAGTAACTTT
ATACAATGAC
ACATTGAkACA
TGTTACAFI'C
ACGTAGAAGG
TCAAAACALGC
FI'AATA.AAGT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
AAGCCGACTI'
CACCACCTrGC
ATTTAGATT
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCTGGTCA
AAGAATTAGC
GCAACCAAGG
CAGA-AGACCC
CTACAAATCA
TT
TGAA.ATGAAC
TATCACCACT
AGGCGGGATT
CAACAAACCA
ACAAGA.ACAT
TTWTCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATIA
AACGGACGAA
GAACT'ITGGC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTI'ATATT
TGACGTGFI'G
AAAAAAACCT
TCAAGCACCA
0@ 0@ 0 0* 00 0 0 *0 00 00 0 00 So 00 0 000 S 00 3 0 000 S 0 @00000 0 0000 0 0000 00 0000 0000 0 0000 000@ 0 0000 00 0 00 00 EF067-4 (SEQ ID NO:256) VLDIID VKVTDENGKD VTANGTVTQE NNXVTFEMUK QADSYDYLSG HTYTMTITTK
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEN
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
APYIEQGGIP
WNVKTAFCNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEEL.APY IE
LTPTNHQAPT
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGGIPNQADL
NPVNFGKSAS
GDVLHSNKPT
DDINKVLDIT
K IKASATDEE
QEHLDLTNRD
IVTQENNKVT
NFGNEGDVLH
KGIH
VTPPPVDPNI
DVzKVTDENGK LAPY IEQGG I
QEFKWNVKTA
FTNXKDDSY
SNKPTVTPPA
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAAGHTYTM
PTPEDPKKPE
EF068-1 (SEQ ID NO:257)
TAGGGGAAGC
ATGAAA.AAGA
CGCAACATGC
GTAAGTATAG
CTATTATCAA
CCGCCAAACC
TCCGCTGTGT
AATGTAGCTG
TAATGATC T AAArITGTTGA
TTAATTTAGC
CTGTTACCTC
ATGTTACGTC
AAAATCAACC
TI'AGTGGACA
CAGCAGGCAG
GGTATTTATC
GGAFI'TTAAT
AATATCAACT
TGGCACAATC
AAATAATGAC
AGTTAATTTC
AAAACAAGCG
CGCAGCAATC
G=TTTTA
CGGAAAAGTC
GFIATTAT
AGTrtCATCAG
AGTOGGCACTT
ACGG=rCTG
GTGTTAGTGG
AATACCAATG
AAGAAAAGAG
AGCATAAAAA
TI'ACCTCAT
CAGCGGTC'T
CAACGAGTAA
GTGGCGCTFI'
'TCCTCCTGA
TCACGATTCGA
GGACGATCAG
ATGGACAAAA
AGCAATCCCT
GGATATCGAA
TCGTTGGACA
AGCAGATGCT
GTTAAGAGGA
TC'FTTCAAAA
153 TABLE 1. Nucleotide and Amino Acid Seqeucnces of E.[aecalis Genes.
GTTACTTTTT
ATTACCAGTG
CGTCAATTGG
GAAACGTTAG
GTTTTAGCCC
GAGGCAAAAG
CTTCCTGTCA
GGTGGTTCAG
TACCAACTA
GGAACAGTCG
TCCAACATTT
GTAACAGGTA
G=TGAAATCC
GCGTTTACAG
GCGAAAAACG
CCAACCGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCATTTACAG
GCGAAAAACG
AATACGCCCG
GAGGTCAAAG
GTCCTTGGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GTTCGTGATC
ACTGTAACGC
GATAACGCGG
ACAGCACCAA
ACCGCTCATC
GGGACTGCCG
AATGAAACAT
TTCCAAACAC
ACAACACAAG
GCAACAGACG
ACGITAGC TT
ACTGGACTTG
ATTGG-TGACA
AATACCACCA
GATCAAGGAA
ACAGTTGTTG
GAAGATGCA
TATCAAGTAA
TTAGTTATTG
GGCACAGCAA
AGTAATCCGA
GTTGACAAAA
GGTACAACAG
ACTGATGGCA
TGACTGCCGT
GGGCGTTAGG
A.ATTAGTI'AA
CAGCTGACGG
AA.AATGTTTC
GTACCAGTAT
AAGGCACGGT
GCGTAAATGA
CCGTTTCAAC
TTCAAACAGA
ATTTTGCTGC
ATTCAACAGC
GAAATGCAGG
TTACCGTTCC
CCAGCGGNAC
CCGCACCAAC
CTCCCATGC
GTACCGCTGA
ATGAAACGTT
TCCAA.ACACC
ATTCAACAGC
GAAATGCAGG
TTACCATTCC
CCAGCGGTAC
TGGCGACGCC
GGACTCGCTGA
CT'GCAACAAC
CAAATCAAAC
CGCCGACAAC
GTTCGGGITA
CATCTGGGGC
TACCAACGGG
GAAATrGAAAG
CTGTAACAGG
CGAATGCTAC
ATGGGACTGC
TGACAGCGTI'
CAGCAGATGA
GATATCAAGT
GAACAGTTT
CAGGAAAAGC
AGAGTCA.ACC
TTACTGGAGA
TTGAAGTACG
ACTTTACTGT
GAAAAGACGG
CCGTAGCAGC
CCGGCACGGC
CTACGGGAAC
CAGCTAACGA
CTGCTTTCAA
TCACTGGTAG
TTGAGGTGCG
AATACACAGI
TTTGAATGCA
GAATTTAACT
TAACATTGAA
CTCATACATT
AAACATCTTA
CCCAAGTAAT
AAACGTGGCT
GTTAGTGGAT
ACCTCAAAAT
TCTTTAGAC
ACGCACTACT
AGGTI'ACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGAA
AGGITACGAA
AGGTGC'CGTG
CGCAGGTG;A
AGAAAGTACG
AATTGTTGAG
AGTIGGCACC
TGGAACTGAC
GCTGAGCGTT
ACCAGCTGAT
TCAAATTACA
AGTCATTGGT
AACGACCAAT
TCAACCGACT
AGTAACAGGT
CATCGAAATT
TI'CCTTTGCT
AGCCAAAGAT
AGTAGTGGCA
GACAGGTACC
AGGCACCGCA
AACAGCTAAA
AACTACAGCT
TTCAACAACT
GAACCCAGAT
GGACCTTCCA
TGACGGCAAT
ACCAACTGTG
AGAGCCAAAT
GAC.TGATGGT
AGCCTTAAC7
.AACACCTGCI
CACGACAAAC
TGACGCCGAI
GACT'IAGAG
GCCAATGAIT
GGTGTTGATA
AACTTAGGTG
AGTGCACCCA
CAAGATTTGA
CTTGTCGCCG
GTTCAGGTG
GCTTCTTTAC
TTATCCAATA
G'rrAA ITAT
AGTGAAGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGCGCCA
CCAACAACGT
GTGACAGGTA
GTTGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGTACAG
CCAGGTCA
CCAACAACCT
ACTGTAACAG
ACCATTGAGG
GAAATATA
GTAGCGAAAA
GTCACTGCAC
GGAACAGCAG
ACAGGTACCT
CCTGGGGATA
GAAGTCCTTG
AATTCAGTI'
CGTGATGCAG
GTGAACCTTC
CCTGCTGGCA
CCGCCAAGTG
GCTGAACTTG
ACAACTGGAC
CAAACAGTGA
ATGACACCCG
GGTTATGAAA
GGAACAATTA
GCGGGAGCCG
GAAAGTCAAC
ACGACTGTrA
GTCACCATTG
GCTGCGCAT
GCCAT~rGCGA
GATCCAGAT'G
GCCTATCAAG
GGCACAGTCC
CCAGGGAAGG
TAACCAATGT
=IATTGAC
CTGCTTCATT
TTAGTGATCG
ATCGCGGCAGT
CAGCTATAAA
CTTTGCCT
TAGGCACAAC
ATTTAGATGC
TAGCAACAGC
CCGCACCA.AC
CTGCCGATGC
GTACCGCTGA
ATGAP ACGTT
TCCAAACNCC
ATTCAACGGC
GAAATGCAGG
TTACCC'ITCC
CCAGCGGCAC
CCGCACCAAC
CTGCCGATGC
GTACTGCTGA
ATGAAACGTT
TCCAAACGCC
GTAGTACAAC
TTCGCGATGC
CACTrGACTT
ACGCTAGTCG
CAACAG'ITA
ACCCTAACAC
CTGATGCGAA
CGTTAACAGT
TTCCTGCTGA
CTGG77ATCA
ATGGGAACGT
CAGCTGGGAC
ATACAAGTAC
TCGACAAAGT
GCACCACCAT
CGACTGGCCA
ATGTAG ITGC
CTGATGTI'AC
TCACTGGGAC
TTGGTACAAC
CTAATCCTGG
CAACGGAAGT
CAGGAACAAC
AGATTCACAA
TTACAATCAC
AAGATGCTGC
CACCAGTCGC
TAGTAGGAGC
TTGGCATGGC
CCTCAGCTAA
GATTACTCAA
GGAAGTGAAT
TACAGCTCCG
TTAGGGTTA
TCAAGCmGT
TGCAGCCTTG
ATTAGCGGTT
CACGGTTACT
TCGTTTTGTA
AGACGGTGTA
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGAA
AGGTTACGAA
AGGCACCCTA
CCCAGGTGAA
AGAAAGTACG
AATCACAGGA
CAATGCCACG
TGGGACAGGG
AACCGCCGTA
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAAAGT
TAACATCACA
AACAATCGAA
TGGTGATTTT1
GATTGGAAAG
TGCCACGGTT
GGTGACAGGC
GATTGCAACA
GGCAAATGCG
ACCGACAACC
TACTGGGAAT
TGAAGTI'CGT
ATATACTGTG
TAAAAATGAT
CACACCAACA
GGCGGACCCT
GACAACGGAT
TGATACATTA
GACGGTCCCT
TGCCACTGGG
TGAAGCAGGT
TCT1'CCGACG
TGGGA.AAGAA
GACACCTACT
AGCAGAAGTT
AACTACTGGA
CGAAACAATA
154 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. *fecalis Genes.
ACTOTCGTAG
GTCGACTTAG
ATCACTGGAA
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
CAAGGAAACA
CAATT ITACA
TTCTCCGTTC
ACAGATACAC
GGAGA.ACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCGG
GGACATTACA
GGCAATGGTA
GCAACTGGCG
GGT=AGGCA
AGTGGCTACC
GGCTACTTAC
CC TGTCG
CGAAAAATOC
CCACACCAAC
CGGCGAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTATTCC
AAGCCAATGG
ATAGCAGTGA
ATATTGCAGC
AAGGAAATGT
AGATI'AAAAT
TCATCAAAGG
GTCTIGATTGC
GACAAGCGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGGTTC
ACCCTAAACT
CTAGCACAGG
CACTCCTTC
AACAGGAAAA
CATTGATTCT
AAAAACCACT
CGAAACCGGC
TACGATTATT
AGCGGATGT
CTATACAGTC
ACAATTATI'C
AGGGTI'AGCA
GAGTCCTAAA
TGCGGCACCA
AkACAGCTGAA
TGTAGGTGCC
TGCTCAACAA
AACCTTCATG
AAATAACGGC
GAATGTCA
TGGCTTAGGT
AAGTACCATC
TGAAAAAGAG
GAGCATGGGA
GAAACTCAGC
ATI'ACCGGAA
ATGATGTCC
CAATATACGG
AGCAAAGATG
GTTTTAGCGG
ACTGGAACT
GCAAGTGGCA
ACAGAAAAAG
ACCACATTTA
ACTCGTTTCTT
CECAAACCCGAA
ACCGATGCTG
AGTTTACTTG
ACGCCAGCCG
AATACAGGCG
AACACAGOTT
TCACTAGGCA
AGTI'ATGGCA
TCTTCAGCCG
ATCATCAAAC
CAGCTACAGC
ATTCTAGTAA
OGACGCAGA
TGACTCTACC
CCAGGTAA
CGCCAACTAT
CTGATCCAAA
ATACAACTAC
AAACGTTAAC
TGACGCCAGC
CAGTITTAGG
TCNTCA.AAT
AAGGCALACTT
CGACAGCTAC
ACCCAACGAA
GCAATACAGG
CAA.ATCCAAA
ATGGCCTCGG
CTGGAAATCA
TGACAACA.AG
GCA-AACGTAA
AACTACACCA
AGGTTACGA.A
CGGAACCATC
AGCTGGCGTA
TGAAAGTCAA
TACGAAGGTT
TGTCACGGT
TrCGACCTACC
CGCACTAACC
CGATATTACG
AACGTCTAAA
TAGTAACCGA
CGCTATCCAA
CGAMGCGCA
TCC I-GAGGA
AAACAATGC
TGGAGGTTICT
TACAAATGGT
CGGGAA.AACA
TTTTGGC
AA.ACTAG
S
S
S
S
5.55 4,
S
S. S 4 5
S.
EF068-2 (SEQ ID NO:258) M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV S IAVTSGTIS
AVFSGQKQAV
TSGALGNLTG
LAQNWVSNILQ
GSGVNELVDA
NIYFAAGTTS
FTVTVPAGE-A
KGTADANATV
TTFQTPADEA
FTVTI PAGEA
VKGTAEVGTT
PATATTPADV
VTLPTGTTNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TTIEVRNPDC
DA'IVAAPPTT
TATANEALTA
TTIVEVRDADG
DLATPTIDSI
TPGETITI IS
PYNSSEQLLA
EPEIKIAAPT
TAGQATAQQS
TGGNNGNGSN
YLPSTGEKES
ASAAVLDIEL
LVVPPELRGN
VDIDLTEVNR
DLNAAVQALE
SLLG TI'VTL
EVTAPTITGV
GANETLTAVA
EIRNAGGTVI
TV1TAPTITGV
GANETLTAVA
IEVRDAACT'J
TAPIVDNITG
GDTL'IVIGKD
DADGNVIATG
PSVDKVTGNT
TVNVVAI{NDT
TI IGTI1'TDD
TIVTC'ITATGY
IAKDAAGKES
TVLGMATTGT
TGNSSKCYEI
KDGAGNESQP
SGNITTGrGTF
VSSVLGTSKA
LLATATDGAG
TGSNPNGGSG
SAVTTSLFGA
LSNVTSNNDS
VAAAGSAAIN
QLELVNNIEN
AKGTS IPSNL
PTTVSTPQNL
TGNSTAGYEV
KNASGTEXTP
GTGTADGTG-A
TGNSTAGYEV
KNASGTESTP
LGTAI'CTDG
NSGSGYEITG
NAGNESQPTE
TADGTGSFAV
TQGYQVTGTA
GLESQPTTAM
QCNFTVELPA
QVTGTAEPNV
NPTAFKTPAD
DGKYTVTLEP
TGTAEPKTTI
ATAVIPADVV
SVHIAAGLAT
GYLIKGTAEP
HYSTATI'FMT
LGTTGSGLGS
FVALLASMGI
GTSTSNRWTA
TNVTIDLSKV
LGAASFTAPE
VAAAINAALL
SNNLDARFVC
KGTADANATV
TTFQTPADEA
?IVVPACEA
KGTADANATV
TTFQTPADPN
KYTVTLD)SGT
TADPNTTIEV
VLVPADATVT
NLPACTANAN
ELG~rIEVRA
TPADVTTPTI
GAANPGDTLT
TIEIHNEAGL
PDAPVATPT'J
GKASANETIT
DVRDADGTI I
LAAPTITKVE
EKETLTALTT
NRIIQISNRL
PADPTNPCG
LGNGLGTNGs
IKRKPXN
ANQNQPVNFT
TFLTAVLNAA
TtAADGSYIS
PVKGTIVNVAV
TVVQTDLLDV
EIRNAGGIVI
'IVTAPT ITO V
GANETLTAVA
EIRNAGGAVI
TPVATP IVET
ATANQTLSWV
RDPSGAVIGT
APTVTGVTGN
ETLTALAKDP
TDGTVLGTAT
GDITGDSTTG
VVGKDGDGNE
VIATIGTTDGA
DKI TOSTTNG
VVAYCNATGKE
AATTANETGQ
GNKANGYTVT
DTQGNVSPKT
LRSVIAVGAT
NGNTGNNGN
GYHPKLSTIS
VSGGALADAS
NDLTNVITQI
API SDGLGLV
SGALPLLAVG
NLLATADGVS
GTGTADGTGA
TCNSTAGYEV
KNASGTESTP
GTGTADGTGA
VTGSTTKGYE
AKNASGTESQ
GTSDANGDFT
SVAGYQVTGT
AGNTSTPTTF
TGPTGQYTVT
YEITGTADPN
SQPTEVTVPE
GA-FTITLPTG
YQVVGAAEVG
SQPATATTPV
YTVTLPACVV
GTADPNVIVQ
TFMTPADITG
DAEGNFAIQL
TGGNTGNNGA
YGTGNHGKTIG
155 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efiaecalis Genes.
EF068-3 (SEQ ID NO:259) CTC TGGCACAATC AGTGCATCAG CAGCGGTCTTI GGATATCGAA
CTAITATCAA
GCCGCAA-ACC
TCCGCTGTGT
AATGTACCTG
GTTACTTTI'
ATTACCAGTG
CGTCAATCGG
GAAACGTTAG
GTTTAGCCC
GAG-GCAAAAG
CTTCCTCTCA
GGTGGTI'CAG
TTACCAACTA
GGAACAGTCG
TCCAACATTT
GTAACAGGTA
G'ITGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTTA.AAGGAA
ATAGGCACAG
CCAGCGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCATTTACAG
GCGAAAAACG
ATC ITACGTC
AAAATCAACC
TTAGTGGACA
CACCAGGCAG
TGACTGCCGT
GGGCGTTAGG
AATTIAGTITAA
CAGCTGACGG
AAAATGTTTC
GTACCAGTAT
AAGGCACGGT
GCGTAAATGA
CCGTTTCAAC
?I'CAAACAGA
ATTTTrGCTGC
ATTCAACAC
GAAATGCAGG
TTACCGTTCC
CCAGCGGNAC
CCGCACCA.AC
CTGCCGATC
GTACCGCTGA
ATGAAACGTT
TCCAAACACC
A'TCAACAGC
GAAATGCAGG
'ITACCA'ITCC
CCAGCGGTAC
AAATAATGAC
AGTTAATTTC
AAAACAAGCG
CCCAGCAATC
TTTGAATG-CA
GAATTTAACT
TAACATTGAA
CTCATACATT
AAACATCTTA
CCCAAGTAAT
A.AACGTGGCT
GTTAGTGGAT
ACCTCAAAAT
TCTTITAGAC
AGGCACTACT
AGGT'rACGAA
AGGCACCCTA
CGCAGGTGAA
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGAA
AGGTTACGAA
AGGTGCCGTG
CGCAGGTGAA
AGAAAGTACG
AGTCGCACN'
ACGG'ITCTC
GTGTTAGTGG
AATACCAATG
GCCAATGAT'
GGTGTTGATA
AACP'rAGGTG
AGTGCACCGA
CAAGATTTGA
C ITGTCGCCG G TTCAGGTG
GCTTCTTTAC
TTATCCAATA
GTTAATTTAT
AGTGAAGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGCGCCA
CCAkACAACGT
GTGACACGTA
GTTGAAATCC
GCGTTTACAG
GC-GAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGTACAG
GCACCTGCGA
CCAACAACGT
CAACGACGTAA
GTGGCGC~rT
TTCCTCCTGA
TCACGATTGA
TAACCAATGT
TTGATTGAC
CTGC=CAT
TTAGTrCATGG
ATGCGG'CAGT
CAGCTATAAA
CTT~TGCCT~TT
TAGGCACAAC
ATTTAGATGC
TAGCAACAC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCAAACNCC
ATTCA-ACGGC
GAAATGCAGG
TTACCGTTCC
CCAGCGGCAC
CCCCACCAAC
CT'GCCGATGC
GTACTGCTGA
ATGAAACGTT
TCCAAACGCC
TCGTTGGACA
AGCAGATGCT
GTTAAGAGGA
TC=TCAAAA
GATTACTCAA
GGAAGTGAAT
TACAGCTCCG
=ITAGGGTTA
TCAAGCTTTG
TGCAGCCTTG
ATTAGCGGTT
CACGGTTACT
TCGTTTTGTA
AGACGGTGTA
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGAA
AGGTTIACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGTACG
AATCACAGGA
CAATGCCACG
TGGGACAGGG
AACCGCCGTA
a a a a a a a a EF068-4 (SEQ ID NO:260) TSGTIS ASAAVLDIEL LSN JTSNNDS GTSTSNRWTA ANQNQPVNFT VSGCALADAS a a a,
AVFSGQKQAV
TSGALGNLTG
LAQNVSN~ILQ
GSGVNELVDA
NIYFAAGTTS
F'IVTVPAGEA
LVVPPELRGN
VDIDILTEVNR
DLNAAVQALE
SLLGTTTVTL
EVTAPTITGV
GANETLTAVA
VAAAGSAAIN
QLELNMIEN
AKOTS IPSNL
P'IIVSTPQNL
TGNSTAGYEV
KNASGTEXTP
TNVTIDLSKV
LGAASFTAPE
VA.AAINAALL
SNNLDARFVG
KGTADANATIV
TTFQTP
TFLTAVLNAA
TLAADGSYI S
PVKGTVNVAV
TVVQTDLLDV
EIRNAGGIVI
NDLTNVITQ I AP ISDOLGLV
SGALPLLAVG
NLLATADGVS
GTGTADGTGA
EF069-1 (SEQ ID NO:261)
TAGGCGAAGC
ATGAAAAAGA
CGCAAGATGC
GTAAGTATAG
CTATI'ATCAA
GCCGCAAACC
TCCGCTGTT
AATGTAGCTG
GTTACTT
TAATCATCTT
AAATTG'IIGA
TAATTAGC
CTG'ITACCTC
ATGTTACGTC
AAAATCAACC
TTAGTCGACA
CAGCAGGCAG
TGACTGCCGT
GGTAFIATC
GGATFFI'AAT
AATATCAAGT
TGGCACAATC
AAATAATGAC
AGTTAATTTC
AAAACA6AGCG
CGCAGCAATC
TTTGAATGCA
GTTrATTFrA
CGGAAAAGTC
GGTTTATI'AT
AGTGCATCAG
AGTGGCACT
ACGGTTTCTC
GTCTTAGTGG
AATACCAATG
GCCAATGATT
AAGAAAAGAG
AGCATAAAA-A
TTACGTCATT
CAGCGGTC T
CAACGAGTAA
GTGGCGCTTI
TI'CCTCCTGA
TCACGArM'A
TAACCAATGT
GGACGATCAG
ATGGACAAAA
AGCAATCCCT
GGATATCGAA
TCGTTGGACA
AGCAGATGCT
GTTAAGAGGA
TCTTTICAAAA
GATTACTCAA
156 TABLE 1. Nucleotide and Am~ino Acid Seqeuences orE. fcaecalis Genes.
ATTACCAGTG
CGTCA.ATTGG
GAAACGTTAG
GTTTTAGCCC
GAGGCAAAAG
CTTCCTGTCA
GGTGGTTCAG
TTACCAACTA
GGAACAGTCG
TCCAACATTT
GTAACAGGTA
G I1TGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTITGAGATCC
GCA ITTACAG
GCGAAAAACG
AATACC CC CC
GAGGTCAAAG
GTCCTTGGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GTCGTGATC
ACTGTAACC
GATA.ACGCGG
ACAGCACCAA
ACCGCTGATC
GGGACTGCCG
AATGAAACAT
TTCCAAACAC
ACA.ACACAAG
GCAACAGACG
ACGTTAGCTT
ACTGGACTTG
ATTGGTGACA
A.ATACCACCA
GATCAAGGAA
ACAGTTGTTG
GAAGATGCAA
TATCAAGTAA
TAGITATTG
GGCACAGCAA
AGTAATCCG.A
GTTGACAAAP
GGTACAACAC
ACTGATGCP
ACTGTCGTAC
GGGCGTTAGG
AATTACTTAA
CAGCTGACGG
AAAATGTTTC
GTACCAGTAT
AAGGCACGGT
GCGTAAATGA
CCCTTTCAAC
TTCAAACAGA
ATTTGCTGC
ATTCAACAC
GAAATGCAGC
TTACCGTTCC
CCAGCGGNAC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATCAAACGTT
TCCAAACACC
ATTCAACAGC
GAAATCCAG
TTACCATTCC
CCAGCGGTAC
TCGCGACGCC
GGAC~rCTGA
CTGCAACAAC
CAAATCAAAC
CGGCGACAAC
G'ITCGGGTTA
CATCTGGGGC
TACCAACGCG
GAAATGAAAC
CTGTAACAGC
CCAATGCTAC
ATGGGACTGC
TGACAGCGT'I
CACCAGATC;
GATATCAAG]
GAACAG=IT
CAGGAAAAGC
AGAGTCAAC(
TTACTGGAG)
TTGAAGTACC
ACTTTACTG'.
GAAAAGACG(
*CCGTAGCAG(
*CCGGCACGG(
CTACGGGAA(
CAGCTAACGA
*CTGCTTTCAX
*TCACTGGTA(
TTGAGGTGCI
AATACACAG'
CGAAAAATG
GAATTTA.ACT
TAACATIMAA
CTCATACATT
AAACATCTTA
CCCA.ACTAAT
AAACGTGGCT
GTTAGTGGAT
ACCTCAAAAT
TCTTTTAGAC
AGGCACTACT
AGGTTACGAA
AGCACCGTA
CGCAGGTGAA
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCCCGTA
AGCGGATGAA
AGGTTACGAA
AGGTGCCGTC
CGCAGGTGAA
AGAAAGTACC
AATTCTTrCAG
AGTTGGCACC
TGGAACTGAC
GCTGAGCGTT
ACCAGCTGAT
TGAAATTACA
AGTCATTGCT
AACGACCAAT
TCAACCGACT
AGTAACAGGI
CATCGAAATI
TTCCTTTGC!I
AGCCAAAGA I AGTAGTCGCb
GACAGGTACC
ACGCACCGC)
AACACCTA-A
AACTACAGC']
k. TTCAACAACI
GAACCCAGAJ
r GGACCTTCCI
TGACGGCAA'
ACCAACTGT(
AGAGCCAAAN
GACT'GATGG'
k. AGCCTAAC'.
IN AACACCTGC' 3CACGACAAA( 3 TGACGCCGA' T' GACTTTAGA
CAACAGGAAA
GdTGTTGATA
AACTTAGCTG
AGTGCACCGA
CAAGATTTGA
CTTGTCGCCG
GTTTCAGGTG
GCTTC-TTAC
TTATCCAATA
GTTAATTTAT
AGTGA-AGTAA
9_TTAAAGGAA
ATAGGCACAG
GCAGGCGCCA
CCAACAACGT
GTGACAGGTA
GTTGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTrAAAGGAA
ATACCTACAG
GCAGGTGCGA
CCAACAACGT
ACTGTAACAG
ACCATTGAGG
GGAAAATATA
GTAGCGAAAA
GTCACTGC C:
GGAACAGCAG
ACAGGTACCT
CCTGGGGATA
GAAGTCCTTG
AATTCAG'TC
CGTGATGCAG
GTGAACCC
CCTGCTGCCP
CCGCCAAGCTC
GCTGAAC'TrC
ACAACTGCAC
cAAACAGTC) 7 ATIACACCCC 7GGTTATGAAJ
CCGAACAATTI
k GCGACCC r GAAAGTCAAC
'ACGACTGTTJ
P GTCACCA'IT r' GCTGGCGCA' r GCcATTGCGJ r GATCC.AGAT( :GGcTATCAA( r GGCACAGTC,
:CCAGGGAAGI
A, GAAAGTCAG TTGATrTGAC CTC 1CATT
TTAGTGATGG
ATGCGGCAGT
CAGCTATAA.A
CTITGCC FIT
TAGGCACAAC
ATTTACATCC
TAGCAACAGC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCAAACNCC
A'ITCAACGGC
GAA.ATGCAGG
TTACCGTTCC
CCAGCGGCAC
CCGCACCAAC
CTGCCGATCC
GTACTGCTGA
ATGAAACGTT
TCCAAACGCC
GTAGTACAAC
TTCGCGATGC
CAGTIGACTTT
ACGCTAGTGG
CAACAGTTGA
AC CCTA.ACAC
CTGATGCGAA
CGTTAACAGT
TTCCTGCTGA
CTGG'!FrATCA ATrCCCAACGT
CAGCTGGGAC
ATACAAGTAC
TCGACAAAGI
GCACCACCAU
CGACTGGCCP
ATGTAGTTGC
;CTC;ATCTTAC
k TCACTGGGAC k TTGGTACAAC
CTAATCCTC
:CAACGGAAG'J
CAGGAACAAC
2AGATTCACAJ r TTACAATCA( 4, AAGATGCTGC
CACCAGTCC(
S TAGTAGGAG( CT'rGGCATGG( C CCTCAGCTA C CAGCTACAC(
GGAAGTGAAT
TACAGCTCCG
TTTAGGGTTA
TCAAGC'rrTG TGCAGCC ITG
ATTAGCGGTT
CACGGTTACT
TCGTTTTGTA
AGACGGTGTA
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
ACCGGATGAA
AGGTTACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGTACG
AATCACAGGA
CAATGCCACG
TCCGACAGGG
AACCGCCGTA
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAAAGT
TAACATCACA
AACAATCGAA
TGGTGATTTT
GATTCGAAAG
TGCCACGGTT
GGTGACACCC
GATTCGCAACA
GGCAA.ATGCG
ACCGACAACC
TACTGCAAT
*TGAAGTTCGT
ATATACTGTG
TAAAAATGAT
CACACCAACA
GGCGGACCCT
GACAACGGAT
TGATACATTA
GACGGTCCCT
TCCCACTGGG
TGAAGCAGGT
TCTTCCGACG
TGGGAAAGA6A
GACACCTACT
AGCAGAAGIT
AACTACTGGA
k. CGAAACAATA
-AACTACACCA
a a.
157 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecali-v Genes.
GTCGACTTAG
ATCACTGGAA
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
GAAGGAAACA
CAATTTTACA
TTCTCCCTTC
ACAGATACAC
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTA.ACAGCGG
GGACATTACA
GGCAATGGTA
GCAACTGGCG
GGTTTAGGCA
AGTGGCTACC
GGCTACTTAC
GCCTTTGTCG
CCACACCAAC
CGGCGGAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTArrCC
AACCCAATCG
ATAGCAGTGA
ATATTGCAGC
AAGGAAATGT
AGATrAAAAT
TCATCAAAGG
GTGMTTGC
GACPLAGCGAC
GTACGGCTAC
ACACTOCCOG
GGAATAATGG
CAACAGGTTC
ACCCTAAACT
CTAGCACACG
CACTCCTTGC
CATTGATTCT
AAAAACCACT
CGAAAC CCCC
TACGATTATT
AGCGGATGTT
CTATACAGTC
ACAATTATTG
AGGGTTAGCA
GAGTCCTAAA
TGCGGCACCA
AACAGCTGAA
TGTAGGTGCC
TGCTCAACAA
AACCTTCATG
APLATAACGGC
GAATGGTTCA
TGGCTTAGGT
AAGTACCATC
TGAAA.AAGAG
GAGCATGGGA
ATTACCGGAA
ATTGATGTCC
CAATATACC
AGCAAAGATG
GT'rTTAGCGG
ACTGGAACTC
GCAAGTGGCA
ACAGAAA-AAG;
ACCACATTTA
ACTG'ITCTT
CCAAACCGAA
ACCGAtCCTG
AGTTTACTTG
ACGCCAGCCG
AATACAGGCG
AACACAGGTI'
TCACTAGGCA
AGTTATGGCA
TCTTCAG CCC
ATCATCAAAC
ATI'CTA*TAA
GTGACGCAGA
TGACTCTACC
GCGCAGGTAA
CGCC-AACTAT
CTGATCCAA-A
ATACA-ACTAC
AAACGTTAAC
TGACGCCAGC
CAGTTTAGG
TCATTCAAAT
AAGGCAACTT
C:GACAGCTAC
ACCCAACGAA
GCAATACAGG
CAAATCCAAA
ATGGCCTCGG
CTGGAAATCA
TGACA.ACAAG
GCAAACG"AA
AGGTTACGAA
CGGAACCATC
AGCTGGCGTA
TGAAAGTCAA
TACGAAGGTT
TGTCACGGTT
TGGAGGTACC
CGCACTAACC
CGATATTACO
AACGTCTAAA
TAGTAACCGA
CGCTATCCAA
CGATGGCGCA
TCCTGGAGGA
AAACAATGGC
TGGAGGTTCT
TACAA.ATGGT
CGGGAAAACA
TTTGTTTC
AAACTAG
EF069-2 (SEQ ID NO:262) M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLFTSLAIPV *0 .q ~0 *0 S IAVTSGTIS
AVFSGQKQAV
TSGALGNLTG
LAQNVSNI LQ
GSGVNELVDA
NIYFAACTTS
F'IVTVPAGEA
KGTADANATV
TTFQTPADEA
FTVTI PAGEA
XTKGTAEVGTT
PATATTPADV
VTLPTGTTNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TT-IEVRNPIDC
DATVAAPTVT
TATANEALTA
T'IVEVRDADG
DLATPTIDS I
TPGETITIIS
FYNSSEQLLA
EPEIKIAAPT
TAGQATAQQS
TGGNNGNGSN
YLPSTGEKES
ASAAVLDIEL
LVVPPEJRGN
VDI DLTEVNR
DLNAAVQALE
SLLGTTTVTL
EVTAPTITGV
GANETiLTAVA
EIRNAGGIVI
TVTAPTITGV
GANETLTAVA
IEVRDAAGIV
TAPTVDNITOG
GDTLTVIGKD
DADGNVIATO
PSVDKVTGNT
TVNVVAKNDr TI IGT'flTDD
TVTGTTATGY
IAKDAAGKE-S
TIVLGMATTGT
TGNSSKGYEI
KDGAGNESQP
SGNTTTGGTF
VSSVLGTSKA
LLATATDGAG
TGSNPNCGSG
SAVTISLFGA
LSNVTSNNDS
VAAAGSAAIN
QLELVNNIEN
AKGTS IPSNL
PTTVSTPQNL
TGNSTAGYEV
KNASGTEXTP
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
LGTATTGTDC
NSGSGYEITG
NAGNESQPTE
TADGTGSFAV
TQGYQVTGTA
GLESQPT'AM
QGNFTVDLPA
QVTGTAEPNTV
NPTAFKTPAD
DGKYTVTLEP
TGTAEPKTTI
ATAVI PADVV SVH IAAGLAT
GYLIKGTAEP
HYSTATTFMT
LGTTGSGLGS
FVAT-LASMG I
GTSTSNRWTA
TNVTIDLSKV
LGAASFTAPE,
VAAAINAALL'
SNNLDARFVG
KGT ADANATV
TTQTPADE-A
FTVTVPAGEA
KGTADANATV
TTFQTPADPN
KYTVTLDSGT
TADPNTTIEV
VLVPADATVT
NLPAGTANAN
ELGTTIEVRA
TPADVTTPTI
GAANPCDTLT
TIE IHNEAGL
POAPVATPTV
GKASANETIT
DVRDADGTI I
LAAPTITKVE
EKETLTALTT
NRIIQISNRL
PADPTNPGGG
LGNGLGTNGS
IKRKRKN
ANQNQPVNFT
TFLTAVLNAA
TLAADGSYIS
PV-XGTVNVAV
'IVVQTDLLDV
EIRNAGGIVI
TVTAPTITGV
GANETLTAVA
E IRNAGGAVI TPVATP IVET
ATANQTI.SVV
RDPSGAVIGT
APTVTGVTGN
ETLTALAKIJP
TIDGTVLGTAT
GDITGDSTTG
VVGKIDC DNE
VIATGTTDGA
DKITGSTTNG
VVAKNATGKE
AATTA.NETGQ
GNKA-NGYTVT
DTQGNVSPKT
LR.SVIAVGAT
NGNTGGNNGN
GYHPKLSTIS
VSGGALADAS
NDLTNVITQ I
APISDGLGLV
SCALFLLAVG
NLLATADGVS
GTGTADGTGA
TCNSTAGYEV
KNASGTESTP
GTGTADGTGA
VTGSTTKGYE
AKNASGTESQ
GTSDANGDFT
SVAGYQVTGT
AGNTSTPTTF
TGPTGQYTVT
YEITGTADPN
SQPTEVTVPE
GAFTITLPTG
YQVVGAAEVG
SQPATATTPV
YTVTLPAGVV
GTADPNVTVQ
TFMTPADITG
DAEGNFAIQL
TGGNTGNNGA
YGTGNHGKTG
EF069-3 (SEQ ID NO:263) 158 TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
AGGTGAA GCAGGTGCGA ATGAAACGTI' AACCGCCGTA
GCGAAAAACG
AATACCCCCG
GAGGTCAAAG
GTCCI'GGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GTTCGTGATC
ACTGTAACC
GATAACGCCG
ACAGCACCAA
ACCGCTGATC
GGGACTCCC
AATGAAACAT
TTrCCAAACAC
ACAACACAAG
GCAACAGACG
ACGTTAGCTT
ACTGGACTTG
ATTGGTGACA
AATACCACCA
GATCA.AGGAA
ACAGTTC'Frc
GAAGATGCAA
CCAGCGGTAC
TGGCGACGCC
GGACTCCTGA
CTGCAACAAC
CAAATCAAAC
CGGCGACAAC
GTTCGGGTTA
CATCTGGGGC
TACCAACGGG
GAAATGAAAG
CTGTA-ACAGd
CGAATGCTAC
ATGGGACTC
TGACAGCGTT
CAGCAGATGA
GATATCAAGT
GAACAGT
CAGGAAAAGC
AGAGTCAACC
TTACTGGACA,
TTGAAGTACG
ACTTTACTGT
GAAAAGACG
CCGTAGCAGC
AGAAAGTACG
AA ITTGAG
AGTTGGCACC
TGGAACTGAC
GCTGAGCGTT
ACCACCTGAT
TGAAAITACA
AGTCAT'GGT
AACGACCAAT
TCAACCGACT
AGTAACAGGT
CATCGAAATT
TTCCTTTCCT
AGCCAAAGAT
AGTAGTGGCA
CACAGGTACC
AGGCACCGCA
AACAGCTAAA
AACTACAGCT
TTCAACAACT
GAACCCAGAT
GGACCTTCCA
TGACCGCAAT
ACCAACTGTG
CCA.ACAACGT
ACTGTAACAG
ACCATTGAGG
GGAA.AATATA
GTAGCGAAAA
GTCACTGCAC
GGAACAGCAG
ACAGGTACCT
CCTGGGGATA
CAAGTCCTTG
AATTCAGTT'G
CGATGCAG
GTGAACCTTC
CCTGCTGGCA
CCGCCAAGTG
GCTGAACTTG
ACAACTGGAC
CAAACAGTGA
ATGACACCCG
GTATGAAA
GGAACAATTA
GCGGGAGCCG
GAAACTCA.AC
ACGACTGTTA
TCCAAACOCC
GTAGTACAAC
TTCGCGATGC
CAGTrGACTTT
ACGCTAGTGG
CAACAG'ITGA
ACCCTAACAC
CTGATGCGA.A
CGTTAACAGT
TTCCTGCTGA
CTGGTATCA
ATGGGAACGT
CAGCTGOAC
ATACAAGTAC
TCGACAAAGT
GCACCACCAT
CGACTGGCCA
ATGTAGTTGC
CTGATG'ITAC
TCACT=CAC
TTGGTACAAC
CTAATCCTGG
CAACGGAAGT
CAGGAA
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAAAGT
TAACATCACA
AACAATCGAA
TGGTGATTTTI
GATTGGAAAG
TGCCACGGTT
GGTGACAGGC
GATTGCAACA
GGCAAATGCG
ACCGACAACC
TACTGGGAAT
TGAAGTTCGT
ATATACTGTG
TAAAATGAT
CACACCAACA
GGCGGACCCT
GACAACGGAT
TGATACATTA
GACGGTCCCT
S
S.
EF069-4 (SEQ ID NO:264) AGEA GANETLTAVA KNASGTEXTP TTFQTPADE-A TVTAPTITGV TGNSTAGYEV
KGTADANATV
TTFQTPADEA
FTVTIPAGEA
VKGTAEVGTT
PATAT1'PADV
VTLPTGTITNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TTI EVRNPDG
EIRNAGGTVI
TVTAPTITGV
GANETLTAVA
I EVRDAAGTV
TAPTVDNITG
GDTLTVICKD
DADGNVIATG
PSVDKVTGNT
TVNVVAKNDT
TI IGTTTTID
GTGTADGTGA
TGNSTAGYEV
KNASCTESTP
LGTATI'GTOG
NSGSGYEITG
NAGNESQPTE
TADGTGSFAV
TQGYQVTGTA
GLESQPTTAM
QGNFTVDLPA
FTVTVPAGEA
KGTADANATV
TTFQTPADPN
KYTVTLDSGT
TADPNTT 1EV
VLVPADATVT
NLPAGTA.NAN
ELGTTIEVRA
TPADVTTPTI
GAANPGDTLT
GANETLTAVA KNASGTESTP EIRNAGGAVI GTGTADGTGA TPVATPIVET VTGSTTKGYE ATANQTLSVV AXNASCTESQ RDPSGAVIGT GTSDANGDFT APTVTGVTGN SVAGYQVTGT ETLTALAKDP AGNTSTPTTF TDGTVLGTAT TGPTGQYTVT CDITGOSTTG YEITGTADPN VVGKDGrDGNE SQPTEVTVPE DATVAAPTVT TVTGT EF070-1 (SEQ ID NO:265)
TACGGGAAC
ATGAAAAAGA
CGCAAGATGC
GTAAGTATAG
CTATTATCAA
GCCGCAAACC
TCCGCTGTGT
AATGTAGCTC
GTTACTr
ATTACCAGG
TAATGATCTT
AAATTGTTGA
TAATTTAGC
CTGTTACCTC
ATGTTACCTC
AAAATCAACC
TTAGTGGACA
CAGCAGGCAG
TGACTGCCGT
GGCGCGTTAGG
GGTATTTATC
GGATITAAT
AATATCAAGT
TGGCACAATC
AAATAATGAC
AGTTA.ATTTC
AAAACAAGCG
CGCAGCAATC
TTTGAATGCA
GAA'TI'AACT
GTTrTATrA
CGGAAAAGTC
GGTTTATTAT
AGTGCATCAG
AGTGCCACTT
ACGTTCTG
GTGTTAGTGG
AATACCAATG
GCCAATGATT
GGTGTTGATA
AAGAAAAGAG
AGCATAAAAA
TTACGTCATT
CAGCGGTCTT
CAACGAGTAA
GTGGCGCTTT
TTCCTCCTGA
TCACGATTGA
TAACCAATGT
TTGATMTAC
GGACGATCAG
ATGGACAAAA
AGCAATCCCT
GGATATCGAA
TCGTTGGACA
AGCAGATGCT
GTTAAGAGGA
TCTTTCAAAA
GATTACTCAA
GGAAGTGAAT
159 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
CGTCAATTCG
GAAACGTTAG
GTTTTAGCCC
GAGCCAAAAG
CT-TCCTGTCA
GGTGGTTCAG
TTACCAACTA
GGAACAGTCG
TCCAACATTT
GTAACAGGTA
GTTGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCATTACAG
GCGAAAAACG
AATACGCCCG
GAGCTCAAAG
GTCCTTGGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GTTCGTGATC
ACTGTAACGC
GATAACGCGG
ACAGCACCAA
ACCGCTGATC
GGGACTGCCG
AATGAAACAT
TT'CCAAACAC
ACAACACAAG
GCAACAGACG
ACGTTAGCTT
ACTGGACTTG
ATTGGTGACA
AATACCACCA
GATCAAGGAA
ACAGTTGTTG
GAAGATGCAA
TATCAAGTAA
TTAGTTATTG
GGCACAGCAA
AGTAATCCGA
GTTGACAAA
GGTACAACAG
ACTGATGGCA
ACTGTCGTAG
GTCGACTTAG
AATTAGTTAA
CAGCTGACCG
AAAATGTTTC
GTACCAGTAT
AAGGCACGGT
GCGTAAATGA
CCG=TCAAC
TTCAAACAGA
ATGCTC
ATCAACAGC
GAAATGCAG
TTACCGTTCC
CCAGCGGNAC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATCAA.ACGTT
TCCAAACACC
A ETCAACAGC
GAAATG;CAG
TTACCATTCC
CCAGCGGTAC
TCGCGACGCC
GGACTGCTr-A
CTGCAACAAC
CAAATCAAAC
CGGCGACAAC
GTTCGGGTrA
CATCTGGGGC
TACCAACCCG
GAAATGAAAG
CTGTAACACG
CGA.ATCGCTAC
A~rGGACTGG
TGACAGCGTT
CAGCAGATGA
GATATCAACT
GAACAGTT
CACCAAAAC
AGAGTCAACC
TTACTGGAGA
TMAAGTACG
ACTTTACTGT
GAA.AAGACGG
CCGTAGCAGC
CCGGCACGGC
CTACGGGAAC
CAGCTAACGA
CTCTTCAA
TCACTGGTAG
TTGAGGTCCG
AATACACACT
CGAAAAAT~c
CCACACCA.AC
TAACA'ITGA
CTCATACATT
AAACATCTTA
CCCAAGTAAT
AAACGTCGCT
GTTAGTGGAT
ACCTCAAAAT
TC LTTTACAC
AGGCACTACT
AGGTTACGAA
AGCCACCGTA
CGCAGGTGAA
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCCTA
AGCGGATGAA
AGGTTACGAA
ACGTGCCGTC
CGCAGGTGAA
AGAAAGTACG
AATTGTTCAG
AGI'CGCACC
TGGAACTGAC
GCTGAGCGT
ACCACCTGAT
TGAAATI'ACA
AGTCAITGGT
AACGACCAAT
TCAACCGACT
AGTAACAGGT
CATCGAAAT
TTCC TTGCT
AGCCAAAGAT
ACTAGTGGCA
GACAGGTACC
AGGCACCGCA
AACAGCTAAA
AAC'TACAGCT
TTCAACAACT
GAACCCAGAT
GGACCTTCCA
TGACGGCAAT
ACCAACTGTG
AGAGCCAAAT
GACTGATGGT
AGCCTTAACT
AACACCTGCT
CACGACAAAC
TGACGCCGAT
CACTTTAGAG
AACAGGAAAA
CATTGATTCT
AACTT2AGGTG CTGCTTCATT TACAGCTCCG AGTGCACCGA TTAGTGATGG =TAGGGTTA CAAGATTTGA ATCCGCAGT TCAAGCTITG CTTGTCGCCG CAGCTATAAA TGCAGCCTTG GT'TTCAGGTG CTTTGCCTTT ATTAGCGGTT GCTTCTI'TAC TAGGCACAAC CACGG'ITACT TTATCCAATA ATTTAC-ATGC TCGTTTTGTA GTTAATTTAT TAGCAACAGC AGACGGTGTA AGTGAAGTAA CCGCACCAAC AATCACAGGA GTTAAAGGAA CTGCCGATGC CAATGCCACG ATAGGCACAG GTACCGCTGA TGGGACAGCA G&AGGCGCCA AT;AAACGTT AACCGCCCGTA CCAACAACGT TCCAAACNCC AGCGGATGAA GTGACAGGTA ATTCAACGGC AGGTTAC'GAA GTTCAALATCC GAAATCCAGG AGGCACCGTA GCGTTTACAG TTACCGTTCC CGCAGGTGAA GCGAAAAACG CCACCCAC AGAAAGTACG GCAACCGTAA CCGCACCAAC AATCACAGGA GTTAAAGGAA CMCCGATGC CAATOCCACG ATAGTACAG GTACTGCTGA TGGGACAGGG GCACGTGCGA ATGAAACGTT AACCGCCGTA CCAACAACGT TCCAAACGCC AGCGGATCCT ACTGTAACAG CTAGTACAAC AAAAGGC TAT ACCATTGAGG TTCGCGATGC AGCTGGCACC GGAAAATATA CAGTGACTTT AGATTCAGGA GTAGCGAAAA ACGCTAGTGG CACGGAAAGT GTCACTGCAC CAACAG'ITGA TAACATCACA GGAACAGCAG'ACCCTAACAC AACAATCGAA ACAGGTACCT CTGATGCGAA TGGTGATTT CCTGGGGATA CGTTAACAGT GATTGGAAAG GAAGTCCTTG TTCCTGCTGA 1GCCACGG T AATTCAGTTG CTGGTTATCA GGTGACAGGC CGTGATCCAG ATGGGAACGT GATTGCAACA GTGAACC=~ CACTCCGAC CGCAAATGCG CCTGCTGGCA ATACAAGTAC ACCGACAACC CCCCAACTC TCGACAAAGT TACTGGGAAT CCTr.AAC7-Ir GCACCACCAT TGAACTTCGT ACAACTGGAC CGACTGGCCA ATATACTGTG CAAACAGTGA ATGTAGTTGC TAAAALATGAT ATGACAC CCG CTGATTAC CACACCAACA GGTTATGAAA TCACTGGGAC GGCGGACCCT GGAACAAITA TTGGTACAAC GACAACGGAT GCGCGAGCCC CTAATCCTG.G TGATACATA GAAAGTCAAC CAACGGAAGT GACGGTCCCT ACGACTGTI'A CAGGAACAAC TGCCACTGGG GTCACCATT Ac3ATTCACAA TGAAGCAGGT GCTGGCGCAT TrACAATCAC TCTTCCGACG GCCATflGCGA AAGA-TGCTGC TGGGAAAGAA GATCCAGATG CACCAGTCGC GACACCTACT GGCTATCAAG TAGTAGGAGC AGCACAAGTT GGCACAGTCC TTGGCATGGC AACTACTGGA CCAGGAAGG CCTCAGCTAA CGAAACAATA GAAAGTCAGC CACCTACAGC AACTACACCA ATTACCCGAA ATTCTALGTAA AGGTTACGAA 160 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
*ATCACTIGGAA
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
GAAGGAAACA
CAATTTrrACA
TTCTCCGTTC
ACAGATACAC
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCGG
GGACATTACA
GGCAATGGTA
GCAACT'GGCG
GTTAG4GCA
AGTGGCTACC
GGCTACTTAC
GCCTTTGTCG
CGGCGGAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTATTCC
AAGCCAATGG
ATAGCAGTGA
ATATTGCAGC
AAGGAAATGT
AGATTAAAAT
TCATCAAAGG
GTGTGA'ITGC
GACAAGCGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGGTIC
ACCCTAAACT
CTAGCACAGG
CACTCCTTGC
AAAAACCACT
CGAAACCGGC
TACGATI'ATT
AGCGGATGTT
CTATACAGTC
ACAA'ITATTG
AGGCTIAGCA
GAGTCCTAAA
TGCGGCACCA
AACAGCTGAA
TGTAGGTGCC
TGCTCA.ACAA
AACCTT'CATG
AAATAACGGC
GAATGGTTCA
TGGCTTAGGT
AAGTACCATC
TGAAAAAGAG
GAGCATGGGA
ATTGATGTCC
CAATATACGG
AGCAAAGATG
GTTTTAGCGG
ACTGGAACTG
GCAAGTGGCA
ACAGAAAAAG
ACCACATTTA
ACTGTTTCT'
CCAAACCGAA
ACCGATGCTC
AGT=TACTTG
ACGCCAGCCG
AATACAGGCG
AACACAGGTT
TCACTAGGCA
ACTtATGGCA
TCTTCAGCCG
ATCATCAAAC
GTGACGCAGA
TGACTCTACC
GCGCAGGTAA
CCCCAACTAT
CTCATCCAAA
ATACALACTAC
AAACGTTAAC
TGACGCCAGC
CAGI=TAGG
TCATI'CAAAT
ALAGGCAACTI'
CGACAGCTAC
ACCCAACGAA
GCAATACACG
CAAATCCAAA
ATGGCCTCGG
CTGGAAATCA
TGACAACAAC
GCAAACGTAA
CCAACCATC
AGCTGGCGTA
TGAAAGTCAA
TACGAAGGTT~
TGTCACGITr
TGGAGGTACC
CGCACTAACC
CGATATTACG
AACGTCTAAA
TAGTAACCGA
CGCTATCCAA
CGATGGCGCA
TCCTGGAGGA
AAACAATGGC
TGGAGGTTCT
TACAAATGGT
CGGGAAAACA
TTTGT PTGGC
AAACTAG
EFO?O-2 (SEQ ID NO:266) M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG'LLFTSLAIPV
SIAVTSGTIS-
AVPSGQKQAV
TSGALGNLTG
LAQNVSNILQ
GSGVNELVDA
N IYFAAGTITS
FTVITVPAGEA
KGTADANATV
TTFQTPADEA
FTVTI PAGEA VKGTALVGTr
PATATTPADV
VTLPTGTTNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TTIEVRNPDG
DATVAAPTVT
TATANEALTA'
TTVEVRDADG
DLATPTIDSI
TPGETITI IS
FYNSSEQLLA
EPEIKIAAPT
TAGQATAQQS
TGGNNGNGSN
YLPSTGEKES
ASAAVLDIEL
LVVPPELRGN
VDIDLTEVNR
DLNAAVQALE
SLLGTTTVTL
EVTAPTITGV
GANETLTAVA
EIRNAGGTVI
TVTAPTITGV
GANETLTAVA
IEVRDA.AGTV
TAPTVDNITG
GDTLTVIGKD
DADGNVIATG
PSVDKVTIGNT
TVNVVAKNDT
TI IGTT'ITDD TVTGTTATGy
IAKDAAGKES
IVLGMATI'GT
TGNSSKGYE I
KDGAGNE-SQP
sGN'TTCCGTF
VSSVLGTSKA
LLATATDGAG
TGSNPNGGSG
SAVTTSLFGA
LSNVTSNNDS
VAAAGSAAIN
QLELVNNIEN
AKGTSIPSNL
PTTVSTPQNL
TGNSTACYEV
KNASGTEXTP
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
LGTATTGTDG
N~SGSGYE ITG
NAGNESQPTE
TADGTGSFAV
TQGYQVTGTA
CLESQPTTAM
QGNFTVDLPA
QVTGTAEPNV
INPTAFKTPAD
DGRYTVTLEP
TGTAEPKTTI
ATAVI PADVV
SVHIAAGLAT
GYLIKGTAEP
HYSTAITFMr
LGTTGSGLGS
FVALLASMGI
GTSTSNRWTA
TNVTIDLSKV
LGAASFTAPE
VAAAINAALL
SNNIJDARFVG
KGTADANATV
TTFQTPADEA
FTVTVPAGEA
KGTADANATV
TTFQTPADPN
KYTVTLDSGT
TAD)PNTTIEV
VLVPADATVT
NLPAGTANAIJ
ELGTTIEVRA
TPADVTTPTI
GAANPGDTLT
TIEIHNE-AGL
PDAPVATPTV
GKASANETIT
DVRDADGTI I
LAAPTITKVE
EKETLTALTT
NRIIQISNRL
PADPTNPGGG
LGNGLGTNGS
IKRKRKN
ANQNQPVNF'
TFLTAVLNAA
TLAADGSYIS
PVKG'IVNVAV
'TVVQTDLLDV
El RNAGGIVI
TVTAPTITGV
GANETLTAVA
EIRNAGGAVI
TPVATP IVET
ATANQTLSVV
RDPSGAVIGT
APTVTCVTGN
ETLTALAXDP
TDGTVLGTAT
GDITGDSTTG
VVGKD)GDCUE
VIATG=rGA DKITGSTrNG
VVAKNATGKE
AATTANETGQ
GNKANGY'IVT
DTQGNVSPKT
LRSVIAVCAT
NGNTGGNNGN
GYHPKLSTIS
VSGGALADAS
NDLTNVITQI
APISDGLGLV
SCALP LLAVG
NLLATADGVS
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
GTGTADGTGA
VTGSTTKGYE
AKNASGTESQ
GTSDANGDFT
SVAGYQVTGT
AGNTSTPITF
TGPTGQYTVT
YEITGTADPN
SQPTEVTVPE
GAFTITLPTG
YQVVGAAEVG
SQPATATTPV
YTVTTLPAGVV
GTADPNVTVQ
TFMTPADITC
DAEGNFAIQL
TGGNTGNNGA
YGTGNI{GKTG
EF070-3 (SEQ ID NO:267) CGG TCACGGCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 161 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GAAGATGCAA
TATCAAGTAA
TTAC ETATTIG
GGCACAGCAA
AGTAATCCGA
GTTGACAA.AA
GGTACAACAG
ACTGATGGCA
ACTGTCGTAG
GTCGACTTAG
ATCACTGGAA
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
GAAGGAAACA
CAATTTTACA
TTCTCCGTTC
ACAGATACAC
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCGG
GGACATTACA
GGCAATGGTA
CCAACTCGCG
GGTTTAGGCA
AGTGGCTACC
GGCTACT
CCGTAGCAGC
CCGGCACGGC
CTACGGGAAC
CAGCTAACGA
CTGC'I=CAA
TCACTGGTAG
TTGAGGTGCG
AATACACAGT
CGAAAAATGC
CCACACCAAC
CGGCGGAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTATTC
AAGCCAATGG
ATACCAGTCA
ATATTGCAC
AAGGAAATGT
AGA ITAAAAT
TCATCAAAGG
GTGTGATC
GACAAGCGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGCT'C
ACCCTAAACT
ACCAACTGTG
AGAGCCAAAT
GACTGATGGT
AGCCTTAACT
AACACCTGCT
CACGACAAAC
TGACGCCCAT
GACTTAGAG
AACAGGAAAA
CATTGATTCT
AAAAACCACT
CGAAACCGGC
TACGATTATT
AGCGGATGTT
CTATACAGTC
ACAATTATTG
AGGGTTAGCA
GAGTCCTAAA
TGCGCCACCA
A.ACAGCTGAA
TGTAGGTGCC
TGCTCAkACAA
AACCTTCATG
AAATAACGGC
GAATGGTTCA
TGGCTTAGGT
AAGTACCATC
ACGACTGTTA
GTCACCATTG.
GCTGGCGCAT
GCCATTGCGA.
GATCCAGATG
GGCTATCAAG
GGCACAGTCC
CCAGGGAAGG
GAAAGTCAGC
ATTACCGGAA
ATTGATGTCC
CAAkTATACGG
ACCAAACATG
GTTTTAGCGG
ACTGGAACTG
GCAAGTGGCA
ACAGAAAAAG
ACCACATTTA
ACTGTTTICTT
CCAAACCGAA
ACCGATGCTG
AGT~AC=G
ACGCCAGCCG
AATACAGGCG
AACACAGGTr
TCACTAGGCA
AGTTrATGGCA
CAGGA.ACAAC
AGATTCACAA
TTACAATCAC
AAGATG-CTrGC
CACCAGTCGC
TAGTAGGAGC
TTGGCATGCGC
CCTCAGCTAA
CAGCTACAGC
ATTCTACTAA
GTGACGCAGA
TGACTCTACC
GCGCAGGTAA
CGCCAACTAT
CTGATCCAA
ATACAACTAC
AAACGTTAAC
TGACGCCAGC
CAGTTTTAGG
TCATTCAAAT
AAGGCAACTT
CGACAGCTAC
ACCCAACGAA
CCAATACAGG
CAAATCCAAA
ATGGCCTCGG
CTGGAAATCA
TGCCACTGGG
TGAAGCAGGT
TCTTCCGACG
TGGAAAGAA
GACACCTACT
AGCAGAAGT'
AACTACTGGA
CCAAACAATA
AACTACACCA
AGGTTACGAA
CGGAACCATC
AGCTGGCGTA
TGAAAGTCAA
TACGAAGGTT
TGT--ACGGTT
TGGAGGTACC
CGCACTAACC
CGATATTACG
AACGTCTAAA
TAGTAACCGA
CGCTATCCAA
CGATGGCGCA
TCCTGGAGGA
AAACAATGGC
TGGAGGTTCT
TACAAATGGT
CGGCAAAACA
EF70-4 (SEQ ID NO:268) DGOGNE SQPTEVTVPE DATVAAP'IVT 'IVTGTTATGY TATANEALTA IAKDAAGKES TTVEVRDADG TVLGMATTGT DLATPTIDSI TGNSSKGYEI TPGETITIIS KDGAGNESQP FYNSSEQLLA SGNT'rGGTF EPEIKIAAPT VSSVLGTSKA TAGQATAQQS LLATATDGAG TGGNNGNGSN TGSNPNGGSG
QVTGTAEPNV
NPTAFKTPAD
DGKYTVTLEP
TGTAEPKTTI
ATAVI PADVV SVH IAACLAT
GYLIKGTAEP
HYSTATTFMT
LGTI'GSGLCS
TIEIHNEAGL
PDAPVATPIV
GKASANETIT
DVRDADGTI I
LAAPTITKVE
EXETLTALTT
NRIIQISNRL
PADPTNPGGG
LGNGLOTNGS
VIATGTTDGA
DKITGSITNG
VVAXNATGKE
AATTANETGQ
GNKANGYIVT
DTQGNVSPKT
LRSVIAVGAT
NGNTGGNNIGN
GYHPKLSTIS
GAFTITLPTG
YQVVGAAEVG
SQ PATATTPV YTrVTLPAGVV
GTADPNVTVQ
TFMTPADI TO DAErGNFAIQL
TGGNTGNNGA
YGTGNNGKTG
EF071-1 (SEQ ID NO:269)
TAAGTAGAAG
GTCCCGCCAT
ATGGCTTTTT
GTGATTGGTT
CCAATCAACT
TCTGGAAACG
CGAAATTGGT
TGGTCGGGAC
TTATCTGCAG
TAAGAAkAGGA
TAAGTI'TAAC
TTACTTATTT
AGCGGAACCT
CAAATGCTTA
AAACGTAGAA
G'IITAAGCCG
GCATGCTATG
GA'FrCCGATG
TCCCGGCTCT
AGGACCACAC
TGTCTCATAT
CTTTCGCTGA
TGGMAGGGAA
TTTAAAAAAT
ACGGCTTNCG
GCAAGCAATG
AG=TAGACA
TTT'GTCGGAT
TI'GCCGAAGA
CTTATITGA
TA.ATGATITCA
CTTACACCAT
AATTAAITT
ATGAAGTGGC
CTGGI'GGACG
AATI'ACTTCT
cTTrCCTrTC
ACTTGCTTTA
CGAAGCGGAT
TTTACATGAA
CTATATGAAA
AGTGAAACAA
162 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.fiaecalis Genes.
TTAGCTCCTG
GCGCAAATCG
TATGTTAA'Fr
GGAACAGT
ACAGATCCTT
ACAACGCGCG
CTGGCCAAAT
AATTGGCTCG
TGGCCCGTGA
ATGGCATAGT
ATGGTTAT-T
TTTlCNGNAAC
TCAATATGGC
AACGAATAAT
TTTGGCTCAG
CACTCATGAT
AGCGCGTGGG
AGGTGAGACT
GCAGGTTCTT
GCGGCGACAT
AACATTGGTG
TGGATTACAA
GGATTAGTAA
GGTCATTATT
TAGCTAATCA
TTAAAAAAGA
CTGATTTTTC
AAAATCIGTG
AGCGCATMG
CAGCCAGGTA
AAAAGCCTAT
T'ATGCTGCC
TCTGGACGAT
GGGAGATCAT
CACNAGATTI'
EF071-2 (SEQ ID NO:270) MF KKLMIQLALV IGLSLTI PMT AXAYTIEADP INFTYFPGSA SNEL IVLHES GNERNLGPHS LDNEVAY~'l( NWSNAYVSYF VGSGGRVKQL APAGQ IQYGA GSLANQKAYA Q IELARTNNA ATFK KDYAAY VNLJARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWWGDHT DPYGYIJARGG LVKRIGTRFT TGVSXTGETG HYSAR EF071-3 (SEQ ID NO:271) G TTTAAAAAAT TAATGATTCA ACTTGCCT A GTGATTGGT'r
CCAATCAACT
TCTGGAAACG
CGAAATTGGT
TTAGCTCCTG
GCGCAAATCG
TATGTTAATT
GGAACAGGTT
ACAGATCCTT
ACAACGGrCCG
TAAGTTTAAC
TTACTTATTT
AGCGGAACCT
CAAATGCTTA
CTGGCCAAAT
AATTGGCTCG
TGGCCCGTGA
ATGGCATAGT
ATGGTTATTI'
TTITCNGNAAC
GATTCCGATG
TCCCGGCTCT
AGGACCACAC
TGTCTCATAT
TCAATATGGC
AACGAATAAT
TTTGGCTCAG
CACTCATGAT
AGCGCGTGGG
AGGTGAGACT
ACGGCITNCG
GCAAGCAATG
AGTTTAGACA
TTTGTCGGAT
GCAGGTTT
GCGGCGACAT
AACATTGGTG
TGGATT1ACAA
GGATTAGTAA
GGTCA ITAT
CTTACACCAT
AATTAATTGT
ATGAAGTGGC
CTGGTGGACG
TAGCTAATCA
TTAAAAAAGA
CTCGATTTTC
AAAATTGGTG
AGCGCATTGG
CAG-CCAGGT
CGAAGCGGAT
T''IACATGAA
CTATATGAAA
AGTSAAACAA
AAAAGCCTAT
TTATGCTGCC
TCTGGACGAT
GGGAGATCAT
CACNAGATFI
EF071-4 (SEQ ID NO:272) F KKLMIQLALV IGLSLTIPMT AXAYTIE-ADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKR NWSNAYVSYF VGSGGRVKQL APACQIQYGA GSLANQKAYA QIELARTNNA ATFKXDYAAY VNLARDLAQN IGADFSLDDG TGYGIVTHDW ITKNWGDHT DPYGYLARGG LVKRIGTRFT TGVSXTGETG HYSAR EF072-1 (SEQ ID NO:273)
TAATCAATGA
T=~CACAGC
TITATTGTTCC
CTGCTTCAAA
CCGTTTTATC
GAAACCGGTG
GAAGATGGAG
TATTTATTTG
GTGAT'I~TGC
AAAAATGAAG
CAAGGCTA
ATTTTIAGGAT
CCAGAATCAA
ACGCAAAAC
AAAACGCACG
AAGCATTAGC
CTGATGGTCA
ATTATCGGGG
AGCTTCGTTC
CAACAAATAG
TGGTTTCTT
TTGAAGCGGA
CTGTTCAAGA
AAAATGCCTA
ATCAAGGAGA
ATCAGGAATI'
TTGAGGTAAA
ATGGAFI'TAC
FI'GGTTAAGT
AGAGGCATCG
ATTACCAGAA
CTTAAATGAC
TGAAGGAAAA
AAAACCGATC
TrCA'ITAGCT
AGCACCAGAA
TCCACAAGGG
TGACTTACCA
GCACATTAAC
CCGTTGTCA
AGTGGCTGGA
GCTI'GATTT
ATT-TGCGTCA
CAAGCAAGCG
CAGCAGCAAA
GTCACTTATC
ACGGTCCAAG
GCAGAAGATA
AGCAAAGATT
GTGGTAAAGG
CAATCGTTAA
CCACTTGAAA
TATCAGTTAA
GATAAGGCGG
AAAACAGTTA
TCAATTAAAG
TGCTACTCGC
TTCAAGTTAC
ACACAGGGGA
AAGTCTATGA
AGGCACAGCG
AAACACAGAC
CGCAGCAACG
AAAA.AGCTAG
CGCATATTCA
AAACGGTACT
CGACTCAGAT
ATACAACGTT
CTACAGGTA
AC ETACAAAA
TCTTTTCGGG
GTTGCACAAA
AGAGGGAACG'
TGTGACGGAT
TCAATTAGCA
AATAAATGGA
AGATAPLAGCC
CAACCTAGTA
LTTATATCCA
CGATAAGCAA
TCCAGCGAAT
GACAC ITTTA
CACACTGACG
CTTTGCAAAT
163 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
CAAACAATGA
ATTAACAACG
CGTACAGGCG
GAGGCTGTTT
TATCGTTGGC
I=TCAGTTA
AGG PTATCT
GAACGGACAA
CTGTGTCGTA
A.AGGACAATT
GCAAGTCTTT
TIATCGTCA-A
AAA.AACAAAA
AAGGNNTTAA
CTGAATCAAA
CGAACAGCAC
TCAAATGCGT
AGTCACGGAC
TGTCAAAGTT
AAATCAAGCG
AGCATTAGCT
AAGATGGCCA
CAGAAATTCC
CGTTACATGT
TTAGAAAAGA
AAACATACCT
GATAGTGAAA
GGGGAATACC
AAAAAATTCA
GTACTTCTTG
r 117 1ACGGTG
AATCAATAA
CCGC-TCAACC
TGACTAAAAG
ATGCGAAAAT
TCAATGAAAC
CGTCTAATCA
GAAGA.ATCT
GGAAAAAATT
TGACACTGCG
AGCCACAC-TT
CACCTTGCCA
AGCAAAC'SGG
AGCCOCTCAA
CTGCACCAAA
CTTATGCAAC
EF072-2 (SEQ ID NO:274) MKNARWLSI CVMLLALFGF SQQALAEASQ ASVQVTILHKL LFPCGQLPEQ -QQNTGEEGTL
LQNYRGLNDV
DGVVS FSLAS
NEENAYDLPP
Es -EVKVAG(
NNEGQLVTDK
RWQKEKALA(
TYQVYDVTDP
KDSQQRDKAY
LEKTVLDKQQ
TVTTGYTLTT
HTLTKRA'IVR
KFTSNQAGEF
FYQLRSEGKT
LFVE-AEAPEV
GFNQGEH INY
QKHGFTLDFS
TGGKSF'VKVD
SVKGXKRWPV
VQEAQRQLAE TGATNRk?IA VKEKASNLVV ILPVQDPQGQ QLTTQIPANI LGYQEFRLSD IKDLQNFANQ TMTVSYQMRE.
SENAKITLPE AVFIVKNQAG LLGRNLCTKR LSSESNRNSF EDKTQT INGE
SLTHIHLYPK
KADTTLTLLP
ERTAEPDTAI
EYLNETANGY
YGGKKFLCNE
RTTNSTVTCN Q EF072-3 (SEQ ID NO:275)
ATTACCAGAA
CTGCTTCAAA
CCGTTTI'ATC
GAAACCGGTG
GAAGATGGAG
TATTTATrrG
OTGATTTTGC
AAAAATCAAG
CAAGGCTTTA
ATTAGGAT
CCAGAATCAA
ACGCAAAAGC
CAAACAATGA
ATTAACAACG
CGTACAGGCG
GAGGCTGTTT
TATCGTTGGC
TTTTCAGTTA
AGGTTATCTT
GAACGGACAA
CAGCAGCAAA
ATI'ATCGGGG
AGCTITCGTTC
CAACAAATAG
TGGTTTCTFI
TTGAAGCGGA
CTGTTCAAGA
AAAATv.CCTA
ATCAAGGAGA
ATCAGGAATI'
TTGAGGTAAA
ATGGATTAC
CTGTGTCGTA
AAGGACAAT'
GCAAGTCT
TI'ATCGTCAA
AAAAAGAAAA
AAGGNNTTAA
CTGAATCAAA
CGAACAGCAC
ACACAGGGGA
CTTAAATGAC
TGAAGGAAAA
AAA.ACCGATC
TTCATTAGCT
AGCACCAGAA
TCCACAAGGG
TGACTTACCA
GCACATTAAC
CCGTTTGTCA
AGTGGCTIGGA
GCTTGATTT
TCAAATGCGT
AGTCACGGAC
TGTCAAAGT'
AAATCAAGCG
AGCATTAGCT
AAGATGGCCA
CAGAAAT'CC
CGTACATGT
AGAGGAACG
GTCACTTATC AAGTCTATGA ACGGTCCAAG AGGCACAGCG GCAGAAGATA AAACACAGAC AGCAAAGATT CGCAGCAACG GTGGTAAAGG AAAA.AGCTAG CAATCGTTAA CGCATATTCA CCACTTGAAA AA.ACGGTACT TATCAGTAA CGACTCAGAT GATAAGGCGG ATACAACGTT AAAACAGTTA CTACAGCCTA TCAATTA.AAG ACTTACAAAA TTAGAAAAGA CCGCTGAACC AAACATACCT TGACTAAAAG GATAGTGAAA ATGCGAAAAT GGGGAATACC TCAATGAAAC AAAAAATTCA CGTCTAATCA GTACITCTTG GAAGAAATCT TITTACGGTG GGAAAAAATT
A
TGTGACGGAT
TCAATTAGCA
AATAA.ATGGA
AGATAAACCC
CAACCTAGTA
T'rATATCCA
CGATAAGCAA
TCCAGCGAAT
GACACTTTTA
CACACTGACG
C71TGCAAAT
TGACACTGCG
AGCCACAGTI
CACCTTGCCA
AGCAAACGGG
AGCCGGTGAA
CTGCACCAAA
C'IATGCAAC
EF072-4 (SEQ ID NO:276) QLPEQ QQNTGEEGTL LQNYRGLNDV TYQVYDVTDP DCVVSFSLAS KDSQQRDYAY NEENAYDLPP LEKTVLDKQQ ESIEVKVAGX TV'ITGYTLTT NNEGQLVTDK HTLTKRANVR RWQKEKALAK KFTSNQACEF
RTTNSTVTC
FYQLRSEGKT
LFVEAkEAPEV GFNQGEH1INY
QKHGFTLDFS
TGGKSFVKVD
SVKGXKRWPV
VQEAQRQLAE
VKEKASNLVV
QLTTQIPANI
IKDLQNFANQ
SENAKITLPE
LLGRNLCTKR
TGATNRKP IA
ILPVQDPQGQ
LGYQEFRLSD
TMIVSYQMRL
AVFItVINQAG
LSSZSNRNSF
EDKTQT IN~GE
SLTHIHLYPK
KADTTLTLLP
EKTAEPDTAI
EYLNETANGY
YGGKKFLCNE
164 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
EF073-1 (SEQ ID NO:277)
TAAATGAACA
ATTGCTGTCG
TTCGTTCTCT
ACGAAAGATA
TTCAGAAGTA
CGTTCAACTA
CTGTTTAAAG
ACACATTTAC
CTAAAAAACA
TCAAAATI'AG
GTCGAAATGT
CAAGTAAACG
AATTAAATAC
CTAGAGAATA
ATCCAAAAAA
AAGAAGCCCA
CAATGAGCAC
AATTACGGC
AGTTAGTGAA
CAAATATCGT
AACA.AACGTA
TTAAAAATGA
CGATCGCTAA
AAGACCAGCA
AAAATTACTG
TGGCTTCTTC
GA.AAAATGCC
TTATCGTGAG
AGCCAAAAAA
GATTCACTTA
AGAACCTAAA
TGACTTAACA
TGAA.AAATTA
TTATGAGGAA
AAGCAGCTIG
ATAA
ATTGGCTATA
GCTTTTGTGA
GCCGACAAAA
TTGGGGTTAT
CAAATCATAC
CGTAATGATA
AAGTTACACT
AGTAAACATr
CA.AGAAACC
ATCGTTTCCG
TCGCAAAAAG
TTCTTTTAGG
TTCTGGTAGG
GCCATCAAAT
CTCCACAAGA
AATTGCAAGA
CTACGAAGGT
TAGCCAATCA
TAGAAATCGA
CACAAATCAT
ATGACTTAGA
CTGCAA CTGA
AGCCTTAATC
CTTTTTAGTA
GCCTTAC ETA AATTGATrT
AAACATGAAT
TTCTAAAGCT
CT'ITCTCTAT
ACAACACGAA
TGACCAATTG
CGATTTAGAT
GGAATCACCT
EF073-2 (SEQ ID NO:278)
S.
S
S
S
S
S.
S
S.
MKLNTKLLI GYILLGALI I KDKEAHYREL GLSPQEIDFF FKELVKEPKK LHLANHFLYT KLVKNDYEEI VSDDLDDLDV
AVAREYGFFA
RSTMSTAKKQ
HLPNIVDLTS
E-MS IAKSSLS
FVILVGFLVF
I IQLQENNNR
KHLEIEQHEV
QKAATEESPQ
VLYRKK<KNA.A
STKLRAIDLR
KN\KQTYEKLE
VNEIDQQ
DKSDQMPYLT
NDTTKVSKAL
ESAQI IDQLS EF073-3 (SEQ ID NO:279) CT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAA.AT GCCTTACTTA
ACGAAAGATA
TTCAGAAGTA
CGTTCAACTA
CTGTTTAAAG
ACACATTTAC
GTAAAAAACA
TCA.AAATTAG
CTCGAAATGT
AAGAAGCCCA
CAATCAGCAC
AA ITACGGGC
AG'ITAGTGAA
CAAATATCGT
AACAAACGTA
ITAAAAATGA
CGATCGCTAA
TTATCGTGAG
AGCCAAAAAA
GATI'GAC 1TA
AGAACCTAAA
TGACTTAACA
TGAAAAATTA
TTATGAGGAA
AAGCAGCTTG
TTGGGGTTAT
CAAATCATAC
CGTAATrGATA
AAGTTACACT
AGTAAACATT
GAAGAAAGCG
ATCGTTTCCG
TCGCAAAAAG
CTCCACAAGA
AATI'CCAAGA
CTACGAAGGT
TAGCCAATCA
TAGAAATCGA
CACAAATCAT
ATGACTTAGA
CTGCAACTGA
AA'FGTTT
AAACATGAAT
TTCTAAAGCT
CTTTCTCTAT
ACAACACGAA
TGACCAATTG
CGATTTAGAT
CGAATCACCT
CAAGTAAACG AAGACCAGCA AT EP073-4 (SEQ ID NO:280) YRKKKNAA DKSDQMPYLT KDKEA}{YREL GLSPQEIDFF FKELVKEPKK LHL.ANHFLYT KLVKNDYEEI VSDDLDDLDV
RSTMSTAKKQ
HLPNIVDLTS
EMS IAKSSLS I IQLQENMNR
KHLEIEQHEV
QKAATEESPQ
STKLRAIDLR
KNKQTYEKLE
VN~EDQQ
NDTTKVSKAL
ESAQI IDQLS EF0'74-1 (SEQ ID NO:281)
TAAAGGAGTT
ACCGTTGCAG
ATGGTAGATA
GGACGCGATG
GCCTACAATCG
TTCAAGCCTT
CAAGGTCGCG
GGCGATGAAC
GT'TAGACA
CTCAAAAAAT
TrcTTGCTG
TCTCTGGCAA
G'ITACAAACA
TCGTACCGGT
ATAACCAAAC
CAGT'ITTA T
AAGCCTTTGC
TCGACTTAGA
GAAGCTAAAA
GTTAACGCCT
AAAAGTGTTG
AGGAACATCA
TTCCTTCATG
CGACACTGCC
GGCACTTGGT
GAATGAAATC
GCA.ATTGGCG
AAAATAATTC
ACTCAACT
G=GCATAT
GCATCACTAA
AAAAGCGATC
TI'CCGACAAG
GGAGCAGATG
ATTCCTCAAG
ATTACTGCTG
cTGCTTrTCC
CTGCAGAMC
GGCATAACTG
ACCTTCAGA
GCACGACACG
AACTCGCACA
CACATATTCA
TGGAAMCATA
GCGACAACCA
CCTTCTTTCA
TGCGGATACG
GGCCTCAAAA
AGTAAATCAA
GA'ITCCTACG
ATTAAATAGT
ATTAGTCAAA
CGGCI'TGAT
AACCGTCATC
165 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
CCTGCTACGT
ACGATGGCAC
TCCCTAAATrC
GTCTGGGTTG
TTCCTCTATN
TGAAAATAGT CAAAGACCAT TATCGAGCAC AAGGAAAAAA TTTCATCATT CAGAATTCCC T'rA'rTAAAA CCTGGTGCCG CTTATGAAAC ATACATTACT GTTATTATGA TTACATTCC CCACAATTAT ATAACCAAGG CCGCCACGGT ATGAAGTTAT GACTFrGGGTT GCTCAAAGCA ACGATGCTCT AAAATACGAG
ATATI'
EF074-2 (SEQ. ID NO:282) MXLK( IIPAFPLLST VAVGLWLTPT QASADAADTM VDISGKKVLV GYWHNWASKG RDGYKQGTSA SLNLSZ-VNQA YNWVPVSFM< SOGTTRIPTF KPYNQTDTAF RQEVAQLNSQ GRAVLLALGG ADAHIQLVKG DEQAFANEII RQV? TYPFDG LOIDLEQL.AI TAGDNQTVIP ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV WVDEVMTWVA QSNDALKYEF LYXI EF074-3 (SEQ ID NO:283) TGC TGCGCATACG ATGGTAGATA TCTCTGGCAA AAAAGTGTTG GGACGCGATG GTTACAAACA AGGAACATCA
GCCTACAATG
TTCAAGCCTT
CAAGGTCGCG
GGCGATGAAC
GGTTTAGACA
CCTGCTACGT
ACGATGGCAC
TCCCTAAATG
GTCTGGGTTG
TTCCTCT
TCGTACCGGT
ATAACCAAAC
CAGITI'TATT
AAGCCTTTGC
TCGACTI'AGA
TGAAAATAGT
CAGAATTCCC
GTTATTATGA
ATGAAGTTAT
TTCCTTCATG
GGACACTGCC
GCCACTTGGT
GAATGAAATC
GCAATTGGCG
CAAAGACCAT
TTATAAAA
TTACATTGCC
GACTTGGGTT
GTTGGATATT
GCATCACTAA
AAAAGCGATG
TTCCG3ACAAG
CQACCAGATC
ATTCGTCAAG
ATTIACTGCTG3
TATCGCCAC
CC TGGTG CCG
CCACAATTAT
GCTCAAAGCA
GGCATAACTG
ACCTTTCAGA
GCACGACACG
AAGTCGCACA
CACATATTCA
TGGAAACATA
GCGACAACC A
AAGGAAAAAA
CTTATGAAAC
ATAACCAAGG
ACGATGCTCT
GGCCTCAAAA
AGTAAATCAA
GATTCCTACG
A'ITAAATAGT
ATTAGTCAAA
CGGCTTTGAT
AACCGTCATC
TTTCATCATT
ATACATTACT
CGGCGACGGT
AAAATACGAG
EF074-4 (SEQ ID NO:284) AADTM VDISGKKVLV GYWHNWASKG RDGYKQGTSA SLNLSEVNQA YNVVPVSFMK SDGTTRIPrF KPYNQTDTAF GRAVLLALGG ADAHIQLVKG DEQAF'ANEII RQVETYGFDC LDIDLEQLAI ATLKIVKDHY RAQGKNFIIT MAPEFPYLKP CAAYETYITS LNGYYDYIAP WVDEVMWVA QSNDALKYEF LY
RQEVAQLINSQ
TAG DNQTVI P
QLYNQGGDGV
EF075-1 (SEQ ID NO:285) TAACCTATAA GAAAAAAATC GGGAAGAAAA ITT~lGC CAT GGAATGGGAG CAAAACTTTA CTAGAACGAT CTAAAAAAAC TTATI'AGGGA 'ITGATACAGG ATTGTTGCAA CAOTTAATCC ACCTATGTTG ATATrCCAGG GGTGGCGCAT CTTrAGCAAT TATGTTTCAA ITAATATGGC GTGAACAATA ATCTGACTTT TTGGATGGTG AACAAGCACT TACGGCCGCC AAGAACGTCA CTTAACAC- TAAGCAACTA GATTTAAGTT TTGATGACAT
ACAACCTGTG
TATCNTTGGA
TTGGGATGTT
TCAGGTCAAT
CGATGATGGG
TCGTGACAAG
TCAAGGAAAA
GGACACAGTT
TGGTTTAAAA
TTCTCAAGAC
CTCCTATTCA
AAGAAAAGTG
TCAAGAAATT
GAAAAAAATT
ATAAATTATT
ATTATCTTGG
TCTAAATC-AA
TTAAACAATA
CGTGTCGACC
CAAACAACCT
CAAGATAAAT
GAAAACTATT
GAATTAGTCA
GGATATGATT
AGAATGCGTT
ArrGAAGGCA
TTAACAGCTG
CC TAGATT
GGAGGNAAAA
NTCTATTTCT
TGGATAAAAC
AGGAGCCTTT
AAGGTCGTTC
TAGTCAGTCT
TGAATCACGC
TAAACATACC
ACGCGGTTGG
TTACGATTGG
ACGA.AGACCC
TCGTCCAAAA
TTTCTGATAA
ATCGCAGTGC
TATGTCAPAA
TCAGTTGTT
CTATGAAACA
TrcTGITI:TA
GGATACAACA
TGCTCGCGAT
CTATGCTTTT
TATTAATCAT
CGGAATCGAA
TAAAATCA
TAATGGTGAC
AGTCTTAAGT
TATGAAGACA
CTTTCGTAAA
166 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.juecalis Genes.
GTGAAACAAG ACCAACTTCA AGGTACTGGT ?TTATGCAAG ATGGTGTTC CT-ATCAACGT GTGGATGAAC AGAAI'AAC TCGTGTCCAA CAAGAG LTGA AAA.ATCAATT GAATACAAAA
TAA
EF075-2 (SEQ ID NO:286) MSKG KKIFAIIXGI ILXLFLAVVG MGAKLYWDVS KSMDKTYETV ERSKKSQVNL NNKEPFSVLL LG IDTGCDDGR VEQGRSDTI'I VATVNPRDKQ YVD IPGQGKQ DKLN-IAYAPG GAS tAMDTVE NYLNIP INHY VS INMAGLKE NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI NSVSNYQEIL TiAVSDNMKTD LSFDDMXKIA LDYRSAFGKV KQDQLQGTGF DEQELTRVQQ ELKNQLNTK EF075-3 (SEQ ID NO:287) ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA
TTLVSLARDT
LVNAVGG 1EV
EGIVQKVLSL
MQDGVSYQRV
A.
S S *5 9*
S.
S a..
S
a a.
GTAGAACGAT
T"TATAGGGA
ATTGTI'GCAA
ACCTATGTTG
GGTGGCGCAT
TATGTTTAA
GTGA.ACAATA
TTGGATGGTG
TACGGCCGCC
CTTAACAGCG
GATTI'AAGTI'
GTGAAACAAG
GTGGATGAAC
CTAAAAAAAG
ITGATACAGG
CAGTTAALTCC
ATATTCCAGG
CrI'AGCAAT
TTAATATGGC
ATCTGACTT
AACAAGCACT
AAGAACGTCA
TAAGCAACTA
TTGATGACAT
ACCAACTTCA
AAGAATTAAC
TCACGTCAAT
CGATGATGGG
TCGTGACAAG
TCAAGGAAAA
GGACACAGTT
TGGTI'TAAAA
TTCTCAAGAC
CTCCTATTCA
AAGAAAAGTG
TCAAGAAAT
GAAAAAAATT
AGG.TACTGGT
TCGTGTCCAA
TTAA.ACAATA
CGTGTCGAGC
CAAACAACCT
CAAGATAAAT
GAAAACTATT
GAATTAGTCA
GGATATGATT
AGAATGCGTr
ATTGAAGGCA
TTALACAGCTG
GccTTAGArr
TTTATGCAAG
CAAGAGFI'GA
AGGAGCCTTT
AAGGTCGTTC
TAGTC:AGTCT
TGAATCACGC
TAAACATACC
ACGCGGTTGG
TTACGAMG
ACGA-AGACCC
TCGTCCAAALA
=IICTGATAA
ATCCCAGTGC
ATGGTG=IC
AAAkATCAATT
TTCTGTTTTA
GGATACAACA
TGCTCGCGAT
CTATGCTTTT'
TATTAATCAT
CGGAATCGAA
TAAAATTTCA
TAATGGTGAC
AGTCTTAAGT
TATGAAGACA
CTI'GGTAAA
CTATCAACGT
GAATACAAAA
EFO075-4 (SEQ ID NO:288) KLY'.WDVS KSMDKTYE'IV ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NWNLTFSQDG YDFTIGKISL DGEQALSYSR NSVSNYQEIL TAVSDNMKTD LSFDDMKKIA DEQ)ELTRVQQ EL.KNQLNTK
VEQGRSDTTI
NYLNIPINHY
MRY'EDPNGDY
LDYRSAFGKV
VATVNPRDKQ
VS INMAGLKE
GRQERQRKVI
KQDQLQGTGF
TTLVSLARDT
LVNAVGGIEV
EGIVQKVLSL
MQDGVSYQRV
EF076-1 (SEQ ID NO:289) TAGAAAATAA CAGAGGAGCT AGCATI'GCTG CAGTITGCAAG AAGGTATCTC ATGTTTCCAA GGAAACCAAA AATTATTATC TTAAATGTTG TGGATCGTGT GTTAAAGACA ATACAGATTC AAGTTAAAAA AGTGGCCTAG
TAA
GAAGGAAATG
TGTCTCTCTI'
TCGTTATAAA
GAITGTCGAT
GAAACATGGC
TTTAAAAGAA
GCCATCT TI
AAAGCATCAA
GCAGTCATCC
GTTAAA;AGT
CATTTATCCG
GGTCAAAAT
CGCT~rTTCA
T'ITATAAAA
CAA1AAATTGG
CTTCTGAAAA
?rGTAGACGA
ATGATGAATT
TAGCTGAATA
CATTTATTGA
ATAATTCTTT
TATCGGTTTA
AATTATTAAG
TAAATTTGAT
AGATI'CTGTT
TGGCGAAAAA
AGATGCAATG
TTCAACA
TABLE 1. Nucleotide and Amino Acid Seqeuences: of E. faecalis Genes.
EF076-2 (SEQ ID NO:290) MK( ASTKIGIGLS IAAVASVSVA VIASEKIIKK VSHVSNRYXV KKFVDDKFOG NQKLLSTVDD LSDDELDSVL NVVD)RVKDGG; SKLAEYGEKV KDNTDSLKER FFTFIEDA,fl LXKWPRPSFF YKNNSFVST EF076-3 (SEQ ID NO:291) CATCG CTTCTGAAAA AAITATTAAG AAGGTATJCTC ATG ITCCAA TCGTTATAAA GTTAAAAAGT GGAAACCAAA AATTATTATC GATTGTCGAT-GAETTATCCG T'TGTAGACGA TAAATTTGAT ATGATGAATT AGATTCTGTT
'ITAAATC'ITG
GTTAAAGACA
AAGTTAAAAA
TGGATCGTGT
ATACAGATTC
AGTGGCCTAG
GAAAGATGGC
TTAAAAGAA
GCCATCTTT
GCI'CAAAAT
CGCTFTTCA
TTTTATAAAA
TAGCTCA-ATA
CAITTATTGA
ATAATTCTT
TGGCGAAAAA
AGATGCAATG
EF076-4 (SEQ ID NO:292) VIASEKIIKK VSHVSNRYKV KKFVDDKFDG NQKLLSIVDD LSODELDSVL NVVDRVKDGG LKKWPRPSFF YKNNS SKLAEYGEKV KDNTDSLKER FFTFIEDAMfl 9 EF077-1 (SEQ ID NO:293)
TAAT-GTAAAG
ACAATTATAA
GCGTATGGCA
ATTICAAACCT
GCAACCTTAG
GGTGATTCAT
GATAACTCGC
GAGGAAATCA
GGCTTGGTAA
CCAATTGAAA
TTGAAAATG
GTGAAAGAAT
CCTTTTACAC
ACACGTTTTG
ATTGCTTTAG
ACGATGGTCG
ACGCAAGGAC
GAATTAGTGG
ATTGTTGCTT
GTTTCTGGTG
AAT'TrCTGA
TCACGTAATG
AAAGAGACTA
GATCAAGAAT
GAATGITTAC
GTCATCATGG
ATTCCTATGG
AAAGATGACT
GCCAAACAAT
ACCGGGATCA
ATCT"TATCTG
.4*
S*
*9
TGAATGATGG
CAGGAGTTTT
TTATTTTAAT
TACGTGAAGG
CTGTGGGAGA
TAGAAGACTA
CACAAAAAGC
ATGITGGCGA
AAACCGGCAC
AAAATCCTGG
TTCCTGAAAA
CTCCGGCGCG
TAGTTGCCTA
CGGAACTCTI'
TGGCAGGGAT
AAAAATTAGC
AACTTTCTGT
GAI'CCCAGC
ATGCCAGAAA
CTGGCGTCAA
CACAAGAGTC
GCACATATTT
TGGAAAAATTI
CCGTTGCAGA
CACAAGATAA
TAGGAGATGG
CCTCATGG
TAAGTAAAGT
C GTATTAAT
TTCCGGCGCT
CTTTGCGTGC
GAGAGAAAAA
GGCATTATTA
AACAGGTTCT
AAAATATGGT
ATACTCCGCC
TGCCGCTGGA
TCATCGCTTG
TGAATTAGTA
ATCAACAGTC
GGATGAATTA
AACTGTAGCA
TCCAGCTCAT
CCTAATTGCA
AGTTGTTGCT
GGGTCGTTCA
TrCTGCAAAA
TGATCAAGTC
AAGCCTCGAA
GCAAGATGTC
GGCATFI'GTG
TCAAGAAACT
AGGCCGAA'T
ACACCAASTA
AACCATTIGCT
ATTAACTATT
TGTAAATGAT
AGCTACTGCG
CAGCCAACC
CGGAATT'TrT
AATCGCGGCT
TCGTCGAATT
GAGATGAAGC
TTITGAATrTA
GTAATGGCGT
GTCGATATTT
AC ETTGAT GA
AAAGCTAACC
AATGGCGAAA
GTAAAACCAG
GAGA-ATCTT
ATGTCGGGTT
GACAGTCAAT
TTTGTACGTI
GGTGTTGCTT
TCGCCGTGTC
AGTCGTCATG
ACGATTGCGT
CAACCAATCA
CAAGA.ATCAA
CCATTAAAA
GATGGTGCTG
GAAAAAATTG
AC TI=ACAG
CATCTTCAAC
GCAGAAGTAG
CTAA.AAGAAT
GCACCTTCGC
GCTAGTGAAA
GTCC4AATTG ATGTAACAAA AITGGGGATT 'TTTrACATCA CCCGAAITCG TAATGATGTT CTGGGAAATG TAGCGAT1'AC CGCTATCG7T 'TTrTAATTAT GTTGACTGGT AAGAGCTGA-A GTCATTATT1G ATTTAGAAGA TGTTTCTGTI' GGGAACTAGT TCCAGTTGAT CATTAACAGG AGAATCAAAA CCGTGAATGG TGACGGCTCT ATCAAACAAT TGTGAACTTA TAGCAGATCG CTATGCGGTA GGTTGTTTC AAAAAGTCCC C PTAATTCT ATCTGCCCCA GGGTCGTTAT TAAATCGGGA ITATAAA.AC AGGCACGATT ATGCTGGAAT AACTGCTGCT CTCATATI=TT AGCTAGATCA ATATACAGA TCTACCGGAA AGATACGGGT AGGTAAAAAG ATAAkAACGAC TATTCATATI' ACACTGTACG CCCAGAAGCA GAAT=IIAAT GCTGACGGGG GAATTACCGA AGTACATGGG TGCCTAAAGA AAATCATCCA TTGCTGCTGC AGACGTAGGT CTGCTGACGT TGTTAT'ITTA CCCAACATAC CATGAAAATT ATCTdcGTTT TAcTAA 1,,Ir ATGCTACAAG AAGTCGTGGA
GGCCAGTAA
IITGCTAGT
CACTGTGTCA
168 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
EF077-2 (SEQ ID NO:294) MKHVTKLG IT
QTLREGKYGV
NSPQKAHRLN
IEKNPGDELM
FTLVAYLIAG
MVEKLASAKT
VAYARKQDVP
RNGTYLGR IT
CLPQDKLTIL
ODLSKVSQAV
LSALRARRIG
I ITGVLALLF
DILAITAIVA
GENLEDVSVE
SGSVNGDGSL
VAWFVSKSPT
IAFDKTCTIT
LKN ITDLAEV FTDTVR PEAK KELPKENH PV
EIAQDTMKIA
Q
EFILHQPNWA YGIILITGSV TLAVGEYWAS LMIL IMLTGG EINVGDELWV KPCELVPVDG KMVAEKTVAD SQYQTIVNLV RFAEVLVVAS PCPLILSAPI QGQLSVDQVQ PINAG ITAAE SGAGVKAFVD GAEIRVGKKN
ETMEKLHQLHLQRILMLTGD
IMVGDGVNDA PSLAAADVGI KQSVLIGIFI CVLLMLIAST MALMFtE4I
DSLEDYAAGI(
LVKTCTSTVD
KESAARPAHF
ALVAGMGRSS
LVGLAASVEQ
FVTQESQETE
QESVAETIAA
AMGAHGATAA
GI IPALIGAM
ANQELKSLLD
ESSLTGESKP
VRLADRYAVP
RHGVVIKSGT
2ESSH ILARS I
KIDKTTIHIS
EVGITEVHGE
SETADVVILK
LQEVJVDTVS I EF077-3 (SEQ ID NO:295) TCA GCCGAATTGG 0*
S
SS
0# 0 0
SO
50 5 50 00 OS S 050 0 5* 5 0 000
GCGTATGGCA
ATTCAAACCT
GCAACCTTAG
GGTGATTCAT
GATAACTCGC
GAGGAAATCA
GGCTTGGTAA
CCAATTGAAA
TTGAAAATGG
GTGAAAGAAT
CCTTTTACAC
ACACGTTTTG
ATTGCTTTAG
ACGATGGTCG
ACGCAAGGAC
GAATTAGTGG
ATTGTTOCTI'
GTTTCTGGTG
AATITI'GTGA
TCACGTAATG
AAAGAGACTA
GATCA.AGAAT
GAATCTITAC
GTCATCATG
ATTGCTATGG
AAAGATGACT
GCCAAACAAT
ACCGGGATCA
ATCTTATCTC
TTATTTTAAT
TACGTGAAGG
CTGTGGGAGA
TAGAAGACTA
CACAAAAAGC
ATGTTGGCGA
AALACCGGGAC
AAkAATCCTGG
TTGCTGAAAA
CTGCGGCCCG
TAGTTGCCTA
CGGAAGTCTT
TGGCAGGGAT
AAAAATTAC
AACTTTCTGT
GATTGGCAGC
ATGCCAGAALA
CTGGCGTGAA
CACAAGAGTC
GCACATATTT
TGGAAAAATT
CCGTTGCAGA
CACAAGATAA
TAGGAGATGG
GTGCTCATGG
TAAGTAAAGT
CTGTXFI'AAT
TTCCGGCGCT
CTTTGCGTGC
AACAGGTTCT GTAATGGCGT AAAATATGGT GTCGATATTT ATACTGGGCC AGTTTGATGA TGCCGCTGGA AAAGCTAACC TCATCGCTTG AATGGCGAAA TGAA'rTAGTA GTAAAACCAG ATCAACACTC GATGAATCTT GGATGAATTA ATGCTCGGGTT AACTGTAGCA GACAGTCAAT TCCAGCTCAT TTrGTACG TT CCTAATTGCA GGTGTTGCTT AGTTGTTGCT TCGCCGTGTC GGGTCGITCA AGTCGTCATG TTCTGCAAAA ACGATTGCGT TGATCAAGTC CAACCAATCA AAGCGTGGAA CAAGAATCAA GCAAGATGTC CCATTAAAAA GGCATI'=GTG GATGGTGCTG TCAAGAAACT GAAAAAATTG AGGCCGAATT ACTTTTACAG ACACCAATTA CATCTTCAAC AACGATTGCT GCAGAAGTAG ATTAACTATT CTAAAAGAAT TGTAAATGAT GCAC-CTI'CC AGCTACTGCG GCTAGTGAAA CAGCCAAGCG GTCGAAATTG CGGAA'TTI' ATCTGCGTTT AATCGGGGCT ATGCTACAAG TCGTCGAA ET GCCC 0000 5000 4 *405
S
40 0
SO
*0
TA.ATGATGTT
TAGCGATTAC
TTTTAATTAT
AAGAGCTGAA
A=TAGAAGA
GGGAACTAGT
CATTAACAGG
CCGTGAATGr-
ATCAAACAAT
TAGCAGATCG
GGTTCTTTC
CTTAATTCT
GGGTCGTTAT
TTGATAAAAC
ATG.CTGGAAT
GTCATATTI'
ATATTACAGA
AGATACGGGT
ATAAAACGAC
ACACTGTACG
GANIT=TAAT
GAATTACCGA
TGCCTAAAGA
TTGCTGCTGC
CTGCTGACGT
CCCAAGATAC
TACTAATGTT
AAGTCGTGGA
CTGGGAAATG
CGCTATCGTT
GTTGACTGGT
GTCATTATTG
TGTTTCTGTT
TCCAGTTGAT
AGAATCAAAA
TGACGGCTCT
TGTGAACTI'A
CTATGCGGTA
AAAAAGTCCG
ATCTGCCCCA
TAAATCGGGA
AGGCACGATT
AACTGCTGCT
AGCTAGATCA
TCTAGCGGAA
AGGTAAAAAG
TAITCATAT
CCCACAAGCA
GCTGACGGGG
AGTACATGGG
AAATCATCCA
AGACGTAGC-T
TGITATTTTA
CATGAAAATT
AATTGCTAGT
CACTGTGTCA
EF0~77-4 (SEQ ID NO:296) QPNWA YGIILITGSV MALMMFWEMI QTLREGKYGV DILAITAIVA TLAVGEYWAS NSPQKAHRLN GENLEDVSVE EINVGDELVV IEKNPGDELM SGSVNGDGSL KMVAEKTVAD LMILIMLTGG DSLEDYAAGK ANQELKSLLD KPGELVPVDG LVKTGTSTVD ESSLTGESKP SQYQTIVNLV KESAARPAHF VRLADRYAVP TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
FTLVAYLIAG
MVEKLASAKT
VAYARKQDVP
RNGTYLGRIT
CLPQDKLTIL
DDLSKVSQAV
LSALRARRIG
VAWFVSKSPT
IAFDKTGTIT
LKNITDLAEV
FTDTVRPEAK
KELPKENH PV EIAQDTMK IA
RFAEVLVVAS
QGQLSVDQVQ
SGAGVKAFVD
ETME(LHQLH
IMVGDGVNDA
KQSVLIGIFI
PCPLILSAPI
PINAG ITAAE GAE IRVGn(N
LQRILMLTGD
PSLAAAIOVCI
CVLLMLIAST
ALVACIAGRSS
LVGLAASVEQ
FVTQESQETE
QESVAETIAA
AMGAEGATAA
GIIPALIGAM
RHGVVIKSGT
ESSHILARSI
K IDKTT I HIS EVG ITEVHGE SETADVV ILK LQEVVDTVS I EF079-1 (SEQ ID NQ:297)
TAATICTAG
CCCAGGCTCT
ATCATTGATG
T'IT'IAGCG
AAAGCAAGCC
AACCAAGAAT
AAAACAACGA
ATTCCAAAAA
AAAGGAAGCT
GTCATTTCAG
AAAAAAGGCG
CAAATAAAAA
CTCGTCACTT
GGACATCGTA
CAACAAAATT
TTCATTATCT
CATCACCGAA
CATGCTTTAT
GTTTTATGAT
ATGCATTAAA
AAGAAAACAC
TAGCGAAAAA
AAAAACCAGA
TAAATGTCCG
CCTTGTTAGA
GCCATCGTGG
ATrGAATTTTA
CCGTTGAACC
TATTAACTTG
TCCCATATCA
TACTATTATG
GGTACAAGCG
GAAATTTTTA
TTIAAGGAG
TC=IFTACTG
TAACTATCTG
CAAAGAAATG
AGGCAGCAAT
CAA.ATCCTAT
TTTACCAATT
AGGAACCTCC
TCTCCCTCAA
TATCGAAGTIC
AACTGATACA
CACACCGTAT
ACCAGAAAAA
GACATTACT
ACGGAAAAAG
GAAAAACAAA
GAAGCAATGA
ATTATTGGAA
GATCAACAAA
GCTGAACTTC
CCTGGATTAG
TTTGAAAGTC
TTTGATAAAA
TATCCTACAG
GCCAAATTAT
AATGGGAAGA
AAAGATTTAC
ATGATA.AACA
GCAGCAGCGG
TAATTCGCCT
ACGACCAGAA
CAGCCT-GGGC CAATCACTGT AGTCAAAAAA GAAACGTCGT TAGGTGCATT TGCGTATCCT TI'ATCGCTCA TATCAAGCA A.AGAAAAAAT CGAAAAGAAA ATCCTTTTC TGAAACGICAA ATACGArI'GG TGTTTTAACC CGAATGCATT GCTATTGGAA GTGGTACGAA TACACATGCG TTACAGATT1' GCCAGAATTA CGCTTGCTTA TCAAGTAGAT ACATTGAGTC TGGCCAAGAT GTCATCGGTI' ATI'AG7TCGA GGATGAAAAA AGTGGCACAA GTGCGTTAAT TATTAGCGGC AACCAAAGTA G
SO
S
S@
0* S S *5
SS
S S
SO
06 @0 S 050 0 *5 5 0
S
S
0 EF079-2 (SEQ ID NO:298) MXSKKKRRI IDGFMIKLLI ICIGAFAYPF VSDALNNYLD QQIIAHYQAX ASQENTKEMA TTKKPDKSYF ESHTIGVLTI PKINVRLPIF ISGHRGLPQA KLFTDLPELK KGDEFYIEVN VTLLTCTPYM INSHRLLVRG HRIPYQPEKA IIWYKRRKKT TRKPK
ELQEKMEKKN
DKTI'JALLLEK
GKTLAYQVDQ
AAGMKKVAQQ
QELAKKGSNP
GSSLLECTSY
IK'IVEPTDTK
QNLLLWTLLL
GLDPFSETQK
PTGGTNTHAV
DLHIESGQDL
IACALIISCF
EF079-3 (SEQ ID N'O:299) see* 0000 0@ 0
TCCT
=TGTTAGCG
AAAGCAAGCC
AACCAAGAAT
AAAACAACGA
ATTCCAAAAA
AA.AGGAAGCT
GTCATTTCAG
AAAAAAGGCG
CAAATAAAAA
CTCGTCACTT
GGACATCGTA
CA.ACAAAATT
'ITCATTATCT
ATGCATI'AAA
AAGAAAACAC
TAGCGAAAAA
AAAAACCAGA
TAAATGTCCG
CCTTGTrAGA
CCCATCGTGG
ATGAATTTTA
CCGTTGAACC
TATTAACTTG
TCCCATATCA
TACTATTATG
GGTAC.AAGCG
TAACTATCTG
CAAAGAAATG
AGGCAGCAAT
CAAATCCTAT
TTTACCAATT
AGGAACCTCC
TCTCCCTCAA
TATCGAAGTC
AACTGATACA
CACACCGTAT
ACCAGAAAAA
GACATTACTT
ACGGAAAAAG
GATCAACAAA
GCTGAACTrC
CCTGGATTAG
TTTGAAAGTC
'TTTGATAAAA
TATCCTACAG
GCCAAATTAT
AATGGGAAGA
AAAGA'ITAC
ATGATA.AACA
GCAGCAGCGG
TTAATTGCCT
ACGACCAGAA
TATCGCTCA
AAGAAAAAAT
ATCCTTTT-TC
ATACGATTOG
CGAATGCAFI'
GTGGTACGAA
ITACAGATTT
CGCTIGCTTA
ACAT'GAGTC
GTCATCGGT1'
GGATGAAAAA
GTGCGTTAAT
AACCAA
TTATCAAGCA
GGAAAAGAAA
TGAAACGCAA
TGTTTTAACC
GCTATTGGAA
TACACATGCG
GCCAGAA ITA
TCA.AGTAGAT
TGGCCAAGAT
ATTAGTTCGA
AGTGGCACAA
TA'ITAGCGGC
EF079-4 (SEQ ID NO:300) 170 TABLE 1. Nucleotide and Amnino Acid Sequenices of E. faecalis Genes.
PF
VSDALNNYLD
TTKKPEDKSYF
I SGHRGLPQA
VTLLTCTPYM
I IWYKRRKKT QQIIAHYQAI( ASQENTKEMA ELQEIOIEYKK QELAKKGSNP
GLDPFSETQX
ESHTIGVLTI ?KINVRLPIF DKTNA[JLLEK GSSLLEGTSY PTGGTNTMAV KLFTDLPELK KGDEFYIEVN GKTLAYQVD)Q IKTVEPTDTK DLHIESGQDL INSHRLLVRG :4RIPYQPEKA AAGfrU(KAQQ QNLLLWTLLL
IACALIISGF
TRKP
EF080-1 (SEQ ID NO:301)
TACTTACACT
TTTATTAGGA
CTTACCTTTG
CAATTACTGA
AAAACAAAAG
GAATATGCAG
CTCATTGATA
ATGACAGATG
AATTATTTCG
GTTCCTGGG
ATTACCCCGC
TTAGGCATTA
GAAAAAACAA
ACGCGTCCTA
ACCCACTCAA
CGTT'TAGGGC
GGTTATTTCC
GCC7rGCCCT
CACCTGATCA
CCTCTGTCTT
ATCATTATCT
TGGACTTACG
CACGAATIGA
CGGCTGCTCA
GCCACTATCG
TGGAAATGGT
ATAT'ITCTAA
CTTTAAAC'T
TTCCTAAAAA
CTGGCGGCGG
TAGCAACGT1T AGGCAT1TTTC
TATGAAAAAA
ACCCCTI'TCG
AATCAACCAA
TATTGTAACC
TTTAAATAAA
GAAAATCTAC
TGATACCTTA
AAccTTTGTT
TGTGGACAGC
AATTGCTTFI
ATATCAATrA
AACCTCCCGC
CAATGGCGGC
CACATTCGGT
CGACTITTAC
GCGGCTGAAA
cTAAAGCAAG
ACAAATAATA
GTTGGCAACG
ATCTCTACTT
GATAAAATAT
CAGGAAACTC
GAAACAGGTA
GCTGCTGCGC
AAATTTCAA
ACAGATCAGT
AGTGGCGGAA
GGCGGCGGTC
GCTCGACTICT
CTAMTI=r
ATTCAATTGA
AdATACAACC
ATACCTATGG
ACCAAAATGC
CTGGAAkACAT
GGGATAATAT
AAGCATTTGT
AAATCACTCG
TGATACTCAG
GTTATCAATA
TAACCAACTC
TGGGCGGTGG
GAAGTTTTTA
TAGCACTCTT
CCTAATACTT
TGATGGCGCA
TrTTAGAAGAA
CGATGALACAA
GATTCTTTTT
GATTGATTAT
GAGTCAAGGA
TAATAAAGGG
TI'ATAAAGTC
TITCGTCTTC
TCCCTTTAGG
TTTCATCACT
TGGTAGCACC
EFOSO-2 (SEQ ID NO:302) MK(KR LLPIFFLILL TFGL.ALPVSA
AENSIDDG-AQ
LLTPDQINQL KQEIQPLEEI( TKASVFIVTT
NNN'TYGDEQE
IDMDLRKIYI STSGNMIDYN TDARIDDTLD
KIWDNMSQGN
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA AALILSLVFL KTTLNLTSRT DQLTNSFI IT RRIPKNNGGS
GGMGGGGST
YADHYLLNKV
YFAAAQTFVQ
GINI SKYQLK
HSTGGGTFG
GKDQNAILFL
ETQAFVNKGV
FSSYQYPFRE
CCRSF
EF080-3 (SEQ ID NO:303) GGCTGAAA ATTCAAT GA TGATGGCGCA
CAATI'ACTGA
AAAACAAAAG
CAATATGCAG
CTCATTGATA
ATGACAGATG
AATTATITCG
GTI'CCTGGGG
ATTACCCCGC
TrTAGGCATTA
GAAAAAACAA
ACGCGTCGTA
ACCCACTCAA
CACCTGATCA
CCTCTGTCT
ATCATTATCT
TGGACTTACG
CACGAATTGA
CGGCTGCTCA
GGCACTATCG
TGcAAATCGT
ATATTTCTAA
CTTTAAACTT
'FrCCTAAAAA
CTGGCGGCGG
AATCAACCAA
TATTGTAACC
TTAAATAAA
GAAAATCTAC
TGATACCTTA
AACCTTIGTT
TGTGGACAGC
AATTGCTTTT
ATATCAATTA
AACCTCCCGC
CAATGCGCC
CAr-A ICGGT
CTAAAGCA-AG
ACAAATAATA
GTTGGCAAGG
ATCTCTACT
GATAAAATAT
CAGGAALACTC
CAAACAGGTA
GCTGCTGCGC
AAATT7'rCAA
ACAGATCAGT
AGTGC.CGGAA
GGCGGCGGTC
AGATACAACC
ATACCTATGG
ACCAAAATGC
CTGGAAACAT
GGGATAATAT
AAGCATTrT
AAATCACTCG
TGATACTCAG
GTrATCAATA
TAACCAACTC
TGGGCGG;TGG
GAAGT
TTTAGA.AGAA
CCATGAACAA
GATCTTTTT
GATTGATTAT
GAGTCAAGGA
TAATAAAGGG
TTATAAAGTC
TTTGGTCTTC
TCCCT'ITAGG-
TTTcATcACT
TGGTAGCACC
EF080-4 (SEQ ID NO:304)
AENSIDDGAQ
LLTPDQINQL KQEIQPLEEX TKASVFIV T NNNTYGDEQE YADHYLLNKV GKD)QNAILFL 171 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. [aecalis Genes.
IDMDLRKIYI
PGGHYRVDSE
KF1'LNLTSRT STSGNMI DYM
TGKITRYKVI
DQLTNSFITT
TDARIDTLD
TPLEMVIAFA
RRI PKNNGGS K IWDNMSQGN
AALILSLVFL
GGMGGGGS ET YFAAAQrTFVQ
GINISKYQLX
H STGGGTFGG
ETQAFVNKGV
FSSYQYPFRE
GGRS
EF081-1 (SEQ ID NO:305)
TGAATGGAAC
TCTGCCTTG
GCCAAACAAG
AA.ACAGGAAT
TATGATACTA
GGAGTCAAAG
TATAAATTAT
ATCGCCAAAA
ATGTCAGGAA
GATCGTAATG
GGCAAACTCG
GCGTTTCTGC
TTTGTACCAA
AAAGATACAG
GAAGCAATCG
'ITCTAATCGC
AATT1AGCCGA
TTGATAAGTT
AATCTGTTGT
CTAGTGATGT
CGATGAGCAC
AAGGCGATAC
ATGATAAAAT
GTAGTGGGCT
GTTCTGCC
TTGATGAAAT
TCACAGTCGC
AGTCACGFI'A
TAATAAAAAA
TGCTGCCGGT
AGcGAAGAAA
ACCGTCCGTT
TGAAAAATAC
TCAAGTCAAA
CCTTTAGGC
CTACCAAATC
TTCGATI'CAA
AGCAATTAAC
AGA-AAAAACA
CAATCAAAAG
TTCTGAACCA
TATCCGCTG
TCTTCAAAAA AACCACTrAT TCI'GGTGTT GGCGGGTATT ATGCTTATAG TCAATGGCAA ACAGCTACTA CATTTTTAAA CGTATTGTCA GTTCAAGAAG CTAGC~rAAA GAAAAAT'-GC C-AAGCAATI AI'CAGGGAT TCAAGCAGAA AAGGCGAAAG ACAATCAATA CACATTTACC GAAATGAAAG ATTTGTCTTA TCA.ATCAAGT GCTTGGAAGC CATCTTTAAT
TTTTCCAGAT
GTAGATAATG CCA.AACCTGG AGAAATTGTC AA.AGTGTTTC ACGAAGTGGG CGTAGTGCCT GCCAATATCA AAGCTTTTAG
TGATAAATTC
TTAAGCCAAG GATGCGTCCA
AGCAGACTCC
GTGACAGAAT TACCAACAGG
GGCTGCGACA
GGGGAAGCAN TGCGCKATTA A EF081-2 (SEQ ID NO:306)
MERSNRNKKS
QEFDKLPSVV
KLSMSTPLGE
RNGSGLAINK
VPITVASEPV
SKKPLILGVS
QE-ASLKKNGY
MKDLSYQSS I
V'FDEVGVV'PG
TELPTGAATK
AL.VLIAAAGG GYYAYSQWQA KQEL.AFAKKT DTKSVVEKYQ AIYSGIQAEG VKASDVQVKK
ATTFLNVLSK
AKDNQYTFTY
DNAKRGEIVD
SQGWVQADSF
AKKGDTYQ IA
KLGSGAEKTA
DTESRYYPLG
WKPSLIFPDM
NIKAFSDKFG
EAXRN
VSVDEINQKL
a.
a.
a a a a a a a a a a EF081-3 (SEQ ID NO:307) T GGCGGGTATT ATGCTTATAG TCAATGGCAA
GCCAAACAAG
AAACAGGAAT
TATGATACTA
GGAGTCAAAG
TATAAATI'AT
ATCGCCAAAA
ATGTCAGGAA
GATCGTAATG
GGCAAACTCG
GGCGTTTCTG
TTTGTACCAA
AAAGATACAG
AATTAGCCGA
TTCGATAAGTT
AATCTGITGT
CTAGTGATGT
CGATGAGCAC
AAGGCGATAC
ATGATAAAAT
GTAGTGGGCT
GTTCTGGCGC
TTGATGAAAT
TCACAGTCGC
AGTCACGTTA
ACCAAGAAA
ACCGTCCGTT
TGAAAAATAC
TCAAGTCAAA
GCCTTTAGGC
CTACCAAATC
TTCGATI'CAA
AGCAATTAAC
AGAAAAAACA
CAATCAAAAG
TTCTGAACCA
TTATCCGCTG
ACAGCTACTA
GTCAAGAAG
CAAGCAATT
AAGGCGAAAG
GAAATGAAAG
GCTTGGAAC
GTAGATAATG
AAAGTGTTTG
GCCAATATCA
TrAACCCAAG
GTGACAGAAT
GGGG
CAT1TITAAA CGTATTGTCA CTAGCT1'AAA GAAAAATGC ATTCAGGGAT TCAAGCAGAA ACAATCAATA
CACATTTACC
A'TTG.TCTTA
TCAATCAAGT
CATCTMTAAT
TTTTCCAGAT
CCAAACGTGG AGAAATTGTC ACGAAGTGGG
CGTAGTGCCT
AAGCTTTTAG TGATAAATTC GATGGGTCCA
AGCAGACTCC
TACCAACAGG
GGCTGCGACA
EFO81-4 (SEQ ID NO:308) G GYYAYSQWQA KQELAEAXK<T
ATTFLNVLSK
QEFDKILPSVV QE-ASLKKNGY DTKSVVEXYQ AIYSGIQAEG vKAsDVQVKK
AKDNQYTFTY
KLSMSTPLGE MKDLSYQSSI AIKKGDTYQIA
WKPSL
1 IFPDM SGNDKISIQV DNAXRGEIVD RNGSGLAINK VFDEVGVVPG KLGSGAEKTA NIKAFSDKFG VSVDE-INQKL
SQGWVQADSF
VPITVASEPV TELPTGAATK
DTESRYYPLG
172 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF082-1 (SEQ ID NO:309)
TAAAAAATGA
CTTTTAAATA
AITTACG=TG
CCTGTrGACAC
GACTATGTTT
TCGGCACAAC
GTTCAAGTAA
GCGCAATTTG
TTAGCAGCAA
ACCTTGGATC
ATCCGCACTT
ACTGTTCCTG
ATT'ITGGATG
AAAAGATCGT
GTTCAAAAGT
CTCTTGATAA
CTA.ATCCTGC
CAAATATCCA
TGGATCAAGT
CTGACA.AACG
CTACAAGTGA
CACCCAA'ITC
CAACTCGTC
GGACATT1AGC
CGACAACGAA
ATACACCACT
GCGCATTCA
TGA.AGCAGCT
TACTGTCACG
TGATCCTCAT
TTTTGGATCA
GCAAAATAGT
TCGTCTAAAT
TTCA-ACACCC
AACACAGTTA
CGCCAC'ITCT
T'rFTGGTAGC
AAAAGTTGCA
'FrAA
AGCAT=~GT
CAAGTCGCTT
CCACCTGTCA
CAACCTGGTA
AAACAAATTC:
ACTCCCGATT
CTTGGCTGGA
GCTGTTTGG
CI'CTTTGG
CCTGTGGCGA
GGANCGACCG
GCTAAACAAT
TCGTTGCTAC
CTATTCAATC
ACCCGACGAA
CAGCCGGACC
AAGCCGGAAC
TAATTAGCCT
AATTATCAGT
ATAATGCATC
CGCCATTAAC
CTGCCCCTCT
CTGCTCAAGG
AtAAALACAAC
GCCTCTTATG
CAACGCTGAT
CCCTTCTCAC
CCTTAGTA'-
AGCGATCTAT
GCCAAACTAT
TAAACAGAGT
CTTGACCTTT
GGTCCCAGTA
TTCAACAGGA
CATTCAATTA
GCTTACTTG
EF082-2 (SEQ ID NO:310) HI(KIVRISS ILFVATPLML LNSSKVEAAQ VASIQS4ADI TFALIDNTVTP PVNPTNPSQP VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLD)QVQNST GDLISVPNYV QVTDKRGL 'NL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT LDPTGAATSP VATAALSTGM GTWTL.AFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI
LDDTPL
EF082-3 (SEQ ID NO:311) AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT e.
t
ATTACGTTIG
CCTGTGACAC
GACTATGTTT
TCGGCACAAC
GTTCAAGTAA
GCGCAATTTG
TTAGCAGCAA
ACCTTCCATC
ATGGGCACTT
ACTGTTCCTG
ATTTGGATG
CTC'ITGATAA
CTAATCCTGC
CAAATATCCA
TGGATCAAGT
CTGA. AACG
CTACA.AGTGA
CACCCAATTC
CAACTGGTGC
GGACATTAGC
CGACAACGAA
ATACACCACT
TACTGTCACG
TGATCCTCAT
=ITGGATCA
GCAAAATAGT
TGGTCTAAAT
F1'CAACACCC
AACACACTTA
CGCCACTTCT
T'ITGGTAGC
AAAAGTT1GCA
CCACCTGTCA
CAACCTGGTA
AAACAAATrC ACTGGCGA T
CTTGGCTGGA
GCTGTTTTGG
CTTTICTTT'GG
CCMTGGCGA
GGANCGACCG
GCTAAACAAT
ACCCGACGA-A
CAGCCGGACC
AAGCCGGAAC
TAATTACCT
AATTIATCAGT
ATAATGCATC
CGCCATTA-AC
CTGCCGCTCT
CTGCTCAAGG
ATAAAACAAC
CCCTTCTCAG
CCTTAGTAT'
AGCGATCTAT
CCCAAACTAT
TAAACAGAGT
CTTGACCTTT
GTCCCAGTA
TTCAACAGGA
CATTCAATITA
GCTTACTTGG
EFOB82-4 (SEQ ID NO:312) AQ VASIQSNADI TFALONTVTP PVNPTNPSQP VTPNPADPHQ PGTAGPLSID YVSNIHFGSI( QIQAGTAIYS AQLDQVQNST GDLISVPNYV QVTDKRGLNL CWKLSVXQSA QFATSDSTPA VLDNTASLTFL AATPNSTQLL SLAPLTVPVT LDPTGAATS P VATAALSTGM GTWTLAFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTW I
LDDTP
EF083-1 (SEQ ID NO:313)
TAATTTAAA-A
GCTCTGTTTT
GCGATTTATC
CAGAITGCAG
GCAAGTGCTA
GACA.AGGAGA
TTATGCCTAC
AAGGTGAACA
GCTACAATC
1 FGCCCAAGG
AATAAAAATG-
AGCTATG=T
AGGTCGCTI'
TAGCCGTATT
TAAACGTGCG
AAAAAGAAAA
GCCGCAAAAG
GGCTATGCAC
TATGAACAAT
CATACCTATA
TTTTAGCACC
GAGACCAAGG
ATGATAAAT
ACACATATA.A
T=TGGTATGA
AGCTTGTC
TTCCGATIGG
CGCTATTGCC
AACGCAAGTG
CAcTTGGGGA TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes
AACATGGACA
AATTCCATCG
GGAGGATATG
ATGCGCAGAA
CTAAATCATG
GCGTATCCTA
GGTATTGGTA
GAT'ITAACAG
CCAGCAACAG
GATACCGTCA
TGGGTAAAAG
GGTATCTTGT
CCTGATAAGC
GCTTATCAAT
AATCTTATTT
ACGGTTAAAT
GCTTTAGCTG
AATTAI'AA
ITTGCGAAAAC
TTGcATTAGA
TAAGTTCAGA
TCAAACAGGC
TAAACTATCA
TCGATGGTGT
ITTTGGCAAT
GAATTACGGA
ATGCAGGCGA
AAGTGAAATT
GAAACAGCTA
CATGGATTAG
AACCAGAAGC
ATGGAACAGA
ATCCTGGTCA
ACGGCGATAA
CATTAAACGG
AACAATGGAT
TACTTTM'TGC
TrTTGAACAT GGAGCGTTrGO TGCCGAAAAA
GCAGCAAATA
TCGCTATACT ccAATG.TATT ACAAATCATC
AAACAGTTTC
GTCACCATAT
CCATTGTATG
CACATCCGCT TATA MGCAG TAGTGG'ITAT
ACAGATACCA
ACGAAATTGAA AAAATACCTA TAATCTAGAT
GCTTGGCAA
CAAAGTGCAAGAACTAACT
CAAAGGTGAT
ATTI'AATTAT
GACTCATGTG
GTACAATACG
CTATCAAACG
TTGGCGGCAT
AGTTTTGAAA
GTCAATGGAT
TTTATCTAGT
ATTGCAGCAA
ATTAGcAAAT
CCTAACTI'A
CACGTATTCA
CTAGTGTTCC
CAGAGACAAT
ACAGCTATAA
CTAACTC 11T
CTTATTTCCC
GTGGTTTAGA
ATAAACCAGA
ATTCTGATGT
CTGGGGAAGC
GAAGCAGAGT
TGCCAGACGC
GAGAPLACATT
TAAATGCATT
CGGCAACAAG
AACTTGGCAC
TTTATCCAGC
A-ACGCCTAAA
AGATGGATAT
TTTGTACGGT
GCCATTTACA
ATGGATTGCT
A.AGCATGGAT
TGGTAACGTA
AACGGATACG
TAAAGTTGGC
TATTCCGCAA
ATTGCTTGAA
A.ACAGTCGTC
ATCAAGTAT
GGCTAATCCA
TAATGTCTAC
TACTTATCAA
TCAAACATTG
EF083-2 (SEQ ID NO:314) MK KKILAGALVA LFFMPTAMFA AKGDQGVDWA IYQGEQGRFG
YAHDKFAIAQ
IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTWGN N4DIAKTTMDY
FLPRIQTPKN
SIVALDFEHG ALASVPDGYG GYVSSDAIEKA ANTETILYGM RRIKQAGYTP
MYYSYKPFTL
NHVNYQQIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG IGIWQFTSAY
IAGGLDGNVD
LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSDVKVGD TVKVKFNVDA
WATOEAIPQW
VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATVVP D)KQPE-ATHVV
QYGETLSSIA
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYCDNLSSI
AAKLG'LTYQA
LAALNGLANP NLIYPGQTLN
Y
4*
SSSSSS
S
S
555.
5555
S
5555 *555
S.
EF083-3 (SEQ ID NO:31S) AAAAG GAGACCAAGG TGTGGATTGG
GCGATTTATC
CAGATTGGAG
GCALAGTGCTA
AACATGGACA
AATTCCATCG
GGAGGATATG
-ATGCGCAGAA
CTAAATCATG
GCGTATCCTA
GGTATTGGTA
GAITTAACAG
CCAGCAACAG
GATACCGTCA
TGGGTAAAAG
GGTATCT'TGT
CCTGATAAGC
GcTTATCAAT AATCTTATm ACGG ETAAA'I
GCTTTAGCTC
AAT
AAGGTGAACA
GCTACAATG-C
TTGCCCAAGG
TTGCGAAAAC
TTGCA'ITAGA
TAAGTTCAGA
TCAAACAGGC
TAAACTATCA
TCGATGGTC.T
TTTGGcAATT
GAATTACGGA
ATGCAGGCCA
AAGTGAA6ATT
GAAACAGCTA
CATGGATTAG
AACCAGAAGC
ATG.GAACAGA
ATCCTGTCA
AcGGcGATAA
CATTAAACGC
AGGTCGCTTT
TAGCGGTATT
TAAACGTGCG
AAcAATGGAT
TTTTGAACAT
TGCCGAAAAA
TGGCTATACT
ACAAATCATC
GTCACCATAT
CACATCCGCT
TAGTGGTTAT
AGAAATTGAA
TAATGTAGAT
CAAAGTGCAA
CAAAGGTGAT
GACTCATGTG
CTATCAAACG
AGT'ITTGAAP
TTTATCTAG1
ATAGCAAAI
GGCTATGCAC
TATGAACAAT
CATACCTATA
TACTTTTTGC
GGAGCCC LrGG GcAG~cAAATA
CCAATGTATT
AAAGAGTTTC
CCA'TGTATG
TATATTGCAG
ACAGATACCA
AAAATACCTA
GCTGGGCAA
GAAGTAACTG
ATTGAATTAT
GTACAATACG
TTGGCGGCAI
GTCAATGGA'I
ATTGCAGCAP
CCTAACTTGP
ATGATAALAT C
ACACATATAA
TTI'GGTATGA
C
CACGTATTCA2 cTAGTGTCC
CAGAGACAAT
ACAGCTATAA
CTAACTCTTT
CTTATTTCC
C
GTGGTTTAGA
ATAAACCAGA
ATTCTGATGT
CTGGGGAAGC
GAAGCAGAGT
TGCCAGACGC
GAGAAACATT
TAAATGGATT
CGGCAACAAG
AACTTGGCAC
TT1'ATCCAGG
:GCTATTGCC
AcGCAAGTC :ACTTGGcA k.AcGCCTAAA kGATGGATAT rTTGTACGGT GccATTTACA
ATGGATTGCT
AAGCATGGAT
TGGTAACGTA
AACGGATACG
TAAAGTTGGC
TATTCcGCAA ATTGCTT1GAA
AACAGTCGTC
ATCAAGTAT
GGCTAATCCA
TAATGTCTAC
TACTTATCAA
TCAAACATTG
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF083-4 (SEQ ID NO:316) KGDQGVDWA IYQGEQGRFG
YAHDKFAIAQ
ICGYNASGIY EQYTYKTQVA
SAIAQGKRAH
SIVALDFEHG ALASVPDGYG
GYVSSDAEKA
NHVNYQQIIK EFPNSLWIAA
YPIDGVSPYP
LTGITDSGYT DTNXPETDTP
ATDAGEEIEX
VKGNSYKVQE VTGSRVLLEG
ILSWISKGDI
YQYGTDYQTL AALNGLANPN
LIYPGQVLKV
LAALNGLANP*NLIYPGQTLN
TYIJDTWGN
ANTETILYGM
LYAYFPSMECG
I PNSDVKVGD
ELLPDATVVP
NGSATsNVYT MD IAKTTIMDY
RRIXQAGYTP
I GIWQFrSAY
TVKVKFNVDA
DKQPEATHVV
VKYGE)NLSSI
FLPRIQTPKN
MYYSYXPFTL
IAGGL.DGNVD
WATGE-AIPQW
QYGETLSS IA
AAJ(LGTTYQA
EF084-1 (SEQ ID NO:31 7 TAGTCAAACG TTTATTTITTT CCT1'AAATCC CTATTGAATT GGAGGAGAAC TATGA6AGAA6A AGCGGTATGA TGGCCACCAT
TGCGGCCGCC
AAAAATCGCC GTGTTGGGAA
AAAATTATA
AATAATCGGC CCGCAGAAGA
AATCATTTCA
AGCGCATTI' CACAAT'I'IGA
TAACTATGAT
CACTTAAAAG AAGAAGATCA
CGGACGCATG
GTTGATGCGC TATTTA6ACCG
CATI'AACGA
GTCACAAAAT TACTACGAAA
AGACGATCAA
ATTTATCCAC CGTGTGTTGT
ATTAACAACT
ACTGGTGATG -GCTATAAACT
AGCCAAAAAA
ACCGAATCAC CTATTATTTC
TGAAGAACCT
TCTTTACA6AG ATGTTAATTT
AACTGTTTTG
CAAATGGATA TGCTGTTTAC ACATT' ITGC .AGTTTTAITA ACCAAGAATT
AACTCGCAAC
**GTCTTTCCGA CAAAATCTI'T
TGAAGAAGTG
*CTTTCCTTTG TGGAACTACT
GAAAGACTTT
GAAAAATC'IT TTGTCACAGG
CGGTGGAT'I
*GAGAGCAAAT TAGTCAATGG
-TATTTTTI
*ACTGCAGGCT ACAATGTTAC AGCTGCATT1 GCAGAAATTG CAGAATACAC
CTATTTACCI
EF084-2 (SEQ ID NO:318)
AGAAAAAATC
TTTGATGTA6A
GAAGCAGGCG
ATGACTCGTG
TTTATTCCTG
ATCATGAACT
TTCCCTGTI'A
TTAGGAGTCA
ATAATTGGCG
GGCGGCCGCA
ATGGGGCATA
'rTrATCCTGG AAccAAAAAG
ATTTCAGGAC
GGTAATCAAC
CCTGCCAAAC
CAGTTCACTC-
TCCCTCAAAC
GCTGOGTGAAC
GTCACTGGAC
SAT1'GAAGAAC CCGTAA6TTAT G
TI'ATTGTCGG
CTCAAGTATT
GCGGCCGCTG
GGAATGGAAA
Tr'IrrGAATC
CAGATAAATC
CTGT'T1TAC
TTGAAACCGA
CTTATCCTTC
CCATCAGCCC
ATAAAACGTT
GAAAACCTI'
CTGCCGCGCT
CTGTCACGGT
*AACTAACAGA
*TTACGAAAAC
AAGTGACCCC
-TTTAGATAT
-ATGTIMCTGG
TCTAA
;GTACACTAC
EGCTGGGACG
kTTGATTGAA
L'AATGTAACC
kTTr-TTTATAC
CA.ATGGTATT
GAAGTCAATTr
AAAAACACAG
ACTGGAAAAA
CACAGGAGCA
GCTCTACCCT
GCAAGGTCTC
AGTTAATCAT
CCGCTGTT-CT
AGCCTTGGAT
AAAGCAACGN
ATTGCCTTTG
TAAAACAATG
TAATGGCTAT
CTCCCATGCC
4*t*
C
MKKl'F DVIIVGAGTS GmmATIAAAE
AGAQVLLIEK
NRRVGKKLLM TGGGRCNVTN NRPAEEIISF
IPGNIGYFLYS
LKEEDHGRMF PVTDKSKSIV DALFNRINEL
GVTVFTKTQV
YAPCVVLTTG GRTYPSTCAT GDGYKLAKKM
GHTISPEJYPT
LQDVNLTVLN QKGKPLVNHQ MDMLFTHFGI SGPAALRCS S FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ
FTVTKTLPLE
SXLV-NGLFFA GELLDINGYT GGYNVTAAEV
TGHVAGSHAA
AFSQFONYDI
TKLLRI(DDQI
ESPIISEEPF
FINQELTRNG
KSFVTGGGIS
EIAEYTYLPI
MNFFESNGIH
IGVETELEKI
ILDKTLQGLS
NQPvTVALDv
LKEVTPKTME
EEV
EF084-3 (SEQ ID N~O:319) a C. C GAAGCAGGCG CTCAACTATI' ATTGAF1'GAA AAAAATCGCC GTGTTGGGAA AAAATTATTA
ATGACTGCTG
AATAATCGGC CCGCAGAAGA AATCATTI'CA TrATTCCTG AGCGCATTTT CACAATTGA TAACTATGAT
ATCATGAACT
CACTTAAAAG AAGAAGATCA CGGACGCATG TCCCTrGTA GTTGATGCGC TATTTAACCG CATTAACGAA
TTAGGAGTCA
GCGGCCGCTG
GGAATGGAAA
TTTTTGAATC
CAGATAAATC
CTGTTT=AC
TAATGTAACC
ATTTTTATAC
CAATGGTATT
GAAGTCAATT
AAAAACACAG
175 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GTCACAAAAT
ATTTATGCAC
ACTGGTGATG
ACCGAATCAC
TCTTTACAAG
CAAATGGATA
AGTTTATTA
GTGTTTCCGA
C=TCCTTTG
GAAAAATCTT
GAGAGCAAAT
ACTGGAGGCT
GCAGAAATTG
TACTACGAAA
CGTGTGTTGT
GCTATAAACT
CTATI'ATTTC
ATOTTAATT
TGCTGTTTAC
ACCAAGAATT
cAAAATCTT
TGGAACTACT
TTGTCACAGG
TAGTCAATGG
ACAATGTTAC
CAGAATACAC
AGACGATCAA ATAAT1CGCG ATTAACAACT
GGCGGCCGCA
AGCCAAAAAA ATGGGGCATA TGAAGAACCT TTI'ATCCTGG AACTGTTTTC AACCAAAAAG ACX1Tr-TGGC ATTCAGGAC AACTCGCAAC GGTAATCAAC TGAAGAAGTG
CCTGCCAAAC
GAAAGACTTT CAGTTCACTG CGGTGGGATT TCCCTCAAAG TTATTTTT-. GCTGGTGAAC AGCTGCATTT
GTCACTGGAC
CTATTTACCA ATTGAAGAAG
TTGAAACCGA
CTTATCCTTC
CCATCACCCC
ATAAA.ACGTT
GAAAACCTFI'
CTGCCGCGCT
CTCTCACGGT
A.ACTAACAGA
TTACGAAAAC
AAGTGAC C
FAGATAT
ATGTTGCTGG
TC
ACTGGAAAAA
CACAGGAGCA
GCTCTACCCT
GCALAGGTCTC
AGT1'AATCAT
CCGCTGTTCT
AGCCTTGGAT
AAAGCAACGN
ATTGCCTTTG
TAAAACAATC
TAATGGCTAT
CTCCCATGCC
EF084-4 (SEQ ID NO:320) E AGAQVLLIEX NRRVGKKLLM TGGGRCNVTN LKEEDHGRMF PVTDKSKS IV YAPCVVLTTG GRTYPSTGAT LQDVNLTVLN QKGKPLVNHQ FPTKSFEEVP AKQLTEKQRL SKLVNGLFFA GELLDINGYT NRPAEEI 1SF DALFNR INEL
GDGYKLAKIK
MOMLFTHFC I
SFVELLKDFQ
GGYNVTAAFV
I PGNGKFLYS
GVTVFTKTQV
GHTI SPLYPT
SGPAALRCSS
FIVTKTLPLE
TGHVAGSHAA
AFSQFDNYDI
TYKLLRKDDQI
ESPIISEEPF
F IN'QELTRNG
KSFVTGGGIS
E:IAEYTYLPI
MNFFESNG IH IGVETELEK I
ILDKTLQGLS
NQPVTVALDV
LKEVTPKTME
EE-V
EF085-1 (SEQ ID NO:321) a a a a
TAACCCATGA
CAGTCCACAC
CCGCGGTGTA
AAGGTT N'
TTTGCACAC
TATCAGCCAT
GGGCTGTT'C
GTACTCTATC
GTGGTACATA
CTTTATCAAG
CTTTTATOCG
AGTGCGAATA
CTTGGCGCAC
AGGCTGAG
GCGGCAGCCT
ATGCTTGAAA
ATGATGGACA
GGGAGAATCG
CCCGATTTC
GGACAAATTG
AATGCGGTTC
GAAAAATACC
AAGCTG ITCA
CGGTTTCGGGG
GGAAAACACC
AATATTTAA
AATCATTTTG
TCATATCAAA
GGTTACCGAT
CCTATACGAT
AATrTGTGTC
TGGTGGAACT
ACTACAATAA
CCACACCGAA
GAGATGATAA
GGCTGACAAT
CGTATATCTT
AAATGGCAAA
TGGCTCACGA
ATGAAATCGC
CTCATGAGTT
ATATCGGTGA
GCCAGGGCAA
TACCCATAGC
AAACCTTGGC
TCCTGTCCGP
AGAACACGCC
GTCTT'CCG]
TCCCATTCTI
CTTGCCATCC
TCAGATGGC(
TCCCGCATAT
AATCTGCGGG
TG-GAGGAATC
CATTGTCCTC
TTATTTCAGA
GATTCAGAAT
CCAATCCTTT
TGCCGATACA
TATI CGAT
TCGGGGAATT
TGCGCGGCAA
CCTGAAAGAA
CATGCATI'CC
AAGGGAACAT
AAAAACGCCC
*CTACAAAGAC
AACCATTTCC
*CGAACCGCTG
AGAGGCAA.AC
TCCGAAGCTC
CCAGGGAGGI
VrrGAACATC
TCGCATTGAI
TACAAAAAAC
T'1rTGTTCTC
GGGGATATGA
CGAACTGCCG
AMTATAATGG
TTACTCCTTG
GCGATGGAAG
AGCGATAGGC
GAGTITATA
TCAAATAGTG
G'IGCTCAAA
GTTATC.ATTf-
ATGACAACC
GTACCGCCGC
ATGTATATCA
GAGTTG-AGG
ATCGCGGCTG
CATTCTAAGT
GAAATACTGG
GA.CATAGGGC
AACCAGCGGI
ATCCAAAAGG
*GAG4GTACGGP
GGCGTTCAC;
CAGGCGCGAI
GCTGGATGCC
GCTGGATTTI
CTITGACGGT C
GAAAATATCA
AAAGAAAAGGC;
TCGrGTGTAAC
CACAGCAAAC
=TATATGCA
T'TGAAGATAA
TTAGGCCCGA
GCAAGGCAGG
CCATAATCGT
CGATCAAkAGC
CGCTGGAGCG
GGCTGAAACA
AAACACAGCG
TAAG-CGTTCT
ATCTGCGCGA
AGC 1TGTCAG
GCACGGT'GC
TCGTCACAGA
CGCTATCCAA
TATGGAGTGA
TTGATGATAC
SGCAGCAAAAA
ATGAGCCTCC
k CCGCCCACAT
,ATGGCAGCA
ECAAAACCAT
;ATTTrCATT
;GCAACACTG
AGTAAAATCC
AGAGGTGCA
AGAGGGAAGC
CTrTCTI'AT
TGTGGGATTG
TGTATTCAGC
CTTAGCGGAC
AAAGGATCAG
AACCATCGCA
ATATTTCTT
GTI'GGAGGGA
ATGCATCAAA
CCTGAACGAT
CGAGCTGCTA
TATIrCCAGCC
TGTCATAITG
GcCTGGGGCT
TGCACTTTCA
GTGGGCGMAG
AATATGCGCT
CAACACTATA
176 TABLE 1. Nucleotide and Amino Acid Seqeuences of E-faecalis Genes.
EF085-2 (SEQ ID NO:322)
HERKGIFIK
VFSYTI IVLL
L.FHYNNQSFE
YQGLTIRG IV GALAHDMI4SM
LENIGDYKDH
DFQTLAEANN
KYRLsvLNMCG
LLVGVTATLF
FYI EDKEGSV
MIAIMVVFSL
YIRLKETIAR
SKYLREC 1KM
QRFVTDIPAG
.VHIDDTALSK
AQQFVSYFRA
LYATPNADTS
LCAY IFARQM
LEDEIAREHE
MDRQGITISE
QIVLSDPKLI
LFIPFYRIDQ
MEAQQTVMSY
QPLVELIQNS
NSVRPDFLYV
VH-RODNISIV
7rTPIKALAIls
ANXMANLKEV
LEETQRYFFA
AASHELKTPI
ILELVSLNOG R IVP IAEPLD QKALSNVILN
AVQNTPQGGE
ARSSKK(WAKR
FGACHRTKNA
DRLDMQEVAG
AQSKAGVGLL
PPPLERKDEL
AAVSVLLEGM
IGRTVAELLP
VRIWSEPGAE
GcI{EPP ICAG KHLRWRFVLA GFTAHINTIN
I
EF08S-3 (SEQ ID NO:323) GC AATTTGTGTC 'FrATTTCAGA
GCGATGG
TATCAGCCAT TGGTGGA.ACT
GAFI'CAGALAT
AG CACAGCAA AC AGTAAAATCC
GGGCTGI'TC
GTACTCTATG
GTGGTACATA
CTTATCAAG
CT TTATGCG
AGTGCGAATA
CTTGGCGCAC'
AGGCTGGAGG
GCGGCAGCCT
ATGCTTGAAA
ATGATGGACA
GGGAGAATCG
CCCGATTI'C
GGACAAATTG
AATGCGGTITC
GAAAAATACC
AAGCTGTTCA
CGGTTTGGGG
GGAAA-ACACC
AATATTT
ACTACAATAA
CCACACCGAA
GAGATGATAA
GGCTGACAAT
CGTATATCT
AAATGGCAAA
TGGCTCACGA
ATGAAATCGC
CTCATGAGTI
ATATCGGTGA
GGCAGGGCAA
TACCCATAGC
AAACCTTrGCC
TCCTGTCCGA
AGA.ACACGCC
GTCTTTCCG!I
TCCCATTCTP
cTTGCCATCC
TCAGATGGCC
CCAATCCTTTr
TGCCGATACA
TATTTCGATT
TCGGGGAATT
TGCGCGGCAA
CCTGAALAGAA
CATGCAFI'CC
AAGGGAACAT
AAAAACGCCC
CTACAAAGAC
AACCATTICC
CGAACCGCTG
AGAGGCAAAC
TCCGAAGCTC
CCAGGGAGG
I
TTTGAACATC
TCGCATI'GA]
TACAAAAAA(
TTTTGTTCT(
GAGTTTTATA
TCAAATAGTC;
GTTGCTCAAA
GTTATGAT'IG
ATCACAACGC
GTACCGCCGC
ATGTATATCA
GAGTTGGAGG
ATCGCGGCTG
CATI'CTAAGT
GAAATACTGG
CACATAGGGC
AACCAGCGGT
ATCCAAAAGG
GAGGTACGGP
GGCGTTCACP
SCAGGCGCGAM
GCTCGATGCC
;GCTGGATr
TTGAAGAA
TTAGGCCCGA
GCAAGGCAGG
CGATA.ATGGT
CGATCAAAGC
CGCTGGAGCG
GGCTGAAAGA
AAACACAGCG
TAAGCGTTCT
ATCTGCGCGA
AGCTTGTC-AG
GCACGGT*TGC
TCGTCACAGA
CGCTATCCA-A
TATGGAGTGA
TTGATUCATAC
GCAGCAAA
ATGAGCCTCC
k CCGCCCACAI
AGAGGTGGCA
AGAGGGAAGC
CTITCTTTAT
TGTGGGATTG
TGTATTCAGC
CTTAGCGGAC
AAAGrGATAG
AACCATCGCA
ATATTTCTTT
GTTGGAGGGA
ATGCATCAAA
CCTGAACGAT
CGAGjCTGCTA
*TATTCCAGCC
*TGTCATATTG
GCCTGGGGCT
TGCACTTTCA
GTGGGCGAAG
pLATATGCGCT rCAACACTATA
S
S S EF085-4 (SEQ ID NO:324)
S
QFVSYFRA MEAQQWVKSY QPLVELIQNS
DRLDMQEVAG
LFHYNNQSFE FYIEDKEGSV LYATPNADTS
NSVRPDFLYV
YQGL.TIRGIV MIAIMVVFSL LCAYIFARQM
TTPIKALADS
GALARDMHSM YIRLKETIAR LEDEIAREHE
LEETQRYFFA
LENIGDYKDH SKYLRECIKM MDRQGKTISE
ILELVSLNDG
DFQTLAEANN QRFVTDIPAG QIVLSDPKLI
QKALSNVILN
KYRLSVLNMG VHIDDTALSK LFIPFYRIDQ
ARSSKKWAKR
KHiLRWRFVLA GFTAHINTIN
I
VHRIDDNIS IV
ANKMANLKEV
AAsHELKTP I RIVP IAEPLD
AVQNTQGGE
FGACHRTKNA
AQSKArGVGLL
PPPLERKDEL
AAVSVLLEGM
IGRTVAELLP
VRIWSEPGAE
GCHEPPICAG
EF086-1 (SEQ ID NO:325) TAAcTGGTGG
CATGATGACC
cTGTATGCAA
ACAGATATTT
GATTGGCAAA 'TCGTTCCGC
GCAGCGCTAA
TATI'GAATAC AGATGCAGAA
AAA'ITAAATA
AAGATCCAAA CATACA.ATGG
CCAATTTATC
CAATCACCGT TTTAGGTACT GGACTTF'rGT CAGATAcATT GATI1'TA 1TA AATTTACTGC TCcGcTGATG GTGCAACAGG
AGCTAACTTA
TAGAAGATAA TCAACGCCTA 1 177 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GTACAAGTAC
TATCCTGATG
GAGTTGCTAA
AATGACCCTA
GTAAATGGAA
AkATCCTTTTA
GCAAAATTTG
CAACAAAGTG
GACTTGAAAA
AATGTATATG
AGTATGTATT
TGGCATACAG
GGATACTGGG
ITGGCAAATG
AATGACAGG
G 1TGCTAAAA
ACTGGTACGA
GAAGTGAAGC
ANTCCATTGA
ATAGAAGAAC
TATACAAATA
TACGAATArT
AAAGGCTATA
GTCATGATGA
ATGTCGGTTA
TTACAAAATA
GCGGACCCAG
GGCAGCTCGC
AAATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAG
CAAGAAAAAG
CAA.ACTGTCG
CTACGTI'CGG
AAAAATGGGG
AAACAAGTTA
ACAGGAGAAA
TGTC T=~AG
AAGAAGCTGT
CTTCCMGAT
AAGGGTTTGG
ACA'ITAGTAA
AAATGCCATC
CTACAGAGTT
CACCAGAAAA
GGTCATACTA
ATGTAGTGAA
GTTCGATGGA
CACAACGTT
CAGACCOCAT
CAACGATCGA
GTGCTTATAC
TTGCCTCTAT
AATCTTGGTT
CAGATGCTTC
TTAACCAAGG
ATAACATTGG
GCTCTGGTCG
CATTTGCTAA
TAACAGTrGGT
CTGTTCTAGA
ATACATGGAA
=TCAGAAAA
ATGCATCCGT
AAATTn-CTGT
GTTCAATICAT
AGGAACAAAA
AAGCATTGAA
ACCAAGCAGA
AAGTAGATAA
ACTACACCGC
CAGATCAAAC
CAGTGAACG
GGAATAATGG
AGCCATCAAG
AGAAATCGAT
TTI=CGTAA
TCCGTCCGTT 1'AAAAAGTG TCAACATCGGT TATTTTCCGT ACGAATTCAG ACTATMIAC TTAI'TAAT GTTGTGGATA GATGGTTTCT GGTAGAACTA TGAATCGGGT AAAGAAACAA TTTAAGAAAT GACAT ITATA TCATTTCTTT AAAAAACCA-A TAGTGCGTCA CCTGCCCAAG TCGAGTCCTA CAGAAAAATA
CGGAAACTAT-GAATT=~GA
GCTTTATITA ThCPLATICAAG TCCATATCGA TTACCAGGAA AGGGAAACGC AGTCCCCAGT AGGAATGTTT TTAGATAAAA CTTATTAGAT GGTCAAATCA GATTGAAACA ATCCTCGATA TTCAGACAAA GATAAMTCTT CTATGTI'TT CCTAATTCNA CTACGGAGAT ATTAACGAAT AATTAGTAAA AATTATGGCA TGGGAAAACG AATGAACAAA AAATACAGCA AACTTACAAG TAATGACCAA GAAA CAG AATI'GATAAC GGTGTTTATC ITCTATTGAA TTTGATAAGG TGACCAAAAT ATATCACT' TGTTAAA.ACA ACTCCTMAAG AGAACACCAA GAAAAAGACT ACAAGCACAA ACTGTGGCAG AACAGAGT1A CGTTCGGCAG AACCAACTTG TrGAA.AATCA AAGCAGTTGG AAAGTCTACA AACAGCAACG CAAGCAGAAG ATTAACATTG AAAAATAGTG ACACTI'AAAT ACTAGTACAG CCAAGGTGGT TTCAGAAAAG CGCGCTTGTG ATIATTGGTC AAGTAAATCG AAGAAGTAA rTTrccTCTGG
ACAACGGCAG
AAGGTTCCGA
AAGGTTACTT
TTTCCAGAGC
TAGCTAA=r
CATCTATCCA
GAGATT=A
Cr-ACACCAAT
ACGAATATGC
ATACGGAAA.A
ACTI'TCTCA
CGACAGTTGA
CATGGGTAGG
GTAATCAAGG
TTAATTTGGG
ATCGGATGAT
GG;ATTAGTTT
TGAATACGCT
ACTTTGTTAA
AGACTGTTGA
TCGCAGCTCT
CCATTGAAGC
-ACTGTATGC
GCITAACTCT
GCATI'CTGA
TAAATAGTGC
TAACGAAAGA
ACACCGCAAG
ATCAA-ACAAC
TGAAGCAATT
TCAAAGAAAA
GTGAAGCATT
TAGACCAAGC
GGGAAAATAA
GAGMITATCA
CTACCCAATT
TCTAGTTAT
rGATGGCT-TA
TTACGGGAAT
CTGGGAGATG
ACAATTGATG
CCCAGAAACG
AACCTTAATT
AACGTGGCTT
AGCGTTAATT
GCAATCTTTA
GGTGGGGATC
TAAAAAAGC
GTTTGATGAA
CAC-AAGAGAA
TGGCTCAAAT
AATGAACTTA
AAGTGGCATT
TCATCCACAG
AAGCGCAGCG
TGATGTTCAA
TGATAAAACC
AAATGGTACT
TCTAAAAAC
AGGTAATTAT
CTATGATCCA
TGCGAATCCT
AGTAG.TCGCA
GGGGTTAAAT
AGCGTT AGAA
CAGCTGGAAA
AGCAACGCA.A
GGTAAAAGTG
CGAGAAACAC
GAAGCAAGCG
AGAAGCAAAA
AAAGGAGCAA
AACTGGTACG
T'ITACCGAGC
CGCCAGTGGG
S.
S.
S
555 S S
S
a.
S
S
S S a.
EF086-2 (SEQ ID NO:326)
IVGLANWFRA
DIS IT'JLTG
LLKGFGRIQT
PFTI'EFESGK
LKNVVNSASP
HTADGMLYLY
GQVAS IGMFL
VKLNQGSDKD
TNTFkAKISKN
MMNTWNNDQE
DPEISVDQNI
YSEALKQAQT
ALTDTLI LU-
LLLEDNQRLV
ILQGSDWEMN
ETIANLTLIA
AQATPMQSLN
NQDFAQFDEG
DKSNEGMNLV
NSWISLSAAX
YGKTVENGTY
IAGLYAYDPM
ITLNSAGLNG
VADQTTATQA
ODLLNTDAE(
QVQE-AVPSVL
DPNISNLFNV
KFAPENLRND
VYGSMDRVLQ
YWATIDPYRL
AKI(SWFLLDG
PLNNICYVFP
EYLTVVGKTN
SVISEKIDNG
SSRSI IVKTT
EVDQAETE-LR
LNKFTAPLML
KSVSSGDGLY
VDKGYLQLMV
IYTS IQTWLQ KNNEYAVG IS
PGITVDTREL
QI INLGSGIT
NSMNTLDVQI
EEIAALSKNK
VYRLTLANPL
PEVTKEA-LEK
SAVKQLVKVP
YAI(DPNIQWP
PDCGSLSIQHGY
NGKHPSMVSG
QSGSYYHFFK
MYSQRVGNYE
ANGAYTGKRS
GTDAS IETI
EERSGRYGDI
GYT'JLENTAN.
QNNASVSIEF
L IQEQKE{QE TK'tIMKTNLL I YRATG-ANLT
FPYNGSYGNE
RS ISRAPETN KPRDFEAkLID
FGNTENKKGW
PQSWVGGSNN
LDNRMI HPQE
NEYFVNDKTY
LQAIE-AGNY-V
DKGILEVVAA
KDYTASSWKV
KI IKENEKHQ TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EKDYTASSWX, VYSE-ALKQAQ TVADQTTATQ AEVDQAEAKL RSAVKRLTLK
NSGENKKEQK
NGGNNGHLNT STGVDQTGTK QVKPSSQGGF RKASQFLPST G;EKKSIALVI
IGLLVIASGC
LLVFRKSKSK
K
EF086-3 (SEQ 10 NO:32 7 ACCAGAAALA TTAAGAA.AT GACATTTATA CATCTATCCA
AACGTGGCITT
cAACAAAGTG GGTCATACTA TCAT-rTT'r AAAAAACCAA
GAGA'TTGA
GACTTGAAAA ATCTAGTGAA TAGTGCGTCA CCTG-CCCAAG
CGACACCAAT
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA
ACGAATATGC
AGTATGTATT CACAACGTGT CGGAAACTAT- GAA'11 GGGA ATACGGAAAA TGGCATACAG CAGACGGCAT GCTTTATTTA TACAATCAAG
ACTITGCTCA
GGATACTGGG CAACGATCGA TCCATATCGA TTACCAGGAA
CGACAGTTGA
TTGGCAAATG GTGCTTATAC AGGGAAACGC AGTCCCCAGT CATGrGGTAcr
AAT
GCAATCTTTA
GGTGGGGATc
TAAAAAAGGC
GTTTGATGAA
CACAAGAGAA
TGGCTcAA-AT EF086-4 (SEQ ID NO:32 8 PENLRND IYTSIQTWLQ QSGSYYHFFK
KPRDFEALID
LKNVVNSASP AQATPMQSLN VYGSMDRVLQ
KNNEYAVGIS
HTADGMLYLY NQDFAQFDEG YWATIDPYRL
PGTIVDTREL
MYSQRVGNYE FGNTEN1CKGW AINGAYTGKRS
PQSWVGGSNN
EF087-1 (SEQ ID NO:32 9 4
TAACTGGTGG
CATGATGACC
CTGTATCCAA
ACAGATATTT
GTACAAGTAC
TATCCTGATG
GAGTTGCTAA
AATGACCCTA
GTAAATGGAA
AATCCTTTTA
GcAAAATTXG
CAACAAAGTC
GACTTGAAAP
AATGTATATC
AGTATGTA19
TGGCATACAC
GGATACTGGC
TTGGCAAATC
AALTGGACAG(
GTTGCTA;Lk
ACTGGTACG.
GAAGTGAAG
ANTCCATTG
ATAGAAGAA
TATACAAAT
TACGAATA'I
AAAGGCTA7
GTCATGATC
ATGTCGGT]
TTACAWAN
GATTGCCAAA TJ TATTGAATAC
AC
AAGATCCAAA
C~
CAATCACCGTT
AAGAAGCTGT Ti GTTCCTTGAT
T
AAGGGTTTGG
A
ACATTAGT-
T
AAATGccATC
G
cTACAGAGTT
TI
CACCAGAAAA GGTCATACTA
I~
ATGTAGTGAA GTTCGATGGA
CACAACGTGTC
CAGACGGCAT
CAACGATCGA
GTGCTTATAC
STTGCCTCTAT
N. AALTCTTGGT1' A, CAGATGCTTC c: TrAAcCAAGG A ATAAcA'ITGG C GCTCTGGTCG 'A CATTI'GCTAA yr TAA.CAGTGGT ,A CTGTICTAGA ;A ATACATGGAA .A TITCAGAAAA C'A ATGCATCCGT EGGTTCCGC GC 3ATGCAGA6A bLTACAATGG
CC
rTAGGTACT
GC
:CCGTCCGT
CAACATGGT
T
cGAATTCAG A TTATTTAAT
G
,ATGGTTTCT
G
GAATCGGGT
TTALAGAAAT
G
~CATTTCTT 7AGTGCGTCA
C
M'GAGTCCTA
C
:GGAAACTATC
.CTTTATI'TA
rCCATATCGA
AGGGAAACGC
AGGAATGT
CTTATTAGAT
GAT1'GAAACA
TCAGACAAA
CTATGTTTTT
CTACGGAGAT
AATTAGTAAA
TGGGAAAACG
AAATACAGCA
TAATGACCAA
AATTGATAAC
TTCTATTGAA
AGCGrCTAA CAGATACATT
G)
A'ITAAATA AATTTACTGC
TC
:AATTTATC GTGCA.ACAGG
AC
;ACTTICT TAGAAGATAA
T
7AAAAAGTG TTrCCTCTGG Ti =TPTCCGT ACAACGGCAG
T
:TATTI'TAC AAGGTTCCGA
C
rTGTGGATA AAGGTT.ACTT
A
GTAGAAGTA TTTCCAGAGC
G
MLGAAAcAA
TAGCTAAITT
ACATTTATA
CATCTATCCA
AAAAACCAA GAGA'rTGA
P~
CTGCCCAAG CGACACCAAT
C
AGAAAAATA ACGAATATGC
C
AATTTGGCGA
ATACGGAAAA
'ACAATCAAG
ACTTTGCTCA
.'TACCAGGAA
CCACAGTTGA
GTCCCCAGT
CATGGGTAGG
n1AGATAAAA
GTAATGAAGG
-GTCAAATCA
TTAATTGGG
MLTCCTCGATA
ATCGGATGAT
GATAATTCTTI'
GGATTAGTTT
:CTAArTTCNA
TGAATACGCT
AT'rAACGAAT
ACTTTGTTAA
AATTATGGCA
AGACTGTTGA
AATGAAGAAA
T-CGCAGCTCT
AACTTACAAG;
CCATTGAAGC
G;AAATTGCAG GACTGTATGc G;GTGT'1rATC GCTrAACTCT TTT1GATAAGG GCATrCTTGA TTFTAT'rA
.CGCTGATG
;VcTAACTTA
.AACGCCTA
.,ATGGC
ITA
TACGGGAAT
TGGGAGATG
CAATTGATG
,CcAGAAACG
.ACCTTAAT
LACGTGGcTT
~GCGTTAAT
.CAATCTTTA
,GTGGGGATC
I.AAAAAAGGC
;TTTGATGAA
CACAAGAGAA
TGGCTCAAAT
AATGAAcTI'A
AAGTGGCATT
TCATCCACAG
AAGCGCAGCG
TGATGTCAA
TGATAAAACC
AAATGGTACT
TCTAAAAAC
AGGTAATTAT
GTATGATCCA
TGCGAATCCT
AGTAGTCGCA
TABLE 1. Nucleotide and Amino Acid Seqeueflces of E. facalis Genes.
GCGGACCcAG
GGCAGCTCC
AAATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAG
cA.AGAAAAAG
CAAACTGTGG
CTACGTTCGG
AAAAATGGGG
AAAcAAGTTA
ACAGGAGAALA
TGTCTTTTAG
AAATTvTCTGT
GTTCAATCAT
AGGAACAAAA
AAGCATTGAA
ACCAAGCAGA
AAGTAGATAA
ACTACACCGC
CAGATCAAALC
cAGTGAAGCG
GGAATAATGG
AGCCATCAAG
AGAAATCGAT
TTTTTCGTAA
TGACCAAAAT
ATTATCACTT
TGTTAAAACA
AGAACACCAA
ACAAGCACAA
AAcAGAGTTA AACCAACT'rG
AAGCAGTTGG
AAAGCAACG
ATTAACATTG
ACACTTAAAT
CCAAGGTGGT
CGCGCTTGTG
AAGTAAATCG
CAAAAAGACT
ACTGTGGCAG
CGTTCGGCAG
TI2GAAAATCA
AAAGTCTACA
CAAGCAGAAG
AAAAATAGTG
ACTAGTACAG
TTCAGAAAAG
ATTAT'rGGTC
ACAAGTAA
TAAATAGTGC GGGGTTA.AAT TAACG-AAAGA
AGCGTTAGAA
ACACCGCAAG
CAGCTGGAAA
ATCAAkACAAC
AGCAACGCAA
TGAAGCAATT
GGTAAAAGTG
TCAA-AGAAAA
CGAGAAACAC
GTGAAcCATT
GAAGCAAGCG
TAGACCAAGC
AGAAGCAAAA
GCGGAAAATAA
AALAGGAGCAA
GAGTTGATCA
A.ACTGGTACG
CTAGCCAATT
TTTACCGAGC
TTCTAGTTAT
CGCCAGTGGG
EF0872-(SEQ ID NO:330)
LVGLANWFRA
DIS ITVLGTG
LLKGFGRIQT
PFTTEFESGK
LKNWVNSASP
HTADGMLYLY
GQVAS IGMFL VKL NQGSDKD TNTFAKI SKN
MM~NTWNNDQE
DPEISVDQNI
YSEALKQAQT
EKDYTASSWK
NGGNNGHLNT
ALTDTLILLHI
LLLEDNQRLV
ILQGSDWEMN
ETIANLTLIA
AQATPMQSLN
NQDFAQFDEG
DKSNEGMNLV
NSWISLSAAX
yGKTVENGTY
IAGLYAYDPM
ITLNSAGLNG
VADQTTATQA
vysrEALKQAQ
STGVDQTGTK
DDLLNTDAEKI
QvQEAVPSVL
DPNISNLFNV
KFAPENLRND
VYGSMDRVLQ
YWATIDPYRL
AKKSWFLLDG
PLNNIGYVFP
EYLTVVGKTN
SVISEKIDNG
SSRSI IVKTT
EVDQAETELR
TVADQTTATQ
QVKPSSQGGF
.NI(F1APLMLJ
(SVSSGDGLY
IDKGYLQLV
IYTSIQTWLQ
KNNEYAVGIS
PGTTVDTREL
QI INLGSGIT NSMTLDVQ
I
EE-IAALSY.NK
VYRLTLANPL
pEVTKEALEK
SAVYKQLVKVP
AEVDQAEAKL
RK(ASQFLPST
YAKIDPNIQWP I PDGSLIQ{GY
I
NGKM-PSMVSG
QSGSYYHFFK
i4YSQRVGNY-E ANGAY-TGKRs GTTD ASIETI
EERSGRYGDI
GYTVLEI\TAN'
QNNASVS IEF
LIQEQKEHQE
TKEVIDKTNLL
RsAVKRLTLK GEKXS IALVI :yRATGANLT
~PYNGSYGNE
ISI SRAPETN KPRDFEAL ID
E'GNT-ENKKGW
PQSWVGGSNN
LDNFRMIHPQE
NEYFVNDKTY
IQAIEAGNYV
DKILEVVAA
KDYTASSWKV
KI1IKENEKHQ
NSGENKKEQK(
IGLLVIASGC
4
S
S 9 a
S
a a *9 LLVFRKSKSK K EF087-3 (SEQ ID NO:331) A ATCGGATGAT
TCATCCACV
GAAGTGAAGC
TTAACCAAGG
ANTCCATrGA
ATAACATTGG
ATAGAAGAAC-
GCTCTGGTCG
TATACAAATA CAIrCTAA TACGAATATI
TAACAGTGGT
AAAGGCTATA
CTGTTCTAGA
GTCATGATGA
ATACATGGAA
ATGTCGGT'TA TT1'CAGAAAA TTACAAAATA
ATGCATCC
TrTCAGACAAA GATAATTCTTI
CTATGTI'TT
CTACGGAGAT
AATI'AGTAAA
TGGGAAAACG
AAATACAGCA
TAATGACCA
AATTGATAAC
ATAACGAAT
AATTATGGCA
AATGAAGAA
AACTTACAAG
GAAATTGCAG
GGTGTTTATC
GGATTAGTTTI
TGAATACGCT
ACTTTGTTAA
AGACTsTTGA TC-CAG CT1CT CCA 1TGAAGC
GACTGTATGC
GCrAACTCT
AAGCGCAGCG
TGATGTTCAA
TGATAAAACC
AAATGGTACT
TT~CTAAAAAC
AGGTAAT1TAT
GTATGATCCA
TGCGAATCCT
EF087-4 (SEQ ID NO:3 3 2 NRM4IHPQE VKLNQGSDKD NSWISLSAAX pLNN'IGYVFP NSb4NTLDVQI EERSGRYGDI
NEYFVNDKTYI
TNT1FAKISKN YGKTVENGTY EYLT'JVGKTN EEIAALSKNX GyTVL.ENTAN
LQAIEAGNYV
MMN~TWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTL.ANPL
QNNAS
EFO88-1 (SEQ ID NO:3 3 3 180 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faccalis Genes.
TAACTGGTGG
CATGATCACC
CTGTATGCAA
ACAGATMTT
GTACAACTAC
TATCCTGATG
GAGTTGCTAA
AATGACCCTA
GTAAATGGAA
AATCCTTTI'A
CCAAAATTTG
CAACAAAGTG
GACTTGAAAA
AATGTATATG
AGTATGTATT
TGGCATACAG
GGATACTGCG
TTGGCAAATG
AATGGACAGG
GTTCCTAAAA
ACTGGTACGA
GAAGTGAAGC
ANTCCATTGA
ATAGAAGAAC
TATACAAATA
TACGAATATT
AAAGGCTATA
GTCATGATGA
ATGTCGGrA
TTACAAAATA
GCGGACCCAG
GGCAGCTCGC
AAATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAG
C.AAGAAAAAG
CAAACTGTGG
CTACG~rCGO
AAAAATCGG
AAACAAGITA
ACAGGAGAAA
TGTC=TAG
CATTGGCAA.A
TATTGAATAC
AAGATCCAAA
CAATCACCGT
AAGAAGCTGT
GtTCCTTGAT AAGGG ITTGC
ACATTAGTAA
AAATGCCATC
CTACAGAGT'
CACCAGAAAA
GGTCATACTA
ATGTAGTGAA
GTTCGATOGA
CACAACGTGT
CAGACGGCAT
CAACGATCGA
GTGCTTATAC
TTGCCTCTAT
AATCTTGGTT
CAGATGCTTC
TTAACCAAGG
ATAACATTG
GCTCTGGTCG
CATTTGCTAA
TAACAGTGGT
CTGTTCTAGA
ATACATGGAA
TTCAGAAAA
ATGCATCCGCI
AAATTrTCGI
GTTCAATCAI
AGGAACAAA-a
AAGCATTGAP
ACCAAGCAG;
AAGTAGATA-Z
ACTACACCGC
CAGATCAAAC
CAGTGAAGCC
GG.AATAATGC
AGCCATCAAC
AGAAATCGA'
TTTTCGT;J
TTGGTTCCGC
AGATGCAGAA
CATACAATGG
TTTAGGTACT
TCCGTCCCGTT
TCAACATGGT
ACGAATTCAG
11TATTTAAT
GATGGTTCT
TGAATCGGGT
ITTAAGAAAT
TCATTTCTTT
TAGTGCGTCA
TCGACTCCTA
CGGAA-ACTAT
G-CTTATTTA
TCCATATCGA
AGGGAAACGC
AGGAATGT'T
CTTATTAGAT
GATTGAAACA
TTCAGACAAA
CTATGTTTTT
CTACGGAGAT
AATTAGTAAA
TGGGAAAACG
AAATACAGCA
TAATGACCAA
AATTGATAAC
TTCTATI'GAA
TGACCAAAAT
TGTTAAAACA
ACAACACCAA
ACAAGCACAA
AACAGAGT'A
AACCAACTTC
AAGCAGTTGC
AACAGCAACC
ATrAACATTC
ACACTTAAAI
CCAAGGTGGI
r' CGCGCTIlGTC k AAGTAA-ATCC
GCAGCGCTAA
AAATTAAATA
CCAATTTATC
GCACTTTTGT
TTAAAAALGTG
TATTTTCCGT
ACTATTTTAC
GTTGTGGATA
GGTAGAAGTA
AAACAAACAA
GACATTTATA
AiAAACCAA
CCTGCCCAAG
CAGAAAAATA
GAATTTGGGA
TACAATCAAG
TTACCAGGAA
AGTCCCCAGT
TTAGATAAAA
GGTCAAATCA
ATCCTCGATA
GATAAI'TCT
CCTAATTCNA
ATTAACGAAT
AAT'IATGGCA
AATGAAGAAA
AACTrACAAG
GAAATTGCAG
GGTGTTTATC
T=IATAAGG
ATTATCACTT
ACTCCTGAAG
*GAAAAAGACT
*ACTGTGGCAG
CGI'GGCAO
TTGAAAATCA
1AAAGTCTACA
CAACCAGAAC
AAAAATAGTC
ACTAGTACAG
TCAGAAAAC
ATTATTGGTC
AAGAAGTAA
CAGATACATT
AATTTACTGC
GTGCAACAGG
TAGAAGATAA
=rCCTrCTGG
ACAACGGCAG
AAGGTTCCGA
AAGG7TACTT
TTTCCAGAGC
TAGCTAATTT
CATCTATCCA
GAGATTTTGA
CGACACCAAT
ACGAA'VATGC
ATACGGAAAA
ACTTTGCTCA
CGACAGTI'GA
CATGGGTAGG
GTAATGAAGG
TTAA7TTGGG
ATCGGATGAT
GGATTAGTTT
TGAATACGCT
ACTTT GTTA.A
AGACTGTTGA
TCGCAGCTCT
CCATTGAAC
GACTGTATGC
GCITAACTCT
GCATTCTI'GA
TAAATAGTGC
TAACGAAAGA
*ACACCCCAAG
ATCAAACA-AC
TGAAGCAATI
*TCAAAGAAAA
GTGAAGCATI
TAGACCAAGC
GCAAAATAP
GAGTTlGATCP CTAGCCAAT7
*TTCTAGTTAI
CATTTTATTA
TCCGCTGATG
ACCTAACTTA
TCAACCCCTA
TGATGGCTTA
'FrACGGGAAT
CTGGGAGATG
ACAATTGATG
GCCAGAAACG
AACCTTAATT
AACGTGGCT
AGCCTTAATT
GCAAT'CTIA
GGTGGGGATC
TAAAAAAGGC
GTTTGATGAA
CACAAGAGAA
TGGCTCAAAT
AATGAACTTA
AAGTGGCATT
TCATCCACAG
AAGCGCAGCG
TGATGTTCAA
TGATAA.AACC
AAATGGTACT
TTCTAAAAAC
AGGTAATTAT
GTATGATCCA
TGCGAATCCT
AGTAGTCGCA
GGGGTTAAAT
AGCGTTAGAA
CAGCTGGAAA
AGCAACGCAA
GGTAAAAGTG
CGAGAAACAC
GAAGCAAGCG
,AGAAGcAAAA
,AAAGGAGCAA
,AACTGGTACG
TITACCGAGC
CGCcAGTGGG EF088-2 (SEQ ID NO:334)
LVGLANWFRA
DISITVLGTG
LLKGFGRIQT
PFTTEFESGK
LKNVVNSASP
HTADGMLYLY
GQVASIGM2FL
VKLNQGSDKD
ALTDTL.ILLH
LLIJEDNQRLV
ILQGSDWEMN
ETIANLTLIA
AQATPMQSJN
NQDFAQFDEG
DKSNEMN~LV
NSWISLSikX
DDLLNTDAK
QVQEAVPSVL
DPNISNLFNV
KFAPENLRND
VYGSMDRVLQ
YWATIDPYRL
AKXSWFLLDG
PLNNIGYVFP
LV1FnAPLML
KSVSSGDGLY
VDKCYLQLMV
IYTSIQ IWLQ KNNEYAVG IS
PCTI'VDTR:EL
Q IINLGSGIT
NSMNTLDVQI
YAXKDPNIQWP
PDGSLIQHGY
NGKMPSMVSG
QSGSYYHFFK
MYSQRVGNYE
ANGAYTGKRS
GITDAS IETI
EERSGRYGDI
iyRATGANLT
FPYNGSYCOE
RSISRAPETN
KPRDFEALID
FGNTENKKGW
PQSWVCGSNN
LDNRMIHPQE
NE-YFVNDI<Ty 181 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
TNTFAKISKN
MMTWNNDQE
DPEI SVDQNI
YSEALKQAQT
EKDYTASSW(
NGGNNGHLNT
LLVFRKSKSK
YGKTVENGTY
IAGEJYAYDPM
ITLNSAGLNG
VADQTTATQA
VYSEALKQAQ
STGVDQTG"X,
EYLTVVWGKTN
SVI SEK lONG SSRS IIVKTT
EVDQAETELR
TVADQTTATQ
QXTKPSSQGGF
EEIAkALSKNK
VYRLTLANPL
PEVTKEALEI(
SAVKQLVKVP
AEVDQAEAKL
RKASQFLPST
GYTVLENTAN
QNNASVSIEF
LIQEQKEHQE
TXEVDKTNLL
RSAVKRLTLK
GEKKS IAIVI LQAI EAGNYV
DKGILEVVAA
KDYTASSWKV
KI IKENEKNQ
NSGENKKEQK
IGLLV IASGC EF088-3 (SEQ ID NO:335) A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA AAATTAATI'C AGGAACAAAA AGAACACCAA dAAAAAGACT GTCTACAGCG AAGCATTGAA ACAAGCACAA ACTOTOCCAG GCAGAAGTAG ACCAACCAGA AACAGAGTTA CGTTCGGCAG CCAACTAAAG AAGTAGATAA AACCAACTTG TTGAAAATCA CAAGAAAAAG ACTACACCGC AAGCAGTTGG AAACTCTACA ACAC CGCAAG
ATCAAACAAC
TGAAGCA.ATT
TCAA.AGAAAA
GTGAAGCAF1'
CAGCTGGAA-A
AGCAACGCAA
GGTAAACTG
CGAGAAACAC
GAAGCAAGCG
CAAACTGTGG
CTACG'ITCGC
AAAAATGGGG
AAACAAGTTA
CAGATCAAAC
CAGTGAAGCG
GGAATAATGG
AGCCATCAAG
AACAGCAACG
ATTAACATTG
ACACTTAAAT
CCAAGGTGGT
CAAGCAGAAG
AAAAATAGTG
ACTACTACAG
TTCAGAAAAG
TAGAC-A-AGC AGAAGCAAAA GGGAAkAATAA
GAGTTGATCA
CTAGCCAAITT
AAAGGAGCAA
AACTGGTACG
TTTACCGAGC
ACAGGAGAAA AGAAA EF088-4 (SEQ ID NO:336) T PEVTKEALEK LIQEQKEHQE KDYTASSWKV YSEALKQAQT VADQTTATQA EVDQAETELR SAVKQLVKVP EKDYTASSW< VYSEALKQAQ IVADQTTATQ AEVDQAEAKL NGGNNGHLNT STGVDQTGTK QVYPSSQGGF RKASQFLPST
TKEVDKTNLL
RSAVKR:LTLK
GEKK
KI IKEN'EKHQ
NSGENCKEQK
9 9 .99.*
C
EF089-1 (SEQ ID NO:337)
TGACAGATAC
TATAGGTCAA
AATGACAGAC
TGTATGTTAT
ACACCAACAA
GCGCCTGGTG
ACCA'ITGAAA
CAAAACGGGA
CCGAAAGAAA
CCTAAAGATT
GAAACAACGA
GTTGTGGCTA
GGGGTTAAAC
CAAGCGGCCT
CTI'TACCAAT
ATTrCTTTAA
GGTGTAAAAG
CTGTACAAAT
AATGAAAAAG
ATCATTCTAG
GAACAACAAT
ACCTGCTAAC
AAATTTTTTG
ATAGGAGAAT
ETGGCTGGAT
ITCCCGAAAA
CCAAACAAAC
ATACGGTGA.A
TCAAACCTGA
TCATCTTGCC
CT'=TTATGG
CTTCTGCGGA
ETATTCTrCA
CAGGCCAAGT
ATTI'AAACCA
CCGATACTGA
AAGGGGAACG
ATGAAAAGGG
GGGAATrTAC
ACCTAACCAT
CGCTG.CTCTT
CTGAGCAATA
ACAGGAAACT
GCTTATCTT
GAATATGAAC
TGGCGTGGAG
TCAAGTG-GAT
CGTAGAAATT
CTCAGCGACA
CAAAACCT1'A
GAAGCATTCC
CGTGATGGCT
TCAATCAAAA
GCAAAATGAG
CAACGCGCGA
ATTACATITA
GGATATGCAA
ATTAACGCCA
CACCTATCAA
AAAAGAAT=
TAAAGGAACC
ATTGATITC
A
AAGAACGACA
CGGTCTTTG
AGATGGAAAG
GCGCACGCTT
AAATCAAAAA
CAGITACGCA
ACAAATTTAA
CG T=IAACT
CAAAAGACCT
GGCGGTATAA
GGGTI'AGCTA
ACAAAGGTTC
AACGTCATCA
ATCAACACTG
GTGGCGCCAA
GGAAAATATG
GTCAAAGGCG
ACTATTTCTG
AATTGGTGGT
TTCTTGTATC
CCATACACGC
GTCCTTATAA
TATATGCAAC
CTGAA=AA
CCTACT=TA
AT GATACAGA
ATGGCGTAGT
TAAAAGA'ITA
TACCTTTAAC
CACTCAAAGA
TTAATAATGA
AACCAGATT
ATGT=CT'Fr
TTTCAAAAGG
ACTCTAAC'rr
TCT'TGAAATC
CCAATGGTGA
GGGACGTGC
TGTATCT'ACT
GTAAAAAGAA
AAGATCGGGA
TACAACAAAG
GGTAATCGCT
TITTGCGGTC
CTTAA.AAATG-
TGAAGACATT
AGAATATGGC
TGTGGAAGCA
CATTACGATG
GAAAAAGAAA
ATACTCCTAT
AAAATTACTG
ACAAAACCCA
AGGCGAAACG
TAGTTACCCA
AACGGCCTAT
AGAACGGTAC
TAAAGAATTA
GATTGCATTA
AAAAGAGGAA
EF089-2 (SEQ ID NIO:338) 182 TABLE 1. Nucleotide and Amino Acid Scqeucnces of E. faecalis Genes.
MNR WKWVATVIAC MLFGWIGVEA HASEFNFAVT IEN'IVNSATT NLNGVVEYGQ KDSFDGVMAG GITLKEKKKE VKPGQVNARN VINVSLQNPQ SLXGERLTPG KYVLKSTAYG EKDVTIKGTN WWLYLL IAL I
PTIPENQVDK
NG IKPCKTLR
TTTSADQSKG
AAYLNQLHLI
VKDEKGTYQV
ILALLLLIFF
SKTYFDLKMA
FNLKDYVEAP
LAINNEYSYV
NTVSKGGETL
KGANGEERYL
LYRKKKKEEE
PGA(Q'VEI Q
XEIILPKHSQ
VAI ILQQNET
YQSDTEDMQV
YXWEFTK-FT
QQSEQ
LRNDTDED IT
KTLPLTITMP
KVQPDLKLLG
AFNSNFSYP I
ISCDVAKELN
EF089-3 (SEQ ID NO:339) T CTGAA TTAA TTTTGCGGTC
ACACCAACAA
GCGCCTGGTG
ACCATTAAA
CAAAACGGGA
CCGAAAGAAA
CCTAAAGATT
GAAACAACGA
GTTGTGGCTA
GGGG'ETAAAC
CAAGCGGCCT
CTTTACCAAT
ATTTCTTTAA
TCCCGAAAA
CCAAACAAAC
ATACGGTG;A
TCAAACCTGA
TCATCTTGCC
CTTTTGATGG
CTTCTGCGGA
TTATTCTTCA
CAGGCCAAGT
ATTIAAACCA
CCGATACTGA
A.AGGGGAACG
TCAAGTGGAT
CGTAGAAATT
CTCAGCGACA
CAAAACCTTA
GAAGCATTCC
CGTGATGGCT
TCAATCAAAA
GCAAAATCAG
CAACGCGCGA
ATTACATTTA
GGATATGCAA
AT
AAATCAAAAA
CAGTTACGCA
ACAAATTTAA
CGTTITAACT
CAAAAGACCT
GGCGGTATAA
GGGTTAGCTA
ACAAAGGTTC
AACGTCATCA
ATCAACACTG
GTGGCOCCAA
CCTACTTTGA
ATGATACAGA
ATGGCGTAGT
TAAAAGATTA
TACCTTTXAC
CACTCAAAGA
TTAATAATGA
AACCACATT
ATGT=C=T
TTTCAAAAGG
ACTCTAACTT
CTTAAAA.ATG
TGAAGACATT
AGAATATGGC
TGTGGAAGCA
CATTACGATG
GAAAAAGAAA
ATACTCCTAT
AAAATTACTG
ACAAAACCCA
AGGCGAAACG
TAGTTACCCA
EF089-4 (SEQ ID NO:340) S Sc. SEFNFAVT PTIPENQVDK SKTYFDLKMA PCAXQ'IVEIQ LANDTIDEDIT IENTVWSATT NLNGVVEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPKHSQ KDSFDGVMAG GITLKEKKKE TTTSADQSKG L AINNEYSYV VAIILQQNET VKPGQVMARN VINVSLQNPQ AAYLNQLHLI NTVSKGGETL YQSDTEDMQV
SL.KGER
KTLPLTITMP
KVQPDLKLLG
APNSNFSYP I EF090-1 (SEQ ID NO:341)
SS
a
*S.S
S
aS S S a.
TAGTCTCTAA
GAAATGCAAA
ATTATTACTT
AGTCACTITT
GCTGCAGAAG
ACAAAAGAAG
GACCA TTAC
GAAGCCAAAT
CGTCGAGGCA
CAGCGCTTAA
AATGCCAAAG
ACATTGACAA
AAAGTAGCCT
AAAGAAAAAC
CCAGTTAT
GGCTGGTCGA
GAACAAACGA
CCCAATAATT
AAAAAAATAG
GAAATAXACC
TGCGTAATAC
CFTGGTTGG
TNCCAACCAC
AAAAA'ITAC
AAAAAAT TCA
ACAGTAGCAA
TAGCAACTTT
ATGGCAC'IFT
ACCAGCAGAT
ATTTTTATAA
CCCTCAATAA
TTGATAAAAC
TAACTACTTG
TTACAGATGT
TTGATGGGGC
ATGCAATCAC
ACATTGAAAT
TCACCACAGA
TAAAATTATT
ACGTCGTCAA
ACTAGCTCTG
AAAAGTAAkAT
GCAAGTTAAT
ACTTCCTAAA
GGTGAAGCTA
GCGTTTTCCA
TGAAATTGT
TATTGCGGAT
AGCCCCTGAA
CAAGTTAAAT
ACAAATTCAA
GGTGACACAA
TCACGGCACG
CAAAACGCAA
TGCTCCGTTG
TGATTIAAAA
TGTCATTACT
GATATAAAGG
AAATCAGGAA
GTTGCTG-GGG
GGAGTTTCTG
CAAACCGAAG
AAATACCAAT
CCGTTAA-ACG
GAGGGGAMAC
CCCGAAGAAC
GTGAAGCGG
ATTACA-AAAG
AAAACAATA
AACGTGCTGA
TTAGAAACAA
ACACGTCG ET
GAACTACTAG
GTTGGTGATA
GATCAAAAAA
GGCAGATATA
ATCAACAAAT
PIAAATAATAA
GCAGTTATGT
TAGGCTGGT
AAGITMGGT
TGGATCAAAA
AGGCATTCAA
CAACCAAAAA
AAGGCACAGT
GAAAAGGCAA
AGGATCAAAC
CAGTTGATAT
ATCATGATGG
CATATGGTTC
TTAAAAACAA
TAAACGCGCT
CCAAAGAAAA
TGTA TGTTT
ACAAAGGAAC
GAAAAAAGAA
AAAGAAAGTA
TTATTI TCAA
AAATGTAAAT
TCAAACGGGG
AT ITTTAAAA
AAAAGAACTA
TGcGAGTATC
AGTGGACACA
CTATCAATAT
GTrAA.AGGCA
TAATGGTGAA
CACAATCAAC
TCCTAATCAA
CGGAAG'ITAT
GAATAGCCAA
TAGTAAAATT
CATTGATGGC
CGCAACAGTA
183 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CCAGGATTCC
GATGGITCTC
GTGTCACAC
GAAGCCTTTA
TCAAAAATCT
GATGATGCAC
ATACAAI-lP
GATACAGTGT
AAATCGGGAT
AAACCCATGC
TTGATGTATC
CAGGTGAATT
ATATCGGACA
GCCAGTAAAA
TCATGACTCC
TGGTACTAAT
CTATGACGGA
TGATAAACCT
ACCCAI'GTGA
TATTGGATGC
GACCATAAAT
GGCTGTATCA
ATGCCGGTXA
GTAGAI'TACG
ATTTAG7LAGG
CGTTATTAAG
TGGATAAGTA
ATACGCCAGG
TTATTTATGG
GCGAAGAAGT
TCAAATGCTT
TCAAGGGGGC
TGGCGATAAA
AACAGA.AGTT
ACATATCTAT
ATAA
EF090-2 (SEQ ID NO:342) MRNTRRQK SGKNNXKKVI ITSLVGLALV AGGSYVYFQS
HFXPTTKVNG
HLHSSKVKLP
RLNQQ IIADV
VAFDKTQIQN
WS IDGAKTQE
KIVTTDVITG
VSVGWLNVNA
LNEAFKKELE
EACKGNYQYN
VLNODGTINK
LLVNALNSQE-
RYNKGTATVP
AEEKLAQVNQ
A.KLATLSFPE
AKDFYKAPEI
EKLTTWVTQL
QTNAITAPLV
GFH-TILYRTT
AFXTOAGSNG
TEEVVVQTGT
GKPSKNAS IR
TKEDQTLXAT
ETTYGSANQP
GDTKENSKIA
DVNLEGQMLD
CI NTPGTEVS KEEK IQLPKK
RGNGTFEIVP
LTTLN~kLNK
VLFTDVHGTT
NNYIEIDLKD
GSRYSVPVKY
KIFDVSYDGM
YQLDQKFLKD
EEQGTVVDTQ
TITVDINGEK
RRFKNNGSYG
QKMYCFIDGK
WMPLLSQGGV
PVIIYGHIYD
VTQ IGIHDSD HKLDKYGDKE DAPGEFDKPV DYGEEV EF090-3 (SEQ ID NO:343) CAC AAAAGTAAAT GGAGTTTCTG TACGCTGGTT AAATGTAAAT 9 9 9 9 *9*e
GCTGCAGAAG
ACAAAAGAAG
GACCATTTAC
GAAGCCAAAT
CGTCGAGGCA
CAGCGCTTAA
AATGCCAAAG
ACATTGACAA
AAAGTAGCCT
AAAGAAAA.AC
CCAGTTTTAT
GGCTGGTCGA
GAACAAACCA
GCCAATAATT
AAAAAAATAG
CCAGGATTCC
GATGGTTCTC
GTTGTCACAC
GAAGCCTTTA
TCAAAAATCT
GATGATGCAC
AAAAATTAGC
AAAAAATTCA
ACAGTAGCAA
TAGCAACTTT
ATGGCACTTT
ACCAGCAGAT
A'TFI'TATAA
CGCTCAATAA
TTGATAAAAC
TAACTACTTG
TTACAGATGT
TTGATGGGGC
ATGCAATCAC
ACATTGAAAT
TCACCACAGA
ATACAATTTT
GATACAGTGT
AAATCGGGAT
AAACCGATGC
TTGATGTATC
CAGGTGAATT
GCAAGTTAAT
ACTTCCTAAA
GGTGAAGCTA
GAG'ITTTCCA
TGAAATTGTT
TATI'GCGGAT
ACCCCCTGAA
CAAGTTAAAT
ACAAATTCAA
GGTGACACAA
TCACGGCACG
CAAAACGCAA
TGCTCCGTTG
TGATTTAAAA
TGTCATTACT
ATATCGGACA
GCCAGTAAAA
TCATGACTCC
TGGTAGTAAT
CTATGACGGA
TGATAAACCT
CA.AACCGAAG
AAATACCAAT
CCGTTAA.ACG
GAGGGGAAAC
CCCGAAGAAC
GTTGAAGCGG
ATTACAAAAG
AAAACAATTA
AACGTGCTGA
TTAGAAACAA
ACACGTCGTT
GAACTACTAG
GTTGGTGATA
GATCAAAAAA
GGCAGATATA
ACCGATGTGA
TATTGGATGC
GACCATAAAT
GGCTGTATCA
ATGCCGGTAA
GTAGATTACG
AAGTTGTGGT TCA TGGATCAAAA ATT AGGCATICAA AAA CAAGCAAAAA MC AAGGCACAGT AGT GAAAAGGCAA CTA AGGATCAAAC GTT CAGTTGATAT TAA ATGATGATGG CAC CATATGGTTC TGC TTAAAAACAA CGG TAAACGCGCT GAA CCAAACAAAA TAG TGTATTGTTT CM ACAA.AGGAAC CGC ATTTAGAAGG TC~ CGTI'A1 9 AAG TC TGGATAAGTA TGC ATACGCCAGG AAC TTA=FATGG ACI GCGAAGAAGT AT
AACGGGG
TTTAAAA
GAACTA
GAGTATC
OGACACA
TCAATAT
AAAGGCA
.GTGAA
AATCAAC
TAATCAA
AAGTTAT
*TAGCCAA
~TAAAATT
,TGATCGC
:AACAGTA
LAATGCTT
LAGGGGGC
;CGATAAA
AGAAGTI'
CTATCTAT
*993 9 *9 S
S
EF090-4 (SEQ ID NO:344) TKVbIG VSVGWLNVNA AEEKLAQVNQ TEEVVVQTGT KEEKIQLPKI< YQLDQKFLKD HLHSSKVKLP LNEAFKKELE AKLATLSFPE GKPSKNASIR RGNGTFEIVP EEQGTVVDTQ RLNQQIIADV EAGKGNYQYN AKDFYKAPEI TKEDQTLKAT LTTLNNXLNX TITVDINGEK VAFDKTQIQN VLNDDGTINK ELTTWVTQL ETTYGSANQP VLFTDVHGTr RRFKNNGSYG WSIDGAKTQE LLVNALTNSQE QTNAITAPLV GDTKENSKIA NNYIEIDLKD QKMYCFIDGK KIVTTDVITG RYNKGTATVP GFHTILYRTT DVNLEGQM~LD GSRYSVPVKY WMPLLSQGGV VTQIGIHDSD HKLDKYGDKE AFKTDAGSNG CINTPGTEVS KIFDVSYDGM PVIIYGHIYD 1 184 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
DAPGEFDKPV DYGEEV EF091-1 (SEQ ID NO:345)
TAATTGGNGG
GGAACAGCAG
rCATGATTTAT
GCTGTTCAAA
GATCAAGCCT
GCACAAGGC
GATCAA'IT1A
GCCGAAGATG
GAAACTG'TT
GCTGCAGAAG
GGGTAA
AGATTTTTAT
CAGCCGTTGC
CAAATCAAAC
AAGGAACAGA
CTGATTGGC
TTTCTGGCGA
AAAAAGCAGC
CAGCAGAAGA
CTGCTCCTT
CAAAAGAAGA
GGCTAAAAAA
CGCATTAITA
AGATGATTTA
ATTAACAGAA
AGGT'TCTGTC
CATGCTTGAT
TrGACGATGCT
TATCTATATT
CGATGAAGCA
TGTTAAAT
GGCGGATTTT
CTI'GCACCAA
AAAAACAAAG
ATCGCAAAAC
AAAGAAAAAA
AACTTTkAAA
CAAGATCACG
&AECGT-TAAAG
ANAGAAACCA
GCAGCGAAAG
TCTTAGGNGC
AATCAGGTAA
CCCAAGATTA
AAAAAGCCGG
CAAAAGATTC
AACA-AACAGG
CAGAAGATTT
ATTCTGCGGC
CCA-AAGATGT
ACGTAIAAAAA
AGTAATTGGT
AGAATTACGT
CACAGATTAT
CGTTTTATCA
ATTGGATAAA
TGATTTATCT
AGGTGAA-AT
AGCGGCCAAA
TCCTGAAAAA
AGAATTTAAA
EF091-2 (SEQ ID NO:346) MAKKC GFFLGAVIGC TAAAVAALLL APKSGKELRD OLSNQTDDU( NKAQDYTDYA VQKGTELTEI AKQKAGVLSD QASDLAGSVK EKTKOSLDKA QGVSGDMLDN FKKQTGDLSD QFKK.AADDAQ DHAEDLGE IA EDAABD IY ID VK.DSAAAAKE TVSAGVDEA.X ETTKDVPEKA AE-AKE-DVKDA AKDVKKEFKG EF091-3 (SEQ ID NO:347) AT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 90 0 0 0@ 00 0 9 0* 00 0 0 00 00 9 000 0 00 0 0 .00 0 GCTG TTCAAA
GATCAAGCCT
GCACAAGGTG
GATCAAT'rA
GCCGAAGATG
GAAACTGTITT
GCTGCAGAAG
AAGGAACAGA
CTGAI=TGC
TTTCTGGCGA
AAAAAGCAGC
CAGCAGAAGA
CTG-CTGGTGT
CAAAAGAAGA
ATTAACAGAA
AGGTTCTGTC
CATGCTTGAT
TGACGATGCT
TATCTATATT
CCATGAAGCA
TGTI'AAAGAT
ATCGCAAAAC
AAAGAAAAAA
AACTTTAAA
CAAGATCACG
GACGTTAAAG
ANAGA.AACCA
GCAGCGAAAG
AAAAAGCCGG CGTTTTATCA CAAAAGATTC ATTGGATAAA AACAAACAGG TGATTTATCT CAGAAGATTT AGGTGAAATT ATTCTCGGC AGCGGCCAAA CCAAAGATGT TCCTGAAAAA ACCTAAAAAA ACAATTTAAA
GGGTAA
EF091-4 (SEQ ID NO:348) SNQTDDLK NKAQDYTDYA VQKGTELTEI AKQKAGVLSD QASDLAGSVK E-KTKDSLDKA QCVSGDMLDN FKKQTGDLSD QFKKAADDAQ DHAEDLGEIA EDAAEDIYID VKDSAAAAKE TVSAGVDEAX ETI'DVPEKA AEAXEDVKDA AXDVKKEFKG EF092-1 (SEQ ID NO:349) 0* 0
TAAGGGGATG
AGCACGAGTT
GACATCTITG
GTTTTACTT
CCAAAAGGGA
AAAGTTTTAG
AAGAAAAAAT
TAT1'AGTAAC
CAGTATCGGC
TAGGTCCACA
TTCCTTTAGA
ATCAAGCAAT
GGCAAAAAAA
AAAAATGCAA
TTCTGAACCA
AGTTCGTTTC
TGTAATTAAC
CTCATTAATG
ACAATTATGT
AAAGCAGCAG
GATACAAACT
ATGAAAGGGC
ATGGCACA'T
GGATAA
TAG=1GTTC
AAGATCGTGG
TGGAAAATAA
AATTGAACA
ATGGCATGAT
CGCAGGAATG
CATGGAAGCA
AGAGGTGAAT
AAAATTACAA
GAATGGCGAA
0000 0 0000 00 0 00 00 EF092-2 (SEQ ID NO:350) MAKKT IMLVCSACMS TSLLVTKMQK AAEDRGMEAD IFAVSASEAD TNLENKEVNV 185 TABLE I. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
LLLGPQVRFN KGQFEQKLQP KG IPLDVINM ADYGKMNGEX VLCQAISLMG EF092-3 (SEQ 10 NO:351) AG AAGATCGTGG CATGCAAGCA GACATCTT-TG CAGTATCGGC 'TCAAGCA GT=TACTTT TAGGTCCACA AGTTCGTI'TC CCAAAAGGGA TTCCTI'TAGA TGTAATI'AAC AAAGTTTTAG ATCAAGCAAT CTCAITAATG
GATACAAACT
ATGAAAGGGC
ATGGCAGATT
GGAT
TGGAAAATAA
ATTAACA
ATGGCATGAT
AGAG'CTGAAT
AAAATTACAA
GAATGGCGAA
EF092-4 (SEQ ID NO:352) EDRGMEAD IFAVSASE-AD TNLENKEVNV LLLGPQVRFM KGQFEQKLQP KGIPLDVINM ADYGD2NGEK VLCQAISLMG EF093-1 (SEQ ID NO:353)
TAGTTFITT
TTCACGGCCA
CGTCAGTTAA
TTTCA'IIAT
CTCGTATACG
GTTTTXIATAA
AAAGCGGAAA
CCGATAAAGG
TGGCCATTGT
CACAAAAATT
TTCTTGCACA
CGCAAGTGAC
ACAAATCTGG3
CAATACTGGA
CAGAATTTTA
TTATTTGATG
TACGC:AAGCT
TOTTAAACAA
CCTCGATTAT
TCGAAA.ATAC
ATAG
ATGAGGCAAA
AGTTTTCTCG
ACCCAGGAAT
A.ATAGACGAA
ACATACAAAA
CAATATCAAG
AATA7CAGG
CCCTTCAGTT
ACTATGCAGG
AGTTAAAAAC
ATGAACAATT~
AGAGAGTTTC
AAACTTATTG
ACTAGAAGA.A
GAAAAGTATC
AGAAGAAAGG
AAGAATAACT
TCATCAAAAA
EF093-2 (SEQ ID NO:354) M RQKYSGNLLF TAMA.IVYLMS FLALQLLEER QLTQKFTQAT QEYYAGKS IF HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLR ITV LLNKSGRKYQ YQERVSHQKK
AETILE
EF093-3 (SEQ ID 00 9. 6 @0 00 0 0 @0 00 S S 00 00 0S 0 0 @0 @0 0
S
CCTTrCAGrr ACTAGAAGAA CGTCAGTIAA CAcAAAAATT TACGCAAGCT TrTCAITPTAT TTCTIGCAGA TTAAACAA CTCGTATACG CGCAAGTGAC CCTCGATTAT GTTTATT AA ACAAATCTGG TCGAAAATAC AAAGCGGAAA CAATACTGG
ACCCAGGAAT
ALATAGACGAA
ACATACAAAA
CAATATCAAG
ACTATGCAGG
AGTTAAAAAC
ATIGAACAATT
AGAr-AG ITTC
GAAAAGTATC
AGAAGAAAGG
AAGAATAACT
TCATCAAAAA
EF093-4 (SEQ ID NO:356) 000 0 ::96.
0.*.0 LQLLEER QLTQKFTQAT QEYYAGKSIF HLFLA-DVKQN RRKLKTEERL VYAQVTLDYT
AETI
YKNEQLRITV LLNKSGRKYQ YQERVSHQKK EF094-1 (SEQ ID NO:357)
TAAACATTTG
ATTAATTATA
TTAAGTCAAA
CACCCTCAAG
TTI'AGCGATG
GCAAAACAAA
AGACA'ITCAG
TGAAAAAAAC
CAATTGGCGG
AGGTAACAAT
GCAGCACGCT
CAGTATTTTG
AGGTGAATGT CTCTTTTTrA AACATTTAAA AAITGGTCGT AACGATTGGT CCTACGATTG TCATTATGAC GTAAGTAAAC CTCAGAACCT ACTACGTCAT TATTGAACCA GGTGTTAGTA
TTACTCAAAA
TATTTr.CGAC
CTTTTGCCCGA
TGTATGAAGT
TATATGCAGA
TTCCAACAGA
ACGAAAGGGG
TTTGGCTCTA
TGAAATTACT
TGACGGAACT
ATACAATGGT
AGTCACGCAC
186 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GGTTATCAGA
TGGGAAAAGG
GAAGTGAACG
AAATCTATTG
CATAATACCA
TTAAATTI'AT
ATTGGGAATC
AAAAAATCAG
ATGGCTGGTG
GGTTCTTTAA
TTCCATTTAG
A=ICTATTT
CCAGATCCTT
A=TCCATGA
GTAGAAACAC
ATTCGTAAAG
CGTGCGGAA-A
GAAACTAAAT
TATGCCAATC
ACTGGGGAAA
GCTGAGTITA
TGGAGTGAAG
ACTTTGGCTT
FI'CTGGCAAG
TCCATCAAAA
AAAGAACAAG
ACTGCCGAAA
GAAATCACAG
GGCTATGGTA
CCAGAAGGAT
GACCACTATG
ATTAAGATGG
AACCGTTTGA
TGGAAACACG
AGATATAACT
GAAGCCACAA
ACAACCGCAA
ACCGCTGAAC
GTAGCCTTTG
CAAACCr-TCA
GATGGTTCC
ACCCATGATG
CCAGATGGTA
AAkAGAATTTA
ACTAAGTTTA
AATGAAGATT
AGTACGCCAA
ACAGTGAAGA
TTTATCTTCA
AAAACCCTTT
CTGGAACAGA
GTTATA.AACT
AAGGAAAAAT
CTGTAAAAAC
CTGAGTTTGA
AATTAGTGCT
CTCGTACTGG
CGCTTGATAA
AGATCAAAAA
ATTMI'GGGAA
TGGATGGAAT
ATATGATTGA
CTTCGAAAAA
CTACTGATCT
ACAGCCCAGC
CACCA.AAAGA
CTAGTAATGG
AAACCGTGC
CCACTTI'GAC
AAGGAGCTGA
CTTTTAAAAC
TAGATGAAAA
AA.ACCAAAGC
AAGTTGATAA
TTATTCGCTT
CTGGATTTAA
GTCCTGAAGA
AGTTTGAAAA
TTCAAAAGAT
CGAAGACTGA
TGACCGT1'CC
TGCTTTATGA
GAAATAAAAA
TGCATGAAAT
AAGCTG3CCCA
CGTTGGCGAA
AAG~TGA
AAGCAGGCAA
ATTGTACGAT
A.AACTTTTAC
TACTGGATGG
CGAACAAAGA
CCAAAACCGT
CTF'TTACGGA
TGAAAGAAAA
AACAACCCGA
CATTCCCGCA
TTTTCGAC
GCCATCAATG
TATTGATACA
CCATTCCATA
TAATAAGGCA
AAT1'TTAGGT
TAAAGTCGTC
TACTCCAAAC
AACTCCAGTC
GCCCAATACC
AATCGATAAA
AGCT
T
ITACCT
TCCCCATGGT
TACCACACCC
TATGCGACAA
TTGGAATOAC
TGGTGAAATT
GCTTGCTAAT
TTT'CGTGAAT
TCTTGTTACC
AAAAGAAGAC
ATATACTCTC
AGAATTAGTG
GAACCAAGT
ACCTGAAGGA
TAACGAAAAA
TGGCTI'TGAT
CGACTTATCT
TAAAGCGACC
TCTTCCTTAT
'ACACCACTA
CTATGTCTT
TTACGAGAAA
TTTGCCCGAG
ATTGAATACC
CAAAGAAGAC
AGAAAAAGAC
CAAAGAGAAA
TAAAACCATC
TGAGCCAGtA
TGAACGCCAT
TCATGGTGAC
CTCAAAAGAA
AATTrGGA;A
ACTTGCGCAA
AATCAATTAC
ATCTCAAACC
AACACCAGCT
AACTGGGGAG
TGCTGGGTAT
TCTGATAAAG
AATATGGTTG
AAAAGATTAG
ATTGAGGAGT
CAATCGACAA
CAAAATACGG
TCTAAt'TCCA
GCTTATAAAA
TACGCTATTA
GAATCAGGTG
TCAAAAGATG
kCAAAGGTAA
ATGGCTGCCA
A.AAGGTCAAA
AATTATTCTC
GTCCA-AGAAA
GCITTTGGAAC
ACCITCAAAC
AGTAACGTA.A
AAAGATACCG
TTTACTGCA
AAGGGAACGA
GCCGTTAAAC
TATACTTG
AATGCCGTA
TTCTTTAAAT
TTTAAAGTGT
ACAr.CTTCTA GGGGATTATTr
GAAATCCGTT
ACCATTACCG
CTAACTAACA
AAAGAAGATA
CTTGATrTrTTA
TGGTATGTCG
GAAAAAGCCA
ACTGGAACTT
CTC'FrGTTCA
GCGAAGGAM
GTTTCCATCC
GTGATGGATA
GC'TrCGAAA
TCTGGCAAAA
AAAGTAGATA
GAAAALAGATO
TTAIACACCAA
GTTCCAAGTA
AAAAATTCCA
TATTTCTGGA
CGAAACTAGT
CACAAAAGAT
GTGC-TGCTTC
ATCAAAAAAA
CTTTAATAGA
CGAATATAGA
AATCAGGA.AC
AAGCAGGACT
AAAMrAATrCT
ATAFTGTACC
TGACAACAGA
CTATTACTGA
CCATTAAAGC I
TCTTT'AGA
TAGCTGGAAA
TAACAACQGA
TGGGAACCTA
CAACAAAAGT
AAGGCCAAAA
GTAATGAGAG
AAGATGGTCA
AAGCTTCTGA
ACCTAGCAAT
ATGAAACGAA
TTACTCGAGA
TTC:TGGATC
.CGCCATTGGA
ACGAOCAAT
TACTTAAGA
CTACATTTAA
AAGA.AGGACA
ACGAGITISC
GT=GACTTIC
CCGAGCTAGT
TAGCTCAAGC
AACCGGTGGT
GGAAAATTCT
ATTATCTGTA
CTAGCTTGAA
AAACAAAAGC
TGT=rATGA
CA-ATTCTGTA
TTGAGCATGA
CCGGAAAGTA
GAAACGTGAA
AAGA.AGTGCC
ATTCTCAAGA
ACGT1' ACT
ATCGCCGCAA
ATCGGTTCTT
GATTTGGGAA
AGTTGATATA
ACCAAG=TT
TAAAAATGAA
F1'ACCGTrCTA
ATTAACATTG
TCAAACTGTG
GGAAACTAAG
P.GAAACGGTT
TAAAGATGGG
AAAATCGGTG
GGGCGAGACC
GAAGACTGGG
TACATTTGCC
TGAAAAAGGT
TACGTGACA
CGAGTTAAA
CCAAGAAATT
TCAAGGGAAA
AGCTGITAAA
TGAAACAGTG
TAACGAGTAT
GTATCCTGTA
T'CTTACGCCA
G'-CTGATGGC
AGGGACCA.AN
AGCG'rTTGAT
AATAGAGGCT
GGAAAACAAA
AAAACAACCA
TG LTAGTCTC
TCTTCGACT
TGATAAATTG
CATTGATGTG
GATTGCCCAA
GCATAAATTA
TGAAAACAAG
CAATCAAGCA
CCACCTAGAA
TGTGTCGGTT
TCTTTACTA
AGTGAATGAT
TCCAGAAGGA
TGGAAAACAC
AACCATACCG
ATCTAGTCCC
G ITAGTTGGC
TTAA
EF094-2 (SEQ ID NO:358) MKKTTKN WSLFATLALL SQTICTIGP TIAFADEITH PQEVTI HYDV SKLYEVDCTF SDGSTLSERT TSLYAEYNCA KQTVFCIEPG VSIPTEVTHG 187 TABLE 1. Nuclcotide and Amino Acid Seqeuences; of E. faecalis Genes.
YQKNPLPSMS
SIEGKINKAI
GNQLVLTPNS
SLKIKKIDKE
DPYMIDTTPM
RKDSPAGEIV
ANQTVALVTS
SEAFKTELVK
IKKVDNNEKN
ITGAEDKATT
DYAKSEYVFT
KDGM(KLNTL
TATLANKEKT
TVNCTIERHV
OGTNKE IWKS
EDLKEKSQTL
ILIFSTAGYY
DKAKLVSVLW
EEYQKKPSFH
NSKSGTLTLK
SGDIVPETVF
AATIKAGETI
QEITTDEKGR
NVXGQNQEIT
GTKASDETVT
AVITRDVTAK
ACNEQLGFDG
ITEEGQKQP I
DFTELVDKLR
GTWKILHKLT
S IQTKAHLED
GKIEHEVNDK
TPKEVPTIPS
FWNRRN
EKAGTDIDTN
NTTVKTILGQ
KSAGTGTPVA
HLDFGKALPS
SMTSKNMRQK
AE:TPFLANA
GETTLTKEDK
LALDEKNQVA
EQVIRFGFDF
YGKFENtJPYG KM4VTVPYEKL YNLHE IKEDW AEQVLDKS IV
GSQTFTHGDV
EFTKTTLAEI<
TPKQPT-TPAV
MVAQKMIWEE
STTLI DKDEL
YKKAGLQTVM
KDV'ITDKDGI
GO ILLEKTGV
LELCTYYVTE
DTGNESQGKA
VKHLAI NEYF
FKFAG-SADGT
DYLLEE I EAP
TNNEFSVSLN
YVIVAQAIDVE
LFNWVYENKV
MDMFDD VS VT
VDTGKYPEGT
PSNSQESSPT
V7NGYF.LHS IK
NELSEFDKVVQ
AGALDKPNTY
SILDGIPHGT
ETGTDLWND)N
TKS SNGFVNT
EFKGAEYTLF
WQETXAPEGY
AETGFNDLSF
EGFQKITPLE
RLMLYDLPEK
ATKAAQEKDE
AFEAGNEPVA
HDVLDGSKEA
KFTFTE INYE
VKTFPQTGEK
RLGGASVDIK
NTANIDYRVI
AIKINVETKG
KVTITEKSVP
YSLAGNTIFA I
FKPTKVELKY
TAKDGQAVKW
TLDETKYPVS
KVSPLEGTXE
IRSTFKENKD
EDSLTSLATW
KAKPVVIAET
KDASLNNQAQ
FETI LYALLP
KDGNVNGKHN
NSNVLLLVGF
EF094-3 (SEQ ID NO:359) CGA TG-AAATTACT CACCCTCAAG AGGTAACAAT TTTAGCGATG GCAGCACGCT GCAAAACAAA CAGTATFITG GGTTATCAGA AAAACCCTTT TGGGAAAAGG CTGGAACAGA GAAGTGAACG GTTATAAACT AAATCTATTIG AAGGAAAAAT CATAATACCA CTGTAAAAAC TTAAATTTAT CTGAGTITGA ATTGGGAATC AATTAGTGCT AAAAAATCAG CTGGTACTGG ATGGCTGGTG CGCTTGATAA GGTTCTI'TAA AGATCAAAAA TTCCATTTAG ATI1'GGGAA ATTTCTATI'T TGGATGGAAT CCAGATCCTI' ATATGATTGA ATTTCCATGA CTTCGAAAAA GTAGAAACAG GTACTGATCT AT1'CGTAAAG ACAGCCCAGC CGTGCGGAAA CACCAAAACA GAAACTAAAT CTAGTAATGG TATG;CCAATC AAACCGTGGC ACTGGGGAAA CCACTTTGAC GCTGAGTTTA AAGGAGCTGA TGGAGTGAAG CT'ITTAAAAC
TCAITATGAC
CTCAGAACGT
TATTGAACCA
GCCATCA-ATG
TA'FrGATACA
CCATTCCATA
TAATAAGGCA
AMTTAGGT
TAAAGTCGTC
TACTCCAAAC
AACTCCAGTC
GCCCAATACC
AATCGATAAA
AGCTTTACCT
TCCCCATGGT
TACCACACCC
TATIGCGACAA
TTGGAATGAC
TGGTGAAAT
GCTTGCTAAT
T=rGTGAAT TCTrG'rrACC
~AAAAGAAGAC
ATATACTCTC
AGA.ATTAGTG
GTAAGTAAAC
ACTACGTCAT
GGTGTTAGTA
TCTGATAAAG
AATATGGTI'G
AAAAGATTAG
ATTGAGGAGT
CAATCGACAA
CAAAkATACGG
TCTAATTCCA
GCTTATAAAA
TACGCTATTA
GAATCAGGTG
TCAAAAGATG
ACAAAGGTAA
ATGGCTGCCA
AAAGGTCAA.A
AATTATTCTC
GTCCAAGAAA
GCT'ITGGAAC
ACCTTCAAAC
AGTAACGTAA
AAAGATACCG
TTTACTGCAA
AAGGGAACGA
TGTATCAAGT
TATATGCAGA
TTCCAACAGA
CGAAACTAGT
CACAAAAGAT
GTGGTGC'rC
ATCAAAAAAA
CITTAATAGA
CGAATATAGA
AATCAGGAAC
AAGCAGGACT
AAATTAATGT
ATATTGTACC
TGACAACAGA
CTAT'TACTGA
TGACGGAACT
ATACAATGGT
AGTGACGCAC
ATCGGTTCTT
GATTTGGGAA
AGTTGATATA
ACCAAGTTTT
TAAAAATGAA
TTACCGTGTA
ATTAACATTG
TCAAACTGTG
GGAAACTAAG
AGAAACGGTT
TAAAGATGGG
AAAATCGGTG
CCATTAAAGC GGGCGAGACC TTCTITAGA GAAGACTGG TAGCTGGAAA TACATrGCC TAACAACGGA TGAAAAAGGT TGGGAACCTA TTACGTGACA CAACAAAAGT CGAGITAAAA AAGC-GCAAAA CCAAGAAATT GTA.ATCGAGAG TCAAGGGAAA AAGATGGTCA AGCTGTI'AA-A AAGCTTCTGA TGAAACAG EF094-4 (SEQ ID NO:360)
DEITH
PQEVTIHYDV
YQKNPLPSMS
SIEGKINKAI
GNQLVLTPNS
SKLYEVDGTF
DKAKLVSVLW
EEYQKKPSFH
NSKSGTLTLK
SDGSTLSERT
EKAGTDIDTN
NTTIVKTILGQ
KSAGTGTPVA
TSLYAE:YNCA
MVAQKMI WEE STTLI DKNEL
YKKAGLQTVM
KQ'IVFCIEPG VSIPTEVTHG VNGYKLHS 1K RLGGASVDIK NLSEFDKVVQ NTAN IDYRVI AGALDKPNIY AIKINVETKG 188 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faccalis Genes.
SLKIXKKIDKE
DPYMIDIrPM
RKDSPAGEIV
ANQ IVALVTS
SEAFKTELVK
SGD IVPETVF
AATIKAGETI
QEITTDEKGR
NVKGQNQEIT
GTKASDET
HLDFGKALPS
SMTSKNMRQK
AETPKELkNA GET1'LTKEDK
KDVTTDKDGI
GQILLEI<TGV
LELGTYYVTE
DTGNESQGKA
S ILDGT PHGT ETGTDLWtDN
TKSSNGFVNT
EFKGAEYTLF
KVTITEKSVP
YSLAGNTFAI
FKPTKVELKY
TAKIDGQAVKW
EF095-1 (SEQ ID NO:361) a a a
TAAGAATTGT
GAArI'CATAG
GTTTI'TGCAG
ACAGCAGAAA
ACACTGGCAT
ATTAGAGAAA
GGGCAGCCAG
CCAGATGGCA
AAT'1AAAAG
GTGCTTGCAG
TACAATAAGA
GGAAACCAAT
TATAGTCAGA
TTGTACAACC
Trr1CAGGGC
ACTCAAGGGA
GAGCGTGACC
TCAAAAACGT
AAAAGCGTAA
TATAAGGAGA
CCACCATCTC
TTAGCTGGAA
GGTTGGTATC
TCCCCTGTTT
GCTGAAATGC
CAGATTATGC
AGTGGTTGGT
CCAAACAAAA
GAAGTGCCTA
CATCAAGTGT
GATACTGTCA
ATCAGCACAC
GGTTTAAAC
AAAAAATCAC
GATCAACTAT
CAGTACAATC
GTTGCCAACC
TATCAATTTG
AAACATCAAA
TGGATTGTTC
TAACGGGCAT
AAACATTACC
AAGCCGAAAG
TGTCAAAAAG
GAATTGAGAC
TAAACGCCAA
TAAATGTGTG
AAGTGGTAAT
CGAGTAATCA
AAGGGGAAAT
ATCCAACAAC
AAACAGGAGT
AAGTGAAGGT
CGGT=ATTA
AACCAATCCC
CTTACACCTT
ATCAATTTCA
CGCCCAGTTA
TACCTCAAAA
ATTTTATTCA
AAAAACTGCC
AAGATAAAAC
TTAATGAAAT
AAATAGAAGG
7ACAAATGT
CACCAGGTCT
TTGTTCCTAT
TTGGTCAGAC
TACAAGCGC
GAATCCAACC
CAACTTGA
AGGCACCACA
AACCCAATTG
CATCAATGAC
AACCAACGCA
GTGTAGCTAG
ATTTCTTT
CTTATCAAGC
TTTAGAAAGA
CTGCCATATA
AAGTACAAAA
TCAACAACCA
TGAGTTAATC
GCCTAACCTA
TGAGATCCTT
GGAAGGTGAA
TCCAAGTGAG
AACA=''TT
TCATCCCAAT
AATTTCGCAA
AACGTTTAAC
TGATTCATCG
TCATGTTACC
TCCACCACCG
TAAACAGAAA
AGGATGGTAT
TGATATTACC
'AAAITATACA
GGATCACCAA
ACAACAATAC
NAAACAAGAG
GAACGCTA IT
ACTAGTCAAA
GGGAGAAGTA
AGCTCGGCCA
TACTGATGAA
AGCAACTATT
TGTTGAAATG
TAAAAATCAA
T'TI'GGCAAA
TTATTATGAA
GGCACTAACT
AAAG TATTG
AACTAAAGTT
CCATAFI'GTT
TrGATCAA.ATC
AATGGTGACT
AGGGACAATA
TTAGTATTCC
CMGTAAGAG
CACACAAAGG
GATAATGAGG
ACTTATCGTr
CTACAGTATC
AGTCAACCAG
AA.AGTAGCCG
TTACCAAGAT
TATCCGCTGC
TTTGAATTGG
ATTAGCGAGA
AATCAATCTG
AATCGCAAAG
GGGTTTAGAC
CATCTTTTGC
AAAGGGAAAA
TATGACGACA
TTTGAGGAIM
CAACCAATAA
AGCGTTAACG
AGCTTAGAAA
ACAGCAGL=
GTCATGCCAA
CCGTTAAAAA
ATCCAAGTCA
AATTIGGCGAG
ATGATGACAA
A.ATGAANT
GAAATTGTGG
GTCGCCAM
AATCTCACG
GCAGAAC TAT
TTAGGAACAA
GCTCTTGGCA
GCCAATGGTC
AAATTAGAAA
TGGAAT'TTAG
TGAAGCGAAC
:CATAcTAAT.
AAGGAACCAA
ATAAACTACA
CTAATGTTAC
ATGGA!TI'AT
ATAGTTGGCA
TGACAGCATC
TCTATTCC-GA
ATTATACTTC
CA.ACTATTTC
AAAAAATGTC
GTCAAAAACT
GGCTMAA
TGACAGAACA
AAGGAAAGCA
CAAGTAGCTA
CGAA.ACCTGA
ATGATGATTT
TCAATGGTGT
CTACGGATGG
GTAAAACI'A
AAACGAAGCG
ATAALGGAAAT
GTGGTI'ATAT
AAATAAACT
CGATTCGTGT
TTGOCATTAC
CAATTrGCTAC
TTTCMCTGT
CACCAGATGA
CTAGCAACAC
AA.AATCCATA
CCCCCTTTGA
CCAATGMTC
AAACAACCGC
AGTIrMACGA
TTCCAGCAAA
TGACAGGCCC
TAAATGGAAA
ACAGACAACT
TCATTCATI'A
TGATGPLAGAA
AAGTCAAACG.
TAATGAAGAG
AGGCAATTCC
TACAGCCT
CATGTCAACG
TTTAAGCTTA
CGACGCATCA
TGCACAACAA
AATCGTTCCT
TTACTTTAAA
=ITTGGAT
AACACTTATT
TGAAATTGAC
AAATTIAGAA
AACTGTTGTC
TGA.AATTGCA
CTTTCGCTAT
TTTATATCAA
ACCCATAALAC
AACTGCAAAA
ACAAATTTG
AAAGCCAGCA
TGGATCTGAA
TTTAAATACG
AGGTGAACCA
TCACGCAGCT
GGAAGGTI=
GCAGCAACAT
'PITACGTTTG
AGGAAGAGTG
AGGTTTTATT
TATTcAATTA AAGTGATmTT
TCAAGGTAGA
ATAA
EF095-2 (SEQ ID N'O:362) MKRSKWKE LIVTGICHIL VFPILIQTTV FAETLPSTKQ VRE-GTNHSLT AEKAESEQPQ TKDKLHDEET LALSKSELID NEANVTSQTI RERIETPNLT YRYGFUIEEG QPVNANE ILL QYHSWQGNS P DCINVWEGES QPVTAsTVAN LKEVVIPSEK VAVYSDMSTV 189 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
LAASNQTFFL
SQKTGVTFNI
QGKPIPPPPG
S VT PSY DITY
AGKKLPQQYS
EMQ lEG LVKV NKIVP ITfEN
TVRIQPKNQE
KSQPNWALTA
ANGVASHIVA
PRYYTSLSLY
SESQKLIVPL
FRQGKQTLIE
DDNDDLTVVY
VNGKTYLYQG
MPSGYIQIWQ
WRVGITLNTE-
IVAPDEEGF I
ELSPFEGRVD
NGQFCESDVY
NKKGEIDPNY
YNQVKVDS SN
RDPYTFKQKD
KEI PQKNYTF
WYQDKTKQES
IMLTNVGEVP
VPIGQTATIM
STPTFDFGKV
QLSSMTKLLL
QFDFSFDQIK
PLPTISDASG
QSGLLNYPKF
LLPSSYEIEDS
EDVNGVE lAP LEKTKRP INS LKK INLKPAS
MTTIATGEPD
AISSNTQQHG
GTTNVSGFIQ
L I PANQGRK
NQYPTTISQF
SGPVVYYHVTN
KTYQFQGWYK
PSDF IQCHQQ
PVFN&ENAIT
GWSPGLARP I
QVLQAMVEI~
LKQAADYYEN
YNQPTETKVA
DQTYQAMVTW
ELEKMSAQQY
RKVTEHFVDT
GKTKPENLEK
P I'TDGFRYL AVYKE ITAKA
QVTIRVGSEP
GNFSAVHAAD
GQENPYLRLK
LGKTTAIQLV
NLVTGP
EF095-3 (SEQ ID NO:363) AAGTACAAA.A CAAGTAAGAG 4* a.
ACAGCAGAAA
ACACTGGCAT
ATTAGAGAAA
GGGCAGCCAG
CCAGATGGCA
AA'ITTAAAAG
GTGCTTGCAG
TACAATAAGA
GGAAACCAAT
TATAGTCAGA
TTGTACAACC
TTTTCAGGGC
ACTCAAGGGA
GAGCGTGACC
TCAAAAACGT
AAAAGCGTAA
TATAAGGAGA
CCACCATCTG
TI'AGCTGGAA
GGTTC-GTATC
TCCCCTGTTT
GCTGAAATGC
CAGATTATGC
AGTGGTIGGT
CCAAACAAAA
GAAGTGCCTA
GATCAAGTGT
GATACTGTCA
ATCAGCACAC
GGTITAAAGC
AAAAAATCAC
GATCAACTAT
CAGTACAATC
GTTGCCAACG
TATCAATTTG
AAAGATCAAA
AAGCCGAAAG
TGTCAAAAAG
GAATTGAGAC
TAAACGCCAA
TAAATGTGTG
AAGTGGTAAT
CGAGTAATCA
AAGGGGAAAT
ATCCAACAAC
AAACAGGAGT
AAGTGAAGGT
CGGTTATTA
AACCAATCCC
CTI'ACACCTT
ATCAATTCA
CGCCCAGTTA
TACCTCAAAA
AT=-k~TTCA
AAAAACTGCC
AAGATAAAAC
TTAATGAAAT
AAATAGAAGG
TTACAAATGT
CACCAGGTCT
TTGTTCCTAT
TI'GGTCAGAC
TACAAGCGGC
GAATCCAACC
CAACTTTTGA
AGGCAGCAGA
AACCCAATTG
CATCAATGAC
AACCAACGGA
GTGTAGCTAG
ATTTTTCT'rr
CTTATCAAGC
AAGGAACCAA
TGAACAAC-CA
TGAGTTAATC
GCCTAACCTA
TGAGATCCTT
GGAAGGTGAA
TCCAAGTGAG
AACATITTT
TGATCCCAAT
AATTCGCAA
AACGTITAAC
TGATTCATCG-
TCATGTrACC
TCCACCACCG
TAAACAGAAA
AGGATGGTAT
TGATATTACC
AAATTATACA
GGATCACCAA
ACAACAATAC
NAAACAAGAG
GAACGCTATT
ACTAGTCAA
GOCAGAAGTA
AGCTCGGCCA
TACTGATGAA
AGCAACTATT
TGTTGAAATG
TAAAAATCAA
TTTTGGCAAA
TTATTATGAA
GGCACTAACT
AAAGTI'ATTG
AACTAAACTI
CCATATTGTIl
TGATCAAATC
AATGGTGAC'I
TCATTCATTA
CAGACAAAGG
GATAATGAGG
ACTTATCGTI
CTACAGTATC
AGTCA.ACCAG
AAAGTAGCCG
TTACCAAGAT
TATCCGCTGC
TTTGAAIG
ATTAGCGAGA
AATCAATCTG
AATCGCAAAG
GGGTTTAGAC
GATCTTMTC
AAAGGGAAAA
TATGACGACA
TTTGAGGATr,
CAACCAATAA
AGCGrtTAACG
AGCTTAGA;A
ACAGCACTOT
GTCATGCCAA
CCGTrAAAAA
ATCCAAGTCA
AATTGGCGAG
ATGATGACAA
AATGGAAATT
GAAATTGTGG
GTCGCCAT
AATGrGTCAGG
GCAGAACTAT
TTAGGAACA.A
GCTCTTGCCA
GCCAATGGTC
AAATTAGAAA
TGGAATTTAG
ATAAACTACA TGATGAAGAA CTAATOTTAC AAGTCAAACG ATGGArTTAT TAATGAAGAG ATAGTTGGCA AGGCAATTCC TGACAGCATC TACAGTGGCT TCTATTCCGA CATGTCAACG ATTATACTTC TTTAAGCTTA CAACTATTTC CGACGCATCA AAAAAATGTC TGCACAACAA GTCAAAAACT AATCGTTCCT GGCTATTGAA TTACTTTAAA TGACAGAACA TI'TTGTGGAT AAGGAAAGCA AACACTTATT CAAGTAGCTA TGAAATTGAC CGAAACCTGA AAATTTAGAA ATGATGATTT AACTGTTGTC TCAALTGGTGT TGAAATTG-CA CTACGGATGG CTTTCGCTAT GTAAAACTTA FTTATATCAA AAACGAAGCG ACCCATAAAC ATAAGGAAAT AACTGCAAAA GTGGTTATAT ACAAATTTGG AAATAAACTT AAAGCCAGCA CGATTCGTGT TGGATCTGAA TTGGC:ATTAC TTrAAATACG CAATTGCTAC AGGTGAACCA T'ITCTOCTGT TCACGCAGCT CACCAGATGA GGAAGGT=I CTAGCAACAC CCAGCAACAT AAAATCCATA TrrACGTTTG CCCCCTTTGA AGGAAGAGTG CCAATGTTTC AGGTTTATT AAACAACCGC TATrCAATTA AGTFTGACGA AAGTGATGT TTCCAGCAAA TCAAGGTAGA TGACAGGCCC A a EF095-4 (SEQ ID NO:364) STKQ VREGTNHSLT 190 TABLE 1. Nucleotide and Amino Acid Seqeuences ofrE. faecalis Genes.
AEKAESEQPQ
QPVNANEILL
LAASNQTFFL
SQKTGVTEFNI
QGKPIPPPPG
SVTPSYDITY
AGKKLPQQYS
EMQIrEGLVKV
NKIVPITDEN
TVRIQPKNQE
KSQPNWALTA
ANGVASH IVA
TKDKLHDEET
QYHSWQGNSP
PRYYTSLSLY
SESQKLIVPL
FRQGKQTLIE
DDNDDLTVVY
VNGKTYLYQG
MPSGYIQIWQ
WRVG ITLNTE
IVAPDEEGFI
ELSPFEGRVD
NGQFIJESDVY
LALSKSELID
DC INVWECES
NKKGEIDPNY
YNQVKVDSSN
RDPYTFKQKD
KEI PQKNYTF
WYQDKTI<QES
IMLTNVGEVP
VP IGQTATI2
STPTFDFGKV
QLSSMTKLLL
QFDFSFDQIK
NEANVTSQTI
QPVTASTVAN
PLPTISDASG
QSGLEJNYFKF
LLPSSYEIDS
EDVNGVEIAP
LEXTKRP INS
LKKINLKPAS
MTTIATGEPD
Al SSNTQQHG
GTTNVSGFIQ
fEI PANQCRK
REPRIETPNLT
LREVVIPSEK
NQYPTTI SQF
SGPVYYHVTN
KTYQFQGWYK
PSDFIQDHQQ
PVFNENAIT
GWS PGLARP I
QVLQAAVEMN
LKQAADYYEN
YNQPTETKVA
DQTYQAMVTW
YRYGFINEEG
VA'\VYSDMSTV
ELEK14SAQQY
RKVTEHFVDT
GKTKPENLEK
P ITTDGFRYL
AVYKEITAKA
QVTIRVGSEP
GNFSAVHAAD
GQENPYLRLK
LGKTTAIQLV
NLVTGP
EF096-1 (SEQ ID NO:365)
TGAGGTGGCC
ATTGTCATCT
GATACCTTGA
GAAAAAGAAA
ACAAAAATTC
CAAAAAATGA
GAAAACATTG
CCGTACTTCT
ATCCAACATT
GGCCCTCG-CG
AACGACCAAC
GCAATTGTTG
ACFI'CTCTG
AAGTTAAAAT
TGTTI'TTGG
CCATPI'ACA-A
CAGGCTATAA
AGCAAGGTGG
TGAAAGCGAA
ATGCACGCTT
GGGGCACGTI'
GGATOATTI'
AAGTG PIAGG
AATTACGTCA
CCGAT1GAAAT
GTGAAGCTGC
GAAAAAATTA
CGTGCGCCAA
TTGGGGGGAC
AGTCAA'ITAC
CACAGCCTAT
GATGCTTTTA
1TTAGATCAA
GGGGATTAT
ATCGCGCCCG
ATTATCTT'
GGCTACCGAT
CAAAATGTAC
TGAAATGCTA
CTGGTTTGAT
TATGAACTT
CACACCAAAT
TTATCCAGAT
CTTAGGAAT
CAGTCACTTT
TTGAGAAAG
TATATAGATC
GAAACCTTTG
GATATTGCCA
CCACTTGATC
TCCTTTGATC
TATAATGATA
GAATTAAAAA
AACAGTTTAG
AAGTI'AAACC
ATGGCTAATG
GAAAACAATG
AACATTGT6A
ATGTI'ACGAC
AAAGAAGCTA
GATGAAACTA
TATAACGATC
TTAT7GGAAT TATCGCTATT CAAGTGGCAT GG^CAGGAGCA CCGCCTTGAT TAAGAAATTT AFTCTAATGA AGCTATGTAT TTCCTTCTGA ATATATGATT ACAGCAAATT AAAAGGC FTA CCAAAAATAA GTTTTCCGTT AAI'TATTGA CGGCCGTCAG ATAATGTCAT GCTGATTGAT GCTATTCGTT AA.ACAGTAAA GATTAACGAkA CAATGTCAAA AAGAAAGTGC AGTTGCTGTA AACATCTACA TTATGTGAT TGCCTAAGAC AGCCAAAAAT CAGAAAATGC GGCACAAAAT AAA.AACTATT ACCAAAAGA-A TCAAACATTT AGAAGTTTAC TGTTCTTGGA GT=TAAGATG 4 CCCAGTGAAG GCTCTAATCT AAAGAGGGTG CCTATGCATT GCAGAATATA TTGGTTATTC GTTGCCGAAG ATAAACAATT CAAGACTTAG GTCAAGAATA TATCGGAAAT AA EF096-2 (SEQ ID NO:366) MKKLQ SLFIGIIAII VILFFGVRQL EKASGMAGAD TLTIYNWGCY IDPALIKXFE KETYKVNYE TFDSNEAMYT KIQQGGTAYD IAIPSEYMIQ KMMKAKMLLP LDHSKLKGLE NIDARFLDQS FDPKNI(FSVP YFWGTLGIIY NDKFIDGRQI QHWDDLWRPE LKNNVMLIDG AREVLGLSLN SLGYSLNSKN DQQLRQATDK LNRLTNNVKA IVAD)EIKMYM ANEESAVAVT FSGEAAEMLE NNEHELMYVIP SEGSNLWFDN IVMPKTAKNK 'EGAYAFMNFM LRPENAAQNA EYIGYSTPNK E.AKXLLPKEV AEDKQF'YPDD ETIKHLEVYQ DLGQEYLGIY NDLFLEFKMY
RK
EF096-3 (SEQ ID NO:367) AAGTGGCAT GGCAGGAGCA GATACCTTGA CCATTTACAA GAAAAAGAAA CAGGCTATAA ACAAAAATTC AGCAAGGTGG CAAAAAATGA TGAAAGCGAA GAAAACATI'G ATGCACGCTT CCGTACTTCT GGGGCACGTT
TTGGGGGGAC
AGTCAAITAC
CACAGCCTAT
GATGCT'TA
TTTAGATCAA
GGGGATTAT'r
TATATAGATC
GAAACCTTTG
GATATTGCCA
CCACTTGATC
TCCTTTGATC
TATAATGATA
CGGCCrI'GAT A~rCTAATGA TTCCTnTCTGA
ACAGCAAAT
CCAAAAATAA
AATTITATTGA
TAAGAAATTT
AGCTATGTAT
ATATATGATT
AAAAGGCTTA
GTTTTCCGTT
CGCCTCAG
191 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Genes.
ATCCAACATT
GGCGCTCGCG
AACGACCAAC
GCA.ATTGTTG
ACTTTCTCTG
CCCAGTGAAG
AAAGAGGGTG
GCAGAATATA
GTTGCCGAAG
CAAGACTTAG
GGGATGA=T
AAGTGTTAGG
AATTACGTCA
CCGATGAAAT
GTGAAGCTC
GCTCTAATCT
CCTATGCATT
TTGGTTATTC
ATAA.ACAATT
GTCA.AGAATA
ATGGCGCCCG
ATTATCTTTG
GGCTACCGAT
CAAAATGTAC
TCAAATGCTA
CTGGT FGAT TATGAAC Tr
CACACCAAAT
TTATCCAGAT
CTI'AGGAATT
GAATTAAAAA
AACACTTTAG
AAGTTAAACC
ATGGCTAATG
GAAAACAATG
AACATTGTGA
ATGTTACCAC
AAACAACCTA
GATGA-AACTA
TATAACGATC
ATAATGTCAT
GCTATTCGTT
GArI'AACGAA
AAGAAAGTG.C
AACATCTACA
TGCCTAAGAC
CAGAAAATC
AAAAACTATT
TCAAACATT
GCTGATTGAT
AAACAGTAAA
CAATGTCAAA
AGTTGCTGTA
TTATGTGATT
AGCCAAAAAT
GCCACAAAAT
ACCAAAAGAA
AGAAGTTTAC
TGTTCTTGGA GTTTAAGATG
TATCGGAAA
EF096-4 (SEQ ID NO:3 68) SGMAGAD TLTIYNWGDY IDPALIKKFE KETGYKVNYE TFDSNEAMYT KIQQGGTAYD NIDARFLDQS FDPKNXFSVP YFWGTLGIIY AREVLGLSLN SLGYSLNSKN DQQLRQATDK FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN EYIGYSTPNX EAKKLLPKEV AEDKQFYPD
RK
IAIPS EYMIQ NDKF IDGRQI
LNRLTNNV*XA
IVMPKTAKNK
ETIKHLEVYQ
KDOUK-kXMLLP
QHWDDLWRPE
IVA-DEIKMYM
EGAYAFIMFM
D)LGQEYLGIY
LDH SKLKGLE LKNNVMLI DG
ANEESAVAVT
LRPENAAQNA
NOLFLEFQ4Y EF097-1 (SEQ ID NO:369) *99 9 4*
TAGAAGTATT
ATGCATTCGC
GGTCATCGTT
TT1GTCTTTGG
ACGATAATGA
GAGGAACAAC
GATGTTCCAC
AAAATTGAAC
ITT7TrAGCAG
GCTGTTGAAA
CTTTTACCAT
ATTAACCATO
ATTTTATTTC
CTGTTTGGGC
GGGGGCATTC
GTAATTGCTG
GCTCCAGCTT
CTGGCGGTTI'
TTATTAAAAC
CAAGTGGAAA
GGCTCAAGTG
ATCCCTGTGA
ATTCAAGCAG
GTTCAAAATT
TCTTCTCAAG
ATACAGAAGC
GAATTATTGC
CTGGAAACAG
GCCTATCATT
AAAGAGTATC
CTAATTATCT
TCTTTTTTAA
TGAGTGGCAT
TGGCTGGCTA
TTCGAGTTGT
GTGGCGGCGT
AGTTGTTTGG
AAATTCTCTT
CAATTGTCGGG
GCGCTAGTTT
TGGTTCACGT
GCTTATTAAC
TATTGGAAAC
CTGTAGGACA
ATGSAAAFIA
GAGCAATGAG
CGCCAGGTI'C
TAGCGGAAT
GTCAACGAGG
CAGTCACACC
CCATGGGGGC
CTTACCAGTC
AATTGAAACA
TTTAGAA.AT
AGCAATCTTC
FrGTTTTTT
GGCAACAAGC
TCqTrTTTTAC
TAGATCTAAC
AAGAATGGCT
ACATAGAGAG
ACATAAGTTT
GATTATGCCA
TACGACTGGG
TTTACCGATT
TGTTGCTGCT
TGCTATCTTT
ACCGAAAGTT
AGGACTGCTG
TTGCCTGTAT
=~CTTAGAG
GCCTCTAGGT
AAACCCTGGA
ACGAAAAACA
TITT-CCGTTT
TGGTACGCTIT
ATTGGTTGCG
TIGTTAGC
AATTGAACCA
TAACTATCAG
TAGTTTGCTA
CG TTCATCAG
GITACCACAA
TAAATCCTAT
ACTTGG'-rCA
ATATGCCGAG
GGCGAAACAA
CAAGGAGACA
GCAACAAAAT
GGAAGGAGGA
CGAGGGACAA
GTGAAAGTAA
AATTTGAGTA
AATCTACGC
CTAArI'GGTT
ATTGCGACAG
ATTGGCCCTT
AAAGAAGGCT
TGCTGTTTTG
CAATTTTCTT
CCCTTAAAAG;
TTAGAAGGTG
CCAGGCGTGG
GCAGGAGGTG
GTTTTGATCG
GTTTTCAAA
ATTTTAGCCA
TTTCTGTGCT
GTTTCAATrGA CAAATTTrAT
AGCCGACAAT
ATGAAGTGGC
AAGTACGTCC
TACCCGCAAr-
GAGTCTACTG
AATGTrCGAG
GGAGTCGCGA
ACCTAC:GTAG
TTATACGTAG
GCTGATAGAT
GGAATATGAA
CTCCCTATTT
I'TTATGC
TAGCTC=IC
TTACAGGCGG
TGGGCGTGAT
TAGCAGGATA
ACGAGATCCT
GTATTCTGGT
CTTCG'ITAAT
TGTTATTTTT
CTAGTCAAAC
GCGTTrGGT
CCACCATGAT
ACCCGCGCTT
TATTTAATGT
ATGCCCCGAC
CTTTTGCAAT
TAAAGATGAA
TTGTTGCTCA
TAAAAGCTGT
AGCCTAAGAC
CAGAAAAGGA
TTTTAGCCAA
AAACGAACTC
GATCGCAAAC
TTGAAGTATC
TGACTCGTGA
TTACTA'TTT
GTT'I=AA
CCAAAAAGAA
ACGTCG TFI
GTGGAGCTTA
TGAAGTCGAA
AAAAATGTTC
TGTTTICCACA
TAC'I'ITCGCC
GACTAAAAAC
TGTAGCTCCG
TGAAGCCAAT
TAATAAT'GCG
AGGTCAGTCC
TGCTI'TCTG
TCAACTCATT
ATTTTAGCA
GGGTCTAAGT
TGATGCGAGG
AGCAAGCTTG
GGAGGAAGAC
TCGCACGAATG
GAACTTGGAG
ATTAGTGGTC
TATGGTGAGT
ACTGACTGCT
GACAAAACAA
AATGGGAATG
TAAACAGCCA
ACTGGCGCAA
TTGAATA.AG
192 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF097-2 (SEQ ID NO:370) MLTKNF LAGIVGGLLC CFGILVVAPA
VESASFWLYQ
LFLLETNPGP
IAGGMSGTLV
LKRQRGIEPV
PVTYQSVH-QM
SQEQSSLGSE
ETVFFTKETT
FSSWL IEANL
GVGVLVAFLL
FQIFNVCLSA
SMIKMflEEDQ
KWQPKTLWVI
STETNSTKQ I
YVVTRELAQA
LPLVHVFLEP
FGP VGQRKTA
PASPGSLVAI
VETVTPNYQQ
QAELKQLAQK
QKLVPLYAEN
YHLDLTQQNL
LKVLFFNNAI
GGATMIQLIG
LANAPTDARL
ILFVCDAGMG
YVPEKDMVSV
VRGSQTMOME
YVVTSFLNKK
NHGLLTPLGL
01 HE IYFFV
AVFSGIFVSF
SSAMGASILLS
QNFLEIKSYY
LLRQQAAKQG
EYQEWLEGGA
EGASQTGQS I
LMDPRLFLAV
LCSFAIASLL
RQLKAVNLEM
PQVLAKLTAS
VA IEVSKEPL
DRCF
EF097-3 (SEQ ID NO:37.) ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC CAAGTrGGAAA
GGCTCAAGTG
ATGCCTGTGA
ATTCAAGCAG
GTTCAAAATT
TCTTCTCAAG
ATACAGAAGC
GAATTATTC
CTGGAAACAG
GCCTATCATT
AAAGAGTATC
CAGTCACACC
CCATGGGGGC
CTTACCAGTC
AATTGAAACA
TT'FrAGAAAT
AGCAATCTTC
TTGTTTTTTT
GCCAACAAC
TCTTTTTTAC
TAGATCTAAC
AAGAATOOCT
TAACTATCAG
TAGTTTGCTA
CGI'CATCAG
GTTAGCACAA
TAAATCCTAT
ACTI'GGTTCA
ATATGCCGAG
GGCGAAACAA
CAAGGAGACA
GCAACAAAAT
GGAAGGAGGA
CAAATTTTAT
AGCCGACAAT
ATGAAGT'GGC
AAGTACGTCC
TACCCGCAAG
GAGTCTACTG
AATGTTCGAG
GGAOTCGCGA
ACCTACGTAG
TTATACGTAG
GCTGATAGAT
TTTTIGTGA
TAAAAGCTT
AGCCTAAGAC
CAGAAAAGGA
TTTAGCCAA
AAACGAACTC
GATCGCAAAC
TTGAACTATC
TCACTCGTGA
TTACTAGTT'
GTTT
TGCAGGAATG
GAACTTGGAG
ATTAGTGGTC
TATGGTGAGT
ACTO.ACTGCT
GACAAAACAA
AATGGGAATG
TAAAOAOCCA
ACTOOCGCAA
TTTGAATAAG
EF097-4 (SEQ ID NO:372) RGIEPV SMIKMKlEEDQ VETVTPNYQQ ILFVCDAGMIG SSAMGASLLS RQLKAVNLEM
PVTYQSVHQM
SQEQSSLGSE
ETVFFTKETT
KWQPKTLVVI
STETNSTKQ I
YVVTRELAQA
QAELKQLAQK
QKLVFLYAEN
YHLDLTQQNL
YVPEKDMVSV
VRGSQTMGME
YVVTSFLNKK
QNFLE IKSYY
LLRQQAAIQG
EYQEWLEGGA
PQVLAKLTAS
VAIEVSKEPL
DRCF
EF098-1 (SEQ ID NO:373)
TAAATGAAAA
CTTGTACCAA
CAATTTCCAC
AAACCAATrA
CCGTTATCCA
AATATGACCT
GCCCAAACT
GTAAAACAAA
ACTTTAAGCA
GCTACGATTG
GAAGGTGCGG
ATITrCATTAG
ACTTGGACTT
AGACAAAAGT
TGAGTGCATI'
CAAATACGAA
CACCAGIGA
TTGACTATGC
ACTATGCAGA
TTGTTCAAGT
ATGGTCAA T
ACGGACGCGT
AATTAA.ACCC
GTACGTACTT
AAGTACCTGG
TGACAGATAC
AATGACATTG
AGCAGTCGAC
CCCAACGAAT
TCCAACTGAT
ATCTAGCTTA
AACACAAAAA
CTCAGATAAT
CAAAACTGAA
GGTTTCAGCT
AACTG.GGGCT
AATGAGCTGG
TTCAACCACA
ACCTGCTAAC
ATGGCAACCA
GGTGGTGAAT
CCAGTTGATC
CCGACAGGGC
TCTTI'TGGGG
TACAAAGATA
CGTGGGACTG
GCCAACCAAG
TCACAATCTG
GAATCAGTGG
GGCGATAGTG
AAATATGCGA
ACACGAAACT
CAACTI'TAGG
ACCAAACAAA
CGACGAATCC
CTAAGCCAGG
AACAA.ACGAT
ACGCTGGTGC
AGACAGGTTG
AACTAACAGC
CAAAGCCAAC
TCATGGCTGC
TAGATACCGC
AAAAATACAC
AA
CGCACTGGCA
CGGAGCGATT
AGACCCAGAT
GACAGCAGGT
TACCTCAAAA
CGACCAAGAA
GACGCTAAA-A
GGCCAAAGTA
GACAGCGCCA
TGGCGATAAA
TAAAACAAGT
GACAAC=TT
EF098-2 (SEQ ID NO:374) MKKTKVMTLM ATTTLGALAL VPMSALAVDG PITPVDPTDP TGPKPGTAGP LSIDYASSLS GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK FGEQTITSKN MTY-YAETQKY KDNAGADQEG 193 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAXVT LSNGRVVSAS QSAKPTTAPA TIELNPTGAE SWVMAAGDKE GAGTYLMSWG DSVDTAKTSI SLEVPGSTTK YA.KKYTTTFT WTLTDTPANT GN EF098-3 (SEQ ID NO:375) AGTCGAC- GGTGGTGAAT ACCAAACAAA CGGAGCGATT
CAATTTGCAC
AAACCAATTA
CCGTTATCCA
AATATGACCT
GGCCCAAACT
GTAAAACAAA
ACTTTAAGCA
GCTACGATG
GAAGGTGCGG
ATTTCATTAG
ACTTGGACTT
CAAATACGAA
CACCAGTTGA
TTGACTATGC
ACTATGCAGA
TTOTTCAAGT
ATGGTCAATT
ACGGACGCGT
AATAAACCC
CTACGTACTIT
AAGTACCTGG
TGACAGATAC
CCCAACGAAT
TCCAACTGAT
ATCTAGCTTA
AACACAAAAA
CTCAGATAAT
CAAAACTCAA
GGTTTCAGCT
AACTGGGC-CT
AATGAGCTGG
TTCAACCACA
ACCTGCTAAC
CCAGTTGATC
CCCACAGGC
TCTTTT MGGG
TACAAAGATA
dGTGGCACTG
GCCAACCAAG
TCACA-ATCTG
GAAT-CAGTGG
GGCGATAGTG
AAATATGCGA
ACAGGAAACT
CGACGAATCC
CTAAGCCAGG
AACAAACGAT
ACGCTGGTGC
AGACAGGTTG
AACTAACAC
CAAAGCCAAC
TCATGGCTGC
TAGATACCGC
AAAAATACAC
AGACCCAGAT
GACAGCAGGT
TACCTCAAAA
CGACCAAGAA
GACGCTAAAA
GGCCAAAGTA
GACAGCGCCA
TGGCGATAAA
TAAAACAAGT
GACAACT'IT
EF098-4 (SEQ ID NO:376) VDC GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK PITPVDPTOP TGPKPGTAGP LSIDYASSLS FGEQTITSKN PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAI(VT TIELNPTCAE SVVMAAGDKE GAG EYLMSWG DSVDTAK<TSI WTLTDTPANT GN
MTYYAETQKY
LSNGRVVSAS
SLEVPGSTTI(
KDNAGADQEG
QSAKPTIAPA
YAKKYTTTFT
4 4 EF099-1 (SEQ ID NO:377) TGATGTTGTA GAGGGCTGAT ATGAAGAAAT TAGD-CAAGGT
TTATTTTTAG
ACACCACAGG
TTGCAGTTTG
AGAATTCAAA
TATGGAATTG
AAAACTCAAA
GATGGTTCTG
GCGGTAGATA
GTAAAAAGAA
GGAGGTTCAG
ACTATTrAATG
CTACAAATGA
AGTTCAkI'TG
GTCAITAATIT
GCAAGTGTTG
=TTGCTA
AGGAAATAG
CTGTATTTTC
AAAAAGTAGC
CTTCCGCTTG
CTGATTI'ATC
GGTTAGGACA
AAAAGGAATG
ATAGTGACT
TTTGAAGCT
AGGCTAGTGC
CCAATGTTGG
GTGGTCAATG
TGGGTACGGG
GTTGGACAGT
TTGCTCAAGG
AAGGTAAAAA
AGTAT='CG
GAAATGTTTA
TTAATTGTT~
TTCTAGTGAA
ATTAGAAGTT
GATTGGCAAT
GTTTAATTCA
ATGGGAITCA
GAAATCAGTA
ACTTAAAAGA
GTGGGAACGA
TAATAATTGG
TGGAGGAAAA
TTATGGC LTA GCATATG TI
CATAAAGAAT
TGGTGTGGCT
CAAGTTTACT
GACTTGGGGA
TCAGTCTTCT
AGTTGTTTTA
AGCGGAGATT
TCTAACTACG
ATGGAACAT-G
GCGATAGCTT
GGACGAAGAG
GCTTTACAAA
ATGTCTAAAT
GCTGGAACAA
TATAAACGAC
ATTGATGCCT
TCTGCTFri-T
GCGAGTQAAA
CCAAATTATT
ACTAGTATIT
ACTTATGAGC
TTAGATWC
TTTTATTGAA
TTTTTATTCT
CTTCCCAGTT
TGACGTCACA
AAAGTGGATL'
TTAATCCTTC
T'AATTTATT
TGGA=7TGC
CAAAAGATCT
AAGATGATCC
TTI'CTACAGG
TGGAAAAAGT
TTG~TGAA-AA
TTGGTAATGA
CAGATATTAA
ATGGGCATAC
AAAACGCTGA
CACAMMTAC
AGGAGAGATC
TCCTTTTFIA
TCAGCCCGCT
TGGCGGAACG
AAATCCTGCT
GTTAGCCT
AAATTITGCA
GTGGAATAAG
GAATACAC I'1
CGCAGAACAA
TTCCATGGGC
GATGGGGCAA
ACAAGAGGT
T1'ATCCTTGG
AGCAGGAGAT
TCGTGTAGTG
ACAAGGTCAA
CAGCATAGTA
a EF099-2 (SEQ ID 1NO:378) frflCLS VFFLLKGEIM KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT PQEKVALEVS NYVTSHGGTL QFASAWIGNN EHESGLNIPAR IQSDLSFNSA IAFNPSLGGY GIGLGQWDSG RR\TNLLNFAX SQKKEWKSVA LQMDFAWNXD GSDSDLLKRM SKSKDVNTLA 194 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
VDILKLWERA GTKDDPAEQV KRKASANNZWY KRLSTGSMG INGGQCYGLS AFFVEKQGGL QMMGTGHMFA SEIGNDYPWS I NFGQGGVAT S IYGHTGVWA SVEGKNKFTT YEQNAEQGO I
K
EF099-3 (SEQ ID NO:379) TAGTGAA. AGCGGAGATT CTTCCCAGTT TCAGCCCGCT GSANVGGGK I CALEKVMGQT SIGWTVIKNP NYSDIKACDV VAKYFRTWGL CFPHVTS IVR
ACACCACAGG
TTG-CAGT='
AGAATTCAA.A
TATGGAA=~
AAAAGTCAAA
GATGGTTCTG
GCGGTAGATA
GTAAAAAGAA
GGAGGTTCAG
ACTATTAATG
CTACAAATGA
AGTTCAATTG
GTCATTAATT
GCAAGTGTTG
ATTGTTGCTA
AGGAAAT
AAAAAGTAGC ATTAGAAGT
CITCCGCTTG
GTGATTTATC
GGTTAGGACA
AAAAGGAATG
ATAGTGACTT
ITTGAAGCT
AGGCTAGTGC
CCAATGTTGG
GTGGTCAATG
TGGGTACGGG
CTTGGACAGT
TTGGTCAAGG
AAGGTAAAAA
ACTATTTICG
GA'ITGGCAAT
G'1TTAATTCA
ATGGGATTCA
GAAATCAGTA
ACTTAAAAGA
GTGGGAACGA
TAATAATTGO
TGGAGGAAAA
TATGGCTT1A
GCATATGT=
CATAAAGAAT
TGGTGGCT
CAAGTTTACT
GACTTGGGGA
TCT.ACTACG
ATGGAACATG
GCCATACCTT
GG.;CGAAGAG
GCTTTACA;A
ATrTCTAAAT
GCTGGAACAA
TATAAACGAC
ATTGATGCCT
TCTGCTIT
GCGAGTGAAA
CCAAA'ITATT
ACTAGTAT
ACTTATGAGC
TTAGA'I IC
TGACGTCACA
AA.AGTGGATT
TTAATCCTTC
TTAKI'TTATT
TGGATTTTGC
CAAAAGATGT
AAGATGATCC
TTTCTACAGG
TGGAAAAAGT
T'rGTTGAAAA
ITGGTAATGA
CAGATATTAA
ATGGGCATAC
AAAACGCTGA
CACATGTGAC
TGGCGGAACG
AAATCCTGCT
GTTAGGCGGT
AA.ATTTI'GCA
CTGGAATAAG
GAATACACTI'
CGCAGAACAA
'PrCCATGGGC
GATGGGGCAA
ACAAGGAGGT
TTATCCTTGG
AGCAGGAGAT
TGGTGTAGTrC
ACAAGGTCAA
CAGCATAGTA
EF099-4 (SEQ ID NO:380) SES GDSSQFQPAT PQEKVALEVS NYVTSHGGTL GIGLGQWDSG RRVNLLNFAK VDI LKLWERA GTKDDPAEQV INGGQCYGLS AFFVEKQGGL INFGQGGVAT S IYGHTOWVA
QFASAWIGNM
SQKKEWKSVA
KRKASAhNY
QMMGTGHMFA
SVEGKNKFTT
EHESGLNPAR
LQMDFAWNKD
KRLSTGSMGG
SEIGNDYPWS
YEQNAEQGQI
IQSDLSFNSA
GSDS:DLLKRM
GSANVGGGKI
SIGWqIVIKNP
VAKYF'RTIWGL
IAFNPSLGGY
SKSKDVNTL.A
DALEKVMGQT
NYSDIKAGDV
DFPHVTS IVR EF100-l (SEQ ID NO:381) Pt..
a
TAN'ITATGGC
ACATI'ATTAG
CCTAACTTAG
ATrGTAGAAT
GAATTAGTCA
CAATGA
AATATGGAAG
AAATGTTGAT
CGAAACATAA
CACAAATCGA
ACGAAGGCTA
GAGTTTATA
TGTCTTATTG
AGAAACAGTT
GCTCTACACA
CATTACTAAA
ATGAAAAAGA
ATTA7TCCG
GATAAAAAAG
CTAGAAAAAA
GAGCAGTTAC;
AACAAAAATA
TArI'ATT
GCAAT'-A-AGC
ATAAGACGCCC
ATAALATATAC
CGCAGGGTT]?
ACTTTTTGTC
AATCGTAAAA
TTCCTTAAAT
AGCAGAAAAG
EFlOO-2 (SEQ ID NO:382) MKKKQKYAGF TLLEMLIVLL I ISVLILLFV PNLAXHKE'IV DKXGNEAIVX IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q EF100-3 (SEQ ID NO:383) TAA AGAAACAGT GATAAAAAAG GCAATGAAGC AATCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAALGACCCC TTCCTTAAAT GAATTAGTCA ACGAAGGCTA CATTACTAA.A GAGCAGTTAG ATAAATATAC AGCAGAAAAG 195 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CAAT
EF100-4 (SEQ ID NO:384) KETV DKKGNEAIVK IVESQIELYT LEKNXTPSLN ELVNEGYITK EQLDKYTAEK Q EF100-1 (SEQ ID NO:385)
TANTTATGC
ACA'FIATTAG
CCTAACTTAG
ATTGTAGAAT
GAATTAGTCA
CAATGA
AATATGGAAG
AAATGTTGAT
CGAAACATAA
CACAAATCGA
ACGAAGGCTA
GAGFFI'TATA
TGTCTTATTG
AGAAACAGTT
GCTCTACACA
CATTACTAAA
ATCAAAAAGA
ATI'ATICCG
GATA.AAAAAG
CTAGAAAAAA
GAGCAGTTAG
AACAAAAATA
TATTGATTTT
GCAATGAAGC
ATAAGACGCC
ATAAAtATAC CGCAGGG TTT
ACTTTTTCTC
AATCGTAAAA
TTCCTTAAAT
AGCAGAAAAG
EFI00-2 (SEQ ID NO:386) MXKKQKYAGF TLLEMLIVLL IISVLILLFV PNLAKHKETV DKKGNEAIVK I'JESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKY'IAEK Q EF100-3 (SEQ ID NO:387) TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC AATCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACGCC TTCCT~rAAAT GAATTAGTCA ACGAACGCTA CATTACTAAA CAGCAGTTAG ATAAATATAC AGCAGAAAAG
CAAT
EFIO0-4 (SEQ ID NO:388) 4.
KETV DKXGINEAIVK IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q EF101-1 (SEQ ID NO:389) 44
TGAGGAGATG
GTAATTGCGG
GCTGTCAAC
CAACAAATTT
CAAAAAATAA
ACAGAATTAT
AGCTTACCCA
GATAAA ETTA
CAATATCAAA
GCGGATTTAG
ATTCCAGAAG
ITAGCTGGAA
CTAACTTCTG
CCTCCTGAAG
AG'ITATAGCG
CAAGCAACCA
AAAGAGGGTG
GTCCAAGTCG
GACCCAGTGG
AAACGAAGAA
TrGGGGGCAT
AAGCGCCTAA
ATA'ITAACGG
CAAAGGATCC
TTACTTATGA
AATTAGAAAC
ATAAAACTAA
CAGAAGTCGA
GArGCGAAGCA
TAAAAGATGC
AAGTGAATGA
TTI'CCAACAA
GCAACAGCGA
TCAAAATTGC
TrGATTTAGG
AACAGG-CCTA
GGCAAGAAAA
TTATTCTTC
AATGAAGAAG
CGTAACTGTG
AGATGACTGG
TGTCATCCAA
AGAGATI'AAG
AGATGAGGCG
GAAGCGGGCG
AGAAGAAGAC
TGCAGTAGAT
ATATMT=CC
CAATTICACCG
AAAGGACTTG
TGTGGITG T(
TGCCGCGAGT
GTTGGCCAAT
CGATTTAGGG
CGTTTTAGTG
TGGCGACAAA
AAAAAAACCA
A-AAACGATAA
AATGCG'TAA
GGAATTGACT
CCGGAACAAA
GTGAAAAACG
GTCACAAAAG
AATATCTATA
CGCACGATCT
GAAGAGATTA
ACAAAGGCTA
AlITI'TACGGT
ACTAAAATTA
GATGGCTCAA
GGCAATCCAG
TTAGACAAAG
GCGATTGAGT
AATGA7=TG
ATGGCGATTG
GTAAAAGTCG
TTATATTCGG GGCAGTTGCG ATAAAIAATGC ACAACAAGTA ATI'TTGACGT TCCCGACTTG TGGAAGCCTT TGCGCGTGAT GCGATGTCGT AGATGCAGC AAATTGAGGC ACAACAAAAT ATAAGTGGAA TCGGGCCATT CTGGTGATGA TTTAAATG-AA CCTTCACCAA TGAAACCTTA ATTTCAAAGG TCGTGTATCA TAACTTCAGA AGATCTTTAT GTGTTGGGCA AAAAGCTAAA TTTCTrACAT CGATGATAAT AGGGCGGCAC AACGATGTCT TCAMAAATGG CTACCATATG TACCGAAAAA AGCGATTCAA GAACCATCAT TCGTCGTGAT AATClrGCTT AGAATCACCC GTGATATI'GT TGAATCAGAT 196 TABLE 1. Nucleotidc and Amino Acid Seqeuences of Efaecalis Genes.
GCAGCGATTG CI'CTGATGA ATCACCAACC AACGAATCAA TGACAGATGC GTCGAAATAG EF101-2 (SEQ ID NQ:390) MKKK TIIILGAVAV IAVGGIVTVN ALNKNAQQVA VKQAPKDJWG IDYFI)VPDLQ
QIYINGVIQP
LAKLETKRAN
DLGAKQYIST
TSVSNNVVVD
ATIDLGDLGA
RVVISSKKPV
EQMEAFARDQ
IYN'KWNRAID
KANFI<GRVS I
GSISYIDDNP
IELPKKAIQK
KVGDIVESDA
KITKDPEIXV
KFNKTKEEDR
PEVKDANSI
PEGNSDAASG
EGEQAYVLVN
AIASDESATN
KN'GDWVDAGT
TMSCDDLNJEQ
LRLTSEDLYL
NPEGGTTMSS
DFGTIIRRDV
ESMTDASK
ELFTYEDEAV
YQTEVDAVDE
AGKVNE-KDLT
YSVKIALANL
QVGQENGDKM
TK:-IEAQQNS
EITFTNETLA
KISVGQKAKL
DKVKNGYHMQ
AIESGLESAD
EF101-3 (SEQ ID N0K391) TAAAAATGC ACAACAAGTA GCTGTCAAGC AAGCGCCTAA CAACAAATTT ATATTAACCG CAAAAAATAA CAAAGGATCC ACAGAATTAT TTACTTATGA AGCTI'AGCCA AATTAGAAAC GATAAATrTA ATAAAACTAA CA.ATATCAA.A CAGAAGTCGA GC'CGAT ITAG GAGCGAAGCA ATTCCAGAAG TAAAAGATGC TTAGCTGGAA AAGTGAATGA CTAACTTCTG TITCCAACAA **CCTCCTGAAG GCAACAGCGA AGTTATAGCG TCAAAATTGC CA.AGCAACCA TTGATTTAGG *.AAAGAGGGTG AACAGGCCTA *GTCCAAGTCG GGCAAGAAAA GACCGAGTGG TTATrTCTTC GCAGCGATTG CTTCTGATGA
AGATGACTGG
TGTCATCCAA
AGAGATTAAG
AGATGAGGCG
GAAGCGGGCG
AGAAGAAGAC
TGCAGTAGAT
ATATATTTCC
CAATTICACCG
AAAGGACTTG
TGTGGT1'GTG
TGCCGCGAGT
GTTGGCCAAT
CGATTAGGG
CGTTTTAGTG
TaCC;ACAAA GGAA7DGACT
CCGGAACAAA
GTGAAAAACG
CTCACAAAAG
AATATCTATA
CGCACGATGT
GAAGAGATTA
ACAAAGGCTA
ATTTACGGT
ACTAAAATTA
GATGGCTCAA
GGCAATCCAG
TTAGACAAAG
GCGA'ITGAGT
AATGATTTTG
ATGGCGATTG
ATTTTGACGT
TGGAAGCCTT
GCGATGTCGT
AAA=TAGGC
ATAAGTCGAA
CTGGTGATGA
CCTTCACCAA
A=TCAAAGG
TAACTTCAGA
GTGTTGGGCA
TT-rCTTACAT
AGGGCGGCAC
TCAAAAATGG
TACCGAAAAA
GAACCATCAT
AATCTGGCTT
GTGATATTGT
TGACAGATC
TCCCGACTTG
TGCCCCTGAT
AGATGCAGGC
ACAACAAAAT
TCCCCATT
TTTAAATGAA
TGAAACCTTA
TCGTGTATCA
ACATCTTTAT
AAAAGCTAAA
CGATGATAAT
AACGATGTCT
CTACCATATG
AGC-GATTCAA
TCGTCGTGAT
ACAATCAGCC
TGAATCAGAT
GTCGAAAT
AAAAAAACCA GTAAAAGTCG ATCAGCAACC AACCAATCAA EF101-4 (SEQ ID N~O:392) KNAQQVA VKQAPKDDWG IDYFDVPDLQ
S
a..
C
a
QIYINGVIQP
LAKLETKRAN
DLGAKQYIST
TSVSNNVVVD
ATI DLGDLGA
RVVISSKKPV
EQMEAFARDQ
IYNKWNRAID
KANFKGRVS I
GSISYIDDNP
IELPKKAIQK
KVGDIVESDA
KITKDPEIKV
KFNKTKEEDR
PEVKDANSP I
PEGNSDAASG
EGEQAYVLVN
AIASDESATN
EF102-1 (SEQ ID NO:393)
TAAACATITG
ATTAAT'ATA
TTAAGTCAAA
CACCCTCA-AG
TTTAGCGATG
GCAAAACAAA
GGTITATCAGA
TGGGAAAAGG
GAAGTGAACG
AGACATTCAG
TGAAAAAAAC
CAA ['rGG
AGGTAACAAT
GCAGCACGCT
CAGTATTTTG
AAAACCCTT
CTGGAACAGA
GTTATAAACT
AGGTGAATGT
AACA=IAAA
AACGATTGGT
TCATTATGAC
CTCAGAACGT
TATTGAACCA
GCCATCAATG
TATIGATACA
CCATTCCATA
KNGDVVDAGT
TMSCDDLNEQ
LRLTSEDLYL
NPEGGTTMSS
DFGTIIRRDV
ESMTDASK
CTCTTrTTA AAT'rGGTCGT
CCTACGATTG
CTAAGTAAAC
ACTACGTCAT
GGTGTTAGTA
TCTGATAAAC
AATATGGTTM
AAAAGATTAG
ELFTYEDE-AV
YQTEVDAVDE
AGKVNEKDLT
YSVKIALANL
QVGQENGDKN
TTACTCAAAA
TAITI'CGAC
CTITTGCCGA
TGTATGAAGT
TATATGCAGA
TI'CCAACAGA
CGAAACTAGT
CACAAAAGAT
GTGGTGC
ACGAAAGGGG
TTTGGCTCTA
TGAAATTACT
TGACGGAACT
ATACAATGGT
AGTGACGCAC
ATCGGTTCTT
GATTTGGGAA
AGTTGATATA
TKEIEAQQNS
EITFTNETLA
KISVGQKAKL
DKVKNGYHMQ
AIESGLESAD
TABLE 1. Nucieotide and Amino Acid Seqeuences of E. faecalis Genes.
AAATCTATTG
CATAATACCA
?I'AAAmTAT ATTr.GGAAT'C
AAAAAATCAG
ATGGCTGGTG
GGTTCTTrAA
TTCCATTTAG
ATTTCTATTT
CCAGATCCT
ATTTCCATGA
GTAGA.AACAG
ATTCGTAAAG
CGTGCGGAAA
GAAACTAAAT
TATCCCAATC
ACTGGGGAAA
GCTGAGTTTA
TGGAGTGAAG
ACTTTGGCTT
TTCTGGCAAG
TCCATCAAAA
AAAGAACA-AG
ACTGCCGAAA
GAAATCACAG
GGCTATCGTA
CCAGAAGGAT
GACGACTATG
ATTAAGATGG
AACCGTTTGA
TGGAAAGACG
AGATATAACT
GAAGCCACAA
ACAACCGCAA
ACCGCTGAAC
GTAGCCTTTG
CAAACCGTCA
GATGGTTCGC
ACCCATGATG
CCAGATGGTA
AAAGAATTTA
ACTAAGTTTA
AATGAAGATT
AGTACGCCAA
ACAGTGAAGA
WI'ATCTI'GA
AAGGAAAAAT
CTGTAAAAAC
CTGACT=GA
AATTAGTCCT
CTGGTACTGG
CGCTTGATAA
AGATCAAAAA
ATT=GGGAA
TGGATGGAAT
ATATCATTGA
CTTCGAAAA-A
GTACTGATCT
ACAGCCCAC
CACCAAAAGA
CTAGTAATGG
AAACCGTGGC
CCAC'ITGAC
AAGGAGCTGA
C'ITTTAAAAC
TAGATGAAAA
AAACCAAAGC
AAGTTGATAA
TTATTCGCTT
CTGGATTTAA
GTGCTGAAGA
AG'T~TGAAAA
TTCAAAAGAT
CGAAGAGTGA
TGACCOTTCC
TGCTTTATGA
GAAATAAAAA
TGCATGAAAT
AAGCTGCCCA
CGTTGGCGAA
AAGTTTTGGA
AAGCAGGCAA
ATTGTACGAT
AAACTTFAC
TACTGGATGC
CGAACAAAGA
CCAAAACCGT
CTTTTACGGA
TGAAAGAAAA
A.ACAACCGGA
CATTCCCGCA
TTTTTCGAC
V'AATAAGGCA I
.ATTITAGGT
rAAAGTCGTC
EACTCCAAAC
AACTCCACTC
GCCCAATACC
.ATCGATAAA
AGCTTTACCT
TCCCCATGGT
TACCACACCC
TATGCGACAA
I'TGGAATGAC
TGGTGAAATT
GCTTGCTAAT
I=CGTGAAT
TCI'GTTACC
AAAAGAAGAC
ATATACTCTC
AGAATTAGTG
GAACCAACTT
ACCTGAAGGA
TAACGAAAAA
TGGCCITIAT
CGACTTATCT
TAAAGCGACC
TCTTCC ITAT
TACACCACTA
GTATGTCTTT
TTACCAGAAA
'TTTGCCCGAG
ATTGAATACC
CAAAGAACAC
AGAAAAAGAC
CAAAGAGAAA
TAAAAGCATC
TGAGCCAGTA
TGAACGCCAT
TCATGGTCAC
CTCAAAACAA
AAT=TGAAA
AC ITGCGGAA
AATCAATTAC
ATCTCAAACC
AACACCAGCT
AACTGGGGAG
TGCTGGGTAT
.TTGAGGAGT2
AATCGACAA
~AAAATACGG
ECTAATTCCA2 ;CTTATAAAA2
PACGCTATTA
;AATCACGTG
TCAAAAGATG
A.CAAAGGTAA
ATCGCTGaCCA
AIAAGGTCAAA
~ATATrCTC
GTCCAAGAAA
GCTTTGGAAC
ACCTTCAAAC
AGTA.ACGTAA
AAAGATACCG
TTACTGCAA
AAGGGAACGA
GCCGTTAAAC
TATACTMG
AATGCCGTAA
TTCTTTAAAT
TTAAAGTG-T
ACAGCTTGTA
GGGATTATT
GAAATCCGTT
ACCATTACC 6
CTAACTAACA
AAAGAAGATA
CTTGA=rTA
TGGTATGTCG
GAAAAAGCCA
ACTGGAACTT
GTCTTGTTCA
GCGAAGGATG
GTTTCCATCC
GTGATGGATA
GCTTTCGAAA
TCTGGCAAAA
AAAGTAGATA
GAAAAAGATG
TTA.ACACCAA
GTTCCAAGTA
.AAAAATCCA
TATTTCTGGA
TCAAAA.AAA I
TTTAATAGA
:GAATATAGA
kATCAGGAAC2 kAGCAGGACT kAATTAATGTC kTATTGTACC rGACAACAGA
CTATTACTGA
CCATTAAAGC
TTCTTTTAGA
I'AGCTGGAAA
'A.ACAACGGA
TGGGA CCTA
CAACAAAAGT
AAGGGCAAAA
GTAATGAGAG
AAGATGGTCA
AAGCTTCTGA
ACCTAGCAAT
ATGAA.ACGAA
TTACTCGAGA
TTGCTGGATC
CGCCATTGGA
ACGAGCAATT
TACTTGAAGA
CTACATTTAA
AAGAAGGACA
ACGAGTFII'C
GTTTGACTI'C
CCGAGCTAGT
TAGCTCAAGC
AACCOGTGGT
GGAAAATTCT
ATTATGTGTA
CTAGCTTGAA
AAACAAAAC
TG'FIGATGA
CAATTCTGTA
TTCACCATGA
CC:GGAAAGTA
GAAACGTGAA
AAGAAGTGCC
ATTCTCAAGA
ACGTTCTACT
ATCGCCGCAA
CCAAGTTTT
CAAAAATGAA
ETACCGTGTA
kTTAA.CATTG
LCAAACTGTG
GGAAACTAAG
AGAAACGGTT
TA.AAGATGGG
~AAATCGGTG
GGGCGAGACC
GAAGACTGGG
TACAT?1GCC
TGAAAAAGGT
TTACGTGACA
CGAGTTAAAA
CCAAGAAATTI
TCAAGGGAAA
AGCTG'TAAA
TGAAACAGTG
TAACGAGTAT
GTATCCTGTA
TGTTACGGCA
GGCTGATGGC
AGGGACCAAN
AGGTTTTGAT
AATAGAGGCT
GGAAAACAAA
AAAACAACCA
TGTTAGTCTG
TCTTGCGACT
TGATAALATTG
CATTGATGTG
GATTGCCGAA
CCATAAATA
TGAA.AACAAG
CAATCAAGCA
CCACCTAGAA
TGTGTCGGT'
TGCTTTACTA
AGTGAATCAT
TCCAGAAGGA
TGGAAAACAC
AACCATACCG
ATCTAGTCCC
GTTAG~TTGGC
TTAA
EF102-2 (SEQ ID NO:394) MKKTTFKN WSLFATLALL SQTIGGTIGP TIAFADEITH PQEVTIHfYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA YQKNPLPSMS DKAKLVSVLW EKACTDIDTN MVAQKI4IWEE SIEGKINKAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM
KQTVFCIEPG
VNGYKLHSIX
NLSEFDKVVQ
AGALOKPNTY
VS IPTEVTHG
RLGGASVDIK
NTAN IDYRV I
AIKINVETKG
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faeca fis Genes.
SLKIKXI DKE
DPYMIDTTPM
RKDSPAGEIV
ANQTVALVTS
SEAFKTELVI(
IKKVD)NNEKN
ITGAEDKAT
DYAKSEYVFT
KDGNKKLNTL
TATLANKEKT
TVNCTIERHV
DGTNKEIWKS
EOLKEKSQTL
IL IFSTAGYY SGD IVPETVF
AATIKAGETI
QEITTDEKGR
NVKGQNQEIT
GTKASDETVT
AVITRDVTAK
ACNEQLGFDG
ITEEGQKQPI
DFTELVDKLR
GTWKILRKLT
SIQTKAHLED
GKIEHEVNDK
TPKEVPTI PS
FWNRRN
HLDFGKAL.PS
SMTSKNMRQK
AETPKELANA
GETTLTKEDK
LALDEKNQVA
EQVIRFGFDF
YGKFENLPYG
KMVTVPYEKL
YNLHEIKEDW
AEQVLDKS IV
GSQTFTHGDV
EFTKTVLAEK
TPKQPETPAV
KDVTTDKDG I
GQILLEKTGV
LE-LGTYVVTE
DTGNESQGKA
VK}(LAINEYF
FKFAGSADGT
DYLLEEIEAP
TNNEFSVSLN
YVVAQAIDVE
LFN.YVYENXV
MOMFDD VS VT
VDTGKYPEGT
PSNSQESSPT
SILDGIPHGT
ETGTDLWNDN
TKSSNGFVNT
EFKGAEYTLF
WQETKAPEGY
AETGFNDLSF
EGFQKITPLE
RLMLYOLPEK
ATKAAQEKDE
AFEAGNEPVA
HD-VLJJGSKEA
KFTFTE INYE
VKTFPQTEY(
KVTITEXSVP
YSLAGNTFAI
FKPTKVELKY
TAXKDGQAVKW
TLDETKYPVS
KVS PLEGTXE
IRSTFKENXD
EDSLTSLATW
KAKPVVIAET
KDASLNNQAQ
FETILYALLP
KDGNVNGKHN
NSNVLLLVGF
EF102-3 (SEQ ID NO:395) TT TAGATGAAAA GAACCAAGIT GCCGTTAAAC ACCTAGCAAT TAACGAGTAT a.
a.
a..
a a.
a a a a a a a a
TTCTGGCAAG
TCCATCAAAA
AAAGAACAAG
ACTGCCGAAA
GAAATCACAG
GGCTATGGTAL
CCAGAAGGAT
GACGACTATG
ATTAAGATGG
AACCGTTTGA
TGGAAAGACG
AGATATAACT
GAAGCCACAA
ACAACCGCA.A
ACCGCTGAAC
GTAGCCTTTG
CAAACCGTCA
CATGGTTCC
ACCCATGATG
CCAGATGGTA
AAAGAAFI'TA
ACTAAGTTTA
AATGAAGATT
AGTACGCCAA
ACAGTGAAGA
AAACCAAAC
AAGTTGATAA
TTATTCGCTT
CTGGA'ITTAA
GTGCTGAAGA
AGTTTGAAAA
TTCAAAAGAT
CGAAGAGTGA
TGACCGTTCC
TGCTTTATGA
GAAATAAAAA
TGCATGAAAT
AAGCTGCCCA
CGTTGGCGAA
AAGTTTTGGA
AAGCAGGCAA
AITGTACGAT
AAACTTTTAC
TACTGGATGG
CGAACAAAGA
CCAAAACCCT
C T=ACGGA
TGAAAGAAAA
AACAACCGGA
ACCTGAAGGA
TAACGAAAAA
TGGCTTTGAT
CGACTTATCT
TAAAGCGACC
TCTTCCTTAT
TACACCACTA
GTATGTCTIT
TTACGAGAAA
TTTGCCCGAG
ATTGAATACC
CAAAGAAGAC
AGAAAAAGAC
CAAAGCAAA
TAAAAGCATC
TGAGCCAGTA
TGAACGCCAT
TCATCGTGAC
CTCAAAAGAA
AA'ITTGGAAA
ACTTGCGGAA
AATCAATTAC
ATCTCAAACC
AACACCACCT
TATAC=TTGG
AATGCCGTAA
TTCTTTAAAT
TITTAAAGTGT
ACAGCTMTA
GGGGATTATJT
GAAATCCGTI'
ACCATTACCG
CTAACTAACA
AAAGAAGATA
CTTGA=ITTA
TGGTATGTCG
GAAAAAGCCA
ACTGGAACII'
GTCTTGTTCA
GCGAAGGATG
GTTTCCATCC
GTGATGGATA
GCTTTCGAAA
TCTGGCAAAA
AAAGTAGATA
GAAAAAGATG
TTAACACCAA
GTTCCAAGTA
ATGA-AACGAA
TTACTCGAC;A
TTCMTGATC
CGCCATTG'-A
ACGACCAATT
TACTM'AAGA
CTACATTTAA
AAGAAGGACA
ACGAGTTIC
GTTI'ACT'C
CCGAGCTAGT
TAGCTCAAGC
AACCGGTGGT
.GGAAAATTCT
ATTATCTGTA
CTAGCTTGAA
AAACAAAAGC
TGTTTGATGA
CAATTCTGTA
TTGAGCATCA
CCGGAAAGTA
GAAACGMAA
AAGAAGTGCC
ATI'CTCAAGA
GTATCCTGTA
TGTTACGGCA
GGCTGATGGC
AGGGACCAAN
AGGTTTTGAT
AATAGAGGCT
GGAAAACAAA
AAAACAACCA
TGTTAGTCTG
TCTTGCGACT
TCATAAATTG
CATTGATGTG
GATTGCCGAA
GCATAAATTA
TGAAAACAAG
CAATCAAGCA
CCACCTAGAA
TGTGTCGGTT
TGCTTTACTA
AGTGAATGAT
TCCAGAAGGA
TGGAAAACAC
AACCATACCG
ATCTAGTCCC
EF102-4 (SEQ ID NO:396) LDEKNQVA VXHLAINEYF WQETKAPEGY TLDETKYPVS
IKKVDNNEKN
ITGAEDKATT
DYAKSEYVFT
KDGNKKLNTL
TATLANKEKT
TVNCTIERHV
DGTNKEIWKS
AVITRDVTAK
ACNEQLGFDG
ITEEGQKQP I
DFTELVDKLR
GTWKILHKLT
SIQTKAHLED
GKIEHEVNDK
EQVIRFGFDF
YGKFENLPYG
KMVTVPYEKL
YNLHEIKEDW
AEQVLDKS IV
GSQTFTHGDV
EFTKTVLAEK
FKFAGSADGT
DYLLEEIEAP
TNNEFSVSLN
YVVAQAIDVE
LFNYVYENKV
MDMFDDVSVT
VDTGKYPEGT
AETGFNDLSF
EGFQKITPLE
RLMLYDLPEK
ATKAAQEKDE
AFE.AGNEPVA
HDVLDGSKEA
KFTTE INY
KVSPLEGTXE
IRSTFKEN!(D
EDSLTSLATW
KAKPVVIAET
KDASLNNQAQ
FETILYALLP
KDGNVNGKHN
199 TABLE 1. Nucleotide and Amino Acid Seqeucnces of E.faecalis Genes.
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQES8PT VX EF103-1 (SEQ ID NO:397)
TAAGATAGGT
ATTACTTTAG
CATCATGCAC
CAAACAAGAG
GTTTATGATG
AAATATGATG
GGTGACGAAG
ACACAAAATT
AAGCGTI'TAG
TATCATGGGA
AAAATI'GTGA
AAAGTCGTAG
GCGAAATTG
CGCGGCGACT
ATTGGTACGA
ACGTATACTC
GAACCTI'ACC
TTATCAAAGA
TAACATTAAG
CCAATCCAAA
GCGACAAGGC
CTGCTGGTAA
GTGAAACCGG
GAACATTT'I
ATCAAGCCGT
GGAAAGATAA
AAA-AATTAGC
CAAATAAATC
ATAAAAACGG
ATCCAAACAC
TTGGGTACTT
ATCGTTACGG
GAATGGGTAA
AAGGCACATA
AAAGGAGCCA
TACGTI'ACTT
TACTGCTACA
GACTAAGAT
TGAT'ITAACG
ITTTTACGAG
TGTGACAGCT
AGTGGATTTA
ACTGGGGAAT
1TMACAAAT AGGG3-ATAAA
TAATGATITG
AAGTAAATAT
CCAAGTGGTG
CGCGGCGCTA
AGATAATCCT
TCATCCAGCC
TCGCTTTATCA A.AAAGAAAG-T ATAGCAGGTT ATGCAAGTCC GCAAAT1rrAG GCAAACATCA -TATCTGGCA CGGACTGGCA GCAGAAAATG CTAATTTTAT TITTTCGACA AAAATACTGG CATGGCACAA AACGAATTTT ACCGAAGTGA CTAAAGACNA GATGTTGAAG TTrACGTGGA GGACGTG;AAG CATTAACCAA ATrTTAGGAA CAACCTTGTG ACAGCGGCCA ATCAAAATT GAATTTTCA ATTTACAAAC GACAATAACA AGATTZCGGC GAATTAACGG AACTAAACAA GGTAATGATA 'ITCAAGTGTT TTTACTT-TCT AA
ATTAAGTTCG
AGCATrrGCA
AAACAATGGC
AGGAACCCGT
TGGTTTAGCA
GGAAACCCGT
AATTTCGCGG
ATTACTTAC
ACACATCCCT
TCAAACTGGC
GAATGGCACA
CATTAGT=A
AGGTGAAACC
CCATGTATCT
TGATCGA'IT
CGTGGAACAT
EF103-2 (SEQ ID NO:398) a.
a a a MKKKVLSSI TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ TRCDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLAK YD)GETGFYEF DEGTFFVTGD GTKRIL.ISRT QNYQAVVDLT EVSKDXFTYK RLGKDKLGND HGKKLAFTNG REALTNQTGK IVTNKSGDK I LGTTLWNGT.K VVDKNG;NDVT KFDPNTSKYE FFNLQTGETR GDFGYFQWVD NNKIRAHVSI GTNRYGAALE YTRMGKDNAG NDIQVFVEHE PYQGTYHPAF TF
FDKNTGETRG
VEVYVE14I PY AANQNFI SLA
LTELNNDRFT
EF103-3 (SEQ ID NO:399) TCATGCAG CCAATCCAA-A TAGTGCTACA GCAAT~AG GCAAACATCA AAACAATGGC
CAAACAAGAG
GTTTATGATG
AAATATGATG
GGTGACGAAG
ACACAAAATT
AAGCGTTTAG
TATCATGGGA
AAAATTGTGA
AAAGTCGTAG
GCGA.AATITG
CGCGGCGACT
A'ITGGTACGA
ACGTATACTC
GAACCTTACC
GCGACAAGGC
CTCCTGGTAA
GTGAAACCGG
GAACATTTTT
ATCAAGCCGT
GGAAAGATAA
AAAAAT1'AGC
CAAATAAATC
ATAAAAACGG
ATCCAAACAC
TTGGG'ACTT
ATCGTTACGG
GAATGGGTAA
AAGGCACATA
GACTAACATT
TGA=TAACG
T'I=rACGAG
TGTGACAGGT
AGTGGATTTA
ACTGGGGAAT
TTI'TACAAAT
AGGGATAAA
TAATGATGTG
AAGTAAATAt
CCAAGTGGTG
CGCGGCGCTA
AGATAATGCT
TCATCCAGCC
TTATCTGGCA
GCAGAAAATG
TITCGACA
GATGGCACAA
ACCGAAGTGA
GATGTrGAAG
GGACGTGA.AG
A'ITTTAGGAA
ACAGCCGCCA
GAATTI'TCA
GACAATAACA
GAATTAACGG
GGTAATGATA
T
CGGACTGGCA
CTAA'TITAT
AAAATACTGG
AACGAATT
GTAAAGACNA
TTTACGTGCA
CA ITAACCAA CAACCTrGTG
ATCAAAAT=
ATIACAAAC
AC;ATTCGGGC
AACTAAACAA
AGGAACCCGT
TGGTI'TAGCA
GGAILACCCGT
AATTTCGCGG
ATITACTTAC
ACACATCCCT
TCA.AACTGGC
GAATGGCACA
CATAGTTITA
AGGTGAAACC
CCATGTATCT
TGATCGAT
TTCAAGTGTT CGTGGAACAT EF103-4 (SEQ ID NO:400)
HAANPNSATA
TRGDKATKIL
DECTFFVTGD
NLGKHQNNGQ
SGTDWQGTRV YDAAGNDLTA ENANFIGLA( YDGETGFYEF FDKNTGETRG GTKRILISRT QNYQAVVDLT EVSKDXFTYK RLGXDKLGND VEVYVEHIPY 200 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
HGKKLAFTNG RE-ALTWQTGK IVTNKSGDKI LGTTLWNGTK VVDKNGNDVT AANQNFISLA KFDPNTSKYE FFNLQTGZTR GDFGYFQVVD NNXIRAHVSI GTNRYGAALE LTELNNDRFT YTRMGKDNAG NDIQVFVEHE PYQGTYMPA EF104-1 (SEQ ID NO:401) S S 'S 55 S 5
S
.55SS
S
S
S
*5 5
S.
S.
TGAAAGGGGA
GCACAAAATT
GAGACCACTG
T'ITAGCAACAA
ACACAGGCGA
GAGGACAAGG
GATGAACTAC
GAGGTTCACC
AACACAGC'IT
TCAACTGTCA
AAAACATTAA
AAAAGTTATC
AAAAAAACCT
ACCAGAACGA
AATCAAACAT
AACTTGGAAC
ACGCAAACCT
ATTGCTGTAA
ACAATTTATT
ACAAAAATTG
ACTGCTAAGA
TATATCAGTT
ACAAAAGTGG
AAAI'TACGG
AAAACTGAAG
ATTTCTGATT
GTAATTGCTC
GTTGTTAATC
TCATCAAGAA
ATTAAAGTAA
TACAAGTI'AG
ACTACGCCAA
TTGCCAAAAG
TTWAACTCCAG
CAAAGTAGTA
GTCCGTACAA
GATCAGGTAG
ATGACGCCAA
ACGTTTGATT
TGGATCGACG
AATGAAGGCT
CAAGCGGCGA
GATAATAAAA
AATGCTCAAA
TTAGAGAAGA
ATTACGATI'G
ATCGACTCTG
ATCACATTGA
TTAGTATGALA
TCGGGTTGC
ATACAGCAAA
CGACTACTTC
GCACAACAC
ACACCTCACT
TGACAGTAT
GCGAATTAAA
ATGCACCTGG
GCGGAGAAGT
GTATTACCTT
TAACAACAGA
ACTCG TIGA
CCGGGTTAGA
TAGAATT
CTCAAGTr=r
TGTTAACACC
CTGTTCCAAA
TAGAGAGTGC
GGTCAATTTC
CGAGTCAAAC
TTCAAAGCAA
GTCAACAAAT
CGTATGGTCC
GTGGCAAGCT
TAACAATGGA
CTAATAAAGC
CTITrGAATGC
CAACTGTCTC
AGCATCCTAA
GA.ACGGATTA
TAACCAACGA
ATAAAAGTAT
TTGATACGAC
AAAATCAATT
AAATTCCAGC
ATTCAATTTA
ACAGCCCTCG
TTGGAAAAAC
TGCCAACTCT
CTGCTTCGGT
ATCCAACATT
CACATCGTGT
TCGATTTGAA
CTACAAACGG
AATACAATAC
AA.ACATTGAA
AATTCTCAGA
GAAAAAAACT
CGTAAATGCC
AAAAGAGGCA
AGAAATGAAT
TTCCAGTGAT
TAATGAAAAA
GAGTCTTGCG
TACAACACCG
ATCAGTTI'AT
GACAGGCATT
TAATCAACGA
GGCGGAACCA
=FATATGAA
TGGCGAAATT
AACAACAGAG
TATGCTGTAA
GAGTTATCGA
CAACCAACTG
GCTGCTACAC
GCf CTGC CAG
CCGATTGGTG
GTAACCGCTA
ACAGTTCAAT
GGCGCAACTT
GTTTCAGATA
TTAATCAAAA
CAAGTGGAAC
TTT-TATAAT
ACGCCAGCGC
CA.ACGACAGA ACCACAAACA ACTCAAC!ACC ATCTTTACCT
CAACAACTGA
CATCTGAACA
ATGTTCAAGC
GAACAGAATA
CGTTCCAATT
TACCAGAACA
GGGCAGTCGA
CTTCC TTTAA
TTGAAACTCC
CAATTCAATA
TAGACCGGAC
CTGTCTTTIGG
ATGGTCAAAT
GCGATAATC
ATTCCTTATC
ATCGCAAACC
ACAAkACAACG
GCCAATTACA
CAGCCAAACA
TGCTGTTGGA
'r=AGGTTAT
TGCGGCGACC
AGTAGAACTA
AGGAAAAAAT
TAACGAACGA
GTTAACTlCGC
AAAACAAGAT
=TACCAGAA
ACTCGGGCGG
CATTAATCGG
CAGFTACGAT GTCGACATTA TGGCAAAGAT TATACATTAA CATGAATCAA CAAAAACCCT TTCGGACTAT AACTACTTAT AT7CGCAGCA GTATCCAACA TTTGAAAAGT ACGACAGGAA AAGTAAACTA ATTGCTGATC
AGGGAAATAT
CGTATTAGAG
TTTATATGAA
CACTTAACG
TTTTGACAAG
CATTCAATTA
TGAAACTGCT
AGTTATGGGA
TTATCTTTICA
TACAGTAACG
AATCCAAATTr
CCCAGTCGAT
AGTAACTACT
TATGTAACAA
AGTACAAACG
AATGTAAAAT
GCCACAAA.AC
AAGGACATTA
GTATCCGATC
TGGGGTA-AT'
AGCAAAGAGA
T'TACGAGCTA
CCAACCTCAG
CCAM'I'GGTT
ACGATACCGA
AATAGTAAGC
TTTTCI'TTG TGATGTTGAG TATACTTCTC
CTAAACAAAC
GTGAAATGCG
TTATGGCAC
GTCAAGAAAT
TGGAAGACTA
ATAGCTATTT
ATCTATCATT
AATATATrGA
ATGATCAAAA
AACCGATTCA
CAAAAGAAAT
ATGGTrCAGT
LTAATTATGT
TAACAATGAG
GTGGTTCTGA
ATTTTGA
ACATI'ATGA
GTCAATACTT
ACCAAAATAC
AGTATAAAAA
AAGAACCACA
TAGACATTTT
CAGTAACAAC
TAACACCAAA
TCCCAGAAGA
ACTATACAT
AAATCTATAC
AAAAAGTCAC
TTTATTTAGC
AACCGATTTT
TAGTATGTCC
GTTAACAGAA
TAAGAATCCT
TTPTGATATT
AAGAATAAAT
AAGTACACCT
ACCAATTAGT
TG-GTGCCTAT
AAATTTAGAA
TTATTTTTAT
TA'ITAAGTTC
GCCAGATAGT
TGCTGAAGGT
ACGAACACE I
CTCACTAAGC
TGTrrCAAAC
TGATAALACCA
CAATACTT1AC
CGCCAATGGC
ATCGAATAAT
GAGTGCAACA
AAAAAATATT
AGGCACAACC
TGCTTATTCA
CACAGAAAAC
AGAAACAACA
CACTGCGCCA
AACTGCCACA
CCTTGTCAAT GCACGAAATG TGGCGCCGAT GTTCTTTTTG
TCCACAATAC
ATATCCAACG
CAACAAACGT
TTATATAACA
TI'CTGTTCA.A
AAAAAATGTA
GAAAAATC-CA
TTCTATTACC
TGCGAAAGTC
GGTCTCTGCA
CCAGATGTCT
AGGTGATGCG
TGGGACC.GCG
ATTACTTTTG
TACATTATIG
GGGACAGCGA
A.ATCAAGCGT
ACAAAA.ACGA
ACGATTGAAT
GTGAAAGGCG
ATI
5
ITAAAG
AACGCTGGCC
GCTACCAAGA
GAAGGTATI'
201 TABLE 1. Nucleotide and Amino Acid Seqeuences; of E. fiacalis Genes.
?I'CTACACGC
ATTGATAATG
=TGAT-GCCA
TCCGTGGATG
CTAAAAATTC
ATTTATGTCG
GGTCAAGACG
GGCG'FTTATC
AAACCCGACC
AAACCAGAAG
AAAAFI'GATG
AAAAATGGCA
GTTAAAGATA
GCAACAGACA
GATACTAGCA
GCTAATGTGA
GTGGGCGATA
GATGTCCCGT
TATGAAATTG
GACAGTCAGT
GATAACTTCG
GTTCAGGGAA
ACAAAAGAAG
ACAACGATT
AAAACAGGTC
ATAACGTAGA
TCACGCATAC
TCATGACAGG
AAGAGTATTT
CATTAACGAA
GCGATTCATG
TTCCCTI'CGA
CAATTATTTA
AATCTAAG PP
ATAATTTCGT
TTCAGGGAAC
NAAAAGAAGC
CAACGATTI'A
AAACAGGCCA
AAGCAGGCGT
CTGTCAAACC
AATGGGAACC
TTGAAAAAAT
TCTATAAAAA
TAGAGGTCAA
TTTCCGCGAC
CAGTr.AATGT
CGAAAGCAAT
ATGTCGGCGA
AAGATGTCCC
GGATGAAAAC
AGCAACCGAA
TGATI'ATACT
GACACGPAAA
AACAATTGAT
GAAACCAGAA
AAAAATCACT
CAGTGACGAA
AGAGGTCAAA
TTCAGCGACA
AGTGAATGTT
GAAAGCAATC
TGTTGGTGAT
AGACGTTCCG
TATCCAATC
CGACCAATCT
AGAAGATAAT
TGACGTTCAG
TCGCACAAAA
AGATACAACA
AGACAAAACA
TGATAAAATA
CGTI'CATGTC
TTCNTGGANA
ATTCGAAAAA
CAAGCAATTG
=TACAACAG
'TrGCGAGTAA
GCCATTAAGC
CACAGTCGTT
GAGAACTTTG
GTTTCAGGTC
GGTAAAGAAG
GATACAACGA
GACAAAACAC
GATAAAATAG
Ga7-CATIGTCC
TCGTGGAAAC
TTGAAAAAA
GTI'TACAGTT
AAGTTAGAGG
TTCGTTTCAG
GGAACAGTGA
CAAGCGA.AAC
CGAAGGTTTC
ATGAAAAAGG
CGAATGT.ACC
TSCGTCAXAGG
TACAAGTCAA
MIrCAGCAAC
AACTTGATAA
AAACAGCCTA
TI'ATGTTGG
GTCAACAC-GT
GCGAF1'ATGA
GTGATGACAG
CAGAACATAA
TCACT-CT'FC
ACGAAGGTAA
TTAAkAGATAC
CAACAGACAA
ATG=GATAA
CAATCGTTCA
TTTTGAACTA
TCAATACTCC
GCAGGA.ATAT
AGACAACCAA
AGATTCAACG
AGATAAAACA
CANCAAAGCA
TGTGACCGTC
TGATTCGTGG
NCCGTTTGAA
AATTGTCTAT
TCAG FTAGAG TTITCG ITTCA
AGGTCAAGTT
AGAAGAAACA
AACGATTTAT
AACAGGTCAA
AATAGGCGAT
TGTCCGTGAT
GGAAGCAGAA
AAAA.ATTGAT
TAAAAATGGC
AGTCAAGGAT
AGCNACAGAT
ATATGTCG GTGATAAATC GGTCAAGACG TTCCGT=GA GGCGATTATG AA.ATTGTCTA CGTGATGATA GTCGI'TACA CCAGAAGNGA ACTTTGTTTC
ATCACTGT
9* 9 9 99 9.
9 9 9.
99 9 S 99 9.
9 999 9 9a *9 9
S
9 *99~
C
99*9 EF104-2 (SEQ ID NO:402) MKKKTF SFVMLSILLA QNFGFAVNAY AVTTTEAQTE TI'DTAKXEAE- LSNSTPSLPL
ATTTTSEMNQ
ELLDSMSLAP
TVSGEVTG IG
KTYSFDLYEQ
LEPQVFSYDV
IYLESASDYN
ISFQSKGXYY
TEGGKLTLTA
VNPLNAETAW
KLGTDYTVTP
TP VOTTVTTN QVDS IYPQYN
IDVPTLYITG
NKTHRVKNPT
T IEYNT'JSAN
YTHNVEDENQ
VDEEYLTGKA
QDVPFEKITV
PEDNFVSATD
KDTTIYVGDS
NVWVKPDQSK
E IVYKNGTKE QG IVNVDKIG
TGQDVPFEKI
PTATTESQTT
IGGTEYSQTE
ATWAVDAATK
VEPIQYNERT
DINGQILPET
YLYSQQYPTT
VTIYGTLTET
TKDSYLRINI
GNYDQNGAYS
TSDGSVIKFT
SKRGSERTLQ
DRGQYFDKPM
TAKEPQSNNN
IELTPKGTTN
AGQIYTETTI
AIA.KVSFELI
IKLVKGDNQL
SGQVDNXKAG
KTGQDVPFEK
WKPEDNFVSA
LEVKDTI'IYV
AKAIVHVRDD
DYE IVYKNGT
TV
EASTTASSDA
VHRELNT'PV
TLSITFNQRV
RTTGLOGEIF
QTLLTPGKDY
KICSISLKST
KVGQQIVLES
SDLTMDFDKK
SRTTVSVMGS
TPITNEIQIP
SSKNQFLVNA
TPNSPGYPTI
ECSASVSVQN
AQIDLNS ITV
DSETLNQMSA
DNV'THTATEF
KIPLTKTIDH
VYPI IYSDEG
IDVQGTVNVD
TDKTGQDVPF
GDKWEPEDNF
SQLMVDTI
KEAXKAIVHVR
ATPSEQQTTE OKDTSLNEKA TATFQFAVGN TGYAPGSVYT SDTSFKV-L( SYLTI'EAEPL YNLDRTLTGN QTLELLTI'ET TLSDNSLGRI AVTVPNMNQQ TGTKQTTDFT AXTSQTSKVI TNGQE IKNPK
DINLSLSTPV
I<EKPIQNLEI
IGFNYVPD)SL
RNDSFDSLSV
TFDENTNSYT
EALDILSATQ
KGVPEDAYSL
SKXKVITAP I
?TDEKGQYSF
SRLQVKDSTI
KEETAYVTVK
RIGDYEIVYK
EKITVSGQVD
VSATDKTGQD
YVGDKWE-AED
DDSRLQVKDT
FTAYGPLYEN
IGNKAIQLV
KVKHPNYLSL
PKDKSIPVDT
RTXIPAGADV
FDFGKTNKRY
AANPTLKNVr
EKTTNCAXVI
TLKFSEGDAE
DAIMTGDYTL
YVGDSWKPEE
PDQSKLEVKD
NGXKEAKAIV
TSKAGVYPIV
VPFEKIDVQG
NFVSATDKTG
TIYVGDSWXP
LPDVQAPITD
VQLPEHLGYS
IKIETPGKNK
PGAVFGKQDN
KAYSLSINRT
ADREMRSMSY
VKLEDYFDIK
SDQYIEPISV
RATKE IYFYY
IPITMSAEGL
LFD IYD)VSND I IEYKNANGW
KTTVTTKNID
FKDYTLTENI
GIVYLATATF
RVTNVPQEYS
NFVSATDKTG
TTIYVGDSWI<
HVRDDSQLE:V
YSYEGKEETA
TVNVDKIGDY
QDVPFEKIDV
EXNFVSATDK
9999
S
S. 9 99 a 59 202 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faccalis Genes.
EF104-3 (SEQ ID NO:403) TGTA.A CAACGACAGA AGCACAAACA 9.
9 9 99 .9 9 9 9 9 99 99 9. 9 9.
99 9 999 9 9 9@9999 9 .99.
9 9999 9.*9 9 9
GAGACCACTG
TTAGCAACAA
ACAGAGGCGA4
GAGGACAAGG
GATGAACTAC
GAGG?1'CACC
AACACAGGTT
TCAACTGTCA
AAAACATTAA
AAAAGTTATC
AAAAAAACCT
ACCAGAACGA
AATCAAACAT
AACTTGGAAC
ACGCAAACCT
ATTGCTGTAA
ACAA'ITTATT
ACAAAAATG
ACTGCTAAGA
TATATCAGTT
ACAAAAGTGG
AAATTTACGG
AAA.ACTGAAG
ATTTCTGATT
GTAAITGGTC
GTT-GTTAATC
TCATCAAGAA
ATTAAAGTAA
TACAAGTTAG
ACTACGCCAA
2TTGCCAAAAG
TTAACTCCAG
CAAACTAGTA
GTCCGTACAA
GATCAGGTAG
ATGACGCCAA
ACGTITGATT
TGGATCGACG
AATGAAGGCT
CAAGCGGCGA
GATAATAAAA
AATrGCTCAAA
TTAGAGAAGA
ATTACGATTG
ATCGACTCTG
ATCACATTGA
TTCTACACGC
ATTGATAATG
T'ITGATGCCA
TCCGTG.GATG
CTAAAAATTC
PLTACAGCAAA CGACTACTTC A~ GCACAACAGC T ACACCTCACT T TTGACAGI'AT C GCGAATTAAA2 ATGCACCTGG IZ GCGGAGAAGT C GTATTACGTT2 TA.ACAACAGA C
ACTCGTTTGA
CGGGGTTAGA
TAGAAT 'ATT
CTCAAGTTTT
TGTTAACACC
CT-GTTCCAAA
TAGAGAGTGC
GGTCAATI'TC
CGAGTCAAAC
TTCAAAGCAA
GTCAACAAAT
CGTATGGTCC
GTGGCAAGCT
TAACAATGGA
CTAATAA.AGC
CTTTGAATGC
CAACTGTCTC
AGCATCCTAA
GAACGGATI'A
TAACCAACGA
ATAAAAGTAT
TTGATACCAC
AAAATCAXI-r
AAATTCCAGC
ATTCAATTIA
ACAGCCCTGG
TTrGGAAAAAC
TGCCAACTCT
CTGCTTCGGT
ATCCAACATT
CACATCGTGT
TCGATTTGAA
CTACAAACG
AATACAATAC
AAACATTGAA
AATTCTCAGA
ATAACGTAGA
TCACGCATAC
TCATGACAGG
AAGAGTATT
CATTAACGAA
LAAAGAGGCA G CGAAATGAAT C ~TCCAGTGAT G ~AATGAAAAA G AGTCTTGCG C ACAACACC GC ~TCAGTTAT A~
;ACAGGCATT
'AATCAACGAC
~GCGGAACCA I E1'TATATGAA C rGGCGAAATT I kACAACAGAG I~ :AGTTACGAT C rGGCAAAGAT
CATGAATCAA
TTCGGACTAT
m'GAA.AAGT
A.AGTAAAGTA
AIGGGAAATAT
CGTATTAGAG
=TATATGAA
CACTTTAACG
TITTGACAAG
CATTCAATTA
TGAAACTGCT
AG2ITATGGGA
TTATC=TCA
TACAGTAACG
AATCCAAATT
CCCAGTCGAT
AGTAACTACT
CCTTIGTCAAT
TGGCGCCGAT
TCCACAATAC
ATATCCAACG
CAACAAACGT
TrATATAACA
TTCTGTTCAA
AAAAAATGTA
GAAAAATCCA
TTCTATTACC
TGCGAAAGTC
GGTCTCTGCA
CCAGATGTCT
AGGTCATGCG
GGATGAAAAC
AGCAACCGAA
TOATTATACT
GACAGGAAAA
AACAATTGAT
AGTTATCGA A AACCAACTC C CTIGCTACAC C CCCTGCCAG A :CGATTGGTG G ;TAACCGCTA C LCAG T--CAATI ;CGCAACTT G ;TTTCAGATA C ~TPATCAAAA 'I AAGTGGAAC C 'TTTATA.ATT 'I ~CGCCAGGCG C
TCGACATTA
CATACATTAA C AAAAAGCCT .ACTACTTAT I k.CGACAGGALA C kTTGCTGATCC IATGTAACAA2
%.GTACAAACG
AATGTAAAAT2
GCCACAAAAG
AAGGACATTA
GTATCCGAtC
TGGGGTAATT
A~GCAAAGAGA
TTACGAGCTA
CCAACGTCAG
CCAATTGGTT
ACGATACCGA
AATAGTAAGC
GCACGAAATG.
GETTTTTG
TGGGACCGCG
ATTACT1'TG
TACATTATTG
GGGACAGCGA
AATGAAGCGT
ACAAAAACGA
ACGATTGAAT
GTGAAAGGCG
ATTIW-TAAAG
AACGCTGGCC
GCTAGCAAGA
GAAGGTATTG
CAACCA.ATTG
TTTACAACAG
TTGCGAGTAA
GCCA2LTAAGC CACAGTCG2T
.CTCA.ACACC
AACAACTCA
ATCTGAACA
.TGTTCAAGCC
AACAGAATAC
:CTTCCAATT2 'ACCAGAACA2 ;GGCAGTCGA2
:TTCCTTTAA
~TGAAACTCC
:AA'IrCtAATA
'AGACCGAC
TGTCTTTGG
~TGGTCAAAT
;CGATAATTC
.TTCCTTATC
~TTCGCAGCA4
TAA.ACA.AAC
;TGAAATGCG
=rATGGCAC
;TCAAGAAAT
CGGAAGACTA
kTAGCTA II
%TCTATCATT
N.ATATATI'CA
A.TGATCAAAA
.AACCGA7TCA
CAAAAGAAAT
ATGGTTCAGT
TTAATTATIGT
TAACAATGAG
GTGGTTCTGA
ATrCTTTTGA ACA ETTATGA
GTCAATACTT
ACGAAAATAC
AGTATAAAAA
AAGAACCACA
TAGACATI=
CAGTAACAAC
TAACACCAAA
TGCCAGAAGA
ACTATACATT
AAATCTATAC
AAAAAGTCAC
TTEAT'FTAGC
CGAAGGTTTC
ATGAAAAACG
CGAkATGTACC
TGGTCAAAGG
TACAAGTCA-A
4TCI=ACCT
~TCGCAAACC
~CAAACAACG
;CCAATTACA
AGCCAAACA
C'CCTGTTGGA
[TTAGGTTAT
rGCGGCGACC kGTAGAACTA
AGGAAAAAAT
rAACGAACGA GTTAACTG-cC
AAAACAAGAT
TTTACCAGAA
ACTCGGGCGG
GATTAATCGG
GTATCCAACA
AACCGATT'Fr
TAGTATGTCC
GTTAACAGAA
TAAGAATCCT
TTTTGATATT
AAGAATAAAT
AAGTACACCT
ACCAA2LTAGT
TGGTGCCTAT
AAAT ITAGAA
TTATTTITAT
TATTAAG LTC
GCCAGATAGT
TGCTGAAGGT
ACGAACACTT
CTCACTAAGC
TGTTrCAAAC
TGATAAACCA
CAATAGTTAC
CGCCAATGGC
ATCGAATAAT
GAGTGCAACA
AAAAAATA2TT
ACC-CACAACC
TGCTTATTCA
GACAGAAAAC
ACAAACAACA
CACTGCGCCA
AAcTGCCACA TWrTGAACTA
TCAATACTCC
GCAGGAATAT
AGACAACCAA
AGATTCAACG
203 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.fJaecalis Genes.
ATTTATGTCG
GGTCAAGACG
GOCGIATC
AAACCCCACC
AAACCAGAAG
AAAATTGATG
AAAAATGGCA
GTTAAAGATA
GCAACAGACA
GATACTAGCA
GCTAATGTGA
GTGGGCGATA
GATGTCCCGT
TATGAAATTG
GACAGTCAGT
GATAACTTlCG
GTTCAGGGAA
ACAAAAGAAG
ACAACGATT
AAAACAGGTC
GCGATTCATG
TTCCCTTCGA
CAATTATTTA
A.ATCTAAGTT
ATAATTTCGT
TTCAGGGAAC
NAAAAGAAGC
CAACGATIA
AAACAGGCCA
AAGCAGGCGT
CTGTCAAACC
AATGGGAACC
TTGAAAAAAT
TCTATAAAAA
TAGAGGTCAA
TTTCCGCGAC
CAGTGAATrGT
CGAAAGCAAT
ATGTCGGCGA
AAGATGTCCC
CAAACCAGAA
AAAAATCACT
CAGTGACGAA
AGAGGTCAAA
TTCAGCGACA
AGTGAATGTT
GAAACCAATC
TGTTGGTGAT
AGACGTTCCG
TTATCCAATC
CGACCAATCT
AGAAGATAAT
TGACGI'CAG
TGGCACAAAA
AGATACAACA
AGACAAAACA
TGATAAAATA
CGTTCATGTC
TTCNTGGANA
ATTC
GAGAACT=G
GTTCAGGTC
GGTAAAGAAG
GATACAACGA
GACAAAACAG
GATAAAPLTAG
GTTCATGTCC
TCGTGCAAAC
TTTGAAAAAA
GTTACAGTT
AAGTTACAGG
TTCCTTTCAG
GGAACAGTGA
GAAGCGAAAG
AATATGG
GGTCAAGACG
GGCGATI'ATG
CGTGAT--ATA
CCAGAAGNGA
TTTCAGCAAC
AAGTTGATAA
AAACAGCCTA
TI'TATGTTGG
GTCAAGACGT
GCGA LPATGA
GTGATGACAG
CACAAGATAA
TCACTGTTC
ACGAAGGTAA
TTAA.AGATAC
CAACAGACAA
ATGTTGATAA
CAATCdTTCA GTGATWATG3
TTCCGTTTGA
AAATTGTCTA
GTCGTTTACA
AC'TTGTTTC
AGATAAAACA
CANCAAAGCA
TGTGACCGTC
TGAITCGTGG
NICCGI'GAA
AA ITGTCTAT
TCAGTTAGAG
TTTCGTTTCA
AGGTCAAGTT
AGAAGAAACA
AACGATTTAT
AACAGGTCAA
AATAGGCGAT
TGTCCGTGAT
GGAAGCAGAA
AAAAATTGAT
TAAAAATGGC
AGTCAAGGAT
AGCNACAGAT
EF104-4 (SEQ ID NO:404) VTTTE-AQTE 'ITDTACKEAE LSNSTPSLPL 0@ 00 S 0S
S.
00 6 00 @0 5 5 @0
S.
@0 S 0@0 0
S.
0@ S 000 0 6 000.00
S
05., 5 @600
S.
0050 0 0050 0000 0000 0 0500 OS 0 00 50
ATTTTSEMNQ
ELLOSMSLAP
TVSGEVTGIG
KTYSFDLYEQ
LEPQVFSYDV
IYLESASDYN
ISFQSKGKYY
TEGGKLTLTA
VNPLNAETAW
KLGTDYIVTP
TPVDTTrVITN QVDS IYPQYW
IDVPTLYITG
NKTHRVKNPT
TIEYNTVSAN
YTHNVEDENQ
VDEEYLTGKA
QDVPFEKITV
PEDNFVSATD
KDTTIYVGDS
NVTVKPDQSK
EIVYKNGTKE
QGTVNVDK(IG
TGQDVPF
PTATTESQT
IGGTEYSQTE
ATWAVDAATK
VEPIQYNERT
DINGQILPET
YLYSQQYPTT
VTIYGTLTET
TKDSYLRINI
GNYDQNGAYS
TSDGSVIKFT
SKRGSERTLQ
DRGQYFOKPM
TAYEPQSNNN
IELTPKGTTN
AGQ IYTETTI
AIAKVSFELI
IKLVKGDNQL
SGQVDNXKAG
KTGQDVPFEK
WKPEDNFVSA
LEVKDTTIYV
AKAIVHVRDD
DYEIVYKNGT
EASTTASSDA
VHRELNTTPV
TLS ITFNQRV RTTGLDGEI F
QTLLTPGXDY
KIGS ISLKST
KVGQQIVLES
SOLTMDFIDKK
SRTTVSVMGS
TPITNEIQIP
SSKNQFLVNA
TPNSPGYPTI
EGSASVSVQN
AQ IDLNSITV
DSETLNQMSA
DNVTHTATEF
KIPLTKTIDH
VYPIIYSDEG
IDVQGTVNVD
TDKTGQDVPF
GDKWEPEDN"F
SQLEVKDTTI
KEAKAIVHVR
ATPSEQQTTE
TATFQFAVGN
SDTSFKVELK
YNLDRTLTGf4
TLSDNSLGRI
TGTKQTDFT
TNGQEIKNPK
DINLStSTPV
KEKPIQNLEI
IGFNYVPDSL
RNDSFDSLSV
TFDENTNSYT
EALDILSATQ
KGVPEDAYSL
SKKKVITAP I
TTDE-KGQYSF
SRLQVKDSTI
KEETAYVTVK
KIGDYEIVYI(
EKITVSGQVD
VSATDKTGQD
YVGDKWEAED
DDSRLQVKDT
DKDTSLNEKA
TGYAPGSVYT
.SYLTTEAEPL
QTLELLTTET
AVTVPNMQQ
AKTSQTSKVI
FTAYGPLYEN
IGPNKAIQLV
KVKHPNYLSL
PKDKS IPVDT RTKI PAGADV
FDFGKTNKRY
AANPTLKNVT
EKTTNGAKVI
TLKFSEGDAE
DAIMTGDYTL
YVGDSWKPEE
PDQSKILEVKD
NGXKEAKAIV
TSKAGVYPIV
VPFEKIDVQG
NEVSATDKTG
TIYVGDSWXP
LPDVQAP lTD
VQLJPEHLGYS
IKIETPGX(NK
PGAVFGKQDN
KAYSLSINRT
ADREM'RSMSY
VKLEDYFDIK
SDQYIEPISV
RATKEIYFYY
IPITMSAEGL
LFDIYDVSND
I IEYXNANGW K'FIVITKNI U
FKDYTLTENI
GIVYLATATF
RVTNVPQEYS
NFVSATDKTG
TrIYVGDSWK
HVRDDSQLEV
YSYEGKEETA
TNNVDKIGDY
QDVPFEKIDV
EXNFVSATDK
EF105-1 (SEQ ID NO:405) TAAATGAAAA AALACAGTCGT CTACTCCTTG TTAIMCGAA CAATGTTGCT TGGCGCCAcT GTTCCTGCC AAGCGGCGAC GGTCG TTII GATAGCGAAC AGTCGATTGT TTTTACCCCA ACCACAGATO GGACGGATCC AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 204 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faccalis Genes.
CCAGTCGATC
GATTATGCCT
TTTGCCAGAG
AATTATGTAC
CAA.AATGGTC
GCCTTTACTG
GCAAACA LTC
ACCGGCGCCG
CAACAAGGAC
AAAACCCCTA
CCAGTAAATA
CAACGAATCC
CAAGTTTGGA
CGCAAACCTA
AAGTAAGTGA
AATTTCGTAA
AGCCCAGTGT
AATTAGATCC
GAACGTGGAT
AGCAAGTAAA
AGGATGCAGT
ATCGAGGGAA
TGATGGACCT
TTTTGGGAGT
TAGAAATCCA
'ITTIACGGGGA
TGCAGAAACA
TCGCTCAAAT
TGCGGGCGCA
CACGCTGTCG
TGCCACAATC
ACAATATAAA
ATAA
AATCCAGGTA
AATGAGATAT
GATGr.TTCAG
ACCAATGCTG
TTACACA.ZAG
GCGACGGACG
GAAACTG=I
GGGCAAGCAG
ACACGGGCA-A
ACAACATTGA
CCCCTGGTCC
CGAATAAGGA
CAAGTGAATT
GCTGGGTTTT
AA ITAACAGG
TATTGCCGCC
TCATGCAAGC
AAAAAGTC.AC
TCTCACTAAC
CTTGGCTACT
ACTI'TCCATC
TCAAACGTAT
GGCAACTGCT
AAAAGTGAAA
CGCCACCOTC
AACTGCTACC
CCCAGAAAAG
CCAA.AAAAAT
TG ITCCTGGG rICAGATGTA EF105-2 (SEQ ID NO:406)
MYKKVVYSLL
VDPTNPDGPN
YVQVSDLRGT
NI QLDAAGAE
TPKDAVQYKT
FGTMLLGATV
PGTPGPLS ID NAG WVLKVKQ
TVVMQAPEKT
TLTWLLSDVP
PAEAATVVFD
YASSLDFGSN
NGQFRNAETL
GAGTWITLWG
VNNGGF
SEQSIVFTPS
El SNKDQTYF
HKELTGATVA
QAEKVTEKNQ
TOGTDPVNPE
ARAZQTYRNPD
FTEPSVRSNA
QGQQVNATIT
NPDPEKPVRP
GSASELATAN
TDVLPPTATA
RAISLTVPCK
o 0 0* 0 000 0 0*0* *000 EF105-3 (SEQ ID NO:407) GCCGAC GGTCGT'ITT GATAGCGAAC AGTCGATTGT TTTTACCCCA
ACCACAGATG
CCAGTCGATC
GATTATGCCT
TTGCCAGAG
AATTATGTAC
CAAAATGGTC
GCCTTTACTG
GCAAACATTC
ACCGGCGCCG
CAACAAGGAC
AAAACCCCTA
GGACGGATCC AGTAAATCCA CAACGAATCC TGATGGACCT CAAGTTTGGA TTTGGGAGT CC-CAAACCTA TAGAAATCCA AAGTAAGTGA TTACGGGGA AA =TCGTAA TGCAGAAACA AGCCCAGTGT TCGCTCAAAT AATI'AGATGC TGCCGGCGCA GAACGTGGAT CACGCTGTGG AGCAAGTAAA TGCCACAATC AGGATGCAGT AC GAAAATC CCG
AATCCAGGTA
AATGAGATAT
GATGGTTCAG
ACCAATGCTrG
TTACACAAAG
GCGACGGACG
GAAACTGTTG
GGGCAAGCAG
ACACG.GGCAA
ATCCAGAAAA
CCCCTGGTCC
CGAATAAGGA
CAAGTGAATI'
GCTGGGTTTT
A.ATTAACAGG
TATTGCCGCC
TCATGCAAGC
A.AAAAGTGAC
TCTCACTAAC
ACCAGTTCGA
ACTTTCCATC
TCAAACGTAT
GGCAACTGCT
AAAAGTGAAA
CGCCACC-GTC
AACTGCTACC
CCCAGAAA-AG
CGAAAAAAAT
TGTTCCTGGG
EP105-4 (SEQ ID NO:408) no.
0000 0 00 ATVVFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNKDQTYF YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HKELTGATVA NIQLDAAGAE TVVMQAPEKT GAGTWITLWG QAEI<VTEKNQ
TPKDAV
ARAQTYRNPD GSASELATAN FTEPSVRSNA TDVLPPTATA QGQQVNATIT RAI SLTVPGK EF106-1 (SEQ ID NO:409)
TAGTCGTTTA
TTAGTTGGC
AATCCCAATA
GAAGGAFI'AT
TTAACAGATG
TATGAT=T
ACCGCTGGTA
GGTGTATACC
GATAAAAACG
TGAAGAAAAA
GAGCAGGAGG
ATTTAGGGGA
CAGAAAAAGA
AAAATCAAAA
TTGATAAAGA
TTACAGATCT
CAC1'TAGAAC
GAAATGATAT
AATCGTTGGT
GGCTTTrGACG
TTACCMCAG
TT1GGGCTGGA
CCTATTACAT
GACTIGGAGAA
TTCCCGTCTT
TTTATACCAA
TGAAGTTI'C
ACAATTACGT
GCAGAAGCAT
TATTTACGTT
ACACGCGTT
GCAATCAAAT
TCAACAGGAG
GTAA'TATT
GATACTTTI'A
GTAGAAAACA
TGTTCCCTTT
ACGTICCTCA
CAG~TTGGTAT
ATGATCGAAA
TTG-ATGCAAC
ATGAAGGAAC
CTGAAACCAA
CGTATAGACA
A.AGCAACCTC
AAGTGCGTNTA
AAGCGTAGAC
TAGACAAGAT
TGGGAATGAC
CACTAGTTTC
CTTCT1'ATG
AAATTATCAA
GATGGGGAAA
AGGACCAGTT
1 205 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TATGGTCGTC
CGCCGTGCCA
ATTGGTAAAA
GATAAAGATG
GACCAAGATA
TATGGCTACT
AATCGCTATG
CGAATGGGTA
CAAGGAACTT
CGCAGCCATA
TGACAGAACA
CT'TCCTI'TGA
GCAATGACGT
GCAGCAAATA
TTAAAGTAGG
GCGCTGTCTT
AAGATAACGA
TTAATCCTGA
CCCCAATAAT
AACAGGCCAG
TGGGACACCG
AACTTCGGCC
Ty-AATT'IIC
AAATCAAAAT
AGAGTTAACA
AGGAAACGAT
ATTTAC=TI'
CGTCCCAGAA
ATTGATGTA-A
CAACTTCTTT
AACCAAAACT
AATTTACAAA
AAATirCCGTc
GAAI'GAATG
ATCCAAGTCT
TAA
CACTAGAAT
ATCGACAAGG
GGAATGGCAC
TTATCAGC IT
CTGGTGAAAC
CCCATGTTTC
ATAATCGTTI'
AtGTGGAACA
CACGAATGGA
GGATGAAATT
AAAAGTAGTG
AGCGAAATT
TCGTIGGCGAC
CATTGGAACC
TACGTACACA
TGAACCATAC
EF106-2 (SEQ ID NO:410) MKXIVGT ITLLALSALL VGGAGGALTA EAYVPQSVDN PNNLGDLPEY LRSVGIRQDE GLSEKOWAGT RVYDRNGNDL TDENQNLLHA IKFDATTSFY EFFDKETGES TGDEGTFFMT AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNGNDIEVFV ENKATSGPVY GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI DVNRQGDEI I GKTSFDGTPQ LLWNGTKVVD KDGNDVTSAN QNFISLAXFD QDSSKYEFFN LQTGETRGDY GYFKVGNQNK FRAHVSIGTN RYGAVLELTE LNDNRFTYTR MGKONEGNDI QVYVEHEPYQ GTFNPEFTF EF106-3 (SEQ ID NO:411)
C
C C
C.
C C CC, C
CC
C C
C
C
C
AT ACGTTCCTCA AAGCGTAGAC
AATCCCAATA
GAAGGATTAT
F1'AACAGATG
TATGAATTTT
ACCGCTGCTA
GGTGTATACC
GATAAAAACG
TATGGTCGTC
CGCCGTGCCA
ATTCGTAAAA
GATAAAGATG
GACCAACATA
TATGGCTACT
AATCGCTATG
CGAATGGGTA
CAAGGAACTT
ATTTAGGGGA
CAGAAAAAGA
AAAATCAAAA
TGATAAAGA
TTACAGATGT
CACTTAGAAC
GAAATGATAT
CGCAGCCATA
TGACAGAACA
C EFCTTTGA
GCAATGACGT
GCACCAAATA
TTAAAGTAGG
GCGCTGTCT
AAGATAACGA
TTTACCTGAG
TTGGGCTGGA
CCTATTACAT
GACTGGAGAA
TTCCCGTCTT
TTTATACCAA
TGAAG=TTC
CCCCAATAAT
AACAGGCCAG
TGGGACACCG
AACTTCGGCC
TGAATITTTC
AAATCAAAAT
AGAGTTAACA
AGGAAACGAT
TATTTACGTT
ACACGCGTT
GCAATCAAAT
TCAACAGGAG
GTAATTATTT
GATACTTTTA
GTAGAAAACA
CGTCCCAGAA
ATTGATGTAA
CAACTI'CTrT
AACCAAAACT
AATTTACAAA
AAATTCCGTG
GAATTGAATG
ATCCAAGTCT
CAGTTGGTAT
ATGATCGAAA
TTC'ATGCAAC
ATG-AAGGAAC
CTGAAACCAA
CGTATAGACA
AAGCAACCTC
CACTAGAATT
ATCGACAAGG
GGAATGGCAC
TTATCAGCTT
CTGGTGAAAC
CCCATGTFI'C
ATAATCG'TI
ATGTGGAACA
TAGACAAGAT
TGGGAATGAC
CACTAGTITTC
CTTCTTTATG
AAATTATCAA
CATGGGGAAA
AGGACCAGTT
CACGAATGGA
GGATGAAATT
AAALAGTAGTG
AGCGAAAT
TCGTGGCGAC
CAITTGAACC
TACGTACACA
TGAACCATAC
*CC.
C. C
C
EF106-4 (SEQ ID NO:412) YVPQSVDN PNNLGDLPEY LRSVGIRQDE GLSEXDWACT RVYDRNGNDL TDENQNLLHA AGITDVSRLV IISETKNYQG VIYPLRTLYQD GRPQPYPNNR PRTLEFTNGR RANTEQTGQI KDGNDVTSAN QNFISLAKFD QDSSKYEFFN RYGAVLELTE LNDNRFTYTR MGKDNEGNDI
IKFDATTSFY
TFTYRQMGKD
DVNRQGDEI I
LQTGETRGDY
QVYVEHEPYQ
EFFDKETGES
KNGNDIEVFV
GKTSFDGTPQ
GYFKVGNQNK
GT
TGDEGTFFMT
ENKATSGPVY
LLWNGTKVVD
FRAHVS IGTN EF107-1 (SEQ ID NO:413)
TAAAAAACGG
ATANATANAA
CTTTATAGAG
AATTGGAAAA
CACTCAATAT
AAATGCTAGT
ACTATAGATT
GATGGCTAGT
GTCAAAATT
TATCAGTATC
GAATTTTTAC
TGTTGGGTTA
GAAATTTCAA
GATAATAACA
ATAGAAAGAA
AGTTTTCTT
GCTGTCGTGTT CTTGGTAAA GGATACTGAT TAAC;AAAGGA GGAGCAAGAT GAAGCdAGTA TGTTCATGGA TTCAGTG 206 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efaecalis Genes.
0**t
GGTGTGACTG
CAGTCGAGTG
GCTAGTGAAG
GGTTATTCFT
TTGTTTGATC
GAGAAATTAC
ATTCCAACGT
GGCGTTTCTG
AGTACGTTTT
GGGGGCAAAG
GAAAAAATGC
ACGGCACCAG
ACTAGTTCGC
CGTGTCACGG
ACCCAGGATA
TTGCCTGAAC
GGGAATGCGA
GATGACAATG
CCAGCGAGAG
AAGTTTAAGC
TTTCCCACAA
CAAAAAGTC'I
AGTGTTGAAA
GGGGATAAAA
AACA ITGTT,
CGCTATACTC
AGTATCTATI
TATTATTAC!
ATCACGCCAC
GTCTACACTC
TTCCAAGGCI)
AGTTTTACA(
CCCACCGCG(
GATTACTGG(
AAGCCAACGC
ACGGGTCAA
GTGTCCATCO
GGAACCGCC(
AACTrrGGC,
ACGAGTCCT,
ATTICAGGA
ACTCGCAAC
CTATCGCAA
ACGGCCGCT
GGCAAGACC
CAACAGT'C
GAACTCCCA
TGTTAGCGGA
ACGAAGCGAG
CGAAAACAGT
CTGTGGAAGG
CCGACCCAAT
AACGAACCCC
CACCAGVITI
AAGTGTCGTT
ATACAGGAGA
TGACGAACTA
CCAATAAGCT
GAGGCGTTAC
ACCAAAGACG
AAAAA TrGA
AGAAAACCAT
GTTACACAGG
AACCTAGCAC
ACGATPGCA
AAGCTTTGTT
TAAGTGCGAC
TTGATGGAAT
ACACACGCCC
TTCCTAAGTA
LCCAAGTATCC
GTAGTTTAGI
CCCGCCGTTC
CAGGAACTGC
7TAGAAAATCC
CAACTGGCTI
TCGCAAAAGC
EGGTTAAAGC
CAACTTTTA;
3AACTAACGT 3AGGCGCTAC'.
3 CAACTTGGG( k ACACCAAAG( k~ CGTTGGAMC 3TTACAGGAA.
N GTTTAACTGI S ACGGCGACGI A GTAAGCAAC, C CTTATTTAC
CAAAATCAG
CTGCCAGCT
A GCACCGTGA A CAGGCAGTG .G CCAACCAAG
AACGATTACT
CCAGACGACG
ACCCCACAG
GcGcG;AGATT
TACGATGGCG
CC=ITAGCCC
TAAATATGAT
TACTATTCCT
CACTACGCAA
TGTGGGCGCC
GGGGCGACGG AGCAAGGAGT CAAACAkACCG AAGAGTCACA GAAACGGCAA GAATTOCTTC CCCTTTTTCT TTGTGGAGGA GTCAATCTTT
CCACGTTTTC
ACCACTGTGA ATGGCGGAAA ACAAATAACC AGAATCCAAG
AAGGAGTATC
TATCCAGTGC
AATGCGGAGA
GACGT-'TTGGT C-TAAG GTATGCGCTG ACCTATTTTT
AGGAACAGCG
GAATCCCGCA
TGTAACAGGG
CAGTGATGGG
GTTGGAAACC
TGTGGTCTAT
GGTCCTGTGT
GGCGGGGCGA
GAGGATTTTA
AAGACGTAT1'
ACCAAAACGC
GAAGAAGCAG
CGGCTATGTT GATGAGCAAC CATGGGTG-AA
AGTGACGGAG
CGATATGCCA C
AGACGATGGG
TTACCAGACG1
AGTACCAAAT
GGGAANCNCT
TTACTGGANG
T GGGGGCAAC
GCGGGTCACT
TACACAAGGA.
'FTGCCGAAG.
cAAAACCAAG
'GATGAGGAC
AACAGGTGCC
GAAGA.ACACA
GGCTGGCATC
TTTTCCTGTC
GCCTTTACCA
TCCTGGTCAA
cAAAGATACG cTrTATTAGT
ATATGGTTTG
CCTGAATACT
CACAGACAGC
TACCGATTAC
TITAACCGCC
CGTCTATCAG
TATGGCTGGC
AGCAACATCT
GGCAACGGTC
TCGAGCGATT
AGACGGGACG
GTTTTATGAA
GTTACTGTCT
TAATATI'TAT
;CAAGTCAAT
~CAATCGTAA
kTTTCGAT=I
'AAGTGCGCC
3CGTATAACT rGGGGCCCCA rATAATTTAT
GAACATTTTG
AATCAGCTAG
ATCTACCAAG
CCAGCAACAT
GACATCACCC
GTI'GACATAA
GGCGAAGCGC
GGCGCACCCA
ACCAAAGAAC
GCTGGCGGTC
GTTI'TAACCC
GTCCGTATT;
ACCCCGACX
A.AGAAGGCCC
AGCCAAGCC
TTGCCAACk
AACCAACCAX
GACAATACC(
TTGGACTTC
CAACAATAC
AAAGCCTGGA CATTCGACCA CAACGGTTTT TGCGAACGTT CGGAATTAGA GTTAACCAAT CGTT.TAAATA TACGGTAGCT ATGGAGATGT CCGCGGTCCA ATTATrIATT AACAAAGCGG TTCCTGCGCC
AGAAGGTTAT
CTTITACCCA AGAAGGCACC TATTTAAAGG TTGGTACAAA CTAGTTIATGC GGTGACCTAT TGATGAAAAC CTATACGTTG GAAACTTGAT TAATCCCGCC CCACAGGGGA
AATGACGACT
TAAAGAAATT AGCCATCCCG C'ITATG-GCCC GCAAGAAGTG CACCAACTAC
TGCGTATACA
GTGGCATCGA
AAACCCCGAC
TGACCCAAAA
AAGTGCCACA
CGAAGACACT TTACTCAATG CGACCCCTGA TGGCACCATT TAGACGAAAG TGGCGCAAAA TGGTGGACAG TGAAAACTAT CTGGTGAAALA AACCTATATC TAAAGACGAC AACGACCCCA CTGTGTACCA AGA.AGCGATT TC:GAAAATGG CGCCACAATG CGTTALACCAC CATTAAAATC ACACCATATT
TGTACAAGGA
AATGGACGAC
CGGTGCAGGA
AATTAAAAAT
GAACTTATTA
CTGATGTTGA
AGTAACGGG-C
AAGAC'ITAGA TCAAGAAATT TTGATTTTGG
TAAACTAGCA
3CAGATTACTA CGGCAATGGC A ATTGGAGTTT AACGGCCCAG SCGACCCGCTT
GTTGCTAGGA
SCAGAAACCAG GACACCACTI' G CAACAGCGGT CGTCGCAAAC A CGTTTGCTAA CATCAAACTA C AAGCCGCCGT
CACGTGGAAT
TTAGTGACTG GCCCCTAA EF107-2 (SEQ ID NO:414)
MKRVN
WKRWLVVGLS CSLFM.DSVVG VTVLAETITG ATEQGvATSQ SSDE.ASQTTQ 'FrEESQATVA SEAKTVPPQE TARIASRAIG YSSVEGREIP FFFVEEDGTL FDPDRITMAV NLSTFSFYEE TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
KLQRTPLJEPT
TFYTGDTTQY
APGGVTYALT
QDKKTIVTGE
DNZDDLHVVYE
PTIDGIDMPA
DKTKYPVPNE
IYSGTAGGNY
YTVAI(ALPK I
TAELTLTGAV
GQNTKAFPVT
FGSLTAKDT'J
RNPYLRLNTS
KTSTVTLTAD
VTGP
TVNGGKLLS I
PVPTVFANVG
YFYGDVGGPT
DPI'FTQEGTL
EAVMXTYTLP
SQLKKLAIPQ
VRRGIENPDN
NLSTPDGTIY
YQAGEKTYIF
DIIENGATMD
KEQWTTGAGV
RIKDLDQEIT
QANWSLTAQL
NTATAVVANQ
PTSPAFKYDT
GKVTbNYVGAN
SSHQRRGTAG
PERYTGSDGK
AREALFGYVD
KVYTRPDDGT
IVSSLVGXXA
YYLENRRVTE
QGWFKGKTKP
YWEALLKNTG
S ITLDQPLPA.
SPDGDGFIST
SQPKSATDSL
QFTGSDVYQL
NNQNPSNIYG
AETELELTNE
PVYYYLTKRR
TYLFKGWY'KG
EQGNLINPAK
IVTYGPQEVS
YNLTQKSATR
HFVDESGAKI
ATLKTTIIP S
EAPLTTIKIK
,GGQLKMNLLG
PFDFGKLAI
PTTTRLLLGT
OFTFANIKLE
VSEVSFTIPX
K4PNKLTFGP
VTEKFENPAG
NAKPSTLET'
FKLSATMGES
VEI PKYYQTI
YTARRSYWXW
TPPTrFTQGN
FTPTFNDEDD
pTAT 'AAG IG
TAVTGNPGQV
SGSKQQYGLK
,AAAAS FTDYN
VPANQGMAGQ
EYQSLDIRPS
KKTFKYTVAT
GAl PAP EGYT
KTPSYAVTYD
DGATGEMTTF
S ISPTTAYTG
GPTKTLYSMS
QLVVDSENYV
MTAVYQEAI P
APNTIFVQGT
LTADVEVTGN
KAADYYGNGT
QPTETRTPLG
QYQAAVTWNL
EF107-3 (SEQ ID NO:415) GG AGCAAGGAGT AGcAACATCT
CAGTCGAGTG
GCTAGTGAAG
GGTTATTCTT
TTGTTTGATC
GACAAATI'AC
ATCCAACGT
GGCGTTTCTG
AGTACGTTTT
GGGGGCAAAG
GAAAAAATGC
ACGGCACCAG
ACTAGTTCGC
CGTGTCACGG
ACGCAGGATA
ITGCCTGAAC
GGGAATGCGA
GATGACAATG
CCAGCGAGAG
AAGTTTAAGC
T'TCCCACAA
CAAAAAGTCT
AGTGTTGAAA
GGGGATAAAA
AACATTGTTA
CGCTATACTG
AGTATCTATTI
TATTATTAC7I
ATCACGCCAC
GTCTACACTC
TTCCAAGGCI
AGTTI'TACAC
CCCACCGCGC
GATTACTGGC
AAGCCAACGC
ACGGGTCAAJl
ACGAAGCGAG
CGAAAACAGT
CTGTGGAAGG
CCGACCGAAT
AACGAACCCC
CACCAGCTTT
AAGTGTCGTT
ATACAGGAGA
TGACGAACTA
CCAATAAGCT
GAGGCGTTAC
ACCAA-AGACG
AAAAATTTGA
AGAAAACCAT
GTTACACAGG
AACCTAGCAC
ACGATTTGCA
AAGCTTTG T
TAAGTGCGAC
TTGATGGAAT
ACACACCCCC
TrCCTAAGTA
CCAAGTATCC
GTAGTTTAGI
CCCGCCGTTC
*CAGGAACTGC
*TAGAA.AATCC
CAACTGGCTJ
TCGCAAAAGC
GGTTTIAAAGC
CAACTTTTAJ
AACTAACGT'
AGGCGCTACr
CAACTTGGG(
kACACCAAAG(
CCAGACGACG
ACCGCCACAG
GCGCGAGATT
TACGATGGCG
CCTTGAGCCC
TAAATATGAT
TACTATTCCT
CACTACGCAA
TGTGGGCGCC
GACGTTTGGT
GTATGCGCTG
AGGAACAGCG
GAATCCCCCA
TGTAACAGGG
CAGTGATGGG
GTTGGAAACC
TGTGGTCTAT
CGGCTATGT
CATGGGTGAA
CGATATGCCA
AGACGATGGG
TACCAGACG
AGTACCAAAM
GGGAANCNC I
TTACTGGANC
TGGGGGCAAC
GCGGGTCAC1 7TACACAAGG; TrTGCCG5AAC
CAAAACCAA(
k. TGATGAGGA( r AACAGGTGC( r' GAAGAACAC
GGCTGGCATI
TT'cCMTI CAAACAACCG AAGAGTCACA GGCAACGGTC GAAACGGCAA GAATTGCTIC TCGAGCGATT
CCCTTTCT
GTCAATCTTT
ACCACTGTGA
AC AAATAAC C
AAGGAGTATC
TATCCAGTGC
AATGCGGAGA
CCTAAAAAGA
ACCTATTTTT
GGTCCTGTGT
GGCGGGGCGA
GAGGA1TT1A
AAGACGTATT
ACCAAAACGC
GAAGAAGCAG
GATGAGCAAG
AGTGACGGAG
GCAAGTCAAT
ACAATCGTAA
ATTTCGATTT
GAAGTGCGCC
GCGTATANACT
CCACGT=TC
ATGGCGGAAA(
AGAATCCAAG
AAAGCCTGGA
CAACGGIII
CGGAATTAGA
CG=rAALATA
ATGGAGATGT
ATTATTATT
TTCCTGCGCC
CT=rACCCA
TA'IIAAAGG
CTAGTTATGC
TGATGAAAAC
GAAACTTIGAT
CCACAGGGGA
TAAAGAAA]T
CTTATGGCCC
CACCAACTAC
GTGGCATCGA
TGACCCAAAA
;TTTTATGAA
;TTACTGTCT
.AATATTTAT
:ATTCGACCA
n'CGAACGT
TIAACCAAT
rACGGTAGCT
:GGCGGTCCA
kACAAAGCGG kGAAGGT1'AT kGAAGGCACC rTGGTACAAA
:;GTGACCTAT
CTATACGTTG
TALATCCCGCC
AATGACGACT
AGCCATCCCG
GCAAGAAGTG-
TGcGTATACA
AAACCCCGAC
AAGTGccACA
TTACTCAATG
TGGCACCATT
TGGCGCAAAA
TGAAAACTAT
AACCTATATC
AACGACCCCA
AGAAGCGATT
cG~cCACAATG
CAITAAALATC
TGTACAAGGA
CGGTGCAGGA
TGGGGCCCCA CGAAGACACT TATAA=rAT CGACCCCTGA
GAACATITTG
AATCAGCTAG
ATCTACCAAG
CCAGCAACAT
GACATGACCG
GTTGACATAA
GGCGAAGCGC
GGCGCACCCA
ACCAAAGAAC
TAGACGAAAG
TGGTGGACAG
CTGGTGAAA-A
TAAAGACGAC
CTGTGTACCA
TCGAAAATG
CGTTAACCAC
ACACGATAT]T
AATGGACGAC
208 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GTGTCCATCA
GGAACCGCCG
AACTITMGCA
ACGAGTCCTG
ATTCAGGAA
ACTCGCAACZ
CTATCGCAAC
ACGGCCGCTG
GGCAAGACCA
CAACAGTTCA
GA.AGTGCCAG
CGTTGGATCA
TTACAGGAAA
GT=AACTGC
ACGGCGACGG
GTAAGCAACA
CTTATTTACG
CAAAATCAGC
CTGCCAGC T
GCACCGTGAC
CAGGCAGTGA
CCAACCAAGG
GCCTTTACCA
TCCTGGTCAA
CAAAGATACG
CTTTATTAGT
ATATGGTTI'G
CCTGAATACT
CACAGACAGC
TACCGATTAC
TTAACCGCC
CGTCTATCAG
TATGGCTGGC
GCTGGCGGTC
GTTAACCG
GTCCGTA ETA
ACCCCGACAT
AAGAAGGCCG
AGCCAAGCCA
TTGCCAACAA
AACCAACCAA
GACAATACCG
TTGGACTTCA
.CAACAATACC
AATTAAAAAT
CTGATGTTGA
AAGACTTAGA
TTGAT'ITT"IGG
CAGATTACTA
ATTGGAGT
CGACCCCCTT
CAGAAACCAG
CAACAGCGGT
ccT'ITGCTAA
AAGCCGCCGT
GAACTTATTA
AGTAACGGGC
TCAAGAAATT
TAAACTAGCA
CGGCAATGGC
AACGGCCCAG
GTT-GCTAGGA
GACACCACTT
GGTCGCAAAC
CATCAAACTA
CACGTGGAAT
TTAGTGACTG GCCCCT EF107-4 (SEQ ID NO:416) a EQG'IATSQ SSDEASQTTQ
TTEESQATVA
SEAKTVPPQE TARIASRAIG YSSVEGREIP KLQRTPLEPT TVNGGKLLSI PTSPAFKYDT TFYTGDTTQY PVP IVFANVG GKVTNYVGAN APGGVTYALT YFYGDVGGPT SSHQRRGTAG QDKKTIVTGE DFTFTQEGTL
PERYTGSDGK
DNODLHVVYE EAVMKTYTLP AREALFGYVD PTIDGIDMPA SQLKKLAIPQ KVYTRPDOGT DKTKYPVPNE VRRGIENPDN
IVSSLVGXXA
IYSGTAGGNY NLSTPDGTIY
YYLENRRVTE
YTVAKALPKI YQAGEKTYIF
QGWFKGKTKP
TAELTLTGAV DIIENGATMD
YWEALLKNTG
C-QNTKAFPVT KEQWTTGAGV SITLDQPLPA FGSLTAXDT'J RIICDLDQEIT
SPDGDGFIST
RNPYLRLNTS QANWSLTAQL SQPKSATDSL KTSTVTLTAD NTATAVVANQ
QFTGSDVYQL
VTGP
FFM~EDGTL
NNQNPSNIYG
AETE.LELTNE
PVYYYLTKRR
TYLFKGWYKG
EQGC1LINPAK
IVTI'GPQEVS
YNLTQKSATR
HFVDESGAKI
ATLKTTTTPS
EAPLTTIKI.K
GGQLKM1NLLG
PTFDFGKLAI
PT±TTRLLLGT
DFTFANIKLE
FDPDRITMAV
VSvEVSFTIPK
KMPNKLTFGP
VTTEKFENPAG
NAKPSTLETT
FXLSATMGES
VEIPKYfYQTI
YTARRSYWXW
TPPTGFTQGN
FTPTFNDEDD
PTATWAAGIG
TAVTGNPGQV
SGSKQQYGLK
AAAASFTDYN
VPANQGMAGQ
NLSTFSFYEE
EYQSLDIRPS
KKTFKYTVAT
GA IPAP EGYT
KTPSYAVTYD
DGATGEMTTF
SI SPTTAYTG
GPTKTLYSMS
QLVVDSENYV
MTAVYQEAIP
APNTI FVQGT
L.TADVEVTGN
KAADYYGNGT
QPTETRTPLG
QYQAAvTWNL EF1O8-1 (SEQ ID NO:417)
TAATCGG=T
CAACGATTAG
ACGGCAGTTG
GAAGAGACGC
CAAGTAACAA
CAAGCAGAAG
AAGGCAACTT
CAATTAATAA
ACAGATGCTT
AACTTAAAGA
GAAGGAACTC
GCGAGTCTCT
CTAGCAGGTA
ACCAGTAAAG
CGTGTAAAGT
ATAGGTTATA
GAAAACTTCG
AAACAAACGG
ACCTACAAAG
GGCGGGAATC
CAACCATTGG
CGGAAACCGT
AAGCAAGTAG
CTGACACGAG
TGGAACAAGC
ATTCATTTTC
ATCCAGATAT
ATAATCGAAA
ATGTAATALAT
AAGAGGTGTA
ACAATACAGA
CTAGACCGCT
GAGACAATGT
TTACTAATT
AGACTGTGAG
TAGATACAAG
TCATTAACAG
TTGGCACGAA
:;TACATAGAA AGAAGGGACG. ACATGAAGCA
AACTAAGTGG
CTTGTGTAGT
GACGATTGAA
CGTGAAGGAA
TCAGGAAGAA
AGAAACACCA
TGCAGAAACT
TATACCAATT
GTGGACTAGT
GCcAGcAcT
CACAAACTTT
AGGTAAAATTI
ATCTrITGACT
TTATACACCA
'FrrGTATCGT
TAGCCCAATC
TGGCGCCAAA
TGATCCTTAC
ATCTTACCGA
rcrTAGTAA
AGTAGTCCGA
cAAACAACGA
GCAACGAAAG
ATCATTCCTA
TATCAGTTTG
ACGTATAGCT
ATGGTTCAAG
TCTGTAGTTA
TCAATTCGCA
GATTCTACTT
CAAAGTAGTG
CGGGAAACAT
ATAAATGACT
TATrACCATC
ATCACGCCAC
ACG~rCCAAC
TTCAAAGGC'I
CCGCCGAAAG
AAGCCAGTAC
A.AGCGGAGAA
AACCAA.AAAA
CATTTGTGAA
ATGCCAAAGG
GGAGTGCTTC
TOCCACCACO
TACCAAAATA
ATCCGTTACC
TAATTAGTGC
TTTTTrGGAGG
TTGATGTGA
TGACCAACCG
CAAGTAATTT
AAAGTGGTTT
GGTACAAAGG
TAGTGCCAAG
GGAAAATAGT
A.GAAGAACCG
AATCAATATG
TCAATCAGGT
ATCATGGAAG
A.ACCGTAGGA
ACCCTCATAT
TTATCCATCA
TGCTAT1'GCA
ATTGGCGCTG
AGATCCTGCA
AGGTAATAAC
CCGTGTCACC
CACCCAAGGG
TTTACCCGAG
GAAA.ACCAAA
209 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
et t 4
ACCGAGCCTT
GATTTGACGG
TTTGGCTTTG
TATAATCTTA
ACAGATACAG
GTCAATACGT
ATCAATATTA
AATATAGATC
CTGTCGGGCC
ACGTTATII'G
CCTGITTACT
GCTAAAATCA
GCCTACACTT
TACAAGTTCA
GCGCCAAGTT
GAAACAGTTA
GCTTTCACAC
TTA.AGAACCG
AGTATTAATA
CAACCAATCA
CAACTAAAA'I
TATAGATATC
GAGGTTGACC
TTTTCTAATC
GTTAACTATC
GTCTACAAAC
ATCACTGCAC
AAGTACACAC
TTCCAAGGG'
ACGTTCAATC
CCAACAGCT)
ATCTGGACA
AAAGGGCCCA
GGAGAAACG,
CCAAATGCCI
AA.ACCAAAC
GTGGATAAC
GGCTTCATC
CAACACAGC
CGGATTAAG
GCGACAGAC
TI'ACCAAIl
AGCTTAACP.
AATGTTTA'I
GGTGTTAAP
TGGCCACCAC
TGGTCTATGA
TGG.ATGAAGC
CITTAAATGA
CGACACTGAA
TTTATGGCGC
ccAAATCAGA
AAGTAGAAAT
AAACGTrGG
GGACAGAGTC
ATTATTAGA
CACCGCCAAC
TCA.AACAAGC
AAGGTTGGTA
ATCAAGTGAC
CGACAGTGTA
CGGCGTTAAC
ATTTATATGP
ATGGTAGTAM
GTGCTACCA
ATGTTGACAC
TGTACACGAI
TAGTTCAGZ
CAAATAATAC
rTTTCGTA.W.
TCACTCGCAj CAACAGGCr CGGCAkAAG C' CGTATAAAGC
CGACCTTIG.
'GTGTCACAT
D, CAACGATCA k. ATTGGTCAG k. CAAAATCAA TTcCTATCG A. CTGTTTTGA T TCGTGAGAA A GTGTGCCAP T TCAAACAAC A AAACGCAAC A GCTTGCCTI 'r ACAATCAA-C ,G CCAACAACJ IC AGTTGGAC' LG GGCAACAA'
TAAAACACCT
GGAGI'TTCA
GACGAACAAA
AAATAATAAA
AAACTTGTCC
GAGTGACATT
TGGCAAAACC
GTCACACATG
CTTTAATGCT
TGGCATCGAT
AAACCGCAAA
AGGTTTCACC
AGGCACCTTA
CAAAGGCAAG
CTACGATGAC
*TCCATCAGTC
*TTTTAGTGG'I
CGTGACCTC.P
GCCATTGTCC
CAGATTACAC
CATTCAATT;
CAATAGCTCI
kATCTC'ITAA(
;AC=TTTA(
r' cT'rrTTTT' k. ACAAGTCAC( r CACCCAAGG' :T'IT1ACCAGC( S GAAAACCAA, P. TGGCA-ATGA T AACTCGACC C GAATACTAG C TGGTCTGAC .T CCCAGTAAA .0 CAAAAAAGT A AGCAGAAGIl LT TCGTCCAAA.
,C CTTCGACTI ;C CGCGGA T; .C CAATTGGAC C AGCGACCCC :C AACCGAGT' kC AGCAACGA( Mr CACCTTICAj I'A CAAGGCCG(
AGCTATAAAG
GGGTACGAGC
TTAATTGCCC
AAAACAGTAA
GTGCC1XGTCA
ACGTTTACAT
GACCCAGCTT
CCTGTGACCA
TTAGCCGATC
GACCCAGTCA
GT'CACCGAGA
CAG GTAAAA
CCAGACACTT
TCCATACTCA
AATGATGATT
GATATGAACT
AAGTACTATG
AAAAATAATC-
CAAGAIrA'I
TTTAATGTT
GACACAGCTC
CTGGTTITCC
TTGC IAAT J
ACGCATTTA(
P' AACGCCAAA(
:GAAAACTTC(
r AAccAAGTA(
ACGTATACT,
SCCAAGTACG'
C GATATGACCI A AAAGAAGTG.
c AAAGCACCC G ATCCCGACC TCACGTATGA T TGCCTGCTTC
G
CCGACCAAGT
C
TGAGCAGTAA C ACTATTTTGA
A
TGCCCAAACG
C
TTCCTCTTCC
'I
CTTATAACA-A
C
AACCTGAATT
IJ
A'ITATI'ATAC
IZ
ACTTCGTAGA
AAACGGTGAT
ACACAACAGG
ACACATTGAC
TGAATGTGGT
*TTGTGAATGA
CGCAA.AGTAC
GTAATGGGCA
*TGAAAAAATA
ATAAATTAC
*AAAGTAGCAA
ACCCAAATGT
7 TTGATTCAGA
GATATAGTGG
-CTGCGGATAA
TGGATCTCAA
CAATGAACAG
k CAGGTGGCA.A r TGAACAAAAC
GCCATGTATAA
A TI'ATACGAA T TACAAAATCT T TTATGGA.AGT
'ACAATCAT
~CCAATCAA
AGATGAAG
rTAACGGG
CAATATCGC
TACAAATCA
~AAATCTAT
TTGAAACAG
rATACGAAA
ATGAGTGGC
PCCAACGGC
ACAAGCGAC
GGTAAGACC
LACTACCAAA
TATGAAGA-A
LAAAGGCGGG
,AGTGCGTAC
~TATACGGTA
~AATAATGGA
:ATCGACCAA
rcTGAAATcc kGCACCAGCA rCGGCACCTAT
CACACCAGGA
GTCAAAACTT
CGTCCCAAA
TAACACCTTC
AGTCTATACG
P.ACAACTCCA
GGAAGAAATA
TACCAATGTA
CACCTTGAA.A
GACACCAGAA
GGTTCCTITA
CGCAACAGGG
AGATACTACA
AACGACCGAA
AACTAAGCAA
TCCGTATCTG
ACcAAAATCA
TGT-CTCTAGC
CAGTGCCATT
CACAGGTAGT
AGCCA.ATCAA
ACGTCCTTAA
AGTACACTT GGACAGAGGG
GTAGTATTTG
GATCAAGAAG
GGCCAAGTGG
TACGGTAACG
TTAACAGCGC
TTATTA'rTAG
AAAAATACGG
ATTATTGCCA
AATGTCAAAC
GTTACATGGA
GT'GGTATI'AA
TAGTCACACC
GCGTTGCAGG
GCACACGGAA
AACTGTCACA
GGGCGGCGCC
TCCGTACCAC
ACAAGCAATI'
TTGAAGTGCC
ACCTAGTTAC
EF108-2 (SEQ ID NO:418) MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES
SPTAESSAKE
ETQASSVKEE TrKASTENSQ VITDSQEE.A TKEAEKEEPQ
AEVEQAETPI
ATYSFSAETY QFGFVNESGQ LINPDI IP IT YSYAI(CSWKT
DGYNRKWTSM
LKNVIMPATS VVMPPGPSYE GTQEVYTNFS IRIPKYYASA
SLYNREGKID
ACTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG
VKFTNFLYRI
GYKTVSSPIY YHLTNRRVTE NFVDTSCAKI TPPSNFTQCI(
QTVINSDPYT
IPKPKKINMK(
VQGSASTVGN
STYPLPAIAL
NDFDVKGNNI
FQQSGFLPET
210 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
YKVGTKSYRF
GFVDEATNKL
NTFYGASDIT
SGQTFGFNAL
KITPPTGFTQ
PSYQVTYDDN
RTD)LYDVTSK
LKYVDS IQLD
SNANNRLFYT
TAPTGFTQGN
FNATFDGNDD
SP NWSAGLT I
PNTVI.AAEVV
HSICQA.ADYY
TNYNQPTELK
VKGQQYKAAV
KGWYKGKTKT
IAPOQVQMXY
FTLPKRY(S I
ADQPEFYTKT
GI{KTVITSDA
DDLNVVYEEE
NNGNGQYTVS
TAQSSNLKSY
HLGYSGTPGV
QVPMNSNTFK
MTAMYKEEIP
PTFMLVTPEG
VrGG IKDSTV
GNGTRNPYLR
NTVGTTSAIS
TWNLVTGP
EPLATTKTPS
NLTLNENNKK
NIITKSDGKTD
LFGTESGIDD
YTFKQAGTLP
TVTTVYPSVD
INNGSMPLSQ
RYVYTNNSSL
NYLLVMFLFN
YTAAKALPAT
TAS VTLTRPK E'FKS IPVNS
DNFVRIRPND
IKKTQPNWSL
LTANNTATS I YKVTIYCDNDD LTvvYEEFSG YELPASTNQF TVMSSNLTGT.DTATLK<NLSV
PVNYFEQYRV
PAFPLPXIYN
PVNYYTMSGP
DTYTTGGKTY
MN4FVNEKGGA
ELLKKYNNGQ
VFDPNVAPAE
AKPADKSKLV
YTTGGKVYTF
EVIDTNTNVI
TLW tEVPLP
QEVVTPTTEG
TAQLSQPKSA
IANKQFT S N I DQVM-4SHMP
VYYYLENRKV
KFKGWYKGKS
FTPALTFSGK
PISATNRLQF
VDLSSESLNL
YKVTRXKQVTE
QGWYKGKTKP
W=rITNTSI( NAVP IGKQCVS
FISVPTFDFG
TIDSLP.TAThL
VYQLDFT-.M
TEN~FVDTNGA
ILNTL LTTKA
YYAQSTSAYL
NVOKLAIDQQ
LNFOSOGTYF
NFVDVNGAK I
STLNKTTTPT
APLQNLTLKK
VAFTTRATGK
QVGVAGTKQQ
LLGAAPVSSF
VKLEVPANQG
EF108-3 (SEQ ID NO:419)
V.
V
CGT GACGATTGAA AGTAGTCCGA CCGCCGAA TAGTGCC
GAAGAGACGC
CAAGTA-ACAA
CAAGCAGAAG
AAGGCAACTT
CAATTAhATAA
ACAGATGGTT
AACTTAAAGA
GAAGGAACTC
GCGAGTCTCT
CTAGCAGGTA
ACCAGTAAAG
GGTGTAAAGT
ATAGGTTATA
GAAAACTI'CG
AAACAAACGG
ACCTACAAAG
ACCGAGCCTT
GATTTGACG
TT1'GGCTTTG
TATAATCTA
ACAGATACAG
GTCAATACGT
ATCAATATTA
AATATAGATC
CTGTCGGGCC
ACGTTATTTG
CCTGTTrACI
GCTAAAATC.A
GCCTACACTI
TACAAGTTCP
GCGCCAAG79
GAAACAGTT;
GCTTTCACAC
AAGCAAGTAG
CTGACACGAG
TGGAACAAGC
ATTCATT'-TC
ATCCAGATAT
ATAATCGAAA
ATGTAATAAT
AAGAGGTGTA
ACAATAGAGA
CTAGACCGCT
GAGACAATGT
TACTAATT
AGACTGTGAG
TAGATACAAG
TCATTAACAG
TTGGCACGAA
TGGCCACCAC
TGGTCTATGA
TGGATGAAGC
CTI'TAAATGA
CGACACTGAA
T[TATGGCGC
CCA.AATCAGAP
AAGTAGAA.Al
AAACGTTTGC
GGACAGAGTC
ATTATTTAGI
CACCGCCAAC
TCAAACAAGC
AAGGTTGGT2
ATCAAGTGA(
CGACAGTGTj
CGGCC'ITAA(
CGTGAAGGAA
TCAGGAAGAA
AGAAACACCA
TGCAGAAACT
TATACCAAT'
GTGGACTAGT
GCCAGCAACT
CACAAACTTT
AGGTAAAATT
ATCTTTGACT
'ITATACACCA
TTTGTATCGT
TAGCCCAATC
TrGGCGCCAAA
TGATCCTTAC
ATCTTACCGA
TAAAACACCI
GGAGTI'TTCP
GACGAACAA.P
*AAATAATAAJ
*AAACTMTCC
GAGTGACAT]
TGGCAAAAC(
GTCACACATC
cITTAATC
TGGCATCGA'
AAACCGCA-k
AGGT'ITCAC(
AGGCACCTT,
k. CAAAGGCAAI
-CTACGATGA,
k. TCCATCAGT'
:TTTTAGTGG
GAAACAACGA
GCAACGAAAG
ATCA'ITCCTA
TATCAGTTTG
ACGTATAGCT
ATGGITCAAG
TCTGTAGTTA
TCAATTCGCA
GATTCTACTT
CA AAGTAGTG
CGGGAAACAT
ATAAATGACT
TATTACCATC
ATCACGCCAC
ACGTTCCAAC
*TTCAAAGGCT
*AGCTATAAAG
GGGTACGAGC
L TAArrrGCCC
LAAAACAGTAA
GTGCCTGTCP
ACGTTTACA'I
GACCCAGCT']
CCTGTGACC;
r TTAGCCGrATC r' GACCCAGTCI
GTCACCGAGI
:CAAGGTAAPJ
k~ CCAGACACT'J 3TCCATACTCj
AATGATGAI
C GATATGAAC' T AAGTACTAT A AAAAATAATI
:AAG
AAGCCAGTAC G AAGCGGAGAA A
AACCAAAAAAA
GATTTGTGAA
ATGCCAAAGG A~
GGAGTGCTTC
TGCCACCAGG TACCAAAATA 9) ATCCGTTACC 9
TA-ATTAGTGC
TTTTGGAGG
TI'ATGTGAA
TGACCAACCG
CAAGTAATTT
AAAGTGGTTI!
GGTACAAAGG
TCACGTATGA
TGCCTGCTTC
CCGACCAAGT
TGAGCAGTAA
LACTATTTTGA
TGCCCAAACG
TTCCTCTTCC
CTTATAACAA
AACCTGAATT
k ATTATI'ATAC k. ACITCGTAGA k AAACGGTGAT r' ACACAACAG
ACACAI'GAC
r TGA.ATGTGGT r TTGTGAATGA 3 CGCAAAGTAC 3 GTAATGGGCA
,GAAAATAGT
.GA.AGAACCG
,ATCAATATG
,GAATCAGGT
LTCATGGAAG
ACCGTAGGA
~CCGTCATAT
'TATGCATCA
EGCTATTGCA
=TIGGCGCTG
k.GATCCTGCA
%.GGTAATAAC
CCGTGTCACC
CACCCAAGGG
rTTACCCGAG
GAAAACCAAA
TGACAATGAT
GACCAATCAA
GCAGATGA-AG
CTTAACGGGG
ACAATATCGC
GTACAAATCA
TAAAATCTAT
GT1'GAAACAG
TTATACGAAA
AATGAGTGGC
CACCAACGGC
TACAAGCGAC
CGGTAAGACC
AACTACCAAA
GTATGAAGAA
AAAAGGCGGG
GAGTGCGTAC
ATATACGGTA
'AAGAACCG ATI'TATATGA CGTGACCTC TABLE 1. Nucleotidz and Amino Acid Seqeuences of E. faecalis Genes.
AGTATTAATA
CAACCAATCA
CAACTAAAAT
TATAGATATG
GAGGTTGACC
TTTTCTAATG
GTITAACTATC
GTCTACAAAG
ATCACTGCAC
AAGTACACAG
TTCCAAGGGT
ACGTTCAATG
CCAACAGCTA
ATCTGGACAA
AAAGGGCCCA
GGAGAAACGA
CCAAATGCCG
AAACCA.AACA
GTGGATAACT
GGCTTCATCA
CAACACAGCT
CGGATTAAGA
***GCGACAGACA
'rTTACCAATT
:AGCTI'AACAG
AATGTI'TATC
GGTGTTAAAG.
kTGGTAGTAT ,TGCtACCAA ATG TMACAG
TGTACACGAA
rTAGTTCAGA cAAATAATAG
TTCTCGTAAT
TCACTCGCAA
CAACAGGCTT
CGGCAAAAC
GGTATAAAGG
CGACCTTTGA
GTGTCACATT
CAACGATCAC
ATTGGTCAGC
cAAAATCAAT
TTCCTATCGG
CTGrT=GAA
TCGTGAGAAT
GTGTGCCAAC
TGAAACAAGC
AAACGCAACC
GCTTGCCTAC
ACAATCAACC
CCAACAACAC
AGTTGGACTI
GGCAACAAT;
GCCATTGTCC CA.AGAATTAT CAGATTACAG
TTTAATGTTG
CATTCAATTA GACACAGCTC CAATAGCTCA CTGG=FTCG ATCTCTTAAC TTGCTTAATT ACTTTTTTAC ACGCKPI'TAG GTTCTTTTI' AACGCCAAAC ACAAGTCACC GAAAACTTCG CACCCAAGGT AACCAAGTAC TTACCAGCG ACGTATACTA
GAAAACCAAG-CCAAGTACGT
TGGCAATGAC GAtATGACCG AACTC--ACCA AAAGAACTGA GAATACTAGC AAAGCACCCT TGGTCTGACG ATCCCGACCT CCCAGTAAAT AGTACACTTT CAAAAAAGTT TCAGTTCT AGCAGAAGTT
GTAGTATTTG
TCOTCCAAAT GATCAAGAAG C'rTCGACTTC GGCCAAGTG CGCGGATTAC
TACGGTAACG
CAATTGGAGC TTAACAGCGC ACGACCCGC TTATTATTAC AACCGAGTTG
AAAAATACGC
AGCAACGAGT ATTATTGCCI CACCTTCAAT AATGTCAAAc CAAGGCCGCA GTTACATGGI TGAAA AAATA TAATAP.TGGA ATAAATTAGC C AAAGTAGCAA I1 ACCCAAATGT TTGATTCAGA I
GATATAGTGG
CTGC--GATAA
TGGATGTCAA
CAATGAACAG
CAGGTGGCAA
TGAACAAAAC
CCATGTATAA
TTGA1IACGAA
TACAAAATCT
TI'ATGGAAGT
GGACAGAGGG
TCACAACTCG
GTGGTATTA-A
TAGTCACACC
GCGTTGC-AGG
GCACACGGAA
AACTGTCACA
GGGCGGCGCC
TCGGTACCAC
ACAAGCAATT
TTGPLAGTGCC
SACCTAGTTAC
:ATCGACCAA
~CTGAAATCC
LGCACCAGCA
LGGCACCTAT
ACACCAGGA
'TCAAAACTT
:GGTGCCAAA
rA.ACACCTTC
AGTCTATACG
AACAACTCCA
GGAAGAAATA
TAC.CAATGTA
CACCTTGAAA
GACACCAGAA
GGTTCCTTTA
CGCiLACAGGG
AGATAGTACA
AACGACCGAA
AACTAAGCAA
TCCGTATCTG
ACCAAAATCA
TGTCTCTAGC
GAGTGCCATT
CACAGGTAGT
AGccAATCAA
AG
EF1OS-4 (SEQ 1D NO1:420) VTIES SPTAESSAKE a
ETQASSVKEE
ATYSFSAETY
LKNVIMPATS
AGTRPLSLTQ
GYKTVSSP IY
YKVGTKSYRF
GFVDJEATNKL
NTFYGASDIT
SGQTFGFNAL
K ITPPTGFTQ
PSYQVTYDDN
RTDLYDVTSK
LKYVDS IQLD
SNANNRLFYT
TAPTGFTQGN
FNATFDGNDD
GPNWSAGLTI
PN'IVLKAEVV
HSLKQAADYY
TNYNQPTELK
TTKASTENSQ
QFGFVNESGQ
VVMPPGPSYE
SSVISALALT
YHLTNRRVTE
KGWYKGKTKT
IAPDQVQMXY
FTLPKRYKS I
ADQPEFYTKT
GK'KTVITSDA
DDLNVVY'EEE
NNGNGQYTVS
TAQSSNLKSY
HLGYSGTPGV
QVPMNSNTFK
MTAMYKEE IP
PTFMEVTPEO
V'FGG IKDSTV
GNGTRNPYLR
NTVCTTSAIS
V'ITDTSQEEA
LINPDI IPIT
GTQEVYTNFS
SKGDNvYTPR
N~FVDTSGA.KI
EPLATTKTPS
NLTLNENNKK
NITKSDGKTD
LFGTESGIDD
YTFKQAGTLP
TVTTVYPSVD
INNGSMPLSQ
RYVYTNINSSL
NYLLvMFLFN
YTAAYKALPAT
TASVTLTRPK
ETTKSIPVNS
DNFVRIRPND
IKKTQPNWSL
LTANNTATSI
TKEAEKEEPQ
YSYAXGSWKT
IRIPKYYASA
ETFFGGDPAG
TPPSUFTQGK
YKVTYDDNDD
TV14SSNLTGT
PAFPLPKIYN
PVNYYTMSGP
DTY'ITGGKTY
MNFVNEKGGA
ELLKKYNNGQ
VFDP.NVAPAE
AKPADKSKLV
YTTGGKVYTF
EVIDTNTNVI
TLWTEGVPLP
QEWVTPTTEG
TAQLSQPKSA
IANKQFTGSN
AEVEQAETPI
DGYt4RKWTSM
SLYNREGKID
VKFTNFLYRI
Q IVINSDPYT
LTVVYEEFSG
DTATLI<NLSV
IDQVEMSHMP
VYYYLENRKV
KFKGWoYKGKS
FTPALTFSGK
PISATNRLQF
VDLSSESLNL
YKVTRKQVTE
QGWYKGKTKP
WTTTITNTSK
NAVP IGKKVS
FISVPTFDFC
TDSLPTATRL
VYQLDFTFNN
IPKPKKINmK
VQGSASTVGN
STYPLPAIAL
NDFDXTKGNNI
FQQSGFLPET
YELPASTNQF
PVNYFEQYRV
VTTYNKLKQL
TENFVDTNGA
ILNTLTI'rKA
YYAQSTSAYL
NVDKLAID:QQ
LNFDSDGTYF
NFVDVNGAKI
STLNI(TTTPT
APLQNLTLKK
VAFTTP-ATGK
QVGVAGTKQQ
LLGAAPVSSF
VKLEVPANQG
VKGQQYKAAV TWWLVT EF109-1 (SEQ ID NO:421) 212 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AGGAGTAAAT
GGAITTCTCG
GAAACGGTGG
GAAATGAACC
ACCTCCGATT
NATAGTAGCG
TAG
TAATGAAAAA
TTACCCCTAT
CTTCAGAAAC
CAAGCATCAT
CCACCACTGA
ACGTACTGAA
AAGTGTTATA
TTCTGCTTAC
ATCTCTA-ACG
kAATTCTCAA
AGTTTCTACA
ACTACTITTGG
ACTAGTTCTA
GCTITGGAAC
GAGCGACAAA
GAGGAAACAG
TCAGAAGTAA
NAACATCACN
TGT -AGCGGT
GCTCTAAGGG
TGAGTAGCGG
AAACAACGTC
CAACTGTI'AA
AAGTAATGAG
TTTGTGTCG
AACTACTGAA
TGTCACTGAA
CACTTCCTCA
TGATACAGAA
GACACACCTA
EF1O9-2 (SEQ ID NO:422) MKUKSVI TSSMLAVLLS GFLVTPISAY ALERSKGTTE ETVASETSLT ERQMSSGVTE EMNPS IINSQ EETETTSTSS TSDSTTEVST SEVTTVNDTE XSSDVLKLLW XHHXVMRTHL EF109-3 (SEQ ID NO:423) GGAAC GCTCTAAGGG AACTACTGAA GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AA.ACAACGTC CACTTCCTCA ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG EF1O9--4 (SEQ ID NO:424) ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ EETETSTSS TSDSTTEVST S EF110-1 (SEQ ID N1O:425) S S
S
S
TAAATAAAA
AGTGCTGGTT
AATGCAGAAG
GACCCTGAGG
GGAAGAATCA
GTTGGAACCA
GTATTAAATC
ACACCAT'ITG
GTAGTGACTG
ACGCCAT'ITG
CCAGGT'GAGA
ACAGACTTAG
TCACCAATCT
CAAACAGGAA
GTGAATGAAG
TrGGATAAGGA Tr ICTTTCT
AGTATATCGT
ACAGAAGACA
TTTCCCCTGC
ATACAATIGT
CGAATGCCAA
GCAAAITCAA
TCGGCAAACA
TTTTGAAAAA
AA-AACCACAC
AAAATCCT
ATAATGATCA
ATCATGGTCA
AAGAAAATAA
GTGGCATAAT
GATTTTAGTA
TCCTGCCGAA
AGAAGTGGCA
CAGTAAACCA
CACCAATAAT
AGATGATGCT
AGTGATTGAT
AAACGATCGT
GTrTGAATCT
ACAATGGTCT
ACTATTAT
GGTI'GAAGTA
AAGACTAAAT
ACGTTTATCC
CTTATGAAAA
ACTI'TGATCG
AdTCATTCAC
GATACAACCG
GGCTATATTT
CATGTGGCTG
TGGTTTTATC
GTAGCTTTTT
CCAGATGGCC
TCAGATACCC
CATGAAAATG
GATATCGATA
GTTGGTGTTC
GAAGTGAATT
GCTGTGCCAG
AG'I'1CTZCAT
CCGGTITTAG
GACAAAAA.AG
AAGCGCCTTT
CT'ITAGGAAC
AAAGTTTTAA
CAGGTCGAGA
CCCCGAATGC
CAGAGTTGGG
ATGTCACAAT
ATFTTTTTAC
CAACCGGCGG
ATTCCAATGG
ATAACTTTAT
CAGCGTAA
ACGAAAAATT
CITGTCTGCA
ATCGTTACTG
TGCGTCAATC
AGGC ITTGTT
GAATGCCAAA
TGGCAGTGCG
GGATATTGCG
AGAAATTTTA
ATCAGGCTAT
ATCTAACTTT
TCAATC~iGT
CGGCATTAAG
TGTTAATCGA
EFI1O-2 (SEQ ID NO:426)
MKKFSIRKIS
PEDRRQEVAD
LNPNAKDDAW
PFVLKKFESS
P IYNDQVEVV AGFLFL ILVT
TTEAPFASIG
FYPGRDGSAT
DTHVTISGYP
GVIISNGGIKQ
L IAGFSLSAN
RIISPASKPG
PFGKFKVIDV
GEKNHTQWSH
TGNHGQRLNE
AEEYIVPAES
YISLGTGFVV
AFSPNADIAV
ENDLFTSNFT
vNYNFIVNRV
HSRQKRSLLD
GTNTIVTNNH
VTVGKQNDRP
DLENPLLFYD
NEEENKRLSA
VAESFKNAKI
DGPELGEILjT
IDTTGGQSGS
VPAA
EF110-3 (SEQ ID NO:427) AG AGTATATCGT TCCTGCCGAA AGTCATI'CAC GACAAAAAAG ATCGTTACTG 213 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GACCCTGAGG
GGAAGAATCA
GTTGGAACCA
GTATTAAATC
ACACCATTTG
GTAGTGACTG
ACGCCATTTG
CCAGGTGAGA
ACAGACTTAG
TCACCAATCT
CAA.ACAGGAA
GTGAATGAAG
ACAGAAGACA
TTTC CCC TG C
ATACAATTGT
CGAATGCCAA
GCAAATTCAA
TCGGCAAACA
TTTTGAAAAA
AAAACCACAC
AAAATCCATI'
ATAATGATCA
ATCATGGTCA
AAGAAAATAA
AGAAGTGGCA
CAGTAAACCA
CACCAATAAT
AGATGATGCT
AGTGATTGAT
AAACGATCGT
GTTTGAATCT
ACAATGGTCT
ACTA'ITI'AT
GGTTGAAGTA
AAGACTAAAT
ACGTTTATCC
GATACAACCG
GGCTATATIT
CATCTGGCTG
TGGTTTTATC
GTAGCTTTTT
CCAGATGGCC
TCAGATACCC
cATGAAAATG
GATATCGATA
GTTGGTGT'rC GAAGTGAA 1T GC(TtCCAG
AAGCSCCTTT
CTTTrAGGAAC
AAAGTTTTAA
CAGGTCGAGA
CCCCGAATGC
CAGAGTTG-GG
ATGTCACAAT
ATTTGTTTAC
CAACCGGCGG
ATTCCAATGG
ATAACTTTAT
CAGCGT
TGCGTCAATC
AGGCTTTGTT
GAATGCCAA-A
TGGCAGTGCG
GGATATTGCG
AGAAATTTTA
ATCAGGCTAT
ATCTAACTT'T
TCAATCTGGT
CGGCATTAAG
TGTTAATCGA
EF110-4 (SEQ ID NO:428) EY IVPAES HSRQKRSLLD PEDRRQEVAD TTEAPFAS IG LNPNAKDDAW FYPGRDGSAT PFVLKKFESS DTHVTISGYP PIYNDQVEVV GVHSNGGIKQ RI ISPASKPG
PFGKFKVIDV
GEKNHTQWSH
TGNHGQRL.NE
YISLGTGFVV
AFSPNADIAV
ENDLFTSNFT
VNYNFIV']FV
GTNTIVTNNH
VTVGKQNDRP
DLENPLLFYD
NEEEN'KRLSA
VAESFKNAKV
DGPELGEILT
IDTTGGQSGS
VPAA
EF111-1 (SEQ ID NO:429) C.
C.
TGATCAATAC
AAAAAGGGAT
CTATCATTTT
TGGTTAGCGT
AGTGTTGAAA
ACCGAACGGA
GCATGGCATG
GGGCCATTGA
ACGGTCAAAA
CTGTCTTATT
'PTAGCTI'AA
GGTGAAGAC
GAGGGAACGG
GTGAAT=ITG
ACGATITCATC
GCTGAACCAA
ACTT'ITACAA
GCGGCGCGCT
GATCAAAAGA
ACGA TAATA
TCCATGTCTT
GCAACCCCTG
CAAATTCAAA
TTITACAATC
AAACCACCAI
TTITrAAAAC
GACCACAAT;
AAAGACGAC;
CTGTTATTCl
AAGGTGTGGC
TAG
ACTTCGATAC
ACACTCCATT
ATAGGAGGGG
TAGGAATCAC
GAACAAAAGT
CGAATGGCGT
GCTACTATTT
TTATTGCGGA.
ATAAAAAAAC
ATCCTGGGCG
TTTTTGTCAG
CCTTGTCACT
AAACCTTAGA
GTGAAGTCAG
ATGCGGATAA
TTGAATAA
AAGAAGAAGA
ATTTCAAAGA
AAACAGCAGA
CCAATTGGCG
ATAAATGGT'I
ATT ITAATCC
AAGATGATAC:
AAGACAGTGC
TCCCTGPATG
AAATGTATCC
AAAATGGGAI
AGGATCAAAJ
kAGCAGCCGCC
'CAAAGGCGA(
GGTCGCTTTT TTTCTAGAGA TGGCATAGTC CTrGCTGATA TTTTATGAAG GGTTTATCAA CGTAGTCAGC
TGTTIGCCT
TGATGAXFI'T GCAAATGTTT ATACGATACC AAI'ATTTTA2 ACCTCAAAAA AGCAATAAAG2 AGAATATCCA
GTAAACTTGG
GGGAGAA-ACC TATGATTTAA CCTAGAGCAL ACCTATGAAT CAATCGAACG GCGC ETATCC TGCAGCAAGC TGGACAGGTG TATTGGCACT CGTTI'AACTG AGAAACGTGG AATTATITG AGT'I CAACA AAAATTGATA GCCTAAACAA ACGTACAACA GGCAAAGGAA CAACAACAAG GAACAAGCAA AGAVGGCAAG ATTCCTCAA TATCAAAATG AAGTGCGGCA GGTGCCTTIA *TATTGGTATG TGGGCTTGGG
*TGAGTTAGCT.AAAAATAATA
CGTACGTCCA CAAGATGCAG GCGTGGTGGT GAAGGTGGCA GGCAGTT1'GG CATAITTTATC CAA.ACTTGTG GC LTATCATA AGCAGAATAT GGAAGCATGG CATTAAAGAT AAAAATGGCC TGGGAAAGTG GCATGGATAA GTGGAGTTA AAGTTTITA ~.GTTGAATC T ~TAAATCAGT G
LAAAGAAACGG
rAAGCAGGGA ['AGATGTGAG %TAATT'rITc T :LGCTACTGGG TI :GGCAAGTTT GCCAAAGCAA
C
rAGACGAI'T
AAACGACACT
CGGTCTIGAC
CTAAAGACAA
CTACGAAAGA
ATCGGAATTA
CCTATACGAC
CACCCGAATA
GATATCTAGA
CCCTACTCAA
AGCATGACGG
ATTCGTGGAA
TGCCGGCCTT
GAGCGATCAT
ACTGGAATGA
AAGAAACCAA
ATTGGTGGTA
TCAGTGATGC
AccTAAAGTG
CCCTACACGC
AAACAAAAAT
TTCAATAAT
TATAAAGCG
GTGTCTACT
GTGCAAGCA
IGCATCACCA
GATTTAGGT
GGTTTTGCG
LACAAATTA
:CCCATGC-AC
LACGAITCAT
rGAAAACACT
:AAAATTCAA
L'GACATTCA
CACAAAATAT
E'ACAGCAACC
AGAAAGCTAT
rACCAAAAAT
TAAAACGTTT
ATCGATTGAA
GATTGTTCCG
AGCGGATGTA
GTTTGATI'AT
TCATGCTGTC
ACGAALATTCT
AGATAAGGAA
TACCAACAGA
TCACTGGCAA
GATGATGATG
TTTGACAAAG
AAAGGAAAAG
214 TABLE 1. Nucleotide and Amino Acid Se.qeuences of E.f[aecalis Genes.
EF111-2 (SEQ ID NO:430) MK(C LSKKKRVSTW
LALGI'IVVSC
WEGYYLPEKS
SYYPGRLEQT
GTETLDIGTR
EP IELKXIQT
QKXTAEFPEY
TADFNPELAK
PP LAAWAVWH DDKDQ IIKDI(
FALSREVQAS
NKELLGGFAG
YELDOLTIEL
LTAKONDIQV
VNTY-TTF-SYT
QNALVKS lET
NNMRALFD)YQ
IYQETKDKEF
NGHLKWMMML
VERTKVDEFA
PLIIAEEYPV
ALIFVSNRTA
NFGEVRETWN
FTKEEEAKEQ
INTNWRSA-AG
IQKDDTVRPQ
LKEM~YPKLVA
LLKQPRGKVA
NVLDVSAS PT
NLAASLNKLT
LIQTTLENTG
YFATKDTKYT
QQAPEYTKNA
AFKHDGIVPS
DAGAI IDAVF YHI4IWYTNRD WI TLHGLTKK 7RTNGVYDTN
VKNKKTGETY
EEPLSLGASW
IHHADKVSTK
ARYFKENKQR
MSYKWFIGMW
YNQDSARGG-E
HNKNG IAEYG
VWAKATLELK
YFNNFSDLGA
DLSQSNRMDL
TGAVFDKIQE
IDNRNYTATA
WQGYLDKTFD
AWDSWI(ADVA
CGNWNERNSK
SMVSDAHWQK
FLKTKIKEK
EF111-3 (SEQ ID NO:431) a.
a a a a a a a.
E-ATGAATTIT
ACCGAACGGA
GCATGGCATG
GGGCCATTGA
ACGGTCAAAA
CTGTCTTATr
TTAGC'ITAA
GGTGAAGAGC
GAGGGAACGG
GTGAATTITG
ACGATTCATC
GCTGAACCAA
AC7TTACAA
GCGGCGCGCT
GATCAA~AAGA
ACGATTAATA
TCCATGTCTT
GCAACGGCTG
CAAATTCAAA
TTTI'ACAATC
AAACCACCA7 ?rTTITAAAAC
CACCACAATP
AAAGACGAC;
CTGTTATTG;
AAGGTGTGGC
GCAAATGTT
CGAATGGCGT
GCTACTATTT
'TATTGCGGA
ATAAAAAAAC
ATCCTGGGCG
TTrTTGTCAG
CCTTGTCACT
AAACC'ITAGA
GTGAAGTCAG
ATGCGGATAA
TTGAA ITGAA
AAGAAGAAGA
ATTCAAAGA
AAACAGCAGA
CCAATTGGCG
ATAA.ATGGTT
ATTTTAATCC
AAGATGATAC
AAGACAGTGC
TGGCTGCATG-
1AAATGTATCC
,AAAATGGGAI
LAGGATCAAA!
AGCAGCCGCC
CAAAGGCGAC
TAGATGTGAG
ATACGATACC
ACCTGAAAAA
AGAATATCCA
GGGAGAAACC
CCTAGAGCAA
cAATCGAACG
TGGAGCAAGC
TA'FTGGCACT
AGAAACCTGG
AGTTTCAACA
GCCTAAACAA
TGCATCACCA
AATTATTTTA
AGCAATAAAG
GTAAACTTGG
TATGATTTAA
ACCTATGAAT
GCGCTTATCC
TGGACAGGTG
CG~TAACTG
A.MTATTTTG
AAAATTGATA
ACGTACAACA
C
C
GGCAAAGGAA CAACAACAAG GAAcAAGCAA AGATGGCAAG ATTICCTGA-A TATCAAAATG AAGTGCGGCA GGTGCCTT1'A TATTGGTATG
TGGGCTTGGG
TGAGTTAGCT AAAAATA6ATA CGTACGTCCA
CALAGATGCAG
GCGTGGTGGT
GAAGGTGG-CA
GGCAGTTTGG
CATATTTATC
CAAACTTGTG GCTTATCATA AGCAGAATAT GGAAGCATGG CATTAAAGAT
AAAAATGGCC
TGGGAAAGTG GCATGGATAA GTTGGAGTTA
AAGTT
~TAATTT'ITC TI ~GCTACTGGG GCcAAGTTT ;CCAAAGCAA C DAGACGATTI k.AACCACACT
CGGTCITTGA
CTAAAGACAA
:TACGAAAGA
ATCGGA.ATI'A
CCTATACGAC
CACCCGAATA
GATATCTAGA
CGCTAGTCAA
AGCATGACGG
ATTCGTGGAA
TGCGGGCCTT
GAGCGATCAT
ACTGGAATGA
AAGAAACCAA
ATTGGTGGTA
TCAGTGATGC
ACCTAALAGTG
CGCTACACGG
GATTTAGGT
rGGTTI'TGCG LAACAAAT'rA
:CGCATGGAC
ACGATTCAT
CGAAA.ACACT
:AAA6ATTCAA 1'GACATTCAA
CACAAAATAT
TACAGCAACC
1GAAAGCTAT
TACCAAAAAT
TAAAACGTTT
ATCGATTGAA
GATTGTTCCG
AGCGGATGTA
GTTTIGATTAT
TGATGcTGTC
ACGAAATTCT
AGATAAGGAA
TACCAACAGA
TCACTGGCA.A
GATGATGATG
TTTGACAAAG
EF111-4 (SEQ ID NO:432) DEFA NVLDVSASPT ERTNGVYIDTN
YFNNFSDLGA
WHGYYLPEKS
SYYPGRLEQT
GTETLDIGTR
EP IELKPKQT
QKKTAEFPEY
TAOFNPELAK
PPLAAWAVWH
DDKDQI IKDK
NKELLGGFAG
YE-LDDLTIHL
LTAKDNDIQV
YNTYTTESYT
QNALVKSIET
NNMRALFDYQ
IYQETKDKEF
NGHLKWMMML
PLIIAEEYPV
ALIFVSNRTA
1NFGEVRETWN
FTKEEEA(EQ
INTNWRSAAG
IQKDDTVRPQ
LKEI4YPKLVA
LLKQPRGKVA
NLAASLNKLT
LIQTTJENTG
YFATKDTKYT
QQAPEYTKNA
AFKHDGIVPS
DAGAI IDAVF
YHNWWYTNRD
WITLHGLTKK
VKNKXTG-ETY
EEPLSLGASW
IHHADKVSTI(
ARYFKZENXQR
MSYKWFIGMW
YNQDSARGGE
HNKNGIAEYG
VWAKATLELK
DLSQSNRMDL
TGAVFDKIQE
IDNRNYTATA
WQGYLDKTFD
AWDSWKADVA
GGNWNERNSK
St4VSDAN
WQK
215 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF117-1 (SEQ ID NO:433)
TAATTCGA'
GCTTTAACAG
AAGGAAACGN
GAATATATfl'
AAAGGTGATC
AAGAAAAAAT
AAAGGGACAT
GAGAAGGTG
GTTGTAAATC
CACAGTCTTA
CTTTGAAAGG
GTTT'TATTTT
TGAAGATTGG
TAATTGAAAG
TTTAGTGAAA
CGGTGAAAAA
NTCAGAAACA
GGAGATTCGTT
GAAAAGTGGT
TGACGAAGTG
TGAGGAGAAT
AGATTTTCAT
GAATTTGATG
GAATTACAAA
GAGAGTGACA
TCTAGTAAAT
ACAGTTI'ACG
CATGATTCAG
'I =ITTTACT
AAGAATCTCT
ATGGTGACGT
GTCr-TTCCAG ATcAAGTTTN
GAGAATATTA
CCACGAATTA
AATTTTACTT
TCAAAATCTA
TCGTTTAAAT
TTTAATAAAA
TAATGAGCAA
CGGCTTTFITG
EF117-2 (SEQ ID NO0:434) VKR FSFFLLILLA LTGCKSGEKE FDEESLQNLK ETXQSXSETE
LQNGDVRLNE
YISLKGEIVE SDSRSSLIKK GDR-FILKSGS SKYQVXNEQK KKLKIGDEVT
VYGEYY'GFLK
GTLIESEENH
DSATN
EF117-3 (SEQ ID NO:435) TG AAGAATCTCT
TCAAAATCTA
AAGGAAACGN CACAGTCTTA
NTCAGAAACA
GAATATATTT CTTTGAAAGG
GGAGATTGTT
AAAGGTGATC GTTTTATT
GAAAAGTGC-T
AAGAAAAAAT TGAAGATI'GG
TGACGAAGTG
AAACCGACAT TAATTGAA.AG
TGAGGAGAAT
GAATTACAAA
GAGAGTGACA
TCTAGTAAAT
ACAGTTTACG
CATGAT'rCAG
ATGGTGACGT
GTCGTTCCAG
ATCAAG7MT
GAGAATATTA
CCACGAA
TCGTTTAAAT
TTTAATAAAA
TAATGAGCAA
cGGcTrTTG EF117-4 (SEQ ID NO:43 6 EESLQNLK ETXQSXSE-TE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK
GDRFILKSGS
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLI ESEENH DSATN EF118-1 (SEQ ID NO:437)
TGAGGGGGAA
GCTTTCTTAC
AATCANAACG
TTTTCACAAG
AAAGACCCAT
CTTAGAGATrG
AGATAA
AAAGTGTGTT
TGACAGGTTG
CACAAGAATC
AAGATGGTGC
CGCTTTCTGG
AXATTAAAAG
AAAAAGAAAA
TGGCAAAAGT
TGGrCGTTI'GN
ACAAACGAAT
GGAAGATTCA
ccATGGGAAT
GTGGGGAITG
GCGAAAGATG
GATTTCTCTA
CCTATGAMrG
AGTAGATGCC
GGATGTACCG
TCGCAGGCGT TTCTGTTCA AGTTCATTCA
AGGAATCGGC
TGTCAATTAG
TGACATGAA-A
GGATGCTCAT
CACGCAAATC
AAAAAAGAAA AAGCATTCAA ATI'TCATTGG
TTGGATCGTT
EF118-2 (SEQ ID NO:438) VLKRXV GIVAGVFCSA LLLTGCGKSA KDEF IQGIGN XNAQESGVXD FSNSI SDM41F SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG
CTDFIGWIVR
EF118-3 (SEQ ID NO:43 9 GAAAGATG AGTTCATTCA
AGGAATCGC
AATCANAACG CACAAGAATC TGGGGTTTGN4 GATTCTCTA TGTCAATTAG
TGACATGAAA
TTTTCACAAG AACATCCTGC ACAAACGAAT CCTATGATG CGATGCTCAT
CACGCAA-TC
AAAGACGCAT CGC=rCTGG GGAAGATTCA AGTAGATCC AAAAPLAGAAA
AAGCATTCAA
CT'TAGAGATG AAATIAAAkAG CGATGGGAAT GGATGTACCG ATTTCATTGG T GGATCGTT
AGAT
216 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
EF118-4 (SEQ ID NO:440) KDEFIQGIGN XNAQESGVXD FSMSISD1M(F SQEDGAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR EF119-1 (SEQ ID NO:441)
TAAAGAATAC
AGTGCAACGG
GGCGGAGTGT
GTATCGACAG
TTTACGAGTG
GTAA.AACTTC
AATGAAAAGC
AAAACTGGAG
CATGAATATG
TTGGCTA-AGG
TATTAG
CGAGTAAAAT
AAAGGAGCTC
TGCTTGTTAG
AAGAAACGAC
AAGCAACTAA
TAGAAGCTTA
TAAACCCCCT
TAGCGTTAGT
cTTTACTTT
TGAAGAACAA
TTTCGGA;AGG
GGAAATCGTrG
TGGCTGTCAA
AGTGATI'GAA
AAAACAGACA
TGGAAATGCG
CATGACTGAA
TTCCGTAGGA
GGATGTGAA
TAAAATTTCT
CTMTTTCA
AATAAGCTAC
AGCCATAAGG
ACAGTAGCAA
GAAACAACGA
TATGCGAACT
AAATGTATCA
AAGGTTACAA
CAAAATGGAA
GAAATGACCT
AAAATTGTAT
CTTTACTTAT
AAGAAAACAA
GGGAACAATC:
AATTAGAAGA
TTACA.AGTAT
AAAAAAATGG
CGAT'-TATAA
CGCAGACACG
ATAATTCAGT
ATGCAAAAGA
ITTATTGTTA
GTCTAGTAAA
GAAGGAATCG
ACCAGATCAT
TAATGATC-GC
AATTGATGTT
AAATGATCA.A
AGTGTTACTT
TAAGCAAGAG
*q.
S S
S
S.
S.
S
.55S S. S S S
S.
EF119-2 (SEQ !D NQ:442) 'ZN KLPLLILLLG GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKXQTE TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EKLKPLMTEK CIKK(NGIDVK TGVALVSVGK VTTI YKNDQH EYALLLDCEQ NGTQTRVLLL AXVKNNKISE MTYNSVKQEY EF119-3 (SEQ ID NO:443) AGAAAACAA GTCTAGTAAA GTATCGACAG AAGAAACGAC TTTACGAGTG AAGCAAkCTAA GTAAAACTTC TAGAAGCTTA AATGAAAAGC TAAAGCCCCT AAAACTGGAG TAGCGTTAGT CATGAATATG CTTTACTTTT TTGGCTAAGG TGAAGAACAA
TAT
AGTGATTGAA
AAAACAGACA
TGGAAATGCG
CATGACTGAA
TTCCGTAGGA
GGATI'GTGAA
TAAAATTTCT
ACAGTAGCAA
GAAACAACGA
TATGCGAACT
AAATGTATCA
AAGGTTACAA
CAAAATGGAA
GAAATGACCT
GGGPLACAATC
AATI'AGAAGA
TTACAAGTAT
AAAAAAATGG
CGATTTATAA
CGCAGACACG
ATAA LTCAGT
GAAGGAATCG
ACCAGATCAT
TAATGATCGC
AATTGATGT
AAATGATCAA
AGTGTTACTT
TAAGCAAGAG
EF119-4 (SEQ ID NO:444) ENKSSKv STEETrVIET VAREQSKESF TSE-ATKXQTE rITKLEEPDF4V KLLEAYGNAY ANFTS INDRN E-KLKPLMTEK CIKKNGIDVK TGVALVSVGK VTTIYKN~DQH EYALLJLDCEQ NGTQTRVLLL AXVKNKI SE MTYNSVKQEY EF120-1 (SEQ ID NO:44 5
TGAATAGGCG
ACTGCTTGTC
GTI'GAAGAGA
AGCAGTGTCG
AGTCAGCTAA
TTAGTTGGAA
TGAAAAAGGG
GGGAACCAAA
CGAATGAAAA
AGTCTAACGA
CTGTAGCTGA
TATGGAAAAA
AATGTTAGCG
AGNAAAGAAA
AACGAGTGAA
ATCAGTGAA-A
TTTAGATACT
TGGTAAAGGA
TT~TTGTCG
GTAACCGC T
ACAATTGATA
AATGAAGAC
ACAGCGATTA
CAGAGTTTGA
TrCCTAGCGGT
CAACGGAGGC
AGACAAACGA
CGACAGCTGA
ATGCTGGCGA
TCATTCATCC
TTTATCATTA
ATCCTCTAAA
ACAAGCGAGC
TGGAAAcAAT
TTITACTACT
TGATGGTAGT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG
ACCAATTACA
AGCTTAAGTA TTAGCTGGGG GCCTACTGGT GCTGCCCTAT TATTATATAA
AATTGGTGTT
EF120-2 (SEQ ID NO:446) VKKGMLAF FVVLAVLSLT ACREPKXKKV TASTEASSKV EETNEKTSET
IDKTNEQASS
SVE-SNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE SLI IHPOGST NTGGMITKDS PTDESRPITS LSIRWGPTGA
ALLLYKIGV
EF120-3 (SEQ ID NO:447) AAGAAA GTAACCGCTT CAACGGAGGC
ATCCTC'AA
GTTGAAGAGA CGAATGAAAA AACCAGTGA-A
ACAATTGATA
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA
AATGAAGAGC
ACTCAGCTAA CTGTAGCTGA TrAGATACT
ACACGATTA
TAGTTGGAA TATGGAAAAA TGGTAAAGGA
GAGAGTTTCA
ACAA.ATACCG GAGGAATGAT TACGAAGGAT
TCACCTACTG
AGCTTAAGTA TTAGGTCCGG GCCTACTGGT
GCTCGCTAT
AGACAAACGA
CGACAGCTGA
ATGCtGGA
TC:ATTCATCC
ATGAGTCGCG
TATTATATAA
ACAAGCGAGC
TGGAAACAAT
TTTTACTACT
TGATGGTAGT
ACCAATTACA
AATTGGTGTT
a.
a a a a a a EF120-4 (SEQ ID NO:448) KKV TASTEASSKV EETNEKTSET
IDKTNEQASS
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE
SLIIHPDGST
NTGGMITKDS PTDESRPITS LSIRWGPTGA
ALLLIYKIGV
EF121-1 (SEQ ID NO:449)
TGAAACACAA
CATT=TTAA
GAAACAACGA
CAAGAACCAG
ACGAGTGACA
GAAACGTCAA
ACGTCAGATG
GTTGGTTTGT
ACCGTTTAA
AATAAACC
AAGTATGATG
AAAATTCAAC
GGTC ITCGTT
CCAGATITAA
CCTCGTGTTA
GAGTTGAAAG
AATAGTGATC
TATATTCTGG
ACTGTACCGG
GIrGCTAAAA ACGAATG ETC cGAGCGTTT'P
ATTAAAGGAP
GTTCAAAAAC
AAATTACCTC
ACcT1'AGTGI
GGAGGAAATT
TGGCTGTL'GC
GTCAACAAAG
TCATACACA
GTGTCAAGCA
TCGCTGAAALA
TTCATGGTCA
cccAAGTAAG
TTGATAATGG
CTTTAGTGAA
cAATGGTITT
AAGAAGCCAC
TrGTTGAAGG
AAGTTGGGAT
CTTCGCTTAA
CTAACGATCA
CGGCTGCCAG-
GTCATGACC.P
TAGGGGGACC
AT'GCCGATA-2
CAGCAGATGI
TTCAGGAGGI
LTTCCCGAAG(
AAGTAACGG(
CGGGGAAGX
k GTGTTCCCA'
TC;TGAAAAAG
GTTGATAGCG
'T~cAGAAGCA
GGAAACAACA
GTCACAAGAA
AGAAGAA.ACG
ATTATGGAAT
TACAGTCGTI'
CGACAATA T
TGAAAAGACC
GGGAAATCAT
TTTTCCAATC
GACTACCACG
TATCGGCTTA
TTTTTTACCT
GGCTGACAT'I
TGCCGACCAA
CCTI'TCTTTi
GAAAGATACC
GTGGGAAGTC
SAGCAGTTAAC
k GATCGGCAC) 'ACAATTACA2 3CGCACAAT~ r' 'TCCTATGC( r TAACGGTGA.
TTGAGCTTTA
CCAAGTGTTA
GTAACAAGTA
GACATCAAAC
ACCACAGCAC
AGCACGCCGC
TrGGTCTTATG
AACCAAGTCC
CAAGGCACTA
CATCCAATGA
GAGTTTAATT
TTGTCTGCGA
AAGGAACTTC
ACAATTCCG.C
TI'GAAAGA.AG
ATtGTTGC CI
*GTAATTGAA-P
*ACCAAGCAAC
GGGACAGAAC
CAAGAAGGTI
GCAGCGACAI
k GCAACAGCTC k CCAACAGCGj
AGTGCGGCA(
ACGA TTTD PL AACTTACTG, AAAAAGTGAA G
CTAGTACGGCA
CCACCGATTCA
AACAAGCACC
CAACAGAGAC
G
AAAAAATAAC
AAGATCATAA
GGGCACAAAAC
TTTTAACAGA
TCACCGCCATC
TTGGTTI'ACC
ATACCTACAA'
ATTITAATCA
ACATTCCTTT
AAGCAGAAAA
CCATrTCATGC
ATGTCGCGGG
GAGCAGCGCC
TTGTCAAAAT
SCAGCAACGAT
~AAGAATACCA
XITrTTTTACC k. TGATTTCTTT CGCTGmAA
ATATCTACAA
A, AGTATTTAGA
.TATGCAGTA
ACTAGAAAA
LAATCAGGCT
~ACGAATA
ATTITAGGT
LGAACTACCA
:CCAGCAGGC
EGACTTGTAT
;AATGTGATG
;TTAATCAM.A
I'AAGGAAGAT
ALGATGrGGCAG GTiGGGATGGC
AGCAGTTACT
GGGACAAcAA
GATTGATGCG
GAATGGAAAA
TGATCTTTCA
TGTACCAACA
TGAAAAAACG
AAAACAAGAA
AATTAATAAC
ATACGACAGT
ATACCCGAAT
AAAACAAGGG
218 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. foecalis Genes.
set**
GCGTACTATA
GTATATAACT
GGTGAACGAA
ACGA ETGCTA
GAACCTATTA
AAAAAAGGAA
AATTT1'GATG
ATTCCCACTT
CGTA.ATGCGG
GGCCGACTAG
CAAGAAAACC
AATTTCTCCA
GCGGTGGGAA
CTGAATTITC
GATCCGTATA
CCAGAAACAG
A IICCAGAAG
TTCGTGGTTA
ACGCTAGCAG
CATTCGCATA
CGTAATTATT
ACAACAAAAT
GCCATCGTTG
TATATTCCAT
GGGAATTTGA
GATTTTGCAG-
GTTGGGGATC
GGCGCCCAAC
AATGGCACA.J
CTACATGTT
GCTAGTGACC
GAAA ITAAAJ
ATGGCTACC)
CGAGAAGAA(
TTGCGGTJTG'
GGCGAACGJAj
ACGCCAAAA(
AATAAACAA.
TATGGCGGC'
CTGTTCGTD
TAG
ACCAAXACACA
ATGACATGAT
TTGTAGATGC
TGAATAATTA
AAAATTCTGA
CTCTTGATCC
AAAAATGGCG
CTC:CTGATGG
GCTTTGATITT
AAGCAGGGAA
CAACCTrGAT
AAGGCGCGGA
ATCACGAG71
CGATTTI'ATC
CAATCGTAGP
CA.ACGAAAAC
TAGAAGCAG~I
CTGGGCATT9)
AAACCCTTAC
CAGCCGTCGI
TAAATAATG'.
TGA'TTTCAGC
ATGAAGCAC(
*TCACATTGG
TITCGTGATG(
TAACTAATGC
TCATTGCTG'
TTAAAGAAA'
LTTTACTAG.
CAAGCTCCA
AAGGCAATC
kATCGGCAAA k ATGATTTCT 3 GATTTCAC r* ATCGTGCAG k. TTGTTTCTP
:CAGATCCAP
PL. ACCAAGCGC r TrTTAccT; 3 GACTTTCT9
GCCAGATGAT
TrTCTGGAGTG
GAA.AATTGAC
'TCGTTACGGC
TCCACAAACC
AGAACAAGAA
TGCCAAAGCA
ACGTACACCA
AGATAATGCA
AGGCGAATTA
GGTGGATGCA
TATGGCCAA-A
TGATTTTGGT
TAGTAATACG
AAAATCCGGG
ACACCCGAAA
GATTAAGGAA
AGGCGTAGAT
TCAAACATAI
AAGTGGCAAA
P TGGGATCGTC
-AGCAGAGCT'C
3 TACGAATTTI k. TGGACA.ACG;
-GATTATCTC;
3 TGGCGGCATr r GTTACCTTT' T GTTGAAATC A TGATGCTGG( T TCGTATCCA( A AACAGGACA C ACAAAAGTT' 'T AGCTGCTGG' T ACATTCTGTI C AACGACGAT ,T TTCGGAAGA hA ACCAGACCC ,G AGCAAGACA LA AACGGGTAC ,c TTCTGGCTG TGACCATTA G
GACTACAAGAT
GGCCAACCGC T GGTTTAGCTA G 1TACGAGGAA T
ATCGAACGAAA
ATCGAATI'AGT
AACGCCGCCG C
TATACCATTA
GGTATGGCGC G .CGGGATGTTT CCAATGAATC ITAGAGATTG C TATIACAAAG AAAAAGTTTG C
AACGTAGACA
ATTAAAGAGA
GAAACGACGC
CCTGAGTTAC
CGTTATGGCA
ACAGCACCAG
CTAGAATTGC
AACGCTGAAA
GAAAATGTGC
TATGGCCAAG
rCGCGCTCATA r' GGCAATAGCA 'TCTGTTCG LT
CAACCAAA.AC
TATGATTCCA
k. ACGATTGTCG r GAACCATrGG r GGTGATGG T
:T'TAATTGAAT
T GATTTAGCAC A GCITACAAAG G AAACCAACAC G AGCAATCCAT A GAAACAGAAA G TATATTTATA
TTTTAATCCA.
TGACATTTCk GGATCCT'-C C.
CCAAGGGAT T
GAT-TGTI'GAT
ETGGTCAAT
GAATGACGG C TATTACGAA A
CGCACACAAAT
TCTAAAAAC C 'CCAAGGATT A
LAGTTGCTA
ACTACGGTA T ~TGGC.AGTGG P
CATTGTAGG
GGTGACATT 9J kGTACGCGGA :GCATATCTG C
ATATCACTGT
AAGTGATCTA
A.GAGTGAACC
CAGAAAACCC
ATGAAAAAGT
GCACACGAGA
ACGCGTTTAG
TTAAACAACG
TTGCGCAAAT
CGATTCCACA
TTGGCGCAAA
CAAAACCAGG
GTAGTCGCGT
ATGAGAAGAA
ACGATATGCT
ACTTGAAAAG
AATATAAAGA
AGTTAATCGG
CAGAAACACC
CCGTAACAGA
CGCTTGCATT
AACGACGTAA
kACA I7CGT kAACCAGTG
~AACAATAT
CAAGTAGGG
TATATTAAG
A'I ICGGACA
ACTCTI'CAA
CAAGATGTC
rGACG ETCAT
TTA-AAGAC
,CCAATCTCC
,rGATGCCATG
~AAAGACCAA
LCGGGTT TTT
.GTGACGACC
~AAAGACCCG
'ATNCAAGCT
CGTGGTGAT
3ATTGATGGA rGCTCAAACA k.ACTAAGAAA
GGCACTTAAA
AATTGTCGAT
GACCAACTTA
CCAACCTGCT
GccAATTAAA
TCAAGTAACC
AAAAGATGAG
TGGTGGTTTC
TACTCGCTTG
ATTAGGAATA
ACAATACCG
AGGTGGTGAA
TGCAACCAGC
ACCATTCCCA
TGGAGGAGAG
AGTAGCAACC
GAAGAAAAAG
ATATGGTTTA
CAAAGCTAGT
0sS
S
0* S 9.
S.
EF121-2 (SEQ ID NO:450) VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE TTSQQSSE-AV TSTTDS
EPVITQETTD
SDVHGQLWNW
KAPLVNEKTH
LRFVEGTTTK
LKANDQADI I
VPVGGPKDTG
AFIQEEIGTA
LPAGKI SYAT
IKQEAPNQAT
SYEDDKELPV
PMITAMNVMK
ELDFNQDGQP
VAS IHAGQQN
TEVVKIDLSV
TADFLPKQEI
IFDIYKYPNT
SDSVKQSQET
GLSQVSTVVN
YDAMVLGNHE
DLKVGIIGLT
SDPAASADQV
AKNADKWEVQ
I(GIPEAQLQP
LVSVPIbIGEN ER IVDAXI DG TAPTETTNriE
QVRAQNPAGT
FNFGLPLIKK
IPHIPLWDGP
IENVAGIDAY
EGTATIVPTT
TAMISLINNV
LLKYLEKQGA
QPLDPAKEYT
VLIDNGDNIQ
IQQEATFP IL
RVTSLNFLPL
ILGHDHLSFT
NVPADEAVKA
QKEVTGAQLS
YYNQTQPDDL
IAMNNYRYGG
SRKQ
TPQKITILGT
GTILTDDLYN
SANTYNKEDG
KEEAEKAVTE
KQGAAPN1GY(T
ATKEYHF.KTR
AAALFKYDSK
TISFNPNIRV
LASQGIQVGE
YNYDMISGVD YKIDISKPVG 219 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PIKNSDPETL
PTSPDGRTPN
ENPTLMVDAG
NFP ILSSNTY
PEVEAVIKEI
SHTAVESGKR
IVDE-ARTNFN
FAVTNGGG IR
GTILLDDAGQ
IKNRQTQKFE
RLYRAATTID
KQNQAGARQS
RGMIVDYIKK
AAAITKQDVR
DVFQGLPISN
YKDGSGRVFD
KEKYADXQAF
YGKVIYAQTG
AENEKVIVDY
ADIKQGPIKV
PKLGANGGFL
PLDEKKQYRM
LAQYKEPFPG
NP SVTZKKKY
KGTLOPEQEI
NAG FDLDNAY
FSKGADMAKA
PYTIVEKSGK
VVTGHLGVDE
NYLNNVG IVT IPFTLDGQR-7
GOVIAVLPFG
HVSSSIRIHY
ATNDFLAAGG
ERIVSISEE-A
GGFLPKTGTE
ERNWSI IGTN
TIMHTNDV{G
MNEVG.YDAMIA
KFAIVGVTI'P
'1 9 1PH IWRGDT
APESEPTKKT
NVRTRETNLG
NSIAQIQVTG
DSTKPGTR{LA
DGYDMLGGER
YKYELIGGGET
TETLALYGLL
FDEKWRAKAI
RLEAGKGE:LG
VGNHEFDFGL
E-TATKTHP"!
LAETLSQTYP
TKL ISAAELL
NLIGDAIMSY
AQVKEMFEMS
SDEGNETGQT
EEGISLDSVL
PKPDPKPDPK
FVGLS SSGWY
ELVNDGTLQI
MARLKTFKDQ
EIALGYXDQL
VEKVTFKDP I
ELDITVIDGH
ELPENPAVKA
GQDAFSQPAO
vRS IPQKDEN
IVGSRVLGIE
IEYLKSATSL
PTPETPVATN
IYKRRNKAS
EF121-3 (SEQ ID NO:451) ACAAAG TTCAGA.AGCA GTAACAAGTA CCACCGATTC AAGI *9 0 90
SS
S S
SS
SS
S 0 5@ 5* 0 5
S
S.
0 5 5
CAAGAACCAG
ACGAGTGACA
GAAACGTCAA
ACGTCAGATG
GTTGGTTTGT
ACCGTTTTAA
AATAAAG.CGC
AAGTATGATO
AAAATTCAAC
GGTCTTCGI'I
CCAGATTTAA
CCTCGTGTTA
GAGTI'GAAAG
AATAGTGATC
TATA'ITCTGG
ACTGTACCGG
GTTGCTAAAA
ACGAATGTTC
CGAGCG=IA
ATTAAAGGAA
GTTCAAAAAG
AAATTACCr
ACCTTAGTGA
GCGTACTATA
GTATATAACT
GGTGAACGAA
ACCAT'rGCTA dAACCTATTA
AAAAAAGGAA
AATTTTGATCG
ATTCCGACTI
TCATTACACA
GTGTCAAGCA
TCGCTGAAAA
'ITCATGGTCA
CCCAAGTAAG
TITGATAATGG
CT'1AGTGAA
CAATGGTTTT
AAGAAGCCAC
'FrGTTGAAGG
AAGTTGGGAT
CTTCGCTTAA
CTAACGATCA
CGGCTGCCAG
GTCATGACCA
TAGGGGGACC
ATGCCGATAA
CAGCAGATGA
TTCAGGAGGA
'rrCCCGAAGC
AAGTAACGGG
CGGGGAAGAI
GTGTTCCCAI
ACCAAACAC;
ATGACATGA1
TTGTAGATGC
TGAATAATTI
AAAATTCTGI
CTCTTGATC(
AAAAATGGCC
CTCCTGATG(
GGAAACAACA
GTCACAAGAA
AGAAGAPLACG
ATTATGGAAT
TACAGTCGTT
CGACAATATT
TGAAAAGACC
GGGAAATCAT
TITTCCAATC
GACTACCACG
TATCGGCTTA
TTTTTTACCT
GGCTGACATT
TGCCGACCAA
CCTTTCTT7I
GAALAGATACG
GTGGGAAGTG-
AGCAGTTAAC
GATCGGCACP
ACAATTACAP
iCCCACAATT;
TTCCTATGCC
TAACGGTGA.U
GCCAGATGK.
TTCTG.GAGTC
GAAAATTGA(
TCGTTCGG(
k. TCCAGAAAC( AGAACAAGXi 3TGCCAAAGC 3ACGTACACC,
GACATCAAAC
ACCACAGCAC
AGCACCCC
TGGTCTTATG
AACCAAGTCC
CAAGGCACTA
CATCCAATGA
GAGTTTAATT
TTGTCTGCGA
AAGGAACTI'G
ACAATTCCGC
TTGAAAGAAG
ATIGTTGCCT
GTAATTGAAA
ACCAAGCAAG
GGGACAGAAG
CAAGAAGGTA
GCAGCGACAA
GCAACAGCTC
CCAACAGCGA
AGTGCGGCAG
ACGATTFIG
kAAC'ITACTGA.
C TTGACCATTP 3GACTACAAGP
GGCCAACCGC
2 GGTTAGCV.
TI'ACGAGGA;
N ATCGAACGA) N. ATCGAATTAC A, AACGCCG
'GAAAA
AAGAAGCACC
CAACAGAGAC
AAAAAATAAC
AAGATGATAA
GGGCACAAAA
T=T1AACAGA
TCACCGCCAT
TMG=ACC
ATACCTACAA
AITTTAATCA
ACATI'CCTTI'
AAGCAGAAAA
CGATI'CATGC
ATGTCGCGGG
GAGCAGCGCC
TTGTCAAAAT
CAGCA.ACGAT
AAGAATACCA
AITITTTTACC
TGATTCTTT
*CGCTGTTTAA
ATATCTACAA
AGTAI'TTAGA
GTTTTAATCC
ITACATLTC
TGGATCCTGC
GCCAAGG-GA]
TGATTGTTGrz k ATTGGTCA-A'
TGAATCGACG(
AAATCAGGCT
GACGAATITA
AArI'TTAGGT
AGA.ACTACCA
CCCAGCAGGC
TCACTTGTAT
GAATGTGATG
GTTAATCAAA
TAAGGAAGAT
AGATGGGCAG
GTGGGATGGC
AGCAGTTACT
GGGACAACAA
GATTGATGCG
GAATGGAAAA
TGATCTTTCA
TGTACCAACA
TGAAAAA.ACG
AAAACAAGAA
AATTAATAAC
ATACGACAGT
ATACCCGAAT
AAAACAAGCG
AAACAITCGT
AAAACCAGTG
CAAAGAATAT
7TCAAGTAGG
TTATATTAAG
r TATTGGGACA 3CACTC'FrCAA
S.
OS@S
S
9
OSSO
OSSO
S
0000 @0 5 00 0S EF121-4 (SEQ ID NO:4S2) QSSEAV TSTTOSSRKQ EPVITQETTD IKQE-APNQAT SDSVKQSQET TAPTETTNLE TSIAEKEETS TPQKITILGT SDVHGQLWNW SYEDDKELPV GLSQVSTVVN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 220 TABLE 1. Nucleotide and Amino Acid Seqeuences; of E.faecalis Genes.
KAPLVNEKTH
LRFVECTTTK
LKANEDQADI
I
VPVGGPKDTG
AFIQEEIGTA
LPAGKISYAT
YNYDMI SGVD
PIKNSDPETL
PTS POGRTPN
PMITAMV'
ELDFNQDGQP
VAS IHAGQQN
TEVVKIDLSV
TADFLPKQEI
I FD IYKYPNT
YKIDISI(PVG
RGMIVDYIKK
YDAMVLGNHE
OLKVGIIGLT
SDPAASADQV
AKNADKWEVQ
KGIPEAQLQP
LVSVP INGEN
ERIVDAKIDG
KGTLDPEQEI
FNFGLPLIKE
IPHIPLWDGP
I ENVAGIDAY
EGTATIVPTT
TAMI SLINNV
LLKYLEKQGA
QPLDPAKEYT
ERNWS IIGTN
IQQEATFIL
RVTISLNFL-PL
ILGHDHLSFT
NVPADE-AVKA
QKEVTGAQLS
YYNQTQPDDL
IAMNNYRYGG
FDEKWRAKAI
SANTYNKEDG
KEEAEKAVTE
KQGAAPNGKT
ATKEYHEKTR
AAALFKYDS(
TISFNPNIRV
LASQGIQVGE
ELVNDGTLQ
I
EF122-1 (SEQ ID NO:453)
TGAAACACAA
CATTTTTTAA
GAAACAACGA
CAAGAACCAG
ACGAGTGACA
GAAACGTCAA
ACGTCAGATG
CTTGGTTTCT
ACCGTTFTAA
AATAAAGCGC
AAGTATGATG
AAAATTCAAC
GGTCTTCGTT
CCAGATTTAA
CCTCGTGTTA
GAGTT1GAAAG
AATAGTGATC
TATATTCTG
ACTGTACCGG
GTTGCTAAAA
ACGAATGTTC
CGAGCG'ITTA
ATTAAAGGAP.
GTTCAAAAAC
AAA TACCTC
ACC~TTAGTG;
GCGTACTATI
GTATATAAC9
GGTGAACGAI
ACGATTGCTJ
GAACCTATTj
AAAAAAGGAJ
AATTTTGATI
AI'CCGACTI
CGTAATGCG,
GGCCGACTA
CAAGAAAAC
AATTI'CTCC
GCGGTGGGA
CTGAATTTI
GGAGGAAATT
TGGCTGTTGC
GTCAACAAAG
TCATTACACA
GTrTCAAGCA
TCGCTGAAAA
TTCATGGTCA
CCCAAGTAAG
TTGATAATGG
CTI'TAGTGAA
CAATGGTTTT
AAGAAGCCAC
TTG7'rGAAGrG
AAGTTGGGAT
CTTICGCTTAA
CTAACCATCA
CGGCTGCCAG
GTCATGACCA
TAGGGGGACC
ATGCCGATAP
CAGCAGATGP
ITCAGGAGG;
'FrCCCGAAGC
AAGTAACGGC
CGGGGAAGA'.
GTGTTCCCK
SACCAAACACi
LATGACATGA!
k. TTGTAGATGi
TGAATAATI'
AAA-ATTCT.
k CTCTTGATC S AAAA.ATGGC T CTCCTGATG G GCTTTGA7I G AAGCAGGGA C CAACCTrGA A AAGGCGCGC A ATCACGAGI 'C CGATT'ITA)
TGTGAAAAAG
GTTGATAGCG
TTCAGA.AGCA
GGAAACAACA
GTCACAAGAA
AGAAGAA.ACG
A'ITATGGAAT
TACAGTCGT'
CGACAATATT
TGAAAAGACC
GGGAA.ATCAT
TTTI'CCAATC
GACTACCACG
TATCGGCTTA
TTTTTACCI
GGCTGACATI
TGCCGACCAP
CCTTTCTI'
GAAAGATACC
GTGGGAAGT(C
AGCAGTTAAC
LGATCCGCACJ
ACAATTACAj
CGCACAATT
C' TTCCTATGCI r' TAACGGTGIk k GCCAGATGA r TTCTGGAGT
-GAAAATTGA
h TCGITACGG A TCCAGAAAC C AGAACAAGP G TGCCAAAGC G ACGTACACC T AGATAATGC A AGGCGAATI 6T GGTGGATGC ;A TATGGCCPJ .r TGATTTTG .C TAGTAATA( ;A AAAATCCGI
TGAGCTTTA
CCAAGTGTTA
GTAACAAGTA
GACATCAAAC
ACCACAGCAC
AGCACGCCC
TGGTCTTATIG
AACCAAGTCC
CAAGGCACTA
CATCCAATGA
GAGT"TTAATT
TTGTCTGCGA
AAGGAACTTG-
ACAATTCCGC
TTGAA-AGAAC
ATTGTTGCC']
LGTAATGAAN
ACCAAGCAAC
GGGACAGAA(
CAAGAAGGT
GCAGCGACAX
k GCAACAGCT k. CCAACAGCG.
k~ AGTGCGGCA
ACGATTTTT
A AACTIACTG T FI'GACCATT G GACTACAAG C GGCCAACCG C GGTlTAGC'I C TTACGAGGP LA ATCGAACG; :A ATCGAATI :A AACGCCGCC :A TATACCAT.
LA GGTATG-GCC :A GGGGATG7 k.A GCAATGAA! 'T TTAGAGAT :G TATTACAA AAA.AAGTGAA G' CTAGTACGGC
A)
CCACCGATTC
AJ
AAGAAGCACC CAACAGAGAC
G~
AAAAA.ATAAC
X~
AAGATGATAA
A(
GGGCACAAAA
C
TITTAACAGA T1 TCACCGCCAT
G.
TTGGTI'TACC
G
ATACCTACAA
T
ATTTTAATCAA
ACATTCCTTT
G
AAGCAGAAAA
7CCATTCATGC
G
kATGTCGCGGG
G
'GAGCAGCGCC
G
TTGTcAAAAT k CAGCAACGAT k. AAGAATACCA 2AT'FITIACC A. TGATTTCTTT 0 CGCTGTITAA G ATATCTACAA A AGTATTI'AGA A GT=11AATCC A TTACATTC ;C TGGATCCTGC
GCCAAGGGAT
LA TGATTGTTGA A ATTGGTCAAT kG TG.AATGACGG 'G CTAITACGAA C~A TCCACACAAA ;C GTCTAAAAAC rT TCCAAGGATT ro AAGTI'GGTTA TG CACTAGGTI'A AG ATGGCAGTGG
.GGGGCATG
'ATGCAGTA
~TAGAAAA
kATCAGGCT kCGAATTTA
~TT
T
ITAGGT
GAACTACCA
CCAGCAGGC
2ACTTGTAT
AATGTGATG
TTAATCAAA
AACGAAGAT
GATGGGCAG
TGGGATGGC
,GCAGTTACT
GGACAACAA
ATTGATGCG
;AATGGAAAA
7GATCTTTCA
E'GTACCAACA
rGAAAAAACG kAAACAAGAA k.ATTAATAAC kTACGACAGT
ATACCCGAAT
P.AAACAAGGG
AAACATTCGT
AAAACCAGTG
CAALAGAATAT
TCAAGTAGGG
TrATATTM-G
TATTGGGACA
CACTCTTCAA
ACAAGATGTC
TGACGTTCAT
CTTTAAAGAC
ACCAATCTCC
TGATGCCATG
TAAAGACCAA
ACGGGTTTITT
AAAAAGTTTG CCA'FI'TAGG TGTGACGACC GATCCGTATA
CAATCGTAC
221 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Genes.
CCAGAAACAG
ATTCCAGAAG
TTCGTGGTTA
ACGCTAGCAG
CATTCGCATA
GGTAATTATT
ACAACAAAAT
GCCATCGTI'G
TATATTCCAT
GGGAATTGA
GATTTTGCAG
G PIGGGGATG
GGCGCCCAAG
AATGGCACAA
CTACATGTIT
GCTAGTGACG
GAAATTAAAA
ATGGCTACCA
CGAGAAGAAG
TTGCGGTTGT
GGCGAACGAA
ACGCCAAAAC
AATAAACAAA
TATGGCGGCI
CTGTTCGT'G
CAACGAAAAC
TAGAAGCAGT
CTGGGCA'TIT
AAACCC LTAG.
CAGCCGTCGA
TAAATAATGT
TGATTTCAGC
ATCAAGCACG
TCACATTGGA
TTGGTGATGC
"TAACTAATGG
TCATTGCTrGT
TTAAAGAAAT
TTTTIACTAGA
CAAGCTCCAT
AAGGCAATGA
ATCGCCAAAC
A EGATTITCTT GGAqTTCACT
ATCGTGCAGC
TTGTTTCTA'I
CAGATCCAAP
ACCAAGCGGG.
T=TACCTAM
GACTTTCT'C
ACACCCGAA-A
GATTAAGGAA
AGGCGTAGAT
TCAAACATAT
AAGTGGCAAA
TGGGATCGTC
AGCAGAGCTG
TACGAATTTT
TGGACAACGA
CATTATGTCA
TGGCGGCATT
GTTACCTTTT
GTTTGAAATG
TGATGCTGGC
TCGTATCCAC
AACAGGACAA
ACAAAAGTTI'
AGCTGCTGGT
AGATTCTGTC
*AACGACGATT
T~CGGAAGAA
ACCAGACCCC
AGCAAGACAC.
AACGGGTACP
'TCTGGCTGC
AACGTAGAGA
ATTAAAGAGA
GAAACGACGC
CCTGAGTTAG
CGTTATGCCA
ACAGCACCAG
CTAGA.ATTGC
AACGCTCAAA
GAAAATGT4GC
TATGGCCAAG
CGCGCTGATA
GcAATAGCA
TCTGTTCGTT
CAACCAAAAC
TATGATTCCA
ACGAI 'GTCCG GAAC CATTGC GGTGATGGT I T-rAATTAAWJ
*GATTTAGCAC
*GCTTACAA.AC
AAACCAACAC
AGCAATCCA'
GAAACAGAA2 TATATTTATj
AGGTGACA-T
AGTACGCCGGA
CGCATATCT-G
ATATCACTGT
AAGTGATCTA
AGAGTGAACC
CAGAAAACCC
ATGAAAAAGT
GCACACGAGA
ACGCGTTTAG
1'TAAACAAGG
T'TGCGCAAAT
CGATTCCACA
'FrGGCbCAAA
CAA.AACCAGG
GTAGTCGCGT
ATGAGAAGAA
ACCATATGCT
ACTTGAAAAG
AATATAA.AGA
AGTTAATCGG
CAGAA.ACACC
r CCGTAACAGA k. CGCTTGCATI AACGACGTAAi
TAAAGACCCG
TATNCAAGCT
GCGTGGTGAT
GATTGATGGA
TGCTCAAACA
AACTAAGAAA
GGCAGTTAAA
AATTGTCGAT
GACCAACTTA
CCAACCTGCT
GCCAATTAAA
TCAACTAACC
AAAACATGAG
TGGTGGTTTC
TACTCGCTTG
ATTAGGAATA
AcAATACCGG
AGGTGGTGAA
TGCAACCAC
ACCATTCCCA
TGGAGGAGAG
AGTAGCAACC
GAAGAAAAAG
ATATGGTTTA
CAAAGCTAGT
a
C
4~*
TAG
ZF122-2 (SEQ ID NO:454) VKKL SFKKVKWGMH FLMAVAL lAP SVTSTAYAVE TTSQQSSEAV
TSTTDSSRKQ
EPV ITQETTD
SDVHGQLWNW
KAPLVNEKTH
LRFVEGTTTK
LKANDQADI I
VPVGGPKDTG
AFIQEEIGTA
LPAGKISYAT
YNYD)MISGVD
P IKNSDPETL PTS PDGRTPN
ENPTLMVDAG
NFPILSSN'TY
PEVEAVIKEI
SHTAVESGKR
IVDEARTNFN
FAVTNGGG IR
GTILLDOAGC
IKNRQTQKFE!
RLYRAATTIE
KQNQAGARQS
IKQEAPNQAT
SYEDDKELPV
PMITAM4NVMK
ELDFNQDGQP
VAS IHAGQQN
TEVVKIDLSV
TADFLPKQEI
IFDIYKYPNT
YKIDISKPVG
RGMIVDYIKK
AAAITKQDVR
DVFQGLPISN
YKDGSGRVFD
KEKYADXQAF
YGKVIYAQTG
AENE-KVIVIDY
ADIKQGPIKV
PKLGANGGFIL
PLDEKYQYPYJ
LAQYKEPFPC
NPsvTEKKIXN
SDSVKQSQET
GLSQVSTVVN
YDAMVLGNHE
DLKVGI IGLT
SDPAASADQV
AKNADKWEVQ
KGIPEAQLQP
LVSVPINGEN
ERIVDAKIDG
KGTLDPEQE I
NAGFDLDNAY
FSKGADMAKA
PYTIVEKSGK
VV'TGHLGVDE
NYLNNVCIVT
IPFTLDGQRE
GDVIAVLPFG
HVSSSIRIH'Y
IATNDFLAAGG
ERIVSISEEA
GGFLPKTGTE
QVRAQNPAGT
FNFGLPLIKK
IPHIPLWDGP
IENVAGIDAY
EGTATIVPTT
TAMI SLINNV
LLKYLEKQGA
QPLDPAKEYT
ERNWSI IGTN
TIM}{TNDVHG
MNEVGYI)AMA
KFAIVGVTTP
TTPH IWRGDT
APESEPTKKT
NVRTRETNLG
NS IAQIQVTG
DSTYPGTRLA
DGYDNLGGEP
YKELIGGGEl
TETLALYGLI
VLIDNGDNIQ
IQQE-ATFPIL
RVTSLNFLPL
ILGHDHLSFT
NVPADE.AVKA
QKEVTGAQLS
YYNQTQPDDL
IAMNNYRYGG
FDEKwRAKAI
RLEAGKGELG
VGNHEFDFGL
ETATKTHPKN
L.AETLSQTYP
TKLI SAAELL
NLIGDAIMSY
AQvKEMFEMKS
SDEGNETGQT
EEGISLDSVL
PKPDPKPDPK
FVGLSSSGWY
TPQKITILGT
GTILTDDLYN
sANTYNKEDG
KEEAEXAVTE
KQGAAPNGKT
ATKEYHEKTR
AAALFKYUSK
TISFNPNIRV
LASQGIQVGE
EL VNDGTLQ I
MARLKTFKDQ
EIALGYKDQL-
VEKVTFKDP I
ELDITVIDGH
ELPENPAVXKA
GQDAFSQPAD
vRs IPQKDEN IVGSRVLG
IE
IEYLKSATSL
PTPETPVATN
IYYRRNKAS
EF122-3 (SEQ ID NO:45 5 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TG AAAAATGGCG TGCCAAAGCA ATCGAATTAG TGAATGACGG
CACTCTTCAA
ATTCCGACTT
CGTAATGCGG
GGCCGACTAG
CAAGAAAACC
AATTTCTCCA
GCGGTGGGAA
CTGAATTTTC
GATCCGTATA
CCAGAAACAG
ATTCCAGAAG
TTCGTGGTTA
ACGCTAGCAG
CATTCGCATA
GGTAATTAT
ACAACAAAAT
GCCATCGTTG
TATATTCCAT
GGGAkATTTGA
GATTTTGCAG
GTTGGGGATC
GGCGCCCAAG
AATGGCACAA
CTACATGTTT
GCTAGTGACG
GAAATTAAAA
ATCGCTACCA
CGAGAAGAAC
TTGCGGTTGE
GGCGAACGA6P
ACGCCAAAAC
AATAAACAAP
CTCCTGATGG A GCTTTGATTTI A AAGCAGGGAA
A
CAACCTTGAT G
AAGGCCCGGA
ATCACGAGTT I1 CGA=TTTATC
CAATCGTACA
CAACGAAAAC
TAGA6AGCAGT CTGGGCA1TT
AAACCCTTAG
CAGCCGTCGA
TAAATAATGT
TGATTTCAGC
ATGAAGCACG
TCACATTGGA
ITGGTGATGC
TAACTAATGG
TCATTGCTGT
TTAAAGAAAT
TITTACTAGA
CAAGCTCCAT
AAGGCAATGA
ATCGGCAAAC
ATGATT"TCTT
GGArI'TCACT
ATCGTGCAC
TTGTTTCTAT
CAGATCCAAA
LACCAAGCGGG
.CGTACACCA
GATAATGCA ~GGCGAATTA G
;GTGGATGCAC
.ATGGCCAAA C ~GATTTTGGT
'GTAATACG
".ATCCGGG
kCACCCGAA6A 3ATTA6AGGAA kGGCGTAGAT rCAAACATAT k~AGTGGCAAA
TGGGATCGTC
AGCAGAGCTG
TACGAATTTT
TGGACAACGA
GATTATCTCA
TGGCGGCATT
GTTACCTTTT
GTTTGAAATG
TGATGCTGGC
TCGTATCCAC
A6ACAGGACAA
ACAAAAGTTT
AGCTGCTGGT
AGATTCTGTC
AACGACGATT
TTCGGAAGAA
ACCAGACCCG
AGCAAGACAG
LACGCCGCCG
.ATACCATTA
;GTATGGCGC
;GGGATGTTT
;CAATGA6ATG
.TAGAGATTG
C'ATTACAAAG
kAAAAGTTTG
),,ACGTAGAGA
%TTAAAGAGA
GAAACGACGC
CCTGAGTTAG
:GTTATGGCA
ACAGCACCAG
CTAGAATTGC
AACGCTGAAA
GAAAATGTGC
TATGGCCAAG
CGCGCTGATA
GGCAATAGCA
TCTGTTCGTI
CAACCAAAAC
TATGATTCCP
ACGATTGTCC
GA.ACCATTGC
GGTGATGG J
TTAATTGA.A
GATTTAGCA(
GCTTACAA6A(
AAACCAACA(
AGCAATCCA'
CTATTACGALA ACAAGATGTC TGCACACAAA TGACGTTICAT GTCTAAAAAC CTTTAAAGAC .TCCAAGGAT ACCAATCTCC AAGTTGGTTA TGATGCCATG CACTAGGTTA TAAAGACCAA ATGGCAGTGG ACCGGTTFI' CCATTGTAGG TGTGACGACC AGGTGACATT TAAAGACCCG AGTACGCGGA TATNCAAGCT CGCATATCTG GCGTGGTGAT ATATC ACTGT GATTGATGGA AAGTGATCTA TGCTCAAACA AGAGTGAACC AACTAAGAAA CAGAAAACCC GGCAGTTAA-A ATGAAAAACT
AATTGTCGAT
GCACACGAGA GACCAACTI'A ACGCGTTTAG CCAACCTGCT TTAAACAAGG GCCAATTAA6A TTGCGCAAAT TCALAGTAACC CGATTCCACA AAAAGATGAG rI'GGCGCAA.A TGGTGGTTI'C CAAAACCAGG TACTCGCTTG GTAGTCGCGT ATTAGGAATA ATGAGAAGA6A ACAATACCGG ACGATATCCT AGGTGGTGAA P' ACTTGAAAAG TGCAACCAGC AATATAAAGA ACCATTCCCA AGTTAATCGG TGGAGGAGAG CCAGAAACACC AGTAGCAACC T CCGTAACAGA CAAGAAAAAG TATGGCGGCT TT EF122-4 (SEQ ID NO:456) 4* EKWRAXAI ELVNDGTLQI PTSPDGRTPN AAAITCQDVR ENPTLMVDAG DVFQGLPISN NFPIhSSNTY YKDGSGRVFD PEVEAVIKEI KEKYADXQAF SHTAVESGKR YGKVIYAQTG IVDE-ARTNFN AENEKVIVDY FAVTNCGIR ADIXQGPIKV GTILLDDAGQ PKLCANGGFL IKNRQTQKFE PLDEKKQYRM RLYRAATTID LAQYKEPFPG KQNQAGARQS NPSVTEKKKY
NAGFDLDNAY
FSKGADMAKA
PYTIVEKSGI{
VVTGHLGVDE
NYLNNVG IVT IPFTrJDGQRE
GDVIAVLPFG
HVSSSIRIHY
ATNDFLAAGG
ERIVSISEEA
GGF
TIMHTNDVHG
MNEVGYDAMA
KFAIVGVTTP
TTPH IWRGDT
APESEPTKKT
NVRTRETNLG
NS IAQIQVTG
DSTKPGTRLA
[)GYDMLGGER
YKEL IGGGET
RLEAGKGELG
VGNHEFDFGL
ETATKTHPKN
LAETLSQTYP
TKLISAAELL
NLIGDAIMSY
AQVKEMFEMS
SDEGNETGQT
EEGISLDSVL
PKPDPKPDPK
MARLJKTFKDQ
EIALGYKDQL
VEKVTFKDPI
ELJDITVIDGH
ELPFNPAVKA
GQDAFSQPAD
VRSIPQKDEN
IVGSRVLGIE
IEYLKSATSL
PTPETPVATN
EF123-1 (SEQ ID NO:457) TAAAATAAAA AATTGGTACG AAGTGAACGT TCTC'I7CTAT GTGTCGTTAG TAGAGGAAGG ATGAAAGAAA TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GrI'GATGTGT 223 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TTGTTACTTG
GATGGCCACT
CTI'CAAGGAA
TCAATCAAAG
CAGAAGAAAC
ACGTTGAACC
TTGACGTTAA
GCATCAGCGA
TCCGAAGAAA
ACTGTCAAAC
ACA.ACGATTA
ACACCGCCGT
GATGTGCGAG
AAACCTAATA
TACACAATTA
AGTGACATTC
CCAGGAGATT
ATTGTCCCAG
CCTAGTGCGA
ACTGTGACGG
GTGATGAATC
ACCGTTGATA
GAGTACCAAA
AAAA.ATCACC
GTTACCGCCP
CAAGAAT'TC
GCAGTCATTI
TATTCAGTGI
AAAGATTACI
GTGACTGGCC(
GTTGCCGTGJ
GCCAGTCAA(
GGTTGGACG!
ACGTACGAA(
ACCACTGGI)
GATGGTGAA
TCCACATA
TTGGACCAT
CATCAGAA
AGCGGTGTT
AATAATCGT
GCTGGGAGC
AAACCAACG
ACTTGGCG'I
GTTGATGAP
CAAGGTTC'I
AAAAAAGG I
AATGGCGCC
GTGCTAGAJ
ACGCCAGAJ
GATAATGAJ
TATATGGAJ
AACCAAGT(
GTGGTCGT'
TTCTAAATTA TG TAACGTTAGG AG AAGCACAACC AG CTGCACATTG GG AGATTCAAAT TG CTACAGCTAC
AG
AGACTGGTAC
TG
ATGAAGGTTC A-P CTGTCGCCAG CP CGCGCGTAGC
AC
TCGATAATTT TC ATAAAGAAGA
TC
A.ACAAATGAA AC AACCAIGTTC AC GTGAAGATGG T~ ACGGGGAC ET 7 GGGTGATTGA Tj ATACCGAACA
A
FITAC ETGGAC G~ AAACATGGCC A.
TTGATGGAAC
A
AAAATGGCAA
T
CGACGATGA
C
CGAcGTTAACA
CATATGGCAAA
CTTGGGAALATT
CAGACACAAT
kCATITGATGA
C
.AAGTGGTAAT C ;CAGTCAAGAT T1 ~ATAATCGTGT
C
AA.AATATTATJ
r TAGCTGTGAA 9J -CAGTTCCTGG
C
3 CTCAGTTGACC A CAGGCTI'TAAC A CFI'ATACTAC T ATCGAAATAC G ATAGTAAACC T ACAATGCCGT T TAGTCGACGC T TGAAAGTCTA ;C AACCGTTGAC rG 'TGATTTCC LA AAG~rATCGA
CACGTGATGT
GCGAATACCA
'C AATCGGTITI'- M' CCGAGTCATI EA AATCTGTAT kA CAGGACAACA k.T ATCGTACTTT IT cAATTACTGG rG ACATTGATCA GCACACCA CTCATGGCTT AAGTGAAG CAAACCAGCCC TAACACAA GAGGTTGTAG CAGCGCCC AATAATACGC AATTGAAT cAGCAAGCGT AAGATGTG ACGTFITCT ~ATCCGACA GAATCA.ACGG ~CACAAGAA GCATCTACAA ~CGACAAAA GCGATAGAAA ;GACCAACA GATATCAGTG ;AAGATCCG _ATTTAITTAA TGACCATT CATTGAACT CAGGCGAT TAC~rCGAGT -GTGATTTA GTTGATIGCAG DLCGGTTCGT 1'TTACCTI'TA FCTTTAGAT ACTCAT=GA ~TTCCTACA CAAGAAGATT :AA.ATTGAT AAACAAGGCC GTAGATATC A.ATCAAGCGA ACAGGGAAT ACCTTTA-AGI TAAAGAA GTGC-GTCGCC GTGACGAT
AAAGGTGACP
GAGGCGGTT ATTCCAGATC AGTGATAAT AATCC-AAATC ATGTTAGAC AAGCGCAATI 'AACTACAAC TATGGTGAAC GGGGATAAT TTAACG=I :AAAGGAAAT
GAAGTCGTTC
AACGGAGAC GGTTCCIn ~GATI'ATAALA ACCAAAGTT GATGTTGGC ACTGGTCAGI *AAAAACACT GGTGCAGTfl rCAAAATAAT TA TTTGATG TTAACTATG GTACCCAAT ;TTAGGCAAG
GATTTCATG
'GTAAG=~T
ATAGGGGCG
CTTTTTCGAT GTTACCGAG; CGCTGCCATT GATTGGACG GITAAACCT
TTACCTGCI
CACCAALAGA.A ATCACTTGC CTTTrrrGACG GATCCAAT] TGAAGGCAAT
ACAAAGCC.
GGATATCACA ATGGAAGMJ TA-ATGATAGT
CGTACGTA'
AGGTTCGGCT AG ETATGA( GACAGGAAAA GT'rrCTA7 CAAAGATGAT CCAGATCA' AGACGATGTG GTTATTAC' GGTGATTTAC GGTACCAA TTTAGAAGAA GGAAAAGX AAAA.ATTGTC- GTTAAA-AT AGTGACVTCT TCAGCGGC AAATGGTICA GAAGTCGT CAGTGGCGGG CATGCCAC TGGCGGAAGA GG AGCAAGAAAT
GA
TGCATTATAG
TG
GCAAGA ITCA AG TACCATAC
GI
ATGGACAACA
GC
CAATCACGAG
TTI
ACTCCTCTGT
TC
GTAAAACAAC TG ATTATTTTAC AC ATCCTGATGG A;~ TTAACTGGTC GI TI'CAATTACC
T
AAGCdAATGT
C
ATGAGCGA.AT
C~
ATGATTCAGA
TD
T'GCCGCCTGT
A'
ATTTTGATCGA
TIGAAAGATCA
A
CCGTGAAAGT C AACTTAGTCC
A
CCAACAA-AGC G GCGGCGGCGA
T
GGTTAGATCC T kTAGATTACGA
C
A.AACCATTCC
AACCAGAFI'C
3 GAGCAGAACT T1 3 CAATTGACTT ATGGAATTCT
C
ZA'TCAGAAGA 3 ATTATCAAAA G AAAATGCCGT T CGmTOGGTTGT G TAGAAATAAC T ATGCCAAAAC T TAGACGCTAA G ATGAAGCAGG
TTGAITI'AAA
;A CGATTGCCGT .T TAACCAATCA iG ATGGTTCGGT kC CAAGCGAGA.A C'G TGATTGAATT 'A ATACCGCATC IrC AACATGGTGG rG TGTACTGGCA rG ATACACCCTC -G TAACAGAAGA TT ACACACTGGA GG CCCATATI'GA GG GGAGTACAGA TC ATGGGGATGA AG GGACTAAAGG 11AACAGC
CCTTAGCG
CCAATGTG
TGGATGAC
'AGTCTA
:AACGAGCG
CGCCAGCC
:CTCG'ITCG
;AATCGACG
;GTGATGAA
CACCAGCA
MT1CCAGAA 3GCAAT'ITG
LATGGAACC
kCGTCTGAA
~GGCGGGGC
GTGATTCCA
PCGCCCAAT
ACAAATCCA
TATGAGITA
GATGAATAT
TATCGTCTT
GTGCCTTT
'GAACCAACT
'GAAGCCAAT
.AAAGACCAA
=TACA L'TA
'GTGGAAGGA
.TTACATGAT
GAAGGCGAT
rGATGCCACA rTCAACGAT
;ATTACGGAT
CAAAGATACA
rCGTAATGCA
AAGTGATGCC
CAATCCTCCG
AAAcAATCAT
TGCGCAAAAA
TAATTI'AAGT
AACCTArI'TG
TGAA.AAAGTG
AAACCAAAAT
TAAGACGTCT
TTATACAAAC
CGAPATCAGTG
TGTA.ATGATC
ACCAAACCAA
CGGAAcTATT
AGTTACCACC
AGCACCTTAT
CACGGTATCC
CAATGGCGAT
CAAAATTCAG
224 TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
a a a a
CTGAAGPAAA
GACCAAGCTA
TTTGGTGGGT
ACAGTTTCGG
GAAGGAGCAC
GATGAGAAGG
CCGTTTACTC
TTAGAGGTGG
GGCTATCTTT
ATT~CGTGATG
GACCAAGCAG
GCAGTTCGAG
CCAGGAAAAT
CCAAGTGCTT
GCTAACTTTG
FI'ATTAAGTG
rrGACGACAA
TTTGTAGAAA
ATTGCTGAGA
TACAA.AGGGG
G MIT TGA.AT
GATGGCAAAA
GCACCAAAAT
TTCGTGGTG
AA.AGGACGCG
TTTAA.ATTGT
AAAGATGGTt
GATGCAACGG
TCAAATGATA
GGACGTAAAG
GCCGATGAGC
AAAGTTTCTG
GGGCATTACG
CATCCAT'TTG
ATCAATTATC
GGTGCGGTGI
ATGTCTGAC;
GTGGAGACAC
ATTCCCAAAC
TACCAAGGTC
GAATI'TAAAC
AACCAAGCG'
AAAGCACCGC
GCGAAAGAC)
TCCAAAAAA2
CTTGG'ITA(
TTGGTCGG7
CAGCCATGGA
AAACACAAGT
TGCCACAAGG
ACGAAT1'AGC
AACCAACCAT
GTAAAAAGTT
ATTGGGAAGA
ATAGTTTAAA
TAGACACGAC
TTCATGTCAA
GCAATCCATT
AACACTTAGT
ATCAATTTGT
TCACGATTG.C
ITAACTATCA
GTGCCACA LT
ATAATCAAGG
CCAAAGCGCC
AAAATGCTGG
CTTTCCAAMA
TATATGATCA
TTATCTTTAC
TACCAGATGC
ATTTCAAAGC
CCGTC TITAM
ATCGAATCG;
CATTGGCTA)
ATGGCTATAJ
AACAACCAC'
TCAACGAGC2
AGAATCAGC(
AAATAACAA(
'ITTTAGTGGi
AMMTACCG(
AAGGAACTG,
TTAAGGTCA:
AGCAAGGCA
AAGCGCCAA
ACAACCAAG
CTGCCAAGC
TGACAGATG
U TAG'N'CAAG
ATGGTTACC
k AACCTGAAC k. CAA.AACCAA
:CGAAAACCA
r TGGCAAGTI
TGAGACGACT
CCTACGTGAA
GCAATACATT
TAAAGGCCGA
TATTAAAAAC
AGTCAATGCT
AG ITCCCCTT
ACCAGGGCTT
CCCCAAACCA
AATGCTTAAT
AGCAGGTGCT
TCGGATGCA
GGAAACCAAA
AGCAAGCGAT
AGGCACGGCT
TA.AGTGC'T
GGAAATTGTT
AACAGGCTAT
TAALACCAGCG
CGTGAAAACG
CAATAAACAA
AGACTTGGCG
CGCAGATTAT
TGATCCGGAG
GAAAATTGAT
AAACGGGGAA
GGAGGATTT-D
r CGTCAATAA-P r AGATGAGTTP k. AGGTZ-AAACC
CCAAGGTTCV
GGATAAGAC9] k. AACGAA-AGCI
CCAATTAGGI
TCAATTAAC(
r TGATGAAAC A. AGTCATTGC( C TAGCTATCT G CAAACCAGC( T GGTGAA.AAT' C AGAGACAGG, T GAACCACTT.
A ACTGTCTAA T CGTGAATGC A TCAGCCAAC ,A CACACAAGT 'G GCTCTTCTA ATTT1'AGCAG
GGTACAGTAG
TTGGTGGAGA
GTCATTACTA
GATGTCAATA
CGCTTTAAAT
GCOCCGGATC
TATCAGITCA
TTCATCGTGA
TACCAAGGT
GAATT~TTCAG
AAC'GGAAAAG
GCGCCAGCAG
CGGGGCAAAC
AAATTAATCA
GATGCGAAGG
GCAGACCACT
TTATTrAAATA
GTCGTGGTI'G
AATAGCGCAG
TCATTAGGGA
CCAGGTACC'I
A'ITAT1'TATC ATTTTC CAMq
GCCAATGCGP
AAAATCTTTC
GGTGCTGGT;
GCGCCCA'IT
ATGCCACCGG
CAAAAGCACC
TTC-ATGAAGA
AAGTATrT=
TAGAGCATGC
GAACCAACGC
CAGAAATCGA
CACAAAATAC
CTGCTGAACT
TCCTTGACAC
TCACAGTGAC
GGTACC TTT CAGCAA CACT
AAAAAGATGT
GAGAAACGAT
TAGCC CC AGO
CCACGCCAGT
CTAGTGACAA
ACCAACCATT
TTACAGCAAC
ATTATTACAA
*CTGAATTAGT
*TAGGGGCCIT
ACCCACTTCC
AAAGAGAAGT
LGCTATGAATTI
:CAAATTGG
;GTTATCACA
%GAAGGCTAT
AACTTCAGCC
AGAAAAAATC
CGTAACCACG
GAATGGCCAG
PGCACCGACA
GAGCGGACALA
AATTAAAAAA
CACAGGACAA
GGATTTAGCC
AAACACTGAA
TATAGCAACG
GAATGGACAC
TCAAACAGGC
AAAATATCGC
CCCATTTGAA
CTTTGTGAGT
AGCAGGTGCT
GAGTGGCAAA
AGAAATCAAA
AAAAGTAGAA
CGCCAATTTC
AGGAACGATT
AACTGCTGA
AGATGAACTG
CAACCCATTT ATTGTAGT GAAGAAGAAT
GAGTTTGTA.A
TACGGGTG
CCGATAACAT
GGCGAAATT
CCAACAGGCT
AAAGAGCAGC
AAAGAAAACG
GGGCAAACCG
AAAAACITAG
CTTAATG"-AAA
ACTGTGGTAC
GATCAGCAAA
CAAACTGTCG
CAACCAGGAA
CAAGCTGTCG
GGCACGTITG
ACGAAkACAAG
AATTACT'I'C
AAAAAGAGCA
A ITATCAAGC
CAGTTTTTGC
TCTTGA.ATCG
ACGCTAAAGG
ATCTGTTAGA
CAATTGCTr
AAACAGGTGA
TAGATOGACA
CACCGGGAAC
CGCCAAGCGC
TTAAAGCACC
AGAATGCCTT
CTCGTTCATI'
AATATACCT
CATTCACTAT
TTAACGAGAA
CAGCTAGAGA
FrGTCT'ITAT AGAAkATAA
AGAAGTAATG
AATTTACAAT
TGCAGGAGAA
GCTAAATGAA
CACAACGCTA
AGGCGATCTT
AGCATTGGCA
AACCAATCTG
GTATCGTITT
AAGCTT1'ACG =rTAT'rAAT
AGCAGGTGCT
ACGTTCTGAC
TGTGGAAACA
TGCGGCAACA
ACAACCTGTA
GACAGGTTGG
CGGCCTCATG
a EF123-2 (SEQ ID NO:458) MRXNGPMV NRWLYGLMCL LLVEJNYGTPL MALAEEVNSD GQLTLGEVKQ TSQQEM4TLAL QGKAQPVTQE VVVHYSANVS IKAAHWAAPN NTRKIQVDDQ KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL TLKTGTDPTE STAITSSPAA SANEGSTEEA STNSSVPRSS EETJASTTKA IESKTTESTT VKPRVAGPTD ISDYFTGDET 225 TABLE 1. Nucleotide and Amino Aci d Seqeuences of E. faecalis Genes.
TIIDNFEDPI
PNKPGSGDLV
GDWVIDI PTQ
VTETWPTGNT
YQTTIDE-AVI
EFTWEINYNY
DYKVVINGDG
SQQNI IKNTG
TGAQLTLGKD
DHYRNTAAID
NRLVDAFLTD
WRVDFPNDSR
KGGEYHKDDP
PDKSVILEEG
QVS ITGNGSE
QAKTQVLREG
GAQPTI IKND
LVDSLKPGLY
QAGNPt.AGAE
SAFTIAASDF
TTNNQGEIVA
KGAFQIVKT'
PKLPDGADYI
KLYRIENGE?
NDKQPLDELI
VS=EITTDKT<
NYQGTAQLTn
ETQAPTSYLI
FKVTDAETG
KDKPELVNA(
YLNPDGTPAT
DAEGNVYTY
EDLPPVVIPI
FKSVKVYELV
PD)GGGDVPFK
GEQTI PKDQA SFAIDFLHDv
AVDYQNSTIG
FMVEITRNAD
WTDEAGNNHH
P1 LTNQTYLA TYVIEF1KTSV
DHVYWHVMIN
KDYTLEVTTO
VVHGDDNGDV
TVDATGVITF
VNKVFLEKMO
QFTEIEAPTG
FSVELDTTGQA
GKPATVIATP
EHLAPGKYRE
I SADQPLAGA% IYPELvKVE]
IFEREVTAEV
FVNYQAEVMC
EIYAKGLNEC
ENETGEALA(
NETPSASFT'
2TVARSLRSDI
STFVNEKQPV
ppyKEDVTIH
TISEDC-TVRF
VPDTEQQIDK
MNLDGTIKEV
NHATTSDN
VITDTMGIDNL
TGAVKIDYKT
WTLAVNQiqDY GETGFKVSF I
SEDSKPFKPL
GSLKVYEGNT
DERVIEGSAS
GAQSVLDDVV
NETGQQKIVV
VVDIDHSGG',
GGLPQGQYIL
EKGKKLVNAR
YLLD'TPKRF
*VREHLVSDM~
NFVNYQGTA.?
*VETKAPTCYI
FELYDHNKQ~l
RGDFKGDPE:
DGSLAMEDL(
RKVNEQGQTI
3 HYVLVETKA:
"AVFKVIDETI
r AKDNQGKPA: .4 QGLVQVNHL S KKTKPNQPT WNFNWS IPED
TFNERITSES
QGHFDRTPNP
GRELSPDEYT
PNGLDAEATV
TFEPDSLHLY
KVDG IVEGDV
LMENAVITOT
GAYA.KTSDAF
PAFDLNAQKS
Y1,PDGSVEKVK YID~rASYTNQ
ITDTPSPNQV
KI4AH IF.APYY
AT-.TKGKIQL
VETKAPEGYT
FKLEHAV~r
IVTQNTSGQI
I GKVTVTDLAE LIKKDVNGH1 LNTTPvPFE] 3 LGITATSGKI
EFQLGAFANFI
3AGsYELDELI L, AGAVFAIYN) P TGYLLDTTLI
SQTVDGQTNLI
T VVLKAPFINT Q PGKYTFVET T KQAARETGW VREQ1MflAGDY F DIIIGDFSLDT H SAITWTVDIN
Q
VDKNGNVTIK
G
TATYCMLDK
R
sVTFDDKGNEV AVNNRVIDVGT G YEPVPGLTMV P HITYTTFFDV
TI
GVYNAVTKEI PTQPLTDITM
I
GSSRDVTG(V
LDPESLVIYG
MEYRSLVTSS
KXTAMDETTI
*VSDELAKGRV
FTHWEEVPLA
*RDVHVKMLNY
GKYQFVETKA
LSGATFKVLO
-AEKNACKPAV
G KIIFRDLAP C GRAVFKY\IDA
ATDCYIVNKQ
k. DEQNQPQGSP A PFDVTAQLGK A SDXQGKVIAK C AAKLVKID K APDGYQLSKQ L GLPKTNTQVN
EFQLPGNLK
LNDSDGRGP
AMKT)(QTNPT
OTNKAYRLE
NIDYDEANQ
VGAELVEGK
*QHSEDDGTA
NSLVVKDTT
'ELDAINNPAL,
7tTIAVNLSN
~EPSEKNQNT
IQHGGESVK
CNVTEDGTIT
%AG STDTVSN
.AGAHFQIWD
ITI DEETSAE
PEDRTNANGQL
QGSAELIKKD
PAGYLLNTEP
AKGETIQTGL
VVASDNFrVSY
GTYYYKEIKA
NANPLPGTIF
P IYFVTKKINS
ITFLNRAGEK
EQPIALGDLI
NLAPG-YRF
QQKNALAGAE
AVAFTIAATA
YFFVF IGLML VGLASWLFYYK
KSKK
EF123-3 (SEQ ID NO:459) *6 GGAAGA GGTTALACAC GATGGCCACT TAACGTTAGG
AGAAGTGAAG
CTTCAAGGAA AAGCACAACC
AGTAACACAA
TCAATCAAAG CTGCACATTG
GGCAGCGCCC
CAGAAGAAAC AGATTCAAAT
TGAATTGAAT
ACGTTGAACC CTACAGCTAC
-ACAAGATCTG
IrTGACGTI'AA AGACTGGTAC
TG.ATCCGACA
GCATCAGCGA ATGAAGGTTC
AACAGAAGAA
TCCCAAGAAA CTGTCGCCAG
CACCACAAAA
ACTGTCAAAC CGCCCGTAGC
AGGACCAACA
ACAACGATTA TCGATAAT'IT
TGAAGATCCG
ACACCGCCGT ATAAAGAAGA
TCGTGACCATT
GATGTGCGAG AACAAATCAA
AGCAGGCGAT
AAACCTAATA AACCAGGTTC
AGGTGATTTA
TACACAATA GTGAAGATCG
TACGGTTCGT
AGTGACATTC ACGGGGACTT
TTCTTTACAT
CCAGGAGATT GGGTGAI'GA
TATTCCTACA
ATTCTCCCAG ATACC';AACA
ACAAATTCMI
CCTACTCCGA TrACTTGGAC
CGTAGATATC
ACTGTGACGG AAACATGGCC
AACAGGGAAM
CAAACCAGCC
GAGr'ITGTAG
AATAATACGC
CAGCAAGCGT
ACGTT'IMTT
GAATCAACG
GCATCTACAA
GCGATAGAAA
GATATCAGTG
ATTTATTTAA
CATTGGAACT
TAcTTCGAGT G'1'GATGCAG ITI'Acc IA AcTcATA
*CAAGAAGATT
*AAACAAGGCC
*AATCAAGCG.A
*ACCTTTAAGIl
AGCAAGAAAT
TCCATTATAC
GcAAGATTCA
TAGCAGATAC
ATGCACAACA
cAATCACGAG
ACTCCTCTGT
GTAAAACAAC
ATTATTMAC
ATCCTGATGG
TrAACTGGTC TrCAATTACC
AAGGCAATGT
ATGAGCGAAT
ATGATTCAGA
TGCCGCCTGT
A~ITTGATCG
TGAAAGATCA
'CCGTGAA.AGT
;ACCTTAGCG
['CC CAATGTG krCTGGATGAC
GTTAGTCTTA
GCAACGAGCG
TTCGCCAGCC
TCCTCGTTCG
TGAATcGACG
AGGTGATGAA
AACACcAGCA
GATTCCAGAA
TGGCAATTTG
CTATGGAACC
CACGTCTGAA
TCCGCGGGGC
ArGCTTCCA
AACGCCCAAT
AACAAATCCA
CTATGPAGTTA
226 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GTGATGAATC
ACCGTTGATA
GAGTACCAAA
AAAAATCACG
GTTACCGCCA
CAAGAATTCA
GCAGTCAITA
TATTCAGTGA
AAAGATTACA
GTGACTGGCG
GTTGCCGTGA
GCCAOTCAAC
GGTTGGACGT
ACGTACCAAC
ACCACTGGTG
GATGGTGAAA
TTCCACATAA
TTGGACCATT
TTrGATGGAAC
AAAATGGCAA
CGACGA'ITGA
CGACGTTAAC
CATATGGCAA
C'rrGGGAAAT
CAGACACAAT
CATTTGATGA
AAGTGGTAAT
CAGTCAAGAT
ATAATCGTGT
AAAATATTAT
TAGCTGTGAA
CAGTTCCTGG
CTCAGTTGAC
CAGGCTTTAA
CTTATACTAC
ATCGAAATAC
AATTAAAGAA
TGTGACGATT
CGAGGCGGTT
AAGTGATAAT
AATGTTAGAC
TAACTACAAC
GGGGGATPAT
CAAAGGAAAT
CAACGGAGAC
TGATTATAAA
GGATGTTGGC
TAAAAACACT
TCAAAATAAT
CTTAACTATG
GTTAGGCAAG
GGTLAGTT
CTT=TCGAT
CGCTGCCATT
GTGGGTCGCG
AAAGGTGACA
ATTCCAGATG
AATCCAAATG
AAGCGCAATA
TATGGTGAAC
TTAACGTTrG GAAGTCG1TG
GGTTCCTIG
ACCAAAGTIG
ACTGGTCAGC
GC6TGcAG TG TA ITTGATCC
GTACCCAATT
GATTTCATGG
ATAGGGGCGT
GTTACCGCT
GATTGG
AACTTAGTCC
CCAACAAAGC
GCGGCGGCGA
GGTTAGATGC
TAGATTACGA
AAACCATTCC
AACCAGATTC
GAGCAGAACT
CAATTGACTr
ATGG-AATTGT
ATTCAGA-AGA
ATTATCAAAA
A.AAATGCCGT
cGT~&FGT
TAGAAATAAC
ATGCCAAAAC
TAGACGCTAA
AGATGAATAT
GTATCGTCT
T'GTGCCTTTT
TGAAGCAACT
CGAAGCCAAT
AAAAGACCAA
TrI'ACATTTA TrTGGAAGGA
TTTACATGAT
CGAAGGCGAT
TGATGGCACA
TTCAACGATT
GATTACGGAT
cAA.AGATACA
TCGTAATGCA
AAGTGATGCC
CAATCCTGCG
EF123-4 (SEQ ID NO:46O) .s a.
4~ a
EEVNSD
GQLTLGEVKQ TSQQEMTLAL KKQ IQ IELNQ QALADTLVLT SANEGSTEEA STNSSVPRSS TI IDNFEDP I YLNPDGTPAT PNKPGSGDLV DAEGNVYGTY GDWVIDIPTQ EDLPPVVIPI VTET WPTGNT FKSVKVYELV YQTTIDEAVI POGGGDVPFK EFTWE INY'NY GEQTIPKDQA DYKVVINGDG SFAIDFLHDV SQQNI IKNTG AVDYQNSTIG TGAQLTLGKD FMVEITRNAD DHYRNTAAID W
QGKAQPVTQE
LNPTATEDVT
EETVASTTKA
PPYKEDVTIH
TISEDGTVRF
VPDTEQQIDK
MLDGTIKEV
NHATLTSDNN
VITDTMGDNL
Tr.AVKI DYKT
WTLAVNQNNY
GETGFKVSF I
VVVHYSANVS
FSYGQQQRAL
I ESKTTESTT WNFNWS IPED
TFNERITSES
QGHFORTPNPi
GRELSPDEYT
PNGLDAEATV
TFEPOSLHLY
KVDGIVEGDV
LMENAVITDT
GAYAKTSDAF
IKAAHWAAPN
TLKTGTOPTE
VKPRVAGPTD
VREQMKAGDY
DIHGDFSLDT
SAITWTVDIN
VDKNGNVTIK
TATYGKMLDK
SVTFDDKGNE
AVNNRVDVGT
YEPVPCLTMV
H ITY'ITFFOV
NTRKIQVDDQ
STAITSSPAA
ISDYFTGDET
IEFQLPGNLK
HLNDSDGRGP
QAW(DQTNPT
GDTNKAYRLE
RNIDYDEANQ
VVGAELVEGK
GQHSEDDGTA
PNSLVVKDTT
TtLOANNPAL EF124-1 (SEQ ID NO:461) a. a a.
TAAAATAAAA
ATGAAAGAAA
TTGTTACTTG
GATGGCCAGT
CTTCAAGGAA
TCAATCAAAG
CAGAAGAAAC
ACCTTGAACC
TGACGTTAA
GCATCAGCGA
TCCGAAGAAA
ACTGTCAAAC
ACAACGATTA
ACACCGCCGT
AATTGGTACG
TGAGAAAGAA
TTCTAAkATTA
TAACGTTAGG
AAGCACAACC
CTGCACATTG
AGATTCAAAT
CTACAGCTAC
AGACTOCTAC
ATGAAGGTTC
CTGTCGCCAG
CGCGCGTAGC
TCGATAATTT
ATAAAGAAGA
AAGTGAACGT
TGGTCCAATG
TGGCACACCA
AGAAGTGAAG
AGTAACACA
GGCAGCGCCC
TGAATTGAAT
AGAAGATGTG
TGATCCGACA
AACAGAAGAA
CACGACAAAA
AGGACCAACA
TGAAGATCCG
TGTGACCAT
AGCAGGCGAT
TCTCTTCTAT
GTAAACCGTT
CTCATGGCTT
CAAACCAGCC
GAGGTTGTAG
AATAATACGC
CAGCAAGCGT
ACG=CTT
GAATCAACGG
GCATCTACAA
GCGATAGAAA
GATATCAGTG
ATTTATITrAA CA=rGAACT
TACTTCGAGT
GTGTCGTTAG
GGCTCTACGG
TGGCGGAAGA
AGCAAGAAAT
TGCATTATAG
GCAAGA7TCA
TAGCAGATAC
ATGGACAACA
CAATCACGAG
AcTCCTCTGT
GTAAA.ACAAC
ATTATTITAC
ATccTGATGG TrAACTGGTC
TTCAATTACC
TAGAGGAAGG
GTrGATGTGT
GGTTAACAGC
GACCTI'AGCG
TGCCAATGTG
AGTGGATGAC
GTTAGTCTTA
GCAACGAGCG
TrCGCCAGcC
TCCTCGTTCG
TGAATCGACG
AGGTGATGAA
AACACCAGCA
GATTCCAGAA
TrGGCAATTTG GATGTGCGAG AACAAATGAA 227 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAACCTAATA
TACACAATTA
AGTGACATTC
CCAGGAGATT
ATTGTCC'CAG
CCTAGTGCGA
ACTGTGACGG
GTGATGAATC
ACCGTTGATA
GAGTACCAAA
AAAAATCACG
GTTACCGCCA
CAAGAATTCA
GCAGTCATTA
TATTCAGTGA
AAAGATTACA
GTGACTGGCG
GTTGCCGTGA
GCCAGTCAAC
GGTTGGACGT
ACGTACGAAC
ACCACTGGTG
GATGGTGAAA
TTCCACATAA
TTGGACCATT
CATTCAGAAC
AGCGGTGTTT
AATAATCGTI
GCTGGGAGC'I
AAACCAACGC
ACTTGGCGTC
GTTGATGAA.
CAAGGTTCT']
AAAA.AAGGTC
AATGGCGCCC
GTGCTAGATC
ACGCCAGATI
GATAATGAAJ
TATATGGAAW
AACCAAGTG'
GTGGTCG?1'(
CTGAAGAAAJL
GACCAAGCTM
TTTGGTGGG'
ACAGTTTCGI
GAAGGAGCA,
GATGAGAAGI
CCGTTTACT
TTAGAGGTC'
GGCTATCT
ATTCGTGAT
GACCAAGCA
GCAGTTCGA
CCAGGAAA.A
AACCACCTTC
GTGAAGATGG
ACGGGGACTT
GGGTGA'I GA
ATACCGAACA
TTACTTGGAC
AAACATGGCC
TTGATGGAAC
AAAATGGCAA
CGACGATTGA
CGACGjTTAAC
CATATGGCAA
CTTGGGAAAT
CAGACACAAT
CATTI'GATGA
AAGTGGTAAT
CAGTCAAGAT
ATAATCGTGT
AAAATA ITAT
TACCTGTGA.A
CAGTTCCTGG
CTCAGTTGAC
CAGGCTTTAA
CTTATACTAC
ATCGAAATAC
ATAGTAAACC
ACAATGCCGT
TAGTCGACGC
TGAAACTCTA
AACCG'ITGAC
TTGATTTTCC
LAAGTTATCGP
CACGTAI
GCGAATACC;
AATCCGTTT1
CCGAGTCAT']
k. AATCTG.TTAJ k. CAGGACAAC) C' ATCGTAGLT' r' CAATTACTG( SACATTIGATCj k. CACCCATGGQ k. AAACACAAG' r' TGCCACAAG( 72 ACGAATTAGI C AACCAACCA' 3 GTAAAAAGT C ATI'CGGAAG.
G ATAGTTTAA T TAGACACGA G TTCATGTCA G GCAATCCAT G AACACTTAG ,T ATCAATTC
AGGTGATI'TA
TACGGTTCGT
TTCTTTAGAT
TATTCCTACA
ACAAAI'TAT
GGTAGATATC
AACAGGGAAT
AATTAAAGAA
TGTGACGATT
CGAGGCGG'T
AAGTGATAAT
AATGTTAGAC
TAACTACAAC
GGGGGATAAT
CAAAGGAAAT
CAACGGAGAC
TGATTATAA-A
GGATGMGGC
TAAAAACAC'I
TCAAAATAAI
CTTAACTATC
GTTAGGCAAG
GGTAAGTTTrI
C'FITITCGAI
CGCTGCCATI
GTTTAAACC9
*CACCAAAGPJ
CTT'TTGACC
TGAAGGCAA'
GGATATCACJ
TAATGATAG
AGGTTCGGC'
GACAGGAAAi
CAAAGATGA'
7AGACGATGTC GGTGA=TAl 7TTTAGAAGA kAAAAATTGT r AGTGACT'C'
AAATGGTTC
k CAGTGGCGG k. TGAGACGAC r CCTACGTGA
GCAATACAT
C TAAAGGCCG r TATTAAAAA r AGTCAATGC A AGI'CCCC]l A ACCAGGCC'I C CCCCAAACC A AATGCTI'AJ T AGCAGCTGC T TTCGGATGC T GCAAACCAJ
GTTGATGCAG
TTTACCITA
ACTCAT7IGA
CAAGAACA'T
AAACAAGCC
AATCAAGCGA
ACCT'FrAAGT
GTGGGTCGCG
AAAGGTGACA
ATTCCAGATG
AATCCAAATG
AA3CGCAATA
TATGGTGAAC
TTA.ACGTTTG
GAAGTCG'TTG
GGTTCCTTTG
ACCAAAGTTG
ACTGGTCAGC
GGTGCAGTTG
TATTTGATG
GTACCCAATrf
GATTTCATGG
ATAGGGCCGrl
GTTACCGAGE
IGATTGGACGC
I TACCTGCTq kATCACTTGGP
'GATCCAATT.
1' ACAAAGCCAC
ATCGAAGAAC
r CGTACGTATC r AGTTATGACI P, GITTCTA'IT r CCAGATCATC
"GTTATTACTX
CGGTACCAAC(
A GGAAAAGAT' C GTTAAAATG T TCAGCGGCG A GAAGTCCTT, G CATGCCACA T ATTTTAGCA A GGTACAGTA T TTGGTGGAG A GTCA 1TACT ,C GATGTCAAT :T CGCmTAAA .r GCGCCGGAI .T TATCAGTI'C ,A TCATCGTC CT TACCAAGGrI "T GATTrrC; "A AACGGAAAM k.A GCGCCAGCI AAGGCAATrGT
ATGAGCGAAT
ATGATTCAGA
TGCCGCCTCGT
ATTTTrCATCG
TGAAAGATCA
CCGTGAAAGT
AACTTAGTCC
CCAACAAAGC
GCGGCGGCGA
GGTTAGATGCC
TAGATI'ACGA
AAACCATTCC
AACCAGAITC
GAGCAGAACT
CAATTGACTT
ATrGGAATTGT ATrCAGAAGA
ATTATCAAAA
AAAATGCCCT
CGTTGGTTCT
TAGA.AATAAC
ATGCCAAAAC
'TAGACGCTAA
ATGAAGCAGG
TTGA~rTAAA
CGA=ECGGT
TAACCA.ATCA
ATGGTTCGGrI
CAAGCGAGAA
TGATTGAATri kATACCGCATC
AACATCGTGC
3TGTACTGGC; 3 ATACACCCTC
STAACAGAAGI
r ACACACTGGI 2CCCATATTGI 2GGAGTACAG) C ATGGGGATG] G GGACTAAAG( G GCGCCCATrL G ATGCCACCG( A CAAAAGCAC, A TTGATGA-AG.
'A AAGTATTT .T TAGAGCATG 'C GAACCAACG A CAGAAATCG ;A CACAAAATA 'T CTGCTGAAC iG TCCTI'GACA 4G TCACAGTGA kG GGTACCIT
CTATGGAACC
CACGTCTGAA
TGGGCGGGGC
AGTGATTCCA
A.ACGCCCAAT
A.ACAAATCCA
CTATGAGTTA
AGATGAATAT
GTATCCGrCTT
TGTGCCTTTT
TGAACCAACT
CGAAGCCAAT
AAAAGACCAA
TTTACATTTA
TGTGGAAGGA
TTTACATGAT
CGAGCCAT
TGATGGCACA
TTCA.ACGATT
GATTACGGAT
CAAAGATACA
TCGTAATGCA
AAGTGATGCC
CAATCCTGCG
AAACAATCAT
.TGCGCAAAAA
*TAATTTAACT
AACCTATTTG
TGAAAAAGTG
*AAACCAAAAT
*TAAGACGTCT
TTATACAAAC
CGAATCAGTG
TGTAATGATC
ACCAAACCAA
CGGAACTATT
k AGTTACCACC k. AGCACCTTAT k CACGGTATCC k. CAATGGCGAT
CAAAATTCAG
r ccAAATTTGG 2 GT1ATCACA CAGAAG~cTAT h. AACTTCAGCC r AGAAAAAATG C CGTAACCACG C GAATGGCCAG A AGCACCGACA C GAGCGGACA T AATTAAAAAA .C CACAGGACAA .C GGA'ITTAGCC T AAACACTGAA
S
228 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CCAAGTGCCFI
GCTAACTTTG
TTATTAAGTG
TTCACGACA-A
ITTGTAGAAA
ATTGCTGAGA
TACAAAGGGG
GTTTTTGAAT
GATGGCAAAA
GCACCAAAAT
ATTCGTGGTG
AAAGGACGCG
TTTAAATI'CT
kAAGATGGTT
GATGCAACGG
TCAA.ATGATA
GGACGTAAAG
GCCGATGAGC
AAAGI'TCTG,
GGGCATTACG
CATCCATTTG
ATCAATTATC
GGTGCGGTGI
ATGTCTGACP
GTGGAGACAC
ATTGCCAAAG-
TACCAAGGTC
AACCAAGG
AAAGCACCGC
GCGAAAGACI
TCCAAAAAA2 CTGGT7TA( TTGGTCGGI7 TCACGATT GC
TTAACTATCA
GTGCGACATT
ATAATCAAGG
CCAA.AGCGCC
AAAATGCTGG
CT'TTCCAALAT
TATATGATCA
TTATCTTTAG
TACCAGATGG
ATT'rCAAAG
CCGTCTTTAA
ATCGAATCGA
CATTGGCTAT
ATGGCTATAT
AACAACCACT
TCAACGAGCA
AGAATCAGCC
AAATAACAAC
TTTTAGTGGA
ATGTAACCGC
AAGGAACTGC
TTAAGGTCAT
AGCAAGGCAA
AAGCGCCAAC
ACAACCAAGC
CTGCCA.AGCI
TGACAGATGC
TAGTTCAAC9
ATGGTI'ACCI
SAACCTGAACJ
k. CAAAACCA-ki 'CGA.AAACCA2 r' TGGCAAG=(
AGCAACAT
AGGCACGGCT
TAAAGTZCTT
CCAAATT1'CTT
AACAGGCTAT
TAAACCAGCG
CCTGAAAACG
CAATAAACAA
AGACTTGGCG-
CCCAGATTAT
TGATCCGGAG
GAAAATTGAT
AAACGGGGAA
GGAGGATTTA
CGTCAATAAA
AGATGAGTTA
AGGTCAAACC
CCAAGGT1'CA
GGATAAGACT
AACGAAAGC.A
CCAATTAGG-A
TCAATTAACC
*TGATGAAAC.P
*AGTCATTGCCG
TAGCTATCTJ
CAAACCAGCC
GGTGAAAAT'J
AGAGACAGGC
GAP.CCACTTI
ACTGTCTA-AC
7 CGTGAATGCC k TCAGCCAAC k. CACACAAGT(
GCTCTTCTA!
CGGGGCAAAC
AAATTAATCA
GATGCGAAGG
GCAGAGCACT
TTATTAAATA
GTCGTGGTTG
AATAGCGCAG
TCATTIAGGGA
CCAGGTACCT
ATTATTTATC
A'ITT'CCAAT
G6CCAATGCGA
AAAATCTTTG
GGTGCTGGTA
CAACCCATTT
GAGTTTGTAA
TTAGCGGGTG
CCGATAACA7 GGCGAAA TT7
CCAACAGGC'I
AAAGAGCAGC
AAAGAAAACC
GGGCAAACCC
AAAAACTTAC
CTTAATGAAN
ACTGTGGTA(
7GATCAGCAAJ
CAAACTGTC(
k. CAACCAGGAj
CAAGCTGTC(
3 GGCACGTT( k. ACGAAACAA(
AATTACTTC'
r AAA.AAGAGC.
CAGCAACAGT
AAAA-AGATGT
GAGAAACGAT
TAGCCCCAGG
CCACGCCAGT
CTAGTGACAA
ACCAACCATT
TTACAGCAAC
ATTATTACAA
CTGAATTAGT
TAGGGGCCTT
ACCCACTTCC
AAAGAGAAGT
GCTAT AATT
ATTTTGTAGT
A'IWATCAAGC
CAG'TTrTGC
TCT-A-ATCG
ACGCTAAAGG
ATCTSTTAGA
CAA'FI'CTT
IAAACAGGTGP
TAGATGGAC;
CACCCGGGAAC
CGCCAAGCGC
TTAAAGCAC(
k. AGAATGCCT' 3CTCGTTCAT'.
k. AATATACCT'
SCATTCACTAK
T TAACGAGA, 3 CAGCTAGAG, r TTGTCTTA' A. AGAALATAA
TATAGCAACG
GAATGGACAC
TCAAACAGGC
AAAATATCGC
CCCATTTGAA
CTTGTGAGT
AGCAGGTGCT
GAGTGGCAAA
AGAAATCAAA
AAA.AGTAGAA
CCC AAT=~C
AGGAACGATT
AACTGCTGAA
AGATCAACTG
GAAGAAGAAT
AGAAGTAATG
AATTTACAAT
TGCAGCAGAA
CTAAATGAA
*CACAACGCTA
*AGCGATCTT
AGCATTGGCA
LAACCAATCTG
GTATCGTT
AAGCTTACG
TTTTATTA-AT
p' AGCAGGTGCT
ACCTICTGAC
r' TGTGGAAACA r TCGCAACA N. ACAACCTGTA P. GACAGGTI'GG r CGGCCTCATG a.
EF124-2 (SEQ ID NO:462) MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD
GQLTLGEVKQ
KKQIQIELNQ
SANEGSTEE-A
TI IDNFEDPI
PNKPGSGDLV
GDWVIDIPTQ
VTETWPTGNT
YQTTIDEAVI
EFTWEINYNY
DYKlVVINGDG
SQQNIIKNTG
TCAQLTLGKD
DHYRNTAAID
NRLVDAFLTD
WRVDFPNDSR
KGGEYHKDDP
TSQQEMTLAL
QALADTLVLT
STNSSVPRSS
YLNPDGTPAT
DAEGNVYGTY
EDLPPVVIP I
FKSVKVYELV
PDGGGDVPFK
GEQTIPKDQA
SFAI DFLHDV
AVDYQNSTIG
FM'JEITRNAD
WTDE-ACNNHH
P ILTNQTYLA
TYVIEFKTSV
DHVYWHVMIN
QCKAQPVTQE,
LNPTATEDVT
EETVASTTKA
PPYKEDVTIH
TISEDGTVRF
VPDTEQQI OK
MNLDGTIKEV
NHATIJTSDNN
VITDTMGDNL
TGAVKIDYKT
WTLAVNQNtNY GETGFKVSF I
SEDSXPFKPL
GSLKVYEGNT
DEKVIEGSAS
1GAQSVLDDVV
VVVHYSANVS
FSYGQQQRAL
IESKTTESTT
WNFNWSI PED
TFNERITSES
QGHFDRTPNP
GRELSPDEYT
PNGLDAP-ATV
TFEPDSLHLY
KVDGIVEGDV
LMENAVITDT
GAYAKTSDAF
PAFDLNAQKS
KPDGSVEKVK
YDNTIASYTNQ
ITDTPSPNQV
IKAAHWAAPN
TLKTGTDPTE
VKPRVAGPTD)
VREQMKAGDY
TJIHGDFSLDT
SAITWIVD IN
VDKNGNV'IK
TATYGKMLDK
SVTFDDKGNE
AVNNRVD)VGT
YEPVPGLTMV
HITYTTFFDV
GVYN4AVTKE I
PTQPLTDITM
GSSRDVTGYKV
LDPESLVIYG
NTRKIQVDDQ
STAITSSPAA
ISDYFTGDET
FEFQLPGNLK
HLNDSDGRGP
QAmKD)QTNPT
GDTNKAYRLE
RNI DYDEANQ
VVGAELVEGK
GQHSEDDGTA
PNSLVVKDTT
TELDANNPAL
TWTIAVNLSN
EEPSEKNQNT
S IQHGGESVK
TNVTEDGTIT
229 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PDKSVILEEG
QVS ITGNGSE
QAKTQVLREG
GAQPTI IKND
EVDSLKPGLY
QAGNPLAGAE
SAFTIAASOR
TTNNQGEIVA
KGAFQIVKTN
PKLPDGADYI
KLYRIENGEK
NDKQPLDELE
VSE ITTDKTG
NYQGTAQLTK
ETQAPTSYLL
FKVTDAETGQ
KDKPELVNAG
KDYTLE:VT'D
VVHGDDNGDV
TVDATGVITF
VNKVFLEKMD
QFTEIEAPTG-
FSVLDTTGQA
GKPATVIATA
EHLAPGKYRF
SADQPLAGAV
IYPELVKVEI
I FEREVTAEK
FVNYQALVMG
EIYAKGLNEG
EN=TEALAG
NETPSASFTI
TVARSLRSDN
TFVNEKQPVS
NETGQQKIVV WKI4MIAPYY VVDIDHSGGH AT4-TKGKIQL GGLPQGQYIL VETKAPEGYT EKG~aLVNAR
FKLEHAVTTP
YLLIDTI'PKRF IVTQNTSr-Q I VRE-{LVSDAN GKVTVTDLAP NFVNYQGTAK LIKKDVNGHL VETKAPTGYL LNTTPVPFE I FELYDHNKQS LG ITATSGKD RGDFKGDPEI FQLGAFANFI< DGSLAMEDLG
AGSYELDELD
RKXVNEQGQTL AG3AVPAIYNA HYWLVETKAP
TGYLLD-.TLH
AVFKV7DET
QTVIDGQT.NLM
AYKDNQGKPAT VVLKAPFINY QGLVQVNH-LQ PGKYTFVETK KI(TKPNQPTT KQAARETGWL MEYRSLNTSS AAGSTDTVSN
KXTAMDETTI
VSDELA1(GRV FTH WEEVP LA RDvWvKMILNY
GKYQFVETKA
LSGATFKVLD
AEKNAGKPAV
GKI IFRDLAP
GRAVFKKIDA
ATDGY IVNKQ
IJEQNQPQGSP
P FDVTAQLGK SDKQdKVIAI(
QGAAICLVKID
APDGYQLSKQ
ITIDEETSAE
PDRTNANGQL
QGSAEL IKKIJ
PAGYLLNTEP
A.KGE.TIQTGL
VVASDNFVSY
GTYYYKE IKA NAN PL PGT IF P IYFVVKXNS
ITFLNRAG:EK
EQP IALGDLI
NLAPGTYRF-V
QQKNALAGAE
AVAFTIAATA
GLPKTNTQVN YFFVFIGLML VGLASWLFYK KSK( EF124-3 (SEQ ID NO:463) TGCCTTCCACATAACTTATACTACCTTT=GACG GATCCAA=I TAACCAATCA AACCTATTTG GCTGGGAGCT TGAAAGTCTA TGAAGGCAAT ACAAAGCCAG ATrGGTCGGT TGAAAAAGTG AAACCAACGC AACCGTTGAC GGATATCACA ATGGAAGAAC CAAGCGAGAA
AAACCAAAAT
ACTTGGCGTG TTGATTTTCC TAATGATAGT CGTACGTATG TGATI'GAATT TAAGACGTCT GTTGATGAAA AAGTTATCGA AGGTTCGG-CT AGTTATGACA ATACCGCATC TTATACAAAC CAAGGTTCTT CACGTGATGT GACAGGAAAA GTTTCTATTC AACATGGTGG CGAATCAGTG
AAAAAAGGTG
AATGGCGCCC
GTGCTAGATC
ACGCCAGATA
GATAATGAAA
TATATGGAAT
AACCAAGTGT
CTrGGTCGTTG
CTGAAGAAAA
GACCAAGCTA
TTTGGTGGGT
ACAGTTTCGG
GAAGGAGCAC
GATGAGAAGG
CCGTTTACTC
TTAGAGGTG
GGCTATCT
ATTCGTGATG
GACCAAGCAG
GCAGTTCGAG
CCAGGAAAAT
CCAAGTGCTT
GCTAACTTTG
TTATTAAGTG
TTGACGACAA
GCGAATACCA CA A~ATCGGTTI AG.
CCGAGTCA IT GG 1ATCTG ITAT TT CAGGACAACA A ATCGTAGTr AC CAA ITACTGG A ACATTGATCA CA CAGCCATCCA TO AAACACAAGT
CC
TGCCACAAGG GC ACCAATTAGC T AACCAACCAT T CTAAAAAGTT Ac ATTCGCAAGA AC ATAGTTTAAA
AC
TAGACACGAC C( TTCATCTCA.A A2 GCAATCCA IT AC AACACTTAGT
F
ATCAATITCGT G( TCACGATTCC AX TTAACTATCA A( CTGCCACATT T ATAATCAACC G k.AGATGAT
ACGATGTG
I'GATTTAC
TAGAAGAA
AAA'ITGTC
TGACTTCT
ATGGTTCA
.GTGGCGG
,ACACGACT
TACGTGAA
.AATACATT
,AAGCCCGA
LrrAAAAAC
,TCAATGCT
TTrCCCC'Fr
.CAGCGC'T
CCAAACGA
~TGCTTAAT
;CAGGTGCT
rCGGATGCA
.,AAACCAAA
~CAAGCCAT
,GCACGGCT
k.AAGTrGCTT CCAGATCATG I GTTA NTACTG CGTACCAACG 9 GGAAAAGATT I
GTTAAAATGGC
TCAGCCCCGGC
GAAGTCGTTC
CATGCCACAG
ATTT'TAGCAG
GCTACAGTAG
TTGGTGGAA
GTCATTACTA
GATCTCAATA
CGCTrAA.AT
GCGCCGGATC
TATCAGTTCA
TTCAT'CGTGA
TACCAAGGTT
GAArF1TTCAG AAcrGGAAAAG
CCGCCAGCAG
CCCGGCAAAC
AAATTAATCA
GATGcGAAGG ~TACACCCTC
EAACACAAGAC
~CACACTGGA
:CCATATTGA
GCAGTACAGA
kTGCCATCA
GGACTAAAGG
GCGCCCA'TTT
WICCCACCGC
CAAAAGCACC
11'GATGAAGA
AAGTATTTT
TAGAGCATGC
GAACCAACGC
CACAAATCOA
CACAAAATAC
CTGCTGAACT
TCC1TCACAC
TCACACTGAC
GGTACCTII'
CAGCAACAGT
AAAAACATC-T
GAGAAACGAT
.CCAAACCAA
:GGAACTATT
kGTTACCACC
%.GCACCTTAT
CACGCTATCC
CAATGCAT
cAAAATTCAG :cAAAT'FrGG
GG'ITATCACA
AGAAGGCTAT
AACTTCAGCC
ACAA.AAAATG
CGTAACCACC
GAATGGCCAG
AGGACCOACA
GAGCGGACAA
AATTAAAAAA
CACAGCACAA
GCAT'ITAGCC
AAACACTGAA
TATACCAACG
GAATGCACAC
TCAAACAGGC
4**b EF124-4 (SEQ ID NO:464) 230 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AF HITYTTFFDV TELDANNPAL
DHYRNTAAID
NRE.VDAFLTD
WRVDFPNDSR
KGGEYHKDOP
PDKSVILEEG
QVS ITGNGSE
QAXTQVLREG
GAQPTI IKND
EVDSLKPGLY
QAGNPLAGAE
SAFTIAASDR
TTNNQG
WTDEAGNNI4 P ILTNQTYLA
TYVIEFKTSV
DHVYWHVMIN
KDYTLEVTTD
VVHGDDNGDV
TVDAT GVITF
VNKVFLEKMD
QFTEIEAPTG
FSVLDTTGQA
GKPATVIATA
SEDSKPFKPL
GSLKVYECNT
DEKVI EGSAS
GAQSVLDDVV
NETGQQKIVV
VVDIDHSGGH
GGLPQGQY IL
EKGKKLVNAR
YLLDTTPKRF
VREHLVSDAN
N~FVNYQCTAK
PAFDLNAQKS
KPDGSVEKVK
YDNTASYTNQ
ITDTPSPNQV
KMAHI EAPYY
ATGTKGKIQL
VETKAPEGYT
FKLEHAV'ITP
IVTQNTSGQ I
-CKV-TVTDLAP
LIKKDVNGHL
GVYNAVTKE I
PTQPLTDITM
GSSRDVTGKV
LDPESLVIYG
MEYRSLVTSS
KKTAMDEITI
VSDELAKGRV
FTHWEEVPLA
RDVHVKMLNY
GKYQFVETKA
ISGATFKVLD
TWTIAVNLSN
EEPSEKNQNT
SI QHGGESVK
TNVTEDGTIT
AAGSTDTVSN
LAGAHFQ IWD
ITIDEETSAE
PDRTNANGQL
QGSAELIKKD
PAGYLLNTEP
AKGETIQTGL
EF125-1 (SEQ ID NO:465)
TAAAATAAAA
ATGAAAGAAA
T'rGTTAC PTG
GATGGCCAGT
CTTCAAGGAA
TCAATCAAAG
CAGAAGAAAC
ACGTTGAACC
ITGACGTTAA
GCATCAGCGA
TCCGAAGAAA
ACTGTCAAAC
ACAACGATTA
ACACCGCCGT
GATGTGCGAG
AAACCTAATA
TACACAA ITA
AGTGACATTC
CCAGGAGATT
ATTCTCCCAG
CCTAGTGCGA
ACTGTGACGG
GTGATCAATC
ACCGTTGATA
GAGTACCAAA
AAAAATCACG
GTTACCGCCA
CAAGAATTCA
GCAGTCATTA
TATTCAGTGA
AAAGA ITACA
GTGACTGGCG
GTTCCCTGA
GCCAGTCAAC
GGTTGGACG'I
ACGTACGAAC
AAI ICGTACC
TGAGAAAGAA
TTCTAAAT-TA
TAACGTTAGG
AAGCACAACC
CTGCACAITG
AGATTCAAAT
CTACAGCTAC
AGACTGGTAC
ATGAAGGTTC
CTGTCGCCAG
CGCGCGTAGC
TCGATAATTT
ATAAAGAAGA
AACAAATGAA
AACCAGGTTC
GTGAAGATGG
ACGGGGACTT
GGGTGATTGA
ATACCGAACA
TTACTTGGAC
AAACATGGCC
TIGATGGAAC
AAAATGGCAA
CGACGATTGA
CGACGTTAAC
CATATGGCAA
CTTGGGAAAT
CAGACACAAT
CATTTGATGA
*AAGTGGTAA7 CAGTCAAGA7
*ATAATCGTG'I
*AAAATATTAI
TAGCTGTGAP
CAGTTCCTGC
AAGTGAACGT
TGGTCCAATG
TGGCACACCA
AGAAGTGAAG
AGTAACACAA
GGCAGCGCCC
TGAATTGAAT
AGAAGATCTG
TGATCCGACA
AACAGAAGAA
CACGACAAAA
AGGACCAACA
TGAAGATCCG
TGTGACCAT
AGCAGCAT
AGGTGA I TA
TACGGTTCGT
TTCTTAGAT
TATTCCTACA
ACAAATGAT
GGTAGATATC
AACAGGGAAT
AATTAAAGAA
TCTGACGATT
CGAGGCGGTT
AAGTGATAAT
AATCTTAGAC
*TAACTACAAC
GGGGGATAAI
CAA.ACGAAA I
*CAACGGAGAC
*TGATTATAAP
GGATTGGC
*TAAAAACACIl
LTCAAAATAAI
;CTTA.ACTATC
TCTCTTCTAT
GTAAACCGTT
CTCATGGCTT
CAAACCAGCC
GAGGTTGTAG
AATAATACGC
CAGCAAGCGT
ACTrCTT
GAATCAACCG
GCATCTACA.A
GCGATAGAAA
GATATCACTG
ATTTAmTAA
CATTGGAACT
TACTTCGAGT
GTI'GATGCAG
?TTTACCTTTA
ACTCATTTGA
CAAGAAGAT
AAACAAGGCC
AATCAAGCGA
ACCTI'TA.AGT
GTGGGTCGCG
AAAGGTGACA
AT.TCCAGATG
AATCCAAATG
AAGCGCAATA
TATGGTGAAC
TTAACGI"MT
GAAGTCGTT'G
GGTTCCTTTG-
ACCAAAGTTCG
ACTGGTCAGC
GGTGCAG=I2 TATTTGATGc:
GTGTCGTTAGJ
GCCTCTACGG C TGGCGGAAGA C
AGCAAGAAAT
TGCATTATAG
GCAAGATTCA
TAGCAGATAC
AT'GGACAACA
CAATCACGAG
ACTCCTCTGT
GTAAAACAAC
ATTATTITAC
ATCCTGATGG
TTAACTGGTC
TTCAATTACC
AAGGCAATGT
ATGAGCGAAT
ATGATTCAGA
TGCCCCTGT
A'=TATCG
TGAAAGATCA
CCGTGAAAGT
AACITAGTCC
CCAACAAAGC
GCGGCGGCGA
GGTTAGATGC
TAGATTACGA
AAACCAITCC
AACCAGATTC
GAGCAGAACT
CAATTGAcTT
ATGGAATT
A'rrCAGAAGA
ATTATCAAAA
AAALATGCCGT
~'AGAGGAAGG
'ITGATGTGT
GTTAACAGC
;ACCTTAGCG
L'GCCAATGTG
kGTGCATGAC
;-TTAGTCTTA
GCAACGAGCG
TTCGCCAGCC
TCCTCGTTCG
TGAATCGACG
AiGGTGATGMA
PIACACCAGCA
GATTCCAGAA
TGGCAATTTG
CTATGGAACC
CACGTCTGAA
TGGGCGGGGC
AGTGATTCCA
AACGCCCAAT
AACAAATCCA
CTATGAGTTA
AGATGAATAT
GTATCGTC 1T TGTGCCTTrT
TGAAGCAACT
CGAAGCCAAT
AAAAGACCAA
TTTAcATI'TA
TGTGGAAGGA
TrTACATGAT
CGA-AGGCGAT
TGATGGcACA TrCAACGATr
GATTACGGAT
GTACCCAATT CGTTGGTTGT CAAAGATACA 231 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
*5 ACCACTGCTG CT GATGGTGAAA C TTCCACATAA Cl TTGGACCATT Al CATTCACAAC Al AGCCGTCTTT A( AATAATCGTrTI GCTCCCACCT TC
AAACCAACCA
ACTTGGCGTG T GITGATGAAA A CAAGGTTCTT
C.
AAAAAAGGTG Gi AATGGCGCCC A G'IGCTAGATC
C
ACGCCAGATAA
GATAATGAAA C TATATGAAT A AACCAAGTGT C GTGGTCGTTG A CTGAAGAAAA C
CACCAACCTA
TT1'GGTGGGT1 ACAGTTTCGG A
CAAGGAGCAC
GATGAGAAGC C CCGTTTACTC
TTAGAGGTGG
GGCTATC=I1
ATTCGTGATG'
CACCAAGCAG
GCAGTTCGAG
CCAGGAAAAT
CCAAGTCCT'
GCTAACTTC
TTATTAAGTG
TTGACGACAA.
TTI'GTAGAAA
ATTCCTGAGA
TACAAAGGGG
G TTTCAAT
GATGGCAAAA
CCACCAAAAT
ATTrCGTG4GTG
AA.AGGACGCG
TTrAAATTGT
AAAGATGCTT
GATGCAACG
TCAAATGATA
GGACGTAAAG
GCCGATGAGC
AAAG TTCTG
GGGCATI'ACG
CATCCATT'G
~CAGTTGAC CT CGCTAA GG 'TATACTAC CT ~CGAAATAC CG ~AGTAAACC CG :AATGCCGT CA kGTCGACGC CT .AAAGTCTA TC ~CCGTTCAC CC rGATTTTCC T kGTTATCGA AC A.CGTG.ATrCT GI CAATACCA Cl ATCGGTTITT AC CGAGTCATT CG
ATCTCTTAT
AGGACAACA A TCCTAGTTT- AN
AATTACTGC
.CATTGATCA C.
ACCCATGGA T AACACAACT C 'CCCACAAGG G CGAATTACC
T
LACCAACCAT
;TAAAAAGTTA
OTTGGGAAGA A
TAGTTTAAA
PAGACACGAC C
['TCATGTCAA
3CAATCCATT
%.ACACTTAGT
kTCAATTTGT
C
rCACGArrC
ITAACTATCA
GTGCGACATI'
ALTAATCAAGG
CCAAACCC
AAA.ATCCTG
CTTTCCAAAT
TATATGATCA
TTATCmTAG
TACCAGATG
ATTTCAAAG
CCGTCTTrA.A
ATCGAATCCA
CATTrGGCTAT
ATGGCTATAT
AACAACCACT
TCAACGAGCA
AGAATCAGCC
AAATAACAAC
TI-rTAGTCGA
ATGTAACCGC
TAGGCAAG GA TAAGTI'IT AT 'TTTTCCAT CT CTGCCATT GA -MAAACCT T CCAAAGAA ATl -rr'=ACG GA ~AAGGCAAT AC ~ATATCACA Al LATGATAGT CC ;GTTCCCCT AC ~CAGGAAAA Gl kkAAGATGAT CC ;ACGATGTG G~ MTATTTAC G( I'rAGAAGAA CG kAAATTGTC G~ ~TGACTTCT Ti AATGCTTCA CG IGTGCCCG C
GAGACGACTA
CTACGTGAA C
CAATACATTT
AA.AGGCCGA C 'ATI'AAAAAC
G
,GTCAATGCT
C
,CTTCCCCTT G
,CCAGCCCTT''
CCCAAACCA
LATGCCTAAT
=CAGGTGCT C n'CGGATGCA GAAACCA.AA C kGCAAGCGATC
%GGCACGGCT
rAAAGTccTT 3GAAATTGTT
AACACCCTAT
TAAACCAGCG
CTGAAAACC
CAATAAACAA
AGACTTIGCC
CGCACATI'AT
TCATCCCGAG
GAAAATI'GAT
AAACGGGGAA
CGAGGATTI'A
CGTCAATAAA
AGATGAGITA
AGTCAA.ACC
CCAAGCITCA
GGATAAGACT
AACGAAAGCA
CCAATTAGCA
TTTCATC.G T-AL ACGGCGT AT( 'TACCCACT TAI .TTCGACCG AT ,ACCTGc ET TT CACT1'GGA CC ~TCCAATTT TA .AAAGCCAG ATl 'GGA-AGAAC CA ,TACGTATG TG ~TTATGACA Al rTC.TAIC AA .AGATCATG TC rTATTACTG Al 3TACCAACC T .,AAAACA ET AC TTAAAATCG CC CAGCGGCCG CC AACTCCTTrC A' ATGCCACAC G( 'ITTTAGCAG G CTACACTAC A TGGTCGACA C.
TCATTACTA T ,ATGTCAATA A
.CCT'TAMATT
,CGCCCGATC C .'ATCAGTTCA C TCATCGTGA C .ACCAAGGTT
C
.AATTTTAG TI ACGGAAAAG Ti .CCCCACCAC C .,GGGGCAAAC C
"TAIATCA
GATCAAGGC
.CAGACCACT
TTATTAAATA
.TCGTGGTTG
A.ATAGCGCAG
TCATTAGGGA
CCACGTACCT
ATTATTTATC
ATTTTCCAAT
CCAATGCCA
AAAATCTTTG
GGTGCTGCTA
CAACCCATTT
GAGTTICTAA
TTAGCGGGTG
CCGATAACAT
GCGAAAT
cCA.ACACCCT
AAAGACCAGC
~AAATAAC TC( ~CCAAAAC AA' ~ACCCTAA CA.
CAACCAGC AA GATTTAAA TC ATTCCGGT TA ACCAATCA AA GCTICCGT TC ACCACAA AA ATTCAATT TA ,'ACCCCATC TT LCATGGTCG CG TACTGGCA TC 'ACACCCTC AC kACAbAAGA CC :ACACTCCA AC :CATATTCA AC ;AGTACACA CI I'CCGGATCA C~ GACTAAC Cj CCCCATTT C( TCCCACCGG CG AAAACCACC A' TGATGAAGA A AGTATTTTT A ACACCATCC C AACCAACGC C
AGAAATCCAA
ACAAAATAC C TcTGAACTA 'CCTTCACAC C ICACACTOAC
C
;CTACC'TT
ACAACACT I LAAAAGATOT C ;ACAAACGAT 9
[AGCCCCAGC
:CACGCCACTC
CTAGTCACAAC
ACCAACCATT
TIACAGCAAC
ATI'ATI TCAA
CTGAATTAGT
TAGGGCCCTI
ACCCAC'ITCC
AAAGACAAGT
GCTATGAA1T AT7'GTACT
ATI'ATCAAC
CAGTTTTTC
TCTTGAATCG
ACCCTAAAG
ATCTGTTACA
CAATGMC=T
GTAATGCA
GTGATGCC
ATCCTC
ACAATCAT
CCAAAAA
ATTTA-ACT
.CCTATTTC
AAAAAGTG
ACCAAAAT
,ACACCTCT
'ATACAAAC
;AATCAGTC
TAATCATC
:CAAACCAA
;CAACTATT
;TTACCACC
;CACCTTAT
kCGGTATCC kIATCGCAT
AAAATTCAG
cAAATTTGG
GTTATCACA
~AACCCTAT
ACTTCAGCC
GAAAAAATG
CTAACCACG
AATGGCCAG
.GCACCGACA
AGCGGACAA
ATTAAAAAA
ACAGGACAA
GATTTACC
LAACACTCAA
~ATAGCAACC
;AATGCACAC
'cAAACAGC
L.ATATCGC
:CCATTTGAA
:TTTCACT
AGCAGCTGCT
GAGTGCCAAA
ACAAATCAAA
AAAAGTAGAA
CGCCAAT'ITC
AGAACGATT
AACTCCTGA-A
AGATCAACTG
GAAGAAGAAT
AGAACTAATG
AATTTACAAT
TGCAGCAGAA
GCTAAATGAA
CACAACGCTA
AGCATcTT
S
232 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
ATCAATTATC
GGTGCGGTGT
ATGTCTGACA
GTOGAGACAC
ATTGCCAAAG
TACCAAGGTG
GAA ITTAAAG
AACCAAGGGT
AAAGCACCGG
GCGAAAGACA
TCCAAAAAAA
CTTGGLTIAC
TTGGTCGGTT
KAGGAACTGC
TTAAGGTCAT
AGCAAGGCAA
AAGCGCCAAC
ACAACCAAGG
CTGCCAAGCT
TGACAGATGC
TAGTTCAAGT
ATGGTTACCA
AACCTGAACT
CAAAACCAAA
CGA.AAACCAA
TGGCAAGTTG
TCAATTAACC
TGATGAAACA
AGTCATTGCG
TAGCTATCTT
CAAACCAGCC
GGTGA.AAAT
AGAGACAGGG
GAACCACTTA
ACTGTCTAAG
CGTGA.ATGCG
TCACCCAACA
CACACAAGTC
GCTCTTCTAT
AAAGAAAACG
GGGCAAACCC
AAAAACTTAG
CTTAATGAAA
ACTGTGGTAC
GATCAGCAAA
CAAACTGTCG
CAACCAGGAA
CAAGCTGTCG
GGCACGTrL'G
ACGAAACA.AG
jATTACTTCT
AAAAAGAGCA
AAACAGGTGA
TAGATGGACA
CACCGGGAAC
CGCCAAGCGC
TAAAGCACC
AGAATGCCT
CTCGTTCATT
AATATACCTT
CATTCACTAT
TTAACGAGAA
CAGCTAGAGA
TTGTCTTTAT
AGAAkATAA
AGCATTGGCA
AACCAATCTC
GTATCGTTT
AAGCTTTACG
TTTTAITrAAT
AGCAGGTGCT
ACGTTCTGAC
TGTGGAAACA
TGCGGCAACA
ACAACCTGTA
GACAGGTTGG
CGGCCTCATG
EF125-2 (SEQ ID NO:466) MRKNGPMV NRWLYGLMCL LLVLNYGTPL MA.LAEEVNSD
GQLTLGEVKQ
KKQ IQ IELNQ
SANEGSTEEA
TI IDNFEDPI
PNKPGSGDLV
GDWVIDIPTQ
VTETWPTGN T
YQTTIDEAVI
EFTWEINYNY
DYKVVINGDG
SQQNI IXNTG
TGAQLTLGKD
DHYRNTAAID
NRLVDAFLTD
WRVDFPNDSR
KCGEYHKDDP
PDKSVILEEG
QVS ITGNGSE
QAKTQVEJREG
GAQPTI IXND
EVDSLKPGLY
QAGNPLAGAE
SAFTIAASDR
'rrNNQGEIVA
KGAFQIVKTN
PKLPDGADYI
KLYRI ENGER
NDKQPLDELE
VSEITTDKTC
NYQGTAQLTI(
ETQAPTSYLL
FKVTDAETC
KDKPELVNAC
VGLASWLFY?
TSQQEMTLAL
QALADTLVLT
STNSSVPRSS
YLNPDGTPAT
DAEGNVYGTY
EDLPPVVIPI
FKSVKVVYELV
PDGGGDVPFK
GEQTIPKDQA
SFAIDFLHDV
AVDYQNSTIG
FMVEITRNAD
WTDEAGNNHH
PILTNQTYLA
TYVIEFKTSV
DHVYWHVMIN
KDYTLEVTTD
VVHGDDNGDV
TWDATGVITP
VNKVFLEa<M
QFTEIEAPTC
FSVLDTTGQA
GKPATVIATP.
EHLAPGKYRF
SADQPLAGA%
IYPELVKVE]
IFEREVTAET
FVNYQAEVMC
EIYAXGLNEC
ENETrOEALAC
NETPSASFT:
TVARSLRSDI
TFVNEKQPV!
KSKK
QGKAQPVTQE
LNPTATEDVT
EETVASTTKA
PPYKEDVTIH
TI SEDGTVRF VPDTEQQ IDK MNhLDGTIKEV
NHATLTSDNN
VITDTMGDNL
TCAVKIDYKT
WTLAVNQN.Ny GETGFKVSF I
SEDSKPFKPL
GSLKVYEGNT
DEKVIEGSAS
GAQSVLDDVV
NETGQQKIVV
VVDIDHSGGH
GGLPQGQYIL
EKGK'KLVNAR
YLLDTTPKRF
VREHLVSDAN
NFVNYQGTAK<
VETKAPTGYL
FELYDHNKQS
RGDFKGDPEI
DGSLAMEDLC
;RKVNEQGQT.
;HYVLVETKAI
;AVFKVIDETC
E AKDNQGKPAJ qI QGLVQVNHL< 5 KKTKPNQPT'J
VVVHYSANVS
FSYGQQQRAL
IESKTTES~r WNFNWS I ED
TFNERITSES
QGFFDRTPNP
GRELSPDEYT
PNGLDAE-AT
TFEPDSLHLY
KVDGIVEGDV
LMENAVITC)T
GAYAKTSDAF
PAFDLNAQKS
KPDGSVEKVK
YDN4TASYTNQ
ITDTPSPNQV
KMAH IEAPYY
ATGTKGKIQL
vETKAPECYT
FKLEHAV'ITP
IVTQNTSGQ I
GKVTVTDLAP
*LIKKDVNGHL
LNTTPVPFEI
LGITATSGKE
*FQLGAFANPB
*AGSYELDELE
AGAVFAIYNI
TGYLLDTI'I
QTVDGQTNUl
SVVLKAPFIY.
SPGXYTFVET]
r' KQAARETCGWI
IYKAAHWAAPN
TLKTGTOPTE
VKPRVAGPTD
VREQMKAGDY
DIHGDFSLDT
SAITWT\TDIN
VDKiNGNVTIK
TATYGKMLDK
SVTFDDKGNE
AVNNRVDVGT
YE? VPGLTMV
HITYTTFFDV
GVYNAVTKEI
PTQ PLTD ITM
GSSRDVTGKV
LDPESLVIYG
MEYRSLVTSS
KKTAI4DETTI
VSDELAI(GRV
FTHWEEVPLA
RDVHVKMLNY
GKYQFVETKA
LSGATFKVLD
AEKNAGKPAV
GKIIFRDLAP
GRAVFKYIDA
ATDGYIVNKC
SDEQNQPQGSE
I PFDVTAQLG? d1 SDKQGKVIA? C QGAAKLVKIE k APDGYQLSK(
GLPKTNTQVW
NTRKIQVDDQ
STAITSSPAA
ISDYFTGDET
FEFQLPGNLK
HLNDSDGRGP
QAMKDQTNPT
GDTNKAYRLE
RNIDYDEANQ
VVGAELVEGK
GQHSEDDCTA
PNSLVVKTJTT
TELDANNPAL
TWTIAVNLSN
EEPSEKNQNT
SIQHGGESVK
TNVTEUGTIT
AAGSTDTVSN
LAGAHFQ IWD
ITIDEETSAE
PDRTNANCQL
QGSAELIKKD
PAGYLLNTEP
AKGETIOTGL
WVASDNFVSY
GTYYYKEIKA
NANPLPGTIF
PIYFVVKKNS
ITFLNRAGEK
EQPIALGDLI
NLAPGrfYRFV
QQKNALAGAE
AVAFTIAATA
4 YFFVFICLML
S
a EF125-3 (SEQ ID NO:46 7 233 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TAATT- TACTATCA AGGCACG'GCT AAATTAATCA AAAAAGATGT GAATGGACAC
TTATTAACTG
TTGACGACAA
TTTGTAGAAA
ATTGCTGAGA
TACAAAGGGG
OTTTTTGMAT
GATGGCAAAA
GCACCAAAAT
ATTCGTGGTG
AAAGC-ACGCG
TTTAAATTGT
AAAGATGGTT
GATGCAACGG
TCAAATGATA
GGACGTAAAG
GCCGATGAGC
AAAGTTTCTG
GGGCATTACG
CATCCATTITG
ATtAATTATC
GGTGCGGTGT
ATGTCTGACA
GTGGAGACAC
ATTGCCAAAC
TACCAAGGTG-
GAATTTAAAC
AACCAAGGGI
AAAGCACCGC
GCGAAAGACI
TCCAAAAAAJ
CTTGGT
GTGCGACA IT
ATAATCAAGG
CCAAAGCGCC
AAAATGCTGG
CTTTCCAAAT
TATATGATCA
TATCTTTAG
TACCAGATGG
ATTTcAAAGG
CCCTCTTTAPA
ATCGAATCGA
CATTGGCTAT
ATGGCTATAT
AACAACCACT
TCA-ACGAGCA
AGAATCAGCC
AAATAACAAC
T'TTrAGTGGP
ATGTAACCGC
AAGGAACTGC
TTAAGGTC'A
AGCAAGGCMJ
AAGCGCCA-AC
ACAACCAAGC
CTGCCAAGCr.
TGACAGATG(
7TAGTTCAAG'
;ATGGTTACC,
k AACCTGAAC k CAAAACCAA
TAAAGTC'I
GGAAATTGTT
AAcAGGCTAT
TAAACCAGCG
CGTGAAAACG
CAATAAACAA
AGACTTGGCG
CGCAGATAT
TGATCCGGAG
GAAA?1'GAT
APLACGGGGAA
GGAGGATTTA
CGTCAATAAA
AGATGAGTTA
AGGTCAAACC
CCAAGGTTCA
GGATAAGACT
AACGAAAGCA
CCAATTAGGA
TCAATTAACC
TGATGAAAC1P
SAGTCATTGCC
TAGCTATCT'I
3CAAACCAGCC r~ GGTGAAAATJ
:AGAGACAGGC
r GAACCACTTJ k. ACTGTCTAA( r CGTGAATGC(
GATGCGAAGG
GCAGAGCACT
TTATTAAATA
GTCGTGGTTG
AATAGCGCAG
TCATTAGGGA
CCAGGTACCT
ATTAI=ATC
ATT='CCAAT
GCCAATGCGA
KZAAATCrG
GGTGCTG;GTA
CAACCCATTT
GAGTTTGTA-A
TTAGCGGGTG
CCGATAACkI
GGCGAAATTI
CCAACAGGC2l
AAAGAGCAGC
AAAGAAAACC
GGGCAAACCC
*AAAAACTTAC
CTTAATGAA)
ACTGTGGTA(
7GATCAGC"A
CAAACTGTC(
k. CAACCAGGA.
3 CAAGCTGTC(
GGCACGTT
TAGCCCCAGG
CCACGCCAGT
CTAGTGACAA
AccAACcATT rTACAGCAAC
ATTATTACAA
CTGAATTAGT
TAGGGGCCTT
ACCCACTTCC
AAAGAGAACT
GCTATGAATT
AkTTTTGTAGT
ATTATCA.AC
CAGTTTTTGC
TCTTGA-ATCG
ACGCTAAACG
ATCTGTTAGA
CAAT1GCTT'I
AAACAGGTGA
3TAGATGGACP 3CA CCGGGA6AC k. CGCCAAGCGC
TTAAAGCACC
~AGAATGCCTI
3CTCGTTCAT'
PAATATACCT
3 CATTCACTA! 3TTAACGAGA 3CAGCTAGAG,
TCAAACAGGC
AAAATATCGC
CCCATTTGAA
CTTTGAGT
AGCAGGTGCT
GAGTGGCAAA
AGAAATCAAA
AAAAGTAGAA
CGCCAATTTC
AGGAACGATT
A.ACTGCTCAA
AGATGAACTG
GAAGAAGAAT
AGAAGTAATG,
AATTTACAAT
TGCAGGAGAA
GCTAAATGAA
CACAACGCTA
AGGCGATCTT
AGCATTGGCA
LAACCAATCTG
GTATCGT'TI'
AAGCTTTACG
TTT'rATTAAT
SAGCAGGTGCT
C ACGTTCTGAC C TGTGGAALACA r TGCGGCAACA k. ACAACCTGTA k. GACAGGTTGG
S
S
5.5 TCAGCCAACA ACGAAACAAI EF125-4 (SEQ ID NO:468) 5*S*
NFVNYQGTAK
TTNNQGEIVA
KGAFOIVKTN
PKLPDGADYI
KLYRIENGEX
NDKQPLDELE
VSEITTDKTG
NYQGTAQLTK
ETQAPTSYLL
FKVTDAETGQ
KIJKPELVNAG
LII(KDVNGHL
EHLAPGKYRF
SADQPLAGAV
IYPELVKVEI
IFEREVTAEK
FVNYQAEVMG
EIYAKGLNEG
ENETGEALAG
NETPSASFTI
TVARSLRSDN
TFVNEKQPVS
LSGATFKVLD
VETKAPTGYL
FELYDHNKQS
RGDFKGDPEI
DGSLAMEIDLG
R.KVNEQGQTL
HYVLVETKAP
AVPKVIDETG
AKDNQGKPAT
QGLVQVNHLQ
KKTKPNQPTT
AKGETIQTGL
LN'TTPVPFE I
LGITATSGKD
FQLGAFANFI(
AGSYELDELD
AGAVFAIYNA
TGYLLDTTLH
QTVDGQTNLM1
WLKAPFIN'Y
PGKYTFVET(
AEKNAGKPAV
GKI IFRDLA? GRAVFKKI DA
ATDGYIVNKQ
D)EQNQPQGSP
PFDVI'AQLGK
SDKQGKVIAK
QCAAKLVKID
APDGYQL.SKQ
VVASDNFVSY
GTYYYKEIKA
NANPLPGTIF
P IYFVVKKNS
ITFLNRAGEK
EQPIALGDLI
NL.APGTYRFV
QQKNALAGAE
AVAFT IAATA
KQA.ARETG-WLG
EF126-1 (SEQ.ID.NO:469)
TAGCGAAAGA
ATGGTGATTG
GAAGCGGTTA
AGTCTGGCTG
GAAACGGAAC
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
ATTGTAAACG
CAGTCGATTT
GATTAAAATG
AGTTGTGGGA
TACAGAAGGA
GATATTGGAA
AGTAGTI'GTT
TTrAAGAAAG
AATTTCAGTC
ATGACCAATA
GGACAAGCTA
GAAGATGCTA
CAACGAAATT
CCACATTGGC
CGGTGAAAGT
CTTTCCCAGT
GTGGTAGTTT
ATTATCGACA
TTTAGCTGAA
GAPLAGACGAC
TCAAGCGGGT
TTCAGATAAT
234 TABLE 1. N~ucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
TTTCCACATG
GTGATGCTGG
ATTAATTCAG
AGCCAATTTG
GGATTGA.AAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAkATTACTA
AGTTCAGTGA
GAGTTGCAAC
TTTACAACCC
GCTTCATI'AA
GGAAATGATG
TACGTAGGAA
ACCCITGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCP
GATGACATTP
CATOITACAC
AAACAAGCAC
AAAA'ITAAJ
cccA.AccAAC
ACCGTAACA(
TTAGATTTAX
GAAACAAGCC
ACTGATGTAk
CAAGAAAAT,
AGTGGTCAT,
GAATTAGCA'
AACGAAGGT
GAAGATCCA
CACCAAGAA
CAAGCCAGC
GANGAAAAT
ACTTTAC'I
ATGACTATI
CAACAAGGC
CAT'ICCAAC
GAACCTAAP
ACGAACCCI
ACAACACTI
TTGGCATI
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGGTTT
TCGCTTTAGA
TATTAGTGAC
CCAATGATTC
ATAGTAATGA
AC C AGCTA
ATTCATACTT
AAGGGTCTAG
ALATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTOCT
CTAACGGCAC
ACAGCTATGA
k CTGACGCAAC
;CCGACTTAA-P
:CACCGCCAGI
k CCAACCGCG;
CTTGGACCCI
k AAGTCACAGI k. ACAAAGTAAC AL CGTACACAA' C CTTATATTGQ G ACGTGTTGC A CGATTACAA" T TTAAATGGA.
A TCGTGGATG, C GCAAAGATG' 'A TCAACAAAA 'A CCACTAAAA !G GGATTCCCA A AGCCAACCC iC AACCGCTAA LG TCAATTTTC L.A ATCCACTTI .A CAAAAAAT.P
GATTGATGA.A
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
cTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGAATCA-A
TGCGCATGCC
AGAACAC'TP
TGGCA.ACGAJ;
AGATATCAT'I
AGTAACACAJ
CTATTTAAGJ
GGACGAAGPI
CTTTGCCAA'
TGATC-CAAA'
TCAAGAAT'FI
AGCCAGCAT(
TGAAAATGG'
TTTTCAAAT,
r' GACCATIAC
ACAAGGTG
k. TTCCAACAA ,k AGATATCGA P. cGTCAAAAC A. CA'ITAATAA T TACAGATAA A AGATGACAG T TAAAACTGA A CCA-AGCCGA T A.ACACCGC A ACCGAAAJ G AAAATCAGC 'A CATCATCGC LA AAAAAGAAJ
GTGTCAAG
CA.ATTITATGT
CGCGTCAATA
CGGACGTATG
cAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATCTATT
AAAACAGAAG
GATITATTA
r-ATCGTCTGG
*ATTCAATCTG
*CAAAccA'TTT
*CTCAAAGAAA
GGAACAATTG
TGTGACGTGA
LGATTTA.ACCA
LACCAGTACTT
7GATGTGA.AAG C AAAATAACA
SGGTCATACG'I
k TTAGCGCCT'I C' GAAGGTGACC r' ATTGCTAAAC r AAATG.GAACC .3 GTAGATGACI r AAAGATGTT)
AACAANCAA(
T ACTAAAAMC C ATTCCCAACI GC'ITATCTGA T 'TCCTGATGG A CGCAATTGAC Ir GTGGTACGCC P GAGATTTAAC C GTTTAGATGG 'I CAAGACATCC I AAG'ITTTAGC C GGGAAAGTGT 9
TGGGT-CCTT'
CAAGCCAATC
CGCAATCCACJ
CGACCGCTAC
CAGCALACTAG
ATACACAT
CTAAGGA-AT
CGCCAGAAGA
ATCGTGAAGA
GGACCCAAC
TCACCGACGA
,AAGTAACTT
'ATACAATGAC
ACATTGAACA
TGTTACArTC
ACGTAGAACG
TCAAAACAGC
k TTAATAAAGT k. CAGCTA.ACCG
CNGACAGCTA
k. AAGCTAGCGC
'AAGCCGACTT
:AAGACCGC
k.AGACAAAA
TATGATAAA
A.CCGCCCCA
A.ATCGAAAA
TACTTGCAT
CTTCAAGTC
rTAAACCAA r.AATCTTI'A
GTAAAACAA
ATTGATGAT
CCAGCAACA
;GACGATGCT
ACAGAAGGT
CATGCAGCA
CCAGAACCC
CCAACGA TT
AGTTTCGAT
AGCATGGTC
kAATGGTAAA rGAAATGAAC rATCACCACT
A.GGCGGGATT
:AACAAACCA
ACAAGAACAT
TTTCGGTAAC
GTTAGAcATC
CAAAGTAACA
TGACTATTTA
AACGGACGAA
GAAC'ITTGGC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAGTTNCT
TAAcAAAGTA
TACATACACA
cCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
AATGACTAAT
GGCTATTAGC
CCAACCGTAA CACCACCTGC GGCCAAGAAC
ATTTAGATTT
GCTrCGGTA
GTGTTAGACA
GGCATAGTAA
TACTCTTACT
GCAACGGATG
TTAAACTTTG
GCACCAACGC
CCGTTGACC
AGTAAAGGAA
GG'rTTAATTG AA ITAC ACGA;AcAAG
TCACAGACGT
CACAAGAAAA
TAGCTGGTCA
AAGAA'ITAC
GCAACGAAGG
CAGAAGACCC
CTACAAATCA
TTCATTTACC
TCCTTATAGT
EF126-2 (SEQ ID NO:470) MF KXATKLLSTM vIVAGTVVGN~ FSPTLALAEE AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVIJVVVE DASGSFSDNF PHVRQAIDEV
VQGLSD)QDRV
MLASYRCGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR
TYGGTPTAPG
LXLJALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDS1INEY PDPRHPLQVS VEYSNDYQGA AAEVLALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE LQQGSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG UDVPVQINGQ TISATSTEGY VGNITIHYEV KENTAIrJAAT LVgSGTMNQG TIAXEFPEAT 235 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
I PKNDNAHAC
DINKVLDIID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEMN
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPK(P
GITKNKKRKN
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYIE
LTPTNHQAPT
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGGI PNQADL
NPVNFGKSAS
LTNREDSFDW
NNVTFMK
GDVLHSNKPT
DDINXVLDIT
KIKASATDEE
QEHLDLTNP.D
IVTQENNVT
NFGNEGDVLH
KGIHLPM'TNT
HVKTA.FGNET
QADSYDYLSG
VTPPP'JOPNI
DVKVTIDENCK
LAPYIEQGGI
QEFKWNVKTA
FTMNKXDDSY
SNKPTVTPPA
TVNPLYMIAG
STWTQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKPE
LIVL IVA 1SF EF126-3 (SEQ ID NO:471) 9*
S
59 9.
9 9 @9 59 5S
S.
S S 5 5* 5 0
S
9 .9 9559 5 9* 9 59 @5
TGAA
GAAGCGGTTA
AGTCTGGCTG
CAAACGGAAC
TTTCCACATG
GTGATGCTGG
ATTAATTCAG
AGCCAATT1'G
GGATTGAAAC
ACGTATTC
AAGACCAATA
TCAGTGGAAT
GAAATTACTA
AGTI'CAGTGA
GAGTTGCAAC
TTTACAACCC
GCTTCATTAA
GGAAATGATG
TACGTAGGAA
AAGCAGGAGA
ATTGTAAACG
CAGTCGATTT
TAAGACAAC
CTTCATATCG
CTGATTATGA
TCTCTGC [T TCGC IAGA
TATTAGTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTT
AAGGGTCTAG
AATTAAAACA
CCATTGCCAA
TGCCTGTTCA
ACATCACGAT
TACAGAAGGA
GATATTGGAA
AGTAGTITGTT
GA ETGATGAA
CGGCGGAA.A
TATGAATGTG
TGGAGACCFI'
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
ATGACCAATA
GGACAAGCTA
GAAGAT-uCTA
GTGGTTCAAG
CAATTATGT
CGCGTCAATA
CGGACGTATC
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTATT
AAAACAGAAC
GATTI'ATTA
GATCGTCTGG
A ETCATCTG
CAAACCATTT
GTCAAAGAAA
CCC TGAAAGT
CTTTCCCAGT
GTGGTAGTTT
CCTTATCTGA
TTCCTGATGG
CGCAA ITGAC
GTCGTACGCCC
GAGATTTAAC
GTTTAGATGG
CAAGACATCC
ALAG=IITAGC
GGGAAAGTGT
TGGGTCCTTI
CAAGCCAATC
CGCA.ATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
GAAAGACGAC
TCAAGCGGGT
FI'CAGATAAT
TCAAGACCC
AAAGACAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
TI'ACTTGCAT
TCTTCAAGTC
GTTAAACCAA
TGAATCTTA
TCTAA.AACAA
TATTGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGAT
EF126-4 (SEQ ID NO:472) EE AVKACDTEGM TNTVKVKDDS LADCKRILEC QATFPVQAGE TEPVDLVVVE MLASYRGGKQ FMFPDGKTKI NSADYDMNVR LKLALDTYNQ THGDLTNRKT YFLLVTDGVA VEYSNDYQGA AAEVLALNQE ITNQGYEMIN LQQGSSTPED FITSQSIDDF TTQLKQIVKD NDVPVQINGQ TISATSTEGY VGNITIHYEV
DASGSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWTESVESLS
RLAQSTPATA
I{ENTAID
PHVRQAIDEV
QFVSGFGDVR
TNTNDS INEY
SVNSYFDKYX
SLTIANQFDI
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
EF127-1 (SEQ ID NO:473)
TAGCGAAAGA
ATGGTGATG
CAAGCGGITA
AGTCTGGCTG
GAAACGGAAC
T'ITCCACATG
GTGATCCTGG
AAATAGGGAG
TCGCT'GGAAC
AAGCAGGAGA
ATTGTAAACG
CAGTCGATTT
TAAGA'.AAGC
C?1TCATATCG
GATTAAAATG
AGTTGTGGGA
TACAGAAGGA
GATATr'GGAA
AGTAGI'GTT
GATGATGAA
CGCGAAAA
TTTAAGAAAG
AATITMAGTC
ATGACCAATA
CGACAAGCTA
GAAGATr.CTA GTGGTrCAAG
CAAMATGT
CGCGTCAATA
CAACGAAATT
CCACATTGC
CGGTGAAAGT
CTTTCCCAGT
GTGGTAGTTT
GCTTATCTGA
TrCCTGATGG
CGCAATTGAC
ATTATCGACA
'TTTAGCTCA
GAAAGACGAC
TCAAGCGGGT
TrcAGATAAT
TCA.AGACCGC
AAAGACAAAA
TTATGATAAA
ATTAATTCAG CTGATTATGA TATGAATGTC 236 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
2 kGCCAATTTG TCTCTGG'TT I 3GATTGAAAC TCGCPTTAGA k.CGTATTITCC TATTAGTGAC A~ %.AGACCAATA CCAATCA'PTC rcAGTGGAAT ATAGTAATGA C 3AAATTACTA ACCAAGGCTA TI !LGT-TCAGTCA A'ITCATACTT I 3 AGTTGCAAC AAGGGTCTAG C TTTACAACCC AATTAAAACA GCTTCATTAA CGATGCCCAA IJ GGAAATGATG TGCCTGTTCA2 TACGTAGGAA ACATCACGAT ACCCTTGTAA GTAGTG3GGAC ACGATTCCTA AAAATCACAA ACAAAAGATA TCGAAAATCA TGGCATGTCA AAACAGCCTI' CATGACATTA ATAAAGTGCT GATG'FrACAG CTAACGGCAC AAACAAGCAG ACAGCTATGA AAAATI'AAAA CTGACGCAAC CCCAACCAAG CCGACTTAA ACCGTAACAC CACCGCCAGT TAGATTTAA CCAACCGCGA GAAACAAGCA CTI'GGACCCA ACTGATGTAA AAGTCACAGA CAAGAAAATA ACAAAGTAAC AGTGGTCATA CGTACACAAT GAATTAGCAC CTI'ATATTGA AACGAAGGTG ACGTGTTGCA GAAGATCCAA CGA'ITACAAA GACCAAGAAT TTAAATGCAA CAAGCCACCA TGGTGGATGA GANCAAAATG GCAAAGATG-T ACTTTTACTA TGAACAAAAA ATGACTA?1TA CCACTAAAAT CAACAAGGCG GGATTCCCAA CATTCCAACA AGCCAACCGT GAACCTAAAC AACCGCTAAA ACGAACCCAG TCAATTTTGG ACAACAGTAA ATCCACTTTA T=rGGCATAA CAAAAAATAA
YGGAGACGTT
'ACGTACAAT
4
GATGGGGTC
LATCAATGAA
TACCAAGGT
~GAAATGATT
~GATAAATAT
:ACACCAGAA
=.TTTCAAA
~CAATTTGAT
ATTAACGGA
rCACTACGAA kATGAATCAA
EGCGCATGCG
AGAACACTTA
rGGCAACGAA
AGATATCATT
PGTAACACAA
CTATTTAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATI'
AGCCAGCATG
TGA.AAATGGT
TITTGAAATG
GACCATI'AC7
ACAAGGTGGC
rCCAACAAA
AGATATCGAP
CGTCAAAACP
CATITAA.TAA;
TACAGATAAI
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCC'
ACCGAAAAA2
AAAATCAGCJ
CATGATCGCJ
AAAAAGAAA)
CGGACGTATG
CP.AACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTATT
AAAACAGAAG'
GA LT=A'TA
GATCGTCTGG
ATTCAATCTG
CAAACCATT
GTCAAAGAAA
GGAACA
TGTCACGTGA
GA=TAACCA
ACCAGTACTT
CATr.TGAAAG
GAAAATAACA
GGTCATACGT
rrAGCGCCTT
GAAGCTGACC
ATTGCTAAAG
AA.ATGGAACG
GTACATGACA
AAAGATGTrA
AACAANCAAG
ACTAAAATCA
ATTCCCAACt
*CCAACCGTA-A
GGCCAACAAC
LGCTTTCGGTP
GTGTTAGACP
GGcATATAP
TACTCTTACI
GCAACGGATC
TTAAACTM
r' GCACCAACGC k CCGTTGACG( kAGTAAAGGA2 GGTrrAA=r k. AATI'AG
GTGTACGCC
GAGATTTAAC C GTTTAGATGG I CAAGACATCC IJ
AAGTTTTAGCC
GGGAAAGTGT
-GGGTCCTT
CAAGCCAATC
CGCAATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAAT
CGCCA6AAGA
ATCGTGAAGA
GGACCCAAGC
TCACCGACGA
AAGTAACT
ATACAATGAC
ACATTGAACA
TGTTACAI1TC
ACGTAGAAGG
TCAAAACAC
TTAATAAAGT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
AAGCCGACTT
CACCACCTGC
ATTTAGAT
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCTCGTCA
AAGAATTAGC
GCAACGAAGG
CAGAAGACCC
CTACAAATCA
k. TTCATTTACC
'TCCTATAGT
LACCGCCCCA
;AATCGAAAA
~TACTTGCAT
LCFI'CAAGTC
;TTAAACCAA
EGAATCTTA
rGTAAAACAA rATTGATC-AT kCCAGCAACA jGACGATGCT
TACAGAAGGT
TGATGCAGCA
TCCAGAAGCG
TCCAACGATT
TAGTTTCGAT
CAGCATGGTG
A.AATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
CAACAAACCA
ACAAGAACAT
TTTCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATTTA
AACCGACGAA
GAACTTTG'GC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAGTI'NCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
AATGACTAAT
GGCTA'ITAGC
0* 9O S
S@
0e S S
SO
0@ 5 S
SS
@0 4~ S
S
@5 S S
S
0 *5@S 5
SOS.
OS
5555
S
@505
S
5.55 0 @555 55 0
S.
SO
EF127-2 (SEQ ;D NO:474) MF KKATKLLSTM VIVAGTVVGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS
LADCKRILEC
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQ INGQ
IPKNDNAHAC
DINKVLDI ID
IKTDATDEEL
QATFPVQAGE
FMFPDGKTKI
THGDLTN~RKT
AAEVLALNQE
FITSQSIDDF
TISATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
TEPVDLVVVE
NSADYDDIVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQIVKD
VGNITIHYEV
KDIENQEHLD
VTAN'GTVTQE
NQADLNFGNE
DASCSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KEN'rAIDAAT
LTNREDSFDW
NNKVTFEMNK
PHVRQAIDEV
QFVSGFGDVR
TNTNDS INEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMNQG
HVKTAFGNET
QADSYDYLSG
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
HTYTMTITTK
GDVLHSNKPT VTPPPVDPNI AXDVEGQEHL 237 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
DLTh?.DQEFK
ENNKVTFEN
EGDVUISNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
G ITKNKKRKN
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYI E
LTPTNHQAPT
TSTWTQASMV
GHTYTMT ITT
DPTITKDIEG
ENGKDVTDNG
QGG:?NQACL
NPVNIPGKSAS
DDINKVLDIT
K IKASATDEE
QEHLDLTNRD
IVTQENN'KVT
NFGNEGDVLH
KGIHLPMT-NT
DVKVTDENGK
LAPYIEQGGI
QEFK7WVKTA
FTMKKDDSY
SNKPTVTPPA
TVNPLYM TAG
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKPE
LIVLIVAISF
EF127-3 (SEQ ID NO:475) GAATCAA GGAACA.ATTG CTAAGG;AATT TCCAGAAGCG
ACGATTCCTA
ACAAAAGATA
TOGCATGTCA
GATGACATTA
GATGTTACAG
AAACAAGCAG
A-A-ATTAAAA
CCCAACCAAG
ACCGTAACAC
TTrAGATTTA6A
GAAACAAGCA
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTIGACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTrGGACCCA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATT
AGTAACACAA
CTATTTIAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGkAAI'
AGCCAGCATG
TGTGACGTGA
GATTTAACCA
ACCAGTAC'T
GATIGTGAAAG
GAAAATAACA
GGTCATACGT
TTAGCGCCTT
GAAGGTGACG
ATTGCTAAAG
AAATGGAACG
GTAGATGACA
CGCCAGAAGA
ATCGTGtAAGA GGACCCA6AGC
TCACCGACGA
AAGTAACTTT
ATACAATGAC
ACATTGAACA
TGTTACATTC
ACGTAGAAGG
TCAAAACAGC
TTAAT
TCCAACGATT
TAGTTTCGAT
CAGCATGGTG
AAATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
C AAC AAAC CA
ACA.AGAACAT
TTTCGGTAAC
EF127-4 (SEQ ID NO:476) MQG TIAKEFPEAT IPKNDNAHAC
DVTPEDP'TIT
DINKVLDI ID VKVTDENGKD IKTDATDEEL APYIEQGGI P DLTNRDQEFK WNVKTAFGNE
KOIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTTQASMV
LTNREDSFDW
NNKVTFEMNK
GDVLHSNKPT
DDIN
HVKTAFGNET STWTQASMVD QADSYDYLSG HTYTMTITTK VTPPPVDPNI
AKDVEGQEHL
EF128-1 (SEQ ID NO:477) TAGCGAAAGA AAATAGGGAG GATTAAAATG ATGGTGATI'G TCGCTGGAAC AGTTGTGGGA GAAGCGGTTA AAGCACGAGA TACAGAAGA AGTCTGGCTG ATTGTAAACG GATATTGGAA GAAACGGAAC CAGTCGATTT
AGTAGTT.
TTTCCACATG TAAGACAAGC GAT=KTGAA GTGATGCTGG CTTCATATCG CGGCGGAAAA ATTA.ATTCAG CTGATTATGA TATGAATGTG AGCCAATTTG TCTCTGGTTT TGGAGACGTT GGATTGAAAC TCGCTTTAGA
TACGTACAAT
ACGTATTTCC TATTAGTGAC AGATGGGGTC AAGACCA.ATA CCAATGATTC AATCAATGAA TCAGTGGAAT ATAGTAATGA CTACCAACGT GAAATTACTA ACCAAGGCTA TGAAATGATE AGTTCAGTGA ATTCATAC~TI TGATAAATA'I GAGTTrGCAAC AACGGTCTAG CACACCAGA.P TTI'ACAACCC AAITTAAAACA AATTGTCAAP GCTTCATTAA CGATTGCCAA
TCAATTTGAI
GGAAATrGATG TGCCTGT1'CA
AATTAACCGG;
TTTAAGAAAG
AATL'TCAGTC
ATGACCAATA
GGACAAGCTA
GAAGATGCTA
GTGG'ITCAAG
CAATI'ATGT
CGCGTCAATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
CAGCAGCAG
AATGCGTA'Fr
*AAAACAGAAG
GA T1'=ATTA
LGATCGTCTGG
ATrCAATCTG
CAAACCATTI
CAACGAAATT
CCACATI'GGC
CGGTGAAAGT
CTTTCCCAGT
GTGGTAGTTT
GCT'TATCTGA
TTCCATGG
CGCA.ATTGAC
GTGGTACGCC
GAGATTTAAC
G=TAGATGG
CAAGACATCC
AAGTITTAGC
GGGAAAGT
TGGGTCCT'T
CAAGCCAATC
CGCAATCGAC
CGACCGCTAC
CAGCAACTAG
ATTATCGACA
TTTAGCTGAA
GAAAGACGAC
TCAAGCGGGT
TTCAGATAAT
TCAAGACCGC
AA.AGACAAAA
ETATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TcTTCAAGTC
GTTAAACCAA
TCAATcTTTA
TTAAAACAA
TAT GATGAT
ACCACCAACA
GGACGATGCT
TACAGAAGGT
238 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faeculis Genes.
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACAI1TA
GATGTTACAG
AAACAAGCAG
AA.AATTAAAA
CCCAACCAAG
ACCGTAACAC
T1'AGAT1'AA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGT-GG-TCATA
GAATTAGCAC
AACGAAGGTC
GAAGATCCAP
GACCA.AGAAI
CAAGCCAGC;
GANGAAAATC
ACTTTI'ACTI
ATGACTAT'IU
GAACAAGGC(
CATTCCAACi
GAACCTAAAI
ACGA.ACCCAI
ACAACAGTA
TTTGG-CATA
ACATCACGAT
GTACTGGGAC
AAAATGACAA
TCGAAAATCA
AAAcAGcCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACGCAAC
CCGACTTAAA
CACCGCCAGT
cCAACCGCGA
CTTGGACCCA
AAGTCACAGA
ACAAACTAAC
CGTACACAAT
CTTATATTGA
ACGTGTI'GCP
CGATTACAAJ
TTA.AATGGA-Z
LTGGTGGATG;
GCAA-AGATM'
STGAACAAAAJ
~CCACTAAAA'
3 GGAITCCCAj k. AGCCAACCG'
AACCGCTAA
2TCAAITI'TGI NATCCAC TT.
A CAAAAAATA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATT
AGTAACACA-A
CTA'FIAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAACAATTT
AGCCAGCATG
TGAAAATGGI
TTTTGAAATC
GACCATTACI
ACAAGGTGGC
TTCCAACAAI
LAGATATCGAJ
CGTCAA.AACI
CATTAATA.AJ
STACAGATAAX
k AGATGACAG r TAAAACTGAI k CCAAGCCGA' r AACACCGCC' k. ACCGAAAAA 2AAAATCAGC A CATGATCC A AAAAAGAAA GTCAAkAGAAA
GGAACAA'FTG
TGTGACGTGA
GATrI'AACCA ACcAGTACTT
GATGTGAAAG
GAAAATAACA
GGTCATACGT
TTACCGCCTT
GAAGGTGACG
ATTGcTAAAG
KAATCA.ACG
GTAGATGACA
AAAGATGTTA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTA-A
kGGCCAAGAAC k. GCTTTCGGTP
~GTGT'TAGAC;
r GGCATAGTA;
TACTCTTACI
GCAACGGAIX
TTAAACTTT
T GCACCAACG( A CCGTTGACGI A AGTAAAGGA A GGTTAATT A AATTAG ATACAGCGAT T CTAkAGGAAT
T
CGCCAGAAGA
TI
ATCCGTGAAGA GGACCCAAGC C TCACCGACGA AAGTAACTTT I ATACAATGAC 9J
ACATTGAACA
TGTTACATTIC
ACGTAGAAGG
TCA-AAACAGC
TTAATAAAGT
cAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
AAGCCGACTT
CACCACCTGC
ATTTAGAT
ACGA.AACAAG
TCACAGACGT
CACAAGAAAA
TAGCTGGTCA
AA!GAATTAGC
SGCAACGAAGG
cAGAAGACCc 2CTACAAATCA N, TTCATTTACC G TCCTTATAGT
GATGCAGCA
CCAGAAGCG
'ccAACG-ATT
'AGTTTCGAT
AGCATGGTG
LAATGGTAAA
~GAAATGA.AC
~ATCACCACT
kGGCGGGATT
AACAAACCA
kCAAGAACAT
TTCGGTAAC
GTTAGACATC
:AAAGTA-ACA
TGACTAITI'A
AACGGACG-AA
GAACTTT GGC
ACCAACGCCA
AACCA-ACCGr
CACATGGACC
GAAAGTI'NCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
AATGACTAAT
GGCTATTAGC
EF128-2 (SEQ ID NO:478) Mr vvhqvPCT'M UT\JAGTVVCN FSPTLALAEE AVKAGDTEGM
TNTVKVKDI
LADCKRILEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQINGQ
IPKNDNA}HAC
TINKVLDIID
IKTDATDEEL
OLTNPDQEFK
ENNKVTFEMN
EGDVLHSNKP
ASMVDDINKV
TITITKIKTDA
PKQPLKPKKP
GITKNKKRKN
QATFPVQAGE
FMFPDGKTKI
TI4GDLTNRKT
AAEVL.ALNQE
FITSQSIDDF
TISATSTEGY
DVTP-7DPTIT
VKVTDENGKD
APYIEQGGIP
WN'JKTAFGNE
XQADSYDYLS
TVTP PAPTPE
LDITDVYKVXX
TDEELAPYI E
LTPTNHQAPT
TEPVDLVVWE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQIVKD
VGNITIHYEV
KDIENQEHLO
vTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGGIPNQADL
NPVNFGKSAS
DASGSFSDNF
V~NTQLTYDKS
NTRLDGYJHK
AYWESVESLS
RLAQSTPATA
KENTAIDAAT
LTNREDSFDW
NNKVTFEMNK
GDVT-4-SNKPT
DDINKVLDIT
KIKASATDEE
QEHL.DLTNRD
IVTQENNKVT
NFGNEGDVLH
KGIHLPMTNT
QFVSGFGDVR
TNTNDS INEY svNsYFDXK
SLTIANQFDI
LVSsGTMNQG
HVKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENGK
LAPYIEQGGI
QEFKWNVKTA
FTMM<KDDSY
SNKPTVTPPA
TVNPLYMIAG
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWjTQ
SYLJAGHTYTM
PTPEDPKKPE
LIVLIVAISF
EF128-3 (SEQ ID NO:479) 239 TABLE 1. Nucleotide arnd Arnino Acid Seqeuences of E. faecalis Genes.
AGA TGAAAATGGT AAAGATGTI'A CAGCTAACGG
CAAAGTAACA
CAAGAAAATA ACAAAGTAAC TITT-GA-AATG AACAANCAAG
CNGACAGCTA
AGTGGTCATA CGTACACAAT GACCATTACT ACTAAAATCA
AAGCTAGCGC
GAATTAGCAC C1TTATATTGA ACAAGGTGGC ATTCCCAACC
AAGCCGACTT
AACGAAGGTG ACGTG-TGCA TTCCAACAAA CCAACCGTA.A
CACCACCTGC
GAAGATrcCAA cGATTACAAA AGATATCGAA GCCAAGA-AC
ATTAGATTT'
GACCAAGAAT TI'AAATGGAA CGTCAAAACA GCTTTCGGTA
ACGAAACAAG
CAAGCCAGCA TGGTGCATGA CMTTAATAAA GTGTTAGACA
TCACAGACGT
GANGAAAATG GCAAAGATGT TACAGATAAT GGCATAGTAA
CACAAGAAAA
ACTTTTACTA TGA.ACAAAAA AGATGACAGC TACTCTTACT
TAGCT.GGTCA
ATGACTAT'rA CCACTAAAAT TAAAACTGAC GCAACGGATG
AAGAATTAGC
GAACAAGGCG GGATTCCCAA CCAAG.CCGAC -iTAAAcTTTG
GCAACGAAGG
CAITCCAACA AGCCAACCGT AACACCGCCT GCACCAACGC
CAGAAGACCC
GAACCTAAAC AACCGCTAAA ACCGAAAAAA CCGTI'GACGC
CTACAAATCA
ACGA.ACCCAG TCAA'TrrGG AAAATCAGCA AGTAAAGGAA
TTCAT
TGACTATTTA
AACGGACCAA
GAACTTTGGC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAGTTNCT
TAACAAACTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
EF128-4 (SEQ ID NO:480) DENGK DVTANGKVTQ ENMKVTFEMN
XQADSYDYLS
EGDVLHSNKP
TVTPPAPTPE
ASMVDDINKV
LDITDVKVXX
TITTKIKTDA
TDEELAPYIE
PKQPLKP~kP
LTPTNHQAPT
GHTYTMT ITT
DPTITKDIEG
ENGKEDvTDNG QGG IPNQADL
NPVNFGKSAS
KIKASATDEE
QEHLDLTNRD
I'JTQENNKVT
NFGNEGDVLH
KGI H LAPY IEQGGI
QEFKWNVKTA
FTMNKKDDSY
sNI(PTvTPPA
PNQADLNFGN
FGNETSTWTQ
SyLAGHTYTM
PTPEDPKKPE
*0S EF129-1 (SEQ ID NQ:481)
TGACAAGTGA
ATTGCTACCC
CAAGAAATT
GAAGCAGAAG
CTAAAAGCAA
AAACGTAACC
ACAATGCTAG
GCTGTTTCAA
CAAGCCCTTG
GAAGCTGAAT
ATGAAAGCTT
AAACAAAAAG
GCTGAAAAAG
CCAGTTGCCIl
GAATCAAGCP
GAAAATACTIC
GGAAATAATC
GCGCCTTCTC
CGTCCAGTAC
GAAGCAGGT(
TGGGCGCCA(
TCAGGAAGCC
TACTCAGGT
AGAAACGTCT
cAAGCATCGC
CATCAFI'AAA
TATC'ITCAGT
AATCAGAACA
AAGCAATCAA
ATGCAGTTTTi~i
CAATCGTAAG
TTGATAAAAA
TAGAAACAAA
CATTAGCATT
CAGCTGCTGA
CCAA.ACAAGC
CTTCATCAAC
CGCAACAAAC
GCTCTTCTTC
GTGGCCAAAC
CTGATCCAAC
TATGGGATIGC
;GCATTCCAAA
3GTAACTCAG'I
GTCACCGTGP
k. GCACAATCGI
ATGCATCA
TITGGCGGAC
AGCAAAACAA.
ATTTGATGAA
ATTACAACAA
AAATCAAGCA
AGATGCGGAC
TGCCAACAAC
AGCTGAAAAC
ACGTCAAGAT
AGAACAATCA
AGCAGAGCAA
TGCTGCAAAA
AACAGAAGCA
AACTGAAACA
ATCAGAACAA
TGGTGGTGGP
AATCAATGC;
AGGTTGC;
*TGATCACTGC
*AATCATGGCC
LTTGGGAAAT'
'AGGACACTC
GTATTACTAT
AATGTTGATA
GGGGATTTAG
AGCATGGCTT
GAAATI'ACAA
CGTGATGTTC
TCAGTTGCAG
GACTTAATGC
GAGAAAAAAG
TTACTITCTA
TCAGCTGAAA
GCACGCTTAG
CCAGCTAAAG
*CAAGCACCAG
*ACTACACCAA
CCAGTACAAC
ACAGTTACAC
TTGAACGTTC
GCTTCTGCA?
TCTCGTGGA(
STGGTACAATC
C AACCCAGGTj k GCCTAA GTTCATTAAC C
AAAAAATTGA
C ETCACAAGT
TACGTGAACA
ACTI'GAATCA
AAGT'rAATGG
ATGCAATCAG
AACAACAAAA
TGAAACAACT
A-ACAATCTGA
GTTCTAAAGC
CTGCTGAACA
CTGAAGTGAA
CAAGCTCAAG
GTACAGATAA
CTACAACACC
CAACACCAGA
TACGTCAATC
CTGCTCGTGC
ATGAAGTTAT
AAACAAACAT
T 'TACGCGTGT
;CTATCAGCA
kGAAAAAAAT k.TCTTcTTTA kAAGCAAACA
ACGTATTGAA
ACAAAGCACA
:cGTGTTCAA
AGAAGACAAA
TGAAGCAACA
ATTAAACGTA
TGGCTTAGAA
AAAAGCTGCA
AGCAGAAGCA
CTCAGCAACT
TAGTGcAACA
AAGCGATAAT
ACCAAcACCA
ATTAGGT'ITA
ACCACAAGTT
CGCAATTATG-
GGTAACAGCT
CGGTTGGT
EF129-2 (SEQ ID NO:482) 240 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
VYX-KPLFASV LLCSLTLSAI ATPSIALAEN VDKKIEEKNQ EISSLKAXQG DLASQVSSLE
AEVSSVFDES
MLDAVLDADS
AELETKRQDL
EKAKQAAAKP
NTGSSSSEQP
PVVWDAGLAA
GSGHROWEIN
MALREQKQTL
VADAISRVQA
LSKQSELNVM
AKAEVXAEAP
VQPTTPSDNG
SATARAAQVE
PG ITRVGFGY
KAKSEQLQQE
VSTIVSANND
KASLALEQSS
VASSSI TEAQ
NNGGQTGGGT
AGGIPNDHWS
SGSTIVGHSA
ITNLNQRIEK
LMQQQKEDKQ
AESSYXAGLEK
APASSS SATE
VTPTPEPTPA
RGDEVIAIMW
AWVDKKAENE
QKAAAEAEQA
SSTQQTTETI'
P SAD PT INAL
APGNSVIMAW
KKVKQLEATE
RLAAEQKAAA
TPSTONSATE
NVLRQSLGLR
YNETNMVTAS
EF129-3 (SEQ ID NO:483) GGAC AATGTTGATA AAAAAATTGA AGAAAAAAT--
CAAGAAATTT
GAACCAGAAG
CTAAAAGCAA
AAACGTAACG
ACAATGCTAG
GCTGTTTCAA
CAAGCCGTTG
CAAGCTGAAT
ATGAAAGCTI'
AAACAAAAAG
GCTCAAAA.AG
CCAGTTGCCT
GAATCAAGCA
GAAAATACTG
GGAAATAATC
GCGCCTI'CTG
CGTCCAGTAG
CAAGCAGGTG
TGGGCGCCAG
TCAGGAAGCG
CATCATTAAA
TATCTTCAGT
AATCAGAACA
AAGCAATCAA
ATGCAGTI=
CAATCGTAAG
TTGATAAAAA
TAGAAACAAA
CATTAGCATT
CAGCTGCTGA
CCAAACAAGC
CTTCATCAAC
CGCAACAAAC
GCTCTTCTTC
GTGGCCAAAC
CTGATCCAAC
TATGGGATGC
GCATTCCAAA
GTAACTCAGT
GTCACCGTGA
a
C
a a a a.
AGCAAAACAA
ATTT'GATGAA
ATTACAACAA
AAATCAAGCA
AGATGCGGAC
TGCCAACAAC
AGCT'AAAAC
ACGTCAAGAT
AGAACAATCA
AGCAGAGCAA
TGCTGCAAAA
AACAGAAGCA
AACTGAAACA
ATCAGAACAA
TGGTGGTGGA
AATcAATGCA
AGG=TGGCA
TGATCACTGG
AATCATGGCC
TTGGGAAATIl
AGGACACTC.P
GGGGATTI'AG
AGCATGGCTT
GAAATTACAA
CGTGATG~rC
TCAGTTGCAG
GACI-rAAT'GC
GAGAAA.AAAG
TTACTTTCTA
TCAGCTGAAA
GCACGCTTAG
CCAGCTAAAG
CAAGCACCAG
ACTACACCAA
CCAGTACAAC
ACAGTTACAC
'rAACG~rC GCTrCTGCAA
TCTCGTGGAG
TrGGTACAATG
AACCCAGGTA
GCC
CTTCACAAGT
TACGTGAACA
ACTTGAATCA
AAGTTAATGG
ATGCAATCAG
AACALACAAAA
TGAAACAACT
AACA.ATCTGA
GTTCTAAAGC
CTGCTGAACA
CTGAAGTGA-A
CAAGCTCAAG
GTACAGATAA
CTACAACACC
CAACACCAGA
TACGTCAATC
CTGCTCGTGC
ATGAAGTTAT
AA.ACAAACAT
TTACGCGTGT
ATCTTCTTTA
AAAGCAAACA
ACGTATTGAA
ACAAAGCACA
CCGTG'ITCAA
AGAAGACAAA
TGAAGCA6ACA
ATTAAACGTA
TGGC'ITAGAA
AAA.AGCTGCA
AGCAGAAGCA
CTCAGCAACT
TAGTGCAACA
AAGCGATAAT
ACCAACACCA
ATTAGGTTTA
AGCACAAGT'
CGcAAITATG
GGTAACAGCT
CGGTT1TT1GGT TACTCAGGTA
GCACAATCGT
EF129-4 (SEQ ID NO:484) DN VDKKIEEKNQ EISSLKAKQG DLASQVSSLE
AEVSSVFDES
MLDAVLDADS
AELETKRQDL
EYXkKQAAAKP
NTGSSSSEQP
PVVWDAGLAA
GSGHRDWEIN
MALREQKQTL
VADAISRVQA
LSKQSELNVM
AKAEVKAEAP
'JQPTTPSDNG
SATARAAQVE-
PGITRVGFGY
KAXSEQLQQE
VSTIVSANND
KASLALEQSS
VASSSTTEAQ
NNGGQTGGGT
AGGIPNDHWS
SGSTIVGHSA
ITNLNQRIEK
LMQQQKEDKQ
A.ESSKAGLEJ(
APASSSSATE
V'TPTPEPTA
RGDEVIAIMW
RNE-AIXNQAR
AVVDKAENE
QKAAAEAEQA
SSTQQTTETT
PSADPTINAL
APGNSVIMAW
DVQVNGQSTT
KKVKQLEATE
RLAAEQKAAA
TPSTDNSATE
NVLRQSLGLR
YNETNMVTAS
EF130-1 (SEQ ID NO:485)
TGATACATTA
TI'AATCAACA
CAGATACGTA
CCAGCTGATG
TCCTTGAGCA
GGTGCCATTG
GTCGCTTTAT
AAAGGAGGGA
GTTTATTAGT
GTTGGGTGGT
TGAAGAAAAA
CAGAAGCGGT
CGATACCMAG
TAACTGGTGC
AAATATGCGC
TTTACTATTT
TCAACAAAAT
TATGGCTCGT
GATGAAACCC
TGTCGAAATT
CGGGACCATG
CCAAAAGAGA
ATCATTGGCT
ACCCCCTCGT
GAAACAACGT
CAT'rAAA
AATI'TGCCCA
AAAGAAGATC
AAAAAAGAGG
TAGCcTTAAT
ACGCCGTTAG
TTGAC TI'GA
ACAAAAACT
TTTTTAAAGG
AAGTCATGCG
AAAAAATTGG
TTTTAAcAAT
CAAGTTGAAA
TTcAGTTGAG
ACCTGTGATIT
ATTGTCCAAT
GAAAAACAAT
241 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TATGCCITrGG
ACCAAAAAAG
ACCT'GTAG
AATATGATTA
GGAACATTIAG
CAATTGGAGC
CTAGTCATCG
ACGAACTCAT
AAAAAATCGA
CCTTAAMTAC
CAGCAACGAC
AAAAAACTT
AACGGA-AGAT
TTATATCACT
ACCAACCCCT
CTGTGGCGAT
GCCTATLAAA
AGCCGATTGG
GGCGTT'TTCCT
GATTTATCTA
G'ITGAGTTAA
TTACAAGCAA
GACGCCAACG
GTGGCTTAA
TATTTCACC
CTGTTTATAC
TGATGACGT
CGACGCGAAT
ACGATATGTT
TTTAGAAAGA
ATACAAAATA
TCCTGGTCAA
TGCTGTTCAA
GAAGGC=TC
EF130-2 (SEQ ID NO:486)
YIKRRENMRP
ADVKKNMARE
ALLTGAGTMK
SVEKIEPTRV
LEQXTLADWV
KE-KKRGKNWL
TTFDFDSVES
EDQVMGKNNY
ELI DDVPGQN INSL.LVLLFI IGLALIFNNQ LSTEAvbMUAQFENKNLPVIG ALASHRTEDG VSLFSPLERT MITLITCGDL QATTRIAVQG I RSWVVQQNS AIAI PSVEIN
KKIDELIYITD
TLAATP IKD
RSYAVSKLKP
LPIFKGLSNV
LSTVYTYK IT
ANDDMLKAFQ
EF130-3 (SEQ ID NO:487)
C
CGITAG CAAGTTGAAA CCAGCTGATG TCAAGAAAAA TCCI'TGAGCA CAGAAGCGGT GGTGCCATTG CGATACCAAG GTCGCTTTAT TAACTGGTC TATGCCTTGG CTAGTCATCG ACCAAAAAAG ACGAACTCAT ACTTCTCTAG AAAAAATCCA A.ATATGATTA CCITAATTAC GGAACATTAG CAGCAACGAC CAATTGGAGC AAAAAACTTT
TATGGCTCGT
GATGAAAGCC
TGTCGAAATT
CGGGACCATG
AACGGAAGAT
TI'ATATCACT
ACCAACCCGT
CTGTGGCGAT
GCCTATTAAA
AGCCGATTGG
GAAACAACGT
CAATTTGAA.A
AATITIGCCCA
AAAGAAGATC
GCCCTTTCCT
GATI'TATCTA
GTTGAGTTAA
TTACAAGCAA
GACGCCAACG
GTGGCT
TTGACT-TTGA
ACAAAAACTT
TTTTTAAAGG
AAGTCATGGG
TAT'ITTCACC
CT--TTTATAC
TGATGACGT
CGACGCGAAT
ACC-ATATGTI'
TTCAGTTGAG
ACCTGTGATT
AFI'GTCCAAT
GAAAAACAAT
'TI'AGAAAGA
ATACAAAATA
TCCTGGTCAA
TGCTGTTCA-A
GAAGGCTI'TC
EF130-4 (SEQ ID NO:488)
VSKLKP
ADVKKNMARE TTFDFDSVE.S LSTE-AVM(AQ FENKNLPVIG AIAIPSVEIN
LPIFKGLSNV
ALLTGAGTMK EDQVMGKNNY ALASHRTEDG VSLFSPLIERT KKDELIYITD
LSTVYTYKIT
SVEKIEPTRV ELIDDVPGQN MITLITCGDL QATTRIAVQG TLAATTPIKD ANDDMLKAFQ LEQKTL.ADWV A EF131-1 (SEQ ID NO:489)
TAGGCGCAGG
TGGCTITTTA
TTCT'TI'CA
CGCCGAACTG
CTI'GATCAAA
CAGGTTTTAG
CCAAACACAA
AA.AGCAATTG
ATTCAAGGTG
TTATCAGCAA
AATACCATTC
GCAGGAATGT
CACAAAACGG
AATTCAACAA
TAACGTAT
TAGTATGITI'
CTAGAGAT'C
ATAA'ITATGC
AA-ATI'CAAGA
TAAATAAAGG
GGTTTCAGAT
AAGAAGGTAA
CTGAGGATAT
TGCCTAATAA
AAGTCAATAA
TAGAGAAAAT
CTIGGTTTAAA
GTrATAAATG
GCGTAA.ACGA
GGCTGGTG
ACAAGTTAGT
GAA'ITTAACC
AAcAAATTAT ATATGGcT'TT
TGGCTCAATT
ACT'TACATA
TACGATTAGC
TATCGTTACC
AGGAAAATAC
GTATCAACGT
nAArTTTGGC
CATGCAAAGA
AT1'GGTCGTA
CAAGAATCGA
AAAGAAATAG
ATflGGI'CGG
GCCAATTTTG
CAAAA.ATCTT
GATACAAAAC
GATATGTTGA
GATGAAGAAA
AATTATTCCC
ACCTATCA.AG
'TCTATGAAA
AAAGACAiG
GTCTTATTT
AAGTGGTCrI'
TTGCACCAGA
CTTTGATCAT
AAAAGCAACA
TTACCACA-AC
TCGCTACGI'
ATATGACAAG
TTATTCAATT
CAGTAAATTT
AITTA'TI'AA
CCTTATTGGA
ACCA.AGTGCT
AGGAGTGAAT
AATAAAAACG
GGAAGAAGAT
TAGTGGCGAA
TAAAr-ATGAT
AGCCAACACG
CTTGATCTTA
TTATCCGCAA
TGGT'IrAAAG
TGTTAAACAA
TGTCCTTA
TAATCTTTAT
ACAGCCCAAT
CTCAATTCCT
GACAGAAGAT AATTCATATA 242 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GCAGCTAG?1'
TATTGGTACT
TTATGGACGT
TTACG'ITTAC
AAGACAGGGG
GGTTCGTTGT
TTGCCCATGA
TACATcAATT
CTTCTCAGCA
ACGGCGTTGA
TCATATTGCT
TCCATGATGT
A'FTTGGGACT
AACGAGTCGA
AAGCTCTTAT
AGCGGGTCA.A
AACTAACTGT
AACCAATTTA
GGTAATGTGG
cATTTAGTTT
CATMGCGGCA
CAAGCCCTGG
GTGAATCCAG
ACTGTTAA-AT
ATATGACGAC
CCACCGCACT
TCTATGTTCA
T=rATTTTC
CGAAATACAA
TTTAA
AGGTGATG
TTTGCAAAAA
TGATAATTAT
AAAAGATATG
AGAATTAATT
EF131-2 (SEQ 10 NO:490) MRKRH AKRRHGGVNW LFIVCLLVVI GGSGYLIKTF FFTRDSQVSQ ESKVVLEEOR RSDNYANLTI( EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA
NFEKQQANTP
NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDIT7SD MLNMTSGLKL SANPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLIEK14YQRT
YQELFNNLYH
KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLSIPA ASFA4EFGTG,
NVDMTTGDLY
WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIVVHDNYL RLHGVEAGQQ
ALVLFSKDMK
TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF EF131-3 (SEQ ID NO:.491) a.
a a TTT AATAAAAACG TTCTTTI'TCA CTA( CGCCGAAGTG ATAj CTTrGATCAAA AAA' CAGGTTTTAG TA CCAAACACAA GGT AAAGCAATTG AAG A TTCAAGGTG CTG TTATCAGCAA TGC AATACCATTC AAG GCAGGAATGT TAG CACAAAACGG CTG AATTCAACAA GTT GCAGCTAGTT TTG TATTOCTACT TAC TTATGGACGT CTI TTACGTT'AC ACG AAGACAGGGG TCA GGTTCGTTGT TCC
~AGATTC
TTATCC
TTCAAGA
FTAAAGG
ITCAGAT
AGGTAA
AGGATAT
CTAATAA
TCAATAA
AGAAAAT
GTTTAAA
'ATAAATG
C CCATGA
ACAAGTTAGT
GAATTTAACG
AACAAATTAT
ATATGGCT
TGGCTCAATT
ACTITACA'ITA
TACGATTAGC
TATCGTTACC
AGGAAA.ATAC
GTATCAACGT
GAATTI'TGGC
GACAGAAGAT
ATTTGGGACT
CAAGAATCGA
AAAGAAATAG
ATI'GGTTCGG
GCCAATTTTG
CAAAAATCT
GATACAAAAC
GATATGTTGA
GATCAAGAA
AATrATTCCC
ACCTATCAAG
TTCTATGAAA
AATL'CATATA
GGTAATGTG
CAT
T
ITAGTIT
CATGGCGGCA
CAAGCCCTGG
GTGAATCCAG
ACTGITAAAT
AAGTGGTCT~r =rCACCAGA C ITGATCAT
AAAAGCAACA
TTACCACAAC
TCGCTACGTI'
ATATGACAAG
TTATTCAATTl
CAGTAAATTT
AATATTrAA
CCTTATTGGA
ACCAAGTGCT
ATATGACGAC
CCACCGCACT
TCTATGTTCA
T=IATmTC
CGAAATACAA
TT
GGAAGAAGAT
TAGTGGCGAA
TAAAGATGAT
AGCCAACACG
CTTGATCTTA
TTATCCGCALA
TGGTTTAAAG
TGTTAAACAA
TGTCCT'FTTA
TAATCTTTAT
ACAGCCCAAT
CTCAATTCCT
AGGTGATITG
TTTIGCAAAAA
TGATAATTAT
AAAAGATATG
AGAATTAAT'
:ATCAATT AACGAGTGGA CTCAGCA AAGCTCTTAT ~GCGTTGA AGCGGGTCAA LTATTGCT AACTAACTGT :ATGATGT AACCAATTTA EF131-4 (SEQ ID NO:492) LIKTF FFTRDSQVSQ ESKVVLEEDR RSDNYANLTK EIVAPDSGEL DQKIQETNYI NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN KTAGLKNFGF YETLLEQPNN STSYKWTEDN WYLHQLTSGH LVSTALLQKL WTSSQQSSYH TGVILJTNCV NPAKYKELIG SLFHDVTNLT
GSALIIKDDQ
T(LATFYPQI
YSPVNFVLLA
SYNQVLS IPA
GGIYVHDNYL
VKF
VLVNKGYGFA
QGAEDITISD
GMLE<YYQRT
ASFAHEFGTG
RLHGVEAGQQ
NFEKQQANTP
MLNMTSGLKL
YQELFNNLYH
NvDmTTGDLY
ALVLFSKDMK
EF132-1 (SEQ ID NO:493) TA~~iTACCCAACAATTTAAAGAGGGTGTTAGGAAGA-GAT
GTGAGCGGAGTATCCTTAATG~-AGTACATGAAAAAAAAT
243 TABLE 1. N'ucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GCTGAGAATAAGATGTCATTGTTACGTTATAATTGAAAAA,
ATTGArACACAGTATCGTACCTATTAGATCCCCACGATAGACC tTTGCCTGAAGATGTTCAAAAAACT mATCCGAACAGAGGAAACAAAGACTATTTTGCAGCGTATGGCATAcTATATACTTAGAGccTCAGAcT
GAA-=AGAACCAGTGTAATAAAAGTTATAGTAATTGAATG
TTGGAA-GTCrAATAAATCAAAAATTGTATTTGAATGATTT GAAAAGTAACAATGCTATCGAGTAAATGTGTCATAGA CrtAAA EF132-2 (SEQ ID NO:494) MMRKWVVVGSLMLIALF I FGACSTNSKDKDTVASNEKLIKVVTNS I LAD ITEN IAKDK I DLHS IVP IGKD PH EYEP LPEDVQKTSKADL IFYNGVNLXTGrNAWFTYKLVKXANKEEN(DYFAASDG
IDVIYLEGQSEKGKEDPHAWLNLENGII
YAXN IEKWLAEKDPDNKKFYKENLDKYI EKLDSLDKEAKSKFAS IPNDKKMIVTSEGCFKYFSKAYNVPSAY IWE INT EEEGTPDQ IK}{LVEKLRTTKVPSLFVES SVDDRPMKTVSKTI P IYSTIFTDS IAEKGQDGDSYYAMMKflWNLDK
IAE
GLSK.
EF132-3 (SEQ ID NO:495) ATATCAACCtTrCT
GTTCAAACMAACGTTATTTAACGGTATG~A
TGAGATC*GTTCAATATA*nTCGAAAAGAACAAATTTTCGAGG
TGCTGTTATATTGGGCGGGGAGGAAGAACCAGCTGTAATAAA
CGTTATAGTAATTCAATGTGG~&-AACTAATAAATTTAGA-
TCAAkGAATAALTGATTTGCAGACAACAATGTCATCATAA A*AAGTGTCATG.GAGTtATTTCCAACTTAGGCTCGTAATGG **AATTGCTGAAGGCCTrTCGAA-A EF132-4 (SEQ ID NO:496) CSTNSKDKDTVASNEKLKVVVTNS ILADITENIAKDKIDLHS IVP IGKDPHEYEPLPEDVQKTSKADL
IFYNGVNLXT
G-GNAWFTKLVKXANKEENKDYFAASDGI DVIYLEGQSEKGKEDPHAWLNLENGI IYAXNI EKWLAEKDPDNKKFYKEN *LDKY IEKLDSLDKEAKSKFAS I PNDKKMIVTSEGCFKYFS(AYiVP SAY IWE INTEEEGTPDQ IKHLVEKLRTTKVPS LFVESSVDDRPMKTVSKDTNI PIYSTIFTDSIAEKGQDGSYYAMKWNLDKIAErCLSK 9 9* S *9 S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genlank and Derwent databases.
Query GenBank GenBank Ge ne Dcscription BLAST BLAST Access. No. Score P-Value EF002-2 gi12338759 (AF0 18073) periplasmic sorbi tol -binding protein; SmoE 113 3.6013-18 Rhodobacter EF003-2 gil 1552773 hypothetical [Escherichia coli] >gnlIDId 10 12634 hypothetical 278 1 .20E-53 EF003-2 gi12196996 lipoprotein homolog [Treponema palliduni] >giJ2 108234 29K 309 3.30E-44 EF003-2 gil46649 lipoprotein-28 precursor [Escherichia coli] >gi12905 10 263 9.20E-40 EF003-2 gi1148838 28 3kDa membrane protein [Haemophilus influenzae] 197 2. 10E-39 EF003-2 giJ1573614 28 kDa membrane protein (hIpA) [Haemopliilus influenzae] 197 7.80E-39 EF003-2 gi123 14748 (A.E000654) outer membrane protein [Helicobacter pylori] 263 4.60E'--37 EF003-2 gi13495 30 _lipoprotein [Pasteurella haemnolytica] >gill 50508 lipoprotein 189 4.i1OE-29 EF003-2 I lPID el 18435 similar to hypothetical proteins [Bacillus subtilis] 158 2.70E-26 EF003-2 giJ349532 lipoprotein [Pasteurella haemolytical >pirl1N075311N0753 outer 200 1.2013-25 EF003-2 gi11336657 lipoprotein [Bacillus subtilis] 182 2.70E-25 EF003-2 gnljPlDje233873 hypothetical protein [Bacillus subtilis] >gnljPlDjel 182900 186 1.3013-23 EF003-2 giJ294071 lipoprotein 3 [Pasteurella haemnolytica] 199 '6.60E-23 EF003-2 giJ34953 I lipoprotein [Pasteurella haemolytica] >pirIJN0752IJN0752 outer 198 1.30E-20 EF003-2 gil294070 __lipoprotein 2 [Pasteurella haemnolytica] 198 1.80E-20 EF005-2 giJ537235 Kenn Rudd identifies as gpmB [Escherichia coli] >gill 790856 127 6.20E- 12 EF006-2 gi1155277 3 hypothetical [Escherichia coli] >gnlPD~d 10 12634 hypothetical 255 1.40E-60 _29.4 1 0 OOe 0* 0* 0** 000 0 09 0 0 0 0*0 0 0 0 0S 0 0 0 00 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF006-2 giJ349532 lipoprotein [Pasteurella haemolytica] >pirjJN0753iINO753 outer 221 6.40E-49 EF006-2 gil23 14748 (AE000654) outer membrane protein [Ilelicobacter pylori] 283 2.70E-48 EF006-2 g11219699 6 lipoprotein homolog [Treponema pallidumi] >giJ2 108234 29K 267 4.4013-47 EF006-2 gn IPIDIel 18435 similar to hypothetical proteins [Bacillus subtilis] 359 1.80E-44 EF006-2 giJ3 4 95 3 1 lipoprotein [Pasteurella. haemolytical >pirj1N0752jfN0752 outer 218 3.8013-41 EF006-2 g11294071_ lipoprotein 3 [Pasteurella haemolytica] 220 2.30E-38 EF006-21 gil 146 6 49 lipoprotein-28 precursor [Escherichia coli] >gi12905 1 0 193 2.60E-38 EF006-2 giJ294070 lipoprotein 2 [Pasteurella haemolytical 218 1 .2013-36 EF006-2 gil 148838 28 kA membrane protein [Haemophilus influenzae] 112 8.50E-34 EF006-2 gil 1573 6 1 4 28 kDa membrane protein (hipA) [Haemophilus influenzae] I112 1.50E-"3 EF006-2 gil3495 30 lipoprotein [Pasteurella haemolytical >gill150508 lipoprotein 114 4.30E,-29 EF006-2 gi12940 69 lipoprotein 1 [Pasteurella haemolytical 114 1.3k0-27 EF006-2 gi1133 665 7 lipoprotein [Bacillus subtilis] 202 2.10E-26 EF006-2 gri1IPlDIe23387 hypothetical protein [Bacillus subtilis] >gnljPIDjeI 182900 200 6.5013-25 EF008-2 gif49301 7 endocarditis specific antigen [Enterococcus faecalis] 1590 2.7013-211I EF008-2 giJ3932 6 9 adhesion protein [Streptococcus pneumoniae] 986 1.8013-129 EF008-2 gil 1538 3 4 adhesin specific for salivary pellicle of dental surfaces 973 1.0013-127 EFOO8-2. gil 1575030 surface adhesin A precursor [Streptococcus pneumoniael 934 .2.90E-126 EF008-2 gi1153 826 adhesin B [Streptococcus sanguis] >pir1A435831A43583 adhesin 916 3.90E- 126 EF008-2 gill 1184932 jScbA [Streptococcus crista] 915 3.40E- 125 EF008-2 gill 1117994 Isurface antigen A variant precursor [Streptococcus neurnoniae] 917 15.60E- 124
S
S
S
0 S S S
S
S S S S S S S S S S *5 55 S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF008-2 giJ3 10633 adhesin [Streptococcus gordonii] 891 6.OOE- 122 EF008-2 gn1lPIDe255529 lipoprotein [Staphylococcus epidermidis] 476 1.20E-99 EF008-2 giJ 1573330 adhesin B precursor (fim.A) [Haemophilus influenzae] 380 1.60E-68 EF008-2 giJ1245 464 YfeA [Yersinia pestis] >gi11 2 4546 4 YfeA [Yersinia pestis] 355 l,20E-64 EF008-2 giJ755O 7 5 periplasmic-binding protein [Synechocystis sp.] 321 1 .70E-62 18652 Mn EFOO8-2, g1133591 2 EwIA [Erysipelotluix rhusiopathiael 232 4.40E-42 EF008-2 guliPIDlel 18595 similar to ABC transporter (membrane protein) [Bacillus 204 4.10E-38 EF008-2 gi117779 3 3 TroA [Treponema pallidum] 181 2.4013-35 EFOO9-2 giJ34953 I lipoprotein [Pasteurella haemolytica] >pirlJN0752jJN0752 outer 391 4.OOE-64 EF009-2 giIl5527 73 hypothetical [Escherichia coli] >gnljPIDjd 10 12634 hypothetical 359 1.90E-63 29.4 EF009-2 gi129 4070 lipoprotein 2 [Pasteurella haenolytica] 391 6.40E-63 EF009-2 gi3 4953 2 lipoprotein [Pasteurella haemolytica] >pirIlu'.u 75.IJINU 0- o uter a Ou EF009-2 gi1148 838 28 3kDa membrane protein (Haemnophilus influenzae] 286 5__60E_60 EF009-2 gij 1573614 28 kDa membrane protein (hIpA) [Haenoph ilus influenzae] 286 .7.60E-60 EF0Q9-2 gi129 4 069 lipoprotein 1 [Pasteurelia haenolytical 122 4.70E-59 EF009-2 giJ146 649 lipoprotein-2 8 precursor [Eschericbia coli] >g12905 10 326 12.20E-58 EF009-2 giJ349 530 lipoprotein [Pasteurella haemol)tica] >gillSOSO 8 lipoprotein 239 _I7.80E-57 EF009-2 giJ29 4 0 7 1 lipoprotein 3 [Pasteurella haemolytica] 344 4.90E-56 EF0O9-2j gi123 14748 I(AE000654) outer membrane protein [Helicobacter pylori] 319 4.20E-53 I r onma nalidiiml >i12108234 29K 312 2.601341 EF00U9-2 1,4 1 VOYYD protein LLIIJ A
__I
A a a a a a a a a a a a a. a a a a a a a *a a a a. .a a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF009-2 gil 1336657 lipoprotein [Bacillus subtilis] 234 4.0013-32 EF009-2 gn1lPIDle233873 hypothetical protein [Bacillus subtilis] >gnlIPlDI 182900 242 1.40E3-31 EF009-2 gnliPlDjel 18435 similar to hypothetical proteins [Bacillus subtilis] 102 6.80E-22 EFOl 11-2 gnIPlD~dl096 ferric anguibactin-binding protein precusor FatB of V. 579 3.1013-98 F0 11-2 gnlPIDjdIO096 ferric anguibactin-binding protein precusor Fatl3 of V. 579 3.1013-98 EFO 11-2 gnljPlDjel85374 ceuE gene product [Campylobacter colil 284 1.30E3-89 EFOI 1-2 gnljPIDIl537 4 ceuE gene product [Campylobacter ccli] 284 1.30E3-89 EFOl 11-2 gi1 150756 40 kDa protein [Plasmid piMI] >pirlA29928lA29928 222 2.80E-52 membrane-associated EFOl 1-2 gil 15075 6 40 kDa protein [Plasmid pJMl] >pirlA299281A29928 222 2.80E-42 membrane-associated E3F012-2 giJ3O9662 pheromone binding protein [Plasmid pCF 10] 266 8.70E-1 116 >pirjB53309IB53309_____ EF012-2 giJ3882 69 traC [Plasinid pAD I) >pirlA5331I0lA533 10 pheromone cADi1 252 1.101-109 ~~binding EFOI12-2 gnljPlDjdlOl 18 TRAC [Enterococcus faecalis] 281 3.60E3-103 EF0 12-2 gn]IP]Dld 10065 TraC [Enterococcus faecalis] 277 2.3013-102 F0 12-2 giJ3 12940 threonine kinase [Streptococcus equisimilis] 227 1.90E3-67 >pirIS28 153IS281 53 EF012-21 giJ48808 jdciA-E [Bacillus subtilis] 228 11.70E-46 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFO 12-2 pirISI 665 1 IS166 dciA.E protein Bacillus subtilis 228 1.OOE-45 EFOl12-2 gnlIDel 18149 (AJ00257 1) DppE [Bacillus subtilis] >gnilIPIDIel 183316 228 3.80E-45 EF012-2 giJ40005 OppA gene product [Bacillus subtilis] 281 3.90E-44 EFO 12-2 giJ1 4 360 3 sporulation protein [Bacillus subtilis] >gn1 PIDe e1183 163 281 7.70E-44 EFOl12-2 gnllPlDldl0156 Periplasmic oligopeptide-binding protein precursor. 152 2.20E-43 3 EF012-2 giI157 46 79 oligopeptide binding protein (oppA) [Haemophilus influenzae] 178 2.20E-42 EFOl12-2 giJ47802 Opp A (AA 1-542) [Salmonella typhimuriumi >giJ478O8 128 1.OOE-37 EFOl12-2 giJ88 2 55O ORFJf535 tEscherichia coli] >gi1 1789397 (AE000384) [535; 228 5.3013-36 This 535 aa EF014-2 pirID7007OlD7O transcriptional regulator homolog ywtF Bacillus subtilis 101 1.40E-J7 0_ EFO 14 2 gnjPlDe 116988 capsular polysaccharide synthesis protein (Streptococcus 121 9.50E-.27 EFO 14-2 gi1280 4 769 (AF030373) putative regulatory protein [Streptococcus 121 9.50E-27 EFO 14-2 gnllPIlDe28 9126 1unknown [Streptococcus pneumoniae] 121 l.OO13-24 EFOI14-2 giJ2267239 ORFI [Staphylococcus epidermidis] 234 .1.50E-24 EFOI14-2 gi148527 5 _putative regulatory protein [Streptococcus pneunoiniael 121 3.90E-24 EF014-2 giJ280 4 7 3 5 (AF030367) putative regulatory protein [Streptococcus 121 3.90E-24 _____pneumoniaej EF014-21 gi1280 4 747 (AF030369) putative regulatory protein [Streptococcus 1.21 3.90E-24 pneumnoniae 9000 Oee 9 SO 0~ 9 0 e e S.
CO C e 0 Se 0 060 0* 0 S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFOl14-2 git 1762327 putative transcriptional regulator [Bacillus subtilis] 185 2.80E-22 EF014-2 -gi114315 6 membrane bound protein [Bacillus subtilis) >gn1IP1Dle1 184471 116 l.lO13-21 EF014-2 gnIlPIDIdlO 189 membrane bound protein LytR [Synechocystis sp.] 113 6.20E-20 EFOI14-2 gi1127 6 874 jEpsA [Streptococcus thermophilus] 103 4.OOE- 17 SICAAI r 1'2 C EF016-2 EF016-2 EF016-2 t~r~t~I~t I C ;lmf% w4d API' trans orter (11inding protein I gn Piiii~ IfJU II U a& "ALL -SS ar inine binding protein [Ecei~ oi gl674at 121 .0E3 giJ4U934 I u- I gn]IP1DId10 152 Argiinebiningperplamicprotein 2 precursor [Escherichi a 14.ISUL-.3 1 I I I EFOI16-2 gi1687 652 j EFOl16-2 giJ26504I0 FlIN [Escherichia coli] >gnlPlD~dt 10 16464 FINY protein 10 5.70E-31 (AEOO 1090) glutamine ABC transporter, periplasmic glutamine- 122 3.30E-29 104nding~2 EFQ 16-2 EF016-2 EF016-2 gi1l 64 90 35 ai1157 4634 g1415 6 9
I
high-affinity periplasmic glutamine binding protein [Salmonell I I 2.50E-27 EF016-2 jgnhiPIDtd 1015: GlnH precursor (AA -22 to 226) [Escherichia coli] 106 4.70E-27 >gnljPIDIdlO152SO____ 2 Arginine-binding periplasmic protein I precursor [Escherichia 109 .3.70E-26 artl [Escherichia coli] giJ7 6979 I arti [Escherichia coli] 127 2.30E-25 9 homologous to Gin-binding periplasmic proteins [Bacillus 117 8.50E-24 EFOI16-2 EFO 16-2 7 gil76979 1 gn.l1PIDId1 008 EFOI16-21 giI1541 25 'J protein [Salmonella typhimurium] >gi4771 8 reading frame 1,18 2.10E-23 0 0 0 0 *0 .i ~0* *0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
hisJ EF016-2 gnIIPlDjdl0l68 HISTIDINE-BINDING PERIPLASMIC PROTEIN 117 4.50E-23 8 PRECURSOR
(HBP).
EFO 16-2. g!1 1166636 histidine-binding perilasmic protein HisJ [Escherichia coli] 117 6.60E-23 EF017-2 giJ38 8269 traC [Plasmid pADI]) >pirjA533101A533l10 pheromone cADI 421 4.50E-1281 ~binding EFO17-2 gn1jPIDjIl 8 TRAC [Enterococcufs faecalis] 417 5.1013-124 EF017-2 gnljPIDjd1006S TraC [Enterococcus faecalis] 414 4.40E-123 EFO 17-2 giJ309 662 pheromone binding protein [Plasmid pCF 10] 415 2.40E-1 19 _____________>pirjB53309IB53309.k EFO17-2 giI40005 OppA gene product [Bacillus subtilis] 294 6.2OEA82 EFO 17-2 gil 14360 3 sporulation protein [Bacillus subtilis] >gnlIP1Djel 183163 290 2.80E-79 EF017-2 giJ 3 l 2 94 0 threonine kinase [Streptococcus equisimilis] 241 2.4013-71 ~>pirIS281I53IS28 153____ EF017-2 giJ488OS dciAE [Bacillus subtilis] 270 1.I1OE-61 EFOI17-2 gnIIPIDel181 49 (MJ00257 1) DppE [Bacillus subtilis] >gn]IP1Dle1 183316 270 .150E-61 EFO 17-2 oirIS 1665l1IS 16 6 dciAE protein- Bacillus subtilis 270 3.10E-60 EFO 17-2 giJ3049 5 j periplasmic oligopeptide binding protein [Escherichia coli] 171 2.60E-57 EF017- !~41l oligopeptide binding protein precursor [Escherichia colil 171 8.7013-56 j 1)9 J In JI..J IEF017-2 Opp A (AA 1-542) [Salmonella typfimunumJ >9g'if ,a'J6 precursor
C
a a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFOI17-2 giJ88255O ORFJf535 [Escherichia coli] >gil 178 93 97 (AE000384) f535; 135 5.50E-52 This 535 EFOl17-2 gil 1574679 oligopeptide binding protein (oppA) [Haeinophilus influenzae] 168 2.90E-43 EFO 19-21 giJ43845 8 liely N-terminal signal sequence; mature protein probably 104 .2.30E-17 EF021-2 gnlIDje3I1492 unknown [Bacillus subtilis] >gnlPD~e1 184232 similar to ABC 317 2.5013-103 EF021-2 bbs1173803 CD4+ T cell- stimulating antigen [Listeria mnonocyto genes, 476 2.80E-81 E0- 167, EFO21-2 gil 5 8 1 8 09 tmbC gene product [Treponema pallidumi >pirIA435951A43595 152 3.20E-71 membrane____ EF021-2 giJ26882 80 (A.E001143) basic membrane protein C (brnpC) [Borrelia 10 1 5.50E-27 EF021-2 gnlIDIel 17283 membrane protein A [Borrelia garinli]j 142 6.50E3-h EF021-2 gntljPIDlel 17283 membrane rotein A [Borrelia burgdorfen]l 141 9.20E-22 EF021-2 gnljItDjeI 17283 membrane protein A [Borrelia burgdor-feriJ >giIS 16592 141 9.20E-22 membrane EF021-2 gnljIlDje 17283 bnipA(239,ORF 1) [Borrelia burgdorferiii 141 1 .70E-21I EF021-2 giJ508421 antigen P39 [Borrelia burgdorferi] >gil 2 68 82 8 I (AEO0I 143) 141 1 .7012-21 basic EF021-2 gi11753 22 5 jBmpA protein [Borrelia burgdorferil 141 2.70E-20 EF021-2 gn]IP1De 11728 2 membrane protein A [Borrelia afzeiilj 141 8.60E-20 EF021-2 gnljPIDjIl728 3 Imembrane protein A [Borrelia afzelii] 141 8.6013-20 A 0 141 ~Af EF021-2 EF021-2 ,gn1jPIDje 17283 gnIjPIDje 17282 membrane protein A[Borrelia afzeliiJ bmpA(p39,OR.F 1) [Borrelia burgdorferi] J 141 1.50E-19 I 1.50E-19
I
I
I II
I
I
I
I I
I
Table 2. Closest matching sequences between tile polypeptides of the present invention and sequences in GenBank and Derwent, databases.
EF022-2 giJ1 2 9 40 threonine kinase [Streptococcus equisimilis] 324 5.90E-66 1531S28153_________ EF022-2 giJ3096 6 2 pheromone binding protein [Plasmid pCF 10) 307 5.60E-60 B53 3091B53309_ EF022-2 gnljPIDjdl0l 118 TRLAC [Enterococcus faecalis] 301 4.80E-59 EF022-2 gnIPIDeIel18149 (AJ 0571) DppE [Bacillus subtilis] >gntIPIDIe 1l8 33 1 6 170 5.LOE-59 EF022-2 gi14880 8 dciAE [Bacillus subtilis] 170 5.20E-59 EF022-2 gnlP1D~d1 0065 TraC [Enterococcus faecalis] 299 2.80E-58 EF022-2 pirI1665115l66 dciAE protein Bacillus subtilis 170 1.60E-57 EF022-2 giJ38 8269 traC [Plasmid pADI >pirIA533 I OA533l10 pheromone cADI1 290 2.70E753 ____binding EF022-2 giJ4 000 5 OppA gene product [Bacillus subtilis] 154 7.30E-48 EF022-2 gi11 43 6 0 3 sporulation protein [Bacillus subtilis] lfPIDjeI 183163 154 3. 1OE-47 EF022-2 gi2688227 (AEO 1139) oligopeptide ABC transporter, periplasmic 215 l.OOE-36 EF022-2 giJ228 1458 (AF000366) oligopeptide permease homolog All [Borrelia 215 l.OOE-36 EF022-2 gif 304925 Periplasmic oligopeptide binding protein Escherichia coli] 131 1 I.30E-35 EF022-2 gi114701 4 oligopeptide binding protein- precursor [Escherichia coli] 131 1 .80E-34 EF022-2 giJ47 8 02 Opp A (AAI-542) [Salmonella typhimurium] >giJ4780 8 138 4.90E-34 precursor __231 EF023-21 gi3 0 9 6 2 lpheromone binding protein [Plasmid pCF 10] 3 .7E6 e
S
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Gen~ank and Derwent databases.
-I>pifB533091B53309
III
EF023-2 EF023-2 EF023-2 EF023-2 EF023-2
FEF
0 2 3 2 EF023-2 EF023-2 EF023-2 EF023-2 giJ3 88269 gnl11PIDIdI01 18 gnI IPIDId 10065 giJ4 8808 gnjPlD le1 18149 pirS 16651 IS 166 giJ 40 005 gil 143603 giJ47802 giJ268 82 7 giJ2 28 l 4 5 8 traC [Plasinid pADI] >pir1A533lOlA53310 pheromone cADI binding TRAC [Enterococcus faecalis) TraC [Enterococcus faecalis] dciAE [Bacillus subtilis] (AJ002571) DppE [Bacillus subtilis] >gntPDlel 183316 dciAE protein Bacillus subtilis, OppA gene product [Bacillus subtilis] sporulation protein [Bacillus subtilis] >gnIPIDIe 1183163 Opp A (AAI-542) [Salmonella typhimuriumi >gil47 8 08 precursor (AEQO 1139) oligopeptide ABC transporter, periplasmic (AF000366) oligopeptide permease homolog All [Borrelia burgdorferil OREfJ535 lEscherichia colil] >gi117 8 9 397 (AEOOO3 84) f535; This 535 aa periplasmic oligopeptide binding protein [Escherichia coli] oligopeptide binding protein precursor [Escherichia coli] (AF005097) unknown [Lactococcus lactis] pheromone binding protein [Plasnmid pCF 226 1 OOE-58 226 4.40E-58 157 157 137 133 135 187 187 1 .20E-57 3.80E3-56 2.30E-53 6.90E- 3 2.0013-41 9.4013-41 I 1.90E3-40 155 11l'3013-38 E F 0 2 3 -2-rj i8 8 2-55 EF023-2 EF023-2 EF026-2 EF027-2 giJ3 04925 gi1 147014_ gil 235 24 82 giJ3O9 662 130 130 141 198 9.0013-37 3;70E3-34 1. 1OE-23 6.2013-71 6 669 6* 6 6* 3 6** 6 6
I
6 66 6 666 6 6 6 1 66 6* 6 *6 61 6 I It *6 66 #6 6~ 6 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
T F 11 202 EF027-2 gnljPIDldlO065 ITraC [Enterococcus faecalis] 202 1 .50E-68 I i i EF027-2 EF027-2 EF027-2 EF027-2 EF027-2 EF027-2 EF027-2 EF02 7-2 EF027-2 EF027-2 gn.1IPIDjdlOl 118 r,
A
TRAC [Enterococcus faecalis] traC [Plasmid pAD I] >pirIA533lIOIA533l10 pheromone cAD I binding 213__ j8.30E-68
I
gillDle 18149 (AJ00257 1) DpE[ailssbiis giII~l131 r 222 AlO-4 giJ48 808 pirIS 16651 IS 1 66 giI4 000 5 gi1143 6 0 3 gif3l12940 gi12688 227 gil2 2 8 1458 dciAE [Bacillus subtilis] dciAE protein Bacillus subtilis OppA gene product [Bacllus sutilis] 1222 1 LlOE-39 TJ 25 sporulation protein [Bacillus subtilisi >gnlPlDIel 183163 threonine kinase [Streptococcus equisimilis] >PirIS28 153 1S28 153 (AFO0l 139)oligopeptide ABC transporter, periplasmic (AF000366) oligopeptide permease homolog All [Borrelia ~burgdorferil (AF000948) OppAJV [Borrelia burgdorferi] >gi12 68 989
I
(AE0007 92) ~oligopeptide binding protein (oppA) [-aemophilus influenzae] B. subtilis alkaline phosphatase iIIA; P 19405 secretory 247 233 131 131 4IE-39 5.8OE-3 9 8.90E-,33 2.40E-24 2 .40E-24 117 1,OE2 EF027-2 1 gij2 2 8 1468 EF027-2 EF028-2 IEF028-2 gil 1574679 gnlfIPlDId 10204 7 pirIBl3 90961B39 130 996 3.50E-20 3.6013-131I alkaline phosphatase (EC 3.1.3. 1) 111 prcursor Bacillus 9 982 12.90E-129
I..
I a *1 II I I I I C I I
I
I. I I Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF028-2 EF029-2 EF028-2 EF028-2 EF028-2 EF028-21 EF028-2 EF028-2 EF02 8-2 EF028-2 EF028-2 EF028-2 EF028-2 0 gi147 0383 gi114 3324 gill 47243 gi11 4 7237 gi11 47239 giJ14724 I gi11.27 7 127 gi1147 229 g-181 8 851 gil 147245 gi1147 23 1 giJ1 4 7 235 gill 0 1 60 10 alkaline phosphatase A [Bacillus subtilis >gIIlel 182942 AkPase I [Bacillus lichenifon-nis] >pirlA44828lA4482 8 alkaline alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia coliLalkaline phosphatase precursor (EC 3.1.3. 1) Escherichia colil alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia colil alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia colil phoA gene product [Cloning vector pFW..phoAl >gil 12 77 130 phoA gene alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichiacoli] alkaline phosphatase [synthetic construct] alkaline phosphatase (phoA) (EC 3.1.3. 1) [Escherichia fergusonii) alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia, coli] alkaline phosphatase precursor (EC 3.1.3.1)1[Escherichia coli] alkaline phosphatase with N-terminal PeIB-leader and Cterminal YncB [Bacillus subtilisi >gnillPlDlel 183421 similar to niicrococcal similar to hypothetical proteins [Bacillus subtilis] nuclease [Staphylococcus aureus] >pirIAOO790INCSAF icrococcal thernionuclease [Stahlococcus hicus] 803 184 183 178 178 178 174 178 174 177 174 174 174 4.8013-119 3 .OOE-54 8.30E-54 4.40E-53 -4.40E-53 4 .40E-53 4.90E-53 8.40E-53 IOE-52 1.20E-5,2 1 .6013-52 1 .60E-52 1 .60E-52 257 3;50E-55 EF029-2 giIl7501 2 6 EF029-2 EF029-2 gnljPIDje 1183 60 giJ67 3492 263 320 7.80E-53 2.20E-39 EF029-2 i gi5 32653 15 5 ]19.I1OE-39 9* 9.
4** 9 9 9 9 9 9 9.
9 9 99. 9 *9 9. 9 9 9 9ta* 9. 9 9* 9* 9 9 9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genl~ank and Derwent databases.
I EF029-21 EFO30-2 EF030-2 EF03-2 EFO30-2 EFO30-2 rEF030.2
I
gIJ47 146 thermonuclease [Staphylococcus intermediusi >iflS26079IS260 7 9 gi48808 dciAE [Bacillus subtilisl mnlPI~ell 8 l 4 9 AJ002571) Dp2E fBacillus Subtilis] >gn1PlDje1183316 S,66 6 -51 S166 dciAE protein Bacillus subtilis giJ30 9662 pheromone binding protein [Plasmid pCF 101 ir B533091 353309 gnlIPIDjdl~l 18 TRAC [EnterococcUS faecalis) gn.1PIDldlOO 6 TraC [Enterococcus faecalis] 149 149 149 227 1.l1OE-66 1 .50E-66 5.90E-66 1 7.4013-52 2 37 F7.40E-52 233 9.70E-5Sl 22 9 3 .OO0E_48 giJ38 8269 EF030-2 gi(3 12940 EF030-2 giJ4 7802 traC [Pla smid pADI lI >pir1A533101A 53 3 l 0 pheromone cADI binding threonine kinase [Streptococcus equisimilis] >pirIS2S 153IS28153 Opp A (AAl-542) [Salmonella typhimuriufli >gil 4 780 8 277 3.OOE-45 125 8g.50E-34 125-3 recursor EFO3O-2reur 2682 AQl19 hoetide ABC trasporter, Renplasinic EF030-2 gi1222 8 1458 (AF000366) oligopeptepreaehmogM[orei b u r d o r fe ri t l s EF030-2 i 40005 0 A ene Rroduct [Bacilluis subtis EF030-2 143603 s orulation protein B acillus subtilisl ltPIDj IeI 183163 E F030-2 giJ228l 4 68 (AF000948) OppAIV [Borr elia burgdorferil >gi12 68 989 l 00OO07 92) S211 211 148 144 210 4.80E-31I 1 .20E-30 4.80E-30 2.IE-2
S
S S S S S S S *5 S. S S S S S S S S S S S. S.
S
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Der-went d .atabases.
EF030-2 EF033-2I EF033- EF033-2 EF03 6-2 EF036-2 giJ157 4 679 gnljPJDjI 18439 pirjS54437JS5 44 g1619 623 gnlPIDIdl 0102 2_ gi126 22858 EF036-2 jgiJ26 22859 EF036-2 1 gi12 688 115 oligopeptide bndig potein (oppA) [H-aemophilus influenzae] similar to iron-binding protein [Bacillus subtilis] hemnin binding protein Yersinia enterocolitica hemin binding protein [Yersinia enterocolitica]_ ORF 108 IBacillus subtilis) >gnljP1DeII857 6 6 alternate gene (AE000929) phosphate-binding protein PstS (Methanobacteflum (AE000929) phosphate-binding protein PstS homolog [Methanobacteflum.
(AEOO 132) phosphate ABC transporter, periplasmic tphosphate-binding (AF005097) unknown [Lactococcus lactis] hypothetical protein [Esherichia coli] >gil78651 1 (AEOOO 139) 2-5A-de endent RNase [Mus musculus] >;pirtB45771lB3 4 5 7 7 l G9a [Homo sapiens] >pirS30385IS 3 0 3 8 5 G9a protein human erythroid aniyrin [Mus musculus) >pirS3777 S3 77 7 1 ankyrin, ankyrin [Mus musculus] >pir114950211 49 502 ankyrin mouse alt. anrn (variant 2.2) [Homo sapiens] ankyrin [Homo sapiens] ankyrin [Homo sapiens] ankyrin 1, erythrocyte splice formn 2 human 148 164 108 108 544 183 1.40E-45 158 12.40E-41 11-7 1IOE-12 6 OOE-29I 2.60E- 14 1.40E-1 I 2.OOE-1 1 1 .20E-96 EF037-2 EF040-2 2 EF040-2 EF040-2 EF040-2 EF040-2 EF040-2 EFO4O-2.
EF040-2 giJ23 52482 giJ 1657516 giJ293 2 65 61~ 2 8786 5 gil 31 1817 giJ19 1940 gil7 477 1 0 giV]178646 giI18 45 26 5 pirlA3 5049IA3 5 141 208 105 143 119 119 120 120 10 120 1.l08-23 1,.90E-29 i.OOE- 17 8 .30E=-14 -4.80E- 13 4-.90E- 13 1.50E-12 1.80E-12 I1.80E-12 I.8E-1 L 0 a 4 9* 4 q 4 4* 49 S 9 9.
9 9~ S 9 a 9 5 9
S
Table 2. Closest matching sequences between the polypeptides of the present ivnonadsquences in GenBank and Derwent databases.
a a 9 9* p.
9..
9 9* 9 a 9* a a a a a a a a.
a a S a S S. *a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
1EF041-2 giJ147 0 1 4 oligopeptide binding protein precursor [Escherichia coli] 160 1 .50E-28 EF041-2 giJ22 5328 6 (AF005657) plasminogen binding protein [Borrelia burgdorferiL 134 5.0011-27 tEF045-2 giJ3O 8 85 4 oligopeptide binding protein [Lactococcus lactisl 43 3.21312 I L I EF045-1 EF045-2 EF045-2 EF048-2 EF04 8-2 EF048-2 EF048-2 EF048-2 EF048-2 EF04 8-2 EF049-2 gi)49 5 1 8 1 giJ 6 7 79 45 giJ293 0 14 gil 1574060 dbjllABOO 1498 2_ gi146 6 7l 7 gil 46 00 6 gil 7 3 102 _gil2 182530 gi15 7 2999 gi1149 58 1 oligopeptide binding protein [Lactococcus lactis] 426 9.70E- 124 AppA [Bacillus subtilis] >gnlPIDIel 183158 oligopeptide ABC 154 2.30E-31 peptide-binding protein [Lactococcus lactis] 159 2.40E- 14 >pirj B47098B47098____ hypothetical [Haemophilus influenzae] >pir116416411 64 164 250 2.30E-41 (ABOG 1488) SIMILAR TO C4-DICARBOXYLATE- 208 3.60E-34 BINDING PERIPLASMIC______ No definition line found [Escherichia coli] >gill 79 000 4 199 1.3013-30 penipiasm-ic C4-dicarboxylate binding-protein [Rhodobacter capsulatus] hypothetical [1amophilus influenzae] >piilH64 14311-64143 (AE000085) Y4mM [Rhizobium s NGR234] hypothetical [Haemophilus influenzael >pirIE64 141 I E64 141 maturation protein [Lactobacillus paracasei] >pirIA448581A44858 ORF (AA 1 to 299) [Lactococcus lactis cremoris) >pirlS0 8 0 83 1 S08083 ~maturation protein [Lactococcus lactis] >gi62 3 055 proteinase 162 11.405-25 2_4 4 114 116 241 239 3.80 5.60E- 18 5:9013- 2.40E-55 1 .O13-54 I6.20E-54 EF049-2 giJ414 32 0 7 ~EF049-2 giJ4321 98 9 90 0* 0 099 9. 0 4 9 9*.
:0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GerLBank and Derwent databases.
EF049-2 giJ4728 35 ORFI [Lactococcus lactis cremoris] 241 11.5013-53 EF049-2 giJ39782 33kDa lipoprotein [Bacillus subtilis] >gnlIPlDje325 181 33kDa 128 8.90E-40 EF051-2 gnlPIDjdl10l114 molybdate-binding periplasmic protein [Synechocystis sp.] 173 3.2013-50 2 EF051 2 IgnlIPID~eI 18602 alternate gene name; yvsD; similar-to molybdate-binding 314 5.90E-50 1-2 gil 1574546 EF051-2 gi150 44 9 8 EF051-2 01g1 48 9 39 EF054-2 EF054-2 EF054-2 _901505 56 gn1IPlDle23657 I giJ457 38 lsg locus hypothetical lHaemopflhlus inhiuenzacj >pirlA641I751A64 175 periplasmic, inolybdate-binding protein [Escherichia coli] >-gill 147 817 ORF 8 [Haemnophilus influenzae] >pirIS27S 83IS27583 hypothetical surface protein [Plasmid pCF 101 >pirIA4l8261A4]826 probable cell wall anchoring signal [Enterococcus faecalis] ORFC [Enterococcus faecalis] >pirIJHO2O4IJHO2O4 hypothetical 30.5K orf iota [Str~eptococcus pyogenes] >pirIS681I25IS4509 1 hypothetical sporozoite surface protein [Plasmodium yoeli) >pirIA45559IA45S 59 PbTRAP fPlasmodium bergheil zinc finger protein [Mus inusculus] >gnlfP1Dj225 6 88 zinc IgG and IgE immnunoreactive antigen recognized by sera from patients 1490 515 372 1.80E-192.
1 .60E-58 148 1.40E-30 150 I8.IOE-28 EF054-21 giJ496S 2
O
362 1I.30E-43 286 14.-30E3-33 EF054-2 EF04- EF054-2 EF054-2 giJI6 06 9 3 7 gi1181 35 23 gn1jPlDje225 6 87 giJ2290 394 305 246 242 1.30E3-32 3.60E3-26 1.40E3-25 4 4..
9** 4 9.
4* 9 9 4 9 44 4 9 *4 9. 9 4 9 9 9 9 99 99 9* 4 9 9 9994 99 9 9 Table 2. Closest mnatching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF054-2 EF054-2 EF054-2 EF054-2 EF054-2 gi 9 3 92 gi46523 pirnS 153301jFCS 0 [1620100 *i636869 Pro- and Glu-rich, PENPEV (l0x); similar to Streptococcus
B
NF-M c-terminuS GCallus gallus) NF-M protein [Gallus gallus] >pirjS 1 57621S 15762 neurotilament triplet TATA-box like sequence (Usi 11) [Human herpesvinis 1) >giJ29193 18 cell wall anchoring signal [EnterocoCCUS faecalis] sufaerotein [Plasmid pCFI10 >pir A4 1826 IA4 1826 oal ORFC [EnterococCus faecalis] >pirlJ H0204 1JH0204 232 228 210 222 222 2 .80E-23 1 .OE3-22 3.I1OE-21 6.90E-2 1 8.SOFE-21 1T94-4. IOE- 19 EF054-2 1 gi17 578 67 EF059-2 EF059-2 gnlIPlD~e23657
I
6150556 I giJ4573 8 418 1366 5 .6013-95 3 .70B-87 9,30E-50 EF059-2 EF059-2 EF059-2 EF059-2 9-2 hypothetical 30.5K gi149 6520 orf iota [Streptococcus pyogenes] >pirIS6 8 125IS4509
I
yotetical gill1 6 06 93 sporozoite surface protein [Plasmnodiumn yoeliil >pirIA4S559IA4555 9 g1813523 PbTRAP IPlasmodium berghei gi122 9 0394 lgG and IgE inunnoreactive antigen recognized by sera from pati ents gi12 290 392 IgG and IgE immunoreactive antigen recognized by sera from watients 367 5.90E-44 1 344 1-.IE-38 2-95 251 251 =2.50OE-32 3 .OOE-29 3E 0- 29 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF059-2 gil 1620100 Pro- and Glu-rich, PENPEV (IlOX); similar to Streptococcus B 253 6.40E-27 EF059-2 gif 46521 Fc receptor [Streptococcus agalactiae] >pirlA602341A60234 IgA 197 2.70E-26 Fc EF059-2 gi4 6523 B3 antigen [Streptococcus agalactiael 232 9.30E-26 EF059-2 pirISl533OIECS IgA Fc receptor precursor Streptococcus agalactiae 232 19.30E-26 0 EF059-2 gn~lPIDDle225687 zinc finger protein [Mus musculus] >gnIjPlDje225 6 88 zinc 234 1 .4E-22 EF059-2 gil42 5356 zona pellucida protein [Pseudopleuronectes aniericanus] 229 1 OOE-21 EF059-2 gi(4577 6 9 Collagen [Bomrbyx mori] >pirlS42886IS4288 6 collagen 209 7.60E- 19 silkwormn EF061-2 gnlPIDe236571 cell wall anchoring signal [Enterococcus aecalis] 925 _18.1E-1 18 EF061-2 IL05 surface protein [Plasmdd pCFIO] >pirlA41826lA41826 probablel 350 11.50E-107 I-)n 1I I AAEZ'.CQ EF061-2 giJ496520 EF061-2 giJ4573 8 EF061-2 gil 18135 23 EF061-2 gil 160 693 EF061-2 giJ 636 86 EF061-2 giJ6 36 89 EFO6I-2 giJ2290 392 orf iota [Streptococcus pyogenes] >pirIS61SbIji43u0i I hypothetical ORFO [Enterococcus faecalis] >pirlJHO204[1H020 4 hypothetical 30.5K PbTRAP [Plasmodium berghei] sporozoite surface protein [Plasmo dium yoelii] >pirIA455591A455 5 9 INF-M c-terminus [Gallus gallus] INF-M protein [Gallus gallusi >pirIS I57621Sl15 76 2 6.40E-50 322 263 1 OOE-26 241 -9:.O0E-25 232 2. 1OE-22 232 2.60E-22 176 12.4013-21 neurofilament, triplet lIgG and IgE imunoreactive antigen recognized by sera from S S S *5 S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF061-2 gil 16201 00 Pro- and Glu-rich, PENPEV (lOx); similar to Streptococcus B 165 2.70E-20 EF061-2 gnljPID~e22568 7 zinc finger protein [Mus musculus] >gnljPIDje22568 8 zinc 197 7.80E- 19 EF061-2 gil 160 355 interspersed repeat antigen [Plasmodium falciparum] 199 8.20E- 18 EF061-2 giJ410750 interspersed repeat antigen [Plasmodium falciparum] 199 8.90E-18 EF061-2 giJ22903 8 8 IgG and IgE iminunoreactive antigen recognized by sera from 182 1.40E- 17 ________patients EF061-2 giJ2290 394 lgG and IgE inmunoreactive antigen recognized by sera from 180 2.80E- 17 patients EF062-2 gi470 49 asal gene product (AA 1-1296) [Enterococcus faecalis] 3716 0 EF062-21 giJ4332 4 aggregation substance (ASPI1) [Enterococcus faecalis] 4003 0 EF062-21 giI2 109266 aggregation substance [Enterococcus faeciuin] 5523 0 A~ 141 41Q A EF062-2 EF062-2 EF062-2 gil 150555 gil 100973 giJ47 248 aggregation substance [Pl'asmid pt-t Iu0j >~pr~IoI 61u.
150K mating SsvB precursor [Streptococcus gordonij iI 110 j9.90E-39 107 I1.70E-38
I
PAc protein precursor (AA -38 to 1527) [Streptococcus T- r EF062-2 jgni IPID d 10 150 surface protein antigen precursor [Streptococcus sobrinus) 1 1 5--OOtE-3 6 6.5013-36 1 .2013-35 EF062 21 giJ47 267 cell surface antigen 1111 [Streptococcus mutans] >p~iY15068391S06839
I
EF062-2 bbsj 148453 SpaA~endocarditis im-munodominant antigen [Streptococcus sobrinus, S S 9*
S
S..
S S S
S
S
S.
S
S S
S..
S S S S S 55*** S S S S S S S S S S S. S* S S Table 2. Closest matching sequences between the polypeptides of the presenit invention and sequences in GenBank and Der-went databases.
EF062-2 giJ47 620 antigen 1 /11 [Streptococcus sobrinus] >pir1A603381A60338 surface 2.90E-35 -1109 12. 1OE-34 EF062-2 EF062-2 EF062-2 EF063-2 EF0632 EF063-2 EF063-2 EF063-2 EF063-2 pirlA3S I86lA35j gI I 009-)r giI097 giJ47 049 gi324 giJ2 109266 -gi1150555 gi1l100973 giJ 4 7 24 8 salivary agglutinin receptor precursor Streptococcus I i. 110 3.80E-32 SspA [Streptococcus gordonii] SspA [Streptococcus gordonij asalI gene product (AA 1- 1296) [Enterococcus faecalis) aggregation substance (ASPI) [Enterococcus faecalis] aggregation substance Enterococcus faeciun] aggregation substance [Plasmid pCF 101 >pirIH41662lH4l66 2 150K mating SspB precursor [Streptococcus gordonii] PAc protein precursor (AA -38 to 1527) [Streptococcus mutansi surface protein antigen precursor [Streptococcus sobrinus] cell surface antigen 1111 [Streptococcus mutans] >pirlS068391SO 68 39 SpaAendocarditis inumunodomiflant antigen [Streptococcus sobrinus, antigen I A~l [Streptococcus sobrinus] >pirlA60338lA60 3 38 surface salivary agglutinin receptor precursor Streptococcus 110 3716 4003 5523 6338 3.80E-2 2.0E2 0 0 0 0 eI EF063-2 gnillPIDjdI10 EF063-2 gi147 267 EF063-2 bbsl 148453 EF063-2 giJ47 620 132 5.OOE-36 107 I6.50E-36 132 132 1 .20E-35 2.90E-35 EF063-2 1861A35 I
I
4 4** 0** .4 *4* 6 V 4 V V
V
V V 44 4
V
4*4 *4e V A 4 *4 VA *4 V* V 0 V *VV*4 *g* *6 C V Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF063-21 gil 1100971 EF063-2 gil 110097 5 EF064-2 giJ470 49 EF064-21j giJ433 24 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF068gi121092 66 gil 150555 gill 100973 giJ4 7248 gnllPlDl 1050 7 gi1j47 267 bbs1 148453 giJ47 62 0 pirIA35 186lA35 gill 100971 gil 100975 gi179 03 98 SspA [Streptococcus gordonii] 110 3.80E-32 SspA [Streptococcus gordoniij 110 2.30E-21 asalI gene product (AA 1-1296) [Enterococcus faecalis] 3716 0 aggregation substance (ASP I) [EInterococcus faecalis] 4003 0 aggregation substance [Enterococcus faeciuml 5523 0 aggregation substance [Plasmid pCF 10] >pirlH41662lH41662 6338 0 150K mating____ SspB precursor fStreptococcus gordonii] 110 9.90E-39 PAc protein precursor (AA -38 to 1527) [Streptococcus 107 1-70E-38 mutans] surface protein antigen precursor [Streptococcus sobrinus] 132 5.OOE-36 cell surface antigen 1/11 [Streptococcus rnutans] 107 6.50Eb-36 >pirlS06 8 39 1S 0 68 39 SpaAendocarditis immnunodominant antigen [Streptococcus 132 1.20E-35 sobrinus, antigen 1 /11 [Streptococcus sobrinus] >pirlA60338lA6033 8 132 2.90E-35 surface salivary agglutinin receptor precursor Streptococcus 109 2. 1OE-34 SspA [Streptococcus gordoniil 110 3.80E-32 sp Streptococcus gordonii] 110 2.3013-21 T06D8. I [Caenorhabditis elegans] 137 85E1 6 C C e.g S C 0e 0O4 S 0 S S *0 C 9
C
S *g C C C C C
C..
CCC egO C C C g.e gee *0 e* g Ceeggee *4 CC C C C Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF068-2 gnIjPlDjdIO208 membrane glycoprotein [Equine herpesvirus 1] 210 5.80E3-16 4 EF068:2 gi(2286 2 04 (AFOI 1339) unknown -[Acinetobacter calcoaceticusl 121 -8.40E3-16 EF068-2 giJ330 86 2 membrane glycoprotein [Equine herpesvirus 1I 208 1.IOE-15 EF068-2 gil 1 7 072 47 partial CDS [Caenorhabditis elegans] 131 3.70E-15 EF068-2 gnlPIDIdIO208 membrane glycoprotein [Equine herpesvirus 1] 203 6.20E- 4 EF068-2 giJ21339 2 antifreeze glycoprotein [Notothenia coniiceps] 102 4.601E-13 84201A3 8420 EF068-2 gnljPIDje12 54 64 (AL022022) PORS-family protein [Mycobacterium 145 1.0E- 12 EF068-2 EF069-2 EF069-2 EF069-2 EF069-2 giJ95 1460 gil790 39 8 gnlPIDId 10208 4 g228 6 20 4 giJ33 08 62 FIM-C.1I gene product L enopus Iaevisj -'pu 1Ij I -ii).i mucin
T
?S2~UL.-I
I
I -rA~r'~Q I rr s,-l~Ik p1po~4n.d Ut-1 Ic membrane glycoprotein [Equine herpesvirus I] 210 156.80E.-10
I
(AFO 11339) unknown [Acinetobacter calcoaceticus] membrane glycoprotein [Equine herpesvirus 1] 208 131 t 3.70E-15 EF069-21 EF069-2 gil 1707247 gnllPlDldlO2O8 4 giJ2 13392 partal CDS Jaenorabditi eleals membrane glycoprotein [Equine herpesvirus I1 anti freeze glycoprotein [Notothenia coriiceps] [6.20E-15 I t 102 I
S
I 4.60E-13 102 4.60E- 13 6 0 *00 000 0 000 00 0 0e S 0 0 0 0 0 0 00 S. @0 0 @0 S 000 *00 @00 0 0 0 0 000000 0@ OS S 0 0 @000005 0000000 0@ @0 0 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
I
EF069-2 gidlPlDlel2546 4 EF069-2 giJ95l 46
O
EF070-2 giJ 7 9 0398 EF070-2 gnlP1D~d10208 EFO7O-2 giJ22 862 04 EF070-2 giJ3309 62 EF07O-2i gi1170 7247 EF070-2 gnljPIDjdl 0208 4 EF070-2 giI2 13392 EF070-2 gnllIle 125464 EF070-2 gil95 1460 EFO7 1-2 gnIlPlDje30 64 28 EF071-21 giJ8537 5 l EF073-21 gi143830 >pirIA3 8420 jA38420 (AL022022) PGRS-family protein [Mycobacteriunm 145 1 .50E-12 tuberculosis] FIM-C.1I gene product [Xenopus lacvis] >pirIA451551A45155 109 2.70E- 12 mucin____ T06D8. 1 [Caenorhabditis elegans] 137 8.50E- 17 membrane glycoprotein [Equine herpesvirus 1] 210 5.80E- 16 (AFOI 1339) unknown [Acinetobacter calcoaceticus] 121 8.40E- 16 membrane glycoprotein [Equine herpesvirus 1] 208 1.1OE- >pirIH368021VGBEXI 1 partial CDS [Caenorhabditis elegans) 131 3.70E, membrane glycoprotein [Equine herpesvirus 1] 203 6.20E- antifreeze glycoprotein [Notothenia coriiceps] 102 4.60E-13 >pirIA3 8420 1A3 8420 (AL022022) PGRS-family protein [Mycobacteriumn 145 1.50E-12 tuberculosis] FIM-C. I gene product [Xenopus laevis] >pir1A451551A45 155 109 2.70E- 12 mucin unnamed protein product [Bacteriophage nlt] >gill 353566 127 2.00E-37 Lysin N-acetylmuramoyl-L-aaninfe aidase Bacteriophage AL5 11] 273 2.6E3 xpaC [Bacillus subtilis) >gnIPID~d1O0505 3 hydrolysis of 173 7.IE1
A
S S S S S S S S.
S
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank EF074-2 giJ125 6698 chitinase [Serratia marcescens] >gill 25 6698 chitinase [Serratia 618 2.60E-104 EF074-2 giJ17 63 98 5 chitinase A [Vibrio harveyij 526 2.80E-84 EF075-2 giJ143 156 membrane bound protein (Bacillus subtilis] >gnIIPlDjel 184471 593 1.70E-91 EF075-2 pirlD7007OlD7O transcriptional regulator homnolog ywtF Bacillus subtilis 118 1 .90E-59 0 EF075-2 giJ1762 3 27 putative transcriptional regulator [Bacillus subtilis] 148 9.6013-53 EF07 5-2 giJ 1276874 EpsA [Streptococcus thermophilus] 239 2.20E-33 EFO75-2,gnljPlDje289l2 6 unknown [Streptococcus pneumoniae] 150 1.20E-27 EF075-21 gil 48 5275 putative regulatory protein [Streptococcus pnieumoniae] 150 2.50E-27 I CA I ICM2 and Derwent databases.
EF075-2 EF075-2 EF075-2 EF075-2 EF075-2 EF075-2 gil280 4 735 (AF030367) putative regulatory protein [Streptococcus gil2 804 7 47 gnljlDlel 16988 giJ280 4 7 69 gill 147744 gi1790 43 giJ22 672 39 gnlP1DIdl01 8 9 gnlIPlDldlOI 13 pneurnoniae] (AF030369) putative regulatory protein [Streptococcus pneumoiael
I
2.50 127.
14I ~Ucapsular polysaccharide synthesis protein [Streptococcus (AF030373) putative regulatory protein [Streptococcus pneumoniae] FPSR [Enterococcus hirae] PSR [Enterococcus faeciumjk pir4S5l77IS54177 PSRroteinLZ 148 _L.30Ez I 109 102 5.30E-27 2. 1OE-23 4.40E-19 85E19 109 ORF I [Staphylococcus epidermidis] membrane bound protein LytR [Synechocystis sp.] cadmium-transporting ATPase [Synechocystis sp.] 121
I
2.30E-1 13 2.30E- 113 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
*t.
4
C
*4 4 S 4 4* 4 *4 4* 4* 4* Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF078-2 gnlI1PIDIdlOl 19 homologous to sp:PHOR BACSU [Bacillus subtilis] 219 4.20E-44 EF078-2 gi11 5 7 5578 histidine protein kinase [Thermotoga maritirna) 191__ Ic K7 An AAULI EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF078-2 EF079-2 EF081-2 EF081-2 EF081-2 EF081-2 ri' 5zW)QQCI histidine kinase rLactococcus lactis cremorisl I 152 1.I1OE-39 giJ21829 92 gn1(P1 Did101 113 4 gil 14 929 6 giJ58 1188 gn1IPID~dl 0108 7 gnhIPlDje266592 12 182996 gn1P1D~d 10 113 gi129 4893 giJ288 420 giJ209 8 71 9 gi4 6 7 806 giJ790 42 9 ,gnljPlDje208 36 5 g1179 043 3 histidine kinase [Lactococcus lactis cremoris] sensory transduction histidine kinase [Synechocystis sp.] phosphate regulatory protein phoR (gtg start codon) [Klebsiella !hoR gene product (AA 1-431) [Escherichia coli] >gi11657 5 96 sensory transduction histidine kinase [Synechocystis sp.] unknow [Mycobacterium tuberculosis histidine kinase [Lactococcus lactis cremoris) 259 3 .90E-38 226 138 232 206 7 .60E-33 1 .60E-32 3 .70E-32 1. 1 E 31 1.309-31 256 1.30E-31 sensory transduction histidine kinase [Synechocystis sp.] hohate regulatory protein phoR (gtg start codon) [Shigella drug sensory protein A [Synechocystis PCC68031I >gnlIPID d 1017420 putative fimbrial -associated protein [Actinomyces naeslundii) penicillin-binding protein [Enterococcus faecalis) low affinity penicillin-binding protein 5 (PBP5) [Enterococcus penic illin-binding protein 5 Enterococcus faecium] low affinity penicillin-binding protein 5 (PBP5S) Enterococcus 225 106 1 .60E-3 1 2.50E-31I 183 8.60E-26 1356 2.lIOE-178 607 1.OOE-78 604 I.IOE-78 604 2.70E-78
S
S S
S
*5 S *5* a S S S S S S S S S S. Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF081-2 giJ790 437 low affinity penicillin-binding protein 5 (PBP5) [Enterococcus 602 5. 1OE-78 EF081-2 -gil79 043 1 low affinity penicillin-binding protein 5 (PBP5)_[Enterococcus 591 2.60E-77 EF081-2 giJ433 42 D-alanyl-D-alanine carboxypeptidase [Enterococcus hirae] 587 9.30E-77 EF081-2 giI4900 0 ID-alanyl-D-alanifle carboxypeptidase [Enterococcus hirael 572 5.20E-74 EF081-2 gnlIPIDldl0079 penicillin-binding protein 2 [Bacillus subtilis] 149 7.40E-2 4 EF08 1-2 gnl(PID1e315088 MecA 1 [Staphylococcus sciun]l Ill 4.40E-19 EF081-2 gnlPIDle28 66 51 MecA protein [Staphylococcus sciuri] 106 2.90E-18 EF081-2 gnljPIDje3l6581 MecA protein [Staphylococcus sciuri] 11I 2.90E- 18 EF081-2 gnljPIDje3 1660 MecA2 protein [Staphylococcus sciuri] 101 3.70E-14 EF081-2 griIPDe31661 3 MecA prtein [Staphylococcus sciuri] >gi 4 6 613 mecA gene 101 3.70E-,'14 EF083-2 giJ49 6 83 lysin [Bacteriophage Tuc2009] 436 6.205'-176 EF083-2 gi153079 8 LysB [Bacteriophage phi-LC3] 421 3.OOE- 175 EF083-2 gi1 16 6 1 8 3 muramidase [Bacteriophage CP-71 186 1.20E-21 EF083-2 gi16 88 muramidase [Bacteriophage CP-91 >pirJQ0438IMUBPC9 188 5.00E-21 EF083-2 gi162 3084 muramidase; muramridase [Bacteriophage LL-H] 193 8.40E-20 EF832 il 6 t 7 muramnidase Bacteriophage CP-1117 -3.40E-19 EF8--: 1. 6 JA,(61I.At5r 175 -dlFI EF083-2 gn1jPIDje22 1272 EF083-2 pirIJQ04371MU
BP_
EF083-2 gil 4 105 02 EF083-2 g793850 lysozyuie Baceniophage r- iiIr~ j >pUA.pv UU61 N-acetylmuramoyl-L-alaline amidase (EC 3.5. 1.28) phage LysA [Bacteriophage inv4j >pirIS38477IS384 77 lytic enzyme IlysA Ilysin [Lactobacillus bacteriophage phi adh] >gnllPlDe 217 3 1 4 a 9.5 50E- 19 187 8.90E- 17~ 117 5.60E- S S CS S S. S S S S S S S S S *5 S 5 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genflank and Derwent databases.
EF084-2 gi1229 33 12 (AF008220) YtfP [Bacillus subtilis] >gnljPIDje 185,879 similar 438 1.70E-140 to EF084-2 gi123672 34 (AE000425) hypothetical 43.8 kD protein in rhsB-pit intergenic 167 2.2013-51 EF084-2 gil9 12464 No definition line found fEscherichia coli] 167 6.0013-51 EF084-2 gnljPIDjd10l 12 hypothetical protein [Synechocystis sp.] >pirlS76 6 78IS 76 67 8 151 6.J1OE-42 7 EF084-2 gil 1573954 hypothetical [Haemophilus influenzael >pirlG64 161 IG64 161 142 2.90E-40 EF085-2 gil 1209527 protein histidine kinase [Enterococcus faecalis] 2023 18.0012-279 EF085-2 giJ4670 5 7 phoR; B2168_C3_247 [Mycobacterium leprae] 226 8.80E-23 ___________>pirj5729051S72905 EF085-2 gnlPIDIel 19229 SenX3 [Mycobacterium, bovis BCG] 222 3. 10E322 EF085-2 gnljP1Dje255l52 unknown [Mycobacterium tuberculosis] >gnlPIDle3215 4 6 222 3.I1OE-22 SenlX3____ EF085-2 gill 77 848 5 PcoS homnolog [Escherichia coli] >gil 1786783 (AE0001 62) M480; 1l1 3.8013-16 This__ EF085-2 gi1149296 phosphate regulatory protein phoR (gtg start codon) [Klebsiella 110 1.40E-14 EF085-2 gil5811 88 phoR gene product (AA 143 1) [Escherichia colij >gill 6 5 7 5 96 103 5.30E-14 EF085-2 6-11143331 alkaline phosphatase regulatory protein [Bacillus subtilis] 118 4.9013-13 EF085-2. gil537239 alternate gene name phoM; CG Site No. 395 [Escherichia colij 126 9.50E-13 EF085-2 giJ147 2 5 1 phoM [Escherichia coli] >gil809670 phoM protein (I is 3rd 126 9.5013-13 base in EF085-2 giJ2 182992 histidine kinase [Lactococcus lactis cremoris] 109 O.9E-12 1111' 949 4 4 4 4 9 4 4 4 4. 4* 9* 9 *4 9. 4 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
I
EF086-21 giJ4377 06 lalternative truncated translation product from E.coli 3.00E-54 ~1 jIStreptococcus I EF086-2 jgi143770 5 jhyaluronidase [Streptococcus pneumoniael 221 41.60E5 EF086-2 Igi1595 847 Ihyaluronate lyase [Streptococcus agalactiae] 203 3.30E-44 I 1371A55 1371 1.t n AAV Ae) EF086&2 EF086-22 FEF087-2 EF087-2 EF087-2 EF087-2 EF087-2 EF088-2 EF088-2 EF088-2 gi1705 406 gi(5620 86 gi14377 06 gil437705 giJ59 58 47 gi1705 406 giJ56 2 086 gij43 7706 giJ437705 giJS958 4 7 hyaluronate lyase [Staphylococcus aureus] hyaluronidase [Propoibacterium acnes) alternative truncated translation product frm E.coli 198 6.OOE-27 3.OOE-54 [StreptocoCCUS hyaluronidase [Streptococcus pneurnoniae] 221 1 .60E-53 hyaluronate lyase [Streptococcus agalactiael 203 3.30E- 44 >pirjA55137jA55 137____ hyaluronate lyase [Staphylococcus aureus] 191 3.40E-42 hyaluronidase [Propionibacterium acnes] 198 6.OOE-27 alternative truncated translation productfr E.coli 221 3.OOE-54 [S treptococcus hyaluronidase [Streptococcus pneumnoniae] hyaluronate lyase [Streptococcus agalactiael 221 203 1.60E-53 1.30E-44 EF088-21 giJ705 4 0 6 hyaluronate lyase Staphylococcus aureus] 1 91 3.40E-42 EF088-21 gij562 086 hyaluronidase [Prop ion ibacteriurn acnes] 198 6.OOE-27 EF091-2 gi5 6 l iia opatwtrsrs roteins; ORF2 [Bacillus subtilis] 198 5.50E-21 EF091- 1 i235 3333 j(AF 16513) Ce-LEA [Caenorhabditis elegans] 189 2.40E-17 9 9 9 S p a 9~ *9 St 9 a. 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF091-2 gnt1PIDje353216 seed maturation protein homolog [Arabidopsis thaliana] 146 3.6013-11I EF091-2 gill 161171 late embryogenesis abundant protein [Picea glauca] 132 5.70E-l II EF091-2 pirlS049091SO 49 embryonic protein DC8 (clone 8/10) carrot 127 6.5OE-l II EF092-2 gJ1268989 8 (AE000792) PTS system, cellobiose-specific ti13 component 145 4.OOE-27 ~~(celA) EF092-2 gn1lPlDjdl0204 B. subtilis, cellobiose phosphotransferase system, celA; 116 1.40E-26 8 EF096-2 gJ11 4 7 329 transport protein [Escherichia coli] >gnljPlDjd 10 15409 532 2.I1OE-91 EF096-2 gil 1573475 spermidine/putrescine-binding periplasmic protein precursor 527 1.I1OE-79 (potD) EF096-2 gi1157 480 3 spermidine/putrescine-bindiflg periplasmic protein precursor 468 1.6013-75 EF096- il 10142681 1Lpp38 [Pasteurella haernolytica) 446 4.40E,72 EF096-2 gnIjPlD~d10152 Putrescine transport protein PotF [Escherichia coli] 216 1.5013-54 6 EF096-2 gi1l 4 73 34 periplasmic putrescine binding protein [Escherichia coli] 216 2.I1OE-53 EF096-2 giJ2688 565 (AE001 165) spermidine/putrescine ABC transporter, 240 2.OOE-48 EF096-2 gi1881 733 PotD [Salmnonella typhimuriumn] 253 2.7012-28 EF096-2 gnIjPIDldlOl92 spermidine/putrescine-binding periplasmic protein 243 4.20E-26 6 EF096-2 gnlPlDle]5254 3 potE gene product [Clostridium perfringcns] 204 3.30E-21 EF097-2 gi1622 99 1 mannitol -transport protein [Bacillus stearothermophilus] 547 4.90E-93 EF097-2 gi14203 4 mannitol permease [Escherichia coli] >gi4 66737 mannitol- 535 5.50E-85 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF097-2 giJ633 6 5O enzyme 11(mannitol) [Staphylococcus camosusl 516 2.I1OE-82 ~~>pirIS68 1931522385 EF097-2 giJ8 8246 2 protein-N(pi)-phosphohistidifle-ugar phosphotransferase 509 3.OOE-76 [Escherichia EF097-2 giI3 12763 protein-N(pi)-phoSphohisti dinle-sugar phosphotransferase 357 7.50E-70 EF097-2 gnIlPIDIdlO09 6 homologue of mannitol transport protein of B. 492 3. 1 OEB-62 6 EF097-2 gnlIPIDjd1079 mannitol-specific phophotransferaSe enzyme 11 [Bacillus 484 5.20E-61 2 1 EF097-2 gi11 6 7 3 855 (AEOOOO2O) Mycoplasma pneumoniae, PTS system mannitol- 232 3.50E-\59 specific EF097-2 gn]IPIDIdlOO 6 5 phosphotransferase enzymell, niannitol-specific [Mycoplasma 158 8.20E- 18 1 EF097-2 pirIS77757IS777 phosphotransferase system enzyme 11 (EC 2.7.1.69), 103 2.OOE- 13 EF 100-2 giJ20 5 8546 ComYC [IStreptococcus gordonii] 193 7.30E-27 EF 100-2 gi1205 85 46 ComYC Streptococcus gordonil 193 -7.30E-27 EFI100-2 giJ142708 comnG3 gene product [Bacillus subtilis] >gnljP1Dje1 185739 150 2.90E-22 comGC EF 100-2 gil42708 comG3 gene product [Bacillus subtilis] >gnljPlDje1 185739 150 2.90E-22 comGC___ EFI100-2 gi114 843 7' secretory component [Erwinia chrysanthemui] 134 4.40E- I>pirE 702 IIE4702 D ectic____
S
*5* a S. S
S
S S S S a S. 5 5 a a a. a a S S *5 S a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank EFI100-2 giJ148 43 7 secretory component [Erwinia chrysanthemi] 134 4.40E- >pirIE4702 1 JE47021I EFJOO-2 giJ6062 62 ORE-o 145 [Escherichia coli] >gil69370 6 HopG [Escherichia 136 9.1013-13 EF 100-2 gi16062 62 ORF-o 145 [Escherichiacoli] >gil6937 0 6 HopG [Eschericbia 136 9.IOE-.13 coli] EFl100-2 giJ388 28 ExeG gene product [Aeromonas hydrophila] 132 3.50E-12 >pirlS229 101149905 protein EF100-2 giJ388 28 ExeG gene product [Aeromonas hydrophila] 132 3.50E-12 >pr S229 101149905_protein____ EFIOO-2 2gnlIlDjel 17259 etpG [Escherichia coli] 131 5.1013-12.
and Derwent databases.
EF 100-2 EFIOO-2 gil 42189
IL
Wietpu13 I 9.90Ec-ac12 ~1 outG gene product [Erwinia carotovora] >pirIS328 6 l1S 32 86 1 outG EFIOO-2t giJ4218 9 outG gene product [Erwinia carotovora] >pirJS3 2 8 6
IIS
32 8 6 1 outG putative [Vibrio cholerae] 130 9.90E- 12 128 9l.90E- 12 EF 10-2~ EF 100-2 EF 101-2 EFIOI-2 EFI11-2 EF 110-2 giJ609 6 8 gi160962 8 gn1lP1DId1 0257 3 IgglIPIDle32 1943 gnilPIDle1 18502 12 6013- 11 putative [ibrio cholerae] bacG [Enterococcus faecalis] hypothetical protein [Enterococcus faecalis] >gn1PlDje32 1943 similar to hypothetical proteins from B. subtilis [Bacillus Staphylococcal serine proteinase homologue [Enterococcus fahecalis] 106 31.60E-17 105 113 3.60E- 17 1.80E-16 1462 2.30E-195 1462 2.3013- 195 S 4 4 4*4 4 9 *4*4 9* S 9~ p
S
S p 9 4 9 *4 44* S. 54 4 p *4 S S S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank EFi 110-2 gnlIDIdlO0lO glutamic acid specific protease prepropeptide [Staphylococcus 106 3.7013-14 8 EF 10-2 gil46687_ preproenzyme (AA -68 to 268) [Staphylococcus, aureus] 106 6.70E-14 EFI 11-2 giJ6060lS ORF o783 [Escherichia coli] >gi11789 46 2 (AE000390) 477 8. 1OE-80 _________hypothetical_88.3 1_ 1 1 An I -7z] and Derwent databases.
EF 121-2 EF 121-2 EF 12 1-2 EF 121-2 EF 121-2 EF 121-2 EF 121-2 EF 121-2 gi12 626 826 giJ4845 3 giJ75 7 84 2 g11177 3 162 giJ4795O gi17 4 7 9 1 3 giJ627 72 J..3VLL n.K 1 tI* 1ki' 4L'Slfll. eI I i ilrt M, VW a" m f
I
(AE000532) 2'3cci-uloie2-popoiseae(pB 413i Z.OUt- 5'-nucleotidase [Vibrio parahaemolyticus) >gnljPIDjd 100 12 18 UDP-sugar hydrolase [Ecricha olii....
7 39 1,.60FE-44 UDPsu r hdroaseprecursor [Escherip~acolil >gill786~Z. 239
I
precursor polypeptide (AA -25 to 525) [Salmonella typhimurium] 2',3 '-cyclic-nucleotide 2'-phosphodiesterase [Yersiia, 5'-nucleotidase [Discopyge ominata] >pir15 195641Sl1 9 5 64 5'nucleotidase 2' 3' cyclic-nucleotide 2'-pbosphodiesterase (cpdB) 2',T-cyclic-nucleotide 2'-phosphodiesterase [Escherichia colil 1 .60E-44 2I IOE-41 I I 4 .70E-3 6 4.70E-36 EF121-21 gi15 7357 3 137 5.80E-35 114 8.90E-34 110 E;I OE-31 EF12 1-: EF 121-: EF 121-: 2 giJ537 054 bbsj 1359 15 5l..nucleotidaseglycosylphosphatidylinositol-anclhored protein 128 7-70E-29 {EC 5'-nucleotidase [Boophilus microplus] 104 1.60E-28 21 gil 17-3 74,43 1 1.315 0. 1 ULL-/-O EF 12 EF 12 1-I v1Ii1J21 5)flU1ttuLIUorvetyie- 1-1) -I I- p I ceuaeDrcso I1* 2 9Q( 4 9 7 83 jecto-5'-nucleotidase [Mus musculus] >pirli C200 I IJC200 1 1-50 1. 1 VrI-1- 9 a 9 a a 9* .9 *9 9 a 9* a a *9 a 9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFl121-2 gi1 23897 5'-nucleotidase [H-omosapiens] >pirIS I 10321S1 1032 133 1.60E-27 ___________nucleotidase EF122-2 giJ262 68 26 YfkN [Bacillus subtilis] >gnIPIDiell182774 similar to 143 1.30E-96 EF122-2. g1231 3 187 (AE000532) 2',3'-cyclic-nucleotide 2'-phosphodiesterase (cp B) 413 2.60E-82 EF122-2 gi14 8453 5'-nucleotidase [Vibrio parahaemolyticus] >gnlPDIdlO01218 279 8.50E-47 EF 122-2 giJ757 8 4 2 UDP-sugar hydrolase [Escherichia, colij 239 1.60E-44 EF122-2 giII77 3 1 6 2 UDP-sugar hydrolase precursor_[Escherichia coli] >gil 17 86 687 239 1.60E-44 EF 122-2 giJ4795O precursor polypeptide (AA -25 to 525) [Salmonella 229 2. 1OE-41 typhimurium] EF 122-2 giJ7479 13 2',3'-cyclic-nucleotide- 2'-phosphodiesterase [Yersinia, 115 4.70E-36 EF 122-2 giJ6 2 7 7 2 5'-nucleotidase [Discopyge ommata] >pir1l95 6 4 I1 9 5 64 137 5.80E-35 nucleotidase EF 122-2 giJIS 73 5 73 2',3'-cyclic-nucleo tide 2'-phosphodiesterase (cpdB) 114 8.90E-34 Haemophilus EF 122-2 giJ53705 4 2',3'-cyclic-nucleotide 2'-phosphodiesterase [Escherichia coli] 110 1. 1 OE-31I EF 122-2 bbsl 135915 5'-nucl eotidase= glyco sylphosphati dylinosi to Iancho red protein 128 7.70E-29 EC EF 122-_2 gi117 3 7 443 5'-nucleotidase [Boophilus microplus] 104 -1.*60E-28 EF 122-2 EF 122-2 EF122-2 gi120 255 1 giJ3497 83 giJ23 897 5'-nucleotidase precursor jz- J) tnatu norve -ua ecto-5'nuceotidase[M~us uculus >irJC200JC 2 001 1361
I
.LIOE-27 1 .6013-27 5'-nucleotidase [Homo sapiens] >pirIS I 10321S1 11032 5'nucleotidase (EC I ~,iU EF12 "-9 1I A~i$J~ P54 protein I7Enterococcus faecum >irlS55421S05542 b J U YAUn- /Y I a
I
9 .9 9*9 9
S
9 S
S
S 9. 9**
S
*5 5 9 9 S *9 55 S S 5*9 S* S S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank ___________hypothetical EF 129-2 guS 12521 usp 45 gene product [Lactococcus lactis] >pirlJNOO97IJN0097 374 1.3013-42 secreted EF 129-2 gi114952 5 secreted protein [Lactococcus lactis] 371 3.60E-42 EF129-2 gnhI]PlDle3l3O22 hypothetical protein [Bacillus subtilis] >gn1P1De 118616 8 317 2.30E-33 EF130-2 giJ 4 8833 9 aipha-amylase [unidentified cloning vector] 621 6.70E-81 EF130-2 gi148833 6 ORF [unidentified cloning vector] 242 8.OOE-27 EF130-2 bbsl 112518 aipha-arnylase {N-tenninal region Artificial sequence, Peptide 237 4.80E-26 EF130-2 gn1,IPIDle28914 4 YwPE [Bacillus subtilis] >gnlIlDjel 1184540 ywpE [Bacillus 129 5.404-l 1 EF131-2 gnljPIDje 11 8 5 2 8 penicillin-binding protein [Bacillus subtilis] 277 7.40E-43 EF131-2 giJ488 33 0 alpha-amylase [unidentified cloninvetr 280 1.30E-731 EF 13 1-2 gi15092 49 No definition line found [Lactobacillus plantarum] 274 11 1.3 EFI131-2 gn]IPID~dl0249 (AB009635) Fmt [Staphylococcus aureus] 170 5.60E-20 EF 131-2 guS 15050 DD-peptidase precursor [Streptomyces'lividans] 131 2.30E- 14 ____________>piTIS48220IS48220 EF13 1-2 giII53448 serine DD-p ptidase [Streptomyces lividans] 131 1.20E- 12 EF132-2 giIl 53 826 adhesin B [Streptococcus sanguis] >pirlA435831A4 35 83 adhesin 1257 2730E- 166
B
EF 132-2, gil 1184932 ScbA [Streptococcus crista] 1248 3.70E- 165 I 14! J.jur,*10z and Derwent databases.
EF132-2 EF 132-2 ~EF 132-2 giJ3 10633 gil393269 gijl.5750 3 0 oAhjoin r~tr~t~-nccii~ Qnrdoniil 1 /14 3. 1LF_ iO.U st. T Von I 1204 .163 adhesion protein [Streptococcus pnunme 1204 3.40Et-t I 1220 2AUL-1~l surface adhesin A prcro Srpoccu nuoie 122
I
5 5 S S S S S S S S S 55 S SS S S S *5 55 5 S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF13272 EF13 2-2 EF 132-2 EF 132.2 EF 132-2 EF 132-2 EF 132-2 EF 132-2 EF132-2 EF 132-2 Query EF003-2 EF003-2 EF006-2 EF006-2 EF008-2 EF008-2 FOO8-2 EF008-2 EF009-2 gill153 83 4 gill 117994 gi14930l 7 gnl1PlDle2555 2 9 gi112 45464 gil 1573330 g117550 7 5 gnljPlDel 118595 gi1177 79 33 gil790 546 adhesin specific for salivary pellicle of dental surfaces 1203 4.80E-159 surface antigen A variant precursor [Streptococcus pneumoniael 1191 2.OOE-1 57 endocarditis specific antigen [Enterococcus faecalisl 931 3.70E-122 lipoprotein [Staphylococcus epidermiidis] 453 3 .20E-92 YfeA [Yersinia pestis] >gil 2 454 64 YfeA [Yersinia pestis] 364 3.60E-64 adhesin B precursor (fim.A) [Haemophilus influenzae] 349 3.50E- 63 per-iplasmic-bindiing protein [Synechocystis sp 326 6. 80E-62 >gnllPIDld 10 1 6 5 2 Mn [similar to ABC transporter (membrane protein) [Bacillus 174 3.10E-32 ~TroA [Treponema pallidum) 171 3.40E-32 Trm I [Treponema palliduml 171 5.10E-32 Derwent Gene Description
IBLAST
Derwven t Access. No.
W20909 W20166 W20909 W20 166 R37495 W26367 R79722 W22134 W20909 pylori outer membrane protein 14gel 0705orf5.
Helicobacter pyloriouter membrane protein, 1 6225006.aa.
H. pylori outer membrane protein 14gelO0705orfS.
Helicobacter pyloni outer membrane protein, 1 6225006.aa.
Pneuniococcal fimbrial protein A.
Staphylococcus aureus saliva binding protein.
ROM precursor
TROMPI.
Treponema pallidum rare outer membrane protein (TROMP-I).
H. pylori outer membrane protein 14ge1705orf5.
268 241 283 266 967 467 181
BLAST
P-Value 4.20E-39 3 .OOE-27 1 .20E-48 1IAOE-30 1 .20E-1 27 7.50E-100 8.00E-36 8.OOE-36 1 .40E-53 9 9 S 0 0 4 5 S S 0t S S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genflank and Derwent databases.
EF009-2 W20 166 Helicobacter pylori outer membrane protein, 16225006.aa. 278 2.5012-32 EFO 12-2 R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 227 3.20E-69 EF014-2 W 14070 S.thermophilus exopolysaccharide biosynthesis protein EpsR. 103 5.90E- 19 EF014-2 W22 169 S.thermophilus exopolysaccharide synthesis operon epsA gene 103 7.30E-18 EF016-2 W 15799 Adherence factor 104R of Lactobacillus fennentum. 157 9.60E-22 EF016-2 W 15793 Adherence factor consensus sequence. 103 1.00 E- I I EF017-2 R48035 H-yaluronic acid synthase of Streptococcus equisimilis. 241 8.90E-71 EF021-2 R31013 P39-alpha. 141 1.60E-19 EF021-2 R33280 P39-beta. 134 7.OOE- 14 EF022-2 R48035 Hyaluronic acid synthase, of Streptococcus eguisimilis. 324 2.2013-65 EF023-2 R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 155 9.90E 33 EF023-2 R70152 Streptococcus; pneunioniae strain SPRU98 PIpA. 125 5.90E- 17 EF027-2 R48035 H-yaluronic acid synthase of Streptococcus equisimilis. 233 2.20E-34 EF028-2 W17830 Thermophilic alkaline phosphatase. 202 7.70E-59 EF028-2 WI 11568 E.coli alkaline phosphatase mutant D I53H/Q329A. 182 7.90E-56 EF028-2 W 11570 Ecoli alkaline phosphatase mutant D I53H/K328H/Q329A. 182 7.90E-56 EF028-2 W26300 E.coli alkaline phosphatase mutant 182 1.l1OE-55 ________D153H/K3281Q329A1D330H.
EF028-2 WI 11565 -E.coli alkaline phosphatase mutant D IS3H/K328141D330A. 182 3.I1OE-55 EF028-2 WI 11557 E.coli alkaline phosphatase mutant D I 3HID33ON. 182 4.3013-5 EF028-2 WI 1561 E.coli alkaline phosphatase mutant D153H/D330A. 182 4.30E3-55 EF028-21 WI 1555 E.coli alkaline phosphatase mutant D153H/K328E/D330N. 182 4.7013-55 e a a a. a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF028-2 WI 1566 E.coli alkaline phosphatase mutant DlI 531-11K3281D330L. 182 1 .20E-54 EF028-2 Wi 1569 E.coli alkaline phosphatase mutant K328H1Q329A. 180 1 .7011-54 [EF028-2 WI 1562 E.coli alkaline phosphatase mutant D I53H1D330L. 182 1.7013-54 EF028-21 R26980 Fv(FRP5)-phoA recombinant antibody. 174 1.90E-54 EF028-21 Wi 1567 E.coli alkaline phosphatase mutant Q329A. 179 2.30E-54 EF028-2 Wi11558 E.coli alkaline phosphatase mutant K328H1D330N. 176 6.40E-54 EF028-2 WI 11563 IE.coli alkaline phosphatase mutant K328H1D330A. 176 6.40E-54 EF029-2 R10044 Plasmid pOW360 encoded Human Growth Hormone (HGH) 320 13.50E-40 nuclease A EF029-2 R10041 Plasmid pOW350 nuclease A product. 320 4.3013-40 EF029-2 R73997 Staphylococcus aureus (Foggi) nuclease signal and mature 320 5.60E-40 sequences.___ EF029-2 R10043 Plasmid pOW360 encoding Human Growth Hormone (HGH) 320 2.90E-38 nuclease_____ EF030-2 R48035 Hyaluronic acid synthase of Streptococcus eguisimilis. 277 6.I1OE-47 EFO4O--2 R59077 2-5A-dependent KNA-ase. 105 1.90E- 18 EF040-2 W12703 Mouse 2-5A-dcpendent RNase. 105 1.90E- 18 EF040-2 R82661 Partial murine 2-A-dependent RNase. 105 1.9013-18 EF041-2 R48035 1-lyaluronic acid synthase of Streptococcus euisimilis. 225 6.30E-26 -IQr Q nnA:;--A EF054-2 EF054-2 R26042 R85782 OCIDII f Ir 232 OE-24- Group B Streptococcal mutant beta antigen without IgA binding domain.
GopBStreptococcal wild-type beta antigen.
232 3.30E_24
EFC
-1 1? R 8 1 .1 J 4 *4* 4 U* 9 4 *44 4.4 4 4 4 4 *4 44 4 ~4S** 4. 4* 9 4 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genl~ank and Derwent databases.
EF054-2 P91941 Sequence of preprospasmolysin. 204 3.lIOE-19 EF054-2 W32519 Collagen-like polypeptide SEQ ID NO:2. 180 7.50E- 18 EF054-2 Wi 12324 Silver halide emulsion protein monomeric repeat unit 180 7.50E- 18 EF054-21 W32522 Collagen-like polypeptide SEQ ID NO:5. 192 1.60E- 17 EF054-2 W 12327 Silver halide emulsion protein monomeric repeat unit 192 1.60E-17 EF054-2 W32520 Collagen-like polypeptide SEQ ID NO:3. 189 2.40E- 17 EF054-2 W32532 Collagen-like polypeptide, SEQ ID NO: 15. 189 2.40E-17 EF054-2 W 12325 Silver halide emulsion protein monomeric repeat unit 189 2.40E-17 EF054-2 W12337 Silver halide emulsion protein monomeric repeat unit 15. 189 2.40E- 17 EF054-2 W 12341 Silver halide emulsion FLAG(RTM)-tagged protein 189 2.60E- 17 EF054-2 W02098 S. mutans antigen 1/11. 161 5.40Ei15 EF054-2 W02096 S. mutans antigen 1/11 fragment (aa803-1 114). 161 1.90E-13 EF059-2 R.26042 P. yoelii SSP2 antigen. 344 1.90E-39 EF059-2 R85782 Group B Streptococcal mutant beta antigen without IgA binding 232 1. 1 OE-26 domain._____ EF059-2 R85781 Group B Streptococcal wild-type beta antigen. 232 l.70E-26 EF059-2 P91941 Sequence of preprospasmolysin. 200 .1.50E-18 EF059-2 P60570 Sequence of the Falciparuin Interspersed Repeat Antigen 186 4.60E- 18 EF059-2 W02096 S. mutans antigen 1/Il fragment (aa803-1 114). 167 8.20E-16 EF059-2 W02098 S. mutans antigen 1/11. 167 4.90E- EF059-2 R79625 Endocarditis specific antigen region. 147 4.40E- 12 EF059-21 R26049 MSF precursor. 143 1.30E-lIl JEF059-21 R28 150 ISugar beet chitinase 1. 148 l .70E-1I1 400 a a
S
*4* 0 0 a.
04 0 a a a *0* 0 040 *5 9 0 0* a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF059-2 R26842 Protease from S. Aureus ATCC 12600. 147 2.10E-1 I EF059-2 R79643 Inununodominant antigen of Streptococcus sobrinus. 151 2.IOE-1 I EF059-2 W07539 Collagen like protein 146 3.00E-1I1 EF06 1-2 R26042 P. y~elii SSP2 antigen. 241 1.70E-25_ EFO61:-2 P60570 Sequence of the Falciparum, Interspersed Repeat Antigen 199 1.60E-18 EF061-2 R85782 Group B Streptococcal mutant beta antigen without IgA binding 153 2.40E-14 Sdomain, EF061-2 R85781 Group B Streptococcal wild-type beta antigen. 153 3.60E- 14 EF06 1-2 P91941 Sequence of preprospasmolysin. 163 9.7013-14 EF061-2 P83194 Sequence of a bioadhesive precursor protein encoded by cDNA 156 7.90E- 13 Sclone EF06 1-2 R28 150 Sugar beet chitinase 1. 156 9. 10OF- 3 EF061-2 W02096 S. mutans antigen 1/11 fragment_(aa803-1 114). 148 1.20E-12_ EF061-2 P82971 Bioadhesive precursor protein from cDNA 52. 148 9.70E- 12 EF061-2 W02098 S. mutans antigen 1/1. 148 1.50E-1-1 EF062-2 W02098 S. mutans antigen 1/11. 107 1.20E-36 EF062-2 R79643 Imniunodominant antigen of Streptococcus sobrinus. 132 3.OOE-36_ EF063-2 W02098 S. mutans antigen 1/1l. 107 1.20E-36.
EF063-2 R79643 Immunodominant antigen of Streptococcus sobrinus. 132 3.OOE-36 EF064-21 W02098 S. mutans antigen 1/H1. _107 1.20E-36 EF064-2 R79643 Inmunodominant antigen of Streptococcus sobrinus. 132 3.0013-36 EF071-2 R85294 Phage R1I-t LytR lysin. 127 3.70E-38 EF071-2 R91515 IListeria phage lysin PLY51 1. 273 4.70E-37 gC* CC. C C A
C
*dg
SOC
go
CC
eeg g C C CO C Ce C C C SC C C g CS C SC g
C..
o* Ce Co e 0 0^0 go CC S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank EF075-2 W 14070 S.thermophilus exopolysaccharide biosynthesis protein EpsR. 239 4.20E-36 EF075-2 W22 169 S.thermophilus exopolysaccharide synthesis operon epsA gene 239 4.OOE-34 EF077-2_ R97280 Helicobac ter-specific ATPase 439. 258 4.10OE-74 EF077-2 R48036 Mycobacteriurn BCG immunogen. 192 2.20E-67 EF077-2 W06712 Helicobacter-specific ATPase 948 (ORF-4). 220 2.50E-67 EF077-2 R70419 Rat homologue of human Wilson disease gene ATP7B. 186 9.80E-54 EF077-2 R72343 Wilson disease protein ATP7B3. 176 6.70E-40 EF077-2 R06376 Product of the ssc 1 gene. 166 3.I1OE-28 EF077-2 R75396 Flea sodium pump alpha subunit. 146 2.40E-25 EF077-2 W20891 pylori transporter protein, l14ce22l9orfl. 15,6 8.60E-14] EF078-2 R56667 Bacteroides fagilis RprX regulatory response protein. 148 8.0-1 EF078-2 R74630 Tomato TGETRI ethylene response protein. 130 7.80E- 13 EF078-2 R69849 Ethylene response (ETR) gene product. 128 1 .70E-1 1 EF078-2 R69850 Ethylene response (ETR) mutant protein etrl-1. 128 1.70E-l I EF078-2 R69851 Ethylene response (ETR) mutant protein etrl-2. 128 1.70E-1lI EF078-2 R69852 Ethylene response (ETR) mutant protein eirl1-3. 128 1 .70E-1 I EF078-2 R69853 Ethylene response (ETR) mutant protein etrl-4. 128 1. 70E- I I EF078-2 R24296 Regulatory protein VanS involved in glycopeptide resistance. 142 2.70E-l I1 EFOSI-2 R27253 Penicillin binding protein PBP2A-epi. 101 4.70E- 16 EF081-21 R27256 Penicillin binding protein PBP2A-27R. 101 6.OOE- EF081-21 R27257 Penicillin binding protein derivative N 1. '101 6.20E- EF081-21 R27258 Penicillin binding protein derivative 101 ,6.20E- and Derwent databases.
S. S
S..
06 S S 6 esS S 5 S S 0 5 S 0 0 Se S. 6 S
S
*5@ 0Se0O@ S 5 0 S 055655 5S @0 0 0 *5 SeeS.
066eOSO 0@ Se 5 5 S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
E F081-2 R27259 Penicillin binding protein derivative 101 6.20E-15 EF081-2 R27260 Penicillin binding protein derivative 101 6.20E- EF081-2 R27261 Penicillin binding protein derivative 101 6.20E-l15 EF081-2 R.27263 Penicillin binding protein derivative 101 6.20E- EF081-2 .R27264 Penicillin binding protein derivative 101 6.20E-15_ EF081-2 R27262 Penicillin binding protein derivative 101 6.5013-15 EF081-21 R30845 Sequence encoded by the mec A gene. 101 6.90E- EF081-2 R27255 Penicillin binding protein PBP2A-27R. 101 6.90 E- EF081-2 R31216 Penicillin binding protein PBP2A-27R. 101 7.OOE-15 EFI 10-2 R91042 V8 mature protease (aa 1-213). 106 6.60E-l16 EF 10-2 R91043 V8 mature protease (aa 1-214). 106 16 EFl 110-2 R91044 V8 mature protease (aa 1-2 15). 106 7.80E- 16 EFi 10-2 R26842 Protease from S. Aureus ATCCl12600. 106 6.70 E- EF 10-2 R29644 Protease from S. Aureus. 106 1.20E-l14 EFI110-2 W22218 Protein encoded by pV8RPT(-) construct. 106 7.6013-14 EF 110-2 R91033 Beta-galactosidase-V8 protease fusion protein. 106 7.60E- 14 F1 10-2 R91034 Beta-galactosidase-V8 protease fusion protein. 106 l1.70E- 13 EF 110-2 W22219 Protein encoded by pV8D construct. 106 7.60E- 13 EFI 10-2 R91035 Recombinant V8 protease V8D fusion protein. 106 7.60E- 13 EF 10-2 W22220 Protein encoded by pV8F construct. 106 -7.90E- 13 EF 129-2 R14530 Usp 5 protein. 374 2.40E-43 EF129-2, R14 150 MSP encoded by pUCRS (DSM 5803). .372 4.70E-43 EF 131-21 R37495 Pneumococcal fimbrial protein A. 1185 6.80E-163 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
I I I E131-2 EFF131-21 W26367 R79722 Staphylococcus aueus saliva bindin rotein. 418 ROM precursor TROMP 1.
Treronema pallidurn rare outer membrane protein TROMP-i) 171 171 19.OOE-31 W22134 EF 131-2 111 288 TABLE 3. Conservative Amino Acid Substitutions.
Aromatic Hydrophobic Polar Basic Acidic Small Phenylaanine Tryptophan Tyrosine Leucine Isoleucine Valine Glutamine Asparagine Arginine Lysine Histidine Aspartic Acid Glutamic Acid Alanine Serine Threonine Methionine Glycine 9* 289 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
EFOO 1-2 from about Asp-i15O to about Lys- 152, from about Ser-256 to about.Tyr- 259, from about Lys-360 to about Lys-363, from about Asn-406 to about EF002-2 from about Asp-80 to about Asp-83, from about Asp-281 to about Gly- 283.
EF003-2 from about Asn-263 to about Gly-266.
EF004-2 from about Asn-23 to about Asn-26, from about Lys-83 to about Ser-87, from about Tyr- 154 to aboiit-Asp- 159.
EF005-2 from about Lys-249 to about Glu-252.
EF006-2 from about Gly-23 to about Asp-28.
EF008-2 from about Thr-92 to about Gly-94, from about Pro- 161 to about Asp- 165, from about Gly-287 to about Thr-289.
EFO 1 0-2 from about Pro- 129 to about Asn- 13 1.
EFO 12-2 from about Asp-77 to about Asp-79, from about Asp-94 to about Lys-98, from about Asp-256 to about Thr-258,- from about Glu-461 to about Asn- 468.
EF013-2 from about Thr-30 to about Asp-32, from about Glu-73 to about from about Gln-1 64 to about Asn- 166, from about Lys- 193 to about Gly- 195.
EFOI14-2 from about Ser-203 to about Asp-206, from about Gin-3 14 to about Gly- 316 EFO 15-2 from about Pro-66 to about Gly-69.
EF016-2 from about Lys-236 to about Asn-239.
EF017-2 from about Ser-90 to about Gly-93, from about Thr-1 97 to about Lys- 199, from about Lys-230 to about Asn-233, from about Ser-428 to about GIY-431.
EF018-2 from about Lys-159 to about Tyr-161, from about Asn-I 65 to about Ser- 167, from about Asn-250 to about Arg-256, from about Asn-392 to about from about Lys-416 to about Tyr-418, from about Asn-428 to a 290 Table 4. Residues Coinprising Antigenic Epitope-Bearing Portion.
Arg-430.
EFOI19-2 from about Arg-209 to about Ser-21 1, from about Lys-287 to about Ser- 290.
EF020-2 from about Lys-57 to about Asn-62.
EF021 -2 from about Ser-33 to about Gly-35, from about Glu-77 to about Gly-8 1, from about Asp- 139 to about Lys- 4 1, fromn about Glu-255 to about Ser- 258, from about Gln-27 I to about Tyr-277.
EF023-2 from about Lys-232 to about Asp-234, from about Arg-304 to about Gly- 306, from about Thr-453 to about Arg-456, from about Ser-478 to about Thr-480.
EF025-2 from about Arg- 18' to about Asp- 18 EF026-2 from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, about Gln- 107 to about Asn-1I EF027-2 from about Gln-72 to about Lys-74, from about Lys-229 to about Asp- 231._ EF028-2 from about Asp-I 86 to about Gln- 188.
EF029-2 from about Asp-Il 18 to about Lys- 122, from about Asp- 124 to about Tyr-1 26.
EF031-2 from about Glu-30 to about Gly-33.
EF034-2 from about Glu-25 to about Gly-27, from about Glu-75 to about Thr-77.
EF36-2 from about Gln- 177 to about Ser- 179.
EF037-2 from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, about Gin-]107 to about Asn- 110.
EF038-2 from about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92.
EF040-2 from about Lys-] 67 to about Gly-1 72, from about Lys-240 to about 291 Table 4. Residues Coml-prising Antigenic Epitope-Bearing Portion.
EF044-2 from about Arg- 192 to about Gly- 194, from about Asn-200 to about. Asn- 203.
EF045-2 from about Asp-159 to about Asn-161, from about His-172 to about Gly- 174, from about Tyr-261 to about Gly-264, from about Lys-305 to about E5046-2 from about Ser-IS8 to about Gly-23, from about Gln-41 to about Ser-47, from about Thr-76 to about Asp-78.
EF047-2 from about Asn-28 to about Asp-30, from about Asp-273 to about Asn- 277.
EF048-2 from about Asp- 13 8 to about Lys- 14 1, from about Asp- 15 2 to about 1-2 from about Asp-73 to about Gly- 76 EF053-2 from about Ser-79 to about Gly-82.
EF055-2 from about Asp-26 to about Gly-28, from about Gln-67 to about Asp-69, from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89.
EF056-2 from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89 EF058-2 from about Lys-129 to about Gly-133, from about Gin-571 to about Tyrfrom about Pro-586 to about Gly-59 1.
EF065-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 3 52, from about Lys-415 to about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to about Asp-S 18, from about Glu-639 to about Lys-642.
EF066-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49 1, from about Ser-5 16 to Asp-S 18, from about Glu-63 9 to about Lys-642.
EF067-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-41 5 to about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49 1, from about Ser-5 16 to Asp-518, from about Glu-639 to about Lys-642.
292 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
EF073-2 from about Met-98 to about Arg-100, from about Arg-I 10 to about Asp- 112 EF074-2 from about SCT-53 to about Tyr-59, from about Ser-86 to about Gly-88, from about Pro-97 to about Gln-l100, from about Gln-230 to about Gly- 232.
EF076-2 from about Asn-38 to about Tyr-40, from about Asp-48 to about Asn-53, about Lys-79 to about Gly-8 1.
EF077-2 from about Arg-41 I to about Gly-4 13.
EF078-2 from about. Thr-294 to about Gly-296, from about Asp-366 to about GIn- 368, from about Glu-524 to about Gly-526.
EF080-2 from about Glu- 164 to about Gly-1 66, from about Ser-206 to about Tyr- 208, from about Lys-239 to about Gly-243.
EF081-2 from about Asn-7 to about Ser- 11, from about Lys-77 to about from about Lys-1 12 to about Asn-1 14, from about Gly- 162 to about Aspfrom about Mrg-I 81 to about Gly- 183.
EF083-2 from about Gin-38 to about EF084-2 from about Lys-140 to about Asp-142, from about Gly-164 to about Mrg- 166, from about Arg-262 to about Gly-264.
EF085-2 from about Asn-95 to about Asp-97, from about Arg-1 12 to about Asp- 11 4, from about Asp-258 to about Ser-260, from-about Arg-40 1 to about EF086-2 from about Pro-i 12 to about Gly-1 15, from about Ser-222 to about Ser- 224, from about Asn-296 to about Gly-299, from about Thr-346 to about from about Asp-428 to about Ser-432.
EF087-2 from about Pro- 112 to about Gly- 15, from about Ser-222 to about Ser- 224, from about Asn-296 to. about Gly-299, from about Thr-346 to about Lys-348, from about Asp-428 to about Ser-432.
EF088-2 from about Pro-I 112 to about Gly-1 15, from about Ser-222 to about Ser- 224, from about Asn-296 to about Gly-299, from about Thr-346 to about from about Asp-428 to about Ser-432.
293 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
EF090-2 from about Arg-2 to about EF09 1-2 from about Gln-40 to about Asp-43.
EF093-2 from about Lys-95 to about Gly-97.
EF094-2 from about Asp-314 to about Asp-3 16.
EF095-2 from about Ser-328 to abou -t Thr-330, from about Asp-359 to about Asp- 363, from about Glu-637 to abbut Gly-6 3 9 from about Asn-744 to about Gly-746.I EF096-2 from about Pro-128 to about Asn-130, ftrm about Ser-193 to about Asp- 196.
EF097-2 from about Val-357 to about Gly-3 59.
EF099-2 from about Glu-44 to about Asp-47, from about Lys-] 54 to about Gly- 156, from about Asn-286 to about Asp-289.
EFIO1-2 from about Lys-40 to about Asp-42, from about Pro-255 to about Asnfrom about Lys-288 to about Gly-29O.
EF102-2 from about Asp-314 to about Asp-3 16.
EFi 03-2 from about Asn-46 to about Gly-48.
EFi 04-2 from about Pro-232 to about Lys-237, fromn about Ala-362 to about Asn- 366, from about Ser-421 to about Gly-423, from about Lys-488 to about Ser-490, from about Asp-S 50 to about Asn-552, from about Pro-637 to about Lys-640, from about Asp-727 to about Gly-729, from about Asn- 751 to about Ser-754, from about Lys-77 I to about Asn-774, from about lle-835 to about Asn-837, from about Pro-851 to about Gly-853.
EFI 05-2 from about Ser-40 to about Gly-43, from about Asn-94 to about Gln-97, from about Gln-220 to about Gly-222, from about Asn-263 to about Gly- 265.
EF106-2 from about Asp- 72 to about Gly-75, from about Thr-274 to about Asp- 277, from about Asn-3 10 to about Arg-3 13.
a.
EF107-2 EF1O7-2 1 from about Thr- 155 to about Asn- 157, from about Thr- 189 to about Asp- 294 Table 4. Residues Comprising Antigenlic Epitope-Beanng Portion.
191, from about Arg-270 to about Gly-272, from about Thr-330 to about Lys-335, from about Asp-365 to about Asp-368, from about Pro-AS to about Asp-453, from about Gly-485 to about Thr-488.
a a EF 108-2 from about Lys- 142 to about Trp- 145, from about Thr- 147 to about Tyr- 150, from about Arg-212 to about Gly-214, from about Ser-248 to about 1, from about Asp-3 84 to about Asp-3 87, from about Pro-481 to about Arg-483, from about Lys-491 to about Gly-494, from about Thr- 619 to about Gly-624, from about Asp-656 to about Asp-659, from about Lys-717 to about Asn-721, from about Ser-822 to about Gly- 8 2 4 from about Tyr- 137 to about Thr-1l 14 1.
EFI 110-2 from about Pro-123 to about Gly-127, from about Thr-223 to about Gly- 225.
EFI 11-2 from about Lys-207 to about Asn-209, from about Asp-245 to about Asn-248, from about Lys-396 to about Asp-398, from about Giu-429 to about Ser432, from about Thr-470 to about His474.
EFI 19-2 from about Asp-90 to about Asn-92, from about Gln- 142 to about Gly- 144.
EFl121-2 from about Asn-1 59 to about Asp- 16 1, from about Asn-35 1 to about Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to Ser-789.
EF122-2 from about Asn-159 to about Asp-161, from about Asn-351 to about Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to Ser-789.
EF123-2 from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 636, from about Glu-780 to about Ser-782, from about Tyr-909 to about Asn-9 11, from about Lys-939 to about Glu-942, from about Asp- 1074 to about Gly-1076, from about Asp-1367 to about Gly-1369, from about Pro-I433 to about Lys-1435, from about Gly-1516 to about Asp-iS 18, from about Lys-16S6 to about Asp-1 660, from about Lys- 1860 to about Gln-1863, from about Ser-1916 to about Gln-1919, from about Pro-i940 about Gly-1942.
EF124-2 from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 636, from about Glu-780 to about Ser-782, from about Tyr-909 to about Asn-91 11, from about Lys-939 to about Glu-942, from about Asp- 1074 to ab out (3ly-10 7 6 from ab out Asp- 13 67 to about Gly- 13 69, from about Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-IS 18, 295 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
from about Lys-1656 to about Asp-1660, from about Lys-1860 to about Gln-1863, from about SeT-1916 to about Gln-1919, from about Pro-1940 to about Gly- 1942.
EF 125-2 from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 636, from about Glu-780 to about Ser-782, from about Tyr-909 to about Asn-91 1, from about Lys-939 to about GIL-942, from about Asp-1074 to about Gly-1076, from about Asp-1367 to about Gly-1369, from about Pro-1433 to about Lys-1435, from about Gly-15l6 to about Asp-iS 18, from about Lys- 165 6 to about Asp- 1660, from about Lys- 18 60 to about Gln-1863, from about Ser--191.6.to about Gln-1919, from about Pro-1940 about Gly-1942.
EF 126-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-418, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to Asp-S 18, from about Glu-639 to about Lys-642._ EF 127-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-4 15'to, about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to about Asp-5 18, from about Glu-639 to about Lys-642.
EF128-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-418, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to Asp-S 18, from about Gu-639 to about Lys-642.
EF 129-2 from about Asn-300 to about Gly-302, from about Ser-3 16 to about Glyfrom about Asn-385 to about His-387 EF 131-2 from about Lys-20 I to about Tyr-204, from about Glu-263 to about Ser- 266.
EF132-2 from about Thr-26 to about Ser-28.
a.
a a.
a a a .aa.
a a..
a a. a a a a.
296 INDICATIONS RELATING TO A DEPOSITED MICROORGANISM (PCT Rule 13bis) A. The indications made below relate to the microorganism referred to int the description on page 10 lIine 12 B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Nan., of depositary institution American Type Culture Collection Address of depositary institution (including postal code and country) 10801 University Boulevard Manasas, Virginia 20110a-2209 United Siates of America C. ADDITIONAL INDICATIONS (leave blank ifnot applicabl) This information is continued on an additional sheet E D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) E. SEPARATE FURNISHING OF INDICATIONS (leave blank if(not applicable) The indications listed below will be submitted to the International Bureau later (specify the general nature of the indication. 'A4ccession Number ojDspoasit) For receiving Office use only This sheet v~ eeved with the international application -For International Bureau use only W1 This sheet was received by the International Bureau on: Authorized officer
Claims (29)
1. An isolated polynucleotide comprising the nucleic acid sequence of ORF EF122-1, represented by SEQ ID NO: 453.
2. The isolated polynucleotide of claim 1, wherein said polynucleotide comprises a heterologous polynucleotide sequence.
3. The isolated polynucleotide of claim 2, wherein said heterologous polynucleotide sequence encodes a heterologous polypeptide.
4. A method for making a recombinant vector comprising inserting the isolated polynucleotide of claim 1 into a vector.
5. A nucleic acid sequence complimentary to the polynucleotide of claim 1.
6. A recombinant vector comprising the isolated polynucleotide of claim 1.
7. The recombinant vector of claim 6, wherein said polynucleotide is operably S. associated with a heterologous regulatory sequence that controls gene expression.
8. A recombinant host cell comprising the isolated polynucleotide of claim 1.
9. The recombinant host cell of claim 8, wherein said polynucleotide is operably associated with a heterologous regulatory sequence that controls gene expression. An isolated polynucleotide fragment comprising a nucleic acid sequence which hybridizes under hybridization conditions, comprising hybridization in and 50% formamide at 50-65 0 C and washing in a wash buffer consisting of 0.5XSSC at 50-65°C, to the complementary strand of ORF EF122-1, represented by SEQ ID NO:453. 297
11. The isolated polynucleotide of claim 10, wherein said polynucleotide comprises a heterologous polynucleotide sequence.
12. The isolated polynucleotide of claim 11, wherein said heterologous polynucleotide sequence encodes a heterologous polypeptide.
13. A method for making a recombinant vector comprising inserting the isolated polynucleotide of claim 10 into a vector.
14. A nucleic acid sequence complimentary to the polynucleotide of claim A recombinant vector comprising the isolated polynucleotide of claim
16. The recombinant vector of claim 15, wherein said polynucleotide is operably associated with a heterologous regulatory sequence that controls gene expression.
17. A recombinant host cell comprising the isolated polynucleotide of claim i. 18. The recombinant host cell of claim 17, wherein said polynucleotide is operably associated with a heterologous regulatory sequence that controls gene S. expression.
19. An isolated polynucleotide for the detection of Enterococcus faecalis, wherein said isolated polynucleotide comprises at least 15 contiguous nucleotides of the nucleic acid sequence of ORF EF122-1, represented by SEQ ID NO:453. The isolated polynucleotide of claim 19, wherein said polynucleotide comprises a heterologous polynucleotide sequence.
21. The isolated polynucleotide of claim 20, wherein said heterologous polynucleotide sequence encodes a heterologous polypeptide.
22. A method for making a recombinant vector comprising inserting the isolated polynucleotide of claim 19 into a vector. 298
23. A nucleic acid sequence complimentary to the polynucleotide of claim 19.
24. A recombinant vector comprising the isolated polynucleotide of claim 19. The recombinant vector of claim 24, wherein said polynucleotide is operably associated with a heterologous regulatory sequence that controls gene expression.
26. A recombinant host cell comprising the isolated polynucleotide of claim 19.
27. The recombinant host cell of claim 26, wherein said polynucleotide is operably associated with a heterologous regulatory sequence that controls gene expression.
28. The isolated polynucleotide of claim 19, wherein said isolated polynucleotide comprises at least 18 contiguous nucleotides of the nucleic acid sequence of ORF EF122-1, represented by SEQ ID NO: 453.
29. The isolated polynucleotide of claim 19, wherein said isolated polynucleotide comprises at least 20 contiguous nucleotides of the nucleic acid sequence of ORF EF122-1, represented by SEQ ID NO: 453.
30. The isolated polynucleotide of claim 29, wherein said isolated polynucleotide comprises at least 40 contiguous nucleotides of the nucleic acid sequence of ORF EF122-1, represented by SEQ ID NO: 453.
31. A method for detecting Enterococcus faecalis using the isolated polynucleotide of claim
32. A method for detecting Enterococcus faecalis using the isolated polynucleotide of claim 19.
33. A method for detecting Enterococcus faecalis using the isolated polynucleotide of claim 28. 299
34. A method for detecting Enterococcus faecalis using the isolated polynucleotide of claim 29. A method for detecting Enterococcus faecalis using the isolated polynucleotide of claim Dated this fifteenth day of February 2002. HUMAN GENOME SCIENCES INC Applicant Wray Associates Perth, Western Australia Patent Attorneys for the Applicant see. 300
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU55888/01A AU770915B2 (en) | 1997-05-06 | 2001-07-20 | Enterococcus faecalis polynucleotides and polypeptides |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US60/044031 | 1997-05-06 | ||
US60/046655 | 1997-05-16 | ||
US60/066009 | 1997-11-14 | ||
AU72793/98A AU735444B2 (en) | 1997-05-06 | 1998-05-04 | (Enterococcus faecalis) polynucleotides and polypeptides |
AU55888/01A AU770915B2 (en) | 1997-05-06 | 2001-07-20 | Enterococcus faecalis polynucleotides and polypeptides |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU72793/98A Division AU735444B2 (en) | 1997-05-06 | 1998-05-04 | (Enterococcus faecalis) polynucleotides and polypeptides |
Publications (2)
Publication Number | Publication Date |
---|---|
AU5588801A AU5588801A (en) | 2001-09-20 |
AU770915B2 true AU770915B2 (en) | 2004-03-04 |
Family
ID=31982944
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU55888/01A Expired - Fee Related AU770915B2 (en) | 1997-05-06 | 2001-07-20 | Enterococcus faecalis polynucleotides and polypeptides |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU770915B2 (en) |
-
2001
- 2001-07-20 AU AU55888/01A patent/AU770915B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
AU5588801A (en) | 2001-09-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU735444B2 (en) | (Enterococcus faecalis) polynucleotides and polypeptides | |
AU2016219667B2 (en) | Antibacterial phage, phage peptides and methods of use thereof | |
KR100923598B1 (en) | Surface Proteins of Streptococcus pyogenes | |
KR100886095B1 (en) | Novel Streptococcus pneumoniae open reading frames encoding polypeptide antigens and a composition comprising the same | |
AU754264B2 (en) | Chlamydia trachomatis genomic sequence and polypeptides, fragments thereof and uses thereof, in particular for the diagnosis, prevention and treatment of infection | |
AU762606B2 (en) | Chlamydia pneumoniae genomic sequence and polypeptides, fragments thereof and uses thereof, in particular for the diagnosis, prevention and treatment of infection | |
WO1997043303A1 (en) | Novel compounds | |
JPH09322781A (en) | Staphylococcus aureus polynucleotide and sequence | |
EP0804616A2 (en) | Specific and universal probes and amplification primers to rapidly detect and identify common bacterial pathogens and antibiotic resistance genes from clinical specimens for routine diagnosis in microbiology laboratories | |
AU2015327511B2 (en) | Biomarkers for rheumatoid arthritis and usage thereof | |
CA2378687A1 (en) | Nucleotide sequences of moraxella catarrhalis genome | |
EP1009859A1 (en) | Lyme disease vaccines | |
EP1109911A2 (en) | Staphylococcus aureus genes and polypeptides | |
AU770915B2 (en) | Enterococcus faecalis polynucleotides and polypeptides | |
AU777190B2 (en) | Streptococcus pneumoniae polynucleotides and sequences | |
AU2021240230B2 (en) | Vaccines and vaccine components for inhibition of microbial cells | |
AU713692B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori for therapeutics | |
AU710880B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori for diagnostics and therapeutics | |
AU3796099A (en) | Assays using nucleic acid and amino acid sequences relating to helicobacter pylori | |
AU1546202A (en) | Enterococcus faecalis polynucleotides and polypeptides | |
CA2388445A1 (en) | Genetic markers, metabolic markers, and methods for evaluating pathogenicity of strains of e. coli |