AU5588801A - Enterococcus faecalis polynucleotides and polypeptides - Google Patents

Enterococcus faecalis polynucleotides and polypeptides Download PDF

Info

Publication number
AU5588801A
AU5588801A AU55888/01A AU5588801A AU5588801A AU 5588801 A AU5588801 A AU 5588801A AU 55888/01 A AU55888/01 A AU 55888/01A AU 5588801 A AU5588801 A AU 5588801A AU 5588801 A AU5588801 A AU 5588801A
Authority
AU
Australia
Prior art keywords
polypeptide
faecalis
polypeptides
amino acid
sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
AU55888/01A
Other versions
AU770915B2 (en
Inventor
Camella Bailey
Gil H. Choi
Alex Hromockyj
Charles A. Kunsch
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Human Genome Sciences Inc
Original Assignee
Human Genome Sciences Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU72793/98A external-priority patent/AU735444B2/en
Application filed by Human Genome Sciences Inc filed Critical Human Genome Sciences Inc
Priority to AU55888/01A priority Critical patent/AU770915B2/en
Publication of AU5588801A publication Critical patent/AU5588801A/en
Application granted granted Critical
Publication of AU770915B2 publication Critical patent/AU770915B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Description

P/00/011 28/5/91 Regulation 3.2
AUSTRALIA
Patents Act 1990
ORIGINAL
COMPLETE SPECIFICATION STANDARD PATENT r Name of Applicant: Actual Inventors: Address for service is: Human Genome Sciences, Inc.
Charles A KUNSCH Gil H CHOI Camella BAILEY Alex HROMOCKYJ WRAY ASSOCIATES 239 Adelaide Terrace Perth, WA 6000 Attorney code: WR Invention Title: "Enterococcus faecalis Polynucleotides and Polypeptides" This application is a divisional application by virtue of Section 39 of Australian Patent Application 72793/98 filed on 4 May 1998.
The following statement is a full description of this invention, including the best method of performing it known to me:- 1/2 Enterococcus faecalis polynucleotides and polypeptides Field of the Invention The present invention relates to novel Enterococcusfaecalis genes faecalis) nucleic acids and polypeptides. Also provided are vectors, host cells and recombinant methods for producing the same. Further provided are diagnostic methods for detecting Enterococcusfaecalis using probes, primers, and antibodies to the E. faecalis nucleic acids and polypeptides of the present invention. The invention further relates to screening methods for identifying agonists and antagonists of E. faecalis i( polypeptide activity and to vaccines using E.faecalis nucleic acids and polypeptides.
Background of the Invention Enterococci have been recognized as being pathogenic for humans since the turn of the century when they were first described by Thiercelin in 1988 as microscopic organisms. The genus Enterococcus includes the species Enterococcus faecalis or E. faecalis which is the most common pathogen in the group, accounting for 80 90 percent of all enterococcal infections. See Lewis et al. (1990) Eur J. Clin Microbiol Infect Dis.9: 111-117.
*.o The incidence of enterococcal infections has increased in recent years and enterococci are now the second most frequently reported nosocomial pathogens.
Enterococcal infection is of particular concern because of its resistance to antibiotics.
Recent attention has focused on enterococci not only because of their increasing role in nosocomial infections, but also because of their remarkable and increasing resistance to antimicrobial agents. These factors are mutually reinforcing since resistance allows enterococci to survive in an environment in which antimicrobial agents are heavily used; the hospital setting provides the antibiotics which eliminate or suppress susceptible bacteria, thereby providing a selective advantage for resistant organisms, and the hospital also provides the potential for dissemination of resistant enterococci via the usual routes of hand and environmental contamination.
Antimicrobial resistance can be divided into two general types, inherent or intrinsic property and that which is acquired. The genes for intrinsic resistance, like other species characteristics, appear to reside on the chromosome. Acquired resistance results from either a mutation in the existing DNA or acquisition of new DNA. The various inherent traits expressed by enterococci include resistance to semisynthetic penicillinase-resistant penicillins, cephalosporins, low levels of aminoglycosides, and low levels of clindamycin. Examples of acquired resistance include resistance to chloramphenicol, erythromycin, high levels of clindamycin, tetracycline, high levels of aminoglycosides, penicillin by means of penicillinase, fluoroquinolones, and vancomycin. Resistance to high levels of penicillin without penicillinase and resistance to fluoroquinolones are not known to be plasmid or transposon mediated and presumably are due to mutation(s).
Although the main reservoir for enterococci in humans is the gastrointestinal tract, the bacteria can also reside in the gallbladder, urethra and vagina.
15 E. faecalis has emerged as an important pathogen in endocarditis, bacteremia, urinary tract infections (UTIs), intraabdominal infections, soft tissue infections, and neonatal sepsis. See Lewis et al. (1990) supra.. In the 1970s and 1980s enterococci became firmly established as major nosocomial pathogens. They are now the fourth Sleading cause of hospital-acquired infection and the third leading cause of bacteremia in the United States. Fatality ratios for enterococcal bactermia range from 12% to 68%, with death due to enterococcal sepsis in 4 to 50% of these cases. See T.G. Emori (1993) Clin. Microbiol. Rev. 6:428-442.
The ability of enterococci to colonize the gastrointestinal tract, plus the many intrinsic and acquired resistance traits, means that these organisms, which usually seem to have relatively low intrinsic virulence, are given an excellent opportunity to become secondary invaders. Since nosocomial isolates of enterococci have displayed resistance to essentially every useful antimicrobial agent, it will likely become increasingly difficult to successfully treat and control enterococcal infections.
Particularly when the various resistance genes come together in a single strain, an event almost certain to occur at some time in the future.
The etiology of diseases mediated or exacerbated by Enterococcus faecalis, involves the programmed expression of E.faecalis genes, and that characterizing these genes and their patterns of expression would dramatically add to our understanding of the organism and its host interactions. Knowledge of the E. faecalis gene and genomic organization would improve our understanding of disease etiology and lead to improved and new ways of preventing, treating and diagnosing diseases. Thus, there is a need to characterize the genome of E.faecalis and for polynucleotides of this organism.
S.Summary of the Invention SThe present invention provides for isolated E. faecalis polynucleotides and polypeptides shown in Table 1 and SEQ ID NO: 1 through SEQ ID NO:496 15 (polynucleotide sequences having odd SEQ ID NOs and polypeptide sequences having even SEQ ID NOs). One aspect of the invention provides isolated nucleic acid molecules comprising polynucleotides having a nucleotide sequence selected from the group consisting of: a nucleotide sequence shown in Table 1; a nucleotide S" sequence encoding any of the amino acid sequences of the polypeptides shown in o Table 1; and a nucleotide sequence complementary to any of the nucleotide sequences in or The invention further provides for fragments of the nucleic acid molecules of(a), above.
Further embodiments of the invention include isolated nucleic acid molecules that comprise a polynucleotide having a nucleotide sequence at least 90% identical, and more preferably at least 95%, 96%, 97%, 98% or 99% identical, to any of the nucleotide sequences in or above, or a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide in or above.
Additional nucleic acid embodiments of the invention relate to isolated nucleic acid molecules comprising polynucleotides which encode the amino acid sequences of epitope-bearing portions of a E. faecalis polypeptide having an amino acid sequence in above.
The present invention also relates to recombinant vectors, which include the isolated nucleic acid molecules of the present invention, and to host cells containing the recombinant vectors, as well as to methods of making such vectors and host cells.
The present invention further relates to the use of these vectors in the production of E. faecalis polypeptides or peptides by recombinant techniques.
The invention further provides isolated E. faecalis polypeptides having an amino acid sequence selected from the group consisting of an amino acid sequence of any of the polypeptides described in Table 1 or fragments thereof.
The polypeptides of the present invention also include polypeptides having an amino acid sequence with at least 70% similarity, and more preferably at least 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% similarity to those described in Table 1, as well as polypeptides having an amino acid sequence at least 70% identical, more 15 preferably at least 75% identical, and still more preferably 80%, 85%, 90%, 96%, 97%, 98%, or 99% identical to those above; as well as isolated nucleic acid molecules encoding such polypeptides.
The present invention further provides a single or multi-component vaccine comprising one or more of the E. faecalis polynucleotides or polypeptides described in Table 1, or fragments thereof, together with a pharmaceutically acceptable diluent, carrier, or excipient, wherein the E. faecalis polypeptide(s) are present in an amount effective to elicit an immune response to members of the Enterococcus genus, or at least E. faecalis in an animal. The E. faecalis polypeptides of the present invention may further be combined with one or more immunogens of one or more other Enterococcal or non-Enterococcal organisms to produce a multi-component vaccine intended to elicit an immunological response against members of the Enterococcus genus and, optionally, one or more non-Enterococcal organisms.
The vaccines of the present invention can be administered in a DNA form, e.g., "naked" DNA, wherein the DNA encodes one or more Enterococcal polypeptides and, optionally, one or more polypeptides of a non-Enterococcal organism. The DNA encoding one or more polypeptides may be constructed such that these polypeptides are expressed as fusion proteins.
The vaccines of the present invention may also be administered as a component of a genetically engineered organism or host cell. Thus, a genetically engineered organism or host cell which expresses one or more E. faecalis polypeptides may be administered to an animal. For example, such a genetically engineered organism or host cell may contain one or more E. faecalis polypeptides of the present invention intracellularly, on its cell surface, or in its periplasmic space. Further, such a genetically engineered organism or host cell may secrete one or more E. faecalis polypeptides. The vaccines of the present invention may also be co-administered to an animal with an immune system modulator CD86 and GM-CSF).
S. The invention also provides a method of inducing an immunological response S* in an animal to one or more members of the Enterococcus genus, preferably one or 15 more isolates of the E.faecalis species, comprising administering to the animal a vaccine as described above.
The invention further provides a method of inducing a protective immune response in an animal, sufficient to prevent, attenuate, or control an infection by members of the Enterococcus genus, preferably at least E. faecalis species, 20 comprising administering to the animal a composition comprising one or more of the polynucleotides or polypeptides described in Table 1, or fragments thereof. Further, these polypeptides, or fragments thereof, may be conjugated to another immunogen and/or administered in admixture with an adjuvant.
The invention further relates to antibodies elicited in an animal by the administration of one or more E. faecalis polypeptides of the present invention and to methods for producing such antibodies and fragments thereof. The invention further relates to recombinant antibodies and fragments thereof and to methods for producing such antibodies and fragments thereof.
The invention also provides diagnostic methods for detecting the expression of the polynucleotides of Table 1 by members of the Enterococcus genus in an animal.
One such method involves assaying for the expression of a polynucleotide encoding E. faecalis polypeptides in a sample from an animal. This expression may be assayed either directly by assaying polypeptide levels using antibodies elicited in response to amino acid sequences described in Table 1) or indirectly by assaying for antibodies having specificity for amino acid sequences described in Table The expression of polynucleotides can also be assayed by detecting the nucleic acids of Table 1. An example of such a method involves the use of the polymerase chain reaction (PCR) to amplify and detect Enterococcus nucleic acid sequences.
The present invention also relates to nucleic acid probes having all or part of a nucleotide sequence described in Table 1 (odd SEQ ID NOs) which are capable of hybridizing under stringent conditions to Enterococcus nucleic acids. The invention further relates to a method of detecting one or more Enterococcus nucleic acids in a biological sample obtained from an animal, said one or more nucleic acids encoding 15 Enterococcus polypeptides, comprising: contacting the sample with one or more of the above-described nucleic acid probes, under conditions such that hybridization occurs, and detecting hybridization of said one or more probes to the Enterococcus nucleic acid present in the biological sample.
Other uses of the polypeptides of the present invention include: inter alia, to detect E. faeca.s in immunoassays, as epitope tags, as molecular weight markers on SDS-PAGE gels, as molecular weight markers for molecular sieve gel filtration columns, to generate antibodies that specificaly bind E. faecalis polypeotides of the present invention for the detection E. faecalis in immunoassays, to generate an immune response against E. faecalis and other Enterococcs species, and as vaccines against E. faecalis, other Enterococcus species and other bacteria genuses.
Isolated nucleic acid molecules of the present invention, particularly DNA molecules, are useful as probes for gene mapping and for identifying E. faecalis in a biological samples, for instance, by Southern and Northern blot analysis.
Polynucleotides of the present invention are also useful in detecting E.faecalis by PCR using primers for a particular E. faecalis polynucleotide. Isolated polynucleotides of the present invention are also useful in making the polypeptides of the present invention.
Detailed Description The present invention relates to recombinant E. faecalis nucleic acids and fragments thereof The present invention further relates to recombinant E. faecalis polypeptides and fragments thereof. The invention also relates to methods for using these polypeptides to produce immunological responses aid to confer immunological In protection to disease caused by members of the genus Enterococcus, at least isolates of the E. faecalis genus. The invention further relates to nucleic acid sequences which encode antigenic E. faecalis polypeptides and to methods for detecting E. faecalis nucleic acids and polypeptides in biological samples. The invention also relates to antibodies specific for the polypeptides and peptides of the present invention and 15 methods for detecting such antibodies produced in a host animal.
Definitions The following definitions are provided to clarify the subject matter which the inventors consider to be the present invention.
20 As used herein, the phrase "pathogenic agent" means an agent which causes a disease state or affliction in an animal. Included within this definition, for examples, are bacteria, protozoans, fungi, viruses and metazoan parasites which either produce a disease state or render an animal infected with such an organism susceptible to a disease state a secondary infection). Further included are species and strains of the genus Enterococcus which produce disease states in animals.
As used herein, the term "organism" means any living biological system, including viruses, regardless of whether it is a pathogenic agent.
As used herein, the term "Enlerococcus" means any species or strain of bacteria which is members of the genus Enterococcus. Such species and strains are known to those of skill in the art, and include those that are pathogenic and those that are not.
As used herein, the phrase "one or more E. faecalis polypeptides of the present invention" means polypeptides comprising the amino acid sequence of one or more of the E. faecalis polypeptides described in Table 1 (even SEQ ID NOs). These polypeptides may be expressed as fusion proteins wherein the E. faecalis polypeptides of the present invention are linked to additional amino acid sequences which may be of Enterococcal or non-Enterococcal origin. This phrase further includes polypeptide comprising fragments of the E. faecalis polypeptides of the 1( present invention. Additional definitions are provided throughout the specification.
Explanation of Table 1 Table 1, below, provides information describing genes which encode polypeptides of E.faecalis. The table lists the gene identifier which consists of the 15 letters EF, which denote E.faecalis, followed imrnediately by a three digit numeric code, which arbitrarily number the E.faecalis genes of the present invention. A number from 1 through 4 follows the three digit number. A number 1 represents the full length open reading frame of the gene specified by the preceeding three digit number. A number 2 represents the full length polypeptide encoded by the gene specified the preceeding three digit number. A number 3 represents a polynucleotide fragment, of the gene represented by the preceeding three digit number, used to produce an antigenic polypeptide. A number 4 represents an antigenic polypeptide fragment, of the gene represented by the preceeding three digit number, used to stimulate an immune response or as a vaccine. The nucleotide and amino acid sequences of each gene and fragment are also shown in the Sequence Listing under the SEQ ID NO listed in Table 1.
Explanation of Table 2 Table 2 lists accession numbers for the closest matching sequences between the polypeptides of the present invention and those available through GenBank and Derwent databases. These reference numbers are the database entry numbers commonly used by those of skill in the art, who will be familar with their denominations. The descriptions of the numenclature for GenBank are available from the National Center for Biotechnology Information. Column 1 lists the gene or ORF of the present invention. Column 2 lists the accession number of a "match" gene sequence in GenBank or Derwent databases. Column 3 lists the description of the "match" gene sequence. Columns 4 and 5 are the high score and smallest sum probability, respectively, calculated by BLAST. Polypeptides of the present invention that do not share significant identity/similarity with any polypeptide sequences of GenBank and Derwent are not represented in Table 2. Polypeptides of the present invention that share significant identity/similarity with more than one of the polypeptides of GenBank and Derwent are represented more than once.
15 Explanation of Table 3.
The E. faecalis polypeptides of the present invention may include one or more S•conservative amino acid substitutions from natural mutations or human manipulation as indicated in Table 3. Changes are preferably of a minor nature, such as conservative amino acid substitutions that do not significantly affect the folding or activity of the protein. Residues from the following groups, as indicated in Table 3, may be substituted for one another: Aromatic, Hydrophobic, Polar, Basic, Acidic, and Small,
S
Explanation of Table 4 Table 4 lists residues comprising antigenic epitopes of antigenic epitopebearing fragments present in each of the full length E. faecalis polypeptides described in Table I as predicted by the inventors using the algorithm of Jameson and Wolf, (1988) Comp. Appl. Biosci. 4:181-186. The Jameson-Wolf antigenic analysis was performed using the computer program PROTEAN (Version 3.11 for the Power Macintosh, DNASTAR, Inc., 1228 South Park Street Madison, WI). E. faecalis polypeptide shown in Table 1 may one or more antigenic epitopes comprising residues described in Table 4. It will be appreciated that depending on the analytical criteria used to predict antigenic determinants, the exact address of the determinant may vary slightly. The residues and locations shown described in Table 4 correspond to the amino acid sequences for each full length gene sequence shown in Table I and in the Sequence Listing. Polypeptides of the present invention that do not have antigenic epitopes recognized by the Jameson-Wolf algorithm are not represented in Table 2.
Selection of Nucleic Acid Sequences Encoding Antigenic E. faecalis Polypeptides Sequenced E. faecalis genomic DNA was obtained from the E. faecalis strain V586. The E. faecalis strain V586 was deposited 2 May 1997 at the ATCC, 10801 University Blvd. Manassas, VA 20110-2209, and given accession number 55969.
Some ORFs contained in the subset of fragments of the E. faecalis genome 15 disclosed herein were derived through the use of a number of screening criteria detailed below. The ORFs are bounded at the amino terminus by a methionine or valine residue and usually at the carboxy terminus by a stop codon.
*ooo Most of the selected sequences consist of complete ORFs. The polypeptides that do not comprise a complete ORF can be determined by determining whether the 20 corresponding polynucleotide sequence comprises a stop codon after the codon for the last amino acid residue in the polypeptide sequence. It is not always preferred to express a complete ORF in a heterologous system. It may be challenging to express and purify a highly hydrophobic protein by common laboratory methods. Some of the polypeptide vaccine candidates described herein have been modified slightly to simplify the production of recombinant protein. For example, nucleotide sequences which encode highly hydrophobic domains, such as those found at the amino terminal signal sequence, have been excluded from some constructs used for expression of the polypeptides. Furthermore, any highly hydrophobic amino acid sequences occurring at the carboxy terminus have also been excluded from the recombinant expression constructs. Thus, in one embodiment, a polypeptide which represents a truncated or modified ORF may be used as an antigen.
While numerous methods are known in the art for selecting potentially immunogenic polypeptides, many of the ORFs disclosed herein were selected on the basis of screening Enterococcusfaecalis ORFs for several aspects of potential immnunogenicity. One set of selection criteria are as follows: 1. Type I signal sequence: An amino terminal type I signal sequence generally directs a nascent protein across the plasma and outer membranes to the exterior of the bacterial cell. Experimental evidence obtained from studies with Escherichia coli suggests that the typical type I signal sequence consists of the following biochemical and physical attributes (Izard, J. W. and Kendall, D. A. Mol. Microbiol. 13:765-773 (1994)). The length of the type I signal sequence is approximately 15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus. In addition, the central region of the signal sequence adopts an alpha-helical conformation in a hydrophobic environment. Finally, the region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 and -3 positions.
2. Type V signal sequence: The type IV signal sequence is an example of the several types of functional signal sequences which exist in addition to the type I signal sequence detailed above. Although functionally related, the type IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, J. Bacteriol. 174:7345-7351 (1992)). These are typically six to eight amino acids with a net basic charge followed by an additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically after the initial six to eight amino acids at the extreme amino terminus. In addition, type IV signal sequences generally contain a phenylalanine residue at the +1 site relative to the cleavage site.
3. Lipoprotein: Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus amino acid sequence for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors examined contained the sequence at positions -3 to relative to the point of cleavage (Hayashi, S. and Wu, H. Bioenerg. Biomembr. 22:451-471 (1990)).
4. LPXTG motif- It has been experimentally determined that most anchored proteins found on the surface of gram-positive bacteria possess a highly conserved carboxy terminal sequence. More than fifty such proteins from organisms such as S.
pyogenes, S. mutans, E. faecalis, S. pneumoniae, and others, have been identified based on their extracellular location and carboxy terminal amino 'acid sequence (Fischetti, V.
ASMNews 62:405-410 (1996)). The conserved region consists of six charged amino acids at the extreme carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain. Immediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins examined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any 15 amino acid.
An algorithm for selecting antigenic and immunogenic Enterococcusfaecalis polypeptides including the foregoing criteria was developed. The algorithm is similar to that described in U.S. patent application 08/781,986, filed January 3, 1997, which is fully incorporated by reference herein. Use of the algorithm by the inventors to select immunologically useful Enterococcusfaecalis polypeptides resulted in the selection of a number of the disclosed ORFs. Polypeptides comprising the polypeptides identified in this group may be produced by techniques standard in the art and as further described herein.
Nucleic Acid Molecules Sequenced E. faecalis genomic DNA was obtained from the E. faecalis strainV586. As discussed elsewhere hererin, polynucleotides of the present invention readily may be obtained by routine application of well known and standard procedures for cloning and sequencing DNA. Detailed methods for obtaining libraries and for sequencing are provided below, for instance. A wide variety of Enierococcusfaecalis strains that can be used to prepare E.faecalis genomic DNA for cloning and for obtaining polynucleotides and polypeptides of the present invention. A wide variety of Enmerococcusfaecalis strains are available to the public from recognized depository institutions, such as the American Type Culture Collection (ATCC). It is recognized that minor variation is the nucleic acid and amino acid sequence may be expected from Efaecalis strain to strain. The present invention provides for genes, including both polynucleotides and polypeptides, of the of the present invention from all the Enterococcusfaecalis strains.
Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA molecule herein were determined using an automated DNA sequencer (such as the Model 373 from Applied Biosystems, Inc., Foster City, CA), and all amino acid sequences of polypeptides encoded by DNA molecules determined herein were predicted by translation of a DNA sequence determined as above. Therefore, as is known in the art for any DNA sequence determined by this automated approach, any S nucleotide sequence determined herein may contain some errors. Nucleotide sequences detennined by automation are typically at least about 90% identical, more typically at least about 95% to at least about 99.9% identical to the actual nucleotide sequence of the sequenced DNA molecule. The actual sequence can be more 20 precisely determined by other approaches including manual DNA sequencing methods well known in the art. As is also known in the art, a single insertion or deletion in a determined nucleotide sequence compared to the actual sequence will cause a frame shift in translation of the nucleotide sequence such that the predicted amino acid sequence encoded by a determined nucleotide sequence will be completely different from the amino acid sequence actually encoded by the sequenced DNA molecule, beginning at the point of such an insertion or deletion. In case of conflict between Table 1 and either the nucleic acid sequence of the clones listed in Table 1 or the amino acid sequence of the protein expressed by the clones listed in Table 1, the clones listed in Table 1 are controlling. By "nucleotide sequence" of a nucleic acid molecule or polynucleotide is intended to mean either a DNA or RNA sequence.Using the information provided herein, such as the nucleotide sequence in Table 1, a nucleic acid molecule of the present invention encoding a E. faecalis polypeptide may be obtained using standard cloning and screening procedures, such as those for cloning DNAs using genomic DNA as starting material. See, Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed.
1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989). Illustrative of the invention, the nucleic acid molecule described in Table 1 was discovered in a DNA library derived from a E.
faecalis genomic DNA.
Nucleic acid molecules of the present invention may be in the form of RNA, such as mRNA, or in the form of DNA, including, for instance, DNA and genomic DNA obtained by cloning or produced synthetically. The DNA may be double-stranded or single-stranded. Single-stranded DNA or RNA may be the coding 15 strand, also known as the sense strand, or it may be the non-coding strand, also referred to as the anti-sense strand.
By "isolated" nucleic acid molecule(s) is intended a nucleic acid molecule, DNA or RNA, which has been removed from its native environment. This includes segments of DNA comprising the E. faecalis polynucleotides of the present invention isolated from the native chromosome. These fragments include both isolated fragments consisting only of E. faecalis DNA and fragments comprising heterologous sequences such as vector sequences or other foreign DNA. For example, recombinant DNA molecules contained in a vector are considered isolated for the purposes of the present invention. Further examples of isolated DNA molecules include recombinant DNA molecules maintained in heterologous host cells or purified (partially or substantially) DNA molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of the DNA molecules of the present invention. Isolated nucleic acid molecules according to the present invention further include such molecules produced synthetically.
In addition, isolated nucleic acid molecules of the invention include DNA molecules which comprise a sequence substantially different from those described above but which, due to the degeneracy of the genetic code, still encode a E. faecalis polypeptides and peptides of the present invention polypeptides of Table 1).
That is, all possible DNA sequences that encode the E. faecalis polypeptides of the present invention. This includes the genetic code and species-specific codon preferences known in the art. Thus, it would be routine for one skilled in the art to generate the degenerate variants described above, for instance, to optimize codon expression for a particular host change codons in the bacteria mRNA to those preferred by a mammalian or other bacterial host such as E. coli).
The invention further provides isolated nucleic acid molecules having the nucleotide sequence shown in Table 1 or a nucleic acid molecule having a sequence complementary to one of the above sequences. Such isolated molecules, particularly DNA molecules, are useful as probes for gene mapping and for identifying E.faecalis in a biological sample, for instance, by PCR, Southern blot, Northern blot, or other Sform of hybridization analysis.
The present invention is further directed to nucleic acid molecules encoding portions or fragments of the nucleotide sequences described herein. Fragments include portions of the nucleotide sequences of Table 1, or the E. faecalis nucleotide sequences contained in the plasimd clones listed in Table 1, at least 10 contiguous nucleotides in length selected from any two integers, one of which representing a nucleotide position and a second of which representing a 3' nucleotide position, where the first nucleotide for each nucleotide sequence in Table 1 is position 1. That is, every combination of a 5' and 3' nucleotide position that a fragment at least contiguous nucleotides in length could occupy is included in the invention. At least means a fragment may be 10 contiguous nucleotide bases in length or any integer between 10 and the length of an entire nucleotide sequence of Table 1 minus 1.
Therefore, included in the invention are contiguous fragments specified by any 5' and 3' nucleotide base positions of a nucleotide sequences of Table 1 wherein the contiguous fragment is any integer between 10 and the length of an entire nucleotide sequence minus 1.
Further, the invention includes polynucleotides comprising fragments specified by size, in nucleotides, rather than by nucleotide positions. The invention includes any fragment size, in contiguous nucleotides, selected from integers between 10 and the length of an entire nucleotide sequence minus 1. Preferred sizes of contiguous nucleotide fragments include 20 nucleotides, 30 nucleotides, 40 nucleotides, nucleotides. Other preferred sizes of contiguous nucleotide fragments, which may be useful as diagnostic probes and primers, include fragments 50-300 nucleotides in length which include, as discussed above, fragment sizes representing each integer between 50-300. Larger fragments are also useful according to the present invention corresponding to most, if not all, of the nucleotide sequences shown in Table 1or of the E. faecalis nucleotide sequences of the plasimd clones listed in Table 1. The preferred sizes are, of course, meant to exemplify not limit the present invention as all size fragments, representing any integer between 10 and the length of an entire nucleotide sequence minus 1, are included in the invention. Additional preferred nucleic acid fragments of the present invention include nucleic acid molecules encoding epitope-bearing portions of E.faecalis polypeptides identified in Table 4.
The present invention also provides for the exclusion of any fragment, specified by 5' and 3' base positions or by size in nucleotide bases as described above for any nucleotide sequence of Table 1 or the plasimd clones listed in Table 1. Any number of fragments of nucleotide sequences in Table 1 or the plasimd clones listed in Table 1, specified by 5' and 3' base positions or by size in nucleotides, as described above, may be excluded from the present invention.
In another aspect, the invention provides an isolated nucleic acid molecule comprising a polynucleotide which hybridizes under stringent hybridization conditions to a portion of a polynucleotide in a nucleic acid molecules of the invention described above, for instance, nucleotide sequences of Table 1 or the E. faecalis sequences of the plasimd clones listed in Table 1. By "stringent hybridization conditions" is intended overnight incubation at 42 0 C in a solution comprising: fornamide, 5x SSC (150 mM NaC1, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 5x Denhardt's solution, 10% dextran sulfate, and 20 p-g/ml denatured, sheared salmon sperm DNA, followed by washing the filters in 0. Ix SSC at about 65 0
C.
By a polynucleotide which hybridizes to a "portion" of a polynucleotide is intended a polynucleotide (either DNA or RNA) hybridizing to at least about nucleotides bases, and more preferably ai least about 20 nucleotides bases, still more preferably at least about 30 nucleotides bases, and even more preferably about 30-70 50) nucleotides bases of the reference polynucleotide. These are useful as diagnostic probes and primers as discussed above. By a portion of a polynucleotide of "at least 20 nucleotides bases in length," for example, is intended 20 or more contiguous nucleotides bases nucleotides from the nucleotide sequence of the reference polynucleotide the nucleotide sequence as shown in Table Portions of a polynucleotide which hybridizes to a nucleotide sequence in Table 1, which can be Sused as probes and primers, may also be precisely specified by 5' and 3' base positions or by size in nucleotide bases as described above or precisely excluded in the same manner.
The nucleic acid molecules of the present invention include those encoding the full length E. faecalis polypeptides of Table 1 and portions of the E. faecalis polypeptides of Table 1. Also included in the present invention are nucleic acids encoding the above full length sequences and further comprise additional sequences, such as those encoding an added secretory leader sequence, such as a pre-, or pro- or prepro- protein sequence. Further included in the present invention are nucleic acids encoding the above full length sequences and portions thereof and further comprise additional heterologous amino acid sequences encoded by nucleic acid sequences from a different source.
Also included in the present invention are nucleic acids encoding the above protein sequences together with additional, non-coding sequences, including for example, but not limited to non-coding 5' and 3' sequences. These sequences include transcribed, non-translated sequences that may play a role in transcription, and mRNA processing, for example, ribosome binding and stability of mRNA. Also included in the present invention are additional coding sequences which provide additional functionalities.
Thus, a nucleotide sequence encoding a polypeptide may be fused to a marker sequence, such as a sequence encoding a peptide which facilitates purification of the fused polypeptide. In certain preferred embodiments of this aspect of the invention, the marker amino acid sequence is a hexa-histidine peptidd, such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311), among others, many of which are commercially available. For instance, hexa-histidine provides for convenient purification of the fusion protein. See Gentz et al. (1989) Proc. Natl. Acad. Sci. 86:821-24. The "HA" tag is another peptide useful for purification which corresponds to an epitope derived from the influenza hemagglutinin protein. See Wilson et al. (1984) Cell 37:767. As discussed below, other such fusion proteins include the E. faecalis polypeptides of the present invention fused to Fc at the N- or C-terminus.
Variant and Mutant Polvnucleotides The present invention further relates to variants of the nucleic acid molecules which encode portions, analogs or derivatives of a E. faecalis polypeptides of Table 1 and variant polypeptides thereof including portions, analogs, and derivatives of the E.
faecalis polypeptides. Variants may occur naturally, such as a natural allelic variant.
By an "allelic variant" is intended one of several alternate forms of a gene occupying a given locus on a chromosome of an organism. See, B. Lewin, Genes IV (1990).
Non-naturally occurring variants may be produced using art-known mutagenesis techniques.
Such nucleic acid variants include those produced by nucleotide substitutions, deletions, or additions. The substitutions, deletions, or additions may involve one or more nucleotides. The variants may be altered in coding regions, non-coding regions, or both. Alterations in the coding regions may produce conservative or non-conservative amino acid substitutions, deletions or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of a E. faecalis protein of the present invention or portions thereof. Also especially preferred in this regard are conservative substitutions.
Such polypeptide variants include those produced by amino acid substitutions, deletions or additions. The substitutions, deletions, or additions may involve one or more residues. Alterations may produce conservative or non-conservative amino acid substitutions, deletions, or additions. Especially preferred among these are silent substitutions, additions and deletions, which do not alter the properties and activities of a E. faecalis protein of the present invention or portions thereof. Also especially preferred in this regard are conservative substitutions.
The present invention also relates to recombinant vectors, which include the isolated nucleic acid molecules of the present invention, and to host cells containing the recombinant vectors, as well as to methods of making such vectors and host cells S* and for using them for production ofE. faecalis polypeptides or peptides by recombinant techniques.
The present application is directed to nucleic acid molecules at least 2* 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown in Table 1.
The above nucleic acid sequences are included irrespective of whether they encode a polypeptide having E. faecalis activity. This is because even where a particular nucleic acid molecule does not encode a polypeptide having E. faecalis activity, one of skill in the art would still know how to use the nucleic acid molecule, for instance, as a hybridization probe. Uses of the nucleic acid molecules of the present invention that do not encode a polypeptide having E. faecalis activity include, inter alia, isolating an E. faecalis gene or allelic variants thereof from a DNA library, and detecting E. faecalis mRNA expression samples, environmental samples, suspected of containing E.
faecalis by Northern Blot analysis.
Preferred, are nucleic acid molecules having sequences at least 90%, 95%, 96%, 97%, 98% or 99% identical to the nucleic acid sequence shown in Table 1, which do, in fact, encode a polypeptide having E. faecalis protein activity By "a polypeptide having E. faecalis activity" is intended polypeptides exhibiting activity similar, but not necessarily identical, to an activity of the E. faecalis protein of the invention, as measured in a particular biological assay suitable for measuring activity of the specified protein.
Due to the degeneracy of the genetic code, one of ordinary skill in the art will immediately recognize that a large number of the nucleic acid molecules having a sequence at least 90%, 95%, 96%, 97%, 98%, or 99% identical to the nucleic acid sequences shown in Table 1 will encode a polypeptide having E. faecalis protein activity. In fact, since degenerate variants of these nucleotide sequences all encode the same polypeptide, this will be clear to the skilled artisan even without performing the above described comparison assay. It will be further recognized in the art that, for such nucleic acid molecules that are not degenerate variants, a reasonable number will also encode a polypeptide having E. faecalis protein activity. This is because the skilled artisan is fully aware of amino acid substitutions that are either less likely or *o*o not likely to significantly effect protein function replacing one aliphatic amino acid with a second aliphatic amino acid), as further described below.
The biological activity or function of the polypeptides of the present invention are expected to be similar or identical to polypeptides from other bacteria that share a high degree of structural identity/similarity. Tables 2 lists accession numbers and descriptions for the closest matching sequences of polypeptides available through Genbank and Derwent databases. It is therefore expected that the biological activity or function of the polypeptides of the present invention will be similar or identical to those polypeptides from other bacterial genuses, species, or strains listed in Table 2.
By a polynucleotide having a nucleotide sequence at least, for example, "identical" to a reference nucleotide sequence of the present invention, it is intended that the nucleotide sequence of the polynucleotide is identical to the reference sequence except that the polynucleotide sequence may include up to five point mutations per each 100 nucleotides of the reference nucleotide sequence encoding the E. faecalis polypeptide. In other words, to obtain a polynucleotide having a nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to of the nucleotides in the reference sequence may be deleted, inserted, or substituted with another nucleotide. The query sequencelnmay be an entire sequence 1o shown in Table 1, the ORF (open reading frame), or any fragment specified as described herein.
As a practical matter, whether any particular nucleic acid molecule or polypeptide is at least 90%, 95%, 96%, 97%, 98% or 99% identical to a nucleotide sequence of the presence invention can be determined conventionally using known 15 computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al. See Brutlag et al. (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences are both DNA sequences. An RNA sequence can be compared by first converting U's to T's. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB alignment of DNA sequences to calculate percent identity are: Matrix=Unitary, k-tuple=4, Mismatch Penalty=l, Joining Penalty=30, Randomization Group Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty 0.05, Window Size=500 or the lenght of the subject nucleotide sequence, whichever is shorter.
If the subject sequence is shorter than the query sequence because of 5' or 3' deletions, not because of internal deletions, a manual correction must be made to the results. This is because the FASTDB program does not account for 5' and 3' truncations of the subject sequence when calculating percent identity. For subject sequences truncated at the 5' or 3' ends, relative to the query sequence, the percent identity is corrected by calculating the number of bases of the query sequence that are and 3' of the subject sequence, which are not matched/aligned, as a percent of the total bases of the query sequence. Whether a nucleotide is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This corrected score is what is used for the purposes of the present invention. Only nucleotides to outside the 5' and 3' nucleotides of the subject sequence, as displayed by the FASTDB alignment, which are not matched/aligned with the query sequence, are calculated for the purposes of manually adjusting the percent identity score.
For example, a 90 nucleotide subject sequence is aligned to a 100 nucleotide query sequence to determine percent identity. The deletions occur at the 5' end of the 15 subject sequence and therefore, the FASTDB alignment does not show a matched/alignment of the first 10 nucleotides at 5' end. The 10 unpaired nucleotides represent 10% of the sequence (number of nucleotides at the 5' and 3' ends not matched/total number of nucleotides in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining nucleotides were perfectly matched the final percent identity would be 90%. In another example, a 90 nucleotide subject sequence is compared with a 100 nucleotide query sequence. This time the deletions are internal deletions so that there are no nucleotides on the 5' or 3' of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only nucleotides 5' and 3' of the subject sequence which are not matched/aligned with the query sequence are manually corrected for. No other manual corrections are to made for the purposes of the present invention.
Vectors and Host Cell The present invention also relates to vectors which include the isolated DNA molecules of the present invention, host cells comprising the recombinant vectors, and the production of E. faecalis polypeptides and peptides of the present invention expressed by the host cells.
Recombinant constructs may be introduced into host cells using well known techniques such as infection, transduction, transfection, transvection, electroporation and transformation. The vector may be, for example, a phage, plasmid, viral or retroviral vector. Retroviral vectors may be replication competent or replication defective. In the latter case, viral propagation generally will occur only in complementing host cells.
The polynucleotides may be joined to a vector containing a selectable marker for propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is a virus, it may be packaged in vitro using an appropriate packaging cell line 15 and then transduced into host cells.
S* Preferred are vectors comprising cis-acting control regions to the polynucleotide of interest. Appropriate trans-acting factors may be supplied by the host, supplied by a complementing vector or supplied by the vector itself upon introduction into the host.
In certain preferred embodiments in this regard, the vectors provide for specific expression, which may be inducible and/or cell type-specific. Particularly preferred among such vectors are those inducible by environmental factors that are easy to manipulate, such as temperature and nutrient additives.
Expression vectors useful in the present invention include chromosomal-, episomal- and virus-derived vectors, vectors derived from bacterial plasmids, bacteriophage, yeast episomes, yeast chromosomal elements, viruses such as baculoviruses, papova viruses, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as cosmids and phagemids.
The DNA insert should be operatively linked to an appropriate promoter, such as the phage lambda PL promoter, the E. coli lac, trp and tac promoters, the early and late promoters and promoters of retroviral LTRs, to name a few.
Other suitable promoters will be known to the skilled artisan. The expression constructs will further contain sites for transcription initiation, termination and, in the transcribed region, a ribosome binding site for translation. The coding portion of the mature transcripts expressed by the constructs will preferably include a translation initiating site at the beginning and a termination codon (UAA, UGA or UAG) appropriately positioned at the end of the polypeptide to:be translated.
As indicated, the expression vectors will preferably include at least one selectable marker. Such markers include dihydrofolate reductase or neomycin resistance for eukaryotic cell culture and tetracycline, kanamycin, or ampicillin resistance genes for culturing in E. coli and other bacteria. Representative examples of appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 15 Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO, COS and Bowes melanoma cells; and plant cells. Appropriate culture mediums and conditions for the above-described host cells are known in the art.
Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE9, 20 pQE 10 available from Qiagen; pBS vectors, Phagescript vectors, Bluescript vectors, pNHSA, pNH 16a, pNH18A, pNH46A available from Stratagene; pET series of vectors available from Novagen; and ptrc99a, pKK223-3, pKK233-3, pDR540, available from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, pSV2CAT, pOG44, pXTI and pSG available from Stratagene; and pSVK3, pBPV, 25 pMSG and pSVL available from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan.
Among known bacterial promoters suitable for use in the present invention include the E. coli lacI and lacZ promoters, the T3, T5 and T7 promoters, the gpt promoter, the lambda PR and PL promoters and the trp promoter. Suitable eukaryotic promoters include the CMV immediate early promoter, the HSV thymidine kinase promoter, the early and late SV40 promoters, the promoters of retroviral LTRs, such as those of the Rous sarcoma virus (RSV), and metallothionein promoters, such as the mouse metallothioriein-I promoter.
Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE-dextran mediated transfection, cationic lipid-mediated transfection, electroporation, transduction, infection or other methods. Such methods are described in many standard laboratory manuals (for example, Davis, et al., Basic Methods In Molecular Biology (1986)).
Transcription of DNA encoding the polypeptides of the present invention by higher eukaryotes may be increased by inserting an enhancer sequence into the vector.
Enhancers are cis-acting elements of DNA, usually about from 10 to 300 nucleotides that act to increase transcriptional activity of a promoter in a given host cell-type.
'Examples of enhancers include the SV40 enhancer, which is located on the late side of 15 the replication origin at nucleotides 100 to 270, the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.
For secretion of the translated polypeptide into the lumen of the endoplasmic reticulum, into the periplasmic space or into the extracellular environment, appropriate secretion signals may be incorporated into the expressed polypeptide, for example, the amino acid sequence KDEL. The signals may be endogenous to the polypeptide or they may be heterologous signals.
The polypeptide may be expressed in a modified form, such as a fusion ***protein, and may include not only secretion signals, but also additional heterologous functional regions. For instance, a region of additional amino acids, particularly charged amino acids, may be added to the N-terminus of the polypeptide to improve stability and persistence in the host cell, during purification, or during subsequent handling and storage. Also, peptide moieties may be added to the polypeptide to facilitate purification. Such regions may be removed prior to final preparation of the polypeptide. The addition of peptide moieties to polypeptides to engender secretion or excretion, to improve stability and to facilitate purification, among others, are familiar and routine techniques in the art. A preferred fusion protein comprises a heterologous region from immunoglobulin that is useful to solubilize proteins. For example, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion proteins comprising various portions of constant region of immunoglobulin molecules together with another human protein or part thereof. In many cases, the Fc part in a fusion protein is thoroughly advantageous for fise in therapy and diagnosis and thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). On the other to hand, for some uses it would be desirable to be able to delete the Fc part after the fusion protein has been expressed, detected and purified in the advantageous manner described. This is the case when Fc portion proves to be a hindrance to use in therapy and diagnosis, for example when the fusion protein is to be used as antigen for immunizations. In drug discovery, for example, human proteins, such as, 15 hIL5-receptor has been fused with Fc portions for the purpose of high-throughput screening assays to identify antagonists of hIL-5. See Bennett, D. et al. (1995) J.
Molec. Recogn. 8:52-58 and Johanson, K. et al. (1995) J. Biol. Chem. 270 (16):9459-9471.
The E. faecalis polypeptides can be recovered and purified from recombinant cell cultures by well-known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxylapatite chromatography, lectin chromatography and high performance liquid chromatography ("HPLC") is employed for purification.
Polypeptides of the present invention include naturally purified products, products of chemical synthetic procedures, and products produced by recombinant techniques from a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, higher plant, insect and mammalian cells.
Polypeptides and Fragments The invention further provides an isolated E. faecalis polypeptide having an amino acid sequence in Table 1, or a peptide or polypeptide comprising a portion of the above polypeptides.
Variant and Mutant Polypeptides To improve or alter the characteristics of E. faecalis polypeptides of the present invention, protein engineering may be employed. Recombinant DNA technology known to those skilled in the art can be used to create novel mutant o0 proteins or muteins including single or multiple amino acid substitutions, deletions, additions, or fusion proteins. Such modified polypeptides can show, enhanced activity or increased stability. In addition, they may be purified in higher yields and show better solubility than the corresponding natural polypeptide, at least under certain purification and storage conditions.
N-Terminal and C-Terminal Deletion Mutants S. It is known in the art that one or more amino acids may be deleted from the N-terminus or C-terminus without substantial loss of biological function. For instance, Ron et al. J. Biol. Chem., 268:2984-2988 (1993), reported modified KGF 20 proteins that had heparin binding activity even if 3, 8, or 27 N-terminal amino acid residues were missing. Accordingly, the present invention provides polypeptides having one or more residues deleted from the amino terminus of the amino acid sequence of the E. faecalis polypeptides shown in Table 1, and polynucleotides encoding such polypeptides.
25 Similarly, many examples of biologically functional C-terminal deletion muteins are known. For instance, Interferon gamma shows up to ten times higher activities by deleting 8-10 amino acid residues from the carboxy terminus of the protein See, Dobeli, et al. (1988) J. Biotechnology 7:199-216. Accordingly, the present invention provides polypeptides having one or more residues from the carboxy terminus of the amino acid sequence of the E.faecalis polypeptides shown in Table 1. The invention also provides polypeptides having one or more amino acids deleted from both the amino and the carboxyl termini as described below.
The present invention is further directed to polynucleotide encoding portions or fragments of the amino acid sequences described herein as well as to portions or fragments of the isolated amino acid sequences described herein. Fragments include portions of the amino acid sequences of Table 1, are at least 5 contiguous amino acid in length, are selected from any two integers, one of which representing a N-terminal position. The initiation codon of the polypeptides of thd present inventions position 1. Every combination of a N-terminal and C-terminal position that a fragment at least contiguous amino acid residues in length could occupy, on any given amino acid sequence of Table 1 is included in the invention. At least means a fragment may be contiguous amino acid residues in length or any integer between 5 and the number of residues in a full length amino acid sequence minus 1. Therefore, included in the 15 invention are contiguous fragments specified by any N-terminal and C-terminal positions of amino acid sequence set forth in Table 1 wherein the contiguous fragment is any integer between 5 and the number of residues in a full length sequence minus 1.
Further, the invention includes polypeptides comprising fragments specified by size, in amino acid residues, rather than by N-terminal and C-terminal positions.
The invention includes any fragment size, in contiguous amino acid residues, selected from integers between 5 and the number of residues in a full length sequence minus 1.
Preferred sizes of contiguous polypeptide fragments include about 5 amino acid residues, about 10 amino acid residues, about 20 amino acid residues, about 30 amino acid residues, about 40 amino acid residues, about 50 amino acid residues, about 100 amino acid residues, about 200 amino acid residues, about 300 amino acid residues, and about 400 amino acid residues. The preferred sizes are, of course, meant to exemplify, not limit, the present invention as all size fragments representing any integer between 5 and the number of residues in a full length sequence minus 1 are included in the invention. The present invention also provides for the exclusion of any fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as described above. Any number of fragments specified by N-terminal and C-terminal positions or by size in amino acid residues as described above may be excluded.
The above fragments need not be active since they would be useful, for example, in immunoassays, in epitope mapping, epitope tagging, to generate antibodies to a particular portion of the protein, as vaccines, and as molecular weight markers.
Other Mutants In addition to N- and C-terminal deletion forms of the protein discussed above, it also will be recognized by one of ordinary skill in the art that some amino acid sequences of the E. faecalis polypeptide can be varied without significant effect of the structure or function of the protein. If such differences in sequence are contemplated, 15 it should be remembered that there will be critical areas on the protein which Sdetermine activity.
Thus, the invention further includes variations of the E. faecalis polypeptides which show substantial E. faecalis polypeptide activity or which include regions of E.
°faecalis protein such as the protein portions discussed below. Such mutants include deletions, insertions, inversions, repeats, and type substitutions selected according to general rules known in the art so as to have little effect on activity. For example, guidance concerning how to make phenotypically silent amino acid substitutions is provided. There are two main approaches for studying the tolerance of an amino acid sequence to change. See, Bowie, J. U. et al. (1990), Science 247:1306-1310. The first method relies on the process of evolution, in which mutations are either accepted or rejected by natural selection. The second approach uses genetic engineering to introduce amino acid changes at specific positions of a cloned gene and selections or screens to identify sequences that maintain functionality.
These studies have revealed that proteins are surprisingly tolerant of amino acid substitutions. The studies indicate which amino acid changes are likely to be permissive at a certain position of the protein. For example, most buried amino acid residues require nonpolar side chains, whereas few features of surface side chains are generally conserved. Other such phenotypically silent substitutions are described by Bowie et al. (supra) and the references cited therein. Typically seen as conservative substitutions are the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu and Ile; interchange of the hydroxyl residues Ser and Thr, exchange of the acidic residues Asp and Glu, substitution between the amide residues Asn and Gin, exchange of the basic residues Lys and Arg and replacements among the aromatic residues Phe, Tyr.
Thus, the fragment, derivative, analog, or homolog of the polypeptide of Table 1, or that encoded by the plaimds listed in Table 1, may be: one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino acid residue) and such substituted 15 amino acid residue may or may not be one encoded by the genetic code: or (ii) one in which one or more of the amino acid residues includes a substituent group: or (iii) one in which the E. faecalis polypeptide is fused with another compound, such as a compound to increase the half-life of the polypeptide (for example, polyethylene glycol): or (iv) one in which the additional amino acids are fused to the above form of 9 the polypeptide, such as an IgG Fc fusion region peptide or leader or secretory sequence or a sequence which is employed for purification of the above form of the polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those skilled in the art from the teachings herein.
Thus, the E. faecalis polypeptides of the present invention may include one or more amino acid substitutions, deletions, or additions, either from natural mutations or human manipulation. As indicated, changes are preferably of a minor nature, such as conservative amino acid substitutions that do not significantly affect the folding or activity of the protein (see Table 3).
Amino acids in the E. faecalis proteins of the present invention that are essential for function can be identified by methods known in the art, such as sitedirected mutagenesis or alanine-scanning mutagenesis. See, Cunningham et al.
(1989) Science 244:1081-1085. The latter procedure introduces single alanine mutations at every residue in the molecule. The resulting mutant molecules are then tested for biological activity using assays appropriate for measuring the finction of the particular protein.
Of special interest are substitutions of charged amino acids with other charged or neutral amino acids which may produce proteins with highly desirable improved characteristics, such as less aggregation. Aggregation may not only reduce activity but also be problematic when preparing pharmaceutical formulations, because aggregates can be immunogenic. See, Pinckard et al., (1967) Clin. Exp. Immunol. 2:331-340; Robbins, et al., (1987) Diabetes 36:838-845; Cleland, et al., (1993) Crit. Rev.
Therapeutic Drug Carrier Systems 10:307-377.
The polypeptides of the present invention are preferably provided in an isolated form, and preferably are substantially purified. A recombinantly produced version of the E. faecalis polypeptide can be substantially purified by the one-step method described by Smith et al. (1988) Gene 67:31-40. Polypeptides of the invention also can be purified from natural or recombinant sources using antibodies .,*directed against the polypeptides of the invention in methods which are well known in the art of protein purification.
°oo ~The invention further provides for isolated E. faecalis polypeptides comprising an amino acid sequence selected from the group consisting of: the amino acid sequence of a full-length E. faecalis polypeptide having the complete amino acid sequence shown in Table 1; the amino acid sequence of a full-length E.
faecalis polypeptide having the complete amino acid sequence shown in Table I excepting the N-terminal methionine; the complete amino acid sequence encoded by the plaimds listed in Table 1; and the complete amino acid sequence excepting the N-terminal methionine encoded by the plaimds listed in Table 1. The polypeptides of the present invention also include polypeptides having an amino acid sequence at least 80% identical, more preferably at least 90% identical, and still more preferably 95%, 96%, 97%, 98% or 99% identical to those described in and above.
Further polypeptides of the present invention include polypeptides which have at least 90% similarity, more preferably at least 95% similarity, and still more preferably at least 96%, 97%, 98% or 99% similarity to those described above.
A further embodiment of the invention relates to a polypeptide which comprises the amino acid sequence of a E. faecalis polypeptide having an amino acid sequence which contains at least one conservative amino acid substitution, but not more than 50 conservative amino acid substitutions, not more than 40 conservative amino acid substitutions, not more than 30 conservative amino acid substitutions, and not more than 20 conservative amino acid substitutions. Also provided are polypeptides which comprise the amino acid sequence of a E. faecalis polypeptide, having at least one, but not more than 10, 9, 8, 7, 6, 5, 4, 3, 2 or 1 conservative amino acid substitutions.
By a polypeptide having an amino acid sequence at least, for example, "identical" to a query amino acid sequence of the present invention, it is intended that the amino acid sequence of the subject polypeptide is identical to the query sequence except that the subject polypeptide sequence may include up to five amino acid alterations per each 100 amino acids of the query amino acid sequence. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a query amino acid sequence, up to 5% of the amino acid residues in the subject sequence may be inserted, deleted, (indels) or substituted with another amino acid.
These alterations of the reference sequence may occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence.
As a practical matter, whether any particular polypeptide is at least 96%, 97%, 98% or 99% identical to, for instance, the amino acid sequences shown in Table 1 or to the amino acid sequence encoded by the plaimds listed in Table I can be detennined conventionally using known computer programs. A preferred method for determining the best overall match between a query sequence (a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, can be determined using the FASTDB computer program based on the algorithm of Brutlag et al., (1990) Comp. App. Biosci. 6:237-245. In a sequence alignment the query and subject sequences are both amino acid sequences. The result of said global sequence alignment is in percent identity. Preferred parameters used in a FASTDB amino acid alignment are: Matrix=PAM 0, k-tuple=2, Mismatch to Penalty=l, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=l, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is shorter.
If the subject sequence is shorter than the query sequence due to N- or Cterminal deletions, not because of internal deletions, the results, in percent identity, 15 must be manually corrected. This is because the FASTDB program does not account for N- and C-terminal truncations of the subject sequence when calculating global percent identity. For subject sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are N- and C-terminal of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. Whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the above FASTDB program using the specified parameters, to arrive at a final percent identity score. This final percent identity score is what is used for the purposes of the present invention. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the purposes of manually adjusting the percent identity score. That is, only query amino acid residues outside the farthest N- and C-terminal residues of the subject sequence.
For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query sequence to determine percent identity. The deletion occurs at the Nterminus of the subject sequence and therefore, the FASTDB alignment does not match/align with the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C- termini not matched/total number of residues in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining residues were perfectly matched the final percent identity would be 90%. In another example, a 90 residue subject sequence is compared with a 100 residue query sequence. This time the deletions are internal so there are no residues at the N- or Ctermini of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned 15 with the query sequence are manually corrected. No other manual corrections are to made for the purposes of the present invention.
The above polypeptide sequences are included irrespective of whether they have their normal biological activity. This is because even where a particular polypeptide molecule does not have biological activity, one of skill in the art would 20 still know how to use the polypeptide, for instance, as a vaccine or to generate antibodies. Other uses of the polypeptides of the present invention that do not have E. faecalis activity include, inter alia, as epitope tags, in epitope mapping, and as molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration columns using methods known to those of skill in the art.
25 As described below, the polypeptides of the present invention can also be used to raise polyclonal and monoclonal antibodies, which are useful in assays for detecting E. faecalis protein expression or as agonists and antagonists capable of enhancing or inhibiting E. faecalis protein function. Further, such polypeptides can be used in the yeast two-hybrid system to "capture" E. faecalis protein binding proteins which are also candidate agonists and antagonists according to the present invention.
See, Fields et al. (1989) Nature 340:245-246.
Epilope-Bearing Portions In another aspect, the invention provides peptides and polypeptides comprising epitope-bearing portions of the E. faecalis polypeptides of the present invention. These epitopes are immunogenic or antigenic epitopes of the polypeptides of the present invention. An "immunogenic epitope" is defined as a part of a protein that elicits an antibody response when the whole protein or polypeptide is the immunogen. These immunogenic epitopes are believed to be confined to a few loci on the molecule. On the other hand, a region of a protein molecule to which an antibody can bind is defined as an "antigenic determinant" or "antigenic epitope." The number of immunogenic epitopes of a protein generally is less than the number of antigenic epitopes. See, Geysen, et al. (1983) Proc. Natl. Acad. Sci. USA 81:3998- 4002.
t15 Predicted antigenic epitopes are shown in Table 4, below. It is pointed out that Table 4 only lists amino acid residues comprising epitopes predicted to have the highest degree of antigenicity. The polypeptides not listed in Table 4 and portions of polypeptides not listed in Table 4 are not considered non-antigenic. This is because they may still be antigenic in vivo but merely not recognized as such by the particular algorithm used. Thus, Table 4 lists the amino acid residues comprising preferred antigenic epitopes but not a complete list. Amino acid residues comprising other anigenic epitopes may be determined by algorithms similar to the Jameson-Wolf analysis or by in vivo testing for an antigenic response using the methods described herein or those known in the art.
As to the selection of peptides or polypeptides bearing an antigenic epitope that contain a region of a protein molecule to which an antibody can bind), it is well known in that art that relatively short synthetic peptides that mimic part of a protein sequence are routinely capable of eliciting an antiserum that reacts with the partially mimicked protein. See, Sutcliffe, et al., (1983) Science 219:660-666.
Peptides capable of eliciting protein-reactive sera are frequently represented in the primary sequence of a protein, can be characterized by a set of simple chemical rules, and are confined neither to immunodominant regions of intact proteins immunogenic epitopes) nor to the amino or carboxyl terminals. Peptides that are extremely hydrophobic and those of six or fewer residues generally are ineffective at inducing antibodies that bind to the mimicked protein; longer, peptides, especially those containing proline residues, usually are effective. See, Sutcliffe, et al., supra, p.
661. For instance, 18 of 20 peptides desig ed according to these guidelines, containing 8-39 residues covering 75% of the sequence of the influenza virus hemagglutinin HAl polypeptide chain, induced antibodies that reacted with the HAl protein or intact virus; and 12/12 peptides from the MuLV polymerase and 18/18 from the rabies glycoprotein induced antibodies that precipitated the respective proteins.
Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise antibodies, including monoclonal antibodies, that bind 15 specifically to a polypeptide of the invention. Thus, a high proportion of hybridomas obtained by fusion of spleen cells from donors immunized with an antigen epitope-bearing peptide generally secrete antibody reactive with the native protein.
See Sutcliffe, et al., supra, p. 663. The antibodies raised by antigenic epitope-bearing peptides or polypeptides are useful to detect the mimicked protein, and antibodies to different peptides may be used for tracking the fate of various regions of a protein precursor which undergoes post-translational processing. The peptides and *oo anti-peptide antibodies may be used in a variety of qualitative or quantitative assays for the mimicked protein, for instance in competition assays since it has been shown that even short peptides about 9 amino acids) can bind and displace the larger 25 peptides in immunoprecipitation assays. See, Wilson, et al., (1984) Cell 37:767-778. The anti-peptide antibodies of the invention also are useful for purification of the mimicked protein, for instance, by adsorption chromatography using methods known in the art.
Antigenic epitope-bearing peptides and polypeptides of the invention designed according to the above guidelines preferably contain a sequence of at least seven, more preferably at least nine and most preferably between about 10 to about amino acids any integer between 7 and 50) contained within the amino acid sequence of a polypeptide of the invention. However, peptides or polypeptides comprising a larger portion of an amino acid sequence of a polypeptide of the invention, containing about 50 to about 100 amino acids, or any length up to and including the entire amino acid sequence of a polypeptide of the invention, also are considered epitope-bearing peptides or polypeptides of the invention and also are useful for inducing antibodies that react with the mimicked protein. Preferably, the amino acid sequence of the epitope-bearing peptide is selected to provide substantial solubility in aqueous solvents the sequence includes relatively hydrophilic residues and highly hydrophobic sequences are preferably avoided); and sequences containing proline residues are particularly preferred.
Non-limiting examples of antigenic polypeptides or peptides that can be used 15 to generate an enterococcal-specific immune response or antibodies include portions of the amino acid sequences identified in Table 1. More specifically, Table 4 discloses a list of non-limiting residues that are involved in the antigenicity of the epitope-bearing fragments of the present invention. Therefore, the present inventions provides for isolatd and purified antigenic epitope-bearing fragements of the polypeptides of the present invention comprising a peptide sequences of Table 4. The antigenic epitopebearing fragments comprising a peptide sequence of Table 4 preferably contain a sequence of at least seven, more preferably at least nine and most preferably between about 10 to about 50 amino acids any integer between 7 and 50) of a polypeptide oo of the present invention. That is, included in the present invention are antigenic polypeptides between the integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4. Therefore, in most cases, the polypeptides of Table 4 make up only a portion of the antigenic polypeptide. All combinations of sequences between the integers of 7 and 50 amino acid in length comprising one or more of the sequences of Table 4 are included. The antigenic epitope-bearing fragements may be specified by either the number of contiguous amino acid residues or by specific N-tenninal and C-terminal positions as described above for the polypeptide fragements of the present invention, wherein the initiation codon is residue 1. Any number of the described antigenic epitope-bearing fragements of the present invention may also be excluded from the present invention in the same manner.
The epitope-bearing peptides and polypeptides of the invention may be produced by any conventional means for making peptides or polypeptides including recombinant means using nucleic acid molecules of the invention. For instance, an epitope-bearing amino acid sequence of the present invention may be fused to a larger polypeptide which acts as a carrier during recombinant production and purification, as well as during immunization to produce anti-peptide antibodies. Epitope-bearing peptides also may be synthesized using known methods of chemical synthesis. For instance, Houghten has described a simple method for synthesis of large numbers of peptides, such as 10-20 mg of 248 different 13 residue peptides representing single amino acid variants of a segment of the HA1 polypeptide which were prepared and characterized (by ELISA-type binding studies) in less than four weeks (Houghten, R.
A. Proc. Natl. Acad. Sci. USA 82:5131-5135 (1985)). This "Simultaneous Multiple •Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 4,631,211 to.Houghten and coworkers (1986). In this procedure the individual resins for the solid-phase synthesis of various peptides are contained in separate solvent-permeable packets, enabling the optimal use of the many identical repetitive steps involved in solid-phase methods. A completely manual procedure allows 500-1000 or more syntheses to be conducted simultaneously (Houghten et al. (1985) Proc. Natl. Acad.
Sci. 82:5131-5135 at 5134.
Epitope-bearing peptides and polypeptides of the invention are used to induce antibodies according to methods well known in the art. See, Sutcliffe, et al., supra;; Wilson, et al., supra;; and Bittle, et al. (1985) J. Gen. Virol. 66:2347-2354.
Generally, animals may be immunized with free peptide; however, anti-peptide antibody titer may be boosted by coupling of the peptide to a macromolecular carrier, such as keyhole limpet hemacyanin (KLH) or tetanus toxoid. For instance, peptides containing cysteine may be coupled to carrier using a linker such as m-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS), while other peptides may be coupled to carrier using a more general linking agent such as glutaraldehyde.
Animals such as rabbits, rats and mice are immunized with either free or carrier-coupled peptides, for instance, by intraperitoneal and/or intradermal injection of emulsions containing about 100 ig peptide or carrier protein and Freund's adjuvant.
Several booster injections may be needed, for instance, at intervals of about two weeks, to provide a useful titer of anti-peptide antibody which can be detected, for example, by ELISA assay using free peptide adsorbed to a solid surface. The titer of anti-peptide antibodies in serum from an immunized animal may be increased by selection of anti-peptide antibodies, for instance, by adsorption to the peptide on a solid support and elution of the selected antibodies according to methods well known 15 in the art.
Sn. Immunogenic epitope-bearing peptides of the invention, those parts of a protein that elicit an antibody response when the whole protein is the immunogen, are identified according to methods known in the art. For instance, Geysen, el al., supra, S• discloses a procedure for rapid concurrent synthesis on solid supports of hundreds of peptides of sufficient purity to react in an ELISA. Interaction of synthesized peptides with antibodies is then easily detected without removing them from the support. In this manner a peptide bearing an immunogenic epitope of a desired protein may be identified routinely by one of ordinary skill in the art. For instance, the immunologically important epitope in the coat protein of foot-and-mouth disease virus was located by Geysen et al. supra with a resolution of seven amino acids by synthesis of an overlapping set of all 208 possible hexapeptides covering the entire 213 amino acid sequence of the protein. Then, a complete replacement set of peptides in which all 20 amino acids were substituted in turn at every position within the epitope were synthesized, and the particular amino acids conferring specificity for the reaction with antibody were determined. Thus, peptide analogs of the epitope-bearing peptides of the invention can be made routinely by this method. U.S. Patent No.
4,708,781 to Geysen (1987) further describes this method of identifying a peptide bearing an immnunogenic epitope of a desired protein.
Further still, U.S. Patent No. 5,194,392, to Geysen (1990), describes a general method of detecting or determining the sequence of monomers (amino acids or other compounds) which is a topological equivalent of the epitope a "mimotope") which is complementary to a particular paratope (antigen binding site) of an antibody of interest. More generally, U.S. Patent No. 4,433,092, also to Geysen (1989), describes a method of detecting or determining a sequence of monomers which is a topographical equivalent of a ligand which is complementary to the ligand binding site of a particular receptor of interest. Similarly, U.S. Patent No. 5,480,971 to Houghten, R. A. et al. (1996) discloses linear Ci-C 7 -alkyl peralkylated oligopeptides and sets and ,libraries of such peptides, as well as methods for using such oligopeptide sets and 15 libraries for determining the sequence of a peralkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs of the epitope-bearing peptides of the invention also can be made routinely by these methods. The entire disclosure of each document cited in this section on "Polypeptides and Fragments" is hereby incorporated herein by reference.
20 As one of skill in the art will appreciate, the polypeptides of the present invention and the epitope-bearing fragments thereof described above can be combined with parts of the constant domain of immunoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins facilitate purification and show an increased half-life in vivo. This has been shown, for chimeric proteins consisting of the first two domains of the human CD4-polypeptide and various domains of the constant regions of the heavy or light chains of mammalian immunoglobulins. (EPA 0,394,827; Traunecker et al. (1988) Nature 331:84-86. Fusion proteins that have a disulfide-linked dimeric structure due to the IgG part can also be more efficient in binding and neutralizing other molecules than a monomeric E. faecalis polypeptide or fragment thereof alone. See Fountoulakis et al. (1995) J. Biochem. 270:3958-3964.
Nucleic acids encoding the above epitopes of E. faecalis polypeptides can also be recombined with a gene of interest as an epitope tag to aid in detection and purification of the expressed polypeptide.
Antibodies E. faecalis protein-specific antibodies for use in the present invention can be raised against the intact E. faecalis protein or an antigenic polypeptide fragment thereof, which may be presented together with a carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if it is long enough (at least about amino acids), without a carrier.
As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is meant to include intact molecules, single chain whole antibodies, and antibody fragments. Antibody fragments of the present invention include Fab and F(ab')2 and 15 other fragments including single-chain Fvs (scFv) and disulfide-linked Fvs (sdFv).
Also included in the present invention are chimeric and humanized monoclonal antibodies and polyclonal antibodies specific for the polypeptides of the present invention. The antibodies of the present invention may be prepared by any of a variety of methods. For example, cells expressing a polypeptide of the present 20 invention or an antigenic fragment thereof can be administered to an animal in order to induce the production of sera containing polyclonal antibodies. For example, a preparation of E. faecalis polypeptide or fragment thereof is prepared and purified to render it substantially free of natural contaminants. Such a preparation is then introduced into an animal in order to produce polyclonal antisera of greater specific 25 activity.
In a preferred method, the antibodies of the present invention are monoclonal antibodies or binding fragments thereof. Such monoclonal antibodies can be prepared using hybridoma technology. See, Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988); Hammerling, et al., in: MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS 563-681 (Elsevier, 1981). Fab and F(ab')2 fragments may be produced by proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, E. faecalis polypeptide-binding fragments, chimeric, and humanized antibodies can be produced through the application of recombinant DNA technology or through synthetic chemistry using methods known in the art.
Alternatively, additional antibodies capable of binding to the polypeptide antigen of the present invention may be produced in a two-step procedure through the use of anti-idiotypic antibodies. Such a method makes use of the fact that antibodies are themselves antigens, and that, therefore, it is possible to obtain an antibody which binds to a second antibody. In accordance with this method, E. faecalis polypeptide-specific antibodies are used to immunize an animal, preferably a mouse.
The splenocytes of such an animal are then used to produce hybridoma cells, and the 15 hybridoma cells are screened to identify clones which produce an antibody whose ability to bind to the E. faecalis polypeptide-specific antibody can be blocked by the E. faecalis polypeptide antigen. Such antibodies comprise anti-idiotypic antibodies to the E. faecalis polypeptide-specific antibody and can be used to immunize an animal to induce formation of further E. faecalis polypeptide-specific antibodies.
Antibodies and fragements thereof of the present invention may be described by the portion of a polypeptide of the present invention recognized or specifically bound by the antibody. Antibody binding fragements of a polypeptide of the present invention may be described or specified in the same manner as for polypeptide fragements discussed above., i.e, by N-terminal and C-terminal positions or by size in contiguous amino acid residues. Any number of antibody binding fragments, of a polypeptide of the present invention, specified by N-terminal and C-terminal positions or by size in amino acid residues, as described above, may also be excluded from the present invention. Therefore, the present invention includes antibodies the specifically bind a particuarlly discribed fragement of a polypeptide of the present invention and allows for the exclusion of the same.
Antibodies and fragements thereof of the present invention may also be described or specified in terms of their cross-reactivity. Antibodies and fragements that do not bind polypeptides of any other species of Enterococcus other than E.
faecalis are included in the present invention. Likewise, antibodies and fragements that bind only species of Enterococcus, i.e. antibodies and fragements that do not bind bacteria from any genus other than Enterococcus, are included in the present invention.
Diagnostic Assays The present invention further relates to methods for assaying staphylococcal infection in an animal by detecting the expression of genes encoding staphylococcal polypeptides of the present invention. The methods comprise analyzing tissue or body fluid from the animal for Enterococcus-specific antibodies, nucleic acids, or 15 proteins. Analysis of nucleic acid specific to Enterococcus is assayed by PCR or hybridization techniques using nucleic acid sequences of the present invention as either hybridization probes or primers. See, Sambrook et al. Molecular cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 2nd ed., 1989, page 54 reference); Eremeeva et al. (1994) J. Clin. Microbiol. 32:803-810 (describing differentiation among spotted fever group Rickettsiae species by analysis of restriction fragment length polymorphism of PCR-amplified DNA) and Chen et al. 1994 J. Clin.
Microbiol. 32:589-595 (detecting B. burgdorferi nucleic acids via PCR).
Where diagnosis of a disease state related to infection with Enterococcus has already been made, the present invention is useful for monitoring progression or 25 regression of the disease state whereby patients exhibiting enhanced Enterococcus gene expression will experience a worse clinical outcome relative to patients expressing these gene(s) at a lower level.
By "biological sample" is intended any biological sample obtained from an animal, cell line, tissue culture, or other source which contains Enterococcus polypeptide, mRNA, or DNA. Biological samples include body fluids (such as saliva, blood, plasma, urine, mucus, synovial fluid, etc.) tissues (such as muscle, skin, and cartilage) and any other biological source suspected of containing Enterococcus polypeptides or nucleic acids. Methods for obtaining biological samples such as tissue are well known in the art.
The present invention is useful for detecting diseases related to Enterococcus infections in animals. Preferred animals include monkeys, apes, cats, dogs, birds, cows, pigs, mice, horses, rabbits and humans. Particularly preferred are humans.
Total RNA can be isolated from a biological sample using any suitable technique such as the single-step guanidinium-thiocyanate-phenol-chloroform method described in Chomczynski et al. (1987) Anal. Biochem. 162:156-159. mRNA encoding Enterococcus polypeptides having sufficient homology to the nucleic acid sequences identified in Table 1 to allow for hybridization between complementary sequences are then assayed using any appropriate method. These include Northern blot analysis, S 1 15 nuclease mapping, the polymerase chain reaction (PCR), reverse transcription in combination with the polymerase chain reaction (RT-PCR), and reverse transcription in combination with the ligase chain reaction (RT-LCR).
Northern blot analysis can be performed as described in Harada et al. (1990) Cell 63:303-312. Briefly, total RNA is prepared from a biological sample as described above. For the Northern blot, the RNA is denatured in an appropriate buffer (such as glyoxal/dimethyl sulfoxide/sodium phosphate buffer), subjected to agarose gel electrophoresis, and transferred onto a nitrocellulose filter. After the RNAs have been linked to the filter by a UV linker, the filter is prehybridized in a solution containing formamide, SSC, Denhardt's solution, denatured salmon sperm, SDS, and sodium 25 phosphate buffer. A E. faecalis polynucleotide sequence shown in Table 1 labeled according to any appropriate method (such as the 32 P-multiprimed DNA labeling system (Amersham)) is used as probe. After hybridization overnight, the filter is washed and exposed to x-ray film. DNA for use as probe according to the present invention is described in the sections above and will preferably at least 15 nucleotides in length.
S1 mapping can be performed as described in Fujita et al. (1987) Cell 49:357-367. To prepare probe DNA for use in Sl mapping, the sense strand of an above-described E.faecalis DNA sequence of the present invention is used as a template to synthesize labeled antisense DNA. The antisense DNA can then be digested using an appropriate restriction endonuclease to generate further DNA probes of a desired length. Such antisense probes are useful for visualizing protected bands corresponding to the target mRNA mRNA encoding Enlerococcus polypeptides).
Levels of mRNA encoding Enterococcus polypeptides arc assayed, for e.g., using the RT-PCR method described in Makino ct al. (1990) Technique 2:295-301.
By this method, the radioactivities of the "amplicons" in the polyacrylamide gel bands are linearly related to the initial concentration of the target mRNA. Briefly, this method involves adding total RNA isolated from a biological sample in a reaction 15 mixture containing a RT primer and appropriate buffer. After incubating for primer annealing, the mixture can be supplemented with a RT buffer, dNTPs, DTT, RNase inhibitor and reverse transcriptase. After incubation to achieve reverse transcription of the RNA, the RT products are then subject to PCR using labeled primers.
Alternatively, rather than labeling the primers, a labeled dNTP can be included in the 20 PCR reaction mixture. PCR amplification can be performed in a DNA thermal cycler according to conventional techniques. After a suitable number of rounds to achieve amplification, the PCR reaction mixture is electrophoresed on a polyacrylamide gel.
After drying the gel, the radioactivity of the appropriate bands (corresponding to the mRNA encoding the Enterococcus polypeptides of the present invention) are 25 quantified using an imaging analyzer. RT and PCR reaction ingredients and conditions, reagent and gel concentrations, and labeling methods are well known in the art. Variations on the RT-PCR method will be apparent to the skilled artisan. Other PCR methods that can detect the nucleic acid of the present invention can be found in PCR PRIMER: A LABORATORY MANUAL Dieffenbach et al. eds., Cold Spring Harbor Lab Press, 1995).
The polynucleotides of the present invention, including both DNA and RNA, may be used to detect polynucleotides of the present invention or Enterococcal species including E. faecalis using bio chip technology. The present invention includes both high density chip arrays (>1000 oligonucleotides per cm 2 and low density chip arrays (<1000 oligonucleotides per cm 2 Bio chips comprising arrays of polynucleotides of the present invention may be used to detect Enterococcal species, including E. faecalis, in biological and environmental samples and to diagnose an animal, including humans, with an E. faecalis or other Enterococcal infection. The bio chips of the present invention may comprise polynucleotide sequences of other pathogens including bacteria, viral, parasitic, and fungal polynucleotide sequences, in addition to the polynucleotide sequences of the present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips can also be used to monitor an E. faecalis or other Enterococcal infections and to monitor the genetic 15 changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the clinic and drug development in the laboratory. The bio chip technology comprising .arrays of polynucleotides of the present invention may also be used to simultaneously monitor the expression of a multiplicity of genes, including those of the present invention. The polynucleotides used to comprise a selected array may be specified in the same manner as for the fragements, i.e, by their 5' and 3' positions or length in contigious base pairs and include from. Methods and particular uses of the polynucleotides of the present invention to detect Enterococcal species, including E.
faecalis, using bio chip technology include those known in the art and those of: U.S.
Patent Nos. 5510270, 5545531, 5445934, 5677195, 5532128, 5556752, 5527681, 25 5451683, 5424186, 5607646, 5658732 and World Patent Nos. WO/9710365, WO/9511995, WO/9743447, WO/9535505, each incorporated herein in their entireties.
Biosensors using the polynucleotides of the present invention may also be used to detect, diagnose, and monitor E. faecalis or other Enterococcal species and infections thereof. Biosensors using the polynucleotides of the present invention may also be used to detect particular polynucleotides of the present invention. Biosensors using the polynucleotides of the present invention may also be used to monitor the genetic changes (deletions, insertions, mismatches, etc.) in response to drug therapy in the clinic and drug development in the laboratory. Methods and particular uses of the polynucleotides of the present invention to detect Enterococcal species, including E.
faecalis, using biosenors include those known in the art and those of: U.S. Patent Nos 5721102, 5658732, 5631170, and World Patent Nos. W097/35011, WO/9720203, each incorporated herein in their entireties.
Thus, the present invention includes both bio chips and biosensors comprising polynucleotides of the present invention and methods of their use.
Assaying Enterococcus polypeptide levels in a biological sample can occur using any art-known method, such as antibody-based techniques. For example, Enterococcus polypeptide expression in tissues can be studied with classical 15 immunohistological methods. In these, the specific recognition is provided by the 9. primary antibody (polyclonal or monoclonal) but the secondary detection system can utilize fluorescent, enzyme, or other conjugated secondary antibodies. As a result, an immunohistological staining of tissue section for pathological examination is obtained.
Tissues can also be extracted, with urea and neutral detergent, for the liberation of Enterococcus polypeptides for Western-blot or dot/slot assay. See, Jalkanen, M.
et al. (1985) J. Cell. Biol. 101:976-985; Jalkanen, M. et al. (1987) J. Cell Biol.
9* 105:3087-3096. In this technique, which is based on the use of cationic solid phases, quantitation of a Enterococcus polypeptide can be accomplished using an isolated Enterococcus polypeptide as a standard. This technique can also be applied to body S" 25 fluids.
Other antibody-based methods useful for detecting Enterococcus polypeptide gene expression include immunoassays, such as the ELISA and the radioimmunoassay (RIA). For example, a Enterococcus polypeptide-specific monoclonal antibodies can be used both as an immunoabsorbent and as an enzyme-labeled probe to detect and quantify a Enterococcus polypeptide. The amount of a Enterococcus polypeptide present in the sample can be calculated by reference to the amount present in a standard preparation using a linear regression computer algorithm. Such an ELISA is described in lacobelli et al. (1988) Breast Cancer Research and Treatment 11:19-30. In another ELISA assay, two distinct specific monoclonal antibodies can be used to detect Enterococcus polypeptides in a body fluid. In this assay, one of the antibodies is used as the immunoabsorbent and the other as the enzyme-labeled probe.
The above techniques may be conducted essentially as a "one-step" or "two-step" assay. The "one-step" assay involves contacting the Enterococcus So polypeptide with irmnobilized antibody and, without washing, contacting the mixture with the labeled antibody. The "two-step" assay involves washing before contacting the mixture with the labeled antibody. Other conventional methods may also be employed as suitable. It is usually desirable to immobilize one component of the a assay system on a support, thereby allowing other components of the system to be 15 brought into contact with the component and readily removed from the sample.
Variations of the above and other immunological methods included in the present invention can also be found in Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd.ed. 1988).
Suitable enzyme labels include, for example, those from the oxidase group, 20 which catalyze the production of hydrogen peroxide by reacting with substrate.
Glucose oxidase is particularly preferred as it has good stability and its substrate (glucose) is readily available. Activity of an oxidase label may be assayed by measuring the concentration of hydrogen peroxide formed by the enzyme-labeled antibody/substrate reaction. Besides enzymes, other suitable labels include 25 radioisotopes, such as iodine (1251, 21), carbon 4 sulphur tritium indium and technetium (99Tc), and fluorescent labels, such as fluorescein and rhodamine, and biotin.
Further suitable labels for the Enterococcus polypeptide-specific antibodies of the present invention are provided below. Examples of suitable enzyme labels include malate dehydrogenase, Enterococcal nuclease, delta-5-steroid isomerase, yeast-alcohol dehydrogenase, alpha-glycerol phosphate dehydrogenase, triose phosphate isomerase, peroxidase, alkaline phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose-6-phosphate dehydrogenase, glucoamylase, and acetylcholine esterase.
Examples of suitable radioisotopic labels include 3 H, 11 n, 251, 131, 32 3 5
S,
14C, SICr, 57 To, 58 Co, 59 Fe, 75 Se, 52 Eu, 9 6 7 Cu, 2 17 Ci, 21 'At, 212 Pb, 47 Sc, etc.
"'In is a preferred isotope where in vivo imaging is used since its avoids the problem of dehalogenation of the 1251 or 31 1-labeled monoclonal antibody by the liver. In addition, this radionucleotide has a more favorable gamma emission energy for imaging.
See, Perkins etal. (1985) Eur. J. Nucl. Med. 10:296-301; Carasquillo et al.
(1987) J. Nucl. Med. 28:281-287. For example,'"In coupled to monoclonal antibodies with I-(P-isothiocyanatobenzyl)-DPTA has shown little uptake in non-tumors tissues, particularly the liver, and therefore enhances specificity of tumor 15 localization. See, Esteban et al. (1987) J. Nucl. Med. 28:861-870.
Examples of suitable non-radioactive isotopic labels include '57Gd, '6Dy, 52 Tr, and 56 Fe.
Examples of suitable fluorescent labels include an 5 2 Eu label, a fluorescein label, an isothiocyanate label, a rhodamine label, a phycoerythrin label, a phycocyanin label, an allophycocyanin label, an o-phthaldehyde label, and a fluorescamine label.
Examples of suitable toxin labels include, Pseudomonas toxin, diphtheria toxin, ricin, and cholera toxin.
Examples of chemiluminescent labels include a luminal label, an isoluminal label, an aromatic acridinium ester label, an imidazole label, an acridinium salt label, an oxalate ester label, a luciferin label, a luciferase label, and an aequorin label.
Examples of nuclear magnetic resonance contrasting agents include heavy metal nuclei such as Gd, Mn, and iron.
Typical techniques for binding the above-described labels to antibodies are provided by Kennedy et al. (1976) Clin. Chim. Acta 70:1-31, and Schurs et al. (1977) Clin. Chim. Acta 81:1-40. Coupling techniques mentioned in the latter are the glutaraldehyde method, the periodate method, the dimaleimide method, the m-maleimidobenzyl-N-hydroxy-succinimide ester method, all of which methods are incorporated by reference herein.
In a related aspect, the invention includes a diagnostic kit for use in screening serum containing antibodies specific against E. faecalis infection. Such a kit may include an isolated E. faecalis antigen comprising an epitope which is specifically immunoreactive with at least one anti-E.faecalis antibody. Such a kit also includes means for detecting the binding of said antibody to the antigen. In specific embodiments, the kit may include a recombinantly produced or chemically synthesized peptide or polypeptide antigen. The peptide or polypeptide antigen may be attached to a solid support.
In a more specific embodiment, the detecting means of the above-described kit includes a solid support to which said peptide or polypeptide antigen is attached.
Such a kit may also include a non-attached reporter-labeled anti-human antibody. In this embodiment, binding of the antibody to the E. faecalis antigen can be detected by binding of the reporter labeled antibody to the anti-E. faecalis polypeptide antibody.
In a related aspect, the invention includes a method of detecting E. faecalis infection in a subject. This detection method includes reacting a body fluid, preferably serum, from the subject with an isolated E.faecalis antigen, and examining the antigen for the presence of bound antibody. In a specific embodiment, the method includes a polypeptide antigen attached to a solid support, and serum is reacted with the support. Subsequently, the support is reacted with a reporter-labeled anti-human antibody. The support is then examined for the presence of reporter-labeled antibody.
The solid surface reagent employed in the above assays and kits is prepared by known techniques for attaching protein material to solid support material, such as polymeric beads, dip sticks, 96-well plates or filter material. These attachment methods generally include non-specific adsorption of the protein to the support or covalent attachment of the protein typically through a free amine group, to a chemically reactive group on the solid support, such as an activated carboxyl, hydroxyl, or aldehyde group. Alternatively, streptavidin coaled plates can be used in conjunction with biotinylated antigen(s).
The polypeptides and antibodies of the present invention, including fragments thereof, may be used to detect Enterococcal species including E.faecalis using bio chip and biosensor technology. Bio chip and biosensors of the present invention may comprise the polypeptides of the present invention to detect antibodies, which specifically recognize Enterococcal species, including E.f tecalis. Bio chip and biosensors of the present invention may also comprise antibodies which specifically recognize the polypeptides of the present invention to detect Enterococcal species, including E. faecalis or specific polypeptides of the present invention. Bio chips or biosensors comprising polypeptides or antibodies of the present invention may be used to detect Enterococcal species, including E. faecalis, in biological and 15 environmental samples and to diagnose an animal, including humans, with an E.
faecalis or other Enterococcal infection. Thus, the present invention includes both bio chips and biosensors comprising polypeptides or antibodies of the present invention and methods of their use.
The bio chips of the present invention may further comprise polypeptide 20 sequences of other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition to the polypeptide sequences of the present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips of the present invention may further comprise antibodies or fragements thereof specific for other pathogens including bacteria, viral, parasitic, and fungal polypeptide sequences, in addition to the antibodies or fragements thereof of the present invention, for use in rapid diffenertial pathogenic detection and diagnosis. The bio chips and biosensors of the present invention may also be used to monitor an E. faecalis or other Enterococcal infection and to monitor the genetic changes (amio acid deletions, insertions, substitutions, etc.) in response to drug therapy in the clinic and drug development in the laboratory. The bio chip and biosensors comprising polypeptides or antibodies of the present invention may also be used to simultaneously monitor the expression of a multiplicity of polypeptides, including those of the present invention.
The polypeptides used to comprise a bio chip or biosensor of the present invention may be specified in the same manner as for the fragements, i.e, by their N-tenninal and C-terminal positions or length in contigious amino acid residue. Methods and particular uses of the polypeptides and antibodies of the present invention to detect Enterococcal species, including E.faecalis, or specific polypeptides using bio chip and biosensor technology include those known in the art, those of the U.S. Patent Nos.
and World Patent Nos. listed above for bio chips and biosensors using polynucleotides of the present invention, and those of: U.S. Patent Nos. 5658732, 5135852, 5567301, 5677196, 5690894 and World Patent Nos. W09729366, W09612957, each incorporated herein in their entireties.
Treatment: Agonists and Antagonists Assays and Molecules.
•The invention also provides a method of screening compounds to identify those which enhance or block the biological activity of the E. faecalis polypeptides of the present invention. The present invention further provides where the compounds kill or slow the growth of E.faecalis. The ability of E.faecalis antagonists, including E. faecalis ligands, to prophylactically or therapeutically block antibiotic resistance may be easily tested by the skilled artisan. See, Straden et al. (1997) J Bacteriol.
179(1):9-16.
An agonist is a compound which increases the natural biological function or which functions in a manner similar to the polypeptides of the present invention, while antagonists decrease or eliminate such functions. Potential antagonists include small organic molecules, peptides, polypeptides, and antibodies that bind to a polypeptide of the invention and thereby inhibit or extinguish its activity.
The antagonists may be employed for instance to inhibit peptidoglycan cross bridge formation. Antibodies against E. faecalis may be employed to bind to and inhibit E. faecalis activity to treat antibiotic resistance. Any of the above antagonists may be employed in a composition with a pharmaceutically acceptable carrier.
Vaccines The present invention also provides vaccines comprising one or more polypeptides of the present invention. Heterogeneity in the composition of a vaccine may be provided by combining E. faecalis polypeptides of the present invention.
Multi-component vaccines of this type are desirable becau'se they are likely to be more effective in eliciting protective immune responses against multiple species and strains of the Enterococcus genus than single polypeptide vaccines.
Multi-component vaccines are known in the art to elicit antibody production to numerous immunogenic components. See, Decker et al. (1996) J. Infect. Dis.
174:S270-275. In addition, a hepatitis B, diphtheria, tetanus, pertussis tetravalent vaccine has recently been demonstrated to elicit protective levels of antibodies in human infants against all four pathogenic agents. See, Aristegui, J. et al. (1997) Vaccine 15:7-9.
The present invention in addition to single-component vaccines includes multi-component vaccines. These vaccines comprise more than one polypeptide, 20 immunogen or antigen. Thus, a multi-component vaccine would be a vaccine comprising more than one of the E. faecalis polypeptides of the present invention.
Further within the scope of the invention are whole cell and whole viral vaccines. Such vaccines may be produced recombinantly and involve the expression of one or more of the E. faecalis polypeptides described in Table 1. For example, the E. faecalis polypeptides of the present invention may be either secreted or localized intracellular, on ihe cell surface, or in the periplasmic space. Further, when a recombinant virus is used, the E. faecalis polypeptides of the present invention may, for example, be localized in the viral envelope, on the surface of the capsid, or internally within the capsid. Whole cells vaccines which employ cells expressing heterologous proteins are known in the art. See, Robinson, K. et al. (1997) Nature Biotech. 15:653-657; Sirard, J. et al. (1997) Infect. Immun. 65:2029-2033; Chabalgoity, J. et al. (1997) Infect. Immun. 65:2402-2412 These cells may be administered live or may be killed prior to administration. Chabalgoity, J. et al., supra, for example, report the successful use in mice of a live attenuated Salmonella vaccine strain which expresses a portion of a platyhelminth fatty acid-binding protein as a fusion protein on its cells surface.
A multi-component vaccine can also be prepared using techniques known in the art by combining one or more E. faecalis polypeptides of the present invention, or fragments thereof, with additional non-Enterococcal components diphtheria toxin or tetanus toxin, and/or other compounds known to elicit an immune response).
Such vaccines are useful for eliciting protective immune responses to both members of the Enterococcus genus and non-Enterococcal pathogenic agents.
The vaccines of the present invention also include DNA vaccines. DNA 15 vaccines are currently being developed for a number of infectious diseases. See, et al., Boyer, et al. (1997) Nat. Med. 3:526-532; reviewed in Spier, R. (1996) Vaccine 14:1285-1288. Such DNA vaccines contain a nucleotide sequence encoding one or more E. faecalis polypeptides of the present invention oriented in a manner that allows for expression of the subject polypeptide. For example, the direct administration of plasmid DNA encoding B. burgdorgeri OspA has been shown to elicit protective immunity in mice against borrelial challenge. See, Luke et al. (1997) J.
Infect. Dis. 175:91-97.
"The present invention also relates to the administration of a vaccine which is co-administered with a molecule capable of modulating immune responses. Kim et al.
(1997) Nature Biotech. 15:641-646, for example, report the enhancement of immune responses produced by DNA immunizations when DNA sequences encoding molecules which stimulate the immune response are co-administered. In a similar fashion, the vaccines of the present invention may be co-administered with either nucleic acids encoding immune modulators or the immune modulators themselves.
These immune modulators include granulocyte macrophage colony stimulating factor (GM-CSF) and CD86.
The vaccines of the present invention may be used to confer resistance to Enterococcal infection by either passive or active immunization. When the vaccines of the present invention are used to confer resistance to Enterococcal infection through active immunization, a vaccine of the present invention is administered to an animal to elicit a protective immune response which either prevents or attenuates a Enterococcal infection. When the vaccines of the present invention are used to confer resistance to Enterococcal infection through passive immunization, the vaccine is provided to a host animal human, dog, or mouse), and the antisera elicited by this antisera is recovered and directly provided to a recipient suspected of having an infection caused by a member of the Enterococcus genus.
The ability to label antibodies, or fragments of antibodies, with toxin molecules provides an additional method for treating Enterococcal infections when passive immunization is conducted. In this embodiment, antibodies, or fragments of antibodies, capable of recognizing the E.faecalis polypeptides disclosed herein, or fragments thereof, as well as other Enterococcus proteins, are labeled with toxin molecules prior to their administration to the patient. When such toxin derivatized antibodies bind to Enterococcus cells, toxin moieties will be localized to these cells and 20 will cause their death.
The present invention thus concerns and provides a means for preventing or attenuating a Enterococcal infection resulting from organisms which have antigens that are recognized and bound by antisera produced in response to the polypeptides of the present invention. As used herein, a vaccine is said to prevent or attenuate a disease if its administration to an animal results either in the total or partial attenuation suppression) of a symptom or condition of the disease, or in the total or partial immunity of the animal to the disease.
The administration of the vaccine (or the antisera which it elicits) may be for either a "prophylactic" or "therapeutic" purpose. When provided prophylactically, the compound(s) are provided in advance of any symptoms of Enterococcal infection.
The prophylactic administration of the compound(s) serves to prevent or attenuate any subsequent infection. When provided therapeutically, the compound(s) is provided upon or after the detection of symptoms which indicate that an animal may be infected with a member of the Enterococcus genus. The therapeutic administration of the compound(s) serves to attenuate any actual infection. Thus, the E. faecalis polypeptides, and fragments thereof, of the present invention may be provided either prior to the onset of infection (so as to prevent or attenuate an anticipated infection) or after the initiation of an actual infection.
The polypeptides of the invention, whether encoding a portion of a native protein or a functional derivative thereof, may be administered in pure form or may be coupled to a macromolecular carrier. Example of such carriers are proteins and carbohydrates. Suitable proteins which may act as macromolecular carrier for "enhancing the immunogenicity of the polypeptides of the present invention include 15 keyhole limpet hemacyanin (KLH) tetanus toxoid, pertussis toxin, bovine serum albumin, and ovalbumin. Methods for coupling the polypeptides of the present invention to such macromolecular carriers are disclosed in Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988).
A composition is said to be "pharmacologically or physiologically acceptable" if its administration can be tolerated by a recipient animal and is otherwise suitable for administration to that animal. Such an agent is said to be administered in a "therapeutically effective amount" if the amount administered is physiologically significant. An agent is physiologically significant if its presence results in a detectable change in the physiology of a recipient patient.
While in all instances the vaccine of the present invention is administered as a pharmacologically acceptable compound, one skilled in the art would recognize that the composition of a pharmacologically acceptable compound varies with the animal to which it is administered. For example, a vaccine intended for human use will generally not be co-administered with Freund's adjuvant. Further, the level of purity of the E. faecalis polypeptides of the present invention will normally be higher when administered to a human than when administered to a non-human animal.
As would be understood by one of ordinary skill in the art, when the vaccine of the present invention is provided to an animal, it may be in a composition which may contain salts, buffers, adjuvants, or other substances which are desirable for improving the efficacy of the composition. Adjuvants are substances that can be used to specifically augment a specific immune response. These substances generally perform two functions: they protect the antigen(s) from being rapidly catabolized after administration and they nonspecifically stimulate immune responses.
Normally, the adjuvant and the composition are mixed prior to presentation to the immune system, or presented separately, but into the same site of the animal being immunized. Adjuvants can be loosely divided into several groups based upon their composition. These groups include oil adjuvants (for example, Freund's complete and 15 incomplete), mineral salts (for example, AIK(S0 4 2 AINa(S0 4 2
AINH
4 (SO4), silica, kaolin, and carbon), polynucleotides (for example, poly IC and poly AU acids), and certain natural substances (for example, wax D from Mycobacterium tuberculosis, as well as substances found in Corynebacterium parvum, or Bordetella pertussis, and members of the genus Brucella. Other substances useful as adjuvants are the saponins such as, for example, Quil A. (Superfos A/S, Denmark). Preferred adjuvants for use in the present invention include aluminum salts, such as AlK(SO 4 2 AINa(SO4)2, and S: AINH 4 (SO4). Examples of materials suitable for use in vaccine compositions are provided in REMINGTON'S PHARMACEUTICAL SCIENCES 1324-1341 (A.
Osol, ed, Mack Publishing Co, Easton, PA, (1980) (incorporated herein by reference).
The therapeutic compositions of the present invention can be administered parenterally by injection, rapid infusion, nasopharyngeal absorption (intranasopharangeally), dermoabsorption, or orally. The compositions may alternatively be administered intramuscularly, or intravenously. Compositions for parenteral administration include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Carriers or occlusive dressings can be used to increase skin permeability and enhance antigen absorption. Liquid dosage forms for oral administration may generally comprise a liposome solution containing the liquid dosage form. Suitable forms for suspending liposomes include emulsions, suspensions, solutions, syrups, and elixirs containing inert diluents commonly used in the art, such as purified water. Besides the inert diluents, such compositions can also include adjuvants, wetting agents, emulsifying and suspending agents, or sweetening, I flavoring, or perfuming agents.
Therapeutic compositions of the present invention can also be administered in S* encapsulated form. For example, intranasal immunization using vaccines encapsulated in biodegradable microsphere composed of poly(DL-lactide-co-glycolide). See, Shahin, R. et al. (1995) Infect. Immun. 63:1195-1200. Similarly, orally administered encapsulated Salmonella typhimurium antigens can also be used. Allaoui-Attarki, K.
et al. (1997) Infect. Immun. 65:853-857. Encapsulated vaccines of the present ~invention can be administered by a variety of routes including those involving contacting the vaccine with mucous membranes intranasally, intracolonicly, intraduodenally).
20 Many different techniques exist for the timing of the immunizations when a multiple administration regimen is utilized. It is possible to use the compositions of the invention more than once to increase the levels and diversities of expression of the immunoglobulin repertoire expressed by the immunized animal. Typically, if multiple immunizations are given, they will be given one to two months apart.
According to the present invention, an "effective amount" of a therapeutic composition is one which is sufficient to achieve a desired biological effect Generally, the dosage needed to provide an effective amount of the composition will vary depending upon such factors as the animal's or human's age, condition, sex, and extent of disease, if any, and other variables which can be adjusted by one of ordinary skill in the art.
The antigenic preparations of the invention can be administered by either single or multiple dosages of an effective amount. Effective amounts of the compositions of the invention can vary from 0.01-1,000 tg/ml per dose, more preferably 0.1-500 pg/ml per dose, and most preferably 10-300 .g/ml per dose.
Examples Example 1: Isolation of a Selected DNA Clone From the Deposited Sample ofE.
faecalis I Three approaches can be used to isolate a E. faecalis clone comprising a polynucleotide of the present invention from any E.faecalis genomic DNA library.
The E. faecalis strain V586 has been deposited as a convienent source for obtaining a E. faecalis strain although a wide varity of strains E. faecalis strains can be used which are known in the art.
E. faecalis genomic DNA is prepared using the following method. A overnight bacterial culture grown in a rich medium Trypticase Soy Broth, Brain Heart Infusion broth or Super broth), pelleted, washed two times with TES Tris-pH 8.0, 25mM EDTA, 50mM NaCI), and resuspended in 5ml high salt TES NaCI). Lysostaphin is added to final concentration of approx 50ug/ml and the 20 mixture is rotated slowly 1 hour at 37C to make protoplast cells. The solution is then placed in incubator (or place in a shaking water bath) and warmed to 55C. Five hundred micro liter of 20% sarcosyl in TES (final concentration is then added to lyse the cells. Next, guanidine HCI is added to a final concentration of7M (3.69g in ml). The mixture is swirled slowly at 55C for 60-90 min (solution should clear).
A CsCI gradient is then set up in SW41 ultra clear tubes using 2.0ml 5.7M CsCI and overlaying with 2.85M CsCI. The gradient is carefully overlayed with the DNAcontaining GuHCI solution. The gradient is spun at 30,000 rpm, 20C for 24 hr and the lower DNA band is collected. The volume is increased to 5 ml with TE buffer.
The DNA is then treated with protease K (10 ug/ml) overnight at 37 C, and precipitated with ethanol. The precipitated DNA is resuspended in a desired buffer.
In the first method, a plasmid is directly isolated by screening a plasmid E.
faecalis genomic DNA library using a polynucleotide probe corresponding to a polynucleotide of the present invention. Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized using an Applied Biosystems DNA synthesizer according to the sequence reported. The oligonucleotide is labeled, for instance, with 3 2 P-y-ATP using T4 polynucleotide kinase and purified according to routine methods.
(See, Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) The library is transformed into a suitable host, as indicated above (such as XL-I Blue (Stratagene)) using techniques known to those of skill in the art. See, Sambrook et al. MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989). The transformants are plated on 1.5% agar plates (containing the 15 appropriate selection agent, ampicillin) to a density of about 150 transformants (colonies) per plate. These plates are screened using Nylon membranes according to routine methods for bacterial colony screening. See, Sambrook et al.
MOLECULAR CLONING: A LABORATORY MANUAL (Cold Spring Harbor, N.Y. 2nd ed. 1989); Ausubel et al., CURRENT PROTOCALS IN MOLECULAR BIOLOGY (John Wiley and Sons, N.Y. 1989) or other techniques known to those of skill in the art.
Alternatively, two primers of 15-25 nucleotides derived from the 5' and 3' ends of a polynucleotide of Table 1 are synthesized and used to amplify the desired DNA by PCR using a E. faecalis genomic DNA prep as a template. PCR is carried out under routine conditions, for instance, in 25 pl of reaction mixture with 0.5 ug of the above DNA template. A convenient reaction mixture is 1.5-5 mM MgCI 2 0.01% gelatin, 20 tM each of dATP, dCTP, dGTP, dTTP, 25 pmol of each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation at 94C for 1 min; annealing at 55 0 C for 1 min; elongation at 72 0 C for 1 min) are performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product is analyzed by agarose gel electrophoresis and the DNA. band with expected molecular weight is excised and purified. The PCR product is verified to be the selected sequence by subcloning and sequencing the DNA product.
Finally, overlapping oligos of the DNA sequences of Table 1 can be chemically synthesized and used to generate a nucleotide sequence of desired length using PCR methods known in the art.
Example Expression and Purification Enlerococcal polypeptides in E. coli The bacterial expression vector pQE60 was used for bacterial expression of some of the polypeptide fragements used in the soft tissue and systemic infection models discussed below. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a S" bacterial origin of replication an IPTG inducible promoter, a ribosome binding site six codons encoding histidine residues that allow affinity purification using nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin (QIAGEN, Inc., supra) ~and suitable single restriction enzyme cleavage sites. These elements are arranged such that an inserted DNA fragment encoding a polypeptide expresses that polypeptide with the six His residues a "6 X His tag") covalently linked to the carboxyl terminus of that polypeptide.
The DNA sequence encoding the desired portion of a E. faecalis protein of the present invention was amplified from E. faecalis genomic DNA using PCR oligonucleotide primers which anneal to the 5' and 3' sequences coding for the portions of the E. faecalis polynucleotide shown in Table 1. Additional nucleotides containing restriction sites to facilitate cloning in the pQE60 vector are added to the and 3' sequences, respectively.
For cloning the mature protein, the 5' primer has a sequence containing an appropriate restriction site followed by nucleotides of the amino terminal coding sequence of the desired E. faecalis polynucleotide sequence in Table 1. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding any desired portion of the complete protein shorter or longer than the mature form.
The 3' primer has a sequence containing an appropriate restriction site followed by nucleotides complementary to the 3' end of the polypeptide coding sequence of Table 1, excluding a stop codon, with the coding sequence aligned with the restriction site so as to maintain its reading frame with that of the six His codons in the pQE60 vector.
The amplified E. faecalis DNA fragment and the vector pQE60 were digested with restriction enzymes which recognize the sites in the primers and the digested 0o DNAs were then ligated together. The E. faecalis DNA was inserted into the restricted pQE60 vector in a manner which places the E. faecalis protein coding region downstream from the IPTG-inducible promoter and in-frame with an initiating AUG and the six histidine codons.
The ligation mixture was transformed into competent E. coli cells using 15 standard procedures such as those described by Sambrook et al., supra.. E. coli strain M15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac repressor and confers kanamycin resistance was used in carrying out the illustrative example described herein. This strain, which was only one of many that are suitable for expressing a E. faecalis polypeptide, is available commercially 20 (QIAGEN, Inc., supra). Transformants were identified by their ability to grow on LB agar plates in the presence of ampicillin and kanamycin. Plasmid DNA was isolated from resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA sequencing.
Clones containing the desired constructs were grown overnight in liquid culture in LB media supplemented with both ampicillin (100 .tg/ml) and kanamycin (25 tg/ml). The O/N'culture was used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells were grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. Isopropyl-p-D-thiogalactopyranoside ("IPTG") was then added to a final concentration of 1 mM to induce transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently were incubated further for 3 to 4 hours. Cells then were harvested by centrifugation.
The cells were then stirred for 3-4 hours at 4 0 C in 6M guanidine-HC1, pH 8.
The cell debris was removed by centrifugation, and the supernatant containing the E.
faecalis polypeptide was loaded onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (Q1AGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high affinity were purified in a simple one-step procedure (for details see: The QlAexpressionist, 1995, QIAGEN, Inc., supra). Briefly the supernatant was loaded onto the column in 6 M guanidine-HC1, pH 8, the column was first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed with volumes of 6 M guanidine-HCI pH 6, and finally the E.faecalis polypeptide was eluted with 6 M guanidine-HCI, pH The purified protein was then renatured by dialyzing it against 15 phosphate-buffered saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCI. Alternatively, the protein could be successfully refolded while immobilized on the Ni-NTA column. The recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 500 mM NaCI, 20% glycerol, 20 mM Tris/HC1 pH 7.4, containing protease inhibitors. The renaturation should be performed over a period of 1.5 hours or more. After renaturation the proteins can be eluted by the addition of 250 mM immidazole. Immidazole was removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer plus 200 mM NaCI. The purified protein was stored at 40 C or frozen at -800 C.
Some of the polypeptide of the present invention were prepared using a nondenaturing protein purification method. For these polypeptides, the cell pellet from each liter of culture was resuspended in 25 mis of Lysis Buffer A at 4 0 C (Lysis Buffer A 50 mM Na-phosphate, 300 mM NaCI, 10 mM 2-mercaptoethanol, Glycerol, pH 7.5 with 1 tablet of Complete EDTA-free protease inhibitor cocktail (Boehringer Mannheim #1873580) per 50 ml of buffer). Absorbance at 550 nm was approximately 10-20 O.D./ml. The suspension was then put through three freeze/thaw cycles from -70 0 C (using a ethanol-dry ice bath) up to room temperature.
The cells were lysed via sonication in short 10 sec bursts over 3 minutes at approximately 80W while kept on ice. The sonicated sample was then centrifuged at 15,000 RPM for 30 minutes at 4°C. The supernatant was passed through a column containing 1.0 ml of CL-4B resin to pre-clear the sample of any proteins that may bind to agarose non-specifically, and the flow-through fraction was collected.
The pre-cleared flow-through was applied to a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin column (Quiagen, Inc., supra). Proteins with a 6 X His tag to bind to the Ni-NTA resin with high affinity and can be purified in a simple one-step procedure. Briefly, the supernatant was loaded onto the column in Lysis Buffer A at 4 0 C, the column was first washed with 10 volumes of Lysis Buffer A until the A280 of the eluate returns to the baseline. Then, the column was washed with 5 volumes of S 40 mM Imidazole (92% Lysis Buffer A 8% Buffer B) (Buffer B 50 mM Na- 15 Phosphate, 300 mM NaCI, 10% Glycerol, 10 mM 2-mercaptoethanol, 500 mM Imidazole, pH of the final buffer should be The protein was eluted off of the column with a series of increasing Imidazole solutions made by adjusting the ratios of Lysis Buffer A to Buffer B. Three different concentrations were used: 3 volumes of 75 mM Imidazole, 3 volumes of 150 mM Imidazole, 5 volumes of 500 mM 20 Imidazole. The fractions containing the purified protein were analyzed using 8 or 14% SDS-PAGE depending on the protein size. The purified protein was then dialyzed 2X against phosphate-buffered saline (PBS) in order to place it into an easily o workable buffer. The purified protein was stored at 40 C or frozen at The following alternative method may be used to purify E.faecalis expressed in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, all of the following steps are conducted at 4-10 0
C.
Upon completion of the production phase of the E. coli fermentation, the cell culture is cooled to 4-10 0 C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear mixer.
The cells are then lysed by passing the solution through a microfluidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCI solution to a final concentration of 0.5 M NaCI, followed by centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using NaCI, 100 mM Tris, 50 mM EDTA, pH 7.4.
o1 The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride (GuHCI) for 2-4 hours. After 7000 x g centrifugation for 15 min., the *e pellet is discarded and the E.faecalis polypeptide-containing supernatant is incubated
S.
at 4 0 C overnight to allow further GuHCl extraction.
Following high speed centrifugation (30,000 x g) to remove insoluble particles, 15 the GuHCI solubilized protein is refolded by quickly mixing the GuHCl extract with volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCI, 2 mM se..: EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing for 12 hours prior to further purification steps.
To clarify the refolded E. faecalis polypeptide solution, a previously prepared tangential filtration unit equipped with 0.16 ptm membrane filter with appropriate surface area Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample is loaded onto a cation exchange resin Poros HS- Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCI in the same buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE.
Fractions containing the E.faecalis polypeptide are then pooled and mixed with 4 volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins.
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCI. The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M NaC1, mM sodium acetate, pH 6.0 to 1.0 M NaC1, 50 mM sodium acetate, pH Fractions are collected under constant A 28 o monitoring of the effluent. Fractions containing the E. faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled.
The resultant E. faecalis polypeptide exhibits greater than 95% purity after the above refolding and purification steps. No major contaminant bands are observed from Commassie blue stained 16% SDS-PAGE gel when 5 p-g of purified protein is loaded. The purified protein is also tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL assays.
i 15 Example Alternative Expression and Purification Enterococcalpolypeptides in E.
coli Tthe vector pQE 10 was alternatively used to clone and express some of the polypeptides of the present invention for use in the soft tissue and systemic infection models discussed below. The difference being such that an inserted DNA fragment 20 encoding a polypeptide expresses that polypeptide with the six His residues a "6 X His tag") covalently linked to the amino terminus of that polypeptide. The bacterial expression vector pQE10 (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311) was used in this example. The components of the pQE 10 plasmid are arranged such that the inserted DNA sequence encoding a polypeptide of the present invention expresses the polypeptide with the six His residues a "6 X His tag")) covalently linked to the amino terminus.
The DNA sequences encoding the desired portions of a polypeptide of Table 1 were amplified using PCR oligonucleotide primers from genomic E. faecalis DNA.
The PCR primers anneal to the nucleotide sequences encoding the desired amino acid sequence of a polypeptide of the present invention. Additional nucleotides containing restriction sites to facilitate cloning in the pQE 10 vector were added to the 5' and 3' primer sequences, respectively.
For cloning a polypeptide of the present invention, the 5' and 3' primers were selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the and 3' primers begins may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present invention. The 5' primer was designed so the coding sequence of the 6 X His tag is aligned with the resiriction site so as to maintain its reading frame with that of E. faecalis polypeptide. The 3' was designed to include an stop codon. The amplified DNA fragment was then cloned, and the protein expressed, as described above for the pQE60 plasmid.
The DNA sequences encoding the amino acid sequences of Table 1 may also be cloned and expressed as fusion proteins by a protocol similar to that described 15 directly above, wherein the pET-32b(+) vector (Novagen, 601 Science Drive, Madison, WI 53711) is preferentially used in place of pQElO.
The above methods are not limited to the polypeptide fragements actually produced. The above method, like the methods below, can be used to produce either full length polypeptides or desired fragements therof.
Example Alternative Expression and Purification of Enterococcal polypeptides in E. coli The bacterial expression vector pQE60 is used for bacterial expression in this example (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, CA, 91311). However, in this example, the polypeptide coding sequence is inserted such that translation of the six His codons is prevented and, therefore, the polypeptide is produced with no 6 X His tag.
The DNA sequence encoding the desired portion of the E. faecalis amino acid sequence is amplified from an E. faecalis genomic DNA prep the deposited DNA clones using PCR oligonucleotide primers which anneal to the 5' and 3' nucleotide sequences corresponding to the desired portion of the E. faecalis polypeptides.
Additional nucleotides containing restriction sites to facilitate cloning in the vector are added to the 5' and 3' primer sequences.
For cloning a E. faecalis polypeptides of the present invention, 5' and 3' primers are selected to amplify their respective nucleotide coding sequences. One of ordinary skill in the art would appreciate that the point in the protein coding sequence where the 5' and 3' primers begin may be varied to amplify a DNA segment encoding any desired portion of a polypeptide of the present invention. The 3' and 5' primers contain appropriate restriction sites followed by nucleotides complementary to the and 3' ends of the coding sequence respectively. The 3' primer is additionally designed to include an in-frame stop codon.
The amplified E. faecalis DNA fragments and the vector pQE60 are digested with restriction enzymes recognizing the sites in the primers and the digested DNAs are then ligated together. Insertion of the E.faecalis DNA into the restricted vector places the E. faecalis protein coding region including its associated stop codon downstream from the IPTG-inducible promoter and in-frame with an initiating AUG.
The associated stop codon prevents translation of the six histidine codons downstream of the insertion point.
The ligation mixture is transformed into competent E. coli cells using standard procedures such as those described by Sambrook et al. E. coli strain M 15/rep4, containing multiple copies of the plasmid pREP4, which expresses the lac repressor S* and confers kanamycin resistance is used in carrying out the illustrative example described herein. This strain, which is only one of many that are suitable for expressing E. faecalis polypeptide, is available commercially (QIAGEN, Inc., supra).
Transformants are identified by their ability to grow on LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is isolated from resistant colonies and the identity of the cloned DNA confirmed by restriction analysis, PCR and DNA sequencing.
Clones containing the desired constructs are grown overnight in liquid culture in LB media supplemented with both ampicillin (100 .g/ml) and kanamycin g/ml). The O/N culture is used to inoculate a large culture, at a dilution of approximately 1:25 to 1:250. The cells are grown to an optical density at 600 nm ("OD600") of between 0.4 and 0.6. isopropyl-b-D-thiogalactopyranoside ("IPTG") is then added to a final concentration of 1 mM to induce transcription from the lac repressor sensitive promoter, by inactivating the lad repressor. Cells subsequently are incubated further for 3 to 4 hours. Cells then are harvested by centrifugation.
To purify the E faecalis polypeptide, the cells are then stirred for 3-4 hours at 4 0 C in 6M guanidine-HC1, pH 8. The cell debris is removed by centrifugation, and the supernatant containing the E. faecalis polypeptide is dialyzed against 50 mM Naacetate buffer pH 6, supplemented with 200 mM NaCI. Alternatively, the protein can be successfully refolded by dialyzing it against 500 mM NaCI, 20% glycerol, mM Tris/HC1 pH 7.4, containing protease inhibitors. After renaturation the protein 15 can be purified by ion exchange, hydrophobic interaction and size exclusion chromatography. Alternatively, an affinity chromatography step such as an antibody column can be used to obtain pure E. faecalis polypeptide. The purified protein is stored at 40 C or frozen at -800 C.
The following alternative method may be used to purify E. faecalis 20 polypeptides expressed in E coli when it is present in the form of inclusion bodies.
Unless otherwise specified, all of the following steps are conducted at 4-10 0
C.
Upon completion of the production phase of the E. coli fermentation, the cell culture is cooled to 4-10°C and the cells are harvested by continuous centrifugation at 15,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit weight of cell paste and the amount of purified protein required, an appropriate amount of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a high shear mixer.
The cells ware then lysed by passing the solution through a microfluidizer (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is then mixed with NaCI solution to a final concentration of 0.5 M NaCI, followed by centrifugation at 7000 x g for 15 min. The resultant pellet is washed again using NaCI, 100 mM Tris, 50 mM EDTA, pH 7.4.
The resulting washed inclusion bodies are solubilized with 1.5 M guanidine hydrochloride (GuHCI) for 2-4 hours. After 7000 x g centrifugation for 15 min., the pellet is discarded and the E. faecalis polypeptide-containing supematant is incubated at 4 0 C overnight to allow further GuHCI extraction.
Following high speed centrifugation (30,000 x g) to remove insoluble particles, in the GuHCI solubilized protein is refolded by quickly mixing the GuHCI extract with volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCI, 2 mM EDTA by vigorous stirring. The refolded diluted protein solution is kept at 4 C without mixing for 12 hours prior to further purification steps.
To clarify the refolded E. faecalis polypeptide solution, a previously prepared tangential filtration unit equipped with 0.16 p.m membrane filter with appropriate surface area Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The filtered sample is loaded onto a cation exchange resin Poros HS- Perseptive Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCI in the same 20 buffer, in a stepwise manner. The absorbance at 280 mm of the effluent is continuously monitored. Fractions are collected and further analyzed by SDS-PAGE.
Fractions containing the E.faecalis polypeptide are then pooled and mixed with 4 volumes of water. The diluted sample is then loaded onto a previously prepared set of tandem columns of strong anion (Poros HQ-50, Perseptive Biosystems) and weak anion (Poros CM-20, Perseptive Biosystems) exchange resins.
The columns are equilibrated with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium acetate, pH 6.0, 200 mM NaCI. The CM-20 column is then eluted using a 10 column volume linear gradient ranging from 0.2 M NaCI, mM sodium acetate, pH 6.0 to 1.0 M NaCI, 50 mM sodium acetate, pH Fractions are collected under constant Azso monitoring of the effluent. Fractions containing the E. faecalis polypeptide (determined, for instance, by 16% SDS-PAGE) are then pooled.
The resultant E. aecalis polypeptide exhibits greater than 95% purity after the above refolding and purification steps. No major contaminant bands are observed from Commassie blue stained 16% SDS-PAGE gel when 5 p g of purified protein is loaded. The purified protein is also tested for endotoxin/LPS contamination, and typically the LPS content is less than 0.1 ng/ml according to LAL assays.
Example Cloning and Expression of E. faecalis in Other Bacteria E.faecalis polypeptides can also be produced in: E. faecalis using the methods of S. Skinner et al., (1988) Mol. Microbiol. 2:289-297 or J. 1. Moreno (1996) Protein Expr. Purif. 8(3):332-340; Lactobacillus using the methods of C. Rush et al., 1997 Appl. Microbiol. Biotechnol. 47(5):537-542; or in Bacillus subtilis using the methods Chang et al., U.S. Patent No. 4,952,508.
Example 3: Cloning and Expression in COS Cells A E. faecalis expression plasmid is made by cloning a portion of the DNA encoding a E. faecalis polypeptide into the expression vector pDNAI/Amp or pDNAIII (which can be obtained from Invitrogen, Inc.). The expression vector pDNAI/amp contains: an E. coli origin of replication effective for propagation in E. coli and other prokaryotic cells; an ampicillin resistance gene for selection of S: plasmid-containing prokaryotic cells; an SV40 origin of replication for propagation in eukaryotic cells; a CMV promoter, a polylinker, an SV40 intron; several codons encoding a hemagglutinin fragment an "HA" tag to facilitate purification) followed by a termination codon and polyadenylation signal arranged so that a DNA can be conveniently placed under expression control of the CMV promoter and operably linked to the SV40 intron and the polyadenylation signal by means of restriction sites in the polylinker. The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein described by Wilson et al. 1984 Cell 37:767.
The fusion of the HA tag to the target protein allows easy detection and recovery of the rccombinant protein with an antibody that recognizes the HA epitope. pDNAIII contains, in addition, the selectable neomycin marker.
A DNA fragment encoding a E. faecalis polypeptide is cloned into the polylinker region of the vector so that recombinant protein expression is directed by the CMV promoter. The plasmid construction strategy is as follows. The DNA from a E. faecalis genomic DNA prep is amplified using primers that contain convenient restriction sites, much as described above for construction of vectors for expression of E. faecalis in E. coli. The 5' primer contains a Kozak sequence, an AUG start codon, and nucleotides of the 5' coding region of the E. faecalis polypeptide. The 3' primer, contains nucleotides complementary to the 3' coding sequence of the E. faecalis DNA, a stop codon, and a convenient restriction site.
**The PCR amplified DNA fragment and the vector, pDNAI/Amp, are digested 15 with appropriate restriction enzymes and then ligated. The ligation mixture is transformed into an appropriate E. coli strain such as SURE T (Stratagene Cloning Systems, La Jolla, CA 92037), and the transformed culture is plated on ampicillin a media plates which then are incubated to allow growth of ampicillin resistant colonies.
Plasmid DNA is isolated from resistant colonies and examined by restriction analysis 20 or other means for the presence of the fragment encoding the E. faecalis polypeptide For expression of a recombinant E. faecalis polypeptide, COS cells are transfected with an expression vector, as described above, using DEAE-dextran, as described, for instance, by Sambrook et al. (supra). Cells are incubated under conditions for expression of E. faecalis by the vector.
Expression of the E. faecalis-HA fusion protein is detected by radiolabeling and immunoprecipitation, using methods described in, for example Harlow et al., supra.. To this end, two days after transfection, the cells are labeled by incubation in media containing "S-cysteine for 8 hours. The cells and the media are collected, and the cells are washed and the lysed with detergent-containing RIPA buffer: 150 mM NaCI, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 7.5, as described by Wilson et al. (supra Proteins are precipitated from the cell lysate and from the culture media using an HA-specific monoclonal antibody. The precipitated proteins then are analyzed by SDS-PAGE and autoradiography. An expression product of the expected size is seen in the cell lysate, which is not seen in negative controls.
Example 4: Cloning and Expression in CHO Cells The vector pC4 is used for the expression of E. fdecalis polypeptide in this example. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC Accession No. 37146). The plasmid contains the mouse DHFR gene under control of the early promoter. Chinese hamster ovary cells or other cells lacking dihydrofolate activity that are transfected with these plasmids can be selected by growing the cells in a selective medium (alpha minus MEM, Life Technologies) supplemented with the 15 chemotherapeutic agent methotrexate. The amplification of the DHFR genes in cells resistant to methotrexate (MTX) has been well documented. See, Alt et al., 1978, J. Biol. Chem. 253:1357-1370; Hamlin et al., 1990, Biochem. et Biophys. Acta, 1097:107-143; Page et al., 1991, Biotechnology 9:64-68. Cells grown in increasing concentrations of MTX develop resistance to the drug by overproducing the target 20 enzyme, DHFR, as a result of amplification of the DHFR gene. If a second gene is linked to the DHFR gene, it is usually co-amplified and over-expressed. It is known in the art that this approach may be used to develop cell lines carrying more than S1,000 copies of the amplified gene(s). Subsequently, when the methotrexate is withdrawn, cell lines are obtained which contain the amplified gene integrated into one or more chromosome(s) of the host cell.
Plasmid pC4 contains the strong promoter of the long terminal repeat (LTR) of the Rouse Sarcoma Virus, for expressing a polypeptide of interest, Cullen, et al.
(1985) Mol. Cell. Biol. 5:438-447; plus a fragment isolated from the enhancer of the immediate early gene of human cytomegalovirus (CMV), Boshart, et al., 1985, Cell 41:521-530. Downstream of the promoter are the following single restriction enzyme cleavage sites that allow the integration of the genes: Barn HI, Xba 1, and Asp 718.
Behind these cloning sites the plasmid contains the 3' intron and polyadenylation site of the rat preproinsulin gene. Other high efficiency promoters can also be used for the expression, the human 8-actin promoter, the SV40 early or late promoters or the long terminal repeats from other retroviruses, HIV and HTLVI. Clontech's Tet- Off and Tet-On gene expression systems and similar systems can be used to express the E. faecalis polypeptide in a regulated way in mammalian cells (Gossen et al., 1992, Proc. Natl. Acad. Sci. USA 89:5547-5551. For the polyadenylation of the mRNA other signals, from the human growth hormone or globin genes can be used as well. Stable cell lines carrying a gene of interest integrated into the chromosomes can also be selected upon co-transfection with a selectable marker such as gpt, G418 or hygromycin. It is advantageous to use more than one selectable marker in the beginning, G418 plus methotrexate.
15 The plasmid pC4 is digested with the restriction enzymes and then dephosphorylated using calf intestinal phosphates by procedures known in the art.
The vector is then isolated from a 1% agarose gel. The DNA sequence encoding the E.
faecalis polypeptide is amplified using PCR oligonucleotide primers corresponding to the 5' and 3' sequences of the desired portion of the gene. A 5' primer containing a restriction site, a Kozak sequence, an AUG start codon, and nucleotides of the coding region of the E. faecalis polypeptide is synthesized and used. A 3' primer, containing a restriction site, stop codon, and nucleotides complementary to the 3' coding sequence of the E.faecalis polypeptides is synthesized and used. The amplified fragment is digested with the restriction endonucleases and then purified again on a 1% agarose gel. The isolated fragment and the dephosphorylated vector are then ligated with T4 DNA ligase. E. coli HB 101 or XL-I Blue cells are then transformed and bacteria are identified that contain the fragment inserted into plasmid pC4 using, for instance, restriction enzyme analysis.
Chinese hamster ovary cells lacking an active DHFR gene are used for transfection. Five .g of the expression plasmid pC4 is cotransfected with 0.5 p.g of the plasmid pSVneo using a lipid-mediated transfection agent such as Lipofectin or LipofectAMINE.
T (LifeTechnologies Gaithersburg, MD). The plasmid pSV2-neo contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that confers resistance to a group of antibiotics including G418. The cells are seeded in alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus MEM supplemented with 10, 25, or 50 ng/ml of methotrexate plus 1 mg/ml G418. After about 10-14 days single clones are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of methotrexate are then transferred to new 6-well plates containing even higher concentrations ofmethotrexate (1 tpM, 2 j.M, 5 piM, 10 mM, 20 mM). The same :procedure is repeated until clones are obtained which grow at a concentration of 15 100-200 jpM. Expression of the desired gene product is analyzed, for instance, by SDS-PAGE and Western blot or by reversed phase HPLC analysis.
Example 5: Quantitative Murine Soft Tissue Infection Model for E. faecalis Compositions of the present invention, including polypeptides and peptides, are assayed for their ability to function as vaccines or to enhance/stimulate an immune response to a bacterial species E. faecalis) using the following quantitative murine soft tissue infection model. Mice NIH Swiss female mice, approximately 7 weeks old) are first treated with a biologically protective effective amount, or immune enhancing/stimulating effective amount of a composition of the present invention using methods known in the art, such as those discussed above. See,e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is per animal.
The desired bacterial species used to challenge the mice, such as E. faecalis, is grown as an overnight culture. The culture is diluted to a concentration of 5 X 108 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired doses are further diliuted 1:2 with sterilized Cytodex 3 microcarrier beads preswollen in sterile PBS (3g/100ml). Mice are anesthetize briefly until docile, but still mobile and injected with 0.2 ml of the Cytodex 3 bead/bacterial mixture into each animal subcutaneously in the inguinal region. After four days, counting the day of injection as day one, mice are sacrificed and the contents of the abscess is excised and placed in a 15 ml conical tube containing 1.0ml of sterile PBS. The contents of the abscess is then enzymatically treated and plated as follows.
The abscess is first disrupted by vortexing with sterilized glass beads placed in the tubes. 3.0mls of prepared enzyme mixture (1.0ml Collagenase D (4.0 mg/ml), 1.0ml Trypsin (6.0 mg/mi) and 8.0 mis PBS) is then added to each tube followed by a 20 min. incubation at 37C. The solution is then centrifuged and the supernatant drawn off. 0.5 ml dH20 is then added and the tubes are vortexed and then incubated for 10 min. at room temperature. 0.5 ml media is then added and samples are serially diluted and plated onto agar plates, and grown overnight at 37C. Plates with distinct oo and separate colonies are then counted, compared to positive and negative control samples, and quantified. The method can be used to identify composition and 20 determine appropriate and effective doses for humans and other animals by comparing the effective doses of compositions of the present invention with compositions known in the art to be effective in both mice and humans. Doses for the effective treatment of humans and other animals, using compositions of the present invention, are extrapolated using the data from the above experiments of mice. It is appreciated that further studies in humans and other animals may be needed to determine the most effective doses using methods of clinical practice known in the art.
Example 6: Murine Systemic Neutropenic Model for E. faecalis Infection Compositions of the present invention, including polypeptides and peptides, are assayed for their ability to function as vaccines or to enhance/stimulate an immune response to a bacterial species E.faecalis) using the following qualitative murine systemic neutropenic model. Mice NIH Swiss female mice, approximately 7 weeks old) are first treated with a biologically protective effective amount, or immune enhancing/stimulating effective amount of a composition of the present invention using methods known in the art, such as those discussed above. See,e.g., Harlow et al., ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 1988). An example of an appropriate starting dose is 20ug per animal.
Mice are then injected with 250 300 mg/kg cyclophosphamide intraperitonially.
Counting the day of C.P. injection as day one, the mice are left untreated for 5 days to begin recovery of PMNL'S.
The desired bacterial species used to challenge the mice, such as E. faecalis, is grown as an overnight culture. The culture is diluted to a concentration of 5 X 15 cfu/ml, in an appropriate media, mixed well, serially diluted, and titered. The desired doses are further diliuted 1:2 in 4% Brewer's yeast in media.
Mice are injected with the bacteria/brewer's yeast challenge intraperitonially. The Brewer's yeast solution alone is used as a control. The mice are then monitered twice daily for the first week following challenge, and once a day for the next week to ascertain morbidity and mortality. Mice remaining at the end of the experiment are sacrificed. The method can be used to identify compositions and determine appropriate and effective doses for humans and other animals by comparing the effective doses of compositions of the present invention with compositions known in the art to be effective in both mice and humans. Doses for the effective treatment of humans and other animals, using compositions of the present invention, are extrapolated using the data from the above experiments of mice. It is appreciated that further studies in humans and other animals may be needed to determine the most effective doses using methods of clinical practice known in the art.
The disclosure of all publications (including patents, patent applications, journal articles, laboratory manuals, books, or other documents) cited herein are hereby incorporated by reference in their entireties.
The present invention is not to be limited in scope by the specific embodiments described herein, which are intended as single illustrations of individual aspects of the invention. Functionally equivalent methods and components are within the scope of the invention, in addition to those shown and described herein and will become apparant to those skilled in the art from the foregoing description and accompanying drawings. Such modifications are intended to fall within the scope of the appended claims.
Throughout the specification, unless the context requires otherwise, the word "comprise" or variations such as "comprises" or "comprising", will be understood to imply the inclusion of a stated integer or group of integers but not the exclusion of any other integer or group of integers.
Page(s)2$?7-3O are claims pages they appear after the sequenc ~sting TABLE 1. Nucleotide and Amniho Acid Seqeuences of E. faecalis Genes.
EFOO1-1 (SEQ ID NO:l)
TGAAAGAATA
AAAAAAATGA
TTGGGGGCTT
AAGGAAGATA
ACAAAATTAA
CAATCTGCTT
TTACCAACAA
TTAGTGGACT
ATTCGTGAAG
AATAAATCGA
GTACCGAAAA
AAAGAAGTCG
AACAAAGGCG
GTGGACTATT
TATTTATCTG
GCTGG ITITG
CCTGAAAAAA
GAACAACGGA
TACTGGGCAC
TACAAAAATT
TTCGCTATCC
GAAAGTATTT
CAATTTGA.AC
TTGCCAGAAC
AGTTCAAAAC
GTGGTAACGG
CCACAATCAC
CGAAAACT
ACCCTGATT
P1'ACGCAACC
TAAAACCATA
TATTGTTAGA
CAGAAATGTT
CATTAGAGGA
'ITGGTGCTGG
'PTGATTAA
ACCGTGATGG
GCCCATTTC
TTCAAAAAGA
TCAACTTACA
CAGCGGCATT
AACAAACAGG
CTAAGACAAC
CAGTAGAAGA
TTGCTTCATC
AAGCATGGAA
GTGGCGAGCA
TCTAGCAACA
TAATGGGGCC
TTCTGGCAT
CATGAAAG2A
ACAAGCCAAA
GTACCCAGGC
TATGGATGAT
CGGCGCCAAA
ATTCTATAAT
A ITAAAAGAA
TT=ACTCG
TAAAGACTTA
TATCGAAGCA
AAACAAAAAG
TGCTGAAGCT
ACAAGGAACA
TGAATTCATG
TTATATGCCA
CAALAGTACCT
AAATGCTGAT
AAATAAAGAC
CCAATAA
AATTrGTITTA
ACAGTGTTAG
AA.AGAATCAA
GCAATG-AATG
AATCCAAAAA
ATCAATTCGA
TGGTTATOGA
GACACAATCG
ATCGACGGCA
GGTETTGT
CCTTCTAAAA
'rrAAATAACT
GATTAACAA
GGTTACTTCC
GTAGCAATGT
GGTGGCTATG
GATATTATA
AAATTCTTAG
ATFIT17AGAAT
GCACAACTTG
TCACCCTATA
ACGAGAAAAT
TAAATTTTT
CAACCGCAGC
ACGATATTGT
GGGTTCAAGA
?I'AAAGTGGA
CTTAACTTC
ATGCTGCACA
GCTGGAAAGA
AACPLATACGG
TGAAAGAATA
CAATTTACCA
ATTACGPA.AT
GCAAAGATTC
GCACAGCT-GG
TTGTCGGTAG
AATACGGTGT
TC'ITCGATAG
CTACTCCTGA
CTGTTTTACA
AAAACGCAGT
ATrGAAATGCG
TATTGAAAGA
TAAGGGAGAG
TATTFCGCA
GAAAGAAGTG
AGAAGCGTTA
AFI'ACAAAAT
ACCAAAAGAT
AGATGAAATG
TGCCAGCCA
CATTCCATTT
TrGGTGTTGAA
AAAATCCAAC
TGGAATGAAA
ACAAGAAGTC
TTCAGATAAA
TATTGCTGGT
TGCACCACGT
TGCTACGCCA
TI'CACAATTG
CAGTGATGAG
AAAAGATTTA
GACAATTATG;
TGCAACATCA
EF001-2 (SEQ ID NO:2) MKFRTLATT VLATAAIFAL GACGNGNGAK ESNDIVKEVK
S.
EDTTITFWHA
PTITQAYPGW
KSTEMLFYNA
KGVDFNKDLD
GFVQKDAEAG
WAQQTGYMP I S IFASSNKDT
MNGVQEEALT
LWNAAQD04L
DLIJKEYGVEV
LTSKDSQEVV
GYEYGVAPRP
LESVLHSDEY
RKr 4
LKDATSQ
KLTKDFMKEN
VDLKPYMDDD
PKTLEELKEA
DYYRDGIEAG
EK INLQQGTD
KNSKTTKVPA
FEQAWNQ
PKIKVELQNQ
TIGWKDAEPI
SKTIYEKSNK
YFRTAGSDKY
IYMFDSATPE
QL.ENAVKDLF
SAYPDLQAKI
REVLLDGARI
EVVGAGFD-SL
LSGPFANKKV
QRTAAFEFMKfl
AIPVEENADS
NSTLTSPKDL
DGKQYGIPFN
NNYYAIGMKN
ANFVGS IAGA
FLATPOSQLY
AYNEMRTIME
EF001-3 (SEQ ID NO:3) 7T GTCCTAACGG TAATGGGGCC AAAGAATCAA ACG-ATMrrGT GAAAG.AAGTG
AAGGAAGATA
ACAAAATTAA
CAATCTGCTT
TTACCAACAA
TTAGMGACT
ATTCGTGAAG
AATAAATCGA
GTACCGAAAA
AAAGAAGTCG
AACAAAGGCG
GTGGACTATI'
TATTATCTG
CGACAATCAC
CGAA.AGACTT
ACCCTCA ITT
TTACGCAAGC
TAAAACCATA
TATTGTTAGA
CAGAAATGTT
CATTAGAGGA
TTGGTGCTGG
TTGATTTAA
ACCGTGATGG
CCCCATTTGC
TTCTGGCAT
CATGAAAGAA
ACAAGCCAAA
GTACCCAGGC
TATGGATGAT
CGGCGCCAAA
ATTCTATAAT
ATTAAAAGAA
TTTTGACTCG
TAAAGACTTA
TATCGAAGCA
AAACAAAAAG
GCAATGAATG
AATCCAAAAA
ATCAATTCGA
TGGTI'ATIGGA
GACACAATCG
ATCGACGGCA
GCTGATrTGT
GCTI'CTAAMA
ITrAAATAACT
GATTTAACAA
GGTTAC'FrCC
GTACCA.ATGT
GGGTTCAAGA AGAAGCGTITA TTAAAGTGGA ATTACAAAAT CITAACTTC ACCAAAAGAT ATGC~f-CACA AGATGAAATG GCTGGAAAGA TGC-AGAGCCA AACAATACGG CATTCCATTT TGAAAGAATA TGGTGTTGAA CAATTIACGA AAAATCCAAC ATTACCCAAT TGGAATGAAA GCAAAGATTC ACAAGAAGTC GCACAGCTGG TTCAGATAAA ?I'GTCGGTAG TATTGCTGGT TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GCTGGCTTTG
CCTGAAAAAA
GAACAACGGA
TACTGGGCAC
TACAAAAATT
TTCGCTATCC
GAAAGTATTT
CAATTTGAAC
TTCAAAAAGA
TCAACTTACA
CAGCGGCATT
AACAAACAGG
CTAAGACAAC
CAGTAGAAGA
TTGCTTCATC
AAGCATGGAA
TGCTGAAGCT
ACAAGGAACA
TGAATTCATG
TTATATGCCA
CAAAGTACCT
AAATGCTGAT
AAATAAACAC
CCAA
CGTCGCTATC
GATATTTATA
AAATTCTTAG
ATTTAGAAT
GCACAACTIG
TCAGCCTATA
ACGAGAAAAT
AATACGGTGT
TGTTCGATAG
CTACTCCTGA
CTGTrrTACA
A-AAACGCAGT
ATGAAAT'GCG
TATTGAAAGA
TGCACCACGT
TGCTACGCCA
rrCAAATTG
CAGTGATGAG
AAAAGATrITA
GACAATTATG
TGCAACATCA
EF001-4 (SEQ ID NO:4) CGNGNGAX< ESNDIVKEVK EDTTITFWHA MNGVQEEALT PTITQAYPGW LWNAAQDEML KST2EMLFYNA DEJLKEYGVEV KGVDFNKDLD LTSKCSQEVV GFVQKDAEAG GYEYGVAPRP WAQQTGYMP I LESVLHSDEY **SIFASSNXDT RKLLKDATSQ
KLTKDFMK~EN
VDLKPYMDDD
PKTLEELKEA
DYYRDGIEAG
EKINLQQGTD
KNSKFI'KVPA
FEQAWNQ
PK IKVELQNQ
TIGWKDAEPI
SKTIYEKSNK
YFRTAGSDKY
IYMFDSATPE
QLENAVKDLF
SAYPI)LQAKI
REVLLDGAKI
EVVGAGFDSL
LSGPFANKKV
QRTAAFEFMfl Al PVEENADS
NSTLTSPKDL
DGKQYGIPFN
~'NYAIGbM(N AMFVGS EAGA
FLATPDSQLY
AYNEMRTIME
*5
S
S.
S S
S
S.
S S
S
SSSSS
S
5*5S
S
5.5.
S
*555
S
S S *5 EF002-1 (SEQ ID
TAAATAGCGG
TTAGCAGTGG
GAAAAGAGTG
CCAGAATTTG
CCGGTGGACA
GATACGACGG
AATCAATTGG
AGTTACGAGA
TGGGTATTGT
TTAACTTGGG
TATGGTGCCT
AATGCCAATT
AGAATGCAAA
TATCAATCAC
GGGACTTTAT
ATACCACAAC
AATAAAAACA
CAAGGTCCAA
GATAAAATCT
AGGTAGTACA
CGGCACTAAC
AAGATGGCAA
AGAAATTATT
TTGCTTCAGA
ATATTTTAAC
TGGATTTAAC
TGTATGAAAT
ATTACAATAA
ATGAATATGA
ATCAACATAC
TCATI'GAACC
AAGATCAATC
AATTTGAAAA
TAACAAACAT
AAGAAAAAGG
GTAAAAAACA
AACTT1rAGC
ACTTTGCAAC
AATGAAAT'Fr
TTTAACAGCA
AACAAAATTA
CAGAGCTTTT
TGATTATGAC
CATGAAAAAC
CGATCACG LT
CGATGGTAAA
AAAAATCTTT
AGCGTTAGCG
TTGGCGCTCA
AAAATACAAT
ACAAATGGAT
TTCAAA.AGCG
TCATrGATGGC
CAAAGCAACT
AAAAGCTGCT
AGAAGTAGGG
AAAAGGAATG
TGGAAAAAAG
TGTGGTGGTT
ACAGTAACTA
GAAGCGGAAA
ACAAAAGTAA
TTAC=ICAT
AAAGATTTAG
ACCTATGCTC
CATGAAGCCG
AAAAAATTAT
ACCGTTCAAG
TATATGGAAA
TTTGGAACAG
GCGATGATGT
AAAACAAATG
ACCTTTGGCT
CAAAAA'FrCT
GTGGTTCCTT
CCTTCAGACG
GCTTAACAGC
CAAGTGAAAA
CI-TGGAATTA
ATCCTGATAT
CAACGATGCT
'ATTCTA-ATTA
ATATCGAACC
AGCCTTACCG
CAATTGCCTA
CTAAACCACA
CGATI'GCTGC
CTTATTATGA
CAA-AATCAAC
ACATGGGTAG
TCGAATGCGG
CACCGACAAG
TAGACTTTC
CTTATAAAAC
AGTCTCACAA
GGCAGCGCTG
GAAACCAACT
TGACACGACC
CACTATTIAA
rrCATCAGGA
CGCGCTACGC
TGCCAAAGCA
TACAGA ITC
TCCCGATAAC
AGAACAAGTA
TGCTCAAAAC
TCGCGCATTG
AAAAGTAACG
CTGGTACATG
GATTGCCGAA
TTTTGCAATT
TTCAGGTAAA
AGATGAAATT
AAAGCCTTTA
ACCCAGATAC AATTAATTTA G EF002-2 (SEQ ID NO:6) MKFW KXGLTAAALL AVAAVTLTAC GGSSEKKATE KSEDGKTKLT Vr7IMTYDP EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTT~1LSSGD TITD ILTMKNL LSYSNYAIJRN QLVDLTDHVK DLDIEPAKAS YEMYEIDGKT YAQPYRTDFW VLYYNKKMFD EAGIAYPDNL TWDEYFEALAX KLSKPEEQVY GAYQHTWRST VQAIAAAQNN ANLIEPKYNY METYYDRALR MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG TLLTNIDDGK TNVEWGIAEI PQQEKGYKATT FGSPTSFAIN KNSKXQKAAQ KFLDFASGKE GAXLLAEVGV VPSYKTDEID 81 TABLE 1. Nucleotide and Amnino Acid Seqeuences of E. faecalis Genes.
KIYFARKGMP SDESHflKLT QIQLI EF002-3 (SEQ ID NO:7) A TGTGGTGGTT CAAGTGAAAA GA-AAGCAAC GAAAAGAGTG AAGAT.3GCAA AACAAAATTA
CCAGAATTTO
CCGGTGGACA
GATACGACGG
AATCAATTGG
AGTTACGAGA
TG'-TATTGT
TTAACTTGGG
TATGGTGCCT
AATGCCAATI'
AGAATGCAAA
TATCAATCAC
GGGACTTTAT
ATACCACAAC
AATAAAAACA
GAAGGTGCAA
GATAAAATCT
ACCCAGATAC
AGAAATTATT
TTCCTTCAGA
ATAT=AAC
TGGXFFI'AAC
TGTATGAAAT
ATTACAATAA
ATGAATATGA
ATCAACATAC
TGATTGAACC
AAGATCAATC
AATTTGAAAA
TAACAAACAT
AAGAAAAAGG
GTAAAAAACA
AACTTTTAGC
ACTTIrGCAAG
AATTAATT
CAGAGCTT
TGATTATGAC
CATGAAAAAC
CGATCACGTT
CGATGGTAA
AAAAATGTTT
AGCGTTAGCG
TTGGCGCTCA
AAAATACAAT
ACAAATGGAT
TTCAAAAGCG
TGATGATGC
CAAAGCAACT
AAAAGCTGCT
AGAAGTAGGG
AAAAGGAATG
T
ACAGTA-ACTA
GAAGCGGAAA
ACAAAAGTAA
TTACTTCAT
AALAGAT'IAG
ACCTATGCTC
CATGAAGCCG
A.AAAAATTAT
ACCGTTCAAG
TATATLGGAAA
T=~GAACAG
GCGATGATGT
AAAACAAATG
ACCTFIGCT
CAAAAATTFCT
GTGCFTCCTT
CCTTCAGACG
CTGGAATTA
ATCCTGATAT
CAACGATGCT
ATTCTAATTA
ATATCGA.ACC
AGCCTTACCG
GAATTGCCTA
CTAAACCAGA
CGAT GC C~rAkTTATGA
CAAAATCAAC
ACATGGGTAG
TCGAATGGCG
CACCGACAAG
TAGACTTTGC
C'ITATAAAAC
AGTCTCACAA
TGACACGACC
CACTAITGAA
TTCATCAGGA
CGCGCTACGC
TGCCAAAGCA
TACAGATTTC
TCCCGATAAC
AGAACAAGTA
TGCTCAAAAC
TCGCGCATTG
AAAAGTAACG
CTGGTACATG
GATTGCCGAA
=TTGCAATT
TTCAGGTAAA
AGATGAAATT
AAAGCCTTTA
S
EF002-4 (SEQ ID NO:8) C GGSSEKKATE KSEDGKTKLT VTrrdNYDTTP EFEKLFRAFE AENPDITIEP VDIASDDYDT KVTTKLSSGD QLVDLTDHVK OLDIEPAKAS YEMYEIDGKT YAQPYRTDFW TWDEYEALAK KLSKPEEQVY GAYQHTWRST VQAIAAAQNN MQKDQSQMDF GTAKSTKVTY QSQFENSKAA MMYMGSWYMG PQQEKGKATT FGSPTSFAIN KNSKKQKAAQ KFLDFASGKE KIYFARKGMP SDESHKKPLT QIQLI TTD ILThKNL
VLYYNKKMFD
ANL IEP(YNY TLLTMI DDGK
GAI<LLAEVGV
LSYSNYALRN
E-AGIAYPDNL
METYYDRALR
TNVEWGIAEI
VPSYKTOEID
EF003-1. (SEQ ID NO:9) TAGGAGGACA A.AAGAATGAA ATITAGCTG CCTGTGGGGG GTTGCCGTGC AATTGGAATC AAAA.AAGGGT ACAAAATTAA GTGCAACATG ACGAAGCGGA AACAAAGAGA AAAAAGCTGA TTCTATTCAA AAGAATACCA CCTAGCGATC CAACCAATGA AAATTAAAAG AAGGTGTCGG AACATCACIT TTGAAAGCAT ATCGCTATGG TGTTCTGCTA GCGATCTITGT TAGAAGATAA AAAGGCGAAA AAGATAGCGA G'ITGCTGAAT ACATCAAGAA GAAGTT71TAT
AAATAAACAA
TTCAAAAGAT
CATTATGGAA
TGCTAATT
TTTAGTGGCT
AGATGCGAAA
AGGTCGTGCT
CTTAACGGC
TGATTTACTG
CCCAGCCTAC
AGAAGCAAGT
AAAAATcAAr,
AAATTCTAAA
TTAGCNACAT TCGCTGTrAT TGCAACAGTT
GCAGACCAGA
ATCTTGGAGA
GTGAGCGACA
GCGCAACATC
GTGCAACCGA
GATTrACCTG
TTAGCAA=
ACGGTGGCAG
AATI'AGCTA
TTAGAACCTG
AAACATTACG
GTTTTAAAAG
GGCGCCAATA
AAGAAGACAA
TTGCCAAGAA
ATGTTGCCTA
AACCCTTAT
TTTATTATT
AAAATGCCAA
TAA.ATGCAAA
ATGTCGTGGA
AAGCCTATGA
CTGGTTAAC
cATTGCAAGT
AAGCGATGAC
TTCCTGCGT
GGAGATTACC
AGAAGCTGAG
CAACGATGCC
GGAAATGTT
TGCTGGTGGT
AGTO-GGGATT
CGGCGTGATT
AAATCCTAAA
TCAAAALAGAC
AACGAALAGAT
TGTGACACGC
AACAAAAGAA
TTAA
EF003-2 (SEQ ID NO:1O) 82 TABLE 1. Nucleotide and Amino Acid Seqcences of E. faecalis Genes.
MKKFYL ATFAVIATVI LAACGCNXQA DQKEDKEITV AVQLESSKD)I LEIAXXEAEI( KGYKINIMEV SDNVAYNflAV QHDEADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF YSKEYQDAKD LPENAKVGIP SOPTNEGRAL AILNANGVIK LKEGVGFNGT VADVVENPKN ITFESIDLLN L.AKAYDEKDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQVVTRK GEKDSEKIK'J LKEAM~TTKEV AEYIKKNSKG ANIPAF EF003-3 (SEQ ID NO:11) CTGTGGGGG AA.ATAAACAA GCAGACCAGA AAGAAGACAA GGAGATIACC
GTTGCCGTGC
AAAAAAGGGT
GTGCAACATG
AACAAAGAGA
TTCTATTCAA
CCTAGCGATC
AAATTAAAAG
AACATCACTT
ATCGCTATGG
GCGATCTTGT
AAAGGCGAAA
GTTGCTGAAT
AATI'GGAATC
ACAAAATTAA
ACGAAGCGGA
AAAAAGCTGA
AAGAATACCA
CAACCAATGA
AAGGTGTCGG
TTGAAAGCAT
TGTTCTGCTA
TAGAAGATAA
AAGATAGCGA
ACATCAAGAA
'TCAAAAGAT
CATTATGGAA
TGCTAATTT
TI'TAGTGGCT
AGATGCGAAA
AGGTCGTGCT
CFI'TAACGGC
TGATTTACTG
CCCAGCCTAC
AGAAGCAAGT
AAAAATCAAG
AAATTCTA.A
ATCTTGGAGA
CTGAGCGACA
6CGCAACATC
GTGCAACCGA
GATTTACCTG
TTAGCAATT
ACGGTGGCAG
AATTIAGCTA
TTAGAACCTG
AAACATTACG
GTTTTAAALAG
GGCGCCAATA
TTGCCAAGAA AGAAGCTGAG ATrGTTGCCTA CAACGATGCC AACCCTTCAT GGAAATGITT T'ITATTATTT TGCTGGTGGT AAA.AltCCAA AGTGGGATT1 TAAATGCAAA CGGCGTGAT ATGTCGTGGA AAATCCTAAA AAGCCTATGA TCAAAAAGAC CTiGGTTrAAC AACGAAAGAT CATTGCAAGT TGTGACACGC AAGCGATGAC AACAAAAGAA TTCCTGCGTT T
S*
S
S.
S
.5
S
S
S
EF003-4 (SEQ ID NO:12) CGGNKQA DQKEDKEITV AVQL.ESSKDI LEIAKKEAE-K KGYKINIMEV SDNVAYNDAV QH.DFADANFA QHQPFMEMFN KEKKADLVAV QPIYYFAGGF YSKEYQDAXD LPENAKVGIP SDPTNEGRAL AILNANGVIX LKEGVGFNGT VADVVENPKN ITFESIDLLN LAKAYDEXDI AMVFCYPAYL EPAGLTTKDA ILLEDKEASK HYALQVVTRK GEKDSEKIKV LKEAMTTKEV AEYIKKNSKG ANIPAF EF004-1 (SEQ ID NO:13)
TAAATCGAAA
ATTIrGCAG
GAAACTG.CTG
ACAGAAAACA
TTAGTAAAAA
GTCATTACGG
GCGATAGAT'
AATGTTGAAG
CCTTACACGA
GAAGGATGAT
CTCTICTCTC
CCCAGTCAAG
TAACCCAAGC
TAGATGIrAA 7rAAGGTAAT
CCAACTCAGG
AAGTAGCCAA
ATGGGA-ACGA
AGAAATGAAA
TGCCTCTAC
CACTA'FrGAA
AGTTAAACAG
AAATAATGTT
TAATCATGAA
TACAGAGGCA
AACATTAGAA
CAGAACCATA
AAAATGATTA
AACGCAAAAA
GCTTrCAGACA
TTAGAAGAAA
AAAGATGACA
GCAAAAAAAA
CAAAAGACTG
AATGATAACC
GCAAAATCAA
AATTTGCAGG CATTGCTCTT ATAATACACA AALAGAAAG"C GTAACGAAAA CGAGCCTAAT AATTTAACTC TGACGAGAAA CATCAGATAA CCCTCACGCT ATATGGAACA AATGCAGACT CCATATACGG AATCAATTA ATGTTATTTC TTrCATCACA CTAAAAATGA AAATA TTATT CCGTTAGTAA AATAA EF004-2 (SEQ ID NO:14) MKK MIKFAGIALI FAALLSACSN AKNNTQKKAE TAAQSSTIEA SDSNENEPNT ENITQAVKQL EE-KFNSDEXL VKIDVKNNVK DDTSDNPIIAV I'I' VINDEA KKNMEEMQTA IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNCNDRTIA KSTKNENIIP
LVK
EF004-3 (SEQ ID 83 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Gcncs.
CTGTAGC AACGCAAAA.A ATAATACACA AAAGAAAGCC
GAAACTGCTG
ACAGAAAACA
ETAGTAAAAA
GTCATTACGG
GCGATAGAT
AATGTTGAAG
CCTTACACGA
CCCAGTCAAG
TAACCCAAGC
TAGATGTI'AA
TTAAGGTAAT
CCAACTCAGG
AAGTAGCCAA
ATGGGAACGA
CACTATI'GAA
AG'ITAAACAG
AAATAATGT
TAXTGATGAA
TACAGAGGCA
AACATTAGAA
CAGAACCATA
GCTTCAGACA
TAGAAGAAA
AAAGATGACA
GCAAAAAAAA
CAAAAGACTG
AATGATAACG
GCAAAATCAA
GTAACGAAA
AATTTAACTC
CATCAGATA.A
ATATGGAAGA
CCATATACGG
ATGTTATTTC
CTA-XAAATGA
CGAGCCTAAT
TGACGAGAAA
CCCTCACGCT
AATGCAGACT
AATTCAAMTA
TTTCATCACA
AAATATTATT
CCGTI'AGTAA AA EF004-4 (SEQ ID NO:16) CSN AXNNTQKXAE TAAQSSTIEA SDSNENEPNT ENITQAVKQL~ EEKFNSDEKL VKIDVKNNVK DDTSDNPHAV TTVKVINDEA KKNNE7EMQTA IDSNSGTEAQ KTAIYGIQLN VEEVAKTLEN DNDVISFITP YTNGNDRTIA KSTKNENIIP
LVK
EF005-1 (SEQ ID NO:17) Not* *000 0* 060 S
TAAAAAATGA
GTTGGTTGtG
CTAACTCTTT
GGATGGTCAG
GGACTGAAAG
ACTGCTCAAC
CCAGATTAC
CAGGATATTG
GAATCCT LTG
TGGCCTGCAG
GTTG.CCACAG
AGTATTI'CAG
A.AGAATGCTA
AAAAACGATT
GTAAAAATCA
ACATTGTGCG
ATGCGGTCCT
ATGTTGCCTT
TTA T=AGA GTGAATT1'AA
CTGATGATCA
CCAATAGTGT
AAGACTATGC
AATCAGCCAA
CGTTGTLTAGC
GTGTCACAAC
CACGATTGTG
GCAAGCAACG
CCACGGAAAA
AACACCAGAA
TCAAAATGCA
TCAAAATAAA
TTTGGTAGC
AGGTGTTCC
AGCTAAACTG
TACAATI'ACT
TCTGGAAT
AACTTTATI
AATTCATTAC
AGCACGCGAA
GGGATGCTTT
ACAAAAGAAA
ACCATGTTAA
GGTGAAAAAG
TATAGTAGTG
GCAGGCAAAG
TATGAAGGGG
TTAGAAGAAT
GATCAACAGC
AAACGTTTGA
GGCAATGTTT
GATGATI'TA
AAAAATGGCG
AAAGAATCAA
TTCTGGCCAT
A.AGAGACAAA
ATACGACGGA
TTGTGACAGC
ATAGTGGCCG
ACCTTGAAGT
ATTTAAATAA
TTATGAAAAA
GCGAGGAAAG
AAAA.AGGCTr
TAGTGGTCTC
A.AGTCCCAGA
AATATAC=II
AATA.A
TTTAGTAATC
ACCTGAAGAA
CCGCGTACAA
AACTGGGAT
CGCCTTGCAA
CGTGCGTGAC
GACAATGTGG
CATGACTCCT
CAAGAATAAC
AGATAAAATT
TCATGGCTTG
AGGCGGTTTG
GGATAAAGTC
AATGATGTCA GCTACTTAGA EF005-2 (SEQ ID NO:18) MKKRLTIVG MLFLAILVMV GCGKNQQATT KEKETKPEEL ThY IVRHGKT MLN'rTDRVQG WSDAVLTPEG EKVVTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKA GKDLEVVRDP DLREFNFGSY EGDLNKTMWQ DIADUQGVSL EEFfrflNMTPE SFANSVAKLD QQREESKNNW PAEDYATITK RLKKGLDKIV ATESANSGNG NVLWVSHGLS I SALLATLFD DFKVPEGGLK NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESK EF005-3 (SEQ ID NO:19) TTGTG GTAAAAATCA GCAAGCAACG ACAAAAGAAA AAGAGACAAA ACCTGAAGAA CTAACTCTT ACATTGTGCG CCACGGAAAA ACCATGTTAA ATACGACGG.A CCGCGTACAA GGATGGTCAG ATGCGGTCCT AACACCAGAA GGTGAAAAAG TTGTGACAGC AACTGGGAT GGACTGAAAG ATGTTGCCTT TCAAAATGCA TATAGTAGTG ATAG;TGGCCG CGCCTTGCAA ACTCCTCAAC TTATITI'AGA TCAAAATAAA GCAGGCAAAG ACCETGAAGT CGTGCGTGAC CCAGA=TAC GCGAT AA TTTIGGTAGC TATGAAGGGG AtTTTAAATAA GACAATGTGG CAGGATATTG CTGATGATCA AGGTGTTTCC TTAGA.AGAAT TTATGAAAAA CATGACTCCT TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GAATCCTTTG
TGGCCTGCAG
GTTGCCACAG
AGTATTTICAG
AAGAATGCTA
AATGAMTCA
CCAATAGTGT
AAGACTATGC
AATCAGCCAA
CGTTGTTAGC
GTGTCACA-AC
GCTACTTAGA
AGCTAACTG
TACAATTACT
I'rCTGGGAAT
AACT'ITATT
AATTCATTAC
AGCAGGCGAA
GATCAACAGC GCGACGAAAG AAACGTTTGA AAA-AAGGCTT GGCAATGTTT TAGI'GCTCTC GATGATTTTA AAGTCCCAGA AAAAATGGCG AATATACTTT AAAGAATCAA AA
CAAGAATAAC
AGATAAAATT
TCATGGCTTG
AGGCGGTTTG
GGATAAAGTC
EF005-4 (SEQ ID CGKNQQATT KEKETKPEEL TLY IVRHGKT MLNTTDRVQG WSOAVLTPEG EKVVTATGIG LKDVAFQNAY SSDSGRALQT AQLILDQNKM GKDLEVVRDP DLREFNFGSY EGDLNKTMWQ DIADDQG'JSL EiE-FMKNMTPE SPANSVAKLD QQREESKNNWA PAEDYATITK RLKKGLDKIV ATESANSGNG NVLVVSHGLS ISALLATLFD DFKVPEGGLR NASVTTIHYK NGEYTLDKVN DVSYLEAGEK ESI( EF006-1 (SEQ ID NO:21) @0 0O S 0E S S
S.
0@ 0O S 0@
S.
S S 0
SO
S S 0 0 505.5.
S
*OS.
0 *050 *0 00@0 0 0000
S
0 *605 60 0 0*
S.
TAAACGATAA
GCAGTAGCTG
AAAG ETGGAG
GAAAAAGAAG
GCGTTGGAAA
GCGGTTAAAG
GGGC'ITACT
GTTAGCTCTT
ATCACGCTGA
ACTAAAAAGT
AATGAAGAAG
CCGAA.AAAAG
GTTCGTAA.AG
AAAGAAGTCC
ATGGAGGGAA
TCTTAGTTT
CTTCACCAGT
GCGTAAAATT
GTGGCGATAT
AAAATGATTA
CGAAAAAATA
CCGTTTCAGA
AAGAAGGGGT
TGAAATTCA-A
GGGCTGCGGT
ATGCGATTGC
AAGACGAAAA
AAGATTGGAT
TAAGATGAA
AGGGGCTTGC
TCCACATGCA
AGAAGTGACG
CGATCCCAAC
TGACTTTGTG,
CAAATCGTTA
TGGCCACGC
AGACCGGACA
TCATGAAAGT
TTTAArrAAC
CTTAGAAAAA
CAACGAAAAT
TACGAAAAAA
AAACGTACAT
GGCAATAA.AA
GAGA 1TTTAG
ACTTATACAG
TATFI'CCAAC
AATGCAGCGTG
CAAGAAATTC:
GTATTAACTA
ACTGCTACTT
GATCCAGC-AA
TCAAACTTTG
CAAAGTTCAC
GTAAAAAAAT
TGGAACGGCG
TATGC-TCAGT
AGAGTGATGA
A.ACATGTAAA
ATTACGTGCT
ATGTOCCGTT
CGATTCAT
CTGATGGTTC
TCTAGAAGA
TCGATGATAT
TCATGSACCAC
CCGTGGATCA
CTTATGCCAA
TAGTCAAAGT
CTATTGTECC
AATI'ACTGTA
CTCGGTCTTG
ACCTTTATTA
ACCTAACAAG
CTTTAATGAA
AGAACCAGTT
AACGATTTAC
TGCTGGTTTA
TGATAAAAAT
TCTTTATCAC
AGGTAAAT
TATTATTGCO
GTTACGTAC
AGTCAATGAA
TAA
EF006-2 (SEQ ID NO:22) MIKK RTLWSVITVA VAVLVLGACG NXKSDDSVLK VGASPVPH~AE ILEHV'KPLLE KEGVKLEVTT YTDYVLPNKA LESGDIDANY FQHVPFFNE-A VKENDYDFVN AGAIHLEPVG LYSKKYKSLQ EIPDGSTIYV SSSVSDWPRV LTILEDAGLI TLKEGVDRTT ATFDDIDKNT KKLKFNHESD PAIMT]TLYDN EEGAAVLINS NFAVDQGLNP KKDAIALEKE SSPYANI IAV RKEDE--NNENV KKLVKVLRSK EVQDWITKKW NGAIVPVNE EF006-3 (SEQ ID 1NO:23) TTGC GGCAATAAAA AGAGTGATGA CTCGGTCTTG
AA.AGTTGGAG
GAAAAAGAAG
GCGTTGGAAA
GCGGTTAAAG
GGGC ITTACT
GTTAGCTCT
ATCACGCTGA
ACTAAAAAGT
AATGAAGAAG
CTTCACCAGT
GCGTAAAATT
GTGGCGATAT
AAAATGATTA
CGA.AAAAATA
CCGTTTCAGA
AAGAAGGGGT
TGAAATTCAA
GGGCTGCGGT
TCCACATGCA
AGAAGTGACG
CCATGCCAAC
TGACTTTGTG
CAAATCGTTA
'rrGGCCACGC
AGACCGCACA
TCATGAAAGT
TTTAATTAAC
GAGXTTTAG
ACTTATACAG
TATTTCCAAC
AATGCAGGTC,
CAAGAAAVTC
GTATTAACTA
ACTGCTAC1'
GATCCAGCAA
TCAAACTT'rG
A.ACATGTAAA
ATTACGTGCT
ATGTGCCGTT
CGA'rTCAT'TT
C-MATGGTC
TCTAGAAGA
TCGATGATAT
TCATGACCAC
CCGTGGATCA
ACCTTTATTA
ACCTAACAAG
CTTTAATGAA
AGAACCACTT
AACGATTTAC
TGCTGGTTTA
TGATAAAAAT
TCITTATGAC
AGGATTAAAT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E.]a~ecajlis Genes.
CCGAAAAAAG ATGCGATTGC CTTAGAAA.AA GAAAGTTCAC CTTATGCCAA TATTAT'TGCG GTTCGTAAAG AAGACGAAAA CAACGAAAAT GTAAAAAAAT TAGTCAAACT GTTACGTAGC AAAGAAGTCC AAGATTGGAT TACGAAAAAA TGGAACGGCG CTATTGTTCC AGTCAATGAA EF006-4 (SEQ ID NO:24) CG NKKSDDSVLK VGASPVPHAE ILEHVKPLLE
KEGVKLEVTT
LYSKKYKSLQ
KXLKFNHESD
RKEDENNENV
YTDYVLPNKA
El PDGSTIYV PAIM ETLYDN
KKLVKVLRSK
LESGDIDANY
SSSVSDWPRV
EEGAAVLINS
EVQDWITKKW
FQHVPFFNEA
LTI LEDAGL I
NFAVDQGLNP
biGAIVPVNE VKENOYDFVN AGAI HLEPVG TLKEGVDRTT ATFDDIDKNT KXDAIALEKE SSPYANI IAV EF008-1 (SEQ ID V00*0 0.00.
.0.0* 0 TA.AAC CGTGA
ACACTTTTAG
AA.ATTAGCAA
GACAAAATTG
TTACCAGAAG
TI'AGAAACAG
AATAAAGAT
CAAGAACAAA
CAAAACATTC
AACGCGAAAA
GCTGATATTC
TCCAAAGCTT
ACACCTGAAC
ITTTGTTIGAAA
CCAATTTACG
TACTACAGCA
GAAAGAAATG
CAGGGTTA-AC
TTGCIGACAAC
AGCTGCATAG
ACATTGCGAA
GCGGAAATG
ACTTTTCTAC
CAGAAGATCC
GTGACGTCTT
ATTATACCGA
CTGATGATAA
ATGATTTAAA
AAATGACCAC
CCAGTGTCGA
ATACACTTTT
TGATGAACTG
GAGGAATCAA
GTTAGCTGCT
GAACTCCATC
TATTGTGCCA
AGCTTCTGAA
CTGTTAAC
AAGCAAAAAT
ACATGCTTGG
AGTAGAAAAA
AAAACTTAGC
AAAATTATTA
TGCCGCTTAT
GATTATTGAT
TAAACGTAGT
CACAGACTCT
GAATTTAACA
CGAATGAAAA
TGCGGGAATC
CTATCTGATT
ATTGGGACAG
GCGGACATT=
AAA'rTAATGA
GTTACGCCAC
TAGACATTG
GATCCAAAAA
AAACTACATG
GTTACAAGTG
ATFI'GGGAAA
ACCATTAAGA
ATGGAACGGG
CTTGCCAAAG
AAAATCCATG
AATTTAGI I
AAGCCGCTGA
TAGTGAAAAA
ACCCTCACGA
TATTCTTI'AA
AAACGGCCAA
AATATT1'AAC AAAATGr.CAT ATAAAGATrT
AGGAAGCCAA
AAGGTGCCTT
.TTAACACAGA
AATCAAAAGC
TCTCAAAAGA
AAGGAACAWA
ATGGCTIAAT
ATITTTTTA
AAkAGAAAGAA
TGTTGGGCAA
ATATGAACCG
CGGCTTGAAC
AAAAGTTGAG
AAGTGCCGGT
TAAATATGTA
CTATACAGAA
AGCTAAXTTT
TAAATATTTC
AAGTCAAGGDN
ACCTGTGTTA
AGTGAAACGA
AGCGCATACG
CGTAAATAA
EF008-2 (SEQ ID NO:26)
M(KFSLFFLT
KIELHSIVPI
KDYFSTSKNV
AKNYTEKLSK
PEQMTTIIDT
YSIOUNWNLTK
LLAGLTEJAAC
GTDPHEYEPL
TPQYLTSAGQ
LHEEAYJAKFA
IKKSKAPVLF
IHDGLMSK
GNQAAEKKEK
PEDIAKASEA
EQTEDPHAWL
DIPDDKKLLV
VETS VDKRSM LAIVTTNS IL SDLVKNVGQD DI LFFNGLNL ETGGNGWFNK DIENGIKYVE NIRDVLVEKD TSEGAFKYFS KAYDLNAAYI ERVSKEVKRP IYDTLFTDSL
LW(TAK<KVEN
PIQNKDFYTEN
WEINTESQGT
AKEGTEGDY
EFOOS-3 (SEQ ID NO:27) T TCGGGAATC AAGCCGCTGA AAAGAAAGAA
AAATTAGCAA
GACAAAATTG
TTACCAGAAG
TTAGAAACAG
AATAAAGATT
CAAGAACAAA
CAAAACATTC
AACGCGAAAA
GCTGATA ETC
TI'GTGACAAC
AGCTCCATAG
ACATTGCGAA
GCGGAAATGG
ACITTTCTAC
CAGAAGATCC
GTGACGTGTT
ATTATACCCA
CTGATGATAA
GAACTCGATC
TATMTGCCA
AGCTTCT'GAA
CTCGTTTAAC
AAGCAAAAAT
ACATGCTTG4G
AGTAGAAAAA
AAAACTI'AC
AAAATTATTA
CTATCTGATT TAGTGAAA ATTGGACAG ACCCTCACGA GCGGACATTT TATTCTTTAA
AAATTAA'IGA
GTTACGCCAC
TTAGACATTG
GATCCAAAAA.
AAACTACATG
GTTACAAGTG
AAACGGCCAA
AATATT-A.AC
AAAATGGCAT
ATAAAGATIT
AGGAAGCCAA
AAGGTGCCTT
TGTTGGCAA
ATATGAACCG
CGGCTTGAAC
AAAAGTTGAG
AAGTCCCGGT
TAAATATGTA
CTATACAGAA
AGcTAAATT
TAAATATTTC
86 TABLE 1. Nucleotide and Am~ino Acid Seqeuences of Efaecalis Genes.
TCCAAAGCTT
ACACCTGAAC
TTTGTTGAAA
CCAATTTACG
TACTACACCA
ATGATTTAAA
AAATGACCAC
CCAGTGTCGA
ATACACITTTT
TGATCAACTG
TGCCGCTTAT
GA'LTATTGAT
TAAACGTAGT
CACAGACTCT
GAAT'IAACA
ATTTGGGAAA TTAACACAGA ACCATTAAGA AATCAAAAGC ATGGAACGGG TCTCAAAAGA CTTGCCAAAG3 AAGGAACAGA AAAATCCATC ATGGCTTAAT
AAGTCAAGGN
ACCTrCTGTTA
AGTGAAACCA
AGGCGATACG
GAGTAAA
EFOOS-4 (SEQ ID N0:28) C GNQAAEKKE( LAIVTTNS IL SOLVKNVGQD KIELHSTVPI GTDPHEYEPL PEDIAKASEA DILFFNGLNL KDYFSTSi(NV TPQYLTSAGQ EQTEDPHAWL DIENGIKYVE AKNYTEKLSK LHEEAKAKFA DIPDDKKLLV TSEGAFKYFS PEQMTTIIDT IKKSKAPVLF VETSVDKRSM E-RVSX EVKRP YSMNWNLTK IHOGLMSK
ETGGNGWFNI(
NIRDVLVEXKD
KAYDLNAAYI
TYDTLFTDSL
LMKTAKKVEN
PKNKDFYTEN
WEINTESQOT
AKEGTEGDTY
EF009-1 (SEQ ID NO:29) 9 9* 9 9* 9 9 9 9 9*9 9 9 9 *9*9 9*9*
TGACAAATGA
GCAGGTTGTG
GTAGGAACAA
GATTIACAAT
GAAATTGA NT
GCAACGAAAT
AAATTGAAAG
ACGAATGGCG
GCGAAACAGC
ACTGAATTAG
AATAGCGGCA
CCTGTAAACG
GAGAATAAAC
ATTGCAGAAA
AAAAATT-TAG
CATCGGGGTC
AAAATGATGA
TGGTAGAATT
TAAATGCCTT
TAGTATCAAT
ATATCACGAA
GGCGGGCGTT
AACTACCGAC
ATGCTACGCA
TGGCTGTCGA
AAAA.AGCGAA
TTTATCAAAA
.CATCAAAAGG
TAAATTAATT
TGTGAAGGAT
ATGG.GAATCG
TACAGACTAT
TCAGCATCAA
TGGCAATACA
AXITAAAGAC
AATITTTATTA
TGTCAGTGAT
AACAGCGCGC
TGCTGGGTAT
ACCTTATGTG
AGTGTAGAA
CGCCAATGTT
GGACTTATTG
ACTAAGACAG
GTCAAAGACC
ACGCAACCAA
ATCT=rTAG
GTCAATGCAC
GGCGGAGAAA
CAAACTGCAG
ATACTGAAA.
rGc'TrrAcAAG
ACACCAGATA
AACATrCTCG
GAATATCAAC
CCAGCCTGGG
GGGTATTAGC
AAACCGTTAA
GTTTAAAA
ACGCAGCATT
ACAATTACAA
CATI'GGGAAT
TTGCTATTCC
GACTGATAAA
ATAA-ACGCCA
ATGTCGATGC
AAGATGCTAT
TGGCCCGAGA
A.AGAAGAAAC
AAACArIG
TTTTACGATT
ACTAGGGGTT
GAAAAATATT
AGCAGAAAAA
TAAAGAGCAT
TTACGCTAAT
TAATGACCCA
AGTAGATCCT
ATTGAAAATA
TTCAGTGATT
TFICTTAGA-A
AGAAGATCAA
GAAAAAGGTC
TAAAAAATAA
EF009-2 (SEQ ID MKKFSKL IG L IGVLAFTIA GCASGSVKDT KTETTKLGVV GTKNDEWdESV KDRLKKKNID LQLVEFTDYT QPNAALAEXE IDLNAFQI4QI FLDNYNKENG TKLVSIGNIV NAPLGIYANK LKDITKIKDG GEIAIPNDPT NGGRALILLQ TAGLIKVDPA KQQLPEVSDI TENKRQLKIT E-LDATQTARA LQDVDASVIN SGMAVDAGYT PDKDAIFLEP VNEKAKPYVN IVVAREEDQE NXLYQKVVEE YQQEETKXVI AETSKGANVP AWETFGKK EF009-3 (SEQ ID NO:31) TTGTG CATCGGGGTC TfGTGAAGGAT ACTAAGACAG AAACCGTTA.A ACTAGGGGT
GTAGGAACAA
GAT'ITACAAT
GAAATTGATT
GGAACGAAAT
AAATTGAAAG
ACGAATGGCG
GCGAAACAGC
ACTGAATTAG
AATAGCGGCA
CCTGTAAACG
AAAATGATGA
TGGTAGAATT
TAAATGCCTT
TAGTATCAAT
ATATCACGAA
GGCGG.GCGTT
AACTACCGAC
ATGCTACGCA
TGGCTGTCGA
AAAAAGCGAA
ATGGGAATCG
TACAGACTAT
TCAGCATCAA
TGGCAATACA
AATTAAAGAC
AATFI'TATTA
TGTCAGTGAT
AACAGCGCGC
TGCTGGGTAT
ACCTTATGTG
GTCAAAGACC
ACGCAACCAA
ATCTFI'TTAG
GTCAATGCAC
GGCGGAGAAA
CAAACTGCAG
ATTACTGAAA
GCTTTACAAG
ACACCAGATA
AACATTGTCG
GTTTGAAAAA
ACGCAGCATT
ACAATTACAA
CATTGGGAAT
TTGCTAMTCC
GACTGATAAA
ATAAACGCCA
ATGTCGATGC
AACA GCTAT
TGGCCCGAGA
GAAAAATATT
AGCAGAAAAA
TAAAGAGCAT
TTAcG~cTAAT
TAATYCACCCA
AGTAGATCCT
AI'GAAAATA
TTcAGTGATT
TTTCTTAGALA
AGAAGATCA
87 TABLE 1. Nucleotidle and Amino Acid Seqeuences of E. faecalis Genes.
GAGAATAAAC TTTATCAAAA AGTTGTAGAA CAATATCAAC A.AGAACAAAC GAAAAAGGTC ATTGCAGAAA CATCAAAAGG CGCCAATGTT CCAGCCTGGG AAACATTTGG TAAAAA EF009-4 (SEQ ID NO:32) CASGSVKDT KTETVKLGVV GTKNDEWESV KORLI(KKNID
LQLVEFTDYT
LKDITKIKDG
ELDATQTARA
NKLYQKVVEE
QPNAALAEKE
GEIAIPNDPT
LQDVDASVIN
YQQEETKI<Vl
IDLNAFQHQI
NGGRALI LLQ SCfrAVDAGYT
AETSKGANVP
FLDNYNKEHG
TAGLIKVDPAk PDKDAI FLETP
AWETFGKK
TKLVS IGNTV NAPLGIYANK KQQLPTIVSDI TENKRQLjKIT VN'EKAKPYVN IVVAREEDQE 00*0 00..
0 0 0 0 EFOIO-1 (SEQ ID NO:33) TGAAAGAATA AAATTGTACA CGTAGCTGGC GCACAGCT T GCAGCGTCAA AAGGTGATGA CCAGATAATT ATGACCAATT GCAAAATTAA AAATGGAATT GTTGCTTCTG GTGAAAGCTA AAACGCGCCT ATGCTCATTT CAATTGCCAG ATAACTATAT ATTTTAGGTA ACTCTTACGG TACAATTAG ATATTAGTAA GAATTCCNTA AAAANGANCC ACAGGTAATT ATGACTTCCC ACTGGCTCAC CAAAAATTAT GTCTTGCATC AATGGTATAA CCATATGACT TAALATACCAA TATGGTGATA ,CA.ATCTTAAC GAACCATTAA AAACAACAC AAAAACAAAG AAAAATCTGT AACGGACTTG TTTATGGTGA AAATrCGTTGA AAGATTACAC TTAATCAT= GGCCAGAAGA ATCGAAGAAG CAAAAGATC ACTGAAATCA CTAACGT1'QC ACTGTTGATC CAGAACAAAC
'GAGGAAATA
kGC'TGTAGGA rAGTACACCA k.ATCGATAAT rGTTGCTMG rGATATTTCA kACTGATTTA
TAAAGGAAAT
TCAACAAGTT
AGTCGATGGT
AAATATTGCT
TATTGGTAAC
TAACCAATAT
AGATGGCTI'G
TACTTGGTTT
ACAAGCTGCT
TCAAGCGCAA
TGAATTGTTA
AGAAGGCAAA
.ACCAACAACT
ATCTGTCACT
ACCAATTCTI'
TACAGTTATG
ACTTCCAAAA
GCAAACACAA
AGGAATGAAA
CTAAGCGCGT
ACGTTATTAA
GCGAATAAAA
GCGATTGCG
TTAGCACAAA
GCACCTAAAT
ACCATTAATC
TTAACTTTTA
AGTTATGAAA
GCTTI'TGCTA
CAATATCCAT
GCCGACAAAG
AITCCAACAG
ATGCGTCAAG
GGCAAACCAC
ATGGCTAACT
GGFFI'ATTAA
CAATATGAAA
CATTTGAGTG
GAAGAAATGG
GG TIrTACTT
AACCGTTACG
TTAATGGATC
TTAGACGAAT
AAATGGCAAA
GCGGTAAATC
TGTATCGTGT
TTATCGAGAA
ACCAAAAAAT
ATTAT'GCAAC
ATGCCAAAGA
GAAAACTGTA
ATAA.AGAATA
GTGCAACGGA
TCGGCCAAAC
TTCAGTAAA
ACATGATITAA
ATGCTCCTAC
A-AACACAAGG
TTGTTI'CTCG
ATGTTGTI'GC
ACAGCAATCC
AAGTTGGCGA
CTTGGAACAC
TTAA.AGAACG
T'TGTAAATGA
CAGCAAGCTT
ACCTAAAAAC
ATATCCAATC
AAGGATTAGC
TTCAAAAGAT
TGGGGACAAA
AAAAATTGGG
CTCAACAATC
GAATGCACAA
AGCCTATGAT
TGCGTTCCCA
TGTCGATAAA
AGI'CTAAAA
ATTCTTTGCA
AACAACTGAT
TA.ACTTAAA-A
AAGTACAACA
ACCTATGGAT
TCCACTAACA
AAACACGTCT
AGAATTGTTA
TGATCGTGTG
AGGAAACAAC
TGATAAGAC
TAAAGTGAAA
AAATACAGGA
AGCTGGCTGG
TCAAAAATAA
GATAAAGTTC AAAAAGAAAT EF010-2 (SEQ ID NO:34) MAKRISR SWRTALAVGL SACGKSSKDA ASKGDDSTPT LLMrYRVGDKP
ONYDQLIDNA
GAYADLTDLA
NLDI SKVDGS GSPKI INQYA
GDTILTQAAG
GLVYGEEGKQ
EEAKDSP ILG
KVQKEMQTQL
NIIIEKKIGA
PKYAI(EAYDQ
YESATEVLKE
DKDMINNLKV
KPLVSRPLTE
YEKVGDIJRVK
FTFVNDKVKT
DEYIQSQK
KLKMEFVGWG
LPDNYIKGNT
FXKXXPNIAA
LHQWYKDGLI
PLKTTrAQAQM LLf(DYTPTTH
EITUVATVMN
DWDQKM4STIV
INGKLYAFPI
FAIGQTFFAT
PTDAATSflrP
ANYVVANTSK
LSAWNTGNNL
RYAASLNTGT
ASGESYDISL
LGNSYGQQVL
GNYDFPIGNQ
YDLNTNTJiFM
NKEKSVELLG
I IWPEESVTE
VDPEETLPKL
AQNYATNAQK
TFNKEYVDKY
YPFAVKTTDT
RQETQGPMDy
LLNSNPELLN
EMVKERDKS I
MDDLKTAGWD
EF010-3 (SEQ ID 88 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. foecalis Genes.
CT GCGGTAAATC TTCAAA.AGAT
GCAGCGTCAA
CCAGATAATT
GCAAAATTAA
GTTGCTTCTO
AAAGCGCCCT
CA.ATTGCCAG
ATTTAGGTA
TACAATTTAG
GA.ATTCCNTA
ACAGGTAATT
ACTGGCTCAC
CTCTTGCATC
CCATATGACT
TATGGTGATA
GAACCATTAA
AAAAACAAAG
AACGGACTTG
AAATTGTTGA
TTAATCATT
ATCGAAGAAG
ACTGAAATCA
ACTGTTG'ATC
GATAAAGTTC
AAGGTGATGA
ATGACCAATT
AAATGGAATT
GTGAAAGCTA
ATGCTGATTT
ATAACTATAT
ACTCTTACGG
ATATTAGTAA
AAAANGANCC
ATGACTTCCC
CAA.AAAWIAT
AATGGTATAA
TAAATACCAA
CAATCTTAAC
AAACAACAGC
AAAAATCTGT
TTTATGGTGA
AAGATTACAC
GCCAGAAGA
CAAAAGATTC
CTAACGTTGC
CAGAAGAAAC
TAGTACACCA
AATCGATAAT
TGTrGGTTGG
TGATATTCA
AACTGATTA
TAAAGGAAAT
TCAACAAGTT
ACTCGATGGT
AAATATTCCT
TATTGGTAAC
TAACCAATAT
AGATGGCTTG
TACTTGTTI
ACAAGCTGCT
TCALAGCGCAA
TCAATTGTTA
AGAAGCCAAA
ACCAACAACT
ATCTCTCACT
ACCAATTCTT
TACAGTTATG
ACTTCCAAAA
ACGTTATTAA
GCGAATAAAA
GGCGATTGGG
ETAGCACAAA
GCACCTAAAT
ACGA=DATG
TTAACTTTTA
AG ITATOAAA
GCTTTTCTA
CAATATCCAT
GCCGACAAAG
ATTCCAACAG
ATGCGTCAAG
GGCAAACCAC
ATGGCTAACT
GGTTTATTAA
CAATATGAAA
CATTTGAGTG
GAAGAAATGG
GGTTTTACTT'
AACCGTTACG
TTAATGGATG
TTAGACGAAT
TGTATCGTGT
TIrATCGAGAA
ACCAAAAAAT
AITATGCAAC
ATGCCAAAGA
CAA.ACTGTA
ATAAAGAATA
GTGCAACGGA
TCGGCCAAAC
TTGCAGTAAA
ACATGATTAA
ATGCTtCTAC
AAACACAAGG
TTGTTTCTCG
ATGTTGTTGC
ACAGCAATCC
AAGTTGGCGA
CTTGGAACAC
TAALAGAACG
ITGTAAATGA
CAGCAAGCT
ACCTAAAAAC
ATATCCAATC
TGGGGACAAA
AAAAATTGGG
GTCAACAATC
GAATrCCACAA
AGCCTATGAT
TGCGTTCCCA
TGTCGATAAA
AGTTCTAAAA
ATTCTTTCCA
AACAACTGAT
TAACTTAAAA
AAGTACAACA
ACCTATGGAT
TCCACTA-ACA
AAACACGTCT
AGAATTGTI'A
TGATCGTGTG
AGGAAACAAC
TGATAAGAC
TAPIAGTGAAA
AAATACAGGA
AGCTGCCTCG
TCAAAA
a a a a a a a a.
AAAAAGAAAT GCAAACACAA EF010-4 (SEQ ID NO:36) CGKSSKDA ASKGDDSTPT LLMYRVGDKP
DNYDQLIDNA
GAYADLTDLA
NLDI SKVDGS GSPKI INQYA GDTI LTQAAG
GLVYGEEGKQ
EEAKOSPILG
NKIIEKKIGA
PKYAKEAYDQ
YESATEVLKE
DKOMINN~LKV
KPLVSRPLTE
YEXVGDDRVK
FTFVNDKVKT
KLKMEFVGWC
LPDNYIKGNT
FXKXXPN IAA
LHQWYKDGLI
PLKTTAQAQM
LLKDYTPTTH
EITNVATVMN
DWDQKMSTIV
INGKLYAFP I
FAIGQTFFAT
PTDAATSTTP
ANYVVANTSK
LSAWNTGNNL
RYAASLNTGT
ASGESYDISL
LGNSYGQQVL
GNYDFPIGNQ
YDLNTNTFM
NKEXSVELLG
I IWPEESVTE
VDPEETLPKL
AQNYATNAQI(
TFNKEYVDKY
YPFAVKTTDT
RQETQGPMDY
LLNSNPELLN
EMVKELRDKS I
MODLKTAGWD
KVQKEMQTQL DEYIQSQK EF011-1 (SEQ ID NO:37) TAACGT'I=rI
ATGGGACTGT
ACAGAAACAA
CCCAAAAATC
CTAGGT-GTCG
AAATACCAAA
CAACTAAAAC
AAAGCCATTG
AAACAAAATA
ATAACTGGCT
AATGCGCT'G
TTCGGTTTAA
CCAGGAAAAG
TAATGTTAAG
CAGCTAAAAC
CTAAGAAAGT
GTGACCGCGT
AAGTTrGAATC
CAGACTTAAT
CGCCAACCAT
TCGAAACGTT
TAGAAAAACA
TTCTGTTAGT
TTCATGATAC
AATCAAAAAG
TGCTTGTCAA
GGAAGTCACA
CGTIG ITTT1
GGTAGGTGCG
AGCAGGCGGC
TATrATTTCT
ITACTTAGCT
AGGCACTATT
AATrGCTGAC
TAACGAAGGA
ATTTGGCTTC
A;ATTFITAG
ACAAATAAA.A
GTCAAAGACA
GATAATGG Tr
CCAACTAAAA
ATTAAAGAAC
GGTCGTCAAC
GTAGATCCCA
TTTrGATAAAG
GTGAAAAAAC
CAACTTCCG
AAAGCAGCAG
CAATGATGGC
AAACAGCAGA
CCAATIGGTCA
CCTTGGATAC
ATATCCCTGC
CAGATITAGA
AAGAT1'ATCA
AAAATCCTTG
AAGAGGTAC
AAGCAGAAGC
CTTACGGAAA
ACGATAAGAT
AGTTI'CAATG
TTCTGCAACA
ATTAACCG1TT
AATGGATGCA
GTATTTGAAA
AAAAATCAAT
AGAACAATI'A
GGCATCAACG
TAAAGAAAAA
TAGCGCGAAT
ACCCTCTCGT
TGAAGCTTCC
89 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Gencs.
ACTCATCCGC
GTGGTAGATC
GAATTGATTC
GTTTGGTATC
AAAGGATTAG
AAAGTGTTTC TTACGAATAT G=TAGAAA AAAATCCTGG GATTCTCTTT GCACCAAAGC AATTGGTCGC CACGATTCA-A AAGATAACGT CGCTGCAAAC AAAAAACCGA TGCTGGTAAA AATGATAAAG TCATrATGCT TCAACCAGAT TAAGCGGTGG TGGATTAGAA TCAM'GCATT TGATGATAGA AGATGTTAAA
AGTAA
EF011-2 (SEQ ID NO:38) bMflKK FLAMMAVSMN GLLMLSACQT NKKTADSATT ETTAKTEVTV KDTNGQLTVP KNPKKWVVFD NGSLDTNDAL GVGDRVVGAP TKNIPAYLKK YQKVESAGGI KEPDLEKTNQ LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAKNPWASTK QNIETLGTIF OKEEVAXEKI TGLEKEIADV KKQAEASANN ALWVLVNEGQ LSAYGKGSRF GLIHDTFGF< AADDKIEAST HGQSVSYEYV LEKNPGILFV VDRTKAIGGD DSKDNVAANE T.IQKTDAGKN DKVIMLQPDV WYLSGGGLES MHLMIEDVKK GLE EF011-3 (SEQ ID NO:39) TTGTCAA ACAAATAAA.A AAACAGCAGA TTCTCCAACA
ACAGAAACAA
CCCAAAAATC
CTAGGTGTCG
AAATACCAAA
CAACTAAAAC
AAGCCATTG
AAACAAAATA
ATAACTGGCT
AATGCGCTTG
TTCGGTTTAA
ACTCATGGGC
GTGGTAGATC
GAATTGATTC
GTTGGTATC
CAGCTAAAAC
CTAAGAAAGT
GTGACCGCGT
AAGTCAATC
CAGACTTAAT
CGCCAACCAT
TCGAAACCTT
TAGAAAAAGA
TTGTGTTAGT
TTCATCATAC
AAAGTGTTTC
GCACCAAAGC
AAAAAACCGA
TAAGCGGTGG
G;GAAGTCACA
CGTTGTTTI
GGTAGGTGCG
AGCAGGCGGC
TATTATTTCT
TTACITAGCT
AGGCACTATT
AATTGCTGAC
TA-ACGAAGGA
ATTTGGCTTC
TTACGAATAT
AAT1TGGTGGC
TGCTGGTAAA
TGGATTAGAA
GTCAAAGACA
GATAATGGTT
CCAACTAAAA
ATTAAAGAAC
GGTCGTCAAC
GTAGATGCCA
TTGATAAAG
GTGAAAAAAC
CAACTTTCCG
AAAGCAGCAG
GTT'ITAGAAA
GACGATTCAA
AATGATAA.AG
TCAATGCATT
CCAATCGTCA
CCTTGGATAC
ATATCCCTGC
CAGATTTAGA
AAGATTATCA
AAAATCCTTG
AAGAGGTAC
,ACCAGAAGC
CTTACGGAAA
ACGATAAGAT
AAAATCCTCG
AAGATAACGT
TCATTATC.CT
TG-ATGATAGA
ATTAACCGTT
AATGGATGCA
GTATTTGAAA
AAAAATCAAT
AGAACAATTA
GGCATCAACG
TAAAGAAAAA
TAGCGCGAAT
AGGCTCTCGT
TGAAGCTTCC
GATTCTCTTT
CGCTGCAAAC
TCAACCAGAT
AGATGTTAAA
AAAGGATTAG AG EF011-4 (SEQ ID CQT NKKTADSATr EIAKTEVTVJ KDTNGQLTVP KNPKKVVVFD NGSLDTM4DAL GVGDRVVGAP TKNIPAYLKK LKPDLIIISG RQQDYQEQLK AIAPTIYLAV DAXNPWASTK TGLEKEIADV KKQAE-ASANN ALVVLVNEGQ LSAYCKGSRF HGQSVSYEYV LIEKNPG ILFV VDRTKAIGGD DSKDNVAANE WYLSGGGLES MHLMIEDVKK GLE
YQKVESAGGI
QOIETLGTIF
CLIH]DTFGFK
LIQKTDAGKN
KEPDLEKINQ
DKEEVAKEK I
AADDKIEAST
DKVIMLQDV
EF012-1 (SEQ ID NO:41)
TGAGGGGGCA
CTTTITAGCCG
GCAGCTGAAC
CAAACAACAG
TTTGATGATG
ACAACATGAA
CATGTGGCGG
AAAAAATCAG
ATAAAAATAC
ATAGTGCCAC
ATTGGGGAAA
AACCAAAGAA
TA ITAGTTCA C=rACAATC
GGTGCCAGCT
AAAGTAGTAG
GCGGCAGAGA
CCTGCACCAA
GCACAACATT
CTAGCTAAAG
GT=TTGC AACAGGGTITT AAGTAGA'ITC GGGAAATTTA.
TCTCAACATT GGATACAACA TATTTGA-AGG CCTTTATCGG A2'GTCA-AGAT TAGTGACGAT TABLE 1. Nucleotide and Amino Acid Seqeuences of E. [iaecalis Genes.
GGGCGCA.AGT
ACGGCCCAAG
AATGCCTATT
GTCGATGAAT
GCCCAACCTT
TGTCGAAG
CCATTTACGC
GAATACTATG
GATAATACTG
CALATATGTTC
TACTTCTTAG
GCGATTGGCC
AAACCCCTTA
TTCCGAGCTT
AC GAAAC C C
GATCAAGGAA
TTAGAAATTA
AATTATGAGT
AACTTATATO
CAATTC-GTAG
TACAAAGAAG
AGTGCCTCAA
ACCACI=AC
ATTTTGTTTA
TACTAGACAG
TAGGGATTTC
CCTTCTTAGC
CGCAAGGCAA
TAGCCAATTG
ATGCGGATCA
GGATTAACTT
AACAATATCA
ATI'CAACAA
AAGCAATTGA
ACGGATTGAT
ACAGTGGCGA
AAGCGGATGT
AACGAATGC
CCATTCATC
TGTCTCTTC
CAGGAGAATC
AAGAGGCACG
CGGAAGACAT
ATTATCTAAT
CTTGC-GGGAG
TTCTTGGAAA
TGTTAAAAAT
AGCCCCGAAT
ACTCGTTTCG
AGATTACGCC
G'OATGCGACT
AGTGAAACTG
ATATCAAGTG
AGATGATCCA
AAAAGAAGGA
TAAAGAAGCC
TCCAAGTAAA
ATATTTGAAA
CGGTAAAAAA
TGAATATGTT
GCAACCAAGT
AGGATGGATT
AAGTTACAAT
AACGATTAAT
CTTGTTGAAC
CAATCCTAAA
TGCCTATTTA
GGCGATTAAGT
AAACTGGTGA
AGTTTTGAAA
GACAAAGAAT
AT'GCTTGGT
TTGGATAGTG
TCAGATACTT
GAAGAAGTTG
AATGAACTAG
GGCTATGTCA
ACGCCATrAG
TTAACACAA.A
C ITATGCGA
AATGACGTCA
GTGAAA=T'
CAAAGTCAGT
AATAPATGTGA
GCCGGCAGTA
TACGGCAATT
GCCAATAATC
CAAGATGCTG
TTGAAAGGCA
ACAGAATGA
GGAGCAACGG
CACCAGCGAC
TACGCAACGG
TCATTGTTGA
TAGCGOCACA
AACA'ITTACT
GGACATTGAA
CGGTTAGCAC
ACTTAGTTCG
GTCATCCAGA
CGAATGTTCA
GTGTCTTAAA
ATCCAGAAAC
AAAAAGWTCA
CATTGCTGGC
TGCA.AGAAAA
ACCAATCGCG
GTGAATTIAGA
ATCATAATGC
CAGAGAkACA C CCAAG TAGCC
TTAGTTATCA
CGAGCCAATC
GATTGGACCC
TGAAAAGTCA
A'FrAAAACAG
AAATCAAAAA
TTATAGCGGG
AAAAAATCCA
AATCAAAGAA
CATTAACGGA
TGTGGCCAAC
TTTACGAAAA
CGATGGGTCA
GGATGAAGAT
AGCTGAATGG
GGCAGACACA
TCTGCCAGGT
ACGTGAAAAA
CTCTTACTTT
CAAATACGAC
G"LTTGCAGAA
GCTGTATCAA
CT'TGTATGGG
GATTAT=TCC ACrI'GCGCAA EF012-2 (SEQ ID NO:42) IIKLGKI< VVGLIATFL LAACGGTKEA AEKVDSGNLA AEQK I SISSP APT STLDTTQ
TTDKNTFTMA
AQDFVYSWKK
QPSFLAVVSI
Y-YDADQVKLE
FLDFNKKEGT
RAYSGEYLKN
EITISSQPSN
LVEEARTINA
YFHLRNAYLT
QHLFEGLYRP
LVTPATIGPN
AWLAPQNQKF
EVAVSTIKED
PLANVHLRKA
DVKKAQAEW'T
NVNQSRREKN
NNPEKQFAEY
DDDSATVPAL
AYLLDSVKNS
VEAQGKDYAL
NTGINLYQVN
IGQAIDKEAL
KAQADVGKKV
YELSLSGWIA
KEAEDILLNQ
AKDVK I SDDG
FEIRNGEXSV
DSEHLLYSGP
ELDLVRINGQ
TQSVLNDGSK
KLSLLAADTD
GSSELDSYFN
DAAQVPLYQS
RKYHFTLREG
DELG ISAPND
FTLANWDATS
YVQQYQDDPG
PLNGLIPSKL
QGKRIAEYVQ
LYAGES SYNY
ASNYLINPKL
IKWSNGEPIT
KEFIVELKQA
DTWTLKKNPE
YVSH PDVANY
YANPETDEDF
SQLQENLPGL
GNYHNAKYD)Q
KG ISYHLYGD EF012-3 (SEQ ID NO:43) ATGTGCCGG AACCAAAGAA GCGGCAGAGA AAGTAGATTC GGGAAATTTA
GCAGCTGAAC
CAAACAACAG
TI'TGATGATG
GGGCGCAAGT
ACGGCCCAAG
AATGCCTATT
GTCGATGAAT
GCCCAACCTI'
TTT~GTCGAAG
CCAITTACC
GAATACTATG
GATAATACTG
CAATATGTI'C
AAAAAATCAG
ATAAAAATAC
ATAGTGCCAC
ACCACTI'TAC
ATTTGTTTA
TACTAGACAG
TAGGGATTTC
CCTrCTTAGC
CGCAAGGCAA
TAGCCAATG
ATGCGGATCA
GGATTAACTT
AACAATATCA
TATTAGTTCA
CTTTACAATG
GGTGCCAGCT
CTTGCGGGAG
TTCTTGGAAA
TGTTAAAAAT
ACCCCGAAT
AGTCGITCG
AGATTACCCC
GGATGCGACT
AGTGAAACTG
ATATCAAGTG
AGATGATCCA
CCTGCACCAA
GCACAACATT
CTAGCTAAAG
GGGATTAAGT
AAACTGGTGA
AGTI'TTGAAA
GACAAAGAAT
ATTGC~rhGT FrGGATAGTG;
TCAGATACTT
GAAGAAGTG-
AATGAACTAC-
GGCTATGTCA
TCTCAACATT
TA'TMAAGG
ATGTCAAGAT
GGAGCAACGG
CACCAGCGAC
TACGCAACGG
TCM'I'GTTGA
TAGCGCCACA
AACATI'ACT
GGACATTGAA
CGGTTAGCAC
ACTTAGTTCG
GTCATCCAGA
GGATACAACA
CCTTTATCGG
TAGTGACGAT
CGAGCCAATC
GATTGGACCG
TGAAALAGTCA
ATTAAA.ACAG
AAATCAAAAA
TTATAGCGG
AAAAAATCCA
AATCAA.AGAA
CA'ITAACGGA
TGTGGCCAAC
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TACTTCI'AG
GCGATTGGCC
AAACCCCTTA
TTCCGAGC IT
ACGAAAGCCC
GATCAAGGAA
TTAGAAATI'A
AATTATGAGT
AACTTATATG
CAATTG-GTAG
TACAAAGAAG
ATTTCAACAA
AAGCAATTGA
ACGGATTGAT
ACAGTGGCGA
AAGCGGATGT
AACGAATTGC
CCATTTCATC
TGTCTCTTC
CAGGAGAATC
AAGAGGCACG
CGGAAGACAT
AAAAGAACGA
TAAAGAACCC
TCCA.AGTAAA
ATATTTGAAA
CGGTAAAAAA
TGAATATGTT
GCAACCAAGT
AGGATGGATT
AAGTTACAAT
AACGATTAAT
CTTGTTGAAC
CAATCCTAAA
TGCCTATTTA
ACGCCATTAG
TTAACACAAA
CTITATGCGA
AATGACGTCA
GTCAAACrT'
CAAAGTCACT
AATAATG~lA
GCCGGCAGTA
TACGGCALAIT
GCCAATAATC
CAAGATGCTC
TTGAAAGGCA
ACAGAA
CGAATGICA
G-TGTCTTAA-A
ATCCAGAAAC
AAAAAGCTCA
CAGCTGGC
TGCAAGAAAA
ACCAATCGCG
GTGA.ATTAGA
ATCATAATGC
CAGAGAAACA
CC CAAGTACC
TTAGTTATCA
=~ACGAAAA
CGATGGGTCA
GGATGAAGAT
AGCTGAATGG
GCCAGACACA
TCTGCCAGCT
ACGTGAAAAA
CTCTTACFT
CAAATACGAC
GTI'TGCAGAA
GCTGTATCAA
CTTGTATGGG
AGTGCCTCALA ATTATCTAAT GATTATTTCC ACTTrCGCAA EF012-4 (SEQ ID NO:44) CGGTKEA AEKVDSGNLA AEQKISISSP APISTLDTTQ
TTDKNTFTMA
AQDFXIYSWKK
QPSFLA'JVSI
YYDADQVKLE
FLDFNKKEGT
RAYSGEYLKN
EITISSQPSN
LVEZARTIN~A
YFHLRNAYLT
QHLFEGLYRF
LVTPATIGPN
AWLAPQNQKF
EVA VSTIKED
?LANVHLRKA
DVKKAQAEWT
NVNQSRREKN
NNPEKQFAEY
DDDSATVPAL
AYLLOSVKNS
VEAQGKDYAL
NTG INLYQVN
IGQAIDKEAL
KAQADVGKYV
YELSLSGWIA
KEAEDILLNQ
AKDVKISDDG
FEIRNGEKSV
DSEHiLLYSGP
ELOLVRINGQ
TQSVLNDGSK
KLSLLAADTD
GSSELDSYFN
DAAQVPLYQS
RKYHFTLREG
DELGISAPND
FTLANDATS
YVQQYQDDPG
PLNGLIPSKL
QGKR IAEYVQ
LYAGESSYNY
ASNYLINPKL
IKWSNGEP IT
KEFIVELKQA
DTWTLKNPE
YVSHPDVANY
YANPETDEDF
SQLQENLPGL
GNYHNAKYDQ
KGISYHLYGD
EF013-1 (SEQ ID
TAACGAAAAA
AGTCTAGCAG
GAAACAAAGA
TCTGTCACGG
CCTTCTGCAG
ATTACAATTA
CCTAATGCAA
TATGCGGATG
TTAATCGCA-A
GGACAAGAAG
TCTGGTAAAA
ATTTTAGCAA
GGCACAAAA.A
GATTTACCAA
TGAAAAAAAT
GATGTGGAAA
AAGCCACGCA
GTCAAAATTC
GA.ACTGATCA
CCAATG'rTGT
CCATTACAGC
CCAATGGCAA
CCGTTGATCA
CAGCGCTTC
CAGCACCGAA
GTTTI'ACTGC
ATCGCTTAGA
ATTAA
TGCTITGTTC
CAAAAAAACA
GAAATCCTCT
TAATGTGACA
AACGAATGAA
TTTCAATCCT
AACGGTAGTC
TTTTACAGTA
AGTATGTTAA
GCAAGCACAA
ACCCAACAAG
GGGGAAAATC
GTCCCTCA-AA
GAAAGAAATG
GGTGATGCTT
ATTAGTCCCA
CGTTCAGTGT
ATGATTCTAA
AAATGAAAAG
CGTCAGAAAA
ACCAAGCACC
AAATTAATGG
CTGCACAAGC
GAGCGGGAGC
AAAFI'GATAT
ATCCGAAACA
GTGCAGATGA
CAGCCAGTAA
TAGGGACACC
ATTGTCTTTA
GCCA-AAGCAA
TAGTCATTCG
TGCCACGCAG
TGATACAAAC
TACTACATTA
AGGTGTITT
GACTACTCAA
TCCAAGTTCA
AGCGACAATT
TGCGCGGGTG
T'ITAGT'C C C
TTACTTGTIT
ACGGAATAGT GCACCTGTCC TTTTAGCAAT ATTACCAT1'C TGCAACTATT TTAGTGTCAC GGATGCCCAA GGGAAITTCA TGTTACGFTA AATGGAGAAA EF013-2 (SEQ ID NQ:46) MKKIALFS MLTFSVLSLS LAGCGNKKTA STNDSKPKQE TKKATQKSSS QQEMKSSHSS VTGQNSNVTG ENPSENATQP SAGTDETNEV PQNQAPDTNI TITNWVFNPE RNEINGTTLP NATITATVVG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG- TKNRLDVTLN GEIGTPYLFO LPN TABLE 1. Nucleotide and Amino Acid Seqeuences; of E. faecalis Genes.
EF013-3 (SEQ ID NO:47) ATGTGGAAA CAAAAAAACA GCAAGCACAA ATGATTCTAA GCCAAAGCAA.
GAAACAAAGA
TCTGTCACGG
CCTrCTGCAG
ATTACAATTA
CCTAATrCAA
TATGCGGATG
TTAATCGCAA
GGACAAGAAG
TCTGGTAAAA
AT'ITIAGCAA
GGCACAAAAA
AAGCCACGCA
GTCAAAATTC
GAACTGATGA
CCAATGT'rGT
CCATTACAGC
CCA.ATGGCAA
CCGTTGATCA
CAGCGCT~rC
CAGCACCGAA
GTTTTACTGC
ATCGCTTAGA
GAAATCCTCT
TAATGTGACA
AACGAATGAA
TTTCXATCCT
AACGGTAGTC
TTTTACAGTA
ACGGAATAGT
TTTTAGCAAT
TGCAACTATT
GGATGCCCAA
TGTTACGTA
AGCCA.ACAAG
GGGGAA.A-ATC
GTCCCTCAAA
GAAAGAAATG
GGTGATGCCI
ATTAGTCCCA
GCACCTGT'CC
ATTACGATIG
TTAGTGTCAC
GGGAATTTCA
AATGGAGAAA
AAATGAAAAG
CGTCAGAAA-A
ACCAAGCACC
AAATTAATGG
CTGC-ACAAC
GAGCGGGAGC
AAATTGATAT
ATCCGAAACA
GTGC AGATGA CAGC CAGTAA TAGGdACACC
TAGTCAI'TCG
TGCCACGCAG
TGATACAAAC
TACTACATTA
AGGTGTTITT
GACTACTCAA
TCCAAGTTCA
AGGGACAATT
TGCGCGGGTG
?I'TAGTTCCC
TTACTTGTTT
GA'I"I'ACCAA AT EF013-4 (SEQ ID NO:48) CGNKKTA STNDSKPKQE TKKATQKSSS QQENXSSHSS VTGQNSNVTC ENPSENATQP SACTDETNEV PQNQAPDTNI TITNVVFNPE RNEINGTTLP NATITATVVG DASAQAGVFY ADANGNFTVI SPRAGATTQL IATVDQRNSA PVQIDIPSSG QEAALSFSNI TIDPKQGTIS GKTAPNATIL VSRADDARVI LASFTADAQG NFTASNLVPG TKNRLDVTWN GEIGTPYLFD LPN EF014-1 (SEQ ID NO:49)
TGATG--TGGA
AGCTTAATTA
GTAGCGGGTA
TATAATGAAG
ATI'GAACGGA
AAGACGAAGA
AAAATTACC
TCTGTTAACA
TTTI'ATCTT
GCGTTTGATG
CATI'TAGATG
ATGCGTGGAT
CAATCAATCA
GTGGATTCCA
AAACAACAGC
CTATACCCAG
AAGCCAGATG
CAAAGTGATT
GGCAATACGT
ACGGAAAATG
GACTTTTITAA
GTTTAGTCAT
GCTATTTAAA
CGCAAAATAA
AATTAGGCAC
AAATAACCTA
AAGGGATGCA
CAGTI'GAGAA
TTATTAAGTT
GTGTCACCAA
GTACGAAAGC
TCCGACAACA
TCAAA-ATAAT
ATGAATTGAC
T ITCTTTTGA
ATAGTATTGA
AACGAGATCA
ATACCGTTCA
ATATTGGTGT
GCTI'TATAAA
GAGAGAGGAA
CATTTTGGTT
GAAXACAATT
AGATAGTCAA
AACTAGGACT
TTTAAGT'TTG
GCGAATG-AA
ATTATTGAAT
AATTGATGCG
AGAC'-GGCCA
TTTATCTTAT
AGAAATTATT
GGACATTATT
TCATTTAGTC
CTGGCGCACT
AAATGTCCCT
AGACGGCTAT
AGATGAAGCA
TCCTGGTAAT
ATAA
GTACAGCCAA
TTTGTCACAG
GATAAAGGCT
TCGTTTTTGA
GATGCTATGA
CCACGGGATA
CCCGCCTATA
ATTCCAATCA
GTTGGCGGCA
GGATCCATTC
GCCCGTGAAA
CAAGCAGTTG
GATTCGTTAA
AAAGAAGGTT
r=AGTAATG CATCAATTAc
GTCTTCCATA
GCTGAGGAAA
ACACAGACCG
TGAC-TAGGAA
TCGGCTCAGC
ATGTTCCCAT
TTATGGGGCT
TGGTGA'ITAC
GTTTTGTTCA
CCTACGATGG
ATCATI'ACGT
TAGATGTCAA
ATTTTATGC
GACATAGCGA
AAGACAAGTT
ATGGAAACAT
TGACTTCGAC
AAGGGCGCAG
GTGTGTCTTT
CGAACGGTGA
ACGAAATGAC
GCCCGTTGCC
GCGAAAAATC
ATACTTTGCT
AAAAAATGAT
AGACAATACA
CGTGAATAAC
AATTGATGCG
ACCAACACCT
TGTGTTTAAC
TGTCAAGCAG
ACGGAAACAG
TAACGATATT
GAAATCTGGT
TCAAACTGAT
CAATTATGAT
TATGGTTGAA
AAATTTAGAA
ATTTTTATAT
TTCCATCA.AC
ATCAGTTAAA
EF014-2 (SEQ ID MSRKRKIS LISLVIILVF VTVGSAYFAV ACGSYLKKTID KGYVPIKNDY NE-AQNXDSQS FLIMGLDNTI ERKLG~TTRTD ANMVITVNNX TKKITYLSLP NYQGMQRIEA AYTYDCPTAS vNTVEKLLNI PINHYVVFNF LSFIKLIDAV FDGVTKDGPG SIHFDAGKQH IJDGTKALSYA RERHSDNDIM RGFRQQEIIQ S IMKIMDI ID SLNGNIQTDV DSNELTHLVK EGLTWTNYDK QQLSFDWRTF RDSFVQ IDAK
GGIDVNVKQA
AVEDKLKSGQ
SNEGRSMVEL
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
YPDSIENVRH QLRVSLNLEK PDERDQDGYV FHTNGEFUYQ SOYTVQDTEAA EENEI'TSING NTYIGVPGNT QTGPLPSVKT
ENGFIK
EF014-3 (SEQ ID NO:51)
TGCT
GTAGCGGGTA
TATAATGAAG
ATTGAACGGA
AAGACGAAGA
AAAAkATTACC
TCTCTTAACA
TTTTTATCT
GCGTTTGATG
CATTTAGATG
ATGCGTGGAT
CAATCAATCA
GTGGATTCCA
AAACAACAGC
CTATACCCAG
AAGCCAGATG
CAAAGTGATT
GGCAATACGT
GCTATTTAAA
CGCAAAATAA
AA'ITAGCAC
AAATAACCTA
AAGGGATGCA
CAGTTGAGAA
TTATTAAGTT
GTGTCACCAA
GTACGAAAG-C
TCCGACAACA
TGAAAATAAT
ATGAATTGAC
TTTCTTTTGA
ATAGTATTGA
AACGAGATCA
ATACCG ITCA
ATATTGGTGT
GAAAACAATI' GATAAAGGCT AGATAGTCAA TCCTT'rTTA AACTAGGACT GATG TATGA TTTAAGTTTG CCACGGGATA GCGAATTGAA GCCGCCTATA ATTATTGAAT
ATTCCGAATCA
AATTGATGCG GTTGGCGGCA AGACGOCCA
GGATCCATTC
TTTATCTTAT GCCCGTGAA.A AGAAATTATT CAAGCAGTTG GGACAFTATT CATTCGTTAA TCATTTAGTC AAAGAAGGTT CTGGCGCACT
TTTAGTAATG
AAATGTCCGT CATCAATrAC AGACGGCTAT GTCTTCCATA AGATGAAGCA G-CTGAGGAAA TCCTGGTAAT ACACAGACCG
ATGTTCCCAT
7 kTGGGGCT
TGGT'GATTAC
GTr'rrTTCA
CCTACGATGG
ATCATTACGT
TAGATGTC-AA
ATTT'r1'ATGC
GACATAGCGA
AAGACAAGTT
ATGGAA.ACAT
TGACTTGGAC
AAGGGCGCAG
GTGTGTCTITT
CGAACGGTGA
ACGAAATGAC
GCCCGTTGCC
AAAAAATGAT
AGACAATACA
CGTGAATAAC
AATTGATGCG
ACCAACAGCT
TGTGTTA-AC
TGTCAAGCAG
AGGGAAACAG
TAACGATAT
GAAATCTGGT
TCAAACTGAT
CAATTATGAT
TATGGTTGAA
AAATTTAGAA
ATTTTTATAT
TTC-CATCAAC
ATCAGTTAAA
a. a.
a a a a.
a ACGGAAAATG GCTTTATAAA A EF014-4 (SEQ ID NO:S2) AV AGSYLKKTID KGYVPIKNDY NEAQNKDSQS FLIMCLDNTI
ERKLGTTRTD
NYQGMQRIEA AYTYDGPTAS VNTVEKLLNI FDGVTKDGPG SIHFDAGKQH LDGT-KALSYA SIMKIMDIID SLNGNIQTDV DSNELTHLVK YPDSIENVRH QLRVSLNLEK PDERDQDGYV NTYIGVPGNT QTGPLPSVKT ENGFIK
A&DVITNNK
P INMYVVFNF
RERHSDNDIM
EGLTWTNYIDK
FHTNGEFLYQ
TKK ITYLSLP LSF IKLIDAV RGFRQQE 10I
QQLSFDWRTF
SDYTVQDEAA
REDSFVQ IDAK
GGIDXTNVKQA
AVEDKLKSGQ
SNEGRSMVEL
EENTEMTSING
EF015-1. (SEQ ID NO:53)
TAATTAAAAA
TTAATAGCTA
AAAAGTTTTG
ATAAATTTAG
AATACTATAG
ATTGCGGTGA
AGATTCATT
TGTGTAAAAA
TCTCT'lITTC
ACGACACTAA
ATGA'ITCAAA
AAGITAAAAA
AAACAGGATG
TAGAAATTGT
GGGTCTGATG
TCCATAT=I
ATATGCTGTG
AGAATTTATC
TGGGCACGTT
GATATCAGAA
TCAACA.ATAT
AAAAAAGGAG
A'FFTTL'TTC
GTCAAGATAG
AAAACATATT
CGTGTAAAAA
CCAGGGCNAA
TCTAAGGATT
ACATA.ATAGT
ACAATAATCC
ATGGGAAAGA
ATCCATCAAA
AAGATAATAG
CTAGTATCTC
ATTATAT'ITA
TATTATCTTT
ATI'TAACTCC
GATTrGAGCGT AGGGcAATAT
TCCAGATCAA
TATTcCTCAC
CTAA
EF015-2 (SEQ ID NO:54) MK KGDIIVIIFL IAISFSPYF1 FFHNNPFNSK SFDDTKYAVV KIDGKEIERI NLDDSKEFIK TYYPSKGQYN TIEVKNGHVR VKKONSPDQI AVKTC;WISEP
GXTSICIPHR
FILEIVOQYS KDYYIY EF015-3 (SEQ ID TABLE 1. Nucieotide and Amino Acid Seqeuences of E.f[aecalis Genes.
CAATAATCC ATTTAACTCC AAAAGTTTG ACGACACTAA ATAAATTTAG ATGATTCAAA AATACTATAG AAGTTAAAA-A ATTCCGGTGA AAACAGGATG AGATTCATTT TAGAAATTGT
ATATGCTGTG
AGAATTTATC
TGGGCACGTT
GATATCAGAA
TCAACAATAT
GTCAAGATAG
AAAACATATT
CGTGTAAAAA
CCAGGGCNAA
TCTAAGGATT
ATCGAAAGA
ATCCATCAAA
AAGATAATAG
CTAGTATCTG
ATTATATTTA
GATTGAGCGT
AGGGCAATAT
TCCAGATCAA
TATTCCTCAC
EFO1S-4 (SEQ ID NO:56) NNPFNSK SFDDT-KYAVV KIDGKEIERI NLDDSKEFIK TYYPSXGQYN TIEVKNGHVR FILEIVQQYS KDYYIY VKKDNSPDQI AV-KTGWISEP GXTSICIPHR EF016-1 (SEQ ID NO:57)
TGACCOTTGC
TTGCTGGTTA
GAAGATCAAT
TTTGTGCCCA
GCCAA.AGCGG
ATGAAAGAAA
ACGAGCGAGC
CTTGTTTCTT
GGGGTTCAAA
AAATTTGTTA
TTAAAATCTG
TCCCACGAAG
TTTGCTGTGG
GAAACGI'AC
GTTACAAATA
TCTGTTGTAG
GGACACGGAT
TGGG ITTTCA
FTTTAAACT
CAGAATTACA
GGGCCGAAAA
TAAAAGAAAA
ACGGCTCTTC
AAGACCAAAC
CTCGAATTGA
ATAA TI'AAA
GCGTCCGCAA
GAAAAGATG
ACACAPI.AAT
TTACTCCTA
TAACGAAGAA
AGATAAATCA
TTATGGCATT
AAATCAAACC
AGTTCAATTC
AAACATTGCA
TGGCTATGAT
ACCTA'TrA
CGGACTCCTA
AAACTATACT
ATCAGACAAT
CACCCTTAGT
AAACTAA
CCCCGTCCAA TAGAAAGGAG
TTTATGATGA
TTGCAGGTT
AAACGGATTA
GGCAAAATTG
TCCGITGACT
ATTGATCTTA
ACACAACCTT
ACAGCGAGCG
GGCTTCGAAA
TATGACGGCT
ATCGATCGCG
ATTTCTCATG
CAATTAGTCC
AAAATTTCTC
AAAAGAAATA
GTGGTAAAAG
TTATTGCTT
TCGGCITTGA
TCCAACCGAT
=TGGAACGG
ACATGACGAA
ACATGCALAGG
GTCAGCCTGA
TTAATGAAGC
TrrACCCCAA
TAGGC-TATGA
AAAAAATCAA
AAAAATGG;TT
TTCTTTAGCC
AAAAAGCA.AC
AGATGACTCC
TGTCGACTTA
TGAITTGTCT
CTACACTAAA
CGACCAAGTA
CAAAATTTTA
CGTTTTGAAA
TTTCTTAGAT
CTACTATCTT
CAATGAAGAT
TACTGCCTT
TGGAGAGGAC
EF016-2 (SEQ 10 NO:58) M1VflKKYSLAL
VPMGFQDKSG
SERAEKVQFT
FVKDQTPILY
AVG VRKSDNQ
LVICCSLLLF
KIVGFDVDLA
QPYMTNDQVL
DGFNEAFLDL
LVQKINTAFE
AGCGKRXSNE
KAVFKLYGIS
VSL
1
XEKNIAT
KSGRI DGLL I
TLRKDGTLSK
DQWTRINEEK
VDFQPIDWSM
ASDMQGKILG
ORVYANYYLS
ISQKWFGEDV
RIIIGLDDSF
KETELQNQTI DLIWNGYTKT VQNGSSGYDG FESQPDVLKK HEDNLKNYTI
SHVGYDNEDF
TNNTKIN
EF016-3 (SEQ ID NO:59)
AAGCAAC
GAAGATCAAT
TTGTGCCCA
GCCAAAGCGG
ATGAAAGAAA
ACGAGCGAGC
CTrGTTTCTT
GGGGTTCAAA
AAATrITGTTA
TTAAAATCTG
GGACACGGAT
TGGGTI'TTCA
IT'TIAAACT
CAGAATTACA
GGGCCGAAAA
TAAAAGAAAA
ACGGCTCTTC
AAGACCAAAC
GTCGAMTTGA
TAACGAAGAA
AGATAAATCA
?TATGGCATTr
AAATCAAACC
AGTTCAATTC
AAACATTGCA
TGGCTATGAT
ACCTAITTA
CGGACTCCTA
AAACGGATTA
GGCAAAA'FTG
TCCGTTGACT
ATTGATCTI'A
ACACAACCTT
ACAGCGAGCG
GGCTCGAAA
TATGACGGCT
ATCGATCGCG
TTATTGGCT
TCGGCTTTGA
TCCAACCGAT
TTTGGAACGG^
ACATGACGAA
ACATGCAAGG
GTCAGCCTGA
rTrA.ATGAAGC
=I~ACGCCAA
ACATGACTCC
TGTCGAC ITA
TG;ATTGGTCT
CTACACTAAA
ccGACCAAGTA
CAAAATTTTA
CGTTTGAAA
'FTTCTTAGAT
CTACTATC IT TABLE 1. Nucleofide and Amino Acid Sequences of E.jhecalis Genes.
TCCCACGAAG
T=TCTGTGG
GAAACGTTAC
GTTACAAATA
ATAATTTAAA AAACTATACT ATTTCTCATG GCGTCCGCAA ATCAGACAAT CAAT'TAGTCC: GAAAAGATGG CACC-CTTAGT AAAATTTCTC ACACAAAAAT AAAC TAGGCTATGA CAATGAAGAT AAAAAXTCAA TACTCCCTTT AAAAATGCTT TOGAGAGGAC EF01.6-4 (SEQ ID SNE DQWTRINEEK RIIIGLDDSF VPMGFQDKSG KIVGFDVDLA KAVFKLYGIS SERAEKVQFT QPYMTNDQVL VSLKEKNIAT FVKDQTPILY DGFNE-AFLDL KSGRIDGLLI AVGVRKSDNQ LVQKINTAPE TLRKDGTIJSK
VDFQPIDWSM
ASDMQGKILG
DRVYANYYLS
fSQKWFGEIDV KETELQNQTI DLIWNGYTKT VQNGSSGYDG FESQPDVLXK HEDNLKNYTI SHVGYDNEDF
TNNTKIN
EFC).7-1 (SEQ ID NO:61)
TGAGGTGTTI'
CTACTTCTCT
TTACGGGTCG
ACCGCAGCAG
GA.AGCGAAAC
ACTTTTACGA
TTGAATACT
TTTGA.AGGGT
GGGGTAACAG
TTCAACAAT
ACGGGCAAAA
GAAGGTTGGG
CAAGCGAATG
AAAAATCTTT
CAAGAACAAG
TTAA.ATACGC
TTGAATT'CTG
TTCGTGCCAA
GATTTAAATC
TTAGGAA~rG
ATCACTGACT
TCACCAGTTC
GTTrGGTGGCT
AAAAATITCCA
GCAAACGTAA
CAATTGGTTG
GTGGCCGATT
GTCTCTATCG
TTATGAAPLAG GGCAACAAAG CGGGCTGTOGG ALAGTGTTGGG GGATTGATTC GGAATTATCA ATG-TAATGAG CCAAGTAGGG CAGCATTGGQ AACTGAAAAA TTCGAAAAGA TGCAAAATGG CTTGGAAGCG CACAGTGGAC TAAA.AAATTA TCGTGCTATT CCATTGATGA CCATACCI'G TATTGGCGGT ACCAGCTI ACTATCCTAC ATCAGCTGAG ATGGCACGAA TAATACTTGG T'TTCGCTAGA TAAGGTGGAT TCGAAGGGAA AGAATTAGAT GCAATGCAGC TTTGAAAATT AAAAAGATCT ITTGGCAAAT AGCGTTTAGC TAAAALATGTT CAGGTTTCAC TAATCAAGAA CTAGTGAACC AGAAAAAGCG AAAAAGCGGA GCTAACGATT ATGTTCAAGG AGCTTTAGCA CTTTTAATAA TCGTTTAGAA GGACGCCAGT ATATGCTGAT ATAATITI'GG TAAATGGTCT CTTATGCAAA TAAATATGAA CGGAAGAAGC CCCCCTAGT CTG-TCCAAAA TTTAGTCTAT GCGACAAGTA A
'AAAGGCTGT
A.AAGAAACCA
ACGGCAGACG
.AGGGACTTT
GTACAGCCCT
AGTAACGGCG
CCAAAAACAG
G'ITGACGGTA
GAAGTAGAGC
TATCCTLrAA
TCAACACTTT
TCCTATGTGA
GTCCAAGTAG
GTIGTAAAAA
CGTGAAAITC
AAGAATGCAC
'ITAAATGATG
ACGCAAAAAG
AAAGAG'ITAT
TTAAGT1TCGG GATAATrrAG
AAAAGTCGCA
CCAATCGATT
AATAAGACCT
GAACG~TTGGA
CCTCTTTATC
GGTCCA'ITAG
:TTTGGCAGC
k.AAAGCAAGA
TGTCGTIGGC
TCTCCTrGA
CCAATGATGG
AGCCAATCAC
CTTCCCCGCA
GCAAATCTAA
TAAGCTATCC
ATGAAGCAT
ACAATGGCGC
AGAATAAAAA
TTAAAGAAGT
=ICTGGAGA
CTGGAACGTA
GTCGAGCAAT
GCTCAAAAAA
ATTTTGCAGA
GGCAA-ACGGC
ATACAGAAAA
AAAXT=AAC
GCGGAGAMT
TCTTAAACTT
TTGATCAGT
AAACATTACA
AATTrAACAGA
GTTCAGGCTA
AATCATGGTT
ACAACAGGTA
AALTGrGATAAT
CGAAAAAGGA
TTAAGCTAT
AGCAAATGAT
AGCGTATTAC
AGAAGAGTTA
TATGAGTTAT
TGTCGAAAAA
CTTCACATTA
TATTGGGAT
CAATACTGGG
AATTGTTGCA
'TTATATCCAA
AGCATTATCA
AGCACTTGGC
GGAATTAGGA
TAAAAAAGAA
TGCTAAAAAA
AGTCAATGTT
CGACATTGTG
ACTGCAATCA
GCTTCAAGAA
AAAAGCGGAT
AGCACGCTTA
TTACAAATCA
EF017-2 (SEQ ID NO:62) bflrRATKQ RLSLAAIMVL LLSGCGSVGK ETKKQE-QQVL RVGIDSELST ADVSLANDNT AADVMSQVGE GLFSFDEKGE AKPALATEKV QPSNDGLSYT FTIRKDAKWS NGEPITANDF EYSWKRTVDP KTASPQAYYF EGLKNYRAIV DGSKSI(EFELG VTAIDD{TrE IIELSYPMSYF QQLLAVPAFY PLNEAFVEKT GKNTYGTSAES TLYNGAFMIE: GWDGTIN 1WS YVKNKNYWDQ ANVSLDKVDXJ QVVKEVNTGK NLFE.GKELDV VKISGEIVAQ EQGNAALXIR EIPGTYYIQL NTQKDLLANK NARRAIAIJSL NSERLAKNVL NDGSXI(ALG-F VPTGFTNQET QKDFAEELGD TABLE 1. Nucleotide and Amino Acid Seqeuences; of E.faecalis Genes.
LNPSEPEKAX ELWQTAXKEL GIEKAELTIL SSDTENAKXI SEYVQGALAD NLENLTVNVS PVPFNNRLEK SRSGDFDIVV GGWTPVYADP IDPLNLLQSX NSNNFC: WSN KTFDQLLQEA NVTYANKYEE RWKTLQKADQ LVAEE-APLVP LYQLTEARLV AE)SVQNAJYG PLGSGYYKSV
SIGOK
ET017-3 (SEQ ID NO:63) CTGTGG AAGTGTTGGG AAAGAACCA AAAAGCAACA ACAACAGGTA TTACGGGTCG GGA'ITGATTC GGAATI'ATCA ACGGCAGACG TGTCGTTGGC
C
C
C C
ACCGCAGCAG
GAAGCGAAAC
ACTTTTACGA
TTTGAATACT
TTrGA.AGGGT
GGGGTAACAG
TI'TCAACAAT
ACGGGCAAAA
GAAGGTTGG
CAAGCGAATG
AAAAATCTTT
CAAGAACAAG
TTAAATACGC
TTGAATTCTG
TTCGTGCCA
GATTTAAATC
TTAGGAATTG
ATCAGTGAGT
TCACCAGTTC
GTTGGTGGCT
AAAAATTCCA
GCAAACGTAA
CAATTGGTTG
GTGGCCGATT
GTCTCTATCG
ATGTAATGAG
CAGCATTGGC
TTCGAA-AAGA
CTTGGAAGCG
TAAAAAATTA
CCATTCATGA
TATTGGCGGT
ACTATGGTAC
ATGGCACGAA
TTCGCTAGA
TCGAAGGGAA
GCAATGCAGC
AAAAAGATCT
AGCGTTTAGC
CAGGTTTCAC
CTAGTGAACC
AAAAAGCGGA
ATGTCAAGG
CTTTTAATAA
GGACGCCAGT
ATAATTTTGG
CTTATOCAA.A
CGGAAGAAGC
CTGTCCAAAA
GCGACAAG
CCAAGTAGGG
AACTGAAAAA
TGCAAAATGG
CACAGTGGAC
TCGTGCTATT
CCATACC'ITG
ACCAGCTITT
ATCAGCTGAG
TAATACTTGG
TAAGGTGGAT
AGAATTAGAT
TTGAAA.ATT
TTTGGCAAAT
TAAAAATGTT
TAATCAAGAA
AGAAAAAGCG
GCTAACGATT
AGCTTTAGCA
TCGTTI'AGAA
ATATGCTGAT
TAAATGGTCT
TAAATATCAA
CCCCCTAGTT
TTTAGTCTAT
GAGGGACTTT
GTACAGCCCT
A-GTAACGGCG
CCAAAAACAG
G ITGACGGTA
CAAGTAGAGC
TATCCTTTAA
TCAACACTTT
TCCTATCTCA
GTCCAAGTAG
GTTGTAAAAA
CGTGAA.ATTC
AAGAATGCAC
TTAAATGATG
ACGCAAAAAG
AAAGAGTTAT
TTAAGTTCGG
CATAATTTAG
AAAAGTCGCA
CCAATCGATT
AATAAGACCT
GAACGTTGGA
CCTCITIATC
GGTCCATTAG
TCTCCTTTGA
CCAATGATGG
ACCCAATCAC
CTTCCCCGCA
GCAAATCTAA
TAAGCTATCC
ATGAAGCATT
ACAATGGCGC
AGAATAAAAA
TTAAAGAACT
TTTCTGGAGA
CTGGAAC:GTA
GTCGAGCAAT
GCTCAAAAA
ATTTTGCAGA
GGCAAACCC
ATACAGAAAA
A-AAATTTAAC
GCGGAGATTT
TCIrA.AACTT
'I'IGATCAGTT
AAACATTACA
AATTAACAGA
GTTCAGGCTA
AATGGATAAT
CGAAAAAGGA
TTTAAGCTAT
AGCAAATGAT
AGCGTATTAC
AGAAGAGTTA
TATCAGTTAT
TGTCGAAAAA
CT1'CACATTA
'TATTGGGAT
CAATACTGGG
AATTGTTGCA
TTATATCCA.A
AGCATTATCA
AGCACTTGGC
GGAATTAGGA
TAAAAAAGAA
TGCTAAAAAA
AGTCAATGTT
CGACATTGTG
ACTGCAATCA
GCTTCAAGAA
AAA.AGCGC3AT
AGCACGCTTA
'rrACAAATCA EF017-4 (SEQ ID NO:64).
CGSVGK ETKKQEQQVL RVGIDSELST ADVSLAMDNT
AADVMSQVGE
EYSWKRTVDP
QQLL.AVPAFY
ANVSLDKVDV
NTQKDLLANK
LNPSEPEYAK
PVPFNNRLEK
NVTYANKYEE
SIGDK
GLFSFDEKGE
KTASPQAYYF
PLNEAFVEKT
QVVKEVNTGK
NARRAIALSL
ELWQTAKKEL
SRSGDFDIVV
RWKTLQKADQ
AKPALATEKV
EGLKNYRAIV
GKNY-GTSAE.S
NLFEGKELDV
NSERLAKNVL
GIEKAELTIL
GGWTPVYADP
LVAEE-APLVP
QPSNDGLSYT
DGSKSKEELG
TLYNGAFTLE
VKISGEIVAQ
NDGSKXALGF
SSDTENAKKI
IDFLNLLQSK
LYQLTEA-RLV
FTI RKDAI(WS VTAI DDHTLE
GWIDGTNNTWS
EQGNAALKIR
VPTGFTNQET
SEYVQGALAD
NSNNFGKWSN
ADSVQNLVYG
NGEP ITANDF
VELSYPMSYF
YVKNKNY'WDQ
EIPGTYYIQL
QKDFAEELGD
NLENLTVNVS
KTFDQLLQEA
PLGSGYYKSV
EF018-1 (SEQ ID TGTCATTACA ACGATACCAA TP1'TAATCAT TTATCCATTA CTACAAAAAC ACTTATCGG CGGTATGATG GCCGGTGCAG TAAAAGAATA AAGAAAGTAG GGA.ACAATAT GAAAAAAGTT TABLE 1. Nucleotide and Amino Acid Seqeucnccs of E. faecalis Genes.
TTAGCGCTT
AAGAAAGCTA
GCTTGGA.AGT
TGGACGGCC
GTGGATATTG
CATATGCCTG
TCTTCGGCCT
GTTAACCAAG
AATTACTCTA
GTTATI'CGTG
TTTGAAAA.AG
ACCACAGTCCG
CCTTAGAGG
TGGTTAAAAA
GATGATGGGG
CTGGAAC CA
TATATAGCCA
GGCAT'ITCAG
**ACTCAACTGT
GAAGCAGTTA
AAATTAGAAC
TACTTTAACA
ATGCAAGA AT
GATGCAGGAA
GTTATTITCAA
GCGGAAAACA
TATTCCTGC
GCTCAGATGT
TAGATAAGAA
TCCCFITGC
AATTTATTTC
ATATCGMAC
ATTCTTTAAA
ATACGTTTAA
ATACAAAAGC
AAGATGTCTA
TCATCCAACA
GCGATGGTGC
ATAAAAAZTGG
CATTAATCA
CAACGTTTGA
GTGGTCAAGG
TTGATGGACC
GTTGGTTAAG
TCACATATTT
CTTATCCCTA
AAACAGACAA
ACGACCGTGT
CCCCCAAAGG
CGCCGGAAGC
TGATTCGTGC
TAAAATCAAA
GAGACAAACT
AACGGCGGTC
CTCA-A'TAAG
GA.AAGAACCG
AAAAGACCTG
CGGCGATGAT
ATTAACTGAA
CGAT-TTAGCT
ATGGTATCC
GGATTATGAA
TAATGCATTA
GATTAAAGAA
AGGACCATTT
TAAATACTAT
TGTTTACCGA
TGAAAAkAGTG
TGGGAACTTC
AAGTAGCACT
TAATACATT
AATTGATGAA
CA.ATATCALA
TGATGCCTAC
CAATAAACTA
AAAATTAGTC
TCGTGCGAAT
GAAAGATGAT
TAAATGGGAT
TAAGTAA
GTTAGTTTAG CGGCCTGTAG CGGT-,GGGAA GATCGGTA 1G AATTAGATGA AAAGACGCCT ACCAAGATTA AATGGTATAT TAACTCAGAT ACCACTGCGC ACATTAAAAA AGACTTAAAT TCAAAkATTAA ATGCCATGAT TTICAAGTGGA AAAACTGGAC A-AGCAGCATT GAAAGCAGAT AAAAAATATG ACCCCTATTT AATGAA.AGTT TTAGAGCA 1G GA.AAAACATA TCCTIACCCT AGTGGAAATA TCCCAGTAAA TGATAAT TIT GGCAAGCCAG ACGTITCAAC ACCAGAAAAT AAATATCCTC ACATGACCCC AATGGGCTTC TTAGACAAAT TACAAGACTT CTTAGGTCTT GATCGAAA'IT TAGATAAAGA ATATTTAGAA GCAGGCAATA 'ITAGtCATGA TAGCTTCACA AAACAAGGAA ATTATGCAAC CATGCTCGTT ACAGAATTTA TGAAAAAATC TGGCACACGT TCTGGCCGAA AACCAACATr AAATCAAACC ACGAAAGATG CGAAACATCC AGCAAAACTC CCGGGACAAkA T'ITTAACAAA ATATGGCGTT GGAAAAATTCG ATTATrTACC AGAAGTGAA.A AACAAAAAAT ATGGCArAG TCGTITCCTA AAAGTACCAA TGGAAAGTGC TTTAACGCALA CCACATTTCG TAATTGAAA-A TATTAATCCA GAACCAITT AAACCAAACT AAATACAACC AAAGCCTTTG ACAAATCTTT AGAAGACTAC GCAATTGAAA AAATAAAATC TGAGAAAATG *99*9* 9 9*99 9 99*9 9- 9 9. 9 99 EF018-2 (SEQ ID NO:66) MKKV LGGLLVATAV VSLAACSGGE KKASSDVS 1K
VDIEFISCDD
VNQDTFKWYA
FEKVMQQ IKE
WLKTFNDVYR
YIAIDGPSST
EGVTYAYNDQ
MQEWGKCKLV
KA.FLKSNKWD
DRYELDEKTP
SKLNAMISSC
LEDGKTYGYP
KYPEMTPMGF
ACIJISDDSFT
SGRKPTLNQT
GKIDYLPEVK
PHFVIENINP
AIEKIKSEKM
AWKLDKKKEP
DMPDIVTLTE
NYSN'TKADYE
TTVGDGAGPF
DDCATFDEKV
GISGWLSNY I
KLEQTDNDAY
DACTPEARAN
AENRDKLK
TKIKWYINSD
KTGQAALKAD
SGNI PVNDNF
LDKLQDFLGV
KQGNYATMLV
TKDAI(DPAKV
NKKYGISRFL
EAI ETKLNTT
WTALPFGKDV
SWAYSLUflLA
VIREDVYNAL
PLEDKNGKYY
AGTSGQGGNF
TQLFTLIDE
YFNNDRVNKL
VISMIRAKDD
TTAQIKKDLN
KKYDPYLMKV
GKPDVSTPEN
DRNLDKEYLE
TEFtMKKSGTR
PGQILTKYGV
KVPMESALTQ
KAFOKSLEDY
EF018-3 (SEQ ID NO:67) CTGTAG CGGTGGGGAA AACAAACCTA GCTCAGATGT GCTrrGGAAGT TAGATAAGAA TGGACGGCGC TGCCTTTTGG GTGGATATTG AATrTATrTC GATATGCCTG ATATCGTGAC TCTTGGGCCT ATTCTTrAAA GTTAACCAAG ATACGTTTAA AATTACTCTA ATACAAAAC GTTATTCGTG AAGATGTCTA TTTGAAAAAG TCATGCAACA CTCAA ITAAG
GAAAGAACCG
AAAAGACGTG
CGGCGATGAT
ATI'AACTGAA
CGATTTAGCT
ATGGTATCCC
GGATTATGAA
TAATGCATTA
GATI'AAAGAA
GATCGGTATG
ACCAAGATTA
ACCACTGCGC
TCAAANITrAA
AAAACTGGAC
AAAA.AATATG
TTAGAGGATG
AGTGGAAATA
GCCAAGCCAG
AAATATCCTIG
AATTAGATGA
AATGGTATAT
AGATTAAAAA
ATGCCATGAT
AAGCAGCATT
ACCCCTATTT
GAAAAACATA
TCCCAGTAAA
ACGTiI'CAAC
AGATGACCCC
AAAGACGCCT
TAACTCAGAT
AGACTTAAAT
TTCAAGTGGA
GAAAGCAGAT
AATGAAAGTT
TGGT1'AC CCT
TGATAATTTT
ACCAGAAAAT
AATGG~cTTC TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
ACCACAGTCG
CCTTTAGAG
TGGTTAAAAA
GCAATrGGG
GCTCGGAACCA
TATATAGCCA
GGCATTTCAG
ACTCAACTGT
GAkAGGAGTTA
AAATTAGAAC
TACTTTAACA
ATGCAAGAAT
GATCCAGGAA
CT'TTCAA
AAAGCATTCT
GCGGAAAACA
GCGATGGTGC
ATAAAAATGG
CATTTAATGA
CAACGITTGCA
GTGG-TCAACG
TI'GATGGACC
GTTGGTTAAG
TCACATAT'T
CTTATGCGTA
AAACAGACAA
ACGACCGTGT
GGGGCAAAGG
CGCCGGAAGC
TGATTCGTGC
TAAAATCAA.A
GAGACAAACT
AGGACCATTT
TAAATACTAT
TGTTTACCGA
TCAAAAAGTG
TGGGAACTTC
AAGTAGCACT
TAATI'ACATT
AATTGAA
CAATrGATCAA
TGATGCCTAC
CAATA.ACTA
AAAATTAGTC
TCGTGCGAAT
GAAAGATGAT
TAAATGGGAT
TAAG
TTAGACAAA-l GATCGAAATr-
GCAGGCAATA
AAACAAGGAA
ACAGAATTTA
TCTGGCCGAA
ACGAAAGATG
CCGGGACAAA
GGAAAAATTG
A.ACAAkAAAAT
AAAGTACCAA
CCkATTTCG
GAAGCGA=T
A.AACCTCr GCAAT1'GAAA TACA.ACAC TI
TAGATAAAGA
TTAGTMATGA
ATTAT13CAAC
TGAAAAA.ATC
AACCAACATT
CGAA.AGATCC
TT'ITAACAAA
ATTA=ACC
ATGGCATT-AG
TGGAAAGT-GC
TAAT'MAAA
AAACCAAACT
ACAAA*TCTTT
A-AATAAA.ATC
CTTAGGTGTT
ATA=IAGAA
TAGCTTCACA
CATGCTCOTT
TGGCACACGT
AAATCAAACC
AGCAAAGTC
ATATGGCCTT
AGAAGTGAAA
TCCTTICCTA
'TTTAACGCAA
TATTAATCCA
AAATACAACC
AGAAGACTAC
TGAGAAAATC
EF018-4 (SEQ ID NO:68)
CSGGE
KKASSOVSIK
VDIEFISGOD
VNQDTFKWYA
FEKVNQQ IKE
WLKTFNDVYR
YIAIDGPSST
EGVTYAYNDQ
MQEWGKGKLV
KAFLKSNKWD
DRYELOEKTP
SKLNAIISSG
LEDGKTYGYP
KYPEMTPMGF
AGNISDDSFT
SGRKPTLNQT
GKIDYLPEVK
PHFVIENINP
AIEKIKSEKN
AWKLDKKKEP
DMPD IVTLTE
NYSNTKADYE
TTVGDGAGPF
DD)GATFDEKV
GISGWLSNYI
KLEQTDNDAY
DAGTPEARAN
AENRDKEA(
TKIKWYINSD
KTGQAALKAD
SGNIPVNDNF
LDKLQDFLGV
KQGNYATMLV
TKDAKDPAK(V
NKKYC-ISRFL
EAIE7TKLNTI'T
WTALPFGKDV
SWAYSLNDLA
\TIREDVYNAL
PLEDKNGKYY
AGTSGQGGNF
TQtjFTYLIDE
YFNNDRVNKL
VISMIRAKDD
TTAQ IKKDLN
KKYDPYLMI(V
GKPDVSTPEN
DRNLDKEYLE
TEFMKKSGTR
PGQILTKYGV
KVPMESALTQ
KAFDKSLEDY
EF019-1 (SEQ ID NO:69)
TAAAGGAGTT
CTI'GGTI'CAT
AAAACACATG
CGGATITATI'G
CAATGGACAG
CCCACTATT
TTAATCAGTT
CCAACTTATG
GCCACTGTTT
ACCAAAGGCG
TGGGTAACCA
TATCAGGACT
GCGGATI'GGA
CTTGTAAACA
GCTGTGAAAA
CCTAr1'GCCA
ACACAATGAA
TCTTACTCGC
AAGTAACAGA
CGAGTI'ATTT
TI'GGACAAGG
CCTATCACTT
CATCTCCTCT
TAGTCAAAAA
TAGATAAAAA
TCCAAGAATA
ACAACCAAGT
TAGGCCTCCA
ATCAAGTTI'C
GCGATGAATC
ATAACCAAGT
ATACTCAAAT
ACITI'TAAAA
AGCTI'GTGGT
TACCTTAGGC
AGAAGATTAT
CAGCAITCAA
GCCATATGAA
AGTI'GAAGGC
CGGCGAAAAT
AGAACAAGCG
TCITGCAAA
CTTIATGGTT
AGTTCCAAAA
TTI'AGAAAAA
AGCACCTCTT
TCATACCTAT
TGTTGAAGAT
AAGACGGTCC
AATACGAATA
AATAAAGTAA
CTAGTTGCAT
GATTATTTrAG
GCGGTTCTAA
GGTAAATACA
GTCACCTGGC
AAAAAAGTGT
AAAGATGCTG
AGCGATAATC
TTAGTGGAAG
TTAGCTGAGC
TTCCAAGAAG
GATAAAAAAA
GTAAAAAAAG
TAATTGGTAC
AAGAAGCCAA
CCGTCCCCGC
TAGGAGAAAA
CGAAAGAATT
AATTTGAACC
AAGAATACAG
GTGATCAAT
TAGA-AGATTA
GCAAATCTGC
GCTCAAGCGG
AAATI'TCTAA
TTGACGCAGA
CAATTTGGAA
GTAGTr'GGTT
CGCTC'IAAA
AACCCTTCTT
CAACGCTGAC
GAAACCCA.AA
ACCAGTCGCA
GAAAGATGTC
TGACTTATI'A
TAAAATTGCG
GGAAGATATT
TGATACCTTA
GGCAGTCTI'A
AACCGTGCTC
AAACGCTACT
CCACATTTI'C
GAACTTACCT
ATACAACGGA
TI'AA
EF019-2 ((SEQ ID NXLLKK TVLIGTI'LLL GSFLLAACGN TNKEANNADK THEVTDTLGN KV'IVPAKPKR IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAK'ELKDVP TISYDLPYE-A VLKFEPDLLL 99 TABLE 1. Nucleotide and Amino Acid Seqeucnces of faecalis Genes.
ISSSALVECG
KGVQEYLGKK
DWNQVSLEKL
IANTQ IVEOV KYKEYSKIAP TYWVKNGENJV TWRDQLEDIA TVLDI<KEQAI( DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY QDLGLQVPKL AELDADH IFL VNSDESAPLF Q.A IWXNLPA VKNNQVHTYD
KKALLN
KVLEDYDTLT
VEEI SKNATA
KKSSWLYNGP
EF019-3 (SEQ ID NO:71) TTGTGGT AATACGAATA AAGAAGCCAA CAACGCTC;AC
AAAACACATG
CGGATTATTG'
CAATGGACAG
CCCACTATTT
TTAATCAGTT
CCAACTTATG
GCCACTGTTT
ACCAAACGCG
TGGGTAACCA
TATCAGGACT
CGATTGGA
CTTGTAAACA
GCTGTGAAAA
CCTATTGCGA
AAGTAACAGA
CGAGTTATTT
TTGGACAAGG
CCTATGACTT
CATCTGCTCT
TAGTCAAAAA
TAGATAAAAA
TCCAAGAATA
ACAACCAAGT
TAGGCCTCCA
ATCAAGTTTC
GCGATCAATC
ATAACCAAGT
ATACTCAAAT
TACCI'AGGC
AGAAGATTAT
CAGCATTCAA
GCCATATGAA
AGTiGA.AGGC
CGGCGAAAAT
AGAACAAGCG
TCTTGGCAAA
CTTTATGGTT
AGTTCCAAAA
TITTAGAAAA.A
AGCACCTCTT
TCATACCTAT
TGTTGAAGAT
AATAAAGTAA CCGTCCCCGC CTAGTrTGCAT TAGGAGAAAA GATTATTTAG CGAAAGAATT GCGGTrCTAA
GGTAAATACA
GTCACCTGGC
AAAAAAGTGT
AAAGATGCTG
AGCGATAATC
TTAGT'GGAAG
TTAGCTGAC
TTCCAAGA-AG
GATAAAAAAA
GTAAAAAAAG
AATTTGAACC
AAGAATACAG
GTGMATC6AATT
TAGAAGATTA
GCA.AATCTGC
GCTCAAGCGG
A.AATTTCTAA
TTGACGCAGA
CA-ATTTGGAA
GTAGTTGGFI'
CGCTC'ITAAA
GAAACCCAAA
ACCAGTGGCA
GAAAGATGTC
TGACTTATTA
TAAAATTGCG
GGAAGATATT
TGATACCTTA
GGCAGTCTTA
AACCGTGCTC
AAACCCTACT
CCACATTTTC
GAACTTACCT
ATACAACGGA
T
9 *9 9* 9* 9 9 9 99999*
S
S*S
S
EF019-4 (SEQ ID NO:72) CGN TNKEANNADK THEVTDTLGN KVTVPA.KPKR IIASYLEDYL VALGEKPVAQ WTVGQGSIQD YLAKELKDVP ISSSALVEGG KYKEYSKIAP TYVVKNGENV rtWRDQLEDIA KGVQEYLGKK DAGKSAAVLW VTNNQVFMVS DNRSSGTVLY DWNQVSLEKL AELDADHIFL VNSDESAPLF QEAIWKNLPA IANTQIVEDV KKALLN
TISYDLPYE-A
TVLDKKEQAK
QD)LGLQVPKL
VKNNQVIITYD
VLKFEPDLLL
KVLEDYDTLT
VEEISKNATA
KYSSWLYNGP
EF020-1 (SEQ ID NO:73) S* S .5
TGAGGAGATG
ACATTAACTG
ATAATGAAAG
TATAAAGATA
ACTAGCGCAT
AGTAAAGTAA
GGGAAATFIA
AGAAAATGAA
CATGTAATGG
ATTCTCATA
AAGACACTAA
TGGAGTATTT
CC TTGATGA
AATAA
AAAGGTAGTT
TTCTAALATTA
TCGTGATGAA
TCGTTATTG
TTATTATTAT
TATGAAAGCT
TCAAT=GTT TGATGTGT GATAAAACAG GTGAAGAATT TATTCAGAAG ATG-GT'I=AG GCTGATTrr GGGTTCCTCT GATGAAGATA AGCGATTAGA AGTGGAAACT ATG-AAGTAGT
TCCAGTCTTC
TAAAAATTCT
TTTTTTAATA
TAAAGATGAA
TAGTACTAAA
GTATAAATcA EF020-2 (SEQ ID NO:74) NXKVVS ILLMVVAVFT LTACNGSKLD KTGEEFI(NSI MYtDSSYGDEY SEDCFSFLIY KDKDTNRYLA DVWVPVXDET SALEYFYYYD EDI(RLDSTKS KVTFDDMKAS GNYEvvYKSG
KFK
EF020-3 (SEQ ID ATGTAATGC TTCTAAA'ITA GATAAAACAG GTrAAGAATT TAAAAAITTC TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
ATAATGAA.AG ATTCTTCATA TGGTGATGAA TATTCAGAAC TATAAAGATA AAGACACTAA TCCTTA =TG CCATCTT= ACTAGCGCAT TGGAGTAT'r TTATTA'IrAT GATGAAGATA AGTAAAGTAA CCTTTGATGA TATGAA.AGCT AGTGGAAAC'r GGGAA.ATTTA AA
ATGGTTTTAG
GGGTTCCTCGT
AGCGATTAGA
ATGAAGTAGT
TT'ITTAATA
TAAAGATGAA
TAGTACTA.AA
GTATAAATCA
F-F020-4 (SEQ ID NO:76) CNGSKLD KTGEEFKNSI KDSSYGDEY SEOGFSFLIY KDKDTNRYLA DVWVPVKDET SALEYFYYYD EDKRLDSTI(S KVTFDDK(AS GNYEVVYKSG
KFK
EF021-1 (SEQ ID NO:77) *4 4 0* 4.
S 4 .4 4* 4 4 .5 5 5
S
4
S
TAGTTGITA
TI'ATTCG.GTT
GGCAAAGGCA
ATCATTACAG
TTGCAAGCTT
CAATCGAATG
AACACAATCT
AACCCTCATA
TCTGCAACAT
ACAAAAACGA
CAAGCTGGTT
GT'MATACCA
GCAATGTACC
GTCTTCCAAG
GGCGTTGACC
AACTTCACGT
CGTGCGTTAG
GGCGTTGACT
AAAGATAAAG
AATACATTAA
'ITAG IWTCAT
AAACCGCTGA
ATACAGGCGG
GGGGTAAAGA
ATGCAGCTGA
TTGGTATTGG
CAAACITI'GT
TTAGAGATAA
ACAAAGTCGG
TTGAAAAAGG
AATATCGC
AAAACGGCGT
AAGCAAAAGA
GCGATCAAGA
TAACTTCAAC
AAGACAAATT
TAACAGACGC
TAATCTCAGG
ACTATT'rrA
TCCATTAGGT
AAGCGGCGT
CGTGGATGAC
ACATGATTTA
CTATACAACC
CTACTI'GCTA
'ITTAATCGAT
TGAAGCAGCT
TTTTG'FrGGT
TGTGGCTGAT
T TCATTTGCT
TGATATCATC
CTTGAATGAA
TrCTGATGGC
GCITAA.AGGT
CCCTGGTGGC
CTATTTAAAC
TGACGTAAAA
GGAGGCTTTA CAGAAATGAA TTATCAGT'r CACITGCAGC GGCAAAGCGr ATGCAGCGCA A.AGTCGTTCA ACCAATCTTC CCAGAAGGTT CAAAA-GGGTA AATA'FTGACC AACCGTATC AAAGATGCAA TTTCTTCTGC GATCAAATCG. ATGCCAAAA TACTTAGCCG CTCTTC;CTGC GGTGAAGAAG GCTCGTAAT GCTGCGAAAG AA'11AGGTAA
AAAAGCAAAA
ATGTGGTCGGT
TAGTGCTCTA
TTGGGAAGGA
TGCATATATT
A.AGTAAATTC
AGCAGATGCC
GAATGTCGTT
TGCAAATGA-A
TGACCGTTTC
AGAAATTACT
TTTAGCTGCT
TGGACAAGGG
TTGGGTAATC
CAAAGAAGAC
TATTGCCAAC
AAAAGATGGT
TAAAACAGCA
GATCCTGCCA
TTCCATGCTT
TCAGGTTCTG
AAGTACAAAA
GTCGGCACAG
GAACATTTAG
GACAAAACAA
GTCCCAGAAA
AAGGGAAAGC
C'I'GCTGCGAC
GCGACIAAGT
CAAkAAGACGG
CGGTTCAAGA
TTTATGGATT
AAGAAGC 'GT
AACCAGAATA
*44* 4 EF021-2 (SEQ ID NO:78) MKKAKL FGFSLIALGL SVSLAACGGG KGKTAESGGG KGDAAHSAVI ITDTGCVDDK SFNQSSWEGL QAWCKEHDLP EGSKGYAY IQ SNDAADYTTN TIFGIGYLLK DAISSAADAN PDTNFVLIDD QIDGKKNVVS ATFRDNAAY KTNKVGFVGG EEGVVIDRFQ AGFEKGVADA AKELGKEIIV DTKYA.ASFAD MYQNGVDIIF MASGATGQGV FQEAXDL.NES GSGDKVWVIG VDRDQDAflGI FTLTSTLKGV GTAVQDIANR ALEDKFPGGE HLVYGLKDGG VDLTDGYLND DKVISGDViKV PEKPE
IDQAVSSKFN
LAG vAAANET
PAKGKALAAA
YX<TKDGKEDN
KTKEAVKTA-K
EF021-3 (SEQ ID NO:79)
ATGTGGTGGT
GGCAAAGGCA AAACCGCTGA AAGCGGCGGT GCAAAGrGG ATGCAGCGCA ATCATTACAG ATACAGGCGG CGTr.GATGAC AAGTCGTTCA ACCAATCTTC TTGCAAGCTT GGGGTAAAGA ACATGATTTA ccAGAAGr CAAAAGGGTA CAATCGAATG ATGCAGCTGA CTATACAACC AATATTGACC ACGGTATC
TAGTGCTCGTA
?I'GGGAAGGA
TGCATATATT
AAGTAAATTC
101 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. facalis Genes.
AACACAATCT
AACCCTGATA
TCTGCAACAT
ACA.AAAACGA
CAAGCTGGTT
GTTGATACGA
CCAATGTACC
GTCTTCCAAG
GGCGT'rGACC
AACTTCACGT
CGTGCGTTAG
GGCGrrGACT
AAAGATAAAG
TTGGTATTGG
CAAACTT
TTAGAGATAA
ACAAAGTCGG
TTG-rAAAAAGG
AATATGCGGC
AAAACGGCGT
AAGCAAAAGA
GCGATCAAGA
TAACTTCAAC
AAGACAAA T
TAACAGACGG
TAATCTCAGG
CTACTTGCTA
FI'TAATCGAT
TGAAGCAGCT
TTTTTTGGT
TGTGGCTGAT
TTCATTTGCT
TGATATCATC
CTTGA-ATGAA
TGCTGATGGC
GCTTAAAGGT
CCCMGMGC
CTA I=AAAC
TGACGTAAAA
AAAGATGCAA
GATCAAATCG
TACTTAGCCG
GGTGAAGAAG
GCTGCGAAAG
GATCCTGCCA
TTCCATGCTT
TCAGGTTCTG
AAGTACAAAA
GTCGGCACAG
GAACATTrAG G6ACAAAACAA
GTCCCAGAAA
TTTCTTCTrGC
ATGGCAAAAA
GTGTT-ccTGc
GGGTCGTAAT
AATTAGGTAA
A.AGGGAAAGC
CTGGTGCGAC
GCGACAAAGT
CAAAAGACCG
CGGTTCAAGA
TTTATGGATT
AAGAAGCTGT
AACCAGAA
AGCAGATGCC
GAATGTCGTT
TGCAAATGAA
TGACCGTTTC
AGAAATTACT
TTI'AGCTGCT
TGGACAAGGG
'ITGGGTAATC
CAAAGAAGAC
TATIGCCALAC
AAAAGATGGT
TAAAACAGCA
EF021-4 (SEQ ID 6O
S.
S S 8S Sb
S
55
S.
S
S
S.
S S 555 0
S
S
P.S.
5.5.
5 5
S
S.
S.
CGGG KGKTAE-SGGG KGDAAHSAVI ITDTGGVDDK SFNQSSWEGL QAWGKEHDLP TIFGICYLL( DAISSAADAN PDTNFVLIDD KTNKVGFVGG EEGVVIDRFQ AGFEKGVADA MYQNGVDIIF HASGATGQGV FQEAKDLNES FTLTSTLKGV GTAVQDIANR ALEDKFPGGE DKVISGDVKV PEKPE
EGSKGYAYIQ
Q IDGKKNVVS
AKELGKEITV
GSGDKVWVIG
HLVYGLKDrG
SNDAADYTTN
ATFRDNEAAY
DTKYAASFAD
VDRE)QDADGK
VDLTDGYLND
IDQAVSSKFN
LAG VAAANET
PA.KGKALAAA
YKTKDCKEDN
KTKEAVKTAK
E-7022-1 (SEQ ID NO:81) TAAGAGCATA AAAA.AATGAA ACAATGG ITr GTATTTTATTI
AAAAAGAAAC
ACGCTCAACA
GAAGGGTTAT
CCGATGAT
AGTAACGATG
CCTAAAAACG
ATICTCAGCGG
TTrAAAGGTGA
TFITCCCGC
GATAAAGT'CG
TGGCAACTAG
AATTATACAG
GATG.TrGCTA
TCGTATCCAA
ACGCCGCTTG
CTAGTCAATA
TTI'GTGGCGA
TATAACAAAG
GTTAACGrI'G
CAAGGCTCGC
GAAGCTGCAT
CCAGACTATC
TATCAGAACC
CCAGAAAAA.A
AGAAAAATAC
CCTCTGTAT
ATACTTTAGA
CAGAAGATGG
ATCCTGTCAC
GC=TGTTTA
GGAAATTAGC
CGCTCAAAGA
AAAATCNAAA
TCTATAATGG
CAAAAAATAA
T1TATCAAAGA
CACTAAGTGG
CAGCGACAAT
CAAACGAAAA
ATATTA'FrGC
ATCCCACAAC
AAAAAGCGCA
AA'TGrATGGT
TACAAGAATT
TGAACTTTGG
AAGACCCTAT
CTGTCTATGA
GATGGGCGAC
GAGTTATAGG
GGTAGGATTT
CAAAGAAGCC
ATTGGAT TT
TGAACAAGAC
AAAAACCTAC
AGCACATGAT
TAGCTCCTC
ACCCAATGAA
GCCAAAACCG
AGTAGTCGAA
TCCGTTCGTG
TCGCTATTGG
AACATCTACC
TGAACTGGCG
GAACTATTTG
CCTGCGTAAA
AGATGGTTCT
GGGTCTCGAT
AAGTTATTGG
AACAGATGAT
GTTTCCTGGT
GCGAGAAAGT
TTCTACCCTG
CAAA ITA TA
ACTGATTGCA
AGAAAGAAGA
TAGCTGGGT
GTTCAACTGA
CCAGATGCTA
CAATTGGTAC
ACGATTTC=
T TTGAATATG ATCGTrGAA
CTAGGTGTCA
TACTTTACGT
CAATTTGG'?G
GTAAAAGATT
GATCACCACA
GCATTGAATC
CAACAGAATIA
CGCTTAA.ATC
GCA'PTGGCTT
AAAGCGCTAC
TTTCGTCAAG
AAAAAAGCAc
GG'ITCTTACA
=TACAATAG3
GACTATGATT
ATGACTTT
GATGAAGCAG3
GCTGAAAAAG
TGAAAAAGTA
GTACCAATAA
TGTCACCCTC
TIGTCCAAAC
CAGCCGTAGC
TCAGAAAAGA
CTTGGAAAAA
CAA~rCAAAA
CAGCTGTGGA
CC7"lrTTAGC
CGGACTATGG
GGCAGCAAAC
ACGTGCGCTC
TTT=rAAGA
AAAATAATAC
AAAA.ACGGNA
TAGGAATAGA
ATGGTGCGAT
AAGCAGGTAA
AAGCAGAATT
AAAAAAPTGG
AGCTAACCGC
TATTCTTAAT
ACAkAGGGCAA
CCACAACCTA
AAGTIGA'ITGA
TI'TAAAAATC
AAATGAAAAT
GGAATTAACA
TGCAGCGTTT
AAAAGCATTG
AGC-GCTITTG
AATGATTGAT
TGGTGCAGAA
TGATTATACA
TITTCCGACA
AACTGCTAGT
AAAGATGGAC
AGACATTATC
TGGACAATI'A
GTTGTATCAT
AGGGCAAGCN
TAAAGAAAAT
TACGGAAGGC
TTTAATGGTT
AGGAGAAAAG
TGAAAGTTTG
ATTGCCGACT
TTACTGGACA
TGATCGCAAT
TGCCTTAGAG
AACGACTGCT
102 TABLE 1. Nucleotidc and Amino Acid Seqeuence-s or E aecalis Genes.
GGCATCATTC CACTTAGCCA AAATGAACAA ACAGTCCTGC AAAATGATAA AGTCAAACGC TTGAATTTTC ATACCTTTCG CGCTCCATTA ACGTTAAAAA ATGTTI'ATAA GGAAAAATAA EF022-2 (SEQ ID NO:82) M4KIYLR IT M4VC ILLVGFL AGCTNKNENK KKQKNiTKEAV QLMSPSELIT
LNTSVLLDFP
NODPVTAHDF
KVTLKEPKPY
QLAKNNRYWD
YPTATMNYLR
VANPTTGLDF
GSLQELFPGL
QN~PVYDKLLD
NFHTFGAPLT
DAIVQTAAPE
EYAWKKM ID P
FTSLLAFPTF
HQNVRSDIIN
LNQKRXGQAT
RQEAGNLMVY
TIELTALPTE
EAATTYALEP
LKNVYKEK
GLYSLOEQDQ
KNGFVYSFLI
FPQNXKVVEQ
YTVIKETSTA
PLANENLRKA
NKEYXAQSYWK
AALNFGRESD
EKRWATLIAA
LVPAVAKALP
VETIQNGAEI
FGADYGTASD
LNLFEDGQLD
L.ALC IDKE.NL
KAQAELGEXV
YDLFLIYWTP
EKEVIETTAG
MISEDCKTYT
SAGKLAPNEL
KVVv.NGP'.VV
VATLSGELAQ
VNNI IA DCS K
NVELMVTEDDG
DYQDP ISTLM
MIPLSQNEQT
ISLRKE-AVWS
GVTAVIDDYTL
KDWQQTKMDW
QNKNNTLYHS
ALHGAITEGF
SYKKIGESLQ
TLYKGNDRNY
VLQNDKVIKCL
S.
6@
S@
S.
SO S
*O
OS
S 0
S.
0@ S 0
S
S.
5 S OS. S 0
S
0
S.
0550
S
0 *05S
S
0S50 00 0
S.
56 EF022-3 (SEQ ID NO:83) GT GTACCAATAA AAATGAAAAT
AAAAAGAAAC
ACGCTCAACA
GAAGGGTTAT
CCGATGATTT
AGTAACGATG
CCTAAAAACG
ATCTCAGCGG
TTAAAGGTGA
TTTTTCCCGC
GATAAAGTCG
TGGCAACTAG
AATTATACAG
GATGTCCCTA
TCGTATCCAA
ACGCCGCTTG
CTAGTCAATA
ITTTGTGGCGA
TATAACAAAG
GTTAACGTTG
CAAGGCTCGC
GAAGCTGCAT
CCAGACTATC
TATCAGAACC
CCAGAAAAAA
GGCATGATTC
TTGAATTTTC
AGAAAAATAC
CCTCTGTATT
ATAG=TAGA
CAGAAGATGG
ATCCTGTCAC
GCT'ITTrTA
GGAAATTAGC
CGCTCAAAGA
AAAATCNAAA
TCTATAATCG
CAAAAAATAA
TTATCAAAGA
CACTAAGTCG
CAGCGACAAT
CAAACGAAAA
ATA ITATTC
ATCCCACAAC
AAAAAGCGCA
AATTGATGGT
TACAAGAATTr
TGAACTTTGG
AAGACCCTAT
CTGTCTATCA
GATGGGCGAC
CACTrAGCCA
ATACCTTTGG
CAAAGAAGCC
ATTGGAT=
TGAACAACAC
AAAAACCTAC
AGCACATGAT
TAGCCTCCTC
ACCCAATGAA
GCCAAAACCG
AGTAGTCGAA
TCCGTTCGTG
TCGCTATTGG
AACATCTACC
TGAACTGGCG
GAACTATTIrG
CCTGCGTAAA
AGATGGTTCT
GGGTCTCGAT
AAGTTATTGG
AACAGATGAT
GTTTCCTGGT
GCGAGAAAGT
TTCTACCCTG
CAAATTATTA
ACTGATTGCA
AAATGAACAA
CGCTCCATTA
GTTCAACTGA
CCAGATGCTA
CAATrGGTAC
ACGATITCT
TTTGAATATG
ATCGTTGAAA
CTAGGTGTdA
TACTTTACGT
CAATTTGGTG
GTAAAAGATT
GATCACCAGA
GCATTGAATC
CAACAGAATA
CGCTTAAATC
GCATTGGCTT
AAAGCGCTAC
TTTCGTCAAG
AAA.AAAGCAC
GGTTCTTACA
TGACAATAG
GACTATGATT
ATGACTTTAT
GATGAAG CAG
GCTGAAAAAG
ACAGTCCTGC
ACCTTAAAAA
TGTCACCCTC
TI'CTCCAAAC
CAGCCGTAC
TGAGAAAAGA
CT'-GGAAAAA
CAATI'CAAAA
CAGCTGTGOA
CCTTGITAGC
CGGACTATGG
GCCCAAAC
ACGTGCGCTC
T'TTTGAAGA
AAAATAATAC
AAAAACGGNA
TAGGAATAGA
ATGGTGCGAT
AAGCAGGTAA
AAGCAGAA 1T
AAAAAATTGG
AGCTAACCC
TATTCTTAAT
ACAAGGGCA.A
CCACAACCTA
AAGTGATTGA
AAAATGATAA
ATGTI'TATAA
GGAATTAACA
TGCAGCGIT
AAAAGCA'ITG
AGCGGTTTCG
AATGATTGAT
TGGTGCAGAA
TGATTATACA
TTI'TCCGACA
AACTGCTAGT
AAAGATGGAC
AGACATTATC
TGGACAAITA
GTTGTATCAT
AGGGCAAGCN
TAA.AGAAAAT
TACGGAAGGC
TTTA.ATGGTT
AGGAGAAAAG
TGAAAGT'ITG
ATTCCOACT
TTACTGGACA
TGATCGCAAT
TGCCTTAGAG
AACGACTGCT
AGTCAAAGGC
GGAAAA.A
EF022-4 (SEQ ID NO:84) CTNKNENX KKQKNTKEAV QLMS PSELTT LNTSVLLDFP DAIVQTAAFE GLYSLDEQDQ LVPAVAKALP M7ISEDGKTYT ISLRKEAVWS NDDPVTAHDF EYAWKKMIDP KNGFVYSFLI =EIQNCAEI SAGKLAPNEL GVTAVDDYTL TABLE 1. Nucleotide and Amino Acid Seqeuences of E, faecalis Genes.
VTLKEPKPY FTSLLAFPTF QLk-KNNRYWO HQN'VRSDI IN YPTATMNYLR LNQKRXGQAT VANPTTGLDF RQEAGNLI'VY GSLQELFPGL TIELTALPTE QNPVYDKLLD EAATTYALEP NFHTFGAPLT LXNVYKEK
FPQNXKVVEQ
YTVIKETSTA
PLANENLRKA
NKEKAQSYTWI
AALNFGRESO
EKRWATL IAA
FGADYGTASD
LNLFE)GQIJD
rJALG I DENIJ
KAQAELGEKV
YDLFL IYWTP EKEVI ETTAG
KVVYNCPFWV
\IATLSGELAQ
VNNI!IADGSK
.IVELMVTDDG
DYQDP ISTUL1
MIPLSQNEQT
KDWQQTKM]DW
QNKNNTLY14S
ALHGAITEGF
SYKKIGESLQ
TLYKCNDRNY
VLQNIDKVKGL
EF023-1 (SEQ ID 0 4J
TAAAATGGAG
GC=tIAACGG
CAAAAAATTG
GATACGACCT
GAAGATGAAT
GTTATACAT
A.ACGACI'TTG
GCTTTATTTT
AATACTCTTG
ACCCCTTATT
ATCAAAGAAC
TTTAAATTGA
ACGTATTGGG
CCAACGACAG
GGTGAATTTA
ACATAC
4
TTT
AACAATATTC
TTGCAAGATC
GATGGAACAG
GCGAAAGCAA
TTAGAA'ITTTT
CATTGCAAG
CGTGTTGATC
TATCGTGATC
ACGT'TCAAGA
GATCAAGAGG
ACGGITTTAG
CATCTCTATT
GGATCGGTAT
CTTGTCAGGC
CAATTAGTTC
CGACTTTAGT
TAGAGTTGGG
TTAAGATTAG
TATOGCATG
TTGATGOTCAT
GGGTTAAGGC
TGAAATCATT
AAGGGGATAA
AAGAATGGGA
ATGCTAAAA
CGGTGAA'IT
TTCCTGG'FrA
TAAAAATGAA
GTAAAGCGT
AATCAACGGC
AT1'TCACAAA
AAGAATTCTG
TAACAGATGA
AAAATCTAGA
GTGATCAAAC
CATTAACAGT
GTCATACGTA
CTCGTTTAAA
CACCAATCTA
OGCATTCAIT
GAAGAAATTA AAAATGTTAG GGGAACGGGA' AACTCGGCTG TGAAGCCGCT ATTTCGACAA CATCAATCAA GTTTATGAAG GGTCGCTGCC GAAGAACCAG AGAAGATGCC AAATGGTCGA GCAACAAGTT GCTTCCCCTA TAAAAATGCT AAGGAAATTG GCTAGATGAT AAAACGTTAG AC=TCCGF= CCTGTTTTGT ATATGCTACT GATGCAGAAC TAATGCCTCT TCTGATGACT AGTTAAATTA ACAGAAGCGA GITGACTCG AATGAATGG TGTI'GATAAT CCAGCC C CAGCGTTCGT GATGGAAAAG GGCACAACCT TTTGATAAAG TACAGATCA-A GTAATTCCGC ACTAGCTGCT AAGAAAAATA GGAAAAAGGG AAAAAAGAAA TACAGACAGC GCCAAAAAAG TGGATTAGAA GTGAATGTTA CAGACACTAT GAITTAGAAT TATGCGCATC TTTACTTCGG TGATCAATTA AI'CAAGAAA TGAC TrCCT CAACCACAAG CAATCGAAGC ATTTCTGTAT TGGACCCACG TACAGTTTAA
CATGCGTCCG
ATAGTA.ACAA
TGGAACCACA
GAG TcirATGT
CGATTTCTGA
ATGATGATCC
AATCAGGATC
CTTTAGAACGr
AAATAACTT
'ITCCACAAAA
AT=IATTTA
GGACCTACGA
AAGTTTCAGT
ATGTAGTGA
TTTCAATTCC
AAAATCCGGC
AAAGTTT
CGGGACAAAC
ACTACTTAAC
TI'GGGCTGGA
CTGCTGAGTT
CTCAAGTTCC
TATCTGGTTG
ATAGTACC TT
CTAGAACAAC
ATATITTGGT
TAGCTAATCA
AATGGGCTTA
GTTGCTTTTA
AGCAGCGGAA
CACAGCGGGG
TTI'AGGTAAA
AGATGAA.ACC
AGTAACAGCA
GATTCATCAA
CGCAGATGTG
AGAACGGCCC
TGAAAAATAT
TAATGGTCCT
AAAAAATGAT
AATTAAGAGC
TAAGCTAAGT
TCAATTCGTC
ITAGCGAAC
AAAAGAAGTC
GATTGCGCCA
CTACGATACA
TA-AAATCAAA
TTTCCAATT
TI=TACTATT
GGGAACCGAT
GGGCGGCGTA
ACATGCGGCT
GAATCAGGAA
AAAAATCAAG
TGTTAACTAA
EF'023-2 (SEQ ID NO:86) MKKLK MLGCVGLLLA LTACQAGTGN SADSNKAAEQ KIAISSEAAI STMEPH-TAGD
TTSTLVMNQV
DFVYAWQQVA
PYLKSLLSFP
YWDAEKVKLT
YFLKMNSVRD
GTDFTKLAAK
LEENLDGLEV
FKSDTYDQLI
LYWHSFGPTY
YEGLYVLGKE
SPKSGS I}QA VLFPQNEKY I
EAKVSVIKSP
GKENPALANN
KNNYLTYDTA
NVTQVPFTIR
QETRTTHAAD
SLKWAYVN
DELELGVAAE
LFFDVIKNA(
KEQGDKYATD
TTAVNLFDSN
NIRKALAQAF
KAKEFWEKGK
VDRDQTRDYD
QEARLNDFAQ
EPAISEDETV
EIALEGADVN
AEHLIYNGPF
ELDVVNKLSG
DKESFVKEVL
KE IGLDKI(L
LELSGWGTDY
AQDILVNQET
YTFKI REDAK
TLGVKALDDK
KL.KEWDNASS
EFIPGYVDNP
QDQSTATDQV
EFLTDDTDSA
RDP LTVMR IF VtLAPIYNRSI
WSNDDPVTAN
TLEITLERPT
DDWTYEKNDT
AFLSIPQFVT
IPPGQTIAPD
KXAAEFFQFQ
TSDSTLGGVT
SVLANQKIKD
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF023-3 (SEQ ID NO:87)
GGGAACOGGA
CAAAAAATTG
GATACGACCT
GAAGATGAAT
GT'ITATACAT
AACGACTFI'G
GCTTTATTTT
AATACTCTTG
ACCCCTTATT
ATCAAAGAAC
TTTAAAITGA
ACGTATTGG
CCAACGACAG
GGTGAATTTA
ACATACTr'rr
AACAATATTC
TTGCAAGATC
CATCGAACAG
CCGAAAGCAA
TTAGATr CAA'FrCGAAG
CGTGTTCAT'C
TATCGTGATC
ACCTTCAAGA
GATCAAGAGG
ACGGTTTTAG
GATCTGTATTr
AACTCGGCTG
CAATTAC ITC
CGACTTTAGT
TAGAGTTGGG
TTAAGATTAG
TTTATGCATG
TTGATGTCAT
GGGTTAAGC
TGAAATCATT
AAGGGGATAA
AAGAATGGGA
ATGCTGAAAA
CGGTGAATTT
TTCCTGG'ITA
TAAAAATGAA
GTAAAGCGTT
AATCAACGGC
ATTTCACAAA
AAGAATTCTG
TAACAGATGA
AAAATCTAGA
GTGATCAAAC
CAFI'AACAGT
GTGATACGTA
CTCGTTTAAA
CACCAATCTA
GGCATTCATT
ATAGTAACAA
TGAAGCGGCT
CATCAATCAA
GGTCGCTGCC
AGAAGATGCC
GCAACAAGTT
TAAAAATGCT
GCTAGATGAT
ACTTTCGTTT
ATATGCTACT
TAATGCCTCT
AGTTAAAT'rA
GTTTGACTCG
TGTTGATAAT
CAGCGTTCGT
GGCACAAGCT
TACAGATCAA
ACTAGCTGCT
GGAAAAAGG
TACAGACAGC
TGGATTAGAA
GAGAGACTAT
TATGCGCATC
TGATCAATTA
TGACTTTGCT
CAATCGAAGC
TGGACCCACG
AGCAGCGGAA
ATTTCGACAA
G TrATCAAG
GAAGAACCAG
AAATGGTCGA
GCTrCCCCTA
AAGGAAATTG
AAAACGTTAG
CCTG71rTGT 5ATGCAGAAC
TCTGATGACT
ACAGAAGCGA
AATGAATT1GG CCAGCCMtTC
GATGGAAAAG
TTCATAAAG
GTAATTCCGC
AAGAAAAATA
AA.AAAAGAAA
GCCAAAAAAG
GTGAATGTTA
GATIAGAAT
TTTACTTCGG
A'FrCAAGAA.A
CAAGCACAAG
AT=CTGTAT
TACAGTTTAA
TGGAACCACA
GACTCTATGT
CGATI'TCTGA
ATGATGATCC
AATCAGGATC
CTTTAGAAGG
AAATAACTTT
TTCCACAAA.A
ATTI'ATTTA
GGACCTACGA
AAG'FrTCAGT
ATGTAGTGAA
TTTCAATTCC
AAAATCCOGC
AAAGTTITT
CGGGACAMA.C
ACTACTTAAC
TTGGGCTGGA
C'-GCTGAGT
CTCAAGTCC
TATCTGG'ITG
ATAGTACCTT
CTAGAACAAC
ATATGGT
TAGCTAATCA
AATGGGCTTA
CACAGCGGGG
TTTAGGTAAA
AGATGAAACC
AGTAACACCA
GATTCATCAA
CGCAGATGTC
AGAACGGCC
TGAAAAATAT
TAATGGTCCT
AAAAAATGAT
AATTAAGAGC
TAAGCTAAGT
TCA.ATTCGTC
TTTAGCGAAC
AAAAGAAGTC
GATTGCGCCA
CTACGATACA
TAAAATCAAA
TTTCCAATTT
TTTTACTAFr
GGGA.ACCCAT
GGGCGGCGTA
ACATGCGGCT
GAATCAGGAA
AAAAATCAAG
TITTAAC
EF023-4 (SEQ ID NO:88) GTGN SADSNKAAEQ KIAISSEAAI STMEPHTAGD
TI'STLVMNQV
DFVYAWQQVA
PYLKSLLSFP
YWDAEKVKLT
YFLKMNSVRD
GTDFTKLAAK
LEENLDGLEV
FKSDTYDQLI
YEGLJYVLGKE
SPKSGSIHQA
VLFPQNEKYI
EAKVSVIKSP
GKENPALANN
KNNYLTYDTA
NVTQVPFTIR
QETRTTHAAD
DELELGVAAE
LPFDVIKNA<
KEQGDKYATO
TTAVNLFDSN
NIRKALAQAF
KAKEFWEKGK
VDRDQTRDYD
QEARLNDFAQ
EPAISEDETV
EIALEGADVN
AEHLIYNGPF
ELDVVNKLSG
DKESFVKEVL
KEIGLDKIKL
LEL
1
SGWGTDY
AQDILVNQET
YTFKIREDAI(
TLGVKALDDK
KLKEWDNASS
EF IPGYVDNP
QDQSTATDQV
EFLTDDTDSA
RDPLTVMRIF
VLAPIYNRSI
WSNDDPVTAN
TLEITLERPT
EDDWTYEKNDT
AFLS IPQFVT
IPPGQTIAPD
KKAAEFFQFQ
TSDSTLGGVT
svLANQKIKJ LYWHSFGPTY SLKWAYVN EF024-1 (SEQ ID NO:89)
TAATGGCCGT
AACAAGGATC
GTCCGGCTTGT
GGTGGTAAAT
TTCAGCTA
AAAAAAATAA
AANGNTANCA
ACAGAATAA
TTCGTCTACT
ATAAAAAAGG
TATTGT'GTC
GGAAAGTGGA
ATGACTCAGA
CCrI'TGACNT
AGATTACAGG
AATAAAGAGG
AGAAGTGAGC
AGGTTGTGGA
AGAAACACGT
GGATAGTGTT~
TACTAGCAGN
GGAAATTGC
ATGAAGCTAC
ATGAAAAAAG
ACAGATATGA
GCAACTTACA
AGTGGGACAT
AACTCTTTCA
GAAAAACA.AA
TCAAATGGCG
TACTACC'TTT
AAAAGATATT
CTITTTTTGA
ACACTTATGA
TTATGGAAAA
GA.ACACTTAT
TI'GGCAATGG
TATTGCCTTA
GACTGCCGAT
TGACGGTAAA
TGAAAAAAAT
AGTNGANTNC
AAAACAAAAA
105 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF024-2 (SEQ ID M1 KKVLPFIALV GLLLLSGCGT DIMflKILTADG GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX XXKITGEICE KQRTLIKQKT E EF024-3 (SEQ ID NO:91) ATr GACTGCCGAT GGTGGTAAAT GGAAAGTGGA AGAAACACGT TTICACCTA ATGACTCAGA GGATAGTGTT AAAAAAATAA CC GACNT TACTAGCAGN AANGNTANCA AGATI'ACAGG GGAAATTGGC
ACAGAA
GCAACTTACA
AGTGGGACAT
AACTCT?1'CA
GAAAAACAAA
CTT'=TTGA
ACAC'tTATrGA
TTATGAAAA
GAACACTTAT
TGACGGTAAA
TGAAAAAAAT
AGTNGANTNC
AAAACAAAAA
55 S S EF024-4 (SEQ ID NO:92)
LTADG
GKWKVEETRA TYTFFDDGKF SANDSEDSVS GTYTYDEKNK KITFDXTSXN SFIMEKVXXX XXXITGEItE KQRTLII(QKT E EF025-1 (SEQ ID NO:93)
TGAATGAAAC
GCGCCAGATC
AACCAACGITA
AATGACGAAC
C'FrACAGGG
GAAAAAAAGA
TATI'ATAAAA
GAACCTGAAA
TATGATACAC
AGCAACACGC
AAACAAATGC
ACTTCTGCGC
CCATCGAAAG
CTTCAAGT3A
GATGAACAGG
ATTCACCAGG
ATATTAA-AGG
AAGAGCCAAC
CCGTCACCAA
TGATTGCTAA
CCACACAAAC
AAAA-AATGTT
ATGAAGGTGT
CGCAAAGGAT
GCTATI'CGC
GTACAGACGA
GTGACGAAAA
GTCAAGATTI
AAACGTATAT
TAACAGAAGA
AAAAAATCAC
ATACAATAAA
AATGTTGGTT
GAAACAAACA
TCAAACGACT
TCAATTGACT
GACATTTrGGA
TTGGTCCAAT
ATTTACTC
TCTGAATGTT
TGTCAACAAA
TACC'ITAGTC
TCGTGTTACA
AATGCCTTTA
CGGTTACGCA
GCAGAAAATA
AGAAACAGCC
CAAACCAACA
TTTATCGGAT
ACAAGTGGTC
TCTGCCGTCO
TTTGATTCTC
ACAACCCCAC
CAACCCCCTT
GCAAATTACG
GAGITTACAG
CCCTGTCGG
ACCGITGTTA
GGTAATT=~
GCAGCAGAGA
GAAGA11'TAG
AAACATGTCA
TGCGGCCT'T
ATTCTTTTAT
TATTTATTI
CGCAAGAGAC
AAAAACA.AGC
ATGAATACAC
CAGCAAAATA
TGGGATTAAT
GCATTG.TAGA
AGTTTGCTAG
ArrATCCTGA
ATC"GTATTAC
ATACGGTACG
TGGACACITG
CCAATGGCCT
GCTATGATCA
'rrATCGTCAA TCAT'FIT TA GC?1'CCTTGT
AAAGCAAGTG
GCCGACTAAA
GTACGAAGTG
TACACCGGAA
GACGGGTAAC
GATTATTACG
TGATCCI'AT
ATTCAAGCA
TTATGTAGAA
CGGTTCATCA
GCTAA.AAGAG
AATCGCTCGA
ATACTrrTCA TCGAA.ATA1T
G
EF025-2 (SEQ ID NO:94) MrKH IKGMLVF DEL IANQLTF
YKN'EGVFTCO
NTRTDDTLVT
SKETYIGYKE
HQDTINKPTI
IGUFILVGCA
DSHEYTYEVV
NYGIVEI ITE VVNG ITYVEK
DLGNGLIARL
LLFIL
PDQEPTKQTT
TGATQTTFGT
PETQRILNVE
QMRDENRVTG
QVITEEQKIK
SGPQETKQVK
TPPAKYTPEE
FT'EFASDPYY
NFY'IVRGSST
HVSYDEYFSD
QV'IVrNQTTS
KKMMFWSNQ
DTRYSGVNKR
SAREGLMPLA
EQEKITETAC
AVEKQAPTKN
PPLGLMTGNY
LSDYPEFQAS
AEMDI'WLKEP
GLFIVNRNI I EF025-3 (SEQ ID 106 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. facalis Genes.
AAC GAAACAAACA ACAAGTGGTC CGCAAGAGAC AAAGCAAGTG
AAGCAAGTTA
AATCACGAAC
G~rACAGGGG
GAAAAAAAGA
TATTATAAAA
GAACCTGAAA
TATGATACAC
AGCAkACACC
AAACAAATGC
ACTTCTGCGC
CCATCCAAAG
CT*TCAAGTGA
GATGAACAGG
ATTCACCAGG
CCGTCACCAA
TGATTGCTAA
CCACACAAAC
AAAAAATGTT
ATGAAGGTGT
CGCAAAGGAT
GCTATTCGGG
GTACAGACGA
GTGACCAAAA
GTGAAGGATT
AA.ACGTATAT
TAACACAAGA
AAAAAATCAC
ATACAATAAA
TCAAACGACT
TCAATTGACT
GACATTTGGA
'ITGGTCCAAT
ATTTACTGC
TCTGAATGTT
TGTCAACAAA
TACGTTAGTC
TCGTGTTACA
AATGCCTTTA
CGGTTACGCA
GCAGAAAATA
AGAAACAGCC
CAAACCAACA
TCTGCGGTGG
TTTGAJTrCTC
ACA.ACCCCAC
CAACCGCcTT
GGAAATTACG
GACTTTACAG
CGCCTGTCGG
ACCGTTrGTTA
GGTAATTTTI'
GCAGCCAGAGA
GAAGATTTAG
AAACATGTCA
TGCGGCC TT
ATTCTTTTAT
AAAAACAAGC
ATGAATACAC
CAGCAAAATA
TGGGATTAAT
GCATTGTAGA
AGTTTCTAG
ATI'ATCCTGA
ATGGTATTAC
ATACCCTACG
TGGACACTI'G
GCAATGGCCT
GCTATG'ATGA
ITATCGTCAA
TCAWT=
GCCGACTAAA
GTACGAAGTG
TACACCGGAA
GACGGGTAAC
GATTATTACG
TGATCCTTAT
AT'rTCAAGCA
TTATGTAGAA
CGGTTCATCA
GCTAAAAGAG
AATCGCTCGA
ATAC=TTCA
TCGAAATA'T
EF025-4 (SEQ ID NO:96).
a.
a.
a TKQT SGPQETKQVK QVTVTNQTTS AVEKQAPTKN DEL IANQLTF DSHEYTYEVV TGATQITFGT TPPAKYTPEE YKNEGVFTGG NYGIVEIITE PETQRILNVE FTEFASDPYY NTRTDDTLVT VVNGITYVEK QMRDENRVTG NFYTVRGSST SKETYIGYAE DLGNGLIARL QVITEEQKIK HVSYDEYFSD HQDTINKPTI LLFIL
KKKMFWSNQ
DTRYSGVNKR
SAREGLMPLA
EQEK ITETAC P PLC LMTGNY
LSDYPEFQAS
AEMDTWL.KEP
CUP IVNRNI I EF026-1 (SEQ ID NO:97)
TGAGTCTATG
ITTTTGGAG
CCTGCTCTTG
AGCAAACAAA
CCTGAAGAC
AATGTCACTT
TCCAACCATG
GCTGAATATG
CACGGGACTG
AAAGIAATC
AAAGATGGTG
ATTACTCATT
GGAT=ICAT
TGTTGCTTAG
CACCTAAITC
TCCAAATGC
TTGATAAAGA
TCTATCCTAA
AAATCACGGT
TCAAAGGTGA
GCAATAITAC
CCACTCTTAC
TCCCTTTGAA
GAAAATGTCT
TGCTTGTTCA
AACTACAGAA
CTTAACTCAT
GGTAACAGTT
ATTAGCACTT
TCCTAAGCTA
TATTGAGGTC
TTTTGATAAA
TCCTCAAGTC
TCAGTTATGA
AAAGTACTCA
TCT'GATAAAA
GTAGTCTCTG
A.AACCAAATT
GCTGTACTT
TATCCCAAG
ATCG=TTL
AAACCA.AAT'G
CAAGAATACA
ACCCTAGCCA
TAAAGGAAGA
CCACTCTTTT
AAACAGATAG
GTGC ErCAAT
CCATTCTCGC
TCCATGATAA
ATCATAATAA
CTGAAAATTT
GCTTACT- T
AAGATTCTGC
ATAATTAA
AATAAATAAA
CACCCCAACT
TAC'ITCTAGT
TAGTGCCAAG
ACCTACTCAC
GGAAAGAT
AAAAGTAACT
CAACATCGTT
A.AATGGTACC
TGACTTAGAA
F026-2 (SEQ ID NO:98) MKMSK VL'ITVLTATA ALVLLSACSS D)KKTDSSSSS KETANSSTEV VSGASISAKP EELEMALSDK G.NWIVAATDN NDVYRKA~y SQDDNI(KVTA EYEITVPKLI VS SENF'NIVH VNGN ITFDKQ EYKDSADLEK DGATVTGEVT VANN VTFDYEVIVA GTFHDKGKDS GTVKGDIEV< ANGFTLNGTK EFO26-3 (SEQ ID NO:99) AACACATAG TAGTTCTAGT ACCAAAGAAA CACCTAATTC AACTACACAA GTAGTCTCTC GTGC FrCAAT TAGTGCCAAG CCTGAAGAGC TCGAAATGGC GTTAACTC-AT AA.ACCAAATT CCATTCTCCC AGCTACTGAC AATCTCACTT TTCATAAAGA GGTAACAGTT GCTGGTAC'rr TCCATGATAA CGCAAAGAT 107 TABLE 1. Nucleotide and Anjino Acid Seqeuences of E. faecalis Genes.
TCCAACGATG
GCTGAATATG
CACGGGACTG
AAAGTTAATG
AAAGATGGTG
TCTATCGTAA
AAATCACGGT
TCAAAGGTGA
GCAATATTAC
CCACTGTTAC
ATTAGCACTT
TCCTAAGCTA
TATIG.AGGTG
TTGATAAA
TGGTGAACTC
TATTCCCAAG
ATCGTTTCTP
AAAGCAAATG
CAAGAATACA
ACCGTAGCCA
ATGATAATAA AAAAGTAACT CTCAAAATT CAACATCGTT GCTTTACT'Fr AAATGGTACC AAGATTCTGC TGAC'ITAGAA
ATAAT
EF026-4 (SEQ ID NO:100) TDSSSS S
KETANSSTEV
NOVYRKLALY
VNGNITFDKQ
VSGASISAKP
SQDDNKKVTA
EYKDSADLEK
EELEMAEJSDK
EYEITVPKLI
OGATVTGEVT
GNWIVA.ATDN
VSSENFNIVHi VANN- VTFDKEVTVA GTFHDKCXDS GTVKGDIEVK ANGFTLNGTI( EF027-1 (SEQ ID NO:1O1)
TTTGGTATGA
GCATGTGGAA
GTCGCATCTG
TCCGATATGA
GAGCTAGCTA
AAGTTACGAG
GCGTTTAGAA
TTTAAAAATG
GCAATCGATG
GTCTTGGTTG
GCCTATGGGA
GATGGCAATr
GTAAAATTGA
TTTGATAATG
GAGTCAAAAC
CGCCGTGAAA
GAAACTTTTG
GCTAATTTTG
TGCCATATAA
AACAGAAAAA
GTGGCGGTTC
GTGGTGAACT
TTGGTCAAGT
TGGCGAAAC
AAGCAAA.ATG
ACGTGGTCGA
GGCGTGCGGT
ACCAGACACT
GGACACCTTT
CTTCTGCAGA
CCGAAACTrG
ATGAAATTGA
GCGACTTAGA
.AGCGCATT
F1'ACCGGCAA
CAAAAGAAAT
CAAAAATCCA
TATTAAAGAA
GTGGTTAATC
GAAAACCACC
CTCGACATTA
AGTTGAAGGC
AGAGCCACAA
GACAAACGGG
TCCAGCATAC
GCGGGAAGGA
AGAACTAACA
TATGCCTAAA
TAIATTTTGTI'
GAAA ITGAAG
TGTI'CAAGTA
?I'ACACTGT
TGTACCTAAA
CGAACATGTT
TI'TAGGAGAT
GATACAGGG
CCCCAAGCTA
GGACTITGTTG
TCAAkACGAAC
GACAGCGCTC
TI'GTATCGAC
G ITAGTGAAG
GATCCAGT=A
GGTTCAAGTA
CAAGCCACGA
TTGGAAAATC
AATGAAGCCT
GGCAATGGCC
ALAGAATGATt
GTGAAAGAAA
TI'AGCAGATA
GCCATGGTGG
CGAAAAGC1T
GGCTCGACAG
AAGATI'TCCG
ACTGGAACAA
CACTGGGCTT
CAGCTACACA
ATTATACAGA
AAGATAAAAA
ACGGGTTACT
ALAGCAGGGGA
GCAGTAATCA
TGGAAGA.ATT
CAATTCCTTA
TT'GCCAAAGA
.CG?11'TAAT
ATTATGGGA
TTGGCACAGG
CF1'ATGCACT
GGTTTTAGCA
GAAAATTAAC
TGTCTATAGT
CGGAGATCCT
CTATACATTC
TTTTGTAGT
AATGGATATT
TGGTGTCAAA
F1TAGCCCAA
AAAAGGTACT
TTCAGGITGG
TAA.AGAACAC
AGCCAATCTT
TCAGTATAAA
G ITATTTAAG CCCCAATCAT T=ACAAGC GATTGACAAA C =TAAATGG NTTTGTACCA CAAAGAAAAT GGTGAITTAT
TT
a a' *a EF027-2 (SEQ ID NO:102) MXQKKWLI GLVALGLVLA ACGSGGISKTT SNEPATQKIN VASGGELSTL DSAHYTDVYS SDMIGQVVEG LYRQDKNGDP E LAMAKAEPQ VSEDGLVYTF KLREAKWqTWG DPVKAGDFVV AFRNVVDPAY GSSSSNQID I FKNGRAVREG QATMEEFGVK AIDDQTLELT LENP IPYL.AQ VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW DGNSEIMKrLK KNDHYWDKEH VKLNEIDVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK ESKQAJIFVPK AMVGYLSPNH RREITGNEHV RKAFLQAIDK ETFAXEILGD GSTALNGFVP ANFARIQIQV KISAXKM1VIY CHIILKKPKL TGTI EF027-3 (SEQ ID NO:103) AACGACC TCAAACGAAC CAGCTACACA GAAAATTAAC GTCGCATCTG GTGGTGAACT CTCGACATTA GACAGCGCTC TCCGATATGA TTGGTCAAGT AG ITGAAGGC TTGTATCGAC GAGCTAGCTA TGGCGAAAGC AGACCCACAA GTTAGTGA.AG AAGTTACGAG AAGCAAAATG GACAAACGGG GATCCAGTTA GCGT'ITAGAA ACGTGGTCGA TCCAGCATAC GGTTCAAGTA A~FrATACAGA
AAGATAAAAA
ACGGG~ITAGT
AAGCAGGGCA
GCAGTAATCA
TGTCTATAGT
CGGAGATCCT
CTATACATTC
TTTTGTAGTT
AATGGATATT
108 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TTTAAAAATG
GCAATCGATC
GTCTTGGTTG
GCCTATGGGA
GATCGCAATT
GTAAAA=1GA
TTITGATAATG
GACTCAAAAC
CGCCGTGAAA
GALAAC'TI'TG
GCTAATTTTG
TGCCATATAA
GGCGTGCGGT
ACCAGACACT
GGACACCTI'
CTTCTGCAGA
CCGAAACTTG
ATCAAATTGA
GCGACTTAGA
AACGCATTT
TTACCGGCAA
CAAA.AGAAAT
CAAAAATCCA
TATTAAACAA
GCGGGAAGGA
AGAACTAACA
TATGCCTAAA
TAATTTTGTT
GAAATTGAAG
TCTTCAAGTA
TTACACTGTT
TGTACCTAAA
CGAACATGTT
'ITTAGGAGAT
GATACAGGTG
GCCCAAGCTA
CAAGCCACGA
TTGGAAAATC
AATGAAGCCT
GGCAATGGGC
AAGAATGATC
GTGAA.AGAA.A
TTAGCAGATA
GCCATGGTGG
CCAAAAGCTT
GGCTCGACAG
AGXI"TTCCG
TGGAAGAATT
CAATTCCTTA
TTGCCAAAGA
CGTTTGTAAT
ATTA'GGA
TTGCGCACAGG
CTTATGCACT
GTTATTTAAG
~TTTTACAAGC
CTTTAAATGG
CAAAGAAAAT
TGGTGTCAAA
TITAGCCCAA
AAAAGGTACT
'ITCAGGTTGG
TAAAGAACAC
AGCCA.ATCTT
TCAGTATAAA
CCCCAATCAT
GATTGACAAA
NTI'TGTACCA
GGTGA=TAT
EF027-4 (SEQ ID NO:104) TIT SNEPATQKIN VASGGELSTL DSAHYTDVYS SDMIGQVVEG LYRQDKNGDP ELAIAKAEPQ VSEDGLVYTF AFRNVVDPAY GSSSSNQMDI FKNCRAVREG QATMEEFGVK VLVGTPFMPK NEAFAKEKGT AYGTSADNFV GNGPFVISGW VKLNE I IVQV VKEIGTGANL FDNGDLDYTV LADTYALQYK RREITGNEHV RKAFLQAIDK ETFAXEILGD GSTALNGFVP
CHIILKKPKL
KLREAKWTNG
AIDDQTLELT
OGNSETWKLK
ESKQAHFVPK
ANFA.KIQIQV
DPVKAGDFVV
LENP IPYLAQ
KNDHYWDKEH
AMVGYLSPNH
KISAKKMVIY
0* EF028-1 (SEQ ID NO:105)
TAACAGAAGC
AAGAC'ITG'rr
AGAGC=IGC
TCTCAACAGA
GAAAAAGCAT
ACAACGGGCT
rTrGATACCT
ACCGATTCAG
ATTGCACTCG
GGGAAATCAA
GGCCACATA
CAAATCGACG
CGGAAAGATC
AAAAAGTCGT
CTACCTAAAA
GCTCTTCAAC
ATTGATTGGG
GAAGCGGCGT
CAACTGCAGA
AATACAACAA
ATAGTCAATG
TAGGGG'I-AC
AAAGCGGCGA
CAGTAAAAAA
ATCGCTA'TTT
AT'ITGGTCGG
C'I'ICCGCAGC
ATAATGACAA
CGGGTCT
ATG'I=CACG
GACAACACAA
GTGA'ITTAGT
TAAATGAGAA
TGA=IACCG
GGTTAGATAA
CCGGGCATAG
'ITGAAAAGGC
'PCATTCAACA
CTTAACACTT
TATGGGTAGA
CTTATTAACA
AAAACAAACA
TGTTA'TTT
CAAAGCCAAT
ACAGCAAGCC
GACAGCGATG
GTCCAAAACA
AGCAACATCT
CAAAAATATG
AGTCGATGTG
CAAAGAATTT
CCAAGACGAC
AACGGAAGAA
AAATGAAAA
TGT ITACTTG
TATGAAGGAG
'ITACAACAT
GAGGTTGCTG
ATGATTGGAG
CACTCAGACA
ACTTATCCAG
CCTGCCGGAG
GAAACAGTC
GAAATAACAC
GCAGAAATCG
TTAC'TGCGCG
TCCCAAGCGG
AAAAT=rAG
GTCCCTTCAT
GG'ITTC'ITTT
TTATTTATCA
GA.AACAAGGA
TAGCGGGTTG
A.AGCGAAGGC
ATGCCATGGG
AGCGTGTTCC
AAGATGAAGA
TGAAAACCTA
TCGAACGTGC
ATGCAACCCC
CCGATGACTA
GCGGCTCCGA
GTTATrCGTCA
GCTTGITTGC
TAGCTCATAT
TAATGG ITGA
TGAGCGAAAT
ATGGTGAACA
GAAATCAACT
AATGAAGAAA
TACAAATTTA
AACTGAATCT
GAATCCGTAT
CCAAACAGCT
AGAGAATGTC
TAATAATGCT
GAAAAAAGTG
TGCTGCATAT
'ITIGATGAT
ATTATTrCC
TGTCACAGAC
ACCAGGCGGG
CACAGAAGCG
AGGTACTCAA
GCAAGACTTC
TTGGTGGTTA
CAATGATATT GTTGGCGCGA CATCGATTTT GCCAAAAAAG GGGGGCTTrGT CTTTAG E-F028-2 (SEQ ID NO:106) MKKR ALLGVTLLTF TTLAGCTNLS EQKSGEKQTE VAFAXATESE KASVKNVIFM DTYLVGQQAT YPEDEEENVT DSASAATAMA KSTGLVATSE ITHATPAAYG AHNVSRKNMA KDRDLVKEFS QAGYCH-VTDK KSLNENQDDK LQRLDKNEKG FFLMVEGSQI DWAGHSMDIV
IGDGMGNPYT
AGVXTYNNAI
E-IADDYFDDQ
ILGLFAPGGL
GAMSEMQDFE
TGYRYFKANH
ALDNDI(SKTE
IDGQHICVDVL
PKMI DR"TIEEV
AA.FEKAIDFA
SDKRVPQTAF
TVLERAKKVG
LGGGSELFAR
PSLADMTEA-A
KKDGEHWLQ
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
LQI IQQGACL EF028-3 (SEQ ID NO:107) ACAGA AAAGCGGCGA AAAACAAACA GAGGTTGCTG AAGCGAAGGC AACTGAATCT
GAAAAAGCAT
ACA.ACGGGCT
TTTGATACCT
ACCGATTCAG
ATTGCACTCG
GGGAAATCAA
GGCGCACATA
CAAATCGACG
CGGAAAGATC
AAAAAGTCGT
CTACCTAAAA
GCTCTTCAAC
ATTGATTGCG
GAAGCGGCGT
CAACTGCAGA
CAGTAAAAA.A
ATCGCTA=~
ATTTGGTCGG
CTTCCGCAGC
ATAATCACAA
CGGGTC 1TGT
ATG=TCACG
GACAACACAA
GTGATTTAGT
TAAATCAGAA
TGATTCGACCG
GGTTAGATAA
CCGGGCATAG
ETGAAA.AGGC
TCATTCAACA
TGTTArTT
CAAAGCCAAT
ACAGCAAGCC
GACACCGATIS
CTCCAAAACA
AGCAACATCT
CAAAAATATG
AGTCGATGWI
CAAAGAA'I=
CCAAGACGAC
AACGGAAGAA
AAATGAAAAA
CAATGATATT
CATCGATT'F
GGGGGCTTGT
ATGATTGGAG
CACTCACACA
ACTATCCAG
GCTGCCGGAG
GAAACAGTGC
GAAATAACAC
GCAGAAATCG
TTACTTCGCG
TCCCAAGCGG
AAAATTTTAG
GTCCCTTCAT
GGTTTCT
G'FrGGCGCGA
GCCAAAAAAG
CTT
ATGGCATGG
AGCGTCC
AAGATGAAGA
TGAAAACCTA
TCGAACGTC
ATGCAACCCC
CCGATGACTA
GCGGCTCCGA
G~ITATGGTCA
GCTTGTTTGC
TAGCTGATAT
TAATGGTGA
TGAGCGAAAT
ATGGTGAACA
GAATCCGTAT
CCAAACAGCT
AGAGAATGTC
TAATAATGCT
GAAA.AAAGTG
TGCTGCATAT
TTTTGATGAT
ATTATG C C
TCTCACAGAC
ACCAGGCGGG
GACAGAAGCG
AGGTAGTCAA
GCAAGACTTIC
TTGGTGGTTA
EF028-4 (SEQ ID NO:108) QKSGEKQTE VAEAKATESE KASVKNVIFM IGDGMGNPYT TGYRYFKANH SDKRVPQTAF DTYLVGQQAT YPEDEEENVT DSASAATANA AGVKTYNNAI ALDNDKSKTE TVLERAKKVG KSTGLVATSE ITHATPAAYG AHNVSRKNMA EIADDYFDDQ IDGQHKVDVL LGGGSELFAR KDRDLVKEFs QAGYGHVTDK KSLNENQDIJK ILGLFAPCGL PKMIIDRTEEV PSLADMTEAA LQRLDKNEKG FFLMVEGSQI DWAGHSNDIV GAMSEMQDFE AAFEKAIDFA KKDGEHWAWLQ LQI IQQGACL EF029-1 (SEQ ID NO:109)
TGAAGGAGGG
GCCACTTTT
GAAGTTACTG
GTTAGGCACG
CGGT'TAT
GGA'rrGGAAG
GAATATGATA
GATGGAACAT
AAAGAGCCTA
CAGTCACTCG
AGAAAATGAA
TATTATCAGG
AAAATGTTAC
TCATCGCGA
TAArI'GACAC
CTAGCAAACG
AGGGCGATAA
TACTACAAAA
CAACTAAGTA
GAATCTGGAG
AA.AGTTAATC
ATGCGCAAGT
TCAPLACTATI'
TACCACACTA
ACCCGAGACT
CACAAAAGAG
AACAGATCGT
AACGCTTGTA
TTTGGCAGAA
CATACCAGGT
GGTAAAAAGT
CTTGAACAAA
TCAAACGATC:
TTAAAAATTG
GTGAAACCGA
C7=TGTCTA
TACGGACGAG
AGTGAAGGAT
CTAGAGCAAG
TATCGTGACAC
GGCTGCTGCT
AAGCACAGGA
AACGTATACC
ACGGAAAACA
AAACAAACT
CCTI CAGA CGTTGGrGCTA
TAGCTCGTGT
CCCAAGAACA
AACGGGGG'TT
TACACCAGTA
TAGTGTAAAA
AGCTGATT
ACAAAAAGTT
TCAGCCGTTC
AATTACGTTT
CATATI'CGTA
TGCCTATGTA
GGCTAAAAAT
TAGTAAATAA
EF029-2 (SEQ ID NO:11O) MI(KLIG KKWLLLTAVA TFLLSGCASL EQKAQDSVKE VTENVTQTIS NDQRIPADFV RHVDGDTTVL KIDGKEQKVR FLLIDTPETV KPKTKVQPFG L.EASKRTKEL LSTASEITFE YDKGDKTDRY GRALGYIFVD CTLLQKTLVS ECrLARVAYV1( EPTTKYLAEL EQAQEQAKNE SLG IWSIPGY VIQRGFSK EF029-3 (SEQ ID NO:111) AAATGTTAC TCAAACTATT TCAAACGATC AACGTATACC AGCTGATTTT GTTAGCACC TGGATGGCGA TACCACACTA TrAAAAATTG ACGGAAAAGA ACAAAAAGTT 110 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CGGTTTTTAT
GGATTGGAAG
GAATATCATA
GATGGAACAT
AAAGAGCCTA
GAGTCACTCG
TAATTCACAC
CTAGCAAACG
AGGGCGATAA
TACTACAAAA
CAACTAAGTA
GAATCTGGAG
ACCCGAGACT
CACAAAAGAG
AACAGATCGT
AACGCTTGTA
TTTIGGCAGAA
CATACCAGGT
GTGAAACCGA
CTTTTGTCTA
TACGGACGAG
AGTGAAGOAT
CTAGAGCAAG
TATGTGACAC
AAACAAAAGT
CTGCTTCAGA
CGTTGGGCTA
TAGCTCOTGT
CCCA.AGAACA
AACGGGGGTT
TCAGCCGTrc AATrTACGTTT~
CATATTCGTA
TGCCTATGTA
GGCTAAAAAT
TAGTAAA
EF029-4.(SEQ ID NO:112) NVTQTIS NDQRIPADFV RHVDGDTTVL KIDGXEQKVR FLLIDTPETV KPKTKVQPFC LEASKRTKEL YDKGDKTDRY GRALGY I VD GTLLQKTLVS EGLARVAYVK EPTTKYLAEL SLGIWSIPGY VTQRGFSK LSTASE ITFE
EQAQEQA(NE
EF030-1 (SEQ ID NO:113)
TGATTGACAC
GTTAGTACGG
ACTGAGAAAA
TCAATGGATT
GAAGGGTTAT
CCAGAAATTT
TCCA-ATGGTG
CCCAAAAACC
ATTACAAAAG
TTGGAGGTTA
TTTTTCCCAC
GAAATGATTG
TGGGATTTTIG
CATTTTGAAG
GATGTGGCTG
GCAATCGAAC
TCtATT'IrTG
TTAGTCGATA
TTTGTATATA
ACAGACGCCA
GTAGCCAITG
ATAGGGGGAA
TCTTGGTAGG
CCAGTGTCTC
CTAT'ITTTAC
ATCAATTGGA
CTG.AGGATGG
ATGCTG-TAAC
AAGCCA.ATTA
AGGAAAAAGC
C TTTAGAAAA
AAA.ACGAAGC
TATCTAATGG
TGCGTAATCC
TTCTTAAAGA
TCTTAACAGG
GGTCAAAAGT
CAAATCAGAA
ATATTTTAGC
ACCCAGAAAC
AAAAACCCAA
AACTTCTTTC
TAGTATGAAA
TTGTGGCACA
GAAAA.AAGTT
ACAAGATGAA
TGAAA.AAGAT
CAAACGATAT
AGCCAATGAT
CTTTTTCTTG
ACCAGAGGAA
GCCTGTACCA
ATTCGTGAAA
TCCGTTTTTA
CTACTATTAC
AACCAATACC
AGATTTTGCT
CTATTCCTTA
TGTCCCCAAA
AGATGGCTCA
GAATGAAGAT
AGAGTATTTA
AAGAGATGGT
AAGTTAAAAA TrATO-GGAT ACAGCAGANA CAAAAATAGA TTAAATTTAA TGGAGAACTC GCCAGTA ITA ACGCACAGTC CAACTAATAC CTGCTGCTGC ACCATTAAAC TAAGAGAAGA TCGTT?=G C'ITGCGTAA TTAGAAGGAA CGATTCTGAA TTGGGTGTCA AAGCGCTTGA TArI'TTACGT CG7TATTGGC GAAAAAGGAC AAGCCTATGC ATGAAAAATT GGGATCAGTC GATAAAGAAA AAGTAAAATC G ETATAATT Tr.TACGAATC AAACAAAATC GAGACAACCC CGTTTAAACC AAAAAAGAAA GCTTFIAGCTT ATGCTTTGGA AAAGAAATI'T ATGGGTACAT T'ITCGTCAAG AAGCAGGCGC GATAAAGCAA AACCAGAGCT GATAGTGACC GA
TATGTTATTT
CGAGAAAGCA
GGAAATCGGT
CAATGTCTTT
TAAAGAGATG
TGGCAAGTGG
ATTAGCGAAT
CGGAACAGCT
TGATTATACT
ATTTTCTCCA
CACTTCTAGT
AGCGATGTCG
AGAAACGATT
AGGTGAATTA
AGACTATGA-A
CGAAAAACCA
TAAAAAAAGT
TCCAGAAAAA
TCTTGTCAAA
AAACGGAGAT
EF030-2 (SEQ ID NO:114) MKK LKMMGIMLFV STVLJVGCGTT AXTKIDEKAT EKTSVSKKVL NLMrENSEIGS MDSIFTQDEA SINAQSNVFE CLYQLDEXDQ LIPAAAKEMP EISF.DGKRYT IKLREDGKWS NGDAV'TANDF VFAWRKLANP KNQANTYFFLL EGTILNGTAI TKEEKAPEEL GVKALDDYTL EVTLEKPVPY FTSLL.AFSPF FPQNEAFVKE KGQAYGTSSE MIVSNGPFLM KNWDQSAMSW DFVRNPYYYD KEKVKSETIH FEVLKETNTV YNLYESGELD VAVLTCDFAI( QNRDNPDYEA IERSKVYSLR LNQKRNEKPs IFANENVRKA LAYALDKKSL VDN~ILADGSK EIYGYIPEKF VYNPETNEDF RQE-AGALVKT DAKKAKEYLD KAKAELNGDV AIELLSRDGD SOR EF030-3 (SEQ ID NO:115)
GAGAAAGCA
ACTGAGAAAA CCAGTGTCTC GAAAAAAGTT ITAAATTTAA TCGAGAACTC GGAAATCGGT TCAATGGATT CTATTTTTAC ACAAGATGAA GCCAGTATTA ACGCACAGTC CAATG.TCTT GAAGGGITAT ATCAATTGGA TGAAAAAGAT CAACTAATAC CTGCTGCTGC TAAAGAGATG 1\
III
TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
CCACAAATT
TCCAATGTG
CCCAAAAACC
A'ITACAAAAG
TTGGAGGTTA
TTETTCCCAC
CAAATCATTG
TGGGATTTTC
CATI'TTGAAG
GATCTGGCTG
GCAATCGAAC
TCCATTTTG
TIAGTCGATA
TTTGTATATA
ACAGACGCCA
GTAGCCATTG
CTCAGGATGG
ATGCTGTAAC
AAGCCAATTA
AGGAAAAAGC
CTTAGAAAA
AAAACGAAGC
TATCTAATGG
TGCGTAATCC
TTCTTAAAGA
TCTTAACAGG
GGTCAAAAGT
CAAATGAGAA
ATATTrAGC
ACCCAGAAAC
AAAAAGCCAA
A.ACT'TCTTTC
CAAACGATAT
AGCCAATGAT
CTTTTCTTG
ACCAGAGGAA
GCCTCTACCA
ATTCGTGAAA
TCCGTT=TA
CTACTATTAC
AACCAATACC
AGArTTGCT
CTATTCCTTA
TGTCCGCAAA
AGATGGCTCA
CAATrAAGAT
AGAGTA!ITTA
AAGAGATGGT
ACCATTA.AAC
TTCcITT'
TTAGAAGGAA
TTGGGTGTCA
TAT'FrTACGT
GAAAAGGAC
ATGAAAAATT
GATAAAGAAA
GTTTATA.A~
AAACAAAATC
CGTTAA.ACC
TAACAGAAGA
CTGGCGTAA
CGATICTGAA
AAGCGCTTGA
CCTTATTGGC
AAGCCTATGG
GGGATCAGTC
AAGTAAAATC
TGTACGAATC
GAGACAACCC
AAJAAAAGAAA
TGGCAAGTG
ATTACGAAT
CGGAACAGCT
TGATTATACT
ATTTTCTCCA
CACTTCTAGT
AGCGATGTCG
AGAAACIGATT
AGGTGAATTA
AGACTATGAA
CGAAAAACCA
TAAAAAAAGT
TCCAGAAAAA
TCTTGTCAA-A
AAACGGAGAT
GCTTTAGCTrT ATGCTTTCGGA AAAGAAA=I ATGGGTACAT TTTCGTCAAr. AACCACCGCGC GATAAAGCAA AAGCAGAGCT EF030-4 (SEQ ID NO:2.16)
S
EKAT EKTSVSKKVL NLMENSEIGS MDSIFTQDE-A SINAQSNVFE GLYQLDEKDQ NGDAVTANDF VFAWRKLANP KNQANYFFLL EVTLEKPVP Y FTSLLAFSPF FPQNEAFVKE DFVRNPYYYD KEKVKSETIH FEVLXETNTV IERSKVYSLR LN'QKRNEKPS IFANENVRXA VYNPETNvEDF RQEAGALVKT DAXKAXEYLD LIPAAAKEMP EISEDGKRYT EGTILNGTAI TKEEKAPEEL KGQAYGTSSE MIVSNGPFLM YNLYESGELD VAVLTGDFAK LAYALDKKSL VDNILADGSK KAKAELNGDV AIELLSROG
IKLREDGKWS
GVKALDDYTL
KNWDQSAMSW
QNRDNPDYEA
EIYGYIPEKF
EF031-1 (SEQ ID NO:117)
TGAGAAATTA
AAACGCGTAA
AGCGAAGCAA
TTCGGATTAA
GAAGCGAAAG
AATCCCAATG
AAAGATCGGT
CCGGGAGCAA
GGGATTGTTT
TCAGCTGATT
ATGTTATACG
GCT'TGAAG
GACTTAGCNA
GTTGATATTA
GTTATTTTAG
TTTAGGGAC
CTACGAAAAG
GTGAAGATAT
TTACCTTAGA
CGGGAA 1MA
TATTM'AAAA
A.AGAGGTI'T
ACAACAAAGA
TGAAAGGTAA
TT'GCTAGTGA
CGATGAAAGA
ATATGTTCCA
TTCAAGGCGC
AAAAATAAAA
ATTAGTCGCT
CCAGAGCAAA
TGTCAAAAAA
AGTAGGCAAT
TGTCAT1'GAA
AATTACAGAA
TGAAAGTGGT
AAAATTAGGC
AATTTCTGTT
ACATGCTGGT
ATTAAAACCA
ATCTCGTCAA
ACAGAAAACG
ACCATTTGG
GCAACGTTAT
GGTGGAAGTA
GACATI'ATCG
AGCGCAGACC
TTAGCACAAG
AAAOAAGTAC
GCTGGCGTAC
ACGGAAGMTT
TAATGACTGC
ATGCT~AGT
CTCCATMA
GCTTACGAA
CAAATGCAC
CTA-ATITAAC
CAATCGCTGT
AAAAATGAAA
TTGTGGAAAC
CG=TCAACT
AAAAGAGAAT
ATTAAAkAAAT
ACAAGGTGGA
TCAGTTAACG
AAACAGTATC
TGACTTATGG
AGGTCCTTrA
CGGGAAGGCC
AAAATCGTCA
TGAT TTCCG AAAGAAATTA AAAACTGGGA CCAGACCTTG CCACGACGCC CAAGATATTA CAAAAGATAA AACG'ITTTA AAACGTATTWC ATTGAACCAC CTCTGCTrC
TGA
EFOO31-2 (SEQ ID NO:118) MKK RVILGTLVAA TLLMTACGNS EATTKSESKG GSNALVVSTF CLSEDIVKKD I LAPFEKENE AKVTLEVGNS ADRPTKL.KNN PNAGIDVIEL AQANAAQGGK DGLFEKITEK EVPNLSQLTP GAKEVFESGA GVPIAVNSIG IVYNKEKLGK EIKNWDDLWS ADLKGKI SVP DVATTAGPLM LYVASEHAGQ DITKIDNGKAA FEAWCEU(PN VVKTYSKSSD LANMFQSGEI EAAVVADFAV DIIQGAQKT EF031-3 (SEQ ID NO:119) 112 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efaecalis Genes.
AA CTACGAAAAG CGAGAGCAAA GGTGGAAGTA ATGCT=AGT CGTTTCAACT
TTCGGATTAA
GAAGCGAAAG
AATCCCAATG
AAACATGGGT
CCGGGAGCAA
GGGATTGTT
TCAGCTGATT
ATGr1'ATACG GCTTT1TGAAG
GACTTAGCNA
GTTGATAFrA
GTGAAGATAT
TTACCTTAGA
CGGGAATTGA
TATTTGAAAA
AAGAGCTTTT
ACAACAAAGA
TGAAAGGTAA
TGCTAGTGA
CGATGAAAGA
ATATGTTCCA
TTCAACGCC
TOTCAAAAAA
AGTAGGCAAT
TGTCATTGAA
AATTACAGAA
TGAAAGTGGT
AAAATTAGGC
AATTTCTGTT
ACATGCTGGT
ATTAAAACCA
ATCTGGTGAA
ACAGAAAA
GACATTATCC CTCCATTTGA AGCGCAGACC GCTTLTACCAA TTAGCACA-AG CAAATGCAGC AAAGAAGTAC CTAA=tA.AG GCTGGCGTAC CAATCGCTGT AAAGAAATTA AAAACTGGGA CCAGACGTTG CCACGACGGC CAACATATTA CAAAAGATAA AACGTTGTTA AA.ACGTATTC ATGAAGCAG CTGTGGITGC
AAAAGAGAAT
ATTAAAA.AAT
ACA.AGGTGGA
TCAGTTAACG
AAACAGTATC
TGACTTATGG
AGGTCCTTA
CGGGAAGGCC
AAAATCGTCA
TGATTTTGCG
EF031-4 (SEQ ID NO:120) TTKSESXC GSNAL.VVSTF GLSEDIVKKD IIAPFEKENE AKVTLEVGNS ADRFTKLKNN' PNAGIDVIEL DGLFEKITEK EVPNLSQLTP GAXEVFESGA GVPIAVNSIC IVYNXEKLGK ADLKGKISVP DVATTAGPLM LYVASEHAGQ DITKDNGKAA FEAMKELKPN LANNFQSGEI EAAVVADFAV DIIQGAQK
AQANAAQGGK
EIKNWDDLWS
VVKTYSKSSD
S
5*5**
S
C
EF032-1 (SEQ ID NO:121)
TGAATAAATT
GTTTGTGGTA
AAGTCAACCA
TCAGGGGAAT
CAACTAGATG
AACCATGACT
CTCACATTCG
CAAGAATATA
TCTGATGTT
GAAGTAACTA
AATACTAATC
GATACTTTAG
TCAATTATCG
A ETTAGGAGG
TTTCACTACT
GTCAATCTAG
ATTCAGTT'-G
ATAAATCGAG
TATACGGAGT
AAACTGCCGA
TGAAAA-ATCC
CTAAAAGTAG
CTGAAGCGAA
ACATTACTAA
CTAAGCATCA
GCGTCATCCC
A.ATTATGATG
TACTGCTTGT
CAGCACAGTT
AAAAGATATT
CATAGTTCTT
GGGAAACAAG
CAAAGATTr
AGTATCNACT
TAGCCAAGAT
GTCTGATGTA
GCTAGCAGAT
ATTTAATGAT
AACCATGGAC
AAAAA.ATTAA
NCGGGAAATA
AAACAACCGA
GATCCTGGAG
ATTACCGTCA
AAAAAAGTAT
GTTGTTAGAT
ACTGAAACTA
AATAAACAAT
GCTACTAATA
GAGCCAACCT
ATGTATCCTI'
GCAAAAAGAT
TTAG=IAGG
ATGATAATAA
ATTCAAAAGA
ATTACTATGC
AATCAGGCCG
CTCTTAAAAA
TTITAAATGA
GCAAACANAA
CTGATGTATC
CTTTACCGAG
TAGAACAACA
ATAA-AGGAAG
GGTAA
ATTGG'TTGT
AGATACTGAA
CTTTGTTGCG
TGTATITAACT
ACAAAATAGT
GGGAGATACT
AAAAGATTTT
AACACTAAAC
TGAAAAAAA.A
CGAAGATAAA
AACCGTC'TTA
CAAAATGCAT
EF032-2 (SEQ ID NO:122) MK KLISLGLAVCV CGISLLTACX GNNDNKDTEK STSQSSSTVK QPNSKDFVAS GEYSVGKDID PGDYYAVLTQ LDDKSSIVLI TVKSGGENSN HDLYGVGNKK KVSLKKGDTL TFETADKDFV VRFLNEXDFQ EYMKNPVSST ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE VSTEAKSDVA TNTLPSEDKN TNOITKLADE PTL.EQQTVLD TLAKHQFNDM YPYKGSKMHS IIGVIPTMDA KRW EF032-3 (SEQ ID NO:123) TA ATGATAATAA AGATACTGAA AAGTCAACCA GTCAATCTAG CAGCACAGTT AAACAACCGA ATTCAAAAGA CTTrGICCG TCAGGGGAAT ATTCAGTTGG AAAAGATATT GATCCTGGAG ATTACTATGC TGTATrrAACT 1.13 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecais Genes.
CAACTAGATC
AACCATGACT
CTCACATTCG
CAAGAATATA
TCTGATGT
GA.AGTAAGTA
AATACTA.ATG
GATACTTTAG
TCAATTATCG
ATAAATCGAG
TATACGGACT
AAACTGCCGA
TGAAAAATCC
CTAAAAGTAC
CTGAAGCGAA
ACATTACTAA
CTAAGCATCA
GCGTCATCCC
CAkTAGTTCTT
GGGAAACAAG
CAAAAT
AGTATCNAGT
TAGCCAAGAT
GTCTGATGTA
GCTAGCAGAT
ATTTAATGAT
AACCATGGAC
ASACCGTCA
AAAAAAGTAT
GTTAGAT
ACTG;AAACTA
AATAAACAAT
GCTACTAATA
GAGCCAACCT
ATGTATCCTI'
GCAAAAAGAT
AATCAGGCGG
CTCTTAAAAA
TTTITAAATGA
GCA.AACANAA
CTGATGTATC
CTTTACCGAG
TAGAACAACA
ATAAAGGA.AG
GG
AGAAAATAGT
GGGAGATACT
AAAAGATT
AACAGTAAAC
TGAAAAAAAA
CGAAGATAAA
AACCGTCTTA
CAAAATGCAT
EF032-4 (SEQ 10 NO:124) NDNKDTEK STSQSSSTVK QPNSKDFVAS GEYSVGKDID PGDYYAVLTQ LODKSSIVLI TFETADKDFV VRFLNEI(DFQ EYMKNPVSST 'JSTEAKSDVA TNTLPSEDKN TNDITKLADE IIGVIPTMDA KRW TVKSGGEFNSN HDLYGVGNKK XVSLYKDTL ETSKXKTVNS DVSKSSSQDN KQSDVSEKKE PTLEQQTVLD TLA'4QFNDM YPYKGSKMS EF033-1 (SEQ ID NO:125)
TGACTGCTTT
CAAAGAGGT
AAAATGAAAA
GCAGGATGTG
TTACCAACCA
ATTTCCCTAG
ATCGCAGTTG
GATATGATG
AATGACATCA
ACAGTCGTTA
ATCCCTGATA
GAAATCGACG
TTT~TCTATTG
TCkITTCAGA
AAITTTACTTI
GTAAACAGGA
AAGACCGTAG
TGCCATCAAC
ATACTCAAAG
CTCTCGATGC
ATTAGCTAG
ATATCCCCAC
GC PTATCTGA
AGTAG
GAGAAAAAAG
AAA 'rTCCC
AACAATGATG
AAAGAAAGCA
CGGCAAAGAA
AACAGA.AGTG
TAGTACAATG
CGAAAAATTG
CTCAGAAAGT
TAGTACAAGC
ACATGAAAAA
TGGT=ITTTT
CA.AAATAAAA
AC EAGGIT
ACTACCTCTT
ATTACTTTAC
ATIGAAGACT
ATGACTGATT
ATTGCCTTGA
GT LTGGAAGC
ATCAAAGCAA
GGACAA.AAGT
GTATTG7Wfl
TAGACGAATC
TAGTACCAAC
CTGAAAAAAC
CCAAAGAAGC
TAGGTAAAAC
TAAAAAAATT
AACCACAAAT
AAGTrGCAAGA
TCAAAGAAGA
TAATCAAAAC
GACGTTGAGA
CGAGGATCAA
ACTTGGC TTA
AGAAGTAACG
AACCAAAATT
CGACCAA'ITA
ACCACAAATG
TGTTTATGTG
TGCTGGA-AT
CGTCCAATTC
AATGGATCAA
EF033-2 (SEQ ID NO:126) S S MKKFTLTMMT LGLVATLGLA GCGKQEKXAT TSSEKTEVTL AVDTQSSTMM TDLKKLPQMD VVNIPTSTSI KAIKEDVQFI PTKDRSGKEI TLPKEATKII SLVPST.PEVI EDLGKTDQLI MAVDAEKLI ALKPQIVY-VN DINLASSESV WKQVEDAGIT ADSL.SEHEKG QKLIKTMDQE IDE EF033-3 (SEQ ID NO:127) CTCTT CTGAA.AAAAC AGAAGTAACG
TTACCAACCA
ATTTCCCTAG
ATCGCAGTG
GATAT'GATGG
AATGACATCA
ACAGTCGTTA
ATCGCTGATA
AAGACCGTAG
TGCCATCAAC
ATACTCAAAG
CTGTCGATC
ATTTAGCTAG
ATATCCCCAC
GCTTATCTGA
CGGCAAAGAA
AACAGAAGTG
TAGTACAATC
CGAAAAATTG
CTCAGAAAGT
TAGTACAAGC
ACATGAAA.AA
ATI'ACTTTAC
AMrAAGAC'I
ATGACTGATT
ATTGCCTTGA
GTTTGGAAGC
ATCAAAGCAA
GGACAA.AAGT
CCAAAGA.AGC
TAGGTAAAAC
TAAAAAAAT'
AACCACAAAT
A-AGTGGAAGA
TCAAAGAAGA
TAATCAAAAC
AACCAAAAT
CGACCAATTA
ACCACAAATG
TG'ITIATGTG
TGCTGGAATT
CGTCCAATTC
AATCCATCAA
GAA.ATCGACG AGTAG 114 TABLE 1. Nucleotide and Amnino Acid Seqeuences of E. faecalis Genes.
EF033-4 (SEQ ID NO:128) SSEKTEVTL PTKDRSGKEI TLPKE-ATK<II SLVPSTTEVI EDLGKTDQ:LI AVDTQSST2d TDLKKLPQMD MIMAVDAEKLI AL.KPQIVY-VN DINLASSE-SV WKQVEDAGIT VVNIPTSTSI KAIKEDVQFI ADSLSEHEKG QKLIKTMDQE IDE EF034-1 (SEQ ID NO:129)
TAGGAGGGAG
TTGGTAGG'T
CCTTTAATAC
TTCAACAAGG
GACACAAAAA
AATAAAGAGA
AAAAAAGTTG
TAATCATCAA
GTAGTAATAA
TCACCACGAT
ATAAAACCAT
AAACAAGTAG
GCTATGAAAT
ATCAAGGTAA
AAAAATCGGG
CAAAAAAGAA
TATGAAAAA
GACGCTTGAA
AACAGA-AAAA
TATAGGTCAA
ACGTATATCT
TATI=rAG'Pr
AACGGCAATC
GAAGAAGACC
AAAGAATATI'
AAGGTATATA
'ITGGACAAAA
GATGCAGAAG
GTATTATTTT
'-TTTGAATGC
TAACGAAAGG
TAGTTAATCC
AAAATATTAA
AAACGAAAAA
GTAATGTCTA
T'ICATGTI'T
CAGTTCGTI'
TTCAATTTTT
CAATAATGALA
AATACAAGAA
AATAGAGTTT
TGGTGAT=rI GGTGGTAAAT AG EF034-2 (SEQ ID NO:13O) MrKKIGY FSCIIFFMFL VGCSNNKKEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF NKDKTMTLEK EYLVNPNNED TKKTSRTEKX VYKNIKIQEN KESYEIIGQL DKKTKKIEFK KVDEGKRISD AEGNVYGDFG GK EF034-3 (SEQ ID NO:131) AGAA AACGGCAATC TTTIGAATGC CAGTTCGTTT CCTTTAATAC TCACCACGAT TATTGAAAAA GAAGAAGACC TTCAACAAGG ATAAAACCAT GACGCFI'GAA AAAGAATATT GACACAAAAA AAACAAGTAG AACAGA-AAAA kAGGTATATA AATAAAGAGA GCTATGAAAT TATAGGTCAA TTGGACAAAA AAAAAAGTTG ATGAAGGTAA ACGTATATCT GATGCAGAAG GGTGGTAAAT AG TAACGAkAAGG
TAGTTAATCC
AAAATAI'1AA
AAACGAAAAA
GTAATGTGTA
'ITCAATT1TTT
CAATAATGAA
AATACA.AGAA
AATAGAGTT
TIGTGATTTT
0 EF034-4 (SEQ ID NO:132) KEN GNLLNASSFP LILTTIIEKE EDLTKGSIFF NKDKTMTLEI( EYLVNPNNED TKKTSRTEKK VYKNIKIQEN KESYEIIGQL DKKTKXIEFK KVDEGKRISD AEGNVYGDFG GK EF035-1 (SEQ ID NO:133)
TAAACGAGAG
TTATTCACAA
ACAATTGATC
GAAAATTCCA
AA.AATCGATA
GTGAGTTTAT GAAAACAAAA ATCGGAAAAA G'TrTCCT=r ACTGAGTGGT TGTACCTCGG GACAGAAAGA AAAAGTCGAT AAA.ACGGTCG TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTACTGAGTA A
CAGTTATCTI'
CTGGCGAAGA
ATAAGCACAA
CTTTAGATAG
GTCAGCATTT
GATGGAAAA-A
ACATAAAAAT
TCAAGAAGAC
Er035-2 (SEQ ID NQ:134) M~KTKI GKTVILSAFL FTSFLLLSGC TSAGEEET IDRQKEKVDK TLVDKQKHKNE NSMESYDEKV DRSLDSQEDK ID~rE 115 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
EF035-3 (SEQ ID NO:135)
GATGGAAAAA
ACAATTGATC GACAGAAAGA AAAAGTCGAT AAAACOCTCG ATAAGCACAA ACATAAAAAT GAAAA LTCCA TGGAAAGTTA CGACGAAAAA GTTGACCGTT CTTTACATAG TCAAGAAGAC AAA.ATCGATA CTACTGAG EF035-4 (SEQ 10 NO:136) MEKT IDRQKEKVDK TSJDKQKHKNE NSMESYDEKV DRSLDSQEDK IDTTE EF036-1 (SEQ ID NO:137) *9
TAATTTTCAA
AAT'GAATCGA
ACAGGATGTG
TTACAACCAT
ATTAATGTCC
GACATTGGTA
ATTGATCATA
AAAGATATCT
GAACTTGGCG
CGTGCGACTT
GATTCCAGCG
GCATTTTCAT
GATGAAAATG
AAAAATCCAA
GAACGTGTGA
TGGCAAGGAA
GTCCTACATA
TGAAAAAAAG
CAAAATGGAT
TAGTAGAGAC
AAGGTGGCGG
AI'CTGAI'T
AAGTTGCTGT
CAATGGAAAA
GGAAAGACCA
TTGAAAAGTG
GCATGGTTCG
ATGTTACTGA
TAATGAACAA
GTGATTTAAC
TTGGTCAA.TT
ATGTCATTAA
TAATGGTAAA
ATTGCTATTA
TGATCGTGGT
AGCGAGTGAG
AAGCGGAACA
ATTTGCAGAA
CGTTGGGATT
TTTAA.AGAAA
AAAAATTGTT
GGTCTTGGGA
TTCCATTGI'
TGAAGTAGCT
TAAATGGATT
CAAAGAGTT
AGGGTATATT
ATAA
ATAGAATGGA
TTTATTrG=
GAATCCATCA
GAATATCAAA
GGTCTGAGTC
GAGAAAAAGG
ACACCAATCG
T'rGA.AA'TAA
TGGCAAGTAT
CAGCGGTAGG
GCCAAAATCC
AAGTCCAATC
GCATCAkAGC
TTAACAAAAA
ATCITITAG GTGAAGTAAC ATTT'GAATA GAGCGGCCGG GATAAAACAG CCATTCGTGC TCTGATACAC CAGGAGCGAT ACGTTAAGTA TTGATGGTGT ATIWGGTCTT ATGA-ACACAT TTAGACTTTA TGTTGTCAGA CCTGTI'CGA AAATGGAAAT
TTGGAGGAAT
ACTTACTT
CTCATCAGCT
GGGAAGATTT
TGGCGCGGTA
GGAAGACTTA
TGTCGGTGTC
AAACTGGAAA
TAGTGGTACG
GCAAGAACAA
TAGTTATACC
TCAGCCA.ACA
GTACACTCGT
TGATATCCAA
TGAACGGGAT
EF-36-2 (SEQ ID NO:138) MKKRLLLF IGLASILTLT GCAXWIDRGE SITAVGSSAL QPLjVETASEE YQSQNPGRFI NVQGGGSGTC LSQVQSGAVD IGNSDLFAEE DHKVAVVGIT P IVNKNVGVK DI SMENLKKI FLGEVTNWKE LGGKDQKIVI ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS DTPGAISYTA FSYVTDEVAT ENVMNNKWII WSYEHMYTRK NPSDLTKEFL DFMLSDDIQE RVIGQLGYIP
KKGIKAEDLI
LNRAAGSGTR
LS IDGVQPTD
VSKMEIERDW
EF036-3 (SEQ ID NO:139) GAT TGATCGTGGT GA.ATCCATCA CAGCGGTAGG CTCATCAGCT
TTACAACCAT
ATTAATGTCC
GACATTGGTA
ATrTCATCATA
AAAGATATCT
GAACTTCCCG
CGTGCGACIT
GATTCCAGCG
TAGTAGAGAC
AAGGTGGCGG
ATTCTGATTT
AAGTTGCTCT
CAATGGAAAA
GGAAAGACCA
=ITAAAAGTG
GCATGGTTCG
AGCGAGTGAG
AAGCGGAACA
ATTTGCAGAA
CGTTGGCATT
TTTAAAGAAA
AAAAATTGTT
GGTCWIGGGA
TTCCATTGTT
GAATATMAAA GCCAAAATCC GGTCTGAGTc AACTCCAATC GAGAAAAAGG GCATCAAAGC ACACCAATCC TTAACAAAAA ATCTTTAG CTGA-AGTAAC ATTTGAATA GAGCGGCCGG GATA.AAACAG CCATITCGTGC TCTCATACAC CAGGAGCGAT CGOAACAT1T
TGCGCGGTA
GGAAGACTTA
TGTCGGTGTC
AA.ACTGGAAA
TAGTGGTACG
GCAAGAACAA
TAGTTATACC
116 TABLE 1. Nucleotide and Amrino Acid Seqeuences of E. tieca lis Genes.
GCATTT'rCAT
CATGAAAATG
AAAA.ATCCAA
GAACGTGTGA
TGGCAAGGAA
ATGTTACTGA TGAAGTAGCT TAATGAACAA TAAATGGATT GT'GATTI'AAC CAAAGAGTI TTGGTCAAI' AGGGTATATT ATGTCATTAA A
ACGTTAAGTA
ATTTGGTC'TT
TTAGACTTTA
CCTGTTT'CGA
TTGATGGTGT
ATGA-ACACAT
TGTTGTCAGA
AAATGGAAAT.
TCAGCCA.ACA
GTACACTCGT
TGATATCCAA
TGAACGGGAT
EF036-4 (SEQ ID NO:140) IDRGE SITAVGSSAL QPLVETASEE YQSQNPGRFI NVQGGGSGTG DHKVAVVGIT PIVNXNVGVK DISMENLKKI ATFEKWVLGD KTAIRAQEQD SSGMVRSIVS ENVMNNKWI I WSYEHMYTRK NPSDLTKEFL
QGNVIK
LSQVQSGAVD
FLGEVTNWKE
DTPGAI SYTA
DFMLSDDIQE
IGNSIDLFAEE
LGGKEDQKIVI
FSYVTDEVAT
RVIGQLCYI P
KKGIKAEDLI
LNRAAGSGTR
LS IDGVQPTD
VSKMEIERDW
EF037-1 (SEQ ID NO:141) a.
a.
a a a a. a
TGAGTGTATG
TTI'TTTGGAG
GCTGCTCTTG
AGCAAAGAAA
CCTGAAGAGC,
AATG-TCACTT
TCCAACGATG
GCTGAATATG
CACGGGACTG
AAAGT'AATG
AAAGATGGTG
ATTACTCAT
GGATT ITCAT TGTTGCTrAG
CAGCTAATTC
TCGAAATGGC
TT1GATAAAGA
TCTATCGTAA
AAATCACGGT
TCAAAGGTGA
GCAATAITAC
CCACTGTTAC
TCCCTTTGAA
GAAAATGTCT
TGCTTGTTCA
AAGTACAGAA
GTTAAGTGAT
GGTAACAGT'
ATTAGCACTT
TCCTAAGCTA
TATTGAGGTG
TTTTGATAAA
TGGTGAAGTC
TCAGTTATGA
AAAGTACTCA
TCTGATAAAA
GTAGTCTCTG
AAAGGAAATr
GCTGGTACT
TATTCCCA.AG
ATCGTTTCTr
AAAGCAAATG
CAAGAATACA
ACCGTAGCCA
TAAAGGAAGA
CCACTGTTTT
AAACAGATAG
GTDCTTCAAT
GGATTGTCGC
TCCATGATAA
ATGATAATAA
CT0AAAA=T
GCTTTACI''
AAGATI'CTGC
ATAA
AATAAATAAA
GACGGCAACT
TAGTTCTAGT
TAGTGCCAAG
AGCTACTGAC
GGGGAAAGAT
AAAAGTAACT
CAACATCGTT
AAATGGTACC
TGACTTAGAA
EF037-2 (SEQ ID NO:142) MKMSK VLITVLTATA ALVLLSACSS DKKTDSSSSS KETANSSTEV VSGASISAKP EE.LEMALSDK GNWIVAATDN VTFDKEVTVA GTFHDKGKDS NDVYRKLALY SQDDNKKVTA EYEITVPKL I VSSENFNIVH GTVKGDIEVK ANGFTLNGTK VNGNITFDKQ EYKDSADLEI( DGATVTGEVT VANN EF037-3 (SEQ ID NO:143) AACAGATAG TAGTTCTAGT AGCAAkAGAAA CAGCTAATI'C CCTGAAGAGC TCGAAATGGC AATGTCAC IT TTATAAAGA TCCAACGATG TCTATCGTAA GCTGAATATG AAATCACGGT CACCGACTO TCAAAGGTGA AAAGTTAATG GCAATATTAC AAAGATGGTG CCACTGTTAC
AAGTACAGAA
GTTAAGTGAT
GGTAACAGTT
ATTAGCACTT
TCCTAAGCTA
TA'ITGAGGTG
TTIATAAA
TGGTGAAGTC
GTAGTCTCTG
AAAGGAAATT
GCTGGTACTT
TATTCCCA.AG
ATCGTITCTT
AAAGCAAATG
CAAGAATACA
ACCGTAGCCA
GTGCTTCAAT
GGATTGTCGC
TCCATGATAA
ATGATAATAA
CTGAAAATTT
GCTTTACTTT
AACA~CTGC
TAGTGCCAAG
AGCTACTGAC
GGGGAAAGAT
AAAAGTAACT
CAACATCGTT
AAATGGTACC
TGACTTAGAA
EF037-4 (SEQ ID NO:144)
TDSSSSS
KETANSSTEV VSGASISAXP EELEMALSDK GNWIVAATDN VTFDKEVIVA GTFHDKGKDS NDVYRKLALY SQDDNKKVTA EYEITVPKLI VSSENFNIVH G IVKGDIEVK ANGFTLNGTK 117 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
VNGNITFDKQ EYKDSADLEK DGATVTGEVT VANN EF038-1 (SEQ ID NO:145)
TAATGGCCAT
AACAAGGATC
GTCGGC2~rGT
CTCGTAAAT
GAAkACT7LT
AATAATAAAA
GAATATAAAG
TTCCTCTACT
ATAAAAAAGG
TATTGTTGTC
GCAACTAGA
CGAGGTATAA
AACTCACTTT
ACGGTAAATT
AATAAAGAGG
AGAAGTGAGC
AGGT'GTGGA
AAATAAAAGT
TTCAAAAATTr
GGATATAAA.A
AAAAGGTGA-A
ATGAAGCTAC
ATGAAAAAAG
ACAGATATGA
CCAACTACTA
AGTrGATAGTG
AATAAAGAAC
ATrGAGGCG
TCAA-ATGGCG
TACTACCTTT
AAAAGATATT
CTTACACTTT
GAACGTACTC
AATTAATA.AT
AGAALGGACTC
TTGGCAATGG
TAT1'GCCTTA
GACTGCCGA*T
TITTGATCAT
TTACGATGAA
GGAAAATGTT
TGATAAAAAA
TNGAATAAGA GTGTCTTTG A EF038-2 (SEQ ID NO:146) M KLLKWRWQWN KDHKXGEVSM KKVLPFIALV GLLLLSGCGT DI~fl(KILTADG GKWELENXSP 'PTTYTFFDDE TFSRYNSKIS DSGTYSYDEN NKKLTLDIKN KEQLIMENVE YKDGKLKGEI GGEKDSDKKX NKRCL EF038-3 (SEQ ID NO:147) TTGTGGA ACAGATATGA AAAAGATAT GACTGCCAT GGTGGTAAAT GGGAACTAGA AAATAAAAGT CCAACTACTA GAAACIT'N CGAGGTATAA TTCAAAAATT AGTGATAGTG AATAATAAAA AACTCACTT GGATATAAAA AATAAAGAAC GA.ATATAAAG ACGGTAAATT AAAAGGTGAA ATTGGAGGCG TNGAATAAGA GGTCTTT=G A
CTTACACTTT
GAACGTACTC
AATI'AATA-AT
AGAAGGACTC
TTTTGATGAT
TTACGATGAA
GGAAAATGTT
TGATAAAAAA
EF038-4 (SEQ ID NO:148) CGT DMKKILTADC GKWELENKSP TTTYTFFDDE TFSRYNSKIS IJSGTYSYDEN N'KKLTLDIKN KEQLIME-NVE YKDGKLKGEI GGEKDSDKKX NKRCL EF039-1 (SEQ ID NO:149)
TAAATATATC
GCGC ITACCT
TCTGCAACAG
GCAGCTAAAG
CAAGGCACAA
A=TG'ITAG
GGTGAAACCC
TTAATTTCGG
AAATTTACCT
GAACATATCC
ACAGAAACTG
TGGAATGCCA
TI'TATI'AGTT
ACGGGTAAAA
GCTCACGTTT
GATAAACGTT
AAAAAGAAAA
TATTAACCTT
ATAAATCAAG
AGCAATCAAA
AAGTTI'ACCA
CAAAATATGA
GTGGCGAGA
ATACACAAAA
ATAAGCGAAT
CTATTCTC.A
GCAAGATTGT
CGAAAGTFr'r TACC.GAAA1T
CACGTGGAGA
CAATTGGTGA
TTACGTATAC
AAGGGGATTA
TAGTACGTTG
TGCAGCTAGC
AGGACAAGAA
CAAAAATNAT
TGGTGAAACA
AGGCACATI'C
CTATCAAGCG
GGGTAAAGAT
CGAGAAA LTA
TACCAATGAA
AGATGAAGAC
TGATAATAAA
TTTTGGTTAC
CAATAAATAT
ACGAATGCGT
CCAACCATGA
TTGATTGCAG
TCAAGCACT13
TAACAGAAA
AATAATTTAA
GGTFI=ATG
TTTGTGACAG
GTGGTCGAT
AAAGACGGGA
ACCT'ITACGA
CCTGGGGATG
GGTAACGATG
ACAAGTAAAT
TI'CCAAGTA-A
GGAGCTGCAT
AAAGACKACA
AAAAGAAAA-A
GCTGTGCTGG
CAGTCTCTAG
TTTTrATCCAG
CAGCAGAAA
AATTTTTCGA
ACGATGGCGA
TAACGGAAGT
AAGATGTAGA
ACGGCCGTAA
ACATTTAGG
TTACTGAAGC
ATGA.ATICTT
TTATAAT;A
TAGAATTAAC
ATGGCAAAGA
AGTTITTAGT
CGGAGCCAAC
TTCAGCAGAA
TACTGATTG
TGCTAATTT
CAAAGAAACA
AAAGCGTATC
GACGAAAGAT
AGTCTITGTA
AGATAGAA
GGCcACATTA
AAATAAAATG
TGATTTAGA-A
CAAAATCCGT
AGAA'ITAAAT
AATTAAAGTC
TABLE 1. Nucleotide and Amino Acid Seqcuences of E.faeca lis Genes.
TTTGTAGAAC ATGAACCATA TGAAGGAGAC TTTACGCCAG ACTTCACGTT CTAA EF039-2 (SEQ ID M~KKKVFSA LTLLTFSTLL IACCAGGANS ATDKSSA.ASS STAVSSSAEA A.KEQSKGQEL TEILSSTDWQ GTKX1YDKNXN NLTAENANFI GLAKYDCGETG ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV VDLTEVTKDK FTYKRMGKOK HIPYSDEKLT FTNGRKDLET ETGKIVTNEP GDOILGATLW NGTKVLDEOG ISLA-KFDNKT SKYEFFDLET GKTRGDFGYF QVIDNNKIRA HVSIGDNKYG KRFTYTRMGK DNNGKEIKVF VEHEPYEGDF TPDFTF
FYEFFDKETG
DGKDVEVFXTE
NDVTEANKMF
AALELTELND
EF039-3 (SEQ ID NO:151) TGCAACAG ATAAATCAAG TGCAGCTAGC TCAAGCACTG CAGTCTCTAG TTCAGCAGAA 0 OS 0O 0*
GCAGCTAAAG
CAAGGCACAA
ATTGGTTTAG
GGTGAAACCC
TTAA'ITCGG
AAATTT1ACCT
GAACATATCC
ACAGAAACTG
TGGAATGGCA
TTTATTAGTT
ACGGGTAAAA
GCTCACG=F
GATAAACGTI'
TTTGTAGAAC
AGCAATCA-AA
AAGTTTACGA
CAAAATATGA
GTCGCGATGA
ATACACAAAA
ATAAGCGAAT
CTTATTCTGA
GCAAGA'ITGT
CGAAAG=TI
TAGCGAAATT
CACGTGGAGA
CAATT-GGTGA
TTACGTATAC
ATAACCATA
AGGACAAGAA
CAAAAATNAT
TGGTGAAACA
ACGCACATTC
CTATCAAGCG
GGGTAAAGAT
CCAGAAATI'A
TACCAATGAA
AGATGAAGAC
TGATAATA.AA
TTTTGGTTAC
CAATAAATAT
ACGAATGGGT
TGAAGGAGAC
AATAATTTAA
GGTTTTTATG
T'PIGTGACAG
GTGGTCGAIT
AAAGACGGGA
ACCTTrACGA
CCTGGCGATG
GGTAACGATG
ACAAGTAAAT
TTCCAAGTAA
GGAGCTGCAT
AAAGACAACA
TTTACGCCAG
CAGCAGAAAA
AATTTTTCGA
ACGATGGCGA
TAACGGAAGT
AAGATGTAGA
ACGGCCGTAA
ACAT=FAGG
rI'ACTGAAGC
ATGAA'ITCTT
TTGATAATAA
TAGAATflAAC
ATGGCAAAGA
ACTTCACGTT
TGCTAATTTT
CAAAGAAACA
AAAGCGTATC
GACGAAACAT
AGTCTTGTA
AGATTAGAA
GCCCACATTA
AAkATAAAATG
TGAITIAGAA
CAAAATCCGT
AGAATIAAAT
AATTAAAGTC
CTAA
TTAACAGAAA TTTATCCAG TACTGATTGG EF039-4 (SEQ ID NO:152) ATDKSSAASS STAVSSSAEA AKEQSKGQEL TEILSSTDWQ GTKVYDKNXN ETRGDEGTFF VTDDGEKRIL ISDTQNYQAV HIPYSDEKLT FTNGRKDLET ETGKIVTNEP ISLAKFDNKT SKYEFFDLET GKTRGDFGYF KRFTYTRMGK DNNCKEIKVF VEHEPYEGDF NLTAENANFI GLAKYDGETG VDLTEVTKDK FTYKRMGKDK GODILGATLW NGTKVLDEDG QVIDNNKIRA HVSIGDNKYG
TPDFTF
FYEFFDKETG
DGKDVEVFVE
NDVTEAkNKM'F
AALELTELND
EF040-1 (SEQ ID NO:153)
TAGATTAGAA
ATGAACA-AAA
TGTCAAGGAG
ACTAGTGCAA
GAGGAAGTGA
CAACGAGATA
CTCGACACAG
GCAAAAGCGT
CCCTATCTT
GCCACCCCAG
GAAAAAGGAC
TrCCAAAATG
CAACTTTACC
GACAATTCTG
CCACTGGAGA
AAATTAAT
GAGAAACACC
AAACTGAAAG
AAGAACCGAT
CGAAGAAGGT
AAGGCAACAC
TGATTGATCG
ATGCGGGAGC
ATTTAAATAA
ATATTGACAA
ACTTT1GGCTA
AAGATA=GT
GTCGAACAGC
AAAATCTCAT
GGGGCTA7TA
TTCCGCAGCG
CACCAGTACA
GAAGACCTAT
CAAGGAAATT
ACCGCTCAAT
GGGTGCCGAT
GCAAGGACGT
GCATAACCCT
TGTGAAGCTC
TACAGCATTG
AAAATM'TTA
AATGGACTAT
ATTTCTCTCG
AGTGTCGTGA
TCAAAAAATA
ACCCGTTCGG
GAAGTGGGTG
'ITACAAGATA
ATCGCTCTTC
ATTAATCTGC
ACGGAGATTT
TACGGTGGCA
rI'CTTAGAAG, A ITGAGGCAG
ATGGAAAATG
GCCAATCAA.A
AGGAAAGGAA
CCATI'CCATT
GTCAA.ACGGT
TAGCTCAAAC
CGCTTTAGA
CTACTTATCA
ACAATAATGA
AAAACAGCAT
TAGCGTATAT
ATGCGTTAAT
ATOGACCGA
TGGGGTACG
GTGCGGATCA
AAGGTTATAC
GTTGAGCACA
ACTTGCTGCG
GACTACTCAA
AACATCAAAA
AGCAGCCAAT
AGTGGATGAA
CATTGAGATTI
TAGTGATAGT
GTAAAACAT
TCCGGCAGCT
AGACATAGAT
TCAAGGGAAC
ATCCATTAAA
GGAAATTAGT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAAATTTTAG CACAGTACAA CTAA EF040-2 (SEQ ID NO:154) M NKKILMGLLS VVTIPLLAAC QGGETPSAAS KNSQTVTITQS SAXTESTSTT RSVAQTTSKE EVKEPMKlTYE VGALLEAANQ RDTKKVKEIL DTEGNTPLNI AVHNNDIEIA KALIORGADI NLQNSISDSP YIJYAGAQGRT TPDLNKHNRY GGNALIPAAE KCHIDNVKLL LEDGREDIDF QNDFOYTALI LYQDIVKLLM ENGADQSIKD NSGRTAMDYA NQKGYTEISK ILAQYN
QDTTYQVDEV
EILAYMLKHA
EAVGLREGNQ
EF040-3 (SEQ ID NO:155) AGCG TCAAkAAAATA GTCAAACGGT GACTACTCAA
AGTAGTGCAA
CAGGAAGTGA
CAACGAGATA
GTCGACACAG
GCAAAAGCGT
CCCTATCTTT
GCGACCCCAG
GAAAAAGGAC
TTCCAAAATG
CAACTTIACC
GACAATTCTG
AAACTGAAAG
AAGAACCCAT
CGA.AGAAGGT
AAGGCAACAC
TGATTGATCG
ATGCGGGAGC
A'I=AAATAA
ATATTGACAA
ACTTTGGCTA
AAGATA'ITGT
GTCGAACAGC
CACCAGTACA
GAAGACCTAT
CAAGGAA.ATT
ACCGCTCAAT
CGGGTGCCGAT
GCAAGGACGT
GCATAACCGT
TGTGAAGCTC
TACAGCATTG
AAAATTGTTA
AATGGACTAT
ACCCGTTCGG
GAAGTGGGTG
TTACAAGATA
ATCGCTGTTC
ATTAATCTGC
ACGGAGATTT
TACGGTGGCA
TTGTTAGAAG
ATTGAGGCAG
ATGGAAAATG
TAGCTCAPLAC
CGCTTTTAGA
CTACTT1ATCA
ACAATAATGA
AAAACAGCAT
TAGCGTATAT
ATGCGTTAAT
ATGGACGAGA
TGGGTT-ACC
GTGCGOATCA
AACATCAAAA
AGCAGCCAAT
AGTGGATGAA
CATI'GAGAT
TAGTGATAGT
C ITAAA.ACAT
TCCGGCAGCT
AGACATAGAT
TGAAGGGAAC
ATCCATTAAA
GGAAArAGT 0* @0 0 0@ .0 @0 0 0* 0* 0 0 96 @0 9* 0 000 9 00 0 0 000 0
S
*00000 0 0*@0 @000 00 @000 0 0 @000 *000 0S S 00 0S GCCAATCAAA AAGGTTATAC AAAAT'r=AG CACAGTACA.A C EF040-4 (SEQ ID NO:1SG) AS KNSQTVTTQS SAKTESTSTT RSVAQTTSKE EVKEPbETYE DTEGNTPLNI AVHNNDIEIA KALIDRGADI TPDLNKHNRY GGNALIPAAE KGHIDNVKLL LYQDIVKLLM ENGADQSIKD NSGRTAMDYA
VGALLEAANQ
NLQNSISDSP
LEDGREDIDF
NQKGYIEI SK RDTKKVKEI L QDTIYQVDE-V YLYAGAQGRT E ILAYMLXHA QNDFGYTAL I E-A\GLREGNQ
ILAQYN
EF041-1 (SEQ ID NO:157)
TAATTATTAA
ATGAAATrGA
GCGGCTTCTG
AGTCGCGAAC
CTAGCAACAG
'FrACACAAAG
GATGGACTAA
GTGACTGCTA
GAATATGCTr
GATAAATCAG
AAAGCAACAC
GACATTGTGG
GGTCCA'ITCG
AAAAACGATC
GTGAAAGAAT
CTTTCTGGTG
GCATCAACAC
NTETCTGATI'
AAAAGTCAT'
GAGGCGGCGG
AAGTTI'ACG
NCAGAAT'AG
ATAACAAAGT
CATACAAAAT
ATGACTATGT
ATCTGTATGC
AA~rAGGAAT
CATACTTTGA
AAAAATATGG
TCITAGACGG
AATATTGGGA
CACCAACCGC
AATTAGCCCA
AATATATGGA
TTCAGAAAAT
AACATTCGGT
AACGTCAGAT
TGTCACAGAA
'TTTATTGCA
CCAACCTGCA
TAAATTAAAT
TTACGGATGG
CTCTGTAAAA
TAAAGCAGTC
TTAC ITATTA
TAAAAA'ITAT
CTTTGATGGT
TAAAGATACT
GTTGAACTTG
ACAAATGGCC
ACTAAPATCAA
ACAGA'TGCA
GTGAT'ACAT
AGCTCAAGCG
CAACAAGA.AA
TTAAATAATG
GGTGCAGCGG
AAACATGCAA
CAACGAACAG
AATGGTGATG
AGTGATACAG
GCTTTCCCAT
GCATCAAACA
CCTGGTAcAG
GTGAAACTGG
TTCCAAGATG
AATOACCCAG
CGTGATGA
TTAI=TTAGG
TAT'IAGCGT
CGTCTGGTGG
TOCCAACAGC
TATATGAAGG
AAAAAGCAGA
AATGGTCAGA
TTGATCCAGC
CCATTGCTAA
AkATTAGAAAT CATTCTTr-CC
GCGAAAGTGC
ATACAAAATG
ACTCAGTAGA
GACAAACAGA
CTTTTGITAG
AATCACCATT
AGGCAACACT
AAcAACTTTA
CGGTAAGGCA
TGATTTATCA
AATTTATCGT
AGTTTCTGAA
CGGTAAACCA
GACAGCTTCT
AGGGGAAAAA
CACTTTAGAA
GCAACGTCAA
TG.TCTACAAT
GTC-ATTCAAG
TGTGAATGTC
CGATGTcGTT
TCAAAAAGPA
TAG-AAATGCG-
7, 120 TABLE 1. Nucleotide and Amino Acid Scqeuences of E. faecalis Genes.
AACTTACGTA AAGCAATTC TTACTCAATC CACCGTAAAG CGTTAGTTGA ATCAATCC'TT
AGGGGATGG
EF041-2 (SEQ ID NO:158) M KLKKSLTFGV ITLFSV~rLA ACGCGGTSDS SSASGGGKAS GEQVLRVTEQ QEMPTADLSL ATXRISFIAL NNVYEGIYRL DKDNKVQPAG GLTYKIKLNK DAXWSDGKPV TANDYVYGWQ RTVDPATASE YAYLYASVKN KSELGIKAVS DTELEITLEK ATPYFDYLLA FPSFF?QRQD) IVE(YUYA PFVLDGFDGP GTDTKWSFKK NDQYWDKIY' KLOS.VDVNVV KES PT-ALNLF SCELAQOMAN DPAFVSQKEA STQYMELNQR DEXSPFRNAN' LRKAISYSID
GW
AAEKAEVSED
GDAIAI(GEKD
SNSESAVYNG
QDGQTDDVVL
RKALVES ILR EF041-3 (SEQ ID NO:159) TTGTG GAGGCGGCGG AACGTCAGAT AGCTCAAGCG CGTCTGGTGG CGGTAAGGCA AGTGGCGAAC AAGTTTACG CTAGCAACAC NCAGAATTAG 6@ *0 S
SO
04 0@ 0 0@
OS
S
SS
.5 S S 0 @0 S S 000 0
S
@00500 0
SOS.
0 0000
TTAGACAA-AG
GATGGACTAA
GTGACTGCTA
GAATATGCTT
GATAAATCAG
AAAGCAACAC
GACA'FTGG
GGTCCATTCG
AAAAACGATC
GTGAAAGALAT
CTTTCTGGTG
CCATCAACAC
AACTTACGTA
AGGGGATGG
ATAACAAAGT
CATACAAAAT
ATGACT.ATGT
ATCTGTATGC
AATTAGGAAT
CATACTTTGA
AAAAATATGG
TCTTAGACGG
AATATTGGGA
CACCAACCGC
AATTAGCCCA
AATATATGGA
AAGCAATTTC
TGTCACAGAA
TTTTATTGCA
CCAACCTGCA
TAAATTAAAT
TTACGGATCG
CTCTITAAAA
TAA-AGCAGTC
TTACTTATTA
TAAAA.A'TAT
CTrTrTGATGGT
TAAAGATACT
GTTGAACTTG
ACAAATGGCC
ACTAAATCAA
'PrACTCAATC
CAACAAGA.AA
TTAAATAATO
GGTGCAGCGG
AAAGATGCAA
CAACGAACAC
AATGGTGATG
AGTGATACAG
GC7=1'CCAT
GCATCAAACA
CCGGTACAG
GTGAAACTGG
TTCCAAGATG
AATGAC C CAG
CGTGATGAAA
CACCGTA.AAG
TGCCAACAGC
TATATGAAGG
AAAAAGCAGA
AATGGTCAGA
TTGATCCAGC
CCA=GCTAA
AATTAGAAAT
CATTCTI'CCC
GCGAAAGTGC
ATACAAAATG
ACTCACTAGA
GACAAACAGA
CT=TTAG
AATCACCATT
CGTTAG=TA
TGATTTATCA
AATrATCGT
AGTTTCTGAA
CGGTAAACCA
GACAGCTTCT
AGGGGAAAAA
CACTTTAGAA
GCAACGTCAA
TGTCTACAAT
GTCATTCAAG
TGTGAATGTC
CGATGTCGTT
TCAAAAAGAA
TAGAAATGCG
ATCAATCC'Tr EF041-4 (SEQ ID NO:160) 606 5000
S
*000 0* 0 00 CGGGGTSDS SSASGGGKAS GEQVLRVTEQ QEMPTADLSL ATXRISFIAL GLTYKIXLNK DAKWSDGKPV TANDYVYGWQ KSELGIKAVS DTELEITLEK ATPYFDYLLA PFVLDGFDGP GTDTKWSFKK NDQYWDKD'IV SGELAQQMAN DPAFVSQK EA STQYMELNQR Gw EF044-1 (SEQ ID NO:161) NNVYEG IYRL
RTVDPATASE
FPSFFPQRQD
KLDSVDVNVV
DEKSPFR.NAiV
DKDNKVQPAG
YAYLYASVKN
IVEKYGKNYA
KES PTALNLF
LRKAISYSID
AAEKAEVSED
GDAIAKGEKD
SNSESAVYNG
QDGQTDDVVL
RKALVES ILR
TAAGATAAAA
ATTTTAGTTA
GAAAATTCTA
ATAGAAACAA
ACITCGTCAA
GCAACACCTT
GGAATGAATG
TTCCGAACTA
TTACTATAG
ITTTIAAC
CCAATCATAC
GCTCTAATAA
AGTCGAAAAA
ATGCTC.TCGA
TGCCAACTTC
A-ATT~GTTCo
CGTCTATAOG
AGGTI'GTAGT
AACATCAATA
TAAACTAAAA
TGAAGTATCT
TCTrAGTAGC
AATTACGTTA
GGCTGAAAAT
AGGAATAGTA
TCTCAAA.AAG
AAAAGCAGTA
GAAACTTcAG
ACAAATGTCG
TTAAACAATC
GAGAACTTAA
GGTCAAGTGA
TGAAAAATT
CGAATGAkACC
CTAATCATTA
A-AAGTGCCAG
A.AGAAGCAAA
CACTCGTATT
ATTCAACACC
AAGAAGCCAT
AGTTTGTGTT
TAAAAAACAA
CAGTTCTAGC
CACCACTCAA
TTCTI'TAGAA
TAATTITAAA
AAcTGCTACC
TAATAAATAT
121 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fuecalis Genes.
GAGCTATCTA
AGTATTCGCA
GATGAACAAA
TTAATCACTC
GTAATTCAAT
TAAATACAAT TCCTACAAAA GAGATTAGAA TATITICAGC GGCCGATAAC CCGTTAAAGT AAATACAGAA TTAATI'TAG GALACTAATAT TTCTTAAAC ATAGATCGGG CACTTTATAC TTATTCAACA ATAAAAATGG TrCGATATCT CTAACTACGC TGGCAATGTT ACGGATG-ATC AAAAAGACGT TATGCTAGA-A
AA
EF044-2 (SEQ ID NO:162) MKKLVCVI LVIFLTGCSS QKANEPKKQE NSTNHITSIK SSTNHYSSSI ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA TPYA'JDLSSL NNPLVFNFKG MNVPTS ITLE NLNSTPTATF RTXLFGAENG QVK-A INKYE LS INTl PTKE I RIFSAADNS IRTVKVNTEL ILGTNISSND EQNRSGTLYL FNN~kNGSISL ITPNYAGNVT DDQKDVMLEV
IQ
EF044-3 (SEQ ID NO:163) TTGTAGT TCTCAAAAAC CGAATGAACC TAAAAAACAA
GAAAAITCTA
ATAGAAACAA
ACTTCGTCAA
GCAACACCTT
GGAATGAATG
TTCCGAACTA
GAGCTATCTA
AGTATTCGCA
GATGAACAAA
TTAATCACTC
GTAATTCAA
CCAATC-ATAC
GCTCTAATA-A
AGTCGAAAAA
ATGCTGTCGA
TGCCAACTTC
AATTGT'ITGG
TAAATACAAT
CCGTTAAAGT
ATAGATCGGG
CTAACTACGC
AACATCAATA
TAAACTAAAA
TGAAGTATCT
TCTTAGTAGC
AATTACGTTA
GGCTGAAAAT
TCCTACAAAA
AAATACAGAA
CACT1'ATAC
TGGCAATG'T
AAAAGCAGTA
GAAACTTCAG
ACAAATGTCG
TTAAACAATC
GAGAACTI'AA
GGTCAAGTGA
GAGATTAGAA
TTAAITTTAG
TTA TCAACA
ACGGATGATC
CTAATCATTA
AA.AGTGCCAG
AAGA.AGCAAA
CACTCGTATT
ATTCAACACC
AAGAAGCCAT
TAT'LTCAGC
GAACTAATAT
ATAAAAATGG
AAAAAGACGT
CAG'ITCTAGC
CACCACTCAA
FrTTTAGA.A
TAATTTTAAA
AACTGCTACC
TAATAAATAT
GGCCGATAAC
TTCTTCAAAC
TTCGATATCT
TATGCTAGAA
EF044-4 (SEQ ID NO4:164) CSS QKANEPKKQE NSTNHTTSIK SSTNHYSSSI ETSSNNKLKE TSESASTTQT SSKSKNEVST NVEEANSLEA NNVPTS ITLE NLNSTPTATF RTKLFGAENG QVKEA INKYE IRTVKVNTEL ILGTNI SSND EQN'RSGTLYL FNNKNGS ISL
IQ
TPYAVDLSSL
LS INTIPTKE
ITPNYAGNVT
NNPLVFNFKG
IRIPSAADNS
DDQKDVMLEV
EF045-1 (SEQ ID NO:165)
TAGCCAA.AAA
TTAGCTTCTG
AATA.AACCAA
APLAGCAA771G
CTTTTCCAC
CAGCCATrAT
AAATI'AGATG
GACGGTAAAG
GACTATACAG
CATGATGGTA
ATCACTTATA
GTTTTACCAA
GrrCGTAAAA
TCTGI'TAAT
ATGAGGGAGG
TGTTAGTATT
CGGAAACAGA
ATGGCGGTAC
AAGAATTTTA
TTAACAATGA
AAGATGCCAA
ATGTGACAGC
GGATTCGTrA
AATCGCCAAC
AAGAAGTTCA
AACATG.CCT
ACCCTCTGAC
ACCTACCAAA
AAAAGAGATG
CGGGTTAGCT
AGACATTTCA
ATTAGATGTC
TCAAGACAAC
TGCAGACTTT
TACACCAACC
cGATGACGTG
TGATGATAAC
CATITCTGGC
CCCAGGAATG
TGAAGGAAT
TATTGGACCA
TGAGCATTAC
AACAAGAAAC
GCATGTGGTG-
AAAATGCCAA
GCTGTAGTTA
TATGATGCAC
AAGA'ITGTCG
ATrAAAT'AC ATTrCTCTT TTrACGAATA
ATAGAAAAAG
CAACAATTAG
GCTGTTAAAG
TACTACATGA
TACGGTGGTA
GGATTTTAGG
GCGGCAATAA
TCGCTGTTAA
TGGATACACA
AATACATGCT
ATGGGGGTCC
GTGACAATTT
ATGAAGTCAT
=TTTGGCAT
TCAATGATAA
GTGGCGGTGT
ACATGGAATC
GTAATATTGT
AACCTAAA-T
TGCAATCACG
AGGCGGGGGC
AAATGATAAA
ATTCCAAGGA
TCCAACGG-TA
TGCGGATCTG
GAAATGGTCT
TGGTCATAAA
GGAAGACTAC
AGAAGT'AAA
TFcGGGGCTCA
AAGCGATGCA
GACAGGTGAA
AGATAAATTA
122 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efiaecalis Genes.
GTGTTCAAAT CTGTTCCTTC TGCGAGCATT ATTGCATTAT CAATGCCA-AC ACATACOTAT
ATCTTAGGAC
AAAGAAACAA
CAAGCCATGG
CGAACAGGGG
AAAGGCTATA
GACGTAGACG
GCTTCAATGT
AAAGAAATTG
TATCATAAAT
GGTTCAGATC
TTTGAGTCAG
GAAGAAAAAC
GTAATCCCAA
ACTTGGGCAG
GTCCCGAACA
ATACAGTGAA
GCTATGCAAT
CAACAACGTT
CGCTTGATT
GCGATGGCAT
CAGGCCGCGA
GCTTAAACGT
TGAAAAATGA
CTTCACCAAC
AAGAAAATAC
GTAAAAAAGC
CCCTTTACAG
TTGATACGA.A
AGCCTACACG
ATACAATCCA
TCACAATGAT
AATCCCACCA
AGACAAAGCG
TCGCGAAGAC
AACTGCACAA
PLACGTATACA
TGACCCAGAA
CGGCTTATAT
TAAAFI'ACTT
CTTCTACGAT
AAATGAAGTC
AGATAATCCA
GTAGAAGCGA TGAAAGCGAA CCAACATACA AAGATACTGA TATATTGGCT TTAAAATGGG AAAGCTAAAA TGGCAGATAA GCAGTCGGCC AA.AAATTCTA GTCTTCAAGA GCTTGCATGA AAAAAATTAT TAGACGATGC AAAGAAGGCA AACCACTAGA CCAC'ITGCTG ATTACTAT ACAGGACGCT TAATTGNI~r G-TAGATATCT ATCAAGGCGC GGfCcAAACT CACCT1=AA GATGCGATTG ATTCAAAAGC TGGCAAGAGT ATGCCATTGA TTGCCTGTCA ACGACCGTGT TGGGCAACGG TGGGTGTCAC
ACAATACGAT
AGGGTATCAA
TACGTTTGAC
AAGCTTACGT
CAACGGCTTA
TAGCGAAGCG
TGGTTATAAA
AATCAAGTTT
CCAACAATGG
CCAAGCATTC
GTGGGGCACA
CTATACACGT
ATCATTTGAT
TGAAGCGT
AGTI'GACTTT
AGCAGACTCA
CGGAAATAA
EF045-2 (SEQ ID NO:166) MN KKRILGAITL ASVL.VFGLAA CGGGNKGGGN KATETEDISK MPIAVKNDK
AIDGGTLDVA
LDEDANTATI
DGKSPTISG I
RKNPVTIGPY
ALSMPTDTYP
AMGYAIDNDA
VOGDGIREDK
DKLKNDDPEV
EKRKKAFYDW
K
VVMDTQFQGL
KLRDNLKWS D EKVNDKEVK I
YMSNIVTGES
TYKDTEGYQI
VGQKFYNGLR
EGKPLEIKFA
DIYQGAWGTG
QEYAIDEAFV
FQQEFYQDNY
GKDVTAIDDVI
TYKEVHPGMQ
'EYLPNEHYY
LGRPEQAYTY
TGATTLIPPV
SMSGGETAQP
SDPSPTGLYG
IPTLYRNEVL
DAQYNLPTVQ PLFNNDAE)FK IVDGGPADLK FSYEVIGHKD YTGIRYDE)NF TNIVGMEDYH
QLGGGVWGSV
GGKPKLDKLV
IGFKNGTFIDK
FKSLHDSEAK
LADYYVQQ WK
PNSAFNYTRF
PVNDRVVDFT
LPKHAFEG IA
FKSVPSASIV
ETNTVKYNPK
GYTLDLDKAK
EIGLNVTYTT
ESEENTKLLD
WAVDTKDNPW
EAMKAKQYDI
AKMADKSLRQ
KLLDDAGYKD
GRLIDFQAFY
AIDSKASFDE
ATVGVTADSR
EF045-3 (SEQ ID NO:167) ATGTGGTG GCGGCAATAA AGGCCGGGGC
AATAAAGCAA
AAAGCAATTG
CTTTTCCAGC
CAGCCAITAT
AAA'ITAGATG
GACGGTAAAG
GACTATACAG
CATCATGCTA
ATCACTTATA
GTTTTACCAA
G PTCGTAAAA
TCTGTTGAAT
GTGTTCAAAT
ATTGCATTAT
ATCITAGGAC
AAAGAAACAA
CAAGCCATGG
CGAACAGGGG
AAAGGCTATA
CGGAAACAGA
ATGGCGGTAC
AAGAATTTITA
TTAACAAT'GA
AAGATGCCAA
ATGTGACAGC
GGATTCGTTIA
AATCGCCAAC
AAGAAGTTCA
AACATGCCTT
ACCCTGTGAC
ACCTACCAAA
CTGTCCcTrC
CAATGCCAAC
GTCCCGAACA
ATACAGTLGAA
GCTATGCAAT
CAACAACCTI
CGCTTGAIT
AGACATI'TCA
ATTAGATGTC
TCAAGACAAC
TGCAGACTTT
TACAGCAACC
CGATGACGG
TGATGATAAC
CATTTCTGGC
CCCACGAATG-
TGAAGGAAI~T
TATTGGACCA
TGAGCATTAC
TGCGAGCATT
AGATACGTAT
AGCCTACACG
ATACAATCCA
TGACAATGAT
AATCCCACCA
AGACAA.AGCG
AAAATGCCAA
GCTGTAGTTA
TATGATGCAC
AAGATTGTCG
ATTAAATTAC
ATTTTCTCTT
TTACGAATA
ATAGAAAAAG
CAACAATI'AG
GCTGTTAAAG
TACTACATGA
TACGGTGGTA
GTAGAAGCGA
CCAACATACA
TATATTGGCI
AAAGCTAAAP
GCAGTCGGCC
GTCTITCAAGP
TCGCTGITAA
TGGATACACA
AATACATrGCT
ATGGGGTCC
GTGACAATT
ATGAAGTCAT
'TGTTGGCAT
TCAATGATAA
GTGGCGGTGT
ACATGGAATC
GTAATATI'GT
AACCTAAATT
TGAAAGCGAA
ALAGATACTGA
TTAAAATGGG
TGGCAGATAA
AAAAATTCTA
LGCTTGCATGA
AAATGATAAA
ATTCCAAGGA
TCCAACGGTA
TGCGGATCTG
GAAATGGTCT
TGGTcATAAA
GGAAGACTAC
AGAAGTTAAA
TTGGGGCTCA
AAGCGATGCA
GACAGGTGAA
AGATAAATTA
ACAATACGAT
AGGGTATCAA
TACGTTGAC
AAGccTACGT cAAcGGCTTA
TAGCGAAGCG
AAAAAATTAT TAGACGATGC TGGTTATAAA 123 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GACGTAGACG
GCTTCAATGT
AAAGAAAI'rG
TATGATAAAT
GG~TTCAGATC
TTGAGTCAG
GAAGAAAAAC
GTAATCCCAA
ACTTGGGCAG
CGGAAA
GCGATGGCAT
CAGGCGGCGA
GCTT~AAcG.T
TGAAAAATGA
CTTCACCA.AC
AAGAAAATAC
CTAAAAAAGC
CGC'ITTACAG
TTGATACGAA
TCGCGAAGAC
AACTGCACAA
AACGTATACA
TGACCCAGAA
CGGCTTATAT
TAAATTACTT
CTTCTACCAT
AAATGAAGTC
AGATAATCCA
AA.AGAAGGCA AACCACTAGA AATCAAC'IT CCACTTGCTG ATTACTATGT CCAACAATGG ACAGGACGCT TAAzTTGATI'T CCAAGCATTC GTAGATATCT ATCA-AGGCGC GTGCGGC-ACA GGTCCA.ACT CAGCC'F=AA CTATACACGT GATGCGATTG ATTCAAAAGC ATCATTTGAT TGGCAAGACT ATCCCATTGA TGAAGCTT TTGCCTCTCA ACGACCGTGT ACTTC.ACTT1 TGGGCAACGG TGGGTGTCAC AGCAGACTCA EF045-4 (SEQ ID NO:168)
CGGGNKGGGN
Al DGGTLDVA
LDEDANTATI
DGKSPTISGI
RKNPVTIGPY
ALSMPTDTYP
AMGYAIDN'DA
VDGDGIREDK
DKLKNDDPEV
EKRKKAFYDW
K
KATETED ISK
VVMDTQFQGL
KLRONIJKWSD
EKVNDKEVKI
YMSNIVTGES
TYKDTEGYQ I
VGQKFYNGLR
EGKPLEIKFA
DIYQGAWGTC
QEYAI]DEAFV
MP IAVKNDKK
FQQEFYQDNY
GKDVTADDVI
TYKEVHPGMQ
VEYLPNIEHYY
LGRPEQAYTY
TGATTLIPPV
SMSGGETAQP
SDPSPTGLYG
IPTLYRNEVL
DAQYMLPTVQ
FSYEVIGHKO
QLGGGVWGSV
GGKPKLDKLV
IGFKMGTFIDK
FKSLHDSEAK
LAIDYYVQQWK
PNSAFNYTRF
PVJNDRVVDFT
PLFNNDADFK
YTC IRYDDNF
LPKHAFEGIA
FKSVPSAS IV
ETNTVKYNPK
GYTLDLDKAK
E IGLNVTYTT
ESEENTKLLD
WAVDTKONPW
IVOGGPADLK
TNIVGMEDYH
VKDMESSDAV
EAMKAKQYDI
AKMADKSLRQ
KLLDDAGYKD
GRLIDFQAFY
AIDSKASFDE
ATVGVTADSR
EF046-1 (SEQ ID NO:169)
TAGGAGGATA
TGTAGTAATA
CAGCAAkACTA
ACATCATCTA
GATGATGTT
ACGGATAAAG
AATTATATTA
TAATGAAAAA
ATACTGGGGG
CCCAGTCTTC
TAACAATTGA
CAAAAACTAG
AACTAAAGGA
AGCAAAAA
AAAACTTATT
AAAAAATAGC
TAAAAAAGAT
AACAACCGAG
ACGACAATTG
ATATATATCA
GTACTATTGT
GACGCTTCAT
AGTAGTAATC
AATTTAAAGA
TATGAACAAG
GAGGCTAAAG
TAGCC'Aq-'I
CTACTGAAGT
CGGACACAAC
ATAGAGAATT
GAATTAACAG
AACAAAGPA
TTTAACGGCA
ATCAACTAAG
ACcAAcTTCT
GAATCCAACA
TTCAACAATT
AGATGTCAT'
EF046-2 (SEQ ID N4O:17O) MKKKLIV LLLALFLTAC SYNTGGKNSD ASSTEVSTKQ QTTrQssKKDS SNPDTTPTST SSITIETTEN LKNRELNPTD DVSKTRRQLY EQGINSST IT DKELKEYISE AKEQKKDVIN YI4QK EF046-3 (SEQ ID NO:171)
A
TCTAGTAATA
CAGCAAACTA
ACATCATCTA
GATGATGT
ACGGATAAkAG
AATTATATTA
ATACTGGGGG
CCCAGTCTTC
TAAcAATTGA cAAAA.ACTAG
AACTAAAGGA
AGCAAAAA
AAAAAATAGC
TAAAAAAGAT
AACAACCGAG
ACGACAATTG
ATATATATCA
GACGCTTCAT
AGTAGTAATC
AATTTAAAGA
TATGA.ACAAG
GAGGCTAAAG
CTACTGAAGT
CGGACACAAC
ATAGAGAAT
GAATTAACAG
AACA.AAAGAA
ATCAACTA.AG
ACcAAcTTCT
GAATCCA-ACA
TTcA.AcAATT
AGATGTCATT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF046-4 (SEQ ID NO:172) C SNNTGGKNSD ASSTEVSTKQ QTTQSSKKDS SNPDTTPTST SSITIETTEN LKNRELNPTD DVSKTRRQLY EQG INSST IT DKELKEYI SE AXEQKKDVIN
YIKQK
EF047-l (SEQ ID NO:173)
TAGGGAAAAC
ITA.ACCCTTG
TCCAGTGCTG
AGTAGCAAAG
TCGAAAGCAA
TCACCAGCGA
AACATGTTTC
TTAACTGCAG
GAAAAAGAAG
=TGAGAAAA
GACAATGGCG
GCAGGTTCTA
AATATCAATG
GAAACATTAC
GACTATAATC
AAGGAGGAAT
CTTTAGCAGC
CGGAAAGTAC
CAACAACAAA
CAGCT1'CTTC
AAAATCGCGAA
CTGGTCAAGG
CGACAACTTC
CGATTCCAGT
AAACATATGG
GTCAACCAGT
GTIACITATC
GTG-AATCCC
CAGCACCGAA
GAAACTCAGT
TCTTATGAAA
ATGCGGCGGC
CACAGTCGAG
ATCTAGTGAT
TACGAAGGAA
TCCAGATGAC
CTTACCCCAG
ACAAGCGGAT
GAATGATGCA
CTCTGATCC
AGATTAGGT
TTGGCAAGAA
TGATGATTTA
TACCGTTGGT
AGTTTGGCAA
AAGATAGGGC TTA=TCTAG TGCTrPTICTT
GGAAAAAGTA
AGTACAAAkAG
GCGAAACCGT
GCGGCAAATA
CAAGCCAACC
GCAATI'TPJ
CAAAACAATT
CGTGTCAATC
GAAGCAAAAA
TACAATATTA
CAGAAAATAC
CATCTCCTAC
CAGGAACAAC
ATGGCTCAGC
AAGTCTAA
CGAGTCAAAC
TCCGTGT=T
AGTTAACGCC
ATGCAGTGAA
CTGGG'-TATAA
GGATAGTCGT
AAAAGAATCA
AACACCTGAT
AGAGAAGCAA
CCAGCTAGCA
GAATAACTT
ATATTATGCA
AATTAG'ITCT
CCAAATCATT
ACAAGGGGCG
ACGGGCCTCA
TTCGAACAA
AGGAACCACT
TGTCCATCAT
GGCAAT'TGGA GTTTAGTCGT GCGAAAAATG TTGTCAACAT CAAATCACAC TGAACGTGGC GCCGGTACAG TCGTT--ACTC ACATCAATGT AA TTTGACCCAA TTCAAGCAGT GAAGATGGCA EF047-2 (SEQ ID NO:174) IMflK IGLISSAFLL T7ALAACGGG KSTENTDSRS SAAESTTVES TKASATKESS SKATTKSSDA KPSGTTTADS KATASSTKEA AN~NGSAEKQS PAKNANPDDQ ANQVLNQLAN MFPGQGLPQA ILTSQTNNFL TAATTSQADQ NNFRVL.YYAE KEAI PVNDAR VNQLTP ISSF EKKTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA GSSYLSWJQEG NWSLVVR-ASN INGESPDDLA KNVVNILEQE TLPAPNT-VGQ ITLNVAGTTD YNRNSWVWQA GTVVYSVHHF DPIQAVKMAT SM EF047-3 (SEQ ID NO:175)
ATGCGGCGC
TCCAGTGCTG
AGTAGCAAAG
TCGAAAGCAA
TCACCAGCGA
AACATGT'TC
TTAACTGCAG
GAAAAAGAAG
TTTGAGAAAA
GACAATGC
GCAGGTTCTA
AATATCAATG
GAAACATTAC
GACTATAATC
'TTGACCCAA
GGAAAAAGTA
CGGAAAGTAC
CAACAACAAA
CAGCTTCTTC
AAAATGCGA-A
CTGGTCAAGG
CCACAACTTC
CGATTCCAGT
AAACATATGG
GTCAACCAGT
GTTACTTATC
GTGAATCGCC
CAGCACCGAA
GAAACTCAGT
TTCAAGCAGT
CAGAAAATAC
CACAGTCGAG
ATCTAGTGAT
TACGAAGGAA
TCCAGATGAC
CTTACCGCAG
ACAACGAT
GAATGATGCA
CTCTGATGCC
AGATTTAGGT
TTGGCAAGAA
TGATGATTTA
TACCCI'GGT
AGTTTGGCAA
GAAGATGGCA
CGATACTCGT
AGTACAAAAG
GCGAAACCGT
GCGGCAAATA
CAAGCCA.ACC
GCAATTTTAA
CAAAACAATT
CGTGTCAATC
GAAGCAAAAA
TACAATATTA
GGCAATTGGA
GCGAAAAATG
CAAATCACAC
GCCGGTACAG
ACATCAATGT
CATCTGCTAC
CAGGAACAAC
ATGGCTCAC
AAGTGCTrAA
CGAGTCAAAC
TCCGTCTTTT
AGTTAACCCC
ATGCAGTGAA
CTGGGTATAA
GTTTAGTCGT
TTGTCAACAT
TGAACGTGC
TCCTTIACTC
AA
AAAAGAATCA
AACAGCTGAT
ACAGAAGCAA
CCAGCTAGCA
GAATAACTTT
ATATTATGCA
AATTACTTCT
CCAAATCT
ACAAGGGGCG
AcGGGCCTCA TrTGGAACAA
AGGAACCACT
TGTCCATCAT
EF047-4 (SEQ ID NO:1?76) 125 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CGGG KSTENTDSRS SAAESTTVES TKASATKESS SKATTKSSIJA KPSGTTTADS KATASSTYE-A ANNGSAE-KQS MFPGQGLPQA ILTSQTNNFL TAAMTSQADQ NNFRVLYYAE EKXTYGSDAE AKNAVNQIID NGGQPVDLGY NITGYKQGAA INGES PDDLA KNVVNI LEQE TLPAPNTVGQ ITLNVAGTTD DPIQAVKMAT SM PAKNANPDDQ ANQVLNQLAN KEAIPVNDAR VNQLTPISSF GSSYLSWQEG NWSLVVRASN YNRNSWVWQA GTVVYSVIIXF EF048-1 (SEQ ID NC:177)
TA.AGGAGAPA
GGATTACAGT
TTGCGCTATG
CAATTAGTAG
TTAGGAGGAG
GTCAGTGGAT
ATTITTTGATA
GTGTATGATT
CGAAGTTTT
ATTCGGGTCA
GTACCTATGGC
GAGAATAATG
TATGATGAGC
TTGACAGCGA
AAAACGGTCT
GTTGTGTTCA
TCATTCAAAA
AGTTCATGAA
TGCTA-AGTGG
CGTATGCTAG
AGGAAAAAAC
AAACAGAACT
CAGCATTAGA
ATGAAAAACA
CTACAAAAAA
ATATGAGCAA
TGCAAAGTGA
GTAGI-TCGGA
AGTTCGTI"IT
ATACTCGAGT
AACAAGAACA
TTAAAGAAGC
ATCAAGTAGA
ATACCTCAGA
AAA;AGAAAG
TGCGGCAAA
TAATAGCCAA
AGATGGTAAA
AATTrGAATTA
AAG=ITCT
TTTTAAA
ATTAGGATT
AGGGCCTGTT
AACCGCCATC
AGTATATACT
ATATACAGCT
GCCAGATATT
AGCGATTGAA
GGTTGAAGAA
CAGTGAACCA
ACATGGCGAA
G'I'TTATTTA
ACAGAAGCTT
CCAGI'ATCG
GTTCAAATTG
ACACAAACAG
AAAGATTATT
GTAATGGATA
GTTGCTI'TAA
ACATCTCCAG
AAAATGGTAG
TCTCTACA.AT
GGTCATGGTG
GTGATTATGA
GAACCAGCAA
GAAAAGAAAA
TTCCAAAAAC
CTGTATCAGG
CAGCAGTTAT
CGGCAAATGA
ATTCTATGAA
AATKTITCC
GTrGCAATTGA CTGTATtGC
ATCAAGCGCT
CTITATTATGA
ATGA=TGAA
A.ACT=~AGG
CTA.ATCTAAT
GTGTGCCTAA
ACGAGGGAAC
AAGAATCGAC
AAGCACAAC
TTGTTCAACC
CTATTCGCCA
GGTATTGGCA
TACGGTAGTC
GAALATTCGGT
AGATGGTCAA
TTTTGCAAAG
CATTCCGTAT
AATGCAACCA
CTCTGGTCAA
AGGTAAAA.AA
GGGTTCGCCA
CAACCGTGCA
GTATIATTCT
AAAAGAACGT
CGCTTTTGAA
AGAATATGGC
GTTGCATGAA
GTTGCCGGAC
EF048-2 (SEQ ID NO:178) MKKRKV LFTAVM4VLAG LQLLSGCGKT EASANDTVVL RYAYASNSQP VIDSM(KFGE LVEEKTDGKV Q IEYFPDGQL FONEKHFFKV NDNQALMQPV RVMQSETAIK MVELLGGSPV DEHTRVPDIV IMNEGTKERL VFNQVDSEPF QKLVQPLHES
GGETELIELT
YDSTKKLGFV
PMGSSEVYTS
TAKQEQAIEE
FKNSSEHGEL
QTGAI DFAKV
GLTYYDSGQR
LQSNLINGAE
AAKESTAFEK
YQAIRQLAD
SGSALESFSK
SFYMSKGPVT
NNEFVLYTAG
TVFKE-AVEEE
DYSVFAIPYI
SPDDLKGKKI
HGGVAKYYSY
KKKAQAEYGV
EF048-3 (SEQ ID NO:179)
TTGCGGCAAA
TTGCGCTATG
GAATTAGTAG
TTAGGAGGAG
GTCAGTGGAT
ATTTTTGATA
GTGTATGAWT
CGAAGTI=T
ATTCGGGTCA
GTACCTATGG
GAGAATAATG
TATGATGAGC
ETGACAGCGA
ACAGAAGCTT
CGTATGCTAG
AGGAAAA.AAC
AAACAGAACT
CAGCATTAGA
ATGAAAAACA
CTACAAAAA-A
ATATGAGCAA
TGCAAAGTGA
GTAGTTCGGA
AGTTCGT=
ATACTCGAGT
AACAAGAACA
CGGCAAATGA
TAATAGCCAA
AGATGGTAAA
AAI'rGAATTA.
AAGTTTCT
TTTTTAAA
AT'rAGGATTT
AGGGCCTGTT
AACCCCATC
AGTATATACT
ATATACAGCT
GCCAGATATT
AGCGATTGAA
TACGGTAGTC
CCAGTTATCG
GTI'CAAATTG
ACACAAACAG
AAAGATI'ATT
GTAATGGATA
G TGGTTTAA
ACATCTCCAG
AAAATGGTAG
TCTCTACAAT
GGTCA=GTG
ATTCTAMAA
AATATTITCC
GTGCAATTGA
CTGTAITIC
ATCAAGCGCT
CTTATI'ATGA
ATGAT7IGAA AAC TTIrAGG
CTAATCTAAT
GTGTGGCTAA
GAAAT1'CGGT
AGATGGTCAA
TTrTGCA;AC
CATTCCGTAT
AATGCAACCA
CTCTGGTCAA
AGGTAAAAAA
GGGTTCGCCA
CAACGGTGCA
r TArATICT GTGATTATGA ACGAGGGAAC AAAAGAACGT GAAGCAGCAA AAGAATCGAC CGCTTI'GAA 126 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAAACGGTCT TTAAAGAAGC GG ITGAAGAA GAAAAGAAAA AAGCACAAGC AGAATATGGC GTTGTGTCA ATCAAGTAGA CAGTGAACCA TI'CCAAAAAC TTGTTCAACC GTTGCATGAA TCATTCAAAA ATAGCTCAGA ACATGGCGAA CTGTATCAGG, CTATTCGCCA GTTGGCGGAC
TAA
EF048-4 (SEQ ID NO:180) CGKT EASANDTVVL RYAYASNSQP VIDSMICFGE LVEEKTDGKV QIEYFPDGQL GGETELIELT QTGAIDFAKV FDNEKHFFKV MDNQALMQPV YDSTKKLGFV GLTYYDSGQR RVMQSETAIK HVELLGGSPV PMGSSEVYTS LQSNLINGAE DEHTRVPDIV IMECTKERL TAKQEQAIEE AAXESTAFEI( VFNQVDSEPF QKLVQPLHES FKNSSEHGEL YQAIRQLAO
SGSALESFSK
S FYM SKG PVT
NNEFVLYTAG
TVFKEAVEEZ
DYSVFAIPY I
S?DDLKGKKI
HGGVAKYYSY
KKKAQAEYGV
EF049-1 (SEQ ID NO:181)
TGAGACTCTT
AAAACAGGAG
GCCGT'ITTTA
TCAACAATTA
GCGTTAGCC
GACAAAGANA
AAGTTCTCTG
AAACAAAGAG
TTAAAAACAG
TICAGAAGATG
ATTGCTAAAG
GATTCACAAG
GGCGAAGTGT
GTAGTGAAAA
ATCAAGAAAA
AGTGACGAAT
GCAGGCTACA
AGTGATTCTA
AGCAAAACAA
TCTTT=~AA
TGCATAAGAG
GTTTAGCAGC
CTGTTGATGA
AAATGTAT
TTCAAAAAAA
ATGCATTAAA
CAGCCTATGA
C'TGGGCAAG
ATGCCAAAGC
AAAAATCAAC
CAACAACTGT
CAGAACCAAT
TGACGAAAAA
TTGCTGAAGA
TAA.AAGCGGC
TGCAAACTGA
AAACAAGCGA
CAGAATCTTC
AATGAGGTAT
AATGAAGAAA
GTGTTCAAGC
TTTTTATAAC
TTATA-AATC
C LTGACGAA
ACAAGCTGGT
TGCAGGTCTA
TTTCCATCCA
TGTCAAGAAA
AGATACTGCT
TCCTGCCGAA
TGCTGCAACA
CAAAGCAAAA
AACAAAATTA
CAATGTGAAA
ATCTTCAAGC
TACCAAAACA
TTCTAALATAA
GGTATAGTTA TAACAGANAT AAAACTANAA AAACTAATCT TAGCTGCAGC GGGCGCAATG
GGTTCAAAAG
CAAATTAAAG
TTTGAAGAAA
GCCAAAGAAC
TTAACTGAAA
AAAGCCCACT
GAAGTAGAAG
GAAATCACTG
ACCAAAAAAG
GTTAAAGA.AG
AATATGCAAA
GGCAATGACA
GCCGATCAAA
ATTAAAGATG
GCTTCTTCAG
AGCGACTCT'G
ATATCGCAAC
AACAAAGCAC
AATATGGCGA
AAGTAGAAGC
AAACATTCAA
TAAAAATTAC
CACAAATTAT
ACGGCGGCGA
ATGGCOCTAA
CTGCCTTCA.A
CCTACCA.AAC
TGAAACCTTA
CATTIvrTTC
ATGCCTTCAA
AGAAAAAAGA
AAAAAGCAAC
AATGAAAGGT
TAGCCAACAA
CAAAGTAACT
ACAAGGCGGA
GAAACAGTTA
AGATGAAGAC
CCAAGTTGCT
TTTCACAAAA
AATTAAATTT
ATTAAAAGAT
AACCTACTAT
TGAAAAAGAC
GAA.AGTCATT
GAACGCTTTA
ATCAAAATCA
AGATI'CTTCA
EF049-2 (SEQ ID NO:182) MKKK LILAAAGAMA VFSLAACSSG SKDIATMKGS TITJDDFYNQ IKEQSTSQQA FSQMVIYKVF E:EKYGDKVTD FSDALKQAGL TEKTFKKQLK QRAAYDAGLd( AHLKITDEDL EDDAKAVKKE ITDGGDFTKI AKEKSTDTAT KKDGGKIKFD EVSEPIAATN MQTYQTTYYV VKMTKNYKC NDbM(PYEKEI DELKAANVKI KDDAFKNALA GYMQTESSSA SSEXKESKSS
KTTESSSK
KXIQKNFDEA
KTAWASFHPE
SQATTVPAEV
KKIAEET KLA
DSKTSDTKTS
KEQVEAQGGK
VEAQI IQVAS
KEAAFKLKDG
DQTFVSKVIS
DS EKATDSS S EF049-3 (SEQ ID NO:183)
GTGTTCAAGC
TCAACAAITA
GCGTTTAGCC
GACAAAGANA
AAGTTCTCTG
GCTICAAAAC
CTGTIGATGA
AAATGGTTAT
TTCAAAAAAA
ATGCATTAAA
ATATCGCAAC
ITT'TTATAAC
TrATAAAGTC
CITI'GACGAA
ACAAGCTGGT
AATGAAAGGT
CAAATTAAAG
77TGAAGAAA
GCCAAAGAAC
7r'AACT.AA
AACAAAGCAC
AATATGGCGA
AAGTAGAAGC
AAACATTCAA
TAGCCAACAA
CAAAGTAACT
ACAAGGCGGA
GAAACAG'PrA 127 TABLE 1. Nucleotide and Amino Acid Seqeuetices of E.f[aecalis Gcncs.
AAACAAAGAG
TTAAAAACAG
TCAGAAGAG
ATTCCTAAAG
GATCACA-AG
GGCGAAGTGT
GTAGTGAAAA
ATCAAGAAAA
AGTGACGAAT
GCAGGCTACA
AGTCA 1TCTA
AGCAAAACAA
CAGCCTATCA
CTTGGGCAAG
ATGCCAAAGC
AAAAATCAAC
CAACAACTGT
CAGAACCAAT
TIGACGAAAAA
TTCCTGAAGA
TAAAAGCGGC
TGCAAAC TGA
AAACAAGCGA
CAGAATCTTC
TGCAGGTCTA
T='ICCATCCA
TGTCAAGAAA
AGATACTGCT
TCCTGCCGAA
TGCTGCAACA
CAAAGCAAAA
AACAAAATTA
CA.ATGTGAAA
ATC'ITCA.AGC
AAAGCCCACT
GAAGTAGAAG
GAAATCACTG
ACGAAAAAAG
CTTAAAGAAG
AATATGCAAA
GGCAATGACA
CCGCATCAAA
ATTAAAGATG
GCTTCTTCAG
TAAAAATTAC
CACAAATTAT
ACGGCGGCGA
ATGGCGGTAA
CTGCCTI'CAA
CCTACCAA-AC
TGAAACCTTA
CATTTG.7'TC
ATGCC~TCAA
AGAAAAAAGA
AAAAAGCAAC
AGATGAAGAC
CCAACTTGCT
'rTCACAAA.A
AATTAAATTT
ATTAAkAAGAT AAC CTAG TAT
TGAAAAAGAG
GAAAGTCATT
GAACGCTTTA
ATCAAAATCA
AGATTCTTCA
TACCAAAACA ACGACTCTC
TTCTA-AATAA
EF049-4 (SEQ ID NO:184) GSSG SKDIATMKGS TITVDDFYNQ IKEQSTSQQA FSDALKQAGL TEKTFKKQLK EDDAKAVKKE ITDGGDFTKI EVSEPIAATN MQTYQTTYYV DELKAANVKI KDDAFKNALA
KTTESSSK
FSQMVIYKVF
QRAAYDAGLK
AKEKSTDTAT
VKMTKNKAKG
GYMQTESSSA
EEKYGDKVTD
AHLK ITDEDL
KKDGGKIKFD
NDMKPYEKE I
SSEKKESKSS
KXIQKNFDE.A
KTAWASFHPE
SQATTVPAEV
KKIA=EETKLA
DSKTSDTKTS
KEQVEAQGGK
VEAQI IQVAS
KEAAFKLKDG
D)QTFVSKVIS
OSEKATDSSS
EFOSO-1 (SEQ ID NO:185)
TAGGGTCTGG
AAAGGATGNA
CTTGTTCTAT
GCCACAAAAG
GGTI'ACGCAG
GCCACAACTC
GGAGCTTAA
AACAATGAAT
TATCAAGTGA
TATTTAGGAA
GGAGCACCTT
GAAAACGGCC
GTTCCAGAPA
GATGGA-ACGG
ACTCAAGTCA
AATCCTGAAG
AAAAGCAGTC
AAAAAATGAA
TATTA-AGTGC
AAGCAACTGT
GAACTAAAAA
AAGAATTAGT
GTGGCAAAC
GGAATCAAAC
ACAAAGCCAA
CCTATGGGCA
TCAATGATTT
CCACAGAAAA
AACTGCCCAT
TGCCGTTATC
CTAGCCAGAT
TAGATCAATT
AAGTGACTTC
CATGCCCAA-A
TTGCCAAATT
TCAGTTAAAC
TTCGTTTGGC
GCTACTCGTT
GACGAATCCC
AGAATGGATA
TATTGTCGGG
AGATACATCG
TATTGATACG
AAGTAGCG
TTATTAATT
TAGTGCCTTA
TATTAAAGGA
GCTAATCGAA
TITCCAAGG
AATATCNGTT
GGGGCAACTA
CGCACAACAA
TCGTTACTGC
AAACCTGATG
AGTGTTCAAG
AAAAACACAT
CACTCTATGG
TrACCTAAA
ACTCAACAGC
TATTGGATT
GGTGGTCAAT
GCAGTCAACG
GAAAATGCAC
TI'CTATGGC
CCTT'ITIAG
ATTTITrC~T
CGAAGGATGA
CACCAAGT
ATCGCTTGGA
GGACCGTGGT
TTCTAT1GA
TACTCTATTT
GTGGTGITAG
TTGAAAAA?1'
AAACCATCGA
ATCA.AGAGAT
TAAGTCCAAC
CCTTGCTA.AG
AACATAGTCA
CGAGTAAAAA
TTCATGGCAG
GCTAATGGGT
CAACCAAGCC
TTTTTCAT
GAAACAAGGT
TAAAGAGCGA
AGATAATAAA
ACAAAAAAAT
TGGTIACGT
CGTCAGCATT
AACGGAACTA
GATTAATGTT
AGAT ITAAGT
ACAGCGAGGA
ATI'ACATGAA
ATAG
EF050-2 (SEQ ID NO:186) MNMPKN IXYFSLLMGL VLLLSACQIG ATTKDDNSQAA TKEAIVELNR 'FrTPTLFFHG YAGThNSFGS LLHRLEKQGA TTQELVLLNKy ALSGKATNPS VQVLFEDNKN NEWNQTEWIK NTLLYLQKNY QXTNKANIVGH LGTYGQDTSL PKIEKFVSIG APFNDFIDTS QQQTIETELE NGPTEKSSRY PEKLPILLIG GQLSPTDLSD GTVPLSSALA VNALLRQRGT QVTSQI IKGE PEVDQLLIEF LWPSKX pDCGTVKERG
SMGVSGLRY
LDYQEMINVV
NAQHSQLHEN
EF050-3 (SEQ ID NO:187) 128 TABLE 1. Nucleotide and Anlino Acid Seqeuences of E.faecatis Genes.
TTGCCAAATT
GCCACAAA.AG
GGTTACCCG
GCCACAACTC
GGAGCT'TTA.A
AACAATGAAT
TATCAAGTGA
TATTTAGGAA
CGAGCACCTT
GAAAACGGCC
GTTCCAGAAA
GATGGAACGG
ACTCAAGTCA
AATCCTGAAG
GGGGCPLACTA
AAGCAACTGT
GAACTAAAAA
AAGAATTAGT
GTGGCAAACC
GGAATCAA-AC
ACAAAGCCAA
CCTATGGGCA
TCAATGATTT
CCACAGAAAA
AACTGCCCAT
TGCCGTT'ATC
CTAGCCAGAT
TAGATCAAT
CGAAGGATGA
TGAGTTAAAC
TTCGTTTGGC
GCTACTCGTT
GACGAATCCC
AGAATGGATA
TATTGTCGGG
AGATACATCG
TA=TATACG
AAGTAGCCC
TTTATTAATT
TAGTGCCTTA
TATTAAACCA
GCTAATCGAA
CAACCAAGCC
CGCACAACAA
TCGTTACTC
AAACCTGATG
AGTGTTCAAG
AAAAACACAT
CACTCTATCG
TT1ACCTAAAA
AGTCAACAGC
TAT'rrGGA'Tr
GGTGGTCAAT
GCAGTCAACG
GAAAATGCAC
TWICTATGGC
CACCAACGCT
ATCGCFTTGA
GGACCGTGGT
TTCTATTTGA
TACTCTATTr GTrCGTGTTAG
TTGAAAAATT
AA.ACCATCGA
ATCAAGAGAT
TA.AGTCCAAC
CCTMCTAAG
ALACATACTCA
CGAGTAAAA-A
TTrTTTTCAT
GAAACAAGGT
TAAAGAGCA
AGATAATAAA
ACAAAAAA.AT
TGGTITACGT
CGTCAGCATT
AACGGAACTA
GATTAATGTT
AGATTTAAGT
ACAGCGAGGA
ATTACATGkA
ATAG
EFOSO-4 (SEQ ID NO:188) 9* 9 9* CQIG ATTKDDNQAA TKE-ATVELNR TTTPTLFFHG ALSGKATNPS VQVLFEDNKN LGTYGQDTSL PKIEKFVSIG PEKLPILLIG GQLSPTDLSD PEVDQLLIEF LWPSKK
YAGTKNSFCS
NEWNQTEWIK
APFNDFIDTS
GTVPLSSALA
LL]HRLEKQGA
NTLLYLQKNY
QQQTIETELE
VNAL.LRQRGT
TTIQELVLLVK PDGTVVKERG QVNKANTIVGH SMGCVSGLRY NGPTEKSSRY LDYQEMINVV QVTSQIIKGE NAQHSQLHEN EF051-1 (SEQ ID NO:189) 9 a.
TAAAAG;AAAA
TTCAGTTTAG
ACAAAAAAAG
AAAATTAT'C
AGTTCTGGAA
GCTTCGACAA
CTTCCTTTAT
CATCATTTI'
CCAGCTGGTC
AAACACGCAA
GCAGAAGCTrG
GCGCCCATGC
GCCTCTAAGA
AGAAAATATT
CGGCGTTCA
TTGCCCTAAT
AAGAAATAAC
CAGCCTTTGA
AATI'ACAGAT
AACAAATGAA
TGGAAAACCA
CTGAT'TAA
AATATGCCGA
GCTITGGCAC
GCTTACTTTA
CTGAAGCTT
AACAAAAATC
TTGANAATAT
AATGTCTAAA
TGCTGCATGT
CC'ITCGGCA
AAAAGAGCAT
GCAAATTGAA
TGCATTGGTT
GCTCGTTCTT
AAAACCCCAA
AGAAGGCTTA
GAATGTAACA
TCCCACAGAT
TrTGAAAAAG
AGCAGATGCT
TGGCTTTAAG
CAAAAAAAGG
ACAAATCAGC
GCAGCTAGCT
CCAGATATTC
AAAGGCCTAA
GCAGAAAAAC
AITTCTGCCTA
ATGATAGCAA
AAAGCTTTAG
GAAGTCCTTG
GCAGCA.ACCA
CCAATTATCT
T=mAAATr
TTAACAAAGT
CTGTGTTCCT
CGCAAAAAGA
TAGAATCAGT
AGGTAACTGG
AAGCCGATGT
TAATTAATAA
ACCAAGATCA
TTGGTGATCC
GCGCTT'CTC
AATGGGTAGC
ATTCAAAAGT
ATCCAGTTKG
T'TTTACAGAG
AG
GCTTAGFFI'A
AACAGTTTCA
CATGGAGAAG
AACCTATGAT
AM'TCTCA
AAAAAGTGTC
AGCAAAGTGG
TGCAAGTGTT
TTATGTAGA-A
TAATGCAAGT
AGCGATTGTT
TAAAGTTGCC
TCAACAATGC
EFOSI-2 (SEQ ID NO:190) MSKQ KKAVFLLSLF SLVALIAACT N~QPQKETVST KKEEITLAAA ASLESVMEG( I IPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA STKQNNALVA EKLINKKSVV PLLENQLVLI VPNQDQAXWH DFSDLKKAQM IAIGDPASVP AGQYAEEG.LK ALGAWSYVEK HAS FGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA AMP EAVLKKP I IYPVGKVAA SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK EF051-3 (SEQ ID NO:191) ATOCT ACAAATCAGC CGCAAAAAGA AACAGTTTCA ACAAAAAAAG AAGAA.ATAAC CCTTGCGGCA GCAGCTAGCT TAGA.ATCAGT CATGGAGAAG 129 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faeczlis Genes.
AAAATTA'TTC
AGTTCTGG;A
GCTTCGACAA
GTTCCTTTAT
CATCATTTTT
CCAGCTGGTC
AAACACGCAA
GCAGAAGCTG
CC CATCC GC CTCTAAGA
AGAAAATATT
CAGCCTTTGA
AATTACAGAT
AACAAATCAA
TGGAAAACCA
CTGATT~TAA.A
AATATGCCGA
GCT=tGGCAC
GCTTAG=?A
CTCAACCTrCT
AACAAAAATC
T'TGANA.ATAT
AAAAGAGCAT
GCAAAT'IGAA
TGCATTGGTT
GCTCGTTCTT
AAAAGCCCAA
AGAAGGCTTA
GAATGTAACA
TGCGACAGAT
TTTGAAAAAG
AGCAGATOCT
TGGCTWAAG
CCAGATATTC
AAACGCCTAA
GCAGAAAAAC
ATTTGCCTA
ATGATAGCAA
AA.AGCTTTAr.
GAAGTCCTTG
GCAGCAACCA
CCAATI'ATCT
TTTTTAAA'T
T.TAACAAAGT
AGGTAACTGG
AAGCCGATGT
TAATTAATAA
ACCAAGATCA
'ITGGTGATCC
GCGCI'CGTC
AATGGGTAGC
ATTCAAAAGT
ATCCAGTTGG
TTTTACAGAG
AG
AACCTATGAT
ATTTTTCTCA
AAAAAGTGTC
AGCAAAGTGG
TGCAAGTGT
TTATCTAGALA
TAATGCAACT
AGCGATTGTT
TAAAGTTCC
TCAACAATC
EF051-4 (SEQ ID iNO:192) CT NQPQKETVST KKEEITLAAA ASLESVMEKK IIPAFEKEHP DIQVTGTYDS SGKLQMQIEK GLKADVFFSA 2ILLENQLVL I VPNQDQAI(WH DFSDLKKAQM IA IGDPASVP HASFGTNVTE VLEWVANASA EAGLVYATDA ATNSKVAIVA SKKQKSADAF LNFLQSQQCR KYFXNIGFKL TK EF052-1 (SEQ ID NO:193)
STKQMLALVA
AGQYATEEGLK
AMPEAVLKKP
ECKLINKKSVV
ALGAWSYVEK
I IYPVGKVAA
TAAAGTAGGA
CCCACA-ATGG
'ITACTTAGTT
NGAAATCTTC
GCAGAAAATA
TCAAAAAAAA
GTTCGTAGCA
CTAACCTGGT
GAAGCGCAAG
GTACCATGG
CGTCGTGGAG
NGTTT'ITAT
GGGAGACCAC
ATCCAGTI'GT
GAACGCAAAT
ETGGCATATT
CGAAAAAAGT
TACCATTATC
AGAAGTGGAA
ATTGTGGTT
AGAAGTCGGA
GAATGTATT'G
AAAAAGATTA
GTrTTrAATA
GAATCAATCG
TTTGACAGTA
AA.ACGAAGGG
CTATTAATT
ATCCGGTTTA
CCGCAA.ACAA
CCTAAAACTG
AGTAGTI=T
GCAGCGTATC
GCACTTCCTG
GAAGTGACAT
ATATTCCTCA
CAAAAACTTC
CCATTCAATC
GTGACAATCG
GGCTG=I1CT
AAGTAGTGAT
GXITGATTGC
AATGAGANGA
A.ACAACTTAT
AGACATACCA
GCTATCAATC
AATAACTTGG
ATIAGACAA
TTATGTAGAA AAGGAGAATA A EF052-2 (SEQ ID NO:194)
MRXX
NLXFLLLLVL LIYIPQTYA ENRETTEVGI GFTKTSDIPS KKNPWVNVLP QTTIQSLSIV RSRTQIKRLP KTGDNRITWL SWFGILFLIS SFWLFLFRQL CRKGE EFOS2-J (SEQ ID NO:19S) AGAAAATA GGGAGACCAC AGAAGTCGGA ATCGGCITA CAAc'I'rC ACACATACCA TCAAAAAAAA ATCCAGTTGT GAATGTATTG CCGCAAACAA CCATTCAATC GCTATCAATC GTCGTAGCA GAACGCAAAT AAAAAGAT EF052-4 (SEQ ID NO:196) EN'RETTEVG I GFTKTSDI PS KKNPVVNVLP QTTIQSLS IV RSRTQ IKR EF053-1 (SEQ ID NO4:197) TAGTCATGGC ACCATAACAA GGAGGAGAGA AGTGACATGA AAAAATACCT TTTcTTAGT 130 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TGTTTTAG
GAAAATAGT(3
ACAGCGACTA
TCATCCGTCC
ACCACCAATC
TITATTAAAA
A'ITTAG
GTCTTTTCAG
GTGTI'GTCTC
GTAATTCTTC
TTTCTTCTGG
AAGCACCATT
GCAGAAGAGA
CTTCTGTCAT
CTTTTATGGA
CACAACGACC
CGTATATGGA
TATTTATITG
AGATCAAAAA
TCAGACACTG
ACGTATGAAT
GAACCCACCA
TCGCGACAAG
GGAATCACC
AACAGTATrA
CGT=I-GAGA
ATCCCACAGA
AGCCAGCTGA
GAAGATrACC
TTATCACTAT
GCAGTAGTAG
AGCAGCCTAT
AGAGTCGACA
CGGAGGCGCT
AGCGACAGGT
AGGCATATTA
GGATTGTAGG
0*
S
EF053-2 (SEQ ID NO:198) K4KKYLLLSC FLGLFSFCHS DTAFGEAAYE NSGVVSFYGT YEYPTEESITT ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQ-GRLJPATrT TNQAPFIYLG ISLITIGILF IKRRREDEKN SISSSROCRI EFOS3-3 (SEQ ID NO:199) TTTGGAGA AGCAGCTTAT GAAAATAGTG GTGTTGTCTC CTT=ATGGA ACGTATGAAT ATCCCACAGA AGAGTCGACA ACAGCGACTA CTAATTC-TTC CACAACCACC GAACCCACCA AGCCAGCTGA CGGAGGCGCT TCATCCGTCC TTTCTTCTGG CGTATATCGA TCGCGACAAC GAAGA EF053-4 (SEQ ID NO:200) FGEAAYE NSGVVSFYGT YEYPTEESTT ATSNSSTTTE PTKPADGGAS SVLSSGVYGS RQGR EF054-1 (SEQ ID NO:201)
TA.AATAAAAA
AGTGCAGTAC
ACAGATCCAG
ACTGAGCCGG
AAGCCAACCG
ATTGATCCTC
CCAACCGAGC
GAACCAACTC
GAACAGCCCA
AATCAATCGG
ACACATGTAC
GTACATAAAG
TATAAGGTTT
TTGCCACATA
ITAATTTCAG
ATTArrGGA
TAGTATI'CGG
CAACTCCACC
TGGATCCTIGC
AGCCTACAAC
CAACGCCGGT
CTACAACACC
AACCAGTGAT
CTAAACCAAT
CAGGAACACA
CAAGTGAAAC
GTTTCCT
TACCTACC
CAGGAGAGAA
CATTCTTrT
GGA.AATTACA
TGGCGGAAGT
AATTACCGAA
AGAACCGCCA
ACCGAGTGAG
TCAACCGACT
AAGCGAACCA
TCCAGAAAAA
AGACGTAGTC
ACCAAGTATT
TACTCCAAI'I
AACCAAAACA
AAACGTTGAG
ATTCACACTC
CTTTAAAAAG
ATGAAAAAAA
ATA.ACAGCAT
CCAACTGAT
GTAATACCAA
CCAGAAAAGC
GAACCAACCG
GAAAAACCTG
CCAGCAGAAC
GTTACACCTA
CCTATTGAAA
ACAACAGAAG
CCAGAAGGAT
GTAAAACCAA
CTTTTCTCTG
AATAAGAAA
TTATTIATC
TCGCTGACGA
CTAGTGAkACC
CTGATCCAAC
CAACAGAACC
AGCCAACAGA
TTACTCCAGA
CAGAAACACC
GTGGAGAAAT
CAAGCAACTT
CTGGGGAAGA
TAAAACCAAT
GTGATGGAAA
TATTGGGAAG
AAGCTTAA
PAGCTTT
TTTAGGACCA
TACGAATCCT
AGAACCAAGC
AACPLACGCCA
ACCTAGTCAA
ACAACCGAA
AAAA.ACTCCT
TGATAAAACG
AGCGGAGGTA
AAITGTAGCA
TAGCAGCTCG
AATGAAAGTA
CTCTTTGTA
EF054-2 (SEQ ID NO:202) M KKIILSSLFS AVLVFGCSI TAFADDLGPT DPATPPITEP TDSSP-PTNPT EPVDPAEPPV IPTDPTEPSK PTEPTrPSEP EKPTEPTTPI DPGTPVEPTE PSEPTEPSQP TEPTTPSEPE KPVTPEQPKE PTQPVIPE-KP AEPETPKTPE QPTKPIDVVV TPSGEIDKTN QSAGTQPSIP IETSNLAEVT HVPSETrPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY KVLPSGNVEV KASDGKMKVL PHTCGEKFTLL FSVLCSFFVL ISGFFFFKKN KKKA EF054-3 (SEQ ID NO:203) 131 TABLE 1. Nucleotide and Amidno Acid Sequences of E. faecalis Genes.
A
ACAGATCCAG
ACTGAGCCGG
AACCCAACCG
ATTGATCCTG
CCAACCGAGC
CAACCAACTC
GAACAGCCCA
AATCAATCGG
ACACATGTAC
GTAGATAAAG
TATAAGGT'T
T
CAACTCCACC
TGGATCCTGC
AGCCTACAAC
GAACGCCGGT
CTACA.ACACC
AACCAGTGAT
CTAAACCAAT
CAGGA-ACACA
CAAGTGAAAC
GTGTTCCGTT
TACCTAGCGG
AATrACCGAA
AGAACCGCCA
ACCGAGTGAG
TCAACCGACT
AAGCGAACCA
TCCAGAAAAA
ACACCTACTC
ACCAAGTATT
TACTCCAATT
AACCAAAACA
AAACGT1'GAG
CCAACTGATT
GTAATACCAA
CCAGAAAAGC
GAAC CAAGCG-
GAAAAACCTG
CCAGCAGAAC
G'ITACACCTA
CCTATI'GAA.A
ACAACAGAAG
CdAAGGAT
GTAAAACCAA
CTAGTGAACC
CTGATCCAAC
CAACAGAACC
AGCCAACACA
TI'ACTCCAGA
CAGAAACACC
GTGGAGAAAT
CAAGCAACT
CTGGGGAAGA
TAAAACCAAT
GTGATGGAAA
TACGAATCCT
AGAACCAAGC
AACAACGCCA
ACCTAGTCAA
ACA.ACCGAAA
AAAAACTCCT
TGATAAAACG
AGCGGAGGTA
AATTGTAGCA
TAGCAGCTCG
AATGAAAGTA
a a.
a.
a.
a a a.
a a a a.
a a a EF054-4 (SEQ ID NO:204) DDLG-PT DPATPPITEP TDSSEPTN'PT EPVDPAEPPV IPTOPTEPSK PTEPTTPSEP EKPTEPI~rPI DPGTPVEPTE PSEPTEPSQP TEPTTPSEPE KPVTPEQPKE PTQPVIPEKP AEPETPKTPE QPTKPIDVVV TPSGEIDKTN QSAGTQPSIP IETSNUAEVT HVPSETTPIT TEAGEEIVAV DKGVPLTKTP EGLKPISSSY KVLPSGNVEV KASDGKMKV EF055-1 (SEQ ID NO:205) a
TAACAAAAGG
GGAGGTFTT
AGTT=TTA
TACCAAAATC
GCTGATCCCA
CAGACCACCA
AGCATTCTCN
CATGAAGCAA
TTGTI'TGTC TITCTGTGT CAATG-AAAAA AAAGCGTTAT TAAATGTTGA AGCGTCTGAT CGAGAACACC CGCTCCTAAA AGGAACCAGC TGGTCCTCCG CAACTGGCTC AACGCTCCCT GNTTCGCCTT AATCGGT=
ACTAA
AAAAGGGCAA
TTAATGAITG
GGTGGTTCTA
GATCCCCCAC
CAAGGACATC
GAAAGGCTAG
TGTGTCTACT
GTTCGGTGGG
CGAAAACAGA
AACGAAGTGG
CGAGTTAAAA
ATCTTCTCCT
GATTGAAITTT
TGCGCCAGCT
TGGTTCGACA
GGCAAATTTG
GAAGGGACGA
CGTACAGGGA GCAAGAGTCA GCGGGAATCG TACATAGAAA EF055-2 (SEQ ID NO:206) MrKKKRYL NIVCLLSSPS FFINVEASDG GSSSVGIEFY QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ GDQRSGGSTQ ILXFA.LIGLA GIVHRKGRH EAN TTTGSTLPR TGSKSQANLS EFOS5-3 (SEQ ID NO:2C7)
AGCGTCTGAT
TACCAAAATC
GCTGATCCCA
CAGACCACCA
GGTGG'rICTA
CGAGAACACC
AGGAACCAGC
CAACTGGCTC
GITCGGTGGG
CGCTCCTAAA
TGGTCCTCCG
AACG
GATTGAA TT
GATCCCCCAC
CAAGGAGATC
CGAAAACAGA TGCGCCAGCT AACGAAGTGG TGGTTCGACA EF055-4 (SEQ ID NO:208) SDG GSSSVGIEFY QNPRTPAPKD PPPKTDAPAA DPKEPAGPPQ CDQRSGGSTQ TTTTGST TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fuecai-% Genes.
EF056-1 (SEQ ID NO:209)
TAAATGAAAA
ATAAATGTTG
CCGGCAACAC
AAGGAACCAG
ACAGCTGGCT
GGTCTTGTCT
AACTAA
AAAAGCGTTA
AAGCATCTGA
CCGCTCCTAA
CTGGTCCTCT
CGCAGCTCCC
TGATTGGTCT
TTTAATAATT
CGTGGTTCT
AGATGCCCCA
OCAAGGAGAT
TCGTACAGGA
TGTCGGAATG
GCGTGTTTAC
AGTTCGGTGG
CCGAAAACAG
CAACGAAGTG
AGCAAGAGTC
CTCCAGAGAA
TA=FTCCCC
GAA=C'A.ATT
ATGAGCCAC
GTGGTTCGAC
AGGCAAAkCCT
AGAAGGGACG
TAGI-PTTTT
TTACCAAA.AT
'TGCGGATCCC
ACAGACCACC
GAGCAI'CPI
ACATGAAGCA
EF056-2 (SEQ ID NO:210) MKXXRYLIIA CLLFSPSFFI NVE-ASEGGSS SVGIEFYQNP ATPAPKDAPP KTDEPAADPK EPAGPLQCDQ RSCCE=QTTT AGSQLPRTG-S KSQANLSILW LVLIGL/CMV QRKKGRHEA.N EF056-3 (SEQ ID NO:211) ATCTGA GGGTGGTTCT AGTTCGGTGG GAATTGAAT TTACCAAAAT CCGGCAACAC CCGCTCCTAA AGATGCCCCA CCGAAAACAG ATGAGCCAGC AAGGAACCAG CTGGTCCTCT GCAAGGAGAT CAACGAAGTG GTGGTTCGAC ACAGCTGGCT CGCAG TGCGGATC CC
ACAGACCACC
EFPO%-4 (SEQ ID NO:212) SEGGSS SVGIEFYQNP ATPAPKDAPP KTIDEPA-ADPK EPAGPLQGDQ RSGGSTQTIT ACSQ EF057-1. (SEQ ID NO:213)
TAATCTTTAT
ATCATAAAAA
AGCGNTATGG
TACGTI'TATG
CAGGGAAGAA
GAAAATCCGC
AATAAAATGA
TGGCTGGCC
GGTTTAGTTT
CTGAAGAAAA
AGAATGAAAA
ACAATTTAGC
TGTATTCCTT
AACGAZAGAA
AGTCAATGTT
GGTATGTTTA
TAATTATGAA
AGAGTCAAAT
TGCTTCTGGA
GATAGGAGTT
GAAAATGGGC
GGGCTATTA
TCAAATGGTC
GACGTAGCCT
CAAGCAGT'I
AGTTTG
A.AGGAGGAAT
TCATTGGTT
AAGCGACT
ATACCCAACA
TACCTAAAAC
GGATAGTCAT
TCAGATCAA
GCNAACAAAA
CTATGGTACC
ATCAGAAGAA
AGGCGAGTCT
TTAT=AAT
GGAC TTATT TALA EF057-2 (SEQ ID NO:214) MXI IKRFSLVCLG LLIIGLXTKS XMAEENNYES NGQASFYGTY VYENEKESND VAYTQQSEEQ GRNNLAASGQ AVLPKTGESE NPLYSLIGVS LLGIVIYLI"
KMKREKEFI
EF057-3 (SEQ ID NO:215) AAA TAMTTATGAA TCAAATCCTC AAGCGACCTT CTATGGTACC TACG ITTATG AGAATGAA.AA AGAGTCAAAT GACCGTAGCGT ATACCCAACA ATCAGPACAA CACGGAAGAA ACAATTTAGC TGCTTCTCGA CAACCAGTTT EF057-4 (SEQ ID NO:216) EENNYES NGQASFYGTY VYENEKESND VAYTQQSEEQ GRNNLAASGQ AV 133 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. foecalis Genes- EF058-1 (SEQ ID NO:217)
C.
C C
C.
C C
C.
C C
C
C
C
CCC.
C
TGAAGAACGT
TITGGCTATTT
TGGTACACCG
ACAACTGCAT
ATGACGGAT
AAATATCAAG
GAGCAACGAG
CCTGGGAAAC
CCTAAAAAAC
GTAGTTGCTG
GATGG'-!TCCT
GTGGTAGCCA
TTAAATGGCG
ATCCAAGCAG
=F~ATTACTG
ACGGGAGAAT
GCTCCAAATA
AACAATCAAA
ACACCAAGCT
GTAAATATTC
AATTTAGTTG
GCTTATGCGT
CAAGCAAATG
GGCACACTAA
'ITTAAAAATG
GAAGTTGTGA
GCCTTGGCGG
AAAATCGATG
rI'TACAACAA
TTAGAAGAA-A
GTCAATGA-AC
AAACACAA.AG
GGCCCTCT
TAA
TCTA'TT-GC-T
TATCAAAA-AC
TTAGTACCTT
TTGCAGAAGA
TACCAGATCC
GACTGGCAGA
CGGCAGGCGC
CTMTTGCTCA
AAAATGGTAA
CTACGAATAT
ATAAATATGG
ATGATGGTAG
CAGAATTTGT
TCAAAGATG
GGAAAAGTTA
TAACAGTTAA
ATGCAGAAT
CACCTGTTGA
TAGATGGTAA
CATTGGGGAT
ATAAACATGA
TATATGATGG
GCTTCACTGT
AkATTCGTTA
AGCGAATGT
CAGGTGGGAA
GAGCTTCCTT
AAACAACGAA
CGGCTGATGG
CTGTAGCTCC
AATCATATGG
GTACCTTACC
TGCTACTTAT
TGACGATTGC
AAAAAAGAGG
GTTACTAAT
AAATGGGGAG
GCCTATCAA.
TGTGACGTT*T
AAGCGTTGAT
AGGAACCACC
AGATGCAGTC
GGTGGTGGCG
AACAGAAGAA
TT-TACATGTG
TATTTCTAAA
ATTATATACA
TGAAATTGGC
AAATCTI'GAG
AA ITCAAAAT
AAAAACAGTC
AGATGTGGCA
TGCAGACAAA
TGCAGCCTTA
GGATACAGTG
CGCCGTTAAT
CTTTATIGCAT
TGATAACGGT
ACGTTTCATT
TGTCCTCCGT
AGCAGCAACT
ATTAGTTGAT
TCATGATTAT
CACAACAGAA
TTCAACAGGT
TGCAGC-AGTC
AGGCCTGCTA
AGAGAGAAAA
TTGCCAC~rT
PGCGCACAGC
AATAGCGGGA
A.GTATTTATA
GCAGCTAAAC
6ATGCAAATG
TATACCATA
TTCCCAGT
TTAGCGGTTG
AAAAAAGTAG
AGCGAAGGCT
TGGACAACGG
GAAAATGATT
GTTGGTTCGT
CAAACAAAAA
AAA.AATCATA
ATTGGCGAAA
GAAGGCGACG
ACTT=TATA
AT1'CTCCTG
CCAGCGTATA
TTAAATGAAA
CATACCGACG
AAAGTCGATG
GATCAAAACA
TGGGTGAAAA
ATCACAGGGC
CTCTTG ITAA
AACCTAGTTT
GGCAAAGGAA
TACTTTGCTA
A.TCATTGGGA
TGAAGCAA T rcACAAGTGT
I'CGTGATTCA
AAGAAATCAG
A.CGTGACGAA
AAGCTGTCCA
GGAATGTCAC
AAGAAGAACC
ACGAAATGAT
'ITCATAMTA
GAACTGCTGA
CAC CAGGCAC
ATA-AAGAACA
TCA.CAGAAGC
ATA'F"IAGA
CAC CATTTAC
CCTCTAA.AGT
AAATrAAATA
CTAATAAATA
ACGTGACTTC
AAAATTATCA
ITrCCTACGCT
AAGCAGATCC
ACCAAACACC
GCGATGTGAC
GCGACACAGC
CAA.AAGCTGA
TTAAATACGG
CAA.ATCGGAT
CACCACAAAA
TCTACGTTA
GACGTAGAAA
E'GGTAGTCAT
kAAAAAAGTT
%.TTAGGGACA
CAAAAAGAAA
CACTTTGAT
CGAATTTTAC
AAGTTTAACT
TGTTCAGTTA
PAAAGAGGGT
CAAGCAAACA
TCCTAAAAAT
AAATGAAGGA
AGTAAAATAT
AGCAAAACC
AGAGAATGGA
AGA.AGTAAA-A
AATTGAAGCA
TGATAAAACA
TCAAATTTCT
CGTCAAATTC
TGGAGAGTAT
AGTGACTGAA
AACACCAGGC
TACGAAAGGC
ACCAACTGTT
AGCGACACAA
AAATTA'ITTG
AGCAACTACT
TACCTATTAT
TCAATTTGTG
AGTACCAAAC
CTTAGGAAGT
AGAAAATGCT
EF058-2 (SEQ ID NO:218) 14KQLKKVW YIVSTLLLIL PLFTSVLG TT
TAFAEENGES
QRAAGASVDA
VAATNMVVAF
NGAEFVISKS
GELTVKNLEV
PSLDGKDVAI
YALYDGDTVhI
KNEANVDNCH
IDETTKAATW
REQSYGTTEN
AQLVIHKKKM
AKQAVQSLTP
PVYEMIKQTD
EGSPGTVKYI
GSYILEEVKA
GEKIKYQISV
APENYQVTEQ
TDDQTPPTVE
vVTKAEATTF LVS PEKVPN( TOLPDPL IQN
GKPVAQGTTD
GSYKYGTEEL
QGVKDGLYW
PNNAEL IENQ
NIPLGIADKE
ANGFTVAVNP
WVTGGKRFIK
'ITTADGLVDI
HKGTLPSTGG
SGKEMSEFDK
ANCNVTVIQLP
AWN IYPKNV
TTDKEQAKRF
TKTPFTIEAN
GDANKYVKFN
AYIPTLTPGG
VDGDVTATQA
TGLKYGTYYL
KGIYVYLGSG
YQGLADVTFS
KKQNGKDAVY
VANIDGSLHVK
ITGKSYEIGE
NQTPVEKTVI(
LVOKHDAALT
TLKFVYFMHL
LAGASFVVRD
EETVAPDDYV
AVLLLIAGVY
IYNVTNEFYE
TIKEEPKEGV
KvGTAENEGL
NDFTEA.ENGT
NDTSKVDKTT
FDN~VTSGEYA
NEyKADPTKGF
QNSDTANYLK
LLTNRI EFVV
FARRRKENA
EF058-3 (SEQ ID NO:219) 134 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecaiis Genes.
ACAAGA AAATGGGGAG AGCGCACAGC TCGTGATI'CA CAAAAAGAAA
ATGACGGATT
AAATATCAAG
GAGCAACGAG
CCTGGGAAAC
CCTAAAAA.AC
GTAGTTGCTG
GATGGTTCCT
GTGGTAGCCA
TTAAATCGCG
ATCCAAGGAG
TTTATTACTG
ACGGGAGAAT
GCTCCAA.ATA
AACAATCAAA
ACACCAAGCT
GTAAATATTC
AA'ITTAGTTG
GCTTATGCGT
CAAGCAAATG
GGCACACTAA
TTTAAAAATG
GAAGTTGTGA
GCCTTGGCGG
AAAATCGATG
TTTACAACAA
TTAGAAGAAA
GTCAATGAAC
AAACACAAAG
TACCAGATCC
GACTGGCAGA
CGGCAGCCC
CTGTTGCTCA
AAAATGGTAA
CTACGAATAT
ATAAATATGG
ATGATGGTAG
CAGAATTTGT
TCAAAGATGG
GGAAAAGTTA
TAACAGTTAA
ATGCAGAATT
CACCTGTI'GA
TAGATGGTAA
CATTGGGGAT
ATAAACATGA
TATATGATGG
CTCACTGT
AATTCGTT-TA
AGGCGAATGT
CAGCTGGGAA
GAGCTTCCTT
AAACAACGAA
CGGCTGATGG
CTGTAGCTCC
AATCATATGG
GTACCTTACC
GCTTATTCA.A
TGTGACGTI'
AAGCGTTGAT
AGGA.ACCACC
AGATGCAGTG
GCGTGGCG
AACAGAAGAA
TTACATGTG
TATTTCTAAA
ATTATATACA
TGAAkATTGGC
AAATCTTGAG
AATITCAAAAT
AAAAACAGTC
AGATGTGGCA
TGOCAGACAA
TGCAGCCTTA
GGATACAGTG
CGCCGTTAAT
CTTTATGCAT
TGATAACGGT
ACGTTCATT
TTCGTCCGT
AGCAGCAACT
AFI'AGTTGAT
TGAMALAT
AATACGA
AGTATI'TATA
GCAGCTAAAC
GATGCAAATG
TATACCA2TA
TTCCCAGTTT
TTAGCGGTTG
AAAAAAGTAG
A.GCGAAGGCT
TGCACAACGG
GAAAATGAr
GTTGGTTCGT
CA-AACAAAAA
AAAA-ATGATA
ATTGGCGAAA
GAAGGCGACG
ACTTTTGATA
A'TTGCTCCTG
CCAGCGTATA
TTAAATGAAA
CATACCGACG
AAAGTCGATG
GATCAAAACA
TGGGTGAAA.A
ATCACAGGC
GTCTTGTrA.A
AAGAAATGAG
ACGTGACGAA
A.AGCTGTCCA
GGA.ATCTCAC
AAGAAGAACC
ACGAAATGAT
TTCATAITT
GAACTIOCTGA
CACCAGGCAC
ATA.AAGAACA
TCACAGAAGC
ATAT=~AGA
CAC CA=FAC
CCTCTAA.AGT
AAATTAAATA
CTAATAAATA
ACGACTTC
AAAATTATCA
TrCCTrACGCT
AAGCAGATCC
ACCAAACACC
GCGATGTGAC
GCGACACAC
CAA.AAGCTGA
TTAAATACGG
CAAATCGGAT
CGAGTI'GAT
CGAATTTTAC
AAGTTTAACT
TGTTCAGITA
AAA.AGAGGGT
CAAGCAAACA
TCCTAAAAAT
AAATGAAGGA
AGTAAAATAT
AGCAAAACC
AGAGAATGGA
AGAAGTAAAA
AATTGAAGCA
TGATAAAACA
TCAAATTTCT
CGTCAAATTC
TGGAGAGTAT
AGTGACTGAA
AACACCAGGC
TACCAAAGCC
ACCAACTGTT
AGCGACACAA
AAATTATTTG
AGCAACTACT
TACCTATTAT
TGAATTTGTG
AGTACCAAAC CACAACAGAA AACCTAG7TT CACCAGAAAA EF058-4 (SEQ ID NO:220) EENGES AQLVIHKXXN TDLPDPLIQN SGKEMSEFDK YQCLADVTFS IYNVTNEFYE
QRAAGASVDA
VAATNMVVAF
NGAEFVISKS
GELTVKNLEV
PSLDGKDVAI
YALYDGDTVI
KNEANVDNGH
IDETTKAATW
NEQSYGTTEN
AKQAVQSLTP
PVYEMIKQTD
EGSPGTVKY I
GSYILEEVKA
GEKII(YQISV
APENYQVTEQ
TDDQTPPTVE
VKTKAE-ATTF
LVSPEKVPNK
GKPVAQGTTO
GSYKYGTEEL
QCVKZDGEJYTW
PNNAEL IENQ
NIPL.CIADKE
ANGFTVAVNP
VVTCGCKRFIK
TITADGLVDI
HKGT
ANGNVTVQLP
AVVI{IYPKUV
TTDKEQAICRF
TKTPFTIEAkN
GDANKYVKFN
AYIPTLTPGG
VDGDVTATQA
TGLKYGTYYL
KKQNGKDAVY
VANDGSLHVK
ITGKSYEIGE
NQTPVEKTVK
LVDXHDAALT
TLKFVYFMHL
LAGASFVNRD
EETVAPDDYV
TIKEEPKEGV
KVGTAENEGL
NDFTEAENGT
NDTSKVDKTT
FDNVTSG'EYA
NEKADPTKGF
QNSDTANYL<
LLTNRIEFVV
EF059-1 (SEQ ID NO:221)
TAGATTGGAA
TTAGCAGCG
ACAACAGGGA
GAGCCAGAGC
ACCGAACCTA
CCAACAGAC
GTACCAGAGC
GAATGAAAAT
GAAGCAGTGT
GTT=ACC
AACCAACAGA
GTGAGCCTTC
CAACAACGCC
AACCAACAGA
GAAAAAAATC
TTCTGCTTAT
AGATCAACCG
GCCAAGTACA
AAAACCGACG
AAGTAAGCCA
GCCAAGTGTA
ATTATTATTG
GCGCAAGAAT
AATGTACCAA
CCAGAGCAAC
GATCCTI'CGT
GAGCAACCAA
CCAGAAAAAC
CCTTATTCAG
CAGAAGGAAA
CTGACCCAAT
CATCGGAACC
TACCAGACGA
CAGAGCCAAC
CAGTAGAACC
TACAACCCTT
TCTTGGTGAA
AACGccAAGT
CTCA.ACACCA
ACCGAGCGTA
AACGCCAAGT
AAATAA.ACCA
135 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fiaecalis Genes.
ACCGAGCCAG
GAGCAACCAA
CAACAGCCAA
GTGACGACAC
CCTTAACAC
AGTGGCAATG
GAAAAAAG
TTAATTTATA
AAAAGCCTGT
CAGATGTGGT
CTGTTCCAAT
CGATTGAAAC
AA-ACGCCTGA
TACAAGTGAA
GCATAATTGG
AAAAACGTAA
GCCAGTTGTT
GGTAAAGCCA
TGAAACGAAT
AGCAAGCGGA
TGGATTAAAA
AAGTGCTGAC
GTCAATCGCT
AGTCTAG
CCTGAAAAAC
AATGGAGAAA
AACCT=CAG
GAAGCAATrG
CCGATTAAAA
GGAAAAATGA
GGTGTATGTT
CAGrI'GTACC
TTGCAACAGG
AkAGTAACACA
TCGCAGTGGA
GTGAATATAA
AAGTACTTCC
TGACTGT'PT
ACAACAACCA
AGAATCTACA
TGTCCCAACT
TAAGGGCGT
AGTATI'ACCA
TTACACTGGT
ATCAGGAATC
EFOS9-2 (SEQ ID NO:222) MKKNI IIALFSTSLL AGCSSVSAYA QESEGNLGET- TCSVLPDEPN VPTDPITPSE PEQPTEPSTP EQPSEPSTPT EPSEPSKPTD PSLPDEPSVP TEPTTrPSKPE QPTEPTTPSV PEQPTEPSVP EKPVEPNKPT EPEKPVPVVP EICPVVPQQPE QPTDVVVKPN GEIATGESTQ QPTVPIETNN LSEVTHVPTV TTPIETASGE AIVAVDKGVP LTQTADGLKP IKSEYKVLPS GNVQVKSADG KMKVLPYTGE KMyGIIGSIAG VCLTVLSGIL IYKKRKV
C
C.
C.
C
b*
C*
C C
CS
C.
C C
C
C
C
S
C
EF059-3 (SEQ ID NO:223) AGAAGGAAA TCTI'GGTGAA
ACAACAGGGA
GAGCCAGAGC
ACCGAACCTA
CCAACAGAGC
GTACCAGAGC
ACCGAGCCAG
GAGCAACCAA
CAACAGCCAA
GTGACGACAC
CCT'ITAACAC
AGTGGCAATG
GMI TTTACC
AACCAACAGA
GTGAGCCTI'C
CA.ACAACGCC
A.ACCAACAGA
AAAAGCCTGT
CAGATGTGGT
CTGTTCCAAT
CGATTGAAkAC
AAACGGCTGA
TACAAGTGAA
AGATGAACCG
GCCAAGTACA
AAAACCGACG
AAGTAAGCCA
GCCAAGTGTA
GCCAGTTGTT
GGTAAAGCCA
TGAA.ACGAAT
AGCAAGCGGA
TGGATTAAAA
AAGTGCTGAC
AATGTACCAA
CCAGAGCAAC
GATCCTI'CGT
GAGCAACCAA
CCAGAAAAAC
CCTGAAAAAC
AATGGAGAAA
AACCTTI'CAC
GAAGCAATTG
CCGATTAAAA
GGAAAAATGA
CTGACCCAAT
CATCGGAACC
TACCAGACGA
CAGAGCCAAC
CAGTAGA.ACC
CAGTTGTACC
TTGCAACAGG
AAGTAACACA
TCGCAGTGGA
GTGAATATA.A
AAGTAC
AACGCCAAGT
GTCAACACCA
ACCGAGCGTA
AACGCCAAGT
AAATAAACCA
ACAACAACCA
AGAATCTACA
TGTCCCA.ACT
TAAGGGCGTT
AGTATTACCA
EF059-4 (SEQ ID NO:224)
C
0C C 0*
C.
EGNLGET TGSVLPDEPN VPTDPITPSE PEQPTIEPSTP EQPSEPSTPT EPSEPSKPTD PEQPTEPSVP EKPVEPNKPT EPEKPVPVVP QPTVPIETNN LSEVTHVPTV ITPIETASGE GNVQVKSADG KMKV PS LPDEP SVP
EKPVVPQQPE
A IVAVrDKGVP
TEPTTPSKPE
QPTDVVVKPN
LTQTADGLJ(P
QPTEPTTPSV
GEIATGESTQ
IKSEYKVLPS
EF060-1 (SEQ ID NO:225)
TCAAAAATAG
ACGTTGGGAA
GCAGAAGAAG
AAAACGGAAG
TCACAACAAC
CAGACGTCAT
GCCTGTITAC
AAATAA
ACAAGGAGCA
TACTr'IAAC
TAGGGCAAAC
GTGTCAITAA
AAGACAAAGT
TACCAAGGAC
TCGTAGTACT
CGCGATGATG
CGTTGGCATT
GAATATCGGT
GCCAATAGAG
GAGCAGAAAA
AGGTGAACGA
AACAAGTTTC
ACAATGAAAA
GCGAGTCTAA
GTAACGT=C
CAACCAGTCA
ACCACTGCTA
AATAGCACGT
TATATTTGA
GTAAAGGGTC
TTGTTCTC
ATGGAGGAAA
CTGATAAAGA
AAACGAATCC
GGCTTTACAG
ATAAAAAAAG
ACTI'CTGGTG
TGAGAGTTTT
AGAGCCACTA
TAAAAAAACG
GACTAATGCA
CCTTGGTATT
GAAAAAGGAA
EF060-2 (SEQ ID NO:226) MMT MKSKGSLLVT LGILLTVGIA SLIVSSIESFA EEVGQTNIGV TFYGGKEPLK 136 TABLE 1. Nucleotide and Amino Acid Seqcuences of E. faecalis Genes.
'rEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TAKTNPTNAQ TSLPRTGERN STWLYSLGIA CLIJVVLTSFY YLNKXRKXXEX EF060-3 (SEQ ID NO:227) ACAAGAAG TACCGCAA-AC GAATATCGGT GTAACGTTCT ATGGAGGAAA AGAGCCACTA AAAACGGAAG GTGTCATTAA GCCAATACAG CAACCAGTCA CTGATAAAGA TAAAAAAACG TCACAACAAC AAGACAAAGT GAGCAGAAAA ACCACTGCTA A.AACGAATCC GACTAATGCA
CAGACGTCAT
EF060-4 (SEQ ID NO:228) EEVCQTNIGV TFYGGXEPLK TEGVIKPIEQ PVTDKDKKTS QQQDKVSRKT TA-KTNPTNAQ TS EF061-1. (SEQ ID NO:229) 00 0
SS
SS
0 0 60
S*
S 0 00 0* 0O 0 600 S
S.
S S 000 S
TAATGGAACG
ATAATGATGA
AGTQAAATTT
GAAGTACCAA
CCACCTGTAG
CCGACAACAC
GAGCCAAGTA
GAAAAAACTC
CCAAGCAAGC
GGTACACAAC
CCTAGTGTAA
GGTCTTCCAC
TTGCCTAGCG
ACAGGTGAAG
ACCGACAGAA
AAAAAATI'CT
CTGCTTTTC
CAGAACCAAG
ACCCTGTAGA
CAACAGAACC
AACCAGTAGA
TGACACCAAC
CAATCGACGT
AGCCAACAGT
CAACACCTAT
TTACACAAAC
GAAATGTAGA
AAATGAATAT
GAAGATTT1'G
TTTTGCTAGT
ACAAGAAATT
TACACCAGAA
GCCACCTATT
TACAACTCCT
ACCTGAAAAA
TAAACCAACA
TGTTGTAACG
CCCTATTGA-A
TACAACTACA
AGCAGAAGGG
AGTAAAAGGT
CTT=IATCT
AACTTACAAA
TTATTTAGTC
ATCCCTGATr.
AAGCCAACAG
ACACCA.ACGG
ACAGACCCAA
CCAGTTACAC
GAATCTGAAA
CCAACAGCG
ACAAGTAAT1'
GACGGAGAA.A
TTAAAACCTA
AAGGACGGTA
GCCGTAGCGG
TAAAATTAA AATGGAGGAA CCACACTACT ATTTGGGGGA ATACTACGAC ACCGCCCATT ATCCAACACC GCCAATTGAG AGCCAACAGA ACCGACAGAG GTGAACCAGA ACAACCAACG CAAGCAAACC AGCAGAACCC AACCAGTACA ACCAGCAGA.A AATTAAATCA CCCTCGAAAT TGGCAGAAAT CACGCACGTG ACATTTAGC TGTAGAAAAA TTCAATCNAG TTACAAACTA AAATGAAGGT TTTACCATAC TATCTTGTCT GTAG EF061-2 (SEQ ID NO:230) 0 'See
S
00 S0 MMX I LFASL
VPTEPSTPEK
PSKPVEPEKP
TQQPTVPIET
PSGNVEVKGK
FSATLLFGGS
PTDPTPPIEP
VTPSKPAEPE
SNLAEITHVP
DCKMKVLPYT
EISAFAQEII
PVDPVEPPIT
KTVTPTKPTE
SVTTPITD
GEEMNIFLSA
PDD'TPPIE
PTEPTEPTEP
SE:KPVQPAEP
GENIVAVEXG
VAVSCL
TTPTEP'ITPT
SKP IDVVVTP
VPLTQTAEGL
EPSEPEQPTE
TGELNHAGNG
KPIQSSYKVL
EF061-3 (SEQ ID NO:231) GAAA=T CTGCTTlITGC ACAAGAAATI' ATCCCTGATG ATACTACGAC ACCGCCCATT
GAAGTACCAA
CCACCTGTAG
CCGACAACAC
GAGCCAAGTA
GAAAAAACTG
CCAAGCAAGC
GGTACACAAC
CCTAGTGTAA
GGTCTTCCAC
TTGCCTAGCG
CAGAACCAAG
ACCCTGTAGA
CAACAGAACC
AACCAGTAGA
TGACACCAAC
CAATCGACGT1 AGcCAACAGT
CAACACCTAT
'ITACACAAAC
GAAATGTAGA
TACACCAGAA
GCCACCTATT
TACAACTCCT
ACCTGAAAAA
TAAACCAACA
TGTMGTAACG
CCCTATTGAA
TACAACTACA
AGCAGAAGGG
AGTAAAAGGT
AAGCCA.ACAG
ACACCAACCC
ACAGAGCCAA
CCAGTTACAC
GAATCTGAAA
CCAACAGGGG
ACAAGTAATT
GACGGAGAAA
TTAAAACCTA
AAGGACGGTA
ATCCAACACC
AGCCAACAGA
GTGAACCAGA
CAAGCAAACC
AACCAGTACA
AATTAXATCA
TGGCAGAAAT
ACATTGTAGC
TTCAATCiqAG AAArGAAGGT
GCCAATTGAG
ACCGACAGAG
ACAACCAACG
AGCAGAACCC
ACCAGCACAA
CGCTGGAAAT
CACGCACGTG
TGTAGAAAAA
TTACAAACTA
TT
137 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. facca lis Genes.
EF061-4 (SEQ ID NO:232) QEII PDDTTTPPIE VPTEPSTPEI( PTDPTPPIEP PVDPVEPPIT PTEPTEPTEP TTPT-EP'ITPT EPSEPEQPTE PSKPVEPEKP VTPSKPAEPE KTVTPTKPTE SEXPVQPAEP SK(PIDVVVTP TGELNHAGNG TQQPT'JPIET SNLAEITHVP SVTTPITTTI GENIVAVEKG VLTQTAEGL KPIQSSYKVL PSGNVEVXGK DGIQM(V EF062-. (SEQ ID NO:233) TGATTCTTGA AGCAACAAAT GAAAGCAAAA 0*
CACTGGGTAA
GATAATGTAC
AACCCCGACC
GTACAAAAAG
GGTACTGAAC
GGAGCTGAAA
CTAGGGCACC
CCTAAAGAAT
A.ACATTACCG
GCTGCAAAAC
CAAGCAGCTG
GCAAAAA.ACA
AAGAACGAAA
GTCGTGACGA
GAT=AATA
ATGGGNGGGA
TCTAATGGTG
GGACTAA.ACG
CAATCCTCAC
ACNGCTITA
AAGT-TC LrG
GCGCTGTCTT
GATTTTGGGG
GATGGAAAAT
AkATAGTGATT
GCNAATGGNC
GTATCTAGTG
ATTTCAATT
AAAGCCAATG
NTCAATGATT
ACAGAACTTG
GATGCAGAAA
CTNAACGANA
GTGACTGCTT
AACGCNGACG
GTAGTCAAAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTC
AATCAAGCAT
CTGTCCCTAT
AAGCCGCGGA
TGCAGTCAGA
ACACTACI'C
AAAGTTCAC
AATCAGCACA
CAACAGAAGT
ITGAAACACC
TTGTTAAA
AAAAAGAAGT
AAT'rAAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTGG
GGTATr.CCOT
CTAGTTAT
CAAGCCAAAG
TTGGTACAAA
ATGCGTCACG
C ITTAAATTC
CNAACAATGC
TITACTCACC
GGC-ACGCTGT
CTATTTICCTT
CGCAATCGTT
ATGGGAATCC
TCGTTCCTGT
TAAATGTOAA
CCAAAGTAGA
AAGTGACGAT
AAGACTNAAA
CTTATGANCT
GNTCNGTTGT
ATGTAGAAGG
CAAATACAGT
AAAACGGTAC
TTTATTATGA
GGGGCATrGAA
TTACNAANTA
CCTACATTCT
TAITTGCNGC
TCTTTTTCTA
ATTAGATACG
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TCCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGAGCGNTAN
CAATGAAGCG
AATTCGTCG
AGGGGATTCA
CAACTTCCAG
TCTTIIAGAA
AGGACGTAAA
TGGAACGTTA
CAGAGTCAAT
TAA.AGAAGTA
AAGTTTAACG
GTTCAAATAC
GGAAGATATI'
AGGTCACAAG
TTCTI'TTGGT
TCCCTTTAGN
AAAAGAACCA
NCTTGTGCCN
NCGTGCCAT
TCCAAAAACA
TGATTTATCC
AGCTGTTGCT
CAATTTAGAT
TTTAGCAATG
CGATTTTGAA
GATTAACCAT
AGTTGGCAGT
AGTGAAATCT
TGATGTCTTG
TGACCTTAAA
TrTAGAAAAC
NTTAAATGAA
AAACAATATA
P ClrGTGTTAG
CAACCAGAAA
CCTAAAACGG
A~AAGTAGAAC
.ATACCACAA
GTAOTAAGCC
GAAA6ATGAAG
AAAGCAGTIG
GACTTAGGCA
CAAAAAGAAC
AAAATTGCCA
GANAPAAGAAG
AT1'AGTAAAA
ATTAAAGGCG
AA.AGATATCT
AA=TCTTTG
AAAAATAAAC
ATTACAAAAG
AATKGCAGTAT
GGTAACGATG
CTACCAGATA
AATAAAGGTG
ATTAATIGGNT
GACTATGGCA
AATGCC TACT
ATGACAACA.A
ACTAACTTAA
GAAAAAGCAA
AATAAAGAAG
TCTT'TACAAT
GTAACNAAAC
AAACTGAAAG
GCACNATTA
CAAAACACCG
GGGTATAAAT
AATACAG-CTG
GTGCCAGGTA
GTTTCTCTAC
TCCGAACGTC
GACACCACCC
GTAGGGGANA
AAAGACAATA
GGAAGcAATA
AGACATATAA
GAGCCGTAG
CAACGACGGT
CAGT~kTCTGA
AACTAGCGCC
ACGCGCAACA
CTGAAACAAC
TGA.ATAAATC
ATGAAGTAAA
ACG=ICTI'C
kAGCGAAAAA
AAGAAAATGC
TCGCNGAATA
ACCTAGTGTT
GAAAATTTAT
TTACAAAATT
AGCTAAGAkAT
AI'ACCTACT
TCAACCCAAT
AGAAGCAACA
AGAAAATAAA
ACCAACAGTA
CAATGAACCT
AACGTCCATT
AAAAGATCCA
TAAAGA=tTA
GATTGCCCAA
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CAAAGCAACT
ACGGAAAGAT
TAAAAGACGC AAATCTTGGG CACTCACAGT GACCTATACA CAGAATTTGT TTATGAACTA ITCAAACGA TCCGATATC ?I'AAAACACG CTTAACCA'Fr A.AGATAGTCC ATTTGCGTAT GCCATGCGGA ATTTGTTTCT CNTATGTGAA AAAACAAGCG CAGCACCTTC TGGATTGAA-A TTGGTCAGG TGTAGGTCTA AAGGAAAAAG TAATGTGCCT ATGCGCAATC AGTCAAGCCT CGATTGAAT CAATNGATAC TCACUGATGG NCAGAAAAAT ACATTCTGAC AG-GOGATACC AAGGGATTCG AGATACNTT 'IIATCAAGC AGACGCAAGT ATTCAGGAAN AGCTAAAGAC TCACAGCAAT CATCAAAACC ATTAC'ITGT CTTGCCG=r IrrcAGcTGAC AAANGATGGN GTAATCCTI'C CAAAGATGTA ATGATAAAGA TATTCCGTTA CAGCNAACTA TGGCGGAATN ATGATCGTIT cACAGGNAAA AAACGTTAAA AGCAGGAACA A-AGAC?M'AC GTTTFACNATG A.AGTAGGCAA ACAAGcTI'GG 138 TABLE 1. Nucleotide and Amino Acid Seqcuences of E. faecalis Genes.
TCTGTGTATC
AACTACAACA
AAACCAACCA
CGTGGTGATG
GCCTTTGATA
AANGTGACAC
AACCAGTTCA
CCACAAGCCT
GTCAAAGCCG
CGAATTAAAA
CTTATTAAAG
TTCTTCTATG
TGGTCGATTA
TI'GCCAATT
TCGAAACTAT
TTTI'TNGATG
ATTGGTGTAG
AATGAGAAGA
CCAGAAAAAA
TTAGTGGTAG
GTCTTATTAA
AAACGTAGAA
TGGAAGTCGA ACGGATNAAA AAGAGCTTCT NCG'ITCTAAT AAGCCG=TCA TAACAAGAAA TTCTTTCTTA TGAAATGACN
CACTCGATCT
CAATCAAAGA
CGATCTfCNTG
TTATTCTAGC
ATGT ETCTGG
CCAATACNGT
TNGGTGACAA
AATTTACAAG
GCGATAAACT
CTAA'TTTGT
TCACGATGAC
CGATGAATCT
AACGAATTGC
TTAAAACNAA
CAGTC-ATTGT
AAAAGGCAAG
CGGTAGCTGG
TGCGACACGC
CTTACTTCGT
GGACGATGCC
GNATGGTGGC
NGATGTI'AT
TOTCAACCAT
ACAAAGTCAA
TAGTGACATPI
AGACGTCAAA
TTAGCAGAC
CITTGAACAA
AAAAGAAAAC
GGCAGGAGAC
TACGGTAGTrC
ACCACCAACA
TGTNGTGCCA
TAGTTTAGCC
ACAGGTGACG
ACNCTGGTr.A
GGGGAAGANA
TGGGACTTAA
GTTTCTTTCT
GTCAAAGA'TT
AAAGGCACGG
CAAGAATTGC
AATTCAGCGG
ATTCCAAAAG
AATGGNGCCA
6CTCCAAT
CATGACAAAT
GGAACCAAAG
GGTAGTGA
AAAAACGTTG
GTTTACAACA
ACNCATACGC
CCAAA.AACAC
GAATTGCCGC
GCAATGCTTG
TAGAAAACAC
CGCATACNCC
TAANCATGG
AAGGGTACGA
TO-GATGATTA
CTAAAGGGGN
TGACNATNTC
GTGTAACNCT
AACAAAATAC
MAANCCTAA
CAATCAAATT
ACCCTCGNGT
TTAGTGGCCA
TGA-AMRAGG
AAATCACGGC
CACACTCATG
CAATCGAAGA
CAGAAAAACC
CGCA.AGCACC
AAACAGGCGA
GCTTACCAGG
GCAAACAGAA
TGATGATCCA
AAAAGTNGCT
TXAAGACTTT
CGATGAAACG
AGACATTACO
TGCCAAAGAC
CCCTACAAAA
ATTGGNCAA
AAAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
ATGGTCTGTG
GGACGACATT
CAGTCAAGCC
GAAAGCGTTC
ATCTTTCAAC
ACAAACNCCA
AGTAGAGCCA
AAAACAAAAT
CTTACGCTTT
AAGAAACAAA ATAA 0* 0@
C
EF062-2 (SEQ ID NO:234) MKAXK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETrVQPNN
PDLQSEKETP
AEKSAQEQPV
ITVVEKPAED
KNKAEKERXX
FNKVNAGDSK
LNASYLGRKI
FFDASGKEVL
GK FYS PED ID
SSAQWFAFXT
NDLNVXRGDS
NXKDXKAVAA
VKNVEGDFEN
TKIYYEVKSS
ISAYILLENK
YNKELVRSNT
FDTVDLATGV
QAFILAXGGQ
IKVGDKQSQN
ANSNFVLADG
GVERIAAGDV
VVEKASVVPE
KTAVSEE-ATV
VSPETTNEPL
LGNVSSKDLA
KEVAEYNKHK
DIFTKLRKDM
TKAEFVYELQ
PDKDSPFAYA
YGTGPSGLKN
NLNAQSVKP I
LQYIVTGDTT
AINSGXAKDV
TAVQEJTXDGE
ERPANYGGXT
DNKDLTFTMN
VVTHTPDDPK
SFFDDYDETX
ELRVTLPTKV
GATIKLGEXF
TKVNKGDDI S
YNTIEESFN
LPQTGEKQNV
QKDTTSQPTK
GQPTEVAPAE
AKEKEVDQLQ
NENSYVNEAI
GGKXTGNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKTV
TASYXLNLDQ
TVTNTVINH'V
EEWGMNDVLD
QALLAALNEG
PTKAVHNKKG
VrP IKDLLRV
KADVSGDVYN
FYEFTSSDI P
KLFTMTFEQG
EKIKTNTVVT
LLTVAGSLAA
VEEVAPE:NKG
NEVNKSTSIP
KEQAKI IAQO SKNLVFrDQSV
SFVKEANLGS
AVFSNDPIIT
KGGHAEFVSD
AYFGSGVGLA
KATIEFNXYK
TKQGIRDTFO
NTVTAMMY(TN
PGSNPSKDVK
TTHDRFTGKW
SNKVGKQAWS
EXIXHGKVAR
KDSKGXDITN
SAEQNTFGQR
AEYAGVVEEW
VVKITASQAF
HTPEKPQTPP
MLGLAGLGFK
TEQS SATPND
KEFETPDVDK
AAELKAKt-EK V'rKDTKISSI
NGGYAVLLEK
AF IGTNRVNG
FGANNAFKYI
NGRI SFSFGM
ANVVPVLVPN
AEKV'rIDLSK
ADGSWVLAMG
ADKNG'PJGSV
HAITXYDLKV
VYLEVERXKT
GDVLSYEMTW
QFTI SWDDAK
IKTNTVVNHI
S ISDKLDVKH
XDAMNLKENK
EK'IVIVPPTP
RRKETK
'ITNAQQPTVG
AVDEVKKDPN
IAKENAEIAA
KGGKF IKATD
NKPVTVTYTG
KDVKTRLTIK
NGSYVKKQAD
T'rKGKSNVPV KEV'rDGQKNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTD
GDVENTQTEN
OLKGYDKDFA
GTVTXSAKDP
PKVXPKKDVV
DKFSGQWSVF
NVAHS WRA I
KTPQAP'JEPL
EF062-3 (SEQ ID NO:235) TGATTCTTGA AGCAACAAAT GAAAGCAAAA AAACAATATA AGACATATAA AGCTAAGAAT CACTGGGTAA CTGTCCCTAT TCTTTTCTA AGTGTGTTAG GAGCCGTAGG ATTAGCTACT 139 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GATAATGTAC
AACCCCCACC
GTACAAAAAG
GGTACTCAAC
GGAGCTGAAA
CTAGGGCAGC
CCTAAAGAAT
AACATTACCG
GCTGCAAAAG
CAAGCAGCTG
GCAA.AAAACA
AAGAACGAA.A
GTCGTGACGA
GATT=AATA
ATCGGNGGGA
TCTAATGGTG
GGACTAAACG
CAATCCTCAC
ACNGCTT'ITA
AAGTTCTITG
GCGCTGTCTT
GA TI'TGGGG
GATGGAAAAT
AATAGTGATT
GCNAATGGNC
GTATCTAGTG
ATTTCALATT
AAAGCCAATG
NTCAATGATT
ACAGAACTTG
GATGCAGAAA
CTNAACGANA
GTGACTGCTT
AACGCNGACG
GTAGTGA-AAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTG
AATCAAGCAT
TCTGTGTATC
AACTACAACA
AAACCAACCA
CGTGGTGATG
GCCTN'rATA
AANCTGACAC
AACCAGTCP
CCACAAGCCI
GTCAA.AGCCC
CGAATTAAAP
GTITATI'AAAC
TTCTTCTAT'C
AAGCCGCGGA
TGCAGTCAGA
ACACTACTI'C
AAAGTTCAGC
AATCAGCACA
CAACAGAACT
TTGAAACACC
'rrGTTGAAAA
AAAA.AGAAGT
AATTAAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTGG
GGTATIGCGGT
CTAGTTAT
CAAGCCAAAC
TTGGTACAAA
ATGCGTCAGG
CTTTAAATTC
CNAACAATGC
TTTACTCACC
GGGACGCTGT
CTATTTCCTT
CGCAATGGTT
ATGGGAATCC
TCGTTCCTGT
TAAATGTGAA
CCAAAGTAGA
AAGTGACGAT
AAGACTNAAA
CTTATGANCT
GNTCNGTTGT
ATGTAGAAGG
CAAATACAGT
AAAACGGTAC
TTTATTATGA
GGGGCATIGAA
TTACNAANTA
CCTACATTCT
TATTGGCNC
TGGAAGTCGA
AAGAGCTTGT
*AAGCCGTTCA
TTCTITCTTA
*CAGTCGATCT
CAATCAAAGA
*CGATCTCNTG
TTA~rCTAGC
ATGTTTCTGG
LCCAATACNG'I
TNCGTGACAA
AATTTACAACG
AT'TAGATACG C AAAGGAAACA
TCAACCGACC
TACCCCAAATC
AGAACAACCA
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGAGCGNTAN
CAATGAAGCG
AATTTCGTCG
AGGGGATTCA
CAACTTCCAG
TC=TTAGA.A
AGGACGTAAA
TGGAACGI!EA
CAGAGTCAAT
TAAAGAAGTA
AAGTTAACG
GTTCAAATAC
GGAAGATATT
AGGTCACAAG
TTCTTTTGGT
TGCCTTTAGN
AAAAGAACCA
NCTTGTGCCN
NCGTGGCGAT
TCCAAAAACA
TGATI1TATCC
AGCTCTTGCT
CAATTTAGAT
TTI'AGCAATG
CGATTFI'GAA
GATTAACCAT
AGTTGGCAGT
AGTGAAATCT
TGATGTCTTG
TGACC~rAAA
TTIAGAAAAC
NTTAAATCAA
ACGGATNAAA
NCGTTCTAAT
TAACAAGAAA
TGAAATGACN
TGCCACAGCC
CTTACTTCGT
GGACGAT--CC
GNATGGTGGG
NAGTTTAT
TGTCAACCAT
ACAAAGTCAA
TAGTCACATT
A.ACCAGAAA C CTAAAACGG C
LAACTAGAAG
;ATAccACAA I~ 3TAGTAACCCC
.AAAATCAAC
VAAGCAGTTG2 'AC'1rAGG3CA2 CAAAAAGAAC2 A.AAATTGCCA2
GANAAAGAAG
C.T'PAGTAAAA
ATTAAAGGCG
~AAGCATATCT
~A ETCCTI=
AAAAATAAAC
ATTACAAAAG
AATGCAGTAT
GGTAAGGATG
CTACCAGATA
A.ATAAAGGTG
ATI'AATCGNT
GACTATGGCA
AATCCCTACT
ATGACAACAA
ACTAACTTAA
GAAAAAGCAA
A.ATAAAGAAC
TCTTTACAAT
GTLAACNAAAC
AA.AGTGAAAG
GCAGCNA TrA
CAAAACACCG
GGGTATAAAT
ALATACAGCTC
GTGCCAGGTA
GTTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AAACACAATA
GGAAGCAATA
ACAGGTGACG
ACNGTGGTGA
GGGGAAGANA
TGGGACTTAA
G'TTCTTTCT
CTCAAACATT
AAAGGCACGG
CAAGAATTGC
AATTCAGCGG
ATTCCAAAAG
AATGGNGCCA
CCTGCAGAAT
AACGACGGT TI ACTATCTGA A UkGTACCGCC A CGCGCkACA A :TGAAACAAc C rCAATAAALTC kTGAAGTAAA kCGTTTCTTC k.AGCGAAAAA G %AGAAAATICC A LCGCNGAATA C kCCTAGCTT C 7,AAAATTTAT C rTACAA.ATT I
TAAAAGAGC
CAGTGACAGTC
CAGAA--TTGT
I'rTCAAACGA
FTAAAACACG
AAGATAGTCC
GCCATGCGGA
CNTIATGTGAA
CAGGACCTTC
TTGGTTCAGG
AAGGAAAAAG
ATCCAATC
CGATTGAATT
TCACTGATGG
ACATITGTZAC
AAGGGA TTCG
TTTATCAAGC
ATTCAGGAAN
TCACAGCAAT
ATI'TAC?M~T
TTCAGCTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCGTTT
AAACGTTAAA
AAGACTTGAC
AACTAGGCAA
TAGAAAACAC
CGCATACNCC
TTAANCATGG
AAGGGTACGA
TCGATCATTA
CTAA.AGGGGN
TGACNATNTC
GTGTAACNCT
AACAkAATAC
TGAANCCTAA
CAATCAAATT
ACGCTGCNGT
CAACCCAAT
GAAGCAACA
GAAAATAAA
.CCAACAGTA
AATGA.ACCT
LACGTCCATT
AAAGATCCA
'AAAGATTTA
ATTCCCCAA
GCAAATTGCG
:AACAAGCAT
:GATCAATCT
:AAAGCAACT
~CGGAAAGAT
LkATC ETGGG
~ACCTATACA
rrATGAACTA
ICCGATTATC
YIAACGATT
kTTTGCGTAT AT'rrGTTTCT
AAAACAAGCG
TGGATTGAAA
TGTAGGTCTA
rAATGTCCCT AGfTGAAGCCT
CAATNGATAC
NCAGAAAAAT
AGGGGATACC
AGATACNTT
AGACGCAAGT
ACCTAAAGAC
GATGAAAACC
CTTGCCGTTr
AAANGATGGN
CAAAGATGTA
TATTCCG ITA
TGCGGAATN
CACAGGNAAA
AGCAGGAACA
cI'rACNATG
ACAAGCTTGG
GCAAACAGAA
TGATGATCCA
AAAAGTNGCT
TAAAGACTTT
CGATGAAACG
AGACATTACG
T~cCAAAGAC
CCCTACAAAA
AFT=GNCAA
AAAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
140 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TGGTCGATTA
T1'TGCCAATT
TCGAAACTAT
TTrINGATG
ATTGGTGTAG
XATCACAAGA
CCAGAAAAAA
GCGATAAACT
CTAA'T=T
TCACGATCAC
CGATGAATCT
AACGAATTGC
TTAAA-ACNAA
AGACGTCAAA
TTITAGCAGAC
CTTTGAACAA
AAAAGA.AAC
GGCAGGAGAC
TACGGTAGn-G
CATGACAAAT
GGAACCAAAG
GGGGTAGTCA
AAAAACGTTG
GTTACAACA
ACNCATACGC
CCAAAAACAC
TTAGTC-CCCA
TGAATAAAGG
AAATCACGGC
CACACTCATG
CAATCGAAGA
CAGAAAkAACC
CGCA-AGCACC
ATGGTCTGTC
GGACGACATT
CAGTCAACCC
GAAAGCGTTC
ATCTTTCAAC
ACAAACNCCA
AGTAGAGCCA
CAGTGATTGT ACCACCAACA TTAGTGGTAG AAAAGGCAAG MO EF062-4 (SEQ ID NO:236) AELDTQ PETTTVQPNN PDLQSEKETP KTAVSEEATV AENSAQEQPV VSPETTNEPL ITVVEKPAED LGNVSSKDLA KNKAEKERXX KEVAEYNKHK FNKVNAGOSK DIFTKLRKOM LNASYLGRKI TKAEFVYELQ FFDASGKEVL PDKDSPFAYA GKFYSPEDID YGTG?SGLKN SSAQWFAFXT NLNAQSVKPI NDLNVXRGbs LQYIVTGDTT NXKDXKAVAA AINSCXAKDV V-KNVEGDFEN TAVQLTXDGE TKIY-YEvKSS ERPANYGGXT I SAY ILLENK DNKDLTFTMN YNKELVRSNT VVTHTPDDPK FDTVDLATGV SFFDDYDETX QAFILAXGGQ ELRVTLPTKV IKVGDKQSQN GATIKLGEXF ANSNFVLADG TXVNKGDDIS CXTERIAAGDv YNTIEESFNN
VVEKASV
QKDTITSQPTK
GQPTEVAPAE
AKEKEVDQLQ
NENSYVNEAI
OOKXTONFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPKTV
TASYXLNLDQ
TV'TNTVINHV
EEWGMN~DVLO
QALLAALNEG
PTKAVHNKKG
VTP IKDLLRV
KADVSGOVYN
FYEFTSSDIP
KLFTMTFEQG
EKIKTNTVVT
VEEVAPENKG
NEVNKSTS IP KEQAKK IAQQ
SKNLVFDQSV
SFVXKEANLGS
AVFSNDPI IT
KGGHAEFVSD
AY'FGSGVGLA
KATIEFNXYK
TKQGIRDTFD
NTVTAMMKTN
PGSNPSKDVK
TTHDRFTGKW
SNKVGKQAWt
EXIXHGKVAR
KDSKGXD ITN
SAEQNTFGQR
AEYAGVVEEW
VVXITASQAF
HTPEK(PQTPP
TEQS SATPND
KEFETPOVDK
AAELKAKNEK
VTKDTKISSI
NGCYAVLLEK
AF IGTNRVNG FGANNAFKY I NOR ISFSFOM
ANVVPVLVPN
AEKVTIDLSK
ADGSVLAMG4
ADKNCTVGSV
HAITXYDLKV
VYLEVERXYXT
GDVLSYEMTW
QFTISVIDDAK
IXTNTV/VNH I S ISDKLDVKH XDAI1JLKENK
EKTVIVPPTP
TTNAQQPTVG
AVDEVKKDPN
IAKENAEIAA
KGGKF IKATD
NKPVTVTYTO
KDVKTRLTIK
NGSYVKKQAD
TTKGKSNVPV
KEVTDGQKNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTD
GDVENTQTEN
DLKGYDKDFA
GTVTXSA.KDP
PKVXPKKDVV
DKFSGQWSVF
NVAHSWKAF I
KTPQAPVEPL
S
EF063-1 (SEQ ID NO:237) TGA'rTCTTGA
CACTOGGTAA
GATAATGTAC
AACCCCGACC
GTACAAAAAG
GGTACTCAAC
GGAGCTGAAA
CTAGGGCAGC
CCTAAAGAAT
AACATTACCG
GCTGCAAAAG
CAAOCAGCTG
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GATTTTAATA
AGCAACAAAT
CTGTCCCTAT
AAGCCGCGGA
TOCAGTCAGA
ACACTACT'C
AAAGETCAGC
AATCAGCACA
CAACAGAAGT
TTGAAACACC
TTO-TTAAAA
AAAAAGAAGT
AATTAAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
GAAAGCAAAA
TCTTTTTCTA
AJTAGATACG
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGAOCGNTAN
CAATGAAGCG
AA'TTCGTCG
AGGATTCA
AAkACAATATA
AGTGTGTTAG
CAACCAGAAA
CCTAAAACGG
AAAGTAGAAG
GATACCACAA
CTACTAAGCCC
GAAkAATGAAG AAAGCAG7M
GACTTAGCCA
CAAAAAGAAC
AAAATTGCCA
GANAAAGAAG
ATTAGTAA.AA
ATTAAAGGCG
AAAGATATCT
AGACATATAA
GAGCCGTAGG
CAACGACGGT
CAGTATCTOA
A-AGTAGCCC
ACCCGCAACA
CTCAAACAAC
TGAATAAATC
ATGAAGTAAA
ACr.'ITCTTC
AAGCGAAAAA
AAGAAAATGC
TCGCNGAATA
ACCTAGCTT
GAAA.AMTAT
TTACAAAATT
AGCTAAGA-AT
ATTAGCTACT
TCAACCCAAT
AGAAGCAACA
AGAAAATAAA
ACCAACAGTA
CAATOAACCT
AACGTCCATT
AAAAGATCCA
TAAAGATTTA
GATTOCCCAA
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CAAAGCAACT
ACGGAAAGAT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
p p p p
ATGGGNGGGA
TCTAATGGTG
tGGACTAAACG
CAATCCTCAC
ACNGCTITA
AAGTTCTTTG
GCGCTGTCT
GATTTTGGGG
GAT'GGAAAAT
AATAGTC-ATT
GCNAAT'GGNC
GTATCTAGTG3
AT=~CAATT
AAAGCCAATG
NTCAATGATT
ACAGAACTG
GATGCAGAAA
CTNAACGANA
GTGACTGCTT
AACGCNGACG
GTAGTGAAAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTG
AATCAAGCAT
TCTGTGTATC
AACTACAACA
AAACCAACCA
CCTGCTCATG
GCCTTTGATA
AANGTGACAC
AACCAGTTCA
CCACAACCCT
GTCAAAGCCG
CGAATTAAA,
GTTATTAAAG
TTCTTCTATG
TCCTCGATT~A
TTTGCCAA'rr
TCGAAACTAT
TITTTNGATIG
AITGrGTG.TAG
AATGAGAAGA
CCAGAAAAAA
TAGTGGTAG
GTCTTATTA.A
AAACGTAGAA
AAGNTACTGG
GGTATGCGGT
CTAGTTATTT
CAAGCCAAAG
TTGGTACAAA
ATGCGTCAGG
CTTTAAATTC
CNAACAATGC
TTTACTCACC
GGGACGCTGT
GTA=IICCTT
CGCAATGGTT
ATGAATCC
TCGTI'CCTGT
TAAATGTGAA
CCAAAGTAGA
AAGTCACGAT
AAGACTNAA-A
CTTATGANCT
GNTCNGTTGT
ATGTAGAAGG
CAAATACAGT
AAAACGGTAC
TTATTATGA
GGGGCATGAA
TTACNAANTA
CCTACAITCT
TATGGCNGC
TGGAAGTCGA
AAGAGCTTGT
AAGCCGTTCA
TTC'ITTCTTA
CAGTCGATCT
CAATCAAAGA
CGATCTCNTG
TTATT:ZTACC
ATG=TCTGG
CCAATACNGT
TNGGTCACAA
AATTACAAG
GCGATAAACT
CTAATTTTGT
TCACGATGAC
CGATGAATCT
AACGAATTGC
TTAAAACNAA
CAGTGATTGT
AAA.AGGCAAG
CGGTAGCTGG
CAACI"TCCAG
TCTTTTAGAA
AGGACGTAAA
TGGAACGTTA
CAGAGTCAAT
TAA.AGAAGTA
AAGTTTAACG
GTTCAAATAC
GGAAGATATT
AGGTCACAAG
TTCTTTTGTGT
TGCCTIAGN
AAAAGAACCA
NCTT~GTGCCN
NCGTGGCGAT
TCCAAAAACA
TGATT1TATCC
AGCTGTTGCT
CAATI'TAGAT
'FITAGCAATG
CGATTTGAA
GAFI'AACCAT
AGTTGGCAGT
AGTGAAATCT
TGATGTCTTG
TCACCTTAAA
TTTAGAAAAC
NTTAAATGAA
ACGGATNAAA
NCGTTCTAAT
TAACAAGAAA
TGAAATGACN
TCGACAGGC
CTTACTTCGT
GGACGATGCC
GNATGGTCGG
NGATG=rAT
TGTCAACCAT
ACAAAGTCAA
TAGTGACATT
AGACGTCAAA
TTTAGCAGAC
CTTGAACAA
AAAAGAAAAC
GOCAGGAGAC
TACGGTAGTG
ACCACCAACA
TGTNGTGCCA
TAGTTTAGCC
AATTCCTTI'G
A.AAAATAAAC
ATTACAAAAG
AATGCAGTAT
GGTAAGGATC
CTACCAGATA
AATAAAGGTG
ATI'AATGGNT
GACTATGGCA
ALATGCCTACT
ATGACA-ACAA
ACTAACTTAA
GAAAAAGCAA
AATAAAGAAG
TCTTTACAAT
GTAACNAAAC
AAAGTGAAAC
GCAGCNATTA
CAAAACACCG
GGGTATAAAT
AATACAGC 1G
GTGCCAGGTA
GTTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AAAGACAATA
GGAAGCAATA
ACAGGTGACG
ACNGTGGTGA
GGGGAAGANA
TGGGACTTAA
GTTTCTTCT
GTCAAAGATI'
AAAGGCACGG
CAAGAATTC
AATTCAGCGG
ATTCCAAAAG
AATGGNGCCA
CCTGCAGAAT
CATGACAAAT
GGAACCAAAG
GGGGTAGTGA
AAAAACG~rG
GTI'TACAACA
ACN'CATACGC
CCAAAAACAC
GAATTOCCGC
GCAATGcTrG rAAAAGAGGC
CAGTGACAGT
CAGAA=TGT
rTTCAAACGA rTAAAACACGC
A.AGATAGTCC
GCCATGCGGA
CNTATMTGAA
CAGGACCTTC
TTGGTTCAGG
1AGGCAAAAAG ATGCGCAkATC
CGATTMAAT
TCACTGATGGI
ACATTGTGAC
AAGGGATTCG
TTTATCAAC
ATCAGGAAN
TCACAGCAAT
ATTACTTGT
TTCAGCTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCGTTT
AAACGTTAAA
AAGACTTGAC
AAGTAGGCAA
TAGAAAACAC
CGCATACNCC
TTAANCATGG
AAGGGTACGA
TCGATGATTA
CTAAAGGGGN
TGACNATNTC
GTGTAACNCT
AACAAAATAC
TGAANCCTAA
CAATCAAATT
ACGCTGGNGT
TTAGTGGCCA
TGAATAAAGG
AAATCACGGC
CACACTCATG
CAATCGAAGA
CAGAAkAAACC
CGCAAGCACC
AAACAGGCGA
GCTTAGCAGG
kAATCTTGGG
;ACCTATACA
rTATGAACTA rCCGATTATC
:TTAACGATT
%.TTTGCGTAT
%.T?1GTTTCT
AIAAACAAGCG
TGGA ITGAAA
TGTAGGTCTA
TAATGTGCCT
A.GTGAAGCCT
CAATNGATAC
NCACAAAAAT
AGGGGATACG
AGATACNTT
A.GACGCAAGT
AGCTAAAGAC
GATGAAAACC
C IPGCCGTTT
AAANGATGGN
CAAAGATGTA
TA PTCCGTTA
TGGCGGAATN
CACAGGNAAA
AGCAGGAACA
GTITACNATG
ACAAGCTTGG
GCAAACAGAA
TCATCATCCA
AAAAGTNGCT
TAAAGACTTT
CGATGAAACG
AGACATTACG
TGCCAAAGAC
CCCTACAA.AA
AITTTCGNCA.A
AAAAGACGTG
AGGGGAGAAN
TrGTGGAAGAA
ATGG.TCTGTG
GGACGACATT
CAGTCAAGCC
GAA.AGCGTTC
ATC'TTTCAAC
ACAAACNCCA
AGTAGAGCCA
AAAACAAAAT
CTTAGGCTTT
AAGAAACAAA ATAA EF063-2 (SEQ ID NO:238) 142 TABLE 1. Nucleotide and Aniino Acid Seqeuences of E. faecalis Genes.
MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTTVQPNN
PDLQSEKETP
AEKSAQEQPV
ITVVEKPAEID
KNKAEKERXX
FNKVNAGDSK
LNASYLGRXI
FFDASGKEVL.
GKFYSPEDID
SSAQWFAFXT
NDLNVXRCDS
NXKDXKAVAA
VKNVE OF EN TX IYYEVKSS
ISAYILLENK
YNKELVRSNT
FDTVDLATGV
QAF ILAXGGQ
IKVGDKQSQN
ANSNFVLADG
GVERIAAGDV
VVEKASVVPE
KTAVSEEATV
VSPETTNEPL
LGNVSSXDLA
KEVAEYNKH(
DIFTKLRKDM
TKAEFVYELQ
POKDSPFAYA
YGTGPSGLKN
NLNAQSVKP I LQY IVTGDTr
AINSGXAKDV
TAVQLTXOGE
ERPANYGGXT
DNKDLTFTMN
VVT}ITPDDPK
SFFDDYDETX
ELRVTLPTKV
GATIKLGEXF
TKVNI(GDOI S
YNTIEESFNN
LPQTGEKQNV
QKDTTSQPTK
GQTEVAPAE
AKEKEVDQLQ
NENSYVNEAI
GGKXTGNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAXVOPKTV
TASYXLNLDQ
TVTNTVINHV
EEWNNDVLD
QALLAALREG
PTKAVHNKKG
VTPIKDLLRV
KADVSGDVYN
FYEFTSSDIP
KLFTMTFEQG
EX IKTNTVVT
LLTIVAGSLAA
VEEVAPENXG
NEVNKSTSIP
KEQAKXIAQQ
SKNLVFDQSV
SFVKEANLGS
AVFSNDPIIT
KGGRAEFVSD
AYFGSGVGLA
:(ATIEFNXYI(
TKQGIRDTFO
RItVTAMMKTN
PGSNPSKDVK
TTHDRFTGKW
SNKVGKQAWS
EXIXHGKVAR
KDSKGXDITN
SAEQNTFGQR
AE-YAGVVEEW
VVKITASQAF
HTPEKPQTPP
MLGLAGLGFK
TEQSSATPND
KEFETPDVDK
AAELXAKNEK
V'flCDTKISS I
NGGYAVLJLEK
AFIGTRVNG
FCANNAFKYI
NGRISFSFGM
ANVVPVLVPN
AEXVTIDLSK
ADGSVVLAM'G
ADKNGTVGSV
HAITXYDLKV
VYLEVERXKT
GDVLSYEMTW
QFTI SWDDAI( IKTNM-- VNH I
SISDKLDVKH
XDAIThNLKENK
EKIVIVPPTP
RRKETK
7TNAQQPTSIG
AVDEVKKDPN
IAKENAEIAA
KGGKFIKATD
NKPVTVTYTG
KDVKTRLTIK
NGSYVKKQAD
TTKGKSNVPV
KEVTDGQ-KNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXXTLKAGTD
GIJVENTQTEN
DLJKGYDKDFA
G'IVTXSAKDP
PKVXPKXD)VV
DKFSGQWSVF
NVAHSWKAF I
KTPQAPVEPL
EF063-3 (SEQ ID NO:239) GGA ATTAGATACG CAACCAGAAA CAACGACGGT TCAACCCAAT
C
AACCCCGACC
GTACAAAA.AG
GGTACTGA.AC
GGAGCTG-AAA
CTAGGGCAGC
CCTAAAGAAT
AACATTACCG
GCTGCAAAAG
CAAGCAGCTI
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GAFTTTAATA
ATCOGNGGGA
TCTAATrGGTG
GGACTAAACG
CAATCCTCAC
ACNGCTTTTA
AAGTTCTTTG
GCGCTGTCTT
GATrrTTGGC
GATGGAAAAT
AATAGTGATT
CAATfGGNC
GTATCTAGTG
ATTTTCA-ATT
TGCAGTCAGA
ACACTACT'C
AAAGTTCAGC
AATCAGCACA
CAACAGAAGT
TTGAAACACC
TTGTTGAAAA
AAAAAGAAGT
AATTAAAAGC
AAGCNGAAAA
ACAGCTATGT
AAGACACTAA
AAGTAAATGC
AAGNTACTGG
GGTATCGCT
CTAGTTATT
CAAGCCAAAG
TTGGTACAAA
ATGCGTCAG
CTTTAAAI'C
CNAACAATC
TTTACTCACC
GGGACGCTGT
GTATTTCCTT
CGCAATGGTT
ATGOGGMTCC
AAAGGAA.ACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAAATGAA
AGAGCGNTAN
CAATGAACCC
AATL'CGTCG
AGGGGATICA
CAACTTCCAG
TCTTTTAGAA
AGGACGTAAA
TGGAACGTTA
CAGAGTCAAT
TAAAGAAGTA
AAGTTTAACG
GTTCAAATAC
GGAAGATATT
AGGTCACAAG
TTCTTTTGCT
TGCCTTTAGN
AAAAGAACCA
CCTAAAACGG
AAAGTAGAAG
GATACCACAA
GTAGTAAGCC
GAAAATGAAG
AAAGCAG LTG
GACTTAGGCA
CAAAAAGAAC
AAAATTGCCA
GANAAAGAAG
ATTAGTAAAA
ATTAAAGGCG
AAAGATATCT
AATTCCTITG
AAAAATAAAC
ATI'ACAAAAG
AATGCAGTAT
GGTAAGGATG
CTACCAGATA
AATAAAGGTG
ATTAATGGNT
GACTATGGCA
AATGCCTACT
ATIACAACAA~
ACTAACTTAA
GAAAAAGCAA
CAGTATCTGA
AAGTAGCGCC
ACGCGCAACA
CTGAA.ACAAC
TGAATAAATC
ATGAAGTAAA
ACGTTITTC
AAGCGAAAAA
AAGAAAATGC
TCGCNGAATA
ACCTAGTGTT
GAAAATTTAT
'ITACAAAAT'
TAAAAGAGGC
CAGTGACAGT
CAGALT'rGT
TI'TCAAACGA
TTAAAACACG
AAGATAGTCC
GCCATGCGGA
CNTAT'CTGAA
CAGGACCTTC
TI'GGrI'CAGG
AAGGAAAAAG
ATr-CGCA.ATC
CGATTAAT
AGAAGCAACA
AGAAAATAAA
ACCAACAGTA
CAATGAACCT
AACGTCCATI7
AAAAGATCCA
TAAAGAITTA
GATTGCCCA-A
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CA.AAGCAACT
ACGGAAAGAT
AAATCTTGGG
GACCTATACA
TTATG-AACTA
TCcGATTATC
CTTAACGATT
ATTTGCGTAT
ATTTGTrCT
AAAACAAGCG
TGGATTGAAA
TGTAGGTCTA
TAATGTGCCT
AGTGAAGCCT
CAATNGATAC
TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
AAAGCCAATG TCGTTCCTCT NCT'TGTCCCN AA4TAAAGAAG NTCAATGATT TAAATGTGAA NCGTGGCGAT TC=TTACAAT ACAGAACTTG CCAAAGTAGA TCCAAAA.ACA GTAACNAAAC CATGCAGAAA AAGTGACCAT TGAITI'ATCC AAAGTG TCACTGATGG NCAGAAAAAT ACATTGTCAC ACGGGATACG AAGGGATTCG AGATACNTT EF063-4 (SEQ ID NO:240) ELDTQ PEITI'VQPNN
PDLQSEKETP
ARK SAQEQ PV
I'VVEKPAED
KNKAEKE.XX
FNKVNAGDSK
LNASYLGRK I
FFDASGKEVL
CKFYSPEDID
SSAQWFAFXT
NDLNVXRGDS
KTAVSEEATV
VSPETTNEPL
LGNVSSKDLA
KEVAEYNKHK
DIFTKLRKDM
TKAEFVYELQ
PDKDSPFAYA
YGTGPSGLKN
NLNAQSVKP I LQY IVTGDTT
QKDTTSQPTK
GQPTEVAPAE
AKEKEVDQLQ
NE-NSYVNEAI
GGKXTCNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
EL.AKVDPKTV
VEEVAPENFG
NEVNKSTSIP
K-2QAKK IAQQ
SKNLVFDQSV
SFVKEANLGS
AVFSN)P I IT
KGGHAEFVSD
AYFGSGVGLA
KATIEFNXYK
TKQGIRDTFD
TEQSSATPND
KEFETPOVDK
AAELKAKNEK
VTKDTKISS I
NGGYAV'LLEK
AF IGTNRVNG
FGANNAFKYI
NCRI SFSFGM
ANVVPVLVPN
AEKVTIDLSK
TTNAQQPTVG
AVDEVKKDPN
IAKENA7EIAA KGGKF IKATD
NYPVTVTYTG
KOVKTRLTIK
NGSYVKKQAD
TKGKSNVPV
KEVTD)GQKNX
v1 EF064-1 (SEQ ID NO:241) a a C a a a. a a a.
TGATTCTTGA
CACTGGGTAA
GATAATGTAC
AACCCCGACC
GTACAA.AAAG
GGTACTGAAC
GGAGCTGAAA
CTAGGGCAGC
CCTAAAGAAT
AACA'FrACCG
GCTCAAA.AG
CAAGCAGCTG
GCAAAAAACA
AAGAACGAAA
GTCGTGACGA
GATTTAATA
ATGGGNGGGA
TCTAATCGGTG
GGACTAAACG
CAATCCTCAC
ACNGCTTTTA
AAGTTCTTTG
GCGCTGTCTT
GXT=GGGO
GATGGAAAAT
AATAGTGATI'
GCNA.ATGGNC
GTATCTAGTG
AITIIICAAT'
AAAGCCAATG
ACCAACAA.AT
CTGTCCCTAT
AAGCCGCGGA
TGCAGTCAGA
ACACTACTTC
AAAGTTCAGC
AATCAGCACA
CAACAGAAGT
TTGAAACACC
TTGTTGAAAA
AAAAAGAAGT
AATTAAAAGC
AAGCNGAAAA
ACAGCTATGT
A.AGACACTAA
AAGTAAATGC
AAGNTACTGG
CGTATGCGGT
CTAGTTATIT
CAAGCCAAAG
TTGGTACAAA
ATGCGTCAGG
C'ITTAAATTC
CNA.ACAATC
TTTACTCACC
GGGACGCTGT
GTATTCCTI'
CGCAATGGTT
ATGGGAATCC
TCG'ITCCTGT
GAAAGCAAAA
TCTT=TCTA
A'FrAGATACG
AAAGGAAACA
TCAACCGACC
TACCCCAAAT
AGAACAACCA
TGCACCAGCT
AGACGTTGAT
ACCAGCAGAA
AGACCAACTA
CAAAA.ATGAA
AGAGCGNTAN
CAATGAAGCG
AATTTCCTCG
AGGGGATTCA
CAACTTCCAG
TCTTT'TAGAA
AGGACGTAKA
TGGAACGTTA
CAGAGTCAAT
TAAAGAAGTA
AAGTTTIAACG
GTTCAAATAC
GGAAGATATT
AGGTCACAAG
TTCTTTTGGT
TGCCTTTAGN
AAAACAACCA
NCI'CTCCCN
AAACAATATA
AGTGTG=IAG
CAACCAGAAA
CCTAAAACGG
AAAGTAGAAG
GATACCACAA
GTAGTAAGCC
GAAAATGAAG
AAAGCAGT7G
GACTI'AGGCA
CAAAAAGAAC
AAAATTGCCA
GANAAAGAAG
ATTACTAAAA
A'FIAAAGGCG.
AAAGATATCT
AATTCCTTrG
AAAAATAAAC
ATTACAAAAG
AATGCAGTAT
GGTAAGGATC
CTACCAGATA
AATAAAGGTG
ATTAATGGNT
GACTATGGCA
AATGCCTACT
ATGACAACAA
ACTAACTTAA
GAAAA.AGCAA
AATAAAGAAC
TCITI'ACAAT
GTAACNAAAC
AGACATATA-A
GAGCCCGTAGG
CAACGACGGT
CAGTATCTGA
AAGTAGCCCC
ACGCGCAACA
CTGAAACAkAC
TGAATAAATC
ATGAAGTAAA
ACGTTTCTTC
AAGCGAAAAA
AAGAAAATGC
TCGCNGAATA
ACCTACTGTT
GAAAAT1'TAT
TTACAAAATT
TAAAAGAGGC
CAGTrACAGT
CAGAATTTGT
TTTCAAACGA
TTAAAACACC
AAGATAGTCC
GCCATGCGGA
CNTATGTGAA
CAGGACCTTC
TTGGTTCAGG
AAGGAAAAAG
ATGCGCAATC
CGATTGAATT
TCAC7CATGG ACATrTCTCAC
AAGGGATTCG
AGCTAAGAAT
ATTAGCTACT
TCAACCCAAT
AGAAGCAACA
AGAAAATAAA
ACCAACAGTA
CAATGAACCT
AACGTCCAT
AAAAGATCCA
TAAAGATTTA
GATTGCCCAA
AGAAATTGCG
CAACAAGCAT
CGATCAATCT
CAAACCAACT
ACGGAAAGAT
AAATCrPTGGG
GACCTATACA
rTrATGAACTA
TCCGAITATC
CITAACGATT
AT=rCGTAT A'TrGTTrT
AAAACAAGCG
TGGATTGAAA
TGTAGGTrCTA
TAATGTGCCT
AGTGAAGCCT
CAATNGATAC
NcAGAA.AAAT
AGGGGATACG
AGATACN=I
NTCAATGA'IT TAAATGTGAA NCGTGCCGAT ACAGAACTTG CCAAAGTAGA TCCAAAAACA TABLE 1. Nucleotide and Ami no Acid Seqeuences of E. faecalis Genes.
GATCCAGAAA
CTNAACGANA
GTGACTGC7TT
AACGCNCACG
GTAGTGAAAA
GAAACGGTAA
AAAGCAGATA
CAAACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTC
AATCAAGCAT
TCTGTGTATC
AACTACAACA
AAACCAACCA
CGTGGTGATG
CCTTTGATA
AANGTCACAC
AACCAGTTCA
CCACAAGCCT
GTCAAAGCCG
CGAATTAAAA
GrI'ATTAAAC
TTCTTCTATC
TGGTCGATTA
TITGCCAATT
TCGAAACTAT
TTTTNGATG
ATGGTGTAG
AATGAGAAGA
CCAGAAAAAA
TTAGTGGTAG
GTCTTATTAA
AAACGTAGAA
AAGTGACGAT TGA=~ATCC AAGACTNAAA AGCTr.TTGCT CTI'ATGANCT CAATITAGAT GNTCNCTTGT TTTAGCAATG ATCTACAAGG CGA=ITrGAA CAAATACAGT GATTAACCAT AAAACGGTAC AGTIGGCAGT TTTATTATGA AGTCALAATCT GGGCCATGAA TGATGTCTTG ?I'ACNAANTA TGACCTTAAA CCTACATTCT TITAGAAAAC TATTGGCNGC NTTAAATGA-A TGGAAGTCGA ACGGATNAAA AAGACCTTGT NCGTTCTAAT AACCCjTTCA TAACAAGAAA TI'CTTCTTA TGAAATGACN CAGTCGATCT TGCGACAGGC CAATCAAAGA CTTACI1TCGT CGATCTCNTG GGACGATGCC TTATTCTAGC GNATGGTGG ATCTI'CTGG NGATCTITTAT CCAATACNGT TGTCAACCAT TNGGTGACAA ACAAAGTCAA AATTTACAAG TAGTCACATT GCGATAALACT AGACGTCAAA CTAATTTTGT TTTAGCAGAC TCACGATCAC CTTTGAACAA CGATGAATCT AAAAGAAA-AC AACGAATTGC GGCAGGAGAC 'TTAAAACNAA TACCTAGTG CAGTGATTGT ACCACCAACA AAAAGGCAAG TGTNGTGCCA CGGTAGCTGG TAGTF'ACC AAGAAACAAA ATAA
APLAGTGALAAG
GCAGCNATTA
CAAAACACCG
GGGTATAAAT
AATACAGCTG
GTGCCAGGTA
GTTTCTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AA.AGACAATA
GA-ACAAT-A
ACAGGTGACG
ACNGTGGTGA
GCGGAAGANA
TGGGACTTAA
GIT7C=TCT
GTCAAAGATT
AAAGGCACGG
CAAGAATTC
AATTCAGCGG
ATTCCAAAAG
AATGGNGCCA
CCTGCAGAAT
CATGACAA.AT
GGAACCAAAG
GGGGTAGTGA
AAAAACG.TTG
GTTTACAACA
ACNCATACGC
CCAAAA.ACAC
GAATTGCCGC
GCAATGCTTG
T'rTATCAAGC
ATTCACOAAN
TCACAGCAAT
ATTTACTTGT
TTCAGCTGAC
GTAATCCTTC
ATGATAAAGA
CACCNAACTA
ATGATCGTTT
AAACGTTAAA
AAGACTTGAC
AAGTACCCAA
TAGAAAACAC
CGCATACNCC
TTAANCATGG
AAGGGTACGA
TCGATGATTA
CTAA-A=GGN
TGACNATNTC
GTGTAACNCT
AACAAAATAC
TGAANCCTAA
CAATCAAAT
ACGCTGGNGT
TTAGTGGCCA
TGAATAAAGG
AAATCACGC
CACACTCATG
CAATCGAAGA
CAGA-AAA6ACC
CGCAAGCACC
AAACAGGCGA
GCTTAGCAGG
AGACGCAAGT
AGCTAA-AGAC
GATGAAAACC
CTTGCCGTTT
AAANGATGGN
CAAAGATGTA
TATTCCGTTA
TGGCGGAATN
CACAGGNAAA
AGCAGGAACA
GTTACNATG
ACAAGCTTGG
GCAA.ACAGAA
TGATGATCCA
AA.AACTNGCT
TAAAGAC=?
CGATGAAACG
AGACAITACG
TGCCAAAGAC
CCCTACAAAA
ATTTGGNCAA
A.AAAGACGTG
AGGGGAGAAN
TGTGGAAGAA
ATGGTCTGTG
GGACGACATT
CAGTCAAGCC
GAAAGCGTTC
ATCTI'CAAC
ACAAACNCCA
AGTAGAGCCA
AAAACAAAAT
CTTAGGCTT
ft.
ft ft.
ft ft.. ft ft.
ft ft ft ft ft ft ft...
ft ft ft...
ft ft ft.. ft ft. ft ft ft ft ft ft.
EF064-2 (SEQ ID NO:242) MKAKK QYKTYKAKNH WVTVPILFLS VLGAVGLATD NVQAAELDTQ PETTI'VQPNN
PDLQSEKETP
AEKSAQEQPV
ITVVEKPAED
KNKAEKERXX
FNKVNAGOSK
LNASYLGRK I
FFDASCKEVL
GKFYSPEDID
SSAQWFAFXT
NDLNVXRGDS
NXKDXKAVAA
VKNVEGDFEN
TKIYYEVKS
I SAYILLENK
YNKELVRSNT
KTAVSEE-ATV
VSPETTNEPL
LGNVSSKDLA
KEVAEYNKHK
D IFTKLRKDM
TKAEFVYELQ
PDKDSPFAYA
YGTGPSGLKN
NLNAQSVKP I
LQYIVTGTT
AINSGXAKDV
TAVQLTXDGE
ERPANYGGXT
DNKDLTFTN
VVTHTPDDPK
QKDTTSQPT(
GQPTEVAPAE
AKEKEVDQLQ
NENSYVNE-AI
GGKXTGNFQN
SSPSQSGTLN
LSSLNSSLTN
SDWDAVGHKN
FNYGNPKEPE
ELAKVDPK'IV
TASYXLNLDQ
TVTN'IVINHV
EEWCMNDVLD
QALLAALNEG
PTKAVHNKKG
VEEVAPENKG
NEVNKSTSI P
KEQAKKIAQQ
SKNLVFD)QSV
SFVKEANLGS
AVFSNDP IIT
KGGHAEFVSD
AYFGSGVGLA
KATIEFNXYK
TKQGIRIDTFD
NTVTAMMKTN
PGSNPSKDVK
TrHDRFTGKW
SNXVCKQAWS
EX IXHGKVAR
TEQSSATPND
KEFETPDVDK
AAELKAKNEI(
VTKDTKISS I
NGGYAVLLEK
AF IGTNRVNG
FGANNAFKYI
NGRI SFSFGM
ANVVPVLVPN
AEKVT:-DLSK
ADGSVVLAMC
ADKNGTVGSV
HAITXYDLKV
VYLEVERXKT
GDVLSYEMTW
TTNAQQPTIVG
AVDEVKKDPN
IAKENAEIAA
KGGKFIKATD
NKPVTVTYTG
KDVKTRLTIK
NGSYVKKQAD
TTKGKSNVPV
KEVTDGQKNX
VKVYQADASL
YKYLLVLPFV
SLHDKDIPLQ
GXKTLKAGTD
GDVENTQTEN
DLKGYDKDFA
145 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. facalis Genes.
FDTVDLATGV
QAFILA.XGGQ
IK'JGDKQSQN'
ANSNFVLADC
GVER IAAGDV
VVEKASVVPE
SFFDDYDETX
ELRVTLPTKV
GATIKLGEXF
TKVNKGDDIS
YNT lEES FNN
LPQTGEKQNV
VTPII(DLLRV
KADVSGDVYN
FYEFTSSDIP
KLFTMTFEQG
EKIKTNTVVT
LLTVAGSLAA
KDSKGXD ITN
SAEQNTFGQR
AEYACVVEEW
VVKITASQAF
IITPEXPQTPP
MLGLAGLGFK
QFTISWDOAK
I KTNTVVNI
SISOKLDVKH
XDAMLKENK
EXKTVIVPPTP
RRKETK
GTVTXSAKDP
PKVXPKX(DVV
DKFSGQWSVF
1NAHSWdKAF I
KTPQAPVEPL
EF064-3. (SEQ ID NO:243) AGTGACGAT TGATTTATCC AAAGTGAAAG TTTATCAAGC AGACGCAAGT C* C
CTNALACGANA
GTGACTGCT
AACGCNGACG
GTAGTGAAAA
GAAACGGTAA
AAAGCAGATA
CAA.ACAAAAA
ACNGAAGAAT
TGGCACGCTA
GATATTTCTG
AATCAAGCAT
TCTGTGTATC
AACTACAACA
AAACCAACCA
CGTGGTGATG
GCCTTTGATA
AANGTGACAC
AACCAGTTCA
CCACAAGCCT
GTCAAAGCCG
CGAATI'AAA.A
G ETATI'AAAG TTC'rrCTATG
TGGTCGATTA
?I'TGCCAATT
TCGAAACTAT
TI TTNGATG
ATTGGTGTAG
AATGAGAAGA
CCAGAAAAAA
'ITAGTGGTAG
GTC ETATTAA
AAACGTAGAA
AAGACTNAAA
CTTATGANCT
GNTCNGTTGT
ATGTAGAAGG
CAA.ATACAGT
AAAACGGTAC
TTTATTATGA
GGGGCATGAA
TTACNAANTA
CCTACATTCT
TATTGGCNGC NTTAAATGA.A TGGAAGTCGA ACGGATNAAA AAGAGCTTGT NCGTTCTAAT AAGCCCTTCA TAACAAGAAA TTCItTICTTA TGAAATGACN CAGTCGATCT TGCGACAGGC CA.ATCAAAGA CTTACTTCGT CGATCTCNTG GGACGATGCC TTATTCTAGC GNATGGTGGG ATGT=CTGG NGATGTTTAT CCA.ATACNGT TGTCAACCAT TNGGTG.ACAA ACAAAGTCAA AATI'TACAAG TAGTGACATT GCGATAAACT AGACGTCAAA CTAATTGT TTTAGCAGAC TCACGATGAC CTTTGAACAA CGATGAATCT AAAAGAAAAC AACGAATTGC GGCAGGAGAC TTAAAACNAA TACGGTAGTG CAGTGATTGT ACCACCA.ACA AAAAGGCAAG TGTNGTGCCA CGGTAGCTGG TAGT=TAGCC AAGAAACAAA ATAA
AGCTGTTGCT
CAA=TAGAT
TTAGCAATG
CGATTTI'GAA
GATTAACCAT
AGTTGGCAGT
AGTGAAATCT
TGATGTCTI'G
TGACCT'rAAA
TTTAGA-AAAC
GCAGCNATI'A
6AAAACACCG
GGGTATAAAT
AATACAGC EG
GTGCCAGGTA
GTTI'CTCTAC
TCCGAACGTC
GACACGACCC
GTAGGGGANA
AAAGACAATA
GGAAGCAATA
ACAGGTGACG
ACNGTGGTGA
GGGGAAGANA
TGGGAC'IWAA
GrrCTT'rCT
CTCAAAGATT
AAAGGCACGO
CAAGAATTGC
AATTAGCGG
ATTCCAAAC
AATGGNGCCA
CCTGCAGAAT
CATGCACAAAT
GGA.ACCAAAG
GGGGTAGTGA
AA-AAACGTM'
GTTTACAACA
ACNCATACGC
CCAAAAACAC
GAATT'C CCGC GCAATGCTrIG
ATTCAGGAAN
TCACAGCAAT
ATTTACTTGT
TTCArGtTGAC
GTAATCCTTC
ATGATAAAGA
CAGCNAACTA
ATGATCGTTT
AAACGTTAAA
AAGACTTGAC
AAGTAGGCAA ACAAGCTTGG TAGAAAACAC CCAAACAGAA CGCATACNCC TGATGATCCA TTAANCATGG AAAAGTNGCT A-AGGGTACGA TAAAGACTITT TCGATGAITA CGATGAAACG CTAAAGGGGN AGACATTACG TGACNATNqTC TGCCAAAGAC GTGTAACNCT CCCTACAAAA AACAAAATAC ATTTGGNCAA TGAANCCTAA AAAAGACGTG CAATCAAATT AGGGAGAAN ACGCTGGNGT TGTGGAAGAA 'ITAGTGGCCA ATCCTCTGTG TGAATAAAGG GGACGACATT AA-ATCACGGC CAGTCAAGCC CACACTCATC GAAAGCGTTC CAA.TCGAAGA ATC'FrrCAAC CAGAAAAACC ACAAACNCCA CGCA.AGCACC AGTAGAGCCA AA ACAGCCA AAAACAAAAT GCTTAGCAGG CTTAGGCTTI
AGCTAAAGAC
GATCAAAACC
CTTGCCGTTT
AAANGATGGN
CAA.AGATGTA
TATTCCGTTA
TGGCGGAATN
CACAGGNAAA
AGCAGGAACA
aTTTACNATG EF064-4 (SEQ ID N0:244) VTIDLSK VKVYQADASL NXKDXKAVAkA AINSGXAKDV VKNVEGDFEN TAVQLTXDGE TKIYYE'VKSS ERPANYGGXT I SAY ILLENK DNKDLTFTMN YNKELVRSNT VVTHTPDDPK FDTVDLATGV SFFDDYDETX QAFILAXGGQ ELRVTLPThV
TASYXLNLOQ
TVTNTVINHV
EEWGMNDVLD
QALLAALNEG
PTKAVHNKKG
VTP IKDLLRV
KADVSGDVYN
NTVTAMMKTN
POS14PSKDVK
TTHDRFTGKW
SNKVGKQAWS
EXIXHGKVAR
Kl:SICCXrJITN
SAEQNTFGQR
ADGSVVLAMG
ADKNG'IVGSV
HA ITXYDLK(V
V'YLEVERXKT
GDVLSYEMW
QFTISWDMDAI(
IKTNTVVNHI
YKYLLVLPFV
SLHDKD IPLQ
GXKTLKAGTD
GDVENTQTEN
DLKGYDKOFA
GTVTXSAKDP
PKVXPKKDVV
146 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
IKVGDXQSQN GATIKLGEXF FYEFTSSDIP ANSNFVLADG TKVNKGDDIS KL.FTMTFEQG GVERIAAGDV YNTIEESFNN EKIKTNIVVT
VVEKASV
AE:YAGVVEE:W
VV-KITASQAF
HTPEKPQTPP
SI SDKLDVKH DKFSGQWSVF XDAMNLXNK NVAi{SWKAPI EKTVIIVPPTP KTPQAPVEPL EF065-1 (SEQ ID NO:245)
TAGCGAAAGA
ATGGTGXITT
GAAGCGGTTA
AGTCTGGCTr,
CAAACGGAAC
TTTCCACATG
GTGATGCTGG
ATTAATTCAG
AGCCAATTTG
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAA TTACTA
AGTI'CAGTCA
GAG ITGCAAC
TTTACAACCC
GCTTCA'rrAA
GGAAATGATG
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGCCATGTCA
GATGACATTA
GATGTTACAG
AAACAAGCAG
AAAATTAAAA
CCCA.ACCAAG
ACCGTAACAC
TTAGATITAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGOTO
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAATG
ACTTTTACTA
ATOACTATTA
GAACAAGGCG
CATTCCAACA
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
ATTGTAAACG
CAGTCGATI'
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGGTTT
TCGCTTTAGA
TATTAGTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTI'
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TrCCCTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAAST;,CAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTTGACCCA
AAGTCACAGA
ACAAAGTAAC
CGTACACAAT
CTTATATTGA
ACGTGTTGCA
CCATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAAACATUT
TGAACAAAAA
CCACTAAAAT
GGA'ITCCCAA
AGCCAACCGT
GATTAAAATG
AGTTGTGGGA
TACAGAAGGA
GATATTGGAA
AGTAGTTCTT
GATTGATGAA
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCA.ATCAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATCAATCAA
TOCGCATGCG
AGAACAC ETA
TGGCAACGAA
AGATATCATT
AGTAACACAA
CTATI'AAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATTT
AGCCAGCATC
TGAAAATGGT
TTGAAAMG
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CATTAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCCT
TTTAAGAAAG
,NATTr-CACTC
ATGACCAATA
GGACAAGCTA
GAAGATGCTA
GTGGTTCAAG
CAATTTATT
CGCGTCA-ATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTATr
AAAACAGAAG
GATTITATTA
GATCGTCTGG
ATTCAATCTG
CAA-ACCATT'r
GTCAAAGAAA
GGAACAATM'
TGTGACCGTCA
GATTAACCA
ACCAGTACTT
GATGCAAAG
GAAAATAACA
GGTCATACGT
TTAGCGCC'IT
GAAGTGACG
AT1'GCTAAAG
AAATCAACG
GTAGATGACA
AAAGATGTrA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTAA
GG.CCAACAC
GCTTTCCGTA
GTG~rAGACA
GGCATAG-TAA
TACTCTTACT
GCA-ACGGATG
TTAAACTTTG
GCACCA.ACGC
CAACGAAATT
CCACATTOGC
CGGTC;AAAGT
C=~CCCAGT
GTGGTkGTTT
GCTTATCTGA
TT-CCTGATGG
CGCAA~rGAC
GTGGTACCC
GAGATf'A.AC
GTTTAGATGG
CAAGACATCC
AAOT'T1AGC
GGGAAAGTGT
TGGGTCCTTT
CAAGCCAATC
CGCAATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAATT
CGCCAGAAGA
ATCGTIGAAGA
GGACCCAAGC
TCACCGACGA
AAGTAACTTT
ATACAATGAC
ACATIGAACA
TGTTACATTC
ACGTAGAAGG
TCAAAACAGC
TTAATAACT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
AAGCCGACTT
CACCACCTGC
ATr'rAGATTT
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCTGGTCA
AAGAATTAGC
GCAACGAAGG
CAGA-AGACCC
ATTATCGACA
TTTAGCTGAA
GAAAGACGAC
TCAAGCGGGT
TTCAGATAAT
r-AAGACCGC
AAAGACAAAA
TTATGATAA.A
AACCCCCCCA
GAATCGAAAA
TTACTTGCAT
TCTTCAAGTC
GTTAAACCALA
TGAATCTTTA
TCTAAAACAA
TATTATGAT
ACCAGCAACA
GGACCATr.CT
TACAGAAGGT
TGATGCAGCA
TCCAGAAGCG
TCCAACGATT
TAGTTTCGAT
CAGCATGGTG
AAATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
CAACAAACCA
ACAAGAACAT
TTTCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATITA
AACGGACGAA
GAACTTTGGC
ACCAACGCCA
AACCAACCCT
CACATGGACC
GAAAGT1'NCT
TAACAAAGTA
TACATACACA
GCcTTATAT
TGACGTGTTG
AAAAAAACCT
.00.
147 TABLE I. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GAACCTAAAC AACCGCTAAA ACGAACCCAG TCAATTI'GG ACAACAGTAA ATCCAC ITA TTTGGCATAA CAAAAAATAA ACCGAAA.AAA CCGTTGACGC CTACAAATCA TCAAGCACCA AAAATCAGCA AGTAAAGGAA TTCATIACC AATGACTAAT CATGATCGCA GGTTAATTG TCCTTATAGT GGCTATTAGC AAAAAGAAAA AATAG EF065-2 (SEQ ID NO:246) MF KKATKLLSTM VIVAGTVVGN FSPTLALAEE AVKAGD'rEGM TNTVKVKDDS
LADCKRILEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQINGQ
I PKNDNAHAC DINKVLDI ID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEMTh
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
GITKNKKRKN.,
QATFPVQAGE
FMFPDGKTKI
THGOLTNRKT
AAEVLALNQE
FITSQSIDDF
TISATSTEGY
DVTPEDPTIT
\TKVTDENGKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
'IVTPPAPTPE
LDITDVKVXX
TDEELAPY IE
LTPTNHQAPT
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMI.N
TTQLKQ IVKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLNFCNE
TSTWTQASMV
GHTYTMTITT
DPTITKD lEG
ENGKDVTDNG
QGGI PNQADL
NPVNFCKSAS
DASGS"FSDNF
VNTQLTYDI<S
NTRLDGYLINK
kYWESVESLS
RLAQSTPATA
KENTAIDAAT
LT.NREDSFOW
NNKVTFEMI'JK
GIDVLHSNKPT
DDINKVLDIT
K IKASATDEE
QEH-LDLTNRD
IVTQENNXVT-
NFGNEGDVLH
KGIHLPMTNT
PHVRQAIDEV
QFVSGFGDVR
TNTNDSINEY
SVNSYFDKYK
SLTIANQFE)I
LVSSG.TMNQG
HVKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENGK
rJAPY IEQGG I Q EFKWNVKTA FTIYIqNKDDSY
SNKPTVTPPA
TVNP LYM lAG
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEF PEAT
STWTQASMVD
HTYTMTITTK
AKDSJEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKK?E
L IVLIVAI SF p
S.
EF065-3 (SEQ ID NO:247) GITA AAGCAGGAGA TACAGAACGA ATGACCAATA CGGTGAAAGT GAAAGACGAC
AGTCTGGCTG
GAAACGGAAC
TI'CCACATC
GTGATGCTGG
A'ITAATI'CAG
AGCCAATTTG
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAATTACTA
AGTTCACTCA
GAGTTGCAAC
TTTACAACCC
GCTTCATTAA
GrGAAATGATG
TACGTAGGAA
ACCCTT'GTALA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATI'A
GATGTTACAG
AAACAAGCAG
AAAATTAAAA
ATTGTAAACG
CAGTCGAT'TT
TAAGACAAGC
CTTCATATCG
CTGA'ITATGA
TCTCTCGTTT
TCGCTTITAGA
TATTAGTGAC
CCAATGA ITC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTT
AAGGGTCTAG
AATTAAZAACA
CCAIGCCAA
TGCCTGTT-CA
ACATCACGAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACCCAAC
GATATTGGAA
AGTAG T'GTT GA 1TGATGAA
CGGCGGAAAA
TATGAATGTG
TrGGAGACG TT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGAT
MATAAATAT
CACACCAGAA
AATT'TCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATTI
AGTAACACAA
CTATTTAAGT
GGACGAAGAA
GGACAAG CTA
GAAGATGCTA
GTGGTTCAAG
CAATITATrGT
CGCGTCAATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTA1T
AAAACAGAAG
GA'ITIrALTTA
GATCGTCTGG
ATTCAATCTG
CAAACCAT!Tll
GTCAAAGAAA
GGAACAATTG
TGTGACGTG.A
GATTtAACCA ACCAGTACTr
GATGTGAAAG
GAAAATAACA
GGTCATACGT
TTAGCGCCTI'
CTTTCCCAGT
GTGGTAGrI
GCTTATCTGA
TTCCTGATGG
CGCAATIGAC
GTrGGTACGCC
GAGATTTAAC
G=AGATIGG
CAAGACATCC
AAC~TT=AGC
GGGAAAGTGT
TGGGTCCTTT
CAAGCCAATC
CGCAATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAATr
CGCCAGAAGA
ATCGTCAAGA
GGACCCAAGC
TCACCGACGA
AAGTAACTTT
ATACAATGAC
ACATTGAACA
TCAAGCGGGT
F1'CAGATAAT
TCAAGACCGC
AAAGACAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TCTTCAAGTC
GTTAAACCAA
TGAATCITA
TGTAAAACAA
TATrGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGATGCAGCA
TCCAGAAGCG
TCCAACGATT
TAG 'ITCGAT
CAGCATGGTG
AAATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
TABLE I1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CCCA-ACCAAG
ACCGTAACAC
ITAGATTTAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAATG
ACTTTTACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGAACCCAG
C-GACTTAAA
CACCGCCAGT
CCAACCGCGA
CTTGGACCCA
AAGTCACAGA
ACAAAGTAAC
CGTACACAAT
CTTATATTGA
ACGMTGCA
CGATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GGA'I'CCCAA
AGCCAACCGT
AACCGCTAAA
TCAAT'I=GG
CTI'TGGCAAT
TGATCCAAAT
TCAAGAAT'IW
AGCCAGCATG
TGAAAATGGT
TIT=AAATG
GACCATITACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CATAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
GAAGGTGACG
ATTGCTAAAG
AAATGGAACG
CTAGATCACA
AAAGATGTTA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTAA
GGCCAAGAAC
G.CTTTCGGTA
GTdITTAGACA
GGCATAGTAA
TACTCT'ACT
GCAACGGA1G
TTAAACTTTG
CCACCAACC
CCGTTGACGC
AGTAAAGGAA
TOTTACATTrC
ACGTAGAACG
TCAAAACAGC
TTALATAAAGT
CAGCTAACCG
CNGACAGCTA
AA(3CTAGCGC
AAGCCGACTT
CACCACCTGC
A'FrAGATTT
ACGAAACAAG
TCACAGACGT
CACAAGAAA.A
TAGCTGGTCA
AAGAATTAGC
GCAACGAAGG
CAGAAGACCC
CTACAAATCA
TT
CAACAAACCA
ACAAGAACAT
'TTTCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATTTA
AACGGACGAA
ACCAACGCCA
A.ACCAACCGT
CACATCGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
EF065-4 (SEQ ID NO:248) a..
a a
AVKAGDTEGM
LADCKRILE G
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQINGQ
IPKNDNAHAC
DINKVLDI ID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEMN
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
TNTVKVKDDS
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVLALNQE
FITSQSIDDF
TI SATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGCIP
WNVKTAFCNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYIE
LTPTNHQAPT
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEM~IN
ITQLKQIVKD
VGNITIHYEV
KOIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGG IPNQADL
NPVNFGKSAS
DASGSFSIDNF
VNTQLTYD)KS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAIDAAT
LTNREDSFDW
NNKVTFF M
GDVLHSNKPT
DDINKVLDIT
K IKASATDEE
QEHLDLTNRD
IVTQEiWKVT
NFGNEGDVLH
KGIH
PHVRQAIDEV
QFVSGFCGDVR
TNTNDSINE:Y
SVNSYFODKYK
SLTIANQFDI
LVSSGThMNQG
HVKTAPGNET
QADSYOYLSG
VTPPPVPPNI
DVKVTD2ENGK
LAPYIEQGGI
QEFKWNVKTA
FTMNKKDDSY
SNKPTVTPPA
VQGLSDQDRV
TYGGTPTAPG
POPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
HTYT.MTITTK
AKDVEGQEHL
DVTANCKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLACHTYTM
PTPEOPKKPE
EF066-1 (SEQ I0 NO:249) TAGCGAAAGA AAATAGCCAG ATGGTGATTG TCGCTGGAAC GAAGCGGTTA AAGCAGGAGA AGTCTGGCTG ATTGTAAACG GAAACGGAAC CAGTCGATTT TTTCCACATG TAAGACAAGC GTGATCCTGG CTTCATATCG ATTAAT-TCAC CTGATTATGA AGCCAATTTG TCTCTGG I=I GGATETGAAAC TCGCTTTAGA ACGTATTTCC TA LTAGTCAC AAGACCAATA CCAATGATTC TCAGTGGAAT ATAGTAATGA
GATTAAAATG
AGT'PGTGGGA
TACAGAAGGA
GATATTGGAA
AGTAG7MTTT
GATTGATGAA
CGGCGGAAAA
TATGAATGr
TGGAGACGTT
TACGTACAAT
AGATVCCGTC
AATCAATC-AA
CTACCAAGGT
TTTAAGAAAG
AAT'IrTCAGTC
ATGACCAATA
GGACAAGCTA
CAAGATGCTA
GTCGGTTCAAG
CAATTTATGT
CG-CGTCAATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
CAACGAAATT
CCACATT=C
CGGTGAAAGT
CITICCCAGT
GTGCTACTTT
GCT'rATCTGA
TTCCTGATGG
CGCA.ATTCAC
GTGGTACCC
GAGAT'rrAAC
GTI'TACATGG
CAACACATCC
ATTATCGACA
TTAGCTGAA
GAA.AGACG-AC
TCAAGCGGGT
TTCAGATAAT
TCAAGACCGC
AAAGACAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TC'ITCAAGTC
GCAGCAGCAG AAGT'I=AGC GTTAAACCAA 1 149 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GAAA ITACTA
AGTTCAGTGA
GAGTTGCAAC
TTTACA.ACCC
GCTTCAT'AA
GGAAATGATG
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATTA
GATGTTACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACC.GTAACAC
TTAGATTTAA
GAAACAAGCA
ACTCATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCA.C
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAATG
AC=rTACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGALACCCAG
ACAACAGTAA
TITCGCATAA
ACCAAGGCTA
ATTCATACTT
AAGGGTCTAG
AATTAAA.ACA
CGA'FrGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACACCTATGA
CTGACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTTGGACCCA
AAGTCACAGA
ACAAAGTAAC
CGTACACAAT
CTTATA ITGA
ACGTGTTGCA
CGATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAALAT
GGATTCCCAA
AGCCAACCGT
AACCGCTAAA
TCAArT-rcc
ATCCACTTTA
CAAAAAATAA
TGAAATGATT AATGCGTATT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATI'TGAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCAT'
AGTAACACAA
CTATTTAAGT
GGACGAAGAA
CTTTGGCAAT
TCATCCAAAT
TCAAGAA=T
AGCCAGCATG
TGAAAATGGT
TTTTGAAATG
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CCTCAAAACA
CATTAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCA.AGCCGAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
CATCATCGCA
AAAAAGAAAA
AAAACAGAAG
GATTT1TATIA
CATCGTCTGG
ATTCAATCTG
CAAACCATTT
GTCAAAGAAA
GGAACAATTG
TGTGACGTGA
GATTTA.ACCA
ACCAGTACTT
GAtGTGAAAG
GAAAATAACA
GCTCATACCT
TTAGCGCCTT
CAAGGTGACG
ATTGCTAAAG
AAATGGAACG
GTAGATGACA
AAAGATGITA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTAA
GGCCAAGAAC
GCTTCGGTA
GTGMACACA
GGCATACTAA
TACTCTTACT
GCAACGGATG
TTAAACT=T
GCACCAACGC
CCGTTGACGC
AGTAAAGGAA
GGTTAA=~
AATTAG
GGGAAAGTGT
TGGGTCCTTT
CAAGCCAATC
CGCAATCGAC
CCACCOCTAC
CAGCAACTAG
ATACA.GCGAT.
CTAACGAATT
TGAATCTTTA
TGTAAAACAA
TATTGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGATGCAGCA
TCCAGAAGCG
CGCCAGAAGA TCCAACGATT ATCGTG.AAGA TAGTI'TCGAT GGACCCAAGC CAGCATGGTG TCACCGACGA AAATGGTAAA AAGTAACTTT TGAAATGAAC ATACAATGAC TATCACCACT ACATTGAACA AGGCGGGATT TGTTACATrC CAACAAACCA ACGTAGAAGG ACAAGAACAT TCAAAACAGC TTTCGGTAAC TTAATAAAGT GTTAGACATC CAGCTAACGG CAALAGTAACA CNGACAGCTA TGACTArI-rA AAGCTAGCGC AACGGACGAA AAGCCGACTT GAACTTTGGC CACCACCTGC ACCAACGCCA ATTTAGATTT AACCAACCGT ACGAAACAAG CACATGGACC TCACAGACGT GAAAGF1'NCT CACAAGAAAA TAACAAAGTA TAGCTGGTCA TACATACACA AAGAATTAGC CCCTTATATT GCA.ACGAAGG TGACGTCTTG CAGA.AGACCC AAAAAAACCT CTACAAATCA TCAAGCACCA TTCATIACC AATGACTAAT TCCTTATAGT GGCTATTAGC a.
a a a a a a a a a.
EF066-2 (SEQ ID NO:250) MF KKATKLLSTM VIVAGTVVCN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS
LADCKRILEG
MLASYRGCKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQ INGQ I PKND1NAHAC DINKVLDI ID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEM
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
QATFPVQAGE
FMFPDGKTK I
THGDLTNRKT
AAEVLALNQE
FITSQSI-DDF
TISATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
WNV'KTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPY IE
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQIVKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGGIPNQADL
DASGSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAI DAAT
LTNREDSFDW
NNKVTFEM
GDVLHSNKPT
DE)INKVLDIT
KIKASATD'EE
QEHLDL'rNRD
IVTQENNXVT
NPGNEGDVLH
PHVRQAIDEV
QFVSGFGDVR
TNTNDSINEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMNQG
HVKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKV'rDENGX
LAPYIEQGGI
QEFKWNVKTA
FTMNXKKDSY
SNKPTIVTPPA
VQGLSDQDRV
IYGGTPTAPG
PDPRIIPLQVS
TE'JGPFVKQE
QSATATDDAG
TIAKEFPE-AT
S1 ATQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTqhTQ
SYLAGHTYTM
PTPEOPKCPE
150 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PKQPLKPKKP LTPTNHQAPT NPVNFGKSAS KGIHLPMTNT TVNPLYI4IAG LIVL-7VAISF C ITKNKKRKN EF066-3 (SEQ ID NO:251) GGTTA AAGCAGGAGA TACAGAAGGA ATGACCAATA CGGTGAAAGT GAA.AGACGAC
AGTCTGGCTG
GAAACGGAAC
TTTCCACATG
GTGATGCTG
A'ITA-ATTCAG
AGCCAATTTG
GGATTGAAAC
ACGTAICC
A.AGACCAATA
TCAGTGGAAT
GAAATTACTA
AGTTCAGTGA
GAGTTGCAAC
TTTACAACCC
GCTTCATTAA
GGAAATGATG
TrACGTAGGAA
ACCCTICTAA
ACGATTCCTA
ACAA.AAGATA
TCGCATGTCA
GATGACATTA
ATTGTAAACG
CAGTCGATTT
TAAGACAAGC
CTTCATATCG
CTCA ETATGA
TCTCTIGGTTT
TCGCTTTAGA
TATTACTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
A ITCATACTT AkAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTCG-AC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
GATATTGGAA
AGTAC ITCTT
GATTGATGAA
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATCGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGAT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTTCAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TCCATGCG
AGAACACT'A
TGGCAACGAA
AGATATCATT
GGACAAGCTA
CAAGATC TA
GTGGTTCAAG
CAA'IIA-TGT
CGCGTCAATA
CGtCACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTATT
AAAACAGAAG
GATFITATTA
GATCGTCGG
ATTCAATCTG
CAAACCATT
GTCAAAGAAA
GGAACAA=P
TGTGACGTGA
GATT-1AACCA
ACCAGTACT
GATGTGAAAG
CTTTCCCAGT
CTCGTAGT
GCTTATCTGA
TTCCTGATGG
CCPATTGAC
GTGGTACGCCC
GAGA=-TAAC
G'ITAGATGG
CA.AGACATCC
AAGTTTTAGC
GGGAAAGTGT
TGGGTCC=T
CAAGCCALATC
CGCAATCGAC
CGACCGCTAC
CAGCAACTAG
ATACAGCGAT
CTAAGGAATT
CGCCAGAAGA
ATCGTGAAGA
GGACCCAAGC
TCA
TCAAGCGGGT
TTCAGATAAT
TCAAGACCGC
AA.AGACAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TCTTCAAGTC
GTTAAACCAA
TGAATCTTITA
TGTAAAACA-A
TATTGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGATGCAGCA
TCCAGAAC
TCCAACGATT
TAGTTCGAT
CAGCATGGTG
S
S..
*5 5
C.
EF066-4 (SEQ ID. NO:252)
AVKAGDTEGM
LADCKRILEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
1NDVPVQINGQ I PKNDNAHAC
TNTVKVKDDS
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVLALNQE
FITSQSIDDF
TI SATSTEGY
DVTPEDPTIT
T:EPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQIVKD
VGNITIHYEV
KDIENQEHLD
DASGSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
K=NAIDAAT
LTNREDSFDW
PHVRQAIDEV
QFVSGFGDVR
TNTNDS INEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMNQG
HVKTAFGNET
VQGLSDQDRV
TYGGTPTAPG
POPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
DIrxCVLDIID VKVT EF067-1 (SEQ ID NO:253)
TAGCGAAAGA
ATGGTGATTG
GAAGCGGITA
AGTCTGGCTC
GAAACGGAAC
T ITCCACATG
GTGATGCTGG
ATTAATTCAG
AGCCAATTC
GGATTGAA-AC
ACGTATTTCC
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
AITGTAAACG
CAGTCGATTT
TAAGACAAGC
CTTCATATCG
CTGA ITATGA TCTCTGGTrr
TCGCTTTAGA
TATTAGTGAC
GATI'AAAATG
AGTI'GTGGGA
TACAGAACCA
GATATTGGAA
AGTAGTTGTT
GATTGATGAA
CGCCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGAT'GGGGTC
TTAAAAAG
AATTTCAGTC
ATGACCAATA
GCACAAGCTA
GAAGATGCTA
GTGG ITCAAG CAA TTATCT
CGCGTCA-ATA
CGGACGTATG
CAAACACACG
GCTAATACAC
CAACGAAATT
CCACAMTGC
CGGTGAAAGT
CTTTCCCAGT
GTGGTAGTTT
GCTTATCTGA
TCCTATGG
CGCAA'TMAC
GTGGTACGCC
GAGAMAAC
G ITTAGATGC
ATTATCGACA
TTTAGCTGAA
GAAAGACGAC
TCAAGCGGGT
TTCAGATAAT
TCAAGACCGC
AAAGAcAAAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
'FTACTTGCAT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AAGACCAATA
TCACGTGGAAT
GAAA'ITACTA
AGTTCAGTGA
CAGTTGCAAC
TTTACAACCC
GCTTCATTAA
GGAAATCATC
TACGTAGGAA
AC CC TTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATTA
GATG'ITACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACCGTAACAC
TTAGATTTAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GALATTAGCAC
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCAGCA
GANGAAAATG
ACTTTTACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTAAAC
ACGAACCCAG
ACAACAGTAA
TTTCCCATAA
CCAATGATTC
ATACTAATGA
ACCAAGGCTA
ATTCATAC T
AAGGGTCTAG
AAITAAAACA
CGATTGCCAA
TGC^CTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTGACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
CTI'GGACCCA
AAGTCACAGA
ACAALAGTAAC
CGTACACAAT
CTTATATTGA
ACGTG'FrCCA
CGATTACAAA
TTAAATCGAA
TGGTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GGATTCCCAA
AGCCAACCGT
AACCGCTAAA
TCAATrTTGG
ATCCACTTTA
CAAAAAATAA
AATCAATGAA
CTACCAAGOT
TGAAATGATT
TrGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAATTAT
AATTA.ACGGA
TC-ACTACGAA
AATGA.ATCAA
TGCGCATGCG
AGAACACTTA
TGGCA.ACGAA
AGATATCATT
AGTAACACAA
CTATIAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATTT
AGCCAGCATG
TGAA.AATGGT
TTTTGAAATG
GACCATI'ACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CATTAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAAGCCGAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
CATCATCGCA
AAAAAGAAAA
TATCCAGATC CAAGACATCC TCTTCAAGTC CCAGCACCAG AAGT='AGC GTTAAACCAA AATGCGTAIT GGGAAAGTCT TCAATCTTTA AAAACAGAAG TGGGTCCTTT TGTAAAACAA GATTTTAT1'A CAAGCCAATC TATTGATGAT GATCGTCTCG CGCA.ATCGAC ACCAGCAACA ATTCAATCTG CGACCGCTAC GGACGATGCT
CAAACCATI
T CAGCAACTAG TACAGAAGGT GTCAAAGAAA ATACAGCGAT TGATGCACCA GGAACAATTG CTAACCAATT TCCAGAAGCG TGTGACGTGA CGCCAGAAGA TCCAACGATT GATTAACCA ATCGMIAAGA TAGTFTCGAT ACCAGTACTI' GGACCCAAGC CAGCATGGTG GATGTG-AAAG TCACCCACGA AAATGGTAAA GAAAATA.ACA AAGTAACTTT TGAAATGAAC GGTCATACGT ATACAATGAC TATCACCACT TTAGCGCCTT ACATTGAACA AGGCGGGAhTT CAAGGTGACC TCTTACATTC CAACAAACCA ATTGCTAAAG ACGTAGAAGG ACAAGAACAT AAATGGA.ACG TCAAAACAGC TTCGGTAAC GTAGATGACA TTAATAAAGT GTTAGACATC AAAGATGTTA CAGCTAACGG CAAAGTAACA AACAANCAAG CNGACAGCTA TGACTATTTA ACTAAAATCA AAGCTAGCGC AACGGACGPA ATTCCCAACC AAGCCCACTT GAACTGG-C CCAACCGTAA CACCACCTCC ACCAkACGCCA GGCCAAGAAC ATT1TAGA=T AACCAACCGT GC=rCGGTA ACGAAACAAG CACATGGACC GTGTTAGACA TCACAGACGT GAAAGTTNCT GGCATAGTAA CACAAGAAAA TAACAAAGTA TACTCTTACT TACCTCCTCA TACATACACA GCAACGGATG AAGAATTACC GCCTTATAT TTAAACTT7G GCAACGAAGG TGACGTGTTG GCACCAACGC CAGAAGACCC AAAAAAACCT CCGTI'GACGC CTACAAATCA TCAACCACCA AGTAAAGGAA TTCATI'TACC AATGACTAAT GGT EAATTG TCCTTATAGT GGCTAITAGC
AATTAG
*00
S.
S
.00.
5555 55..
S
55..
0
S'S.
S5 S
S.
.5 EF067-2 (SEQ ID NO:254) M~F KKATKLLSTM VIVAGTVVGN FSPTLALAEE AVKAGDTEGM TNTVKVKDDS LADCKRItLEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
NDVPVQINGQ
I PKNDNAHAC DINKVLDI ID
IKTDATDEEL
DLTNRDQEFK
ENNKVTFEM
EGDVLHSNXP
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVL.ALNQE
FITSQS IDOF
TISATSTEGY
DVTPEDPTIT
VKVTDENGKD
APYIEQGGI P
WNVKTAFGNE-
XQADSYDYLS
TVTPPAPTPE
TEPVDLVVVE
NSADYDMNVR
YFLLVTDGVA
ITNQGYEMIN
TTQLKQIVKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITTI
DPT ITKD 120
DASGSFSDNF
VNTQLTYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAI DAAT
LTNREDSFDW
NNKVTFEMNKM
GDVLHSNKPT
DDINKVLDIT
KIKASATDEE
QEHLDLTNRD
PHVRQAIDEV
QFVSGFGDVR
TNTNDSINEY
SVNSYFDKYK
SLTLANQFDI
LVSSGTMNQG
HVKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENGK
LAPYIEQGGI
QEFKVINVKTA
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TVEVGPFVI<QE
QSATATDDAG
TIAKEFPEAT
STWTQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQAOLINFGN
FGNETST'JTQ
152 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
ASMVDDINKV LDITDVKVXX ENGKDVTDNG IVTQENNKVT FTMKKDDSY SYLAGHTYTN TITTKIKTDA TDE-ELAPYIE QGGIPNQADL NFGNEGDVLHi SNK(PTVT-PPA PTPEDPKKPE PKQPLKPKKP LTPTNH-QAPT NPVNFGKSAS KGIHLPMTNT TVNPLYMIAG LIVLIVAISF C ITKNKKRKN EF067-3 (SEQ ID NO:255) GCT AGATATCATT GATCTGAAAG TCACCGACGA AAATCCTAAA
GATGTTACAG
AAACAAGCAC
AAAATTAAAA
CCCAACCAAG
ACCGTAACAC
TTAGATTTAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGGTC
GAAGATCCAA
GACCAAGAAT
CAACCCAGCA
GANGAAAAT G
ACTTTTACTA
ATGACTATTA
GAACAAGC
CATTCCAACA
CAACCTAAAC
ACGAACCCAC
CTAACGGCAC
ACAGCTATGA
CTGACGCA.AC
CCGACTTAAA
CACCGCCACT
CCAACCCA
CTTGGACCCA
AAGTCACAGA
ACAA.AGTAAC
CGTACACAAT
CTTATATTGA
ACGTGTTGCA
CGATTACAAA
TTAAATGGAA
TCCTGGATGA
GCAAAGATGT
TGAACAAAAA
CCACTAAAAT
GGATI'CCCAA
AGCCAACCGT
AACCGCTAAA
TCAATTTTC-G
AGTAACACAA
CTATTTAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATT
AGCCAGCATG
TGA.AAATGGT
TTCAAATC
GACCA PTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CATIAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAACCCGAC
AACACCGCCT
ACCGAAAAAA
A.AAATCAGCA
GAAAATAACA
GGTCATACGT
TTAGCGCCTT
GAAGGTGACG
ATI'GCTAAAG
AAATGGAACG
GTAGATGACA
AAAGATGTTA
AACAANCAAG
ACTAAAATCA
ATTCCCAACC
CCAACCGTAA
GGCCAAGAAC
GC=ICGGTA
6TGTTAGACA
GGCATAGTAA
TACTCTTACT
CCAACGGATG
TTAAAC=G~
GCACCA.ACC
CCGTTGACGC
AGTAAAGGAA
AAGTAAC'TTT
ATACAAPIGAC
ACATTCAACA
TGTTACATTC
ACGTAGAAGG
TCAAACAGC
TAA.TAAAGT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCC
AAGCCGACTT
CACCACCTGC
ATTTACATTT
ACGAA.ACAAG
TCACAGACGT
CACA.AGAAAA
TAGCTGGTCA
AAGAATTAGC
GC'AACGAAGG
CAGAAGACCC
CTACAAATCA
TT
TGAAATGAAC
TATCACCACT
AGGCGGGATT
CAACAAACCA
ACA.AGAACAT
=TCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTAT=A
AACGGACGAA
GAAC1TGGC
ACCAACGCCA
AACCAACCCT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCA.AGCACCA
00 00 S S0 0e 0 0
S.
00 0
S.
S.
S S 000 S
SS
S S @06
S
0055 Se
S
*050 0
S.
*550
S
0 0000 0@*0 0 0555 S. S 05 EF067-4 (SEQ ID NO:256) VLDI ID VKVTDENGKO VTANGTVTQE NNKVTFEMINY QADSYDYLSG HTYTMTITI'K
IKTDATDEEL
DL.TNRDQEFK
ENNKVTFEMN
EGDVLH-SNK(P
ASMVDDIN(V
TITTKIKTDA
PKQPLKPKKP
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEEL.APYIE
LTPTNHQAPT
NQADLNFGNE
TS IWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGKDVTDNG
QGGIPNQADL
NPVNFGKSAS
GDVLHSNKPT
DDINKVLDIT
KIKASATDEE
QEHLDLTNRD
IVTQENNKVT
NFCNEGDVLH
KG Dl
VTPPPVDPNI
DVKVTDENGK
LAPYIEQGGI
QEFKWNVKTA
FTMNKKDDSY
SNKPTVTPPA
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKPE
EF068-1 (SEQ ID NO:2S7)
TAGGGGAAGC
ATGAAAAAGA
CCCAAGATGC
GTAAGTATAG
CTATTATCAA
CCCCCAA.ACC
TCCGCTGTGT
AATGTAGCTG
TAATGATCT
AAAITGTTGA
TTAATTTAGC
CTGTTACCTC
ATGTTACGTC
AAAATCAACC
FIAGTGGACA
CAGCAGGCAG
GGTATTTATC
GGATTTTAAT
AATATCAAGT
TGGCACAATC
AAATAATGAC
AGTTAATTTC
AAAACAAGCG
CGCAGCAATC
GTTTATTTTA
CGGAAAAGTC
GGTTIATTAT
AGTGCATCAG
AGTGGCACTT
ACGGITPTCT
GTGTTAGTGG
AATACCAATG
AAGAAAAGAG
AGCATAAAAA
'ITACGTCA'Fr CACGCTC IT
CAACGAGTAA
GTGGCGCTT
'TCCTCCTGA
TCACGATTGA
GGACGATCAG
ATGGACAAAA
AGCAATCCCT
GGATATCGAA
TCGTTGGACA
AGCAGATGCT
CTTAAGAGGA
TCTTTCAAAA
153 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[accalis Genes.
GTTACTTTTT
ATTACCAGTG
CGTCAATTGG
GAAACGTTAG
GT=TAGCCC
GAGGCAAAAG
CVI'CCTGTCA
GGTGGTTCAG
TACCAACTA
GGAACAGTCG
TCCAACATTT
GTAACAGGTA
GTTGAAATCC
GCGITTACAG
GCGAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCATTTACAG
GCGAAAAACG
AATACGCCCG
GAGGTCAAAG
GTCCTTGGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GTTCGTGATC
ACTGTAACGC
GATAACGCG
ACAGCACCAA
ACCGCTGATC
GGGACTGCCG
AATGAAACAT
TTCCAAACAC
ACAACACAAG
GCAACAGACG
ACGTTAGCI'
ACTGGACTTG
ATTGGTGACA
AATACCACCA
GATCAAGGAA
ACAGTTGTTC.
GAAGATGCAA
TATCAAGTAA
TTAGTTATTG
GGCACAGCAA
AGTAATCCGA
GTTGACAAAA
GGTACAACAG
ACTGATGGCA
TGACTOCCGT
GGGCGTTAGG
AATTAGTTAA
CAGCTCACGG
AAAATG=T
CTACCAGTAT
AAGGCACGGT
GCGTAAATGA
CCGTTTCAAC
TTCAAACAGA
A=TTGCTGC
ATTCAACAGC
CAAATGCAGG
TTACCGTTCC
CCAGCGGNAC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCAAACACC
A'ITCAACAGC
GAAATGCAGG
TTACCATTCC
CCAGCGGTAC
TGGCGACGCC
GGACTG.CTGA
CTGCAACAAC
CAAATCAAAC
CGGCGACAAC
GTTCGGGTTA
CATCTGGGGC
TACCAACGGG
GAAATGAAAG
CTGTAACAGG
CGAATGCTAC
ATGGGACTOG
TGACAGCGTT
CAGCAGMA
GATATCAAGT
GAACAGTTF
CAGGAAAAGC
AACTCAACC
TTACTGGAGA
TTGAAGTACG
ACTTTACTOT
GAAAAGACCG
CCGTAGCAGC
CCGGCACGGC
CTACCGGAAC
CACCTAACGA
CTCrrCAA
TCACTGGTAG
TTIGAGGTGCG
AATACACAGT
rTTAATGCA
GAATTAACT
TAACATTGAA
CTCATACATT
AAACATCTT-A
CCCAAGTAAT
AAACGTGGCT
GTTAGTGGAT
ACCTICAAAAT
TCT'ITAGAC
AGGCACTACT
AGGTTACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCATGA-A
AGGITACGAA
AGGTGC-GTG
CCCAGGTGAA
AGAAAGTACG
AATTGTTGAG
ACTTGCACC
TGGAACTGAC
GCTG-AGCGTT
ACCAGCTGAT
TGAAATTACA
AGTCATTGGT
AACGACCAAT
TCAACCGACT
AGTAACAGGT
CATCGAAATT
T1?CCTTTGCT
AGCCA.AAGAT
AGTAGTIGGCA
GACAGGTACC
AGGCACCGCA
AACAGCTAAA
AACTACAGCT
TTCAACAACT
GAACCCAGAT
GGACCTTCCA
TGACGGCAAT
ACCAACTiGTG
AGAGCCAAAT
GACTGATGGT
AGCCTTAACT
AACACCTGCT
CACGACAAAC
TGACGCCGAT
GAC=TAGAG
GCCAATGAr12
GGTGTTGATA
A.ACTTAGGTG
A.GTGCACCGA
CAAGATTTGA
CTTGTCGCCG
GTTTCACCTG
GCTTCTTTAC
TTATCCAATA
GTTAATTTAT
ACTCA.AGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGCGCCA
CCAACAACGT
GTGACAGGTA
GTTGAA.ATCC
GCGTTTACAG
GCGAAAAACG
CCAACCGTAA
GTTAAAGGAA
ATAGGTACAG
GCAGGTGCGA
CCAACAACGT
ACTGTAACAG
ACCATTGAGG
GGAAAATATA
GTAGCGAAAAA
GTCACTGCAC
GGAACAGCAG
ACAGGTACCT
CCTGGGGATA
GAAGTCCTTrG
AATTCAG=~
CGTGATGCAG
GTGAACCTTC
CCTGCTGGCA
CCGCCAAGTG
GCTGAACTTG
ACAACTGGAC
CAAACAGTGA
ATGACACCCG
GGTTATGAAA
GGAACAA ITA
GCGGGAGCCG
GAAAGTCAAC
ACGACTGTTA
GTCACCATTG
GCTGCGCAT
GCCAT JXCGA GATCCAGA G
CGCTATCAAG
CCCACAGTCC
CCAGGGAAGG
TAACCAATCT
TTATTTGAC
CTGCTTCATT
TTAGTGATG
ATGCCGGCAGT
CACCTATAAA
CTTTGCCTTTr
TAGGCACAAC
ATTTAGATCC
TAGCAACAGC
CCGCACCA-AC
CTGCCGATGC
GTACCGCTGA
ATGAP ACGTT
TCCAAACNCC
A/FTCAACGGC
GAAATGCAGG
TTACCGTTCC
CC:AGCGGCAC
CCCCACCAAC
CTGCCGATGC
GTACTGCTGA
ATGAAACOTT
TCCAAACGCC
GTAGTACAAC
TTCGCGATGC
CACTGACTTT
ACGCTAGTGG
CAACAGMITA
ACCCTAACAC
CTGATGCGALA
CGTTAACAGT
MICCTGCTGA
CTGGTTATCA
ATGGGAACGT
CAGCTGOGAC
ATACAAGTAC
TCGACAAAGT
GCACCACCAT
CGACTGGCCA
ATGTAG=ITC
CTGATGTI'AC
TCACTGGGAC
TTGGTACAAC
CTAATCCTGG
CAACGGAAGT
CAGGAACAAC
AGATCACAA
TACAATCAC
AAGATGCTGC
CACCAGTCGC
TAGTAGGAGC
TTGGCATGGC
CCTCAGCTAA
GATTACTCAA
GGAAGTGAAT
TACAGCTCCG
TTACGGTTA
TCAAGCTTTG
TGCAGCCTTG
ATTAGCGGTT
CACGGTTACT
TCGTTTTGTA
AGACGGTGTA
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCCTA
AGCGGATGA.A
AGGTTACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGTACG
AATCACAGGA
CAATGCCACG
TGGGACAGGG
AACCGCCGTA
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAAAGT
TAACATCACA
AACAATCGAA
TGGTGATTTT
GATTGGAAAG
TGCCACGGTT
GGTGACAGGC
GATTGCAACA
GGCAAATGCG
ACCGACA-ACC
TACTGGGAAT
TGAAGTTCGT
ATATACTGTG
TAAAAATGAT
CACACCAACA
GGCGGACCCT
GACAACGGAT
TGATACATTA
GACGGTCCCT
TGCCACTGGG
TGAAGCAGGT
TCTTCCGACG
TGGGAAAGAA
GACACCTACT
AGCAGAAGTT
AACTACTGCA
CGAAACAATA
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. *faecalij Genes.
ACTGTCGTAG
GTCGACTTAG
ATCACTGGAA
ATTGCTCCTA
GTGACACCAG
CCAGCTACAG
CAACGAAACA
CAATTTTACA
TTCTCCGTTC
CGAAAAATC
CCACACCAAC
CGGC--GAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTATTCC
AAGCCAATG
ATAGCAGTGA
ATAITGCAGC
AACAGGAAA
CATTGATTCT
AAAA.ACCACT
CGAAACCGGC
TACGATTA2T AGCGGATG TT
CTATACAGTC
ACAAT~A L7G AGGGT1'AGCA ACACATACAC A.AGGAAATGT GAGTCCTAAA
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCCG
GGACATTACA
GGCAATGGTA
GCAACTGGCG
GGTTTAGGCA
ACTGGCTACC
GGCTACTTAC
CC 'FTGTCG
AGATITAAA.AT
TCATCAAAGG
GTCTGATTGC
GACAAGCGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGGTTC
ACCCTAAACT
CTAGCACACG
CACTCCTTC
TGCGGCACCA
AACAGCTGAA
TGTAGGTGCC
TGCTCAACA-A
AACCTTCATG
AAATAACGGC
GAATGTCA
TGGCTTAGGT
AAGTACCATC
TGAAAAAGAG
GAGCATGGGA
GAAAGTCAGC
ATI'ACCGGAA
ATTrGATGTCC
CAATATACGG
AGCAAAGATG
GTI ITAGCGG
ACTGGAACTG
GCAAGTGGCA
ACAGAAAAAG
ACCACAF1'A
ACTGTTTCTT
CECAAACCGAA
ACCGATGCTG
AGTTTACTTC
ACGCCAGCCG
AATACAGGCG
AACACAGGTT
TCACTAGGCA
AGTTATGGCA
TCTTCAGCCG
ATCATCAAAC
CAGCTACACC
AITCTAGTAA
GTGACGCAGA
TGACTCTACC
CCAGGTAA
CGCCAACTAT
CTGATCCAAA
ATACAACTAC
AAACGTTAAC
TGACGCCAC
CAGTTTTACG
TCA'ITCAAAT
AAGGCAACTT
CGACA:OCTAC
ACCAACGAA
GCAATACAGG
CAAATCCAAA
ATGGCCTCGG
CTGGAAATCA
TCACAACAAG
GCAAACGTAA
AACTACACCA
AGGTTACGAA
CGGAACCATC
AGCTGGCGTA
TGAAAGTICAA
TACGAAGGTT
TrGTCACGGTT
LGGAGGTACC
CGCACTAACC
CGATATTACG
A.ACGTCTAAA
TAGTAACCGA
CGCTATCCA
CGAMGCGCA
TCCMGAGGA
AAACAATGCC
TGGAGGCTCT
TACAAATCCT
CGGGAAAACA
TTTTTTGGC
AAACTAG
o 00% .0.
*.0 0 *0.
1.: EF068-2 (SEQ ID NO:258) M KKKIVEDFNR KSQHKKWTKR KMLNLAISSG LLTSLAIPV S IAVTSGTIS
AVFSGQKQAV
TSGALGNLTG
LAQNWVSNILQ
GSGVNELVDA
NIYFAAGTTS
FTVTVPAGEA
KGTADANATV
TTFQTPADEA
FTVTI PAGEA
VKGTAEVGTT
PATATTPADV
VTLPTGTTNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TTIEVRNPDG
DATVAAPIVT
TATANEALTA
T'IVEVRDADG
DLATPTIDSI
TPGETITI IS FYNSSEQ4LA
EPEIKIAAPT
TAGQATAQQS
TGGNNGNGSN
YLPSTGE-KES
ASAAVLDIEL
LVVPPELRGN
VDIDLTEVNR
OLNAAVQALE
SLLGTIVTL
EVTAPTITGV
GANETLTAVA
EIRNAGGTVI
TVTAPTITGV
GANETLTAVA
IEVRDAAGTV
TAPTVDNITG
GDTLTVIGKD
DADGNVIATG
PSVDKVTGNT
'IVNVVAKNDT
TI IGTTTqTDD TVTC7TTATGY
IAKDAAGKES
TVLGMATTGT
TNSSKGYEI
KDGAGNESQP
SGNITTGGTF
VSSVLGTSKA
LLATATDGAG
TGSNPNGGSG
SAVTTSLFCA
LSNVTSNNDS GTSTSNRWTA VAAAGSAAIN TNVTIDLSKV
QLELVNNIEN
AKGTS IPSNL
PTTVSTPQNL
TGNSTAGYEV
KNASGTEXTP
CTGTADCTCA
TGNSTAGYEV
KNASGTESTP
LGTXITTCTDG
NSGSGYEITG
NAGNESQPTE
TADCTCSFAV
TQGYQVTGTA
GLESQPTTAM
QCNPTVDLPA
QVTGTAEPNV
NPTAFKTPAD
DGKYTVTLEP
TGTAEPKTTII
ATAVIPADVV
SVHIAAGLAT
GYLIKGTAEP
HYSTATI'FMT
LGTI'GSGLGS
FVALLASMC I
LGAASFTAPE
VAAAINAALL
SNNLDARFVG
KGTADANATV
TTFQTPADEA
FTVTVPAGEA
KGTADANATV
TFQTPADPN
KY IVTLEDSCT
TADPNTTIEV
VLVPADATVT
NLPAGTANAN
ELG'ITIEVRA
TPADVTTPTI
GAANPGD'rLT TIE IRNEAGL
PDAPVATPTV
GKASANETIT
DVRDADGTI I LAAPTiTKvE
EKETLTALTT
NRIIQISNRL
PADPTNPCG
LC-NGLGTNGS
IXRXKhCN
ANQNQPVNFT
TFLTAVLNAA
TLAADGSYIS
PVKG'IVNVAV
TVVQTDLLDV
EIRNAGGTVI
TVTAPT ITO V
GANETLTAVA
EIRNAGGAVI
TPVATP IVET
ATANQTLSVV
RDPSGAVIGT
APTVTGVTGN
ETLTALAKDP
TOG'IVLGTAT
GDITGDS=T
VVGXDGDGNE
VIATGTTDGA
DI(ITGSTTNC
VVAKNATGKE
AATTAN.TCGQ
GNKANGYTVT
DTQGNVSPKT
LRSVIAVGAT
NGN~TGCNNGN
GYHPKLSTIS
VSGGALADAS
NDLTNVITQ I API SDGLGLV
SGALPLLAVG
NLLATADGVS
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
GTCTADGTGA
VTGSTTKGYE
AKNASGTESQ
GTSDANGDFT
SVAGYQVTGT
AGNTSTPTTF
TGPTGQYTVT
YEITGTADPN
SQPTEVTVPE
GA:FTITLPTG
YQVVGAAEVG
SQPATATTPV
YTVTLPACVV
GTADPNVTV/Q
'rFMTPAD ITO
DAEGNFAIQL
TGGNTGNNGA
YGTGNHGKTOG
155 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF068-3 (SEQ ID NO:259) CTC TGGCACAATC AGTGCATCAG CAGCGGTCTT GGATATCGAA
CTATTATCAA
GCCGCAA ACC
TCCGCTGTGT
AATGTAGCTG
GTTACTTT
ATTACCAGTG
CGTCAATTCG
GAAACGTTAG
GTTTAGCCC
GAG-GCAAAAG
CTTCCTGTCA
GGTGGTTCAG
TTACCAACTA
GGAACAGTCG
TCCAACATTT
GTAACAGGTA
GTGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTrAAAGGAA
ATAGGCACAG
GCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCAITTTACAG
GCGAAAA-ACG
ATGTTACGTC
AAAATCAACC
TTAGTGGACA
CAGCAGGCAG
TGACTCCCGT
GGGCGTT1AGG
AAT'AGTITAA
CAGCTGACGG
AAAATG'FrTC
GTACCAGTAT
AACCCACGGT
GCGTAAATGA
CCGTTTCAAC
FI'CAAACAGA
ATTTTGCTGC
ATTCAACAGC
GAAATGCAGG
TTACCGTTCC
CCAGCGGNAC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCAA.ACACC
A'ITCAACAGC
GAAATGCAGG
'ITACCATTCC
CCAGCGGTAC
AAATAATGAC
AGTTAATTTC
AAAACAAGCG
CGCAGCAATC
TFTGAATG-CA
GAATTTAACT
TAACATTGALA
CTCATACAT
AAACATCTI'A
CCCAAGTAAT
AAACGGCT
GTTAGTGGAT
ACCTCAAAAT
TCTTTTAGAC
AGGCACTACT
AGGTTACGA6A
AGGCACCGTA
CGCAGGTGA6A
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCCCGTA
AGCGGATGAA
AGGTTACGAA
AGGTGCCGTG
CGCAGGTGAA
AGAAAGTACG
AGTGGCACTT
ACGG'TrCTG
GTGTTAGTGG
AATACCAATG
GCCAATGATT
GGTC'ITGATA
AACI'AGGTG
AGTGCACCGA
CAAGATTTGA
CTTGTCGCCG
G?ITCAGGTG
GCTTCTTTAC
TTATCCAATA
GTTAATTTAT
AGTGAAGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGCGCCA
CCAACAACGT
CGACACCTA
GTTGAAATCC
GCGTTTACAG
G^CGAAAAACG
GCAACCGTAA*
GTTAAAGGAA
ATAGGTACAG
GCACC.TGCGA
CCAACPLACGT
CAACGAGTA6A
GTGGCGCT
TTCCTCCTGA
TCACGA?1'GA
TAACCAATGT
TTGATTTGAC
CTGC=CATT
TTAGTUATGG
ATGCGGCAGT
CAGCTATAAA
C'LTTGCC'TI
TAGGCACAAC
ATTAGAT-GC
TAGCAACAGC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCA-AACNCC
ATTCAACGGC
GAA-TGCAGG
TTACCGTTCC
CCAGCGGCAC
CCGCACCAAC
CTGCCGATGC
GTACTGCTGA
ATGAAACGT
TCCAAACGCC
TCGTTGGACA
AGCAGATGCT
GTTAAGAGGA
TCTTCAAALA
GATTACTCAA
GGAAGTGAAT
TACAGCTCCG
TTAGGGTTA
TCAAGCTTTG
TGCAGCCTC
ATTAGCGGTT
CACGGTTACT
TCGTTTTGTA
AGACGGTGTA
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGA6A
AGGTTACGAA
AGCCACCGTA
CGCAGGTGA6A AGAA6AGTACG
AATCACAGGA
CAATGCCACG
TGGGACAGGG
AACCCCGTA
o e 0 o 0 00 0 0 0 0 0 000000 0 0 EF068-4 (SEQ ID NO:260) TSGTIS ASA.AVLDIEL LSNVTSNNDS GTSTSNRIWTA ANQNQPVNFT VSGOALADAS
AVFSGQKQAV
TSGALGNLTG
LAQNVSNILQ
GSGVNELVDA
NIYFAAGTTS
FTV'IVPAGCEA
LVVPPELRGN
VDIDILTEVNR
DLNAAVQALE
SLLGTTTVTL
EVTAPTITGV
GANETLTAVA
VAAAGSAAIN
QLELV14NIEN AXGTS IPSNL
PTTVSTPQNL
TGNSTAGYEV
KNASGTEXTP
TNVTIDLSKV
LGAASFTAPE
VAAAINAALL
SNNLDARFVG
KGTADAMV
TTFQTP
TFLTAVLNAA
TLAADGSYI S
PVKGTNVAV
TVVQTDLLDV
EIRNAGG'IVI
NDLTINV ITQ I APISD3LGLV
SGALPLLAVG
NLLATADGVS
GTGTADGTG.A
EF069-1 (SEQ ID NO:261)
TAGGGGAAGC
ATGAAAAAGA
CGCAAGATGC
GTAAGTATAG
CTATIATCAA
GCCGCAAACC
TCCGCTGTGTT
AATGTAGCTG
GTTACTT=
TAATGATCTT
AATTGI'GA
TTAAT ITAGC
CTCTTACCTC
ATGTTACGTC
AAAATCAACC
'ITAGTGCACA
CACCAGGCAG
TGACTGcCGT
GGTATITATC
GGATTTTAAT
AATATCAAGT
TGGCACAATC
AAATAATGAC
AGTTAATTTC
AAAACAAGCG
CGCAGCAATC
TTTGAATGCA
GTrI'ATTTTA
CGGAAAAGTC
GGTTI'AITAT
AGTCGCATCAG
AGTGGCACTT
ACGGTTTCTG
GTCTTAGTCG
AATACCAATG
GCCAATGAT'
AAGAAAAGAG CGACGATCAG AGCATAAAA ATGGACAAA-A ITACGTCATT AGcAATCCCT CAGCGGTC IT GGATATCGAA CAACGAGTAA TCGTTGGACA GTGGCGCTTT AGCAGATGCT ?I'CCTCCTGA GTTAAGAGGA TCACGA7M'A TCT2I'CAAA-A TAACCAATGT GATTACTCAA 156 TABLE 1. Nucleotide and Am~ino Acid Seqeuences of E. fuecalis Genes.
a.
a a.
a
ATTACCAGTG
CGTCAATTGG
GAAACGTTAG
GT'TTTAGCCC
GAGGCAAAAG
CTTCCTGTCA
GGTCGTTCAG
TTACCAACTA
GGAACAGTCG
TCCAACATT
GTAACAGGTA
GTTGAAATCC
GCGTTI'ACAG
GCGAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCATTTACAG
GCGAAAAACG
AATACCCCCG
GAGGTCAAAG
GTCCTTGGTA
ACACCAACAG
CAACCAGCAA
GGCAACTCTG
C'ITCGTGATC
ACTGTAACGC
GATAACGCGG
ACAGCACCAA
ACCGCTCATC
GGGACTGCCG
AATGAAACAT
TTCCAAACAC
ACAACACAAG
GCAACAGACG
ACCTTAGCTT
ACTGGACTTrG
ATTGGTGACA
AATACCACCA
GATCA.AGGAA
ACAGTTGTT'G
GAAGATGCAA
TATCAAGTAA
TTAGTTA=-
GGCACAGCAA
ACTAATCCGP
GTTGACAAAP
GGTACAACAC
ACTGATGC;
ACTGTCGTAC
GGGCGTTAGG
AATTACTTAA
CAGCTGACGG
AAAATGTTTC
GTACCAGTAT
AAGGCACGGT
GCGTAAATGA
CCGTTTCAAC
TTCAA.ACAGA
ATTT'TGCTGC
ATTCAACAC
GAAATGCAGG
TTACCGTTCC
CCAGCGGNAC
CCGCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCAAACACC
ATTCAACAGC
GAAATGCAGG
ITACCATTCC
CCAGCGGTAC
TGGCGACGCC
GGACMCTGA
CTGCAACAAC
CAAATCAAAC
CGGCGACAAC
G 1TCGGGTTA
CATCTGGGGC
TACCAACGGC
GAAATGAAAG
CTGTAACAGG
CGAATCCTAC
ATGGGACTGG
TGACAGCGTT
CAGCAGATGA
GATATCAAGT
GAACAGTI I
CAGGAA.AAC
AGAGTCAACC
TTACTGGAGA
TI'GAAGTACG
ACTTACTGI
GAAAAGACGG
CCGTAGCAGC
CCGGCACGGC
CTACGGGAAC
CAGCTAACGP
CTGCTTTCAz
TCACTGGTAC
TTGAGGTGCC
LA.ATACACAGJ
;CGAAAAA=G
GAATTTAACT
TAACATTGAA
CTCATACATT
AAACATCTTA
CCCAAGTAAT
AAACGTGGCT
GTT!AGTGGAT
ACCTCAAAAT
TCTTTTAGAC
AGGCACTACT
AGGTTACGA6A
AGGCACCGTA
CGCAGGTGAA
AGAAAGNACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGAA
AGGTTACGAA
AGGTGCCGTG
CGCAGGTGAA
AGAAAGTACG
AATTCTTGAG
AGTTGGCACC
TGGAACTGAC
GCTGCCTT
ACCAGCTGAT
GGTGTTGATA
AAC'?TAGGTG
AGTGCACCGA
CAAGATTTGA
CTTGTCGCCG
GTTTCAGGTG
GCTTCTTAC
TTATCCAATA
GTTAATTTAT
AGTGAAGTAA
GTTAAAGGAA
ATAGGCACAG
GCAGGCGCCA
CCAACAACGT
GTGACAGGTA
GTTGAAATCC
GCGTTTACAG
GCGAAA.AACG
GCAACCGTAA
GTrAA.AGGAA
ATAGGTACAG
GCAGGTGCGA
CCAACAACGT
ACTCTAACAG
ACCATTGAG
GGAAAATATA
GTAGCGAAAA
GTCACTGCAzC rTGA=rrAC G ZTGCITCATT TrAGTCGATGG TI k.TGCGGCAGT T1 CAGCTATAAA TAGGCACAAC C A.TTTAGATGC I3
T'AGCAACAGC
CCGCACCA.AC
CTGCCGATGC C
CTACCGCTGA
A.TGAAACGTI'
rCCAAACNCC
ATTCAACGGC
GA.ATGCAGG
TTACCGTTCC
CCAGCGGCAC
CCGCACCAAC
CTGCCGATGC
CTACTCCTGA
ATGAAACGTT
TCCAAACGCC
GTAGTACAAC
TTCGCGATGC
CAGTGACTTT
ACGCTAGTGG
CAACAGTTGA
ACCCTAACAC
CTGATGCGAA
CGTTAACAGT
TTCCTGCTGA
CTGG'-rATCA
ATGGGAACGT
CAGCTGGGAC
ATACAAGTAC
TCGACAAAGT
GCACCACCAT
CGACTGGCCA
ATGTAGTTGC
CTrGATGTTAC
TCACTGGGAC
TTGGTACAAC
CTAATCCTCG
CAACGGAAGT
CAGGAACAAC
AGATI'CACAA
?I'ACAATCAC
AAGATGCTGC
CACCAGTCGC
TAGTAGGAGC
TTGGCATGGC
CC-TCAGCTAA
CAGCTACAC
GAAGTGAAT
'ACAGCTCCG
~TTAGGGTTA
CA.AGCTTTG
.'ICAGCCTI'G
~TTAGCGGTT
ACGGTTACT
[CGTTGTA
GACGGCTA
ATCACAGGA
AATGCCACG
L'CGGACAGGA
ACCGCCGTA
kGCGGATGA.A k.GGTTACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGTACG
A.ATCACAGGA
:AATGCCACG
TCGGACAGG
A.ACCGCCGTA
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAAAGT
TAACATCACA
AACAATCGAA
TGGTGATTT
GATTGGAAAG
TGCCACGGTT
GGTGACAGC
GATTGCAACA
GGCAAATGCG
ACCGACAACC
TACTGGGAAT
TGAAGTI'CGT
ATATACTGTG
TAAAAATGAT
CACACCAACA
GGcGGACCCT
CACAACCGAT
TGATACATTA
GACGGTCCCT
TGCCACTGGG
TGAAGCAGGT
TCTTCCGACG
TGGGAAAGAA
GACACCTACT
AGCAGAAGTT
AAcTACTGGA
CGAAACAATA
AACTACACCA
TGAAATTACA GGAACAGCAG AGTCATTGGT ACAGGTACCT AACGACCAAT CCTGGGGATA TCAACCGACT GAAGTCCTTG a .q~
AGTAACAGGT
CATCGAAATT
TTCCTTTGCT
AGCCAAAGAT
AGTAGTGGCA
GACAGGTACC
AGGCACCGCA
AACAGCTAAA
AACTACAGCT
TTCAACAACT
AATTCAG'ITG
CGTGATGCAG
GTGAACC'ITC
CCTGCTGGCA
CCGCCAAGTG
GCTGAACTTG
ACAACTGGAC
CAAACAGTGA
ATGACACCCG
GGTTATGAAA
GAACCCAGAT GGAACAATrA GGACCTTCCA GCGGGAGCCG
TGACGGCAAT
ACCAACTGTG
AGAGCCAAAT
GACTGSATG.GT
AGCCTI'AACT
AACACCTGCT
CACGACAAAC
TGACGCCGAT
GACTTTAGAG
AACAGGAAAA
GAAAGTCAAC
ACCACTOTTA
GTCACCATTG
GC'TGGCGCAT
GCCATTGCGA
GATCCAGATG
GGCTATCAAG
GGCACAGTCC
CCAGGGAAGG
GAAAGTCAGC
157 TABLE 1. Nucleotide and Amnino Acid Seqeuences: of E. faecali-v Genes.
GTCGACTTAG
ATCACTCGGAA
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
GAAGGAA.ACA
CAATTTTACA
ITCTCCCTTC
ACAGATACAC
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCGG
GGACATTACA
GGCAATGGTA
GCAACTGGCG
GGTTTAGGCA
AGTGGCTACC
GGCTACTTAC
GCCTTTGTCG
CCACACCAAC
CGGCGGAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTATTCC
AACCCAATGG
ATAGCAGTGA
ATATTGCAGC
AAGGAAATGT
AGA'FTAAAAT
TCATCAAAGG
GTGMATTGC
GACAAG CGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGGTTC
ACCCTAAACT
CTAGCACAGG
CACTCCF1'GC
CATTGATTCT
AAAAACCACT
CGAAACCGCC
TACGATTATT
AGCGGATGTT
CTATACAGTC
ACAATTATTG
AGGGTTAGCA
GAGTCCTAAA
TGCGGCACCA
AACAGCTGAA
TCTAGGTGCC
TGCTCAACAA
AACCTTCATG
AAATAACGGC
GAATGGTTCA
TGGCTTAGGT
AAGTACCATC
TGAAAAAGAG
GAGCATGGGA
A'ITACCGGAA
ATTGCATGTCC
CAATAkTACGG
AGCAAAGATG
GTC''AGCGG
ACTGGAACTC
GCAAGTGGCA
ACAGAAA-AAG
ACCACATTTA
ACTG=TCTT
CCAAACCGAA
ACCdAtC;CTC AGTTTAC PT
ACGCCAGCCG
AATACAGGCG
AACACAGGTI'
TCACTAGGCA
AGTTATrGGCA
TCTCAGCCC
ATCATCAAAC
ATT'TACTAA
GTGACGCAGA
TGACTCTACC
GCGCAGGTAA
CC-AACTAT
CTGATCCAAA
ATACAACTAC
A-AACGTTAAC
TrGACGCCAGC
CAGTTTAGG
TCATTCAAALT
AAGGCAACTT
CGACAGCTAC
ACCCAACGAA
GCAATACACG
CAAATCCAAA
ATGGCCTCGG
CTGGAAA'ICA
TGACAACAAC
GCAAACGT-7AA
AGGTTACGAA
CGGAACCATC
AGCTCGCGTA
TGAAAGTCAA
TACGAAGGTT
TGTCACGGTT
TGGAGGTACC
CGCACTAACC
CCATATTACC
AACGTCTAAA
TAGTAACCGA
CGCTATCCAA
CGATGGCGCA
TCCTGGAGGA
AAACAATGCC
TGGAGTTCT
TACAAATGGT
CGGGAAAACA
TTTGTTTGGC
AAACTAG
S
S
*5 EF069-2 (SEQ ID NO:262) M KKKIVEDFNR KSQHKKWTKR KME..LAISSG LLFTSLAIPV *5*q 5* 5
S*
S IAVTSGTIS
AVFSGQKQAV
TSGALGNLTG
LAQNVSNI LQ
GSGVNELVDA
NIYPAAGTTS
FIVTVPAGEA
KGTADANATV
TTFQTPADEA
FTVTI PAGEA
VKGTAEVGTT
PATATTPADV
VTLPTGETNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TT-IEVRNPDC
DATVAAPIVT
TATANEALTA
TTIVEVRDADG
DLATPTIDS I
TPGETITIIS
FYNSSEQLLA
EPEIKIAAPT
TAGQATAQQS
TGCNNGNGSN
YLPSTGEKES
ASAAVLEJIEL
LVVPPELRGN
VDI DLTEVNR
DLNAAVQALE
SLLGTTTVTL
EVTAPTITGV
GANETLTAVA
EIRNAGGTVI
TVTAPTITGV
GANETLTAVA
IEVRDAAGTV
TAPIVDNITG
GDTLTVIGKD
DADGNVIATG
PSVDKVTGNT
TVNVVAKNDT
TI IGTTT'D
TVTGTTATGY
IAKDAAGKE-S
TVLGMATTGT
TGNSSKGYEI
KDGAGNESQP
SGNTTTGGTF
VSSVLGTSKA
LLATATDGAG
TGSNPNGGSG
SAVTTSLFGA
LSNVTSNNDS
VAAAGSAAIN
QLELVNNIEN
AKGTSIPSNL
PTTVSTPQNL
TGNSTAGYEV
KNASGTEXTP
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
LGTATTGTDG
NSGSGYFITG
NAGNESQPTE
TADGTGSFAV
TQGYQVTGTA
GLESQPTTAM
QGNFTVDLPA
QVTG7TAEPNV
NPTAFKTPAD
DGKYTVTLEP
TGTAEPKTTI
ATAVI PADVV SVH IAAGLAT GYL IKGTAEP
HYSTATTFMT
LGTTGSGLGS
F'JA-LASMG I
SNNLDARFVG
KGTADANATV
TTFQTPADEA
FTVTVPAGEA
KGTADANATV
TTFQTPADPN
KY'IVTLDSCT
TADPNTTIE:V
VLVPADATVT
NLPAGTANAN
ELGITIEVRA
TPADVTTPTI
GAANPGDTLT
TIE IHNE-AGL
PDAPVATPTV
GKASANETIT
DVRDADGTI I
LAAPTITKVE
EKETLTALTT
MRIIQISNRL
PADPTNPGGG
LGNGL.TNGS
IKRKRKN
TVVQTDLLDV
EIRNAGGEVI
TVTAPTITGV
CANETLTAVA
EIRNAGGAVI
TPVATP IVET
ATAN'QTLSVV
RDPSGAVIGT
APTVTGVTGN
ETLTALAKIDP
TDGTJLGTAT
GDITGDSTTG
VVGKDGDC;NE
VIATGTTDGA
DKITGS'ITNG
VVAKNATCKE
AATTANETGQ
GNKANGYT
DTQGNVSPKT
LRSVIAVGAT
NGNTGGNNGN
GYHPKLSTIS
GTSTSNRWTA ANQNQPVNFT TNVTI!JLSKV TFLTAVLNA-A LGAASFTAPE. TLAADC-SYIS VAAAINAALL' PVX<GTVNVAV
VSGGALADAS
NDLTNVITQ I AP ISDGLGLV
SCALPLLAVG
NLLATADGVS
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
GTGTADGTGA
VTGSTTKGYE
AKNASGTESQ
GTSDANGDFT
SVAGYQVTGT
AGNTSTPTrF TGPTGQY'rVT
YEITGTADPN
sQPTEvTvPE
GAFTITLPTG
YQVVGAAEVG
SQPATATTPV
Y IVTLPAGVV
GTADPNVTVQ
TFMTPADITG
DAEGNFAIQL
TGGNTGNNGA
YGTGNHGKTG
EF069-3 (SEQ ID NO:263) 158 TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
AGGTGAA GCAGGTGCGA ATGAAACGTT AACCCCCGTA
GCGAAAAACG
AATACGCCCG
GGTCAAAG
GTCCTTGGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GITCGTGATC
ACTGTAACC
GATAACGCGG
ACAGCACCAA
ACCCCTGATC
GGGACTGCCG
AATGAAACAT
TTCCAAACAC
ACAACACAAG
GCAACAGACG
ACGTTAGCTT
ACTGGACTTG
ATTGGTGACA
AATACCACCA
GATCAAGGAA
ACAGTTGTT6
GAAGATGCAA
CCAGCGGTAC
TGGCGACGCC
GCACTCCTGA
CTGCAACAAC
CAAATCAAAC
CCGCGACAAC
GTTCGGGTTA
CATCTGGGGC
TACCAACGGG
GAAATGAAAG
CTGTAACAGG.
CGAATGCTAC
ATGGGACTC
TGACAGCGTT
CAGCAGATGA
GATATCAAGT
GAACAG'ITT
CAGGAA.AAGC
AGAGTCAACC
TTACTGGAGA,
TTGAAGTACG
ACTTTACTGT
GAAAAGACG
CCGTAGCAGC
AGAAAGTACG
AA'TTIGAG
AOTTCGCACC
TCGAACTGAC
GCTGAGCGTT
ACCACCTGAT
TCAAATITACA
AGTCATTGGT
AACGACCAAT
TCAACCGACT
AGTAACAGGT
CATCGAAAT
TTCCTTTGCT
AGCCAAAGAT
AGTAGTGGCA
GACAGGTACC
AGGCACCGCA
AACAGCTAAA
AACTACAGCT
TTCAACAACT
GAACCCAGAT
GGACCTTCCA
TGACGGCAAT
ACCA.ACTGTG
CCAACAACGT
ACTGTAACAG
ACCATTGAGG
GGAAAATATA
GTAGCGAAAA
GTCACTGCAC
TCCAAAC
GTAGTAC
TTCGCGI
CAGTGAC
ACGCTAC
CAACAG~
GGAACAGCAG ACCCTAJ ACAGGTACCT CTGATG( CCTGGGGATA CGTTAA( CAAGTCCTTG TTCCTG( AATTCAGTTI CTGGITI CGTGATGCAG ATGGGA CTGAACCTTC- CACCT( CCTGCTGCCA ATACAA( CCGCCAAGTC TCGACA GCTGAACTTIG GCACCA( ACAACTGGAC CGACTC CAAACAGTGA ATGTAG' ATGACACCCG CTGATG GGTTATGAAA TCACTG( GGAACAATTA TTGGTA( GCGGGAGCCG CTAATC GAAAGTCAAC CAACGCG ACGACTGTTA CAGGAA GCCc
:AAC
~TGC
:TTT
,TGG
[TGA
.CAC
CAGT
CTGA
.TCA
kCGT
'AC
GTAC
k.AGT
CCAT
XCCA
F1'GC E1AC
GAC
CAAC
CTGG
AAGT
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAA.AGT
TAACATCACA
AACAATCGAA
TGGTGATT
GATTGGAAAC
TGCCACGG TT
GGTGACAGGC
GATTGCAACA
GGCAAATGCG
ACCGACA-ACC
TACTGGGAAT
TGAAGTTCGT
ATATACTGTG
TAAAAATGAT
CACACCAACA
GGCGGACCCT
GACAACGGAT
TGATACATTA
GACGGTCCCT
S S
S
*555
S
SS
EF069-4 (SEQ ID NO:264) AGEA GANETLTAVA KNASGTEXTP ?1'FQTPADEA TVTAPTITGV TGNSTAGYEV KGTADANATV EIRNAGGTVI TTFQTPADEA TVTAPTITIGV FTVTIPAGE-A GANETLTAVA VKGTAEVGTT I EVRDAAGTV PATATTPADV TAPTVDNITG VTLPTGTTNP GDTLIVIGKD ADPNATIEIR DADGNVIATG QTPADEWVAP PSVDKVTGNT LASGKATAKQ IVNVVAKNDT TTIEVRNPDG TI IGTT'ITDD DATVAAPTVT TVTGT
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
LGTATTGTEDG
NSGSGYEITG
NAGNESQPTE
TAD)G LSFAV
TQGYQVTGTA
GLESQPTTAM
QGNFT'JDLPA
FTVTVPAGEA
KGTADANATV
TTFQTPADPN
KYTVTLOSGT
TADPNTTIEV
VLVPADATVT
NLPAGTANAN
ELG'ITIEVRA
TPADVTTPTI
CAANPGDTLT
GA.NETLTAVA
EIRNAGGAVI
TPVATPIVET
ATANQTLSVV
RDPSGAVIGT
APTV'MVTGN
ETLTALAKDP
TDGTVLGTAT
GDITGOSTTG
VVCKDGDGNE
KNASGTESTP
GTGTADGTGA
VTGSTIvTKGYE
AXNASCTESQ
GTSDANGDFT
SVAGYQVTGT
AGNTSTPTTF
TGPTGQYTVT
YEITGTADPN
SQPTEVTVPE
EF070-1 (SEQ ID NO:265)
TAGCGGAAGC
ATGAAAAAGA
CGCAAGATGC
CTAAGTATAG
CTATTATCAA
GCCGCAAACC
TCCGCTGTGT
AATGTAGCTG
GTTACTTr
ATTACCAGTG
TAATGATCTT
AAATITCTTGA
TI'AATTAGC
CTGTTACCTC
ATG-TTACGTC
AAAATCAACC
LTAGTG.GACA
CAGCAGGCAG
TGACTGCCGT
GGGCGTTAGG
GGTA'TrATC
GGAT='IAAT
AATATCAAGT
TGGCACAATC
AAATAATCAC
AGTTAATITTC
AAAACAAGCG
CGCAGCAATC
TFI'AATGCA
GAA IITAACT
GTTATTTTA
CGGAAAAGTC
GGTITATTAT
AGTr.CATCAG
AGTGOCACTT
ACGGI'TCTG
GTGTTAGTGG
AATACcAATG GCCAATGATr
GGTG'ITGATA
AAGAAAAGAG
AGCATAAAAA
TTACGTCAI'
CAGCGGTCT
CAACGAGTAA
GTGGCGCTTT
TTCCTCCTGA
TCACGATTGA
TAACCAATrCT
TTGATTTGAC
GGACGATCAG
ATGGACAAAA
AGCAATCCCT
GGATATCGAA
TCGTTGGACA
AGCAGATGCT
GTTAAGAGGA
TCTTTCAAAA
GATTACTCAA
GGAAGTGAAT
159 TABLE I. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
CGTCAATTGG
GAAACGTTAG
GTTTTAGCCC
GAGGCAAAAG
CT-TCCTGTCA
GGTGGTTCAG
TTACCAACTA
GGA.ACAGTCG
TCCAACATTT
GTAACAGGTA
GTTGAAATCC
GCGTTTACAG
GCGAAAAACG
GCAACCGTAA
GTTAAAGGAA
ATAGGCACAG
CCAGGTGCCA
CCAACAACGT
GTGACAGGTA
GTTGAGATCC
GCATTTACAG
GCGAAAAACG
AATACGCCCC
GAGGTCAAAG
GTCCTTGGTA
ACAGCAACAG
CAACCAGCAA
GGCAACTCTG
GTTCGTGATC
ACTCTAACC
GATAACGCGG
ACAGCACCAA
ACCGCTGATC
GGGACTCCC
AATGAAACAT
TrrCCAAACAC
ACAACACAAG
GCAACAGACG
ACGTTAGCTT
ACTGGACTrG
ATTGGTGACA
AATACCACCA
GATCAAGGAA
ACAGTTGTTG
GAAGATGCAA
TATCAAGTAA
TTAGTTATTG
GGCACAGCAA
AGTAATCCGA
GTTGACAAAA
GGTACAACAG
ACTGATCGCA
ACTGTCGTAG
GTCGAC ETAG AAI'AGTrTAA
CAGCTGACGG
AAAATGTTTC
GTACCACTAT
AAGGCACGGT
GCGTAAATGA
CCCTTICAAC
TTCAAACAGA
AT=~GCTGC
ATTCAACAGC
GAAATGCAGG
TTACCGTTCC
CCAGCGGNAC
CCCCACCAAC
CTGCCGATGC
GTACCGCTGA
ATGAAACGTT
TCCAAACACC
ATTCAACAGC
GAAATGCAGG
TTACCATTCC
CCAGCGGTAC
TGGCGACCCC
GGACTGCT--A
CTGCAACAAC
CAAATCAAAC
CGGCGACAAC
GTTCGGGTTA
CATCTGGGGC
TACCAACGCG
GAAATGAAAG
CTGTAACAGG
CGAATGCTAC
ATGGGACTIGG
TGACAGCGT
CAGCAGATGA
GATATCAAGT
GAACAGT1
CAGGAAAAGC
AGAGTCAACC
TTACTGGAGA
TAAGTACG
ACTTTACTGT
GAAAAGACGG
CCGTAGCAGC
CCGGCACC
CTACGGGAAC
CAGCTAACGA
CTCCTTTCAA
TCACTGGTAG
TTGAGGTGCG
AATACACAG'I
CGAAAAATGc
CCACACCAAC
TAACATTGAA
CTCATACATTr
AAACATCTTA
CCCAAGTAAT
AAACGTGGCT
GTTAGTGGAT
ACCTCAAAAT
TCITTTTAGAC
AGGCACTACT
AGGTTACGAA
AGGCACCGTA
CGCAGGTGA-A
AGAAAGRACG
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCCCTA
AGCGGATGAA
AGGTTACGAA
AGGTGCCGTC
CGCAGGTGA-A
AGAAAGTACG
AATTC1'TTAC
AGTTGGCACC
TGGAACTGAC
GCTGAGCGTT
ACCACCTGAT
TGAAATTACA
AGTCATTGGT
AACGACCAAT
TCAACCGACT
AGTAACAGGT
CATCGAAATT
TTCCTTI'CCT
AGCCAAAGAT
AGTAGTCGCA
GACAGGTACC
AGGCACCGCA
AACAGCTAAA
AACTACAGCT
TI'CAACAACT
GAACCCAGAT
GGACCTTCCA
TGACGGCAAT
ACCAACTGTG
AGAGCCAAAT
GACTGATGGT
*AGCCTTAACT
*AACACCTGCT
CACGACAAAC
TGACGCCGAT
GACTFI'AGAG
AACAGGAAAA
CATTGATTCT
AkACTTlACCTGC
AGTGCACCGA
CAAGATTTGA
CTGTCGCCG
GTTTCAGGTG
GCTTCTT'TAC
TTATCCAATA
CTTAATTTAT
AGTGAAGTAA
GTTAAAGGAA
ATAGCCACAG
G&AGGCGCCA
CCAACAACGT
GTCACAGGTA
GTTrGAAATCC
GCGTTTACAG
GCGAAAA.ACC
r-CAACCGTAA
GTTAAAGGAA
ATAGGTACAG
GCAGGTGCGA
CCAACAACGT
ACTGTAACAG
ACCATTGAGG
GGAAAATATA
GTAGCGAAAA
GTCACTrGCAC
GGAACAGCAG
ACAGGTACCT
CCTGGGGATA
GAAGTCCTTG
AATTCAGTTG
CGT'GATGCAG
GTGAACCTTC
CCTGCTGGCA
CCGCCAACTC
GCTGAAC7-rX
ACAACTGGAC
CAAACAGTG.A
ATGACACCCG
GGTATGAAA
GGAACAATTA
GCGGGACCC
GAAAGTCAAC
ACGACTGTrA
GTCACCA=G
GCTGGCG-CAI
GCCATTGCGA
GATCCACATC
GGCTATCAAC
GGCACAGTCC
CCAGGGAAGC.
GAAAGTcAC ATTACCGGA-2 CGCT1CAT
TTAGT'GATGG
ATGCGC-CAGT
CACCTATAAA
CI'TGCCTT
TAGGCACAAC
ATTTAC-ATGC
TAGCA.ACAGC
CCGCACCAkAC
CTGCCGATGC
GTACCCCTGA
ATGAAACGTT
TCCAAACNCC
ATTCAACGGC
GAAATGCACG
TTACCGTTCC
CCAGCGGCAC
CCGCACCAAC
CMCCGATGC
GTACTGCTGA
ATGAAACGTT
TCCAAACGCC
GTAGTACAAC
TTCGCGATGC
CAGTGACTTT
ACGCTAGTGG
CAACAC ?TGA
ACCCTAACAC
CTGATGCGAA
CGTTAACAGT
TICCTGCTrGA
CTGGTTATCA
ATGGGAACGT
CAGCTGGGAC
ATACAAGTAC
TCGACAAAGT
GCACCACCAT
CGACTGGCCA
ATGTAG7MTC
CTGATG'ITAC
TCACTGGGAC
TTGGTACAAC
CTAATCCTGG
CAACGGAAGT
CAGGAACAAC
AGATTCACAA
T1'ACAATCAC
AAGAT-GCTGC
CACCAGTCGC
TAGTAGGAC
*TTGGCATGGC
CCTCAGCTAA
*CAGCTACAGC
ATTCTAGTAA
TACAGCTCCG
TTAGGGTTA
TCA.AGCTTTG
TGCAGCC'ITG
ATTIAGCGGTT
CACGGI'ACT
TCGTTTTGTA
AGACGGTGTA
AATCACAGGA
CAATGCCACG
TGGGACAGGA
AACCGCCGTA
AGCGGATGAA
AGGTTACGAA
AGGCACCGTA
CGCAGGTGAA
AGAAAGTACG
AATCACAGGA
CAATGCCACG
TGGGACAGGG
AACCCCGTA
AGCGGATCCT
AAAAGGCTAT
AGCTGGCACG
AGATTCAGGA
CACGGAAAGT
TAACATCACA
AACAATCGAA
TGGTGATTTT
GAI'GGAAAG
TGCCACGG T
GGTGACAGGC
GATTGCAACA
GGCAAATGCG
ACCGACAACC
TACTCGGAAT
TGAAGTTCGT
ATATACTGTG
TAAAAATGAT
CACACCAACA
GGCGGACCCT
GACAACGGAT
TGATAcA ITA
GACGGTCCCT
TGCCACTGGG
TGAAGCAGGT
TCTTCCGACG
TGGGAAAGAA
GACACCTACT
AGCAGAAGTT
AACTACTGGA
CGAAACAATA
AACTACACCA
AGGTTACGAA
160 TABLE 1. Nucleotide and Amino Acid Seqeuences of £.faecalis Genes.
ATCACTGGA-A
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
GAAGGAAACA
CAArr'ITACA
TTCTCCGT-TC
ACAGATACAC
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCGG
GGACATTACA
GGCAATGGTA
GCAACTGGCG
GGTTTAGCCA
AGTGGCTACC
GGCTACTTAC
GCCTTTGTCG
CGGCGGAGCC
CAACTGCTAA
GAGAAACGAT
CCGTTATTCC
AAGCCAATGG
ATAGCAGTGA
ATAT1'GCAGC
AAGGAAATGT
AGATI'AAAAT
TCATCAAAGG
GTGTGATTGC
GACAAGCGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGCTTC
ACCCTAAACT
CTAGCACAGG
CACTCCTTGC
AAAAACCACT ATTGATGTCC
CGAAACCGGC
TACGATI'ATT
AGCGGATGTT
CTATACAGTC
ACAATTATTG
AGGGTTAGCA
GAGTCCTA-AA
TGCGGCACCA
AACAGCTGA-A
TGTAGCTGCC
TGCTCAACAA
AACCTTCATG
AAATAACGGC
GAATGGTTCA
TGGCTTAGGT
A.AGTACCATC
TGAAAAAGAG
GAGCATGGGA
CAATATACGG
AGCAAAGATG
GTTrTTAGCGG
ACTGGAACTG
GCAAGTGGCA
ACACAAAAAG
ACCACATTTA
ACTGTTTCT.P
CCAAACCGAA
ACCGATGCTG
AGT TrACTTG
ACGCCAGCCG
AATACAGGCC
AACACAGG LT
TCACTAGGCA
AGTTATCCCA
TCTTCAGCCG
ATCATCAAAC
GTGACGCAGA
TGACTCTACC
GCGCAGGTAA
CGCCAACTAT
CTGATCCAAA
ATACAACTAC
AAACGTTAAC
TGACGCCAGC
CAGTTTrTAGG
TCATTCAAAT
AAGGCAACTrr
CGACAGCTAC
ACCCAACGAA
GCAATACAGG
CAAATCCAAA
ATGGCCTCGG
CTGGAAATCA
TGACAACAAC
GCAAACGTAA
CGGAACCATC
AGCTGGCGTA
TGAAAGTCAA
TACGAAGG Pr
TGTCACGCTT
TGGAGGTACC
CGCACTAACC
CGATATTACC
AACGTCTAAA
TAGTAACCGA
CGCTATCCA
CGATGGCGCA
TCCTGGAGGA
AAACAATGGC
TGGAGGTTCT
TACAAATGGT
CGGGAAAACA
TTTGTTTGGC
AAACTAG
EF070-2 (SEQ ID NO:266) M KKKIVEDFNR KSQHKKWTKR XMLNLAISSG'LLFTSLAIPV a a
SIAVTSGTIS
AVFSGQKQAV
TSGALGNLTCG
LAQN'VSNI LQ
GSGVNELVDA
NIYFAAGTTS
FTVTVPAGEA
KGTADANATV
TTFQTPADEA
FTVTI PAGEA VKGTAEVGTr
PATATTPADV
VTLPTGTTNP
ADPNATIEIR
QTPADEVVAP
LASGKATAKQ
TTIEVRNPDG
DATIVAAPTVT
TATANE-ALTA'
TTVEVRDADG
DLATPTIDSI
TPGETITI IS
FYNSSEQLLA
EPEIKIAAPT
TAGQATAQQS
TGGNNGNGSN
YLPSTGEKES
ASAAVLDIEL
LVVPPELRGN
VD IDLTEVNR
DLNAAVQALE
SLLGTTTVTL
EVTAPTITGV
GANETLTAVA
EIRNAGGTVI
TVTAPTITCV
GANETLTAVA
IEVRDAAGTV
TAPTVDNITG
CDTLTV IGKD
DADGNVIATG
PSVDKVTGNT
TVNVVAKNDT
TI IGTTTTDD
TVTGTI'ATGY
IAKDAAGKES
TVLGMATrGT
TGNSSKGYEI
KDGACNESQP
SGN'rTTGGT'F
VSSVLGTSKA
LLATATDGAG
TGSNPNGGSG
SAVTTSLFGA
LSNVTSNNDS
VAAAGSAAI N
QLELVNNIEN
AKGTSIPSNL
PTTVSTPQNL
TGNSTAGYEV
KNASGTEXTP
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
LGTATTGTDG
NSGSGYE ITG
NAGNESQPTE
TADGTGSFAV
TQGYQVTGTA
CLESQPTTAM
QGNFTVDLPA
QVTGTAEPNV
NPTAFKTPAD
DGKYI'JTLEP
TG TAEPKTTI ATAVI PADVV
SVHIAAGLAT
GYLIKGTAEP
HYSTATI'FMT
LGTTGSGLCS
FVALLASMGI
GTSTSNRWTA
TNVTIDLSKV
LGAASFTAPE
VAAAINAALL
SNNLDARFVG
KGTADANATV
TTFQTPADE-A
PIVTVPAGEA
KGTADANATV
TTFQTPADPN
KYTVTLDSGT
TADPNTTIEV
VLVPADATVT
NLPAGTANAN
ELGTTIEVRA
TPADVTTPTI
GAANPGDTLT
TIE IHNEAGL
PDAPVATPTV
GKASANETIT
DVRDAIJCTI I
LAAPTITKVE
EKETLTALTT
NRIIQISNRL
PADPTNPGGG
LGNGLGTNGS
IKRKRKN
ANQNQPVNFT
TFLTAVLNAA
TLAADGSYIS
PVKGTVNVAV
'TVVQTDLLOV
El RNACGIVI
TVTAPTITGV
GANETLTAVA
EIRNAGGAVI
TPVATP IVET
ATANQTLSVV
RDPSGAVIGT
APTVTCVTGN
ETLTALAKDP
TDGTVLGTAT
GDITGDSTTG
VVGKDGDGNE
VIATGTDGA
DKITGSTTNG
VVAKNATGKE
AATTANETGQ
GNKANGYIVT
DTQGNVSPKT
LRSVIAVGAT
NGNTGGNNGN
GYHPKLSTIS
VSGGALADAS
NDLTNVITQI
APISDGLGLV
SGALPLLAVG
NLLATADGVS
GTGTADGTGA
TGNSTAGYEV
KNASGTESTP
GTGTADGTGA
VTGSTTKCYE
AKNASGTESQ
GTSDANGDFT
SVAGYQVTGT
ACNTSTPITF
TGPTGQYTVT
YEITGTADPN
SQPTEVTVPE
GAFTITLPTG
YQVVGAAEVG
SQPATATTPV
YTVTLPAGCW
GTADPNVTVQ
TFMTPADITG
DAEGNFAIQL
TGGNTGNNGA
YGTGN14GKTG EF070-3 (SEQ ID NO:267) CGG TCACGCCAAT GAAAGTCAAC CAACGGAAGT GACGGTCCCT 161 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GAAGATGCAA
TATCAAGTAA
TTAGT'ATTC
GGCACAGCAA
AGTAATCCGA
GTTGACAAAA
GGTACAACAG
ACTGATGGCA
ACTGTCGTAG
GTCGAC'ITAG
ATCACTGGAA
ATTGCTGCTA
GTGACACCAG
CCAGCTACAG
GAAGGAAACA
CAAT=rACA
TTCTCCGTTC
ACAGATACAC
GGAGAACCAG
GCCGGCTACC
CTATTAAGAA
TTAACAGCGG
GGACATTACA
GGCAATGGTA
GCAACTGGCG
GGT'ITAGGCA
AGTGGCTACC
GGCTACT
CCGTAGCAGC
CCGGCACGGC
CTACGGGAAC
CAGCTAACGA
CTGCTTTCAA
TCACTGGTAG
TTGAGGTGCG
AATACACAGT
CGAAAAATGC
CCACACCAAC
CGGCGGAGCC
CAACTGCTAA
GAGAAACCAT
CCGTTATTCC
AAGCCAATGG
ATAGCAGTGA
ATATTGCAC
AAGGAAATCT
AGATTAAAAT
TCATCAAACG
GTGTGATTGOC
GACAAGCGAC
GTACGGCTAC
ACACTGGCGG
GGAATAATGG
CAACAGGT'C
ACCCTAAACT
ACCAACTGTG
AGAGCCAAAT
GACTGATGGT
AGCCTTAACT
AACACCTGCT
CACGACAALAC
TGACGCCCAT
GACTTTAGAG
AACAGGAAAA
CATTGATCT
AAAAACCACT
CGAAACCGGC
TACCATTAT
AGCGGATGTT
CTATACAGTC
ACAATTATTG
AGGGTTAGCA
GAGTCCTAA-A
TCCCCCACCA
AACAGCTGAA
TGTAGGTGCC
TGCTCAACAA
AACCTTCAC
AAATAACGGC
GAATGGTI'CA
TGGCTTAGGT
AAGTACCATC
ACGACTCTTA CAGGA.ACAAC GTCACCATTG AGATTCACAA GCTGGCGCAT TTACAATCAC GCCATTGCGA AAGATrGcTrc
GATCCAGATG
GGCTATCAAG
GGCACAGTCC
CCAGGGAAGG
GAAAGTCAGC
ATTACCGGAA
ATTGATGTCC
CAATATACGG
AGCAAAGATG
GTTT TAGCGG
ACTGGAACTG
GCAAGTGGCA
ACAGAAAAAG
ACCACATTTA
ACTGTTTCTT
CCAAACCr.AA
ACCGATGCTG
AGMTACTI'
ACGCCAGCCG
ALATACAGGCG
AACACAGGTr
TCACTAGGCA
AGTTATGGCA
CACCAGTCGC
TAGTAGGAGC
'ITGGCATCGC
CCTCAGCTAA
CAGCTACAGC
ATI'CTAGTAA
GTGACGCAGA
TGACTCTACC
GCGCAGGTAA
CGCCAACTAT
CTGATCCAAA
ATACAACTAC
AAACGTTAAC
TGACGCCAGC
CAGTTTTAGG
TCATTCAAAT
AAGGCAACTT
CGACAGCTAC
ACCCAACGAA
GCAATACAGG
CAAATCCAAA
ATGGCCTCGG
CTGGA.AATCA
TGCCACTGGG
TGAAGCAGGT
TCTTCCGACG
TGGGAAAGAA
GACACCTACT
AGCAGAAGTT
AACTACTGGA
CGAAACAATA
AACTACACCA
AGGTTACGAA
CGGAACCATC
AGCTGGCGTA
TGAAAGTCAA
TACCAAGGTT
TGTCACGGTT
TGGAGGTACC
CGCACTAACC
CGATATTACG
AACGTCTAAA
TAGTAACCGA
CGCTATCCAA
CGATGGCGCA
TCCTGGAGGA
AAACAATGGC
TGGAGGTTCT
TACAAATGGT
CGGGAAAACA
S.
S
55 S
S*
*S
EF70-4 (SEQ ID NO:268) DGDGNE SQPTEVTVPE DATVAAP'IVT TVTG'FrATGY TATANEALTA IAKDAACKES TTVEVRDADG TVLGMATT DLATPTIDSI TGNSSKGYEI TPGETITIIS KDGAGNESQP FYNSSEQLLA SGN'TITGGTF EPEIKIAAPT VSSVLGTSKA TAGQATAQQS LLATATDGAG TG.GNNGNGSN TGSNPNGGSG
QVTGTAEPNV
NPTAFKTPAD
DGKYTVTLEP
TGTAEPKTTI
ATAVI PADVV
SVHIAAGLAT
GYLIKGTAEP
HYSTATTFMT
LGTTGSGLGS
TIEIHNEAGL
PDAPVATPTV
GKASANETIT
DVRDADGTII
LAAPTITKVE
EKETLTALTr
NRIIQISNRL
PADPTNPGGG
LGNGLOTNCS
VIATGTTOGA
DKITGSI'NG
VVAKNATGKE
AATTANETGQ
GNKANGYTVT
DTQGNVSPKT
LRSVIAVGAT
NGNTGGNNGN
CYIIPKLSTIS
GAFTITLPT'G
YQVVGAAEVG
SQPATATTPV
Y7VTLPAGVV
GT.ADPNVTVQ
TFMTPAD ITG DArEGNFAIQL
TGGNTGNNGA
YGTGNHGKTG-
EF071-1 (SEQ ID NO:269)
TAAGTAGAAC
GTCCCGCCAT
ATGGC'ITTTT
GTGATTGGTT
CCAATCAACT
TCTGGAAkACG
CGAAATTGGT
TGGTCGGGAC
TTATCTGCAG
TAAGAAAGGA
TAAG=TAAC
TTAC'ITATTT
AGCGGAACCT
CAAATGCTTA
AAACGTAGAA
GTrrAAGCCG
GCATGCTATG
GATTCCGATG
TCCCGGCTCT
AGGACCACAC
TCTCTCATAT
CTT7CGCTGA
TGGAAGGGAA
TTTAAAAAAT
ACGGCTTNCG
GCAAGCAATG
AGmAcACA
TTTGTCGGAT
TI'GCCGAAGA
GTTATI I'A
TAATGATTCA
CTACACCAT
AATI'AAMCT
ATGAAGTGGC
cTGGTGGACG
ATACTTCT
CTTTCCTTTC
ACTTGCTrI'A
CGAAGCGGAT
T'ITACATGAA
CTATATGAAA
AGTGAAACAA
162 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
TTAGCTCCTIG
GCGCAAATCG
TATGTTAATT
GGAACAGGTT
ACAGATCCTT
ACAACGGGCG
CTGGCCAAAT
AATTGGCTCG
TGGCCCGTGA
ATGGCATAGT
ATGGTTAT-T
T7CNGNAAC TCAATATGGC GCAGGTTCTT AACGAATAAT GCGGCGACAT TTTGGCTCAG AACATTGGTG
CACTCATGAT
AGCGCGTIGGG
AGGTGAGACT
TGGATTACAA
GGATT~AGTAA
GGTCATTATT
TACCTAATCA
TTAAAAAAGA
CTCATTTTTC
A-AAAT"=CTG
AGCZ-CATTGG
CAGCCAGGTA
AAAAGCCTAT
TTATGCTGCC
TCTGGACGAT
GGGAGATCAT
CACNAGATTT
EF071-2 (SEQ ID NO:270) MF KKLNIQLALV IGLSLTIPMT AXAYTIEADP INFTYFPGSA SNELIVLHES GNERNLGPHS LDNEVAYMKflR NWSNAYVSYF VCSCCRVKQL APAGQIQYGA GSLANQKAYA QIELARTNNA ATFK KDYAAY VNLARDLAQN IGAOFSLDDG TG-YGIVTHDW ITXNWWGDHT OPYGYIJARCG LNKRIGTRFT TGVSXTGETC HYSAR EF071-3 (SEQ ID NO:271) G TTTAAAAAAT TAATGATTCA ACTTGCM-=A
V
GTGATTGGTT
CCAATCAACT
TCTGCAAACC
CGAAATTGGT
TTAGCTCCTG
CGCAAATCG
TATGITAATT
GGAACAGGTT
ACAGATCCTT
ACAACGGGCG
TAATTAAC
TTACTTANT
AGCGCAACCT
CAAATGCTTA
CTGGCCAAAT
AAITGGCTCG
TGGCCCGTGA
ATGGCATAGT
ATGGTTATTT
TTTCNGNAAC
GA'ITCCGATG ACGGCTTNCG
TCCCGGCTCT
AGGACCACAC
TGTCTCATAT
TCAATATGGC
AACGAATAAT
TTTGGCTCAG
CACTCATGAT
AGCGCGTGGG
AGGTOAGACT
GCAAGCAATG
AGTTTAGACA
TTTGTCCGAT
GCAGGTTCTT
GCGGCGACAT
AACATTGGTG
TGGAT1'ACAA
GGATTAGTAA
GGTCA PIA
CTTFACACCAT
AATTAATTGT
ATGAAGTGGC
CTGGTGGACG
TAGCTAATCA
TTAAAAAAGA
CTCGATrTTTC
AAAATTGGTG
AGCGCATTGG
CA'GCCAGCT
CGA.AGCGGAT
T=ACATGAA
CTATATGAAA
AGTG-AAACA-A
AAAAGCCTAT
TTATGCTGCC
TCTiGGACGAT
GGGAGATCAT
CACNAGATTI
EF071-4 (SEQ ID NO:272) F KXLMIQLALV IGLSLTIPMT AXAYTIEADP INFTYFPGSA NWSNAYVSYF VGSGGRVKQL APACQIQYGA VNLARDLAQN IGADFSLDDG TGYGIVTHDW TGVSXTGETG HYSAR
SNELIVLHES
GSLANQKAYA
ITKNWWGDHT
GNERNLGPHS
QIELARTNNA
DPYGYLARGG
LDNEVAYMKR
ATFKKDYAAY
LVKR IGTRFT EF072-1 (SEQ ID NO:273)
TAATCAATGA
TTI'CACAGC
'ITATrGTTCC
CTO-CTTCAAA
CCGTTTTATC
GAAACCGGTG
GAAGATGGAG
TATTATT'ro
GTGATTTTGC
AAAAATGAAG
CAAGGCTTTA
ATITTAGGAT
CCAGAATCAA
ACGCAAAAGC
AAAACGCACG
AAGCATTAGC
CTGATGGTCA
ATTATCGGGG
AGCTTCGTI'C
CAACAAATAG
TGGTTTCTT
'ITGAAGCGGA
CTGTI'CAAGA
AAAATGCCTA
ATCAAGGAGA
ATCAGGAA'
TTGAGGTAAA
ATGGA'ITTAC
T1'GGTTAAGT
AGAGGCATCG
ATTACCAGAA
C'I AAATGAC
TGAAGGAAAA
AAAACCGATC
TTCA'ITAGCT
AGCACCAGAA
TCCACAAGGG
TGACTTACCA
GCACATTAAC
CCGTTTGTCA
AGTGGCTGGA
GC'ITGATT
ATT-TGCGTCA
CAAGCAAGCG
CAGCAGCAAA
GTCACT1'ATC
ACGGTCCAAG
GCAGAAGATA
AGCAAAGATT
GTGGTAAAGG
CAATCGTrAA
CCACTTGAAA
TATCAGTTAA
GATAAGG-CGG
AAAACAGTTA
TCAATTAAAG
TGCTACTCGC
TTCAAGTTAC
ACACAGGGGA
AAGTCTATGA
AGGCACAGCG
AAACACAGAC
CGCAGCAACG
AAAA.AGCTAG
CGCATATTCA
AAACGGTACT
CGACTCAGAT
ATACAACGTT
CTACAGGTI'A
ACTI'ACAkA.A
TCTTTITCGGG
GTI'GCACAAA
AGAGGGA.ACG'
TGTGACGGAT
TCAATTAGCA
AATAAATGGA
AGATAAAGCC
CAACCTAGTA
TTTATATCCA
CGATAAGCAA
TCCAGCGAAT
GACACITA
CACACTGACG
CTTTGCAAAT
163 TABLE 1. Nucleotide and Aminio Acid Seqeuences of E.faecalis Genes.
CAAACAATGA
ATTAACAACG
CGTACAGGCG
GAGGCTGTTT
TATCGTTGGC
TI'TCAGTTA
AGG'ITATCTT
GAACGGACAA
CTGTGTCGTA
AAGGACAATT
GCAAGTC'rrT
TTATCGTCAA
AAAAAGAAAA
AAGGNNTTA-A
CTGAATCAAA
CGAACAGCAC
TCAAATGCGT
ACTCACGGAC
TGTCAAAGTT
AAATCAAGCG
AGCATTAGCT
AAGATGGCCA
CAGAAATTCC
CGTTACATGT
TTAGAAAAGA
AAACATACCT
GATAGTGAA.A
GGGGAATACC
AAAAAATTCA
GTACTTCTTG
TTTTACGGTG
AATCAATAA
CCGC-TGAACC
TGACT AAAAG
ATGCGAAAAT
TCAATGAAAC
CGTCTAATCA
GAAGAAATCT
GGAAAAAATT
TGACACTGCG
ACCCACA'GTT
CACCTTGCCA
AGCAAAC3-GG
AGCCZCGTGA.A
CTGCACCAAA
CTTATGCAAC
EF072-2 (SEQ 7D NO:274) MXNARWLSI CVMLLALFGF SQQALAEASQ ASVQVThHKL LFPrDGQLPEQ.QQNTGEEGTL
LQNYRGLNDV
DGVVS FSLAS
NEENAYDLPP
ES EVKVAGI<
NNEGQLVTDK
iWQKEKALAI(
RTTNSTVTCN
TYQVYDVTDP
KDSQQRDKAY
LEKTVLDKQQ
TVTTGYTLTT
HTLTKRATVR
KFTSNQAGEF
FYQLRSEGKT
LFVE.AEAPEV
GFNQGEH INY
QKHGFTLDFS
TGGKSFVYKVD
SVKGXKRWPV
VQEAQRQLAE
VKEKASNLVV
QLTTQI PANI
IKDLQNFANQ
SENAKITLPE
LLGRNLCTKR
TGATNRkPIA
ILPVQDPQGQ
LGYQEFRLSD
TMTVSYQMRL
AVF IVKNQAG
LSSESNRNSF
EDKTQTINGE
SLTHIHLYPK
KADTI'LTLLP
ERTAEPOTAI
EYLNETANGY
YGGKKFLCNE
0@ EFO72-3 (SEQ ID NO:27S)
ATTACCAGAA:
CTGCTTCAAA
CCGTTT1'ATC
GAAACCGCGM
GAAGATGGAG
TATTTAT=r
OTGATTTTGC
AAAAATGAAG
CAAGGCTTTA
ATTTTAGGAT
CCAGAATCA-A
ACGCAAAAGC
CA.AACAATGA
A ITAACAACG
CGTACAGGCG
GAGGCTGTTT
TATCCTTGGC
TTTCAGTTA
AGGTTATCTr
GAACGGACAA
CAGCAGCAAA
ATTATCGGGG
AGCTTCGTTC
CAACAAATAG
TGGTTTCTTr
TTGAAGCGGA
CTGTTCAAGA
AAAANPJCCTA
ATCAAGGAGA
ATCAGGAATT
TTGAGGTAAA
ATGGATTTAC
CTGTGTCGTA
AAGGACAATT
GCAAGTCTT
TTATCGTCAA
AAAAAGAAAA
AAGGNNTTAA
CTGAATCAAA
CGAACAGCAC
ACACAGGGGA
CTAAATGAC
TGAAGGAAAA
AAAACCGATC
'ITCATTAGCT
AGCACCAGAA
TCCACAAGGG
TGACTTACCA
GCACATTAAC
CCGITTTGTCA
AGTCCCTCGGA
GCTTGATITI
TCAAATG.CGT
ACTCACCGAC
TGTCAAAG T
AAATCAAGCG
AGCATI'AGCT
AAGATGGCCA
CAGAAAI'CC
CGTTACATGT
AGACCA.ACG
GTCACTTATC
ACGGTCCAAC
GCAGAAGATA
ACCAAAGATT
GTGGTAAAGG
CAATCGTTAA
CCACTTGAAA
TATCAGTTAA
GATAAGGCGG
AAGTCTATGA
AGGCACAGCG
AAACACAGAC
CGCAGCAACG
A.AAAAGCTAG
CGCATATTCA
A.AACGGTACT
CGACTCAGAT
ATACAACGTT
TGTGACGGAT
TCAATTAGCA
AATAAATGGA
AGATAAAGC C
CAACCTAGTA
?TTATATCCA
CGATAAGCAA
TCCAGCGAAT
GACACTTTTA
AAAACAGTI'A CTACAGGTTA CACACTGACG TCAATTA-AAG ACTTACAAAA CTTGCAAAT
TTAGAAAAGA
AAACATACCT
GATAGTGAAA~
GGGGAATACC
AAAAAATTrCA
GTACITCTTC
TTTTACGGTG
CCGCTGAACC
TGACTAAAAC
ATCCGAAAAT
TCAATGAAAC
CGTCTAATCA
GAAGAA.ATCT
GGAAAAAATT
TGACACTGCG
AGCCACAGTT
CACCTTGCCA
AGCAAACGGG
AGCCGGTGAA
CTCCACCAAA
C ITATGCAAC EF072-4 (SEQ ID NO:276) QLPEQ QQNTGEEGTL LQNYRGLNDV TYQVYDVTDP DCVVSFSLAS KDSQQRDKAY NEENAYDLPP LEKTVLDKQQ ES IEVVAGK TVTTGYTLTT NNEGQLVTDK HTLTKRATVR RWQKEKALAI( KFTSNQAGEF
RTTNSTVTC
FYQLRSEGKT
LFVEAEFAPEV
GFNQGEH INY
QKHGFTLDFS
TGGKSFVKVD
SVKGXKRWPV
VQEAQRQrAE
VKEKASNLVV
QLTTQIPANI
IKDLQNFANQ
SENAKITLPE
LLGRNLCTKR
TGATNRKP IA
ILPVQDPQGQ
LGYQEFRLSD
TM'IVSYQMRL
AVF IVKNQAG
LSS=ESNRNSF
EDKTQTINGE
SLTHIHLYPK
KADTTLTLLP
EKTAEPDTAI
EYLNETANGY
YGGKKFLCNE
164 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
EF073-1 (SEQ ID NO:277)
TAAATGAACA
ATTGCTGTCC
TTCGTTCTCT
ACGAAAGATA
TTCAGAAGTA
CGTTCAACTA
CTGTTTAAAG
ACACATTTAC
GTAAAAAACA
TCAAAATTAG
GTCGAAATGT
CAAGTAAACG
AATTAAATAC
CTAGAGAATA
ATCCAAAAAA
AAGAAGCCCA
CAATGAGCAC
AATTACGCC
AGTTAGTGAA
CAAATATCGT
AACAAACGTA
TTAAAAATGA
CGATCGCTAA
AAGACCAGCA
AAAATTACTG
TGGCTTCTflC
GAAAAATGCC
TTATCGTGAG
AGCCAAAAAA
GA'IIGACTTA
AGAACCTAAA
TGACTITAACA
TGAAAAATTA
TTATGAGGAA
AAGCAGCTM'
ATAA
ATTGGCTATA TTCTTTTAGG AGCCTTALATC
GCTTTI'GTGA
GCCGACAAAA
TTGGGGTTAT
CAAATCATAC
CGTAATGATA
AAGTTACACT
4GTAAACATT
GAAGAAAGCG
ATCGTTI'CCG
TCGCAAAAAG
TTCTGGTAGG
GCGATCAAAT
CTCCACAAGA
AATTGCAAGA
CTACGAAGGT
TAGCCAATCA
TAGAA.ATCGA
CACAAATCAT
ATGACTTAGA
CTGCAACTGA
CTTTTTAGTA
GCCTTACTTA
AATTGATT
AAACATGAAT
TTCTAAAGCT
CTITCTCTAT
ACAACACGAA
TGACCAATTG
CGATTTAGAT
GGAATCACCT
EF073-2 (SEQ ID NO:278) MNKLNTKLL I
KDKEAHYREL
FKEL.VKEPKK
KLVXNDYEE I GYILLGALI I GtLSPQEIDFF
LHLANHFLYT
VSDDLDDLDV
AVAREYGFFA
RSTMSTA.KKQ
HLPNIVDLTS
EMS IAKSSLS
FVILVGFLVF
IIQLQENIMR
KHLEIEQHEV
QKAATEESPQ
VLYRKKKNAA
STKLRAIDLR
YKDKQTYEKLE
VNEIDQQ
OKSDQMPYLT
NDTTKVSKAL
ESAQI IDQLS EF073-3 (SEQ ID NO:279) CT ATCGAAAAAA GAAAAATGCC GCCGACAAAA GCGATCAAkAT GCCTrA.-TTA ACGAAAGATA AAGAAGCCCA TTATCGTGAG TTGGGGTTAT CTCCACAAGA AA ITGATT TTCAGAACTA CAATGAGCAC AGCCAAAAAA CAAATCATAC AATI'GCAAGA AAACATGAAT CGF'rCAACTA AATTACGGGC GATTGACTTA CGTAATGATA CTACGAAGGT TTCTAAAGCT CTGTTTAAAG AGITAGTGAA AGAACCTAAA AAGTTACACT TAGCCAATCA CTTTCTCTAT ACACATTTAC CAAATATCGT TGACTTAACA AGTAAACAT1' TAGAAATCGA ACAACACGAA GTAAAAAACA AACAAACGTA TGAAAAATTA GAAGAAAGCG CACAAATCAT TACCAATTG TCAAAAITAG TTAAAAATGA TTATGAGGAA ATCGITrTCCG ATGACTTAGA CGATI'AGAT GTCCAAAP~T CCGATC(GCTAA ACAGCTTC TCGCAAkAAAC CTGCAACTGA GGAATCACCT CAAGTAAACG AAGACCAGCA AT EF073-4 (SEQ ID NO:280) YRKKKNAA DKSDQMPYLT KDKE-AIYREL GLSPQEIDFF FKELVKEPKX LHiLAN}IFLYT KLVKNDYEEI VSDOLDDLDV
RSTMSTAKKQ
HLPNIVDLTS
EMS IAKSSLS I IQLQENNR
KHLEIEQHEV
QKAATEE-SPQ
STKLRAIDLR
KNKQTYEKLE
VNEDQQ
NDTTKVSKAL
ESAQI IDQLS EF0'74-1 (SEQ ID NO:281)
TAAAGGAGT
ACCGTTCCAG
ATGGTAGATA
GGACGCGATG
GCCTACAATCG
TTCAAGCCT
CAAGGTCGCG
GGCGATGAAC
GT'TAGACA
CTCAAAAAAT
TTGGCT=TG
TCTCTGGCAA
GTITACAAACA
TCGTACCGGT
ATAACCAAAC
CAGTITrAT
AAGCCTTTC
TCGACTTAGA
GAAGCTAAA
GTTAACCCCT
AAAkAGTGTTG
AGGAACATCA
[TCCTTCATG
GGACACTCC
GGCACTTGGT
GAATGAAATC
GCAAITGGCG
AAAATAATTC
ACTCAAGC~r G [TGGATATT
GCATCACTAA
AAAAGCGATG
ETCCGACAAG
GGAGCAGATG
ATTCCTCAAG
ATTACTGCTG
CICTcTcCC
CTCCAGATGC
GGCATAACTG
ACCTTTCAGA
GCACGACACG
AAGTCGCACA
CACATATTCA
TGGAAACATA
GCGACAACCA
CCTTCTTTCA
TGCGGATACG
GGCCTCAAAA
AGTAAATCAA
GA'ITCCTACG
ATTAXATAGT
ATAGTCAAA
CGGCTTTGAT
AACCGTCATC
165 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
CCTGCTACGT
ACGATGGCAC
TCCCTAAATG
GTCTGGGTTG
TTCCTCTATN
TGAAAATAGT CAAAGAC CAT CAGAATTCCC TTATTTAAAA CTTATTATGA TTACATTGCC ATGAAGTTAT GACTTGGGTT
ATATT
TATCGAGCAC A.AGGAAAAAA TTTCATCATT CCTGGTGCCG CTTATOAAAC ATACATTACT CCACAATTAT ATAACCAAGG CCCCGACGGT GCTCAAAGCA ACGATGCTCT AAAATACGAG EF074-2 (SEQ ID NO:282) MXLKX IIPAFPLLST VAVGLWLTPT QASADAADTM VDISCKKVLV GYWHNWASKG RDGYKQGTSA SLNLSEVNQA YNVVPVSFK SOGTTRIPTF KPYNQTDTAF RQEVAQLNSQ GRAVLLALGG ADAHIQLVKG DEQAPANEII RQVTYPFDG LOIDLEQLAI TAGDNQTVIP ATLXIVKDHY RAQGKNFIIT MAPEFPYLKP GAAYETYITS LNGYYDYIAP QLYNQGGDGV WVDEVMTWVA QSNDALKYEF LYXI EF074-3 (SEQ ID NO:283) TGC TGCGGATACG
S*
S
S
ATGGTAGATA
GGACGCGATG
GCCTACAATG
TTCAAGCCTT
CAAGGTCGCG
GGCGATGAAC
GGTTTAGACA
CCTGCTACGT
ACGATGGCAC
TCCCTAAATG
GTCTGGGTTG
TTCCTCT
TCTCTGGCAA
GTTACAAACA
TCGTACCGGT
ATAACCAAAC
CAGTI'ATT
AAGCCT'ITGC
TCGACTTAGA
TGAAAATAGT
CAGAATTCCC
GTTATTATCA
ATGAAGTTAT
AAAAGTGTTG
AGGA.ACATCA
TTCCTTCATIG
GGACACTGCC
GGCACTTGGT
GAATGAAATC
GCAATTGGCG
CAAAGACCAT
TTA'ITTAAAA
TTACATTGCC
GACTTGGGTT
GTTGGATATT
GCATCACTAA
AAAAGCGATG
TCCGACAAG
GGAGCAGAM
ATTCGTCA.AG
ATTACTGCTG3
TATCCAGCAC
CCTC CCCG
CCACAATTAT
GCTCAAAGCA
GGCATAACTG GGCCTCAAAA ACCTTTCAGA AGTAAATCAA GCACGACACG GATTCCTACG AAGTCGCACA ATTAAATAGT CACATATTCA ATTAGTCAAA TGGAAACATA CGGCTTTGAT GCGACAACCA AACCGTCATC AAGGAAAAAA TTTCATCATT CTTATGAAAC ATACATTACT ATAACCAAGG CGGCGACGGT ACGATGCTCT AAAATACGAC
S.
S*SS
S
EF074-4 (SEQ ID NO:284) AADTM VDiISGKKVLV GYWHNWASKG RDGYKQGTSA SLNLSEVNQA YNVVPVSFMK SDGTTRIPTF KPYNQTDTAF RQEVAQLNSQ GRAVLLALGG ADAH IQLVKG DEQAFANEI I RQVETYGFDG LDI DLEQLAI TAG DNQTVI P ATLKIVKDHY RAQGKNFI IT MAPEFPYLKP GAAYETITS LNGYYDYIAP QLYNQGGDGV WVDEVMTWVA QSNDALKYEF LY EF075-1 (SEQ ID NO:285)
TAACCTATA-A
GGGAAGAAAA
GGAATGGGAG
GTAGAACGAT
TTATTAGGGA
ATTGTTGCAA
ACCTATGTTG
GGTGGCGCAT
TATG'TrCAA
GTGAACAATA
TICGATGGTG
TACGGCCGCC
CTI'AACAGCO
GATTT1AAGTT
GAAAAAAATC
TTTT'GCCAT
CAAAACTTTIA
CTAAAAAAAG
TTGATACAGG
CAGTI'AATCC
ATATrCCAGG CTTTrAGCAAT
TAATATGGC
ATCTGACTTT
A.ACAAGCACT
AAGAACGTCA
TAAGCAACTA
TTGATGACAT
ACAACCTGTG
TATCNTIGGA
TTGGGATGTT
TCAGGTCAAT
CGATGATGGG
TCGTGACAAG
TCAACGAAAA
GGACACAGTT
TGGTTI'AAAA
TTCTCAAGAC
CTCCTATTCA
AAGAAAAGTG
TCAAGAAATT
GAAAAAA.ATT
ATAAATTATNI
ATTATCTTGG
TCTMPATCAA
?I'AAACAATA
CGTGTCGAGC
CAAACAACCT
CAAGATAAAT
GAAAACTATT
GAATTAGTCA
GGATATGATI'
AGAATGcG'TT
ATGAAGGCA
TTAACAGcTlG GCCTTAGA T
GGAGGNAAAA
NTCTATTI'CT
TGGATAAAAC
AGGAGCCTTTr
AAGGTCGTTC
TAGTCAGTCT
TGAATCACC
TAAACATACC
ACGCGGTTGG
TTACGATI'GG
ACGAAGACCC
TCGTCCAAAA
TTTCTGATAA
ATCGCAGTGC
TATGTCAAAA
TGCAGTTGTT
CTATGAAACA
'rTCTG ITTTA
GGATACAACA
TGCTCGCGAT
CTATGCTT
TATTAATCAT
CGGAATCGAA
TAAANTCA
TAATGGTGAC
AGTCIAAGT
TATGAAGACA
CTTTGGTAAA
166 TABLE 1. Nucleotide and Armino Acid Seqeuences of E. faecalis Genes.
GTGAAACAAG ACCAACTTCA AGGTACTGGT TTTATGCAAG ATGGTGTTI'C CTATCAACGT GTGGATGAAC AAGAAI'AAC TCCTGTCCAA CAAGAGTTGA AAAATCAATT GAATACAAAA
TAA
EF075-2 (SEQ ID NO:286) MSKG KKIFAI IXG I ILXLFLAWVG MGAKLYWOVS KSMDKTYETV ERSKKSQVNL NNKEPFSVLL LGIDTCDDCR VEQGRSDTI'I VATVNPRDKQ YVDIPGQGKQ DKLNHAYAFG GASLAMDTVE NYLNIPINHY VSINMAGU(E NNNLTFSQDG YDFTIGKISL DGEQALSYSR MRYEDPNGDY GRQERQRKVI NSVSNYQEIL -iAVSDNMKTD LSFDDbM(XIA LDYRSAFGKV KQDQLQGTGF DEQELTRVQQ ELKNQLNTI(
TTLVSLARDT
LVNAVGG 1EV
EGIVQKVLSL
MQDGVSYQRV
EF075-3 (SEQ ID NO:287) ACTTTA TTGGGATGTT TCTAAATCAA TGGATAAAAC CTATGAAACA a a.
a a a a
GTAGAACGAT
TTA'ITAGGGA
ATTGTI'GCAA
ACCTATGTTG
GGTGGCGCAT
TATGTTTCAA
GTGAACAATA
TTGGATGGTG
TACGGCCGCC
CTTAACAGCG
GATTTAAGT
GTGAAACAAG
GTGGATGAAC
CTAAAAAAAG
TTGATACAGG
CAGTTAATCC
ATATTCCAGO
CTrTAGCAAT
TTAATATGGC
ATCTGACTr
AACAAGCACT
AAGAACGTCA
TAAGCAACTA
TTGATCACAT
ACCAACTTCA
AAGAATTAAC
TCAGGTCAAT
CGATGATGG
TCGTGACAAG
TCAAGGAAAA
GGACACAGT
TGGTTTAAAA
TTCTCAAGAC
CTCCTATTCA
AAGAAAAGTG
TCAAGAAAT
GAAAAAAAT
AGGTACTGGT
TCGTGTCCAA
'ITAAACAATA
CGTGTCGAGC
CAAACAACCT
CAAGATAAAT
GAAAACTATt
GAATTAGTCA
GGATATGA~TT
AGAATGCCI'r
ATTGAAGGCA
TTAACAGCTG
GCCTTAGATt
TTATGCAAG
AGGAGCCTTT
AAGGTCGTTC
TAGTCAGTCT
TGA.ATCACGC
TAAACATACC
ACGCGG'ITGG
TTACGA'FIGG
ACGAAGACCC
TCGTCCAAAA
rTTCTGATAA
ATCGCAGTGC
ATGGTGITC
TTC-TGT=rA
GGATACAACA
TGCTCGCGAT
CTATGCTTT
TATTAATCAT
CGGAATCGAA
TAAAATTCA
TAATGGTGAC
AGTCTTAAGT
TATGAAGACA
CTIGGTAAA
CTATCAACGT
CAAGAG'FTGA AAAATCAATT GAATACAAAA £F07S-4 (SEQ ID 110:288) a a a a.
KLYWDVS KSMDKTYE'IV ERSKKSQVNL NNKEPFSVLL LGIDTGDDGR YVDIPGQCKQ DKLNHAYAFG GASLAMD'IVE NNNLTFSQDG YD)FTIGKISL DGEQALSYSR NSVSNYQEIL TAVSDNMIITD LSFDDMKKIA DEQ)ELTRVQQ ELKNQLNTK
VEQGRSDTTI
NYLNIPINHY
MRYEDPNGDY
LDYRSAFGKV
VATVNPRDKQ
VS INMAGLKE
GRQERQRKVI
KQDQLQGTGF
TTLVSLARDT
LVNAVGG 1EV
EGIVQKVLSL
MQDGVSYQRV
EF076-1 (SEQ ID NO:289)
TAGAAAATAA
AGCATTGCTG
AAGGTATCTC
GGAA.ACCAAA
TTAAATGTTG
G'ITAAAGACA
AAGTTAAAAA
TAA
CAGAGGAGCT
CAG=CAAG
ATGTTTCCAA
AATTATTATC
TGGATCGTGT
ATACAGATTC
AGTGGCCTAG
GAAGGAAATG
TGTCTCTGT
TCGTTATAAA
GATTGTCGAT
GAAAGATGGC
TTTAAAAGAA
GCCATC7 rr'
AAAGCATCAA
GCAGTCATCG
GTTAAAA.AGT
GATTTATCCG
GGTTCAAAAT
CGCTTTITCA
'rITrATAAA CAAkAAATTGG
CTTCTGAAAA
TTGTAGACGA
ATGATGAATT
TAGCTGAATA
CATTTATTGA
ATAATTC=T
TATCGGTTA
AATTATTAAG
TAAATT I-AT
AGATTCTGTT
TGGCGAAAAA
AGATGCAATG
TGTITCAACA
TABLE 1. Nucleotide and Amino Acid Seqeuences of E.Jfaecalis Genes.
EF076-2 (SEQ ID NO:290) MK ASTKICIGLS IAAVASVSVA VIASEKIIKK VSHVSNRYKV KKFVDDKFDC NQKLLSTVDD LSDDELDSVL NVVDRVKDGG SKLAEYGEKV KDNTDSLKER FFTFIEDAK LKX<WPRPSFF YXNNSFVST EF076-3 (SEQ ID NO:291) CATCG CTTCTGAAAA AATTATTAAC AAGGTATCTC ATGTTCCAA TCGTITATAAA G ITAAAAAGT GGAAACCAAA AATTATTATC GATTGTCGATGA GATATCCG ITAAATGTI'G TGGATCGTCT GAAAGATGGC GCTTCAAAAT GTTAAAGACA ATACAGATTC TTTAAAAGAA CGCTT'TrCA AAGTTAAAAA AGTGGCCTAG GCCATCTTTT TTTTATAAAA
TTGTAGACGA
ATGATGAATT
TAGCTGA.ATA
CA'FrTATTGA
ATAATTCTT
TAAATTCAT
AGATTCTGTT
TGGCGAAAAA
AGATGCAATG
EF076-4 (SEQ ID NO:292) VIASEXIIKI( VSHVSNRYKV KKFVDDKFDG NQKLLSIVDD LSODELDSVL NVVDRVKDCG LKKWPRPSFF YKNNS SKLAEYGEKV KDNTDSLKER FFTFIEDAMKD EF077-1 (SEQ ID NO:293)
TAAT-GTAAAG
ACAATTATAA
GCGTATGGCA
ATI'CAAACCT
GCAACCTTAG
GGTGATTCAT
GATAACTCGC
GAGGAMA.TCA
GGCTTGGTAA
CCAATTGAAA
TTCAAAATGG
GTGAAAGAAT
CCTTTTACAC
ACACCTTTG
AITTGC=TAG
ACGATGGTCG
ACGCAAGGAC
GAATTAGTCC
ATTGTTGCTT
OTTTCTGGTG
AATTTI'GTGA
TCACGTAATG
AAAGAGACTA
GATCAAGAAT
GAATGTfl'AC
GTCATCATGG
ATTCCTATG
AAAGATGACT
GCCAAACAAT
ACCGGGATCA
ATCTTATCTG
TGAATGATGG
CAGGAGTTTT
TTATTTITAAT
TACGTGAAGG
CTGTGGGAGA
TAGAAGACTA
CACAAAAAGC
ATGTTCGCGA
AAACCGCGAC
AAAATCCTGG
TTGCTGAAAA
CTGCGCGCCG
TAGTTGCCTA
CGGAAGTCTI'
TGGCAGGGAT
AAAAATTAGC
AACTTTCTGT
GATI'GCCAGC
AT'GCCAGAAA
CTGGCGTGAA
CACAAGAGTC
GCACATATTT
TGGAAAAATT
CCGTI'GCAGA
CACAAGATAA
TAGGAGATGG
GT'CTCATGG
TAAGTAAAGT
CTGTATTAAT
TTCCGGCGCT
CTTTGCGTGC
GAGAGAAAAA
GGCATTATTA
AACAGGTTCT
AAAATATGGT
ATACTGGGCC
TGCCGCTGGA
TCATCGCTTG
TGAATTAGTA
ATCAACAGTC
GGATGAATTA
AACTGTAGCA
TCCAGCTCAT
CCTAATTGCA
AGTTGTTGCT
GGGTCGTTCA
TTCTGCAAAA
TGATCAAGTC
AAGCGTGG;A
GCAAGATGTC
GGCA'T7TG
TCAAGAAACT
AGGCCGAATT
ACACCAATI'A
AACCATTGCT
ATTAACTATT
TGTAAATGAT
AGCTACTGCG
CAGCCAACCG
CGGAATTTT
AATCGCGGCT
TCGTCGAATT
GAGATGAAGC
TTTGAATTTA
GTAATGGCGT
GTCGATATTT
AGT'rTGAT GA
AAAGCTAACC
AATGGCGAAA
GTAAAACCAG
GATGAATCTT
ATGTCGGGTT
GACAGTCAAT
TTTGTACGTT
GGTGTI'CTT
TCGCCGTGTC
AGTCGTCATG
ACGATTGCGT
CAACCAATCA
CAAGAATCAA
CCATTAAAAA
GATGGTGCTG
GAAAAAA'TG
ACTITTACAG
CATCTTCAAC
GCAGAAGTAG
CTAAAAGAAT
GCACCTI'CGC
GCTAGTGAAA
GTCA.ATTG
ATCTGCGT
ATGCTACAAG
GGCCAGTAA
ATGTAACAAA.
'ITTTACATCA
TAATGATGTI'
TAGCGAT1'AC ~TTTrAATI'AT
AAGAGCTGAA
AT7TAGAAGA
GGGAACTAGT
CA'FIAACAGG
CCGTGAATGG
ATCAAACAAT
TAGCAGATCG
GGI=GTTC
CTTTAAITCT
GGGTCGTTAT
TTCATAAAAC
ATGCTGGAAT
CTCATATITT
ATATTACAGA
AGATACGGGT
ATAAAACGAC
ACACTGTACG
GAATITI'AAT
GAATTACCGA
TGCCTAAAGA
TTGCTGCTGC
CTGCTGACGT
CCCAAGATAC
TACTAAGTT
AAGTCGTGGA
ATTrGGGGATT
CCCGAATTG
CTGGGAAATC
CGCTATCGTT
GTITGACTGGT
CTCATTA'ITG
TGTTTCTGTT
TCCAGTI'GAT
AGAATCAAAA
TGACGGCTCT
TGTGAACTTA
CTATCCGGTA
AAAAAGTCCG
ATCTGCCCCA
TAAATCGGGA
AGGCACGATT
AACTGCTGCT
AGCTAGATCA
TCTAGCGGAA
AGGTAAAA-AG
TATTCATATT
CCCAGAAGCA
GCTGACGGGG
AGTACATGCG
AAATCATCCA
AGACGTAGGT
TGTTATTTTA
CATGAAAATT
AATIGCTAGT
CACTGTGTCA
168 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E.f[aecalis Genes.
EF077-2 (SEQ ID NO:294)
MKHVTKLGIT
QTLREGKYGV
NS PQKAHRLN
IEKNPGDELM
FTLVAYL lAG
MVEKLASAKT
VAYARKQDVP
RNGTYLGRIT
CLPQDKLTIL
ODLSKVSQAJ
LSALRARRIG
IITGVLALLF
DILAITAIVA
GENLEDVSVE
SGSVNGDGSL
VAWFVSKSPT
IAFDKTGTIT
LKNITOLAEV
FTDTVRPEA-K
KELPKENH-PV
EIAQDTKA
EFILHQPNWA
TLAVGEYWAS
EINVGDELVV
KI4VAEKTVAD
RFAEIVAS
QGQLSVDQVQ
SGAGVKAFVD
YGIILITGSV
LMILIMLTGG
KPCELVPVDG
SQYQTIVWN
PCPLILSAPI
PINAGITAAE
GAEIRVGKKN
MALMMFXEMI
DSLEDYAAGK
LVKTCTSTVD
KESAARPAHF
ALVAGMGRSS
LVGLAASVEQ
FVTQESQETE
QESVAETIAA
AMGARGATAA
GI IPALIGAM
ANQELKSLLD
ES SLTGESK(P
VRLADRYAVP
RHGVVIKSGT
ESSHILARS I
KIDKTTIHIS
EVGITEVHGE
SETADVVILK
LQEVVDTVS I ETMEKLjHQLHLQRILMLTGD IMVGDGVNDA PSLAA.ADIC I KQSVLIGIFI CVLLMLIAST EF077-3 (SEQ ID NO:295) TCA GCCGAATTGG
S
S.
SW
S
We 55
S
S@
0* S
S
5 0
GCGTATGGCA
ATTCAAACCT
GCAACCTTAG
GGTGATTCAT
GATAACTCC
GAGGAAATCA
GGCTTGGTAA
CCAATTGAAA
TTGAAAATGG
GTGAAAGAAT
CC'=TTACAC
ACACGTTTTG
ArI'GCTTTAG
ACCATGGTCG
ACGCAAGGAC
GAATTAGTGG
ATTCTTGCTI'
G=TCTGGTG
AATITTrGTGA
TCACGTAATG
AAAGAGACTA
GATCAAGAAT
GAATGTIAC
GTCATCAM1G A ITGCTATGG
AAAGATGACT
GCCAAACAAT
ACCGGGATCA
ATCTTATCTG
TI'ATTTTAAT AACAGGTTCT GTAATGGCGT TACGTGAAGG AAAATATGGT GTCGATATTT CTGTGGGAGA ATACTGGGCC AGTTTGATCA TAGAAGACTA TGCCGCTGGA AAAGCTAACC CACAAAAAGC TCATCGCTTG AATGGCGAA.A ATGTTGGCGA TGAATI'AGTA GTAAA.ACCAG AA.ACCGGGAC ATCAACAGTC GATGAATCTT AAAATCCTGG GGATGAATTA ATCGTCGGGTT TTGCTGAAAA AACTGTAGCA GACAGTCAAT CTGCGCCGCG TCCAGCTCAT TTTGTACdTT TAGTTGCCTA CCTAATTSCA GGTGTTGCIT CGGAAGTCTT AGTTGTTGCT TCGCCGTGTC TGGCAGGGAT GGGTCGTI'CA AGTCGTCATG AAAAATITACC TICTGCAAAA ACGA ETGCGT AACTITCTGT TGATCAAGTC CAACCAATCA GATTGGCAGC AAGCGTGGAA CAAGAATCAA ATGCCAGAAA GCAAGATGTC CCATrAAAAA CTGGCGTGAA GGCA7=TTG GATGGTGCTG CACAAGAGTC TCAAGAAACT GAAAAAATTG CCACATATTT AGGCCGAATI ACTTTTACAG TGGAAAAATT ACACCAATTA CATCTTCAAC CCG ITGCAGA AACGATTGCT GCAGAAGTAG CACAAGATAA ATTAACTATr CTAAAAGMAT TAGGAGATG3G TGTAAATGAT GCACCTrCC GTGCTCATGG AGCTACTGCG GCTAGTGAAA TAAGTAAAGT CAGCCAAGCG GTCGAAATTG CTGTATTAAT CGGAATTTI ATCTCCTT 'ITCCGGCGCT AATCGGGGCT ATGCTACAAG CT'FrGCGTGC TCGTCGAAIT GCCC TAATGATGTT CTGGGAAATG TAGCGATTAC CGCTATCGTT TTTTAATTAT GTTGACTGGT AAGAGCTGAA GTCATTATTG ATI'TAGAAGA TGTTTCTGTT GGGAACTAGT TCCAGTTGAT CATTAACAGG AGAATCAAAA CCGTGAATrCG TGACGGCTCT ATCAAACAAT TGTGAACITA TAGCAGATCG CTATGCGGTA GTrTG TrTC AAAAAGTCCG CTTTATTCT ATCTGCCCCA GGGTCGTTAT TAAATCGGGA TTGATAAA.AC AGGCACAT ATGCTGGAAT AACTGCTGCT GTCATAITI= AGCTAGATCA ATAM~CAGA TCTAGCGGAA AGATACGGGT AGGTAAA.AAG ATAAAACGAC TA'ITCATT ACACTGTACG CCCAGAAGCA GAA=I'AAT GCTGACGCGrG GAATTACCGA AGTACATGGG TGCCTAAAGA AAATCATCCA 'ITGCTGCTGC AGACGTAGT CTG-CTGACGT TGTTATTTTA CCCAAGATAC CATGAAAATT TACTAATrTI' AATTGC TAC-T AAGTCGTGGA CACTGTGTCA see* .00.
*0e EF077-4 (SEQ ID NO:296) QPNWA YGIILITCGSV MALMMFWEMI QTLREGKYGV DILAITAIVA TLAVGEYWAS LMILIMLTGG DSLEDYAAGK ANQELKSLLD NSPQKAHRLN GENLEDVSVE EINVGDELVV KPGELVPVDG LVKTGTSTVD ESSLTGESKP I EKNPGDELM SGSVNGDGSL KMVAEKTVAD SQYQTIVNLV KESAARPAMF VRLADRYAVP 169 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genrs.
FTLVAYLIAG
MVEKLASAKT
VAYARKQDVP
RNGTYLGRIT
CLPQDKLTIL
DDLSKVSQAV
LSALRARRIG
VAWFVSKS PT
IAFDKTGTIT
LKNITDLAEV
FTD'IVRPEA(
KELPKENHPV
EIAQDTflIA
RFAEVLVVAS
QGQLSVDQVQ
SGAGVYAFVD
ETMEKLHQLH
IMVGDGVNDA
KQSVLIGIFI
PCPLILSAPI
P INAG ITAAE GAE IRVGKXN
LQRILMLTGD
PSLAAADVG I CVLLKL lAST
ALVAGMGRSS
LVGLAASVEQ
FVTQESQETE
QESVAETIAA
AMGAH'GATA.A
GIl PALIGAM
RHGVVIKSGT
ESSH ILARS I K IDKTT I HIS
EVGITEVHGE
SETADVV ILK LQEVVDTVS I EF079-1 (SEQ ID NO:297) TAATTTCTAG CATCACCGAA
CCCAGGCTCT
ATCATTGATG
TTTGTTIAGCG
AAAGCAAGCC
AACCAAGAAT
AAA.ACAACGA
ATI'CCAAAAA
AAAGGAAGCT
GTCATTTCAG
AAAAAAGGCG
CAAATAAAAA
CTCGTCACTT
GGACATCGTA
CAACAAAATT
ITCATTATCT
CATGCTTTAT
GTTTTATGAT
ATGCATTAAA
AAGAAAACAC
TAGCGAAAAA
AAAAACCAGA
TAA.ATGTCCG
CCTTGTTAGA
GCCATCGTGG
ATGAATTTTA
CCGTTGAACC
TATTAACTTG
TCCCATATCA
TACTATTATG
GGTACAAGCG
GAAATTTTTA
TTTAAGGAG
TCTF1'TACTG
TAACTATCTG
CAAAGAAATG
AGGCAGCAAT
CAAATCCTAT
TTTACCAA ET
AGGAACCTCC
TCTCCCTCAA
TATCGAAGTC
AACTGATACA
CACACCGTAT
ACCAGAAAAA
GACATTACTI
ACGGAAAAAG
GAAAAACAAA
GAAGCAATGA
ATTATTGGAA
GATCAACAAA
GCTGAACTTC
CCTGGATTAG
TTTGAAAGTC
T1'GATAAAA
TATCCTACAG
GCCAAATTAT
AATGGGA.AGA
AAAGATTTAC
ATGATAAACA
GCAGCAG;CGG
TTAATTGCCT
ACGACCAGAA
GAGCCT.GGGC CAATCACTGT AGTCAAAAAA GAAACGTCGT TAGGTGCATT TGCGTATCCT TI'ATCGCTCA ITATCAAGCA AAGAAAAAAT GGAAAAGAAA ATCCTTrTC TGAAACGCA ATACGATTGG TGTTTTAACC CGAkATGCATT GCTATTGGAA GTGGTACGAA TACACATGCG TTACAGATTi GCCAGAATTA CGCTTCCTTA TCAAGTAGAT ACAI'GAGTC TGGCCAAGAT GTCATCGGTTI ATTAGTTCGA GGATGAAAA AGTGGCACAA GTGCGTTAkAT TATTAGCGGC AACCAAAGTA G 0O 0 0S
S@
S 0
S.
0S S S S0 56 0 000 S
S.
0 0
S
S
*0SSeS 0 5*50 0 EF079-2 (SEQ ID NO:298) MKSKXRRI IDGFMI LLI IGIGAFAYPF VSDALNUYLD QQIIAHYQAK ASQENTKEMA ITKKPDKSYF ESHTIGVLTI PKINVRLPIF ISGHRGLPQA KLFTDLPELK KGDEMYIEVN VTIJLTCTPYM INSHRLLVRG HRIPYQPEKA IIWYKRRKKT TRKPK
ELQEIKEKKN
DKTNALLLEK
GKTLAYQVDQ
AAGMKKVAQQ
QELAKKGSNP
GSSLLEGTSY
IKTVEPTDTK
QNLLLWTLLL
GLDPFSETQK
PTGGTNTHAV
DLHIESGQDL
IACALIISGF
EF079-3 (SEQ ID NO:299) @55.
0 0@ 0 00
S.
TCCT
TI'TGTTAGCG
AAAGCAAGCC
AACCAAGAAT
AAAACAACGA
ATTCCAAAAA
AAAGGAAGCT
GTCATTTCAG
AAAAAAGGCG
CAAATAAAAA
CTCGTCACTT
GGACATCGTA
CAACAAAArr TTCATrrATCT
ATGCATI'AAA
AAGAAAACAC
TAGCGAAAAA
AAAAACCAGA
TAAATGTCCG
CCTTGTTAGA
CCCATCGTGG
ATGAATTTTA
CCGTTGAACC
TATTA.ACTTG
TCCCATATCA
TACTA'FrATG GGTACAkAGCG
TAACTATCTG
CAAAGAAATG
AGGCAGCAAT
CAAATCCTAT
TTACCAATT
AGGAACCTCC
TCTCCCTCAA
TATCGAAGTC
AACTGATACA
CACACCGTAT
ACCAGAAAAA
GACATI'ACTT
ACGGAAAAAG
GATCAACAAA
GCTGAACTI'C
CCTGGATTAG
TTTGAAAGTC
TTTGATAAAA
TATCCTACAG
GCCAAATTAT
AATGGGAAGA
AAAGATTrTAC
ATGATAAACA
GCAGCAGCGG
TTAATTGCCT
ACGACCAGAA
TI'ATCGCTCA
AAGAAAAAAT
ATCCTTI'TC
ATACGATTG
CGAATGCATT
GTGGTACGAA
TTACAGATTT
CGCILMCTTA
ACATTGAGTC
GTCATCGTT
GGATGAAAAA
GTGCGTTAAT
AACCAA
TTATCAAGCA
GGAAAAGAAA
TGAAACGCAA
TGTTI'TAACC
GCTATTGGAA
TACACATGCG
GCCAGAATTA
TCAAGTAGAT
T'GGCCAAGAT
ATTAGTTCGA
AGTGGCACAA
TA'ITAGCGGC
EF079-4 (SEQ ID NO:300) 1 170 TABLE 1. Nucleofide and Amino Acid Seqeuences of E. faecalis Genes.
VSDALNNYLD
TTKKPDKSYF
I SGH-RGLPQA
VTLLTCTPYM
IIWYXRRKT
QQIIAHYQAK ASQENTKEMA
ELQEK.\EKKN
ESHTIGVLTI PKINVRLPIF
DKTNALLIJEK
KLFTDLPELK KGDEFYIEVN
GKTLAYQVDQ
INSHRLLVRG HRIPYQPEKA
AAGM(KVAQQ
TRKP
QELAKXGSNP
GSSLLEGTSY
IKTVEPTDTK
QNLLLWTLLL
GLDPFSETQ<
PTGGTNTHAV
DLH IESGQDL IACALI ISGF EF080-1 (SEQ ID NO:301)
TAGTTACACT
TTTATTAGGA
CTI'ACC E1G
CAATTACTGA
AAAACAAAAG
GA.ATATGCAG
CTCATTGATA
ATGACAGATC,
AATTATTTCG
GTTCCTIGGGG
ATTACCCCGC
TTAGGCATTA
GAAAAAACAA
ACCTCGTA
ACCCACTCAA
CGTTTAGGGC
GGTTATTTCC
GCCTTIGCCCT
CACCTGATCA
CCTCTGTCTT
ATCATTATCT
TCGACTTACG
CACGAA'ITGA
CGGCTGCTCA
GCCACTATCG
TGGAAATGGT
ATATTTCTAA
CTTTAAACTT
TTCCTAAAAA
CTGGCGGCGG
TAGCAACGTT-AGGCATT
TTC
TATGAAAAAA
ACCCG ET1TCG
AATCAACCAA
TATTGTAACC
TTAAATAAA
GAAAATCTAC
TGATACCTTA
AACTTGTT
TGTGGACAGC
AATTGCT'I=
ATATCAATTA
AACCTCCCGC
CAATGGCGGC
CACATTCGGT
GCGG.CTGAAA
CTAAAGCAAG
ACAAATAATA
GT~rGGCAACG
ATCTCTACTT
GATAAAATAT
CAGGAAACTC
GAAACAGGTA
OCTGCTGCGC
AAATTTCAA
ACAGATCAGT
AGTGGCGGAA
GGCGGCGGTC
GCTGGACTCT
CTA=TTTT
ATTCAATTGA
AdATACAACC
ATACCTATGG
ACCAAAATGC
CT'GGAAACAT
GGGATAATAT
AAGCATTTGT
AAATCACTCG
TGATACTCAG
GTTATCAATA
TAACCAACTC
TGCCCGGTGG
GAAGTTTTTA
TAGCACTCTT
CCTAATACTT
TGATGGCGCA
TTTAGAAGAA
CGATGAACAA
GATTCT1TTTT
GATTGATTAT
GAGTCAAGGA
TAATAAAGGG
TATAA-AGTC
T1'TGGTCTTC
TCCCTTTAGG
?rI'CATCACT
TGTAGCACC
G
EF080-2 (SEQ ID NO:302) MKKR LLPIFFLILL TFGLALPVSA AENSIDDGAQ LLTPDQINQL KQEIQPLEEI( TXASVFIVTT
NNNTYGDEQE
IDMDLRKIYI STSGNMIDYM TDARIDDTLD
KIWDNMSQGN
PGGHYRVDSE TGKITRYKVI TPLEMVIAFA
AALILSLVFL
KTTLNLTSRT DQLTNSFI'IT RRIPKNNGGS GGMGGGGST
YADHYLLNKV
YFAAAQTFVQ
GINI SKYQLK HiSTGGGTFGG
GKDQNAILFL
ETQAFVNKGV
FSSYQYPFRE
GRSF
EFO80-3 (SEQ ID NO:3 03) GGCTGAAA ATTCAATI'GA
TGATGGCGCA
CAA'ITACTGA CACCTGATCA AATCAACCAA AAAACAAAAG CCTCTGTCTT1 TATTGTAACC GAATATGCAG ATCATTATCT TTTAAATAAA CTCATTGATA TGGACT1'ACG- GAAAATCTAC ATGACAGATG CACGAA'TTGA TGATACCTTA AATTATITCG CGGCTGCTCA AACCT1'IGTT GTCCTGGGG GGCACTATCG TGTGGACAGC ATI'ACCCCGC TGGAAATGGT AATGCTIT TrTAGGCATTA ATATTTCTAA
ATATCAATTA
GAAAAAACAA cTTTAMC AACCTCCCGC ACGCGTCGTA rPTCCTAAAAA CAATGGCGG-C ACCCACTCAA CTCGCGGCGG CACAT1'CGGT
CTAAAGCAAG
ACAAATAATA
GTTGGCA.AGG
ATCTCTACT
GATAAAATAT
CAGGAAACTC
GAAACAGGTA
GCTGCTGCGC
AAATTCAA
ACAGATCAGT
AGTGCGGAA
GGCGGCGGTC
AGATACAACC
ATACCTATGG
ACCAAAATGc
CTGGAAACAT
GGGATAATAT
AAGCATTTGT
AA.ATCACTCG
TGATACTCAG
GTTATCAATA
TAACCAACTC
TGGGCGGT'GG
GAAGT
T1'TAGAAGAA
CGATGAACAA
GA LTCTT'ITTT
GATTCGATTAT
GAGTCAAGGA
TAATAAAGG
TI'ATAAAGTC
TTTGGTCTTC
TCCCTTI'AGG
TTT.LCATCACT
TGGTAGCACC
EF080-4 (SEQ ID NO:304)
AENSIDOGAQ
LLTPDQINQL KQEIQPLEEX TKASVFIVTT NNNTYGDEQE YADHYLLNKV GKD)QNAILFL TA13LE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
IDMDLRKIYI
PGGHYRVDSE
KTTLNLTSR-T
STSGNMIDYM
TGKITRYKVI
DQLTNSFITT
TDARIDDTLD
TPLEMVIAFA
RRI PKNNGGS
KIWDNMSQGN
AALILSLVFL
GGMGGGGSTT
YFAAAQ77FVQ
GINISKYQLK
HSTGGGTFGG
ETQAFVNKGV
FSSYQYPFRE
GGRS
EF081-1 (SEQ ID NO:305)
TGAATGGAAC
TCTGCCTTG
GCCAAACAAG
AAACAGGAAT
TATGATACTA
GGAGTCAAAG
TATAAATTAT
ATCGCCAAAA
ATGTCAGGAA
GATCGTAATG
GGCAAACTCG
GGCGTTTCI
TTTGTACCAA
AAAGATACAC
GAAGCAATCG
ITCTAATCGC
AAT1'AGCCGA TT13ATAAGTT
AATCTGTTGT
CTAGTGATGT
CGATGAGCAC
AAGGCGATAC
ATGATAAAAT
GTAGTGGGCT
GTTCTGGCGC
TTGATGAAAT
TCACAGTCGC
AGTCACGTI'A
TAATAAAAAA
TGCTGCCGGT
AGCGAAGAAA
ACCGTCCGTT
TGAAAAATAC
TCAAGTCAAA
rCCTTAGGC
CTACCAAATC
TTCGATTCAA
AGCAATTAAC
AGAAAAAACA
CAATCAAAAG
TTCTGAACCA
TTATCCGCTG
TCTTCAAAA.A
GGCGGGTAT
ACAGCTACTA
GTTCAAGAAG
C-AAGCAATTT
AAGGCGAAAG
GAAATGAAAG
GCTTGGA.AGC
GTAGATAATG
AAAGTGTTrG
GCCAATATCA
TTAAGCCAAG
GTGACAGAAT
GGGGAAGCAN
AACCACTTAT
ATGCTLTATAG
CATTTTTAAA
CTAGC'ITAAA
ATCAGGGAT
ACAATCAATA
ATTTGTCTTA
CATCTtTAAT
CCAAACGTGG
ACCAAGTGGG
AAGCTTTTAG
GATGGG'TCCA
TACCAACAGG
TGCGCAATTA
TCTTGGTGTT
TCAATGGCAA
CGTATTGTCA
GAAAAATI-GC
TCAACCAGAA
CACATTTACC
TCAATCAAGT
FTTCCAGAT
AGAAATTGTC
CGTAGTGCCT
TGATAAATTC
AGCAGACTCC
GGCTGCGACA
EF081-2 (SEQ ID NO:306)
MERSNRNKKS
QEFDKLPSVV
KLSMSTPLGE
RNCSCLAINK
VP ITVASEPV
SKKPLILGVS
QEASLKKNGY
MXDLSYQSS I
VFDEVGVVPG
TELPTGAATK
AILVL IAAAGG
DTKSVVEKYQ
AKDTYQ IA
KLGSGAEKTA
DTESRYYPLG
GYYAYSQWQA
AIYSGIQAEG
WKPSL IFPDM
NIKAFSDKFG
EAXRN
KQELAF.AKKT
VKASDVQVKK
SGNDKIS IQV
VSVDEINQKL
ATTFLNVLSK
AKDNQYTFTY
DNAKRGEIVD
SQGWVQADSF
0 .0 0000 0 00.
EFO81-3 (SEQ ID NO:307) T GGCGGGTATI' ATGCTTATAG TCAATGCAA
GCCAAACAAG
AAACAGGAAT
TATGATACTA
GGAGTCAAAC
TATAAATI'AT
ATCGCCAAAA
ATGTCAGGAA
GATCGTAATG
GGCAAACTCG
GGCGITTCTG
TTGTACCAA
AAAGATACAG
AATTAGCCGA
TI'GATAAGTT
AATCTGTTGT
CTAGTGATGT
CGATGAGCAC
AAGGCGATAC
ATGATAAAAT
GTAGTGGGCT
GTTCTGGCGC
G-ATGAAAT
TCACAGTCGC
AGTCACGTTA
AGCGAAGAAA
ACCGTCCGTT
TGAAAAATAC
TCAAGTCAAA
GCCTTTAGGC
CTACCAAATC
TT-CGATTCA-A
AGcAATI'AAC
AGAAAAAACA
CAATCAAAAG
TTCTGAACCA
TTATCCGCTG
ACAGCTACTA
GTTCAAGAAG
CAAGCAATT
AAGGCGAAAG
GAA.ATGAAAG
GCTI'CGAACC
GTAGATAATG
AA.AGTGTMT
GCCAATATCA
TTAACCCAAG
GTGACAGAAT
GGGG
cAT'TTIAAA CGTATTGTCA CTAGCTTAAA GAAAAATGC ATTCAGGGAT TCAAGCAGAA ACAATCAATA CACAVITACC ATTT.-TCTTA TCAATCAAGT CATCITI'AAT TTTTCCAGAT CCAAACGTGG AGAAATTGTC ACGAAGTGGG
CGTAGTGCCT
AAGCT TTAG TCATAAATI'C GATCGGTCCA AGCAGACTCC TACCAACAGG
GGCTGCGACA
EFO81-4 (SEQ ID NO:308) G GYYAYSQWQA KQELAEAXXT ATTFLNVLSK QEFDKLPSVV QE-ASLKKNGY DTKSWVEXYQ AIYSGIQAEG vKASDVQVKK AKDNQYTFTY KLSMSTPLGE MKDLSYQSSI AKXGDTYQIA WKPSLIFPDM SGNDKISIQV DNAKRGEIVD RNGSGLAINK VFDEVGVVPG KLCSGAEKTA N~IKAFSDKFG VSVDEINQKL SQGWVQADSF VPITVASEPV TELPTGAATK DTESRYYPLG 172 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
EF082-1 (SEQ ID NO:309)
TAAAAAATGA
CTT'AAATA
ATLTACGTTITG
CCTG.TGACAC
GACTATG~
TCGGCACAAC
GTTCAAGTAA
GCGCAATTTG
TAGCAG'CAA
ACCTTGGATC
ATGGGCACTT
ACTGTTCCTG
ATT'TCGATG
AAAAGATCGT
GTTCAAAAGT
CTCTTGATAA
CTAATCCTGC
CAAATATCCA
TGGATCAAGT
CTCACAAACG
CTACAAGTGA
CACCCAATI'C
CAACTGCGC
GGACATTAGC
CGACAACGAA
ATACACCACT
GCGCATTTCA
TGAAGCAGCT
TACTGTCACG
TGATCCTCAT
TT1TTGGATCA
GCAAAATAGT
TGGTCTAAAT
TTCAACACCC
A.ACACAGTT-A
CGCCAC7I'CT
TFTTGTAGC
AAAAGTTGCA
'ITAA
AGCATTTTGT
CAAGTCGCI1T
CCACCTGTCA
CAACCTGGTA
AAACAAATTC
ACTCGCCATT
CTTGGCTGGA
GCTGTTTTGG
CmCTTTGG CC Tr.TrGCGA
GGANCGACCG
GCTAAACAAT
TCGTTGCTAC
CTATTCAATC
ACCCGACGAA
CAGCCGGACC
AAGCCGGAAC
TAA ETIAGCGT
AATTATCAGT
ATAATGCATC
CGCCA'ITAAC
CTGCCGCTCT
CTqCTCAAGG AtkAAACAAC
GCCTCTTATG
CAACGCTGAT
CCCTTCTCAG
CCTTAGTAT'
AGCGATCTAT
GCCAAACTAT
TAAACAGAGT
C ETGACCT
GGTCCCAGTA
'TCAACAGGA
CATTCAATTA
GCTTACTTGG
EF082-2 (SEQ ID NO:310) M4KKIVRISS ILFVATPULL LNSSKVEAAQ VASIQSNADI TFALDNTVTP PVNPTNPSQP VTPNPADPHQ PGTAGPLSID YVSNIHFGSK QIQAGTAIYS AQLDQVQNST GDLISVPNYV QVTDKRGL 'NL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT LDPTGAATSP VATAALSTGM GTWTL.AFGSG XTAAQGIQLT VPATTKKVAA KQYKTTLTWI
LDDTPL
EF082-3 (SEQ ID NO:311) AGCT CAAGTCGCTT CTATTCAATC CAACGCTGAT 0 0 0.
0 00* 0* 0.
000 t
ATTACGTTTG
CCMTGACAC
GACTATGTTT
TCGGCACAAC
GTTCAAGTAA
GCGCAATTTG
TTAGCAGCAA
ACCTTGGATC
ATGGGCACTT
ACTGTITCCTG
CTCTI'GATAA
CTAATCCTGC
CAAATATCCA
TGGATCAAGT
CTGAZ:XA.ACG
CTACAAGTGA
CACCCAATTC
CAACTGGTGC
GGACAITAGC
CGACAACGAA
TACTCTCACG
TGATCCTCAT
=FTGGATCA
GCAAAATAGT
TGGTCTAAAT
TTCAACACCC
AACACAGTTA
CGCCACTTCT
TTIrTGGTAGC
AAAAGTITGCA
CCACCTGTCA
CAACCTGGTA
AAACAAATTC
ACTCGCGATT
CTTGGCTGGA
GCTGTTTTGG
CTTTCTTG
CCTGTGGCGA
GGANCGACCG
GCTAAACAAT
ACCCGACGAA
CAGCCGGACC
AAGCCGGAAC
TAATTAGCGT
AATTATCAGT
ATAATGCATC
CGCCATTAAC
CTGCCGCTCT
CTGCTCAAGG
ATAAA.ACAAC
CCCTI'CTCAG
CCTTAGTATT
AGCGATCTAT
GCCAAACTAT
TAAACAGAGT
CTTGACCTTT
GGTCCCAGTA
TTCAACAGGA
CATTCAA'ITA
GCTTACTTGG
ATT =GATG ATACACCACT E'.OB02-4 (SEQ ID NO:312) AQ VASIQSNADI TFALDNTVTP PVNPTNPSQP VTPNPADPHQ PGTAGPLSID X'VSNIHFGS( QIQAGTAIYS AQLDQVQNST GDLISVPNY-V QVTDKRGLNL GWKLSVKQSA QFATSDSTPA VLDNASLTFL AATPNSTQLL SLAPLTVPVT LDPTGAATS P VATAALSTCN GTWTL.AFGSG XTAAQGIQLT VPA'ITKKVAA KQYKTTLTWI
LDDTP
EF083-1 (SEQ ID NO:313) TAA'TrTAAAA
GCTCTGTTTT
GCGATTTATC
CAGA=TGAG
GCAAGTGCTA
GACAAGGAGA
TTATCCCTAC
AAGGTGAACA
GCTACAATC
rrGCCCAAGG
AATAAAAATG
AGCTATG=T
AGGTCGCTTT
TAGCGGTATT
TAAACGTGCG
AAAAAGAAAA
CCCGCAAAAG
CGCTATGCAC
TATGAACAAT
CATACCTATA
T'ITTAGCAGG
GAGACCAAGG
ATGATAAATTr
ACACATATAA
TTG-GTATGA
AGCGCTTGTC
TGTGGATTGG
CGCTATTrCC AACGCAAGTrG
CACTTGGGGA
173 TABLE 1. Nucleotide and Amino Acid Seqeuerices of E. faecalis Genes.
AACATGGACA
AATTCCATCG
GGAGGATATG
ATGCGCAGAA
CTAAATCAT'-
GCGTATCCTA
GGTATTGGTA
GATTTAACAG
CCAGCAACAG
GATACCGTCA
TGGGTAAAAG
GGTATC'ITGT
CCTGATAAGC
GCTTATCAAT
AATCTTATTT
ACGGTTAAAT
GCTTTAGCTG
TTGCGAAAAC
TTGCATTAGA
TAAGTTCAGA
TCAAACAGGC
TAAACTATCA
TCGATGGTGT
TTTGGCAATT
GAATTACGGA
ATGCAGGCGA
AAGTGAAATT
GAAACAGCTA
CATGGATTAG
AACCAGAAGC
ATGGAACAGA
ATCCTGGTCA
ACGGCGATAA
CATTAA-ACGG
AACAATGGAT
TACTTTTTGC
TTTTGAACAT GGAGCGTTrGG TGCCGAAAAA GCAGCAAATA TGCCTATACT CCAATGTATT ACAAATCATC
AAAGAGTTTC
GTCACCATAT
CCATMTATG
CACATCCGCT TATAM3CAG TAGTGGTTAT
ACAGATACCA
AGAAAT~rAA
AAA.ATACCTA
TAATGTAGAT GCTTGGGCAA
CAAAGTGCAA-GA.AGTAACTC
CAA.AGGTGAT AiTTCAATTAT G-ACTCATGTG
GTACAATACG
CTATCAAACG
TTGGCGGCAT
AGTTTTG;AAA CTCAATGGAT TTTATC-TAGT
ATTGCAGCA-A
ATTAGCAAAT
CCTAACTTIGA
:ACGTATTICA
:TAGTGTTCC
CAGAGACAAT
PICAGCTATAA
CTAACTC TI'
CTATTTCCC
GTGG'TTTAGA
ATAAACCAGA
ATTCTGATiGT
CTGGGGAAGC
GAAGCAGAGT
TGCCAGACGC
GAGAA.ACATT
TAAXATGGATT
CGGCA.ACAAG
AACTTGGCAC
TTTATCCAGG
ACGCCTAA-A
AGATIGGATAT
TTTGTACGGT
GCCATTTACA
ATGGATTGCT
ALAGCATGGAT
TGGTAkACGTA
A.ACGGATACG
TAAAGT1'GGC
TATTCCGCAA
ATTGCTTGAA
AACAGTCGTC
ATCAAGTAT
GGCTAATCCA
TAATGTCTAC
TACTTATCAA
TCAAAICATTG
AATTATTAA
EF083-2 (SEQ ID NO:314) MK KKILAGALVA LFFMPTAMFA AKGDQGVDWA IYQGE-QGRFG
YAHDKFAIAQ
IGGYNASGIY EQYTYKTQVA SAIAQGKRAH TYIWYDTJGN MOIAXITFMfY
FLPRIQTPKN
SIVALDFEHG ALASVPDGYG GYVSSDAEKA ANTETILYGM RRIKQAGYTP
MYYSYKPFTL
NHVNYQQIIK EFPNSLWIAA YPIDGVSPYP LYAYFPSMDG IGIWQFTSAY
IAGGLDGNVD
LTGITDSGYT DTNKPETDTP ATDAGEEIEK IPNSOVKVGD TVKVKFNVDA
WATCEAIPQW
VKGNSYKVQE VTGSRVLLEG ILSWISKGDI ELLPDATVVP DKQPEATHVV
QYGETLSSIA
YQYGTDYQTL AALNGLANPN LIYPGQVLKV NGSATSNVYT VKYGDNLSSI
AAKLGTTYQA
LAALNGLANP NLIYPrCQTLN
Y
EF083-3 (SEQ ID NO:]1S) AAAAG GAGACCAAGG
TGTGGAITGG
GCGATTTATC
CAGATTGGAG
GCAAGTGCTA
AACATGGACA
AATTCCATCG
GGAGGATATG
-ATCCGCAGAA
CTAAATCATG
GCGTATCCTA
GGTATTCTA
GAT~IAACAG
CCAGCAACAG
GATACCGTCA
TGCTAAAAG
GGTATCTTGTl
CCTGATAAGC
GcTTATCAAI
AATC'ITA=T
ACGC LTAAl'
GC-TTTAGCTC
AAT
AAGGTGAACA
GCTACAATGC
TCCCAACG
TTGCGAAAAC
TI'GCATTAGA
TAAGTTCAGA
TCAAACAGGC
TAAACTATCA
TCGATGGTGT
TTCG~cAATT
GAATTACGGA
ATCCACGCGA
AAGTGAAATI
GAAACAGCTP
cATcGATTAC
AACCAGAAGC
ATGGAAcAGZ
ATCCTGGTCJ
7ACGGCGATAJ
;CATTAAACGC
AGGTCGCTTT
TAGCGGTATT
TAAACGTGCG
AACAATGGAT
TT1'TCAACAT
TGCCGAAAAA
TGGcTATACT
ACAAATCATC
GTCACCATAT
CACATCCGCT
TAGTCCTTAT
AGAAATTGAA
TAATGTAGAT
CAAkAGTGCAA
CAAAGGTGAT
GACTCATGTG
CTATCAAACC
kAGTTFI'GAAP 'ITATCTAGI1 3 AITAGCAAA]
GGCTATGCAC
TATGAAcAAT
CATACCTATA
TACTTTMC
CGAGCGTTGG
GCAGCAA.ATA
CCAATGTATT
AAAGAGT'ITC
CCATrGTATG
TATAI'GCAG
ACAGATACCA
.AAAATACCTA
GCT1'GGGCAA
GAAGTAACTG
A=TAATTAT
GTACAATACC
TTGGCGGCAI
GTCAATGG.Al
ATTGCAGCA;
7CCTAACTTGI
ATGATAAATT
ACACATATAA
T1'GGTATr.A
CACGTATTCA
CTAGTr-TTCC cAGAGACAAT
ACAGCTATAA
CTAACTCTTT
CTTATTTCCC
GTGGTTTAGA
ATAAACCAGA
ATTCTGATGT
cTGGGAAC
GAAGCAGAGT
TGCCAGACGC
GAGAAACATT
TAAATGGATT
CGGCAAcAAG
SAACTI'CGCAC
k. TrATCCAGG
:GCTATTGCC
%.ACGCAAGTGC
CACTTCGGGA
AACGCCTAAA
AGATGGATAT
T'FrGTACGC-T GCcATTTACA
ATGGATTGCT
AAGCATGGAT
TGGTAACGTA
AACGGATACG
TAAAGTTGGC
TATTCCGCAA
A71'GcTI'GAA
AACAGTCGTC
ATCAAGTAT
GGCTAATCCA
TAATGTCTAC
TACTTATCAA
TCAAACATTG
174 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fczecalis Genes.
EF083-4 (SEQ ID NO:316) KGDQGVDWA IYQGEQGRFG
YAHDKFAIAQ
IGGYNASGIY EQYTYKTQVA
SAIAQGXRAH
S IVALDFEHG ALASVPDGYG GYVS SDAEKA NHVNYQQIIK EFPNSLWIA-A
YPIDGVSPYP
LTGITDSGYT DTNKPETDTP
ATDAGEEIEK
VKGNSYI{VQE VTGSRVLLEG
ILSWISKGDI
YQYGTDYQTL AALNGLANPN
LIYPGQVLKV
LAALNGLANP
NLIYPGQTLN
TYIW'YDTWGN
AN'TETILYGM
LYAYFPSMOG
I PNSDVKVGD
ELLPDATVVP
NGSATSNVYT
MDIAKTTMDY
RRIYQAGYTP
IGIWQFTSAY
TVKVKFNVDA
DKQPEATHVV
-\KYGDNLSSI
FLPRIQTPKN
MYYSYXPFTL
IAGGLDGNVD
WATGEAIPQW
QYGETLSSIA
AAKLGTTYQA
EF084-1 (SEQ ID NO:317) TAGTCAAACG TTTATTTTrT CTATTGAATT
GGAGGAGAAC
AGCGGTATGA
TGGCCACGAT
AAAAATCGCC
GTGTI'GGCAA
AATAATCGGC
CCGCAGAAGA
AGCGCATTTT
CACAATTTGA
CACTTAA.AAG
AAGAAGATCA
GTTGATGCGC
TAT'ITAACCG
GTCACAA.AAT
TACTACGAAA
ATTTATGCAC
CGTGTGTTGT
ACTGGTGATG -GCTATAAACT ACCGAATCAC
CTATTATTTC
TCT=TACAAG
ATGTTAATTT
CAAATGGATA
TGCTGTTTAC
*AGTTTTATTA
ACCAAGAATT
'*CTCTTTCCGA
CAAAATCTT
*CTTTCCTTTG
TGGAACTACT
GAAAAATC'IT
TTGTCACAGC
GGACGCAA.AT
TAGTCAATGC
ACTCGAGGCT
ACAATGTTAC
:GCAGAAATI'G
CAGAATACAC
CCTTAAATCC
TATGAAGAAA
TGCGGCCGCC
AAAATLTATTA
AATCATTTCA
TAACTATGAT
CGGACGCATG
CAT1'AACGAA
AGACGATCAA
ATTAACAACT
AGcCAAAAAA
TGAAGAACCT
AACTGTTTTG
ACATTTTGC
AACTCGCAAC
TGAAGAAGTG
GAAAGACTTI
CGCTGGGATI
TTTATIT1"1 AGCTGCATT9
CTATTTACCI
AGAAAAAATC C
TTTGATGTAA
GAAGCAGGCG
C
ATGACTGGTG
C
TTTATTCCTG
ATCATGAACT
TTCCCTGTTA
TTAGGAGTCA
ATAAT1'GGCG
GGCGGCCGCA
ATGGGGCATA
'TTTATCCTGG
AACCAAAAAG
ATTTCAGGAC
GGTAATCAAC
CCTGCCAAAC
CAGTTCACTG
TCCCTCAA6AG
GCTGGTGAAC
CTCACTGGAC
kATTGAAGAAG :CGTAATTAT G .rT1TGTCGGI TCAAGTATT
A
CGGccGCTG I 3GAAIGGAAA 1rTT1TGAATC
CAGATAAATC
CTGTITTAC
TTGAAACCGA
CTTATCCTTC
CCATCAGCCC
ATAAAACGTT
GAAAACCT
CTGCCGCGCT
CTGTCACGGT
AACTAACAGA
TTACGAAAAC
AAGTGACCCC
TTr'TAGATAT
ATGTTGCTGG
TCTAA
;GTACACTAC
'GCTGGGACG
TI GATTGAA
AATGTAACC
=TrT'IATAC
:AATGGTATT
3AAGTCAATT
A.AAAACACAG
ACTGGAAAAA
CACAGGAGCA
GCTCTACCCT
GCAAGGTCTC
AGTTAATCAT
CCGCTGTT-CT
AGCCTTGGAT
AAAGCAACGN
ATTGCCTT
TAAAACAATG
TAATGGCTAT
CTCCCATGCC
EF084-2 (SEQ ID NO:31 8 a MXKF DVIIVGAGTS GMMATIAAAE
AGAQVLLIEK
NRRVGKKLLM TGGGRCNVTN NRPAEEIISF
IPGNGKFLYS
LKEEDHGRHF PVTDKSKSIV DALFNRINEL
GVTVFTKTQV
YAPCVVL ITG GRTYPSTGAT GDGYKLAKKM
GHTISPILYPT
LQDVNLTVLN QKCKPLVNHQ MDMLFTHFGI
SGPAALRCSS
FPTKSFEEVP AKQLTEKQRL SFVELLKDFQ
FTVTKTLPLE
SKLVNGLFFA GELLDINGYT GGYNVTAAFV
TGHVAGSHAA
AFSQFDNYDI
TKLLRKDDQI
ESPIISEEPF
FINQELTRNG
KSFv'rGGGIs EIAEYTYLP I
MNFFESNGIH
IGVETELEKI
ILDKTLQGLS
NQPVTVALDV
LKEVTPKTME
EEV
EF084-3 (SEQ ID NO:319) C GAAGcAGGCG CTCAAGTATT
ATTGAIGAA
AAAAATCGCC GTGTTGGGAA AAAATTATTA
ATCACTGGTG,
AATAATC-GGC CCGCAGAAGA AATCA TTCA ?flTA~CCTG AGCGCATTTT CACAATITTGA TAACTATGAT
ATCATGAACT
CACTTAAAAG AAGAAGATCA CGGACGCATG
TTCCCTCTTA
G MATGCGC TATTTAACCG CATrAACGAA
TTAGGAGTCA
GCGGCCGCTG;
GGAATGGAAA
TITTGAATC
CAGATAAATC
CTGTITTTrAC
TAATGTAACC
ATTTT'rATAC
CAATGGTATT
GAAGTcAATT
AAAAACACAG
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GTCACAAAAT
XITTATGCAC
ACTGGTGATG
ACCGAATCAC
TCTTTACAAG
CAAATGGATA
AGT=TTAA
GTGTTTCCGA
CTCTTTG
GAAAAATCTT
GAGAGCAAAT
ACTGGAGGCT
GCAGAAATTG
TACTACGAAA
CGTGTGTTGT
GCTATAAACT
CTATTA=TC
ATGTTAATT'I
TGCTGI'TAC
ACCAAGAATI'
cAAAATCTTT
TGGAACTACT
TTGTCACAGG
TAGTCAATCG
ACAATGTTAC
CAGAATACAC
AGACGATCAA ATAAT1'GGCG
ATTAACAACT
AGCCAAAAAA
TGAAGAAC-CT
AAC~rTTTTG ACAITr-TGGC
AACTCGCAAC
TGAAGAAGTC
GAAAGACTI'
CGGTGGGATT
GGCGGCCGCA
ATGGGGCATA
TTrTATCCTIGG AkACCAAAAAG
ATTTCAGGAC
GGTAATCAAC
CCTGCCAAAC
CAGTTCACTG
TCCCTCAA.AG
TTGC~AACCGA
CTTATCCTTC
CCATCACCCC
ATAAAACGTI'
GAAAACCTTTI
CTGCCGCGCT
CTGTCACCGT
AACTAACAGA
TTACGAAAAC
AAGTGACCCC
TTTTAGATAT
ATGTTGCTGG
TC
ACTGGAAAAA
CACAGGAGCA
GCTCTACCCT
GCALAGGTCTC
AGTTAATCAT
CCGCTGTTCT
AGCCTTGGAT
AAAGCAACGN
ATTGCCTTTG
TAA.AACAATG
TAATGGCTAT
CTCCCATGCC
TTATrTTTT GCTGGTGAAC AGCTGCATT GTCACTGGAC CTATTTACCA ATTGAAGAAG EF084-4 (SEQ ID NO:320) E AGAQVLLIEK NRRVGKKLLM TGGGRCNVTN LKEEDHGRMF PVTDKSKS IV YAPCVVLTTG GRTYPSTGAT LQDVNLTVLN QKGKPLVNHQ FPTKSFEEVP AKQLTEKQR.
SKLVNGLFFA GELLDINGYT NRPAEEI 1SF DALFNR INEL
GDGYKLAKM
MDMLFTHFGI
SFVELLKDFQ
GGYi\=AAFV
IPGNGKFLYS
GVTVFTKTQV
GHTISPLYPT
SGPAALRCSS
FTVTKTIJPLE
TGHVAGSHAA
AFSQFDNY-DI
TKLLRKDDQI
ESPIISEEPF
F INQELTRNG KSFVTGrCGIS
EIAEYTYLPI
MNFFESNG IH
IGVETELEKI
ILDKTLQGLS
NQPVTVALDV
LKEVTPKTME
EEV
EF085-1 (SEQ ID NO:321)
S.
S S S S
S
S
S* S
S
55
TAACCCATGA
CAGTCCACAC
CCGCGGTGTA
AAGGTTT T
TTTGCACACC
TATCAGCCAT
GGGCTGTTTC
GTACTCTATG
GTGGTACATA
CTTTATCAAG
CITFIATCCG
AGTGCGAATA
CTTGGCGCAC
AGGCTGGAG
GCGGCAGCCT
ATGCTTGAAA
ATGATGGACA
G-GGAGAATCG
CCCGA'IC
GGACAAATTG
AATGCGGTTC
GAAAAATACC
AAGCTC'ITC;
CGGTTlrcGGC
GGAAAACACC
AATA=TAA
AATcATTTTG
TCATATCAA-A
GG'ITACCGAT
CCTATACGAT
AATTTGTGTC
TGGTGGAACT
ACTACAATAA
CCACACCGAA
GAGATGATAA
GGCTGACAAT
CGTATATCTT
AAATGGCAAA
TGGCTCACGA
ATGAAATCGC
CTCATGAGTT
*ATATCGGTGA
*GGCAGCGCAA
TACCCATAGC
AAACCTTGGC
ITCCTGTCCGA
AGAACACGCC
GTCTTTCCGI
STCCCAT'TCTA
CTTCCCATCG
TCAGATGGCC
TCCCGCATAT
AATCTGCGGG
TGGAGGAATC
CATTGTCCTC
'TATTTCAGA
GATTCAGAAT
CCAATCCTTI'
TGCCGATACA
TATI'TCGAT]T
TCGGGGA.ATT
TCCGGCAA
CCTGAAAGAA
CATGCATC
AAGGGA.ACAT
AAAAACGCCC
CTACAAAGAC
AACCATCC
CGAACCGCTG
AGAGGCAAAC
TCCGAAGCTC
CCAGGGAGGI
TrTGAACATC
TCGCATTGA
TACAAAAAAC
~TrTGTTCT( GGCGATATGA CTFTGACGGT GATGGCAGCA CGAACTGCCG GAAAATATCA
TCAAAACCAT
ATI'ATAATGG AAAGAAAAGG
GATTTI'CATT
TTACTGCTTG TCGGTGTAAC GGCAACACTG GCGATGGAAG CACAGCAAAC
AGTAAAATCC
AGCGATAGGC TM'ATATGCA
AGAGGTGGCA
GAGTTrTATA TTGAAGATAA
AGAGGGAAGC
TCAAATAGT'G TTAGGCCCGA CTTTC TrAT G ITGCTCAAA GCAAGGCAGC TGTCGGATTG GTTATGATTG- CGATAATGT
TCTATTCAGC
ATGACAACGC CGATCAAAGC
CTTAGCGGAC
GTACCCCGC CGCTGGAGCG AAAGGATGAG ATGTATATCA G-GCTGAAAGA
AACCATCGCA
GAGT=GAGG AAA'CACAGCG
ATATI'TCTIT
ATCGCGGCTG TAAGCGTTCT
GTTGGAGGGA
CATTCTAAGT ATCTGCGCGA
ATGCATCA
GAAATAC TOG AGCTTGTCAG
CCTGAACGAT
GACATAGGGC GCACGGTTGC
CGAGCTGCTA
AACCAGCGGT TCGTCACAGA TAT1'CCAGCC ATCCAAAkAGG CGCTATCCAA TGTCATA ITG 7GAGGTACGGA TATGGAGTGA
GCCTGGGGCT
;GGCGTTCACA TIGATIGATAC
TGCACTTTCA
r' CAGGCGCGAA G-CAGCAAAAA GTGGGCGAAG GCTGGATGCC ATGAGCCTCC
AATATGCGCT
3GCTGGATTTA CCGCCCACAT CAACACTATA 176 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E-faecalis Genes.
EF085-2 (SEQ ID NO:322)
MERKGIFIK
VFSYTI IVLL
LFHYNNQSFE
YQGLTIRGIV
GALAHDMHSM
LENIGDYKDH
DFQTLAEANN
KYRLSVLNNG
LLVGVTATLF
FYI EDKEGSV
MIAIMWVFSL
YIRLKETIAR
SKYLREC 1KM
QRFVTDIPAG
VHIIDDTALSK
AQQFVSYFRA
LYATPNADTS
LCAY IFARQM
LEDEIAREIE
MURQGKTISE
QIVTLSDPKLI
LFIPFYRIDQ
tAEAQQTvKsy QPLVELIQNS NSVRPDFLYV
VH-RDDNISIV
Trp IKALADS ANXMANLKEV LEEFTQRYFFA
AASHELXKPI
ILIELVSLNIDC R IVP IAEPLD QKALSNVILN
AVQNTPQGGE
ARSSKKWAKR
FGACHRTKNA
DRLDMQEVAG
AQSKAGVGLL
PPPLERKDEL
AAVSVLLEGM
IGRTVAE-LLP
VRIWSEPGAE
GCHEPP ICAG KHLRWRFVLA GFTAHINTIN
I
EF085-3 (SEQ ID NO:323) GCATTTCGTGTC TITATTTCAGA GCGATGGAAG CACAGCAAAC
AGTAAAAT
cc
TATCAGCCAT
GGGCTGTTTC
GTACTCTATG
GTGGTACATA
C'ITTATCAAG
CT=TATGCG
AGTGCGAATA
CTTGCCCAC*
AGGCTGGAGG
GCGGCAGCCT
ATGCTTGAAA
ATGATGGACA
GGGAGAATCG
CCCGATTTC
GGACAAATG
AATGCGGTTC
GAAAAATACC
PAr.CTCTTCA
CGGTTTGGGG
GGAAAACACC
AATAT
rGGTGGAACT C ACTACAATAA C
CCACACCGAA
GAGATGATAA
GGCTGACAAT
CGTATATC~r
AAATGGCAAA
TGGCTCACGA
ATGAAATCGC
CTCATGAGT
ATATCGGTGA
GGCAGGGCAA
TACCCATAGC
AAACCTTGGC
TCCTGTCCGA
AGAACACGCC
GTCTTTCCGT
TCCCATTCTA
CTTGCCATCG
TCAGATGCC
;A'rCAGAAT
:CAATCCTTT
E'GCCGATACA
L'ATTTCGATT
rCGGGGAATT
I'GCGCGGCAA
CCTGAAAGAA
CATGCATTCC
AAGGGAACAT
AAAAACCCCC
CTACAAAGAC
AACCATrI'CC
CGAACCGCTG
AGAGCCAAAC
TCCGAAGCTG
CCACGGAGGT
TTITGAACATG
TCGCATTGAT
TACAAAAAAC
'rTrTTCTG AGCGATAGGC ?I'GATATCCA GAGT'FTTATA
TTGAA.GATA-A
TCAAATAGTC;
TTAGGCCCGA
GTTGCTCAAA
GCAAGGCAGG
GTTATGAT1G CGAtAATGGT ATCACAACGC
CGATCAAAGC
GTACCGCCGC
CGCTGGAGCG
ATGTATATCA
GGCTGAAAGA
GAGTTGGAGG
AA.ACACAGCG
ATCGCGGCTG
TAAGCGTTCT
CATCTA-AGT
ATCTCCGA
GAAATACTGG,
AGCTTGT'CAG
CGACATAGGGC'
GCACGGTTGC
AACCAGCGGT
TCGTCACAGA
ATCCAAAAGG
CGCTATCCAA
GAGGTACGCGA
TATGGAGTGA
GGCGTTCACA
TTGATCATAC
CAGGCGCGAA GCAGCAAAA GCTGGATGCC
ATGAGCCTCC
GCTGGATTrA
CCGCCCACAT
GAGGTGGCA
krAGGGAAGC
:ITTCTTTAT
TGTGGGA'rTG
I'GTATTCAGC
CTTAGCGGAC
AAAGGAGAG
AACCATCGCA
ATATTTCTT
GTTGGAGGGA
ATGCATCAAA
CCTGAACGAT
CGAdCTGCTA
TATTCCAGCC
TGTCATAITG
GCCTGGGGCT
TGCACTTTCA
GTGGGCGAAG
AATATGCGCT
CAACACTATA
a a.
a.
a a EF085-4 (SEQ ID NO:324) a a, QFVSYFRA ME.AQQTViKSY QPLVELIQNS
DRLDMQEVAG
LFHYNNQSFE FYIEDKEGSV LYATPNADTS
NSVRPD)FLYV
YQGLTIRGIV MIAIMVVFSL LCAYIFA-RQM TrPIKALADS GALAHDMHSM YIRLKETIAR LEDEIAREHE
LEETQRYFFA
LENIGDYKDH SKYLRECIKM MDRQGKTISE
ILELVSLNJG
DFQTLAEANN QRFVTDIPAG QIVLSDPKLI
QKALSNTVILN
KYRLSVLNMG VHIDCTALSK LFIPFYRIDQ
ARSSKKWAKR
KHLRWRFVLA GFTAHINTIN
I
VHRDDNIS IV AQSKAGVGLL ANKMANKEV
PPPLERKDEL
AASHELKTP I AAVSVLLEGM RIVP IAEPLD
IGRTVAEL.LP
AVQNTPQGGE
VRIWSEPGAE
FGACHRTYKNA
GCHEPPICAG
EF086-1 (SEQ ID NO:325)
TAACTGGTGG
CATGATGACC
CTGTATGCAA
ACAGAT=-
GATTGGCAAA TIGGTCCGC GCAGCGCTAA CAGATACATT
GATTTIATA
TATGAATAC AGATGCAGAA AAA'ITAAATA AATTTACTGC
TCCGCTGATG
AAGATCCAAA CATACAATCG CCAATTTATC GTG-CAACAGG
AGCTA.ACTTA
CAATCACCGT TTTAGGTACT GGACTT'F11GT TAGAAGATAA
TCAACCCCTA
I i I I TABLE 1. Nucleotide and Amino Acid Seqeuences of E faecalis Genes.
GTACAAGTAC
TATCCTGATG
GAGTTGCTAA
AATGACCCTA
GTAAATGGAA
AATCCTTTTA
GCAAAATTTG
CAACAAAGTG
GACTTGAAAA
AATGTATATG
AGTATGTAT
TGGCATACAG
GGATACTGGG
TTGGCAAATG
AATGGACAGG
GTGCTAAAA
ACTGGTACGA
GAAGTGAAGC
ANTCCATTGA
ATAGAAGA.AC
TATACAAATA
TACGAATAIT
AAAGGCTATA
GTCATGATGA
ATGTCGGTTA
TTACAAAATA
GCGGACCCAG
GGCAGCTCGC
AAATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAG
CAAGAAAAAG
*CAAACTCTGG
CTACGTTCGG
AAAAATGGGG
AAACAAGTTA
ACAGGCAAA
TGTCrTTAG
PLAGAAGCTGT
GTTCCTTGAT
AAGGGTTTCG
1ACATT1AGTAA
AAATGCCATC
CTACAGAGTT
CACCAGAAAA
GGTCATACTA
ATGTAGTGAA
GTI'CGATGGA
CACAACGTGT
CAGACGGCAT
CAACGATCGA
GTGCTTATAC
TTCCCTCTAT
AATCTTGGTT
CAGATGCTTC
TTAACCAAGG
ATAACATI'GG
GCTCTGGTCG
CATTTGCTAA
TAACAGTCGT
CTGITCTAGA
ATACATGGAA
FI'CAGAAAA
ATGCATCCGT
AAATT"TCTGT
GTTCAATCAT
AGGAACAAAA
AAGCA PTGAA
ACCAAGCAGA
AAGTAGATAA
ACTACACCGC
CAGATCAAAC
CAGTGAAC
GGAATAATG
AGCCATCAAG
AGAAATCGAT
TTTrTCGTAA TCCGTCCGTT TTAAAAAGTG TCAACATGGT TAT'PTTCCGT ACGAATTCAG ACTATITAC TTTATI'TAAT GTTGTG-GATA GATGGTTTCT GCTAGAAGTA TGAATCGGGT AAAGAAACAA 'ITTAAGAAAT GACAT ITATA TCATTTCTTT AAA.AAACCAA TAGTGCCTCA CCTGCCCAAG TCGAGTCCTA CAGA-AAAATA CGGAACTATC-AAT2Y=GA GCTTTAT'ITA TACAATCAAG TCCATATCGA TTACCAGGAA AGGGAAACGC AGTCCCCAGT AGGAATGTTT TTAGATAAAA C'FrATTAGAT GGTCAAATCA GATTGAAACA ATCCTCGATA TTCAGACAAA GATAATTC'TT CTATGTTrTT CCTAATTCNA CTACGGAGAT ATTAACGAAT AkATTAGTAAA AATTATGGCA TGGGAAAACG AATGA-AGAAA AAATACAGCA AACTTACAAG TAATGACCAA GAAA=TCAG AATTGATAAC GGTGTTTATC T1'CTATTGAA TTTGATAAGG TGACCAAAAT ATTATCACTI' TGTTAAAACA ACTCCCAG AGAACACCAA GAAAAAGACT ACAAGCACAA ACTGTGGCAG AACAGAG [TA CGTTCGGCAG AACCAACTTG TITGAAAATCA AAGCAGTTGG AAAGTCTACA AACAGCAACG CAAGCAGAAG AITAACATTG AAAAATAGTG ACACI'AAAT ACTAGTACAG CCAAGCTIGT 'ITCAGAAAAG CGCGCTTGTG AIATTGGTC AAGTAAATCG AAGAAGTAA
TTCCTCTGG
ACA.ACGGCAG
AAGGTTCCGA
AAGGT'IACTT
T'T TCCAGAGC
TAGCTAAFT
CATCTATCCA
GAGA7=~A
CGACACCAAT
ACGAATATGC
ATACCGAAAA
ACITGCTCA
CGACAGTTGA
CATGCTAGC
GTAATGAAGG
TTAAbTTTGGG
ATCGGATGAT
GGATTAGTTT
TGAATACGCT
ACTITGTTAA
AGACTCGFTGA
TCGCAGCTCT
CCATTGAAGC
G-ACTGTATGC
GCTMhACTCT
GCATTCTI'GA
TAAkATACGC
TAACGAAAGA
ACACCGCA.AG
ATCAAACA.AC
TGAAGCAATI'
TCAAAGAAAA
GTG.AAGCATTr
TAGACCAAGC
GGGAAAATAA
GAGTTGATCA
CTACCCAATI'
'PTCTAGTTAT
TGATCGCT-TA
TTACGGGAAT
CTGGGAGATG
ACAATTGATG
CCCAGAAACG
AACCTTAATT
AACGTGGCTT
AGCGTTAXI"I
GCAATCTITA
GGTGGGGATC
TAAAAAAGC
GTTTGATGAA
CAC-AAGAGAA
TGG-CTCAAAT
AATGAACTTA
AAG-TGGCATT
TCATCtACAG
AAGCGCAGCG
TGATGTTCAA
TGATAA-AACC
AAATGGTACT
TTCTAAAAAC
AGGTAATI'AT
GTATCATCCA
TGCGAATCCT
AGTAGTCGCA
GGGGTTAAAT
AGCC ITAGAA
CAGCTGGAAA
AGCAACGCAA
GGTAAAAGTG
CGAGAAACAC
GAAGCAAGCG
AGAAGCAAAA
AAAGGAGCAA
AACTGGTACG
TTTACCGAGC
CGCCAGTGGG
EF086-2 JSEQ ID NO:326)
LVGI.ANWFRA
DIS ITVLCTG
LLKGFGRIQT
PFTTEFESGK
LKNVVNSASP
HTADGMLYLY
GQVAS IGMFL
VKLNQGSDKD
TNTFAKISKN
lOrINNDQE
DPEISVDQNI
YSEALKQAQT
ALTDTLILU4
LLLEDNQRLV
ILQGSDWEN
ETIANLTLIA
AQATPMQSLN
NQDFAQFDEG
DKSNEGMNLV
NSWISLSAAX
YGKTVENGTY
IAGLYAYDPM
ITLNSAGLNG
VADQTTATQA
DDLLNTDAEK
QVQE-AVPSVL
DPNISNELFNV
KFAPENLRND
VYGSMDRVLQ
YWATIDPYRL
AKKSWFLLDG
PLNNIGYVFP
EYLTVVGKTN
SVISEKI DNG
SSRSIIVKTT
EVDQAETE-LR
LNKPTAPLML
KSVSSGDGLY
VDKGYLQLMV
IYTS IQTWLQ KNNEYAVGI S
PGTIVDTREL
QI INLGSGIT NSMNTLIDVQ I
EEIAALSKNK
VYRLTLANPL
PEVTKEALEK
SAVKQLVKVP
YAKDPNIQWP
POGSLIQHGY
NGKMPSMVSG
QSGSYYHFFK
M4YSQRVGNYE
AN'GAYTGKRS
GTrDAS IETI
EERSGRYGDI
GY'NENTAN.
QNNASVSIEF
L IQEQKEIHQE
?KXVDKTNLL
IYRATGANLT
FPYNGSYGNE
RS ISRAPETN
KPRDFEALID
FGNTENKXGW
PQSWVGGSN
LDNRMI HPQE
NEYFVNDKTY
LQAIE-AGNYV
DKGILEVVAA
KDYTASSWKV
KIIKENEKHQ
178 TABLE 1. Nucleotide and Amino Acid Sequences of E. faecalis Genes.
EKDYTASSWK VYSE-ALKQAQ TVADQTTATQ AEVIDQAEAKL RSAVKRL'TLK
NSGENKKEQK
NGGNNGHLNT STGVDQTCTK QVKPSSQGGF RKASQFLPST GEKXSIALVI
IGLLDVIASGC
LLVFRKSI(SK
K
EF086-3 (SEQ ID NO:327) ACCACAAAA TTTAAGAAAT GACATTTATA CATCTATCCA
AACGTGGCITT
CAACAA.AGTG GGTCATACTA TCATI'CTTT AAA.AAACCAA
GAGATTTTG.
GACTTGAAAA ATCTAGTGAA TAGTGCGTCA CCTGCCCAAC
CGACACCAA
AATGTATATG GTTCGATGGA TCGAGTCCTA CAGAAAAATA
ACGAATATG
AGTATGTAT CACAACGTGT CGGAAACTAT- GAATTTGGGA
ATACGGAAA
TGGCATACAG CAGACGGCAT GCTTTATTTA tACAATCA.AG
ACTITTGCTC
GGATACTGGG CALACGATCGA TCCATATCGA TTACCAGGA
CGACAGTTG
TTGGCAPLATG GTGCTTATAC AGCGA.AACGC AGTCCCCAGT
CATGSGGTAG
AAT
A
C
A
A
A
AGCCGTTAATIT
GcAATCTTTA
GGTGGGGATC
TAAAAAAGGC
GTTTGATGAA
CACAAGAGAA
TGGCTCAAAT
EF086-4 (SEQ ID NO:328) PENLRND IYTSIQTWLQ QSGSYYHFFK
KPRDFEALID
LKNVVNSASP AQATPMQSLN VYGSMDRVLQ KNNEYAVGIS
MYSQRVGNYE-
HTADGMLYLY NQDFAQFDEG YWATIDPYRL PGTIVDTREL
ANGAYTGKRS
FGNTENK(KGW
PQSWVGGSNN
EF087-1 (SEQ ID NO:329) TAACTGGTGC GATTGGCAAA
TI
CATGATGACC TATTGAATAC
AC
CTGTATGCAA AAGATCCAA-A Cl ACAGATATIT CAATCACCGT
T
GTACAAGTAC A.AGAAGCTGT
T(
TATCCTGATG GTTCCTTGAT Ti GAGTTGCTAA AAGGGTTTCG
A
AATGACCCTA ACATTAGTAA
T
GTAAATGGAA AAATGCCATC
G
AATCCTTTTA CTACAGAGTT
T
GCAAAAT'ITG CACCAGAAAA CAACAALAGTG GGTCATACTA
TI
GACTTGAAA-A ATGTAGTGAA AATGTATATrG
GTTCGATGGA
AGTATGTATT CACAACGTGT
C
TGGCATACAG
CAGACGGCAT
GGATACTGGG
CAACGATCGA
TTGGCAAATG
GTGCTTATAC
AATG;GACAGG
TTGCCTCTAT
GTTIGCTAAAA
AATCTGGT'
ACTGG.TACGA
CAGATGCTTC
GAAGTGAAGC
TTAACCAAGG
ANTCCATTGA
ATAACMTTGG
ATAGAAGAAC
GCTCTGGTCG
TATACAAATA
CATTTGCTAA
TACGAATATT TAACAGTrGT AAAGGCTATA cTGTTCTAGA GTCATGATGA
ATACATGGAA
ATGTCGGTTA ?I2TCAGAAALA TTACAAAATA
ATGCATCCGT
ATGCAGAA A kTACAATGG C' rTAGGTAcT
G
CCGTCCG T
CAACATGGTT
CGAA'rrCAG A TTATT1AAT
G
ATCGTTTCT
G
GAATCGGGT
TTAAGAAAT C 'CATTTCT?1T
!AGTGCGTCAC
.CGAGTCCTA
:GGAAACTAT
;cTTTATI'TA rcCATATCGA
AGGGAAACC
AGGAATG'Tr cTTATTAGAT
GATTGAAACA
TTCAGACAAA
CTATGTTT
CTACGGAGAT
AATTAGTAAA
TC.GGAAAACG
AAATACAGCA
TAATGACCAA
AATTGATAAC
TTCTAT'rGAA kAITAAATA A.; CAATTTATC G~ GACT'T
TI
TAAA.AAGTG T AT'ITTCCGT A( CT WITTTAC A TTCGTGGATA
A
~GTAGAAGTAT
AAGAA.ACAAT
;ACATTTATA C ~LAAAACCAA C :CTGCCCAAG
C
:AGAAAAATA
P
GAA=rGGGA TACAATCAAG
I
TTACCAGGAAC
AGTCCCCAGT
TTAGATAAAA
GGTCA.AATCA
ATCCTCGATA
GATAALTTC~r ccTAATTCNA A'rrAACGAAT
A.AITATGGCA
AATGAAGAAA
AACTTACAAC
GAAATIGMCAG
GGTGT'P1ATC
TTTGATAAGG
~GGTTCCGC GCAGCGCTAA %GATACAT GP TTTAcTGC TC GCAACAGG AC kGAACATAA TC rrCCTCTGG- T( AACGGCAG T' p1rCrTTcCA C ACGT-TACTT A TTCCAGACC G AGCTAATTT
A
ATCTATCCAA
,AGATTTTGAA
GACACCAAT
G
~CGAATATGC G TACGGAAAA qj ~CTTIICTCA C MACAGTTGAc
CATGGGTAGG
GTAATGAAGG
TTAATTGG
PTCGGATGAT
GGATTAGTTT
TGAATACGCT
ACTTTGTTAA
AGACTGTI'GA
TCGCAGCTCT
CCATTGAAGC
GACTGTAT~c
GCTTAACTCT
GCATTCTI'GA
~TTTATTA
.CGCTGATG
,CTAACTTA
.AACGCCTA
;ATGGCT]TA
rACGGGAAT
TGGGAGATG
CAATTGATG
CcAGAAACG
ACCTTAA'T
ACGTGGCTT
,CCGTTAAT]T
:CAATCTTTA
,GTGGGGATC
'AAAA.AAGGC
TMATGAA
.ACAAGAGAA
rGGCTCAAAT AATGAACT1'A
AAGTGGCATT
TCATCCACAG
AACCGcAGCG
TGATGTTCAA
TGATAAAACC
AAATGGTACT
TTCTAAAAAC
AGGTAATTAT
G;TATGATCCA
TGCGAATCCT
AGTAGTCGCA
179 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GCGGACCcAG
GGCAGCTCC
AA-ATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAC
cAAGAAAAAG
CAAACTGTGG
CTACGTTCGG
AAAAATGGGG
A).AcAAGTTA
ACAGGAGAAA
TGTCTTTAG
AAATTTCTGT
GTTCAATCAT
AGGAACAAAA
AAGCATTGAA
ACcAAGCAGA kACTAGATAA
ACTACACCGC
CAGATCAAAc cAGTGAAGCG
GGAATAATGG
AGCCATCAAG
AGAA.ATCGAT
TTTTI'CGTAA
TGACC-AAAAT
TGTTAAAACA
AGAACACCAA
ACAAGCACA
AAcAGAGTTA AAccAAC'1r AAGcAGTTGG AAcAGCAACG
ATTAACATI'G
ACAc?1TAAAT
CCAAGGTGGT
CGCGcTTGTG
AAGTAAATCG
ATTATCACTT
ACTCcTGAAG
GAAAAAGACT
ACTGTGGCAG
CGTTCGGCAG
TTGAAAATCA
AAAGTCTACA
CAAGCAGAAG
AAA.AATAGTG
ACTAGTACAG
TTqAGAAAAG
ATTATTGGTC
AAGAAG'TAA
TAkATACTGC
TAACGAAAGA
ACACcGcAAG
ATCAAACAAC
TcAAGCAATT
TCAAAGAAAA
GTGAAGCATT
TAGACcAAGC
GGGAAAATAA
GAGTTGATCA
CTACGCCAATT
TTC:TAGTTAT
cGGGTTAAAT
AGCGTTAGAA
CAGCTGGAAA
ACAACGCA
GGTAA.AAGTrG
CGAGAAACAC
GAACCAAGCG
AGAAGCAAAA
APLAGGAGCAA
AACTGGTACG
'rrTACCGAGC
CGCCAGTGGG
EF0872 (SEQ ID NO:.330)
INGLANWFRA
DIS ITVLGTG
LLKGFGRIQT
PFTTEFESGK
LJKNVVNSASP
HTADGMLYLY
GQVAS IGMFL
VKILNQGSDKD
TNTFAK ISKN
MMNTWNNDQE
DPEISVDQNI
YSEALKQAQT
EKDYTASSWK
NGGNNGHLNT
ALTDTLILLH
DDLLNTDAEI(
LLLEONQRLV
ILQGSDWEMN
ETIANLTLIA
AQATPMQSLN
NQDFAQFDEG
DKSNEGb4NLV
NSWISLSAAX
YG1(IVENG IY
IAGLYAYDPM
ITLN~SAGLNG
VADQTTATQA
vysEAIJKQAQ
STGVDQTGT<
QvQEAVPSVL
DPNISNLFNV
KFAPENLRND
VYGSMDRVLQ
YWATIDPYRL
AKKSWFLLDG
PLNNIGYVFP
7EYL IVVGKTN'
SVISEKIDNG
SSRSI IVKITT
EVDQAETELR
TVAE)QTTATC
QVYPSSQGGE
LtqKFT-APLMli
KSVSSGDGLY
VDKGYLQLKV
IYTSIQTJJLQ
KNNEYAVGIS
PG ITVDTREL QI INLGSGIT NSMN2TLDVQ
I
EEIAALS(NK
VYRLTLANPL
*PEvTKF.ALEI< *SAVKQLVKVrP
AEVDQAEAKI
*RKASQFLPS3
YAKDPNIQWP
PDGSLIQHGY
NGKMPSMVSG
QSGSYYHFFK
MYSQRVGNY-E
ANGAvYTGKRS GTTD ASIETI
EERSGRYGOI
GYTVLMNTAN
QNNASVS IEF
LIQEQKEHQE
TYEVDKTNLL
RSAVKRLTLK
GEKXSIALVI
IyRATGANLT
FPYNGSYGNE
RS ISRAPETN
KPRDFEALID
FGNTENKKGW
PQSWVJGGSNN
LDNRMIHPQE
NEYFVNTJKTY
LQAIEAGNYV~
DKG-ILEVVAA
KDYTASSWKV
KI IKENEKHQ
NSGENKKEQK
IGLLVIASGC
a LLJVFRKSKSK
K
EFOB7-3 (SEQ ID NO:331) A ATCGGATGAT
TCATCCACAG
GAAGTGAAGC
ANTccAI'GA
ATAGAAGAAC
TATAcAAATA
TACGAATATI'
AAAGGCTATA
GTCATGATGA
ATGTCGGTTA
TrAAcCAAGG
ATAACATTGG
rGcTCTGGTCG
CAICTAA
TAACAGTGGT
CTGTTCTAGA
ATACATGGAA
TTCAGAAAA
TTCAGACAAA
CTATGTTTT
CTAcGrGAGAT
AMTTAGTAA.A
TGGGAAAACG
AAATACAGCA
TAATGACCAA
AAT'rGATAAC
GATAATTCTT
ccTAATTCNA
A'FTAACGAAT
AA ETATGGCA
AATGAAGAAA
AACTTACAAG
GAAATTGCAG
GGTGTTITATC
GGATTAGI'T AAGCGCAGCG TGAATACGCT TyGATGTTCAA ACTTGTTAA
TGATAAAACC
AGACT,3TTGA
AAATGGTACT
TCGCAGCTCT
TTCTAAAAAC
CCATTG-AAGC
AGGTAATTAT
GACTGTATGC
CTATGATCCA
GCTTAACTCT
TGCGAATCCT
TTACAAAATA
ATGCATCC
EF087-4 (SEQ ID NO:332)
NRMIHPQE
VKLNQGSDKD NSWISLSAAX PLNNIGYVFP NSMNTLDVQI EERSGRYGDI
NEYFVNDYTY
TNTFAKI SKN YGKTVENGTY EYLTNVGKTN EEIAALSKN3( GYTVLENTAN
LQAIEAGNYV
MMNTWNNDQE IAGLYAYDPM SVISEKIDNG VYRLTLANPL
QNNAS
EF088-1 (SEQ ID NO:333) TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TAACTGGTGGC
CATGATGACC
CTGTATGCAA
ACAGATA'IT
GTACAAGTAC
TATCCTGATG
GAGTTGCTAA
AATCACCCTA
GTAAATGGAA
AATCCTTIA
GCAAAATTTG
CAACAAAGTG
GACTTGAAAA
AATGTATATG
AGTATGTATT
TGGCATACAG
GGATACTGCG
TTGGCAAATG
AATGGACAGG
GTTCCTAAAA
ACTGGTACGA
GAAGTGAAGC
ANTCCATTGA
XTAGAAGAAC
TATACAAATA
TACGAATTT
AAAGGCTATA
GTCATGATGA
ATGTCGGTI'A
1'rACAAAATA
GCGGACCCAG
GGCAGCTCGC
AAATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAG
CAAGAAAA-AG
CAAACTGTGG
CTACGTTCGG
AAAAATGCGGG
AAACAAGTTA
ACAGGAGAAA
TGTCTTITAG
~A'TGGCAAA TI L'ATTGAATAC k.AGATCCAAA C :AATCACCGT k~AGA.AGCTGT I1 GTTCCT1'CAT ~I ~AGGTTTGG
ACATTAGTAA
AAATGCCATCC
CTACAGAGT
CACCAGAAAA
GGTCATACTA
PTGTAGTGAA
GTTCGATGGA
CACAACGTGT
CAGACGGCAT
CAACGATCGA
GTGCTTATAC
TTGCCTCTAT
AATCTIGGTT-
CAGATGCTTC
TTAACCAAGG
ATAACATTGG
GCTCTGGTCG
CATTI'GCTAA
TAACAGTGGT
CTGTTCTAGA
ATACATGGAA
TTTCAGAAAA
ATGCATCCGT
AAA'TrTCTGT
GTCAATCAT
AGGAACAAAA
AAGCATIGAA
ACCAAGCAGA
AAGTAGATAA
ACTACACCGC
CAGATCAAAC
CAGTGAAGCG
GGAATAATG
AGCCATCAAG
AGAAATCGAT
TTTTCGTAA
'TGGTTCCC
,GATGCAGAA
:ATACAATGG
~TTAGGTACT
~CCGTCCGTT
~CAACATGGT
CCAATTCAG
.FrATITTAAT ;ATGTrrCT
MGAATCGGCT
ETAAGAAAT
ECA=TC=T
[AGTGCGTCA
ICAGTCCTA
:GGAAACTAT
GCTTTATTTA
TCCATATCGA
AGGGAAACGC
AGGATIGT
CTTATTACAT
GATI'GAAACA
TTCAGACAAA
CTATGTTTT'
CTACGGACAT
AATTAGTAAA
TGGGAAAACG
AA.ATACAGCA
TAATGACCAA
AATTGATAAC
TTCTA'ITGAA
TGACCAAAAT
TGTTAAAACA
AGAACACCAA
ACAAGCACAIA.
AACAGAGTTP
AACCAACTTIC.
AAGCAGTTGC
AACAGCAACC
ATTAACATTC
ACACTTAAA]
CCAAGGTGG'
CGCGCI'GTC
AAGTAAATC(
GCAGCGCTAA
AAATTAAATA
CCAATTTATC
GGACTT=TT
TAAAAAGTG
TATTTTCCGT
ACTATTTTAC
GTTGTGGATA
GGTAGAAGTA
AAAGAAACAA
GCAXFITATA
AAAAAACCAA
CCTGCCCAAG
CAGAAAAATA
GAATTTGGGA
TACAATCAAG
TTACCAGGAA
AGTCCCCAGT
TTAGATAAAA
GTrAAATCA
ATCCTCGATA
GATAAITCIT
CCTAATTCNA
ATTAACGAAT
AAkTTATGGCA
AATGAAGAAA
AACTI'ACAAG
GAAATTGCAG
GGTGTTTATC
TTTGATAAGC
ATTATCACTI
ACTCCTGAAC
GAAAAAACI
ACTGTGGCAC
LCGTCGGCAC
TTGAAAATC)
AAAGTCTAC)
CAAGCAGAA(
AAAAATAGT(
ACTAGTACAC
T TCAGAAAA( ;AlrATTGGT(
;AAGAAGTAA
CAGATACATT
AATTTACTGC
GTGCAACAGG
TAGAAGATAA
'ITTCCTCTGG
ACAACGGCAG
AAGGTTCCGA
AAGG ITACTT
TTTCCAGAGC
TAGCTAAM
CATCTATCCA
GAGATTTTGA
CGACACCAAT
ACGAA'VATGC
ATACGGAAAA
ACTT-TGCTCA
CGACAGTTGA
CATGGGTAGG
GTAATGAAGG
TTAA'-TTGG
ATCGGATGAT
GGATTAGTTT
TGAATACGCT
ACITTrGTTAA *AGACTG ITGA
*TCGCAGCTCT
CCATTGAAC
GACTGTATGC
*GCrAACTCI
*GCATTCI'GP
TAAATAGTGC
TAACGAAAG;
SACACCCCAAC
ATCAAACAAC
TGAAGCAATI
k TCAA-AGAAAJ k. GTGAAGCATj
TAGACCAAGC
C GGAAAATAJ
.GAG=ATCJ
3CTAGCCAAT.
TTCTAGTTA.
GATTTTATTA
TCCGCTGATG
AGCTAACITA
TCAACGCCTA
TGATGGCTTA
TTIACGGGAAT
CTGGGAGATG
ACAATTGATG
GCCAGAALACG
AACCTTAATT
AACGTGGCT
AOCGTTAATT
GCAAT'CTTTA
GGTGGGGATC
TAAAAAAGC
GTFI'GATGAA
CACAAGAGAA
TGGCTCAAAT
AATGAACTTA
AAGTGGCATT
TCATCCACAG
AkAGCGCAGCG
TGATGTTCAA
TGATAAAACC
AAATGGTACT
TTCTAAAAAC
AGGTAATTAT
GTATGATCCA
TGCGAATCCT
AGTAGTCGCA
GGGGTTAAAT
AGCGTI'AGAA
CAGCTGGAAA
AGCAACGCAA
7GGTAAAAGTG-
CGAGAAACAC
r GAAGCAAGCG
,AGAAGCUAAA
k AAAGGAGCAA k AACTCCTACG r 1TTACCGAGC r CGCCAGTGCC- EF088-2 (SEQ ID NO:334)
LVGLANWFRA
DISITVLGTG
LLKGFGRIQT
PFTI'EFESGK
LKNVVNSASP
HTADGMLYLY
GQVAS IGMFL
VKLNQGSDKD
ALTDTLILLH
£LLEDNQRLV
ILQGSDWEMN
ETIANLTLIA
AQATPMQSLN
NQDFAQFDEG
DKSNEGMNLV
NSWISLSiAX
D)DLL.NTDAEK
QVQFEAVPSVL
DPNISNLFNV
KFAPENLRND
VYGSMDRVLQ
YWATIDPYRL
AKXSWFLLDG
PLNNIGY'VFP
LNKFTAPLML
KSVSSGDGLY
VDKGYLQLMV
IYTS IQTWLQ
KNNEYAVGIS
PGTTVDTRZEL
QI INLGSGIT
NSMNTLDVQI
YA1KDPNIQWP
PDGSLIQHGY
NGMPSMVSG
QSGSYYHFFK
MYSQRVGNYE
ANGAYTGKRS
GITDAS IETI
EERSGRYGDI
IyRATGANLT
FPYNGSYGNE
RS ISRAPETN KPRDFEALI D
FGNTENICKGW
pQSWVGrSNN
LDNRMIHPQE
NEYFVNDKTY
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TNTFAKI SKN YGKTVENGTY bMMII\TWNNDQE IAGLYAYDPM DPEISVDQNI ITLNSAGLNG YSEALKQAQT VADQTTATQA EKDYTASSWK VYSEALKQAQ NGGNNGHLNT STGVDQTG--'K LLVFRKSKSI( K EYLmVVGKTN
SVISEKICNG
SSRSIIVKTT
EVDQAETELR
TVADQTTATQ
QVKPSSQGGF
EEIAkALSKNK
VYRLTLANPL
PEVTKEAkLEK
SAVKQLVKVP
AEVDQAEAKL
RKASQFLPST
GYTWLETAN
QNNASVS I EF
LIQEQKEHQE
TKEVDKTNLL
RSAVKRLTLK
GEKKS IALVI LQAI EAGNY-V DKG ILEVVAA
KDYTASSWKV
KIIKENEKHQ
NSGENKKEQK
IGLLV IASGC EF088-3 (SEQ ID NO:335) A ACTCCTGAAG TAACGAAAGA AGCGTTAGAA
AAATTAATTC
GTCTACAGCG
GCAGAAGTAG
CCAACTAAAG
CAAGAAAAAG
CAAACTGTGG
CTACGTTCGG
AAAAATGGGG
AAACAAGTTA
AGGAACAAAA
AAGCATTGAA
ACCAAGCAGA
AAGTAGATAA
ACTACACCC
CAGATCAAAC
CACTCAAGCG
GGAATAATGG
AGCCATCAAG
AGAACACCAA
ACAAGCACAA
AACAGAGTTA
AACCAACTTG
AAGCAGTTGG
AACAGCAACG
A'ITAACATTG
ACACTTAAAT
CCAAGGTGGT
GAAAAAGACT
ACTGTG.GCAG
CGTTCGGCAG
TTGAAAATCA
AAACTCTACA
CAAGCAGAAG
AAAAATAGTG
ACTACTACAG
TTCAGA.AAAG
ACAC CGCAAG
ATCAAACAAC
TGALAGCAATT
TCAAAGAAAA
GTGkAGCATT TAGACf-AAGC
GGGA-AAATA.A
GAGTTGATCA
CTAGCCAAPT
CAGCTGGAAA
AGCA.ACGCAA
GGTAA.AAGTG
CGAGAAACAC
GAAGCAkAGCG
AGAAGCAAAA
AAAGGAGCAA
AACTGGTACG
=IACCGAGC
ACAGGAGAAA AGAAA EF088-4 (SEQ ID NO:336) T PEVTKEALEK LIQEQKEHQE KDYTASSWKV
YSEALKQAQT
EKDYTASSW<
NGGNNGHLNT
VADQTTATQA
VYSEAILKQAQ
STGVDQTGTK
EVDQAETELR
TVADQTTATQ
QVKPSSQGGF
SAVKQLVKVP
AEVDQAE.AKL
RKASQFLPST
TKEVDKTNLL
RSAVK-RLTLK
GEKK
KIIKENEKHQ
NSGENKKEQK
S
S
S
S
S
S
**SS
S. S S
S*
EF089-1 (SEQ ID NO:337)
TGACAGATAC
TATAGGTCAA
AATGACAGAC
TGTATGTTAT
ACACCAACAA
GCGCCTGGTG
ACCATTGAAA
CAAAACGGGA
CCGAAAGAAA
CCTAAAGATr
GAAACAACGA
GTTGTGGCTA
G4GGGTTAAAC
CAAGCGGCCT
C'TrACCAAT
ATTTCTTTAA
GGTGTAAAAG
CTGTACAAAT
AATGAAAAAG
ATCA ITCTAG
GAACAACAAT
ACCTGCTAAC
AAA'rTT=TG
ATAGGAGAAT
LTGGCTGGAT
'TCCCGAAAA
CCAAACAAAC
ATACGGTGAA
TCAAACCTGA
TCATCTT(GCC
CTlrTATGG
CTTCTGCGGA
TATTCTTCA
CAGGCCAAGT
ATTTAAACCA
CCGATACTGA
AAGGGGAACG
ATGAA.AAGGG
GGGAATTTAC
ACGTAACCAT
CGCTGCTCT
CTGAGCAATA
ACAGGAAACT
GCTTATCTT
GAATATGAAC
TGGCGTGGAG
TCAAGTGGAT
CGTAGAAATT
CTCAGCGACA
CAAAACCTTA
GA.AGCATTCC
CGTGCATGGCT
TCAATCAAAA
GCAAAATGAG
CAACGCGCGA
Al'rACXITIA
GGATATGCAA
AI'TAACGCCA
CACCTATCAA
AAAAGAATT
TAAAGGAACC
ATTGATTITTC
AAGAACGACA
CGGTCTITG
AGATGGAAAG
GCGCACGCTT
AAATCAAAAA
CAGTTACGCA
ACAAATTTAA
CGTrAACT
CAAAAGACCT
GGCGGTATAA
GGGITA~cTA ACAAAGGTT1C
AACGTCATCA
ATCAACACTG
GTGGCGCcAA
GGAAAATATG
GTCAAAGGCG
ACTATTTC~TG
AATTGGTGGT
TCIGTATC
GCATACACGC
GTGCTATAA
TATATGCAAC
CTGAATIIAA
CCTACTFITGA
ATGATACAGA
AT-GGCGTAGT
TAAAAGATTA
TACCTIAAC
CACTCAAAGA
TTAATAATGA
A.ACCAGAITI
ATGTTrCT=
TTCAAAAGG
ACTCTAAC TT
TC'ITGAAATC
CCAATCGTGA
GGGACGTCGC
TGTATCT"ACT
GTAA.AAAGAA
AAGATCGGGA
TACAACAAAG
GGTAATCGCT
'TITTGCGGTC
CTrAAAAATG
TGAAGACATT
AGAATATGGC
TGTGGAAGCA
CATTACGATG-
GAAAAAGAAA
ATACTCCTAT
AAAATTACTG
ACAAAACCCA
AGGCGAAACG
TAGTTACCCA
AACGGCCTAT
AGAACGGTAC
TAAAGAA'ITA
GATTGCAIA
AAAAGAGGAA
EF089-2 (SEQ ID NO:338) 182 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
MNR WKVYATVIAC MLFGWIGVEA HASEFNFAVT IENTVNSATT NLNGVVEYGQ KDSFDGVMAG GITLKEXKI(E VKPGQVNARN VINVSLQNPQ SLKGERLTPG KYVLKSTAYG EKDVTIKGTN WWLYLLIALI
PTIPENQVDK
NG IKPCKTLR
T'ITSALDQSKG
AAYLNQLHLI
VKDEKGTYQV
ILALLLLIFF
SKTYFDLKMA
FNLKDYVE:AP
LAINNEYSYV
NTVSKGGIETL
KGANGEERYL
LYRKKKKEEE
PGAKQ'IVEIQ
KEIILPKHSQ
VAI ILQQNET
YQSDTEDMQV
YKWEFTK'tEFT
QQSEQ
LRNDTDED IT KTLPLT ITMP
KVQPDLKLLG
APNSNFSYP I
ISGDVAKELN
EF089-3 (SEQ ID NO:339) T CTGAXTAA TTTTGCGGTC ACACCAACAA TrCCCGAAAA GCGCCTGGTG CCAAACAAAC ACCATTGAAA ATACGGTGA.A CAAAACGGGA TCAAACCTGA CCGAAAGAAA TCATCTTGCC CCTAAAGATT CTTTTGATGG GAAACAACGA CTTTGCGGA GTTGTGGCTA WI'ATTCTTCA GGGGTTAAAC CAGGCCAAGT CAAGCGGCCT ATI=TAAACCA CTTTACCAAT. CCGATACTGA A TTT CTTTAA AAGGGGAACG
TCAAGTGGAT
CGTAGAAATT
CTCAGCGACA
CAAAACCTTA
GAAGCAT'CC
CGTGATGGCT
TCAATCAAAA
GCA.AAATGAG
CAACGCGCGA
ATTACATTTA
GGATATGCAA
AT
AAATCAAAAA
CAGTTACGCA
ACAAATTTAA
CGTT'ITAACT
CAAAAGACCT
GGCGGTATAA
GGGTTAGCTA
ACAAAGGTTC
AACGTCATCA
ATCAACACTG
GTGGCCCCAA
CCTACTTTGA
ATGATACAGA
ATGGCGTAGT
TA-AAAGATTA
TACCTTTAAC
CACTCAAAGA
TTAATAATGA
A.ACCAGATTT
ATGT=C=I
TTTCAAAAGG
ACTCTAACTT
C'TAAAAATG
TCAAGACAT
AGAATATGGC
TGTGGAAGCA
CATTACGATG
GAAAAAGAAA
ATACTCCTAT
AAAATTACTG
ACAAAACCCA
AGGCGAAACG
TAGTTACCCA
EF089-4 (SEQ ID NO:340) SEFNFAVT PTIPENQVDK SKTYFDLKiMA PGAKQTVEIQ LRNDTDEDIT IENTVNSATT NLNGVVEYGQ NGIKPDKTLR FNLKDYVEAP KEIILPK4SQ KDSFDGVMAG GITLKEKKKE TTTSADQSKG LAINNEYSYV VAIILQQNET VKPGQVNARN VINVSLQNPQ A.AYLNQLHLI NTVSKGGETL YQSDTEDMQV
SL.KGER
S S
S.
S.
S
S
S
S.
S S
S
55 *5
S
S.*S
5**S
S
S
S S
SS
KTLPLTITMP
KVQPDLKLLG
APNSNFSYPI
EF090-1 (SEQ ID NO:341)
TAGTCTCTAA
GAAATCCAAA
ATTATTACTT
AGTCACTITT
GCTGCAGAAG
ACAAAAGAAG
GACCATAC
GAAGCCAAAT
CGTCGAGGCA
CAGCGCTTAA
AATGCCAAAG
ACATTGACAA
AAAGTAGCCT
AAAGAAAAAC
CCAGTTAT
GGCTGGTCGA
GAACAAACGA
GCCAATAATT
AAAAAAATAG
GAAATAAACC
TCCTAATAC
C=TGGTTGG
TNCCAACCAC
AAAAAI'AC
AAAAAATTCA
ACAGTAGCAA
TAGCAACTTT
ATGGCACTT
ACCAGCAGAT
ATTTTTATAA
CGCTCAATAA
TTGATAAAAC
TAACTACTTG
TTACAGATGT
TTGATGGGGC
ATGCAATCAC
ACATTGAAAT
TCACCACAGA
TAAAATTATT
ACCTCCTCAA
ACTAGCTCTG
AAAAGTAAAT
GCAAGTTAAT
ACTTCCTAAA
GGTGAAGCTA
GAGTTTTCCA
TGAAATTGTE'
TATTGCGGAT
ACCCCCTGAA
CAAGTTAAAT
ACAAAI'CAA
GGTGACACAA
TCACGGCACG
CA.AAACGCAA
TG.CTCGTTG
TGATFI'AAA.A
TGTCATTACT
CATATA.AAGG ATCAACAA.AT GAAAAAAGAA AAATCAGGAA AAAA.TAATAA AAAGAAAGTA GTTGCTGGGG GCAGTTATGT TrA'TrrTcAA GGAC'F11CTG TAGGCTGGTT AAATGTAAAT CAA.ACCGAAG AAGTTGGGT TCAAACCGGGG AAATACCAAT TGGATCAKAA ATTTTTAAAA CCGTTAAACG AGGCATTCAA AAAAGAACTA CAGGCGAAAC CAAGCAAAAA TGCGAGTATC CCCGAAGAAC AAGGCACAGT AGTGGACACA GTTGAAGCGG GAAA-AGGCAA CTATCAATAT ATrACAAAAG AGGATCAAAC GTrAA-AGGCA AAJCAATTA CAGTTGATAT TAATGGTGAA AACGTGCTGA ATGATCATGG CACAATCAAC TTACAAACAA CATATGTC TGCTAATCAA ACACGTCGTI TTAAAAACAA CGGAAGTTAT GAACTACTAG TAAACGCGCT GAATAGCCA GTTGGTGATA CCAAAGAAAA TAGTAAAATT GATCAAAAAA TGTATITGTTT CAITCATGGC GGCAGATATA ACAAAGGAAC CGCAACAGTA 183 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efadecalis Genes.
CCAGGATTCC
GATGG'ITCTC
GTTGTCACAC
GAAGCCTTTA
TCAAAAATCT
GATGATGCAC
ATACAATI
GATACAGTGT
AAATCGGGAT
AAACCGATC
TTGATGTATC
CAGGTGAATT
ATATCGGACA
GCCAGTAAAA
TCATGACTCC
TGGTACTAAT
CTATGACGGA
TGATAAACCT
ACCCATCGTGA
TATTCGATGC
GACCATA-AAT
GGCTGTATCA
ATGCCGGTAA
GTAGATTACG
ATTTAGA.AGG
CGTI'ATTAAG
TGGATAAGTA
ATACCCCAGG
TATTTATGG
GCGAAGAAGT
TCAAATGCTT
TCAAGGGGGC
TGGCGATAAA
AACAGAAGTT
ACATATCTAT
ATAA
EF090-2 (SEQ ID NO:342) MRNTRRQK SGKNNKKKVI ITSLVGLALV AGGSYVYFQS
HFXPTTKVNG
HL.HSSKVKLP
RLNQQ I ADV
VAFDKTQIQN
WS IDCAKTQE
KIVTTDVITG
VTQIGIHDSD
VSVGWLNVNA
LNEAFKKELE
EAGKGNYQYN
VLNODGTINK
LLVNALNSQE
RYNKGTATVP
HKLDKYGDKE
AEEKLAQVNQ
AKLATLSFPE
AKDFYKAPEI
EKLTTWVTQL
QTNAITAPLV
GFHTILYRTT
AFKTD-AGSNG
TEEVVVQT
GKPSKNAS IR
TKEDQTLKAT
ETTYGSANQP
GDTKENTSKIA
DVNLEGQMLD
CINTPGTEVS
KEEKIQLPKK
RGNGTFEIVP
LTTLNNkLNIK
VLFTDVHGTT
NNYIEIDLKO
GSRYSVPVKY
K IFOVSYDGM
YQLDQKFLKD
EEQGTVVDTQ
TITVD INGEK
RRFKNNGSYG
QKMYCFIDGK
WMPLLSQGGV
PVIIYGHIYD
DAPGEFDKPV DYGEEV EF090-3 (SEQ ID NO:343) CAC AAAAGTAAAT GGAGTTTCTG TAGGCTGGT' AAATGTAAAT
GCTGCAGAAG
ACAAAAGAAG
GACCATITAC
GAAGCCAAAT
CGTCGAGGCA
CAGCGCTTAA
AATrGCCAAAG
ACATTGACAA
AAAGTAGCCT
AAAGAAAAAC
CCAGT'FTTAT
GGCTGGTCGA
GAACAAACGA
GCCAATAATT
AAAAAAATAG
CCAGGATTCC
GATIGTTCTC
GTTGTCACAC
GAAGCCTTTA
TCAAAAATCT
GATGATGCAC
AAALAATTAC
AAAAAATTCA
ACAGTAGCAA
TAGCAACTTT
ATGGCACTTT
ACCAGCAGAT
fATAA
CGCTCAATAA
TTGATAAAAC
TAACTACI'G
TTACAGATGT
TTGATGGGGC
ATGCAATCAC
ACATTGAAAT
TCACCACAGA
ATACAATTT
GATACAGTGT
AA.ATCGGGAT
AAACCGATOC
TTGATGTATC
CAGGTGAATT
GCAAGTTAAT
ACTTCCTAAA
GGTGAAGCTA
GAGTTTTCCA
TGAAATTGTT
TATTGCGGAT
ACCCCCTGAA
CAAGTTAAAT
ACAAATTCAA
GGTGACACAA
TCACGGCACG
CAAAACGCAA
TGCTCCCTTG
TGATTTAAAA
TGTCATITACT
ATATCGGACA
GCCAGTAAAA
TCATGACTCC
TGGTAGTAAT
CTATGACGGA
TGATAAACCT
CAAACCGAAG
AAATAC CAAT
CCGTTA.AACG
GAGGGGAAAC
CCCGAAGAAC
GTTGAAGCGG
ATTACAAAAG
AAAACAAT'A
AACGTGCTGA
TTAGAAACAA
ACACGTCGTI'
GAACTACTAG
GTTGGTGATA
GATCAAAAAA
GOCAGATATA
ACCGATCTGA
TATTGGATGC
GACCATAAAT
GGCTGTATCA
ATGCCrGGTAA
GTAGATTACG
A.AGTTGTGGT TCAAACGGGG TGGATCAAA-A ATT'TTTAAAA AGGCATI!CAA AAAAGAACTA CAAGCAAAAA TGCGAGTATC ALAGGCACAGT AGTGGACACA GAAAAGGCAA CTATCAATAT AGGATtAAAC GTTAAAGGCA CAGTTGATAT TAATGGTGAA ATGATGATGG CACAATCAAC CATATGTTC TGCTAATCAA TTAAAAACAA CGGAAGTTAT TAAACGCGCT GAATAGCCAA CCAAAGAAAA TAGTAAAATr TGTATIGTTT CATTGATGGC ACAAAGGAAC CGCAACAGTA ATTTAGAAGG TCAAATCCTT CG'ITATTAAG TCAAGGGGGC TGGATAAGTA TGGCGATAAA ATACGCCAGG AACAGAAGT TTAP'FrATGC ACATATCTAT GCGAAGAAGT AT a **a a a
S.
EF090-4 (SEQ ID NO:344) TKVNG VSVGWLNVNA AEEKLAQVNQ TEEVVVQTGT KEEKIQrPKK( YQLDQKFLKD H LHS SKVKL P RLNQQI IADV
VAFDKTQIQN
WS IDGAKTQE K IVTTDVITG
VTQIGIHDSD
LNEAFKKELE
EAGKGNYQYN
VLNDDGT INK
LLVNALTNSQE
RYNKGTATVP
HKLDKYGDKE
AKLATLSFPE
AKDFYKAPE I
EKLTTWVTQL
QTNAITAPLV
GFHTILYRTT
AFKTDAGSNG
GKPSYXNASIR
TKED)QTLKA?
ETTYGSANQP
GDTKENSKIA
DVNLEGQmfLD C INTPGTEVS
RGNGTFEIVP
LT-TLN'NKLNX
VLFTDVHGT]T
NNYIEIDLKD
G~sRYSVPVKY
KIFDVSYDGM
EEQGTVVDTQ
TITVDINGEI(
RRFKNNGSYG
QKMYCFIDGX
WMPLL.SQGGV
PVIIYGHIYD
1 184 TABLE 1. Nucleotide and Amino Acid Seqeuences; of E. faecalis Genes.
DAPGEFDKPV DYGEEV EF091-1 (SEQ ID NO:345)
TAATTGGNGG
GGAACAGCAG
GATGATTTAT
GCTGTTCAAA
GATCAAGCCT
GCACAAGGTG
GATCAATTTA
GCCGAAGATG
GAAACTGrI'T
GCTGCAGAAG
GGGTAA
AGATTTITAT
CAGCCGI'GC
CAAATCAAAC
AAGGAACAGA
CTCXIT='GC
TTTCTGCCGA
AAAAAGCAGC
CAGCAGAAGA
CTGCTGCTCT
CAAAAGAAGA
GGCTAAAAAA
CGCATTATI'A
AGAMTC TA
ATTAACAGAA
AGTCTGTC
CATGCTTGAT
TGACGATGCT
TATCTATATT
CGATrCAAGCA
TGTTA.AAGAT
GGCGGATTTT
CT~1'GCACCAA
AAAAACAAAG
ATCGC:AAAAC
AAAGAAAAAA
AACTTAAAA
CAAGATCACG
&AE:GTTAAAG
ANAGAAACCA
GCAGCGAAAG
TCTTAGGNGC AGTAATTCT AATCAGGTAA AGAAFI'ACGT CCCAACATTA CACAGATTAT AAAAAGCCGG CGTTTTATCA CAAAAGATTC ATTGGATAAA AACAAACAGG TGATTTATCT CAGAAGATTT AGGTGAAATT ATTCTGCGGC AGCGGCCAAA CCAAAGATGT TCCTGAAAAA ACGTAAAAAA AGAATTTAAA EF091-2 (SEQ ID NO:346) MAKKG GFFLGAVIGG TAAAVAALLL APKSGKELRD OLSNQTDDLd( NKAQDYTDYA VQKGTELTEI AKQKAGVLSD QASDLACSVK EKTKDSLDKA QGVSCDMLDN FKKQTGDLSD QFKKAADDAQ DHAEDLGE IA EDAAEDIY ID VKDSAAAAKE TVSAGVDEAX ETTKDSIPEKA AE-AKE-DVKDA AXDVKKEFXG EF091-3 (SEQ ID NO:347) AT CAAATCAAAC AGATGATTTA AAAAACAAAG CGCAAGATTA CACAGATTAT 99 9 9 99 99 9 9.
9.
9 9 .9 99 9 0 9* 9. 9 09* 9 9 9.99 9 9.9.
99 9.99 9 .99.
9 9
S
09 9.
GCTGTTCAAA
GATCAAGCCT
GCACAAGGTG
GATCAATTTA
GCCGAAGATG
GAAACTG~T
GCTGCAGAAG
GGGTAA
AAGGAACAGA
CTGATTTGGC
TTTCTGGCGA
AAAAAGCAGC
CAGCAGAAGA
CTGCTGGTGT
CAAAAGAAGA
ATTAACAGAA
AGGTTCTGTC
CATGCTTGAT
TGACGATGCT
TATCTATATT
CCATGAAGCA
TGTI'AAAGAT
ATCGCAAAAC
AAAGAAAAAA
AACTTTAAAA
CAAGATCACG
GACGTTAAA.G
ANAGAAACCA
GCAGCr-AAAG
AAA.AAGCCGG
CAA.AAGATTIC
AACAAACAGG
CAGAAGATTT
ATTCTGCGGC
CCAAAGATGT
ACGTAAAAAA
CGTTTTATCA
ATTGGATAAA
TGATTTATCT
AGCGTGAAATT
AGCGGCCAAA
TCCTGAAAAA
AGAATTTAAA
EF091-4 (SEQ ID NO:348) SNQTDDLK NKAQDYTDYA VQKGTELTEI AKQKACVLSD QASOLAGSW±( QFKKAADDAQ DHAEDLGEIA EDAAEDIYID AEAXEDVKDA AKDVKKEFKG EKTKDSLDKA QGVSGDMLDN FKKQTGDLSD VKDSAAAAKE TVSAGVDEAX ETIKDVPEKA EF092-1 (SEQ ID NO:349)
TAAGGGGATG
AGCACGAGTT
GACATCTG
GTTTTACTTT
CCAAAAGGGA
AAAGTrAG AAGAAAAAkAT
TATTAGTAAC
CAGTATCGGC
TAGGTCCACA
TTCCT'ITAGA
ATCAAGCAAT
GGCAAAAAAA
AAAAATCCAA
TTCTGAAGCA
AGTTCGTTTC
TGTAATTAAC
CTCATTAATG
ACAATTATGT
AAAGCACCAG
GATACAAACT
ATGAAAGGGC
ATGGCACATT
GGATAA
TAGTTTGTTC
AAGATCGTGG
TG-GAAAATAA
AATTTGAACA
ATGGCATICAT
CGCAGGAATG
CATGGAAGCA
AGAGGTGAAT
AAAATTACAA
GAATGGCGAA
EF092-2 (SEQ ID NO:350) MAKKT IMLVCSAGMS TSLLVTKMQI( AAEDRGMEAD IFAVSASEAD TNLENKEVNV 185 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
LLLGPQVRFN K<GQFEQKLQP KGIPLDVINM ADYGKNIGEY \TLCQAISLMG EF092-3 (SEQ ID NO:351) AC AAGATCGTGG CATCAAGCA GACATCTT-TG CAGTATCGGC '-'CTC;AAGCA GT=PTAC'-TT TAGGTCCACA AGTTCGTTTC CCAAAAGGGA TTCC AGA TGTANITAAC AAAGTTTTAG ATCAAGCAAT CTCATTIAATG GATACAAACT TGGAAAATAA AC2AGCTGAAT ATGAAAGGGC AATTTAACA AAAATTACAA ATGGCAGATT ATGGCATGAT GAATGGCGAA
GGAT
EF092-4 (SEQ ID NO:352) EDRGMEAD IFAVSASE-AD TNLENXEVNV LLLGPQVRFN KGQFEQKLQP KGIPLDVINM ADYGMDGEK VLCQAISLMG EF093-1 (SEQ ID NO:353)
TAGTTT~TT
TTCACGGCCA
CGTCAG ITAA
TTTCA'IITAT
CTCGTATACG
GT'ITTAITAA
AAACGAAA
CCGATAAAGC;
TGGCCATTGT
CACAAAAATT
TTCTTCCACA
CGCAACTGAC
ACAAATCTGCG
CAATACTGGA
GAGAATTTTA
TTATITGATG
TACGCAAGCT
TGTTAAACAA
CCTCGATTAT
TCGAAAATAC
ATAG
ATGAGGCAAA
AGTTTTCTCG
ACCCAGGAAT
AATAGACGAA
ACATACAAAA
CAATATCAAG
AATA17CAGG
CCCTTCAGTT
ACTATGCAGG
AGTTAAAAAC
ATGAACAATT
AGAGAGTTTC
AAACITATTG
ACTAGAAGAA
GAAAAGTATC
AGAAGAAAGG
AAGAATAACT
TCATCAAAAA
EF093-2 (SEQ ID NO:354) M RQKYSGNLLF TAMAIVYLMS FLALQLLEER QLTQKFTQAT QEYYAGKSIF HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRXYQ YQERVSHQKK
AETILE
EF093-3 (SEQ ID NO:355)
S.
S. 0
S.
OS
e S 0 Se 05 S C
S.
55 S S .50 0 05 S S 500 6 CCTTCAC'IT ACTAGAACAA CGTCAGTTAA CACAAAAA'rr TACGCAAGCT TTTCATI 2AT TTCTTGCAGA TGTTAXACAA CTCCTATACG CGCAAGTGAC CCTCGATTAT GTTTrTATFAA ACAAATCTGG TCGAAAATAC AAAGCGGAAA CAATACTGG
ACCCAGGAAT
AATAGACGAA
ACATACAAAA
CAATATCAAG
ACTATGCAGC GAAAAGTATC AGTTAAAAAC AGAAGAAAGG ATGAACAATT AAGAATAACT AGACAGTI'C TCATCAAAAA EF093-4 (SEQ ID NO:356) LQLLEER QLTQKFTQAT QEYYAGKSIF HLFLADVKQN RRKLKTEERL VYAQVTLDYT YKNEQLRITV LLNKSGRKYQ YQERVSHQKK
AETI
EF094-1 (SEQ ID N'3:3S7) *055 5 e.g.
S. S 5S 00
TAAACATTTG
ATTAATTATA
TTAAGTCAAA
CACCCTCAAG
=IAGCGATG
GCAAAACAAA
AGACATTCAG
TGAAAAAAAC
CAATTGGCGG
AGGTAACAAT
GCAGCACGCT
CAGTATTqTG AGGTGAATGT CTCT'TTrA AACATTAAA AATTGGTCGT AACGATTGGT CCTACGXITC TCATI'ATGAC GTAAGTAAAc CTCACAACGT 'ACTACGTCAT TATTGAACCA GGTGTTAGTA
TTACTCAAAA
TATTTGCGAC
CTTTTiCCCGA
TGTATIGAAGT
TATATGCAGA
TTCCAACACA
ACGAAAGGC
2TI'GGCTCTA
TGAAATTACT
TGACGGAACT
ATACAATGGT
AGTGACGCAC
186 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GGTTATCAGA
TGGGAAAAGG
GAAGTGAACG
AAATCTATTG
CATAATACCA
TTAAATTTAT
ATTGGGAATC
AAAAAATCAG
ATGGCTGGTG
GG3TTC'PTTAA
TCCATTTAG
A=TCTATTT
CCAGATCCTT
ATTTCCATGA
GTAGAAACAG
ATTCGTAAAG
CGTGCGGAAA
GAAACTAAAT
TATGCCAATC
ACTGGGGAAA
GCTGAGTITA
TGGAGTGAAG
ACTTTGGCTT
TTCTGGCAAG
TCCATCAAAA
AAAGAACAAG
ACTGCCCAAA
GAAATCACAG
GGCTATGGTA
CCAGAAGGAT
GACCACTATG
ATTAAGATGG
AACCGITGA
TGGAAACACG
AGATATAACT
GAAGCCACAA
ACAACCCCAA
ACCGCTGAAC
GTAGCCTTTG
CAAACCCTCA
GATGGTTCGC
ACCCATGATG
CCAGATGGTA
AAAGAATTTA
ACTAAGTTTA
AATGAAGAT
AGTACGCCAA
ACAGTGAAGA
TTTATCTTGA
AAAACCCTTT
CTGCAACAGA
GTTATAAACT
AAGGAAAAAT
CTGTAAAAAC
CTGAGTTTGA
AATTAGTGCT
CTGGTACTGG
CGCTTGATAA
AGATCAAAAA
ATT'flGGGAA
TGGATGGAAT
ATATGATTGA
CTTCGAAAAA
GTACTGATCT
ACAGCCCAGC
CACCAAAAGA
CTAGTAATGG
AAACCGTGC
CCACTTTGAC
AAGGAGCTGA
CTTTTAAAAC
TAGATGAAAA
AAACCAAAGC
AAGTTGATAA
TATTCGCTT
CTGGArPITAA
GTGCTGAACA
AGTTTGAAAA
TTCAAAAGAT
CGAAGACTGA
TGACCGTI'CC
TGCTTTATGA
GAAATAAAAA
TGCATGAAAT
AAGCTGCCCA
CGTTr.CGAA AAGTrTTGrGA
AAGCAGGCAA
A'rrGTACGAT AAACT LTrAC
TACTGGATGG
CGAACAAAGA
CCAAAACCGT
CTTTITACGGA
TGAAAGAAAA
AACAACCGGA
CATTCCCGCA
Trr TCGAC
GCCATCAATG
TATTGATACA
CCATT-CATA
TAATAAGGCA
AATTTTAGGT
TAAAGTCGTC
TACTCCAAAC
AACTCCAGTC
GCCCAATACC
AATCGATAAA
AGCTTTACC-T
TCCCCATGGrT
TACCACACCC
TATGCGACAA
TTrGGAATCAC
TGGTGAAATT
GCTTGCTAAT
TTTCGTCAAT
TCTTGTTACC
AAAAGAAGAC
ATATACTCTC
AGAATTAGTG
GAACCAAGT
ACCTGAAGGA
TAACGAAAAA
TGGC=TGAT
CGACTTATCT
TAAAGCGACC
TCTTCCTTAT
TACACCACTA
GTATGTCITT
TTACGAGAAA
n=GCCCGAG
ATTCAATACC
CAAAGAAGAC
AGAAAAAGAC
CAAAGAGAAA
TAAAAGCATC
TGAG.CCAGtA
TGAACGCCAT
TCATGGTGAC
CTCAAAAGAA
AATTTGGAAA
ACTTGCGCAA
AATCAATTAc
ATCTCAAACC
AACACCAGCT
AACTGGGGAG
TGCTGGGTAT
TCTOATAAAG
AATATGGTTG
AAAAGATTAG
ATTGAGGAGT
CAATCGACAA
CAAkAATACGG TCTAAtTCCA
GCTTATAAAA
TAC'-CTAETA
GAATCAGGTG
TCAAAAGATG
kWAAGGTAA
ATGGCTGCCA
AAAGGTCAAA
A.ATI'ATTCTC
GTCCAAGA;A
GCITTTGGAAC
ACCI'CAAAC
AGTAACGTAA
AAAGATACCG
TTTACTGCAA
AAGGGA.ACGA
GCCGTTAAAC
TATACT=G
AATGCCGTAA
TTCTTTAAAT
TTTAAAGTGT
ACAGCTI'GTA
GGGGATTATT
GAAATCCGTT
ACCATTACCG
CTAACTAACA
AAAGAAGATA
CTTGAT"TTA
TCCTATGTCG
GAAAA.AGCCA
ACTGGAACTT
GTCTTGTTCA
GCGAAGGATG
GVF1'CCATCC
GTOATGGATA
GCrrCGAAA
TCTGGCAAAA
AAAGTAGATA
GAAAAAGATC
TTAACACCAA
GTTCCAAGTA
AAAAA.TTCCA
TATTTCTGGA
CGAAACTAGT
CACAAAAGAT
GT GCTGCTTC
ATCAAAAAAA
CT=AATAGA
CGAATATAGA
AATCAGGAAC
AAGCAGGACT
AAATTAATGT
ATATTGTACC
TGACAACAGA
CTATTACTGA
CCATTAAAGC
TTCTTTTAGA
TAGCTGGAAA
TAACAACQGA
TGGGAACCTA
CAACAAAAGT
AAGCGCAAAA
GTA-ATGAGAG
AAGATGGTCA
A.AGCTTCTGA
ACCTAGCAAT
ATGAAACGAA
TTACTCGAGA
TTCTGGATC
.CGCCATTGGA
ACGAGCAATT
TACTTAAGA
CTACATTTAA
AAGA.AGGACA
ACGAGIMrT
GTTGACTT'C
CCGAGCTAGT
TAGCTCAAGC
AACCGGTGGT
r-GAAAATTCT AT-TATGTr.TA
CTAGCTTGAA
AAACAAAAC
TGT=rATGA
CAATCTGTA
TTGAGCATGA
CCGGAAAGTA
GAA.ACGTGAA
AAGAAGTGCC
ATTCTCAAGA
ACGIT=-ACT
ATCGCCGCAA
ATCGGTTCTT
GATTGGGAA
AGTTGATATA
ACCAAGTI'TTT
TAAAAATGAA
TTACCGTrGTA
AT'ALACATTG
TCAA.ACTGTG
GGAAACTAAG
AGAAACGGTT
TAAAGATGGG
AAAATCGGTG
GGGCGAGACC
GAAGACTGGG
TACATTTGCC
TGAAAAAGGT
TTACGTGACA
CGAGTTAAAA
CCAAGAAATT
TCAAGGGAAA
AGCTGTTAAA
TrCAAACAGTG
TAACGAGTAT
GTATCCTGTA
TC-TTACGGCA
G'CCTGATGGC
AGGGACCAAN
ACGGI'rTMAT
AATAGAGGCT
GGAAAACAAA
AAAACAACCA
TGTTAGTCTG
TCTTGCGACT
TGATAAATTG
CATTGATCTG
GATTGCCGA.A
GCATAAATTA
TGAAAACAAG
CAATCAAGCA
CCACCTAGAA
TGTGTCGGTT
TGCTTTACTA
AGTGAATGAT
TCCAGAAGGA
TGGAAAACAC
AACCATACCG
ATCTAGTCCC
GTI'AGTTGGC
T1'AA EF094-2 (SEQ ID NO:358) MKXTTFKN WsLFATLALL~ SQTIGGTIGP TIAFADEITH PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYDEA KQTVFCIEPG VSIPTEVTHG TABLE 1. Nucleotide and Amino Acid Seqeuences of Efaecalis Genes.
YQKNPLPSMS
SIEGKINKAI
GNQLVLTPNS
SLK IKIDKE
DPYMIDTTPM
RXDS PAGEIV
ANQTVALVTS
SEAFKTELVK
IKKVDNNEKN
ITGAEDKATT
DYAKSEYVFT
KDGNKKLNTL
TATLANKEKT
TVNCTIER-W
DGTNKEIWKS
EDLKEKSQTL
ILIFSTAGYY
DKAKLVSVLW
EEYQKKPSFH
NSKSGTLTLK
SGDIVPETVF
AATIKAGETI
QEITTDEKGR
NVKGQNQEIT
GTKASDEIVT
AVITRDVTAK
ACNEQLGFDG
ITEEGQKQP I
OFTELVDKLR
GTWKILHKLT
S IQTKAHLED
GKIEHEVNDK
TPKEVPTI PS
FWNRRN
EKAGTDIDTN
NT'IVKTI LGQ
KSAGTGTPVA
HLDFGKALPS
SMTSKNt4RQK
AETPKELANA
GETTLTKEDK
LALDEKNQVA
EQVIRFGFDF
YGKFENLPYG
KIIVTVPYEKL
YNLHEIKEJW
AEQVLDKSIV
GSQTFTHGDV
EFTKTVLAEI<
TPKQP-TTPAV
MVA QKMIWEE ST'LI DKDIEL
YKXAGLQTVM
KDV'ITDKDGI
GQ ILLEKTOV
LELGTYYVTE
DTGNESQGKA
VKRHLAINEYF
FKFAG-SADCT
DYLLEE IEAP
TNNEFSVSLN
YVVAQAI DVE LFNYVYENflV
MOMFDDVSVT
VDTGKYPEGT
PSNSQESSPT
VINGYXLHS 1K RLGGASVDIK NLSEFDKVVQ NTANIDYRVI AGALDKPNTY AIKI NVETXG SILDGIPHGT KVTITEXSVP ETGTDLWND)N YSLAGN'TFAI TrKS SNGFVNT FKPTKVELKY EFKGAEYTLF TAKDGQAVWW WQETXAPEGY TLDETKYPVS AETGFNDLSF KVSPLEGTXE EGFQKITPLE I RSTFKENKD RLMLYDLPEK EDSLTSLATW ATKAAQEI<DE KAXPVV IAET AFEAGNEPVA KDASLNNQAQ HD)VLDGSKEA FETILYALLP YXFTFTEINYE KDGNVNGKH-N VKTFPQTGEK NSNVLLLVGF' EF094-3 (SEQ ID NO:359) CGA TGAAATTACT CACCCTCAAG AGGTAACAAT TTTAGCGATG GCAGCACGCT GCAAACAAA CAGTATTTTG GGTTATCAGA AAAACCCTTT TGGGAAAAGG CTGGAACAGA GAACTCAACG GTTATAAACT AAATCTATTG AACGAAAAAT CATAATACCA CTGTAAAAAC TTAAATTTAT CTGAGTTTGA ATTGGGAATC AATTAGTCCT AAAAAATCAG CTGGTACTG ATGGCTGGTG CGCTTGATAA GGTTCTTTAA ACATCAAAAA TTCCATTTAG ATTTTGGGAA ATTTCTATTT TGGATGGAAT CCAGATCCIT ATATGATTGA ATTTCCATGA CTTCGAAAAA GTAGAAACAG GTACTGATCT ATTCGTAAAG ACAGCCCAGC CGTCCGGAAA CACCAAAACA GAAACTAAAT CTAGTAATGG TATG.CCAATC AAACCGTGGC
TCAITATGAC
CTCAGAACGT
TKPI'GAACCA
GCCATCA-ATG
TATTGATACA
CCATTCCATA
TAATAAGCCA
AA'TTAGGT
TAAAGTCGTC
TACTCCAA-AC
AACTCCAGTC
GCCCAATACC
AATCGATAAA
AGCTTTACCT
TCCCCATGGT
TACCACACCC
TATrGCGACAA
TTGGAATGAC
TGGTGAAATT
GCTTGCTA-AT
TTTCGT13AAT
TCTTGI'IACC
AAAACAAGAC
ATATACTCTC
AGAATTAGTG
GTAAGTAAAC
ACTACGTCAT
GGTGTTAGTA
TCTGATAAAG
AATATGGTTG
AAAAGATTAG
AT1=AGGCT
CAATCGACAA
CAAAATACGG
TCTAATTCCA
GCTTATAAAA
TACGCTATTA
GAATCAGC9.G
TCAAAAGATG
ACAAAGGTAA
ATGGCTGCCA
AAAGGTCAAA
AATTATTCTC
GTCCAAGAAA
GC'rTrGGAAC
ACCTTCAAAC
AGTAACGTAA
AAAGATACCG
TTTACTGCAA
AAGGGAACGA
TG-TATCAACT TGACGGAACT TATATGCAGA ATACALATGGT TTCCAACAGA AGTGACGCAC CGAAACTAGT ATCGGTTCTT CACAAAAGAT GATTTGGGAA GTGGTGCTTC AGTTGATATA ATCAAAAAAA ACCAAGTTTT CITTAATAGA TAAAATGAA CGAATATAGA TTACCGTGTA AATCAGGAAC ATTAACATTG AAGCAGGACT TCAAACTGTG AAATTAATGT GGAAACTAAG ATAF1'GTACC AGAAACGGTT TGACAACAGA TA.AAGATGGG CTATTACTGA AAAATCGGTG CCATTAAAGC GGGCGAGACC ?I'CTI'TAGA CAAGACTGGG TAGCTGGAAA TACATTTCC TAACAACGGA TGAAAAAGGT TGGGAACCTA TTACGTGACA CAACAA-AAGT CGAGTTAAAA AAGC-GCAAAA CCAAGAAATT GTAATIAGAG TCAAGGGAA-A AAGATGGTCA AGCTGTTAAA AAGCT1'CTGA TGAAACAG
ACTGGGGAAA
GCTGAG'FITA
TGGAGTGAAG
CCACTTTGAC
AAGGAGCTGA
CT'ITAAAAC
EF094-4 (SEQ ID NO:360)
DEITH
PQEVTIHYDV
YQKNPLPSMS
S IECKINKAI
GNQLVLTPNS
SKLYEVDGTF SDGSTLSERT DKAKLVSVLW EKAGTDIDTN EEYQKKPSFH NTI'VKTILGQ NSKSGTLTLK KSAGTGTPVA
TSLYAEYNGA
MVAQKMIWEE
STTLIDMTL
YK(AGLQTVM
KQTVFCIEPG VSIPTEV'rHG VNGYKLHS 1K RLCGASVDII( NLSEFDKVVQ NTANI DYRVI AGALDKPNTY" AIKINVETKG 188 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faeca (is Genes.
SLKIRKIDKE
DPYMI DTTPM
RKDSPAGEIV
ANQTVALVTS
SEAFKTELVK
SGDIVPETVF
AATIKAGETI
QEITTDEKGR
NVKGQNQEIT
GTKASDET
HLDFGKALPS
SMTSKNMRQK
AETPKELANA
GETTLTKEDK
KDVTTDKDGI
GQILLEKTGV
LELGTYYVTE
DTGNES QGXA S ILDGI.PHGT
ETGTDLWL\DN
TKSSNGFVNT
EFKGAEYTLF
KVTITEKSVP
YSLAGNTFAI
FKPTKVELKY
TAKDGQAVKW
EF095-1. (SEQ ID NO:361)
TAAGAATT
GAATTGATAG
GTT=PTGCAG
ACAGCAGAAA
ACACTGGCAT
ATTAGAGAAA
GGGCAGCCAG
CCAGATGGCA
AATI'TAAAAG
GTGCTTGCAG
TACAATAAGA
GGAALACCAAT
TATAGTCAGA
TTGCTACAACC
TTTCAGGGC
ACTCAAGGGA
GACTGACC
TCAAAAACGT
AAAAGCGTAA
TATAAGGAGA
CCACCATCTC
TTAGCTGGAA
GGTTGGTATC
TCCCCTGTTT
GCTGAAATGC
CAGATTATGC
AGTGGTTGGT
CCAAACAAAA
GAAGTGCCTA
CATCAAGTCT
GATACTGTCA
ATCAGCACAC
GGTTTAAAGC
AAAAAATCAC
GATCAACTAT
CAGTACAATC
GTTGCCAACG
TATCAATTTG
AAAGATCAAA
TGGATTGTTC TTTACAAAGA
TAACGGGCAT
AAACATTACC
AAGCCGAAAG
TGTCAAAAAG
GANI'rGAGAC
TAAACGCCAA
TAAATGTrGTC
AAGTGGTAAT
CGAGTAATCA
AACGGGAAAT
ATCCAACAAC
AAACAGGAGT
AAGTGAAGGT
CGGTTATTA
AACCAAT'CC
CTTACACC'Fr
ATCAATTTCA
CGCCCAGTTA
TACCTCAAAA
ATTTIATTCA
AAAAACTGCC
AAGATAAAAC
TTAATGAAAT
AAATAGAAGG
7rACAAATGT
CACCAGGTCT
TTGTTCCTAT
TTGGTCAGAC
TACAAGCGGC
GAATCCAACC
CAACTT'ITGA
AGGCAGCAGA
AACCCAATTG
CATCAATGAC
AACCAACGGA
GTGTAGCTAG
ATTTITCTTT
CTTATCAAGC
CTGCCATATA
AAGTACAAAA
TGAACAACCA
TGAGTTAATC
GCCTAACCTA
TGAGATCT
GGAAGGTGAA
TCCAAGTGAG
AACAT=rT-I
TGATCCCAAT
AATTTCGCAA
AACGTTTAAC
TCATTCATCG
TCATGTTACC
TCCACCACCG
TAAACAGAAA
AGGATGGTAT
TGATATTACC
AAATTATACA
GGATCACCAA
ACAACAATAC
NAAACAAGAG
GAACGCTATI'
ACTAGTCAAA
GGGAGAAGTA
AGCTCGGCCA
TACTGATGAA
AGCAACTATT
TGTTOAAATC
TAAAAATCAA
TITTTGGCAAA
'ITA'ITATCAA
GGCACTAACT
AAAG ITAUTG
AACTAAACTT
CCATA=TGTT
TGATCAAATC
AATGGTGACT
Cj .AGTAAGAG
CAGACAAAGG
GATAATrGAGr, ACTTATCG~r
CTACAGTATC
AGTCAACCAG
AAAGTAGCCG
TTACCAAGAT
TATCCGCTC
TTTGAATTGG
ATTAGCGAGA
AATCAATCTC;
AATCGCAAAG
GGGTTTAGAC
GATCTTTTGC
AAAGGGAAAA
TATGACGACA
TTTG;AGGAIM
CAACCAATAA
AGCGTTAACG
AGCTTAGAAA
ACAGCAC=G
GTCATGCCAA
CCGTTAAAAA
ATCCAAGTCA
AAL'IGGCGAG
ATGATGACAA
AATGGAAATT
GAAATTGTGG
GTCGCCAM1
AATGCTCAGC
GCAGAACTAT
TTAGGAACAA
GCTCTTGGCA
GCCAATGGTC
AAATI'AGAAA
TGGAATTAG
AAGGAACCAA TCATTCATTA ATAA-ACTACA TGATGAAGAA CTAATGTTAC AAGTCAAACG ATGGATTAT TAATGAAGAG ATAGTTGGCA AGGCAATTCC TGACAGCATC TACAGTGGCT TCTAITCCGA CATGTCAACG ATTATACTTC TTAAGCTTA CAACTATTTC CGACGCATCA AAAAAATGTC TGCACAACAA GTCAAAAACT AATCG'ITCCT GCCTA'ITGAA TTACTTTAAA TGACAGAACA 'I=TGTGGAT AAGGAAAGCA AACACTTATT CAAGTAGCTA TGAAAWI'GAC CGAAACCTGA AAATTTAGAA ATGATGATTT AACTGTTGTC TCAATGGTGT TGAAATTGCA CTACGGATGG CTTTCGCTAT GTAAAACTTA TTTATATCAA AAACGAAGCG ACCCATAAAC ATAAGCAAAT AACTGCAAAA GTGGTTATAT ACAAATITG AAATAAACTT AAAGCCAGCA CCATTCGTGT TGGATCTGAA TTGGCArTAC TTTAAATACG CAATTCTAC AGGTGAACCA TTTCTCCTGT TCACGCACCT CACCAGATGA GGAAGGTTI' CTAGCAACAC GCAGCAACAT AAAATCCATA =~ACGTTTG CCCCC 1TTCA AGGAAGAGTG CCAATG=ITC AGGTTTTATT AAACAACCGC TATTCAATI'A AGTITGrACGA AAGTGATGTT TTCCAGCAAA TCAAGGTAGA TGACAGGCCC ATAA AGGGACA.ATA TGAAGCGAAG TAAATGGAAA ?I'AGTATTCC CCATACTA-AT ACACACAACT 0 EF095-2 (SEQ 10 NO~:362) MKRSKWKE LIVTGICHIL VFPILIQTTV FAETLPSTKQ VR.EGTN14SLT AEKAESEQPQ TKDKLHDEET LALSKSELID N~EANv'rSQTI RERIETPNLT YRYGFINEEG QPVNANEILL QYHSWQGNSP DCINVWECGES QPVTAS TVAN LKEVVIPSEK VAVYSDMSrTV 189 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Genes.
LAASNQTFFL
SQKTCVTFNI
QGKPIPPPPG
SVTPSYCITY
AGKKLPQQYS
EMQIEGLVXV
NKIVP ITDEN
TVRIQPKNQE
KSQPNWALTA
ANGVASHIVA
PRYYTSLSLY
SESQKLIVPL
FRQGKQTILIE
DDNDDLTVVY
VNGKTYLYQG
MPSGYIQIWQ
WRVGI TLNTv- IVAPDEEGF I ELS PFEGRVD NGQF[7ESDVY NKXGEI DPNY
YNQVKVDSSN
RDPYTFKQKD
KEI PQKNYTF
WYQDKTKQES
IMLTNVGEVP
VPIGQTATIM
STPTFDFGKV
QLSSMTKLLL
QFDFSFDQIK
PLPTISDASG
QSGLLNYPI(P
LLPSSYEIDS
EDVNGVE lAP LEKTKRP INS
LKKINLKPAS
MTTIATGEPD
AISSNTQQHG
G;TTNVSG;FIQ
LE I PAQGRK
NQYPTTISQF
SC? VYYHVTN
KTYQFQGWYK
PSDFIQCHQQ
PVF-NEL'NAIT
GtAJSPGLARP I
QVLQAAVEI~
LKQA.ADYYEN
YNQPTETKVA
DQTYQAMVTW
ELEKMdSAQQY
RKVTEHFVDT
GKTKPENLEK
P ITI'DGFRYL
AVYKEITAKA
QVTIRVGSEP
GNFSAVHAAD
GQENPYLRLK
LGKTTAIQLV
NLVTGP
EF095-3 (SEQ ID NO:363) *5
S
AAGTACAAA.A
ACAGCAGAAA
ACACTGGCAT
ATTAGAGAAA
GGGCAGCCAG
CCAGATGGCA
AA IIAAAAG
GTGCTTGCAG
TACAATAAGA
GGAAACCAAT
TATAGTCAGA
TTGTACAACC
TTTTCAGGGC
ACTCAAGGGA
GAGCGTGACC
TCAAAAACGT
AAAAGCGTAA
TATAAGGAGA
CCACCATCTG
ITAGCTGGAA
GCTTGGTATC
TCCCCTGTTT
GCTGAAATGC
CAGATTATGC
AGTGGTTGGT
CCAAACAAAA
GAAGTGCCTA
GATCAAGTGT
GATACTGTCA
ATCAGCACAC
GGITI'AAAGC
AAAAAATCAC
GATCAACTAT
CAGTACAATC
GTTGCCAACG
TATCAATTTG
AAAGATCAAA
CAAGTAAGAG
AAGCCGAAAG
TTCAAAAAG
GAATTGAGAC
TAAACGCCAA
TAAATGTGTG
AAGTGGTAAT
CGAGTAATCA
AAGGGGAAAT
ATCCAACAAC
AAACAGGAGT
AAGTGAAGGT
CGG TTA
AACCAATCCC
CTTACACCTT
ATCAATTTCA
CGCCCAGTTA
TACCTCAA;A
ATTTItTTCA
AAAAACTGCC
AAGATAAAAC
TTAATGAAAT
AAATAGAAGG
TTACAAATGT
CACCAGGTCT
TTGTTCCTAT
TI'GGTCAGAC
TACAAGCGGC
GAATCCAACC
CAACTTTTGA
AGGCAGCAGA
AACCCAATTG
CATCAATGAC
AACCAACGGA
GTGTAGCTAG
ATTTTTCTT
CTTATCAAGC
AAGGAACCAA
TGAACAACCA
TGAGTTAATC
GCCTAACCTA
TGAGATCCTT
GGAAGGTGAA
TCCAAGTGAG
AACATTTTT
TGATCCCAAT
AATTCGCAA
AACGTTTAAC
TGATTCATCG
TCATGTrACC
TCCACCACCG
TAAACAGAAA
AGGATGGTAT
TGATATTACC
AAATTATACA
GGATCACCAA
ACAACAATAC
NAAACAAGAG
GAACGCTATT
ACTAGTCAAA
GGGAGAAGTA
AGCTCGGCCA
TACTGATGAA
AGCAACTATT
TGTTGAAATG
TAAAAATCAA
TTTTGGCAAA
TTATTATGAA
GGCACTAACT
AAAG ITATTG
A.ACTAAACTT
CCATATTGTT
TIGATCAAATC
AATGGTGACT
TCATrCATI'A 0 CAGACAAAGG ATAAACTACA TGATGAAGAA G.ATAATGAGG CTAATrGTTAC AAGTCAAACG ACTTATCGTT ATGGArI'TAT TAATGAAGAG CTACAGTATC ATAGTTGGCA AGGCAAITCC AGTCA.ACCAG TGACAGCATC TACAGTGGCT AAAGTAGCCG TCTATTCCGA CATGTCAACG TTACCAAGAT ATTATACTTC TFrAAGCTTA TATCCGCTGC CAACTATTTC CGACGCATCA TTTGAA=IG AAAAAATGTC TGCACAkACAA ATTAGCGAGA GTCAAAAACT AATCGTTCCT A.ATCAATCTG GGCTAITGAA 'ITACTTTAALA AATCGCAAAG TGACAGAACA TTTTGTGG.AT GGGTTTAGAC AAGGAAAGCA AACACTTATT GATC'I"'TTC CAAGTACCTA TGAAATTGAC AAAGGGAAAA CGAAACCTGA AAATTTAGAA TATGACGACA ATG3ATGATTr AACTGTTGTC TTTGAGCATG TCAATGGTGT TGAAATTGCA CAACCAATAA CTACGGATGG CTTTCGCTAT AGCG TTAACG GTAAAACTTA TTTATATCAA AGCTTAGAAA AAACGAACG ACCCATAAAC ACAGCAGTC'r ATAAGGAAAT AACTGCAAA.A GTCATGCCAA GTGTATAT ACAAATTTGG CCG ITAAAAA AAATAAACTT AAAGCCAGCA ATCCAAGTCA CGATTCGTG-T TGGATCTGAA A.ATTGGCGAG TTGCATTAC T=AAATACG ATGATGACAA CAATTGCTAC AGGTGAACCA AATGGAAATT TTTCTGCTGT TCACGCAGCT GAAATTGTGG CACCAGATGA GGAAGGTTTT GTCGCCATTT CTAGCAACAC GCAGCAACAT AATG--TCAGG AAAATCCATA TTrACGTrC GCAGAACTAT CCCCCTTTGA AGGAAGAGTG TTAGGAACAA CCAATGTTTC AGGTTMATT GCTCTTGCCA AAACAACCGC TAT~CAATTA GCCAATGGTC AGT=GACGA AAGTGATGTT AAATTAGAAA TTCCAGCAAA TCAAGGTAGA TGGAATTTAG TGACAGGCCC A
S.
EF095-4 (SEQ ID NO:364) STKQ VREGTNHSLT 190 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. [aecalis Genes.
AEKAESEQPQ
QPVNANEILL
LAASNQTFFL
SQKTGVTFNI
QGKPIPPPPG
SVTPSYDITY
AGKKLPQQYS
EMQ EGLVKV
NKIVPITDEN
TVRIQPKNQE
KSQPNWALTA
ANGVASH IVA
TKDKLHDEET
QYHSWQGNSP
PRYYTSLSLY
SESQKLIVPL
FRQCKQTLIE
DDNDDLTVVY
VNGKTYLYQG
MPSGYIQIWQ
WRVGITLNTE
IVAPDEEGFI
ELSPFECRVD
NGQFDESDVY
LALSKSELID
DC INVWECES
N~KXGEIDPNY
YNQVIKVDSSN
RDPYTFKQKD
KEIPQKNYTF
WYQDKTKQES
IMLTNVGEVP
VP ICQTATIM
STPTFDFGKV
QLSSMTKLLL
QFDFSFDQIK
NEANVTSQTI
QPVTASTVAN
PLPTISDASG
QSGLLNYFKF
LLPSSYEIDS
EDVNGVEIAP
LEKTKRP INS LKK INLKPAS
MTTIATGEPD
AISSNTQQHG
GTTNVSGFIQ
L&I PANQCRK
RERIETPNLT
LKEVVIPSER
NQYPITISQF
SGPVYYHVTN
KTYQFQGWYK
PSDFIQDHQQ
PVFNMAIT
GWSPGLARPI
QVLQAAVEMN
LKQA.ADYYEN
YNQPTEzTKVA
DQTYQAMIVTW
YRYGFINEEG
VA~VYSDMSTV
ELEKMSAQQY
RKVTEHFVDT
GKTKPENLEK
P ITTDGFRYL
AVYKEITAKA
Q VT IR VGSE P
GNFSAVHAAD
GQENPYLRLK
LGKTTAIQLV
ULVTGP
EF096-1 (SEQ ID NO:365)
TGAGGTGCC
ATTGTCATCT
GATACCTTGA
GAAAAAGAAA
ACAAAAATTC
CAAAAAATGA
GAAAACATTG
CCGTACTTCT
ATCCAACATT
GGCGCTCGCCG
AACGACCAAC
CCAATTGTTG
ACTTTCTCTG
CCCAGTGAAG
AAAGAGGGTG
CAGAATATA
GTTGCCGAAG
CA.AGACTTAG
TATCGGAAAT
AAGTTAAAAT
TGTTTITTGG
CCATTTACAA
CAGGCTATAA
AGCAAGGTG
TGAA.AGCGAA
ATG.CACGCTr
GGGGCACGTT
GGGATGATTT
AAGTG ITAGG
AATI'ACGTCA
CCGATCAAAT
CTGAACCTC
GCTCTAATCT
CCTATGCATT
TTGTI'ATTC
ATAAACAATT
GTCAAGAATA
AA
GAAAAAXI'TA
CGTGCGCCAA
TTGGGGGGAC
AGTCAATTAC
CACAGCCTAT
GATGCTTTTA
TI'TAGATCAA
GGGGATTAT
ATGGCGCCCG
ATTATCTIr
GGCTACCGAT
CAAA.ATGTAC
TCAAATCCTA
CTGGTT'GAT
TATGAACTI'
CACACCAAAT
TTATCCAGAT
CTTAGGAAT'
CAGTCACTTT
TTGGAGAAAG
TATATAGATC
GAAACCTTTG
GATATTGCCA
CCACTTGATC
TCCTTTGATC
TATAATGATA
GAATTAAAAA
AACAGTTTAG
AAGTTAAACC
ATGGCTAATG
GAAAACAATG
AACATTGTr.A
ATG'ITACGAC
AAAGAAGCTA
GATGAALACTA
TATALACGATC
TTAT7GGA-AT
CAAGTGGCAT
CGGCCTTGAT
AITCTAATGA
TTCCTTCTGA
ACAGCAAAT
CCAAAAATAA
AATTTATTGA
ATAATGTCAT
GCTATTCGTT
GAT'TAACGAA
AAGAAAGTGC
AACATCTACA
TGCCTAAGAC
CAGAAAATGC
AAAAACTATT
TCA.AACATTT
TGTTC 'rGGA
TATCGCTATT
GGCAGGAGCA
TAAGAAATTT
AGCTATGTAT
ATATATGATT
AAAAGGCT'A
CTTTTCCGTT
CGGCCGTCAG
GCTGATTGAT
AAACACTAA
CAATGTCAAA
AGTTGCTGTA
ITATGTGATI'
AGCCAAAAAT
GGCACAAAAT
ACCAAAAGAA
AGAATT1TAC
GITAAGATG
0.
o 0* *0 0 V EF096-2 (SEQ ID NO:366) MKKLQ SLFIGIIAII VILFFGVRQL EKASGMAGAD TLTIYNWGCY IDPALIKKFE KETGYKVNYE TFDSNEAMYT KIQQGCTAYD IAIPSEYNIQ KMMKAKMLLP LDHSKLKCLE NIDARFLDQS FDPKNKFSVP YFWCTLGIIY NDKFIDGRQI QHWIDDLW.RPE LKNNVMLIDG AREVLGL.SLN SLCYSLNSKN DQQLRQATDK LNRLTNNVKA IVADEIKMYM ANEESAVAVT FSGEAAEMLE NNEHLHYVIP SEGSNLWFDN IVMPXTAKNK EGAYAFMNFM LRPENAAQNA EYICYSTPNK EAKXLLPKEV AEDKQFYPDD ETIKHLEVQ DLGQE-YLGIY NDLFLEFKMY
RK
EF096-3 (SEQ ID NO:367) AAGTGGCAT GGCAGGAGCA CATACCTTGA CCATTTACAA GAAAAAGA.AA CAGGCTATAA ACAAAAATTC AGCAAGGTGG CAAAAAATGA TGAAAGCGAA GAAAACAT1TG ATGCACGCT CCGTACTTCT GGGGCACGT
TT'GGGGGGAC
ACTCAATTAC
CACAGCCTAT
GATGCTTTTA
TTT1AGATCAA
GGGGATTATT
TATATAGATC
GAAACCTIVC
GATATTOCCA
CCACTTGATC
TCCTTTGATC
TATAATGATA
CGGCC=IAT
ATrCTAATGA
TTCCTTCTGA
ACAGCAAATT
CCA.AAAkATA).
AATTTATTGA
TAAGAAATTT
AGCTATGTAT
ATATATGATT
AAAAGGCTTA
GTTTTCCGTT
CCCCCTCAG
191 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. aecalis Genes.
ATCCAACATT
GGCGCTCGCG
AACGACCAAC
GCAATTGTTG
ACITCTCTG
CCCAGTGAAG
AAAGAGGGTG
GCAGAATATA
GTTGCCGAAG
CAAGACTTAG
TATCGGAAA
GGGATGA=T
AAGTG'ITAGG
AATTACGTCA
CCGATGAAAT
GTGAAGCTGC
GCTCTAATCT
CCTATGCATT
TTGGTTATTC
ATAAACAATT
GTCAAGAATA
ATGGCGCCCG
ATTATCTTTG
GGCTACCGAT
CAAAATGTAC
TGAAATGCTA
CTGG=1FGAT TATGAAC'Tr
CACACCAA.AT
TTATCCAGAT
CTTAGGAATT
GAATTAAAAA
AACAGTTTAG
AAGTTAA.AcC
ATGGCTAATG
GAAAACAATG
AACATTGTGA
ATGTTACGAC
A-AAGAAGCTA
GAMA-AACTA
TATAACGATC
ATAATGTCAT
GCTATTCCT
GATTAACGAA
AAGAAAGTGC
AACATCTACA
TGCCTAAGAC
CAGAAAATC
A-AAAACTATT
TCAAACATTT
TGTTCTTGGA
GCTGATTGAT
AAACAGTAAA
CAATGTCA.AA
AGTTGCTGTA
TTATGTGTr
AGCCAAAAAT
GGCACAAAAT
ACCAAAAGAA
AGAAGTTTAC
GTTTAAGATG
EF096-4 (SEQ ID NO:3 68) SGMAGAD TLTIYNWGDY IDPALIKKFE KETGYKVNYE TFDSNEAKYT KIQQGGTAYD NIDARFLDQS FDPKNKFSVP YFWGTLGIIY AREVLGLSLN SLGYSLNSKN DQQLRQATDK FSGEAAE-MLE NNEHLHYVIP SEGSNLWFDN EYIGYSTPNX EAKKLLPKEV AEDKQFYPDD
RK
IAIPS EYMIQ NDK I DGRQI
LNRLTNNV*XA
IVMPKTAKUK
ETIKHLEVYQ
KPMUALLP
QHWD)DLWRPE
IVADEIKMYM
EGAYAFMNFM
D)LGQEYLG IY LDH SKLKGLE LKNNVMLI DG
ANEESAVAVT
LRPENAAQNA
NDLFLEFKMY
EF097-1 (SEQ ID NO:369) a a
TAGAAGTATT
ATGCATTCGC
GGTCATCGTT
ITGTCTTITGG
ACGATAATGA
GAGGAACAAC
GATGTTCCAC
AAAATTGAAC
ITTFTAGCAG
GCTGTTGAA.A
CTTTTACCAT
ATTAACCATG
A~TTTTC
CTGTTTGGGC
GGGGGCATTC
GTAATTGCTG
GCTCCAGCTT
CTGGCGGTTI'
LTATTAAAAC
CAAGTGGAAA
GGCTCAAGTG
ATGCCTGTGA
ArI'CAAGCAG G 1TCAAAATT
TCTTCTCAAG
ATACAGAAGC
GAATTATITGC
CTGGAA.ACAG
GCCTATCATT
AAAGAGTATC
CTAATTATCT
TCTTTTTTAA
TGAGTGGGAT
TGGCTGGCTA
TTCGAGTTGT
GTGGCGGCGT
AGTTGTTGG
AAATTCTCTT
CAATTGTGGG
GCGCTAGTT
TGGTTCACGT
GCTTATTAAC
TATTGGAAAC
CTIGTAGGACA
ATGAAA'PrTA
GAGGAATGAG
CGCCAGGITC
CTCAACGAG
CAGTCACACC
CCATGGGGGC
CTI'ACCAGTC
AATTGAAACA
TTTTAGAA.AT
AGCAATCTTC
TTGTTTTTTT
GGCAACAAC
TCTT'TAC
TAGATCTAAC
AAGAATGGCT
ACATAGAGAG
ACATAAGTTT
GATTATCCCA
TACCACTGGG
TTTACCGAI'
TGTTGCTGCT
TGCTATGTTT
ACCGAAAGTT
AGGACTGCTG
?I'GGCTGTAT
TI'CTTIAGAG
GCCTCTAGGT
AAACCCTGGA
ACGAAAAACA
I-rI-CCG=T
TGGTACGCTT
ATTGGTrGCG 7TI'rGTTAGC
AATTGAACCA
TAACTATCAG
TAGTTTGCTA
CGTTCATCAG
GTITAGCACAA
TAAATCCTAT
ACTTGG'-ITCA
ATATGCCGAG
GGCGAAACAA
CAAGGAGACA
GCAACAAAAT
GGAAGGAGGA
CGAGGGACAA
GTGAAAGTAA
AATTGAGTA
AATCTACGGC
CTAATTrGGTI'
ATTGCGACAG
ATTGGCCCTT
AAAGAAGGCT
TG.^TGT~rTG CAArTTTr
CCCTTAAAAG
TTAGAAGGTG
CCACGCGTOG
GCAGGAGGTG
GTTTTGATG
GTTTTTCAAA
ATTTTAGCCA
TTTCTGTGCT
GTTTCAATGA
CAAATTFrAT
AGCCGACAAT
ATGAAGTGC
AAGTACGTCC
TACCCGCAAG
GAGTCTACTG
AATTCGAG
GGAGTCGCCA
ACCTACGTAG
TTATACGTAG
GCTCATAGAT
GGAATATGAA
CTCCCTATTT
TTTTrTATTGC
TAGCTCTTTC
TTACAGGCGG
TGGGCGTGAT
TAGCAGGATA
ACGAGATGCT
GTATTCTGGT
CTTGGTTAAT
TGTTATTTT
CTAGTCAA-AC
GCGTTI"'IGGT
CCACCATGAT
ACCCGCGCIT
TATTTAATGT
ATGCCCCGAC
C ITTTGCAAT
TAAAGATGAA
TTGTTTGTGA
TAA.AAGCTGT
AGCCTAAGAC
CAGAAAAGGA
1I'TAGCCAA
AAACGAACTC
GATCGCAAAC
TTGAAGTATC
TGACTCGTGA
TTACTAGTTT
GTTTrrAA
GGAAAAAGAA
ACGTCGITPI
GTGGAGCTTA
TGAAGTCGAA
AAAAATGTTC
TGT'ITCCACA
TACI'TCGCC
GACTAAAAAC
TGTAGCTCCG
TGAAGCCAAT
TAATAATGCG
AGGTCAGTCC
TGCTTCTG
TCAACTGATT
ATTTTAGCA
GGGTC'rAAGT
TG-ATGCGAGG
AGCAAGCTTG
GGAGGAAGAC
Tw-CAGGAATG
GAACTTGGAG
ATTAGTGGTC
TATGGTGAGT
ACTGACTG-CT
GACAAAACAA
AATGGGAATG
TAAACAGCCA
ACTGGCGCAA
TTTGAATAAG
a.
192 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF097-2 (SEQ ID NQ:370) MLTKNF LAGIVGGLLC CFGILVVAPA
VESASFWLYQ
LFLLETNPGP
IACGMSGTLV
LKRQRGIEPV
PVTYQSVHQM
SQEQSSLGSE
ETVFFTKETT
FSSWL IEANL
GVGVLJVAFLL
FQ IFNVGLSA
SMIKMCEEDQ
KWQPKTLVVI
STETNSTKQI
YVVTRELAQA
LPLVHVFLEP
FGPVGQRK(TA
PASPGSLVAI
VEWVPNYQQ
QAELKQLAQK
QKLVFLYAEN
YHLDLTQQN'L
LKVLFFNNAI
GGATMIQLIG
LANAPTDARL
ILFVCDAGMG
YVPEKDMVSV
VRCSQTMGME
YVVTSFLNKK
N}IGLLTPLC'L
G IHE IYFPFV
AVFSCIFVSF
S SANGASLLS
QNFLEIKSYY
LLRQQAAKQG
EYQEWLEGGA
EGASQTGQS I
LMDPRLFLAV
LCSFAIASLL
RQLKAXTNLEM
PQVLAXIJTAS
VAIEVSKEPL
DRCF
EF097-3 (SEQ ID NO:371) ACGAGG AATTGAACCA GTTTCAATGA TAAAGATGAA GGAGGAAGAC
CAAGTGGAAA
GGCTCAAGTG
ATGCCTGTGA
ATTCAAGCAG
CTTCAAAATT
TCTTCTCAAG
ATACAGAAGC
GAATTATTGC
CTGGA-AACAG
GCCTATCATT
AAAGAGTATC
CAGTCACACC
CCATGGGGC
CTTACCAGTC
AATTGAAACA
TTrAGAAAT
AGCAATCTTC
TTGTTTTTTT
CGCAACAAGC
TCTTTTTTAC
TAGATCTAAC
AAGAATCCCT
TAACTATCAG
TAGTTTCCTA
CGTTCATCAG
GTTAGCACAA
TAAATCCTAT
ACTTGGTTCA
ATATGCCGAG
CGCGAA-ACAA
CAAGGAGACA
GCAACAAAAT
GGAAGGAGGA
CAAATTT'AT
AGCCGACAAT
ATGAAGTGGC
AAGTACGTCC
TACCCGCAAG
GAGTCTACTG
AATGTTCGAG
GGAGTCGCGA
ACCTACGTAG
TTATACGTAG
GCTGATAGAT
TTG3TTGTr.A
TAAAAGCTT
AGCCTA-AGAC
CAGAAAAGGA
ITTAGCCAA
AAACGAACTC
GATCGCAAAC
TTAAGTATC
TGACTCGTGA
TTACTAGITT
GTITT
TGCAGGAATG
GAACTTCGAG
ATTAGTGGTC
TATGGTGAGT
ACTGACTGCT
GACA.AAACAA
A.ATGGGAATG
TAA.AGAGCCA
ACTr.GCGCAA
TTTGAATAAG
q.
EF'097-4 (SEQ ID NO:372) RGIEPV SMIKMKlEEDQ VETVTPNYQQ ILFVCDAGMG SSAMGASLLS RQLKAVNLEM
PVTYQSVHQM
SQEQSSLGSE
ETVFFTKETT
KWQPKTLVVI
STETNSTKQ I
YVVTRELAQA
QAELKQLAQK
QKLVFLYAEN
YHLDLTQQNL
YVPEKDMVSV
VRGSQTMGME
YVVTSFLNKK
QNFLEIKSYY
LLRQQAAKQG
EYQEWLEGGA
PQVLAKLTAS
VAIEVSKEPL
DRCF
EF098-1 (SEQ ID NO:373) p p
TAAATGAAAA
CTTGTACCAA
CAATI'TGCAC
AAACCAATTA
CCCTTATCCA
AATATGACCT
GGCCCAAACT
GTAAAACAAAi
ACTI'AACCA
GCTACGATTG
GAAGGTGCGG
ATTTCATAG
ACTTGGACTT
AGACAAAAGT
TGAGTGCATI'
CAAATACGAA
CACCAG273A
TTIGACTATGC
ACTATGCAGA
ITGTTCAAGT
ATGGTCAAT
ACGGACGCGT
AATTrAAACCC GTACGTAC T
AAGTACCTGG
TGACAGATAC
ATACATTG
AGCAGTCGAC
CCCAACGAAT
TCCAACTGAT
ATCTAGCTITA
AACACAAAAA
CTCAGATAAT
CAAAACTGAA
GCTTTCAGCT
AACTCGCCT
AATGCTGG
TTCAACCACA
ACCTGCTAAC
ATGGCAACCA CAACTTTAGG GGTGGT'GAAT ACCAAACAAA CCACTTGATC CGACGAATCC CCGACAGOC CTAAGCCAGG TCTITTGGGG AACAAACGAT TACAAAGATA ACGCTGGTGC CGTCOGACTG AGACAGCTTG CCCAACCAAG AACTAACAC TCACAATCTG CAA.AGCCAAC GAATCAGTCG TCATGGCTGC GGCGATAGTG TAGATACCGC AAATATGCGA AAAAATACAC ACACCAAACT AA
CGCACTGGCA
CGGAGCGATT
AGACCCAGAT
GACAGCAGGT
TACCTCAAAA
CGACCAAGAA
GACGCTAAAA
GGCCAAAGTA
GACAGCGCCA
TGGCGATAAA
TAAAACAAGT
GAcAcTTT EF098-2 (SEQ ID NO:374) M4KKTKVMTLM ATTTLGALAL VPMSALAVDG GEYQTNGAIQ FAPNTNPTNP VDPTNPDPDK PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAE'TQKY KDNAGADQEG 193 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PNFVQVSDNR GTETGWTLKV KQNGQFKTE-A NQELTAkAKVT LSNGRVVSAS QSAKPTTAPA TIELNPTGAE SVVMAAGDKE GAGTYLMSWG DSVDTAKTSI SLEVPGSTTK YAKKYTTTFT WTLTDTPANT CN EF098-3 (SEQ ID NO;375) AGTCGAC GOTGGTGAAT ACCAAACAAA CCGAGCGATT
CA-A'TGCAC
AA.ACCAATTA
CCGTTATCCA
AATATGACCT
GGCCCA.AACT
GTAAAACAAA
ACTTTAAGCA
GCTACGATTIG
GAAGCTGCCC
ATTTCATTAG
ACI'GGACTT
CAAATACGAA
CACCACTTGA
7TGACTATC
ACTATGCAGA
ITTGT1CAAGT
ATGCTCAATT
ACGGACGCGT
AATTAAAC CC
CTACGTACTT
AAGTACCTGG
TGACAGATAC
CCCAACGAAT
TCCAACTGAT
ATCTAGCTTA
AACACAAAAA
CTCAGATAAT
CAAAACTCA.A
GGTTTCAGCT
AACTGGGC-CT
AATGAGCTGG
TTCAACCACA
ACCTGCTAAC
CCAGTTGATC
CCGACAGGGC
TCI=TGGG
TACAAAGATA
CGTGGGACTG
GCCAACCAAG
TCACAATCTG
GAATCAGTGG
GGCGATAGTG
AAATATGCGA
ACAGGAAACT
CGACGAATCC
CTAACCCAGG
AACAAACGAT
ACGCTGGTGC
AGACA*GTTG
AACTAACAGC
CAAAGCCAAC
TCATGGCTGC
TAGATACCCC
AAA-AATACAC
AGACCCAGAT
GACAGCAGGT
TACCTCAAAA
CGACCAAGAA
GACGCTAAAA
GGCCAA.AGTA
GACAGCGCCA
TGGCGATAAA
TAAA.ACAAGT
GACAACTF1 EF098-4 (SEQ ID NO:376) \TDC G2YQTNGAIQ FAPNTNPTNP VDPTNPDPDK PITPVDPTDP TGPKPGTAGP LSIDYASSLS FGEQTITSKN MTYYAETQKY KDNAGADQEG PNFVQVSDNR GTETGWTLKV KQNGQFKTEA NQELTAAKVT LSNGRVrVSAS QSAKPTTAPA TIELNPTGAE SVVMAAGDKE GAGTYLNSWG DSVDTAKTSI SLEVPGSTTK YAXKYTTTFT WTLTDTPANT GN EF099-1 (SEQ ID NO:377) C
C.
C
TGATGTTGTA
ATGAAGAAAT
TTTTAG
ACACCACAGG
TTGCAGTT'G
AGAATTCAAA
TATGGAAITG
AA.AAGTCAAA
GATGCTTCTG
GCGGTAGATA
GTAAA.AAGAA
GGAGGTTCAG
ACTATTAATG
CTACAAATGA
AGTTCAI'TG
GTCATTAAIT
GCAAGTGTTG
ATTGTTGCTA
Ar-GAAATAG
GAGGGCTGAT
TAGGCAAGGT
GTGTAITTC
AA.AAAGTAGC
CTTCCGCTTG
GTGAT=ATC
GGTTAGGACA
AAAAGGAATG
ATAGTGACTT
TITGAAGCT
AGGCTAGTGC
CCAATGTTGG
GTGGTCAATG
TGGGTACGGG
GTTGGACAGT
TGTCAAGG
AAGGTAAAAA
AGTATTTITCG
GAAATGTTTA
TTAALTGTT
TTCTAGTGAA
ATI'AGAAGTT
GATTGGCAAT
GTTTAATTCA
ATGGGATTCA
GAAATCAGTA
ACTTAAAAGA
GTGGGAACGA
TAATAATI!GG
TGGGAAAA
TTATGCTA
GCATAM='1
CATAAAGAAT
TGGTGTGGCT
CAAGT'ITACT
GACTI'GGGGA
TCAGTCTTCT
AGTTGT=1A
ACCGGACATT
TCTAACTACG
ATGGAACATG
GCGATAGCTT
GGACGA-AGAG
GCTTTACAAA
ATGTCTAAAT
GCTGGAACA.A
TATAA.ACGAC
ATTGATCCCT
TCTGC PI=
GCGAGTCAAA
CCAAATTATT
ACTAGTATT
ACTTATGAGC
TTAGATTC
TTTATTGAA
TTTTTATTCT
CTTCCCAGTT
TGACGT-CACA
AAAGTGGATT
TTAkATCCTTC
TTAATTTATT
TGGATCTTC
CAA.AAGATCT
AAGATGATCC
TTCTACAGG
TGGAAAAAGT
TTTGAAAA
TTGGTAATGA
CAGATATTAA
ATGGGCATAC
AAA.ACGCTGA
CACATGTGAC
AGGAGAGATC
TCCTTTA
TCAGCCCGCT
TGGCGGAACG
AAATCCTGCT
GTTAGGCGGT
AAATTTTGCA
GTGGAATP.AG
GAATACACI'
CGCAGAACAA
TTCCATGGGC
GATGGGGCAA
ACAAGGACCGT
'ITATCCTTGG
AGCAGGAGAT
TGGTGTAGTG
ACAAGGTCAA
CAGCATAGTA
C
C C C C CC EF099-2 (SEQ ID NO:378) MKCLS VFFL.LKGEIM KKLGKVLIVS CFIFILPFLL FLGVFSSSES GDSSQFQPAT PQEKVALEVS NYVTSHGGTL QFASAWIGNN EHESGLNPAR IQSDLSFNSA IAFNPSLGGY GIGLGQWDSG RRVNLLNFAX SQKKEWKSVA LQMDFAWNKD GSDSDLLKRH SKSKDVNTLA 194 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
VDILKLWERA GTKDDPAEQV KRKASAi\TNWY KRLSTGSMGG INGGQCYGLS AFFVEXQGCL QMMGTGHMFA SEIGNDYPWS I Z'FGQGG VAT S IYGHTGVVA SVEGKNKFTT YEQNAEQGQ I
K
E.7099-3 (SEQ ID NO:379) TAGTGA.A. AGCGAGATT CTTCCCAGTT TCAGCCCGCT
OSANVGGGKI
SIrGWTVIKNP
VA-KYFRTWGL
CALEKVMGQT
NYSDIKAGDV
CFPHVTS IVR
ACACCACAGG
TTGCAGT='
AGAATTCAAA
TATGGAATTG
AAAAGTCAAA
CATCGTTCTG
GCGGTAGATA
CTAAAAAGAA
GGAGGTTCAG
ACTATTA-ATC
CTACAAATCA
AGTTCAATTG
GTCATTAATT
GCAAGTGTTG
ATTGTTGCTA
AGGAAAT
AAAAAGTAGC
CTTCCGCTTG
GTCATTTATC
GGTTAGGACA
AAAAGGAATG
ATAGTGACTT
=ITGAAGCT
AGGCTAGTGC
CCAATGTTGG
GTGGTCAATG
TGGGTACGGG
G TGrGACAGT
TTGGTCAAGG
AAGGTAAAAA
ACTATTTTCG
ATTAGAAGTT
GATTOGCAAT
GTTTIAATTCA
ATGGGA'FrCA
GAAATCAGTA
ACTTAAAAGA
GTGGGAACGA
TAATA-ATTGG
TGGACGAAAA
TT'ATGGCTI'A
GCATATGTTI
CATAAAGAAT
TGGTGTGGCT
CAAGTTTACT
GACTTGGGGA
TCTAACTACG
ATGGAACATG
GCGATAGCTT
GGCGAAGAG
GCTTTACAAA
ATCTCTAAAT
GCTGGAACAA
TATAAACGAC
ATTGATGCCT
TCTrGC=T
GCGAGTGAAA
CCAAATrATT
ACTAGTATTT
ACTTATGAGC
ETAGA'IWC
TGACGTCACA
AAAGTGGATT
TTAATCCTTC
TTAATTTATT
TGGATTTTGC
CAAAAGATGT
AAGATCATCC
TTTCTACAGG
TGGAAAAAGT
TITCTTrAAAA
TTGTAATGA
CAGATATTAA
TGGCCGAACC
AAATCCTGCT
GTTIAGGCGGT
AAATIPrGCA
GTGGAATAAG
GAATACACI'
CCCAGAACAA
TTICCATGGGC
GATGGGGCAA
ACAAGGAGGT
TTATCCTTGG
ACCAGGAGAT
ATGCGCATAC TGGTGTACTG AAAACGCTGA ACAAGGTCAA CACATGTGAC CAGCATAGTA EF099-4 (SEQ ID NO:380) 9 0.
0* 0 9 9.
S
9 SES GDSSQFQPAT PQEKVALEVS NYVTSHGGTL GIGLGQWDSG RRVNLLNFAK VDILKLWERA GTKDDPAEQV INGGQCYGLS AFFVE-KQCGL INFGQCGVAT S IYGHTGVVA
K
QFASAWIGNM
SQKKEWKSVA
KRKASANNWY
QMMGTGHMFA
SVEGKNKFTT
EHESGLNPAR
LQMDFAWNKD
KRLSTGSMGG
SEIGNDYPWS
YEQNAEQGQ I
IQSDLSFNSA
GSD)S:LLKRM
GSA.NLVGGGK I S IGWIVIKNP
VAXY.TRTWGL
IAFNPSLGGY
SKSKDVNTLA
DALEKVMGQT
NYSD IKAGDV
DFPHVTSIVR
EF100-1. (SEQ ID NO:381) 0
TANTTATGGC
ACATI'ATTAG
CCTAACTTAG
ATTGTAGAAT
GAATTACTCA
CAATGA
AATATGGAAG
AAATGTTGAT
CGAAACATAA
CACAAATCGA
ACGAAGGCTA
GAGTTTTATA
TGTCTTATTG
AGAAACAGTT
GCTCTACACA
CATTACTAAA
ATGAAP.AAGA
ATATr~rCG
GATAAAAAAG
CTAGAAAAAA
GAGCAGTI'AG
AACAAAAATA
TAITATTI'
GCAAT--AAGC
ATAAGACGCC
ATAAATATAC
CGCAGGGCTI
ACTTT'ITGTC
AATCGTAAA.A
TTCCTTA-AAT
AGCAGAAAAG
EFI00-2 (SEQ ID NO:382) MKKKQKYAGF TLLENL IVLL II SVL ILLFV PNLAKHXEfrV DKXGiqEAIV'K IVESQIELYT LEKNXTPSLN ELVNEGYITI( EQLDKYTAEK Q EFIQO-3 (SEQ ID NO:383) TAA AGAAACAC'IT GATAAAAAAG GCAATGAAGC AATCCGTAAAA ATTGTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAAA ATAAGACCCC TTCCTTAAAT GAATTAGTCA ACGAAGGCTA CATTACTAAA GAGCAGTTAG ATAAATATAC AGCAGAAAAG 195 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes,
CAAT
EF100-4 (SEQ ID NO:384) KETV DKKGNEAIVK IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q EF100-l (SEQ ID NO:385)
TANTT~ATGGC
ACATI'ATTAG
CCTAACTTAG
ATTGTACAAT
GAATTAGTCA
CAATGA
AATATGGAAG
AAATGTTGAT
CGAAACATAA
CACAAATCGA
ACGAAGGCTA
GAGTTTTATA
TGTCTTATTG
AGAAACAGTT
GCTCTACACA
CATTACTAAA
ATGAAAAAGA
ATI'ATTCCG
GATAAAAAAG
CTAGAAAAAA
GACCAGTTAG
AACAAAAATA
TATTGATTT
GCAATGA.AGC
ATAAGACGCC
ATAAAT
1
ATAC
CGCAGGGIT
ACTTTTTGTC
AATCGTAAPA
TTCCTTAAAT
AGCAGAAAAG
EF100-2 (SEQ ID NO:386) MKKKQKYACF TLLEMLIVLL IISVLILLFV PNLAKHR!ETV DKKGNEAIVK IVESQIELYT LEKNKTPSLN ELVNECYITK EQLDKYTAEK Q EF100-3 (SEQ ID NO:387) TAA AGAAACAGTT GATAAAAAAG GCAATGAAGC ALATCGTAAAA ATTGrTAGAAT CACAAATCGA GCTCTACACA CTAGAAAAA.A ATA.AGACGCC TTCCrrAAAT GAATTAGTCA ACGAAGGCTA CATTACTAA-A GAGCAGTTAG ATAAATATAC AGCAGAAAAG
CAAT
EF100-4 (SEQ ID NO:388) 4.
S.
a KETV DKKGNEAIVK IVESQIELYT LEKNKTPSLN ELVNEGYITK EQLDKYTAEK Q EF101-1 (SEQ ID NO:389)
S
5* TGAGGAGATG AAACGAAGAA GTAATTGCGG TTGGGGGCAT GCTGTCAAGC AAGCGCCTAA CAACAAATTT ATArI'AACGG CAAAAAATAA CAAAGGATCC ACACAATTAT ITACTI'ATGA AGCTTACCCA AATTAGAAAC GATAAATTTA ATAAAACTAA CAATATCAAA CAGAAGTCGA GCGGATI'TAG GAGCGAAGCA ATTCCAGAAG TAAAAGATGC TTAGCTGGAA AAGTGAATGA CTAACTTCTG TTTCCAACAA CCTCCTGAAG GCAACAGCGA AGTTATAGCG TCAAAATTGC CAAGCAACCA TrrGAT1'TAGG AAAGAGGGTG AACAGG-CCTA GTCCAAGTCG GGCAAGAAAA GACCCAGTGG TTAmTCTTC
CGTAACTGTG
AGATGACTGG
TGTCATCCAA
AGAGATTAAG
AGATGAGGCG
GAAGCGGGCG
AGAAGAAGAC
TGCAGTAGAT
ATATATTTCC
CAATTCACCG
AAAGGACTTG
TGTGGITGTG
TGCCGCGAGT
GTTGGCCAAT
CGATTTAGGG
CGTTTTAGG
TGGCGACAAA
AAAAAAACCA
AATGCGTTAA
GGAATTrGACT
CCCGAACAAA
GTGAA-AAACG
GTCACAAAAG
AATATCTATA
CGCACGATGT
GAAGAGXITA
ACAAAGG3CTA
AT=TACGGT
ACTAAAATTA
GATGGCTCAA
GGCAATCCAG
TTAGACAAAG
GCGATTGAGT
AATGA=~G
ATGGCGATTG
GTAAAAGTCG
AAlrGAAGAAG AAAACGATAA TTATATTGGG GGCAGTTGCG ATAAAAATGC ACAACAAGTA ATI'TTGACGT TCCCGACITG TGGAAGCCTT TGCGCGTGAT GCGATGTCGT AGATOGCAGOC AAATTGAGGC ACAACAAAAT ATAAGTGGAA TCGGGCCATT CTGGTGATGA TTTAAATGA-A CCTTCACCAA TGAAACCTTA ATTTCAAAGG TCGTGTATCA TAACTTCAGA AGATCFI'TAT GTGT IGGGCA AAAAGCTAAA TTTCTTACAT CGATGATAAT AGGGCGGCAC AACGATGTCT TCAAAAATGG CTACCATATG TACCGAAAAA AGCGATTCAA GAACCATCAT TCGTCGTGAT AATCMGCTT AGAATCAGCC GTGATATIGT TGAATCAGAT 196 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GCAGCGATTG CTCTATGA ATCAGCAACC AACGAATCA-A TGACAGATGC GTCGAAATAG EF101-2 (SEQ ID NO:390) MKKK TIIILGAVAV IAVGGIVTVN ALNKNAQQVA VKQAPKDDWG IDYFDVPDLQ
QIYINGVIQP
LA.KLETKRAN
DLGAKQYIST
TSVSNNVWVD
ATIDLGDLGA
R'JVISSKKPV
EQMEAFARDQ
IYNKWNRAID
KANFXGRVS I
GSISYIDDNP
IELPKKAIQK
KVGDIVESDA
KITKDPE 1KV
KFNKTKEEDR
P EVKDANS PI
PEGNSOAASG
EGEQAYVLVN
AIASDESATN
KNGDVVDAGT
TMSGDDLNEQ
LRLTSEDLYL
NPEGGTTMSS
OFGTI IRRDV
ESMTDASK
ELFIYEDEAV
YQTEVDAVDE
AGKVNE-KOLT
YSVKIALANL
QVGQENGDKI
TKEI EAQQNS EITFTNrTLA
KISVGQKAKL
DKVKNGYHMQ
AIESGLESAD
EFIO1-3 (SEQ ID NO:391) TAAAAATGC ACAACAAGTA
GCTGTCAAGC
CAACAAATTT
CAAAAAATAA
ACAGAATTAT
ACrAGCCA
GATAAATTTA
CAATATCAAA
G'CGGATTTAG
ATTCCAGAAG
TTAGCTGGAA
CTAACTTCTG
CCTCCTGAAG
AGTTATAGCG
CAAGCAACCA
AAAGAGGGTG
GTCCAAGTCG
GACCGAGTGG
GCAGCGATTG
AAGCGCCTAA
ATATAACGG
CAA.AGGATCC
TTACTTATCA
AATTAGAAkAC
ATAA-AACTAA
CAGAAGTCGA
GAGCGAAGCA
TAAAAGATGC
AAGTGAATGA
TTTrCCAACAA
GCAACAGCGA
TCAAAATTGC
T'TGATTTAGG
AACAGGCCTA
GGCAAGAAAA
TTATTTCTTC
CTTCTCATGA
AGATGACTGG
TGTCATCCAA
AGAGATTAAG
AGATGAGGCG
GAAGCGGGCG
AGAAGAAGAC
TGCAGTAGAT
ATATATTTCC
CAAITCACCG
AAAGGACTTG
TGTGGTTGTG
TGCCGCGAGT
GTTGGCCAAT
CCATTTAGGG
CGTTTTAGTG
TGGCGACAAA
AAAAAAACCA
ATCAGCAACC
GGAATTGACT
CCGGAACAAA
GTGAAAAACG
GTCACAAAAG
AATATCTATA
CGCACGATGT
GAAGAGATTA
AC AAAGGC TA
ATTTTACGGT
ACTAAAATTA
CAT'GGCTCAA
GGCAATCCAG
TTAGACAAAG
GCGATTGACT
AATGATTTTG
ATGGCGATTG
GTAAAAGTCG
AACGAATCAA
ATTTTOACGT
TGGAAGCCTT
GCGATGTCGT
AAA'ITGAGGC
ATA-AGTCGAA
CTGGTGATGA
CCTTCACCAA
ATTTCAAAGG
TAACTTCAGA
GTGTTGGGCA
TTCTTACAT
AGGGCGGCAC
TCAAAAATGG
TACCGAAAA.A
GAACCATCAT
AATCTGGCIT
GTGAkTATTGT
TGACAGATGC
TCCCGACTTG
TGCCTGAT
AGATGCAGGC
ACAACAAAAT
TCGGGCCATT
TTTAAATGAA
TGAAACCTTA
TCGTGTATCA
AGATCTTTAT
AAAAGCTAAA
CGATGATAAT
AACGATGTCT
CTACCATATG
AGC-GATTCAA
TCOTCGTGAT
AGAATCAGCC
TGAATCAGAT
GTCGAA.AT
EF1OI-4 (SEQ ID NO:392) KNAQQVA VKQAPKD)DWG IDYFDVPDLQ QIYINGVIQP EQMEAFARDQ KITKDPEIKV LAKLETKRAN~ IYNKWNRMID KFNKTKEEDR DLGAKQYIST KAN~F'KGRVSI PEVXDANSPI TSVSNNVVVD GSISYIDDNP PEGNSDAASG ATIDLGDLOA IELPKKAIQK EGEQAYVLVN RVVISSKKPV KVGDIVE-SDA AIASDESATN EF102-1 (SEQ ID NO:393) TAAACAT=r
ATTAATTATA
TT1AACTCAAA
CACCCTCAAG
TTTAGCGATG
GCAAAACAAA
GGITATCAGA
TGGGAAAAGG
CAACTGAACG
AGACATTCAG
TGAAAAAAAC
CAA ETGGCGG
AGGTAACAAT
GCAGCACGCT
CAGTATTTTG
AAAACCCTTT
CTGGAACAGA
GTTATAAACT
AGGTGAATGT
AACATTI'AAA
AACGATTGGT
TCATTATGAC
CTCAGAACGT
TATTGAACCA
GCCATCAATG
TATTGATACA
CCATTCCATA
KNGDVVDAGT
TMSGDDLNEQ
LRLTSEDLYL
NPEGGTTMSS
DFGTIIRRDV
ESMTDASK
CTC'I=rTTA
AATTGTCGT
CCTACGATTG
GTAAGTAAAC
ACTACGTCAT
GGTGTTAGTA
TCTGATAAAG
AATATGG=
AAAAGATTAG
ELFTYEDE-AV
YQTEVDAVDE
AGKVNEKDLT
YSVKIALANL
QVGQENGDKM
TKEIEAQQNS
EITFTNETLA
KISVGQKAKL
DKVKNGYHMQ
AIESGLESAD
TACTCAAAA
TAITTOrCGAC
C'ITIGCCGA
TGTATGAAGT
TATATGCAGA
'FTCCAACAGA
CGAAACTAGT
CACAAAACAT
GTGGTGC'
ACGAAAGGGG
TITGGCTCTA
TGAAATTACT
TGACGGAACT
ATACAATGGT
AGTGACGCAC
ATCGGTTCTT
GATTTGGGAA
AGTTGATATA
197 TAB3LE 1. Nucleotide and Amino Acid Seqeuences; of E. faecalis, Genes.
AAATCTATTG
CATAATACCA
TTAAA='AT
ATTGGGAATC
AAAAAATCAG
ATGGCTGGTG
GGTTCTTTAA
TTCCATTTAG
ATTTCTAIT
CCAGATCCTT
ATTTCCATGA
GTAGAAACAG
ATTCGTAAAG
CGTGCGGAAA
GAAACTAAAT
TATGCCAATC
ACTGGGGAAA
GCTGAGTTTA
TGGAGTGAAG
ACTTTGCCTT
TTCTGGCAAG
TCCATCAAAA
AA.AGAACA-AG
ACTGCCGAAA
GAAATCACAG
GGCTATGGTA
CCAGAAGGAT
GACGACTATG
ATTAAGATG
AACCGTTTGA
TGGAAAGACG
AGATATAACT
GAAGCCACAA
ACAACCGCAA
ACCGCTGAAC
GTAGCCT'IG
CAAACCGTCA
GATGGTTCGC
ACCCATCATG
CCAGATGGTA
AAAGAAIM1A
ACTAAGTTA
AATGAAGATT
AGTACGCCAA
ACAGTGAACA
TrTATCTrGCA
AAGGAAAAAT
CTGTAAAAAC
CTGAG'ITGA
AATTAGTGCT
CTGGTACTGG
CGCTTGATAA
AGATCAAAAA
ATTT-GGGAA
TGGATGGAAT
ATATCATTGA
CTTCGAAAAA
GTACTGATCT
ACAGCCCAGC
CACCAAAAGA
CTAGTAATGG
AAACCGTGGC
CCAC'ITGAC
AAGGAGCTGA
C=~TAAAAC
TAGATGAAAA
AAACCAAAGC
AAGTTGATALA
TTATTCCCTT
CTGGATTIALA
GTGCTGAAGA
AG'ITTGAAAA
TTCAAAAGAT
CGAAGAGTGA
TCACCGTTCC
TGCTTTATGA
GAAATAAAAA
TGCATGAAAT
AAGCTC-CCCA
CGTTGGCGAA
AAGTTTTGGA
AAGCAGGCAA
ATTGTACGAT
AAACTTTTAC
TACTGGATCC
CGAACAAAGA
CCAAAACCGT
CTTTACGGA
TGAAAGAAAA
AACAACCGGA
CATTCCCGCA
T'TCGAC
TAATAAGGCA
AATTTTIAGGT
TAAAGTCGTC
TACTCCAAAC
AACTCCAGTC
GCCCAATACC
AATCGATAAA
ACCTTTACCT
TCCCCATOGT
TACCACACCC
TATGCGACAA
TTGGA.ATGAC
TGGTGAA.ATT
GCTTGCTAAT
'TTCGTGAAT
TCTI'GTTACC
AAAAGAAGAC
ATATACTCTC
AGAATTAGTG
CAACCAACTT
ACCTGAAGGA
TAACGAAAAA
TGCTIGAT
CGACTTATCT
TA.AAGCGACC
TC~rCCITAT
TACACCACTA
GTATGTCTTT
LTACGAGAAA
TTTGCCCGAG
ATTGAATACC
CAAAGAAGAC
AGAAAAAGAC
CAAAGAGAAA
TAAAAGCATC
TGAGCCAGTA
TGAACGCCAT
TCATGGTGAC
CTCAAAAGAA
AATTTGGAAA
ACTTGCGGAA
AATCAATTAC
ATCTCAAACC
AACACCAGCT
ATTGAGGAGT
CAATCGACAA
CAAAATACGG
TCTA.ATrCCA.
GCTTATAAAA.
TACGCTATTA
GAATCAGGTC
TCAAAAGATC
ACAAAGGTAA
ATCCCTGCCA
AAAGGTCAA.A
AATT'ATrCTC
GTCCAAGAAA
GCTTTGAAC
ACCTTCAAAC
AGTA.ACGTAA
AAAGATACCG
TrACTGCAA
AAGGGAACGA
GCCGTTAAAC
TATACT7MG
AATGCCGTAA
TTCTTTAAAT
TTAAAGTGT
ACAGCTTGTA
GGGGATTATT
GAAATCCGTT
ACCATTACC.G
CTAACTAACA
AAAGAAGATA
CTTGA=TTA
TGGTATGTCG
GAAAAAGCCA
ACTGGAACTT
GTCI'CTTCA
GCGAAGGATC
GTTTCCATCC
GTGATGGATA
GCTTTCGAAA
TCTGrGCAAAA
AAAGTAGATA
GAAAAAGATG
TTAACACCAA
GTTCCAAGTA
PAICAAAAAA ACCA.AGTT'T CTTTAATAGA TAAAAATGAA cGAATATAGA rTACCGTGTA AATCAGGA-AC AFI'AACATTG A.AGCAGGACT TCAAACTGTG AAA~rAATGT GGAAACTAAG ATATTCTACC AGAAACGGTT TC3ACXACAGA TAMA.GATGGG CTATTACTGA AAAATCGGTG CCATTAAAGC GGGCGAGACC TTCTTTTAGA GAAGACTGGG TAGCTGGAAA TACATTIGCC TA.ACA.ACGGA TGAAAAAGGT TGGGAA~CCTA TTACGTGACA CAACAAAAGT CGAGTTAAAA AAGGGCAAAA CCAAGAAATT GTAATGAGAG TCAAGGGAAA AAGATCGTCA AGCTGTTAAA AkAGCTTCTG.A TGAAACAGTG ACCTAGCAAT TAACGAGTAT ATGAALACGAA GTATCCTGTA TTACTCGAGA TGTTACGGCA TTCCTGGATC GGCTGATGGC CGCCATTGGA AGGC-ACCAAN ACGAGCAALT AGGTTTTGAT TACTTGAAGA AATAGAGGCT CTACATT'AA GGAAAACAAA AAGAAGGACA AAAACAACCA ACGAGTI=C TGTTAGTCTG CTTTAC1'IC TCTTGCGACT CCGAGCTAGT TGATAAAT'G TAGCTCAAGC CATTGATGTG A.ACCGGTGGT GA TCGCCGAA GGAAAATrCT GCATAAATTA ATTATGTGTA TGAAAACAAG CTAGCTTGAA cAATCAAGCA AAACAAAAGC CCACCTAGAA TGT'IGATGA TGTGTCGGT' CAA7MCTGTA TGCTTI'ACTA TTCACCATCGA AGTGAATGAT CCGGAAAGTA TCCAGAAGGA GAAACGTGAA TGGAAAACAC A.AGAAGTGCC AACCATACCG AT1rCTCAAGA ATCTAGTCCC ACGTTCTACT GTTAGITGGC ATCGCCGCAA TTAA
S
S S 555 5
S
S
S 5555 S C AACTGGGGAG .AAAAATTCCA TGCTGGGTAT TATTTCTGGA EF102-2 (SEQ 10 NO:394) MXKTTFKN WSLFATLA.L SQTIGGTIGP TIAE'ADEITH PQEVTIHYDV SKLYEVDGTF SDGSTLSERT TSLYAEYNGA YQKNPLPSMS DKAKLVSVLW EXAGTDIDTN MVAQKIIWEE SIEGKINXAI EEYQKKPSFH NTTVKTILGQ STTLIDKNEL GNQLVLTPNS NSKSGTLTLK KSAGTGTPVA YKKAGLQTVM KQTVFC IEPG VNGYKLHS 1K
NLSEFDKVVQ
AGALDKPNTY
VSIPTEVTHG
RLGGASVDIYK
NTANIDYRVI
AIKINVETKG
198 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
SLKIYKKIDKE
DPYMIDTTPM
RKDS PAGE IV
ANQTVALVTS
SEAFKTELVK
IKKVDNNEKN
ITGAEDKATI
DYARSEYVFT
KDGNKKLNTL
TATLANKEKT
TVNCTIERHV
OGTNKEIWKS
EDLKEKSQTL
ILIFSTAGYY
SGDIVPETVF
AATIKAGETI
QEITTDEKGR
NVKGQNQEIT
GTKASDETVT
AV ITRDVTAK
ACNEQLGFDG
ITEEGQKQ 21
DFTELVOKLR
GTWKILHKLT
SIQTKAHLED
GK IEHEVNDK TPKEVPTI PS
FWNRRN
HLDFGKA.LPS
SMTSKNMRQK
AETPKELANA
GET'LTKEDK
L.AIDEKNQVA
EQVIRFGFDF
YGKFENLPYG
KMVTVPYEKL
YNLH-EIKEDW
AEQVLDKS IV
GSQTFTHGDV
EFTK'IVLAEK
TPKQPETPAV
KDVTTDKDG I GQ ILLEKTGV
LE-LGTYVVTE
DTGNESQGKA
VKHLAINEYF
FKFAGSADGT
DYLLEEIEAP
TN'NEFSVSLN
YVVAQAIDVE
LFNYVYENKV
MOMFDDVSVT
VDTGKYPEGT
PSNSQESSPT
SILDGIPHGT
ETGTDLWNDN
TKSSNGFVNT
EVKGAEYTLF
WQETKAPEGY
AETGFNDLSF
EGFQKITPLE
RLMLYOLPEK
ATKA.AQEKDE
AFEAGNEPVA
HDVLDGSKEA
KFTFTE INYE
VKTFPQTGEK
KVTITEKSVP
YSLAGNTFAI
FKPTKVELKY
TAKDGQAVKW
TLDETKYPVS
KVS PLEGTXE
IRSTFXENXD
EDSLTSLATW
KAKPVVIAE.T
KDASLNNQAQ
FETILYALLP
KDGNVNGKHN
NSNVLLLVGF
EF102-3 (SEQ ID NO:395) TT TAGATGAAAA GAACCAAGTT GCCG ITAAAC ACCTAGCAAT TAACGAGTAT
S
S.
S
S.
S S SSO a.
S S
S
S
TTCTGGCAAG
TCCATCAAAA
AAAGAACAAG
ACTGCCGAAA
GAAATCACAG
GGCTATGGTrA
CCAGA-AGGAT
GACGACTATC
ATTAAGATGG
AACCGT ITGA
TGGA.AAGACC
AGATATAACT
GA.AGCCACAA
ACAACCGCAA
ACCGCTGAAC
GTAGCCTTTG
CAAACCGTCA
GATGGTTCGC
ACCCATGATG
CCAGATGGTA
AAAGAATTTA
ACTAAGTTTA
AATGAAGATT
AGTACGCCAA
AAACCAAAGC
AAGTTGATAA
TTATTCGCTT
CTGGAT1'TAA
GTGCTCAAGA
AGTTTGAAA-A
TTCAAAAGAT
CGAAGAGTGA
TGACCGTTCC
TGCTTTATGA
GAAATAAAAA
TGCATGAAAT
AAGCTGCCCA
CGTTGGCGAA
AAGTTTTGGA
AAGCAGGCAA
ATTGTACGAT
AAACTTTTAC
TACTGGATGG
CGAACAAAGA
CCAAAACCGT
C TITTACGGA
TGAAAGAAAA
AACAACCGGA
ACCTGAAGGA
TAACGAAAAA
TGGCTTTGAT
CGACTTATCT
TAAAGCGACC
TCTTCCTTIAT
TACACCACTA
GTATGTCTTT
TTACGAGAAA
TTTGCCCGAG
ATTGAATACC
CAAAGAAGAC
AGAAAAAGAC
CAAAGAGAAA
TAAAAGCATC
TGAGCCAGTA
TGAACGCCAT
TCATGGTGAC
CTCAAAAGAA
AAITTGGAAA
ACITTCGGAA
AATCAATTAC
ATCTCAAACC
AACACCAGCT
TATACTITTGG
AATGCCGTAA
'TCI'TAAAT
ITTAAAGTGT
ACAGCGTA
GGGGAT'rATT
GAAATCCGTI'
ACCATTACCO
CTAACTAACA
AAAGAAGATA
CTTGAmMTA
TC;GTATGTCG
GAAAAAGCCA
ACTGGAACT
GTCTTCGITCA
GCGAAGGATG
GTTTCCATCC
GTGATGGATA
GCTTTCGAAA
TCTGGCAAAA
AAAGTAGATA
GAAAAAGATG
TTA-ACACCAA
GTTCCAACTA
ATGAAACGAA
TTACTCGAGA
TTCTGATC
CGCCAITG'-A
ACGAGCAATT
TACTTGA-AGA
CTACATTTAA
AAGAAGGACA
ACGAGTTTTC
GTTTGACTTC
CCCAGCTAGT
TAGCTCAAGC
AACCGGTGGT
GGAAAATTCT
ATTATGTGTA
CTAGC'ITGAA
AAACAA.AAGC
TGTTTGATGA
CAATTCTGTA
TTGAGCATGA
CCGCAAAGTA
GAAACGTIGAA
AAGAAGTGCC
ATTCTCAAGA
GTATCCTGTA
TGTTACGGCA
GGCTGATGGC
AGGGACCAAN
AGGTTTTGAT
AATAGAGGCT
GGAAAACAAA
AAAACAACCA
TGT'TAGTCTG
TCTTGCGACT
TCATAAATTIG
CATTGATGTG
GATTGCCGAA
GCATAAATTA
TGAAAACAAG
CAATCAAGCA
CCACCTAGAA
TGTGTCGGTT
TGCTT'rACTA
AGTGAATGAT
TCCAGAAGGA
TGGAAAACAC
AACCATACCG
ATCTAGTCCC
ACAGTGAAGA
EF102-4 (SEQ ID NO:396) LDEKNQVA VKHLAINEYF WQETKAPEGY TLDETKYPVS
IKKVDNNEKN
ITGAEDKATT
DYAKSEYVFT
KDCNKKLNTL
TATLANKEKT
TVNCTIERHV
DGTNKEIWI(s
AVITRDVTAK
ACNEQLGFDG
ITEEGQKQPI
DFTELVDKLR
GTWK ILHKLT S IQTKAHLED
GKIEHEVNDK
EQVI RFGFDF
YGKFENLPYG
KMVTVPYEKL
YNLHE IKEDW
AEQVLOKSIV
GSQTFTHGDV
EFTKTVLAEK
FKFAGSADGT
DYLLEEIEAP
TNNEFSVSLN
YVVAQAIDVE
LFNYVYENXKV
MDI4FDDVSVT
VDTGKYPEGT
A.ETGFNDLSF
EGFQKITPLE
RLMLYDLPEK
ATKAAQEKDE
AFE-AGNEPVA
HDVLDGSKEA
KFTFTEINYE
KVSPLEGTXE
IRSTFKENI<D
EDSLTSLATW
KAKPVVIAET
KDASLNNQAQ
FETILYALLP
KDGNVNGKHN
199 TABLE 1. Nuclectide and Amino Acid Seqeucnces of E. [aecalis Genes.
EDLKEKSQTL TPKEVPTIPS TPKQPETPAV PSNSQESSPT VK, EF103-1 (SEQ ID NO:397)
TAAGATAGGT
ATTACTTTAG
GATCATGCAG
CAAACAAGAG
GTTTATGATG
AAATATGATG
GGTGACGA-AC
ACACAAAATT
AAGCGTTTAG
TATCATGGGA
AAAA'ITGTGA
AAAGTCGTAG
GCGA.AA'11T0
CGCGGCGACT
ATTGGTACGA
ACGTATACTC
GAACCTTACC
?I'ATCAAAGA
TAACATTAAG
CCAATCCAAA
GCGACAAGGC
CTGCTGGTAA
GTGAAACCGG
GAACATTrTTT
ATCA.AGCCGT
GGAAAGATAA
AAAAATI'AGC
CAAATAAATC
ATAAAAACGG
ATCCAAACAC
TTGGGTACTT
ATCGTTACGG
GAATGGGTAA
AAGGCACATA
AAAGGAGCGA
TACGTTACTT
TAGTGCTACA
GACTAAGATT
TGATTAACG
TFT=IACGAG
TGTGACAGGT
AGTGGATTTA
ACTGGGGAAT
'ITIACAAAT
AGG3ATAAA
TAATGATGTG
AAGTAAATAT
CCAAGTGGTG
CGCGGCGCTA
AGATAATGCT
TCATCCAGCC
TIGCTTTATG-A
ATAGCAGGTT
CCAAA21AG
-TATCTGGCA
GCAGAAAATG
T =TTCGACA
GCATGGCACA-A
ACCGAAGTGA
GATGTTGAAG
GGACGTGAAG
ATTTTAGGAA
ACAGCGGCCA
GAATTTCA
GACAATAACA
GAATTAACGG
GGTAATGATA
AAAAGAAA'-T
ATGCAAGTCC
GCAAACATCA
CGGACTGGCA
CTAATTTTAT
AAAATACTGG
AACGAATTTT
GTAAAGACNA
TTTACGTGGA
CATTAACCAA
CA.ACCTTGTr,
ATCAAAA'IT
AKrrA^AAAC
AGATTCGGGC
AACTAAACAA
TTCAAGTGTT
A'ITAAGTTCr-
AGCATTTGCA
AAACAATGGC
AGGAACCCGT
TGGTTTAGCA
GGAAACCCGT
AATTTCCCGG
ATTTACTTAC
ACACATCCCT
TCAAACTGGC
GAATGGCACA
CATTAGTTA
AGGTGAAACC
CCATGTATCT
TGATCGATT
CGTGGAACAT
=ITACITTTCT A-A EF103-2 (SEQ ID NO:398) 9 9 *99.
.99.
9 9999 9 9 9.
99 MXKKVLSSI TLVTLSTLLI AGYASPAFAD HAANPNSATA NLGKHQNNGQ TRGDKATKIL SGTDWQGTRV YDAAGNDLTA ENANFIGLLAK YD)GETGFYEF DEGTFFVTGD GTKRILISRT QNYQAVVDLT EVSKDXFTYK RLGKDKLGND HGKKLAFTNG REALTNQTGK IVTNKSGDKI LGTTLWNGTK VVDKNGNDVT KFDPNTSKYE FFNLQTGETR GDFGYFQVVD NNKIRAHVSI GTNRYGAALE YTRMGKDNAG NDIQVFkVEHE PYQGTYMPAF TF
FDKNTGETRG
VEVY"VEH IPY AANQNFI SLA
LTELNNDRFT
EF103-3 (SEQ ID NO:399) TCATGCAG CCAATCCAAA TAGTGCTACA GCAAXI'rAG GCAAACATCA AAACAATGGC
CAAACAAGAG
GTTTATGATG
AAATATGATG
GGTGACGAAG
ACACAAAATT
AAGCGTTTAG
TATCATGGGA
AAAATTGTGA
AALACTCGTAG
GCGAAATFrG
CGCGGCGACT
ATTGGTACGA
ACGTATACTC
GAACCTTACC
GCGACAAGGC
CTGCTGGTAA
GTGAAACCGG
GA.ACA=rTT
ATCAAGCCGT
GGAAAGATAA
AAAAATTAGC
CWATAAATC
ATAAAAACGG
ATCCAAACAC
TTGGG-'2ACTT
ATCGTTACGG
GAATGGGTAA
AAGGCACATA
GACTAAGATT
TGA'ITTAACG
TT=ACGAG
TGTGACAGGT
AGTGGATITA
ACTGGGGAAT
TTTTACAAAT
AGGGGATAAA
TAATGATGTG
AAGTAAATAt
CCAAGTGGTG
CGCGGCGCTA
AGATAATGCT
TCATCCAGCC
TTATCTGGCA
GCAGAAAkATG
TTITCGACA
GATGGCACAA
ACCGAAGTGA
GATGT7GAAC
GGACGTGAAG
ATTTTAGGAA
ACAGCGGCCA
CGGACTGGCA
CTAATIAT
AAAATACTGG
AACGA2ITT
GTAAAGACNA
TTTACGTGGA
CATTAACCAA
CAACCTTGTG
ATCAAAA=T
AGGAACCCGT
TGG'ITTAGCA
GGAAACCCGT
AATTTCGCGG
ATTTACTTAC
ACACATCCCT
TCAAACTGGC
GAATGGCACA
CATTAGTTTA
GAATTTTCA ATTACAAAC AGGTGAACC GACAATAACA AGATTCGGGC CCATGTATCT GAATTAACGG AACTAAACAA TCATCGATITT GGTAATGATA TTCAAGTGTT CGTGGAACAT
T
EF103-4 (SEQ ID NO:400)
I{AANPNSATA
TRGDKATKIL
DEGTFFVTGD
NLGKHQNNGQ
SGTDWQGTRV YDAAGNOLTA ENANFIGL~j( YDGETGFYEF FDKNTG;ETRG GTKRILI SRT QNYQAVVDLT EVSKDXFTYI', RLGKDKLGND VEVY-VEH IPY 200 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
HGKKLAFTNG REALTN'QTGK IVTNKSGDKI LGTTLWNGTK VVDKNGNDVT AANQNFISLA KFDPNTSKYE FFNLQTGIETR GDFGYFQVVD NNIRAHVSI GTNRYGAALE LTELNNDRFT YTRMGKDNAG NDIQVFVEHE PYQGTYHPA EF104-1 (SEQ 10 NC:401) TGAAAGGGGA TTAGTATGAA GCACAAAAT1' TCGGGTGC se 0.
GAGACCACG
TTAGCAACAA
ACAGACGCGA
GAGGACAAGG
GATGAACTAC
GAGGTTCACC
AACACAGCTr
TCAACTGTCA
AAAACATTAA
AAAAGTTATC
AAAAAAACCT
ACCAGAACGA
AATCAAACAT
A.ACTTGGAAC
ACGCAAACCT
ATTGCTGTAA
ACAATTTATT
ACAAAAATG
ACTCCTAAGA
TATATCAGTT
ACAAAAGTGG
AAATrTAccG
AAAACTGAAG
ATTTCTCAT
GTAATTGGTC
GTTGTTAATC
TCATCAAGAA
ATTAAAGTAA
TACAAGTTAG
ACTACGCCAA
TTGCCAAAAG
TI'AACTCCAG
CAAAGTAGTA
GTCCGTACAA
GATCAGGTAC
ATGACGCCAA
ACGTTTGATT
TGGATCGACG
AATGAAGGCT
CAAGCGGCGA
GATAATAAAA
AATGCTCAAA
TTAGAGAAGA
ATTACCATTG
ATCGACTCTG
ATCACATITGA
ATACAGCAALA
CGACTACTrC
GCACAACACC
ACACCTCACT
TTGACAGTAT
GCGAAFI'AAA
ATGCACCTGG
GCGGAGAAGT
GTATTACT
TAACAACAGA
ACTCGTTTGA
CGGGG ETAGA
TAGAATTATI'
CTCAAGTIT
TGTTAACACC
CTGTTCCAAA
TAGAGAGTGC
GGTCAATTTC
CGAGTCAAAC
'ITCAA.AGCAA
GTCAACAAAT
CGTATGGTCC
GTrCGCAACCT
TAACAATGGA
CTAATAAAGC
CTTTGAATGC
CAACTGTCTC
AGCATCCTAA
GAACGGATTA
TAACCAACGA
ATAAAAGTAT
TTGATACGAC
AAAATCAATT
AA.ATTCCAGC
ATTCAAITTA
ACAGCCCTGG
TGGAAAAAC
TCCCAACTCT
CTGCTTCGGT
ATCCAACAT l
CACATCGTGI
TCGATTTGAA
CTACAAACGC
AATACAATAC
AAACATTGAP
AATTCTCAGP
GAAAAAAACT
CCTAAATGCC
AAAAGAGGCA
AGAAATGAAT
TCCAGTGAT
TAATGA.AAAA
GAGTCTTGCG
TACAACACCG
ATCAGTTTAT
GACAGGCATT
TAATCAACGA
GGCGGAACCA
'TTTATATGAA
TGGCGAAT
AACAACAGAG
CAGTTACGAT
TGGCAAAGAT
CATGAATCAA
TTTTC?11'TG
TATCCTGTAA
GAGTTATCGA
CA.ACCAACTG
GCTGCTACAC
GCCfCTGCCAG
CCGATTGGTG
GTAACCGCTA
ACAGTTCAAT
GGCGCAACTT
TGATGFI'GAG
CAACGACAGA
ACTCAACACC
CAACAACTGA
CATCTGAACA
ATCTPCAAGC
GA.ACAGAATA
CGTTCCAATT
TACCAGAACA
GGGCAGTCGA
GTTTCAGATA C'FTCC'ITTAA AGTAGAACTA TTAATCAAAA TTGAAACTCC AGGAAAAAAT
CAAGTGGAAC
TTT-TATAATT
ACGCCACGC
GTCGACATTA
TATACATTAA
CAAAAAGCCT
TTCGGACTAT AACTACTTAT TTTGAAAAGT ACGACAGGAA
TATACTTCTC
AGCACAAACA
ATCTI'TACCT
ATCGCAAACC
ACAAACAACG
GCCAATTACA
CAGCCAAACA
TGCTGTTGGA
T11AGCI'AT
TGCGGCGACC
AAGTAAAGTA
AGGGAAATAT
CGTATTAGAG
TTTATATGAA
CACTTAACG
TTTTGACAAG
CATTCAATTA
TGAAACTGCT
AGTTATGGGA
TTATCTTTCA
TACAGTAACG
AATCCAAATT
CCCAGTCGAT
ACTAACTACT
CCTTGTCAAT
TGGCGCCGAT
TCCACAATAC
ATATCCAACG
CAACAAACGT
TTATATAACA
TTCTGTTCA-A
AAAAAATGTA
G.AAAAATCCA
TTCTATTACC
TGCGAAAGTC
GGTCTCTGCA
CCAGATGTCT
AGGTGATGCG
ATTGCTGAT'C
TATGTAACAA
AGTACAAACG
AATGTAAAAT
GCCACAAAAG
AAGGACATTA
GTATCCGATC
TGGGGTAAT1T
AGCAAAGAGA
TTACGAGCTA
CCAACGTCAG
CCAATTGGTT
ACGATACCGA
AATAGTAAGC
GCACGAAATG
GTTCTTTTTG
TGGGACC.GCG
ATTACTTTTG
TACATTATTG
GGGACAGCGA
~AMAGCC'I
ACAAAAACGA
ACGATTGAAM
GTGAALAGCCG
ATITI'AAAG
AACGCTGGCC
GCTAGCAACA
GAAGGTATM
CAATTCAATA TAACGAACGA TAGACCGGAC GTTAACTGGC CTGTCTTrGG AAAACAAGAT ATGGTCAAAT TTTACCAGAA GCGATAATTC ACTCGGCCGG ATTCCTTATC GATTAATCGG ATTCGCAGCA GTATCCAACA CTAAACAAAC AACCGATTTT GTGAAATGCG TAGTATGTCC TTTATGGCAC GTTAACAGAA GTCAAGAAAT TAAGAATCCT TGGAAGACTA TTTGATATT ATAGCTATT AAGAATAAAT ATCTATCATr AAGTACACCT AATATATTGA ACCAATTAGT ATGATCAAAA TGGTGCCTAT AACCGATTCA AAATTTAGAA CAAAAGAAAT TTATITTTAT ATGGITCAGT TATTAAGTTC TTAATTATrGT GCCAGATACT TAACAATCAG TGCTGAAGGT GTGGTTCTGA ACGAACACTI' AT==1TTGA CTCACTA.AC ACAT=TATGA TGTTTCAAAC GTCAATACTT TGATAAACCA ACGAAAATAC CAATAGTTAC AGTATAAAAA CGCCAATGGC AAGAACCACA ATCGAATAAT TAGACATI' GAGTGCAACA CAGTAACAAC AAAAAATATT TAACACCAAA AGGCACAACC TGUCCAGAAGA TGCTTATTCA ACTATACATI' GACAGAAAAC AAATCTATAC AGAA.ACAACA AAAAAGTCAC CACTGCGCCA TTI'ATTTAGC AACTGCCACA 201 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.fiaecalis Genes.
TICTACACGC
ATTGATAATG
=TGAT-GCCA
TCCGTGGATG
CTAAAAATTC
ATTTATGTCG
GGTCAACACG
GGCGTTTATC
AAACCCGACC
AAACCAGAAG
AAAATTGATG
AAAAATGGCA
GTTAAAGATA
GCAACAGACA
GATACTAGCA
GCTAATGTGA
GTGGGCGATA
GATGTCCCGT
TATGAAATI'G
GACAGTCAGT
GATAACTTCG
CTTCAGGGAA
ACAAAAGAAG
ACAACGATT
AAA.ACAGGTC
ATA.ACGTAGA
TCACGCATAC
TCATGACAGG
AAGAGTATTT
CATTAACGAA
GCGATTCATG
TTCCCTI'CGA
CAA ETA 1TA
AATCTAAGT
ATAATTTCGT
TTCAGGGAAC
NAAAAGAAGC
CAACGATTTA
AAACAGGCCA
AAGCAGGCGT
CTGTCAAACC
AATGGGAACC
TTGAAAAAAT
TCTATAAAAA
TAGAGGTCAA
TTTCCGCGAC
CAGTGAATC-T
CGA.AAGCAAT
ATGTCGGCGA
AAGATGTCCC
GGATGAAAAC
AGCAACCGAA
TGAITATACT
GACAGGAAAA
AACAATTGAT
GAAACCAGA.A
AAAAATCACT
CAGTGACGAA
AGAGGTCAAA
TTCAGCCACA
CAAGCAATTG
TTrTACAACAG
TI'GCGAGTAA
GCCATTAAC
CACAGTCGTT
GAGAACTTTG
GT~rCAGGTC
GGTAA.AGAAG
GATACAACGA
GACAAAACAG
AGTGAATG ET GATAAAATAG GAAAGCAATC GTCATrGTCC TGTTGGTGAT TCGTCGAAAC AGACGTTCCG TTTGAAAAkAA TTATCCAATC GITTACAGTT CGACCAATCT AAGTTAGAGG AGAAGATAAT TTCGTTTCAC TCACGTTCAG GGAACAGTGA TCGCACAAAA GAAGCGAAAC AGATACAACA A=rATGTG AGACAAAACA GGTCAAGACG TCATAAAATA GGCGATTATC CGTTCATGTC CGTGATGATA
CGAAGGTTTC
ATGAAAAAGG
CGAATGTAkCC T'3GTCAAAGG
TACAAGTCAA
=TCAGCAAC
AAGTTGATAA
AAACAGCCTA
'ITTATGTTGG
GTCAAGACT
GCGATTATGA
GTGATCACAG
CAGAAGATAA
TCACT-CTTIC
ACGAAGGTA.A
TTAAAGATAC
CAACAGACAA
ATG=GATAA
CAATCGTTCA
GTGATAAATG
TTCCGITI'A
AAATGTCTA
GTCGTI'TACA
TTTTGAACTA
TCAATACTCC
GCAGGAATAT
AGACAACCAA
AGATTCAACG
AGATAAAACA
CANCAAAGCA
TGTGACCGTC
TGATTCGTGG
NCCGTTTGAA
AATTGTCTAT
TCAGTTAGAG
TTCGTI'TCA
AGGTCAAGTT
AGAAGAAACA
AACGATTTAT
AACAGGTCAA
AATAGGCGAT
TGTCCGTGAT
GGAAGCAGAA
AAA.AATTGAT
TAAAAATGGC
AGTCAAGGAT
TTCNTGGANA
ATTCGAAAAA
CCAGALAGNGA
ATCACTGTT
ACTTTGTTTC AGCNACAGAT 9 9 @9 .9 9 9 99 9 9 .9 99 9. 9 9 99 9 9 9 EF104-2 (SEQ ID NO:402) M4KKKTF SFVMLSILLA QNPGFAVNAY AVTTTEAQTE TTOTAKEAE LSNSTPSLPL 99*9 9 9 *9*9 9. 9 *9 ATTTTSEMNQ PTATTESQTT ELLIJSMSLAP IGGTEYSQTE TVSCEVTG IG ATAVDAATK KTYSFDLYEQ VEPIQYNERT LEPQVFSYDV DINGQILPET IYLESASDYN YLYSQQYPTT ISFQSKGKYY VTIYGTLTET TEGGKLTLTA TKDSYLRINI VNPLNAETAW GNYDQNGAYS KLGTDYTVTP TSDGSVIKFT TPVDTTVTTN SKRGSERTLQ QVDS IYPQYW DRGQYFDKPM IDVPTLYITG TAREPQSNN NKTHRVKNPT IELTPKGTTN TIEYNTVSAN AGQ IYTETTI YTHNVEDENQ AIAKVSFELI VDEEYLTGKA IKLVKGDNQL QDVPFEKITV SGQVDNXKAG PEDNFVSATD KTGQDVPFEK KDTTIYVGDS WKPEDNFVSA NVTVKPDQSK LEVKDTTIYV E IVYKNGTKE AKAIVHVRDD
EASTTASSDA
VHRELNTrPV
TLSITFNQRV
RTTGLOGEIF
QTLLTPGKDY
KIGSISLKST
KVGQQIVLES
SDLTMDFDKK
SRTTVSVMGS
TPITNE IQI P
SSKNQFLVNA
TPNSPGYPTI
EGSASVSVQN
AQIDLNS ITV
DSETLNQMSA
DNVTHTATEF
KI PLTKTIDH VYPI IYSDEG
IDVQGTVNVD
TDKTGQDVPF
GDKWEPEDNF
SQLEVKDTTI
ATPSEQQTI'E
TATFQFAVGN
SDTSFKVE:LK
YNLDRTLMN
TLSDNSLGRI
TGTKQTTDFT
TNGQE IKNPK
DINLSLSTPV
REKP IQNLEI
IGFNYVPDSL
RNDSFDSLSV
TFDENTNSYT
EALDILSATQ
KGVPEIJAYSL
SKXKVTTAP I TrDEKGQYSF
SRLQVKDSTI
KEETA'xVIVK
KIGDYEIVYX
EKITVSGQVD
DKDTSLNEKA
TGYARGSVYT
SYLTTEAEPL
QTLELLTTET
AVTVPNMNQQ
AKTSQTSKVI
FTAYGPLYEN
IGPiWKAIQLV
KVKHPNYLSL
PKDKSIPVDT
RTXI PAGADV
FDFGKTNKRY
AANPTLKNVT
EKT'INGAKVI
TLKFSEGDAE
DAIMTGOYTL
YVGDSWKPEE
PDQSKLEVKD
NGXKEAKAIV
TSICAGVYP IV
LPDVQAPITD
VQLPEHLGYS
IKIETPGKNK
PGAVFGKQDN
KAYSLS INRT
ADREMRSMSY
VX<LEDYFDIK
SDQYIEPISV
RATKEIYFYY
IPITMSAEGL
LFDIYDVSND
I IEYKNANGW KTTVTTKNI D
FKDYTLTENI
GIVYLATATF
RVTNVPQEYS
NFVSATDKTG-
TTIYVGDSWI<
HVRDDSQLEV
YSYEGKEETA
TVNVDKIGDY
QDVPFEKIDV
EXNFVSATDK
VSATDKTGQD VPFEKIDVQG YVGDKWEAED NFVSATDKTG DDSRLQVKDT TIYVGDSWXP QGIVNVDKIG DYE IVYXKNGT KEAKAIVHVR TGQDVPFEKI TV 202 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faccalis Genes.
EF104-3 (SEQ ID NO:403) TGTAA CAACGACAGA AGCACAAACA 0 0 0s 0* 0 0 *0 00 0 0 0S 00 0 0 000 0 00 0 0 000 0 *04000 0 0000 9000 0000 0
GAGACCACTG
TTAGCAACA-A
ACAGAGGCGA
GAGGACAAGG
GATGAACTAC
GAGGTTCACC
AACACAGGTT
TCAACTGTCA
AAAACATTAA
AAAAGTTATC
AAAAAAACCT
ACCAGAACGA
AATCAAACAT
AACTTCGAAC
ACGCAAACCT
ATTGCTGTAA
ACAA'ITTATT
ACAAAAATTG
ACTGCTAkAGA
TATATCAGTT
ACAAAAGTGG
AAATTTACGG
AAAACTGAAG
ATTCTGATT
GTAATTGGTC
GT'rGTTAATC
TCATCAAGAA
A'FIAAAGTAA
TACAAGTTAG
ACTACGCCAA
TTGCCAAAAG
TTAACTCCAG
CAAAGTAGTA
GTCCGTACAA
GATCAGGTAG
ATGACGCCAA
ACGTTTGATT
TGGATCGACG
AATGAAGGCT
CA.AGCGGCGA
GATAATAAAA
AATCCTCAAA
TTAGAGA.AGA
ATTACGATTG
ATCGACTCTG
ATCACATTGA
TTCTACACGC
ATTGATAATC
TTrGCATGCC~b ATACAGCAAA
CGACTACI'C
GCACAACAGC TI ACACCTCACT
TTGACAGTATC
GCGAATTAAA
ATGCACCTGG IZ
GCGGAGAAGTC
GTATTACGTT
TA.ACAACAGA
ACTCGTTTGA
CGGGGTTAGA
TAGAATTATT
CTCAAGTTTT
TGTTAACACC
CqTGTTCCAAA
TAGACTC
GGTCAATIIC
CGAGTCAAAC
TTCAAAGCAA
GTCAACAAAT
CGTATGGTCC
GTGGCAAGCT
TAACAATGGA
CTAATAA-AGC
CTTTGAATGC
CAACTGTCTC
AGCATCCTAA
GAACGGATTA
TAACCAACGA
ATAAAAGTAT
TTGATACGAC
AAAATCAATT
AAATTCCAC
ATTCAATTTA
ACAGCCCTGG
TTrGGAAAAAC
TGCCAACTCT
CTGCTTCGGT
ATCCAACATT
CACATCGTGT
TCGATTTGAA
CTACAAACGG
AATACAATAC
AAACAI'GAA
AA ETCTCAGA
ATAACGTAGA
TCACGCATAC
TCATGACAGG
AAAGAGGCA G iGAAATGAAT C ~TCCAGTGAT G 'AATGAAAAA G ;AGTCTTGCG C ACAACACCG G .TCAG'ITTAT A
ACAGGCATT
['AATCAACGAC
.GCGGAACCA1 IrTATATGAA C rGGCGAAATT I kACAACAGAG CAGTTACGAT C rCGCAAAGAT
:ATGAATCAA
TTCGGACTAT
TTTGAAAAGTI
AAGTAAAGTA
AGGGAAATAT
CGTATTAGAG
T'IATATGAA
CACTT.TAACG
TTTTGACAAC
CATTCAATI'A
TGAAACTGCT
AGTTATGGGA
TTATCTI=CA
TACAGTAACG
AATCCAALATT
CCCAGTCQAT
AGTAACTACT
CCTTGTCAAT
TGGCGCCGAT
TCCACAATAC
ATATCCAACG
CAACAAACGT
TTATATAACA
7rCTGTTCAA
AAAAAATGTA
GAAPLAATCCA
TTCTATTACC
TGCGAAAGTC
GGTCTCTGCA
CCAGATGTCT
AGGTCATGCG
GGATGAAAAC
AGCAACCGAA
TGATTATACT
AGTTATCGA ACTCAACACC ATCTI'ACCT AACCAACTC CAACAACTGA ATCGCAAACC CTMCTACAC CATCTGAACA ACAAACAACG CCCTGCCAG ATGTTCAAGC GCCAATTACA :CGA'ITGGTC. GAACAGAATA CAGCCAAACA ;TAACCGCTA CGTTCCAATT TGCTGTTGGA CAGT-CAAT TACCAGAACA TrAGGTTAT GCGCAACTT GGCCTCGA TCCGGCGACC ;TTCAGATA CTTCCTTTAA AGTAGA.ACTA r'AATCAAAA TTGAAACTCC AGGAAAAAAT .AACTGGAAC CAA=,tAATA TAACGAACG-A ~TTTATAATT TAGACCGGAC GTTAACTG'GC ~CGCCAGGCG CTGTC-TTTGG AAAACAAGAT ;TCGACATTA ATGGTCAAAT TTTACCAGAA CATACATTAA GCGATAATTC ACTCGGGCGG :AAAAAGCCT ATTCCTTATC GATTAATCGG kACTACTTAT ATICG-CAGCA GTATCCAACA kCGACAGGAA CTAAACAAAC AACCGXTIT %TTGCTGATC GTGAAATGCG TAGTATGTCC L'ATGTAACAA T=ATGGCAC GTTAACAGAA AGTACAAACGCGTCALAGAAAT TAAGAATCCT A4ATGTAAAAT TGGAAGACTA TTTTGATATT GCCACAAAAG ATAGCTATTT AAGAATAAAT A.AGGACATTA ATCTATCA'IT AAGTACACCT GTATCCGAtC A.ATATAITTA ACCAATTACT TGGGGTAATT ATGATCAAAA TGGTGCCTAT AGCAAAGAGA AACCG-ATCA AAATTrAGAA TTACGAGCTA CAAAAGAAAT TTAT'TMAT CCAACGTCAG ATGGTTCAGT TATTAAGTTC CCAATTGGTT TTAATTATGT GCCAGATAGT ACGATACCGA TAACA.ATGAG TGCTC-AAGGT AATAGTAAGC GTGGTTCTGA ACGAACACTT GCACGAAATG A IrCTTI'IGA CTCACTAAGC GTTCTT7=T ACATTTATGA TGTTTCAAAC TGGGACCGCG GTCAATACT TGATAAACCA ATTACTTTTG ACGAAAATAC CAATAGTTAC TACATTATTG AGTATAAAAA CGCCAATGGC GGGACAGCGA AAGAACCACA ATCGAATAAT AATGAAGCGT TAGACATII GAGTGCAACA ACAAAAACGA CAGTAACAAC AAAAAATATT ACGATTGAAT TAACACCAAA AGG-CACAACC GTGAAAGGCG TGCCAGAAGA TGCTTATI'CA ATTrTTTAAAG ACTATACATT GACAGA4AAC AACGCTGGCC AAATCTATAC AGAAACAACA GCTAGC:AAGA AAA.AAGTCAC CACTGCGCCA GAAGCTATTG TrATFTAGC AACTGCCACA CAAGCAATTG CGAAGGTTrC I=ITGAkACTA TTTACAACAG ATGAAAAAGG TCAATACTCC TTGCGALGTAA CGAATGTACC GCAGGAATAT GCCATTAACC TGGTCAAAGG AGACAACCAA CACAGTCGTT TACALAGTCAA AGATTCAACG TCCGT'GGATG AAGAGTATTT GACAGGAAAA CTAAAAATTC CATTAACGAA AACAATTGAT TABLE 1. Nucleotide and Amino Acid Sequences of E.jftecalis Genes.
ATTTATGTCG
GGTCAAGACG
GCCGTIATC
AAACCCCACC
AAACCAGAAG
AAAATTGATG
AAAAATCCCA
GTTAAAGATA
GCAACAGACA
CATACTACCA
GCTAATGTGA
GTGGGCGATA
GATGTCCCGT
TATGAAAI-rG
GACAGTCAGT
GATAAC ETCG
GTTCAGCAA
ACA.AAAGAAG
ACA.ACGAT
AA.AACAGGTC
GCGATTCATG
TTCCCTTCGA
CAATTAMTTA
AATCTAAGTT
ATAATTTCGT
TTCAGGGAAC
NAAAAGAAC
CAACGATTA
AAACAGGCCA
AAGCAGGCGT
CTGTCAAACC
AATGGGAACC
TTGAAAAA.AT
TCTATAAAA.A
TAGAGGTCAA
TTTCCGCGAC
CAGTGAATGT
CGA-AAGCAAT
ATGTCGGCGA
AAGATGTCCC
GAAACCAGAA
AAAAATCACT
CAGTGACGAA
AGAGGTCAAA
TTCAGCGACA
AGTGAATGTT
GAAAGCAATC
TGTT-GGTGAT
AGACGTTCCG
TrTATCCAATC
CGACCAATCT
AGAAGATAAT
TGACGI'CAG
TGGCACAAAA
AGATACAACA
AGACAAAACA
TGATAAAATA
CGTTCATGTC
TTCNTGGANA
ATTC
GAGAACTTT
GTCAGGTC
GGTAAAGAAG
GATACAACGA
GACAAAACAG
GATAAAATAG
GTTCAMTCC
TCGTGGAAAC
TTTGAAAAAA
GTT*ACAGTT
AAGTTAGAGG
7T"CGTTTCAG
GGAACAGTGA
GAAGCGAAAG
AT=ATTGG
GGTCAAGACG
GGCGATIATG
CGTGAT--ATA
CCAGAAGNGA
TTTCAGCAAC
AAGTTGATAA
AA.ACAGCCTA
TTTATGTTGG
GTCAAGACGT
GCGATTATGA
GTGATGACAG
CAGAAGATAA
TCACTGTTTC
ACGAAGGTAA
TTAAAGATAC
CAACAGACAA
ATGTTGATAA
CA-ATCdTTCA
GTGATW~TG
TTCCGTTTGA
AAATTGTCTA
GTCGTTTACA
ACT=TTTC
AGATAAAACA
CANCAAAGCA
TGTGACCGTC
TGATTCGTGG
NCCGIT=AA
AATTGTCTAT
TCAGTTAGAG
TTTCGTTTCA
AGGTCAAGTT
AGAAGAAACA
AACGATTI'AT
AACAGGTCAA
AATAGGCGAT
TGTCCGTCAT
GGA.AGCAGAA
AAAAATTGAT
TAAAAATGGC
AGTCAAGGAT
AGCNACAGAT
EF104-4 (SEQ ID NO:404) VTTTE-AQTE TrDTAKKEAE LSNSTPSLPL 0@ 0S S 0S
S.
S. S 0O @0
S
@0 @5 S 0
S
S.
0@ S 0 0
S
OS@9 0
OS..
S.
SSSS
0 *0e0 4 000@
OOSS
0 0S@S 0e 0 00
S.
ATTTTSEMNQ
ELLDSMSLAP
TVSGEVTG IG
KTYSFDLDYEQ
LEPQVFSYDV
IYLESASDYN
ISFQSKGKYY
TEGGKLTLTA
VNPELNAETAW
KLGTDY'IVTP
TPVDTTVTTN
QVDS IYPQYW
IDVPTLYITG
NKTHRVKNPT
TIEYNTVSAN
YTHNVEDENQ
VDEEYLTGKA
QDVPFEKITV
PEDNFVSATD
KDTTIYVGDS
NVTVKPDQSK
EIVYKNGTKE
QGTVNVDK IG
TGQDVPF
PTATTESQTT
IGGTEYSQTE-
ATWAVDAATK
VEPIQYNERT
DINGQILPET
YLYSQQYPTT
VTIYGTLTET
TKDSYLRINI
GNYDQNGAYS
TSDGSVIKFT
SKRGSERTLQ
DRGQYFDKPM
TAKEPQSNNN
IELTPKGTTN
AGQIYTETTI
AIAKVSFELI
IKLVKGDNQL
SGQVDNXKAG
KTGQDVPFEK
WKPEDNSFVSA
LEVKDTTIYV
AYXAIVHVRDD
DYE IVYKNGT
EASTTASSDA
VH-RELNTTPV
TLS ITFNQRV
RTTGLDGEIF
QTLLTPGKDY
KIGSISLKST
KVGQQ IVLES
SOLTMDFDKK
SRTTVSVMGS
TPITNEIQIP
SSKNQFLVNA
TPNSPCYPTI
EGSASVSVQN
AQIDLNSITV
DSETLNQMSA
DNVTHTATEF
KIPLTKTIDH
VYPIIYSDEG
IDVQGTVNVD
TDKTGQDVPF
GDKWEPEDNF
SQLEVKDTTI
KEAKAIVHVR
ATPSEQQTTE
TATFQFAVGN
SDTSFKVELK
YNLDRTLTGN
TLSDNSLGRI
TGTKQ~rDFT
TNGQEIKNPK
DINLSLSTPV
KEKPIQNLEI
IGFNYVPDSL
RNDSFD)SLSV
TFDENTNSYT
EALDILSATQ
KGVPEDAYSL
SKKKVI'TAPI
TTDELKGQYSF
SRLQVKDSTI
KEETAYV'rVK
KIGDYEIVYK
EKITVSGQVD
VSATDKTGQD
YVGDKWEAED
DDSRLQVKDT
DKJ)TSL.NEKA
TGYAPGSVYT
SYLTTE-AEPL
QTLELLTTET
AVTVPNb'NQQ
AI(TSQTSIZVI
FTAYGPLYEN
IGPNKAIQLV
KVKHPNYLSL
PKDJKS IPVDT RTKI PAGADV
FDFGKTNKRY
AANPTLKNVT
EKTTNGAKVI
TLKFSEGDAE
DAIMTGDYTL
YVGDSWKPEE
PDQSKLEVKD
NGXKEAKAIV
TSKAGVYPIV
VPFEKIDVQG
NEVSATDKTG
TIYVGIDSWXP
LPDVQAP lTD VQLiPEHLGYS
IKIETPCKNK
PGAVFGKQDN
KAYSLS INRT
ADREMRSMSY
VKLEDYFDIK
SDQYIEPISV
RATKEIYFYY
IP ITMSAEGL
LFDIYDVSND
I IEYKNANGW KT'IVTTKNI U
FI(DYTLTENI
GIVYLATATF
RVTNVPQEYS
NFVSATD)KTG
TrIYVGDSWK
HVRDDSQLEV
YSYEGKEETA
IVNVDKIGDY
QDVPFE I DV
E-XNFVSATDK
EF105-1 (SEQ ID NO:405)
TAAATGAAAA
G TCCrCTG
AGCACAGATO
AAACAGTCGT
AAGCGGCGAC
GGACGGATCC
CTACTCCTTG TTATTCGGAA CAATGTTGCT TGGCGCCACT GGTCGTTI'T GATAGCGAAC AGTCGATTGT TTTTACCCCA AGTAAATCCA GAAAATCCCG ATCCAGAAAA ACCAGTTCGA 204 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faccalis Genes.
CCAGTCGATC
GATTATGCCT
TTTGCCAGAG
AATTATGTAC
CAAAATGGTC
GCCITACTG
GCAAACATTC
ACCGGCGCCG
CAACAAGGAC
AAAACCCCTA
CCAGTAAATA
CAACGAATCC
CAAGTTTCA
CGCAAACCTA
AkAGTAAGTGA
AATTTCGTAA
AGCCCAGTGT
AATTAGATGC
GAACGTGGAT
AGCAAGTAAA
AGGATGCAGT
ATGGAGGGAA
TGATGGACCT
TTTTGGGAGT
TAGAAATCCA
=TIACGGGGA
TGCAGAAACA
TCGCTCAAAT
TGCGGGCGCA
CACGCTG'GC
TGCCACAATC
ACAATATAAA
ATAA
AATCCAGGTA
AATGAGATAT
GATGGTTCAG
ACCAATGCTG
TTACACA-AAG
GCGACGGACG
GAAACTGTI'
GGGCAAGCAG
ACACGGGCAA
ACAACATTGA
CCCCTGGTCC ACICCATC
CGAATAAGGA
C:AAGTGAATT
GCTGGGTTTT
AATTAACAGG
TA'ITGCCGCC
TCATGCAAGC
AAAAACTGAC
TCTCACTAAC
CT=GCTACT
TCAAACGTAT
GGCAACTGCT
AAAAGTGAAA
CGCCACCOTC
AACTGCTACC
CCCAGAAAAG
CGAAAAAAAT
TGTTCCTGGG
TCAGATGTA
EF1OS-2 (SEQ ID NO:406)
MKKTVVYSLL
VDPTNPDGPN
YVQVSDLRGT
NIQLDAAGAE
TPKDAVQYKT
FGTMLLGATV
PGTPGPLS ID
NAGWVLKVKQ
TVVMQAPEKT
TLTWLLSDVP
PAEAATVVFD
YASSLDFGSN
NGQFRNAETL
GACTWITLWG
VNNGGK
SEQSIVFTPS
El SNXDQTYF
HKELTGAEVA
QAEKVTEKNQ
TOGTDPVNPE
ARAQTYRNPD
FTEPSVRSNA
QGQQVNATIT
NPDPEKPVRP
GSASELATAN
TDVLPPTATA
RAISLTVPGK
EF105-3 (SEQ ID NO:407) GGCGAC GGTCGTITTT GATAGCGAAC AGTCGAT1TGT TTTTACCCCA
ACCACAGATG
CCAGTCGATC
GATTATGCCT
TTGCCAGAG
AATTATGTAC
CAAAATGGTC
GCCTTTACTG
GCAAACAITC
ACCGGCGCCG
CAACAAGGAC
AAA.ACCCCTA
GGACGGATCC
CAACGAATCC
CAAGTTTGGA
CG-CAAACCTA
AAGTAAGTGA
AA=CGTAA
AGCCCAGTGT
AATTAGATC
GAACGTGGAT
AGCAAGTAAA
AGGATGCAGT
AGTAAATCCA
TGATGGACCT
TTTTGGGAGT
TAGAAATCCA
TTTACGGGGA
TGCAGAAACA
TCGCTCAAAT
TGCGGGCGCA
CACGCTGTGG
TGCCACAATC
AC
GAAAATCCCG
AATCCAGGTA
AATCAGATATI
GATGGTTCAG
ACCAATGCTG
TTACACAAAG
GCGACGGACG
GA-AACTG LTG
GGGCAAGCAG
ACACGGGCAA
ATCCAGAAAA
CCCCTGGTCC
CGAATAAGGA
CAACTGAATT
GCTGGGTTTT
AATTAACAGG
TATTGCCGCCC
TCATGCAAGC
AAAAAGTGAC
TCTCACTAAC
AC CAGTTC GA
ACTITTCCATC
TCAAACGTAT
CCCAACTGCT
AAAAGTGAAA
CGCCACCGTC
AACTGCTACC
CCCAGAAAAG
CGAAAAAAAT
TGTTCCTGGG
S EP105-4 (SEQ ID NO:4D8) ATVVFD SEQSIVFTPS TDGTDPVNPE NPDPEKPVRP VDPTNPDGPN PGTPGPLSID YASSLDFGSN EISNXDQTYF ARAQTYRNPD GSASELATAN YVQVSDLRGT NAGWVLKVKQ NGQFRNAETL HXELTGATVA FTEPSVRSNA TDVLPPTATA NIQLDAAGAE TVVM~QAPEKT GAGTWITLWG QAEKVTEKNQ QGQQVNATIT RAISL'IVPGK
TPKDAV
EF106-1 (SEQ ID NO:409)
TAGTCGTTTA
TTAGT'I-GTrG
AATCCCAATA
GAAGGATTAT
ITAACAGATG
TATGAATTI
ACCGCTGGTA
GGTGTATACC
GATAAAAACG
TGAAGAAAAA
GAGCAGGAGG
A=~AGGGGA
CAGAAAAAGA
AAAATCAAAA
TTGATAAAGA
TTACAGATGT
CACI'AGAAC
GAAATGATAT
AATCGTTGGT
G.GCTrTGACG
ITTACCTGAG
'TGGCCTGCA
CCTATTACAT
GACTGGAGAA
TTCCCGTCTT
TTTATACCAA
TGAAGTTTTC
ACAATTACGT
GCAGAAGCAT
TATTTACGTT
ACACGCGTT
GCAATCAAAT
TCAACAGGAG
GTAA'TATT
GATACTTTITA
GTAGAAAACA
TGTGCCTTT
ACGTTCCTCA
CAGTI'GGTAT
ATGATCGAAA
TTGATGCAAC
ATGAAGGAAC
CTGAAACCAA
CGTATAGACA
AAGCAACCTC
AAGTGCGTTA
AAGCGTAGAC
TAGACAAGAT
TGGGAATGAC
CACTAGTTTC
CTTCT=ATG
AAATTATCA.A
GATrOGGGAMA
AGGACCAGTT
205 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TATGGTCGTC
CCCCGTGCCA
ATTGGTAAAA
CATAAAGATG
GACCA.AGATA
TATGGCTACT
AATCGCTATG
CGAATGGGTA
CAAGGAACTT
CGCAGCCATA
TGACAGAACA
CTTCCTTTGA
GCAATGACGT
CCCAAATA
'rrAAAGTAGG
GCGCTGTCTT
AAGATAACGA
TTAATCCTGA
CCCCAATAAT CGTCCCAGAA AACAGGCCAG ATTGATGTAA TGGGACACCG CAACTTCTTT AACTTCGGCC AACCAAAACT TGAATTTTTC AATTTACAAA AAATCAAAT AAATrCCGTO AGAGTTAACA GAArrGAATG AGGAAACGAT ATCCAAGTCT ATTTACCI'T TAA
CACTAGAAT
ATCGACAAGG
GGAATGCCAC
TATCAGCTT
CTGGTGAAAC
CCCATGTTTC
ATAATCGTTT
AtGTGGAACA
CACGAATGGA
GGATGAAATT
AAAAGTAGTC
AGCGAAATTT
TCGTIGGCGAC
CATTCCAACC
TACGTACACA
TGAACCATAC
EF106-2 (SEQ ID NO:410) MKKKIVGT ITLLALSALL VGGAGGALTA E-AYVPQSVDN PNNLGDLPEY LRSVGIRQDE GLSEKDWAGT RVYDRNGNDL TDENQNLLH-A IKFDATTSFY EFFDK8TGES TGDEGTFFMT AGITDVSRLV IISETKNYQG VYPLRTLYQD TFTYRQMGKD KNG-NDIEVFV ENKATSGPVY GRPQPYPNNR PRTLEFTNGR RANTEQTGQI DVNRQGDEII GKTSFOGTPQ LLWNGTKVVD KDGNDVTSAN QNFISLAXFD QDSSKYEFFN LQTGETRGDY GYFXVGNQNI( FRAHVSIGTN RYCAVLELTE LNDNRFTYTR MGKDNEGNDI QVYVEHEPYQ GTFNPEFTF EF10O6-3 (SEQ ID NO:412.)
S.
S S
S
S.
S
S
555555
S
S
55S5 5555 5555
S
5*55
S
AT ACGTTCCTCA AAGCGTAGAC
AATCCCAATA
GAAGGATTAT
TTAACAGATG
TATGAATTTT
ACCGCTGCTA
GGTGTATACC
GATAAAAACG
TATGGTCGTC
CGCCGTGCCA
ATTGGTAAAA
GATAAAGATG
GACCAAGATA
TATGGCTACT
AATCGCTATG
CGAATCGTA
CAAGGAACTT
ATTTAGGGGA
CAGAAAAAGA
AAAATCAAAA
TTGATAAAGA
TTACAGATGT
CACTTAGAAC
GAAATGATAT
CGCAGCCATA
TGACAGAACA
CTTCCTTTGA
GCAATGACGT
GCAGCAAATA
TTAAAGTAGG
GCGCTGTCTT
AAGATAACGA
TTTACCTGAG
TTGGGCTGGA
CCTATTACAT
GACTGGAGAA
TTCCCGTCTT
TTTATACCAA
TGAAGTTTTC
CCCCAATA.AT
AACAGGCCAG
TGGGACACCG
AACTTCGGCC
TGAATTTTTC
AAATCAAA.AT
AGAG'ITAACA
AGGAAACGAT
TATTTACGTT
ACACGCGTTT
GCAATCAAAT
TCAACAGGAG
GTAATTATTT
GATACTTTTA
GTAGAAA.ACA
CGTCCCAGA.A
ATTGATGTAA
CAACTTCT
AACCAAAACT
AAT ITACAAA
AAATTCCCTG
GAATTGAATG
ATCCAAGTCT
CAGTTGGTAT
ATGATCGAAA
TTC-ATGCAAC
AT--AAGGAAC
CTGAAACCAA
CGTATAGACA
AAGCA.ACCTC
CACTAGAAT
ATCGACAAGG
GGAATGGCAC
TTATCAGCIT
CTGGTGAAAC
CCCATG'TTC
ATAATCGTrT
ATGTGGAACA
TAGACAAGAT
TGGGAATGAC
CACTAGTTTC
CTTCTTTATG
AAATTATCAA
GATGGGGAAA
AGGACCAGTT
CACGAATGGA
GGATGAAATT
AAAAGTAGTG
AGCGAAATTT
TCGTGGCGAC
CATTGGAACC
TACGTACACA
TGAACCATAC
EF106-4 (SEQ ID NO:412) YVPQSVDN PNNLGDLPEY LRSVGIRQDE GLSEKDWACT RVYDRNGNDL TDENQNLLHA AGITDVSRLV IISETKNYQG IJYPLRTLYQD GRPQPYPNNR PRTLEFTNGR RAMTEQTGQI KDGNDVTSAN QUFISLAKFD QDSSKYEFFN RYGAVLELTE LNDNRFTYTR MGKDNEGNDI
IKFDATTSFY
TFTYRQMCKD
DVNRQGDEI I
LQTGETRGDY
QVYVEHEPYQ
EFFDKETGES
KNGNDIEVFV
GKTSFDGTPQ
GYFKVGNQNK
GT
TGDEGTFFMT
ENKATSGPVY
LLWNCGTKVVD
FRAHVS IGTN EF107-1 (SEQ ID NO:413)
TAAAAAACGG
ATANATANAA
CTTTATAGAG
AATTGGAAAA
CACTCAATAT
AAATGCTAGT
ACTATAGATT
GATGGCTACT
GTCAAAA TT
TATCAGTATC
GAATTF1TTAC
TGTTGGGTTA
GAAATrTCAA
GATAATAACA
ATAGAAAGAA
AGTTGTTCTT
GCTGTCGT C?1'TCCTAAA GGATACTGAT TAAGAAAGGA GGAGCAAGAT GAAGCGAGTA TGTTCATGGA TTCAGTrGCTT 206 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
a a a a a a a.
GGTGTGACTG
CAGTCGAOTG
GCTAGTGAAG
GG TATTCT
TTGTTTGATC
GAGAAATTAC
ATTCCAACGT
GGCGTTTCTG
AGTACGTTTT
GGGGGCAAAG
GAAAAAATGC
ACGGCACCAG
ACTAGTTCGC
CGTGTCACGG
ACGCAGGATA
TTGCCTGAAC
GGGAATGCGA
GATCACAATG
CCAGCGAGAG
AAGTTTAAGC
TTTCCCACAA
CAAAAAGTCT
AGTGTTGAAA
GGGGATAAAA
AACA=Tr TA
CGCTATACTG
AGTATCTATT
TATTATTACT
ATCACGCCAC
GTCTACACTG
TTCCA.AGGCI
AGTTTTACAC
CCCACCGCGC
GATTACTOOC
AAGCCAACGC
ACGGGTCAA.P
GTGTCCATC;
GGAACCGCCC
AAC'TrGGC;
ACGAGTCCTC
ATT1CAGGAJ
ACTCGCAAC(
CTATCGCAA(
ACGGCCGCTC
GGCAAGAMC
CAACAGTrC
GAAGTGCCA(
TTAGTGACT(
TGTTAGCGGA A ACGAAGCGAG C
CGAAAACAGT
CTGTGGAAGG G CCGACCCAAT AACGAACCCC C CACCAGC TT' AAGTGTCG1' ATACAGGAGA C TGACGAACTA CCAATAAGCT C GAGGCGTTAC C ACCAAAGACG I
AAAAAITGA
AGAAAACCAT
GTTACACAGG
AACCTAGCAC
ACGATTGCA
AAGCTTTGTT
TAAGTGCGAC
TTGATGGAAT
ACACACGCCC
TTCCTAAGTA
CCAAGTATCC
CTAr-TTTAGT
CCCCCCGTTC
CAGGAACTGC
TAGAAAATCG
CAACTGGCTT
TCGCAAAAGC
GGTTTAAAGG
CAACTTTIAA
1AACTAACGTT
AGGCGCTACT
CAACTTCGC
ACACCAAAGC
CGITGGATCA
TTACAGGAAA
SGTTTAACTGC
ACGGCGACGG
k GTAAGCAACA
:CTTATTI'ACG
CAAAATCAGC
CTGCCAGCTT
k. GCACCGTGAC k CAGGCAGTGA 3CCAACCAAGG 3 GCCCCTAA
CAGACGACG
.CCGCCACAG
CGCGAGATT
'ACGATGGCG
CTI'GAGCCC
,'AAATATGAT
'ACTATTCCT
:ACTACGCAA
MTGGGCGCC
;ACGTTTGGT
TATGCGCTrG
~GGAACAGCG
;AATCCCGCA
L'CTAACAGGG
:AGTGATGGG
3TTGGA.AACC rGTGGTCTAT
CGGCTATGTT
CATGGGTOGAA
:GATATGCCA
AGACGATGGC
F1ACCAGACC
AGTACCAAAM
GGGAANCNCI
TI'ACTGGANC
TGGGGGCAAC
GCGGGTCACIJ
TACACAAGG;
TTrTGCCGAAC
CAAAACCAAC
TG-ATGAGGA(
AACAGGTGCC
GAAGAACAC
GGCTGGCAT(
TTTTCCTGTI
GCCTTTACC,
TCCTGGTCA
CAAAGATAC
C ITTATTAG ATATGrGTTT
CCTGAATAC
CACAGACAG
TACCGATTA
'rrrAACCGC
CGTCTATCA
TATGGCTGC
CAAACAACCG
GAAACGGCAA
CCCTTTTTCT
GTCAATCTTI'
ACCACTG-TGA
ACAAATAACC
AAGGAGTATC
TATCCAGTGC
AATGCGGAGA
CICTA.AAAAGA
ACCTATTT 1T
GGTCCTGTGT
GGCGGGGCGA
GAGGATTTTA
A.AGACGTATI'
ACCAAAACGC
GAAGAAGCAG
GATCAGCAAG
AGTGACGGAG
GCAAGTCAAM
ACAATCGTAP
ATTTCGATT
GA.AGTGCGCC
GCGTATAACI
TGGGGCCCC;
TATAATTTAI
7GAACATTTTC kAATCAGCTA( 3ATCTACCAAC 3 CCAGCAACAr.
-GACATGACC(
:GTTGACATAj P, GGCGAAGCGI
ZGGCGCACCC,
:ACCAAAGAA,
A GCTGGCGGT A GTTTTAACC G GTCCGTATT T ACCCCGACA C AAGAAGGCC T AGCCAAGCC C TTGCCAACA .C AACCAACCA C GACAATACC ,G TTGGACTTC C CAACAATAC ACGATTACT GGGGCGACGG
AGCAALGGAGT
AAGAGTCACA
GAATTGCTTC
TTCTGGAGGA
CCACGTTIC
ATGGCGGAAA
AGAATCCAAG
AAAGCCTGGA
CAACGGT
CGGAATTAGA
CG =TAAATA
ATCGAGATGT
ATTATTATTP
TTCCTGCGCC
CTTTACCCA
TATTTAA-AGG
CTAGTTATGC
TGATGAAAAC
GAAACTTGAT
CCACAGGGGA
TAAAGAAATT
*CITrATGGCCCC
CACCAACTAC
*GTGGCATCGA
7TGACCCAAAA
CGA.AGACACT
CGACCCCTGA
TAGACGAAAG
TGGTGGACAC
CTGGTGAAAA
r' TAAAGACGAC
;CTCTCTACC)
k TCGAAAATGC
:CGTTA-ACCAC
4 ACACCATA' C AATGGACGA( C AATTAAAAAM 0 CTGATGTTGJ A AAGACTTAGj T T1'GA'ITTG( G CAGATTACMi A ATTGGAGTT A CGACCCGCT A CAGAAACCAI ZG CAACAGCCGG .A CGTTTGCTA 'C AAGCCGCCG
AGCAACATCT
GGCAACGGTC
TCGAGCGATT
AGACGGGACG
GITTTATGAA
GTTACTGTCT
TAATA=TAT
CATTCGACCA
TGCGAACGTT
GTTAACCAAT
TACGGTAGCT
CGGCGGTCCA
AACAAAGCGG
AGAAGGTTAT
AGAAGGCACC
TTGGTACAAA
GGTGACCtAT
CTATACGTTG
TAATCCCGCC
AATGACGACT
AGCCATCCCG
GCAAGAAGTG
TGCGTATACA
AAACCCCGAC
AAGTGCCACA
TTACTCAATG
TGGCACCATT
TGGCGCAAAA
TGAAAACTAT
AACCTATATC
AACGACCCCA
AGAAGCGATT
CGCCACAATG
CATTAAAATC
7TGTACAAGGA
'CGGTGCAGGA
r GAACTTATTA k AGTAACGGGC k. TCAAGAAATT
TAAACTAGCA
k. CGGCAATGGC r AAcGCCCAG r GTTGCTAGGA
GACACCACT
r GGTCGCAAAC A cATCAAACTA T CACGTGGAAT EF107-2 (SEQ ID NO:414)
MKRVN
WKRWLVVGLS CSLFADSVVG VTVLAETITG ATEQGVATSQ SSDEASQTTQ rI'EESQATVA SEAKTVPPQE TARIASRAIG YSSVEGREIP FFFVEEDGTL FDPDRITMAV N~LSTFSFYEE 207 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
KLQRTPLJEPT
TFYTGDTTQY
APGGVTYALT
QEDKXTIVTGE
DNDDLHVVYE
PTIDCIDMPA
DKTKYPVPNE
IYSGTAGGNY
YTVAXALPK I
TAELTLTGAV
GQNTKAFPVT
FGSLTAKD)TV
RNPYLRLNTS
KTSTVTLTAD
VTGP
TVNGGI(LLS I
PVPTVFANVG
YFYGDVGGPT
DIrFTQEGTL
E-AVM(TYTLP
SQLKKLAIPQ
VRRG IENPDN
NLSTPDGTIY
YQAGEKTYIF
DIIENGATMD
KEQWTTGAGV
RIKDLDQEIT
QANWSLTAQL
NTATAVVANQ
PTSPAFKYDT
GKVTNYVGAN
S SHQRRGTAC
PERYTGSDG(
AREALFGYVD
KVYTRPDDGT
IVSSLVGXXA
YYLENRRVTE
QGWFKGKTKP
YWEALLKNTG
S ITLDQPLPA.
SPIJCDGF 1ST
SQPKSATDSJ
QFTGSDVYQL
NNQNPSNIYG
AETELELTNE
PVYYYLTKRR
TYLFKGWYKG
EQGNLINPAI(
IVTYGPQEJS
YNLTQKSATR
HFVDESGAKI
ATLKTTTTPS
EAPLTTIXIK
JZGQLKMNhLLG PtEFGKLAl
PTTTRLLLGT
OFTFANIKLE
VSEVSFTIPK
KMPNKLTFGP
VTEKPENPAG
NAKPSTLETTI
FKLSATMGES
VEIPKYYQTI
YTARRSYWXW
TP PTGFTQGN
FTPTFNDEDD
PTATWAAG IG
TAVTGNPGQV
SGSKQQYGLK
AAAAS FTOYN
VPANQGMAGQ
EYQSLDIRPS
KKTFKYTVAT
GAl PAP EGYT
KTPSYAVTYD
DGATGEMTTF
S ISPTTAYTG
GPTKTLYSMS
QLVVDSENYV
MTAVYQEAI P
APNTIFVQGT
L'FADVEVTGN
KAADYYGNGT
QPTETRTPLG
QYQAAvTWNL EF107-3 (SEQ ID NO:415) GG AGCAAGGAGT AGCAACATCT
CAGTCGAGTG
GCTAGTGAAG
GGTTATTCTT
TTGrPTGATC
GAGAAATTAC
ATTCCAACGT
GGCGTTTCTG
AGTACGTTT
GGGGGCAAAG
GAAAAAATGC
ACGGCACCAG
ACTAGTTCGC
CGTGTCACGG
ACGCAGGATA
TTGCCTG-AAC
GGGAATGCGA
CATGACAATG
CCACCGAGAG
AAGTTTAAGC
TTTCCCACAA
CAAAAACTCT
AGTGTTGAAA
GGGGATAAAA
AACATrG=r.
CGCTATACTC
AGTATCTATI
TATTATI'ACJ
ATCACGCCAC
GTCTACACTC
TTCCAAGGC'.
AGTTrACA(
CCCACCGCG(
GATTACTGGC
AAGCCAACCC
ACGGGTCAk.
ACGAAGCGAG
cGAAAACAGT
CTGTGGAAG
CCGACCGAAT
AACGAACCCC
CACCAGCTTT
AAGTGTCGTT
ATACAGGAGA
TGACGAACTA
CCAATAAGCT
GAGGCGTrAC
ACCAAACACG
AAAAATIGA
AGAAAACCAT
GTTACACAGG
AACCTAGCAC
ACGATI'TGCA
AAGCTTW T
TAAGTGCGAC
TTGATGGAAT
*ACACACGCCC
CCTAAGTA
CCAAGTATCC
GTAGTTI'AGT
CCCGCCGTTC
CAGGAACTGC
TAGAAAATCC
CAACTGGCTI
TCGCAAAAGC
P~ GGTI'AAAGC :CAACmTTAI 3 AACTAACGT) 3AGGCGCTAC' 3CAACTTGGG( A ACACCAAAG( CCAGACGACG CAAACAACCG ACCGCCACAG
GAAACGGCAA
GCGCGAGATT
CCCTTITCT~
TACGATGGCG GTCAATCT CCTTGAGCCC ACCACTGTGA TAAATATGAT
ACAAATAACC
TACTATTCCT
CACTACGCAA
TGTGGGCGCC
GACG ITTGGT
GTATGCGCTG
AGGAACAGCG
GAATCCCGCA
TGTAACAGGG
CAGTGATGGG
GTGGAAACC
TGTGGTCTAT
CGGCTATGTT
CATGGGTGAA
CGATATGCCA
AGACGATGGG
TTACCAGACC
AGTACCAAAI
GGGAANCNCI1
TTACTGGANC
TGGGGGCAA(
GCGGGTCACr.
TACACAAGGj TT'TGCC5AAC CAAAACCAAi
TCATGAGGAI
SAACAGGTGC
1GAAGAACAC
-GGCTGGCAT
II7rCCTGT TATCCAGTc3C
AATGCGGAGA
CCTAAAAAGA
AC CTA~TT
GGTCCTGTGT
GGCGGGGCGA
GAGGATTTA
AAGACCTATT
ACCAAAACGC
GAAGAAGCAG
GATGAGCAAG
AGTGACGGAG
GCAA.GTCAAM
ACAATCGTA)
*ATTTCGATT1
GAAGTGCGCC
*GCGTATACJ
TGGGGCCCCJ
TATAATrTA' r' GAACATIT( k. AATCAGCTA( 3ATCTACCAAI 3 CCAGCAACA
"GACATGACC
C GTTGACATA A GGCGAAGCG C GGCGCACCC C ACCAAAGAA AAGAGTCACA G( GAATTGCTTC
TC
ITGTGGAGGA AC CCACGTTTTC
G'
ATGGCGGAAA
G'
AGAATCCAAG
T
AAAGCCTGGA C cAACGGTTrT T CGGAATTAGA
C
CGTITAAATAT
ATGGAGATGT
C
ATTATTATT
TTCCTGCGCC
CT'TTACCCA
TATTAAAGG
CTAGTTATGC C
TGATGAAAAC
GAAACTTGAT
CCACAGGGGA
TAAAGAAATT
CTTATGGCCCC
CACCAACTAC
GTGGCATCGA
7 TGACCCAAAA k CGAAGACACT C? CGACCCCTGA 3 TAGACGA;AG~ 3TGGTGGACAG cTGGTGAAAA r TAAAGACGAC S CTGTGTACCA A TCGAAAATGG- C CGTTlAACCAC A ACACGATATT .C AATGGACGAC 3CAACGGTC
CGAGCGATT
3ACGGGACG r TTATGAA
TTACTGTCT
P.ATA'FITAT
ATTCGACCA
GCGAACG'IT
TTAACCAAT
ACGGTAGCT
GGCGGTCCA
ACAAAGCGG
LGAAGGTTAT
LGAAGGCACC
.TGGTACAAA
;CTGACCTAT
TATACGTTG
L'AATCCCGCC
%ATGACGACT
kGCCATCCCG
GCAAGAAGTG-
rGCGTATACA
A.AACCCCGAC
AAGTGCCACA
ITrACTCAATG
TGGCACCATT
TGGCGCAAAA
TGAAA-ACTAT
AACCTATATC
AACGACCCCA
AGAAGCGATT
CGCCACAATG
CATTAAAATC
TGTACA-AGGA
CGGTGCAGGA
208 TABLE 1. Nucleofide and Amino Acid Seqeuences of E. faccalis Genes.
GTGTCCATCA
GGAACCGCCG
AACTTTWGGCA
ACGAGTCCTG
ATI'CAGGAA
ACTCGCAACC
CTATCGCAAC
ACGGCCGCTG
CGCAAGACCA
CAACAGTTCA
GA.AGTGCCAG
TTAGTGACTG
CGTTGGATCA
TTACAGGAAA
GTTAACTGC
ACGGCGACGG
GTAAGCAACA
CTTATTTACG
CAAAATCAGC
CTGCCAOCTT
GCACCGTGAC
CAGGCAGTGA
CCAACCAAGG
GCCCCT
GCCTTTACCA
TCCTGGTCAA
CAAACATACG
CTTTATTAGT
ATATGGTTTG
CCTGAATACT
CACAGACAGC
TACCGATAC
TTAACCGCC
CGTCTATCAG
TATGGCTGGC
GCTGGCGGTC
GT'TIAACCG
GTCCGTA 1TA
ACCCCGACAT
AAGAAGGCCG
AGCCA.AGCCA
TTGCCAACAA
AACCAACCAA
GACAATACCG
TTGGACTTCA
,CAACAATACC
AATTAAAAAT
CTGATGTTGA
AAGACTTAGA
TTGAT=tTGG
CAGATTACTA
ATTGGAGTTTI
CGA--CGCrT
CAGAAACCAG
CAACAGCGGT
CGTTTGCTAA
AAGCCGCCGT
GAACTTATTA
AGTAACGGGC
TCAAGAAATT
TAAACTAGCA
CGGCAATGGC
AACGGCCCAG
GTT-GCTAGGA
GACACCACT
CGTCGCAAAC
CATCAAACTA
CACGTGGAAT
EF107-4 (SEQ ID NJO:416) EQGVATSQ SSDEASQTTQ FTEESQATVA
B
SEAXTVPPQE
KLQRTPLEPT
TFYTGDTTQY
APGGVTYALT
QDKKTIVTGE
DNDDLHVVYE
PTIDGIDMPA
DKTKYPVPNE
IYSGTAGGNY
YTVAKALPKI
TAELTLTGAV
GQNTKAFPVT
FGSLTA(DTJ
RNPYLRLNTS
KTSTVTLTAD
VTGP
TARIASRAIG
TVNGGKLLS I
PVPTIVFANVG
YFYGDVGGPT
DFTFTQEGTL
EA'JM(TYTLP
SQLKIKLAI PQ
VRRGIE-NPDN
NLSTPDGTIY
YQAGEKTYIF
DIIENGATMD
KEQWTTGAGV
RIKDLDQEIT
QANWSLTAQL
NTATAVVANQ
YSSVEGREI P
PTSPAFKYDT
GKVTNYVGAN
SSHQRRGTAG
PERYTGSDGK
AREALFGYVD
KVYTRPDDGT
IVS SLVGXXA
YYLENRRVTE
QGWFKGKTKP
YWEALLKNTG
SITLDQPLPA
S PDGDGFI ST
SQPKSATDSL
QFTGSDVYQL
FFFVEEDGTL
NNQNPSNIYG
AETELELTNE
PVYYYLTKRR
TYLFKGWYKG
EQGNLINPAK
IVTYGPQEVS
YNLTQKSATR
HFVDESGAK I
ATLKTTTTPS
EAPLTTIKIK
GGQLKMNLLG
PTFDFGKLAI
PT±TTRLL-LGT
DFTFANIKLE
FDPIDR ITMAV
VSTEVSFTIPK
KMPNKLTFGP
VT:-KFENPAG
NAKPSTLETT
FKLSATMGES
VEIPKYYQTI
YTARRSYWXW
TPPTGFTQGN
FTPTFNDEDD
PTATWAAGIG
TAVTGNPGQV
SGSKQQYGLK
AAAASFTDYN
VPANQGMAGQ
NLSTFSFYEE
EYQSLDIRPS
KKTFKYTVAT
GAl PAPEGYT
KTPSYAVTYD
DGATGE.MTTF
SISPTTAYTG
GPTKTLYSMS
QLVVDSENYV
MTAVYQE.AIP
APNTI FVQGT
LTADVEVTGN
KAADYYGNGT
QPTETRTPLG
QYQAAVTWNL
EF108-1 (SEQ ID NO:417)
B
B.
TAATCGGTT
CAACGATTAG
ACGGCAG"TG
GAAGAGACGC
CAAGTAACAA
CAAGCAGAAG
AAGGCAACTr
CAATTAATAA
ACAGATGT
AACTTAAAGA
GAAGGAACTC
GCGAGTCTCT
CTAGCAGGTA
ACCAGTAAAG
GGTGTAAAGT
ATAGGTTATA
GAAAACTTCG
AAACAAACGG
ACCTACA.AG
GGCGGGAATC
CAACCATTGG
CGGAAACCGT
AAGCAAGTAG
CTGACACGAG
TGGAACAAC
ATTCATI'TTC
ATCCAGATAT
ATAATCGAAA
ATGTAATAJJ
AAGAGGTGTA
ACAATAGAGA
CTAGACCGCT
GAGACAATGT
TTACTAATTT
AGACTGTGAG
TAGATACAAG
TCATTAACAG
TTGGCACGAA
CTTGTGTAGT
GACGATTGAA
CGTGAAGGAA
TCAGGAAGAA
AGAAACACCA
TGCAGAAACT
TATACCAATT
GTGGACTAGT
GCCAGCAACT
CACAAACTTT
AGGTAAAAT
ATC'TFGACT
TTATACACCA
rTTGTATCGT
TAGCCCAATC
TGGCGCCAAA
TGATCCTTAC
ATCTTACCGA
TCTI'AGTAA
AGTAGTCCGA
GAAACAACGA
GCAACGAAAG
ATCATTCCTA
TATCAGT'ITG
ACGTATAGCT
ATGGTCAAG
TCTGTAGrTA
TCAATTCGCA
GATCTACTT
CAAAGTAGTG
CGGGAAACAT
ATAAATGACT
TATT!ACCATC
ATCACGCCAC
ACGTTCCAAC
TTCAAAGGCT
TI'AACGCCTT
CCGCCGAAAG
AAGCCAGTAC
AAGCGGAGAA
AACCAAAAAA
GATTTGTGAA
ATGCCAAAGG
GGAGTGCITC
TGCCACCAGG
TACCAAAATA
ATCCGTrACC
TAATTAGTGC
TTI'GGAGG
TTGATGTGAA
TGACCAACCG
CAAGTAATTI
AAAGTGGT
GGTACAAAGG
TTCTGGTGTG
TAGTGCCAAG
GGAAAATAGT
AGAAGA.ACCG
AATCAATATG
TGAATCAGGT
ATCATGGA6AG A6ACCGTAGGA
ACCGTCATAT
TIATGCATCA
TGCTAT1'GCA ATrGGCCTG
AGATCCTGCA
AGGTAATAAC
CCGTGTCACC
CACCCAAGGG
TTTAccCGAG GAAAAcCAAA GTACATAGAA AGAAGGGACG. ACATGA.AGCA AACTAAGTG 209 TABLE 1. Nucleotide and Amino Acid SeqeuenceS of E. faecalis Genes.
a a e *aaa..
a a .a.a a.
ACCGAGccT
GATTTGACGG
TTTGGCTTL'G
TATAATCTTA
ACAGATACAG
GTCAATACGT
ATCAATATTA
AATATAGATC
CTGTCGGGCC
ACGTTATI'TG
CCTGTTTACT
GCTAAAATCA
GCCTACACTT
TACAAGTTCA
GCGCCAAGTT
GAAACAGTTA
GCTTTCACAC
TTAAGAACCG
AGTATTAATA
CAACCAATCA
CAACTAAAA'I
TATAGATATCG
GAGGTTGACC
TT'PTCTAATC
GTTAACTATC
GTCTACAAAC
ATCACTGCAC
AAGTACACAC
TTCCA-AGGG'
ACGTTCAATC
CCAACAGCTA
ATCTGGACA.
AAAGGGCCC,
GGAGAAACG,
CCAAATGCC'
AAACCAAAC
GTGGATAAC
GGCTTCATC
CAALCACAGC
CGGAT'AAG
GCCACAGAC
TTACCAAM
AGCTTAAC;
AATGTTA]
GGTGTTAAJ
TGGCCACCAC
TGGTCTATGA
TGGATGAAGC
cTTrTAA-ATGA
CGACACTGAA
TTTATGGCGC
CCAAATCAGA
AAGTAGAAAT
AAACGTTGG
GGACAGAGTC
ATTATAGA
CACCCCCAAC
TCAAACAAGC
AAGGTTGGTA
ATCAAGTGAC
CGACAGTGTA
CGGCGTTAAC
ATTTATATGA
ATGGTAGTAIl
GTGCTACCAP
ATGTTGACAC
TGTACACGAJ
TTAGTTCAGIt
CAAATAATAC
TCTCGTAAM
TCACTCGCA2 CAACAGGCr
CGGCAAAAG(
r' GGTATAAAG( 3 CGACC TG, D, GTGTCACAT k. CAACGATCA P, ATTGGTCAG A CAAAATCAA G TTCCTATCG A CTGTTTITGA T TCGTGAGAA A GTCTGCCAP T TGAAACAAG- A AAACGCAAC A GCTTGCCTI
ACAATCAAC
iG CCAACAACI .C AGTTGGAC' G GGCAACAA
TAAAACACCT
GGAG~rTTCA
GACGAACAAA
AAATAATAAA
AAAC'LTGTCC
GAGTGACAT'r
TGGCAAA.ACC
GTCACACATG
C CTTAATGCT
TGGCATCGAT
AAACCGCAAA
AGGTTTCACC
AGGCACCTTA
CAAAGGCAAG
CTACGATGAC
TCCATCAGTC
TTTrAGTGGT
CGTGACCTCA
GCCATTGTCC
CAGATTACAG
CATTCAATTP
CAATAGCTCP
ATCTCTTAAC
ACTTTTTAC
C' GT1'TTTTJ
ACAAGTCACC
r CACCCA-AGG'.
TTTACCAGC(
S GAAAACCAA( AL TGGCAATGA, T AACTCGACC.
C GAATACTAG C TGGTCTGAC .T CCCAGTAAA G CAAAA.AGT A AGCAGAAGTl ,T TCGTCCAAA 6C CTTCGAC'TI ;C CGCGGA TP .C CAATTGGAC C AGCGACCCC :C AACCGAGT kC AGCAACGAC rr CACCTICAJ rA CAAGGCCG(
AGCTATAAAG
GGGTACGAGC
TTAATTGCCC
AAAACAGTAA
GTGCCTGTCA
ACGTTTACAT
GACCCAGCTT
CCTGTGACCA
TTAGCCGATC
GACCCAGTCA
GTFCACCGAGA
CA&tAAAA
CCAGACACTT
TCCATACTCA
AATGATGATT
GATATGAACT
AAGTACTATG
AAAAATAATG
CAAGAATTI'I
TTTAATGTTC
GACACAGCTC
CTGG'rT71rCC TTGCTMAT J
ACGCATTTA(
LAACGCCAAA(
GAAAACTI'C(
r~ AACCAAGTA( 3ACGTATACT 3CCAAGTACrG
GATATGACC
A AAAGAAGTG C AAAGCACCC G ATCCCGACC T AGTACACTT T TCAGM1CCI T GTAGTATI XT GATCAAGAP ,c GGCCAAGTIC LC TACGGTAAC ;C TTAACAGC:C ,c TTATTA'I1 M' AAAAATACC ;T ATTATTGC( k.T AATGTCAA :A GTTACATG( TCACGTATGA TG TGCCTGCTTC
GA
CCGACCAAGT
GC
TGAGCAGTA-A C9 ACTATTTrTGA
A(
TGCCCAAACG
G
TTCCTCFI'CC
T~
CTTATAACAA
G~
AACCTGAATT
T
ATI'AITATAC k AC LTCGTAGA
C.
AA-ACGGTGAT
T
ACACAACG
C
ACACATTGACA
TGAATGTGGT
G
TTGTGAATGA CGCAAAGTAC G GTAATGGGCA
A~
TGAAAAAATA ATAA.ATTAG'C
C
AAAGTAGCAA
I
ACCCAAATGT P' TTGATTCAGA 3 GATATAGTGG
:CTGCGGATAA
3TGGATCTCAA Z: CAATGAACAG A CAGGTGGCAA
TGAACAAAAC
G CCATGTATAA A Tr'GATACGAA T TACAAAATCT T TTATGGAAGT ~T GGACAGAGGG T TCACAACTCG 'C GTGGTATTAA LG TAGTCACACC ;G GCGTTGCAGG 'G GCACACGGAA 'C AACTGTCACA kG GGGCGGCGCC 'G TCGGTACCAC :A ACAAGCAATr AC TTGAAGTGCC
ACCTAGTTAC
ACAATCAT
~CCAATCAA
:AGATGAAG
~TAACGGG
AATATCGC
L'ACAAATCA
U.AAATCTAT
I'TGAAACAG
rATACGAAA
ATGAGTGGC
ACCA.ACGGC
ACAAGCGAC
GGTAAGACC
ACTACCAAA
TATGAAGAA
AAAGGCGGG
AGTGCGTAC
~TATACGGTA
.AATAATGGA
:ATCGACCAA
~CTGAAATCC
WGCACCAGCA
LGCCACCTAT
:ACAC-CAGG-A
GTCAAAACTT
CGGTGCCAA
rAACACCTTC
AGTCTATACG
AACAACTCCA
GGAAGAAATA
TACCAATGTA
CACCTTGAAA
GACACCAGAA
GGTTCCTI'TA
CGCAACAGG
AGATACTACA
AACGACCGAA
AACTAAGCAA
TCCGTATCTG
ACcAAAATCA
TGT-CTCTAGC
GAGTGCCATT
CACAGGTAGT
AGCCAATCAA
AGGTCCTTAA
EF108-2 (SE.Q ID NO:418) MKQTKWQ RLATIGLCSS LVINAFSGVT AVAETVTIES
SPTAESSAKE
ETQASSVKEE TTKASTENSQ V'ITDTSQEFEA TKEA.EKEEPQ
AEVEQAETPI
ATYSFSAETY QFGFVNESGQ LINPDI IP IT YSYAI<GSWKT
DGYNRKWTSM
LKNVIMPATS VVMPPGPSYE GTQEVYTNFS IRIPKCYYASA
SLYNREGKID
AGTRPLSLTQ SSVISALALT SKGDNVYTPR ETFFGGDPAG
VKFTNFLYRI
GYK'IVSSPIY YHLTNRRVTE NFVDTSGAIKI TPPS4FTQGK
QTVINSDPYT
IPKPKINM'K
VQGSASTVGN
STYPLPAIAL
NDFDVYKGNNI
FQQSGFLPET
210 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
YKVGTKSYRF
GFVDEATNKL
NTFYGASDIT
SGQTFGFNAL
KITPPTGFTQ
PSYQVTYDDN
RTDLYDVTSK
LKYVDS IQLD
SNANNRLFYT
TAPTGFTQGN
FNATFDGNDD
GPNWSAGLTI
PNTVLKAEVV
HSLJKQAADYY
TNYNQPTELK
KGWYKGKTKT
IAPDQVQM(Y
FTLPXRYKS I
ADQPEFYTKT
GKKTVITSDA
DDLNVVYEEE
NNGNGQYTVS
TAQSSNLKSY
ELGYSGTPGV
QVPMNhSNTFK MTAMYXEE IP
PTFMEVTPEG
VFGC IKO)STV
GNGTRNPYLR
NTVGTTSAIS
EPLATI'XTPS
NLTLNENNKK
NITKSDGKTD
LFGTESGIDD
YTFKQAGTLP
TVTTVYPSVD
INNGSMPLSQ
RYVYTNNSSL
NYLLVMFLFN
YTAAKALPAT
TASVTLTRPK
ETTKSIPVNS
DNFVRIRPND
IKKTQPNWSL
LTANNTATS I
YKVTY)DNIDD
TVMSSNLTGT
PAFPLPKIYN
PVNYYTMSGP
DTYTTGGKTY
NNFVNEKGGA
ELLKKYNNGQ
VFDPNVAPAE
AKPADKSKIJV
YTTGGKVYTF
EVIDTNTNVI
TLWTEGVPLP
QEVVTPTTEG
TAQLSQPKSA
IANKQFTGSN
LTVVYEEFSG
DTATLI(NLSV
IDQVEMSHMP
VYYYLENRKV
KFKGWYKGKS
FTPALTFSGK
P ISATNRLQF
VDLSSESLNL
YKVTRI(QVTE
QGWYKGKTKP
WTTTITNTSK
NAVP IG1GVS
FISVPTFDFG
TOSL.TATRL
VYQLDFTFNN
YELPASTNQF
PVNYFEQYRV
VTTYNKLKQL
TENFVDTNGA
ILNTLTTTKA
YYAQSTSAYL
NVDKLAIDQQ
LNFOSDGTYF
NFVDVNGAK I STrLNKTTTPT
APLQNLTLKK
VAFTTRATGK
QVGVACTKQQ
LLGAAPVSSF
VKLEVPANQG
VKGQQYKAAV T.JNLVTGP EF108-3 (SEQ ID NQ:419) 9.
9* 9 9 9 9, 9 .9.9 9 9*9~ 9* S 9* 9* CGT GACGA PrGAA AGTAGTCCGA CCGCCGAAAG TAGTGCCAAG
GAAGAGACGC
CAAGTAACAA
CAAGCAGAAG
AAGGCAACTT
CAATTAATAA
ACAGATGGTT
AACTTAAAGA
GAAGGAACTC
GCGAGTCTCT
CTAGCAGGTA
ACCAGTAAAG
GGTGTAAAGT
ATAGGTTATA
GAAAACrI'CG
AAACAAACGG
ACCTACAAAG
ACCGAGCC T
GATTTCACCG
TTTGGCTTTG
TATAATCTTA
ACAGATACAG
GTCAATACGT
ATCAATATTA
AATATAGATC
CTGTCGGGCC
ACGTTATTTG
CCTGT'ITAC'I
GCTAAAATCA
GCCTACACTI
TACAAGTTCPA
GCGCCAAG PI
GAAACAGTTP
GCITTCACAC
TTAAGAACCC
AAGCAAGTAG C CTGACACGAG T1 TGGAACAAGC A
ATTCA=TTC
ATCCAGATAT T1 ATAATCGAA-A C ATGTAATAAT C
AAGAGGTGTAC
ACAATAGAGA I
CTAGACCGCT
GAGACAATGT
TrACTAATT
AGACTGTGAG
TAGATACAAG
TCATTAACAG
TTGGCACGAA
TGGCCACCAC
TGGTCTATGA
TGGATGAAGC
CTTTAAATGA
CGACACTGAA
TTrATGGCGC
CCAAATCAGA
AAGTAGAAAT
AAACGTTTIGG
GGACAGAGTC
ATTATTTAGA
*CACCGCCAAC
*TCAAACAAGC
AAGGTTGGTA
ATCAACTGAC
LCGACAGTGTA
CGGCGTTAAC
ATTTATATCA
GTGAAGGAA C CAGGAAGAA C GAAACACCA I ,GCAGAAACT I
~ATACCAA
;TGGACTAGT
;CCAGCAACT
'ACAAACT
~GGTAAAA IT kTCTTTGACT
LTATACACCA
I=rTATCGT
TAGCCCAATC
TGGCGCCAAA
TGATCCTTAC
ATCTTACCGA
TAAAACACCT
GGAGT=CA
GACGAACAAA
AAATAATAAA
AAACTTGTCC
GAGTGACATT
TGGCAAAACC
GTCACACATG
CTITAATGCT
TGGCATCGAT
AAACCGCAAA
AGGT'ITCACC
AGGCACCTTA
CAAAGGCAAG
CTACCATGAC
TCCATCAGTC
TI'TAGTGGT
CGTGACCTCA
,AAACAACGA
CA.ACGAAAG
~TCAT1'CCTA EATCAGTTTG C kCGTATAGCT k.TGG'PTCAAG E'cTGTAGTTA r'cAATTcGcA
"ATTCTACTT
AAAGTAGTG
CGGGAAACAT
A~TAAATGACT
TATTACCATC
ATCACGCCAC
ACGTTCCAAC
TTCAAAGGCT
AGCTATAA.AG
GGGTACGAGC
'ITAATTGCCC
AAAACAGTAA
GTGCCTGTCA
ACGTTTACAT
GACCCAGCTT
CCTGTGACCA
TTAGCCGATC
GACCCAGTCA
GTCACCGAGA
CAAGGTAAAA
CCAGACAC'T
TCCATACTCA
AATtGATGAT1'
GATATGAACT
AAGTACTATG
AAAAATAATG
LAGCCAGTAC G LAGCGGAGAA A~
LACCAAAAAA
;ATTTGTGAA
kTGCCAAAGG P
;GAGTCCTTC
EGCCACCAGG TACCAAAA'TA 9 kTCCGTTACC TA-A TAGTGC rTTTTrGGAGG
TTGATGTGAA
TGACCAACCG
CAAGTAAT
AAAGTGGTFI'
GGTACAAAGG
TCACGTATGA
TGCCTGCTTC
CCGACCAAGT
TGAGCAGTAA
ACTATTI'GA
TGCCCAAACG
TTCCTCTTCC
CTTATAACAA
AACCTGAATT
ATTA7rATAC
ACTICGTAGA
AAACGGTGAT
ACACAACAGG
ACACATTGAC
TGAATGTGGT
TTGTGAATGA
CGCAAAGTAC
GTAATGGGCA
GAAAATAGT
.CAAGAACCG
LATCAATATG
,GAATCAGGT
LTCATGGAAG
LCCGTAGGA
~CCGTCATAT
~TATGCATCA
rGCTATTGCA kTITGGCGCTG kGATCCTGCA kGGTAATAAC
CCGTGTCACC
CAccCAAGGG rTTACCCGAG
CAAA.ACCAAA
TGACAATGAT
GACCAATCAA
GCAGATGAAG
CTTAACGGGG
ACAATATCGC
GTACAAATCA
TAAAATCTAT
GT'GrAAACAG
TTATACGA;A
AATGAGTGGC
CACCAACGGC
TACAAGCGAC
CGGTAAGACC
AACTACCAAA
GTATGAAGAA
AAAAGGCGGG
GAGTGCGTAC
ATATACGGTA
211 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
AGTATTAATA ATGGTAGTAT CAACCAATCA GTGCTACCAA CAACTAAAAT ATG7r'IACAG TATAGATATG TGTACACGAA GAGGFI'GACC TTAGTTCAGA T'TTTCTAATG CAAATA.ATAG GT~AACTATC TTCTCGTAAT GTCTACAAAG TCACTCGCAA ATCACTGCAC CAACAGGCTT AACTACACAG CGGCAAAAGC TTCCAAGGGT GGTATAAAGG ACGTTCAATG CGACCTTTIGA CCAACAGCTA GTGTCACATT ATCTGGACAA CAACGATCAC AAAGGGCCCA ATTGGTCAGC GGAGAA.ACGA CAAAATCAAT CCAAATGCCG TTCCTATCGG AAACCAAACA CTGTI'GAA GTGGATAACT TCGTCAGAAT **GGCTTCATCA
GTGTGCCAAC
CAACACAGCT TGAAACAAGC **CGGATTAAGA
AAACGCAACC
GCGACAGACA GCTTGCCTAC VTTACCAATTI ACAATCAACC *AGCTTAACAG CCAACAACAC *AATGTTTATC
AGTTGGACTI]
GGTGTTAAAG GGCAACAAT;
GCCATTGTCC
CAGATTACAG
CATTCAATTA
CAATAGCTCA
ATCTCTTAAC
ACTT'TTTTAC
GTTTCTTTTT
ACAAGTCACC
CACCCAAGGT
TTTACCAGCG
GAAAACCAAG
TGGCAATGAC
AACTCG-ACCA
GAATACTAC
TGGTCTGACG
CCCAGTAAAT
CAAAAAAGTr
AGCAGAAGTE
TCOTCCAA-AE
CTLTCGACTTC
CGCGGATTAC
CAATTGGAGC
AGCGACCCGC
AACCGAGTTC
AGCAACGAG
CACCTTCAA'
CAAGGCCGC
CA.AGAATTAT
TTTAATGTTG
GACACAGCTC
CTGGTI'MTCG
TTGCTPAATT
ACGCATTTAG
AACCCCAAAC
GAAAACTTCG
A.ACCAAGTAC
ACGTATACTA
.CCAAGTACGT
GAPfitTGACCG
AAAGAACTGA
AAAGCACCCT
ATCCCGACCT
AGTACACTT
*TCAGTTGCMF
GTAGTATTTC
GATCAAGAAC
GGCCAAGTGCC
TACGGTAACC
TTAACAGCGC
TTATTATTA(
AAAAATACG(
C' ATTATTGCC r' AATGTCAAA( k GTTACATGG, TGAAAAAATA T ATAAATTAGC C AAAGTAGCAA I
ACCCAAATGT
TTGATTCAGA
GATATAGTGG
CTGCGGATAA
TGGATGTCAA
CAATGAACAG
CAGGTGGCAA
TGAACAAAAC
CCATGTATAA
MrATACGAA
TACAAAATCT
TTATGGAAGT
GGACAGAGGG
TCACAACTCG
GTGGTATTAA
TAGTCACACC
GCGTTGCAGG
GCACACGGAA
AACTGTCACA
GGGCGGCCCC
TCGGTACCAC
k. ACAAGCAATT
-TTGAAGTCC
k~ ACCTAGTTAC
~AATPLATGGA
:ATCGACCALA
~CTGAAATCC
=GACCAGCA
LGGCACCTAT
:ACACCAGGA
'TCAAAACTT
:GGTGCCAAA
rA.ACACCTTC
AGTCTATACG
AACAACTCCA
GGAAGAXATA
TAC ATGTA
CACCTTGAAA
GACACCAGAA
GGTTCCT ITA
CGCA.ACAGGG
AGATAGTACA
AACGACCGAA
AACTAAGCAA
TCCGTATCTG
ACCAAAATCA
TGTCTCTAGC
GAGTGCCATT
CACAGGTAGT
AGCCAATCAA
AG
EF108-4 (SEQ ID 110:420) VTIES SPTAESSAKE a -TQAS SVKEE
ATYSFSAETY
LKNVIMPATS
AGTRPLSLTQ
GYKIVSSP IY
YKVGTKSYRF
GFVDE-ATNKL
NTFYGASDIT
SGQTFGFNAL
KITPPTGFTQ
PSYQVTYDDN
RTDLYDVTSK
LKYVDS IQLD
SNANNRLFYT
TAPTGFTQGN
FNATFDGNDD
GPNWSAGLTI
PNTVLKAEV
HSLKQAADYY
TNYNQPTEL<
TTYKASTENSQ
QFGFVNESGQ
VVI4PPGPSYE
SSVISALALT
YHLTNRRVTE
KGWYKGKTKT
IAPOQVQMXY
FTLPKRYKS I
ADQPEFYTKT
GKKTVITSDA
DDLNVVYEEE
NNGNGQYTVS
TAQSSNLKSY
HLGYSGTPGV
QVP4I'SNTFK MTAMYKEEI P
PTFMEVJTPEO
VFGG IKDSTU
GNGTRNPYLP
*N'IVG'TSAIS
VTTDTSQEEA
LINPDI IP IT
GTQEVYTNFS
SKGDNVYTPR
NFVDTSGAKI
EPLATTKTPS
NLTLNENNKK
NITKSDGKTD
LFGTESGIDD
YTFKQAGTLP
TVTTVYPSVD
INNGSMPLSQ
RYVYTNNSSLi
NYLLVMFLFN
YTAAKALPAT
TASVTLTRPK
ETrKSIPVNS
DNFVRIRPNV
IKKTQPNWSL
LTANNTATSI
TKEAEKEEPQ
YSYAKGSWKT
IRIPKYYASA
ETFFGGDPAG
TPPSNFTQGK
YKVTrYDDNDD
TVMSSNLTGT
PAFPLPKIYN
PVNYYTMSGP
DTYTTGGKTY
MNFVNEKGGA
ELLKKYNNGQ
VFDPNAPAE
A.KPADKSKLV
YTTGGKVYTF
EVIDTNrNVI
TLWTEGVPLP
QEVVTPTTEG
TAQLSQPKSA
IANKQFTGSN
AEVEQAETPI
DGYNRKWTSM
SLYNREGKID
VKFTNFLYRI
QIVINSUPYT
LTVVYE-EFSG
DTATLKNLSV
IDQVEMSHMP
VYYYLE-NRKV
KFKGVIYKGKS
FTPALTFSGK
PISATNRLQF
VDEJSSESLNL
YKVTRKQVTE
QGWdYKGKTKP WTM ITNTSK NM?? IGKKVS Fl SVPTFDFG
TDSLPTATRL
IVYQLDFTFNN
IPKPKKIOM~K
VQGSASTVGN
STYPLPAIAL
NDFDVKGNNI
FQQSGFIPET
YELPASTNQF
PVNYFEQYRV
rTytNKL.KQL
TENFVDTNGA
ILNTLTT KA
YYAQSTSAYL
NVDKLAIDQQ
LNFDSDGTYF
NFVDVNGAX
I
STLNKTTTPT
APLQNLTLKK
VAFTTRATGK
QVGVAGTKQQ
LLGAAP VS SF
VKLEVPANQG
VKGQQYKAAV TWNLVT EF109-1 (SEQ ID NO:421) 212 TABLE 1. Nucleotide and Amnino Acid Seqeuences of E. faecalis Genes.
AGGAGTAAAT
GGA=TCTCG
GAAACGGTGG
GAAATGAACC
ACCTCCGAT
NATAGTAGCG
TAG
TAATGAAAAA
TTACCCCTAT
CTTCAGAAAC
CAAGCATCAT
CCACCACTGA
ACGTACTGAA
AAGTGTTATA
TTCTGCTTAC
ATCTCTAACG
AAATTCTCAA
AG=ITCTACA
ACTAC'FTTGG
ACTAGTTCTA
GC TTGGAAC
GAGCGACAAA
GAGGAAACAG
TCAGAAGTAA
NAACATCACN
TGTT-AGCGGT
GCTCTAAGGG
TGAGTAGCGG
AAACAACGTC
CAACTGTTAA
AAGTAATGAG
TTTGTTGTCG
AACTACTGAA
TGTCACTGAA
CACTTCCTCA
TGATACAGAA
GACACACCTA
E.F109-2 (SEQ ID NO:422)- 14KKSVI TSSMLAVLLS GFLVTPISAY ALERSKGTTE ETVASE TSLT ERQMSSGVTE EMNPSIINSQ EETETTSTSS TSDSTTEVST SEVTTVNDTE XSSIDVLKLLW XHHXVMRTHL EF109-3 (SEQ ID NO:423) GGAAC GCTCTAAGGG AACTACTGAA GAAACGGTGG CTTCAGAAAC ATCTCTAACG GAGCGACAAA TGAGTAGCGG TGTCACTGAA GAAATGAACC CAAGCATCAT AAATTCTCAA GAGGAAACAG AAACA.ACGTC CACTTCCTCA ACCTCCGATT CCACCACTGA AGTTTCTACA TCAG EF109-4 (SEQ ID NO:424) ERSKGTTE ETVASETSLT ERQMSSGVTE EMNPSIINSQ EET=E'FTSTSS TSDSTTEVST S EF110-1 (SEQ ID NO:425)
C
TAAATAAAAA
AGTGCTGGTT
AATGCAGAAG
GACCCTGAGG
GGA.AGA.ATCA
GTTGGAACCA
GTATTAAATC
ACACCAT ITG
GTAGTGACTG
ACGCCATI'G
CCAGGTGAGA
ACAGACTTAG
TCACCAATCT
CAAACAGGAA
GTGAATGAAG
TGGATAAGGA
'rrwCTTTCT
AGTATATCGT
ACAGAAGACA
TTTCCCCTGC
ATACAAT
CGAATGCCAA
GCAAATTCAA
TCGGCAAACA
~TTGAAAAA
AAAACCACAC
AAAATCCATT
ATAATGATCA
ATCATGGTCA
AAGAAAkATAA
GTGGCATAAT
GATTTTAGTA
TCCTGCCGAA
AGAAGTGGCA
CAGTAAACCA
CACCAATAAT
AGATGATGCT
AGTGATTGAT
AAACGATCGT
GT'ITGAATCT
ACAATGGTCT
ACTAIFTAT
GG ITGAAGTA
AAGACTAAAT
ACGITTTATCC
CTTATGAAAA
ACTI'TGATCG
AGTCATTCAC
GATACAkACCG
GGCTATATTT
CA.TGTGGCTG
TGGTTTTATC
GTAGCTrTTT
CCAGATGGCC
TCAGATACCC
CATGAAAATG
GATATCGATA
GTTGGTGTTC
GAAGrTGAATT
GCTGTGCCAG
AGTTCTCCAT
CCGGF=AG
GACAAAAAAG
AAGCGCCFI
CTTTAGGAAC
AAAGTTTAA
CAGGTCGAGA
CCCCGAATGC
CAGAGTTGGG
ATGTCACAAT
ATT'GTTTAC
CAACCGGCGG
ATTCCAATGG
ATAACTTTAT
CAGCGTAA
ACGAAAAAT
C'ITTCTGCA
ATCGTTACTG,
TGCGTCAATC
AGGCTI'TGTT
GAATGCCAAA
TGGCAGTGCG
GGATATTGCG
AGAAATTTA
ATCAGGCTAT
ATCTAACTTT
TcAATCTGT
CGGCATTAAG
TGTTAATCGA
EF110-2 (SEQ ID NO:426)
MKKFSIRKIS
PEDRRQEVAD
LNPNAKDDAW
PFVLKKFESS
P IYNDQVEVV AGFLFL ILVT TTEAkPFAS IG
FYPGRDGSAT
DTHVTISGYP
GVHSNGGIKQ
LIAGFSLSAN
RIISPASKPG
PFGKFKVIDV
GEI(NHTQWSH
TGNHGQRLNE
AEEYIVPAES HSRQKRSLLD
YISLGTGFVV
AFsPNADIAV
ENDLFTSNFT
VNYNFIVNRV
GTMTIVTNNH VAESFKNAKV VTVGKQNDRP DGPELGEILT DLENPLLFYD IDTTGGQSGS NEEENKRLSA VPAA EF110-3 (SEQ ID NO:427) AG AGTATATCGT TCC LGCCGA-A AGTCA'FTCAC GACAAAAAAG ATCGTTACTG 213 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GACCCTGAGG
GGAAGAATCA
GTTGGAACCA
GTAT TAAATC
ACACCATTTG
GTAGTGACTG
ACGCCATTTG
CCAGGTGAGA
ACAGACTTAG
TCACCAATCT
CAAACAGGAA
GTGAATGAAG
ACAGAAGACA
TTCCCCTGC
ATACAATTGT
CGAATGCCAA
GCAAkATTCAA
TC'GGCAAACA
TTTTGAAAAA
AAAACCACAC
AAAATCCATI'
ATAATGATCA
ATCATGGTCA
AAGAAAATAA
AGAAGTGGCA
CAGTAAACCA
CACCAATAAT
AGATGATGCT
AGTGATTGAT
AA.ACGATCGT
GTTTGAATCT
ACAATGGTCT
ACTAITI AT
GGTTGAAGTA
AAGACTAAAT
ACGTTTATCC
GATACAACCC
G4GCTATATTT
CATGTGGCTG
TGGT=TATC
GTAGCTITTTT
CCAGATGGCC
TCAGATACCC
CATGAAAATG
GATATCGATA
GTTGGTGIC
GAAGTGAANr GCtGtGCCAG AAGCSCCTTT TGCGTCAATC C'I TAGGAAC AGGCTTTGTT AAAGTTTTAA GALATGCCAAA CAGGTCGAGA TGGCAGTGCG CCCCGAATGC GGATATTGCG CAGAGTTOGG AGAATTTA ATGTCACAAT ATCAGGCTAT ATTTGTTTAC ATCTAACTTT CAACCGGCG-G TCAATCTGGT ATTCCAATG-G cG~cATTAAG ATAACTTITAT
TGTTAATCGA
CAGCGT
EFIJQ-4 (SEQ ID NO:428) EY IVPAES HSRQKRSLLD PEDRRQEVAD TrEAPFAS IG LNPNAKDDAW FYPGRDGSAT PFVLKKFESS DT]-VTISGYP PIYNDQVEVV GVHSNGGIKQ RI ISPASKPG
PFGKFKVIDV
GEKNHTQWSH
TGNHGQRLNE
YISLGTGFVV
AFSPNADIAV
ENDLFTSNFT
VNYNFIVNRV
GTNTIVTNNH
VTVGKQNDRP
DLENPLLFYID
NEEENKRLSA
vAESFKNAKV
DGPELGEILT
IDTTGGQSGS
VPAA
a a.
a a a a.
a a a EF111-l (SEQ ID NO:429) a a a
TGATCAATAC
AAAAAGGGAT
CTATCATTTT
TGGTTAGCGT
AGTGTTGAAA
ACCGAACGGA
GCA I'GCATG
GGGCCATTGA
ACGGTCAAAA
CTGTCTTATT
TTAGC'TITAA
GGTGAAGAC
GAGGGAACGG
GTGAATTI'TG
ACGATI'CATC
GCTGAACCAA
ACTT?1'ACAA
GCCGCGCGCT
GATCAAAAGA
ACGATTAATA
TCCATGTCTI
GCA.ACGGCTC
CAA.ATrcAAP
TTTTACAATC
AAACCACCAI
TTTTTAAAAC
GACCACAAT;
AAAGACGACI
CTGTTATG)-
AAGGTGTGG(
TAG
ACTTCGATAC
ACACTCCATT
ATAGGAGGGG
TAGGAATCAC
GAACAAAAGT
CGAATGGCGT
GCTACTATTT
TTATTGCGGA
ATAAAAAAAC
ATCCTGGGCG
TTTTTGTC-AG
CCTTGTCACT
AAACCTTAGA
GTGAAGTCAG
ATGCGGATAA
TIGAATTGAA
AAGAAGAAGP
ATTTCAAAGP
AAACAGCAG;
CCAATTGGCC
ATAAATGGT]
ATTT1'AATCC
AAGATGATAC
AAGACAGTG(
TGCCTGCAT(
AAATGTATC(
AAATGGGA!
kAGGATCAAA' k AGCAGCCGCI 3CAAAGGCGA
GGTCGCTTTT
TGGCATAGTC
'IT=ATCAAG
CGTAGTCAGC
TGATGAAFI'T
ATACCATACC
ACCTGAAAAA
AGAATATCCA
GGGAGAA-ACC
CCTAGAGCAA
CAATCGAACG
TGGAGCAAGC
TAT'MGCACT
AGAAACGTGG
AGTTTCAACA
GCCTAAACAA
GGCAAAGC-AA
GAACAAGCAA
ATTTCCTGAA
AAGTGCGGCP
TATTGGTATC.
TGACTTAGCI
CGTACGTCC;
GCGTGGTGGI
G GCAGTTTGC
CAAACT
r AGCAGAATA! r CATAAAGA! 7. TGGGAAAGT(
TTTCTAGAGA
CF1'GCTGATA A
GGTTTATCAA
TGTT'ITGCGIT I GCAAATGTT 'I
AAI'ATTTTAI
AGCAATAAAG
GTAAACTTGG
TATGATTTAA(
ACCTATGAAT
GCGCTTATCC
TGGACAGGTG
CGTTTAACTG
AATTAITTG
AAAATTGATA
ACGTACAACA
CAACAACAAG
AGATGGCAAG
TATCAAAATG
GGTGCCTTrA
*TGGGCTTGGG
AAAAATAATA
CAAGATGCAG
GAAGGTGGCA
CATATT1TATC
GCTTATCATA
r' GGAAGCATGG r AAAAATGGCC
-GCATGCATAA
LAGTTGAATC T ~TAAATCAGT G
LAAAGAAACGC
~AAGCAGA
~AGATGTGAG Ti ~TAATTTTT'C TI GCCTACTCG1
:GGCAAGTTT
3CCA-AAGCAA C rAGACGA'ITT I
PLAACGACACT
CGGTCTrGA
CTAAAGACAA
CTACGAAAGA
ATCGGAATTA
CCTATACGAC
CACCCCAATA
GATATCTAGA
CGCTACTCAA
ACCATCACG
A'ITCGTGGAA
TGCGGGCCTT
GAGCGATCAT
ACTGGAATGA
AAGAAACCAA
A'TCGTGCTA
TCAGTGATGC
ACCTAAAGTG
CGCTACACCO
AAACAAAAAT
TTCAATAAT
TATAAAGCG
GTGTCTACT
,GTGCAAGCA
'GCATCACCA
~CATTTAGCT
'CCTTTTGC
LAACAAATTA
CGCATG-AC
k.ACGATCAT
L'GAAAACACT
:AAAATTCAA
TGACATTCAA
CACAAAATAT
rACAGCAAC
AGAAAGCTAT
TACCAAAAAT
TAAAACGTTT
ATCGATTGCAA
GATTCTTCCG
AGCGGATGTA
GTTTGA'ITAT
TGATGCTGTC
ACCAAATTCT
AGATAAGGAA
TACCAACAGA
TCACTCCCAA
GATGATGATG
TTTCACAAAG
AALAGGAAAAG
GTTGGAGTTA AAGTTTTTGA 214 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
EF111-2 (SEQ ID NO:430) DM(G LSKKKRVSTW
LALGITVVSC
WHGYYLPEKS
SYY PGRLEQT
GTETLDIGTR
EP IELKPI(QT
QKKTAEFPEY
TADFNPELAK
PPLAAWAVWH
DDKDQI IKDK
FALSREVQAS
NKELLGGFAG
YELDOLTIHL
LTAKONDIQV
VNTYTTESYT
QNALVKS lET
NNMRALFDYQ
IYQETKDKEF
NGI4LKWMMML
VERTKVDEFA
PLIIAEEYPV
ALIFVSNRTA
NFGEVRE.TWN
FTKEEEAKEQ
INTNWRSAAG
IQKDDTVRPQ
LKE4YPKLVA
LLKQPRGKVA
NVLOVSASPT
NLAASLNK:LT
LI QTTLENTG YFATK[YrKYT
QQAPE-YTKNA
AFKHDGIVPS
DAGAI IDAVF YI4RWWTNRD
WITLHGLTKK
E-RTNGVYDTN
VK<NKKTGETY
7-EPLSLGASW
IHHADKVSTK
ARYFKENKQR
MSYKWFIGMW
YNQDSARGG7E HNXNG IAEYG
VAAATLELK
YFNNFSDLGA
DLSQSNRMDL
TGAVFDKIQE
IDNRNYTATA
WQGYLDKTFD
AWDSWKADVA
GGNWNERNSK
SMVSDAHWQK
FLKTK IKEK EF111-3 (SEQ ID NO:431) S S
SS
5 5 0* 5* S S S 5
S.
S S
S
S
S
5
S
55 5 S S
S.
TCYATGAATE
5
I
ACCGAACGGA
GCATGGCATG
GGGCCATTGA
ACGGTCAAAA
CTGTCTTATT
TTAGC=IAA
GGTGAAGAGC
GAGGGAACGG
GTGAATTI'TG
ACGATTCATC
GCTGAACCAA
ACTTTACAA
GCGGCGCGCT
GATCAAAAGA
ACGATTAATA
TCCATGTCTT
GCAACGGCTG
CAAATTCAAA
TTTTACA-ATC
AAACCACCA'I
TrTTTAAAAC
GACCACAATP
AAAGACGAC;
CTGTTATTG;
AAGGTGTGGC
GCAAATGT
CGAATGGCGT
GCTACTATT
TTATTGCGGA
ATAAAAAAAC
ATCCTGGGCG
TTrTTGTCAG
CCTTGTCACT
AAACC'ITAGA
GTGAAGTCAG
ATGCGGATAA
TTGAATTGAA
AAGAAGAAGA
ATTTCAAAGA
AAACAGCAGA
CCAATTGGCG
ATAAATGGTT
ATTTTAATCC
AAGATGATAC
AAGACAGTGC
TGGCTGCATG
AAATGTATCC
LAAAATGGGAT
AGGATCAAAI
AGCAGCCGCC
CAAAGGCGAC
TAGATCTGAG TGCATCA ATACGATACC AATTAT' ACCTGAAAAA
AGCAATA
AGAATATCCA
GTAAACI
GGGAGAAACC
TATGATI
CCTAGAGCAA ACCTATG CAATCGAACC
GCGCTTV
TGGAGCAAGC
TGGACAC
TATTGGCACT CGTTTA-; AGAAACGTGG AAT' AGTTTCAACA
AAAAT
GCCTAAACAA ACGTAC GGCAAAGGAA CAACAA( GAACAAGCAA AGATGG( A=ITCCTGAA TATCA AAGTGCGGCA GGTGCC' TA7TGGTATG TGGGCT TGAGTTAGCT AAAAAT CGTACGTCCA
CAAGAT
GCGTGGTGGT GAAGG T GGCAGTTTGG CATATI CAAACTTGTG GCTTAI AGCAGAATAT GGAAGC CATTAAAGAT AAAAXI TGGGAAAGTG GCATGG G ITGGAGTTA AAGTT
.CCA
LTTA
AAG
TGG
TAA
~AAT
~TCC
;GTG
CTG
ETTG
;ATA
AACA
CAAG
CAAG
AATG
TTA
GCAG
CGCA
'TATC
CATP
ATC
GGCC
ATA7 ATAATTTTTC 9) AGCTACTGGG
CGGCAAGTTT
GCCAAACCAA
TAGACGATTI
AAACGACACT
CGGCTITGA
CTAAAGACAA
CTACGAAAGA
ATCGGAATTA
CCTATACGAC
CACCCGAATA
GATATCTAGA
CGCTAGTCAA
AGCATGACGG
ATTCGTGGAA
*TGCGGGCCTT
*GAGCGATCAT
ACTGGAATGA
AAGAA.ACCAA
ATTGTGGTA
TCAGTGATGC
ACCTAAAGTG
SCGCTACACGG
rGATTIAGGT
,GGTTTTGCG
LAACAAATTA
:CGCATGGAC
kACGATTCAT rGAAAACACT
:AAAATTCAA
rGACATTCAA
CACAAAATAT
TACAGCAACC
AGAAAGCTAT
TACcAAAAAT
TAAAACGTTT
ATCGATTGAA
GATTGTTCCG
AGCGGATGTA
GTTTIGATTAT
TGATGCTGTC
ACGAAATTCT
ACATAAGGAA
TACcAACAGA
TCACTGGCAA
GATGATGATG
'rTTGACAAAG EF111-4 (SEQ ID NO:432) DEFA NVLDVSASPT ERTNGVYDTN YFNNFSDLGA
WHGYYL~PEKS
SYYPGRLEQT
GTE-TLDIGTR
EP IELKPKQT
QKKTAEFPEY
TADFNPELAK
PPLAAWAVWH
DDiXDQI IKDK
NKELLGGFAG
YE-LDDLTIHL
LTAKDNDIQV
YNTYTTESYT
QNALVKSIET
NNMRALFDYQ
IYQETKDKEF
NGHLKW~tha4 PLI IAEEYPV
ALIFVSNRTA
NFGEVRETWN
F'IKEEEAKEQ
INTNWRSAAG
IQKDDTVRPQ
LKEI4YPKLVA
L.LKQPRGKVA
NLAASLNKLT
LIQTTLENTG
YFATKDTKYT
QQAPEYTKNA
AFKHDGIVPS
DACAI IDAVF
YHNWWYTNRD
WITLHGLTKK
VKNI(XTGETY
EEPLSLGASW
IHH-ADKVSTK
ARYFKZENKQR
MSYKWFIGMW
YNQDSARGGE
HNKNG IAEYG
VWAKATLELK
DLSQSNRMDL
TGAVFDKIQE
IDNRNYTATA
WQGYLDKTFD
AWDSWKADVA
GGNWNERNSK
SMVSDAHWQK
215 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
EF117-1 (SEQ ID NO:433)
TAATTCGATIG
GCTTTAACAG
AAGGAAACGN
GAATATA=I
AAAGGTGATC
AAGAAAAAAT
AAAGGGACAT
GAGAAGGTG
GTTGTAAATC
CACAGTCTTA
CTTTGAAAGG
GTTTTATTTT
TGAAGATTGG
TAATTGAAAG
TTTAGTGA-AA
CGGTGAAAAA
NTCAGAAACA
GGAGATTGTT
GAAAAGTGGT
TGACGAAGTG
TCAGGAGAAT
AGATT1'TCAT
GAATTTGATG
GAATTACAAA
GAGAGTGACA
TCTAGTAAAT
ACAGT'ITACG
CATGATI'CAG
'TI=ITTTACT
AAGAATCTCT
ATGGTGACGT
GTCGTTCCAG
ATCAAG=TN
GAGAATATTA
CCACGAATTA
AATTITTACTT
TCAAAATCTA
TCGTTTAAAT
TTTAATAAAA
TAATGAGCAA
CGGCT'TITG
C
EF117-2 (SEQ ID NO:434) VKR FSFFLLILLA LTGCKSGEKE FDEESLQNLK ETXQSXSETE L&NGDVRLNE YISLKGEIVE SDSRSSLIKK GDRF-ILKSGS SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH DSATN EF117-3 (SEQ 10 NO:435) S
S*
S.
S.
S S
S
S.
S
S
S
S
5 *55.
*5.S S. S *5
S.
TG AAGAATCTCT TCAAAATCTA AAGGAAACGN CACAGTCTTA NTCAGA.ACA GAATATATI'T CFI'TGAAAGG GGAGATTGT AAAGGTGATC G'FrTTAFTI'
GAAAAGTGGT
AAGAAAAAAT TGAAGATTGG TGACGAAGTG AAAGGGACAT TAATTGAAAG TGAGGAGAAT
GAATTACAAA
GAGAGTGACA
TCTAGTAAAT
ACAGTTTACG
CATGATTCAG
ATGGTGACGT
GTCG'TCCAG
,ATCAAGTTTN
GAGAATATI'A
CCACGA.A
TCGTTTAA-AT
TTTAATAAAA
TAATCAGCAA
CG.GCTrT= EF117-4 (SEQ ID NO:436) EESLQNLK ETXQSXSE-TE LQNGDVRLNE YISLKGEIVE SDSRSSLIKK
CDRFILKSCS
SKYQVXNEQK KKLKIGDEVT VYGEYYGFLK GTLIESEENH
DSATN
EF1.18-1 (SEQ ID NO:437)
TGAGGGGGAA
GCTTTGITAC
AATCANAACG
TTTTCACAAG
AAAGACCCAT
CTTAGAGATG
AGATAA
AAAGTGTGTT
TGACAGCTTG,
CACAAGAATC
AAGATGGTGC
cGCTTTCTGG
AAATTAAMAG
AA.AAAGAA.AA
TGGCAAAAGT
TGGCCTTTGN
ACAAACGAAT
GGAAGATTCA
CGATGGGAAT
GTGGGGATTG
GCGAAAGATG
GATTTCTCTA
CCTATGATTG
AGTAGATGCC
GGATGTACCG
TCGCAGGCGT TTTCTGTTCA AGTTCATTCA
AGGAATCGGC
TGTCAATTAG
TGACATCAA-A
GGATGCTCAT CACGcAAATC AAAAAAGAAA AAGCATTCA ATI'TCATTCG TTGGATCGTT EF118-2 (SEQ ID NO:438) VLKRKV GIVAGVFCSA LLLTGCGKSA KDEFIQGICN XNAQESGVXD FSMSISDM(F SQEDCAQTNP MIGMLITQIK DASLSGEDSS RCQKRKSIQL RDEIKSDGNG CTDFIGWIVR EF118-3 (SEQ ID NO:439) GAAAGATG AGTTCATTCA AGGAATCGC AATCANAACG CACAAGAATC TGGGG=EIGN GA'ITCTCTA TGTCAATTAG TGACATGAAA TTTTCACAAG AAGATGTGC ACAAACGAAT CCTATGATG GCATGCTCAT CACGCAAATC AA.AGACGCAT CGCTTCTGG GGAAGATI'CA AGTAGATCCC AAAAAAGAAA AAGCATTCAA CTTAGAGATG AAAI'AAAAG CGATGGGAAT GGATGTACCG ATTTCATTGG TTGGATCGTT
AGAT
21 6 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E. faecalis Genes.
EF118-4 (SEQ ID NO:440) KDEFIQGIGN XNAQESGVXD FSMSISD1M(F SQE-DGAQTNP MIGMLITQIK
DASLSGEDSS
RCQKRKSIQL RDEIKSDGNG
CTDFIGWIVR
EF11-9-1. (SEQ ID NO:441)
TAAAGAATAC
AGTGCAACGG
GGCGGAGTGT
GTATCGACAG
TTTIACGAGTG
GTAAAACTTC
PLATGAAAAGC
AAAACTG.GAG
CATGAATATG
TTGGCTAAGG
TATTAG
CGAGTAAAAT
AAAGGAGCTC
TGCTTGTAG
AAGAAACGAC
AAGCAACTAA
TAGAAGCTA
TAAAGCCCCT
TAGCGTTAGT
cTTTAcT1TTT
TGAAGAACAA
TTTCGGAAGG
GGAAATCGTG
TGGCTGTCAA
AGTGATTGAA
AAAACAGACA
TGGAAATGCG
CATGACTGAA
TTCCGTAGGA
GGATTGAA
TAAAATTTCT
CqIlTrTTTCA
AATAAGCTAC
&GCCATAAGG
ACAGTAGCAA
GAAACAACGA
TATGCGAACT
AA.ATGTATCA
AAGCTTACAA
CAA.AATGGAA
GAAATGACCT
AAA.ATTGTAT
CTITACTTAT
AAGAAAACAA
GGGAACAATC
AATTAGAAGA
TTACAAGTAT
AAAAAAATGG
CGATC-TATAA
CGCAGACACG
ATAATTCAGT
ATGCAAAAGA
TFI'ATTGTTA
GTCTAGTAA
GAAGGAATCG
ACCAGATCAT
TAATGATCGC
AATTGATGTT
AAATGATCAA
AGTGTTACTT
TAAGCA.AGAG
S S Sb S S
S.
S
S S
S
S
EF119-2 (SEQ TD NO:442) 'ZN KLPLLILLLG GVLLVSGCQS HKEENKSSKV STEETTVIET VAREQSKESF TSEATKKQTE TTKLEEPDHV KLLEAYGNAY ANFTSINDRN EKLKPLMTEK C IKK<NGI DVK TGVALVSVGK VTTIYKNDQH EYALLLDCEQ NGTQTRVL.LL AXVXNNKISE rATYNSVKQEY EF119-3 (SEQ ID NO:443) AGAAAACAA GTCTAGTAAA
GTATCGACAG
T'ITACGAGTC
GTAA.AACTTC
AATGAAAAGC
AAAACTGGAG
CATGAATATG
TTGGCTA.AGG
AAGAAACGAC
AAGCAAkCTAA
TAGAAGCTTA
TAAAGCCCCT
TAGCGTTAGT
CTTACTTTT
TGAAGAACAA
AGTGATTGAA
AAAACAGACA
TGGAAATGCG
CATGACTGAA
TTCCGTAGGA
GGATI'GTGAA
TAAAATTTCT
ACAGTAGCAA
GAAACAACGA
TATGCGAACT
AAATGTATCA
AAGGTTACAA
CAAAATGGAA
GAAATGACCT
GGGAACAATC GAAGGAATCG
AATTAGAAGA
TrACAAGTAT
AAAAAAATGG
CA EATAA
CGCAGACACG
ATAATTCAGT
TAATGATCGC
AATTGATGTT
AAATGATCAA
AGTGTTACT
TAAGCAAGAG 4 5 .45.
S
SS
EF119-4 (SEQ ID NO:44 4 ENKSSKV STEETT'JIET VAREQSKESF TSE-ATKKQTE TTXLEEPDFV KLLEAYGNAY ANFTS INDRN EKLKPLMTEK C IKXNGiDvK TGvALVSVGK VTTIYKNDQH EYALLLDCEQ
NGTQTRVLLL
AXVKNNKI SE MIYNSVKQEY EF120-1 (SEQ ID NO:44 5
TGAATAGGCG
ACTGCTTGTC
GTTGAAGAGA
AGCAGTGTCG
AGTCAGCTAA
TTAGTTGGAA
TGAAAAAGGG
GGGAACCAAA
CGAATGAAAA
AGTCTAACGA
CTGTAGCTGA
TATGGAAAAA
AATGTTAGCG
AGNAAAGAAA
AACGAGTGAA
ATCAGTGAALA
TTTAGATACT
TGGTAAAGGA
TT'TrGTCG GTAACCGC IT
ACAATTGATA
AATGCkAGAGC
ACAGCGATTA
GAGACTTTCA
TGCTACGCT
CAACGGAGGC
AGACAAACGA
CCACACCA
ATGCTGGCGA
TCATTCATCC
TTTATCATTA
ATCCTCTAAA
ACAAGCGAGC
TGGAAACAAT
TTTTACTACT
TGATGGTAGT
217 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
ACAAATACCG GAGGAATGAT TACGAAGGAT TCACCTACTG ATGAGTCGCG
ACCAATTACA
AGCTTAAGTA TTAGGTGGGG GCCTACTGGT GCTCCGCTAT TATTATATAA
AATTGGTGTT
EF120-2 (SEQ ID NO:44 6 VKKGMLAF FVVLAVLSLT ACREPKXKKV TASTEASSKV EETNEKTSET
IDKTNEQASS
SVE-SNESVXN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE
SLIIHPDGST
NTGGMITKDS PTDESRP ITS LSIRWGPTGA
ALLLYKIGV
EF120-3 (SEQ ID NO:447) AAGAAA GTAACCGCTT CAACGGAGGC
ATCCTCTAAA
GTTGAAGAGA CGAATGAAAA A.ACGAGTGAA
ACAATTGATA
AGCAGTGTCG AGTCTAACGA ATCAGTGAAA
AATGAAGAGC
AGTCAGCTAA CTGTAGCTGA TITAGATACT
ACAGCCATTA
TTAGTTGGAA TATGGAAAAA TGGTAAACGA
GAGAGTTTGA
ACAA.ATACCG GAGGAATGAT TACGAAGGAT
TCACCTACTG
AGCTTAAGTA TTAGGTGCGG GCCTACTGGT
GCTGCGCTAT
AGACAALACGA
CGACA4GCTGA
ATGCTGGCCGA
TCATTCATCC
ATGAGTCGCG
TATTATATAA
ACAAGCGAGC
TGGAAACAAT
TTTTACTACT
TGATGGTAGT
ACCAATTACA
AATTGGTGTT
EF120-4 (SEQ ID NO:448) KKV TASTEASSKV EETNEKTSET
IDKTNEQASS
SVESNESVKN EEPTADGNNS QLTVADLDTT AINAGDFTTL VGIWKNGKGE
SLIIHPDGST
NTGGMITXDS PTDESRPITS LSIRWGPTGA
ALLLYRIGV
EF121-1 (SEQ ID NO:449) 0*
TGAAACACAA
CATTTTTAA
GAAACAACGA
CAAGAACCAG
ACGAGTGACA
GAAACGTCAA
ACGTCAGATG
GTTGGTITT
ACCG=TTAA
AATAAAGCGC
AAGTATGATG
AAAATTCAAC
GGTCTTCGTT
CCAGATTTAA
CCTCGTGTTA
GAGTTGAAAG
AATAGTGATC
TATATTCTCG
ACTGTACCGC
GTTGCTAAAP
ACGAATGTTC
CCAGCGTTT;
ATTAAAGGAJ
GTTCAAAAAC
AAArrrACCTC ACcTTAGTGJ GGAr-GAAATT
TGGCTGTTGC
GTCAACAAAG
TCArrrACACA
GTGTCAAGCA
TCGCTGAAAA
TTCATGGTCA
CCCAAGTAAG
TTGATAATGG
CTTTAGTGAA
CAATGGTTT
AAGAAGCCAC
TTGTTGAAGG
AAGTGGAT
CITCGCTTAA
CTAACGATC-A
CGGCTGCCAG
GTCATGACCA
TAGGGGGACC
ATGCCGATAA
CAGCAGATGA
TTCAGCAGGP
TTCCCGA-AGC
;AAGTAACGGC
;CGCGGAAGAI
'GTGTTCCCAJ
TTGAAAA-AG TI GTTGATAGCG C TTCAGAACCA G GGAAACAACA C GTCACAAGAA
AGAAGAAACG
A ITATGGAAT
TACAGTCGTT
CGACAATATT
TGAAAAGACC
GGGAAATCAT
TTTTCCAATC
GACTACCACG
TATCGGCTTA
TTTTI'TACCT
GGCTGACATT
TGCCGACCAA
CCTI'TCTTT
GAAAGATACG
.GTCGGGAAGTG
AGCAGTTAAG
GATCGGCACA
ACAATTACAA
CGCACAATTA
TTCCTATGCC
r' TALACGGTGAA TGAG-CTTTA AAAAAGTGAA GTGGGGCATG :CAAGTGTTA CTAGTACGGC
ATATGCAGTA
TAACAAGTA CCACCGATTC
AAGTAGAAA
;ACATCAAAC AAGAAGCACC
AAATCAGGCT
CCACAGCAC CAACAGAGAC
GACGAATTTA
kGCACGCCGC AAAAA6ATAAC
AAT'ITTAGGT
rGGTCTTATG AAGATCATA-A
AGAACTACCA
kACCAAGTCC GGGCACAAAA
CCCAGCAGGC
CAAGGCACTA TTTTAACAGA
TGACTTGTAT
CATCCAATGA TCACCGCCAT
GAATGTGATG
GAGTTTAATT TTGG'FTACC
GTTAATCAAA
TTGTCTGCGA ATACCTACAA
TAAGGAAGAT
AAGGAACTTC A'TIT AATCA AGATGGGCAG ACAATTCCGC ACATTCCTIT
GTGGGATGGC
TTGAA-AGALAG AAGCAGAAA6A
AGCAGTTACT
ATTGTTGCCT CGATTCATGC
GGGACAACAA
GTAATTGAAA ATCTCGCGGG CArTTCATGCG ACCAAGCAAG GAGCAGCGCC
GAATGGAA;A
GGGACAGAAG TTGTCAAAAT
TGATCTTTCA
CAAGAAGGTA CAOCAACGAT
TGTACCAACA
GCAGCGACAA AAGAATACCA
TGAAAAAACG
GCAACAGCTG ATITTACC
A.AAACAAGAA
CCAACAGCGA TCATTTC'ITT
AATTAATAAC
AGTGCGGCAG CGCTGTTAA
ATACGACAGT
ACCATTTTTG ATATCTACAA
ATACCCGAAT
AACTTACTGA AGTATTTAGA
AAAACAAG
218 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. foecalis Genes.
j. S
SS
*9 0* S S
SS
S9 9 0 9 S S 9
GCGTACTATA
GTATATAACT
GGTGAACGAA
ACGATTGCTA
GAACCTATTA
AAAAAAGGAA
AATTTITGATG
ATTCCCACTT
CGTAATGCGG
GGCCGACTAG
CAAGAAAACC
AATTT'CTCCA
GCGGTGGGAA
CTGA.ATrrrC
GATCCGTATA
CCAGAAACAG
ATTCCAGAAG
TTCGTGGTTA
ACGCTAGCAG
CATTCGCATA
GGTAATTATT
ACAACAAAAT
GCCATCGTTC
TATAFICCAI
GGGAATTTGP
GATITTTGCAC
GTTGGGGATC
GGCGCCCAAC
AATGGCACAI
CTACATGTT.
GCTAGTGAC(
GAAATTAAA2 ATGGCTACCj
CGAGAAGAA(
TTGCGGTTG'
GGCGAACGAK
ACGCCAAAAI
AATAAACAA
TATGGCGGC'
CTGTTCGTT
TAG
ACCAAACACA
ATGACATGAT
TTGTAGATGC
TGAATAATTA
AAAATTCTCA
CTCTTGATCC
AAAAATGGCG
CTCCTGATGG
GCTTTGAT
AAGCAGGGALA
CAACCTTGAT
AAGGCGCGGA
ATCACGAGTI'
CGATTTITATC
CAATCGTAGA
CAACGAAAAC
TAGAAGCAGT
CTGGGCATTT
AAACCCTTAC
CAGCCGTCGA
TAAATAATGI
TGATTTCAGC
ATGAAGCACC
TCACATTGG;
LTI'GGTGATGC
TAACTAATG(
TCATTGCTG'
TTAAAGAAAr.
TTTTACTAGj r' CAAGCTC CA
'AAGGCAATG,
k ATCGGCAA-Al k ATGATITCT'
GGATTTCAC
r ATCGTGCAG k. TTGTTTCTA
-CAGATCCAA
A ACCAAGCGG T TTTTACCTA G GACTrCT'I
GCCAGATGAT
TTCTGGAGTG
GAAAATTGAC
TCGTTACGG-C
TCCAGAAACC
AGAACAAGAA
TGCCAAAGCA
ACGTACACCA
AGATAATGCA
AGGCGAATTA
GGTGGATGCA
TATGGCCAAA
TGATTTTGGT
TAGTAATACG
AAAATCCGGG
ACACcCGAAA
GATTAAGGAA
AGGCGTAGAT
TCAAACATAT
A.AGTGGCAA-A
TGGGATCGTC
AGCAGAGCTG
TACGAATTTTI
TGGACAACGA
GATTATGTCA
TGGCGGCAT'I
LGTTACCTTTrI C GT'TGAAATC
TGATGCTGGC
r TCGTATCCAC k~ AACAGGACAJ ACAAAAGTTr T AGcTGCTGG'.
T AGATTCTGT( C AACGACGAT' T TTCGGAAGAX A ACCAGACCC( G AGCAAGACAI A AACGGGTAC, 'C TTCTGGCTG' TGACCATTA G
GACTACAAGAT
GGCCCAACCGC T GGTTI'AGCTA G T1TACGAGGAA TI
ATCGAACGAA
ATCGAATTAG AACGCCGCCG C
TATACCATTA
GGTATGGCGC C GGGGATGTTT I GCAATGAATG
TAGAGATT=
TAITACAAAG
AAAAAGTTTG
AACGTAGAGA
AT1'AAAGAGA
GAAACGACGC
CCTGAGTTAG
CGTTATGGCA
ACAGCACCAG
CTAGA.ATTGC
AACGCTGAAA
GAAAATGTGC
TATGGCCAAG
'CGCGCTr.ATA
GGCAATAGCA
TCTGTTCCII'
CAACCAAAAC
TATGATTCCA
SACGATTGTCG
r GAACCATI'GG P GGTGATGGTT1
TTAATTGAAT
r GATTTA~cAC k GCTTACAAAG 3, AAACCAACAC
ACCAATCCAT
A GAAACAGAAA G TATATTTATA
TTTAATCCA
TGACATTTCA
GGATCCTI'-C C CCAAGGGAT T
'GATTGTTFGA
,T'rGGTCAAT G.AATGACGG C TATTACGAA P .'CCACACAAA ;TCTAAAAAC C 'CCAkAGGATT .AGTTGGT1'A
ACTACCTA
MTGCAGTGG2
:CATTGTAGG
kGCGTGACATT
AGTACGCGGA
CGCATATCTG
ATATCACTCGT
AAGTGATCTA
AGAGTGAACC
CAGAAAACCC
ATGAAAAAGT
GCACACGAGA
ACGCGTTAG
TTAAACAACG
TTGCGCAAAT
CCA'ITCCACA
TTGGCGCAAA
CAAAACCAGG
GTAGTCGCGT
ATGAGAAGAA
ACGATATGCT
ACTTGAAAAG
AATATAAAGA
AGTTAATCG
CAGAAACACC
CCGTAACAGA
CGCTTGCATT
AACGACGTAA
AACATTCGT
AAACCAGTG
AAAGAATAT
'CAAGTAGGG
'ATA~rAAG
'ATTGGGACA
ACTCTTrCAA
LCAAGATGTC
G.ACGTTCAT
:'FTAAAGAC
CCAATCTCC
M'ATGCCATG
LAAAGACCAA
kCGGGTTTT
EGTGACGACC
rAAAGACCCG
TATNCAAGCT
GCGTGGTGAT
GATTGATGGA
TGCTCAAACA
AACTAAGAAA
GGCAGTTAAA
AATTGTCGAT
GACCAACTTA
CCAACCTGCT
GCCAATTAAA
TCAAGTAACC
AAAAGATGAG
TGGTGGTTTC
TACTCGCTTG
ATTAGGAATA
ACAATACCGG
AGGTGGTGAA
TGCAACCAGC
ACCATTCCCA
TGGAGGAGAG
AGTAGCAACC
GAAGAAAAAG
ATATGGTTTA
CAAAGCTAGT
.5
OSSS
S
9
S.
EF121-2 (SEQ ID NO:450) V'KKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE T1TSQQSSE-AV TSTTDS
EPVITQETTD
SDVHGQLWPNW
KAPLVNEKTH
LRFVEGTI'TK
LKANDQADI I
VPVGGPKDTG
AFIQEEIGTA
LPAGKI SYAT YNYDMI SGVD
IKQE-APNQAT
SYEDDKELPV
PMITAMNVMK
ELDFNQDGQP
VAS IHiAGQQN
TEVVKIDLSV
TADFLPKQE I
IFDIYKYPNT
YKIDISKPVG
SDSVKQSQET
GLSQVSTVVN
YDAMVLrGNHE DLKVGIIi3LT
SDPAASADQV
AKNADKWEVQ
KGIPEAQLQP
LVSVPINGEN
ERIVDAKIDG
TAPTETrNLE
QVRAQN~PAGT
FNFGLPLIIK
IPIPLWDGP
IENVAGIDAY
EGTATIVPTT
TAMISLINNV
LLKYLEKQGA
QPLDPAKEYT
TS IAEKEETS
VLIDNGDNIQ
IQQEATFPIL
RVTSLN'FLiPL
ILGHDHLSFT
NVPADEAVKA
QKEVTGAQLS
YYNQTQPDDL
iAMNNYRYGG
:SRKQ
TPQKITILGT
GTILTDDLYN
sANTYNKEDG
KEEAEKAVTE
KQGAAPNGKT
ATI(EY14EKTR
AAALFKYDSK
TISFNPNIRV
LASQGIQVGE
219 TABLE 1. Nucleotide and Amino Acid Seqeuences of Efaecalis Genes.
P IKNSDPETL
PTSPDGRTPN
ENPTLMVDAG
NFPILSSNTY
PEVEAVIXEI
SHTAVESGKR
IVDE-ARTNFN
FAVTNGGGIR
GTILLDDAGQ
IKNRQTQKFE
RLYRAATTID
KQNQACARQS
RGMIVDYIKK
AAAITKQDVR
DVFQGLP ISN
YKDGSGRVFD
KEI(YADXQAF
YGKVIYAQTG
AENEI(VIVDY
ADIKQGPIKV
PKLGANGGFL
PLDEKYKQYRM
LAQYKEPFPG
NPSVTEKKKY
KGTLOPEQEI
NAGFDLDNAY
FSKGADMAKA
PYTIVEKSGK
VVTGHLGVD:E
NYLNNVGIVT
I PFTLDGQR7-
GDVIAVLPFG
HVSSSIRIHY
ATNDFLAAGG
ERIVSISEE-A
GGFLPKTGTE
ERNWSI IGTW
TIMHTNDVHG
MNEVQYDAMA
KFAIVGVTT?
rrTP IWRGDT
APESEPTKKT
NVRTRETNLG
rISIAQIQVTG
DSTKPGTRLA
DGYDMLGGER
YJ(EL IGGGET
TETLALYGLL
FDEKWRAKAI
RLEAGKGELG
VGNHEFDFGL
ETATKTiPK<N
LAETLSQTYP
TKLISAAELL
NLIGDAIMSY
AQVKEMFEMS
S DEC NETGQT
E-EGISLDSVL
PKPDPKPDPK
FVGLSSSGWY
ELVNDGTLQ I
MARLKTFKDQ
EIALCYKDQL
VEKVTFKDP I
ELDITVIDGH
ELPENPAVKA
GQDAFSQPAID
VRSIPQKDEN
IVGSRVLG IE
IEYLKSATSL
PT? ETPVATN
IYKRRNKAS
EF121-3 (SEQ ID NO:4S1) ACAAAG TTCAGAAGCA GTA.ACAAGTA CCACCGATTC AAG I 6e S S 9*
S*
0 0
S*
0@ @0 0 *0
S.
0 S 0 @8
S
0Oe 0
CAAGAACCAG
ACCGTCACA
GAAACGTCAA
ACGTCAGATG
GTTGGTTTGT
ACCGTTTTAA
AATAAAdCGC
AAGTATGATG
AAAATTCAAC
GGTCTTCGTI'
CCAGATTTAA
CCTCGTGTTA
GAGTTGAAAG
AATAGTGATC
TATATTCTGG
ACTGTACCGG
GTTGCTAAAA
ACGAATGTTC
CGAGCGTTTA
ATTAAAGGAA
GTTCAAAAAC
AAATTACCV.
ACCTTAGTG;
GCGTACTAT;
GTATATAACI
GGTGAACGAi
ACGATTGCT)
GAACCTATT2
AAAAAAGGAX
AATTTTGAT
ATTCCGACTr
TCATTACACA
GTGTCAAGCA
TCGCTGAPAAA
TTCATGGTCA
CCCAAGTAAG
TTIGATAATGG
CTTTAGTGAA
CAATGG'TTTT
AAGAAGCCAC
TTGTTGAAGG
AAGTI'GGGAT
CTTCCCTTAA
CTAACGATCA
CGGCTGCCAG
CTCATGACCA
TAGGGGGACC
ATGccGATAA
CAGCAGATGA
TI'CAGGAGGP
ITCCCGAAGC
AAGTAACGGC
CGGGGAAGAI
GTGTTCCCA)
ACCAAACAC;
ATGACATGA
kTTGTAGATGC k. TGAATAATT; k. AAAATTCTGQ
CTCTTGATC(
3 AAAAATGGC( r CTCCTGATG'
GGAAACAACA
CTCACAAGAA
AGAAGAAACG
ATTATGGA.AT
TACAGTCGTT
CGACAATATT
TGAAAAGACC
GGGAAATCAT
TTTCCAATC
GACTACCACG
TATCGGCTTA
TTITTACCT
GGCTGACATT
TGCCGACCAA
CCTT1'CTT'I
GAAAGATACCG
*GTGGGAAGTC
*AGCAG?'A.AC
*GATCGGCAC;
ACAATTACA;
CGCACAATT.'
7TTCCTATGC( 7TA.ACGGTGAJ kGCCAGATGAK TTCTvCGAGIX
GAAAATTGA(
k. TCGTTACGG k. TCCAGAAAC,
:AGAACAAGA
3 TGCCAAAGC
ACCTACACC
GACATCAAAC
ACCACAGCAC
AGCACCCCGC
TGGTCTTATG
AACCAACTCC
CAAGGCACTA
CATCCA-ATGA
GAGTTTA.ATT
TTCTCTGCGA
AAGGAACTTG
ACAATTCCGC
TTGAAAGAAG
ATTGTI'GCCT
GTAAITGAAA
ACCAAGCAAG
GGACAGAAG
CAAGAAGGTA
GCAGCGACA
GCAACAGCT'C
CCAACAGCGP
AGTGCGGCAC
ACGAI
9
'=IC
k. AACTTACTG; r T1'GACCA~rl
GACTACAAGI
GGCCAACCG(
-GGITAGMT
-TTACGAGCA,
A
1
ATCGAACGAK
A ATCGAATTA, A AACGCCG
'AGAAAA
AAGAAGCACC
CAACAGAGAC
AAAAAATA.AC
AAGATGATAA
GGGCACAAAA
TTrAACAGA
TCACCGCCAT
TTGTTACC
ATACCTACAA
ATTTTA.ATCA
ACATI'CCTTI'
AAGCAGAAA
CGATTCATGC
ATGTCGCGGG
GACCAGCGCC
T'rCTCAAAAT
CAGCAACGAT
AAGAATACCA
ATTTTTACC
L MAT'IrC'TI
CGCTGTTTMJ
ATATCTACPJ
SAGTATTTAGI
SGTTTTAATCC
k. FIACATTT(
:TOGATCCTG(
k. GCCAAGGGAr k. TGATTGTTGJ k. AT1'CGTCAA'
TGAATGACGI
AAATCAGGCT
GACGAATITA
AATTTTAGGT
AGAACTACCA
CCCAGCAGGC
TGAC ITGTAT
GAATGTGATG
GTTAATCAAA
TAAGGAAGAT
AGATGGGCAG
GTGGGATGGC
AGCACTTACT
GGGACAACA-A
GATTGATGCG
GAATGGAAAA
TGATCTTTCA
*TGTACCAACA
*TGAAAAAACG
*AAAACAAGAA
*AA'rTAATAAC
ATACGACAGT
ATACCCCAAT
AAAACAAGGG
AAACATTCGT
AAAACCAGTG
CAAAGAATAT
r TCAAGTAGCG k. TrATA'rTAAG r TA E'GGGACA 3 CACTcTTCAA *0
OSSS
0
S
8SOO e.g.
5 *000 00 S 0* *5 EF121-4 (SEQ ID NO:452) QSSEAV TSTTDSSRKQ EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTETTNLE- TSIAEKEETS TPQKITILGT SDVHGQLWNW SYEDDKELPV GLSQVST'JVN QVRAQNPAGT VLIDNGDNIQ GTILTDDLYN 220 TABLE 1. Nucleotide and Amino Acid Seqeuences; of E. faecalis Genes.
KAPLVNEKTH
LRFVEGTTTK
LKAIWQADI I
VPVGGPKDTG
AFIQEEICTA
LPAGKISYAT
YNYDMI SGVD P IKNSDPETL PMITAMVMKl
ELOFNQDGQP
VAS IHAGQQN
TEVVKIDLSV
TADFLPKQEI
IFDIYKYPNT
YKIDISKPVG
RGMIVDY I.KI
YDAMVLGNHE
OLKVGIIGLT
SDPAASADQV
AKNADKWEVQ
KGIPEAQLQP
LVSVJPINGEN
ERIVDAKIDG
KGTLDPEQEI
FNFGLPLIKK
IPHIPL~WDGP
I ENVAG IDAY
EGTATIVPTT
TAmI SLINNV
LLKYLEKQGA
QPLDPAKEYT
ERNWS IIGTN IQQ EATFPIL
RVTSLNFLPL
ILGHDHLSFT
NVPADEAVKA
QKEVTGAQLS
YYNQTQPDDL
IAMNNYRYGG
FDEKWRAKAI
SANTYNKEDG
KEE-AEKAVTE
KQGAAPNGKT
ATKEYHEKTR
AA.ALFKYDSK
TISFNPNIRV
LASQG IQVGE ELVNDGTLQ I PTSPOGRTPN A EF122-1 (SEQ ID NO:453)
TGAAACACAA
CATTTITTAA
GAAACAACGA
CAAGAACCAG
ACGAGTGACA
GAAACGTCAA
ACGTCAGATG
GTTGOTTTGT
ACCGTTTTAA
AATAAAGCGC
AAGTATGATG
AAAATTCAAC
GGTCTTCGTT
CCAGATI'AA
CCTCGTGTTA
GAGTTGAAAG
AATAGTGATC
TATATTCTCC
ACTGTACCGG
GTTGCTAAAA
ACGAATGTTC
CGAGCGTTTA
ATTAAAGGAA
GTTCAAAAAC
AAATTACCTC
ACCTTAGTGP
GCGTACTAT;
GTATATAACI
GGTGAACGAI
ACGATTGCT)
GAACCTATTJ
AAAAAAGGAk AATT1'TGAT(
ATCCGAC?
CGTAATGCGI
GGCCGACTA,
CAAGAAAAC
AATITrCTCC
GCGGTGGGA
CTGXI'TTT
GATCCGTAT
GGAGGAAATT
TGGCTGTTGC
GTCAACAAAG
TCATTACACA
CTrGTCAAGCA
TCGCTGAAAA
TTCATGGTCA
CCCAAGTAAG
TTGATAATGG
CTI'TAGTGAA
CAATGGTTTI'
AAGAAGCCAC
TTGTTCAAGG
AAGTTGGGAT
CTTICGCTTAA
CTAACCATCA
CGGCTGCCAG
GTCATGACCA
TAGGGGGACC
ATGCCGATAA
CAGCAGATGA
TTCAGGAGGP
TTCCCGAAGC
AAGTAACGGC
CGGGGAAGAI
GTGTTCCCAI
ACCAAACACJ
ATGACATGA'
k rGTAGATG( k. TGAATAATT, k. AAAATTCTG, k. CTCTITGATC 3AAAAATGGC' r CTCCTGATG S GCTTTGAT' 3 AAGCAGGGA C CAACC7I'A A AAGGCGCGG A ATCACGAGIl 'C CGATTTTIT 'A CALATCGTAC
TGTGAAAAAG
GTTGATAGCG
TTCAGAAGCA
GGAAACAACA
GTCACAAGAA
AGAAGAAACG
ATTATGGAAT
TACAGTCGTT
CCACAATAT
TGAAAAGACC
GGGAAATCAT
TTTI'CCAATC
GACTACCACG
TATCGGCTTA
TTTTTTACCT
GGCTGACATrI
TGCCGACCAA
CCTTTCTMr
GAAAGATACC
GTGGGAAGTC
AGCAGTTAAC
GATCCCCACI
AC A 'TACAJ
CGCACAAT]L
TTCCTATGC(
pTAACGGTGAI k. GCCAGATGA! C. TTCTGGAGTI
:GAAAATTGA
k. TCG ITACGG N TCCAGAAAC
CAGAACAAGA
G TGCcAAAGC C ACGTACACC T AGATAATGC A AGGCGAATI .T GGTGGATGC A TATGGCCAJ 'T TG;ATTrI'GC IC TAGTAATA( :A AAAATCCGC
TGAGCTTTA
CCALAGTGTTA
GTAACAAGTA
GACATCAA.AC
ACCACAGCAC
AGCACGCCGC
TGGTCTTATG
AACCAAGTCC
CAAGGCACTA
CATCCAATGA
GAGTT.TAATI'
TTGTCTCGCGA
AAGGAACTTG
ACAATrCCGC
TTGAAAGAAC
ATTGTTGCCI
GTAATTGA-A
ACCAAGCAAC
GGGACAGAAC
CAAGAAGGTI
GCAGCGACAJ
kGCALACAGCT( k CCAACAGCQi k AGTGCGGCA(
:ACGATTTTTI
k AACTI'ACTG.
r TTIGACCATT
GACTACAAG
C GGCCAACCG C GGTT'AGCT C TTACGAGGA A ATCGAACGA A ATCGAATTP 'A AACGCCGCC A TATACCAT" .A GGTATGGCC :A GGGGATGTI LA GCAATGAX7 'T TTAGAGAV1 :G TATTACAAL .G AAAAAGTT AAAAAGTGAA G' CTAGTACGGC A
CCACCGATTCA.
AAGAAGCACC A CAACAGAGAC G~ AAAAAATAAC k AAGATGATAA A' GGGCACAAAA C TTI'TAACAGA T TCACCGCCAT
G
TTGGT1'TACC C ATACCTACAA TI
ATTTTAATCA
ACATTCCTT G
AAGCAGAAAAA
CCATTCATGC G ATGTCGCGGG C GAGCAGCGCC C TTGTCAAAAT kCAGCAACGAT k. AAGAATACCA
'ATI'TTACC
k TGAT'IrCTTT 3 CGCTG'ITTAA 3ATATCTACAA A~ AGTATITAGA A GTT'IAATCC A TrGACATTTC C T'GGATCCTGC A GCCAAGGGAT A TGATT1GTTGA A ATTGGTCAAT LG TGAATGACGG :G CTATTACGAA 'A TG-CACACAAA ;C GTCTAAAAAC TCCAAGGA T LG AAGTI'CG IWA IG CACTAGGCTA kG ATGGCAGTGG rG CCATTGTAGG
[GGGCATG
LATGCAGTA
!GTAGAAAA
kATCAGGCT
ACGAATTIA
.TTITAGGT
GAACTACCA
CCAGCAGGC
GACTTGTAT
AATGTGATG
TTAATCAAA
'AAGGA-AGAT
,GATGGGCAG
TGGGATGGC
~GCAGTTACT
~GGACAACAA
;ATIGATGCG
;AATGGAAAA
['GATCTTTCA
IGTACCAACA
rGAAAAAACG k.AAACAAGAA
AXTTAATAAC
ATACGACAGT
ATACCCGAAT
AAAACAAGGG
AAACATTCGT
AAAACCAGTG
CAAAGAATAT
TCAAGTAGGG
ITATATTAAG
TATTGGGACA
CACTCTTCAA
ACAAGATGTC
TCACGTTCAT
CTTTAAAGAC
ACCAATcTCC
TGATGCCATG
TAA.AGACCAA
ACGGGTT'ITTT
TGTGACGACC
221 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. [accalis Genes.
CCAGAAACAG
ATTCCAGAAG
TTCGTGGTTIA
ACGCTAGCAG
CATTCt3CATA
GGTAATTAT
ACAACAAAAT
GCCATCGII'C
TATATTCCAT
GGGAATTIGA
GATTTGCAG
GTTGGGGATG
GGCGCCCAAG
ALATGGCACA-A
CTACATGTTT
GCTAGTGACG
GAAATTAAAP
ATGGCTACCP
CGAGAAGAAC
'PTGCGGTTGI
GGCGAACGA;
ACGCCAAAA(
AATAAACAAJ
TATGrGCGGCI'
CTGTTCGTT(
CAACGAAAAC
TAGAAGCAGT
CTGGGCATTT
AAACCCTTAG.
CAGCCGTCGA
TAAATAATGT
TGATTTCAGC
ATGAAGCACG
TCACATTGGA
TTGGTGATGC
TmAACTAATGG
TCATTGCTGT
TTAAAGAAAT
T'TTTACTAGA
CAAGCTCCAI
AAGGCAATGAl
ATCGGCAAAC
LATGATI'CT'
GGATTTCACI
ATCGTGCAG(
TTTTCTW~
CAGATCCAAJ
k ACCAAGCGG( r' TITTACCTAK 3 GACTTTCTI
ACACCCGAAA
GATTAAGGAA
AGGCGTAGAT
TCAAACATAT
AAGTGCCAA.A
TGGGATCGTC
AGCAGAGCTG
TACCAATTTT
TGGACAACGA
GATTATGTCA
TGGCGGCATT
GTTACCTTTT
GTTTGAAATG
TGATGCTGGC
TCGTATCCAC
AACAGGACAJA
ACAAAAGTT
AGCTGCTGG'l
AGATTCTGTC
AACGACGAT9J V' TrCGGAAGAJ k. ACCAGACCCC 3AGCAAGACA( k AACGGGTAC) Z TTCTGGCTG(
AACGTAGAGA
ATTAAACAGA
GAAACGACGC
CCTGAG'ITAG
CGI'ATGOCA
ACAGCACCAG
CTAGAAMrC AACGClrGAAA
GAAAATGCC
TATGGCCAkAG
CG.CGCTGATA
GGCAATAGCA
TCTGTTCGII'
CAACCAAAAC
TATGATTCCA
ACGAMTTCG
*GAACCATrGC GGTGATGCGn TTAATCA-A I
SGATTTAGCAC
SCCTTACAAAC
AAACCAACAC
3 AGCAATCCA'
GAAACAGAA)
TATA=~AT
AGGTGACA-T TI
AGTACGCGGA
CGCATATCT--('G
ATATCACTGT G
AAGTGATCTA
AGAGTGAACC2
CAGAAAACCCC
ATIGAAAAAGT
GCACACGAGA
ACGCGTTI'AG
TrAAACAAGG
TTGCGCAAAT
CGATTCCACA
TIGGCbCAAA
CAAAACCAGG
*GTAGTCGCGT
ATGAGAAGAA
ACGATATGCT
ACTTGAAAAG
AATATAAAGA
AGTTAATCGG
CAGAAACACC
r' CCGTAACAGA
CGCTTGCAT
AACGACGTAA
AAAGACCCG
.'ATNCAAGCT
CGTGGTG-AT
ATTGATG-GA
.GCTCAAACA
ACTAAGAAA
;GCAGTTAAA
k.ATTGTCGAT
;ACCAACTTA
CCAACCTGCT
GCCAATTAAA
rCAACTAACC
AAAAGATGAG
TGGTGGTTTC
TACTCCCTTG
ATTAGGAATA
ACAATACCGG
AGGTGGTGA
TGCAACCAGC
ACCATTCCCA
TGGAGGAGAG
AGTAGCAACC
GAAGAAAAAG
ATATGGTTTA
CAAAGCTAGT
TAG
ZF122-2 (SEQ ID NO:454) VKKL SFKKVKWGMH FLMAVALIAP SVTSTAYAVE- TTSQQSSEAV TSTTDSSRKQ EPVITQETTD IKQEAPNQAT SDSVKQSQET TAPTTTNLFE TS IAEKEETS TPQKITILGT SDVHGQLWNW SYEDDKELPV GLSQVSTVVN QVRAQNPAGT VLIDNGD)NIQ GTILTDDLYN KAPLVNEKTH PMITAMNVMK YDAMVLGNHE FNFrGLPLIKK IQQEATFPIL SANTYNKEDG LRFVEGTTTK ELDFNQDGQP DLKVGIIGLT IPHIPLWDGP RVTSLNFLPL KEEAEKAVTE LKANDQADII VASIHAGQQN SDPAASADQV IENVAGIDAY ILGHDHLSFT KQGAAPNGKT VPVGGPKDTG TEVVKIDLSV AXNADKWEVQ EGTATIVPTT NVPADE-AVXA AT1(EYHE(TR AFIQEEIGTA TADFLPKQEI KGIPE-AQLQP TAMISLIN~NV QKEVTGAQLS AAALFKYDSK LPAGKISYAT IFDIYKYPNT LVSVPINGEN LLKYLEKQGA YYNQTQPDDtJ TISFNPNIRV YNYDMISGVD YKIDISKPVG ERIVDAXIDG QPLDPAKEYT IAANNYRYGG
LASQGIQVGE
?IKNSDPETL RGMIVDYIKX KGTLDPEQEI ERNWSIIGTN FDEKWRAKAI ELVNDGTLQ I PTSPDGRTPN AAAITKQDVR NAGFDLDNAY TIMHTNDVH-G RLEAGXGELG MARLKTFKDQ ENPTLMVDAG DVFQGLPISN FSKGADMAKA MNE:VGYDAMA VGNHEFD)FGL
EIAL~GYKD)QL
NFP ILSSNTY YKDG-SGRVFD PYTIVEKSGI( KFAIVGVTTP ETATKTHiPKN VEICVTFXDP I PEVEAVIKEI KEKYADXQAF VVTGHLGVDE TTPI{IWRGE)T LAETLSQTYP
ELDITVIDGN.
SHTAVESGKR YGKVIYAQTG NYLNNVGIVT APESEPTKKT TKLISAAELL ELPENPAVKA IVDEARTNFN AENEKVIVOY IPFTLDGQRE NVRTRETNLG NLIGDAIMSY CQDAFSQPAD FAVTNGGGIR ADIKQGIKV GDVIAVLPFG NSIAQIQVTG AQVKEM4FEM4S VRSIPQKDEN GTILLDDAGQ PKLGANGCFL HVSSSIRIHY DSTKPGTRLA SDEGNETGQT IVGSRVLGIE IKNRQTQKFE PLDEKQYRM ATNDFLA.AGG DGYDMLGGE-R EEGISLDSVL
IEYLKSATSL
RLYRAATTID LAQYKEPFPG ERIVSISEEA YKELIGGGET PKPDPKPDPK
PTPETPVATN
"Y GGFPT(T 1'TTLALYGLL FVGLSSSGWY IYKRRNKAS Lmro EF122-3 (SEQ ID) NO:455) 222 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TG AAAAAiTGGCG TGCCAALAGCA ATCGAATTAG TGAATGACGG CACTCTTCAA
ATTCCGACTT
CGTAATCCGG
GGCCGACTAG
CAAGAAAACC
AA'TTTCTCCA
GCGGTGGGAA
CTGAAT=TC
GATCCGTATA
CCAGA.AACAG
A 1TCCAGAAG
TTCGTGGTTA
ACGCTAGCAG
CATTCGCATA
GGTAATTAT
ACAACAAAAT
GCCATCGTTG
TATATTCCAT
GGGAATTTGA
GATTTTGCAG
GTTGGGGATG
GGCGCCCAAG
A.ATGGCACAA
CTACATGTT
GCTAGTGACG
GAAATTAAAA
ATGGCTACCA
CGAGAAGAAC
TTGCGGTTGT
GGCGAACGAA
ACGCCAAAAC
AATAAACA-U,
CTCCTGATGG
P~
CCTTTGATTT AAGCAGGGAA CAACCTTGAT C
AAGGCGCGA
ATCACGAGTT
CGA'InTTATC
CAATCGTAGA
CAACGAAAAC
TAGAAGCAGT
CTGGGCAT
AAACCCTTAG
CAGCCGTCGA
TAAATAATGT
TGATTTCAGC
ATGAAGCACG
TCACATTGGA
TTGGTGATGC
TALACTAATGG
TCATTGCTGT
TTAAAGAAAT
TT'rTACTAGA
CAAGCTCCAT
AAGGCAATCA
ATCGGCAAAC
ATGATrITCTT
GGAITTCACT
ATCGTGCAC
TTGTTTCTAT
CAGATCCAAA
ACCAAGCGG
~CGTACACCAA
~GATAATGCAT
~GGCGAATTA G
GTGGATGCA
rATGGCCAAA G rGATTTTGGT TI rAGTAATACG
AAAATCCGGG
PCACCCGAAA
GATTAAGGAA AGGCGTAGAT C
TCAAACATAT
AAGTGGCAAA
TGGGATCGTC
ACCAGAGCTG
TACGAATTTT
TGGACAACGA
GATTATGTCA
TGGCGGCATT
GTTACCT'fl
GTFTTA.AATG
TGATGCTGGC
TCGTATCCAC
AACAGGACAA
ACAAAAGTTT
AGCTGCTGGT
AGA'ITCTGTC
AACGACGATT
TTCGGAAGAA
ACCAGACCCG
AGCAACACAC
ACGCCGCCG
ATACCATTA
.GTATCCGCGC
GGGATGTT
CAATGAATG
'TAGAGA ETG .'ATIrACAAAG
LAAAAGTTTG
ACGTAGAGA
~TTAAAGAGA
.AAACGACGC
:CTGAGTTAG
:GTTATGCcA
%CAGCACCAG
CTAGAA7MTC
AACGCTGAAA
GAAAATGTGC
TATGCCCAAG
CGCGCTGATP
GGCAATAGC.z
TCTGTTCGM'
CAACCAAAAC
TATGATTCCI
ACGATTGTCC
CAACCA ITG( GGTGATGGI L
TTAA'ITGAA'
GATTTAGCA(
GCTTACAMC
AAACCAACA
AGCAATCCA
CTATTACGAA ACAAGATGTC TGCACACAAA TCACGTI'CAT GTCTAAAAAC CTTTAAAGAC .TCCAAGGATT ACCAATCTCC AAGTTGGTTA TGATGCCATG CACTACGTTA TAAAGACCA.A ATGGCAGTGG ACGGTTTT CCATTGTAGG TGTGACGACC AGGTGACATT TAAAGACCCG AGTACGCGGA TATNCAAGCT CGCATATCTG GCGTGGTGAT ATATCIACTGT GATI'GATCGA AAGTGAITCTA TGCTCAAACA AGAGTGAACC AACTAAGAAA CAGAAAACCC GGCAGTTAAA *ATGAAAAAGT AATTGTcGAT GCACACGAGA GACCAACTTA ACGCGTTTAG CCAACCTGCT TTAAACAAGG GCCAATTAAA TTGCGCAAAT TCAAGTAACC CGATTCCACA AAAAGATGAG 'FTGGCGCAAA TGGTGGTTTC SCAAAACCAGG TACTCGCTI'G 3GTAGTCGCGT ATTAGGA-ATA ATGAGAAGAA ACAATACCGG 1ACGATATGCT AGGTGGTGAA r ACTTGAAA.AG TGCAACCAGC CAATATAAAGA ACCAFTCCCA G AGTTAATCGG TGGAGGAGAG CCAGAAACACC AGTAGCALACC TP CrGTAACAGA GAAGAAAAAG
S.
S
S
S
TATGGCGGCT TT EF122-4 (SEQ ID NO:456) EKWRAXAI ELVNDGTLQ I
SS
PTSPOCRTPN
ENPTLMVDAG
NFPILSSNTY
PEVEAVIKEI
SHTAVESGKR
IVDE.ARTNFN
FAVTNGGI R
GTILLDDAGQ
IKNRQTQKFE
RLYRAATI D
AA.AITKQDVR
DVFQGLPISN
YKDGSGRVFD
KEKYADXQAF
YGKVIYAQTG
AENEKVIVDY
ADIKQGPIKV
PKLGANGGFL
PEJDEKRQYRM
L.AQYKEPFPG
NAGFDLDNAY
FSKGADMAKA
PYTIVEKSGK
\TVTGHLGVDE
NYLNNVG IVT
IPFTLDGQRE
GDVIAVLPFG
HVSSSIRIHY
ATNDFLAAGG
ERIVSISEE-A
TIMHTNDVHG
MNEVGYDAMA
KFAIVGVTTP
TTPHIWRGDT
APESEPTKKT
NVRTRETNLG
NS IAQIQVTG
DSTKPGTRLA
DGYDMLGGER
YKELIGGGET
RLEAGKGELG
VGNHEFDFGL
ETATKTHPKN
LAETLSQTYP
TKLISAAELL
NMIGDAIMSY
AQVKEMFEMS
SDEGNETGQT
EECISLDSVL
PKPDPKPDPK
MARLKTFKDQ
EIALGYKDQL
VEKVTFKDPI
ELDITVIDGH
ELPE-NPAVKA
GQDAFSQPAD
VRS IPQKDEN
IVGSRVLGIE
IEYLKSATSL
PTPETPVATN
KQNQAGARQS NPSVTEKIGCY GGF EF123-1 (SEQ ID NO:457) TAAAATAAAA AATTGGTACG AAGTGAACGT TCTCITCTAT GTGTCGTTAG TAGAGGAAGG ATCAAAGAAA TGAGAAAGAA TGGTCCAATG GTAAACCGTT GGCTCTACGG GTI'GATGTGT 223 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. fuecalis Genes.
a a .t
TTGTTACTTG
GATGGCCACT
CTTCAAGGAA
TCAATCAAAG
CAGAAGAAAC
ACGTTGAACC
TTGACGTTAA
CCATCACCGA
TCCGAAGAAA
ACTGTCAAAC
ACAACCATTA
ACACCGCCGT
GATGTGCGAG
AAACCTAATA
TACACAATTA
AGTGACATTC
CCAGGAGATT
ATTCTCCCAG
CCTAGTGCGA
ACTGTGACGC
GTGATGAATC
ACCGTTGATA
GAGTACCAAA
AAAAATCACC.
GTTACCGCCA
CAAGAATTCP.
GCACTCATTA
TATTCAGTGAP
AAAGATTAC;
GTGACTGGCC
GTTCCCGTG]
GCCAGTCAA(
GGTTCGACC'
ACGTACGAA(
ACCACTGGT'
GATGGTGAA,
TTCCACATA.
TTGGACCAT
CATTCAGAAl
AGCCGTGTT
AATAATCGT
GCTGGGAGC
AA.ACCAACG
ACTTGGCGT
GTI'GATGAA
CAACGTTC'I
AAAA.AAGGI
AATGGCGCC
GTCCTACAI
ACGCCAGAI
GATAATGAJ
TATATGGAI
AACCAAGTC
TTCTAAATTA TG TAACGTTAGC AG AAGCACAACC AG CTGCACATTG
GC
ACATTCAAAT TG CTACAGCTAC AC AGACTGGTAC TC ATGAAGGTTC Ad CTGTCGCCAG Cl CGCGCGTAGC
AC
TCCATAATTT
T
ATAAAGAAGA
T(
AACAAATGAA A( AACCAr.CTTC
A(
CTGAACATCG
T.
ACGGGGACTT
T'
GGGTGATTGA
T.
ATACCGAACA
A
TTACTTGGAC
G
AAACATGGCCA
TTCATGGAACA
AAAATGGCAA
*CGACGATGA-C
*CCACGTTAACA
CATATGGCAAA
CTTGGGAAAT
T
CAGACACAAT
C
CATTTGATGA
C
AAGTGGTAAT
C
CAGTCAAGAT
kATAATCGTGT
AAAATATTAT
r' TACCTGAA
:CAGTTCCTGG
3 CTCAGTTGAC k. CAGGCTI'TAA PL CTI'ATACTAC r ATCGAAATAC 3 ATAGTAAACC T ACA.ATCCCCT T TACTCGACGC T TGAAAGTCTA C AACCCTTCAC G T'rGATTTTCC A AAG'ITATCGA "r CACGTOATGT 'C GCGAATACCA !C AATCGGTTTT CCGAGTCA1T 'A AATCTGTTAT A CAGGACAACA kT ATCGTAC'F1T IT CAATTACTGG GCACACCA CT AACTGAAG CA TAACACAA GA CAGCGCCC A.A ;AATTGAAT CA ;AAGATGTG AC ATCCGACA GA CAGAAGAA GC .CGACAAAA C 3GACCAACA G; ~AAGATCCG _A9 :;TGACCATT Cl GCAGGCGAT TI GGTGATTTA
G~
ACGTTCGT T' TCTTTAGAT A( ATICCTACA C~ CAAATTGAT A GTAGATATC A.
ACAGGGAAT A ATrAAAGAA G
GTGACGATTA
'GAGGCGGTTA
,AGTGATAATA
.ATGTTACACA
~AACTACAAC T ;CCGGATAAT T :AAACGAAAT G
AACGGAGACC
L'CATTATAAA ;GAT'FIGGC
;Z
rAAAAACACTC rcAAAATAAT
CTTAACTATG
GTTAGGCAAG
GGTAAG=TT
CTTTTTCCAT
CGCTGCCATT
GITI'AAACCT
CACCAAAGAA
cITTrCACG
TGAAGGCAAT
GGATATCACA
TAATGATAGT
AGG ETCGGCT
GACAGGAAAA
CAAACATGAT
AGACGATGTG
GGTGA'I TAC
TTTAGAACAA
AAAAATTGTC
AGTGACTTCT
AAATGGTTCA
CAGTGGCGGG
CATGGCTT TG( AACCAGCC AG( GGTTGTAG TG( .TAATACGC GC.
.GCAAGCGT TA' C PTTTC~T1 AT LATCAACGG CA ATCTACAA AC :GATAGAAA GT ~TATCAGTG AT ~TTATTTAA AT k=TTGAACT T ~CTTCGAGT TT IWGATGCAG AP rTACCTTTA A) CTCATTI'GA A9] AAGAAGATT IX A.ACAAGGCC AX.
ATCAAGCGA
T
CCTTTAAGT C( TGC'GTCGCG
A.
AAGGTGACA C ,TTCCAGATG
G'
ATCCAA-ATG 0 AGCCCAATA T
'ATGGTGAACA
'TAACG TI'C A- ;AAGTCGTTG
G
;GTI'CCTTTG C CCAAAGTTCG A ~CTIGGTCAGC A
;GTGCAGTTGA
LATITGATGG
3TACCCAATT C 3ATTTCATGG I)
ATAGGGGCGT
GTTACCGAGT
GATTGGACCG
TTACCTGCTIT
ATCACTTCCA
CATCCAATT'
ACAAAGCCAG
ATGGAAGAAC
CGTACGTATG
AGTI'ATGACA
GTTrCTATTC
CCAGATCATG
GTTATTACTG
GGTACCAACG
CGAAAAGATT
GTTAAAATCCG
TCAGCGGCGG
GAAGTCCTTC
CATGCC:ACAG
CGGAAGA
CAAGAAAT
CATTATAG
AAGATTCA
GCAGATAC
GCACAACA
ATCACGAG
TCCTCTGT
AAAACAAC
TATTTTAC
'CCTGATGG
'AACTGGTC
CAATTACC
GCAATGT
rGAGCGAA'I
~GATTCAGP
CCGCCTGI
~TTTGATCC
3AAAGATCI
:CTGAAAG
.CTTAGTC(
CAACAAAG(
CGGCGGCG
~TIAGATG'
AGATTACO.
AACCATI'C
ACCAGATT
AGCAGAAC
'AATTGACT
~TGGAA=~
~T1CAGAAC
LTTATCAAA
U".ATGCCG
GCTCGTTG
~AGAAATAJ
kTGCCAAAI
IAACGCTI
bLTGAAGCAC
TTGATTI'AJ
CGATTGCGC
1'AACCAAT(
ATGGTTCG
CAAGCGAG
TGATTGAA:
ATACCGCA
AACATGGT
TGTACIG
ATACACCC
TAACAGAA
ACACACTG
CCCATATI
GGAGTAC)
ATGGGGAI
GGACTAAl GGT1'AACAGC
GACCTTAGCG
TGCCAATGTC
AGTGGATGAC
GTTAGTCTTA
GCAACGAGCG
TTCGCCAGCC
TC'CTCGTTCG
TCAATCGACG
AGGTGATGAA
AACACCAGCA
GATTCCAGAA
TGGCAATTIG
*CTATGGAACC
*CACGTCTGAA
TGGGCGGGGC
AGTGATTCCA
AACGCCCAAT
AACAAATCCA
r' CTATGAGTTA
:AGATGAATAT
CGTATCGTCTT
A. TGTGCCT
CTGAAGCAACT
A. CCAAGCCAAT C AAAAGACCAA C =FACATTTA T TCTCGAAGGA T TTTACATGAT T CGAAGGCGAT A TCATGCCACA
ATTCAACGATT
T GATTACGGAT T CAAACATACA LC TCGTAATGCA ~C AAGTGATGCC
~ACAATCCTGCG
;G AAACAATCAT
.TGCGCAAAAA
'T TAATTI'AAGT CA AAccTATTTG 3T TlGAAAAAGTG A~A AAACCAAAAT
ITTAACACCTCT
TC TTATACAAAC CC CCAATCAGTG CA TGTAATGATC TC ACCA.AACCAA GA CGGAACTATT GA AGTTACCACC 'A AGCACCTTAT LGA CACGGTATCC CA CAATCGCGAT ~cGAAAATTCAC GTGCTCGTTG
ACATTGATCA
224 TABLE 1. Nucleotide and Amino Acid Seqeuenccs of E. faecalis Genes.
CTGA.AGAAAA
GACCAAGCTA
TTTGGTGGGT
ACAG'TTCGG
GAAGGAGCAC
GATGAGAAGG
CCGTTTACTC
rTAGAGGTGG
GGCTATCT'T
ATI'CGTGATG.
GACCAAGCAG
GCAG'ITCGAG
CCAGGAA.AAT
CCAAGTGCTI'
GCTAACTTTG
F1'ATTAAGTG
TTGACGACAA
TTTGTAGAAA
ATTGCTGAGA
TACAAAGGCG
GT=TGAAT
GATGCCAAAA
GCACCAAAAT
AAAGGACGCG
TTTAAATTGT
AAAGATGG
GATGCAACGG
TCAAATGATA
GGACGTAAAG
GCCGATGAGC
AAAGTTTCTG
GGGCATTACG
CATCCAT'ITG
ATCAATTATC
GGTGCGGTGI
ATGTCTGACP
GTGGAGACAC
ATTGCCAAAC
TACCAAGGTC
GAA'ITAAAC
AACCAAGGC J
AAAGCACCGC
GCCAAAGACI
TCCAAAAAAJ
CTTGG'ITTA(
TTGGTCGG7I
CAGCCATGGA
AAACACAAGT
TGCCACAAGG
ACGANPI'AGC
AACCAACCAT
GTAAAAAGTT
ATTGGGAAGA
ATAGTTTAAA
TAGACACGAC
TTCATCTCA-A
GCAATCCATT
AACACTTAGT
ATCAATTTGT
TCACGATTGC
TTAACTATCA
GTGCCACAT
ATAATCAAGG
CCAAAGCGCC
AAAATGCTGG
CTTTCCAAAT
TATATGATCA
TTATCTTAG-
TACCAGATGG
ATTTCAAAGC
CCGTCTTTAP
ATCGAATCG;
CATI'GGCTA]
aTGGCTATA9 AACAACCAC9
TCAACGAGC.'
AGAATCAGC(
AAATAACAA(
TTITTAGTGGJ
ATGTAACCG(
AAGGAACTG(
TTAAGGTCA!
AGCAAGGCK
AAGCGCCAA,
ACAACCAAG,
CTGCCAAGC'
TGACAGATG
TAGTTCAAG
ATGGTI'ACC
k AACCTGAAC k. CAAAACCAA
:CGAAAACCA
r TGGCAAGTI
TGAGACGACT
CCTACGTGAA
GCAATACA'TT
TAAAGGCCGA
TATTAAAAAC
AGTCAAkTGCT
AGTTCCCCTT
ACCAGGGCTT
CCCCAAACOA
AATGCTTAAT
AGCAGGTGCT
TI'CGGATGCA
GGAAACCAAA
AGCAAGCCAT
AGGCACGGCT
TAAAGTGC'T
GGAAATTGTT
AACAGGCTAT
TAAACCAGCG
CGTGAAAACG
*CAATAA.ACAA
AGACTTGGCG
CGCAGATTAT
TGATCCGGAG
GAAAATTGAT
AAACGGGGAA
GGAGGATTA
CGTCA.ATAAA
7ACATGAGTTA kAGGTC-AAACC
'CCAAGGTTC-A
GATAAGAC'I
AACGAAAGC;
:CCAATTAGG;
ZTCAATTAACC
r TGATGAAAC; k AGTCATTGCC TAGCTATCT9 3CAAACCAGC( T GGTGAAAAT' C AGAGACAGG( T GAACCACTI'i A ACTGTCTAA( T CGTGAATGC( A TCAGCCAAC, A CACACA-AGT, C GCTCTTCTA
ATTTTAGCAG
GGTACAGTAG
TTGGTGGAGA
GTCATTACTA
GATGTCA-ATA
CGCTTTAAAT
GCCCCATC
TATCAGTTCA
TCATCGTGA
TACCAAGGT
GAATT ETCAG
AAC-GGAAAAG
GCGCCAGCAG
CGGGGCAAAC
AAATTAATCA
GATGCGAAGG
GCAGAGCACT
TTATTAAATA
GTCGTGGTI'G
AATAGCGCAC
TCATAGGGA
CCAGGTACCI
ATTA'PTTATC
ATTTTCCAAM
GCCA.ATGCGP
AAAATCTTTC
GGTGCTGGT;
CAACCCATT1
GAGTTTGTA;
TTAGCGGGTC
CCGATAACA'
GGCGAAA~r'
LCCAACAGGC'
AAAGAGCAG(
AAAGAAAAC(
GGGCA.AACCI
AAAAACT1'AI r' CTTAAT-GAA
:ACTGTGGTA
r~ GATCAGCAA
SCAAACTGTC
k. CAACCAGGA
CAAGCTGTC
3 GGCACGTI= A, ACGAAACAA C AATTAC ITC T AAAAAGAGC
GCGCCCA=I
ATGCCACCGG
CAAAAGCACC
TT'-ATGAAGA
AAGTATrTTT
TAGAGC-ATGC
GAACCAACGC
CAGAAATCGA
CACAAAATAC
CTGCTGAACT
TCCTTGACAC
TCACAGTGAC
GGTACCTTT
CAGCAAgCAGT
AAAAAGATGT
GAGAAACGAT
TAGCCCCAGG
CCACGCCAGT
CTAGTGACAA
ACCAACCATT
TTACAGCAAC
ATTATTACAA
CTGAATTAGT
TAGGGGCCT
ACCCACTTCC
AAAGAGAAGT
LGCTATCAATT
ATTTAGT
k ATATCAAGC CAGTTTrGC r TCTTGA.ATCG r ACGCTAAAGG- I' ATCTGTTAGP CAATrC= 3 AAACAGGTG; 3TAGATGGAC;
CACCGGGAAC
A, CGCCAAGCGC C TTAAAGCAC( A AGA-ATGCC' G CTCGTTCM A AATATACCT G CATTCACTA' G ITAACGAGAX .G CAGCTAGAG, T TTGTCTITTAN !A AGAAATAA
CCAAATTTGG
GCTTATCACA
AGAAGGCTAT
AACTTCACC
ACAAAAAATG
CGTAACCACG
GAATGGC CAG
AGCACCGACA
GAGCGGACAA
AATTAAAAA
CACAGGACAA
GGA PTTAGCC
AAACACTGAA
TA'CAGCAACG
GAATGGACAC
TCAA.ACAGGC
AAAATATCGC
CCCATITGAA
CTTTGTGAGT
AGCAGGTG7CT
GAGTGGCAAA
AGAAATCAAA
AAAAGTAGAA
CGCCAATTTC
AGGAACGATT
AACTGCTGAA
AGATGAACTCG
GAAGAAGAAT
AGAAGTAATG
AATTTACAAT
TGCAGGAGAA
GCTAA.ATGAA
CACAACGCTA
AGGCGATCTT
LAGCATTGGCA
AACCAATCTG
GTATCCTI=
AAGCTTTACG
TTTTAT'rAAT rAGCAGOTOCT r' ACGTTCTGAC r' TGTGGAAACA r TcGcccAACA k. ACAACcTGTA k. GACAGGTTGG r CGGCCTCATG EF123-2 (SEQ ID NO:458) MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD GQLTLGEVKQ TSQQEMTLAL QGXAQPVTQE VVVHYSANVS KKQIQIELNQ QALADTLVLT LNPTATEDVT FSYGQQQRAL SANEGSTEEA STNSSVPRSS EEIVASITKA IESKTTES-, IKAAHWAAPN NTRKIQVDDQ TLXTGTDPTE STAITS SPAA VKPRVAGPTD I SDYFTGDET TABLE 1. Nucleotidc and Amino Acid Seqeuecoces of E. faecalis Genes.
TI IDNFEDPI
PNKPGSGDLV
GDWVIDIPTQ
VTETWPTGNT
YQTTIDEAVI
EFTWEINYNY
DYKVVINGDG
SQQNI IKNTG
TGAQLTLGKD
DHYRNTAAID
NRLVDAFLTO
WRVDFPNDSR
KGGEYHKDDF
PDKSVILEEC
QVS ITGNGSE
QAKTQVLREC
GAQPTI IKNI
ZVDSLKPGL)
QAGNPLAGAJ
SAFTIAASDI
TTNNQGEIV
KGAFQIVKfl
PKLPDGADY
KLYRIENGE
NDKQPLDEL
VS=EITTDKT
NYQGTAQLT
ETQAPTSYL
FKVTDAETC
KDKPELVN.D
YLNPDGTPAT
DAEGNVYTY
EDLPPWVIP I
FKSVKVYELV
PDGGGDVPFK
GEQTIPKDQA
SFAIDFU4DV
AVDYQNSTIG
FMVEITRNAD
WTDEAGNNHH
P1 LTNQTYLA
TYVIEFKTSV
DHVYWHVMIN
KDYTLEVTTO
VVHGDONGDV
TVDATGVITF
\TNKVFLEKMC
QFTEIEAPTG
FSVLDTTGQP
RGKPATVIATP
k. EH-LAPGKYRI >1 SADQPLAGA\ I IYPELVKVE] K IFEREVTAEI E FVNYQAEVMC G EIYAKGLNE( Y, ENETGE-ALA( L NETPSASFT ;Q TVARSLRSD ,G TFVNEKQPV PPYKEDvTIH
TISED--TVRF
VPDTEQQ
IDK
MLDGTIKEV
NHATL.TSONN
VITDTMGDNL
TGAVKI DYKT
WTLAVNQNN
GETGFKVSF
I
SEDSKPFKPL
GSLKVYEGNI
DERVIEGSAS
GAQSVLDDV'J
NETGQQKIV%
VVDIDHSGG'-
GGLPQGQYII
EKGKKLVNAI
YLLDTTPKRI
VREHLVSDAI
NFVNYQGTA2
VETKAPTGY
IFELYDHNKQ
IRGDFKGDPE
DCSLAMEDL
3RKV&EQGQT 3HYVLVETKA 3AVFKVIDEl I AKDNQGKPP' N QGLVQVNHI S KKTKPNQP1 WNFNWS IPED
TFNERITSES
QGHFDRTP.NP
GRELS PDEYT
PNGLDAEXITV
TFEPDSLHLY
KVDG IVEGOV LMENAV ITDT
GAYAKTSDAF
PAFDLNAQKS
KpD-svEK YDaTASYTNQ
ITDTPSPNQV
KMAH IEAPYY I ATCTKGKIQL
.VETKAPEGYT
k FKLEHAVrP FIVTQNTSrCQI q G1(VTVTDLAr K LIKKD)VNGHI
LNTTPVPFE]
S LGITATSGKE I FQL.GAFANFI G AGSYELDELI L AGAVFAIYNj .P TGYLLDTTLI 'G QTVDGQTNLI LT VVLKAPFI9 ,Q PGKYTFVE-r 'T KQAARETGW VREQMflAGDY Fl DIHGDFSLD)T
HI
SAITWTVDIN
Q~
VDKNGNVTIK
GI
TATYGKrLDK
R~
svTFD)DKGNE V AVNNRVDVGT
G
YEPVPGLTMV P HITYTI'FFDV
T
GVYNAVTKEI
PTQPLTUITM E G;SSRDVTGKV S
LDPESLVIYG
M.EYRSLVTSS
KKTAMDETTII
*VSDELAKGRV
FTHWEEVPLA
*RDVHVKMLNY
GKYQFVETKA
LSGATFKVLD
-AEKNAGKPAV
)GKIIFRDLAP
(GRAVFKKIDA
D ATDGYIVNKQ
DEQNQP-.QGSP
Hl PFDVTAQLGK A4 SDIKQGKVIAK -Y GAAKLVKID K APDGYQLSKQ L GLPKTNTQVN
:,FQLPGNLK
LNDSDGRGP
AMKDQTN'PT
OTNKAYRLE
N'IDYDEANQ
VGAELVEGK
QHSEDDGTA
NSLVVKDT
'ELDANNPAL
W(tTIAVNLSN
;EPSEKNQNT
IQHGGESVK
'NVTEDGTIT
AGSTDTVSN
.AGAHFQIWD
ITI DEETSAE
PORTNANGQL
2GSAELIKKD
PAGYLLNTEP
AKGETIQTGL
VVASDNFVSY
GTYYYKEI KA
NANPLPGTIF
P IYFVVKKNS
ITFLNRAGEK
EQP IAILODLI NLAPGT-YRFv
QQKNALAGAE
AVAFTIAATA
YFFVF IGLNL VGLASWLFYK
KSKK
EF123-3 (SEQ ID NO:459) GGAAGA GGTI'AACAGC GATGGCCAGT TAACGTTAGG
AGAAGTGAAG
CTTCAAGGAA AAGCACAACC
AGTAACACAA
TCAATCAAAG CTGCACATTG
GGCAGCGCCC
CAGAAGAAAC AGATTCAAAT
TGAATGAAT
ACGTrGAACC CTACAGCTAC
-AGAAGATGTG
TTGACGTTAA AGACTGGTAC
TGATCCGACA
GCATCAGCGA ATGAAGG7I'
AACAGAAGAA
TCCGAAGAAA CTGTCGCCAG
CACGACAAAA
ACTGTCAAAC CGCGCGTAGC
AGGACCAACA
ACAACGATTA TCGATAATT
TGAAGATCCG
ACACCGCCGT ATAAAGAAGA
TGTGACCATTI
GATGTGCGAG AACAAATGAA
AGCAGGCCXI
AAACCTAATA AACCAGGTTC
AGGTGATTTP
TACACAATA GTGAAGATGG TACGGT'rCG'I AGTGACATC ACGGGGACTT
TTCTTI'AGA]
CCAGGAGATT GGGTGATTGA
TATTCCTAC;'
ATTGTCCCAG ATACCr;AACA
ACAAATTGA)
CCTAGTGCGA TITACTTGGAC
GGTAGATAT(
ACTGTGACGG AAACATGGCC
AACAGGGAA'
CAA.ACCAGCC
GAGGTI'GTAG
AATAATACGC
cAGcAAGCGT
ACGTTITTCTT
GAATCAACGG
GCATCTACAA
GCGATAGAAA
GATATCACTG
ATTTATTAA
CATTGGAACT
AGCAAGAAAT
TGCA7ITATAG GCAAGATrCA
TAGCAGATAC
ATGGACAACA
CAATCACGAG
ACTCCTCTGT
GTAAAACAAC
ATTATITTTAC
ATCCTGATGG
TrAACTGGTC
GACCTTAGCG
TrCCAATGTG
AGTGGATCAC
GTTAGTCTTA
GCAACGAGCG
'rTCGCCAGCC
TCCTCGTTCG
TCAATCGACG
ACGGTGATG-
AACACCAGCA
GATTCCAGAA
TGGcAATTTG
CTATGGAACC
CACGTCTGAA
TGGGCGGGGC
AG;TGATTCCA
AAC-GCCCAAT
AACAAATCCA
CTATGAGTTA
G=rATGCAG
AAGGCAATGT
I'IACC'TI'A ATGAGCGAAT ACTCAIUITA
ATGATTCAGA
CAAGAAGATT TGCCGCCTCGT AAACAA~ccc A'TTIrGATCG AATcAAGCGA
TGAAAGATCA
ACCTLTAAGT
CCGTGAAAGT
226 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GTGATGAATC
ACCGTTGATA
GAGTACCAAA
AAAA.ATCACG
GTTACCGCCA
CAAGAATTCA
GCAGTCATTA
TATTCACTGA
AAACATTACA
GTGACTGGCG
GTTGCCGTGA
GCCACTCAAC
GGTTGGACGT
ACGTACGAAC
ACCACTGGTG
GATGGTGAAA
TTCCACATAA
TTGGACCATT
TTGATGGAAC
A.AAATGGCAA
CG;ACGATI'GA
CGACGTTAAC
CATATGGCAA
CTTGGGAAAT
CAGACACAAT
CATTTGATGA
AAGTGGTAAT
CAGTCAAGAT
ATAATCGTGT
AAAATATTAT
TAGCTGTGAA
CAGTTCCTGG
CTCAGTTGAC
CAGGCTTTA-A
CTTATACTAC
ATCGAAATAC
AATTAAAGAA
TGTGACGATT
CGAGGCGGT
AAGTGATAAT
AATGTTAGAC
TAACTACAAC
GGGCGATAAT
CAAAGGAAAT
CAACGGAGAC
TGATTATAAA
GGATGTTGGC
TAAAAACACT
TCAAAATAAT
CTTAACTATG
GTTAGGCPLAG
GGTAAGTT
CT1T='CGMr
CGCTGCCAT'I
GTGGGTCGCG
AAAGGTGACA
ATTCCAGATG
AATCCAAATG
AAGCGCAATA
TATGGTGAAC
TTAACG=IIG
GAAGTCG'ITG
GGTTCCTITG
ACCAAAGTG
ACTGGTCAGC
66TGCAGTTG TAmTGAMG
GTACCCAATT
GATI'CATGG
ATAGGGGCGT
GTTACCGAGT
GATTGG
AACTI'AGTCC
CCAACAAAGC
GCGGCGGCGA
GGTTAGATGC
TAGATTACGA
AAACCATTCC
AACCAGATTC
GAGCAGAACT
CAATTGACTT
ATGG-AATTGT
ATTCAGAAGA
ATTATCAAAA
AAAATGCCGT
CGTTGG TTGT
TAGAAATAAC
ATGCCAAAA-C
TAGACGCTAA
AGATGAATAT
GTATCGTC'TT
I'GTGCCTTIWT
TGAAGCAACT
CGAAGCCAAT
AAAAGACCAA
TFITACATTTA
TGTGGAAGGA
TTTACATGAT
CGAAGGCGAT
TMATIGGCACA
'rrCAACGATT
GATTACGGAT
CAAAGATACA
TCGTAATGCA
AAGTGATGCC
CAATCCTGCG
EF123-4 (SEQ ID NO:460) a.
p a
EEVNSD
GQLTLGEVKQ
KKQIQIELNQ
SANEGSTEEA
TI IDNFEDPI
PNKPGSGDLV
GDWVIDIPTQ
VTETWPTGNT
YQTTIDE-AVI
EFTWEINYNY
DYKVV INGDG SQQNI IKNTG
TG.AQLTLGKD
TSQQEMTLAL
QALADTLVLT
STNSSVPRSS
YLNPOGTPAT
DAEGNVYGTY
EDLPPVVI PI
FKSVKVYELV
POGGGDVPFK
GEQTIPKDQA
SFAIDFLHDV
AVDYQNSTIG
FMVEITRNAD
QGKAQPVTQE
LNPTATEDVT
EETVASTTKA
PPYKEDVTIH
TISEDGTVRF
VPDTEQQIDK
MNLDGTIKEV
NHATL.TSDNN
VITDTMGDNL
TGAVI DYKT
WTLAVNQNNY
GETGFKVSFI
VVV}IYSANVS
FSYGQQQRAL
I ESKTT1ESTT
WNFNWSIPED
TFNERITSES
QGHFORTPNP
GRELSPDEYT
PNGLDAEATV
TFEPOSLHLY
KVDGIVEGDV
LMENAVITIDT
GAYAKTSDAF
IKAAHWAAPN
TLKTGTDPTE
VKPRVAGPTD
VREQMXAGDY
DIHGDFSLI'r
SAITWTVDIN
VDKNGNVTIK
TATYGKMUJK
SVTFDDKGNE
AVNNRVDVGT
YEPVPGLTMV
H ITYTTFFDV
NTRKIQVDDQ
STAITSSPAA
ISDYFTGDET
FeEFQLPGNLK
HLNDSDGRGP
QAMZDQTNPT
GDTNKAYRLE
RNIDYDEANQ
VVGAELVEGK
GQHSEDDGTA
PNSLVVKDTT
TELDANNPAL
DHYRNTAAID W EF124-1 (SEQ ID NO:461) a.
TAAAATAAAA
ATGAAAGAAA
TTGTTACTTG
GATGGCCAGT
CTTCAAGGAA
TCAATCA-AAG
CAGAAGAAAC
ACCTTGAACC
TI'GACGTTAA
GCATCAGCGA
TCCCAAGAAA
ACTGTCAAAC
ACAACGATTA
ACACCGCCGT
GATGTGCGAG
AATTGGTACG
TGAGAAAGAA
TITCTAAATTA
TAACGTTAGG
AAGCACAACC
CTGCACATTG
AGATTCAAAT
CTACAGCTAC
AGACTGGTAC
ATGAAGG'ITC
CTGTCGCCAG
CGCGCGTAGC
TCGATAATT
ATAAAGAAGA
AACAAATCAA
AAGTGAACGT
TG.GTCCAATG
TGCACACCA
AGAAGTGAAG
AGTAACACAA
GGCAGCGCCC
TGAATTGAAT
AGAAGATGTG
TGATCCGACA
AACAGAAGAA
CACGACAAAA
AGGACCAACA
TGAAGATCCG
TGTGACCT
AGCAGGCGAT
TCTCTTCTAT
GTAA6ACCGTT
CTCATGGCTT
CAAACCAGCC
GAGGTI'GTAG
AATAATACC
CAGCAAGCGT
ACGTTCIT
GAATCAACGG
GCATCTACAA
GCGATAGAAA
GATATCAGTG
ATITATTTAA
CAITGGAACT
TACTTCGAGT
GTGTCGTTAG
GGCTCTACGG
TrGGCGGAAGA AGCAAGAA6AT
TGCATTATAG
GCAAGATrCA
TAGCAGATAC
ATGGACAACA
CAATCACGAG
ACTCcTCTGT
GTAAAACAAC
ATTATTTrAC ATccTGATGG
TTAACTGGTC
TTCAATTACC
TAGAGGAAGG
GTrGATGTGT
GG'ITAACAGC
GACC ITAGCG
TGCCAATGTG
AGTGGATGAC
GTTAGTCTTA
GCAACGAGCG
TTCGCCAGCC
TCCTCGTTCG
TGAATCGACG
AGGTGATGAA
A6ACACCAGCA GA 1TCCAGAA TGGcAATTTrG 227 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.f[aecalis Genes.
a a.
a a a a a a a *a AAACCTAATA A TACACAATTA G' ACTCACATTC AC CCACCAGATT CC ATTGTCCC-AG A CCTAGTCA T t ACTCTGACGG A
CTCATGAATCT
ACCGTTGATAA
CACTACCAAA C AAAAATCACG C GTTACCGCCA C CAACAATTCA C GCAGTCATTA C TATTCACTGA C
AAACATTACAA
GTCACTCCCG C
CTTGCCGTGAA
GCCACTCAACA
CGTTGGACGTI
ACGTACCAAC C ACCACTCCTG C CATCGTGAAA C TTCCACATAA C
TTGGACCATT
CATTCACAAC I
AGCGGTCTTT
AATAATCGTT
CCTGGCACCT
AAACCAACC
ACTTCGCGTG
CTTGATGAAA
CAAGGTTCTT
AAAAAAGGTC
AATGCCGCCC
CTGCTAGATC
ACGCCAGATA
CATAATGA;A
TATATGGAAT
AACCAAGTGT
CTGGTCGTTG
CTGAAGAA.AA
GACCAAGCTA
TT1TGTGGGT
ACAGTTTCGG
GAAGCAGCAC
GATGAGAAG
CCCITTACTC
TTAGACGTG
GGCTATCTIP
ATTCCTGATC-
CACCAAGCAG
GCAGTTCGAG
CCAGGAAAAT
kCCACCTI'C
MGAAGATGG
CCGGGACTT
GGTGATI'CA
rACCGAACA rACTI'GGAC
;LACATGCC
TCATGGAAC
P.AATGGCAA
GACCATTGA
GACGTTAAC
ATATGGCAA
TCGCGAAAT
ACACACAAT
ATTTIGATCA
ACTGGTAAT
AGTCAACAT
.TAATCGTGT
.AAATA'ITAT
'ACCTCTGAA
:AGTTCCTCC
:TCAGTTGAC
ACGCTTTAA
T'TATACTAC
LTCGAAATAC
~TACTAAACC
kCAATGCCCT
CAGTCGACGC
rGAAACTCT.A kACCGTTCGAC rTAT1TICC kAGTTATCGP
CACGTGATG]
GCCAATACCI
AATCGGTI
CCGAGTCATJ
AATCTGTTA)
CAGGACAAC)
ATCGTACTT"
CAATTACTG(
ACATTIOATC
CACCCATCG4
AA.ACACAAG'
TGCCACAAG,
ACGAATTAGI
AACCAACCA
GTAA.AAAGT
ATI'GGGAAG
ATAGTIAA
TAGACACGA
TTCATCTCA
GCAATCCAI
AACACTTAG
ATCAATT'C
TACGCTTCGT
TTCTTTAGAT
TATI'CCTACA
ACAAkATTGAT
GCTAGATATC
A.ACAGCAAT
AAT1'AAAGAA
TCTCACGATT
CGAGCCG'P1'
AAGTCATAAT
AATCTI'ACAC
TAACTACAAC
GCCCATA.AT
CAAACGAAAT
CAACGGAGAC
TGATTATA-AA
GGATGTTGGC
TTTACCTTTA
ACTCATTTGA
CAAGAAC.ATT
AAACAAGGCC
AATCAAGCGA
ACCT=AACT
CTCCGTCCGCG
AAAGGTGACA
ATTCCAGATG
AATCCAAATG
kAGCGCALATA
TATGGTGA.AC
'rrAACGTITTC
CAAGTCOTTG
GGTTCC'ITTG
ACCAAAGTTC
ACTGGTCAGC
TAAAAACACT GCTGCAGTTG
TCAAAATAAT
CTTAACTATC
G'I"IAGGCAAG
CCTAAGTTT
cTTrTCGAT
CGCTCCCATT
GTTTAAACCT
CACCAAAGAA
CTTTICACG
TGAAGCCAAT
GGATATCACA
TAATGATAGT
ACGTTCCGCT
GACAGCAAAA
CAAAGATGAT
ACACGATGTG
GGTGA=TAC
TTTAGAAGAA
AAAATTGTC
AGTGACTI'CT
AAATGGTTCA
CAGTCCCGGC
TGAGACCACT
CCTACGTGAA
GCAATACATT
TAAAGGCCGA
TATTAAAAAC
AGTCAATCC I
ACTTCCCCTI
LACCAGCTI
CCCCAAACGP
SAATGCTTAAI
ACCAGCTGC9 7 TTCGGATCCI r' GGAAACCAA)
TATTTGATGC
GTACCCAATT
CAT'FrCATGG
ATAGG'CGT
GTTACCGAGT
GATTCGACGG
TTACC [CCTT ATCAC~rGGA
GATCCAATTT
ACAAAGCCAG
ATCCAAGAAC
CGTACGTATG
AGTTATGACA
GTrTCTA TCM
CCAGATCATG
G ITATTACTG
GGTACCAACG
GGAAAAGATIr
GTTAAAATGG
TCAGCGGCGG
CAAGTCCTTC
CATGCCACAC
ATTTTAGCAC
CCTACACTAC
TTGGTGGAGI
GTCATTACT)
CATGTCAATJ
CGCTIAAAN
GCGCCCGAT(
TATCACPTC
LTTCATCGTG.
TACCAAGGT
GAAT'I=CA
AACGGAAAA
GCGCCAGCA
ACGTGATI'TA GTTGAT,-CAG AAGGCAATGT CTATCCAACC kTGACAAT CACGTCTCAA PTGAITCAGA TGGGCGGGGC rCCCGCCTCT AGTCATTCCA A.TTTTCATCG AACCCCCAAT TGAAAGATCA AACAAATCCA CCCTGAAAGT CTATGAGTTA AACTTACTCC AGATCAATAT CCAACAAAGC GTATCGTCTT GCGCCGGCCA TGTGCCTTTT GGTTAGATCGC TGAAGCAACT TACATTACCA CGAAGCCAAT AAACCATTCC AAAAGACCAA AAccCATA'TC TTTACATI'A GAGCAGAACT TGTGGAAGGA CAATTGACTT TTTACATGAT ATCGAATTCT CCAAGCCGAT AT'TCACAACA TGATGCCACA A'PTATCAAAA TTCAACCATT AAAATGCCCT CATTACGCAT CGTTCGTTGT CAAACATACA TAGAAATAAC TCGTAATGCA ATGCCAAAAC AAGTGATGCC TAGACGCTAA CAATCCTGCG ATGAAGCAGG AAACAATCAT TTGATTTAAA TGCCGCAAAAA CGA CGCT TAATTTAACT TAACCAATCA AACCTATTTG ATCCTTCGGT TGAAAAAGTG CAAGCGACAA AAAcCAAAAT TCATTGAATT TAAGACGTCT ATACCCCATC TTATACAAAC AACATCGTCG CGAATCAGTG TCTACTCCCA TGTAATGATC ATACACCCTC ACCAAACCAA TA.ACACAAGA CGGAACTATT ACACACTCGA AGTTACCACC CCCATATTGA AGCACCTTAT CGAGTACAGA CACGGTATCC ATCGGCATGA CAATGCCAT GGACTAAACC
CAAAATTCAG
GCGCCCATTT ccAAATITCCG ATGCCACCGG GG'ITATCACA ~CAAAAGCACC
AGAACCCTAT
STTGATGAAGA AACTTCAGCC ~AAGTATrTTT AGAA.AAAATC rTAGACCATCC CGTAACCACG C AACCAACGC C-AATGCCCAG A CACAAATCCA
ACCACCOACA
A CACAAAATAC CAGCCCACAJA T CTCCTGAACT AATTAAAAAA G TCCITTGACAC CACACGACAA CTCACACTGAC CGAF'TACC GGCTAcC'TI' AAACACTCAA 228 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
C CAAGTGCTT
GCTAACTTTG
TTAI'AAGTG
TTACGACAA
TTI'GTAGA;A
ATTGCTGAGA
TACAAAGGGG
GTTTI'GAAT
GATGGCAAAA
GCACCAAAAT
ATTCGTGGTG
AAAGGACGCG
T'ITAAATT
AAAGATGGTT
GATGCAACGG
TCAAATGATA
GGACGTAAAG
GCCGATGAGC
AAAGTTTCTG
GGGCATTACG
CATCCATTTG
ATCAATTATC
GGTGCGGTGT
ATGTCTGACA
GTGGAGACAC
ATTGCCAA.AG
TACCAAGGTCG
GALATTTAAAC
AACCAAGG I
AAAGCACCGC
GCGAAAGACI
TCCAAAAAAI
CTGGT'TAC
TTGGTCGGTr.
TCACGATFGC
TTAACTATCA
GTGCGACATT
ATAATCA-AGG
CCAAAGCGCC
AAAATGCTGG
CTTTCCAAAT
TATATGATCA
TTATCTTTAG
TACCAGATC
A TrCAAAGG
CCGTCTTTAA
ATCGA.ATCGA
CATTGGCTAT
ATGGCTATAT
AACAACCACT
TCAACGAGCA
AGAATCAGCC
AAATAACAAC
TTTTAGTGGA
ATGTAACCGC
AAGGAACTGC
TTAACCTCAT
AGCAAGGCAA
AAGCGCCAAC
ACAACCAAG
CTGCCAAC
TGACAGATGC
TAGTTCAAGI
ATGGTTACC-P
AACCTGAAC I
SCAAAACCAAJ
CGAAAACCA-1 r' TGGCAAG=T
AGCAAGCGAT
AGGCACGGCT
TAAAGTGCTT
CCAAATTG-T
AACAGGCTAT
TAAACCAGCG
CGTGAAAACG
CAATAAACAA
AGACTMGCG
CGCAGATTAT
TGATCCGGAG
GAAAATTGAT
AAACGGGGAA
GGAGGATTTA
CGTCAATAAA
AGATGAGTTA
AGCTCAAACC
CCAAGGTTCA
GGATAAGACT
AACG-AAAGCA
CCAATTAGGA
TCAATTAACC
TGATGAAACA
AGTCATTGCG
TAGCTATCT'I
CAAACCAGCC
GGTGAAAAT']
AGAGACAGGC
GAACCACTT7
ACTGTCTAAC
CGTGAATGCC
TCAGCCAACI
CACACAAGT(
GCTCTTCTA'
CGGG4GCAAAC
AAATTAATCA
GATGCGAAGG
GCAGAGCACT
TTATTAAATA
GTCGTGGTTG
AATAGCGCAG3
TCAI'AGGGA
CCAGGTACCT
ATTATTTATC
ATTTTCCAAT
G6CCAATGCGA
AAAATCTTTG
GGTGCTGGTA
CAACCCATTT
GAGTTTGTAA
TTAGCGGGTG
CCGATAACAT
GGCGAAATTT
CCAACAGGCT
AAAGAGCAGC
AAAGAA.ACG
*GGGCAAACCC
AAAAACTTAC
*CTTAATGAAP~
*ACTGTGGTAC
GATCAGCAA);
cAAACTGTCC
CAACCAGGPO
CAAGCTGTCC
GGCACGTT(
k. ACGAAACAA(
-AATTACTTCI
r' AAAAAGArGCji
CAGCAACAGT
AAAAAGATGT
GAGAAACGAT
TAGCCCCAGG
CCACGCCAGT
CTAGTGACA
ACCAACCATT
TTACAGCAAC
ATTATTACAA
CTGAATTAGT
TAGGGGCCTT
ACCCACTTCC
AAAGAGAAGT
GCTAIT GATT ArTTGTAGT
ATTATCAAGC
CAGITmGC
TCTTGAATCG
ACGCTAAAGG
ATCTGTTAGA
CAATTcCTTT
AAACAGGTGA
TAGATGGACA
CACCGGGAAC
CGCCAAGCGC
TTAAAGCACC
~AGAATGCCTI
CTCGTTCATI
SAATATACCTI
CATTCACTA]
T TAACGAGAJ; 3 CAGCTAGAGI r TTGTCTT'Ar.
k. AGAAATAA
TATAGCAACG
GAATGGACAC
TCAAACAGGC
AAAATATCGC
CCCATTTGAA
CTTGTGAGT
AGCAGGTGCT
GAGTGGCAAA
AGAAATCAAA
AAAAGTAGAA
CGCCAATTTC
AGGAACGATT
AACTGCTGAA
AGATGAACTG
GAAGAAGA6AT
AGAAGTAATG
AATTflACAAT
TGCAGGAGAA
GcTAAATGAA
CACAACGCTA
AGGCGATCTT
AGCATTGGCA
AACCAATCTG
GTATCGTTTT
AkAGCTTTACG
TITTTATTAAT
AGCAGGTGcT
ACG;TTCTGAC
TGTGGAA6ACA TGCGGCA6ACA
ACAACCTGTA
k. GACAGGTTGG r CGGCCTCATG B B B
B
B
B
B
B
B
B. B B B
B.
EF124-2 (SEQ ID NO:462) MR1QNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD
GQLTLGEVKQ
KKQIQIELNQ
SANEGSTEEA
TIIDNFEDPI
PNKPGSGDLV
GDWVIDIPTQ
VTETWlPTGNT
YQTTIDE.AVI
EFTWEINYNY
DYKVVINGDG
SQQNI IKNTG
TGAQLTLGKD
DHYRNTAAID
NRLVDAFLTD
WRVDFPNDSR
KGGEYHKDDP
TSQQEMTLAL
QALADTLVLT
STNSSVPRSS
YLNPDGTPAT
DAEGNVYGTY
EDLPPVVIPI
FKSVKVYELV
PDGGGDVPFK
GEQTIPKDQA
SFAIDFLHDV
AVDYQNSTIG
F14VEITRNAD
WTDEAGNNHH
PILTNQTYLA
TYVIEFKTSV
DHVYWHVMIN
QGKAQPVTQE
LNPTATE-DVT
EETVASITKA
PPYYKEDVTIH
TI SEDGTVRF
VPDTEQQIDK
MNLDGTIKEV
NI{ATLTSDNN
VITDTMGDNL
TGAVKIDYXT
WTLAVNQNNY
GETGFKVSF I
SEDSKPFKPL
GSLKVYEGNT
DEKVIEGSAS
GAQSVLDDVV
VVVHYSANVS
FSYGQQQRAL
IESKTI'ESTT
WNFN'WSI PED
TFNERITSES
QGHFDRTPNP
GRELSPDEYT
PN~GLDAE.ATV
TFEPDSLHLY
KVDGIVEGDV
LMENAVITDT
GAYAKTSDAF
PAFDLNAQKS
KPDGSVEKVK
YDNTASYTNQ
ITOTPSPNQV
IKAAHWAAPN
ThKTGTDPTE
VKPRVAGPTD
VREQfrflAGDY
DIHGDFSLDT
SAITWTVD IN
VDKNGNVTIK
TATYCKM~LDK
SVrFDDKGNE
AVNNRVI)VGT
YEPVPGLTMV
H ITYTTFFDV GVYNiAVrKE I
PTQPLTDITM
GSSRDVTGKV
LDPESLVIYG
NTRKIQVDDQ
STAITSSPAA
ISDYFTGIDET
FEFQLPGNLK
HLNDSDGRGP
QAMKDQTNPT
GDTNKAYRLE
RNIDYDEANQ
VVGAELVEGK
GQHSEDDGTA
PNSLvVKDTT
TELDANNPAL
TwJTIAVNLSN
EEPSEKNQNT
S IQHGGESVK
TNVTEDGTIT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
PDKSVILEEG
QVS ITGNGSE
QAKTQVLREG
GAQPTI IKND
EVDSLKPGLY
QAGNPLAGAE
SAFTIAASDR
TTNNQGEIVA
KGAFQIVKTN
PKLPDGADY I KLYR IENGEK
I'DKQPLDELE
VSE ITTDKTG
NYQGTAQLTK
ETQAPTSYLL
FKVTDAETGQ
KDKPELVNAG
KDYTLEVTTD
WVHGDDNGDV
TVDATGVITF
VNKVFLEKMD
QFTEIE-APTG-
FSVLDTTGQA
GKPATVIATA
EHLAPGX(YRF
SADQPLAGAV
IYPELVKVEI
IFEREVTAEK
FVNYQAEVMG
EIYAKGLNEG
ENETGEALAG
NETPSASFTI
TVARSLRSDN
TFVNEKQPVS
NETGQQKIVV KQMIPEAPYY VVDIDHSGGH ATGTKGKIQL GGLPQ2GQY IL VETKAPEGYT EKGKXLVNAR FKLEI14AVTTP YLLOTTPKRF IVTQNTSGQ I VREHLVSDAN GKVTIVTDLAP NFVNYQGTAK LIKKDVbJGHL VETKAPTGYL LN'ITPVPFE I FELYDHNKQS LGITATSGKD RGDFKGDPEI FQLGAFANrFX DGSLAMEDLG
AGSYELDELD
RI4VNEQGQTL AGAVFAIYNA HWVLVETKAP TGYLLD-TTLH AVFKv: DETG QTVDGQT-NLM AKDNQGKPAT VVLKAPFINTY QGLVQVNHLQ PGKYTFVETK KI<TKPNQPTT KQAARETGWL MEYRSLVTSS AAGSTDTVSN KYKTAMDETTI LAGAHFQ IWD
VSDELAK(GRV
FTHWEEVPL.A
RDVHVKMLNY
GKYQFVETKA
LSGATF'KLD
AEKNAGKPAV
PDRTNANGQL
QGSAELIKKD
PAGYLLNTEP
AKGE-TIQTGL
VVASDNFVSY
GKI IFRDLAP GTYYYKEIKA GRAVFKK IDA NANPLPGTIF ATDGY IVNKQ
DEQNQPQGSP
PFDVTAQLGK
SDKQdKVIAK
QGAAKLVKID
APDGYQLSKQ
ITFLNRAG:EK
EQPIALGDLI
NLAPGTYRF-V
QQKNALAGAE
AVAFTIAATA
GLPKTNTQVN 'YFFVFIGLML VGLJASWLFYK KSKK EF124-3 (SEQ ID NO:463) TGCCTTCCACATAACTTATACTACCTTT=GACG GATCCAATIT TAACCAATCA AACCTATTTG C C
C
C.
C C
C
C
C
GCTGGGAGCT
AAACCPACGC
ACTTGGCGTG
GTITGATGAAA
CAAGGTTCTT
AAAAAAGGTG
AATGGCCCCC
GTGCTAGATC
ACGCCAGATA
GATAATGAAA
TATATGGAAT
AACCAAGTGT
GTGGTCGTTG
CTGAAGAAAA
GACCAAGCTA
T TTGGTGGGT
ACAGTTTCGC
GAAGGAGCAC
GATGAGAAGG
CCGTTTACTC
TTAGAGGTG
GGCTATCTI'
ATTCGTGATG
GACCAAGCAG
GCAGTTCGAG
CCAGGAAAAT
CCAAGTGCT
GCTAACTI'G
TTATTAAGTG
TTGACGACAA
rGAAACTCTA TGAAGGCAAT .ACCGI'GAC GGATATCACA rTGATTTTCC TAATGATAGT ~ATTATCGA AGGTTCGGCT CACGTGATGT GACAGGAAAA GCGAATACCA CAAAGATGAT AATCGG'TIT AGACGATGTG CCGAGTCA ET GGTGATTI'AC AATCTGTTAT TTTAGAAGAA CAGGACAACA
AAAAATTGTC
ATCGTAGTI' ACTGACTTCT CAATTACTGG AAATGGTTCA ACATTGATCA
CAGTGGCGGG
CAGCCATGGA TGAGACGAcT AAACACAAGT CCTACGTGAA TGCCACAAGG GCAATACATT ACGAATTAGC TAAAGGCCGA AACCAACCAT TATrAAAAAC GTAAAAAGTT AGTCAATGCT ATTGGGAAGA AGTTCCCCTI ATAGITTTAA-A ACCAGGGCTI TAGACACGAC CCCCAAACGP TTCATGTCAA AATGCTTAAI GCAATrCCMTr AGCAGGTGC]l AACAC~rAGT 'rrCGGATGC; ATCAATITGT GGAAACCAAM TCACGATTGC AGCAAGCGAJ TTAACTATCA AGGCACGGC GTGCGACATT TAAAGTGCT.
ATAATCAAGG G
ACAAAGCCAG
ATGGAAGAAC
CGTACGTAT'G
AGTI'ATGACA
GTTTCTATTC
CCAGATCAT G
GTTATITACTG
GGTACCAACG
CAACCAGAA
TGATTGAATT
ATACCGCATC
AACATGGTGG
TGTACTGGCA
ATACACCCTC
TAACAGAAGA
AAACCAAAAT
TAAGACGTCT
TTATACAAAC
CGAATCAGTC
TGTAATGATC
ACCAAACCAA
CGGAACTATT
GGAAAAGA~I? ACACACTGGA
AGTTACCACC
C. C C C
GTTAAAATGG
TCAGCGCCGG
GAAGTCGTTrC CATGCCACAG4
ATT'TAGCAG
GGTACAGTAG
ITGGTGGAGA
GTCATTACTA
GATGTCAATA
CGCTITAAAT
GCGCCCGGATC
TATCAGTTCA
TTCATCGTGA
TACCAAGGTT
GAA~T~rCAG
AACGGAAAAG
GCGCCAGCAG
CCGGCCAAAC
AAATTAATCA
GATGCGAAGG
CCCATATTGA
~GAGTACAGA
ATGGGGATGA
GGACTAAAGG
GCGCCCATT
ATOCCACCOG
CAAAAGCACC
TGATGAAGA
AAGTATTFT
TAGAGCATGC
GAACCA.ACGC
CAGAAATCGA
CACAAAATAC
CTGCTGAACT
TCCITGACAC
TCACAGTGAC
GGTACCT
CAGCAACAGT
AAAAAGATGT
GAGAAACGAT
CACGGTATCC
CAATGGCGAT
CAAAATI'CAG
:cAAATrTGG
GGTTATCACA
AGAAGGCTAT
AACTTCAGCC
AGAAAAAATrG
CGTAACCACG
GAATGGCCAG
AGCACCGACA
GAGCGGACAA
AArrAAAAAAA
C-ACAGGACAA
GGATITAGCC
AAACAcTGAA
TATAGCAACG
GAATGGACAC
TCAAACAGGC
EF124-4 (SEQ ID NO:464) 230 TABLE 1. Nucleofide and Amino Acid Seqeuences of E. faecalis Genes.
AF HITYTFFDV TELDANNPAL
DHYRNTAAID
NRLVDAFLTD
WRVDFPNDSR
KGGEYHKDDP
PDKSVILEEG
QVS ITGNGSE
QAKTQVLREG
GAQPTIIXND
EVDSLKPGLY
QAGNPLAGAE
SAFTIA-ASDR
TTNNQG
WTDEAGNNHH
P ILTN.QTYLA
TYVIEFKTSV
DHVYWHVMIN
KDYTLEVTTD
VVHGDDNGDV
TVDATGVITF
'NKVFLEKMD
QFTEIEAPTG
FSVLDTTI'GQA
GKPA'IVIATA
SEDSKPFKPL
GSLKVYEGNT
DEKVI EGSAS
GAQSVLDDVV
NETGQQKIVV
VVDIDHSCGH
GGLPQGQYIL
EKGKKLVNAR
YLLD'TTPKRF
PAFDLNAQKS
KPDGSVEKVK
YDNTASYTNQ
ITIDTPSPNQV
IQ4AHI EAPYY ATGTKGK IQL
VETKAPEGYT
FKLEHAVTI'P
IVTQNTSGQ I GVYNAVTKE I
PTQPLTDITM
GSSRDVTGKV
LDPESLVIYG
MEYRSLVTSS
KKTAMDETTI
VSDELAKGRV
FTHWEEVPLA
RDVHVKMLNY
GKYQFVETKA
LSGATFKVLD
TWTIAVNLSN
EEPSEKNQNT
S IQHGGESVK
TNVTEDGTIT
AAGSTDTVSN
LAGAHFQ IWD
ITIDEETSAE
PDRTNANGQL
QGSAEL IKKD pAGYLLNTEP
AKGETIQTGL
VREHLVSDAN tOKV-TVTDLAP NFVNYQGTAK LIKKDVNGHL EF125-1 (SEQ ID NO:465)
S.
TAAAATAAAA
ATGAAAGAAA
TTGTTACTTG
GATGGCCAGT
CTTCAAGGAA
TCAATCAAAG
CAGAACAAAC
ACGTTGAACC
ITGACGTTAA
GCATCAGCGA
TCCGAAGAAA
ACTGTCAAAC
ACAACGATTA
ACACCGCCGT
GATGTGCGAG
AAACCTAATA
TACACAATTA
AGTGACATTC
CCAGGAGATT
ATTGTCCCAG
CCTAGTGCGA
ACTGTGACGG
GTGATCAATC
ACCGTTATA
GAGTACCAAA
AAAAATCACG
GTTACCGCCA
CAAGAATTCA
GCAGTCATTA
TATTCAGTGA
AAAGA'ITACA
GTGACTIGGCG
GTTCCGTGA
GCCAGTCAAC
GG L'IGGACGT
ACGTACGAAC
AATTCGTACG
TGAGAAAGAA
TTCTAAATTA
TAACGTTAGG
AAGCACAACC
CTGCACAITG
AOATTCAAAT
CTACAGCTAC
AGACTGGTAC
ATGAAGGTTC
CTGTCGCCAG
CGCGCGTAGC
TCGATAAIT
ATAAAGAAGA
AACAAATGAA
AACCAGGTTC
GTGAAGATGG
ACGGGGACI'
GGGTGA'ITGA
ATACCGAACA
TTACTTGGAC
AAACATGGCC
TGATGGAAC
AAAATGGCAA
CGACGATTGA
CGACGTTAAC
CATATGGCAA
CTTGGGAAAT
CAGACACAAT
CATTTGATGA
AAGTGGTAAT
CAGTCAAGAT
ATAATCGTGT
AAAATATTAT
TAGCTGTGAA
CAG-TTCCTGG
AAGTGAACGT
TGCTCCAATG
TGGCACACCA
AGAAGTGAAG
AGTAACACAA
GGCAGCGCCC
TAATTGAAT
AGAAGATG
TGATCCGACA
AACAGAAGAA
CACGACAAAA
AGGACCAACA
T'GAAGATCCG
TGTGACCATTI
AGCAGGCr-AT AGGTGATrrA
TACGGTTCGT
TTCTTTAGAT
TATTCCTACA
ACAAATGAT
GGTAGATATC
AACAGGGAAT
AATTAAAGAA
TGTGACCATT
CGAGGCGGTT
AAGTGATAAT
AATCTTiAGAC
TAACTACAAC
GGGGGATAAT
CAAAGGAAAT
CAACGGAGAC
TGATTATAAA
GGATGTGC
*TAAAAACACI
*TCAAAATAA I
CTTAACTATG
TCTCTTCTAT
GTAA.ACCGTT
CTCATGGCTT
CAAACCAGCC
GAGGTTGTAG
AATAATACGC
CAGCAAGCGT
ACG'TTETCTT
GAATCAACGG
GCATCTACAA
GCGATAGAAA
GATATCAGC
ATTTAITAA
CATTGGAACT
TACTCGAGT
GTTGATGCAG
TI'ACCTTI'A
ACTCATTGA
CAAGAAGATI
AAACAAGGCC
AATCAAGCGA
ACCTTTAAGT
GTGGGTCGCG
AAAGGTGACA
A'rI'CCAGATG
AATCCAAATG
AAG3CGCAATA
TATGGTGAAC
TTAACGTTTG
GAAGTCGTITC
GGTTCCTTTC
ACCA;AAGTrC
ACTGGTCAGC
GGTGCACTrC TAT ITGAT~c
GTACCCAAT)
GTGTCGTTAG
GGCTCTACGG
TGGCGGAAGA
AGCAAGAAAT
TGCATTATAG
GCAAGAITCA
TAGCAGATAC
ATGOGACAACA
CAATCACGAG
ACTCCTCTGT
GTAAAACA.AC
ATTATTTTAC
ATCCTGATGG
TTAACTGGTC
TTCAATTACC
AAGGCP.ATGT
ATGAGCGAAT
ATG)WI'CAGA
TGCCCCTGT
AT1'TTATCG
TGA.AAGATCA
CCGTGAAAGT
AACTTAGTCC
CCAACAAAGC
GCGCCGGCGA
GGTTAGATIGC
TAGATTACGA
AAACCATTCC
AACCAGATC
GAGCAGAAC'I
CAATTGACTI
ATGGAATTG'I
ATTCAGAAGP
A 'TATCAAAJ
AAAATGCCG!
CGTTGGTG'
TAGAGGAAGG
GTTGATGTGT
GCGTTAACAGC
GACCTTAGCG
TGCCAATGTG
AGTGGATGAC
GTTAGTCTTA
GCAACGAGCG
TTCGCCACCC
TCCTCGTTCG
TGAATCGACG
AGGTCATGAA
AACACCAGCA
GATTCCAGAA
TGGCAATTTG
CTATGGAACC
CACGTCTGAA
TGGGCGGGGC
AGCrATTCCA
AACGCCCAAT
AACAAATCCA
CTATGAGTTA
AGATGAATAT
GTATCGTCTT
~TcTCCTTTT
TGAAGCAACT
CGAAGCCAAT
AA.AAGACCAA
TTTACATTTA
TGTGGAAGGA
TI'TACATGAT
CGAAGGCGAT
LTGATGGCACA
L TCAACGA ET
GATTACCOAT
CAAAGATACA
231 TABLE 1. Nucleotide and Amino Acid Secleuences of E. faecalis Genes.
ACCACTGGTG
GATGGTGAAA
=TCACATAA
TTGGACCATT
CA ITCAGAAG
AGCCGTGTTT
AkATAATCG T
GCTGGGAGCT
AAACCAACGC
ACTTGGCGTG
GTTGATGAAA
CAAGGTTCTT
AAAAAAGGTG
AATGGCGCCC
CTGCTAGATC
ACGCCAGATA
GATAATGAAA
TATATCGAAT
AACCAAGTGT
GTGGTCGTTG
CTCAAGAAAA
GACCAAGCTA
TT'1GGTGGGT
ACAGTTTCGG
GAAGGAGCAC
GATGAGAAGG
CCG'ITTACTC
'ITAGAGGTC
GGCTATCMT
ATCGTGATC
GACCAAGCAC
GCAGTTCGAC
CCAGGAAAA]
CCAACTCT
GCTAACTIT
TTATTAAGTC
TTGACGACAJ
TTTGTAGAAJ
ATTCCTGACj
TACAAAGGG(
G~rlrrCAA!
GATGGCAAA.
GCACCAAAA!
ATTCGTGGT
AAAGGACGCI
TTTAAATTG
AAAGATGGT
GATGCAACG
TCAAATGAT
GGACGTAAA
GCCGATGAG
AAAG I=C'
GGGCATTAC
CATCCA=F
CTCAGTTGAC
CAGGCTTTAA
CTTATACTAC
ATCGAAATAC
ATAGTAAACC
ACAATrGCCGT
TAGTCGACGC
TGAAAGTCTA
AACCGTTGAC
TTGATTTTCC
AAG'ITATCGA
CACGTCATGT
GCGAATACCA
AATCGGT=T
CCGAGTCATT
APATCTGTTAT
CAGGACAACA
ATCGTAGT1"I'
CAA'ITACTGG
ACATTGATCA
CAGCCATGGA
AAACACAAGCr
TGCCACAAGG
ACGAATTAGC
AACCAACCANI
GTAAAA.AGT'l
ATTGGGAAGI
ATAGTTI'AA;
TAGACACCAC
TI'CATGTCA)
GCAATCCAT'
AACACTTAG'
ATCAATI'TG'.
7TCACGATI'G( TrAACTATC,
GTGCGACAT
k. ATAATCAAG k. CCAAAGCC k AAA.ATCTGC 3 CTTTCCAAA r TATATGATC k~ ITATCTI'A r TACCAG;ATO SA=TCAAkAG G CCGTCTIA T ATCGAATCG T CATTGGCTP G ATGGCTATP A AACAACCAC G TCAALCGAGC C AGAATCAC G AAATAACPM 'C FITTAGTCC r. ATCTAACCC
GTTAGGCAAG
GGTAAGTITI
CTTTTTCGAT
CGCTGCCATT
GT~rAAACCT
CACCAAAGA.A
CTrrTTGACG
TGAAGGCAAT
GGATATCACA
TAATGATAGT
AGGTTCGGCT
GACAGGAAAA
CAAAGATGAT
AGACGATGTG
GGTGAT1TTAC
TTTAGAAGAA
AAAAATTGTC
AGTGACTTC7
AAATGGTTCA
*CAGTGGCGGG
TGAGACGACI
CCTACGTGA-Z
GCAATACAT2
TAAAGGCCG;
*TATTAAAAAC
AGTCAATGCI7
AGI'TCCCCT'
ACCAGGGCT'
CCCCAAACGj
SAATG.CTTAAN
C' AGCAGGTGCI C TrCGGATGCC r GGAAACCAA
:AGCAAGCGN
k AGGCACGGC' r TAA.AGTGCT 3 GGAAAITGT 1. AACAGGCTA s TAAACCAGC T CGTGAAAAC A CAATAAACA C AGACTTGGC G CGCAGXIb G TGATCCCGP A GAA.kAArGP ;A AA.ACGGGG-; ,T GGAGGATTI LT CGTCAATA; :T AGATGAGTl :A AGGTCAAAC :C CCAAGGTT( XC GGATAAGA( ;A AACGAAAG( 'C CCAAT]TAG(
GATTTCATGG
ATAGGGGCGT
OTTAC CGCGT
GATTGGACGG
TI'ACCTGC T ATCACT1'GGA
GATCCAATT
ACAAAGCCAG
ATGGAAGAAC
CGTACGTATG
AGTTATGACA
G'1TTC.TATI'C
CCAGATCATC
GTTATTACTG
GGTACCAACG
GGAAAAGAIT
GTTAAAATGG
TCAGCGGCCG
CAAGTCGIPC
CATGCCACAG
A'ITTAGCAG
LCr.TACAGTAG I TCTGAGA
GTCATTACTA
GATGTCAATA
P CGCTI'TAAA'I C' GCGCCGGATC r TATCAGTTCP TTCATCGT1GP r TACCAAGGT'] r GAATTTAc PL AACGGAAAAC PL GCGCCAGCAC T CGGGGCAAAC T AAATTAATC) T GATCGAAG( T GCAGAGCAC' TTATTAAAT4 G GTCG-TGGT'( G AATAGCGCA( A TCA'ITAGGG.
CG CCAGGTACC ,T AT'rATTTAT LG ArITTTCCAA LT CCCAATGCG- ~A AAAATCM1 .A GGTGcTGCT L.A CAACCCATT C'A GAGTTGTA IC TAGCGGG'I 'A CCGATAACP -T GGcGAAATI -A cCAACAGGC -A AAAGAGCAC
TAGAAATAAC
AT'GCCAAAAC
TAGACCCTAA
ATOGAAGCAGG
TITAATAAA
CGATTGCGGT
TAACCAATCA
A 1'CG-CGGT
CAAGCGAGAA
TCATTYCAA?1'
ATACCGCATC
AACATGGTGG
TGTACTGGCA
ATACACCCTC
TAACAGA6AGA
ACACACTGGA
CCCATATTIGA
GGAGTACAGA
ATIGGGATGA
GGACTAAAGG
GCGCCCATT
ATCCCACCGG
*CAAAAGCACC
*TTGATGAAGA
AAGTAT'HLIr'
*TAGAGCATCC
*GAACCAACGC
CAGAAATCG;
CACAAAATAC
*CTGCTG-AACI
TCCTTGACAC
TCACAGTGA(
*GGTACCTT
-CAGCAACAG'.
N6 AAAAAGATG' 1; CAGAAACGAN r TAGccccAG( k CCACGCCAG' SCTAGTGACk
SACCAACCAT
TIACAGCAA
TATTACA
C CGAATTAG T TAGGGGCCT A ACCCACIT G AAACAGAAG 'A GCTATGAA7 IT A'TPIGTAG, -A A~rATCAAC )G CAGTTFFI IT TCTTGAATC T ACCCTAAAC 'T ATCTGTTAC ;C CAAIrMCT TCGTA6ATGCA
AAGTGATGCC
CA6ATCCTGCG
A.AACAATCAT
TGCGCAAAAA
TAATTTA6AGT
AACCTATTTG
TGAAAAAGTC
AAACCAAAAT
TAAGACGTCT
TTATACAAAC
CGAATCAGTG
TGTAATGATC
ACCAAACCAA
CGGAACTATT
AGTTACCACC
AGCACCTTAT
CACGGTATCC
CAATGGCGAT
CAAAATTCAG
CCAAATTTGG
GGTTATCACA
AGA.AGGCTAT
AACTTCAGCC
AGAAAAAATG
CGTAACCACG
GAATGGCCAG
AGCACCGACA
GAGCGGACAA
AATTAAAAAA
CACAGGACAA
GGATTTAGCCC
rAAACACTGAA r* TATAGCAACG r' GAATGGACAC r TCAAACAGGC 3 AAAATATCGC r CCCATTTGAA k~ CTTCTGACT T AGCAGCTGCT C GAGTGGCAAA A AGAAATCAAA T AAAAGTAGAA T cGccAATITC C AGGAACGATT T AACTGCTCAA T AGATGAACTG T GAAGAAGAAT ;C AGAAGTAA-TG ;C AATTTAcAAT :G TGCAGCAGAA ;G GCTAAATGAA ;A CACAACGCTA LrT AGGCGATCTT 232 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
ATCAATTATC
GGTGCGGTGT
ATGTCTGACA
GTGGAGACAC
AT9TGCCAAAG
TACCAAGGTG
GAATTTAAAG
A.ACCAAGGGT
AAAGCACCGG
GCGAA.AGACA
TCCAAMAAA
CTTGGI=rAC T1'GGTCGGTT
AAGGAACTC
TTAAGGTCAT
AGCAAGGCAA
AAGCGCCAAC
ACAACCAAG
CTGCCAAGCT
TGACAGATGC
TAGTTCAAGT
ATGGTTACCA
AACCTGAACT
CAAAACCAAA
CGAAAACCAA
TGGCAAG LTG
TCAATTAACC
TGATGAAACA
AGTCNITGCG
TAGCTATCTT
CAAACCAGCC
GGTGAAAATT
AGAGACAGGG
GAACCACTTA
ACTGTCTAAG
CGTGAATGCG
TCACCCAACA
CACACAAGTC
GCTCTTCTAT
AAAGAAAACG
GGGCAAACCG
AAAAACTTAG
CTTAATGAAA
ACTGTGGTAC
GATCAGCAAA
CAAACTGTCG
CAACCAGGAA
CAAGCTGTCG
GGCACGTTTG
ACGAAkACAAG
KA-TTACTTCT
AAAAAGAGCA
AA.ACAGGTGA
TAGATGGACA
CACCGGGAAC
CGCCAAGCGC
TTAAAGCACC
AGALATGCCTI'
CTCGTTCATT
AATATACCTT
CATTCACTAT
TAACGAGAA
CAGCTAGAGA
TTGTCTTTAT
AGAAATAA
AGCATTGGCA
AACCAATCTG
GTATCGT
AAGCTTITACG
TTTTATTAAT
AGCAGGTGCT
ACGTTCTGAC
TGTGGAAACA
TGCGGCAACA
ACAACCTGTA
GACAGGTTG
C--GCCTCATG
EF125-2 (SEQ ID NO:466) MRKNGPMV NRWLYGLMCL LLVLNYGTPL MALAEEVNSD
S.
S.
S S
S
S.
S
S
S
S
IS S
S
55 5 S S 5*
GQLTLGEVKQ
KKQIQIELNQ
SANEGSTEEA
TI IDNFEDPI
PNKPGSGDLV
GDWVIDIPTQ
VTETWPTGNT
YQTTIDEAVI
EFTWEINYNY
DYKVVINGDG
SQQNI IKNTG
TGAQLTLGKD
DHYRNTAAID
NRLVDAFLTD
WRVDFPNDSR
KGGEYHKDDP
PDKSVILEEG
QVS ITGNGSE
QAXTQVLJREG
GAQPT IIKND
EVDSLKPGLY
QAGNPLAGAE
SAFTIAASDR
TI'NNQGEIVA
KGAFQIVKTN
PKLPDGADY I
KLYRIENGEE
NDKQPLDELE
VSEITTDXTC
NYQGTAQLTF
ETQAPTSYLI
FKVI'DAETGC
KDKPELVNAC
TSQQEMTLAL
QALADTLVLT
STNSSVPRSS
YLNPDGTPAT
DAEGNVYGTY
EDLPPWVIP I
FKSVKVY-ELV
PDGGGDVPFK
GEQTIPKDQA
SFAIDFLHDV
AVDYQNSTIG
FMVEITRNAD
WTDEAGNNHH
P ILTNQTYLA
TYVIEFKTSV
DHVYWHVMIE
KOYTLEVTTE
VVH-GDDNGDM
TVDATGVITI
VNKVFLEKMr
QFTEIEAPTC
FSVLOTTGQI
GKPATVIATI
EHLAPGKYRI
SADQPLAGAI
IYPELVKVE]
IFEREVTAE]
FVNYQAEVM(
EIYAKGLNE'
ENETGEALA,
NETPSASFT
TVARSLRSD
TFVNEQ P V
QGKAQPVTQE
LNPTATEDVT
EETVASTTKA
PPYKEDVTIH
TI SEDGTVRF
VPDTEQQIDK
MLDGTIKEV
NHATLTSDNN
VITDTMGDNL
TGAVKIDYKT
WTLAVNQNNY
GETGFKVSF I
SEDSKPFKPL
GSLKVYEGNT
DEKVIEGSAS
GAQSVLDDVV
NETG;QQKIVV
VVDIDHSGGH
GGLPQGQYIL
EKCI(XLVNAF
YLLDTTPKRI-
SVREHLVSDAD
SNFVNYQGTId
VETK.APTGYI
I FELYDHNKQ! E RGDFKGDPE:
DGSLAHEDL(
3 RKVNEQGQT] HYvLvETKA! 3 AVFKVIDET( I AKDNQGKPA! N QGLVQVNHL, S KKTKPNQPT'
VVVHYSANVS
FSYGQQQRAL
IESKTTEST'
WNFNWSIPED
TFNERITSES
QGHFDRTPNP
GRELSPDEYT
PNGLDAEATV
TFEPDSLHLY
KVDGIVEGDV
L1MENAVITUT
GAYAI(TSDAF
PAFDLNAQKS
KPIDGSVEKVK
YDNTASYTNQ
ITDTPSPNQV
KMAHIEAPYY
ATGTKGKIQL
VETKAPEGY7I FKLEHAVTrl
IVTQNTSGQI
IGKVTVTDLAI
*LII(XDVNGHI
*LNTTPVPFE:
5LGITATSGKI
[FQLGAFANFI
AGSYELDELI
.~AGAVFAIYNA
P TCYLLDTI'L] 3QTVDGQTNLU r VVLKAPFIN Q PGKYTFVET IT KQAARETCGW
IKAAHWAAPN
TLKTGTDPTE
VKPRVAGPTD
VREQMKlAGDY
DIHGDFSLDT
SAITWTVD IN VDKNGNVT 1K
TATYGKMLDK
SVTFDDKGNE
AVNNRVDVGT
YEPVPGLTMV
HITYTTFFDV
GVYNAVTKEI
PTQPLTDITM~
GSSRDVTGKV
*LDPESLVIYG
*MEYRSLVTSS
KKTAM~DETTI
*VSDELAKGRV
FTHWEEVPLA
RDVHVKMLNY
GKYQFVETKA
LSGATFKVu:
IAEKNAGKPA%
GKIIFRDLMI
GRAVFKKIDI
D ATDGYIVNK k DEQNQPQGS] 9 PFDVTAQLGI XI SDKQGKVIA] Y QGAAKLVKII K APDGYQLSK, L GLPKTNTQVJ
NTRKIQVDDQ
STAITSSPAA
ISDYFTGDET
FEFQLPGNLK
HLNDSDGRGP
QAMKlDQTNPT
GDTNI(AYRLE
RNIDYDEANQ
VVGAELVEGK
GQHSEDDGTA
PNSLVVKDTT
TELDANNPAL
TWTIAVNLSN
EEPSEI<NQNT
SIQHGGESVK
TNVTEDGTIT
AAGSTDTVSN
L.AGAIFQIWD
ITIDEETSAE
PDRTNANGQL
QGSAELIKKD
PAGYLLNTEP
AKGETIQTGL
VVASDNFVSY
GTYYYKEIKA
NANPLPGTIF
PIYFVVKKNS
?ITFLNRAGEK
(EQPIALGDLI
(NLAPGTYRFV
)QQKNALAGAE
SAVAFTIAATA
N YFFVFIGLL VGLASWLFYK KSKK EF125-3 (SEQ ID NO:467) .233 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
TAACTTTO TTAC'VATCA AGGCACGG-CT AAATTA-ATCA AAAAAGATGT
GAATGGACAC
TTATTAAGTG G PTGACGACAA A~ TTTGTAGAAA C
ATTGCTGAGA
TACAAAGGGG C
GTTTTTGAAT
GATGGCAAA*
CCACCAAAAT
ATTCCTGGTG2 AAAG3'ACGCG
TTAAATTGT
AAAGATGGTT
GATGCAACGG
TCAAATGATA
GGACGTAAAG
GCCGATGAC
AAAGTTTCTG
GGGCATTACG
CATCCATI'C
ATtAATTATC
GGTGCGGTGT
ATGTCTGACA
GTGGAGACAC
ATTGCCAAAG
TACCAAGGTG
GAATTTAAAG
AACCAAGGGT
AAAGCACCGG
GCCAAAGACA
TCCAAAAAAA
CTTGGT
,TGCGACATT
~TAATCAAGG
CAAAGCGCC
LAAATGCTGG
:TTTCCAAAT
.ATATGATCA
1'TATCTTTAG
PACCAGATGG
kTTTCAA.AGG
:CGTCTI'TAA
ATCGAATCGA
CATTGGCTAT
ATGGCTATAT
AACAACCACT
TCAACGAGCA
AGAATCAGCC
AA.ATAACAAC
T'ITTAGTGGA
ATGTAACCGC
A.AGGAACTGC
TTAAGGTCAI
AGCAAGGCAP
AAGCGCCAAC
ACAACCAAGC
CTGCCAAGCIJ
TGACAGATG(
TAGTTCAAG'
ATGGTTAMC
AACCTGAAC'
CAAAACCAAk TAAAGTGcTTI
GGAAATTGTT
AACIAGGCTAT
TAAACCAGCCG
CGTGAAAACG
CAATA.AACAA
AGACTTGGCG
CGCAGATAT
TGATCCGGAG
GAAAATTGAT
AAACGGAA
GGAGGATTTA
CGTCAATAA-A
AGATGAGTTA
AGGTCAAACC
CCAAGGTTCA
GGATAAGACTl
*AACGAAAGCA
*CCAATTAGGP
TCAATTAACC
TGATGAAACP
AGTCATTGCC
TAGCTATCTJ
CAAACCAGC(
GGTGAAAAT'
AGAGACAGG(
C GAACCACTMh k AcTGTCTAA( r CGTGAATrc( N. TCAGCCAAC
GATGC:GAAG
GCAGAGCACT
TTATI'AAATA
GTCGTGCTGI
AATAGCCCAG
TCATTAGGGA
CCAGGTACCT
ATTATTTATC
ATTTTCCAAT
GCCAATr.CGA
AAATCTTTG
GGTGCTOGTA
CAACCCATT
GAGTTTCTAA
TTAGCGGGTG
CCGATAACA'I
GGCGAAATI
CCAACAGGC I
AAAGAGCAGC
*AAAGAAAACC
GGGCAAACCC
AAAAACTTAC
CTTAATGAA.2
ACTGTGGTA(
C' GATCAGC"A
CAAACTGTC(
PL CAACCACCAX 3 CAAGCTGTC, 3 GGCACGTT A. ACGAAACA
GAGAAACCAT
TAGCCCCAGG
CCACGCCAGT
CTAGTGACAA
ACCAACCATT
ITACAGCAAC
ATTATTACAA
CTGAATTAGT
TAGGGGCCTT
ACCCACTTCC
AAAGAGA-AGT
GCTATGAATT
.kTTT'TGTAGT
ATTATCAAC
CAGTTTTTC
TCTTGAATCG
ACCCTAAACG
*ATCTGTTAGA
*CAATTGCTT-1
AAACAGGTCP
TAGATGGAC;
3CACCGGGAAC
"CGCCAAGCGC
-TTAAAGCACC
k. AGALATGCCT' 3 CTCGTTICAT' P, ALATATACCT 3 CATTCACTA! 3- TTAACGAGAK C; CAGCTAGAG
TCAAACAGGC
AAAATATCC
CCCATTTGPA
CTTGTGAGT
AGCAGGTGCT
GAGTGGCAAA
AGAAATCAAA
AAAAGTAGAA
CGCCAA=TC
AGGAACGATT
AACTGCTGA-A
AGATGAACTG
GA.AGAAGAAT
AGAAGTAATG
AATTTACA-AT
TGCAGGAGAA
GCTAAATGAA
CACAACGCTA
AGGCGATcTT AGCATTrOGCA
AACCAATCTG
GTATCGT=I
AAGCTTTACG
TTTTATTIAAT
C' AGCAGGTGCT r ACGTTCTGAC r' TCTGGAAACA r TGCGGCAACA A, AcAACCTGTA A GACAGGTTGG 9 EF125-4 (SEQ ID NO:468) 99*9 9999 9 .9 0 6e 9.
NFVNYQGTAK
TTNNQGEIVA
KGAFQ IVKTN PKLPDGADY I
KLYRIENGEK
NDKQPLDELE
VSEITTDKTG
NYQGTAQLTK
ETQAPTSYLL
FKVTDAETGQ
KDKPELVNAG
LII(KDVNGHL
EHLAPGKYRF
SADQPLAGAV
IYPELVKVEI
IFEREVTAEK
FVNYQAEVMG
EIYAKGLNEG
ENETGEALAG
NETPSASFTI
TVARSLRSDN
TFVNEKQPVS
LSGATFKVLD
VETKAPTGYL
FELYDHNKQS
RGDFXGDPEI
DGSLAMEDLG-
RKVNEQGQTL
HYVLVETKAP
AVFKVIDETG
AKDNQCKPAT
QGLVQVNHLQ
KKTKPNQPTT
AKGETIQTGL
LNTTPVPFE I
LGITATSGKD)
FQLGAFANFK
AGSYELDELD
AGAVFAIYNA
TGYLLDTTH
QTVDGQTNLM
VVLKAPFIN'Y
PGKYT'JE:TY
KQA.AFETGWL
AEKNAGKPAV
GKI IFRDLAP
GRAVFKKIDA
ATDGYIVNKQ
DEQNQPQGSP
PFDVTAQLGK
SDKQGKV lAK
QCAAI(LVKID
APDGYQLSKQ
VVASDNFVSY
GTyyyKEIKA
NANPLPGTIF
P IYFVVKKNS
ITFLNRAGEK
EQPIALGDLI
NLAPGTYRFV
QQKNALAGAE
AVAFT IAATA EF126-1 (SEQ.ID.NO:469)
TAGCGAAAGA
ATGGTGATTG
GAAGCGGTTA
AGTCTGGCTG
GAAACGGAAC
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
ATTGTAAACG
CAGTCGATT
GATTAAAATG TTTAAGAAAG AGTTGTGGGA AATFTCAGTC TACAGAAGGA ATGACCAATA GATATTGGAA GGACAAGCTA AGTAGTTGTT GAAGATGCTA
CAACGAAAT
CCAC-ATrGC
CGGTGAAAGT
CT'rTCCCAGT
GTGGTAGTTT'
ATTATCGACA
TTTrAGCTGAA
GAAAGACGAC
TCAAGCGGGT
'FrcAGATAAT 234 TABLE 1. Nucleofide and Amino Acid Seqeuenccs of E. faecalis Genes.
TTTCCACATG
GTGATGCTGG
ATTA.ATI'CAG
AGCCAATTTG
GGATTGAAAC
ACGTA=TCC
AAGACCAATA
TCAGTGGAAT
GAAkATTACTA
AGTTCAGTGA
CAGTTGCAAC
TTTACAACCC
GCT'rCATTAA
GGAAATGATG
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCA L'CTCA
GATGACATTA
CATCT'ACAG
AAACAAGCAG
AAAATTAAAA
cccAAccALAG
ACCGTA-ACAC
TTAGATTTAP
GAAACAAC
ACTGATGTAJ
CAAGAAAATI
AGTGGTCAT)
GAATTACCA(
AACGAAGGTC
GAAGATCCA
GACCAAGAAW
CAAGCCAGC,
GANGAAAAT(
ACFTITACT,
ATGACTATI'.
GAACAAGGC'
CATTCCAAC.
GAACCTAAA
ACGAACCCA
ACAACAGTA
TI'TCGCATA
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGGTTT
TCCTAGA
TATTACTGAC
CCAATGA'TTC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTI'
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGI'CA
ACATCACCAT
GTAGTGGGAC
AAAATGACAA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGJCI
CTAACCGCAC
ACAGCTATGP.
CTGACGCAAC
CCGACTTAAP
CACCGCCAGI
CCAACCGCG;
CTI'GGACCCI
AAGTCACAGI
k. ACAAAGTAAC k CGTACACAW
'CTI'ATAFTTQ
ACGTCTTGC
k. CGATTACAA r TTAAATGGA.
k. TG.GTGGATG, 3 GCAAAGATG' P, TCAACAAAA A CCACTAAAA 3 GGATTCCCA A AGCCAACCG C AACCGCTAA G TCAATTTTG A ATCCACTT I A CAAAAAATP
GATTGATGAA
CGCGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
AATTCTCAAA
TCAATTTGAT
AATTAACGGA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACAC'ITA
TGGCAACGA-A
AGATATCATT
ACTAACACAA
CTATTTAAG'I
GGACGAAGAA
CTTTCGCAAI
TGATCCAAA1
TCAAGAATTI
AGCCAGCATC
TGAAAATG9
TTTTGAAATC
r GACCA~rACI
ACAACGTCG(
k. TTCCAACAA6 k. AGATATCGAX N. CGTCAAAAC, A. CA ITAATAA T TACAGATAA' A AGATGACAG, T TAAAACTGA A CCAAGCCG-A T AACACCGCC A ACCGAAAAA ,G AAAATCAGC 'A CATGATCGC .A AAAAAGAAA
GTGGTTCAAG
CAA-TTTATGT
CGCGTCAATA
CGGACGTATG
CAAACACACG
CCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATCTA'T
AAAACAGAAG
GATTTTATTA
GATCGTCTCG
ATTCAATCTG
CAAACCATT
CTCAAAGAAA
GGAACAATTG
TGTGACGTGA
GATTTAACCA
ACCAGTACT'I
GATGTGAAAC
GAAAkATAAC,' GGTCATACG9
LTTAGCGCCTJ
GALAGGTGAC(
ALWGCTAAA(
*AA.ATGGAAC(
GTAGATGACJ
I AAAGATGTT,
AACAANCAA,
C' ACTAAAATC.
-ATTCCCAAC
'CCAACCGTA
k. GGCCAAGAA k. GCT'ITCGGT PL GTGTrAGAC T' GGCATAGTA C TACTCTTAC C GCAACGGAE C rTAAACTrI T GCACCAACC A CCGTI'GACC GCTTATCTGA T TCCTGATGG A CGCAATTGAC GTGGTACGCC A.
GAGATTAAC G.
GTrTAGA"CC F CAAGACATCC T A.AGTDTTAGC G GGGAAAGTG'1 T TGGGT!CCTIT T
CAAGCCAATCT
CGCAATCCAC A
CGACCGCTAC
CAGCAACTAG
ATACAGCGATI
CTAAGGAATI' T CGCCAGAAGA T ATCGTGAAGA I1 GGACCCAAGC C TCACCGACGA
AAGTAACTTT
ATACA.ATGAC
'ACATTGAACA
3TGTTACATTC 3ACrTAGAAG 3TCAAAACAGC k. 'TAATAA.AGT
CAGCTAACCG
3 CNGACAGCTA A. AAGCTAGCGC C AAGCCGACTT A CACCACCTC C ATTAGATTT A ACGAAACAAG A TCACAGACGT A CACAAGAAAA T TAGCTGGTCA 103 AAGAATTAGC
GCAACGAAGG
;C CAGAAGACCC ;C CTACAAATCA
AAGACCGC
P.AGACAAAA
rATGATAAA
PCCGCCCCA
PATCGA.AAA
TACTTGCAT
CTTCAAGTC
T'rAAACCAA
CAATC=IA
GTAAAACAA
ATTGATGAT
CCAGCAACA
GACGATGCT
'ACAGAAGGT
rCATGCAGCA
~CCAGAAGCG
CCAACGATT
'AGTTTCGAT
AGCATCG'I'
LkATGGTAAA
GAAATGAAC
E'ATCACCACT
%.GGCGGGATT
CAACAAACCA
P.CAAGAACAT
rTTcCGGTAAC
GTTAGACATC
CAAAGTAACA
TGACTATTTA
AACGGACGAA
GAACTTTGGC
ACCAACGCCA
A.ACCAACCGT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCcTATATT TGACGTG ETG
AAAAAAACCT
TCAACCACCA
AATGACTAAT
GGCTA'FTAGC
::66.
AGTAAAGGAA TTCATTTACC GGTTTAATTG TCCTTATAGT
AATTAG
EF126-2 (SEQ ID NO:470) MF KXATKLLSTM VIVAGTVVGN FSPThALAEE AvKAGDTEGm TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVDLVVVE DASGSFSDN-F PHVRQAIDEV VQGLSDQDRV MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSCFGDVR TYGGTPTAPG LKL.ALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHK TNTNDS INEY PDPRHPLQVS VEYSNDYQGA AAVrALNQE ITNQGYEMIN AYWESVESLS SVNSYFDKYK TEVGPFVKQE LQQCSSTPED FITSQSIDDF TTQLKQIVKD RLAQSTPATA SLTIANQFDI QSATATDDAG NDVPVQINGQ TISATSTEGY VGNITIHYE-V KENTAIDAAT LVS SGTMNQG TIAKEFPE-AT 235 TABLE 1. Nucleotide and Amino Acid Seqeuences of~ E. faecalis Genes.
I PKNDNAHAC
DINKVLDIID
IKTDATDEEL
DLTNRDQEFK
ENN1VTFE2
EGDVLHSNKP
ASMVDDINKV
TITTKIKTDA
PKQPLKPKKP
DVTPEDPTIT
VKVTDENGKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYLS
TVTPPAPTPE
LDITDVKVXX
TDEELAPYIE
LTPTNHQAPT
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE-
TSTWTQASMV
GHTYTMTITT
OPTITKDIEG
ENGKDVTDNG
QGGIPNQADL
NPVNFGKSAS
LTNREDSFDW
NNKVTFEK
GDVLHSNKPT
DDINXVLDIT
KIKASATDEE
QEHLDLTNRD
IVTQENLZVT
NFGNEGDVLH
KGIHLPM'TNT
HVKTAPGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENCK
LAPY IEQGG I
QEFKWNVKTA
FTMNKRDDSY
SNKPTVTPPA
TVNPLYMIAG
STWTQASMVD
HTYTMTITTK
AKDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKPE
LIVLIVAISF
G ITKNKXRKN EF126-3 (SEQ ID NO:471) 0* S S
S.
S S *5
S.
S S
S.
S S
S
S.
S S *55 S
S
S
TGAA
GAAGCGGTTA
AGTCTGGCTG
GAAACGGAAC
TTTCCACATG
GTGATGCrG
ATTAATTCAG
AGCCAATTTG
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAAATrACTA
AGTI'CAGTGA
GAGTTGCAAC
TTTACAACCC
GCTTCA'rTAA
GGAAATGATG
TACGTAGGAA
AAGCAGGAGA
ATTGTAAACG
CACTCGA'ITT
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGG'I=
TCGC ITAGA
TATTAGTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTT
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
TACAGAAGGA
GATATTGGAA
AGTAGTTGTT
GATTGATGAA
CGGCGGAAAA
TATGAATGTG
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
AATTGTCAAA
TCAMTTTGA7
AATTAACGGA
TCACTACGAA
ATGACCAATA
GGACAAGCTA
GAAGAMCTA
GTGGTTCAAG
CAATTTATGT
CGCGTCAATA
CGGACGTATG
CA.AACACACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTATT
AAAACAGAAG
GAT'MlATTA
GATCGTCTGG
ATTCAATCTG
CAAACCATIT
GTCAAAGAAA
CGGTGAAACT
C'TTCCCAGT
GTG3GTAGTTT
GCTTATCTCGA
TTCCTGATGG
CGCAATTGAC
GTGGTACGCC
GAGATTTAAC
GTTTAGATGG
CAAGACATCC
AAGI=AC
GGGAAAGTGT
TGGGTCCTTI'
CAAGCCAATC
CGCAAkTCGAC
CGACCCTAC
CAGCAACTAG
ATACAGCGAT
GAAAGACGAC
rCA-AGCGGGT
TTCAGATAAT
TCAAGACCGC
AAAGACA.AAA
TTATGATAAA
AACCGCCCCA
GAATCGAAAA
7rIACTTGCAT
TCTTCAACTC
GTTAAACCAA
TGAATCTTI'A
TGTAAAACAA
TA ETGATGAT
ACCAGCAACA
GGACGATGCT
TACAGAAGGT
TGAT
EF126-4 (SEQ ID NO:472) EE AVKACDTEGM TNTVKVKDDS LADCKRILEG QATFPVQAGE TEPVDLVVVE MLASYRGGKQ FMFPDGKTKI NSADYDMNVR LKLALDTYNQ THGDLTNRKT YFLLVTDGVA so*. VEYSNDYQGA AAEVLALNQE ITNQGYEMIN LQQGSSTPED FITSQSIDDF TTQLKQIVKD 8: NDVPVQINCQ TISATSTEGY VGNITIHYEV EF127-1 (SEQ ID NO:473)
DASCSFSDNF
VNTQL'IYDKS
NTRLDGYLHK
AYWESVESLS
RLAQSTPATA
KENTAID
PHVRQAIDEV
QFVSGFGDVR
TNTDS INEY
SVNSYFDKYK
SLTIANQFDI
VQGLSDQDRV
TYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
TAGCGAAAGA
ATGGTGATTG
GAAGCGGTA
AGTCTGCCTG
GAAACGGAAC
TTTCCACATG
GTGATGCTGG
ATTAATTCAG
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
ATTCTAAACG
CAGTCGAI'T
TAAGA'.AAGC
CTTCATATCG
CTGATTAGA
GATTAAA.ATG
AGTTGTGGGA
TACAGAAGGA
GATATTGGAA
AGTAGTI'CTT
GATTGATGAA
CGCGAAAA
TATGAATGTG
TTTAAGAAAG
AATITCAGTC
ATIGACC1AATA
GGACAAGCTA
GAAGATGCTA
GTGG1rCA-AG CAATITATr.T
CAACGAAATT
CC-ACATTGGC
CGGTGAAAGT
CTT'ccCAGT
GTGGTAGTT
GCTTATCTGA
'ITCCTGATGG
ATTATCGACA
'ITTAGcTG.A
GAAAGACGAC
TCA.AGCGGGT
TTCAGATAAT
TCAAGACCGC
AAAGACAAAA
TTATGATAAA
CrGCCTCAATA CGCAATTGAC 236 TABLE I1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AGCCAAITTTG
GGATTGAAAC
ACGTATTI'CC
AAGACCAATA
TCAGTGGAAT
GAAATI'ACTA
AGTTCAGTGA
GAGTTGCAAC
GGAAATGATG
TACGTAGGAA
ACCCTTGTAA
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATTA
GATGTI'ACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACCGTAACAC
TAGATTTAA
GAAACAAGCA
ACTGATGTAA
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGCTG
GAAGATCCAA
GACCAAGAAT
CAACCCAGCA
GAN'GAAA.ATG
ACTT=ACTP.
ATGACTTT,
GA.ACAAGCCC
CATTCCAAC;
GAACCTAPLAC
ACGAACCCAC
ACAACAGTA2 T'FrGGCATAJ
TCTCTGGTTT
TCGCTTTAGA
TATTAGTGAC
CCAATCATTC
ATAGTAATGA
ACCAAGGCTA
ATTCATACTT
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTGTTCA
ACATCACGAT
GTAGTGGGAC
AAAATACAA
TCGAAAATCA
AAACAGCCTIT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
CTCACGCAAC
CCGACTTAA.A
CACCGCCAGT
CCAACCGCGA
CTTGGACCCA
AAGTCACAGA
ACAAAGTAAC
CGTACACAAI
CTIATATTGP
ACGTG.TTGCP
CGATTACAAJ
TTAAATGGA;
TGGTGGATGZ
GCAA-AGATG=
TGAACAAAAJ
CCACTAAAA'
GGATTCCMA
AGCCAACCG'
AACCGCTAAL
TCAATTITG
~ATCCACTTT.
~CAAAAAATA
TGGAGACGTT
TACGTACAAT
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGATTI
TGATAA.ATAT
CACACCAGAA
AATTGTCAAA
TCAArlTTGAT
AATTAACGGA
TCACTACGAA
AATGALATCAA
TGCGCATGCG
AGAACALCTTA
TGGCAACGAA
AGATATCATT
AGTAACACA-A
CTATTTAAGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGA.ATT
*AGCCAGCATC
TGAAAATGG'I
TTTTGA.AATC
*GACCATTAC'I
ACAAGGTGGC
LTTCCAACAA;
AGATATCGAJ
CGTCAAAACI
CATTAATAA3
STACAGATAA'
SAGATGACAC
[TAAAACTGA(
~CCAAGCCGA(
r' AACACCGCC' k~ ACCGAAAAAL
SAAAATCAGC.
~CATGATCGC.
A~ AAAAAGAAA
CGGACGTATG
CAAACAkCACG
GCTAATACAC
TATCCAGATC
GCAGCAGCAG
AATGCGTAT
AAAACAGAAG
GAT=TATTA
GATCGTCTGG
ATTCAATCT'G
CA.A.ACCATTT
GTCAAAGAAA
GGAACAA=
TG~TCACGTGA
GAT=;AACCA
ACCAGTACTT
CATGTGAAAG
GAAAATA.ACA
GGTCATACGI
11'AGCGCCTI
GAAGGTGACC
ATTGCTAAAC
AAATGGAACC
GTAGATGAC;
AAAGATGTT;
AACAANCAAC
ACTAAAATCI
ATrCCCAACC
CCAACCGTAJ
kGGCCAAGAA( k GCTTTCGGT k GTGTTAGACA r' CGCATAGTA,
:TACTCTTAC
:GCAACGGAT
ZTTAAACTTT
r GCACCAACG P, CCGTTGACG A. AGTAAAGGA A GG IMAA77 A AA TAG GTIGTACGCC A.
GAGATTTAAC G.
GTI'TAGATGG T CAAGACATCC T AAGTTTTAGC G GGGAAAGTGT T TCGGTCCTTT T CAAGCCAATC T CGCAATCGAC A CGACCGCTAC G CAGCAACTAG Ti
ATACAGCGAT
CTAAGGAATT 'I CGCCA6AAGA 9 ATCGTGA.AGA 91 GGACCCAAGC C TCACCGACGA
AAGTAACTTT
ATACAATGAC
ACArDTGAACA
TGTTACATTC
ACGTAGAAG
TCAAAACAGC
TTAATAALAGT
CAGCTAACGG
CNGACAGCTA
kAAGCTAGCGC
AACCCACTT
k CACCACCTGC
"ATTTAGATTT
ACGAAACAAG
k TCACAGACGT k. CACAAGAAA.A r TAGCTGGTCA S AAGAATTAGC G GCAACGA.AGG C CAGAAGACCC C CTACAAATCA A TTCATTTACC G TCCTTATAGT
ACCGCCCCA
AATCGAAAA
TACTTGCAT
CTTCAAGTC
TA.A.ACCAA
GAATCT'TA
GTAAAACAA
'ATTGATGAT
CCAGCAACA
GACGATGCT
ACAGALAGGT
C-ATGCAGCA
CCAGAAGCG
~CCAACGATT
'AGTTTCGAT
:AGCATGGTG
kAATGGTAA.A
LGAAATGAAC
E'ATCACCACT
kGGCGGGATT
CAACAAACCA
A.CAAGAACAT
TTTCGGTAAC
CTTAGACATC
CAAAGTAACA
TGACTATTTA
AACGGACGAA
GAACTTTGGC
ACCAACGCCA
AACCAACCGT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAA.AAAACCT
TCAAGCACCA
AATGACTAAT
GGCTATTAGC
0O *00* 0 @0 0 0 0 .0 E7127-2 (SEQ ID NO:474) 14F KKATKLLSTM VIVAGTVVGN FSPTLAIAEE, AVKAGDTEGM TNTVKVKDDS LADCKRILEG QATFPVQACE TEPVDLVVVE DASGSFSDNF PHVRQAIDEV VQ( MLASYRGGKQ FMFPDGKTKI NSADYDMNVR VNTQLTYDKS QFVSGFGDVR TYC LKLALDTYNQ THGDLTNRKT YFLLVTDGVA NTRLDGYLHI( TNTNDSINEY PD' VEYSNDYQGA AAEVL.ALNQE ITNQCYEMIN AYWESVESLS SVNSYFDKYK TZ LQQGSSTPED FITSQSIDDF 7rQLKQMVD RLAQSTPATA SLTIANQFDI QS.
NDVPVQINGQ TISATSTEGY VGNITIHYEV KEN'rAIDAAT LVSSGTMNQG TI IPKNDNAHAC DVTPEDPTIT KDIENQEHLD LTNREDSFDW HVKTAFGNET ST DINICVLDIID VKVTDENGKD VTANCTVTQE NN TF24N1( QADSYDYLSG HT IKTDATDEEL APYXEOGGIP NQADL.NFGNE GDVLHSNKPT VTPPPVDPNI AK
GLSDQDRV
GGTPTAPG
PRHPLQVS
GPFVKQE
ATATDDAG
AKEFPEAT
WTQASMVD
YTMTITTK
DVEGQEHL
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
*DLTWRDQEFK WNVKTAFGNE ENNKVTFEMrN XQADSYDYLS ECDVLHSNKP TVTPPAPTPE ASMVDD INKV LDITDVKVXX TIFI'KIKTDA TDEELAPYIE PKQPLKPKKP LTP1'NHQAPT G ITKNKKRXN
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
E-NGKDVTDNG
QGG?--NQACL
NPVNTGKSAS
DDINKVLDIT
K IKASATDEE
QEH-LDLTNRD
IVTQENNKVT
NFGNEGDVLI-i
KGIHLPMT-NT
DVKVTDENGK
LAPY IEQGG I
QEFK!RNVKTA
FTMKKDOSY
SNKPTVTPPA
TVNP LYM lAG
DVTANGKVTQ
PNQADLNFGN
FGNETSTWTQ
SYLAGHTYTM
PTPEDPKKPE
LIVLIVAISF
EF127-3 (SEQ ID GAATCAA GGAACAATTG CTAAGAATT TCCAGAAGCG
ACGATTCCTA
ACAAAAGATA
TGGCATGTCA
GATGACATTA
GATGTTACAG
AAACAAGCAG
AAAATTAAAA
CCCAACCAAG
ACCGTAACAC
TTAGATTTAA
AAAATGACA
TCGAAAATCA
AAACAGCCTT
ATAAAGTGCT
CTAACGGCAC
ACAGCTATGA
C~rACGCAAC
CCGACTTAAA
CACCGCCAGT
CCAACCGCGA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATT
AGTA.ACACAA
CTATTTA-AGT
GGACGAAGAA
CTTTGGCAAT
TGATCCAAAT
TCAAGAATI'T
AGCCAGCATG
TGTGACGTGA
GATTTAACCA
ACCAGTACT
GAT'TGAAAG
GAAAATAACA
GGTCATACGT
TTAGCGCCTT
GAAGGTGACG
ATTGCTAALAG
AAATGGAACG
GTAGATGACA
CGCCAGAAGA
ATCGTbA.AGA
GGACCCAAGC
TCACCGACGA
AkAGTA.ACTTT
ATACAATGAC
ACATTGA-ACA
TGTTACATTC
ACGTAGAAGG
TCAAAACAC
TTAAT
TCCAACGATT
TAGTTTCGAT
CAGCATGGTG
AAATGGTAAA
TGAAATGAAC
TATCACCACT
AGGCGGGATT
C AAC AAACC A
ACAAGAACAT
TTTCGGTAAC
0 00 GAAkACACA CTTGGACCCA EF127-4 (SEQ ID NO:476) NQG TIAXEFPEAT IPKNDNAHAC DVTPEDPTIT DINKVLD 110 VKVTDENGKD IKTDATDEEL APYIEQGGI P DLTNRDQEFK WNVK<TAFGNE.
KEDIENQEHLD
VTAINGTVTQE
NQADLNFGNE
TSTW.TQASMV
LTNREDSFD)W HVKTAFGNET NNXVTFEMNK QADSYDYLSG GDVLHSNKPT VTPPPVDPNI
DOIN
STWTQASMVD
HTYTMTITTK
AKDVEGQEHL
EF128-1 (SEQ ID NO:477) a a a
TAGCGAAAGA
ATGGTGATTG
GAAGCGGTTA
AGTCTGGCTG
GAAACGGAAC
TFI'CCACATG
GTGATGCTGG
ATTAATTCAG
AGCCAATTTG
GGATTGAAAC
ACGTATTTCC
AAGACCAATA
TCAGTGGAAT
GAA.ATTACTA
AGTTCAGTGA
GAGTTGCAAC
TT"TACAACCC
GCT'TCATTAA
GGAAATrGATG
AAATAGGGAG
TCGCTGGAAC
AAGCAGGAGA
ATTGTAAACG
CAGTC GATT
TAAGACAAGC
CTTCATATCG
CTGATTATGA
TCTCTGGTTT
TCGCTTTAGA
TATTAOTGAC
CCAATGATTC
ATAGTAATGA
ACCAAGGCTA
ATTCATAC~TT
AAGGGTCTAG
AATTAAAACA
CGATTGCCAA
TGCCTrGITCA
GATTAAAATG
AGTTGTGGGA
rACAGAAGGA
GATATTGGA
AGTAGTTGTT
GATTATIGAA
CGGCGCAAAA
TATGAkTGTG
TGGAGACGTT
TACGTACAAT.
AGATGGGGTC
AATCAATGAA
CTACCAAGGT
TGAAATGATT
TGATAAATAT
CACACCAGAA
A.ATTGTCAAA
TCAATTTGAT
AATTAACGGA
TTTAAGAAAG
AATI'TCAGTC
ATGACCAATA
GGACAAGCTA
GAAGATGCTA
GTGGTITCAAG
CAATTITATiGT
CGCGTCAATA
CGGACGTATG
CAAACACACG
GCTAATACAC
TATCCAGATC
GCAGCACCAG
AATGCGI'AT
AAAACAGAAG
GATTATTA
GATCGTCTGG
A1'TCAATCTG
CAAACCAT
CAACGAAATT ATTATCGACA
CCACATTGGC
cGGTGAAACT
CTTI'CCCAGT
GTGGTAGTTT
GC LTATCTGA
TTCCTGATGG
CGcAATTGAC
GTGGTACGCC
GAGATTTAAC
G'ITAGATG
CAAGACATCC
A.AGT'TTAGC
GGGAAAGTGT
TGGGTCC=I
CAAGCCAATC
CGCAATCGAC
CGACCGCTAC
CAGCAACTAG
TTTAGCTGAA
GAAAGACCAC
TCAAGCGGGT
TTC AGATAAT
TCAAGACCGC
AAAGACAAA-A
'ITATGATAAA
AACCGCCCCA
GAATCGAAAA
TTACTTGCAT
TcTTCAAGTC
GTTAAACCAA
EYGAATCTTTA
T'TAAAACAA
TAT~cATGAT AccACCAACA
GGACGATGCT
TAcAGAAGGT 238 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faeculis Genes.
TACGTAGGAA
ACATCACGAT
ACCCTTGTAA
GTAGTGGGAC
ACGATTCCTA AAAATGACA-A ACAAAAGATA TCGAAAATCA TGGCATGTCA
AAACAGCCTT
GATGACATTA
ATAAAGTGCT
GATGTTACAG CTAACGGCAC AAACAAGCAG ACAGCTATGA AAAATTAAAA CTGACGCAAC CCCAACCAAG CCGACTTAAA ACCGTAACAC CACCGCCAGT TTAGATFTAA CCAACCGCGA GAAACAAGCA CTTGGACCCA ACTGATGTAA AAGTCACAGA CAAGAAAATA ACAAAGTAAC AGTGGr-TCATA CGTACACALAI GAATTAGCAC CTTATATTGP AACGAArGGTG ACGTGTTCZ.
GAAGATCCAA
CGATTACAAJ
GACCAAGAAT
TTAAATGGMI
CAAGCCAGCA TGGTGGATGI *GANGAAAATG GCAAAGATGr ACTTTACTA TGAACAAAA *ATGACTATTA
CCACTAAAA'
*GAACAAGGC GGATTCCCAk **CATTCCAACA
AGCCAACCG
*:GAACCTAAAC
AACCGCTAA
ACGAACCCAC TCAATTTTG *ACAACAGTAA
ATCCACTT
TTTGG-CATAA CAAAAAATA
TCACTACGAA
AATGAATCAA
TGCGCATGCG
AGAACACTTA
TGGCAACGAA
AGATATCATT
AGTAACACAA
CTATAAGT
GGACGAAGALA
CTTTGGCAAT
TGATCCAA.AT
TCAAGAATTT
AGCCAGCATG
TGAAAATGGT
TTTTGAAATG
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGA-P
SCGTCA-AAACP
~CATTAATAM;
r~ TACAGATAk) 5, AGATG-ACAGC r TAAAACTGAC N. CCAAGCCGA( T AACACCGCC' A ACCGAAAAAk G AAAATCAGC.
A CATGATCGC A AAAAAGA.AA
GTCAAAGAAA
GGAACAATTG
TGTGACGTGA
GA'TTTAACCA
ACCAGTACTT
GATGTGAAAG
GAAAATAACA
GGTCATACGT
TTAGCGCCTT
GALAGGTGACG
ATTGCTAAAG
AAATG;GA.ACG
GTAGATGACA
APLAGATGTTA
AACAANCAAG
ACTAAAATC.A
ATCCCAACC
CCAACCGTAP
GGCCAAGAAC
GCTTTCGG2;
GTGTTAGACI
7GGCATAGTA2
TACTCTTAC'.
GCAACGGAD
TTAAACTTrD r GCACCAACGi kL CCGTTGACG PL AGTAAAGGA A GG TrAATT A A.ATTAG ATACAGCGAT TGATGCAGCA CTAAGGAATT T CGCCAGAAGA T
ATCGTGAAGAI
GGACCCAAGC C
TCACCCACGA
AAGTAAC=T I~ ATACA.ATGAC I ACATTGAACA
TGTTACAITCC
ACGTAGAAGG
TCAkAACAGC
TTAATAAAGT
CAGCTAACGG
CNGACAGCTA
AAGCTAGCGC
*AAGCCGACTT
CACCACCTGC
*ATTTAGATTT
k. ACGAAACAAG k TCACAGACGT k. CACAAGAAAA r~ TAGCTGGTCA 3AA:GAATTAGC G; CAACGAAGG
CCAGAAGACCC
C. CTACAAATCA A TTCATTTACC C; TCCTTATAGT
'CCAACGATT
'AGTTTCGAT
AGCATGGTG
LAATGGTAAA
~GAAATGAAC
~ATCACCACT
kGGCGGGATT
:AACAAACCA
kCAAGAACAT rTTCGGTAAC
GTTAGACATC
CA.AAGTAACA
TGACTATTA
AACGGACGAA
GAACTTT GGC
ACCAACGCCA
AACCAACCCT
CACATGGACC
GAAAG TNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
AATGACTAAT
GGCTATTAGC
EF1.28-2 (SEQ ID NO:478) mrVhqI'LT.TM VIVACTVVCN FSPTLALAEE AVKAGDTEGM TNTVKVKD
DASGSFSD&F
LJADCKRILEG
MLASYRGGKQ
LKLALDTYNQ
VEYSNDYQGA
LQQGSSTPED
ND VP VQINGQ
IPKNDNA.HAC
DINKVLDIID
IKTDATDEEL
D)LTNRDQEFK
ENNKVTFEMN
EGDVLHStNKP
ASMVDDINKV
TIT'1(IKTDA
PKQPLKPKKP
QATFPVQAGE
FMFPDGKTKI
THGDLTNRKT
AAEVLALNQE
FITSQSIDDF
TIsATSTEGY DVTPv-DPTIT
VKVTDEN'GKD
APYIEQGGIP
WNVKTAFGNE
XQADSYDYIJS
TVTPPAPTPE
LDITDVK(VXX
TDEELAPYI E
LTPTNI{QAPI
TEPVDLVV'JE
NSADYDmNVR
YFLLVTDGVA
ITNQGYEMIN
TrQL.KQIVKD
VGNITIHYEV
KDIENQEHLD
VTANGTVTQE
NQADLNFGNE
TSTWTQASMV
GHTYTMTITT
DPTITKDIEG
ENGY(DVTDNG
QGGIPNQADL
NPVNFGKSAS
VNTQLTYDKS
NTRLDGYLHK
AYWESVE-SLS
P-LAQSTPATA
KENTAIDAAT
LTNREDSFDW
NNKVTFENK
GDVT-HSNKPT
DDINKVLDIT
KIKASATDEE
QEHLDLTNRD
IVTQENNKVT
NFGNEGDVLH
KGIHLPMTNT
QFVSGFGDVR
TNTNDSINEY
SVNSYFDKYK
SLTIANQFDI
LVSSGTMNQG
HVrKTAFGNET
QADSYDYLSG
VTPPPVDPNI
DVKVTDENGK
LAPYIEQGGI
QEFKWNVKTA
pFTMNI(DDSY
SNKPTVTPPA
'IVNPLYMIAG
JQGLSDQDRV
rYGGTPTAPG
PDPRHPLQVS
TEVGPFVKQE
QSATATDDAG
T IAXEFP EAT
STWTQASMVD
HTyTMTITTK
AYCDVEGQEHL
DVTANGKVTQ
PNQADLNFGN
FGNETSrtWTQ SYLAGHTYTm
PTPEDPKXPE
LIVLIVAISF
GITKNKXRKN
EF128-3 (SEQ ID NO:479) 239 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
AGA TGAA.AATGGT AAAGATGTTA CAGCTAACGG
CAAAGT~
CAAGAAAATA
AGTGGTCATA
GAATTAGCAC
AACGAAGGTG
GAAGATCCAA
GACCAAGAAT
CAAGCCACCA
CANGAAAATG
ACTTTTACTA
ATGACTATTA
GAACAAGGCG
CATTCCAACA
GAACCTA.AAC
ACGAACCCAG
ACAAACTAAC
CGTACACAAT
C'ITATATTGA
ACGTrTTGCA
CGATTACAAA
TTAAATGGAA
TGGTGGATGA
GCAA.AGATGT
TGAACAAAAA
CCACTAAAAT
GGATTCCCAA
AG CCAACCGT
AACCGCTAAA
TCAATTTTGG
T~IlGAAATG
GACCATTACT
ACAAGGTGGC
TTCCAACAAA
AGATATCGAA
CGTCAAAACA
CA'ITAATAAA
TACAGATAAT
AGATGACAGC
TAAAACTGAC
CCAACCCGAC
AACACCGCCT
ACCGAAAAAA
AAAATCAGCA
ACTAAAATCA
Arl'CCCAACC
CCAACCGTAA
GGCCAAGAAC
GCTTTcGGTA
GTGTTAGACA
GCCATAGTAA
TACTCTTACT
GCAACGGATG
;i'I.AACTTTG
GCACCAACGC
CCGTI'CACGC
LACA'
CNGACAGCTA
AAGCTAGCGC
AAGCCGACTT
CACCACCTGC
ATTAGATTT
ACGAAACAAG
TCACAGACGT
CACAAGAAAA
TAGCT.GGTCA
AAGAATTAGC
GCAACGAAGG
CAGAAGACCC
CTACAAATCA
TGACTA=TA
AACGGACGA.A
GAACTTTGGC
AC CAACGC CA
AACCAACCGT
CACATGGACC
GAAAGTTNCT
TAACAAAGTA
TACATACACA
GCCTTATATT
TGACGTGTTG
AAAAAAACCT
TCAAGCACCA
AGTAAAGGA-A TTCAT EF128-4 (SEQ ID NO:480) DENGI( DVTANGKVTQ ENNKVTFEMN
XQADSYIDYLS
EGDVLHSNKP
TVTPPAPTPE
ASNVDDINKV LDITOVKVXX TITTKIKTDA
TDEELAPYIE
PKQPLKP~kP
LTPTNHQAPT
GHTYTMT ITT
DPTITKDIEG
ENGKDVTDNG
QGGIPNQA.DL
NPVNFGKSAS
KIKASATDEE LAPYIEQGGI QEHLDLTNRD QEFKWiNVKTA IVTQENNKVT FTMThKKDDSY NFGNEGD)VLH sNKPTVTPPA
KGIH
PNQADLNFGN
FGNETSTWTQ
syLAG1XTYTM
PTPEDPKKPE
EF129-1 (SEQ ID NQ:481)
TGACAAGTGA
ATTGCTACCC
CAAGAAATI'
GAAGCAGAAG
CTAAAAGCAA
AAACGTAACG
ACAATGCTAG
GCTGTTTCAA
CAAGCCGITG
GAAGCTGAAT
ATGAAAGCT?
AAACAAA.AAG
GCTGAAAAAG
CCAGTTGCCT
GAATCAAGCA
GAAAATACTG
GGAAATAATG
GCGCC ITCTG
CGTCCAGTAG
GAAGCAGGTG
TGGGCGCCAG
TCAGGAAGCG
TACTCAGGT.P
AGAAACGTCT
CAAGCATCGC
CATCATTAAA
TATC ITCAGT
AATCAGAACA
AAGCAATCAA
ATGCAGTTTT
CAATCGTAAG
TTGATAAAAA
TAGA-AACAA.A
CATTAGCATI
CAGCTGCTGA
CCAA-ACAAGC
CTTCATCAAC
CGCA.ACAAAC
GCTC LTCTTC
GTGGCCAAAC
CTGATCCAAC
TATGGGATGC
GCATTCCAAJ
GTAACTCAG9
GTCACCGTG.'
GCACAATCG'.
NITI'CATCA
'FITGGCGGAC
AGCAAAACAA
ATTTGATGAA
ATTACAACAA
AAATCAAGCA
AGATGCGGAC
TGCCAACAAC
AGCTGAAAAC
ACGTCAAGAT
AGAACAATCA
AGCAGAGCAA
TGCTGCAAA
AACAGAAGCP
AACTGAAACP
ATCAGAACAJ
TGGTGGTGG;
AATCAATGCI
AGGTTTGGCJ
TGATCACTG(
AATCATGGC(
TTGGGAAAT'
r AGGACACTCA
GTATTACTAT
AATGTTGATA
GGGGATTTAG;
AGCATGGC NT
GAAA'ITACAA
CGTGCATG=T
TCAG=TCAG
GACT'AATGC
GAGAAAAAAG
TTACTTTCTA
TCAGCTG3AAA
CACGCTTAG
*CCAGCTAAAG
CAAGCACCAG
ACTACACCAA
CCAGTACAAC
ACAGTTACAC
STTGAACGTC
k. GCTTCTGCAA
TCTCGTGGAC
3TGGTACA.ATC t' AACCCAGGT; k GCCTAA GTTCATTAAC C
AAAAAATTGA
cTrCACAAGT
TACGTGAACA
AC'ITGAATCA
AAGTTAATGG
ATGCAATCAG
AACAACAAAA
TGAAACAACT
AACAATCTGA
GTTCTAA-AGC
CTGCTGAACA
CTGAAGTGAA
CAAGCTCAAG
GTACAGATAA
CTACAACACC
CAACACCAGA
TACGTCAATC
CTGCTCGTGC
ATGAAGTTAT
AAACAAACAT
L TACGCGTGT
CTATCAGCA
k.GAAAAAAAT kTCTTCTTTA k.AAGCAAACA
A.CGTATTGAA
A~CAAAGCACA
CCGTGTTCAA
AGAAGACAAA
TGAAGCAACA
ATTAAACGTA
TGGCTTAGAA
AAAAGCTGCA
AGCAGAAGCA
CTCAGCAACT
TAGTGCAACA
AAG;CGATAAT
ACCAACACCA
ATTAGGTA
AGCACAAGTT
CGCAATTATG
CGTAACAGCT
CGG'ITTTGGT
EF129-2 (SEQ ID NO:482) 1 240 TABLE 1. Nucleotidc and Amino Acid Seqeuences of E.faecalis Genes.
VYK'KRLFASV LLCSLTLSA I ATPS IALACN VDKKIEECNQ LIS SLKAKQG DLASQVSSLE
AEVSSVFDES
MLDAVLDADS
AELETKRQDL
EKAKQAAAKP
NTGSSSSEQP
PVVWDAGLAA
GSGHROWEIN
MALREQKQTL
VADAISRVQA
LSKQSELNJM
AKAEVKAEAP
VQPTTPSDNG
SATARAAQVE
PG ITRVGFGY
KAKSEQLQQE
VSTIVSANND
KASLALEQSS
VASSSTTEAQ
NNGGQTGGGT
AGGIPNDHWS
SGSTIVGHSA
ITNL.NQRIEK
LMQQQKEDKQ
AESSKAGLEK
APASSSSATIE
VTPTPEPTPA
RGDEVIAIMW
RNEAIKNQAR
AVVDXKAENE
QKAAAEAEQA
SSTQQTTETT
PSADPTINAL
APGNSVIMAW
DVQVNGQSTT
KKVKQLE-ATE
RLAAEQKAAA
TPSTONSATE
NVLRQSLGLR
YNETNMVTAS
EF129-3 (SEQ ID NO:483) GGAC AATGTTGATA AAAAAATTGA ACAAAAAAT--
C
C
C.
CAAGAAAT
GAAGCAGAAG
CT-AAAkAGCAA
AAACGTAACG
ACAATGCTAG
GCTGTTTCAA
CAAGCCGTTG
GAAGCCAAT
ATGAAAGCTI'
AAACAAAAAG
GCTGAAAAAG
CCAGTTGCCT
GAATCAAGCA
GAAAATACTG
cGAAATAATG
GCGCCTTCTIG
CGTCCAGTAG
GA.AGCAGGTG
TGGGCGCCAG
TCAGGAAGCG
TACTCAGGTA
CATCATTAAA
TATCTTCAGT
AATCAGAACA
AAGCAATCAA
ATGCAGTTI'T
CAATCGTAAG
'FTGATAAAAA
TAGAAACAAA
CATTAGCATT
CAGCTGCTGA
CCAAACAAGC
CTTCATCAAC
CGCAACAAAC
GCTCTTCTTC
GTGGCCAAAC
CTGATCCAAC
TATGGGATGC
GCATTCCAAA
GTAACTCAGT
GTCACCGTGA
GCACAATCG7
AGCAAAACAA
AT?1'GATGAA
ATTACAACAA
AJAATCAAGCA
AGATGCGGAC
TGCCAACAAC
AGCTGAAAAC
ACGTcAAGAT
AGAACAATCA
AGCAGAGCAA
TGCTGCAAAA
AACAGAAGCA
AACTGAAACA
ATCAGAACAA
TGGTGGTGGA
AATCAATr-CA
AGGTTTGGCA
TGATCACTGG
AATCATGCCG
TTGGGAAATlI
AGGACACTC;
GGGGATTIrAG
AGCATGGCTT
GAAATT.ACAA
CGTGATGTTC
TCAGTTGCAG
GAC "A.ATGCC
GAGAAAAAAG
TTACTTTCTA
TCAGCCAAA
GCACGCTTAG
CCAGCTAAAG
CAAGCACCAG
ACTACACCAA
CCAGTACAAC
ACAGTTACAC
'TCGAACGTIC
*GCTrCTGCAA
TC'ICGTGGAG
TGGTACAATG
AACCCAGGTA
GCC
:TTCACAAGT
1'ACGTGAACA
ACTTGAATCA
AAGTTAATGG
ATGCAATCAG
AACAACAAAA
TGAAACAACT
AACAATCTGA
GTTCTAAAGC
CTGCTGAACA
CTGAAGTGAA
CAAGCTCAAG
GTACAGATAA
CTACAACACC
CAACACCAGA
TACGTCAATC
CTGCTCGTGC
ATGAAGTTAT
AAACAAACAT
TTACGCGTGT
ATCTTCTTTA
AAkAGCAAACA
ACGTATTGAA
ACAAAGCACA
CCGTGTTCAA
ACAAGACAA
TGAAGCAACA
ATTAAACGTA
TGCCTTAGAA
AAAAGCTGCA
AGCAGAAGCA
CTCAGCAACT
TAGTGCAACA
AAGCGATAAT
ACCAACACCA
ATI'AGGTTTA
AGcACAAGTT
CGCAAITATG
GGTAACAGCT
CGGTTTTGGT
EF129-4 (SEQ ID NO:484) ON VDKKIEEKNQ EISSLKAXQC- OLASQVSSLE
AEVSSVFDES
MLDAVLDADS
AELETKRQDL
EFKAKQAAAKP
NTGSSSSEQP
PV'JWDAGLAA
C SGH RDWE IN
MALREQKQTL
VADAISRVQA
LSKQSELNVM
A.KAEVKAEAP
VQPTTPSD4G
SATARAAQVE
PG ITRVGFGY
KAJCSEQLQQE
VSTIVSANND
KASLALEQSS
VASSSTTEAQ
NNGGQTGGGT
AGGIPNDHWS
SGSTIVGHSA
ITNLNQRIEK
LMQQQKEDKQ
AESSKAGLEK
APASSSSATE
VTPTPEPTPA
RCDEV IA 1MW
RNEAIXENQAR
AVVDKKAENE
QKAAAEAEQA
SSTQQTTE-TT
PSADPTINAL
APGNSVIMAW
DVQVNGQSTT
KKVKQLEATE
RLAAEQKAAA
TPSTDNSATE
NVLRQSLGLR
YNETNMVTIAS
EF130-1 (SEQ ID NO:485)
TGATACAITA
TTAATCAACA
CAGATACGTA
CCAGCTGATC
TCCTTGAGCA
GGTGCCATTG
GTCGCT'rTAT
AAAGGAGGGA
GT=ATTAGT
G'ITGGGTGGT
TGAAGAAAAA
CAGAAGCGGT
CGATACCAAG
TAACTGGTGC
AAATATGCGC
TTACTATTT
TCAACAAAAT
TATGGCTCGT
GATGAAAGCC
TGTCGAAATT
CGGGACCATG
CCAAA.AGAGA
ATCATTGGCT
AGCCCCTCGT
GAAACA.ACGT
CAATTTGAAA
AATTGCCCA
AAAGAAGATc
AAAAAAGACG
TAGCCTTAAT
ACGCCGTI'AG
TTGACTTTGA
ACAAAAACT
TTTTTAAACC
AAGTCATGGG
AAAAAATTGG
TTTTAACAAT
CAAGTTGAAA
IT~cAGTTGAG
ACCTGTGATT
ATTGTCcAAT
GAAAAACAAT
TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faeculis Genes.
TATGCCTTGG
ACCAAAAAAG
ACTTCTGTAG
AATATGATTA
GGAACATTIAG
CAATTGGAGC
CTAGTCATCG
ACGAACTCAT
AAAAAATCGA
CCTTAA ETAC
CAGCAACGAC
AAAAAACTTT
AACGAAGAT
TTATATCACT
ACCAACCCGT
CTGTGGCGAT
GCCTATTAAA
AGCCGATTGG
GGCGr'TTCCT
GATI'TATCTA
GTT1GAGTTAA
TTACAAGCAA
GACGCCAACG
GCGCTTAA
TATTTTCACC
CTGTTTATAC
TI'GATGACGT
CGACGCGA-AT
ACGATATGI'
TTTAGAAAGA
ATACAAAATA
TCCTCGTCAA
TGCTGTTCAA
GAAGGCTTTC
EF130-2 (SEQ 10 NO:486)
YIKRRENMRP
ADVKKNMARE
ALLTGAGTb4K
SVEKIEPTRV
LEQKTLADWV
KEKKRGKNWL
TTFDFDSVES
EDQVMGI<NNY
ELIDDVPGQN
INSLLVLLF I IGLAL IFNNQ LSTEAVMYAQ _yENKNLPVIG ALASHRTEDG VSLFSPLERT MITLITCGDL QATTRIAVQG I RSWVVQQNS
AIAIPSVEIN
KKDEL IY lTD TLAA PTP IKD
RSYAVSKLKP
LPIFKGLSNV
LSTVYTYKIT
ANDDMLKAFQ
EF130-3 (SEQ ID NO:487) CGITAG CAAGTTGAAA CCAGCTG-ATG TGAAGAAAAA TCCTTGAGCA CAGAAGCGGT GGTGCCATTG CGATACCAAG GTCGCTTTAT TAACTCGTGC TATGCCTTGG CTAGTCATCG ACCAAAAAAG ACGAACTCAT ACTTCTCTAG AAAAAATCGA A.ATATGATTA CCTTAATTAC GGAACATTAG CAGCAACGAC CAATTGGAGC AAAAAACTTT
TATGGCTCGT
GATGAAACC
TGTCGAAATT
CGGGACCATG
AACGGA-AGAT
TATATCACT
ACCAACCCGT
CTCTGCGAT
GCCTATTAAA
AGCCGATTGG
GAAACAACCT
CAATTTGAAA
AATDGCCCA
AAAGAAGATC
GGCGTTTCCT
GAITTATCTA
GTTGAGTTAA
ErACAAGCAA
GACGCCAACG
GTGGCT
TTGACTI'TGA
ACAAAAACTT
TTTTTAAAGG
AAGTCATGGG
TATTTTCACC
CTGTTTATAC
TGATGACGT
CGACGCGAAT
ACGATATGT'
TTCAGTTGAG
ACCTGTGATT1
ATTGTCCAAT
GAA.AAACAAT
TTTAGAAAGA
ATACAAAATA
TCCTGGTCAA
TGCTG'IICAA
GAAGGCTI'TC
EF130-4 (SEQ ID NO:488)
VSKLKP
ADVKKNMARE TTFDFD)SVES LSTEAVMKAQ FENKNLP'JIG AIAIPSVEIN LPIFKGLSNV ALLTGAGTMK EDQVMGKNNY ALASHRTEDG VSLFSPLERT KKDELIYITD LSTVYTYXIT SVEKIEPTRV ELIDDVPGQN MITLITCGDL QATTRIAVQG TLAATTPIKD ANDDMLKAFQ LEQKTLADTJTV A EF131-1 (SEQ ID NO:489)
TAGGCGGAGG
TGGCTTrTrTA
TTCTITTCA
CGCCGAAGTG
C ITGATCAAA
CAGGTTTTAG
CCAAACACAA
AAAGCAATTG
ATI'CAAGGTG
TTATCAGCAA
AATACCATTC
GCAGGAATGT
CACAAAACGG
AATTCAACA-A
TAACGCGTAT
TAGTATGTTTr
CTAGAGATTC
ATAATTATGC
AAATTCAAGA
TAAATAAAGG
GGTTCAGAT
AAGAAGGTAA
CTGAGGATAT
TGCCTAATAA
AAGTCAATAA
TAGAGAAAAT
CTGGTTAAA
GTTATAAATG
GCGTA-AACGA
GTTrGTGGTG
ACAAGTTAGT
GAATTTAACrG
AACAAATTAT
ATATGGCTTT
TGGCTCAATT
ACTTACATTA
TACGATTAGC
TATCGT1'ACC
AGGAAAATAC
GTATCAACGT
GAATTTTGGC
GACAGAAGAT
CATGCAAAGA
ATI'GGTGGTA
CAAGAATCGA
AAAGAAATAG
ATTGGTI'CGG
GCCAATTTTG
CAAAAATCTT
GATACAAAAC
GATATGTTGA
GATGAAGAAA
AATTATTCC C
ACCTATCAAG
TTCTATGAAA
AATCATATA
AAAGACATGG
GTGGTIATT
AAGTGGTC IT
TTGCACCAGA
CTTTGATCAT
AAAAGCAACA
TTACCACA-AC
TCGCTACGTI'
ATATGACAAG
TTATTCAATT
CACTAAATTT
AATTATTTAA
CCTTATTGGA
ACCAAGTGCT
AGGAGTGAAT
AATAAAAACG
GGAAGAAGAT
TAGTGGCGAA
TAAAGATGAT
AGCCAACACG
CTTGATCTTA
TTATCCGCAA
TGGTT1TAAAG
TGTTAAACAA
TGTCCTTTITA
TAATcTTTAT ACAGcccAAT
CTCAATTCCT
242 TABLE 1. Nucleotide and Amino Acid Seqeuences of E.faecalis Genes.
GCAGCTAGTT
TATTGGTACT
TTATGGACGT
TTACGT1T1AC
AAGACAGGGG
GGTTCGTTGT
TTGCCCATGA
TACATCAATT
CTTCTCAGCA
ACGGCGTTGA
TCATAT'TGCT
TCCATGATGT
ATTGGACT GGTAATGTGG AACGAGTC'GA CAFFTAGTTT AAGCTCTTAT CATGGCGGCA AGCGCGTCAA CAAGCCCTGG AACTAACTGT GTGAATCCAG AACCAATTTA ACTr.TTA.AAT
ATATGACGAC
CCACCGCACT
TCTATGTTCA
T TATT=IC
CGAAATACAA
TTIAA
AGGTGA=IG
TTTGCAAAAA
TGATAATTAT
AAAAGATATG
AGAATTAATT
EF131-2 (SEQ ID NO:490) MRKRH AK.KRHGGVNW LFIVCLLVVI GGSGYLIKTF FFTRDSQVSQ ESKWVLEEDR RSDNYANLTK EIVAPDSGEL DQKIQETNYI GSALIIKDDQ VLVNKGYGFA NFEKQQANTP NTRFQIGSIQ KSFTTTLILK AIEEGKLTLD TKLATFYPQI QGAEDIT'-SD MLNMTSGLKL SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN YSPVNFVLLA GMLEIQ4YQRT YQELFNNLYH KTAGLKNFGF YETLLEQPNN STSYKWTEDN SYNQVLS IPA ASFAHEFGTG NVDMTTGDLY WYLHQLTSGH LVSTALLQKL WTSSQQSSYH GGIYVHDNYL RLHGVEAGQQ ALVLFSKDMK TGVILLTNCV NPAKYKELIG SLFHDVTNLT VKF EF131-3 (SEQ ID NO:491) a a TTT AATAAAAACG TTCTTTTTCA CTAGAGATTC ACAAGFI'AGT CGCCGAAGTG ATAATTATGC GAATTAACG CTTGATCAALA AAA~rCAAGA AACAAATTAT CAGGTTTTAG TAAATAAAGG ATATGGCT CCAAACACAA GGTCAGAT TGGCTCAATT AAAGCAATTG AAGAAGGTAA ACTTACATTA ATCA.AGGTG CTGAGGATAT TACGATTAGC TTATCAGCAA TGCCTAATAA TATCGTIACC AATACCATTC AAGTCAATAA AGGAAAATAC GCAGGAATGT TAGAGAAAAT GTATCAACGT CACAAAACGG CTGGTTTAAA GAAT I"TGGC AATTCAACAA GTTATAAATG GACAGAAGAT GCAGCTAGTT T'TGCCCATGA ATTTGGGACT TATTOCTACT TACATCAATT A.ACGAGTGGA TTATGGACGT CTTCTCACCA AAGCTCTTAT TTACGTTTAC ACGGCGTTGA AGCGGGTCAA AAGACAGGGG TCATATTGCT AACTAACTGT GGTTCGTTIGT TCCATGATGT AACCAATTTA
CAAGAATCGA
1AAGAAATAG
A.TTGGTTCGG
GCCAATTTG
CAAAAATCT
GATACAAAAC
GATATGTTGA
GATGAAGA.A
AATTATTCCC
ACCTATCAAG
'ITCTATGAAA
AA~rCATATA
GGTAATGTGG
CATTAGTI'
CATGGCGGCA
CAAGCCCTGG
GTGAATCCAG
ACTG~rAA-AT
AAGTGGTCTI'
ITCACCAGA
CTrTGATCAT
AAAAGCAACA
TTACCACAAC
TCGCTACGTT
ATATGACAAG
TTA'ITCAAT1'
CAGTAAATI'
AATI'ATTI'AA
CCT1'ATTGGA
ACCAAGTGCT
ATATGACGAC
CCACCGCACT
TCTATGTTCA
ITITATTTTIC
CGAAATACAA
TT
GGAAGAAGAT
TAGTCGCG;A
TAAAGATGAT
AGCCAACACG
CTTGATCTTA
TTATCCGCAA
TGGTTTAA.AG
TGTTAAACAA
TGTCCTTTTA
TAATCTF1'AT
ACAGCCCAAT
CTCAATTCCT
AGGTGATTG
T ITGCAAAAA
TGATAATI'AT
AAAAGATATG
AGAATTA.ATT
EF131-4 (SEQ ID NO:492) LIKTF FFTRDSQVSQ ESKVVtLEEDR RSDNYANLTK EIVAPDSGEL DQKIQETNYI NTRFQIGSIQ I(SFT'ITLILK AIEEGKLTLD SAMPNNIVTD EEIIQFVKQN TIQVNKGKYN KTAGLKNFGF YETLLEQPNN STSY1(WTEDN WYLHQL.TSGH LVSTALLQKL WTSSQQSSYH TGVILLTNCV NPAXYKELIG SLFHDVTNLT GSALI IKDDQ
TKIATFYPQI
YSPVNFVLLA
SYNQVLSIPA
GGIYVH]nNYL
VKF
VLVNKGYGFA
QGAEDITISD
GMLEKMYQRT
ASFAHEFGTG
RLI4GVEAGQQ
NFEKQQANTP
MLNMTSGLKL
YQELFNNLYH
NVDMTTGDLY
ALVLFSKDHK
EF132-1 (SEQ ID NO:493) TAGTTTCTAATCTCACcAAAAAAAATTTTA
ACAAAGAAGGAGAGATCGTATATAGAATGGATAT
rGTGGGAAGTCTGGGAATG TGATTGCTCTTrATATCGGGGCATGTTCAACApLATAGTAA GACAAGTC~r 243 TABLE 1. Nucleotide and Amino Acid Seqeuences of E. faecalis Genes.
GCTTCGAACGAAAAATTAAAGGTAGTAGTACTAATTCGATTI'AGCAGATATTACTGAAA-ATATAGCAAAAGATAAA
ATTGATTTACACAGTATCGTACCTATTGGGAAAGATC CCCACGAATATGAACC tTTGCCTGAAGATGTTCAAAAAACT TCAACACAGATTGATTTTTTATAACGGTGTAACTTGGAmACTGGAGGAAATGCTTGmACAAAATTAGTAAAA
MATGCGAACAAAGAGGAAAACAAGACTATTTTGCAGCAAGTGATGGCATAGATATTACTAGAGGGTCAGACT
GAGAAGGGAAGGAAGATCCCCATGCTTGG ITAAATTTAGAAMACGGTATTATTTACGCTAAAAATATTGAAAAATIG TTAGCGGAAAAAGATCCTATAATAAAAAACTATAAAGAAAATCTAGATAAGTATATTG
TGGATCTCTA
GACAAAGAAGCTAAATCTAAATTCTCAATTCCGAATGATAAA AAAATGATTGTTACAGTGAAGGATCtAAA
TATTTCTCGAAAGCGTATAATGTGCCTTCTGCTTACATTTGGGAAACAACACTGAAGAAGAAGGAACACCAGATCAA
ATAAATATAAATCCCAAACTCTCTTCTGAGATTGCAAAC
ATGAAAACAGTATCAAAAGATACCAATATCCTATCTATTCAAC ATTTACTGATTC GCAGA-AGACAA
GATGGTGATAGTTACTATGCGATGATGAAATGGAACCTGGATAAATTGCTGAAGGCC'I=CGAAATAA
EF132-2 (SEQ ID NO:494) MMRKWXVVVGSLGMLIALF I FCACSTNSKDKDTVASNEKLKVVVTNS I LAD ITENI AXDK I DLHS IVP IGKIDP HEYEP LPEDVQKTSKADL~I FYNGVNLXTGCNAWFTKLVKXANX<EENKDYFAAS DGIDVIYLEGQSEKGKEDPHAWLNLENG I I YAKNI EKWLAEKDP DNKKFYKENLDKYI EKLDSLDKEAKSKFAS I PNDKKMIV'rSEGCFKYFSKAYNVPSAY IWE INT EEEGTPDQ IKH.LVEKLRTTKVPSLFVESSVDDRPMKTVSKDTN t P IYST IFTDS IAEKGQDGDSYYAMMr'KWNLDKIAE
GLSK.
EF132-3 (SEQ ID NO:495) ATTCAATGAAGCAG*...GCTGACAAArA
GATATAT.ATGT
*AGCAGATATTACTGAPATATAGCAAAGATAAAATTGATTACACAGTATCGTACCTATGGAAGATCCCCACGA
*ATATGAACCtT CGAAGTAkAACMAACGTTATTTAACCGTATGAA
TGGAGGAAATGCTTGG=ITACAAATTAGTAAAAMATGCGAACAAAGAGGAAACAAAGACTAITTTGCAGCAAOTGA
TGCTGTTA ATAAGTCGGGGAGGAG.GTCCAGTGTAATAAA *CGGTATTATTTACGCTAAAAATATTGAAAAATTAGCGGAAAATCCTGATAATA
TTCTATGAA
AGAATGA.GTGATTTGCAAAGTATTAATGTC.ATCATAA
AAAATGATTGTTACAAGTGAAGGATGCTTtAAATA=TCTCGAAGCGTATAATGTGCCTTCGCTACATTGGGA AAtCAkACACTGAAGAAAAGGAACACCAGATCAAATAACACTTAGTTGAAAA
ACGCACCAGTCCCTC
.9CTATTCGTAGAAAGTAGTGTGGACGATAGACCATGAAACATATCAAAAGATACCATATCCTATCTATCAAC
AA'FTGCTGAAGGCCTTTCGAAA
EF132-4 (SEQ ID NO:496) .9C CSTNSKDKDTVASNEKLKVVVTNS ILADI TEN IAKDK I DLHS IVP IGKDPHEYEPLPEDVQKTSKADL I FYNGVNLXT CGNAWFTKLVKXANKEENTKDYFAASDG IDVIYLEGQSEKGKEDPHAWLNLENGI IYAKNIEKWLAEKDPDNXKFYKEN 9..LDKY I EIKLDSLDKEAKSKFAS I PNDKKMIVTSEGCFKYFSYAYNVP SAY IWEINTEEEGTPDQ IKHLVEKLRTTKVP S LFVESSVDDRPMKTVSKDTNI PIYSTIFTDS IAEKGQDGDSYYAM4MKWNLDKIAEGLSK :0 *0 .0.
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
Query GenBank GenBank Gene Description BLAST BLAST Access. No. Score P-Value EF002-2 giJ2338759 (AF0 18073) periplasmic sorbi tol -binding protein; SmoE 113 3.6013-18 Rhodobacter EF003-2 giJ1552773 hypothetical (Escherichia coli] >gn]IPIld 10l12634 hypothetical 278 1.2013-53 29.4 EF003-2 giJ219 6 99 6 lipoprotein homolog [Treponema pallidum] >giJ2 108234 29K 309 3.30E-44 _______protein EF003-2 gi1146649 lipoprotein-28 precursor [Escherichia coli] >gi]2905 10 263 9.2013-40 EFOO3-2 giJ148838 28 3kDa membrane protein [Haemophilus influenzae] 197 2.10OE-39 EF003-2 gi11573 6 1 4 28 kDa membrane protein (hlpA) [Haemophilus influenzae] 197 7.80E-39 EF003-2 giJ23 14748 (A-E000654) outer membrane protein [Helicobacter pyloril 263 4.60E'-37 EF003-2 giJ349530 lipoprotein [Pasteurella haemnolytica] >gill 50508 lipoprotein 189 4.I1OE-29 EF003-2 gnlID~e 1l8435 similar to hypothetical proteins [Bacillus subtilis] 158 2.70E-26 EF003-2 giJ349532 lipoprotein [Pasteurella haemolytica] >pirIJNO753IjN0753 outer 200 1 .2013-25 EF003-2 gi11336657 lipoprotein [Bacillus subtilisl_ 182 2.70E-25 EF003-2 grilIPlDIe233873 hypothetical protein (Bacillus subtilis] >gnIjPlDjel 182900 186 1.3013-23 EF003-2 gi1294071 lipoprotein 3 [Pasteurella haemolytica] 199 76.60E-23 EF0O3-2. gi134953 1 lipoprotein [Pasteurella haernolytical >pirIJNO7521JN0752 outer 198 1.30E-20 EF003-2 gi129 4070 lipoprotein 2 [Pasteurella haemnolytica] 198 1.80E-20 EF005-2 giJ537235 Kenn Rudd identifies as gprmB [Escherichia colij >gill1790856 127 6.20E-12 EF006-2 gij 1552773 hypothetical [Escherichia coli] >gntIPlDjd 10 12634 hypothetical 255 1.40E-60 _29.4 S S 5.5 0 4 0 0 5 5 *50 0 S* 0 S S SO *5 S S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF006-2 gi1349532 lipoprotein [Pasteurella haemolytica] >pirlJN07531JN0753 outer 221 6.40E-49 EF006-2 gi123 14748 (AE000654) outer membrane protein [1lelicobacter pylori] 283 2.70E-48 EF006-2 g112 196996 lipoprotein hornolog [Treponema pallidum] >giI2 108234 29K 267 4.40E-47 protein EF006-2 gnljPIDI 18435 similar to hypothetical proteins [Bacillus subtilis] 359 1.80E-44 EF006-2 giI3 4 9 53 1 lipoprotein [Pasteurella haemolytica] >pirlYN07521JN0752 outer 218 3.80E-41 EF006-2 gi1294071 lipoprotein 3 [Pasteurella haemolytica] 220 2.30E-38 EF006-2 gil 1466 4 9 lipoprotein-28 precursor [Escherichia coli] >giJ 2 905 10 193 2.60E-38 EF006-2 giJ294070 lipoprotein 2 [Pasteurella, haemolytica] 218 1 .20E-36 EF006-2 gil 14 883 8 28 kA membrane protein [Haemophilus influenzael 112 8.50E-34 EF006-2 gi1 1573 6 1 4 28 kDa membrane protein (hipA) [Haemophilus influenzae] 112 I.50E-33 EF006-2 gil3495 30 lipoprotein [Pasteurella haemolytica] >gill150508 lipoprotein 114 4.30E,-29 EF006-2 gi1294069 lipoprotein 1 [Pasteurella haemolytica] 114 1.30E-27 EFO6-2 gi13 6 7 lipoprotein [Bacillus subtilis] 202 2.10E-26 EF006-2 gnIPIDje233873 hypothetical protein [Bacillus subtilis] >gnhIPIDjeI 182900 200 6.50E-25 EF008-2 giJ49301 7 endocarditis specific antigen [Enterococcus faccalis] 1590 12.70E-21 I EF008-2 giJ 3 9 32 69 adhesion protein [Streptococcus pneurnoniae] 986 1.80E-129 EF008-2 gi1153 8 3 4 adhesin specific for salivary pellicle of dental surfaces 973 l.0013-127 EF008-2 giJI575030 surface adhesin A precursor [Streptococcus pneumoniae] 934 2.90E-126 EF008-2 gil 153826 adhesin B [Streptococcus sanguis] >pir1A435831A43583 adhiesin 916 3.90E- 126
BI
EF008-2 gi1 11849 32 IScbA [Streptococcus crista] 915 3.40E- 125 EF008-21 giL 117994 Isurface antigen A variant precursor [Streptococcus pneurnoniae] 1917 5.0-124 0 000 *00 @0 000 0 000 0 0 0 00 0 *00 000 00* 0 0 0 000*0e 0 0 0 *00000.
00*0000 00 0 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF008-2 giI3 10633 adhesin [Streptococcus gordonij 891 6.OOE- 122 EF008-2 gnljP1Dje255529 lipoprotein [Staphylococcus epidermnidis] 476 1.20E-99 EF008-2 gil 1573330 adhesin B precursor (fim.A) [Haemophilus influenzae] 380 1 .60E-68 EF008-2, 1i1245464 __YfeA [Yersinia pestis] >gi112 45 464 YfeA [Yersinia pestis] 355 1.20E-64 EF008-2 giJ755075 periplasmic-binding protein [Synechocystis sp.] 321 1 .70E-62 >gnlPIDId10 18652 EF008-2 gil 133 5 91 2 EwIA [Erysipelotlirix rhusiopathiae] 232 4.40E-42 EF008-2 gnlPID el 18595 similar to ABC transporter (membrane protein) [Bacillus 204 4.10E-38 EF008-2 g1177793 3 TroA [Treponema pallidum] 181 2.40E-35 EF009-2 giJ 34 9 53 1 iorti [Pasteurella haemolytical >pirl1N07521JN0752 outer 391 4.OOE-64 EF009-2 gi0 552773 hypothetical [Escherichia coli] >grilIP1DId 10 12634 hypothetical 359 1 .9013-63 129.4 EF009-2 gil29 4 0 7 0 lipoprotein 2 [Pasteurella haemolytica] 391 6.40E-63 EF009-2 gi3 4 95 32 lipoprotein [Pasteurella haenolytica] >pirlJN0753lJN0753 outer 386 1. 1OE-'61 EF009-2 gi11488 38 28 3kDa membrane protein [Haemophilus influenzae] 286 5.60E-60 EF009-2 gil] 5 7 36 1 4 28 kDa membrane protein (hIpA) [HaemopHius influenzae] 286 7.60E-60 EF009-2 gi2940 69 lipoprotein 1 [Pasteurella haernolytical 122 4.70E-59 EF009-2 gi 14 6649 lipoprotein-28 precursor [Eschericbia coli] >gi 12905 10 326 -2.20E-58 EF009-2 gi3495 3 0 lipoprotein IPasteurella haemnolytica] >giIl5 0508 lipoprotein 239 7.80E-57 EF009-2 giJ29 4 07 1 lipoprotein 3 [Pasteurella haemolytica] 344 4.90E-5_6] EF009-2 gi12 3 1 47 48 (AE000654) outer membrane protein [Helicobacter pylori] 319 4.2E-53 EF0O9-2; giJ21969 96 lipoprotein homolog [Treponema pallidum] >giJ2 108234 29K 312 2.60E-41 _protein S. *5* 9*
S
5
S
S A S S See...
S S 55 5. 5 5 5 S S S S S* S S *S S S .5 5 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF009-2 gi1 1336657 lipoprotein [Bacillus subtilis] 234 4.OOE-32 EF009-2 gnlIPIlfe233873 hypothetical protein [Bacillus subtilis] >gnljPlDI 182900 242 1.40E-31 EF009-2 gniljPIDIel 18435 similar to hypothetical proteins [Bacillus subtilis] 102 6.80E-22 EFOl 1-2 gnlIDIdIOO96 ferric anguibactin-binding protein precusor FatB of V. 579 3.10E-98 EFOI1I-2 gnlIDIdlOO96 ferric anguibactin-binding protein precusor Fat3 of V. 579 3. 1OE-98 EFOl 1-2 gnljPlDjeI 85374 ceuE gene product [Campylobacter colil 284-- 1 .30E-89 EFOl 11-2 guI 1PIDje 185374 ceuE gene product [Campylobacter ccli] 284 1.30E-89 EFOI 1-2 gi11507 56 40 kDa, protein [Plasmid piMI] >pirIA299281A29928 222 2.80E-52 membrane-associated EFO 11-2 gi115075 6 40 kDa protein [Plasmid pJMl] >pir1A299281A29928 222 2.8OE-712 membrane-associated_____ EF012-2 giJ309662 pheromone binding protein [Plasmid pCF 10] 266 8.70E-1 16 >tpir1B53309IB53309 EF012-2 gif3882 69 traC [Plasmid pADI) >pir1A533lI jA533 10 pheromone cADi1 252 1.IOE-109 binding EF012-2 gn~jPlDIdlOl 18 TRAC [Enterococcus faecalis] 281 3.60E- 103 EF012-2 gnlPlD~d1O65 TraC [Enterococcus faecalis] 277 2.30E-102 EFO 12-2 giJ3 12940 threonine kinase [Streptococcus equisimilis] 227 1 .90E-67 1531S28 153 EFOl12-2 giJ48 8 0 8 dciA-E [Bacillus subtilis] 228 1 170E-46 9SW 0@ 9 9 0 0 0 *0 SO 0* 0 0@ 0 a 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF012-2 pirIS 1665l1IS 1 6 6 dciAE protein Bacillus subtilis, 228 1.OOE-45 EFO 12-2 gnllPlDIel18149 (AJ002571) DppE [Bacillus subtilisi >gnllPl.Dlel 183316 228 3.80E-45 EF012-2 giJ400O5 OppA gene product [Bacillus subtilis] 281 3.90E-44 EFOI12-2 giJ143603 sporulation protein [Bacillus subtilis] >gnI PIDe e1183 163 281 7.70E-44 EFO 12-2 gn.1IPlDId 0156 Periplasmic oligopeptide-binding protein precursor. 152 2.20E-43 3 EFO 12-2 gi1157 4 679 oligopeptide binding protein (oppA) [Haemophilus influenzae] 178 2.20E-42 EFO 12-2 giJ47802 Opp A (AAI1-542) [Salmonella typhim-urium] >giJ4780 8 128 1.OOE-37 -precursor_ EFOI12-2 giJ8 82 SSO ORF -f535 [Escherichia cold] >gi11789 397 (AE000384) f535; 228 5.30E-36 This 535 EF014-2 pirID7O070ID70 transcriptional regulator homolog ywtF Bacillus subtilis 101 1.40E-J7 0__ EFOI14-2 gnjPlDe 11698 8 capsular polysaccharide synthesis protein [Streptococcus 121 9.5.0E-27 EFO 14-2 giJ280 4 7 6 9 (AF030373) putative regulatory protein [Streptococcus 121 9.50E-27 pneumofliae] EF014-2 gn1lPIDfe2891 2 6 unknown [Streptococcus pneumoniael 121 1.OOE-24 EFOI14-2 giJ22 6 7 23 9 ORFi [Staphylococcus epiderrnidis] 234 .1.50E-24 EFOI14-2 gi14852 7 5 putative regulatory protein [Streptococcus pneunoiiiae] 121 3.90E-24 EF014-2 giJ280 4 7 35 (AF030367) putative regulatory protein [Streptococcus 121 3.90E-24 pneumoniae]____ EF014-21 giJ2 8047 47 (AF030369) putative regulatory protein [Streptococcus 121 3.9013-24 9 @000 *9* 0S 0@ 0@0 0 C@t 0 0 S *0 S 0 6 0 00 0 0 @0 9 @6 0 000 'to 0 .6 0*0 0*0 *0000 0 0 0 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFO 14-2 giJ 1762327 putative transcriptional regulator [Bacillus subtilis] 1 85 2.80E-22 EF0 14-2 gi143 156 membrane bound protein [Bacillus subtilis] >gnlIPlDjI 184471 116 1.l1OE-21 EF014-2 gdIjP1D~dI189 membrane bound protein LytR [Synechocystis sp.] 113 6.20E-20 EF0I4-2 giJ12768 74 EpsA [Streptococcus thermophilus] 103 4.OOE- 17 EF0 16-2 gnlIlDjel 18566 similar to amino acid ABC transporter (binding protein) 194 3.70E-35_ EFO 16-2 gi1409 3 4 arginine binding protein [Escherichia colil >gi176979 4 arti 121 1 .60E-31I EF016-2 gn]IP1D~dI152 Arginine-binding periplasmic protein 2 precursor [Escherichia 121 4.80E-31 7 EFOI16-2 giJ687 65 2 FINY [Escherichia coli] >gn1lPIDIdt 016464 FINY protein 160 5.70E-31 precursor. EFOI16-2 giJ26504lJ 0 (AE001 090) glutamine ABC transporter, periplasmic glutamine- 122 3.30E-29 _binding EFO 16-2 giJ16 4 9 03 5 high-affinity periplasmic glutam-ine binding protein [Salmonella 104 1.80E7.27 EF016-2 gil 1574634 ,glutamine-binding periplasmic potein (glnH) [Haemnophilus 174 2.50E-27 EF016-2 giI4 1569 GInH precursor (AA -22 to 226) [Escherichia coli] 106 4.70E-27 ~>gnlPIDId1 015250 EF016-2 gn]IP1DjdI0 152 Arginine-binding periplasmic protein I precursor [Escherichia 109 .3.70E-26 EFO 16-2 giJ7697 9 1 artI [Escherichia coli) >gi7 69 79 l arti [Esclerichia coli] 127 2.30E-25 EF016-2 gnljPlDjdlOO89 homologous to Gin-binding periplasmic proteins [Bacillus 117 8.50E-24 2 EFO16-21 gi115 4 12 5 J protein [Salmonella typhimurium] >gi4771 8 reading frame 118 21IOE-23 *b
C
C
C
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
hisJ EFOI6-2 gnIjP1DdlI168 HISTID[NE-BINDING PERIPLASMIC PROTEIN 117 4.50E-23 8 PRECURSOR EFOI16-2 gill 166636 histidine-binding periplasmic protein HisJ [Escherichia cclii 117 6.60E-23 EF017-2 giJ 3 8 826 9 traC [Plasmid pAD I) >pirlA533l I A533 10 pheromone cADi1 421 4.50E-128 binding EF017-2 gnljPIDjdl0l 18 TRAC [Enterococcufs faecalis] 417 5. 113E-124 EF017-2 gn1jPlDjdl0065 TraC [Enterococcus faecalis] 414 4.4013-123 EFOL17-2 giJ3O96 62 pheromone binding protein [Plasmid pCF 101 415 2.40E-l 119 __________>pirIB53309IB53309 EFO17-2 giJ4 05 OppA gene product [Bacillus subtilis] 294 -6.20EA82 EFO 17-2 giJ143603 sporulation protein [Bacillus subtifis] >gn1jPIDjel 1183163 290 2.80E-79 EF0 17-2 giJ3 12940 threonine kinase [Streptococcus equisimilis] 241 2.40E-71 ~>pirlS28 1531528 153____ EF017-2 ___t4880 8 dciAE [BaciULus subtilis] 270 1. 1OE-61 EFO17-2 gnlPIDjell8I49 (AJ002571) DppE [Bacillus subtilisi >gnljPlDI 183316 270 1.5OE-61 EFO 17-2 pirlS 16651 IS]66 dciAE protein Bacillus subtilis 270 3.I1OE-60 EF017-2 giJ30492 5 periplasmic oligopeptide binding protein [Echerichia coli] 171 2.60E-57 EF017-2 gil 14701 4 oligopeptide binding protein precursor [Escherichia ccli] 171 8.70E-56 EFO 17-2 giJ478 02 Opp A (AAI-542) [Salmonella typhimunium] >giJ47 8 0 8 154 1.30E-52 p p.
p p p p .1 .1 0. 0 .*p Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
F0 17-2 giJ88255O ORF-f535 [Escherichia coli] >gi11 78 93 97 (AE000384) [535; 135 5.50E-52 This 535 aa EFO 17-2 giJ157 4 67 9 oligopeptide binding protein (oppA) [Haemopliilus influenzae] 168 2.90E-43 EFO 19-21 gi1438 45 8 lRely N-terminal signal sequence; mature protein probably 104 2.30E- 17 EF02 1-2 grnlIPID1e611492 unknown [Bacillus subtilis] >gnljPDIel 184232 similar to ABC 317 2.50E-103 EF02 1-2 bbsj 173803 CD4+ T cell-stimulating antigen [Listeria monocytogenes, 476 2.80E-81 8 5 EQ- 1167, EF021-2 giJ58lSO 9 tmbC gene product [Treponema pallidum] >pir1A435951A4359 5 152 3.20E-71 membrane EF02 1-2 giJ26 88280 (AE00 1143) basic membrane protein C (bmpC) [Borrelia 101 5.50E-27 burgdorferi] EF02 1-2 gnl1PlDIel 17283 membrane protein A [Borrelia garinii] 142 6.50E- 2 EF02 1-2 gnilPIDfe 117283 membrane rotein A [Borrelia burgdorferi] 141 -9.20E-'22 EF02 1-2 gnIjPIDje 117283 membrane protein A [Borrelia burgdorferi] >giI5 16592 141 9.20E-22 ,membrane EF021-2 EF021-2 EF021-2 EF02 1-2 gril PDle 117283 gi150 84 21 7-gI-17-53 22 5 gnljPDjel 1172R82 gn1jPIDje 117283 ~gnIPIDI 17283 IgnjjP1Djel 178 bmpA(p39,ORFI) [Borrelia burgdorferil antigen P39 [Borrelia burgdorferi] >gil2 68 82 8 l (AEOO1 1143) I It 1.70E-21 basic____ BmpA protein Borrelia burgdorferij- 141 2.70E-20 membrane protein ALBorrelia afehl 141 8.60E-20_ membrane protein A [Borrelia aftelfii 141 8.60E-20 membrane protein A [Borrelia afzelii] 141 8.60E-20 bmp~p9,RF Borrelia burgdorferi] 141 1 .50E- 19 9e*
S
S
S
S
*.S
S S S S *S 55 5 S S
S
S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF022-2 giJ3 12940 thieonine kinase [Streptococcus equisimilis] 324 5.9013-66 1531528153 EF022-2 giJ 30 9 662 pheromone binding protein [Plasmid pCF 10] 307 5.60E-60 ___________>pirIB53309IB53309 EF022-2 gnljP1Ddl0l 18 TRAC [Enterococcus faecalis] 301 4.80E-59 EF022-2 gnIPIDI 18149 (AJ002571) DppE [Bacillus subtilis] >gnljPIDI 183316 170 5.10E-59 EF022-2 giJ4 880 8 dciAE [Bacillus subtilis] 170 5.20E-59 EF022-2 gnlIDIdlO065 TraC [Enterococcus faecalis] 299 2.80E-58 EF022-2 pirIS 16651 IS 1 66 dciAE protein Bacillus subfilis 170 1.60E-57 EF022-2 gil38 82 6 9 traC [Plasrnid pAD I >pir1A533 I01A533 10 pheromone cADi1 290 2.70E 53 S~~binding EF022-2 giI40005 OppA gene product [Bacillus subtilis] 154 7.30E-48 EF022-2 gi11436 03 sporulation protein [Bacillus subtilis] >ghlIPIDjel 1183163 154 3.I1OE-47 EF022-2 giJ268 8 2 2 7 (AEOO 1139) oligopeptide ABC transporter, periplasmic 215 1.OOE-36 EF022-2 giJ228l 4
S
8 (AF000366) oligopeptide permease homotog All IiBorrelia 215 l.0013-36 burgdorferil_____ EF022-2 giJ30 4 925 Periplasmic oligopeptide binding protein Escherichia coli] 1 31 1 .30E-35 EF022-2 gil 14701 4 oligopeptide binding protein precursor [Escherichia coli] 131 1.80E-34 EF022-2 giJ47 802 Opp A (AAI-542) [Salmonella typhimurium] >gi147 80 8 138 4.90E-34 EF023-21 gil 30 9 66 2 lpheromone binding protein [Plasmid pCF 10] 231 14.70E-66_ *9 e *4* 4 0 4 4. *4 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Gen~ank and Derwent databases.
__________>piflB53309IB53309 EF023-2 giJ38 826 9 traC [Plasmid pADI]I >pirIAS33lIOIA533 10 pheromone cADI1 223 4.80E-62 binding EF023-2 gnljP1Djd1O1 18 TRAC [Enterococcus faecalis) 226 1.OOE-58 EF023-2 gnlPDId1065 TraC (Enterococcus faecalis] 226 4.40E-58 EF023-2 giJ4 880 8 dciAE [Bacillus subtilis] 157 1.20E3-7 EF023-2 gnlIPlDje1I8l49 (Ai002571) DppE [Bacillus subtilis] >gnljPIDjel 183316 157 1.20E-57 EF023-2 pir166511 6 6 dciAE protein Bacillus subtilis, 157 3.80E-56_ EF023-2 giI40 0 05 OppA gene product [Bacillus subtilis] 137 2.3013-53 EF023-2 gi11436 03 sporulation protein (Bacillus subtilis] >gnlPIDIel 183163 133 6.90E-43 EF023-2 giJ 4 780 2 Opp A (AAI1-542) [Salmonella typhimuriumi >gil47 8 08 135 2.OOE-4) precursor_____ EF023-2 giJ26882 27 (AEQO 1139) oligopeptide ABC transporter, periplasmic 187 9.4013-41 EF023-2 giJ228l 4 58 (AF000366) oligopeptide permease homolog All [Borrelia 187 1.90E-40 ~~burgdorferil EF023-2 gif 8825 50 ORFf535 [Escherichia coli] >gi117 89 39 7 (AE000384) f535; 155 l-30E-38 IThis 535 EF02-3-2 gi13049 25 1,eriplasrnic oligopeptide binding protein [Escherichia coli] 130 9.QOE-37 EF023-2 gi114701 4 oligopeptide binding protein recursor [Escherichia colil 130 170E-34 EF026-21 2352482 (AF005097) unknown [Lactococcus lactis] 141 1.I1OE-23 F027- 3062 heromone binding protein [Plasmid pCFI 109 8 6.20E-71 *6* 9* 9 9 9
I
9 9* 9 *9 9 9 9** 9 .9 9* Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
__________>pirlB53309IB5 3309 EF027-2 gnlPDIdI0065 TraC [Enterococcus faecalis] 202 1.50E-68 EF027-2 gnlJ1PIDIdl10 118 TRAC [Enterococcus faecalis] 202 1.50E-68 EF027-2 giJ38 8269 traC [Plasmid pAD I1] >pirIA533 I OA533 10 pheromone cADi1 213 8.30E-68 binding EF027-2 gnlPIDIe 181l 49 (AJ00257 1) DppE [Bacillus subtilis] >gnljIlDlel 183316 222 3.70E-41 EF027-2 gi 1 4 88 08 dciAE [Bacillus subtilisl 222 4.90E-41 EF027-2 pirlS 1665 1 S 166 dciABE protein Bacillus subtilis 222 LIlOE-39 EF027-2 giJ4 00 0 5 OppA gene product [Bacillus subtilis] 251 4.10E-39 EF027-2. gi11 436 0 3 sporulation protein [Bacillus subtilisi ]Dle 183163 247 5.80E-S9 EF027-2 EF027-2 EF027-2 gi13 12940 gi2688 227 gi1228 1458 threonine kinase [Streptococcus equtsimii'sj >pirIS28153IS28l53 (AEOOI 139) oigopeptide ABC transporter, peiplasmic 131 2.40E-24 (AF000366) oligopeptide permease homolog All [Borrelia 131 2.40E-24 EF027-21 giJ 22 S 1468 burgdorferi].
(AF000948) OppAIlV [Borrelia burgdorferi] >gil2 6 89 89
I
(AE000792) oligopeptide binding protein oppA) Haemophilus influenzae] B. subtilis alkaline phosphatase iIA; P 19405 secretory 117 13:O0E-20 EF027-21 EF028-2 gil 1574679 gnlIPIDjI 1204 130 996 I982 3 .50E-20 3.60E-131I 2.90E- 129
I
pirlB3W 6n3Ilklfe phosphatase (EC 3.1.3. 1) 111 precursor Bacillus 9** 9 *0e 9* 9.
9 9 9 9 99 9 9 9 9 9 9. C 3* .9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF028-2 EF029-2 EF028-2 EF028-2 EF028-2 EF028-2 EF028-2 EF028-2 EF028-2 EF028-2 EF028-2 EF028-2 0 gi147 0383 giJ 143324 gil 147243 gil 147237 gill 47 23 9 gil 147241 gill1 27 7 127 gi1147 229 gi181 8851 gil 147245 gi1147231 g147235 gijl0l 6 OlO alkaline phosphatase A [Bacillus subtilis] >gnljPIDlel 182942 APase I [Bacillus lichenifonuis] >pir1A44828lA4482 8 alkaline alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia coli] alkaline phosphatase precursor (EC 3.1.3. 1) Escherichia colij alkaline phiosphiatase precursor (EC 3.1.3. 1) f[Escherichia coli) alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia coli] phoA gene product [Cloning vector pFWjol]>i1 7 I0 phoA gene a nelkaline hosphatase precursor (EC 3.1.3. 1) [Escherichia coli] alkaline phosphatase [synthetic construct] alkaline phosphatase (phoA) (EC 3.1.3. 1) [Escherichia fergusonii] alkaline phosphatase precursor (EC 3.1.3. 1) [Escheichia coli] alkaline phosphatase precursor (EC 3.1.3. 1) [Escherichia colij alkaline phosphatase with N-terminal PeIB-leader and Cterminal YncB [Bacillus subtilisi >gnillPlDlel 183421 similar to mnicrococcal simiar o hyothtica prteins rBacillus subtilis] nuclease [Staphylococcus aureus] >pirlA00790NCSAF mircoccal heonclease [Staphylococcus hicus] 803 184 183 178 178 178 174 178 174 177 174 174 174 4.80E-1 19 3 .00E-54 8.3013-54 4.40E-53 4.40E-53 4.40E-53 4.9013-53 8.4013-53 1. 1OE-52 1.20E-5,2 1 .6013752 1 .60E-52 1 .60E-52 257 V3:50E--551 EF029-2 Igif 1750126 EF29- EF029-2 Ra[PlDle 18360 giJ673492 263 320 7.80E-53 2.20E-39 15 5 1 9. IOE 39] e Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF029-2 g1147 146 thermonuclease [Staphylococcus intermedius] 145 4.90E-32 >vidS26079IS26079 EF030-2 *i48808 dciAE [Bacillus subtilisl 149 i. 1OE-66 EfO 3 0-2 lP1IDe11l8l49 AJ00257l)D E Bacillus subtilis >nlIP1De118331 6 149 1.50E-66 EF030-2 S16651 S166 dciA.E protein Bacillus subtilis 149 5.90E-66 EFO30-2 giJ3O966 2 pheromone binding protein [Plasmid pCF 101 227 7.40E-52 I>pir 853309 853309 nVII- rnlPIDW1dl1l 18 TRAC [EnterococCUS faecalis] 2377 7.0-2 9* 4* S S S S
S..
S S S S S S S. S S S S S S 55 S S S S S *5
S
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF030-2 EF033-2 EFO3 3-2 EF033-2 EF03 6-2 EF036-2 [EF036..2 gil 1574679 gnlIDleI 18439 pirIS5443 7
IS
544 gil 1619623 gnllPIDldl 0102 2 gi126 2 2858 giJ2 622859 EF036.21 gi126 88 I 15 oligopeptide binding protein (oppA) [Haemophi lus influenzae] similar to iron-binding protein [Bacillus subtilis] hemin binding protein Yersinia enterocolitica hemnin binding protein [Yersinia enterocolitica] ORLF 108 [Bacillus subtilis) >gnljPIDlel 185766 alternate gene (AE000929) phosphate-binding protein PstS rMethanobacten'um (A.E000929) phosphate-binding protein PsLS homolog I'Methanobacteriufli (AEQO 132) phosphate ABC transporter, periplasmic phosphate-binding (AF05097) unknown [Lactococcus lactis] hypothetical protein [Escherichia colil >gil78 65 1 1 (AEOOO- 139) 2-5A-dependent RNase [Mus musculus] >pirIB45 771 lB4577 1 G9a [Homo sapiens] >pir1S30385IS30385 G9a protein human erythroid ankyrin Mus musculus] >pir.S37771 lS3 77 7 1 ankyrin, ankyrin [Mus musculus] >pil495 0 211 4 9 50 2 ankyrin mouse alt. ankyrin (variant 2.2) [Homo sapiens] ankyrin MHow saies ankyrin [Homo sapiens]__ ankyrin 1, erythrocyte splice form 2 human 148 164 108 108 544 183 1.40E-45 158 2.40E-41 1-I17 1 1.IOE-12 2.60E- 14 1.40E-l II 2.OOE-1 I1 1.20E-96 EF03 7-2 EF040-2 EF040-2 EF040-2 EF040-2 EF040-2 EF040-2 EF040-2 EF040-2 EF040-2 1i123 524 82 gil 16575 16 gi129 326 5 gi1287 86 5 gil3 11817 gill 191940 g17 8646 gil11845265 pirIA3 50491A3 5 0LL 141 208 105 143 119 119 120_ 120 120 120 1.10t23_ 1 .90E-29 1.OOE- 17 8.3 0E- 14 4.80E-13 4-90E- 13 1.50E-12 1.80E-12 1.80E-12 1.80E-12 i q U* S i .9 9 0 *9 S *9 S 0 9 0* 9* 9 9q 9 0 *9 to 9*9 9 9 5 4. S 4 Table 2. Closest matching sequen ces between the polypeptides of the present invention and sequences in GenBank and Derwent, databases.
C
C., C
C
C. C OC9 C C C C
C
C C C C C C eq
C
C
C..
.0 .00 .0 'a 0 0 C C 0 a C a 0 q. 0 C Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF041-2 gil 1 4 7 0 11 4 oligopeptide binding protein precursor [Escherichia coli] 160 1.50E-28 EF041-2 gi12253 286 (AF005657) plasminogen binding protein [Borrelia burgdorferil 134 5.OOE-27 EF045-2 giJ30885 4 oligopeptide binding protein [Lactococcus lactis] 437 3.2013-125 EF045-2 gi)49 5 1 8 1 oligopeptide binding protein [Lactococcus lactis] 426 9.70E- 124 EF045-2 gil6779 45 AppA [Bacillus subtilis] >gnljP1Dlel 183158 oligopeptide ABC 154 2.3013-31.
EF045-2 giJ29301 4 peptide-binding protein [Lactococcus lactis] 158 2.40E-14 >pirlB47098lB34709 8 EF048-2 ,gi1 1574060_ hypothetical [Haemophilus influenzae] >pir1164 1641164164 250 2 .30E-41I EF048-2 dbjjjA.B001488_. (ABOO 1488) SIMILAR TO C4-DICARBOXYLATE- 208 3.6013-34 2 BINDING PERIPLASMIC EF048-2 giJ466717 No defintifon. line found [Escherichia coli] >gill1 7 90 004 199 1.3013-30 EF048-2 gi1460O 6 periplasmic G4-dicarboxylate binding-protein [Rhodobacter 162 1.405'-25 capsulatus] EF048-2 gil157 3 10 2 hypothetical [Haemophilus influenzael >piflH641431H64143 244 3.8013-25 EF048-2 gi12182 530 (AE000085) Y4miM [Rhizobium sp. NGR,234] 114 _5.60E- 18 EF048-2 gi11 5 7 2999 hypothetical [Haemophilus influenzae] >pirE64141 JE64141 116 5-.90E-I15 EF049-2 gi11495 81 maturation protein [Lactobacillus paracasei] 241 2.40E-55 __________>pirIA448581A44858 EF049-2 giJ47 198 ORF (AA I to 299) [Lactococcus lactis cremoris] 239 1.OOE-54 ___________>pirlS080831SO 8 0 8 3 EF049-21 gI14324 02 maturation protein [Lactococcus lactis] >0i162305S5 proteinase 239 6.20E-54
S
0* 9.
S
VO .0 00 6 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Gen.Bank and Derwent databases.
EF049-2 gi147283 5 ORFI [Lactococcus lactis cremoris] 241 1.5013-53 EF049-2 gi(39782 33kDa lipoprotein [Bacillus subtilis] >gnlPIle325 181 33kDa 128 8.90E-40 EF051-2 gnilPIDIdIOl 14 molybdate-binding periplasmic protein [Synechocystis sp.) 173 3.2013-50 2_ EFOS 1-2 gnlIP[DIe 118602 alternate gene name: yvsD; similar to rnolybdate-binding 314 5.9013-50 EF051-2 giJl574S 4 6 lsg locus hypothetical [Haemophilus influenzae] 161 2.20E-43 ~~>pirIA64 1751A64 175 EF051-2 giJ50449 8 periplasmic inolybdate-binding protein [Escherichia coli] 148 1.40EB-30 147817 EF051-2 gij14 8 9 39 ORE 8 [Haemophilus influenzae] >pirIS27583IS27583 150 8.IO13-28 hypothetical_____ EF054-2 giI150556 surface protein [Plasmid pCF 101 >pirA418261A4]826 probable 1490 1.8013-192.
EF054-2 gnlPiDle23657 1 cell wall anchoring signal [Enterococcus faecalis] 515 8.l1OE264 EF054-2 giJ457 3 8 ORFC [Enterococcus faecalis] >pirIJH0204IJHO2O4 372 1 .60E-58 ___________hypothetical 30.5K EF054-2 giJ4965 2 O orf iota [Streptococcus pyogenes] >pirIS68l 2 5IS 4 5O 9 l 362 1.30E-43 hypothetical_____ EF054-2 gill1 6 06 93 sporozoite surface protein [Plasmodium yoelii] 286 4.30E-33 >pir1A45559lA4555 9 EF054-2 gill 8135 2 3 PbTRAP [Plasmodium berghei] 305 1.3013-32 EF054-2 -gnIl PIDIe225 687 1zinc finger protein [Mus musculus] >gnlP1De225688 zinc 246 3.60E-26 A 'I4 I AnY7 ir EF054-2 giJ22903 94 lgG and IgE immnunoreactive antigen recognizea oy sera from patients 0*0 0 000000 0eS 0* 0 *09 0 0 0 0 0 0 0* 4 0 00 0 0 00*090 0 0 0 0 0 0 S. 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF054-2 EF054-2 EF054-2 EF054-2 EF054- I EF054-2 giJ 229 0 392 giJ 46523 pinIS 153301IFCS 0 g1620100 _gj163 686 1 gi163 689 lgG and IgE immunoreactive antigen recognized by sera from B antigen [Streptococcus agalactiael IgA Fc receptor precursor Streptococcus agalactiae Pro- and Glu-rich, PENPEV (10x); similar to Streptococcus
B
NF-M c-teriu Glugaus N F-M protein [Gallus gallus) >pirlS I 57621S 15762 neurofilament tri let TATA-box like sequence (Us 11) [Human herpesvinius
I)
>gil29I 4 93 18 cell wall anchoring sigal [Enterococcus faecalis] surface protein Plasmid pCFl10 >pin A4 1826 A4 1826 probable ORFC [Enterococcus faecalis] >pirIJHO2O4IJHO20 4 hypothetical 30.5K orf iota [Streptococcus pyogenes] >pirIS68 1251S45091 hypothetical sporzit suface protein [Plasmodium yoeli) 232 228 222 222 2.80E-23 L OOE-22 3I OE-21 6.90E-2 1 1 .5017-21 194j 4.1IOE- 19 EF054-2 1 g07 57867 EF059-2 EF059-2 9-2 gnljPlDe2 67 gil 15 0 5 56 giI457 3 8 418 606 366 5 60E-95 3 .70E,-87 9.30E-50 EF059-2 giJ49 6520 9-2 1n1 6 0 693 367 5.90E-44 344 =11013-38 EF059-2 EF059-2 EF059-2 >pirIA455591A4555 9 1iI813523 PbTRAP [Plasmodiunm ben heij e ysr rr gi12 2 9039 4 lgG and IgE immunoreactive antigen recognizedb eafo iatientIs giJ2 290 39 2 IgG and IgE irnnunoreactive antigen recognized by sera from Datients 295 251 251 3.0013-29 3 .4013-29 0 *0 0* .00 0. 00 .0 00 0 0 0 0 0* Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF059-2 gij 1620100 ]Pro- and Glu-rich, PENPEV (l Ox); similar to Streptococcus B 253 6.40E-27 EF059-2 giJ465 2 1 Fc receptor [Streptococcus agalactiae] >pirjA60234jA60234 IgA 197 2.70E-26 Fe EFOS9-2, giJ4 6 523 13 antigen [Streptococcus agalactiae]__ 232 9.30E-26 1 0-InT SEF059-2 1pirIS 1 53301IFCS IgA Fc receptor precursor Streptococcus agaiactiae EF059-2 EF059-2 EF059-21 EF061-2 1 EF061-2] EF061-2 EFO61-2 EF061-2 EF061-2 EF061-2 EF061-2 0 gil425356 giJ4577 69 gnlPIDje23657 1 gi11 50556 giJ49 6 5 2 0 giJ4573 8 gil813523 gill 60693 giJ6 3 6 8 9 zicfinger protein [Mus musculus] >gnljP1Dje225688 zinc zona pellucida protein [Pseudopleuronectes americanus] Collagen [Bombyx mnon] >pir1S42886IS42886 collagen silkworm cell wall anchoring signal [Enterococcus faecalis] surface protein [Plasmid pCF 10] >pirlA418261A4l826 probable oil' iota [Streptococcus pyogenes] >pirIS681I251S4S09
I
hypothetical OR.FC [Enterococcuts faecalis] >piriJHO2O4IJHO2O4 hypothetical 30.5K PbTRAP [Plasmodium. berghei) sporozoite surface protein [Plasmodium yoeliil 591A4555 9 NF-M c-terminus [Gallus gallus] ,NF-M protein [Gallus gallus] >pirS I 57621S 15762 neurofilainent triplet IlgG and IgE immnunoreactive antigen recognized by sera from 229 209 925 350 308 I- t 234 IA.4E-22 I OOE-21 7.60E- 19 8. IOE-I 18 1.50E-,07 IA.4E-58 322 1 6.40E-50 263 241 232 L OOE-26 -9:.O0E-25 2. 1OE-22 2.60E-22 EF06 1-2 Ig1229 0392 176 jT2.40E-21_ *9* 9** .9 .9 9 999 9 9 9 9 9 9 0 9 9.
9 9 9 9 9 9** 99* *99 9 9 9 9e 9 0@ .9999 9.999 99 .9 9 9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
________patients EF061-2 gil 162 0 100 Pro- and Glu-rich, PENPEV (IlOx); similar to Streptococcus B 165 2.70E-20 EF061-2 gn]IPIDje225687 zinc finger protein [Mus musculus] >gnljPlDje225688 zinc 197 7.80E- 19 EFO6I-2 gi 1601355 interspersed repeat antigen [Plasmodium falciparuml 199 8.20E-l18 EF061-2 gi1410750 interspersed repeat antigen [Plasmodium falciparum] 199 8.90E- 18 EF061-2 giJ229O 388 lIgG and IgE immunoreactive antigen recognized by sera from 182 1.40E- 17 ton 1~17 EF061-2 EF062-2 EF062-2 EF062-2 EF062-21 E F062-2 EF062-2 gi[2 29 0 394 IgG and IgE iinmunoreactive antigen recognizea by sera fromn patients asal gene product (AA 1-1296) [Enterococcus faecalis] giJ470 4 9 gi143324 giI2 109266 gill15 05 55 gill 1009 73 giJ47 248 3716 0 aggegaionsubstance (ASPI1) [Enterococcus faecalis] ragregtio sustace Entrococcus faeciwn] aggregation substance [Plasmid pCF 101 >pirIH416621H4 1662 5523 0 6338 0 SspB precursor [Streptococcus gordonii] PAc protein precursor (AA -38 to 1527) [Streptococcus mutans] ~surface protein antigen precursor [Streptococcus sobrinus) YJUVI-31Y -t S 1.70E-38 -1 5iJ0L-3~ EF062-2 IgnIlPIDld 10 150 5-.00E-36 EF062-2 EF062-2 7 giJ4726 7 bbs1 148453 cell surface antigen 1111 [Streptococcus mutans] >PiTlS068391S06839 t 1 6.5013-36 1.2 E-3 SpaAendocarditis imnmunodominant antigen [Streptococcus sobrinus, 1132 .00% too 4 *90 0* 5 00 0 00 00 *0 SS Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genl~ank and Derwent databases.
L9UIZ-i) EF062-2J giJ47620 antigen 1 /11 [Streptococcus sobrinus) >pirIA603381A6033 8 surface 90E-33 -1
-I
109 2.IOE-34 EF062-2IpirIA35186IA35 salivary agglutinin receptor precursor Streptococcus EF06 -2 EF062-2 EF063-2 EF063-2 EF063-2 EF063-2
I
gil 11 097 1 gill00975 giJ47 049 6i1433 24 giI2 109266 .gi1l5 0 5 55 F-gJ1 47248 SspA [Streptococcus gordonii] SspA [Streptococcus gordonii] asal g ene product (AA l- 1296) [Enterococcus faecalis] aggregation substance (ASPI1) [Enterococcus faecalisl aggregation substance [Enterococcus faeciurn] aggregation substance [Plasmid pCFI10] >pirlH41662lH4l66 2 150K mating SspB precursor [Streptococcus gordonii] PAc protein precursor (AA -38 to 1527) [Streptococcus mutansi surface protein antigen precursor [Streptococcus sobrinusi cell surface antigen I/11 [Streptococcus mutans] irlS0683 9 1SO 68 39 SpaAendocarditis inununodominant antigen [Streptococcus sobrinus, antigen 1 /11 [Streptococcus sobrinus] >pirlA60338lA60 33 8 surface salivary agglutinin receptor precursor Streptococcus 109 110 3716 4003 5523 6338 110 107 3.80E-32 2.3012-21 0 0 0 0 9.90E-39 I .70E38 EF063-2 gnllPlDldIO15O 7_ EF063-2 gil47 267 EF063-2 bbs1148453 EF063-2 giJ 4 7 6 20 EF063-2 pirlA35 1861A35 132 5.OOE-36 107 6.5013-36 132 1.20E-35 F13-2 2.90E-35 109 2.I1OE-34 *0* 0*0 *00 0 0 4 0 0* 04 09 4 0 0 0 0 0 0 0 0 000 ~0e *0 *0 0 0 0 00 *0@00 *0 *4 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF063-2 EF063-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 EF064-2 gi109 71 11~l100975 SspA [Streptococcus gordoniil SspA [Streptococcus gordonij gil470 4 9 giJ43 324 giJ2 109266 gij 150555 gill 100973 gi1472 48 EF06,4-2gn IlPIDjI 1050 EF064-2~ 7_ asalI gene product (AA 1-1296) [Enterococcus faecalisl aggregation substance (ASPI1) [Enteocccus faecali s] aggregation substance [Enterococcus faeciurn] aggregation substance [Plasmid pCFI1O] >pirlH4l6621H41662 150K mating SspB precursor [Streptococcus gordoni] PAc protein precursor (AA -38 to 1527) [Streptococcus mutansi Surface protein antigen precursor [Streptococcus sobninus] cell surface antigen 1/11 [Streptococcus unutans] >p)irlS068 39 1SO 6839 SpaAendocarditis immunodominant antigen [Streptococcus sobrinus, antigen 1 /11 [Streptococcus sobrinus] >pirlA603381A603 3 8 surface salivary agglutinin receptor precursor Streptococcus SspA [Streptococcus gordoniil SspA [Streptococcus gordonii] T06D8. 1 [Caenorhabditis elegans] 110 110 3716 4003 5523 6338 110 107 132 107 3.80E-32 2.30E-21I 0 -0 0 0 9.90E-39 I1.70E-38 5 .OOE-36 [6.so36 EF064-2 1 bbs148453 13 .20E-35 EF064-2 EF064-2 giJ4 7620 pirlA35 186fA35 1321 109 110 110 137 2.90E-35 2. 1OE-34 3 .80E-32 2.30E-21I 8.50E- 17 qI EF064-2, gill1 0 09 7 1 EF064-21 gill 100 97 5 EF068-2 giJ790398 4 000 4 C@S C S 4 0 C C S. S C C
C
C
.o a
C.
4 C 0O 9
C**
C. Se C C **C*C eeC....
.4 4 C Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF068-2 gnIjP1D~d10208 inembrane glycoprotein [Equine herpesvirus 1] 210 5.80E-16 4 EF068-2 gi1228 6 204 (AFOI 1339) unknown [Acinetobacter calcoaceticus) 121 8.40E- 16 EF068-2 gi13 30862 membrane glycoprotein [Equine her-pesvirus 1] 208 >pirIH3 68021VGBEX EF068-2 gi117 0 7 24 7 partial CDS_[Caenorhabditis elegansl 131 3.70E- EF068-2 gn1lPIDId10208 membrane glycoprotein [Equine herpesvirus 1] 203 6.20E- 4 EF068-2 giJ2133 92 antifreeze glycoprotein [Notothenia coriieeps] 102 4.60E- 13 >pirlA3 84201A3 84 20 EF068-2 gnljP1Dje 125464 (AL022022) PORS-family protein [Mycobacteriuin 145 1.SOE- 12 EF068-2 giJ951 4 60 FIM-C.1 gene product [Xenopus laevis] >pir1A451551A45155 109 2.70E-12 mucm EF069-2 giJ79039 8 T06D8.1 [Caenorhabditis elegans] 137 8.50E- 17 EF069-2 gnlPIDIdlO208 membrane glycoprotein [Equine herpesvirus 1] 210 5.80E- 16 4 EF069-2 giJ22 862 04 (AFO 1133 9) unknown [Acinetobacter calcoaceticus] 121 -&.40E-16 EF069-2 gi13308 62 membrane glycoprotein [Equine herpesvirus 11 208 __________>pirIH368021VGBEX EF069-21 EF069-2 EF069g1117 7 247 gn~jPID~d 10208 4 giJ2 13392 partial CDS [Caenorhabditis elegans] zi 1)1 VLI membrane glycoprotein [Equine herpesvirus 1] 203 6.20E- antifreeze glycoprotein [Notothenia coriceps] 102 .4.60E-13 OO00 oS S0 065 0@ 000 S0O 96 00 .0 000 S .0.
0O 400 0 s 0 0 0 SO S0 5 0 0 Table 2. Closest match ing sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
8420 jA3 8420 EF069-2 gnljPIDje 125464 (AL022022) PGRS-family protein [Mycobacterium 145 1.50E-12 tuberculosis] ici EF069-2 giJ95 1460 FIM-C.1I gene product [Xenopus lacvis] >prA. 514 5 mucin I- i I 137 1 8.50E-17 EF070-2 EF070-2 EF070-2 EF070-2 EF07 0-2 EF070-2 EF070-2 gi179 0398 gnlPIDIdIO08 4 giJ228 6204 giJ33O862 gi1 1707247 gnlPIDId 10208 4 giJ2 13392 T06D8. 1 [Caenorhabditis elegans]_ membrane glycoprotein [Equine herpesvirus 1] 1' 210 ]5.80E- 16 I K 121 ~.quL-1c~ (AFOl 1339) unknown [Acinetobacter calcoaceticus] membrane glycoprotein [Equine herpesvirus I1I >pirIH368021VGBEX 1 partial CDS [Caenorhabditis elegans] membrane glycoprotein [Equine herpesvirus I1 208 1.1 0E- 1113.70E,15 6,20E- I t 4.60E-13 antifreeze glycoprotein [Notothenia coriiceps) >pirIA3 84201A3 8420 4.60E-13 EF070-2 guljPIDlel 25464 EF070-2 gi19514 6 0 EF07 1-2 gn1jP1Dje3064 28 EF071-21 giJ853751 EF073-21 gi143830 4 T 145 I .50E-12 (AL022022) PGRS-famnily protein [Mycobacterium 109
I
2.70E- 12 tuberculosis]_ FIM-C.1I gene product [Xenopus laevisl >pir1A451551A45 155 mucin unnamed protein product [Bacteriophage ri ti >gil 1353566 Lysin N-acetylinuramoyl-L- lanine amnidase [Bacteriophage AS1 11] xpaC [Bacillus subtilisi >gntIPlD~dlO05803 hydrolysis of 127 12.OOE-37 273 173 2.60E-36 7.I1OE- 16 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF074-2 giJ125669 8 chitinase [Serratia marcescens] >gill 2 5 669 8 chitinase [Serratia 618 2.60E-104 EF074-2 giJ17 639 85 chitmnase A [Vibrio harveyiJ 526 2.80E-84 EF075-2 giJ143 156 membrane bound protein (Bacillus subtilis] >gn1lPIDl 184471 593 1.70E-91 EF075-2 pirID7007OID7O transcriptional regulator homolog ywtF Bacillus subtilis 11 I 1I.90E-59 0 EF075-2 giJ1762 32 7 putative transcriptional regulator [Bacillus subtilis] 148 9.60E-53 EF075-2 giJ127 68 74 EpsA [Streptococcus thermophilus] 239 2.20E-33 EF075-2 gnlPIle28912 6 unknown [Streptococcus pneumoniael] 150 1.20E-27 EF075-2 giJ4852 75 putative regulatory protein [Streptococcus pneunloniae] 150 2.50E-27 EF075-2 g112 80 4 7 35 (AF030367) putative regulatory protein [Streptococcus 150 2.50E-27 prieumoniael EF075-2 gij280 4 7 47 (AF030369) putative regulatory protein [Streptococcus 150 2.5O -271 pneumfoniae) EF075-2 gnlIDle 116988 capsular polysaccharide synthesis protein [Streptococcus 148 5.30E-27 EF075-2 giJ280 4769 (AF030373) putative regulatory protein [Streptococcus 148 5.30E-27 Spneunioniael EF075-2 gill 147744 PSR [Enterococcus hirael 109 2. 1OE-23 EF075-2 giJ790 43 5 PSR [Enterococcus faeciuniI >pir1S541771S54177 PSR protein 102 4.4011-19 EF07 5-2 gi12 2672 39 ORF I [Staphylococcus epidermidis] 109 8.50E- 19 EF075-2 gnilPlDIdI189 membrane bound protein LytR [Synechocystis sp.] 121 2.80E-16 EF077-21gnlPIDIdl~l 13 cadmium-transporting ATPase [Synechocystis sp.] 396 12.30E-1 13 1 a..
a a a a..
a a. a a. a a. a. a a a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF077-2 gillS 0 7 19 cadmium resistance protein [Plasmid p125 8 1 373 8.60E-1 12 >pirlA3256 I IA3256 1 EF077-2 gil1143753 cadmium-efflux ATPase [Bacillus firinus] >pirlD42707lD42707 361 8.1IOE-1 I I probable_____ EF077-2 giJ15297 8 ElI-E2 cadmium efflux adenosine triphosphatase 381 4.30E-1I EF077-2 gnljPIDle248808 unknown [Mycobacterium tuberculosis] 298 3.50E-107 EF077-21 giI4956 46 ATPase [Transpo son Tn5422] 361 12.1OE-106 EF077-2 EF077-2 EF077-2 EF077-2 EF077-2 EF077-2 EF077-2 EF077-2 EF077-2 EF078-2 EF078-2 gnllPIDI 18497 gil 1 990 49 gnllDlel 118603 gnllPIDe3O6540 gnljPIDje263525 _gilIPIDle264090 gnljPlDjdlOl 13 giJ177 3 1 66 similar to heavy metal-transporting AI rase LijacIIlus cadmium resistance protein [Latococcus lactis] similar to heavy metal-transportigATPase 352 254 3.60E-100 9.90E- 100
I
1* unknown [Mycobacterium tuberculosis] P-type ATPase [Mycobacteriun tuberculosis] 199 5.50AWS >gnl1PIDle2494 13 unknown [Mycobacterium tuberculosis] cadmium-transporting ATPase [Synechocystis sp.] probable copper-transporting atpase [Escherichia coli] 250 260
I
3 .OOE-84 L OOE-81 212 1-4.70E-80 >17 8 6 6 9 1 gi113 549 35 probable copper-transporting atpase Escherichia coli] -gil 143331 alkaline phosphatase regulatory Protein (Bacillus subtilis giJ4l0l 4 2 OR-FX18 [Bacillus subtilis] >gnljPDlel 185580 two-component sensor 212 257 235 8,5013-79 550E 58 M e
S
S
S
S
S
S..
S S S S S S *5 *S 5 5 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF078-2 gn1jPIDjd1lI 19 homologous to sp:PHORBACSU [Bacillus subtilis] j219 4.20E-44 EF078-2 gi1157 5578 histidine protein kinase [Thermotoga manitima] 191 7.10E-44 b~~~i~~~fbflp~~~IA Ai~ I~coocStcsceosi~ IAAF..d rTfl~O 'I ,.i1 1 Q')oof wstiliinp 1-inase r, actococcus lactis cremonsl EF078-2 6 i2 182992 Ihistidine, kinase [iLatoccu lacti rmsrm)... 152 EF078-2 gnlIDjdlOl 13J EF078-2~ EF078-2 EF078-2 EF078-2 EF078-2 gill 492 96 giJ581 1 88 gn1jP D~d 10 108 7 ffnlP1Dle2665 92 gi121 829 96 gnIlPIDd 101 113 giJ29 4893 gi1288 420 sensory transduction histidine kinase [Synechocystis sp-.
phosphate regulatory protein phoR (gtg start codon) [Kiebsiella phoR gene product (AA 143 1) [Escherichia coli] >gil6575 96 sensory transduction histidine kinase [Synechocystis sp.] 228 226 138 I I 7.
I OE-39 60E-33 60E-32 70E-32 unknown [Mycobacterium tuberculosis] 232 1. 1 Opal histidine kinase [Lactococcus lacti cremoris) 206 i 1.3012-3 1
I
EF079-2 giJ2O9 87 l 9 EF081-2 giJ467 80 6 EF081-2 gi1790 429 EF08 1-2 IgnlPIDtE2083 65 EF08 1-21 gi7 90433 sensory transduction histidine kinase [Synechocystis sp.] 1 phosp ate regulatory protein phoR (gtg start codon) [Shigella drug sensory protein A [Synechocystis PCC6803] >gnlPIDId 1017420 putative fimbrial -associated protein [Actinomyces naeslundii I penicillin-binding protein [Enterococcus faecalis] low affinity penicillin-binding protein 5 (PBP5) [Enterococcus penicillin-binding protein 5 [Enterococcus faccium] low affinity penicillin-binding protein 5 (PBP5) [Enerococcus 225 106 183 1356 1.6013-31 2.50E-31I 8 .60E-26 2. 10E-178 256 1.30E-3 I 607 1 .00E-78 604 604 L.IOE-78 2.70E-78
C
C C C C C C C C. C C C C C C C C C a. Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF081-2 gi(790 4 37 low affinity penicillin-binding protein 5 (PBP5) [Enterococcus 602 5. 1OE-78 EF081-2 giJ790 4 3 1 low affinity penicillin-binding protein 5 (PBP5) _[Enterococcus 591 2.60E-77 EF081-2 gi143342 D-alanyl-D-alanine carboxypeptidase [Enterococcus hirael 587 9.30E-77 EFO8I-2_ giJ49000 D-alanyl-D-alanine carboxypeptidase [Enterococcus hirae] 572 5.20E-74 EF081-2 gn1jPIDIl079 penicillin-binding protein 2 [Bacillus subtilisi 149 7.40E-24 4 EF081-2 gnlIPIle3l5088 MecAl [Staphylococcus sciuril IIl 4.40E- 19 EF081-2 gn1jPDje28665I MlecA protein [Staphylococcus sciuri] 106 2.90E-18 EF081-2 gnljPIDje3I658I MecA protein [Staphylococcus sciuri] Ill 2.90E- 18 EF081-2 gnlIPID16316607 MecA2 protein [Staphylococcus sciuri] 101 3.70E-14 EF081-2 glIjPIDe366l~I63 MecA protein [Staphylococcus sciuri] >gi4 6 6 1 3 mnecA gene 101 3.0-,1 EF083-2 giJ49 62 8 3 lysin [Bacteiophage Tuc2009] 436 6.20E,-176.
EF083-2 g!1530798 LysB [Bacteriophage phi-LC3] 421 3.OOE -175 EF083-21 gil 1 66 183 muramidase [Bacteriophage CP-71 186 1.20E-21 EF083-2 giJ1661 8 8 muramidase [Bacteriophage CP-91 >pirIJQ04381MUBPC9 188 5,OOE-21 EF083-2 giJ6 230 84 muramidase; muramidase [Bacteriophage LL-H] 193 8.40E-20 EF083-2 gi116617 5 muramridase [Bacteriophage CP-11 175 _3.40E- 19 EF083-2.gnlPIDle22 1272 lysozyme [Bacteriophage CP-1] >pirIA31086MUBPCP 175 3.4013-19 EF083-2 piriJQ04371MU N-acetylmuramoyl-L-alanine amidase (EC 3.5.1.28) phage 171 9.5 0E- 19 EF083-2 giJ 4 10502 LysA [Bacteriophage inv4] >pir1538 47 7
IS
384 7 7 lytic enzyme 187 8.90E- 17 jlysA EF083-21 gi793850 Ilysin [Lactobacillus bacteriophage phi adh] >gnlPlDel2l73141 117 5.60E-l15 S S 9*
S
5 9 *9
S
'S S S S S
S..
ra S 0 0 0 0*0 0* 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genl~ank and Derwent databases.
lysin_ EF084-2 gi122933l 2 (A-1008220) YtfP [Bacillus subtilis] >gnIPlDjel 185879 similar 438 1.70E-140 to_ EF084-2, gi123672 34 (AE000425) hypothetical 43.8 kD protein in rhsB-pit intergenic 167 2.2013-51 EF084-2 giJ912464 No definition line found [Escherichia colij 167 6.OOE-51 EF084-2 gJPJ Djdl 1112 hypothetical protein [Synechocystis sp.] >pirlS76678lS76678 151 6.I1OE-42 EF084-2 gill 1573954 __hypothetical [Haemophilus influenzaej >pirjG64 161 IG64 161 142 2.9013-40 EF085-2 .gil 1209527 protein histidine kinase [Enterococcus faecalis] 2023 8.0013-279 EF085-2 giJ467057 phoR; B2168_C3_247 [Mycobacteriuin leprae] 226 8.8013-23 ___________>pirJS72905IS7290S____ EF085-2 gnIPlDjeI 19229 SenX3 [Mycobacterium bovis BCG) 222 3. 1 OM2 EF085-2 gnlIPlDje25 5152 unknown [Mycobacterium tuberculosis] >gn~jPIDje32 1546 222 3. 1 OE-22 EF085-2 giJ177 8 48 5 PcoS homnolog [Escberichia coli] >gi1178 6 78 3 (AE000162) R480; 1ll 3.8013-16 his EF085-2 gil 149296 phosphate regulatory protein phoR (gtg start codon) [Klebsiella 110 1.40E-14 EF085-2 giJ5811 88 plioR gene product (AA 143 1) [Escherichia coli] >gijI1 6 5 7 59 6 103 5.30E-14 EF085-2 giJ143331 alkaline phosphatase regulatory protein [Bacillus subtilisl 118 4.90E- 13 EF085-2. gil537239 alternate gene name phoM; CG Site No. 395 [Escherichia coli] 126 9.50E-13 EF085-2 giJ147 25 1 phoM [Escherichia coli] >giJ809670 phoM protein (I is 3rd 126 9.5013-13 in_ EF085-2 giJ2 182992 histidine kinase [Lactococcus lactis cremoris] '109 ,5.9013-12 S*S 559 5 5
S
9 9
S
9 9 S S 9* S S 9* S S S S S S S S S S S Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF086-2 giJ 4 37 7
O
6 alternative truncated translation product from E.coli 221 3.OOE-54 [Streptococcus EF086-2 giI437 7 05 hyaluronidase [Streptococcus pneumoniae] 221 1.60E-53 EF086-2 gi1595 847 hyaluronate lyase [Streptococcus agalactiae] 203 3.30E-44 1371A55 137 EF086-2 giJ70540 6 hyaluronate lyase [Staphylococcus aureusl 191 3.40E-42 EF086-2 giJ5 62 08 6 hyaluronidase [Propionibacterium acnes] 198 6.OOE-27 EF087-2 gi143 7706 alternative truncated translation product from E.coli 221 3.OOE-54 EF087-2 gil437705 hyaluronidase [Streptococcus pneumoniae] 221 1.60E-53 EF087-2 giJ59 584 7 hyaluronate lyase [Streptococcus agalactiae] 203 3.30E-44 1371A55 EF087-21 EF087-2 EF088-2 EF088-2 EF088-2 EF088-2 EF088-2~ EF091-2 EF091-2 giJ705 406 gil5620 86 gi143 7706 giJ4377 0 giJ595 847 giJ70540 6 giJ56 08 6 i155 60 1 6 hyaluronate lyase [Staphylococcus aureusj hyaluronidase [Propionibacterium acnes] 198 6.OOE-27 alternative truncated translation product from E-coli 221 3.OOE-54 [Streptococcus hyaluronate lyase [Streptococcus agalactiae] 203 5.30E-44 SpirIA55 137IA55 137 hyaluronate lyase [Staphylococcus aureus] 191 3.40E-42 1hyaluronidase (Propionibacteriuni acnes) 198 6.OOE-27 similar to plant water stespoen;OF Bclu utlj 189 2.40E- 17 I giJ235 3 3 j-~653 e-LEA [Caenorhabditis elegans] .1 9*9 9..
*99 9 0 9 9 9 99 9 9 9 9 99 99 9 99 9 9 9 9 9 9 99999 99 99 9 9 9 9999 99* .9 9. 9..
09 99 9 9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF091-2 nIlIDle353216 seed maturation protein homolog [Arabidopsis thalianal 146 13.60E-1 II EFO91-2 gil 161171 late embryogenesis abundant protein [Picea glauca] 132 5.70E-1 I1 EF091-2 pirlS04909 1S049 embryonic protein DC8 (clone 8/10) carrot 127 6.5013-11I EF092-2 giJ268989 8 (AE000792) PTS system, cellobiose-specific IIB3 component 145 4.OOE-27 (celA)_ EF092-2 gn1IPDIdl0204 B. subtilis, cellobiose pho spho trans ferase system, celA; 116 1 .40E-26 EF096-2 giJ147329 transport protein [Escherichia coli] >gn1lPIDId 10 15409 532 2. 1OE-91 EF096-2 giJ1573 4 75 spermidine/putrescine-binding penipiasmic protein precursor 527 1.I1OE-79 EF096-2 giJ 1574803 spermidine/putrescine-binding periplasmic protein precursor 468 1.60E-75 EF096-2 gill 142681 Lpp38 [Pasteurella haemolytica) 446 4.40E' 72 EF096-2 gnlPID~d10152 Putrescine transport protein PotF [Escherichia coli] 216 1.50E-54 E-F096-2 gi11 4 73 34 periplasmic putrescine binding protein [Escherichia col. 216 2. 1OE-53 EF096-2 giJ26885 6 5 (AEOO1 165) spermidine/putrescine ABC transporter, 240 2.OOE-48 EF096-2 gi1188 1 733 PotD [Salmonella typhimurium] 253 !-70E-28 EF096-2 gn~fPID~d10192 spermidine/putrescine-binding penipiasmic protein 243 4.20E-26 6 EF096-2 gnljPlDje152543 potF gene product [Clostridium perfringens] 204 3.30E-21 E072 gii622991 mamiitol transport protein [Bacillus stearothermophilus] 547 4.90E-93 E972 giJ42 034 mannitol permease [Escherichia coli] >gii4 66 737 mannitol- 535 5.50E-85 9 9 999 9 9 9 *9 9 9* p p p a..
9 9 9 9 a 9 9 9 99 9 9 9 9 9 *9 999 99 9 9 9999999 9. 9. a. *9 9 9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF097-2 giJ633650 enzyme 11(mannitol) [Staphylococcus carnosusl 516 2. 10E-82 193 jS22385 EF097-2 giJ882 462 protein-N(pi)-phosphohistidile-sugar phosphotransf'erase 509 3 .OOE-76 [Escherichia EF097-2 giJ3 12763 protein-N(pi)-pho sphohisti dine- ugar phosphotransferase 357 7.50E-70 EF097-2 gn1lPIDId096 homologue of mannitol transport protein of B. 492 3. 1 OE-62 6 EF097-2 gn]IPID~dl0079 mannitol-specific phophotransferase enzyme 11 [Bacillus 484 5.20E-61 2 EF097-2 gi1l 673 8 55 (AEOOOO2O) Mycoplasma pneumoniae, PTS system mannitol- 232 3.50E)59 specific EF097-2 gnllPIDId 10065 phosphotransferase enzymell, mannitol-specific [Mycoplasma 158 8.20E- 18 EF097-2 pirS7775715 7 77 phosphotransferase system enzyme 11 (EC 2.7.1.69), 103 2.OOE- 13 EF 10- 2 giJ2058 54 6 ComYC [Streptococcus gordonii] 193 7.30E-27 EFI100-2 gqZ20 5854 6 ComYC [Streptococcus gordoniii 193 -7.30E-27 EF 100-2 g11142 708 comnG3 gene product [Bacillus subtilis] >gnIlPIDjel 185739 150 2.90E-22 comOC EF 100-2 giJ1427 08 comG3 gene product [Bacillus subtilis] >gnlPlDIel 185739 150 2.90E-22 comGC EF 100-2 gi1148 4 3 7 secretory component [Erwinia chrysantheini] 134 4.40E- ___________>pirIE4702I1E4702 I pectic ft..
ft ft ft ,ftft ft ft ft q ft *4 ft 4 ft ft ft ft ft.
ft. ft ft ft ft ft. ft V ftftft ft.. ft.. ft ft ft ft ft ft. ft. ft ft ft ft ft. *ft ftftft' ft ft. ft. ft. ft ft. ft. ft ft ft Table 2. Closest matching sequences betwveen the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFI100-2 gil 148 4 3 7 secretory component [Erwinia chrysanthemi] 134 4.40E- >pirIE4702 I IE4702 1 pectic____ EFI100-2 giJ60 62 62 OR.F_o 145 [Escherichia coli] >gif6937 06 HopG [Escherichia 136 9.I101- 13 EF 100-2 giJ6062 6 2 ORF o145 [Escherichia.coli] >gil6937 06 HopG [Escherichia 136 9.1IOE-113 coli] EF 100-2 gi1388 28 ExeG gene product [Aeromonas hydrophila] 132 3.50E-12 101149905 protein____ EFIOO-2 giJ38828 ExeG gene product [Aeromonas hydrophila] 132 3.50E-12 >pirIS2291011 4 9 9 0 5 protein____ EFI100-2 gnlIPIDleI 17259 etpG [Escherichia coli] 131 5.10E-12 EF 100-2 gnlIlDIel 17259 etpG [Escherichia coli] 131 5.101342 EFlQO-2 gil 42 18 9 outG gene product [Erwinia carotovora] >pirIS32 86 1 1S 3 286 1 130 9.90E- 12 outG____ EFIOO-2 gi1421 89 outG gene product [Erwinia carotovorai >PirIS 3 2 8 6
IS
3 2 8 6 1 130 9.90E- 12 outG____ EF 100-2 giJ609 62 8 putative [Vibrio cholerae] 128 1.60E-lI1 EF 100-2 giJ609 6 2 8 putative [Vibrio cholerael 128 1. 60E- I EFIOI-2 gnIjPlDIdlO257 bacG [Enterococcus faecalis] 106 3.60E- 17 [EF 101-2jgnIPIDIe32 1943 hypothetical protein [Enterococcus faecalis] >gnIIPlDje32 1943 105 1 .80E- 16 n :11 11-4 _1 EF 10L-2 EFi 110-2 gnljP1Dje1 18502 giJ4 333 8 similar to hypothetical proteins 11om 2. su"Iiws [L c us Staphylococcal serine proteinase homologue [Enterococcus faecalisl i462 2.30E- 195 1462 2.3013- 195 V V
V
V V 9 V 4 V
V
V
V..
V V V V 4** V V. *V V V V V *qV V V V Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Genl~ank and Derwent databases.
EFi 10-2 gnIlPIDIdl100 10 glutamic acid specif ic protease prepropeptide [Staphylococcus 106 3.70E- 14 8 EFI 10-2 gi146687 _preproenzyme (AA -68 to 268) [Staphylococcus aureusl 106 6.70E- 14 EFI 11-2 giJ6060l 8 ORF_o783 [Escherichia coli] >gill 7 8 9 4 62 (AE000390) 477 8.10E-80 hypothetical 88.3 EFl121.2 gij2 6 26 826 YfkN [Bacillus subtilis] >gnlIPlDjel 182774 similar to 143 1.30E-96 EF 121-2 gi123 13187 (AE000532) 2',3'-cyclic-nucleotide 2-phosphodi sterase (cpdB) 413 2 .60E-82 EFl121-2 gil484 53 5'-nucleotidase [Vibfio parahaemolyticus] >gnlPIDId 100 1218 279 8.50E-47 EF121-2 giJ757842 UDP-sugar hydrolase [Escherichia coli] 239 1.60E-44 EFI21-2 gi1177 3 162 UDP-sugar hydrolase precursor [Escherichia coli] >gil 7 86 6 87 239 1.60E-44 EFl121-2 giJ 4 795O precursor polypeptide (AA -25 to 525) [Salmonella 229 2. 1OE-41 __________typhimurium] EF121-2 gi17 4 79 1 3 2',3'-cyclic-nucleotide 2'-phosphodiester-ase [Yersmnia, 115 4.70E -36 EF 121-2 gi16277 2 5'-nucleotidase [Discopyge ominata] >pirl S 19564 IS 195 64 137 5 .80E-35 nucleotidase EFl121-2 gi1157 3 5 73 2',3'-cyclic-nucleotide 2'-phospbodiesterase (cpdB) 114 8.90E-34 Haemophilus EFl121-21 gij537 0 54 2',3'-cyclic-nucleotide 2-phosphodiesterase [Eschenichia colid 110 IEIOE-31 EF 121 EF 121 EF 121 JEF 121 -2 bbs1135915 (12nu letds=gy rypos aidlioio-n e rti I. Ivr--Zy I iL -21 gi1l 7 374 43 15'-nuclcotidase [Boophilus microplus] 104 1.60E-28.
2 gi1202551 j5-nucleotidase precursor (EC 3.1.3.5) [Rattus norvegicus] 138 J6.1IOE-28 2 gi13 4 9 783 jecto-5'-nucleotidase [Mus musculus] >piriC2001 IJC2001 136 1. 1OE-27 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EFI121-2 gi123897 5'-nucleotidase [Homosapiens] >pirISlI 10321S1 11032 133 1.60E-27 nucleotidase EF122-2 giJ262 6 826 YfkN [Bacillus subtilis] >gnilPIDjel182774 simiflar to 143 1.30E-96 EF122-2 gil2313 187 (AE000532) 2',3'-cyclic-nucleotide 2'-phosphodiesterase (cpdB) 413 2.60E-82 EF122-2 gil 4 845 3 5'-nucleotidase [Vibrio parahaernolyticus] >gnlPIDIdlO01218 279 8.50E-47 EF 122-2 gi1757 842 UDP-sugar hydrolase [Escherichia coli] 239 1.60E-44 EF 122-2 giI177 3 162 UDP-sugar hydrolase precursor [Escherichida coli] >gil 17 866 87 239 1 .60E-44 EF 122-2 giJ47950 precursor polypeptide (AA -25 to 525) [Salmonella 229 2. 1OE-41 _______typhimurium] EF 122-2 gi174791 3 2',3'-cyclic -nucleo tide 2'-phosphodiesterase [Yersinia 115 4.70E-36 EF 122-2 giJ6277 2 5'-nucleotidase [Discopyge ommata] >pirlS 195641519564 137 5 .80E-35 nucleotidase EF 122-2 giJI57 3
S
7 3 2',3 -cycic-nuc eo tide 2'-phosphodiesterase (cpdB) 114 8.90E-34 EF 122-2 giJ53705 4 2',3'-cyclic-nucleotide 2'-phosphodiesterase [Escherichia coli] 110 1. 1 OE-3 1 EF 122-2 bbsj 135915 5-nucl eotidase=glycosylphosphatidy ifosi tol-ancho red protein 128 7.70E-29 EC_ EF122-2j gi11737 443 5'-nucleotidase [Boophilus microplus] 104 60E-28 EF 122-2 giJ20 2 551 5'-nucleotidase precursor (EC 3.1.3.5) [Rattus norvegicus] 138 6.1 OE-28 EF122-2 giJ3 497 83 ecto-5'-nucleotidase [Mus musculus] >pirJC2001 IJC2001 136 1LIOE-27 EF 122-2 gi123897 5'-nucleotidase [Homo sapiens] >pirjS I 10321S 11032 133 1.60E-27 Inucleotidase (EC 1F2-2~ 334 protein [Enterococcus faeciuml >pirlS055421S05542 630 9.4013-79 400.
90 9 9 9 9q 9 9 9 9. 9 9 .9.
:0 to, 9 t o 9 0 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBanlk and Derwent databases.
____________hypothetical EF 129-2 guS 12521 usp 45 gene product [Lactococcus lactis) >pirlJNOO971JNO097 374 1.30E-42 secreted EF 129-2 giJ149525 secreted protein [Lactococcus lactis] 371 3.60E-42 EF129-2 gnIPIDle3 13022 hypothetical protein [Bacillus subtilis] >gnlIPIDIl 186168 317 2.30E-33 EF130-2 gil48 8339 apa-amnylase [unidentified cloning vector] 621 6-70E-81 EF130-2 giJ488336 ORF [unidenti fled cloning vector] 242 8.OOE-27.
EF130-2 bbsll 12518 alpha-amylase {N-terminal region) [Artificial sequence, Peptide 237 4.80E-26 EF 130-2 gnllPIDie28914 4 ,yrpE [Bacillus subtilis] >gnljP1Diel 184540 ywpE [Bacillus 129 5.4013-11 EF131-2 gnlPIDle1185 2 8, penicillin-bindi-ng protein [Bacillus subtilis] 277 7.40E-43 EF 131-2 gii488330 alpha-amylase [unidenti fled cloning vector] 280 1.30E-31 EF 131-2 giJ509249 No definition line found [Lactobacillus plantarum] 274 1. 1 Ol$. EF 131-2 gnlPIDjdlO249 (AB009635) Fmt [Staphylococcus aureus] 170 5.60E-20 EF 131-2 guS 15050 DD-peptidase precursor [Streptomyces'lividans] 131 2.30E- 14 >pirlS48220IS48220_________ EFI131-2 gi11.534 4 8 serine DD-peptidase [Streptomyces lividans] 131 1.20E- 12 EF132-2 giJI53826 adhesin B [Streptococcus sanguis] >pirlA43583lA43583 adhesin 1257 2-.30E- 166 B EF 132-2 gil 1184932 ScbA [Streptococcus crista] 1248 3.70E-165 EF132'-2 gi31 0633 adhesin Streptococcus gordonii) 1247 5.I1OE- 165 EF132-2 gij393 2 69 adhesion protein [Streptococcus pneumoniae] 1204 3,40E- 163 EF 132-2, 1i575030 surface adhesin A precursor [Streptococcus pneumoniae] 1220 2.40E- 161 .6 '0 S
SS
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF 132-2 giJlS 3834 adhesin specific for salivary pellicle of dental surfaces 120 4.80E-159 EF132-2 gill 117994 surface antigen A variant precursor [Streptococcus pneum niae] 1191 2.OOE-157 EF132-2 giJ49301 7 endocarditis specific antigen [Enterococcus faecalis] 931 3.70E-122 EF132-2,gnlPIDle255529 lipoprotein [Staphylococcus epidermidis] 453 3.20E-92 EF 132-2 gi112 4 54 64 YfeA [Yersinia pestis] >gi1 124 546 4 YfeA [Yersinia pestis] 364 3.60E-64 EF 132-2 gi1157 3330 adhesin B precursor (firnA) [Haemophiilus influenzae] 349 3.50E-63 EF 132-2 giJ755 07 5 periplasmic-bindinfg protein [Synechocystis sp.] 326 6.80E-62 I>gnllPIDld 10 18652 Mn EF 132-2 gnilPIle 118595 similar to ABC transporter (membrane protein) [Bacillus 174 3.10E-32 EF132-2 gi177 7933 TroA [Treponema pallidum]) 171 3.40E-32.
EF 132-2 gi1790 5 46 Tromp I [Treponema pallidum] 171 5.10E-32 Qey Derivent Derwent Gene Description BLAST BLAST IScore P-Value EF003-2 EF003-2 EF006-2 EF006-2 EF008-2 EF008-2 EF008-2 Access. INo.
W20909 W20 166 W20909 W2( 6 H. pylori outer membrane protein 14 ge 10705orfS.
Helicobacter pylori outer membrae potein, 1622 5006.aa.
H. pylori outer membrane protein l4gelO07O5orf5.
Hicobacter pylori outer membrane protein, 16225006.aa.
268 241 283 266 4J
I
R37495 W26367 R79--22 W22134 W20909 Pneumococcal fimbrial protein A. 967, Staphylococcus aureus saliva binding protein. 467 ~7 ROM precursor TROMPI. 181 Treponema pallidurn rare outer membrane protein -(TROM P-i1). 181 i H. pylori outer membrane protein l4gelO07O5orfS. 1319 1 .20E-39 .OOE-27 .20E-48 .10E-30 .2013-127 .50E-100 ~.OOE-36 ~.OOE-36 I .4E-53 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in Gen~ank and Derwent databases.
EF009-2 W20 166 Helicobacter pylori outer membrane protein, 16225006.aa. 278 2.50E-32 EFO 12-2 R48035 Hyaluronic acid synthase of Streptococcus eguisimilis. 227 3.20E-69 EF014-2 W 14070 S.thermophilus exopolysaccharide biosynthesis protein EpsR. 103 5.90E-19 EFO 14-2 W22 169 S.thermophilus exopolysaccharide synthesis operon epsA gene 103 7.30E-18 product.____ EFO 16-2 W 15799 Adherence factor 104R of Lactobacillus fermentum. 157 9.60E-22 EFO 16-2 W 15793 Adherence factor consensus sequence. 103 L.OOE-l 1 EF017-2 R48035 H-yaluronic acid synthase of Streptococcus eguisimilis. 241 8.90E-71 EF021-2 R31013 P39-alpha. 141 1.60E- 19 EF021-2 R33280 P39-beta. 134 7.OOE- 14 EF022-2 R48035 Hyaluronic acid synthase of Streptococcus eguisimilis. 324 2.2 0E -6 S EF03-2 R48035 Hyaluronic acid synthase of Streptococcus eguisimilis._ 155 9.90E 33 EF023-2 R70152 Streptococcus pneumoniae strain SPRU98 PIpA. 125 5.90E- 17 EF027-2 R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 233 2.20E-34 EF028-2 W17830 Thermophilic alkaline phosphatase. 202 7.70E-59 EF028-2 Wi 1568 Excoli alkaline phosphatase mutant D I53FVQ329A. 182 7.90E-56 EF028-2 W 1570 E.coli alkaline phosphatase mutant D1I53H1K328H/Q329A. 182 7.90E-56 EF028-2 W26300 E.coli alkaline phosphatase mutant 182 I. 1OE-55 D I 3H/K3281Q329AJD330H.
I__
EF028-2 WI 11565 E.coli alkaline phosphatase mutant D IS3H/K328H/D330A. 182 3.1OE-55 EF028-2 WI 1557 E.coli alkaline phosphatase mutant D153H1D330N. 182 4.30E-55 EF028-2 W 11561 E.coli alkaline phosphatase mutant D153H/D330A. 182 4.30E-55 EF028-21 WI 1555 rs.coii alkaline phosphatase mutant D I53H1K328F1/D330N. 182 4.70E-55
A
Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF028-2 Wi 1566 Ecoli alkaline phosphatase mutant D1I531-11K3281D330L. 182 1.20E-54 EF028-2 WI 1569 E.coli alkaline phosphatase mutant K3281-/Q329A. 180 1.70E-54 EF028-2 W 11562 E.coli alkaline phosphatase mutant DlI 53H1D330L. 182 1 .7013-54 EF028-21 R26980 Fv(FRP5)-phoA recombinant antibody. 174 1 .90E--54 EF028-2 W 1567 E.coli alkaline phosphatase mutant Q329A. 179 2.30E-54 EF028-2 Wi 1558 E.coli alkaline phosphatase mutant K328H1D330N. 176 6.40E-54 EF028-2 WI 11563 E.coli alkaline phosphatase mutant K328H1D330A. 176 6.40E-54 EF029-2 R10044 Plasmid pO W3 60 encoded Human Growth Hormone (HGH) 320 3.50E-40 nuclease A EF029-2 R10041 Plasmid pOW350 nuclease A product. 320 4.30E-40 EF029-2 R73997 Staphylococcus aureus (Foggi) nuclease signal and mature 320 5.60E-40 _____sequences. EF029-2 R10043 Piasmid pOW36O encoding Human Growth Hormnone (HGH) 320 2.90E-38 nuclease EF030-2 R48035 H-yaluronic acid synthase of Streptococcus equisimilis. 277 6.l1OE-47 EF040-2 R59077 2-5A-dependent RNA-ase. 105 1.9013-18 EF040-2 W12703 Mouse 2-5A-dcpendent RNase. 105 1.90E- 18 EF040-2 R82661 Partial murine 2-5A-dependent RNase. 105 1 .9013-18 EF041-2 R48035 Hyaluronic acid synthase of Streptococcus equisimilis. 225 6.3013-26 EF054-2 R26042 P. yoelii SSP2 antigen. 286 8.0011-34 EF054-2 R85782 Group B Streptococcal mutant beta antigen without IgA binding 232 3.30E-24 Idomain. EF054-21 R85781 IGroup B Streptococcal wild-type beta antigen. 232 15.2 0 E-2 4 4** *49 44 4.
4 4 9 4 4 4 *4 4 0% 0 4*4 44. 4 4 4 4 444444 44 44 4 4 4 4444444 4444444 44 44 4 4 4 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF054-2 P91941 Sequence of preprospasmolysin. 204 3.I1OE- 19 EF054-2 W32519 Collagen-like polypeptide SEQ ID NO:2._ 180 7.50E- 18 EF054-2 Wl12324 Silver halide emulsion protein monomeric repeat unit 180 7.50E-18 EF054-2 W32522 Collagen-like polypeptide SEQ ID NO:5. 192 .I.60E-17 EF054-21 W12327 Silver halide emulsion protein monomeric repeat unit 192 1.60E- 17 EF054-2 W32520 Collagen-like polypeptide SEQ ID NO:3. 189 2.4013-17 EF054-2 W32532 Collagen-like polypeptide SEQ ID NO: 15. 189 2.40E-17 EF054-2 W 12325 Silver halide emulsion protein monomeric repeat unit 189 2.40E- 17 EF054-2 W12337 Silver halide emulsion protein monomeric repeat unit 15. 189 2.40E- 17 EF054-2 WI 12341 Silver halide emulsion FLAG(RTM)-tagged protein 189 2.60E- 17 EF054-2 W02098 S. mutans antigen 1/11. 161 5.40E., EF054-2 W02096 S. mutans antigen 1/11 fragment (aaS03-1 1114). 161 1.90E- 13 EF059-2 R26042 P. yoelii S5P2 antigen. 344 1 .90E-39 EF059-2 R85782 Group B Streptococcal mutant beta antigen without IgA binding 232 1.I1OE-26 EF059-2 R85781I Group B Streptococcal wild-type beta antigen. 232 1 .70E-26 EF059-2 P91941 Sequence of preprospasmolysin. 200 .1.50E-18 EF059-2 P60570 Sequence of the Falciparuin Interspersed Repeat Antigen 186 4.60E- 18 EF059-2 W02096 S. mutans antigen 1/11 fragment (aa803-1 114). 167 8.20E-16_ EF059-2 W02098 S. mutans antigen 1/11. -167 4.90E-l15 EF059-2 R79625 Endocarditis specific antigen region. 147 4.40E- 12 EF059-2 R26049 MSF precursor. 143 1.30E-lIl EF059-21 R28 150 ISugar beet chitinase 1. 148 et i a a a to. a *0 a Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF059-2 R26842 Protease from S. Aureus ATCC 12600. 147 2. 1OE-l II EF059-2 R79643 Inumunodominant antigen of Streptococcus sobrinus. 151 2. IOE-1 I EF059-2 W07539 Collagen like protein (CLP). 146 3.00E-1I1 EFO6I-2, R26042 P. ypelii SSP2 antigen. 241 1.70E-25 EFO61:-2 P60570 Sequence of the Falciparum Interspersed Repeat Antigen 199 1.6013-18 EF061-2 R85782 Group B Streptococcal mutant beta antigen without IgA binding 153 2.40E- 14 domain.
EF061-2 R85781 Group B Streptococcal wild-type beta antigen. 153 3.60E- 14 EF06 1-2 P91941 Sequence of preprospasmolysin. 163 9.70E- 14 EF061-2 P83194 Sequence of a bioadhesive precursor protein encoded by cDNA 156 7.90E- 13 ,clone EF061-2 R28 150 Sugar beet chitinase 1. 156 9. 1 OF-13 EF061-2 W02096 S. mutans antigen 1/11 fragment (aa803-1 114). 148 1.20t- 12 EF061-2 P82971 Bioadhesive precursor protein from cDNA 52. 148 9.70E- 12 EF061-2 W02098 S. mutans antigen 1/11. 148 1.50E-1l EF062-2 W02098 S. mutans antigen 1/11. 107 1.20E-36 EF062-2 R79643 Inunodominant antigen of Streptococcus sobrinus. 132 3.OOE-36 EF063-2 W02098 S. mutans antigen 1/11. 107 1.20E-36 EF063-2 R79643 Immunodominant antigen of Streptococcus sobrinus. 132 3.0012-36 EF064-2 W02098 S. mutans antigen 1/11. 107 1.20E-36 EF064-2 R79643 Immunodominant antigen of Streptococcus sobrinus.- 132 3.OOE-36 EF071-21 R85294 Phage R1-t LytR lysin. 127 3.70E-38_ EFO7I-21 R91515 lListeria phage lysin PLY51 1. 273 4.70E-37 0 £40 000 06 04 000 0 £00 0 00 0 0 0 0 0 00 0 0 0 *0 Ve*40 00600 00 00 0 0 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF075-2 W 14070 S.thermnophilus exopolysaccharide biosynthesis, protein EpsR. 239 4.20E-36 EF075-2 W22 169 'S.thermophilus exopolysaccharide synthesis operon epsA gene 239 4.OOE-34 ___product._ EF077-2_ R97280 Helicobacter-specific ATPase 439. 258 4.10E-74 EF077-2 R48036 Mycobacterium BCG immunogen. 192 2.20E-67 EF077-2 W06712 Helicobadter-specific ATPase 948 (ORF-4). 220 2.50E-67 EF077-2 R70419 Rat homologue of human Wilson disease gene ATP7I3. 186 9.80E-54 EF077-2_ R72343 Wilson disease protein ATP7B. 176 6.70E-40 EF077-2 R06376 Product of the ssclI gene. 166 3. IOE-28- EF077-2 R75396 Flea sodium pump alpha subunit. 146 2.40E-25 EF077-2 W20891 H. pylori transporter protein, l4ce22l9orfl. 15,6 8.60E- 14 EF078-2 R56667 Bacteroides fragLs RprX regulatory response protein. 148 9.308-18 EF078-2 R7 4630 Tomato TGETRI ethylene response protein. 130 7.80E- 13 EF078-2 R69849 Ethylene response (ETR) gene product. 128 1 .70E-1 1 EF078-2 R69850 Ethylene response (ETR)_mutant protein etrl 128 1 .70E- 11I EF078-2 R69851 Ethylene response (ETR) mutant protein etrl-2. 128 1. 70E-1 I1 EF078-2 R69852 Ethylene response (ETR) mutant protein etrl-3. 128 1.70E-1lI EF078-2 R69853 Ethylene response (ETR) mutant protein etrl-4. 128 1.70E-1 I EF078-2 R24296 Regulatory protein VanS involved in glycopeptide resistance. 142 2.70 E-1 1 EFO8I1-2 R27253 Penicillin binding protein PBP2A-epi. 101 4.70E- 16 EF081-2 R.27256 Penicillin binding protein PBP2A-27R. 101 6.OOE- EF081-2 R27257 Penicillin binding protein derivative N1. '101 6.20E- EF081-21 R27258 Penicillin binding protein derivative 1101 16.20E-15 0 0@*
OOS
0O 0O 0*e 0 0 0 0O 0 S 0 0 0 0 *0 0 0 0 S S 000 OSO See S S 0 5@eO@e 0@ Ge 0606050 SeeSSee S. Se 0 5 0 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
EF081-2 R27259 Penicillin binding protein derivative 101 6.2013-15 EF081-2 R27260 Penicillin binding protein derivative 101 6.20E- EF081-2 R27261 Penicillin binding protein derivative #15. 101 6.20E-lI5_ EBF081-2 R27263 Penicillin binding protein derivative V7. 101 6.20E- EF081-2 R27264 Penicillin binding protein derivative 101 6.20E- EF08 1-2 R27262 Penicillin binding protein derivative 101 6. 50E- EF08 1-2 R30845 Sequence encoded by the mec A gene.__ 101 6.90E- EF081-2 R27255 Penicillin binding protein PBP2A-27R. 101 6.90E- EF081-2 R31216 Penicillin binding protein PBP2A-27R. 101 7.OOE-15 EFI 110-2 R91042 V8 mature protease (aal-213). 106 6.60E- 16 EF 110-2 R91043 V8 mature protease (aa 1-2 14). 106 7.20,- 16 EFI 110-2 R91044 V8 mature protease (aa 1-215). 106 7.80E- 16 EF 10-2 R26842 Protease from S. Aureus ATCC12600. 106 6.70E- EF 110-2 R29644 Protease from S. Aureus. 106 1.20E- 14 EFI110-2 W2221 8 Protein encoded by pV8PPT(-) construct. 106 7.6013-14 EF 110-2 R91033 Beta-galactosidase-V8 protease fusion protein. 106 7.6013-14 EF 110-2 R9 1034 Beta-galactosidase-V8 protease fusion protein. 106 1 .70E- 13 EFi 10-2 W22219 Protein encoded by pV8D construct. 106 7.60E- 13 EF 110-2 R91035 Recombinant V8 protease V8D fusion protein. 106 7.60E- 13 EF 10-2 W22220 Protein encoded by pV8F construct. 106 7.90E- 13 EF 129-2. R14530 Usp45 protein. 374 2.40E-43 EF 129-21 R14 150 MSP encoded by pUCRS (DSM 5803). .372 4.70E-43 EF 131-21 R37495 Pneumococcal fimbrial protein A. 1185 6.80E-163 9 9 Table 2. Closest matching sequences between the polypeptides of the present invention and sequences in GenBank and Derwent databases.
E 3 121 E131-21 W26367 R79 722 W22 134 Staphylococcus aureus saliva binding protein. 41I8 ROM precursor TROMP 1. 171 Treponema, pallidurn rare outer membrane protein (TROMP-i 171 9.0013-31 288 TABLE 3. Conservative Amino Acid Substitutions.
Aromatic Hydrophobic Polar Basic Acidic Small Phenylalamne Tryptophan Tyrosine Leucine.
Isoleucine Valine Glutamine Asparagine Arginiie.
Lysine Histidine Aspartic Acid Glutainic Acid Alanine Serine Threonine Methionine Glycine a a too.
o.
289 T[able 4. Residues Comipri sing, Antigenic Epitope-Bearing Portion.
EFOO 1-2 from about Asp- 15 0 to about Lys- 15 2, from about Ser-256 to about Tyr- 259, from about Lys-360 to about Lys-363, from about Asn-406 to about EF002-2 from about Asp-8O to about Asp-83, from about Asp-281 to about Gly- 283.
EF003-2 from about Asn-263 to about GI -266.
EF004-2 from about Asn-23 to about Asn-26, from about Lys-83 to about Ser-87, from about Tyr- 154 to aboiteAsp -159.
EF005-2 from about Lys-249 to about Glu-252.
EF006-2 from about Gly-23 to about Asp-28.
EF008-2 from about Thr-92 to about Gly-94, from about Pro-161 to about Asp- 165, from about Gly-287 to about Thr-289.
F0 10-2 from about Pro- 129 to about Asn- 13 1.
F0 12-2 from about Asp-77 to about Asp-79, from about Asp-94 to about Lys-98, from about Asp-256 to about Thr-258, from about Glu-461 to about Asn- 468.
EFO 13-2 from about Thr-30 to about Asp-32, from about Glu-73 to about from about Gln-1 64 to about Asn- 166, from about Lys- 193 to about Gly- 195.
EFOI14-2 from about Ser-203 to about Asp-206, from about Gln-314 to about Gly- 316 F0 15-2 from about Pro-66 to about Gly-69.
EF016-2 from about Lys-236 to about Asn-239.
EFO 17-2 from about Ser-90 to about Gly-93, from about Thr- 197 to about Lys- 199, from about Lys-230 to about Asn-233, from about Ser-428 to about 1.
EF018-2 from about Lys- 159 to about Tyr- 16 1, from about Asn- 165 to about Ser- 167, from about Asn-250 to about Arg-256, from about Asn-392 to about from about Lys-416 to about Tyr-418, from about Asn428 to 290 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
Arg-430.
EFO 19-2 from about Arg-209 to about Ser-21 1, from about Lys-287 to about Ser- 290.
EF020-2 from about Lys-57 to about Asn-62.
EF02 1-2 from about Ser-33 to about Gly-35, from about Glu-77 to about Gly-8 I, from about Asp-139 to about Lys-141, from about Glu-255 to about Ser- 258, from about Gln-271 to about Tyr-277.
EF023-2 from about Lys-232 to about Asp-234, from about Arg-304 to about Gly- 306, from about Thr-453 to about Arg-456, from about Ser-478 to about EF025-2 from about Arg- 183 to about Asp-] EF026-2 from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, from about Gln- 107 to about Asn- I EF027-2 from about Gln-72 to about Lys-74, from about Lys-229 to about Asp- 23 1.
EF028-2 from about Asp-l 86 to about Gln-1 88.
EF029-2 from about Asp-i 18 to about Lys-1 22, from about Asp-1 24 to about Tyr- 12 6.
EF031-2 from about Glu-30 to about Gly-33.
EF034-2 from about Glu-25 to about Gly-27, from about Glu-75 to about Thr-77.
EF36-2 from about Gln- 177 to about Ser- 179.
EF037-2 from about Ser-25 to about Asp-30, from about Asp-90 to about Asp-94, about Gin-]107 to about Asn- I EF038-2 from about Asn-77 to about Lys-79, from about Tyr-88 to about Asn-92.
EF040-2 from about Lys-167 to about Gly-172, from about Lys-240 to about 291 Table 4. Residues Comprising Antigenic Epitope-Bearing Pordon.
EF044-2 from about Arg- 192 to about Gly- 194, from about Asn-200 to about.Asn- 203.
EF045-2 from about Asp-159 to about Asn-161, from about His-I 72 to about Gly- 174, from about Tyr-261 to about Gly-264, from about Lys-305 to about Glu-308.
EF046-2 from about Ser-I18 to about Gly-23, from about Gln-41 to about Ser-47, about Thr-76 to about Asp-78.
EF047-2 from about Asn-28 to about Asp-30, from about Asp-273 to about Asn- 277.
EF048-2 from about Asp-138 to about Lys-141, from about Asp-152 to about Gly-I 54.
1-2 from about Asp-73 to about Gly-76.
EF053-2 from about Ser-79 to about Gly-82.
EF055-2 from about Asp-26 to about Gly-28, from about Gln-67 to about Asp-69, from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89.
EF056-2 from about Arg-71 to about Gly-74, from about Arg-87 to about Gly-89.
EF058-2 from about Lys-129 to about Gly-133, from, about Gln-571 to about Tyrfrom about Pro-586 to about Gly-59 1.
EF065-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49 1, from about Ser-S 16 to about Asp-S 18, from about Glu-639 to about Lys-642.
EF066-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-4 18, from about Arg-44 6 to about Asp-448, from about Asn-489 to about Lys-491, from about Ser-516 to Asp-SI 8, from about Glu-639 to about Lys-642.
EF067-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-4 18, fronm about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49 I, from about Ser-5 16 to Asp-I 18, from about Gu-63 9 to about Lys-642.
9 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
EF073-2 from about Met-98 to about Arg-100, from about Arg-l 10 to about Asp- EF074-2 from about Scr-53 to about Tyr-59, from about Ser-86 to about Gly-88, from about Pro-97 to about Gin- 100, from about Gln-230 to about Gly- 232.
EF076-2 from about Asn-38 to about Tyr-40, from about Asp-48 to about Asn-53, about Lys-79 to about Gly-8 1.
EF077-2 from about Arg-41 I to about Gly-4 13.
EF078-2 from about. Thr-294 to about Gly-296, from about Asp-366 to about Ginfrom about Glu-524 to about Gly-526.
EF080-2 from about Glu-1 64 to about Gly-1 66, from about Ser-206 to about Tyr- 208, from about Lys-239 to about Gi -243.
EF081-2 from about Asn-7 to about Ser- 11, from about Lys-77 to about from about Lys-1 12 to about Asn-1 114, from about Gly- 162 to about Aspfrom about Arg- 181 to about Gly- 183.- EF083-2 from about Gln-38 to about EF084-2 from about Lys-140 to about Asp-142, from about Gly-164 to about Argfrom about Arg-262 to about Gly-264.
EF085-2 from about Asn-95 to about Asp-97, from about Mrg-I 12 to about Asp- 114, from about Asp-258 to about Ser-260, from about Arg-401 to about EF086-2 from about Pro-I 12 to about Gly-I 15, from about Ser-222 to about Ser- 224, from about Asn-296 to about Gly-299, from about Thr-346 to about Lys-348, from about Asp-428 to about Ser-432.
EF087-2 from about Pro-I 12 to about Gly-1 15, from about Ser-222 to about Ser- 224, from about Asn-296 to about Gly-299, from about Thr-34 6 to about from about As -428 to about Ser-432.
EF088-2 from about Pro-I 12 to about Gly-I 15, from about Ser-222 to about Ser- 224, from about Asn-296 to about Gly-299, from about Thr-346 to about from about Asp-428 to about Ser-432.
293 Table 4. Residues Coi-iprising Antigenic Epitope-Bearing Portion.
EF090-2 from about Arg-2 to about EF091-2 from about Gln-40 to about Asp-43.
EF093-2 from about Lys-95 to about Gly- 9 7 EF094-2 from about Asp-3 14 to about Asp-3 16.
EF095-2 from about Ser-328 to about Thr-330, from about Asp-359 to about Asp- 363, from about Glu-637 to about Gly-639, from about Asn-744 to about EF096-2 from about Pro-128 to about Asn-130, from about Ser-193 to about Asp- 196.
EF097-2 from about Val-357 to about Gly-359.
EF099-2 from about Glu-44 to about Asp-47, from about Lys-] 154 to about Gly- 156, from about Asn-286 to about Asp-289.
EF 101-2 from about Lys-40 to about Asp-42, from about Pro-25 5 to about Asnfrom about Lys-288 to about Gly-290.
EF 102-2 from about Asp-314 to about Asp-3 16.
EF103-2 from about Asn-46 to about Gly-48.
EF 104-2 from about Pro-232 to about Lys-237, from about Ala-362 to about Asn- 366, from about Ser-42 1 to about Gly-423, from about Lys-488 to about Ser-490, from about Asp-S 50 to about Asn-5 52, from about Pro- 6 37 to about Lys-640, from about Asp-727 to about Gly-729, from about Asn- 751 to about Ser-754, from about Lys-771 to about Asn-774, from about lle-835 to about Asn-837, from about Pro-851 to about Gly-853.
EFi 05-2 from about Ser-40 to about Gly-43, from about Asn-94 to about Gln-97, from about Gln-220 to about Gly- 2 2 2 from about Asn-263 to about Gly- 265.
EF 106-2 from about Asp-72 to about Gly-75, from about Tbr-274 to about Aspfrom aboutAsn-3 10_to about Arg-3 13.
F77- 2 from about Thr- 15 5 to about Asn- 15 7, from about Thr- 189 to about Asp 294 Table 4. Residues Comprising Antigenic Epitope-Beanng Portion.
191, from about Arg-270 to about Gly-272, from about Thr-330 to about Lys-335, from about Asp-365 to about Asp-368, from about Pro-451 to Asp-453, from about Gly-485 to about Thr-488.
EFI 08-2 from about Lys- 142 to about Trp- 145, from about Thr- 147 to about Tyr- 150, from about Arg-212 to about Gly-214, from about Ser-248 to about Asp-251, from about Asp-384 to about Asp-387, from about Pro-481 to about Arg-483, from about Lys-491 to about Gly-494, from about Thr- 619 to about Gly-624, from about Asp-656 to about Asp-659, from about Lys-717 to about Asn-721, from about Ser-822 to about Gly-824, from about Tvr 1137 to about Thr-A 141.
S
S.
S
S
S
S*SS
S
S. S
S
S.
EFI 10-2 from about Pro-123 to about Gly-127, from about Thr-223 to about Gly- 225.
EF1 11-2 from about Lys-207 to about Asn-209, from about Asp-245 to about Asn-248, from about Lys-396 to about Asp-398, from about Glu-429 to Ser-432, from about Thr-470 to about His-474.
EFI 119-2 from about Asp-90 to about Asn-92, from about Gin- 142 to about Gly- EF121-2 from about Asn-159 to about Asp-161, from about Asn-35 1 to about Lys-353, from about Pro-658 to about Gly-660, from about Lys-786 to Ser-789.
EF122-2 from about Asri-159 to about Asp-161, from about Asn-35 I to about Lys-353, from about Pro-658 to about Gly- 66 O, from about Lys-786 to about Ser-789.
EF123-2 from about Asn-331 to about Arg-336, from about Asp-634 to about Gly- 636, from about Glu-780 to about Ser-782, from about Tyr-909 to about Asn-9 11, from about Lys-939 to about Glu-942, from about Asp- 1074 to about Gly-1076, from about Asp-1367 to about Gly-1369, from about Pro-1433 to about Lys-1435, from about Gly-1516 to about Asp-iS 18, from about Lys- 165 6 to about Asp- 1660, from about Lys- 18 6 0 to about Gln- 1863, from about Ser- 1916 to about Gin- 1919, from ab out Pro-i1940 about Gly-1942.
EF124-2 from about Asn-33 1 to about Arg-336, from about Asp-634 to about (ily- 636, from about Giu-780 to about Ser-782, from about Tyr-909 to about As-n-9 11, from about Lys-939 to about Glu-942, from about Asp- 1074 to About Gly- 107 6, from Aout Asp- 13 67 to about Gly- 13 69, from about to about Lys-1435, from about Gy-i 516 to about Asp-1518, 295 Table 4. Residues Comprising Antigenic Epitope-Bearing Portion.
ftrm about Lys-1 656 to about Asp- 1660, from about Lys- 1860 to about GI-n- 1863, from about SeT- 1916 to about Gln-1 919, fr-om about Pro-I 940 to about Gly- 1942.
EF125-2 from about Asn-331 to about Arg-336, from about Asp-634 to about Gl- 636, from about Glu-780 to about Ser-782, from about Tyr-909 to about Asn-91 1, from about Lys-93 9 to about Glu-942, from about Asp-] 074 to about Gly- 1076, from about Asp- 1367 to about Gly- 1369, from about Pro- 143 3 to about Lys- 143 5, from about Gly- 1516 to about Asp-IS5 18, from about Lys- 1656 to about Asp- 1660, from about Lys- 1860 to about GIn- 1863, from about SerI916. to about Gln- 1919, from about Pro- 1940 about Gly-1942.
EF 126-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49], from about Ser-516 to about Asp-S 18, from about Glu-639 to about Lys-642.
EF127-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-4 15'to, about Asn-4 18, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49 1, from about Ser-5 16 to about Asp-S 18, from about Glu-639 to about Lys-642.
EF128-2 from about Ser-236 to about Tyr-239, from about Asp-350 to about Gly- 352, from about Lys-415 to about Asn-418, from about Arg-446 to about Asp-448, from about Asn-489 to about Lys-49 1, from about Ser-S 16 to Asp-S 18, from about Glu-639 to about Lys-642.
EF129-2 from about Asn-300 to about Gly-302, from about Ser-316 to about Glyfrom about Asn-385 to about His-387 EF13 1-2 from about Lys-201 to about Tyr-204, from about Glu-263 to about Ser- 266.
EF132-2 from about Thr-26 to about Ser-28.
4* INDICATIONS RELATING TO A DEPOSITED MICROORGANISM (PCT Rule 13bis) Address of depositary institution (including postal code and country) 10801 University Boulevard Manasas, Virginia 201 10-2209 United States of America C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicarions are nor for all designated States) E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) The indications listed below will be submitted to the International Bureau later (specify thec general nature of the indication. "A ccession Number of Depasit') For receiving Office use only This she Js eived with the international application SFor International Bureau use onlyV W This sheet was received by the international Bureau on: Authorized officer Authorized offiicer

Claims (25)

1. An isolated nucleic acid molecule comprising a polynucleotide having a nucleotide sequence selected from the group consisting of: a nucleotide sequence encoding any one of the amino acid sequences of the polypeptides shown in Table 1; or a nucleotide sequence complementary to any one of the nucleotide sequences in a nucleotide sequence at least 95% identical to any one of the nucleotide sequences shown in Table 1; or, a nucleotide sequence at least 95% identical to a nucleotide sequence complementary to any one of the nucleotide sequences shown in Table
2. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which hybridizes under stringent hybridization conditions to a polynucleotide having a nucleotide sequence identical to a nucleotide sequence in or of claim 1.
3. An isolated nucleic acid molecule of claim 1 comprising a polynucleotide which encodes an epitope-bearing portion of a polypeptide in of claim 1.
4. The isolated nucleic acid molecule of claim 3. wherein said epitope-bearing portion of a polypeptide comprises an amino acid sequence listed in Table 4.
5. A method for making a recombinant vector comprising inserting an isolated nucleic acid molecule of claim 1 into a vector.
6. A recombinant vector produced by the method of claim
7. A host cell comprising the vector of claim 6.
8. A method of producing a polypeptide comprising: growing the host cell of claim 7 such that the protein is expressed by the cell; and recovering the expressed polypeptide.
9. An isolated polypeptide comprising a polypeptide selected from the group consisting of: a polypeptide consisting of one of the complete amino acid sequences of Table 1; a polypeptide consisting of one the complete amino acid sequences of Table 1 except the N-terminal residue; a fragment of the polypeptide of having biological activity; and a fragment of the polypeptide of which binds to an antibody specific for the polypeptide of An isolated antibody specific for the polypeptide of claim 9.
11. A polypeptide produced according to the method of claim 8.
12. An isolated polypeptide comprising an amino acid sequence at least 95% identical to a sequence selected from the group consisting of an amino acid sequence of any one of the polypeptides in Table 1.
13. An isolated polypeptide antigen comprising an amino acid sequence of an E. faecalis epitope shown in Table 4.
14. An isolated nucleic acid molecule comprising a polynucleotide with a nucleotide sequence encoding a polypeptide of claim 9. A hybridoma which produces an antibody of claim
16. A vaccine, comprising: one or more E. faecalis polypeptides selected from the group consisting of a polypeptide of claim 9; and a pharmaceutically acceptable diluent, carrier, or excipient; wherein said polypeptide is present, in an amount effective to elicit protective antibodies in an animal to a member of the Enterococcus genus.
17. A method of preventing or attenuating an infection caused by a member of the Enterococcus genus in an animal, comprising administering to said animal a polypeptide of claim 9, wherein said polypeptide is administered in an amount effective to prevent or attenuate said infection.
18. A method of detecting Enterococcus nucleic acids in a biological sample comprising: contacting the sample with one or more nucleic acids of claim 1, under conditions such that hybridization occurs, and detecting hybridization of said nucleic acids to the one or more Enterococcus nucleic acid sequences present in the biological sample. -299-
19. A method of detecting Enterococcus faecalis nucleic acids in a biological sample obtained from an animal, comprising: a) amplifying one or more Enterococcus faecalis nucleic acid sequences in said sample using polymerase chain reaction, using primers selected from claim 1; and b) detecting said amplified Enterococcus faecalis nucleic acid. A kit for detecting Enterococcus antibodies in a biological sample obtained from an animal, comprising: a) a polypeptide of claim 9 attached to a solid support; and b) detecting means.
21. A method of detecting Enterococcus antibodies in a biological sample obtained from an animal, comprising: a) contacting the sample with a polypeptide of claim 9; and b) detecting antibody-antigen complexes. *l
22. An isolated nucleic acid molecule of claim 1 substantially as herein before described with reference to the Examples.
23. An isolated polypeptide according to claim 9 substantially as herein before described with reference to the Examples.
24. An isolated polypeptide according to claim 12 substantially as herein before described with reference to the Examples.
25. An isolated polypeptide antigen according to claim 13 substantially as herein before described with reference to the Examples.
26. A method according to claim 19 substantially as herein before described with reference to the Examples. 300
27. A method according to claim 21 substantially as herein before described with reference to the Examples. Dated this Twentieth day of July 2001. Human Genome Sciences, Inc. Wray Associates Perth, Western Australia Patent Attorneys for the Applicant S. 0 S S S. 0S 0S 0 SS OS S. S OS OS 0O S SOS S 5* S C OS. S S 550505 e.g. 0 0555 00 5000 S
555. S .555 e.g. S S *5 0 S Se S 00
AU55888/01A 1997-05-06 2001-07-20 Enterococcus faecalis polynucleotides and polypeptides Expired - Fee Related AU770915B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU55888/01A AU770915B2 (en) 1997-05-06 2001-07-20 Enterococcus faecalis polynucleotides and polypeptides

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US60/044031 1997-05-06
US60/046655 1997-05-16
US60/066009 1997-11-14
AU72793/98A AU735444B2 (en) 1997-05-06 1998-05-04 (Enterococcus faecalis) polynucleotides and polypeptides
AU55888/01A AU770915B2 (en) 1997-05-06 2001-07-20 Enterococcus faecalis polynucleotides and polypeptides

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU72793/98A Division AU735444B2 (en) 1997-05-06 1998-05-04 (Enterococcus faecalis) polynucleotides and polypeptides

Publications (2)

Publication Number Publication Date
AU5588801A true AU5588801A (en) 2001-09-20
AU770915B2 AU770915B2 (en) 2004-03-04

Family

ID=31982944

Family Applications (1)

Application Number Title Priority Date Filing Date
AU55888/01A Expired - Fee Related AU770915B2 (en) 1997-05-06 2001-07-20 Enterococcus faecalis polynucleotides and polypeptides

Country Status (1)

Country Link
AU (1) AU770915B2 (en)

Also Published As

Publication number Publication date
AU770915B2 (en) 2004-03-04

Similar Documents

Publication Publication Date Title
AU735444B2 (en) (Enterococcus faecalis) polynucleotides and polypeptides
AU2016219667B2 (en) Antibacterial phage, phage peptides and methods of use thereof
KR100923598B1 (en) Surface Proteins of Streptococcus pyogenes
AU773895B2 (en) Streptococcus pneumoniae antigens and vaccines
JPH09322781A (en) Staphylococcus aureus polynucleotide and sequence
EP0934336A1 (en) Novel compounds
CA2218741A1 (en) Nucleotide sequence of the haemophilus influenzae rd genome, fragments thereof, and uses thereof
EP0804616A2 (en) Specific and universal probes and amplification primers to rapidly detect and identify common bacterial pathogens and antibiotic resistance genes from clinical specimens for routine diagnosis in microbiology laboratories
AU2015327511B2 (en) Biomarkers for rheumatoid arthritis and usage thereof
CA2378687A1 (en) Nucleotide sequences of moraxella catarrhalis genome
WO1998059071A1 (en) Lyme disease vaccines
CN101019123A (en) Computational method for identifying adhesin and adhesin-like proteins of therapeutic potential
US6902893B1 (en) Lyme disease vaccines
WO2000012678A2 (en) Staphylococcus aureus genes and polypeptides
AU770915B2 (en) Enterococcus faecalis polynucleotides and polypeptides
AU777190B2 (en) Streptococcus pneumoniae polynucleotides and sequences
AU2021240230B2 (en) Vaccines and vaccine components for inhibition of microbial cells
AU710880B2 (en) Nucleic acid and amino acid sequences relating to helicobacter pylori for diagnostics and therapeutics
AU713692B2 (en) Nucleic acid and amino acid sequences relating to helicobacter pylori for therapeutics
AU1546202A (en) Enterococcus faecalis polynucleotides and polypeptides
AU3796099A (en) Assays using nucleic acid and amino acid sequences relating to helicobacter pylori
AU8938601A (en) Lyme disease polynucleotides