US20020150902A1 - Polymorphic loci that differentiate escherichia coli 0157:H7 from other strains - Google Patents
Polymorphic loci that differentiate escherichia coli 0157:H7 from other strains Download PDFInfo
- Publication number
- US20020150902A1 US20020150902A1 US09/875,573 US87557301A US2002150902A1 US 20020150902 A1 US20020150902 A1 US 20020150902A1 US 87557301 A US87557301 A US 87557301A US 2002150902 A1 US2002150902 A1 US 2002150902A1
- Authority
- US
- United States
- Prior art keywords
- ala
- gly
- leu
- ile
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
Definitions
- the present invention relates generally to the field of microbiology and food sciences. More particularly, the inventor has discovered the gnd gene and corresponding 6-phosphogluconate dehydrogenase (6-PGD) protein from fourteen different strains of Escherichia coli and polymorphic sequences therein. Novel biotechnological tools, diagnostics, and food screening techniques are provided.
- 6-PGD 6-phosphogluconate dehydrogenase
- Escherichia coli O157:H7 is an exceptionally virulent food borne, human pathogen that causes a spectrum of illness, including asymptomatic and post-symptomatic carriage, mild diarrhea, bloody diarrhea/hemorrhagic colitis, and the postdiarrheal, potentially lethal, hemolytic uremic syndrome (HUS).
- HUS hemolytic uremic syndrome
- HUS is defined as a triad of non-immune microangiopathic hemolytic anemia, thrombocytopenia, and acute renal failure. HUS is chiefly a disorder of children under age 10, however, the elderly are also susceptible to severe complications of E. coli O157:H7 gastrointestinal infections.
- HUS follows gastrointestinal infection with E. coli O157:H7 in approximately 10-15% of pediatric cases.
- Talr et al. Am J Epidemiol, 129:582-586 (1989);
- Tarr et al. Am J Epidemiol, 129:582-586 (1989)).
- E. coli O157:H7 is often found in food and environmental vehicles that do not always undergo an efficient bacterial killing process.
- Large outbreaks have been caused by the interstate dissemination of contaminated ground beef that was under cooked (Bell et al., JAMA, 272:1349-1353 (1994) and Riley et al., N Engl J Med, 308:681-685 (1983)); salted, fermented, but uncooked salami (Tilden et al., Am J Public Health, 86:1142-1145 (1996)); municipal (Swerdlow et al., Ann Intern Med, 117:812-819 (1992)) and swimming (Keene et al., N Engl J Med, 331:579-584 (1994)) water; unpasteurized apple juice (Anonymous, Morb Mortal Wkly Rep, 45:975 (1996)); unpasteurized milk (Keene et al., J Infect Dis, 176:815-818 (1997)); and lettuce
- E. coli O157:H7 has not been shown to possess a capsular polysaccharide but it expresses an O side chain antigen designated 157, which consists of repeating tetrasaccharide units of variable length. These tetrasaccharide units comprise the antigenic O157 lipopolysaccaride (LPS).
- O157:H7 fails to ferment sorbitol after overnight culture on MacConkey agar into which sorbitol rather than lactose is incorporated as the carbon source. (Wells et al., J Clin Microbiol, 18:512-520 (1983); March et al., J Clin Microbiol, 23:869-872 (1986)).
- E. coli O157:H7 also fails to produce ⁇ -glucuronidase, another metabolic distinguishing factor.
- Sorbitol non-fermenting E. coli almost always express the H7 flagellar antigen, though occasional sorbitol non-fermenting E. coli O157 strains recovered in the United States do not express the H7 antigen.
- Another variant of E. coli O157:H7 has been found in Germany and Czech Republic, which expresses the O157 antigen, but are non-motile pathogens that ferment sorbitol.
- Shiga-toxin a glycosphingolipid ligand (globotriaosylceramide) (Basta et al., J Clin Microbiol, 27:1617-1622 (1989)) (Biocarb, Gaithersburg, Md.) or employ an enzyme immunoassay (Meridian Diagnostics, Cincinnati, Ohio). (Kehl et al., J Clin Microbiol, 35:2051-2054 (1997)); Park et al., Diag Microbiol Infect Dis, 26:69-72 (1996)).
- a glycosphingolipid ligand globotriaosylceramide
- the inventor has discovered the gnd gene and corresponding 6-phosphogluconate dehydrogenase (6-PGD) protein of fourteen strains of E. coli . Within these genes and proteins the inventor has also found several polymorphisms that can be used to identify the presence of a particular strain of E. coli and/or differentiate one strain of E. coli from another.
- One polymorphism in particular which involves a substitution of an isoleucine molecule for a threonine molecule at amino acid position 218, can be used to differentiate highly pathogenic strains of O157:H7 and O55:H7 from less pathogenic strains of O157:H7.
- O55:H7 is only about 82% homologous to O157:H7
- the highly pathogenic strains of O157:H7 can be differentiated from O55:H7 at several different loci.
- identifying the presence and/or absence of the polymorphism at position 218 and identifying the presence or absence of a region of non-homology between O55:H7 and O157:H7 one of skill in the art can rapidly identify the presence of a highly pathogenic strain of E. coli in a sample obtained from a patient or from a food or liquid source.
- identifying the presence or absence of other polymorphisms in the gnd locus one of skill can efficiently differentiate specific strains of E. coli allowing for a more precise diagnosis or screening.
- Embodiments of the invention include an isolated polynucleotide encoding gnd, wherein the polynucleotide comprises one of the E. coli sequences disclosed in the sequence listing. Fragments of these sequences having least 9 consecutive bases and a polymorphism described in Table 1 are also embodiments of the invention. Other embodiments include isolated polynucleotides that encode a polypeptide that corresponds to the E.
- a additional embodiment concerns a nucleic acid probe for detecting the presence of E. coli O157:H7 consisting of an isolated nucleic acid molecule at least 7 nucleotides in length, wherein the nucleic acid molecule hybridizes to DNA of gnd of E. coli O157:H7 and not to DNA of gnd of non-H7 E.
- nucleic acid primer for detecting the presence of E. coli O157:H7 consisting of an isolated nucleic acid molecule at least 7 nucleotides in length, wherein the isolated nucleic acid molecule primes DNA of gnd of E. coli O157:H7 and not DNA of gnd of non-H7 E. coli O157 strains.
- the nucleic acid probes of the invention can be provided on a substrate or in a microarray on a chip.
- SRecombinant constructs and vectors comprising one of the sequences of the sequence listing are also embodiments of the invention.
- a cultured cell line comprising the one of the vectors of the invention is an embodiment.
- the proteins of the invention include an isolated protein comprising one of the sequences found in the sequence listing and an isolated polypeptide comprising at least 3 consecutive amino acids of one of the sequences of the sequence listing, wherein the polypeptide contains at least one polymorphism that can be deduced from Table 1.
- Additional protein embodiments concern an isolated antibody capable of specifically binding to a protein having one of the sequences of the sequence listing, wherein the epitope corresponds to at least one polymorphism that can be deduced from Table 1.
- another embodiment includes an isolated antibody capable of binding to a polypeptide comprising at least 9 consecutive amino acids of one of the sequences of the sequence listing, wherein the epitope corresponds to at least one polymorphism that can be deduced from Table 1.
- the antibody is monoclonal.
- Methods of detecting a polymorphism and detecting or diagnosing the presence of a highly pathogenic E. coli are also embodiments.
- a polymorphism in a gene encoding 6-PGD is detected by obtaining a biological sample containing polynucleotides and analyzing the biological sample for the presence of a diagnostic polynucleotide having at least one polymorphism described in Table 1.
- the presence or absence of the C653T or G653C polymorphism is analyzed and/or the analysis of the biological sample further comprises a DNA amplification step.
- Another method concerns the identification of a pathogenic or non-pathogenic E. coli .
- This approach is practiced by obtaining a biological sample containing polynucleotides, analyzing the biological sample for the presence of a diagnostic polynucleotide having at least one polymorphism described in Table 1, and identifying the E. coli as a pathogenic or non-pathogenic strain based on the presence or absence of at least one polymorphism described in Table 1.
- the presence or absence of the C653T or G653C polymorphism is analyzed and/or the analysis of the biological sample further comprises a DNA amplification step.
- Other methods of the invention include, a method of making a 6-PGD protein comprising the steps of obtaining a cDNA comprising one of the sequences of the sequence listing, inserting the cDNA in an expression vector such that the cDNA is operably linked to a promoter, and introducing the expression vector into a host cell whereby the host cell produces the protein encoded by the cDNA.
- This method can also be used in conjunction with a step involving the isolation of the protein.
- An additional method concerns the construction of a transformed host cell that expresses one of the sequences of the sequence listing. This method includes the steps of transforming a host cell with a recombinant DNA vector suitable for gene expression. Additionally, a method for detecting the presence of E.
- coli O157:H7 in a sample involves the steps of: (a) contacting said sample, under hybridization conditions, with a nucleic acid probe that selectively hybridizes to a nucleic acid sequence from gnd of E. coli O157:H7 and not to nucleic acid sequence from gnd of non-H7 E. coli O157 strains, to form a hybridization complex and (b) detecting formation of said hybridization complex as an indication of the presence of E. coli O157:H7 in the sample.
- FIG. 1 shows a graphical representation of the polymorphisms present at the gnd locus in several strains of E. coli . Bars represent the 1407 bp gnd allele and the vertical lines represent sites of polymorphisms determined by comparison to a consensus sequence.
- FIG. 2 shows the homology between chromosomes of E. coli O55:H7 and E. coli O157:H7 observed 3916 nucleotides downstream of the 3′ terminus of gnd of E. coli O55:H7, and 52 nucleotides downstream of the 3′ terminus of gnd of E. coli O157:H7.
- Elements of interest in the extra DNA in E. coli O55:H7 include a segment of homology to tnpA of S. enterica Typhimurium, an H-repeat protein gene with segments homologous to noncoding parts of the E. coli O157 rfb cluster, wbdJ and wbdK. Orfs are noted as homologous proteins. Loci are oriented chromosomally.
- FIG. 3 is a representation of a chromosome having the gnd locus and flanking regions.
- the inventor describes the discovery of the gnd gene and corresponding 6-phosphogluconate dehydrogenase (6-PGD) protein of fourteen strains of E. coli .
- 6-PGD 6-phosphogluconate dehydrogenase
- the inventor has also found several genetic differences or “polymorphisms” that can be used to identify the presence of a particular strain of E. coli and/or differentiate one strain of E. coli from another.
- One polymorphism in particular involves a substitution of an isoleucine molecule for a threonine molecule at amino acid position 218. This polymorphism is referred to as “T218I” or “Thr218Iso”.
- this form of 6-PGD or a polynucleotide encoding this form of 6-PGD i.e., an isoleucine at amino acid position 218 or a polynucleotide encoding an isoleucine at position 2178
- Iso218 a 6-PGD molecule having a threonine at amino acid position 218 or a polynucleotide encoding a threonine at position 218
- Thr218 a 6-PGD molecule having a threonine at amino acid position 218 or a polynucleotide encoding a threonine at position 218
- the term “Iso218” refers to a polymorphism in a polynucleotide encoding a fragment of 6-PGD (in which case the polymorphism is with reference to codon 218 of the 6-PGD fragment-encoding polynucleotide), or to a fragment of the 6-PGD protein itself (in which case the polymorphism is with reference to amino acid position 218 of the 6-PGD polypeptide sequence provided in the sequence listing.
- This polymorphism can also be referred to by the nucleotide differences that encode the Iso218 polymorphism.
- the Thr218 polymorphism results from the presence of a cytosine and guanine residue at nucleotide positions 653 and 654, respectively; whereas, the Iso2I8 polymorphism has a thymine and cytosine at positions 653 and 654, respectively.
- other ways of referring to the polymorphism at amino acid residue 218 include “C ⁇ T mutation at nucleotide position 653” and/or a “G ⁇ C” mutation at nucleotide position 654” or “C653T” and/or “G654C”.
- the inventor describes the cloning, sequencing, and characterization of fourteen gnd genes and corresponding proteins from different strains of E. coli .
- Evidence is also provided of the existence of one or more mobile DNA element(s) within the gnd-rfb region that has co-transferred among E. coli and accounts for the antigenic changes that resulted in the emergence of pathogenic E. coli that express the O55 and O157 antigens.
- Biological tools, diagnostics, and methods of use of the foregoing are described in the sections that follow. These embodiments are useful for the rapid identification of the presence of a specific strain of E. coli , and the differentiation of one strain of E.
- the inventor describes the cloning, sequencing, and characterization of the fourteen gnd genes and corresponding proteins of different strains of E. coli.
- the rfb cluster of genes occurs at approximately 44 minutes on the E. coli chromosome. These clusters are generally between 8 and 14 kb in length and contain approximately 8 to 12 contiguous genes that act in concert to produce the O side chain lipopolysaccharide. (Reeves, New Compr Biochem, 27:281-314 (1994); Reeves et al., Trends Microbiol, 4:495-503 (1996)). Adjacent to the rfb cluster is the gnd allele that encodes 6-phosphogluconate dehydrogenase (6-PGD) (EC 1.1.1.44), the third enzyme in the pentose-phosphate pathway.
- 6-PGD 6-phosphogluconate dehydrogenase
- gnd encodes a “housekeeping” gene with critical bacterial function
- this allele is highly polymorphic, when compared to other “housekeeping” genes in the E. coli chromosome.
- Woodtam and Ake “Mechanisms of molecular evolution,” Sinauer, Takahata and Clark, eds., Sunderland, Mass.: 1993:223-245. It is believed by some that the polymorphisms at the gnd locus result from inter-strain or interspecies transfers and subsequent recombination with Salmonella.
- the “hitchhiking hypothesis” the rfb region of E. coli is believed to have been acquired via horizontal transfer from other species by virtue of sequence homology and low G+C content. That is, gnd and rfb are thought to co-transfer or “hitchhike” with rfb. (Nelson and Selander, Proc Natl Acad Sci USA, 91:10227-10231 (1994)).
- this hypothesis are the discordant electromorphic appearances of 6-PGD of E. coli O157:H7 and its closest non-O157:H7 relative, E. coli O55:H7.
- the inventor cloned and sequenced the gnd genes of virulent strains of E. coli O157:H7, E. coli O55:H7, and E. coli that express the O157 antigen but are not as pathogenic to humans as E. coli O157:H7 and determined that, indeed, a relationship existed between polymorphisms within genes of the rfb cluster, in particular gnd and pathogenicity.
- E. coli O157:H7 and the other E. coli strains were cloned from purified bacterial DNA.
- bacteria were grown overnight in LB broth (Maniatis et al., Molecular cloning: a laboratory manual, Cold Spring Harbor Laboratory, (1982)) without antibiotics or with ampicillin (200 mg/mL), respectively, at 37° C.
- LB broth Maniatis et al., Molecular cloning: a laboratory manual, Cold Spring Harbor Laboratory, (1982)
- ampicillin 200 mg/mL
- genomic DNA bacteria (3 ml), pelletted by centrifugation, were suspended in 50 millimolar (mM) Tris-HCl (pH8.0) and 50 mM ethylenediamine tetraacetic acid (EDTA).
- the precipitate was centrifuged, washed once with 100% ethanol, air dried, and solubilized in 10 mM Tris-HCl (pH8.0), containing 1 mM ETDA. Plasmids were obtained and prepared using the Qiaprep Spin Miniprep Kit (Qiagen Inc., Valencia, Calif.) and manufacture's instructions.
- the inventor To amplify gnd from E. coli expressing the O157 antigen, the inventor initially used the primer pair (1) 5′CACGGATCCGATCACACCTGACAGGAGTA3′ (SEQ. ID. No.1) (for the rfb side) and 5′CCGGAATTCCGGGCAAAAAAAAGCCCGGTGCAA3′ (SEQ. ID. No. 2) (for the his side), which were derived from published sequences (Bisercic et al., J Bacteriol, 173:3894-3900 (1991)) and were modified to contain BamHI and EcoRI sites for cloning purposes. However, these primers failed to obtain an amplicon from E. coli O55:H7 DNA.
- the consensus oligonucleotides of primer pair (2)-5′CGGAATTCCGCGCTCAACATCGANAGCCGTGG3′ (SEQ. ID. No. 3) and 5′CGGAATTCCGCCTGGATCAGGTTAGCCGG3′ (SEQ. ID. No. 4) (derived from a computerized data base of E. coli gnd sequences and having 5′ EcoRI sites) were used to prime DNA from strain TB 182A (an E. coli O55:H7 strain). (Bokete et al., J Infect Dis, 175:1382-1389 (1997)).
- These primers produced a PCR product of approximately 1.3 kb, consisting of the internal portion of the gnd gene. Sequence analysis of this amplicon determined that the following primer pairs would prime DNA close to the 5′ and 3′ termini, respectively, of this allele:
- 5′TGCCCGCTACATCTCCTC3′ (SEQ. ID. No. 8) were then used to amplify DNA beyond the 5′ and 3′ termini of the E. coli O55:H7 gnd, respectively.
- the resulting sequence data then prompted the design and use of the primer pair (8)-5′CCATCAGTAATAATGAAAAGGAATT3′ (SEQ. ID. No. 11) and 5′ATCATTAGCTCCTCTTAAGATCGC3′ (SEQ. ID. No.12) to amplify the E. coli O55 gnd allele.
- Primer pairs (9)-5′TCGTCGCTTATGCGGTACAGAGCG3′) (SEQ. ID. No.
- PCR was performed using either the ExpandTM Long Template PCR System (Boehringer Mannheim, Indianapolis, Ind.) (“Expand System”) or Taq DNA polymerase (Promega, Madison, Wis.). For initial pan-gnd amplifications, Taq DNA polymerase (Promega) was used. For amplifications using the Expand system, reactions were performed in 50 ⁇ 1 containing BMB buffer 1 supplied by the manufacturer. DNA polymerases used were either Taq DNA polymerase supplied by Promega, catalog number M1865 (5U/ ⁇ 1) (A) or Taq and Pwo DNA polymerases supplied by Boehringer-Mannheim (3.5U/ ⁇ 1) (B).
- Thermocycling conditions included: 35 cycles at 94° C. (1 min), 37° C. (1 min), and 72° C. (1 min), followed by a 7 minute incubation at 72° C.); 30 cycles at 94° C. (1 min), 37° C. (1 min), and 72° C. (1 min), followed by a 7 minute incubation at 72° C.; an initial cycle at 95° C. (3 min), 55° C. (1 min), and 74° C. (1 min), followed by 35 cycles of 95° C. (1 min), 55° C. (1 min), and 74° C. (1 min), and a final incubation at 72° C. (5 min); or an initial incubation at 92° C. (2 min), followed by 10 cycles at 92° C.
- single nucleotide polymorphisms were found in strains 13A81 and 13A83 ( E. coli O157 isolates expressing H antigens 16 and 45, respectively); strains 13A80, 7E, 3005-89, 3004-89, and G5933 ( E. coli O157 expressing H antigens 16, 43, 38, 3, and 12, respectively); and each of the non-H7 E. coli O157 strains.
- the amino acid sequences that correspond to the polymorphisms described in Table 1 i.e., the polymorphisms expressed in terms of the amino acid
- the T218I was discovered in pathonic O157:H7 strains and the O55:H7 strain TB182A but not any of non-pathonic O157:H7 strains.
- the sequence data revealed that the non-pathogenic strains, except O55:H7, have a cytosine and guanine residue at nucleotide positions 653 and 654, respectively; whereas, the pathogenic strains have a thymine and cytosine at positions 653 and 654, respectively.
- a convenient way to distinguish pathogenic O157:H7 strains from non-pathogenic O157:H7 strains involves the identification of a “C ⁇ T” mutation at nucleotide position 653 of gnd and/or a “G ⁇ C” mutation at nucleotide position 654 or the presence of an isoleucine amino acid residue at amino acid position 218. Because the gnd of E. coli O55:H7 is only about 82% homologous to the gnd of E. coli O157:H7 (e.g., strain 86-24), these strains can be easily distinguished at several different loci, as will be described in greater detail below. TABLE 1 Pos.
- 6468 0.0 1 gb
- E. coli O157 Three distinct allele groups were found in E. coli O157. (See Table 3). These alleles differed from one another at about 5% of their nucleotide residues.
- the “gnd allele A” is comprised of gnds of toxigenic E. coli O157:H7 and E. coli O157:NM strains. The gnd sequences of strains 85-07 and 87-16 each differed from that of strain 86-24 at only two of their 1407 nucleotides; the remaining three were identical.
- the “gnd allele B” is found in E.
- E. coli O55:H7 is the closest relative to E. coli O157:H7, their gnd sequences are strikingly different.
- the gnd sequence of E. coli O157:H7, strain 86-24 has only about 82% homology to the gnd sequence of E. coli O55:H7, strain TB182A and there appears to be no readily apparent region of conservation between these two alleles.
- coli O157:H7 contained open reading frames (orfs) encoding UDP glucose-6-dehydrogenase and an O-antigen chain length determining protein. Sequences between positions 3′+1 and 3′+51, and 3′+1 and 3′+3915, relative to the respective E. coli O55:H7 and E. coli O157:H7 gnds, were not found to be homologous.
- the inventor discovered an orf between positions 3′+2817 and 3′+3422 that encodes a protein of 201 amino acids, which is about 98% homologous to H-repeat protein amino acids in RhsB encoded by orf-H (Genbank number LO2370). (Zhao et al., J. Bacteriol., 175:2799-2808 (1993)). Still further, the inventor found that approximately 92% of the 114 inclusive nucleotides between positions 3′+3809 and 3′+3922 relative to gnd of E. coli O55:H7, including 7 nucleotides of the sequence common to E.
- coli O157:H7 are identical to nucleotides adjacent to the 3′ end of tnpA of Salmonella typhimurium LT2, encoding IS200 transposase A (GenBank number AFO93749). DNA between nucleotides at the 3′+478 and 3′+1942 positions relative to gnd of E. coli O55:H7 were also found to be about 75% identical to E. coli O111 wbdJ and wbdK (Genbank number U13629).
- the two orfs corresponding to nucleotides between positions 3′+112 and 3′+1035, and 3′+1032 and 3′+2198 relative to gnd are 67% and 80% identical to WbdJ and Wbd K, respectively. (Bastin and Reeves, Gene, 164:17-23 (1995)). Three segments between nucleotides at positions 3′+2788 and 3′+3806 relative to the E. coli O55:H7 gnd allele are 83-96% homologous to non-coding regions of the E. coli O157:H7 rfb cluster (Genbank numbers AF061251 and AB008676).
- PCR was employed using the primers: 5′GCGTTCTTAAAGAGTCCTGC3′ (SEQ. ID. No. 13) and 5′TGCCCGCTACATCTCCTC3′ (SEQ. ID. No. 8), which correspond to the 3′ end of gnd and downstream regions, so as to obtain a 6.5 kb amplicon from the DNA of 11 E. coli O55 strains. This amplicon was not obtained when PCR was performed with these primers on DNA from E. coli O157:H7.
- E. coli O55 strains from diverse lineages. Genomic DNA or amplicons from E. coli HB101, E. coli O157:H7 strain 86-24, E. coli O55:H7 strains TB156A, TB182A, and 5 A-E, and E. coli O55:H6 strains 1A, 1B, 2A, and 2B were produced using the primers: 5′GCGTTCTTAAAGAGTCCTGC3′ (SEQ. ID. No. 13) and 5′TGCCCGCTACATCTCCTC3′ (SEQ. ID. No. 8).
- the amplicon probe was labeled with the Megaprime DNA system (Amersham) and [ ⁇ 32 P]dATP (New England Nuclear Research Products). This experiment showed a strong signal in the lanes loaded with DNA from an O55 strain but not from a lane loaded with DNA from an O157 strain or the HB101 control.
- the study above not only provides strong evidence that the region 3′ to gnd in E. coli O55 strains contains a conserved element with sequences that are involved in DNA mobility but also teach a rapid method to differentiate E. coli O55:H7 from O157:H7.
- E. coli O55 and O111 O-side chains each contain colitose (Keene et al., Carbohydr. Res., 111:289-296 (1983)), an unusual residue among known bacterial LPS sugars.
- the rfb regions specifying these two serogroups have genes encoding WbdK and WbdJ homologues, though on different sides of gnd.
- WbdK is homologous to RfbH of Yersinia pseudotuberculosis, a CDB- 4-keto-6-deoxy-D-glucose-3-dehydrase in the CDP-abequose pathway.
- WbdK is a putative pyridoxamine 5-phosphate-dependent dehydrase at a corresponding step in the synthesis of the O111 antigen. (Bastin and Reeves, Gene, 164:17-23 (1995)). WbdJ is homologous to Orf1.9 encoded by the E. coli capsular polysaccharide gene cluster, and is believed to perform a related function in the synthesis of the E. coli O111 LPS antigen.
- a computer readable medium having the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 are useful for the determination of homologous sequences, design of probes and primers, epitope analysis, elucidation of structural and functional domains, and the construction of protein models for rational drug design.
- the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 can be stored, recorded, and manipulated on any medium that can be read and accessed by a computer.
- the words “recorded” and “stored” refer to a process for storing information on computer readable medium.
- a skilled artisan can readily adopt any of the presently known methods for recording information on computer readable medium to generate manufactures comprising the nucleotide or polypeptide sequence information of this embodiment of the invention.
- a variety of data storage structures are available to a skilled artisan for creating a computer readable medium having recorded thereon a nucleotide or polypeptide sequence. The choice of the data storage structure will generally be based on the component chosen to access the stored information.
- Computer readable media include magnetically readable media, optically readable media, or electronically readable media.
- the computer readable media may be a hard disc, a floppy disc, a magnetic tape, CD-ROM, RAM, or ROM as well as other types of other media known to those skilled in the art.
- the computer readable media on which the sequence information is stored may be in a personal computer, a network, a server or other computer systems known to those skilled in the art.
- Embodiments of the invention include systems, particularly computer-based systems that contain the sequence information described herein.
- a computer-based system refers to the hardware, software, and database used to analyze the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43, or fragments thereof.
- the computer-based system preferably includes the storage media described above, and a processor for accessing and manipulating the sequence data.
- the hardware of the computer-based systems of this embodiment comprise a central processing unit (CPU) and one or more databases.
- CPU central processing unit
- a skilled artisan can readily appreciate that any one of the currently available computer-based systems are suitable.
- the computer system includes a processor connected to a bus which is connected to a main memory (preferably implemented as RAM) and a variety of secondary storage devices, such as a hard drive and removable medium storage device.
- the removable medium storage device may represent, for example, a floppy disk drive, a compact disk drive, a magnetic tape drive, etc.
- a removable storage medium, such as a floppy disk, a compact disk, a magnetic tape, etc. containing control logic and/or data recorded therein (e.g., the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43) may be inserted into the removable storage device.
- the computer system includes appropriate software for reading the control logic and/or the data from the removable medium storage device once inserted in the removable medium storage device.
- the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 may be stored in a well known manner in the main memory, any of the secondary storage devices, and/or a removable storage medium.
- Software for accessing and processing the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 (such as search tools, compare tools, and modeling tools etc.) reside in main memory during execution.
- a database refers to memory that can store nucleotide or polypeptide sequence information, and protein model information. Additionally, a “database” refers to a memory access component which can access manufactures having recorded thereon nucleotide or polypeptide sequence information, and/or protein model information. In other embodiments, a database stores an “ E. coli pathogen profile” that comprises nucleotide and/or polypeptide sequence information, and/or protein model information on gnd genes and 6-PGD proteins and the polymorphisms therein.
- an E. coli pathogen profile has recorded or stored in a database a plurality of polymorphisms associated with highly pathogenic and/or less pathogenic E.
- the pathogen profile can be stored such that the sequences therein that correspond to specific organisms are fully searchable by sequence, organism, and/or restriction map and homology, identity and matches to queried sequences can be determined.
- a preferable organization of the database is as provided by NCBI, which allows BLAST-type searching, protein model searching, key word searches, and an interface with Medline. Many other types of databases and organizations are known to those of skill in the art and several will be discussed below.
- the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 may be stored and manipulated in a variety of data processor programs in a variety of formats.
- the sequence data may be stored as text in a word processing file, such as MicrosoftWORD or WORDPERFECT or as an ASCII file in a variety of database programs familiar to those of skill in the art, such as DB2, SYBASE, or ORACLE.
- a “search program” refers to one or more programs that are implemented on the computer-based system to compare a nucleotide or polypeptide sequence with other nucleotide or polypeptide sequences stored within the database.
- a search program also refers to one or more programs that compare one or more protein models to several protein models that exist in a database.
- a search program is used, for example, to compare regions of the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 that match sequences in nucleic acid and/or protein data base so as to identify homologies and structural or functional motifs.
- a search program is used to compare an E. coli pathogen profile to a queried sequence so as to identify the presence of one or more polymorphisms in the queried sequence and determine the strain of the bacteria from which the queried sequence was derived.
- a “retrieval program” refers to one or more programs that are implemented on the computer based system to identify a homologous nucleic acid sequence, a homologous protein sequence, or a homologous protein model. Further a retrieval program can be used to identify an E. coli pathogen profile that matches a queried sequence, keyword, disease characteristic, or restriction map. Preferably, the retrieval program interfaces with a display format that presents the data from the E. coli pathogen profile in a form that can be rapidly discerned. For example, the “bar code” shown in FIG. 1 is one format that can be obtained by a retrieval program that provides information on the position of polymorphisms that can be used to identify or distinguish a particular strain of E. coli.
- one of the novel sequences disclosed in is compared to a queried sequence and the percent sequence identity is determined.
- Standard methods that are commonly used to compare the similarity and position of the amino acid of two polypeptides can be used to make these comparisons.
- two polypeptides can be aligned for optimal matching of their respective amino acids (either along the full length of one or both sequences, or along a predetermined portion of one or both sequences).
- Such programs provide “default” opening penalty and a “default” gap penalty, and a scoring matrix such as PAM 250 (a standard scoring matrix; see Dayhoff et al., in: Atlas of Protein Sequence and Structure, Vol. 5, Supp. 3 (1978)) can be used in conjunction with the computer program.
- the percent identity can then be calculated as:
- Polypeptides that are at least 70% identical will typically have one or more amino acid substitutions, deletions and/or insertions. Usually, the substitutions will be conservative so as to have little or no effect on the overall net charge, polarity, or hydrophobicity of the protein but optionally may increase the activity of 6-PGD.
- Blast 2 (BlastP 2.0.9) searches on the NCBI data base using the BLOSUM matrix with an opening penalty of 11, a gap extension of 1, and an x_-dropoff of 50. These later search parameters were used to compare 6-PGD encoded by O157:H7, strains 86-24, H8, ADAL233, and 2755 to:
- ORFs encoded by the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 and regions within the gnd/rfb gene cluster were also compared to known amino acid sequences found in Swissprot.
- Many computer programs and databases may be used with embodiments of the invention. The following list is intended not to limit the invention but to provide guidance to programs and databases that are useful with the nucleic acid and protein sequence embodiments of the invention.
- the programs and databases that can be used include, but are not limited to: MacPattern (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine (Molecular Applications Group), Look (Molecular Applications Group), MacLook (Molecular Applications Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul et al, J. Mol. Biol. 215: 403 (1990)), FASTA (Pearson and Lipman, Proc. Natl. Acad. Sci.
- aspects of the invention include recombinant vectors, probes, and primers comprising the gnd sequences and/or corresponding proteins of SEQ. ID. Nos. 16-43 and fragments thereof, in particular portions of the gnd gene or corresponding protein that contain a polymorphism described in Table 1.
- the discussion below describes these aspects of the invention.
- Several embodiments of the invention include recombinant vectors, probes, and primers comprising the gnad sequences of SEQ. ID. Nos. 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 and fragments thereof.
- preferred nucleic acid embodiments include fragments of any gnd gene that have a polymorphism described in Table 1.
- full-length refers to either the entire sequence of genomic gnd or cDNA gnd depending on the context. Further embodiments include nucleic acids that complement the full-length gnd described in SEQ.
- nucleic acid embodiments of the invention can have from 9 to approximately 1,406 consecutive nucleotides of SEQ. ID. Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 or a complement to these sequences of virtually any length so long as the nucleic acid includes at least one polymorphism described in Table 1.
- gnd nucleic acids of the invention can be joined to an exogenous nucleic acid so as create a fusion product, which is within the scope of the invention, having virtually any length.
- a nucleic acid having a portion (i.e., about 9 to about 1,406 consecutive nucleotides) of SEQ. ID. Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 or a complement to these sequences or a full-length gnd of the invention (either genomic or cDNA) are embodiments.
- embodiments include a nucleic acid having at least one polymorphism described in Table 1 and less than or equal to 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450,
- the nucleic acid embodiments comprise at least 12, 13, 14, 15, 16, 17, 18, or 19 consecutive nucleotides from SEQ. ID. Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 or a complement to these sequences, as conditions dictate, so long as the fragment has at least one polymorphism described in Table 1. More preferably, the nucleic acid embodiments comprise at least 20-30 consecutive nucleotides. These nucleic acid oligomers have biotechnological and diagnostic use, e.g., in nucleotide acid hybridization assays, Southern and Northern Blot analysis, etc. and the prognosis of E. coli infection.
- Some embodiments comprise recombinant constructs having all or part of the gnd genes disclosed in SEQ. ID. Nos. 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 or complements thereof
- a recombinant construct can be capable of replicating autonomously in a host cell. Alternatively, the recombinant construct can become integrated into the chromosomal DNA of the host cell.
- Such a recombinant polynucleotide comprises a polynucleotide of genomic or cDNA, of semi-synthetic or synthetic origin by virtue of human manipulation. Therefore, recombinant nucleic acids comprising sequences otherwise not naturally occurring are provided by embodiments of this invention.
- nucleic acid embodiments of this invention can also be altered by mutation such as substitutions, additions, or deletions that provide for sequences encoding functionally equivalent molecules. Due to the degeneracy of nucleotide coding sequences, other DNA sequences that encode substantially the same 6-PGD amino acid sequence as depicted in SEQ. ID. Nos.: 23, 17, 19, 25, 27, 21, 43, 29, 31, 41, 33, 37, 39, and 35 can be used in some embodiments of the invention. These include, but are not limited to, nucleic acid sequences comprising all or portions of gnd depicted in SEQ. ID. Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 or complements thereof that have been altered by the substitution of different codons that encode a functionally equivalent amino acid residue within the sequence, thus producing a silent change.
- gnd-encoding nucleic acid sequences and their complementary sequences can be engineered so as to modify processing or expression.
- the gnd genes depicted in SEQ. ID. Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 can be combined with a promoter sequence and/or ribosome binding site, or a signal sequence may be inserted upstream of 6-PGD-encoding sequences to permit secretion of 6-PGD and thereby facilitate harvesting or bioavailability.
- a given gnd nucleic acid can be mutated in vitro or in vivo, to create and/or destroy translation, initiation, and/or termination sequences, or to create variations in coding regions and/or form new restriction sites or destroy preexisting ones, or to facilitate further in vitro modification.
- Any technique for mutagenesis known in the art can be used, including but not limited to, in vitro site-directed mutagenesis. (Hutchinson et al., J. Biol. Chem. 253:6551 (1978)).
- nucleic acids encoding other proteins or domains of other proteins can be joined to nucleic acids encoding 6-PGD so as to create a fusion protein.
- the resulting fusion proteins can be used as biotechnological tools to investigate the mobility of regions of the gnd/rfb cluster, for example, or to develop strain specific antibodies.
- the nucleic acid embodiments can also be used as biotechnological tools for isolation procedures and diagnostic assays.
- probes that complement these sequences can be designed and manufactured by oligonucleotide synthesis.
- Preferred hybridization probes comprise at least one polymorphism found in Table 1. These probes can be used to screen cDNA or genomic libraries so as to isolate natural sources of the nucleic acid embodiments of the invention or can be used to identify specific strains or classes of strains of E. coli .
- Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 can be used to make oligonucleotide primers by conventional oligonucleotide synthesis for use in amplification strategies, such as PCR.
- oligonucleotide primers can be used, for example, to isolate the nucleic acid embodiments of this invention by amplifying the sequences resident in genomic DNA or biological samples by using PCR or other enzyme-mediated nucleic acid amplification techniques. Such diagnostic and food or water screening techniques are discussed in greater detail below.
- nucleic acids encoding the gnd sequences disclosed in SEQ. ID. Nos.: 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34, or fragments thereof are manipulated using conventional techniques in molecular biology to create recombinant constructs that express 6-PGD or fragments of 6-PGD.
- the discussion that follows describes some of these expression constructs and protein embodiments.
- the 6-PGD polypeptide embodiments or derivatives thereof include but are not limited to, those molecules having as a primary amino acid sequence all of the amino acid sequence substantially as depicted in SEQ. ID. Nos.: 23, 17, 19, 25, 27, 21, 43, 29, 31, 41, 33, 37, 39, and 35 and fragments of these sequences at least three amino acids in length including altered sequences in which functionally equivalent amino acid residues are substituted for residues within the sequence resulting in a silent change.
- Preferred fragments include at least one of the polymorphisms that can be deduced from Table 1, as described previously. It is to be understood that in the following discussion in this section, references made to 6-PGD in a general sense are intended to encompass the proteins and fragments thereof found in SEQ. ID. Nos. 23, 17, 19, 25, 27, 21, 43, 29, 31, 41, 33, 37, 39, and 35.
- one or more amino acid residues within the 6-PGD polypeptide of SEQ. ID. Nos.: 23, 17, 19, 25, 27, 21, 43, 29, 31, 41, 33, 37, 39, and 35 or fragments thereof can be substituted by another amino acid of a similar polarity that acts as a functional equivalent, resulting in a silent alteration.
- Substitutes for an amino acid within the sequence can be selected from other members of the class to which the amino acid belongs.
- the non-polar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine.
- the polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine.
- the positively charged (basic) amino acids include arginine, lysine, and histidine.
- the negatively charged (acidic) amino acids include aspartic acid and glutamic acid.
- the aromatic amino acids include phenylalanine, tryptophan, and tyrosine.
- the 6-PGD fragments of the invention can be less than or equal to 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, and 468
- the 6-PGD polypeptide of SEQ. ID. Nos.: 23, 17, 19, 25, 27, 21, 43, 29, 31, 41, 33, 37, 39, and 35 or fragments thereof or derivatives thereof are differentially modified during or after translation, e.g., by phosphorylation, glycosylation, cross-linking, acylation, proteolytic cleavage, linkage to an antibody molecule, membrane molecule, or other ligand. (Ferguson et al., Ann. Rev. Biochem. 57:285-320 (1988)).
- the 6-PGD polypeptide of SEQ. ID. Nos.: 23, 17, 19, 25, 27, 21, 43, 29, 31, 41, 33, 37, 39, and 35 or fragments thereof are expressed in a cell line.
- the sequences, constructs, vectors, clones, and other materials comprising the present invention can advantageously be in enriched or isolated form.
- “enriched” means that the concentration of the material is at least about 2, 5, 10, 100, or 1000 times its natural concentration (for example), advantageously 0.01%, by weight, preferably at least about 0.1% by weight.
- Enriched preparations from about 0.5%, 1%, 5%, 10%, and 20% by weight are also contemplated.
- isolated requires that the material be removed from its original environment (e.g., the natural environment if it is naturally occuriang).
- a naturally-occuring polynucleotide or polypeptide present in a living animal is not isolated, but the same polynucleotide or polypeptide, separated from some or all of the coexisting materials in the natural system, is isolated. It is also advantageous that the sequences be in purified form.
- purified does not require absolute purity; rather, it is intended as a relative definition. Purification of starting material or natural material to at least one order of magnitude, preferably two or three orders, and more preferably four or five orders of magnitude is expressly contemplated.
- nucleic acids containing the coding sequence for 6-PGD or fragments of 6-PGD are obtained and cloned into a suitable expression vector such that the coding region is operably linked to a heterologous promoter.
- the nucleic acid encoding the protein or polypeptide to be expressed is operably linked to a promoter in an expression vector using conventional cloning technology.
- the expression vector can be in any of the mammalian, yeast, amphibian, insect, parasite, or bacterial expression systems known in the art.
- the following is provided as one exemplary method to express the proteins encoded by the nucleic acids described above.
- the methionine initiation codon for the gene and the poly A signal of the gene are identified. If the nucleic acid encoding the polypeptide to be expressed lacks a methionine to serve as the initiation site, an initiating methionine can be introduced next to the first codon of the nucleic acid using conventional techniques.
- this sequence can be added to the construct by, for example, splicing out the Poly A signal from pSG5 (Stratagene) using BglI and SalI restriction endonuclease enzymes and incorporating it into the mammalian expression vector pXT1 (Stratagene).
- the vector pXT1 contains the LTRs and a portion of the gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in the construct allow efficient stable transfection.
- the vector includes the Herpes Simplex Thymidine Kinase promoter and the selectable neomycin gene.
- the nucleic acid encoding the polypeptide to be expressed can be obtained by PCR from the bacterial vector using oligonucleotide primers complementary to the nucleic acid and containing restriction endonuclease sequences for Pst I incorporated into the 5′ primer and BglII at the 5′ end of the corresponding cDNA 3′ primer, taking care to ensure that the nucleic acid is positioned in frame with the poly A signal.
- the purified fragment obtained from the resulting PCR reaction is digested with PstI, blunt ended with an exonuclease, digested with Bgl II, purified and ligated to pXT1, now containing a poly A signal and digested with BglII.
- the ligated product is transfected into a suitable cell line, e.g., mouse NIH 3T3 cells, using Lipofectin (Life Technologies, Inc., Grand Island, N.Y.) under conditions outlined in the product specification. Positive transfectants are selected after growing the transfected cells in 600 ⁇ g/ml G418 (Sigma, St. Louis, Mo.). Preferably the expressed protein is released into the culture medium, thereby facilitating purification.
- a suitable cell line e.g., mouse NIH 3T3 cells
- Lipofectin Life Technologies, Inc., Grand Island, N.Y.
- Positive transfectants are selected after growing the transfected cells in 600 ⁇ g/ml G418 (Sigma, St. Louis, Mo.).
- the expressed protein is released into the culture medium, thereby facilitating purification.
- Another embodiment utilizes the “Xpress system for expression and purification” (Invitrogen, San Diego, Calif.).
- the Xpress system is designed for high-level production and purification of recombinant proteins from bacterial, mammalian, and insect cells.
- the Xpress vectors produce recombinant proteins fused to a short N-terminal leader peptide that has a high affinity for divalent cations.
- a nickel-chelating resin Invitrogen
- the recombinant protein can be purified in one step and the leader can be subsequently removed by cleavage with enterokinase.
- One preferred vector for the expression of 6-PGD and fragments of 6-PGD is the pBlueBacHis2 Xpress.
- the pBlueBacHis2 Xpress vector is a Baculovirus expression vector containing a multiple cloning site, an ampicillin resistance gene, and a lac z gene.
- the gnd nucleic acid, or portion thereof is cloned into the pBlueBacHis2 Xpress vector and SF9 cells are infected.
- the expression protein is then isolated or purified according to the maufacturer's instructions.
- Several other cultured cell lines having recombinant constructs or vectors comprising gnd or portions thereof are embodiments of the present invention and their manufacture would be routine given the present disclosure.
- Proteins in the culture medium can also be separated by gel electrophoresis. The separated proteins are then detected using techniques such as Coomassie or silver staining or by using antibodies against the protein. Coomassie, silver staining, and immunolabeling of proteins are techniques familiar to those skilled in the art. If desired, the proteins can also be ammonium sulfate precipitated or separated based on size or charge prior to electrophoresis.
- the protein encoded by gnd or portion thereof can also be purified using standard immunochromatography techniques.
- a solution containing the protein such as the culture medium or a cell extract, is applied to a column having antibodies against the protein attached to the chromatography matrix.
- the protein is allowed to bind the immunochromatography column. Thereafter, the column is washed to remove non-specifically bound proteins.
- the specifically bound protein is then released from the column and recovered using standard techniques.
- gnd or portion therof can be incorporated into expression vectors designed for use in purification schemes employing chimeric polypeptides.
- the coding sequence of gnd or portion therof is inserted in frame with the gene encoding the other half of the chimera.
- the other half of the chimera may be ⁇ -globin or a nickel binding polypeptide encoding sequence.
- a chromatography matrix having antibody to ⁇ -globin or nickel attached thereto is then used to purify the chimeric protein.
- Protease cleavage sites can be engineered between the ⁇ -globin gene or the nickel binding polypeptide and the gnd cDNA such as enterokinase.
- the two polypeptides of the chimera can be separated from one another by protease digestion.
- One useful expression vector for generating ⁇ -globin chimerics is pSG5 (Stratagene), which encodes rabbit ⁇ -globin. Intron II of the rabbit ⁇ -globin gene facilitates splicing of the expressed transcript, and the polyadenylation signal incorporated into the construct increases the level of expression.
- these molecules can be prepared by chemical synthesis methods (such as solid phase peptide synthesis) using methods known in the art such as those set forth by Merrifield et al., J. Am. Chem. Soc. 85:2149 (1964), Houghten et al., Proc. Natl. Acad. Sci. USA, 82:51:32 (1985), and Stewart and Young (solid phase peptide synthesis, Pierce Chem Co., Rockford, Ill. (1984).
- Such polypeptides can be synthesized with or without a methionine on the amino terminus.
- 6-PGD and fragments of 6-PGD can be oxidized using methods set forth in these references to form disulfide bridges.
- 6-PGD and fragments of 6-PGD can be employed as biologically active or immunological substitutes for natural, purified 6-PGD and fragments of 6-PGD.
- Analogs of 6-PGD or fragments of 6-PGD include small molecules modeled on the peptides. These small molecules are also known as peptidomimetics.
- a peptidomimetic is a molecule that has the same effect as a peptide, usually because it has the same critical ‘shape’, but is not itself a peptide and hence is not broken down by proteases and is cheaper to produce.
- peptidomimetics that structurally and/or functionally resemble 6-PGD or fragments of 6-PGD can be made and evaluated for their ability to interact with 6-PGD in a 6-PGD characterization assay (e.g., inhibit the function of natural 6-PGD or fragment thereof) or induce an immune response in a subject.
- 6-PGD characterization assay e.g., inhibit the function of natural 6-PGD or fragment thereof
- Several approaches to make peptidomimetics that resemble polypeptides are described in the art. A vast number of methods, for example, can be found in U.S. Pat. Nos. 5,288,707; 5,552,534; 5,811,515; 5,817,626; 5,817,879; 5,821,231; and 5, 874,529, herein incorporated by reference in their entirety.
- the isolated or purified proteins can be used to generate antibodies and tools for identifying agents that interact with 6-PGD and fragments of 6-PGD.
- Antibodies that recognize 6-PGD and fragments of 6-PGD have many uses including, but not limited to, biotechnological applications, therapeutic/prophylactic applications, and diagnostic applications.
- Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments produced by a Fab expression library.
- various hosts including goats, rabbits, rats, mice, etc can be immunized by injection with 6-PGD or any portion, fragment or oligopeptide that retains immunogenic properties.
- various adjuvants can be used to increase immunological response.
- adjuvants include but are not limited to Freund's, mineral gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol.
- BCG Bacillus Calmette-Guerin
- Corynebacterium parvum are potentially useful adjuvants.
- Peptides used to induce specific antibodies can have an amino acid sequence consisting of at least three amino acids, preferably at least 10 or 15 amino acids that include a polymorphism as can be deduced from Table 1.
- Preferred antibodies include ones that specifically bind to a polypeptide having the T218I polymorphism or a nucleic acid having either the C653T or G654C but not 6-PGD or gnd that has the Thr218 or thymine or cytosine polymorphisms at nucleic acid positions 653 and 654, respectively.
- preferred antibodies recognize an epitope that uniquely identifies the Iso218 polymorphism but not the Thr218 polymorphism or vice versa or the antibodies recognize an epitope that uniquely identifies a cytosine at nucleic acid position 653 and/or a guanine at nucleic acid position 654 or a thymine at position 653 and/or a cytosine at nucleic acid position 654.
- short stretches of amino acids encoding fragments of 6-PGD are fused with those of another protein such as keyhole limpet hemocyanin and antibody is produced against the chimeric molecule.
- While antibodies capable of specifically recognizing 6-PGD can be generated by injecting into mice synthetic 3-mer, 10-mer, and 15-mer peptides that correspond to a protein sequence of 6-PGD, a more diverse set of antibodies can be generated by using recombinant or purified 6-PGD and fragments of 6-PGD.
- substantially pure 6-PGD or a fragment of 6-PGD is isolated from a transfected or transformed cell.
- concentration of the polypeptide in the final preparation is adjusted, for example, by concentration on an Amicon filter device, to the level of a few micrograms/ml.
- Monoclonal or polyclonal antibody to the polypeptide of interest can then be prepared as follows:
- Monoclonal antibodies to 6-PGD or a fragment of 6-PGD can be prepared using any technique that provides for the production of antibody molecules by continuous cell lines in culture. These include but are not limited to the hybridoma technique originally described by Koehler and Milstein (Nature 256:495-497 (1975), the human B-cell hybridoma technique (Kosbor et al. Immunol Today 4:72 (1983); Cote et al Proc Natl Acad Sci 80:2026-2030 (1983), and the EBV-hybridoma technique Cole et al. Monoclonal Antibodies and Cancer Therapy, Alan R. Liss Inc, New York N.Y., pp 77-96 (1985).
- Antibodies can also be produced by inducing in vivo production in the lymphocyte population or by screening recombinant immunoglobulin libraries or panels of highly specific binding reagents as disclosed in Orlandi et al., Proc Natl Acad Sci 86: 3833-3837 (1989), and Winter G. and Milstein C; Nature 349:293-299 (1991).
- Antibody fragments that contain specific binding sites for 6-PGD can also be generated.
- fragments include, but are not limited to, the F(ab′) 2 fragments that can be produced by pepsin digestion of the antibody molecule and the Fab fragments that can be generated by reducing the disulfide bridges of the F(ab′) 2 fragments.
- Fab expression libraries can be constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity. (Huse W. D. et al. Science 256:1275-1281 (1989)).
- monoclonal antibodies to 6-PGD of fragments thereof are made as follows. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein or peptides derived therefrom over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the spleen isolated. The spleen cells are fused in the presence of polyethylene glycol with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and aliquots of the dilution placed in wells of a microtiter plate where growth of the culture is continued.
- HAT media aminopterin
- Antibody-producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, E., Meth. Enzymol. 70:419 (1980), and derivative methods thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody production are described in Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New York. Section 21-2.
- Polyclonal antiserum containing antibodies to heterogenous epitopes of a single protein can be prepared by immunizing suitable animals with the expressed protein or peptides derived therefrom described above, which can be unmodified or modified to enhance immunogenicity.
- Effective polyclonal antibody production is affected by many factors related both to the antigen and the host species. For example, small molecules tend to be less immunogenic than others and may require the use of carriers and adjuvant.
- host animals vary in response to site of inoculations and dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigen administered at multiple intraderinal sites appears to be most reliable.
- An effective immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 33:988-991 (1971).
- Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, O. et al., Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) Blackwell (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of serum (about 12 ⁇ M). Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed.
- Antibody preparations prepared according to either protocol are useful in quantitative immunoassays that determine concentrations of antigen-bearing substances in biological samples; they are also used semi-quantitatively or qualitatively (e.g., in diagnostic embodiments that identify the presence of 6-PGD in biological samples).
- the diagnostics and screening methods of the invention can be classified according to whether the embodiment is a nucleic acid or protein based assay.
- These assays preferably identify and distinguish the strain and extent of pathogenicity of an E. coli present in a biological sample (e.g., a sample from a patient, food source, or liquid source) by detecting the presence of one or more polymorphisms at the gnd locus. That is, several of the diagnostic and screening embodiments focus on the detection of one or more polymorphisms provided in Table 1 or that can be deduced from Table 1 in a nucleic acid or protein sample.
- kits that incorporate the reagents and methods described in the following embodiments so as to allow for the rapid detection and identification of highly pathogenic O157:H7 E. coli are contemplated.
- the diagnostic kits can include a nucleic acid probe or an antibody or combinations thereof, which specifically detect the one or more polymorphisms described in Table 1 or that can be deduced from Table 1.
- the detection component of these kits will typically be supplied in combination with one or more of the following reagents.
- a support capable of absorbing or otherwise binding DNA, RNA, or protein will often be supplied. Available supports include membranes of nitrocellulose, nylon or derivatized nylon that can be characterized by bearing an array of positively charged substituents.
- One or more restriction enzymes, control reagents, buffers, amplification enzymes, and non-human polynucleotides like calf-thymus or salmon-sperm DNA can be supplied in these kits.
- Useful nucleic acid-based diagnostic techniques include, but are not limited to, direct DNA sequencing, Southern Blot analysis, single-stranded confirmation analysis (SSCA), RNase protection assay, dot blot analysis, nucleic acid amplification, and combinations of these approaches.
- the starting point for these analysis is isolated or purified DNA from a biological sample. Most simply, fecal material is obtained from a subject to be tested or a food or water sample is provided.
- the bactrerial DNA is extracted from the sample and amplified by a DNA amplification technique such as PCR using primers that correspond to regions of the gnd locus and/or the gnd/rfb cluster, preferably regions having a polymorphism listed in Table 1.
- SSCA single-stranded confirmation polymorphism assay
- CDGE clamped denaturing gel electrophoresis
- HA heteroduplex analysis
- CMC chemical mismatch cleavage
- a rapid preliminary analysis to detect polymorphisms and DNA sequences can be performed by looking at a series of Southern Blots of DNA cut with one or more restriction enzymes preferably with a large number of restriction enzymes. Each block contains lanes of DNA from uninfected individuals and the DNA to be tested. Southern Blots displaying hybridizing fragments when probed with sequences corresponding to one or more polymorphisms described in Table 1 indicate the presence of the specific E. coli strain.
- the detection of point mutations can also be accomplished by amplifying the DNA directly from the sample using primers corresponding to the regions flanking one or more polymorphisms described in Table 1 by standard PCR techniques and sequencing the amplicons, as will be discussed in greater detail below.
- nucleic acid-based methods for confirming the presence of one or more polymorphisms described in Table 1 are provided below. Provided for exemplary purposes only and not intended to limit any aspect of the invention, these methods include:
- DGGE denaturing gradient gel electrophoresis
- TTGE temporal temperature gradient gel electrophoresis
- SSCA SSCA
- TTGE TTGE
- RNase protection involves cleavage of the mutant polynucleotide into two or more smaller fragments.
- DGGE detects differences in migration rates of sequences compared to less pathogenic strain gnd sequences, using a denaturing gradient gel.
- ASOs allele-specific oligonucleotide assay
- an oligonucleotide is designed that detects a specific sequence, and an assay is performed by detecting the presence or absence of a hybridization signal.
- the protein binds only to sequences that contain a nucleotide mismatch in a heteroduplex between polymorphic and non-polymorphic sequences.
- Mismatches in this sense of the word refers to hybridized nucleic acid duplexes in which the two strands are not 100% complementary.
- the lack of total homology results from the presence of one or more polymorphisms in an amplicon obtained from a biological sample, for example, that has been hybridized to a non-polymorphic strand.
- Mismatched detection can be used to detect point mutations in the gnd gene or in its mRNA product. While these techniques are less sensitive than sequencing, they are easily performed on a large number of biological samples and are amenable to array technology.
- the nucleic acid embodiments of the present invention are attached to a support in an ordered array wherein a plurality of nucleic acid probes are attached to distinct regions of the support that do not overlap with each other.
- such an ordered array is designed to be “addressable” where the distinct locations of the probe are recorded and can be accessed as part of an assay procedure.
- addressable nucleic acid arrays comprise a plurality of nucleic acid probes that complement a plurality of polymorphisms listed in Table 1. These probes are joined to a support in different known locations. The knowledge of the precise location of each nucleic acid probe makes these “addressable” arrays particularly useful in binding assays.
- the nucleic acids from a preparation of several biological samples are then labeled by conventional approaches (e.g., radioactivity or fluorescence) and the labeled samples are applied to the array under conditions that permit hybridization. If a nucleic acid in the samples hybridizes to a probe on the array, then a signal will be detected at a position on the support that corresponds to the location of the hybrid. Since the identity of each labeled sample is known and the region of the support on which the labeled sample was applied is known, an identification of the presence and polymorphic variant (i.e., the strain of E. coli ) can be rapidly determined. Conventional methods in DNA amplification, as will be discussed below, can also be incorporated so as to detect the presence of less than 10 bacterial cells. These approaches are easily automated using technology known to those of skill in the art of high throughput diagnostic or detection analysis.
- conventional approaches e.g., radioactivity or fluorescence
- Nucleic acids present in biological samples can be disposed on a support so as to create an addressable array.
- the samples are disposed on the support at known positions that do not overlap.
- the presence of nucleic acids having a desired polymorphism in each sample is determined by applying labeled nucleic acid probes that complement nucleic acids that encode the polymorphism and detecting the presence of a signal at locations on the array that correspond to the positions at which the biological samples were disposed. Because the identity of the biological sample and its position on the array is known, the identification of the polymorphic variant can be rapidly determined.
- conventional methods in DNA amplification can be incorporated so as to detect the presence of very few bacterial cells.
- any addressable array technology known in the art can be employed with this aspect of the invention.
- One particular embodiment of polynucleotide arrays is known as GenechipsTM, and has been generally described in U.S. Pat. No. 5,143,854; PCT publications WO 90/15070 and 92/10092. These arrays are generally produced using mechanical synthesis methods or light directed synthesis methods, which incorporate a combination of photolithographic methods and solid phase oligonucleotide synthesis. (Fodor et al., Science, 251:767-777, (1991)).
- VLSIPSTM Very Large Scale Immobilized Polymer Synthesis
- a wide variety of labels and conjugation techniques are known by those skilled in the art and can be used in various nucleic acid assays.
- There are several ways to produce labeled nucleic acids for hybridization or PCR including, but not limited to, oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide.
- a nucleic acid encoding 6-PGD, or any portion of it can be cloned into a vector for the production of an mRNA probe.
- RNA polymerase such as T7, T3 or SP6 and labeled nucleotides.
- RNA polymerase such as T7, T3 or SP6 and labeled nucleotides.
- Suitable reporter molecules or labels include those radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as, substrates, cofactors, inhibitors, magnetic particles and the like.
- RNA mismatch cleavage technique that is amenable to array technology
- the method involves the use of a labeled riboprobe which is complementary to a gnd sequence having a polymorphism (e.g., the C653T and G654C polymorphism that distinguishes highly pathogenic O157:H7 and O55:H7 from less pathogenic E. coli strains).
- the riboprobe and either mRNA or DNA isolated and amplified from a biological sample are annealed (hybridized) and subsequently digested with the enzyme RNase A, which is able to detect mismatches in a duplex RNase structure.
- RNA product will be seen which is much smaller than the full length duplex RNA for the riboprobe and the mRNA or DNA.
- complements to the riboprobe can be dispersed on an array and stringently probed with the products from the Rnase A digestion after denaturing any remaining hybrids.
- RNA probes can be used to detect mismatches, through enzymatic or chemical cleavage. See, e.g., Cotton, et al., Proc. Natl. Acad. Sci.
- mismatches can be detected by shifts in the electrophoretic ability of mismatched duplexes relative to matched duplexes.
- riboprobes or DNA probes the cellular mRNA or DNA that corresponds to regions of gnd containing polymorphisms can be amplified by PCR before hybridization. DNA sequences isolated from biological samples which have been amplified by use of PCR can then be screened using allele-specific probes. These probes are nucleic acid oligomers, each of which contains a region including one or more polymorphisms present in Table 1.
- one oligomer may be about 30 nucleotides in length and corresponds to the C653T and G654C polymorphism.
- PCR amplification products can be screened to identify the presence of specific polymorphisms.
- the most definitive test for the presence of a highly pathogenic E. coli in a sample is to directly compare nucleotide or protein sequences isolated from a biological sample with one or more of the polymorphisms present in Table 1.
- PCR techniques are familiar to those skilled in the art. For a review of PCR technology, see Molecular Cloning to Genetic Engineering White, B. A. Ed. in Methods in Molecular Biology 67: Humana Press, Totowa (1997), the disclosure of which is incorporated herein by reference in its entirety and the publication entitled “PCR Methods and Applications” (1991, Cold Spring Harbor Laboratory Press), the disclosure of which is incorporated herein by reference in its entirety.
- RT-PCR reverse transcribe mRNA into cDNA followed by PCR
- primers on either side of the sequence to be amplified are added to a suitably prepared nucleic acid sample along with dNTPs and a thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent polymerase.
- a thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent polymerase.
- the nucleic acid in the sample is denatured and the primers are specifically hybridized to complementary nucleic acid sequences in the sample.
- the hybridized primers are extended. Thereafter, another cycle of denaturation, hybridization, and extension is initiated. The cycles are repeated multiple times to produce an amplified fragment containing the nucleic acid sequence between the primer sites.
- PCR has further been described in several patents including U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,965,188, the disclosure of which is incorporated herein by reference in its entirety.
- the primers are selected to be substantially complementary to a portion of the sequence of gnd DNA or mRNA and a portion of the sequence that complements the sequence of gnd DNA or mRNA, thereby allowing the sequences between the primers to be amplified.
- the length of the primers for use with this aspect of the invention is identical to most of the lengths of the nucleic acid embodiments provided previously.
- primer length can be less than or equal to 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 6
- primers are 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 and 30 nucleotides in length. Shorter primers tend to lack specificity for a target nucleic acid sequence and generally require cooler temperatures to form sufficiently stable hybrid complexes with the template. Longer primers are expensive to produce and can sometimes self-hybridize to form hairpin structures.
- the formation of stable hybrids depends on the melting temperature (Tm) of the DNA. The Tm depends on the length of the primer, the ionic strength of the solution and the G+C content. The higher the G+C content of the primer, the higher is the melting temperature because G:C pairs are held by three H bonds whereas A:T pairs have only two.
- the G+C content of the amplification primers of the present invention preferably ranges between 10 and 75%, more preferably between 35 and 60%, and most preferably between 40 and 55%.
- the appropriate length for primers under a particular set of assay conditions may be empirically determined by one of skill in the art.
- the spacing of the primers determines the length of the segment to be amplified.
- amplified segments carrying nucleic acid sequence encoding fragments of 6-PGD can range in size from at least about 25 bp to 35 kb.
- Amplification fragments from 25-1407 bp are typical, fragments from 50-1000 bp are preferred and fragments from 100-600 bp are highly preferred.
- amplification primers for the gnd genes of the invention can be of any sequence that allows for specific amplification of a region of the gnd genes disclosed in SEQ. ID. Nos. 22, 16, 18, 24, 26, 20, 42, 28, 30, 40, 32, 36, 38, and 34 and can, for example, include modifications such as restriction sites to facilitate cloning.
- highly pathogenic O157:H7 E. coli are identified and differentiated from less pathogenic E. coli by employing PCR amplification with two sets of primers.
- a first set of primers is designed to produce an amplicon containing at least the C653T and G654C polymorphisms, which distinguish the highly pathogenic O157:H7 and O55:H7 E. coli from less pathogenic strains.
- a second set of primers is designed to produce an amplicon that is unique to the O55:H7 parasite, e.g., the primer pair: 5′GCGTTCTTAAAGAGTCCTGC3′ (SEQ. ID. No. 13) and 5′TGCCCGCTACATCTCCTC3′ (SEQ.
- the presence of a 6-PGD protein of the invention can also be detected by using conventional assays.
- monoclonal antibodies immunoreactive with a polymorphism found on a specific 6-PGD sequence can be used to screen biological samples for the presence of a particular strain of E. coli and can be used to distinguish one strain from another. Because the T218I polymorphism can distinguish highly pathogenic O157:H7 and O55:H7 from less pathogenic O157:H7, diagnostic and screening assays that comprise reagents and methods that involve the detection of the presence or absence of the T218I polymorphism are preferred embodiments.
- diagnostic assays can also include a reagent that specifically differentiates the O55:H7 and O157:H7 parasites, for example, an antibody directed to an epitope found in a region of the O55:H7 6-PGD protein that is not homologous to the 6-PGD protein from an O157:H7 parasite.
- a reagent that specifically differentiates the O55:H7 and O157:H7 parasites for example, an antibody directed to an epitope found in a region of the O55:H7 6-PGD protein that is not homologous to the 6-PGD protein from an O157:H7 parasite.
- Such immunological assays can be done in many convenient formats.
- antibodies are used to immunoprecipitate the 6-PGD of the invention from solution and, in another embodiment, antibodies are used to react with 6-PGD on Western or Immuneblots of a polyacrylamide gel.
- ELISA enzyme-linked immunosorbant assays
- RIA radioimmunoassays
- IRMA immunoradiometric assays
- IEMA immunoenzymatic assays
- Exemplary sandwich assays are described by David et al., in U.S. Pat. Nos. 4,376,110 and 4,486,530, hereby incorporated by reference.
- antibodies of the present invention are attached to a support in an ordered array wherein a plurality of antibodies are attached to distinct regions of the support that do not overlap with each other.
- the protein-based arrays are ordered arrays that are designed to be “addressable” such that the distinct locations are recorded and can be accessed as part of an assay procedure.
- addressable antibody arrays comprise a plurality of antibodies that recognize the 6-PGD polymorphisms that can be deduced from Table 1. These probes are joined to a support in different known locations. The knowledge of the precise location of each probe makes these “addressable” arrays particularly useful in binding assays.
- an addressable array can comprise a support having several regions to which are joined a plurality of antibody probes that recognize the 6-PGD polymorphisms that can be deduced from Table 1. Proteins obtained from biological samples are labeled by conventional approaches (e.g., radioactivity, calorimetrically, or fluorescently) and the labeled samples are applied to the array under conditions that permit binding.
- a protein in the sample binds to an antibody probe on the array, then a signal will be detected at a position on the support that corresponds to the location of the antibody-protein complex. Since the identity of each labeled sample is known and the region of the support on which the labeled sample was applied is known, an identification of the presence, concentration, and/or expression level is rapidly determined. That is, by employing labeled standards of a known concentration of 6-PGD, an investigator can accurately determine the protein concentration of 6-PGD in a sample and from this information can assess the expression level of 6-PGD. Conventional methods in densitometry can also be used to more accurately determine the concentration or expression level of 6-PGD. These approaches are easily automated using technology known to those of skill in the art of high throughput diagnostic analysis.
- Proteins present in biological samples can be disposed on a support so as to create an addressable array.
- the protein samples are disposed on the support at known positions that do not overlap.
- the presence of a protein encoding a specific form of 6-PGD in each sample is then determined by applying labeled antibody probes that recognize epitopes of 6-PGD that correspond to the polymorphisms that can be deduced from Table 1 and detecting a signal at locations on the array that correspond to the positions at which the biological samples were disposed. Because the identity of the biological sample and its position on the array is known, an identification of the presence, concentration, and/or expression level of a particular 6-PGD can be rapidly determined.
- an investigator can accurately determine the concentration of 6-PGD in a sample and from this information can assess the expression level of 6-PGD.
- Conventional methods in densitometry can also be used to more accurately determine the concentration or expression level of 6-PGD.
- These approaches are also easily automated using technology known to those of skill in the art of high throughput diagnostic analysis.
- any addressable array technology known in the art can be employed with this aspect of the invention and display the protein arrays on the chips in an attempt to maximize antibody binding patterns and diagnostic information.
- the presence or detection of one or more polymorphisms in 6-PGD can provide a diagnosis of a subject's disease or indicate the contamination of a food or water supply.
- Additional embodiments include the preparation of diagnostic kits comprising detection components such as antibodies specific for one or more polymorphisms of 6-PGD.
- the detection component will typically be supplied in combination with one or more of the following reagents.
- a support capable of absorbing or otherwise binding RNA or protein will often be supplied. Available supports for this purpose include, but are not limited to, membranes of nitrocellulose, nylon or derivatized nylon that can be characterized by bearing an array of positively charged substituents, and GenechipsTM or their equivalents.
- One or more enzymes such as Reverse Transcriptase and/or Taq polymerase, can be furnished in the kit, as can dNTPs, buffers, or non-human polynucleotides like calf-thymus or salmon-sperm DNA. Results from the kit assays can be interpreted by a healthcare provider or a diagnostic laboratory. Alternatively, diagnostic kits are manufactured and sold to private individuals for self-diagnosis.
- restriction site “A” is present in fragment “B-G” (i.e., “A 1 , A 2 , and A 3 ”), defined below.
- B corresponds to the left-hand border of a pathogenicity or antigenicity island and “G” corresponds to the right hand border of this element.
- a 1 is the first restriction site A site to the left of B
- a 2 is the first restriction site A site to the right of G.
- the BG island flanking this sequence can be determined by using inverse PCR.
- Primers “C”, “D”, “E”, and “F” are derived from the sequence of the unique pathogenicity/antigenicity island. Actual sequence is derived from the raw data, depicted in the 5′ to 3′ direction, as indicated under the line shown in FIG. 3. The primers are in the same (primer D, primer F) or opposite orientation (primer C and primer E).
- E. coli DNA is digested to completion with enzyme A. Ligase is then added and the resulting fragments are re-circularized. Primers are added in separate tubes with a heat stable polymerase and PCR is conducted to obtain amplicons. The amplicons are cloned and sequenced. This approach identifies sequences beyond the 5′ and 3′ ends of the known pathogenicity/antigenicity islands and primers derived from these sequences are used to amplify this region in a variety of pathogens and non-pathogens, as was performed for the gnd allele. The resulting amplicons are then sequenced to identify differentiating polymorphisms.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Enzymes And Modification Thereof (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/875,573 US20020150902A1 (en) | 1998-12-08 | 2001-06-05 | Polymorphic loci that differentiate escherichia coli 0157:H7 from other strains |
US11/787,195 US8597877B2 (en) | 1998-12-08 | 2007-04-12 | Polymorphic loci that differentiate Escherichia coli O157:H7 from other strains |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11149398P | 1998-12-08 | 1998-12-08 | |
PCT/US1999/029149 WO2000034484A1 (en) | 1998-12-08 | 1999-12-08 | Polymorphic loci that differentiate escherichia coli 0157:h7 from other strains |
US09/875,573 US20020150902A1 (en) | 1998-12-08 | 2001-06-05 | Polymorphic loci that differentiate escherichia coli 0157:H7 from other strains |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/029149 Continuation WO2000034484A1 (en) | 1998-12-08 | 1999-12-08 | Polymorphic loci that differentiate escherichia coli 0157:h7 from other strains |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/787,195 Continuation US8597877B2 (en) | 1998-12-08 | 2007-04-12 | Polymorphic loci that differentiate Escherichia coli O157:H7 from other strains |
Publications (1)
Publication Number | Publication Date |
---|---|
US20020150902A1 true US20020150902A1 (en) | 2002-10-17 |
Family
ID=22338866
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/875,573 Abandoned US20020150902A1 (en) | 1998-12-08 | 2001-06-05 | Polymorphic loci that differentiate escherichia coli 0157:H7 from other strains |
US11/787,195 Expired - Lifetime US8597877B2 (en) | 1998-12-08 | 2007-04-12 | Polymorphic loci that differentiate Escherichia coli O157:H7 from other strains |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/787,195 Expired - Lifetime US8597877B2 (en) | 1998-12-08 | 2007-04-12 | Polymorphic loci that differentiate Escherichia coli O157:H7 from other strains |
Country Status (10)
Country | Link |
---|---|
US (2) | US20020150902A1 (de) |
EP (1) | EP1147196A4 (de) |
JP (1) | JP2002531130A (de) |
KR (1) | KR20010090879A (de) |
CN (1) | CN1333831A (de) |
AU (1) | AU3115300A (de) |
CA (1) | CA2352534A1 (de) |
MX (1) | MXPA01005728A (de) |
NZ (1) | NZ512088A (de) |
WO (1) | WO2000034484A1 (de) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1766021A1 (de) * | 2004-05-20 | 2007-03-28 | Warnex Research Inc. | Polynukleotide zum nachweis von escherichia coli 157:h7- und escherichia coli 0157:nm-verotoxinproduzenten |
US20110165568A1 (en) * | 2009-12-31 | 2011-07-07 | Life Technologies Corporation | Sequences of e.coli 055:h7 genome |
US20120082989A1 (en) * | 2009-04-10 | 2012-04-05 | Torres Alfredo G | Uses for long polar fimbriae genes of pathogenic escherichia coli strains |
US20140005061A1 (en) * | 2012-06-29 | 2014-01-02 | Life Technologies Corporation | Compositions and methods for detection of multiple microorganisms |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4074251B2 (ja) | 2001-12-03 | 2008-04-09 | 協和醗酵工業株式会社 | 変異型6−ホスホグルコン酸デヒドロゲナーゼ |
CN1313477C (zh) * | 2004-12-30 | 2007-05-02 | 天津生物芯片技术有限责任公司 | 对大肠杆菌o25型的o-抗原特异的核苷酸 |
KR101109565B1 (ko) * | 2010-06-11 | 2012-01-31 | 서울대학교산학협력단 | 대장균 o157:h7 검출을 위한 배지 조성물 |
CN102520167B (zh) * | 2011-11-15 | 2013-12-04 | 吉林出入境检验检疫局检验检疫技术中心 | 液相芯片检测大肠杆菌o157的方法 |
CN103468605B (zh) * | 2013-08-05 | 2015-04-22 | 浙江大学 | 分离自冷浸田土壤的肠杆菌cj-22及其应用 |
CN108362629B (zh) * | 2018-02-09 | 2021-02-05 | 中国计量科学研究院 | 大肠杆菌o157:h7单个活菌的快速检测方法及试剂盒 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5474796A (en) * | 1991-09-04 | 1995-12-12 | Protogene Laboratories, Inc. | Method and apparatus for conducting an array of chemical reactions on a support surface |
US5580703A (en) * | 1992-09-17 | 1996-12-03 | American Cyanamid Company | Human adeno-associated virus integration site DNA and uses thereof |
US5792833A (en) * | 1994-12-22 | 1998-08-11 | New England Medical Center Hospitals, Inc. | E2 binding proteins |
US5855885A (en) * | 1993-01-22 | 1999-01-05 | Smith; Rodger | Isolation and production of catalytic antibodies using phage technology |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DK6488D0 (da) | 1988-01-07 | 1988-01-07 | Novo Industri As | Enzymer |
IE61148B1 (en) | 1988-03-10 | 1994-10-05 | Ici Plc | Method of detecting nucleotide sequences |
US5082767A (en) | 1989-02-27 | 1992-01-21 | Hatfield G Wesley | Codon pair utilization |
US5126239A (en) | 1990-03-14 | 1992-06-30 | E. I. Du Pont De Nemours And Company | Process for detecting polymorphisms on the basis of nucleotide differences |
US5288707A (en) | 1990-08-13 | 1994-02-22 | Sandoz Ltd. | Borolysine peptidomimetics |
US5552534A (en) | 1991-08-22 | 1996-09-03 | The Trustees Of The University Of Pennsylvania | Non-Peptide peptidomimetics |
US5550251A (en) | 1991-12-12 | 1996-08-27 | The Trustees Of The University Of Pennsylvania | Alicyclic peptidomimetics |
JP2534597B2 (ja) | 1991-12-27 | 1996-09-18 | 財団法人癌研究会 | プロ―ブ |
GB9214857D0 (en) | 1992-07-13 | 1992-08-26 | Medical Res Council | Human nucleic acid fragments and their use |
WO1994010325A1 (en) | 1992-11-05 | 1994-05-11 | Xyrofin Oy | Recombinant method and host for manufacture of xylitol |
WO1994028017A1 (en) | 1993-06-01 | 1994-12-08 | The Scripps Research Institute | Human mutant tissue factor compositions useful as tissue factor antagonists |
US5821231A (en) | 1993-12-06 | 1998-10-13 | Cytel Corporation | CS-1 peptidomimetics, compositions and methods of using same |
IL109943A (en) | 1994-06-08 | 2006-08-01 | Develogen Israel Ltd | Conformationally constrained backbone cyclized peptide analogs |
US5817626A (en) | 1995-03-14 | 1998-10-06 | Praecis Pharmaceuticals Incorporated | Modulators of beta-amyloid peptide aggregation |
US5654417A (en) * | 1995-04-14 | 1997-08-05 | Children's Hospital And Medical Center | Nucleic acid probes for detecting E. coli O157:H7 |
US5811515A (en) | 1995-06-12 | 1998-09-22 | California Institute Of Technology | Synthesis of conformationally restricted amino acids, peptides, and peptidomimetics by catalytic ring closing metathesis |
US5612473A (en) | 1996-01-16 | 1997-03-18 | Gull Laboratories | Methods, kits and solutions for preparing sample material for nucleic acid amplification |
US5747257A (en) | 1996-02-29 | 1998-05-05 | E. I. Du Pont De Nemours And Company | Genetic markers and methods for the detection of escherichia coli serotype-0157:H7 |
US5758293A (en) | 1996-03-06 | 1998-05-26 | Motorola Inc. | Subscriber unit and delivery system for wireless information retrieval |
-
1999
- 1999-12-08 NZ NZ512088A patent/NZ512088A/xx unknown
- 1999-12-08 CA CA002352534A patent/CA2352534A1/en not_active Abandoned
- 1999-12-08 WO PCT/US1999/029149 patent/WO2000034484A1/en not_active Application Discontinuation
- 1999-12-08 JP JP2000586917A patent/JP2002531130A/ja active Pending
- 1999-12-08 MX MXPA01005728A patent/MXPA01005728A/es unknown
- 1999-12-08 AU AU31153/00A patent/AU3115300A/en not_active Abandoned
- 1999-12-08 KR KR1020017007204A patent/KR20010090879A/ko not_active Application Discontinuation
- 1999-12-08 CN CN99815683A patent/CN1333831A/zh active Pending
- 1999-12-08 EP EP99965181A patent/EP1147196A4/de not_active Withdrawn
-
2001
- 2001-06-05 US US09/875,573 patent/US20020150902A1/en not_active Abandoned
-
2007
- 2007-04-12 US US11/787,195 patent/US8597877B2/en not_active Expired - Lifetime
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5474796A (en) * | 1991-09-04 | 1995-12-12 | Protogene Laboratories, Inc. | Method and apparatus for conducting an array of chemical reactions on a support surface |
US5580703A (en) * | 1992-09-17 | 1996-12-03 | American Cyanamid Company | Human adeno-associated virus integration site DNA and uses thereof |
US5855885A (en) * | 1993-01-22 | 1999-01-05 | Smith; Rodger | Isolation and production of catalytic antibodies using phage technology |
US5792833A (en) * | 1994-12-22 | 1998-08-11 | New England Medical Center Hospitals, Inc. | E2 binding proteins |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1766021A1 (de) * | 2004-05-20 | 2007-03-28 | Warnex Research Inc. | Polynukleotide zum nachweis von escherichia coli 157:h7- und escherichia coli 0157:nm-verotoxinproduzenten |
EP1766021A4 (de) * | 2004-05-20 | 2008-05-14 | Warnex Res Inc | Polynukleotide zum nachweis von escherichia coli 157:h7- und escherichia coli 0157:nm-verotoxinproduzenten |
US20090170076A1 (en) * | 2004-05-20 | 2009-07-02 | Gregory Taylor | Polynucleotides for the detection of escherichia coli 0157:h7 and escherichia coli 0157:nm verotoxin producers |
US8088572B2 (en) | 2004-05-20 | 2012-01-03 | Aes Chemunex S.A. | Polynucleotides for the detection of Escherichia coli O157:H7 and Escherichia coli O157:NM verotoxin producers |
US20120082989A1 (en) * | 2009-04-10 | 2012-04-05 | Torres Alfredo G | Uses for long polar fimbriae genes of pathogenic escherichia coli strains |
US20110165568A1 (en) * | 2009-12-31 | 2011-07-07 | Life Technologies Corporation | Sequences of e.coli 055:h7 genome |
WO2011082325A2 (en) | 2009-12-31 | 2011-07-07 | Life Technologies Corporation | Sequences of e.coli 055:h7 genome |
US20140005061A1 (en) * | 2012-06-29 | 2014-01-02 | Life Technologies Corporation | Compositions and methods for detection of multiple microorganisms |
Also Published As
Publication number | Publication date |
---|---|
MXPA01005728A (es) | 2003-07-14 |
US8597877B2 (en) | 2013-12-03 |
EP1147196A1 (de) | 2001-10-24 |
US20080102458A1 (en) | 2008-05-01 |
WO2000034484A1 (en) | 2000-06-15 |
KR20010090879A (ko) | 2001-10-19 |
EP1147196A4 (de) | 2002-10-16 |
JP2002531130A (ja) | 2002-09-24 |
CN1333831A (zh) | 2002-01-30 |
AU3115300A (en) | 2000-06-26 |
NZ512088A (en) | 2004-01-30 |
CA2352534A1 (en) | 2000-06-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8597877B2 (en) | Polymorphic loci that differentiate Escherichia coli O157:H7 from other strains | |
Ng et al. | Regulation of the pspA virulence factor and essential pcsB murein biosynthetic genes by the phosphorylated VicR (YycF) response regulator in Streptococcus pneumoniae | |
AU700752B2 (en) | Nucleic acid probes for detecting E. coli 0157:H7 | |
Harvey et al. | A variable region within the genome of Streptococcus pneumoniae contributes to strain-strain variation in virulence | |
JP4969638B2 (ja) | 単球由来核酸および関連する組成物ならびに方法 | |
Taylor et al. | Cloning and nucleotide sequence of the gyrA gene from Campylobacter fetus subsp. fetus ATCC 27374 and characterization of ciprofloxacin-resistant laboratory and clinical isolates | |
EP1698698B1 (de) | Zytoletal-distendierende-toxin gene als ziele zum nachweis von campylobacter bakterien | |
CN114836531A (zh) | 一种用于检测类孟买血型变异型的snp位点 | |
Klena et al. | Cloning, sequencing, and characterization of the lipopolysaccharide biosynthetic enzyme heptosyltransferase I gene (waaC) from Campylobacter jejuni and Campylobacter coli | |
CN111850141A (zh) | 一种用于检测类孟买血型的基因多态性位点标记 | |
US7601822B2 (en) | Molecular identification of bacteria of genus Streptococcus and related genera | |
EP1424401A1 (de) | Verfahren zum nachweis, zur identifizierung und zum zählen von enteritis vibrio unter verwendung von rna-polymerase sigma-70-faktor codierenden (rpod-) gensequenzen | |
EP0849360A1 (de) | Monofunktionales Glycosyltransferase-Gen aus Staphylococcus aureus | |
WO2007067803A9 (en) | Diagnostic targets against johne's disease | |
WO2003097821A1 (fr) | Synthase de cereulide produite par bacillus cereus, gene codant pour celle-ci et procede de detection de cereulide | |
JP2000512493A (ja) | ピラジンアミド―耐性ミコバクテリアの同定およびミコバクテリア感染の治療方法 | |
Han et al. | Identification of H. pylori strain specific DNA sequences between two clinical isolates from NUD and gastric ulcer by SSH | |
RU2397251C1 (ru) | Набор для детекции возбудителя легионеллеза в биологическом материале и объектах окружающей среды | |
KR20230079107A (ko) | 개선된 특성을 갖는 유전자 변형된 메틸로바실러스 세균 | |
WO2003042655A2 (en) | Diagnostic assay for antibiotic tolerance | |
US6905865B1 (en) | Detection of a gene, vatE, encoding an acetyltransferase inactivating streptogramin | |
US6649394B1 (en) | Topoisomerase IV, corresponding nucleotide sequences and uses thereof | |
TW200411067A (en) | An isolated gene encoding a unique haemolysin of klebsiella pneumoniae and its uses in the detection of infections caused by klebsiella pneumoniae isolates | |
WO2003094860A2 (en) | Use of homo sapiens chromosome 1 open reading frame 28 (c1orf28) in the diagnosis of hyperparathyroidism-jaw tumor syndrome | |
Stanley et al. | Multilocus enzyme electrophoresis typing of clinical campylobacters from outbreak and sporadic sources |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: CHILDREN'S HOSPITAL AND REGIONAL MEDICAL CENTER, W Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TARR, PHILLIP I.;REEL/FRAME:012198/0920 Effective date: 20010913 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |