EP1328641A2 - Polynucleotides isolated from e. coli of nature b2/d+ a-, and uses thereof - Google Patents

Polynucleotides isolated from e. coli of nature b2/d+ a-, and uses thereof

Info

Publication number
EP1328641A2
EP1328641A2 EP01917119A EP01917119A EP1328641A2 EP 1328641 A2 EP1328641 A2 EP 1328641A2 EP 01917119 A EP01917119 A EP 01917119A EP 01917119 A EP01917119 A EP 01917119A EP 1328641 A2 EP1328641 A2 EP 1328641A2
Authority
EP
European Patent Office
Prior art keywords
polynucleotides
coli
group
sequence
seq
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
EP01917119A
Other languages
German (de)
French (fr)
Inventor
Edouard Bingen
Stéphane BONACORSI
Olivier Clermont
Xavier Nassif
Colin Tinsley
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Assistance Publique Hopitaux de Paris APHP
Institut National de la Sante et de la Recherche Medicale INSERM
Original Assignee
Assistance Publique Hopitaux de Paris APHP
Institut National de la Sante et de la Recherche Medicale INSERM
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from FR0003145A external-priority patent/FR2806096A1/en
Application filed by Assistance Publique Hopitaux de Paris APHP, Institut National de la Sante et de la Recherche Medicale INSERM filed Critical Assistance Publique Hopitaux de Paris APHP
Publication of EP1328641A2 publication Critical patent/EP1328641A2/en
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P13/00Drugs for disorders of the urinary system
    • A61P13/12Drugs for disorders of the urinary system of the kidneys
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P25/00Drugs for disorders of the nervous system
    • A61P25/28Drugs for disorders of the nervous system for treating neurodegenerative disorders of the central nervous system, e.g. nootropic agents, cognition enhancers, drugs for treating Alzheimer's disease or other forms of dementia
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P31/00Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
    • A61P31/04Antibacterial agents
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/195Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
    • C07K14/24Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
    • C07K14/245Escherichia (G)
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K39/00Medicinal preparations containing antigens or antibodies
    • A61K2039/51Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
    • A61K2039/53DNA (RNA) vaccination
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K48/00Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/16Primer sets for multiplex assays

Definitions

  • the present invention relates, in general, to polynucleotides which are of nature B2/D+ A- and which are isolated from E. coli, and to the biological uses, in particular medical and biotechnological uses, of these polynucleotides and of the polypeptides which they encode.
  • the expression "the nature B2/D+ A-" is intended to mean that the polynucleotide is present with greater frequency such as observed in ECOR E. coli of group B2 (ECOR B2 frequency) and/or in ECOR E. coli of group D (ECOR D frequency) , with respect to the frequency observed in ECOR E. coli of group A (ECOR A frequency) .
  • the ECOR B2 frequency and/or the ECOR D frequency is greater than the ECOR A frequency by. a factor of 2, preferably of 3, more preferably of 3.5, very preferably by a factor of 4.
  • the set of polynucleotides which are of nature B2/D+ A-, provided by the present invention, comprises in particular products the ECOR B2 frequency and/or the ECOR D frequency of which is greater than 10%, and the ECOR A frequency of which is less than 25%, preferably less than 20%, more preferably less than 10%, and very preferably less than 5%, while always remaining less than the ECOR B2 frequency and/or the ECOR D frequency.
  • the ECOR collection is a collection which represents the genetic diversity of the E. coli species; it is available from banks such as the ATCC.
  • the E. coli species is currently divided into four main phylogenic groups termed groups A, Bl, B2 and D.
  • groups A, Bl, B2 and D The E. coli species are currently divided into four main phylogenic groups termed groups A, Bl, B2 and D.
  • MLEE multilocus enzymatic electrophoresis
  • Descriptions of these techniques are in particular described in Herzer et al . 1990 J. Bacteriol. 172:6175-6181 and in Selander et al . 1986 Appl. Environ. Microbiol. 51:873-884 for MLEE, and in Bingen et al . 1994 Clin. Microbiol. Rev. 7:311-317, Bingen et al . Clin. Infect. Dis.
  • MLEE is based on the analysis of the migration polymorphism of bacterial enzymes. For each strain, a large number of enzymes characteristic of the species (often greater than or equal to 20) are characterized by their electrophoretic mobility. The existence of migration variants for these enzymes makes it possible to characterize each strain by an electrophoretic type.
  • ribotyping it. ⁇ s based on the analysis of the restriction polymorphism of the chromosomal regions which include the genes encoding the 16S and 23S RNAs. This polymorphism is revealed using a labelled cDNA probe, prepared from the 16S and 23S RNA of -257. coli, which can be hybridized to the DNA of the strain studied, after digestion of this DNA with various restriction enzymes and Southern transfer.
  • a collection termed ECOR has, moreover, been constituted in such a way as to represent the genetic diversity of the E. coli species. This collection is available in particular from the ATCC (ATCC No. 35320 to No. 35391) . It comprises:
  • E-7- coli strains not assigned to one of the four groups A, Bl, B2 and D.
  • the present invention makes the original and pertinent choice of isolating the entire set of its polynucleotides which are of nature B2/D+ A-, as defined above. Such a solution has never, to the Applicant's knowledge, been proposed. Now, the present invention demonstrates that this particular choice not only makes it possible to solve the problem of the phylogenic determination of a given E. coli strain, but also makes it possible to detect and to treat an undesired development of i57. coli, and more particularly a development of E. coli in a human or animal compartment which is extra-intestinal
  • the present invention in fact provides, in addition to the diagnostic and therapeutic means directed against E. coli in general, specific diagnostic and therapeutic means which have the advantage of distinguishing between E. coli capable of infecting the extra-intestinal compartment (J57. coli of group B2 and D) and -57. coli which are part of the normal physiological flora of humans and animals. Besides the less deleterious effects that such treatments offer patients suffering from an extra-intestinal E. coli infection, these specific diagnostic methods and treatments according to the invention make it possible to limit the development of bacterial resistances which are more and more frequently observed as broad-spectrum antibiotic treatments are used.
  • the inventors have therefore developed means which provide access to the entire set of polynucleotides which are of nature B2/D+ A-, as defined above. To the Applicant's knowledge, this is the first description of such means and the first description of such a set of products. These means also demonstrate that there are genomic regions which are of nature B2/D+ A-.
  • a concept which is common to the various aspects of the invention corresponds to producing phylogenic markers of i57. coli by choosing to isolate the set of the chromosomal and plasmid DNA fragments of E. coli which are of nature B2/D+ A- as defined above, and developing, for this purpose, a method which is capable of allowing the entire set of these fragments to be isolated.
  • the method used consists in subtracting the polynucleotide population of one or more E. coli strain (s) of group A, randomly sheared, from the polynucleotide population of one or more E. coli strain (s) of group B2 or D, cleaved with a restriction endonuclease allowing fragments comprising from 100 to 1 500 bp approximately, with a mean of 300 to 500 bp approximately, to be produced, such as 5au3AI, Tsp509I, Mspl or Maell (any restriction enzyme with short restriction sites, for example for bp) .
  • This method makes it possible to obtain a library of products which are of nature B2/D+ A- as defined above.
  • Example 1 below gives an illustration thereof, for which the i57. coli strain subjected to subtraction is the i57. coli strain C5.
  • This particular strain is classified, according to reference techniques, in the phylogenic group B2, yet it comprises, in particular at the plasmid level, DNAs of D origin.
  • the specific choice of E. coli C5 therefore has the advantage of isolating polynucleotides which are of nature B2+ A- and/or D+ A-, using a single strain.
  • Such a subtraction can be repeated over and over until the desired level of exhaustiveness is obtained (the level of exhaustiveness of the library can be estimated by searching for the presence or absence of products known to have the nature B2/D+ A-, such as sfa, ibelO ) . If the intention is to increase the level of exhaustiveness, it is desirable to use different restriction enzymes between the various subtraction repetitions. Examples of implementation of such a method are in particular described in Example 1 which follows.
  • the present invention proposes, for the first time, to apply a subtractive method to a specific choice of E. coli strains: firstly, an isolate of E. coli of group B2, associated with neonatal meningitis (E. coli C5 assigned to the phylogenic group B2 and comprising, in particular at the plasmid level, DNAs of D origin) , and secondly, E. coli strains of group A from the ECOR collection.
  • the present invention therefore offers the means for obtaining all of the products which are of nature B2/D+ A-.
  • the polynucleotides targeted by the present application correspond to the set of polynucleotides which are of novel nature B2/D+ A- .
  • Some of these polynucleotides correspond to products which are novel in themselves (SEQ ID NO: 1 to NO: 153, sequences corresponding to regions 1, 3, 4 and 5 identified in the following examples, orf sequences containing these sequences) .
  • Others correspond to products which have homologies with known products, but which are of novel nature B2/D+ A- (in particular SEQ ID NO: 170, 171, 174,
  • a subject of the present application is therefore any isolated polynucleotide which is of novel nature B2/D+ A- and which corresponds to a novel product in itself. It is in particular aimed towards any isolated polynucleotide the sequence of which corresponds to a sequence chosen from the group consisting of :
  • polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of otl and Blnl , selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52, and
  • this polynucleotide group corresponds to the polynucleotides which are of novel nature B2/D+ A- and which are also novel as products.
  • SEQ ID NO:l to NO: 153 it comprises the novel regions 1, 3, 4 and 5 identified according to the invention, and the orfs which can be identified on these polynucleotides (cf. examples) .
  • nature-conserving sequence of a sequence is herein intended to mean any sequence which has conserved the nature B2/D+ A- (as defined above) of the polynucleotide corresponding to this parent sequence.
  • variant is herein intended to mean any sequence having nucleotide insertions and/or deletions and/or substitutions with respect to the parent sequence, this includes any sequence which is complementary to the parent sequence.
  • fractional is herein intended to mean any fragment of the parent sequence.
  • any strain is considered to be E. coli if it can be considered to belong to the E. coli species according to the criteria given by Bergey' s Manual of systemic bacteriology ⁇ cf. in particular Volume 1) .
  • any E. coli strain is considered to belong to the group B2 or to the group A if it is considered as such after carrying out a reference phylogenic test suitable for discriminating between B2/D and A, such as for example multilocus enzymatic electrophoresis (MLEE) and/or ribotyping.
  • MLEE multilocus enzymatic electrophoresis
  • Such phylogenic techniques are well known to those skilled in the art. Examples of suitable protocols are in particular described in Herzer et al . 1990 J. Bacteriol. 172:6175-6181 and in Selander et al . 1986 Appl. Environ. Microbiol. 51:873-884 for MLEE, and in Bingen et al . 1994 Clin. Microbiol. Rev.
  • the fact of being or of not being present in a given bacterial strain or sample corresponds to a notion which is common to those skilled in the art in the field.
  • the determination can in particular be carried out by Southern transfer of the nucleotide population of said E. coli strain, and bringing this transfer into contact with the polynucleotide tested, or with a probe derived from this polynucleotide, under conditions suitable for polynucleotide hybridization reactions. Examples of such probes and such conditions are given in Example 1. A positive hybridization will then be interpreted as the presence of said polynucleotide in said E.
  • the determination can also be carried out by PCR detection of a positive or negative amplification using amplification primers constructed on the basis of said polynucleotide and placed in contact with the nucleotide population of said i57. coli strain, under conditions which are favourable for the amplification, using these primers, of a target out of this population. Illustrations of such PCR procedures are given in the examples which follow.
  • the majority of the polynucleotides which are of nature B2/D+ A- according to the invention are advantageously present in ECOR -57.
  • coli of group B2 with a frequency greater than 10%, particularly greater than 40%, preferably greater than 50%, more preferably greater than 60%, and even more preferably greater than 70% ⁇ cf. Example 2 and Table 3) . Some of them are present in ECOR -57.
  • the polynucleotides which are of nature B2/D+ A- according to the invention can be used for simply detecting the presence of E. coli bacteria of the group B2 or D with a probability of at least 90% (presence of said polynucleotides) . Taken in combination together, or with other products, they make it possible to completely distinguish between the groups A, Bl, B2 and D of i57. coli ⁇ cf. Example 5) . They also have therapeutic, palliative and/or preventive applications of interest. These polynucleotides will be referred to hereinafter as: novel polynucleotides which are of novel nature B2/D+ A-.
  • the present application is also aimed towards any pair of primers which allows the PCR amplification of at least one novel polynucleotide which is of novel nature B2/D+ A- as defined above.
  • the setting up of suitable experimental PCR conditions is accessible to those skilled in the art ⁇ cf. in particular Molecular cloning - a laboratory manual, Sambrook, Fritsch, Maniatis, and in particular Vol. 2, Chap. 14 2nd edition; cf. Ausubel et al . 1989 Current protocols in molecular biology; John Wiley and Sons Ed., and in particular Vol. 2 Chap. 15). Examples thereof are given in the examples which follow.
  • the present application is also aimed towards any nucleotide probe comprising at least one novel polynucleotide which is of novel nature B2/D+ A- as defined above. It is in particular aimed towards any probe as obtained using at least one pair of primers according to the invention, in particular by PCR amplification ⁇ cf. examples), such as (SEQ ID NO: 160; 161), (SEQ ID NO: 162; 163) or (SEQ ID NO: 164; 165) .
  • any pair of primers which allows the PCR amplification, under conditions as mentioned above, of at least one polynucleotide which is of novel nature B2/D+ A-, and also any probe directed against such a polynucleotide, and any antibody directed against a polypeptide encoded by such a polynucleotide can, in accordance with the present invention, be used for the phylogenic determination of a bacterium of the E. coli species.
  • the present invention also demonstrates that the y ' aA gene (SEQ ID NO: 254; protein of SEQ ID NO: 255) is a phylogenic tool of interest for E. coli .
  • coli of any product which makes it possible to detect y aA ⁇ cf. Example 5 therefore enters into the domain of the present application. It is in particular aimed towards any pharmaceutical composition comprising such products, such as a pair of primers (e.g. SEQ ID NO: 162; NO: 163), a probe or a specific antibody.
  • a pair of primers e.g. SEQ ID NO: 162; NO: 163
  • a probe or a specific antibody e.g. SEQ ID NO: 162; NO: 163
  • the present application is also aimed towards any antisense polynucleotide, characterized in that its sequence corresponds to the antisense sequence of at least one novel polynucleotide which is of novel nature B2/D+ A- and any isolated polypeptide, characterized in that its amino acid sequence corresponds to a sequence encoded, according to the universal genetic code and taking into account the degeneracy of this code, by at least one novel polynucleotide which is of novel nature B2/D+ A-.
  • the present application is also aimed towards any combination of the polypeptides encoded by the polynucleotides of the invention with a binding product, capable of binding to at least one novel polypeptide which is of novel nature B2/D+ A-, and which will inhibit an important biological function, such as the extra- intestinal growth by growth in serum, or the multiplication in animal, such as disclosed in example 6.
  • a binding product capable of binding to at least one novel polypeptide which is of novel nature B2/D+ A-, and which will inhibit an important biological function, such as the extra- intestinal growth by growth in serum, or the multiplication in animal, such as disclosed in example 6.
  • Such products correspond in particular to antibodies or monoclonal antibodies and compounds inhibiting the biological function of the polypeptides.
  • Methods for manufacturing such binding products using the polypeptides according to the invention are available to those skilled in the art. They are conventional methods which comprise, in particular, the immunization of animals such as rabbits and the harvesting of the serum produced, followed optionally by the purification of the serum obtained
  • the expression "capable of binding” is intended to mean physiological-type conditions
  • binding product when said binding product is intended to be administered to a human or animal organism, and ELISA-type conditions when said binding product is intended to be used in assays and methods in vi tro, for example for determining the phylogenic group of an E. coli bacterium.
  • the present application is also aimed towards any vector comprising at least one novel polynucleotide which is of novel nature B2/D+ A-, and also any cell transformed by genetic engineering, characterized in that it comprises, by transfection, at least one novel polynucleotide which is of novel nature B2/D+ A- and/or at least one vector according to the invention, and/or in that said transformation induces the production by this cell of at least one polypeptide corresponding to a novel polynucleotide which is of novel nature B2/D+ A-.
  • a subject of the present application is also any composition, in particular any pharmaceutical composition, comprising at least one compound chosen from the group consisting of the novel polynucleotides which are of novel nature B2/D+ A-, the polypeptides corresponding to at least one novel polynucleotide which is of novel nature B2/D+ A-, and the vectors and cells according to the invention, mentioned above.
  • Such compositions are in particular useful for treating and/or for alleviating and/or for preventing E. coli infections, and in particular of infection by extra-intestinal E. coli . (systemic and non-diarrhoeal infections) .
  • said compound is immunogenic or is made immunogenic, these compositions can correspond to vaccines.
  • a subject of the present application is also any composition, in particular any pharmaceutical composition, comprising at least one compound chosen from the group consisting of the novel polynucleotides which are of novel nature B2/D+ A-, the pairs of primers and probes according to the invention, mentioned above, and the binding products according to the invention, mentioned above.
  • Such a composition is in particular useful for the phylogenic determination of an E. coli bacterium. It can therefore correspond to a diagnostic kit or composition.
  • a subject of the present application is also any composition, in particular any pharmaceutical composition, comprising, in association with an inert carrier, at least one compound chosen from the group consisting of the antisense polynucleotides according to the invention, mentioned above, and the binding products according to the invention, mentioned above.
  • Such compositions can also be used for the phylogenic determination of an E. coli bacterium, and can take the form of a diagnostic kit or composition. They can also be used in a therapeutic capacity, in order to treat and/or to alleviate and/or to prevent any undesirable growth of £7. coli, such as a sanitary contamination, an E. coli infection, and in particular the presence of extra-intestinal E. coli ⁇ cf. examples) .
  • a subject of the present application is also any use of these products according to the invention for manufacturing such compositions.
  • the present invention also provides products for which the sequence exhibits homologies with known products, but for which the nature B2/D+ A- is novel.
  • the chuA gene is a known product (Genbank Accession No. U67920) : it has been described in a few E. coli responsible for intestinal and extra-intestinal infections, in Shigella as a gene involved in iron metabolism, and has been described as being homologous to a Yersinia gene. However, it has never been described as a phylogenic marker.
  • the present invention reports, for the first time, its presence in the urophathogenic E. coli strain J96 and reports, for the first time, that it is present in 100% of the B2s and of the Ds and in 0% of the
  • the majority of the polynucleotides according to the invention which exhibit homology but which are of novel nature B2/D+ A- are advantageously present in ECOR E. coli of group B2 with a frequency greater than 40%, preferably greater than 50%, more preferably greater than 60% and even more preferably greater than 70%. Some of them are present in ECOR E. coli of group B2 with a frequency greater than 80%, or even at a frequency equal to 100%, while at the same time remaining present in ECOR E. coli of group A at a frequency of less than 5%, or even less than 3%, or even equal to 0% ( cf. Example 2) .
  • the present application is thus aimed towards any use of a compound which is of novel nature B2/D+ A- for the phylogenic determination of an E. coli bacterium, for the diagnosis of the presence or absence of undesirable J57. coli bacteria, such as contaminant E. coli or extra- intestinal .57. coli, and/or for the diagnosis of an E. coli infection, and/or for the manufacture of a composition intended for such a phylogenic determination and/or for such a diagnosis.
  • polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
  • polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of WotI and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hly r cnfl , pap/prs r hra and ibe 10,
  • polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248),
  • the present invention is also aimed towards the use of at least one of these compounds for the diagnosis of the presence or absence of undesirable E. coli bacteria, such as contaminant J57. coli or extra-intestinal E. coli, and/or for the diagnosis of an E. coli infection. It is also aimed towards any use of at least one of these compounds for the manufacture of a composition intended for such a phylogenic determination and/or for such a diagnosis.
  • the detection of the presence or absence of such compounds can in particular be carried out by nucleotide hybridization, by PCR amplification or by detection of their polypeptide products. Detection of the presence of such compounds makes it possible to conclude that the B2 or D i57. coli bacterium is present.
  • the combined use of the detection of the presence or absence of chuA and of yjaA makes it possible in particular to conclude as to whether £7. coli of group B2 and/or £7. coli of group D are present or absent ( cf. Example 5) .
  • the present application is also aimed towards any use of a compound which is of novel nature B2/D+ A- for the manufacture of a composition, in particular of a pharmaceutical composition, intended to alleviate and/or to prevent and/or to treat an undesirable growth of E. coli, such as an E. coli infection, the presence of extra-intestinal E. coli or a sanitary contamination. It is aimed in particular towards any use of at least one compound chosen from the group consisting of: - the polynucleotides corresponding to the ⁇ uA gene and to its operon,
  • polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186,
  • polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of WotJ and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hly r cnfl , pap/prs, hra and ibe 10,
  • polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248),
  • polypeptides the sequence of which corresponds, according to the universal genetic code and taking into account the degeneracy of this code, to these polynucleotides,
  • binding products capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A-,
  • the cells transformed by genetic engineering which comprise at least one of these polynucleotides and/or at least one of these vectors, and/or the transformation of which induces the production of at least one of these polypeptides, for the manufacture of a composition, in particular of a pharmaceutical composition, intended to alleviate and/or to prevent and/or to treat an undesirable growth of E. coli, such as an J57. coli infection, the presence of extra-intestinal E. coli or a sanitary contamination.
  • E. coli such as an J57. coli infection, the presence of extra-intestinal E. coli or a sanitary contamination.
  • the present application is aimed towards any method which makes it possible to identify a compound capable of inhibiting the growth of E. coli , and in particular of inhibiting its extra-intestinal development, which comprises the detection of at least one compound:
  • polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
  • polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186,
  • polynucleotides which are as obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hly r cnfl f pap/prS f hra and ibe 10,
  • polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the polynucleotides cited in the preceding four paragraphs, - the polynucleotides the sequence of which corresponds to a nature-conserving variant or ' fractional sequence of the sequence of at least one of these polynucleotides and in particular (SEQ ID NO: 241, 195, 185 and 248),
  • the present invention is also aimed towards any method which makes it possible to identify a compound capable of inhibiting the growth of an £7. coli bacterium, and in particular its extra-intestinal development, characterized in that it comprises the detection of at least one compound capable of inhibiting the activity of a protein encoded by a polynucleotide the orf of which comprises a polynucleotide chosen from the group consisting of :
  • polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
  • polynucleotides which are as obtained by digesting the total DNA of an -57. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa r hly r cnfl , pap/prs, hra and ibe 10,
  • polynucleotides the sequence of which corresponds to an orf (open reading frame) comprising the sequence of one of these polynucleotides
  • polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides.
  • Such compounds can in particular be obtained by screening chemical and/or biological libraries.
  • the present application is also directed towards any compound as identified by one or other of these methods, and any composition, in particular any pharmaceutical composition, ' comprising at least one such compound.
  • Such compositions are in particular useful for treating and/or for alleviating and/or for preventing an undesirable growth of E. coli, such as E. coli infections, or an E. coli contamination, and especially useful for treating and/or for alleviating and/or for preventing the presence of extra-intestinal E. coli bacteria.
  • a subject of the present invention is also, according to a notable aspect of the invention, phylogenic identification methods which implement the detection of the presence or absence of at least one of the polynucleotides which are of novel nature B2/D+ A- (whether these polynucleotides are novel as products, or whether they exhibit homologies with known products) : - the novel polynucleotides according to the invention, i.e.
  • polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
  • polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250, - the polynucleotides which can be obtained by digesting the total DNA of an E.
  • coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa r hly r cnfl , pap/prs, hra and ibe 10,
  • polynucleotides the sequence of which corresponds to an orf (open reading frame) comprising the sequence of one of these polynucleotides
  • polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides.
  • This detection can be carried out by direct detection of said polynucleotide, or of its fragments, or by detection of one or more polypeptide (s) corresponding to it
  • polypeptides encoded by a polynucleotide which is of novel nature B2/D+ A- polypeptides encoded by a polynucleotide which is of novel nature B2/D+ A-.
  • the present application is aimed towards any phylogenic identification method characterized in that it comprises the use of at least one compound chosen from the group consisting of :
  • sequences SEQ ID NO: 1 to NO: 153 • the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
  • polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
  • polynucleotides which are as obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128, 151 and 211, with the exception of the polynucleotides sfa, hly, cnfl , pap/prs , hra and ibe 10,
  • polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of a polynucleotide of this group (and in particular SEQ ID NO: 241, 195, 185 and 248),
  • polynucleotides the sequence of which corresponds to an orf comprising the sequence of a polynucleotide of this group
  • the probes comprising a polynucleotide which is of novel nature B2/D+ A-, as defined above,
  • binding products which are capable of binding to a polynucleotide which is of novel nature B2/D+ A-, as defined above, - the binding products (for example, antibodies) capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A- as defined above.
  • This implementation can be carried out according to any technique known to those skilled in the art so as to allow said "at least one compound" to detect, in a biological sample, the presence or absence of the polynucleotide (s) or, where appropriate, the polypeptide (s) which constitute (s) the target thereof: ELISA for reactions of antibody-antigen type, Southern-type hybridization or PCR for reactions of polynucleotide hybridization and/or amplification type.
  • Suitable biological samples comprise, in particular, all samples of human origin originating from sites which are normally sterile (blood, cerebrospinal fluid -CSF-, liquid from an effusion, etc.) or non-sterile (stooTs, oropharynx, skin, etc.), and also samples of environmental origin (soils, water, etc.), of food origin and of animal origin, and microbiological cultures.
  • This method makes it possible to conclude, when said detection is positive, that £7. coli bacteria of group B2 and/or D are present. With only the detection of the presence or absence of chuA, this method makes it possible to conclude, when this detection is positive, that £7. coli bacteria of group B2 or D are present. When this detection is negative, and insofar as the sample effectively contains £7. coli, it makes it possible to conclude that bacteria of group Bl or A are present.
  • This method can also comprise the use of several of said compounds, for example in such a way as to detect the presence or absence of the ch ⁇ K gene, and also that of the fragment SEQ ID NO: 119. It makes it possible then to conclude :
  • the phylogenic identification method according to the invention can also implement the detection of the presence or absence of the yjaA gene.
  • the yjaA gene ⁇ cf. Figure 5 is, in the prior art, known to be present in the £7. coli strain K12 (group A) . There is nothing in the prior art which would suggest that it may constitute a phylogenic marker capable of distinguishing between group B2 and group D.
  • Figure 5 shows the yjaA sequence (SEQ ID NO: 254) and that of the corresponding ORF (SEQ ID NO: 255) .
  • the detection of the presence or absence of the yjaA gene can take place by any technique accessible to those skilled in the art. It can in particular take place by detection of the polynucleotide or of its fragments (SEQ ID NO: 254), or by detection of the polypeptides which correspond to it (SEQ ID NO: 255) .
  • the development of reagents which allow such a detection is accessible to those skilled in the art: amplification probes or primers for the detection of polynucleotides, compound of serum or antibody type for the detection of polypeptides.
  • the polynucleotide hybridization reactions can, for example, be carried out by Southern and/or by PCR, and those of antigen-antibody type can be carried out by ELISA. Examples of implementation of such a detection step are given in the examples which follow.
  • a pair of primers SEQ ID NO: 162 and 163 which makes it possible to amplify yjaA are described in particular.
  • the present invention is also aimed towards any phylogenic identification method which implements at least the detection of the presence or absence of the yjaA gene. This detection makes it possible to distinguish between £7. coli of group B2 and £7. coli of group D ⁇ cf. examples) .
  • the present invention provides a phylogenic detection method which makes it possible to completely distinguish the £7. coli groups A, Bl, B2 and D.
  • This method implements the detection of the presence or absence of the ⁇ uA gene, of the yjaA gene and of SEQ ID NO: 119.
  • This method is described in detail in the examples.
  • Advantageous PCR conditions are in particular are mentioned therein (“simplified PCR”) .
  • this detection can be carried out by PCR amplification.
  • suitable PCR conditions are given in a purely illustrative capacity in the examples.
  • the present application is in particular aimed towards an identification method as defined above which implements the simultaneous detection of one or more of said polynucleotides which is/are of nature B2/D+ A-, and of yjaA, preferably by triplex PCR.
  • this method can be applied directly to the bacteria or to the sample analysed. It does not require the availability of a reference strain collection.
  • the present application is also aimed towards any method for detecting an undesirable growth of £7. coli, any method for diagnosing an £7. coli infection, any method for sanitary control or detection (foodstuffs, liquids intended for consumption, soils) and any method for selecting £7. coli strains suitable for biotechnological manipulations, which implement the phylogenic identification method according to the invention.
  • kits can in particular comprise at least one of the compounds used in said method. They can also comprise instructions for use featuring one or more of the profiles given in Figure 4 ⁇ cf. Example 5), or describing one or more of these profiles.
  • An advantageous kit comprises at least one of the pairs of primers (SEQ ID NO: 160, 161), (SEQ ID NO: 162, 163) and (SEQ ID NO: 164, 165) , preferably two of these pairs, more preferably the three pairs of primers.
  • FIG. 1 represents the distribution of B2/D+ A- clones along the chromosome of the £7. coli strain RS218, - Figure 2 gives the sequence listing for isolated B2/D+ A- fragments, and indicates their respective SEQ ID NOS,
  • FIG. 3 illustrates a decision diagram for the phylogenic analysis of an £7. coli strain
  • FIG. 4 represents the various phylogenic profiles obtained using a triplex PCR method according to the invention (lanes 1 and 2: group A; lane 3: group Bl; lanes 4 and 5: group D; lanes 6 and 7: group B2) ,
  • Figure 5 represents the polynucleotide sequence of the yjaA gene (SEQ ID NO: 254) and the corresponding ORF sequence,
  • Figure 6 represents the sequences of the CTF073+ K12- regions and RS218+ K12- regions obtained using the clones of Figure 2,
  • Figure 7 represents the position of the genomic fragment specific to £7. coli CFT073 and RS218 strains on the chromosome of £7. coll K12 strain.
  • EXAMPLE 1 Production of a C5+ A- library and of a RS218+ K12- library
  • the strains used for this subtractive hybridization are an £7. coli isolated from the cerebrospinal fluid (CSF) of a newborn (£7. coli C5 of serotype 018:K1:H7; group B2) and two £7. coli strains of group A, ECOR4 and ECOR15, which belong to the ECOR collection (ATCC) .
  • £7. coli C5 harbours several virulent factors, such as the Kl capsular antigen, an sfa operon, the ibe 10 gene, Pap pili and the haemolysin ⁇ hly) gene.
  • This strain belongs to the phylogenic group B2.
  • the ECOR4 and ECOR15 strains, which belong to the phylogenic group A express no identified virulence factor.
  • Other £7. coli strains were used: strain RS218
  • Southern transfer The Southern transfers were carried out by capillary transfer onto positively charged nylon membranes. The hybridizations were carried out at 65°C in 1% sodium dodecyl sulphate (SDS) - IM NaCI - 50 mM Tris- HCI (pH 7.5) - 1% blocking reagent (Boehringer Mannheim, Mannheim, Germany) . The membranes were washed, first in
  • 2x SSC (lx SSC corresponds to 0.15M NaCI + 0.015M sodium citrate) for 15 min at room temperature, and then in 2x SSC - 0.1% SDS for 30 min at 65°C and finally, in O.lx SSC for 5 min at room temperature.
  • the ' ' detection by chemi- luminescence was carried out with the DIG luminescence detection kit for nucleic acid (Boehringer Mannheim) according to the manufacturer's instructions.
  • the sfa and ije 10 probes were produced by PCR using primers and an amplification method described previously (Bingen et al . 1997, J. Clin. Microbiol. 35: 2981-2982).
  • the chromosomal DNA of the ECOR strains was randomly sheared by repeatedly pushing it through a hypodermic needle so as to obtain fragments with a length of between approximately 3 and 10 kb. This digested DNA was purified by phenol extraction. The chromosomal DNA of £7. coli C5 was cleaved with the Sau3AI or Tsp5091 restriction endonuclease.
  • This DNA (20 ⁇ g) was ligated with 10 nmol of hybridized oligonucleotides RBaml2 (5' -GATCCTCGGTGA-3' ; SEQ ID NO: 154) and RBam24 (5' -AGGACTCTCCAGCCTCTCACCGAG-3' ; SEQ ID NO: 155) or REcol2 (5' -AATTCTCGGTGA-3' ; SEQ ID NO: 156) and REco24 (5' -AGCACTCTCCAGCCTCTCACCGAG-3' ; SEQ ID NO: 157) when the restriction was carried out with SauAI or Tsp5091, respectively.
  • the DNA was separated from the excess primers by electrophoresis in a 2% low melting point agarose gel. The portion of the gel containing fragments longer than 200 bp was excized and digested with ⁇ -agarase. This DNA was purified by phenol extraction.
  • the solution was covered with mineral oil, and the DNA was denatured by heating to 100°C for 2 min; 2 ⁇ l of 5M NaCI were added and the mixture was left to hybridize at 55°C for 48 h.
  • the reaction mixture was diluted 10 times in preheated 3x EE buffer - IM NaCI and immediately placed in ice. A portion of the dilution (10 ⁇ l) was added to 400 ⁇ l of PCR reaction mixture (10 mM Tris HCl [pH 9.0], 50 mM KCI, 1.5 mM MgCl 2 , 0.1% Triton X-100, a 0.25 mM concentration of each deoxynucleotide triphosphate and 50 U of Taq polymerase per ml) and the whole mixture was incubated for 3 min at 70 °C in order to fill the ends of the re-hybridized £7. coli C5 fragments. After denaturation at 94°C for 5 min an addition of RBam24 or REco24 oligonucleotides (0.1 mmol per 100 ⁇ l) , the hybridizations were amplified by PCR
  • the PCR products were purified on a gel in order to separate the £7. coli C5 fragments from the primer and from the high molecular weight ECOR subtraction DNA.
  • the second round of subtractive hybridization was carried out using 40 ⁇ g of fragmented ECOR4 or ECOR15 £7. coli DNA and 25 ng of DNA ligated to RBam24 or REco24 obtained from the first round.
  • the gels were 1% agarose in 0.5x Tris-borate-EDTA buffer, and they were subjected to electrophoresis at 6 V/cm for 27 h, with pulsed durations varying in a linear manner between 2 and 49 s.
  • the positions on the RS218 chromosome of sequences which are reactive with each of the clones were determined by comparing the Blnl and Wotl recognized restriction fragments with the published macrorestriction map (Rode et al . 1999) .
  • Some of the clones correspond to genes which are already known to be present in the strain C5, such as pap, hly and kps .
  • the 494 clones none proved to be homologous to sfa or ibel O . Additional rounds of subtraction and/or the sequencing of additional clones make it possible to obtain greater exhaustiveness, until this is complete.
  • sequences are found which correspond to virulence factors described in strains which, to date, have never been associated with neonatal meningitis: (i) prs, cnf and hra, all of which form part of a PAI (pathogenicity island) in the uropathogenic £7.
  • each of these clones was used to probe the ral-digested DNA of strains EC0R4, EC0R15 and MG1655: they proved to be nonreactive with respect to these strains.
  • TspE4.A7 was positioned by SauE15.F12 overlap; 2, SauE15.F12 also exhibits reactivity on the plasmid; and 3, TspE4.A8 also exhibits weak reactivity on Notl fragment P.
  • Region 1 is contained in Blnl fragment m (85 kb) .
  • the clones of region 2 were mapped on Notl fragments p and n ( ⁇ 240 kb) .
  • Region 3 is contained in Notl fragment a ( ⁇ 310 kb) and region 4 is contained in Blnl fragment h (210 kb) .
  • Region 5 is located on Blnl fragment j (135 kb) .
  • Region 6 which overlaps Notl fragment b and Blnl fragment b ( ⁇ 550 kb) , was divided into two subregions, regions 6a and 6b, with the Xbal enzyme. This latter subregion contains clones which exhibit homologies with Cnfl, hly, prs and hra . The sfa and ibel O probes hybridize in regions 2 and 6a, respectively. The genes encoding the capsule were not linked to any of these regions. Six clones exhibiting no homology with known sequences and the senB gene were all located on a large plasmid present in the strain RS218.
  • a The prevalence of PAI V was evaluated using clones TspE4.Dll, TspE4.D10 and TspE15.Cl, which are homologous to the hly, cnfl and hra genes, respectively.
  • b p ⁇ 0.05, with respect to the strains of the ECOR collection (the existence of a difference in the distribution of the clones studied was tested using the ⁇ 2 test) .
  • regions 1, 3 and 4 appeared with a frequency which was notably higher (p ⁇ 0.05) among the NMEC than among the other B2 strains, thus suggesting that these regions contain DNAs encoding NMEC-specific factors. These regions, their portions and the clones which they contain therefore constitute a source of polynucleotides involved in neonatal meningitis.
  • any restriction enzyme having short restriction sites for example 4 bp, such as Tsp509I or 5au3AI, and also Mspl or aell
  • two to three rounds of subtraction followed by the elimination of those of the clones which exhibit homology with a strain of group A such as £7. coli K12 gives a very good level of specificity.
  • Varying the nature of the restriction enzymes used, in particular between each subtraction cycle, further increases the level of exhaustiveness and/or of specificity of the set obtained. In order to evaluate the level of exhaustive- ness effectively obtained, this evaluation can be carried out by investigating the presence or absence of known markers such as sfa or ibel O .
  • Region 5 has a high prevalence, but it is similar in both collections and may thus correspond to segments which are highly characteristic of the phylogenic group B2 with respect to the group A.
  • Region 5 of £7. coli thus appears to be an advantageous source of DNA fragments present in a great majority (more than 80%) of £7. coli of group B2 of the ECOR collection, and absent from the majority of £7. coli of group A of the ECOR collection. The same distribution was observed for ⁇ uA, but interestingly, these genes were present, without exception, in all the strains of group B2 tested.
  • regions 1, 3 and 4 they appear to be clearly more common in NMEC than in the other B2 £7. coli . Given that regions 1, 3 and 4 do not, moreover, contain any known virulence factors, these regions appear to correspond to DNA islands associated with invasion of the meninges by £7. coli in infants and newborn.
  • Regions 1, 3, 4 and 5 were not available, that they had never been described or isolated and that no function was known for them. Regions 1, 3, 4 and 5 therefore appear to be novel. As regards regions 2, 6a and 6b, they comprise DNAs which were known as products, but this is the first description of the existence of such regions and the first description of their nature C5+ A-. EXAMPLE 2: Distribution of the C5+ A- fragments among the ECOR strains
  • Example 1 The frequency of presence of the fragments obtained in Example 1 (C5+ A- fragments) among ECOR £7. coli of groups Bl and D was then measured by Southern hybridization as described in Example 1.
  • the results obtained for 14 of them (SEQ ID NO: 56, 116, 43, 51, 141, 130, 45, 50, 52, 119, 127, 125, 55 and 37) are given in Table 3 below.
  • 15 £7. coli strains of group B2 which the ECOR collection contains. More than 75% of the fragments are present in the 15 ECOR £7. coli strains of group B2 at a frequency greater than 40%, 50% of them are present therein at a frequency greater than 70% and approximately 35% of them are present therein at a frequency greater than or equal to 80%.
  • the various results obtained indicate that the set of DNAs isolated according to the invention constitute, taken alone or in combination, particularly suitable tools for the phylogenic identification of £7. coli strains.
  • TspE15.G6 is however present at a frequency of 12% in the £7. coli of group A, at a frequency of 53.3% in the ECOR £7. coli of group B2, and at a frequency of 83.3% in the ECOR £7. coli of group D.
  • the choice of the £7. coli strain C5 as the strain of group B2 for the subtractive hybridization which enabled the isolation of these fragments ⁇ cf. Example 1 above) is probably not unrelated to these results.
  • the fragments isolated in accordance with the invention using a B2/D £7. coli strain such as £7. coli C5 have, in addition to phylogenic diagnostic applications ( cf. Example 5 below) , applications of particular value for preventing, alleviating and combating any extra-intestinal development of £7. coli (systemic and nondiarrhoeal development) .
  • This B2/D+ A- set according to the invention in particular comprises mostly fragments which are present at a frequency greater than 10% in the ECOR £7. coll of group B2 and/or D, and at a frequency of less than 25%, preferably 20%, more preferably 10%, and even more preferably 5%, in the ECOR £7. coli of group A (with the proviso that this A frequency is always lower than that in the B2s and/or Ds) .
  • the present invention also provides means which give access to the entire set of these polynucleotides which are of nature B2/D+ A-.
  • EXAMPLE 3 Example of medical application of the B2/D+ A- fragments isolated (systemic and non- diarrhoeal development of E. col ⁇ in an extra- intestinal compartment)
  • B2/D+A- E. coli are those which are particularly responsible for extra-intestinal infections
  • DTA differential transcriptional analysis
  • the bacteraemia phase is compared to the growth of the bacterium in the presence of human serum
  • the £7. coli strain C5 (serotype 018:K1:H7) was isolated from the CSF of a newborn. This virulent strain belongs to the phylogenic group B2 and exhibits the following virulent factors: The Kl capsular polysaccharide, S adhesin, IbelO invasin, the type P pilus and haemolysin, but does not produce aerobactin.
  • The- subtractive fragment library was obtained using the C5 strain.
  • the nonpathogenic strains used were the £7. coli K12 strain MG1655 and the strain ECOR15 belonging to the so-called nonpathogenic phylogenic group A, originating from the ECOR reference collection.
  • the bacterial inocula were prepared from 18 h cultures on tryptocasein-soybean agar (Sanofi Pasteur) , with the colonies being resuspended in sterile water. After measuring the OD, this bacterial suspension, pure or diluted, was used to prepare the inoculum in the various culture media, adding a volume which was always less than 1/1Oth of the final volume.
  • the bacterial cultures were prepared at 37°C with shaking, using either pure nutrient broth (Sanofi Pasteur) or nutrient broth supplemented with an iron chelator: 2, 2'-dipyridyl (Sigma) at a concentration of 200 ⁇ M.
  • the bacterial strains were also cultured in the presence of human serum. Two types of serum were obtained: one consisting of a pool of 4 sera originating from donor blood, the other corresponding to a single donor. The serum was collected after harvesting the blood in a dry tube, spontaneous coagulation for approximately 3 hours at room temperature, and then decanting. The serum was then stored in the form of 2.5 ml aliquots at -80°C. Decomplemented serum was obtained, from the pool, after incubation at 56°C for 30 minutes.
  • the subcultures of the clones of the B2/D+ A- library were prepared in nutrient broth enriched with ampicillin (50 ⁇ g/ml) at 37°C.
  • the subtractive DNA fragments cloned into the plasmid pUC19 were amplified by PCR reaction, without DNA extraction, directly from a 1/lOth dilution of an 18 h culture broth of each clone. 5 ⁇ l of this bacterial solution were added to the reaction mixture, with a final volume of 50 ⁇ l, comprising: IX PCR buffer (10 mM Tris-HCI pH 9; 1.5 mM MgC12; 50 mM KCI; 1% Triton X100; 0.1% gelatin); 2U SuperTaq polymerase (ATCG Biotechnologie) ; 200 ⁇ M deoxynucleotide triphosphates; 6 ⁇ M specific primers.
  • the primers used were as follows : PI (5'-CATGCCTGCAGGTCGACTCT-3'; SEQ ID NO: 728), P2 (5'-CGTTGTAAAACGACGGCCAG-3' ; SEQ ID NO: 729).
  • the PCR reaction consisted of 30 cycles: 30 s at 95°C (denaturation) , 30 s at 55°C (hybridization) and 30 s at 72°C (elongation) .
  • Membranes comprising the set of specific amplified fragments of the £7. coli strain C5 were manufactured by the company Eurogentec. 180 nl of each of the PCR products were deposited in duplicate, in the form of microspots, onto 6 cm by 11 cm nylon membranes. In order to be able to normalize the signals recorded for each reverse transcript, spots consisting of 4-fold serial dilutions of the chromosomal DNA of the £7. coli strain C5, on the one hand, and of dilutions of the product of a PCR of the 16S rRNA, on the other hand, were deposited. In addition, a negative control corresponding to the PCR product of a subtractive fragment of 17 bp was also deposited. These membranes are conserved sealed, at +4°C.
  • RNA is extracted from a bacterial pellet obtained after centrifugation (4 700 rpm at 4°C for 3 min) of 5 ml of a 4-hour culture (approximately 10 8 CFU/ml) .
  • this reagent consists of a monophasic solution of phenol and of guanidine isothiocyanate, which allows extraction of the total RNA in a single step according to the method developed by Chomczynski and Sacchi.
  • the bacterial cells are lysed by adding the TRIZOL reagent, vortexed for 30 s and subjected to a heat shock (incubation at 65°C followed by freezing at -80°C) .
  • the following steps are those of a conventional phenol-chloroform extraction.
  • the recovered aqueous phase contains the RNA.
  • this kit contains no phenol-based solution. It consists in using a solution which allows the cells to be lysed during an incubation at 65°C for 5 minutes. Next, a solution for precipitating the DNA and the proteins makes it possible to then recover the RNA which is in the aqueous phase.
  • RNA is precipitated using isopropanol: the isopropanol is added to the aqueous solution containing the RNA, volume for volume. This mixture is incubated at -20°C for approximately 15 h, and after centrifugation (13 000 rpm at 4°C for 5 min), the RNA is obtained in the form of a pellet. The RNA is then washed with 70% ethanol and precipitated by centrifugation (13 000 rpm at 4°C for 2 min) . Finally, the RNA is solubilized in water in the first case
  • RNA samples are stored at -80°C.
  • RNA assay and puri ty The RNA is assayed using a spectophotometer, by measuring the absorbance at 260 nm, it being known that an optical density value at 260 nm corresponds to an RNA concentration of 40 ⁇ g/ml.
  • the purity of the sample is estimated by calculating the OD 26 o nm/OD 2 8o nm ratio: a ratio greater than 1.6 reflects an acceptable purity.
  • RNA preparation obtained is of good quality.
  • the cDNA probe is synthesized using random priming. 10 ⁇ g of RNA are mixed together, in a final volume of 50 ⁇ l, with 10 ⁇ g of random hexamers; dATP, dTTP and dGTP, final concentration of 10 ⁇ M (Boehringer Mannheim) ; 40 U of Rnasin ribonuclease inhibitor (Promega) ; IX RT buffer; 10 ⁇ g of bovine serum albumin
  • the membrane is prehybridized in 5 ml of Church and Gilbert hybridization buffer (0.5 M NaPi, pH 7.2; 1 mM EDTA; 0.7% SDS) for a minimum of 30 minutes at 65°C.
  • Church and Gilbert hybridization buffer 0.5 M NaPi, pH 7.2; 1 mM EDTA; 0.7% SDS
  • the 33 P- labelled cDNA probe is directly added to 5 ml of hybridization buffer.
  • the hybridization reaction is carried out at 65°C for 15 to 18 h.
  • the membrane is first rinsed twice with washing buffer (40 mM NaPi, pH 7.2; 1 mM EDTA; 1% SDS). Then, four washing steps are carried out (30 min at 65°C) .
  • washing buffer 40 mM NaPi, pH 7.2; 1 mM EDTA; 1% SDS. Then, four washing steps are carried out (30 min at 65°C) .
  • the membrane is exposed to a 33 P-sensitive screen (Molecular Dynamics) for 48 h to 90 h.
  • the membrane is incubated twice for 15 min at 37 °C in the presence of a solution of 0.2N NaOH and 0.1% SDS, and then it is rinsed with distilled water for 5 min.
  • the exposed 33 P-sensitive screen is scanned using a Phosphorlmager (Storm 840, Molecular Dynamics) with a pixel size of 50 urn, and then the image obtained is analysed using the XdotsReader software (Cose) .
  • This software allows automatic recognition of the spots and is capable of calculating the pixel-density for each of the spots. It also makes it possible to subtract the background noise and to normalize the signal intensity. For each spot, the local background noise was subtracted and the signal intensity was normalized using the signal of the chromosome diluted 400-fold.
  • the disparity between two spots from the same clone was calculated in order to eliminate the spots with a significant disparity, reflecting aberrant signals. Then, the mean of the intensity of the signals- from a same pair was defined, and it is this mean intensity, net (after subtraction of the local background noise) and normalized, which will be considered for the data analysis.
  • the intensity obtained for the 17 bp clone made it possible to define a positivity threshold, the signals lower than this intensity being considered as negative or "undetected".
  • ⁇ £7. coli strain K12 is killed in less than 2 hours and the strain ECOR15 experiences a decrease in growth of more than 2 logs in 2 hours, persisting at a level of between 10 4 and 10 5 CFU/ml.
  • the strain C5 persists without growing for the first two hours of culturing, and then experiences a growth of 1 log in the 4th hour.
  • the inoculum of 10 7 CFU/ml was therefore selected for the transcriptome * study.
  • the growth of the strains K12 and ECOR15 is similar to that of the strain C5, which suggests that the bactericidal effect observed was due to the lytic activity of the complement.
  • RNA extraction step is a fundamental step of the DTA, it appeared to us to be necessary to compare two extraction methods in order to determine whether or not the mode of extraction could have an influence on the transcriptome results.
  • Preparations of good quality are characterized by the presence of the 23S, 16S and 5S rRNAs, detected in the form of clear bands by agarose gel electrophoresis.
  • the bands obtained for the 23S and 16S rRNAs, and a band corresponding to the 5S rRNA and to the tRNAs reflect the good quality of the RNA preparations and of the equivalence of the extraction methods .
  • the detection of RNA of low- molecular weight shows that the extraction method makes it possible to isolate the mRNAs which are small in size, unlike other systems and in particular those using columns which retain only the mRNAs longer than 200 bp.
  • a high molecular weight band corresponding to the chromosomal DNA appears clearly on the lane of the RNA obtained using the Biorad kit.
  • the two transcriptomes obtained from these two types of RNA were compared visually (without integrating the signals), and appear to be identical.
  • the transcriptomes obtained by hybridization of the reverse transcripts originating from £7. coli C5, under two different culture conditions, on the high density membranes comprising the set of clones of the B2/D+ A- library were analysed with the aid of the Cose software, using the 400-fold dilution of the chromosome as the normalization spot, the signal of which is close to the median of all of the signals. It appears clearly that a certain number of spots are lacking in signals.
  • the normalized intensity recorded on the 17 bp clone (0.05) was used. In order to be safe, this threshold was doubled and, therefore, any spot with arr intensity of less than 0.1 was considered to be "undetected".
  • the normalized intensities of the signals obtained for all the clones for which the transcript was detected under at least one of the culturing conditions make it possible to visualize the level of transcription of the set of subtractive clones in the course of the three respective experimental conditions: growth in nutrient broth, in nutrient broth supplemented with 2,2' -dipyridyl (iron chelator) and in serum. It is noted that most of the signals detected have similar intensities whatever the culturing conditions, whereas certain fragments exhibit levels of transcription which vary by a factor of ten according to the culturing conditions. These results therefore suggest good reproducibility of the technique, with a capacity to detect different transcriptional levels.
  • the transcriptome obtained in nutrient broth was considered to reflect the basal level of transcription of the bacterium in favourable medium.
  • the expression profile obtained under conditions of stress consisting of culturing in serum was analysed with respect to this control transcriptome.
  • the transcriptomes obtained when culturing in nutrient broth and iron chelator were also prepared, in order to determine the respective roles of iron deficiency and of complement in the serum.
  • Table 5 gives the ratios of the intensities of the signals obtained for some of the clones of the B2/D+ A- library under the various experimental conditions, with respect to the control condition represented by the growth in nutrient broth (NB) .
  • the transcripts induced in the- serum (ser) are most commonly also induced in the presence of dipyridyl (dip) and of decomplemented serum (DC serum), with the exception of the SauE15.A12, SauE4.E6, SauE4.Cll and TspE15.G6 clones.
  • a new probe was prepared from a culture of £7. coli C5 in a serum originating from a single donor.
  • the transcriptome obtained with this new probe was compared with the transcriptome produced with the pool consisting of several sera.
  • the straight line of regression obtained, and also the regression coefficient with a value of 0.85, indicate the excellent reproducibility of the technique.
  • the intensities obtained for the chromosomal range points consisting of l/100th, l/400th and 1/1 600th dilutions were recorded on a graph.
  • the graph obtained shows the existence of a linear relationship between these values, making it possible to deduce an induction factor directly from the normalized intensities recorded.
  • the DTA technique described herein is a reliable method for selecting, from the B2/D+ A- library obtained from £7. coli C5, DNA fragments the transcription of which is increased in the presence of serum. These fragments make up genes which participate specifically in the systemic and non- diarrhoeal extra-intestinal development of £7. coli in humans and animals. These genes can be isolated using said DNA fragments the transcription of which is increased in the presence of serum, according to conventional techniques of those skilled in the art ⁇ cf. Example 6) .
  • fragments and the genes which bear them can be used as active principles (in the form of naked DNA placed under the control of a eukaryotic promoter or in the form of DNA transfected into a cell) in a vaccine composition intended to prevent, alleviate or combat the systemic and non-diarrhoeal development of £7. coli in a human or animal extra-intestinal compartment.
  • these fragments and genes can, if desired, be modified, for example so as to produce inactivated isogenic mutants.
  • polypeptides which they encode can also be used in such vaccine compositions, in an inactivated and immunogenic form.
  • DNA fragments and these genes can also be used as anti-pathogenicity targets (with the objective of preventing the development of £7. coli in an extra- intestinal area, and not in the intra-intestinal area) : they allow the identification of compounds capable of specifically inhibiting their transcription and/or translation, or of compounds capable of inhibiting the activity of the proteins encoded by these genes.
  • compounds can be used as active principles in pharmaceutical compositions (medicinal products) in order to prevent, alleviate or treat the systemic non- diarrhoeal development of £7. coli in a human or animal extra-intestinal compartment.
  • fragments which are of nature B2/D+ A-, and the transcripts of which are increased in the serum those which are not present in £7. coli which are agents of infections localized in the intestine, such as £7. coli 0157 :H7, are more particularly preferred.
  • a systemic non-diarrhoeal development of £7. coli in an extra-intestinal compartment is in particular observed in the context of diseases such as neonatal meningitis, septiceamias, sepsis or pyelonephritis.
  • Such vaccines and pharmaceutical compositions are particularly useful in the context of hospital-acquired infections.
  • Such vaccines are most particularly valuable for the vaccination of women (from adolescent to adult, and more particularly before and during a gestation period) as a prevention against pyelonephritis, in order to avoid contamination of the newborn during birth.
  • the vaccines and pharmaceutical compositions according to the invention are therefore particularly suitable for such pathologies.
  • the present invention is therefore aimed towards any polynucleotide capable of being obtained :
  • the DNAs given in Table 5 above are examples of such polynucleotides .
  • any vaccine composition comprising such a polynucleotide, or an inactivated isogenic mutant of such a polynucleotide (inactivation of the possible pathogenic potency) , in particular for preventing, alleviating or combating the development (systemic and non-diarrhoeal) of £7. coli in a human or animal extra-intestinal compartment,
  • any pharmaceutical composition comprising a compound capable of inhibiting the transcription and/or translation of such a polynucleotide or mutant, or capable of inhibiting the activity of a polypeptide encoded by such a polynucleotide or mutant.
  • the present invention is also aimed towards a method for identifying compounds which can be used as active principles in a pharmaceutical composition (medicinal product) intended to prevent, alleviate or combat the development (systemic and non-diarrhoeal) of £7. coli in a human or animal extra-intestinal compartment.
  • This method comprises detecting and selecting compounds capable of inhibiting the transcription and/or translation of said polynucleotides, or capable of inhibiting the activity of the polypeptides which they encode.
  • the present invention is also aimed towards any kit suitable for the implementation of this method, said kits comprising at least one of said polynucleotides or polypeptides.
  • Such cells and transgenic animals are particularly useful for selecting active principles of interest.
  • Example 1 the isolation and the sequence of 259 DNA fragments which are of nature B2/D+ A- (153 of which are novel as products) are described, and means for producing the entire set of these products are provided.
  • Example 1 and Figure 1 give the sequence of eight (novel) fragments belonging to region 1 (SEQ ID NO: 134, 144, 109, 115, 140, 135, 33 and 56) .
  • region 2 5 novel fragments (SEQ ID NO: 125, 123, 116, 43 and 40), and the presence of the sfa gene, are indicated.
  • region 3 7 (novel) fragments (SEQ ID NO: 122, 130, 141, 25, 48, 51 and 57) are indicated.
  • region 4 three (novel) fragments (SEQ ID NO: 121, 44 and 45) are indicated.
  • region 5 (novel) fragments SEQ ID NO: 113, 119, 120, 23 and 52) are indicated.
  • region 6a eight novel fragments (SEQ ID NO: 127, 133, 27, 34, 36, 42, 46 and 54), and the presence of the known iielO product, are indicated.
  • region 6b four (novel) fragments (SEQ ID NO: 55, 38, 128 and 151), and the presence of four known products (SEQ ID NO: 212, 226, 201 and 229), are indicated.
  • One of the conventional techniques consists in developing primers for amplifying the desired region, based on the sequences of the fragments of this region, and in carrying out a PCR amplification using various combinations of said primers placed in contact with the polynucleotide population of an £7. coli strain of group B2 or D (which may or not be ECOR) , under conventional PCR conditions.
  • the PCR products which overlap are sequenced on both strands, using the chain termination technique and automated sequencing.
  • the sequence obtained can be extended beyond the limit of the clones available by cloning, for example in lambda DASH-II, partial fragments of approximately 15 kb obtained by restriction carried out on said polynucleotide population.
  • the inserts overlapping the desired region are then identified by hybridization with clones of this region.
  • the inserted DNA is then sequenced from the end of the inserts, and these sequences are used to develop novel primers which" will be used to directly amplify the chromosomal (and non-phage) DNA. Amplification of the chromosomal DNA is then obtained using these novel primers and those of the shorter sequence already obtained.
  • These PCR strands are also sequenced on both strands, which then produces the complete sequence of the desired region.
  • sequence of such regions can be obtained by extension of the sequencing of the chromosome of an £7. coli strain of group B2 or D, from points at which a clone which is of nature B2/D+ A- is located (cf. Example 6 below) .
  • the open reading frames of the sequences obtained for the regions can then be analysed according to conventional techniques for seeking ORFs.
  • a search is carried out in particular for ORFs which begin with ATG or CTG and which have a high codon use index.
  • EXAMPLE 5 Method and kit for identifying the phylogenic group of an E. col ⁇ strain
  • the polynucleotides which are of nature B2/D+ A- can be used to determine the phylogenic group of any £7. coli strain. They can be used alone, in combination together or in combination with other products, depending on the result and the precision of phylogeny desired. In the event of positive detection, the set of DNAs which are of nature B2/D+ A- makes it possible to eliminate the hypothesis that a strain belongs to the group A.
  • probes can in particular be carried out using said polynucleotides as probes (Southern technique) , or by constructing amplification primers capable of amplifying one of said polynucleotides, so as to carry out a PCR.
  • the construction of probes and of primers can be carried out according to any technique known to those skilled in the art; examples of such constructions are given below.
  • a phylogenic identification method which allows a phylogenic precision of at least 99% for £7. coli .
  • This method is based on the PCR detection of two genes (chuA and yjaA) , the sequence of which is known, but which- are of novel nature, and on the detection of the
  • TspE4.C2 novel DNA fragment (SEQ ID NO: 119). The method was evaluated by testing 220 strains which had already been grouped together using reference methods
  • the inventors in fact demonstrated that the known cnuA gene is present in 100% of the ECOR strains of group D, in 100% of the ECOR strains of group B2, and in 0% of the ECOR strains of group A and of the ECOR strains of group Bl. They also demonstrated that the y aA gene, the sequence of which is known, is present in 100% of the ECOR B2 strains and in 0% of the
  • Bacterial strains The 72 strains of the ECOR- collection are available from the ATCC. These reference strains, isolated from various hosts and various geographical locations, are representative of the range of genotypic variation of the species. Sixty-eight of these strains belong to the four main phylogenic groups (A, Bl, B2 and D) , and 4 are unclassified.
  • the other 17 NMEC and the remaining 65 clinical isolates were classified by means of ribotyping as previously described in Binger et al . (ref. above) .
  • the bacteria were cultured at 37°C on Luria Bertani broth medium or agar. If necessary, ampicillin (100 ⁇ g per ml) was used.
  • PCR amplification In a first step, the PCR was carried out according to a standard protocol. The reaction was carried out in a volume of 20 ⁇ l containing 2 ⁇ l of 10X buffer (supplied with Taq polymerase) , 20 pmol of each primer, 2 ⁇ M of each dNTP, 2.5 U of Taq polymerase (ATGC Biotechnologie, noisysy-le-Grand, France) and 200 ng of genomic DNA.
  • the PCR was carried out using a Perkin-Elmer GeneAmp 9600 thermal cycling machine, with MicroAm tubes, under the following conditions ⁇ denaturation for 5 minutes at 94°C, 30 cycles of 30 seconds at 94 °C, 30 seconds at 55°C and 30 seconds at 72°C, and a final extension step of 7 minutes at 72°C, using the pairs of primers :
  • chuA.l (5'-GACGAACCAACGGTCAGGAT-3' ; SEQ ID NO: 160) and ChuA.2 (5'-TGCCGCCAGTACCAAAGACA-3' ; SEQ ID NO: 161), for chuA
  • yjaA.l (5'-TGAAGTGTCAGGAGACGCTG-3' ; SEQ ID NO: 162) and yjaA.2 (5'-ATGGAGAATGCGTTCCTCAAC-3' ; SEQ ID NO: 163), for yjaA and
  • TspE4C2.1 (5'-GAGTAATGTCGGGGCATTCA-3' ; SEQ ID NO: 164) and TspE4C2.2 (5' -CGCGCCAACAAAGTATTACG-3' ; SEQ ID NO: 165) for TspE4C2 (SEQ ID NO: 119) .
  • These pairs of primers generate, by amplification, fragments of 279 bp, of 211 bp and of 152 bp, respectively.
  • the PCR steps were as follows: denaturation for 4 minutes at 94°C, 30 cycles of 5 seconds at 94°C and 10 seconds at 59°C, and a final extension step of 5 minutes at 72°C.
  • the Southern transfer was carried out by transfer by capillarity onto positively charged nylon membranes.
  • the hybridization was carried out at 65°C in 1% SDS/1M NaCl/50 mM Tris HC1, pH 7.5/1% of blocking agent (Boehringer Mannheim, Mannheim, Germany) .
  • the membranes were washed in 2x SSC for 15 minutes at room temperature; then in 2x SSC/0.1% SDS for 30 minutes at 65°C and finally, in O.lx SSC for 5 minutes at room temperature.
  • the detection of chemiluminescence was carried out according to the manufacturer's instructions (DIG Luminescence Detection Kit for nucleic acids, Boehringer Mannheim) .
  • the probes were produced by PCR according to the manufacturer' s instructions (PCR DIG Probe Synthesis Kit, Boehringer Mannheim) and using the primers and the amplification procedure described above for the standard protocol.
  • R.M phylogenic groups assessed by reference methods (MLEE or ribotyping)
  • Table 6 above shows the results obtained with the method according to the invention and the reference methods for the complete set of strains, according to phylogenic group.
  • the chuA gene is present in all the strains belonging to groups B2 and D, and absent from all the strains of groups A and Bl. This makes it possible to effectively separate groups B2/D from groups A/Bl.
  • the yjaA gene allows complete distinction between group B2 (100% positive strains) and group D (100% negative strains) .
  • the novel TspE4.C2 clone is present in all the strains of group Bl except 2, and absent from all the strains of group A. All the PCR results were confirmed by Southern hybridization.
  • the inventors have developed a PCR method for rapidly determining the phylogenic group of £7. coli strains. Using two genes, chuA and yjaA, and a novel DNA fragment named TspE4.C2 (SEQ ID NO: 119, cf. Example 1) , the phylogenic groups of 220 strains which had previously been assigned to phylogenic groups determined using known methods were determined. The precision of analysis obtained according to the invention exceeds 99% with respect to the grouping established with the methods of the prior art. In addition, the same results were observed with a technically very simple triplex PCR method which is used directly on the bacterial colonies.
  • the phylogenic characterization of the £7. coli strains based on a few genotypic or phenotypic characteristics, appeared, in the prior art, to be very difficult.
  • Such genotypic characteristics Presence or absence of a gene, for example
  • the gene must have been acquired, or have been deleted, when the group that i- characterizes emerged.
  • this same gene must have been "stabilized” so as to exclude any phenomenon of subsequent deletion or horizontal transfer towards bacteria belonging to other phylogenic groups.
  • recombination phenomena must be very rare in the candidate gene. In other words, the gene product must not be a target for natural selection, which would favour novel genetic recombinations.
  • ECOR 70 can be considered to be a "hybrid" strain in which a few housekeeping genes have nucleotide sequences which are common with ECOR strains of group A, and in which few other genes haver nucleotide sequences which are common with the ECOR strains of group Bl.
  • the phylogenic membership of ECOR 70 to group Bl is not clearly determined; it is, in any event, considered to be part of group A using the method according to the invention.
  • the invention has the advantage of not requiring the use of a reference collection such as the ECOR strain or another collection, which means that the analysis can be easily carried out in the laboratory and in particular for routine analyses.
  • the allocation to the phylogenic groups is unequivocal. Specifically, the 4 strains which, until now, were unclassified in the ECOR collection (E31, E37, E42 and E43) can be classified using the method of the invention; the first three strains belong to group D, and the fourth to group A. It can also be noted that all the sequences of the housekeeping genes studied in the latter strain appear to be characteristic of those found in the strains of group A.
  • the first use is, of course, the bioclinical use, taking into account the link which can be established between the phylogenic group and the possible virulence or dangerousness.
  • the second use corresponds to a biotechnological screening tool which makes it possible to. eliminate potentially pathogenic strains, i.e. strains which are highly dangerous, when candidate strains are being sought for cloning. It was possible to develop such screening tools in the prior art, for example for identifying the £7. coli K12 strains by PCR- or for detecting £7. coli strains which exhibit none of the virulence genes which were, until then, known due to a reverse dot blot procedure.
  • the method described herein has the notable advantage of allowing the identification of nonpathogenic strains other than £7. coli K12, and of being suitable for screening strains on a large scale.
  • EXAMPLE 6 Production of seventy CFT073+ K12- zones and thirty one RS218+/K12- zones
  • These clones are present with greater frequency in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D with respect to the ECOR £7. coli of group A (preferably 2 times greater, more preferably 3 times greater, even more preferably 3.5 times greater, and very preferably 4 times greater) . They are, in particular, present at a frequency greater than 10% in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D, and at a frequency of less than 25%, preferably less than 20%, more preferably less than 10%, and even more preferably less than 5% in the ECOR £7. coli of group A, the frequency observed in the ECOR £7. coli of group A always remaining less than that observed in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D.
  • the chromosomal position of each of these seventy CFT073+ K12- zones was then determined ("K12 coordinates" column in Table 8 below) with respect to the chromosomal map of £7. coli K12, which constitutes a reference in the field.
  • These seventy CFT073+ K12- zones were then analysed in order to determine the possible presence of open reading frames (orf/ORF) using programs such as CodonUseTM (£7. coli codon use) and ORF Finder (www : _ncbi .nliti•.nih_gov/gqr;f/ orf._html, bacterial genetic code) .
  • ORFs The corresponding protein sequences (ORFs) were extracted compared to various databases in order to search for homology compared to known sequences.
  • the protein sequence (ORF) and polynucleotide sequence (orf) of the open reading frame identified is shown, indicating the start and end positions (Pos.) of these. frames with respect to the complete sequence of the zone concerned.
  • Some of these ORFs are encoded by orfs which are, in fact, located on the strand complementary to the indicated sequence of the region (the "frame start" position then has a number which is higher than that of the "frame end” position) .
  • Table 7 summarizes the SEQ ID NOS assigned to each of the seventy CFT073+ K12- zones and to their ORFs and orfs, and each thirty one RS218+/K12- zones.
  • zone 24 and its orfs/ORFs which, correspond to known products which are of known nature B2+ A-, none of the sequences identified herein is strictly identical to a sequence of the prior art (comparison of the nucleotide series over their entire length, independently of the function of the process) .
  • some homologies are found as it appears from table 9 above.
  • the CFT073+ K12- and RS218+/K12- zones isolated herein all comprise, in the vicinity of their chromosome, at least one DNA fragment which is of nature B2/D+ A- as defined above (ECOR B2 frequency and/or ECOR D frequency higher than ECOR A frequency SEQ ID NO 1 to NO. 550, the zones and orfs isolated herein are excellent candidates as polynucleotides which are of nature B2/D+ A-.
  • Verifying that the CFT073+ K12- or RS218+/K12- zones, ORFs and orfs isolated herein are effectively of nature B2/D+ A- can be easily carried out by those skilled in the art according to techniques which are conventional in the domain of phylogeny.
  • One way of making sure may, for example, consists in applying to at least one other E. coli strain of group B2 or D, for example E. coli C5, the procedure which has been described herein for the E. coli strain CFT073 and RS218, in such ' a way as to identify and isolate the zones, ORFs and orfs which are present in E. coli C5 and absent in E. coli K12 (or another E. coli strain of group A) .
  • sequences of these C5+ K12- zones, ORFs and orfs are then compared with those of the CFT073+ K12- and RS218+ K12- groups described herein, while searching for homologous sequences and sequence fragments.
  • This comparison can, for example, be carried out using the BLAST program (National Centre for Biotechnology Information, NCBI, Altschul et al . 1997, Nucleic Acids Res. 25: 3389- 3402), comparing each sequence of the C5+ K12- group with those of the CFT073+ K12- (or RS218+ K12-) group.
  • polynucleotide sequences, or fragments of polynucleotide sequence, with a low probability of the significant homology (for example, identity of 80% or more) being linked by chance can then be selected as being of nature B2/D+ A-, as defined above. If desired, the process can be repeated on a fourth or fifth E. coli strain of group B2 or D.
  • Such B2/D+ A- products are particularly useful for the phylogenic determination of E. coli .
  • the polynucleotides which are of nature B2/D+ A- and their inactivated isogenic mutants can be used, in the form of isolated DNAs (placed under the control of a eukaryotic promoter or in the ' form of DNA transfected into a cell) , as active principles in a vaccine composition intended to prevent, alleviate or combat an undesirable development of E. coli, and in particular a development of E. coli in an extra-intestinal compartment.
  • the ORFs themselves, administered in an optionally inactivated and immunogenic form are active principles which are very promising for the development of a vaccine composition intended to prevent, alleviate or combat an undesirable development of E. coli , and in particular a development of E. coli in an extra- intestinal compartment.
  • polynucleotides and polypeptides which are of nature B2/D+ A- can also be used as anti-pathogenicity targets (with extra-intestinal and non intra-intestinal targeting) / they make it possible to identify and isolate capable active principles which can be used in pharmaceutical compositions (medicinal products in particular) intended to inhibit the growth of an E. coli bacterium and, notably, its extra-intestinal development, by selecting, from candidate active principles, those which inhibit or block the correct transcription and/or translation of these zones or orfs, or which inhibit or block the activity of the ORFs.
  • the present application is aimed towards each of these CFT073+ K12- and RS218+/K12- zones, numbers 1 to 23, 25 to 48, and 49 to 101, also their ORFs and orfs, as products. It is also aimed towards any vaccine or pharmaceutical composition intended to prevent, alleviate or combat an undesirable development of E. coli, and in particular an extra-intestinal development
  • E. coli which comprises at least one of said zones or at least one of said orfs and ORFs.
  • zones, ORFs and orfs which are both CFT073+ K12- and RS218+ K12-.
  • the applications targeted concern a development of E. coli in an extra-intestinal compartment (systemic and non- diarrhoeal E. coli infection)
  • those of said CFT073+ K12-, RS218+ K12- zones, orfs and ORFs which are also 0157 :H7- are then preferred.
  • sequence or sequence fragment derived from the set obtained in i. and a sequence or sequence fragment obtained in ii. being due to chance are sequence or sequence fragment derived from the set obtained in i. and a sequence or sequence fragment obtained in ii. being due to chance.
  • the present application is also aimed towards the possible orfs identifiable on such a polynucleotide, and the ORFs and polypeptides encoded by these orfs .
  • the B2/D+ A- DNAs isolated according to the invention are useful as active principles in the context of a vaccine composition intended to prevent, alleviate or combat an undesirable development of E. coli , and in particular a development of E. coli in an extra- intestinal compartment (systemic and non-diarrhoeal E. coli infections) . They can then be used in an isolated ⁇ ("naked" DNA) form or in the form of DNA transfected into a cell chosen for its physiological innocuity and its capacity to secrete the polypeptide encoded by the transfected B2/D+ A- fragment.
  • polypeptides encoded by the DNAs which are of nature B2/D+ A- can also be used as active principles in the context of such vaccine compositions. They are then used in an inactivated and immunogenic form.
  • E. coli Examples of extra-intestinal development of E. coli comprise, in particular, septicaemias, pyelonephritis, and meningeal infections in newborns. Recognized or potential hospital-acquired infections are most commonly the product of such extra-intestinal developments of E. coli .
  • Region 5 appears to be an • advantageous source for manufacturing broad- spectrum anti-B2-coli products.
  • regions 1, 3 and 4 they appear to be advantageous sources for manufacturing products against septicaemia with meningitis and in particular against meningitis in newborns and in adults.
  • the invention is aimed in particular towards the compositions, in particular ' • the vaccine and pharmaceutical compositions, comprising at least one of said novel DNAs or at least one of said novel ORFs or at least one polypeptide the sequence of which can be considered to correspond, according to the universal genetic code and taking into account the degeneracy of this code, to one of these DNAs or ORFs. It is also aimed towards the use of the known DNAs which are of novel nature B2/D+ A-, such as chuA, the chuA fragments which are of nature B2/D+ A-, and of the polypeptides corresponding to these DNAs, for manufacturing such compositions .
  • DNAs or polypeptides which are of nature B2/D+ A- can be used for screening chemical and/or biological libraries in such a way as to identify compounds capable of binding to them and of blocking an undesirable development of E. coli bacteria
  • compositions can be developed and adjusted by those skilled in the art as a function of the medical indication targeted, of the method of administration desired, and of the patient under consideration (age, weight, sex, condition) .
  • compositions can also comprise- one or more physiologically inert vehicles, and in particular any excipient suitable for the pharmaceutical formulation and/or for the method of administration desired
  • B2/D+ A- compound can also comprise one or more 'active co-agent (s) in order to modify the intensity of activity of the B2/D+ A- compound according to the invention, or to modify its system of activity, or alternatively to modify the targeting of its activity.
  • s 'active co-agent
  • They can also comprise other agents useful for preventing and/or alleviating and/or treating an infection, without these agents having any interaction with the B2/D+ A- compounds according to the invention.
  • the polynucleotides and polypeptides which are of nature B2/D+ A- according to the invention are used for identifying, for example by screening chemical and/or biological libraries, compounds capable of inhibiting, in vivo or under conditions mimicking as closely as possible the in vivo state, the activity of a protein the ORF of which comprises a polynucleotide which is of nature B2/D+ A- according to the invention.
  • Such compounds can, in particular, be used in compositions, in particular pharmaceutical compositions, intended to inhibit the growth of E. coli , and in particular its extra-intestinal growth.
  • a subject of the present invention is such an identification method, the compounds as obtained by such a method, and such compositions.
  • Example 8 Animal models for studying E. col ⁇ virulence
  • Newborn rats (Sprague Dawley - January breeding) after 24 hours of acclimatisation in animal houses were infected when 5 days old.
  • the injected inoculum was prepared from dilutions in physiological serum in a nutritive culture medium of 2 h.
  • the animals were infected by intra-peritoneal route after anesthesia with ether, then put back to their mother after- randomization by brood of ten.
  • the numeration of the bacteraemia at 18h was obtained by taking of 5 ⁇ l otf blood after incision of the tail.
  • mice Mortality of adult mice (%) at D7 after intra peritoneal injection of 3 strains of E. coli .
  • the inoculum was prepared as above-mentioned and the injection made by the intra peritoneal route.
  • E. coli B2 responsible for neonatal infection ' (C5 strain) are capable inducing a bacteraemia after injection of a weak inoculum in the newborn rat and that the mortality in mice is 10 times higher.
  • CFT073 i.e. the E. coli B2 responsible for sepsis in the adult, it is necessary to use a stronger inoculum. But said coli is still pathogenic in mice.
  • An E. coli of group A is not .pathogenic even in mice.

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Analytical Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • General Chemical & Material Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Wood Science & Technology (AREA)
  • Biomedical Technology (AREA)
  • Neurology (AREA)
  • Neurosurgery (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • Urology & Nephrology (AREA)
  • Communicable Diseases (AREA)
  • Psychiatry (AREA)
  • Hospice & Palliative Care (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Immunology (AREA)
  • Microbiology (AREA)
  • Oncology (AREA)
  • Gastroenterology & Hepatology (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)

Abstract

The present invention relates to products which are of nature B2+ A-, isolated from E. coli, and to their biological applications, in particular their medical (therapeutic, vaccine and diagnostic) and biotechnological applications. In the present application, the expression 'of nature B2+ A-' is intended to mean presence at a frequency greater than 10 % among the E. coli strains of group B2 of the ECOR collection, and at a frequency of less than 10 % among the strains of group A of the same collection. A phylogenic determination method which makes it possible to rapidly and easily distinguish the groups A, B1, B2 and D of the E. coli species with more than 99 % precision is in particular described.

Description

Polynucleotides which are of nature B2/D+ A- and which are isolated from E. coli , and biological uses of these polynucleotides and of their polypeptides
The present invention relates, in general, to polynucleotides which are of nature B2/D+ A- and which are isolated from E. coli, and to the biological uses, in particular medical and biotechnological uses, of these polynucleotides and of the polypeptides which they encode. In the present invention, the expression "the nature B2/D+ A-" is intended to mean that the polynucleotide is present with greater frequency such as observed in ECOR E. coli of group B2 (ECOR B2 frequency) and/or in ECOR E. coli of group D (ECOR D frequency) , with respect to the frequency observed in ECOR E. coli of group A (ECOR A frequency) . Preferably, the ECOR B2 frequency and/or the ECOR D frequency is greater than the ECOR A frequency by. a factor of 2, preferably of 3, more preferably of 3.5, very preferably by a factor of 4. The set of polynucleotides which are of nature B2/D+ A-, provided by the present invention, comprises in particular products the ECOR B2 frequency and/or the ECOR D frequency of which is greater than 10%, and the ECOR A frequency of which is less than 25%, preferably less than 20%, more preferably less than 10%, and very preferably less than 5%, while always remaining less than the ECOR B2 frequency and/or the ECOR D frequency.
The ECOR collection is a collection which represents the genetic diversity of the E. coli species; it is available from banks such as the ATCC.
The E. coli species is currently divided into four main phylogenic groups termed groups A, Bl, B2 and D. Currently known techniques for determining the phylogenic group of a given E. coli strain comprise multilocus enzymatic electrophoresis, or MLEE, and ribotyping. Descriptions of these techniques are in particular described in Herzer et al . 1990 J. Bacteriol. 172:6175-6181 and in Selander et al . 1986 Appl. Environ. Microbiol. 51:873-884 for MLEE, and in Bingen et al . 1994 Clin. Microbiol. Rev. 7:311-317, Bingen et al . Clin. Infect. Dis. 22:152-156, Bingen et al . J. Infect. Dis. 177:642-650 and in Desjardins et al . 1995 J. Mol. Evol. 41:440-448, for ribotyping. Briefly, MLEE is based on the analysis of the migration polymorphism of bacterial enzymes. For each strain, a large number of enzymes characteristic of the species (often greater than or equal to 20) are characterized by their electrophoretic mobility. The existence of migration variants for these enzymes makes it possible to characterize each strain by an electrophoretic type. With regard to ribotyping, it.άs based on the analysis of the restriction polymorphism of the chromosomal regions which include the genes encoding the 16S and 23S RNAs. This polymorphism is revealed using a labelled cDNA probe, prepared from the 16S and 23S RNA of -257. coli, which can be hybridized to the DNA of the strain studied, after digestion of this DNA with various restriction enzymes and Southern transfer.
The techniques of the prior art are, however, complicated to carry out for industrial-type applications, such as analyses in a medical environment or large scale screening of strains of interest for biotechnological techniques such as cloning. In addition, these techniques are long and require the availability of a collection of reference strains. A collection termed ECOR has, moreover, been constituted in such a way as to represent the genetic diversity of the E. coli species. This collection is available in particular from the ATCC (ATCC No. 35320 to No. 35391) . It comprises:
- 25 E. coli strains of group A, - 16 .57. coli strains of group Bl,
- 15 E. coli strains of group B2, - 12 -57. coli strains of group D, and
- 4 E-7- coli strains not assigned to one of the four groups A, Bl, B2 and D.
In order to provide a more effective solution to the problem of the phylogenic determination of a given E. coli strain, the present invention makes the original and pertinent choice of isolating the entire set of its polynucleotides which are of nature B2/D+ A-, as defined above. Such a solution has never, to the Applicant's knowledge, been proposed. Now, the present invention demonstrates that this particular choice not only makes it possible to solve the problem of the phylogenic determination of a given E. coli strain, but also makes it possible to detect and to treat an undesired development of i57. coli, and more particularly a development of E. coli in a human or animal compartment which is extra-intestinal
(systemic and non-diarrhoeal infections, such as septicaemia, pyelonephritis, or meningitis in the newborn) . The present invention in fact provides, in addition to the diagnostic and therapeutic means directed against E. coli in general, specific diagnostic and therapeutic means which have the advantage of distinguishing between E. coli capable of infecting the extra-intestinal compartment (J57. coli of group B2 and D) and -57. coli which are part of the normal physiological flora of humans and animals. Besides the less deleterious effects that such treatments offer patients suffering from an extra-intestinal E. coli infection, these specific diagnostic methods and treatments according to the invention make it possible to limit the development of bacterial resistances which are more and more frequently observed as broad-spectrum antibiotic treatments are used.
The inventors have therefore developed means which provide access to the entire set of polynucleotides which are of nature B2/D+ A-, as defined above. To the Applicant's knowledge, this is the first description of such means and the first description of such a set of products. These means also demonstrate that there are genomic regions which are of nature B2/D+ A-.
In the scientific literature, a few products have been individually described as being present in one or more -57. coli strains of group B2 or D, and as being generally absent from one or more E. coli strains of A. It is in particular the case of the sfa, ibelO , hly, pap, prs and kps genes, of the genes involved in sorbose metabolism. The reliability of these products as phylogenic markers has, however, never been demonstrated. In any event, there is nothing to indicate that these products are capable of effectively distinguishing between the various phylogenic groups of E. coli . In addition, these products have been isolated separately, with the observation of their presence in one or more J57. coli strains of group B2 or D, and of their absence in the few E. coli strains of group A which have been tested, being made a posteriori. Those skilled in the art did not, therefore, in the prior art, have means for obtaining other products possibly having the nature B2/D+ A-.
A concept which is common to the various aspects of the invention corresponds to producing phylogenic markers of i57. coli by choosing to isolate the set of the chromosomal and plasmid DNA fragments of E. coli which are of nature B2/D+ A- as defined above, and developing, for this purpose, a method which is capable of allowing the entire set of these fragments to be isolated.
The method used consists in subtracting the polynucleotide population of one or more E. coli strain (s) of group A, randomly sheared, from the polynucleotide population of one or more E. coli strain (s) of group B2 or D, cleaved with a restriction endonuclease allowing fragments comprising from 100 to 1 500 bp approximately, with a mean of 300 to 500 bp approximately, to be produced, such as 5au3AI, Tsp509I, Mspl or Maell (any restriction enzyme with short restriction sites, for example for bp) . This method makes it possible to obtain a library of products which are of nature B2/D+ A- as defined above. Example 1 below gives an illustration thereof, for which the i57. coli strain subjected to subtraction is the i57. coli strain C5. This particular strain is classified, according to reference techniques, in the phylogenic group B2, yet it comprises, in particular at the plasmid level, DNAs of D origin. The specific choice of E. coli C5 therefore has the advantage of isolating polynucleotides which are of nature B2+ A- and/or D+ A-, using a single strain.
Such a subtraction can be repeated over and over until the desired level of exhaustiveness is obtained (the level of exhaustiveness of the library can be estimated by searching for the presence or absence of products known to have the nature B2/D+ A-, such as sfa, ibelO ) . If the intention is to increase the level of exhaustiveness, it is desirable to use different restriction enzymes between the various subtraction repetitions. Examples of implementation of such a method are in particular described in Example 1 which follows.
Various subtractive methods for isolating DNAs present in one bacterium and absent in another bacterium have been described in the prior art. Before the present invention, such a subtractive method had, however, never been applied to the .57. coli species, nor had it been envisaged that such a method may provide a solution to the problem of identifying phylogenic markers in general, and to the problem of identifying phylogenic markers of the E. coli species in particular. What is more, the present invention proposes, for the first time, to apply a subtractive method to a specific choice of E. coli strains: firstly, an isolate of E. coli of group B2, associated with neonatal meningitis (E. coli C5 assigned to the phylogenic group B2 and comprising, in particular at the plasmid level, DNAs of D origin) , and secondly, E. coli strains of group A from the ECOR collection.
The present invention therefore offers the means for obtaining all of the products which are of nature B2/D+ A-. As such, the polynucleotides targeted by the present application correspond to the set of polynucleotides which are of novel nature B2/D+ A- . Some of these polynucleotides correspond to products which are novel in themselves (SEQ ID NO: 1 to NO: 153, sequences corresponding to regions 1, 3, 4 and 5 identified in the following examples, orf sequences containing these sequences) . Others correspond to products which have homologies with known products, but which are of novel nature B2/D+ A- (in particular SEQ ID NO: 170, 171, 174,
175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195,
196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218,
220, 221, 233, 234, 235, 241, 244, 246, 247, 248, 250, αhuA gene and fragments of the chuA gene which have conserved this nature B2/D+ A- such as SEQ ID NO: 241, 195, 185 or 248) .
A subject of the present application is therefore any isolated polynucleotide which is of novel nature B2/D+ A- and which corresponds to a novel product in itself. It is in particular aimed towards any isolated polynucleotide the sequence of which corresponds to a sequence chosen from the group consisting of :
- the sequences SEQ ID NO: 1 to NO: 153,
- the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of otl and Blnl , selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52, and
- the sequences of the orfs (open reading frames) which contain one of these sequences, - the nature-conserving variant or fractional sequences of these sequences.
The production of this polynucleotide group is described in detail in the examples which follow. It corresponds to the polynucleotides which are of novel nature B2/D+ A- and which are also novel as products. Besides the isolated fragments SEQ ID NO:l to NO: 153, it comprises the novel regions 1, 3, 4 and 5 identified according to the invention, and the orfs which can be identified on these polynucleotides (cf. examples) .
The expression "nature-conserving sequence of a sequence" is herein intended to mean any sequence which has conserved the nature B2/D+ A- (as defined above) of the polynucleotide corresponding to this parent sequence. The term "variant" is herein intended to mean any sequence having nucleotide insertions and/or deletions and/or substitutions with respect to the parent sequence, this includes any sequence which is complementary to the parent sequence. The term "fractional" is herein intended to mean any fragment of the parent sequence.
In the present application, any strain is considered to be E. coli if it can be considered to belong to the E. coli species according to the criteria given by Bergey' s Manual of systemic bacteriology { cf. in particular Volume 1) .
Similarly, any E. coli strain is considered to belong to the group B2 or to the group A if it is considered as such after carrying out a reference phylogenic test suitable for discriminating between B2/D and A, such as for example multilocus enzymatic electrophoresis (MLEE) and/or ribotyping. Such phylogenic techniques are well known to those skilled in the art. Examples of suitable protocols are in particular described in Herzer et al . 1990 J. Bacteriol. 172:6175-6181 and in Selander et al . 1986 Appl. Environ. Microbiol. 51:873-884 for MLEE, and in Bingen et al . 1994 Clin. Microbiol. Rev. 7:311-317, Bingen et al . Clin. Infect. Dis. 22:152-156, Bingen et al . J. Infect. Dis. 177:642-650 and in Desjardins et al . 1995 J. Mol. Evol. 41:440-448 for ribotyping.
Examples of such determinations are given in the examples which follow.
The fact of being or of not being present in a given bacterial strain or sample corresponds to a notion which is common to those skilled in the art in the field. In order to determine whether a given polynucleotide is, or is not, present in a given E. coli strain, the determination can in particular be carried out by Southern transfer of the nucleotide population of said E. coli strain, and bringing this transfer into contact with the polynucleotide tested, or with a probe derived from this polynucleotide, under conditions suitable for polynucleotide hybridization reactions. Examples of such probes and such conditions are given in Example 1. A positive hybridization will then be interpreted as the presence of said polynucleotide in said E. coli strain, and a hybridization which is negative or insignificant with respect to the background noise will then be interpreted as the absence of said polynucleotide in said .57. coli strain. The determination can also be carried out by PCR detection of a positive or negative amplification using amplification primers constructed on the basis of said polynucleotide and placed in contact with the nucleotide population of said i57. coli strain, under conditions which are favourable for the amplification, using these primers, of a target out of this population. Illustrations of such PCR procedures are given in the examples which follow.
The majority of the polynucleotides which are of nature B2/D+ A- according to the invention are advantageously present in ECOR -57. coli of group B2 with a frequency greater than 10%, particularly greater than 40%, preferably greater than 50%, more preferably greater than 60%, and even more preferably greater than 70% { cf. Example 2 and Table 3) . Some of them are present in ECOR -57. coli of group B2 with a frequency greater than 80%, or even at a frequency equal to 100%, while at the same time being present in ECOR E. coli of group A at a frequency of less than 5%, or even less than 3%, or even equal to 0%.
The polynucleotides which are of nature B2/D+ A- according to the invention can be used for simply detecting the presence of E. coli bacteria of the group B2 or D with a probability of at least 90% (presence of said polynucleotides) . Taken in combination together, or with other products, they make it possible to completely distinguish between the groups A, Bl, B2 and D of i57. coli { cf. Example 5) . They also have therapeutic, palliative and/or preventive applications of interest. These polynucleotides will be referred to hereinafter as: novel polynucleotides which are of novel nature B2/D+ A-.
The present application is also aimed towards any pair of primers which allows the PCR amplification of at least one novel polynucleotide which is of novel nature B2/D+ A- as defined above. The setting up of suitable experimental PCR conditions is accessible to those skilled in the art { cf. in particular Molecular cloning - a laboratory manual, Sambrook, Fritsch, Maniatis, and in particular Vol. 2, Chap. 14 2nd edition; cf. Ausubel et al . 1989 Current protocols in molecular biology; John Wiley and Sons Ed., and in particular Vol. 2 Chap. 15). Examples thereof are given in the examples which follow.
It is in particular aimed towards the pair of primers corresponding to the pair SEQ ID NO: 164 and NO: 165. This specific pair of primers makes it possible to amplify the fragment SEQ ID NO: 119 (clone TspE4C2) .
The present application is also aimed towards any nucleotide probe comprising at least one novel polynucleotide which is of novel nature B2/D+ A- as defined above. It is in particular aimed towards any probe as obtained using at least one pair of primers according to the invention, in particular by PCR amplification { cf. examples), such as (SEQ ID NO: 160; 161), (SEQ ID NO: 162; 163) or (SEQ ID NO: 164; 165) .
In general, any pair of primers which allows the PCR amplification, under conditions as mentioned above, of at least one polynucleotide which is of novel nature B2/D+ A-, and also any probe directed against such a polynucleotide, and any antibody directed against a polypeptide encoded by such a polynucleotide, can, in accordance with the present invention, be used for the phylogenic determination of a bacterium of the E. coli species. The present invention also demonstrates that the y'aA gene (SEQ ID NO: 254; protein of SEQ ID NO: 255) is a phylogenic tool of interest for E. coli . The use, for the phylogenic determination of E. coli, of any product which makes it possible to detect y aA { cf. Example 5) therefore enters into the domain of the present application. It is in particular aimed towards any pharmaceutical composition comprising such products, such as a pair of primers (e.g. SEQ ID NO: 162; NO: 163), a probe or a specific antibody.
The present application is also aimed towards any antisense polynucleotide, characterized in that its sequence corresponds to the antisense sequence of at least one novel polynucleotide which is of novel nature B2/D+ A- and any isolated polypeptide, characterized in that its amino acid sequence corresponds to a sequence encoded, according to the universal genetic code and taking into account the degeneracy of this code, by at least one novel polynucleotide which is of novel nature B2/D+ A-.
The present application is also aimed towards any combination of the polypeptides encoded by the polynucleotides of the invention with a binding product, capable of binding to at least one novel polypeptide which is of novel nature B2/D+ A-, and which will inhibit an important biological function, such as the extra- intestinal growth by growth in serum, or the multiplication in animal, such as disclosed in example 6. Such products correspond in particular to antibodies or monoclonal antibodies and compounds inhibiting the biological function of the polypeptides. Methods for manufacturing such binding products using the polypeptides according to the invention are available to those skilled in the art. They are conventional methods which comprise, in particular, the immunization of animals such as rabbits and the harvesting of the serum produced, followed optionally by the purification of the serum obtained. A technique suitable for the production of monoclonal antibodies is that of Kohler and Milstein (Nature 1975, 256:495-497) .
In the present invention, the expression "capable of binding" is intended to mean physiological-type conditions
{in vivo or mimicking in vivo) when said binding product is intended to be administered to a human or animal organism, and ELISA-type conditions when said binding product is intended to be used in assays and methods in vi tro, for example for determining the phylogenic group of an E. coli bacterium.
The present application is also aimed towards any vector comprising at least one novel polynucleotide which is of novel nature B2/D+ A-, and also any cell transformed by genetic engineering, characterized in that it comprises, by transfection, at least one novel polynucleotide which is of novel nature B2/D+ A- and/or at least one vector according to the invention, and/or in that said transformation induces the production by this cell of at least one polypeptide corresponding to a novel polynucleotide which is of novel nature B2/D+ A-.
A subject of the present application is also any composition, in particular any pharmaceutical composition, comprising at least one compound chosen from the group consisting of the novel polynucleotides which are of novel nature B2/D+ A-, the polypeptides corresponding to at least one novel polynucleotide which is of novel nature B2/D+ A-, and the vectors and cells according to the invention, mentioned above. Such compositions are in particular useful for treating and/or for alleviating and/or for preventing E. coli infections, and in particular of infection by extra-intestinal E. coli . (systemic and non-diarrhoeal infections) . When said compound is immunogenic or is made immunogenic, these compositions can correspond to vaccines.
A subject of the present application is also any composition, in particular any pharmaceutical composition, comprising at least one compound chosen from the group consisting of the novel polynucleotides which are of novel nature B2/D+ A-, the pairs of primers and probes according to the invention, mentioned above, and the binding products according to the invention, mentioned above. Such a composition is in particular useful for the phylogenic determination of an E. coli bacterium. It can therefore correspond to a diagnostic kit or composition.
A subject of the present application is also any composition, in particular any pharmaceutical composition, comprising, in association with an inert carrier, at least one compound chosen from the group consisting of the antisense polynucleotides according to the invention, mentioned above, and the binding products according to the invention, mentioned above. Such compositions can also be used for the phylogenic determination of an E. coli bacterium, and can take the form of a diagnostic kit or composition. They can also be used in a therapeutic capacity, in order to treat and/or to alleviate and/or to prevent any undesirable growth of £7. coli, such as a sanitary contamination, an E. coli infection, and in particular the presence of extra-intestinal E. coli { cf. examples) .
A subject of the present application is also any use of these products according to the invention for manufacturing such compositions. The present invention also provides products for which the sequence exhibits homologies with known products, but for which the nature B2/D+ A- is novel. There are in particular the sequences SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248, 250, the orfs (open reading frames) containing these sequences, the chuA gene, the operon of this gene and of its nature-conserving fragments and variants, and the polynucleotides corresponding to a region 2, 6a and 6b of E. coli as described in Example 1 which follows, with the exception of the polynucleotides sfa, hlyr cnfl f pap/prs, hra and ibe 10. These products will be referred to hereinafter as: products which exhibit homology but which are of novel nature B2/D+ A-.
The chuA gene is a known product (Genbank Accession No. U67920) : it has been described in a few E. coli responsible for intestinal and extra-intestinal infections, in Shigella as a gene involved in iron metabolism, and has been described as being homologous to a Yersinia gene. However, it has never been described as a phylogenic marker. Now, the present invention reports, for the first time, its presence in the urophathogenic E. coli strain J96 and reports, for the first time, that it is present in 100% of the B2s and of the Ds and in 0% of the
As and of the Bis of the ECOR collection. Four fragments of chuA which are of nature B2/D+ A- are also described
(SEQ ID NO: 241, 195, 185 and 248) .
The majority of the polynucleotides according to the invention which exhibit homology but which are of novel nature B2/D+ A- are advantageously present in ECOR E. coli of group B2 with a frequency greater than 40%, preferably greater than 50%, more preferably greater than 60% and even more preferably greater than 70%. Some of them are present in ECOR E. coli of group B2 with a frequency greater than 80%, or even at a frequency equal to 100%, while at the same time remaining present in ECOR E. coli of group A at a frequency of less than 5%, or even less than 3%, or even equal to 0% ( cf. Example 2) .
The present application is thus aimed towards any use of a compound which is of novel nature B2/D+ A- for the phylogenic determination of an E. coli bacterium, for the diagnosis of the presence or absence of undesirable J57. coli bacteria, such as contaminant E. coli or extra- intestinal .57. coli, and/or for the diagnosis of an E. coli infection, and/or for the manufacture of a composition intended for such a phylogenic determination and/or for such a diagnosis.
It is in particular aimed towards any use of at least one compound chosen from the group consisting of:
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of WotI and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hlyr cnfl , pap/prsr hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the polynucleotides cited in the preceding three paragraphs,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248),
- the pair of primers, the probes and the antisense polynucleotides, the sequence of which is as derived from these polynucleotides,
- the binding products which are capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A-, for the phylogenic determination of an E. coli bacterium.
The present invention is also aimed towards the use of at least one of these compounds for the diagnosis of the presence or absence of undesirable E. coli bacteria, such as contaminant J57. coli or extra-intestinal E. coli, and/or for the diagnosis of an E. coli infection. It is also aimed towards any use of at least one of these compounds for the manufacture of a composition intended for such a phylogenic determination and/or for such a diagnosis. These compounds cήuA, SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221/ 233, 234, 235, 241, 244, 246, 247, 248, 250, and the nature-conserving variant or fractional polynucleotides of these products correspond, in fact, to products which are of novel nature B2/D+ A- (as defined above) . The detection of the presence or absence of such compounds can in particular be carried out by nucleotide hybridization, by PCR amplification or by detection of their polypeptide products. Detection of the presence of such compounds makes it possible to conclude that the B2 or D i57. coli bacterium is present. The combined use of these compounds, or the use of one or more of these compounds combined with other products, such as in particular yjaA, makes it possible to refine the phylogenic allocation. The combined use of the detection of the presence or absence of chuA and of yjaA (SEQ ID NO: 254) makes it possible in particular to conclude as to whether £7. coli of group B2 and/or £7. coli of group D are present or absent ( cf. Example 5) .
The present application is also aimed towards any use of a compound which is of novel nature B2/D+ A- for the manufacture of a composition, in particular of a pharmaceutical composition, intended to alleviate and/or to prevent and/or to treat an undesirable growth of E. coli, such as an E. coli infection, the presence of extra-intestinal E. coli or a sanitary contamination. It is aimed in particular towards any use of at least one compound chosen from the group consisting of: - the polynucleotides corresponding to the α uA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186,
190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of WotJ and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hlyr cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf (open reading frame) comprising the sequence of one of the polynucleotides cited in the preceding three paragraphs,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248),
- the polynucleotides which are antisense polynucleotides of the polynucleotides of this group,
- the polypeptides the sequence of which corresponds, according to the universal genetic code and taking into account the degeneracy of this code, to these polynucleotides,
- the binding products (for example, antibodies) capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A-,
- the vectors comprising at least one of these polynucleotides,
- the cells transformed by genetic engineering which comprise at least one of these polynucleotides and/or at least one of these vectors, and/or the transformation of which induces the production of at least one of these polypeptides, for the manufacture of a composition, in particular of a pharmaceutical composition, intended to alleviate and/or to prevent and/or to treat an undesirable growth of E. coli, such as an J57. coli infection, the presence of extra-intestinal E. coli or a sanitary contamination.
According to another applied aspect, the present application is aimed towards any method which makes it possible to identify a compound capable of inhibiting the growth of E. coli , and in particular of inhibiting its extra-intestinal development, which comprises the detection of at least one compound:
i. capable of binding to at least one polynucleotide chosen from the group consisting of :
- the novel polynucleotides which are of novel nature B2/D+ A-, i.e. • the sequences SEQ ID NO: 1 to NO: 153,
• the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
• the sequences of the orfs (open reading frames) which contain one of these sequences,
• the nature-conserving variant or fractional sequences of these sequences,
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186,
190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which are as obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hlyr cnfl f pap/prSf hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the polynucleotides cited in the preceding four paragraphs, - the polynucleotides the sequence of which corresponds to a nature-conserving variant or' fractional sequence of the sequence of at least one of these polynucleotides and in particular (SEQ ID NO: 241, 195, 185 and 248),
and
ii. capable of specifically inhibiting the correct transcription and/or translation of this polynucleotide.
The present invention is also aimed towards any method which makes it possible to identify a compound capable of inhibiting the growth of an £7. coli bacterium, and in particular its extra-intestinal development, characterized in that it comprises the detection of at least one compound capable of inhibiting the activity of a protein encoded by a polynucleotide the orf of which comprises a polynucleotide chosen from the group consisting of :
- the novel polynucleotides which are of novel nature B2/D+ A- according to the invention, i.e.
• the sequences SEQ ID NO: 1 to NO: 153, • the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl., selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
• the sequences of the orfs (open reading frames) which contain one of these sequences,
• the nature-conserving variant or fractional sequences of these sequences.
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which are as obtained by digesting the total DNA of an -57. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfar hlyr cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf (open reading frame) comprising the sequence of one of these polynucleotides,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides.
Such compounds can in particular be obtained by screening chemical and/or biological libraries.
The present application is also directed towards any compound as identified by one or other of these methods, and any composition, in particular any pharmaceutical composition, ' comprising at least one such compound. Such compositions are in particular useful for treating and/or for alleviating and/or for preventing an undesirable growth of E. coli, such as E. coli infections, or an E. coli contamination, and especially useful for treating and/or for alleviating and/or for preventing the presence of extra-intestinal E. coli bacteria.
A subject of the present invention is also, according to a notable aspect of the invention, phylogenic identification methods which implement the detection of the presence or absence of at least one of the polynucleotides which are of novel nature B2/D+ A- (whether these polynucleotides are novel as products, or whether they exhibit homologies with known products) : - the novel polynucleotides according to the invention, i.e.
• the sequences SEQ ID NO: 1 to NO: 153,
• the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
• the sequences of the orfs (open reading frames) which contain one of these sequences,
• the nature-conserving variant or fractional sequences of these sequences,
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250, - the polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa r hlyr cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf (open reading frame) comprising the sequence of one of these polynucleotides,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides.
This detection can be carried out by direct detection of said polynucleotide, or of its fragments, or by detection of one or more polypeptide (s) corresponding to it
(polypeptides encoded by a polynucleotide which is of novel nature B2/D+ A-) . In particular, the present application is aimed towards any phylogenic identification method characterized in that it comprises the use of at least one compound chosen from the group consisting of :
- the novel polynucleotides which are of novel nature B2/D+ A-, i.e.
the sequences SEQ ID NO: 1 to NO: 153, • the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
• the sequences of the orfs (open reading frames) which contain one of these sequences,
• the nature-conserving variant or fractional sequences of these sequences,
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which are as obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128, 151 and 211, with the exception of the polynucleotides sfa, hly, cnfl , pap/prs , hra and ibe 10,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of a polynucleotide of this group (and in particular SEQ ID NO: 241, 195, 185 and 248),
- the polynucleotides the sequence of which corresponds to an orf comprising the sequence of a polynucleotide of this group,
- the pairs of primers which make it possible to amplify a polynucleotide which is of novel nature B2/D+ A-, as defined above,
- the probes comprising a polynucleotide which is of novel nature B2/D+ A-, as defined above,
- the polynucleotides which are antisense polynucleotides of a polynucleotide which is of novel nature B2/D+ A-, as defined above,
- the compounds as identified by a method according to the invention for identifying compounds capable of inhibiting the growth of £7. coli,
- the binding products which are capable of binding to a polynucleotide which is of novel nature B2/D+ A-, as defined above, - the binding products (for example, antibodies) capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A- as defined above.
This implementation can be carried out according to any technique known to those skilled in the art so as to allow said "at least one compound" to detect, in a biological sample, the presence or absence of the polynucleotide (s) or, where appropriate, the polypeptide (s) which constitute (s) the target thereof: ELISA for reactions of antibody-antigen type, Southern-type hybridization or PCR for reactions of polynucleotide hybridization and/or amplification type.
Suitable biological samples comprise, in particular, all samples of human origin originating from sites which are normally sterile (blood, cerebrospinal fluid -CSF-, liquid from an effusion, etc.) or non-sterile (stooTs, oropharynx, skin, etc.), and also samples of environmental origin (soils, water, etc.), of food origin and of animal origin, and microbiological cultures.
This method makes it possible to conclude, when said detection is positive, that £7. coli bacteria of group B2 and/or D are present. With only the detection of the presence or absence of chuA, this method makes it possible to conclude, when this detection is positive, that £7. coli bacteria of group B2 or D are present. When this detection is negative, and insofar as the sample effectively contains £7. coli, it makes it possible to conclude that bacteria of group Bl or A are present.
This method can also comprise the use of several of said compounds, for example in such a way as to detect the presence or absence of the chυK gene, and also that of the fragment SEQ ID NO: 119. It makes it possible then to conclude :
- when this detection is negative for cl.uA and positive for SEQ ID NO: 119, that £7. coli bacteria of group Bl are present,
- when this detection is negative for chuA and negative for SEQ ID NO: 119, that £7. coli bacteria of group A are present.
Examples of such methods and detection are given in the examples which follow. A pair of primers SEQ ID NO: 160 and NO: 161 which makes it possible to detect the presence or absence of the chuA gene, and a pair of primers SEQ ID NO: 164 and 165 which makes it possible to detect the presence or absence of SEQ ID NO: 119 (TspE4.C2) are described in particular.
The phylogenic identification method according to the invention can also implement the detection of the presence or absence of the yjaA gene. The yjaA gene { cf. Figure 5) is, in the prior art, known to be present in the £7. coli strain K12 (group A) . There is nothing in the prior art which would suggest that it may constitute a phylogenic marker capable of distinguishing between group B2 and group D.
Figure 5 shows the yjaA sequence (SEQ ID NO: 254) and that of the corresponding ORF (SEQ ID NO: 255) .
The detection of the presence or absence of the yjaA gene can take place by any technique accessible to those skilled in the art. It can in particular take place by detection of the polynucleotide or of its fragments (SEQ ID NO: 254), or by detection of the polypeptides which correspond to it (SEQ ID NO: 255) . The development of reagents which allow such a detection is accessible to those skilled in the art: amplification probes or primers for the detection of polynucleotides, compound of serum or antibody type for the detection of polypeptides. The polynucleotide hybridization reactions can, for example, be carried out by Southern and/or by PCR, and those of antigen-antibody type can be carried out by ELISA. Examples of implementation of such a detection step are given in the examples which follow. A pair of primers (SEQ ID NO: 162 and 163) which makes it possible to amplify yjaA are described in particular.
The present invention is also aimed towards any phylogenic identification method which implements at least the detection of the presence or absence of the yjaA gene. This detection makes it possible to distinguish between £7. coli of group B2 and £7. coli of group D { cf. examples) .
Notably, the present invention provides a phylogenic detection method which makes it possible to completely distinguish the £7. coli groups A, Bl, B2 and D. This method implements the detection of the presence or absence of the α uA gene, of the yjaA gene and of SEQ ID NO: 119. This method is described in detail in the examples. Advantageous PCR conditions are in particular are mentioned therein ("simplified PCR") .
Advantageously, this detection can be carried out by PCR amplification. Examples of suitable PCR conditions are given in a purely illustrative capacity in the examples. The present application is in particular aimed towards an identification method as defined above which implements the simultaneous detection of one or more of said polynucleotides which is/are of nature B2/D+ A-, and of yjaA, preferably by triplex PCR. Particularly advantageously, this method can be applied directly to the bacteria or to the sample analysed. It does not require the availability of a reference strain collection.
The present application is also aimed towards any method for detecting an undesirable growth of £7. coli, any method for diagnosing an £7. coli infection, any method for sanitary control or detection (foodstuffs, liquids intended for consumption, soils) and any method for selecting £7. coli strains suitable for biotechnological manipulations, which implement the phylogenic identification method according to the invention.
It is also aimed towards any kit for implementing a phylogenic identification, diagnostic and/or selection method according to the invention, possibly accompanied by instructions for use. Such kits can in particular comprise at least one of the compounds used in said method. They can also comprise instructions for use featuring one or more of the profiles given in Figure 4 { cf. Example 5), or describing one or more of these profiles.
An advantageous kit comprises at least one of the pairs of primers (SEQ ID NO: 160, 161), (SEQ ID NO: 162, 163) and (SEQ ID NO: 164, 165) , preferably two of these pairs, more preferably the three pairs of primers.
The present invention is illustrated by the examples which follow and which are given in a nonlimiting capacity. Other advantages and variants of implementation can be read by those skilled in the art in these examples. Such variants are targeted by the present application.
In these examples, reference is made to the following figures :
- Figure 1 represents the distribution of B2/D+ A- clones along the chromosome of the £7. coli strain RS218, - Figure 2 gives the sequence listing for isolated B2/D+ A- fragments, and indicates their respective SEQ ID NOS,
- Figure 3 illustrates a decision diagram for the phylogenic analysis of an £7. coli strain,
- Figure 4 represents the various phylogenic profiles obtained using a triplex PCR method according to the invention (lanes 1 and 2: group A; lane 3: group Bl; lanes 4 and 5: group D; lanes 6 and 7: group B2) ,
- Figure 5 represents the polynucleotide sequence of the yjaA gene (SEQ ID NO: 254) and the corresponding ORF sequence,
- Figure 6 represents the sequences of the CTF073+ K12- regions and RS218+ K12- regions obtained using the clones of Figure 2,
- Figure 7 represents the position of the genomic fragment specific to £7. coli CFT073 and RS218 strains on the chromosome of £7. coll K12 strain.
EXAMPLE 1: Production of a C5+ A- library and of a RS218+ K12- library
MATERIALS AND METHODS Bacterial strains. The strains used for this subtractive hybridization are an £7. coli isolated from the cerebrospinal fluid (CSF) of a newborn (£7. coli C5 of serotype 018:K1:H7; group B2) and two £7. coli strains of group A, ECOR4 and ECOR15, which belong to the ECOR collection (ATCC) . £7. coli C5 harbours several virulent factors, such as the Kl capsular antigen, an sfa operon, the ibe 10 gene, Pap pili and the haemolysin {hly) gene. This strain belongs to the phylogenic group B2. In addition, the ECOR4 and ECOR15 strains, which belong to the phylogenic group A, express no identified virulence factor. Other £7. coli strains were used: strain RS218
(serotype 018:K1:H7), an isolate from newborn CSF, which has been described in particular by Huang et al . 1995 (Infect. Immun. 63: 4470-4475) and which harbours the same virulence factors as the C5 strain; and the £7. coli K-12 laboratory strain MG1655 (group A) , the genome of which has recently been sequenced (Blattner et al . 1997, Science 277: 1453-1461) .
In addition, we used a set of 54 NMEC £7. coli which could be associated with neonatal meningitis, obtained from the CSF of newborns suffering from meningitis (age range: 1 to 28 days) and belonging to the phylogenic group B2. This population was compared with the 15 £7. coli strains of the phylogenic group B2 of the 72 strains from the ECOR collection, which are, themselves, not associated with meningitis. This collection is available from the ATCC
(ATCC No. 3520 to No. 35391) . These reference strains, isolated from various hosts and from various geographical sites are representative of the variation range in the £7. coli species, and are divided into four main phylogenic groups (A, Bl, B2 and D) , four of them being unclassified. The bacteria were cultured at 37°C in a Luria-Bertani broth or on Luria-Bertani agar. When necessary, ampicillin was used at a concentration of 100 μg per ml.
Southern transfer. The Southern transfers were carried out by capillary transfer onto positively charged nylon membranes. The hybridizations were carried out at 65°C in 1% sodium dodecyl sulphate (SDS) - IM NaCI - 50 mM Tris- HCI (pH 7.5) - 1% blocking reagent (Boehringer Mannheim, Mannheim, Germany) . The membranes were washed, first in
2x SSC (lx SSC corresponds to 0.15M NaCI + 0.015M sodium citrate) for 15 min at room temperature, and then in 2x SSC - 0.1% SDS for 30 min at 65°C and finally, in O.lx SSC for 5 min at room temperature. The ' 'detection by chemi- luminescence was carried out with the DIG luminescence detection kit for nucleic acid (Boehringer Mannheim) according to the manufacturer's instructions. The sfa and ije 10 probes were produced by PCR using primers and an amplification method described previously (Bingen et al . 1997, J. Clin. Microbiol. 35: 2981-2982).
Representational difference analysis. The chromosomal DNA of the ECOR strains was randomly sheared by repeatedly pushing it through a hypodermic needle so as to obtain fragments with a length of between approximately 3 and 10 kb. This digested DNA was purified by phenol extraction. The chromosomal DNA of £7. coli C5 was cleaved with the Sau3AI or Tsp5091 restriction endonuclease. This DNA (20 μg) was ligated with 10 nmol of hybridized oligonucleotides RBaml2 (5' -GATCCTCGGTGA-3' ; SEQ ID NO: 154) and RBam24 (5' -AGGACTCTCCAGCCTCTCACCGAG-3' ; SEQ ID NO: 155) or REcol2 (5' -AATTCTCGGTGA-3' ; SEQ ID NO: 156) and REco24 (5' -AGCACTCTCCAGCCTCTCACCGAG-3' ; SEQ ID NO: 157) when the restriction was carried out with SauAI or Tsp5091, respectively. The DNA was separated from the excess primers by electrophoresis in a 2% low melting point agarose gel. The portion of the gel containing fragments longer than 200 bp was excized and digested with β-agarase. This DNA was purified by phenol extraction.
For the subtractive hybridization (first round), 0.2 μg of £7. coli C5 DNA, ligated to the oligonucleotides, was mixed with 40 μg of fragmented ECOR4 or ECOR15 DNA in a total volume of 8 μl of 3x EE buffer (lx EE buffer corresponds to 10 mM W- (2-hydroxyethyl)piperazine-W- (3-propanesulphonic acid); 1 mM EDTA [pH 8.0]). The solution was covered with mineral oil, and the DNA was denatured by heating to 100°C for 2 min; 2 μl of 5M NaCI were added and the mixture was left to hybridize at 55°C for 48 h. The reaction mixture was diluted 10 times in preheated 3x EE buffer - IM NaCI and immediately placed in ice. A portion of the dilution (10 μl) was added to 400 μl of PCR reaction mixture (10 mM Tris HCl [pH 9.0], 50 mM KCI, 1.5 mM MgCl2, 0.1% Triton X-100, a 0.25 mM concentration of each deoxynucleotide triphosphate and 50 U of Taq polymerase per ml) and the whole mixture was incubated for 3 min at 70 °C in order to fill the ends of the re-hybridized £7. coli C5 fragments. After denaturation at 94°C for 5 min an addition of RBam24 or REco24 oligonucleotides (0.1 mmol per 100 μl) , the hybridizations were amplified by PCR
(30 cycles of 1 min at 94°C, 1 min at 70°C and 3 min at
72°C, and then 1 min at 94°C and 10 min at 72°C, in a GeneAmp 9600 thermocycler [Perkin-Elmer] ) . The PCR products were purified on a gel in order to separate the £7. coli C5 fragments from the primer and from the high molecular weight ECOR subtraction DNA. The second round of subtractive hybridization was carried out using 40 μg of fragmented ECOR4 or ECOR15 £7. coli DNA and 25 ng of DNA ligated to RBam24 or REco24 obtained from the first round. The products of the second round of subtraction were radiolabelled en masse and used as probe in Southern hybridization experiments, in order to verify that the amplified fragments were indeed unique to the DNA of the B2 strain and absent from the strains of group A. Thus, four subtractive libraries were produced.
Analysis of clones from the subtractive libraries. The DNA of the subtractive libraries was cloned into the BamHl
(5au3AI libraries) or £7coRl (_Tsp5091 libraries) sites of pUC19 (New England Biolabs, Beverly, Mass.), and then transformed into Epicurian coli XL2-Blue ultra-competent cells (Stratagene, La Jolla, Calif.). The inserts were amplified by PCR reactions carried out on transformed colonies, using the following primers: PI (5'-CATGCCTGCAGGTCGACTCT-3' ; SEQ ID NO: 158) and P2 (5'-CGTTGTAAAACGACGGCCAG-3'; SEQ ID NO: 159). The clones were named according to the following (in order) : the restriction enzyme used ("Tsp" or "Sau") , the strain used for the subtraction (E4 or E15) and an alphanumeric name.
(i) DNA sequencing. After purification of the PCR products by reversible immobilization on a solid phase, the purified PCR fragments were sequenced using the Big-Dye Terminator Cycle Sequencing Ready Reaction kit with AmpliTaq DNA polymerase (Perkin-Elmer) , on an ABI PRISM 377 XL automatic DNA sequencer (Perkin-Elmer) , according to the manufacturer's instructions. When problems in obtaining a sequence of good quality were encountered with a given primer, the sequencing reaction was carried out with the dGDT Big-Dye Terminator Cycle Sequencing Ready Reaction kit (Perkin-Elmer) . The sequences were screened for homologies with already published sequences, using the BLASTN and BLASTX computer programs (National Centre for Biotechnology Information, NCBI, Altschul et al . 1997, Nucleic Acids Res. 25: 3389-3402).
(ii) Southern-blot hybridization. In order to verify their specificity, the PCR products obtained, using the PI and P2 primers, from the transformant colonies were labelled by incorporating digoxygenin-11-dUTP (Boehringer, Mannheim) , and used as probes for the Southern-blot analysis of the Dral-digested chromosomal DNA originating from the £7. coli C5, EC0R4 and ECOR15 strains and the £7. coli K-12 strain MG1655.
(iii) Pulsed-field gel electrophoresis and mapping of the clones on the chromosomes of the RS218 and C5 strains. The position of the DNA sequences corresponding to the difference products cloned was determined with respect to the map of £7. coli RS218 (Rode et al. 1999, Infect. Immun. 67:230-236) by probing Southern transfers of pulse-field agarose gels. The DNA of the strain RS218 was digested with Blnl, Wotl and' Xbal , and subjected to pulsed-field gel electrophoresis, as was the DNA of the strain C5, which was digested with Blnl and Wotl. The gels were 1% agarose in 0.5x Tris-borate-EDTA buffer, and they were subjected to electrophoresis at 6 V/cm for 27 h, with pulsed durations varying in a linear manner between 2 and 49 s. The positions on the RS218 chromosome of sequences which are reactive with each of the clones were determined by comparing the Blnl and Wotl recognized restriction fragments with the published macrorestriction map (Rode et al . 1999) . RESULTS
Production of libraries of DNA fragments of the strain C5 of group B2, which are not found in the genome of E. col± of group A. Using the technique of representational difference analysis, we subtracted the chromosomes of two strains of group A (EC0R4 and EC0R15) from the chromosome of the strain of group B2 (strain C5) . Four libraries were produced and named SauE4, SauE15, TspE4 and TspE15, according to the enzyme used to digest the chromosome of the strain C5 and the strain used for the subtraction. In each case, the amplified difference product from the second round of subtraction was labelled and used as probe against the Dral-digested DNA of C5, RS218, EC0R4 and ECOR15. Strong reactivity with the chromosome of the strains of group B2 was observed. In addition, there was little or no signal in the lanes corresponding to t"he subtractive strains (group A) . 494 of the clones obtained were isolated for sequencing. Among them, 140 exhibit significant homology with the sequence of £7. coli K-12, and were eliminated. Among the 354 remaining fragments, 259 sequences (SEQ ID NO: 1 to NO: 153, and NO: 170 to NO: 253) were unique. Table 1 below shows all the clones which exhibit significant homology with genes described previously. TABLE 1. Summary of the BLAST search for the C5+ A- clones which exhibit significant homologies£
SauE15.Cl
SauE15.C6 SauE15.C7
SauE15.C9
SauE15.Cll
SauE15.D4 SauE15.D8
SauE15.E4
SauE15.Ell
SauE15.F3
SauE15.F9 SauE15.F10
SauE15.G3
SauE15.G6 SauE15.H5
SauE15.H7
SauE15.H10 SauE15.Hll
SauE15.I3
SauE15.Ill
SauE15. J7
SauE15. J9 SauE15.M4
SauE15.M8
SauE15.M12
SauE15.N7
SauE4.A2
SauE4.A5
SauE4.B4 SauE4.C7
SauE4.Cll
SauE4.D3 SauE4.E3
SauE4.Ell SauE4. F8
SauE4. F9
SauE4. F12
SauE4.H2
SauE4.I2
TspE4.A5
TspE4.A8
TspE4.A9
TspE4.A10
TspE4.Bl
TspE4.B5
TspE4.B9
TspE4.B12 TspE4.C10
TspE4.D2
TspE4.D4 TspE4.D10
TspE4.Dll
TspE4.E3
TspE4.E4
TspE4.E6
TspE4.E8 TspE4.G7 TspE4.G8
TspE4.Hl
TspE4.H9 TspE4.H10
TspE4.Hll TspE4.I10
TspE4. Jl TspE4. J6
TspE4.K3
TspE4.K8
TspE15.Al
TspE15.Cl
TspE15.C3 TspE15.D7
TspE15.D9
TspE15.E7
TspE15.G12
Some of the clones correspond to genes which are already known to be present in the strain C5, such as pap, hly and kps . Among the 494 clones, none proved to be homologous to sfa or ibel O . Additional rounds of subtraction and/or the sequencing of additional clones make it possible to obtain greater exhaustiveness, until this is complete. In addition, sequences are found which correspond to virulence factors described in strains which, to date, have never been associated with neonatal meningitis: (i) prs, cnf and hra, all of which form part of a PAI (pathogenicity island) in the uropathogenic £7. coli strain J96; (ii) chuA, a gene involved in the iron transport system and found in enterohaemorrhagic £7. coli 0157 :H7; and (iii) senB, a gene encoding an enterotoxin on the virulence plasmid of enteroinvasive £7. coli and Shigella . Finally, 153 of the fragments sequenced (SEQ ID NO: 1 to NO: 153) exhibited no significant homology with any published sequence.
Mapping of the sequences specific for NMEC, on the chromosome of E. col± . The availability of a physical map of the chromosome of £7. coli RS218 (Rode et al . 1999, Infect. Immun. 67: 230-235) made possible the investigation of the distribution of the sequences mentioned above. Of the 64 clones which were chosen for this purpose, 7 exhibit homology with known virulence factors {kps, hly, prs, hra, cnfl , chuA and senB) and 57 exhibit no known homology. These latter clones were chosen randomly from the TspE4 and SauE15 libraries. These two libraries were chosen because they contain most of the genes expected in the B2s (for example, pap, hly and kps) , and were therefore considered to be sufficiently complete to be representative. All the clones exhibited homology, by Southern hybridization, with respect to the chromosome of the strain RS218. The PCR products of these clones were labelled and used to probe Southern transfers of RS218 DNA digested with the Blnl , Not! and Xbal enzymes (enzymes which cleave infrequently) . The location of both sfa and ibel O was determined. In order to evaluate the B2+ A- specificity thereof, each of these clones was used to probe the ral-digested DNA of strains EC0R4, EC0R15 and MG1655: they proved to be nonreactive with respect to these strains.
The mapping of these clones revealed a non-random distribution of the C5+ A- sequences. This distribution is illustrated by Figure 1. In this figure, the upper arrows indicate the six regions which were found in this study and which are represented by NotI and Blnl fragments with a high density of clones specific for C5. The clones exhibiting known homologies are indicated, as are the positions of the sfa and ibel O- probes. Region 6 was divided into two subregions according to the mapping of the clones on various Xbal fragments. The exponents next to the names of the clones indicate the following: 1, TspE4.A7 was positioned by SauE15.F12 overlap; 2, SauE15.F12 also exhibits reactivity on the plasmid; and 3, TspE4.A8 also exhibits weak reactivity on Notl fragment P.
Forty-four of the clones were clustered in six distinct groups on the chromosome. One clone encoding a portion of the α uA gene found in enterohaemorrhagic £7. coli was not associated with any of these clusters and remained isolated. Region 1 is contained in Blnl fragment m (85 kb) . The clones of region 2 were mapped on Notl fragments p and n (~240 kb) . Region 3 is contained in Notl fragment a (~310 kb) and region 4 is contained in Blnl fragment h (210 kb) . Region 5 is located on Blnl fragment j (135 kb) . Region 6, which overlaps Notl fragment b and Blnl fragment b (~550 kb) , was divided into two subregions, regions 6a and 6b, with the Xbal enzyme. This latter subregion contains clones which exhibit homologies with Cnfl, hly, prs and hra . The sfa and ibel O probes hybridize in regions 2 and 6a, respectively. The genes encoding the capsule were not linked to any of these regions. Six clones exhibiting no homology with known sequences and the senB gene were all located on a large plasmid present in the strain RS218.
Distribution of the C5+ A- genomic regions among two collections of E. col± strains. In order to refine the relevance of these regions in terms of diagnosis and pathogenesis, we determined the frequency of the appearance of clones located on regions 1, 3, 4, 5 and 6b, and also the clone containing a portion of the chuA gene, in a collection of 54 £7. coli which were associated with neonatal meningitis and which belong to the phylogenic group B2 (54 NMEC) . We excluded regions 2 and 6a from this study since they contain the sfa and ibel O genes, as it has been possible to establish the distribution thereof among the £7. coli strains of group B2. For each region, two to four clones were used independently to screen Southern transfers of the genomic DNA of NMEC isolates. The control group corresponded to the 15 B2 strains of the ECOR collection; the strains have not, to date, been associated with meningitis. The results are given in Table 2 below. TABLE 2. Strains isolated from cases of neonatal meningitis and from the ECOR collection,
C5 which hybridized with the subtractive clones used as probe
©
©α. ω
H Uα.
C5
a: The prevalence of PAI V was evaluated using clones TspE4.Dll, TspE4.D10 and TspE15.Cl, which are homologous to the hly, cnfl and hra genes, respectively. b: p < 0.05, with respect to the strains of the ECOR collection (the existence of a difference in the distribution of the clones studied was tested using the χ2 test) .
Nonsignificant, with respect to the strains of the ECOR collection.
All the regions mentioned above are widely present among the NMEC, except region 6b which is under- represented in the strains associated with meningitis. In addition, regions 1, 3 and 4 appeared with a frequency which was notably higher (p < 0.05) among the NMEC than among the other B2 strains, thus suggesting that these regions contain DNAs encoding NMEC-specific factors. These regions, their portions and the clones which they contain therefore constitute a source of polynucleotides involved in neonatal meningitis. They can, therefore, be used as active principles in anti- neonatal meningitis vaccines, and allow the development of medicinal products intended to prevent, alleviate or treat neonatal meningitis using products which slow down or block the transcription and/or translation of these polynucleotides, or which slow down or block the activity of the polypeptides which they encode.
DISCUSSION
In these studies, we carried out a subtractive hybridization in order to identify the regions of the chromosome which are capable of encoding phylogenic attributes of interest. We carried out two rounds of subtractive hybridization, choosing to subtract the DNA population of a B2 £7. coli (£7. coli C5 associated with neonatal meningitis) from that of £7. coli strains of group A. We thus obtained libraries of C5+ A- clones containing inserts ranging from 100 to 500 bp long in which the specific NMEC clones could also be identified. Among the 259 clones representative of these libraries, 153 are novel as products (SEQ ID NO: 1 to NO: 153), the other fragments exhibit homology with known products. Among the fragments with homology, some are described for the first time as being of nature C5+ A-; this is in particular the case of chuA (ATCC Accession Number U67920) and of the four clones which appear to correspond to it: TspE4J6 (SEQ ID NO: 241, 221 bp, probability of e"102, score of 375), SauE15.Ill (SEQ ID NO: 195, 162 bp, probability of e"82, score of 305), SauE15.F3 (SEQ ID NO: 185, 422 bp, probability of e"20, score of 98) and TspE15.D9 (SEQ ID NO: 248, 230 bp, probability of e"18, score of 89) . It is also the case of the island PAIV, and of regions 1, 6a and 6b which were identified (cf . examples) . The invention also demonstrates that these DNA fragments are not dispersed randomly on the chromosome of £7. coli , and that there are chromosomal regions of £7. coli which are of nature C5+ A- (regions 1, 2, 3, 4, 5, 6a and 6b) .
The specificity of the subtractive libraries was evaluated (i) by Southern transfer with nonpathogenic strains, and (ii) by sequence analyses which showed 72% of the clones exhibit no homology with the published sequence of £7. coli K-12. Some clones correspond to genes associated with virulence {kps, pap and hly) . On the other hand, we have not isolated any clone corresponding to the sfa and ibel O genes. However, clones derived from regions containing these two genes were obtained. Taken together, these data confirm the complete nature of these libraries.
In addition, while, in the examples described above, 494 clones were isolated for sequencing and made it possible to obtain 259 clones which are different from each other and which exhibit no significant homology with £7. coli K12, it will become clearly apparent to those skilled in the art that the initial number of clones initially isolated and sequenced can, if desired, be considerably increased, for example by starting with 3 000 clones or more instead of 494. Automatic sequencing machines make it possible to easily treat such a number of clones. Increasing the number of clones initially sequenced makes it possible, in particular, to increase the final set of clones which are different from each other and which exhibit no significant homology with £7. coli K12. This makes it possible to increase the level of exhaustiveness of the set obtained. Alternatively, or in combination, it is possible to choose to increase the number of strains from which the DNA population originates (B2 £7. coli ) and/or to increase the number of subtractive libraries prepared (using other restriction enzymes) . In order to increase the level of specificity obtained, it is possible to choose to multiply the subtraction cycles: in the example described above, only two cycles were necessary to obtain said 259 clones, but it is possible to choose to carry out a third or a fourth cycle. It is" also possible to choose to use other restriction enzymes, any enzyme allowing fragments of approximately 100 to 1 500 bp, with a mean of approximately 300 to 500 bp, to be obtained being, a priori , the most suitable (i.e. any restriction enzyme having short restriction sites, for example 4 bp, such as Tsp509I or 5au3AI, and also Mspl or aell) . Advantageously, two to three rounds of subtraction followed by the elimination of those of the clones which exhibit homology with a strain of group A such as £7. coli K12 gives a very good level of specificity. Varying the nature of the restriction enzymes used, in particular between each subtraction cycle, further increases the level of exhaustiveness and/or of specificity of the set obtained. In order to evaluate the level of exhaustive- ness effectively obtained, this evaluation can be carried out by investigating the presence or absence of known markers such as sfa or ibel O .
In the C5+ A- set isolated herein, particularly discriminating DNAs can be identified { cf. Table 2 above) : most of them (more than 80% of them) are in fact present in the £7. coli of group B2 of the ECOR collection at a frequency greater than 40% (clones of regions 1, 4, 5 and 6b) , some of them even being present at a frequency greater than 50%, or even greater than 80% (clones of region 5 and clones corresponding to chuA) , reaching a B2 frequency of 100% for approximately 15% of the set obtained (some clones of region 5 and isolated clone corresponding to chuA) .
Seven C5+ A- regions (regions 1, 2, 3, 4, 5, 6a and 6b) were also identified. An epide iological approach was undertaken in order to study the role of these regions in the infectious process of NMEC (£7. coli which it has been possible to associate with neonatal meningitis) . Given that the majority of NMEC belong to the phylogenic group B2, we determined the prevalence of each region, and also of chuA, among NMEC of group B2, and among the strains of group B2 not associated with meningitis (ECOR collection) . Although small, this control group was chosen since it is composed of reference strains from the ECOR collection, which is considered to be representative of the range of genotypic variations of the species. We used two to four clones of each region and the TspE4.J6 clone (SEQ ID NO: 241) , a homologue of chuA, as probes against Southern transfers of genomic DNA prepared from isolates belonging to the two groups. The presence of this clone among the meningitis isolates indicates that all these regions, except region 6b, are widely represented among NMEC, thus suggesting the involvement of genes encoded by these regions in the pathogenesis of these strains. On the other hand, and surprisingly, region 6b, which resembles PAI V, has a low prevalence in NMEC (17%) but is widely represented in the strains of group B2 of the ECOR collection (47%) . Region 5 has a high prevalence, but it is similar in both collections and may thus correspond to segments which are highly characteristic of the phylogenic group B2 with respect to the group A. Region 5 of £7. coli thus appears to be an advantageous source of DNA fragments present in a great majority (more than 80%) of £7. coli of group B2 of the ECOR collection, and absent from the majority of £7. coli of group A of the ECOR collection. The same distribution was observed for αuA, but interestingly, these genes were present, without exception, in all the strains of group B2 tested. As regards regions 1, 3 and 4, they appear to be clearly more common in NMEC than in the other B2 £7. coli . Given that regions 1, 3 and 4 do not, moreover, contain any known virulence factors, these regions appear to correspond to DNA islands associated with invasion of the meninges by £7. coli in infants and newborn.
It may be noted that, in the prior art, the sequences of regions 1, 3, 4 and 5 were not available, that they had never been described or isolated and that no function was known for them. Regions 1, 3, 4 and 5 therefore appear to be novel. As regards regions 2, 6a and 6b, they comprise DNAs which were known as products, but this is the first description of the existence of such regions and the first description of their nature C5+ A-. EXAMPLE 2: Distribution of the C5+ A- fragments among the ECOR strains
The frequency of presence of the fragments obtained in Example 1 (C5+ A- fragments) among ECOR £7. coli of groups Bl and D was then measured by Southern hybridization as described in Example 1. The results obtained for 14 of them (SEQ ID NO: 56, 116, 43, 51, 141, 130, 45, 50, 52, 119, 127, 125, 55 and 37) are given in Table 3 below.
TABLE 3
Between brackets: Frequency of the clone among the ECOR group under consideration (percentage of strains of this ECOR group in which the clone is present) .
It is observed that the vast majority of the fragments tested (more than 90%) are absent (frequency of presence 0%) from the 16 £7. coli strains of group A which the ECOR collection contains, and that they are all present at a frequency greater than 10% in the
15 £7. coli strains of group B2 which the ECOR collection contains. More than 75% of the fragments are present in the 15 ECOR £7. coli strains of group B2 at a frequency greater than 40%, 50% of them are present therein at a frequency greater than 70% and approximately 35% of them are present therein at a frequency greater than or equal to 80%.
It is also observed that some of these fragments are also present in ECOR £7. coli strains of groups Bl and/or in ECOR £7. coli strains of group D. It is in particular noted that the TspE4.C2 clone
(SEQ ID NO: 119) is present in the ECOR £7. coli of group Bl at a frequency greater than 90%, while at the same time being completely absent from the ECOR £7. coli of group A. The SauE15.I12 clone (SEQ ID NO: 37) is, itself, present with a frequency of 100% in the ECOR
£7. coli of group D and with a frequency of 100% in the
ECOR £7. coli of group B2, while at the same time being completely absent from the ECOR £7. coli of group A and barely present in the ECOR £7. coli of group Bl.
All the fragments tested herein have in common the fact that they are present at a frequency greater than 10% in the ECOR £7. coli of group B2 and at a frequency of less than 25%, particularly less than 10% and notably less than 5% in the ECOR £7. coli of group A.
Since the ECOR collection represents the genetic diversity of the £7. coli species, the various results obtained indicate that the set of DNAs isolated according to the invention constitute, taken alone or in combination, particularly suitable tools for the phylogenic identification of £7. coli strains.
In order to further refine knowledge of the phylogenic distribution of the isolated fragments, the epidemiological study was pursued for several other C5+ A- clones. The results are given in Table 4 below (groups A, Bl, B2 and D, group X including the 4 strains of the ECOR collection which are not assigned to any of these 4 groups) .
TABLE 4
It can be observed that, as for the fragments previously tested, the majority of the fragments are present at a frequency greater than 10% in the £7. coli of group B2 and at a frequency of less than 10% in the £7. coli of group A. It can in particular be noted that the SauE15.C12, SauE15.A12 and SauE15.B2 clones are present at a frequency of 100%, 93.3% and 93.3%, respectively, in the ECOR £7. coli of group B2, and that all three of them are present at a frequency of 0% in the ECOR £7. coli of group A.
One clone, TspE15.G6, is however present at a frequency of 12% in the £7. coli of group A, at a frequency of 53.3% in the ECOR £7. coli of group B2, and at a frequency of 83.3% in the ECOR £7. coli of group D.
Four other clones, namely SauE15.B10, SauE4.A2, SauE15.C7 and TspE4.B9 appear, themselves, not to be present in the ECOR £7. coli of group B2; these clones are, however, present at a frequency greater than 10% in the ECOR £7. coli of group D (41.7%), and at a frequency of less than 10% in the ECOR £7. coli of group A (4%) .
The choice of the £7. coli strain C5 as the strain of group B2 for the subtractive hybridization which enabled the isolation of these fragments { cf. Example 1 above) is probably not unrelated to these results. The £7. coli strain C5, although belonging to the phylogenic group B2, in fact comprises a plasmid some sequences of which are also present in £7. coli of group D (frequency of 41%) .
The choice of such a strain for isolating the set of fragments which are very generally absent from the ECOR £7. coll of group A therefore makes it possible to isolate the entire set of fragments which are present with greater frequency in the ECOR £7. coli of group B2 and/or D, with respect to the ECOR £7. coli of group A. The majority of the fragments tested are, moreover, completely absent from the ECOR £7. coli of group A. When their frequency of presence among the ECOR £7. coli of group A is not zero, it remains low (maximum measured for the fragments of this example = 24%) , and it is always less, by a factor of approximately 3 or 4, than the frequency observed either in the ECOR £7. coli of group B2 or in the ECOR £7. coli of group D, or in each of them.
Since the extra-intestinal pathogenicity of £7. coli is associated with strains of group B2 or D, and not with strains of group A, the fragments isolated in accordance with the invention using a B2/D £7. coli strain such as £7. coli C5 have, in addition to phylogenic diagnostic applications ( cf. Example 5 below) , applications of particular value for preventing, alleviating and combating any extra-intestinal development of £7. coli (systemic and nondiarrhoeal development) .
This being so, it will become clearly apparent to those skilled in the art that the examples reported herein with the £7. coli strain C5 can be carried out in a similar manner with another £7. coli strain of B2/D type, such as RS218, such as £7. coli CFT073, and/or with a D strain.
In conclusion, the results obtained show that the
C5+ A- fragments isolated in accordance with the present invention are present in the ECOR £7. coli of group A at a frequency lower than that which can be observed in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D (= nature B2/D+ A-) .
Their frequency of presence in the ECOR £7. coli of group A is generally zero; if this is not the case, it is lower than that observed in the ECOR £7. coli of group B2 and/or D by a factor which is at the very least 2, preferably 3, more preferably 3.5, and very preferably 4. This B2/D+ A- set according to the invention in particular comprises mostly fragments which are present at a frequency greater than 10% in the ECOR £7. coll of group B2 and/or D, and at a frequency of less than 25%, preferably 20%, more preferably 10%, and even more preferably 5%, in the ECOR £7. coli of group A (with the proviso that this A frequency is always lower than that in the B2s and/or Ds) .
The present invention also provides means which give access to the entire set of these polynucleotides which are of nature B2/D+ A-.
EXAMPLE 3: Example of medical application of the B2/D+ A- fragments isolated (systemic and non- diarrhoeal development of E. col± in an extra- intestinal compartment)
Since the B2/D+A- E. coli are those which are particularly responsible for extra-intestinal infections, we attempted to determine whether some of the B2/D+ A- fragments thus isolated were, in fact, involved in a step essential to the extra-intestinal infectious process of £7. coli, namely survival and multiplication in the blood.
The approach used is that of differential transcriptional analysis (DTA) which consists in revealing the transcripts induced during this step. In order to also determine which characteristics of the serum are responsible for the variation of the level o'f gene expression of £7. coli, the DTA was carried out under the following growth conditions :
- the nutrient broth constitutes the control,
- the bacteraemia phase is compared to the growth of the bacterium in the presence of human serum,
- the iron deficiency which the culturing in serum induces is reproduced using a culture in nutrient broth supplemented with an iron chelator,
- the effect of the complement is studied using growth in the presence of decomplemented serum. Comparison of the transcriptomes obtained under each of these culture conditions makes it possible to reveal the genes specifically involved in the growth in serum, and to produce a functional group of the genes subjected to the same regulatory factor, such as iron content or stress induced by the bactericidal activity of the serum.
MATERIALS AND METHODS
1. Bacterial strains, culture media and subtractive clones
The £7. coli strain C5 (serotype 018:K1:H7) was isolated from the CSF of a newborn. This virulent strain belongs to the phylogenic group B2 and exhibits the following virulent factors: The Kl capsular polysaccharide, S adhesin, IbelO invasin, the type P pilus and haemolysin, but does not produce aerobactin. The- subtractive fragment library was obtained using the C5 strain. The nonpathogenic strains used were the £7. coli K12 strain MG1655 and the strain ECOR15 belonging to the so-called nonpathogenic phylogenic group A, originating from the ECOR reference collection.
The bacterial inocula were prepared from 18 h cultures on tryptocasein-soybean agar (Sanofi Pasteur) , with the colonies being resuspended in sterile water. After measuring the OD, this bacterial suspension, pure or diluted, was used to prepare the inoculum in the various culture media, adding a volume which was always less than 1/1Oth of the final volume.
The bacterial cultures were prepared at 37°C with shaking, using either pure nutrient broth (Sanofi Pasteur) or nutrient broth supplemented with an iron chelator: 2, 2'-dipyridyl (Sigma) at a concentration of 200 μM. The bacterial strains were also cultured in the presence of human serum. Two types of serum were obtained: one consisting of a pool of 4 sera originating from donor blood, the other corresponding to a single donor. The serum was collected after harvesting the blood in a dry tube, spontaneous coagulation for approximately 3 hours at room temperature, and then decanting. The serum was then stored in the form of 2.5 ml aliquots at -80°C. Decomplemented serum was obtained, from the pool, after incubation at 56°C for 30 minutes.
The subcultures of the clones of the B2/D+ A- library were prepared in nutrient broth enriched with ampicillin (50 μg/ml) at 37°C.
2. Study of the bactericidal effect of the serum
In order to evaluate the resistance of the £7. coli strain C5 to the bactericidal effect of the serum, on the one hand, and the intact activity of the complement in the serum, on the other hand, growth curves were produced. The bacterial growth was analysed by taking various inocula of the strains ECOR15, K12 and C5, and carrying out bacterial counts of cultures in pure serum or in decomplemented serum. These counts were taken at times corresponding to O , 3 h, 6 h and 24 h, by plating out pure or serially diluted cultures on Petri dishes using a spiral meter" system.
3. Amplification of the subtractive DNA fragments specific for £7. coli Kl and manufacture of the high density membranes a. PCR of the subtractive DNA fragments
The subtractive DNA fragments cloned into the plasmid pUC19 were amplified by PCR reaction, without DNA extraction, directly from a 1/lOth dilution of an 18 h culture broth of each clone. 5 μl of this bacterial solution were added to the reaction mixture, with a final volume of 50 μl, comprising: IX PCR buffer (10 mM Tris-HCI pH 9; 1.5 mM MgC12; 50 mM KCI; 1% Triton X100; 0.1% gelatin); 2U SuperTaq polymerase (ATCG Biotechnologie) ; 200 μM deoxynucleotide triphosphates; 6 μM specific primers. The primers used were as follows : PI (5'-CATGCCTGCAGGTCGACTCT-3'; SEQ ID NO: 728), P2 (5'-CGTTGTAAAACGACGGCCAG-3' ; SEQ ID NO: 729). The PCR reaction consisted of 30 cycles: 30 s at 95°C (denaturation) , 30 s at 55°C (hybridization) and 30 s at 72°C (elongation) .
b. Preparation of the high density membranes
Membranes comprising the set of specific amplified fragments of the £7. coli strain C5 were manufactured by the company Eurogentec. 180 nl of each of the PCR products were deposited in duplicate, in the form of microspots, onto 6 cm by 11 cm nylon membranes. In order to be able to normalize the signals recorded for each reverse transcript, spots consisting of 4-fold serial dilutions of the chromosomal DNA of the £7. coli strain C5, on the one hand, and of dilutions of the product of a PCR of the 16S rRNA, on the other hand, were deposited. In addition, a negative control corresponding to the PCR product of a subtractive fragment of 17 bp was also deposited. These membranes are conserved sealed, at +4°C.
4. Synthesis of the 33P-labelled reverse transcripts
a. Extraction of the total RNA
In order to prevent degradation of the RNA by RNAses, all the extraction steps were carried out in ice, using gloves and RNAse-free material. The RNA is extracted from a bacterial pellet obtained after centrifugation (4 700 rpm at 4°C for 3 min) of 5 ml of a 4-hour culture (approximately 108 CFU/ml) .
For the purpose of evaluating the influence of the extraction method, two techniques were used to extract the total RNA: the TRIZOL reagent and the BIORAD kit.
- TRIZOL reagent (Gibco BRL) : this reagent consists of a monophasic solution of phenol and of guanidine isothiocyanate, which allows extraction of the total RNA in a single step according to the method developed by Chomczynski and Sacchi. The bacterial cells are lysed by adding the TRIZOL reagent, vortexed for 30 s and subjected to a heat shock (incubation at 65°C followed by freezing at -80°C) . The following steps are those of a conventional phenol-chloroform extraction. Finally, the recovered aqueous phase contains the RNA.
- BIORAD ki t : this kit contains no phenol-based solution. It consists in using a solution which allows the cells to be lysed during an incubation at 65°C for 5 minutes. Next, a solution for precipitating the DNA and the proteins makes it possible to then recover the RNA which is in the aqueous phase.
For these two methods, the subsequent steps of precipitation and solubilization of the RNA are- identical. The RNA is precipitated using isopropanol: the isopropanol is added to the aqueous solution containing the RNA, volume for volume. This mixture is incubated at -20°C for approximately 15 h, and after centrifugation (13 000 rpm at 4°C for 5 min), the RNA is obtained in the form of a pellet. The RNA is then washed with 70% ethanol and precipitated by centrifugation (13 000 rpm at 4°C for 2 min) . Finally, the RNA is solubilized in water in the first case
(TRIZOL) or in a rehydration solution in the second case (BIORAD) . The RNA samples are stored at -80°C.
b. Analysis of the RNA samples obtained
- RNA assay and puri ty The RNA is assayed using a spectophotometer, by measuring the absorbance at 260 nm, it being known that an optical density value at 260 nm corresponds to an RNA concentration of 40 μg/ml. The purity of the sample is estimated by calculating the OD26o nm/OD28o nm ratio: a ratio greater than 1.6 reflects an acceptable purity.
- RNA quali ty
Approximately 0.3 μg of RNA is analysed by 2% agarose gel electrophoresis - 80 volts. The image obtained allows mainly the visualization of the bands corresponding to the 23S rRNA and the 16S rRNA, and a band corresponding to the 5S RNA and to the transfer RNAs (cf. figure), if the RNA preparation obtained is of good quality.
c. Synthesis of the 33P-labelled cDNA probe
The cDNA probe is synthesized using random priming. 10 μg of RNA are mixed together, in a final volume of 50 μl, with 10 μg of random hexamers; dATP, dTTP and dGTP, final concentration of 10 μM (Boehringer Mannheim) ; 40 U of Rnasin ribonuclease inhibitor (Promega) ; IX RT buffer; 10 μg of bovine serum albumin
(Promega) . This reaction mixture is incubated at 50°C for 5 minutes so as to linearize the mRNAs, and then brought back to +4°C in ice. Finally, 100 U of M-MuLV
Reverse Transcriptase (New England Biolabs) and 50 μCi of [α-33P]dCTP (Amersham Pharmacia Biotech) are added. Finally, this reaction mixture is incubated at 37°C for 2 hours .
5. Molecular hybridization - Prehybridization
Initially, the membrane is prehybridized in 5 ml of Church and Gilbert hybridization buffer (0.5 M NaPi, pH 7.2; 1 mM EDTA; 0.7% SDS) for a minimum of 30 minutes at 65°C.
- Hybridization
After having been denatured (5 min at 95°C) , the 33P- labelled cDNA probe is directly added to 5 ml of hybridization buffer. The hybridization reaction is carried out at 65°C for 15 to 18 h.
- Washing
The membrane is first rinsed twice with washing buffer (40 mM NaPi, pH 7.2; 1 mM EDTA; 1% SDS). Then, four washing steps are carried out (30 min at 65°C) .
- Exposure
Once wrapped in Saran (food film-wrap) , the membrane is exposed to a 33P-sensitive screen (Molecular Dynamics) for 48 h to 90 h.
- Dehybridization
The membrane is incubated twice for 15 min at 37 °C in the presence of a solution of 0.2N NaOH and 0.1% SDS, and then it is rinsed with distilled water for 5 min.
6. Analysis of the transcriptomes
- Data acquisi tion The exposed 33P-sensitive screen is scanned using a Phosphorlmager (Storm 840, Molecular Dynamics) with a pixel size of 50 urn, and then the image obtained is analysed using the XdotsReader software (Cose) . This software allows automatic recognition of the spots and is capable of calculating the pixel-density for each of the spots. It also makes it possible to subtract the background noise and to normalize the signal intensity. For each spot, the local background noise was subtracted and the signal intensity was normalized using the signal of the chromosome diluted 400-fold.
Data analysis
First, the disparity between two spots from the same clone was calculated in order to eliminate the spots with a significant disparity, reflecting aberrant signals. Then, the mean of the intensity of the signals- from a same pair was defined, and it is this mean intensity, net (after subtraction of the local background noise) and normalized, which will be considered for the data analysis. The intensity obtained for the 17 bp clone made it possible to define a positivity threshold, the signals lower than this intensity being considered as negative or "undetected".
RESULTS
1. Validation of the model for studying the bactericidal effect of the serum
Abili ty of E. coli C5 to mul tiply in serum. The growth curves produced with an inoculum of 107 CFU/ml for the £7. coli strain C5 responsible for neonatal meningitis, and the nonpathogenic £7. coli strains K12 and ECOR15, show that, in the presence of human serum, £7. coli C5 is capable of surviving and of multiplying, whereas as
£7. coli strain K12 is killed in less than 2 hours and the strain ECOR15 experiences a decrease in growth of more than 2 logs in 2 hours, persisting at a level of between 104 and 105 CFU/ml. At a lower inoculum (103 or 105 CFU/ml) , the strain C5 persists without growing for the first two hours of culturing, and then experiences a growth of 1 log in the 4th hour. The inoculum of 107 CFU/ml was therefore selected for the transcriptome* study.
In decomplemented serum, the growth of the strains K12 and ECOR15 is similar to that of the strain C5, which suggests that the bactericidal effect observed was due to the lytic activity of the complement.
In order to determine whether or not the survival of the strain C5 between 2 h and 6 h of culturing was due to the modification of the complement at 37°C, growth curves for the strain K12, in the presence of serum incubated beforehand at 37°C for 2 h, 4 h and 6 h, were produced. The results of this experiment demonstrate that the serum still possesses its bactericidal activity, even after having been pre-incubated for 6 h at 37°C. 2. Isolation of the RNAs and comparison of the two extraction methods
Since the RNA extraction step is a fundamental step of the DTA, it appeared to us to be necessary to compare two extraction methods in order to determine whether or not the mode of extraction could have an influence on the transcriptome results. Preparations of good quality are characterized by the presence of the 23S, 16S and 5S rRNAs, detected in the form of clear bands by agarose gel electrophoresis. The RNAs extracted with the Biorad kit and the Trizol reagent, from a nutrient broth culture supplemented with dipyridyl, were analysed on 2% agarose gel under nondenaturing conditions. The bands obtained for the 23S and 16S rRNAs, and a band corresponding to the 5S rRNA and to the tRNAs reflect the good quality of the RNA preparations and of the equivalence of the extraction methods . In addition, the detection of RNA of low- molecular weight shows that the extraction method makes it possible to isolate the mRNAs which are small in size, unlike other systems and in particular those using columns which retain only the mRNAs longer than 200 bp. However, a high molecular weight band corresponding to the chromosomal DNA appears clearly on the lane of the RNA obtained using the Biorad kit. The two transcriptomes obtained from these two types of RNA were compared visually (without integrating the signals), and appear to be identical. These results suggest the absence of an influence of the extraction methods and, in particular, of the contaminant chromosomal DNA. The latter point was, moreover, confirmed by carrying out a comparison between a probe obtained using Biorad RNA, with and without DNAse treatment. 3. Differential expression of the transcripts corresponding to the DNA fragments specific for £7. coli Kl responsible for neonatal meningitis
The transcriptomes obtained by hybridization of the reverse transcripts originating from £7. coli C5, under two different culture conditions, on the high density membranes comprising the set of clones of the B2/D+ A- library were analysed with the aid of the Cose software, using the 400-fold dilution of the chromosome as the normalization spot, the signal of which is close to the median of all of the signals. It appears clearly that a certain number of spots are lacking in signals. In order to determine objectively the reverse transcripts which will be considered to be negative or "undetected", the normalized intensity recorded on the 17 bp clone (0.05) was used. In order to be safe, this threshold was doubled and, therefore, any spot with arr intensity of less than 0.1 was considered to be "undetected".
The normalized intensities of the signals obtained for all the clones for which the transcript was detected under at least one of the culturing conditions make it possible to visualize the level of transcription of the set of subtractive clones in the course of the three respective experimental conditions: growth in nutrient broth, in nutrient broth supplemented with 2,2' -dipyridyl (iron chelator) and in serum. It is noted that most of the signals detected have similar intensities whatever the culturing conditions, whereas certain fragments exhibit levels of transcription which vary by a factor of ten according to the culturing conditions. These results therefore suggest good reproducibility of the technique, with a capacity to detect different transcriptional levels.
The transcriptome obtained in nutrient broth was considered to reflect the basal level of transcription of the bacterium in favourable medium. The expression profile obtained under conditions of stress consisting of culturing in serum was analysed with respect to this control transcriptome. In addition, the transcriptomes obtained when culturing in nutrient broth and iron chelator were also prepared, in order to determine the respective roles of iron deficiency and of complement in the serum.
Table 5 below gives the ratios of the intensities of the signals obtained for some of the clones of the B2/D+ A- library under the various experimental conditions, with respect to the control condition represented by the growth in nutrient broth (NB) . Overall, it appears that the transcripts induced in the- serum (ser) are most commonly also induced in the presence of dipyridyl (dip) and of decomplemented serum (DC serum), with the exception of the SauE15.A12, SauE4.E6, SauE4.Cll and TspE15.G6 clones.
It is interesting to note that these four clones have transcripts induced by serum factors other than iron deficiency, since their level of transcription is not modified, or even decreased, in the presence of dipyridyl (cf. Table 5) . The transcription of two of these clones (SauE15.A12 and SauE4.Cll) is not induced in decomplemented serum, and they therefore represent genes which are excellent candidates for complement resistance. TABLE 5
Reproducibility :
In order to verify the reproducibility of the technique and to demonstrate that the differences in transcription level detected are not linked to factors specific to the pool of serum used, a new probe was prepared from a culture of £7. coli C5 in a serum originating from a single donor. The transcriptome obtained with this new probe was compared with the transcriptome produced with the pool consisting of several sera. The straight line of regression obtained, and also the regression coefficient with a value of 0.85, indicate the excellent reproducibility of the technique.
Relationship between the normalized intensi ty and the amount of reverse transcripts :
With the aim of verifying that there is indeed a linearity between the normalized intensity and the amount of reverse transcripts, the intensities obtained for the chromosomal range points consisting of l/100th, l/400th and 1/1 600th dilutions were recorded on a graph. The graph obtained shows the existence of a linear relationship between these values, making it possible to deduce an induction factor directly from the normalized intensities recorded.
CONCLUSION
It appears, therefore, that the DTA technique described herein is a reliable method for selecting, from the B2/D+ A- library obtained from £7. coli C5, DNA fragments the transcription of which is increased in the presence of serum. These fragments make up genes which participate specifically in the systemic and non- diarrhoeal extra-intestinal development of £7. coli in humans and animals. These genes can be isolated using said DNA fragments the transcription of which is increased in the presence of serum, according to conventional techniques of those skilled in the art { cf. Example 6) .
These fragments and the genes which bear them can be used as active principles (in the form of naked DNA placed under the control of a eukaryotic promoter or in the form of DNA transfected into a cell) in a vaccine composition intended to prevent, alleviate or combat the systemic and non-diarrhoeal development of £7. coli in a human or animal extra-intestinal compartment. For this purpose, these fragments and genes can, if desired, be modified, for example so as to produce inactivated isogenic mutants.
The polypeptides which they encode can also be used in such vaccine compositions, in an inactivated and immunogenic form.
These DNA fragments and these genes can also be used as anti-pathogenicity targets (with the objective of preventing the development of £7. coli in an extra- intestinal area, and not in the intra-intestinal area) : they allow the identification of compounds capable of specifically inhibiting their transcription and/or translation, or of compounds capable of inhibiting the activity of the proteins encoded by these genes. Such, compounds can be used as active principles in pharmaceutical compositions (medicinal products) in order to prevent, alleviate or treat the systemic non- diarrhoeal development of £7. coli in a human or animal extra-intestinal compartment.
Among the fragments which are of nature B2/D+ A-, and the transcripts of which are increased in the serum, those which are not present in £7. coli which are agents of infections localized in the intestine, such as £7. coli 0157 :H7, are more particularly preferred.
A systemic non-diarrhoeal development of £7. coli in an extra-intestinal compartment is in particular observed in the context of diseases such as neonatal meningitis, septiceamias, sepsis or pyelonephritis. Such vaccines and pharmaceutical compositions are particularly useful in the context of hospital-acquired infections. Such vaccines are most particularly valuable for the vaccination of women (from adolescent to adult, and more particularly before and during a gestation period) as a prevention against pyelonephritis, in order to avoid contamination of the newborn during birth.
The vaccines and pharmaceutical compositions according to the invention are therefore particularly suitable for such pathologies.
The present invention is therefore aimed towards any polynucleotide capable of being obtained :
- by subtractive hybridization of an £7. coli strain of group B2 or D against one or more £7. coli strains of group A,
- isolation of the substraction DNA fragments, and
- selection of those for which transcription is stimulated in the presence of serum, with respect to a standard nutrient medium.
The DNAs given in Table 5 above are examples of such polynucleotides .
The present application is thus aimed towards :
- any vaccine composition comprising such a polynucleotide, or an inactivated isogenic mutant of such a polynucleotide (inactivation of the possible pathogenic potency) , in particular for preventing, alleviating or combating the development (systemic and non-diarrhoeal) of £7. coli in a human or animal extra-intestinal compartment,
- any pharmaceutical composition comprising a compound capable of inhibiting the transcription and/or translation of such a polynucleotide or mutant, or capable of inhibiting the activity of a polypeptide encoded by such a polynucleotide or mutant.
The present invention is also aimed towards a method for identifying compounds which can be used as active principles in a pharmaceutical composition (medicinal product) intended to prevent, alleviate or combat the development (systemic and non-diarrhoeal) of £7. coli in a human or animal extra-intestinal compartment. This method comprises detecting and selecting compounds capable of inhibiting the transcription and/or translation of said polynucleotides, or capable of inhibiting the activity of the polypeptides which they encode. The present invention is also aimed towards any kit suitable for the implementation of this method, said kits comprising at least one of said polynucleotides or polypeptides. Any transgenic cell and any non-human transgenic animal, into which at least one of said polynucleotides or inactivated isogenic mutant has been transfected, also enter into the domain of the present application. Such cells and transgenic animals are particularly useful for selecting active principles of interest.
EXAMPLE : Production of the sequences of the B2/D+ A- regions
In Example 1, the isolation and the sequence of 259 DNA fragments which are of nature B2/D+ A- (153 of which are novel as products) are described, and means for producing the entire set of these products are provided.
Those skilled in the art will appreciate that, using these DNA fragments, the sequence of each of regions 1, 2, 3, 4, 5, 6a and 6b described, and ORFs which correspond to them, can be identified, isolated and sequenced.
Example 1 and Figure 1 give the sequence of eight (novel) fragments belonging to region 1 (SEQ ID NO: 134, 144, 109, 115, 140, 135, 33 and 56) . For region 2, 5 novel fragments (SEQ ID NO: 125, 123, 116, 43 and 40), and the presence of the sfa gene, are indicated. For region 3, 7 (novel) fragments (SEQ ID NO: 122, 130, 141, 25, 48, 51 and 57) are indicated. For region 4, three (novel) fragments (SEQ ID NO: 121, 44 and 45) are indicated. For region 5, 5 (novel) fragments (SEQ ID NO: 113, 119, 120, 23 and 52) are indicated. For region 6a, eight novel fragments (SEQ ID NO: 127, 133, 27, 34, 36, 42, 46 and 54), and the presence of the known iielO product, are indicated. For region 6b, four (novel) fragments (SEQ ID NO: 55, 38, 128 and 151), and the presence of four known products (SEQ ID NO: 212, 226, 201 and 229), are indicated.
The provision of the sequences allows those skilled in the art to obtain the complete sequence of the corresponding region, according to conventional techniques, and to identify, in these regions, the presence of possible open reading frames (ORFs) .
One of the conventional techniques consists in developing primers for amplifying the desired region, based on the sequences of the fragments of this region, and in carrying out a PCR amplification using various combinations of said primers placed in contact with the polynucleotide population of an £7. coli strain of group B2 or D (which may or not be ECOR) , under conventional PCR conditions. The PCR products which overlap are sequenced on both strands, using the chain termination technique and automated sequencing.
If necessary, the sequence obtained can be extended beyond the limit of the clones available by cloning, for example in lambda DASH-II, partial fragments of approximately 15 kb obtained by restriction carried out on said polynucleotide population. The inserts overlapping the desired region are then identified by hybridization with clones of this region. The inserted DNA is then sequenced from the end of the inserts, and these sequences are used to develop novel primers which" will be used to directly amplify the chromosomal (and non-phage) DNA. Amplification of the chromosomal DNA is then obtained using these novel primers and those of the shorter sequence already obtained. These PCR strands are also sequenced on both strands, which then produces the complete sequence of the desired region.
Alternatively, the sequence of such regions can be obtained by extension of the sequencing of the chromosome of an £7. coli strain of group B2 or D, from points at which a clone which is of nature B2/D+ A- is located (cf. Example 6 below) .
The open reading frames of the sequences obtained for the regions can then be analysed according to conventional techniques for seeking ORFs. A search is carried out in particular for ORFs which begin with ATG or CTG and which have a high codon use index.
EXAMPLE 5: Method and kit for identifying the phylogenic group of an E. col± strain
As indicated in Examples 1, 2 and 3, the polynucleotides which are of nature B2/D+ A- can be used to determine the phylogenic group of any £7. coli strain. They can be used alone, in combination together or in combination with other products, depending on the result and the precision of phylogeny desired. In the event of positive detection, the set of DNAs which are of nature B2/D+ A- makes it possible to eliminate the hypothesis that a strain belongs to the group A. More finely, the presence or absence of the cnuA gene, or of the fragments which are of nature B2/D+ A- (in particular SEQ ID NO: 241, 195, 185 and 248), each make it possible to completely distinguish between group ft or Bl, on the one hand, and group B2 or D, on the other hand. The presence or absence of the TspE4.C2 fragment
(SEQ ID NO: 119) , of the gene which corresponds to it or of the other fragments, of this gene, which are of nature B2/D+ A- make it possible to completely distinguish between group A and group Bl. The presence or absence of the SauE15.I12 fragment (SEQ ID NO: 37), of the gene which corresponds to it or of the other fragments, of this gene, which are of nature B2/D+ A- make it possible to completely distinguish between group B2 or D and group A. The detection of such presences or absences can take place by any means available to those skilled in the art. It can in particular be carried out using said polynucleotides as probes (Southern technique) , or by constructing amplification primers capable of amplifying one of said polynucleotides, so as to carry out a PCR. The construction of probes and of primers can be carried out according to any technique known to those skilled in the art; examples of such constructions are given below. When these polynucleotides are coding polynucleotides, the- detection of the corresponding polypeptides (using antibodies directed against these polypeptides) constitutes a variant of implementation.
Using the teaching given in the present application, those skilled in the art can design the decision tree which corresponds to the level of phylogenic precision desired.
More particularly described here, is an example of a phylogenic identification method which allows a phylogenic precision of at least 99% for £7. coli . This method is based on the PCR detection of two genes (chuA and yjaA) , the sequence of which is known, but which- are of novel nature, and on the detection of the
TspE4.C2 novel DNA fragment (SEQ ID NO: 119). The method was evaluated by testing 220 strains which had already been grouped together using reference methods
(multilocus enzymatic electrophoresis, MLEE and/or ribotyping) . The inventors in fact demonstrated that the known cnuA gene is present in 100% of the ECOR strains of group D, in 100% of the ECOR strains of group B2, and in 0% of the ECOR strains of group A and of the ECOR strains of group Bl. They also demonstrated that the y aA gene, the sequence of which is known, is present in 100% of the ECOR B2 strains and in 0% of the
ECOR strains of group D. The y aA gene was, until then,
only known to be present in the £7. coli strain K12
(group A) , but. had no known function. It was also demonstrated that the TspE4.C2 novel fragment is present in approximately 94% of the ECOR strains of group Bl and in 0% of the ECOR strains of group A. The combination of these three phylogenic markers makes it possible to access a level of effectiveness of distinction between the groups A, Bl, B2, B2 and D, which is greater than 99%. A technical procedure for implementing this combination is also described, which is technically very advantageous: triplex PCR. This novel method is rapid and simple, it can be used directly on a bacterial colony, and it does not require having a reference collection, unlike the techniques of the prior art, and MLEE and ribotyping in particular. The method described therefore represents the first method which may constitute a real clinical tool for routine analyses.
Materials and methods
Bacterial strains. The 72 strains of the ECOR- collection are available from the ATCC. These reference strains, isolated from various hosts and various geographical locations, are representative of the range of genotypic variation of the species. Sixty-eight of these strains belong to the four main phylogenic groups (A, Bl, B2 and D) , and 4 are unclassified.
A set of 86 £7. coli strains having caused neonatal meningitis (NMEC) , 34 £7. coli strains responsible for neonatal septicaemia without meningitis, 30 £7. coli strains isolated from healthy newborns, and the uropathogenic £7. coli strain J96 (04:K6) were also tested. The distribution by phylogenic group of 69 of the 86 NMEC strains has already been described (Binger et al . 1998, J. Infect. Dis. 177:642-650). The other 17 NMEC and the remaining 65 clinical isolates were classified by means of ribotyping as previously described in Binger et al . (ref. above) . The laboratory £7. coli K-12 strain MG1655, which belongs to the phylogenic group A, was also used.
The bacteria were cultured at 37°C on Luria Bertani broth medium or agar. If necessary, ampicillin (100 μg per ml) was used.
PCR amplification. In a first step, the PCR was carried out according to a standard protocol. The reaction was carried out in a volume of 20 μl containing 2 μl of 10X buffer (supplied with Taq polymerase) , 20 pmol of each primer, 2 μM of each dNTP, 2.5 U of Taq polymerase (ATGC Biotechnologie, Noisy-le-Grand, France) and 200 ng of genomic DNA. The PCR was carried out using a Perkin-Elmer GeneAmp 9600 thermal cycling machine, with MicroAm tubes, under the following conditions^ denaturation for 5 minutes at 94°C, 30 cycles of 30 seconds at 94 °C, 30 seconds at 55°C and 30 seconds at 72°C, and a final extension step of 7 minutes at 72°C, using the pairs of primers :
chuA.l (5'-GACGAACCAACGGTCAGGAT-3' ; SEQ ID NO: 160) and ChuA.2 (5'-TGCCGCCAGTACCAAAGACA-3' ; SEQ ID NO: 161), for chuA
yjaA.l (5'-TGAAGTGTCAGGAGACGCTG-3' ; SEQ ID NO: 162) and yjaA.2 (5'-ATGGAGAATGCGTTCCTCAAC-3' ; SEQ ID NO: 163), for yjaA and
TspE4C2.1 (5'-GAGTAATGTCGGGGCATTCA-3' ; SEQ ID NO: 164) and TspE4C2.2 (5' -CGCGCCAACAAAGTATTACG-3' ; SEQ ID NO: 165) for TspE4C2 (SEQ ID NO: 119) . These pairs of primers generate, by amplification, fragments of 279 bp, of 211 bp and of 152 bp, respectively.
According to a simplified protocol, a two-step triplex polymerase reaction was used. The components of the reaction are the same as in the standard protocol, except
(i) the DNA was provided directly by 3 μl of bacterial lysate or a colony fraction,
(ii) the six primers mentioned above were mixed together,
(iii) the PCR steps were as follows: denaturation for 4 minutes at 94°C, 30 cycles of 5 seconds at 94°C and 10 seconds at 59°C, and a final extension step of 5 minutes at 72°C.
Southern transfer. The Southern transfer was carried out by transfer by capillarity onto positively charged nylon membranes. The hybridization was carried out at 65°C in 1% SDS/1M NaCl/50 mM Tris HC1, pH 7.5/1% of blocking agent (Boehringer Mannheim, Mannheim, Germany) . The membranes were washed in 2x SSC for 15 minutes at room temperature; then in 2x SSC/0.1% SDS for 30 minutes at 65°C and finally, in O.lx SSC for 5 minutes at room temperature. The detection of chemiluminescence was carried out according to the manufacturer's instructions (DIG Luminescence Detection Kit for nucleic acids, Boehringer Mannheim) . The probes were produced by PCR according to the manufacturer' s instructions (PCR DIG Probe Synthesis Kit, Boehringer Mannheim) and using the primers and the amplification procedure described above for the standard protocol.
Results
Two hundred and twenty strains were analysed. Their phylogenic groups determined by reference methods are as follows: 43 strains belong to group A, 23 to group Bl, 41 to group D and 113 to group B2.
TABLE 6
PCR amplification of genes chuA et yjuA and of TSPE4.C2 DNA fragment in E.coli strains of various collections, depending on their phylogenic group.
R.M : phylogenic groups assessed by reference methods (MLEE or ribotyping)
MLEE : multilocus enzymatic electrophoresis Herzer et al. 1990, J. Bacteriol. 172:6175-6178
Ribotyping : Binzen et al. 1998, J. Infect. Dis . s177 : 642-650
Table 6 above shows the results obtained with the method according to the invention and the reference methods for the complete set of strains, according to phylogenic group. The chuA gene is present in all the strains belonging to groups B2 and D, and absent from all the strains of groups A and Bl. This makes it possible to effectively separate groups B2/D from groups A/Bl. In the same way, the yjaA gene allows complete distinction between group B2 (100% positive strains) and group D (100% negative strains) . Finally, the novel TspE4.C2 clone is present in all the strains of group Bl except 2, and absent from all the strains of group A. All the PCR results were confirmed by Southern hybridization. The results of these three amplifications made it possible to establish a dichotomous decision tree for the phylogenic grouping. This decision tree is in particular illustrated in Figure 3. Following this tree, 218 of the 220 strains tested (99%) are correctly grouped with respect to the reference methods, while only two strains which are considered to belong to group Bl according to the reference methods are identified as being from group A with the technique according to the invention. Identical results were obtained with the standard and simplified PCR protocols. Figure 4 illustrates the various profiles which were obtained by triplex PCR for the four phylogenic groups. These novel profiles therefore constitute analytical references for the phylogenic grouping of £7. coli .
Discussion
The inventors have developed a PCR method for rapidly determining the phylogenic group of £7. coli strains. Using two genes, chuA and yjaA, and a novel DNA fragment named TspE4.C2 (SEQ ID NO: 119, cf. Example 1) , the phylogenic groups of 220 strains which had previously been assigned to phylogenic groups determined using known methods were determined. The precision of analysis obtained according to the invention exceeds 99% with respect to the grouping established with the methods of the prior art. In addition, the same results were observed with a technically very simple triplex PCR method which is used directly on the bacterial colonies.
The phylogenic characterization of the £7. coli strains, based on a few genotypic or phenotypic characteristics, appeared, in the prior art, to be very difficult. Such genotypic characteristics (presence or absence of a gene, for example) must satisfy various criteria in order to be able to be used for phylogenic characterization. First of all, the gene must have been acquired, or have been deleted, when the group that i- characterizes emerged. Secondly, this same gene must have been "stabilized" so as to exclude any phenomenon of subsequent deletion or horizontal transfer towards bacteria belonging to other phylogenic groups. Finally, recombination phenomena must be very rare in the candidate gene. In other words, the gene product must not be a target for natural selection, which would favour novel genetic recombinations. In the prior art, the attempts to identify characteristics of phylogenic groups based on the phenotype or on the genotype have not been shown to be sufficiently discriminating. For the first time, described herein, is the combined use of two genes and of a novel DNA fragment which make it possible, in a technically very simple way, to determine the phylogenic group with good effectiveness . However, two strains (ECOR 70 and an NMEC) belonging, according to conventional techniques, to the phylogenic group Bl were classified, by the method according to the invention, in group A. This analytical difference may be explained by the fact that there may exist an intermediate genetic base common to these two groups of strains, and by the fact that the regions studied using the method according to the invention { chuA, yjaA and TspE4.C2, are located at 78.7 minutes, 90.8 minutes and approximately at 87 minutes, respectively, on the genome of £7. coli K12) might be closer to the group A than the regions studied using the methods of the prior art. Specifically, it has been demonstrated that the groups A and Bl are sister groups. In addition, recent analyses of multiple chromosomal nucleotide sequences show that ECOR 70 can be considered to be a "hybrid" strain in which a few housekeeping genes have nucleotide sequences which are common with ECOR strains of group A, and in which few other genes haver nucleotide sequences which are common with the ECOR strains of group Bl. The phylogenic membership of ECOR 70 to group Bl is not clearly determined; it is, in any event, considered to be part of group A using the method according to the invention.
In addition to the rapidity of our novel PCR method, the invention has the advantage of not requiring the use of a reference collection such as the ECOR strain or another collection, which means that the analysis can be easily carried out in the laboratory and in particular for routine analyses. In addition, unlike the other methods, the allocation to the phylogenic groups is unequivocal. Specifically, the 4 strains which, until now, were unclassified in the ECOR collection (E31, E37, E42 and E43) can be classified using the method of the invention; the first three strains belong to group D, and the fourth to group A. It can also be noted that all the sequences of the housekeeping genes studied in the latter strain appear to be characteristic of those found in the strains of group A.
In conclusion, this simple and rapid phylogenic grouping technique has many practical uses. The first use is, of course, the bioclinical use, taking into account the link which can be established between the phylogenic group and the possible virulence or dangerousness. The second use corresponds to a biotechnological screening tool which makes it possible to. eliminate potentially pathogenic strains, i.e. strains which are highly dangerous, when candidate strains are being sought for cloning. It was possible to develop such screening tools in the prior art, for example for identifying the £7. coli K12 strains by PCR- or for detecting £7. coli strains which exhibit none of the virulence genes which were, until then, known due to a reverse dot blot procedure. The method described herein has the notable advantage of allowing the identification of nonpathogenic strains other than £7. coli K12, and of being suitable for screening strains on a large scale.
EXAMPLE 6: Production of seventy CFT073+ K12- zones and thirty one RS218+/K12- zones
In the previous examples, the production of a library of clones, which are of nature B2/D+ A-, by substrative hybridization between the £7. coli strain C5 (£7. coli strain belonging to group B2) and £7. coli strains of group A (nonpathogenic £7. coli strains EC0R4 and ECOR15) , is described. The sequence of these clones is given in Figure 2 (SEQ ID NO: 1-153 and 169-253) .
These clones are present with greater frequency in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D with respect to the ECOR £7. coli of group A (preferably 2 times greater, more preferably 3 times greater, even more preferably 3.5 times greater, and very preferably 4 times greater) . They are, in particular, present at a frequency greater than 10% in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D, and at a frequency of less than 25%, preferably less than 20%, more preferably less than 10%, and even more preferably less than 5% in the ECOR £7. coli of group A, the frequency observed in the ECOR £7. coli of group A always remaining less than that observed in the ECOR £7. coli of group B2 and/or in the ECOR £7. coli of group D.
The presence of these clones could, moreover, be verified in various £7. coli strains of group B2 or D which are involved in pathologies which are entirely different from that in which the £7. coli strain C5, which was initially used for isolating these clones, participates. The presence of these clones was, for example, verified in £7. coli CFT073 (£7. coli of group B2 involved in adult pyelonephritis) .
In doing so, it was observed that some of these clones lie, at the level of the chromosome of £7. coli CFT073, within polynucleotide zones which are not found in £7. coli K12 MG1655. CFT073+ K12- zones were thus isolated in the vicinity of said fragments which are of nature B2/C+ A- (isolated from £7. coli C5; cf. Example 1) . An illustration is given herein for seventy of them.
The chromosomal position of each of these seventy CFT073+ K12- zones was then determined ("K12 coordinates" column in Table 8 below) with respect to the chromosomal map of £7. coli K12, which constitutes a reference in the field. These seventy CFT073+ K12- zones were then analysed in order to determine the possible presence of open reading frames (orf/ORF) using programs such as CodonUse™ (£7. coli codon use) and ORF Finder (www:_ncbi .nliti•.nih_gov/gqr;f/ orf._html, bacterial genetic code) . The corresponding protein sequences (ORFs) were extracted compared to various databases in order to search for homology compared to known sequences. The sequences of these 70 zones, and of their ORFs, are represented in Figure 6. For each zone, the protein sequence (ORF) and polynucleotide sequence (orf) of the open reading frame identified is shown, indicating the start and end positions (Pos.) of these. frames with respect to the complete sequence of the zone concerned. Some of these ORFs are encoded by orfs which are, in fact, located on the strand complementary to the indicated sequence of the region (the "frame start" position then has a number which is higher than that of the "frame end" position) .
The same protocol was applied to RS218 £7. coli clones. RS218+/K12- zones have also been isolated in the neiboughring of the fragments with a B2/D+ A- nature.
Table 7 below summarizes the SEQ ID NOS assigned to each of the seventy CFT073+ K12- zones and to their ORFs and orfs, and each thirty one RS218+/K12- zones. TABLE 7
822 (824/823) (826/825) (828/8271 88 1425 (1427/1426) (1429/1428) (1431/1430)
(1433/1432) (1435/1434) (1437/1436)
(1439/1438) (1441/1440) (1443/1442)
(1445/1444) (1447/1446) (1449/1448)
(1451/1450) (1453/1452) .
89 1454 (1456/1455) (1458/1457) (1460/1459)
(1162/1461) (1464/1463) (1466/1465)
90 1467 (1469/1468)
91 1470 (1472/1471) (1474/1473)
92 1475 (1477/1476) (1479/1478) (1481/1480)
93 1482 (1484/1483) (1486/1485) (1488/1487)
(1490/1489) (1492/1491) (1494/1493)
(1496/1495)
94 1497 (1499/1498) (1501/1500) (1503/1502)
95 1504 (1506/1505) (1508/1507) (1510/1509)
(1512/1511) (1514/1513) (1516/1515)
96 1517 (1519/1518) (1521/1520) (1523/1522)
97 1524
98 1525 (1527/1526) (1529/1528) (1531/1530) (1533/1532)
99 1534 (1536/1535) (1538/1537) (1540/1539)
100 1541 (1543/1542) (1545/1544) (1547/1546)
(1549/1548) (1551/1550) (1553/1552)
(1555/1554) (1557/1556) (1559/1558)
(1561/1560)
101 1562 (1564/1563) (1566/1565) (1568/1567)
Tables 8 and 9 below summarizes the results obtained on these SEQ IDs; the following are indicated :
- Table 8 ; - the SEQ ID NOs and the names of the clones having allowed the identification of each CFT073+ K12- zone and RS218+/K12- zone, the number which' was assigned to this zone,, the coordinates of each zone with respect to the chromosomal map of E. coli K12, the respective size of each zone, the number of the fragment, the number of the region within which the said zone is located (regions 1, 2, 3, 4, 5, 6a or 6b as identified in Examples 1 and 3 above; or, if the zone is not within a region, indicated in this column is the name of the £7. coli RS218 fragment at the level of which this zone is located, cf. Figure 1 for the £7. coli RS218 fragment names),
- Table 9 : - the homology with £7. coli RS218, £7. coli 0157:H7, £7. coli CFT073.
TABLE 8
Table9
Table9
Tableθ
Tableθ
Tableθ
Apart from zone 24 and its orfs/ORFs which, correspond to known products which are of known nature B2+ A-, none of the sequences identified herein is strictly identical to a sequence of the prior art (comparison of the nucleotide series over their entire length, independently of the function of the process) . However, when comparing the complete sequence of these CFT073+ K12- zones and RS218+K12- zones with those indicated for the E. coli strain 0157 :H7 (strain of phylogenic group not clearly determined, responsible for infections located in the intestine) , some homologies are found as it appears from table 9 above.
With regard now to the ORFs which were identified on each of said zones, the percentage identity (% id) and percentage similarity (% sim) obtained at the polypeptide level by comparison on databases are given in Figure 6.
This being so, for each of the CFT073+ K12- zones isolated herein, and for each of the orfs and ORFs identified herein (except zone 24 and its orfs/ORFs) as well as for each of the isolated RS218+/K12- zones, it is the first description of a nature CFT073+ K12- or RS218+/K12- respectively. These zones, orfs and ORFs therefore have, at the very least, applications in the domain of phylogeny (diagnostic applications) since these products make it possible to distinguish between two E. coli strains with very different pathogenicities. Anti-CFT073 or anti-RS218 vaccine applications might also be envisaged.
In addition, since the CFT073+ K12- and RS218+/K12- zones isolated herein all comprise, in the vicinity of their chromosome, at least one DNA fragment which is of nature B2/D+ A- as defined above (ECOR B2 frequency and/or ECOR D frequency higher than ECOR A frequency SEQ ID NO 1 to NO. 550, the zones and orfs isolated herein are excellent candidates as polynucleotides which are of nature B2/D+ A-. Those of the CFT073+ K12- ORFs and orfs which comprise in their sequence a clone which is of nature B2/D+ A- as defined in Example 1 above (SEQ ID NO 1 to NO 253) have an even higher probability of being products which are of nature B2/D+ A-.
Verifying that the CFT073+ K12- or RS218+/K12- zones, ORFs and orfs isolated herein are effectively of nature B2/D+ A- can be easily carried out by those skilled in the art according to techniques which are conventional in the domain of phylogeny. One way of making sure may, for example, consists in applying to at least one other E. coli strain of group B2 or D, for example E. coli C5, the procedure which has been described herein for the E. coli strain CFT073 and RS218, in such ' a way as to identify and isolate the zones, ORFs and orfs which are present in E. coli C5 and absent in E. coli K12 (or another E. coli strain of group A) . The sequences of these C5+ K12- zones, ORFs and orfs are then compared with those of the CFT073+ K12- and RS218+ K12- groups described herein, while searching for homologous sequences and sequence fragments. This comparison can, for example, be carried out using the BLAST program (National Centre for Biotechnology Information, NCBI, Altschul et al . 1997, Nucleic Acids Res. 25: 3389- 3402), comparing each sequence of the C5+ K12- group with those of the CFT073+ K12- (or RS218+ K12-) group. The polynucleotide sequences, or fragments of polynucleotide sequence, with a low probability of the significant homology (for example, identity of 80% or more) being linked by chance can then be selected as being of nature B2/D+ A-, as defined above. If desired, the process can be repeated on a fourth or fifth E. coli strain of group B2 or D.
Such B2/D+ A- products are particularly useful for the phylogenic determination of E. coli .
The polynucleotides which are of nature B2/D+ A- and their inactivated isogenic mutants can be used, in the form of isolated DNAs (placed under the control of a eukaryotic promoter or in the ' form of DNA transfected into a cell) , as active principles in a vaccine composition intended to prevent, alleviate or combat an undesirable development of E. coli, and in particular a development of E. coli in an extra-intestinal compartment. The ORFs themselves, administered in an optionally inactivated and immunogenic form, are active principles which are very promising for the development of a vaccine composition intended to prevent, alleviate or combat an undesirable development of E. coli , and in particular a development of E. coli in an extra- intestinal compartment.
The polynucleotides and polypeptides which are of nature B2/D+ A- can also be used as anti-pathogenicity targets (with extra-intestinal and non intra-intestinal targeting) / they make it possible to identify and isolate capable active principles which can be used in pharmaceutical compositions (medicinal products in particular) intended to inhibit the growth of an E. coli bacterium and, notably, its extra-intestinal development, by selecting, from candidate active principles, those which inhibit or block the correct transcription and/or translation of these zones or orfs, or which inhibit or block the activity of the ORFs.
The present application is aimed towards each of these CFT073+ K12- and RS218+/K12- zones, numbers 1 to 23, 25 to 48, and 49 to 101, also their ORFs and orfs, as products. It is also aimed towards any vaccine or pharmaceutical composition intended to prevent, alleviate or combat an undesirable development of E. coli, and in particular an extra-intestinal development
■ of E. coli , which comprises at least one of said zones or at least one of said orfs and ORFs.
Particularly targeted are the zones, ORFs and orfs which are both CFT073+ K12- and RS218+ K12-. When the applications targeted concern a development of E. coli in an extra-intestinal compartment (systemic and non- diarrhoeal E. coli infection) , those of said CFT073+ K12-, RS218+ K12- zones, orfs and ORFs which are also 0157 :H7- are then preferred.
The present application is thus directed towards any isolated polynucleotide the sequence of which can be obtained by:
i. isolating a set of polynucleotide sequences of a strain of group B2 or D (such as E. coli RS218, E. coli CFT073 or E. coli C5) by :
- locating, on the chromosome of this E. coli strain of group B2, sequences which exhibit, with the clones of SEQ ■ ID NO 1-153 and 170-253, homology such that the probability of this homology being due to chance is very low, for example an homology above 80% of identity and - sequencing the chromosome of this E. coli strain of group B2 or D in the 5' and 3' directions, using each of the homologous sequences located, and stopping the sequencing as soon as a sequence is • reached which exhibits significant homology above 80% of identity with a sequence contained in the chromosome of an E. coli strain of group A, such as E. coli K12,
ii. repeating the operation indicated in i. above on at . least one other E. coli strain of group B2 or D, then
iii. comparing the sequences obtained for each E. coli of group B2 or D tested in such as way as to search for the sequences and sequence fragments which are homologous, among the various E. coli strains of group B2 or D tested (for example, using a program such as BLAST) .
iv. selecting and isolating the sequences and sequence fragments which exhibit significant homology above 80% of identity, of the homology measured between a
" sequence or sequence fragment derived from the set obtained in i. and a sequence or sequence fragment obtained in ii. being due to chance) .
The present application is also aimed towards the possible orfs identifiable on such a polynucleotide, and the ORFs and polypeptides encoded by these orfs .
The present application is also aimed towards the phylogenic, diagnostic and therapeutic applications and uses of these products, as indicated above. Example 7: Therapeutic application of the novel B2/D+ A- products
The B2/D+ A- DNAs isolated according to the invention are useful as active principles in the context of a vaccine composition intended to prevent, alleviate or combat an undesirable development of E. coli , and in particular a development of E. coli in an extra- intestinal compartment (systemic and non-diarrhoeal E. coli infections) . They can then be used in an isolated ("naked" DNA) form or in the form of DNA transfected into a cell chosen for its physiological innocuity and its capacity to secrete the polypeptide encoded by the transfected B2/D+ A- fragment.
The polypeptides encoded by the DNAs which are of nature B2/D+ A- can also be used as active principles in the context of such vaccine compositions. They are then used in an inactivated and immunogenic form.
Examples of extra-intestinal development of E. coli comprise, in particular, septicaemias, pyelonephritis, and meningeal infections in newborns. Recognized or potential hospital-acquired infections are most commonly the product of such extra-intestinal developments of E. coli .
For extra-intestinal applications, those of the DNAs and of the polypeptides which are of nature B2/D+ A- and which are not present in the E. coli strains responsible for infections located in the intestine, such -as E. coli 0157 :H7, and/or the transcripts of which are increased in the serum by comparison with a culture on standard nutrient medium ( cf. Example 3 above) , are more particularly preferred. Region 5 ( cf. in particular previous examples) appears to be an • advantageous source for manufacturing broad- spectrum anti-B2-coli products. As regards regions 1, 3 and 4, they appear to be advantageous sources for manufacturing products against septicaemia with meningitis and in particular against meningitis in newborns and in adults.
The invention is aimed in particular towards the compositions, in particular ' • the vaccine and pharmaceutical compositions, comprising at least one of said novel DNAs or at least one of said novel ORFs or at least one polypeptide the sequence of which can be considered to correspond, according to the universal genetic code and taking into account the degeneracy of this code, to one of these DNAs or ORFs. It is also aimed towards the use of the known DNAs which are of novel nature B2/D+ A-, such as chuA, the chuA fragments which are of nature B2/D+ A-, and of the polypeptides corresponding to these DNAs, for manufacturing such compositions .
Alternatively, the DNAs or polypeptides which are of nature B2/D+ A- can be used for screening chemical and/or biological libraries in such a way as to identify compounds capable of binding to them and of blocking an undesirable development of E. coli bacteria
(by blocking the correct transcription and/or translation of these DNAs, or by blocking the activity of the polypeptides which they encode) .
The formulation and the dose of such compositions can be developed and adjusted by those skilled in the art as a function of the medical indication targeted, of the method of administration desired, and of the patient under consideration (age, weight, sex, condition) .
These compositions can also comprise- one or more physiologically inert vehicles, and in particular any excipient suitable for the pharmaceutical formulation and/or for the method of administration desired
(tablet, patch, gelatin capsule, powder, spray, drinkable solution, injectable solution, colloid) . They can also comprise one or more 'active co-agent (s) in order to modify the intensity of activity of the B2/D+ A- compound according to the invention, or to modify its system of activity, or alternatively to modify the targeting of its activity. They can also comprise other agents useful for preventing and/or alleviating and/or treating an infection, without these agents having any interaction with the B2/D+ A- compounds according to the invention.
Preferably, the polynucleotides and polypeptides which are of nature B2/D+ A- according to the invention are used for identifying, for example by screening chemical and/or biological libraries, compounds capable of inhibiting, in vivo or under conditions mimicking as closely as possible the in vivo state, the activity of a protein the ORF of which comprises a polynucleotide which is of nature B2/D+ A- according to the invention. Such compounds can, in particular, be used in compositions, in particular pharmaceutical compositions, intended to inhibit the growth of E. coli , and in particular its extra-intestinal growth. A subject of the present invention is such an identification method, the compounds as obtained by such a method, and such compositions. Exemple 8 :Animal models for studying E. col± virulence
1. Newborn rats (Sprague Dawley - January breeding) after 24 hours of acclimatisation in animal houses were infected when 5 days old. The injected inoculum was prepared from dilutions in physiological serum in a nutritive culture medium of 2 h. The animals were infected by intra-peritoneal route after anesthesia with ether, then put back to their mother after- randomization by brood of ten.
The numeration of the bacteraemia at 18h was obtained by taking of 5 μl otf blood after incision of the tail.
Bacteraemia at H18 (%)of 4 days old rats after intra peritoneal injection of various strains of E. coli .
* : ND not done
2. Mortality of adult mice (%) at D7 after intra peritoneal injection of 3 strains of E. coli . The inoculum was prepared as above-mentioned and the injection made by the intra peritoneal route.
It appears from said results that the E. coli B2 responsible for neonatal infection ' (C5 strain) are capable inducing a bacteraemia after injection of a weak inoculum in the newborn rat and that the mortality in mice is 10 times higher. With CFT073, i.e. the E. coli B2 responsible for sepsis in the adult, it is necessary to use a stronger inoculum. But said coli is still pathogenic in mice. An E. coli of group A is not .pathogenic even in mice.
'With the claimed sequences, it is then possible to generate mutants whose virulence can be tested on said animal models.

Claims

1. A library of DNA fragments of E. coli strains consisting polynucleotides having a nature B2/D+A~.
2. The library according to claim 1, selected from the group comprising the E. coli C5 + A" library, the E. coli CFT073+ K12- library, the E. coli RS218+ K12- library, or the E. coli CFT073+ K12- and RS218+ K12- library.
3. The library according to claim 1 or 2, which is devoid of 0157 :H7- polynucleotides.
4. A method for isolating from E. coli strain (s) the entire set of the chromosomal and plasmid polynucleotides having a nature B2/D+ A", comprising subtracting the polynucleotide population of one or more E. coli strain (s) of group A, randomly sheared, from the polynucleotide population of one or more E. coli strain (s) of group B2 or D, cleaved with a restriction endonuclease allowing fragments comprising from 100 to 1 500 bp approximately, with a mean of 300 to 500 bp approximately, to be produced, said subtraction being iteratively repeated, if desired.
5. The method of claim 4, wherein the E. coli strain submitted to said substraction is an E. coli C5 strain or an E. coli RS218 stain, and the E. coli strains of group A are of the ECOR collection.
6. Isolated polynucleotides having a nature B2/D+ A~, with a sequence selected from the group comprising :
- the sequences SEQ ID N°l to 153, the polynucleotide sequences obtainable by digesting the total DNA of an E. coli of group B2 or group D with a restriction enzyme, selecting those of the obtained fragments which hybridize with at least one sequence selected in the group consisting of SEQ ID N° 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, 51, 57, 121,44, 45, 113, 119, 120, 23, 52, and
- the sequences of the orfs containing one of said sequences,
the nature-conserving variant or fractional sequences of said sequences.
7. The polynucleotides of claim 6, the transcription of which is increased in the presence of human or animal serum and the genes bearing such polynucleotides.
8. Pair of primers allowing the amplification of a polynucleotide according to claim 6 or 7.
9. Pair of primers according to claim 8, corresponding to SEQ ID N° 154 and 165.
10. Probes which are elaborated from a polynucleotide according to claim 6 or 7.
11. The use as a probe of a polynucleotide according to claim 6 or 7, such as obtained by amplification with a pair of primers selected from the group comprising SEQ ID N° 160/161/ SEQ ID N°162,163/ SEQ ID N°164,165.
12. The polynucleotides having a sequence corresponding to the antisens of the polynucleotides according to claim 6 or 7.
13. A combination of a polypeptide .encoded by a polynucleotide according to claim 6 or 7 with a ligand which is capable of inhibiting a biological function of E. coli, such as the extra-intestinal growth, said ligand being selected from the group comprising serum, purified serum, antibodies and monoclonal antibodies.
14. Vector comprising at least a polynucleotide according to claim 6, 7 or 11 and cells tranfected by such a vector.
15. The method for the phylogenic determination of an E. coli strain comprising the use of at least a polynucleotide isolated from the library according to anyone of claims 1 to 3.
16. The method of claim 15 wherein said polynucleotide is selected from the group comprising : a polynucleotide according to claim 6 or 7, or a polynucleotide selected from the group comprising :
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183,
185, 186, 190, 191, 193, 194, 195, 196, 199, 200,
202, 205, 206, 208, 209, 211, 214, 218, 220, 221,
233, 234, 235, 241, 244, 246, 247, 248 and 250, - the polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hly; cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the polynucleotides cited in the preceding three paragraphs,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248) ,
- the pair of primers, the probes and the antisense polynucleotides, the sequence of which is as derived from these polynucleotides,
- the binding products which are capable of binding to a polypeptide encoded by a polynucleotide which is of nature B2/D+ A-,
17. The method of claim 15 or 16, comprising the detection of yjaA.
18. The method of claim 17, wherein yjaA is detected with a probe, a pair of primers such as SEQ ID N°162, 163, or the encoded protein is detected by a specific antibody .
19. A pharmaceutical composition, comprising an effective amount of a polynucleotide of ,the library of anyone of claims 1 to 3, or the encoded proteins in association with an inert carrier.
20. The pharmaceutical composition of claim 19, wherein said polynucleotide is according to claim 6 or 7, a' vector or a cell according to claim 14 or an antisens according to claim 12.
21. The pharmaceutical composition of claim 20, wherein said polynucleotide (s) is (are) selected in the group comprising polynucleotides of SEQ ID NO: 71, 114, 13, 77, 8, 36, 120 and 130.
22. The pharmaceutical composition of anyone of claim 19 to 21, for treating and/or palliating and/or preventing extra-intestinal E. coli infections.
23. The use for making drugs for preventing or treating, E. coli extra-intestinal infections of at least one polynucleotide selected from the group comprising :
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250, - the polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of iVotl and Blnl , and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151,' with the exception of the polynucleotides sfa, hly, cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the polynucleotides cited in the preceding three paragraphs,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248) ,
- the pair of primers, the probes and the antisense polynucleotides, the sequence of which is as derived from these polynucleotides,
- the peptides encoded by said polynucleotides,
- the binding products which are capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A- .
24. The use according to claim 23, wherein said polynucleotide (s) is (are) selected in the group comprising the polynucleotides pf SEQ ID NO 174, 202, ■178, 221, 206, 175, 196, 170.
25. The pharmaceutical composition of anyone of claims 19 to 22 or the drug obtained according to claim 23 or
24, for treating septicaemia, pyelonephritis, and meningeal infections in newborns.
26. Use of the polynucleotides of the library of anyone of claims 1 to 3, or antisens thereof for the' diagnostic of the presence or the absence of undesired E. coli such as contaminants E. coli or extra-intestinal E. coli . in a sample.
27. The use according to claim 26, wherein said DNA polynucleotide is according to claim 6 or 7, or is in a vector or a cell according to claim 14.
28. The use according to claim 27, wherein said polynucleotide is selected from the group comprising :
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which can be obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa, hly, cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the
.polynucleotides cited in the preceding three paragraphs,
- the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides (and in particular SEQ ID NO: 241, 195, 185 and 248) ,
- the pair of primers, the probes and the antisense polynucleotides, the sequence of which is as derived from these polynucleotides,
- the binding products which are capable of binding to a polypeptide encoded by a polynucleotide which is of novel nature B2/D+ A- .
29. A method for identifying a compound capable of inhibiting the extra-intestinal growth of E. coli, which comprises the detection of at least one compound :
i. capable of binding to at least one polynucleotide chosen from the group consisting of :
- novel polynucleotides which are of novel nature B2/D+ A-, i.e. • the sequences SEQ ID NO: 1 to NO: 153,
• the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an 'E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, ' 56, 122, 130, 141, 25, 48, 51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and
• the sequences of the orfs (open reading frames) which contain one of these sequences,
• the nature-conserving variant or fractional sequences of these sequences,
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183, 185, 186, 190, 191, 193, 194, 195, 196, 199, 200, 202, 205, 206, 208, 209, 211, 214, 218, 220, 221, 233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which are as obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hly, cnfl,, pap/prs, hra and ibe 10,
• the polynucleotides the sequence of which corresponds to an orf comprising the sequence of one of the polynucleotides cited in the preceding four paragraphs,
• the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides and in particular (SEQ ID NO: 241, 195, 185 and 248), and
ii. capable of specifically inhibiting the correct transcription and/or translation of this polynucleotide .
30. A method for identifying a compound capable of inhibiting the extra-intestinal growth of E. coli, which comprises the detection of at least one compound capable of inhibiting the activity of a protein the orf of which comprises a polynucleotide such as used in the method of claim 29.
31. Kits for implementing a method according to anyone of claims 15 to 18, or the use according to claim 26 to 28 comprising at least a polynucleotide of the library according to anyone of claims 1 to 3.
32. The kits of claim 31, comprising at least one of the pairs of primers (SEQ ID NO: 160, 161), (SEQ ID NO: 162, 163) and (SEQ ID NO: 164, 165) .
33. A vaccinal composition for preventing, alleviating or combatting an extra-intestinal development of E. coli, comprising an immunogenic polynucleotide such as isolated from the library according to anyone of claims 1 to 3 or an inactivated isogenic mutant thereof as active principle, or the corresponding ORFS in' association with an inert carrier-..
34. The vaccinal composition of claim 33, wherein said polynucleotide is polynucleotide selected from the group comprising :
- the novel polynucleotides which are of novel nature B2/D+ A- according to the invention, i.e.
• the sequences SEQ ID NO: 1 to NO: 153,
• the polynucleotide sequences which can be obtained by digesting the total DNA of an E. coli of group B2 or group D (such as, for example, an E. coli strain isolated from the blood of a patient suffering from a septicaemic or meningeal infection) with a restriction enzyme chosen from the group consisting of Wotl and Blnl, selecting those of the fragments obtained which hybridize with at least one sequence chosen from the group consisting of SEQ ID NO: 134, 144, 109, 115, 140, 135, 33, 56, 122, 130, 141, 25, 48, .51, 57, 121, 44, 45, 113, 119, 120, 23, 52 and sequencing selected fragments, and • the sequences of the orfs (open reading frames) which contain one of these sequences,
• the nature-conserving variant or fractional sequences of these sequences.
- the polynucleotides corresponding to the chuA gene and to its operon,
- the polynucleotides the sequence of which corresponds
' to SEQ ID NO: 170, 171, 174, 175, 178, 179, 183,
185, 186, 190, 191, 193, 194, 195, 196, 199, 200,
202, 205, 206, 208, 209, 211, 214, 218, 220, 221,
233, 234, 235, 241, 244, 246, 247, 248 and 250,
- the polynucleotides which are as obtained by digesting the total DNA of an E. coli of group B2 or D with a restriction enzyme chosen from the group consisting of Wotl and Blnl, and selecting those of the fragments obtained which hybridize to at least one sequence chosen from the group consisting of SEQ ID NO: 125, 123, 116, 43, 40, 127, 133, 27, 34, 36, 42, 46, 54, 55, 38, 128 and 151, with the exception of the polynucleotides sfa , hly, cnfl , pap/prs, hra and ibe 10,
- the polynucleotides the sequence of which corresponds to an orf (open reading frame) comprising the sequence of one of these polynucleotides,
the polynucleotides the sequence of which corresponds to a nature-conserving variant or fractional sequence of the sequence of at least one of these polynucleotides .
35. The vaccinal composition of claim 34, wherein said polynucleotide is selected from the group comprising SEQ ID NO 71, 114, 13, 77, 8, 36, 120, 130, 174, 202, 178, 221, 206, 175, 196, 170.
EP01917119A 2000-03-10 2001-03-12 Polynucleotides isolated from e. coli of nature b2/d+ a-, and uses thereof Ceased EP1328641A2 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
FR0003145A FR2806096A1 (en) 2000-03-10 2000-03-10 A library of DNA fragments of Escherichia coli strains for the phylogenic determination of a given strain comprises polynucleotides of nature B2/D+ A-
FR0003145 2000-03-10
FR0101449 2001-02-02
FR0101449 2001-02-02
PCT/EP2001/003445 WO2001066572A2 (en) 2000-03-10 2001-03-12 Polynucleotides isolated from e. coli of nature b2/d+ a-, and uses thereof

Publications (1)

Publication Number Publication Date
EP1328641A2 true EP1328641A2 (en) 2003-07-23

Family

ID=26212251

Family Applications (1)

Application Number Title Priority Date Filing Date
EP01917119A Ceased EP1328641A2 (en) 2000-03-10 2001-03-12 Polynucleotides isolated from e. coli of nature b2/d+ a-, and uses thereof

Country Status (4)

Country Link
EP (1) EP1328641A2 (en)
JP (1) JP2004512817A (en)
CA (1) CA2402602A1 (en)
WO (1) WO2001066572A2 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1342784A1 (en) * 2002-03-06 2003-09-10 Mutabilis S.A. ExPEC-specific proteins, genes encoding them and uses thereof
JP2008530245A (en) 2005-02-18 2008-08-07 ノバルティス ヴァクシンズ アンド ダイアグノスティクス, インコーポレイテッド Antigens from uropathogenic strains
PL1858920T3 (en) * 2005-02-18 2016-12-30 Proteins and nucleic acids from meningitis/sepsis-associated escherichia coli
US8277798B2 (en) * 2006-05-10 2012-10-02 Institut National De La Recherche Agronomique Use of cells containing a specific DNA molecule as cytopathic agents to inhibit the proliferation of cells
US20100166788A1 (en) 2006-08-16 2010-07-01 Novartis Vaccines And Diagnostics Immunogens from uropathogenic escherichia coli
ITMI20090946A1 (en) 2009-05-28 2010-11-29 Novartis Ag EXPRESSION OF RECOMBINANT PROTEINS
PL2464658T3 (en) 2009-07-16 2015-03-31 Novartis Ag Detoxified escherichia coli immunogens
ATE557035T1 (en) * 2009-09-10 2012-05-15 Univ Berlin Freie IDENTIFICATION AND CHARACTERIZATION OF A NOVEL AVIARY PATHOGENIC E. COLI (APEC) FIMBRIAL ADHESIN
EP2736922B1 (en) 2011-07-28 2017-11-01 Universiteit Gent Prevention of salmonella recrudescense
GB2554742A (en) * 2016-10-07 2018-04-11 Bhongir Ravi Treatment
JP2024514074A (en) * 2021-04-01 2024-03-28 ヤンセン ファーマシューティカルズ,インコーポレーテッド E. Production of COLI O18 bioconjugate

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2078716A1 (en) * 1992-09-21 1994-03-22 Joyce De Azavedo Attaching and effacing protein of enterohemorrhagic e. coli
ES2076895B1 (en) * 1994-02-04 1996-08-16 Ind Farma Especial FIMBRIAS-ADHESINAS OF TYPE 1 AND P ISOLATED FROM NEW STRAINS OF E. COLI, PROCEDURE FOR THEIR OBTAINING AND APPLICATIONS.
JPH11506424A (en) * 1995-03-24 1999-06-08 オフィディアン ファーマシューティカルズ インコーポレーテッド Treatment for Verotoxin-producing Escherichia coli
CA2272047A1 (en) * 1996-11-22 1998-05-28 Human Genome Sciences, Inc. Nucleotide sequence of escherichia coli pathogenicity islands

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0166572A2 *

Also Published As

Publication number Publication date
WO2001066572A2 (en) 2001-09-13
WO2001066572A3 (en) 2003-05-01
WO2001066572A9 (en) 2002-08-15
CA2402602A1 (en) 2001-09-13
JP2004512817A (en) 2004-04-30

Similar Documents

Publication Publication Date Title
Hsu et al. The role of Klebsiella pneumoniae rmpA in capsular polysaccharide synthesis and virulence revisited
Kao et al. Pathogenicity island sequences of pyelonephritogenic Escherichia coli CFT073 are associated with virulent uropathogenic strains
Nicholls et al. Identification of a novel genetic locus that is required for in vitro adhesion of a clinical isolate of enterohaemorrhagic Escherichia coli to epithelial cells
Russo et al. Identification of a new iron-regulated virulence gene, ireA, in an extraintestinal pathogenic isolate of Escherichia coli
Bonacorsi et al. Identification of regions of the Escherichia coli chromosome specific for neonatal meningitis-associated strains
Peigne et al. The plasmid of Escherichia coli strain S88 (O45: K1: H7) that causes neonatal meningitis is closely related to avian pathogenic E. coli plasmids and is associated with high-level bacteremia in a neonatal rat meningitis model
Occhino et al. Vibrio cholerae iron transport: haem transport genes are linked to one of two sets of tonB, exbB, exbD genes
Spitznagel Jr et al. Regulation of leukotoxin in leukotoxic and nonleukotoxic strains of Actinobacillus actinomycetemcomitans
Pupo et al. Evolutionary relationships among pathogenic and nonpathogenic Escherichia coli strains inferred from multilocus enzyme electrophoresis and mdh sequence studies
Taniguchi et al. Cloning and expression in Escherichia coli of Vibrio parahaemolyticus thermostable direct hemolysin and thermolabile hemolysin genes
Newton et al. Shiga toxin–producing Escherichia coli strains negative for locus of enterocyte effacement
Valvano et al. Occurrence of chromosome-or plasmid-mediated aerobactin iron transport systems and hemolysin production among clonal groups of human invasive strains of Escherichia coli K1
EP0355147B1 (en) Dna probe specific for pathogenic listeria
US20080241151A1 (en) Virulence genes, proteins, and their use
JP2003144180A (en) Identification of gene
Hsu et al. Cloning of a beta-hemolysin gene of Brachyspira (Serpulina) hyodysenteriae and its expression in Escherichia coli
Javed et al. Transposon mutagenesis in a hyper-invasive clinical isolate of Campylobacter jejuni reveals a number of genes with potential roles in invasion
Cid et al. Association between intimin (eae) and EspB gene subtypes in attaching and effacing Escherichia coli strains isolated from diarrhoeic lambs and goat kids
EP1328641A2 (en) Polynucleotides isolated from e. coli of nature b2/d+ a-, and uses thereof
Reingold et al. Identification of a new Escherichia coli She haemolysin homolog in avian E. coli
US20090298713A1 (en) Polynucleotides which are of nature b2/d+ a- and which are isolated from e. coli, and biological uses of these polynucleotides and of their polypeptides
Mazur et al. Membrane topology of PssT, the transmembrane protein component of the type I exopolysaccharide transport system in Rhizobium leguminosarum bv. trifolii strain TA1
Cutter et al. Cloning and expression of the damselysin gene from Vibrio damsela
Harrington et al. Epidemiology of Salmonella sofia in Australia
WO2001032697A2 (en) Virulence genes and proteins from brucella melitensis, and their use

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20020909

AK Designated contracting states

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 1059278

Country of ref document: HK

111L Licence recorded

Free format text: 0100 MUTABILIS S.A.

Effective date: 20060304

111L Licence recorded

Free format text: 0101 CHIRON SRL.

Effective date: 20060304

Free format text: 0100 MUTABILIS S.A.

Effective date: 20060304

17Q First examination report despatched

Effective date: 20070314

REG Reference to a national code

Ref country code: DE

Ref legal event code: R003

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED

18R Application refused

Effective date: 20111029

REG Reference to a national code

Ref country code: HK

Ref legal event code: WD

Ref document number: 1059278

Country of ref document: HK