FR2801609A1

FR2801609A1 - Collection of nucleic acids from environmental samples, useful for identifying e.g. genes encoding polyketide synthases and derived antibiotics

Info

Publication number: FR2801609A1
Application number: FR9915032A
Authority: FR
Inventors: Pascale Jeannin; Jean Luc Pernodet; Michel Guerineau; Pascal Simonet; Sophie Courtois; Carmela Cappellano; Francois Francou; Alain Raynal; Maria Ball; Guennadi Sezonov
Original assignee: Aventis Pharma SA
Current assignee: Aventis Pharma SA
Priority date: 1999-11-29
Filing date: 1999-11-29
Publication date: 2001-06-01
Anticipated expiration: 2019-11-29
Also published as: NZ519883A; ZA200203532B; FR2801609B1

Abstract

Preparation of a collection of nucleic acids (I) from organisms in a soil sample, is new. Preparation of a collection of nucleic acids (I) from organisms in a soil sample comprising: (1) milling a dried sample to produce microparticles (MP); (2) suspending these in liquid buffer; (3) extraction of (I) from MP; (4) passing (I)-containing solution through a molecular sieve; (5) passing (I)-enriched fractions through an anion exchange chromatography material; and (6) recovering fractions containing purified (I), is new. Independent claims are also included for the following: (a) processing an environmental sample by: (i) dispersing in liquid; (ii) homogenization of the resulting suspension by gentle mixing; (iii) separation of organisms from other components by density-gradient centrifugation; (iv) lysing separated organisms; (v) extraction of (I); and (vi) purification on a cesium chloride gradient; (b) preparing a collection of recombinant vectors by inserting (I) into a cloning and/or expression vector; (c) new methods for preparing the cloning and expression vectors; (d) a collection of (I) prepared by the new methods; (e) (I) present in the collections of (d); (f) recombinant vectors containing specific (I), prepared by methods (c); (g) collection of recombinant vectors prepared by methods (b) or (c); (h) recombinant vectors present in the collection of (g); (i) recombinant host cells containing (I) of (e) or the vector of (h); (j) collection of cells of (i); (k) detecting a specific (I), or related sequence, in the collection of (j); (l) identifying production of a selected compound (II) in one or more cells of the collection of (j); (m) selected recombinant host cells producing (II) in the collection of (j); (n) producing (II); (o) (II) produced by method (m); (p) polyketide (PK) produced by expressing specified sequences (Ia); (q) determining diversity of (I) in a collection; (r) nucleic acid (Ib) containing a 16S rDNA sequence at least 99% identical with any of 47 specified sequences as given in the specification; (s) producing a type I PK synthase (PKS-I); (t) PKS having any of about 20 specified amino acid sequences as given in the specification; (u) antibodies (Ab) directed against the PKS of (t); (v) detecting PKS-I or its fragments; and (w) kit for method (v).

Description

La présente invention concerne un procédé de préparation d'acides nucléiques à partir d'un échantillon de l'environnement, plus particulièrement un procédé d'obtention d'une collection d'acides nucléiques à partir d'un échantillon. L'invention est également relative aux acides nucléiques ou aux collections d'acides nucléiques obtenus selon le procédé et leur application à la synthèse de nouveaux composés, notamment de nouveaux composés d'intérêt thérapeutique. The present invention relates to a method for preparing nucleic acids from a sample of the environment, more particularly to a method of obtaining a nucleic acid collection from a sample. The invention also relates to the nucleic acids or collections of nucleic acids obtained according to the process and their application to the synthesis of new compounds, in particular new compounds of therapeutic interest.

L'invention a également pour objet les moyens nouveaux mis en oeuvre dans le procédé d'obtention d'acides nucléiques ci-dessus, tels que de nouveaux vecteurs et des nouveaux procédés de préparation de tels vecteurs ou encore des cellules hôtes recombinantes comprenant un acide nucléique de l'invention. The subject of the invention is also the novel means used in the above process for obtaining nucleic acids, such as novel vectors and methods for preparing such vectors, or recombinant host cells comprising an acid. nucleic acid of the invention.

L'invention concerne encore des procédés pour détecter un acide nucléique d'intérêt au sein d'une collection d'acides nucléiques obtenus selon le procédé ci-dessus, ainsi que les acides nucléiques détectés par un tel procédé et les polypeptides codés par de tels acides nucléiques. The invention also relates to methods for detecting a nucleic acid of interest within a collection of nucleic acids obtained according to the above method, as well as the nucleic acids detected by such a method and the polypeptides encoded by such methods. nucleic acids.

L'invention a également trait à des acides nucléiques obtenus et détectés selon les procédés ci-dessus, en particulier des acides nucléiques codant pour une enzyme participant à la voie de biosynthèse d'antibiotiques tels que les ss-lactames, les aminoglycosides, les nucléotides hétérocycliques ou encore des polykétides ainsi que l'enzyme codée par ces acides nucléiques, les polykétides produits grâce à l'expression de ces acides nucléiques et enfin des compositions pharmaceutiques comprenant une quantité pharmacologiquement active d'un polykétide produit grâce à l'expression de tels acides nucléiques. The invention also relates to nucleic acids obtained and detected according to the above methods, in particular nucleic acids encoding an enzyme participating in the biosynthetic pathway of antibiotics such as ß-lactams, aminoglycosides, nucleotides heterocyclic or polyketides as well as the enzyme encoded by these nucleic acids, polyketides produced through the expression of these nucleic acids and finally pharmaceutical compositions comprising a pharmacologically active amount of a polyketide produced through the expression of such nucleic acids.

Depuis la découverte de la production de la streptomycine par les actinomycètes, la recherche de nouveaux composés d'intérêt thérapeutique, et tout particulièrement de nouveaux antibiotiques, a eu recours de manière accrue à des méthodes de criblage des métabolites produits par les micro-organismes du sol. Since the discovery of streptomycin production by actinomycetes, the search for new compounds of therapeutic interest, and especially new antibiotics, has made greater use of methods for screening the metabolites produced by the microorganisms of the ground.

De telles méthodes consistent principalement à isoler les organismes de la microflore tellurique, à les cultiver sur des milieux nutritifs spécialement adaptés puis à détecter une activité pharmacologique dans les produits retrouvés dans les surnageants de Such methods consist mainly of isolating the organisms from the soil microflora, cultivating them on specially adapted nutrient media and then detecting pharmacological activity in the products found in the supernatants of the soil.

culture ou dans les lysats cellulaires ayant, le cas échéant, subi au préalable une ou plusieurs étapes de séparation et/ou de purification. culture or in cell lysates having, if necessary, previously undergone one or more separation and / or purification steps.

Ainsi, les méthodes d'isolement et de culture in vitro des organismes constituant la microflore tellurique ont permis, à la date d'aujourd'hui, de caractériser environ 40.000 molécules, dont environ la moitié présente une activité biologique. Thus, the methods of isolation and in vitro culture of the organisms constituting the telluric microflora have made it possible, as of today, to characterize about 40,000 molecules, of which about half have a biological activity.

Des produits majeurs ont été caractérisés selon de telles méthodes de culture in vitro, tels que des antibiotiques (pénicilline, érythromycine, actinomycine, tétracycline, céphalosporine), des anticancéreux, des anticholestérolémiants ou encore des pesticides. Major products have been characterized according to such in vitro culture methods, such as antibiotics (penicillin, erythromycin, actinomycin, tetracycline, cephalosporin), anticancer agents, antichololesterolemic agents or even pesticides.

Les produits d'intérêt thérapeutique d'origine microbienne connus à ce jour proviennent majoritairement (environ 70%) du groupe des actinomycètes et plus particulièrement du genre Streptomyces. The products of therapeutic interest of microbial origin known to date come mainly (about 70%) from the group of actinomycetes and more particularly of the genus Streptomyces.

Toutefois, d'autres composés thérapeutiques, tels que les teicoplanines, la gentamycine et les spinosines, ont été isolés à partir de microorganismes de genres plus difficiles à cultiver tels que Micromonospora, Actinomadura, Actinoplanes, Nocardia, Streptosporangium, Kitasatosporia ou encore Saccharomonospora. However, other therapeutic compounds, such as teicoplanins, gentamycin and spinosins, have been isolated from more difficult to cultivate microorganisms such as Micromonospora, Actinomadura, Actinoplanes, Nocardia, Streptosporangium, Kitasatosporia or Saccharomonospora.

Mais la pratique illustre le fait que la caractérisation de nouveaux produits naturels synthétisés par les organismes de la microflore du sol est restée limitée, en partie du fait que l'étape de culture in vitro aboutit le plus souvent à une sélection d'organismes déjà connus antérieurement. But the practice illustrates that the characterization of new natural products synthesized by the organisms of the soil microflora has remained limited, partly because the in vitro culture step most often results in a selection of organisms already known previously.

Les méthodes de séparation et de culture in vitro des organismes telluriques en vue d'identifier de nouveaux composés d'intérêt présentent donc de nombreuses limites. The methods of separation and in vitro culture of telluric organisms to identify new compounds of interest therefore have many limitations.

Chez les actinomycètes, par exemple, le taux de redécouverte d'antibiotiques déjà connus antérieurement est d'environ 99%. En effet, des techniques de microscopie en fluorescence ont permis de dénombrer plus de 1010 cellules bactériennes dans 1g de sol, alors que seulement 0,1à 1 % de ces bactéries peuvent être isolées après ensemencement sur des milieux de culture. In actinomycetes, for example, the antibiotic rediscovery rate already known previously is about 99%. Indeed, fluorescence microscopy techniques made it possible to count more than 1010 bacterial cells in 1 g of soil, whereas only 0.1 to 1% of these bacteria can be isolated after seeding on culture media.

A l'aide de techniques de cinétique de réassociation d'ADN, il a pu être montré qu'entre 12. 000 et 18. 000 espèces bactériennes peuvent être contenues dans 1g de sol, alors qu'à ce jour, seuls 5000 microorganismes non eucaryotes ont été décrits, tout habitat confondu. Using DNA reassociation kinetics techniques, it has been shown that between 12,000 and 18,000 bacterial species can be contained in 1g of soil, whereas to date only 5,000 microorganisms eukaryotes have been described, all habitat confused.

Des études d'écologie moléculaire ont permis d'amplifier et cloner de nombreuses séquences nouvelles d'ADNr 16S à partir d'ADN de l'environnement. Molecular ecology studies have amplified and cloned many new 16S rDNA sequences from environmental DNA.

Les résultats de ces études ont conduit à tripler le nombre de divisions bactériennes caractérisées antérieurement. The results of these studies led to a tripling of the number of bacterial divisions previously characterized.

A la date d'aujourd'hui, les bactéries sont subdivisées en 40 divisions, certaines d'entre elles n'étant constituées que par des bactéries ne pouvant être cultivées. Ces derniers résultats témoignent de l'ampleur de la biodiversité microbienne restée inexploitée à ce jour. As of today, the bacteria are subdivided into 40 divisions, some of which consist only of bacteria that can not be cultivated. These latest results demonstrate the extent of microbial biodiversity that has remained untapped to date.

Des travaux récents ont tenté de surmonter les nombreux obstacles à l'accès à la biodiversité de la microflore du sol, dont notamment l'étape de culture in vitro préalable à l'isolement et la caractérisation de composés d'intérêt industriel, surtout d'intérêt thérapeutique. Recent work has attempted to overcome the many obstacles to access to soil microflora biodiversity, including the in vitro culture step prior to isolation and the characterization of compounds of industrial interest, especially of therapeutic interest.

Des méthodes ont ainsi été mises au point qui incluent une étape d'extraction de l'ADN des organismes telluriques, le cas échéant après un isolement préalable des organismes contenus dans les échantillons de sol. Methods have thus been developed which include a step of extracting the DNA from telluric organisms, if appropriate after prior isolation of the organisms contained in the soil samples.

L'ADN ainsi extrait, après lyse des cellules bactériennes sans étape préalable de culture in vitro, est cloné dans des vecteurs utilisés pour transfecter des organismes hôtes, afin de constituer des banques d'ADN provenant de bactéries du sol. The DNA thus extracted, after lysis of the bacterial cells without prior in vitro culture step, is cloned into vectors used to transfect host organisms, to form DNA libraries from soil bacteria.

Ces banques de clones recombinants sont utilisées pour détecter la présence de gènes codant pour des composés d'intérêt thérapeutique ou alternativement pour détecter la production de composés d'intérêt thérapeutique par ces clones recombinants. These libraries of recombinant clones are used to detect the presence of genes coding for compounds of therapeutic interest or alternatively to detect the production of compounds of therapeutic interest by these recombinant clones.

Toutefois, les méthodes d'accès direct à l'ADN de la microflore du sol, décrites dans l'état de la technique présentent des inconvénients lors de la mise en oeuvre de chacune des étapes décrites ci-dessus, de nature à affecter considérablement la quantité et la qualité du matériel génétique obtenu et exploitable. However, the methods for direct access to the DNA of the soil microflora described in the state of the art have drawbacks in the implementation of each of the steps described above, which can considerably affect the quantity and quality of genetic material obtained and exploitable.

L'état de la technique concernant chacune des étapes de construction de banques d'ADN provenant d'échantillons de sol est détaillé ci-après, ainsi que les inconvénients techniques identifiés par le demandeur et qui ont été surmontés selon la présente invention. The state of the art relating to each of the steps of constructing DNA libraries from soil samples is detailed below, as well as the technical drawbacks identified by the applicant and which have been overcome according to the present invention.

1. Etape d'extraction de l'ADN à partir d'un échantillon du sol. 1. Step of extracting DNA from a soil sample.

1. 1 Extraction directe d'ADN de l'environnement. 1. 1 Direct DNA extraction from the environment.

Il s'agit pour l'essentiel d'un procédé mettant en oeuvre des techniques d'extraction d'ADN réalisées directement sur l'échantillon dans l'environnement, le plus souvent après une lyse in situ préalable des organismes de l'échantillon. It is essentially a process using DNA extraction techniques performed directly on the sample in the environment, usually after a prior in situ lysis of the sample organisms.

De telles techniques ont été mises en oeuvre sur des échantillons provenant de milieux aquatiques, que ce soit d'eau douce ou marine. Elles comprennent une première étape de concentration préalable des cellules présentes librement ou sous forme de particules, consistant en général en une filtration de grands volumes d'eau sur différents dispositifs de filtration, par exemple filtration classique sur membrane, filtration tangentielle ou rotationnelle ou encore ultrafiltration. Such techniques have been implemented on samples from aquatic environments, whether freshwater or marine. They comprise a first stage of prior concentration of the cells present freely or in the form of particles, generally consisting in a filtration of large volumes of water on various filtration devices, for example conventional membrane filtration, tangential or rotational filtration or else ultrafiltration .

La taille des pores est comprise entre 0,22 et 0,45 mm et nécessite souvent une préfiltration dans le but d'éviter des colmatages dus au traitement de grands volumes. The pore size is between 0.22 and 0.45 mm and often requires prefiltration in order to avoid clogging due to the treatment of large volumes.

Dans un second temps, les cellules récoltées sont lysées directement sur les filtres dans des petits volumes de solutions, par traitement enzymatique et/ou chimique. In a second step, the harvested cells are lysed directly on the filters in small volumes of solutions, by enzymatic and / or chemical treatment.

Cette technique est par exemple illustrée par les travaux de STEIN et al. , 1996, Journal of Bacteriology, Vol.178 (3): 591-599 qui décrit le clonage de gènes codant pour de l'ADN ribosomal et pour un facteur d'élongation de la transcription (EF 2) à partir d'Archaebactéries du plancton marin. This technique is for example illustrated by the work of STEIN et al. , 1996, Journal of Bacteriology, Vol.178 (3): 591-599 which describes the cloning of genes coding for ribosomal DNA and for a transcriptional elongation factor (EF 2) from Archaebacteria marine plankton.

Des techniques d'extraction directe d'ADN à partir d'échantillons de sol ou de sédiment ont été également décrites, basées sur des protocoles de lyse physique, chimique ou enzymatique réalisée in situ. Techniques for the direct extraction of DNA from soil or sediment samples have also been described, based on physical, chemical or enzymatic lysis protocols performed in situ.

Par exemple, le brevet US N 5,824,485 (Chromaxome Corporation) décrit une lyse chimique des bactéries directement sur l'échantillon prélevé par addition d'un tampon de lyse chaud à base d'isothiocyanate de guanidium. For example, US Patent No. 5,824,485 (Chromaxome Corporation) discloses chemical lysis of bacteria directly to the sample taken by addition of a hot lysis buffer based on guanidium isothiocyanate.

La demande Internationale n WO 99/20.799 (WISCONSIN ALUMNI RESEARCH FOUNDATION) décrit une étape de lyse des bactéries in situ à l'aide d'un tampon d'extraction contenant une protéase et du SDS. International Application No. WO 99/20799 (WISCONSIN ALUMNI RESEARCH FOUNDATION) describes a step of lysing bacteria in situ using an extraction buffer containing a protease and SDS.

D'autres techniques ont également été utilisées telles que la réalisation de plusieurs cycles de congélation-décongélation de l'échantillon puis pressage de l'échantillon décongelé à haute pression. Other techniques have also been used, such as carrying out several cycles of freezing-thawing of the sample and then pressing the thawed sample at high pressure.

Ont été également utilisées des techniques de lyse des bactéries à l'aide d'une succession d'étapes de sonication, de chauffage par micro-ondes et de chocs thermiques (PICARD et al. (1992). Bacterial lysis techniques have also been used by means of a succession of sonication steps, microwave heating and thermal shocks (PICARD et al (1992).

Toutefois, les techniques d'extraction directe d'ADN de l'état de la technique décrites ci-dessus ont une efficacité très variable du point de vue quantitatif et qualitatif. However, the techniques of direct DNA extraction of the state of the art described above have a very variable efficiency in terms of quantity and quality.

Ainsi, les traitements chimiques ou enzymatiques in situ de l'échantillon ont le désavantage de ne lyser que certaines catégories de micro-organismes du fait de la résistance sélective des différents microorganismes indigènes à l'étape de lyse en raison de leur morphologie hétérogène. Thus, the chemical or enzymatic treatments in situ of the sample have the disadvantage of only lysing certain categories of microorganisms because of the selective resistance of the various indigenous microorganisms to the lysis stage because of their heterogeneous morphology.

Ainsi, les bactéries à Gram-positif résistent à un traitement à chaud au détergent SDS alors que la quasi-totalité des cellules à Gramnégatif sont lysées . Thus, Gram-positive bacteria are resistant to SDS detergent heat treatment while almost all Gram-negative cells are lysed.

En outre, certains des protocoles d'extraction directe décrits cidessus favorisent l'adsorption des acides nucléiques extraits sur les particules minérales de l'échantillon, réduisant ainsi significativement la quantité d'ADN accessible. In addition, some of the direct extraction protocols described above promote the adsorption of the extracted nucleic acids on the mineral particles of the sample, thus significantly reducing the amount of accessible DNA.

Par ailleurs, si certains protocoles de l'état de la technique divulguent une étape de traitement mécanique pour lyser les microorganismes de l'échantillon prélevé, une telle étape de lyse mécanique est systématiquement effectuée en milieu liquide dans un tampon d'extraction, ce qui ne permet pas une bonne homogénéisation de l'échantillon de départ sous la forme de particules fines permettant une accessibilité maximale à la diversité des organismes présents dans l'échantillon. Des essais de broyage ont également été effectués sur échantillon de sol brut à l'aide de billes de verre, mais la quantité d'ADN extrait était faible. Furthermore, if certain protocols of the state of the art disclose a mechanical treatment step for lysing the microorganisms of the sample taken, such a mechanical lysis step is systematically performed in a liquid medium in an extraction buffer, which does not allow a good homogenization of the starting sample in the form of fine particles allowing maximum accessibility to the diversity of organisms present in the sample. Grinding tests were also performed on a raw soil sample using glass beads, but the amount of DNA extracted was small.

Il a été observé selon l'invention qu'une première étape de lyse mécanique in situ en milieu liquide avait des effets négatifs sur la quantité d'ADN susceptible d'être extrait. It has been observed according to the invention that a first step of in situ mechanical lysis in a liquid medium has negative effects on the amount of DNA that can be extracted.

La quantité d'ADN directement utilisable pour le clonage dans des vecteurs recombinants est également tributaire des étapes de purification subséquentes à son extraction. The amount of directly usable DNA for cloning into recombinant vectors is also dependent on the purification steps subsequent to its extraction.

Dans l'état de la technique, l'ADN extrait est ensuite purifié, par exemple par l'utilisation de polyvinylpolypyrrolidone, par une précipitation en présence d'acétate d'ammonium ou de potassium, par des centrifugations sur gradient de chlorure de césium, ou encore des techniques chromatographiques, notamment sur support d'hydroxyapatite, sur colonne échangeuse d'ions ou encore tamisage moléculaire ou par des techniques d'électrophorèse sur gel d'agarose. In the state of the art, the extracted DNA is then purified, for example by the use of polyvinylpolypyrrolidone, by precipitation in the presence of ammonium or potassium acetate, by cesium chloride gradient centrifugations, or else chromatographic techniques, in particular on hydroxyapatite support, ion exchange column or molecular sieving or by agarose gel electrophoresis techniques.

Les techniques de purification d'ADN décrites antérieurement, surtout lorsque celles-ci sont combinées avec les techniques d'extraction d'ADN de l'environnement précitées, sont susceptibles de conduire à une co-purification de l'ADN avec des composés inhibiteurs provenant de l'échantillon initial qui sont difficiles à éliminer. Previously described DNA purification techniques, especially when these are combined with the above-mentioned DNA extraction techniques from the environment, are likely to lead to co-purification of DNA with inhibitory compounds from of the initial sample that are difficult to eliminate.

La co-extraction de composés inhibiteurs avec l'ADN nécessite la multiplication du nombre d'étapes de purification ce qui conduit à des pertes importantes de l'ADN initialement extrait et réduit simultanément la diversité du matériel génétique initialement contenu dans l'échantillon, ainsi que sa quantité. The coextraction of inhibitory compounds with the DNA requires the multiplication of the number of purification steps which leads to significant losses of the DNA initially extracted and simultaneously reduces the diversity of the genetic material initially contained in the sample, as well as than its quantity.

Un autre but de l'invention a été de surmonter les inconvénients des protocoles de purification antérieurs et de mettre au point une étape de purification d'ADN permettant de maintenir de manière optimale la diversité de l'ADN de l'échantillon initial, d'une part, et, de favoriser quantitativement son obtention, d'autre part. Another object of the invention has been to overcome the disadvantages of previous purification protocols and to develop a DNA purification step to optimally maintain the DNA diversity of the original sample, from on the one hand, and, quantitatively, to favor its obtaining, on the other hand.

Tout particulièrement, les améliorations qualitatives et quantitatives à la purification d'ADN sont maximales lorsqu'elles font appel à une combinaison d'un procédé d'extraction direct de l'ADN selon l'invention et d'un procédé de purification ultérieur, comme cela sera décrit ci-après. In particular, the qualitative and quantitative improvements in DNA purification are maximum when they involve a combination of a direct extraction method of the DNA according to the invention and a subsequent purification process, such as this will be described below.

1. 2. Extraction indirecte d'ADN de l'environnement. 1. 2. Indirect DNA extraction from the environment.

De telles techniques ont recours à une première étape de séparation des différents organismes de la microflore tellurique des autres constituants de l'échantillon de départ, préalablement à l'étape d'extraction de l'ADN proprement dite. Such techniques use a first step of separating the different organisms from the telluric microflora from the other constituents of the starting sample, prior to the step of extracting the DNA proper.

Dans l'état de la technique, la séparation préalable d'une fraction microbienne d'un échantillon de sol comprend le plus souvent une dispersion physique de l'échantillon par broyage de ce dernier en milieu liquide, par exemple en utilisant des dispositifs du type Waring Blender ou encore un mortier. In the state of the art, the prior separation of a microbial fraction from a soil sample most often comprises a physical dispersion of the sample by grinding the latter in a liquid medium, for example using devices of the type Waring Blender or a mortar.

Il a également été décrit des dispersions chimiques, par exemple sur des résines échangeuses d'ions ou encore des dispersions à l'aide de détergents non spécifiques tels que le déoxycholate de sodium ou du polyéthylène glycol. Quel que soit le mode de dispersion, l'échantillon solide doit être mis en suspension dans de l'eau, du tampon phosphate ou une solution saline. It has also been described chemical dispersions, for example on ion exchange resins or dispersions with non-specific detergents such as sodium deoxycholate or polyethylene glycol. Whatever the mode of dispersion, the solid sample should be suspended in water, phosphate buffer or saline solution.

L'étape de dispersion physique ou chimique peut être suivie d'une centrifugation sur gradient de densité permettant la séparation des cellules contenues dans l'échantillon et des particules de ce dernier, étant entendu que les bactéries ont des densités inférieures à celles de la plupart des particules du sol. The physical or chemical dispersion step may be followed by a density gradient centrifugation to separate the cells in the sample and particles from the sample, it being understood that the bacteria have densities lower than those of most soil particles.

L'étape de dispersion physique peut aussi être suivie alternativement d'une étape de centrifugation à faible vitesse ou encore une étape d'élutriation cellulaire. The physical dispersion step can also be alternately followed by a low speed centrifugation step or a cell elutriation step.

L'ADN peut ensuite être extrait des cellules séparées par toutes les méthodes de lyse disponibles et être purifié par de nombreuses méthodes, y compris les méthodes de purification décrites au paragraphe 1. 1 précédent. Notamment, l'inclusion des cellules dans de l'agarose à bas point de fusion peut être réalisée afin de ménager la lyse. The DNA can then be extracted from the cells separated by all available lysis methods and purified by many methods, including the purification methods described in the previous paragraph. In particular, the inclusion of the cells in low-melting agarose can be carried out in order to preserve the lysis.

Toutefois, les méthodes décrites dans l'état de la technique connues du demandeur ne donnent pas satisfaction du fait de la présence, dans les fractions contenant l'ADN extrait, de constituants indésirables de l'échantillon de départ ayant une influence significative sur la qualité et la quantité d'ADN final. However, the methods described in the state of the art known to the applicant are unsatisfactory because of the presence, in the fractions containing the extracted DNA, of undesirable constituents of the starting sample having a significant influence on the quality. and the amount of final DNA.

La présente invention se propose de résoudre les difficultés techniques rencontrées dans les procédés de l'art antérieur comme cela sera décrit ci-après. The present invention proposes to solve the technical difficulties encountered in the methods of the prior art as will be described below.

2. Caractérisation moléculaire de l'ADN extrait. 2. Molecular characterization of the extracted DNA.

Lorsque l'on désire construire une banque d'ADN à partir d'un échantillon de l'environnement, en particulier à partir d'un échantillon de sol, il est avantageux de vérifier la qualité et la diversité de la source d'ADN extrait et purifié préalablement à son insertion dans des vecteurs appropriés. When it is desired to construct a DNA library from a sample of the environment, in particular from a soil sample, it is advantageous to check the quality and diversity of the extracted DNA source. and purified prior to insertion into appropriate vectors.

L'objectif d'une telle caractérisation moléculaire de l'ADN extrait et purifié est d'obtenir des profils représentant les proportions des différents taxons bactériens présents dans cet extrait d'ADN. La caractérisation moléculaire de l'ADN extrait et purifié permet de déterminer si des artefacts ont été introduits lors de la mise en oeuvre des différentes étapes d'extraction et de purification et, le cas échéant, si la diversité d'origine de l'ADN extrait et purifié est représentative de la diversité microbienne présente initialement dans l'échantillon, notamment dans l'échantillon de sol. The objective of such a molecular characterization of the extracted and purified DNA is to obtain profiles representing the proportions of the different bacterial taxons present in this DNA extract. The molecular characterization of the extracted and purified DNA makes it possible to determine whether artifacts have been introduced during the implementation of the various steps of extraction and purification and, if appropriate, whether the original diversity of the DNA extracted and purified is representative of the microbial diversity initially present in the sample, especially in the soil sample.

A la connaissance du demandeur, il est recouru dans l'état de la technique à des procédés d'hybridation quantitative mettant en oeuvre des sondes oligonucléotidiques spécifiques de différents groupes bactériens, appliqués directement à l'ADN extrait de l'environnement. To the applicant's knowledge, quantitative hybridization methods using oligonucleotide probes specific for different bacterial groups applied directly to the DNA extracted from the environment are used in the state of the art.

Malheureusement, une telle approche est peu sensible et ne permet pas de détecter des genres ou des groupes taxonomiques présents en faible abondance. Unfortunately, such an approach is insensitive and can not detect taxonomic genera or groups present in low abundance.

L'état de la technique décrit aussi des procédés de PCR quantitative, telle que la MPN-PCR ou encore la PCR quantitative par compétition. Toutefois, ces techniques présentent d'importants inconvénients. The state of the art also describes quantitative PCR methods, such as MPN-PCR or competitive quantitative PCR. However, these techniques have significant disadvantages.

Ainsi, la MPN-PCR est d'une utilisation complexe du fait de la multiplication des dilutions et des répétitions qui la rend inappropriée pour un grand nombre d'échantillons ou de couples d'amorces. Thus, the MPN-PCR is of a complex use because of the multiplication of the dilutions and the repetitions which makes it inappropriate for a large number of samples or couples of primers.

Par ailleurs, la PCR quantitative par compétition est d'une mise en oeuvre difficile du fait de la nécessité de construire un compétiteur spécifique à l'ADN cible qui, en outre, n'induit pas de biais ou d'artefacts dans la compétition proprement dite. Moreover, the quantitative PCR by competition is difficult to implement due to the need to build a competitor specific to the target DNA which, moreover, does not induce any bias or artifacts in the competition itself. called.

Il est ainsi proposé selon l'invention un procédé de précriblage d'une banque d'ADN provenant d'un échantillon de l'environnement qui est à la fois rapide, simple et fiable et permet de tester la qualité de l'ADN préalablement extrait et purifié et de déterminer ainsi l'intérêt de construire une banque de clones préparés à partir de cet ADN purifié de départ. It is thus proposed according to the invention a precranking method of a DNA library from a sample of the environment which is at once fast, simple and reliable and makes it possible to test the quality of the DNA extracted beforehand. and purified and thereby determine the interest of constructing a library of clones prepared from this purified starting DNA.

3. Vecteurs pour le clonage de l'ADN extrait et purifié à partir d'un échantillon de l'environnement. 3. Vectors for cloning extracted and purified DNA from a sample of the environment.

De nombreux vecteurs ont déjà été décrits dans l'état de la technique afin de cloner de l'ADN préalablement extrait d'un échantillon de l'environnement. Many vectors have already been described in the state of the art in order to clone DNA previously extracted from a sample of the environment.

Ainsi, selon la description de la demande internationale n WO 99/20.799, peuvent être utilisés des vecteurs viraux, des phages, des plasmides, des phagemides, des cosmides, des phosmides, des vecteurs du type BAC (chromosome artificiel bactérien) ou encore le bactériophage P1, des vecteurs de type PAC (chromosome artificiel basé sur le bactériophage P1), des vecteurs du type YAC (chromosome artificiel de levure), des plasmides de levure ou tout autre vecteur capable de maintenir et d'exprimer de manière stable un ADN génomique. Thus, according to the description of International Application No. WO 99/20799, it is possible to use viral vectors, phages, plasmids, phagemids, cosmids, phosmides, vectors of the BAC (artificial bacterial chromosome) type, or the bacteriophage P1, PAC-type vectors (artificial chromosome based on bacteriophage P1), YAC (artificial chromosome yeast) vectors, yeast plasmids or any other vector capable of stably maintaining and expressing a DNA genomics.

L'exemple 1 de la demande PCT n WO 99/20.799 décrit la construction d'une banque d'ADN génomique par clonage dans un vecteur du type BAC. Example 1 of PCT Application No. WO 99/20799 describes the construction of a genomic DNA library by cloning into a BAC vector.

A la connaissance du demandeur, aucune banque d'ADN provenant d'un échantillon de l'environnement n'avait encore été effectivement réalisée avec des vecteurs de type conjugatif, une telle technique étant rendue pour la première fois accessible et reproductible par l'homme du métier grâce à l'enseignement de la présente invention. To the applicant's knowledge, no DNA library from a sample of the environment had actually been made with conjugative vectors, such a technique being made accessible for the first time and reproducible by humans. of the art through the teaching of the present invention.

4. Hôtes cellulaires
Dans l'état de la technique, de nombreuses cellules hôtes ont été décrites comme pouvant être utilisées afin d'héberger les vecteurs contenant les inserts d'ADN provenant de l'ADN extrait et purifié à partir d'un échantillon de l'environnement. 4. Cellular hosts
In the state of the art, many host cells have been described as being able to be used to harbor vectors containing DNA inserts from extracted and purified DNA from a sample of the environment.

Ainsi, la demande PCT N WO 99/20.799 cite de nombreux hôtes cellulaires appropriés, tels que Escherichia coli, en particulier la souche DH 10B ou encore la souche 294 (ATCC 31446, la souche E. coli B, E. Thus, PCT application WO 99 / 20.799 cites many suitable cell hosts, such as Escherichia coli, in particular strain DH 10B or strain 294 (ATCC 31446, strain E. coli B, E.

Coli X 1776 (ATCC N 31 537), E.coli DH5 a et E.coli W3110 (ATCC n 27.325). Col. X. 1776 (ATCC N 31,537), E. coli DH5α and E. coli W3110 (ATCC No. 27,325).

Cette demande PCT cite également d'autres cellules hôtes appropriées telles que Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, Serratia, Schigella ou encore des souches du type bacillus telles que B. subtilis et B. licheniformis ainsi que les bactéries du genre Pseudomonas, Streptomyces ou Actinomyces. This PCT application also cites other appropriate host cells such as Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, Serratia, Schigella or Bacillus-type strains such as B. subtilis and B. licheniformis as well as bacteria of the genus Pseudomonas. Streptomyces or Actinomyces.

Le brevet US N 5,824,485 cite en particulier la souche de Streptomyces lividans TK66 ou encore des cellules de levure telles que celles de Saccharomyces pombe. US Patent No. 5,824,485 particularly cites the Streptomyces lividans strain TK66 or yeast cells such as those of Saccharomyces pombe.

5. Caractérisation de gènes d'intérêt dans des banques d'ADN provenant d'un échantillon de l'environnement. 5. Characterization of genes of interest in DNA libraries from a sample of the environment.

La demande PCT N WO 99/20.799 décrit une identification du phénotype de différents clones appartenant à la banque d'ADN de B. cereus, respectivement un clone produisant de l'hémolysine, un clone hydrolysant l'esculine ou encore un clone produisant un pigment orange. PCT application WO 99 / 20,799 describes an identification of the phenotype of different clones belonging to the B. cereus DNA library, respectively a clone producing hemolysin, an esculin-hydrolyzing clone or a clone producing a pigment. orange.

Des techniques de mutagenèse basées sur l'utilisation d'un transposon codant pour l'enzyme pho A ont permis subséquemment d'isoler des clones mutés et de caractériser les séquences responsables des phénotypes observés. Mutagenesis techniques based on the use of a transposon coding for the pho A enzyme have subsequently made it possible to isolate mutated clones and to characterize the sequences responsible for the observed phenotypes.

L'article de STEIN et al. (1996) précité décrit l'utilisation d'amorces spécifiques de l'ADN ribosomal afin d'amplifier l'ADN inséré dans les vecteurs hébergés par certains clones d'une banque d'ADN génomique d'Archaebactéries de plancton marin et l'identification de plusieurs séquences codantes dans l'ADN ainsi amplifié. The article by STEIN et al. (1996) supra discloses the use of ribosomal specific DNA primers to amplify DNA inserted into vectors harbored by certain clones of a marine plankton archaebacteria genomic DNA library and identification. of several coding sequences in the DNA thus amplified.

L'article de BORSCHERT S. et al., (1992) décrit le criblage d'une banque d'ADN génomique de Bacillus subtilis à l'aide de couples d'amorces hybridant avec des régions conservées de peptide synthétases connues afin d'identifier un ou plusieurs gènes correspondant dans le génome de Bacillus subtilis. The article by BORSCHERT S. et al. (1992) describes the screening of a Bacillus subtilis genomic DNA library using pairs of primers hybridizing with conserved regions of known peptide synthetases in order to identify one or more corresponding genes in the genome of Bacillus subtilis.

Cette technique a permis de détecter un fragment d'ADN chromosomique d'environ 26 kb portant une partie de l'opéron de biosynthèse de la surfactine. This technique detected a chromosomal DNA fragment of about 26 kb carrying part of the surfactin biosynthesis operon.

L'article de KAH-TONG S. et al.(1997) décrit le criblage d'une banque d'ADN provenant du sol à l'aide d'amorces hybridant avec des séquences conservées de l'opéron responsable de la voie de biosynthèse des polykétides de type Il et montre l'identification, au sein de cette banque d'ADN, de séquences apparentées au gène PKS-p. Cet article décrit aussi la construction de cassettes d'expression hybrides dans lesquelles la séquence de la sous-unité PKS-, retrouvée naturellement dans l'opéron responsable de la biosynthèse des polykétides, a été remplacée par différentes séquences apparentées retrouvées dans la banque d'ADN. The article by KAH-TONG S. et al (1997) describes the screening of a DNA library originating from the soil using primers hybridizing with conserved sequences of the operon responsible for the biosynthetic pathway. type II polyketides and shows the identification, within this DNA library, of sequences related to the PKS-p gene. This article also describes the construction of hybrid expression cassettes in which the sequence of the PKS- subunit, found naturally in the operon responsible for the biosynthesis of polyketides, has been replaced by different related sequences found in the library. DNA.

De même, l'article de HONG-FU et al. , (1995) décrit la construction de cassettes d'expression contenant les différentes phases de lecture ouverte de l'opéron responsable de la biosynthèse des polykétides, les différentes cassettes d'expression ayant été construites artificiellement en combinant les phases de lecture ouverte qui ne sont pas retrouvées ensemble naturellement dans le génome de Streptomyces coelicolor. Cet article montre que la combinaison, dans les cassettes d'expression artificielles, de cadres de lecture ouvert originaires de différentes souches bactériennes permet la production de polykétides ayant différentes caractéristiques structurales et des activités antibiotiques plus ou moins grandes vis-à-vis de Bacillus subtilis et Bacillus cereus. Similarly, the article by HONG-FU et al. , (1995) describes the construction of expression cassettes containing the different open reading phases of the operon responsible for the biosynthesis of polyketides, the different expression cassettes having been artificially constructed by combining the open reading phases which are not not found together naturally in the genome of Streptomyces coelicolor. This article shows that the combination, in artificial expression cassettes, of open reading frames originating from different bacterial strains allows the production of polyketides with different structural characteristics and more or less large antibiotic activities with respect to Bacillus subtilis. and Bacillus cereus.

Les polykétides font partie d'une grande famille de produits naturels de structure variable et possédant une grande diversité d'activités biologiques. Font partie des polykétides par exemple, les tétracyclines et l'érythromycine (antibiotiques), le FK506 (immunosuppresseur), la doxorubicine (agent anti-cancéreux), la monensine (un agent coccidiostatique) ainsi que l'avermectine (un agent antiparasitaire). Polyketides are part of a large family of natural products of variable structure and with a great diversity of biological activities. Examples of polyketides include tetracyclines and erythromycin (antibiotics), FK506 (immunosuppressant), doxorubicin (anti-cancer agent), monensin (a coccidiostatic agent) and avermectin (a pest control agent).

Ces molécules sont synthétisées grâce à des enzymes multifonctionnelles appelées polykétides synthases, qui catalysent des cycles de condensation répétés entre des acyl thioesters (en général des acétyl, propionyl, malonyl ou méthylmalonyl thioesters). Chaque cycle de condensation aboutit à la formation, sur une chaîne croissante carbonée, These molecules are synthesized through multifunctional enzymes called polyketide synthases, which catalyze repeated condensation cycles between acyl thioesters (usually acetyl, propionyl, malonyl or methylmalonyl thioesters). Each cycle of condensation results in the formation, on a growing carbon chain,

d'un groupe ss-kéto qui peut ensuite subir, le cas échéant, une ou plusieurs séries d'étapes réductrices. of a ss-keto group which can then undergo, if necessary, one or more series of reducing steps.

Compte-tenu de l'intérêt clinique important des polykétides, leur mécanisme commun de biosynthèse ainsi que le haut degré de conservation observé entre les groupes de gènes codant pour les polykétides synthases, il s'est développé un intérêt accru pour le développement de nouveaux polykétides par génie génétique. In view of the important clinical interest of polyketides, their common mechanism of biosynthesis, and the high degree of conservation observed between gene clusters encoding polyketide synthases, there has been increased interest in the development of new polyketides. by genetic engineering.

De nouveaux polykétides artificiels ont ainsi été produits par génie génétique, tels que la méderrhodine A ou la dihydrogranatirhodine. La grande majorité des molécules nouvelles de polykétides obtenues par génie génétique sont très différentes, du point de vue structural, des polykétides correspondants naturels. New artificial polyketides have thus been produced by genetic engineering, such as medronhodin A or dihydrogranatirhodine. The vast majority of new molecules of genetically engineered polyketides are structurally very different from the corresponding natural polyketides.

De l'état de la technique, il ressort ainsi qu'il existe un besoin d'obtention de nouveaux polykétides d'intérêt et tout particulièrement de polykétides d'intérêt thérapeutique présentant notamment, par rapport à leurs homologues naturels, un niveau accru d'activité antibiotique ou encore un spectre d'activité antibiotique différent, soit plus large que celui des polykétides connus, soit au contraire plus sélectif. From the state of the art, it thus emerges that there is a need to obtain new polyketides of interest and more particularly polyketides of therapeutic interest having in particular, compared with their natural counterparts, an increased level of antibiotic activity or a spectrum of different antibiotic activity, either wider than that of known polyketides, or rather more selective.

Ce besoin est, comme cela sera décrit ci-après, en partie comblé selon la présente invention. This need is, as will be described hereinafter, partly filled according to the present invention.

DESCRIPTION DE L'INVENTION
L'invention concerne tout d'abord un procédé pour la construction de banques d'ADN provenant d'un échantillon de l'environnement, un tel échantillon pouvant être indifféremment un milieu aquatique (eau douce ou marine), un échantillon de sol (couche superficielle du sol, sous-sol ou sédiments), ou encore un échantillon d'organismes eucaryotes contenant une microflore associée, tel que par exemple un échantillon provenant de plantes, d'insectes ou encore d'organismes marins et possédant une microflore associée. DESCRIPTION OF THE INVENTION
The invention firstly relates to a method for the construction of DNA libraries from a sample of the environment, such a sample can be indifferently an aquatic medium (freshwater or marine), a soil sample (layer superficial soil, subsoil or sediment), or a sample of eukaryotic organisms containing an associated microflora, such as for example a sample from plants, insects or marine organisms and having an associated microflora.

La mise au point d'un procédé de construction d'une banque d'ADN d'un échantillon de l'environnement, et tout particulièrement d'un échantillon de sol, comprend des étapes critiques dont la mise en oeuvre doit être nécessairement optimisée pour l'obtention d'une banque d'ADN The development of a method of constructing a DNA library of a sample of the environment, and particularly of a soil sample, comprises critical steps whose implementation must necessarily be optimized for obtaining a DNA bank

dont le contenu en acides nucléiques d'intérêt répond aux objectifs initialement fixés. whose content in nucleic acids of interest meets the objectives initially set.

Une première étape critique consiste en l'extraction et la purification ultérieure des acides nucléiques contenus initialement dans l'échantillon, c'est-à-dire principalement des acides nucléiques contenus dans les divers organismes composant la microflore de cet échantillon. A first critical step consists in the extraction and subsequent purification of the nucleic acids initially contained in the sample, that is to say mainly nucleic acids contained in the various organisms composing the microflora of this sample.

La qualité de la purification de l'ADN extrait est déterminante sur le résultat obtenu. The quality of purification of the extracted DNA is decisive on the result obtained.

Une seconde étape importante d'un procédé de construction d'une banque d'acides nucléiques provenant d'un échantillon de l'environnement est l'évaluation de la diversité génétique des acides nucléiques extraits et purifiés. La mise au point d'une étape de réalisation simple et fiable de pré-criblage de l'ADN extrait et purifié afin de vérifier qu'il rend compte, au moins partiellement, de la diversité phylogénétique des organismes présents initialement dans l'échantillon de départ, permet en effet de déterminer l'intérêt ou non d'utiliser la source initiale d'ADN extrait et purifié pour la construction de la banque d'acides nucléiques proprement dite ou au contraire de ne pas poursuivre la construction de la banque d'acides nucléiques du fait d'artefacts trop importants introduits au moment de l'extraction et de la purification des acides nucléiques. Il a en outre été identifié selon l'invention que la qualité des inserts introduits dans les vecteurs pour construire la banque est déterminante. Il a ainsi été déterminé que l'utilisation d'enzymes de restriction pour cliver l'ADN extrait et purifié à partir de l'échantillon de l'environnement était de nature à introduire des artefacts ou " biais " dans la structure des inserts obtenus. A second important step in a process for constructing a nucleic acid library from a sample of the environment is the evaluation of the genetic diversity of extracted and purified nucleic acids. The development of a simple and reliable step of pre-screening the extracted and purified DNA to verify that it accounts, at least partially, for the phylogenetic diversity of the organisms initially present in the sample. departure, makes it possible to determine the interest or not to use the initial source of DNA extracted and purified for the construction of the nucleic acid library proper or otherwise not to continue the construction of the bank of nucleic acids because of excessive artifacts introduced at the time of extraction and purification of nucleic acids. It has furthermore been identified according to the invention that the quality of the inserts introduced into the vectors to build the bank is decisive. It has thus been determined that the use of restriction enzymes to cleave the DNA extracted and purified from the sample of the environment was likely to introduce artifacts or "biases" into the structure of the inserts obtained.

En effet, l'ADN extrait du sol ou d'autres environnements, provenant en très grande majorité d'organismes non cultivables, est composé de molécules dont le taux de bases G et C est par définition inconnu et de plus variable en fonction de l'origine de ces organismes
Une troisième étape critique est l'insertion des acides nucléiques extraits et purifiés dans des vecteurs capables d'intégrer des acides nucléiques de longueur choisie, d'une part, et, d'autre part, d'en permettre la transfection ou encore l'intégration dans le génome dans des hôtes cellulaires déterminés ainsi que, le cas échéant, d'en permettre l'expression dans de tels hôtes cellulaires. Indeed, DNA extracted from soil or other environments, mostly from non-culturable organisms, is composed of molecules whose G and C bases are by definition unknown and more variable depending on the origin of these organisms
A third critical step is the insertion of extracted and purified nucleic acids into vectors capable of integrating nucleic acids of selected length, on the one hand, and, on the other hand, to allow transfection thereof or else integration into the genome in specific cellular hosts and, if appropriate, to allow expression in such cellular hosts.

Constituent des vecteurs d'intérêt, les vecteurs capables d'intégrer des acides nucléiques de grande taille, c'est-à-dire de taille supérieure à 100 kb lorsque l'objectif poursuivi consiste en un clonage et en une identification d'un opéron complet capable de diriger une voie complète de biosynthèse d'un composé d'intérêt industriel, en particulier d'un composé d'intérêt pharmaceutique ou agronomique. Vectors of interest are vectors capable of integrating nucleic acids of large size, that is to say larger than 100 kb when the objective pursued consists in cloning and in identifying an operon. A compound capable of directing a complete biosynthetic pathway of a compound of industrial interest, particularly a compound of pharmaceutical or agronomic interest.

DEFINITIONS
Au sens de la présente invention, on entend par " acides nucléiques ", " polynucléotides " et " oligonucléotides " aussi bien des séquences d'ADN, d'ARN, que des séquences hybrides ARN/ADN de plus de 2 nucléotides, indifféremment sous la forme simple brin ou double brin. DEFINITIONS
Within the meaning of the present invention, "nucleic acids", "polynucleotides" and "oligonucleotides" are understood to mean both DNA and RNA sequences, and RNA / DNA hybrid sequences of more than 2 nucleotides, regardless of whether single-stranded or double-stranded form.

Le terme " banque " ou " collection " est utilisé dans la présente description en référence indifféremment à un ensemble d'acides nucléiques extraits et, le cas échéant purifiés, provenant d'un échantillon de l'environnement, à un ensemble de vecteurs recombinants, chacun des vecteurs recombinants de l'ensemble comprenant un acide nucléique provenant de l'ensemble d'acides nucléiques extraits et, le cas échéant purifiés précités, ainsi qu'à un ensemble de cellules hôtes recombinantes comprenant un ou plusieurs acides nucléiques provenant de l'ensemble des acides nucléiques extraits et, le cas échéant, purifiés précités, lesdits acides nucléiques étant soient portés par un ou plusieurs vecteurs recombinants, soit intégrés dans le génome desdites cellules hôtes recombinantes. The term "library" or "collection" is used in the present description with reference equally to a set of nucleic acids extracted and, where appropriate purified, from a sample of the environment, to a set of recombinant vectors, each of the recombinant vectors of the set comprising a nucleic acid from the set of nucleic acids extracted and, where appropriate purified above, and to a set of recombinant host cells comprising one or more nucleic acids from the all nucleic acids extracted and, where appropriate, purified above, said nucleic acids being carried by one or more recombinant vectors, or integrated into the genome of said recombinant host cells.

On désigne par " échantillon de l'environnement " indifféremment un échantillon d'origine aquatique, par exemple d'eau douce ou saline, ou un échantillon tellurique provenant de la couche superficielle d'un sol, de sédiments ou encore de couches inférieures du sol (sous-sol), ainsi que des échantillons d'organismes eucaryotes, le cas échéant multicellulaires, d'origine végétale, provenant d'organismes marins ou encore d'insectes et possédant une microflore associée, cette microflore associée constituant des organismes d'intérêt. The term "environmental sample" refers to a sample of aquatic origin, for example of fresh or saline water, or a telluric sample originating from the superficial layer of a soil, sediments or even lower layers of the soil. (subsoil), as well as samples of eukaryotic organisms, possibly multicellular, of plant origin, from marine organisms or insects and having an associated microflora, this associated microflora constituting organisms of interest .

On entend par " opéron " selon l'invention, un ensemble de cadres ouverts de lecture dont la transcription et/ou la traduction est co- The term "operon" according to the invention means a set of open reading frames whose transcription and / or translation is co-

régulée par un ensemble unique de signaux de régulation de la transcription et/ou de la traduction. Selon l'invention, un opéron peut également comprendre lesdits signaux de régulation de la transcription et/ou de la traduction. regulated by a unique set of transcriptional and / or translational regulatory signals. According to the invention, an operon can also comprise said signals for regulating transcription and / or translation.

Par " voie métabolique " aux fins de l'invention ou encore " voie de biosynthèse " on entend un ensemble de réactions biochimiques anaboliques ou cataboliques réalisant la conversion d'une première espèce chimique en une seconde espèce chimique. By "metabolic route" for purposes of the invention or "biosynthetic pathway" is meant a set of anabolic or catabolic biochemical reactions effecting the conversion of a first chemical species into a second chemical species.

Par exemple, une voie de biosynthèse d'un antibiotique est constituée de l'ensemble des réactions biochimiques convertissant des métabolites primaires en produits intermédiaires des antibiotiques, puis subséquemment en antibiotiques. For example, an antibiotic biosynthesis pathway consists of all biochemical reactions converting primary metabolites into intermediates of antibiotics, and subsequently into antibiotics.

Par séquence de régulation placée " en phase " (en anglais operably linked) par rapport à une séquence nucléotidique dont l'expression est recherchée, on signifie que la ou les séquences de régulation de la transcription sont localisées, par rapport à la séquence nucléotidique d'intérêt dont l'expression est recherchée, de manière à permettre l'expression de ladite séquence d'intérêt, la régulation de la dite expression étant dépendante de facteurs interagissant avec les séquences nucléotidiques régulatrices. By regulatory sequence placed "in phase" (in English operably linked) with respect to a nucleotide sequence whose expression is sought, it is meant that the transcription regulation sequence (s) are localized, with respect to the nucleotide sequence of the nucleotide sequence. interest whose expression is sought, so as to allow the expression of said sequence of interest, the regulation of said expression being dependent on factors interacting with the regulatory nucleotide sequences.

Selon une autre terminologie, on peut dire également que la séquence nucléotidique d'intérêt dont l'expression est recherchée est placée " sous le contrôle " des séquences nucléotidiques régulatrices de la transcription. According to another terminology, it may also be said that the nucleotide sequence of interest whose expression is sought is placed "under the control" of the nucleotide transcription regulatory sequences.

Le terme " isolé " au sens de la présente invention désigne un matériel biologique qui a été soustrait à son environnement originel (l'environnement dans lequel il est localisé naturellement). The term "isolated" in the sense of the present invention refers to biological material that has been removed from its original environment (the environment in which it is naturally located).

Par exemple, un polynucléotide ou un polypeptide présent à l'état naturel dans un organisme (virus, bactérie, champignon, levure, plante ou animal) n'est pas isolé. Le même polypeptide séparé de son environnement naturel ou le même polynucléotide séparé des acides nucléiques adjacents au sein desquels il est naturellement inséré dans le génome de l'organisme, est isolé. For example, a polynucleotide or a polypeptide naturally occurring in an organism (virus, bacteria, fungus, yeast, plant or animal) is not isolated. The same polypeptide separated from its natural environment or the same polynucleotide separated from the adjacent nucleic acids in which it is naturally inserted into the genome of the organism, is isolated.

Un tel polynucléotide peut être inclus dans un vecteur et/ou un tel polynucléotide peut être inclus dans une composition et demeure Such a polynucleotide may be included in a vector and / or such a polynucleotide may be included in a composition and remains

néanmoins à l'état isolé, du fait que le vecteur ou la composition ne constitue pas son environnement naturel. nevertheless in the isolated state, because the vector or composition does not constitute its natural environment.

Le terme " purifié " ne nécessite pas que le matériel soit présent sous une forme de pureté absolue, exclusif de la présence d'autres composés. Il s'agit plutôt d'une définition relative. The term "purified" does not require that the material be present in a form of absolute purity, exclusive of the presence of other compounds. It is rather a relative definition.

Un polypeptide ou un polynucléotide est à l'état purifié après purification du matériel de départ d'au moins un ordre de grandeur, de préférence 2 ou 3 et préférentiellement 4 ou 5 ordres de grandeur. A polypeptide or a polynucleotide is in the purified state after purification of the starting material of at least one order of magnitude, preferably 2 or 3 and preferably 4 or 5 orders of magnitude.

Le " pourcentage d'identité " entre deux séquences de nucléotides ou d'acides aminés, au sens de la présente invention, peut être déterminé en comparant deux séquences alignées de manière optimale, à travers une fenêtre de comparaison. The "percent identity" between two nucleotide or amino acid sequences, within the meaning of the present invention, can be determined by comparing two optimally aligned sequences, through a comparison window.

La partie de la séquence nucléotidique ou polypeptide dans la fenêtre de comparaison peut ainsi comprendre des additions ou des délétions (par exemple des " gaps ") par rapport à la séquence de référence (qui ne comprend pas ces additions ou ces délétions) de manière à obtenir un alignement optimal des deux séquences. The part of the nucleotide sequence or polypeptide in the comparison window may thus comprise additions or deletions (for example "gaps") with respect to the reference sequence (which does not include these additions or deletions) so as to obtain an optimal alignment of the two sequences.

Le pourcentage est calculé en déterminant le nombre de positions auquel une base nucléique ou un résidu d'aminoacide identique est observé pour les deux séquences (nucléique ou peptidique) comparées, puis en divisant le nombre de positions auquel il y a identité entre les deux bases ou résidus d'aminoacides par le nombre total de positions dans la fenêtre de comparaison, puis en multipliant le résultat par 100 afin d'obtenir le pourcentage d'identité de séquence. The percentage is calculated by determining the number of positions at which an identical nucleobase or amino acid residue is observed for the two (nucleic or peptide) sequences compared and then dividing the number of positions at which there is identity between the two bases. or amino acid residues by the total number of positions in the comparison window, then multiplying the result by 100 to obtain the percentage of sequence identity.

L'alignement optimal des séquences pour la comparaison peut être réalisé de manière informatique à l'aide d'algorithmes connus contenus dans le package de la Société WISCONSIN GENETICS SOFTWARE PACKAGE, GENETICS COMPUTER GROUP (GCG), 575 Science Doctor, Madison, WISCONSIN. The optimal alignment of the sequences for the comparison can be performed in a computer manner using known algorithms contained in the package of the company WISCONSIN GENETICS SOFTWARE PACKAGE, GENETICS COMPUTER GROUP (GCG), 575 Science Doctor, Madison, WISCONSIN.

A titre d'illustration, le pourcentage d'identité de séquence pourra être effectué à l'aide du logiciel BLAST (versions BLAST 1. 4.9 de Mars 1996, BLAST 2. 0.4. de Février 1998 et BLAST 2. 0.6. de Septembre 1998), en utilisant exclusivement les paramètres par défaut (S.F. Altschul et al., J. Mol. Biol. 1990 215: 403-410, S. F. Altschul et al., Nucleic Acids Res. 1997 25 : 3389-3402).Blast recherche des séquences similaires/homologues à une séquence " requête " de référence, à l'aide By way of illustration, the percentage of sequence identity can be carried out using the BLAST software (BLAST versions 1. 4.9 of March 1996, BLAST 2. 0.4 of February 1998 and BLAST 2. 0.6 of September 1998. ), using only the default parameters (SF Altschul et al., J. Mol Biol., 1990 215: 403-410, SF Altschul et al., Nucleic Acids Res 1997 25: 3389-3402) .Blast searches for similar sequences / homologues to a reference "query" sequence, using

de l'algorithme d'Altschul et al. La séquence requête et les bases de données utilisées peuvent être peptidiques ou nucléiques, toute combinaison étant possible. Altschul et al. The query sequence and the databases used may be peptide or nucleic, any combination being possible.

EXTRACTION ET PURIFICATION D'ACIDES NUCLEIQUES PROVENANT D'UN ECHANTILLON DE L'ENVIRONNEMENT. EXTRACTION AND PURIFICATION OF NUCLEIC ACIDS FROM AN ENVIRONMENTAL SAMPLE.

1. Extraction directe d'acides nucléiques
Il a été montré selon la présente invention que, pour l'obtention d'une banque d'acides nucléiques provenant d'organismes contenus dans un échantillon du sol, il était important de créer des conditions dans lesquelles, d'une part, les différents organismes de l'échantillon sont rendus accessibles aux étapes ultérieures d'extraction des acides nucléiques et, d'autre part, que l'étape initiale de traitement de l'échantillon de sol permette une lyse mécanique maximale des organismes de l'échantillon de nature à rendre directement accessibles les acides nucléiques de ces organismes, principalement l'ADN génomique et plasmidique, aux tampons utilisés pour les étapes ultérieures d'extraction. 1. Direct extraction of nucleic acids
It has been shown according to the present invention that, in order to obtain a nucleic acid library from organisms contained in a soil sample, it was important to create conditions in which, on the one hand, the different organisms in the sample are made accessible for subsequent nucleic acid extraction steps and that the initial stage of soil sample processing allows maximum mechanical lysis of the organisms in the sample of nature to make the nucleic acids of these organisms, mainly genomic and plasmid DNA, directly accessible to the buffers used for the subsequent extraction steps.

Il a été ainsi démontré selon l'invention qu'une accessibilité maximale des acides nucléiques provenant des micro-organismes d'un échantillon du sol était atteinte par un broyage poussé et à sec de l'échantillon de sol préalablement séché afin d'obtenir des microparticules. Le demandeur a ainsi déterminé que le séchage de l'échantillon de sol préalable à tout traitement ultérieur provoque une diminution significative de la cohésion de l'échantillon de sol brut et favorise en conséquence sa désagrégation ultérieure sous la forme de micro-particules, lorsqu'un traitement par broyage approprié est opéré. It has thus been demonstrated according to the invention that a maximum accessibility of the nucleic acids originating from the microorganisms of a soil sample is reached by a thorough and dry grinding of the previously dried soil sample in order to obtain microparticles. The Applicant has thus determined that the drying of the soil sample prior to any subsequent treatment causes a significant decrease in the cohesion of the raw soil sample and consequently favors its subsequent disintegration in the form of microparticles, when a suitable milling treatment is operated.

De manière surprenante, le demandeur a montré que des microparticules d'échantillons de sol sec réunissaient des propriétés physicochimiques favorables à l'extraction d'une quantité optimale d'acides nucléiques qui, dans leur nature, pouvaient être représentatifs de la diversité génétique des organismes présents initialement dans l'échantillon de sol de départ. Il a été montré en particulier que le procédé d'extraction directe d'acides nucléiques selon l'invention permettait Surprisingly, the applicant has shown that microparticles of dry soil samples have physicochemical properties favorable to the extraction of an optimal quantity of nucleic acids which, in their nature, could be representative of the genetic diversity of the organisms. initially present in the starting soil sample. It has been shown in particular that the method of direct extraction of nucleic acids according to the invention

l'extraction d'ADN provenant de micro-organismes rares, tels certains Streptomyces rares ou des micro-organismes sporulés. the extraction of DNA from rare microorganisms, such as some rare Streptomyces or sporulated microorganisms.

Par " micro-particules " de l'échantillon de sol aux fins de la présente invention, on entend des particules dérivées de l'échantillon ayant une taille moyenne d'environ 50 m, c'est à dire comprise en moyenne entre 45 et 55 m/. By "microparticles" of the soil sample for the purpose of the present invention is meant particles derived from the sample having an average size of about 50 m, that is to say between 45 and 55 on average. m /.

Selon l'invention, les micro-particules sont obtenues à partir d'échantillons de sol préalablement séchés ou dessiqués puis broyés jusqu'à l'obtention de micro-particules de taille moyenne comprise entre 2 m et 50 m, avant remise en suspension dans un milieu tampon liquide des micro-particules obtenus. According to the invention, the microparticles are obtained from previously dried or desiccated soil samples then crushed to obtain micro-particles of average size between 2 m and 50 m, before resuspension in a liquid buffer medium of the microparticles obtained.

Un tel milieu tampon liquide peut consister en un tampon d'extraction d'acides nucléiques, en particulier un tampon d'extraction d'ADN conventionnel bien connu de l'homme du métier. Such a liquid buffer medium may consist of a nucleic acid extraction buffer, in particular a conventional DNA extraction buffer well known to those skilled in the art.

Le broyage de l'échantillon de sol en micro-particules a pour double fonction de lyser mécaniquement la majorité des organismes présents dans l'échantillon de sol initial et de rendre accessibles les organismes non lysés par ce traitement mécanique à des étapes facultatives ultérieures de lyse chimique et/ou enzymatique. Grinding of the soil sample into microparticles has the dual function of mechanically lysing most of the organisms present in the initial soil sample and making non-lysed organisms accessible by this mechanical treatment to optional later stages of lysis. chemical and / or enzymatic.

Ainsi, un premier objet de l'invention consiste en un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes, ledit procédé comprenant une première étape (I-(a)) d'obtention de micro-particules par broyage de l'échantillon de sol préalablement séché ou dessiqué, puis mise en suspension des micro-particules dans un milieu tampon liquide. Thus, a first object of the invention is a method of preparing a nucleic acid collection from a soil sample containing organisms, said method comprising a first step (I- (a)) of obtaining microparticles by grinding the previously dried or desiccated soil sample, and then suspending the microparticles in a liquid buffer medium.

De manière tout à fait préférée, l'étape de broyage est réalisée à l'aide d'un dispositif à billes d'agate ou de tungstène ou encore à l'aide d'un dispositif à anneaux de tungstène. Ces dispositifs sont préférés car la dureté de matériaux comme l'agate ou le tungstène facilite significativement l'obtention des micro-particules de la taille spécifiée cidessus. Pour cette raison, on ne choisira pas préférentiellement, voire on évitera, un recours à un dispositif de broyage à billes de verre, qui s'est révélé beaucoup moins efficace. Most preferably, the grinding step is carried out using an agate ball or tungsten device or using a tungsten ring device. These devices are preferred because the hardness of materials such as agate or tungsten significantly facilitates obtaining microparticles of the size specified above. For this reason, it will not be preferable, or even avoided, recourse to a glass ball grinding device, which has proved much less effective.

Le séchage ou la classification de l'échantillon de sol peut-être réalisée par toute méthode connue de l'homme du métier. Par exemple, Drying or classification of the soil sample can be carried out by any method known to those skilled in the art. For example,

l'échantillon de sol brut peut être séché à température ambiante pendant une durée de 24 à 48 heures. the raw soil sample can be dried at room temperature for a period of 24 to 48 hours.

Comme indiqué précédemment, le milieu tampon liquide peut consister en un milieu d'extraction de l'ADN présent dans les microparticules. On utilisera de manière tout à fait préférée un tampon d'extraction désigné TENP contenant respectivement 50 mM tris, 20 mM EDTA, 100 mM NaCI et 1% (poids/volume) de polyvinylpolypyrrolidone, à pH 9,0. As indicated above, the liquid buffer medium may consist of an extraction medium for the DNA present in the microparticles. An extraction buffer designated TENP, containing 50 mM tris, 20 mM EDTA, 100 mM NaCl and 1% (w / v) polyvinylpolypyrrolidone, at pH 9.0, will be used most preferably.

Le procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol est en outre caractérisé en ce que l'étape d'obtention de micro-particules par broyage de l'échantillon de sol préalablement séché ou dessiqué est suivie d'une étape I-(b) d'extraction des acides nucléiques présents dans les micro-particules. The process for preparing a nucleic acid collection from a soil sample is further characterized in that the step of obtaining microparticles by grinding the previously dried or desiccated soil sample is followed by a step I- (b) of extraction of the nucleic acids present in the microparticles.

Il est constant que l'extraction des acides nucléiques est accompagnée d'une co-extraction de composés et/ou de constituants du sol indésirables nécessitant la purification ultérieure des acides nucléiques extraits, une telle étape de purification ultérieure devant être à la fois suffisamment sélective pour permettre l'élimination des composés et/ou constituants du sol indésirables et d'un rendement suffisant pour entraîner une perte faible en quantité de l'ADN préalablement extrait. It is common ground that nucleic acid extraction is accompanied by co-extraction of undesirable compounds and / or soil constituents necessitating subsequent purification of the extracted nucleic acids, such a subsequent purification step having to be both selective enough. to allow the removal of undesirable compounds and / or soil components and a yield sufficient to cause a small loss in the amount of DNA previously extracted.

Il a été montré selon l'invention qu'une étape de purification de l'ADN extrait des micro-particules de l'échantillon de sol répondant aux critères de sélectivité et de rendement définis ci-dessus, comprend un traitement de l'ADN extrait par une combinaison de deux étapes successives de chromatographie, respectivement une chromatographie sur tamis moléculaire et une chromatographie d'échange d'anions. It has been shown according to the invention that a purification step of the DNA extracted from the microparticles of the soil sample meeting the selectivity and yield criteria defined above, comprises a treatment of the extracted DNA by a combination of two successive chromatography steps, respectively molecular sieve chromatography and anion exchange chromatography.

Selon une autre caractéristique du procédé ci-dessus, l'étape 1- (b) d'extraction des acides nucléiques est suivie d'une étape I-(c) de purification des acides nucléiques extraits à l'aide des deux étapes de chromatographie suivantes: - passage de la solution contenant les acides nucléiques sur un tamis moléculaire, puis récupération des fractions d'élution enrichies en acides nucléiques; According to another characteristic of the above process, the nucleic acid extraction step 1- (b) is followed by a step I- (c) of purification of the nucleic acids extracted using the two chromatography steps. following: - passage of the solution containing the nucleic acids on a molecular sieve, then recovery of elution fractions enriched in nucleic acids;

- passage des fractions d'élution enrichies en acides nucléiques sur un support de chromatographie d'échange d'anions, puis récupération des fractions d'élution contenant les acides nucléiques. passage of elution fractions enriched in nucleic acids on an anion exchange chromatography support, then recovery of the elution fractions containing the nucleic acids.

La nature et l'ordre des étapes de chromatographie ci-dessus sont essentiels à une bonne sélectivité et un excellent rendement de l'étape de purification de l'ADN préalablement extrait des micro-particules de l'échantillon du sol préalablement séché ou dessiqué. The nature and order of the chromatography steps above are essential to good selectivity and excellent yield of the DNA purification step previously extracted from the microparticles of the previously dried or desiccated soil sample.

De manière très avantageuse, le support chromatographique du type " tamis moléculaire " de l'étape de purification d'acides nucléiques cidessus consiste en un support chromatographique de type Sephacryl# S400 HR ou un support chromatographique de caractéristiques équivalentes. Very advantageously, the chromatographic support of the "molecular sieve" type of the nucleic acid purification step above consists of a Sephacryl # S400 HR type chromatographic support or a chromatographic support of equivalent characteristics.

De manière tout à fait préférée, le support chromatographique d'échange d'anions utilisé lors de la seconde étape de purification de l'ADN extrait est un support de type Elutip d, ou un support chromatographique de caractéristiques équivalentes. Most preferably, the anion exchange chromatographic support used in the second step of purification of the extracted DNA is a support of the type Elutip d, or a chromatographic support of equivalent characteristics.

En combinant les étapes I-(a) d'obtention de micro-particules de l'échantillon de sol sec, I-(b) d'extraction des acides nucléiques présents dans les micro-particules et I-(c) de purification par les étapes chromatographiques décrites ci-dessus, il a été possible selon l'invention d'extraire directement l'ADN du sol sans purification préalable des cellules des organismes contenus initialement dans l'échantillon, tout en évitant la co-extraction de contaminants du sol, tels que par exemple les acides humiques qui est observée avec les procédés de l'état de la technique. By combining steps I- (a) of obtaining microparticles from the dry soil sample, I- (b) extraction of the nucleic acids present in the microparticles and I- (c) purification by the chromatographic steps described above, it was possible according to the invention to directly extract the DNA from the soil without prior purification of the cells of the organisms initially contained in the sample, while avoiding the co-extraction of soil contaminants such as for example humic acids which is observed with the methods of the state of the art.

Les contaminants, tels que les acides humiques affectent sévèrement les analyses et les utilisations subséquentes des acides nucléiques dont la purification est recherchée. Contaminants, such as humic acids, severely affect the subsequent analyzes and uses of the nucleic acids whose purification is sought.

Selon le procédé ci-dessus, il est en outre possible d'accéder aux acides nucléiques contenus dans les organismes qui n'ont pas été lysés mécaniquement au cours de l'étape I-(a) d'obtention de microparticules de l'échantillon de sol, dans le but d'obtenir une collection quasi-exhaustive de la diversité génétique des acides nucléiques présents initialement dans l'échantillon de sol. Ainsi, les micro-particules de l'échantillon de sol peuvent faire l'objet d'étapes ultérieures de traitement de lyse chimique, enzymatique ou physique, ou encore d'une combinaison de traitements chimiques, enzymatiques ou physiques. According to the above method, it is furthermore possible to access the nucleic acids contained in the organisms which have not been mechanically lysed during step I- (a) of obtaining microparticles of the sample of the soil, with the aim of obtaining an almost exhaustive collection of the genetic diversity of the nucleic acids initially present in the soil sample. Thus, the microparticles of the soil sample may be the subject of subsequent chemical, enzymatic or physical lysis treatment steps, or a combination of chemical, enzymatic or physical treatments.

Selon un premier aspect, le procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol selon l'invention, peut être en outre caractérisé en ce que l'étape I-(a) est suivie des étapes suivantes: # traitement de la suspension de sol dans un milieu tampon liquide par sonication; # extraction et récupération des acides nucléiques. According to a first aspect, the method for preparing a nucleic acid collection from a soil sample according to the invention may be further characterized in that step I- (a) is followed by steps following: # treatment of the soil suspension in a liquid buffer medium by sonication; # extraction and recovery of nucleic acids.

De manière préférée, on aura recours, pour un traitement par sonication, à un dispositif de type à micro-pointe en titane, tel que le dispositif 600 W Vibracell Ultrasonicator commercialisé par la Société Bioblock ou encore un sonicateur de type Cup Horn. Preferably, for a sonication treatment, a titanium microtip type device, such as the 600 W Vibracell Ultrasonicator device marketed by the Bioblock Company or a Cup Horn type of sonicator, will be used.

De manière tout à fait préférée, l'étape de sonication est réalisée à une puissance de 15 W pendant une durée de 7 à 10 min et comprend des cycles successifs de sonication, la sonication proprement dite étant réalisée pendant 50% de la durée de chaque cycle. Most preferably, the sonication step is carried out at a power of 15 W for a period of 7 to 10 min and comprises successive cycles of sonication, the actual sonication being performed for 50% of the duration of each cycle.

Selon un second aspect, le procédé ci-dessus peut être en outre caractérisé en ce que l'étape I-(a) est suivie des étapes suivantes: # traitement de la suspension de sol dans un milieu tampon liquide par sonication; # incubation de la suspension à 37 C après sonication en présence de lysozyme et d'achromopeptidase; # addition de SDS avant centrifugation et précipitation des acides nucléiques; # récupération des acides nucléiques précipités. According to a second aspect, the above method can be further characterized in that step I- (a) is followed by the following steps: # treatment of the soil suspension in a liquid buffer medium by sonication; incubation of the suspension at 37 ° C. after sonication in the presence of lysozyme and of achromopeptidase; # addition of SDS before centrifugation and precipitation of nucleic acids; # recovery of precipitated nucleic acids.

De préférence, l'étape d'incubation en présence de lysozyme et d'achromopeptidase sera réalisée à une concentration finale de 0,3 mg/ml de chacune des deux enzymes, préférentiellement pendant 30 minutes à 37 C. Preferably, the incubation step in the presence of lysozyme and achromopeptidase will be carried out at a final concentration of 0.3 mg / ml of each of the two enzymes, preferably for 30 minutes at 37 ° C.

De manière préférée, le SDS sera utilisé à une concentration finale de 1% et pendant un temps d'incubation de 1 heure à la température de 60 C avant centrifugation et précipitation. Preferably, the SDS will be used at a final concentration of 1% and for an incubation time of 1 hour at a temperature of 60 ° C. before centrifugation and precipitation.

Selon un troisième aspect, le procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol ci-dessus est en outre caractérisé en ce que l'étape I-(a) est suivie des étapes suivantes : - homogénéisation de la suspension de sol avec une étape de mixage violent (vortex) suivie d'une étape de simple agitation; - congélation de la suspension homogène suivie d'une décongélation ; - traitement par sonication de la suspension après décongélation; - incubation de la suspension à 37 C après sonication en présence de lysozyme et d'achromopeptidase; - addition de SDS avant centrifugation et précipitation des acides nucléiques; - récupération des acides nucléiques. According to a third aspect, the method for preparing a nucleic acid collection from a soil sample above is further characterized in that step I- (a) is followed by the following steps: homogenization of the soil suspension with a violent mixing step (vortex) followed by a simple stirring step; freezing of the homogeneous suspension followed by thawing; treatment by sonication of the suspension after thawing; incubation of the suspension at 37 ° C. after sonication in the presence of lysozyme and of achromopeptidase; addition of SDS before centrifugation and precipitation of the nucleic acids; - recovery of nucleic acids.

De manière préférée, les suspensions de micro-particules de sol sont passées au vortex puis homogénéisées par une agitation douce sur un agitateur à rotation circulaire pendant une durée de deux heures avant d'être congelées à -20 C. Preferably, the suspensions of soil microparticles are vortexed and then homogenized by gentle stirring on a circular rotation stirrer for a period of two hours before being frozen at -20.degree.

Préférentiellement, les suspensions sont à nouveau agitées violemment par vortex pendant 10 minutes, après décongélation et avant l'étape de sonication. Preferably, the suspensions are again violently vortexed for 10 minutes, after thawing and before the sonication step.

Il va sans dire que les acides nucléiques extraits par les modes de réalisation du procédé d'extraction directe d'acides nucléiques décrit cidessus sont préférentiellement purifiés selon l'étape de purification constituée d'un premier passage sur tamis moléculaire puis un passage subséquent des fractions d'élution obtenues à l'issue de la chromatographie sur tamis moléculaire sur un support chromatographique d'échange d'anions. It goes without saying that the nucleic acids extracted by the embodiments of the direct nucleic acid extraction method described above are preferentially purified according to the purification step consisting of a first pass on molecular sieve and then a subsequent passage of the fractions. of elution obtained after the chromatography on molecular sieve on an anion exchange chromatographic support.

2. Extraction indirecte des acides nucléiques 2. Indirect extraction of nucleic acids

Selon un second mode de réalisation du procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de l'environnement, selon l'invention, ledit échantillon de l'environnement subit un premier traitement de nature à permettre la séparation des organismes, contenus dans cet échantillon, des autres macro-constituants de l'échantillon. According to a second embodiment of the method for preparing a nucleic acid collection from a sample of the environment, according to the invention, said sample of the environment undergoes a first treatment of such a nature as to allow separation. organisms, contained in this sample, of the other macro-constituents of the sample.

Ce second mode de réalisation du procédé de préparation d'une collection d'acides nucléiques selon l'invention favorise l'obtention d'acides nucléiques de grande taille, qui sont pratiquement impossibles à obtenir selon le premier mode de réalisation du procédé selon l'invention décrit ci-dessus, l'étape de lyse mécanique opérée pour l'obtention des micro-particules ayant également pour effet de casser physiquement les acides nucléiques de l'échantillon de sol ou des acides nucléiques contenus dans les organismes de l'échantillon de sol. This second embodiment of the process for preparing a nucleic acid collection according to the invention promotes the production of nucleic acids of large size, which are practically impossible to obtain according to the first embodiment of the method according to the invention. invention described above, the step of mechanical lysis performed to obtain microparticles also having the effect of physically breaking the nucleic acids of the soil sample or nucleic acids contained in the organisms of the sample of ground.

L'obtention d'acides nucléiques de grande taille a été recherchée par le demandeur dans le but d'isoler et de caractériser les acides nucléiques comprenant, au moins partiellement, l'ensemble des séquences codantes appartenant à un même opéron capable de diriger la biosynthèse d'un composé d'intérêt industriel. The obtaining of large nucleic acids has been sought by the applicant in order to isolate and characterize the nucleic acids comprising, at least partially, all of the coding sequences belonging to the same operon capable of directing the biosynthesis of a compound of industrial interest.

De manière préférée, on obtient, en mettant en oeuvre le second mode de réalisation du procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol selon l'invention , des acides nucléiques ayant une taille supérieure à 100 kb, de préférence supérieure à 200,250 ou 300 kb, et de manière tout à fait préférée d'acides nucléiques d'une taille supérieure à 400,500 ou encore 600 kb. Preferably, using the second embodiment of the process for preparing a nucleic acid collection from a soil sample according to the invention, nucleic acids having a size greater than 100 are obtained. kb, preferably greater than 200,250 or 300 kb, and most preferably nucleic acids larger than 400,500 or even 600 kb.

Ce second mode de réalisation d'un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de l'environnement selon l'invention est constitué d'une combinaison de quatre étapes successives destinées à l'obtention des acides nucléiques ayant les caractéristiques décrites ci-dessus. This second embodiment of a method for preparing a nucleic acid collection from a sample of the environment according to the invention consists of a combination of four successive stages intended to obtain the acids nucleic acids having the characteristics described above.

Lorsque l'échantillon de l'environnement est un échantillon de sol, il a été montré selon l'invention qu'une première étape d'obtention d'une suspension par dispersion de l'échantillon de sol en milieu liquide favorisait l'accessibilité des organismes contenus dans l'échantillon sans provoquer de lyse mécanique significative des cellules. When the sample of the environment is a soil sample, it has been shown according to the invention that a first step of obtaining a suspension by dispersion of the soil sample in a liquid medium facilitates the accessibility of the samples. organisms contained in the sample without causing significant mechanical lysis of the cells.

La première étape d'obtention d'une dispersion de l'échantillon de sol ci-dessus rend accessibles les organismes de l'échantillon au milieu extérieur et permet également une dissociation partielle des organismes de l'échantillon et des macro-constituants. Elle rend ainsi possible une séparation ultérieure des organismes contenus initialement dans l'échantillon des autres constituants de ce dernier. The first step of obtaining a dispersion of the above soil sample makes the organisms in the sample accessible to the external environment and also allows partial dissociation of the sample organisms and the macro-constituents. It thus makes possible a subsequent separation of the organisms initially contained in the sample of the other constituents of the latter.

Lorsque l'échantillon de l'environnement provient par exemple de végétaux, d'organismes marins ou d'insectes, un traitement préalable par broyage est nécessaire afin de rendre les organismes de la microflore associée accessible aux étapes ultérieures du procédé. When the environmental sample comes for example from plants, marine organisms or insects, a pre-treatment by grinding is necessary in order to make the organisms of the associated microflora accessible at the later stages of the process.

Ainsi, le présent procédé comprend une étape de séparation des organismes des autres constituants minéraux et/ou organiques obtenus précédemment par une centrifugation sur un gradient de densité. Les organismes ainsi séparés sont ensuite soumis à une étape de lyse puis d'extraction des acides nucléiques . Thus, the present process comprises a step of separating the organisms from the other inorganic and / or organic constituents obtained previously by centrifugation on a density gradient. The organisms thus separated are then subjected to a step of lysis and extraction of the nucleic acids.

L'étape de centrifugation sur un gradient de densité a, de manière surprenante, permis de séparer les cellules d'organismes des particules de sol contenues dans la suspension de l'échantillon. On aurait en effet pu s'attendre à ce qu'une proportion des cellules soient entraînées avec les macro-particules au sein de la phase de gradient. En outre, il n'avait jamais été démontré jusqu'à présent qu'une centrifugation sur gradient de densité d'un échantillon de sol permettait de retrouver, à l'interface phase aqueuse/gradient, une population d'organismes représentative de la diversité des organismes présents dans l'échantillon de départ, du fait que ces organismes sont de volume, densité et forme extrêmement variables. On pouvait raisonnablement supposer qu'ils seraient retrouvés indifféremment au sein de la phase aqueuse, à l'interface phase aqueuse/gradient de densité et également au sein du gradient de densité lui-même. The step of centrifugation on a density gradient has, surprisingly, allowed the cells of organisms to be separated from the soil particles contained in the suspension of the sample. It would indeed be expected that a proportion of the cells are driven with the macro-particles within the gradient phase. In addition, it has never before been demonstrated that a density gradient centrifugation of a soil sample allows the discovery at the aqueous phase / gradient interface of a representative population of organisms. organisms present in the starting sample, because these organisms are of extremely variable volume, density and shape. It was reasonable to assume that they would be found indifferently within the aqueous phase, at the aqueous phase / density gradient interface and also within the density gradient itself.

Ainsi, l'homme du métier pouvait s'attendre à ce que des organismes présentant des densités plus faibles ou plus grandes que la densité du gradient de densité utilisé (densité du gradient de densité comprise entre 1,2 et 1,5 g/ml , préférentiellement 1,3 g/ml) ne pouvait être récupérés, ce qui aurait eu pour effet d'introduire un biais dans la représentativité des organismes effectivement séparés et, par voie de conséquence, également dans la diversité des acides nucléiques extraits. Thus, one skilled in the art could expect organisms with densities lower or greater than the density density density used (density gradient density between 1.2 and 1.5 g / ml). preferentially 1.3 g / ml) could not be recovered, which would have had the effect of introducing a bias in the representativity of the organisms actually separated and, consequently, also in the diversity of the nucleic acids extracted.

En outre, dans un mode de réalisation particulier du procédé, une étape de germination des spores, en particulier d'actinomycètes, est réalisée, ce qui a pour effet d'accroître de manière significative la quantité d'ADN d'actinomycètes récupérée. In addition, in a particular embodiment of the method, a step of germinating the spores, in particular actinomycetes, is performed, which has the effect of significantly increasing the amount of actinomycetes DNA recovered.

La dernière étape consiste en une étape de purification des acides nucléiques ainsi extraits sur un gradient de chlorure de césium. The last step consists of a purification step of the nucleic acids thus extracted on a cesium chloride gradient.

De manière surprenante, la purification des acides nucléiques sur le gradient de chlorure de césium permet une élimination substantielle, voire complète, des substances composant le gradient de densité. Cette caractéristique est déterminante en ce qui concerne l'utilisation ultérieure des acides nucléiques purifiés car le gradient de densité est connu comme un puissant inhibiteur enzymatique, capable le cas échéant d'inhiber l'activité catalytique des enzymes utilisées pour préparer l'insertion des acides nucléiques extraits dans des vecteurs. Surprisingly, the purification of the nucleic acids on the cesium chloride gradient allows a substantial or complete elimination of the substances composing the density gradient. This characteristic is decisive with regard to the subsequent use of the purified nucleic acids because the density gradient is known as a potent enzyme inhibitor, capable of inhibiting the catalytic activity of the enzymes used to prepare the insertion of the acids, where appropriate. nucleic extracts in vectors.

Selon ce second mode de réalisation, le procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de l'environnement contenant des organismes selon l'invention comprend la succession d'étapes suivantes: (i) obtention d'une suspension par dispersion de l'échantillon de l'environnement en milieu liquide puis homogénéisation de la suspension obtenue par agitation douce; (ii) séparation des organismes des autres constituants minéraux et/ou organiques de la suspension homogène obtenue à l'étape (i) par centrifugation sur un gradient de densité; (iii) lyse des microorganismes séparés à l'étape (ii) et extraction des acides nucléiques ; (iv) purification des acides nucléiques sur un gradient de chlorure de césium . According to this second embodiment, the method for preparing a nucleic acid collection from a sample of the environment containing organisms according to the invention comprises the following succession of steps: (i) obtaining a suspension by dispersion of the sample of the environment in a liquid medium and then homogenization of the suspension obtained by gentle stirring; (ii) separating the organisms from the other inorganic and / or organic constituents of the homogeneous suspension obtained in step (i) by centrifugation on a density gradient; (iii) lysing the separated microorganisms in step (ii) and extracting the nucleic acids; (iv) purification of nucleic acids on a cesium chloride gradient.

Préférentiellement, la suspension de l'échantillon de sol est obtenue par dispersion de cet échantillon par broyage à l'aide d'un dispositif de type Waring Blender ou un dispositif de caractéristiques Preferably, the suspension of the soil sample is obtained by dispersing this sample by grinding with the aid of a Waring Blender type device or a device of characteristics

équivalentes. De manière tout à fait préférée, la suspension d'échantillon est obtenue après trois broyages successifs d'une durée d'une minute chacun dans un dispositif de type Waring Blender. De préférence, l'échantillon broyé sera refroidi dans la glace entre chacun des broyages. equivalent. Most preferably, the sample suspension is obtained after three successive grindings lasting one minute each in a Waring Blender type device. Preferably, the milled sample will be cooled in ice between each of the grindings.

De manière préférée, les organismes sont ensuite séparés des particules du sol par centrifugation sur un coussin de densité du type " Nycodenz ", commercialisé par la Société Nycomed Pharma AS. (Oslo , Norvège). Les conditions préférées de centrifugation sont de 10.000g pendant 40 minutes à 4 C, avantageusement dans un rotor à godets mobiles du type " rotor TST 28.38 " commercialisé par la Société KONTRON. Preferably, the organisms are then separated from the soil particles by centrifugation on a "Nycodenz" type density cushion marketed by the company Nycomed Pharma AS. (Oslo, Norway). The preferred conditions for centrifugation are 10,000 g for 40 minutes at 4 ° C., advantageously in a rotor with moving cups of the "rotor TST 28.38" type marketed by KONTRON.

L'anneau d'organismes localisé, après centrifugation, à l'interphase de la phase supérieure aqueuse et de la phase inférieure de Nycodenz est alors prélevé et lavé par centrifugation avant reprise du culot cellulaire dans un tampon approprié. The ring of organisms located, after centrifugation, at the interphase of the aqueous upper phase and the lower phase of Nycodenz is then removed and washed by centrifugation before taking up the cell pellet in an appropriate buffer.

L'étape (iii) de lyse des organismes séparés à l'étape (ii) décrite ci-dessus peut être réalisée de toute manière connue de l'homme du métier. The step (iii) of lysing the organisms separated in step (ii) described above can be carried out in any manner known to those skilled in the art.

Avantageusement, les cellules sont lysées dans une solution Tris 10 mM-EDTA 100mM à pH 8. 0 en présence de lysozyme et d'achromopeptidase, avantageusement pendant une heure à 37 C. Advantageously, the cells are lysed in a 100 mM Tris 10 mM-EDTA solution at pH 8. 0 in the presence of lysozyme and of achromopeptidase, advantageously for one hour at 37 ° C.

L'extraction proprement dite de l'ADN peut être avantageusement réalisée par addition d'une solution de lauryl sarcosyl (1% du poids final de la solution) en présence de protéinase K et incubation de la solution finale à 37 C pendant 30 minutes. The actual extraction of the DNA can be advantageously carried out by adding a solution of lauryl sarcosyl (1% of the final weight of the solution) in the presence of proteinase K and incubation of the final solution at 37 ° C. for 30 minutes.

Les acides nucléiques extraits à l'étape (iii) sont ensuite purifiés sur un gradient de chlorure de césium. Préférentiellement, l'étape de purification des acides nucléiques sur un gradient de chlorure de césium est réalisée par centrifugation à 35.000 tours/minute pendant 36 heures, par exemple sur un rotor du type Kontron 65.13. The nucleic acids extracted in step (iii) are then purified on a cesium chloride gradient. Preferably, the step of purifying the nucleic acids on a cesium chloride gradient is carried out by centrifugation at 35,000 rpm for 36 hours, for example on a Kontron type rotor 65.13.

Selon un aspect particulier du procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes selon l'invention, lesdits acides nucléiques sont constitués majoritairement, sinon exclusivement, de molécules d'ADN. According to a particular aspect of the process for preparing a nucleic acid collection from a soil sample containing organisms according to the invention, said nucleic acids consist mainly, if not exclusively, of DNA molecules.

Selon un autre aspect, les acides nucléiques peuvent être récupérés après inclusion des organismes, séparés sur un gradient de In another aspect, the nucleic acids can be recovered after inclusion of the organisms, separated on a gradient of

densité, dans un bloc d'agarose et lyse, par exemple chimique et/ou enzymatique, des organismes inclus dans le bloc d'agarose. density, in a block of agarose and lysis, for example chemical and / or enzymatic, organisms included in the agarose block.

Un autre objet de l'invention consiste en une collection d'acides nucléiques constitués des acides nucléiques obtenus à l'étape II-(iv) du procédé de préparation d'une collection d'acides nucléiques selon l'invention ou encore obtenue à l'étape (c) ou une étape ultérieure du procédé de préparation d'une collection d'acides nucléiques selon l'invention. Another subject of the invention consists in a nucleic acid collection constituted by the nucleic acids obtained in stage II- (iv) of the process for preparing a nucleic acid collection according to the invention or obtained at the same time. step (c) or a subsequent step of the process for preparing a nucleic acid collection according to the invention.

L'invention est encore relative à un acide nucléique caractérisé en ce qu'il est contenu dans une collection d'acides nucléiques telle que définie ci-dessus. The invention also relates to a nucleic acid characterized in that it is contained in a nucleic acid collection as defined above.

Selon un premier aspect, un tel acide nucléique constitutif d'une collection d'acides nucléiques selon l'invention est caractérisé en ce qu'il comprend une séquence nucléotidique codant au moins un opéron, ou une partie d'un opéron. According to a first aspect, such a nucleic acid constituting a nucleic acid collection according to the invention is characterized in that it comprises a nucleotide sequence encoding at least one operon, or part of an operon.

De manière tout à fait préférée, un tel opéron code pour la totalité ou une partie d'une voie métabolique. Most preferably, such an operon encodes all or part of a metabolic pathway.

L'exemple 9 décrit la construction d'une banque d'ADN génomique à partir d'une souche de Streptomyces alboniger et son clonage respectivement dans les cosmides navettes pOS7001 et pOS700R. Il a été montré selon l'invention que dans la banque d'ADN réalisée dans le vecteur intégratif pOS7001 neuf clones contiennent des séquences nucléotidiques appartenant à l'opéron responsable de la voie de biosynthèse de la puromycine. De même, il a pu être identifié au sein de la banque d'ADN réalisée dans le vecteur réplicatif pOS 700R douze clones contenant des séquences nucléotidiques de l'opéron responsable de la voie de biosynthèse de la puromycine. Example 9 describes the construction of a genomic DNA library from a strain of Streptomyces alboniger and its cloning respectively in shuttle cosmos pOS7001 and pOS700R. It has been shown according to the invention that in the DNA library made in the integrative vector pOS7001 nine clones contain nucleotide sequences belonging to the operon responsible for the puromycin biosynthetic pathway. Similarly, twelve clones containing nucleotide sequences of the operon responsible for the puromycin biosynthetic pathway could be identified within the DNA library made in the replicative vector pOS 700R.

En particulier, certains cosmides intégratifs et replicatifs des banques réalisées présentent, après digestion par les endonucléases de restriction ClaI et EcoRV, un fragment d'une taille de 12 kb susceptible de contenir la totalité des séquences de l'opéron responsable de la voie de biosynthèse de la puromycine. In particular, some integrative and replicative cosmids of the libraries produced have, after digestion with ClaI and EcoRV restriction endonucleases, a fragment of a size of 12 kb capable of containing all the sequences of the operon responsible for the biosynthetic pathway. puromycin.

Ainsi, selon un autre aspect, un acide nucléique selon l'invention contient, au moins en partie, des séquences nucléotidiques de l'opéron responsable de la voie de biosynthèse de la puromicyne. Thus, according to another aspect, a nucleic acid according to the invention contains, at least in part, nucleotide sequences of the operon responsible for the puromicyne biosynthetic pathway.

L'exemple 2 ci-après décrit la construction d'une banque d'ADN selon un procédé conforme à la présente invention dans un vecteur pBluescript SK- à partir d'un sol contaminé par du lindane. Example 2 below describes the construction of a DNA library according to a method according to the present invention in a pBluescript SK- vector from a soil contaminated with lindane.

Les vecteurs recombinants ont été transfectés dans des cellules d'Escherichia coli DH10B puis les cellules transformées ont été cultivées dans un milieu de culture approprié en présence de lindane. Un criblage des clones de cellules transformées de la banque a permis de montrer que, sur 10.000 clones criblés, 35 d'entre eux présentaient un phénotype de dégradation du lindane . La présence du gène linA chez ces clones a pu être confirmée par amplification PCR grâce à des amorces spécifiques de ce gène. The recombinant vectors were transfected into Escherichia coli DH10B cells and the transformed cells were cultured in a suitable culture medium in the presence of lindane. Screening the transformed cell clones of the library showed that out of 10,000 clones screened, 35 had a lindane degradation phenotype. The presence of the linA gene in these clones could be confirmed by PCR amplification using primers specific for this gene.

Ainsi, selon un autre aspect, l'invention concerne également un acide nucléique contenant une séquence nucléotidique de la voie métabolique provoquant la biodégradation du lindane. Thus, according to another aspect, the invention also relates to a nucleic acid containing a nucleotide sequence of the metabolic pathway causing the biodegradation of lindane.

Il est donc clairement démontré, comme décrit plus haut, qu'un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes selon l'invention ainsi qu'un procédé de préparation d'une collection de vecteurs recombinants contenant les acides nucléiques constitutifs de la collection d'acides nucléiques précités était tout à fait apte à l'isolement et à la caractérisation de séquences nucléotidiques incluses dans un opéron. It is thus clearly demonstrated, as described above, a method for preparing a nucleic acid collection from a soil sample containing organisms according to the invention as well as a method for preparing a The collection of recombinant vectors containing the nucleic acids constituting the above-mentioned nucleic acid collection was entirely suitable for the isolation and characterization of nucleotide sequences included in an operon.

Une démonstration supplémentaire de l'aptitude d'un procédé selon l'invention à l'identification de séquences nucléotidiques codantes impliquées dans une voie de biosynthèse régulée sous la forme d'un opéron est en outre décrite plus loin: il s'agit du clonage et de la caractérisation de séquences codant pour des polykétides synthases impliquées dans la voie de biosynthèse des polykétides, qui appartiennent à une famille de molécules dont certains représentants sont d'un intérêt thérapeutique majeur, en particulier antibiotique. A further demonstration of the ability of a method according to the invention to identify coding nucleotide sequences involved in a regulated biosynthetic pathway in the form of an operon is further described below: this is cloning and the characterization of sequences coding for polyketide synthases involved in the polyketide biosynthetic pathway, which belong to a family of molecules, some of whose representatives are of major therapeutic interest, in particular antibiotics.

La présente invention a donc en outre pour objet un acide nucléique constitutif d'une collection d'acides nucléiques selon l'invention, caractérisé en ce qu'il comprend la totalité d'une séquence nucléotidique codant pour un polypeptide. The present invention therefore also relates to a nucleic acid constituting a nucleic acid collection according to the invention, characterized in that it comprises all of a nucleotide sequence encoding a polypeptide.

Selon un premier aspect, un acide nucléique constitutif d'une collection d'acides nucléiques selon l'invention est d'origine procaryote. According to a first aspect, a nucleic acid constituting a collection of nucleic acids according to the invention is of procaryotic origin.

Selon un second aspect, un acide nucléique constitutif d'une collection d'acides nucléiques selon l'invention provient d'une bactérie ou d'un virus. According to a second aspect, a nucleic acid constituting a collection of nucleic acids according to the invention comes from a bacterium or a virus.

Selon un troisième aspect, un acide nucléique constitutif d'une collection d'acides nucléiques selon l'invention est d'origine eucaryote. According to a third aspect, a nucleic acid constituting a collection of nucleic acids according to the invention is of eukaryotic origin.

En particulier, un tel acide nucléique est caractérisé en ce qu'il provient d'un champignon, d'une levure, d'une plante ou d'un animal. In particular, such a nucleic acid is characterized in that it comes from a fungus, a yeast, a plant or an animal.

CARACTERISATION MOLECULAIRE DE LA COLLECTION D'ACIDES NUCLEIQUES EXTRAITS DU SOL. MOLECULAR CHARACTERIZATION OF THE NUCLEIC ACID COLLECTION EXTRACTED FROM THE SOIL.

Afin de surmonter les nombreux inconvénients techniques des méthodes de caractérisation des banques d'ADN extraits et purifiés à partir d'un échantillon de l'environnement qui ont été décrits dans la partie de la description relative à l'état de la technique, le demandeur a mis au point un procédé simple et fiable permettant de caractériser qualitativement et semi-quantitativement les acides nucléiques obtenus à l'issue du procédé décrit ci-dessus. In order to overcome the many technical disadvantages of the methods of characterizing the extracted and purified DNA libraries from a sample of the environment that have been described in the prior art portion of the description, the applicant has developed a simple and reliable method for characterizing qualitatively and semi-quantitatively the nucleic acids obtained at the end of the process described above.

Le procédé selon l'invention consiste ainsi à amplifier universellement un fragment de 700 pb localisé à l'intérieur d'une séquence d'ADN ribosomal de type 16 S, puis d'hybrider l'ADN amplifié avec une sonde oligonucléotidique de spécificité variable et enfin de comparer l'intensité d'hybridation de l'échantillon par rapport à une gamme étalon externe d'ADN de séquence ou d'origine connue. The method according to the invention thus consists in universally amplifying a 700 bp fragment located inside a 16S-type ribosomal DNA sequence, and then hybridizing the amplified DNA with an oligonucleotide probe of variable specificity and finally, to compare the intensity of hybridization of the sample with respect to an external standard range of DNA of sequence or of known origin.

L'amplification préalable à l'hybridation avec la sonde oligonucléotidique permet de quantifier des genres ou des espèces de micro-organismes peu abondants. De plus, l'amplification par des amorces universelles permet, lors de l'hybridation, d'utiliser une large série de sondes oligonucléotidiques. The amplification prior to hybridization with the oligonucleotide probe makes it possible to quantify genera or species of scant microorganisms. In addition, the amplification by universal primers makes it possible, during hybridization, to use a large series of oligonucleotide probes.

Ainsi, l'invention a en outre pour objet un procédé de détermination de la diversité des acides nucléiques contenus dans une collection d'acides nucléiques, et tout particulièrement d'une collection d'acides nucléiques provenant d'un échantillon de l'environnement, préférentiellement d'un échantillon du sol, ledit procédé comprenant les étapes suivantes: Thus, the invention furthermore relates to a method for determining the diversity of nucleic acids contained in a collection of nucleic acids, and more particularly to a collection of nucleic acids from a sample of the environment, preferably a soil sample, said method comprising the following steps:

- mise en contact des acides nucléiques de la collection d'acides nucléiques à tester avec un couple d'amorces oligonucléotidiques hybridant à toute séquence d'ADN ribosomal 16 S bactérien; - réalisation d'au moins trois cycles d'amplification ; - détection des acides nucléiques amplifiés à l'aide d'une sonde oligonucléotidique ou d'une pluralité de sondes oligonucléotidiques, chaque sonde hybridant spécifiquement avec une séquence d'ADN ribosomal 16 S commune à un règne, un ordre, une sous-classe ou un genre bactérien; - le cas échéant, comparaison des résultats de l'étape de détection précédente avec les résultats de détection, à l'aide de la sonde ou de la pluralité de sondes d'acides nucléiques de séquence connue constituant une gamme étalon. contacting the nucleic acids of the collection of nucleic acids to be tested with a pair of oligonucleotide primers hybridizing with any bacterial 16S ribosomal DNA sequence; performing at least three amplification cycles; detection of the amplified nucleic acids using an oligonucleotide probe or a plurality of oligonucleotide probes, each probe specifically hybridizing with a 16S ribosomal DNA sequence common to a kingdom, order, subclass or a bacterial genus; where appropriate, comparing the results of the preceding detection step with the detection results, using the probe or the plurality of nucleic acid probes of known sequence constituting a standard range.

De manière préférée, un premier couple d'amorces hybridant avec des régions universellement conservées du gène de l'ARN ribosomal 16 S est constitué respectivement des amorces FGPS 612 (SEQ ID N 12) et FGPS 669 (SEQ ID N 13). Preferably, a first pair of primers hybridizing with universally conserved regions of the 16S ribosomal RNA gene consists respectively of primers FGPS 612 (SEQ ID No. 12) and FGPS 669 (SEQ ID No. 13).

Un second mode de réalisation d'un couple d'amorces préféré selon l'invention est constitué du couple d'amorces universelles 63 f (SEQ ID N 22) et 1387r (SEQ ID N 23). A second embodiment of a preferred pair of primers according to the invention consists of the pair of universal primers 63 (SEQ ID No. 22) and 1387r (SEQ ID No. 23).

Selon un mode particulier de réalisation d'un procédé de détermination de la diversité des acides nucléiques d'une collection d'acides nucléiques, l'étape d'amplification à l'aide d'un couple d'amorces universelles peut être réalisée sur une collection de vecteurs recombinants dans chacun desquels a été inséré un acide nucléique de la collection d'acides nucléiques considérée, préalablement à l'étape d'hybridation avec les sondes oligonucléotidiques spécifiques d'un règne, d'un ordre, d'une sous-classe ou d'un genre bactérien particulier. According to a particular embodiment of a method for determining the nucleic acid diversity of a nucleic acid collection, the amplification step using a pair of universal primers may be performed on a collection of recombinant vectors in each of which has been inserted a nucleic acid of the relevant nucleic acid collection, prior to the step of hybridization with the oligonucleotide probes specific for a kingdom, an order, a sub- class or a particular bacterial genus.

Un tel procédé de détermination de la diversité des acides nucléiques contenus dans une collection est tout particulièrement applicable aux collections d'acides nucléiques obtenus conformément à l'enseignement de la présente description. Such a method for determining the diversity of nucleic acids contained in a collection is particularly applicable to nucleic acid collections obtained according to the teaching of the present description.

Ainsi, l'exemple 3 détaille un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes comprenant une étape d'extraction indirecte d'ADN par dispersion d'un échantillon du sol préalablement à la séparation des Thus, Example 3 details a process for preparing a collection of nucleic acids from a soil sample containing organisms comprising an indirect DNA extraction step by dispersing a soil sample prior to the separation of

cellules sur gradient de Nycodenz, lyse des cellules puis purification de l'ADN sur gradient de chlorure de césium. cells on a Nycodenz gradient, lysis of the cells and then purification of the DNA on a cesium chloride gradient.

La collection d'acides nucléiques ainsi obtenue a été utilisée telle quelle ou sous la forme d'inserts dans des vecteurs de type cosmide dans un procédé d'amplification à l'aide des amorces universelles de l'ADNr 16 S précitées, puis les ADN amplifiés ont été soumis à une étape de détection à l'aide de sondes oligonucléotidiques de séquences SEQ ID N 14 à SEQ ID N 21 qui sont présentées dans le tableau 4. The nucleic acid collection thus obtained was used as such or in the form of inserts in cosmid type vectors in an amplification process using the above-mentioned universal primers of the 16S rDNA, and then the DNAs. amplified were subjected to a detection step using oligonucleotide probes of SEQ ID N 14 to SEQ ID N 21 which are presented in Table 4.

Les résultats montrent qu'un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes selon l'invention permet d'accéder à l'ADN de plus de 14% de la microflore tellurique totale, soit 2 x 108 cellules par gramme de sol, alors que la microflore totale cultivable ne représente qu'à peine 2% de la population microbienne totale. The results show that a method of preparing a nucleic acid collection from a soil sample containing organisms according to the invention provides access to DNA of more than 14% of the total telluric microflora. that is 2 x 108 cells per gram of soil, whereas the total cultivable microflora represents only 2% of the total microbial population.

Afin de déterminer la diversité phylogénétique d'une collection d'acides nucléiques préparés conformément à l'invention, 47 séquences du gène ARNr 16S ont été isolées et séquencées. Ces séquences correspondent respectivement aux séquences nucléotidiques SEQ ID N 60 à SEQ ID N 106. In order to determine the phylogenetic diversity of a collection of nucleic acids prepared in accordance with the invention, 47 sequences of the 16S rRNA gene were isolated and sequenced. These sequences correspond respectively to the nucleotide sequences SEQ ID N 60 to SEQ ID N 106.

Les acides nucléiques comprenant les séquences SEQ ID N 60 à SEQ ID N 106 font également partie de l'invention, ainsi que les acides nucléiques possédant au moins 99 %, préférentiellement 99,5% ou 99,8% d'identité en acides nucléiques avec les acides nucléiques comprenant les séquences SEQ ID N 60 à SEQ ID N 106. De telles séquences peuvent être utilisées notamment en tant que sondes pour cribler des clones d'une banque d'ADN et identifier ainsi ceux , parmi les clones de la banque, qui contiennent de telles séquences, ces séquences étant suceptibles d'être à proximité de séquences codantes d'intérêt, telles que des séquences codant pour des enzymes impliquées dans la voie de biosynthèse de métabolites antibiotiques, par exemple des polykétides. The nucleic acids comprising the sequences SEQ ID N 60 to SEQ ID N 106 are also part of the invention, as well as the nucleic acids having at least 99%, preferentially 99.5% or 99.8% nucleic acid identity. with the nucleic acids comprising the sequences SEQ ID N 60 to SEQ ID N 106. Such sequences can be used in particular as probes to screen clones of a DNA library and thus identify those among the clones of the library , which contain such sequences, these sequences being likely to be close to coding sequences of interest, such as sequences coding for enzymes involved in the biosynthetic pathway of antibiotic metabolites, for example polyketides.

La comparaison des séquences d'ARNr 16S à partir d'une banque d'ADN réalisée conformément à l'invention avec les séquences répertoriées dans la base données RDP (Maidak B.L., Cole J. R., Parker C. T., Garrity G.M., Larsen N., Li B., Lilburn T. G., McCaughey, M. J., Olsen G.J., Overbeek R., Pramanik S., Schmidt T.M., Tiedje J. M., Woese C. R. The comparison of the 16S rRNA sequences from a DNA library made according to the invention with the sequences listed in the RDP data base (Maidak BL, Cole JR, Parker CT, Garrity GM, Larsen N., Li B., Lilburn TG, McCaughey, MJ, Olsen GJ, R. Overbeek, S. Pramanik, Schmidt TM, Tiedje JM, Woese CR

(1999) " A new projet of the RDP (Ribosomal Database Project) " Nucleic (1999) "A new project of the RDP (Ribosomal Database Project)" Nucleic

Acids Research Vol. 27 : 171-173) ont permis de déterminer que les acides nucléiques contenus dans une collection d'acides nucléiques selon l'invention proviennent d'a-protéobactéries, de -protéobactéries, de 8protéobactéries, de y-protéobactéries, d'actinomycètes ainsi que d'un genre apparenté à acidobactérium. Ces résultats, présentés dans le tableau 7 ainsi que par l'arbre phylogénétique de la figure 7 rendent compte de la grande diversité phylogénétique des acides nucléiques contenus dans une banque d'ADN préparée conformément au procédé selon l'invention. Acids Research Vol. 27: 171-173) have made it possible to determine that the nucleic acids contained in a collection of nucleic acids according to the invention come from a-proteobacteria, -proteobacteria, 8proteobacteria, γ-proteobacteria, actinomycetes as well as of a genus related to acidobacterium. These results, presented in Table 7 as well as by the phylogenetic tree of FIG. 7, account for the great phylogenetic diversity of the nucleic acids contained in a DNA library prepared according to the process according to the invention.

VECTEURS DE CLONAGE ET/OU D'EXPRESSION
Chacun des acides nucléiques contenus dans une collection d'acides nucléiques préparés conformément à l'invention peut être inséré dans un vecteur de clonage et/ou d'expression. CLONING AND / OR EXPRESSION VECTORS
Each of the nucleic acids contained in a collection of nucleic acids prepared in accordance with the invention may be inserted into a cloning and / or expression vector.

A cette fin, tous types de vecteurs connus de l'état de la technique peuvent être utilisés, tels que des vecteurs viraux , des phages, des plasmides, des phagemides, des cosmides, des phosmides, des vecteurs de type BAC, des bactériophages P1 , des vecteurs de type BAC, des vecteurs de type YAC, des plasmides de levure ou encore tout autre vecteur connu de l'état de la technique par l'homme du métier. To this end, all types of vectors known in the state of the art can be used, such as viral vectors, phages, plasmids, phagemids, cosmids, phosmides, BAC type vectors, P1 bacteriophages. , BAC type vectors, YAC type vectors, yeast plasmids or any other vector known to the state of the art by the skilled person.

On aura avantageusement recours selon l'invention à des vecteurs permettant une expression stable des acides nucléiques d'une banque d'ADN. A cette fin, de tels vecteurs incluent préférentiellement des séquences de régulation de la transcription qui sont localisées en phase (" operably linked ") avec l'insert génomique de manière à permettre l'initiation et/ou la régulation de l'expression d'au moins une partie dudit insert d'ADN. Advantageously, according to the invention, vectors will be used which allow stable expression of the nucleic acids of a DNA library. For this purpose, such vectors preferentially include transcriptional control sequences which are located in phase ("operably linked") with the genomic insert so as to allow the initiation and / or regulation of the expression of at least a portion of said DNA insert.

Il résulte de ce qui précède, que l'invention concerne encore un procédé de préparation d'une collection de vecteurs recombinants caractérisé en ce que les acides nucléiques obtenus à l'étape Il-(iv) ou à l'étape I-(c) ou toute autre étape ultérieure d'un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes selon l'invention sont insérés dans un vecteur de clonage et/ou d'expression. As a result of the foregoing, the invention also relates to a process for preparing a recombinant vector collection characterized in that the nucleic acids obtained in step II- (iv) or in step I- (c) or any further step of a method of preparing a nucleic acid collection from a soil sample containing organisms according to the invention are inserted into a cloning and / or expression vector.

Préalablement à leur insertion dans un vecteur de clonage et/ou d'expression, les acides nucléiques constitutifs d'une collection d'acides nucléiques selon l'invention peuvent être séparés en fonction de leur taille, par exemple par électrophorèse sur un gel d'agarose, le cas échéant après digestion à l'aide d'une endonucléase de restriction. Prior to their insertion into a cloning and / or expression vector, the nucleic acids constituting a nucleic acid collection according to the invention can be separated according to their size, for example by electrophoresis on a gel of agarose, if appropriate after digestion with a restriction endonuclease.

Selon un autre aspect, la taille moyenne des acides nucléiques constitutifs d'une collection d'acides nucléiques selon l'invention peut être rendue d'une taille sensiblement uniforme par la mise en oeuvre d'une étape de rupture physique préalablement à leur insertion dans le vecteur de clonage et/ou d'expression. In another aspect, the average size of the nucleic acids constituting a nucleic acid collection according to the invention can be made of a substantially uniform size by the implementation of a physical disruption step prior to their insertion into the cloning and / or expression vector.

Une telle étape de rupture physique ou mécanique des acides nucléiques peut consister en des passages successifs de ces derniers, en solution, dans un canal métallique d'environ 0,4 mm de diamètre, par exemple le canal d'une aiguille de seringue ayant un tel diamètre. Such a step of physical or mechanical breaking of the nucleic acids may consist of successive passages of the latter, in solution, in a metal channel of about 0.4 mm in diameter, for example the channel of a syringe needle having a such diameter.

La taille moyenne des acides nucléiques peut dans ce cas être comprise entre 30 et 40 kb de longueur. The average size of the nucleic acids can in this case be between 30 and 40 kb in length.

La construction des vecteurs préférés selon l'invention est shématisée dans les figures 25 (cosmide intégrarif conjugatif) et 26 (BAC intégratif ). The construction of the preferred vectors according to the invention is shematized in FIGS. 25 (conjugative integrative cosmid) and 26 (integrative BAC).

Des vecteurs de clonage et/ou d'expression pouvant être avantageusement utilisés aux fins d'insertion des acides nucléiques contenus dans une collection ou banque d'ADN selon l'invention sont notamment les vecteurs décrits dans le brevet européen N EP-0 350 341 et dans le brevet US N 5 688 689, de tels vecteurs étant spécialement adaptés à la transformation de souches d'actinomycètes. De tels vecteurs contiennent, outre une séquence d'ADN de l'insert, une séquence d'attachement att ainsi qu'une séquence d'ADN codant pour une intégrase (séquence int) fonctionnelle dans les souches d'actinomycètes. Cloning and / or expression vectors that can be advantageously used for the purpose of inserting the nucleic acids contained in a collection or DNA library according to the invention are, in particular, the vectors described in European Patent No. EP-0 350 341. and in US Patent No. 5,688,689, such vectors being especially adapted for the transformation of actinomycete strains. Such vectors contain, in addition to a DNA sequence of the insert, an att attachment sequence as well as a DNA sequence encoding an integrase (int sequence) functional in the actinomycete strains.

Toutefois, il a été observé selon l'invention que certains vecteurs de clonage et/ou d'expression présentaient des inconvénients et que leur capacité fonctionnelle théorique n'était pas atteinte dans la pratique. However, it has been observed according to the invention that certain cloning and / or expression vectors have disadvantages and that their theoretical functional capacity has not been reached in practice.

Ainsi, il est apparu que le système d'intégration contenu dans des vecteurs de l'état de la technique, et notamment dans les vecteurs décrits dans le brevet européen n EP 0 350 41 ne permettait pas en Thus, it has been found that the integration system contained in vectors of the state of the art, and in particular in the vectors described in European Patent No. EP 0 350 41, did not make it possible

réalité une bonne intégration de l'insert d'ADN de la banque au sein du chromosome bactérien. reality a good integration of the DNA insert of the bank within the bacterial chromosome.

Partant de l'hypothèse que les déficits fonctionnels d'intégration de tels vecteurs au sein du chromosome bactérien étaient dus à un défaut dans l'expression du gène de l'intégrase présent dans ces vecteurs, le demandeur a tout d'abord cherché à augmenter l'expression du gène de l'intégrase en substituant au promoteur de la transcription initial un promoteur de la transcription susceptible d'augmenter significativement le nombre de transcrits de l'intégrase. Assuming that the functional deficits of integration of such vectors within the bacterial chromosome were due to a defect in the expression of the integrase gene present in these vectors, the applicant first sought to increase expression of the integrase gene by substituting for the promoter of the initial transcription a transcription promoter capable of significantly increasing the number of transcripts of the integrase.

Les résultats ont été décevants et la fonction d'intégration au chromosome de ces vecteurs n'a pas été améliorée. The results have been disappointing and the chromosome integration function of these vectors has not been improved.

De manière surprenante, il a été montré selon l'invention que les difficultés d'expression de l'intégrase contenues dans cette famille de vecteurs intégratifs ne se situait pas au niveau de la quantité d'expression des transcrits, mais au niveau de leur stabilité. Surprisingly, it has been shown according to the invention that the difficulties of expression of the integrase contained in this integrative vector family was not at the level of the expression quantity of the transcripts, but at the level of their stability. .

Selon une seconde hypothèse, le demandeur a pu montrer que le défaut de stabilité des transcrits de l'intégrase était causé par des déficits dans la terminaison de la transcription de l'ARN messager correspondant. According to a second hypothesis, the applicant could show that the lack of stability of integrase transcripts was caused by deficits in the transcription termination of the corresponding messenger RNA.

Le demandeur a alors inséré un site terminateur placé en aval de la séquence codant pour l'intégrase du vecteur de manière à obtenir un ARN messager de taille déterminée. L'insertion d'un signal de terminaison additionnel en aval de la séquence nucléotidique codant pour l'intégrase du vecteur a permis l'obtention d'une famille de vecteurs intégratifs de type cosmide et de type BAC . The applicant then inserted a terminator site placed downstream of the sequence encoding the vector integrase so as to obtain a messenger RNA of determined size. The insertion of an additional termination signal downstream of the nucleotide sequence coding for the vector integrase made it possible to obtain a family of integrative vectors of the cosmid type and of the BAC type.

Préférentiellement, le site terminateur est placé en aval du site d'attachement att. Preferably, the terminator site is placed downstream of the att attachment site.

En outre, le demandeur a mis au point de nouveaux vecteurs conjugatifs et de nouveaux vecteurs réplicatifs du type cosmide et de nouveaux vecteurs conjugatifs de type BAC qui peuvent avantageusement être utilisés pour l'insertion des acides nucléiques constitutifs d'une collection d'acides nucléiques préparés selon le procédé de l'invention. In addition, the Applicant has developed novel conjugative vectors and novel cosmid-type replicative vectors and novel BAC-type conjugative vectors which can advantageously be used for the insertion of the nucleic acids constituting a nucleic acid collection. prepared according to the process of the invention.

Lorsque l'insertion de fragments d'ADN de taille moyenne est recherchée, on utilise préférentiellement des vecteurs du type cosmide, capables de recevoir des inserts ayant une taille maximale d'environ 50 kb. When the insertion of medium-sized DNA fragments is desired, cosmid-type vectors are preferably used, capable of receiving inserts having a maximum size of about 50 kb.

De tels vecteurs cosmidiques sont tout particulièrement adaptés pour l'insertion d'acides nucléiques constitutifs d'une collection d'acides nucléiques obtenus selon le procédé de l'invention comprenant une première étape d'extraction directe d'ADN par lyse mécanique des organismes contenus dans l'échantillon de sol initial. Such cosmidic vectors are particularly suitable for the insertion of nucleic acids constituting a collection of nucleic acids obtained according to the method of the invention comprising a first step of direct extraction of DNA by mechanical lysis of the organisms contained in the initial soil sample.

Lorsque l'insertion d'acides nucléiques de grande taille, en particulier d'acides nucléiques d'une taille supérieure à 100 kb, voire supérieure à 200, 300, 400,500 ou 600 kb est recherchée, on aura alors recours préférentiellement à des vecteurs du type BAC capables de recevoir des inserts d'ADN d'une telle taille. When the insertion of large nucleic acids, in particular nucleic acids of a size greater than 100 kb, or even greater than 200, 300, 400,500 or 600 kb, is sought, preference will then be given to vectors of the nucleic acid. BAC type capable of receiving DNA inserts of such size.

De tels vecteurs de type BAC sont tout particulièrement adaptés pour l'insertion des acides nucléiques constitutifs d'une collection d'acides nucléiques obtenus conformément au procédé selon l'invention dans lequel la première étape est constituée d'une extraction indirecte de l'ADN par séparation préalable des organismes contenus dans l'échantillon de sol initial et élimination des macro-constituants dudit échantillon de sol. Such BAC type vectors are particularly suitable for the insertion of the nucleic acids constituting a collection of nucleic acids obtained according to the process according to the invention in which the first step consists of an indirect extraction of the DNA. by first separating the organisms contained in the initial soil sample and removing the macromolecules from said soil sample.

En particulier, des vecteurs du type BAC sont avantageusement mis en oeuvre pour l'insertion d'acides nucléiques de grande taille contenant, au moins partiellement, la séquence nucléotidique d'un opéron. In particular, BAC-type vectors are advantageously used for the insertion of large nucleic acids containing, at least partially, the nucleotide sequence of an operon.

Ainsi, le procédé de préparation d'une collection de vecteurs recombinants de clonage et/ou d'expression selon l'invention est en outre caractérisé en ce que le vecteur de clonage et/ou d'expression est du type plasmide. Thus, the process for preparing a collection of recombinant cloning and / or expression vectors according to the invention is further characterized in that the cloning and / or expression vector is of the plasmid type.

Selon un autre aspect, un tel procédé est caractérisé en ce que le vecteur de clonage et/ou d'expression est du type cosmide. In another aspect, such a method is characterized in that the cloning and / or expression vector is of the cosmid type.

Selon un premier aspect, il peut s'agir d'un cosmide réplicatif chez E.coli et intégratif chez Streptomyces. Un vecteur cosmidique tout à fait préféré répondant à une telle définition est le cosmide pOS7001 décrit à l'exemple 3. According to a first aspect, it may be a replicative cosmid in E. coli and integrative in Streptomyces. A most preferred cosmid vector responding to such a definition is the cosmos pOS7001 described in Example 3.

Selon encore un autre aspect, le vecteur cosmidique est conjugatif et intégratif chez Streptomyces. In yet another aspect, the cosmid vector is conjugative and integrative in Streptomyces.

De manière générale, des vecteurs conjugatifs de type cosmide ou de type BAC, qui comprennent dans leurs séquences nucléotidiques un motif reconnu par la machinerie enzymatique cellulaire appelé " origine de conjugaison " sont utilisés chaque fois que l'on veut éviter un recours à des techniques de transformation lourdes et peu automatisables. In general, conjugative vectors of cosmid type or BAC type, which comprise in their nucleotide sequences a pattern recognized by the cellular enzyme machinery called "origin of conjugation" are used whenever it is desired to avoid recourse to techniques of heavy transformation and little automation.

Par exemple, la transfection de vecteurs initialement hébergés par des cellules de E.coli dans des cellules de Streptomyces nécessite classiquement une étape de récupération du vecteur recombinant contenu dans les cellules de Escherichia coli, et sa purification préalable à l'étape de transformation de protoplastes de Streptomyces. Il est communément admis qu'une transfection d'un ensemble de 1000 clones de Escherichia coli dans Streptomyces requiert l'obtention d'environ 8000 clones pour que chaque clone de E. coli ait une chance d'être représenté. For example, the transfection of vectors initially harbored by E. coli cells in Streptomyces cells conventionally requires a step of recovery of the recombinant vector contained in Escherichia coli cells, and its prior purification at the protoplast transformation stage. of Streptomyces. It is commonly accepted that transfection of a set of 1000 clones of Escherichia coli into Streptomyces requires the production of about 8000 clones for each clone of E. coli to have a chance of being represented.

A l'inverse, une étape de transfection par conjugaison d'un vecteur hébergé par E.coli vers des cellules de Streptomyces nécessite le même nombre de clones de chacun des micro-organismes, l'étape de conjugaison ayant lieu " clone à clone " et ne comprenant en outre pas les difficultés techniques liées à l'étape de transfert de matériel génétique par transformation de protoplastes, par exemple en présence de polyéthylène glycol. In contrast, a step of transfection by conjugation of an E. coli-hosted vector to Streptomyces cells requires the same number of clones of each of the microorganisms, the conjugation step taking place "clone to clone" and furthermore not including the technical difficulties associated with the step of transferring genetic material by protoplast transformation, for example in the presence of polyethylene glycol.

Afin d'optimiser la construction de banque d'ADN chez Streptomyces, il a été mis au point selon l'invention, de nouveaux vecteurs conjugatifs de type cosmide et de type BAC de nature à permettre une efficacité maximale de l'étape de conjugaison. In order to optimize the DNA library construction in Streptomyces, new conjugative vectors of cosmid type and BAC type have been developed according to the invention so as to allow maximum efficiency of the conjugation step.

Notamment, les nouveaux vecteurs conjugatifs selon l'invention ont été construits en plaçant un gène marqueur de sélection à l'extrémité de l'ADN du vecteur qui est transféré à la bactérie réceptrice en dernier lieu.Ce perfectionnement aux vecteurs conjugatifs de l'état de la technique permet de ne sélectionner positivement que les bactéries réceptrices ayant reçu la totalité de l'ADN du vecteur et, en conséquence, la totalité de l'ADN de l'insert d'intérêt. In particular, the novel conjugative vectors according to the invention have been constructed by placing a selection marker gene at the end of the vector DNA which is transferred to the recipient bacterium in the last place. This enhancement to the conjugative vectors of the state of the technique makes it possible to positively select only the receptor bacteria which have received all the DNA of the vector and, consequently, all the DNA of the insert of interest.

Un cosmide conjugatif et intégratif chez Streptomyces préféré selon l'invention est le cosmide pOSV303 décrit à l'exemple 5. A conjugative and integrative cosmid in Streptomyces preferred according to the invention is the cosmid pOSV303 described in Example 5.

Selon un autre aspect, un procédé de préparation d'une collection de vecteurs recombinants selon l'invention est mis en oeuvre à l'aide d'un cosmide réplicatif à la fois chez E.coli et chez Streptomyces. In another aspect, a method of preparing a recombinant vector collection according to the invention is carried out using a replicative cosmid in both E. coli and Streptomyces.

Un tel cosmide est avantageusement le cosmide pOS 700R décrit à l'exemple 6. Such a cosmid is advantageously the cosmos pOS 700R described in Example 6.

Selon encore un autre aspect, le procédé ci-dessus peut être mis en oeuvre avec un cosmide réplicatif chez E. coli et Streptomyces et conjugatif chez Streptomyces. In yet another aspect, the above method can be carried out with a replicative cosmid in E. coli and Streptomyces and conjugative in Streptomyces.

Un tel cosmide réplicatif et conjugatif peut être obtenu à partir d'un cosmide réplicatif conforme à l'invention, par l'insertion d'une origine de transfert appropriée, telle que RK2, comme décrit à l'exemple 5 pour la construction du vecteur pOSV303. Such a replicative and conjugative cosmid can be obtained from a replicative cosmid according to the invention, by inserting an appropriate transfer origin, such as RK2, as described in Example 5 for the construction of the vector. pOSV303.

Selon un autre mode de réalisation avantageux du procédé de préparation d'une collection de vecteurs recombinants selon l'invention, on a recours à un vecteur de clonage et/ou d'expression de type BAC. According to another advantageous embodiment of the process for preparing a recombinant vector collection according to the invention, a BAC-type cloning and / or expression vector is used.

* Selon un premier aspect, le vecteur du type BAC est intégratif et conjugatif chez Streptomyces. According to a first aspect, the BAC-type vector is integrative and conjugative in Streptomyces.

De manière tout à fait préférée, un tel vecteur BAC intégratif et conjugatif chez Streptomyces est le vecteur BAC pOSV 403 décrit à l'exemple 8. Most preferably, such an integrative and conjugative BAC vector in Streptomyces is the BAC vector pOSV 403 described in Example 8.

L'invention a en outre pour objet un vecteur recombinant caractérisé en ce qu'il est choisi parmi les vecteurs recombinants suivants : a) un vecteur comprenant un acide nucléique constitutif d'une collection d'acides nucléiques selon l'invention; b) un vecteur tel qu'obtenu selon un procédé éliminant tout recours à l'action d'une endonucléase de restriction sur le fragment d'ADN à insérer, tel que décrit précédemment. The invention further relates to a recombinant vector characterized in that it is chosen from the following recombinant vectors: a) a vector comprising a nucleic acid constituting a nucleic acid collection according to the invention; b) a vector as obtained by a method eliminating any recourse to the action of a restriction endonuclease on the DNA fragment to be inserted, as described above.

De manière tout à fait préférée, l'invention est également relative à un vecteur choisi parmi les vecteurs suivants: - le cosmide pOS7001; - le cosmide pOSV303; - le cosmide pOS700R; - le vecteur BAC pOSV403;
L'invention est en outre relative à une collection de vecteurs recombinants tels qu'obtenus selon l'un quelconque des procédés selon l'invention. Most preferably, the invention also relates to a vector chosen from the following vectors: cosmid pOS7001; the cosmid pOSV303; the cosmid pOS700R; the BAC vector pOSV403;
The invention further relates to a collection of recombinant vectors as obtained according to any of the methods according to the invention.

Procédé de préparation d'un vecteur recombinant de clonage et/ou d'expression selon l'invention. Process for the preparation of a recombinant cloning and / or expression vector according to the invention

Les techniques conventionnelles d'insertion d'ADN au sein d'un vecteur afin de préparer un vecteur de clonage et/ou d'expression recombinant font classiquement appel à une première étape au cours de laquelle une endonucléase de restriction est incubée à la fois avec l'ADN à insérer et avec le vecteur récepteur créant ainsi des extrémités compatibles entre l'ADN à insérer et l'ADN du vecteur permettant l'assemblage des deux ADN avant une étape de ligation finale permettant l'obtention du vecteur recombinant. Conventional techniques for inserting DNA into a vector to prepare a recombinant cloning and / or expression vector typically use a first step in which a restriction endonuclease is incubated with both the DNA to be inserted and with the receptor vector thus creating compatible ends between the DNA to be inserted and the DNA of the vector allowing the assembly of the two DNAs before a final ligation step making it possible to obtain the recombinant vector.

Toutefois, une telle technique conventionnelle présente des inconvénients notables, tout particulièrement lorsque est recherchée l'insertion d'acides nucléiques de grande taille dans un vecteur de clonage et/ou d'expression. However, such a conventional technique has notable drawbacks, particularly when the insertion of large nucleic acids into a cloning and / or expression vector is desired.

En effet, l'action préalable d'une enzyme de restriction sur les fragments d'ADN destinés à être insérés dans un vecteur est susceptible de réduire notablement la taille de cet ADN préalablement à son insertion dans le vecteur. Il va sans dire qu'une réduction significative de la taille de l'ADN préalablement à son insertion sur un vecteur est une situation particulièrement défavorable lorsqu'est recherché le clonage de fragments d'ADN de grande taille susceptible de contenir l'ensemble des séquences codantes et, le cas échéant, également des séquences régulatrices, d'un opéron dont l'expression constitue une voie de biosynthèse complète d'un métabolite d'intérêt industriel, et tout particulièrement d'un composé d'intérêt thérapeutique. Indeed, the prior action of a restriction enzyme on the DNA fragments to be inserted into a vector is likely to significantly reduce the size of this DNA prior to its insertion into the vector. It goes without saying that a significant reduction in the size of the DNA prior to its insertion on a vector is a particularly unfavorable situation when cloning of large DNA fragments that may contain all the sequences is sought. coding and, where appropriate, also regulatory sequences, of an operon whose expression constitutes a complete biosynthetic pathway of a metabolite of industrial interest, and more particularly of a compound of therapeutic interest.

Pour remédier aux inconvénients des techniques de l'art antérieur, il a été mis au point selon l'invention deux procédés de préparation d'un vecteur recombinant de clonage et/ou d'expression qui ne nécessitent pas le recours à une endonucléase de restriction sur l'ADN à insérer préalablement à son introduction au sein du vecteur. De tels procédés sont en conséquence tout à fait adaptés au clonage de longs fragments d'ADN susceptibles de contenir, au moins partiellement, l'ensemble des séquences codantes et, le cas échéant, également des In order to overcome the drawbacks of the prior art techniques, two methods of preparing a recombinant cloning and / or expression vector which do not require the use of a restriction endonuclease have been developed according to the invention. on the DNA to insert prior to its introduction into the vector. Such methods are therefore quite suitable for cloning long DNA fragments which may contain, at least partially, all the coding sequences and, where appropriate, also

séquences régulatrices, d'un opéron complet responsable d'une voie de biosynthèse. regulatory sequences, of a complete operon responsible for a biosynthetic pathway.

Selon un premier aspect, un procédé de préparation d'un vecteur recombinant de clonage et/ou d'expression selon l'invention est caractérisé en ce que l'insertion d'un acide nucléique dans le vecteur de clonage et/ou d'expression, comprend les étapes suivantes: - ouvrir le vecteur de clonage et/ou d'expression à un site de clonage choisi, à l'aide d'une endonucléase de restriction appropriée; - ajouter un premier acide nucléique homopolymérique à l'extrémité 3' libre du vecteur ouvert; - ajouter un second acide nucléique homopolymérique, de séquence complémentaire au premier acide nucléique homopolymérique, à l'extrémité 3' libre de l'acide nucléique à insérer dans le vecteur ; - assembler l'acide nucléique du vecteur et l'acide nucléique par hybridation du premier et du second acide nucléique homopolymérique de séquences complémentaires l'une de l'autre; - refermer le vecteur par ligation. According to a first aspect, a process for preparing a recombinant cloning and / or expression vector according to the invention is characterized in that the insertion of a nucleic acid into the cloning and / or expression vector , comprises the steps of: - opening the cloning and / or expression vector at a selected cloning site, using an appropriate restriction endonuclease; adding a first homopolymeric nucleic acid to the free 3 'end of the open vector; adding a second homopolymeric nucleic acid, of complementary sequence to the first homopolymeric nucleic acid, to the free 3 'end of the nucleic acid to be inserted into the vector; assembling the nucleic acid of the vector and the nucleic acid by hybridization of the first and second homopolymeric nucleic acid with sequences complementary to one another; - close the vector by ligation.

Un tel procédé est décrit aux exemples 10 et 13 ci-après. Such a method is described in Examples 10 and 13 below.

De manière avantageuse, le procédé ci-dessus peut comporter les caractéristiques suivantes, isolément ou en combinaison: - le premier acide nucléique homopolymérique est de séquence poly(A) ou poly(T); - le second acide nucléique homopolymérique est de séquence poly(T) ou poly(A). Advantageously, the above method may comprise the following characteristics, alone or in combination: the first homopolymeric nucleic acid is of poly (A) or poly (T) sequence; the second homopolymeric nucleic acid is of poly (T) or poly (A) sequence.

De manière tout à fait préférée, les acides nucléiques homopolymériques ont une longueur comprise entre 25 et 100 bases nucléotidiques, préférentiellement entre 25 et 70 bases nucléotidiques. Most preferably, the homopolymeric nucleic acids have a length of between 25 and 100 nucleotide bases, preferably between 25 and 70 nucleotide bases.

Le procédé de préparation d'un vecteur recombinant de clonage et/ou d'expression décrit ci-dessus est particulièrement adapté à la construction de banques d'ADN dans des vecteurs de type BAC. Ainsi, selon un mode de réalisation avantageux du procédé de préparation d'un vecteur recombinant décrit ci-dessus, ledit procédé est en outre caractérisé en ce que la taille de l'acide nucléique à insérer est d'au moins 100 kb, et préférentiellement d'au moins 200,300, 400,500 ou 600 kb. The process for preparing a recombinant cloning and / or expression vector described above is particularly suitable for the construction of DNA libraries in BAC type vectors. Thus, according to an advantageous embodiment of the process for preparing a recombinant vector described above, said method is further characterized in that the size of the nucleic acid to be inserted is at least 100 kb, and preferentially at least 200,300, 400,500 or 600 kb.

Un tel procédé de préparation est donc particulièrement adapté à l'insertion des acides nucléiques contenus dans une collection d'acides nucléiques obtenus selon le procédé de l'invention. Such a preparation method is therefore particularly suitable for the insertion of the nucleic acids contained in a collection of nucleic acids obtained according to the method of the invention.

Afin de permettre l'insertion de fragments d'ADN de grande taille dans des vecteurs de clonage et/ou d'expression, il a été mis au point selon l'invention, un second procédé ayant permis d'éliminer tout recours à l'action d'une endonucléase de restriction sur l'ADN destiné à être inséré au sein du vecteur. In order to allow the insertion of large DNA fragments into cloning and / or expression vectors, it has been developed according to the invention, a second method having made it possible to eliminate any recourse to the action of a restriction endonuclease on the DNA to be inserted into the vector.

Un tel procédé de préparation d'un vecteur recombinant de clonage et/ou d'expression selon l'invention est caractérisé en ce que l'étape d'insertion d'un acide nucléique dans ledit vecteur de clonage et/ou d'expression comprend les étapes suivantes: - création de bouts francs sur les extrémités de l'acide nucléique de la collection par élimination des séquences 3' sortantes et remplissage des séquences 5' sortantes; - ouverture du vecteur de clonage et/ou d'expression à un site de clonage choisi à l'aide d'une endonucléase de restriction appropriée; - adition d'adaptateurs oligonucléotidiques complémentaires ; - création de bouts francs aux extrémités de l'acide nucléique du vecteur par élimination des séquences 3' sortantes et remplissage des séquences 5' sortantes, puis déphosphorylation des extrémités 5' afin de prévenir une recircularisation du vecteur; - insertion de l'acide nucléique de la collection dans le vecteur par ligation. Such a process for preparing a recombinant cloning and / or expression vector according to the invention is characterized in that the step of inserting a nucleic acid into said cloning and / or expression vector comprises the following steps: - creation of blunt ends on the ends of the nucleic acid of the collection by elimination of the 3 'outgoing sequences and filling the 5' outgoing sequences; opening of the cloning and / or expression vector to a cloning site chosen with the aid of an appropriate restriction endonuclease; addition of complementary oligonucleotide adapters; creation of blunt ends at the ends of the nucleic acid of the vector by elimination of the outgoing 3 'sequences and filling of the 5' outgoing sequences, followed by dephosphorylation of the 5 'ends in order to prevent recircularization of the vector; - Insertion of the nucleic acid of the collection into the vector by ligation.

De manière préférée, l'élimination des séquences 3' sortantes est réalisée à l'aide d'une exonucléase, telle que l'enzyme de Klenow. Preferably, the removal of the 3 'outgoing sequences is carried out using an exonuclease, such as the Klenow enzyme.

De manière préférée, le remplissage des séquences 5' sortantes est réalisé à l'aide d'une polymérase, et de manière tout à fait préférée de la T4 polymérase, en présence des quatre nucléotides triphosphates. Preferably, the filling of the 5 'outgoing sequences is carried out using a polymerase, and very preferably T4 polymerase, in the presence of the four nucleotide triphosphates.

Un procédé de préparation d'un vecteur recombinant de clonage et/ou d'expression par élimination des séquences 3' sortantes et remplissage des séquences 5' sortantes tel que décrit ci-dessus est particulièrement adapté à la construction de banques d'ADN à partir de vecteurs de type cosmide. A method for preparing a recombinant cloning and / or expression vector by removing the outgoing 3 'sequences and filling the outgoing 5' sequences as described above is particularly suitable for the construction of DNA libraries from of vectors of cosmid type.

Un tel procédé d'obtention de vecteurs recombinants est décrit à l'exemple 12. Such a process for obtaining recombinant vectors is described in Example 12.

Dans un mode particulier de préparation d'un vecteur recombinant selon l'invention, des oligonucléotides comprenant un ou plusieurs sites de restriction rares sont ajoutés sur le vecteur au niveau du site de clonage de l'ADN à insérer, conformément à l'enseignement de l'exemple 10. Cet ajout d'oligonucléotides facilite la récupération ultérieure des inserts sans clivage de ces derniers. In a particular mode of preparation of a recombinant vector according to the invention, oligonucleotides comprising one or more rare restriction sites are added to the vector at the cloning site of the DNA to be inserted, in accordance with the teaching of Example 10. This addition of oligonucleotides facilitates the subsequent recovery of the inserts without cleavage of the latter.

CELLULES HOTES
Bien que tout type de cellules hôtes puisse être utilisé pour la transfection ou la transformation avec un acide nucléique ou un vecteur recombinant selon l'invention, notamment une cellule hôte procaryote ou eucaryote, on utilisera de préférence des cellules hôtes dont les caractères physiologiques, biochimiques et génétiques sont bien caractérisés, facilement cultivables à grande échelle et dont les conditions de culture pour la production de métabolites soient bien connues. HOST CELLS
Although any type of host cell may be used for transfection or transformation with a nucleic acid or a recombinant vector according to the invention, in particular a prokaryotic or eukaryotic host cell, it will be preferable to use host cells whose physiological, biochemical characteristics and genetic are well characterized, easily cultivable on a large scale and whose culture conditions for the production of metabolites are well known.

De manière préférentielle, la cellule hôte réceptrice d'un acide nucléique ou d'un vecteur recombinant selon l'invention est phylogénétiquement proche des organismes donneurs contenus initialement dans l'échantillon de l'environnement desquels les acides nucléiques sont originaires. Preferably, the host cell receiving a nucleic acid or a recombinant vector according to the invention is phylogenetically close to the donor organisms initially contained in the sample of the environment from which the nucleic acids originate.

De manière tout à fait préférée, une cellule hôte selon l'invention doit posséder un usage des codons similaire, ou du moins proche, des Most preferably, a host cell according to the invention must have a similar or at least a similar codon usage.

organismes donneurs présents initialement dans l'échantillon de l'environnement, tout particulièrement de l'échantillon de sol. donor organisms initially present in the environmental sample, especially the soil sample.

La taille des fragments d'ADN susceptible de porter les séquences nucléotidiques d'intérêt recherchées peut être variable. Ainsi, des enzymes codées par des gènes de taille moyenne de 1 kb pourront être exprimées à partir d'inserts de petite taille alors que l'expression de métabolites secondaires nécessiteront le maintien dans l'organisme hôte de fragments de taille bien supérieure, par exemple de 40 kb à plus de 100 kb, 200 kb, 300 kb, 400 kb ou 600 kb. The size of the DNA fragments likely to carry the desired nucleotide sequences of interest may be variable. Thus, enzymes encoded by 1 kb medium size genes can be expressed from small inserts while expression of secondary metabolites will require maintenance of much larger fragments in the host organism, for example from 40 kb to more than 100 kb, 200 kb, 300 kb, 400 kb or 600 kb.

Ainsi, les cellules hôtes de Escherichia coli constituent un choix privilégié pour le clonage de grands fragments d'ADN. Thus, Escherichia coli host cells are a preferred choice for cloning large DNA fragments.

De manière tout à fait préférée, on aura recours à l'utilisation de la souche de Escherichia coli désignée DH10B et décrite par Shizuya et al; (1992) pour laquelle des protocoles de clonage dans des vecteurs BAC ont été optimisés. Most preferably, use will be made of the strain of Escherichia coli designated DH10B and described by Shizuya et al; (1992) for which cloning protocols in BAC vectors have been optimized.

Toutefois, d'autres souches de Escherichia coli peuvent être avantageusement utilisées pour la construction d'une banque d'ADN selon l'invention, telles que les souches E.coli Sure, E.coli DH5 a, ou encore E.coli 294 (ATCC N 31446). However, other Escherichia coli strains can be advantageously used for the construction of a DNA library according to the invention, such as E. coli Sure strains, E.coli DH5α, or E.coli 294 ( ATCC No. 31446).

En outre, la construction d'une banque d'ADN par transfection de cellules de E.coli avec des vecteurs recombinants selon l'invention est également possible, l'expression de gènes de divers procaryotes tels que Bacillus, Thermotoga, Corynebacterium, Lactobacillus ou Clostridium ayant été décrite dans la demande PCT N WO 99/20799. In addition, the construction of a DNA library by transfection of E. coli cells with recombinant vectors according to the invention is also possible, the expression of genes of various prokaryotes such as Bacillus, Thermotoga, Corynebacterium, Lactobacillus or Clostridium having been described in PCT application WO 99/20799.

De manière générale, des cellules hôtes de E.coli peuvent dans tous les cas constituer des hôtes transitoires dans lesquels des vecteurs recombinants selon l'invention pourront être maintenus avec une grande efficacité, le matériel génétique pouvant être facilement manipulé et archivé et façon stable. In general, E. coli host cells may in all cases be transient hosts in which recombinant vectors according to the invention can be maintained with high efficiency, the genetic material can be easily handled and archived and stable manner.

Dans le but d'exprimer la plus grande diversité moléculaire possible, d'autres hôtes cellulaires pourront être également avantageusement mis en oeuvre tels que des cellules de Bacillus, Pseudomonas, Streptomyces, Myxococcus, Aspergillus nidulans ou encore Neurospora crassa. In order to express the greatest possible molecular diversity, other cellular hosts may also be advantageously used, such as Bacillus, Pseudomonas, Streptomyces, Myxococcus, Aspergillus nidulans or Neurospora crassa cells.

Il a en outre été montré selon la présente invention, que des cellules de Streptomyces lividans peuvent être utilisées avec succès et constituent des systèmes d'expression complémentaires à Escherichia coli. It has furthermore been shown according to the present invention that Streptomyces lividans cells can be used successfully and constitute expression systems complementary to Escherichia coli.

Streptomyces lividans constitue un modèle pour l'étude de la génétique des Streptomyces et a également été utilisé comme hôte d'expression hétérologue de nombreux métabolites secondaires. Streptomyces lividans, possède en commun avec d'autres actinomycètes tels que Streptomyces coelicolor, Streptomyces griseus, Streptomyces fradiae, ainsi que Streptomyces griseochromogenes, les molécules précurseurs et les systèmes de régulation nécessaires à l'expression de tout ou partie des voies de biosynthèses complexes, telles que par exemple la voie de biosynthèse des polykétides ou encore la voie de biosynthèse des polypeptides non ribosomiques représentant des classes de molécules de structures très diverses. Streptomyces lividans is a model for studying the genetics of Streptomyces and has also been used as a heterologous expression host for many secondary metabolites. Streptomyces lividans, together with other actinomycetes such as Streptomyces coelicolor, Streptomyces griseus, Streptomyces fradiae, as well as Streptomyces griseochromogenes, the precursor molecules and regulatory systems necessary for the expression of all or part of complex biosynthetic pathways, such as, for example, the polyketide biosynthetic pathway or the non-ribosomal polypeptide biosynthesis pathway representing classes of molecules of very diverse structures.

Streptomyces lividans présente également l'avantage d'accepter l'ADN étranger avec des efficacités de transformation élevées. Streptomyces lividans also has the advantage of accepting foreign DNA with high transformation efficiencies.

Ainsi, l'invention concerne aussi une cellule hôte recombinante comprenant un acide nucléique selon l'invention, constitutif d'une collection d'acides nucléiques préparée selon un procédé conforme à l'invention, ou encore une cellule hôte recombinante comprenant un vecteur recombinant tel que défini précédemment. Thus, the invention also relates to a recombinant host cell comprising a nucleic acid according to the invention, constituting a nucleic acid collection prepared according to a method according to the invention, or a recombinant host cell comprising a recombinant vector such as than previously defined.

Selon un premier aspect, il peut s'agir d'une cellule hôte recombinante d'origine procaryote ou eucaryote. According to a first aspect, it may be a recombinant host cell of prokaryotic or eukaryotic origin.

Avantageusement, une cellule recombinante selon l'invention est une bactérie, et de manière tout à fait préférée une bactérie choisie parmi E.coli et Streptomyces. Advantageously, a recombinant cell according to the invention is a bacterium, and most preferably a bacterium selected from E. coli and Streptomyces.

Selon un autre aspect, une cellule hôte recombinante selon l'invention est caractérisée en ce qu'il s'agit d'une levure ou encore d'un champignon filamenteux. In another aspect, a recombinant host cell according to the invention is characterized in that it is a yeast or a filamentous fungus.

L'invention a également trait à une collection de cellules hôtes recombinantes, chacune des cellules hôtes constitutive de la collection comprenant un acide nucléique provenant d'une collection d'acides nucléiques réalisée conformément à un procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes tel que décrit ci-dessus. The invention also relates to a collection of recombinant host cells, each of the host cells constituting the collection comprising a nucleic acid derived from a collection of nucleic acids made according to a process for preparing a nucleic acid collection. from a soil sample containing organisms as described above.

L'invention est également relative à une collection de cellules hôtes recombinantes, chacune des cellules hôtes constitutives de la collection comprenant un vecteur recombinant selon l'invention. The invention also relates to a collection of recombinant host cells, each of the host cells constituting the collection comprising a recombinant vector according to the invention.

En raison de la grande taille des inserts il est nécessaire d'avoir une efficacité maximale de transformation. Dans ce but, une souche réceptrice de Streptomyces lividans exprimant l'intégrase de pSAM2 de façon constitutive afin de favoriser l'intégration site-spécifique du vecteur est préférée. Pour cela, le gène int sous contrôle d'un promoteur fort est intégré dans le chromosome. La surproduction d'intégrase n'induit pas de phénomènes d'excision (Raynal étal., 1998). Due to the large size of the inserts it is necessary to have maximum efficiency of transformation. For this purpose, a Streptomyces lividans receptor strain constitutively expressing the integrase of pSAM2 to promote site-specific vector integration is preferred. For this, the int gene under the control of a strong promoter is integrated into the chromosome. Overproduction of integrase does not induce excision phenomena (Raynal et al., 1998).

La production d'un nouveau métabolite à partir de l'insert pourrait être toxique pour Streptomyces si l'insert ne contient pas de gènes de résistance à l'antibiotique produit ou si ce gène est peu ou pas exprimé. La capacité des différents gènes permettant à Streptomyces ambofaciens de résister à l'antibiotique qu'il produit est étudiée (Gourmelen et al., 1998; Pernodet et al., 1999). Certains de ces gènes codent des transporteurs de type ABC susceptibles de conférer un large spectre de résistance. Ces gènes peuvent être introduits et surexprimés dans la souche hôte de Streptomyces lividans. The production of a new metabolite from the insert could be toxic to Streptomyces if the insert does not contain genes for resistance to the antibiotic produced or if this gene is little or not expressed. The capacity of the different genes allowing Streptomyces ambofaciens to resist the antibiotic it produces is studied (Gourmelen et al., 1998, Pernodet et al., 1999). Some of these genes encode ABC-type transporters that may confer a broad spectrum of resistance. These genes can be introduced and overexpressed in the host strain of Streptomyces lividans.

A l'inverse, une souche hypersensible aux antibiotiques peut être utilisée (Pernodet et al., 1996), afin de détecter dans la banque la présence de gènes de résistance. En effet, chez les micro-organismes producteurs d'antibiotique, ces gènes de résistance sont souvent associés aux gènes de la voie de biosynthèse de l'antibiotique. La sélection de clones résistants peut permettre d'effectuer simplement un premier tri avant les tests plus complexes de détection d'un nouveau métabolite produit par le clone. Conversely, a strain that is hypersensitive to antibiotics can be used (Pernodet et al., 1996) to detect the presence of resistance genes in the library. Indeed, in antibiotic-producing microorganisms, these resistance genes are often associated with genes in the biosynthetic pathway of the antibiotic. The selection of resistant clones can make it possible to simply perform a first sort before the more complex tests for detecting a new metabolite produced by the clone.

ISOLEMENT ET CARACTERISATION DE NOUVELLES SEQUENCES NUCLEOTIDIQUES CODANT POUR DES POLYKETIDES SYNTHASES. ISOLATION AND CHARACTERIZATION OF NEW NUCLEOTIDE SEQUENCES ENCODING SYNTHASE POLYKETIDES.

Selon l'invention, une collection de cellules hôtes recombinantes a été obtenue après transfection des cellules hôtes par une collection de vecteurs recombinants contenant chacun un insert d'acide nucléique provenant d'une collection d'acides nucléiques préparée conformément au procédé selon l'invention. According to the invention, a collection of recombinant host cells has been obtained after transfection of the host cells by a collection of recombinant vectors each containing a nucleic acid insert from a collection of nucleic acids prepared in accordance with the method according to the invention. .

Plus précisément, les fragments d'ADN obtenus selon le procédé de l'invention dans lequel il est mis en oeuvre une étape d'extraction indirecte d'ADN des organismes contenus dans l'échantillon de sol ont été tout d'abord clonés dans le cosmide intégratif pOS7001. More specifically, the DNA fragments obtained according to the process of the invention in which it is implemented a step of indirect extraction of DNA from the organisms contained in the soil sample were first cloned into the cosmid integrative pOS7001.

L'étape d'insertion des fragments d'ADN dans le cosmide intégratif pOS7001 a été réalisée selon le procédé de l'invention dans lequel des queues de polynucléotides homopolymériques poly(A) et poly(T) ont été ajoutées à l'extrémité 3' respectivement de l'acide nucléique du vecteur et des fragments d'ADN à insérer. The step of inserting the DNA fragments into the integrative cosmid pOS7001 was carried out according to the process of the invention in which poly (A) and poly (T) homopolymeric polynucleotide tails were added at the 3-end. respectively, the nucleic acid of the vector and the DNA fragments to be inserted.

Les vecteurs recombinants ainsi construits ont été encapsidés dans des têtes de phage lambda et les phages obtenus ont été utilisés pour infecter des cellules de E. coli selon des techniques bien connues de l'homme du métier. The recombinant vectors thus constructed were encapsidated in lambda phage heads and the resulting phages were used to infect E. coli cells according to techniques well known to those skilled in the art.

Une banque d'environ 5000 clones de Escherichia coli a été obtenue. A library of approximately 5000 Escherichia coli clones was obtained.

Cette banque de clones a été criblée avec des couples d'amorces spécifiques d'une séquence nucléotidique codant pour une enzyme impliquée dans la voie de biosynthèse des polykétides, l'enzyme PKS de type I, aussi désignée ss-kétoacyl synthase . This library of clones was screened with pairs of primers specific for a nucleotide sequence coding for an enzyme involved in the polyketide biosynthetic pathway, the type I PKS enzyme, also referred to as β-ketoacyl synthase.

On rappelle ici que les polykétides constituent une classe chimique d'une grande diversité structurale comprenant un nombre important de molécules d'intérêt pharmaceutique tels que la tylosine, la monensine, la vermectine, l'érythromycine, la doxorubicine ou encore le FK506. It is recalled here that polyketides constitute a chemical class of great structural diversity comprising a large number of molecules of pharmaceutical interest such as tylosin, monensin, vermectin, erythromycin, doxorubicin or FK506.

Les polykétides sont synthétisés par condensation de molécules d'acétate sous l'action d'enzymes appelées polykétide synthases (PKSs). Polyketides are synthesized by condensation of acetate molecules by the action of enzymes called polyketide synthases (PKSs).

Il existe deux types de polykétide synthases. Les polykétide synthases de type Il sont impliquées en général dans la synthèse des antibiotiques aromatiques polycycliques et catalysent la condensation d'unités acétate de façon itérative. There are two types of polyketide synthases. Type II polyketide synthases are generally involved in the synthesis of polycyclic aromatic antibiotics and catalyze the condensation of acetate units iteratively.

Les polykétide synthases de type I sont impliquées dans la synthèse des polykétides macrocycliques ou macrolides et constituent des enzymes modulaires multifonctionnelles. Type I polyketide synthases are involved in the synthesis of macrocyclic or macrolide polyketides and are multifunctional modular enzymes.

Compte-tenu de leur intérêt thérapeutique, il existe un besoin dans l'état de la technique d'isoler et de caractériser de nouvelles polykétides synthases qui peuvent être utilisées pour la production de In view of their therapeutic interest, there is a need in the state of the art to isolate and characterize novel polyketide synthases that can be used for the production of

nouveaux composés pharmaceutiques, notamment de nouveaux composés pharmaceutiques à activité antibiotique. new pharmaceutical compounds, especially new pharmaceutical compounds with antibiotic activity.

Le criblage de la banque de clones recombinants décrite cidessus à l'aide d'amorces PCR amplifiant sélectivement des séquences nucléotidiques codant pour des polykétide synthases de type # a permis d'identifier des clones recombinants contenant des inserts d'ADN comprenant une séquence nucléotidique codant pour de nouvelles polykétide synthases. Les séquences nucléotidiques codant pour ces nouvelles polykétides synthases sont référencées comme les séquences SEQ ID N 33 à SEQ ID N 44. Screening of the recombinant clone library described above with PCR primers selectively amplifying nucleotide sequences encoding # type polyketide synthases has identified recombinant clones containing DNA inserts comprising a nucleotide sequence encoding for new polyketide synthases. The nucleotide sequences coding for these novel polyketide synthases are referenced as the sequences SEQ ID N 33 to SEQ ID N 44.

Un autre objet de l'invention consiste en un acide nucléique codant pour une nouvelle polykétide synthase I, caractérisé en ce qu'il comprend l'une des séquences nucléotidiques SEQ ID N 34 à SEQ ID N 44. Another subject of the invention consists of a nucleic acid coding for a novel polyketide synthase I, characterized in that it comprises one of the nucleotide sequences SEQ ID N 34 to SEQ ID N 44.

De préférence, un tel acide nucléique se présente sous une forme isolée et/ou purifiée. Preferably, such nucleic acid is in an isolated and / or purified form.

L'invention concerne aussi un vecteur recombinant comprenant un polynucléotide comprenant l'une des séquences SEQ ID N 34 à SEQ ID N 44 . The invention also relates to a recombinant vector comprising a polynucleotide comprising one of the sequences SEQ ID N 34 to SEQ ID N 44.

L'invention a également trait à une cellule hôte recombinante comprenant un acide nucléique choisi parmi les polynucléotides comprenant l'une des séquences nucléotidiques SEQ ID N 34 à SEQ ID N 44 ainsi qu'à une cellule hôte recombinante comprenant un vecteur recombinant dans lequel est inséré un polynucléotide comprenant l'une des séquences nucléotidiques SEQ ID N 34 à SEQ ID N 44. The invention also relates to a recombinant host cell comprising a nucleic acid selected from polynucleotides comprising one of the nucleotide sequences SEQ ID N 34 to SEQ ID N 44 as well as to a recombinant host cell comprising a recombinant vector in which is inserted a polynucleotide comprising one of the nucleotide sequences SEQ ID N 34 to SEQ ID N 44.

Avantageusement, les vecteurs recombinants contenant un insert d'ADN codant pour une nouvelle polykétide synthase de type 1 selon l'invention sont des vecteurs de clonage et d'expression. Advantageously, the recombinant vectors containing a DNA insert coding for a new type 1 polyketide synthase according to the invention are cloning and expression vectors.

De préférence, une cellule hôte recombinante telle que décrite cidessus est une bactérie, une levure ou encore un champignon filamenteux. Preferably, a recombinant host cell as described above is a bacterium, a yeast or a filamentous fungus.

Les séquences en acides aminés de nouvelles polykétide synthases provenant d'organismes contenus dans un échantillon de sol ont été déduites des séquences nucléotidiques SEQ ID N 34 à SEQ ID N 44 ci-dessus. Il s'agit des polypeptides comprenant l'une des séquences en acides aminés SEQ ID N 48 à SEQ ID N 59. The amino acid sequences of novel polyketide synthases from organisms in a soil sample were deduced from the nucleotide sequences SEQ ID N 34 to SEQ ID N 44 above. These are polypeptides comprising one of the amino acid sequences SEQ ID N 48 to SEQ ID N 59.

L'invention concerne encore de nouvelles polykétides synthases comprenant une séquence en acides aminés choisie parmi les séquences SEQ ID N 48 à SEQ ID N 59. The invention also relates to novel polyketide synthases comprising an amino acid sequence selected from the sequences SEQ ID N 48 to SEQ ID N 59.

On a aussi extrait et amplifié selon l'invention de l'ADN génomique provenant de souches bactériennes pures, telles que Streptomyces coelicolor (ATCC N 101.478), Streptomyces ambofaciens (NRRL N 2.420), Streptomyces lactamandurans (ATCC N 27.382), Streptomyces rimosus (ATCC N 109.610), Bacillus subtilis (ATCC N 6633) ou encore Bacillus lichenifornis et Saccharopolyspora erythrea. Genomic DNA from pure bacterial strains, such as Streptomyces coelicolor (ATCC N 101.478), Streptomyces ambofaciens (NRRL N 2.420), Streptomyces lactamandurans (ATCC N 27.382), Streptomyces rimosus ( ATCC N 109.610), Bacillus subtilis (ATCC N 6633) or Bacillus lichenifornis and Saccharopolyspora erythrea.

Une amplification par PCR de l'ADN de chacune des souches bactériennes décrites ci-dessus a été effectuée à l'aide des couples d'amorces spécifiques des séquences nucléiques de polykétide synthase de type 1. PCR amplification of the DNA of each of the bacterial strains described above was carried out using the primer pairs specific for the type 1 polyketide synthase nucleic acid sequences.

De nouveaux gènes de polykétide synthases de type # bactériennes ont ainsi pu être isolés et caractérisés. Il s'agit des séquences nucléiques de séquences SEQ ID N 30 à SEQ ID N 32. New bacterial-type polyketide synthase genes have thus been isolated and characterized. These are the nucleic sequences of sequences SEQ ID N 30 to SEQ ID N 32.

L'invention a donc en outre pour objet des séquences nucléotidiques codant pour de nouvelles polykétides synthases de type # choisies parmi les polynucléotides comprenant l'une des séquences nucléotidiques SEQ ID N 30 à SEQ ID N 32. The invention therefore also relates to nucleotide sequences coding for novel polyketide synthases # selected from polynucleotides comprising one of the nucleotide sequences SEQ ID N 30 to SEQ ID N 32.

Font également partie de l'invention des vecteurs recombinants comprenant les séquences nucléotidiques codant pour de nouvelles polykétides synthases de type # définies ci-dessus. Also included in the invention are recombinant vectors comprising the nucleotide sequences coding for new type # polyketide synthases defined above.

L'invention concerne aussi des cellules hôtes recombinantes caractérisées en ce qu'elles contiennent un acide nucléique codant pour une nouvelle polykétide synthase de type 1 comprenant une séquence nucléotidique choisie parmi les séquences SEQ ID N 30 à SEQ ID N 32 ainsi que des cellules hôtes recombinantes comprenant un vecteur recombinant tel que défini ci-dessus. The invention also relates to recombinant host cells characterized in that they contain a nucleic acid encoding a novel type 1 polyketide synthase comprising a nucleotide sequence chosen from the sequences SEQ ID N 30 to SEQ ID N 32 as well as host cells recombinant comprising a recombinant vector as defined above.

L'invention a également pour objet des polypeptides codés par des séquences comprenant les acides nucléiques SEQ ID N 30 à 32, et plus précisément des polypeptides comprenant les séquences d'acides aminés SEQ ID N 47 à SEQ ID N 50. The subject of the invention is also polypeptides encoded by sequences comprising the nucleic acids SEQ ID N 30 to 32, and more specifically polypeptides comprising the amino acid sequences SEQ ID N 47 to SEQ ID N 50.

L'invention a en outre pour objet un procédé de production d'une polykétide synthase de type 1 selon l'invention, ledit procédé de production comprenant les étapes suivantes: - obtention d'une cellule hôte recombinante comprenant un acide nucléique codant pour une polykétide synthase de type 1 comprenant une séquence nucléotidique choisie parmi les séquences SEQ ID N 33 à SEQ ID N 44 et SEQ ID N 30 à SEQ ID N 32; - culture des cellules hôtes recombinantes dans un milieu de culture approprié ; - récupération et, le cas échéant, purification de la polykétide synthase de type I à partir du surnageant de culture ou du lysat cellulaire. The subject of the invention is furthermore a process for producing a type 1 polyketide synthase according to the invention, said production process comprising the following steps: obtaining a recombinant host cell comprising a nucleic acid coding for a polyketide type 1 synthase comprising a nucleotide sequence selected from the sequences SEQ ID N 33 to SEQ ID N 44 and SEQ ID N 30 to SEQ ID N 32; - culture of recombinant host cells in a suitable culture medium; recovery and, where appropriate, purification of the type I polyketide synthase from the culture supernatant or the cell lysate.

Les nouvelles polykétide synthases de type 1 obtenues selon le procédé décrit ci-dessus peuvent être caractérisées par fixation sur une colonne de chromatographie d'immuno-affinité sur laquelle des anticorps reconnaissant ces polykétides synthases ont été préalablement immobilisés. The new type 1 polyketide synthases obtained according to the method described above can be characterized by attachment to an immunoaffinity chromatography column on which antibodies recognizing these polyketide synthases have been immobilized beforehand.

Les polykétide synthases de type # selon l'invention, et plus particulièrement les polykétide synthases recombinantes décrites cidessus peuvent être aussi purifiées par des techniques de chromatographie liquide à haute performance (HPLC), telles que par exemple des techniques de chromatographie en phase inverse ou de chromatographie d'échanges d'anions ou de cations, bien connues de l'homme du métier. The polyketide synthases of the type # according to the invention, and more particularly the recombinant polyketide synthases described above can also be purified by high performance liquid chromatography (HPLC) techniques, such as, for example, reverse phase chromatography techniques or chromatography of anion exchanges or cations, well known to those skilled in the art.

Les polykétide synthases, recombinantes ou non recombinantes, selon l'invention peuvent être utilisées pour la préparation d'anticorps. The polyketide synthases, recombinant or non-recombinant, according to the invention can be used for the preparation of antibodies.

Selon un autre aspect, l'invention a donc encore pour objet un anticorps reconnaissant spécifiquement une polykétide synthase de type # selon l'invention ou un fragment peptidique d'une telle polykétide synthase. According to another aspect, the subject of the invention is therefore an antibody specifically recognizing a # type polyketide synthase according to the invention or a peptide fragment of such a polyketide synthase.

Les anticorps selon l'invention peuvent être monoclonaux ou polyclonaux . Les anticorps monoclonaux peuvent être préparés à partir The antibodies according to the invention may be monoclonal or polyclonal. Monoclonal antibodies can be prepared from

de cellules d'hybridome selon la technique décrite par KOHLER et MILSTEIN C. (1975), Nature, Vol.256:495. of hybridoma cells according to the technique described by KOHLER and MILSTEIN C. (1975), Nature, Vol.256: 495.

Les anticorps polyclonaux peuvent être préparés par immunisation d'un mammifère, en particulier des souris, des rats ou des lapins avec une polykétide synthase de type 1 selon l'invention, le cas échéant en présence d'un composé adjuvant de l'immunité, tels que l'adjuvant complet de Freund, l'adjuvant incomplet de Freund, l'hydroxyde d'aluminium ou encore un composé de la famille des muramyl peptides. The polyclonal antibodies can be prepared by immunization of a mammal, in particular mice, rats or rabbits with a polyketide synthase type 1 according to the invention, if appropriate in the presence of an adjuvant compound of immunity, such as Freund's complete adjuvant, Freund's incomplete adjuvant, aluminum hydroxide or a compound of the muramyl peptide family.

Constituent également des " anticorps " au sens de la présente invention, les fragments d'anticorps tels que les fragments Fab, Fab', F(ab')2, ou encore les fragments d'anticorps simple chaîne contenant la partie variable (ScFv) décrits par MARTINEAU et al. (1998) J. Mol. Biol., Vol.280 (1):117-127 ou encore dans le brevet US 4,946,778, ainsi que les anticorps humanisés décrits par REINMANN KA et al. (1997), AIDS Res. Also "antibodies" within the meaning of the present invention include antibody fragments such as Fab, Fab ', F (ab') 2 fragments, or even single chain antibody fragments containing the variable part (ScFv). described by MARTINEAU et al. (1998) J. Mol. Biol., Vol.280 (1): 117-127 or in US Pat. No. 4,946,778, as well as the humanized antibodies described by REINMANN KA et al. (1997), AIDS Res.

Hum. Retroviruses, vol. 13(11):933-943 ou par LEGER O. J et al. (1997), Hum. Antibodies, vol.8 (1): 3-16. Hmm. Retroviruses, vol. 13 (11): 933-943 or by LEGER O. J et al. (1997), Hum. Antibodies, vol.8 (1): 3-16.

Les préparations d'anticorps selon l'invention sont utiles notamment dans des tests immunologiques qualitatifs ou quantitatifs visant, soit à simplement détecter la présence d'une polykétide synthase de type 1 selon l'invention, soit à quantifier la quantité de cette polykétide synthase, par exemple dans le surnageant de culture ou le lysat cellulaire d'une souche bactérienne susceptible de produire une telle enzyme. The antibody preparations according to the invention are useful in particular in qualitative or quantitative immunological tests aimed either at simply detecting the presence of a polyketide synthase of type 1 according to the invention, or at quantifying the amount of this polyketide synthase, for example in the culture supernatant or the cell lysate of a bacterial strain capable of producing such an enzyme.

Un autre objet de l'invention consiste en un procédé de détection d'une polykétide synthase de type # selon l'invention ou un fragment peptidique de cette enzyme, dans un échantillon, ledit procédé comprenant les étapes de : a) mettre en contact un anticorps selon l'invention avec l'échantillon à tester; b) détecter le complexe antigène/anticorps éventuellement formé. Another subject of the invention consists in a method for detecting a polyketide synthase of the type # according to the invention or a peptide fragment of this enzyme, in a sample, said process comprising the steps of: a) contacting a antibody according to the invention with the test sample; b) detecting the antigen / antibody complex that may be formed.

L'invention est également relative à un kit ou nécessaire de détection d'une polykétide synthase de type I selon l'invention dans un échantillon, comprenant : The invention also relates to a kit or kit for detecting a type I polyketide synthase according to the invention in a sample, comprising:

a) un anticorps selon l'invention; b) le cas échéant, des réactifs nécessaires à la détection du complexe antigène/anticorps éventuellement formé. a) an antibody according to the invention; b) if necessary, reagents necessary for the detection of the antigen / antibody complex that may be formed.

Un anticorps dirigé contre une polykétide synthase de type # selon l'invention peut être marqué à l'aide d'un marqueur détectable isotopique ou non isotopique, selon des procédés bien connus de l'homme du métier. An antibody directed against a polyketide synthase of the type # according to the invention may be labeled using an isotopic or non-isotopic detectable label, according to methods well known to those skilled in the art.

Le criblage d'une banque d'ADN selon l'invention à l'aide d'une paire d'amorces hybridant avec des séquences cibles dont la présence est recherchée, telles que des séquences de la voie de biosynthèse de la puromycine, des séquences du gène linA impliquées dans la biodégradation du lindane ou encore des séquences codant pour des polykétides synthases de type # ont été détaillées ci-avant. Screening of a DNA library according to the invention using a pair of primers hybridizing with target sequences whose presence is sought, such as sequences of the puromycin biosynthetic pathway, sequences of the linA gene involved in the biodegradation of lindane or sequences encoding # type polyketide synthases have been detailed above.

L'invention a donc pour objet un procédé de détection d'un acide nucléique de séquence nucléotidique déterminée, ou de séquence nucléotidique structuralement apparentée à une séquence nucléotidique déterminée, dans une collection de cellules hôtes recombinantes selon l'invention, caractérisé en ce qu'il comprend les étapes suivantes: - mettre en contact la collection de cellules hôtes recombinantes avec un couple d'amorces hybridant avec la séquence nucléotidique déterminée ou hybridant avec la séquence nucléotidique structurellement apparentée à une séquence nucléotidique déterminée ; - réaliser au moins trois cycles d'amplification ; - détecter l'acide nucléique éventuellement amplifié. The subject of the invention is therefore a method for detecting a nucleic acid with a determined nucleotide sequence, or a nucleotide sequence structurally related to a determined nucleotide sequence, in a collection of recombinant host cells according to the invention, characterized in that it comprises the following steps: bringing the collection of recombinant host cells into contact with a pair of primers hybridizing with the determined nucleotide sequence or hybridizing with the nucleotide sequence structurally related to a determined nucleotide sequence; - perform at least three amplification cycles; detect the optionally amplified nucleic acid.

Pour les conditions d'amplification appropriées en fonction des séquences cibles recherchées, l'homme du métier pourra se référer avantageusement aux exemples ci-dessous. For the appropriate amplification conditions depending on the desired target sequences, those skilled in the art can advantageously refer to the examples below.

Selon un autre aspect, l'invention concerne aussi un procédé de détection d'un acide nucléique, de séquences nucléotidiques déterminées, ou de séquences nucléotidiques structurellement apparentées à une séquence nucléotidique déterminée, dans une collection de cellules hôtes recombinantes selon l'invention, caractérisé en ce qu'il comprend les étapes suivantes: - mettre en contact la collection de cellules hôtes recombinantes avec une sonde hybridant avec la séquence nucléotidique déterminée ou According to another aspect, the invention also relates to a method for detecting a nucleic acid, specific nucleotide sequences, or nucleotide sequences structurally related to a determined nucleotide sequence, in a collection of recombinant host cells according to the invention, characterized in that it comprises the following steps: - bringing the collection of recombinant host cells into contact with a probe hybridizing with the determined nucleotide sequence or

hybridant avec une séquence nucléotidique structurellement apparentée à la séquence nucléotidique déterminée ; - détecter l'hybride éventuellement formé entre la sonde et les acides nucléiques compris dans les vecteurs de la collection. hybridizing with a nucleotide sequence structurally related to the determined nucleotide sequence; detect the hybrid possibly formed between the probe and the nucleic acids included in the vectors of the collection.

Pour effectuer le criblage d'une banque d'ADN selon l'invention en vue de détecter la présence d'une séquence nucléotidique codant pour un polypeptide capable de dégrader le lindane, on a détecté les clones recombinants d'intérêt par leur phénotype correspondant à leur capacité à dégrader le lindane. Dans ce but, les clones isolés et/ou des ensembles de clones de la banque d'ADN préparée ont été mis en culture dans un milieu de culture en présence de lindane et la dégradation du lindane a été observée par la formation d'un halo trouble dans l'environnement immédiat des cellules. To screen a DNA library according to the invention for the presence of a nucleotide sequence coding for a polypeptide capable of degrading lindane, the recombinant clones of interest were detected by their phenotype corresponding to their ability to degrade lindane. For this purpose, the isolated clones and / or sets of clones of the prepared DNA library were cultured in a culture medium in the presence of lindane and the degradation of lindane was observed by the formation of a halo trouble in the immediate environment of the cells.

L'invention concerne aussi un procédé pour identifier la production d'un composé d'intérêt par une ou plusieurs cellules hôtes recombinantes dans une collection de cellules hôtes recombinantes selon l'invention, caractérisé en ce qu'il comprend les étapes suivantes: - culture des cellules hôtes recombinantes de la collection dans un milieu de culture approprié ; - détection du composé d'intérêt dans le surnageant de culture ou dans le lysat cellulaire d'une ou plusieurs des cellules recombinantes cultivées. The invention also relates to a method for identifying the production of a compound of interest by one or more recombinant host cells in a collection of recombinant host cells according to the invention, characterized in that it comprises the following steps: - culture recombinant host cells of the collection in a suitable culture medium; detection of the compound of interest in the culture supernatant or in the cell lysate of one or more of the recombinant cells cultured.

L'invention a en outre pour objet un procédé pour sélectionner une cellule hôte recombinante produisant un composé d'intérêt dans une collection de cellules hôtes recombinantes selon l'invention, caractérisé en ce qu'il comprend les étapes suivantes: - culture des cellules hôtes recombinantes de la collection dans un milieu de culture approprié ; - détection du composé d'intérêt dans le surnageant de culture ou dans le lysat cellulaire d'une ou plusieurs des cellules hôtes recombinantes cultivées; - sélection des cellules hôtes recombinantes produisant le composé d'intérêt. The invention further relates to a method for selecting a recombinant host cell producing a compound of interest in a collection of recombinant host cells according to the invention, characterized in that it comprises the following steps: - culture of the host cells recombinants from the collection in a suitable culture medium; detecting the compound of interest in the culture supernatant or in the cell lysate of one or more of the recombinant cultured host cells; selection of the recombinant host cells producing the compound of interest.

L'invention concerne encore un procédé pour la production d'un composé d'intérêt caractérisé en ce qu'il comprend les étapes suivantes: - cultiver une cellule hôte recombinante sélectionnée selon le procédé décrit ci-dessus; - récupérer et, le cas échéant, purifier, le composé produit par ladite cellule hôte recombinante. The invention also relates to a method for the production of a compound of interest characterized in that it comprises the following steps: cultivating a recombinant host cell selected according to the method described above; recovering and, if necessary, purifying, the compound produced by said recombinant host cell.

L'invention est également relative à un composé d'intérêt caractérisé en ce qu'il est obtenu selon le procédé ci-dessus décrit. The invention also relates to a compound of interest characterized in that it is obtained according to the method described above.

Un composé d'intérêt selon l'invention peut consister en un polykétide produit grâce à l'expression d'au moins une séquence nucléotidique comprenant une séquence choisie parmi les séquences SEQ ID N 33 à 44 et SEQ ID N 30 à 32. A compound of interest according to the invention may consist of a polyketide produced by the expression of at least one nucleotide sequence comprising a sequence chosen from the sequences SEQ ID N 33 to 44 and SEQ ID N 30 to 32.

L'invention concerne encore une composition comprenant un polykétide produit grâce à l'expression d'au moins une séquence nucléotidique comprenant une séquence choisie parmi les séquences SEQ ID N 33 à SEQ ID N 44 et SEQ ID N 30 à SEQ ID N 32. The invention also relates to a composition comprising a polyketide produced by the expression of at least one nucleotide sequence comprising a sequence chosen from the sequences SEQ ID N 33 to SEQ ID N 44 and SEQ ID N 30 to SEQ ID N 32.

Un polykétide produit grâce à l'expression d'au moins une séquence nucléotidique ci-dessus est préférentiellement le produit de l'activité de plusieurs séquences codantes incluses au sein d'un opéron fonctionnel dont les produits de traduction sont les différentes enzymes nécessaires à la synthèse d'un polykétide, l'une des séquences ci-dessus étant comprise et exprimée dans ledit opéron. Un tel opéron comprenant une séquence d'acide nucléique selon l'invention codant pour une polykétide synthase peut être construit par exemple selon l'enseignement de Borchert et al. (1992). A polyketide produced by the expression of at least one nucleotide sequence above is preferably the product of the activity of several coding sequences included within a functional operon whose translation products are the various enzymes necessary for the synthesis of a polyketide, one of the above sequences being included and expressed in said operon. Such an operon comprising a nucleic acid sequence according to the invention encoding a polyketide synthase can be constructed for example according to the teaching of Borchert et al. (1992).

L'invention est encore relative à une composition pharmaceutique comprenant une quantité pharmacologiquement active d'un polykétide selon l'invention, le cas échéant en association avec un véhicule pharmaceutiquement compatible. The invention also relates to a pharmaceutical composition comprising a pharmacologically active amount of a polyketide according to the invention, where appropriate in combination with a pharmaceutically compatible vehicle.

De telles compositions pharmaceutiques seront avantageusement adaptées pour l'administration, par exemple par voie parentérale, d'une quantité d'un polykétide synthétisé par une polykétide synthase de type 1 selon l'invention allant de1ug/kg par jour à 10 mg/kg par jour, de préférence au moins 0,01 mg/kg par jour et de manière tout à fait préférée entre 0,01 et 1 mg/kg par jour. Such pharmaceutical compositions will advantageously be suitable for the administration, for example parenterally, of an amount of a polyketide synthesized by a polyketide synthase type 1 according to the invention ranging from 1 μg / kg per day to 10 mg / kg per preferably at least 0.01 mg / kg per day and most preferably between 0.01 and 1 mg / kg per day.

Les compositions pharmaceutiques selon l'invention peuvent être indifféremment administrées par voie orale, rectale, parentérale, intraveineuse, sous-cutanée ou encore intradermique. The pharmaceutical compositions according to the invention can be indifferently administered orally, rectally, parenterally, intravenously, subcutaneously or intradermally.

L'invention concerne aussi l'utilisation d'un polykétide obtenu grâce à l'expression d'une polykétide synthase de type 1 selon l'invention pour la fabrication d'un médicament, en particulier d'un médicament à activité antibiotique. The invention also relates to the use of a polyketide obtained by the expression of a polyketide synthase type 1 according to the invention for the manufacture of a medicament, in particular a drug with antibiotic activity.

L'invention sera en outre illustrée, sans pour autant être limitée, par les figures et les exemples ci-après. The invention will be further illustrated, without being limited, by the figures and examples below.

La figure 1 illustre le schéma des différentes étapes de lyse effectuées selon les protocoles 1,2, 3n 4a, 4b, 5a, et 5b décrits à l'exemple 1. FIG. 1 illustrates the diagram of the various lysis steps carried out according to the protocols 1,2, 3n 4a, 4b, 5a, and 5b described in Example 1.

La Figure 2 illustre une. électrophorèse sur gel d'agarose 0. 8% des ADN extraits à partir de 300 mg du sol n 3 (Côte St André) après différents traitements de lyse (protocoles 1 à 5, cf. Fig. 1). M : marqueur de poids moléculaire de phage lambda
La Figure 3 illustre la proportion de différents genres d'actinomycètes cultivés à la suite des traitements 1 à 5 (cf. Fig. 1). Le nombre d'ufc (unité formant colonie) a été déterminé sur un milieu sélectif pour ce groupe de bactéries. Un nombre total d'environ 400 colonies a été analysé. Figure 2 illustrates a. agarose gel electrophoresis 0.8% of the DNAs extracted from 300 mg of soil n 3 (Côte St André) after various lysis treatments (protocols 1 to 5, see Fig. 1). M: lambda phage molecular weight marker
Figure 3 illustrates the proportion of different kinds of actinomycetes grown as a result of treatments 1 to 5 (see Fig. 1). The number of cfu (colony-forming unit) was determined on a selective medium for this group of bacteria. A total of about 400 colonies were analyzed.

La Figure 4 illustre la. récupération d'ADN de phage lambda digéré par Hindlll additionné dans les sols à différentes concentrations avant (G) ou après (G*) broyage. Les traitements T (chocs thermiques) et S (sonication) sont des traitements additionnels de lyse. La quantification a été réalisée par analyse au phospho-imageur après hybridation en dotblot. Un échantillon de chaque sol a été utilisé pour chaque concentration de phage lambda ajouté. Les caractéristiques des sol sont reproduites dans le tableau 1. Les échantillons correspondant à 10 et 15 g d'ADN ajouté n'ont pas été traités. Figure 4 illustrates the. recovery of lambda digested lambda phage DNA added to the soil at different concentrations before (G) or after (G *) grinding. T (heat shock) and S (sonication) treatments are additional lysis treatments. Quantification was performed by phosphorimager analysis after dotblot hybridization. One sample of each soil was used for each added lambda phage concentration. The soil characteristics are reproduced in Table 1. Samples corresponding to 10 and 15 g of added DNA were not processed.

La Figure 5 illustre l'amplification par PCR des ADN extraits à partir de sol n 3 selon les protocoles 1,2, 3, 5a et 5b. Les amorces FGPS 122 et FGPS 350 (tableau 2) ont été utilisés afin de cibler Streptosporangium spp. indigènes. Les extraits d'ADN ont été utilisés non dilués ou dilués au 1/10ème et 1/100ème . M : marqueur de poids moléculaires 123 pb ( Gibco BRL), C : d'amplification sans ADN. Figure 5 illustrates the PCR amplification of DNAs extracted from soil n 3 according to the protocols 1,2, 3, 5a and 5b. FGPS 122 and FGPS 350 primers (Table 2) were used to target Streptosporangium spp. native. The DNA extracts were used undiluted or diluted to 1 / 10th and 1 / 100th. M: 123 bp molecular weight marker (Gibco BRL), C: amplification without DNA.

La Figure 6 illustre les quantités d'ADN extrait après inoculation de spores (a) ou de mycélium (b) de S. lividans OS48. 3 inoculés dans les sols à différentes concentrations. La quantités de mycélium ajoutée dans le sol correspond au nombre de spores inoculées dans le milieu de germination. Environ 50% des spores ont germé, le nombre de cellules ou de génomes contenues dans les hyphes des spores germées n'a pas été déterminé. Les quantités de spores et de mycélium inoculées ne sont donc pas directement comparables. Le protocole d'extraction a été mené selon le protocole 6 (cf. section matériel et méthodes). Le symbole (') indique que de l'ARN a été inclus dans le tampon d'extraction. L'ADN cible a été amplifié par PCR avec les amorces FGPS 516 et FGPS 517, la quantification a été réalisée par phosphoimageur après hybridation en dot blot en utilisant le sonde FGPS 518. Un échantillon de chaque sol a été utilisé pour chaque concentration d'hyphes ou de spores. Les caractéristiques des sols sont décrites dans le tableau 1. Figure 6 illustrates the amounts of DNA extracted after inoculation of S. lividans OS48 spores (a) or mycelium (b). 3 inoculated in soils at different concentrations. The amount of mycelium added to the soil corresponds to the number of spores inoculated in the germination medium. About 50% of the spores germinated, the number of cells or genomes contained in the hyphae of germinated spores was not determined. The quantities of inoculated spores and mycelium are therefore not directly comparable. The extraction protocol was carried out according to protocol 6 (see section material and methods). The symbol (') indicates that RNA has been included in the extraction buffer. The target DNA was amplified by PCR with the primers FGPS 516 and FGPS 517, the quantification was carried out by phosphoimageur after dot blot hybridization using the FGPS 518 probe. A sample of each soil was used for each concentration of hyphae or spores. Soil characteristics are described in Table 1.

La figure 7 représente l'arbre phylogénétique obtenu par l'algorithme de Neighbour Joining , positionnant les séquences d'ADNr 16S contenues dans la banque d'ADN du sol, par rapport à des bactéries de références cultivées. Figure 7 shows the phylogenetic tree obtained by the Neighbor Joining algorithm, positioning the 16S rDNA sequences contained in the soil DNA library, relative to cultured reference bacteria.

En grisé:.les séquences issues des pools de clones de la banque. In gray: .the sequences from clone pools of the bank.

Les valeurs de bootstrap sont indiquées au niveau des noeuds, après rééchantillonnage de 100 répétitions. La barre d'échelle indique le Bootstrap values are indicated at the node level, after resampling 100 repetitions. The scale bar indicates the

nombre de substitutions par site. Le numéro d'accès des séquences dans la base de données Genbank est indiqué entre parenthèses. number of substitutions per site. Sequence access numbers in the Genbank database are indicated in parentheses.

La figure 8 représente un schéma du vecteur pOSint 1. FIG. 8 represents a diagram of the vector pOSint 1.

La figure 9 représente un schéma du vecteur pWED1. Figure 9 shows a diagram of the vector pWED1.

La figure 10 représente un schéma du vecteur pWE15 (ATCC N 37503). Figure 10 is a diagram of the vector pWE15 (ATCC N 37503).

La figure 11 représente un schéma du vecteur pOS 7001. Figure 11 shows a diagram of the vector pOS 7001.

La figure 12 représente un schéma du vecteur pOSV010. Figure 12 shows a diagram of the vector pOSV010.

La figure 13 représente le fragment contenant un site " cos inséré dans le plasmide pOSV010 au cours de la construction du vecteur pOSV 303. Figure 13 shows the fragment containing a cos site inserted into the plasmid pOSV010 during the construction of the pOSV 303 vector.

La figure 14 représente un schéma du vecteur pOSV 303. Figure 14 is a diagram of the vector pOSV 303.

La figure 15 représente un schéma du vecteur pE116. Figure 15 shows a diagram of the vector pE116.

La figure 16 représente un schéma du vecteur pOS 700 R. FIG. 16 represents a diagram of the vector pOS 700 R.

La figure 17 représente un schéma du vecteur pOSV 001. Figure 17 shows a diagram of the vector pOSV 001.

La figure 18 représente le schéma du vecteur pOSV 002. Figure 18 shows the scheme of the vector pOSV 002.

La figure 19 représente un schéma du vecteur pOSV 014. Figure 19 is a diagram of the vector pOSV 014.

La figure 20 représente un schéma du vecteur pBAC 11. Figure 20 shows a diagram of the vector pBAC 11.

La figure 21 représente un schéma du vecteur pOSV 403. Figure 21 is a diagram of the vector pOSV 403.

La figure 22 représente les gels d'électrophorèse d'ADN de la banque après digestion par les enzymes BamHl et Oral des clones positifs de la banque criblée avec les oligonucléotides PKS-I. Figure 22 shows the DNA electrophoresis gels of the library after digestion with BamHI and Oral enzymes of the positive clones of the library screened with PKS-I oligonucleotides.

La figure 23 illustre la production de puromycine par les recombinants de S. lividans comparée à la production de la souche sauvage S. alboniger. Figure 23 illustrates the production of puromycin by S. lividans recombinants compared to the production of the S. alboniger wild strain.

La Figure 24 illustre Alignement de PKSs du sol avec les sites actifs conservés d'autres PKSs. Les références pour chaque peptide sont indiquées. Les domaines béta-kétoacyl synthase ont été alignés en utilisant le programme PILEUP de GCG (Wisconsin Package Version 9.1, Genetics Computer Group, Madison, Wisc). Figure 24 illustrates Alignment of soil PKSs with conserved active sites of other PKSs. References for each peptide are indicated. The beta-ketoacyl synthase domains have been aligned using GCG's PILEUP program (Wisconsin Package Version 9.1, Genetics Computer Group, Madison, Wisc).

La Figure 25 illustre la construction d'un cosmide intégratif conjugatif. Figure 25 illustrates the construction of a conjugative integrative cosmid.

La Figure 26 illustre la construction d'un BAC intégratif conjugatif. Figure 26 illustrates the construction of a conjugative integrative BAC.

EXEMPLES : EXAMPLES

EXEMPLE 1 : de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol contenant des organismes, comprenant une étape d'extraction directe d'ADN à partir de l'échantillon de sol. EXAMPLE 1: Preparation of a nucleic acid collection from a soil sample containing organisms, comprising a step of directly extracting DNA from the soil sample.

1. MATERIEL ET METHODES
1. 1 SOLS : Les caractéristiques des six sols utilisés dans cette étude sont listées dans le tableau 1. 1. MATERIAL AND METHODS
1. 1 SOILS: The characteristics of the six soils used in this study are listed in Table 1.

La teneur en argile et en matière organique va respectivement de 9 à 47% et de 1,7 à 4,7%, le pH variant de 4,3 à 5,8. The clay and organic matter content ranges from 9 to 47% and from 1.7 to 4.7% respectively, the pH ranging from 4.3 to 5.8.

Des échantillons de sol ont été collectés à partir de la couche superficielle de 5 à 10 cm de profondeur. Toutes les racines visibles ont été éliminées et les sols ont été conservés à 4 C pendant quelques jours si nécessaire, après quoi ils ont été séchés pendant 24 heures à la température ambiante et tamisés (taille moyenne de maille 2 mm) avant d'être conservés jusqu'à plusieurs mois à 4 C. Soil samples were collected from the top layer 5 to 10 cm deep. All visible roots were removed and the soils were kept at 4 ° C for a few days if necessary, after which they were dried for 24 hours at room temperature and sieved (average mesh size 2 mm) before being stored. up to several months at 4 C.

1. 2 SOUCHES BACTERIENNES ET CONDITIONS DE CULTURE: L'ADN extracellulaire ainsi que les souches bactériennes fournissant des cellules végétatives, des spores ou des hyphae, utilisées pour innoculer les échantillons de sol, ont été choisies de telle sorte que leur présence puisse être suivie spécifiquement. 1. 2 BACTERIAL STRAINS AND CULTURE CONDITIONS: Extracellular DNA as well as bacterial strains providing vegetative cells, spores or hyphae, used to innoculate soil samples, were chosen so that their presence could be monitored. specifically.

Afin d'obtenir de grandes quantités d'ADN extracellulaire, la souche lysogénique de E.coli 1192 Hfr P4X (metB), contenant le phage lambda C1857 Sam7, a été cultivée sur milieu Luria-Bertani (LB) pendant deux heures à 30 C, puis 30 minutes à 40 C, puis 3 heures à 37 C. L'ADN du phage lambda a été extrait selon la technique décrite par SAMBROOK J. et al. (1989) Molecular Cloning: A Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor N.Y. In order to obtain large amounts of extracellular DNA, the lysogenic strain of E. coli 1192 Hfr P4X (metB), containing lambda phage C1857 Sam7, was cultured on Luria-Bertani (LB) medium for two hours at 30 ° C. then 30 minutes at 40 ° C. and then 3 hours at 37 ° C. The phage lambda DNA was extracted according to the technique described by SAMBROOK J. et al. (1989) Molecular Cloning: A Laboratory Manual, 2nd, ed. Cold Spring Harbor Laboratory, Cold Spring Harbor N.Y.

La souche avirulente de Bacillus anthracis (STERNE 7700) a été utilisée comme inoculum de cellules bactériennes. Bacillus anthracis a été multiplié sur un bouillon de culture de type " trypticase soy broth " (TSB) (Biomérieux, Lyon, France) pendant environ 6 heures, en vérifiant que la D06oo soit maintenue en dessous de 0,6. Ces conditions permettent le développement des cellules végétatives sans formation de spores (Patra étal., (1996), FEMS Immunol. Medical Microbiology, vol. 15:223-231.). Les The avirulent strain of Bacillus anthracis (STERNE 7700) was used as an inoculum of bacterial cells. Bacillus anthracis was propagated on broth culture type "trypticase soy broth" (TSB) (Biomérieux, Lyon, France) for about 6 hours, verifying that OD600 is kept below 0.6. These conditions allow the development of vegetative cells without spore formation (Patra et al., (1996), FEMS Immunol, Medical Microbiology, 15: 223-231). The

spores de Streptomyces lividans OS48. 3 (CLERC-BARDIN et al. non publié) ont été éliminées mécaniquement des cultures de l'organisme sur un milieu R2YE (HOPWOOD et al., (1985), Genetic Manipulation of Streptomyces-A Laboratory Manual. The John Innes Foundation , Norwich ,United Kingdom). Les hyphae de S.lividans OS48. 3 ont été obtenus à partir des spores en pré-germination, car l'on s'attendait à ce que l'utilisation de hyphae courtes minimise la rupture et la perte subséquente d'ADN. Les spores ont été mises en suspension dans du tampon TES (Acide N-Tris [hydroxyméthyl]méthyl-2-aminoéthanesulfonique ; SigmaAldrich Chimie, France) (0.05M; pH 8) (Holben WE et al., (1988), APPL. Environ. Microbiol. vol.54:703-711, puis ont été soumises à un choc thermique (50 C pendant 10 minutes suivi d'un refroidissement sous un courant d'eau froide puis ajoutées à un volume égal de milieu de prégermination (extrait de levure 1 %, casaminoacides 1 % CaCl2 0,01 M). spores of Streptomyces lividans OS48. 3 (CLERC-BARDIN et al., Unpublished) were mechanically removed from organism cultures on R2YE medium (HOPWOOD et al., (1985), Genetic Manipulation of Streptomyces-A Laboratory Manual, The John Innes Foundation, Norwich , United Kingdom). The hyphae of S.lividans OS48. 3 were obtained from pre-germinated spores, as the use of short hyphae was expected to minimize breakage and subsequent loss of DNA. The spores were suspended in TES buffer (N-Tris [hydroxymethyl] methyl-2-aminoethanesulfonic acid, SigmaAldrich Chemistry, France) (0.05M, pH 8) (Holben WE et al., (1988), APPL. Microbiol, vol.54: 703-711, were then heat-shocked (50 ° C for 10 minutes followed by cooling under a stream of cold water and then added to an equal volume of pregermination medium (extract). yeast 1%, casamino acids 1% CaCl 2 0.01 M).

La solution a été incubée à 37 C sur un agitateur. La proportion de spores germées a été estimée à environ 50%, en accord avec les résultats de HOPWOOD et al. (1985). Après centrifugation, les culots ont été resuspendus dans du tampon TES, ajoutés à 3% de milieu TSB, et incubés à 37 C jusqu'à l'obtention d'une D045o de 0,15 (HOPWOOD et al. , (1985)). Streptomyces hygroscopicus SWN 736 et Streptosporangium fragile AC1296 (Institute Pushino, Moscou) ont été cultivés selon des techniques décrites par HICKEY et TRESNER (1952). The solution was incubated at 37 ° C. on a shaker. The proportion of germinated spores was estimated at about 50%, in agreement with the results of HOPWOOD et al. (1985). After centrifugation, the pellets were resuspended in TES buffer, added to 3% of TSB medium, and incubated at 37 ° C. until a OD 45 0 of 0.15 (HOPWOOD et al., (1985)) was obtained. . Streptomyces hygroscopicus SWN 736 and Streptosporangium fragile AC1296 (Pushino Institute, Moscow) were grown according to techniques described by HICKEY and TRESNER (1952).

L'ADN des spores et des hyphae de S. Lividans a été extrait à partir des cultures pures selon le protocole de lyse 6 décrit ci-dessous (excepté qu'aucun broyage n'a été réalisé), tandis que les spores de S. hygroscopicus et de S. fragile ont été extraites par lyse chimique/enzymatique (Hintermann étal., 1981). Spore and hyphae DNA from S. Lividans was extracted from the pure cultures according to the lysis protocol 6 described below (except that no grinding was performed), whereas S. hygroscopicus and fragile S. were extracted by chemical / enzymatic lysis (Hintermann et al., 1981).

1. 3 CHOIX DU TAMPON D'EXTRACTION : tampon TENP (50 mM Tris, 20 mM EDTA, 100 mM NaCI, 1% pds/vol de polyvinylpolypyrrolidone développé par PICARD (1992) a été utilisé. Des tampons similaires ont été ultérieurement utilisés par d'autres auteurs (CLEGG et al., 1997 ; KUSKE et al., 1998 ; ZHOU et al., 1996). 1. CHOICE OF EXTRACTION BUFFER: TENP buffer (50 mM Tris, 20 mM EDTA, 100 mM NaCl, 1% wt / vol polyvinylpolypyrrolidone developed by PICARD (1992) was used, similar buffers were subsequently used by other authors (CLEGG et al., 1997, KUSKE et al., 1998, ZHOU et al., 1996).

Le Tris et l'EDTA protègent l'ADN de l'activité nucléase, le NaCI apporte un effet dispersant et la PVPP absorbe les acides humiques et les Tris and EDTA protect the DNA from nuclease activity, NaCl provides a dispersing effect and PVPP absorbs humic acids and

autres composés phénoliques (HOLBEN et al. (1988) ; PICARD et al., (1992). other phenolic compounds (HOLBEN et al (1988), PICARD et al., (1992).

Dans cette étude, l'efficacité d'extraction de ce tampon a été évaluée à différents pH (6,0 - 10,0) en utilisant 20 sols différents ayant une gamme de pH de 5,8 à 8,3 et une teneur en matière organique entre 0,2 et 6,3%. Ces vingt sols (les autres caractéristiques ne sont pas indiquées) ont été utilisés uniquement dans cette expérience. La quantité d'ADN a été déterminée de manière colorimétrique comme décrit par RICHARD (1974), et détaillé ci-après. In this study, the extraction efficiency of this buffer was evaluated at different pH (6.0 - 10.0) using 20 different soils with a pH range of 5.8 to 8.3 and a organic matter between 0.2 and 6.3%. These twenty soils (the other characteristics are not indicated) were used only in this experiment. The amount of DNA was determined colorimetrically as described by RICHARD (1974), and detailed below.

1. 4 PROTOCOLE DE LYSE IN SITU ET D'EXTRACTION D'ADN : Plusieurs protocoles utilisant un nombre croissant d'étapes ont été testés afin d'évaluer l'efficacité de différentes techniques pour lyser les microbes du sol in situ. Pour ces expériences, la microflore indigène du sol a été ciblée dans six sols. Des expériences additionnelles ont été conduites afin d'étudier les effets des traitements de lyse sur l'ADN libéré, en analysant les quantités et la qualité d'ADN récupéré provenant d'un ADN de phage lambda préalablement additionné aux sols. 1. 4 IN SITU LYSE AND DNA EXTRACTION PROTOCOL: Several protocols using an increasing number of steps have been tested to evaluate the effectiveness of different techniques for lysing soil microbes in situ. For these experiments, native soil microflora was targeted in six soils. Additional experiments were conducted to investigate the effects of lysis treatments on the released DNA by analyzing the amounts and quality of recovered DNA from lambda phage DNA previously added to the soils.

Une fois qu'un protocole optimisé (désigné protocole 6) a été développé, ce protocole a été utilisé pour quantifier l'ADN provenant d'Actinomycètes indigènes et d'ADN provenant de bactéries Grampositives inoculées dans les sols sélectionnés. Dans tous les cas, les échantillons de sol ont été séchés et passés au tamis comme décrit cidessus. Once an optimized protocol (referred to as protocol 6) was developed, this protocol was used to quantify DNA from native Actinomycetes and DNA from Grampositive bacteria inoculated into selected soils. In all cases, the soil samples were dried and sieved as described above.

Après broyage, 0,5 ml de tampon TENP ont été ajoutés à 200 mg poids sec de sol excepté pour le protocole 1 dans lequel le tampon a été ajouté à un sol non broyé). After grinding, 0.5 ml of TENP buffer was added at 200 mg dry weight of sol except for protocol 1 in which the buffer was added to unmilled soil).

Pour les divers traitements de lyse (voir ci-dessous), les suspensions de sol ont été passées au Vortex pendant dix minutes et centrifugées (4000 g pendant cinq minutes), après quoi une fraction aliquote (25 l) du surnageant a été analysée par électrophorèse sur gel (0,8% d'agarose). For the various lysis treatments (see below), the soil suspensions were vortexed for ten minutes and centrifuged (4000 g for five minutes), after which an aliquot (25 l) of the supernatant was analyzed by gel electrophoresis (0.8% agarose).

Une autre fraction aliquote du surnageant représentant un volume connu, généralement 350 l, a été précipitée avec de l'isopropanol. Another aliquot of the supernatant representing a known volume, typically 350 l, was precipitated with isopropanol.

Cinq fractions aliquotes (représentant de l'ADN dérivé de 1 g de sol) ont été réunies et resuspendues dans 100 l d'un tampon TE stérile (10 mM Tris, 1 mM EDTA, pH 8,0) avant purification (protocole D, voir cidessous) et quantification, soit par hybridation (Dot Blot) de l'ADN total, soit par hybridation (Dot Blot) des produits d'amplification PCR (voir cidessous). Five aliquots (representing DNA derived from 1 g of sol) were pooled and resuspended in 100 μl of sterile TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0) prior to purification (Protocol D, see below) and quantification, either by hybridization (Dot Blot) of the total DNA, or by hybridization (Dot Blot) PCR amplification products (see below).

Les signaux d'hybridation ont été quantifiés par imagerie par phosphorescence (technique de " phospho-imaging " voir ci-dessous). The hybridization signals were quantified by phosphorescence imaging ("phospho-imaging" technique see below).

1. 5 EVALUATION DES METHODES DE LYSE CELLULAIRE IN SITU: La qualité et la quantité de l'ADN extrait après un nombre croissant d'étapes de traitement de lyse (protocole 2-5b) ont été comparées à celles de l'ADN extracellulaire obtenu après lavage du sol avec un tampon d'extraction (protocole 1; voir aussi figure 1). 1. EVALUATION OF IN SITU CELL LYSE METHODS: The quality and quantity of the DNA extracted after an increasing number of lysis treatment steps (protocol 2-5b) were compared with those of the extracellular DNA obtained. after washing the soil with an extraction buffer (protocol 1, see also Figure 1).

Protocole 1 : Pas de traitement de lyse. Protocol 1: No lysis treatment.

Le tampon TENP a été ajouté à un sol non broyé, une étape d'extraction d'ADN a été réalisée comme décrit ci-dessus. TENP buffer was added to unmilled soil, a DNA extraction step was performed as described above.

Protocole 2. Broyage du sol suivi d'une extraction d'ADN. Protocol 2. Soil grinding followed by DNA extraction.

Deux types de dispositifs différents ont été utilisés pour le broyage du sol. Two different types of devices were used for ground grinding.

Afin de comparer leur efficacité respective, 5g de sol sec ont été broyés pendant 30 secondes dans un broyeur contenant des anneaux de tungstène, ou pendant des temps variés jusqu'à 60 minutes dans un broyeur de sol contenant un mortier et des billes en agate (20 mm de diamètre). In order to compare their respective efficiency, 5g of dry soil was milled for 30 seconds in a mill containing tungsten rings, or for various times up to 60 minutes in a soil grinder containing a mortar and agate balls ( 20 mm in diameter).

Le tampon TENP est ensuite ajouté et l'ADN est extrait comme décrit ci-dessus. The TENP buffer is then added and the DNA is extracted as described above.

Les résultats d'électrophorèse sur gel ont montré qu'un broyage de 40 minutes en utilisant des billes en agate étaient nécessaires afin d'obtenir des quantités d'ADN extraits équivalentes à celles obtenues après 30 secondes de broyage en utilisant des anneaux de tungstène. Gel electrophoresis results showed that grinding for 40 minutes using agate beads was necessary in order to obtain extracted amounts of DNA equivalent to those obtained after 30 seconds of grinding using tungsten rings.

La distribution de taille des fragments d'ADN est similaire quelle que soit la méthode employée. The size distribution of the DNA fragments is similar regardless of the method used.

Ainsi, ces traitements ont été considérés comme équivalents et celui qui sera utilisé dans les protocoles décrits ci-dessous ne sera en conséquence pas spécifié. Thus, these treatments have been considered equivalent and the one that will be used in the protocols described below will therefore not be specified.

Dans les protocoles 3 à 5, l'efficacité de plusieurs autres traitements de lyse ultérieure au broyage du sol a été testée, soit séparément, soit dans différentes combinaisons. In Protocols 3 to 5, the effectiveness of several other lysis treatments subsequent to ground grinding was tested, either separately or in different combinations.

Protocole 3 :
Ce protocole est identique au protocole 2 , sauf qu'il comprend une étape d'homogénéisation à l'aide d'un mixeur de type Ultraturrax (Janker et Kunkel, IKA Labortechnik, Allemagne) réglé à la moitié de la vitesse maximale pendant 5 minutes. Protocol 3:
This protocol is identical to protocol 2 except that it comprises a homogenization step using an Ultraturrax type mixer (Janker and Kunkel, IKA Labortechnik, Germany) set at half the maximum speed for 5 minutes. .

PROTOCOLES 4a et 4b:
Ces protocoles sont identiques au protocole 3 à l'exception d'une étape additionnelle de sonication. PROTOCOLS 4a and 4b:
These protocols are identical to protocol 3 with the exception of an additional sonication step.

Deux types de dispositifs sonicateurs ont été comparés : un sonicateur à micropointe de titane (600W Vibracell Ultrasonicator, Bioblock, Illkirch, France) (Protocole 4a) et un sonicateur de type Cup Horn (protocole 4b). Two types of sonicator devices were compared: a titanium microtip sonicator (600W Vibracell Ultrasonicator, Bioblock, Illkirch, France) (Protocol 4a) and a Cup Horn-type sonicator (Protocol 4b).

La micropointe Vibracell produisant des ultrasons est en contact direct avec la solution de sol. The ultrasonic Vibracell microtip is in direct contact with the soil solution.

En ce qui concerne le dispositif de type Cup Horn, la solution de sol est conservée dans des tubes qui sont placés dans un bain d'eau à travers lequel passent les ultrasons. With regard to the Cup Horn device, the soil solution is stored in tubes which are placed in a water bath through which the ultrasound passes.

Des expériences préliminaires ont été réalisées afin de déterminer les conditions optimales pour les deux sonicateurs (résultats non présentés). Preliminary experiments were conducted to determine the optimal conditions for both sonicators (results not shown).

Le meilleur compromis, en terme de quantité d'ADN extrait et de taille de fragments, consiste en une sonication avec la micropointe de titane et le sonicateur de type Cup Horn respectivement pendant 7 et 10 minutes, en réglant la puissance à 15 W et avec des cycles actifs à 50%. The best compromise, in terms of amount of extracted DNA and fragment size, consists of sonication with the titanium micro-tip and the Horn-type sonicator respectively for 7 and 10 minutes, setting the power at 15 W and with active cycles at 50%.

Protocoles 5a et 5b:
Après sonication avec une micropointe de titane ou un dispositif de type Cup Horn (respectivement protocoles 4a et 4b) du lysozyme et de l'achromopeptidase ont été ajoutés, chacune des enzymes à une concentration finale de 0,3 mg/ml. Protocols 5a and 5b:
After sonication with a titanium microtip or a Cup Horn device (respectively Protocols 4a and 4b) lysozyme and achromopeptidase were added, each of the enzymes at a final concentration of 0.3 mg / ml.

Les suspensions de sol ont été incubées pendant 30 minutes à 37 C, après quoi du lauryl sulfate à une concentration finale de 1 % a été ajouté, puis des suspensions ont été incubées pendant 1 heure à 60 C avant centrifugation et précipitation comme décrit ci-dessus. The soil suspensions were incubated for 30 minutes at 37 ° C., after which lauryl sulfate at a final concentration of 1% was added, then suspensions were incubated for 1 hour at 60 ° C. before centrifugation and precipitation as described below. above.

En plus des protocoles décrits ci-dessus, l'effet de la sonication (Cup Horn, voir protocole 4b) et de chocs thermiques (30 secondes dans l'azote liquide suivi de trois minutes dans l'eau bouillante, les traitements étant répétés trois fois ) sur l'ADN de phage lambda digéré par Hindlll préalablement ajouté au sol ont été examinés (voir ci-après). In addition to the protocols described above, the effect of sonication (Cup Horn, see Protocol 4b) and thermal shock (30 seconds in liquid nitrogen followed by three minutes in boiling water, the treatments being repeated three on Hindlll digested lambda phage DNA previously added to the soil were examined (see below).

Des chocs thermiques ont été suggérés dans l'état de la technique comme des moyens de lyse cellulaire in situ (PICARD et al. (1992)).. Cependant, du fait qu'un tel traitement a un effet préjudiciable sur l'ADN libre (voir la section résultats) il n'a pas été inclus dans les protocoles décrits ci-dessus. Thermal shocks have been suggested in the state of the art as means of cell lysis in situ (PICARD et al. (1992)). However, because such a treatment has a detrimental effect on free DNA (see results section) it was not included in the protocols described above.

PROTOCOLE OPTIMISE
Après évaluation des différents traitements de lyse, un protocole optimisé a été défini, désigné protocole 6. Le protocole 6 est identique au protocole 5b excepté que, avant la sonication, les suspensions de sol sont soumises à un traitement par Vortex puis agitées par rotation sur une roue pendant deux heures avant d'être congelées à - 20 C. OPTIMIZED PROTOCOL
After evaluation of the different lysis treatments, an optimized protocol was defined, designated protocol 6. Protocol 6 is identical to protocol 5b except that, before sonication, the soil suspensions are subjected to a Vortex treatment and then agitated by rotation on a wheel for two hours before being frozen at - 20 C.

Après décongélation, les suspensions de sol sont passées au Vortex pendant 10 minutes avant sonication. Le protocole 6 a été utilisé dans les expériences dans lesquelles les sols ont été ensemencés avec des cellules bactériennes ainsi que dans les expériences dans lesquelles les actinomycètes indigènes ont été quantifiés (voir ci-dessous). After thawing, the soil suspensions are vortexed for 10 minutes before sonication. Protocol 6 was used in experiments in which soils were seeded with bacterial cells as well as in experiments in which native actinomycetes were quantified (see below).

1. 6 COMPTAGE AU MICROSCOPE: L'efficacité du broyage du sol comme méthode pour lyser des cellules bactériennes a été examinée au microscope. 1. 6 MICROSCOPE COUNTING: The efficiency of soil crushing as a method for lysing bacterial cells was examined under a microscope.

5g de sol brut séché ont été mélangés dans un dispositif de type Waring Blender avec 50 ml d'eau stérilisée ultrapure pendant 1,5 minutes; simultanément, 1g (poids sec) de sol broyé (protocole n 2) a été mis en suspension dans 10 ml par agitation pendant 10 minutes. Les suspensions de salant fait l'objet de dilutions en séries et de l'acridine orange a été ajoutée à une concentration finale de 0,001%. 5 g of dried crude sol were mixed in a Waring Blender device with 50 ml of ultrapure sterilized water for 1.5 minutes; simultaneously, 1 g (dry weight) of ground ground (Protocol No. 2) was suspended in 10 ml by stirring for 10 minutes. Salt suspensions are serial diluted and orange acridine added to a final concentration of 0.001%.

Après 2 minutes, les suspensions ont été filtrées à travers une membrane de marque NUCLEOPORE de type 0,2 m black. Chaque filtre a été rinçé avec de l'eau stérile lysée, traitée avec 1 ml d'isopropanol pendant 1 minute afin de fixer les cellules bactériennes, puis rincé de nouveau. After 2 minutes, the suspensions were filtered through a membrane brand NUCLEOPORE type 0.2 m black. Each filter was rinsed with lysed sterile water, treated with 1 ml of isopropanol for 1 minute to fix the bacterial cells, and then rinsed again.

Les cellules bactériennes ont été comptées à l'aide d'un microscope a épifluorescence du type Zeiss Universal avec un objectif 100x. Pour chacun des types de sol, trois filtres ont été comptés, et au moins 200 cellules ont été comptées sur chacun des filtres. The bacterial cells were counted using an epifluorescence microscope of the Zeiss Universal type with a 100x objective. For each soil type, three filters were counted, and at least 200 cells were counted on each of the filters.

1. 7 NUMERATION DES ACTINOMYCETES CULTIVABLES ET NOMBRE TOTAL D'UNITES FORMANT COLONIES (CFU) : Les actinomycètes ayant survécu aux traitements de lyse (protocoles 1-5) ont été examinés spécifiquement avec le sol n 3 (Côte Saint André, voir tableau 1). 1. 7 NUMBER OF CULTIVATED ACTINOMYCETS AND TOTAL NUMBER OF COLONY-FORMING UNITS (CFU): Actinomycetes surviving lysis treatments (Protocols 1-5) were examined specifically with soil No. 3 (Côte Saint André, see Table 1). ).

Après une dilution de 10 fois d'une solution d'extrait de levure (6% poids/volume) et de SDS (0,05%) afin d'induire la germination (Hayakawa et al. (1988)), les suspensions de sol ont été diluées en séries dans de l'eau stérile, incubées à 40 C pendant 20 minutes et ensemencées sur du milieu HV (HAYAKAWA et al., 1987). After a 10-fold dilution of a yeast extract (6% w / v) and SDS (0.05%) solution to induce germination (Hayakawa et al (1988)), suspensions of The sol were serially diluted, incubated at 40 ° C. for 20 minutes and seeded on HV medium (HAYAKAWA et al., 1987).

Le milieu HV a été additionné de actidione (50 mg/l) et de nystatine (50 mg/ml). The HV medium was supplemented with actidione (50 mg / l) and nystatin (50 mg / ml).

Les colonies d'actinomycètes ont été comptées après incubation pendant 15 jours à 28 C. Actinomycete colonies were counted after incubation for 15 days at 28 ° C.

Au total, environ 400 colonies ont été examinées. L'identification a été réalisée sur la base des caractéristiques morphologiques macro-et microscopiques ainsi que sur l'analyse de la teneur en acide In total, about 400 colonies were examined. The identification was carried out on the basis of macro-and microscopic morphological characteristics as well as on the analysis of the acid content

diaminopimélique des isolats (SHIRLING et al., 1966); STANECK et al., 1974; WILLIAMS état.,1993). diaminopimelic isolates (SHIRLING et al., 1966); STANECK et al., 1974; WILLIAMS state., 1993).

La quantité totale de bactéries cultivables (CFU totales) a été également déterminée pour chacun des protocoles de lyse 1 à 5. Les suspensions de sol ont été diluées en série et ensemencées en triple sur un milieu agar Bennett (WAKSMAN et al., 1961) additionné de nystatine et d'actidione (chacune à 50 mg/l). The total amount of cultivable bacteria (total CFU) was also determined for each of lysis protocols 1 to 5. The soil suspensions were serially diluted and seeded in triplicate on Bennett agar medium (WAKSMAN et al., 1961). added with nystatin and actidione (each at 50 mg / l).

Chaque boîte de Pétri a été couverte d'un filtre de nitrate de cellulose (Millipore) et incubée pendant trois jours à 28 C. Après la numération des colonies sur les membranes, les filtres ont été retirées et les boîtes de Pétri ont été à nouveau incubées pendant 7 jours à 28 C puis comptées à nouveau. Each petri dish was covered with a cellulose nitrate filter (Millipore) and incubated for three days at 28 C. After counting the colonies on the membranes, the filters were removed and the Petri dishes were again incubated for 7 days at 28 C then counted again.

1. 8 RECUPERATION DE L'ADN DE PHAGE LAMBDA AJOUTE AUX SOLS : L'ADN de phage lambda a été digéré avec Hindlll, extrait par un mélange de phénol-chloroforme, précipité puis resuspendu dans de l'eau stérile ultrapure selon des protocoles standard (SAMBROOK et al., 1989). 1. 8 RECOVERY OF LAMBDA PHAGE DNA ADDED TO THE SOIL: The lambda phage DNA was digested with HindIII, extracted with a mixture of phenol-chloroform, precipitated and then resuspended in sterile ultrapure water according to standard protocols. (SAMBROOK et al., 1989).

Des dilutions correspondant respectivement à 0,2,5, 5,7,5, 10 et 15 g d'ADN/g de poids sec de sol ont été préparées dans des volumes de 60 pl. Ces dilutions d'ADN ont été ajoutées à des lots de 5g de sol sec qui ont été subséquemment vigoureusement mélangés par vortex pendant 5 minutes avant broyage. Dilutions corresponding respectively to 0.2.5, 5.7.5, 10 and 15 g of DNA / g dry weight of sol were prepared in volumes of 60 μl. These DNA dilutions were added to 5 g batches of dry sol which were subsequently vigorously vortexed for 5 minutes before grinding.

L'ADN de phage lambda a aussi été ajouté à un sol avant broyage à des concentrations correspondant à 0, 10 et 15 g d'ADN/g de poids sec du sol. Lambda phage DNA was also added to soil before grinding at concentrations corresponding to 0, 10 and 15 g DNA / g dry weight of the soil.

Après broyage, le tampon d'extraction est ajouté et l'ADN est extrait selon le protocole 2(voir ci-dessus). After grinding, the extraction buffer is added and the DNA is extracted according to protocol 2 (see above).

1. 9 SATURATION DES SITES D'ADSORPTION AVEC DE L'ARN : de déterminer si la saturation des sites d'adsorption d'acides nucléiques des colloïdes du sol pouvait augmenter le taux de récupération de l'ADN, le terreau sablonneux (sol n 4) et le sol argileux (sol n 5) ont été incubés avec une solution d'ARN avant tout autre traitement. 1. 9 SATURATION OF ADSORPTION SITES WITH RNA: to determine whether the saturation of the soil colloid nucleic acid adsorption sites could increase the recovery rate of DNA, the sandy loam (soil n 4) and clay soil (soil 5) were incubated with an RNA solution before further treatment.

De l'ARN commercial de Saccharomyces cerevisiae (BOHRINGER MANNHEIM, MEYLAN, France) a été dilué dans du tampon phosphate (pH 7,1) et ajouté aux échantillons de sol sec et Commercial Saccharomyces cerevisiae RNA (BOHRINGER MANNHEIM, MEYLAN, France) was diluted in phosphate buffer (pH 7.1) and added to the dry soil samples.

tamisés (2 ml/g de sol) à des concentrations finales de 20,50 et 100 mg d'ARN/g de poids sec du sol. sieved (2 ml / g soil) at final concentrations of 20.50 and 100 mg RNA / g dry weight of soil.

Les tubes contenant les suspensions de sol ont été agités par rotation pendant deux heures à température ambiante. Après centrifugation, les culots de sol ont été séchés au four (50 C) pendant la nuit. L'ADN de phage lambda a ensuite été ajouté aux sols (0,20 ou 50 g/g de poids sec du sol) afin de simuler le sort de l'ADN libéré après lyse cellulaire. The tubes containing the soil suspensions were rotated for two hours at room temperature. After centrifugation, the soil pellets were oven dried (50 ° C.) overnight. Lambda phage DNA was then added to the soils (0.20 or 50 g / g dry weight of soil) to simulate the fate of the DNA released after cell lysis.

L'ADN a été extrait selon le protocole n 2. Il a été déterminé par la suite qu'un effet identique de l'addition d'ARN sur la récupération d'ADN pouvait être atteint en ajoutant l'ARN directement au tampon d'extraction. The DNA was extracted according to protocol No. 2. It was subsequently determined that an identical effect of RNA addition on DNA recovery could be achieved by adding RNA directly to the DNA buffer. extraction.

Cette procédure simplifiée a été utilisée pour le sol argileux n 5 dans les expériences dans lesquelles les micro-organismes ont été inoculés dans les sols. This simplified procedure was used for # 5 clay soil in experiments in which microorganisms were inoculated into soils.

L'ARN a ensuite été ajouté à une concentration correspondant à 50 mg d'ARN/g de poids sec du sol. The RNA was then added at a concentration corresponding to 50 mg RNA / g dry weight of the soil.

1. 10 DETERMINATION QUALITATIVE ET QUANTITATIVE DE L'EFFICACITE DES PROTOCOLES D'EXTRACTION : qualité de l'ADN (absence de dégradation) a été estimée sur la base de la taille des fragments d'ADN ou de la position relative des bandes de migration d'ADN après électrophorèse d'une fraction aliquote d'une solution d'ADN sur un gel d'agarose à 0,8%. 1. 10 QUALITATIVE AND QUANTITATIVE DETERMINATION OF THE EFFECTIVENESS OF EXTRACTION PROTOCOLS: DNA quality (no degradation) was estimated on the basis of the size of the DNA fragments or the relative position of the migration bands of DNA after electrophoresis of an aliquot of a DNA solution on a 0.8% agarose gel.

L'intensité de fluorescence a permis une estimation semiquantitative des rendements d'extraction. The fluorescence intensity allowed a semiquantitative estimation of the extraction yields.

Une autre fraction aliquote a été utilisée pour des déterminations quantitatives de la teneur en ADN par hybridation (Dot Blot) et analyse au phospho-Imager. Le protocole d'hybridation sur tache a été décrit par SIMONET et al. (1990). Another aliquot fraction was used for quantitative determination of DNA content by dot blotting and phospho-imager analysis. The stain hybridization protocol has been described by SIMONET et al. (1990).

Les membranes d'hybridation (GeneScreen plus, Life Science Products, Boston, Etats-Unis d'Amérique) ont été préhybridées pendant au moins 2 heures dans 20 ml d'une solution contenant 6 ml de 20 x SSC, 1 ml de solution de DENHARDT's, 1 ml de SDS à 10% et 5 mg d'ADN de sperme de saumon. Hybridization membranes (GeneScreen plus, Life Science Products, Boston, USA) were prehybridized for at least 2 hours in 20 ml of a solution containing 6 ml of 20 x SSC, 1 ml of DENHARDT's, 1 ml of 10% SDS and 5 mg of salmon sperm DNA.

L'hybridation a été réalisée pendant une nuit dans la même solution en présence d'une sonde marquée préalablement à deux lavages des membranes dans un tampon SSC 2 x pendant 5 minutes à température ambiante, puis un troisième lavage dans du tampon SSC 2 x, SDS 0,1% et un quatrième lavage dans du tampon SSC 1 x, SDS 0,1% pendant 30 minutes à la température d'hybridation. Hybridization was carried out overnight in the same solution in the presence of a labeled probe prior to two membrane washes in 2 x SSC buffer for 5 minutes at room temperature, followed by a third wash in 2 x SSC buffer, 0.1% SDS and a fourth wash in 1x SSC buffer, 0.1% SDS for 30 minutes at the hybridization temperature.

Les signaux d'hybridation ont été quantifiés avec un système d'imagerie radioanalytique BIORAD (Molecular Analyst Software, BIORAD, Ivry S/Seine, France). The hybridization signals were quantified with a BIORAD radioanalytical imaging system (Molecular Analyst Software, BIORAD, Ivry S / Seine, France).

Afin de quantifier la quantité totale d'ADN dérivée de la microflore indigène, les différents sols ont été extraits selon les protocoles n 1 à 5. L'ADN non amplifié a été appliqué sur les membranes de Dot Blot et hybridé en utilisant la sonde universelle FGPS431 (tableau 2). In order to quantify the total amount of DNA derived from the native microflora, the different soils were extracted according to the protocols n 1 to 5. The non-amplified DNA was applied on the Dot Blot membranes and hybridized using the universal probe. FGPS431 (Table 2).

Cette sonde, qui hybride aux positions 1392-1406 du gène de l'ADNr 16S de E.coli (Amann et al. (1995)) a été marquée à ses extrémités avec un ATP[alpha]32P en utilisant une polynucléotide kinase T4(BOEHRINGER MANNHEIM, Melan, France). This probe, which hybridizes to positions 1392-1406 of the E. coli 16S rDNA gene (Amann et al (1995)) was labeled at its ends with ATP [alpha] 32P using a T4 polynucleotide kinase ( BOEHRINGER MANNHEIM, Melan, France).

Une courbe de calibration a été préparée à partir de l'ADN de E.coli DH5a. La conversion des calculs aux bactéries du sol a nécessité une simplification, partant de l'hypothèse que le nombre de copies moyen (rrn) est de 7, comme pour E.coli. A calibration curve was prepared from E. coli DH5a DNA. The conversion of stones to soil bacteria necessitated a simplification, assuming that the average number of copies (rrn) was 7, as for E. coli.

L'ADN de phage lambda digéré par Hindlll a été utilisé pour quantifier la récupération de l'ADN extracellulaire. Des extraits non amplifiés à partir de sols, auxquels de l'ADN de phage lambda avait été ajouté, ont été hybridés avec de l'ADN de phage lambda digéré par Hindlll marqué au hasard en utilisant le fragment Klenow (Bôehringer Mannheim, Melan, France). Hindlll digested lambda phage DNA was used to quantify the recovery of extracellular DNA. Unamplified extracts from soils, to which lambda phage DNA had been added, were hybridized with randomly-tagged Hindlll digested phage lambda DNA using the Klenow fragment (Boehringer Mannheim, Melan, France). ).

Les quantités d'ADN ont été calculées par interpolation à partir d'une courbe de calibration préparée avec l'ADN purifié. The amounts of DNA were calculated by interpolation from a calibration curve prepared with the purified DNA.

La quantité totale d'ADN extrait à partir des sols n 1, 2,3, 4 et 6 selon le protocole n 2 (broyage) a également été quantifiée de manière colorimétrique selon la technique décrite par RICHARD (1974). The total amount of DNA extracted from sols 1, 2, 3, 4 and 6 according to protocol No. 2 (grinding) was also quantified in a colorimetric manner according to the technique described by RICHARD (1974).

Brièvement, de l'ADN a été mélangé avec du HClO4 concentré (la concentration finale de HC104 était de 1,5 N). On a mélangé 2,5 volumes de cette solution avec 1,5 volumes de DPA (diphénylamine, Sigma-Aldrich, France) et laissé incuber le mélange à la température Briefly, DNA was mixed with concentrated HClO4 (the final concentration of HC104 was 1.5 N). 2.5 volumes of this solution were mixed with 1.5 volumes of DPA (diphenylamine, Sigma-Aldrich, France) and incubated at room temperature.

ambiante pendant 18 heures, préalablement à la détermination de la DO à 600 nn. Les extraits d'ADN du sol ont été quantifiés par rapport à une courbe standard réalisée par l'ADN extrait à partir de E.coli DH5[alpha] selon les protocoles standards (SAMBROOK et al., (1989)). room temperature for 18 hours, prior to determining the OD at 600 nn. The soil DNA extracts were quantified against a standard curve made by DNA extracted from E. coli DH5 [alpha] according to standard protocols (SAMBROOK et al., (1989)).

1. 11 DEVELOPPEMENT D'UNE TECHNIQUE DE QUANTIFICATION D'ADN EN UTILISANT L'AMPLIFICATION PCR ET L'HYBRIDATION: Pour les amplifications par PCR, de l'ADN polymérase Taq (Appligene Oncor, France) a été utilisé selon les instructions du fabricant. 1. 11 DEVELOPMENT OF A DNA QUANTIFICATION TECHNIQUE USING PCR AMPLIFICATION AND HYBRIDIZATION: For PCR amplifications, Taq DNA polymerase (Appligene Oncor, France) was used according to the manufacturer's instructions. .

Le programme PCR utilisé pour toutes les amplifications est le suivant : dénaturation initiale pendant 3 minutes à 95 C, puis 35 cycles consistant en 1 minute à 95 C, 1 minute à 55 C et 1 minute à 72 C, suivie par une extension finale à 72 C pendant 3 minutes. The PCR program used for all the amplifications is as follows: initial denaturation for 3 minutes at 95 ° C., then 35 cycles consisting of 1 minute at 95 ° C., 1 minute at 55 ° C. and 1 minute at 72 ° C., followed by a final extension to 72 C for 3 minutes.

L'ADN isolé et purifié à partir de Streptosporangium fragile a été utilisé comme témoin à des concentrations allant de 100 fg à 100 ng. DNA isolated and purified from brittle Streptosporangium was used as a control at concentrations ranging from 100 fg to 100 ng.

Afin d'amplifier spécifiquement l'ADN de ce genre bactérien, on a choisi les amorces FGPS122 et FGPS350 (tableau 2), complémentaires à une partie de l'ADNr 16S, après alignement des séquences d'ADNr 16S d'actynomycètes. Leur spécificité a été testée sur une collection de souches d'actynomycètes (Streptomycès, Streptosporangium et d'autres genres fortement apparentés)
Les produits de PCR ont été hybridés avec la sonde oligonucléotidique FGPS643 (tableau 2). Afin de simuler le niveau de pureté obtenu en routine avec de l'ADN extrait à partir du sol, des témoins d'ADN pur de S. fragile ont été mélangés avec les extraits de sol obtenus après des traitements selon les protocoles de lyse 4b et 5b puis purifiés selon le protocole D. In order to specifically amplify the DNA of this bacterial genus, primers FGPS122 and FGPS350 (Table 2), complementary to a portion of the 16S rDNA, were chosen after alignment of the actinomycete 16S rDNA sequences. Their specificity was tested on a collection of actinomycete strains (Streptomyces, Streptosporangium and other closely related genera)
The PCR products were hybridized with the oligonucleotide probe FGPS643 (Table 2). In order to simulate the level of purity obtained routinely with DNA extracted from the soil, pure S. fragile DNA controls were mixed with the soil extracts obtained after treatments according to the lysis protocols 4b and 5b and then purified according to protocol D.

Avant utilisation, les extraits de sol ont été traités avec de la DNase (une unité de DNase/ml, GIBCO BRL) pendant 30 minutes à température ambiante. La DNase a ensuite été inactivée par chauffage à 65 C pendant 10 minutes. Une vérification de l'inactivation a été réalisée par PCR. Les concentrations d'acides humiques ont été mesurées par spectrophotométrie (D028onm) contre une courbe standard d'acides humiques commerciaux (Sigma). Before use, the soil extracts were treated with DNase (one unit of DNase / ml, GIBCO BRL) for 30 minutes at room temperature. The DNase was then inactivated by heating at 65 ° C. for 10 minutes. An inactivation check was performed by PCR. Humic acid concentrations were measured spectrophotometrically (D028onm) against a standard curve of commercial humic acids (Sigma).

Des solutions de sol traitées à la Dnase non diluées, diluées 10x et diluées x 100 ont été mélangées de 100. fg à 100ng d'ADN de S. Undiluted, diluted 10x and diluted x 100 Dnase-treated soil solutions were mixed with 100 μg to 100 μg of S DNA.

fragile avant l'amplification par PCR. Dans une autre série d'expériences, les concentrations croissantes d'ADN de Streptomyces hygroscopicus de (100 pg à 1 g) ont été ajoutées à l'ADN de S. fragile afin de simuler la présence d'ADN non-cible et son influence sur le procédé PCR. fragile before amplification by PCR. In another series of experiments, increasing concentrations of Streptomyces hygroscopicus DNA from (100 μg to 1 g) were added to the fragile S. DNA to simulate the presence of non-target DNA and its influence. on the PCR process.

1. 12 PURIFICATION DES EXTRAITS D'ADN BRUT : méthodes de purification d'ADN ont été comparées. L'ADN a été extrait à partir de 1g (poids sec de sol selon le protocole 4a et remis en suspension dans 100 l de tampon TE8 (50 mM Tris, 20 mM (EDTA, pH 8,0). 1. 12 PURIFICATION OF GROSS DNA EXTRACTS: DNA purification methods were compared. The DNA was extracted from 1 g (dry weight of sol according to protocol 4a and resuspended in 100 l of TE8 buffer (50 mM Tris, 20 mM (EDTA, pH 8.0).

Protocole A
Elution à travers deux colonnes successives Elutip d (SCHLEICHER et SCHUELL, Dassel, Allemagne) (PICARD et al., (1992)). Protocol A
Elution through two successive Elutip columns (SCHLEICHER and SCHUELL, Dassel, Germany) (PICARD et al., (1992)).

Protocole B:
Elution à travers une colonne SEPHACRYL S200 (Pharmacia Biotech, Uppsala, Suède) suivie d'une élution à travers une colonne Elutip d (NESME et al. (1995)). Protocol B:
Elution through a SEPHACRYL S200 column (Pharmacia Biotech, Uppsala, Sweden) followed by elution through an Elutip column (NESME et al (1995)).

Protocole C :
Séparation à l'aide d'un système aqueux à deux phases avec 17,9% (poids/poids) de PEG 8000 (Merck, Darmstadt, Allemagne) et 14,3% (poids/poids) de (NH4)2SO4(ZASLAVSKY,(1995)). Protocol C:
Separation using a two-phase aqueous system with 17.9% (w / w) of PEG 8000 (Merck, Darmstadt, Germany) and 14.3% (w / w) of (NH4) 2SO4 (ZASLAVSKY) (1995)).

Après un mélange vigoureux au vortex, les deux phases ont été laissées à température ambiante pour leur séparation. After vigorous vortex mixing, both phases were left at room temperature for separation.

1 ml de chacune des phases a été transféré dans un autre tube, mélangé avec 100 l de l'échantillon et laissé à 4 C pendant une nuit pour permettre la séparation. 1 ml of each of the phases was transferred to another tube, mixed with 100 l of the sample and left at 4 ° C overnight to allow separation.

La phase inférieure a été dialysée pendant une heure à travers une membrane Millipore en présence d'un excès d'un tampon TE 7,5 (10 mM Tris, 1 mM EDTA à pH 7,5 et 1 M Mg Cl2) afin d'éliminer les sels en excès. The lower phase was dialyzed for one hour through a Millipore membrane in the presence of an excess of a TE 7.5 buffer (10 mM Tris, 1 mM EDTA pH 7.5 and 1 M Mg Cl 2) in order to remove excess salts.

Protocole D :
Elution à travers une colonne de type Microspin Sephacryl S400 HR (Pharmacia Biotech, Uppsala, Suède) , suivie d'une élution à travers une colonne de type Elutip d. Protocol D:
Elution through a Microspin Sephacryl S400 HR column (Pharmacia Biotech, Uppsala, Sweden), followed by elution through an Elutip d column.

Chaque protocole est terminé par une étape de précipitation à l'éthanol, et l'ADN est remis en suspension dans 10 l de tampon TE 7,5. Each protocol is terminated by an ethanol precipitation step, and the DNA is resuspended in 10 l of TE 7.5 buffer.

L'efficacité des protocoles de purification a été vérifiée par amplification PCR de fractions aliquotes non diluées des solutions d'ADN et de fractions aliquotes diluées 10 et x 100 fois, en utilisant des protocoles standard (voir ci-dessous). The efficiency of the purification protocols was verified by PCR amplification of undiluted aliquots of diluted DNA solutions and aliquots 10 and x 100-fold, using standard protocols (see below).

1. 13 RECUPERATION DE L'ADN A PARTIR DE MICROORGANISMES INNOCULES: Les cellules, spores et hyphae ont été lavées deux fois et dénombrées par comptage sur plaque ou comptage microscopique direct. Des lots de 5g de sol sec et tamisé (sols n 2, 3 et 5) ont été inoculés avec 100 l d'une suspension de spores et d'hyphae de S. lividans à des concentrations correspondant à 0,103, 105, 107 et 109 spores/g de poids sec de sol, ou avec des cellules végétatives de B.anthracis à des concentrations correspondant à 0,107 et 109 cellules par gramme de poids sec du sol. 1. 13 RECOVERY OF DNA FROM INNOCULATED MICROORGANISMS: The cells, spores and hyphae were washed twice and counted by plate counting or direct microscopic counting. Lots of 5g of dry, sieved soil (soil types 2, 3 and 5) were inoculated with 100 L of spore and hyphae suspension of S. lividans at concentrations corresponding to 0.103, 105, 107 and 109 spores / g dry weight of soil, or with vegetative cells of B. anthracis at concentrations corresponding to 0.107 and 109 cells per gram of dry soil weight.

Les quantités de hyphae de S. lividans ont été calculées sur la base du nombre de spores desquelles elles sont originaires. Après addition des suspensions bactériennes, les échantillons de sol sont mélangés vigoureusement par vortex pendant 5 minutes avant broyage. The quantities of S. lividans hyphae were calculated on the basis of the number of spores from which they originate. After addition of the bacterial suspensions, the soil samples are vortexed vigorously for 5 minutes before grinding.

L'ADN est extrait selon le protocole n 6 (voir ci-dessous). DNA is extracted according to Protocol No. 6 (see below).

L'amplification PCR suivie d'une hybridation sur tache (Dot Blot) et imagerie par phosphorescence (phospho-imaging) a été utilisée afin de quantifier les quantités d'ADN récupérées à partir des cellules, des spores et du mycélium bactérien inoculé dans les sols. PCR amplification followed by dot blotting and phosphorescence imaging was used to quantify the amounts of DNA recovered from the cells, spores and bacterial mycelium inoculated into the cells. soils.

L'extraction d'ADN a été réalisée selon le protocole de lyse n 6. DNA extraction was performed according to lysis protocol n 6.

L'amplification PCR et l'hybridation ont été réalisées comme décrit cidessus. Les amorces et les sondes sont ciblées sur des régions chromosomiques localisées en dehors de la région 16S, et sont hautement spécifiques des organismes respectifs, de manière à éviter des signaux de bruit de fond. PCR amplification and hybridization were performed as described above. The primers and probes are targeted to chromosomal regions located outside the 16S region, and are highly specific to the respective organisms, so as to avoid background signals.

Pour les sols ensemencés avec B. anthracis, les amorces R499 et R500 ont été utilisées (Patra et al. (1996)) et les produits d'amplification ont été hybridés avec la sonde oligonucléotidique C501 (tableau 2). For soils seeded with B. anthracis, primers R499 and R500 were used (Patra et al. (1996)) and the amplification products were hybridized with oligonucleotide probe C501 (Table 2).

Pour les sols ensemencés avec S. lividans , les réactions PCR ont été réalisées en utilisant les amorces FGPS516 et FGPS517, et les produits d'amplification ont été hybridés avec la sonde oligonucléotidique FGPS518 (tableau 2). For soils seeded with S. lividans, PCR reactions were performed using primers FGPS516 and FGPS517, and the amplification products were hybridized with oligonucleotide probe FGPS518 (Table 2).

La région amplifiée est une partie de la cassette construite spécifiquement pour obtenir la souche OS48.3 (CLERC-BARDIN et al., non publié). The amplified region is a portion of the cassette specifically constructed to obtain strain OS48.3 (CLERC-BARDIN et al., Unpublished).

Les comptes de calibration ont été dans tous les cas obtenus en utilisant l'ADN purifié de l'organisme cible. The calibration counts were in all cases obtained using the purified DNA of the target organism.

2. RESULTATS 2. 1 CHOIX DU TAMPON D'EXTRACTION: 20 sols différents ont été utilisés afin de déterminer le pH optimal du tampon d'extraction d'ADN. 2. RESULTS 2. 1 CHOICE OF THE EXTRACTION STAMP: 20 different soils were used to determine the optimal pH of the DNA extraction buffer.

Pour tous les sols, le rendement en ADN augmente avec les pH croissants du tampon. Le rendement pour chaque pH (+/- sd), calculé comme le pourcentage de la valeur la plus haute pour chacun des sols, est le suivant : pH 6,0 : 31+/- 13; pH 7,0 : 43+/- 16 ; pH 8,0 : 60+/- 14 ; pH 9,0 : 82 +/- 12 ; pH 10,0: 98 +/- 3. For all soils, the yield of DNA increases with the increasing pH of the buffer. The yield for each pH (+/- sd), calculated as the percentage of the highest value for each soil, is as follows: pH 6.0: 31 +/- 13; pH 7.0: 43 +/- 16; pH 8.0: 60 +/- 14; pH 9.0: +/- 12; pH 10.0: 98 +/- 3.

Pour 16 des 20 sols, le rendement le plus élevé a été obtenu à pH 10,0, alors que pour les quatre autres sols le plus haut rendement a été obtenu à pH 9,0. Cependant, à pH 10,0, des quantités plus grandes de matériel humique ont été libérées, comparées à pH 9,0 (résultats non présentés). En conséquence, le pH 9,0 a été choisi pour toutes les expériences présentées ci-dessous. For 16 of the 20 soils, the highest yield was obtained at pH 10.0, while for the other four soils the highest yield was obtained at pH 9.0. However, at pH 10.0, larger amounts of humic material were released, compared to pH 9.0 (results not shown). As a result, pH 9.0 was chosen for all the experiments presented below.

2. 2 EFFICACITE DES PROTOCOLES D'EXTRACTION D'ADN : L'ADNtotal des organismes indigènes du sol a été extrait et quantifié de manière à évaluer l'efficacité de nombreux protocoles de lyse cellulaire in situ. Des échantillons des sols 1-6 (tableau 1) ont été traités selon les protocoles n 1 à 5 décrits dans la section Matériel et Méthodes (figure 1). 2. EFFECTIVENESS OF DNA EXTRACTION PROTOCOLS: The total native soil DNA was extracted and quantified to evaluate the efficacy of many in situ cell lysis protocols. Samples of soils 1-6 (Table 1) were processed according to Protocols 1-5 described in the Materials and Methods section (Figure 1).

Après l'extraction d'ADN, les suspensions de sols ont été précipitées avec de l'isopropanol, et des fractions aliquotes des culots remis en suspension ont été analysées par électrophorèse sur gel , dans une première étape, afin d'estimer la qualité et la quantité de l'ADN libéré. After the DNA extraction, the soil suspensions were precipitated with isopropanol, and aliquots of the resuspended pellets were analyzed by gel electrophoresis, in a first step, to estimate the quality and the amount of DNA released.

Cependant, la couleur de l'extrait d'ADN devenait de plus en plus sombre au fur et à mesure du nombre croissant d'étapes de lyse, du fait de la co-extraction de composés, tels que les acides humiques, avec l'ADN. However, the color of the DNA extract became increasingly dark as the number of lysis steps increased, due to the coextraction of compounds, such as humic acids, with the DNA.

Certains de ces extraits bruts de couleur sombre ne migrent pas de la manière attendue dans les gels d'agarose. Some of these dark-colored crude extracts do not migrate in the expected manner in the agarose gels.

En conséquence, les solutions d'ADN brut ont été purifiées (protocole B) avant quantification. Les électrophorèses sur gel des solutions purifiées obtenues après les différents traitements de lyse sont exemplifiées sur le sol n 3 (figure 2). As a result, the crude DNA solutions were purified (protocol B) before quantification. The gel electrophoresis of the purified solutions obtained after the various lysis treatments are exemplified on the soil n 3 (Figure 2).

Une comparaison visuelle au rayonnement ultra-violet des intensités de l'ADN coloré a permis une estimation semi-quantitative de l'efficacité des traitements. De plus, la présence de profils de migration de tailles multiples de fragments (bandes discrètes) d'ADN et la disparition des fragments longs indique qu'une dégradation de l'ADN a eu lieu. A visual comparison with ultraviolet radiation of the intensities of the colored DNA allowed a semi-quantitative estimation of the effectiveness of the treatments. In addition, the presence of multiple size DNA fragment (discrete band) migration profiles and the disappearance of the long fragments indicates that DNA degradation has occurred.

Aucun ADN n'a pu être extrait du sol argileux n 5. No DNA could be extracted from # 5 clay soil.

* Une quantification plus précise de l'ADN de tous les sols, extrait selon les protocoles n 1 à 5, a été réalisée par hybridation sur tache (Dot Blot) sans étape d'amplification PCR préalable et en utilisant une sonde oligonucléotidique complémentaire d'une séquence hautement conservée de la région d'ADNr 16S (sonde FGPS 431, tableau 2). A more precise quantification of the DNA of all the sols, extracted according to the protocols n 1 to 5, was carried out by hybridization on stain (Dot Blot) without step of preliminary PCR amplification and by using an oligonucleotide probe complementary of a highly conserved sequence of the 16S rDNA region (FGPS 431 probe, Table 2).

L'ADN a été détecté dans les extraits de tous les sols après chacune des différentes étapes de lyse, à l'exception du sol argileux n 5. DNA was detected in the extracts of all soils after each of the different lysis steps except for # 5 clay soil.

Les résultats concordent avec les estimations réalisées après gel d'électrophorèse. The results are consistent with estimates made after electrophoresis gel.

Afin de comparer avec une méthode indépendante pour la quantification, l'ADN extrait selon le protocole n 2 (tous les sols sauf le sol n 5) a été également quantifié en utilisant une méthode colorimétrique de détection de l'ADN (RICHARD, 1974). In order to compare with an independent method for quantification, the DNA extracted according to protocol n 2 (all soils except soil n 5) was also quantified using a colorimetric method of DNA detection (RICHARD, 1974) .

On a trouvé une bonne corrélation (r = 0,88) entre l'ADN quantifié en utilisant cette technique colorimétrique et les résultats obtenus par hybridation de type Dot Blot/radio-imagerie, confirmant A good correlation (r = 0.88) was found between the quantified DNA using this colorimetric technique and the results obtained by Dot Blot / radioimaging hybridization, confirming

l'hypothèse selon laquelle le nombre de copies moyen des bactéries du sol (rrn) est de 7. the assumption that the average copy number of soil bacteria (rrn) is 7.

L'hybridation (Dot Blot) a montré que les quantités d'ADN extracellulaires, comme déterminé par extraction sans traitement de lyse (protocole n 1), allait de 4ug/g pour le sol acide (n 6) à 36 g/g pour le sol n 3 (tableau 3). Hybridization (Dot Blot) showed that the amounts of extracellular DNA, as determined by extraction without lysis treatment (Protocol No. 1), ranged from 4 μg / g for the acidic soil (No. 6) to 36 μg / g for soil no. 3 (Table 3).

Le broyage du sol (protocole n 2 ) a augmenté les quantités d'ADN extrait à partir de tous les sols (p.ex. 26 g/g de sol) pour le sol n 6 et 59 g/g de sol (pour le sol n 3) (tableau 3 ; figure 2). Soil crushing (Protocol 2) increased the amounts of DNA extracted from all soils (eg 26 g / g soil) for soil 6 and 59 g / g soil (for soil 3) (Table 3, Figure 2).

Pour les deux traitements de broyage (voir la section Matériel et Méthodes) la migration discrète d'ADN a été détectée sur les gels d'agarose, indiquant que les molécules d'ADN ont été partiellement dégradées (figure 2). For both grinding treatments (see Materials and Methods section) the discrete migration of DNA was detected on the agarose gels, indicating that the DNA molecules were partially degraded (Figure 2).

La taille des fragments d'ADN est comprise entre 20 et 0,2 kb. The size of the DNA fragments is between 20 and 0.2 kb.

L'intensité de bande des fragments les plus petits est très faible, indiquant que la majeure partie des fragments ont une taille bien supérieure à 1 kb. The band intensity of the smaller fragments is very small, indicating that most of the fragments are much larger than 1 kb.

Le protocole n 3 comprend une étape d'homogénéisation dans un dispositif mixeur de type Ultraturax après l'addition du tampon d'extraction aux échantillons de sol. Cette étape conduit à une augmentation des quantités d'ADN extrait, comme déterminé par hybridation sur tache (Dot Blot) pour deux des sols (le terreau sablonneux n 3 et le sol acide n 6), alors que les deux sols riches en matière organique (sols n 1 et n 2) ont conduit à l'obtention de quantités plus faibles d'ADN. Protocol No. 3 comprises a homogenization step in an Ultraturax type mixing device after the addition of the extraction buffer to the soil samples. This step leads to an increase in the amount of DNA extracted, as determined by dot blotting for two of the soils (sandy soil # 3 and acid soil # 6), while both soils rich in organic matter. (# 1 and # 2 sols) led to lower amounts of DNA.

Les protocoles n 4a et n 4b ont permis d'évaluer l'influence de deux types de sonication sur les rendements en ADN à partir de sols préalablement broyés et homogénéisés . Protocols # 4a and # 4b evaluated the influence of two types of sonication on DNA yields from previously ground and homogenized soils.

La sonication n'a pas eu d'effet positif sur le rendement en ADN, comparé au protocole n 3, excepté pour le sol n 6. Toutefois, l'efficacité de lyse des deux types de sonicateur diffèrent. Pour les sols n 2, 3 et 4, les quantités d'ADN extraits les plus grandes ont été obtenues en utilisant la micropointe de titane (tableau 3 ; 2), alors que pour les sols n 1 et n 6, le rendement en ADN était supérieur en utilisant le dispositif Cup Horn. Sonication did not have a positive effect on DNA yield, compared to protocol # 3, except for soil # 6. However, the lysis efficiency of both types of sonicator differed. For soils # 2, # 3 and # 4, the largest extracted DNA amounts were obtained using titanium microtip (Table 3; 2), whereas for # 1 and # 6 sols, DNA yield was was superior using the Cup Horn device.

Des résultats contradictoires ont été également obtenus lorsque l'on a ajouté une étape de lyse enzymatique/chimique (protocoles n 5a et Conflicting results were also obtained when an enzymatic / chemical lysis step was added (Protocols 5a and 5a).

5b) après l'étape de sonication : certains cas, les quantités d'ADN extraites ont été plus grandes que celles récupérées selon les protocoles n 4a et 4b, alors que dans d'autres cas les rendements étaient moindres (tableau 3). 5b) after the sonication step: in some cases, the quantities of DNA extracted were larger than those recovered according to protocols 4a and 4b, while in other cases the yields were lower (Table 3).

2. 3 COMPTAGE DIRECT DES MICRO-ORGANISMES: Des comptes au microscope du nombre total de cellules bactériennes après coloration à l'acridine orange ont été réalisés pour tous les sols, avant et après broyage. 2. 3 DIRECT COUNTING OF MICROORGANISMS: Microscopic counts of the total number of bacterial cells after staining with acridine orange were performed for all soils, before and after grinding.

Avant broyage, le nombre de bactéries par gramme de poids sec du sol allait de 1,4 x 109 (+/- 0,4) dans le sol tropical n 5 à 10 x 109 (+/- 0,7) dans le sol provenant de la Côte Saint-André (sol n 3) (tableau 1). Before grinding, the number of bacteria per gram of soil dry weight ranged from 1.4 x 109 (+/- 0.4) in tropical soil n 5 to 10 x 109 (+/- 0.7) in soil from Côte Saint-André (soil 3) (Table 1).

Après broyage, les nombres de cellules ont été respectivement de 45, 74, 75, 54, 34 et 75% des valeurs initiales pour les sols n 1 à 6. After grinding, the cell numbers were respectively 45, 74, 75, 54, 34 and 75% of the initial values for soils 1 to 6.

2. 4 NUMERATION DES ACTINOMYCETES CULTIVABLES APPARTENANT A DIFFERENTS GENRES : modification dans les populations d'actinomycètes dans le sol n 3 a été remarquée après les différents traitements de lyse (figure 3). 2. 4 NUMBER OF CULTURABLE ACTINOMYCETES BELONGING TO DIFFERENT GENRES: modification in the populations of actinomycetes in the soil n 3 was noticed after the various lysis treatments (figure 3).

Par exemple, les colonies de Streptomyces sp. dominaient la flore viable d'actinomycètes lorsqu'aucun traitement de lyse n'est appliqué (protocole n 1), et représentaient 65% du nombre total de colonies identifiées. Après broyage, le pourcentage de colonies de Streptomyces a diminué pour atteindre 51%, alors que la proportion de colonies appartenant au genre Micromonospora a augmenté de 14% à 41%. For example, colonies of Streptomyces sp. dominated the viable actinomycete flora when no lysis treatment was applied (Protocol No. 1), and accounted for 65% of the total number of colonies identified. After milling, the percentage of Streptomyces colonies decreased to 51%, while the proportion of colonies belonging to the genus Micromonospora increased from 14% to 41%.

La lyse chimique/enzymatique (protocoles 5a et 5b) est apparue comme particulièrement efficace pour la lyse des streptomycètes. Lorsque tous les traitements de lyse ont été appliqués, y compris une lyse chimique/enzymatique (protocoles 5a et 5b), la microflore d'actinomycètes, qui comprenait encore plus de 106 CFU/g de sol, était dominée par les espèces appartenant au genre Micromonospora, alors qu'aucune ou très peu de colonies de Streptomyces ont été récupérées. Chemical / enzymatic lysis (Protocols 5a and 5b) appeared to be particularly effective for lysis of streptomycetes. When all lysis treatments were applied, including chemical / enzymatic lysis (Protocols 5a and 5b), the actinomycete microflora, which still contained more than 106 CFU / g soil, was dominated by species belonging to the genus. Micromonospora, while none or very few Streptomyces colonies were recovered.

Les organismes appartenant aux genres tels que Streptosporangium, Actinomadura, Microbispora, Dactilosporangium et Actinoplanes sont apparus sur les plaques en faible nombre (2-8% du nombre total de colonies identifiées) après broyage, homogénéisation Organisms belonging to genera such as Streptosporangium, Actinomadura, Microbispora, Dactilosporangium and Actinoplanes appeared on the plates in low numbers (2-8% of the total number of colonies identified) after grinding, homogenization

avec le dispositif Ultraturrax, et sonication, mais étaient généralement absents lorsque ces traitements étaient combinés avec une lyse chimique/enzymatique. with the Ultraturrax device, and sonication, but were generally absent when these treatments were combined with chemical / enzymatic lysis.

Le nombre total de bactéries cultivables restant après chaque traitement de lyse (protocoles 2 à 5) a été aussi recherché pour le sol n 4. Les résultats indiquent que le nombre de bactéries cultivables ne décroît pas avec l'intensité des traitements de lyse (environ 2 x 106 CFU/g de sol dans tous les cas, et également lorsqu'un traitement n'est appliqué, tel que selon le protocole n 1). The total number of cultivable bacteria remaining after each lysis treatment (protocols 2 to 5) was also searched for soil n 4. The results indicate that the number of culturable bacteria does not decrease with the intensity of the lysis treatments (approximately 2 x 106 CFU / g soil in all cases, and also when treatment is applied, as in protocol 1).

L'obtention de ces faibles valeurs de CFU est probablement due au fait que du sol sec a été utilisé et que seules les bactéries les plus résistantes se sont multipliées sur les plaques. Le nombre d'actinomycètes formant colonies était généralement plus grand que celui des CFU total (toutes les bactéries) du fait qu'une étape de germination de spores, comprise dans le protocole de détection des actynomycètes, manquait lors du contrôle des bactéries totales 2. 5 RECUPERATION DE L'ADN DU PHAGE LAMBDA AJOUTE:
Le but de ces expériences était d'estimer de quelle manière des traitements de lyse successifs pouvaient affecter la récupération d'ADN nu , et si ces traitements successifs de lyse contribuaient à sa dégradation. The achievement of these low CFU values is probably due to the fact that dry soil was used and only the most resistant bacteria multiplied on the plates. The number of colony-forming actinomycetes was generally greater than that of total CFUs (all bacteria) because a spore germination step, included in the actinomycete detection protocol, was missing in the control of total bacteria 2. RECOVERY OF LAMBDA ADDED PHAGE DNA:
The purpose of these experiments was to estimate how successive lysis treatments could affect the recovery of naked DNA, and whether these successive lysis treatments contributed to its degradation.

L'ADN pouvait être soit une fraction d'ADN extracellulaire libérée à partir d'organismes déjà morts, qui peuvent persister dans le sol pendant des mois (WARD et al., 1990), soit de l'ADN libéré à partir d'organismes lysés facilement pendant les premières étapes du traitement. Afin de simuler cette situation, de l'ADN de phage lambda digéré par Hindlll a été ajouté, à diverses concentrations, aux sols avant et après broyage. En plus du broyage, une combinaison des autres traitements de lyse a été testée, y compris la sonication (dispositif Cup Horn, voir protocole n 4b) et des chocs thermiques (voir la section Matériel et Méthodes). DNA could be either a fraction of extracellular DNA released from already dead organisms, which can persist in the soil for months (WARD et al., 1990), or DNA released from organisms lysed easily during the first stages of treatment. In order to simulate this situation, HindIII digested lambda phage DNA was added at various concentrations to the soil before and after grinding. In addition to grinding, a combination of other lysis treatments was tested, including sonication (Cup Horn device, see Protocol No. 4b) and thermal shock (see Materials and Methods section).

Après extraction, des fractions aliquotes qui devraient théoriquement contenir de 25 à 150 ng d'ADN de phage lambda ont été analysées par électrophorèse sur gel. Aucun fragment d'ADN spécifique du phage lambda n'a pu être observé lorsque l'ADN a été inoculé dans les After extraction, aliquots which should theoretically contain 25 to 150 ng of lambda phage DNA were analyzed by gel electrophoresis. No DNA fragment specific for lambda phage could be observed when the DNA was inoculated into the

échantillons de sol préalablement au broyage, indépendamment de la dose ou du type de sol. soil samples prior to grinding, regardless of the rate or type of soil.

Lorsque l'ADN a été ajouté après broyage, et extrait sans étape de traitement de lyse additionnelle, les profils spécifiques d'ADN de phage lambda ont été détectés dans les extraits de quatre des cinq sols testés. When the DNA was added after grinding, and extracted without additional lysis treatment step, specific lambda phage DNA profiles were detected in the extracts of four of the five soils tested.

Dans tous ces cas, une relation directe de cause à effet a été obtenue entre la quantité d'ADN ajoutée et l'intensité des signaux sur les gels d'agarose. Les intensités des signaux étaient, cependant, inférieures aux intensités de signaux attendues si on les compare à celles des standards moléculaires. In all these cases, a direct cause-and-effect relationship was obtained between the amount of DNA added and the intensity of the signals on the agarose gels. The signal intensities were, however, less than the expected signal strengths when compared to those of the molecular standards.

De plus, la bande à 23 kb était absente dans plusieurs cas, indiquant que les longs fragments étaient préférentiellement adsorbés aux particules du sol, ou étaient plus sensibles à la dégradation, comparés aux fragments courts. In addition, the 23 kb band was absent in several cases, indicating that long fragments were preferentially adsorbed to soil particles, or were more susceptible to degradation, compared to short fragments.

Aucune bande n'a été détectée dans les échantillons de sol tropical n 5 qui est caractérisé par une très haute teneur en argile (tableau 1). No bands were detected in samples of tropical soil n 5 which is characterized by a very high clay content (Table 1).

Pour une quantification plus précise, la récupération d'ADN a été déterminée sur un dispositif d'imagerie par phosphorescence (phosphoimager) après hybridation en tache (Dot Blot). Selon cette technique, l'ADN a été détecté dans tous les échantillons, y compris ceux qui avaient été inoculés avant broyage, à l'exception du sol n 5 dans lequel aucun ADN n'a pu être détecté. For more precise quantification, DNA recovery was determined on a phosphorescence (phosphoimager) imaging device after dot blotting. According to this technique, DNA was detected in all samples, including those that had been inoculated before grinding, except for sol No. 5 in which no DNA could be detected.

Dans tous les autres sols, la quantité d'ADN extrait augmente avec l'augmentation de taille de l'inoculum (figures 4a-d). In all other soils, the amount of DNA extracted increases with increasing size of the inoculum (Figures 4a-d).

Cependant, les récupérations d'ADN de phage lambda étaient faibles. Lorsque le broyage était le seul traitement de lyse appliqué, les récupérations étaient comprises entre 0,6 et 5,9% de l'ADN ajouté lorsque celui-ci était ajouté avant broyage, et de 3,6 à 24% de l'ADN ajouté lorsque ce dernier était ajouté après broyage. Les plus hauts niveaux de récupération ont été obtenus à partir du sol n 2. However, lambda phage DNA recoveries were low. When grinding was the only lysis treatment applied, the recoveries ranged from 0.6 to 5.9% of the DNA added when it was added before grinding, and from 3.6 to 24% of the DNA. added when the latter was added after grinding. The highest levels of recovery were obtained from soil n 2.

L'électrophorèse sur gel de fractions aliquotes d'échantillons traités par choc thermique et sonication n'a permis d'observer des bandes d'ADN dans aucun des échantillons, y compris l'essai dans lequel l'ADN avait été ajouté après broyage. Les expériences d'hybridation en tache (Dot Blot) ont confirmé ces résultats. Gel electrophoresis of aliquots of heat shock and sonication samples failed to observe DNA bands in any of the samples, including the assay in which the DNA was added after grinding. Spot hybridization experiments (Dot Blot) confirmed these results.

Les signaux d'hybridation obtenus à partir de suspensions de sol qui ont été traitées par chocs thermiques et sonication ont été, tout au plus, faibles. Hybridization signals obtained from soil suspensions that have been treated by thermal shock and sonication have been, at most, low.

L'échantillon présentant la plus forte quantité d'ADN (15 g d'ADN/g de poids sec du sol) était le seul pour lequel le signal obtenu était sensiblement différent du niveau du bruit de fond. The sample with the highest amount of DNA (15 g DNA / g dry weight of soil) was the only one for which the signal obtained was significantly different from the level of the background noise.

Aucune différence,( ou de faibles différences) n'a été observée entre les échantillons traités par choc thermique et ceux traités par chocs thermiques et sonication, indiquant que les chocs thermiques ont un effet préjudiciable sur l'ADN. Les récupérations les meilleures ont été observées pour le sol n 2, qui a la plus forte teneur en matière organique (tableau 1), alors qu'aucun ADN n'a été récupéré à partir du sol argileux n 5. No differences (or small differences) were observed between heat shock treated samples and those treated with thermal shock and sonication, indicating that thermal shocks have a detrimental effect on DNA. The best recoveries were observed for soil # 2, which has the highest organic matter content (Table 1), while no DNA was recovered from # 5 clay soil.

Des expériences additionnelles ont été réalisées avec des échantillons non broyés de sols n 4 et n 5, qui ont été ensemencés avec 20 et 50 g d'ADN de phage lambda par gramme de sol. Additional experiments were performed with unmilled samples of # 4 and # 5 sols, which were inoculated with 20 and 50 g of lambda phage DNA per gram of soil.

Les échantillons ont été extraits immédiatement ou après une période d'incubation d'une heure à 28 C, puis les extraits d'ADN ont été purifiés et analysés par électrophorèse sur gel. The samples were extracted immediately or after an incubation period of one hour at 28 ° C, then the DNA extracts were purified and analyzed by gel electrophoresis.

L'incubation du sol n 4 pendant une heure après l'inoculation n'a pas conduit à des profils qualitativement ou quantitativement différents de ceux obtenus sans incubation ou de ceux observés antérieurement lorsque l'ADN avait ajouté après broyage. Incubation of soil No. 4 for one hour after inoculation did not lead to qualitatively or quantitatively different profiles from those obtained without incubation or from those previously observed when DNA was added after milling.

Ces résultats indiquent que la dégradation enzymatique par les nucléases du sol ne seraient pas impliquée dans le faible taux de récupération d'ADN. De plus, l'absence d'étape de broyage ne permet pas une augmentation de la récupération de l'ADN à partir du sol n 5, indiquant que les modifications de structure du sol dûes au broyage n'augmentent pas significativement l'adsorption des acides nucléiques sur les colloïdes. These results indicate that enzymatic degradation by soil nucleases would not be involved in the low recovery rate of DNA. In addition, the absence of grinding step does not allow an increase in the recovery of DNA from soil n 5, indicating that the changes in soil structure due to grinding do not significantly increase adsorption of the soil. nucleic acids on colloids.

2. 6 SATURATION DES SITES D'ADSORPTION AVEC L'ARN : plupart des profils obtenus sur les gels d'agarose ne diffèrent pas significativement des profils précédents dans lesquels le traitement d'ARN n'a pas été effectué. 2. 6 SATURATION OF ADSORPTION SITES WITH RNA: Most of the profiles obtained on the agarose gels do not differ significantly from the previous profiles in which the RNA treatment was not performed.

Par exemple, aucune bande n'a été détectée à partir du sol riche en argile n 5, indépendamment des concentrations d'ARN et des concentrations d'ADN de phage lambda utilisées. For example, no band was detected from soil rich in # 5 clay, regardless of the RNA concentrations and lambda phage DNA concentrations used.

De plus, les bandes spécifiques d'ADN de phage lambda digérées par Hindlll restaient indétectables dans le terreau sablonneux traité par l'ARN (sol n 4) lorsque l'ARN est ajouté avant le broyage. In addition, the Hindlll-digested lambda phage specific DNA bands remained undetectable in RNA-treated sandy loam (sol 4) when RNA was added prior to grinding.

L'intensité des bandes obtenues à partir d'échantillons ensemencés avec l'ADN après broyage augmente avec la concentration d'ARN, indiquant que le traitement pourrait avoir un effet positif. The intensity of the bands obtained from samples seeded with the DNA after grinding increases with the concentration of RNA, indicating that the treatment could have a positive effect.

Cependant, les résultats après hybridation et analyse par imagerie à phosphorescence n'ont pas confirmé les résultats de l'électrophorèse. Par exemple, l'effet positif du traitement d'ARN sur la récupération d'ADN à partir du terreau argileux, lorsque l'ADN a été ajouté après broyage, n'apparaît pas clairement. However, the results after hybridization and phosphorescence imaging analysis did not confirm the results of electrophoresis. For example, the positive effect of RNA treatment on DNA recovery from clay loam, when DNA has been added after milling, does not appear clearly.

D'un autre côté, un effet positif de l'ARN a été trouvé pour le sol riche en argile (n 5) lorsque l'ADN a été ajouté après broyage. On the other hand, a positive effect of RNA was found for clay-rich soil (# 5) when DNA was added after milling.

Bien que les signaux d'hybridation pour les échantillons contrôle ne diffèrent pas des niveaux de bruit de fond, des quantités significatives d'ADN ont été libérées à partir des échantillons traités par l'ARN, et les signaux ont augmenté avec la quantité d'ADN ajoutée ainsi qu'avec la concentration d'ARN. Although the hybridization signals for the control samples do not differ from the background levels, significant amounts of DNA were released from the RNA-treated samples, and the signals increased with the amount of DNA added as well as the RNA concentration.

Cependant, même pour la plus forte concentration d'ARN (100 mg/g de poids de sol sec) le taux de récupération n'a jamais dépassé 3%. However, even for the highest RNA concentration (100 mg / g dry weight) the recovery rate never exceeded 3%.

2. 7 PURIFICATION DES EXTRAITS BRUTS D'ADN: Des quatre protocoles testés, la meilleure amplification des extraits d'ADN non dilués (1 l d'extrait dans 50 l de mélange PCR) a été observée après l'élution à travers des colonnes de type Microspin S400 suivie d'une élution à travers une colonne de type Elutip d, comme le montre l'électrophorèse sur gel des produits PCR. 2. 7 PURIFICATION OF DRAFT DNA EXTRACTS: Of the four protocols tested, the best amplification of the undiluted DNA extracts (1 l of extract in 50 l of PCR mixture) was observed after elution through columns Microspin S400 followed by elution through a column of Elutip d type, as shown by gel electrophoresis of PCR products.

L'ADN purifié par le système aqueux double phase (protocole C) a donné des quantités plus faibles de produits PCR après amplification à partir d'extrait d'ADN non dilué. DNA purified by the aqueous double-phase system (protocol C) gave smaller amounts of PCR products after amplification from undiluted DNA extract.

Aucun produit d'amplification n'a pu être obtenu à partir des extraits non dilués après amplification à la suite de la mise en oeuvre des protocoles A ou B. En conséquence, le protocole B (voir section Matériels No amplification product could be obtained from undiluted extracts after amplification following the implementation of Protocols A or B. Therefore, Protocol B (see Materials section

et Méthodes) a été utilisé pour toutes les expériences dans lesquelles les amplifications PCR et/ou les hybridations sur tâche (Dot Blot) ont été réalisées. and Methods) was used for all experiments in which PCR amplifications and / or dot blots were performed.

2. 8 QUANTIFICATION PAR PCR ET HYBRIDATION: La première étape était de déterminer si les quantités de produit PCR étaient proportionnelles au nombre de molécules d'ADN cibles initialement présentes dans le tube réactionnel. De l'ADN de Streptosporangium fragile a été utilisé comme cible (voir section Matériels et Méthodes). 2. 8 PCR QUANTIFICATION AND HYBRIDIZATION: The first step was to determine if the amounts of PCR product were proportional to the number of target DNA molecules initially present in the reaction tube. Fragile Streptosporangium DNA was used as a target (see Materials and Methods section).

Les amorces utilisées ont été les amorces FGPS122 et FGPS350 (tableau 2). L'électrophorèse sur gel des produits PCR a montré que l'intensité de bande augmente avec l'accroissement de la concentration des cibles. Les produits PCR ont été hybridés avec la sonde oligonucléotidique FGPS643 (tableau 2), et les signaux ont été quantifiés par imagerie par phosphorescence (phospho-imaging). The primers used were primers FGPS122 and FGPS350 (Table 2). Gel electrophoresis of PCR products showed that band intensity increases with increasing target concentration. The PCR products were hybridized with the oligonucleotide probe FGPS643 (Table 2), and the signals were quantified by phosphorescence (phospho-imaging) imaging.

On a trouvé une bonne corrélation (r2= 0,98) entre le log[nombre de cibles] et le log[intensité du signal d'hybridation]. A good correlation (r2 = 0.98) was found between log [number of targets] and log [intensity of hybridization signal].

On a ensuite recherché si l'efficacité de l'amplification PCR était affectée par les acides humiques et l'ADN non cible. Lorsqu'on l'analyse par électrophorèse sur gel, l'intensité accrue des bandes des produits PCR, correspondant aux différentes quantités d'ADN cible, était conservée lorsque l'amplification était réalisée avec des solutions d'ADN auxquelles on avait ajouté des extraits de sol traités à la DNase, contenant des acides humides à des concentrations allant jusqu'à 8ng dans le mélange PCR d'un volume de 50 l. It was then investigated whether the efficiency of PCR amplification was affected by humic acids and non-target DNA. When analyzed by gel electrophoresis, the increased intensity of the PCR product bands, corresponding to the different amounts of target DNA, was retained when the amplification was performed with DNA solutions to which extracts had been added. of DNase treated soil, containing wet acids at concentrations up to 8 ng in the PCR mixture of 50 l volume.

Avec 20 ng d'acide humique dans le mélange PCR, les bandes correspondant aux faibles niveaux d'ADN cible ont disparu, et à des concentrations d'acide humique de 80 ng et à des concentrations supérieures, aucune bande n'était visible . With 20 ng of humic acid in the PCR mixture, the bands corresponding to low levels of target DNA disappeared, and at humic acid concentrations of 80 ng and at higher concentrations no band was visible.

Les quantités variées d'ADN cible de S.fragile ont permis de fournir les quantités attendues de produit PCR lorsque, avant amplification, l'ADN de S. fragile a été mélangé avec de l'ADN de Streptomyces hygroscopicus et ajouté au mélange PCR de 50 l dans une gamme de 100 pg à 1 g afin de simuler l'ADN non-cible libéré à partir de la microflore du sol. The varied amounts of S.fragile target DNA provided the expected amounts of PCR product when, prior to amplification, the fragile S.sub.S DNA was mixed with Streptomyces hygroscopicus DNA and added to the PCR mixture of 50 l in a range of 100 μg to 1 g to simulate the non-target DNA released from the soil microflora.

2. 9 QUANTIFICATION DES ACTINOMYCETES INDIGENE DU SOL APRES DIFFERENTS TRAITEMENTS DE LYSE : Ona appliqué le protocole de purification D suivi d'une amplification par PCR comme décrit ci-dessus afin de quantifier les actinomycètes appartenant au genre Streptosporangium dans le sol n 3 après extraction conformément aux protocoles n 1, 2,3, 5a et 5b (figure 5). 2. 9 QUANTIFICATION OF INDIGENE ACTINOMYCETES OF SOIL AFTER DIFFERENT LYSE TREATMENTS: The purification protocol D followed by a PCR amplification as described above was used to quantify the actinomycetes belonging to the genus Streptosporangium in soil n 3 after extraction. according to Protocols 1, 2,3, 5a and 5b (Figure 5).

Après broyage, (protocole n 2) la quantité d'ADN cible provenant de cet actinomycète a été estimée par hybridation (Dot Blot) et radioimagerie comme étant de 2,5 +/- 1,3 ng/g de poids de sol sec. After grinding, (Protocol No. 2) the amount of target DNA from this actinomycete was estimated by hybridization (Dot Blot) and radioimaging as 2.5 +/- 1.3 ng / g dry soil weight.

Si l'on postule que le contenu en ADN est de 10 fg par cellule, comme pour Streptomyces (Gladek et al. 1984), cette valeur correspond à approximativement 2,5 x 105 génomes. Des valeurs similaires ont été obtenues après les autres traitements de lyse (respectivement 2,6 +/-1,1 et 1,8 +/- 1,3 ng d'ADN/g de sol sec en utilisant respectivement les protocoles 3 et 4b). Assuming that the DNA content is 10 fg per cell, as for Streptomyces (Gladek et al., 1984), this value corresponds to approximately 2.5 x 105 genomes. Similar values were obtained after the other lysis treatments (respectively 2.6 +/- 1.1 and 1.8 +/- 1.3 ng of DNA / g of dry soil using respectively protocols 3 and 4b ).

2. 10 EFFICACITE DE LA RECUPARATION D'ADN A PARTIR DE SOLS PREALABLEMENT INOCULES AVEC DES BACTERIES: Trois sols (n 2, 3 et 5) ont été inoculés avec des spores ou des hyphae de Streptomyces lividans à différentes concentrations (voir section Matériel et Méthodes). 2. 10 EFFECTIVENESS OF DNA RECOVERY FROM SOILS PREVIOUSLY INOCULATED WITH BACTERIA: Three soils (n 2, 3 and 5) were inoculated with spores or hyphae of Streptomyces lividans at different concentrations (see section Material and Methods).

Les quantités de mycélium ajoutées au sol (figure 6b) correspondent au nombre de spores inoculées dans le milieu de germination. Approximativement 50% de ces spores ont germé. Le nombre exact de cellules dans les hyphae des spores germinées n'a pas été déterminé. En conséquence, les quantités de spores et de mycélium ensemencées dans les sols ne sont pas directement comparables. The amounts of mycelium added to the soil (FIG. 6b) correspond to the number of spores inoculated in the germination medium. Approximately 50% of these spores germinated. The exact number of cells in the hyphae of germ spores has not been determined. As a result, the amounts of spores and mycelia seeded in soils are not directly comparable.

Pour chaque échantillon de sol, le protocole d'extraction n 6, la méthode de purification D, et l'amplification PCR combinée avec l'hybridation sur tache (Dot Blot) et l'imagerie par phosphorescence (phospho-imaging) ont été utilisés pour dénombrer les ADNs cibles spécifiques qui avaient été libérés. L'ADN extrait peut être clairement distingué du bruit de fond seulement lorsque le nombre de spores ajoutées dépasse 105 pour les sols n 3 et n 5 et 107 pour le sol n 2 (figure 6a). For each soil sample, extraction protocol No. 6, purification method D, and PCR amplification combined with dot blotting and phosphorescence imaging were used. to count the specific target DNAs that had been released. The extracted DNA can be clearly distinguished from the background only when the number of spores added exceeds 105 for the n 3 and n 5 soils and 107 for the n 2 sol (Figure 6a).

Lorsque le mycélium est ajouté, l'ADN extrait peut être détecté au-delà d'une quantité correspondant à 103 spores/g de sol pour les sols n 2 et n 3, et au-delà de 107 spores/g pour le sol n 5 (figure b). When the mycelium is added, the extracted DNA can be detected beyond an amount corresponding to 103 spores / g of soil for soils n 2 and n 3, and more than 107 spores / g for soil n 5 (Figure b).

Au-dessus du niveau de détection, le signal d'hybridation augmente avec des quantités croissantes des cellules inoculées. Above the detection level, the hybridization signal increases with increasing amounts of the inoculated cells.

Pour l'inoculum de spores, une augmentation de 100 fois dans le nombre de cellules ensemencées conduit à une augmentation de presque 100 du rendement d'ADN. Cette augmentation est clairement inférieure lorsque les hyphae sont inoculées, particulièrement dans les sols n 2 et n 3 (figure 6). For the spore inoculum, a 100-fold increase in the number of seeded cells leads to an almost 100-fold increase in the yield of DNA. This increase is clearly lower when hyphae are inoculated, particularly in n 2 and n 3 soils (Figure 6).

Au contraire, les résultats obtenus lorsque l'ADN de phage lambda a été utilisé comme inoculum, l'ADN a également été récupéré à partir du sol riche en argile (n 5) lorsque les cellules bactériennes ont été utilisées comme inoculum. Cependant, pour ce dernier aussi, le traitement par l'ARN a augmenté la récupération d'ADN de Streptomyces à partir de ce sol à la fois pour les spores et le mycélium (figure 6). In contrast, the results obtained when lambda phage DNA was used as an inoculum, DNA was also recovered from clay-rich soil (# 5) when bacterial cells were used as inoculum. However, for the latter as well, RNA treatment increased the recovery of Streptomyces DNA from this soil for both spores and mycelium (Figure 6).

Le fait d'ensemencer des sols avec des cellules végétatives de Bacillus anthracis a fourni des taux de récupération similaires à ceux obtenus pour Streptomyces. Seeding of soils with vegetative cells of Bacillus anthracis provided similar recovery rates to those obtained for Streptomyces.

De plus, les taux de récupération d'ADN à partir du sol n 5 ont augmenté après traitement par l'ARN également pour cet inoculum. In addition, DNA recovery rates from soil no. 5 increased after RNA treatment also for this inoculum.

Exemple 2: Construction d'une banque d'ADN de faible poids moléculaire (<10 kb) à partir d'un sol contaminé par du lindane : clonage et expression du gène linA Cet exemple décrit la construction d'une librairie d'ADN du sol dans E. coli. Il permet de démontrer le clonage et l'expression de gènes de petite taille issus d'une microflore non cultivable . Example 2 Construction of a Low Molecular Weight DNA Library (<10 kb) from a Lindane Contaminated Soil: Cloning and Expression of the LinA Gene This example describes the construction of a DNA library of soil in E. coli. It demonstrates the cloning and expression of small genes from a non-culturable microflora.

Le lindane est un pesticide organochloré, récalcitrant à la dégradation et persistant dans l'environnement. En aérobie, sa biodégradation est catalysée par une déhydrochlorinase, codée par le Lindane is an organochlorine pesticide, recalcitrant to degradation and persistent in the environment. Aerobically, its biodegradation is catalyzed by a dehydrochlorinase, coded by the

gène linA, permettant de transformer le lindane en 1,2,4-trichlorobenzène. LinA gene, which transforms lindane into 1,2,4-trichlorobenzene.

Le gène linA n'a été identifié que parmi deux souches isolées du sol : Sphingomonas paucimobilis, isolé au Japon (Seeno et Wada 1989, Imai et al 1991, Nagata et al 1993) et Rhodanobacter lindaniclasticus isolé en France (Thomas et al 1996, Nalin et al 1999). The linA gene has only been identified from two strains isolated from the soil: Sphingomonas paucimobilis, isolated in Japan (Seeno and Wada 1989, Imai et al 1991, Nagata et al 1993) and Rhodanobacter lindaniclasticus isolated in France (Thomas et al 1996, Nalin et al 1999).

Pourtant le potentiel de dégradation du lindane, mis en évidence par dosage des ions chlorures libérés et amplification par PCR du gène linA à partir de sols ayant été en contact ou non avec du lindane, semble être répandu plus largement dans l'environnement (BiesiekierskaGalguen, 1997). However, the potential for degradation of lindane, demonstrated by assaying chloride ions released and PCR amplification of the linA gene from soils that have been in contact with lindane or not, seems to be spread more widely in the environment (BiesiekierskaGalguen, 1997).

1. Extraction directe d'ADN de sol
Les sols secs sont broyé pendant 10 minutes dans un broyeur à force centrifuge Restch équipé 6 billes de tungstène. 10 grammes de sol broyé sont mis en suspension dans 50 ml de tampon TENP pH 9 (Tris 50 mM, EDTA 20 mM, NaCI 100 mM, polyvinylpolypirrolidone 1% w/v), et homogénéisés au vortex pendant 10 min. 1. Direct extraction of soil DNA
Dry soils are milled for 10 minutes in a Restch centrifugal force mill equipped with 6 tungsten balls. 10 grams of ground sol are suspended in 50 ml of TENP buffer pH 9 (50 mM Tris, 20 mM EDTA, 100 mM NaCl, 1% w / v polyvinylpolypyrrolidone) and vortexed for 10 min.

Après centrifugation de 5 minutes, 4000 g à 4 C, le surnageant est précipité à l'acétate de sodium (3M, pH 5. 2) et à l'isopropanol, pour être repris dans du tampon TE stérile (Tris 10 mM, EDTA 1 mM, pH 8.0). After centrifugation for 5 minutes, 4000 g at 4 ° C., the supernatant is precipitated with sodium acetate (3M, pH 5. 2) and isopropanol, to be taken up in sterile TE buffer (10 mM Tris, EDTA 1mM, pH 8.0).

L'ADN extrait est ensuite purifié sur colonne de tamisage moléculaire S400 (Pharmacia) et sur colonne échangeuse d'ions Elutip d (Schleicher et Schuell), selon les instructions des fabricants, puis conservé dans du TE. The extracted DNA is then purified on a S400 molecular sieve column (Pharmacia) and on an Elutip d ion exchange column (Schleicher and Schuell), according to the manufacturers' instructions, and then stored in TE.

2. Construction de la banque d'ADN extrait du sol dans le vecteur pBluescript SK-
Le vecteur pBluescript SK- et l'ADN extrait du sol sont chacuns digérés par les enzymes Hindlll et BamHI (Roche), à raison de 10 unités d'enzymes pour 1 g d'ADN (incubation 2 heures à 37 C). Les ADN sont 2. Construction of the soil-extracted DNA library in the vector pBluescript SK-
The pBluescript SK- vector and the DNA extracted from the soil are each digested with the enzymes HindIII and BamHI (Roche), at the rate of 10 units of enzymes per 1 g of DNA (incubation for 2 hours at 37 ° C.). The DNAs are

ensuite ligués par action de la T4 DNA ligase (Roche), une nuit à 15 C, à raison d'une unité d'enzyme pour 300 ng d'ADN (environ 200 ng d'ADN insert et 100 ng de vecteur digéré). Les cellules d'Escherichia coli électrocompétentes, ElectroMAX DH10B TM (Gibco BRL) sont transformées par le mélange de ligation (2 NI) par électroporation (25 F, 200 et 500 #, 2,5 kV) (Biorad Gene Pulser). then ligated by the action of T4 DNA ligase (Roche) overnight at 15 C, one unit of enzyme per 300 ng of DNA (about 200 ng of DNA insert and 100 ng of digested vector). The electrocompetent Escherichia coli cells, ElectroMAX DH10B TM (Gibco BRL) are transformed by the ligation mixture (2 NI) by electroporation (25 F, 200 and 500 #, 2.5 kV) (Biorad Gene Pulser).

Après une heure d'incubation dans le milieu LB, les cellules transformées sont diluées de façon à obtenir environ 100 colonies par boîte puis sont étalées sur milieu LB (10 g/l Tryptone, 5 g/l extrait de levure, 5 g/ NaCI) additionné d'Ampiciline (100 mg/l), de y-HCH (500 mg/l), de X-gal 60 mg/l (5-bromo-4-chloro-3-indolyl-a-D-galactoside), et d'IPTG 40 mg/l (isopropylthio-ss-D-galactoside), et incubées une nuit à 37 C. Le y- hexachlorocyclohexane (Merck-Schuchardt) étant insoluble dans l'eau, une solution à 50 g/l est préparée dans du DMSO (dimethyl sulfoxyde) (Sigma). After one hour of incubation in LB medium, the transformed cells are diluted to obtain about 100 colonies per dish and are then plated on LB medium (10 g / l Tryptone, 5 g / l yeast extract, 5 g / NaCl ) supplemented with Ampiciline (100 mg / l), γ-HCH (500 mg / l), X-gal 60 mg / l (5-bromo-4-chloro-3-indolyl-α-D-galactoside), and of IPTG 40 mg / l (isopropylthio-s-D-galactoside), and incubated overnight at 37 C. The y-hexachlorocyclohexane (Merck-Schuchardt) being insoluble in water, a solution of 50 g / l is prepared in DMSO (dimethyl sulfoxide) (Sigma).

Une banque de 10 000 clones a ainsi été obtenue. A library of 10,000 clones has thus been obtained.

3.Clonage et expression du gène linA
Le criblage de la banque s'effectue par visualisation d'un halo de dégradation du lindane autour de la colonie (le lindane précipitant dans les milieux de culture). Sur 10 000 clones criblés, 35 présentaient ainsi une activité de dégradation du lindane. La présence du gène linA chez ces clones a pu être confirmée par PCR grâce à des amorces spécifiques, décrites par Thomas et al (1996). Des digestions réalisées sur les inserts ainsi que sur les produits d'amplification ont montré des profils identiques entre tous les clones criblés et le témoin de référence, R. lindaniclasticus. Les clones portant le gène linA présentaient également un insert de même taille (environ 4 kb). 3. Cloning and expression of the linA gene
The library is screened by visualizing a halo of degradation of lindane around the colony (lindane precipitating in the culture media). Of 10,000 clones screened, thus exhibited lindane degrading activity. The presence of the linA gene in these clones could be confirmed by PCR using specific primers described by Thomas et al (1996). Digestions performed on the inserts as well as on the amplification products showed identical profiles between all the clones screened and the reference control, R. lindaniclasticus. Clones carrying the linA gene also had an insert of the same size (about 4 kb).

Il ainsi pu être démontré que l'ADN du sol pouvait être cloné et exprimé chez un hôte hétérologue : E. coli, et que des gènes issus d'une microflore difficilement cultivable pouvaient être exprimés. Des banques réalisées à partir de digestion partielle d'ADN extrait du sol par des enzymes de restriction telles que Sau3AI sont donc aussi envisageables. It could thus be demonstrated that the soil DNA could be cloned and expressed in a heterologous host: E. coli, and that genes from a poorly cultivable microflora could be expressed. Banks made from partial digestion of DNA extracted from the soil by restriction enzymes such as Sau3AI are also possible.

EXEMPLE 3: Procédé de préparation d'une collection d'acides nucléiques à partir d'un échantillon de sol, comprenant une étape d'extraction indirecte de l'ADN. EXAMPLE 3: Process for preparing a nucleic acid collection from a soil sample, comprising a step of indirect extraction of the DNA.

1. MATERIEL ET METHODES. 1. MATERIAL AND METHODS.

1. 1 Extraction de la fraction bactérienne du sol. 1. 1 Extraction of the bacterial fraction of the soil.

5g de sol sont dispersés dans 50 ml de NaCI 0. 8% stérile, par broyage au Waring Blender pendant 3 x 1 minute, avec refroidissement dans la glace entre chaque broyage. les cellules bactériennes sont alors séparées des particules du sol par centrifugation sur un coussin de densité de Nycodenz (Nycomed Pharma AS, Oslo, Norvège). Dans un tube à centrifugation, 11,6 ml d'une solution de Nycodenz de densité de 1. 3 g.ml-1 (8g de Nycodenz suspendu dans 10 ml d'eau stérile) sont placés en dessous de 25 ml de la suspension de sol précédemment obtenue. Après centrifugation à 10. 000 g dans un rotor à godets mobiles (rotor TST 28. 38, Kontron) pendant 40 minutes à 4 C, l'anneau cellulaire, se situant à l'interphase de la phase aqueuse et de la phase Nycodenz, est prélevé, lavé dans 25 ml d'eau stérile et centrifugé à 10. 000 g pendant 20 minutes. Le culot cellulaire est ensuite repris dans une solution Tris 10 mM ; EDTA 100 mMn pH 8.0. 5 g of sol are dispersed in 50 ml of sterile 0. 8% NaCl, by grinding with Waring Blender for 3 x 1 minute, with cooling in ice between each grinding. the bacterial cells are then separated from the soil particles by centrifugation on a Nycodenz density cushion (Nycomed Pharma AS, Oslo, Norway). In a centrifuge tube, 11.6 ml of a solution of Nycodenz with a density of 1. 3 g.ml-1 (8 g of Nycodenz suspended in 10 ml of sterile water) are placed under 25 ml of the suspension. ground previously obtained. After centrifugation at 10,000 g in a rotor with moving cups (TST rotor 28. 38, Kontron) for 40 minutes at 4 ° C., the cell ring lying at the interphase of the aqueous phase and of the Nycodenz phase, is removed, washed in 25 ml of sterile water and centrifuged at 10,000 g for 20 minutes. The cell pellet is then taken up in a 10 mM Tris solution; EDTA 100 mMn pH 8.0.

Préalablement à la dispersion du sol au Waring Blender, une étape d'enrichissement du sol dans une solution d'extrait de levure peut être incluse afin de permettre notamment la germination des spores bactériennes du sol. 5 g de sol sont alors incubés dans 50 ml d'une solution stérile de NaCI 0. 8% - extrait de levure 6%, pendant 30 minutes à Prior to soil dispersion with Waring Blender, a soil enrichment step in a yeast extract solution can be included to allow germination of soil bacterial spores. 5 g of sol are then incubated in 50 ml of a sterile solution of NaCl 0.8% - yeast extract 6%, for 30 minutes at

40 C. L'extrait de levure est éliminé par centrifugation à 5000 rpm pendant 10 minutes afin d'éviter la formation de mousse durant le broyage. C. Yeast extract is removed by centrifugation at 5000 rpm for 10 minutes to prevent foaming during grinding.

1. 2 Lyse des cellules bactériennes du sol. 1. 2 Lyse bacterial cells of the soil.

- Lyse des cellules en milieu liquide et purification sur gradient de chlorure de césium. Lysis of the cells in a liquid medium and purification on a gradient of cesium chloride.

Les cellules sont lysées dans une solution Tris 10 mM, EDTA 100 mM, pH 8. 0 contenant 5 mg/ml-1 de lysozyme et 0.5 mg/ml-1 d'achromopeptidase pendant 1 heure à 37 C . Une solution de lauryl sarcosyl (1% final) et de protéinase K (2 mg/ml-1) est ensuite ajoutée et incubée à 37 C pendant 30 minutes. La solution d'ADN est alors purifiée sur un gradient de densité de chlorure de césium par centrifugation à 35 000 rpm pendant 36 heures sur un rotor Kontron 65. 13. Le gradient de chlorure de césium employé est un gradient à 1g/ml de CsCI, possédant un indice de réfraction de 1,3860 (Sambrook et al., 1989). The cells are lysed in a 10 mM Tris solution, 100 mM EDTA, pH 8.0 containing 5 mg / ml-1 of lysozyme and 0.5 mg / ml-1 of achromopeptidase for 1 hour at 37 ° C. A solution of lauryl sarcosyl (1% final) and Proteinase K (2 mg / ml-1) is then added and incubated at 37 ° C. for 30 minutes. The DNA solution is then purified on a cesium chloride density gradient by centrifugation at 35,000 rpm for 36 hours on a Kontron 65 rotor. 13. The cesium chloride gradient employed is a 1 g / ml gradient of CsCl having a refractive index of 1.3860 (Sambrook et al., 1989).

- Lyse des cellules après inclusion dans un bloc d'agarose. Lysis of the cells after inclusion in an agarose block.

Les cellules sont mélangées à un volume égal d'agarose à 1.5% (poids/volume) Seaplaque (Agarose Seaplaque FMC Products. TEBU, Le Perray en Yvelines, France). à bas point de fusion et coulées dans un bloc de 100 l. Les blocs sont ensuite incubés dans une solution de lyse : EDTA 250 mM, saccharose 10.3%, lysozyme 5 mg/ml-1 et achromopeptidase 0. 5 mg/ml-1 à 37 C pendant 3 heures. Les blocs sont alors lavés dans une solution de Tris 10 mM - EDTA 500 mM et incubés une nuit à 37 C dans de l'EDTA 500 mM contenant 1 mg.ml-1 de protéinase K et du lauryl sarcosyl 1%. Après plusieurs lavages dans du Tris-EDTA, les blocs sont conservés dans de l'EDTA 500 mM. The cells are mixed with an equal volume of 1.5% (weight / volume) agarose Seaplaque (Agarose Seaplaque FMC Products, TEBU, Perray en Yvelines, France). low melting point and cast in a 100 l block. The blocks are then incubated in a lysis solution: 250 mM EDTA, 10.3% sucrose, 5 mg / ml-1 lysozyme and 0.5 mg / ml-1 achromopeptidase at 37 ° C. for 3 hours. The blocks are then washed in a solution of 10 mM Tris-500 mM EDTA and incubated overnight at 37 ° C. in 500 mM EDTA containing 1 mg.ml-1 of proteinase K and 1% lauryl sarcosyl. After several washes in Tris-EDTA, the blocks are stored in 500 mM EDTA.

La qualité des ADN ainsi extraits est contrôlée par électrophorèse en champs pulsés. The quality of the DNAs thus extracted is controlled by pulsed field electrophoresis.

La quantité d'ADN extrait a été évaluée sur gel d'électrophorèse par rapport à une gamme étalon d'ADN de thymus de veau. The amount of extracted DNA was evaluated on electrophoresis gel against a standard range of calf thymus DNA.

1. 3 Caractérisation moléculaire de l'ADN extrait du sol. 1. 3 Molecular characterization of the DNA extracted from the soil.

Les ADN extraits du sol sont caractérisés par hybridation PCR, méthode qui consiste à amplifier dans un premier temps les ADNs à l'aide d'amorces situées sur des régions universellement conservées du gène de l'ARNr16S, puis à hybrider les ADNs amplifiés avec différentes sondes oligonucléotidiques de spécificité connue (tableau 4), dans le but de quantifier l'intensité du signal d'hybridation par rapport à une gamme étalon externe d'ADN génomique. The DNAs extracted from the soil are characterized by PCR hybridization, which method consists in firstly amplifying the DNAs using primers located on universally conserved regions of the 16S rRNA gene, and then hybridizing the amplified DNAs with different Oligonucleotide probes of known specificity (Table 4), in order to quantify the intensity of the hybridization signal relative to an external standard range of genomic DNA.

Les ADN extraits du sol ainsi que les ADN génomiques extraits de cultures pures sont amplifiés avec les amorces FGPS 612-669 (tableau 1) dans les conditions standard d'amplification par PCR. Les produits d'amplification sont ensuite dénaturés par un volume égal de NaOH 1 N, déposés sur une membrane de Nylon (GeneScreen Plus, Life Science Products) et hybridés avec une sonde oligonucléotidique marquée à son extrémité par du g32P ATP par action de la T4 polynucléotide kinase. Après préhybridation de la membrane dans une solution de 20 ml contenant 6 ml de SSC 20X, 1 ml de solution de Denhardt, 1 ml de SDS 10% et 5 mg d'ADN hétérologue de sperme de saumon, les hybridations sont conduites durant une nuit à la température définie par la sonde. Les membranes sont lavées deux fois dans du SSC 2X pendant 5 minutes à température ambiante, puis une fois dans du SSC 2X SDS 0,1% et une seconde fois dans du SSC 1X, SDS 0,1% pendant 30 minutes à la température d'hybridation. Les signaux d'hybridation sont quantifiés à l'aide du logiciel Molecular Analyst (Biorad, Ivry sur Seine, France) et les quantités d'ADN sont estimées par interpolation des courbes étalons obtenues à partir des ADN génomiques. The soil-extracted DNAs as well as genomic DNAs extracted from pure cultures are amplified with FGPS primers 612-669 (Table 1) under standard PCR amplification conditions. The amplification products are then denatured with an equal volume of 1N NaOH, deposited on a nylon membrane (GeneScreen Plus, Life Science Products) and hybridized with an oligonucleotide probe end-labeled with g32P ATP by action of T4. polynucleotide kinase. After prehybridization of the membrane in a 20 ml solution containing 6 ml of 20X SSC, 1 ml of Denhardt's solution, 1 ml of 10% SDS and 5 mg of heterologous salmon sperm DNA, the hybridizations are conducted overnight. at the temperature defined by the probe. The membranes are washed twice in 2X SSC for 5 minutes at room temperature, then once in 2% SDS SDS and a second time in 1X SSC, 0.1% SDS for 30 minutes at room temperature. 'hybridization. The hybridization signals are quantified using the Molecular Analyst software (Biorad, Ivry sur Seine, France) and the quantities of DNA are estimated by interpolation of the standard curves obtained from the genomic DNAs.

2. RESULTATS ET DISCUSSION 2. 1 Extraction et lyse de la fraction bactérienne du sol. 2. RESULTS AND DISCUSSION 2. 1 Extraction and lysis of the bacterial fraction of the soil.

La séparation des cellules microbiennes des particules du sol, préalablement à l'extraction de l'ADN, est une alternative présentant de nombreux avantages par rapport aux méthodes d'extraction directe de l'ADN dans le sol. En effet, l'extraction de la fraction microbienne limite la contamination de l'extrait d'ADN par de l'ADN extracellulaire présent The separation of microbial cells from soil particles, prior to DNA extraction, is an alternative with many advantages over direct DNA extraction methods in the soil. Indeed, the extraction of the microbial fraction limits the contamination of the DNA extract by extracellular DNA present

librement dans le sol ou par de l'ADN d'origine eucaryote. Mais surtout, l'ADN extrait de la fraction microbienne du sol présente des fragments de plus longue taille et une meilleure intégrité que l'ADN extrait par lyse directe JACOBSON et RASMUSSEN (1992). De plus, la séparation des particules de sol permet d'éviter une contamination de l'extrait d'ADN par des composés humiques et phénoliques, composés pouvant, par la suite, nuire gravement aux efficacités de clonage. freely in the soil or by DNA of eukaryotic origin. Most importantly, the DNA extracted from the microbial fraction of the soil has longer fragments and better integrity than directly extracted DNA JACOBSON and RASMUSSEN (1992). In addition, the separation of the soil particles makes it possible to avoid contamination of the DNA extract by humic and phenolic compounds, compounds which can subsequently seriously affect the cloning efficiencies.

Une des étapes déterminantes pour l'extraction des cellules du sol est la dispersion de l'échantillon de sol afin de dissocier les cellules adhérant à la surface ou à l'intérieur des agrégats de particules de sol. One of the decisive steps for the extraction of soil cells is the dispersion of the soil sample in order to dissociate the cells adhering to the surface or inside the aggregates of soil particles.

Trois cycles de broyage successifs d'une minute chacun permettent d'obtenir une meilleure efficacité d'extraction des cellules ainsi qu'une plus grande quantité d'ADN récupéré, par rapport à un unique cycle de broyage d'une minute 30. Three successive milling cycles of one minute each make it possible to obtain a better extraction efficiency of the cells as well as a larger quantity of recovered DNA, compared with a single milling cycle of one minute 30.

Le tableau 5 rapporte les efficacités d'extraction obtenues après centrifugation sur gradient de Nycodenz , sur la microflore totale viable (dénombrée par microscopie après coloration à l'acridine orange), sur la microflore totale cultivable (dénombrée sur milieu solide Trypticase-Soja 10%), et sur la microflore d'actinomycètes cultivables sur milieu HV agar (après incubation à 40 C dans une solution d'extrait de levure 6%-SDS 0,05% afin de provoquer la germination des sprores). D'autre part, l'ADN extrait a été quantifié soit après une lyse des cellules en milieu liquide (sans purification sur gradient de chlorure de césium) soit après une lyse des cellules incluses dans un bloc d'agarose (après digestion de l'agarose par une b-agarase). Table 5 reports the extraction efficiencies obtained after centrifugation on a Nycodenz gradient, on the total viable microflora (counted by microscopy after staining with acridine orange), on the total cultivable microflora (counted on Trypticase-Soy solid medium 10% ), and on the microflora of actinomycetes cultivable on HV agar medium (after incubation at 40 C in a solution of yeast extract 6% -SDS 0.05% in order to cause sprouting sprouts). On the other hand, the extracted DNA was quantified either after lysis of the cells in a liquid medium (without purification on a cesium chloride gradient) or after lysis of the cells included in an agarose block (after digestion of the agarose with a b-agarase).

Les résultats montrent que plus de 14% de la microflore tellurique totale est récupéré par cette méthode (soit 2 108 cellules par gramme de sol), et que la microflore totale cultivable ne représente qu'à peine 2% de la population microbienne totale. The results show that more than 14% of the total soil microflora is recovered by this method (ie 2 108 cells per gram of soil), and that the total cultivable microflora represents only 2% of the total microbial population.

D'autre part, la quantité d'ADN extrait des cellules est de 330 ng par gramme de sol sec. En estimant le contenu d'ADN par cellule microbienne du sol entre 1. 6 et 2. 4 fg, et compte tenu de la quantité de cellules extraites (2 108 cellules par gramme de sol), on peut estimer que la quasi-totalité des cellules ont été lysées et qu'ainsi la lyse n'apporte pas d'important biais à cette approche. On the other hand, the amount of DNA extracted from the cells is 330 ng per gram of dry soil. By estimating the DNA content per microbial cell of the soil between 1. 6 and 2. 4 fg, and taking into account the quantity of cells extracted (2 108 cells per gram of soil), it can be estimated that almost all cells have been lysed and thus the lysis does not bring significant bias to this approach.

Les électrophorèses en champs pulsés ont montré que l'ADN du sol extrait après gradient de Nycodenz et de CsCI pouvait atteindre une taille de 150 kb et que la lyse en bloc d'agarose permettait d'extraire des fragments supérieurs à 600kb. Pulsed-field electrophoresis showed that the soil DNA extracted after the Nycodenz and CsCl gradient could reach a size of 150 kb and that the agarose block lysis allowed fragments larger than 600 kb to be extracted.

Ces résultats confirment l'intérêt de cette approche indépendante de la culture pour la construction de banques d'ADN de l'environnement, en se présentant comme une alternative aux méthodes directes d'extraction d'ADN. These results confirm the interest of this culture-independent approach for the construction of environmental DNA libraries as an alternative to direct DNA extraction methods.

2. 2 Caractérisation moléculaire de l'ADN extrait du sol. 2. 2 Molecular characterization of the DNA extracted from the soil.

Le but de la caractérisation moléculaire de l'ADN extrait du sol est d'obtenir des profils représentant les proportions des différents taxons bactériens présents dans l'extrait d'ADN. Il s'agissait également de connaître les biais d'extraction induits par la séparation préalable de la réaction cellulaire du sol, en comparaison avec une méthode d'extraction directe faute de visualisation directe de la diversité microbienne présente dans les sols. En effet, peu d'informations ont été rassemblées sur l'extraction des cellules sur gradient de Nycodenz en fonction de leur structure morphologique (diamètre des cellules, formes filamenteuses ou sporulées). The purpose of the molecular characterization of the DNA extracted from the soil is to obtain profiles representing the proportions of the different bacterial taxons present in the DNA extract. It was also a question of knowing the extraction biases induced by the preliminary separation of the cellular reaction of the soil, in comparison with a direct extraction method for lack of direct visualization of the microbial diversity present in the soils. Indeed, little information has been collected on the extraction of Nycodenz gradient cells according to their morphological structure (cell diameter, filamentous or sporulated forms).

Les méthodes jusqu'ici en place étaient basées sur des: - hybridations quantitatives utilisant des sondes oligonucléotidiques spécifiques à différents groupes bactériens, appliqués directement d'ADN extrait de l'environnement Malheureusement, cette approche n'est pas très sensible et ne permet pas de détecter des genres ou des groupes taxonomiques présents en faible abondance AMANN (1995). The methods hitherto in place were based on: - quantitative hybridizations using oligonucleotide probes specific to different bacterial groups, applied directly to DNA extracted from the environment Unfortunately, this approach is not very sensitive and does not allow detect genera or taxonomic groups present in low abundance AMANN (1995).

- PCR quantitatives telles que la MPN-PCR (Most Probable Number) SYKES et al. (1992) ou la PCR quantitative par compétition DIVIACCO et al. (1993). Les inconvénients respectifs de chacune de ces approches sont (i) la lourdeur d'utilisation du fait de la multiplication des dilutions et des répétitions qui rend la technique inappropriée pour un grand nombre d'échantillons ou de couples d'amorces, et (ii) la nécessité de construire un compétiteur spécifique à l'ADN cible et n'induisant pas de biais dans la compétition. - Quantitative PCRs such as the MPN-PCR (Most Probable Number) SYKES et al. (1992) or quantitative PCR by competition DIVIACCO et al. (1993). The respective drawbacks of each of these approaches are (i) the heaviness of use due to the multiplication of dilutions and repetitions which makes the technique inappropriate for a large number of samples or primer pairs, and (ii) the need to build a competitor specific to the target DNA and not inducing bias in the competition.

La méthode mise en place selon la présente invention consiste à amplifier universellement un fragment de 700 pb à l'intérieur de la séquence d'ADNr 16S, à hybrider cet amplifiat avec une sonde oligonucléotidique de spécificité variable (au niveau du règne, de l'ordre, de la sous classe ou du genre) et à comparer l'intensité d'hybridation de l'échantillon par rapport à une gamme étalon externe. L'amplification préalable à l'hybridation permet de quantifier des genres ou des espèces de micro-organismes peu abondants. De plus, l'amplification par des amorces universelles permet, lors de l'hybridation, d'utiliser une large série de sondes oligonucléotidiques. Elle permet de comparer entre eux différents modes de lyse (extraction directe ou indirecte) sur des groupes taxonomiques bien définis. The method implemented according to the present invention consists in universally amplifying a fragment of 700 bp inside the 16S rDNA sequence, to hybridize this amplifiat with an oligonucleotide probe of variable specificity (at the level of the kingdom, of the order, subclass or genus) and to compare the intensity of hybridization of the sample with respect to an external standard range. The amplification prior to hybridization makes it possible to quantify genera or species of scant microorganisms. In addition, the amplification by universal primers makes it possible, during hybridization, to use a large series of oligonucleotide probes. It makes it possible to compare different modes of lysis (direct or indirect extraction) with well-defined taxonomic groups.

Les résultats sont rassemblés dans le tableau 6. The results are summarized in Table 6.

Ils montrent des profils similaires entre les deux méthodes d'extraction (directe et indirecte). Ainsi, il apparaît que l'extraction préalable de la fraction microbienne tellurique n'introduit pas de réels biais parmi les taxons testés. La seule différence significative entre les deux approches d'extraction semblerait être la plus grande abondance de séquences d'ADNr appartenant aux y protéobactéries dans l'extrait par la méthode d'extraction indirecte. They show similar profiles between the two extraction methods (direct and indirect). Thus, it appears that the prior extraction of the telluric microbial fraction does not introduce any real bias among the taxa tested. The only significant difference between the two extraction approaches would seem to be the greater abundance of rDNA sequences belonging to the proteobacteria in the extract by the indirect extraction method.

De plus, un effet significatif de l'incubation de l'échantillon de sol dans une solution d'extrait de levure est observé sur les populations sporulées du sol (Gram+, bas pourcentage de GC et Actinomycètes). In addition, a significant effect of incubation of the soil sample in a yeast extract solution was observed on sporulated soil populations (Gram +, low percent GC and Actinomycetes).

Cette étape provoque la germination des spores, et permet d'une part certainement une meilleure récupération de ce type de cellules et d'autre part une plus grande efficacité de la lyse sur des cellules en germination. This step causes germination of the spores, and on the one hand certainly allows a better recovery of this type of cells and, on the other hand, a greater efficiency of the lysis on germinating cells.

Cette approche permet une analyse semi-quantitative, ciblée sur les principaux taxons définis à partir de micro-organismes cultivés et habituellement retrouvés dans les sols. Seuls des outils moléculaires permettent d'estimer l'importance des différents taxons, les méthodes de mise en culture étant trop restrictives et dépendantes de la spécificité du milieu utilisé. This approach allows a semi-quantitative analysis, targeted at the main taxa defined from cultivated micro-organisms and usually found in soils. Only molecular tools make it possible to estimate the importance of the different taxa, the methods of cultivation being too restrictive and dependent on the specificity of the medium used.

Les résultats montrent qu'une grande part de la population microbienne n'est pas représentée dans les groupes phylogénétiques décrits, mettant ainsi en évidence l'existence de nouveaux groupes The results show that a large part of the microbial population is not represented in the phylogenetic groups described, thus highlighting the existence of new groups

composés de micro-organismes non cultivés jusqu'à présent, ou non cultivables. composed of micro-organisms not cultivated until now, or non-cultivable.

Ainsi, de nouvelles sondes peuvent être définies à partir de séquences déterminées à partir d'ADN extrait du sol (nouveaux phylums composés de micro-organismes non cultivés, LUDWIG et al. (1997) afin d'obtenir une image plus exacte de la composition de l'extrait d'ADN. Thus, new probes can be defined from sequences determined from DNA extracted from the soil (new phyla composed of non-cultured microorganisms, LUDWIG et al., (1997) to obtain a more accurate image of the composition of the DNA extract.

Exemple 4 : - CONSTRUCTION DU COSMIDE POS 7001 Caractéristiques de POS 7001:
Réplicatif chez E. coli
Intégratif chez Streptomyces
Sélectionnable chez E. coli AmpR, HygroR et Streptomyces HygroR
Les propriétés du cosmide permettent d'insérer de grands fragments d'ADN entre 30 et 40kb. Example 4: - CONSTRUCTION OF COSMIDE POS 7001 Features of POS 7001:
Replicative in E. coli
Integrative with Streptomyces
Selectable from E. coli AmpR, HygroR and Streptomyces HygroR
The properties of the cosmid allow to insert large DNA fragments between 30 and 40kb.

Il comprend 1 - Le promoteur inductible tipA de Streptomyces lividans 2 - Le système d'intégration spécifique de l'élément pSAM2 3 - Le gène de résistance à l'hygromycine 4- le cosmide pWED1, dérivé de pWED15 1) - Le promoteur inductible du gène tip A de S. lividans
Le gène tipA code une protéine de 19 KD dont la transcription est induite par l'antibiotique thiostrepton ou nosiheptide. Le promoteur de tipA est bien régulé: induction en phase exponentielle et en phase stationnaire (200X) Murakami T, Holt TG, Thompson CJ. J. Bacteriol 1989 ; 171 : 1459-66 2) - Le gène de résistance à l'hygromycine - Hygromycine : produit par S. hygroscopicus - Le gène de résistance code une phosphotransfèrase (hph) - Le gène utilisé provient d'une cassette construite par Blondelet et al dans laquelle le gène hyg est sous contrôle de son propre promoteur It comprises 1 - The inducible promoter tipA of Streptomyces lividans 2 - The specific integration system of the element pSAM2 3 - The hygromycin resistance gene 4- the cosmid pWED1, derived from pWED15 1) - The inducible promoter of the gene tip A from S. lividans
The tipA gene encodes a 19 KD protein whose transcription is induced by the antibiotic thiostrepton or nosiheptide. The tipA promoter is well regulated: exponential and stationary phase induction (200X) Murakami T, Holt TG, Thompson CJ. J. Bacteriol 1989; 171: 1459-66 2) - The hygromycin resistance gene - Hygromycin: produced by S. hygroscopicus - The resistance gene codes a phosphotransferase (hph) - The gene used comes from a cassette built by Blondelet et al in which the hyg gene is under the control of its own promoter

et du promoteur plac inductible par l'IPTG Blondelet-Rouault et al ; . and IPTG-inducible promoter Blondelet-Rouault et al; .

Gene 1997 ;190 :315-7 3) - Le système d'intégration site-spécifique
L'élément pSAM2 s'intègre dans le chromosome par un mécanisme d'intégration site-spécifique. La recombinaison a lieu entre deux séquences identiques de 58 pb présentes sur le plasmide (attp) et sur le chromosome (attB). Gene 1997; 190: 315-7 3) - The site-specific integration system
The pSAM2 element integrates into the chromosome through a site-specific integration mechanism. Recombination takes place between two identical 58 bp sequences present on the plasmid (attp) and on the chromosome (attB).

Le gène int, situé à proximité du site attP, est impliqué dans l'intégration site-spécifique de pSAM2, et son produit présente des similitudes avec les intégrases des bactériophages tempérés d'entérobactéries. Il a été démontré qu'un fragment de pSAM2 ne contenant que le site d'attachement attP ainsi que le gène int était capable de s'intégrer de la même manière que l'élément entier. Voir brevet français n 88 06638 du 18/05/1988, ainsi que Raynal A et al. Mol Microbiol 1998 28 :333-42). The int gene, located near the attP site, is involved in the site-specific integration of pSAM2, and its product has similarities to the integrases of bacteriophages tempered with Enterobacteriaceae. It has been shown that a fragment of pSAM2 containing only the attP attachment site as well as the int gene was able to integrate in the same way as the whole element. See French Patent No. 88 06638 of May 18, 1988, and Raynal A et al. Mol Microbiol 1998 28: 333-42).

4) - Construction du cosmide pOS7001 Etape 1/ Le promoteur TipA a été isolé du plasmide pPM927 (Smokvina et al. Gene 1990 ; 94 :53-9 ) sur un fragment Hindlll-BamHl de 700 paires de bases et cloné dans le vecteur pUC18 (Yannish-Perron et al., 1985) digéré par Hindlll/BamHI Etape 2/ Ce fragment Hindlll-BamHl a ultérieurement été transféré de pUC18 à pUC19 (Yannish-Perron et al., 1985). 4) Construction of Cosmos pOS7001 Step 1 The TipA promoter was isolated from plasmid pPM927 (Smokvina et al., Gene 1990; 94: 53-9) on a 700 base pair HindIII-BamHI fragment and cloned into the pUC18 vector (Yannish-Perron et al., 1985) HindIII / BamHI digested Step 2 / This HindIII-BamHI fragment was subsequently transferred from pUC18 to pUC19 (Yannish-Perron et al., 1985).

Etape 3/ Un insert BamHI-BamHI de 1500 paires de bases portant le gène int et le site attP de pSAM2 a été isolé du plasmide pOSint1, représenté à la Figure 8, (Raynal A et al. Mol Microbiol 1998 28 :333-42) et cloné au site BamHl du vecteur précédent (pUC19/TipA), dans l'orientation permettant de mettre le gène int sous contrôle du promoteur TipA. Step 3: A 1500 base pair BamHI-BamHI insert carrying the int gene and the attP site of pSAM2 was isolated from plasmid pOSint1, shown in Figure 8 (Raynal A et al., Mol Microbiol 1998 28: 333-42 and cloned at the BamHI site of the previous vector (pUC19 / TipA), in the orientation to bring the int gene under control of the TipA promoter.

Etape 4/ Le site BamHl situé en 5' du gène int a été supprimé par digestion partielle BamHl puis traitement par l'enzyme Klenow. Un Step 4 / The BamHl site located 5 'of the int gene was deleted by BamHl partial digestion and treatment with Klenow enzyme. A

fragment Hindlll-BamHl portant TipA-int-attP a ainsi été isolé de pUC19 et transféré dans pBR322 Hindlll/BamHI. HindIII-BamHI fragment carrying TipA-int-attP was thus isolated from pUC19 and transferred into pBR322 HindIII / BamHI.

Etape 5/ La cassette Hygromycine isolée de pHP45Qhyg (BlondeletRouault et al., 1997) sur un fragment Hindlll-Hindlll a été clonée au site Hindlll situé en amont du promoteur TipA. Step 5 / Hygromycin cassette isolated from pHP45Qhyg (BlondeletRouault et al., 1997) on a HindIII-HindIII fragment was cloned at the HindIII site located upstream of the TipA promoter.

Etape 6/ Le site Hindlll situé entre la cassette QHyg et le promoteur TipA a été supprimé par traitement Klenow après digestion partielle Hindlll. Step 6 / The HindIII site located between the QHyg cassette and the TipA promoter was removed by Klenow treatment after HindIII partial digestion.

Etape7/ Le plasmide obtenu à l'issue de l'étape précédente permet d'isoler un fragment unique HindlII-BamHI, portant tous les éléments QHyg/TipA/int attP, qui a été cloné après traitement Klenow au site EcoRV du cosmide pWED1. Le cosmide pWED1, représenté à la Figure 9, dérive du cosmide pWE15, représenté à la Figure 10 (Wahl GM, et al.. Proc Natl Acad Sci U S A 1987 84 :2160-4) pardélétion d'un fragment Hpal-Hpal portant le gène Neomycine et l'origine SV40. Step 7 / The plasmid obtained at the end of the preceding step makes it possible to isolate a single HindIII-BamHI fragment bearing all the QHyg / TipA / int attP elements, which was cloned after Klenow treatment at the EcoRV site of the cosmid pWED1. The cosmid pWED1, represented in FIG. 9, derives from the cosmid pWE15, represented in FIG. 10 (Wahl GM, et al., Proc Natl Acad Sci USA 1987 84: 2160-4) by the deletion of a Hpal-Hpal fragment bearing the Neomycin gene and SV40 origin.

Une carte du vecteur pOS 7001 est représentée à la Figure 11. A map of the vector pOS 7001 is shown in Figure 11.

Exemple 5 : Construction du cosmide conjugatif et intégratif chez Streptomyces, le vecteur pOSV 303. Example 5 Construction of Conjugative and Integrative Cosmid in Streptomyces, Vector pOSV 303

Etant donné que l'empaquetage sélectionne les clones ayant une taille supérieure à 30kb, seuls 10 à 15% des clones ne contiennent pas d'insert, il n'est donc pas vraiment nécessaire d'avoir un système de sélection des recombinants, ce qui permet de construire un vecteur plus petit. Since the packaging selects clones larger than 30kb, only 10-15% of the clones do not contain an insert, so there is no real need for a recombinant selection system, which allows to build a smaller vector.

Construction : Etape 1 : le vecteur pOSV001
Clonage d'un fragment Pstl-Pstl de 800 paires de bases portant l'origine de transfert OriT du réplicon RK2 (Guiney et al., 1983), dans le plasmide pUC19 ouvert par Pstl. Cette étape de clonage permet d'obtenir un vecteur transférable de E. coli à Streptomyces par conjugaison. Construction: Step 1: the vector pOSV001
Cloning of a 800 base pair PstI-PstI fragment carrying the OriT transfer origin of the RK2 replicon (Guiney et al., 1983), in the plasmid pUC19 opened by PstI. This cloning step makes it possible to obtain a transferable vector of E. coli to Streptomyces by conjugation.

La carte du vecteur pOSV 001 est représentée à la Figure 17. The map of the vector pOSV 001 is shown in Figure 17.

Etape 2 : le vecteur pOSV002
Insertion du marqueur Hygromycine (cassette hyg), et sélectionnable chez Streptomyces, de sorte que le gène conférant la résistance à l'hygromycine soit transféré en dernier ce qui permet de s'assurer du transfert complet du BAC avec l'insert d'ADN du sol. Step 2: the vector pOSV002
Insertion of Hygromycin marker (hyg hygienic), and selectable in Streptomyces, so that the gene conferring resistance to hygromycin is transferred last which makes it possible to ensure complete transfer of the BAC with the DNA insert of the ground.

Clonage de la cassette Hygromycine isolée de pHP45Qhyg sur un fragment Hindlll-Hindlll portant le gène de résistance à l'Hygromycine.. Ce fragment est cloné au site Pstl (position 201) du vecteur pOSV001. Ce site Pstl a été choisi, compte tenu du sens du transfert, pour que le marqueur Hygro soit le dernier transféré lors de la conjugaison. Les extrémités Pstl et Hindlll sont rendues compatibles après traitement par le fragment Klenow de l'ADN polymérase permettant de générer des "bouts francs". L'orientation du fragment Qhyg est déterminée en fin de construction. Cloning of the hygromycin cassette isolated from pHP45Qhyg on a HindIII-HindIII fragment carrying the hygromycin resistance gene. This fragment is cloned at the PstI site (position 201) of the vector pOSV001. This Pstl site was chosen, given the direction of the transfer, so that the Hygro marker is the last transferred during the conjugation. The PstI and HindIII ends are made compatible after treatment with the Klenow fragment of the DNA polymerase making it possible to generate "free ends". The orientation of the Qhyg fragment is determined at the end of construction.

La carte du vecteur pOSV002 est représentée à la Figure 18. The map of the vector pOSV002 is shown in Figure 18.

Etape 3 : le vecteur pOSV010
Le fragment Xbal-Hindlll isolé du plasmide pOSV002 et contenant le marqueur de résistance à l'hygromycine et l'origine de transfert est cloné dans le plasmide pOSint1 digéré par Xbal et Hindlll. L'orientaion des sites est telle que le marqueur hygromycine sera toujours transféré en dernier. Step 3: the vector pOSV010
The XbaI-HindIII fragment isolated from the plasmid pOSV002 and containing the hygromycin resistance marker and the transfer origin is cloned into the plasmid pOSint1 digested with XbaI and HindIII. The orientaion of the sites is such that the hygromycin marker will always be transferred last.

Le plasmide pOSint1, représenté à la Figure 8, a été décrit dans l'article de Raynal et al.( Raynal A et al. Mol Microbiol 1998 28 :333-42). Plasmid pOSint1, shown in Figure 8, has been described in the article by Raynal et al (Raynal A et al Mol Microbiol 1998 28: 333-42).

Cette construction permet l'expression de l'intégrase chez E. coli et chez Streptomyces. This construct allows the expression of integrase in E. coli and Streptomyces.

Etape 4 : insertion du site " cos
Le principe est d'insérer un site " cos " dans le plasmide pOSV010 permettant l'empaquetage dans le plasmide pOSV010, représenté à la Figure 12. Step 4: inserting the site "cos
The principle is to insert a "cos" site in the plasmid pOSV010 allowing the packaging in the plasmid pOSV010, represented in FIG. 12.

L'obtention du fragment " cos " est représentée à la Figure 13. Obtaining the "cos" fragment is shown in Figure 13.

Ce fragment est obtenu par PCR. A partir d'un fragment portant les extrémités cohésives (cos) de # (bactériophage lambda ou cosmide pHC79), une amplification par PCR est réalisée à l'aide des This fragment is obtained by PCR. From a fragment bearing the cohesive ends (cos) of # (bacteriophage lambda or cosmid pHC79), a PCR amplification is carried out using the

oligonucléotides correspondant aux séquences -50/+130 par rapport au site cos. Ces oligonucléotides contiennent en outre les sites de clonage Nsil, compatible Pstl, Xhol, compatible Sali, EcoRV, " bout franc ". oligonucleotides corresponding to the -50 / + 130 sequences relative to the cos site. These oligonucleotides additionally contain the Nsil cloning sites, compatible with PstI, XhoI, SalI compatible, EcoRV, "free end".

L'addition des sites rares Swal et Pacl permet d'isoler et/ou de cartographier l'insert cloné. The addition of the rare sites Swal and Pacl makes it possible to isolate and / or map the cloned insert.

Le fragment PCR est borné par un site Pstl à l'extrémité 5' et par un site Hindi à l'extrémité 3', permettant le clonage dans le vecteur pOSV010 (Figure 12) prélablement digéré par les enzymes Nsil et EcoRV, provoquant la délétion du répresseur laclq. The PCR fragment is bounded by a PstI site at the 5 'end and by a Hindi site at the 3' end, allowing cloning into the vector pOSV010 (FIG. 12), which is first digested with the enzymes Nsil and EcoRV, causing the deletion laclq repressor.

La carte du vecteur pOSV303 est représentée sur la Figure 14. The map of the vector pOSV303 is shown in Figure 14.

Le vecteur pOSV303, contient des sites de clonage tels que le site Nsil, compatible Pstl, le site Xhol, compatible Sali ou encore le site EcoRV pour l'obtention de " bouts francs ". The vector pOSV303 contains cloning sites such as the Nsil site, PstI compatible, the XhoI site, Sali compatible or the EcoRV site for obtaining "free ends".

Exemple 6 : - Construction du cosmide réplicatif navette E. coliStreptomyces pOS700R. Example 6: Construction of the replicative cosmid shuttle E. coli Streptomyces pOS700R.

Les fragments du plasmide pEI16 (Volff et al., 1996) représenté à la Figure 15 ont été isolés et traités par Klenow. Ces fragments contiennent les séquences nécessaires à la replication et à la stabilité provenant du plasmide SCP2. Fragments of plasmid pEI16 (Volff et al., 1996) shown in Figure 15 were isolated and treated with Klenow. These fragments contain the sequences necessary for replication and stability from the SCP2 plasmid.

Ces deux fragment sont insérés séparément dans le site EcoRV du cosmide pWED1 conduisant à 2 clones différents. These two fragments are inserted separately into the EcoRV site of cosmid pWED1 leading to 2 different clones.

La cassette Hygromycine isolée de pHP45Qhyg sur un fragment Hindlll-Hindlll a été clonée au site Hindlll des cosmides pWED1 contenant l'insert ScP2 sous forme de fragments Pstl-EcoRI ou Xbal. Elle confère une résistance à l'Hygromycine sélectionnable à la fois chez E. coli et chez Streptomyces. The hygromycin cassette isolated from pHP45Qhyg on a HindIII-HindIII fragment was cloned at the HindIII site of cosmids pWED1 containing the ScP2 insert as PstI-EcoRI or XbaI fragments. It confers resistance to selectable Hygromycin in both E. coli and Streptomyces.

Transformation de S. lividans et détermination de l'efficacité de transformation. Transformation of S. lividans and determination of transformation efficiency.

Il est apparu que le cosmide contenant l'insert Xbal était moins stable que celui contenant le fragment Pstl EcoRI. C'est donc ce dernier qui a été retenu sous le nom de pOS700R. It was found that the cosmid containing the XbaI insert was less stable than that containing the EcoRI PstI fragment. It is therefore the latter which has been retained under the name of pOS700R.

La carte du vecteur pOS 700R est représentée sur la Figure 16. The map of the vector pOS 700R is shown in Figure 16.

Exemple 7 : Efficacité de transformation des vecteurs intégratifs (pOS7001) et réplicatifs Possibilités
Rendre la souche de S. lividans résistante au thiostrepton par intégration du plasmide pT01 portant le marqueur de résistance au thiostrepton
Préparation de protoplastes à partir de S. lividans cultivée en présence de thiostrepton
Avec le vecteur pOS7001, l'efficacité de transformation est d'environ 3000 transformants par g d'ADN. Example 7: Transformative Efficiency of Integrative Vectors (pOS7001) and Replicative Possibilities
Make the S. lividans strain resistant to thiostrepton by integrating the plasmid pT01 carrying the thiostrepton resistance marker
Preparation of protoplasts from S. lividans grown in the presence of thiostrepton
With the vector pOS7001, the transformation efficiency is about 3000 transformants per g of DNA.

Avec le vecteur pOS700R, l'efficacité de transformation est d'environ 30 000 transformants par g d'ADN. With the vector pOS700R, the transformation efficiency is about 30,000 transformants per g of DNA.

Exemple 8: Construction d'un vecteur BAC intégratif chez Streptomyces et conjugatif Caractéristiques: Réplicatif chez E. coli Transférable par conjugaison de E. coli aux Streptomyces Intégratif chez Streptomyces Sélectionnable chez E. coli et Streptomyces Capable d'insérer de grands fragments d'ADN ; il faut souligner qu'il est nécessaire de disposer d'ADN du sol dont la taille est comprise entre 100 et 300kb et non contaminé par des petits fragments. En effet les petits fragments sont très préférentiellement intégrés. Example 8: Construction of an Integrative BAC Vector in Streptomyces and Conjugate Characteristics: Replicative in E.coli Transferable by Conjugation of E. coli to Streptomyces Integrative in Streptomyces Selectable in E. coli and Streptomyces Capable of Inserting Large DNA Fragments ; It must be emphasized that it is necessary to have soil DNAs ranging in size from 100 to 300 kb and not contaminated with small fragments. Indeed the small fragments are very preferentially integrated.

Doté d'un crible permettant de sélectionner les plasmides portant un insert. Ce crible permet en éliminant les vecteurs refermés sur eux même et non digérés de travailler avec un rapport plus élevé entre vecteur et DNA à insérer ce qui permet d'avoir une meilleure efficacité de clonage pour constituer des banques. With a screen for selecting the plasmids carrying an insert. This screen makes it possible by eliminating the closed and undigested vectors to work with a higher ratio between vector and DNA to be inserted, which makes it possible to have a better cloning efficiency for forming banks.

Construction : Construction :

Etape 1 : le vecteur pOSV001
Clonage d'un fragment Pstl-Pstl de 800 paires de bases portant l'origine de transfert OriT du réplicon RK2 (Guiney et al., 1983), dans le plasmide pUC19 ouvert par Pstl. Cette étape de clonage permet d'obtenir un vecteur transférable de E. coli à Streptomyces par conjugaison. Step 1: the vector pOSV001
Cloning of a 800 base pair PstI-PstI fragment carrying the OriT transfer origin of the RK2 replicon (Guiney et al., 1983), in the plasmid pUC19 opened by PstI. This cloning step makes it possible to obtain a transferable vector of E. coli to Streptomyces by conjugation.

Etape 2 : le vecteur pOSV002
Insertion du marqueur Hygromycine (cassette Qhyg), et sélectionnable chez Streptomyces, de sorte que le gène conférant la résistance à l'hygromycine soit transféré en dernier ce qui permet de s'assurer du transfert complet du BAC avec l'insert d'ADN du sol. Step 2: the vector pOSV002
Insertion of Hygromycin marker (Qhyg cassette), and selectable in Streptomyces, so that the gene conferring resistance to hygromycin is transferred last, which ensures the complete transfer of the BAC with the DNA insert of the ground.

Etape 3 : le vecteur pOSV010
Le fragment Xbal-Hindlll isolé du plasmide pOSV002 et contenant le marqueur de résistance à l'hygromycine et l'origine de transfert est cloné dans le plasmide pOSint1 digéré par Xbal et Hindlll. L'orientation des sites est telle que le marqueur hygromycine sera toujours transféré en dernier. Step 3: the vector pOSV010
The XbaI-HindIII fragment isolated from the plasmid pOSV002 and containing the hygromycin resistance marker and the transfer origin is cloned into the plasmid pOSint1 digested with XbaI and HindIII. The orientation of the sites is such that the hygromycin marker will always be transferred last.

Etape 4 : le vecteur pOSV014 Step 4: the vector pOSV014

Addition d'une "cassette" permettant à terme de sélectionner dans la construction finale les plasmides ayant insérés de l'ADN étranger. Addition of a "cassette" allowing ultimately to select in the final construct the plasmids having inserted foreign DNA.

Cette "cassette" porte le gène codant pour le répresseur CI du phage # et le gène conférant la résistance à la tétracycline. Ce gène porte dans sa région 5' non codante la séquence cible du répresseur. L'insertion d'ADN dans le site Hindlll situé dans la séquence codante de CI conduit à la non production du répresseur et donc à l'expression de la résistance à la tétracycline. This "cassette" carries the gene encoding the phage # repressor IC and the tetracycline resistance conferring gene. This gene carries in its 5 'non-coding region the target sequence of the repressor. The insertion of DNA into the HindIII site located in the coding sequence of CI leads to the non-production of the repressor and therefore to the expression of the resistance to tetracycline.

Elle est portée par le plasmide pUN99 décrit dans l'article : Nilsson et al . It is carried by the plasmid pUN99 described in the article: Nilsson et al.

(Nucleic Acids Res 1983, 11:8019-30) Un fragment Pvull-Hindlll isolé de pOSV010 et contenant les séquences Int, attP, Hygro et oriT est cloné au site Mscl de pUN99 . (Nucleic Acids Res 1983, 11: 8019-30) A Pvull-HindIII fragment isolated from pOSV010 and containing the Int, attP, Hygro and oriT sequences is cloned at the Mscl site of pUN99.

La carte du vecteur pOSV014 est représentée sur la Figure 19. The map of the vector pOSV014 is shown in Figure 19.

Etape 5 : le vecteur pOSV 403, vacteur BAC intégratif et conjugatif
Cette dernière étape de clonage dans pBAC11(représenté à la Figure. 20) permet de conférer au plasmide final des caractéristiques de BAC (Bacterial Artificial Chromosome), en particulier l'aptitude à accepter des inserts d'ADN de très grande taille. Step 5: the vector pOSV 403, integrative and conjugative BAC factor
This last step of cloning into pBAC11 (shown in Figure 20) provides the final plasmid with BAC (Bacterial Artificial Chromosome) characteristics, particularly the ability to accept very large DNA inserts.

Le fragment Pstl-Pstl du vecteur pOSV014 portant l'ensemble des éléments et fonctions décrits précédemment est cloné dans le pasmide pBAC11 (pBeloBAC11) digéré par Notl. Les extrémités sont rendues compatibles pat traitement avec l'enzyme de Klenow. The PstI-PstI fragment of the vector pOSV014 carrying all of the elements and functions described above is cloned into the pasmide pBAC11 (pBeloBAC11) digested with NotI. The ends are made compatible with the Klenow enzyme treatment.

La carte du vecteur pOSV403 est représentée sur la Figure 21. Le schéma de la Figure 21 indique l'orientation retenue. The map of the vector pOSV403 is shown in Figure 21. The diagram of Figure 21 indicates the selected orientation.

Etape 6 :
Le vecteur pOSV403 contient les sites Hindlll et Nsil. Le site Nsil est assez rare chez Streptomyces et présente l'avantage d'être compatible avec Pstl. En revanche, le site Pstl est fréquent chez Streptomyces et peut être utilisé pour effectuer des digestions partielles. Step 6:
The vector pOSV403 contains the HindIII and Nsil sites. The Nsil site is quite rare in Streptomyces and has the advantage of being compatible with Pstl. On the other hand, the Pstl site is frequent in Streptomyces and can be used to carry out partial digestions.

Les clones recombinants portant un insert cloné dans le répresseur CI, et donc inactivant ce répresseur deviennent résistants à la tétracycline. Etant donné que les BACs ne sont présents qu'à raison d'une copie par cellule, il faut sélectionner les clones recombinants avec une dose plus faible de tétracycline que la dose habituelle de 20 g/ml, par The recombinant clones carrying an insert cloned in the repressor CI, and thus inactivating this repressor, become resistant to the tetracycline. Since BACs are only present at one copy per cell, recombinant clones should be selected with a lower dose of tetracycline than the usual dose of 20 g / ml per day.

exemple avec une dose de 5 g/ml. Dans ces conditions il n'y a aucun bruit de fond. example with a dose of 5 g / ml. Under these conditions there is no background noise.

Il est aussi possible d'utiliser un système développé et commercialisé par la société InVitrogen, dans lequel l'insertion d'ADN dans le vecteur inactive un inhibiteur de la gyrase dont l'expression est toxique pour E. coli. Le fragment est préférentiellement isolé à partir du vecteur pZErO-2 (http://www.invitrogen.com/). It is also possible to use a system developed and marketed by InVitrogen, in which the insertion of DNA into the vector inactivates a gyrase inhibitor whose expression is toxic to E. coli. The fragment is preferably isolated from the vector pZErO-2 (http://www.invitrogen.com/).

Exemple 9 : Construction d'une banque de S. alboniger dans les 2 cosmides intégratif (pOS7001) et replicatif (pOS700R) 1) - Construction de la Banque
Pour évaluer l'efficacité du système de clonage, la voie de biosynthèse de la puromycine de Streptomyces alboniger, a été clonée dans les deux cosmides navettes pOS7001 et pOS700R. Les gènes de la voie de biosynthèse de la puromycine sont portés par un fragment d'ADN BamHI d'environ 15 Kb. Example 9 Construction of an S. alboniger library in the two cosmids integrative (pOS7001) and replicative (pOS700R) 1) - Construction of the Bank
To evaluate the efficiency of the cloning system, the puromycin biosynthetic pathway of Streptomyces alboniger was cloned into both shuttle cosmos pOS7001 and pOS700R. The genes of the puromycin biosynthetic pathway are carried by a BamHI DNA fragment of about 15 Kb.

L'ADN génomique de Streptomyces alboniger a été isolé. 90% de cet ADN possède un poids moléculaire compris entre 20 et 150 Kb, déterminé par électrophorèse en champ pulsé. Genomic DNA from Streptomyces alboniger was isolated. 90% of this DNA has a molecular weight between 20 and 150 Kb, determined by pulsed field electrophoresis.

Les deux cosmides ont été digérés par l'enzyme BamHl (site unique de clonage). Both cosmids were digested with the BamHI enzyme (unique cloning site).

Les conditions de digestion partielle BamHl de l'ADN génomique ont été déterminées (50 g d'ADN et 12 unités d'enzyme, 5 minutes de digestion). Après vérification de la taille par électrophorèse en gel d'agarose, l'ADN partiellement digéré a été introduit dans les vecteurs. Dans la ligation, 15 g d'ADN génomique + 2 g du vecteur intégratif ou 5 g du vecteur réplicatif ont été utilisés. The BamHl partial digestion conditions of the genomic DNA were determined (50 g of DNA and 12 units of enzyme, 5 minutes of digestion). After checking the size by agarose gel electrophoresis, the partially digested DNA was introduced into the vectors. In the ligation, 15 g of genomic DNA + 2 g of the integrative vector or 5 g of the replicative vector were used.

Chaque mélange de ligation a été utilisé pour l'encapsidation in vitro de l'ADN dans les têtes de bactériophage lambda. Les mélanges d'encapsidation (0,5ml) ont été titrés (Vecteur intégratif pOS7001 = 7,5 x 105 cosmides/ml, Vecteur réplicatif= 5 x 104 cosmides/ml). Each ligation mixture was used for in vitro encapsidation of DNA in bacteriophage lambda heads. The encapsidation mixtures (0.5 ml) were titrated (integrative vector pOS7001 = 7.5 x 105 cosmids / ml, replicative vector = 5 x 104 cosmids / ml).

Les cosmides ont été utilisés pour transfecter E. coli et générer ainsi deux banques d'environ 25000 clones résistant à l'ampicilline. L'ADN de l'ensemble de ces clones a été isolé et quantifié. Cosmids were used to transfect E. coli and thereby generate two libraries of approximately 25,000 ampicillin resistant clones. The DNA of all these clones was isolated and quantified.

Pour tester les banques, plusieurs clones ont été choisis, l'ADN purifié et a été digéré par BamHl, afin de vérifier la présence et la taille des inserts. Les clones testés contiennent entre 20 et 35 Kb d'insert de S. alboniger. To test the libraries, several clones were chosen, the DNA purified and digested with BamHI to verify the presence and size of the inserts. The clones tested contain between 20 and 35 Kb of S. alboniger insert.

2) - Identification des clones contenant la voie de biosynthèse de la puromycine Les clones susceptibles de contenir la voie complète de biosynthèse de la puromycine ont été identifiés par hybridation avec une sonde correspondant au gène de résistance à la puromycine, le gène pac de 1,1 kb. (Lacalle et al. Gene 1989;79, 375-80 ) Banque faite dans le Vecteur Intégratif pOS 7001:
Parmi 2000 clones analysés, 9 clones ont hybridé avec la sonde et ils contiennent des inserts d'environ 40 kb. 2) - Identification of Clones Containing the Puromycin Biosynthetic Pathway Clones capable of containing the complete puromycin biosynthetic pathway were identified by hybridization with a probe corresponding to the puromycin resistance gene, the pac gene of 1, 1 kb. (Lacalle et al., Gene 1989, 79, 375-80) Bank made in the Integrative Vector pOS 7001:
Among 2000 clones analyzed, 9 clones hybridized with the probe and they contain inserts of about 40 kb.

Banque faite dans le Vecteur replicatif pOS 700R: Parmi 2000 clones analysés, 12 clones ont hybridé avec la sonde ; ils contiennent des inserts d'environ 40 kb. Bank made in the replicative vector pOS 700R: Among 2000 clones analyzed, 12 clones hybridized with the probe; they contain inserts of about 40 kb.

En utilisant les données publiées par Tercero et al. (J Biol Chem. Using data published by Tercero et al. (J Biol Chem.

1996; 271,1579-90), les clones contenant la totalité de la voie de biosynthèse ont été identifiés, après hybridation avec des sondes appropriées. Certains cosmides intégratifs et replicatifs présentent après digestion Clal-EcoRV un fragment de 12360 paires de bases, ce qui laisse supposer un insert contenant la totalité de la voie de biosynthèse de la puromycine. 1996; 271,1579-90), clones containing the entire biosynthetic pathway were identified after hybridization with appropriate probes. Some integrative and replicative cosmids have a fragment of 12360 base pairs after Clal-EcoRV digestion, suggesting an insert containing the entire puromycin biosynthesis pathway.

4) - Vérification de la production de puromycine par les clones résistants (Rhône-Poulenc). a) Matériels et Méthodes 4) - Verification of puromycin production by resistant clones (Rhône-Poulenc). a) Materials and Methods

Souches et conditions de culture : Trois clones résistants ont été sélectionnés pour vérifier la production de puromycine. Ils correspondent aux recombinants de S. lividans contenant un insert dans le vecteur intégratif pOS7001 (G 20) ou un insert dans le vecteur réplicatif (G21 et G22). Strains and culture conditions: Three resistant clones were selected to verify the production of puromycin. They correspond to the recombinants of S. lividans containing an insert in the integrative vector pOS7001 (G20) or an insert in the replicative vector (G21 and G22).

Des souches de référence ont été utilisées pour s'assurer que les milieux de culture utilisés permettaient cette production. Il s'agit de la souche sauvage S. alboniger ATCC 12461, productrice de puromycine et de la souche recombinante S. lividans contenant le cluster complet de la puromycine cloné dans le plasmide pRCP11(Lacalle et al, 1992, the EMBO journal, 11, 785-792) (G23). Reference strains were used to ensure that the culture media used allowed this production. It is the wild type S. alboniger ATCC 12461 strain producing puromycin and the S. lividans recombinant strain containing the complete puromycin cluster cloned in the plasmid pRCP11 (Lacalle et al., 1992, the EMBO journal, 11). 785-792) (G23).

Les souches sont ensemencés dans un milieu de culture dont la composition est la suivante : Peptone bactériologique Organotechnie 5 g/l de milieu final Extrait de levure Springer 5 Extrait de viande Liebig 5 Glucose Prolabo 15 CaC03 (1) Prolabo 3 NaCI Prolabo 5 Agar (2) Difco 1 (1) Les 3g de carbonate sont mélangés à 200ml d'eau distillée puis stérilisés à part. L'addition se faisant après stérilisation. The strains are seeded in a culture medium whose composition is as follows: Bacteriological peptone Organotechnie 5 g / l of final medium Springer yeast extract 5 Liebig meat extract 5 Prolabo 15 glucose CaC03 (1) Prolabo 3 NaCI Prolabo 5 Agar ( 2) Difco 1 (1) The 3 g of carbonate are mixed with 200 ml of distilled water and sterilized separately. The addition is done after sterilization.

(2) L'agar est préalablement fondu dans 100ml d'eau distillée avant d'être ajouté aux autres ingrédients du milieu pH ajusté à 7,2 avant stérilisation stérilisation 25 minutes à 121 C (2) The agar is premixed in 100ml of distilled water before being added to the other ingredients of pH medium adjusted to 7.2 before sterilization sterilization 25 minutes at 121 ° C

50 g/l d'hygromycine et 5 g/l de thiostrepton sont ajoutés au milieu après stérilisation de façon à maintenir une pression de sélection des clones contenant un insert grâce au gène marqueur présent sur le vecteur ( le gène de résistance au thiostrepton étant porté par le plasmide pRCP11). 50 g / l of hygromycin and 5 g / l of thiostrepton are added to the medium after sterilization so as to maintain a selection pressure of the clones containing an insert by virtue of the marker gene present on the vector (the thiostrepton resistance gene being worn by the plasmid pRCP11).

50 ml de milieu de culture liquide, répartis en erlenmeyers de 250 ml, sont ensemencés avec 2 ml de suspension aqueuse de spores et de mycelium de chacune des souches. Les cultures sont incubées pendant 4 jours à 28 C avec une agitation de 220 trs/mn.50 ml de milieux de production, répartis en erlenmeyers de 250 ml, sont ensuite ensemencés avec 2 ml de ces pré-cultures. Le milieu de production utilisé est un milieu industriel optimisé pour la production de pristinamycine (milieu RPR 201). Les cultures sont incubées à 28 C, avec une agitation de 220trs/mn. Après différents temps d'incubation, un erlenmeyer de chaque culture est amené à pH 11 puis extrait par 2 fois 1 volume de dichlorométhane. La phase organique est concentrée à sec sous pression réduite, puis l'extrait est repris par 10 l de méthanol. 100 l de la solution méthanolique sont analysés en CLHP munie d'un détecteur à barrette de diodes dans un système gradient eau-acétonitrile 0,05% TFA VA/ sur colonne C18 pour la détection de la puromycine. b) Résultats
Les analyses HPLC comparatives à partir des cultures des différentes souches montrent la production de puromycine dans la culture de la souche sauvage à partir de 24 h d'incubation. Une production, bien que plus faible, est aussi nettement détectée à partir de 48 h dans la culture du clone G20 contenant le cosmide pOS7001 (figure 23). La puromycine a également été détecté à l'état de trace dans le clone G23 contenant l'opéron complet codant pour le composé dans le plasmide pRCP11. Néanmoins, aucune production n'a été observée dans les cultures des 50 ml of liquid culture medium, divided into 250 ml Erlenmeyer flasks, are inoculated with 2 ml of aqueous suspension of spores and mycelium of each of the strains. The cultures are incubated for 4 days at 28 ° C. with stirring of 220 rpm. 50 ml of production media, divided into 250 ml Erlenmeyer flasks, are then inoculated with 2 ml of these pre-cultures. The production medium used is an industrial environment optimized for the production of pristinamycin (RPR 201 medium). The cultures are incubated at 28 ° C. with stirring of 220 rpm. After different incubation times, an Erlenmeyer flask of each culture is brought to pH 11 and then extracted with 2 times 1 volume of dichloromethane. The organic phase is concentrated to dryness under reduced pressure and the extract is taken up in 10 l of methanol. 100 l of the methanolic solution are analyzed by HPLC equipped with a diode array detector in a 0.05% TFA VA / C18 column water-acetonitrile gradient system for the detection of puromycin. b) Results
Comparative HPLC analyzes from the cultures of the different strains show the production of puromycin in the culture of the wild strain from 24 hours of incubation. Production, although lower, is also clearly detected from 48 h in the G20 clone culture containing cosmid pOS7001 (FIG. 23). Puromycin was also detected in trace amounts in clone G23 containing the complete operon encoding the compound in plasmid pRCP11. Nevertheless, no production was observed in

clones G21 et G22 contenant le cosmide pOS700R. Les résultats sont reportés sur la Figure 23. c) Conclusions
Les résultats obtenus permettent de démontrer l'efficacité du système de clonage développé dans le cosmide pOS7001 pour exprimer chez un hôte hétérologue tel que S. lividans une voie de biosynthèse complète sous le contrôle de séquences régulatrices qui lui sont propres. D'autre part, ces données valident également le criblage des banques obtenues sur la base de la résistance des clones à la puromycine puisqu'il a conduit à identifier parmi un petit nombre de clones, un recombinant capable d'exprimer la voie de biosynthèse associée au gène de résistance. L'absence de production de puromycine chez les autres clones peut probablement s"expliquer par le clonage d'une partie seulement de l'opéron contenant le gène de résistance mais dépourvue de certaines séquences de régulation, transduction ou transcription nécessaires à la synthèse du composé. clones G21 and G22 containing the cosmid pOS700R. The results are shown in Figure 23. c) Conclusions
The results obtained make it possible to demonstrate the effectiveness of the cloning system developed in the cosmid pOS7001 to express in a heterologous host such as S. lividans a complete biosynthetic pathway under the control of its own regulatory sequences. On the other hand, these data also validate the screening of the libraries obtained on the basis of the resistance of the clones to puromycin since it led to identify among a small number of clones, a recombinant capable of expressing the associated biosynthetic pathway. to the resistance gene. The absence of puromycin production in the other clones can probably be explained by the cloning of only part of the operon containing the resistance gene but lacking certain regulatory, transduction or transcription sequences necessary for the synthesis of the compound. .

EXEMPLE 10 : - CLONAGE D'ADN DU SOLDANS DES VECTEURS 1) - Préparation de l'ADN du sol à cloner
Les différents fragments d'ADN doivent être purifiées selon leur destination :
Cosmides
La taille des molécules doit être comprise entre 30 et 40kb. Or , l'ADN extrait du sol est hétérogène en taille et comprend des molécules atteignant 200 ou 300kb. Afin d'homogénéiser les tailles, l'ADN est cassé mécaniquement par passage de la solution à travers une aiguille de 0,4mm de diamètre. Les fragments d'une taille voisine de 30kb ne sont pas affectés par ces passages répétés à travers une aiguille et il n'est EXAMPLE 10: CLONING DNA OF THE VECTOR SOLDANS 1) - Preparation of the DNA of the soil to be cloned
The different DNA fragments must be purified according to their destination:
cosmids
The size of the molecules must be between 30 and 40kb. However, the DNA extracted from the soil is heterogeneous in size and comprises molecules up to 200 or 300kb. In order to homogenize the sizes, the DNA is broken mechanically by passing the solution through a needle of 0.4 mm in diameter. Fragments of a size close to 30kb are not affected by these repeated passages through a needle and it is not

donc pas nécessaire de faire une séparation par la taille surtout que l'empaquetage dans les particules élimine automatiquement les inserts courts. therefore not necessary to make a separation by the size especially that the packaging in the particles automatically eliminates the short inserts.

BACs
Préparation de l'ADN
L'ADN du sol est séparé par electrophorèse en champ pulsé (type
CHEF) dans des conditions telles que les fragments compris entre
100 et 300kb sont concentrés dans une bande d'environ 5mm. Ceci est obtenu en réalisant la migration dans un gel à 0,7% d'agarose normal ou 1% d'agarose à bas point de fusion avec un temps de pulsation de 100 secondes pendant 20 heures et à une température de 10 C. BACs
DNA preparation
The soil DNA is separated by pulsed field electrophoresis (type
CHEF) in such conditions as fragments between
100 and 300kb are concentrated in a band of about 5mm. This is achieved by running in a 0.7% normal agarose or 1% low melting agarose gel with a 100 second pulse time for 20 hours and at a temperature of 10 C.

Récupération de l'ADN
Deux méthodes sont utilisées, leur choix dépend de la taille des molécules que l'on veut isoler, soit jusqu'à 150kb soit au dessus. DNA recovery
Two methods are used, their choice depends on the size of the molecules that we want to isolate, up to 150kb or above.

- Jusqu'à 150kb
La porosité d'un gel à 0,7% d'agarose permet la sortie de l'ADN par électroélution à condition d'absence totale de bromure d'ethidium. Cet
ADN est ensuite manipulé avec des instruments de pipetage à orifice agrandi et hydrophobe pour éviter la fragmentation mécanique des molécules. - Up to 150kb
The porosity of a 0.7% agarose gel allows the release of the DNA by electroelution provided complete absence of ethidium bromide. This
DNA is then manipulated with enlarged and hydrophobic orifice pipetting instruments to avoid mechanical fragmentation of the molecules.

- Entre 100 et 300kb
La bande contenant les fragments d'une taille entre 100 et 300kb est découpée. Pour la migration un gel d'agarose à 1% et à bas point de fusion est utilisé. Cette propriété permet de fondre le gel à une température supportable pour l'ADN de 65 C et de le digérer ensuite par l'agarase (Agarase commercialisée par la société Boehringer) à une température de 45 C suivant les prescriptions du fournisseur. - Between 100 and 300kb
The band containing the fragments of a size between 100 and 300 kb is cut. For migration, a 1% agarose gel with a low melting point is used. This property makes it possible to melt the gel at a tolerable temperature for the 65 C DNA and then to digest it with the agarase (Agarase sold by the company Boehringer) at a temperature of 45 ° C. according to the supplier's instructions.

2) - Utilisation des cosmides intégratifs pOS7001 et replicatifs pOS700R Construction par queues polyA polyT Principe Un vecteur cosmide, ouvert à un site de clonage quelconque, est modifié aux extrémités 3' en ajoutant un polynucléotide monotone. D'autre part, l'ADN à cloner est modifié aux extrémités 3' en ajoutant un polynucléotide monotone pouvant s'apparier au précédent. 2) Use of integrative cosmos pOS7001 and replicative pOS700R PolyA polyT tails construction Principle A cosmid vector, open at any cloning site, is modified at the 3 'ends by adding a monotonous polynucleotide. On the other hand, the DNA to be cloned is modified at the 3 'ends by adding a monotonous polynucleotide that can be paired with the preceding one.

L'association vecteur-fragment à cloner se fait par ces polynucléotides et la séquence cos du vecteur permet l'empaquetage in vitro de l'ADN dans des capsides de phage Lamda. The vector-fragment association to be cloned is by these polynucleotides and the cos sequence of the vector allows the in vitro packaging of the DNA in Lamda phage capsids.

Préparation du vecteur Le vecteur utilisé est un vecteur autoréplicatif chez E. coli et intégratif chez Streptomyces. Vector Preparation The vector used is a self-replicating vector in E. coli and integrative in Streptomyces.

Pour E. coli, la sélection se fait sur la résistance à l'ampicilline et pour Streptomyces, elle se fait sur la résistance à l'hygromycine . For E. coli, the selection is on ampicillin resistance and for Streptomyces, it is on resistance to hygromycin.

Le cosmide est ouvert à l'un des 2 sites possibles (BamHl ou Hindlll) et les extrémités 3' sont rallongées par du polyA avec de la terminale transférase dans les conditions où le fournisseur de l'enzyme prévoit l'addition de 50 à 100 nucléotides. The cosmid is opened at one of the two possible sites (BamHI or HindIII) and the 3 'ends are lengthened by polyA with terminal transferase under the conditions where the supplier of the enzyme provides the addition of 50 to 100 nucleotides.

Préparation de l'ADN à insérer. Preparation of the DNA to be inserted

Les extrémités 3' de l'ADN sont rallongées par du polyT avec de la terminale transférase dans les conditions fournissant un allongement comparable à celui du vecteur. Dans les conditions expérimentales décrites par le fabricant les queues polyA polyT sont longues de 30 à 70 bases The 3 'ends of the DNA are extended by polyT with terminal transferase under the conditions providing an elongation comparable to that of the vector. Under the experimental conditions described by the manufacturer, polyA polyT tails are 30 to 70 bases long.

Assemblage des molécules et encapsidation in vitro. Assembly of molecules and encapsidation in vitro.

Pour l'assemblage des molécules, on mélange une molécule de vecteur pour une molécule d'ADN inséré. La concentration de l'ADN en masse est de 500 g.ml-1. For assembly of the molecules, a vector molecule is mixed for an inserted DNA molecule. The concentration of the DNA in mass is 500 g.ml-1.

Le mélange est encapsidé et l'efficacité de transfection dépend de la souche utilisée comme réceptrice et de l'ADN inséré : nulle avec l'ADN test et la souche DH5a, l'efficacité est comparable pour les souches SURE et DH10B ; à l'extraction le rendement en ADN est cependant plus élevé avec la souche DH10B. The mixture is packaged and the transfection efficiency depends on the strain used as the recipient and the inserted DNA: zero with the test DNA and the strain DH5a, the efficiency is comparable for the SURE and DH10B strains; at extraction, however, the yield of DNA is higher with strain DH10B.

Construction par déphosphorylation L'ADN du sol est mis en bouts francs par élimination des séquences 3' sortantes et remplissage des séquences 5' sortantes. Cette opération est faite avec : enzyme de Klenow, T4 polymérase, les 4 nucléotides triphosphates. Le vecteur cosmidique est digéré par BamHI, puis traité par l'enzyme de Klenow pour le rendre bout franc puis déphosphorylé pour éviter qu'il ne se referme sur lui même. Après ligation, le mélange est encapsidé et transfecté comme précédemment décrit. Construction by dephosphorylation The soil DNA is blunt ended by removing the outgoing 3 'sequences and filling the outgoing 5' sequences. This operation is done with: Klenow enzyme, T4 polymerase, the 4 nucleotides triphosphates. The cosmid vector is digested with BamHI, then treated with the Klenow enzyme to make it free and then dephosphorylated to prevent it from closing on itself. After ligation, the mixture is packaged and transfected as previously described.

3) - Utilisation des pBAC Principe. 3) - Use of pBAC Principle.

Le plasmide pBAC conjugatif et intégratif possède les sites Hindlll et Nsil comme sites de clonage L'insertion d'une séquence d'ADN à ces sites inactive le répresseur CI du phage Lambda qui contrôle l'expression du gène de la résistance à la tétracycline. L'inactivation du répresseur rend donc la cellule résistante à cet antibiotique (5 g.ml-1). Le clonage à ces sites est facilité par la modification du vecteur et la préparation de l'ADN à cloner. The conjugative and integrative pBAC plasmid possesses the HindIII and Nsil sites as cloning sites. The insertion of a DNA sequence at these sites inactivates the phage Lambda repressor IC which controls the expression of the tetracycline resistance gene. Inactivation of the repressor therefore makes the cell resistant to this antibiotic (5 g.ml-1). Cloning at these sites is facilitated by vector modification and preparation of the DNA to be cloned.

Préparation du vecteur. Exemple Hindlll Preparation of the vector Hindlll example

Pour que le vecteur ne se referme pas sur lui-même, le site Hind III est modifié : la première base (A) est remise en place pour former une séquence 5' sortante, qui ne peut pas s'apparier avec ses semblables. In order for the vector not to close on itself, the Hind III site is modified: the first base (A) is put back in place to form a 5 'outgoing sequence, which can not be paired with its peers.

L'opération est effectuée par l'enzyme de Klenow en présence de dATP. The operation is performed by the Klenow enzyme in the presence of dATP.

Le succès de l'opération est vérifié en effectuant une ligation du vecteur sur lui-même avant et après traitement à l'enzyme de Klenow. A quantité d'ADN testé identique, on obtient 3000 clones avant traitement et 60 après traitement. The success of the operation is verified by carrying out a ligation of the vector on itself before and after treatment with the Klenow enzyme. At the same amount of tested DNA, 3000 clones are obtained before treatment and 60 after treatment.

Préparation de l'ADN (taille comprise entre 100 et 300kb). Preparation of the DNA (size between 100 and 300 kb).

Mise en bouts francs de l'ADN. Blunt-ended DNA.

L'ADN est mis en bouts francs par élimination des séquences 3' sortantes et remplissage des séquences 5' sortantes. Cette opération est faites avec : enzyme de Klenow, T4 polymérase, les 4 nucléotides triphosphates. The DNA is blunt ended by removing the outgoing 3 'sequences and filling the outgoing 5' sequences. This operation is done with: Klenow enzyme, T4 polymerase, the 4 nucleotides triphosphates.

Préparation des extrémités.Exemple Hindlll
L'addition de l'ADN sur le vecteur se fait au moyen d'oligo-nucléotides reconnaissant la séquence Hindlll modifiée du vecteur. Ils contiennent des sites de restriction rares pour permettre les clonages ultérieurs (Swal ; Notl). cette technique est dérivée de celle de : Elledge SJ,
Mulligan JT, Ramer SW, Spottswood M, Davis RW. Proc Natl Acad
Sci U S A 1991 Mar 1;88(5):1731-5
Deux oligonucléotides complémentaires sont utilisés :
Oligo 1 : 5'-GCTTATTTAAATATTAATGCGGCCGCCCGGG-3' (SEQ ID N 25)
Oligo 2 : 5'-CCCGGGCGGCCGCATTAATATTTAAATA-3' (SEQ ID
N 26)
Ils sont phosphorylés en 5' par la polynucléotide kinase de T4 en présence d'ATP, après leur hybridation. Cette étape de phosphorylation peut être éliminée en utilisant les oligo-nucléotides déjà phosphorylés. Preparation of the extremities.Example Hindlll
The addition of the DNA to the vector is by means of oligonucleotides recognizing the modified HindIII sequence of the vector. They contain rare restriction sites to allow subsequent cloning (Swal, Notl). this technique is derived from that of: Elledge SJ,
Mulligan JT, SW Ramer, Spottswood M, Davis RW. Proc Natl Acad
Sci USA 1991 Mar 1; 88 (5): 1731-5
Two complementary oligonucleotides are used:
Oligo 1: 5'-GCTTATTTAAATATTAATGCGGCCGCCCGGG-3 '(SEQ ID N 25)
Oligo 2: 5'-CCCGGGCGGCCGCATTAATATTTAAATA-3 '(SEQ ID
N 26)
They are phosphorylated in 5 'by the T4 polynucleotide kinase in the presence of ATP, after their hybridization. This phosphorylation step can be eliminated using the already phosphorylated oligonucleotides.

La ligation de cet adaptateur double brin avec l'ADN à insérer dans un vecteur est faite par la ligase de T4 en présence d'un très grand excès d'adaptateur (1000 molécules d'adaptateur pour une molécule d'ADN à insérer), en 15 heures à 14 C. The ligation of this double-stranded adapter with the DNA to be inserted into a vector is made by T4 ligase in the presence of a very large excess of adapter (1000 adapter molecules for a DNA molecule to be inserted), in 15 hours at 14 C.

L'excès d'adaptateur est éliminé par électrophorèse sur un gel d'agarose et les molécules d'intérêt sont récupérées du gel par hydrolyse de celui-ci par de l'agarase ou par électroélution. The excess adapter is removed by electrophoresis on an agarose gel and the molecules of interest are recovered from the gel by hydrolysis thereof with agarase or electroelution.

Ligation vecteur- ADN . Vector-DNA Ligation.

La ligation se fait à 14 C sur 15 heures avec 10 molécules de vecteur pour une molécule d'insert. Ligation is at 14 C over 15 hours with 10 carrier molecules for one insert molecule.

Transformation. Transformation.

La souche réceptrice est la souche DH10B. La transformation se fait par électroporation. Pour exprimer la résistance à la tétracycline, les transformants sont incubés à 37 C pendant 1 heure en milieu sans antibiotique. La sélection des clones se fait par culture pendant une nuit , sur milieu gélosé LB additionné de tétracycline à 5 g.ml-1. The recipient strain is strain DH10B. The transformation is done by electroporation. To express the resistance to tetracycline, the transformants are incubated at 37 ° C. for 1 hour in an antibiotic-free medium. The clones are selected by culturing overnight on LB agar medium supplemented with tetracycline at 5 g.ml-1.

Exemple 11 : CONJUGAISON CLONE A CLONE ENTRE E. COLI ET STREPTOMYCES CONJUGAISON ENTRE E coLI SOUCHE S17.1 CONTENANT PPM803 ET STREPTOMYCES LIVIDANS TK 21 Introduction Il est possible d'effectuer des conjugaisons entre E. coli et Streptomyces (Mazodier et al, 1989). L'adaptation de cette méthode en développant une technique dite en goutte où l'on mélange 10 l d'une culture de E. coli contenant un vecteur recombinant à une goutte de S. lividans récepteur consiste à réaliser une transformation de clone à clone en s'assurant qu'à la fin de l'opération toute la banque construite dans E. coli est introduite dans S. lividans. Une transformation en vrac amènerait obligatoirement à une multiplication des clones de Streptomyces transformants afin d'être Example 11 CLONE CLONE CONJUGATION BETWEEN E. COLI AND STREPTOMYCES CONJUGATION BETWEEN E STAINLESS STEEL S17.1 CONTAINING PPM803 AND STREPTOMYCES LIVIDANS TK 21 Introduction It is possible to carry out conjugations between E. coli and Streptomyces (Mazodier et al, 1989) . Adaptation of this method by developing a so-called droplet technique in which 10 l of a culture of E. coli containing a recombinant vector is mixed with a drop of S. lividans receptor consists in carrying out a cloned clone transformation into ensuring that at the end of the operation the entire bank built in E. coli is introduced into S. lividans. A bulk transformation would necessarily lead to a multiplication of clones of Streptomyces transformants in order to be

pratiquement sûr que la banque dans E. coli est complètement représentée dans S. lividans. practically sure that the library in E. coli is completely represented in S. lividans.

De plus cette méthode est facilement automatisable. Moreover this method is easily automatable.

Essais préliminaires Conjugaison entre E. coli souche S17. 1 contenant le vecteur pOSV303 et S. lividans TK21. Preliminary tests Conjugation between E. coli strain S17. 1 containing the vector pOSV303 and S. lividans TK21.

Dans ces conditions, on mélange 6 x 106cellules de E. coli avec 2 x 106 spores pré-germées de S. lividans dans un volume final de 20 l. Under these conditions, 6 × 10 6 cells of E. coli are mixed with 2 × 10 6 pre-germinated S. lividans spores in a final volume of 20 l.

Mise au point de la méthode Il est connu que l'ADN extrait de certains actinomycètes est modifié et de ce fait ne peut être introduit dans certaines souches de E. coli sans qu'il soit restreint. La souche de E. coli DH10B qui accepte ces ADN n'est pas capable de transférer à Streptomyces un plasmide ne contenant que oriT, et il est donc nécessaire d'en construire une. Il faudrait y introduire par intégration dans le chromosome un dérivé de RP4 capable de fournir en trans toutes les fonctions nécessaires pour assurer le transfert des clones recombinants contenant l'origine de transfert oriT. Development of the method It is known that the DNA extracted from certain actinomycetes is modified and therefore can not be introduced into certain strains of E. coli without being restricted. The strain of E. coli DH10B that accepts these DNAs is not capable of transferring to Streptomyces a plasmid containing only oriT, and it is therefore necessary to construct one. It should be introduced by integration into the chromosome a derivative of RP4 capable of providing in trans all the functions necessary to ensure the transfer of recombinant clones containing the oriT transfer origin.

Exemple 12 : Construction d'une banque cosmidique dans E, coli et Streptomyces lividans : Clonage de l'ADN du sol
L'objectif est la construction d'une librairie d'ADN de grande taille issue de l'environnement, sans étape préalable de culture.des microorganismes, dans le but d'accéder aux gènes métaboliques de bactéries (ou de tout autre organisme) que l'on ne sait pas cultiver dans des conditions standard de laboratoire. Example 12 Construction of a cosmid library in E. coli and Streptomyces lividans: Cloning of soil DNA
The objective is the construction of a large library of DNA from the environment, without prior step of culture.of microorganisms, in order to access the metabolic genes of bacteria (or any other organism) that it is not known to grow under standard laboratory conditions.

La procédure décrite a été utilisée pour générer une banque d'ADN dans Escherichia coli utilisant le cosmide navette E. coli-S. lividans pOS7001 et de l'ADN extrait et purifié de la fraction bactérienne d'un sol . Cette dernière méthode permet d'obtenir de l'ADN d'une grande pureté et The described procedure was used to generate a DNA library in Escherichia coli using E. coli-S shuttle cosmid. lividans pOS7001 and DNA extracted and purified from the bacterial fraction of a soil. This last method makes it possible to obtain DNA of high purity and

d'une taille moyenne de 40 kb. Aussi, afin d'éviter pour le clonage une digestion partielle de l'ADN extrait, a été adoptée une stratégie alternative basée sur l'utilisation de l'enzyme terminale tranférase qui permet d'ajouter des queues de polynucléotides aux extrémités 3' de l'ADN et du vecteur. an average size of 40 kb. Also, in order to avoid for cloning a partial digestion of the extracted DNA, has been adopted an alternative strategy based on the use of the terminal tranferase enzyme which makes it possible to add tails of polynucleotides to the 3 'ends of the DNA and the vector.

5 g d'ADN ont été extraits de 60 mg de sol de " la Côte Saint André " selon le protocole décrit à l'exemple 3 et traités avec de la terminale transférase (Pharmacia) pour rallonger les extrémités 3' avec un polynucléotide monotone (poly T) (Exemple 10). 5 g of DNA were extracted from 60 mg of "Côte Saint André" soil according to the protocol described in Example 3 and treated with terminal transferase (Pharmacia) to lengthen the 3 'ends with a monotonous polynucleotide ( poly T) (Example 10).

Le cosmide intégratif pOS7001 est préparé selon le protocole B1, Orsay. Après une étape classique de purification en présence de phénol/chloroforme, l'ADN et le vecteur sont assemblés en mélangeant une molécule de vecteur et une molécule d'ADN inséré. Le mélange est ensuite encapsidé dans les têtes de bactériophages lambda (kit Amersham) qui servent à transfecter E. coli DH10B. Les cellules transfectées sont ensuite ensemencées sur milieu LB agar en présence d'ampicilline pour sélection des recombinants résistants à cet antibiotique. The integrative cosmid pOS7001 is prepared according to the protocol B1, Orsay. After a conventional phenol / chloroform purification step, the DNA and vector are assembled by mixing a vector molecule and an inserted DNA molecule. The mixture is then packaged in lambda bacteriophage heads (Amersham kit) which serve to transfect E. coli DH10B. The transfected cells are then seeded on LB agar medium in the presence of ampicillin for selection of recombinants resistant to this antibiotic.

Une banque d'environ 5000 clones d'E. coli résistants à l'ampicilline a été obtenue Chaque clone a été ensemencé en milieu LB ou TB + ampicilline dans un puits de microplaque (96 puits) et conservé à -80 C . A library of about 5000 E clones. Each clone was seeded in LB or TB + ampicillin medium in a microplate well (96 wells) and stored at -80 ° C.

La séquence aux sites d'insertions des fragments du sol dans le vecteur, pOS7001, générés pendant la construction de la banque a été analysée. Pour cela 17 cosmides de la banques ont été purifié et séquence avec une amorce, seq. 5' CCGCGAATTCTCATGTTTGACCG 3', qui hybride entre les site BamHl et le site de clonage Hindlll présente dans le vecteur. The sequence at the site of insertions of the soil fragments in the vector, pOS7001, generated during the construction of the library was analyzed. For this purpose, 17 cosmids of the library were purified and sequenced with a primer, seq. 5 'CCGCGAATTCTCATGTTTGACCG 3', which hybridizes between the BamHI site and the HindIII cloning site present in the vector.

Les séquences obtenues ont permis d'estimer que la longueur des queues homopolymériques aux points de jonctions est très variable, entre 13 et 60 poly-dA/dT. Au-delà des queues, les séquences des fragments du sol ainsi générées possèdent un pourcentage en G+C entre 53 et 70 %. Des pourcentages si élevés étaient inattendus, mais des résultas The sequences obtained made it possible to estimate that the length of the homopolymeric tails at the junction points is very variable, between 13 and 60 poly-dA / dT. Beyond the tails, the sequences of the soil fragments thus generated have a percentage in G + C between 53 and 70%. Such high percentages were unexpected, but results

similaires ont été déjà reportés sur des préparation brut d'ADN à partir de sol (Chatzinotas A. étal., 1998). similar have already been reported on crude preparation of DNA from soil (Chatzinotas A. et al., 1998).

Une stratégie de " pooling " de 48 ou 96 clones à été utilisée pour l'analyse de la richesse microbienne et métabolique. L'ADN cosmidique extrait à partir de ces " pools " de clones à été utilisé ensuite pour réaliser des expériences de PCR ou d'hybridation. A pooling strategy of 48 or 96 clones was used for the analysis of microbial and metabolic richness. The cosmid DNA extracted from these "pools" of clones was then used to perform PCR or hybridization experiments.

Exemple 13 : Diversité de l'ADN ribosomique 16S au sein de l'ADN cloné. a) Matériels et méthodes
Les cosmides de la banque sont extraits à partir de pools de clones par lyse alcaline puis sont purifiés sur gradient de chlorure de césium, afin de prélever la bande d'ADN cosmidique sous forme superenroulée et dans le but d'éliminer tout ADN chromosomique d'Escherichia coli pouvant interférer dans l'étude. Example 13: Diversity of 16S ribosomal DNA within the cloned DNA. a) Materials and methods
The cosmids of the library are extracted from pools of clones by alkaline lysis and are then purified on cesium chloride gradient, in order to collect the cosmid DNA band in supercoiled form and in order to eliminate any chromosomal DNA from Escherichia coli may interfere in the study.

Après linéarisation des cosmides par action de la nucléase S1 (50 unités, 30 minutes à 37 C), les séquences d'ADNr 16S contenues dans les pools de clones sont amplifiées dans les conditions standard d'amplification, à partir des amorces universelles 63f (5'CAGGCCTAACACATGCAAGTC-3') et 1387r (5'GGGCGGWGTGTACAAGGC-3') définies par MARCHESI et al.(1998). Les produits d'amplification d'environ 1.5 kilobases sont purifiés à partir du kit Qiaquik gel extraction (Qiagen) puis directement clonés dans le vecteur pCR Il (Invitrogen) chez Escherichia coli TOP10, selon les instructions du fabricant. L'insert est alors amplifié à l'aide des amorces M13 Forward et M13 reverse spécifiques au site de clonage du vecteur pCR II. Les produits d'amplification de taille attendue (environ 1,7 kb) sont analysés par RFLP (Restriction Fragment Length Polymorphism) à l'aide des enzymes Cfol, Mspl et BstUI (0,1 unités) afin de sélectionner les After linearization of cosmids by action of S1 nuclease (50 units, 30 minutes at 37 ° C.), the 16S rDNA sequences contained in the pools of clones are amplified under the standard amplification conditions, starting from the universal primers 63f ( 5'CAGGCCTAACACATGCAAGTC-3 ') and 1387r (5'GGGCGGWGTGTACAAGGC-3') defined by MARCHESI et al (1998). The amplification products of about 1.5 kilobases are purified from the Qiaquik gel extraction kit (Qiagen) and then directly cloned into the vector pCR II (Invitrogen) in Escherichia coli TOP10, according to the manufacturer's instructions. The insert is then amplified using the M13 Forward and M13 reverse primers specific to the pCR II vector cloning site. Amplification products of expected size (about 1.7 kb) are analyzed by RFLP (Restriction Fragment Length Polymorphism) using the enzymes Cfol, Mspl and BstUI (0.1 units) to select the

clones à séquencer. Les profils de restriction obtenus sont séparés sur gel d'agarose Métaphore 2. 5% (FMC Products) contenant 0,4 mg de bromure d'éthidium parmi. clones to sequence. The restriction profiles obtained are separated on agarose gel Metaphore 2.5% (FMC Products) containing 0.4 mg of ethidium bromide among.

Les séquences d'ADNr 16S sont alors déterminées directement en utilisant les produits PCR purifiés par le kit " Qiaquick gel extraction " à l'aide des amorces de séquençage définies par Normand (1995). Les analyses phylogénétiques sont obtenues en comparant les séquences avec les séquences d'ADNr 16S procaryotes rassemblées dans la base de données Ribosomal Database Project (RDP), version 7,0 MAIDAK et al.(1999) grâce au programme SIMILARITY MATCH, permettant d'obtenir les valeurs de similarité par rapport aux séquences de la base de données. b) Résultats
Pour déterminer la diversité phylogénétique représentée dans la banque, 47 séquences du gène ARNr16S ont été isolées à partir de pools de 288 clones et ont été séquencées dans leur quasi-totalité. Les résultats sont rapportés dans le tableau 7. The 16S rDNA sequences are then determined directly using the PCR products purified by the kit "Qiaquick gel extraction" using the sequencing primers defined by Normand (1995). Phylogenetic analyzes are obtained by comparing the sequences with the prokaryotic 16S rDNA sequences collected in the Ribosomal Database Project (RDP) database, version 7.0 MAIDAK et al (1999) using the SIMILARITY MATCH program, allowing obtain the similarity values with respect to the sequences of the database. b) Results
To determine the phylogenetic diversity represented in the library, 47 sequences of the 16S rRNA gene were isolated from pools of 288 clones and were almost completely sequenced. The results are reported in Table 7.

L'analyse des séquences par interrogations des bases de données révèle que la majorité des séquences (>61%) présentent des pourcentages de similarité inférieurs ou égaux à 95% avec des espèces bactériennes identifiées (tableau 7). Sur les 47 séquences analysées, 28 séquences ont pour plus proches voisins des bactéries non cultivées, dont les séquences ont été directement issues d'ADN extrait de l'environnement. La majorité de ces séquences présentent par ailleurs des pourcentages de similarité très faibles (88-95%), 17 séquences sur 28 diffèrent ainsi de plus de 5% par rapport à leurs voisins les plus proches. Analysis of the sequences by interrogation of the databases reveals that the majority of the sequences (> 61%) have percentages of similarity that are less than or equal to 95% with identified bacterial species (Table 7). Of the 47 sequences analyzed, 28 sequences have, for closer neighbors, non-cultured bacteria whose sequences have been derived directly from DNA extracted from the environment. The majority of these sequences also have very low percentages of similarity (88-95%), so 17 out of 28 sequences differ by more than 5% from their nearest neighbors.

Parmi les séquences pouvant être classées dans un groupe phylétique, une majorité de séquences appartiennent à la sous classe a des protéobactéries (18 séquences avec un pourcentage de similarité Of the sequences that can be classified in a phyletic group, a majority of sequences belong to the subclass a proteobacteria (18 sequences with a percentage of similarity

compris entre 89 et 99%). Un second groupe de séquences est représenté par la sous classe g des protéobactéries, comprenant 9 séquences dont les pourcentages de similarité varient entre 84 et 99%). Les groupes des b-protéobactéries, d-protéobactéries, firmicutes à bas G+C% et à haut G+C% comprennent respectivement 1,4, 3 et 5 séquences. Seule une séquence n'a pu être classée au sein des grands groupes taxonomiques bactériens définis: la séquence a22.1 (19), son plus proche voisin Aerothermobacter marianas (avec une similarité de 89%) étant lui même une souche isolée de l'environnement marin et non classifiée à l'heure actuelle.Enfin, 6 séquences peuvent être classées au sein du groupe des Acidobacteriuml Holophaga. Ce groupe présente la particularité de n'être représenté que par deux bactéries cultivées Acidobacterium capsulatum et Holophaga foetida, l'ensemble de ce groupe étant composé par des bactéries dont seul le gène ARNr16S a été détecté par amplification et clonage à partir d'ADN extrait d'échantillon de l'environnement (principalement de sol), Ludwig et al (1997). Les faibles valeurs de similarité entre les différentes séquences composant ce groupe laisse présager une grande hétérogénéité et diversité au sein de ce groupe. between 89 and 99%). A second group of sequences is represented by the subclass g of proteobacteria, comprising 9 sequences whose similarity percentages vary between 84 and 99%). The groups of b-proteobacteria, d-proteobacteria, firmicutes at low G + C% and at high G + C% respectively comprise 1.4, 3 and 5 sequences. Only one sequence could not be classified within the large defined bacterial taxonomic groups: the sequence a22.1 (19), its nearest neighbor Aerothermobacter marianas (with a similarity of 89%) being itself an isolated strain of the Currently, 6 sequences can be classified in the group Acidobacteriuml Holophaga. This group has the particularity to be represented by only two bacteria cultivated Acidobacterium capsulatum and Holophaga foetida, the whole of this group being composed of bacteria of which only the rRNA16S gene was detected by amplification and cloning from extracted DNA. sample of the environment (mainly soil), Ludwig et al (1997). The low similarity values between the different sequences in this group suggest a great heterogeneity and diversity within this group.

* L'ensemble des résultats est représenté sur le tableau 7. * The overall results are shown in Table 7.

Ces résultats montrent que les séquences contenues dans la banque cosmidique proviendraient de micro-organismes non seulement diversifiés phylogénétiquement mais surtout de micro-organismes n'ayant jamais été isolés jusqu'à ce jour. These results show that the sequences contained in the cosmid bank come from microorganisms not only phylogenetically diversified but especially microorganisms that have never been isolated to date.

Les résultats du séquençage des ADN amplifiés ont permis d'établir un arbre phylogénétique des organismes présents dans l'échantillon de sol dont les séquences caractérisées sont originaires. The results of the sequencing of the amplified DNAs made it possible to establish a phylogenetic tree of the organisms present in the soil sample from which the characterized sequences originate.

L'arbre phylogénétique représenté à la figure 7 a été réalisé à partir de l'alignement des séquences par le logiciel MASE (Faulner et Jurak, 1988) etcorrigé par la méthode des 2 paramètres de Kimura (1980), et à The phylogenetic tree shown in Figure 7 was made from the sequence alignment by the MASE software (Faulner and Jurak, 1988) and corrected by the Kimura 2 parameter method (1980), and

l'aide de l'algorithme Neighbour Joining (Saitou et Nei 1987). L'analyse phylogénétique a permis de comparer les séquences ADNr 16S clonées dans la banque d'ADN du sol, avec les séquences d'ADNr 16S procaryotes rassemblées dans les bases de données Ribosomal Database Project (RDP), (version 7. 0, programme SIMILARITY-MATCH, Maidak et al 1999), et dans la base GenBank grâce au logicel BLAST 2.0 (Atschul et al, 1997). using the Neighbor Joining algorithm (Saitou and Nei 1987). Phylogenetic analysis compared the 16S rDNA sequences cloned in the soil DNA library with the prokaryotic 16S rDNA sequences collected in the Ribosomal Database Project (RDP) databases (version 7. 0, program SIMILARITY-MATCH, Maidak et al 1999), and in the GenBank database thanks to the BLAST 2.0 software (Atschul et al, 1997).

Exemple 14 : Présélection génétique de la banque pour l'évaluation de la richesse métabolique
Pour caractériser la banque obtenue en terme de diversité métabolique et identifier les clones contenant des inserts portant des gènes pouvant être impliqués dans des voies de biosynthèse, il a été développé selon l'invention des techniques de criblage génétique basées sur des méthodes PCR afin de détecter et d'identifier des gènes PKS de type I. Example 14: Genetic preselection of the bank for the evaluation of metabolic richness
To characterize the bank obtained in terms of metabolic diversity and to identify clones containing inserts carrying genes that may be involved in biosynthetic pathways, PCR-based genetic screening techniques have been developed according to the invention to detect and identify PKS type I genes.

1 Souches bactériennes, plasmides et conditions de culture
S. coelicolor ATCC101478, S. ambofaciens NRRL2420, S. lactamandurans ATCC27382, S. rimosus ATCC109610, B. Subtilis ATCC6633 et B. licheniformis THE1856 (collection RPR) ont été utilisés comme sources d'ADN pour les expériences de PCR. S. lividans TK24 est la souche hôte utilisée pour le cosmide navette POS1700. 1 Bacterial strains, plasmids and culture conditions
S. coelicolor ATCC101478, S. ambofaciens NRRL2420, S. lactamandurans ATCC27382, S. rimosus ATCC109610, B. Subtilis ATCC6633 and B. licheniformis THE1856 (RPR collection) were used as sources of DNA for PCR experiments. S. lividans TK24 is the host strain used for the cosmic POS1700 shuttle.

Pour la préparation d'ADN génomique, de suspensions de spores, de protoplastes et pour la transformation de S. lividans, on a suivi les protocoles standard décrits dans Hopwood et al.(1986). For the preparation of genomic DNA, spore suspensions, protoplasts and for the transformation of S. lividans, the standard protocols described in Hopwood et al (1986) were followed.

Escherichia coli Top10 (INVITROGEN) a été utilisé comme hôte pour le clonage des produits PCR et E. coli Sure (STRATAGENE) a été utilisé comme hôte pour le cosmide navette pOS7001. Les conditions de culture Escherichia coli Top10 (INVITROGEN) was used as a host for cloning PCR products and E. coli Sure (STRATAGENE) was used as a host for the cosmos shuttle pOS7001. Culture conditions

de E. coli, la préparation de plasmides, la digestion de l'ADN, l'électrophorèse sur gel d'agarose ont été realisées suivant les procédures standard (Sambroock et al., 1996). E. coli, plasmid preparation, DNA digestion, agarose gel electrophoresis were performed according to standard procedures (Sambroock et al., 1996).

2. Amorces PCR:
Les couples d'amorces a1-a2 et b1-b2 ont été définis par l'equipe de N. Bamas-Jacques et leur utilisation a été optimisée pour le criblage de l'ADN des souches pures et de la banque du sol pour la recherche de gènes codant PKSI)
Tableau 8 :Amorces PCR homologues aux gènes PKSI utilisées pour le criblage de la banque.. 2. PCR primers:
The pairs of primers a1-a2 and b1-b2 were defined by the team of N. Bamas-Jacques and their use was optimized for the screening of the DNA of the pure strains and the soil bank for research. of genes encoding PKSI)
Table 8: PCR primers homologous to the PKSI genes used for the screening of the library.

<tb>
<tb> a1 <SEP> (+) <SEP> 5' <SEP> CCSCAGSAGCGCSTSTTSCTSGA <SEP> 3'
<tb> a2 <SEP> (-) <SEP> 5' <SEP> GTSCCSGTSCCGTGSGTSTCSA <SEP> 3'
<tb> b1 <SEP> 5' <SEP> CCSCAGSAGCGCSTSCTSCTSGA <SEP> 3'
<tb> b2 <SEP> 5' <SEP> GTSCCSGTSCCGTGSGCCTCSA <SEP> 3'
<tb>
Conditions d'amplification :
Pour la recherche de PKS I à partir de l'ADN de souches pures, le mélange d'amplification contenait : dans un volume final de 50 l, entre 50 et 150 ng d'ADN génomique, 200 M de dNTP, 5mM de MgCl2 final, 7% de DMSO, tampon 1x Appligene, 0,4 M de chaque primer et 2,5U de Taq Polymerase Appligène. Les conditions d'amplification utilisées sont : dénaturation à 95 C pendant 2 minutes, hybridation à 65 C pendant 1 minute, élongation à 72 C pendant 1 minute, pour le premier cycle, suivi par 30 cycles où la température est diminuée jusqu'à 58 C comme décrit dans K. Seow et al., 1997. L'étape d'extension finale s'effectue à 72 C pendant 10 minutes. <Tb>
<tb> a1 <SEP> (+) <SEP> 5 '<SEP> CCSCAGSAGCGCSTSTTSCTSGA <SEP>3'
<tb> a2 <SEP> (-) <SEP> 5 '<SEP> GTSCCSGTSCCGTGSGTSTCSA <SEP>3'
<tb> b1 <SEP> 5 '<SEP> CCSCAGSAGCGCSTSCTSCTSGA <SEP>3'
<tb> b2 <SEP> 5 '<SEP> GTSCCSGTSCCGTGSGCCTCSA <SEP>3'
<Tb>
Amplification conditions:
In the search for PKS I from DNA of pure strains, the amplification mixture contained: in a final volume of 50 l, between 50 and 150 ng of genomic DNA, 200 M of dNTP, 5 mM of final MgCl 2 , 7% DMSO, 1x Appligene buffer, 0.4 M each primer and 2.5 U of Appligene Taq Polymerase. The amplification conditions used are: denaturation at 95 ° C. for 2 minutes, hybridization at 65 ° C. for 1 minute, elongation at 72 ° C. for 1 minute, for the first cycle, followed by 30 cycles where the temperature is decreased to 58 ° C. C as described in K. Seow et al., 1997. The final extension step is at 72 ° C for 10 minutes.

Pour la recherche de PKS I à partir de l'ADN de la banque, les conditions PCR sont les mêmes que ci-dessus pour le couple a1-a2 en utilisant entre 100 et 500 ng de cosmide extrait de pools de 48 clones. For the search for PKS I from the library DNA, the PCR conditions are the same as above for the α1-α2 pair using between 100 and 500 ng of cosmid extracted from pools of 48 clones.

Pour le couple d'amorces b1-b2 , 500ng de cosmides issus de pools de 96 clones ont été utilisés. Le mélange d'amplification contenait 200 M de dNTP, 2,5mM de MgCl2 final, 7% de DMSO, tampon 1x Quiagen, 0,4 M de chaque primer et 2,5U de Taq polymerase Hot-start (Qiagen). Les conditions d'amplification utilisées sont: dénaturation 15' à 95 C suivie For the primer pair b1-b2, 500 ng of cosmids from pools of 96 clones were used. The amplification mixture contained 200 M dNTPs, 2.5 mM final MgCl 2, 7% DMSO, 1x Quiagen buffer, 0.4 M each primer and 2.5 U Taq polymerase Hot-start (Qiagen). The amplification conditions used are: denaturation 15 'to 95 C followed

par 30 cycles : 1' de dénaturation à 95 C + 1' d'hybridation à 65 C pour le premier cycle et 62 C pour les autres cycles, 1' d'élongation à 72 C, étape d'extension finale de 10' à 72 C. by 30 cycles: 1 'of denaturation at 95 ° C + 1' of hybridization at 65 ° C. for the first cycle and 62 ° C. for the other cycles, elongation at 72 ° C., final extension step of 10 ° to 72 C.

L'identification des clones positifs à partir des pools de 48 ou 96 clones est effectuée à partir des répliques des microplaques mères correspondantes sur milieu solide ou toute autre méthode standard de réplication. Identification of positive clones from pools of 48 or 96 clones is performed from replicates of the corresponding parent microplates on solid media or any other standard replication method.

3 Sous-clonage et séquençage Les produits PCR des clones identifiés ont été séquences selon le protocole suivant : Les fragments sont purifiés sur gel d'agarose (Gel Extraction Kit ( Qiagen))et clones dans E.coli TOP 10 (Invitrogen) à l'aide du kit TOPO TA cloning kit (Invitrogen). L'ADN plasmidique de sous-clones est extrait par lyse alcaline sur un Biorobot (Qiagen) et dialysé durant 2 h sur membrane VS 0,025pm (Millipore). Les échantillons sont séquences avec les amorces M13 " Universal " et " Reverse " sur le séquenceur ABI 377 96( PERKIN ELMER). Subcloning and Sequencing The PCR products of the identified clones were sequenced according to the following protocol: The fragments are purified on agarose gel (Gel Extraction Kit (Qiagen)) and cloned in E.coli TOP 10 (Invitrogen) using the TOPO TA kit cloning kit (Invitrogen). The plasmid DNA of subclones is extracted by alkaline lysis on a Biorobot (Qiagen) and dialyzed for 2 h on VS 0.025pm membrane (Millipore). The samples are sequenced with primers M13 "Universal" and "Reverse" on the sequencer ABI 377 96 (PERKIN ELMER).

4) Résultats Définition et validation des amorces PCR
Deux régions très conservées de PKS du type 1 d'actinomycètes, comprenant le site actif de l'enzyme, ont été ciblées pour l'amplification de gènes homologues avec des amorces dégénérées. Ces deux régions correspondent aux séquences PQQR (L)(L)LE VE(A)HGTGT respectivement. 4) Results Definition and validation of PCR primers
Two highly conserved regions of actinomycete type 1 PKS, including the active site of the enzyme, have been targeted for amplification of homologous genes with degenerate primers. These two regions correspond to the sequences PQQR (L) (L) VE (A) HGTGT respectively.

Des amorces (tableau 8) ont été testées avec l'ADN de souches productrices ou non de macrolides: Streptomyces coelicolor, Primers (Table 8) were tested with the DNA of strains producing or not producing macrolides: Streptomyces coelicolor,

Streptomyces ambofaciens, producteur de spiramycine, et Saccharopolyspora erythraea, producteur de l'erythromycine. Quelles que soient les amorces utilisées, des bandes représentant des fragments d'environ 700 pb et correspondant à la longueur du fragment attendu, ont été obtenues avec toutes les souches. Streptomyces ambofaciens, producer of spiramycin, and Saccharopolyspora erythraea, producer of erythromycin. Whatever the primers used, bands representing fragments of about 700 bp and corresponding to the length of the expected fragment, were obtained with all the strains.

Ces résultats démontrent la spécificité des amorces a et b pour les gènes PKS 1 de souches productrices ou de gènes silencieux chez S. coelicolor. These results demonstrate the specificity of primers a and b for the PKS 1 genes of producer strains or silent genes in S. coelicolor.

Le séquençage des produits PCR obtenus avec le couple d'amorces a1-a2 a permis d'identifier, à partir de la souche S. ambofaciens, la séquence d'un gène KS déjà décrite (Demande de brevet européen n EP0791656) comme appartenant à la voie de biosynthèse du planténolide, précurseur macrolidique de la spiramycine, et deux séquences jamais décrites, Stramb 9 et Stramb12, (voir liste séquences). Sequencing of the PCR products obtained with the pair of primers a1-a2 made it possible to identify, from the S. ambofaciens strain, the sequence of a KS gene already described (European Patent Application No. EP0791656) as belonging to the biosynthetic pathway of plantenolide, macrolidic precursor of spiramycin, and two sequences never described, Stramb 9 and Stramb12, (see sequence listing).

En ce qui concerne, S. erythraea, la méthode de criblage a permis l'identification d'une séquence de KS (sacery17) identique à celle du KS du module 1 déjà publiée dans Genebank (Numéro d'accès M63677), codant pour la synthétase 1 (DEBS1) du 6- deoxyérythronolide B. Une autre séquence non corrélée à la voie de biosynthèse de l'érythromycine a été identifiée et il s'agit de la séquence SEQ ID N 32. With regard to S. erythraea, the screening method allowed the identification of a sequence of KS (sacery17) identical to that of the KS of module 1 already published in Genebank (access number M63677), coding for the synthetase 1 (DEBS1) of 6-deoxyerythronolide B. Another sequence not correlated to the erythromycin biosynthetic pathway has been identified and it is the sequence SEQ ID N 32.

Conclusion
Une méthode pour analyser la présence de gènes codant pour les PKS du type # par PCR à partir de différents micro-organismes a été mise au point. La structure très conservée du domaine de la keto-synthétase du type # a permis de réaliser une méthode PCR basée sur l'utilisation d'amorces dégénérées biaisées en GC pour le choix des codons. Conclusion
A method for analyzing the presence of genes encoding PCR-type PKSs from different microorganisms has been developed. The highly conserved structure of the keto-synthetase domain of type # has made it possible to carry out a PCR method based on the use of degenerate GC-biased primers for the choice of codons.

Cette approche montre la possibilité d'identifier des gènes ou clusters impliqués dans la voie de biosynthèse des polyketides du type I. This approach shows the possibility of identifying genes or clusters involved in the type I polyketide biosynthetic pathway.

Le clonage de ces gènes permet la création d'une collection qui pourra ensuite être utilisés pour construire des hybrides polyketides. Le même principe peut être appliqué à d'autres classes d'antibiotiques. Cloning these genes allows the creation of a collection that can then be used to build polyketide hybrids. The same principle can be applied to other classes of antibiotics.

Les résultats obtenus ici montrent aussi la présence de gènes pouvant appartenir à des clusters silencieux (SEQ ID N 30 à 32). The results obtained here also show the presence of genes that can belong to silent clusters (SEQ ID N 30 to 32).

La présence de clusters silencieux a été déjà documentée dans S. lividans et leurs expressions sont déclenchées par des régulateurs spécifiques ou pleiotropiques (Horinouchi et al. ;Umeyama et al. 1996) . The presence of silent clusters has already been documented in S. lividans and their expressions are triggered by specific or pleiotropic regulators (Horinouchi et al., Umeyama et al., 1996).

Ces résultats suggèrent que la détection de gènes appartenant à des voies dites silencieuses codent en réalité pour des enzymes actives capable de diriger, en association avec les autres enzymes spécifiques de la voie, les étapes enzymatiques nécessaires pour la synthèse des métabolites secondaires. These results suggest that the detection of genes belonging to so-called silent pathways actually encode active enzymes capable of directing, in association with the other pathway-specific enzymes, the enzymatic steps necessary for the synthesis of secondary metabolites.

Criblage de la banque
Le criblage a été effectué dans les conditions décrites dans la section Matériels et Méthodes en utilisant les couples d'amorces validées à partir de souches productrices. Screening of the bank
The screening was carried out under the conditions described in the Materials and Methods section using the primer pairs validated from producing strains.

En présence du couple d'amorces a1-a2 , la taille des produits PCR obtenus à partir de l'ADN cosmidique extrait de pools de 48 ou 96 clones était d'environ 700 bp, donc en accord avec les résultats attendus. In the presence of the pair of primers a1-a2, the size of the PCR products obtained from the cosmid DNA extracted from pools of 48 or 96 clones was about 700 bp, thus in agreement with the expected results.

L'intensité des bandes obtenues était variable, mais une seule bande d'amplification était présente pour chaque pool d'ADN cible. The intensity of the bands obtained was variable, but only one amplification band was present for each target DNA pool.

Dans ces conditions, 8 groupes d'ADN cible ont été détectés, correspondant à 9 clones positifs après déréplication. Under these conditions, 8 groups of target DNA were detected, corresponding to 9 positive clones after dereplication.

Le criblage effectué avec le second couple d'amorces, b1-b2, a donné des résultats d'amplification moins spécifiques puisque de nombreuses bandes satellites étaient observées à côté de la bande de 700 bp. Néanmoins, 9 groupes d'ADN cible ont été détectés, correspondant à 14 clones positifs après déréplication à partir de ces clones positifs, l'ADN a Screening performed with the second pair of primers, b1-b2, gave less specific amplification results since many satellite bands were observed next to the 700 bp band. Nevertheless, 9 groups of target DNA were detected, corresponding to 14 positive clones after dereplication from these positive clones, the DNA

été extrait pour les étapes de séquençage et de transformation de .S. lividans. was extracted for the sequencing and transformation steps of .S. lividans.

Analyse des cosmides
La digestion des cosmides identifiés par PCR avec l'enzyme Dral, reconnaissant un site riche en AT, libère un fragment supérieur à 23 kb (figure 22). Ceci suggère que la méthode PCR cible préférentiellement l'ADN du sol contenant un haut pourcentage en G+C. Ce résultat est la conséquence de la dégénérescence des amorces utilisées, biaisées en GC pour le choix des codons. Les inserts, comme attendu dans le cas de cosmides, ont une taille supérieure à 23 kb, sauf dans un cas ( clone a9B12), ce qui pourrait traduire une certaine instabilité des cosmides. Cosmid analysis
Digestion of the cosmids identified by PCR with the Dral enzyme, recognizing a site rich in AT, releases a fragment greater than 23 kb (FIG. 22). This suggests that the PCR method preferentially targets soil DNA containing a high percentage of G + C. This result is the consequence of the degeneracy of the primers used, biased in GC for the choice of codons. The inserts, as expected in the case of cosmids, have a size greater than 23 kb, except in one case (a9B12 clone), which could reflect a certain instability of the cosmids.

D'autre part, parmi tous les clones sélectionnés, seulement deux d'entre eux, GS.F1 et GS.G11, ont montré le même profil de restriction indiquant un faible taux de redondance dans la banque. On the other hand, of all the clones selected, only two of them, GS.F1 and GS.G11, showed the same restriction profile indicating a low redundancy rate in the bank.

Les cosmides sélectionnés ont été transférés dans Streptomyces lividans par transformation de protoplastes en présence de PEG 1000. The selected cosmids were transferred to Streptomyces lividans by transformation of protoplasts in the presence of PEG 1000.

L'efficacité de transformation varie entre 30 et 1000 transformants par g d'ADN cosmidique utilisé. The transformation efficiency varies between 30 and 1000 transformants per g of cosmid DNA used.

Séquençage et analyse phylogénétique des gènes PKS 1 du sol
La méthode de PCR mise à point sur les souches pures a été utilisée comme décrite sur les cosmides de la banque et 24 clones ont ainsi été identifiés. Sequencing and Phylogenetic Analysis of PKS 1 Soil Genes
The PCR method developed on the pure strains was used as described on the cosmids of the library and 24 clones were thus identified.

Les produits de PCR d'environ 700 bp obtenus à partir de l'ADN de deux pools (48 clones) et de 8 clones uniques, ont été clonés, après purification sur gel d'agarose , et séquences. Cela a permis l'identification de 11 séquences. PCR products of approximately 700 bp obtained from the DNA of two pools (48 clones) and 8 unique clones were cloned, after purification on agarose gel, and sequenced. This allowed the identification of 11 sequences.

L'alignement des séquences protéiques déduites PKSs 1 du sol avec d'autres PKSs # présentes dans différents micro-organismes (figure 24) montre la présence d'une région très conservée qui correspond à la région consensus du site active de la b-kétoacyl synthétase. The alignment of the PKSs 1 deduced protein sequences of the soil with other PKSs # present in different microorganisms (FIG. 24) shows the presence of a highly conserved region which corresponds to the consensus region of the active site of b-ketoacyl. synthetase.

L'analyse des séquences obtenues avec la méthode " Codonpreference " (Gribskov et al., 1984 ; Bibb et al., 1984) a révélé la présence d'un fort biais dans l'usage des codons riches en G+C dans une seule phase de lecture. Les protéines déduites selon cette phase de lecture montrent une forte similarité avec des KSs du type 1 connus ( programme Blast). En particulier, la similarité entre les séquences de KSs du sol et des KSs du cluster de l'érythromycine est d'environ 53%. Codonpreference sequence analysis (Gribskov et al., 1984, Bibb et al., 1984) revealed a strong bias in the use of G + C rich codons in a single reading phase. The proteins deduced according to this reading phase show a strong similarity with known type 1 KSs (Blast program). In particular, the similarity between the soil KSs and KSs sequences of the erythromycin cluster is about 53%.

Après déréplication d'un pool et identification du clone unique, la séquence du produit PCR obtenu à partir de ce clone est identique à celle du pool ce qui confirme la fiabilité de la méthode utilisée. After dereplication of a pool and identification of the single clone, the sequence of the PCR product obtained from this clone is identical to that of the pool which confirms the reliability of the method used.

L'analyse de la séquence du produit PCR d'un clone a permis l'identification probable de 3 gènes KSI différents. Une de ces séquences (SEQ ID N 34) a une similarité de 98,7% avec la séquence d'un autre pool, suggérant qu'elles codent pour la même enzyme. Les deux autres séquences sont différentes mais fortement homologues. The analysis of the sequence of the PCR product of a clone allowed the probable identification of 3 different KSI genes. One of these sequences (SEQ ID N 34) has a similarity of 98.7% with the sequence of another pool, suggesting that they encode the same enzyme. The other two sequences are different but strongly homologous.

Ici, il est décrit pour la première fois le clonage et l'identification dans une banque d'ADN du sol de voies de biosynthèse de métabolites secondaires contenant des gènes codant des KS du type I. Here, it is described for the first time the cloning and identification in a soil DNA library of biosynthetic pathways of secondary metabolites containing genes encoding type I KS.

Le pourcentage élevé en G+C des séquences du sol suggère qu'elles puissent dériver de génomes ayant un usage des codons similaire à ceux d'actinomycètes. The high G + C percentage of soil sequences suggests that they may be derived from genomes with codon usage similar to those of actinomycetes.

Même si les données disponibles dans la littérature sont réduites, on sait que les gènes codant des PKS du type I sont très diversifiés de par leur organisation physique dans le génome, la taille et le nombre de modules contenus dans chaque gène. Although the data available in the literature are reduced, it is known that genes encoding type I PKS are very diverse in their physical organization in the genome, the size and the number of modules contained in each gene.

La présence de plusieurs domaines provenant d'un seul clone est une confirmation de leur appartenance à des clusters de polyketides assymétriques. Dans un seul cas, deux clones semblent former un contigue puisqu'ils partagent la même séquence pour le domaine KS. The presence of several domains from a single clone is a confirmation of their membership in asymmetric polyketide clusters. In one case, two clones seem to form a contigue since they share the same sequence for the KS domain.

La taille des régions génétiques impliquées dans la synthèse des PKSI varie entre quelques kb pour la pénicilline à environ 120 kb pour la rapamycine. La dimension des inserts cosmidiques peut donc ne pas être suffisante pour l'expression des clusters les plus complexes. The size of the genetic regions involved in PKSI synthesis varies from a few kb for penicillin to about 120 kb for rapamycin. The size of the cosmid inserts may therefore not be sufficient for the expression of the most complex clusters.

Des gènes codant pour des PKSs I, capables de travailler de façon itérative comme les PKS Il et de contrôler la synthèse de polykétides aromatiques, ont été décrits (Jae-Hyuk et al., 1995). L'étude des clusters des PKSs 1 du sol pourrait apporter encore des nouveautés dans ce domaine. Genes coding for PKSs I, able to iteratively work as PKS II and to control the synthesis of aromatic polyketides, have been described (Jae-Hyuk et al., 1995). The study of soil PKS 1 clusters could bring new innovations in this field.

J2 au cc Su w c t- "S CD =. -O "M cara micr ava

g- E C cl *N d or

o j3 ation des prélèvement s différentes expérie ation à l'acridine ora

o Loc dan la c@

J2 to cc Su wc t- "S CD =. -O" M cara micr ava

g- EC cl * N d gold

o j3 ation of samples from different experiments with acridine ora

o Loc dan the c @

<tb> Numéro <SEP> Origine <SEP> Texture <SEP> Quantité <SEP> (%) <SEP> de <SEP> Matière <SEP> pH <SEP> Nombre <SEP> de <SEP> Nombre <SEP> de
<tb> sable <SEP> Limon <SEP> Argile <SEP> organique <SEP> cellules <SEP> avant <SEP> cellules <SEP> après
<tb> (g/kg <SEP> de <SEP> sol <SEP> broyage <SEP> broyage'
<tb> sec) <SEP> a(x109/g <SEP> poids <SEP> (x109/g <SEP> poids
<tb> <tb> Number <SEP> Origin <SEP> Texture <SEP> Quantity <SEP> (%) <SEP> of <SEP> Material <SEP> pH <SEP> Number <SEP> of <SEP> Number <SEP> of
<tb> sand <SEP> Limon <SEP> Clay <SEP> organic <SEP> cells <SEP> before <SEP> cells <SEP> after
<tb> (g / kg <SEP> of <SEP> soil <SEP> grinding <SEP> grinding '
<tb> sec) <SEP> a (x109 / g <SEP> weight <SEP> (x109 / g <SEP> weight
<Tb>

~~~~~ ~~~~~~~~~~~ ~~~~~~~~~~~ ~~~~~~~~~~~~~ ~~~~~~~~ sol sec sol sec

~~~~~ ~~~~~~~~~~~ ~~~~~~~~~~~ ~~~~~~~~~~~~~ ~~~~~~~~ dry soil dry soil

<tb> 1 <SEP> Australie <SEP> Argile <SEP> sablonneuse <SEP> 62 <SEP> 22 <SEP> 16 <SEP> 49,7 <SEP> 5,8 <SEP> 6,5 <SEP> (0,9) <SEP> 2,9 <SEP> (1,3)
<tb> 2 <SEP> Peyrat <SEP> le <SEP> Châ- <SEP> Argile <SEP> sablonneuse <SEP> 61 <SEP> 26 <SEP> 13 <SEP> 48,2 <SEP> 4,9 <SEP> 7,3 <SEP> (0,6)
<tb> 5,4(0,8)teau,France
<tb> 3 <SEP> Côte <SEP> St-André, <SEP> Terreau <SEP> sablonneux <SEP> 50 <SEP> 41 <SEP> 9 <SEP> 40,6 <SEP> 5,6 <SEP> 10,0 <SEP> (0,7)
<tb> France
<tb> 4 <SEP> Chazay <SEP> d'Azergue, <SEP> Terreau <SEP> sablonneux <SEP> 34 <SEP> 47 <SEP> 19 <SEP> 13,9 <SEP> 5,8 <SEP> 7,8(1,1) <SEP> 4,2 <SEP> (0,6)
<tb> France <SEP> argileux
<tb> 5 <SEP> Guadeloupe,France <SEP> Argile <SEP> 27 <SEP> 26 <SEP> 47 <SEP> 17,0 <SEP> 4,8 <SEP> 1,4 <SEP> (0,4) <SEP> 0,5 <SEP> (0,1)
<tb> 6 <SEP> Dombes,France <SEP> Terreau <SEP> sablonneux <SEP> 20 <SEP> 67 <SEP> 13 <SEP> 30,3 <SEP> 4,3 <SEP> 7,5(0,5) <SEP> 5,6(0,9)
<tb> Argileux
<tb>
<tb>
re parenthèses.

(1) ation standard

.0 '(1) "C n=3 ; m <tb> 1 <SEP> Australia <SEP> Clay <SEP> sandy <SEP> 62 <SEP> 22 <SEP> 16 <SEP> 49.7 <SEP> 5.8 <SEP> 6.5 <SEP> ( 0.9) <SEP> 2.9 <SEP> (1.3)
<tb> 2 <SEP> Peyrat <SEP> the <SEP> Châ- <SEP> Clay <SEP> sandy <SEP> 61 <SEP> 26 <SEP> 13 <SEP> 48.2 <SEP> 4.9 <SEP> 7.3 <SEP> (0.6)
<tb> 5.4 (0.8) water, France
<tb> 3 <SEP> Coast <SEP> St-André, <SEP> Soil <SEP> sandy <SEP> 50 <SEP> 41 <SEP> 9 <SEP> 40.6 <SE> 5.6 <SEP> 10.0 <SEP> (0.7)
<tb> France
<tb> 4 <SEP> Chazay <SEP> of Azergue, <SEP> Soil <SEP> sandy <SEP> 34 <SEP> 47 <SEP> 19 <SEP> 13.9 <SEP> 5.8 <SEP> 7.8 (1.1) <SEP> 4.2 <SEP> (0.6)
<tb> France <SEP> clay
<tb> 5 <SEP> Guadeloupe, France <SEP> Clay <SEP> 27 <SEP> 26 <SEP> 47 <SEP> 17.0 <SEP> 4,8 <SEP> 1,4 <SEP> (0, 4) <SEP> 0.5 <SEP> (0.1)
<tb> 6 <SEP> Dombes, France <SEP> Soil <SEP> sandy <SEP> 20 <SEP> 67 <SEP> 13 <SEP> 30.3 <SEP> 4.3 <SEP> 7.5 (0 , 5) <SEP> 5.6 (0.9)
<tb> Argileux
<Tb>
<Tb>
re parentheses.

(1) standard item

.0 '(1) "C n = 3; m

ation PCR et

!E li a c

- 'C'IS #<s U 2 pour sur

10 ABLI isées datio

= -0 0 m 0 ZT3 so

-< Amorces

PCR and

! E li ac

- 'C'IS # <s U 2 for on

10 ABLI dations datio

= -0 0 m 0 ZT3 so

- <Amorces

<tb> Amorce <SEP> ou <SEP> sonde <SEP> Cible <SEP> a) <SEP> Séquence <SEP> (5' <SEP> à <SEP> 3') <SEP> Référence <SEP> n
<tb> FGPS431 <SEP> sonde <SEP> Universelle <SEP> (1392-1406) <SEP> ACGGGCGGTGTGT(A/G)C <SEP> Amann <SEP> et <SEP> al., <SEP> 1995 <SEP>
<tb> FGPS122 <SEP> amorce <SEP> Bactéries <SEP> (6-27) <SEP> GGAGAGTTTGATCATGGCTCAG <SEP> Amann <SEP> et <SEP> al <SEP> , <SEP> 1995
<tb> FGPS350 <SEP> amorce <SEP> Streptosporangium <SEP> (616-635) <SEP> CCTGGAGTTAAGCCCCCAAGC <SEP> Cette <SEP> étude
<tb> FGPS643 <SEP> sonde <SEP> Streptosporangium <SEP> (122-142) <SEP> GTGAGTAACCTGCCCC <SEP> (T/C)GACT <SEP> étude
<tb> R499 <SEP> amorce <SEP> Bacillus <SEP> anthracis <SEP> TTAATTCACTTGCAACTGATGGG <SEP> Patra <SEP> et <SEP> al., <SEP> 1996
<tb> R500 <SEP> amorce <SEP> Bacillus <SEP> anthracis <SEP> AACGATAGCTCCTACATTTGGAG <SEP> Patra <SEP> et <SEP> al., <SEP> 1996
<tb> C501 <SEP> sonde <SEP> Bacillus <SEP> anthracis <SEP> TTGCTGATACGGTATAGAACCTGGC <SEP> Patra <SEP> et <SEP> al., <SEP> 1996
<tb> FGPS516 <SEP> amorce <SEP> S. <SEP> lividans <SEP> OS48.3 <SEP> TCCAGATCCTTGACCCGCAG <SEP> Cette <SEP> étude
<tb> FGPS517 <SEP> amorce <SEP> S <SEP> lividans <SEP> OS48.3 <SEP> CACGACATTGCACTCCACCG <SEP> Cette <SEP> étude
<tb> FGPS518 <SEP> sonde <SEP> S. <SEP> lividans <SEP> Os48.3 <SEP> CCGTGAGCCGGATCAG <SEP> Cette <SEP> étude
<tb>
norces et ocalisées ).

1 i fi) M @s, les ont pa on pu

0 .52 m ra c-l

-c" eses. s res st dé

-0) aren nist

en ; es en é <U

ë- "5 #~ -CD U M l'ARNr 16S de E ces chromosomi cassette conten

cr -CD (0 .(D c- 11 Q) 1 1 -I J U (6 <tb> Primer <SEP> or <SEP> probe <SEP> Target <SEP> a) <SEP> Sequence <SEP> (5 '<SEP> to <SEP>3')<SEP> Reference <SEP> n
<tb> FGPS431 <SEP> probe <SEP> Universal <SEP> (1392-1406) <SEP> ACGGGCGGTGTGT (A / G) <SEP> Amann <SEP> and <SEP> al., <SEP> 1995 <SEP >
<tb> FGPS122 <SEP> primer <SEP> Bacteria <SEP> (6-27) <SEP> GGAGAGTTTGATCATGGCTCAG <SEP> Amann <SEP> and <SEP> al <SEP>, <SEP> 1995
<tb> FGPS350 <SEP> Primer <SEP> Streptosporangium <SEP> (616-635) <SEP> CCTGGAGTTAAGCCCCCAAGC <SEP> This <SEP> Study
<tb> FGPS643 <SEP> Probe <SEP> Streptosporangium <SEP> (122-142) <SEP> GTGAGTAACCTGCCCC <SEP> (T / C) GACT <SEP> Study
<tb> R499 <SEP> primer <SEP> Bacillus <SEP> anthracis <SEP> TTAATTCACTTGCAACTGATGGG <SEP> Patra <SEP> and <SEP> al., <SEP> 1996
<tb> R500 <SEP> primer <SEP> Bacillus <SEP> anthracis <SEP> AACGATAGCTCCTACATTTGGAG <SEP> Patra <SEP> and <SEP> al., <SEP> 1996
<tb> C501 <SEP> probe <SEP> Bacillus <SEP> anthracis <SEP> TTGCTGATACGGTATAGAACCTGGC <SEP> Patra <SEP> and <SEP> al., <SEP> 1996
<tb> FGPS516 <SEP> primer <SEP> S. <SEP> lividans <SEP> OS48.3 <SEP> TCCAGATCCTTGACCCGCAG <SEP> This <SEP> study
<tb> FGPS517 <SEP> primer <SEP> S <SEP> lividans <SEP> OS48.3 <SEP> CACGACATTGCACTCCACCG <SEP> This <SEP> study
<tb> FGPS518 <SEP> probe <SEP> S. <SEP> lividans <SEP> Os48.3 <SEP> CCGTGAGCCGGATCAG <SEP> This <SEP> study
<Tb>
norces and ocalised).

1 i) M @ s, have not been able to

0 .52 m ra cl

-c "eses.

-0) aren nist

in ; are in <U

ë- "5 # ~ -CD UM 16S rRNA these chromosomi cassette contained

cr -CD (0. (Dc- 11 Q) 1 1 -IJU (6

(0 '0 emen iation 5b

#> i -0 -f-I -0 ,d) u> ,j " h 0 -0 . 0 15 -≈ ! 1 - II ; n : e n@

#! -0 < Î3 | H (5 " N n - -ro ' x 'o O Bzz <1 Cq .0 CL P V) #4-> O ns </) o (A =' i ! o ! D5 #*#

.0) - E N

(0 0 0ation 5b

## EQU1 ## where: ## STR2 ##

#! -0 <Î3 | H (5 "N n - -ro 'x' o O Bzz <1 Cq .0 CL PV) # 4-> O ns </) o (A = 'i! O! D5 # * #

.0) - EN

<tb> 1. <SEP> Australie <SEP> 17+/-2 <SEP> 52+/- <SEP> 2 <SEP> 32+/-5 <SEP> 16+/-3 <SEP> 33+/- <SEP> 2 <SEP> 59+/-1 <SEP> 27+/-0
<tb> 2. <SEP> Peyrat <SEP> 29+/-2 <SEP> 58+/-1 <SEP> 40+/-2 <SEP> 29+/-2 <SEP> 18+/3 <SEP> 56+/-1 <SEP> 15+/-1
<tb> 3.Côte <SEP> St-André <SEP> 36+/-7 <SEP> 60+/-6 <SEP> 148+/-10 <SEP> 94+/-7 <SEP> 38+/-6 <SEP> 73+/-5 <SEP> 47+/-6
<tb> 4. <SEP> Chazay <SEP> 9 <SEP> 16 <SEP> ND <SEP> 32 <SEP> 15 <SEP> 15 <SEP> 70
<tb> 6. <SEP> Dombes <SEP> 4+/- <SEP> 2 <SEP> 26+/-3 <SEP> 43+/-1 <SEP> 61+/- <SEP> 66+/-1 <SEP> 160+/-7 <SEP> 102+/- <SEP> 5 <SEP>
<tb>
GPS43 que/

.Q"" iver un +

0 1);

x + r tâche avec tion Ultratura (CH); 5a: Cr

CI) idation @ogéné Cup Ho

E 0 ..0 oc -C 0 en '7 L- U C3 c en v. O Q) C'0 en cenc I (G); + H+

C i D..cu::2: 0 Q) 0 0).- fication par imagerie de
2). <tb> 1. <SEP> Australia <SEP> 17 +/- 2 <SEP> 52 +/- <SEP> 2 <SEP> 32 +/- 5 <SEP> 16 +/- 3 <SEP> 33 + / - <SEP> 2 <SEP> 59 +/- 1 <SEP> 27 +/- 0
<tb> 2. <SEP> Peyrat <SEP> 29 +/- 2 <SEP> 58 +/- 1 <SEP> 40 +/- 2 <SEP> 29 +/- 2 <SEP> 18 + / 3 <SEP > 56 +/- 1 <SEP> 15 +/- 1
<tb> 3.Côte <SEP> St-André <SEP> 36 +/- 7 <SEP> 60 +/- 6 <SEP> 148 +/- 10 <SEP> 94 +/- 7 <SEP> 38 + / -6 <SEP> 73 +/- 5 <SEP> 47 +/- 6
<tb> 4. <SEP> Chazay <SEP> 9 <SEP> 16 <SEP> NS <SEP> 32 <SEP> 15 <SEP> 15 <SEP> 70
<tb> 6. <SEP> Dombes <SEP> 4 +/- <SEP> 2 <SEP> 26 +/- 3 <SEP> 43 +/- 1 <SEP> 61 +/- <SEP> 66 +/- 1 <SEP> 160 +/- 7 <SEP> 102 +/- <SEP> 5 <SEP>
<Tb>
GPS43 that /

.Q "" iver a +

0 1);

x + r task with Ultratura (CH); 5a: Cr

CI) idation @ ogen Cup Ho

## STR5 ## in which OQ) C'0 in cenc I (G); + H +

C i D..cu :: 2: 0 Q) 0 0) .- imaging by
2).

In traitement ; 2:broyage H + sonication Microtip tique. Voir aussi figure Non déterminé.

:J:J cuZ C CD " + p a Qua@ (table b1: Au 4a: G c ND : In treatment; 2: grinding H + Microtiptic sonication. See also figure Not determined.

: J: J cuZ C CD "+ pa Qua @ (table b1: At 4a: G c ND:

sation

I 0 (J 0 3 s la raits

C A-J qe m 0 bleau lisées s ADN

-0 Amorces et sondes moléculair

tion

I 0 (J 0 3 s the raits

C AJ q e m a lle s s DNA

-0 Molecular primers and probes

<tb> Cible <SEP> Séquence <SEP> (5' <SEP> - <SEP> 3') <SEP> Position <SEP> a
<tb> (amorce <SEP> ou <SEP> sonde)
<tb> FGPS <SEP> 612 <SEP> Eubactéries <SEP> (amorce) <SEP> C(C/T)AACT(T/C/A)CGTGCCAGCAGCC <SEP> 506 <SEP> - <SEP> 525 <SEP>
<tb> FGPS <SEP> 669 <SEP> Eubactéries <SEP> (amorce) <SEP> GACGTC(A/G)TCCCC(A/C)CCTTCCTC <SEP> 1174 <SEP> - <SEP> 1194 <SEP>
<tb> FGPS <SEP> 618 <SEP> Eubactéries <SEP> (sonde) <SEP> ATGG(T/C)TGTCGTCAGCTCG <SEP> 1056 <SEP> - <SEP> 1073 <SEP>
<tb> FGPS <SEP> 614 <SEP> a-Protéobactéries <SEP> (sonde) <SEP> GTGTAGAGGTGAAATTCGTAG <SEP> 683 <SEP> - <SEP> 703 <SEP>
<tb> FGPS <SEP> 615 <SEP> b-Protéobactéries <SEP> (sonde) <SEP> CGGTGGATGATGTGGATT <SEP> 939 <SEP> - <SEP> 956 <SEP>
<tb> FGPS <SEP> 616 <SEP> g-Protéobactéries <SEP> (sonde) <SEP> AGGTTAAAACTCAAATGA <SEP> 900 <SEP> - <SEP> 917 <SEP>
<tb> FGPS <SEP> 621 <SEP> Gram <SEP> plus <SEP> à <SEP> bas <SEP> GC% <SEP> (sonde) <SEP> ATACGTAGGTGGCAAGCG <SEP> 532 <SEP> - <SEP> 549
<tb> FGPS <SEP> 617 <SEP> Actinomycètes <SEP> (sonde) <SEP> GCCGGGGTCAACTCGGAGG <SEP> 1159 <SEP> - <SEP> 1149
<tb> FGPS <SEP> 680 <SEP> Streptomycètes <SEP> (sonde) <SEP> TGAGTCCCCA(A/C/T)C(T/A)CCCCG <SEP> 1132 <SEP> - <SEP> 1149 <SEP>
<tb> FGPS <SEP> 619 <SEP> Streptosporangium <SEP> (sonde) <SEP> GCTTGGGGCTTAACTCCAGG <SEP> 609 <SEP> - <SEP> 628 <SEP>
<tb>
ia col

(1) -c RNr16S d'Esc

a) c: gè

JU t- :3 a : position s <tb> Target <SEP> Sequence <SEP> (5 '<SEP> - <SEP>3')<SEP> Position <SEP> a
<tb> (primer <SEP> or <SEP> probe)
<tb> FGPS <SEP> 612 <SEP> Eubacteria <SEP> (primer) <SEP> C (C / T) AACT (T / C / A) CGTGCCAGCAGCC <SEP> 506 <SEP> - <SEP> 525 <SEP >
<tb> FGPS <SEP> 669 <SEP> Eubacteria <SEP> (primer) <SEP> GACGTC (A / G) TCCCC (A / C) CCTTCCTC <SEQ> 1174 <SEP> - <SEP> 1194 <SEP>
<tb> FGPS <SEP> 618 <SEP> Eubacteria <SEP> (probe) <SEP> ATGG (T / C) TGTCGTCAGCTCG <SEP> 1056 <SEP> - <SEP> 1073 <SEP>
<tb> FGPS <SEP> 614 <SEP> α-Proteobacteria <SEP> (probe) <SEP> GTGTAGAGGTGAAATTCGTAG <SEP> 683 <SEP> - <SEP> 703 <SEP>
<tb> FGPS <SEP> 615 <SEP> b-Proteobacteria <SEP> (probe) <SEP> CGGTGGATGATGTGGATT <SEP> 939 <SEP> - <SEP> 956 <SEP>
<tb> FGPS <SEP> 616 <SEP> g-Proteobacteria <SEP> (probe) <SEP> AGGTTAAAACTCAAATGA <SEP> 900 <SEP> - <SEP> 917 <SEP>
<tb> FGPS <SEP> 621 <SEP> Gram <SEP> plus <SEP> to <SEP> down <SEP> GC% <SEP> (probe) <SEP> ATACGTAGGTGGCAAGCG <SEP> 532 <SEP> - <SEP> 549
<tb> FGPS <SEP> 617 <SEP> Actinomycetes <SEP> (probe) <SEP> GCCGGGGTCAACTCGGAGG <SEP> 1159 <SEP> - <SEP> 1149
<tb> FGPS <SEP> 680 <SEP> Streptomycetes <SEP> (probe) <SEP> TGAGTCCCCA (A / C / T) C (T / A) CCCCG <SEP> 1132 <SEP> - <SEP> 1149 <SEP >
<tb> FGPS <SEP> 619 <SEP> Streptosporangium <SEP> (probe) <SEP> GCTTGGGGCTTAACTCCAGG <SEP> 609 <SEP> - <SEP> 628 <SEP>
<Tb>
collar

(1) -c RNr16S of Esc

a) c: gè

JU t-: 3 a: position s

ADN extra %,

'a d Mi-.Q) -<D 3 +> 111 2 +* N ci ent de N ne solut ation su xtrait

= 0) N M M 2: -IIS S 5 g S 3 1*5-=: bactérien chantillon persion et ries extraite

=5 t! 0 **#> C 3 lif CI) s o

(0 Effi

Extra DNA%,

'ad Mi-.Q) - <D 3 +> 111 2 + * N ce ent of N solut ation su xtra

= 0) NMM 2: -IS S 5 g S 3 1 * 5- =: bacterial sample persion and extracted rays

= 5 t! 0 ** #> C 3 lif CI) so

(0 Effi

<tb> Microflore <SEP> totale <SEP> Microflore <SEP> cultivable <SEP> Actinomycètes <SEP> Lyse <SEP> directe <SEP> Lyse <SEP> en <SEP> bloc
<tb> bactéries/g <SEP> sol <SEP> sec <SEP> cfu <SEP> /g <SEP> sol <SEP> sec <SEP> cultivables <SEP> d'agarose <SEP> d,e <SEP>
<tb> cfu/g <SEP> sol <SEP> sec <SEP> ng <SEP> ADN/ <SEP> g <SEP> sol <SEP> ng <SEP> ADN/ <SEP> g <SEP> sol
<tb> sec <SEP> sec
<tb> Sans <SEP> incubation
<tb> Suspension <SEP> de <SEP> sol <SEP> 1 <SEP> 3 <SEP> 109 <SEP> (~ <SEP> 0 <SEP> 1) <SEP> 6.9 <SEP> 106 <SEP> (~ <SEP> 0 <SEP> 2) <SEP> 8 <SEP> 6 <SEP> 106 <SEP> (~ <SEP> 1 <SEP> 2)
<tb> Extrait <SEP> cellulaire <SEP> 1.9 <SEP> 108 <SEP> (~ <SEP> 0. <SEP> 2) <SEP> 4.1 <SEP> 106 <SEP> (~ <SEP> 1.5) <SEP> 2.5 <SEP> 106 <SEP> (~ <SEP> 0 <SEP> 7) <SEP> 333 <SEP> (~ <SEP> 35) <SEP> 221 <SEP> (~ <SEP> 70)
<tb> Efficacité <SEP> d'extraction
<tb> 15% <SEP> 59% <SEP> 38%
<tb> Avec <SEP> incubation <SEP> dans <SEP> extrait <SEP> de <SEP> levure <SEP> 6%
<tb> Suspension <SEP> de <SEP> sol <SEP> 1 <SEP> 2 <SEP> 109( <SEP> 0. <SEP> 1) <SEP> 7.6 <SEP> 107(~ <SEP> 1.1) <SEP> 6.6 <SEP> 107(~0.4)
<tb> Extrait <SEP> cellulaire
<tb> 1 <SEP> 6 <SEP> 108( <SEP> 0 <SEP> 3) <SEP> 5.3 <SEP> 106(~ <SEP> 14) <SEP> 3. <SEP> 7 <SEP> 106(~ <SEP> 0.7) <SEP> 344 <SEP> (~ <SEP> 30) <SEP> 341 <SEP> (~ <SEP> 67)
<tb> Efficacité <SEP> d'extraction <SEP> 13% <SEP> 7% <SEP> 5%
<tb>
ure e

> CD "5 U
Z uti

S a I I 1 I -(0 -CD r ra ion

a 1 (0 a5 w (C e - # CL -JE U o .U U7 (0,(D oloratio ase-Soj gar, apr@ e sur ge

.< -8 " 48 ' 48 Q.r -I- CL ffl d) j) *Q) C0 (D'0'0 ) (D #11 cn cn cn 8.1.1 I mil xtr

z (6 U) #<.2 U N N ô -U E m >(D ombrer DS 0,0@ quantité quantifi@

(D G) m m o # CD ' ' a e <tb> Microflora <SEP> total <SEP> Microflora <SEP> cultivable <SEP> Actinomycetes <SEP> Lyse <SEP> direct <SEP> Lyse <SEP> in <SEP> block
<tb> bacteria / g <SEP> soil <SEP> dry <SEP> cfu <SEP> / g <SEP> soil <SEP> sec <SEP> cultivable <SEP> agarose <SEP> d, e <SEP>
<tb> cfu / g <SEP> soil <SEP> dry <SEP> ng <SEP> DNA / <SEP> g <SEP> soil <SEP> ng <SEP> DNA / <SEP> g <SEP> soil
<tb> dry <SEP> sec
<tb> Without <SEP> incubation
<tb> Suspension <SEP> of <SEP> sol <SEP> 1 <SEP> 3 <SEP> 109 <SEP> (~ <SEP> 0 <SEP> 1) <SEP> 6.9 <SEP> 106 <SEP> ( ~ <SEP> 0 <SEP> 2) <SEP> 8 <SEP> 6 <SEP> 106 <SEP> (~ <SEP> 1 <SEP> 2)
<tb> Extract <SEP> Cellular <SEP> 1.9 <SEP> 108 <SEP> (~ <SEP> 0. <SEP> 2) <SEP> 4.1 <SEP> 106 <SEP> (~ <SEP> 1.5) <SEP> 2.5 <SEP> 106 <SEP> (~ <SEP> 0 <SEP> 7) <SEP> 333 <SEP> (~ <SEP> 35) <SEP> 221 <SEP> (~ <SEP> 70)
<tb> Extraction Efficiency <SEP>
<tb> 15% <SEP> 59% <SEP> 38%
<tb> With <SEP> incubation <SEP> in <SEP> extract <SEP> of <SEP> yeast <SEP> 6%
<tb> Suspension <SEP> of <SEP> sol <SEP> 1 <SEP> 2 <SEP> 109 (<SEP> 0. <SEP> 1) <SEP> 7.6 <SEP> 107 (~ <SEP> 1.1) <SEP> 6.6 <SEP> 107 (~ 0.4)
<tb> Cellular Extract <SEP>
<tb> 1 <SEP> 6 <SEP> 108 (<SEP> 0 <SEP> 3) <SEP> 5.3 <SEP> 106 (~ <SEP> 14) <SEP> 3. <SEP> 7 <SEP> 106 (~ <SEP> 0.7) <SEP> 344 <SEP> (~ <SEP> 30) <SEP> 341 <SEP> (~ <SEP> 67)
<tb> Effectiveness <SEP> of extraction <SEP> 13% <SEP> 7% <SEP> 5%
<Tb>
ure e

> CD "5 U
Z uti

S a II 1 I - (0-CD rra ion

a 1 (0 a5 w (C e - # CL -JE U o. U U7 (0, (D oloratio ase-Soj gar, after te w ge

<-8 "48 '48 Qr -I- CL ffl d) j) * Q) C0 (From 0'0) (D # 11 cn cn cn 8.1.1 I mil xtr

z (6 U) # <. 2 UNN ô -UE m> (D shader DS 0,0 @ quantified quantity @

(DG) mmo # CD '' ae

on avec la

03 "C nal d'hyb

O) , et g nomycètes

" < Se ur proportion bas GC% et
00%.

"C 0 #c E <a <DO <D .9 2 .> 3-505 Me .0.....0)"" J--Q)(/)- .::; 0 ,ft O- -0) 1! des es P prod

'0 a> .2 w "2 B 0) r risa asse so

S Ca so

with the

03 "C nal hyb

O), and g nomycetes

"In proportion to low GC% and
00%.

"C 0 #c E <a <DO <D .9 2 .> 3-505 Me .0 ..... 0)""J - Q) (/) -. ::; 0, ft O- -0) 1! Es es prod

'0 a> .2 w "2 B 0) r risa asse so

S Ca so

<tb> a- <SEP> b- <SEP> g- <SEP> Gram+
<tb> Protéobactérie <SEP> Protéobactérie <SEP> Protéobactérie <SEP> bas <SEP> GC% <SEP> Actinomycètes <SEP> Streptomycètes
<tb> s <SEP> s <SEP> s
<tb> Extraction <SEP> directe <SEP> a <SEP> 7.7 <SEP> % <SEP> (1.4) <SEP> 5.3 <SEP> % <SEP> ( <SEP> 0.5) <SEP> 3.3 <SEP> % <SEP> ( <SEP> 0.9) <SEP> 3.1 <SEP> % <SEP> (~ <SEP> 1.7) <SEP> 14.7 <SEP> % <SEP> (0.6) <SEP> 0.8 <SEP> % <SEP> (0.1)
<tb> Extraction <SEP> indirecte
<tb> Lyse <SEP> + <SEP> CsCI <SEP> 10.9% <SEP> (1.4) <SEP> 6.4% <SEP> (1.4) <SEP> 14.3% <SEP> (1.4) <SEP> 7.9% <SEP> (1.4) <SEP> 8.5% <SEP> (1.4) <SEP> 3.0% <SEP> (1.4) <SEP>
<tb> Lyse <SEP> en <SEP> bloc <SEP> 2.9% <SEP> (1.4) <SEP> 5.4 <SEP> % <SEP> (~ <SEP> 1.4) <SEP> 11.1 <SEP> %(1.4) <SEP> 8.0% <SEP> (1.4) <SEP> 11.3% <SEP> (1.4) <SEP> 2.6% <SEP> (1.4) <SEP>
<tb> Lyse <SEP> en <SEP> bloc <SEP> 6.3% <SEP> (1.4) <SEP> 7.5% <SEP> (1.4) <SEP> 17.0% <SEP> (1.4) <SEP> 18.1 <SEP> %(1.4) <SEP> 19.4% <SEP> (1.4) <SEP> 4.6% <SEP> (1.4) <SEP>
<tb> + <SEP> incubation <SEP> YE
<tb>
crit dans article Frostega

>CD ctio

(0 .4#1 X (U fuge (protoco

cn gstene, à force c

-ë i~r , ex

ns

JE vs m LU m 0) >- <tb> a- <SEP> b- <SEP> g- <SEP> Gram +
<tb> Proteobacterium <SEP> Proteobacterium <SEP> Proteobacterium <SEP> Low <SEP> GC% <SEP> Actinomycetes <SEP> Streptomycetes
<tb> s <SEP> s <SEP> s
<tb> Direct <SEP> Extraction <SEP> a <SEP> 7.7 <SEP>% <SEP> (1.4) <SEP> 5.3 <SEP>% <SEP>(<SEP> 0.5) <SEP> 3.3 <SEP> % <SEP>(<SEP> 0.9) <SEP> 3.1 <SEP>% <SEP> (~ <SEP> 1.7) <SEP> 14.7 <SEP>% <SEP> (0.6) <SEP> 0.8 <SEP>% <SEP> (0.1)
<tb> Extraction <SEP> indirect
<tb> Lysis <SEP> + <SEP> CsCI <SEP> 10.9% <SEP> (1.4) <SEP> 6.4% <SEP> (1.4) <SEP> 14.3% <SEP> (1.4) <SEP> 7.9% <SEP> (1.4) <SEP> 8.5% <SEP> (1.4) <SEP> 3.0% <SEP> (1.4) <SEP>
<tb> Lysis <SEP> in <SEP> block <SEP> 2.9% <SEP> (1.4) <SEP> 5.4 <SEP>% <SEP> (~ <SEP> 1.4) <SEP> 11.1 <SEP>% ( 1.4) <SEP> 8.0% <SEP> (1.4) <SEP> 11.3% <SEP> (1.4) <SEP> 2.6% <SEP> (1.4) <SEP>
<tb> Lyse <SEP> in <SEP> block <SEP> 6.3% <SEP> (1.4) <SEP> 7.5% <SEP> (1.4) <SEP> 17.0% <SEP> (1.4) <SEP> 18.1 <SEP>% (1.4) <SEP> 19.4% <SEP> (1.4) <SEP> 4.6% <SEP> (1.4) <SEP>
<tb> + <SEP> incubation <SEP> YE
<Tb>
written in Frostega article

> CD ctio

(0 .4 # 1 X (U fuge (protoco

cn gstene, to force c

-ex, ex

ns

I vs m LU m 0)> -

banque cosmidique

tu c s o
0 Tableau 7 : Nr16S contenu
Q

U) quenc

-0 (0 Q) '0 Diver

cosmic bank

you are
0 Table 7: Nr16S content
Q

U) quenc

-0 (0 Q) '0 Diver

<tb> N <SEP> pool <SEP> Voisin <SEP> identifié <SEP> % <SEP> de <SEP> Voisin <SEP> le <SEP> plus <SEP> proche <SEP> % <SEP> de
<tb> (clone <SEP> n <SEP> le <SEP> plus <SEP> proche <SEP> similarité <SEP> (classification, <SEP> référence) <SEP> similarité
<tb> a-Protéobactéries
<tb> a24. <SEP> 1 <SEP> (2) <SEP> Azospirillum <SEP> brasilense <SEP> 97.7%
<tb> a4-a6-a7 <SEP> (7) <SEP> Azospirillum <SEP> brasilense <SEP> 95.4%
<tb> a4-a6-a7 <SEP> (23) <SEP> Azospirillum <SEP> brasilense <SEP> 88. <SEP> 9% <SEP> Str <SEP> L-87 <SEP> (a-proteobactérie)' <SEP> 89. <SEP> 8%
<tb> a52-a53-a5 <SEP> (15) <SEP> Azospirillum <SEP> lipoferum <SEP> 97.6%
<tb> a49-a50-a51 <SEP> (22) <SEP> Agrobacterium <SEP> tumefaciens <SEP> 95.0% <SEP> Clone <SEP> JN15d <SEP> (non <SEP> publié) <SEP> 95.5%
<tb> a49-a50-a51 <SEP> (11) <SEP> Rhizobium <SEP> sp <SEP> 99.7%
<tb> a4-a6-a7 <SEP> (14) <SEP> Rhizobium <SEP> sp <SEP> 99.7%
<tb> a30-a31-a32 <SEP> (7) <SEP> Bradyrhizobium <SEP> japonicum <SEP> 99.4%
<tb> a19-a20-a26 <SEP> (5) <SEP> Bradyrhizobium <SEP> genosp <SEP> 93 <SEP> 3% <SEP> Clone <SEP> DA122 <SEP> (non <SEP> publié) <SEP> 95.9%
<tb> a37-a38-a39 <SEP> (6) <SEP> Mesorhizobium <SEP> sp. <SEP> 98.9 <SEP> % <SEP>
<tb> a19-a20-a26 <SEP> (9) <SEP> Bradyrhizobium <SEP> sp <SEP> 90.2% <SEP> CloneS-26(aProtéobactérie)2 <SEP> 95.9%
<tb> a46-a47-a48 <SEP> (14) <SEP> Phyllobacterium <SEP> rubiacearum <SEP> 97.6 <SEP> % <SEP>
<tb> <tb> N <SEP> pool <SEP> Neighbor <SEP> identified <SEP>% <SEP> from <SEP> Neighbor <SEP><SEP> plus <SEP> close <SEP>% <SEP> from
<tb> (clone <SEP> n <SEP> the <SEP> more <SEP> close <SEP> similarity <SEP> (classification, <SEP> reference) <SEP> similarity
<tb> a-Proteobacteria
<tb> a24. <SEP> 1 <SEP> (2) <SEP> Azospirillum <SEP> brasilense <SEP> 97.7%
<tb> a4-a6-a7 <SEP> (7) <SEP> Azospirillum <SEP> brasilense <SEP> 95.4%
<tb> a4-a6-a7 <SEP> (23) <SEP> Azospirillum <SEP> brasilense <SEP> 88. <SEP> 9% <SEP> Str <SEP> L-87 <SEP> (a-proteobacterium) '<SEP> 89. <SEP> 8%
<tb> a52-a53-a5 <SEP> (15) <SEP> Azospirillum <SEP> Lipoferum <SEP> 97.6%
<tb> a49-a50-a51 <SEP> (22) <SEP> Agrobacterium <SEP> tumefaciens <SEP> 95.0% <SEP> Clone <SEP> JN15d <SEP> (no <SEP> published) <SEP> 95.5%
<tb> a49-a50-a51 <SEP> (11) <SEP> Rhizobium <SEP> sp <SEP> 99.7%
<tb> a4-a6-a7 <SEP> (14) <SEP> Rhizobium <SEP> sp <SEP> 99.7%
<tb> a30-a31-a32 <SEP> (7) <SEP> Bradyrhizobium <SEP> japonicum <SEP> 99.4%
<tb> a19-a20-a26 <SEP> (5) <SEP> Bradyrhizobium <SEP> genosp <SEP> 93 <SEP> 3% <SEP> Clone <SEP> DA122 <SEP> (no <SEP> published) <SEP> 95.9%
<tb> a37-a38-a39 <SEP> (6) <SEP> Mesorhizobium <SEP> sp. <SEP> 98.9 <SEP>% <SEP>
<tb> a19-a20-a26 <SEP> (9) <SEP> Bradyrhizobium <SEP> sp <SEP> 90.2% <SEP> CloneS-26 (aProteobacterium) 2 <SEP> 95.9%
<tb> a46-a47-a48 <SEP> (14) <SEP> Phyllobacterium <SEP> rubiacearum <SEP> 97.6 <SEP>% <SEP>
<Tb>

ns

l'a *0 e 1)
16S contenues ique

::s Z 0 0 E LEAU 7 ( @ces d'AI @que cos

mI:l: c4 TA Diversité des séque la b

ns

the * 0 e 1)
16S contained

:: s Z 0 0 E LEAU 7 (@ces of AI @que cos

mI: l: c4 TA Diversity of seques

<tb> N <SEP> pool <SEP> Voisin <SEP> identifié <SEP> % <SEP> de <SEP> Voisin <SEP> le <SEP> plus <SEP> proche <SEP> % <SEP> de
<tb> (clone <SEP> n ) <SEP> le <SEP> plus <SEP> proche <SEP> similarité <SEP> (classification, <SEP> référence) <SEP> similarité
<tb> a49-a50-a51 <SEP> (1) <SEP> Caulobacter <SEP> henricii <SEP> 97. <SEP> 0%
<tb> a1-a2-a3 <SEP> (13) <SEP> Caulobacter <SEP> sp <SEP> 96.3%
<tb> a52-a53-a5 <SEP> (8) <SEP> Mesorhyzobium <SEP> mediterraneum <SEP> 92.1% <SEP> Clone <SEP> DA122 <SEP> (non <SEP> publié) <SEP> 94. <SEP> 8%
<tb> a34-a35-a36 <SEP> (3) <SEP> Rhodobium <SEP> orientis <SEP> 91. <SEP> 8% <SEP> Clone <SEP> (non <SEP> publié) <SEP> 95.1%
<tb> a1-a2-a3 <SEP> (4) <SEP> Sphingomonas <SEP> sp. <SEP> 94. <SEP> 7% <SEP> Clone <SEP> PAD23 <SEP> (non <SEP> publié) <SEP> 95.1%
<tb> a8-a9-a10 <SEP> (13) <SEP> Sphingomonas <SEP> sp. <SEP> 94.0%
<tb> [gamma]-Protéobactéries
<tb> a40-a41-a42 <SEP> (13) <SEP> Pseudomonas <SEP> sp <SEP> 98. <SEP> 9% <SEP> clone <SEP> G26(g-Protéobactérie)3 <SEP> 99. <SEP> 7%
<tb> a15-a16-a17 <SEP> (12) <SEP> Lysobacter <SEP> antibioticus <SEP> 94. <SEP> 4% <SEP> clone <SEP> vadinHA77(g-Protéo) <SEP> 93. <SEP> 6%
<tb> a15-a16-a17 <SEP> (5) <SEP> Xanthomonas <SEP> sp <SEP> 93. <SEP> 4% <SEP> clone <SEP> vadinHA77(g-Protéo)4 <SEP> 94. <SEP> 6%
<tb> a19-a20-a26 <SEP> (13) <SEP> Luteimonas <SEP> mephitis <SEP> 92. <SEP> 9% <SEP> Souche <SEP> rJ15 <SEP> (non <SEP> publié) <SEP> 93. <SEP> 5%
<tb> a46-a47-a48 <SEP> (6) <SEP> Methylobacter <SEP> whittenburyi <SEP> 88. <SEP> 3% <SEP> soil <SEP> clone <SEP> S-43(g-Protéo) <SEP> 2 <SEP> 88.9 <SEP> % <SEP>
<tb> a11-a12-a13 <SEP> (11) <SEP> Methylobacterwhittenburyi <SEP> 88. <SEP> 3% <SEP> soil <SEP> clone <SEP> S-43(g-Protéo) <SEP> 88. <SEP> 9%
<tb> <tb> N <SEP> pool <SEP> Neighbor <SEP> identified <SEP>% <SEP> from <SEP> Neighbor <SEP><SEP> plus <SEP> close <SEP>% <SEP> from
<tb> (clone <SEP> n) <SEP> the <SEP> plus <SEP> close <SEP> similarity <SEP> (classification, <SEP> reference) <SEP> similarity
<tb> a49-a50-a51 <SEP> (1) <SEP> Caulobacter <SEP> henricii <SEP> 97. <SEP> 0%
<tb> a1-a2-a3 <SEP> (13) <SEP> Caulobacter <SEP> sp <SEP> 96.3%
<tb> a52-a53-a5 <SEP> (8) <SEP> Mesorhyzobium <SEP> mediterranean <SEP> 92.1% <SEP> Clone <SEP> DA122 <SEP> (no <SEP> published) <SEP> 94. <SEP> 8%
<tb> a34-a35-a36 <SEP> (3) <SEP> Rhodobium <SEP> orientis <SEP> 91. <SEP> 8% <SEP> Clone <SEP> (no <SEP> published) <SEP> 95.1 %
<tb> a1-a2-a3 <SEP> (4) <SEP> Sphingomonas <SEP> sp. <SEP> 94. <SEP> 7% <SEP> Clone <SEP> PAD23 <SEP> (no <SEP> released) <SEP> 95.1%
<tb> a8-a9-a10 <SEP> (13) <SEP> Sphingomonas <SEP> sp. <SEP> 94.0%
<tb> [gamma] -Proteobacteria
<tb> a40-a41-a42 <SEP> (13) <SEP> Pseudomonas <SEP> sp <SEP> 98. <SEP> 9% <SEP> clone <SEP> G26 (g-Proteobacterium) 3 <SEP> 99 <SEP> 7%
<tb> a15-a16-a17 <SEP> (12) <SEP> Lysobacter <SEP> antibioticus <SEP> 94. <SEP> 4% <SEP> clone <SEP> vadinHA77 (g-Proteo) <SEP> 93. <SEP> 6%
<tb> a15-a16-a17 <SEP> (5) <SEP> Xanthomonas <SEP> sp <SEP> 93. <SEP> 4% <SEP> clone <SEP> vadinHA77 (g-Proteo) 4 <SEP> 94 <SEP> 6%
<tb> a19-a20-a26 <SEP> (13) <SEP> Luteimonas <SEP> mephitis <SEP> 92. <SEP> 9% <SEP> Strain <SEP> rJ15 <SEP> (no <SEP> released) <SEP> 93. <SEP> 5%
<tb> a46-a47-a48 <SEP> (6) <SEP> Methylobacter <SEP> whittenburyi <SEP> 88. <SEP> 3% <SEP> soil <SEP> clone <SEP> S-43 (g-Proteo ) <SEP> 2 <SEP> 88.9 <SEP>% <SEP>
<tb> a11-a12-a13 <SEP> (11) <SEP> Methylobacterwhittenburyi <SEP> 88. <SEP> 3% <SEP> soil <SEP> clone <SEP> S-43 (g-Proteo) <SEP> 88. <SEP> 9%
<Tb>

ns

ra *0 ) contenues de

C-4 CI) = (1) (t) CX I: T3 ;:] c E 0 rn S < es ue

0 M 03 0 (0 <3 z
T s séq la

-0 Diversité

ns

ra * 0) contained in

C-4 CI) = (1) (t) CX I: T3;:] c E 0 rn S <es ue

0 M 03 0 (0 <3 z
T s seq la

-0 Diversity

<tb> N <SEP> pool <SEP> Voisin <SEP> identifié <SEP> % <SEP> de <SEP> Voisin <SEP> le <SEP> plus <SEP> proche <SEP> % <SEP> de
<tb> (clone <SEP> n <SEP> le <SEP> plus <SEP> proche <SEP> similarité <SEP> (classification, <SEP> référence) <SEP> similarité
<tb> a34-a35-a36 <SEP> (5) <SEP> Methylococcus <SEP> capsulatus <SEP> 84.9% <SEP> soil <SEP> clone <SEP> S-12 <SEP> (d-Protéo) <SEP> 85 <SEP> 6% <SEP>
<tb> a43-a44-a45 <SEP> (10) <SEP> Legionella <SEP> birminghamensis <SEP> 88.9%
<tb> a8-a9-a10 <SEP> (2) <SEP> Lamprocystis <SEP> roseopersicina <SEP> 87. <SEP> 5% <SEP> Clone <SEP> 2-100C14 <SEP> (non <SEP> publié) <SEP> 95.1%
<tb> p-Protéobactéries
<tb> a27-a28-a29 <SEP> (5) <SEP> Rhodocyclus <SEP> tenuis <SEP> 90. <SEP> 2% <SEP> Clone <SEP> OPB37 <SEP> (b-protéo)5 <SEP> 91%
<tb> 8-Protéobactéries
<tb> a8-a9-a10 <SEP> (18) <SEP> Nannocystis <SEP> exedens <SEP> 92.0%
<tb> a11-a12-a13 <SEP> (5) <SEP> Geobacter <SEP> sulfurreducens <SEP> 91.5%
<tb> a27-a28-a29 <SEP> (8) <SEP> Desulfoacinum <SEP> infernum <SEP> 88. <SEP> 4% <SEP> Clone <SEP> S-31 <SEP> (d-Protéo) <SEP> 89. <SEP> 1 <SEP> %
<tb> a40-a41-a42 <SEP> (6) <SEP> Desulfivibrio <SEP> aminophilus <SEP> 85. <SEP> 3% <SEP> Clone <SEP> S-34 <SEP> (d-Protéo) <SEP> 2 <SEP> 86.2%
<tb> G+ <SEP> bas <SEP> GC%
<tb> a23.1 <SEP> Kurthia <SEP> zopfii <SEP> 97. <SEP> 3%
<tb> a25.1 <SEP> Kurthia <SEP> zopfii <SEP> 97.2%
<tb> a18.1 <SEP> (22) <SEP> Kurthia <SEP> gipsonii <SEP> 94.4% <SEP> G+ <SEP> bas <SEP> GC% <SEP> non <SEP> identifié <SEP> RS19 <SEP> (non <SEP> publié) <SEP> 94.8%
<tb> <tb> N <SEP> pool <SEP> Neighbor <SEP> identified <SEP>% <SEP> from <SEP> Neighbor <SEP><SEP> plus <SEP> close <SEP>% <SEP> from
<tb> (clone <SEP> n <SEP> the <SEP> more <SEP> close <SEP> similarity <SEP> (classification, <SEP> reference) <SEP> similarity
<tb> a34-a35-a36 <SEP> (5) <SEP> Methylococcus <SEP> capsulatus <SEP> 84.9% <SEP> soil <SEP> clone <SEP> S-12 <SEP> (d-Proteo) <SEP> 85 <SEP> 6% <SEP>
<tb> a43-a44-a45 <SEP> (10) <SEP> Legionella <SEP> birminghamensis <SEP> 88.9%
<tb> a8-a9-a10 <SEP> (2) <SEP> Lamprocystis <SEP> roseopersicina <SEP> 87. <SEP> 5% <SEP> Clone <SEP> 2-100C14 <SEP> (no <SEP> published) <SEP> 95.1%
<tb> p-Proteobacteria
<tb> a27-a28-a29 <SEP> (5) <SEP> Rhodocyclus <SEP> tenuis <SEP> 90. <SEP> 2% <SEP> Clone <SEP> OPB37 <SEP> (b-proteo) 5 <SEP> 91%
<tb> 8-Proteobacteria
<tb> a8-a9-a10 <SEP> (18) <SEP> Nannocystis <SEP> exedens <SEP> 92.0%
<tb> a11-a12-a13 <SEP> (5) <SEP> Geobacter <SEP> sulfurreducens <SEP> 91.5%
<tb> a27-a28-a29 <SEP> (8) <SEP> Desulfoacinum <SEP> infernum <SEP> 88. <SEP> 4% <SEP> Clone <SEP> S-31 <SEP> (d-Proteo) <SEP> 89. <SEP> 1 <SEP>%
<tb> a40-a41-a42 <SEP> (6) <SEP> Desulfivibrio <SEP> aminophilus <SEP> 85. <SEP> 3% <SEP> Clone <SEP> S-34 <SEP> (d-Proteo) <SEP> 2 <SEP> 86.2%
<tb> G + <SEP> low <SEP> GC%
<tb> a23.1 <SEP> Kurthia <SEP> zopfii <SEP> 97. <SEP> 3%
<tb> a25.1 <SEP> Kurthia <SEP> zopfii <SEP> 97.2%
<tb> a18.1 <SEP> (22) <SEP> Kurthia <SEP> gipsonii <SEP> 94.4% <SEP> G + <SEP> low <SEP> GC% <SEP> no <SEP> identified <SEP> RS19 <SEP> (no <SEP> released) <SEP> 94.8%
<Tb>

s la banque cosmidiq

<0 '0 M M
EAU 7 (suite
6S contenue

bzz u.1 ^ IJI Diversité des séquences

s the cosmidiq bank

<0 '0 MM
WATER 7 (continued
6S contained

bzz u.1 ^ IJI Variety of sequences

<tb> Actinomycètes
<tb> a33. <SEP> 1 <SEP> Cellulomonas <SEP> sp <SEP> 99.5%
<tb> a <SEP> 14.7 <SEP> Streptosporangium <SEP> longisporum <SEP> 99. <SEP> 8%
<tb> a <SEP> 21.7 <SEP> Arthrobater <SEP> polychromogenes <SEP> 99.2% <SEP> non <SEP> identifié <SEP> RSW1 <SEP> (non <SEP> publié)
<tb> a8-a9-a10 <SEP> (7) <SEP> Arthrobacter <SEP> oxydans <SEP> 983% <SEP> actinomycète <SEP> non <SEP> identifié <SEP> RSW1 <SEP> (non <SEP> publié) <SEP> 98. <SEP> 5%
<tb> a27-a28-a29 <SEP> (3) <SEP> Arthrobacter <SEP> oxydans <SEP> 98.9% <SEP> actinomycète <SEP> non <SEP> identifié <SEP> RSW1 <SEP> (non <SEP> publié) <SEP> 99.3%
<tb> Acidobacterium <SEP> ?
<tb> a43-a44-a45 <SEP> (4) <SEP> Holophaga <SEP> foetida <SEP> 87.3% <SEP> Clone <SEP> 32-10
<tb> (Acidobacterium <SEP> phylum)6 <SEP> 95.0%
<tb> a27-a28-a29 <SEP> (12) <SEP> Desulfuromonas <SEP> acetexigens <SEP> 88. <SEP> 8% <SEP> Clone <SEP> Sva0515 <SEP> (Acidobacterium <SEP> phylum)6 <SEP> 91.0%
<tb> a37-a38-a39 <SEP> (12) <SEP> Desulfuromonas <SEP> palmitatis <SEP> 90. <SEP> 3% <SEP> Clone <SEP> Sva0515 <SEP> (Acidobacterium <SEP> phylum)6 <SEP> 91. <SEP> 5%
<tb> a37-a38-a39 <SEP> (14) <SEP> Halothermothrix <SEP> orenii <SEP> 87.5% <SEP> Clone <SEP> ii3-7 <SEP> 93. <SEP> 3%
<tb> (Acidobacterium <SEP> phylum)6 <SEP> 93.3%
<tb> a8-a9-a10(9) <SEP> Pelobacter <SEP> carbinolicus <SEP> 865% <SEP> Clone <SEP> ii3-15 <SEP> 92.6%
<tb> (Acidobacterium <SEP> phylum)6 <SEP> 92.6%
<tb> a34-a35-a36(10) <SEP> Nitrococcus <SEP> mobilis <SEP> 90.6% <SEP> clone <SEP> RB43 <SEP> Hum) <SEP> 6 <SEP> 93.7%
<tb> (Acidobacterium <SEP> phylum)6 <SEP> 93.7%
<tb> Non <SEP> classifié <SEP>
<tb> a22.1(19) <SEP> Aerothermobacter <SEP> marianas <SEP> 89. <SEP> 1% <SEP> Eubacterie <SEP> non <SEP> identifiée <SEP> (non <SEP> publié) <SEP> 93.4%
<tb>
98

en s# . <tb> Actinomycetes
<tb> a33. <SEP> 1 <SEP> Cellulomonas <SEP> sp <SEP> 99.5%
<tb> a <SEP> 14.7 <SEP> Streptosporangium <SEP> longisporum <SEP> 99. <SEP> 8%
<tb> a <SEP> 21.7 <SEP> Arthrobater <SEP> polychromogenes <SEP> 99.2% <SEP> no <SEP> identified <SEP> RSW1 <SEP> (no <SEP> released)
<tb> a8-a9-a10 <SEP> (7) <SEP> Arthrobacter <SEP> oxidans <SEP> 983% <SEP> actinomycete <SEP> no <SEP> identified <SEP> RSW1 <SEP> (no <SEP > published) <SEP> 98. <SEP> 5%
<tb> a27-a28-a29 <SEP> (3) <SEP> Arthrobacter <SEP> oxidans <SEP> 98.9% <SEP> actinomycete <SEP> no <SEP> identified <SEP> RSW1 <SEP> (no <SEP > published) <SEP> 99.3%
<tb> Acidobacterium <SEP>?
<tb> a43-a44-a45 <SEP> (4) <SEP> Holophaga <SEP> Foetida <SEP> 87.3% <SEP> Clone <SEP> 32-10
<tb> (Acidobacterium <SEP> phylum) 6 <SEP> 95.0%
<tb> a27-a28-a29 <SEP> (12) <SEP> Desulfuromonas <SEP> acetexigens <SEP> 88. <SEP> 8% <SEP> Clone <SEP> Sva0515 <SEP> (Acidobacterium <SEP> phylum) 6 <SEP> 91.0%
<tb> a37-a38-a39 <SEP> (12) <SEP> Desulfuromonas <SEP> palmitatis <SEP> 90. <SEP> 3% <SEP> Clone <SEP> Sva0515 <SEP> (Acidobacterium <SEP> phylum) 6 <SEP> 91. <SEP> 5%
<tb> a37-a38-a39 <SEP> (14) <SEP> Halothermothrix <SEP> orenii <SEP> 87.5% <SEP> Clone <SEP> ii3-7 <SEP> 93. <SEP> 3%
<tb> (Acidobacterium <SEP> phylum) 6 <SEP> 93.3%
<tb> a8-a9-a10 (9) <SEP> Pelobacter <SEP> carbinolicus <SEP> 865% <SEP> Clone <SEP> ii3-15 <SEP> 92.6%
<tb> (Acidobacterium <SEP> phylum) 6 <SEP> 92.6%
<tb> a34-a35-a36 (10) <SEP> Nitrococcus <SEP> mobilis <SEP> 90.6% <SEP> clone <SEP> RB43 <SEP> Hum) <SEP> 6 <SEP> 93.7%
<tb> (Acidobacterium <SEP> phylum) 6 <SEP> 93.7%
<tb> No <SEP> classified <SEP>
<tb> a22.1 (19) <SEP> Aerothermobacter <SEP> marianas <SEP> 89. <SEP> 1% <SEP> Eubacteria <SEP> no <SEP> identified <SEP> (no <SEP> published) <SEP> 93.4%
<Tb>
98

in s #.

N '0 (D erson et al (1996 - 4Godon et al (19

CL en 0) 1,'c' (6 GONZALEZ et al (1996) - Zhou et N o (D erson et al (1996 - 4 Godon et al (19

CL in 0) 1, 'c' (6 GONZALEZ et al (1996) - Zhou and

TABLEAU 9 :

TABLE 9:

<tb>
<tb> SéquencesDésignation <SEP> SEQ <SEP> ID <SEP> N <SEP>
<tb> Sondes <SEP> et <SEP> amorces
<tb> FGPS431 <SEP> 1
<tb> FGPS122 <SEP> 2
<tb> FGPS350 <SEP> 3
<tb> FGPS643 <SEP> (T) <SEP> 4
<tb> FGPS643 <SEP> (C) <SEP> 5
<tb> R499
<tb> R500 <SEP> 7
<tb> C501 <SEP> 8
<tb> FGPS516
<tb> FGPS517 <SEP> 10
<tb> FGPS518 <SEP> 11
<tb> FGPS612 <SEP> 12
<tb> FGPS669 <SEP> 13
<tb> FGPS618 <SEP> 14
<tb> FGPS614 <SEP> 15
<tb> FGPS615 <SEP> 16
<tb> FGPS616 <SEP> 17
<tb> FGPS621 <SEP> 18
<tb> FGPS617 <SEP> 19
<tb> FGPS680 <SEP> 20
<tb> FGPS619 <SEP> 21
<tb> 63f <SEP> 22
<tb> 1387r <SEP> 23
<tb> Oligo-1 <SEP> (Exemple <SEP> 10) <SEP> 24
<tb> Oligo-2 <SEP> (Exemple <SEP> 10) <SEP> 25
<tb> A1 <SEP> 26
<tb> A2 <SEP> 27
<tb> B1 <SEP> 28
<tb> B2 <SEP> 29
<tb> Acides <SEP> nucléiques <SEP> PKS-I
<tb> Amb9 <SEP> 30
<tb> Amb12 <SEP> 31
<tb> Ery19 <SEP> 32
<tb> A9b12 <SEP> 33
<tb> A23G1 <SEP> 1-1 <SEP> 34
<tb> A26G1 <SEP> 1-2 <SEP> 35
<tb> A26G1-10 <SEP> 36
<tb> <Tb>
<tb> SequencesDesignation <SEP> SEQ <SEP> ID <SEP> N <SEP>
<tb> Probes <SEP> and <SEP> primers
<tb> FGPS431 <SEP> 1
<tb> FGPS122 <SEP> 2
<tb> FGPS350 <SEP> 3
<tb> FGPS643 <SEP> (T) <SEP> 4
<tb> FGPS643 <SEP> (C) <SEP> 5
<tb> R499
<tb> R500 <SEP> 7
<tb> C501 <SEP> 8
<tb> FGPS516
<tb> FGPS517 <SEP> 10
<tb> FGPS518 <SEP> 11
<tb> FGPS612 <SEP> 12
<tb> FGPS669 <SEP> 13
<tb> FGPS618 <SEP> 14
<tb> FGPS614 <SEP> 15
<tb> FGPS615 <SEP> 16
<tb> FGPS616 <SEP> 17
<tb> FGPS621 <SEP> 18
<tb> FGPS617 <SEP> 19
<tb> FGPS680 <SEP> 20
<tb> FGPS619 <SEP> 21
<tb> 63f <SEP> 22
<tb> 1387r <SEP> 23
<tb> Oligo-1 <SEP> (Example <SEP> 10) <SEP> 24
<tb> Oligo-2 <SEP> (Example <SEP> 10) <SEP> 25
<tb> A1 <SEP> 26
<tb> A2 <SEP> 27
<tb> B1 <SEP> 28
<tb> B2 <SEP> 29
<tb>SEP> nucleic acids <SEP> PKS-I
<tb> Amb9 <SEP> 30
<tb> Amb12 <SEP> 31
<tb> Ery19 <SEP> 32
<tb> A9b12 <SEP> 33
<tb> A23G1 <SEP> 1-1 <SEP> 34
<tb> A26G1 <SEP> 1-2 <SEP> 35
<tb> A26G1-10 <SEP> 36
<Tb>

TABLEAU 9 (suite 1):Séquences

TABLE 9 (continued 1): Sequences

<tb>
<tb> Désignation <SEP> SEQ <SEP> ID <SEP> N
<tb> A35 <SEP> E4-16 <SEP> 37
<tb> A49F1-32 <SEP> 38
<tb> A17d2-3 <SEP> 39
<tb> A53F11-13 <SEP> 40
<tb> A53F11-14 <SEP> 41
<tb> A22A <SEP> 2-11 <SEP> 42
<tb> A36E8-1 <SEP> 43
<tb> A52E8-2 <SEP> 44
<tb> Séquences <SEP> d'acides <SEP> aminés <SEP> PKS-I
<tb> Amb9 <SEP> 45
<tb> Amb12 <SEP> 46
<tb> Ery19 <SEP> 47
<tb> A9b12 <SEP> 48
<tb> A23G1 <SEP> 1-1 <SEP> 49
<tb> A26G1 <SEP> 1-2 <SEP> 50
<tb> A26G1-10 <SEP> 51
<tb> A35 <SEP> E4-16 <SEP> 52
<tb> A49F1-32 <SEP> 53
<tb> A17d2-3 <SEP> 54
<tb> A53F11-13 <SEP> 55
<tb> A53F11-14 <SEP> 56
<tb> A22A <SEP> 2-11 <SEP> 57
<tb> A36E8-1 <SEP> 58
<tb> A52E8-2 <SEP> 59
<tb> Séquences <SEP> ADNr <SEP> 16S
<tb> a24.1(2), <SEP> 60
<tb> a4. <SEP> a6.a7 <SEP> (7) <SEP> 61
<tb> a52.a53.a5(15) <SEP> 62
<tb> a49.a50.a51(11) <SEP> 63
<tb> a4.a6.a7(14) <SEP> 64
<tb> a30.a31.a32(7) <SEP> 65
<tb> a37.a38.a39 <SEP> 6) <SEP> 66
<tb> a46.a47.a48 <SEP> (14) <SEP> 67
<tb> a49.a50.a51(1) <SEP> 68
<tb> a52.a53.a5(8) <SEP> 69
<tb> a8.a9.a10(13) <SEP> 70
<tb> a1.a2.a3(13) <SEP> 71
<tb> a43.a44.a45(10) <SEP> 72
<tb> a27.a28.a29 <SEP> 5) <SEP> 73
<tb> <Tb>
<tb> Designation <SEP> SEQ <SEP> ID <SEP> N
<tb> A35 <SEP> E4-16 <SEP> 37
<tb> A49F1-32 <SEP> 38
<tb> A17d2-3 <SEP> 39
<tb> A53F11-13 <SEP> 40
<tb> A53F11-14 <SEP> 41
<tb> A22A <SEP> 2-11 <SEP> 42
<tb> A36E8-1 <SEP> 43
<tb> A52E8-2 <SEP> 44
<tb> Sequences <SEP> of <SEP> amino acids <SEP> PKS-I
<tb> Amb9 <SEP> 45
<tb> Amb12 <SEP> 46
<tb> Ery19 <SEP> 47
<tb> A9b12 <SEP> 48
<tb> A23G1 <SEP> 1-1 <SEP> 49
<tb> A26G1 <SEP> 1-2 <SEP> 50
<tb> A26G1-10 <SEP> 51
<tb> A35 <SEP> E4-16 <SEP> 52
<tb> A49F1-32 <SEP> 53
<tb> A17d2-3 <SEP> 54
<tb> A53F11-13 <SEP> 55
<tb> A53F11-14 <SEP> 56
<tb> A22A <SEP> 2-11 <SEP> 57
<tb> A36E8-1 <SEP> 58
<tb> A52E8-2 <SEP> 59
<tb> Sequences <SEP> RNA <SEP> 16S
<tb> a24.1 (2), <SEP> 60
<tb> a4. <SEP> a6.a7 <SEP> (7) <SEP> 61
<tb> a52.a53.a5 (15) <SEP> 62
<tb> a49.a50.a51 (11) <SEP> 63
<tb> a4.a6.a7 (14) <SEP> 64
<tb> a30.a31.a32 (7) <SEP> 65
<tb> a37.a38.a39 <SEP> 6) <SEP> 66
<tb> a46.a47.a48 <SEP> (14) <SEP> 67
<tb> a49.a50.a51 (1) <SEP> 68
<tb> a52.a53.a5 (8) <SEP> 69
<tb> a8.a9.a10 (13) <SEP> 70
<tb> a1.a2.a3 (13) <SEP> 71
<tb> a43.a44.a45 (10) <SEP> 72
<tb> a27.a28.a29 <SEP> 5) <SEP> 73
<Tb>

TABLEAU 9 (suite 2):Séquences

TABLE 9 (cont'd 2): Sequences

<tb>
<tb> Désignation <SEP> SEQ <SEP> ID <SEP> N <SEP>
<tb> a23.1 <SEP> 74
<tb> a25. <SEP> 1 <SEP> 75
<tb> a18.1(22) <SEP> 76
<tb> a33. <SEP> 1 <SEP> 77
<tb> a14.7 <SEP> 78
<tb> a21.7 <SEP> 79
<tb> a8.a9.a10(7) <SEP> 80
<tb> a8.a9.a10(18) <SEP> 81
<tb> a27.a28.a29 <SEP> 3 <SEP> 82
<tb> a34.a35.a36 <SEP> 5 <SEP> 83
<tb> a22.1(19) <SEP> 84
<tb> a11.a12.a13(5) <SEP> 85
<tb> a19.a20.a26 <SEP> 9 <SEP> 86
<tb> a40.a41.a42(6) <SEP> 87
<tb> a27.a28.a29 <SEP> 8 <SEP> 88
<tb> a27.a28.a29(12) <SEP> 89
<tb> a37.a38.a39 <SEP> 12) <SEP> 90
<tb> a46.a47.a48(6) <SEP> 91
<tb> a11.a12.a13(11) <SEP> 92
<tb> a15.a16.a17(12) <SEP> 93
<tb> a15.a16.a17(5) <SEP> 94
<tb> a19.a20.a26(13) <SEP> 95
<tb> a37.a38.a39(14) <SEP> 96
<tb> a8.a9.a10(9 <SEP> 97
<tb> a19.a20.a26(5) <SEP> 98
<tb> a43.a44.a45(4 <SEP> 99
<tb> a1.a2.a3(4) <SEP> 100
<tb> a4.a6.a7(23) <SEP> 101
<tb> a49.a50.a51 <SEP> 22 <SEP> 102
<tb> a8.a9.a10(2) <SEP> 103
<tb> a34.a35.a36 <SEP> 3) <SEP> 104
<tb> a34.a35.a36 <SEP> 10 <SEP> 105
<tb> a40.a41.a42 <SEP> 13 <SEP> 106
<tb> <Tb>
<tb> Designation <SEP> SEQ <SEP> ID <SEP> N <SEP>
<tb> a23.1 <SEP> 74
<tb> a25. <SEP> 1 <SEP> 75
<tb> a18.1 (22) <SEP> 76
<tb> a33. <SEP> 1 <SEP> 77
<tb> a14.7 <SEP> 78
<tb> a21.7 <SEP> 79
<tb> a8.a9.a10 (7) <SEP> 80
<tb> a8.a9.a10 (18) <SEP> 81
<tb> a27.a28.a29 <SEP> 3 <SEP> 82
<tb> a34.a35.a36 <SEP> 5 <SEP> 83
<tb> a22.1 (19) <SEP> 84
<tb> a11.a12.a13 (5) <SEP> 85
<tb> a19.a20.a26 <SEP> 9 <SEP> 86
<tb> a40.a41.a42 (6) <SEP> 87
<tb> a27.a28.a29 <SEP> 8 <SEP> 88
<tb> a27.a28.a29 (12) <SEP> 89
<tb> a37.a38.a39 <SEP> 12) <SEP> 90
<tb> a46.a47.a48 (6) <SEP> 91
<tb> a11.a12.a13 (11) <SEP> 92
<tb> a15.a16.a17 (12) <SEP> 93
<tb> a15.a16.a17 (5) <SEP> 94
<tb> a19.a20.a26 (13) <SEP> 95
<tb> a37.a38.a39 (14) <SEP> 96
<tb> a8.a9.a10 (9 <SEP> 97
<tb> a19.a20.a26 (5) <SEP> 98
<tb> a43.a44.a45 (4 <SEP> 99
<tb> a1.a2.a3 (4) <SEP> 100
<tb> a4.a6.a7 (23) <SEP> 101
<tb> a49.a50.a51 <SEP> 22 <SEP> 102
<tb> a8.a9.a10 (2) <SEP> 103
<tb> a34.a35.a36 <SEP> 3) <SEP> 104
<tb> a34.a35.a36 <SEP> 10 <SEP> 105
<tb> a40.a41.a42 <SEP> 13 <SEP> 106
<Tb>

REFERENCES # Amann, R. L, W. Ludwig, and K.-H. Schleifer. 1995. Phylogenetic identification and in situ détection of individual microbial cells without cultivation. Microbiol. Rev. 59:143-169. REFERENCES # Amann, R. L, W. Ludwig, and K.-H. Schleifer. 1995. Phylogenetic identification and in situ detection of individual microbial cells without cultivation. Microbiol. Rev. 59: 143-169.

# Atschul S. F., Madden T.L., Schâffer A. A., Zhang J., Zhang Z., Miller W., Lipman D.J. (1997) " Gapped BLAST and PSI-BLAST : a nex generation of protein databses search programs " Nucleic Acid Researchs Vol 25 : 3389-3404 # Atschul SF étal., 1990, J. Mol Biol, 215 : 403-410. # Atschul SF, Madden TL, AA Schäffer, Zhang J., Zhang Z., Miller W., Lipman DJ (1997) "Gapped BLAST and PSI-BLAST: a nex generation of protein databses" Nucleic Acid Researchs Vol 25: 3389-3404 # Atschul SF et al., 1990, J. Mol Biol, 215: 403-410.

# Bakken, L. R. 1985. Separation and purification of bacteria from soil. # Bakken, L. R. 1985. Separation and purification of bacteria from soil.

Appl. Environ. Microbiol. 49 :1482-1487. Appl. About. Microbiol. 49: 1482-1487.

# Bibb MJ, Findlay PR, Johnson MW, The relationship between base composition and codon usage in bacterial genes and its use for the simple and reliable identification of protein-coding sequences., Gene 30 : 1-3, 157-66, Oct, 1984. # Bibb MJ, Findlay PR, Johnson MW, The relationship between base composition and codon usage in bacterial genes and its use for the single and reliable identification of protein-coding sequences., Gene 30: 1-3, 157-66, Oct. 1984.

# Biesiekierska-Galguen M. (1997) "Atténuation biologique de contaminants xénobiotiques dans le sol - modèle lindane " Diplôme de DEA National de Toxicologie, Université Claude Bernard Lyon I. # Biesiekierska-Galguen M. (1997) "Biological attenuation of xenobiotic contaminants in soil - lindane model" National Diploma of Toxicology, Claude Bernard Lyon I University.

# Blondelet-Rouault MH, Weiser J, Lebrihi A, Branny P, Pernodet JL. # Blondelet-Rouault MH, Weiser J, Lebrihi A, Branny P, Pernodet JL.

Institut de Genetique et Microbiologie, URA CNRS 2225, Universite Paris XI, Orsay, France. Gene 1997 May 6;190(2):315-7 # Borchert S étal., 1992, Microbiology Letters, 92 : Institute of Genetics and Microbiology, URA CNRS 2225, Paris XI University, Orsay, France. Gene 1997 May 6; 190 (2): 315-7. Borchert S et al., 1992, Microbiology Letters, 92:

175-180' Boccard, F., Smokvina, T. Pernodet, J. L. Friedmann, A. & Guerineau M. (1989) . The integrated conjugative plasmid pSAM2 of Streptomyces ambofaciens is related to temperature bacteriophages. Embo J 8,973-80 # Chatzinotas A., Sandaa R-A., Schônhuber W., Amanna R., Daae F.L., Torsvik V., Zeyer J., Hahn D. (1998) " Analysis of broad-scale differences in microbial community composition of two pristine forest soils " Systematic and Applied Microbiology Vol 21 : 579-587 # Clegg, C. D., K. Ritz, and B. S. Griffiths. 1997. Direct ectraction of microbial community DNA from humified upland soils. Lett. Appl. 175-180 Boccard, F., Smokvina, T. Pernodet, J.L. Friedmann, A. & Guerineau M. (1989). The integrated plasmid conjugate pSAM2 of Streptomyces ambofaciens is related to temperature bacteriophage. Embo J 8,973-80 # Chatzinotas A., Sandaa RA., Schonhuber W., Amanna R., Daae FL, Torsvik V., Zeyer J., Hahn D. (1998) "Analysis of broad-scale differences in microbial community composition of Pristine Forest Soils "Systematic and Applied Microbiology Vol 21: 579-587 # Clegg, CD, Ritz, K. and Griffiths BS. 1997. Direct ectraction of microbial community DNA from humified upland soils. Lett. Appl.

Microbiol. 25 :30-33. Microbiol. 25: 30-33.

# Clerc-Bardin, S., J. -L. Pernodet, A. Frostegârd, and P. Simonet. # Clerc-Bardin, S., J. -L. Pernodet, A. Frostegard, and P. Simonet.

Development of a conditional suicide system for a Streptomyces lividans strain and its use to investigate conjugative transfer in soil. Submitted. Development of a conditional suicide system for Streptomyces lividans strain and its use to investigate conjugative transfer in soil. Submitted.

# Elledge SJ, Mulligan JT, Ramer SW, Spottswood M, Davis RW. # Elledge SJ, Mulligan JT, SW Ramer, Spottswood M, Davis RW.

Department of Biochemistry, Baylor College of Medicine, Houston, TX 77030. Proc Natl Acad Sci U S A 1991 Mar 1;88(5):1731-5 # Engelen, B., K. Meinken, F. Von Wintzingerode, H. Heuer, H.-P. Department of Biochemistry, Baylor College of Medicine, Houston, TX 77030. Proc Natl Acad Sci USA 1991 Mar 1; 88 (5): 1731-5 # Engelen, B., K. Meinken, F. von Wintzingerode, H. Heuer, H.-P.

Malkomes, and H. Backhaus. 1998. Monitoring impact of a pesticide treatment on bacterial soil communities by metabolic and genetic fingerprinting in addition to conventional testing procedures. Appl. Environ. Malkomes, and H. Backhaus. 1998. The impact of pesticide treatment on bacterial soils. Appl. About.

Microbiol. 64 :2814-2821. Microbiol. 64: 2814-2821.

# Farrelly, V., F. A. Rainey, and E. Stackebrandt. 1995. Effect of genome size and rrn gene copy number on PCR amplification of 16S rRNA genes from a mixture of bacterial species. Appl. Environ. Microbiol. # Farrelly, V., F. A. Rainey, and E. Stackebrandt. 1995. Effect of genome size and rnn gene copy number on PCR amplification of 16S rRNA genes from a mixture of bacterial species. Appl. About. Microbiol.

61:2798-2801. 61: 2798-2801.

# Faulkner D. V., Jurka J. (1988) " Multiple Aligned Sequence Editor (MASE) " Trends in Biochemical Sciences Vol 13 : 321-322 # Faulkner D.V., Jurka J. (1988) "Multiple Aligned Sequence Editor (MASE)" Trends in Biochemical Sciences Vol 13: 321-322

#Frostegard, A., Tunlid, A., and Bààth, E. 1991. Microbial biomass measured as total lipid phosphate in soils of différent organic content. J. #Frostegard, A., Tunlid, A., and Baath, E. 1991. Microbial biomass measured as total lipid phosphate in soils of different organic content. J.

Microbiol. Meth. 14:151-163. Microbiol. Meth. 14: 151-163.

#Giddings, G. 1998. The release of genetically engineered microorganisms and viruses into the environment. New Phytol. 140 :173-184. #Giddings, G. 1998. The release of genetically engineered microorganisms and viruses into the environment. New Phytol. 140: 173-184.

#Gladek, A., and J. Zakrzewska. 1984. Genome size of Streptomyces. #Gladek, A., and J. Zakrzewska. 1984. Genome size of Streptomyces.

FEMS Microbiol. Lett. 24 :73-76. FEMS Microbiol. Lett. 24: 73-76.

#Gribskov M, Devereux J, Burgess RR, The codon préférence plot: graphie analysis of protein coding séquences and prediction of gene expression., Nucleic Acids Res 12 : 1Pt 2, 539-49, Jan 11,1984. #Gribskov M, Devereux J, Burgess RR, The codon preference plot: graphie analysis of protein coding sequences and prediction of gene expression., Nucleic Acids Res 12: 1Pt 2, 539-49, Jan 11,1984.

.Guiney et al., 1983, Proc. Natl. Acad. Sci USA, (12) : 3595-3598. Guiney et al., 1983, Proc. Natl. Acad. Sci USA, (12): 3595-3598.

# Gourmelen, A. Biondelet-Rouault, M. H. & Pernodet, J. L. (1998). # Gourmelen, A. Biondelet-Rouault, M.H. & Pernodet, J.L. (1998).

Characterization of a glycosyl transferase inactivating macrolides, encoded by gimA from Streptomyces ambofaciens, Antimicrob Agents Chemother 42, 2612-9. Characterization of a glycosyl transferase inactivating macrolide, encoded by gimA from Streptomyces ambofaciens, Antimicrob Agents Chemother 42, 2612-9.

# Hayakawa, M., and H. Nonomura. 1987. Humic acid-vitamin agar, a new medium for the selective isolation of soil actinomycetes. J. Ferment. # Hayakawa, M., and H. Nonomura. 1987. Humic acid-vitamin agar, a new medium for the selective isolation of soil actinomycetes. J. Ferment.

Technol. 65 :501-509. Technol. 65: 501-509.

# Hayakawa, M., Ishizawa K., and H. Nonomura. 1988. Distribution of rare actinomycetes in Japanese soils. J. Ferment. Technol. 66 :367-373. # Hayakawa, M., Ishizawa K., and H. Nonomura. 1988. Distribution of rare actinomycetes in Japanese soils. J. Ferment. Technol. 66: 367-373.

# Hickey, R. J., and H. D. Tresner. 1952. A cobalt containing medium for sporulation of Streptomyces species. J. Bacteriol. 64 :891-892. # Hickey, R. J., and H. D. Tresner. 1952. A cobalt containing medium for sporulation of Streptomyces species. J. Bacteriol. 64: 891-892.

# Hintermann, G., R., Crameri, Kieser, T., and R. Hütter. 1981. # Hintermann, G., R., Crameri, Kieser, T., and R. Hütter. nineteen eighty one.

Restriction analysis of the Streptomyces glaucescens genome by agarose gel electrophoresis. Arch. Microbiol. 130 :218-222. Restriction analysis of the Streptomyces glaucescens genome by agarose gel electrophoresis. Arch. Microbiol. 130: 218-222.

# Holben, W. E., J. K. Jansson, B. K. Chelm, and J. M. Tiedje. 1988. # Holben, W.E., J.K. Jansson, B.K. Chelm, and J.M.Tiedje. 1988.

DNA probe method for the détection of spécifie microorganisms in the soil bacterial community. Appl. Environ. Microbiol. 54 :703-711. DNA probe method for the detection of microorganisms in the soil bacterial community. Appl. About. Microbiol. 54: 703-711.

# Hong Fu et al., 1995, Molecular diversity, 1 : 121-124 # Hopwood DA, Bibb M J, Chater K F, Kieser T., Bruton C. J., Kieser H. M., Lydiate D. J., Smith C. P., Ward J. M. and Scrempf H. 1985. # Hong Fu et al., 1995, Molecular Diversity, 1: 121-124 # Hopwood DA, Bibb MJ, KF Chater, Kieser T., Bruton CJ, Kieser HM, Lydiate DJ, CP Smith, Ward JM and Scrempf H. 1985 .

Genetic Manipulation of Streptomyces. A Laboratory manual. The John Innes Foundation, Norwich, U.K. Genetic Manipulation of Streptomyces. A Laboratory manual. The John Innes Foundation, Norwich, U.K.

# Hopwood, D. A., M. J. Bibb, K. F. Chater, T. Kieser, C. J. Bruton, H. # Hopwood, D. A., M. J. Bibb, K. F. Chater, T. Kieser, C. J. Bruton, H.

M. Kieser, D. J. Lydiate, C. P. Smith, J. M. Ward, and H. Schrempf. M. Kieser, D. J. Lydiate, C. P. Smith, J. M. Ward, and H. Schrempf.

1985. Genetic manipulation of streptomyces - a laboratory manual. The John Innes Foundation, Norwich, United Kingdom. 1985. Genetic manipulation of streptomyces - a laboratory manual. The John Innes Foundation, Norwich, United Kingdom.

# Horinouchi S., Malpartida F., Hopwood D. et Beppu T., Mol. Gen. # Horinouchi S., Malpartida F., Hopwood D. and Beppu T., Mol. Gen.

Genet. (1989) 215 :355-357. Broom. (1989) 215: 355-357.

# Imai R., Nagata Y., Fukuda M., Takagi M., Yano K. (1991) " Molecular cloning of a Pseudomonas paucimobilis gene encoding a 17-kilodalton plypeptide that eliminates HCI molecules from ?-Hexachlorocyclohexane " Journal of Bacteriology Vol 17 ", No21 : 6811-6819 # Jacobsen, C. S., and O. F. Rasmussen. 1992. Development and application of a new method to extract bacterial DNA from soil based on séparation of bacteria from soil with cation-exchange resin. Appl. Environ. # Imai R., Nagata Y., Fukuda M., Takagi M., Yano K. (1991) "Molecular cloning of a Pseudomonas paucimobilis gene encoding a 17-kilodalton polypeptide that eliminates HCI molecules from? -Hexachlorocyclohexane" Journal of Bacteriology Vol 17, No 21: 6811-6819, Jacobsen, CS, and Rasmussen, 1992. Development and application of a new method to extract bacterial DNA from soil based separation of bacteria from cation exchange resin.

Microbiol. 58 :2458-2462. Microbiol. 58: 2458-2462.

Jae-Hyuk Y.U. and Léonard T.J.,1995. Sterigmetscytin biosynthesis in Aspergilus nidulans requires a ... type I polyketide synthase. J. Bacteriol, (August) : 4792-4800. Jae-Hyuk Y.U. and Leonard T.J., 1995. Sterigmetscytin biosynthesis in Aspergillus nidulans requires a ... type I polyketide synthase. J. Bacteriol, (August): 4792-4800.

# Ka, J. O., W. E. Holben, and J. M. Tiedje. 1994. Analysis of compétition in soil among 2,4-dichlorophenoxyacetic acid-degrading bacteria. Appl. Environ. Microbiol. 60 :1121-1128. # Ka, J. O., W. E. Holben, and J. M. Tiedje. 1994. Analysis of competition in soil among 2,4-dichlorophenoxyacetic acid-degrading bacteria. Appl. About. Microbiol. 60: 1121-1128.

# Kah-Tong S et al., 1997, J Bacteriol, G179(23) : 7360-7368 # Kimura M. (1980) " A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide séquences Journal of Molecular Evolution Vol 16 : 111-120 # Kuske, C. R., K. L. Banton, D. L. Adorada, P. C. Stark, K. K. Hill, and P. J. Jackson. 1998. Small-scale DNA sample preparation method for field PCR détection of microbial cells and spores in soil. Appl. Environ. # Kah-Tong S et al., 1997, J Bacteriol, G179 (23): 7360-7368 # Kimura M. (1980) "A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences" Journal of Molecular Evolution Vol 16: 111-120 # Kuske, CR, KL Banton, DL Adorada, PC Stark, KK Hill, and PJ Jackson 1998. Small-scale DNA sample preparation method for field PCR detection of microbial cells and spores in soil. About.

Microbiol. 64 :2463-2472. Microbiol. 64: 2463-2472.

# Lacalle RA, Pulido D, Vara J, Zalacain M, Jimenez A. Centro de Biologia Molecular (CSIC-UAM), Universidad Autonoma, Canto Blanco, Madrid, Spain. Gene 1989 Jul 15;79(2):375-80 # Lee, S.-Y., J. Bollinger, D. Bezdicek, and A. Ogram. 1996. Estimation of the abundance of an uncultured soil bacterial strain by a competitive quantitative PCR method. Appl. Environ. Microbiol. 62 :3787-3793. # Lacalle RA, Pulido D, Vara J, Zalacain M, Jimenez A. Biologia Molecular Center (CSIC-UAM), Universidad Autonoma, Canto Blanco, Madrid, Spain. Gene 1989 Jul 15; 79 (2): 375-80. Lee, S.-Y., J. Bollinger, D. Bezdicek, and A. Ogram. 1996. Estimation of the abundance of a bacterial strain by a competitive quantitative PCR method. Appl. About. Microbiol. 62: 3787-3793.

# Leff, L. G., J. R. Dana, J. V. McArthur, and L. J. Shimkets. 1995. # Leff, L. G., J. R. Dana, J. V. McArthur, and L. J. Shimkets. 1995.

Comparison of methods of DNA extraction from stream sediments. Appl. Comparison of methods of DNA extraction from stream sediments. Appl.

Environ. Microbiol. 61:1141-1143. About. Microbiol. 61: 1141-1143.

# Liesack, W., and E. Stackebrandt. 1992. Occurrence of novel groups of the domain Bacteria as revealed by analysis of genetic material isolated from an Australian terrestrial environment. J. Bacteriol. 174 :5072-5078. # Liesack, W., and E. Stackebrandt. 1992. Occurrence of novel groups of the world Bacteria as revealed by an Australian terrestrial environment. J. Bacteriol. 174: 5072-5078.

# Liesack, W., P. H. Janssen, F. A. Rainey, N. L. Ward-Rainey, and E. # Liesack, W., H. Janssen, F. Rainey, N., L. Ward-Rainey, and E.

Stackebrandt. 1997. Microbial diversity is soil: the need for a combined approach using molecular and cultivation techniques. In J. D. Van Elsas, J. T. Trevors, and E. M. H. Wellington (ed.), Modern soil microbiology, Marcel Dekker, Inc., New York. (p 375-439) # Lorentz, M. G., and W. Wackernagel. 1994. Bacterial gene transfer by natural genetic transformation in the environment. Microbiol. Reviews 58 :563-602. Stackebrandt. 1997. Microbial diversity is soil: the need for a combined approach using molecular and cultivation techniques. In J. D. Van Elsas, J. T. Trevors, and E. M. H. Wellington (ed.), Modern Soil Microbiology, Marcel Dekker, Inc., New York. (p 375-439) # Lorentz, M. G., and W. Wackernagel. 1994. Bacterial gene transfer by natural genetic transformation in the environment. Microbiol. Reviews 58: 563-602.

# Maidak B. L., Cole J.R., Parker C.T., Garrity G. M., Larsen N., Li B., Lilburn T. G., McCaughey M.J., Olsen G. J., Overbeek R., Pramanik S., Schmidt T. M., Tiedje J. M., Woese C. R. (1999) " A new project of the RDP (Ribosomal Database Project) " Nucleic Acids Research Vol 27 : 171-173 . Mazodier P. et al., 1989, J. Bacteriol., 171(6) : 3583-3585. # Maidak BL, Cole JR, Parker CT, Garrity GM, Larsen N., Li B., Lilburn TG, McCaughey MJ, Olsen GJ, Overbeek R., Pramanik S., Schmidt TM, Tiedje JM, Woese CR (1999) " A new project of the Ribosomal Database Project (RDP) "Nucleic Acids Research Vol 27: 171-173. Mazodier P. et al., 1989, J. Bacteriol., 171 (6): 3583-3585.

. More, M. I., J. B. Herrick, M. C. Silva, W. C. Ghiorse, and E. L. . More, M. I., J. B. Herrick, M. C. Silva, W. C. Ghiorse, and E. L.

Madsen. 1994. Quantitative cell lysis of indigenous microorganisms and rapid extraction of microbial DNA from sediment. Appl. Environ. Microbiol. Madsen. 1994. Quantitative cell lysis of indigenous microorganisms and rapid extraction of microbial DNA from sediment. Appl. About. Microbiol.

60 :1572-1580. 60: 1572-1580.

# Murakami T, Holt TG, Thompson CJ, Unité de Génie Microbiologique, Institut Pasteur, Paris, France. J. Bacteriol 1989 Mar; 171 (3): 1459-66 #Nagata Y., Hatta T., Imai R., Kimbara K., Fukuda M., Yano K., Takagi M. (1993) " Purification and characterization of ?-Hexachlorocyclohexane (?-HCH)dehydrochlorinase (LinA) from Pseudomonas paucimobilis " Bioscience, Biotechnology and Biochemistry Vol 57 No 9 : 1582-1583 # Nalin R., Simonet P., Vogel T. M., Normand P. (1999) " Rhodanobacter lindaniclasticus gen. nov., sp., nov., a lindane-degrading bacterium " International Journal of Systematic Bacteriology Vol 49 : 19-23 # Nesme, X., C. Picard, and P. Simonet. 1995. Spécifie DNA séquences for détection of soil bacteria. ln J. T. Trevors, and J. D. van Elsas (ed.), Nucleic acids in the environment, methods and application. Springer Lab Manual. (p 111-139) # Nilsson B, Uhlen M, Josephson S, Gatenbeck S, Philipson L. Nucleic Acids Res 1983 Nov 25;11 (22):8019-30 # Normand P. et al., 1995, Océanis, 21(1) : 31-56 # Murakami T, Holt TG, CJ Thompson, Microbiological Engineering Unit, Pasteur Institute, Paris, France. J. Bacteriol 1989 Mar; 171 (3): 1459-66 #Nagata Y., Hatta T., Imai R., Kimbara K., Fukuda M., Yano K., Takagi M. (1993) "Purification and characterization of? -Hexachlorocyclohexane (? HCH) dehydrochlorinase (LinA) from Pseudomonas paucimobilis "Bioscience, Biotechnology and Biochemistry Vol 57 No 9: 1582-1583 # Nalin R., Simonet P., VogelTM, Normand P. (1999)" Rhodanobacter lindaniclasticus gen. Nov., lindane-degrading bacterium International Journal of Systematic Bacteriology Vol 49: 19-23 # Nesme, X., C. Picard, and P. Simonet. 1995. Specifies DNA sequences for detection of soil bacteria. J. T. Trevors, J. and D. van Elsas (ed.), Nucleic acids in the environment, methods and application. Springer Lab Manual. (p 111-139) # Nilsson B, Uhlen M, Josephson S, Gatenbeck S, Philipson L. Nucleic Acids Res 1983 Nov 25; 11 (22): 8019-30 # Normand P. et al., 1995, Océanis, 21 (1): 31-56

#Ogram, A. V., M. L. Mathot, J. B. Harsh., J. Boyle, and C. A. #Ogram, A. V., M. Mathot L., J. B. Harsh., J. Boyle, and C. A.

Pettigrew, JR. 1994. Effects of DNA polymer length on its adsorption to soils. Appl. Environ. Microbiol. 60 :393-396. Pettigrew, JR. 1994. Effects of DNA polymer length on adsorption to soils. Appl. About. Microbiol. 60: 393-396.

#Ogram, A., G. S. Sayler, and T. Barkay. 1987. The extraction and purification of microbial DNA from sediments. J. Microbiol. Methods 7:57- 66. #Ogram, A., G. S. Sayler, and T. Barkay. 1987. The extraction and purification of microbial DNA from sediments. J. Microbiol. Methods 7: 57-66.

#Olsen, R. A., and Bakken, L. R. 1987. Viability of soil bacteria: optimization of the plate-counting technique. Microb. Ecol. 13 :59-74. #Olsen, R.A., and Bakken, L.R. 1987. Viability of soil bacteria: optimization of technical plate-counting. Microb. School. 13: 59-74.

#Paget, E., L. Jocteur Monrozier, and P. Simonet. 1992. Adsorption of DNA on clay minerais: protection against DNasel and influence on gene transfer. FEMS Microbiol. Lett. 97 :31-40. #Paget, E., L. Jocteur Monrozier, and P. Simonet. 1992. Adsorption of DNA on clay minerals: protection against DNasel and influence on gene transfer. FEMS Microbiol. Lett. 97: 31-40.

*Patra, G., P. Sylvestre, V. Ramisse, J. Thérasse, and J. -L. Guesdon. * Patra, G., P. Sylvester, V. Ramisse, J. Thérasse, and J. -L. Guesdon.

1996. Isolation of a spécifie chromosomic DNA séquence of Bacillus anthrasis and its possible use in diagnosis. FEMS Immunol. Medical Microbiology 15 :223-231. 1996. Isolation of a specifies chromosomal DNA sequence of Bacillus anthrasis and its possible use in diagnosis. FEMS Immunol. Medical Microbiology 15: 223-231.

# Pernodet J.L. Fish, S. Blondelet-Rouault, M.H. & Cundliffe, E. (1996). # Pernodet J.L. Fish, S. Blondelet-Rouault, M.H. & Cundliffe, E. (1996).

The macrolide-lincosamide-streptogramin B résistance phenotypes characterized by using a specifically deleted, antibiotic-sensitive strain of Streptomyces lividans. Antimicrob Agents Chemother 40, 581, 5. The macrolide-lincosamide-streptogramin B resistance phenotypes characterized by using a specific deleted antibiotic-sensitive strain of Streptomyces lividans. Antimicrob Agents Chemother 40, 581, 5.

# Pernodet J.L. , Gourmelen, A., Blondelet-Rouault, M.H. & Cundliffe, E. (1999). Dispensable ribosomal résistance to spiramycin conferred by srmA in the spiramycin producer Streptomyces ambofaciens. 145,2355- 64. # Pernodet J.L., Gourmelen, A., Blondelet-Rouault, M.H. & Cundliffe, E. (1999). Dispensable ribosomal resistance to spiramycin conferred by srmA in the spiramycin producer Streptomyces ambofaciens. 145.2355-64.

.Picard, C., C. Ponsonnet, X. Nesme, and P. Simonet. 1992. Detection and enumeration of bacteria in soil by direct DNA extraction and polymerase chain reaction. Appl. Environ. Microbiol. 58 :2717-2722. Picard, C., C. Ponsonnet, X. Nesme, and P. Simonet. 1992. Detection and enumeration of bacteria in soil by direct DNA extraction and polymerase chain reaction. Appl. About. Microbiol. 58: 2717-2722.

#Preud'homme, J., Belloc, A., Charpentié, Y., and Tarridec, P. 1965. Un antibiotique formé de deux groupes de composants à synergie d'action : la pristinamycine C. R. Acad. Sci. 260 :1309-1312. # Preud'homme, J., Belloc, A., Charpentier, Y., and Tarridec, P. 1965. An antibiotic formed from two groups of synergistically active components: pristinamycin C. R. Acad. Sci. 260: 1309-1312.

#Priemé, A., J. I. B. Sitaula, A. K. Klemedtsson, and L. R. Bakken. # Priemé, A., J. I. B. Sitaula, A. K Klemedtsson, and L. R. Bakken.

1996. Extraction of methane-oxidizing bacteria from soil particles. FEMS Microbiol. Ecol. 21 : 59-68. 1996. Extraction of methane-oxidizing bacteria from soil particles. FEMS Microbiol. School. 21: 59-68.

'Presser, J. 1994. Molecular marker systems for détection of genetically engineered micro-organisms in the environment. Microbiol. 140 :5-17. Presser, J. 1994. Molecular marker systems for the detection of genetically engineered microorganisms in the environment. Microbiol. 140: 5-17.

# Raynal A, Tuphile K, Gerbaud C, Luther T, Guérineau M, Pernodet JL ; Laboratoire de Biologie et Génétique Moleculaire, Institut de Génétique et Microbiologie, URA CNRS 2225, Université Paris-Sud, Orsay, France. Mol Microbiol 1998 Apr;28(2):333-42 # Raynald A. Tuphile, K. Gerbaud, C., Luther, T. Guerineau, M. & PERNODET, J. L. (1998). Structure of the chromosomal insertion site for pSAM2 : functional analysis in Escherichia coli. Mol. Microbiol 28,333-42. # Raynal A, Tuphile K, Gerbaud C, Luther T, Guerineau M, Pernodet JL; Laboratory of Biology and Molecular Genetics, Institute of Genetics and Microbiology, URA CNRS 2225, University Paris-Sud, Orsay, France. Mol Microbiol 1998 Apr; 28 (2): 333-42 # Raynald A. Tuphile, K. Gerbaud, C., Luther, T. Guerineau, M. & PERNODET, J. L. (1998). Structure of the chromosomal insertion site for pSAM2: functional analysis in Escherichia coli. Mol. Microbiol 28,333-42.

# Richard, G. M. 1974. Modifications of the diphenylamine reaction giving increased sensitivity and simplicity in the estimation of DNA. Analytical Biochem. 57:369-376. # Richard, G. M. 1974. Modifications of the diphenylamine reaction in the estimation of DNA. Analytical Biochem. 57: 369-376.

# Romanowski, G., M. G. Lorentz, and W. Wackernagel. 1993. Use of polymerase chain reaction and electroporation of Escherichia coli to monitor the persistence of extracellular plasmid DNA introduced into natural soils. Appl. Environ. Microbiol. 59 :3438-3446. # Romanowski, G., M. G. Lorentz, and W. Wackernagel. 1993. Use of polymerase chain reaction and electroporation of Escherichia coli to monitor the persistence of extracellular plasmid DNA. Appl. About. Microbiol. 59: 3438-3446.

# Saitou N., Nei M. (1987) " The Neighbour-Joining method : a new method for reconstructing phylogentic trees " Molecular and Biological Evolution Vol 2 : 112-118 # Sambrook J., Fritsch E. F. et Maniatis T. 1996. Molecular cloning : a laboratory manual, 2nd ed. Cold spring Harbor Laboratory Press, Cold Sring Harbor, N.Y. # Saitou N., Nei M. (1987) "The Neighbor-Joining method: a new method for reconstructing phylogentic trees" Molecular and Biological Evolution Vol 2: 112-118 # Sambrook J., Fritsch EF and Maniatis T. 1996. Molecular cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory Press, Cold Sring Harbor, N.Y.

#Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecular cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N. Y. #Sambrook, J., F. F. Fritsch, and T. Maniatis. 1989. Molecular cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N. Y.

# Senoo K ., Wada H. (1989) " Isolation and identification of an aerobic ? - HCH-decomposing bacterium from soil " Soil Science, Plant Nutrition Vol 35, No 1 : 79-87.# Sezonov, G., Blanc, V., Bamas-Jacques, N., Friedmann, A. # Senoo K., Wada H. (1989) "Soil Science, Plant Nutrition Vol 35, No. 1: 79-87." Sezonov, G., White, V., Bamas-Jacques, N., Friedmann, A.

Pernodet, J. L. & Guerineau, M.(1997). Complete conversion of antibiotic precursor to pristinamycin IIA by overexpression of Streptomyces pristinae biosynthetic genes. Nat Biotechnol 15,349-53. Pernodet, J.L. & Guerineau, M. (1997). Complete conversion of precursor antibiotic to pristinamycin IIA by overexpression of Streptomyces pristinae biosynthetic genes. Nat Biotechnol 15,349-53.

' Shirling, E. B., and D. Gottlieb. 1966. Methods for characterization of Streptomyces species. Int. J. Syst. Bacteriol. 16:313-340. Shirling, E. B., and D. Gottlieb. 1966. Methods for characterization of Streptomyces species. Int. J. Syst. Bacteriol. 16: 313-340.

Shizuga et al., 1992, Proc. Natl. Acad. Sci USA, 89 : 8794-8797.# Siefert, J. L., and G. E. Fox. 1998. Phylogenetic mapping of bacterial morphology. Microbiology 144 :2803-2808. Shizuga et al., 1992, Proc. Natl. Acad. Sci USA, 89: 8794-8797, Siefert, J.L., and G.E. Fox. 1998. Phylogenetic mapping of bacterial morphology. Microbiology 144: 2803-2808.

# Simonet, P., P. Normand, A. Moiroud, and R. Bardin. 1990. # Simonet, P., Normand P., A. Moiroud, and R. Bardin. 1990.

Identification of Frankia strains in nodules by hybridization of polymerase chain reaction products with strain-specific oligonucleotide probes. Arch. Identification of Frankia strains in nodules by hybridization of polymerase chain reaction products with strain-specific oligonucleotide probes. Arch.

Microbiol. 153 :235-240. Microbiol. 153: 235-240.

# Smalla, K., N. Cresswell, L. Mendonca-Hagler, A. Wolters, and D. J. van Elsas. 1993. Rapid DNA extraction protocol from soil for polymerase chain reaction-mediated amplification. J. Appl. Bacteriol. 74:78-85. # Smalla, K., N. Cresswell, L. Mendonca-Hagler, A. Wolters, and D. J. van Elsas. 1993. Rapid DNA extraction protocol from soil for polymerase chain reaction-mediated amplification. J. Appl. Bacteriol. 74: 78-85.

# Smit, E., P. Leeflang, and K. Wernars. 1997. Detection of shifts in microbial community structure and diversity in soil caused by copper contamination using amplified ribosomal DNA restriction analysis. FEMS Microbiol. Ecol. 23 :249-261. # Smit, E., P. Leeflang, and K. Wernars. 1997. Detection of shifts in microbial community structure and diversity in soil by copper contamination using amplified ribosomal DNA restriction analysis. FEMS Microbiol. School. 23: 249-261.

# Smokvina T, Mazodier P, Boccard F, Thompson CJ, Guerineau M. # Smokvina T, Mazodier P, Boccard F, Thompson CJ, Guerineau M.

Laboratoire de Biologie et Genetique Moleculaire, Universite Paris-Sud, Orsay, France. Gene 1990 Sep 28;94(1):53-9 # Smolvina, T., Mazodier, P. Boccard, F. Thompson, C. J. & Guerineau, M. (1990). Construction of a series of pSAM2-based integrative vectors for use in actinomycetes. Gene 94,53-9. Laboratory of Biology and Molecular Genetics, University Paris-Sud, Orsay, France. Gene 1990 Sep 28; 94 (1): 53-9. Smolvina, T., Mazodier, P. Boccard, F. Thompson, C.J. & Guerineau, M. (1990). Construction of a series of pSAM2-based integrative vectors for use in actinomycetes. Gene 94.53-9.

# Stackebrandt, E. 1988. Phylogenetic relationships vs. phenotypic diversity : how to achieve a phylogenetic classification system of the eubacteria. Can. J. Microbiol. 34 :552-556. # Stackebrandt, E. 1988. Phylogenetic relationships vs. phenotypic diversity: how to achieve a phylogenetic classification system of eubacteria. Can. J. Microbiol. 34: 552-556.

# Staneck, J. L., and G. D. Roberts. 1974. Simplified approach to identification of aerobic Actinomycetes by thin-layer chromatography. # Staneck, J.L., and G. D. Roberts. 1974. Simplified approach to aerobic identification Actinomycetes by thin-layer chromatography.

Appl. Microbiol. 28 :226-231. Appl. Microbiol. 28: 226-231.

# Stapleton, R. D., S. Ripp, L. Jimenez, S. Cheol-Koh, J. T. Fleming, I. # Stapleton, R. D., S. Ripp, L. Jimenez, S. Cheol-Koh, J. T. Fleming, I.

R. Gregory, and G. S. Sayler. 1998. Nucleic acid analytical approaches in bioremediation : assessment and characterization. J. Microbiol. R. Gregory, and G. S. Sayler. 1998. Nucleic acid analytical approaches in bioremediation: assessment and characterization. J. Microbiol.

Methods 32:165-178. Methods 32: 165-178.

# Steffan, R. J., J. Goksoyr, A. K. Bej, and R. Atlas. 1988. Recovery of DNA from soils and sediments. Appl. Environ. Microbiol. 54 :2908-2915. # Steffan, R.J., J. Goksoyr, A.K. Bej, and R. Atlas. 1988. Recovery of DNA from soils and sediments. Appl. About. Microbiol. 54: 2908-2915.

# Tebbe, C. C., and W. Vahjen. 1993. Interference of humic acids and DNA extracted directly from soil in détection and transformation of recombinant DNA from bacteria and a yeast. Appl. Environ. Microbiol. # Tebbe, C. C., and W. Vahjen. 1993. Interference of Humic Acids and DNA Extract from the Field of Detection and Transformation of Recombinant DNA from Bacteria and Yeast. Appl. About. Microbiol.

59:2657-2665. 59: 2657-2665.

# Tercero JA, Espinosa JC, Lacalle RA, Jimenez A . Centro de Biologia Molecular Severo Ochoa, Consejo Superior de Investigaciones Cientificas, Madrid, Spain. J Biol Chem 1996 Jan 19;271(3):1579-90 # Thomas J-C., Berger F., Jacquier M., Bernillon D., Baud-Grasset F., Truffaut N., Normand P., Vogel T .M., Simonet P. (1996) " Isolation and Characterisation of a novel ?-Hexachlorocyclohexane-degrading bacterium " Journal of Bacteriology Vol 178, No20 : 6049-6055 #Torsvik, V. L. 1980. Isolation of bacterial DNA from soil. Soil Biol. # Tercero JA, JC Espinosa, Lacalle RA, Jimenez A. Centro de Biologia Molecular Severo Ochoa, Consejo Superior de Investigaciones Cientificas, Madrid, Spain. J Biol Chem 1996 Jan 19; 271 (3): 1579-90 # Thomas JC., Berger F., Jacquier M., Bernier D., Baud-Grasset F., Truffaut N., Normand P., Vogel T.M Simonet P. (1996) "Isolation and Characterization of a novel? -Hexachlorocyclohexane-degrading bacterium" Journal of Bacteriology Vol 178, No20: 6049-6055 #Torsvik, VL 1980. Isolation of bacterial DNA from soil. Soil Biol.

Biochem. 12:15-21. Biochem. 12: 15-21.

# Torsvik, V., R. Sorheim, and J. Goksoyr. 1996. Total bacterial diversity in soil and sediment communities - a review. J. Ind. Microbiol. 17 :170- 178. # Torsvik, V., R. Sorheim, and J. Goksoyr. 1996. Total bacterial diversity in soil and sediment communities - a review. J. Ind. Microbiol. 17: 170-178.

# Tsai, Y.-L., and B. Oison. 1991. Rapid method for direct extraction of DNA from soil and sediments. Appl. Environ. Microbiol. 57:1070-1074. # Tsai, Y.-L., and B. Oison. 1991. Rapid method for direct extraction of DNA from soil and sediments. Appl. About. Microbiol. 57: 1070-1074.

# Umeyama T., Tanabe Y., Aigle B. D. et Horinuochi S., FEMS (1996) 144 :177-184. # Umeyama T., Tanabe Y., Eagle B. D. and Horinuochi S., FEMS (1996) 144: 177-184.

# Van Elsas, J. D., G. F. Duarte, A. S. Rosado, and K. Smalla. 1998. # Van Elsas, J. D., F. Duarte G., A. Rosado S., and K. Smalla. 1998.

Microbiological and molecular biological methods for monitoring microbial inoculants and their effects in the soil environment. J. Microbiol. Methods 32:133-154. Microbiological and molecular biological methods for monitoring microbial inoculants and their effects in the soil environment. J. Microbiol. Methods 32: 133-154.

#Van Elsas, J. D., V. Mântynen, and A. C. Wolters. 1997. Soil DNA extraction and assessment of the fate of Mycobacterium chlorophenolicum strain PCP-1 in différent soils by 16S ribosomal RNA gene séquence based most-probable-number PCR and immunofluorescence. Biol. Fert. #Van Elsas, J.D., V. Mântynen, and A. C. Wolters. 1997. Soil DNA extraction and assessment of the fate of Mycobacterium chlorophenolicum strain PCP-1 in different soils by 16S ribosomal RNA gene sequence based most-likely-number PCR and immunofluorescence. Biol. Fert.

Soils 24:188-195. Soils 24: 188-195.

# Volff JN et al., 1996, Mol. Microbiol., 21(5) : 1037-1047. # Volff JN et al., 1996, Mol. Microbiol., 21 (5): 1037-1047.

# Volossiouk, T., E. J. Robb, and R. N. Nazar. 1995. Direct DNA extraction for PCR-mediated assays. Appl. Environ. Microbiol. 61:3972- 3976. # Volossiuk, T., E. J. Robb, and R. N. Nazar. 1995. Direct DNA extraction for PCR-mediated assays. Appl. About. Microbiol. 61: 3972-3976.

. Wahl GM, Lewis KA, Ruiz JC, Rothenberg B, Zhao J, Evans GA. . Wahl GM, Lewis KA, JC Ruiz, Rothenberg B, Zhao J, Evans GA.

Proc Natl Acad Sci U S A 1987 Apr;84(8):2160-4 # Waksman, S. A. 1961. Williams and Wilkins (ed.) The actinomycetes. Proc Natl Acad Sci U S A 1987 Apr; 84 (8): 2160-4 # Waksman, S.A. 1961. Williams and Wilkins (ed.) The actinomycetes.

Classification, identification and description of genera and species.Vol 2. Classification, identification and description of genera and species.Vol 2.

Baltimore. Baltimore.

# Ward, D. M., R. Weller, and M. M. Bateson. 1990.16S rRNA séquences reveal numerous uncultured microorganisms in a natural community. Nature 344:63-65. # Ward, MD, R. Weller, and M. M. Bateson. 1990.16S rRNA sequences reveal numerous uncultured microorganisms in a natural community. Nature 344: 63-65.

# Widmer, F., R. J. Seidler, and L. S. Watrud. 1996. Sensitive détection of transgenic plant marker gene persistence in soil microcosms. Mol. Ecol. # Widmer, F., R. J. Seidler, and L. S. Watrud. 1996. Sensitive detection of transgenic plant marker gene persistence in soil microcosms. Mol. School.

5 :603-613. 5: 603-613.

# Williams, S. T., R. Locci, A. Beswick, D. I. Kurtbôke, V. D. Kuznetsov, F. J. Le Monnier, P. F. Long, K. A. Maycroft, R. A. Palma, B. Petrolini, S. Quaroni, J. I. Todd, and M. West. 1993. Detection and identification of novel actinomycetes. Res. Microbiol. 144 :653-656. # Williams, S. T., R. Locci, A. Beswick, D.I. Kurtbouke, V.D. Kuznetsov, F.M. Monnier, P.F. Long, K.A. Maycroft, R.A. Palma, B.Pilrolini, S.Miloni, J.I. Todd, and M.West. 1993. Detection and identification of novel actinomycetes. Res. Microbiol. 144: 653-656.

# Wilson, I. G. 1997. Inhibition and facilitation of nucleic acid amplification. Appl. Environ. Microbiol. 63 :3741-3751. # Wilson, I. G. 1997. Inhibition and facilitation of nucleic acid amplification. Appl. About. Microbiol. 63: 3741-3751.

#Woese, C. R. 1987. Bacterial evolution. Microbiol. Rev. 51:221-271. #Woese, C.R. 1987. Bacterial evolution. Microbiol. Rev. 51: 221-271.

Yannish-Perron et al., 1985 , Gene, 33(1) : 103-119. Yannish-Perron et al., 1985, Gene, 33 (1): 103-119.

#Zaslavsky, B. Y. 1995. Separation of biomolecules, p. 503-667. ln Aqueous two-phase partitioning. Boris Y. Zaslavsky (ed.) Physical Chemistry and Bioanalytical Applications, Marcel Dekker, Inc., New York. #Zaslavsky, B. Y. 1995. Separation of biomolecules, p. 503-667. Aqueous two-phase partitioning. Boris Y. Zaslavsky (ed.) Physical Chemistry and Bioanalytical Applications, Marcel Dekker, Inc., New York.

# Zhou, J., M. A. Bruns, and J. M. Tiedje. 1996. DNA recovery from soils of diverse composition. Appl. Environ. Microbiol. 62 :316-322. # Zhou, J., M. A. Bruns, and J. M. Tiedje. 1996. DNA recovery from soils of diverse composition. Appl. About. Microbiol. 62: 316-322.

LISTE DE SEQUENCES <110> Rhône Poulenc Rorer S.A. SEQUENCE LIST <110> Rhône Poulenc Rorer S.A.

<120> Procédé d'obtention d'acides nucléiques à partir d'un échantillon de sol, acides nucléiques ainsi obtenus et leur application à la synthèse de composés <130> Banque d'ADN du sol - RPR S.A. <120> Method for obtaining nucleic acids from a soil sample, nucleic acids thus obtained and their application to the synthesis of compounds <130> Soil DNA library - RPR S.A.

<140> <141> <160> 106 <170> PatentIn Ver. 2.1 <210> 1 <211> 15 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:sonde
FGPS431 <220> <221> variation <222> (14) <223> Base A remplacée par G <400> 1 acgggcggtg tgtac 15 <210> 2 <211> 22 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
FGPS122 <400> 2 ggagagtttg atcatggctc ag 22 <210> 3 <211> 20 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
FGPS350 <400> 3 cctggagtta agccccaagc 20 <140><141><160> 106 <170> PatentIn Ver. 2.1 <210> 1 <211> 15 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: probe
FGPS431 <220><221> variation <222> (14) <223> Base A replaced by G <400> 1 acggcggtg tgtac <210> 2 <211> 22 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer
FGPS122 <400> 2 ggagagtttg atcatggctc ag 22 <210> 3 <211> 20 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer
FGPS350 <400> 3 cctggagtta agcccacaagc 20

<210> 4 <211> 24 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:sonde
FGPS643 <220> <221> variation <222> (20) <223> T remplacée par C <400> 4 gtgagtnnna acctgcccct gact 24 <210> 5 <211> 21 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:sonde
FGPS643-2 <400> 5 gtgagtaacc tgcccccgac t 21 <210> 6 <211> 23 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
R499 <400> 6 ttaattcact tgcaactgat ggg 23 <210> 7 <211> 23 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
R500 <400> 7 aacgatagct cctacatttg gag 23 <210> 8 <211> 25 <210> 4 <211> 24 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: probe
FGPS643 <220><221> variation <222> (20) <223> T replaced by C <400> 4 gtgagtnnna acctgcccct gact 24 <210> 5 <211> 21 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: probe
FGPS643-2 <400> 5 gtgagtaacc tgcccccgac t 21 <210> 6 <211> 23 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer
R499 <400> 6 ttaattcact tgcaactgat ggg 23 <210> 7 <211> 23 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer
R500 <400> 7 aacgatagct cctacatttg gag 23 <210> 8 <211> 25

<212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:sonde C501 <400> 8 ttgctgatac ggtatagaac ctggc 25 <210> 9 <211> 20 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
FGPS516 <400> 9 tccagatcct tgacccgcag 20 <210> 10 <211> 20 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
FGPS517 <400> 10 cacgacattg cactccaccg 20 <210> 11 <211> 16 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:sonde
FGPS518 <400> 11 ccgtgagccg gatcag 16 <210> 12 <211> 20 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS612 <220> <221> variation <222> (2) <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: C501 probe <400> 8 ttgctgatac ggtatagaac ctggc <210> 9 <211> 20 <212> DNA <213> Artificial sequence <220 ><223> Description of the artificial sequence: primer
FGPS516 <400> 9 tccagatcct tgacccgcag 20 <210> 10 <211> 20 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer
FGPS517 <400> 10 cacgacattg cactccaccg 20 <210> 11 <211> 16 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: probe
FGPS518 <400> 11 ccgtgagccg gatcag 16 <210> 12 <211> 20 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: FGPS612 <220><221> variation <222> (2 )

<223> Base C remplacée par T <220> <221> variation <222> (7) <223> Base T remplacée par C <220> <221> variation <222> (7) <223> Base T remplacée par A <400> 12 ccaacttcgt gccagcagcc 20 <210> 13 <211> 21 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS669 <220> <221> variation <222> (7) <223> Base A remplacée par G <220> <221> variation <222> (13) <223> Base A remplacée par C <400> 13 gacgtcatcc ccaccttcct c 21 <210> 14 <211> 18 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS618 <220> <221> variation <222> (5) <223> Base T remplacée par C <400> 14 atggttgtcg tcagctcg 18 <210> 15 <211> 21 <212> ADN <213> Séquence artificielle <223> Base C replaced by T <220> <221> variation <222> (7) <223> Base T replaced by C <220> <221> variation <222> (7) <223> Base T replaced by A <400> 12 ccaacttcgt gccagcagcc 20 <210> 13 <211> 21 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: FGPS669 <220> <221> variation <222> (7) <223> Base A replaced by G <220> <221> variation <222> (13) <223> Base A replaced by C <400> 13 gacgtcatcc ccaccttcct c 21 <210> 14 <211> 18 <212> DNA < 213> Artificial sequence <220> <223> Description of the artificial sequence: FGPS618 <220> <221> Variation <222> (5) <223> Base T replaced by C <400> 14 atggttgtcg tcagctcg 18 <210> 15 < 211> 21 <212> DNA <213> Artificial sequence

<220> <223> Description de la séquence artificielle:FGPS614 <400> 15 gtgtagaagt gaaattcgat t 21 <210> 16 <211> 18 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS615 <400> 16 cggtggatga tgtggatt 18 <210> 17 <211> 18 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS616 <400> 17 aggttaaaac tcaaatga 18 <210> 18 <211> 18 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS621 <400> 18 atacgtaggt ggcaagcg 18 <210> 19 <211> 19 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS617 <400> 19 gccggggtca actcggagg 19 <210> 20 <211> 18 <212> ADN <213> Séquence artificielle <220> <223> Description of the artificial sequence: FGPS614 <400> 15 gtgtagaagt gaaattcgat t 21 <210> 16 <211> 18 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: FGPS615 <400> 16 cggtggatga tgtggatt 18 <210> 17 <211> 18 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: FGPS616 <400> 17 aggttaaaac tcaaatga 18 <210> 18 < 211> 18 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: FGPS621 <400> 18 atacgtaggt ggcaagcg 18 <210> 19 <211> 19 <212> DNA <213> Artificial sequence < 220> <223> Description of the artificial sequence: FGPS617 <400> 19 gccggggtca actcggagg 19 <210> 20 <211> 18 <212> DNA <213> Artificial sequence

<220> <223> Description de la séquence artificielle:FGPS680 <220> <221> variation <222> (11) <223> Base A remplacée par C <220> <221> variation <222> (11) <223> Base A remplacée par T <220> <221> variation <222> (13) <223> Base T remplacée par A <400> 20 tgagtcccca actccccg 18 <210> 21 <211> 20 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:FGPS619 <400> 21 gcttggggct taactccagg 20 <210> 22 <211> 21 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce 63f <400> 22 caggcctaac acatgcaagt c 21 <210> 23 <211> 18 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce
1387r <400> 23 gggcggngtg tacaaggc 18 <210> 24 <220><223> Description of the artificial sequence: FGPS680 <220><221> variation <222> (11) <223> Base A replaced by C <220><221> variation <222> (11) <223> Base A replaced by T <220><221> variation <222> (13) <223> Base T replaced by A <400> 20 tgagtcccca actccccg 18 <210> 21 <211> 20 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: FGPS619 <400> 21 gcttggggct taactccagg <210> 22 <211> 21 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer 63f <400> 22 caggcctaac acatgcaagt c 21 <210> 23 <211> 18 <212> DNA <213> Artificial sequence <220><223> Description of the artificial sequence: primer
1387r <400> 23 gggcggngtg tacaaggc 18 <210> 24

<211> 30 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:oligo-1 <400> 24 gcttatttaa atattaagcg gccgcccggg 30 <210> 25 <211> 28 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:oligo-2 <400> 25 cccgggcggc cgcattaata tttaaata 28 <210> 26 <211> 23 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce al <400> 26 ccncagnagc gcntnttnct nga 23 <210> 27 <211> 22 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce a2 <400> 27 gtnccngtnc cgtgngtntc na 22 <210> 28 <211> 23 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce bl <400> 28 ccncagnagc gcntnctnct nga 23 <210> 29 <211> 30 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: oligo-1 <400> 24 gcttatttaa atattaagcg gccgcccggg 30 <210> 25 <211> 28 <212> DNA <213 > Artificial sequence <220> <223> Description of the artificial sequence: oligo-2 <400> 25 cccgggcggc cgcattaata tttaaata 28 <210> 26 <211> 23 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: primer al <400> 26 ccncagnagc gcntnttnct nga 23 <210> 27 <211> 22 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: primer a2 <400> 27 gtnccngtnc cgtgngtntc na 22 <210> 28 <211> 23 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: primer bl <400> 28 ccncagnagc gcntnctnct nga 23 <210> 29

<211> 22 <212> ADN <213> Séquence artificielle <220> <223> Description de la séquence artificielle:amorce b2 <400> 29 gtnccngtnc cgtgngcctc na 22 <210> 30 <211> 672 <212> ADN <213> Streptomyces ambofaciens <400> 30 ccccagcagc acgtgttcct cgagacggtg tgggagacct tcgaatccgc cggagtggac 60 ccgcgcgcgg tacgcggtcg ttccgtcggg atgttcgtcg gcaccaacgg acaggactac 120 ccggtggtgt tggccggatc cgccgacgag ggcctggacg cccacgcggc caccggtaac 180 gcggcggcgg tgctgtccgg ccgggtctcg tacgccttcg gcctggaagg gccggcggtc 240 accgtcgaca cggcgtgttc gtcgtcgctg gtggcccttc acctggccgc gcaggcgctg 300 cggcgcggcg agtgcgatct ggcactcgcc ggcggtgtgt cggagatgtc caccgaggcg 360 gcgttcaccg agttcgcccg gcagggcggc ctggccgacg acggccgctg caaggccttc 420 tcggccgacg ccgacggcac gggctggggc gagggcgtcg gcgtcctgct ggtggagcgg 480 ctggcggacg cccgccgcaa cgggcaccgg gccctcgcgc tggtacgggg cagcgcggtc 540 aaccaggacg gcgcctccaa cggtctgacg gcacccaacg gcccgtccca gcagcgagtc 600 atccggcagg cactggcgga cgcccggctg tcgccgtcgg aggtcgacgc ggtcgagacc 660 cacggcaccg gc 672 <210> 31 <211> 665 <212> ADN <213> Streptomyces ambofaciens <400> 31 ccccagcagc gcgtgttcct ggaagcgtcc tgggaggcgg tcgagcgggc aggcatcgac 60 atgcgcaccc tgcgcggtgg acgcaccggc gtcttcgccg gcgtgatgta ccacgactac 120 ccgtcggtgg tcgaccccga agcgctcgac ggctacctgg gcacggccaa cgccggcagc 180 gttctctccg gccgcatcgc ctacaccttc gggcttcagg gaccggcggt caccgtggac 240 acggcctgct cctcgtccct ggtggcgctg cacctcgccg cccaggcgct gcccgccggc 300 gagtgcgaac tcgccctggt cggtggggtc acggtcatgt ccggcccgat gatgttcgcg 360 ggcttcggcc tggaagacgg ctctgccgcc gacggccgct gcaaggcgtt cgccgccgcc 420 gccgacggca ccggctgggg cgagggtgtc ggtgtgctgc tggtggagcg gctgtcggac 480 gcccggcgcc acgggcaccg ggtgctggcc gtggtgcgcg gtagcgcggt caaccaggac 540 ggtgcctccg gcggcctcac cgcccccaac ggacctgccc agcagcgcgt catccgtcag 600 gccctggcga gcgcggcact cgtaccggcc gaggtcgacg cggtcgagac ccacggcacc 660 gggac 665 <210> 32 <211> 671 <212> ADN <213> Saccharopolyspora erythraea <400> 32 ccgcaggagc gcgtgttcct ggaactcgct tgggaagcac ttgataacgc gggcatcgca 60 ccgcacagcc tcagggacag ccggacgggc gtgttcttcg gagctatgtg gcacggctac 120 gcgcagttcg cagccggagc cgtcgaccgc atcacccagc acaccgcgac cgggcacgac 180 <211> 22 <212> DNA <213> Artificial sequence <220> <223> Description of the artificial sequence: primer b2 <400> 29 gtnccngtnc cgtgngcctc na 22 <210> 30 <211> 672 <212> DNA <213> Streptomyces ambofaciens <400> 30 ccccagcagc acgtgttcct cgagacggtg tgggagacct tcgaatccgc cggagtggac 60 ccgcgcgcgg tacgcggtcg ttccgtcggg atgttcgtcg gcaccaacgg acaggactac 120 ccggtggtgt tggccggatc cgccgacgag ggcctggacg cccacgcggc caccggtaac 180 gcggcggcgg tgctgtccgg ccgggtctcg tacgccttcg gcctggaagg gccggcggtc 240 accgtcgaca cggcgtgttc gtcgtcgctg gtggcccttc acctggccgc gcaggcgctg 300 cggcgcggcg agtgcgatct ggcactcgcc ggcggtgtgt cggagatgtc caccgaggcg 360 gcgttcaccg agttcgcccg gcagggcggc ctggccgacg acggccgctg caaggccttc 420 tcggccgacg ccgacggcac gggctggggc gagggcgtcg gcgtcctgct ggtggagcgg 480 ctggcggacg cccgccgcaa cgggcaccgg gccctcgcgc tggtacgggg cagcgcggtc 540 aaccaggacg gcgcctccaa cggtctgacg gcacccaacg gcccgtccca gcagcgagtc 600 atccggcagg cactggcgga cgcccggctg tcgccgtcgg aggtcgacgc ggtcgagacc 660 cacggcaccg gc 672 <210> 31 <211> 665 <212> DNA <213> Streptomyces ambofaciens <400> 31 ccccagcagc gcgtgttcct ggaagcgtcc tgggaggcgg tcgagcgggc aggcatcgac 60 atgcgcaccc tgcgcggtgg acgcaccggc gtcttcgccg gcgtgatgta ccacgactac 120 ccgtcggtgg tcgaccccga agcgctcgac ggctacctgg gcacggccaa cgccggcagc 180 gttctctccg gccgcatcgc ctacaccttc gggcttcagg gaccggcggt caccgtggac 240 acggcctgct cctcgtccct ggtggcgctg cacctcgccg cccaggcgct gcccgccggc 300 gagtgcgaac tcgccctggt cggtggggtc acggtcatgt ccggcccgat gatgttcgcg 360 ggcttcggcc tggaagacgg ctctgccgcc gacggccgct gcaaggcgtt cgccgccgcc 420 gccgacggca ccggctgggg cgagggtgtc ggtgtgctgc tggtggagcg gctgtcggac 480 gcccggcgcc acgggcaccg ggtgctggcc gtggtgcgcg gtagcgcggt caaccaggac 540 ggtgcctccg gcggcctcac cgcccccaac ggacctgccc agcagcgcgt catccgtcag 600 gccctggcga gcgcggcact cgtaccggcc gaggtcgacg cggtcgagac ccacggcacc 660 gggac 665 <210> 32 <211> 671 <212> DNA <213> Saccharopolyspora erythraea <400> 32 ccgcaggagc gcgtgttcct ggaactcgct tgggaagcac ttgataacgc gggcatcgca 60 ccgcaca gcc tcagggacag ccggacgggc gtgttcttcg gagctatgtg gcacggctac 120 gcgcagttcg cagccggagc cgtcgaccgc atcacccagc acaccgcgac cgggcacgac 180

ctgagcatca tcccggccag gatcgcctac ttcctgggct tgcgcggccc ggacatgacc 240 ctgaacaccg cgtgctcatc ggctttggtg gccatgcacc aggcacgcca aagcatcctg 300 ctgggcgaat cctcggtcgc cttggtcggc gggatcagct tgttggtcgc gctggacagc 360 atggtcgcca tgtcgcggtt cggagcgatg gccccggacg gccggtgcaa ggcattcgac 420 tctcgcgcga acggctacgt gcgcggcgaa ggcggcggtg tcgtggtgct caaaccgctg 480 tcgcgcgctc tggccgatgg caacccggtc tactgcgtcc tgcgcggcag cgcggtcaac 540 aacgacggct tcagcaatgg ccttaccgcg ccgagcccgg cggcgcagga gcaggtactg 600 cgcgacgcct acgccaacgc cggggtcgat ccggcacagg tcgactacgt cgagacccac 660 gggaccggca c 671 <210> 33 <211> 686 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 33 ccgcaggagc gcgtgttcct cgagtcgtgc tgggaggcgc tggagcatgc tggatacgat 60 actgcacgct accccggccg catcgggctg tgggccggcg cgggcttcaa cagctacctc 120 ctgaccaatc tcatgaacaa ccgcgccttt ttagagagcg tgggcatgta ccagatcttt 180 ctgagcaacg acaaggactt catcgccacc cgcacggctt acaagttaaa cctgcgcggt 240 ccggcgatgg ccgtcggcac cgcctgttcc acatcgctgg tggcggttca cgaagcttgc 300 caggcgctgc ggctgggcga gtgtgacatg gcactggccg gtgctgcgtc tgtcagcacg 360 cccctccggg agggctacct ctaccaggaa ggcatgatta tgagccgtga cggcgtctgc 420 cgcccgtttg acgccgacgc cgatggcacg gtgctgggca atggcgtggc ggtcgtggtg 480 ctcaagcggc tggacgaagc gctccgggac ggtgacacgg tctacgccgt gattcgtggc 540 acggcggtca acaacgacgg ctctgtcaag atcgggttca cggcgcccag cgccgagggg 600 cagagccggg tcgtgcggga cgccctgcgg gcggccgcgg tcccggcgga gagcgtgacc 660 tacgtcgaca cgcacggcac cggcac 686 <210> 34 <211> 689 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 34 ccccagcagc gcctgttcct cgagtgcgcg tgggaagcga tggagaacgc gggatatgcg 60 gcgcgaagct ataagggttc gatcggcgtt ttcgcgggat gcggcgtcaa tacctacctg 120 ctgaacaacc tcgccaccgc ggagccgttc gatttctcac gcccctccgc gtaccagctg 180 ctgacggcca acgacaagga tttcctggcc acgcgtgtct cttacaagct gaacctccgc 240 gggcccagcc tgacggttca gacggcgtgc tccacctcgc tggtgtcggt ggtgatggca 300 tgcgagagct tgcagcgcgg cgcctcggac attgccttgg ccgggggagt tgccatcaat 360 gttccgcagt ccgtggggta cctgcaccag ccgggcatga tcctgtcgcc cgacgggcgc 420 tgccgcgcct tcgatgagtc cgctcaaggc acggtgccgg gcaacggcgc gggtgtggtc 480 gtcctcaagc gcttgagccg cgctctggcc gatggcgaca cgatctacgc cgtcattcgc 540 ggagcggcta ttaataatga tggcgccgag cgcatggggt ttaccgctcc aggtgtggac 600 ggtcagacgc gattgattcg gcgcactcaa gagatggcgg gcgtgaagcc ggagtccatc 660 ggctacatgg acacccacgg caccggcac 689 <210> 35 <211> 671 <212> ADN ctgagcatca tcccggccag gatcgcctac ttcctgggct tgcgcggccc ggacatgacc 240 ctgaacaccg cgtgctcatc ggctttggtg gccatgcacc aggcacgcca aagcatcctg 300 ctgggcgaat cctcggtcgc cttggtcggc gggatcagct tgttggtcgc gctggacagc 360 atggtcgcca tgtcgcggtt cggagcgatg gccccggacg gccggtgcaa ggcattcgac 420 tctcgcgcga acggctacgt gcgcggcgaa ggcggcggtg tcgtggtgct caaaccgctg 480 tcgcgcgctc tggccgatgg caacccggtc tactgcgtcc tgcgcggcag cgcggtcaac 540 aacgacggct tcagcaatgg ccttaccgcg ccgagcccgg cggcgcagga gcaggtactg 600 cgcgacgcct acgccaacgc cggggtcgat ccggcacagg tcgactacgt cgagacccac 660 gggaccggca c 671 <210> 33 <211> 686 <212> DNA <213> Unknown organism <220> <223> Origin of the sequence: soil organism <400> 33 ccgcaggagc gcgtgttcct cgagtcgtgc tgggaggcgc tggagcatgc tggatacgat 60 actgcacgct accccggccg catcgggctg tgggccggcg cgggcttcaa cagctacctc 120 ctgaccaatc tcatgaacaa ccgcgccttt ttagagagcg tgggcatgta ccagatcttt 180 ctgagcaacg acaaggactt catcgccacc cgcacggctt acaagttaaa cctgcgcggt 240 ccggcgatgg ccgtcggcac cgcctgt cc acatcgctgg tggcggttca cgaagcttgc 300 caggcgctgc ggctgggcga gtgtgacatg gcactggccg gtgctgcgtc tgtcagcacg 360 cccctccggg agggctacct ctaccaggaa ggcatgatta tgagccgtga cggcgtctgc 420 cgcccgtttg acgccgacgc cgatggcacg gtgctgggca atggcgtggc ggtcgtggtg 480 ctcaagcggc tggacgaagc gctccgggac ggtgacacgg tctacgccgt gattcgtggc 540 acggcggtca acaacgacgg ctctgtcaag atcgggttca cggcgcccag cgccgagggg 600 cagagccggg tcgtgcggga cgccctgcgg gcggccgcgg tcccggcgga gagcgtgacc 660 tacgtcgaca cgcacggcac cggcac 686 <210> 34 <211> 689 <212> DNA <213> Unknown organ <220> <223> Origin of the sequence: soil organism <400> 34 ccccagcagc gcctgttcct cgagtgcgcg tgggaagcga tggagaacgc gggatatgcg 60 gcgcgaagct ataagggttc gatcggcgtt ttcgcgggat gcggcgtcaa tacctacctg 120 ctgaacaacc tcgccaccgc ggagccgttc gatttctcac gcccctccgc gtaccagctg 180 ctgacggcca acgacaagga tttcctggcc acgcgtgtct cttacaagct gaacctccgc 240 gggcccagcc tgacggttca gacggcgtgc tccacctcgc tggtgtcggt ggtgatggca 300 tgcgagagct tgcagcgcgg cgcctcggac attgccttg g ccgggggagt tgccatcaat 360 gttccgcagt ccgtggggta cctgcaccag ccgggcatga tcctgtcgcc cgacgggcgc 420 tgccgcgcct tcgatgagtc cgctcaaggc acggtgccgg gcaacggcgc gggtgtggtc 480 gtcctcaagc gcttgagccg cgctctggcc gatggcgaca cgatctacgc cgtcattcgc 540 ggagcggcta ttaataatga tggcgccgag cgcatggggt ttaccgctcc aggtgtggac 600 ggtcagacgc gattgattcg gcgcactcaa gagatggcgg gcgtgaagcc ggagtccatc 660 ggctacatgg acacccacgg caccggcac 689 <210> 35 <211> 671 <212> DNA

<213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 35 ccgcagcagc gcctcttcct cgaggtggca tgggaagctt tggagcgtgc gggtcggccg 60 cccgacagtc tcgcgggcag cgacaccgga gtgttcatcg ggatcagcac cgacgactac 120 agccggctga aacctaccga tccggcgctc attgacgcct ataccggtac cggaaccgcg 180 ttcagcactg ccgccggacg gatctcctat ctgctggggt tgcagggacc gaacttcccc 240 gtcgacacgg cgtgctcttc ctcactcgtg gcggttcatc tggcgtgccg cagcttgcag 300 tcgcgagagt gcagcatggc gctggccggc ggcgtgaacc tgattctggc gccggaaagc 360 acgatctact tctgccgcct gcgggccatg gcggccgatg gccgttgcaa aagtttcgct 420 gcctccgccg acggttacgg ccgcggcgag ggatgcggaa tgctggtgct gaagcggctg 480 tccgatgcga cgcgtgacgg cgatcgtatt ctggcgctga ttcgcggatc ggccgtcaac 540 cacggcggcc gcagcaacgg cctcacggcg ccgaacggtc cggcgcagga agccgtgatt 600 cgggcggcgc tcaagaacgc cggcatggcc cccgccgatg tcgattacgt ggacacccac 660 ggcaccggca c 671 <210> 36 <211> 758 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 36 ccgcaggagc gcgtcttcct cgaacgcatt gacggtttcg atgcggaatt cttcggcatc 60 tccccccgcg aagctctgaa catggatccg cagcagcggc tgctgctgga agtgtgctgg 120 gaagcggcag aggacgccgg catctctccc ggccctctgg cgggcagcgc gaccggcgtc 180 tttgccggct cctgcgccca ggacttcgga ctgtttcagt acgccgaccc tgcccgcatc 240 ggagcttggt cgggttccgg cgtggcgcat agcatgttgg ccaatcgcat ctcctatctg 300 ctcgacctgc gcggtccgag catggcggtc gatacggcct gctcctccgc gctcgtcgcc 360 gtccatctgg cttgccaaag cctgcgccgg cgcgaatgcg atgcggcatt cgccggcgga 420 gtgaacttga tcctgactcc cgagggcatg atcgctttgt cgaaggctcg catgttggcg 480 cccgacggac gctgcaagac gttcgacgcc gcagccgacg gttatgtgcg cggcgagggc 540 tgcggcatcg tgctgctgaa gcggctctcc gatgcgctgg ccgatggcga tgccatctgt 600 gcagtcatcc gcggctcggc aatcaatcag gacggacgga gcaatggcat cacggcgccg 660 aatctgcagg cgcagaaggc ggtcctgcaa gaggcggtgg ccaacgcgca catcgatcca 720 tcccacgtat cgttgatcga cacgcacggc accggcac 758 <210> 37 <211> 704 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 37 ccgcagcagc gcgtgttcct cgagtgcgcc tgggaggcgg tggaaagcgc gggctacgat 60 cccgaaaaat atcccggcct gatcggagtt ttcgccgggg ccagcatcaa cagctatttc 120 ctttataacc tcgcgcacaa ccgggaattc gtcgcccgca tggcggggga gtaccaagtg 180 ggcgagtacc agacgatcct cggaaacgac aaggactacc tccccactcg cgtctcctac 240 aaattgaacc tgcgcggccc cagcctggcc gtgcagtccg cctgctcgac cggcctcgtc 300 gccgtttgtc aggccattca aaatctgcag acttatcagt gcgatatggc cctcgcgggc 360 ggcatctcga tttcgtttcc gcaaaagcgc gactaccgct tcaccgacga aggaatggtc 420 <213> Organime Unknown <220> <223> Origin of sequence: soil organism <400> 35 ccgcagcagc gcctcttcct cgaggtggca tgggaagctt tggagcgtgc gggtcggccg 60 cccgacagtc tcgcgggcag cgacaccgga gtgttcatcg ggatcagcac cgacgactac 120 agccggctga aacctaccga tccggcgctc attgacgcct ataccggtac cggaaccgcg 180 ttcagcactg ccgccggacg gatctcctat ctgctggggt tgcagggacc gaacttcccc 240 gtcgacacgg cgtgctcttc ctcactcgtg gcggttcatc tggcgtgccg cagcttgcag 300 tcgcgagagt gcagcatggc gctggccggc ggcgtgaacc tgattctggc gccggaaagc 360 acgatctact tctgccgcct gcgggccatg gcggccgatg gccgttgcaa aagtttcgct 420 gcctccgccg acggttacgg ccgcggcgag ggatgcggaa tgctggtgct gaagcggctg 480 tccgatgcga cgcgtgacgg cgatcgtatt ctggcgctga ttcgcggatc ggccgtcaac 540 cacggcggcc gcagcaacgg cctcacggcg ccgaacggtc cggcgcagga agccgtgatt 600 cgggcggcgc tcaagaacgc cggcatggcc cccgccgatg tcgattacgt ggacacccac 660 ggcaccggca c 671 <210> 36 <211> 758 <212> DNA <213> Unknown organ <220> <223> Origin of the sequence: soil organism <400> 36 ccgcaggagc gc gtcttcct cgaacgcatt gacggtttcg atgcggaatt cttcggcatc 60 tccccccgcg aagctctgaa catggatccg cagcagcggc tgctgctgga agtgtgctgg 120 gaagcggcag aggacgccgg catctctccc ggccctctgg cgggcagcgc gaccggcgtc 180 tttgccggct cctgcgccca ggacttcgga ctgtttcagt acgccgaccc tgcccgcatc 240 ggagcttggt cgggttccgg cgtggcgcat agcatgttgg ccaatcgcat ctcctatctg 300 ctcgacctgc gcggtccgag catggcggtc gatacggcct gctcctccgc gctcgtcgcc 360 gtccatctgg cttgccaaag cctgcgccgg cgcgaatgcg atgcggcatt cgccggcgga 420 gtgaacttga tcctgactcc cgagggcatg atcgctttgt cgaaggctcg catgttggcg 480 cccgacggac gctgcaagac gttcgacgcc gcagccgacg gttatgtgcg cggcgagggc 540 tgcggcatcg tgctgctgaa gcggctctcc gatgcgctgg ccgatggcga tgccatctgt 600 gcagtcatcc gcggctcggc aatcaatcag gacggacgga gcaatggcat cacggcgccg 660 aatctgcagg cgcagaaggc ggtcctgcaa gaggcggtgg ccaacgcgca catcgatcca 720 tcccacgtat cgttgatcga cacgcacggc accggcac 758 <210> 37 <211> 704 <212> DNA <213> Unknown organism <220> <223> Origin of the sequence: soil organism <400> 37 ccgcagcagc gc gtgttcct cgagtgcgcc tgggaggcgg tggaaagcgc gggctacgat 60 cccgaaaaat atcccggcct gatcggagtt ttcgccgggg ccagcatcaa cagctatttc 120 ctttataacc tcgcgcacaa ccgggaattc gtcgcccgca tggcggggga gtaccaagtg 180 ggcgagtacc agacgatcct cggaaacgac aaggactacc tccccactcg cgtctcctac 240 aaattgaacc tgcgcggccc cagcctggcc gtgcagtccg cctgctcgac cggcctcgtc 300 gccgtttgtc aggccattca aaatctgcag acttatcagt gcgatatggc cctcgcgggc 360 ggcatctcga tttcgtttcc gcaaaagcgc gactaccgct tcaccgacga aggaatggtc 420

tctcgcgacg gtcactgccg cccgttcgac gccagcgcgc aaggcacggt cttcggcaac 480 ggggccggcg tcgtcctgat gaaaagattg gccgacgcag tgaccgatcg ggacacgatc 540 ctcgccgtga ttaggggcgc tgccgtgaac aacgacggcg gcgtcaaaat gggttacacg 600 gcgcccagtg ccgaaggtca ggcggaggcc atcaccctgg ccctcgcgct cgctggcgtc 660 agcccggaga ccatcacttg catggacacc cacggcaccg gcac 704 <210> 38 <211> 680 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 38 ccccagcagc gcgtgttcct cgaatgcgcc tgggcggcgc tggagcgccg ccggatatca 60 gggcgacacc ttccacggtg tccatcggcg gtctatgcct caagcggctt taacacctat 120 cttctgaacc tgcatgccaa tgccgcggtg cgccaatcga tcagcccgtt tgaactgttc 180 gtcgccaacg acaaggattt tctggcgacg cgcacggctt acaagctcaa tctgcgcggc 240 ccggccatga cagtgcagac ggcctgctcc tcatcgttgg ttgccgttca tgtcgccgcg 300 caaagcctcc tagcgggcga atgcgatatt gcgctcgcgg gcggcatcac ggtttcccgt 360 tcgcatggat atgtggcgcg cgaaggtgga atattgtctc ctgacgggca ttgccgggcg 420 ttcgatgcgg atgccggcgg aaccgttcca ggcagcggcg tcggcgttgt cgtgctcaag 480 cgtctcgaag atgcgcttgc agacggcgat acgatcgacg ccgtcatcat cggttcggcc 540 atcaacaatg atggcgcgct gaaggcgagc tttaccgcac cgcaggtgga cagccaggcc 600 ttggtcatca gcgaggccca tgcagctgcc ggaatatcgg ccgattccat cggttatatg 660 gacacccacg gcaccgggac 680 <210> 39 <211> 671 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 39 ccgcagcagc gcctcttcct cgagctcacc tgggaagcgc tggaagatgc cggcatcccg 60 ccgtccacga ttgccggcac gaatgtcggc gttttcatgg gcgcgtcgca ggctgactac 120 ggccacaagt tcttcagcga ccacgccgtc gcggattccc atttcgccac cggcacctcg 180 ctggcggtcg tcgccaatcg catttcctac atctacgacc tgcgcggccc aagcctcact 240 gtagacacgg cgtgctcgtc gtcgctcgtc gcgctgcatc aggcggtgga agcgctccgc 300 tcggggcgga tcgaaacagc cattgtcggc ggcattaacg ttatcgccag cccggcgtcc 360 ttcatcgcct tctcgcaggc ctcgatgctg tcgccgacgg ggttgtgcca ggctttctcc 420 gccaaggccg atggctttgt ccgcggcgag ggcggcacgg ttttcgtcct gcgcaaggcg 480 gcgcatgcgc atggcagccg caacccggtg cgcgggctca ttctcgccac cgacgtcaat 540 tccgacgggc gtaccaacgg catctcgctg ccatcggccg aagcgcagga agtcctcctg 600 caacgcgtct attcacgcgc atcgatcgat ccgaaccgcc tggctttcgt cgacacccac 660 gggaccggca c 671 <210> 40 <211> 764 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol tctcgcgacg gtcactgccg cccgttcgac gccagcgcgc aaggcacggt cttcggcaac 480 ggggccggcg tcgtcctgat gaaaagattg gccgacgcag tgaccgatcg ggacacgatc 540 ctcgccgtga ttaggggcgc tgccgtgaac aacgacggcg gcgtcaaaat gggttacacg 600 gcgcccagtg ccgaaggtca ggcggaggcc atcaccctgg ccctcgcgct cgctggcgtc 660 agcccggaga ccatcacttg catggacacc cacggcaccg CCGA 704 <210> 38 <211> 680 <212> DNA <213> Organime unknown <220> <223> Origin of sequence: soil organism <400> 38 ccccagcagc gcgtgttcct cgaatgcgcc tgggcggcgc tggagcgccg ccggatatca 60 gggcgacacc ttccacggtg tccatcggcg gtctatgcct caagcggctt taacacctat 120 cttctgaacc tgcatgccaa tgccgcggtg cgccaatcga tcagcccgtt tgaactgttc 180 gtcgccaacg acaaggattt tctggcgacg cgcacggctt acaagctcaa tctgcgcggc 240 ccggccatga cagtgcagac ggcctgctcc tcatcgttgg ttgccgttca tgtcgccgcg 300 caaagcctcc tagcgggcga atgcgatatt gcgctcgcgg gcggcatcac ggtttcccgt 360 tcgcatggat atgtggcgcg cgaaggtgga atattgtctc ctgacgggca ttgccgggcg 420 ttcgatgcgg atgccggcgg aaccgttcca ggcagcggcg tcggcgttgt cgtgctca ag 480 cgtctcgaag atgcgcttgc agacggcgat acgatcgacg ccgtcatcat cggttcggcc 540 atcaacaatg atggcgcgct gaaggcgagc tttaccgcac cgcaggtgga cagccaggcc 600 ttggtcatca gcgaggccca tgcagctgcc ggaatatcgg ccgattccat cggttatatg 660 gacacccacg gcaccgggac 680 <210> 39 <211> 671 <212> DNA <213> Organime Unknown <220> <223> Origin of the sequence: soil organism <400> 39 ccgcagcagc gcctcttcct cgagctcacc tgggaagcgc tggaagatgc cggcatcccg 60 ccgtccacga ttgccggcac gaatgtcggc gttttcatgg gcgcgtcgca ggctgactac 120 ggccacaagt tcttcagcga ccacgccgtc gcggattccc atttcgccac cggcacctcg 180 ctggcggtcg tcgccaatcg catttcctac atctacgacc tgcgcggccc aagcctcact 240 gtagacacgg cgtgctcgtc gtcgctcgtc gcgctgcatc aggcggtgga agcgctccgc 300 tcggggcgga tcgaaacagc cattgtcggc ggcattaacg ttatcgccag cccggcgtcc 360 ttcatcgcct tctcgcaggc ctcgatgctg tcgccgacgg ggttgtgcca ggctttctcc 420 gccaaggccg atggctttgt ccgcggcgag ggcggcacgg ttttcgtcct gcgcaaggcg 480 gcgcatgcgc atggcagccg caacccggtg cgcgggctca ttctcgccac 540 cgacgtcaat gt tccgacgggc aqccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccbc

<400> 40 ccgcagcagc gcgtgttcct cgacggcatc gaccggttcg atccgcgtca cttcgcgatc 60 acgccgcgcg aggcgatcag catggacccg cagcagcggc tcctgctcga ggtcacgtgg 120 gaagcgctgg agcgcgccgg cgtggcgccc gatcgcctga ccggatccga caccggcgtc 180 ttcatcggca tcagcaccaa cgactacggc cagatcctgc tgcgcgcctc ggaccagatc 240 gatccgggga tgtacttcgg caccggcaac ctgttgaacg cggcggcggg acgcctctcg 300 tacgtcctcg gccgcaggg tccgagcatg gcggtcgaca ccgcatgtcc gtcgtcgctg 360 gtggcgattc atctcgcgtg tcagagcctg cgcaaccgcg agtgccgcat ggcgctcgcc 420 ggcggcgcca acctggtgct cgtcccggaa gtgacggtca actgctgccg cgccaagatg 480 ctcgcgcctg acgggcgctg caagacgttc gacgccgcgg cggacggcta cgtccgcggc 540 gaaggggccg cggtgatcgt gctgaagcgg ctctccgacg cgctggcgga cggcgatccg 600 atcgtcgcgc tgatccgcgg atccgcggtc aatcaggacg gccgcagcgg cggcttcacc 660 gcgccgaacg aactggcgca gcaggcggtg atccggaccg cgctcgcggc agcgggcgtc 720 gccgcgtccg acatcggcta cgtggacacg cacggcaccg ggac 764 <210> 41 <211> 763 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 41 ccgcagcagc gcgtgttcct cgacggcatc gaccgcttcg atccgcagtt tttcgggatc 60 gcgccgcgcg aagcggccgg catcgatccg cagcagcggc tgctgctcga gacgacgtgg 120 gaagcgctgg aagacgccgg gacgtcgccg gaaaagctgc agggaacccc ggccggcgtg 180 ttcgtcggca tcaacagcat cgactacgcg acgctgcagc tgcagaactg cgatctggcc 240 agcatcgacg cctattcgct ctccggcagc gcgcacagca tcgcggccgg gcggctcgcc 300 tacgtgctcg gcctgcaggg gccggcgatg gcggtcgaca ccgcctgctc gtcgtcgctg 360 gtcgcgatcc acctggcgtg ccagagcctg cgcaacgacg actgccgcgt cgccgtggcc 420 ggcggcgtgc acgtcacgct gacgccgatc aacatggtcg tgttctcgaa gctgcgcatg 480 ctggcggcgg acggcaagtg caagacgttc gacggccgcg gcgacggatt cgtcgaaggc 540 gagggctgcg cggtcatcgt cctcaagcgg ttgtcgcacg cgcttgccga caaggatcgg 600 atcctcgcgc tggtgcgcgg ttcggcggtc aaccaggacg gcgcgagcag cggtctcacc 660 gcgccgaacg gtccggcgca ggaagcggtc atccgcgcgg cgttgaagcg ggccggcgtg 720 cagccggcgg aggtcggcta cgtggacacc cacggcaccg gca 763 <210> 42 <211> 668 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 42 ccgcaggagc gcgtgctgct ggaatcctcg tggcatgcgc tggaagacgc cggctatgcc 60 ggcgaaagca tcgccggcgc gcgctgcggc gtgtacatgg gcttcaacgg cggcgactac 120 ggcgacctgc tgtacggcca gccgtcgctg ccgccgcacg cgatgtgggg caacgccgcc 180 tcggtgctgt cggcgcgcat cgcctattac ctggacctgc aaggcccggc gatcaccctc 240 gacaccgcct gttcgagctc gttggtcgcg gtgcatctgg cctgccaggg gctgtggacc 300 ggcgagaccg atctggccct ggccggcggc gtgtggatcc agtgcacgcc cggattcctg 360 atctcctcca gccgcgccgg catgctctcg ccgaccggcc agtgccgcgc gttcggcgcc 420 ggcgccgacg gcttcgtgcc gtccgaaggc gtcggcgtgg tcgtgctcaa gcgcctgcag 480 gacgcgctcg acgccggcga ccacatntac ggcgtgatcc gcggcagcgc gatcaaccag 540 gacggcgcca gcaacggcat caccgcgccg agcgccgccg cccaggagcg cttgcagcgc 600 <400> 40 ccgcagcagc gcgtgttcct cgacggcatc gaccggttcg atccgcgtca cttcgcgatc 60 acgccgcgcg aggcgatcag catggacccg cagcagcggc tcctgctcga ggtcacgtgg 120 gaagcgctgg agcgcgccgg cgtggcgccc gatcgcctga ccggatccga caccggcgtc 180 ttcatcggca tcagcaccaa cgactacggc cagatcctgc tgcgcgcctc ggaccagatc 240 gatccgggga tgtacttcgg caccggcaac ctgttgaacg cggcggcggg acgcctctcg 300 tacgtcctcg gccgcaggg tccgagcatg gcggtcgaca ccgcatgtcc gtcgtcgctg 360 gtggcgattc atctcgcgtg tcagagcctg cgcaaccgcg agtgccgcat ggcgctcgcc 420 ggcggcgcca acctggtgct cgtcccggaa gtgacggtca actgctgccg cgccaagatg 480 ctcgcgcctg acgggcgctg caagacgttc gacgccgcgg cggacggcta cgtccgcggc 540 gaaggggccg cggtgatcgt gctgaagcgg ctctccgacg cgctggcgga cggcgatccg 600 atcgtcgcgc tgatccgcgg atccgcggtc aatcaggacg gccgcagcgg cggcttcacc 660 gcgccgaacg aactggcgca gcaggcggtg atccggaccg cgctcgcggc agcgggcgtc 720 gccgcgtccg acatcggcta cgtggacacg cacggcaccg GGAC 764 <210> 41 <211 > 763 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: organism u soil <400> 41 ccgcagcagc gcgtgttcct cgacggcatc gaccgcttcg atccgcagtt tttcgggatc 60 gcgccgcgcg aagcggccgg catcgatccg cagcagcggc tgctgctcga gacgacgtgg 120 gaagcgctgg aagacgccgg gacgtcgccg gaaaagctgc agggaacccc ggccggcgtg 180 ttcgtcggca tcaacagcat cgactacgcg acgctgcagc tgcagaactg cgatctggcc 240 agcatcgacg cctattcgct ctccggcagc gcgcacagca tcgcggccgg gcggctcgcc 300 tacgtgctcg gcctgcaggg gccggcgatg gcggtcgaca ccgcctgctc gtcgtcgctg 360 gtcgcgatcc acctggcgtg ccagagcctg cgcaacgacg actgccgcgt cgccgtggcc 420 ggcggcgtgc acgtcacgct gacgccgatc aacatggtcg tgttctcgaa gctgcgcatg 480 ctggcggcgg acggcaagtg caagacgttc gacggccgcg gcgacggatt cgtcgaaggc 540 gagggctgcg cggtcatcgt cctcaagcgg ttgtcgcacg cgcttgccga caaggatcgg 600 atcctcgcgc tggtgcgcgg ttcggcggtc aaccaggacg gcgcgagcag cggtctcacc 660 gcgccgaacg gtccggcgca ggaagcggtc atccgcgcgg cgttgaagcg ggccggcgtg 720 cagccggcgg aggtcggcta cgtggacacc cacggcaccg gca 763 <210> 42 <211> 668 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: organ ism soil <400> 42 ccgcaggagc gcgtgctgct ggaatcctcg tggcatgcgc tggaagacgc cggctatgcc 60 ggcgaaagca tcgccggcgc gcgctgcggc gtgtacatgg gcttcaacgg cggcgactac 120 ggcgacctgc tgtacggcca gccgtcgctg ccgccgcacg cgatgtgggg caacgccgcc 180 tcggtgctgt cggcgcgcat cgcctattac ctggacctgc aaggcccggc gatcaccctc 240 gacaccgcct gttcgagctc gttggtcgcg gtgcatctgg cctgccaggg gctgtggacc 300 ggcgagaccg atctggccct ggccggcggc gtgtggatcc agtgcacgcc cggattcctg 360 atctcctcca gccgcgccgg catgctctcg ccgaccggcc agtgccgcgc gttcggcgcc 420 ggcgccgacg gcttcgtgcc gtccgaaggc gtcggcgtgg tcgtgctcaa gcgcctgcag 480 gacgcgctcg acgccggcga ccacatntac ggcgtgatcc gcggcagcgc gatcaaccag 540 gacggcgcca gcaacggcat caccgcgccg agcgccgccg cccaggagcg cttgcagcgc 600

cacgtctacg acagcttcgg catcgacgcc tcgcgcctgc agatgatcga ggcccacggc 660 accggcac 668 <210> 43 <211> 671 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence : :organisme du sol <400> 43 ccgcaggagc gcgtgctgct ggaggtgact tgggaggcac tcgaagacgc cggccaagac 60 gtggaccgtc tggccgggcg gcccgtcggc gtcttcgtcg ggatctcgtc gaacgattac 120 ggccagcttc agaacggcga cccggccgac gtggacgcct acgtcggcac cggtaacgcg 180 ctgagcatcg ccgccaaccg actcagctac acgtttgact ttcgcggccc gagtctggcg 240 gtggacacgg cgtgctcgtc ttcactcgtc gcgatccatc tcgcctgcca gagcgttcgc 300 cgcggtgaag cggaactcgc cgtcgcggcc ggcgtcaact tgattctgac ccccggcctg 360 acggtgaatt tcacccgcgc cggcatgatg gcgcctgacg gccggtgcaa gacgttcgac 420 gcggccgcca acggctacgt gcgcggcgaa ggcgccggcg tcgtcgtgct caagccgctg 480 gcccaggcta tcgccgacgg cgacccgatc tacgcgatcg tccgtggcag cgccgtcaac 540 caggacggcc gttccaacgg cctcaccgcc ccgaaccgac aggcccaaga ggtcgtgctg 600 cgggccgcgt atcgtgacgc gggcatcagc ccggccgatg tcgacgccgt cgaggcccac 660 ggcaccggca c 671 <210> 44 <211> 707 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 44 ccccagcagc gcgtgttcct cgaggacgcg actgaggtcg acgtggatgc gctttcagac 60 ggcgaagacg tcgtgatcgc cggcatcatg cagcacatcg aggaggccgg catccactcg 120 ggcgattcat cgtgcgtgct tccgccggtc gacatcccgc cgaaggcgct gcagacgatc 180 cgcgatcaca cgttcaagct cgcgcgcgcg ttgaaggtca tcggcctgat gaacgtgcag 240 tacgcgattc agcgcgacaa ggtctacgtg attgaggtaa accctagggc ttctcgaact 300 gtcccgtatg tctcgaaggc gacaggcgtg ccgctggcga aggtcgcgtc acgcttgatg 360 accggacgca aactgcacga gctgttgccg gaaggggtcg agcgcggctg gatcaccacc 420 gcgggcgaga atttctacgt gaagtcgccg gtcttcccgt ggggtaagtt cccgggcgtt 480 gacactgtgc tcgggccgga gatgaaatcg accggcgaag tcatgggcgt cgccgacaac 540 ttcggcgagg ccttcgccaa ggcacagatc gccgccggca catacctgcc gaccgaaggt 600 accgtcttca tctcggtcaa cgaccgtgac aaaggcaacg tcattcagct ggcgcagcgt 660 ttctccgaac tcggtttcgg cattgtcgac acgcacggca ccgggac 707 <210> 45 <211> 225 <212> PRT <213> Streptomyces ambofaciens <400> 45 Pro Gln Gln His Val Phe Leu Glu Thr Val Trp Glu Thr Phe Glu Ser
1 5 10 15 Ala Gly Val Asp Pro Arg Ala Val Arg Gly Arg Ser Val Gly Met Phe cgtctacg acagcttcgg catcgacgcc tcgcgcctgc agatgatcga ggcccacggc 660 accgcac 668 <210> 43 <211> 671 <212> DNA <213> Unknown Organ <220><223> Origin of the sequence:: soil organism <400> 43 ccgcaggagc gcgtgctgct ggaggtgact tgggaggcac tcgaagacgc cggccaagac 60 gtggaccgtc tggccgggcg gcccgtcggc gtcttcgtcg ggatctcgtc gaacgattac 120 ggccagcttc agaacggcga cccggccgac gtggacgcct acgtcggcac cggtaacgcg 180 ctgagcatcg ccgccaaccg actcagctac acgtttgact ttcgcggccc gagtctggcg 240 gtggacacgg cgtgctcgtc ttcactcgtc gcgatccatc tcgcctgcca gagcgttcgc 300 cgcggtgaag cggaactcgc cgtcgcggcc ggcgtcaact tgattctgac ccccggcctg 360 acggtgaatt tcacccgcgc cggcatgatg gcgcctgacg gccggtgcaa gacgttcgac 420 gcggccgcca acggctacgt gcgcggcgaa ggcgccggcg tcgtcgtgct caagccgctg 480 gcccaggcta tcgccgacgg cgacccgatc tacgcgatcg tccgtggcag cgccgtcaac 540 caggacggcc gttccaacgg cctcaccgcc ccgaaccgac aggcccaaga ggtcgtgctg 600 cgggccgcgt atcgtgacgc gggcatcagc ccggccgatg tcgacgccgt cgaggcccac ggcaccggca c 660 671 <210> 44 <211> 707 <212> DNA <213> Organime Unknown <220><223> Origin of sequence: soil organism <400> 44 ccccagcagc gcgtgttcct cgaggacgcg actgaggtcg acgtggatgc gctttcagac 60 ggcgaagacg tcgtgatcgc cggcatcatg cagcacatcg aggaggccgg catccactcg 120 ggcgattcat cgtgcgtgct tccgccggtc gacatcccgc cgaaggcgct gcagacgatc 180 cgcgatcaca cgttcaagct cgcgcgcgcg ttgaaggtca tcggcctgat gaacgtgcag 240 tacgcgattc agcgcgacaa ggtctacgtg attgaggtaa accctagggc ttctcgaact 300 gtcccgtatg tctcgaaggc gacaggcgtg ccgctggcga aggtcgcgtc acgcttgatg 360 accggacgca aactgcacga gctgttgccg gaaggggtcg agcgcggctg gatcaccacc 420 gcgggcgaga atttctacgt gaagtcgccg gtcttcccgt ggggtaagtt cccgggcgtt 480 gacactgtgc tcgggccgga gatgaaatcg accggcgaag tcatgggcgt cgccgacaac 540 ttcggcgagg ccttcgccaa ggcacagatc gccgccggca catacctgcc gaccgaaggt 600 accgtcttca tctcggtcaa cgaccgtgac aaaggcaacg tcattcagct ggcgcagcgt 660 ttctccgaac tcggtttcgg cattgtcgac acgcacggca ccgggac 707 <210> 45 <211> 225 <212> PRT <213> Streptomyces ambofaciens <400> 45 Pro G ln Gln His Val Phe Glu Thr Thru Glu Thr Phe Glu Ser
1 5 10 15 Ala Gly Val Asp Pro Arg Ala Val Arg Gly Arg Ser Val Gly Met Phe

20 25 30 Val Gly Thr Asn Gly Gln Asp Tyr Pro Val Val Leu Ala Gly Ser Ala
35 40 45 Asp Glu Gly Leu Asp Ala His Ala Ala Thr Gly Asn Ala Ala Ala Val
50 55 60 Leu Ser Gly Arg Val Ser Tyr Ala Phe Gly Leu Glu Gly Pro Ala Val
65 70 " 75 80 Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala
85 90 95 Ala Gln Ala Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly
100 105 110 Val Ser Glu Met Ser Thr Glu Ala Ala Phe Thr Glu Phe Ala Arg Gln
115 120 125 Gly Gly Leu Ala Asp Asp Gly Arg Cys Lys Ala Phe Ser Ala Asp Ala
130 135 140 Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Val Leu Leu Val Glu Arg 145 150 155 160 Leu Ala Asp Ala Arg Arg Asn Gly His Arg Ala Leu Ala Leu Val Arg
165 170 175 Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro
180 185 190 Asn Gly Pro Ser Gln Gln Arg Val Ile Arg Gln Ala Leu Ala Asp Ala
195 200 205 Arg Leu Ser Pro Ser Glu Val Asp Ala Val Glu Thr His Gly Thr Gly
210 215 220 Thr 225 <210> 46 <211> 207 <212> PRT <213> Streptomyces ambofaciens <400> 46 Ala Ser Trp Glu Ala Val Glu Arg Ala Gly Ile Asp Met Arg Thr Leu 1 5 5 10 15 Arg Gly Gly Arg Thr Gly Val Phe Ala Gly Val Met Tyr His Asp Tyr
20 25 30 Pro Ser Val Val Asp Pro Glu Ala Leu Asp Gly Tyr Leu Gly Thr Ala
35 40 45 Asn Ala Gly Ser Val Leu Ser Gly Arg Ile Ala Tyr Thr Phe Gly Leu
50 55 60 20 25 30 Val Gly Thrn Asn Gly Asp Gln Asp Val Al Val Val Val Ala Gly Ser Ala
35 40 45 Asp Glu Gly Leu Ala Asp Ala Ala Thr Gly Asn Ala Ala Ala Val
50 55 60 Leu Ser Gly Arg Val Ser Tyr Ala Phe Gly Leu Glu Gly Pro Ala Val
65 70 "75 80 Thr Asp Asp Val Ala Cys Ser Ser Leu Leu Val Ala Leu His Leu Ala
85 90 95 Ala Gln Ala Leu Arg Arg Gly Glu Cys Asp Leu Al Leu Ala Gly Gly
100 105 110 Val Ser Glu Met Ser Thr Glu Ala Ala Phe Thr Glu Phe Ala Arg Gln
115 120 125 Gly Gly Ala Asp Asp Gly Arg Cys Ala Phe Ser Lys Ala Asp Ala
130 135 140 Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Val Leu Leu Val Glu Arg 145 150 155 160 Leu Ala Asp Ala Arg Arg Asn Gly His Arg Ala Leu Ala Leu Val Arg
165 170 175 Gly Ser Ala Val Asn Asp Gln Gly Ala Asn Ser Gly Leu Thr Ala Pro
180 185 190 Asn Gly Pro Ser Gln Gln Arg Val Arg Arg Gln Ala Leu Ala Asp Ala
195 200 205 Arg Leu Ser Ser Ser Glu Val Asp Ala Glu Gl Thr Thr Gly Thr Gly
210 215 220 Thr 225 <210> 46 <211> 207 <212> PRT <213> Streptomyces ambofaciens <400> 46 Ala Ser Trp Glu Ala Val Glu Arg Ala Gly Ile Asp Met Arg Thr Leu 1 5 5 10 15 Arg Gly Gly Arg Thr Gly Val Phe Ala Gly Val Met Tyr Tyr Asp Tyr
20 25 30 Pro Val Val Val Pro Asp Glu Ala Leu Asp Gly Tyr Leu Gly Thr Ala
35 40 45 Asn Ala Gly Ser Val Leu Ser Gly Arg Ala Tyr Thr Phe Gly Leu
50 55 60

Gln Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val
65 70 75 80 Ala Leu His Leu Ala Ala Gln Ala Leu Pro Ala Gly Glu Cys Glu Leu
85 90 95 Ala Leu Val Gly Gly Val Thr Val Met Ser Gly Pro Met Met Phe Ala
100 105 110 Gly Phe Gly Leu Glu Asp Gly Ser Ala Ala Asp Gly Arg Cys Lys Ala
115 120 125 Phe Ala Ala Ala Ala Asp Gly Thr Gly Trp Gly Glu Gly Val Gly Val
130 135 140 Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg His Gly His Arg Val 145 150 155 160 Leu Ala Val Val Arg Gly Ser Ala Val Asn Gln Asp Gly Ala Ser Gly
165 170 175 Gly Leu Thr Ala Pro Asn Gly Pro Ala Gln gln Arg Val Ile Arg Gln
180 185 190 Ala Leu Ala Ser Ala Ala Leu Val Pro Ala Glu Val Asp Ala Val
195 200 205 <210> 47 <211> 223 <212> PRT <213> Saccharopolyspora erythraea <400> 47 Pro Gln Glu Arg Val Phe Leu Glu Leu Ala Trp Glu Ala Leu Asp Asn 1 5 10 15 Ala Gly Ile Ala Pro His Ser Leu Arg Asp Ser Arg Thr Gly Val Phe
20 25 30 Phe Gly Ala Met Trp His Gly Tyr Ala Gln Phe Ala Ala Gly Ala Val
35 40 45 Asp Arg Ile Thr Gln His Thr Ala Thr Gly His Asp Leu Ser Ile Ile
50 55 60 Pro Ala Arg Ile Ala Tyr Phe Leu Gly Leu Arg Gly Pro Asp Met Thr
65 70 75 80 Leu Asn Thr Ala Cys Ser Ser Ala Leu Val Ala Met His Gln Ala Arg
85 90 95 Gln Ser Ile Leu Leu Gly Glu Ser Ser Val Ala Leu Val Gly Gly Ile
100 105 110
Ser Leu Leu Val Ala Leu Asp Ser Met Val Ala Met Ser Arg Phe Gly
115 120 125 Gln Gly Pro Ala Thr Thr Val Asp Thr Ser Ala Ser Cys Ser Ser Leu Val
65 70 75 80 Ala Leu Ala Ala Gla Ala Leu Leu Ala Gly Glu Leuk Leu
85 90 95 Ala Leu Val Gly Gly Val Thr Met Met Ser Gly Met Met Phe Ala
100 105 110 Gly Phe Gly Leu Asp Asp Glly Ala Ala Ala Asp Gly Arg Ala Cly Lys
115 120 125 Phe Ala Ala Ala Ala Gly Thr Gly Trly Gly Glly Gly Val Gly Val
130 135 140 Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg His Gly His Arg Val 145 150 155 160 Leu Ala Val Val Arg Gly Ser Ala Val Asn Asp Gln Gly Ala Ser Gly
165 170 175 Gly Leu Thr Ala Pro Asn Gly Pro Ala Gln Gln Arg Val Arg Gln Island
180 185 190 Ala Leu Ala Ser Ala Ala Val Leu Pro Ala Glu Asp Val Ala Val
195 200 205 <210> 47 <211> 223 <212> PRT <213> Saccharopolyspora erythraea <400> 47 Pro Gln Glu Arg Val Phe Leu Glu Leu Ala Trp Glu Ala Leu Asp Asn 1 5 10 15 Ala Gly Ile Ala Pro His Ser Arg Arg Ser Ser Arg Thr Gly Val Phe
20 25 30 Phe Gly Ala Met Trp His Gly Ala Gla Phe Ala Ala Gla Ala Val
35 40 45 Asp Arg Thr Thr Thrn His Thr Ala Thr Gly His Asp Leu Ser Ile Ile
50 55 60 Pro Ala Arg Ala Tyr Phe Leu Gly Leu Arg Gly Pro Asp Met Thr
65 70 75 80 Leu Asn Thr Ala Ser Cys Ser Ala Leu Ala Val Met His Gln Ala Arg
85 90 95 Gln Ser Ile Leu Leu Gly Glu Ser Ser Val Ala Leu Val Gly Gly Ile
100 105 110
Ser Leu Leu Ala Val Leu Asp Ser Met Ala Val Met Ser Arg Phe Gly
115 120 125

Ala Met Ala Pro Asp Gly Arg Cys Lys Ala Phe Asp Ser Arg Ala Asn
130 135 140 Gly Tyr Val Arg Gly Glu Gly Gly Gly Val Val Val Leu Lys Pro Leu 145 150 155 160 Ser Arg Ala Leu Ala Asp Gly Asn Pro Val Tyr Cys Val Leu Arg Gly
165 170 175 Ser Ala Val Asn Asn Asp Gly Phe Ser Asn Gly Leu Thr Ala Pro Ser
180 185 190 Pro Ala Ala Gln Glu Gln Val Leu Arg Asp Ala Tyr Ala Asn Ala Gly
195 200 205 Val Asp Pro Ala Gln Val Asp Tyr Val Glu Thr His Gly Thr Gly
210 215 220 <210> 48 <211> 211 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 48 Ser Cys Trp Glu Ala Leu Glu His Ala Gly Tyr Asp Thr Ala Arg Tyr
1 5 10 15 Pro Gly Arg Ile Gly Leu Trp Ala Gly Ala Gly Phe Asn Ser Tyr Leu
20 25 30 Leu Thr Asn Leu Met Asn Asn Arg Ala Phe Leu Glu Ser Val Gly Met
35 40 45 Tyr Gln Ile Phe Leu Ser Asn Asp Lys Asp Phe Ile Ala Thr Arg Thr
50 55 60 Ala Tyr Lys Leu Asn Leu Arg Gly Pro Ala Met Ala Val Gly Thr Ala
65 70 75 80 Cys Ser Thr Ser Leu Val Ala Val His Glu Ala Cys Gln Ala Leu Arg
85 90 95 Leu Gly Glu Cys Asp Met Ala Leu Ala Gly Ala Ala Ser Val Ser Thr
100 105 110 Pro Leu Arg Glu Gly Tyr Leu Tyr Gln Glu Gly Met Ile Met Ser Arg
115 120 125 Asp Gly Val Cys Arg Pro Phe Asp Ala Asp Ala Asp Gly Thr Val Leu
130 135 140 Gly Asn Gly Val Ala Val Val Val Leu Lys Arg Leu Asp Glu Ala Leu 145 150 155 160 Arg Asp Gly Asp Thr Val Tyr Ala Val Ile Arg Gly Thr Ala Val Asn Ala Met Ala Pro Asp Gly Arg Cys Lilies Ala Phe Asp Ser Arg Ala Asn
130 135 140 Gly Tyr Val Arg Gly Glu Gly Gly Gly Val Val Val Leu Lys Pro Leu 145 150 155 160 Ser Arg Ala Leu Ala Asp Gly Asn Pro Val Tyr Cys Val Leu Arg Gly
165 170 175 Ser Ala Asn Asp Asn Asp Gly Phe Ser Asn Gly Ur Thr Ala Pro Ser
180 185 190 Pro Ala Ala Gln Glu Gln Val Leu Arg Asp Ala Tyr Ala Asn Ala Gly
195 200 205 Val Asp Pro Ala Gln Val Asp Tire Val Glu Thr His Gly Thr Gly
210 215 220 <210> 48 <211> 211 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism <400> 48 Ser Cys Trp Glu Leu Ala Glu His Ala Gly Tyr Asp Thr Ala Arg Tyr
1 5 10 15 Pro Gly Arg Ile Gly Leu Trp Ala Gly Ala Gly Phe Asn Ser Tyr Leu
Leu Leuk As Leun Asn Asn Arg Ala Phe Leuk Glu Ser Val Gly Met
35 40 45 Tyr Gln Ile Phe Leu Ser Asn Asp Lys Asp Phe Ile Ala Thr Arg Thr
50 55 60 Ala Tyr Lilies Leu Asn Leu Arg Gly Pro Ala Ala Val Gly Thr Ala
65 70 75 80 Cys Ser Thr Ser Leu Leu Val Ala Val His Glu Ala Cys Gln Leu Al Arg
85 90 95 Leu Gly Glu Asp Cys Ala Al Ala Gly Ala Ser Val Ser Thr
100 105 110 Pro Leu Arg Glu Gly Tire Leu Tyr Gln Glu Gly Met Ile Met Ser Arg
115 120 125 Asp Gly Val Cys Arg Pro Phe Asp Ala Asp Ala Asp Gly Thr Val Leu
130 135 140 Gly Asn Gly Val Ala Val Val Val Leu Lys Arg Leu Asp Glu Ala Leu 145 150 155 160 Arg Asp Asp Gly Asp Thr Val Tyr Ala Val Arg Island Gly Thr Ala Val Asn

165 170 175
Asn Asp Gly Ser Val Lys Ile Gly Phe Thr Ala Pro Ser Ala Glu Gly
180 185 190
Gln Ser Arg Val Val Arg Asp Ala Leu Arg Ala Ala Ala Val Pro Ala
195 200 205
Glu Ser Val
210 <210> 49 <211> 229 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 49
Pro Gln Gln Arg Leu Phe Leu Glu Cys Ala Trp Glu Ala Met Glu Asn 1 5 10 15
Ala Gly Tyr Ala Ala Arg Ser Tyr Lys Gly Ser Ile Gly Val Phe Ala
20 25 30
Gly Cys Gly Val Asn Thr Tyr Leu Leu Asn Asn Leu Ala Thr Ala Glu
35 40 45
Pro Phe Asp Phe Ser Arg Pro Ser Ala Tyr Gln Leu Leu Thr Ala Asn
50 55 60
Asp Lys Asp Phe Leu Ala Thr Arg Val Ser Tyr Lys Leu Asn Leu Arg
65 70 75 80
Gly Pro Ser Leu Thr Val Gln Thr Ala Cys Ser Thr Ser Leu Val Ser
85 90 95
Val Val Met Ala Cys Glu Ser Leu Gln Arg Gly Ala Ser Asp Ile Ala
100 105 110
Leu Ala Gly Gly Val Ala Ile Asn Val Pro Gln Ser Val Gly Tyr Leu
115 120 125
His Gln Pro Gly Met Ile Leu Ser Pro Asp Gly Arg Cys Arg Ala Phe
130 135 140
Asp Glu Ser Ala Gln Gly Thr Val Pro Gly Asn Gly Ala Gly Val Val
145 150 155 160
Val Leu Lys Arg Leu Ser Arg Ala Leu Ala Asp Gly Asp Thr Ile Tyr 165 170 175
Ala Val Ile Arg Gly Ala Ala Ile Asn Asn Asp Gly Ala Glu Arg Met
180 185 190
Gly Phe Thr Ala Pro Gly Val Asp Gly Gln Thr Arg Leu Ile Arg Arg
195 200 205 165 170 175
Asn Asp Gly Ser Val Lys Gly Phe Thr Ala Pro Ser Ala Glu Gly
180 185 190
Gln Ser Arg Val Val Arg Asp Ala Ala Al Arg Ala Ala Val Pro Ala
195 200 205
Glu Ser Val
210 <210> 49 <211> 229 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism <400> 49
Pro Gln Gln Arg Leu Phe Leu Glu Cys Ala Trp Glu Ala Met Glu Asn 1 5 10 15
Ala Gly Tyr Ala Ala Arg Ser Tyr Lys Gly Ser Gly Val Phe Ala
20 25 30
Gly Cys Gly Val Asn Thr Leu Leu Leu Asn Leu Leu Ala Thr Ala Glu
35 40 45
Pro Phe Asp Phe Ser Arg Pro Ser Ala Tyr Gln Leu Leu Thr Ala Asn
50 55 60
Asp Lys Asp Phe Leu Ala Thr Arg Val Ser Tyr Lys Leu Asn Leu Arg
65 70 75 80
Gly Pro Ser Leu Thr Thrn Gln Thr Ala Ser Cys Ser Ser Ser Val Val Ser
85 90 95
Val Val Met Ala Cys Glu Ser Gln Arg Glly Ala Ser Asp Ala Island
100 105 110
Leu Ala Gly Gly Val Ala Val Asn Val Val Gln Ser Val Gly Tyr Leu
115 120 125
His Gln Pro Gly Met Leu Ser Pro Asp Gly Arg Cys Arg Ala Phe
130 135 140
Asp Glu Ser Ala Gln Gly Thr Pro Val Gly Asn Gly Ala Gly Val Val
145 150 155 160
Val Leu Lys Arg Leu Ser Arg Ala Leu Ala Asp Gly Asp Thr Tyr Island 165 170 175
Ala Val Arg Island Gly Ala Ala Asn Asn Island Asp Gly Ala Glu Arg Met
180 185 190
Gly Phe Thr Ala Pro Gly Val Asp Gly Arg Gln Arg Arg Arg Arg
195 200 205

Thr Gln Glu Met Ala Gly Val Lys Pro Glu Ser Ile Gly Tyr Met Asp
210 215 220 Thr His Gly Thr Gly 225 <210> 50 <211> 223 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 50 Pro Gln Gln Arg Leu Phe Leu Glu Val Ala Trp Glu Ala Leu Glu Arg 1 5 10 15 Ala Gly Arg Pro Pro Asp Ser Leu Ala Gly Ser Asp Thr Gly Val Phe
20 25 30 Ile Gly Ile Ser Thr Asp Asp Tyr Ser Arg Leu Lys Pro Thr Asp Pro
35 40 45 Ala Leu Ile Asp Ala Tyr Thr Gly Thr Gly Thr Ala Phe Ser Thr Ala
50 55 60 Ala Gly Arg Ile Ser Tyr Leu Leu Gly Leu Gln Gly Pro Asn Phe Pro
65 70 75 80 Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Val His Leu Ala Cys
85 90 95 Arg Ser Leu Gln Ser Arg Glu Cys Ser Met Ala Leu Ala Gly Gly Val
100 105 110 Asn Leu Ile Leu Ala Pro Glu Ser Thr Ile Tyr Phe Cys Arg Leu Arg
115 120 125 Ala Met Ala Ala Asp Gly Arg Cys Lys Ser Phe Ala Ala Ser Ala Asp
130 135 140 Gly Tyr Gly Arg Gly Glu Gly Cys Gly Met Leu Val Leu Lys Arg Leu 145 150 155 160 Ser Asp Ala Thr Arg Asp Gly Asp Arg Ile Leu Ala Leu Ile Arg Gly
165 170 175 Ser Ala Val Asn His Gly Gly Arg Ser Asn Gly Leu Thr Ala Pro Asn
180 185 190 Gly Pro Ala Gln Glu Ala Val Ile Arg Ala Ala Leu Lys Asn Ala Gly
195 200 205 Met Ala Pro Ala Asp Val Asp Tyr Val Asp Thr His Gly Thr Gly
210 215 220 Thr Gln Glu Met Ala Gly Val Lys Pro Glu Ser Gly Tyr Island Met Asp
210 215 220 Thr His Gly Thr Gly 225 <210> 50 <211> 223 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism <400> 50 Pro Gln Gln Arg Leu Phe Leu Glu Val Ala Trp Glu Ala Leu Glu Arg 1 5 10 15 Ala Gly Arg Pro Pro Ser Ser Ala Gly Ser Ser Asp Thr Gly Val Phe
20 25 30 Gly Island Ser Ser Asp Asp Asp Tyr Ser Ser Arg Leu Lys Pro Thr Asp Pro
35 40 45 Ala Leu Asp Asp Ala Tyr Thr Gly Thr Gly Thr Ala Phe Ser Thr Ala
50 55 60 Ala Gly Arg Ser Ser Tyr Leu Leu Gly Leu Gln Gly Asn Pro Phe Pro
65 70 75 80 Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Val His Leu Ala Cys
85 90 95 Arg Ser Leu Gln Ser Arg Glu Cys Ser Met Ala Leu Ala Gly Gly Val
100 105 110 Asn Leu Leu Leu Ala Pro Glu Ser Thr Ile Tyr Phe Cys Arg Leu Arg
115 120 125 Ala Met Ala Ala Asp Gly Arg Cys Lily Ser Phe Ala Ala Ser Ala Asp
130 135 140 Gly Tyr Gly Arg Gly Glu Gly Cys Gly Met Leu Val Leu Lys Arg Leu 145 150 155 160 Ser Asp Ala Thr Arg Asp Gly Asp Arg Ile Leu Ala Leu Arg Gly Island
165 170 175 Ser Ala Val Asn His Gly Gly Arg Ser Asn Gly Leu Thr Ala Pro Asn
180 185 190 Gly Pro Ala Gln Ala Val Glu Ala Ala Ala Lys Ala Gly
195 200 205 Met Ala Pro Asp Asp Asp Asp Asp Asp Gly Thr Gly
210 215 220

<210> 51 <211> 252 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 51 Pro Gln Glu Arg Val Phe Leu Glu Arg Ile Asp Gly Phe Asp Ala Glu 1 5 10 15 Phe Phe Gly Ile Ser Pro Arg Glu Ala Leu Asn Met Asp Pro Gln Gln
20 25 30 Arg Leu Leu Leu Glu Val Cys Trp Glu Ala Ala Glu Asp Ala Gly Ile
35 40 45 Ser Pro Gly Pro Leu Ala Gly Ser Ala Thr Gly Val Phe Ala Gly Ser
50 55 60 Cys Ala Gin Asp Phe Gly Leu Phe Gln Tyr Ala Asp Pro Ala Arg Ile
65 70 75 80 Gly Ala Trp Ser Gly Ser Gly Val Ala His Ser Met Leu Ala Asn Arg
85 90 95 Ile Ser Tyr Leu Leu Asp Leu Arg Gly Pro Ser Met Ala Val Asp Thr
100 105 110 Ala Cys Ser Ser Ala Leu Val Ala Val His Leu Ala Cys Gln Ser Leu
115 120 125 Arg Arg Arg Glu Cys Asp Ala Ala Phe Ala Gly Gly Val Asn Leu Ile
130 135 140 Leu Thr Pro Glu Gly Met Ile Ala Leu Ser Lys Ala Arg Met Leu Ala 145 150 155 160 Pro Asp Gly Arg Cys Lys Thr Phe Asp Ala Ala Ala Asp Gly Tyr Val
165 170 175 Arg Gly Glu Gly Cys Gly Ile Val Leu Leu Lys Arg Leu Ser Asp Ala
180 185 190 Leu Ala Asp Gly Asp Ala Ile Cys Ala Val Ile Arg Gly Ser Ala Ile
195 200 205 Asn Gln Asp Gly Arg Ser Asn Gly Ile Thr Ala Pro Asn Leu Gln Ala
210 215 220 Gln Lys Ala Val Leu Gln Glu Ala Val Ala Asn Ala His Ile Asp Pro 225 230 235 240 Ser His Val Ser Leu Ile Asp Thr His Gly Thr Gly
245 250 <210> 51 <211> 252 <212> PRT <213> Unknown organ <220><223> Origin of the sequence: soil organism <400> 51 Pro Gln Glu Arg Val Phe Leu Asp Glu Asp Asp Asp Glu 1 5 10 15 Phe Phe Gly Ile Pro Ser Arg Glu Ala Asn Leu Met Asp Pro Gln Gln
Arg Leu Leu Leu Glu Val Cys Trp Ala Glu Ala Glu Asp Ala Gly Ile
35 40 45 Ser Pro Gly Pro Leu Ala Gly Ser Ala Thr Gly Val Phe Ala Gly Ser
50 55 60 Cys Ala Gin Asp Phe Gly Leu Phe Gln Tire Ala Asp Pro Ala Arg Ile
65 70 75 80 Gly Ala Ser Ser Gly Ser Gly Ser Ala His Ser Ser Leu Ala Asn Arg
85 90 95 Ser Ser Tyr Leu Leu Asp Leu Arg Gly Pro Ser Met Ala Val Asp Thr
100 105 110 Ala Cys Ser Ser Ala Leu Val Ala Val His Leu Ala Cys Gln Ser Leu
115 120 Arg Arg Arg Glu Cys Asp Ala Ala Phe Ala Gly Gly Val Asn Leu Ile
130 135 140 Leu Thr Pro Glu Gly Met Ile Ala Leu Ser Lys Ala Arg Met Leu Ala 145 150 155 160 Pro Asp Gly Arg Cys Lys Thr Asp Asp Ala Ala Asp Gly Tyr Val
165 170 175 Arg Gly Glu Gly Cys Gly Ile Val Leu Leu Lys Arg Leu Ser Asp Ala
180 185 190 Leu Ala Asp Gly Asp Ala Island Cys Ala Val Island Arg Gly Ser Ala Island
195 200 205 Asn Gln Asp Gly Arg Ser Asn Gly Thr Island Ala Pro Asn Leu Gln Ala
210 215 220 Gln Lys Ala Val Leu Gln Glu Ala Val Ala Asn Ala His Asp Asp 225 230 235 240 Ser His Val Ser Leu Asp Asp Thr His Gly Thr Gly
245,250

<210> 52 <211> 234 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence : organisme du sol <400> 52 Pro Gln Gln Arg Val Phe Leu Glu Cys Ala Trp Glu Ala Val Glu Ser
1 5 10 15 Ala Gly Tyr Asp Pro Glu Lys Tyr Pro Gly Leu Ile Gly Val Phe Ala
20 25 30 Gly Ala Ser Ile Asn Ser Tyr Phe Leu Tyr Asn Leu Ala His Asn Arg
35 40 45 Glu Phe Val Ala Arg Met Ala Gly Glu Tyr Gln Val Gly Glu Tyr Gln
50 55 60 Thr Ile Leu Gly Asn Asp Lys Asp Tyr Leu Pro Thr Arg Val Ser Tyr
65 70 75 80 Lys Leu Asn Leu Arg Gly Pro Ser Leu Ala Val Gln Ser Ala Cys Ser
85 90 95 Thr Gly Leu Val Ala Val Cys Gln Ala Ile Gln Asn Leu Gln Thr Tyr
100 105 110 Gln Cys Asp Met Ala Leu Ala Gly Gly Ile Ser Ile Ser Phe Pro Gln
115 120 " 125 Lys Arg Asp Tyr Arg Phe Thr Asp Glu Gly Met Val Ser Arg Asp Gly
130 135 140 His Cys Arg Pro Phe Asp Ala Ser Ala Gln Gly Thr Val Phe Gly Asn 145 150 155 160 Gly Ala Gly Val Val Leu Met Lys Arg Leu Ala Asp Ala Val Thr Asp
165 170 175 Arg Asp Thr Ile Leu Ala Val Ile Arg Gly Ala Ala Val Asn Asn Asp
180 185 190 Gly Gly Val Lys Met Gly Tyr Thr Ala Pro Ser Ala Glu Gly Gln Ala
195 200 205 Glu Ala Ile Thr Leu Ala Leu Ala Leu Ala Gly Val Ser Pro Glu Thr
210 215 220 Ile Thr Cys Met Asp Thr His Gly Thr Gly 225 230 <210> 53 <211> 226 <212> PRT <210> 52 <211> 234 <212> PRT <213> Unknown Organ <220><223> Sequence Origin: Soil Organism <400> 52 Pro Gln Gln Arg Val Phe Leu Glu Cys Ala Trp Glu Ala Val Glu Ser
1 5 10 15 Ala Gly Tyr Asp Pro Glu Lys Tire Pro Gly Leu Isle Gly Val Phe Ala
20 25 30 Gly Ala Ser Asn Ser Ser Tyr Phe Leu Tyr Asn Ala His As Asn Arg
35 40 45 Glu Phe Val Ala Arg Ala Gly Glu Gln Gln Val Gly Glu Tyr Gln
50 55 60 Thr Leu Leuk Asn Asp Asp Asp Tyr Tyr Leu Pro Thr Arg Val Ser Tyr
65 70 75 80 Lys Leu Asn Leu Arg Gly Pro Ser Leu Ala Val Gln Ser Ala Cys Ser
85 90 95 Thr Gly Leu Val Ala Valley Cys Gln Ala Gln Island Asn Leu Gln Thr Tyr
100 105 110 Gln Cys Asp Met Ala Leu Ala Gly Gly Ser Island Ser Phe Pro Gln Island
115 120 "125 Lys Arg Asp Tyr Tyr Arg Phe Thr Asp Glu Gly Met Val Ser Arg Asp Gly
130 135 140 His Cys Arg Pro Phe Asp Ala Ser Ala Gln Gly Thr Val Phe Gly Asn 145 150 155 160 Gly Ala Gly Val Val Leu Met Arg Lys Leu Ala Asp Ala Asp Asp Val
165 170 175 Arg Asp Thr Island Leu Ala Val Island Arg Gly Ala Ala Val Asn Asn Asp
180 185 190 Gly Gly Val Lys Met Gly Tyr Thr Ala Ser Ser Ala Glu Gly Gln Ala
195 200 205 Glu Ala Island Thr Leu Ala Leu Ala Leu Ala Gly Ser Val Pro Glu Thr
210 215 220 Thr Cys Thr Met Asp Thr His Gly Thr Gly 225 230 <210> 53 <211> 226 <212> PRT

<213> Organime Inconnu <220> <223> Origine de la séquence : organisme du sol <400> 53 Pro Gln Gln Arg Val Phe Leu Glu Cys Ala Trp Ala Ala Leu Glu Arg 1 5 10 15 Arg Arg Ile Ser Gly Arg His Leu Pro Arg Cys Pro Ser Ala Val Tyr
20 25 30 Ala Ser Ser Gly Phe Asn Thr Tyr Leu Leu Asn Leu His Ala Asn Ala
35 40 45 Ala Val Arg Gln Ser Ile Ser Pro Phe Glu Leu Phe Val Ala Asn Asp
50 55 60 Lys Asp Phe Leu Ala Thr Arg Thr Ala Tyr Lys Leu Asn Leu Arg Gly
65 70 75 80 Pro Ala Met Thr Val Gln Thr Ala Cys Ser Ser Ser Leu Val Ala Val
85 90 95 His Val Ala Ala Gln Ser Leu Leu Ala Gly Glu Cys Asp Ile Ala Leu
100 105 110 Ala Gly Gly Ile Thr Val Ser Arg Ser His Gly Tyr Val Ala Arg Glu
115 120 125 Gly Gly Ile Leu Ser Pro Asp Gly His Cys Arg Ala Phe Asp Ala Asp
130 135 140 Ala Gly Gly Thr Val Pro Gly Ser Gly Val Gly Val Val Val Leu Lys 145 150 155 160 Arg Leu Glu Asp Ala Leu Ala Asp Gly Asp Thr Ile Asp Ala Val Ile
165 170 175 Ile Gly Ser Ala Ile Asn Asn Asp Gly Ala Leu Lys Ala Ser Phe Thr
180 185 190 Ala Pro Gln Val Asp Ser Gln Ala Leu Val Ile Ser Glu Ala His Ala
195 200 205 Ala Ala Gly Ile Ser Ala Asp Ser Ile Gly Tyr Met Asp Thr His Gly
210 215 220 Thr Gly 225 <210> 54 <211> 223 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <213> Unknown organ <220><223> Origin of the sequence: soil organism <400> 53 Pro Gln Gln Arg Val Phe Leu Glu Cys Ala Trp Ala Ala Leu Glu Arg 1 5 10 15 Arg Arg Ile Ser Gly Arg His Leu Pro Arg Ser Cys Ser Ala Val Tyr
Ala Ser Ser Gly Phe Asn Thr Leu Leu Leu Asn Leu His Ala Asn Ala
35 40 45 Ala Val Arg Gln Ser Ser Ser Pro Phe Glu Phe Leu Val Ala Asn Asp
50 55 60 Lys Asp Phe Leu Ala Thr Arg Thr Ala Tyr Lily Leu Asn Leu Arg Gly
65 70 75 80 Pro Ala Met Thr Val Gln Ser Ala Cys Ser Ser Ser Ala Val Val Val
85 90 95 His Val Ala Ala Gln Leu Leu Leu Ala Gly Glu Asp Cys Ala Leu
100 105 110 Ala Gly Gly Thr Island Ser Val Ser Arg Ser His Gly Tyr Val Ala Arg Glu
115 120 125 Gly Gly Ile Leu Pro Ser Asp Gly His Cys Arg Asp Asp Ala Phe Asp
130 135 140 Ala Gly Gly Thr Val Pro Gly Ser Gly Val Gly Val Val Val Leu Lys 145 150 155 160 Arg Leu Glu Asp Ala Leu Asp Ala Asp Gly Asp Asp Asp Asp Ala Val Ile
165 170 175 Island Gly Ser Ala Island Asn Asn Asp Gly Ala Leu Lys Ala Ser Phe Thr
180 185 190 Ala Pro Gln Asp Val Gln Al Gla Val Al Ser Glu Ala His Ala
195 200 205 Ala Ala Gly Ser Island Ala Asp Ser Gly Tyr Island Met Asp Thr His Gly
210 215 220 Thr Gly 225 <210> 54 <211> 223 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism

<400> 54 Pro Gln Gln Arg Leu Phe Leu Glu Leu Thr Trp Glu Ala Leu Glu Asp 1 5 10 15 Ala Gly Ile Pro Pro Ser Thr Ile Ala Gly Thr Asn Val Gly Val Phe
20 25 30 Met Gly Ala Ser Gln Ala Asp Tyr Gly His Lys Phe Phe Ser Asp His
35 40 45 Ala Val Ala Asp Ser His Phe Ala Thr Gly Thr Ser Leu Ala Val Val
50 55 60 Ala Asn Arg Ile Ser Tyr Ile Tyr Asp Leu Arg Gly Pro Ser Leu Thr
65 70 75 80 Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Gln Ala Val
85 90 95 Glu Ala Leu Arg Ser Gly Arg Ile Glu Thr Ala Ile Val Gly Gly Ile
100 105 110 Asn Val Ile Ala Ser Pro Ala Ser Phe Ile Ala Phe Ser Gln Ala Ser
115 120 125 Met Leu Ser Pro Thr Gly Leu Cys Gln Ala Phe Ser Ala Lys Ala Asp
130 135 140 Gly Phe Val Arg Gly Glu Gly Gly Thr Val Phe Val Leu Arg Lys Ala 145 150 155 160 Ala His Ala His Gly Ser Arg Asn Pro Val Arg Gly Leu Ile LeuAla
165 170 175 Thr Asp Val Asn Ser Asp Gly Arg Thr Asn Gly Ile Ser Leu Pro Ser
180 185 190 Ala Glu Ala Gln Glu Val Leu Leu Gln Arg Val Tyr Ser Arg Ala Ser
195 200 205 Ile Asp Pro Asn Arg Leu Ala Phe Val Asp Thr His Gly Thr Gly
210 215 220 <210> 55 <211> 254 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 55 Pro Gln Gln Arg Val Phe Leu Asp Gly Ile Asp Arg Phe Asp Pro Arg 1 5 10 15 His Phe Ala Ile Thr Pro Arg Glu Ala Ile Ser Met Asp Pro Gln Gln
20 25 30 <400> 54 Pro Gln Gln Arg Leu Phe Leu Glu Leu Thr Trp Glu Ala Leu Glu Asp 1 5 10 15 Ala Gly Pro Pro Ser Thr Island Ala Gly Thr Asn Val Gly Val Phe
Met Gly Ala Ser Gln Ala Asp Tyr Gly His Lys Phe Phe Ser Asp His
35 40 45 Ala Val Ala Asp His Ser Phe Ala Thr Ser Gly Thr Ser Ala Val Val
50 55 60 Ala Asn Arg Isle Ser Tyr Island Tyr Asp Leu Arg Gly Pro Ser Leu Thr
65 70 75 80 Val Asp Thr Ala Ser Cys Ser Ser Al Leu Leu Leu Leu Gln Ala Val
85 90 95 Glu Ala Leu Arg Ser Gly Arg Glu Thr Island Ala Island Val Gly Gly Island
100 105 110 Asn Val Ala Ser Pro Ala Ser Phe Ala Phe Ser Ala Ser
115 120 125 Met Leu Ser Gly Leu Cys Gln Pro Ala Phe Ser Ala Lys Ala Asp
130 135 140 Gly Phe Val Arg Gly Glu Gly Gly Thr Val Phe Val Leu Arg Lys Ala 145 150 155 160 Ala His Ala His Gly Ser Arg Asn Pro Val Arg Gly Leu Ile LeuAla
165 170 175 Thr Asp Val Asn Ser Asp Gly Arg Thr Asn Gly Ile Ser Leu Pro Ser
180 185 190 Ala Glu Ala Gln Glu Val Leu Leu Gln Arg Val Tyr Ser Arg Ala Ser
195 200 205 Asp Asp Island Asn Arg Leu Ala Phe Asp Val Asp Thr His Gly Thr Gly
210 215 220 <210> 55 <211> 254 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism <400> 55 Pro Gln Gln Arg Val Phe Leu Asp Gly Asp Asp Arg Phe Asp Pro Arg 1 5 10 15 His Phe Ala Island Thr Pro Arg Glu Ala Island Ser Met Asp Pro Gln Gln
20 25 30

Arg Leu Leu Leu Glu Val Thr Trp Glu Ala Leu Glu Arg Ala Gly Val
35 40 45 Ala Pro Asp Arg Leu Thr Gly Ser Asp Thr Gly Val Phe Ile Gly Ile
50 55 60 Ser Thr Asn Asp Tyr Gly Gln Ile Leu Leu Arg Ala Ser Asp Gln Ile
65 70 75 80 Asp Pro Gly Met Tyr Phe Gly Thr Gly Asn Leu Leu Asn Ala Ala Ala
85 90 95 Gly Arg Leu Ser Tyr Val Leu Gly Leu Gln Gly Pro Ser Met Ala Val
100 105 110 Asp Thr Ala Cys Pro Ser Ser Leu Val Ala Ile His Leu Ala Cys Gln
115 120 125 Ser Leu Arg Asn Arg Glu Cys Arg Met Ala Leu Ala Gly Gly Ala Asn
130 135 140 Leu Val Leu Val Pro Glu Val Thr Val Asn Cys Cys Arg Ala Lys Met 145 150 155 160 Leu Ala Pro Asp Gly Arg Cys Lys Thr Phe Asp Ala Ala Ala Asp Gly
165 170 175 Tyr Val Arg Gly Glu Gly Ala Ala Val Ile Val Leu Lys Arg Leu Ser
180 185 190 Asp Ala Leu Ala Asp Gly Asp Pro Ile Val Ala Leu Ile Arg Gly Ser
195 200 205 Ala Val Asn Gln Asp Gly Arg Ser Gly Gly Phe Thr Ala Pro Asn Glu
210 215 220 Leu Ala Gln Gln Ala Val Ile Arg Thr Ala Leu Ala Ala Ala Gly Val 225 230 235 240 Ala Ala Ser Asp Ile Gly Tyr Val Asp Thr His Gly Thr Gly
245 250 <210> 56 <211> 254 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 56 Pro Gln Gln Arg Val Phe Leu Asp Gly Ile Asp Arg Phe Asp Pro Gln 1 5 10 15 Phe Phe Gly Ile Ala Pro Arg Glu Ala Ala Gly Ile Asp Pro Gln Gln
20 25 30 Arg Leu Leu Leu Glu Val Thr Trp Glu Ala Leu Glu Arg Ala Gly Val
35 40 45 Ala Pro Asp Arg Green Thr Gly Asp Asp Thr Gly Val Phe Ile Gly Ile
50 55 60 Ser Thr Asn Asp Tyr Gly Gln Ile Leu Leu Arg Ala Ser Asp Gln Ile
65 70 75 80 Asp Pro Gly Met Tyr Phe Gly Thr Gly Asn Leu Leu Asn Ala Ala Ala
85 90 95 Gly Arg Leu Ser Tyr Leu Val Leu Gly Leu Gln Ser Gly Pro Met Ala Val
100 105 110 Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Island His Leu Ala Cys Gln
115 120 125 Ser Leu Arg Asn Arg Glu Cys Arg Met Ala Leu Ala Gly Gly Ala Asn
130 135 140 Leu Val Leu Val Pro Glu Val Thr Val Asn Cys Cys Arg Ala Lys Met 145 150 155 160 Leu Ala Pro Asp Gly Arg Cys Lily Asp Asp Phe Ala Ala Ala Asp Gly
165 170 175 Tyr Val Arg Gly Glu Gly Ala Ala Val Ile Val Leu Lys Arg Leu Ser
180 185 190 Asp Ala Leu Ala Asp Gly Asp Pro Val Ala Island Leu Arg Gly Ser Island
195 200 205 Ala Val Asn Gln Asp Gly Arg Ser Gly Gly Phe Thr Ala Pro Asn Glu
210 215 220 Leu Ala Gln Gln Ala Val Arg Arg Ala Al Ala Ala Gly Val 225 230 235 240 Ala Ala Ser Asp Gly Tyr Island Val Asp Thr Thr Gly Thr Gly
245 250 <210> 56 <211> 254 <212> PRT <213> Unknown organ <220><223> Origin of the sequence: soil organism <400> 56 Pro Gln Gln Arg Val Phe Leu Asp Gly Ile Asp Arg Phe Asp Pro Gln 1 5 10 15 Phe Phe Gly Ile Ala Pro Arg Glu Ala Ala Gly Ile Asp Pro Gln Gln
20 25 30

Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Leu Glu Asp Ala Gly Thr
35 40 45 Ser Pro Glu Lys Leu Gin Gly Thr Pro Ala Gly Val Phe Val Gly Ile
50 55 60 Asn Ser Ile Asp Tyr Ala Thr Leu Gln Leu Gln Asn Cys Asp Leu Ala
65 70 75 80 Ser Ile Asp Ala Tyr Ser Leu Ser Gly Ser Ala His Ser Ile Ala Ala
85 90 95 Gly Arg Leu Ala Tyr Val Leu Gly Leu Gin Gly Pro Ala Met Ala Val
100 105 110 Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Ile His Leu Ala Cys Gln
115 120 125 Ser Leu Arg Asn Asp Asp Cys Arg Val Ala Val Ala Gly Gly Val His
130 135 140 Val Thr Leu Thr Pro Ile Asn Met Val Val Phe Ser Lys Leu Arg Met 145 150 155 160 Leu Ala Ala Asp Gly Lys Cys Lys Thr Phe Asp Gly Arg Gly Asp Gly
165 170 175 Phe Val Glu Gly Glu Gly Cys Ala Val Ile Val Leu Lys Arg Leu Ser
180 185 190 His Ala Leu Ala Asp Lys Asp Arg Ile Leu Ala Leu Val Arg Gly Ser
195 200 205 Ala Val Asn Gln Asp Gly Ala Ser Ser Gly Leu Thr Ala Pro Asn Gly
210 215 220 Pro Ala Gln Glu Ala Val Ile Arg Ala Ala Leu Lys Arg Ala Gly Val 225 230 235 240 Gln Pro Ala Glu Val Gly Tyr Val Asp Thr His Gly Thr Gly
245 250 <210> 57 <211> 222 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 57 Pro Gln Glu Arg Val Leu Leu Glu Ser Ser Trp His Ala Leu Glu Asp 1 5 10 15 Ala Gly Tyr Ala Gly Glu Ser Ile Ala Gly Ala Arg Cys Gly Val Tyr
20 25 30 Met Gly Phe Asn Gly Gly Asp Tyr Gly Asp Leu Leu Tyr Gly Gin Pro Arg Leu Leu Leu Glu Thr Thr Trp Glu Ala Glu Asp Leu Ala Gly Thr
35 40 45 Ser Pro Glu Lilies Leu Gin Gly Thr Pro Ala Gly Val Phe Val Gly Ile
50 55 60 Asn Ser Asp Asp Tyr Ala Thr Leu Gln Gln Leu Asn Cys Asp Leu Ala
65 70 75 80 Ser Asp Ala Ser Ser Ser Ser Ser Ser Gly Ser Ala His Ser Ala Ala Ser
85 90 95 Gly Arg Leu Ala Val Leu Val Leu Gly Leu Gin Val Ala Ala Val
100 105 110 Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Island His Leu Ala Cys Gln
115 120 125 Ser Leu Arg Asp Asn Asp Asp Cys Arg Val Ala Val Ala Gly Gly Val His
130 135 140 Val Thr Leu Thr Pro Ile Asn Met Val Val Phe Ser Lys Leu Arg Met 145 150 155 160 Leu Ala Ala Asp Gly Lys Cys Lys Thr Phe Asp Gly Arg Gly Asp Gly
165 170 175 Phe Val Glu Gly Glu Gly Cys Ala Val Ile Val Leu Lys Arg Leu Ser
180 185 190 His Ala Leu Ala Asp Lys Asp Arg Arg Leu Ala Leu Val Arg Gly Ser
195 200 205 Ala Val Asn Asp Gln Gly Ala Ser Ser Gly Leu Thr Ala Pro Asn Gly
210 215 220 Pro Ala Gln Glu Ala Val Arg Island Ala Ala Leu Lys Arg Ala Gly Val 225 230 235 240 Gln Pro Ala Glu Val Gly Tyr Val Asp Thr His Gly Thr Gly
245 250 <210> 57 <211> 222 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism <400> 57 Pro Gln Glu Arg Leu Leu Leu Ser Glu Ser Trp His Ala Leu Glu Asp 1 5 10 15 Ala Gly Tyr Ala Gly Glu Ser Ile Ala Gly Ala Arg Cys Gly Val Tyr
20 25 30 Met Gly Phe Asn Gly Gly Asp Tyr Gly Asp Leu Leu Tyr Gly Gin Pro

35 40 45 Ser Leu Pro Pro His Ala Met Trp Gly Asn Ala Ala Ser Val Leu Ser
50 55 ' 60 Ala Arg Ile Ala Tyr Tyr Leu Asp Leu Gln Gly Pro Ala Ile Thr Leu
65 70 75 80 Asp Thr Ala Cys Ser Ser Ser Led Val Ala Val His Leu Ala Cys Gln
85 90 95 Gly Leu Trp Thr Gly Glu Thr Asp Leu Ala Leu Ala Gly Gly Val Trp
100 105 110 Ile Gln Cys Thr Pro Gly Phe Leu Ile Ser Ser Ser Arg Ala Gly Met
115 120 125 Leu Ser Pro Thr Gly Gln Cys Arg Ala Phe Gly Ala Gly Ala Asp Gly
130 135 140 Phe Val Pro Ser Glu Gly Val Gly Val Val Val Leu Lys Arg Leu Gln 145 150 155 160 Asp Ala Leu Asp Ala Gly Asp His Xaa Tyr Gly Val Ile Arg Gly Ser
165 170 175 Ala Ile Asn Gln Asp Gly Ala Ser Asn Gly Ile Thr Ala Pro Ser Ala
180 185 190 Ala Ala Gln Glu Arg Leu Gln Arg His Val Tyr Asp Ser Phe Gly Ile
195 200 205 Asp Ala Ser Arg Leu Gln Met Ile Glu Ala His Gly Thr Gly
210 " 215 220 <210> 58 <211> 223 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence organisme du sol <400> 58 Pro Gln Glu Arg Val Leu Leu Glu Val Thr Trp Glu Ala Leu Glu Asp
1 5 10 15 Ala Gly Gln Asp Val Asp Arg Leu Ala Gly Arg Pro Val Gly Val Phe
20 25 30 Val Gly Ile Ser Ser Asn Asp Tyr Gly Gln Leu Gln Asn Gly Asp Pro
35 40 45 Ala Asp Val Asp Ala Tyr Val Gly Thr Gly Asn Ala Leu Ser Ile Ala
50 55 " 60 Ala Asn Arg Leu Ser Tyr Thr Phe Asp Phe Arg Gly Pro Ser Leu Ala
65 70 75 80 35 40 45 Ser Leu Pro Pro His Ala Met Trp Gly Asn Ala Ala Ser Val Leu Ser
50 55 '60 Ala Arg Ala Tyr Tyr Leu Leu Asp Leu Gln Gly Pro Ala Ile Thr Leu
65 70 75 80 Asp Thr Ala Ser Cys Ser Ser Led Ala Val Val His Leu Ala Cys Gln
85 90 95 Gly Leu Trp Thr Gly Glu Asp Asp Leu Al Leu Ala Gly Gly Val Trp
100 105 110 Gln Island Cys Thr Pro Gly Phe Leu Ser Ser Ser Ser Arg Ala Gly Met
115 120 125 Leu Ser Pro Gly Gln Cys Arg Ala Phe Gly Ala Gly Ala Asp Gly
130 135 140 Phe Val Pro Ser Glu Gly Val Gly Val Val Val Leu Lys Arg Leu Gln 145 150 155 160 Asp Ala Leu Asp Ala Gly Asp His Xaa Tyr Gly Val Isle Arg Gly Ser
165 170 175 Ala Asn Island Gln Asp Gly Ala Ser Asn Gly Thr Island Ala Pro Ser Ala
180 185 190 Ala Ala Gln Glu Arg Leu Gln Arg His Val Tyr Asp Ser Phe Gly Ile
195 200 205 Asp Ala Ser Arg Leu Gln Met Glu Island Ala His Gly Thr Gly
210 "215 220 <210> 58 <211> 223 <212> PRT <213> Unknown organism <220><223> Origin of the soil organism sequence <400> 58 Pro Gln Glu Arg Leu Leu Leu Glu Val Thr Trp Glu Ala Leu Glu Asp
1 5 10 15 Ala Gly Asp Gln Val Asp Arg Ala Leu Gly Arg Pro Val Gly Val Phe
20 25 30 Val Gly Ser Ser Asn Asp Asp Gly Gln Gln Leu Gln Asn Gly Asp Pro
35 40 45 Ala Asp Val Asp Ala Tire Val Gly Thr Gly Asn Ala Leu Ser Ala Island
50 55 "60 Ala Asn Arg Ser Ser Tyr Thr Phe Asp Arg Phe Gly Pro Ser Leu Ala
65 70 75 80

Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Ile His Leu Ala Cys
85 90 95 Gin Ser Val Arg Arg Gly Glu Ala Glu Leu Ala Val Ala Ala Gly Val
100 105 110 Asn Leu Ile Leu Thr Pro Gly Leu Thr Val Asn Phe Thr Arg Ala Gly
115 120 125 Met Met Ala Pro Asp Gly Arg Cys Lys Thr Phe Asp Ala Ala Ala Asn
130 135 140 Gly Tyr Val Arg Gly Glu Gly Ala Gly Val Val Val Leu Lys Pro Leu 145 150 155 160 Ala Gin Ala Ile Ala Asp Gly Asp Pro Ile Tyr Ala Ile Val Arg Gly
165 170 175 Ser Ala Val Asn Gin Asp Gly Arg Ser Asn Gly Leu Thr Ala Pro Asn
180 185 190 Arg Gln Ala Gin Glu Val Val Leu Arg Ala Ala Tyr Arg Asp Ala Gly
195 200 205 Ile Ser Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly
210 215 220 <210> 59 <211> 235 <212> PRT <213> Organime Inconnu <220> <223> Origine de la séquence :organisme du sol <400> 59 Pro Gln Gin Arg Val Phe Leu Glu Asp Ala Thr Glu Val Asp Val Asp
1 5 10 15 Ala Leu Ser Asp Gly Glu Asp Val Val Ile Ala Gly Ile Met Gin His
20 25 30 Ile Glu Glu Ala Gly Ile His Ser Gly Asp Ser Ser Cys Val Leu Pro
35 40 45 Pro Val Asp Ile Pro Pro Lys Ala Leu Gln Thr Ile Arg Asp His Thr
50 55 60 Phe Lys Leu Ala Arg Ala Leu Lys Val Ile Gly Leu Met Asn Val Gin
65 70 75 80 Tyr Ala Ile Gin Arg Asp Lys Val Tyr Val Ile Glu Val Asn Pro Arg
85 90 95 Ala Ser Arg Thr Val Pro Tyr Val Ser Lys Ala Thr Gly Val Pro Leu
100 105 110 Val Asp Thr Ala Ser Cys Ser Ser Leu Leu Val Ala Ile His Leu Ala Cys
85 90 95 Gin Ser Arg Arg Val Gly Glu Ala Glu Leu Val Ala Ala Ala Gly Val
100 105 110 Asn Leu Leu Leuk Pro Gly Leu Thr Asn Phe Thr Arg Ala Gly
115 120 125 Met Met Ala Pro Asp Gly Arg Cys Lys Asp Thr Phe Asp Ala Ala Ala Asn
130 135 140 Gly Tyr Val Arg Gly Glu Gly Ala Gly Val Val Val Leu Lys Pro Leu 145 150 155 160 Ala Gin Ala Island Ala Asp Gly Asp Pro Island Tyr Ala Island Val Arg Gly
165 170 175 Ser Ala Val Asn Gin Asp Gly Arg Ser Asn Gly Leu Thr Ala Pro Asn
180 185 190 Arg Gln Ala Gin Glu Val Val Leu Arg Ala Ala Tyr Arg Asp Ala Gly
195 200 205 Pro Ser Island Ala Asp Asp Val Ala Val Glu Ala His Gly Thr Gly
210 215 220 <210> 59 <211> 235 <212> PRT <213> Unknown organism <220><223> Origin of the sequence: soil organism <400> 59 Pro Gln Gin Arg Val Phe Leu Glu Asp Ala Thr Glu Val Asp Val Asp
1 5 10 15 Ala Leu Ser Asp Asp Gly Glu Val Val Val Ala Gly Island Met Gin His
20 25 30 Island Glu Glu Ala Gly Island His Ser Gly Asp Ser Ser Cys Val Leu Pro
35 40 45 Pro Val Asp Pro Island Pro Lys Ala Leu Gln Thr Arg Asp Asp Thr Island
50 55 60 Phe Lys Leu Ala Arg Ala Leu Lys Val Ile Gly Leu Met Asn Val Gin
65 70 75 80 Tyr Ala Island Gin Arg Asp Lys Val Tyr Val Glu Val Asn Pro Arg
85 90 95 Ala Ser Arg Thr Val Pro Tire Val Ser Lys Ala Thr Gly Val Pro Leu
100 105 110

Ala Lys Val Ala Ser Arg Leu Met Thr Gly Arg Lys Leu His Glu Leu
115 120 125 Leu Pro Glu Gly Val Glu Arg Gly Trp Ile Thr Thr Ala Gly Glu Asn
130 135 140 Phe Tyr Val Lys Ser Pro Val Phe Pro Trc Gly Lys Phe Pro Gly Val 145 150 155 160 Asp Thr Val Leu Gly Pro Glu Met Lys Ser Thr Gly Glu Val Met Gly
165 170 175 Val Ala Ala Lys Val Ala Ser Arg Leu Met Thr Gly Arg Lys Leu His Glu Leu
115 120 125 Leu Pro Glu Gly Val Glu Arg Gly Trp Thr Thr Island Ala Gly Glu Asn
130 135 140 Phe Tyr Val Lys Ser Pro Val Phe Pro Trc Gly Lys Phe Pro Gly Val 145 150 155 160 Asp Thr Val Leu Gly Pro Glu Met Lys Ser Thr Gly Glu Val Met Gly
165 170 175 Val Ala

* Asp Asn Phe Gly Glu Ala Phe Ala Lys Ala Gln Ile Ala Ala
180 185 190 Gly Thr Tyr Leu Pro Thr Glu Gly Thr Val Phe Ile Ser Val Asn Asp
195 200 205 Arg Asp Lys Gly Asn Val Ile Gln Leu Ala Gln Arg Phe Ser Glu Leu
210 215 220 Gly Phe Gly Ile Val Asp Thr His Gly Thr Gly 225 230 235 <210> 60 <211> 1269 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 60 taacaggaag aagcttgctt ctttgctgac gagtggcgga cgggtgagta acacgtggga 60 acctgcctta tggttcggga taacgtctgg aaacggacgc taacaccgga tgtgcccttc 120 gggggaaagt ttacgccatg agaggggccc gcgtccgatt aggtagttgg tggggtaatg 180 gcccaccaag ccgacgatcg gtagctggtc tgagaggatg atcagccaca ctgggactga 240 gacacggccc agactcctac gggaggcagc agtggggaat attggacaat gggggcaacc 300 ctgatccagc aatgccgcgt gagtgatgaa ggcctaggg ttgtaaagct ctttcgcacg 360 cgacgatgat gacggtagcg tgagaagaag ccccggctaa cttcgtgcca gcagccgcgg 420 taatacgaag ggggcgagcg ttgttcggaa ttactgggcg taaagggcgc gtaggcggcc 480 cgatcagtca gatgtgaaag ccccgggctc aacctgggaa ctgcatttga tactgtcggg 540 cttgagttcc ggagaggatg gtggaattcc cagtgtagag gtgaaattcg tagatattgg 600 gaagaacacc ggtggcgaag gcggccatct ggacggacac tgacgctgag gcgcgaaagc 660 gtggggagca aacaggatta gataccctgg tagtccacgc cgtaaacgat gaatgctaga 720 cgctggggtg catgcacttc ggtgtcgccg ctaacgcatt aagcattccg cctggggagt 780 acggccgcaa ggttaaaact caaaggaatt gacgggggcc cgcacaagcg gtggagcatg 840 tggtttaatt cgaagcaacg cgcagaacct taccaaccct tgacatgtcc attgccggtc 900 cgagagattg gaccttcagt tcggctggat ggaacacagg tgctgcatgg ctgtcgtcag 960 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccctac cgccagttgc 1020 catcattcag ttgggcactc tggtggaact gccggtgaca agccggagga aggcggggat 1080 gacgtcaagt cctcatggcc cttatgggtt gggctacaca cgtgctacaa tagcggtgac 1140 agtgggacgc gaagtcgcaa gatggagcaa atccccaaaa gccgtctcag ttcggattgc 1200 actctgcaac tcgggtgcat gaagttggaa tcgctagtaa tcgcggatca gcacgccgcg 1260 gtgaatacg 1269 <210> 61 * Asp Asn Phe Gly Glu Ala Phe Ala Lys Ala Gln Ala Ala Island
180 185 190 Gly Thr Tyr Leu Pro Thr Glu Gly Thr Val Phe Val Asn Asp Asp
195 200 205 Arg Asp Lys Gly Asn Val Gln Leu Ala Gln Arg Phe Ser Glu Leu
210 215 220 Gly Phe Gly Asp Asp Thr His Gly Thr Gly 225 230 235 <210> 60 <211> 1269 <212> DNA <213> Unknown Organism <220><223> Origin of the sequence: Soil organism <400 > 60 taacaggaag aagcttgctt ctttgctgac gagtggcgga cgggtgagta acacgtggga 60 acctgcctta tggttcggga taacgtctgg aaacggacgc taacaccgga tgtgcccttc 120 gggggaaagt ttacgccatg agaggggccc gcgtccgatt aggtagttgg tggggtaatg 180 gcccaccaag ccgacgatcg gtagctggtc tgagaggatg atcagccaca ctgggactga 240 gacacggccc agactcctac gggaggcagc agtggggaat attggacaat gggggcaacc 300 ctgatccagc aatgccgcgt gagtgatgaa ggcctaggg ttgtaaagct ctttcgcacg 360 cgacgatgat gacggtagcg tgagaagaag ccccggctaa cttcgtgcca gcagccgcgg 420 taatacgaag ggggcgagcg ttgttcggaa ttactgggcg taaagggcgc gtaggcggcc 480 cgatcagtca gatgtgaaag ccccgggctc aacctgggaa ctgcatttga tactgtcggg 540 cttgagttcc ggagaggatg gtggaattcc cagtgtagag gtgaaattcg tagatattgg 600 gaagaacacc ggtggcgaag gcggccatct ggacggacac tgacgctgag gcgcgaaagc 660 gtggggagca aacaggatta gataccctgg tagtccacgc CGTA aacgat gaatgctaga 720 cgctggggtg catgcacttc ggtgtcgccg ctaacgcatt aagcattccg cctggggagt 780 acggccgcaa ggttaaaact caaaggaatt gacgggggcc cgcacaagcg gtggagcatg 840 tggtttaatt cgaagcaacg cgcagaacct taccaaccct tgacatgtcc attgccggtc 900 cgagagattg gaccttcagt tcggctggat ggaacacagg tgctgcatgg ctgtcgtcag 960 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccctac cgccagttgc 1020 catcattcag ttgggcactc tggtggaact gccggtgaca agccggagga aggcggggat 1080 gacgtcaagt cctcatggcc cttatgggtt gggctacaca cgtgctacaa tagcggtgac 1140 agtgggacgc gaagtcgcaa gatggagcaa atccccaaaa gccgtctcag ttcggattgc 1200 actctgcaac tcgggtgcat gaagttggaa tcgctagtaa tcgcggatca gcacgccgcg 1260 gtgaatacg 1269 <210> 61

<211> 1500 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 61 ttttaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggccctct 60 agatgcatgc tcgagcggcc gccagtgtga tggatatctg cagaattcgc ccttcaggcc 120 taacacatgc aagtcgaacg agggcttcgg ccctagtggc gcacgggtga gtaacacgtg 180 ggaacctgcc ttatggttcg ggataacgc tggaaacgga cgctaacacc ggatgtgccc 240 ttcgggggaa agtttacgcc argagagggg cccgcgtccg attaggtagt tggtggggta 300 atggcccacc aagccgacga tcggtagctg gtctgagagg atgatcagcc acactgggac 360 tgagacacgg cccagactcc tacgggaggc agcagtgggg aatattggac aatgggggca 420 accctgatcc agcaatgccg cgtgagtgat gaaggcctta gggttgtaaa gctctttcgc 480 acgcgacgat gatgacggta gcgtgagaag aagccccggc taacttcgtg ccagcagccg 540 cggtaatacg aagggggcga gcgttgttcg gaattactgg gcgtaaaggg cgcgtaggcg 600 gcccgatcag tcagatgtga aagccccggg ctcaacctgg gaactgcatt tgatactgtc 660 gggcttgagt tccggagagg atggtggaat tcccagtgta gaggtgaaat tcgtagatat 720 tgggaagaac accggtggcg aaggcggcca tctggacgga cactgacgct gaggcgcgaa 780 agcgtgggga gcaaacagga ttagataccc tggtagtcca cgccgtaaac gatgaatgct 840 agacgctggg gtgcatgcac ttcggtgtcg ccgctaacgc attaagcatt ccgcctgggg 900 agtacggccg caaggttaaa actcaaagga attgacgggg gcccgcacaa gcggtggagc 960 atgtggttta attcgaagca acgcgcagaa ccttaccaac ccttgacatg tccattgccg 1020 gtccgagaga ttggaccttc agttcggctg gatggaacac aggtgctgca tggctgtcgt 1080 cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga gcgcaacccc taccgccagt 1140 tgccatcatt cagttgggca ctctggtgga actgccggtg acaagccgga ggaaggcggg 1200 gatgacgtca agtcctcatg gcccttagg gttgggctac acacgtgcta caatggcggt 1260 gacagtggga cgcgaagtcg caagatggag caaatcccca aaagccgtct cagttcggat 1320 tgcactctgc aactcgggtg catgaagtg gaatcgctag taatcgcgga tcagcacgcc 1380 gcggtgaata cgttcccggg ccttgtacac accgcccaag ggcgaattcc agcacactgg 1440 cggccgttac tagtggatcc gagctcggta ccaagcttgg cgtaatcatg gtcatagctg 1500 <210> 62 <211> 1366 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 62 acgacggcca gtgaattgta atacgactca ctatagggcg aattgggccc tctagatgca 60 tgctcgagcg gccgccagtg tgatggatat ctgcagaatt cgcccttcag gcctaacaca 120 tgcaagtcga acgaaggctt cggccttagt ggcgcacggg tgagtaacac gtgggaacct 180 gcctttcggt tcggaataac gtctggaaac ggacgctaac accggatacg cccttcgggg 240 gaaagttcac gccgagagag gggcccgcgc cggattaggt agttggtgag gtaatggctc 300 accaagcctt cgatccgtag ctggtctgag aggatgatca gccacactgg gactgagaca 360 cggcccagac tcctacggga ggcagcagtg gggaatattg gacaatgggc gcaagcctga 420 tccagcaatg ccgcgtgagt gatgaaggcc ttagggttgt aaagctcttt cgcacgcgac 480 gatgatgacg gtagcgtgag aagaagcccc ggctaacttc gtgccagcag ccgcggtaat 540 acgaaggggg ctagcgttgt tcggaattac tgggcgtaaa gggcgcgtag gcggcctgct 600 tagtcagaag tgaaagcccc gggctcaacc tgggaatagc ttttgatact ggcaggcttg 660 agttccggag aggatggtgg aattcccag gtagaggtga aattcgtaga tattgggaag 720 aacaccggtg gcgaaggcgg ccatctggac ggacactgac gctgaggcgc gaaagcgtgg 780 ggagcaaaca ggattagata ccctggtagt: ccacgccgta aacgatgaat gctagacgtc 840 ggggtgcatg cacttcggtg tcgccgctaa cgcattaagc attccgcctg gggagtacgg 900 ccgcaaggtt aaaactcaaa ggaattgacg ggggcccgca caagcggtgg agcatgtggt 960 <211> 1500 <212> DNA <213> Organime Unknown <220> <223> Origin of sequence: Floor Organism <400> 61 ttttaaaacg acggccagtg aattgtaata cgactcacta tagggcgaat tgggccctct 60 agatgcatgc tcgagcggcc gccagtgtga tggatatctg cagaattcgc ccttcaggcc 120 taacacatgc aagtcgaacg agggcttcgg ccctagtggc gcacgggtga gtaacacgtg 180 ggaacctgcc ttatggttcg ggataacgc tggaaacgga cgctaacacc ggatgtgccc 240 ttcgggggaa agtttacgcc argagagggg cccgcgtccg attaggtagt tggtggggta 300 atggcccacc aagccgacga tcggtagctg gtctgagagg atgatcagcc acactgggac 360 tgagacacgg cccagactcc tacgggaggc agcagtgggg aatattggac aatgggggca 420 accctgatcc agcaatgccg cgtgagtgat gaaggcctta gggttgtaaa gctctttcgc 480 acgcgacgat gatgacggta gcgtgagaag aagccccggc taacttcgtg ccagcagccg 540 cggtaatacg aagggggcga gcgttgttcg gaattactgg gcgtaaaggg cgcgtaggcg 600 gcccgatcag tcagatgtga aagccccggg ctcaacctgg gaactgcatt tgatactgtc 660 gggcttgagt tccggagagg atggtggaat tcccagtgta gaggtgaaat tcgtagatat 720 tgggaagaac accggtggcg aaggcggcca tctggacgga cactgacgct gaggcgcgaa 780 agcgtgggga gcaaacagga ttagataccc tggtagtcca cgccgtaaac gatgaatgct 840 agacgctggg gtgcatgcac ttcggtgtcg ccgctaacgc attaagcatt ccgcctgggg 900 agtacggccg caaggttaaa actcaaagga attgacgggg gcccgcacaa gcggtggagc 960 atgtggttta attcgaagca acgcgcagaa ccttaccaac ccttgacatg tccattgccg 1020 gtccgagaga ttggaccttc agttcggctg gatggaacac aggtgctgca tggctgtcgt 1080 cagctcgtgt cgtgagatgt tgggttaagt cccgcaacga gcgcaacccc taccgccagt 1140 tgccatcatt cagttgggca ctctggtgga actgccggtg acaagccgga ggaaggcggg 1200 gatgacgtca agtcctcatg gcccttagg gttgggctac acacgtgcta caatggcggt 1260 gacagtggga cgcgaagtcg caagatggag caaatcccca aaagccgtct cagttcggat 1320 tgcactctgc aactcgggtg catgaagtg gaatcgctag taatcgcgga tcagcacgcc 1380 gcggtgaata cgttcccggg ccttgtacac accgcccaag ggcgaattcc agcacactgg 1440 cggccgttac tagtggatcc gagctcggta CCAAGCTTGG cgtaatcatg gtcatagctg 1500 <210> 62 <211> 1366 <212> DNA <213 > Unknown organism <220> <223> Origin of the sequence: Soil organism <400> 62 acgacggcca gtgaattgta atacgactca ctatagggcg aattgggccc tctagatgca 60 tgctcgagcg gccgccagtg tgatggatat ctgcagaatt cgcccttcag gcctaacaca 120 tgcaagtcga acgaaggctt cggccttagt ggcgcacggg tgagtaacac gtgggaacct 180 gcctttcggt tcggaataac gtctggaaac ggacgctaac accggatacg cccttcgggg 240 gaaagttcac gccgagagag gggcccgcgc cggattaggt agttggtgag gtaatggctc 300 accaagcctt cgatccgtag ctggtctgag aggatgatca gccacactgg gactgagaca 360 cggcccagac tcctacggga ggcagcagtg gggaatattg gacaatgggc gcaagcctga 420 tccagcaatg ccgcgtgagt gatgaaggcc ttagggttgt aaagctcttt cgcacgcgac 480 gatgatgacg gtagcgtgag aagaagcccc ggctaacttc gtgccagcag ccgcggtaat 540 acgaaggggg ctagcgttgt tcggaattac tgggcgtaaa gggcgcgtag gcggcctgct 600 tagtcagaag tgaaagcccc gggctcaacc tgggaatagc ttttgatact ggcaggcttg 660 agttccggag aggatggtgg aattcccag gtagaggtga aattcgtaga tattgggaag 720 aacaccggtg gcgaaggcgg ccatctggac ggacactgac gctgaggcgc gaaagcgtgg 780 ggagcaaaca ggattagata ccctggtagt: ccacgccgta aacgatgaat gctagacgtc 840 ggggtgcatg cacttcggtg tcgccgctaa cgcattaagc attccgcctg gggagtacgg 900 ccgcaaggtt aaaactcaaa ggaattgacg ggggcccgca caagcggtgg agcatgtggt 960

ttaattcgaa gcaacgcgca gaaccttacc aacccttgac atgtccatta tgggcttcag 1020 agatgaggtc cttcagttcg gctgggtgga acacaggtgc tgcatggctg tcgtcagctc 1080 gtgtcgtgag atgttgggtt aagtcccgca acgagcgcaa cccctaccgt cagttgccat 1140 cattcagttg ggcactctgg tggaaccgcc ggtgacaagc cggaggaagg cggggatgac 1200 gtcaagtcct catggccctt atgggttggg ctacacacgt gctacaatgg cggtgacagt 1260 gggaagcgaa gtcgcgagat ggagcaaatc cccaaaagcc gtctcagttc ggatcgcact 1320 ctgcaactcg agtgcgtgaa gttggaatcg ctagtaatcg cggatc 1366 <210> 63 <211> 1360 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 63 acagctatga ccatgattac gccaagcttg gtaccgagct cggatccact agtaacggcc 60 gccagtgtgc tggaattcgc ccttcaggcc taacacatgc aagtcgaacg ccccgcaagg 120 ggagtggcag acgggtgagt aacgcgtggg aacataccct ttcctgcgga atagctccgg 180 gaaactggaa ttaataccgc atacgcccta cgggggaaag atttatcggg gaaggattgg 240 cccgcgttgg attagctagt tggtggggta aaggcctacc aaggcgacga tccatagctg 300 gtctgagagg atgatcagcc acattgggac tgagacacgg cccaaactcc tacgggaggc 360 agcagtgggg aatattggac aatgggcgca agcctgatcc agccatgccg cgtgagtgat 420 gaaggcctta gggttgtaaa gctctttcac cggagaagat aatgacggta tccggagaag 480 aagccccggc taacttcgtg ccagcagccg cggtaatacg aagggggcta gtgttgttcg 540 gaattactgg gcgtaaagcg cacgtaggcg gatatttaag tcaggggtga aatcccagag 600 ctcaactctg gaactgcctt tgatactggg tatcttgagt atggaagagg taagtggaat 660 tccgagtgta gaggtgaaat tcgtagatat tcggaggaac accagtggcg aaggcggctt 720 actggtccat tactgacgct gaggtgcgaa agcgtgggga gcaaacagga ttagataccc 780 tggtagtcca cgccgtaaac gatgaatgtt agccgtcggg cagtatactg ttcggtggcg 840 cagctaacgc attaaacatt ccgcctgggg agtacggtcg caagattaaa actcaaagga 900 attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgaagca acgcgcagaa 960 ccttaccagc tcttgacatt cggggtttgg gcagtggaga cattgtcct cagttaggct 1020 ggccccagaa caggtgctgc atggctgtcg tcagctcgcg tcgtgagag ttgggttaag 1080 tcccgcaacg agcgcaaccc tcgcccttag ttgccagcat ttagttgggc actctaaggg 1140 gactgccggt gataagccga gaggaaggtg gggacgacgt caagtcctca tggcccttac 1200 gggctgggct acacacgtgc tacaatggtg gtgacagtgg gcagcgagac agcgatgtcg 1260 agctaatctc caaaagccat ctcagttcgg attgcactct gcaaccgag tgcatgaagt 1320 tggaatcgct agtaatcgca gatcagcatg tgcggtgaat 1360 <210> 64 <211> 1288 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 64 tccaggaaac agctatgacc atgattacgc caagcttggt accgagctcg gatccactag 60 taacggccgc cagtgtgctg gaattcgccc ttcaggccta acacaLgcaa gtcgagcgcc 120 ccgcaagggg agcggcagac gggtgagtaa cgcgtgggaa tctacccatc cctacggaac 180 aactccggga aactggagct aataccgtat acgccctttg ggggaaagat ttatcgggga 240 tggatgagcc cgcgttggat tagctagttg gtggggtaaa ggcctaccaa ggcgacgatc 300 catagctggt ctgagaggat gatcagccac attgggactg agacacggcc caaactccta 360 cgggaggcag cagtggggaa tattggacaa tgggcgcaag cctgatccag ccatgcccgc 420 gtgagtgatg aaggtcttag gattgtaaag ctctttcacc ggagaagata atgacggtat 480 ttaattcgaa gcaacgcgca gaaccttacc aacccttgac atgtccatta tgggcttcag 1020 agatgaggtc cttcagttcg gctgggtgga acacaggtgc tgcatggctg tcgtcagctc 1080 gtgtcgtgag atgttgggtt aagtcccgca acgagcgcaa cccctaccgt cagttgccat 1140 cattcagttg ggcactctgg tggaaccgcc ggtgacaagc cggaggaagg cggggatgac 1200 gtcaagtcct catggccctt atgggttggg ctacacacgt gctacaatgg cggtgacagt 1260 gggaagcgaa gtcgcgagat ggagcaaatc cccaaaagcc gtctcagttc ggatcgcact 1320 ctgcaactcg agtgcgtgaa gttggaatcg ctagtaatcg cggatc 1366 <210 > 63 <211> 1360 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 63 acagctatga ccatgattac gccaagcttg gtaccgagct cggatccact agtaacggcc 60 gccagtgtgc tggaattcgc ccttcaggcc taacacatgc aagtcgaacg ccccgcaagg 120 ggagtggcag acgggtgagt aacgcgtggg aacataccct ttcctgcgga atagctccgg 180 gaaactggaa ttaataccgc atacgcccta cgggggaaag atttatcggg gaaggattgg 240 cccgcgttgg attagctagt tggtggggta aaggcctacc aaggcgacga tccatagctg 300 gtctgagagg atgatcagcc acattgggac tgagacacgg cccaaactc c tacgggaggc 360 agcagtgggg aatattggac aatgggcgca agcctgatcc agccatgccg cgtgagtgat 420 gaaggcctta gggttgtaaa gctctttcac cggagaagat aatgacggta tccggagaag 480 aagccccggc taacttcgtg ccagcagccg cggtaatacg aagggggcta gtgttgttcg 540 gaattactgg gcgtaaagcg cacgtaggcg gatatttaag tcaggggtga aatcccagag 600 ctcaactctg gaactgcctt tgatactggg tatcttgagt atggaagagg taagtggaat 660 tccgagtgta gaggtgaaat tcgtagatat tcggaggaac accagtggcg aaggcggctt 720 actggtccat tactgacgct gaggtgcgaa agcgtgggga gcaaacagga ttagataccc 780 tggtagtcca cgccgtaaac gatgaatgtt agccgtcggg cagtatactg ttcggtggcg 840 cagctaacgc attaaacatt ccgcctgggg agtacggtcg caagattaaa actcaaagga 900 attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgaagca acgcgcagaa 960 ccttaccagc tcttgacatt cggggtttgg gcagtggaga cattgtcct cagttaggct 1020 ggccccagaa caggtgctgc atggctgtcg tcagctcgcg tcgtgagag ttgggttaag 1080 tcccgcaacg agcgcaaccc tcgcccttag ttgccagcat ttagttgggc actctaaggg 1140 gactgccggt gataagccga gaggaaggtg gggacgacgt caagtcctca tggcccttac 1200 g ggctgggct acacacgtgc tacaatggtg gtgacagtgg gcagcgagac agcgatgtcg 1260 agctaatctc caaaagccat ctcagttcgg attgcactct gcaaccgag tgcatgaagt 1320 tggaatcgct agtaatcgca gatcagcatg tgcggtgaat 1360 <210> 64 <211> 1288 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 64 tccaggaaac agctatgacc atgattacgc caagcttggt accgagctcg gatccactag 60 taacggccgc cagtgtgctg gaattcgccc ttcaggccta acacaLgcaa gtcgagcgcc 120 ccgcaagggg agcggcagac gggtgagtaa cgcgtgggaa tctacccatc cctacggaac 180 aactccggga aactggagct aataccgtat acgccctttg ggggaaagat ttatcgggga 240 tggatgagcc cgcgttggat tagctagttg gtggggtaaa ggcctaccaa ggcgacgatc 300 catagctggt ctgagaggat gatcagccac attgggactg agacacggcc caaactccta 360 cgggaggcag cagtggggaa tattggacaa tgggcgcaag cctgatccag ccatgcccgc 420 gtgagtgatg aaggtcttag gattgtaaag ctctttcacc ggagaagata atgacggtat 480

ccggagaaga agccccggct aactttcgtg ccagcagccg cggtaatacg aagggggcta 540 gcgttgttcg gaattactgg gcgtaaagcg cacgtaggcç gatatttaag tcaggggtga 600 aatcccagag ctcaactctg gaactgcctt tgatactggg tatcttgagt atggaagagg 660 taagtggaat tgcgagtgta gaggtgaaat tcgtagata tcgcaggaac accagtggcg 720 aaggcggctt actggtccat tactgacgct gaggtgcgaa agcgtgggga gcaaacagga 780 ttagataccc tggtagtcca cgccgtaaac gatgaatgtt agccgtcggc aagtttactt 840 gtcggtggcg cagctaacgc attaaacatt ccgcctgggg agtacggtcg caagattaaa 900 actcaaagga attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgaagca 960 acgcgcagaa ccttaccagc ccttgacatg cccggacagc tacagagatg tagtgttccc 1020 ttcggggacc gggacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg 1080 gttaagtccc gcaacgagcg caaccctcgc ccttagttgc cagcattcag ttgggcactc 1140 taaggggact gccggtgata agccgagagg aagtgggga-- gacgtcaagt cctnatggcc 1200 cttacgggct gggctacaca cgtgctacaa tgggtggtga cagtgggcag cgaaggaacg 1260 atcccgagct aatctccaaa agccatct 1288 <210> 65 <211> 1386 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 65 cgacggccag tgaattgtaa tacgactcac tatagggcga attgggccct ctagatgcat 60 gctcgagcgg ccgccagtgt gatggatatc tgcagaattc gcccttcagg cctaacacat 120 gcaagtcgag cgggcgtagc aatacgtcag cggcagacgg gtgagtaacg cgtgggaaca 180 taccttttgg ttcggaacaa cacagggaaa cttgtgctaa taccggataa gcccttacgg 240 ggaaagattt atcgccgaaa gattggcccg cgtctgata gctagttggt agggtaatgg 300 cctaccaagg cgacgatcag tagctggtct gagaggatga tcagccacat tgggactgag 360 acacggccca aactcctacg ggaggcagca gtggggaata ttggacaatg ggcgcaagcc 420 tgatccagcc atgccgcgtg agtgatgaag gccctaggg,: tgtaaagctc ttttgtgcgg 480 gaagataatg acggtaccgc aagaataagc cccggctaac ttcgtgccag càgccgcggt 540 aatacgaagg gggctagcgt tgctcggaat cactgggcg-: aaagggtgcg taggcgggtc 600 tttaagtcag gggtgaaatc ctggagctca actccagaac tgcctttgat actgaagatc 660 ttgagttcgg gagaggtgag tggaactgcg agtgtagagg tgaaattcgt agatattcgc 720 aagaacacca gtgggcgaag gcggctcact ggcccgatac tgacgctgag gcacgaaagc 780 gtggggagca aacaggatta gataccctgg tagtccacgc cgtaaacgat gaatgccagc 840 cgttagtggg tttactcact agtggcgcag ctaacgctt aagcattccg cctggggagt 900 acggtcgcaa gattaaaact caaaggaatt gacgggggcc cgcacaagcg gtggagcatg 960 tggtttaatt cgacgcaacg cgcagaacct taccagccc tgacatgtcc aggaccggtc 1020 gcagagatgt gaccttctct tcggagcctg gagcacaggt gctgcatggc tgtcgtcagc 1080 tcgtgtcg=g agatgttggg ttaagtcccg caacgagcgc aacccccgtc cttagttgct 1140 accatttagt tgagcactct aaggagactg ccggtgataa gccgcgagga aggtggggat 1200 gacgtcaagt cctcatggcc cttacgggct gggctacaca cgtgctacaa tggcggtgac 1260 aatgggacgc taaggggcaa cccttcgcaa atctcaaaaa gcccgtctca gttcggattg 1320 ggctctgcaa ctcgagccca tgaagttgga atcgctagta atcgtggatc agcacgccac 1380 ggtgaa 1386 <210> 66 <211> 1223 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 66 ccggagaaga agccccggct aactttcgtg ccagcagccg cggtaatacg aagggggcta 540 gcgttgttcg gaattactgg gcgtaaagcg cacgtaggcç gatatttaag tcaggggtga 600 aatcccagag ctcaactctg gaactgcctt tgatactggg tatcttgagt atggaagagg 660 taagtggaat tgcgagtgta gaggtgaaat tcgtagata tcgcaggaac accagtggcg 720 aaggcggctt actggtccat tactgacgct gaggtgcgaa agcgtgggga gcaaacagga 780 ttagataccc tggtagtcca cgccgtaaac gatgaatgtt agccgtcggc aagtttactt 840 gtcggtggcg cagctaacgc attaaacatt ccgcctgggg agtacggtcg caagattaaa 900 actcaaagga attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgaagca 960 acgcgcagaa ccttaccagc ccttgacatg cccggacagc tacagagatg tagtgttccc 1020 ttcggggacc gggacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg 1080 gttaagtccc gcaacgagcg caaccctcgc ccttagttgc cagcattcag ttgggcactc 1140 taaggggact gccggtgata agccgagagg aagtgggga-- gacgtcaagt cctnatggcc 1200 cttacgggct gggctacaca cgtgctacaa tgggtggtga cagtgggcag cgaaggaacg 1260 atcccgagct aatctccaaa agccatct 1288 <210> 65 <211> 1386 <212> DNA <213> Organism Inc Onnu <220> <223> Origin of sequence: Soil Organism <400> 65 cgacggccag tgaattgtaa tacgactcac tatagggcga attgggccct ctagatgcat 60 gctcgagcgg ccgccagtgt gatggatatc tgcagaattc gcccttcagg cctaacacat 120 gcaagtcgag cgggcgtagc aatacgtcag cggcagacgg gtgagtaacg cgtgggaaca 180 taccttttgg ttcggaacaa cacagggaaa cttgtgctaa taccggataa gcccttacgg 240 ggaaagattt atcgccgaaa gattggcccg cgtctgata gctagttggt agggtaatgg 300 cctaccaagg cgacgatcag tagctggtct gagaggatga tcagccacat tgggactgag 360 acacggccca aactcctacg ggaggcagca gtggggaata ttggacaatg ggcgcaagcc 420 tgatccagcc atgccgcgtg agtgatgaag gccctaggg ,: tgtaaagctc ttttgtgcgg 480 gaagataatg acggtaccgc aagaataagc cccggctaac ttcgtgccag càgccgcggt 540 aatacgaagg gggctagcgt tgctcggaat cactgggcg-: aaagggtgcg taggcgggtc 600 tttaagtcag gggtgaaatc ctggagctca actccagaac tgcctttgat actgaagatc 660 ttgagttcgg gagaggtgag tggaactgcg agtgtagagg tgaaattcgt agatattcgc 720 aagaacacca gtgggcgaag gcggctcact ggcccgatac tgacgctgag gcacgaaagc 780 gtggggagca aacaggatta gataccctgg tagtccacgc cgtaaacgat gaatgccagc 840 cgttagtggg tttactcact agtggcgcag ctaacgctt aagcattccg cctggggagt 900 acggtcgcaa gattaaaact caaaggaatt gacgggggcc cgcacaagcg gtggagcatg 960 tggtttaatt cgacgcaacg cgcagaacct taccagccc tgacatgtcc aggaccggtc 1020 gcagagatgt gaccttctct tcggagcctg gagcacaggt gctgcatggc tgtcgtcagc 1080 tcgtgtcg = g agatgttggg ttaagtcccg caacgagcgc aacccccgtc cttagttgct 1140 accatttagt tgagcactct aaggagactg ccggtgataa gccgcgagga aggtggggat 1200 gacgtcaagt cctcatggcc cttacgggct gggctacaca cgtgctacaa tggcggtgac 1260 aatgggacgc taaggggcaa cccttcgcaa atctcaaaaa gcccgtctca gttcggattg 1320 ggctctgcaa ctcgagccca tgaagttgga atcgctagta atcgtggatc agcacgccac 1380 ggtgaa 1386 <210> 66 <211> 1223 <212> DNA <213> Organime Unknown <220> <223> Origin of the sequence: Soil organism <400> 66

agcggcagag ggtgagtaac gcgtgggaat ctacccatct ctacggaaca actccgggaa 60 actggagcta ataccgtata cgtccttcgg gagaaagatt tatcggagat ggatgagccc 120 gcgttggatt agctagttgg tggggtaatg gcctaccaag gcgacgatcc atagctggtc 180 tgagaggatg atcagccaca ctgggactga gacacggccc agactcctac gggaggcagc 240 agtggggaat attggacaat gggcgaaagc ccgatccagc catgccgcgt gagtgatgaa 300 ggccctaggg ttgtaaagct ctttcaacgg tgaggataat gacggtaacc gtagaagaag 360 ccccggctaa cttcgrgcca gcagccgcgg taatacgaag ggggctagcg ttgttcggaa 420 ttactgggcg taaagcgcac gtaggcggac tattaagtca ggggtgaaat cccggggctc 480 aaccccggaa ctgcctttga tactggtagt ctcgagtccg gaagaggtga gtggaattcc 540 gagtgtagag gtgaaattcg tagatattcg gaggaacacc agtggcgaag gcggctcact 600 ggtccggtac tgacgctgag gtgcgaaagc gtggggagca aacaggatta gataccctgg 660 tagtccacgc cgtaaacgat ggaagctagc cgttggcaag tttacttgtc ggtggcgcag 720 ctaacgcatt aagcttcccg cctggggagt acggtcgcaa gattaaaact caaaggaatt 780 gacgggggcc cgcacaagcg gtggagcatg tggtttaatt cgaagcaacg cgcagaacct 840 taccagccct tgacatcccg gtcgcggtta ccagagatgg tatccttcag ttcggctgga 900 ccggtgacag gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc 960 cgcaacgagc gcaaccctcg cccttagttg ccagcattca gttgggcact ctaaggggac 1020 tgccggtgat aagccgagag gaaggtgggg atgacgtcaa gtcctcatgg cccttacggg 1080 ctgggctaca cacgtgctac aatggtggtg acagtgggca gcgagaccgc gaggtcgagc 1140 taatctccaa aagccatctc agttcggatt gcactctgca actcgagtgc atgaagttgg 1200 aatcgctagt aatcgcggat cag 1223 <210> 67 <211> 1237 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 67 cccgcagggg agtggcagag ggtgagtaac gcgtgggaat ctaccctttt ctacggaaca 60 actgagggaa acttcagcta ataccgtata cggccgagag gcgaaagatt tatcggagaa 120 ggatgagccc gcgttggatt agctagttgg tggggtaaag gcctaccaag gcgacgatcc 180 atagctggtc tgagaggatg atcagccaca ctgggactga gacacggccc agactcctac 240 gggaggcagc agtggggaat attggacaat gggcgcaagc ctgatccagc catgccgcgt 300 gagtgatgaa ggccctaggg ttgtaaagct ctttcaccgg tgaagataat gacggtaacc 360 ggagaagaag ccccggctaa cttcgtgcca gcagccgcgg taatacgaag ggggctagcg 420 ttgttcggat ttactgggcg taaagcgcac gtaggcggac tattaagtca ggggtgaaat 480 cccggggctc aaccccggaa ctgcctttga tactggtagt cttgagttcg aaagaggtga 540 gtggaattcc gagtgtagag gtgaaattcg tagatattcg gaggaacacc agtggcgaag 600 gcggctcact ggctcgatac tgacgctgag gtgcgaaagc gtggggagca aacaggatta 660 gataccctgg tagtccacgc cgtaaactat gagagctagg cgtcgggcag tatactgttc 720 ggtggcgcag ctaacgcatt aagctcttcg cccggggagt acggtcgcaa gattaaaact 780 caaaggaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt cgaagcaacg 840 cgcagaacct taccagccct tgacatcccg atcgcggtta ccagagatgg tatccttcag 900 ttaggctgga tcggtgacag gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg 960 ggttaagtcc cgcaacgagc gcaaccctcg cccttagttg ccatcattca gttgggcact 1020 ctaaggggac tgccggtgat aagccgagag gaaggtgggg atgacgtcaa gtcctcatgg 1080 cccttacggg ctgggctaca cacgtgctac aatggtggcg acagtgggca gcgagaccgc 1140 gaggtcgagc taatctccaa aagccatctc agttcggatt gcactctgca actcgagtgc 1200 atgaagttgg aatcgctagt aatcgtggat cagaatg 1237 <210> 68 <211> 1346 <212> ADN <213> Organime Inconnu agcggcagag ggtgagtaac gcgtgggaat ctacccatct ctacggaaca actccgggaa 60 actggagcta ataccgtata cgtccttcgg gagaaagatt tatcggagat ggatgagccc 120 gcgttggatt agctagttgg tggggtaatg gcctaccaag gcgacgatcc atagctggtc 180 tgagaggatg atcagccaca ctgggactga gacacggccc agactcctac gggaggcagc 240 agtggggaat attggacaat gggcgaaagc ccgatccagc catgccgcgt gagtgatgaa 300 ggccctaggg ttgtaaagct ctttcaacgg tgaggataat gacggtaacc gtagaagaag 360 ccccggctaa cttcgrgcca gcagccgcgg taatacgaag ggggctagcg ttgttcggaa 420 ttactgggcg taaagcgcac gtaggcggac tattaagtca ggggtgaaat cccggggctc 480 aaccccggaa ctgcctttga tactggtagt ctcgagtccg gaagaggtga gtggaattcc 540 gagtgtagag gtgaaattcg tagatattcg gaggaacacc agtggcgaag gcggctcact 600 ggtccggtac tgacgctgag gtgcgaaagc gtggggagca aacaggatta gataccctgg 660 tagtccacgc cgtaaacgat ggaagctagc cgttggcaag tttacttgtc ggtggcgcag 720 ctaacgcatt aagcttcccg cctggggagt acggtcgcaa gattaaaact caaaggaatt 780 gacgggggcc cgcacaagcg gtggagcatg tggtttaatt cgaagcaacg cgcagaacct 840 taccagccct tgacatcccg gtcgcggtta ccagagatgg tatccttcag ttcggctgga 900 ccggtgacag gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc 960 cgcaacgagc gcaaccctcg cccttagttg ccagcattca gttgggcact ctaaggggac 1020 tgccggtgat aagccgagag gaaggtgggg atgacgtcaa gtcctcatgg cccttacggg 1080 ctgggctaca cacgtgctac aatggtggtg acagtgggca gcgagaccgc gaggtcgagc 1140 taatctccaa aagccatctc agttcggatt gcactctgca actcgagtgc atgaagttgg 1200 aatcgctagt aatcgcggat cag 1223 <210> 67 <211 > 1237 <212> DNA <213> Organime Unknown <220> <223> Origin of sequence: floor Organism <400> 67 cccgcagggg agtggcagag ggtgagtaac gcgtgggaat ctaccctttt ctacggaaca 60 actgagggaa acttcagcta ataccgtata cggccgagag gcgaaagatt tatcggagaa 120 ggatgagccc gcgttggatt agctagttgg tggggtaaag gcctaccaag gcgacgatcc 180 atagctggtc tgagaggatg atcagccaca ctgggactga gacacggccc agactcctac 240 gggaggcagc agtggggaat attggacaat gggcgcaagc ctgatccagc catgccgcgt 300 gagtgatgaa ggccctaggg ttgtaaagct ctttcaccgg tgaagataat gacggtaacc 360 ggagaagaag ccccggctaa cttcgtgcc has gcagccgcgg taatacgaag ggggctagcg 420 ttgttcggat ttactgggcg taaagcgcac gtaggcggac tattaagtca ggggtgaaat 480 cccggggctc aaccccggaa ctgcctttga tactggtagt cttgagttcg aaagaggtga 540 gtggaattcc gagtgtagag gtgaaattcg tagatattcg gaggaacacc agtggcgaag 600 gcggctcact ggctcgatac tgacgctgag gtgcgaaagc gtggggagca aacaggatta 660 gataccctgg tagtccacgc cgtaaactat gagagctagg cgtcgggcag tatactgttc 720 ggtggcgcag ctaacgcatt aagctcttcg cccggggagt acggtcgcaa gattaaaact 780 caaaggaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt cgaagcaacg 840 cgcagaacct taccagccct tgacatcccg atcgcggtta ccagagatgg tatccttcag 900 ttaggctgga tcggtgacag gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg 960 ggttaagtcc cgcaacgagc gcaaccctcg cccttagttg ccatcattca gttgggcact 1020 ctaaggggac tgccggtgat aagccgagag gaaggtgggg atgacgtcaa gtcctcatgg 1080 cccttacggg ctgggctaca cacgtgctac aatggtggcg acagtgggca gcgagaccgc 1140 gaggtcgagc taatctccaa aagccatctc agttcggatt gcactctgca actcgagtgc 1200 atgaagttgg aatcgctagt aatcgtggat cagaatg 1237 < 210> 68 <211> 1346 <212> DNA <213> Unknown Organism

<220> <223> Origine de la séquence :Organisme du sol <400> 68 acgacgggcc agtgaattgt aatacgactc actatagggc gaattgggcc ctctagatgc 60 atgctcgagc ggccgccagt gtgatggata tccgcagaat tcgcccttca ggcctaacac 120 atgcaagtcg aacggatccc ttcggattag tggcggacgg gtgagtaaca cgcgggaacg 180 tgccctttgg ttcggaacaa ctcagggaaa cttgagctaa taccggataa gcctttcgag 240 ggaaagattt atcgccattg gagcggcccg cgtaggatta gctagttggt gaggtaaaag 300 ctcaccaagg cgacgatccc tagctggtct gagaggatga tcagccacat tgggactgag 360 acacggccca aactcctacg ggaggcagca gtggggaatc ttgcgcaatg ggcgaaagcc 420 tgacgcagcc atgccgcgtg aatgatgaag gtcttaggat tgtaaaattc tttcaccggg 480 gacgataatg acggtacccg gagaagaagc cccggctaac ttcgtgccag cagccgcggt 540 aatacgaagg gggctagcgt tgctcggaat tactgggcgt aaagggagcg taggcggata 600 gtttagtcag aggtgaaagc ccagggctca accttggaat tgcctttgat actggctatc 660 ttgagtacgg aagaggtatg tggaactccg agtgtagagg tgaaattcgt agatattcgg 720 aagaacacca gtggcgaagg cgacatactg gtccgttact gacgctgagg ctcgaaagcg 780 tggggagcaa acaggattag ataccctggt agtccacgct gtaaacgatg agtgctagtt 840 gtcggcatgc atgcatgtcg gtggcgcagc taacgcatta agcactccgc ctggggagta 900 cggtcgcaag attaaaactc aaaggaattg acgggggccc gcacaagcgg tggagcatgt 960 ggtttaattc gaagcaacgc gcagaacctt accacctttt gacatgcccg gaccgctcca 1020 gagatggagc tttcccttcg gggactggga cacaggtgct gcatggctgt cgtcagctcg 1080 tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcaac cctcgctatt agttgccatc 1140 aggtttggct gggcactcta ataggaccgc cggtggtaag ccggaggaag gtggggatga 1200 cgtcaagtcc tcatggccct tacaaggtgg gctacacacg tgctacaatg gcgactacag 1260 agggctgcaa tcccgcgagg gggagccaat ccctaaaagt cgtctcagtt cggattgcac 1320 tctgcaactc gagtgcatga agttgg 1346 <210> 69 <211> 1500 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 69 acagctatga ccatgattac gccaagcttg gtaccgagct cggatccact agtaacggcc 60 gccagtgtgc tggaattcgc ccttcaggcc taacacatgc aagtcgaacg ccgtagcaat 120 acggagtggc agacgggtga gtaacacgtg ggaacgtgcc ctttggttcg gaacaacaca 180 gggaaacttg tgctaatacc gaataagccc ttacggggaa agattatcg ccaaaggatc 240 ggcccgcgtc tgattagcta gttggtgggg taacggccca ccaaggctac gatcagtagc 300 tggtctgaga ggatgatcag ccacactggg actgagacac ggcccagact cctacgggag 360 gcagcagtta ggaatcttgg acaatgggcg caagcctgat ccagccatgc cgcgtgagtg 420 atgaaggcct tagggttgta aagctctttc agcggggaag ataatgacgg tacccgcaga 480 agaagccccg gctaacttcg tgccagcagc cgcggraata cgaagggggc tagcgttgct 540 cggaatcact gggcgtaaag cgcacgtagg cggatcttta agtcaggggt gaaatcctgg 600 agctcaactc cagaactgcc tttgatactg gggatctcga gtccggaaga ggtgagtgga 660 actccgagtg tagaggtgaa attcgtagat attcggaaga acaccagtgg cgaaggcggc 720 tcactggtcc ggtactgacg ctgaggtgcg aaagcgtggg gagcaaacag gattagatac 780 cctggtagtc cacgccgtaa acgatggatg ctagccgttg gcgggtttac tcgtcagtgg 840 cgcagctaac gcattaagca tcccgcctgg ggagtacggt cgcaagatta aaactcaaag 900 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt caattcgaag caacgcgcag 960 aaccttacca gcccttgaca tgtcccgtat ggacttcaga gatgaggtcc ttcagttcgg 1020 ctggcgggaa cacaggtgct gcatggctgt cgtcagctcg tgtcgtgaga tgttgggtta 1080 agtcccgcaa cgagcgcaac cctcgccctt agttgccatc atttagttgg gcactctaag 1140 gggactgccg gtgataagcc gcgaggaagg tggggatgac gtcaagtcct catggccctt 1200 acgggctggg ctacacacgt gctacaatgg cggtgacagt gggacgcaat ggagcaatcc 1260 tgcgcaaatc tcaaaaagcc gtctcagttc ggattggggt ctgcaactcg accccatgaa 1320 <220> <223> Origin of sequence: Floor Organism <400> 68 acgacgggcc agtgaattgt aatacgactc actatagggc gaattgggcc ctctagatgc 60 atgctcgagc ggccgccagt gtgatggata tccgcagaat tcgcccttca ggcctaacac 120 atgcaagtcg aacggatccc ttcggattag tggcggacgg gtgagtaaca cgcgggaacg 180 tgccctttgg ttcggaacaa ctcagggaaa cttgagctaa taccggataa gcctttcgag 240 ggaaagattt atcgccattg gagcggcccg cgtaggatta gctagttggt gaggtaaaag 300 ctcaccaagg cgacgatccc tagctggtct gagaggatga tcagccacat tgggactgag 360 acacggccca aactcctacg ggaggcagca gtggggaatc ttgcgcaatg ggcgaaagcc 420 tgacgcagcc atgccgcgtg aatgatgaag gtcttaggat tgtaaaattc tttcaccggg 480 gacgataatg acggtacccg gagaagaagc cccggctaac ttcgtgccag cagccgcggt 540 aatacgaagg gggctagcgt tgctcggaat tactgggcgt aaagggagcg taggcggata 600 gtttagtcag aggtgaaagc ccagggctca accttggaat tgcctttgat actggctatc 660 ttgagtacgg aagaggtatg tggaactccg agtgtagagg tgaaattcgt agatattcgg 720 aagaacacca gtggcgaagg cgacatactg gtccgtact gacgctgagg ctcgaaagcg 780 tggggagcaa acaggattag ataccc TGGT agtccacgct gtaaacgatg agtgctagtt 840 gtcggcatgc atgcatgtcg gtggcgcagc taacgcatta agcactccgc ctggggagta 900 cggtcgcaag attaaaactc aaaggaattg acgggggccc gcacaagcgg tggagcatgt 960 ggtttaattc gaagcaacgc gcagaacctt accacctttt gacatgcccg gaccgctcca 1020 gagatggagc tttcccttcg gggactggga cacaggtgct gcatggctgt cgtcagctcg 1080 tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcaac cctcgctatt agttgccatc 1140 aggtttggct gggcactcta ataggaccgc cggtggtaag ccggaggaag gtggggatga 1200 cgtcaagtcc tcatggccct tacaaggtgg gctacacacg tgctacaatg gcgactacag 1260 agggctgcaa tcccgcgagg gggagccaat ccctaaaagt cgtctcagtt cggattgcac 1320 tctgcaactc gagtgcatga agttgg 1346 <210> 69 <211> 1500 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 69 acagctatga ccatgattac gccaagcttg gtaccgagct cggatccact agtaacggcc 60 gccagtgtgc tggaattcgc ccttcaggcc taacacatgc aagtcgaacg ccgtagcaat 120 acgagtggc agacgggtga gtaacacgtg ggaacgtgcc ctttggttcg gaacaacaca 180 gggaaacttg tgctaatacc gaataagccc ttacggggaa agattatcg ccaaaggatc 240 ggcccgcgtc tgattagcta gttggtgggg taacggccca ccaaggctac gatcagtagc 300 tggtctgaga ggatgatcag ccacactggg actgagacac ggcccagact cctacgggag 360 gcagcagtta ggaatcttgg acaatgggcg caagcctgat ccagccatgc cgcgtgagtg 420 atgaaggcct tagggttgta aagctctttc agcggggaag ataatgacgg tacccgcaga 480 agaagccccg gctaacttcg tgccagcagc cgcggraata cgaagggggc tagcgttgct 540 cggaatcact gggcgtaaag cgcacgtagg cggatcttta agtcaggggt gaaatcctgg 600 agctcaactc cagaactgcc tttgatactg gggatctcga gtccggaaga ggtgagtgga 660 actccgagtg tagaggtgaa attcgtagat attcggaaga acaccagtgg cgaaggcggc 720 tcactggtcc ggtactgacg ctgaggtgcg aaagcgtggg gagcaaacag gattagatac 780 cctggtagtc cacgccgtaa acgatggatg ctagccgttg gcgggtttac tcgtcagtgg 840 cgcagctaac gcattaagca tcccgcctgg ggagtacggt cgcaagatta aaactcaaag 900 gaattgacgg gggcccgcac aagcggtgga gcatgtggtt caattcgaag caacgcgcag 960 aaccttacca gcccttgaca tgtcccgtat ggacttcaga gatgaggtcc ttcagttcgg 1020 ctggcgggaa cacaggtgct gcatggctgt cgtcagctcg tgtcgtgag 1080 has tgttgggtta agtcccgcaa cgagcgcaac cctcgccctt agttgccatc atttagttgg gcactctaag 1140 gggactgccg gtgataagcc gcgaggaagg tggggatgac gtcaagtcct catggccctt 1200 acgggctggg ctacacacgt gctacaatgg cggtgacagt gggacgcaat ggagcaatcc 1260 tgcgcaaatc tcaaaaagcc gtctcagttc ggattggggt ctgcaactcg accccatgaa 1320

gtcggaatcg ctagtaatcg cagatcagca cgctgcggtg aatacgttcc cgggccttgt 1380 acacaccgcc caagggcgaa ttctgcagat atccatcaca ctggcggccg ctcgagcatg 1440 catctagagg gcccaattcg ccctatagtg agtcgtatta caattcactg gccgtcgttt 1500 <210> 70 <211> 1113 <212> ADN <213> Organimè Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 70 gagctaatac cgtataatga cttcggtcca aagatttatc gcctgaggat gagcccgcgt 60 cggattagct agttggtagg gtaaaagcct accaaggcga cgatccgtag ctggtctgag 120 aggatgatca gccacactgg gactgagaca cggcccagac tcctacggga ggcagcagtg 180 gggaatattg gacaatgggc gcaagcctga tccagcaatg ccgcgtgagt gatgaaggcc 240 ttagggttgt aaagctcttt tacccgggaa gataatgact gtaccgggag aataagcccc 300 ggctaactcc gtgccagcag ccgcggtaat acggaggggg ctagcgttgt tcggaattac 360 tgggcgtaaa gcgcacgtag gcggctttgt aagttagagg tgaaagcccg gggctcaact 420 ccggaattgc ctttaagact gcatcgctcg aattgtggag aggtaagtgg aattccgagt 480 gtagaggtga aattcgtaga tattcggaag aacaccagtg gcgaaggcga cttactggac 540 acatattgac gctgaggtgc gaaagcgtgg ggagcaaaca ggattagata ccctggtagt 600 ccacgccgta aacgatgatg actagctgtc ggggcgctta gcgtttcggt ggcgcagcta 660 acgcgttaag tcatccgcct ggggagtacg gccgcaaggt taaactcaaa gaaattgacg 720 ggggcctgca caagcggtgg agcatgtggt ttaattcgaa gcaacgcgca gaaccttacc 780 agcgtttgac atgccaggac ggtttccaga gatggattcc ttcccttacg ggacctggac 840 acaggtgctg catggctgtc gtcagctcgt gtcgtgagat gttgggttaa gtcccgcaac 900 gagcgcaacc ctcgtcttta gttgctacca tttagttgag cactctagag aaactgccgg 960 tgataagccg gaggaaggtg gggatgacgt caagtcctca tggcccttac gcgctgggct 1020 acacacgtgc tacaatggcg gtgacaacgg gcagcaaact cgcgagagtg agcaaatccc 1080 gaaaagccgt ctcagttcgg attgttctct gca 1113 <210> 71 <211> 1225 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 71 ggagcggcgg acgggtgagt aacgcgtggg aacgtgccct ttggtacgga acaactgagg 60 gaaacttcag ctaataccgt atgtgccctt cgggggaaag atttatcgcc attggagcgg 120 cccgcgttgg attaggtagt tggtggggta aaggcctacc aagcctacga tccatagctg 180 gtctgagagg atgatcagcc acactgggac tgagacacgg cccagactcc tacgggaggc 240 agcagtaggg aatcttgcgc aatgggcgaa agcctgacgc agccatgccg cgtgtatgat 300 gaaggtctta ggattgtaaa atactttcac cggggaagat aatgacggta cccggagaag 360 aagccccggc taacttcgtg ccagcagccg cggtaatacg aagggggcta gcgttgctcg 420 gaattactgg gcgtaaaggg cgcgtaggcg gatatttaag tcgggggtga aagcccaggg 480 ctcaaccctg gaattgcctt cgatactgga tatcttgagt tcgggagagg tgagtggaat 540 gccgagtgta gaggtgaaat tcgtagatat tcggcggaac accagtggcg aaggcgactc 600 actggcccga tactgacgct gaggcgcgaa agcgtgggga gcaaacagga ttagataccc 660 tggtagtcca cgctgtaaac gatgagtgct agttgtcggc atgcatgcat gtcggtgacg 720 cagctaacgc attaagcactccgcctgggg agtacggtcg caagattaaa actcaaagga 780 attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgaagca acgcgcagaa 840 ccttaccacc ttttgacatg ccctgatcgc tggagagatc cagttttccc ttcggggaca 900 gggacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc 960 gtcggaatcg ctagtaatcg cagatcagca cgctgcggtg aatacgttcc cgggccttgt 1380 acacaccgcc caagggcgaa ttctgcagat atccatcaca ctggcggccg ctcgagcatg 1440 catctagagg gcccaattcg ccctatagtg agtcgtatta caattcactg gccgtcgttt 1500 <210> 70 <211> 1113 <212> DNA <213> Organimè Unknown <220> <223> Origin of sequence: Organization soil <400> 70 gagctaatac cgtataatga cttcggtcca aagatttatc gcctgaggat gagcccgcgt 60 cggattagct agttggtagg gtaaaagcct accaaggcga cgatccgtag ctggtctgag 120 aggatgatca gccacactgg gactgagaca cggcccagac tcctacggga ggcagcagtg 180 gggaatattg gacaatgggc gcaagcctga tccagcaatg ccgcgtgagt gatgaaggcc 240 ttagggttgt aaagctcttt tacccgggaa gataatgact gtaccgggag aataagcccc 300 ggctaactcc gtgccagcag ccgcggtaat acggaggggg ctagcgttgt tcggaattac 360 tgggcgtaaa gcgcacgtag A ccggaattgc ggtggggtgt ggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggfggggggggg ca ggattagata ccctggtagt 600 ccacgccgta aacgatgatg actagctgtc ggggcgctta gcgtttcggt ggcgcagcta 660 acgcgttaag tcatccgcct ggggagtacg gccgcaaggt taaactcaaa gaaattgacg 720 ggggcctgca caagcggtgg agcatgtggt ttaattcgaa gcaacgcgca gaaccttacc 780 agcgtttgac atgccaggac ggtttccaga gatggattcc ttcccttacg ggacctggac 840 acaggtgctg catggctgtc gtcagctcgt gtcgtgagat gttgggttaa gtcccgcaac 900 gagcgcaacc ctcgtcttta gttgctacca tttagttgag cactctagag aaactgccgg 960 tgataagccg gaggaaggtg gggatgacgt caagtcctca tcccttac gcgctgggct 1020 acacacgtgc tacaatggcg gtgacaacgg gcagcaaact cgcgagagtg agcaaatccc 1080 gaaaagccgt ctcagttcgg attgttctct gca 1113 <210> 71 <211> 1225 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 71 ggagcggcgg acgggtgagt aacgcgtggg aacgtgccct ttggtacgga acaactgagg 60 gaaacttcag ctaataccgt atgtgccctt cgggggaaag atttatcgcc attggagcgg 120 cccgcgttgg attaggtagt tggtggggta aaggcctacc aagcctacga tccatagctg 180 gtctgagagg atgatcagcc acactgggac tgagacacg g cccagactcc tacgggaggc 240 agcagtaggg aatcttgcgc aatgggcgaa agcctgacgc agccatgccg cgtgtatgat 300 gaaggtctta ggattgtaaa atactttcac cggggaagat aatgacggta cccggagaag 360 aagccccggc taacttcgtg ccagcagccg cggtaatacg aagggggcta gcgttgctcg 420 gaattactgg gcgtaaaggg cgcgtaggcg gatatttaag tcgggggtga aagcccaggg 480 ctcaaccctg gaattgcctt cgatactgga tatcttgagt tcgggagagg tgagtggaat 540 gccgagtgta gaggtgaaat tcgtagatat tcggcggaac accagtggcg aaggcgactc 600 actggcccga tactgacgct gaggcgcgaa agcgtgggga gcaaacagga ttagataccc 660 tggtagtcca cgctgtaaac gatgagtgct agttgtcggc atgcatgcat gtcggtgacg 720 cagctaacgc attaagcactccgcctgggg agtacggtcg caagattaaa actcaaagga 780 attgacgggg gcccgcacaa gcggtggagc atgtggttta attcgaagca acgcgcagaa 840 ccttaccacc ttttgacatg ccctgatcgc tggagagatc cagttttccc ttcggggaca 900 gggacacagg tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc 960

gcaacgagcg caaccctcgc cattagttgc catcattaag ttgggcactc taatgggacc 1020 gccggtggta agccggagga aggtggggat gacgtcaagt cctcatggcc cttacggggt 1080 gggctacaca cgtgctacaa tggcgactac agagggttgc aaacctgcga aggggagcta 1140 atccctaaaa gtcgtctcag ttcggattgc actctgcaac tcgagtgcat gaagtcggaa 1200 tcgctagtaa tcgcggatca gcatg 1225 <210> 72 <211> 1286 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 72 atgattagta gcaatactaa tcgatgacga gcggcggacg ggtgagtaat acgtaggaac 60 ctgcccttaa gcgggggata actaagggaa actttagcta ataccgcata aactcgagag 120 agaaaagctg cagcaatgtg gcacttgagg aggggcctgc gtcagattag ctagttggtg 180 aggtaatagc tcaccaaggc gatgatctgt aactggtctg agaggacgac cagtcacact 240 gggactgaga cacggcccag actcctacgg gaggcagcag tggggaatat tggacaatgg 300 gggcaaccct gatccagcga tgccgcgtgg gtgaagaagg ccttcgggtt gtaaagccct 360 ttaggtcggg aagaaggtta gtagaggaaa tgctattaac ttgacggtac cgacagaata 420 agcaccggca aactctgtgc cagcagccgc ggtaatacag agggtgcgag cgttaatcgg 480 atttactggg cgtaaagggc gcgtaggcgg tgagatgtgt gtgatgtgaa agccccaggc 540 tcaacctggg aagtgcatcg caaactgtct gactggagta tatgagaggg tggcggaatt 600 tccggtgtag cggtgaaatg cgtagagatc ggaaggaacg tcgatggcga aggcagccac 660 ctggcataat actgacgctg aggcgcgaaa gcgtggggat cgaacaggat tagataccct 720 ggtagtccac gctgtaaact atgagtacta gatgttggta ggggaaccta tcggtatcga 780 agctaacgcg ataagtattc cgcctgggaa gtacggccgc aaggttgaaa ctcaaatgaa 840 ttgacggggg cccgcacaag cggtggagca tgtggtttaa ttcgatgcaa cgcgaagaac 900 cttacctacc cttgacatcc tgagaatctg gcttagtagc tggagtgccg aaaggagctc 960 agagacaggt gctgcatggc tgtcgtcagc tcgtgttgtg agatgttggg ttaagtcccg 1020 taacgagcgc aacccttgcc cttagttgcc atcatttagt tggggactct aaggggaccg 1080 ccagtgatga actggaggaa ggcggggacg acgtcaagtc atcatggcct ttatgggtag 1140 ggccacacac gtgctacaat ggggcgtacg gagggtcgca aacccgcgag ggggagctaa 1200 tctcataaag cgtctcgtag tccggattgg agtctgcaac tcgactccat gaagttggaa 1260 tcgctagtaa tcgcgaatca gcattg 1286 <210> 73 <211> 1288 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence : :Organisme du sol <400> 73 cggggcaacc ctggcggcga gcggcgaacg ggtgagtaat gcatcggaac gtgtcctctt 60 gtgggggata accagtcgaa agactggcta ataccgcatg agatcgaaag atgaaagcag 120 gggaccgcaa ggccttgcgc gagaggagca gccgatgccg gattagctag ttggtggggt 180 aaaagcctac caaggcgacg atccgtagct ggtctgagag gacgaccagc cacactggga 240 ctgagacacg gcccagactc ctacgggagg cagcagtggg gaattttgga cagtgggggc 300 aaccctgatc cagccatgcc gcgtgtgtga agaaggcctt cgggttgtaa agcactttcg 360 gacggaacga aatcgcgcga gttaatagtt cgcgtggatg acggtaccgt aagaagaagc 420 accggctaac tacgtgccag cagccgcggt aatacgtagg gtgcgagcgt taatcggaat 480 tactgggcgt aaagtgtgcg caggcggctt cgcaagtcga gtgtgaaatc cccgagctta 540 acttgggaat tgcgctcgaa actacggagc cggagtgtgg cagaggaagg tggaattcca 600 cgtgtagcgg tgaaatgcgt agagatgtgg aggaacaccg atggcgaagg cggccttctg 660 gcaacgagcg caaccctcgc cattagttgc catcattaag ttgggcactc taatgggacc 1020 gccggtggta agccggagga aggtggggat gacgtcaagt cctcatggcc cttacggggt 1080 gggctacaca cgtgctacaa tggcgactac agagggttgc aaacctgcga aggggagcta 1140 atccctaaaa gtcgtctcag ttcggattgc actctgcaac tcgagtgcat gaagtcggaa 1200 tcgctagtaa tcgcggatca gcatg 1225 <210> 72 <211> 1286 <212> DNA <213> Organime Unknown < 220> <223> sequence Origin: soil Organism <400> 72 atgattagta gcaatactaa tcgatgacga gcggcggacg ggtgagtaat acgtaggaac 60 ctgcccttaa gcgggggata actaagggaa actttagcta ataccgcata aactcgagag 120 agaaaagctg cagcaatgtg gcacttgagg aggggcctgc gtcagattag ctagttggtg 180 aggtaatagc tcaccaaggc gatgatctgt aactggtctg agaggacgac cagtcacact 240 gggactgaga cacggcccag actcctacgg gaggcagcag tggggaatat tgccgcgtg tgccgcgtgg tgccgcgtgg tgccgcgtgg gtgaagaagg ccttcgggtt gtaaagccct 360 ttaggtcggg aagaaggtta gtagaggaaa tgctattaac ttgacggtac cgacagaata 420 agcaccggca aactctgtgc cagcagccgc ggtaatacag agggtgcgag cgttaatcgg 480 atttactg gg cgtaaagggc gcgtaggcgg tgagatgtgt gtgatgtgaa agccccaggc 540 tcaacctggg aagtgcatcg caaactgtct gactggagta tatgagaggg tggcggaatt 600 tccggtgtag cggtgaaatg cgtagagatc ggaaggaacg tcgatggcga aggcagccac 660 ctggcataat actgacgctg aggcgcgaaa gcgtggggat cgaacaggat tagataccct 720 ggtagtccac gctgtaaact atgagtacta gatgttggta ggggaaccta tcggtatcga 780 agctaacgcg ataagtattc cgcctgggaa gtacggccgc aaggttgaaa ctcaaatgaa 840 ttgacggggg cccgcacaag cggtggagca tgtggtttaa ttcgatgcaa cgcgaagaac 900 cttacctacc cttgacatcc tgagaatctg gcttagtagc tggagtgccg aaaggagctc 960 agagacaggt gctgcatggc tgtcgtcagc tcgtgttgtg agatgttggg ttaagtcccg 1020 taacgagcgc aacccttgcc cttagttgcc atcatttagt tggggactct aaggggaccg 1080 ccagtgatga actggaggaa ggcggggacg acgtcaagtc atcatggcct ttatgggtag 1140 ggccacacac gtgctacaat ggggcgtacg gagggtcgca aacccgcgag ggggagctaa 1200 tctcataaag cgtctcgtag tccggattgg agtctgcaac tcgactccat gaagttggaa 1260 tcgctagtaa tcgcgaatca gcattg 1286 <210> 73 < 211> 1288 <212> DNA <213> Unknown Organism <220> <223> Origin of sequence: Floor Organism <400> 73 cggggcaacc ctggcggcga gcggcgaacg ggtgagtaat gcatcggaac gtgtcctctt 60 gtgggggata accagtcgaa agactggcta ataccgcatg agatcgaaag atgaaagcag 120 gggaccgcaa ggccttgcgc gagaggagca gccgatgccg gattagctag ttggtggggt 180 aaaagcctac caaggcgacg atccgtagct ggtctgagag gacgaccagc cacactggga 240 ctgagacacg gcccagactc ctacgggagg cagcagtggg gaattttgga cagtgggggc 300 aaccctgatc cagccatgcc gcgtgtgtga agaaggcctt cgggttgtaa agcactttcg 360 gacggaacga aatcgcgcga gttaatagtt cgcgtggatg acggtaccgt aagaagaagc 420 accggctaac tacgtgccag cagccgcggt aatacgtagg gtgcgagcgt taatcggaat 480 tactgggcgt aaagtgtgcg caggcggctt cgcaagtcga gtgtgaaatc cccgagctta 540 acttgggaat tgcgctcgaa actacggagc cggagtgtgg cagaggaagg tggaattcca 600 cgtgtagcgg tgaaatgcgt agagatgtgg aggaacaccg atggcgaagg cggccttctg 660

ggccaacact gacgctcatg cacgaaagcg tggggagcaa acaggattag ataccctggt 720 agtccacgcc ctaaacgatg atgactagtt gttggaggag ttaaatcctt tagtaacgca 780 gctaacgcgt gaagtcatcc gcctggggag tacggtcgca agattaaaac tcaaaggaat 840 tgacgggggc ccgcacaagc ggtggatgat gtggtttaat tcgatgcaac gcgaaaaacc 900 ttacctaccc ttgacatgct aggaacgctg cagaaatgta gcggtgcccg aaagggaacc 960 tagacacagg tgctgcatgg ctgtcgtcag ctcgrgtcgt gagatgttgg gttaagtccc 1020 gcaacgagcg caacccctgc cattagttgc tacattcagt tgagcactct aatgggactg 1080 ccggtgacaa accggaggaa ggtggggatg acgtcaagtc ctcatggccc ttatgggtag 1140 ggctacacac gtcatacaat ggcgcgtaca gagggttgcc aacccgcgag ggggagccaa 1200 tcccagaaag cgcgtcgtag tccggattgg agtctgcaac tcgactccca tgaagtcgga 1260 atcgctagta atcgcggatc agcatgtc 1288 <210> 74 <211> 600 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 74 cgtgccagca gccgcggtaa tacgtaggtg gcaagcgttg tccggaatta ttgggcgtaa 60 agcgcgcgca ggtggtttct taagtctgat gtgaaagccc acggcttaac cgtggagggt 120 cattggaaac tgggagactt gagtgcagaa gaggaaagtg gaattccaag tgtagcggtg 180 aaatgcgtag agatttggag gaacaccagt ggcgaaggcg actttctggt ctgcaactga 240 cgctgaggcg cgaaagcatg gggagcaaac aggattagat accctggtag tccatgccgt 300 aaacgatgag tgctaagtgt tagggggttt ccgcccctta gtgctgcagc taacgcatta 360 agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg acgggggccc 420 gcacaagcgg tggagcatgt ggtttaattc gaagcaacgc gaagaacctt accaggtctt 480 gacatcccga tgancgctct agagatagag ttttcccttc ggggacattg gtgacaggtg 540 gtgcatggtt gtcgtcagct cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca 600 <210> 75 <211> 601 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence Organisée du sol <400> 75 cgtgccagca gccgcggtaa tacgtaggtg gcaagcgttg tccggaatta ttgggcgtaa 60 agcgcgcgca ggtggtttct taagtctgat gtgaaagccc acggcttaac cgtggagggt 120 cattggaaac tgggagactt gagtgcagaa gaggaaagtg gaattccaag tgtagcggtg 180 aaatgcgtag agatttggag gaacaccagt ggcgaaggcg actttctggt ctgcaactga 240 cgctgaggcg cgaaagcatg gggagcaaac aggattagat accctggtag tccatgccgt 300 aaacgatgag tgctaagtgt tagggggttt ccgcccctta gtgctgagct aacgcattaa 360 gcactccgcc tggggagtac gaccgcaagg ttgaaactca aaggaattga cgggggcccg 420 cacaagcggt ggagcatgtg gtttaattcg aagcaacgcg aagaacctta ccaggtcttg 480 acatcccgat gacgctctag agatagagtt ttcccttcgg ggacattggt gacaggtggt 540 gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcacc 600 c " 601 <210> 76 <211> 1236 <212> ADN <213> Organime Inconnu ggccaacact gacgctcatg cacgaaagcg tggggagcaa acaggattag ataccctggt 720 agtccacgcc ctaaacgatg atgactagtt gttggaggag ttaaatcctt tagtaacgca 780 gctaacgcgt gaagtcatcc gcctggggag tacggtcgca agattaaaac tcaaaggaat 840 tgacgggggc ccgcacaagc ggtggatgat gtggtttaat tcgatgcaac gcgaaaaacc 900 ttacctaccc ttgacatgct aggaacgctg cagaaatgta gcggtgcccg aaagggaacc 960 tagacacagg tgctgcatgg ctgtcgtcag ctcgrgtcgt gagatgttgg gttaagtccc 1020 gcaacgagcg caacccctgc cattagttgc tacattcagt tgagcactct aatgggactg 1080 ccggtgacaa accggaggaa ggtggggatg acgtcaagtc ctcatggccc ttatgggtag 1140 ggctacacac gtcatacaat ggcgcgtaca gagggttgcc aacccgcgag ggggagccaa 1200 tcccagaaag cgcgtcgtag tccggattgg agtctgcaac tcgactccca tgaagtcgga 1260 atcgctagta atcgcggatc agcatgtc 1288 <210> 74 <211> 600 <212> DNA <213> Organime Unknown <220> <223> Origin of sequence: Soil organism <400> 74 cgtgccagca gccgcggtaa tacgtaggtg gcaagcgttg tccggaatta ttgggcgtaa 60 agcgcgcgca ggtggtttct taagtctgat gtgaaagccc acggcttaac cgtggagggt 120 cattg gaaac tgggagactt gagtgcagaa gaggaaagtg gaattccaag tgtagcggtg 180 aaatgcgtag agatttggag gaacaccagt ggcgaaggcg actttctggt ctgcaactga 240 cgctgaggcg cgaaagcatg gggagcaaac aggattagat accctggtag tccatgccgt 300 aaacgatgag tgctaagtgt tagggggttt ccgcccctta gtgctgcagc taacgcatta 360 agcactccgc ctggggagta cgaccgcaag gttgaaactc aaaggaattg acgggggccc 420 gcacaagcgg tggagcatgt ggtttaattc gaagcaacgc gaagaacctt accaggtctt 480 gacatcccga tgancgctct agagatagag ttttcccttc ggggacattg gtgacaggtg 540 gtgcatggtt gtcgtcagct cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca 600 <210> 75 <211> 601 <212> DNA <213> Organime Unknown <220> <223> Origin of Organized sequence soil <400> 75 cgtgccagca gccgcggtaa tacgtaggtg gcaagcgttg tccggaatta ttgggcgtaa 60 agcgcgcgca ggtggtttct taagtctgat gtgaaagccc acggcttaac cgtggagggt 120 cattggaaac tgggagactt gagtgcagaa gaggaaagtg gaattccaag tgtagcggtg 180 aaatgcgtag agatttggag gaacaccagt ggcgaaggcg actttctggt ctgcaactga 240 cgctgaggcg cgaaagcatg gggagcaaac aggattagat accctgg tag tccatgccgt 300 aaacgatgag tgctaagtgt tagggggttt ccgcccctta gtgctgagct aacgcattaa 360 gcactccgcc tggggagtac gaccgcaagg ttgaaactca aaggaattga cgggggcccg 420 cacaagcggt ggagcatgtg gtttaattcg aagcaacgcg aagaacctta ccaggtcttg 480 acatcccgat gacgctctag agatagagtt ttcccttcgg ggacattggt gacaggtggt 540 gcatggttgt cgtcagctcg tgtcgtgaga tgttgggtta agtcccgcaa cgagcgcacc 600 c '601 <210> 76 <211> 1236 < 212> DNA <213> Unknown Organism

<220> <223> Origine de la séquence :Organisme du sol <400> 76 tgccctgtag acggggataa cttcgggaaa ccggagctaa taccggataa tcctcttccc 60 cacatgggga agagttgaaa ggcgctttcg cgtcactaca ggatgggccc gcggtgcatt 120 agctagttgg tagggtaacg gcctaccaag gcgacgatgc atagccgacc tgagagggtg 180 atcggccaca ttgggactga gacacggccc aaactcctac gggaggcagc agtagggaat 240 cttccacaat ggacgaaagt ctgatggagc aacgccgcgt gagtgatgaa ggttttcgga 300 tcgtaaaact ctgttgtaag ggaagaacca gtacgtcagg caatggacgt accttgacgg 360 taccttatta gaaagccacg gctaactacg tgccagcagc cgcggtaata cgtaggtggc 420 aagcgttgtc cggaattatt gggcgtaaag cgcgcgcagg tggtttctta agtctgatgt 480 gaaagcccac ggcttaaccg tggagggtca ttggaaactg ggagacttga gtgcagaaga 540 ggaaagtgga attccaagtg tagcggcgaa atgcgtagag atttggagga acaccagtgg 600 cgaaggcgac tttctggtct gcaactgacg ctgaggcgcg aaagcatggg gagcaaacag 660 gattagatac cctggtagtc catgctgtaa acgatgagtg ctaagtgtta gggggtttcc 720 gccccttagt gctgcagcta acgcattaag cactccgcct ggggagtacg accgcaaggt 780 tgaaactcaa aggaattgac gggggcccgc acaagcggtg gagcatgtgg tttaattcga 840 agcaacgcga agaaccttac caggtcttga catcccgatg atcgctctgg agatagagtt 900 ttcccttcgg ggacattggt gacaggtggt gcatggttgt cgtcagctcg tgtcgtgaga 960 tgttgggtta agtcccgcaa cgagcgcaac ccttaatctt agttgccatc atttagttgg 1020 gcactctaag gtgactgccg gtgataaacc ggaggaaggt ggggatgacg tcaaatcatc 1080 atgcccctta tgacctgggc tacacacgtg ctacaatgga cggtacaaag agtcgctaac 1140 tcgcgagagt atgctaatct catagaaccg ttctcagttc ggattgtagg ctgcaactcg 1200 cctacatgaa gccggaatcg ctagtaatcg cggatc 1236 <210> 77 <211> 815 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 77 caagcgttgt ccggaattat tgggcgtaaa gagctcgtag gcggtttgtc gcgtctgctg 60 tgaaaactcg aggctcaacc tcgggcttgc agtgggtacg ggcagactag agtgcggtag 120 gggtgactgg aattcctggt gtagcggtgg aatgcgcaga tatcaggagg aacaccgatg 180 gcgaaggcag gtcactgggc cgcaactgac gctgaggagc gaaagcatgg ggagcgaaca 240 ggattagata ccctggtagt ccatgccgta aacgttgggc actaggtgtg gggctcattc 300 cacgagttcc gtgccgcagc aaacgcatta agtgccccgc ctggggagta cggccgcaag 360 gcttaaaact caaagaaatt gacgggggcc cgcacaagcg gcggagcatg cggattaatt 420 cgatgcaacg cgaagaacct taccaaggct tgacatacac cggaaacttc cagagatggt 480 tgccccgcaa ggtcggtgta caggtggtgc atggttgtcg tcagctcgtg tcgtgaagat 540 gttgggttaa gtcccgcaac gagcgcaacc ctcgtcctat gttgccagca cgtgatggtg 600 gggactcata ggagactgcc ggggtcaact cggaggaagg tggggatgac gtcaaatcat 660 catgcccctt atgtcttggg cttcacgcat gctacaatgg ccggtacaaa gggctgcgat 720 accgcaaggt ggagcgaatc ccaaaaagcc ggtctcagtt cggattgggg tctgcaactc 780 gaccccatga agtcggagtc gctagtaatc gcaga 815 <210> 78 <211> 826 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <220> <223> Origin of sequence: Floor Organism <400> 76 tgccctgtag acggggataa cttcgggaaa ccggagctaa taccggataa tcctcttccc 60 cacatgggga agagttgaaa ggcgctttcg cgtcactaca ggatgggccc gcggtgcatt 120 agctagttgg tagggtaacg gcctaccaag gcgacgatgc atagccgacc tgagagggtg 180 atcggccaca ttgggactga gacacggccc aaactcctac gggaggcagc agtagggaat 240 cttccacaat ggacgaaagt ctgatggagc aacgccgcgt gagtgatgaa ggttttcgga 300 tcgtaaaact ctgttgtaag ggaagaacca gtacgtcagg caatggacgt accttgacgg 360 taccttatta gaaagccacg gctaactacg tgccagcagc cgcggtaata cgtaggtggc 420 aagcgttgtc cggaattatt gggcgtaaag cgcgcgcagg tggtttctta agtctgatgt 480 gaaagcccac ggcttaaccg tggagggtca ttggaaactg ggagacttga gtgcagaaga 540 ggaaagtgga attccaagtg tagcggcgaa atgcgtagag atttggagga acaccagtgg 600 cgaaggcgac tttctggtct gcaactgacg ctgaggcgcg aaagcatggg gagcaaacag 660 gattagatac cctggtagtc catgctgtaa acgatgagtg ctaagtgtta gggggtttcc 720 gccccttagt gctgcagcta acgcattaag cactccgcct ggggagtacg accgcaaggt 780 tgaaactcaa aggaattgac gggggc CCGC acaagcggtg gagcatgtgg tttaattcga 840 agcaacgcga agaaccttac caggtcttga catcccgatg atcgctctgg agatagagtt 900 ttcccttcgg ggacattggt gacaggtggt gcatggttgt cgtcagctcg tgtcgtgaga 960 tgttgggtta agtcccgcaa cgagcgcaac ccttaatctt agttgccatc atttagttgg 1020 gcactctaag gtgactgccg gtgataaacc ggaggaaggt ggggatgacg tcaaatcatc 1080 atgcccctta tgacctgggc tacacacgtg ctacaatgga cggtacaaag agtcgctaac 1140 tcgcgagagt atgctaatct catagaaccg ttctcagttc ggattgtagg ctgcaactcg 1200 cctacatgaa gccggaatcg ctagtaatcg cggatc 1236 <210> 77 <211> 815 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 77 caagcgttgt ccggaattat tgggcgtaaa gagctcgtag gcggtttgtc gcgtctgctg 60 tgaaaactcg aggctcaacc tcgggcttgc agtgggtacg ggcagactag agtgcggtag 120 gggtgactgg aattcctggt gtagcggtgg aatgcgcaga tatcaggagg aacaccgatg 180 gcgaaggcag gtcactgggc cgcaactgac gctgaggagc gaaagcatgg ggagcgaaca 240 ggattagata ccctggtagt ccatgccgta aacgttgggc actaggtgtg gggctcattc 300 cacgagttcc gtgccgcagc aaa cgcatta agtgccccgc ctggggagta cggccgcaag 360 gcttaaaact caaagaaatt gacgggggcc cgcacaagcg gcggagcatg cggattaatt 420 cgatgcaacg cgaagaacct taccaaggct tgacatacac cggaaacttc cagagatggt 480 tgccccgcaa ggtcggtgta caggtggtgc atggttgtcg tcagctcgtg tcgtgaagat 540 gttgggttaa gtcccgcaac gagcgcaacc ctcgtcctat gttgccagca cgtgatggtg 600 gggactcata ggagactgcc ggggtcaact cggaggaagg tggggatgac gtcaaatcat 660 catgcccctt atgtcttggg cttcacgcat gctacaatgg ccggtacaaa gggctgcgat 720 accgcaaggt ggagcgaatc ccaaaaagcc ggtctcagtt cggattgggg tctgcaactc 780 gaccccatga agtcggagtc gctagtaatc gcaga 815 <210> 78 <211> 826 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism

<400> 78 tcgtaggtgg cttgtcacgt cgggtgtgaa agcttggggc ttaactccag gtctgcattc 60 gatacgggct ggctagaggt aggtagggga gaacggaatt cctggtgtag cggtgaaatg 120 cgcagatatc aggaggaaca ccggtggcga aggcggttct ctgggcctta cctgacgctg 180 aggagcgaaa gcgtggggag cgaacaggat tagataccct ggtagtccac gctgtaaacg 240 ttgggcgcta ggtgtgggga ccttccacgg tttccgcgcc gtagctaacg cattaagcgc 300 cccgcctggg gagtacggcc gcaaggctaa aactcaaagg aattgacggg ggcccgcaca 360 agcggcggag catgttgctt aattcgacgc aacgcgaaga accttaccaa ggcttgacat 420 cgcccggaaa gcttcagaga tggagccctc ttcggactgg gtgacaggtg gtgcatggct 480 gtcgtcagct cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttgttc 540 aatgttgcca gcaacatcct tcggggtggt tggggactca ttggagactg ccggggtcaa 600 ctcggaggaa ggtggggacg acgtcaagtc atcatgcccc ttatgtcttg ggctgcaaac 660 atgctacaat ggccggtaca gagggttgcg ataccgcaag gtggagcgaa tccctaaaag 720 ccggtctcag ttcggattgg ggtctgcaac tcgaccccat gaagtcggag tcgctagtaa 780 tcgcagatca gcaacgctgc ggtgaatacg ttcccgggcc ttgtac 826 <210> 79 <211> 799 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 79 cgtaggcggt ttgtcgcgtc tgccgtgaaa gtccggggct caactccgga tctgcggtgg 60 gtacgggcag actagagtga tgtaggggag actggaattc ctggtgtagc ggtgaaatgc 120 gcagatatca ggaggaacac cgatggcgaa ggcaggtctc tgggcattaa ctgacgctga 180 ggagcgaaag catggggagc gaacaggatt agataccctg gtagtccatg ccgtaaacgt 240 tgggcactag gtgtggggga cattccacgt tttccgcgcc gtagctaacg cattaagtgc 300 cccgcctggg gagtacggcc gcaaggctaa aactcaaagg aattgacggg ggcccgcaca 360 agcggcggag catgcggatt aattcgatgc aacgcgaaga accttaccaa ggcttgacat 420 gaaccggaaa cacctggaaa caggtgcccc gcttgcggtc ggtttacagg tggtgcatgg 480 ttgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caaccctcgt 540 tctatgttgc cagcgcgtta tggcggggac tcataggaga ctgccggggt caactcggag 600 gaaggtgggg acgacgtcaa atcatcatgc cccttatgtc ttgggcttca cgcatgctac 660 aatggccggt acaaagggtt gcgatactgt gaggtggagc taatcccaaa aagccggtct 720 cagttcggat tggggtctgc aactcgaccc catgaagtcg gagtcgctag taatcgcaga 780 tcagcaacgc tgcggtgaa 799 <210> 80 <211> 1250 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 80 tgccagcttg ctggtggatt agtggcgaac gggtgagtaa cacgtgagta acctgccctt 60 aactctggga taagcctggg aaactgggtc taatgccgga tatgactcct catcgcatgg 120 tggggggtgg aaagcttttt gtggttttgg atggactcgc ggcctatcag cttgttggtg 180 aggtaatggc tcaccaaggc gacgacgggt agccggcctg agagggtgac cggccacact 240 gggactgaga cacggcccag acttctacgg gaggcagcag tggggaatat tgcacaatgg 300 gcgaaagcct gatgcagcga cgccgcgtga gggatgacgg ccttcgggtt gtaaacctct 360 ttcagtaggg aagaagcgaa agtgacggta cctgcagaag aagcgccggc taactacgtg 420 ccagcagccg cggtaatacg tagggcgcaa gcgttatccg gaattattgg gcgtaaagag 480 <400> 78 tcgtaggtgg cttgtcacgt cgggtgtgaa agcttggggc ttaactccag gtctgcattc 60 gatacgggct ggctagaggt aggtagggga gaacggaatt cctggtgtag cggtgaaatg 120 cgcagatatc aggaggaaca ccggtggcga aggcggttct ctgggcctta cctgacgctg 180 aggagcgaaa gcgtggggag cgaacaggat tagataccct ggtagtccac gctgtaaacg 240 ttgggcgcta ggtgtgggga ccttccacgg tttccgcgcc gtagctaacg cattaagcgc 300 cccgcctggg gagtacggcc gcaaggctaa aactcaaagg aattgacggg ggcccgcaca 360 agcggcggag catgttgctt aattcgacgc aacgcgaaga accttaccaa ggcttgacat 420 cgcccggaaa gcttcagaga tggagccctc ttcggactgg gtgacaggtg gtgcatggct 480 gtcgtcagct cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccttgttc 540 aatgttgcca gcaacatcct tcggggtggt tggggactca ttggagactg ccggggtcaa 600 ctcggaggaa ggtggggacg acgtcaagtc atcatgcccc ttatgtcttg ggctgcaaac 660 atgctacaat ggccggtaca gagggttgcg ataccgcaag gtggagcgaa tccctaaaag 720 ccggtctcag ttcggattgg ggtctgcaac tcgaccccat gaagtcggag tcgctagtaa 780 tcgcagatca gcaacgctgc ggtgaatacg ttcccgggcc ttgtac 826 <210> 79 <211> 799 <212> AD N <213> Organime Unknown <220> <223> Origin of sequence: Floor Organism <400> 79 cgtaggcggt ttgtcgcgtc tgccgtgaaa gtccggggct caactccgga tctgcggtgg 60 gtacgggcag actagagtga tgtaggggag actggaattc ctggtgtagc ggtgaaatgc 120 gcagatatca ggaggaacac cgatggcgaa ggcaggtctc tgggcattaa ctgacgctga 180 ggagcgaaag catggggagc gaacaggatt agataccctg gtagtccatg ccgtaaacgt 240 tgggcactag gtgtggggga cattccacgt tttccgcgcc gtagctaacg cattaagtgc 300 cccgcctggg gagtacggcc gcaaggctaa aactcaaagg aattgacggg ggcccgcaca 360 agcggcggag catgcggatt aattcgatgc aacgcgaaga accttaccaa ggcttgacat 420 gaaccggaaa cacctggaaa caggtgcccc gcttgcggtc ggtttacagg tggtgcatgg 480 ttgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caaccctcgt 540 tctatgttgc cagcgcgtta tggcggggac tcataggaga ctgccggggt caactcggag 600 gaaggtgggg acgacgtcaa atcatcatgc cccttatgtc ttgggcttca cgcatgctac 660 aatggccggt acaaagggtt gcgatactgt gaggtggagc taatcccaaa aagccggtct 720 cagttcggat tggggtctgc aactcgaccc catgaagtcg gagtcgctag taatcgcaga 780 tca gcaacgc tgcggtgaa 799 <210> 80 <211> 1250 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 80 tgccagcttg ctggtggatt agtggcgaac gggtgagtaa cacgtgagta acctgccctt 60 aactctggga taagcctggg aaactgggtc taatgccgga tatgactcct catcgcatgg 120 tggggggtgg aaagcttttt gtggttttgg atggactcgc ggcctatcag cttgttggtg 180 aggtaatggc tcaccaaggc gacgacgggt agccggcctg agagggtgac cggccacact 240 gggactgaga cacggcccag acttctacgg gaggcagcag tggggaatat tgcacaatgg 300 gcgaaagcct gatgcagcga cgccgcgtga gggatgacgg ccttcgggtt gtaaacctct 360 ttcagtaggg aagaagcgaa agtgacggta cctgcagaag aagcgccggc taactacgtg 420 ccagcagccg cggtaatacg tagggcgcaa gcgttatccg gaattattgg gcgtaaagag 480

ctcgtaggcg gtttgtcgcg tctgccgtga aagtccgggg ctcaactccg gatctgcggt 540 gggtacgggc agactagagt gatgtagggg agactggaat tcctggtgta gcggtgaaat 600 gcgcagatat caggaggaac accgatggcg aaggcaggtc tctgggcatt aactgacgct 660 gaggagcgaa agcatgggga gcgaacagga ttagataccc tggtagtcca tgccgtaaac 720 gttgggcact aggtgtgggg gacattccac gttttccgcg ccgtagctaa cgcattaagt 780 gccccgcctg gggagtacgg ccgcaaggct aaaactcaaa ggaattgacg ggggcccgca 840 caagcggcgg agcatgcgga ttaattcga-- gcaacgcgag gaaccttacc aaggcttgac 900 atgaaccgga aatacctgga aacaggtgcc ccgcttgcgg tcggtttaca ggtggtgcat 960 ggttgccgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctc 1020 gttctatgtt gccagcgcgt tatggcgggg actcatagga gactgccggg gtcaactcgg 1080 aggaaggtgg ggacgacgtc aaatcatcat gccccttatg tcttgggctt cacgcatgct 1140 acaatggccg gtacaaaggg ttgcgatact gtgaggtgga gctaatccca aaaagccggt 1200 ctcagttcgg attggggtct gcaactcgac cccatgaagt cggagtcgct 1250 <210> 81 <211> 1210 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence .-Organisme du sol <400> 81 cgctaatacc ggatacggcg cgagagtctr cggactttcg cgagaaagat tcgcaaggat 60 cactgaggga cgagcctgcg gcccatcagc tagttggtga ggtaagagct caccaaggct 120 aagacgggta gctggtctga gaggatgatc agccacactg gaactgagac acggtccaga 180 ctcctacggg aggcagcagt ggggaatatt gcgcaatggg cgaaagcctg acgcagccac 240 gccgcgtgag cgatgagggc cttcgggtcg caaagctctg tggggagaga cgaataaggc 300 cggtgaagag tcggccttga cggtatctcc ttagcaagca ccggctaact ccgtgccagc 360 agccgcggta atacggaggg tgcaaacgtt gctcggaatc attgggcgta aagcgcacgt 420 aggcggcgtg ataagttggg tgtgaaagcc ctgggctcaa cccaggaagt gcattcaaaa 480 ctgtcacgct tgaatctcgg agggggtcag agaattcccg gtgtagaggt gaaattcgta 540 gatatcggga ggaataccag tggcgaaggc gctggcctgg acgaagattg acgctgaggt 600 gcgaaagcgc ggggagcaaa caggattaga taccctggta gtccgcgctg taaacgatga 660 gtgctagacg ggggaggtat tgaccccttc gctgccgaag ctaacgcgtt aagcactccg 720 cctggggagt acggtcgcaa gactaaaact caaaggaatt gacgggggcc cgcacaagcg 780 gtggagcatg tggtttaatt cgacgcaacg cgcaaaacct tacctgggtt aaatccgccg 840 gaacctggct gaaaggctgg ggtgccctcc ggggaatcgg tgagaaggtg ctgcatggct 900 gtcgtcagct cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccctatcg 960 tcagttgcca acattaaggt gggaactctg gcgagactgc cggtctaaac cggaggaagg 1020 tggggacgac gtcaagtcct catggccctt atgcccaggg ctacacacgt gctacaatgg 1080 ctggtacaat gagccgcaaa accgcgaggt caagctaatc tcaaaaaacc agtctcagtt 1140 cggatcggag tctgcaactc gactccgtga agctggaatc gctagtaatc gaagatcagc 1200 acgctttcgg 1210 <210> 82 <211> 1272 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence : :Organisme du sol <400> 82 gatgccagct tgctggtgga ttagtggcga acgggtgagt aacacgtgag taacctgccc 60 ttaactctgg gataagcctg ggaaactggg tctaataccg gatatgactc ctcatcgcat 120 ggtggggggt ggaaagcttt ttgtggtttt ggatggactc gcggcctatc agcttgttgg 180 tgaggtaatg gctcaccaag gcgacgacgg gtagccggcc tgagagggtg accggccaca 240 ctcgtaggcg gtttgtcgcg tctgccgtga aagtccgggg ctcaactccg gatctgcggt 540 gggtacgggc agactagagt gatgtagggg agactggaat tcctggtgta gcggtgaaat 600 gcgcagatat caggaggaac accgatggcg aaggcaggtc tctgggcatt aactgacgct 660 gaggagcgaa agcatgggga gcgaacagga ttagataccc tggtagtcca tgccgtaaac 720 gttgggcact aggtgtgggg gacattccac gttttccgcg ccgtagctaa cgcattaagt 780 gccccgcctg gggagtacgg ccgcaaggct aaaactcaaa ggaattgacg ggggcccgca 840 caagcggcgg agcatgcgga ttaattcga-- gcaacgcgag gaaccttacc aaggcttgac 900 atgaaccgga aatacctgga aacaggtgcc ccgcttgcgg tcggtttaca ggtggtgcat 960 ggttgccgtc agctcgtgtc gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctc 1020 gttctatgtt gccagcgcgt tatggcgggg actcatagga gactgccggg gtcaactcgg 1080 aggaaggtgg ggacgacgtc aaatcatcat gccccttatg tcttgggctt cacgcatgct 1140 acaatggccg gtacaaaggg ttgcgatact gtgaggtgga gctaatccca aaaagccggt 1200 ctcagttcgg attggggtct gcaactcgac cccatgaagt cggagtcgct 1250 <210> 81 <211> 1210 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence.-Org anismus soil <400> 81 cgctaatacc ggatacggcg cgagagtctr cggactttcg cgagaaagat tcgcaaggat 60 cactgaggga cgagcctgcg gcccatcagc tagttggtga ggtaagagct caccaaggct 120 aagacgggta gctggtctga gaggatgatc agccacactg gaactgagac acggtccaga 180 ctcctacggg aggcagcagt ggggaatatt gcgcaatggg cgaaagcctg acgcagccac 240 gccgcgtgag cgatgagggc cttcgggtcg caaagctctg tggggagaga cgaataaggc 300 cggtgaagag tcggccttga cggtatctcc ttagcaagca ccggctaact ccgtgccagc 360 agccgcggta atacggaggg tgcaaacgtt gctcggaatc attgggcgta aagcgcacgt 420 aggcggcgtg ataagttggg tgtgaaagcc ctgggctcaa cccaggaagt gcattcaaaa 480 ctgtcacgct tgaatctcgg agggggtcag agaattcccg gtgtagaggt gaaattcgta 540 gatatcggga ggaataccag tggcgaaggc gctggcctgg acgaagattg acgctgaggt 600 gcgaaagcgc ggggagcaaa caggattaga taccctggta gtccgcgctg taaacgatga 660 gtgctagacg ggggaggtat tgaccccttc gctgccgaag ctaacgcgtt aagcactccg 720 cctggggagt acggtcgcaa gactaaaact caaaggaatt gacgggggcc cgcacaagcg 780 gtggagcatg tggtttaatt cgacgcaacg cgcaaaacct tacctgggtt aaatccgccg 84 0 gaacctggct gaaaggctgg ggtgccctcc ggggaatcgg tgagaaggtg ctgcatggct 900 gtcgtcagct cgtgtcgtga gatgttgggt taagtcccgc aacgagcgca acccctatcg 960 tcagttgcca acattaaggt gggaactctg gcgagactgc cggtctaaac cggaggaagg 1020 tggggacgac gtcaagtcct catggccctt atgcccaggg ctacacacgt gctacaatgg 1080 ctggtacaat gagccgcaaa accgcgaggt caagctaatc tcaaaaaacc agtctcagtt 1140 cggatcggag tctgcaactc gactccgtga agctggaatc gctagtaatc gaagatcagc 1200 acgctttcgg 1210 <210> 82 < 211> 1272 <212> DNA <213> Organime Unknown <220> <223> Origin of sequence: soil Organism <400> 82 gatgccagct tgctggtgga ttagtggcga acgggtgagt aacacgtgag taacctgccc 60 ttaactctgg gataagcctg ggaaactggg tctaataccg gatatgactc ctcatcgcat 120 ggtggggggt ggaaagcttt ttgtggtttt ggatggactc gcggcctatc agcttgttgg 180 tgaggtaatg gctcaccaag gcgacgacgg gtagccggcc tgagagggtg accggccaca 240

ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat attgcacaat 300 gggcgaaagc ctgatgcagc gacgccgcgt gagggatgac ggccttcggg ttgtaaacct 360 ctttcagtag ggaagaagcg aaagtgacgg tacctgcaga agaagcgccg gctaactacg 420 tgccagcagc cgcggtaata cgtagggcgc aagcgttatc cggaattatt gggcgtaaag 480 agctcgtagg cggtttgtcg cgtctgccgt gaaagtccgg ggctcaactc cggatctgcg 540 gtgggtacgg gcagactaga gtgatgtagg ggagactgga attcctggtg tagcggtgaa 600 atgcgcagat atcaggagga acaccgatgg cgaaggcagg tctctgggca ttaactgacg 660 ctgaggaacg aaagcatggg gagcgaacag gattagatac cctggtagtc catgccgtaa 720 acgttgggca ctaggtgtgg gggacattcc acgttttccg cgccgtagct aacgcattaa 780 gtgccccgcc tggggagtac ggccgcaagg ctaaaactca aaggaattga cgggggcccg 840 cacaagcggc ggagcatgcg gattaattcg atgcaacgcg aagaacctta ccaaggcttg 900 acatgaaccg gaaatacctg gaaacaggtg ccccgcttgc ggtcggttta caggtggtgc 960 atggttgtcg tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc 1020 tcgttctatg ttgccagcgc gttatggcgg ggactcatag gagactgccg gggtcaactc 1080 ggaggaaggt ggggacgacg tcaaatcatc atgcccctta tgtcttgggc ttcacgcatg 1140 ctacaatggc cggtacaaag ggttgcgata ctgtgaggtg gagctgatcc caaaaagccg 1200 gtcccagttc ggattggggt ctgcaactcg accccatgaa gtcggagtcg ctagtaatcg 1260 cagatcagca ac 1272 <210> 83 <211> 1247 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 83 tgtttagtag caatactaaa tgatgacgag cggcggacgg gtgaggaaca cgtaggaacc 60 tgcccaagag agggggacaa ccaagggaaa ctttggctaa taccgcataa tctctacgga 120 gaaaagttgc ccgtaagggt ggcgcttttg gaggggcctg cgtccgatta gttagttggt 180 gaggtaatag ctcaccaaga ctgtgatcgg taactggtct gagaggacga ccagtcacac 240 tgggactgag acacggccca gactcctacg ggaggcagca gtggggaatc ttggacaatg 300 ggggcaaccc tgatccagcg atgccgcgtg ggtgaagaag gccttcgggt tgtaaagccc 360 tttaggcggg gaagaaggat atgggatgaa taagcctgta ttttgacggt acccgcagaa 420 taagcaccgg caaactctgt gccagcagcc gcggtaatac agagggtgcg agcgttaatc 480 ggatttactg ggcgtaaagg gcgcgtaggc ggttgtgtga gtgtgatgtg aaagccccgg 540 gctcaacctg ggaagtgcat cgcaaacgac acaactggag tatatgagag ggtggcggaa 600 tttccggtgt agcggtgaaa tgcgtagaga tcggaaggaa cgtcgatggc gaaggcagcc 660 acctggcata atactggcgc tgaggcgcga aagcgtgggg agcgaacagg attagatacc 720 ctggtagtca cgcccgtaaa cgatgagaac tagatgttgg agggggaacc cttcagtatc 780 gaagctaacg cgataagttc tccgcctggg aagtacagtc gcaagactga aactcaaaag 840 aattgacggg ggcccgcaca agcggtggag catgtggttt aattcgatgc aacgcgaaga 900 accttacctg cccttgacat cctgcgaatc ttgccgagag gtgagagtgc cgcagggagc 960 gcagagacag gtgctgcatg gctgtcgtca gctcgtgttg tgagatgttg ggttaagtcc 1020 cgtaacgagc gcaacccttg tccttagttg ccatcattta gttggggact ctaaggagac 1080 cgccggtgat gaaccggagg aaggcgggga cgacgtcaag tcatcatggc ctttatgggt 1140 agggctacac acgtgctaca atggggcgta cagagggtcg ccaacccgcg agggggagcc 1200 aatctcttaa agcgtctcgt agtccggart ggagtctgca actcgac 1247 <210> 84 <211> 1292 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol ctgggactga gacacggccc agactcctac gggaggcagc agtggggaat attgcacaat 300 gggcgaaagc ctgatgcagc gacgccgcgt gagggatgac ggccttcggg ttgtaaacct 360 ctttcagtag ggaagaagcg aaagtgacgg tacctgcaga agaagcgccg gctaactacg 420 tgccagcagc cgcggtaata cgtagggcgc aagcgttatc cggaattatt gggcgtaaag 480 agctcgtagg cggtttgtcg cgtctgccgt gaaagtccgg ggctcaactc cggatctgcg 540 gtgggtacgg gcagactaga gtgatgtagg ggagactgga attcctggtg tagcggtgaa 600 atgcgcagat atcaggagga acaccgatgg cgaaggcagg tctctgggca ttaactgacg 660 ctgaggaacg aaagcatggg gagcgaacag gattagatac cctggtagtc catgccgtaa 720 acgttgggca ctaggtgtgg gggacattcc acgttttccg cgccgtagct aacgcattaa 780 gtgccccgcc tggggagtac ggccgcaagg ctaaaactca aaggaattga cgggggcccg 840 cacaagcggc ggagcatgcg gattaattcg atgcaacgcg aagaacctta ccaaggcttg 900 acatgaaccg gaaatacctg gaaacaggtg ccccgcttgc ggtcggttta caggtggtgc 960 atggttgtcg tcagctcgtg tcgtgagatg ttgggttaag tcccgcaacg agcgcaaccc 1020 tcgttctatg ttgccagcgc gttatggcgg ggactcatag gagactgccg gggtcaactc 1080 ggaggaaggt ggggacg acg tcaaatcatc atgcccctta tgtcttgggc ttcacgcatg 1140 ctacaatggc cggtacaaag ggttgcgata ctgtgaggtg gagctgatcc caaaaagccg 1200 gtcccagttc ggattggggt ctgcaactcg accccatgaa gtcggagtcg ctagtaatcg 1260 cagatcagca ac 1272 <210> 83 <211> 1247 <212> DNA <213> Organime Unknown <220> <223> Origin of the sequence : floor Organism <400> 83 tgtttagtag caatactaaa tgatgacgag cggcggacgg gtgaggaaca cgtaggaacc 60 tgcccaagag agggggacaa ccaagggaaa ctttggctaa taccgcataa tctctacgga 120 gaaaagttgc ccgtaagggt ggcgcttttg gaggggcctg cgtccgatta gttagttggt 180 gaggtaatag ctcaccaaga ctgtgatcgg taactggtct gagaggacga ccagtcacac 240 tgggactgag acacggccca gactcctacg ggaggcagca gtggggaatc ttggacaatg 300 ggggcaaccc tgatccagcg atgccgcgtg ggtgaagaag gccttcgggt tgtaaagccc 360 tttaggcggg gaagaaggat atgggatgaa taagcctgta ttttgacggt acccgcagaa 420 taagcaccgg caaactctgt gccagcagcc gcggtaatac agagggtgcg agcgttaatc 480 ggatttactg ggcgtaaagg gcgcgtaggc ggttgtgtga gtgtgatgtg aaagccccgg 540 gctcaacctg ggaagtgcat cgcaaacgac acaactgg ag tatatgagag ggtggcggaa 600 tttccggtgt agcggtgaaa tgcgtagaga tcggaaggaa cgtcgatggc gaaggcagcc 660 acctggcata atactggcgc tgaggcgcga aagcgtgggg agcgaacagg attagatacc 720 ctggtagtca cgcccgtaaa cgatgagaac tagatgttgg agggggaacc cttcagtatc 780 gaagctaacg cgataagttc tccgcctggg aagtacagtc gcaagactga aactcaaaag 840 aattgacggg ggcccgcaca agcggtggag catgtggttt aattcgatgc aacgcgaaga 900 accttacctg cccttgacat cctgcgaatc ttgccgagag gtgagagtgc cgcagggagc 960 gcagagacag gtgctgcatg gctgtcgtca gctcgtgttg tgagatgttg ggttaagtcc 1020 cgtaacgagc gcaacccttg tccttagttg ccatcattta gttggggact ctaaggagac 1080 cgccggtgat gaaccggagg aaggcgggga cgacgtcaag tcatcatggc ctttatgggt 1140 agggctacac acgtgctaca atggggcgta cagagggtcg ccaacccgcg agggggagcc 1200 aatctcttaa agcgtctcgt agtccggart ggagtctgca actcgac 1247 <210> 84 <211> 1292 <212> DNA <213> Organime Unknown <220> <223> Origin of the sequence: Soil organism

<400> 84 ggctcgcaag agcaaccggc gaacgggtgc gtaacacgtg aacaacctgc cctcgtgtgg 60 gggatagccg ggctaacgcc cgggtaatac cgcatacgtt ctctctgggg agtcctgggg 120 agaggaaagc tccggcgcac ggggaggggt tcgcggccta tcagctagtt ggcggggtaa 180 tggcccacca aggcgacgac gggtagctgg tctgagagga tggccagcca cattgggact 240 gagagacggc ccagactcct acgggaggca gcagtgggga atcttgcgca atggccgaaa 300 ggctgacgca gcgacgccgc gtgtgggagg acgcctttcg gggtgtaaac cactgttgcc 360 cgggacgaac agcctctttc gagaggtctg acggtaccgg gtgaggaagc accggctaac 420 tccgtgccag cagccgcggt aatacggagg gtgcgagcgt tgtccggaat cattgggcgt 480 aaagggcgcg taggtggccc ggtcagttcg tggtgaaagc gcggggctca accctgcgtc 540 ggccatgaat actgccgcgg ctggagcact gtagaggcag gcggaattcc gggtgtagcg 600 gtggaatgcg tagagatccg gaagaacacc ggtggcgaag gcggcctgct gggcagtagc 660 tgacactgag gcgcgacagc gtggggagca aacaggatta gataccctgg tagtccacgc 720 cgtaaacgat gggcactagg cgcttggggg agcgaccccc cgagggccgg cgctaacgca 780 ttaagtgccc cgcctgggga gtacggccgc aaggctgaaa ctcaaaggaa ttgacggggg 840 cccgcacaag cggtggagca tgtggtttaa ttcgacgcaa cgcgaagaac cttacctagg 900 cttgacatac acgggaaacc ggtcagaaac ggccggccct cttcggagcc cgtgcacagg 960 tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg 1020 caacccctgt ctctagttgc cagcgcgtca tggcggggac tctagagaga ctgccggtgc 1080 caaaccggag gaaggtgggg atgacgtcaa gtcatcatgg tccttacgtc tagggctaca 1140 cacgtgctac aatggcgggg acagagggtc gcgagccggc aacggcaagc caatcccgta 1200 aaccccgcct cagttcggat tgtcgtctgc aactcgacgg catgaagctg gaatcgctag 1260 taatcgtgga tcagctacgc cacggtgaat ac 1292 <210> 85 <211> 1300 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence : :Organisme du sol <400> 85 tcccttcggg agcaagtaca gcggcgaacg ggtgagtaac acgtaggtaa cctaccctgg 60 agactgggat aacctgccga aaggcgggct aataccagat aagaccacga gggctgcggc 120 ccttggggca aaaggtggcc tctacttgta agctaccact ccgggatggg cctgcgcgcc 180 attagctagt tggcggggta acggcccacc aaggcagaga tggctagctg gtctgagagg 240 atggccagcc acacagggac tgagacacgg cccagactcc tacgggaggc agcagtgggg 300 aatattgcgc aatgggcgaa agcctgacgc agcgacgccg cgtgggtgat gaaggccttc 360 gggtcgtaaa gccctgtcaa gagggacgaa accttgtcga cctaacacgt cggcaacctg 420 acggtacctc tgaaggaagc accggctaac tccgtgccag cagccgcggt aatacggagg 480 gtgcgagcgt tgttcggaat tactgggcgt aaagcgcgtg taggcggcct cttcagtctg 540 gtgtgaaagc ccggggctca accccggaag tgcattggat actgggaggc tggagtaccg 600 gagaggaggg tggaattcct ggtgtagcgg tgaaatgcgt agatatcagg aggaacacct 660 gtggcgaagg cggccctctg gacggatact gacgctgaga cgcgaaagcg tggggagcaa 720 acaggattag ataccctggt agtccacgct gtaaacgatg ggcactaggt gttcggggta 780 ttgaccccct gagtgccgca gctaacgcat taagtgcccc gcctggggaa tacggccgca 840 aggttaaaac tcaaaggaat gacgggggc ccgcacaagc ggtggagcat gtggtttaat 900 tcgacgcaac gcgaagaacc ttacctgggc tagacaacat cggacagcct cagaaatgag 960 gtctccccgc aaggggccgg tggttcaggt gctgcatggc tgtcgtcagc tcgtgtcgtg 1020 agatgttggg ttaagtcccg caacgagcgc aacccctgtc tctagttgct accattcagt 1080 tgagcactct agagagactg cccngtgtta aacgggagga aggtggggac gacgtcaagt 1140 cctcatggcc cttatgtcca gggctacaca cgtgctacaa tgggcgatac aaagggctgc 1200 gaacccgcga ggggaagcca atcccaaaaa gtcgctctca gttcggattg gagtctgcaa 1260 ctcgactcca tgaaggcgga atcgctagta atcgcggatc 1300 <210> 86 <211> 1186 <400> 84 ggctcgcaag agcaaccggc gaacgggtgc gtaacacgtg aacaacctgc cctcgtgtgg 60 gggatagccg ggctaacgcc cgggtaatac cgcatacgtt ctctctgggg agtcctgggg 120 agaggaaagc tccggcgcac ggggaggggt tcgcggccta tcagctagtt ggcggggtaa 180 tggcccacca aggcgacgac gggtagctgg tctgagagga tggccagcca cattgggact 240 gagagacggc ccagactcct acgggaggca gcagtgggga atcttgcgca atggccgaaa 300 ggctgacgca gcgacgccgc gtgtgggagg acgcctttcg gggtgtaaac cactgttgcc 360 cgggacgaac agcctctttc gagaggtctg acggtaccgg gtgaggaagc accggctaac 420 tccgtgccag cagccgcggt aatacggagg gtgcgagcgt tgtccggaat cattgggcgt 480 aaagggcgcg taggtggccc ggtcagttcg tggtgaaagc gcggggctca accctgcgtc 540 ggccatgaat actgccgcgg ctggagcact gtagaggcag gcggaattcc gggtgtagcg 600 gtggaatgcg tagagatccg gaagaacacc ggtggcgaag gcggcctgct gggcagtagc 660 tgacactgag gcgcgacagc gtggggagca aacaggatta gataccctgg tagtccacgc 720 cgtaaacgat gggcactagg cgcttggggg agcgaccccc cgagggccgg cgctaacgca 780 ttaagtgccc cgcctgggga gtacggccgc aaggctgaaa ctcaaaggaa ttgacggggg 840 cccgcacaag c ggtggagca tgtggtttaa ttcgacgcaa cgcgaagaac cttacctagg 900 cttgacatac acgggaaacc ggtcagaaac ggccggccct cttcggagcc cgtgcacagg 960 tgctgcatgg ctgtcgtcag ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg 1020 caacccctgt ctctagttgc cagcgcgtca tggcggggac tctagagaga ctgccggtgc 1080 caaaccggag gaaggtgggg atgacgtcaa gtcatcatgg tccttacgtc tagggctaca 1140 cacgtgctac aatggcgggg acagagggtc gcgagccggc aacggcaagc caatcccgta 1200 aaccccgcct cagttcggat tgtcgtctgc aactcgacgg catgaagctg gaatcgctag 1260 taatcgtgga tcagctacgc cacggtgaat ac 1292 <210> 85 <211> 1300 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence:: Soil organism <400> 85 tcccttcggg agcaagtaca gcggcgaacg ggtgagtaac acgtaggtaa cctaccctgg 60 agactgggat aacctgccga aaggcgggct aataccagat aagaccacga gggctgcggc 120 ccttggggca aaaggtggcc tctacttgta agctaccact ccgggatggg cctgcgcgcc 180 attagctagt tggcggggta acggcccacc aaggcagaga tggctagctg gtctgagagg 240 atggccagcc acacagggac tgagacacgg cccagactcc tacgggaggc agcagtgggg 300 aatattgcg c aatgggcgaa agcctgacgc agcgacgccg cgtgggtgat gaaggccttc 360 gggtcgtaaa gccctgtcaa gagggacgaa accttgtcga cctaacacgt cggcaacctg 420 acggtacctc tgaaggaagc accggctaac tccgtgccag cagccgcggt aatacggagg 480 gtgcgagcgt tgttcggaat tactgggcgt aaagcgcgtg taggcggcct cttcagtctg 540 gtgtgaaagc ccggggctca accccggaag tgcattggat actgggaggc tggagtaccg 600 gagaggaggg tggaattcct ggtgtagcgg tgaaatgcgt agatatcagg aggaacacct 660 gtggcgaagg cggccctctg gacggatact gacgctgaga cgcgaaagcg tggggagcaa 720 acaggattag ataccctggt agtccacgct gtaaacgatg ggcactaggt gttcggggta 780 ttgaccccct gagtgccgca gctaacgcat taagtgcccc gcctggggaa tacggccgca 840 aggttaaaac tcaaaggaat gacgggggc ccgcacaagc ggtggagcat gtggtttaat 900 tcgacgcaac gcgaagaacc ttacctgggc tagacaacat cggacagcct cagaaatgag 960 gtctccccgc aaggggccgg tggttcaggt gctgcatggc tgtcgtcagc tcgtgtcgtg 1020 agatgttggg ttaagtcccg caacgagcgc aacccctgtc tctagttgct accattcagt 1080 tgagcactct agagagactg cccngtgtta aacgggagga aggtggggac gacgtcaagt 1140 cctcatggcc cttatgtcca gggct acaca cgtgctacaa tgggcgatac aaagggctgc 1200 gaacccgcga ggggaagcca atcccaaaaa gtcgctctca gttcggattg gagtctgcaa 1260 ctcgactcca tgaaggcgga atcgctagta atcgcggatc 1300 <210> 86 <211> 1186

<212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 86 caatgggcag cggcggacgg gtgagtaaca cgtgggaatg tacctttcgg tgcggaacaa 60 ctcagggaaa cttgagctaa tgccgcatac gcccttacgg ggaaagattt atcgccgaaa 120 gatcagcccg cgttggatta gctagttggt gaggtaatgg cccaccaagg cgacgatcca 180 tagctggttt gagagaacga ccagcctcac tgggactgag acacggccca gactcctacg 240 ggaggcagca gttgggaatc ttggacaatg ggggaaaccc tgatccagcc atgccgcgtg 300 agtgatgaag gccttcgggt tgtaaaactc tttcgacggg gacgataatg acggtacccg 360 tagaagaagc tccggctaac ttcgtgccag cagccgcggt aatacgaagg gggctagcgt 420 tgttcggaat tactgggcgt aaagcgtgcg caggcggcta tccaagtcag tggtgaaagc 480 ccggagctca actccggaat tgccattgaa actgtttagc ttgagtacga gagaggtgag 540 tggaataccc agtgtagagg tgaaattcgt agatattggg tagaacaccg gtggcgaagg 600 cggctcactg gctcgtaact gacgctcagg cacgacagcg tggggatcaa acaggattag 660 ataccctggt agtccacgcc gtaaacgatg aacgctagcc gttggatagc ttgctattca 720 gtggcgcagc taacgcatta agcgttccgc ctggggagta cggccgcaag gttgagactc 780 agaggaattg acgggggccc gcacaagcgg tggagcatgt ggtttaattc gacgcaacgc 840 gcagaacctt accagggttt gacatcctgt gctcgccggt gaaagccggt tttcccgcaa 900 gggacgcaga gacaggtgct gcatggctgt cgtcagctcg tgtcgtgaga tgttgggtta 960 agtcccgcaa cgagcgcaac cctcgccttt agttgccatc attcagttgg gcactctaga 1020 gggaccgccg gcgacaagcc ggaggaaggt ggggatgacg tcaagtcccc atggccctta 1080 caccctgggc tacacacgtg ctacaatggc ggtgacagtg ggcacgagct cgcgagagtc 1140 agctaatccc aaaaaaccgt cccagttcag attgcactct gcaact 1186 <210> 87 <211> 1454 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 87 cgacggccag tgaattgtaa tacgactcac tatagggcga attgggccct ctagatgcat 60 gctcgagcgg ccgccagtgt gatggatatc tgcagaattc gcccttcagg cctaacacat 120 gcaagtcgag cgagaaaggg cgcttcggcg cctgagtaca gcggcgcacg ggtgcgtaac 180 acgtgggcaa tctgtccttg agatggggat aacccagcga aagttgggct aataccgaat 240 aagactacag gaggcaactc ccgtggttaa agggtgctct ctgcggggag catgcgcttg 300 aggaggagcc cgcggcctat cagctagttg gtagggtcac ggcctaccaa ggcgaagacg 360 ggtagctggt ctgagaggat gaccagccac acggggactg agacacggcc ccgactccta 420 cgggaggcag cagtggggaa tattgggcaa tgggggaaac cctgacccag cgacgccgcg 480 tgggtgatga aggccttcgg gtcgtaaagc cctgtcgggc ggaacgaagg ttctcacggc 540 aaatagccgt gagaggtgac ggtaccgccg aaggaagcac cggccaactc cgtgccagca 600 gccgcggtaa gacggagggt gcaagcgttg ctcggaatca ctgggcgtaa agggtgcgta 660 ggcggtctcg caagtctggc gtgaaagccc aaggctcagc cttggaagtg cgctcgaaac 720 tgcgaggctg gagtgccgga ggggagagtg gaattcccgg tgtagcggtg aaatgcgtag 780 agatcgggag gaataccggt ggcgaaagcg actctctgga cggcaactga cgctgaggca 840 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgga 900 cactaggtgt cgggggtatc cactccctcg gtgccgccgc taacgcagta agtgtcccgc 960 ctgggaagta cggtcgcaag attaaaactc aaaggaattg acgggggccc gcacaagcgg 1020 tggagcatgt ggttcaattc gatgcaacgc gaagaacctt acctgggttt gacatctggc 1080 gaatctctgg gaaaccagag agtgcccgca ggggagcgcc aagacaggtg ctgcatggct 1140 gtcgtcagct cgtgccgtga ggtgttgggt taagtcccgc aacgagcgca acccttaccc 1200 ttagttgccc ccgggtcaag ccgtggcact ccaagggaac tgcccgtgtt aagcgggagg 1260 aaggtgggga cgacgtcaag tcatcatggc ctttatatcc agggctacac acgtgctaca 1320 <212> DNA <213> Organime Unknown <220> <223> Origin of sequence: Floor Organism <400> 86 caatgggcag cggcggacgg gtgagtaaca cgtgggaatg tacctttcgg tgcggaacaa 60 ctcagggaaa cttgagctaa tgccgcatac gcccttacgg ggaaagattt atcgccgaaa 120 gatcagcccg cgttggatta gctagttggt gaggtaatgg cccaccaagg cgacgatcca 180 tagctggttt gagagaacga ccagcctcac tgggactgag acacggccca gactcctacg 240 ggaggcagca gttgggaatc ttggacaatg ggggaaaccc tgatccagcc atgccgcgtg 300 agtgatgaag gccttcgggt tgtaaaactc tttcgacggg gacgataatg acggtacccg 360 tagaagaagc tccggctaac ttcgtgccag cagccgcggt aatacgaagg gggctagcgt 420 tgttcggaat tactgggcgt aaagcgtgcg caggcggcta tccaagtcag tggtgaaagc 480 ccggagctca actccggaat tgccattgaa actgtttagc ttgagtacga gagaggtgag 540 tggaataccc agtgtagagg tgaaattcgt agatattggg tagaacaccg gtggcgaagg 600 cggctcactg gctcgtaact gacgctcagg CACGACAGCG TAGGAGACCA TAGGAGACTA TAGGAGAGAG TAGGAGAGAG TAGGAGATAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG TAGGAGAG 780 agaggaattg acgggggccc gcacaagcgg tggagcatgt ggtttaattc gacgcaacgc 840 gcagaacctt accagggttt gacatcctgt gctcgccggt gaaagccggt tttcccgcaa 900 gggacgcaga gacaggtgct gcatggctgt cgtcagctcg tgtcgtgaga tgttgggtta 960 agtcccgcaa cgagcgcaac cctcgccttt agttgccatc attcagttgg gcactctaga 1020 gggaccgccg gcgacaagcc ggaggaaggt ggggatgacg tcaagtcccc atggccctta 1080 caccctgggc tacacacgtg ctacaatggc ggtgacagtg ggcacgagct cgcgagagtc 1140 agctaatccc aaaaaaccgt cccagttcag attgcactct gcaact 1186 < 210> 87 <211> 1454 <212> DNA <213> Organism Unknown <220> <223> Origin of the sequence: Soil organism <400> 87 cgacggccag tgaattgtaa tacgactcac tatagggcga attgggccct ctagatgcat 60 gctcgagcgg ccgccagtgt gatggatatc tgcagaattc gcccttcagg cctaacacat 120 gcaagtcgag cgagaaaggg cgcttcggcg cctgagtaca gcggcgcacg ggtgcgtaac 180 acgtgggcaa tctgtccttg agatggggat aacccagcga aagttgggct aataccgaat 240 aagactacag gaggcaactc ccgtggttaa agggtgctct ctgcggggag catgcgcttg 300 aggaggagcc cgcggcctat cagctagttg gtagggtcac ggcctac caa ggcgaagacg 360 ggtagctggt ctgagaggat gaccagccac acggggactg agacacggcc ccgactccta 420 cgggaggcag cagtggggaa tattgggcaa tgggggaaac cctgacccag cgacgccgcg 480 tgggtgatga aggccttcgg gtcgtaaagc cctgtcgggc ggaacgaagg ttctcacggc 540 aaatagccgt gagaggtgac ggtaccgccg aaggaagcac cggccaactc cgtgccagca 600 gccgcggtaa gacggagggt gcaagcgttg ctcggaatca ctgggcgtaa agggtgcgta 660 ggcggtctcg caagtctggc gtgaaagccc aaggctcagc cttggaagtg cgctcgaaac 720 tgcgaggctg gagtgccgga ggggagagtg gaattcccgg tgtagcggtg aaatgcgtag 780 agatcgggag gaataccggt ggcgaaagcg actctctgga cggcaactga cgctgaggca 840 cgaaagcgtg gggagcaaac aggattagat accctggtag tccacgccgt aaacgatgga 900 cactaggtgt cgggggtatc cactccctcg gtgccgccgc taacgcagta agtgtcccgc 960 ctgggaagta cggtcgcaag attaaaactc aaaggaattg acgggggccc gcacaagcgg 1020 tggagcatgt ggttcaattc gatgcaacgc gaagaacctt acctgggttt gacatctggc 1080 gaatctctgg gaaaccagag agtgcccgca ggggagcgcc aagacaggtg ctgcatggct 1140 gtcgtcagct cgtgccgtga ggtgttgggt taagtcccgc aacgagcgca acccttaccc 12 00 ttagttgccc ccgggtcaag ccgtggcact ccaagggaac tgcccgtgtt aagcgggagg 1260 aaggtgggga cgacgtcaag tcatcatggc ctttatatcc agggctacac acgtgctaca 1320

atggctggga canagcgtgg ccaacgcgcg agcgggagct aatcgcaaaa ccccagcctc 1380 agttcggatc ggagtctgca actcgactcc gtgaagctgg aatcgctagt aatcgcggat 1440 cagcatgccg cggt 1454 <210> 88 <211> 1307 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 88 cccttcgggg agcgagtaca gcggcgaacg ggtgagtaac acgtaggtaa cctaccctgg 60 tgactgggat aacttgccga aaggcgggct aataccagat aagaccacga gggctgcggc 120 ctttggggta aaagatggcc tctgcttgca tgctatcacg ccgggatggg cctgcgcgcc 180 attagctagt tggtgaggta acggctcacc aaggcagaga tggctagctg gtctgagagg 240 atggccagcc acactgggac tgagacacgg cccagactcc tacgggaggc agcagtgggg 300 aatattgcgc aatgggcgaa agcctgacgc agcgacgccg cgtgggtgat gaaggccttc 360 gggtcgtaaa gccctgtcaa gagggacgaa acctcgccga cccaatacgt cggcgacctg 420 acggtacctc tgaaggaagc accggctaac tccgtgccag cagccgcggt aatacggagg 480 gtgcaagcgt tgttcggaat cactgggcgt aaagcgcgtg taggcggcct tcttagtctg 540 gtgtgaaagc ccggggctca accccggaag agcattggat actggaaggc tggagtaccg 600 gagaggaggg tggaattcct ggtgtagcgg tgaaatgcgt agatatcagg aggaacaccg 660 gtggcgaagg cggccctctg gacggatact gacgctgaga cgcgacagcg tggggagcaa 720 acaggattag ataccctggt agtccacgcc gtaaacgatg ggtactaggt gttcggggta 780 ttgaccccct gagtgccgca gctaacgcat taagtacccc gcctggggac tacggccgca 840 aggctaaaac tcaaaggaat tgacgggggc ccgcacaagc ggtggagcat gtggtttaat 900 tcgacgcaac gcgaagaacc ttacctgggc tagacaacac tggacagccc cagaaatggg 960 gtcttcccgc aagggactgg tggttcaggt gctgcatggc tgtcgtcagc tcgtgtcgtg 1020 agatgttggg ttaagtcccg caacgagcgc aacccctgtc tctagttgct accattaagt 1080 tgagcactct agagagactg cccgtgttaa acgggaggaa ggtggggacg acgtcaagtc 1140 ctcatggccc ttatgtccag ggctacacac gtgctacaat ggacagtaca aagggctgcg 1200 aacccgtgag ggggagccaa tcccaaaaag ctgttctcag ttcggattgg agtctgcaac 1260 tcgactccat gaaggcggaa tcgctagtaa tcgcggaca gcatgcc 1307 <210> 89 <211> 1305 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 89 gggagcaatc cccaagtaga gcggcgaacg ggtgagtaac gcgtgggtaa tctgcctccg 60 agtggggaac aacatcggga aactggtgct aataccgcat aacatcgttg ggtcttcgga 120 tctgacgatc aaagccgggg accgcaaggc ctggcgcttg gagaggagcc cgcgtccgat 180 tagctagttg gtggggtaat ggcccaccaa ggcttcgatc ggtagccggc ctgagagggc 240 ggacggccac actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa 300 tttttcgcaa tgggcgaaag cctgacgaag caacgccgcg tggaggatga gggccttcgg 360 gtcgtaaact cctgtcgacc gggacgaaag taggatggcc taatacgccg atctattgac 420 tgtaccggtg gaggaagcca cggctaactc tgtgccagca gccgcggtaa tacagaggtg 480 gcaagcgttg ttcggaatta ctgggcgtaa agggcgcgta ggcggcttgg tcagtcccgt 540 gtgaaatccc tcggctcaac tgaggaactg cacgggaaac tgcctggctt gagttcggga 600 gagggaagtg gaattccggg tgtagcggtg aaatgcgtag atatccggag gaacaccggt 660 ggcgaaggcg gcttcctgga ccgacactga cgctgaggcg cgaaagctag gggagcaaac 720gggattagat accccggtag tcctagctgt aaacgatgag tgctgggtgt agggggtatc 780 atggctgggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggggg > 88 cccttcgggg agcgagtaca gcggcgaacg ggtgagtaac acgtaggtaa cctaccctgg 60 tgactgggat aacttgccga aaggcgggct aataccagat aagaccacga gggctgcggc 120 ctttggggta aaagatggcc tctgcttgca tgctatcacg ccgggatggg cctgcgcgcc 180 attagctagt tggtgaggta acggctcacc aaggcagaga tggctagctg gtctgagagg 240 atggccagcc acactgggac tgagacacgg cccagactcc tacgggaggc agcagtgggg 300 aatattgcgc aatgggcgaa agcctgacgc agcgacgccg cgtgggtgat gaaggccttc 360 gggtcgtaaa gccctgtcaa gagggacgaa acctcgccga cccaatacgt cggcgacctg 420 acgtacctc tgaaggaagc accggctaac tccgtgccag cagccgcggt aatacggagg 480gtgcaagcgt tgttcggaat cactgggcgt aaagcgcgtg taggcggcct tcttagtctg 540gtgtgaaagc ccggggctca accccggaag agcattgat actggaaggc tggagtaccg 600 gagaggaggg tggaattcct ggtgtagcgg tgaaatgcgt agatatcagg aggaacaccg 660 gtggcgaagg cggccctctg gacggatact gacgctgaga cgcgacagcg tggggagcaa 720 acaggattag ataccctggt agtccacgcc gtaaacgatg ggtactaggt gttcggggta 780 ttgaccccct gagtgccgca gctaacgcat taagtacccc gcctggggac tacggccgca 840 aggctaaaac tcaaaggaat tgacgggggc ccgcacaagc ggtggagcat gtggtttaat 900 tcgacgcaac gcgaagaacc ttacctgggc tagacaacac tggacagccc cagaaatggg 960 gtcttcccgc aagggactgg tggttcaggt gctgcatggc tgtcgtcagc tcgtgtcgtg 1020 agatgttggg ttaagtcccg caacgagcgc aacccctgtc tctagttgct accattaagt 1080 tgagcactct agagagactg cccgtgttaa acgggaggaa ggtggggacg acgtcaagtc 1140 ctcatggccc ttatgtccag ggctacacac gtgctacaat ggacagtaca aagggctgcg 1200 aacccgtgag ggggagccaa tcccaaaaag ctgttctcag ttcggattgg agtctgcaac 1260 tcgactccat gaaggcggaa tcgctagtaa tcgcggaca gcatgcc 1307 <210> 89 <211> 1305 <212> DNA <213> Organime Unknown <220 > <223> Origin of the sequence: Soil organism <400> 89 gggagcaatc cccaagtaga gcggcgaacg ggtgagtaac gcgtgggtaa tctgcctccg 60 agtggg GAAC aacatcggga aactggtgct aataccgcat aacatcgttg ggtcttcgga 120 tctgacgatc aaagccgggg accgcaaggc ctggcgcttg gagaggagcc cgcgtccgat 180 tagctagttg gtggggtaat ggcccaccaa ggcttcgatc ggtagccggc ctgagagggc 240 ggacggccac actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa 300 tttttcgcaa tgggcgaaag cctgacgaag caacgccgcg tggaggatga gggccttcgg 360 gtcgtaaact cctgtcgacc gggacgaaag taggatggcc taatacgccg atctattgac 420 tgtaccggtg gaggaagcca cggctaactc tgtgccagca gccgcggtaa tacagaggtg 480 gcaagcgttg ttcggaatta ctgggcgtaa agggcgcgta ggcggcttgg tcagtcccgt 540 gtgaaatccc tcggctcaac tgaggaactg cacgggaaac tgcctggctt gagttcggga 600 gagggaagtg gaattccggg tgtagcggtg aaatgcgtag atatccggag gaacaccggt 660 ggcgaaggcg gcttcctgga ccgacactga cgctgaggcg cgaaagctag gggagcaaac 720gggattagat accccggtag tcctagctgt aaacgatgag tgctgggtgt agggggtatc 780

aaccccccct gtgccgaagc taacgcatta agcactccgc ctggggagta cggtcgcaag 840 gctgaaactc aaaggaattg acgggggccc gcacaagcgg tggagcatgt ggttcaattc 900 gacgcaacgc gaagaacctt accggggttt gaactgtacg ggacagctct agagatagag 960 tcttccttcg ggacccgtac agaggtgctg catggctgtc gtcagctcgt gtcgtgagat 1020 gttgggttaa gtcccgcaac gagcgcaacc cttgcctcct gttgccatca ggtaaagctg 1080 ggcactctgg agagactgcc ggtgataaac cggaggaagg tggggatgac gtcaagtcct 1140 catggccttt atgccccggg ctacacacgt gctacaatgg ccggtacaaa gggtcgcaaa 1200 accgcgaggt ggagctaatc ccaaaaagcc ggtcccagtt cggattgcag tctgcaactc 1260 gactgcatga agttggaatc gctagtaatc gcggatcagc atgcc 1305 <210> 90 <211> 1299 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 90 gggctttcgg gtcctgagta aagtggcgaa cgggtgagta acgcgtaggt aacctgacct 60 cgagtgtgga ataacctggc gaaagccggg ctaataccgc atgacgtctt cgggtcttcg 120 gacttgagga ccaaaggtgg cgagctttga gcgctgtcgc tcgagaaggg gcctgcgtcc 180 cattagctag ttggtgggg gatggcctac caaggcgacg atgggtagcc gggctgagag 240 gctgtccggc cacactggaa ccgagacacg gtccagactc ctacgggagg cagcagtggg 300 gaatcttgcg caatggggga aaccctgacg caacgacgcc gcgtgggcga tgaaggcctt 360 cgggtcgtaa agccctgtcg agcgggacga accgtgcgag ctctaacata gctcgtgcct 420 gacggtaccg ctagaggaag ccccggctaa ctccgtgcca gcagccgcgg taatacggag 480 ggggctagcg ttattcggaa ttattgggcg taaagggcgt gtaggcggct ctgtgtgtcc 540 catgtgaaag ccctcggctc aaccggggaa ctgcatggga aactgcggag cttgagtccg 600 ggagaggtga gtggaattcc cagtgtagcg gtgaaatgcg tagatattgg gaggaacacc 660 agtggcgaag gcggctcac ggaccggtac tgacgctgag acgcgaaagc caggggagca 720 aacgggatta gataccccgg tagtcctggc tgtaaacgat gagcacttgg tgtggcgggt 780 atcgacccct gccgtgctga agctaacgca ttaagtgctc cgcctgggga gtacggccgc 840 aaggctgaaa ctcaaaggaa ttgacggggg cccgcacaag cggtggagca tgtggttcaa 900 ttcgacgcaa cgcgaagaac cttacctggg tttgaactgc aggtgacagc ccctgaaagg 960 gggtcttcct tcgggacacc tgtagaggtg ccgcatggct gtcgtcagct cgtgtcgtga 1020 gatgttgggt taagtcccgc aacgagcgca acccctactc ctagttgcca gcggctcggc 1080 cgggaactct agggggaccg ccggtgataa accggaggaa ggtggggatg acgtcaagtc 1140 ctcatggcct ttatgtccag ggctacacac gtgctacaac ggacggtaca aagggctgcg 1200 aaggcgcgag ccggagccaa tcccaaaaag ccgttctcca gtgcggattg cagtctgcaa 1260 ctcgactgca tgaaggtgga atcgctagta atcgcggat 1299 <210> 91 <211> 1296 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 91 atgtctggta gcaataccag atgatggcaa gtggcggacg ggtgagtaat acgtagggat 60 ctgcccagaa gagggggaca acccggggaa actcgggcta ataccgcata ctattctgag 120 gaagaaagct tggcgcaagc caggcgcttt tggaggaacc tacgtccgat tagctagttg 180 gtgaggtaaa ggctcaccaa ggcagagatc ggtagctggt ctgagaggat gaccagccac 240 actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa tattggacaa 300 tgggggcaac cctgatccag cgatgccgcg tgtgtgaaga aggccttcgg gttgtaaagc 360 actttagttg gggaagaagt aatgtttttt aaLagagagc attgttgacg gtacccaaag 420 aaccccccct gtgccgaagc taacgcatta agcactccgc ctggggagta cggtcgcaag 840 gctgaaactc aaaggaattg acgggggccc gcacaagcgg tggagcatgt ggttcaattc 900 gacgcaacgc gaagaacctt accggggttt gaactgtacg ggacagctct agagatagag 960 tcttccttcg ggacccgtac agaggtgctg catggctgtc gtcagctcgt gtcgtgagat 1020 gttgggttaa gtcccgcaac gagcgcaacc cttgcctcct gttgccatca ggtaaagctg 1080 ggcactctgg agagactgcc ggtgataaac cggaggaagg tggggatgac gtcaagtcct 1140 catggccttt atgccccggg ctacacacgt gctacaatgg ccggtacaaa gggtcgcaaa 1200 accgcgaggt ggagctaatc ccaaaaagcc ggtcccagtt cggattgcag tctgcaactc 1260 gactgcatga agttggaatc gctagtaatc gcggatcagc atgcc 1305 <210> 90 <211> 1299 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 90 gggctttcgg gtcctgagta aagtggcgaa cgggtgagta acgcgtaggt aacctgacct 60 cgagtgtgga ataacctggc gaaagccggg ctaataccgc atgacgtctt cgggtcttcg 120 gacttgagga ccaaaggtgg cgagctttga gcgctgtcgc tcgagaaggg gcctgcgtcc 180 cattagctag ttggtgggg gatggcctac caaggcgacg atgggtagcc g ggctgagag 240 gctgtccggc cacactggaa ccgagacacg gtccagactc ctacgggagg cagcagtggg 300 gaatcttgcg caatggggga aaccctgacg caacgacgcc gcgtgggcga tgaaggcctt 360 cgggtcgtaa agccctgtcg agcgggacga accgtgcgag ctctaacata gctcgtgcct 420 gacggtaccg ctagaggaag ccccggctaa ctccgtgcca gcagccgcgg taatacggag 480 ggggctagcg ttattcggaa ttattgggcg taaagggcgt gtaggcggct ctgtgtgtcc 540 catgtgaaag ccctcggctc aaccggggaa ctgcatggga aactgcggag cttgagtccg 600 ggagaggtga gtggaattcc cagtgtagcg gtgaaatgcg tagatattgg gaggaacacc 660 agtggcgaag gcggctcac ggaccggtac tgacgctgag acgcgaaagc caggggagca 720 aacgggatta gataccccgg tagtcctggc tgtaaacgat gagcacttgg tgtggcgggt 780 atcgacccct gccgtgctga agctaacgca ttaagtgctc cgcctgggga gtacggccgc 840 aaggctgaaa ctcaaaggaa ttgacggggg cccgcacaag cggtggagca tgtggttcaa 900 ttcgacgcaa cgcgaagaac cttacctggg tttgaactgc aggtgacagc ccctgaaagg 960 gggtcttcct tcgggacacc tgtagaggtg ccgcatggct gtcgtcagct cgtgtcgtga 1020 gatgttgggt taagtcccgc aacgagcgca acccctactc ctagttgcca gcggctcggc 1080 cggga actct agggggaccg ccggtgataa accggaggaa ggtggggatg acgtcaagtc 1140 ctcatggcct ttatgtccag ggctacacac gtgctacaac ggacggtaca aagggctgcg 1200 aaggcgcgag ccggagccaa tcccaaaaag ccgttctcca gtgcggattg cagtctgcaa 1260 ctcgactgca tgaaggtgga atcgctagta atcgcggat 1299 <210> 91 <211> 1296 <212> DNA <213> Organime Unknown <220> <223> Origin of the sequence: floor Organism <400> 91 atgtctggta gcaataccag atgatggcaa gtggcggacg ggtgagtaat acgtagggat 60 ctgcccagaa gagggggaca acccggggaa actcgggcta ataccgcata ctattctgag 120 gaagaaagct tggcgcaagc caggcgcttt tggaggaacc tacgtccgat tagctagttg 180 gtgaggtaaa ggctcaccaa ggcagagatc ggtagctggt ctgagaggat gaccagccac 240 actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa tattggacaa 300 tgggggcaac cctgatccag cgatgccgcg tgtgtgaaga aggccttcgg gttgtaaagc 360 actttagttg gggaagaagt aatgtttttt aaLagagagc attgttgacg gtacccaaag 420

aataagcacc ggctaactct gtgccagcag ccgcggtaat acagagggtg caagcgttaa 480 tcggagttac tgggcgtaaa gggcgcgtag gcggtgttgc aagtgagatg tgaaatccct 540 gggcttaacc taggaaccgc attttagact gcaatgctag agtacagtag agggtagtgg 600 aatttccggt gtagcggtga aatgcgtaga gatcggaagg aacaccagtg gcgaaggcga 660 ctacctggac tgacactgac gctgaggcgc gagagcgtgg ggagcaaaca ggattagata 720 ccctggtagt ccacgctgta aacgatgaga actagatgtt ggtgcgcgcg agcgcacaag 780 tatcgaagct aacgcgataa gttctccgcc tggggagtac ggccgcaagg ttaaaactca 840 aaggaattga cgggggcccg cacaagcggt ggagcatgtg gtttaattcg atgcaacgcg 900 aggaacctta cctacccttg acatccacag aatttgatag agatatcgaa gtgccgaaag 960 gaactgtgag acaggtgctg catggctgtc gtcagctcgt gttgtgagat gttgggttaa 1020 gtcccgtaac gagcgcaacc cttatcctta gttgccaaca cgtaatggtg gggactctaa 1080 ggagactgcc ggtgaagaac cggaggaagg tggggacgac gtcaagtcat catggccttt 1140 atgggtaggg ctacacacgt gctacaatgg ggcgtacaga gggttgccaa cctgcgaagg 1200 ggagccaatc ccggaaagcg cctcgtagtc cagattgaag tctgcaactc gacttcatga 1260 agtcggaatc gctagtaatc gcgaatcaga acgtcc 1296 <210> 92 <211> 1250 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 92 gtctggtagc aataccagat gatggcaagt ggcggacggg tgagtaatac gtagggatct 60 gcccagaaga gggggacaac ccggggaaac tcgggctaat accgcatact attctgagga 120 aaaaagcttg gcgcaagcca ggcgcttttg gaggaaccta cgtccgatta gctagttggt 180 gaggtaaagg ctcaccaagg cagagatcgg tagctggtct gagaggatga ccagccacac 240 tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata ttggacaatg 300 ggggcaaccc tgatccagcg atgccgcgtg tgtgaagaag gccttcgggt tgtaaagcac 360 tttagttggg gaagaagtaa tgttttttaa tagagagcat tgttgacggt acccaaagaa 420 taagcaccgg ctaactctgt gccagcagcc gcggtaatac agagggtgca agcgttaatc 480 ggagttactg ggcgtaaagg gcgcgtaggc ggtgttgcaa gtgagatgtg aaatccctgg 540 gcttaaccta ggaaccgcat tttagactgc aatgctagag tacagtagag ggtagtggaa 600 tttccggtgt agcggtgaaa tgcgtagaga tcggaaggaa caccagtggc gaaggcgact 660 acctggactg acactgacgc tgaggcgcga gagcgtgggg agcaaacagg attagatacc 720 ctggtagtcc acgctgtaaa cgatgagaac tagatgttgg tgcgcgcgag cgcacaagta 780 tcgaagctaa cgcgataagt tctccgcctg gggagtacgg ccgcaaggtt aaaactcaaa 840 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaacgcgaa 900 gaaccttacc tacccttgac atccacagaa tttgatagag atatcgaagt gccgaaagga 960 actgtgagac aggtgctgca tggctgtcgt cagctcgtgt tgtgagatgt tgggttaagt 1020 cccgtaacgg gcgcaaccct tatccttagt tgccaacacg taatggtggg gactctaagg 1080 agactgccgg tgaagaaccg gaggaaggtg gggacgacgt caagtcatca tggcctttat 1140 gggtagggct acacacgtgc tacaatgggg cgtacagagg gttgccaacc tgcgaagggg 1200 agccaatccc ggaaagcgcc tcgtagtcca gattgaagtc tgcaactcga 1250 <210> 93 <211> 1545 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 93 ccaggaaaca gctatgacca tgattacgcc aagcttggta ccgagctcgg atccactagt 60 aacggccgcc agtgtgctgg aattcgccct tcaggcctaa cacatgcaag tcgaacggca 120 aataagcacc ggctaactct gtgccagcag ccgcggtaat acagagggtg caagcgttaa 480 tcggagttac tgggcgtaaa gggcgcgtag gcggtgttgc aagtgagatg tgaaatccct 540 gggcttaacc taggaaccgc attttagact gcaatgctag agtacagtag agggtagtgg 600 aatttccggt gtagcggtga aatgcgtaga gatcggaagg aacaccagtg gcgaaggcga 660 ctacctggac tgacactgac gctgaggcgc gagagcgtgg ggagcaaaca ggattagata 720 ccctggtagt ccacgctgta aacgatgaga actagatgtt ggtgcgcgcg agcgcacaag 780 tatcgaagct aacgcgataa gttctccgcc tggggagtac ggccgcaagg ttaaaactca 840 aaggaattga cgggggcccg cacaagcggt ggagcatgtg gtttaattcg atgcaacgcg 900 aggaacctta cctacccttg acatccacag aatttgatag agatatcgaa gtgccgaaag 960 gaactgtgag acaggtgctg catggctgtc gtcagctcgt gttgtgagat gttgggttaa 1020 gtcccgtaac gagcgcaacc cttatcctta gttgccaaca cgtaatggtg gggactctaa 1080 ggagactgcc ggtgaagaac cggaggaagg tggggacgac gtcaagtcat catggccttt 1140 atgggtaggg ctacacacgt gctacaatgg ggcgtacaga gggttgccaa cctgcgaagg 1200 ggagccaatc ccggaaagcg cctcgtagtc cagattgaag tctgcaactc gacttcatga 1260 agtcggaatc GCTA gtaatc gcgaatcaga acgtcc 1296 <210> 92 <211> 1250 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 92 gtctggtagc aataccagat gatgcaagt ggcggacggg tgagtaatac gtagggatct 60 gcccagaaga gggggacaac ccggggaaac tcgggctaat accgcatact attctgagga 120 aaaaagcttg gcgcaagcca ggcgcttttg gaggaaccta cgtccgatta gctagttggt 180 gaggtaaagg ctcaccaagg cagagatcgg tagctggtct gagaggatga ccagccacac 240 tgggactgag acacggccca gactcctacg ggaggcagca gtggggaata ttggacaatg 300 ggggcaaccc tgatccagcg atgccgcgtg tgtgaagaag gccttcgggt tgtaaagcac 360 tttagttggg gaagaagtaa tgttttttaa tagagagcat tgttgacggt acccaaagaa 420 taagcaccgg ctaactctgt gccagcagcc gcggtaatac agagggtgca agcgttaatc 480 ggagttactg ggcgtaaagg gcgcgtaggc ggtgttgcaa gtgagatgtg aatccctgg 540 gcttaaccta ggaaccgcat tttagactgc aatgctagag tacagtagag ggtagtggaa 600 tttccggtgt agcggtgaaa tgcgtagaga tcggaaggaa caccagtggc gaaggcgact 660 acctggactg acactgacgc tgaggcgcga gagcgtgggg agcaaacagg attagatacc 720 ctggtagtcc acgc tgtaaa cgatgagaac tagatgttgg tgcgcgcgag cgcacaagta 780 tcgaagctaa cgcgataagt tctccgcctg gggagtacgg ccgcaaggtt aaaactcaaa 840 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaacgcgaa 900 gaaccttacc tacccttgac atccacagaa tttgatagag atatcgaagt gccgaaagga 960 actgtgagac aggtgctgca tggctgtcgt cagctcgtgt tgtgagatgt tgggttaagt 1020 cccgtaacgg gcgcaaccct tatccttagt tgccaacacg taatggtggg gactctaagg 1080 agactgccgg tgaagaaccg gaggaaggtg gggacgacgt caagtcatca tggcctttat 1140 gggtagggct acacacgtgc tacaatgggg cgtacagagg gttgccaacc tgcgaagggg 1200 agccaatccc ggaaagcgcc tcgtagtcca gattgaagtc tgcaactcga 1250 <210> 93 <211> 1545 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 93 ccaggaaaca gctatgacca tgattacgcc aagcttggta ccgagctcgg atccactagt 60 aacggccgcc agtgtgctgg aattcgccct tcaggcctaa cacatgcaag tcgaacggca 120

gcacagggga gcttgctccc tgggtggcga gtggcggacg ggtgaggaat acatcggaat 180 ctgcccagtc gtgggggata acctcgggaa accgggacta ataccgcata cgaccttagg 240 gtgaaagcgg aggaccgcaa ggcttcgcgc gattggatga gccgatgtcg gattagcttg 300 ttggcggggt aacggcccac caaggcgacg atccgtagct ggtctgagag gatgatcagc 360 cacactggaa ctgagacacg gtccagactc ctacgggagg cagcagtggg gaatattgga 420 caatgggcgc aagcctgatc cagccatgcc gcgtgagtga agaaggcctt cgggttgtaa 480 agctcttttg tccggaaaga aaagctttcg gttaataccc ggaagtcctg acggtaccgg 540 aagaataagc accggctaac ttcgtgccag cagccgcggt aatacgaagg gtgcaagcgt 600 tactcggaat tactgggcgt aaagcgtgcg taggtggttt gttaagtctg atgtgaaagc 660 cctgggctca acctgggaat tgcactggat actggcaggc tagagtgcgg tagaggatgg 720 cggaattccc ggtgtagcag tgaaatgcgt agagatcggg aggaacatct gtggcgaagg 780 cggccatctg gaccagcact gacactgagg cacgaaagcg tggggagcaa acaggattag 840 ataccctggt agtccacgcc ctaaacgatg cgaactggat gttgggagca actaggctct 900 cagtatcgaa gctaacgcgt taagttcgcc gcctggggag tacggtcgca agactgaaac 960 tcaaaggaat tgacgggggc ccgcacaagc ggtggagtat gtggtttaat tcgatgcaac 1020 gcgaagaacc ttacctggcc ttgacatcca cggaacttac cagagatggt ttggtgcctt 1080 cggnaaccgt gagacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgttgggt 1140 taagtcccgc aacgagcgca acccttgtcc ttagttgcca gcacgtaatg gtgggaactc 1200 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc 1260 cttacggcca gggctacaca cgtactacaa tggtcggtac agagggttgc aaagccgcga 1320 ggtagagcca atcccagaaa accgatccca gtccggatcg aagtctgcaa ctcgacttcg 1380 tgaagtcgga atcgctagta atcgcggatc agaatgccgc ggtgaatacg ttcccgggcc 1440 ttgtacacac cgcccaaggg cgaattctgc agatatccat cacactggcg gccgctcgag 1500 catgcatcta gagggcccaa ttcgccctat agtgagtcgt attac 1545 <210> 94 <211> 1549 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 94 ttttaaaccg acggccagtg aattgtaata cgactcacta tagggcgaat tgggccctct 60 agatgcatgc tcgagcggcc gccagtgtga tggatatctg cagaattcgc ccttcaggcc 120 taacacatgc aagtcgagcg gcagcgcggg gcaacctggc ggcgagcggc ggacgggtga 180 ggaatgcatc ggaatctacc ctgtcgtggg ggataacgta gggaaactta cgctaatacc 240 gcatacgacc gagaggtgaa agtgggggac cgcaaggcct cacgcgatag gatgagccga 300 tgccggatta gctagttggt gaggtaaagg ctcaccaagg cgacgatccg tagctggtct 360 gagaggatga tcagccacat tgggactgag acacggccca aactcctacg ggaggcagca 420 gtggggaata ttggacaatg ggcgcaagcc tgatccagcc atgccgcgtg tgtgaagaag 480 gccttcgggt tgtaaagcac ttttgttcgg gaagaaatcg tgcgggttaa tacccagtac 540 ggatgacggt accgaaagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac 600 gaagggtgca agcgttactc ggaatcactg ggcgtaaagc gtgcgtaggc ggttggttaa 660 gtctgctgtg aaagccctgg gctcaacctg ggaactgcag tggatactgg ccagctagag 720 tgtgatagag gatggtggaa ttcccggtgt agcggtgaaa tgcgtagaga tcgggaggaa 780 caccagtggc gaaggcggcc atctggatca acactgacgc tgaggcacga aagcgtgggg 840 agcaaacagg attagatacc ctggtagtcc acgccctaaa cgatgcgaac tggacgttgg 900 gagcaacttg gctctcagtg tcgaagctaa cgcgctaagt tcgccgcctg gggagtacgg 960 tcgcaagact gaaactcaaa ggaattgacg ggggcccgca caagcggtgg agtatgtggt 1020 ttaattcgat gcaacgcgaa gaaccttacc tggccttgac atccacggaa cttaccagag 1080 atggtttggt gccttcggaa ccgtgagaca ggtgctgcat ggctgtcgtc agctcgtgtc 1140 gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctt gtccttagtt gccagcacgt 1200 aatggtggga actctaagga gactgccggt gacaaaccgg aggaaggtgg ggatgacgtc 1260 aagtcatcat ggcccttacg gccagggcta cacacgtact acaatggtcg gtacaagagg 1320 gttgcaaagc ccgcgaggta gagccaatcc cagaaaaccc gatcccagtc ccggatcgaa 1380 gtctgcaact cgacttcgtg aagtcggaat cgctagtaat cgcggatcag aatgccgcgg 1440 tgaatacgtt cccgggcctt gtacacaccg cccaagggcg aattccagca cactggcggc 1500 gcacagggga gcttgctccc tgggtggcga gtggcggacg ggtgaggaat acatcggaat 180 ctgcccagtc gtgggggata acctcgggaa accgggacta ataccgcata cgaccttagg 240 gtgaaagcgg aggaccgcaa ggcttcgcgc gattggatga gccgatgtcg gattagcttg 300 ttggcggggt aacggcccac caaggcgacg atccgtagct ggtctgagag gatgatcagc 360 cacactggaa ctgagacacg gtccagactc ctacgggagg cagcagtggg gaatattgga 420 caatgggcgc aagcctgatc cagccatgcc gcgtgagtga agaaggcctt cgggttgtaa 480 agctcttttg tccggaaaga aaagctttcg gttaataccc ggaagtcctg acggtaccgg 540 aagaataagc accggctaac ttcgtgccag cagccgcggt aatacgaagg gtgcaagcgt 600 tactcggaat tactgggcgt aaagcgtgcg taggtggttt gttaagtctg atgtgaaagc 660 cctgggctca acctgggaat tgcactggat actggcaggc tagagtgcgg tagaggatgg 720 cggaattccc ggtgtagcag tgaaatgcgt agagatcggg aggaacatct gtggcgaagg 780 cggccatctg gaccagcact gacactgagg cacgaaagcg tggggagcaa acaggattag 840 ataccctggt agtccacgcc ctaaacgatg cgaactggat gttgggagca actaggctct 900 cagtatcgaa gctaacgcgt taagttcgcc gcctggggag tacggtcgca agactgaaac 960 tcaaaggaat tgacggggg c ccgcacaagc ggtggagtat gtggtttaat tcgatgcaac 1020 gcgaagaacc ttacctggcc ttgacatcca cggaacttac cagagatggt ttggtgcctt 1080 cggnaaccgt gagacaggtg ctgcatggct gtcgtcagct cgtgtcgtga gatgttgggt 1140 taagtcccgc aacgagcgca acccttgtcc ttagttgcca gcacgtaatg gtgggaactc 1200 taaggagact gccggtgaca aaccggagga aggtggggat gacgtcaagt catcatggcc 1260 cttacggcca gggctacaca cgtactacaa tggtcggtac agagggttgc aaagccgcga 1320 ggtagagcca atcccagaaa accgatccca gtccggatcg aagtctgcaa ctcgacttcg 1380 tgaagtcgga atcgctagta atcgcggatc agaatgccgc ggtgaatacg ttcccgggcc 1440 ttgtacacac cgcccaaggg cgaattctgc agatatccat cacactggcg gccgctcgag 1500 catgcatcta gagggcccaa ttcgccctat agtgagtcgt attac 1545 <210> 94 <211> 1549 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism < 400> 94 ttttaaaccg acggccagtg aattgtaata cgactcacta tagggcgaat tgggccctct 60 agatgcatgc tcgagcggcc gccagtgtga tggatatctg cagaattcgc ccttcaggcc 120 taacacatgc aagtcgagcg gcagcgcggg gcaacctggc ggcgagcggc ggacgggtga 180 g gaatgcatc ggaatctacc ctgtcgtggg ggataacgta gggaaactta cgctaatacc 240 gcatacgacc gagaggtgaa agtgggggac cgcaaggcct cacgcgatag gatgagccga 300 tgccggatta gctagttggt gaggtaaagg ctcaccaagg cgacgatccg tagctggtct 360 gagaggatga tcagccacat tgggactgag acacggccca aactcctacg ggaggcagca 420 gtggggaata ttggacaatg ggcgcaagcc tgatccagcc atgccgcgtg tgtgaagaag 480 gccttcgggt tgtaaagcac ttttgttcgg gaagaaatcg tgcgggttaa tacccagtac 540 ggatgacggt accgaaagaa taagcaccgg ctaacttcgt gccagcagcc gcggtaatac 600 gaagggtgca agcgttactc ggaatcactg ggcgtaaagc gtgcgtaggc ggttggttaa 660 gtctgctgtg aaagccctgg gctcaacctg ggaactgcag tggatactgg ccagctagag 720 tgtgatagag gatggtggaa ttcccggtgt agcggtgaaa tgcgtagaga tcgggaggaa 780 caccagtggc gaaggcggcc atctggatca acactgacgc tgaggcacga aagcgtgggg 840 agcaaacagg attagatacc ctggtagtcc acgccctaaa cgatgcgaac tggacgttgg 900 gagcaacttg gctctcagtg tcgaagctaa cgcgctaagt tcgccgcctg gggagtacgg 960 tcgcaagact gaaactcaaa ggaattgacg ggggcccgca caagcggtgg agtatgtggt 1020 ttaattcgat gcaacgcga has gaaccttacc tggccttgac atccacggaa cttaccagag 1080 atggtttggt gccttcggaa ccgtgagaca ggtgctgcat ggctgtcgtc agctcgtgtc 1140 gtgagatgtt gggttaagtc ccgcaacgag cgcaaccctt gtccttagtt gccagcacgt 1200 aatggtggga actctaagga gactgccggt gacaaaccgg aggaaggtgg ggatgacgtc 1260 aagtcatcat ggcccttacg gccagggcta cacacgtact acaatggtcg gtacaagagg 1320 gttgcaaagc ccgcgaggta gagccaatcc cagaaaaccc gatcccagtc ccggatcgaa 1380 gtctgcaact cgacttcgtg aagtcggaat cgctagtaat cgcggatcag aatgccgcgg 1440 tgaatacgtt cccgggcctt gtacacaccg cccaagggcg aattccagca cactggcggc 1500

cgttactagt ggatccgagc tcggtaccaa gcttggcgta atcatggtc 1549 <210> 95 <211> 1276 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 95 ctggcggcga gcggcggacg ggtgaggaat acatcggaat ctacccagtc gtgggggata 60 acgtagggaa acttacgcta ataccgcata cgacctgagg gtgaaagcag gggatcgcaa 120 gaccttgcgc gattggatga gccgatgcc gattagctag ttggtgaggt aaaggctcac 180 caaggcgacg atcggtagct ggtctgagag ggtgatcagc cacactggaa ctgagacacg 240 gtccagactc ctacgggagg cagcagggg gaatattgga caatgggcgc aagcctgatc 300 cagccatgcc gcgtgtgtga agaaggcctt cgggttgtaa agcacttttg ttcgggaaga 360 aatcttccga gttaatacct cgggaggatg acggtaccgg aagaataagc accggctaac 420 ttcgtgccag cagccgcggt aatacgaagg gtgcaagcgt tactcggaat tactgggcgt 480 aaagcgtgcg taggtggttc gttaagctg ccgtgaaagc cccgggctca acctgggaat 540 tgcggtggat actggcggac tagagtgcgg tagagggtgg tggaattccc ggtgtagcag 600 tgaaatgcgt agagatcggg aggaacact gtggcgaagc ggccacctgg accagcactg 660 acactgaggc acgaaagcgt ggggagcaaa caggattaga taccctggta gtccacgccc 720 taaacgatgc gaactggacg ttgggagcaa ctaggctctc agtgtcgaag ctaacgcgtt 780 aagttcgccg cctggggagt acggtcgcaa gactgaaact caaaggaatt gacgggggcc 840 cgcacaagcg gtggagtgtg tggtttaatt cgatgcaacg cgaagaacct tacctggcct 900 tgacatccac ggaatccttt agagatagag gagtgccttc gggaaccgtg agacaggtgc 960 tgcatggctg tcgtcagctc gtgtcggag atgttgggtt aagtcccgca acgagcgcaa 1020 cccttgtcct tagttgccag cgcgtaatgg cgggaactct aaggagactg ccggtgacaa 1080 accggaggaa ggtggggatg acgtcaagtc atcatggccc ttacggccag ggctacacac 1140 gtactacaat ggtggggaca gagggtcgcg aagccgcgag gtggagccaa tcccagaaac 1200 cccatcctag tccggatcgg agtctgcaac tcgactccgt gaagtcggaa tcgctagtaa 1260 tcgcggtcag catgcc 1276 <210> 96 <211> 1306 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 96 cagggatcag tagagtggca aacggggag taacgcgtgg gcgacctacc ttcgagtggg 60 ggataacctt ccgaaaggag ggctaaacc gcatgacatc ccgtgtttgg atacacggac 120 atcaaagccg gggatcgcaa gacctggcgc ttggagaggg gcccgcgtcc gattagctag 180 ttggtgaggt cacggctcac caaggcccg atcggtatcc ggcctgagag ggcggacgga 240 cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg gaattgttcg 300 caatgggcgc aagcctgacg acgcaacgcc gcgtggagga tgaagacctt cgggtcgtaa 360 actcctttcg accgagatga agacccçccg gcctaatacg ccggcggatt gacagtatcg 420 agggaagaag ccccggctaa ctccgtgcca gcagccgcgg taatacgggg ggggcaagcg 480 ttgttcggaa ttactgggcg taaagggttc gtaggtggct cgctaagtca gacgtgaaat 540 ccctcagctc aactggggaa ctgcgtctga gactggcaag cttgagtgca ggagaggaac 600 gcggaattcc aggtgtagcg gtgaaagcg tagatatctg gaggaacacc ggtggcgaag 660 gcggcgttct ggactgcaac tgacactgag gaacgaaagc taggggagca aacgggatta 720 gataccccgg tagtcctagc cctaaacgat gaatgcttgg tgtggcgggt atcgatccct 780 gccgtgccgc agttaacgcg ataagcatc cgcctgggga gtacggtcgc aaggctgaaa 840 ctcaaaggaa ttgacggggg cccgcacaag cggtggagca tgtggttcaa ttcgacgcaa 900 1549 <210> 95 <211> 1276 <212> DNA <213> Unknown organism <220> <223> Origin of the sequence: Soil organism <400> 95 ctggcggcga ataccgcata cgacctgagg gtgaaagcag gggatcgcaa 120 gaccttgcgc gattggatga gccgatgcc gattagctag ttggtgaggt aaaggctcac 180 caaggcgacg atcggtagct ggtctgagag ggtgatcagc cacactggaa ctgagacacg 240 gtccagactc ctacgggagg cagcagggg gaatattgga caatgggcgc aagcctgatc 300 cagccatgcc gcgtgtgtga agaaggcctt cgggttgtaa agcacttttg ttcgggaaga 360 aatcttccga gttaatacct cgggaggatg acggtaccgg aagaataagc accggctaac 420 ttcgtgccag cagccgcggt aatacgaagg gtgcaagcgt tactcggaat tactgggcgt 480 aaagcgtgcg taggtggttc gttaagctg ccgtgaaagc cccgggctca acctgggaat 540 tgcggtggat actggcggac tagagtgcgg tagagggtgg tggaattccc ggtgtagcag 600 tgaaatgcgt agagatcggg aggaacact gtggcgaagc ggccacctgg accagcactg 660 acactgaggc acgaaagcgt ggggagcaaa caggattaga taccctggta gtcca cgccc 720 taaacgatgc gaactggacg ttgggagcaa ctaggctctc agtgtcgaag ctaacgcgtt 780 aagttcgccg cctggggagt acggtcgcaa gactgaaact caaaggaatt gacgggggcc 840 cgcacaagcg gtggagtgtg tggtttaatt cgatgcaacg cgaagaacct tacctggcct 900 tgacatccac ggaatccttt agagatagag gagtgccttc gggaaccgtg agacaggtgc 960 tgcatggctg tcgtcagctc gtgtcggag atgttgggtt aagtcccgca acgagcgcaa 1020 cccttgtcct tagttgccag cgcgtaatgg cgggaactct aaggagactg ccggtgacaa 1080 accggaggaa ggtggggatg acgtcaagtc atcatggccc ttacggccag ggctacacac 1140 gtactacaat ggtggggaca gagggtcgcg aagccgcgag gtggagccaa tcccagaaac 1200 cccatcctag tccggatcgg agtctgcaac tcgactccgt gaagtcggaa tcgctagtaa 1260 tcgcggtcag catgcc 1276 <210> 96 <211> 1306 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism < 400> 96 cagggatcag tagagtggca aacggggag taacgcgtgg gcgacctacc ttcgagtggg 60 ggataacctt ccgaaaggag ggctaaacc gcatgacatc ccgtgtttgg atacacggac 120 atcaaagccg gggatcgcaa gacctggcgc ttggagaggg gcccgcgtcc gattagctag 180 ttggtgaggt cacggctcac caaggcccg atcggtatcc ggcctgagag ggcggacgga 240 cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg gaattgttcg 300 caatgggcgc aagcctgacg acgcaacgcc gcgtggagga tgaagacctt cgggtcgtaa 360 actcctttcg accgagatga agacccçccg gcctaatacg ccggcggatt gacagtatcg 420 agggaagaag ccccggctaa ctccgtgcca gcagccgcgg taatacgggg ggggcaagcg 480 ttgttcggaa ttactgggcg taaagggttc gtaggtggct cgctaagtca gacgtgaaat 540 ccctcagctc aactggggaa ctgcgtctga gactggcaag cttgagtgca ggagaggaac 600 gcggaattcc aggtgtagcg gtgaaagcg tagatatctg gaggaacacc ggtggcgaag 660 gcggcgttct ggactgcaac tgacactgag gaacgaaagc taggggagca aacgggatta 720 gataccccgg tagtcctagc cctaaacgat gaatgcttgg tgtggcgggt atcgatccct 780 gccgtgccgc agttaacgcg ataagcatc cgcctgggga gtacggtcgc aaggctgaaa 840 ctcaaaggaa ttgacggggg cccgcacaag cggtggagca tgtggttcaa ttcgacgcaa 900

cgcgaagaac cttacctagg ctcgaagtgc agatgaccat cggtgaaagc cgactttcgc 960 aagaacatct gtagaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1020 aagtcccgca acgagcgcaa cccttgtttc ctgttgccat caggttaagc tgggcactct 1080 ggagagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc agcatggcct 1140 ttatgtctag ggctacacac gtgctacaat ggccggtaca aagcgctgca aacccgcgag 1200 ggtgagccaa tcgcagaaag ccggtctcag ttcggatagc aggctgcaac tcgcctgctt 1260 gaagttggaa tcgctagtaa tcgcggatca gcatgccgcg gtgaat 1306 <210> 97 <211> 1300 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 97 cccgcagggt gagtagatgg caaacgggtg agtaacacgt gggtgacctg cctcagagtg 60 ggggataacg acccgaaagg gtcgctaata ccgcataaca tcctgtcttt ggatagacgg 120 agatcaaagc cggggatcgc aagacctggc gcttagagag gggcccgcgg ccgattagct 180 agttggtgag gtaacggctc accaaggcaa cgatcggtat ccggcctgag agggcggacg 240 gacacactgg gactgagaca cggcccagac tcctacggga ggcagcagtg gggaattgtt 300 cgcaatgggc gcaagcctga cgacgcaacg ccgcgtggag gatgaagatc ttcgggtcgt 360 aaactccttt cgatcgggaa gaacgccrct ggtgtgaaca ccatcagagg gtgacggtac 420 cgagagaaga agccccggct aactctgtgc cagcagccgc ggtaatacag ggggggcaag 480 cgttgttcgg aattactggg cgtaaagggc tcgtaggcgg ccggctaagt ccgacgtgaa 540 atccccaggc ttaacctggg aactgcgtcg gatactggcg ggcttgaatc cgggagaggg 600 atgcggaatt ccaggtgtag cggtgaaatg cgtagatatc tggaggaaca ccggtggcga 660 aggcggcatc ctggaccggt attgacgctg aatagcgaaa gccaggggag caaacgggat 720 tagatacccc ggtagtcctg gccctaaacg atgaatgttt ggtgtggcgg gtatcgatcc 780 ctgccgtgcc gaagctaacg cattaaacat tccgcctggg gagtacggtc gcaaggctga 840 aactcaaagg aattgacggg ggcccgcaca agcggtggag catgtggttc aattcgacgc 900 aacgcgaaga accttaccca ggctcgaacg gcattggaca tccggcgaaa gccggctccc 960 gcaagggccg atgtcgaggt gctgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg 1020 ttaagtcccg caacgagcgc aacccttgtc cgctgttgcc atcacgttat ggtgggcact 1080 ctgcggagac tgccggtgat aaaccggagg aaggtgggga tgacgtcaag tcagcatggc 1140 ctttatgtct ggggctacac acgtgctaca atggccggta caaaccgttg cgatctcgca 1200 agagtgagct aatcggagaa agccggtctc agttcggatt gcaggctgca actcgcctgc 1260 atgaagttgg aatcgctagt aatcgcggat cagcacgccg 1300 <210> 98 <211> 1233 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 98 acggagcggc agacgggaga gtaacacgtg ggaacgtgcc ctttggttcg gaacaacaca 60 gggaaacttg tgctaatacc ggataagccc ttacggggaa agatttatcg ccaaaggatc 120 ggcccgcgtc tgattagcta gttggtgagg taacggctca ccaaggcgac gatcagtagc 180 tggtctgaga ggatgatcag cctcactggg actgagacac ggcccagact cctacgggag 240 gcagcagtgg ggaatattgg acaatgggcg caagcctgat ccagccatgc cgcgtggatg 300 atgaaggccc tagggttgta aagtcctttc ggcggggaag ataatgacgg tacccgcaga 360 agaagccccg gctaacttcg tgccagcagc cgcggtaata cgaagggggc tagcgttgct 420 cggaatcact gggcngtaaa gcgcacgtag gcggcttttt aagtcagggg tgaaatcctg 480 gagctcaact ccagaactgc ctttgatact gagaagcttg agtccgggag aggtgagtgg 540 cgcgaagaac cttacctagg ctcgaagtgc agatgaccat cggtgaaagc cgactttcgc 960 aagaacatct gtagaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1020 aagtcccgca acgagcgcaa cccttgtttc ctgttgccat caggttaagc tgggcactct 1080 ggagagactg ccggtgacaa accggaggaa ggtggggatg acgtcaagtc agcatggcct 1140 ttatgtctag ggctacacac gtgctacaat ggccggtaca aagcgctgca aacccgcgag 1200 ggtgagccaa tcgcagaaag ccggtctcag ttcggatagc aggctgcaac tcgcctgctt 1260 gaagttggaa tcgctagtaa tcgcggatca gcatgccgcg gtgaat 1306 <210 > 97 <211> 1300 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 97 cccgcagggt gagtagatgg caaacgggtg agtaacacgt gggtgacctg cctcagagtg 60 ggggataacg acccgaaagg gtcgctaata ccgcataaca tcctgtcttt ggatagacgg 120 agatcaaagc cggggatcgc aagacctggc gcttagagag gggcccgcgg ccgattagct 180 agttggtgag gtaacggctc accaaggcaa cgatcggtat ccggcctgag agggcggacg 240 gacacactgg gactgagaca cggcccagac tcctacggga ggcagcagtg gggaattgtt 300 cgcaatgggc gcaagcctga cgacgcaacg ccgcgtggag gatgaagatc ttcgggtcgt 360 aaactccttt cgatcgggaa gaacgccrct ggtgtgaaca ccatcagagg gtgacggtac 420 cgagagaaga agccccggct aactctgtgc cagcagccgc ggtaatacag ggggggcaag 480 cgttgttcgg aattactggg cgtaaagggc tcgtaggcgg ccggctaagt ccgacgtgaa 540 atccccaggc ttaacctggg aactgcgtcg gatactggcg ggcttgaatc cgggagaggg 600 atgcggaatt ccaggtgtag cggtgaaatg cgtagatatc tggaggaaca ccggtggcga 660 aggcggcatc ctggaccggt attgacgctg aatagcgaaa gccaggggag caaacgggat 720 tagatacccc ggtagtcctg gccctaaacg atgaatgttt ggtgtggcgg gtatcgatcc 780 ctgccgtgcc gaagctaacg cattaaacat tccgcctggg gagtacggtc gcaaggctga 840 aactcaaagg aattgacggg ggcccgcaca agcggtggag catgtggttc aattcgacgc 900 aacgcgaaga accttaccca ggctcgaacg gcattggaca tccggcgaaa gccggctccc 960 gcaagggccg atgtcgaggt gctgcatggc tgtcgtcagc tcgtgtcgtg agatgttggg 1020 ttaagtcccg caacgagcgc aacccttgtc cgctgttgcc atcacgttat ggtgggcact 1080 ctgcggagac tgccggtgat aaaccggagg aaggtgggga tgacgtcaag tcagcatggc 1140 ctttatgtct ggggctacac acgtgctaca atggccggta caaaccgttg cgatctcgca 1200agagtgagct aatcggagaa agccggtctc agttcggatt gcaggctgca actcgcctgc 1260 atgaagttgg aatcgctagt aatcgcggat cagcacgccg 1300 <210> 98 <211> 1233 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 98 acggagcggc agacgggaga gtaacacgtg ggaacgtgcc ctttggttcg gaacaacaca 60 gggaaacttg tgctaatacc ggataagccc ttacggggaa agatttatcg ccaaaggatc 120 ggcccgcgtc tgattagcta gttggtgagg taacggctca ccaaggcgac gatcagtagc 180 tggtctgaga ggatgatcag cctcactggg actgagacac ggcccagact cctacgggag 240 gcagcagtgg ggaatattgg acaatgggcg caagcctgat ccagccatgc cgcgtggatg 300 atgaaggccc tagggttgta aagtcctttc ggcggggaag ataatgacgg tacccgcaga 360 agaagccccg gctaacttcg tgccagcagc cgcggtaata cgaagggggc tagcgttgct 420 cggaatcact gggcngtaaa gcgcacgtag gcggcttttt aagtcagggg tgaaatcctg 480 gagctcaact ccagaactgc ctttgatact gagaagcttg agtccgggag aggtgagtgg 540

aactgcgagt gtagaggtga aattcgtaga tattcgcaag aacaccagtg gcgaaggcgg 600 ctcactggcc cggtactgac gctgaggtgc gaaagcgtgg ggagcaaaca ggattagata 660 ccctggtagt ccacgctgta aacgatggat gctagccgtt gtcgggttta ctcgtcagtg 720 gcgcagctaa cgcattaagc atcccgcctg gggagtacgg tcgcaagatt aaaactcaaa 780 ggaattgacg ggggcccgca caagcggtgg agcatgtggt tcaattcgaa gcaacgcgca 840 gaaccttacc agcccttgac atgtcccgta tgagtaccag agatggaact cttcagttcg 900 gctggcggga acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 960 aagtcccgca acgagcgcaa ccctcgccct tagttgccat catttagttg ggcactctaa 1020 ggggactgcc ggtgataagc cgcgaggaag gtggggatga cgtcaagtcc tcatggccct 1080 tacgggctgg gctacacacg tgctacaatg gcggtgacag tgggatgcag aggggtaacc 1140 ccgagcaaat ctcaaaaagc cgtctcagtt cggattgtgc tctgcaactc gagcacatga 1200 agttggaatc gctagtaatc gcagatcagc acg 1233 <210> 99 <211> 1304 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 99 cgaaatcccg cagggatcag tagagtggca aacgggtgag taacacgtgg gtgacctgcc 60 ttcgagtggg ggataacgtc ccgaaaggga cgctaatacc gcatgacatc ctgctcttga 120 acgagtggag atcaaagctg gggatcgcaa gacctagcgc tcaaagaggg gcccgcgcct 180 gattagctag ttggtggggt aacggctcac caaggcgacg atcagtatcc ggcctgagag 240 ggcggacgga cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg 300 gaattgttcg caatgggcgc aagcctgacg acgcaacgcc gcgtggagga tgaagatctt 360 cgggtcgtaa actcctttcg atcgagacga acggcctccg ggtgaacaat ccggaggagt 420 gacggtaccg agagaagaag ccccggctaa ctccgtgcca gcagccgcgg taatacgggg 480 ggggcaagcg ttgttcggaa ttactgggcg taaagggctc gtaggcggcc aactaagtca 540 gacgtgaaat ccctcggctt aaccggggaa ctgcgtctga tactggatgg ctagaggttg 600 ggagagggat gcggaattcc aggtgtagcg gtgaaatgcg tagatatctg gaggaacacc 660 ggtggcgaag gcggcatcct ggaccaattc tgacgctgag gagcgaaagc caggggagca 720 aacgggatta gataccccgg tagtcctggc cctaaacgat gaatgcttgg tgtggcgggt 780 atcgatccct gccgtgccga agctaacgca ttaagcattc cgcctgggga gtacggtcgc 840 aaggctgaaa ctcaaaggaa ttgacggggg cccgcacaag cggtggagca tgtggttcaa 900 ttcgacgcaa cgcgaagaac cttacccagg cttgaacagc gagtgaccac tcctgaaaag 960 gagcttccgc aaggacactc gtagaggtgc tgcatggctg tcgtcagctc gtgtcgtgag 1020 atgttgggtt aagtcccgca acgagcgcaa cccttgtttg ctgttgccat cacgttatgg 1080 tgggcactct gcaaagactg ccggtgataa accggaggaa ggtggggatg acgtcaagtc 1140 agcatggcct ttatgtctgg ggctacacac gtgctacaat ggccggtaca aaccgtcgca 1200 aaaccgtaag gtcgagctaa tcggagaaag ccggtctcag ttcggatcgt cggctgcaac 1260 tcgccggcgt gaagttggaa tcgctagtaa tcgcggatca gcac 1304 <210> 100 <211> 1197 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 100 tctagtggcg cacgggtgcg taacgcgtgg gaatctgccc ttgggttcgg gataacagtt 60 ggaaacgact gctaataccg gatgatgtct tcggaccaaa gatttatcgc ccagggatga 120 gcccgcgtcg gattagctag ttggtgaggt aaaggctcac caaggcgacg atccgtagct 180 ggtctgagag gatgatcagc cacactggga ctgagacacg gcccagactc ctacgggagg 240 aactgcgagt gtagaggtga aattcgtaga tattcgcaag aacaccagtg gcgaaggcgg 600 ctcactggcc cggtactgac gctgaggtgc gaaagcgtgg ggagcaaaca ggattagata 660 ccctggtagt ccacgctgta aacgatggat gctagccgtt gtcgggttta ctcgtcagtg 720 gcgcagctaa cgcattaagc atcccgcctg gggagtacgg tcgcaagatt aaaactcaaa 780 ggaattgacg ggggcccgca caagcggtgg agcatgtggt tcaattcgaa gcaacgcgca 840 gaaccttacc agcccttgac atgtcccgta tgagtaccag agatggaact cttcagttcg 900 gctggcggga acacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 960 aagtcccgca acgagcgcaa ccctcgccct tagttgccat catttagttg ggcactctaa 1020 ggggactgcc ggtgataagc cgcgaggaag gtggggatga cgtcaagtcc tcatggccct 1080 tacgggctgg gctacacacg tgctacaatg gcggtgacag tgggatgcag aggggtaacc 1140 ccgagcaaat ctcaaaaagc cgtctcagtt cggattgtgc tctgcaactc gagcacatga 1200 agttggaatc gctagtaatc gcagatcagc acg 1233 <210> 99 <211> 1304 <212> DNA <213> Organime Unknown < 220> <223> Origin of the sequence: Soil organism <400> 99 cgaaatccg cagggatcag tagagtggca aacgggtgag taacacgtgg gtgacctgcc 60 ttcgagtggg ggataacgtc ccgaaaggga cgctaatacc gcatgacatc ctgctcttga 120 acgagtggag atcaaagctg gggatcgcaa gacctagcgc tcaaagaggg gcccgcgcct 180 gattagctag ttggtggggt aacggctcac caaggcgacg atcagtatcc ggcctgagag 240 ggcggacgga cacactggga ctgagacacg gcccagactc ctacgggagg cagcagtggg 300 gaattgttcg caatgggcgc aagcctgacg acgcaacgcc gcgtggagga tgaagatctt 360 cgggtcgtaa actcctttcg atcgagacga acggcctccg ggtgaacaat ccggaggagt 420 gacggtaccg agagaagaag ccccggctaa ctccgtgcca gcagccgcgg taatacgggg 480 ggggcaagcg ttgttcggaa ttactgggcg taaagggctc gtaggcggcc aactaagtca 540 gacgtgaaat ccctcggctt aaccggggaa ctgcgtctga tactggatgg ctagaggttg 600 ggagagggat gcggaattcc aggtgtagcg gtgaaatgcg tagatatctg gaggaacacc 660 ggtggcgaag gcggcatcct ggaccaattc tgacgctgag gagcgaaagc caggggagca 720 aacgggatta gataccccgg tagtcctggc cctaaacgat gaatgcttgg tgtggcgggt 780 atcgatccct gccgtgccga agctaacgca ttaagcattc cgcctgggga gtacggtcgc 840 aaggctgaaa ctcaaaggaa ttgacggggg cccgcacaag cggtggagca tgtggttcaa 900 ttcgacgcaa cgcgaaga ac cttacccagg cttgaacagc gagtgaccac tcctgaaaag 960 gagcttccgc aaggacactc gtagaggtgc tgcatggctg tcgtcagctc gtgtcgtgag 1020 atgttgggtt aagtcccgca acgagcgcaa cccttgtttg ctgttgccat cacgttatgg 1080 tgggcactct gcaaagactg ccggtgataa accggaggaa ggtggggatg acgtcaagtc 1140 agcatggcct ttatgtctgg ggctacacac gtgctacaat ggccggtaca aaccgtcgca 1200 aaaccgtaag gtcgagctaa tcggagaaag ccggtctcag ttcggatcgt cggctgcaac 1260 tcgccggcgt gaagttggaa tcgctagtaa tcgcggatca CCGA 1304 <210> 100 <211> 1197 <212> DNA <213> Organime Unknown <220> <223> Origin of sequence: soil Organism <400> 100 tctagtggcg cacgggtgcg taacgcgtgg gaatctgccc ttgggttcgg gataacagtt 60 ggaaacgact gctaataccg gatgatgtct tcggaccaaa gatttatcgc ccagggatga 120 gcccgcgtcg gattagctag ttggtgaggt aaaggctcac caaggcgacg atccgtagct 180 ggtctgagag gatgatcagc cacactggga ctgagacacg gcccagactc ctacgggagg 240

cagcagtggg gaatattgga caatgggcga aagcctgatc cagcaatgcc gcgtgagtga 300 tgaaggcctt agggttgtaa agctcttttg cccgggatga taatgacagt accgggagaa 360 taagccccgg ctaactccgt gccagcagcc gcggtaatac ggagggggct agcgttgttc 420 ggaattactg ggcgtaaagc gcacgtaggc ggcttgtaa gttagaggtg aaagcccgga 480 gctcaactcc ggaactgcct ttaagactgc atcgcttgaa cgtcggagag gtaagtggaa 540 ttccgagtgt agaggtgaaa ttcgtagata ttcggaagaa caccagtggc gaaggcgact 600 tactggacga ctgttgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc 660 ctggtagtcc acgccgtaaa cgatgatgac tagctgtcgg ggctcatgga gtttcggtgg 720 cgcagctaac gcgttaagtc atccgcctgg ggagtacggc cgcaaggtta aaactcaaag 780 aaattgacgg gggcctgcac aagcggtgga gcagtggtt taattcgaag caacgcgcag 840 aaccttacca gcgtttgaca tggtaggacg gtttccagag atggattcct tcccttacgg 900 gacctacaca caggtgctgc atggctgtcg tcagctcgtg tcgtgagatg ttgggttaag 960 tcccgcaacg agcgcaaccc tcgtcttag ttgctaccat ttagttgggc actctaaaga 1020 aactgccggt gataagccgg aggaaggtgg ggatgacgtc aagtcctcat ggcccttacg 1080 cgctgggcta cacacgtgct acaatggcgg tgacagtggg cagcaaactc gcgagagtga 1140 gcaaatcccc aaaaaccgtc tcagttcgga ttgttctctg caactcgaga gcatgaa 1197 <210> 101 <211> 1352 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence .-Organisme du sol <400> 101 cgacggccag tgaattgtaa tacgactcac taagggcga attgggccct ctagatgcat 60 gctcgagcgg ccgccagtgt gatggatatc tgcagaattc gcccttcagg cctaacacat 120 gcaagtcgca cgagaaaggg cttcggcccc ggacagtgg cgcacgggtg agtaacacgt 180 aggcaatctc ccccgagtg gtggataacc ttccgaaagg agggctaata cagcatgaga 240 ccacgagctc gcagagcttg tggccaaagc ggacctcttc ttgaaagttc gcgcttgagg 300 atgagcctgc ggcccatcag ctagttggta gggtaatggc ctaccaaggc taagacgggt 360 agctggtctg agaggacgga cagccacact ggaactgaga cacggtccag actcctacgg 420 gaggcagcag tggggaatct tgcgcaatgg acgaaagtct gacgcagcga cgccgcgtga 480 gcgatgaagg ccttcgggtt gtaaagctct gtggggagag acgaataagg tgcagctaat 540 acctgcatcg atgacggtat ctccttagca agcaccggct aactctgtgc cagcagccgc 600 ggtaagacag agggtgcaaa cgttgttcgg aattactggg cgtaaagcgc gtgtaggcgg 660 ctgtgtaagt cgggcgtgaa atcccatggc tcaaccatgg aagtgcaccc gaaactgcgt 720 agctagagtc ctggagagga aggtggaatg cttggtgtag aggtgaaatt cgtagatatc 780 aagcggaaca ccggtggcga agcggccttc tggacagtga ctgacgctga gacgcgaaag 840 cgtggggagc aaacaggatt agataccctg gtagtccacg ccgtaaacga tgaatgctag 900 acgctggggt gcatgcactt cggtgtcgcc gctaacgcat taagcattcc gcctggggag 960 tacggccgca aggttaaaac tcaaaggaat tgacgggggc ccgcacaagc ggtggagcat 1020 gtggtttaat tcgaagcaac gcgcaaacct taccaaccct tgacatgtcc attgccggtc 1080 cgagagattg gaccttcagt tcggctggat ggaacacagg tgctgcatgg ctgtcgtcag 1140 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccctac cgccagttgc 1200 catcattcag ttgggcactc tggtggaact gccggtgaca agccggagga agcggggatg 1260 acgtcaagtc ctcatggccc ttatgggttg ggctacacac gtgctacaat ggcggtgaca 1320 gtgggacgcg aagccaaga tggacaaatc cc 1352 <210> 102 <211> 1361 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence : Organisme du sol cagcagtggg gaatattgga caatgggcga aagcctgatc cagcaatgcc gcgtgagtga 300 tgaaggcctt agggttgtaa agctcttttg cccgggatga taatgacagt accgggagaa 360 taagccccgg ctaactccgt gccagcagcc gcggtaatac ggagggggct agcgttgttc 420 ggaattactg ggcgtaaagc gcacgtaggc ggcttgtaa gttagaggtg aaagcccgga 480 gctcaactcc ggaactgcct ttaagactgc atcgcttgaa cgtcggagag gtaagtggaa 540 ttccgagtgt agaggtgaaa ttcgtagata ttcggaagaa caccagtggc gaaggcgact 600 tactggacga ctgttgacgc tgaggtgcga aagcgtgggg agcaaacagg attagatacc 660 ctggtagtcc acgccgtaaa cgatgatgac tagctgtcgg ggctcatgga gtttcggtgg 720 cgcagctaac gcgttaagtc atccgcctgg ggagtacggc cgcaaggtta aaactcaaag 780 aaattgacgg gggcctgcac aagcggtgga gcagtggtt taattcgaag caacgcgcag 840 aaccttacca gcgtttgaca tggtaggacg gtttccagag atggattcct tcccttacgg 900 gacctacaca caggtgctgc atggctgtcg tcagctcgtg tcgtgagatg ttgggttaag 960 tcccgcaacg agcgcaaccc tcgtcttag ttgctaccat ttagttgggc actctaaaga 1020 aactgccggt gataagccgg aggaaggtgg ggatgacgtc aagtcctcat ggcccttacg 1080 cgctgggcta cacacgtgct acaatggcgg tgacagtggg cagcaaactc gcgagagtga 1140 gcaaatcccc aaaaaccgtc tcagttcgga ttgttctctg caactcgaga gcatgaa 1197 <210> 101 <211> 1352 <212> DNA <213> Unknown Organ <220> <223> Origin of the Sequence-Soil Organism <400> 101 cgacggccag tgaattgtaa tacgactcac taagggcga attgggccct ctagatgcat 60 gctcgagcgg ccgccagtgt gatggatatc tgcagaattc gcccttcagg cctaacacat 120 gcaagtcgca cgagaaaggg cttcggcccc ggacagtgg cgcacgggtg agtaacacgt 180 aggcaatctc ccccgagtg gtggataacc ttccgaaagg agggctaata cagcatgaga 240 ccacgagctc gcagagcttg tggccaaagc ggacctcttc ttgaaagttc gcgcttgagg 300 atgagcctgc ggcccatcag ctagttggta gggtaatggc ctaccaaggc taagacgggt 360 agctggtctg agaggacgga cagccacact ggaactgaga cacggtccag actcctacgg 420 gaggcagcag tggggaatct tgcgcaatgg acgaaagtct gacgcagcga cgccgcgtga 480 gcgatgaagg ccttcgggtt gtaaagctct gtggggagag acgaataagg tgcagctaat 540 acctgcatcg atgacggtat ctccttagca agcaccggct aactctgtgc cagcagccgc 600 ggtaagacag agggtgcaaa cgttgttcgg aattactggg cgtaaagcgc gtgtaggcgg 6 60 ctgtgtaagt cgggcgtgaa atcccatggc tcaaccatgg aagtgcaccc gaaactgcgt 720 agctagagtc ctggagagga aggtggaatg cttggtgtag aggtgaaatt cgtagatatc 780 aagcggaaca ccggtggcga agcggccttc tggacagtga ctgacgctga gacgcgaaag 840 cgtggggagc aaacaggatt agataccctg gtagtccacg ccgtaaacga tgaatgctag 900 acgctggggt gcatgcactt cggtgtcgcc gctaacgcat taagcattcc gcctggggag 960 tacggccgca aggttaaaac tcaaaggaat tgacgggggc ccgcacaagc ggtggagcat 1020 gtggtttaat tcgaagcaac gcgcaaacct taccaaccct tgacatgtcc attgccggtc 1080 cgagagattg gaccttcagt tcggctggat ggaacacagg tgctgcatgg ctgtcgtcag 1140 ctcgtgtcgt gagatgttgg gttaagtccc gcaacgagcg caacccctac cgccagttgc 1200 catcattcag ttgggcactc tggtggaact gccggtgaca agccggagga agcggggatg 1260 acgtcaagtc ctcatggccc ttatgggttg ggctacacac gtgctacaat ggcggtgaca 1320 gtgggacgcg aagccaaga tggacaaatc cc 1352 <210> 102 <211> 1361 <212> DNA <213> Organime Unknown <220> <223> Origin of the sequence: Soil organism

<400> 102 aacagctatg accatgatta cgccaagctt ggtaccgagc tcggatccac tagtaacggc 60 cgccagtgtg ctggaattcg cccttcaggc ctaacacatg caagtcgaac ggatccttcg 120 ggattagtgg cggacgggtg agtaacacgt gggaacgtgc cctttggttc ggaacaactc 180 agggaaactt gagctaatac cggataagcc tttcgaggga aagatttatc gccattggag 240 cggcccgcgt aggattagct agttggtgag gtaaaagctc accaaggcga cgatccttag 300 ctggtctgag aggatgatca gccacattgg gactgagaca cggcccaaac tcctacggga 360 ggcagcagtg gggaatcttg cgcaatgggc gcaagcctga tccagccatg ccgcgtgagt 420 gatgaaggcc ttagggttgt aaagctcttt caccggagaa gataatgacg gtatccggag 480 aagaagcccc ggctaacttc gtgccagcag ccgcggtaat acgaaggggg ctagcgttgt 540 tcggaattac tgggcgtaaa gcgcacgtag gcggatattt aagtcagggg tgaaatccca 600 gagctcaact ctggaactgc ctttgatact gggtatcttg agtatggaag aggtaagtgg 660 aattccgagt gtagaggtga aattcgtaga tattcggagg aacaccagtg gcgaaggcgg 720 cttactggtc cattactgac gctgaggtgc gaaagcgtgg ggagcaaaca ggattagata 780 ccctggtagt ccacgccgta aacgatgaat gttagccgtc gggcagtata ctgttcggtg 840 gcgcagctaa cgcattaaac attccgcctg gggagtacgg tcgcaagatt aaaactcaaa 900 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgaa gcaacgcgca 960 gaaccttacc agctcttgac attcggggtt tgggcagtgg agacattgtc cttcagttag 1020 gctggcccca gaacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1080 aagtcccgca acgagcgcaa ccctcgccct tagttgccag catttagttg ggcactctaa 1140 ggggactgcc ggtgataagc cgagaggaag gtggggatga cgtcaagtcc tcatggccct 1200 tacgggctgg gctacacacg tgctacaatg gtggtgacag tgggcagcga gacagcgatg 1260 tcgagctaat ctccaaaagc catctcagtt cggattgcat ctgcaactcg agtgcatgaa 1320 gttggaatcg ctagtaatcg cagatcagca tgctgcggtg a 1361 <210> 103 <211> 1300 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence : :Organisme du sol <400> 103 catgtttagt agcaatacta aatgatgacg agcggcggac gggtgaggaa cacgtaggaa 60 cctgcccaag agagggggac aaccaaggga aactttggct aataccgcat aatctctacg 120 gagaaaagtt gcccgtaagg gtggcgcttt tggaggggcc tgcgtccgat tagttagttg 180 gtgaggtaat agctcaccaa gactgtgatc ggtaactggt ctgagaggac gaccagtcac 240 actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa tcttggacaa 300 tgggggcaac cctgatccag cgatgccgcg tgggtgaaga aggccttcgg gttgtaaagc 360 cctttaggcg gggaagaagg atatgggatg aataagcctg tattttgacg gtacccgcag 420 aataagcacc ggcaaactct gtgccagcag ccgcggtaat acagagggtg cgagcgttaa 480 tcggatttac tgggcgtaaa gggcgcgtag gcggttgtgt gagtgtgatg tgaaagcccc 540 gggctcaacc tgggaagtgc atcgcaaacg acacaactgg agtatatgag agggtggcgg 600 aatttccggt gtagcggtga aatgcgtaga gatcggaagg aacgtcgatg gcgaaggcag 660 ccacctggca taatactgac gctgaggcgc gaaagcgtgg ggagcgaaca ggattagata 720 ccctggtagt ccacgccgtaaacgatgaga actagatgtt ggagggggaa cccttcagta 780 tcgaagctaa cgcgataagt tctccgcctg ggaagtacag tcgcaagact gaaactcaaa 840 agaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaacgcgaa 900 gaaccttacc Lacccttgac atcctgcgaa tcttgccgag aggtgagagt gccgcaagga 960 gcgcagagac aggtgctgca tggctgtcgt cagctcgtgt tgtgagatgt tgggttaagt 1020 cccgtaacga gcgcaaccct tgtccttagt tgccatcatt tagttgggga ctctaaggag 1080 accgccggtg atgaaccgga ggaaggcggg gacgacgtca agtcatcatg gcctttatgg 1140 gtagggctac acacgtgcta caatggggcg tacagagggt cgccaacccg cgagggggag 1200 ccaatctctt aaagcgtctc gtagtccgga ttggagtctg caactcgact ccatgaagtc 1260 ggaatcgcta gtaatcgcgg atcagcagtg ccgcggtgaa 1300 <210> 104 <400> 102 aacagctatg accatgatta cgccaagctt ggtaccgagc tcggatccac tagtaacggc 60 cgccagtgtg ctggaattcg cccttcaggc ctaacacatg caagtcgaac ggatccttcg 120 ggattagtgg cggacgggtg agtaacacgt gggaacgtgc cctttggttc ggaacaactc 180 agggaaactt gagctaatac cggataagcc tttcgaggga aagatttatc gccattggag 240 cggcccgcgt aggattagct agttggtgag gtaaaagctc accaaggcga cgatccttag 300 ctggtctgag aggatgatca gccacattgg gactgagaca cggcccaaac tcctacggga 360 ggcagcagtg gggaatcttg cgcaatgggc gcaagcctga tccagccatg ccgcgtgagt 420 gatgaaggcc ttagggttgt aaagctcttt caccggagaa gataatgacg gtatccggag 480 aagaagcccc ggctaacttc gtgccagcag ccgcggtaat acgaaggggg ctagcgttgt 540 tcggaattac tgggcgtaaa gcgcacgtag gcggatattt aagtcagggg tgaaatccca 600 gagctcaact ctggaactgc ctttgatact gggtatcttg agtatggaag aggtaagtgg 660 aattccgagt gtagaggtga aattcgtaga tattcggagg aacaccagtg gcgaaggcgg 720 cttactggtc cattactgac gctgaggtgc gaaagcgtgg ggagcaaaca ggattagata 780 ccctggtagt ccacgccgta aacgatgaat gttagccgtc gggcagtata ctgttcggtg 840 gcgcagctaa cgcattaaac attccgcctg gggagtacgg tcgcaagatt aaaactcaaa 900 ggaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgaa gcaacgcgca 960 gaaccttacc agctcttgac attcggggtt tgggcagtgg agacattgtc cttcagttag 1020 gctggcccca gaacaggtgc tgcatggctg tcgtcagctc gtgtcgtgag atgttgggtt 1080 aagtcccgca acgagcgcaa ccctcgccct tagttgccag catttagttg ggcactctaa 1140 ggggactgcc ggtgataagc cgagaggaag gtggggatga cgtcaagtcc tcatggccct 1200 tacgggctgg gctacacacg tgctacaatg gtggtgacag tgggcagcga gacagcgatg 1260 tcgagctaat ctccaaaagc 1325 <210> 103 <211> 1300 <212> DNA <213> Unknown organism <220> <223> Origin of the sequence:: Soil organism <400> 103 catgttagt agcaatacta aatgatgacg catctcagtt cggattgcat ctgcaactcg agtgcatgaa 1320 gttggaatcg ctagtaatcg cagatcagca tgctgcggtg a agcggcggac gggtgaggaa cacgtaggaa 60 cctgcccaag agagggggac aaccaaggga aactttggct aataccgcat aatctctacg 120 gagaaaagtt gcccgtaagg gtggcgcttt tggaggggcc tgcgtccgat tagttagttg 180 gtgaggtaat agctcaccaa gactgtgatc ggtaactggt ctgagaggac gaccagtcac 240 actgggactg agacacggcc cagactccta cgggaggcag cagtggggaa tcttggacaa 300 tgggggcaac cctgatccag cgatgccgcg tgggtgaaga aggccttcgg gttgtaaagc 360 cctttaggcg gggaagaagg atatgggatg aataagcctg tattttgacg gtacccgcag 420 aataagcacc ggcaaactct gtgccagcag ccgcggtaat acagagggtg cgagcgttaa 480 tcggatttac tgggcgtaaa gggcgcgtag gcggttgtgt gagtgtgatg tgaaagcccc 540 gggctcaacc tgggaagtgc atcgcaaacg acacaactgg agtatatgag agggtggcgg 600 aatttccggt gtagcggtga aatgcgtaga gatcggaagg aacgtcgatg gcgaaggcag 660 ccacctggca taatactgac gctgaggcgc gaaagcgtgg ggagcgaaca ggattagata 720 ccctggtagt ccacgccgtaaacgatgaga actagatgtt ggagggggaa cccttcagta 780 tcgaagctaa cgcgataagt tctccgcctg ggaagtacag tcgcaagact gaaactcaaa 840 agaattgacg ggggcccgca caagcggtgg agcatgtggt ttaattcgat gcaacgcgaa 900 gaaccttacc Lacccttgac atcctgcgaa tcttgccgag aggtgagagt gccgcaagga 960 gcgcagagac aggtgctgca tggctgtcgt cagctcgtgt tgtgagatgt tgggttaagt 1020 cccgtaacga gcgcaaccct tgtccttagt tgccatcatt tagttgggga ctctaaggag 1080 accgccggtg atg aaccgga ggaaggcggg gacgacgtca agtcatcatg gcctttatgg 1140 gtagggctac acacgtgcta caatggggcg tacagagggt cgccaacccg cgagggggag 1200 ccaatctctt aaagcgtctc gtagtccgga ttggagtctg caactcgact ccatgaagtc 1260 ggaatcgcta gtaatcgcgg atcagcagtg ccgcggtgaa 1300 <210> 104

<211> 1250 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 104 tgtagcaata catcagtggc agacgggtga gtaacacgtg ggaaccttcc tcgttgtacg 60 ggacaactca gggaaacttg agctaatacc gtatacgtcc gagaggagaa agatttatcg 120 caatgagacg ggcccgcgtc ggattagcta gttggtaagg taacggctta ccaaggcgac 180 gatccgtagc tgatctgaga ggatgatcag ccacactggg actgagacac ggcccagact 240 cctacgggag gcagcagtgg ggaatcttgg acaatgggcg caagcctgat ccagccatgc 300 cgcgtgagtg aagaaggcct tagggttgta aagctctttt gccagggacg ataatgacgg 360 tacctgagaa taagccccgg caaacttcgt gccagcagcc gcggtaatac gaagggggct 420 agcgttgttc ggatttactg ggcgtaaagc gcacgtaggc gggtcgttaa gtcaggggtg 480 aaatcccgga gctcaactcc ggaactgcct ttgatactgg cgaccttgag gctggaagag 540 gttagtggaa ttcccagtgt agaggtgaaa ttcgtagata ttgggaagaa caccagtggc 600 gaaggcggct aactggtcca gatctgacgc tgaggtgcga aagcgtgggg agcaaacagg 660 attagatacc ctggtagtcc acgccgtaaa ctatgggtgc tagctgtcag cgggcttgct 720 cgttggtggc gcagctaacg cattaagcac cccgcctggg gagtacggtc gcaagattaa 780 aacttaaagg aattgacggg ggcccgcaca agcggtggag catgtggttt aattcgaagc 840 aacgcgcaga accttaccaa cccttgacat cccgatcgcg gacaccagag atggagtcct 900 tcagttcggc tggatcggag acaggtgctg catggctgtc gtcagctcgt gtcgtgagat 960 gttgggttaa gtcccgcaac gagcgcaacc ctcgccttta gttgccatca tttagttggg 1020 cactctaaag ggactgccgg tgataagccg gaggaaggtg gggatgacgt caagtcctca 1080 tggcccttac gggttgggct acacacgtgc tacaatggcg gtgacaatgg gcagctactt 1140 cgcaaggaga agctaatccc aaaaagccgt ctcagttcag attgcactct gcaactcggg 1200 tgcatgaagt tggaatcgct agtaatcgct aatcagcagg tagcggtgaa 1250 <210> 105 <211> 1302 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 105 ggcttcggct ccccggtaga gggcggacg ggtgagtaac acgtgggtaa tctgcctttg 60 ggtggggaat aacccttcga aagaggggct aataccgcat aacgcagcgg caccgaatgg 120 tgacagttgt taaagtgggg gatcgcaaga cctcacgcct gaagaggagc ccgcgcccga 180 ttagctagtt ggtgcggtaa tggcgtacca aggcggcgat cggtagccgg cctgagaggg 240 cggacggcca cactggcact gagagacggg ccagactcct acgggaggca gcagtgggga 300 attttgggca atgggcgcaa gcctgaccca gcaacgccgc gtgaaggacg aaatccctct 360 gggatgtaaa cttcgaaagt tggggaagaa atccgtgtga ggataatgca cacgggatga 420 cggtacccaa cgtaagcccc ggctaactac gtgccagcag ccgcggtaat acgtaggggg 480 caagcgttgt tcggaattac tgggcgtaaa gggcgcgtag gcggtacgac aagtctggag 540 tgaaagcccg gggctcaacc ccggaatgtc tttggaaact gtcgaacttg agtgcggaag 600 aggcatctgg aattcccagt gtagcggtga aatgcgtaga tattgggaag aacacctgag 660 gcgaaggcgg gatgctgggc cgacactgac gctgaggcgc gaaagccagg ggagcgaacg 720 ggattagata ccccggtagt cctggcccta aacgatggat acttggtgtg tggggttctc 780 gaagtccccg cgtgccggag caacgcggt aagtatcccg cctggggagt acggtcgcaa 840 ggctgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg tggttcaatt 900 cgacgcaacg cgaagaacct tacctgggtt aaatcctacc tcgtcgcctc agagatgagg 960 tttcccttcg ggggaggtag gacggtgctg catggctgtc gtcagctcgt gccgtgaggt 1020 gttgggttaa gtcccgcaac gagcgcaacc cttaccacta gttgccagcg gttcggccgg 1080 gcactctatt gggactgccg gtgacaaacc ggaggaaggt ggggatgacg tcaagtcatc 1140 atggccttta tgtccagggc tacacacgtg ctacaatggc cggaacaaag cgcagcaaac 1200 <211> 1250 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 104 tgtagcaata catcagtggc agacgggtga gtaacacgtg ggaaccttcc tcgttgtacg 60 ggacaactca gggaaacttg agctaatacc gtatacgtcc gagaggagaa agatttatcg 120 caatgagacg ggcccgcgtc ggattagcta gttggtaagg taacggctta ccaaggcgac 180 gatccgtagc tgatctgaga ggatgatcag ccacactggg actgagacac ggcccagact 240 cctacgggag gcagcagtgg ggaatcttgg acaatgggcg caagcctgat ccagccatgc 300 cgcgtgagtg aagaaggcct tagggttgta aagctctttt gccagggacg ataatgacgg 360 tacctgagaa taagccccgg caaacttcgt gccagcagcc gcggtaatac gaagggggct 420 agcgttgttc ggatttactg ggcgtaaagc gcacgtaggc gggtcgttaa gtcaggggtg 480 aaatcccgga gctcaactcc ggaactgcct ttgatactgg cgaccttgag gctggaagag 540 gttagtggaa ttcccagtgt agaggtgaaa ttcgtagata ttgggaagaa caccagtggc 600 gaaggcggct aactggtcca gatctgacgc tgaggtgcga aagcgtgggg agcaaacagg 660 attagatacc ctggtagtcc acgccgtaaa ctatgggtgc tagctgtcag cgggcttgct 720 cgttggtggc gcagctaacg cattaagcac cccgcctggg gagtacggt c gcaagattaa 780 aacttaaagg aattgacggg ggcccgcaca agcggtggag catgtggttt aattcgaagc 840 aacgcgcaga accttaccaa cccttgacat cccgatcgcg gacaccagag atggagtcct 900 tcagttcggc tggatcggag acaggtgctg catggctgtc gtcagctcgt gtcgtgagat 960 gttgggttaa gtcccgcaac gagcgcaacc ctcgccttta gttgccatca tttagttggg 1020 cactctaaag ggactgccgg tgataagccg gaggaaggtg gggatgacgt caagtcctca 1080 tggcccttac gggttgggct acacacgtgc tacaatggcg gtgacaatgg gcagctactt 1140 cgcaaggaga agctaatccc aaaaagccgt ctcagttcag attgcactct gcaactcggg 1200 tgcatgaagt tggaatcgct agtaatcgct aatcagcagg tagcggtgaa 1250 <210> 105 <211> 1302 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 105 ggcttcggct ccccggtaga gggcggacg ggtgagtaac acgtgggtaa tctgcctttg 60 The invention relates to the following: The invention relates to the following aspects of the application of the present invention: - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -. ccagactcct acgggaggca gcagtgggga 300 attttgggca atgggcgcaa gcctgaccca gcaacgccgc gtgaaggacg aaatccctct 360 gggatgtaaa cttcgaaagt tggggaagaa atccgtgtga ggataatgca cacgggatga 420 cggtacccaa cgtaagcccc ggctaactac gtgccagcag ccgcggtaat acgtaggggg 480 caagcgttgt tcggaattac tgggcgtaaa gggcgcgtag gcggtacgac aagtctggag 540 tgaaagcccg gggctcaacc ccggaatgtc tttggaaact gtcgaacttg agtgcggaag 600 aggcatctgg aattcccagt gtagcggtga aatgcgtaga tattgggaag aacacctgag 660 gcgaaggcgg gatgctgggc cgacactgac gctgaggcgc gaaagccagg ggagcgaacg 720 ggattagata ccccggtagt cctggcccta aacgatggat acttggtgtg tggggttctc 780 gaagtccccg cgtgccggag caacgcggt aagtatcccg cctggggagt acggtcgcaa 840 ggctgaaact caaaggaatt gacggggacc cgcacaagcg gtggagcatg tggttcaatt 900 cgacgcaacg cgaagaacct tacctgggtt aaatcctacc tcgtcgcctc agagatgagg 960 tttcccttcg ggggaggtag gacggtgctg catggctgtc gtcagctcgt gccgtgaggt 1020 gttgggttaa gtcccgcaac gagcgcaacc cttaccacta gttgccagcg gttcggccgg 1080 gcactctatt gggactgccg gtgacaaacc ggaggaaggt ggggatga cg tcaagtcatc 1140 atggccttta tgtccagggc tacacacgtg ctacaatggc cggaacaaag cgcagcaaac 1200

ccgcgagggg gagccaatcg caaaaatccg gtctcagttc ggattggagt ctgcaactcg 1260 actccatgaa gttggaatcg ctagtaatcg cggatcagca tg 1302 <210> 106 <211> 1281 <212> ADN <213> Organime Inconnu <220> <223> Origine de la séquence :Organisme du sol <400> 106 tgcttctctt gagagcggcg gacgggtgag taatgcctag gaatctgcct ggtagtgggg 60 gataacgttc ggaaacggac gctaataccg catacgtcct acgggagaaa gcaggggacc 120 ttcgggcctt gcgctatcag atgagcctag gtcggattag ctagttggtg aggtaatggc 180 tcaccaaggc gacgatccgt aactggtctg agaggatgat cagtcacact ggaactgaga 240 cacggtccag actcctacgg gaggcagcag tggggaatat tggacaatgg gcgaaagcct 300 gatccagcca tgccgcgtgt gtgaagaagg tcttcggatt gtaaagcact ttaagttgga 360 aggaagggca gtaaattaat actttgctgt tttgacgtta ccgacagaat aagcaccggc 420 taactctgtg ccagcagccg cggtaataca gagggtgcaa gcgttaatcg gaattactgg 480 gcgtaaagcg cgcgtaggtg gtttgttaag ttggatgtga aatccccggg ctcaacctgg 540 gaactgcatt caaaactgac tgactagagt atggtagagg gtggtggaat ttcctgtgta 600 gcggtgaaat gcgtagatat aggaaggaac accagtggcg aaggcgacca cctggactaa 660 tactgacact gaggtgcgaa agcgtgggga gcaaacagga ttagataccc tggtagtcca 720 cgccgtaaac gatgtcaact agccgttgga agccttgagc ttttagtggc gcagctaacg 780 cattaagttg accgcctggg gagtacggcc gcaaggttaa aactcaaatg aattgagggg 840 ggcccgcaca agcggtggag catgtggttt aattcgaagc aacgcgaaga accttaccag 900 gccttgacat ccaatgaact ttctagagat agattggtgc cttcgggaac attgagacag 960 gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgtaacgagc 1020 gcaacccttg tccttagtta ccagcacgac atggtgggca ctctaaggag actgccggtg 1080 acaaaccgga ggaaggtggg gatgacgtca agtcatcatg gcccttacgg cctgggctac 1140 acacgtgcta caatggtcgg tacagagggt tgccaagccg cgaggtggag ctaatcccac 1200 aaaaccgatc gtagtccgga tcgcagtctg caactcgact gcgtgaagtc ggaatcgcta 1260 gtaatcgcga atcagaaatg t 1281 ccgcgagggg gagccaatcg caaaaatccg gtctcagttc ggattggagt ctgcaactcg 1260 actccatgaa gttggaatcg ctagtaatcg cggatcagca tg 1302 <210> 106 <211> 1281 <212> DNA <213> Unknown Organ <220> <223> Origin of the sequence: Soil organism <400> 106 tgcttctctt gagagcggcg gacgggtgag taatgcctag gaatctgcct ggtagtgggg 60 gataacgttc ggaaacggac gctaataccg catacgtcct acgggagaaa gcaggggacc 120 ttcgggcctt gcgctatcag atgagcctag gtcggattag ctagttggtg aggtaatggc 180 tcaccaaggc gacgatccgt aactggtctg agaggatgat cagtcacact ggaactgaga 240 cacggtccag actcctacgg gaggcagcag tggggaatat tggacaatgg gcgaaagcct 300 gatccagcca tgccgcgtgt gtgaagaagg tcttcggatt gtaaagcact ttaagttgga 360 aggaagggca gtaaattaat actttgctgt tttgacgtta ccgacagaat aagcaccggc 420 taactctgtg ccagcagccg cggtaataca The invention relates to the following: The invention relates to a method for controlling the operation of a vehicle. actaa 660 tactgacact gaggtgcgaa agcgtgggga gcaaacagga ttagataccc tggtagtcca 720 cgccgtaaac gatgtcaact agccgttgga agccttgagc ttttagtggc gcagctaacg 780 cattaagttg accgcctggg gagtacggcc gcaaggttaa aactcaaatg aattgagggg 840 ggcccgcaca agcggtggag catgtggttt aattcgaagc aacgcgaaga accttaccag 900 gccttgacat ccaatgaact ttctagagat agattggtgc cttcgggaac attgagacag 960 gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg ggttaagtcc cgtaacgagc 1020 gcaacccttg tccttagtta ccagcacgac atggtgggca ctctaaggag actgccggtg 1080 acfaaccgga ggaaggtggg gatgacgtca agtcatcatg gcccttacgg cctgggctac 1140 acacgtgcta caatggtcgg tacagagggt tgccaagccg cgaggtggag ctaatcccac 1200 aaaaccgatc gtagtccgga tcgcagtctg caactcgact gcgtgaagtc ggaatcgcta 1260 gtaatcgcga atcagaaatg t 1281

Claims

A method of preparing a nucleic acid collection from a soil sample containing organisms, said method comprising the following sequence of steps: (a) Obtaining micro-particles by grinding a sample of previously dried or desiccated soil, and then suspending the microparticles in a liquid buffer medium; and (b) extracting the nucleic acids present in the microparticles; and (c) - passing the nucleic acid-containing solution over a molecular sieve, then recovering the nucleic acid-enriched elution fractions and passing the nucleic acid-enriched elution fractions on an exchange media chromatography medium. anions and then recovering the elution fractions containing the purified nucleic acids.

2. Method according to claim 1, characterized in that step I- (a) is followed by the complementary steps of: - treatment of microparticles in suspension in a liquid buffer medium by sonication and - extraction and recovery of nucleic acids .

3. Method according to claim 1, characterized in that step I- (a) is followed by the following additional steps: - treatment of microparticles in suspension in a liquid buffer medium by sonication; incubation of the suspension at 37 ° C. after sonication in the presence of lysozyme and of achromopeptidase; - addition of SDS - recovery of nucleic acids.

4. Method according to claim 1, characterized in that step - (a) is followed by the following complementary steps: homogenization of the microparticles by means of a violent mixing stage (vortex) followed by a stage of simple agitation; freezing of the homogeneous suspension followed by thawing; treatment by sonication of the suspension after thawing;

incubation of the suspension at 37 ° C. after sonication in the presence of lysozyme and of achromopeptidase; - addition of SDS; - centrifugation and precipitation of nucleic acids; and - recovery of nucleic acids.

5. Method according to one of claims 1 to 4, characterized in that the nucleic acids are DNA molecules.

6. Process for preparing a recombinant vector collection, characterized in that the nucleic acids obtained by the method according to one of claims 1 to 5 are inserted into a cloning and / or expression vector.

7. Method according to claim 6, characterized in that the nucleic acids are separated according to their size prior to their insertion into the cloning and / or expression vector.

8. Method according to claim 6, characterized in that the average size of the nucleic acids is made substantially uniform by physical rupture, prior to their insertion into the cloning and / or expression vector.

9. Process according to claim 6, characterized in that the cloning and / or expression vector is of the plasmid type.

10. Process according to claim 6, characterized in that the cloning and / or expression vector is of the cosmid type.

11. The method of claim 10, characterized in that it is a replicative cosmid in E. coli and integrative Streptomyces.

12. The method of claim 11, characterized in that it is cosmos pOS7001.

13. The method of claim 10, characterized in that it is a cosmid conjugative and integrative Streptomyces.

14. The method of claim 13, characterized in that it is the cosmid pOSV303.

15. The method of claim 10, characterized in that it is a cosmid replicative both in E. coli and Streptomyces.

16. The method of claim 15, characterized in that it is cosmos pOS 700R.

17. The method of claim 10, characterized in that it is a replicative cosmid in E. coli and Streptomyces and conjugative Streptomyces.

18. Process according to claim 6, characterized in that the cloning and / or expression vector is of the BAC type.

19. Method according to claim 18, characterized in that it is an integrative and conjugative BAC vector in Streptomyces.

20. The method of claim 19, characterized in that it is the BAC vector pOSV403.

21. Nucleic acid collection consisting of the nucleic acids obtained by the method according to one of claims 1 to 5.

22. A nucleic acid characterized in that it is contained in the nucleic acid collection according to claim 21.

23. Nucleic acid according to claim 22, characterized in that it comprises a nucleotide sequence encoding at least one operon, or part of an operon.

24. Nucleic acid according to claim 23, characterized in that the operon encodes all or part of a metabolic pathway.

25. Nucleic acid according to claim 24, characterized in that the metabolic pathway is the synthetic route of the polyketides.

Nucleic acid according to Claim 25, characterized in that it is chosen from polynucleotides comprising the sequences SEQ ID N 30 to 44.

27. Nucleic acid according to claim 22, characterized in that it comprises all of a nucleotide sequence coding for a polypeptide.

28. Nucleic acid according to one of claims 22 to 27, characterized in that it is of prokaryotic origin.

29. Nucleic acid according to claim 28, characterized in that it comes from a bacterium or a virus.

30. Nucleic acid according to one of claims 22 to 24 and 27, characterized in that it is of eukaryotic origin.

31. Nucleic acid according to claim 30, characterized in that it comes from a fungus, a yeast, a plant or an animal.

32. Recombinant vector characterized in that it is chosen from vectors comprising a nucleic acid according to one of claims 26 to 31;

33. Vector according to claim 32 characterized in that it is cosmos pOS 7001.

34. The vector of claim 32 characterized in that it is the cosmid pOSV303.

35. Vector according to claim 32 characterized in that it is the cosmid pOS 700R.

36. Vector according to claim 32 characterized in that it is the BAC vector pOSV403.

37. Recombinant vector collection as obtained according to the method of one of claims 6 to 20.

38. Recombinant cloning and / or expression vector characterized in that it is contained in the recombinant vector collection according to claim 37.

Recombinant host cell comprising a nucleic acid according to one of claims 22 to 31 or a recombinant vector according to claim 38.

40. Recombinant host cell according to claim 39, characterized in that it is a prokaryotic or eukaryotic cell.

41. Recombinant host cell according to claim 40, characterized in that it is a bacterium.

42. Recombinant host cell according to claim 41, characterized in that it is a bacterium selected from E. coli and Streptomyces.

43. Recombinant host cell according to claim 40, characterized in that it is a yeast or a filamentous fungus.

44. Collection of recombinant host cells, each of the host cells constituting the collection comprising a nucleic acid of the nucleic acid collection according to claim 21.

45. Collection of recombinant host cells, each of the host cells constituting the collection comprising a recombinant vector according to one of claims 32 or 38.

46. A method for detecting a nucleic acid of a given nucleotide sequence, or of a nucleotide sequence structurally related to a determined nucleotide sequence, in a recombinant host cell collection according to one of claims 44 or 45, characterized in that comprises the following steps: - bringing the collection of recombinant host cells into contact with a pair of primers hybridizing with the determined nucleotide sequence or hybridizing with the nucleotide sequence structurally related to a determined nucleotide sequence; - perform at least three amplification cycles; detect the optionally amplified nucleic acid.

47. A method for detecting a nucleic acid of a given nucleotide sequence, or of a nucleotide sequence structurally related to a determined nucleotide sequence, in a recombinant host cell collection according to one of claims 44 or 45, characterized in that includes the following steps:

bringing the collection of recombinant host cells into contact with a probe hybridizing with the determined nucleotide sequence or hybridizing with a nucleotide sequence structurally related to the determined nucleotide sequence; detect the hybrid possibly formed between the probe and the nucleic acids included in the vectors of the collection.

48. A method for identifying the production of a compound of interest by one or more recombinant host cells in a collection of recombinant host cells according to one of claims 44 or 45, characterized in that it comprises the following steps: culturing the recombinant host cells of the collection in a suitable culture medium; detection of the compound of interest in the culture supernatant or in the cell lysate of one or more of the recombinant cultured host cells.

A method for selecting a recombinant host cell producing a compound of interest in a recombinant host cell collection according to one of claims 44 or 45, characterized in that it comprises the following steps: - culture of the recombinant host cells of the collection in a suitable culture medium; detection of the compound of interest in the culture supernatant or in the cell lysate of one or more of the recombinant cultured host cells.

selection of the recombinant host cells producing the compound of interest.

50. Process for the production of a compound of interest characterized in that it comprises the following steps: cultivating a recombinant host cell selected according to the method of claim 49; recovering and, if necessary, purifying, the compound produced by said recombinant host cell.

51. A compound of interest characterized in that it is obtained according to the process of claim 50.

52. A compound according to claim 51, characterized in that it is a polyketide.

53. Polyketide characterized in that it is produced by the expression of at least one nucleotide sequence comprising a sequence chosen from the sequences SEQ ID N 30 to 44.

54. A composition comprising a polyketide according to claim 52 or 53.

A pharmaceutical composition comprising a pharmacologically active amount of a polyketide according to claim 52 or 53, in association with a pharmaceutically compatible carrier.