CA2363938A1 - Method for identifying microorganisms based on sequencing gene fragments - Google Patents

Method for identifying microorganisms based on sequencing gene fragments Download PDF

Info

Publication number
CA2363938A1
CA2363938A1 CA 2363938 CA2363938A CA2363938A1 CA 2363938 A1 CA2363938 A1 CA 2363938A1 CA 2363938 CA2363938 CA 2363938 CA 2363938 A CA2363938 A CA 2363938A CA 2363938 A1 CA2363938 A1 CA 2363938A1
Authority
CA
Canada
Prior art keywords
sequence
nucleic acid
seq
relevant
gene
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA 2363938
Other languages
French (fr)
Inventor
Jon Jonasson
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Pyrosequencing AB
Original Assignee
Pyrosequencing AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Pyrosequencing AB filed Critical Pyrosequencing AB
Priority to CA 2363938 priority Critical patent/CA2363938A1/en
Priority to US10/303,199 priority patent/US20040023209A1/en
Publication of CA2363938A1 publication Critical patent/CA2363938A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Organic Chemistry (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention relates to a method of identifying a microorganism in a sample, based upon sequencing, and analysing, using a sequencing-by-synthesis procedure, short stretches, or fragments of a gene.

Accordingly, the present invention provides a method of identifying a microorganism in a sample, said method comprising:

determining the sequence of a region of up to 50 nucleotides in a predetermined site in a gene of said microorganism, thereby to obtain a signature sequence;
and analysing sequencing information in said signature sequence to identify said microorganism, wherein said sequence is determined by detecting the nucleotides incorporated in a primer extension reaction performed using a primer binding at a pre-determined site in said gene.

Description

76010/001.615 Method for identifying microorganisms based on sequencinq~g~ene frarnnents The present invention relates to a method of identifying a microorganism in a sample, advantageously a clinical sample, based upon sequencing, and analysing, using a sequencing-by-synthesis procedure, short stretches, or fragments of a gene.
Microbial infections, namely the infection of a host organism by a microorganism, are one of the major causes of morbidity in general populations. In order to make an effective diagnosis of the disease or infection and to determine an appropriate treatment, it is important to identify rapidly and accurately the etiologic (i.e. causative) agent of the infection, namely to identify (or "type") the microorganism involved in the infection.
In epidemiology, species information is also extremely important to determine the source and mode of transmission.
Conventional methods of diagnosing or typing microbial infections involve culturing a sample taken from the patient (e. g. blood sample), and re-culturing on selective growth medium. Biochemical characterization of the microorganism involved may then take place. Suitable methods of biochemical characterization include gram staining, colonial morphology, indole production testing and O-F reaction (testing whether an organism utilises glucose fermentively, oxidatively, or not at all) and other tests. These assays result in the identification of the species of microorganism involved in the infection, and provide no further information regarding the infection.
The problems with conventional methods of typing microorganisms are multiple and can severely hinder prompt diagnosis of infection. Culturing microorganisms can be time-consuming, especially when the organism is slow growing or even non-cultivatable. For newer species there is a lack of accurate methods for typing.
Classical identification methods based on biochemical, serological, morphological and phenotypic characteristics are traditionally used to identify microorganism infections. However, as more information becomes available regarding microorganisms at the genetic level, the emphasis of diagnostic studies is shifting towards molecular methods, particularly those based on deletion and analysing nucleic acids, or genes, such as sequencing of the 16S rRNA (ribosomal RNA) genes of bacteria or the RNase P RNA gene. One advantage of molecular biology based identification or typing of microorganisms is that there is no need to culture samples. However, conventional sequencing methods used for typing (such as pulse field electrophoresis, hybridization or gel-based sequencing) can be time consuming, days or weeks may be required, and some methods are difficult to perform. Thus, even though nucleic acid sequence analysis is increasingly used for research purposes it is still considered too costly and time-consuming for use in large-scale molecular identification of microorganisms in a routine clinical diagnostic laboratory setting.
Identification of the species of microorganism involved in the infection does not always provide all the information required for the diagnosis, treatment and/or prognosis of the infection in the patient.
For accurate diagnosis, it would be advantageous not only to determine the general "class" (or genus or species) of infecting microorganism present, but also to determine which of the sub-types (e.g. strains) is present. For many infections, the infecting microorganism may occur in a number of different sub-types (strains or genotypes). The advantage of using ' CA 02363938 2003-02-27 molecular biology based techniques is that the sub-type (strain or genotype) of the infection microorganism can be identified. Molecular biology based analysis of the microorganism involved in the infection thus offers some advantages over standard techniques.
A need thus exists not only for a method which offers accurate and quick nucleic acid analysis and hence diagnosis of the infection, but which can be applied to a high number of samples in a high throughput setting in a cost-effective manner. Such information is vital, especially with life-threatening infections and epidemics of infection. Furthermore, such a method which may allow not only genus identification, but also species and strain typing information to be obtained would be highly advantageous. The present invention addresses this need.
The invention is thus based on deriving typing, or identification, information from relatively short nucleotide sequences contained in microorganism genes.
This sequence information is derived using particular sequencing protocols which rely on specific priming and detection of the event of nucleotide incorporation (or non-incorporation) in such specific primer extension reactions. Such sequencing techniques enable valuable and discriminatory sequence information to be obtained from only short nucleotide sequences.
The potential of using genes, particularly the RNA
genes and in particular the bacterial 16S rRNA gene or the RNase P RNA gene, as taxonomic tools have become increasingly evident in recent years. RNase P RNA is part of a ribonucleoprotein, Rnase P which is responsible for the maturation of the 5'-termini of tRNA
molecules. The RNA subunit is approximately 400 nucleotides in length and is responsible for the catalytic activity of the RNase P. At the nucleotide level there are 4 regions of hypervariable nucleotide sequence known as the P3, P12, P17 and P19 loops. In bacteria, the major interspecies differences can be found in the P3 and P19 loops. The remaining 'core structure' which is thought to be essential for catalysis, is conserved across different species. The utility of the variable regions in detecting pathogenic organisms is discussed in WO01/51662.
The 16S rRNA is a structural part of the 30S
ribosomal small subunit, whose functions are essential in the living cell. At the nucleotide level 16S rRNA
consists of eight highly conserved regions, U1-U8, which are invariant across the bacterial domain. In between those conserved regions, nine variable regions can be distinguished, Vl-V9, which are presumed to be segments of less importance for ribosomal function. These regions show a spectrum of different nucleotide substitution rates, which forms a favourable basis for phylogenetic analysis; the expression "rRNAs, the ultimate molecular chronometer" has been coined.
Depending on species, bacterial chromosomes carry from 1 to 15 copies of rRNA genes. The individual rrn operons are monophyletic but heterogeneous 16S rRNA genes within a single microorganism are not rare. It is generally agreed that 16S rRNA and other ribosomal gene sequences are an unusually stable genotypic feature. Tens of thousands such molecules have been catalogued with sequences, structures and taxonomy in public molecular databases, e.g. GenBank at NCBI
(http:///www.ncbi.nlm.nih.gov/). It has been proposed that these data can advantageously be used for identifying unknown bacteria by 16S rRNA gene sequencing (Relman, D.A., Schmidt, T.M., MacDermott, R.P.O and Falkow, S., 1992, New Engl. J. Med., 327, 293-301).
However, as mentioned above such sequencing involved the use of conventional sequencing techniques, with all their attendant drawbacks, to sequence relatively long gene fragments making them unsuitable for use in a clinical diagnostic setting. However, we have now shown that highly accurate provisional classification or identification of commonly encountered clinically important bacteria and other micro-organisms can be obtained on a large scale using a sequencing-by-synthesis based technique for real-time DNA sequence analysis to obtain and analyse the sequence information content of that "signature" nucleotide sequences of selected gene sequences. This concept of "signature matching" is described further below.
Automated microbial identification in a clinical setting generally requires a fast and reliable, generally applicable identification system for approximately 1000 different, but sometimes closely related, pathogens. Most molecular diagnostic kits are narrow in scope and could not possibly fulfil this requirement. However, we have shown that a genotyping method of the present invention as described above, enables such analyses and is sufficiently discriminative to allow the rapid molecular identification, and even subtying, of a range of clinically important bacteria.
Accordingly, the present invention provides a method of identifying a microorganism in a sample, said method comprising:
determining the sequence of a region of up to 50 nucleotides in a predetermined site in a gene of said microorganism, thereby to obtain a signature sequence;
and analysing sequencing information in said signature sequence to identify said microorganism, wherein said sequence is determined by detecting the nucleotides incorporated in a primer extension reaction performed using a primer binding at a pre-determined site in said gene.
The term "identifying" as used herein includes all forms of detecting, determining and/or characterising the identity of the target microorganism. Thus, identify may be detected, determined or characterised at the genus, species, strain or particular genotype level, and different levels or degrees of information pertaining to the identity of the microorganism in question are encompassed by the present invention. The invention thus includes all methods of detecting a microorganism, and discriminating or distinguishing a microorganism. Thus, for example, the present invention allows pathogenic microorganisms to be distinguished from commensals or saprophytes in the same sample (e. g.
in the same environment or habitat). Since sequence information is derived, the method of the invention permits molecular identification of microorganisms (e. g.
microbial isolates) and hence it can be seen that genotyping may be achieved. The method of the invention thus includes methods of typing, and sub-typing, and classifying microorganisms. The methods of the invention may be used for general microorganism classification, characterisation, genotyping, epidemiological typing and phylogenetic analysis. The methods of the invention may particularly be used to ascribe an identity to (i.e. to identify) an unknown microorganism in a sample, including methods of provisional identification and provisional classification.
The "microorganism" according to the present invention may be any microorganism, and can be eukaryotic or prokaryotic. Such microorganisms are generally uni-cellular but need not be so limited.
Advantageously however, the invention is performed on bacteria, which represent a significant class of microbial pathogens, although other organisms such as fungi, algae and protozoa are not excluded. The invention finds particular utility in the identification of pathogenic microorganisms.
The "sample" may be any sample or specimen which contains microorganisms and includes not only biological samples which may contain microorganisms e.g. samples of _ 7 _ cellular or tissue material or body fluids, and microbial isolates or cultures, but also any cell cultures, suspensions or preparations, lysates, etc.
which may contain microbial material, environmental samples (e. g. soil and water samples), food samples (e. g. from food manufacturers, caterers, restaurants, including testing utensils and cooking areas) etc. As mentioned above, the samples may contain microorganisms the identity of which is unknown. The samples may be freshly prepared or prior-treated in any convenient way e.g. for storage. Especially advantageously however, the sample will be a clinical sample, and this may thus include any tissue, cell or fluid sample which may be taken from a patient, to determine the presence or identity of a microbial infection. Representative samples include whole blood and blood-derived products such as plasma, serum and buffy coat, lymph, urine, cerebrospiral fluid, saliva, semen or any other body fluid, faeces tissues, biopsy samples or swabs.
Microorganisms from such samples and specimen may be cultured, and such cultures may be used directly in the procedure e.g. a microbial cell suspension or other cell preparation or indeed a microbial colony (e.g. a bacterial colony). Alternatively, if desired nucleic acid may be extracted or isolated from the sample or microbial material in the sample.
The "patient" may be human, or a veterinary patient, such as farm animals including cattle, horses, sheep, pigs or chickens, companion animals such as dogs and cats, primates such as chimpanzees and gorillas, or any other animal. Herein, the term "animal" includes fish and birds.
It will be seen, therefore, that the method of the invention may be applied to any situation requiring the identification of a microorganism. Particularly advantageously, the method finds utility in identifying microorganisms in clinical samples, and hence in one aspect the invention can be seen as providing a method of diagnosis, for example, wherein the identity of a microbial pathogen causing an infection in a patient or subject is determined. However, the methods of the invention may equally be applied to any microbial classification study, e.g. phylogenetic or taxonomic studies, environmental monitoring, contamination testing, forensic analysis etc.
As explained above, the method involves sequencing a short stretch of nucleotides in a gene to obtain a "signature" sequence, the sequence information content of which may be used to identify the microorganism.
The gene may be any gene i.e. any gene encoding a product which may be an RNA molecule or a protein molecule. If the gene encodes a protein molecule, it will be understood that messenger mRNA is produced as an intermediate product.
Preferably the gene is an RNA gene. The RNA gene may be any RNA gene i.e. any gene encoding RNA as its final product. Such RNA genes include ribosomal RNA
(rRNA) genes (e. g. 5S RNA, 16S rRNa, 18S rRNA, 23S rRNA
and 26S rRNA), transfer RNA (tRNA) genes, ribozymal RNA
genes (e. g. the RNA component of RNase P) and the genes encoding the RNA components of telomerases, splicesomes and other RNA-protein complexes.
Preferably however, the gene will be a ribosomal RNA (rRNA) gene, namely a gene encoding a ribosomal RNA
molecule. The rRNA may be of any ribosomal subunit, i.e. including both the large (50s) and small (30s) subunits. The rRNA may thus be the 16S molecule deriving from the 30s subunit or the 23S and 5S rRNAs deriving from the 50s subunit. Preferably, the rRNA
gene is the 16S rRNA gene.
Alternatively however, the gene will encode a ribozyme RNA product, which may or may not associate with protein subunits to form a ribozyme. Preferably, the ribozyme RNA gene is the gene for RNase P RNA.

A surprising feature of the present invention is that sufficient sequence information to identify a microorganism may be derived from a relatively short nucleotide sequence, namely a sequence of not more than 50 nucleotides. Indeed it has been found that discriminatory information sufficient to identify a microorganism (e. g. at a provisional level or at genus level) may be obtained from a nucleotide sequence as short as 6 nucleotides, e.g., 10 nucleotides. Thus, the region sequenced may be from 6, 10, 12, 15, 20 or 25, nucleotides long, and up to e.g., 30, 35, 40, 45 to 50 nucleotides long and any combination derived therefrom, e.g. 6 to 50, 6 to 40, 10 to 40, 12 to 40, 15 to 40, 10 to 30, 10 to 25, 10 to 20 or 10 to 15, nucleotides long.
In some cases a longer stretch may be sequenced e.g. 15 to 50, 20 to 50, 25 to 40 nucleotides etc. It is possible to combine different sequences from different regions to yield further discriminatory or identificatory information, and this may in certain cases enable shorter sequences to be used. Thus, the method of the invention may be performed by sequencing one or more (i.e. multiple) regions of up to 50 nucleotides of a gene e.g. 2, 3, 4, 5 or more e.g. 1 to 9 or 1 to 6 (e.g. 2 to 6) nucleotides. For example a region from each of the nine variable regions (V1 to V9) of the 16S rRNA gene may be sequenced, or a particular combination thereof, e.g. V1 and V3.
In order for the sequenced region to provide discriminatory information, it will be appreciated that it needs to be variable or distinguishable, as between different microorganisms.
It can thus be viewed as a "discriminatory" or "variable" region. As mentioned above, the sequenced region lies in a pre-determined site in the gene. Thus, the region may be selected to lie in or overlap with a region or site (or locus) of sequence variability (i.e.
genetic variation), namely a site or region which is not conserved as between different microorganisms. As mentioned above, ribosomal genes contain regions of variability e.g. V1 to V9 for the 16S RNA gene, or P3, P12, P17 and P19 for the RNase P RNA gene, and such variable regions or sequences within them, may be used as the variable region according to the present invention.
On the other hand, in order to be able to obtain a primer extension product from a range of different microorganisms (i.e. from any microorganism which may be present in the sample) it will be understood that the primer needs to bind at site which is common (i.e.
conserved or semi-conserved) as between different microorganisms. Thus, in order to perform the invention the primer binding site should be available in all individual microorganisms which may be present in the sample. Such primer binding sites will therefore advantageously lie in regions which are common to, or substantially conserved between different microorganisms. This may readily be achieved by selecting the primer binding site to lie in conserved/semi-conserved regions as discussed above.
Thus, the extension primer (i.e. the sequencing primer) is designed or selected to bind at a pre-determined site which is common to (a conserved or semi-conserved) different microorganisms. Such a primer may be regarded as a universal primer i.e. a primer capable of binding to the selected gene of a range of different microorganisms i.e. of binding non-selectively insofar as the microorganism is concerned, although of course binding primer is specific as regards its binding site.
Such conserved regions may e.g. be or lie within the regions U1 to U8 of the 16S rRNA gene or the conserved core structure of the RNase P RNA mentioned above. The primer is further designed or selected so that when the primer extension reaction is performed the primer is extended over the "variable" or "discriminatory" region to be sequenced. In other words, the extension primer is designed or selected so that its extension product overlaps (or comprises) a region of sequence variability. Thus, the primer binds to the target gene at, or near to, (e.g. within 1 to 40, 1 to 20, 1 to 10, or 1 to 6 bases of) a variable region or site. It will be seen therefore that primer binding sites may be selected which flank a variable region. Where more than one region is to be sequenced, two or more primers are provided, each binding at a different pre-determined site.
From the above it will be appreciated that to design or select the predetermined sites of the variable region and the primer binding site, knowledge of the sequence of the target gene is required.
Primers suitable for use as extension primers of the invention may be publically available, for example primers known for sequencing ribosomal genes e.g. pJBS-V3.SE, B-V3.A5 and pBR.-Vl.A5 sequencing primers for V3 and V1 regions in the 16S rRNA gene (Monstein et al., 2001, FEMS Microbiology Letters, 199, 103-107 and Jonasson et al., APMIS 2002, in press).
The sequencing, or primer extension, step results in the obtention of a "signature" sequence for the target microorganism. In other words, the sequence interpreted from detecting nucleotide incorporation in the primer extension step may be used as the signature of the gene of the target microorganism. This signature sequence may thus be viewed as an identificatory or characterising sequence or "tag" or "motif" for a microorganism. The signature sequence may contain a range of sequence information or data which may be used to identify the microorganism. This may include both full sequence information, identification of particular substitutions or base identity at defined positions (i.e. "landmark" sequence data), combinations or substitutions or of base identity at particular positions, uniqueness of the signature sequence or of base identity at particular positions within it, detection of matches and/or mismatches, insertions, deletions etc. Thus, a signature sequence may have multiple signature attributes.
The information content (i.e. sequence data) in the signature sequence is analysed to identify the microorganism. This analysis step may be accomplished in any known or desired manner for assessing or evaluating sequence information. Thus, the analysis may involve comparing the signature sequence obtained against one or more reference or standard sequences (e.g. a panel or catalogue or database of sequences or a consensus sequence or "template" sequence).
A reference or standard sequence may readily be obtained using publically available information, for example the rRNA sequences and sequence databases mentioned above, or by determining the sequence of one or more known genes using the same sequence procedure (e.g. the same extension primer) as the method of the invention.
The comparison may involve determining sequence identity or similarity using known procedures, comparing particular positions, substitutions, or other sequence features etc., determining the presence of matches, mismatches etc. Thus, a matching step may be performed, wherein it is determined whether or not the signature sequence, or any positions or combinations of positions within it, match a known sequence. Sequence alignments may be performed, again using known procedures. The pattern of nucleotide incorporation detected in the primer extension step may be analysed. Where multiple (i.e. 2 or more) signature sequences are obtained, the sequence information may be analysed combinatorially (e.g. aspects of particular sequence information, or particular attributes may be combined, or assessed together).

Alternatively, the "reference" sequence can be theoretically derived from knowledge of the selected variable region. It may then not be necessary actually to compare the signature sequence obtained with a reference sequence, and the desired typing/sequence information can be read from the sequence obtained.
Once the extension primers for each variable region have been selected and the order of addition of nucleotides determined, it is possible to determine a theoretical output from a primer extension reaction. Thus, by identifying (or recognising) the sequence obtained for a target microorganism molecule may be identified (or recognised). Conveniently, test sequences or patterns and reference sequences or patterns may be compared using sequence recognition software. All such analysis procedures are regarded herein as a step of "matching"
the signature sequence.
Such matching or analysis procedures may be performed in any convenient or desired manner, for example manually, or in an automated fashion using e.g.
appropriate computer software (e. g. computer algorithms). various software for sequence analysis is available publically, for example the BLAST advanced option tools available at NCBI
(http://www.ncbi.nlm.nih.gov/).
As described further below, the present invention is based on a method of "sequencing-by-synthesis" (see e.g. US-A-4,863,849 of Melamede). This is a term used in the art to define sequencing methods which rely on the detection of nucleotide incorporation (or non-incorporation) during a primer-directed polymerase extension reaction. The four different nucleotides (i.e. A, G, T or C nucleotides) are added cyclically or sequentially (conveniently in a known order), and the event of incorporation can be detected in various ways, directly or indirectly, This detection reveals which nucleotide has been incorporated, and hence sequencing information; when the nucleotide (base) which forms a pair (according to the normal rules of base pairing, A-T
and C-G) with the next base in the template target sequence is added, it will be incorporated into the growing complementary strand (i.e. the extended primer) by the polymerase, and this incorporation will trigger a detectable signal, the nature of which depending upon the detection strategy selected.
The primer extension reaction in the sequencing step conveniently may be performed by sequentially adding nucleotides to the reaction mixture (i.e. a polymerase, and primer/template mixture).
Advantageously the different nucleotides are added in known order, and preferably in a pre-determined order.
In a convenient embodiment of the invention, the 4 different nucleotides (i.e. A, G, T and C nucleotides) are added sequentially in a predetermined order of addition. It thus forms a preferred aspect of the invention that the nucleotides are added sequentially in a predetermined order of addition. If desired, the order of addition can be tailored to the microorganism to be identified or to the ribosomal gene in question and the primers used. It will therefore be seen that the order of addition will not necessarily be cyclical e.g. A T G C A T G C but can be e.g. C G C T A G A.
Indeed, it may not be necessary to add all four nucleotides, (i.e. all of A, T, C or G) but a desired selection thereof.
As each nucleotide is added, it may be determined whether or not nucleotide incorporation takes place.
Advantageously, as described in more detail below, it may further be determined the amount (i.e. how many) of each nucleotide incorporated. In this manner, the sequence or a pattern of nucleotide incorporation may be determined. In other words, the step of determining the sequence may comprise determining (or detecting) whether or not, and which, nucleotide is incorporated. If desired, this step also includes determining the amount of each nucleotide incorporated.
In this manner, a "signature" may be obtained for the target microorganism. This "signature" may comprise the base identity (i.e. sequence) of the particular variable sites identified in the variable region for that microorganism.
In order to perform the invention, it may be advantageous or convenient first to amplify the nucleic acid molecule by any suitable amplification method known in the art. The target region to be sequenced would then be an amplicon. Suitable in vitro amplification techniques include any process which amplifies the nucleic acid present in the reaction under the direction of appropriate primers. The amplification method may thus preferably be PCR, or any of the various modifications thereof e.g. the use of nested primers, although it is not limited to this method. Those skilled in the art will appreciate that other amplification procedures may also be used, such as Self-sustained Sequence Replication (3SR), NASBA, the Q-beta replicase amplification system and Ligase chain reaction (LCR) (see for example Abramson and Myers (1993) Current Opinion in Biotech., 4: 41-47).
If PCR is used to amplify the nucleic acid, suitable primers, as discussed previously, are designed or selected to ensure that the region of interest within the nucleic acid sequence (i.e. the variable region), is amplified. PCR can also be used for indiscriminate amplification of all DNA sequences, allowing amplification of essentially all sequences within the sample for study (i.e. total DNA). Linker-primer PCR is particularly suitable for indiscriminate amplification, and uses double stranded oligonucleotide linkers with a suitable overhanging end, which are ligated to the ends of target DNA fragments. Amplification is then conducted using oligonucleotide primers which are specific for the linker sequences. Alternatively, completely random oligonucleotide primers may be used in conjunction with DOP-PCR (degenerate oligonucleotide-primed) to amplify all the DNA within a sample.
Preferably, however amplification is conducted using primers having binding sites which are common or conserved as between different organisms i.e. universal primers designed or selected along the principles set out above for the extension, or sequencing primer.
Conveniently, broad-range amplification primers may be used.
In the method of the invention, several sequences may need to be amplified, to allow several regions to be analysed. Therefore, several appropriate amplification primers may need to be synthesized or selected.
In a preferred embodiment of the invention, one or more of the amplification primers used in the amplification reaction, may be subsequently used as an "extension primer" in the sequencing step. This has the advantage that an amplicon will always yield a primer extension product in the sequencing step.
It will be appreciated that the sequence and length of the oligonucleotide amplification and extension primers to be used in the amplification and extension (sequencing) steps, respectively, will depend on the sequence of the target gene, the desired length of amplification or extension product, the further functions of the primer (i.e. for immobilization) and the method used for amplification and/or extension.
Appropriate primers may readily be designed applying principles and techniques well known in the art.
Advantageously, as mentioned above, an extension primer will bind near (e.g. within 1-40, 1-20, 1-10 or 1-6, preferably within 1-3 bases), substantially adjacent or exactly adjacent to the variable region of the gene and will be complementary to a conserved or semi-conserved region of the gene.

In order for the method of the invention to be performed, knowledge of the sequence of the conserved or semi-conserved region is required in order to design an appropriate complementary extension primer. An extension primer is provided for each of the variable regions, each being specific for a site at or near to the variable site. The specificity is achieved by virtue of complementary base pairing. For all embodiments of the invention, primer design may be based upon principles well known in the art. It is not necessary for the extension or amplification primer to have absolute complementarily to the binding site, but this is preferred to improve the specificity of binding.
The extension primer may be designed to bind to the sense or anti-sense strand of the target gene.
In a preferred embodiment of the invention, the extension primers are designed to bind to the target gene near to the variable region in such a way that upon the addition of nucleotides in a predetermined manner, the sequencing of particular positions or sites in the variable region or a particular variable region takes place discretely.
The "primer extension" reaction according to the invention includes all forms of template-directed polymerase-catalysed nucleic acid synthesis reactions.
Conditions and reagents for primer extension reactions are well known in the art, and any of the standard methods, reagents and enzymes etc. may be used in this step (see e.g. Sambrook et al., (eds), Molecular Cloning: a laboratory manual (1989), Cold Spring Harbor Laboratory Press). Thus, the primer extension reaction at its most basic, is carried out in the presence of primer, deoxynucleotides (dNTPs) and a suitable polymerase enzyme e.g. T7 polymerase, Klenow or Sequenase Ver 2.0 (USB USA), or indeed any suitable available polymerase enzyme. As mentioned above, for an RNA template, reverse transcriptase may be used.

Conditions may be selected according to choice, having regard to procedures well known in the art.
The primer is thus subjected to a primer-extension reaction in the presence of a nucleotide, whereby the nucleotide is only incorporated if it is complementary to the base immediately adjacent (3') to the primer position. The nucleotide may be any nucleotide capable of incorporation by a polymerase enzyme into a nucleic acid chain or molecule. Thus, for example, the nucleotide may be a deoxynucleotide (dNTP, deoxynucleoside triphosphate) or dideoxynucleotide (ddNTP, dideoxynucleoside triphosphate). Thus, the following nucleotides may be used in the primer-extension reaction: guanine (G), cytosine (C), thymine (T) or adenine (A) deoxy- or dideoxy-nucleotides.
Therefore, the nucleotide may be dGTP (deoxyguanosine triphosphate), dCTP (deoxycytidine triphosphate), dTTP
(deoxythymidine triphosphate) or dATP (deoxyadenosine triphosphate). As discussed further below, suitable analogues of dATP, and also for dCTP, dGTP and dTTP may also be used. Thus, modified nucleotides, or nucleotide derivatives may be used so long as they are capable of incorporating and including an activated or detectably-labelled nucleotides (e. g. radio or fluoroscently labelled nucleotide triphosphates for example, a suitable fluorescently labelled nucleotide triphosphate is cyanine 5 S-S-d NTP available from NEN Life Sciences, Boston, USA and as described in WO 00/53812).
Dideoxynucleotides may also be used in the primer-extension reaction. The term "dideoxynucleotide" as used herein includes all 2'-deoxynucleotides in which the 3' hydroxyl group is modified or absent.
Dideoxynucleotides are capable of incorporation into the primer in the presence of the polymerase, but cannot enter into a subsequent polymerisation reaction, and thus function as a "chain terminator". It will therefore be appreciated that in embodiments of the invention which rely on sequential nucleotide addition the use of chain terminating nucleotides is to be avoided (although so-called "false" or "labile"
terminators might be used in which the 3'blocking group may be removed following incorporation. Such modified nucleotides are known and described in the art).
However, in some embodiments of the invention it may be advantageous to use chain terminating nucleotides whereby it is desired to terminate sequencing of one variable region after incorporation of the chain terminating nucleotide, but more sequence information is required for another region.
If the nucleotide is complementary to the target base, the primer is extended by one nucleotide, and inorganic pyrophosphate is released. As discussed further below, in a preferred method, the inorganic pyrophosphate may be detected in order to detect the incorporation of the added nucleotide. The extended primer can serve in exactly the same way in a repeated procedure to determine the next base in the variable region, thus permitting the whole variable region to be sequenced. Different nucleotides may be added sequentially, advantageously in known order, as discussed above, to reveal the nucleotides which are incorporated for each extension primer. Furthermore, in the case where the variable region is homopolymeric or contains a homopolymer site (i.e. contains 2 or more identical bases), the number of nucleotides incorporated of the complementary base will reflect the number present in the homopolymeric region. Accordingly, determining the number of nucleotides incorporated for each nucleotide addition, will reveal this information.
Hence, a primer extension protocol may involve annealing a primer as described above, adding a nucleotide, performing a polymerase-catalysed primer extension reaction, detecting the presence or absence of incorporation of said nucleotide (and advantageously also determining the amount of each nucleotide incorporated) and repeating the nucleotide addition and primer extension steps etc. one or more times. As discussed above, single (i.e. individual) nucleotides may be added successively to the same primer-template mixture.
In order to permit the repeated or successive (iterative) addition of nucleotides in a primer-extension procedure, the previously-added nucleotide must be removed. This may be achieved by washing, or more conveniently, by using a nucleotide-degrading enzyme, for example as described in detail in W098/28440.
Accordingly, in a principal embodiment of the present invention, a nucleotide degrading enzyme is used to degrade any unincorporated or excess nucleotide.
Thus, if a nucleotide is added which is not incorporated (because it is not complementary to the target base), or any added nucleotide remains after an incorporation event (i.e. excess nucleotides) then such unincorporated nucleotides may readily be removed by using a nucleotide-degrading enzyme. This is described in detail in W098/28440.
The term "nucleotide degrading enzyme" as used herein includes any enzyme capable of specifically or non-specifically degrading nucleotides, including at least nucleoside triphosphates (NTPs), but optionally also di- and mono-phosphates, and any mixture or combination of such enzymes, provided that a nucleoside triphosphatase or other NTP-degrading activity is present. Where a chain terminating nucleotide is used (e. g. a dideoxy nucleotide is used), the nucleotide degrading enzyme should also degrade such a nucleotide.
Although nucleotide-degrading enzymes having a phosphatase activity may conveniently be used according to the invention, any enzyme having any nucleotide or nucleoside degrading activity may be used, e.g. enzymes which cleave nucleotides at positions other than at the phosphate group, for example at the base or sugar residues. Thus, a nucleoside triphosphate degrading enzyme is essential for the invention. Nucleoside di-and/or mono-phosphate degrading enzymes are optional and may be used in combination with a nucleoside tri-phosphate degrading enzyme.
The preferred nucleotide degrading enzyme is apyrase, which is both a nucleoside diphosphatase and triphosphatase, catalysing the reactions NTP -~ NDP + Pi and NDP ~ NMP + Pi (where NTP is a nucleoside triphosphate, NDP is a nucleoside diphosphate, NMP is a nucleotide monophosphate and Pi is inorganic phosphate).
Apyrase may be obtained from the Sigma Chemical Company.
Other possible nucleotide degrading enzymes include Pig Pancreas nucleoside triphosphate diphosphorydrolase (Le Bel et al., 1980, J. Biol. Chem.,255, 1227-1233).
Further enzymes are described in the literature.
The nucleotide-degrading enzyme may conveniently be included during the polymerase (i.e. primer extension) reaction step. Thus, for example the polymerase reaction may conveniently be performed in the presence of a nucleotide-degrading enzyme. Although less preferred, such an enzyme may also be added after nucleotide incorporation (or non-incorporation) has taken place, i.e. after the polymerase reaction step.
Thus, the nucleotide-degrading enzyme (e. g.
apyrase) may be added to the polymerase reaction mixture (i.e. target nucleic acid, primer and polymerase) in any convenient way, for example prior to or simultaneously with initiation of the reaction, or after the polymerase reaction has taken place, e.g. prior to adding nucleotides to the sample/primer/polymerase to initiate the reaction, or after the polymerase and nucleotide are added to the sample/primer mixture.
Conveniently, the nucleotide-degrading enzyme may simply be included in the reaction mixture for the polymerase reaction, which may be initiated by the addition of the nucleotide.
According to the present invention, detection of nucleotide incorporation can be performed in a number of ways, such as by incorporation of labelled nucleotides which may subsequently be detected.
As explained above, the invention uses a sequencing-by-synthesis method, and such methods are disclosed extensively in US-A-4,863,849, which discloses a number of ways in which nucleotide incorporation may be determined or detected, e.g. spectrophotometrically or by fluorescent detection techniques, for example by determining the amount of nucleotide remaining in the added nucleotide feedstock, following the nucleotide incorporation step. In a sequencing-by-synthesis reaction, determination of the pattern of nucleotide incorporation may occur simultaneously with primer extension. One working definition of sequencing by synthesis is a method in which a single nucleotide is or is not incorporated into a primed template, incorporation being detected by any suitable means.
This step is repeated by addition of a different nucleotide and incorporation is again detected. These steps are repeated and from the sum of incorporated nucleic acids the sequence can be deduced.
Thus, in the method of the invention it may be directly determined whether or not incorporation of a given nucleotide has taken place. Contrary to conventional sequencing methods (e. g. dideoxy sequencing), sequencing-by-synthesis allows the ordinal numbering of bases to be determined, and it is known exactly where the sequencing primer binds.
Consequently, it is possible readily to derive position used sequence data or information (e.g. which bases are incorporated in which position). Conveniently, sequencing may start from either end of an amplicon.
One method of sequencing-by-synthesis is a method based on the detection of incorporation of fluorescently labelled nucleotides.
The preferred method of sequencing-by-synthesis is a pyrophosphate detection-based method.
Preferably, therefore, nucleotide incorporation is detected by detecting PPi release, preferably by luminometric detection, and especially by bioluminometric detection.
PPi can be determined by many different methods and a number of enzymatic methods have been described in the literature (Reeves et al., (1969), Anal. Biochem., 28, 282-287; Guillory et al., (1971), Anal. Biochem., 39, 170-180; Johnson et al., (1968), Anal. Biochem., 15, 273; Cook et al., (1978), Anal. Biochem. 91, 557-565;
and Drake et al., (1979), Anal. Biochem. 94, 117-120).
It is preferred to use luciferase and luciferin in combination to identify the release of pyrophosphate since the amount of light generated is substantially proportional to the amount of pyrophosphate released which, in turn, is directly proportional to the amount of nucleotide incorporated. The amount of light can readily be estimated by a suitable light sensitive device such as a luminometer. Thus, luminometric methods offer the advantage of being able to be quantitative.
Luciferin-luciferase reactions to detect the release of PPi are well known in the art. In particular, a method for continuous monitoring of PPi release based on the enzymes ATP sulphurylase and luciferase has been developed (Nyren and Lundin, Anal.
Biochem., 151, 504-509, 1985; Nyren P., Enzymatic method for continuous monitoring of DNA polymerase activity (1987) Anal. Biochem Vol 167 (235-238)) and termed ELIDA
(Enzymatic Luminometric Inorganic Pyrophosphate Detection Assay). The use of the ELIDA method to detect PPi is preferred according to the present invention.
The method may however be modified, for example by the use of a more thermostable luciferase (Kaliyama et al., 1994, Biosci. Biotech. Biochem., 58, 1170-1171) and/or ATP sulfurylase (Onda et al., 1996, Bioscience, Biotechnology and Biochemistry, 60:10, 1740-42). This method is based on the following reactions:
ATP sulphurylase 2_ PPi + APS ---------------> ATP + SOq luciferase ATP + luciferin + OZ ----------> AMP + PPi +
oxyluciferin + COz + by (APS = adenosine 5'-phosphosulphate) Reference may also be made to WO 98/13523 and WO
98/28448, which are directed to pyrophosphate detection-based sequencing procedures, and disclose PPi detection methods which may be of use in the present invention.
In a PPi detection reaction based on the enzymes ATP sulphurylase and luciferase, the signal (corresponding to PPi released) is seen as light. The generation of the light can be observed as a curve known as a PyrogramTM. Light is generated by luciferase action on the product, ATP (produced by a reaction between PPi and APS (see below) mediated by ATP sulphurylase) and, where a nucleotide-degrading enzyme such as apyrase is used, this light generation is then "turned off" by the action of the nucleotide-degrading enzyme, degrading the ATP which is the substrate for luciferase. The slope of the ascending curve may be seen as indicative of the activities of DNA polymerase (PPi release) and ATP
sulphurylase (generating ATP from the PPi, thereby providing a substrate for luciferase). The height of the signal is dependent on the activity of luciferase, and the slope of the descending curve is, as explained above, indicative of the activity of the nucleotide-degrading enzyme. In a PyrogramTM in the context of a homopolymeric region, peak height is also indicative of the number of nucleotides incorporated for a given nucleotide addition step. Thus, when a nucleotide is added, the amount of PPi released will depend upon how many nucleotides (i.e. the amount) are incorporated, and this will be reflected in the slope height.
Advantageously, by including the PPi detection enzymes) (i.e. the enzyme or enzymes necessary to achieve PPi detection according to the enzymatic detection system selected, which in the case of ELIDA, will be ATP sulphurylase and luciferase) in the polymerase reaction step, the method of the invention may readily be adapted to permit extension reactions to be continuously monitored in real-time, with a signal being generated and detected, as each nucleotide is incorporated.
Thus, the PPi detection enzymes (along with any enzyme substrates or other reagents necessary for the PPi detection reaction) may simply be included in the polymerase reaction mixture.
A potential problem which has previously been observed with PPi-based sequencing methods is that dATP, used in the chain extension reaction, interferes in the subsequent luciferase-based detection reaction by acting as a substrate for the luciferase enzyme. This may be reduced or avoided by using, in place of deoxyadenosine triphosphate (ATP), a dATP analogue which is capable of acting as a substrate for a polymerase but incapable of acting as a substrate for a PPi-detection enzyme. Such a modification is described in detail in W098/13523.
The term "incapable of acting" includes also analogues which are poor substrates for the detection enzymes, or which are substantially incapable of acting as substrates, such that there is substantially no, negligible, or no significant interference in the PPi detection reaction.

Thus, a further preferred feature of the invention is the use of a dATP analogue which does not interfere in the enzymatic PPi detection reaction but which nonetheless may be normally incorporated into a growing DNA chain by a polymerase. By "normally incorporated"
is meant that the nucleotide is incorporated with normal, proper base pairing. In the preferred embodiment of the invention where luciferase is a PPi detection enzyme, the preferred analogue for use according to the invention is the [1-thio]triphosphate (or a-thiotriphosphate) analogue of deoxy ATP, preferably deoxyadenosine [1-thio]triphospate, or deoxyadenosine a-thiotriphosphate (dATPaS) as it is also known. dATPaS, along with the a-thio analogues of dCTP, dGTP and dTTP, may be purchased from Amersham Pharmacia.
Experiments have shown that substituting dATP with dATPaS allows efficient incorporation by the polymerase with a low background signal due to the absence of an interaction between dATPaS and luciferase. False signals are decreased by using a nucleotide analogue in place of dATP, because the background caused by the ability of dATP to function as a substrate for luciferase is eliminated. In particular, an efficient incorporation with the polymerase may be achieved while the background signal due to the generation of light by the luciferin-luciferase system resulting from dATP
interference is substantially decreased. The dNTPaS
analogues of the other nucleotides may also be used in place of the other dNTPs.
Another potential problem which has previously been observed with sequencing-by-synthesis methods is that false signals may be generated and homopolymeric stretches (i.e. CCC) may be difficult to sequence with accuracy. This may be overcome by the addition of a single-stranded nucleic acid binding protein (SSB) once the extension primers have been annealed to the template nucleic acid. The use of SSB in sequencing-by-synthesis is discussed in WO 00/43540 of Pyrosequencing AB.
In order for the primer-extension reaction to be performed, the nucleic acid molecule to the sequenced (i.e. the ribosomal gene), regardless of whether or not it has been amplified, is conveniently provided in a single-stranded format. The nucleic acid may be subjected to strand separation by any suitable technique known in the art (e.g. Sambrook et al., supra), for example by heating the nucleic acid, or by heating in the presence of a chemical denaturant such as formamide, urea or formaldehyde, or by use of alkali.
However, this is not absolutely necessary and a double-stranded nucleic acid molecule may be used as template, e.g. with a suitable polymerase having strand displacement activity.
Where a preliminary amplification step is used, regardless of how the nucleic acid has been amplified, all components of the amplification reaction need to be removed, to obtain pure nucleic acid, prior to carrying out the typing assay of the invention. For example, unincorporated nucleotides, PCR primers, and salt from a PCR reaction need to be removed. Methods for purifying nucleic aids are well known in the art (Sambrook et al., supra), however a preferred method is to immobilize the nucleic acid molecule, removing the impurities via washing and/or sedimentation techniques.
Optionally, therefore, the nucleic acid to be sequenced may be provided with a means for immobilization, which may be introduced during amplification, either through the nucleotide bases or the primers used to produce tile amplified nucleic acid.
To facilitate immobilization, the amplification primers used according to the invention may carry a means for immobilization either directly or indirectly.
Thus, for example the primers may carry sequences which are complementary to sequences which can be attached directly or indirectly to an immobilizing support or may carry a moiety suitable for direct or indirect attachment to an immobilizing support through a binding partner.
Numerous suitable supports for immobilization of DNA and methods of attaching nucleotides to them, are well known in the art and widely described in the literature. Thus for example, supports in the form of microtitre wells, tubes, dipsticks, particles, fibres or capillaries may be used, made for example of agarose, cellulose, alginate, teflon, latex or polystyrene.
Advantageously, the support may comprise magnetic particles e.g. the superparamagnetic beads produced by Dynal Biotech ASA (Oslo, Norway) and sold under the trademark DYNABEADS. Chips may be used as solid supports to provide miniature experimental systems as described for example in Nilsson et al. (Anal. Biochem.
(1995), 224:400-408).
The solid support may carry functional groups such as hydroxyl, carboxyl, aldehyde or amino groups for the attachment of the primer or capture oligonucleotide.
These may in general be provided by treating the support to provide a surface coating of a polymer carrying one of such functional groups, e.g. polyurethane together with a polyglycol to provide hydroxyl groups, or a cellulose derivative to provide hydroxyl groups, a polymer or copolymer of acrylic acid or methacrylic acid to provide carboxyl groups or an amino alkylated polymer to provide amino groups. US patent No. 4,654,267 describes the introduction of many such surface coatings.
Alternatively, the support may carry other moieties for attachment, such as avidin or streptavidin (binding to biotin on the nucleotide sequence), DNA binding proteins (e.g. the lac I repressor protein binding to a lac operator sequence which may be present in the primer or oligonucleotide), or antibodies or antibody fragments (binding to haptens e.g. digoxigenin on the nucleotide sequence). The streptavidin/biotin binding system is very commonly used in molecular biology, due to the relative ease with which biotin can be incorporated within nucleotide sequences, and indeed the commercial availability of biotin-labelled nucleotides. This represents one preferred method for immobilisation of target nucleic acid molecules according to the present invention. Streptavidin-coated DYNABEADS are commercially available from Dynal Biotech ASA.
As mentioned above, immobilization may conveniently take place after amplification. To facilitate post amplification immobilisation, one or both of the amplification primers are provided with means for immobilization. Such means may comprise as discussed above, one of a pair of binding partners, which binds to the corresponding binding partner carried on the support. Suitable means for immobilization thus include biotin, haptens, or DNA sequences (such as the lac operator) binding to DNA binding proteins.
When immobilization of the amplification products is not performed, the products of the amplification reaction may simply be separated by for example, taking them up in a formamide solution (denaturing solution) and separating the products, for example by electrophoresis or by analysis using chip technology.
Immobilization provides a ready and simple way to generate a single-stranded template for the extension reaction. As an alternative to immobilization, other methods may be used, for example asymmetric PCR, exonuclease protocols or quick denaturation/annealing protocols on double stranded templates may be used to generate single stranded DNA. Such techniques are well known in the art.
The method of the present invention is particularly advantageous in the diagnosis of pathological conditions characterised by the presence of a particular or specific microorganism, particularly infectious diseases. The method can be used to characterise or type and quantify microbial (e. g. bacterial, protozoal and fungal) infections where samples of an infecting organism may be difficult to obtain or where an isolated organism is difficult to grow in vitro for subsequent characterisation (e. g. as in the case of P. falciparum or Chlamydia species). Due to the simplicity and speed of the method it may also be used to detect or identify a wide range of pathological agents which cause diseases such as of clinical importance. Even in cases where samples of the injecting organism may be easily obtained, the speed of this method compared with overnight incubation of a culture may make the method according to the invention preferable over conventional techniques.
The high capacity and convenience of the method also make it particularly suitable for screening large numbers of samples, or for screening for the presence of a large number of organisms. A large number of samples may be simultaneously analysed.
The invention also comprises kits for carrying out the method of the inventlOIl. These will normally include one or more of the following components:
optionally primers) for in vitro amplification;
one or more primers for the primer extension reaction;
nucleotides for amplification and/or for the primer extension reaction (as described above); a polymerase enzyme for the amplification and/or primer extension reaction; and means for detecting primer extension (e. g.
means of detecting the release of pyrophosphate as outlined and defined above or means for detecting the incorporation of fluorescently labelled nucleotides).
In certain embodiments, the kit will also include instructions for the order of addition of the nucleotides.
The invention will now be described by way of non-limiting examples with reference to the drawings in which:-Figure 1 shows two panels. The upper panel shows sequence alignment of 16S rDNA variable V1 region of H.
pylori isolates HP-HJM 1-25 and reference strains H.
pylori 26695 and J99. Gaps indicate deletions, and dashes indicate positions at which the sequences were homologous to that of reference strain H. pylori 26695.
Lineages A to F indicate six individual 16S rDNA V1 alleles (signature sequences) at positions 75 to 99 (E.
coli nomenclature). The 16S rDNA broad-range sequencing primer pBR-V1/as corresponds to a consensus sequence between positions 120 and 100 of many clinically important bacteria.
The lower panel shows sequence alignment of the variable V3 region of H. pylori isolates HP-HJM 1-25, reference strain H. pylori 26695 (AE000620/644), H.
pylori J99 (AE001534/56), and the type strain H. pylori CCUG 17878T(U01331). Gaps indicate deletions, and dashes indicate DNA sequence homologies compared to the type strain. The HP-V3T/as sequencing primer corresponds to the sequence of type strain H. pylori CCUG 17874T.
For clarity, the corresponding sequences of H.
pylori-related strains H. heilmanii (Y18028), H. bills (AF047847), H. hepaticus (L39122) and H. cholecystus (U46129) are included; and Figure 2 shows Pyrosequencing~' of 16S rDNA variable V1 region of H. pylori isolates performed as described in Example 1 with cyclic dispensation of the nucleotides (Dispensation order: ACGT). Each pyrogram represents ari individual H. pylori lineage (A - F). The corresponding nucleotide signature sequences as interpreted by a custom-made application program are shown in Figure 1 (upper panel). The plots show nucleotide addition versus light emitted.
Figure 3 shows PyrosequencingTM obtained for 3 isolates obtained with the CONS sequence 5'-AACGTCAGAGGAGCAAGCTCCTCGT-3' using the pBR-Vl primer SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Pyrosequencing AB
(ii) TITLE OF INVENTION: Method for Identifying Microrganisms based on Sequencing Gene Fragments (iii) NUMBER OF SEQUENCES: 87 (iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: SMART & BIGGAR
(B) STREET: 650 WEST GEORGIA STREET, SUITE 2200 (C) CITY: VANCOUVER
(D) STATE: BRITISH COLUMBIA
(E) COUNTRY: CANADA
(F) ZIP: V6B 4N8 (v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk (B) COMPUTER: IBM PC compatible (C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: PatentIn Release #1.0, Version #1.30 (vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: CA 2,363,938 (B) FILING DATE: 28-NOV-2001 (C) CLASSIFICATION: C12Q-1/68 (viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: ROBINSON, J. CHRISTOPHER
(C) REFERENCE/DOCKET NUMBER: 40745-6 (ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (604) 682-7780 (B) TELEFAX: (604) 682-0274 (2) INFORMATION FOR SEQ ID NO:1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Streptomyces sp.
(iii) HYPOTHETICAL: NO
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:
aacgtcagag gagcaagctc ctcgt 25 (2) INFORMATION FOR SEQ ID N0:2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer : bio-pBR-5'/se (xi) SEQUENCE DESCRIPTION: SEQ ID N0:2:
gaagagtttg atcatggctc ag 22 (2) INFORMATION FOR SEQ ID N0:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer - pBR-V1/as (xi) SEQUENCE DESCRIPTION: SEQ ID N0:3:
ttactcaccc gtccgccact 20 (2) INFORMATION FOR SEQ ID N0:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer: HP-V3T/asb (xi) SEQUENCE DESCRIPTION: SEQ ID N0:4:
agctctggca agccagaca 19 (2) INFORMATION FOR SEQ ID N0:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer: bio-pJB-1/se (xi) SEQUENCE DESCRIPTION: SEQ ID N0:5:
attcgatgca acgcgaagaa ccttacc 27 (2) INFORMATION FOR SEQ ID N0:6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer: bio-pJBS-V3. SE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6:
gcaacgcgaa gaaccttacc 20 (2) INFORMATION FOR SEQ ID N0:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer: B-V3. AS
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7:
acgacagcca tgcagcacct 20 (2) INFORMATION FOR SEQ ID N0:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Staphylococcus sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8:
aaatcttgac atcctctgac ccctctagag atagagtttt 40 (2) INFORMATION FOR SEQ ID N0:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Staphylococcus sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9:
aaatcttgac atcctctgac cctcctagag atagagtttt 40 (2) INFORMATION FOR SEQ ID NO:10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = STAPHYLOCOCCUS SAPROPHYTICUS
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:
aaatcttgac atcctttgaa aactctagag atagagcctt 40 (2) INFORMATION FOR SEQ ID NO:11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = STAPHYLOCOCCUS AUREUS
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:
aaatcttgac atcctttgac aactctagag atagagcctt 40 (2) INFORMATION FOR SEQ ID N0:12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Fusobacterium sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12:
agcgtttgac atcctacgaa cggagcagag atgcgccggt 40 (2) INFORMATION FOR SEQ ID N0:13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = STREPTOCOCCUS PYOGENES
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13:
aggtcttgac atcccgatgc ccgctctaga gatagagttt 40 (2) INFORMATION FOR SEQ ID N0:14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = STREPTOCOCCUS PNEUMONIAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14:
aggtcttgac atccctctga ccgctctaga gatagagttt 40 (2) TNFORMATION FOR SEQ ID N0:15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = STREPTOCOCCUS AGALACTIAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15:
aggtcttgac atccttctga ccggcctaga gataggcttt 40 (2) INFORMATION FOR SEQ ID N0:16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = ENTEROCOCCUS GALLINARUM
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16:
aggtcttgac atcctttgac cactctagag at 32 (2) INFORMATION FOR SEQ ID N0:17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = ENTEROCOCCUS FAECIUM
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:17:
aggtcttgac atcctttgac cactctagag atagagctt 39 (2) INFORMATION FOR SEQ ID N0:18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = ENTEROCOCCUS FAECALIS
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18:
aggtcttgac atcctttgac cactctagag atagagctt 39 (2) INFORMATION FOR SEQ ID N0:19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = LISTERIA MONOCYTOGENES
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19:
aggtcttgac atcctttgac cactctggag acagagctt 39 (2) INFORMATION FOR SEQ ID N0:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = BACTERIOIDES FRAGILIS
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20:
cgggcttaaa ttgcagtgga atgatgtgga aacatgtcag 40 (2) INFORMATION FOR SEQ ID N0:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = CLOSTRIDIUM PERFRINGENS
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21:
tacacttgac atcccttgca ttactcttaa tcgaggaaat 40 (2) INFORMATION FOR SEQ ID N0:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Yersinia sp.

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22:
tactcttgac atccacggaa tttagcagag atgctttagt 40 (2) INFORMATION FOR SEQ ID N0:23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = HAEMOPHILUS PARAINFLUENZAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23:
tactcttgac atccagagaa cattccagag atggattgg 39 (2) INFORMATION FOR SEQ ID N0:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = ENTEROBACTER CLOACAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24:
tactcttgac atccagagaa cttaccagag atggtttggt 40 (2) INFORMATION FOR SEQ ID N0:25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = KLEBSIELLA OXYTOCA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25:
tactcttgac atccagagaa cttagcagag atgctttggt 40 (2) INFORMATION FOR SEQ ID N0:26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = KLEBSIELLA PNEUMONIAE

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26:
tactcttgac atccagagaa cttagcagag atgctttggt 40 (2) INFORMATION FOR SEQ ID N0:27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = CITROBACTER FREUNDII
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:27:
tactcttgac atccagagaa cttagcagag atgctttggt 40 (2) INFORMATION FOR SEQ ID N0:28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = MORGANELLA MORGANII
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28:
tactcttgac atccagagaa cttcagaga 29 (2) INFORMATION FOR SEQ ID N0:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Serratia Sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29:
tactcttgac atccagagaa ctttccagag atggattgg 39 (2) INFORMATION FOR SEQ ID N0:30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C} STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = ENTEROBACTER CLOACAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30:
tactcttgac atccagagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:31:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRA3~EDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = PROTEUS MIRABILIS
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31:
tactcttgac atccagcgaa tcctttagag atagaggagt 40 (2) INFORMATION FOR SEQ ID N0:32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = HAEMOPHILUS PARAINFLUEN?AF
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32:
tactcttgac atccatggaa tcttgtagag atatgagagt 40 (2) INFORMATION FOR SEQ ID N0:33:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = HAEMOPHILUS INFLUENZAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33:
tactcttgac atcctaagaa gagctcagag atgagcttgt 40 (2) INFORMATION FOR SEQ ID N0:34:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Clostridium sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:34:
tagacttgac atctcctgca ttactcttaa tcgaggaagt 40 (2) INFORMATION FOR SEQ ID N0:35:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Acinetobacter sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:35:
tggccttgac atagtaagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:36:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = PSEUDOMONAS AERUGINOSA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:36:
tggccttgac atgctgagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:37:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = STENOTROPHOMONAS MALTOPHILIA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:37:
tggccttgac atgtcgagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:38:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = CAMPYLOBACTER JEJUNI
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38:
tgggcttgat atcctaagaa ccttatagag atatgagggt 40 (2) INFORMATION FOR SEQ ID N0:39:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Acinetobacter sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:39:
tggtcttgac atagtaagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:40:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 36 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = MORAXELLA CATARRHALIS
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40:
tggtcttgac atagtgagaa tcttgcagag atgcga 36 (2) INFORMATION FOR SEQ ID N0:41:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Salmonella sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41:
tggtcttgac atccacagaa ctttccagag atggactggt 40 (2) INFORMATION FOR SEQ ID N0:42:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Escherichia coli (xi) SEQUENCE DESCRIPTION: SEQ ID N0:42:
tggtcttgac atccacagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:43:

(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = KLEBSIELLA OXYTOCA
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43:
tggtcttgac atccacagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:44:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = KLEBSIELLA PNEUMONIAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44:
tggtcttgac atccacagaa ctttccagag atggattggt 40 (2) INFORMATION FOR SEQ ID N0:45:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Salmonella sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:45:
tggtcttgac atccacagaa gaatccagag atggatttgt 40 (2) INFORMATION FOR SEQ ID N0:46:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 38 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Citrobacter sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46:
tggtcttgac atccacgaag attgcagaga tggctgga 38 (2) INFORMATION FOR SEQ ID N0:47:

(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 36 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Escherichia coli (xi) SEQUENCE DESCRIPTION: SEQ ID N0:47:
tggtcttgac atccacgaag atttacgaga tgatga 36 (2) INFORMATION FOR SEQ ID N0:48:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Shigella sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48:
tggtcttgac atccacgaag atttccagag atg 33 (2) INFORMATION FOR SEQ ID N0:49:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Shigella sp.
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49:
tggtcttgac atccacggaa gttttcagag atgagaatgt 40 (2) INFORMATION FOR SEQ ID N0:50:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Escherichia coli (xi) SEQUENCE DESCRIPTION: SEQ ID N0:50:
tggtcttgac atccacggaa gttttcagag atgagaatgt 40 (2) INFORMATION FOR SEQ ID N0:51:
(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = NEISSERIA GONORRHOEAE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:51:
tggttttgac atgtgcggaa tcctccggag acggaggagt 40 (2) INFORMATION FOR SEQ ID N0:52:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:52:
aacatcagag 10 (2) INFORMATION FOR SEQ ID N0:53:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:53:
aacgtcaaag 10 (2) INFORMATION FOR SEQ ID N0:54:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION. /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54:
aacgtcagag 10 (2) INFORMATION FOR SEQ ID N0:55:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUET?CE DESCRIPTION: SEQ ID N0:55:
aactttggaa 10 (2) INFORMATION FOR SEQ ID N0:56:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:56:
aagatcagta 10 (2) INFORMATION FOR SEQ ID N0:57:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:57:
aagtatcaga 10 (2) INFORMATION FOR SEQ ID N0:58:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:58:
aatccttccg 10 (2) INFORMATION FOR SEQ ID N0:59:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:59:
agatttgttc 10 (2) INFORMATION FOR SEQ ID N0:60:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:60:
caagtccgaa 10 (2) INFORMATION FOR SEQ ID N0:61:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:61:
catcagtcta 10 (2) INFORMATION FOR SEQ ID N0:62:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:62:
catccagaga 10 (2) INFORMATION FOR SEQ ID N0:63:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:63:
cctctttcca 10 (2) INFORMATION FOR SEQ ID N0:64:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64:
cctctttttc 10 (2) INFORMATION FOR SEQ ID N0:65:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ TD N0:65:
ccttgaaccg 10 (2) INFORMATION FOR SEQ ID N0:66:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:66:
cgccacccaa 10 (2) INFORMATION FOR SEQ ID N0:67:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:67:
cgccacccga 10 (2) INFORMATION FOR SEQ ID N0:68:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:68:
cgccggcaaa 10 (2) INFORMATION FOR SEQ ID N0:69:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69:
cgtcacccaa 10 (2) INFORMATION FOR SEQ ID N0:70:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:70:
cgtcacccag 10 (2) INFORMATION FOR SEQ ID N0:71:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71:
cgtcacccga 10 (2) INFORMATION FOR SEQ ID N0:72:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:72:
cgtcagcaaa 10 (2) INFORMATION FOR SEQ ID N0:73:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:73:
cgtcagcaag 10 (2) INFORMATION FOR SEQ ID N0:74:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:74:
cgtcagcaga 10 (2) INFORMATION FOR SEQ ID N0:75:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:75:
cgtcagcgaa 10 (2) INFORMATION FOR SEQ ID N0:76:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:76:
cgtcatcaaa 10 (2) INFORMATION FOR SEQ ID N0:77:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:77:
ctcaagagaa 10 (2) INFORMATION FOR SEQ ID N0:78:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:78:
ctttcttcgg 10 (2) INFORMATION FOR SEQ ID N0:79:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = SIGNATURE SEQUENCE

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:79:
gaatccagga 10 (2) INFORMATION FOR SEQ ID N0:80:
SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer - JB1 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:80:
cgaactaatc ggaagagtaa ggc 23 (2) INFORMATION FOR SEQ ID N0:81:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer - JB2 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:81:
gagcgagtaa gccggrttct gt 22 (2) INFORMATION FOR SEQ ID N0:82:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer - MK forward (xi) SEQUENCE DESCRIPTION: SEQ ID N0:82:
aagagtaagg carccgc 17 (2) INFORMATION FOR SEQ ID N0:83:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 base pairs (B) TYPE; nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer- MK2 reverse (xi) SEQUENCE DESCRIPTION: SEQ ID N0:83:
agtcckgact ttcctct 17 (2) INFORMATION FOR SEQ ID N0:84:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 tease pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer - MK3 forward (xi) SEQUENCE DESCRIPTION: SEQ ID N0:84:
tagakgaatg rytgc 15 (2) INFORMATION FOR SEQ ID N0:85:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: Z6 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = Primer - MK4 Reverse (xi) SEQUENCE DESCRIPTION: SEQ ID N0:85:
taagccggut tctgtc 16 (2) INFORMATION FOR SEQ ID N0:86:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = HELICOBACTER PYLORI
(1x) FEATURE:
(A) NAME/KEY: misc_feature (B) LOCATION: 6..6 (D) OTHER INFORMATION: /note= n is any nucleotide (x1) SEQUENCE DESCRIPTION: SEQ ID N0:86:
aatcangcac tctagcaagc tagaa 25 (2) INFORMATION FOR SEQ ID N0:87:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 48 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: not relevant (D) TOPOLOGY: not relevant (ii) MOLECULE TYPE: other nucleic acid (A) DESCRIPTION: /desc = HELICOBACTER PYLORI
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:87:
agctctggca agccagacac tccactattt ctagcggatt ctctcaat 48

Claims (18)

1. A method of identifying a microorganism in a sample, said method comprising:

determining the sequence of a region of up to 50 nucleotides in a predetermined site in a gene of said microorganism, thereby to obtain a signature sequence;
and analysing sequencing information in said signature sequence to identify said microorganism, wherein said sequence is determined by detecting the nucleotides incorporated in a primer extension reaction performed using a primer binding at a pre-determined site in said gene.
2. The method of claim 1 wherein said gene is an RNA
gene.
3. The method of claim 1 wherein said gene encodes the RNA components of telomerases, splicesomes and/or other RNA-protein complexes.
4. The method of claim 2 wherein said RNA gene is a ribosomal RNA (rRNA gene).
5. The method of claim 4 wherein the rRNA gene is 5S
rRNA, 16S rRNA, 18S rRNA, 23S rRNA and/or 26S rRNA.
6. The method of claim 5 wherein the rRNA gene is the 16S rRNA gene.
7. The method of claim 6 wherein said predetermined site in the 16S rRNA gene is selected from one or more of the nine variable regions, V1 to V9.
8. The method of claim 2 wherein said gene is a ribozymal RNA gene.
9. The method of claim 8 wherein the ribozymal RNA
gene is the RNA component of RNase P.
10. The method of claim 9 wherein said predetermined site is selected from one or more of the variable regions P3, P12, P17 and P19 loops.
11. The method of claim 1 wherein the region sequenced is 10 to 40 nucleotides long.
12. The method of claim 1 wherein the region sequenced is 10 to 15 nucleotides long.
13. The method of claim 1 wherein the pre-determined primer binding site lies in a conserved or semi-conserved region.
14. The method of claim 1 wherein one or more further regions of up to 50 nucleotides of a gene are sequenced.
15. The method of claim 1 wherein the primer extension reaction is performed by sequentially adding nucleotides in a predetermined order of addition in the presence of a polymerase.
16. The method of claim 1 or 8 wherein as each nucleotide is added, it is determined whether or not the nucleotide is incorporated into the extended primer by the polymerase.
17. The method of claim 9 wherein the nucleotide incorporation is detected by detecting PPi release.
18. The method of claim 1 wherein the strain of said microorganism is identified.
CA 2363938 2001-11-28 2001-11-28 Method for identifying microorganisms based on sequencing gene fragments Abandoned CA2363938A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CA 2363938 CA2363938A1 (en) 2001-11-28 2001-11-28 Method for identifying microorganisms based on sequencing gene fragments
US10/303,199 US20040023209A1 (en) 2001-11-28 2002-11-25 Method for identifying microorganisms based on sequencing gene fragments

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CA 2363938 CA2363938A1 (en) 2001-11-28 2001-11-28 Method for identifying microorganisms based on sequencing gene fragments

Publications (1)

Publication Number Publication Date
CA2363938A1 true CA2363938A1 (en) 2003-05-28

Family

ID=4170679

Family Applications (1)

Application Number Title Priority Date Filing Date
CA 2363938 Abandoned CA2363938A1 (en) 2001-11-28 2001-11-28 Method for identifying microorganisms based on sequencing gene fragments

Country Status (1)

Country Link
CA (1) CA2363938A1 (en)

Similar Documents

Publication Publication Date Title
US20040023209A1 (en) Method for identifying microorganisms based on sequencing gene fragments
US20200263168A1 (en) High throughput transcriptome analysis
EP1322782B1 (en) Method of nucleic acid typing or sequencing
US6238866B1 (en) Detector for nucleic acid typing and methods of using the same
JP3514630B2 (en) Amplification and detection of nucleic acid sequences
Sharkey et al. Detection and quantification of gene expression in environmental bacteriology
CA2233079C (en) Method for characterizing nucleic acid molecules
CN1703521B (en) Quantification of gene expression
US20030157499A1 (en) Method of assessing the amount of nucleic acid in a sample
CN111979303A (en) Nucleic acid detection kit, method and application thereof
CN111218529B (en) Primer composition, kit and method for detecting novel coronaviruses
US20040197794A1 (en) Nucleic acid detection method
CN108184327A (en) For identifying the composition of drug resistant M and method
US20090023151A1 (en) Method For The Labeling And Detection Of Small Polynucleotides
Hadidi et al. Polymerase chain reaction
US6261773B1 (en) Reagent for nucleic acid amplification and process for nucleic acid amplification
CN110628925B (en) Primer, kit and method for detecting clostridium difficile
KR101955074B1 (en) Snp markers for discrimination of raphanus sativus
EP1275734A1 (en) Method for random cDNA synthesis and amplification
KR20100012319A (en) Methods for classifying and identifying sepsis-causing microorganisms
CA2363938A1 (en) Method for identifying microorganisms based on sequencing gene fragments
CN116042928B (en) Primer group for amplifying and detecting nucleic acid sequence of digestive tract virus
US10093988B2 (en) Universal primers and the use thereof for the detection and identification of amphibia/fish species
WO2024023510A1 (en) Method and kit for detecting single nucleotide polymorphisms (snp) by loop-mediated isothermal amplification (lamp)
CN116355896A (en) Primer probe for detection, primer probe group and application thereof

Legal Events

Date Code Title Description
FZDE Dead