EP4298242A1 - Détection améliorée de séquences génomiques et molécules sondes associées - Google Patents

Détection améliorée de séquences génomiques et molécules sondes associées

Info

Publication number
EP4298242A1
EP4298242A1 EP22705127.3A EP22705127A EP4298242A1 EP 4298242 A1 EP4298242 A1 EP 4298242A1 EP 22705127 A EP22705127 A EP 22705127A EP 4298242 A1 EP4298242 A1 EP 4298242A1
Authority
EP
European Patent Office
Prior art keywords
probe
sample
nucleic acids
genome
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP22705127.3A
Other languages
German (de)
English (en)
Inventor
Holger Klapproth
Sonja Bednar
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Safeguard Biosystems Holdings Ltd
Original Assignee
Safeguard Biosystems Holdings Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Safeguard Biosystems Holdings Ltd filed Critical Safeguard Biosystems Holdings Ltd
Priority claimed from PCT/IB2022/050494 external-priority patent/WO2022157672A1/fr
Publication of EP4298242A1 publication Critical patent/EP4298242A1/fr
Pending legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6888Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
    • C12Q1/689Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Definitions

  • the present disclosure provides improved methods of identifying genomic sequences and/or distinguishing homologous genomic sequences that may be present in a sample.
  • the methods utilize combinations of probe molecules that cannot individually but can collectively distinguish homologous genomic sequences from closely related species of microorganisms and may also identify genomic sequences that are likely to be present in a sample. Such combinations of probe molecules are referred to herein as “virtual probes” for convenience.
  • Virtual probes can comprise a plurality (e.g., two, three, or more than three) of individual probe molecules.
  • the hybridization of a genomic DNA (or a PCR product amplified from the genomic DNA) to an individual probe molecule may not be sufficient, without sequencing, to differentiate the genomic DNA from a homologous genomic DNA of a genetically related species that may be present in the same sample, particularly when using universal primers that target conserved sequences in related species to amplify the genomic DNA.
  • the genomic DNA or corresponding PCR amplicon is probed with a virtual probe comprising two or more probe molecules, the difference in hybridization patterns to the virtual probes can differentiate the genomic DNA from two related species or homologous amplicons amplified therefrom.
  • the virtual probes of the disclosure can in combination distinguish a genomic sequence from a homologous genomic sequence (or amplicons prepared therefrom) and identify the microbial species present in a biological sample.
  • a combination of two or three oligonucleotide probe molecules can in combination form a virtual probe that distinguishes between amplicons from related species such as coagulase negative Staphylococcus sp. (e.g., S. hominis) and coagulase positive Staphylococcus sp. (e.g., S. aureus).
  • coagulase negative Staphylococcus sp. e.g., S. hominis
  • coagulase positive Staphylococcus sp. e.g., S. aureus
  • a probe in the virtual probe that is capable of hybridizing to homologous genomic sequence from multiple closely related species (e.g., a coagulase negative Staphylococcus sp. and a coagulase positive Staphylococcus sp.) such that the signal from the probe can provide a measure of the relative amount of the homologous genomic sequences in a sample.
  • probes are referred to herein as “meter probes” for convenience.
  • a first formula can be used to analyze hybridization signals for the probes of the virtual probe, and when the signal for the meter probe is below the threshold value, a second formula can be used.
  • a meter probe can be particularly useful for improving the accuracy of microorganism identification when using a virtual probe having a probe molecule that reaches saturation before one or more other probe molecules of the virtual probe.
  • the signal pattern obtained when probing a sample can vary depending on concentration of the target sequences in the sample.
  • the signal pattern obtained for the test sample can be analyzed using a formula specific for the relative concentration of target sequences, e.g., the hybridization pattern can be analyzed using a first formula if the relative concentration is high (e.g., where the signal for a meter probe is above a threshold value), and analyzed using a second formula if the relative concentration is low (e.g., where the signal for a meter probe is below a threshold value).
  • Example 6 exemplifies the use of a meter probe to increase the accuracy of identifying a sample as containing a coagulase negative Staphylococcus sp. or coagulase positive S. aureus.
  • the improved accuracy of microorganism identification provided by a meter probe represents a significant improvement over existing microorganism detection methods which do not make use of a meter probe.
  • the present disclosure provides methods of detecting if a first microorganism (or corresponding first genome) and/or a second microorganism (or corresponding second genome) is present in a sample.
  • the sample is a blood sample.
  • Methods of the disclosure can comprise probing the sample with a virtual probe for the first organism and the second organism to determine the presence or absence of one or more target nucleic acids corresponding to the first genome or second genome.
  • the target nucleic acids can be, for example, genome fragments or amplicons produced in a DNA amplification reaction such as PCR.
  • the virtual probe comprises two or more probe molecules, each of which is capable of specifically hybridizing to one or more of the target nucleic acids corresponding to the first genome and/or one or more homologous target nucleic acids corresponding to the second genome.
  • the virtual probe can distinguish between the target nucleic acids corresponding to the first genome and the target nucleic acids corresponding to the second genome.
  • the virtual probe preferably includes a meter probe.
  • An exemplary method comprises the steps of:
  • PCR polymerase chain reaction
  • each amplicon set contains only a single amplicon, and when a plurality of PCR primer pairs is used, an amplicon set can contain two or more amplicons (e.g., a plurality of single amplicons).
  • any resulting PCR amplification products are probed with a virtual probe to determine the presence or absence of the first amplicon set and second amplicon set.
  • the virtual probe comprises two or more probe molecules (e.g., two or more oligonucleotide probe molecules) capable of specifically hybridizing to the first amplicon set and second amplicon set in a distinct manner, the virtual probe can distinguish between the first amplicon set and the second amplicon set.
  • the probe molecules within each virtual probe can be distinguished by virtue of having different labels (e.g., fluorescent labels, for example molecular beacons labeled with different fluorescent labels) or being positioned at discrete locations on an array.
  • hybridization of the PCR amplification products of the PCR reaction to the virtual probe can distinguish between the first and second genomes, thereby identifying the presence of the first and/or second organisms in the sample.
  • the reference to the presence of an organism in the sample does not mean that the sample had a live organism, merely that sufficient genomic DNA from the organism was present in the sample to be detected or to serve as a template for an amplification reaction such as a PCR reaction.
  • the reference to the presence of a genome in a sample does not mean that the sample had an intact genome, merely that sufficient DNA from the genome was present in the sample to be detected or to serve as a template for an amplification reaction such as a PCR reaction.
  • the first amplicon set and the second amplicon set can each comprise one amplicon (referred to as a “first amplicon” and a “second amplicon,” respectively), for example when a single set of primers is used in a PCR amplification reaction.
  • the first amplicon set and/or the second amplicon set can comprise more than one amplicon (each amplicon in the first amplicon set referred to as a “first amplicon” and each amplicon in the second amplicon set referred to as a “second amplicon”), for example when more than one set of primers is used in a PCR amplification reaction.
  • Further exemplary methods for distinguishing homologous amplicons from homologous genomic sequences are described in Section 6.2 and numbered embodiments 1 to 98, 142 to 147, and 151 , infra.
  • the disclosure further provides arrays for distinguishing homologous genomic sequences, systems for distinguishing homologous genomic sequences, and oligonucleotide probe molecules which are useful, for example, in the methods, arrays, and systems of the disclosure.
  • the present disclosure provides addressable arrays for distinguishing a first genomic sequence from a first genome and a second, homologous genomic sequence from a second genome.
  • the addressable arrays of the disclosure can be used, for example, in the methods described herein.
  • An addressable array of the disclosure can comprise a group of positionally addressable oligonucleotide probe molecules, each at a discrete location on the array, where each probe molecule in the group of oligonucleotide probe molecules comprises a nucleotide sequence that is 90% to 100% complementary to 15 to 40 consecutive nucleotides in the first genomic sequence or second genomic sequence.
  • the addressable array typically comprises one or more meter probes and may further optionally comprise one or more control probe molecules.
  • Exemplary addressable arrays of the disclosure are described in Section 6.3 and numbered embodiments 99 to 141 and 173 to 175, infra.
  • one or more steps of methods of the disclosure are computer- implemented. Exemplary computer implemented methods are described in Section 6.4 and numbered embodiments 97, 98, 146, and 147, infra.
  • the present disclosure provides systems for distinguishing between a first genomic sequence and a second, homologous genomic sequence if present in a sample.
  • An exemplary system can comprise: (a) an optical reader for generating signal data for each probe molecule location of an array of the disclosure; and
  • (i) is configured to receive signal data from the optical reader
  • (ii) is configured to analyze the signal data for one or more virtual probes (e.g., a virtual probe having features as described herein); and
  • (iii) has an interface to a storage or display device or network for outputting a result of the analysis.
  • the present disclosure provides exemplary oligonucleotide probe molecules suitable for use in a virtual probe and kits comprising two or more of such oligonucleotide probe molecules.
  • the oligonucleotide probe molecules of the disclosure can be included on an addressable array of the disclosure and/or used in the methods of the disclosure.
  • Exemplary oligonucleotide probe molecules and virtual probes are described in Section 6.2.4 and numbered embodiments 156 to 172, infra.
  • Exemplary kits are described in Section 6.6 and numbered embodiments 176 to 187, infra.
  • FIG. 1A-1B is a dendrogram (split between FIG. 1A and FIG. 1 B) prepared from the 16S rRNA genes of several Staphylococcus species obtained from GenBank.
  • the tree was constructed using the CLC sequence viewer software with bootstrap analysis to show the confidence in the data.
  • Each species of the Staphylococcus group is represented with two sequences which are shown by Gen Bank accession numbers.
  • the bootstrap values at each node of the tree define the percentage of confidence in the data across the replicates. The higher the bootstrap value the more supportive is the data for respective taxa.
  • the horizontal lines for a respective species are called branches and represent the amount of genetic change in that species.
  • the unit of branch length is shown as the number of changes or substitutions divided by the length of the sequence.
  • FIG. 2A-2B represents a dendrogram (split between FIG. 2A and FIG. 2B) of species from the Streptococcus viridians group created from multiple alignment of 16s rRNA and 16s-23s rRNA genome sequences.
  • I Streptococcus bovis group
  • II Streptococcus angi nosus group
  • III Streptococcus salivarius group
  • IV Streptococcus mitis group
  • V Streptococcus mutans group.
  • the numbers represent bootstrap values which indicate the percentage of confidence in the dataset.
  • the species are addressed as three groups - I, II and III.
  • FIG. 3 is a diagram of 16S rRNA and 16S-23S rRNA ITS genomic sequences, and 16S forward (16S Fw), 16S reverse (16S Rv), ITS forward (ITS Fw), and ITS reverse (ITS Rv) primers.
  • FIG. 4 illustrates a primer pair useful for the asymmetric PCR methods described in Section 6.2.3.4, comprising an Unextended Primer (which can be a traditional primer used in symmetric PCR processes) and an Extended Primer composed of an “A” region, which is complementary to the target nucleic acid, a “B” region which includes a Direct Repeat or Inverted Repeat of at least a portion of the “A” region, and an optional “C” region, which can include a spacer sequence and/or part of all of a restriction endonuclease recognition site.
  • an Unextended Primer which can be a traditional primer used in symmetric PCR processes
  • an Extended Primer composed of an “A” region, which is complementary to the target nucleic acid
  • B which includes a Direct Repeat or Inverted Repeat of at least a portion of the “A” region
  • an optional “C” region which can include a spacer sequence and/or part of all of a restriction endonuclease recognition site.
  • FIG. 5A-5C illustrates intermolecular hybridization of Extended Primers that occurs when the “B” region contains an Inverted Repeat of at least a portion of the “A” region.
  • Fig. 5B illustrates intermolecular hybridization of Extended Primers that occurs when the “B” region contains a Direct Repeat of at least a portion of the “A” region.
  • Fig. 5C illustrates intramolecular hybridization of Extended Primers that occurs when the “B” region contains an Inverted Repeat of at least a portion of the “A” region.
  • the region of complementarity between the “A” region and the “B” region is at or near the 5' end of the “A” region.
  • FIG. 6 illustrates the denaturation step in an asymmetric PCR reaction described in Section 6.2.3.4.
  • a PCR reaction mixture typically containing a biological sample containing or at risk of containing target nucleic acid, an Asymmetric Primer Pair, a thermostable DNA polymerase, and PCR Reagents
  • a PCR reaction mixture typically containing a biological sample containing or at risk of containing target nucleic acid, an Asymmetric Primer Pair, a thermostable DNA polymerase, and PCR Reagents
  • a PCR reaction mixture typically containing a biological sample containing or at risk of containing target nucleic acid, an Asymmetric Primer Pair, a thermostable DNA polymerase, and PCR Reagents
  • FIG. 7 illustrates the annealing step of the exponential phase of the asymmetric PCR reactions described in Section 6.2.3.4, which occurs below the melting temperature of the Unextended Primer. Both the Unextended Primer and Extended Primer in the Asymmetric Primer Pair hybridize to their respective complementary strands.
  • Fig. 7 shows annealing (also referred to as hybridization or binding) to target DNA, as occurs in the initial cycles of PCR, but in subsequent cycles annealing is likely to occur between primers and complementary sequences in PCR products. Because of the “B” and optional “C” regions in the Extended Primer, the PCR products will have those sequences or their complements, as depicted in Fig. 8B and Fig. 9.
  • FIGS. 8A-8B Fig. 8A and Fig. 8B illustrate the extension step of the exponential phase of the asymmetric PCR reactions described in Section 6.2.3.4, during which the thermostable DNA polymerase extends the primer DNA using the complementary DNA as template.
  • the region of extension is depicted in dashed lines.
  • the template in Fig. 8A is a strand of target DNA
  • in Fig. 8B is a strand of PCR product produced using the Asymmetric Primer Pair and the target DNA.
  • FIG. 9 illustrates the simultaneous annealing and extension step of the linear phase of the asymmetric PCR reactions described in Section 6.2.3.4, which occurs above the melting temperature of the Unextended Primer and below the melting temperature of the Extended Primer, using the PCR Product Strand 2 as template. This results in asymmetric amplification of PCR Product Strand 2, resulting in an excess of PCR Product Strand 2 molecules relative to PCR Product Strand 1 molecules by the end of the PCR reaction.
  • FIG. 10A-B illustrate how two (FIG. 10A) or three (FIG. 10B) probe molecules can be used in a virtual probe for coagulase negative Staphylococcus (CNS).
  • the signals from the hybridization of a PCR amplification product to the two or three probe molecules can be combined using Boolean operators to determine if a CNS is present in a sample.
  • FIG. 11A-B show signals for various oligonucleotide probe molecules when bound to 16S rRNA amplicons from Streptococcus mitis (Fig. 11 A) and Streptococcus pneumoniae (Fig. 11 B).
  • FIG. 12 shows signal intensities for various oligonucleotide probe molecules when bound to PCR amplicons from Streptococcus pneumoniae, Streptococcus mitis and Streptococcus oralis.
  • FIG. 13 shows signal intensities for various oligonucleotide probe molecules when bound to PCR amplicons from Salmonella enterica and Escherichia coll.
  • FIG. 14 shows signal intensities for various oligonucleotide probe molecules when bound to PCR amplicons from Klebsiella pneumoniae and Klebsiella oxytoca.
  • FIG. 15 shows signal intensities for various oligonucleotide probe molecules when bound to PCR amplicons from Enterobacter cloacae, Enterobacter asburiae, and Enterobacter hormaechei.
  • FIG. 16A-16B show AIIStaph-146abp/Sau-71 p signal ratios (Y-axis) for increasing concentrations (CFU/ml) of a coagulase negative Staphylococcus (S. hominis) and coagulase positive S. aureus.
  • FIG. 16A shows ratios plotted on a linear Y-axis
  • FIG. 16B shows ratios plotted on a logarithmic Y-axis.
  • FIG. 17 shows AIIStaph-146abp signal ratios (X-axis) and AIIStaph-146abp/Sau-71 p signal ratios (Y-axis) for samples containing coagulase positive S. aureus.
  • Amplicon An amplicon is a nucleic acid molecule produced by a PCR amplification reaction.
  • Asymmetric Primer Pair A Primer Pair consisting of an Extended Primer and an Unextended Primer.
  • corresponding refers to the region of sequence overlap or complementarity present in both strands, as the context dictates.
  • Direct Repeat In the context of the “B” region of an Extended Primer, “Direct Repeat” means a nucleotide sequence that is the direct complement to a portion of the “A” region (/'.e., has the complementary sequence in the same 5' to 3' order).
  • Extended Primer A PCR primer that contains (a) an “A” region at its 3' end that has at least 75% sequence identity to a corresponding region Target Strand 1 or at least 75% sequence complementarity to a corresponding region in Target Strand 2; (b) a “B” region at its 5' end that comprises a sequence that is complementary to at least a portion of the “A” region; and (c) an optional “C” region positioned between the “A” and “B” regions.
  • Homologous genomic sequences are genomic sequences found in different species or strains which have shared ancestry but which are not identical in nucleotide sequence. Exemplary homologous genomic sequences include 16S rRNA genes, 23S rRNA genes, and 16S - 23S internal transcribed spacer region (ITS) sequences.
  • Inverted Repeat In the context of the “B” region of an Extended Primer, “Inverted Repeat” means a nucleotide sequence that is the reverse complement to a portion of the “A’ region (/.e., has the complementary sequence in the opposite 5' to 3' order).
  • Melting temperature (T m ) the temperature at which a one half of a DNA duplex will dissociate to become single stranded.
  • the T m ’s of linear primers comprised of deoxyribonucleotides (DNA) have been commonly calculated by the “percent GC” method (PCR PROTOCOLS, a Guide to Methods and Applications, Innis et al. eds., Academic Press (San Diego, Calif.
  • the T m of a DNA is calculated according to the “Nearest Neighbor” method, and non-naturally occurring bases (e.g., 2-deoxyinosine) are treated as adenines.
  • PCR Product Strand 1 refers to the strand in a doublestranded PCR product produced from target nucleic acid and an Asymmetric Primer Pair which is complementary to the Unextended Primer of the Asymmetric Primer Pair.
  • PCR Product Strand 1 refers to the strand in a doublestranded PCR product produced from target nucleic acid and an Asymmetric Primer Pair which is complementary to the Extended Primer of the Asymmetric Primer Pair.
  • PCR Reagents refers to components of a PCR reaction other than template nucleic acid, thermostable polymerase and primers.
  • PCR Reagents typically include dNTPs (and may include labeled, e.g., fluorescently labeled, dNTPs in addition to unlabeled dNTPs), buffers, and salts containing divalent cations (e.g., MgCI 2 ).
  • Primer A DNA oligonucleotide of at least 12 nucleotides that has a free hydroxyl group at its 3' terminus.
  • Primers can include naturally and non-naturally occurring nucleotides (e.g., nucleotides containing universal bases such as 3-nitropyrrole, 5-nitroindole or 2-deoxyinosine, 2-deoxyinosine being preferred).
  • the term “primer” also refers to a mixture of primer molecules that is created when mixed bases are included in the primer design and construction to allow them to hybridize to variant sequences in the target nucleic acid molecules.
  • the target sequence variants can be inter- or intra-species variants. Standard nomenclature for mixed bases is shown in Table 1 :
  • each primer contains no more than three mixed bases in the region of complementarity to a target nucleic acid.
  • a primer contains zero, one, two or three mixed bases in the region of complementarity to a target nucleic acid.
  • Primer Pair A forward and reverse primer pair (each of which can be a mixture of primers with sequence variations to account for possible variations in the target sequence) that is capable of hybridizing with and initiating a DNA polymerization reaction from different strands of the same nucleic acid molecule within a region of less than 5,000 base pairs.
  • the primer pair is capable of hybridizing with an initiating a DNA polymerization reaction from different strands of the same nucleic acid molecule within a region of less than 2,500 base pairs or less than 1 ,500 base pairs.
  • sample refers to any sample containing or suspected of containing a nucleic acid of interest, for example a genome, a genome fragment, an amplicon corresponding to a region of a genome, or another target nucleic acid.
  • a sample can be subjected to one or more processes and still be considered a “sample.” For example, a sample that is subjected to a PCR amplification reaction remains a “sample” after the PCR amplification reaction.
  • Single amplicon refers to a nucleic acid molecule or group of nucleic acid molecules produced by a PCR amplification reaction from a single organism with a single primer pair.
  • a “single amplicon” refers to a PCR product with a unique sequence, but can also refer to a PCR product with a group, e.g., a pair, of unique sequences, for example when the organism is heterozygous for the sequence being amplified.
  • a probe molecule can, for example, be capable of hybridizing to an amplicon comprising a first genomic sequence and an amplicon comprising a second, homologous genomic sequence that differs by one or more nucleotides from the first genomic sequence.
  • Target Strand 1 refers to the strand in a double-stranded target nucleic acid to which an Unextended Primer in an Asymmetric Primer Pair is complementary.
  • Target Strand 2 refers to the strand in a double-stranded target nucleic acid to which the “A” region in an Extended Primer in an Asymmetric Primer Pair is complementary.
  • Unextended Primer A PCR primer that consists essentially of a nucleotide sequence having at least 75% sequence identity to a corresponding region in Target Strand 2 or at least 75% sequence complementarity to a corresponding region in Target Strand 1 .
  • the term “consisting essentially of’ in reference to the Unextended Primer means that the nucleotide sequence may contain no more than 3 additional nucleotides 5' to the region of (at least 75%) complementarity to the target sequence.
  • the present disclosure provides methods of distinguishing between a first genomic sequence from a first organism and a second, homologous genomic sequence from a second organism.
  • the methods allow the identification of an organism present in a sample using virtual probes.
  • a virtual probe for a genomic sequence generally comprises two or more probe molecules that can be distinguished, e.g., by virtue of their discrete locations on an addressable array, or by differential labeling, e.g., with different fluorescent moieties.
  • the readout from an individual probe molecule within a virtual probe is sometimes referred to herein as a “signal.”
  • a probe molecule need not be labeled to generate a “signal”.
  • the absence of hybridization to a fluorescently labeled amplicon can constitute a “signal”.
  • Each virtual probe for a genomic sequence contains at least one probe molecule (of the plurality of probe molecules that make up the virtual probe) that is capable of specifically hybridizing to a target nucleic acid (e.g., an amplicon) corresponding to the genomic sequence.
  • a target nucleic acid e.g., an amplicon
  • two or more probe molecules in the virtual probe are capable of hybridizing to a target nucleic acid (e.g., an amplicon) corresponding to the genomic sequence.
  • the hybridization patterns of the probe molecules in a virtual probe to different target nucleic acids (e.g., amplicons) from related genomic sequences are sufficiently different so as to distinguish target nucleic acids from the related genomic sequences, for example distinguish between a first amplicon set from a first genome and a second amplicon set from a second genome with a homologous genomic sequence.
  • the methods can be used, for example, to determine the presence of specific species or strains of bacteria in a sample from which probed amplicons were amplified, directly (e.g., where the sample is directly utilized in PCR) or indirectly (e.g., through an intermediate purification or enrichment step, such as a bead beating method described in Section 6.2.1).
  • Various embodiments disclosed herein describe probing the product of a DNA amplification reaction such as a PCR reaction with a virtual probe; however, it should be understood that probing can alternatively be performed using a method capable of detecting non-amplified genomic DNA.
  • Exemplary methods for detecting non-amplified genomic DNA are described in Detection of Non-Amplified Genomic DNA, 2012, Spoto and Corradini (eds) doi.org/10.1007/978-94-007- 1226-3, the contents of which are incorporated by reference in their entirety.
  • Such methods include optical detection methods (see, e.g., Li and Fan, 2012, “Optical Detection of Nonamplified Genomic DNA,” pp.
  • probing of a sample is performed in the absence of a DNA amplification step (e.g., where the sample contains or is suspected of containing a target nucleic acid which is a genome fragment).
  • Methods of determining the presence of a first genome from a first organism or a second genome from a second organism, if either is present in a sample can comprise a step of performing a PCR amplification reaction (e.g., as described in Section 6.2.3) on the sample using PCR primers capable of hybridizing to, and initiating a PCR amplification from, the first genome and the second genome.
  • Amplification from the first genome, if present in the sample results in a first amplicon set.
  • Amplification from the second genome, if present in the sample results in a second amplicon set.
  • the PCR amplification products can be probed with a virtual probe to determine the presence or absence of the first amplicon set and second amplicon set.
  • the probing can be performed during the PCR amplification reaction (e.g., when using real-time PCR, for example as described in Section 6.2.3.5) or after the PCR amplification reaction ⁇ e.g., by using an array comprising oligonucleotide probe molecules, for example as described in Section 6.3).
  • the probing is performed after the PCR reaction, for example on an array, it is useful to include fluorescently labeled nucleotides in the PCR mixture to label the resulting PCR amplicons.
  • the location(s) of the fluorescent label on the addressable array and in some instances its intensity can constitute the signals for the probe molecules that make up the virtual probe.
  • first amplicon set is determined to be present, it can be concluded that the sample contains the first genome.
  • second amplicon set is determined to be present, it can be concluded that the sample contains the second genome.
  • Virtual probes can be used to distinguish between a first amplicon set and a second amplicon set prepared from related microorganisms, for example coagulase negative and coagulase positive Staphylococcus species ⁇ e.g., as described in Section 6.2.5.1), S. gordonii and S. anginosus ⁇ e.g., as described in Section 6.2.5.2), or S. mitis and S. pneumoniae ⁇ e.g., as described in 6.2.5.3).
  • related microorganisms for example coagulase negative and coagulase positive Staphylococcus species ⁇ e.g., as described in Section 6.2.5.1), S. gordonii and S. anginosus ⁇ e.g., as described in Section 6.2.5.2),
  • Samples can be, for example, biological samples, environmental samples, or food products.
  • the samples are infected with, or at risk of infection with one or more microorganisms. Exemplary samples are described in Section 6.2.1.
  • a virtual probe capable of distinguishing between target nucleic acids ⁇ e.g., amplicons) corresponding to genomic sequences encoding rRNA ⁇ e.g., 16S rRNA or 23S rRNA), or intergenic spacer regions between rRNA genes ⁇ e.g. a 16S rRNA - 23S rRNA intergenic spacer region) can be used.
  • target nucleic acids e.g., amplicons
  • rRNA e.g., 16S rRNA or 23S rRNA
  • intergenic spacer regions between rRNA genes ⁇ e.g. a 16S rRNA - 23S rRNA intergenic spacer region
  • Amplicons for probing with virtual probes can be produced by performing a PCR amplification reaction on a sample containing or suspected or at risk of containing a first organism and/or second organism using PCR primers capable of hybridizing to, and initiating a PCR amplification from, the genome of the first organism and the genome of the second organism.
  • the PCR amplification reaction can be performed with a single set of primers (which should produce a first amplicon and second amplicon, respectively, when the first and second organisms are present in the sample).
  • the PCR amplification reaction can be performed with more than one set of primers to produce multiple amplicons corresponding to the first genome and multiple amplicons corresponding to the second genome, when the first and second organisms, respectively, are present in the sample.
  • Exemplary PCR amplification reactions that can be used in the methods of the disclosure are described in Section 6.2.3.
  • Nucleic acid amplification techniques other than PCR e.g., isothermal amplification techniques
  • LAMP loop mediated isothermal amplification
  • NASBA nucleic acid sequence based amplification
  • SDA strand displacement amplification
  • RCA rolling circle amplification
  • the probing of PCR amplification products comprises the steps of contacting the PCR amplification products with an array, e.g., as described in Section 6.3, washing unbound nucleic acid molecules from the array, and measuring the signal intensity of a label (e.g., a fluorescent label) at each probe molecule location on the array.
  • the array comprises one or more meter probes.
  • the probing of the PCR amplification products comprises measuring signals from oligonucleotide probe molecules used in a real-time PCR reaction.
  • Kits that can be used in the methods of the disclosure are described in Section 6.6.
  • the sample used in the methods of the disclosure can be any type or form of sample that contains genomic DNA that is in a condition, or can be prepared to be in a condition, suitable for PCR amplification.
  • the sample is at risk of infection with one or more microorganisms, for example, one or more species of microorganisms.
  • the sample is suspected of having an infection with one or more microorganisms, for example, one or more species of microorganisms.
  • the sample can be, for example, a biological sample, an environmental sample, or a food product.
  • samples include various fluid samples.
  • the sample can be a bodily fluid sample from a subject.
  • the sample can include tissue collected from a subject.
  • the sample can include a bodily fluid, secretion, and/or tissue of a subject.
  • the sample can be a biological sample.
  • the biological sample can be a bodily fluid, a secretion, and/or a tissue sample.
  • biological samples include but are not limited to, blood, serum, saliva, urine, gastric and digestive fluid, tears, stool, semen, vaginal fluid, interstitial fluids derived from tumorous tissue, ocular fluids, sweat, mucus, earwax, oil, glandular secretions, breath, spinal fluid, hair, fingernails, skin cells, plasma, nasal swab or nasopharyngeal wash, spinal fluid, cerebrospinal fluid, tissue, throat swab, wound swab, biopsy, placental fluid, amniotic fluid, cord blood, emphatic fluids, cavity fluids, sputum, pus or other wound exudate, infected tissue sampled by wound debridement or excision, cerebrospinal fluid, lavage, leucopoiesis specimens, peritoneal dialysis fluid, milk and/or other excretions.
  • a subject can provide a sample, and/or a sample can be collected from a subject.
  • the subject can be a human or a non-human animal.
  • the sample can be collected from a living or dead subject.
  • the animal can be a mammal, such as a farm animal (e.g., cow, pig, sheep), a sport animal (e.g., horse), or a pet (e.g., dog or cat).
  • the subject can be a patient, clinical subject, or pre-clinical subject.
  • a subject can be undergoing diagnosis, treatment, and/or disease management or lifestyle or preventative care. The subject may or may not be under the care of a health care professional.
  • the sample can be an environmental sample.
  • environmental samples include air samples, water samples (e.g., groundwater, surface water, or wastewater), soil samples, and plant samples.
  • Additional samples include food products, beverages, manufacturing materials, textiles, chemicals, and therapies.
  • the sample is a sample containing or suspected of containing a pathogen such as, for example, one or more of Mycobacterium tuberculosis, Mycobacterium avium subsp paratuberculosis, Staphylococcus aureus (including methicillin sensitive and methicillin resistant Staphylococcus aureus (MRSA)), Staphylococcus epidermidis, Staphylococcus lugdunensis, Staphylococcus maltophilia, Streptococcus pyogenes, Streptococcus pneumoniae, Streptococcus agalactiae, Haemophilus influenzae, Haemophilus parainfuluezae, Moraxella catarrhalis, Klebsiella pneumoniae, Klebsiella oxytoca, Escherichia coll, Pseudomonas aeruginosa, Acinetobacter sp., Bordetella pathogen such as, for example,
  • a sample can be pre-processed prior to performing PCR amplification.
  • the sample subjected to PCR amplification in the methods of the disclosure can be a sample which is, for example, processed, extracted, or fractionated from any of the types of samples described in this Section or elsewhere in the disclosure (e.g., a sample processed, extracted or fractionated from urine, sputum, a wound swab, blood, or peritoneal dialysis fluid).
  • pre-processing steps examples include filtration, distillation, extraction, concentration, centrifugation, inactivation of interfering components, addition of reagents, and the like, as discussed herein or otherwise as is known in the art.
  • a “filter,” as used herein, is a membrane or device that allows differential passage of particles and molecules based on size. Typically this is accomplished by having pores in the filter of a particular nominal size. For instance, filters of particular interest for bacterial detection applications have pores sufficiently large to allow passage of bacteria but small enough to prevent passage of eukaryotic cells that present in a sample of interest.
  • bacterial cells range from 0.2 to 2 pm (micrometers or microns) in diameter
  • most fungal cells range from 1 to 10 pm in diameter
  • platelets are approximately 3 pm diameter
  • most nucleated mammalian cells are typically 10 to 200 pm in diameter. Therefore, filter pore sizes of less than 2 pm or less than 1 pm are particularly suitable for removing non-bacterial cells from a biological sample if detection of bacteria is intended.
  • a biological sample can be subject to centrifugation to remove cells and debris from a sample. Centrifugation parameters that precipitate eukaryotic but not bacterial cells are known in the art. The supernatant can then be filtered if desired.
  • Samples can be prepared for PCR amplification using any of the various processes for preparing samples comprising genomic DNA for PCR which are known in the art (e.g., following one or more of the pre-processing steps described above).
  • a commercially available DNA extraction reagent, kit, and/or instrument can be used, e.g., a QIAamp DNA Mini Kit (Qiagen), a MagMAXTM DNA Multi-Sample Kit (ThermoFisher Scientific), a Maxwell® RSC Instrument (Promega), etc.
  • a sample is prepared for PCR by a process comprising beadbeating, for example as described in U.S. Patent No. 10,036,054, the contents of which are incorporated herein by reference in their entireties.
  • Blood can be directly subjected to bead beating after being collected in a commercially available blood collection tube, for example by adding bead beating beads to the collection tube and subjecting the collection tube to agitation.
  • commercially available collection tubes that can be used to collect blood samples include lavender-top tubes containing EDTA, light blue-top tubes containing sodium citrate, gray-top tubes containing potassium oxalate, or green-top tubes containing heparin.
  • the methods of the disclosure can be used to identify and/or distinguish first and second homologous genomic sequences (and target nucleic acids such as amplicons corresponding to the first and second homologous genomic sequences).
  • Homologous genomic sequences are genomic sequences found in species or strains which have shared ancestry but which are not identical in nucleotide sequence. Thus, for example, homologous genomic sequences are found in closely related species or strains of bacteria.
  • the first genomic sequence and the second genomic sequence are generally genomic sequences from a first microorganism and a second microorganism (e.g., bacteria, viruses, or fungi).
  • the first and/or second microorganisms can be, for example, a human pathogen and/or an animal pathogen.
  • the microorganisms can be from the same order, the same family, the same genus, the same group, or even the same species.
  • the first and second microorganism are bacteria.
  • NCBI National Center for Biotechnology Information
  • EMBL European Molecular Biology Laboratory
  • DDBJ DNA Databank of Japan
  • 16S rRNA sequences Due to the slow rate of sequence evolution observed among these genes, 16S rRNA sequences have been used in constructing phylogenic trees for a number of bacterial species.
  • An exemplary phylogenic tree prepared from the 16S rRNA genes of several Staphylococcus species obtained from GenBank is shown in Fig. 1.
  • the bacterial genome contains a second ribosomal rRNA gene, the 23S rRNA gene.
  • the 16S rRNA and the 23S rRNA genes are separated from each other by a spacer region known as the 16S - 23S internal transcribed spacer region (ITS) or the 16S - 23S intergenic spacer region.
  • the 16S - 23S rRNA ITS region comprises hypervariable regions comprising species and inter-species specific sequences that can be used for distinguishing and identifying particular bacterial species (K. Okamura, et al., 2012).
  • An exemplary phylogenic tree for the Streptococcus viridians group created from multiple alignment of 16s rRNA and 16s-23s rRNA genome sequences is shown in Fig. 2.
  • the first genomic sequence and the second genomic sequence each comprises a nucleotide sequence of a gene encoding rRNA. In other embodiments, the first genomic sequence and the second genomic sequence each comprise a nucleotide sequence of an intergenic spacer region between rRNA genes.
  • the first genomic sequence and the second genomic sequence can each comprise, for example, a nucleotide sequence of a 16S rRNA gene or a 23S rRNA gene.
  • the first genomic sequence and the second genomic sequence each comprises a nucleotide sequence of a 16S rRNA gene.
  • the first genomic sequence and the second genomic sequence each comprises a nucleotide sequence of a 23S rRNA gene.
  • the genomic sequence comprises a nucleotide sequence found in a 16S-23S intergenic spacer region.
  • the first genomic sequence and/or the second homologous genomic sequence are genomic sequences from pathogens, e.g., bacteria, viruses or fungi, that can be found in human blood, urine or peritoneal fluid.
  • pathogens include, but are not limited to, Mycobacterium tuberculosis, Mycobacterium avium subsp paratuberculosis, Staphylococcus aureus (including methicillin sensitive and methicillin resistant Staphylococcus aureus (MRSA)), Staphylococcus epidermidis, Staphylococcus lugdunensis, Staphylococcus maltophilia, Streptococcus pyogenes, Streptococcus pneumoniae, Streptococcus agalactiae, Haemophilus influenzae, Haemophilus parainfuluezae, Moraxella catarrhalis, Klebsiella pneumoniae, Klebsi
  • PCR amplification is performed on a sample using PCR primers capable of hybridizing to, and initiating a PCR amplification from, a genome that may be present in the sample.
  • the PCR amplification reaction can be a “symmetric” PCR reaction, that is, the reaction makes double-stranded copies of template DNA by utilizing a forward primer and a reverse primer designed to have “melting temperatures,” or “T m ’s” that equal or within a few °C of one another. Commonly used computer software programs for primer design warns users to avoid high T m difference, and have automatic T m matching features.
  • “Asymmetric” PCR reactions that can make single-stranded DNA amplicons can also be used.
  • Real-time PCR reactions can also be used.
  • a genomic sequence amplified by the reaction can be referred to as a “target” nucleic acid, a “template” nucleic acid, or the like.
  • PCR amplification reactions can use a single primer set or multiple primer sets (e.g., when the PCR amplification is a multiplex PCR). Multiplex PCR can be useful, for example, to produce an amplicon corresponding to a first genomic sequence (e.g., a 16S rRNA gene) and/or a different amplicon corresponding to a second genomic sequence (e.g., a 23S rRNA gene). As an alternative to multiplex PCR, amplicons produced by separate PCR amplification reactions performed with different primer sets can be pooled for subsequent analysis.
  • a first genomic sequence e.g., a 16S rRNA gene
  • amplicon corresponding to a second genomic sequence e.g., a 23S rRNA gene
  • a single primer set can be used to make amplicons corresponding to homologous genomic sequences found in the genome of multiple strains or species, e.g., 2, 3, 4, or more than 4 species, which can be, for example, members of the same genus.
  • PCR amplification conditions can be selected (whether symmetric or asymmetric, singleplex or multiplex), for example, such that the DNA amplification products are 100 to 1000 nucleotides in length. In some embodiments, the PCR reactions are selected so that the DNA amplification products are 300 to 800 nucleotides in length. In other embodiments, the PCR conditions are selected so that the DNA amplification products are 400 to 600 nucleotides in length.
  • PCR amplification reactions used in the methods of the disclosure incorporate a label that produces a measurable signal into any amplicons produced by the reaction.
  • the label can be, for example, a fluorescent label, an electrochemical label, or a chemiluminescent label.
  • Fluorescent labeling can be achieved by fluorescently labeled nucleotide incorporation during PCR and/or by the use of labeled primers for PCR.
  • Electrochemical labeling can be achieved by redox-active labeled nucleotide incorporation during PCR and/or by the use of redeox-active labeled primers for PCR (see, e.g., Hocek and Fojta, 2011 , Chem. Soc.
  • Chemiluminescent labeling can be achieved, for example, by the use of biotin labeled primers for PCR, binding a streptavidin-alkaline phosphatase conjugate, then incubating with a chemiluminescent 1 ,2-dioxetane substrate.
  • fluorescent moieties include FITC, EDANS, Texas red, 6-joe, TMR, Alexa 488, Alexa 532, BODIPY FL/C3, BODIPY R6G, BODIPY FL, Alexa 532, BODIPY FL/C6, BODIPY TMR, 5-FAM, BODIPY 493/503, BODIPY 564, BODIPY 581 , Cy3, Cy5, R110, TAMRA, Texas red, and x-Rhodamine.
  • Fluorescent moieties can be attached to dNTPs, particularly those containing cytosine as a base (cytidylic acid, cytidine 5'-phosphate, cytidine 5'-diphosphate, cytidine 5'- triphosphate, or a polymer thereof, or a polymer containing cytidylic acid).
  • the position of the dNTP labeling can be at the base (amino group), phosphate group (OH group), or deoxyribose moiety (2'- or 3'-OH group).
  • the preferred position is at the base.
  • fluorescently labeled dNTPs can be incorporated into both strands of a PCR amplicon at random sites, typically dC sites, and extended by DNA polymerase.
  • Fluorescent dNTPs are commercially available in highly concentrated form and can be added to the PCR reaction mixture without adjusting the concentration of each unlabeled dNTP.
  • the typical ratio of dNTP to fluorescent dNTPs is between 100:1 and 1000:1.
  • fluorescently labeled dNTPs can be included among the PCR Reagents at 0.1% to 1% the (molar) quantity of the unlabeled dNTPs.
  • Detection of fluorescently labeled PCR products can be achieved through hybridization to probe molecules, for example probe molecules bound to a microarray.
  • a suitable microarray system takes advantage of three-dimensional crosslinked polymer networks, as described in U.S. Patent No. 9,738,926, the contents of which are incorporated by reference herein in their entireties
  • the primers utilized in a PCR reaction are designed to recognize and hybridize to the sequence of a given nucleic acid template(s), e.g., a target genomic sequence(s).
  • Mismatches in the sequence of a primer and a target nucleic acid template may result in reduced efficiency of the PCR reaction and/or amplification of a sequence other than the desired sequence.
  • Parameters for successful primer design are well known in the art (see for example, Dieffenbach, et al., 1993) and include primer length, melting temperature, GC content, and the like.
  • PCR primers do not need to share 100% sequence identity with a given target nucleic acid template, and PCR primers having at least 75%, e.g., 80%, e.g., 85%, e.g., 90%, e.g., 95%, e.g., 96%, e.g., 97%, e.g., 98%, e.g., 99%, or 99.5% identity with a target sequence may function to hybridize to and allow amplification of a target sequence.
  • the present disclosure provides for additional parameters suitable for preparing a unique primer system with high specificity and good amplification efficiency.
  • Primers are typically 18 to 24 bases in length, but can be longer, e.g., 25 to 50 bases in length, e.g., 25 to 45 bases in length, e.g., 30 to 45 bases in length, e.g., 35 to 45 bases in length, e.g., 40 to 45 bases in length, or e.g., 40 to 50 bases in length.
  • Primers used in PCR amplification are often designed in pairs, with one primer referred to as the “forward” primer and one primer referred to as the “reverse” primer.
  • Forward primers of the present disclosure can be designed with G and/or C residues at the 3’ end so as to provide a “GC-clamp”.
  • G and C nucleotide pairs exhibit stronger hydrogen bonding than A-T nucleotide pairs; as such, a GC- clamp at the 3’ end of a primer may aid in increasing sequence specificity, increasing the likelihood of hybridization, and increasing the overall efficiency of the PCR reaction.
  • a set of primers can be designed to amplify two genomic regions, for example, a set of primers can include one primer pair specific to the 16S rRNA gene and a second primer pair specific to the 16S-23S rRNA ITS region (see Fig. 3). Such primer sets can be used, for example, to produce multiple amplicons in a single PCR reaction.
  • PCR primer pairs can be designed to amplify a sequence conserved across a number of species, for example, to amplify the 16S rRNA genes of multiple bacterial species. Thus, it can be possible to produce amplicons corresponding to homologous genomic sequences using a single PCR primer pair, which is advantageous when performing PCR on a sample containing or suspected of containing one of a number of possible organisms.
  • Parameters for primers designed to amplify a conserved sequence can include identifying a conserved region across the various species, optionally verifying as correct any sequence differences in the conserved region (e.g., if there is uncertainty whether a published sequence is correct), and selecting a sequence that is at least 75%, e.g., 80%, e.g., 85%, e.g., 90%, e.g., 95%, e.g., 96%, e.g., 97%, e.g., 98%, e.g., 99%, or even 100% conserved across the sequences.
  • Primers exhibiting less than 100% sequence identity may simply contain one or more single nucleotide bases that are different from a given template, that is, all primers in the preparation contain the same sequence to each other.
  • primers can be prepared to contain alternate nucleotide residues at a particular location in the sequence.
  • a reverse primer for the amplification of the 16S region of several species can comprise a pool of oligonucleotides, a percentage of which, e.g., 50%, contain a first nucleotide at a position in the primer and a percentage of which, e.g., 50%, contain a second nucleotide at the position.
  • the primers utilized in the methods of the disclosure are labeled with a detectable label (e.g., a fluorescent label).
  • a detectable label e.g., a fluorescent label
  • at least one primer is 5’ fluorescently labeled.
  • more than one primer is 5’ fluorescently labeled.
  • Fluorescent labels suitable for labeling primers are known in the art, and include Cy5, FAM, JOE, ROX and TAMRA.
  • a typical three-step PCR protocol that can be used in the methods of the disclosure (see PCR PROTOCOLS, a Guide to Methods and Applications, Innis et al. eds., Academic Press (San Diego, Calif. (USA) 1990, Chapter 1) may include denaturation, or strand melting, at 93-95 °C for more than 5 sec, primer annealing at 55-65 °C for 10-60 sec, and primer extension for 15-120 sec at a temperature at which the polymerase is highly active, for example, 72°C for Taq DNA polymerase.
  • a typical two-step PCR protocol may differ by having the same temperature for primer annealing as for primer extension, for example, 60°C or 72°C.
  • amplification involves cycling the reaction mixture through the foregoing series of steps numerous times, typically 25-40 times.
  • the times and temperatures of individual steps in the reaction may remain unchanged from cycle to cycle, or they may be changed at one or more points in the course of the reaction to promote efficiency or enhance selectivity.
  • a PCR reaction mixture typically contains each of the four deoxyribonucleotide 5' triphosphates (dNTPs), typically at equimolar concentrations, a thermostable polymerase, a divalent cation (typically Mg 2+ ), and a buffering agent.
  • dNTPs deoxyribonucleotide 5' triphosphates
  • a thermostable polymerase typically at equimolar concentrations
  • a divalent cation typically Mg 2+
  • a buffering agent typically 20-100 pl.
  • Multiple target sequences can be amplified in the same reaction. The number of cycles for a particular PCR amplification depends on several factors including: a) the amount of the starting material, b) the efficiency of the reaction, and c) the method and sensitivity of detection or subsequent analysis of the product.
  • asymmetric PCR methods are described in Gyllensten and Erlich, 1988, Proc. Natl. Acad. Sci. (USA) 85: 7652-7656 (1988); and Gyllensten and Erlich, 1991 , U.S. Pat. No. 5,066,584.
  • Traditional asymmetric PCR differs from symmetric PCR in that one of the primers is added in limiting amount, typically 1/100 th to 1/5 th of the concentration of the other primer.
  • Double-stranded amplicon accumulates during the early temperature cycles, as in symmetric PCR, but one primer is depleted, typically after 15-25 PCR cycles, depending on the number of starting templates. Linear amplification of one strand takes place during subsequent cycles utilizing the undepleted primer.
  • Primers used in asymmetric PCR reactions reported in the literature are often the same primers known for use in symmetric PCR.
  • Poddar (Poddar, 2000, Mol. Cell Probes 14: 25-32) compared symmetric and asymmetric PCR for amplifying an adenovirus substrate by an end-point assay that included 40 thermal cycles. He reported that a primers ratio of 50:1 was optimal and that asymmetric PCR assays had better sensitivity that, however, dropped significantly for dilute substrate solutions that presumably contained lower numbers of target molecules. 6.2.3.4. Improved Asymmetric PCR Amplification
  • Improved asymmetric PCR methods are described in U.S. patent no. 10,513,730, the contents of which are incorporated herein by reference in their entireties.
  • the improved asymmetric PCR methods include both an exponential phase and a linear phase. During the exponential phase, both strands of the target nucleic acid are amplified. During the linear phase, only one of the strands is amplified, resulting in an excess of a single strand of target nucleic acid.
  • the improved asymmetric PCR methods achieve the excess of a single strand though the use of primer pairs of different lengths and melting temperatures, with the longer primer referred to as the “Extended Primer” and the shorter primer referred to as the “Unextended Primer”.
  • the Extended Primer has a higher melting temperature than the Unextended Primer and can be used to selectively amplify a single strand of the target nucleic acid using PCR cycles in which the annealing step is performed at a temperature greater than the melting temperature of the Unextended Primer but lower than the melting temperature of the Extended Primer.
  • the selective amplification gives rise to a PCR product mixture that is enriched in the target strand which can be probed in a subsequent detection assay.
  • the Extended Primers contain in addition to the sequence complementary to the target nucleic acid a 5' extension containing a sequence that is complementary to the targetbinding portion of the same primer. Without being bound by theory, it is believed that the use of the 5' extension allows intra- or inter-molecular hybridization of Extended Primer molecules and prevents arbitrary or non-specific binding of these longer primers to DNA molecules present in the PCR reaction at the beginning of the PCR reaction. This in turns prevents non-specific DNA amplification and prevents “noise” in the PCR product, which can be problematic when amplifying a target that is present in low quantities in a biological sample.
  • the initial PCR reaction mixture includes
  • the initial concentration of the Extended Primer and the Unextended Primer in the PCR reaction can each range from 200 nM to 8 pM.
  • the Extended Primer and Unextended Primer can be included in equimolar quantities in the initial PCR reaction, e.g., at concentrations ranging between about 200 nM and 1 pM each, for instance at concentrations of 500 nM each.
  • the Extended Primer and Unextended Primer can be included in non-equimolar quantities in the initial PCR reaction.
  • the initial concentration of the Extended Primer is preferably in an excess of the concentration of Unextended Primer, for example about a 2-fold to 30-fold molar excess. Accordingly, in certain aspects, the concentration the Extended Primer ranges between about 1 pM and 8 pM and the concentration of the Unextended Primer ranges between about 50 nM and 200 nM.
  • the Asymmetric Primer Pair can be designed to amplify nucleic acid from any source, and for diagnostic applications the Asymmetric Primer Pair can be design to amplify DNA from pathogens such as those identified in Section 6.2.1 .
  • the Asymmetric Primer Pair can be designed so as to be capable of amplifying homologous nucleic acid sequences present in many species simultaneously, for example the highly conserved 16S ribosomal sequence in bacteria.
  • thermostable polymerases that can be used in the asymmetric PCR reactions of the disclosure includes, but are not limited to, Vent
  • thermoautotrophicum Pab (Pyrococcus abyssi), Pho (Pyrococcus horikosihi, B103 (Picovirinae Bacteriophage B103), Bst (Bacillus stearothermophilus), Bst Large Fragment, Bst 2.0, Bst 2.0 WarmStart, Bsu, TherminatorTM, TherminatorTM II, TherminatorTM III, and TherminatorTM T.
  • the DNA polymerase is a Taq polymerase, such as Taq, Hot Start Taq, Hot Start Ex Taq, Hot Start LA Taq, DreamTaqTM, TopTaq, RedTaq, Taq urate, NovaTaqTM or
  • thermostable DNA polymerase is active at 72°C.
  • the cycle times can be varied according to the melting temperatures of the primers and the length of the PCR product, with longer PCR products calling for longer extension times.
  • a rule of thumb is that the extension step should be at least 60 seconds per 1 ,000 bases of amplicon.
  • the extension step can be extended in the linear phase to provide additional time for annealing. 6.2.3.4.1. Extended Primer
  • the “A” region of the Extended Primer has at least 75% sequence identity to a corresponding region in Target Strand 1 .
  • the “A” region of the primer has at least 80%, at least 85%, at least 90%, or at least 95% identical to the corresponding region in Target Strand 1 .
  • the “A” region of the primer has 100% sequence identity to the corresponding region of Target Strand 1 .
  • the “A” region of the Extended Primer has at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% or 100% sequence identity to the complement of the corresponding region in Target Strand 2.
  • One of skill in the art can readily design primer sequences that have less than 100% sequence identity to the target strand but can still efficiently amplify target DNA.
  • the sequence in the “B” region that is complementary to at least a portion of the “A” region can be a Direct Repeat or Inverted Repeat.
  • different Extended Primer molecules can hybridize to one another intermolecularly, as shown in Fig. 5B.
  • Extended Primer molecules can hybridize intramolecularly, as shown in Fig. 5C, or to one another intermolecularly, as shown in Fig. 5A.
  • the portion of the “A” region to which a sequence in the “B” region is complementary is preferably at or near (e.g., within 1 , 2, or 3 nucleotides from) the 5' end of the “A” region, i.e., at or near where the “A” region adjoins the “B” region (or the “C” region when a “C” region is present).
  • the “B” region of the Extended Primer is preferably 6 to 12 nucleotides in length, i.e., is preferably 6, 7, 8, 9, 10, 11 or 12 nucleotides in length. In specific embodiments, the “B” region of the Extended Primer is 8 to 10 nucleotides in length, i.e., is 8, 9 or 10 nucleotides in length.
  • the “C” region when present in an Extended Primer, is preferably 1 to 6 nucleotides in length, i.e., is preferably 1 , 2, 3, 4, 5, or 6 nucleotides in length.
  • the T m of the Extended Primer is preferably (but not necessarily) between approximately 68°C and approximately 80°C.
  • the T m of the Unextended Primer is between approximately 72°C and approximately 78°C, for example approximately 72°C, approximately 73°C, approximately 74°C, approximately 75°C, approximately 76°C, approximately 77°C, or approximately 78°C.
  • the optional region “C” positioned between regions “A” and “B” can act as a spacer between the “A” and “B” regions to allow the Extended Primer to form a hairpin loop and/or introduce a restriction endonuclease sequence (preferably a 6-cutter sequence) into the PCR product.
  • the restriction endonuclease sequence can be within the “C” region in its entirety or be formed from all or a portion of the “C” region together with flanking 5' and/or 3' sequences from the “B” and “A” regions, respectively.
  • the “C” region is preferably not complementary to Target Strand 1 or Target Strand 2.
  • the T m of the Extended Primer is preferably at least approximately 6°C greater than the T m of the Unextended Primer.
  • the Extended Primer has a T m that is at approximately 15 °C to 30°C greater than the T m of the Unextended Primer.
  • the T m of the “A” region of the Extended Primer is preferably no more than approximately 3°C higher or lower than the T m of the portion of the Unextended Primer (at least 75%) complementary to the target (exclusive of any 5' extensions), i.e., the T m of region in the forward primer that hybridizes to the target is preferably no more than approximately 3°C higher or lower than the T m of the region in the reverse primer that that hybridizes to the target, and vice versa.
  • the “A” region of the Extended Primer is preferably at least 12 nucleotides in length, and preferably ranges from 12 to 30 nucleotides and more preferably from 14-25 nucleotides. In certain embodiments, the “A” region of the Extended Primer is 14, 15, 16, 17, 18, 19 or 20 nucleotides in length.
  • the Unextended Primer has a nucleotide sequence at least 75% sequence identity to a corresponding region in Target Strand 2. In certain embodiments, the Unextended Primer has a nucleotide sequence with at least 80%, at least 85%, at least 90%, or at least 95% sequence identity to the corresponding region in Target Strand 2. In yet other embodiments, the Unextended Primer has a nucleotide sequence with 100% sequence identity to the corresponding region of Target Strand 2.
  • Unextended Primer has a nucleotide sequence having least 75%, at least 80%, at least 85%, at least 90%, or at least 95% or 100% sequence identity to the complement of the corresponding! region in Target Strand 2.
  • One of skill in the art can readily design primer sequences that have less than 100% sequence identity to the target strand but can still efficiently amplify target DNA.
  • the Unextended Primer may further have a 5' tail of 1 , 2 or 3 nucleotides.
  • the T m of the Unextended Primer is preferably (but not necessarily) between approximately 50°C and approximately 62°C.
  • the T m of the Unextended Primer is between approximately 59°C and approximately 62°C, for example approximately 59°C, approximately 60°C, approximately 61 °C, or approximately 62°C.
  • the T m of the Unextended Primer is preferably at least approximately 6°C lower than the T m of the Extended Primer.
  • the Unextended Primer has a T m that is at approximately 15°C to 30°C lower than the T m of the Extended Primer.
  • the T m of the region of the Unextended Primer (at least 75%) complementary to the target (exclusive of any 5' extensions) is preferably no more than approximately 3°C higher or lower than the T m of the “A” region of the Extended Primer, i.e., the T m of region in the forward primer that hybridizes to the target is preferably no more than approximately 3°C higher or lower than the T m of the region in the reverse primer that that hybridizes to the target, and vice versa.
  • the Unextended Primer is preferably at least 12 nucleotides in length, and preferably ranges from 12 to 30 nucleotides and more preferably from 14-25 nucleotides. In certain embodiments, the Unextended Primer is 14, 15, 16, 17, 18, 19 or 20 nucleotides in length.
  • a third, “generic” primer is used that has a sequence that is similar to a 5' oligonucleotide tail added to one of the primers.
  • the generic primer is intended to participate in the amplification reaction after the initial PCR cycle to “balance” the amplification efficiency of different targets in a multiplex amplification reaction.
  • the improved asymmetric DNA amplification methods described herein can utilize a single Asymmetric Primer Pair per target region, i.e., do not include any additional primers, recognizing that an individual primer may be a mixture of primer molecule with closely related sequences resulting from the inclusion of mixed bases at certain positions in the primer.
  • this embodiment does not preclude that use of a plurality of Asymmetric Primer Pairs in a multiplex amplification reaction, provided that a single Asymmetric Primer Pair is used for each amplicon.
  • the PCR amplification reaction used in the methods of the disclosure can be a realtime PCR amplification reaction.
  • Real-time PCR refers to a growing set of techniques in which the buildup of amplified DNA products can be measured as the reaction progresses, typically once per PCR cycle. Monitoring the accumulation of products overtime allows for the determination of the efficiency of the reaction, as well as to estimate the initial concentration of DNA template molecules.
  • Real-Time PCR An Essential Guide, K. Edwards et al., eds., Horizon Bioscience, Norwich, U.K. (2004).
  • FRET fluorescence resonance energy transfer
  • Hydrolysis probes use the polymerase enzyme to cleave a reporter dye molecule from a quencher dye molecule attached to an oligonucleotide probe.
  • Conformation probes (such as molecular beacons) utilize a dye attached to an oligonucleotide, whose fluorescence emission changes upon the conformational change of the oligonucleotide hybridizing to the target DNA (see for example, Tyagi S et al., 1996, Molecular beacons: probes that fluoresce upon hybridization. Nat Biotechnol 14, 303-308).
  • Real-time PCR can be symmetric or asymmetric, e.g., performed with a hydrolysis probe molecule in the reaction mixture of a symmetric or asymmetric PCR amplification reaction described in Section 6.2.3.3 or 6.2.3.4.
  • probe molecules e.g., oligonucleotide probe molecules, suitable for the sequence specific detection of amplicons produced in a PCR reaction.
  • Parameters for successful oligonucleotide probe molecule design include, but are not limited to, probe molecule length, cross-hybridization efficiency, melting temperature, GC-content, self-annealing, and the ability to form secondary structures.
  • oligonucleotide probe molecules in a microarray, e.g., an addressable array, in which the probe molecules are anchored to a substrate, e.g., a membrane, e.g., a glass substrate, e.g., a plastic substrate, e.g., a polymer-matrix substrate, and exposed to nucleic acids under conditions allowing hybridization of the oligonucleotide probe molecule with amplicons having similar to identical sequences, e.g., the sequences share at least 75%, e.g., 80%, e.g., 85%, e.g., 90%, e.g., 95%, e.g., 96%, e.g., 97%, e.g., 98%, e.g., 99%, or even 100% similarity or identity.
  • a substrate e.g., a membrane, e.g., a glass substrate, e.g., a
  • oligonucleotide probe molecules used in the methods of the disclosure comprise a nucleotide sequence that is 90% to 100% complementary (e.g., 90% to 95% or 95% to 100%) to 15 to 40 consecutive nucleotides in a first genomic sequence and/or second genomic sequence.
  • oligonucleotide probe molecules are described in the Examples, and include probe molecules comprising nucleotide sequences of SEQ ID NOs: 1-7.
  • the oligonucleotide probe molecules are present on an array. Each probe molecule can be at a discrete at a discrete location on the array and distinguishable by its location on the array such that the oligonucleotide probe molecules are positionally addressable probe molecules present on the array.
  • the oligonucleotide probe molecules comprise a polythymidine tail, for example, a poly-thymidine tail comprising up to 10 nucleotides, or for example, a poly-thymidine tail comprising up to 15 nucleotides, or, for example, a polythymidine tail comprising up to 20 nucleotides.
  • the poly-thymidine tail comprises 10 to 20 nucleotides, e.g., 15 nucleotides.
  • Poly-thymidine tails can be useful when probe molecules are attached to an array, with the poly-thymidine tail acting as a spacer between the array substrate and the region of the probe molecule having partial or full complementarity to a target sequence(s).
  • Oligonucleotide probe molecules can be labeled or unlabeled. In some embodiments, oligonucleotide probe molecules are labeled. In other embodiments, oligonucleotide probe molecules are unlabeled. Oligonucleotide probe molecules can be labeled, for example with a fluorescent reporter, which can be a fluorescent dye such as those described in Section 6.2.3 or 6.2.3.1. Labeled oligonucleotide probe molecules can be used, for example, in realtime PCR reactions.
  • Labeled oligonucleotide probe molecules for real-time PCR can comprise a fluorescent reporter at one end of the probe molecule and a quencher moiety at the other end of the probe molecule that quenches fluorescence of the reporter.
  • the probe molecule can hybridize to its target sequence during the annealing stage, and once the polymerase reaches the probe molecule during the extension stage, its 5'-3'- exonuclease degrades the probe, physically separating the fluorescent reporter from the quencher, resulting in an increase in fluorescence, which can be measured.
  • the locations of the fluorescent label and quencher moiety on the probe molecule may be such that FRET may occur between the two moieties.
  • the fluorescent label can be, for example, at or near the 5' end of the probe molecule and the quencher moiety at or near the 3' end of the probe.
  • the separation distance between the fluorescent label and quencher is about 14 to about 22 nucleotides, although other distances, such as from about 6, about 8, about 10, or about 12 nucleotides may be used. Additional distances that can be used include about 14, about 16, about 18, about 20, or about 22 nucleotides.
  • An exemplary fluorescent label that can be used in an oligonucleotide probe molecule for real-time PCR is FAM or 6-FAM, and a representative quencher moiety is MGB.
  • a reporter moiety include fluorescein, HEX, TET, TAM, ROX, Cy3, Alexa, and Texas Red while non- limiting examples of a quencher or acceptor fluorescent moiety include TAMRA, BHQ (black hole quencher), LC RED 640, and cyanine dyes such as CY5.
  • any pair of reporter and quencher/acceptor moieties may be used as long as they are compatible such that transmission may occur from the donor to the quencher/acceptor.
  • pairs of suitable donors and quenchers/acceptors are known in the art and are provided herein. The selection of a pair may be made by any means known in the art. Custom real-time PCR probe molecules are commercially available, for example from ThermoFisher Scientific, Sigma- Aldrich and others.
  • this Section refers to amplicons and amplicon sets that can be probed with virtual probes.
  • virtual probes can likewise be used to probe samples containing or suspected of containing non-amplified target nucleic acids such as genome fragments.
  • the number of nucleotide mismatches between a first amplicon set (containing a single first amplicon corresponding to a region in a first genome or a plurality of first amplicons corresponding to different regions in the first genome) and a second amplicon set (containing a single second amplicon corresponding to a region in a second genome or a plurality of second amplicons corresponding to different regions in the second genome) can be relatively small, and individual oligonucleotide probe molecules may not be capable of individually distinguishing between the first and second amplicon sets.
  • the inventors have unexpectedly discovered that it is possible to nevertheless distinguish between the first and second amplicon sets in such situations by using a virtual probe.
  • the probe molecules of the virtual probe cannot individually, but can collectively distinguish between the two amplicon sets by virtue of the different hybridization patterns observed when the first and second amplicon sets are probed with the probe molecules of the virtual probe.
  • a meter probe in a virtual probe can help to improve the accuracy of species identification.
  • a meter probe that can hybridize with an amplicon in the first amplicon set and a homologous amplicon in the second amplicon set can be used to provide a measure of the relative concentration such amplicons in a sample.
  • a first formula or a second formula can be used to analyze the observed hybridization pattern for all probe molecules of the virtual probe.
  • a first formula can be used to analyze the hybridization pattern when the signal from the meter probe is at or above a pre-determined threshold level (indicating a relatively high concentration of amplicon in the sample), while a second formula can be used to analyze the hybridization pattern when the signal from the meter probe is below the threshold level (indicating a relatively low concentration of amplicon in the sample).
  • Threshold levels can be determined empirically, for example from signal data obtained from a series of samples containing known or unknown concentrations of microorganisms (see, e.g., Example 6).
  • Signal data for oligonucleotide probes can be plotted and the plots examined for patterns or trends, with a threshold value being selected on the basis of an observed pattern or trend, e.g., as described in Example 6.
  • a plot can contain signal data for a first oligonucleotide probe on the X-axis and signal ratio data (e.g., ratios of signals for the first oligonucleotide probe to signals for a second oligonucleotide probe) on the Y-axis.
  • value X can be selected as a threshold.
  • the meter probe used in a virtual probe is a genus probe, e.g., a probe that can be used to identify a microorganism as a member of a genus, but that cannot by itself identify an individual species of the genus from another species of the genus.
  • An exemplary meter probe useful in virtual probes for Staphylococci comprises the nucleotide sequence of SEQ ID NO:1 .
  • Nucleotide sequences of the first amplicon and the second amplicon should have at least 1 (e.g., 1 , at least 2, 2, at least 3, or 3) nucleotide mismatch in the regions of the amplicons capable of being bound by the probe molecules used in a virtual probe so that there is a difference in the signal pattern for the two or more probe molecules that make up the virtual probe when the probe molecules are hybridized to the first amplicon set and when the probe molecules are hybridized to the second amplicon set (e.g., on an array or during a real-time PCR reaction).
  • the difference in the signal pattern can be used to identify and/or distinguish the first and second amplicon sets.
  • the sample from which the first amplicon set was produced contains the genome corresponding to the first amplicon set (and, by extension, the organism whose genome is contained in the sample).
  • the sample from which the second amplicon set was produced contains the genome corresponding to the second amplicon set (and, by extension, the organism whose genome is contained in the sample).
  • Signals for individual probe molecules of a virtual probe when hybridized to PCR amplification products can be combined, for example, by one or more Boolean operators, by one or more relational operators, by one or more signal ratios (e.g., the signal for a first probe can be divided by the signal for a second probe), or by any of the foregoing, in any combination, to distinguish between first and second amplicon sets.
  • the signals are combined by one or more Boolean operators.
  • the signals are combined by one or more relational operators.
  • the signals are combined by one or more Boolean operators and one or more relational operators.
  • the signals are combined by a signal ratio.
  • the Boolean operators “AND”, “OR”, and “NOT” can be used to combine the signals from individual probe molecules of a virtual probe to distinguish between a first amplicon set and a second amplicon set.
  • a virtual probe for two homologous amplicons (“Amplicon A” and “Amplicon B” in this example) consists of two probe molecules (“Probe 1” and “Probe 2” in this example). Both Probe 1 and Probe 2 are capable of specifically hybridizing to Amplicon A, while Probe 1 , but not Probe 2, is capable of specifically hybridizing to Amplicon B.
  • PCR amplification products are probed with the virtual probe and the signal from hybridization of Probe 1 and the signal from hybridization of Probe 2 to the PCR amplification products are both positive (which can be represented using the Boolean operator “AND” as “Probe 1 AND Probe 2”), it can be determined that Amplicon A is present in the PCR amplification products.
  • PCR amplification products are probed with the virtual probe and the signal from hybridization of Probe 1 to the PCR products is positive, while the signal from hybridization of Probe 2 to the PCR products is not positive (which can be represented using the Boolean operator “NOT” as “Probe 1 NOT Probe 2”), it can be determined that Amplicon B is present in the PCR amplification products.
  • a hybridization signal can be considered positive, for example, if the hybridization signal is above a background level.
  • a hybridization signal can be considered not positive, for example, when no signal is observed or the observed signal is not above a background level.
  • the relational operators “greater than” (“>”), “greater than or equal to” (“>”), “less than” (“ ⁇ ”) and “less than or equal to” (“ ⁇ ”) can be used to combine the signals from individual probe molecules of a virtual probe to distinguish between a first amplicon set and a second amplicon set.
  • a virtual probe for two homologous amplicons (“Amplicon C” and “Amplicon D” in this example) consists of two probe molecules (“Probe 3” and “Probe 4” in this example). Both Probe 3 and Probe 4 are capable of specifically hybridizing to Amplicon C and Amplicon D.
  • Probe 3 and Probe 4 are hybridized to Amplicon C, the signal for Probe 3 is greater than the signal for Probe 4 (which can represented using the “greater than” relational operator as “Probe 3 > Probe 4”.
  • Probe 3 and Probe 4 are hybridized to Amplicon D, the signal for Probe 3 is less than the signal for Probe 4 (which can be represented using the “less than” relational operator as “Probe 3 ⁇ Probe 4”).
  • signals from individual probe molecules can be combined by a signal ratio, e.g., the signal for Probe 1 divided by the signal for Probe 2, or vice versa.
  • Signal ratios can in some embodiments be compared to a pre-determined cutoff value to determine the presence or absence of a microorganism in a sample or an initial sample from which the test sample was prepared.
  • the signals can be, for example, absolute signals, normalized signals, or fractional signals (e.g., the value of a signal for a probe molecule used in a virtual probe can be scaled using a predetermined function, for example as described in Example 3).
  • a signal for a probe molecule can be considered positive, for example, when it is above a predetermined cut-off.
  • a cut-off can be, for example, set at or above a background signal observed for a given probe molecule (e.g., a background signal due to non-specific hybridizing). Thus, for example, if a signal for a probe molecule is observed, but the signal is not above a background level, the signal can be considered not positive.
  • a virtual probe comprises two or more oligonucleotide probe molecules (e.g., 2 oligonucleotide probe molecules). In another embodiment, a virtual probe comprises three or more oligonucleotide probe molecules (e.g., 3 oligonucleotide probe molecules or 4 oligonucleotide probe molecules).
  • a virtual probe for a first organism and a second organism consists of two probe molecules.
  • the two probe molecules comprise a first probe molecule capable of specifically hybridizing a first amplicon in a first amplicon set (corresponding to the first organism) and a second amplicon in a second amplicon set (corresponding to the second organism), and a second probe molecule that is capable of specifically hybridizing to an amplicon in the second amplicon set but not an amplicon in the first amplicon set.
  • it can be determined that the first organism is present in a sample if the signal for the first probe molecule is positive and the signal for the second probe molecule is not positive when probing PCR amplification products prepared from the sample.
  • a virtual probe for a first organism and a second organism consists of three probe molecules.
  • the three probe molecules comprise a first probe molecule capable of specifically hybridizing a first amplicon in a first amplicon set (corresponding to the first organism) and a second amplicon in a second amplicon set (corresponding to the second organism), a second probe molecule capable of specifically hybridizing to an amplicon in the first amplicon set and an amplicon in the second amplicon set, and which is different from the first probe, and a third probe molecule capable of specifically hybridizing to an amplicon in the first amplicon set and an amplicon in the second amplicon set, and which is different from the first and second probe molecules.
  • the relative signals for the three probe molecules observed when probing a PCR amplification product can be used to determine whether the sample used to prepare the PCR amplification products contains the first organism or the second organism.
  • virtual probes can be used to distinguish homologous genomic sequences
  • virtual probes can be used to distinguish closely related organisms, for example closely related microorganisms.
  • virtual probes can be used to distinguish microorganisms from the same order, the same family, the same genus, the same group, or even the same species (e.g., different strains of the same species).
  • virtual probes can be used to distinguish between Lactobacillus and Listeria species, distinguish between Corynebacterium and Propionibactium species, distinguish between Micrococcus and Kocuria species, distinguish between Pasturella and Haemophilias species, distinguish between coagulase negative Staphylococcus species and coagulase positive Staphylococcus species, distinguish Streptococcus species (e.g., S. anginosus, S. gordonii, S. mitis, S. pneumoniae, S. agalactiae, S. pyogenes. S. gallolyticus, S. infantarius, S. vestibularis, S. salivarius, S. hyointestinalis, S.
  • Streptococcus species e.g., S. anginosus, S. gordonii, S. mitis, S. pneumoniae, S. agalactiae, S. pyogenes. S. gallolytic
  • S. intermedius e.g., S. oralis, S. sanguinis, S. parasanguinis
  • Staphylococcus species e.g., S. lugdunensis, S. epidermidis
  • Enterococcus species e.g., E. fecalis, E. faecium
  • Clostridium species e.g., C. perfringens, C. clostridiiforme, C. innocuum
  • Bacillus species e.g., B. cereus, B. coagulans
  • Pseudomonas species e.g., P. aeruginosa, P. putida, P. stutzeri, P.
  • Acinetobacter species e.g., A. baumannii, A. Iwoffii, A. ursingii, A. haemolyticus, A. junii).
  • Sections 6.2.5.1 to 6.2.5.3 and Examples 1-5 in Section 7 describe exemplary virtual probes for identifying and/or distinguishing different types of closely related bacteria.
  • the disclosure provides virtual probes that can be used to determine if a coagulase negative Staphylococcus sp. is present in a sample, and that can be used to distinguish a sample comprising a coagulase negative Staphylococcus sp. from a sample comprising a coagulase positive Staphylococcus sp.
  • a first exemplary probe molecule that can be used in a virtual probe for coagulase negative Staphylococcus sp. comprises or consists of the nucleotide sequence CCAGTCTTATAGGTAGGTTAYCCACG (SEQ ID NO:1).
  • a second exemplary probe molecule that can be used in a virtual probe for coagulase negative Staphylococcus sp. comprises or consists of the nucleotide sequence GCTTCTCGTCCGTTCGCTCG (SEQ ID NO:2).
  • the nucleotide sequences of SEQ ID NO:1 and SEQ ID NO:2 are designed to probe 16S RNA amplicons.
  • probe molecules having nucleotide sequences of SEQ ID NO:1 or SEQ ID NO:2 can be used to probe amplicons produced by a PCR amplification reaction performed using primers designed to amplify coagulase negative Staphylococcus sp. 16S rRNA genomic sequences.
  • the probe molecules can be, for example, included on an array or used in a real-time PCR reaction.
  • a sample contains a coagulase negative Staphylococcus sp. if, when probing PCR amplification products prepared from the sample, the signal for the first oligonucleotide probe (“Probe 1”) is positive and the signal for the second oligonucleotide probe (“Probe 2”) is not positive (which can be represented using the “NOT” operator as “Probe 1 NOT Probe 2”).
  • signal ratios for Probe 1 and Probe 2 can be used. For example, when the signal for Probe 1 divided by the signal for Probe 2 is greater than or equal to a predetermined cutoff value, it can be determined that a sample contains a coagulase negative Staphylococcus sp. Different formulas having different cutoff values can be used when the signal from a meter probe indicates a relatively high amount or a relatively low amount of target nucleic acids in a sample. Probe 1 , a genus probe, can be used as a meter probe.
  • first formula having a first cutoff can be used, and when the signal from the meter probe indicates a relatively low concentration of target nucleic acids (e.g., when the signal for the meter probe is below a pre-determined threshold value), a second formula having a second cutoff can be used.
  • first and second formulas can increase the accuracy of coagulase negative Staphylococcus sp. and coagulase positive Staphylococcus sp. identification, for example, as described in Example 6.
  • Exemplary virtual probes for coagulase negative Staphylococcus sp. are further described in Examples 1 and 6. 6.2.5.2.
  • the disclosure provides virtual probes that can be used to determine if Streptococcus gordonii or Streptococcus anginosus is present in a sample, and that can be used to distinguish a sample comprising Streptococcus gordonii from a sample comprising Streptococcus anginosus.
  • a first exemplary probe molecule that can be used in a virtual probe for S. gordonii and S. anginosus comprises or consists of the nucleotide sequence CAGTCTATGGTGTAGCAAGCTACGGTAT (SEQ ID NO:3).
  • a second exemplary probe molecule that can be used in a virtual probe for S. gordonii and S.
  • anginosus comprises or consists of the nucleotide sequence TATCCCCCTCTAATAGGCAGGTTA (SEQ ID NO:4).
  • the nucleotide sequences of SEQ ID NO:3 and SEQ ID NO:4 are designed to probe 16S RNA amplicons.
  • oligonucleotide probe molecules having nucleotide sequences of SEQ ID NO:3 or SEQ ID NO:4 can be used to probe amplicons produced by a PCR amplification reaction performed using primers designed to amplify 16S rRNA genomic sequences from S. gordonii and S. anginosus.
  • the probe molecules can be, for example, included on an array or used in a real-time PCR reaction.
  • a sample contains S. gordonii if, when probing PCR amplification products prepared from the sample, the signal for the first probe (“Probe 1”) is positive and the signal for the second probe (“Probe 2”) is not positive (which can be represented using the “NOT” operator as “Probe 1 NOT Probe 2”). It can be determined that a sample contains S. anginosus if, when probing PCR amplification products prepared from the sample, the signal for Probe 1 is positive and the signal for Probe 2 is also positive (which can be represented using the “AND” operator as “Probe 1 AND Probe 2”). Exemplary virtual probes for S. gordonii and S. anginosus are further described in Example 2.
  • the disclosure virtual probes that can be used to determine if Streptococcus mitis or Streptococcus pneumoniae is present in a sample, and that can be used to distinguish a sample comprising Streptococcus mitis from a sample comprising Streptococcus pneumoniae.
  • a first exemplary probe molecule that can be used in a virtual probe for S. mitis and S. pneumoniae comprises or consists of the nucleotide sequence AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5).
  • a second exemplary probe molecule that can be used in a virtual probe for S. mitis and S. pneumoniae comprises or consists of the nucleotide sequence GATGCAAGTGCACCTTTTAAGCAA (SEQ ID NO:6).
  • a third exemplary probe molecule that can be used in a virtual probe for S. mitis and S. pneumoniae comprises or consists of the nucleotide sequence GATGCAAGTGCACCTTTTAAGTAA (SEQ ID NO:7).
  • the nucleotide sequences of SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7 are designed to probe 16S RNA amplicons.
  • probe molecules having nucleotide sequences of SEQ ID NO:5, SEQ ID NO:6, or SEQ ID NOT can be used to probe amplicons produced by a PCR amplification reaction performed using primers designed to amplify 16S rRNA genomic sequences of S. mitis and S. pneumoniae.
  • the probe molecules can be, for example, included on an array or used in a real-time PCR reaction.
  • a sample contains S. mitis if, when probing PCR amplification products prepared from the sample, the signals for the second probe (“Probe 2”) and/or third probe (“Probe 3”) is/are less than a scaled signal for the first probe (“Probe 1”).
  • the relationship between the signals for determining whether the sample contains S. mitis can be represented using Boolean and relational operators as “(Probe 2 OR Probe 3) ⁇ (Probe 1 )/n,” where n is a pre-determined value used to scale the Probe 1 signal. It can be determined that a sample contains S.
  • the relationship between the signals for determining whether the sample contains S. pneumoniae can be represented using Boolean and relational operators as “(Probe 2 OR Probe 3) > (Probe 1 )/n.”
  • a suitable value for “n” can be determined, for example, by probing PCR products produced from a sample known to contain S. mitis and probing PCR products from a sample known to contain S. pneumoniae.
  • a sample contains S. mitis if, when probing PCR amplification products prepared from the sample, the signal for Probe 3 divided by the signal for Probe 1 is less than a predetermined value “n”. It can be determined that a sample contains S. pneumoniae if, when probing PCR amplification products prepared from the sample, the signal for Probe 3 divided by the signal for probe 1 is greater than “n”.
  • a suitable value for “n” can be determined, for example, by probing PCR products produced from a sample known to contain S. mitis and probing PCR products from a sample known to contain S. pneumoniae.
  • the present disclosure provides addressable arrays comprising one or more virtual probes that each can be useful for distinguishing a first genomic sequence from a second, homologous genomic sequence.
  • the addressable arrays of the disclosure can be used in the methods described herein.
  • An addressable array of the disclosure can comprise a group of positionally addressable oligonucleotide probe molecules, each at a discrete location on the array.
  • each probe molecule in the group of oligonucleotide probe molecules making up a virtual probe comprises a nucleotide sequence that is 90% to 100% (e.g., 90% to 95% or 95% to 100%) complementary to 15 to 40 consecutive nucleotides (e.g., 15 to 20, 15 to 30, 20 to 40, 20 to 30, or 30 to 40 consecutive nucleotides) in the first genomic sequence or second genomic sequence that the virtual probe is intended to distinguish.
  • the addressable array may further optionally comprise one or more control probe molecules (e.g., an extraction and amplification control for useful for evaluating the efficiency of DNA extraction and amplification steps and/or a hybridization control useful for evaluating the efficiency of DNA hybridization to the array).
  • control probe molecules e.g., an extraction and amplification control for useful for evaluating the efficiency of DNA extraction and amplification steps and/or a hybridization control useful for evaluating the efficiency of DNA hybridization to the array.
  • the probe molecules of the array comprise a poly-thymidine tail, for example, a poly-thymidine tail comprising up to 10 nucleotides, or for example, a poly-thymidine tail comprising up to 15 nucleotides, or, for example, a poly-thymidine tail comprising up to 20 nucleotides.
  • the poly-thymidine tail is a 10-mer to a 20-mer, e.g., a 15 mer.
  • the addressable array comprises 12 or more probe molecules, for example, 12 to 100 probe molecules, or for example, 12 to 50 probe molecules, or for example, 25 to 75 probe molecules, or for example, 50 to 100 probe molecules. In some embodiments, the addressable array comprises 12 probe molecules. In other embodiments, the addressable array comprises 14 probe molecules. In still other embodiments, the addressable array comprises 84 probe molecules.
  • the addressable array comprises oligonucleotide probes for at least 2 virtual probes, for example, for at least 3 virtual probes, or for example, at least 5 virtual probes, or for example at least 10 virtual probes, or for example, the addressable array comprises oligonucleotide probes for up to 10 or up to 15 virtual probes.
  • the virtual probes can be overlapping such that a probe molecule can be a component of two or more virtual probes.
  • the virtual probes can also be non-overlapping.
  • the addressable array comprises virtual probes capable of distinguishing between at least 5 different types of microorganisms, for example bacteria. In other embodiments, the addressable array comprises virtual probes capable of distinguishing between at least 10 different types, for example, at least 20 different types, for example at least 30 different types, for example, at least 40 different types, or for example up to 50 different types of microorganisms, for example bacteria.
  • the addressable array contains at least 5 virtual probes, for example, at least 10 virtual probes, for example at least 15 virtual probes, or for example, at least 20 virtual probes, each of which is capable of identifying different types of microorganisms, for example bacteria, for example, different strains or species of bacteria that might be present in a sample.
  • an addressable array of the disclosure comprises one or more virtual probes for differentiating a genomic sequence from a species of eubacteria from a genomic sequence of a microorganism that is not a species of eubacteria.
  • the addressable array comprises one or more virtual probes suitable for differentiating a genomic sequence from a gram positive bacteria and a genomic sequence from a gram negative bacteria.
  • the addressable array comprises one or more virtual probes suitable for differentiating genomic sequences from microorganisms of different orders.
  • the virtual probes are suitable for differentiating genomic sequences from microorganisms of different families.
  • the virtual probes are suitable for differentiating genomic sequences from microorganisms of different genera, of different groups, and/or of different species.
  • Suitable microarray system that can be used to make an array of the disclosure are described in U.S. Patent No. 9,738,926 and U.S. Patent Application Publication no. 2018/0362719 A1 , the contents of which are incorporated by reference herein in their entireties.
  • the microarray systems described in U.S. Patent No. 9,738,926 and U.S. Patent Application Publication no. 2018/0362719 A1 take advantage of three-dimensional crosslinked polymer networks.
  • the array of the disclosure comprises an array as described in U.S. Patent No. 9,738,926, wherein the probe molecules of the array comprise a group of oligonucleotide probe molecules as described herein.
  • the array of the disclosure comprises an array as described in U.S.
  • the present disclosure provides methods for determining if a first organism or a second organism is present in a sample using an array of the disclosure.
  • An exemplary method comprises the steps of: performing a PCR amplification reaction on a sample using PCR primers capable of hybridizing to, and initiating a PCR amplification from, both the genome of the first organism (the “first genome”) and the genome of the second organism (the “second genome”), resulting in a first amplicon set and a second amplicon set, respectively, when the first genome and the second genome are present in the sample, and wherein the PCR amplification reaction incorporates a label which produces a measurable signal into any PCR amplification products produced by the reaction; contacting the PCR amplification products to an array of the disclosure having one or more virtual probes comprising two or more oligonucleotide probe molecules each of which is capable of specifically hybridizing to one or more amplicons in the first amplicon set and/or the second amplicon set,
  • the disclosure provides computer-implemented methods for detecting if a first microorganism having a first genome or a second microorganism of the same genus having a second genome is present in a test sample or an initial sample from which the test sample was prepared.
  • the methods typically comprise executing, in a computer system having one or more processors coupled to a memory storing one or more computer readable instructions for execution by the one or more processors, the one or more computer readable instructions comprising instructions for: (a) receiving signal data from hybridization of probe molecules to nucleic acids, if any, in a test sample, and (d) analyzing the signal data according to one or more formulas to determine whether a first microorganism having a first genome or a second microorganism having a second genome is present in a test sample or an initial sample from which the test sample was prepared.
  • the one or more formulas can comprise, for example, a first formula that is used to analyze the signal data if the signal from a meter probe is at or above a predetermined threshold value, and a second formula that is used to analyze the signal data if the signal data from the meter probe is less than a predetermined threshold value.
  • the computer-implemented methods can include providing notifications to the user, for example concerning the identity of a microorganism in a sample and/or concerning the absence of a microorganism of interest in a sample.
  • the present disclosure provides systems useful, for example, for determining if an organism is present in a sample.
  • the systems can comprise, for example: (i) an optical reader for generating signal data for each probe molecule location of an array having oligonucleotide probe molecules (e.g., an array of the disclosure); and (ii) at least one processor which is configured to receive signal data from the optical reader and is configured to analyze the signal data using a virtual probe (e.g., a virtual probe having features as described herein), and which has an interface to a storage or display device or network for outputting a result of the analysis.
  • a virtual probe e.g., a virtual probe having features as described herein
  • Optical readers that can be used in the systems of the disclosure include commercially available microplate readers (e.g., GloMax® Discover (Promega), ArrayPixTM (Arrayit), VarioskanTM LUX (Thermo Scientific), Infinite® 200 PRO (Tecan)).
  • microplate readers e.g., GloMax® Discover (Promega), ArrayPixTM (Arrayit), VarioskanTM LUX (Thermo Scientific), Infinite® 200 PRO (Tecan)).
  • the systems can further include (as a component of an optical reader or as a separate component) a light source light source capable of exciting fluorescently labeled molecules (e.g., probe molecules and/or amplicons hybridized thereto) on an array.
  • a light source light source capable of exciting fluorescently labeled molecules (e.g., probe molecules and/or amplicons hybridized thereto) on an array.
  • the system can include a non-transient storage medium (e.g., a hard disk, flash drive, CD or DVD) including processor executable instructions for implementing the analysis of the signal data.
  • the system can include a general purpose or a special purpose computing system environment or configuration. Examples of well-known computing systems, environments, and/or configurations that can be used with the systems of the disclosure include, but are not limited to, personal computers, server computers, smartphones, tablets, hand-held or laptop devices, multiprocessor systems, microprocessor-based systems, network PCs, minicomputers, mainframe computers, distributed computing environments that include any of the above systems or devices, and the like.
  • Systems of the disclosure can execute computer-executable instructions, such as program modules.
  • program modules include routines, programs, objects, components, data structures, etc. that perform particular tasks or implement particular abstract data types.
  • Some embodiments may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. These distributed systems may be what are known as enterprise computing systems or, in some embodiments, may be “cloud” computing systems.
  • program modules may be located in both local and/or remote computer storage media including memory storage devices.
  • a computing environment may include one or more input/output devices. Some such input/out devices may provide a user interface. A user may enter commands and information into the computer through input devices such as a keyboard and pointing device, such as a mouse. However, other forms of pointing devices may be used, including a trackball, touch pad or touch screen.
  • Systems of the disclosure can include one or more output devices, including an output device that may form a portion of a user interface, for example a monitor.
  • Systems of the disclosure can be operated in a networked environment using logical connections to one or more remote computers.
  • the remote computer may be a personal computer, a server, a router, a network PC, a peer device or other common network node.
  • Logical connections include a local area network (LAN) and a wide area network (WAN), but may also include other networks.
  • LAN local area network
  • WAN wide area network
  • Such networking environments are commonplace in offices, enterprise-wide computer networks, intranets and the Internet.
  • the WAN may include a cellular network.
  • a system of the disclosure can be connected to the LAN through a network interface or adapter.
  • a system can include a modem or other means for establishing communications over the WAN, such as the Internet.
  • program modules for analyzing signal data using a virtual probe can be stored in a remote memory storage device (e.g., hard drive or flash drive).
  • a remote memory storage device e.g., hard drive or flash drive.
  • a system of the disclosure can further comprise a plate handling robot capable of adding the product of a PCR amplification reaction to the array and capable of washing unbound nucleic acid molecules from the array.
  • a plate handling robot capable of adding the product of a PCR amplification reaction to the array and capable of washing unbound nucleic acid molecules from the array.
  • Numerous plate handling robots are commercially available and such robots can be used in the systems of the disclosure (e.g., a Tecan MSP 9000, MSP 9250 or MSP 9500, a Tecan Cavro® Omni Flex, a Tricontinent TriTon (XYZ), or an Aurora VersaTM).
  • kits suitable for use in the methods of the disclosure are provided.
  • a kit can comprise, for example, a set of two or more labeled probe molecules (e.g., 2 to 20 probe molecules, 2 to 10 probe molecules, 2 to 5 probe molecules, 5 to 10 probe molecules, or 10 to 20 probe molecules) suitable for use in a real-time PCR reaction as described herein.
  • a set of two or more labeled probe molecules e.g., 2 to 20 probe molecules, 2 to 10 probe molecules, 2 to 5 probe molecules, 5 to 10 probe molecules, or 10 to 20 probe molecules
  • a kit can comprise (1) a probe molecule whose nucleotide sequences comprises SEQ ID NO:1 and a probe molecule whose nucleotide sequence comprises SEQ ID NO:2; (2) a probe molecule whose nucleotide sequences comprises SEQ ID NO:3 and a probe molecule whose nucleotide sequence comprises SEQ ID NO:4; or (3) a probe molecule whose nucleotide sequences comprises SEQ ID NO:5, a probe molecule whose nucleotide sequence comprises SEQ ID NO:6, and a probe molecule whose nucleotide sequence comprises SEQ ID NO:7.
  • a kit comprises a combination of the probe molecules of (1) and (2).
  • a kit comprises a combination of the probe molecules of (1) and (3). In other embodiments, a kit comprises a combination of the probe molecules of (2) and (3). In yet other embodiments, a kit comprises a combination of the probe molecules of (1), (2), and (3).
  • a kit can comprise, for example, a set of two or more probe molecules (e.g., 2 to 20 probe molecules, 2 to 10 probe molecules, 2 to 5 probe molecules, 5 to 10 probe molecules, or 10 to 20 probe molecules) suitable for use on an array as described herein (e.g., unlabeled probe molecules).
  • probe molecules e.g., 2 to 20 probe molecules, 2 to 10 probe molecules, 2 to 5 probe molecules, 5 to 10 probe molecules, or 10 to 20 probe molecules
  • a kit can comprise (1) a probe molecule whose nucleotide sequences comprises SEQ ID NO:1 and a probe molecule whose nucleotide sequence comprises SEQ ID NO:2; (2) a probe molecule whose nucleotide sequences comprises SEQ ID NO:3 and a probe molecule whose nucleotide sequence comprises SEQ ID NO:4; or (3) a probe molecule whose nucleotide sequences comprises SEQ ID NO:5, a probe molecule whose nucleotide sequence comprises SEQ ID NO:6, and a probe molecule whose nucleotide sequence comprises SEQ ID NO:7.
  • a kit comprises a combination of the probe molecules of (1) and (2).
  • a kit comprises a combination of the probe molecules of (1) and (3). In other embodiments, a kit comprises a combination of the probe molecules of (2) and (3). In yet other embodiments, a kit comprises a combination of the probe molecules of (1), (2), and (3).
  • kits can comprise an array as described herein.
  • Kits as described herein can further comprise one or more reagents for performing a PCR reaction e.g., one or more (e.g., two) primers for amplifying homologous genomic sequences, and/or one or more reagents for performing a hybridization reaction, e.g., a wash buffer.
  • one or more reagents for performing a PCR reaction e.g., one or more (e.g., two) primers for amplifying homologous genomic sequences
  • a hybridization reaction e.g., a wash buffer.
  • Kits as described herein can further comprise one or more reagents and/or one or more devices for preparing a sample for a PCR amplification reaction, e.g., a lysis buffer or a bead beating system.
  • Kits as described herein can further comprise one or more containers and/or instructions for using the components of the kit to perform some or all of the steps of a method as described herein.
  • Example 1 Virtual probes for Coagulase Negative Staphylococci (CNS)
  • Staphylococcus aureus is a coagulase positive species and is a normal member of the microbiota of the body.
  • S. aureus can become an opportunistic pathogen, causing skin infections, respiratory infections, and food poisoning.
  • There are a few other coagulase positive Staphylococci but they usually play no major role in disease and therefore can be neglected for most analytical purposes.
  • AIIStaph-146abp having the nucleotide sequence CCAGTCTTATAGGTAGGTTAYCCACG (SEQ ID NO:1)
  • AIIStaph-146abp is a genus probe molecule and cannot by itself distinguish S. aureus from coagulase negative species in a sample.
  • the numbers present in the probe molecule names used in the examples refer to the distance in number of nucleotides between a forward PCR primer for making an amplicon that can be probed with the probe molecule and the start of the probe.
  • a second oligonucleotide probe “Sau-71 p” (having the nucleotide sequence GCTTCTCGTCCGTTCGCTCG (SEQ ID NO:2)) is a 16S rRNA probe molecule that provides a positive signal with amplicons from S. aureus but does not provide a positive signal with amplicons from coagulase negative Staphylococcus.
  • an exemplary virtual probe for coagulase negative Staphylococcus species can consist of AIIStaph-146abp and Sau-71 p.
  • the sample from which the PCR amplification product was made contains a coagulase negative Staphylococcus species (see, Fig. 10A).
  • the virtual probe can include additional probe molecules.
  • the sample can be determined to contain a coagulase negative Staphylococcus species when a probe specific for S. hyicus is also not positive (which can be represented as “AIIStaph-146abp NOT Sau71 P NOT S. hyicus’”) (see, Fig. 10B).
  • Example 2 Virtual Probes for Differentiating Streptococcus anginosus and Streptococcus gordonii
  • Streptococcus gordonii is a bacterium normally found in the human mouth. S. gordonii is usually harmless in the mouth, but can cause acute bacterial endocarditis upon entry to the bloodstream. Streptococcus anginosus is also a member of the human microbiota, and is known to cause infections in immunocompromised individuals.
  • oligonucleotide probe molecules have been made that can be used in virtual probes for distinguishing between S. anginosus and S. gordonii in a sample.
  • An exemplary virtual probe for S. gordonii and S. anginosus consists of Stango85p and Sang156p.
  • Streptococcus mitis and Streptococcus pneumoniae are almost identical in their 16S rRNA, thereby making it difficult to distinguish between the two species using single oligonucleotide probe molecules for 16S rRNA.
  • the first probe “AllStrep- 261 p,” having the nucleotide sequence AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5), is a genus probe molecule that cannot distinguish between different Streptococci species.
  • the second probe “Spneu-229p,” having the nucleotide sequence GATGCAAGTGCACCTTTTAAGCAA (SEQ ID NO:6), although comprising a genomic sequence from S. pneumoniae, cannot by itself be used to distinguish between S. mitis and S. pneumoniae.
  • the third probe “Spneu-229bp,” having the nucleotide sequence GATGCAAGTGCACCTTTTAAGTAA (SEQ ID NO:7), differs from Spneu-229p by a single nucleotide to account for a SNP in S. pneumoniae.
  • mitis produced a signal pattern that can be represented as “(Spneu-229p OR Spneu-229bp) ⁇ (AllStrep- 261 p)/3,” while probing PCR amplification products produced from a sample containing S. pneumoniae produced a signal pattern that can be represented as “(Spneu-229p AND Spneu-229bp) > (AIIStrep-261 p)/3”.
  • VGS The viridians Streptococci group
  • the viridians Streptococci group is one of the major group of clinically relevant gram positive bacteria with over 24 species which are arranged in five sub-groups, the Streptococcus bovis group, Streptococcus anginosus group, Streptococcus salivarius group, Streptococcus mitis group and the Streptococcus mutans group.
  • VGS group bacteria can cause pneumonia and sepsis in immunocompromised patients.
  • VGS species as a group appear genetically heterogeneous, suggesting that different species can be detected with a single probe.
  • Multiple probes were designed for the different VGS group bacteria, but a few of the species showed cross-reactivity with probes for other VGS sub-groups (see, FIG. 12, which shows cross-reactivity of S. pneumoniae, with S. mitis probe Smit-79p, and which shows S. mitis and S. oralis cross-reactivity with S. pneumoniae probes Spneu-229p and Spneu-229bp).
  • FIG. 12 shows cross-reactivity of S. pneumoniae, with S. mitis probe Smit-79p, and which shows S. mitis and S. oralis cross-reactivity with S. pneumoniae probes Spneu-229p and Spneu-229bp.
  • a virtual probe for S. mitis group bacteria that can distinguish between S. pneumoniae and S. mitis/S. oralis was designed:
  • S. mitis sub-group (Spar-205p AND AIIStrep-261 p) OR (Smit-79p AND AHStrep- 261 p AND NOT Shyo-193p) OR (Ssang-193p AND AIIStrep-261 p AND NOT Stmu-86p) OR (Stango-85p AND NOT Sang-156p AND AIIStrep-261 p>0.01) AND IF Smit-79p THEN (Spneu-229bp / AIIStrep-261 p) / AIIStrep-261 p ⁇ 3.
  • Example 5 Virtual Probe for Detecting species from Enterobacteriaceae group
  • Enterobacteriaceae is a large family of gram negative bacteria that includes both pathogenic and non-pathogenic species.
  • Pathogenic family members include Klebsiella species, Enterobacter species, Escherichia species, Citrobacter species, Serratia species and Salmonella species.
  • the 16s rRNA region for members of the family has minimal genetic sequence variation among the species making it difficult to design 16S probes capable of distinguishing between species.
  • a common probe, “Entb-132p,” was designed which can identify most family members as Enterobacteriace, except Pantoea species which can be identified the probe “Entb-299p.”
  • Two group probes were designed following the clustering pattern of the species from Enterobacteriace species in the 16S rRNA genomic region.
  • Species from Enterobacter and Klebsiella genera can be identified by the probe “Enklspss-95p” and species from Citrobacter, Salmonella and Escherichia genera can be identified by the probe “SaEsCi- 91 p.” Because of the difficulty in designing single probes capable of distinguishing between Enterobacteriaceae species, a combination of 16S rRNA and ITS probes were designed (see FIG. 13 and FIG. 14) for hierarchical identification and differentiation of Enterobacteriaceae species.
  • 16s-23s ITS region made it possible to differentiate the species of Enterobacter cloacae complex, which includes E. cloacae, E. asburiae, and E. hormaechei.
  • This example describes improved formulas for distinguishing and identifying CNS and coagulase positive S. aureus using a virtual probe comprising the AIIStaph-146abp and Sau-71 p probes described in Example 1 .
  • AIIStaph-146abp/Sau-71 p signal ratios can be used to distinguish samples containing CNS from samples containing S. aureus. For example, it was found that a signal ratio cutoff value of 2 could be used to distinguish samples containing CNS from samples containing S. aureus with good accuracy (e.g., identifying a sample as containing CNS when the AllStaph- 146abp/Sau-71 p signal ratio is > 2, and identifying a sample containing S. aureus when the AIIStaph-146abp/Sau-71 p signal ratio is ⁇ 2) (data not shown).
  • a signal ratio cutoff value of 2 could be used to distinguish samples containing CNS from samples containing S. aureus with good accuracy (e.g., identifying a sample as containing CNS when the AllStaph- 146abp/Sau-71 p signal ratio is > 2, and identifying a sample containing S. aureus when the AIIStaph-146a
  • FIGS. 16A-16B show that as the concentration of a CNS (S. hominis) and S.
  • an AIIStaph-146abp/Sau-71 p signal ratio cutoff value (e.g., 2) that may accurately distinguish between CNS and S. aureus at a low sample concentration may lead to an incorrect identification at a high sample concentration.
  • AIIStaph-146abp/Sau-71 p signal ratios were observed to decrease as sample concentration was increased, it was also observed that for all studied concentrations the AIIStaph-146abp/Sau-71 p ratio remained higher for CNS than for S. aureus.
  • an appropriate cutoff value for the AIIStaph-146abp/Sau-71 p signal ratio can be used to distinguish CNS and S. aureus, even at high sample concentrations.
  • the genus probe AIIStaph-146abp can be used a meter probe to provide a relative measure of target nucleic acid concentration. Formulas for identifying CNS and S. aureus that account for this relative measure of target nucleic acid concentration were designed:
  • a raw AIIStaph-146abp signal of 0.2 is a threshold value (which was determined empirically from the raw signal data; see, FIG. 17).
  • FIG. 17 plots AllStaph- 146abp signal data on the X-axis and AIIStaph-146abp/Sau-71 signal ratios on the Y-axis for samples containing S. aureus.
  • a threshold value of 0.2 was selected because AHStaph- 146abp/Sau-71 signal ratios were observed to generally be relatively high when the AIIStaph-146abp signal was below 0.2 and relatively low when the AIIStaph-146abp signal was above 0.2.
  • cutoff values for the AHStaph- 146abp/Sau-71 signal ratio that are indicative of the presence of CNS or S. aureus were selected for use when AIIStaph-146abp signal levels are at or above 0.2 and for use when AIIStaph-146abp signal levels are below 0.2. Cutoff values of 1 .5 (for use when the AllStaph- 146abp signal is greater than or equal to 0.2) and 3 (for use when the AIIStaph-146 signal is less than 0.2) were selected. Data used for the selection of the threshold and cutoff values are shown in Tables 3A-3B.
  • CNS when the signal from AIIStaph-146abp is greater than or equal to the threshold value of 0.2 (indicative of a relatively high concentration of target nucleic acids), CNS is determined to be present when the ratio of raw signals for Allstaph- 146abp/Sau-71 p is greater than or equal to 1.5; if the ratio is less than 1.5, S. aureus is determined to be present.
  • CNS is determined to be present in the sample when the ratio of raw signals for AIIStaph-146abp/Sau-71 p is greater than 3; if the ratio is less than or equal to 3, S. aureus is determined to be present.
  • the above formula represents the combination of a first formula (AllStaph- 146abp/Sau-71 p>1.5) and a second formula (AIIStaph-146abp/Sau-71 p>3), with the signal for AIIStaph-146abp (specifically, how the signal compares to the threshold value) determining which formula is used to identify a sample as containing CNS or S. aureus.
  • Using formulas that account for the relative concentration of target nucleic acids in the sample can increase the accuracy of CNS and S. aureus identification compared to a formula based simply on a single AIIStaph-146abp/Sau-71 p signal ratio cutoff.
  • the virtual probe comprises at least two probe molecules each capable of specifically hybridizing to one or more target nucleic acids corresponding to the first genome and/or one or more homologous target nucleic acids corresponding to the second genome,
  • At least one of the probe molecules is a meter probe capable of hybridizing to a target nucleic acid corresponding to the first genome and a homologous target nucleic acid corresponding to the second genome, such that the hybridizing of the meter probe to such target nucleic acids can provide a measure of the relative amount of target nucleic acids in the test sample,
  • the probe molecules of the virtual probe cannot individually distinguish the target nucleic acids corresponding to the first and second genomes
  • the probe molecules hybridize non-identically to the target nucleic acids corresponding to the first and second genomes, such that the hybridizing of the probe molecules to the target nucleic acids corresponding to the first and second genomes can distinguish between the target nucleic acids corresponding to the first and second genomes;
  • step (i) analyzing the signals detected and/or quantified in step (b) according to a first formula if the signal from hybridization of the meter probe to nucleic acids in the sample is greater than or equal to a threshold value
  • step (ii) analyzing the signals detected and/or quantified in step (b) according to a second formula if the signal from hybridization of the meter probe to nucleic acids in the sample is less than the threshold value.
  • each probe molecule in the virtual probe is capable of specifically hybridizing to one or more amplicons in the first amplicon set and/or the second amplicon set, and wherein the probe molecules hybridize non-identically to the amplicons in the first amplicon set and the amplicons in the second amplicon set, such that the hybridizing of the probe molecules to the amplicons in the first amplicon set and the second amplicon set can distinguish between the first amplicon set and the second amplicon set.
  • PCR primers comprise more than one primer pair and wherein the first amplicon set comprises a plurality of first amplicons and/or the second amplicon set comprises a plurality of second amplicons.
  • the method of embodiment 5, which further comprises preparing the test sample by (a) performing a first PCR amplification reaction on the initial sample using a first set of PCR primers capable of hybridizing to, and initiating a PCR amplification from, both the first genome and the second genome, (b) performing a second PCR amplification reaction on the initial sample using a second set of PCR primers which is different from the first set of PCR primers and which is capable of hybridizing to, and initiating a PCR amplification from, both the first genome and the second genome, and (c) combining the amplicons produced in the first and second PCR reactions, resulting in a first amplicon set comprising a plurality of first amplicons and a second amplicon set comprising a plurality of second amplicons, respectively, when the first genome and second genome are present in the initial sample.
  • the plurality of first amplicons corresponds to different regions in the first genome and/or the plurality of second amplicons correspond
  • PCR primers comprise a single primer pair and the first amplicon set consists of a single first amplicon and the second amplicon set consists of a single second amplicon.
  • nucleotide sequence of the first amplicon and the nucleotide sequence of the second amplicon have at least 1 nucleotide mismatch in the regions of the amplicons capable of hybridizing to at least one probe molecule in the virtual probe.
  • nucleotide sequence of the first amplicon and the nucleotide sequence of the second amplicon have at least 2 nucleotide mismatches in the regions of the amplicons capable of hybridizing to at least one probe molecule in the virtual probe.
  • nucleotide sequence of the first amplicon and the nucleotide sequence of the second amplicon have at least 3 nucleotide mismatches in the regions of the amplicons capable of hybridizing to at least one probe molecule in the virtual probe.
  • each probe molecule comprises a nucleotide sequence that is 90% to 100% complementary to 15 to 40 consecutive nucleotides in the first genome and/or second genome.
  • step (b) comprises:
  • the PCR amplification reaction is a real-time PCR amplification reaction.
  • each probe molecule comprises a distinguishable label and a quencher moiety that inhibits detection of the label when the label and quencher moiety are both attached to the probe;
  • the label produces a measurable signal upon cleavage of the probe molecule during the real-time PCR amplification reaction
  • each label is distinguishable from each other label.
  • first amplicon set and the second amplicon set each comprise a nucleotide sequence corresponding to a 16S rRNA gene.
  • the virtual probe comprises (i) a first probe molecule capable of specifically hybridizing to a first target nucleic acid (e.g., a first amplicon in the first amplicon set when the target nucleic acids are PCR products) and a second target nucleic acid (e.g., a second amplicon in the second amplicon set when the target nucleic acids are PCR products) and (ii) a second probe molecule that is capable of specifically hybridizing to the second target nucleic acid but not the first target nucleic acid.
  • a first target nucleic acid e.g., a first amplicon in the first amplicon set when the target nucleic acids are PCR products
  • a second target nucleic acid e.g., a second amplicon in the second amplicon set when the target nucleic acids are PCR products
  • a second probe molecule that is capable of specifically hybridizing to the second target nucleic acid but not the first target nucleic acid.
  • the virtual probe comprises (i) a first probe molecule capable of specifically hybridizing to a first target nucleic acid (e.g., a first amplicon in the first amplicon set when the target nucleic acids are PCR products) and a second target nucleic acid (e.g., a second amplicon in the second amplicon set when the target nucleic acids are PCR products) and (ii) a second probe molecule that is capable of specifically hybridizing to the first target nucleic acid and the second target nucleic acid.
  • a first target nucleic acid e.g., a first amplicon in the first amplicon set when the target nucleic acids are PCR products
  • a second target nucleic acid e.g., a second amplicon in the second amplicon set when the target nucleic acids are PCR products
  • a second probe molecule that is capable of specifically hybridizing to the first target nucleic acid and the second target nucleic acid.
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising CCAGTCTTATAGGTAGGTTAYCCACG (SEQ ID NO:1).
  • the probe molecule having a nucleotide sequence comprising CCAGTCTTATAGGTAGGTTAYCCACG (SEQ ID NO:1) is a meter probe.
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising GCTTCTCGTCCGTTCGCTCG (SEQ ID NO:2).
  • the virtual probe comprises a first probe molecule having a nucleotide sequence comprising CCAGTCTTATAGGTAGGTTAYCCACG (SEQ ID NO:1) and a second probe molecule having a nucleotide sequence comprising GCTTCTCGTCCGTTCGCTCG (SEQ ID NO:2), wherein the first formula and second formula combines signals from the first probe and the second probe in a signal ratio.
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising CAGTCTATGGTGTAGCAAGCTACGGTAT (SEQ ID NO:3).
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising TATCCCCCTCTAATAGGCAGGTTA (SEQ ID NO:4).
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising GATGCAAGTGCACCTTTTAAGTAA (SEQ ID NOT).
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5).
  • the virtual probe comprises (i) a first probe molecule capable of specifically hybridizing to a first target nucleic acid (e.g., a first amplicon in the first amplicon set when the target nucleic acids are PCR products) and a second target nucleic acid (e.g., a second amplicon in the second amplicon set when the target nucleic acids are PCR products), (ii) a second probe molecule which is different from the first probe molecule and that capable of specifically hybridizing to the first and second target nucleic acids, and (iii) a third probe molecule which is different from the first and second probe molecules and that is capable of specifically hybridizing to the first and second target nucleic acids.
  • a first target nucleic acid e.g., a first amplicon in the first amplicon set when the target nucleic acids are PCR products
  • a second target nucleic acid e.g., a second amplicon in the second amplicon set when the target nucleic acids are
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising GATGCAAGTGCACCTTTTAAGCAA (SEQ ID NO:6).
  • the virtual probe comprises a probe molecule having a nucleotide sequence comprising AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5).
  • the probe molecule having a nucleotide sequence comprising AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5) is a meter probe.
  • the initial sample or test sample is a biological sample selected from blood, serum, saliva, urine, gastric fluid, digestive fluid, tears, stool, semen, vaginal fluid, interstitial fluid, fluid derived from tumorous tissue, ocular fluid, sweat, mucus, earwax, oil, glandular secretions, breath, spinal fluid, hair, fingernails, skin cells, plasma, fluid obtained from a nasal swab, fluid obtained from a nasopharyngeal wash, cerebrospinal fluid, a tissue sample, fluid or tissue obtained from a throat swab, fluid or tissue obtained from a wound swab, biopsy tissue, placental fluid, amniotic fluid, peritoneal dialysis fluid, cord blood, lymphatic fluids, cavity fluids, sputum, pus, microbiota, meconium, breast milk, or a sample processed, extracted or fractionated from any of the foregoing.
  • the initial sample or test sample is a biological sample selected from blood, serum, saliva
  • the initial sample or test sample is an environmental sample selected from soil, groundwater, surface water, wastewater, or a sample processed, extracted or fractionated from any of the foregoing.
  • step (c) is computer- implemented and wherein step (c) comprises executing, in a computer system having one or more processors coupled to a memory storing one or more computer readable instructions for execution by the one or more processors, the one or more computer readable instructions comprising instructions for: (i) receiving signal data from hybridization of the probe molecules in the virtual probe to nucleic acids, if any, in the test sample and (ii) analyzing the signal data according to the first formula if the signal data from hybridization of the meter probe to nucleic acids in the sample is greater than or equal to a threshold value and analyzing the signal data according to the second formula if the signal from hybridization of the meter probe to nucleic acids in the sample is less than the threshold value.
  • An addressable array comprising:
  • each virtual probe comprising a group of positionally addressable oligonucleotide probe molecules, each at a discrete location on the array, wherein each probe molecule in the one or more virtual probes comprises a nucleotide sequence that is 90% to 100% complementary to 15 to 40 consecutive nucleotides in the first genomic sequence or second genomic sequence, wherein one or more of the oligonucleotide probe molecules are meter probes;
  • control probe molecules optionally, one or more control probe molecules.
  • the addressable array of embodiment 99 or embodiment 100 which comprises at least two virtual probes.
  • 102 The addressable array of embodiment 99 or embodiment 100, which comprises at least three virtual probes.
  • each virtual probe comprises a meter probe.
  • each virtual probe comprises 2-4 oligonucleotide probe molecules.
  • each virtual probe comprises 2-3 oligonucleotide probe molecules.
  • the addressable array of embodiment 111 which comprises 12 to 100 probe molecules.
  • invention 111 which comprises 12 to 50 probe molecules.
  • the addressable array of embodiment 111 which comprises 50 to 100 probe molecules.
  • 116. The addressable array of embodiment 111 which comprises 12 probe molecules.
  • each probe molecule is capable of specifically hybridizing to one or more target nucleic acids corresponding to the first genome and/or one or more homologous target nucleic acids corresponding to the second genome,
  • At least one of the probe molecules of the virtual probe is a meter probe capable of hybridizing to a target nucleic acid corresponding to the first genome and a homologous target nucleic acid corresponding to the second genome, such that the hybridizing of the meter probe to such target nucleic acids can provide a measure of the relative amount of target nucleic acids in the test sample,
  • the probe molecules hybridize non-identically to the target nucleic acids corresponding to the first and second genomes, such that the hybridizing of the probe molecules to the one or more target nucleic acids corresponding to the first and second genomes can distinguish between the target nucleic acids corresponding to the first and second genomes;
  • target nucleic acids that hybridize to the probe molecules of the array are present in the test sample, analyzing the signals to determine if target nucleic acids corresponding to the first genome or target nucleic acids corresponding to the second genome are present in the sample, thereby determining if the first organism or second organism are present in the initial sample or the test sample; or
  • step (ii) no target products that hybridize to the probe molecules of the virtual probe are produced in step (a), determining that that initial sample or test sample does not contain the first organism or the second organism.
  • step (d)(i) comprises (i) analyzing the signals detected and/or quantified in step (c) according to a first formula if the signal from the meter probe is greater than or equal to a threshold value, and (ii) analyzing the signals detected and/or quantified in step (c) according to a second formula if the signal from the meter probe is less than the threshold value
  • each probe molecule in the virtual probe is capable of specifically hybridizing to one or more amplicons in the first amplicon set and/or the second amplicon set, and wherein the probe molecules hybridize non-identically to the amplicons in the first amplicon set and the amplicons in the second amplicon set, such that the hybridizing of the probe molecules to the amplicons in the first amplicon set and the second amplicon set can distinguish between the first amplicon set and the second amplicon set.
  • the method of embodiment 144 which further comprises preparing the test sample by performing a PCR amplification reaction on the initial sample using PCR primers capable of hybridizing to, and initiating a PCR amplification from, both the first genome and the second genome, resulting in the first amplicon set and a second amplicon set, respectively, when the first genome and second genome are present in the sample.
  • PCR primers capable of hybridizing to, and initiating a PCR amplification from, both the first genome and the second genome, resulting in the first amplicon set and a second amplicon set, respectively, when the first genome and second genome are present in the sample.
  • step (d) is computer-implemented and wherein step (d) comprises executing, in a computer system having one or more processors coupled to a memory storing one or more computer readable instructions for execution by the one or more processors, the one or more computer readable instructions comprising instructions for: (i) receiving signal data from hybridization of the probe molecules in the virtual probe to nucleic acids, if any, in the test sample and (ii) analyzing the signal data according to the first formula if the signal data from hybridization of the meter probe to nucleic acids in the sample is greater than or equal to a threshold value and analyzing the signal data according to the second formula if the signal from hybridization of the meter probe to nucleic acids in the sample is less than the threshold value.
  • a system comprising:
  • At least one of the processors is configured to receive signal data from the optical reader
  • At least one of the processors is configured to analyze signal data for the one or more virtual probes generated following hybridization of the probe molecules in the virtual probes to nucleic acid molecules, if present, in a test sample, optionally wherein the analysis comprises determining whether the test sample contains the first genomic sequence and/or the second genomic sequence;
  • At least one of the processors has an interface to a storage or display device or network for outputting a result of the analysis.
  • invention 149 The system of embodiment 148, which comprises a light source capable of exciting fluorescently labeled molecules on the array.
  • invention 150 The system of embodiment 148 or embodiment 149, further comprising a plate handling robot capable of adding the product of a PCR amplification reaction to the array and capable of washing unbound nucleic acid molecules from the array.
  • a method for preparing a test sample for detecting if a first microorganism having a first genome or a second microorganism of the same genus having a second genome is present in an initial sample comprising:
  • step (b) contacting the PCR amplification product of step (a) with an array according to any one of embodiments 99 to 141 comprising a virtual probe for distinguishing a first genomic sequence of the first microorganism from a second, homologous genomic sequence of the second microorganism, wherein the virtual probe comprises a meter probe;
  • step (i) analyzing the signals detected and/or quantified in step (d) according to a first formula if the signal from the meter probe is greater than or equal to a threshold value
  • step (ii) analyzing the signals detected and/or quantified in step (d) according to a second formula if the signal from the meter probe is less than the threshold value.
  • An oligonucleotide probe molecule whose nucleotide sequence comprises CAGTCTATGGTGTAGCAAGCTACGGTAT (SEQ ID NO:3).
  • An oligonucleotide probe molecule whose nucleotide sequence comprises TATCCCCCTCTAATAGGCAGGTTA (SEQ ID NO:4).
  • An oligonucleotide probe molecule whose nucleotide sequence comprises AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5).
  • An oligonucleotide probe molecule whose nucleotide sequence comprises GATGCAAGTGCACCTTTTAAGCAA (SEQ ID NO:6).
  • An oligonucleotide probe molecule whose nucleotide sequence comprises GATGCAAGTGCACCTTTTAAGTAA (SEQ ID NO:7).
  • a virtual probe comprising a plurality of oligonucleotide probe molecules, wherein at least one oligonucleotide probe molecule in the virtual probe has a nucleotide sequence comprising CCAGTCTTATAGGTAGGTTAYCCACG (SEQ ID NO:1) and another oligonucleotide molecule in the virtual probe has a nucleotide sequence comprising GCTTCTCGTCCGTTCGCTCG (SEQ ID NO:2).
  • a virtual probe comprising a plurality of oligonucleotide probe molecules, wherein at least one oligonucleotide probe molecule in the virtual probe has a nucleotide sequence comprising CAGTCTATGGTGTAGCAAGCTACGGTAT (SEQ ID NO:3) and another oligonucleotide molecule in the virtual probe has a nucleotide sequence comprising TATCCCCCTCTAATAGGCAGGTTA (SEQ ID NO:4).
  • a virtual probe comprising a plurality of oligonucleotide probe molecules, wherein at least one oligonucleotide probe molecule in the virtual probe has a nucleotide sequence comprising AGCTAATACAACGCAGGTCCATCT (SEQ ID NO:5), another oligonucleotide molecule in the virtual probe has a nucleotide sequence comprising GATGCAAGTGCACCTTTTAAGCAA (SEQ ID NO:6) and another oligonucleotide molecule in the virtual probe has a nucleotide sequence comprising GATGCAAGTGCACCTTTTAAGTAA (SEQ ID NOT).
  • each oligonucleotide probe molecule comprises a poly-thymidine tail.
  • An addressable array comprising:
  • control probe molecules optionally, one or more control probe molecules.
  • An addressable array comprising the virtual probe of any one of embodiments 167 to 172, wherein each probe molecule in the virtual probe is at a discrete location in the array.
  • a kit comprising two or more probe molecules selected from probe molecules whose nucleotide sequence comprises SEQ ID NO:1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or SEQ ID NO:7.
  • kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:1 and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:2.
  • kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:3 and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:4.
  • kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:5, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:6, and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NOT. 180.
  • the kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:1 , an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:2, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:3 and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:4.
  • the kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:1 , an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:2, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:5, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:6, and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NOT.
  • kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:3, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:4, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:5, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:6, and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NOT.
  • the kit of embodiment 176 which comprises an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:1 , an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:2, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:3, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:4 an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:5, an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NO:6, and an oligonucleotide probe molecule whose nucleotide sequence comprises SEQ ID NOT.
  • kit of any one of embodiments 176 to 186 which further comprises one or more PCR primer pairs capable of amplifying a first genomic sequence, and a second, homologous genomic sequence.

Abstract

L'invention concerne des procédés d'identification de séquences génomiques homologues qui peuvent être présentes dans un échantillon à l'aide de sondes virtuelles, des réseaux pour distinguer des séquences génomiques homologues, des systèmes pour distinguer des séquences génomiques homologues, et des molécules de sonde utiles dans les procédés, les réseaux et les systèmes de l'invention.
EP22705127.3A 2021-01-20 2022-01-20 Détection améliorée de séquences génomiques et molécules sondes associées Pending EP4298242A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163139643P 2021-01-20 2021-01-20
PCT/IB2022/050494 WO2022157672A1 (fr) 2021-01-20 2022-01-20 Détection améliorée de séquences génomiques et molécules sondes associées

Publications (1)

Publication Number Publication Date
EP4298242A1 true EP4298242A1 (fr) 2024-01-03

Family

ID=80218492

Family Applications (1)

Application Number Title Priority Date Filing Date
EP22705127.3A Pending EP4298242A1 (fr) 2021-01-20 2022-01-20 Détection améliorée de séquences génomiques et molécules sondes associées

Country Status (4)

Country Link
EP (1) EP4298242A1 (fr)
CN (1) CN116745434A (fr)
TW (1) TW202246525A (fr)
WO (1) WO2022157248A1 (fr)

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5066584A (en) 1988-09-23 1991-11-19 Cetus Corporation Methods for generating single stranded dna by the polymerase chain reaction
US20030134293A1 (en) * 1999-11-16 2003-07-17 Zhiping Liu Method for rapid and accurate identification of microorganisms
US7205104B2 (en) * 2000-03-24 2007-04-17 Eppendorf Array Technologies Sa (Eat) Identification of biological (micro) organisms by detection of their homologous nucleotide sequences on arrays
CN1280422C (zh) 2004-08-26 2006-10-18 北京博奥生物芯片有限责任公司 一种不对称pcr扩增方法及其应用
US20110151453A1 (en) * 2007-07-11 2011-06-23 Bergeron Michel G Nucleic acid sequences and combination thereof for sensitive amplification and detection of bacterial and fungal sepsis pathogens
EP3181700A1 (fr) 2015-12-18 2017-06-21 Safeguard Biosystems Holdings Ltd. Réseaux polymères tridimensionnels avec des canaux situés à l'intérieur
US10036054B2 (en) 2016-01-30 2018-07-31 Safeguard Biosystems Holdings Ltd. Bead beating tube and method for extracting deoxyribonucleic acid and/or ribonucleic acid from microorganisms
EP3418741A1 (fr) 2017-06-19 2018-12-26 Safeguard Biosystems Holdings Ltd. Réseaux polymères tridimensionnels et leur utilisation
US10513730B2 (en) 2017-11-22 2019-12-24 Safeguard Biosystems Holdings Ltd. Asymmetric PCR methods, primers and kits

Also Published As

Publication number Publication date
CN116745434A (zh) 2023-09-12
TW202246525A (zh) 2022-12-01
WO2022157248A1 (fr) 2022-07-28

Similar Documents

Publication Publication Date Title
Rahman et al. Polymerase chain reaction (PCR): a short review
US20220267833A1 (en) Detection of genomic sequences using combinations of probes, probe molecules and arrays comprising the probes for the specific detection of organisms
Novais et al. The evolution of Pyrosequencing® for microbiology: from genes to genomes
JP2013531983A (ja) 多重生物検出のための核酸ならびにその使用および製造方法
BRPI0517131B1 (pt) Métodos para simplificar ácidos nucléicos microbiais por modificação química de citosinas
WO2014106076A2 (fr) Séquençage sanger universel à partir d'amplicons de séquençage de prochaine génération
EP2850205B1 (fr) Technique combiné du pcr et d'amplification isotherme à médiation par boucle pour la détéction des acides nucléiques
EP2344673B1 (fr) Essai diagnostique pour streptococcus equi
WO2013173774A2 (fr) Sondes d'inversion moléculaire
US20080228406A1 (en) System and method for fungal identification
JP5916740B2 (ja) 核酸標的の定量的多重同定
US20210017582A1 (en) Detection of genomic sequences and probe molecules therefor
Mianzhi et al. Contemporary nucleic acid-based molecular techniques for detection, identification, and characterization of Bifidobacterium
WO2022157672A1 (fr) Détection améliorée de séquences génomiques et molécules sondes associées
EP4298242A1 (fr) Détection améliorée de séquences génomiques et molécules sondes associées
CN101092645B (zh) 诊断性传染病致病原的新颖方法
US8304184B2 (en) Genotyping using multiple variant-specific primer pools
KR20100012319A (ko) 패혈증­유발 미생물의 분류 및 동정 방법
WO2013173795A1 (fr) Système de biosurveillance basé sur une séquence en temps réel
Nazir Polymerase Chain Reaction
WO2024030342A1 (fr) Procédés et compositions pour l'analyse d'acides nucléiques
JP2023511909A (ja) 細菌定量化方法
WO2021094601A1 (fr) Procédé pour déterminer la présence d'une souche hyper-virulente de clostridioides difficile du groupe b1/nap1/027 dans un échantillon
CN116356049A (zh) 一种检测艰难梭状芽孢杆菌的试剂盒
CN116356075A (zh) 一种用于检测的引物探针、引物探针组及其应用

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: UNKNOWN

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20231123

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)