US20110059436A1 - Methods for sequence determination - Google Patents

Methods for sequence determination Download PDF

Info

Publication number
US20110059436A1
US20110059436A1 US12/419,660 US41966009A US2011059436A1 US 20110059436 A1 US20110059436 A1 US 20110059436A1 US 41966009 A US41966009 A US 41966009A US 2011059436 A1 US2011059436 A1 US 2011059436A1
Authority
US
United States
Prior art keywords
polymerase
monomer
tag
tags
dna
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/419,660
Inventor
Susan Hardin
Xiaolian Gao
James Briggs
Richard Willson
Shiao-Chun Tu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Life Technologies Corp
Original Assignee
Life Technologies Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=22807697&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=US20110059436(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Life Technologies Corp filed Critical Life Technologies Corp
Priority to US12/419,660 priority Critical patent/US20110059436A1/en
Publication of US20110059436A1 publication Critical patent/US20110059436A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing

Definitions

  • the present invention relates to a single-molecule sequencing apparatus and methods.
  • the present invention relates to a single-molecule sequencing apparatus and methods using tagged polymerizing agents and/or tagged monomers where the tagged polymerizing agent and/or the tagged monomers undergo a change in a detectable property before, during and/or after monomer insertion into a growing polymer chain.
  • the apparatus and methods are ideally-suited for sequencing DNA, RNA, polypeptide, carbohydrate or similar bio-molecular sequences under near real-time or real-time conditions.
  • the present invention also relates to a single-molecule sequencing apparatus and methods using tagged depolymerizing agents and/or tagged depolymerizable polymer where the tagged depolymerizing agent and/or the tagged depolymerizable polymer undergo a change in a detectable property before, during and/or after monomer removal from the depolymerizable polymer chain.
  • the apparatus and methods are ideally-suited for sequencing DNA, RNA, polypeptide, carbohydrate or similar bio-molecular sequences.
  • the present invention also relates to detecting a signal evidencing interactions between the tagged polymerizing agent or depolymerizing agent and a tagged or untagged polymer subunit such as a monomer or collection of monomers, where the detected signal provides information about monomer order.
  • the methods are carried out in real-time or near real-time.
  • DNA sequence information provides scientists with information critical to a wide range of biological processes.
  • the order of bases in DNA specifies the order of bases in RNA, the molecule within the cell that directly encodes the informational content of proteins.
  • DNA sequence information is routinely used to deduce protein sequence information. Base order dictates DNA structure and its function, and provides a molecular program that can specify normal development, manifestation of a genetic disease, or cancer.
  • DNA sequence information may provide a way to uniquely identify individuals.
  • a DNA molecule is comprised of four bases, adenine (A), guanine (G), cytosine (C), and thymine (T). These bases interact with each other in very specific ways through hydrogen bonds, such that A interacts with T, and G interacts with C. These specific interactions between the bases are referred to as base-pairings. In fact, it is these base-pairings (and base stacking interactions) that stabilize double-stranded DNA.
  • the two strands of a DNA molecule occur in an antiparallel orientation, where one strand is positioned in the 5′ to 3′ direction, and the other strand is positioned in the 3′ to 5′ direction.
  • 5′ and 3′ refer to the directionality of the DNA backbone, and are critical to describing the order of the bases.
  • the convention for describing base order in a DNA sequence uses the 5′ to 3′ direction, and is written from left to right. Thus, if one knows the sequence of one DNA strand, the complementary sequence can be deduced.
  • Sanger sequencing is currently the most commonly used method to sequence DNA (Sanger et al., 1977). This method exploits several features of a DNA polymerase: its ability to make an exact copy of a DNA molecule, its directionality of synthesis (5′ to 3′), its requirement of a DNA strand (a ‘primer’) from which to begin synthesis, and its requirement for a 3′ OH at the end of the primer. If a 3′ OH is not available, then the DNA strand cannot be extended by the polymerase.
  • ddNTP dideoxynucleotide
  • ddNTP dideoxynucleotide
  • ddATP dideoxynucleotide
  • ddTTP dideoxynucleotide
  • ddGTP dideoxynucleotide
  • ddCTP dideoxynucleotide
  • the automated sequencer is used to separate sequencing reaction products, detect and collect (via computer) the data from the reactions, and analyze the order of the bases to automatically deduce the base sequence of a DNA fragment.
  • Automated sequencers detect extension products containing a fluorescent tag. Sequence read lengths obtained using an automated sequencer are dependent upon a variety of parameters, but typically range between 500 to 1,000 bases (3-18 hours of data collection). At maximum capacity an automated sequencer can collect data from 96 samples in parallel.
  • dye-labeled terminator chemistry When dye-labeled terminator chemistry is used to detect the sequencing products, base identity is determined by the color of the fluorescent tag attached to the ddNTP. After the reaction is assembled and processed through the appropriate number of cycles (3-12 hours), the extension products are prepared for loading into a single lane on an automated sequencer (unincorporated, dye-labeled ddNTPs are removed and the reaction is concentrated; 1-2 hours).
  • An advantage of dye-terminator chemistry is that extension products are visualized only if they terminate with a dye-labeled dNTP; prematurely terminated products are not detected. Thus, reduced background noise typically results with this chemistry.
  • dRhodamine dichlororhodamine
  • the d-Rhodamine acceptor dyes associated with the terminators are dichloro[R110], dichloro[R6G], dichloro[TAMRA] or dichloro[ROX], for the G-, A-, T- or C-terminators, respectively.
  • the donor dye (6-FAM) efficiently absorbs energy from the argon ion laser in the automated sequencing machine and transfers that energy to the linked acceptor dye.
  • the linker connecting the donor and acceptor portions of the terminator is optimally spaced to achieve essentially 100% efficient energy transfer.
  • the fluorescence signals emitted from these acceptor dyes exhibit minimal spectral overlap and are collected by an ABI PRISM 377 DNA sequencer using 10 nm virtual filters centered at 540, 570, 595 and 625 nm, for G-, A-, T- or C-terminators, respectively.
  • energy transfer dye-labeled terminators produce brighter signals and improve spectral resolution.
  • the predominant enzyme used in automated DNA sequencing reactions is a genetically engineered form of DNA polymerase I from Therms aquaticus .
  • This enzyme, AmpliTaq DNA Polymerase, FS was optimized to more efficiently incorporate ddNTPs and to eliminate the 3′ to 5′ and 5′ to 3′ exonuclease activities.
  • Replacing a naturally occurring phenylalanine at position 667 in T. aquaticus DNA polymerase with a tyrosine reduced the preferential incorporation of a dNTP, relative to a ddNTP (Tabor and Richardson, 1995; Reeve and Fuller, 1995).
  • a single hydroxyl group within the polymerase is responsible for discrimination between dNTPs and ddNTPs.
  • the 3′ to 5′ exonuclease activity which enables the polymerase to remove a mis-incorporated base from the newly replicated DNA strand (proofreading activity), was eliminated because it also allows the polymerase to remove an incorporated ddNTP.
  • the 5′ to 3′ exonuclease activity was eliminated because it removes bases from the 5′ end of the reaction products. Since the reaction products are size separated during gel electrophoresis, interpretable sequence data is only obtained if the reaction products share a common endpoint.
  • the primer defines the 5′ end of the extension product and the incorporated, color-coded ddNTP defines base identity at the 3′ end of the molecule.
  • conventional DNA sequencing involves analysis of a population of DNA molecules sharing the same 5′ endpoint, but differing in the location of the ddNTP at the 3′ end of the DNA chain.
  • a large DNA fragment typically one larger than 20,000 base pairs
  • a cloning vector typically one larger than 20,000 base pairs
  • Numerous smaller clones are randomly selected, DNA templates are prepared for sequencing reactions, and primers that will base-pair with the vector DNA sequence bordering the insert are used to begin the sequencing reaction (2-7 days for a 20 kbp insert).
  • a directed or primer-walking sequencing strategy can be used to fill-in gaps remaining after the random phase of large-fragment sequencing, and as an efficient approach for sequencing smaller DNA fragments.
  • This strategy uses DNA primers that anneal to the template at a single site and act as a start site for chain elongation. This approach requires knowledge of some sequence information to design the primer. The sequence obtained from the first reaction is used to design the primer for the next reaction and these steps are repeated until the complete sequence is determined.
  • a primer-based strategy involves repeated sequencing steps from known into unknown DNA regions, the process minimizes redundancy, and it does not require additional cloning steps. However, this strategy requires the synthesis of a new primer for each round of sequencing.
  • This method requires that the DNA strand is synthesized to contain the flourescently-labeled base(s). This requirement limits the length of sequence that can be determined, and increases the number of manipulations that must be performed before any sequence data is obtained.
  • a related approach proposes to sequentially separate single (unlabeled) nucleotides from a strand of DNA, confine them in their original order in a solid matrix, and detect the spectroscopic emission of the separated nucleotides to reconstruct DNA sequence information (Ulmer, 1997; Mitsis and Kwagh, 1999; Dapprich, 1999). This is the approach that is being developed by Praelux, Inc., a company with a goal to develop single-molecule DNA sequencing. Theoretically, this latter method should not be as susceptible to length limitations as the former enzymatic degradation method, but it does require numerous manipulations before any sequence information can be obtained.
  • Li-cor, Inc. is developing an enzyme synthesis based strategy for single-molecule sequencing as set forth in PCT application WO 00/36151.
  • the Li-cor method involves multiply modifying each dNTP by attaching a fluorescent tag to the ⁇ -phosphate and a quenching moiety to another site on the dNTP, preferably on the base.
  • the quenching moiety is added to prevent emission from the fluorescent tag attached to an unincorporated dNTP.
  • the fluorescent tag and quenching moiety are separated, resulting in emission from the tag.
  • the tag (contained on the pyrophosphate) flows away from the polymerase active site, but the modified (quenched) base becomes part of the DNA polymer.
  • the A-form DNA may be important for maximizing minor groove contacts between the enzyme and the DNA. If the DNA structure is affected due to base modification, enzyme fidelity and/or function may be altered. Thus, there is still a need in the art for a fast and efficient enzymatic DNA sequencing system for single molecular DNA sequences.
  • the present invention provides a polymerizing agent modified with at least one molecular or atomic tag located at or near, associated with or covalently bonded to a site on the polymerizing agent, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • the monomers can be organic, inorganic or bio-organic monomers such as nucleotides for DNA, RNA, mixed DNA/RNA sequences, amino acids, monosaccharides, synthetic analogs of naturally occurring nucleotides, synthetic analogs of naturally occurring amino acids or synthetic analogs of naturally occurring monosaccharides, synthetic organic or inorganic monomers, or the like.
  • the present invention provides a depolymerizing agent modified with at least one molecular or atomic tag located at or near, associated with or covalently bonded to a site on the depolymerizing agent, where a detectable property of the tag undergoes a change before, during and/or after monomer removal.
  • the polymers can be DNA, RNA, mixed DNA/RNA sequences containing only naturally occurring nucleotides or a mixture of naturally occurring nucleotides and synthetic analogs thereof, polypeptide sequences containing only naturally occurring amino acids or a mixture of naturally occurring amino acids and synthetic analogs thereof, polysaccharide or carbohydrate sequences containing only naturally occurring monosaccharides or a mixture of naturally occurring monosaccharides and synthetic analogs thereof, or polymers containing synthetic organic or inorganic monomers, or the like.
  • the present invention also provides a system that enables detecting a signal corresponding to a detectable property evidencing changes in interactions between a synthesizing/polymerizing agent or a depolymerizing agent (molecule) and its substrates (monomers or depolymerizable polymers) and decoding the signal into monomer order specific information or monomer sequence information, preferably in real-time or near real-time.
  • the present invention provides a polymerase modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site on the polymerase, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • the monomers can be nucleotides for DNA, RNA or mixed DNA/RNA monomers or synthetic analogs polymerizable by the polymerase.
  • the present invention provides an exonuclease modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site on the exonuclease, where a detectable property of the tag undergoes a change before, during and/or after monomer release.
  • the polymers can be DNA, RNA or mixed DNA/RNA sequences comprised of naturally occurring monomers or synthetic analogs depolymerizable by the exonuclease.
  • the present invention provides a polymerase modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where the tag has a first detection propensity when the polymerase is in a first conformational state and a second detection propensity when the polymerase is in a second conformational state.
  • the present invention provides a polymerase modified with at least one chromophore located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where an intensity and/or frequency of emitted light of the chromophore has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state.
  • the present invention provides a polymerase modified with at least one fluorescently active molecular tag located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where the tag has a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state.
  • the present invention provides a polymerase modified with a molecular tag located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where the tag is substantially detectable when the polymerase is in a first conformational state and substantially non-detectable when the polymerase is in a second conformational state or substantially non-detectable when the polymerase is in the first conformational state and substantially detectable when the polymerase is in the second conformational state.
  • the present invention provides a polymerase modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where the polymerase tag has a first detection propensity before interacting with the tag on the released pyrophosphate group and a second detection propensity when interacting with the tag on the released pyrophosphate group.
  • this change in detection propensity is cyclical occurring as each pyrophosphate group is released.
  • the present invention provides a polymerase modified with at least one chromophore located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where an intensity and/or frequency of light emitted by the chromophore has a first value before the chromophore interacts with the tag on the released pyrophosphate and a second value when interacting with the tag on the released pyrophosphate group.
  • this change in detection propensity is cyclical occurring as each pyrophosphate group is released.
  • the present invention provides a polymerase modified with at least one fluorescently active molecular tag located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where the polymerase tag changes from a first state prior to release of the pyrophosphate group and a second state as the pyrophosphate group diffuses away from the site of release.
  • this change in detection propensity is cyclical occurring as each pyrophosphate group is released.
  • the present invention provides a polymerase modified with a molecular tag located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where the polymerase tag changes from a substantially detectable state prior to pyrophosphate release to a substantially non-detectable state when the polymerase tag interacts with the tag on the pyrophosphate group after group release, or changes from a substantially non-detectable state prior to pyrophosphate release to a substantially detectable state when the polymerase tag interacts with the tag on the pyrophosphate group after group release.
  • the present invention provides a monomer polymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with, or covalently bonded to sites on the polymerizing agent, where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer incorporation or where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer incorporation due to a change in inter-tag interaction.
  • the present invention provides a depolymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with, or covalently bonded to sites on the depolymerizing agent, where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer release or where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer release due to a change in inter-tag interaction.
  • the present invention provides a monomer polymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with, or covalently bonded to sites on the polymerizing agent, where a detectable property of at least one tag of the pair has a first value when the polymerizing agent is in a first state and a second value when the polymerizing agent is in a second state, where the polymerizing agent changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a depolymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with or covalently bonded to sites on the polymerizing agent, where a detectable property of at least one tag of the pair has a first value when the depolymerizing agent is in a first state and a second value when the depolymerizing agent is in a second state, where the depolymerizing agent changes from the first state to the second state and back to the first state during a monomer release cycle.
  • the first and second states are different so that a change in the detected signal occurs.
  • a no-change result may evidence other properties of the polymerizing media or depolymerizing media.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with, or covalently bonded to sites at least one of the tags undergoes a change during monomer incorporation, where a detectable property of the pair has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the detectably property of the pair has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a polymerase modified with at least one pair of molecules or atoms located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the pair interact to form a chromophore when the polymerase is in a first conformational state or a second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the tags have a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the pair is substantially active when the polymerase is in a first conformational state and substantially inactive when the polymerase is in a second conformational state or substantially inactive when the polymerase is in the first conformational state and substantially active when the polymerase is in the second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with, or covalently bonded to sites at least one of the tags undergoes a change during and/or after pyrophosphate release during the monomer incorporation process, where a detectable property of the pair has a first value when the tag is in a first state prior to pyrophosphate release and a second value when the tag is in a second state during and/or after pyrophosphate release, where the tag changes from its first state to its second state and back to its first state during a monomer incorporation cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes a change in position due to a conformational change in the polymerase during the pyrophosphate release process, where the detectably property of the pair has a first value when the tag is in its first position and a second value when the tag is in its second position, where the tag changes from its first position to its second position and back to its first position during a release cycle.
  • the present invention provides a polymerase modified with at least one pair of molecules or atoms located at or near, associated with or covalently bonded to sites, where the tags change relative separation due to a conformational change in the polymerase during pyrophosphate release, where the tags interact to form a chromophore having a first emission profile when the tags are a first distance apart and a second profile when the tags are a second distance apart, where the separation distance changes from its first state to its second state and back to its first state during a pyrophosphate release cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites, where the tags change relative separation due to a conformational change in the polymerase during pyrophosphate release, where the tags have a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state, where the propensity changes from its the first value to its second value and back again during a pyrophosphate release cycle.
  • the present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites, where the tags change relative separation due to a conformational change in the polymerase during pyrophosphate release, where the pair is substantially fluorescently active when the tags have a first separation and substantially fluorescently inactive when the tags have a second separation or substantially fluorescently inactive when the tags have the first separation and substantially fluorescently active when the tags have the second separation, where the fluorescence activity undergoes one cycle during a pyrophosphate release cycle.
  • the first and second states are different so that a change in the detected signal occurs.
  • a no-change result may evidence other properties of the polymerizing medium or depolymerizing medium.
  • the present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of an atomic or molecular tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerizing agent, where the detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • the present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of an atomic or molecular tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerizing agent, where the detectable property has a first value when the agent is in a first state and a second value when the agent is in a second state, where the agent changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the first and second states are different so that a change in the detected signal occurs.
  • a no-change result may evidence other properties of the polymerizing medium.
  • the present invention provides a method for determining when or whether a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerase, where the site undergoes a change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state, where the values signify that the site has undergone the change and where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a method for determining when or whether a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerase, where the site undergoes a conformational change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state, where the values signify that the site has undergone the change and where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention provides a method for determining when or whether a monomer is incorporated into a growing molecular chain comprising the steps of exposing a tagged polymerase to light, monitoring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase, where the tagged polymerase comprises a polymerase including a tag located at or near, associated with, or covalently bonded to a site that undergoes conformational change during monomer incorporation and where the tag emits fluorescent light at a first intensity and/or frequency when the polymerase is in a first conformational state and a second intensity and/or frequency when the polymerase is in a second conformational state, where the change in intensities and/or frequencies signifies that the site has undergone the change and where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • the present invention also provides the above methods using a plurality of tagged polymerases permitting parallel and/or massively parallel sequencing simultaneously. Such parallelism can be used to ensure confidence. Such parallelism can also be used to quickly detect the degree of homology in DNA sequences for a given gene across species or to quickly screen patient DNA for specific genetic traits or to quickly screen DNA sequences for polymorphisms.
  • the present invention also provides a method for determining if or when a monomer is incorporated into a growing DNA chain associated with a polymerase, where a tag is located on the polymerase so that as the pyrophosphate group is released after base incorporation and prior to its diffusion away from the polymerase, the polymerase tag interacts with the tag on the pyrophosphate causing a change in a detectable property of one of the tags or a detectable property associated with both tags in the case of a fluorescent pair.
  • the first and second states are different so that a change in the detected signal occurs.
  • a no-change result may evidence other properties of the polymerizing media.
  • the present invention provides a single-molecule sequencing apparatus comprising a substrate having deposited thereon at least one tagged polymerizing agent.
  • the tagged polymerizing agent can be placed on the surface of the substrate in an appropriate polymerizing medium or the polymerizing agent can be confined in a region, area, well, groove, channel or other similar structure on the substrate.
  • the substrate can also include a monomer region, area, well, groove, channel, reservoir or other similar structure on the substrate connected to the polymerizing agent confinement structure by at least one connecting structure capable of supporting molecular transport of monomer to the polymerizing agent such as a channel, groove, or the like.
  • the substrate can include structures containing each monomer, where each structure is connected to the polymerizing agent confinement structure by a connecting structure capable of supporting molecular transport of monomer to the polymerizing agent.
  • the substrate can also be subdivided into a plurality of polymerizing agent confinement structures, where each structure is connected to a monomer reservoir.
  • each polymerizing agent confinement structure can have its own monomer reservoir or sufficient monomer reservoirs so that each reservoir contains a specific monomer.
  • the present invention also provides a single-molecule sequencing apparatus comprising a substrate having at least one tagged polymerizing agent attached to the surface of the substrate by a molecular tether or linking group, where one end of the tether or linking group is bonded to a site on the surface of the substrate and the other end is bonded to a site on the polymerizing agent or bonded to a site on a molecule strongly associated with the polymerizing agent.
  • a single-molecule sequencing apparatus comprising a substrate having at least one tagged polymerizing agent attached to the surface of the substrate by a molecular tether or linking group, where one end of the tether or linking group is bonded to a site on the surface of the substrate and the other end is bonded to a site on the polymerizing agent or bonded to a site on a molecule strongly associated with the polymerizing agent.
  • bonded to means that chemical and/or physical interactions sufficient to maintain the polymerizing agent within a given region of the substrate under normal polymerizing conditions.
  • the chemical and/or physical interactions include, without limitation, covalent bonding, ionic bonding, hydrogen bonding, apolar bonding, attractive electrostatic interactions, dipole interactions, or any other electrical or quantum mechanical interaction sufficient in toto to maintain the polymerizing agent in a desired region of the substrate.
  • the substrate having tethered tagged polymerizing agent attached thereon can be placed in container containing an appropriate polymerizing medium.
  • the tagged polymerizing agent can be tethered or anchored on or within a region, area, well, groove, channel or other similar structure on the substrate capable of being filled with an appropriate polymerizing medium.
  • the substrate can also include a monomer region, area, well, groove, channel or other similar structure on the substrate connected to the polymerizing agent structure by at least one a connecting structure capable of supporting molecular transports of monomer to the polymerizing agent.
  • the substrate can include structures containing each monomer, where each structure is connected to the polymerizing agent structure by a connecting structure capable of supporting molecular transports of monomer to the polymerizing agent.
  • the substrate can also be subdivided into a plurality of polymerizing agent structures each having at least one tethered polymerizing agent, where each structure is connected to a monomer reservoir.
  • each polymerizing agent structure can have its own monomer reservoir or sufficient monomer reservoirs, one reservoir of each specific monomer.
  • the monomers for use in these apparatus including, without limitation, dNTPs, tagged dNTPs, ddNTPs, tagged ddNTPs, amino acids, tagged amino acids, mono saccharides, tagged monosaccharides or appropriate mixtures or combinations thereof depending on the type of polymer being sequenced.
  • the present invention provides a single-molecule sequencing apparatus comprising a substrate having deposited thereon at least one tagged polymerase.
  • the tagged polymerase can be placed on the surface of the substrate in an appropriate polymerizing medium or the polymerase can be confined in a region, area, well, groove, channel or other similar structure on the substrate capable of being filled with an appropriate polymerizing medium.
  • the substrate can also include a monomer region, area, well, groove, channel or other similar structure on the substrate connected to the polymerase confinement structure by at least one connecting structure capable of supporting molecular transports of monomer to the polymerase.
  • the substrate can include structures containing each monomer, where each structure is connected to the polymerase confinement structure by a connecting structure capable of supporting molecular transports of the monomer to the polymerase in the polymerase confinement structures.
  • the substrate can also be subdivided into a plurality of polymerase confinement structures, where each structure is connected to a monomer reservoir.
  • each polymerase confinement structure can have its own monomer reservoir or four reservoirs, each reservoir containing a specific monomer.
  • the present invention also provides a single-molecule sequencing apparatus comprising a substrate having at least one tagged polymerase attached to the surface of the substrate by a molecular tether or linking group, where one end of the tether or linking group is bonded to a site on the surface of the substrate and the other end is bonded (either directly or indirectly) to a site on the polymerase or bonded to a site on a molecule strongly associated with the polymerase.
  • bonded to means that chemical and/or physical interactions sufficient to maintain the polymerase within a given region of the substrate under normal polymerizing conditions.
  • the chemical and/or physical interactions include, without limitation, covalent bonding, ionic bonding, hydrogen bonding, apolar bonding, attractive electrostatic interactions, dipole interactions, or any other electrical or quantum mechanical interaction sufficient in toto to maintain the polymerase in its desired region.
  • the substrate having tethered tagged polymerizing agent attached thereon can be placed in container containing an appropriate polymerizing medium.
  • the tagged polymerizing agent can be tethered or anchored on or within a region, area, well, groove, channel or other similar structure on the substrate capable of being filled with an appropriate polymerizing medium.
  • the substrate can also include a monomer region, area, well, groove, channel or other similar structure on the substrate connected to the polymerase structure by at least one channel.
  • the substrate can include structures containing each monomer, where each structure is connected to the polymerase structure by a connecting structure that supports molecular transports of the monomer to the polymerase in the polymerase confinement structures.
  • the substrate can also be subdivided into a plurality of polymerase structures each having at least one tethered polymerase, where each structure is connected to a monomer reservoir.
  • each polymerase structure can have its own monomer reservoir or four reservoirs, each reservoir containing a specific monomer.
  • the monomers for use in these apparatus including, without limitation, dNTPs, tagged dNTPs, ddNTPs, tagged ddNTPs, or mixtures or combinations thereof.
  • the present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerizing agent confined on or tethered to a substrate and monitoring a detectable property of the tag over time.
  • the method can also include a step of relating changes in the detectable property to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of incorporated monomers.
  • the present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerizing agent confined on or tethered to a substrate, exposing the tagged polymerizing agent to light either continuously or periodically and measuring an intensity and/or frequency of fluorescent light emitted by the tag over time.
  • the method can further comprise relating the changes in the measured intensity and/or frequency of emitted fluorescent light from the tag over time to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of the incorporated monomers.
  • the present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerase confined on or tethered to a substrate and monitoring a detectable property of the tag over time.
  • the method can also include a step of relating changes in the detectable property over time to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of the incorporated monomers.
  • the present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerase confined on a substrate, exposing the tagged polymerase to light continuously or periodically and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase over time.
  • the method can further comprise relating changes in the measured intensity and/or frequency of emitted fluorescent light from the tag over time to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of the incorporated monomers.
  • the present invention provides cooperatively tagged polymerizing agents and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact before, during and/or after monomer insertion.
  • the tag on the polymerase is positioned such that the tags interact before, during and/or after each monomer insertion.
  • tags that are released from the monomers after monomer insert such as of ⁇ and/or ⁇ phosphate tagged dNTPs, i.e., the tags reside on the ⁇ and/or ⁇ phosphate groups
  • the tag on the polymerizing agent can be designed to interact with the tag on the monomer only after the tag is released from the polymerizing agent after monomer insertion.
  • Tag placement within a polymerizing agent can be optimized to enhance interaction between the polymerase and dNTP tags by attaching the polymerase tag to sites on the polymerase that move during an incorporation event changing the relative separation of the two tags or optimized to enhance interaction between the polymerase tag and the tag on the pyrophosphate as it is release during base incorporation and prior to its diffusion away from the polymerizing agent.
  • the present invention provides cooperatively tagged polymerizing agents and tagged monomers, where a detectable property of at least one of the tags changes when the tags are within a distance sufficient to cause a measurable change in the detectable property. If the detectable property is fluorescence induced in one tag by energy transfer to the other tag or due to one tag quenching the fluorescence of the other tag or causing a measurable change in the fluorescence intensity and/or frequency, the measurable change is caused by bringing the tags into close proximity to each other, i.e., decrease the distance separating the tags.
  • the distance needed to cause a measurable change in the detectable property is within (less than or equal to) about 100 ⁇ , preferably within about 50 ⁇ , particularly within about 25 ⁇ , especially within about 15 ⁇ and most preferably within about 10 ⁇ .
  • a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tag, the nature of the tag, the solvent system, external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • the present invention provides a tagged polymerizing agent and tagged monomer precursor(s), where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags interact before, during and/or after monomer insertion.
  • the present invention provides cooperatively tagged depolymerizing agents and tagged depolymerizable polymer, where a detectable property of at least one of the tags changes when the tags interact before, during and/or after monomer release.
  • the tag on the depolymerizing agent can be designed so that the tags interact before, during and/or after each monomer release.
  • the present invention provides cooperatively tagged depolymerizing agents and tagged polymers, where a detectable property of at least one of the tags changes when the tags are within a distance sufficient to cause a change in measurable change in the detectable property. If the detectable property is fluorescence induced in one tag by energy transfer to the other tag or due to one tag quenching the fluorescence of the other tag or causing a measurable change in the fluorescence intensity and/or frequency, the measurable change is caused by bringing two tags into close proximity to each other, i.e., decrease the distance separating the tags.
  • the distance needed to cause a measurable change in the detectable property is within (less than or equal to) about 100 ⁇ preferably within about 50 ⁇ , particularly within about 25 ⁇ , especially within about 15 ⁇ and most preferably within about 10 ⁇ .
  • a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tag, the nature of the tag, the solvent system, external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • the present invention provides a tagged depolymerizing agents and a tagged polymer, where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags interact before, during and/or after monomer release.
  • the present invention provides cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact before, during and/or after monomer insertion.
  • the tag on the polymerase can be designed so that the tags interact before, during and/or after each monomer insertion.
  • tags that are released from the monomers after monomer insert such as of ⁇ and/or ⁇ phosphate tagged dNTPs, i.e., the tags reside on the ⁇ and/or ⁇ phosphate groups
  • the tag on the polymerizing agent can be designed to interact with the tag on the monomer only after the tag is released from the polymerizing agent after monomer insertion.
  • the polymerase tag In the first case, the polymerase tag must be located on a site of the polymerase which allows the polymerase tag to interact with the monomer tag during the monomer insertion process—initial binding and bonding into the growing polymer. While in the second case, the polymerase tag must be located on a site of the polymerase which allows the polymerase tag to interact with the monomer tag now on the released pyrophosphate prior to its diffusion away from the polymerase and into the polymerizing medium.
  • the present invention provides cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tags are within a distance sufficient or in close proximity to cause a measurable change in the detectable property.
  • the detectable property is fluorescence induced in one tag by energy transfer to the other tag or due to one tag quenching the fluorescence of the other tag or causing a measurable change in the fluorescence intensity and/or frequency
  • the measurable change is caused by bringing two tags into close proximity to each other, i.e., decrease the distance separating the tags.
  • the distance or close proximity is a distance between about 100 ⁇ and about 10 ⁇ .
  • the distance is less than or equal to about 100 ⁇ , preferably less than or equal to about 50 ⁇ , particularly less than or equal to about 25 ⁇ , especially less than or equal to about 15 ⁇ and most preferably less than or equal to about 10 ⁇ .
  • a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tags, the nature of the tags, the solvent system (polymerizing medium), external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • the present invention provides a tagged polymerase and tagged monomer precursors, where the tags form a fluorescently active pair such as a donor-acceptor pair and an intensity and/or frequency of fluorescence light emitted by at least one tag (generally the acceptor tag in donor-acceptor pairs) changes when the tags interact.
  • the present invention provides a tagged polymerase and a tagged monomer precursors, where the tags form a fluorescently active pair such as a donor-acceptor pair and an intensity and/or frequency of fluorescence light emitted by at least one tag (generally the acceptor tag in donor-acceptor pairs) changes when the tags are a distance sufficient or in close proximity to change either the intensity and/or frequency of the fluorescent light.
  • the distance or close proximity is a distance between about 100 ⁇ and about 10 ⁇ .
  • the distance is less than or equal to about 100 ⁇ , preferably less than or equal to about 50 ⁇ , particularly less than or equal to about 25 ⁇ , especially less than or equal to about 15 ⁇ and most preferably less than or equal to about 10 ⁇ .
  • a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tag, the nature of the tag, the solvent system, external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • the present invention provides a single-molecule sequencing apparatus comprising a container having at least one tagged polymerase confined on or tethered to an interior surface thereof and having a solution containing a plurality of tagged monomers in contact with the interior surface.
  • the present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of tagged monomers to a tagged polymerase confined on an interior surface of a container, exposing the tagged polymerase to light and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase during each successive monomer addition or insertion into a growing polymer chain.
  • the method can further comprise relating the measured intensity and/or frequency of emitted fluorescent light to incorporation events and/or to the identification of each inserted or added monomer resulting in a near real-time or real-time readout of the sequence of the a growing nucleic acid sequence—DNA sequence, RNA sequence or mixed DNA/RNA sequences.
  • the present invention also provides a system for retrieving stored information comprising a molecule having a sequence of known elements representing a data stream, a single-molecule sequencer comprising a polymerase having at least one tag associated therewith, an excitation source adapted to excite at least one tag on the polymerase, and a detector adapted to detect a response from the excited tag on the polymerase, where the response from the at least one tag changes during polymerization of a complementary sequence of elements and the change in response represents a content of the data stream.
  • the present invention also provides a system for determining sequence information from a single-molecule comprising a molecule having a sequence of known elements, a single-molecule sequencer comprising a polymerase having at least one tag associated therewith, a excitation source adapted to excite at least one tag on the polymerase, and a detector adapted to detect a response from the excited tag on the polymerase, where the response from at least one tag changes during polymerization of a complementary sequence of elements representing the element sequence of the molecule.
  • the present invention also provides a system for determining sequence information from a single-molecule comprising a molecule having a sequence of known elements, a single-molecule sequencer comprising a polymerase having at least one fluorescent tag associated therewith, an excitation light source adapted to excite at least one fluorescent tag on the polymerase and/or monomer and a fluorescent light detector adapted to detect at least an intensity of emitted fluorescent light from at least one fluorescent tag on the polymerase and/or monomer, where the signal intensity changes each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • the present invention also provides a system for storing and retrieving data comprising a sequence of nucleotides or nucleotide analogs representing a given data stream; a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; an excitation light source adapted to excite the at least one fluorescent tag on the polymerase and/or monomer; and a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase and/or monomer, where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • monomer as used herein means any compound that can be incorporated into a growing molecular chain by a given polymerase.
  • monomers include, without limitations, naturally occurring nucleotides (e.g., ATP, GTP, TTP, UTP, CTP, dATP, dGTP, dTTP, dUTP, dCTP, synthetic analogs), precursors for each nucleotide, non-naturally occurring nucleotides and their precursors or any other molecule that can be incorporated into a growing polymer chain by a given polymerase. Additionally, amino acids (natural or synthetic) for protein or protein analog synthesis, mono saccharides for carbohydrate synthesis or other monomeric syntheses.
  • naturally occurring nucleotides e.g., ATP, GTP, TTP, UTP, CTP, dATP, dGTP, dTTP, dUTP, dCTP, synthetic analogs
  • precursors for each nucleotide e.g.,
  • polymerase as used herein means any molecule or molecular assembly that can polymerize a set of monomers into a polymer having a predetermined sequence of the monomers, including, without limitation, naturally occurring polymerases or reverse transcriptases, mutated naturally occurring polymerases or reverse transcriptases, where the mutation involves the replacement of one or more or many amino acids with other amino acids, the insertion or deletion of one or more or many amino acids from the polymerases or reverse transcriptases, or the conjugation of parts of one or more polymerases or reverse transcriptases, non-naturally occurring polymerases or reverse transcriptases.
  • polymerase also embraces synthetic molecules or molecular assembly that can polymerize a polymer having a pre-determined sequence of monomers, or any other molecule or molecular assembly that may have additional sequences that facilitate purification and/or immobilization and/or molecular interaction of the tags, and that can polymerize a polymer having a pre-determined or specified or templated sequence of monomers.
  • the present invention provides a composition
  • a composition comprising a polymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • the present invention provides a composition
  • a composition comprising a polymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • the present invention provides a composition
  • a depolymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property of the tag undergoes a change before, during and/or after monomer removal.
  • the present invention provides a composition
  • a composition comprising a polymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the polymerase, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the polymerase, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • the present invention provides a composition
  • a composition comprising an exonuclease including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property of the tag undergoes a change before, during and/or after monomer removal.
  • the present invention provides a composition
  • a composition comprising an exonuclease including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • the present invention provides a composition comprising an enzyme modified to produce a detectable response prior to, during and/or after interaction with an appropriately modified monomer, where the monomers are nucleotides, nucleotide analogs, amino acids, amino acid analogs, monosaccharides, monosaccharide analogs or mixtures or combinations thereof.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one molecular tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tag has a first detection propensity when the polymerase is in a first conformational state and a second detection propensity when the polymerase is in a second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one chromophore located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where an intensity and/or frequency of emitted light of the tag has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one molecular tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tag has a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerase including a molecular tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tag is substantially active when the polymerase is in a first conformational state and substantially inactive when the polymerase is in a second conformational state or substantially inactive when the polymerase is in the first conformational state and substantially active when the polymerase is in the second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer incorporation.
  • the present invention provides a composition
  • a composition comprising a polymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • the present invention provides a composition
  • a depolymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer removal.
  • the present invention provides a composition
  • a composition comprising a depolymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer incorporation.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • the present invention provides a composition
  • a composition comprising an exonuclease including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer removal.
  • the present invention provides a composition
  • a composition comprising an exonuclease including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the detectable property of the pair has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one pair of molecules or atoms located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the pair interact to form a chromophore when the polymerase is in a first conformational state or a second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tags have a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state.
  • the present invention provides a composition
  • a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the pair is substantially active when the polymerase is in a first conformational state and substantially inactive when the polymerase is in a second conformational state or substantially inactive when the polymerase is in the first conformational state and substantially active when the polymerase is in the second conformational state.
  • the present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with or covalently bonded to a site on a polymerase or associated with or covalently bonded to a site on the monomer, where the site undergoes a change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state and cycles from the first value to the second value during each monomer addition.
  • the present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with or covalently bonded to a site on a polymerase or associated with or covalently bonded to a site on the monomer, where the site undergoes a conformational change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state and cycles from the first value to the second value during each monomer addition.
  • the present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of exposing a tagged polymerase to light, monitoring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase and/or monomer, where the tagged polymerase comprises a polymerase including a tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation or associated with or covalently bonded to a site on the monomer and where the tag emits fluorescent light at a first intensity and/or frequency when the polymerase is in a first conformational state and a second intensity and/or frequency when the polymerase is in a second conformational state and cycles from the first value to the second value during each monomer addition.
  • the present invention provides a composition comprising a single-molecule sequencing apparatus comprising a substrate having a chamber or chip surface in which at least one tagged polymerase is confined therein and a plurality of chambers, each of which includes a specific monomer and a plurality of channels interconnecting the chambers, where each replication complex is sufficiently distant to enable data collection from each complex individually.
  • the present invention provides a method for single-molecule sequencing comprising the steps of supplying a plurality of monomers to a tagged polymerase confined on a substrate, exposing the tagged polymerase to light and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase.
  • the method can further comprise the step of relating the measured intensity and/or frequency of emitted fluorescent light to incorporation of a specific monomer into a growing DNA chain.
  • the present invention provides a composition comprising a cooperatively tagged polymerizing agent and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact.
  • the present invention provides a composition comprising a cooperatively tagged depolymerizing agent and tagged depolymerizable monomers, where a detectable property of at least one of the tags changes when the tags interact.
  • the present invention provides a composition comprising a cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact.
  • the present invention provides a composition comprising a cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tag are within a distance sufficient to cause a change in the intensity and/or frequency of emitted fluorescent light.
  • the present invention provides a composition comprising a tagged polymerase and tagged monomer precursors, where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags interact.
  • the present invention provides a composition comprising a tagged polymerase and a tagged monomer precursors, where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags are within a distance sufficient to cause a change in the intensity and/or frequency of emitted fluorescent light.
  • the present invention provides a single-molecule sequencing apparatus comprising a container having at least one tagged polymerase confined on an interior surface thereof and having a solution containing a plurality of tagged monomers in contact with the interior surface or a subset of tagged monomers and a subset of untagged monomers which together provide all monomers precursor for polymerization.
  • the present invention provides a method for single-molecule sequencing comprising the steps of supplying a plurality of tagged monomers to a tagged polymerase confined on an interior surface of a container, exposing the tagged polymerase to light and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase.
  • the method can further comprise relating the measured intensity and/or frequency of emitted fluorescent light to incorporation of a specific monomer into a growing DNA chain.
  • the present invention provides a system for retrieving stored information comprising: (a) a molecule having a sequence of elements representing a data stream; (b) a single-molecule sequencer comprising a polymerase having at least one tag associated therewith; (c) an excitation source adapted to excite the at least one tag on the polymerase; and (d) a detector adapted to detect a response from the tag on the polymerase or on the monomers; where the response from at least one tag changes during polymerization of a complementary sequence of elements and the change in response represents a data stream content.
  • the present invention provides a system for determining sequence information from a single-molecule comprising: (a) a molecule having a sequence of elements; (b) a single-molecule sequencer comprising a polymerase having at least one tag associated therewith; (c) an excitation source adapted to excite at least one tag on the polymerase or on the monomers; and (d) a detector adapted to detect a response from the tag on the polymerase; where the response from at least one tag changes during polymerization of a complementary sequence of elements representing the element sequence of the molecule.
  • the present invention provides a system for determining sequence information from an individual molecule comprising: (a) a molecule having a sequence of elements; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag associated therewith; (c) an excitation light source adapted to excite the at least one fluorescent tag on the polymerase or on the monomers; and (d) a fluorescent light detector adapted to detect at least an intensity of emitted fluorescent light from the at least one fluorescent tag on the polymerase; where the intensity change of at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • the present invention provides a system for storing and retrieving data comprising: (a) a sequence of nucleotides or nucleotide analogs representing a given data stream; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; (c) an excitation light source adapted to excite at least one fluorescent tag on the polymerase; and (d) a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase; where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • the present invention provides a system for storing and retrieving data comprising: (a) a sequence of nucleotides or nucleotide analogs representing a given data stream; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; (c) an excitation light source adapted to excite the at least one fluorescent tag on the polymerase or the monomers; and (d) a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase or the monomers; where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • the present invention provides a method for sequencing a molecular sequence comprising the steps of: (a) a sequenced of nucleotides or nucleotide analogs representing a given data stream; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; (c) an excitation light source adapted to excite at least one fluorescent tag on the polymerase or the monomers; and (d) a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase; where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • the present invention provides a method for synthesizing a ⁇ -phosphate modified nucleotide comprising the steps of attaching a molecular tag to a pyrophosphate group and contacting the modified pyrophosphate with a dNMP to produce a ⁇ -phosphate tagged dNTP.
  • the present invention provides a method for 5′ end-labeling a biomolecule comprising the step of contacting the biomolecule with a kinase able to transfer a ⁇ -phosphate of a ⁇ -phosphate labeled ATP to the 5′ end of the biomolecule resulting in a covalently modified biomolecule.
  • the present invention provides a method for end-labeling a polypeptide or carbohydrate comprising the step of contacting the polypeptide or carbohydrate with an agent able to transfer an atomic or molecular tag to either a carboxy or amino end of a protein or polypeptide or to either the ⁇ -phosphate of a ⁇ -phosphate labeled ATP to the 5′ end of the biomolecule resulting in a covalently modified biomolecule.
  • FIG. 1 depicts FRET activity as a function of distance separating the fluorescent donor and acceptor
  • FIG. 2 depicts the open and closed ternary complex forms of the large fragment of Taq DNA pol I (Klentaq 1);
  • FIGS. 3A-C depicts an overlay between 3ktq (closed ‘black’) and 1tau (open ‘light blue’), the large fragment of Taq DNA polymerase I;
  • FIG. 4 depicts an image of a 20% denaturing polyacrylamide gel containing size separated radiolabeled products from DNA extension experiments involving ⁇ -ANS-phosphate-dATP;
  • FIG. 5 depicts an image of (A) the actual gel, (B) a lightened phosphorimage and (C) an enhanced phosphorimage of products generated in DNA extension reactions using ⁇ -ANS-phosphate-dNTPs;
  • FIG. 6 depicts an image of (A) 6% denaturing polyacrylamide gel, (B) a lightened phosphorimage of the actual gel, and (C) an enhanced phosphorimage of the actual gel containing products generated in DNA extension reactions using ⁇ -ANS-phosphate-dNTPs;
  • FIG. 7 depicts an image of (A) the actual gel, (B) a lightened phosphorimage of the actual gel, and (C) an enhanced phosphorimage of the actual gel;
  • FIG. 8 depicts data for the Klenow fragment from E. coli DNA polymerase I incorporation of gamma-modified nucleotides
  • FIG. 9 depicts data for the Pfu DNA polymerase incorporation of gamma-modified nucleotides
  • FIG. 10 depicts data for the HIV-1 reverse transcriptase incorporation of gamma-tagged nucleotides
  • FIG. 11 depicts experimental results for the native T7 DNA polymerase and SEQUENASE® incorporation of gamma-tagged nucleotides
  • FIG. 12 depicts reaction products produced when the four natural nucleotides (dATP, dCTP, dGTP and dTTP) are used in the synthesis reaction (solid line) and reaction products produced when base-modified nucleotides are used in the synthesis reaction.
  • the inventors have devised a methodology using tagged monomers such as dNTPs and/or tagged polymerizing agents such as polymerase and/or tagged agents associated with the polymerizing agent such as polymerase associated proteins or probes to directly readout the exact monomer sequence such as a base sequence of an RNA or DNA sequence during polymerase activity.
  • the methodology of this invention is adaptable to protein synthesis or to carbohydrate synthesis or to the synthesis of any molecular sequence where the sequence of monomers provides usable information such as the sequence of a RNA or DNA molecule, a protein, a carbohydrate, a mixed biomolecule or an inorganic or organic sequence of monomers which stores a data stream.
  • the methods and apparatuses using these methods are designed to create new ways to address basic research questions such as monitoring conformation changes occurring during replication and assaying polymerase incorporation fidelity in a variety of sequence contexts.
  • the single-molecule detection systems of this invention are designed to improve fluorescent molecule chemistry, computer modeling, base-calling algorithms, and genetic engineering of biomolecules, especially for real-time or near real-time sequencing.
  • the inventors have also found that the methodology can be adapted to depolymerizing agents such as exonucleases where the polymer sequence is determined by depolymerization instead of polymerization.
  • the single-molecule systems of this invention are amendable to parallel and/or massively parallel assays, where tagged polymerases are patterned in arrays on a substrate. The data collected from such arrays can be used to improve sequence confidence and/or to simultaneously sequence DNA regions from many different sources to identify similarities or differences.
  • the pattern of emission signals is collected, either directly, such as by an Intensified Charge Coupled Devise (ICCD) or through an intermediate or series of intermediates to amplify signal prior to electronic detection, where the signals are decoded and confidence values are assigned to each base to reveal the sequence complementary to that of the template.
  • ICCD Intensified Charge Coupled Devise
  • the present invention also provides techniques for amplifying the fluorescent light emitted from a fluorescent tag using physical light amplification techniques or molecular cascading agent to amplify the light produced by single-molecular fluorescent events.
  • the single-molecule DNA sequencing systems of this invention have the potential to replace current DNA sequencing technologies, because the methodology can decrease time, labor, and costs associated with the sequencing process, and can lead to highly scalable sequencing systems, improving the DNA sequence discovery process by at least one to two orders of magnitude per reaction.
  • the single-molecule DNA sequencing technology of this invention can: (1) make it easier to classify an organism or identify variations within an organism by simply sequencing the genome or a portion thereof; (2) make rapid identification of a pathogen or a genetically-modified pathogen easier, especially in extreme circumstances such as in pathogens used in warfare; and (3) make rapid identification of persons for either law enforcement and military applications easier.
  • One embodiment of the single-molecule sequencing technology of this invention involves strategically positioning a pair of tags on a DNA polymerase so that as a dNTP is incorporated during the polymerization reaction, the tags change relative separation.
  • This relative change causes a change in a detectable property, such as the intensity and/or frequency of fluorescence from one or both of the tags.
  • a time profile of these changes in the detectable property evidences each monomer incorporation event and provides evidence about which particular dNTP is being incorporated at each incorporation event.
  • the pair of tags do not have to be covalently attached to the polymerase, but can be attached to molecules that associate with the polymerase in such a way that the relative separation of the tags change during base incorporation.
  • Another embodiment of the single-molecule sequencing technology of this invention involves a single tag strategically positioned on a DNA polymerase that interacts with a tag on a dNTP or separate tags on each dNTP.
  • the tags could be different for each dNTP such as color-coded tags which emit a different color of fluorescent light.
  • the identity of the base is indicated by a signature fluorescent signal (color) or a change in a fluorescent signal intensity and/or frequency.
  • the rate of polymerase incorporation can be varied and/or controlled to create an essentially “real-time” or near “real-time” or real-time readout of polymerase activity and base sequence. Sequence data can be collected at a rate of >100,000 bases per hour from each polymerase.
  • the tagged polymerases each include a donor tag and an acceptor tag situated or located on or within the polymerase, where the distance between the tags changes during dNTP binding, dNTP incorporation and/or chain extension. This change in inter-tag distance results in a change in the intensity and/or wavelength of emitted fluorescent light from the fluorescing tag. Monitoring the changes in intensity and/or frequency of the emitted light provides information or data about polymerization events and the identity of incorporated bases.
  • the tags on the polymerases are designed to interact with the tags on the dNTPs, where the interaction changes a detectable property of one or both of the tags.
  • Each fluorescently tagged polymerase is monitored for polymerization using tagged dNTPs to determine the efficacy of base incorporation data derived therefrom.
  • Specific assays and protocols have been developed along with specific analytical equipment to measure and quantify the fluorescent data allowing the determination and identification of each incorporated dNTP.
  • the inventors have identified tagged dNTPs that are polymerized by suitable polymerases and have developed software that analyze the fluorescence emitted from the reaction and interpret base identity.
  • appropriate fluorescently active pairs are well-known in the art and commercially available from such vendors as Molecular Probes located in Oregon or Biosearch Technologies, Inc. in Novato, Calif.
  • the tagged DNA polymerase for use in this invention are genetically engineered to provide one or more tag binding sites that allow the different embodiments of this invention to operate.
  • a suitable polymerase candidate is identified, specific amino acids within the polymerase are mutated and/or modified such reactions well-known in the art; provided, however, that the mutation and/or modification do not significantly adversely affect polymerization efficiency.
  • the mutated and/or modified amino acids are adapted to facilitate tag attachment such as a dye or fluorescent donor or acceptor molecule in the case of light activated tags.
  • the engineered polymerase can be contacted with one or more appropriate tags and used in the apparatuses and methods of this invention.
  • the single-molecule DNA sequencing system of this invention can significantly reduce time, labor, and costs associated with the sequencing process and is highly scalable.
  • the single-molecule DNA sequencing system of this invention (1) can improve the sequence discovery process by at least two orders of magnitude per reaction; (2) is not constrained by the length limitations associated with the degradation-based, single-molecule methods; and (3) allows direct sequencing of desired (target) DNA sequences, especially genomes without the need for cloning or PCR amplification, both of which introduce errors in the sequence.
  • the systems of this invention can make easier the task of classifying an organism or identifying variations within an organism by simply sequencing the genome in question or any desired portion of the genome.
  • the system of this invention is adapted to rapidly identify pathogens or engineered pathogens, which has importance for assessing health-related effects, and for general DNA diagnostics, including cancer detection and/or characterization, genome analysis, or a more comprehensive form of genetic variation detection.
  • the single-molecule DNA sequencing system of this invention can become an enabling platform technology for single-molecule genetic analysis.
  • the single-molecule sequencing systems of this invention have the following advantages: (1) the systems eliminates sequencing reaction processing, gel or capillary loading, electrophoresis, and data assembly; (2) the systems results in significant savings in labor, time, and costs; (3) the systems allows near real-time or real-time data acquisition, processing and determination of incorporation events (timing, duration, etc.), base sequence, etc.; (4) the systems allows parallel or massively parallel sample processing in microarray format; (5) the systems allows rapid genome sequencing, in time frames of a day or less; (6) the systems requires very small amount of material for analysis; (7) the systems allows rapid genetic identification, screening and characterization of animals including humans or pathogen; (8) the systems allows large increases in sequence throughput; (9) the system can avoid error introduced in PCR, RT-PCR, and transcription processes; (10) the systems can allow accurate sequence information for allele-specific mutation detection; (11) the systems allows rapid medical diagnostics, e.g., Single Nucleotide Polymorphism (SNP) detection; (12) the systems allows improvement in basic research
  • a single tag is attached to an appropriate site on a polymerase and a unique tag is attached to each of the four nucleotides: dATP, dTTP, dCTP and dGTP.
  • the tags on each dNTPs are designed to have a unique emission signature (i.e., different emission frequency spectrum or color), which is directly detected upon incorporation.
  • a characteristic fluorescent signal or base emission signature is emitted due to the interaction of polymerase tag and the dNTP tag.
  • the fluorescent signals i.e., the emission intensity and/or frequency, are then detected and analyzed to determine DNA base sequence.
  • One criteria for selection of the tagged polymerase and/or dNTPs for use in this invention is that the tags on either the polymerase and/or the dNTPs do not interfere with Watson-Crick base-pairing or significantly adversely impact polymerase activity.
  • the inventors have found that dNTPs containing tags attached to the terminal (gamma) phosphate are incorporated by a native Taq polymerase either in combination with untagged dNTPs or using only tagged dNTPs. Tagging the dNTPs on the ⁇ and/or ⁇ phosphate group is preferred because the resulting DNA strands do not include any of the dNTP tags in their molecular make up, minimizing enzyme distortion and background fluorescence.
  • One embodiment of the sequencing system of this invention involves placing a fluorescent donor such as fluorescein or a fluorescein-type molecule on the polymerase and unique fluorescent acceptors such as a d-rhodamine or a similar molecule on each dNTP, where each unique acceptor, when interacting with the donor on the polymerase, generates a fluorescent spectrum including at least one distinguishable frequency or spectral feature.
  • a fluorescent donor such as fluorescein or a fluorescein-type molecule
  • unique fluorescent acceptors such as a d-rhodamine or a similar molecule on each dNTP
  • Another embodiment of the sequencing system of this invention involves a fluorescent tag on the polymerase and unique quenchers on the dNTPs, where the quenchers preferably have distinguishable quenching efficiencies for the polymerase tag. Consequently, the identity of each incoming quencher tagged dNTP is determined by its unique quenching efficiency of the emission of the polymerase fluorescent tag. Again, the signals produced during incorporation are detected and analyzed to determine each base incorporated, the sequence of which generates the DNA base sequence.
  • Suitable polymerizing agents for use in this invention include, without limitation, any polymerizing agent that polymerizes monomers relative to a specific template such as a DNA or RNA polymerase, reverse transcriptase, or the like or that polymerizes monomers in a step-wise fashion.
  • Suitable polymerases for use in this invention include, without limitation, any polymerase that can be isolated from its host in sufficient amounts for purification and use and/or genetically engineered into other organisms for expression, isolation and purification in amounts sufficient for use in this invention such as DNA or RNA polymerases that polymerize DNA, RNA or mixed sequences, into extended nucleic acid polymers.
  • Preferred polymerases for use in this invention include mutants or mutated variants of native polymerases where the mutants have one or more amino acids replaced by amino acids amenable to attaching an atomic or molecular tag, which have a detectable property.
  • Exemplary DNA polymerases include, without limitation, HIV1-Reverse Transcriptase using either RNA or DNA templates, DNA pol I from T. aquaticus or E. coli, Bateriophage T 4 DNA pol, T7 DNA pol or the like.
  • Exemplary RNA polymerases include, without limitation, T7 RNA polymerase or the like.
  • Suitable depolymerizing agents for use in this invention include, without limitation, any depolymerizing agent that depolymerizes monomers in a step-wise fashion such as exonucleases in the case of DNA, RNA or mixed DNA/RNA polymers, proteases in the case of polypeptides and enzymes or enzyme systems that sequentially depolymerize polysaccharides.
  • Suitable monomers for use in this invention include, without limitation, any monomer that can be step-wise polymerized into a polymer using a polymerizing agent.
  • Suitable nucleotides for use in this invention include, without limitation, naturally occurring nucleotides, synthetic analogs thereof, analog having atomic and/or molecular tags attached thereto, or mixtures or combinations thereof.
  • Suitable atomic tag for use in this invention include, without limitation, any atomic element amenable to attachment to a specific site in a polymerizing agent or dNTP, especially Europium shift agents, nmr active atoms or the like.
  • Suitable atomic tag for use in this invention include, without limitation, any atomic element amenable to attachment to a specific site in a polymerizing agent or dNTP, especially fluorescent dyes such as d-Rhodamine acceptor dyes including dichloro[R110], dichloro[R6G], dichloro[TAMRA], dichloro[ROX] or the like, fluorescein donor dye including fluorescein, 6-FAM, or the like; Acridine including Acridine orange, Acridine yellow, Proflavin, pH 7, or the like; Aromatic Hydrocarbon including 2-Methylbenzoxazole, Ethyl p-dimethylaminobenzoate, Phenol, Pyrrole, benzene, toluene, or the like; Arylmethine Dyes including Auramine O, Crystal violet, H2O, Crystal violet, glycerol, Malachite Green or the like; Coumarin dyes including 7-Methoxycoumarin-4-acetic acid, Cou
  • Taq DNA polymerase I Therms aquaticus —is ideally suited for use in the single-molecule apparatuses, systems and methods of this invention.
  • Taq DNA Polymerase sometimes simply referred to herein as Taq, has many attributes that the inventors can utilize in constructing tagged polymerases for use in the inventions disclosed in this application. Of course, ordinary artisans will recognize that other polymerases can be adapted for use in the single-molecule sequencing systems of this invention.
  • Taq DNA polymerase I tolerates so many mutations within or near its active site (as reviewed in Patel et al, J. Mol. Biol., volume 308, pages 823-837, and incorporated herein by reference), the enzyme is more tolerant of enzyme tagging modification(s) and also able to incorporate a wider range of modified nucleotide substrates.
  • the program compares structural data associated with the Taq polymerase in its open and closed form to identify regions in the polymerase structure that are optimally positioned to optimize the difference in conformation extremes between a tag on the polymerase and the dNTP or to optimize a change in separation between two tags on the polymerase, thereby increasing or maximizing changes in a detectable property of one of the tags or tag pair.
  • Taq DNA Polymerase is Efficiently Expressed in E. Coli
  • the Taq DNA polymerase is efficiently expressed in E. coli allowing efficient production and purification of the nascent polymerase and variants thereof for rapid identification, characterization and optimization of an engineered Taq DNA polymerase for use in the single-molecule DNA sequencing systems of this invention.
  • the Taq DNA polymerase contains no cysteines, which allows the easy generation of cysteine-containing mutants in which a single cysteine is placed or substituted for an existing amino acid at strategic sites, where the inserted cysteine serves as a tag attachment site.
  • the Processivity of the Enzyme can be Modified
  • native Taq DNA polymerase may not represent an optimal polymerase for sequencing system of this invention because it is not a very processive polymerase (50-80 nucleotides are incorporated before dissociation), the low processivity may be compensated for by appropriately modifying the base calling software.
  • the processivity of the Taq DNA Polymerase can be enhanced through genetic engineering by inserting into the polymerase gene a processivity enhancing sequence. Highly processive polymerases are expected to minimize complications that may arise from template dissociation effects, which can alter polymerization rate.
  • the processivity of Taq can be genetically altered by introducing the 76 amino acid ‘processivity domain’ from T7 DNA polymerase between the H and H 1 helices (at the tip of ‘thumb’ region within the polymerase) of Taq.
  • the processivity domain also includes the thioredoxin binding domain (TBD) from T7 DNA polymerase causing the Taq polymerase to be thioredoxin-dependent increasing both the processivity and specific activity of Taq polymerase.
  • TBD thioredoxin binding domain
  • Taq DNA Polymerase Possesses a 5′ to 3′ Exonuclease Activity and is Thermostable
  • Single-stranded M13 DNA and synthetic oligonucleotides are used in the initial studies. After polymerase activity is optimized, the sequencing system can be used to directly determine sequence information from an isolated chromosome—a double-stranded DNA molecule. Generally, heating a sample of double-stranded DNA is sufficient to produce or maintain the double-stranded DNA in stranded DNA form for sequencing.
  • the 5′ to 3′ exonuclease activity of the native Taq DNA polymerase in the enzyme engineered for single-molecule DNA sequencing is retained. This activity of the polymerase is exploited by the ‘TaqMan’ assay.
  • the exonuclease activity removes a duplex strand that may renature downstream from the replication site using a nick-translation reaction mechanism. Synthesis from the engineered polymerase is initiated either by a synthetic oligonucleotide primer (if a specific reaction start is necessary) or by a nick in the DNA molecule (if multiple reactions are processed) to determine the sequence of an entire DNA molecule.
  • the Polymerase is Free from 3′ to 5′ Exonuclease Activity
  • the Taq DNA polymerase is does not contain 3′ to 5′ exonuclease activity, which means that the polymerase cannot replace a base, for which fluorescent signal was detected, with another base which would produce another signature fluorescent signal.
  • the error rate of engineered polymerases of this invention is assayed by determining their error rates in synthesizing known sequences.
  • the error rate determines the optimal number of reactions to be run in parallel so that sequencing information can be assigned with confidence.
  • the optimal number can be 1 or 10 or more.
  • base context influences polymerase accuracy and reaction kinetics, and this information is used to assign confidence values to individual base calls.
  • Taq DNA Polymerase is the Enzyme of Choice for Single-Molecule DNA Sequencing
  • Taq DNA polymerase is the optimal enzyme to genetically modify and adapt for single-molecule DNA sequencing. Additionally, basic research questions concerning DNA polymerase structure and function during replication can be addressed using this technology advancing single-molecule detection systems and molecular models in other disciplines. The inventors have found that native Taq DNA polymerase incorporates gamma-tagged dNTPs, yielding extended DNA polymers. Importantly, incorporation of a modified nucleotide is not detrimental to polymerase activity and extension of primer strands by incorporation of a _-tagged nucleotide conforms to Watson-Crick base pairing rules.
  • FRET-based fluorescence resonance energy transfer-based
  • a FRET-based method exists when the emission from an acceptor is more intense than the emission from a donor, i.e., the acceptor has a higher fluorescence quantum yield than the donor at the excitation frequency.
  • the efficiency of FRET method can be estimated form computational models. See, e.g., Furey et al., 1998; Clegg et al., 1993; Mathies et al., 1990.
  • the efficiency of energy transfer (E) is computed from equation (1) as follows:
  • R 0 is calculated from equation (2):
  • ⁇ 2 is a geometric orientation factor related to the relative angle of the two transition dipoles ( ⁇ 2 is generally assumed to be 2 ⁇ 3)
  • J DA [M ⁇ 1 cm 3 ] is the overlap integral representing the normalized spectral overlap of the donor emission and acceptor absorption
  • Q D is the quantum yield.
  • the overlap integral is computed from equation (3):
  • I D and I RF are the fluorescence intensities of donor and a reference compound (fluorescein in 0.1 N NaOH), and A RF and A D are the absorbances of the reference compound and donor.
  • Q RF is the quantum yield of fluorescein in 0.1N NaOH and is taken to be 0.90.
  • R the distance between the donor and acceptor
  • R is measured by looking at different configurations (e.g., conformations) of the polymerase in order to obtain a conformationally averaged value. If both tags are on the polymerase, then R is the distance between the donor and acceptor in the open and closed conformation, while if the donor is on the polymerase and the acceptor on the dNTP, R is the distance between the donor and acceptor when the dNTP is bound to the polymerase and the polymerase is its closed form.
  • the distance between the tagged ⁇ -phosphate and the selected amino acid sites for labeling in the open versus closed polymerase conformation delineates optimal dye combinations. If the distance (R) between the donor and acceptor is the same as R 0 (R 0 is the Forster critical distance), FRET efficiency (E) is 50%. If R is more than 1.5 R 0 , the energy transfer efficiency becomes negligible (E ⁇ 0.02). Sites within the enzyme at which R/R 0 differ by more than 1.6 in the open versus closed forms are identified and, if necessary, these distances and/or distance differences can be increased through genetic engineering. A plot of FRET efficiency verses distance is shown in FIG. 1 .
  • Dye sets are chosen to maximize energy transfer efficiency between a tagged dNTP and a tag on the polymerase when the polymerase is in its closed configuration and to minimize energy transfer efficiency between the tag on the DNTP (either non-productively bound or in solution) and the tag on the polymerase when the polymerase is in its open configuration.
  • a molarity of each nucleotide in the reaction medium of no more than about 1 ⁇ M
  • an average distance between tagged nucleotides is calculated to be greater than or equal to about 250 ⁇ . Because this distance is several fold larger than the distance separating sites on the polymerase in its open to closed conformational, minimal FRET background between the polymerase and free dNTPs is observed.
  • nucleotide concentrations are reduced below 1 ⁇ M.
  • Reducing dNTP concentrations to levels of at least ⁇ 10% of the K m further minimizes background fluorescence and provides a convenient method for controlling the rate of the polymerase reaction for the real-time monitoring. Under such conditions, the velocity of the polymerization reaction is linearly proportional to the dNTP concentration and, thus, highly sensitive to regulation.
  • the use of a single excitation wavelength allows improved identification of unique tags on each dNTP. A single, lower-wavelength excitation laser is used to achieve high selectivity.
  • a fluorescence donor is attached to a site on the polymerase comprising a replaced amino acid more amenable to donor attachment such as cysteine and four unique fluorescence acceptors are attached to each dNTP.
  • fluorescein is attached to a site on the polymerase and rhodamine, rhodamine derivatives and/or fluorescein derivatives are attached to each dNTP.
  • Each donor-acceptor fluorophore pair is designed to have an absorption spectrum sufficiently distinct from the spectra of other pairs to allow separate identification after excitation.
  • the donor is selected such that the excitation light activates the donor, which then efficiency transfers the excitation energy to one of the acceptors.
  • the acceptor After energy transfer, the acceptor emits it unique fluorescence signature.
  • the emission of the fluorescence donor must significant overlap with the absorption spectra of the fluorescence acceptors for efficient energy transfer.
  • the methods of this invention can also be performed using two, three or four unique fluorescence donor-acceptor pairs, by running parallel reactions.
  • Fluorophore choice is a function of not only its enzyme compatibility, but also its spectral and photophysical properties. For instance, it is critical that the acceptor fluorophore does not have any significant absorption at the excitation wavelength of the donor fluorophore, and less critical (but also desirable) is that the donor fluorophore does not have emission at the detection wavelength of the acceptor fluorophore. These spectral properties can be attenuated by chemical modifications of the fluorophore ring systems.
  • the dNTPs are amenable to tagging at several sites including the base, the sugar and the phosphate groups, the dNTPs are preferably tagged at either the ⁇ and/or ⁇ phosphate. Tagging the terminal phosphates of dNTP has a unique advantage. When the incoming, tagged dNTP is bound to the active site of the polymerase, significant FRET from the donor on the polymerase to the acceptor on the dNTP occurs. The unique fluorescence of the acceptor identifies which dNTP is incorporated.
  • the fluorescence acceptor which is now attached to the pyrophosphate group, is released to the medium with the cleaved pyrophosphate group.
  • the growing DNA chain includes no fluorescence acceptor molecules at all.
  • FRET occurs only between the donor on the polymerase and incoming acceptor-labeled dNTP, one at a time. This approach is better than the alternative attachment of the acceptor to a site within the dNMP moiety of the dNTP or the use of multiply-modified dNTPs. If the acceptor is attached to a site other than the ⁇ or ⁇ phosphate group, it becomes part of the growing DNA chain and the DNA chain will contain multiple fluorescence acceptors. Interference with the polymerization reaction and FRET measurements would likely occur.
  • collisional quenchers can be added to the polymerizing medium that do not covalently interact with the acceptors on the dNTPs and quench fluorescence from the tagged dNTPs in the medium.
  • the quenchers are also adapted to have insignificant contact with the donor on the polymerase.
  • the polymerase tag is preferably localized internally and shielded from the collisional quenchers or the collisional quencher can be made sterically bulky or associate with a sterically bulky group to decrease interaction between the quencher and the polymerase.
  • Another preferred method for detecting polymerase-nucleotide interactions involves using nucleotide-specific quenching agents to quench the emission of a fluorescent tag on the polymerase.
  • the polymerase is tagged with a fluorophore, while each dNTP is labeled with a quencher for the fluorophore.
  • DABCYL 4-(4′-dimethylaminophe-nylazo) benzoic acid is a universal quencher, which absorbs energy from a fluorophore, such as 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (AEANS) and dissipates heat.
  • AEANS 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid
  • a quencher is selected for each dNTP so that when each quencher is brought into close proximity to the fluorophore, a distinguishable quenching efficiency is obtained. Therefore, the degree of quenching is used to identify each dNTP as it is being incorporated into the growing DNA chain.
  • One advantage of this preferred detection method is that fluorescence emission comes from a single source rendering background noise negligible.
  • two or three suitable quenchers are identified, then two or three of the four dNTPs are labeled and a series of polymerization reaction are made each time with a different pair of the labeled dNTPs. Combining the results from these runs generates a complete sequence of the DNA molecule.
  • the present invention is directed to attaching any type of atomic and/or molecular tag that has a detectable property
  • the processes for site selection and tag attachment are illustrated using a preferred class of tags, namely fluorescent tags.
  • the fluorescence probes or quenchers attached to the polymerase or dNTPs are designed to minimize adverse effects on the DNA polymerization reaction.
  • the inventors have developed synthetic methods for chemically tagging the polymerase and dNTPs with fluorescence probes or quenchers.
  • the polymerase is tagged by replacing a selected amino acid codon in the DNA sequence encoding the polymerase with a codon for an amino acid that more easily reacts with a molecular tag such as cysteine via mutagenesis.
  • a mutated DNA sequence is prepared, the mutant is inserted into E. coli for expression. After expression, the mutant polymerase is isolated and purified. The purified mutant polymerase is then tested for polymerase activity. After activity verification, the mutant polymerase is reacted with a slight molar excess of a desired tag to achieve near stoichiometric labeling.
  • the polymerase can be treated with an excess amount of the tag and labeling followed as a function of time. The tagging reaction is than stopped when near stoichiometric labeling is obtained.
  • the tagging reaction can also be carried out under special reaction conditions such as using a protecting group or competitive inhibitor and a reversible blocking group, which are later removed.
  • a protecting group or competitive inhibitor is first added to protect the target residue and a reversible blocking group is subsequently added to inactivate non-target residues.
  • the protecting group or competitive inhibitor is then removed from the target residue, and the mutant polymerase is treated with the desired tag to label the target residue.
  • the blocking groups are chemically removed from non-target residues in the mutant polymerase and removed to obtain a tagged mutant polymerase with the tag substantially to completely isolated on the target residue.
  • the polymerase can be treated with a blocking group to inactivate non-target residues. After removal of unreacted blocking group, the mutant polymerase is treated with the desired tag for labeling the target residue. Finally, the blocking groups are chemically removed from the non-target residues in the mutant polymerase and removed to obtain the tagged mutant polymerase.
  • the inventors have identified amino acids in the Taq polymerase that are likely to withstand mutation and subsequent tag attachment such as the attachment of a fluorescent tag. While many sites are capable of cysteine replacement and tag attachment, preferred sites in the polymerase were identified using the following criteria: (1) they are not in contact with other proteins; (2) they do not alter the conformation or folding of the polymerase; and (3) they are not involved in the function of the protein. The selections were accomplished using a combination of mutational studies including sequence analyses data, computational studies including molecular docking data and assaying for polymerase activity and fidelity. After site mutation, computational studies will be used to refine the molecular models and help to identify other potential sites for mutation.
  • Regions of the protein surface that are not important for function were identified, indirectly, by investigating the variation in sequence as a function of evolutionary time and protein function using the evolutionary trace method. See, e.g., Lichtarge et al., 1996. In this approach, amino acid residues that are important for structure or function are found by comparing evolutionary mutations and structural homologies.
  • the polymerases are ideal systems for this type of study, as there are many crystal and co-crystal structures and many available sequences. The inventors have excluded regions of structural/functional importance from sites selection for mutation/labeling.
  • visual inspection and overlays of available crystal structures of the polymerase in different conformational states provided further assistance in identifying amino acid sites near the binding site for dNTPs.
  • Some of the chosen amino acids sites are somewhat internally located and preferably surround active regions in the polymerase that undergo changes during base incorporation, such as the dNTP binding regions, base incorporation regions, pyrophosphate release regions, etc. These internal sites are preferred because a tag on these sites show reduced background signals during detection, i.e., reduce interaction between the polymerase enzyme and non-specifically associated tagged dNTPs, when fluorescently tagged dNTPs are used.
  • Another aspect of this invention involves the construction of molecular mechanics force field parameters for atomic and/or molecular tags such as fluorescent tags used to tag the dNTPs and the polymerase and parameters for the fluorescent tagged amino acid on the polymerase and/or dNTP.
  • Force field parameters are using quantum mechanical studies to obtain partial charge distributions and energies for relevant intramolecular conformations (i.e., for the dihedral angle definitions) derived from known polymerase crystal structures.
  • Ionization states of each ionizable residue are estimated using an electrostatic model in which the protein is treated as a low dielectric region and the solvent as a high dielectric, using the UHBD program. See, e.g., Antosiewicz et al., 1994; Briggs and Antosiewicz, 1999; Madura et al., 1995.
  • the electrostatic free energies of ionization of each ionizable residue are computed by solving the Poisson-Boltzmann equation for each residue. These individual ionization free energies are modified to take into account coupled titration behavior resulting in a set of self-consistent predicted ionization states.
  • an electrostatic potential map is generated from properties of the molecular surface of the Taq polymerase/DNA complex, screened by solvent and, optionally, by dissolved ions (i.e., ionic strength) using mainly the UHBD program.
  • the map provides guidance about binding locations for the dNTPs and the electrostatic environment at proposed mutation/labeling sites.
  • the molecular models generated are designed to be continually refined taking into account new experimental data, allowing the construction of improved molecular models, improved molecular dynamics calculations and improved force field parameters so that the models better predict system behavior for refining tag chemistry and/or tag positioning, predicting new polymerase mutants, base incorporation rates and polymerase fidelity.
  • HyperChem v6.0
  • the HyperChem software runs on PCs under a Windows operating system.
  • a number of computer programs for data analysis or for FRET prediction are and will be written on a PC using the Linux operating system and the UHBD program running under Linux.
  • Co-crystal structures solved for DNA polymerase I (DNA pol I) from E. coli, T. aquaticus, B. stearothermophilus , T7 bacteriophage, and human pol ⁇ demonstrate that (replicative) polymerases share mechanistic and structural features.
  • the structures that capture Taq DNA polymerase in an ‘open’ (non-productive) conformation and in a ‘closed’ (productive) conformation are of particular importance for identifying regions of the polymerase that undergo changes during base incorporation.
  • the addition of the nucleotide to the polymerase/primer/template complex is responsible for the transition from its open to its closed conformation.
  • the open and closed ternary complex forms of the large fragment of Taq DNA pol I (Klentaq 1) are shown in a superimposition of their C ⁇ tracings.
  • the ternary complex contains the enzyme, the ddCTP and the primer/template duplex DNA.
  • the open structure is shown in magenta and the closed structure is shown in yellow.
  • the disorganized appearance in the upper left portion of the protein shows movement of the ‘fingers’ domain in open and closed conformations.
  • amino acids represent preferred amino acid sites for cysteine replacement and subsequent tag attachment, because these sites represent the sites in the Taq polymerase the undergo significant changes in position during base incorporation.
  • amino acid site selection visualization of the polymerase in its open and closed conformational extremes for these identified amino acid sites is used so that the final selected amino acid sites maximize signal and minimize background noise, when modified to carry fluorescent tags for analysis using the FRET methodology.
  • Amino acid changes that are not predicted to significantly affect the protein's secondary structure or activity make up a refined set of amino acid sites in the Taq polymerase for mutagenesis and fluorescent modification so that the tag is shielded from interaction with free dNTPs.
  • the following three panels illustrate the protocol used in this invention to refine amino acid site selection from the about list of amino acids that undergo the largest change in position relative to a bound ddGTP as the polymerase transitions from the open to the closed form.
  • FIGS. 3A-C an overlay between 3 ktq (closed ‘black’) and 1 tau (open ‘light blue’), the large fragment of Taq DNA polymerase I is shown.
  • the bound DNA from 3 ktq is shown in red while the ddCTP bound to 3 ktq is in green.
  • Three residues were visually identified as moving the most when the polymerase goes from open (1 tau) to closed (3 ktq), namely, Asp655, Pro656, and Leu657. Based on further analyses of the structures, Pro656 appears to have the role of capping the O-helix. Leu657's side chain is very close to another part of the protein in the closed (3 ktq) form.
  • FIG. 3B a close-up view of the active site from the overlay of the 3 ktq (closed) and 1 tau (open) conformations of Taq polymerase is shown. The large displacements between the open and closed conformations are evident.
  • FIG. 3C a close-up view of a molecular surface representation of 3 ktq (in the absence of DNA and ddCTP). The molecular surface is colored in two areas, blue for Asp655 and green for Leu657.
  • Leu657 is in close proximity to another part of the protein, because the green part of the molecular surface, in the thumb domain, is “connected” to a part of the fingers domain.
  • This view shows this region of the polymerase looking into the palm of the hand with fingers to the right and thumb to the left.
  • the gene encoding Taq DNA polymerase was obtained and will be expressed in pTTQ 18 in E. coli strain DH1. See, e.g., Engelke et al., 1990.
  • the inventors have identified candidate amino acids for mutagenesis including the amino acids in Tables I-IV, the refined lists or mixtures or combinations thereof.
  • the inventors using standard molecular methods well-known in the art introduced a cysteine codon, individually, at each of target amino acid sites. See, e.g., Sambrook et al., 1989 and Allen et al., 1998.
  • DNA is purified from isolated colonies expressing the mutant polymerase, sequenced using dye-terminator fluorescent chemistry, detected on an ABI PRISM 377 Automated Sequencer, and analyzed using SequencherTM available from GeneCodes, Inc.
  • the inventors have demonstrated that the Taq polymerase is capable of incorporating ⁇ -tagged dNTPs to synthesize extended DNA sequences.
  • the next step involves the construction of mutants capable of carrying a tag designed to interact with the tags on the dNTPS and optimization of the polymerase for single-molecule sequencing.
  • the mutants are constructed using standard site specific mutagenesis as described above and in the experimental section.
  • the constructs are then inserted into and expressed in E. coli .
  • Mutant Taq polymerase is then obtained after sufficient E. coli is grown for subsequence polymerase isolation and purification.
  • E. coli can be grown to optical densities exceeding 100 by computer-controlled feedback-based supply of non-fermentation substrates, the resulting three kg of E. coli cell paste will be excessive during polymerase optimization.
  • optimized polymerases construct are prepared, then this large scale production will be used.
  • the mutants are derived from E. coli cell masses grown in 10 L well-oxygenated batch cultures using a rich medium available from Amgen.
  • the mutants are prepared by growing E. coli in 2 L baffled shake glasses. Cell paste are then harvested using a 6 L preparative centrifuge, lysed by French press, and cleared of cell debris by centrifugation. To reduce interference from E.
  • nucleic acid sequences it is preferably to also remove other nucleic acids. Removal is achieved using either nucleases (with subsequent heat denaturation of the nuclease) or, preferably using a variation of the compaction agent-based nucleic acid precipitation protocol as described in Murphy et al., Nature Biotechnology 17, 822, 1999.
  • Taq polymerase Because the thermal stability of Taq polymerase is considerably greater than typical E. coli proteins, purification of Taq polymerase or its mutants from contaminating Taq polymerase proteins is achieved by a simple heat treatment of the crude polymerase at 75° C. for 60 minutes, which reduces E. coli protein contamination by approximately 100-fold. This reduction in E. coli protein contamination combined with the high initial expression level, produces nearly pure Taq polymerase or its mutants in a convenient initial step; provide, of course, that the mutant polymerase retains the thermal stability of the native polymerase.
  • a single anion-exchange step typically on Q Sepharose at pH 8.0, is generally sufficient to produce a product pure enough to these tests.
  • a second purification step will also be performed to insure that contamination does not cloud the results of subsequent testing.
  • the second purification step involves SDS-PAGE and CD-monitored melting experiments.
  • the dNTPs are tagged either by reacting a dNTP with a desired tag or by reacting a precursor such as the pyrophosphate group or the base with a desired tag and then completing the synthesis of the dNTP.
  • the inventors have developed syntheses for modifying fluorophore and fluorescence energy transfer compounds to have distinct optical properties for differential signal detection, for nucleotide/nucleoside synthons for incorporation of modifications on base, sugar or phosphate backbone positions, and for producing complementary sets of four deoxynucleotide triphosphates (dNTPs) containing substituents on nucleobases, sugar or phosphate backbone.
  • dNTPs deoxynucleotide triphosphates
  • the native Taq polymerase is capable of polymerizing phosphate-modified dNTPs or ddNTPs. Again, tagging the dNPTs or ddNTPs at the beta and/or gamma phosphate groups is a preferred because the replicated DNA contains no unnatural bases, polymerase activity is not significantly adversely affected and long DNA strands are produced.
  • the inventors have synthesized ⁇ -ANS-phosphate dNTPs, where the ANS is attached to the phosphate through a phosphamide bond. Although these tagged dNTPs are readily incorporated by the native Taq polymerase and by HIV reverse transcriptase, ANS is only one of a wide range of tags that can be attached through either the ⁇ and/or ⁇ phosphate groups.
  • the present invention uses tagged dNTPs or ddNTPs in combination with polymerase for signal detection.
  • the dNTPs are modified at phosphate positions (alpha, beta and/or gamma) and/or other positions of nucleotides through a covalent bond or affinity association.
  • the tags are designed to be removed from the base before the next monomer is added to the sequence.
  • One method for removing the tag is to place the tag on the gamma and/or beta phosphates.
  • the tag is removed as pyrophosphate dissociates from the growing DNA sequence.
  • Another method is to attach the tag to a position of on the monomer through a cleavable bond.
  • the tag is then removed after incorporation and before the next monomer incorporation cleaving the cleavable bond using light, a chemical bond cleaving reagent in the polymerization medium, and/or heat.
  • FR is a fluorescent tag
  • L is a linker group
  • X is either H or a counterion depending on the pH of the reaction medium
  • Z is a group capable of reaction with the hydroxyl group of the pyrophosphate and Z′ is group after reaction with the dNMP.
  • Z is Cl, Br, I, OH, SH, NH 2 , NHR, CO 2 H, CO 2 R, SiOH, SiOR, GeOH, GeOR, or similar reactive functional groups, where R is an alkyl, aryl, aralkyl, alkaryl, halogenated analogs thereof or hetero atom analogs thereof and Z′ is O, NH, NR, CO 2 , SiO, GeO, where R is an alkyl, aryl, aralkyl, alkaryl, halogenated analogs thereof or hetero atom analogs thereof.
  • the synthesis involves reacting Z terminated fluorescent tag, FR-L-Z with a pyrophosphate group, P 2 O 6 X 3 H, in DCC and dichloromethane to produce a fluorescent tagged pyrophosphate.
  • a fluorescent tagged pyrophosphate is prepared, it is reacted with a morpholine terminated dNMP in acidic THF to produce a dNTP having a fluorescent tag on its ⁇ -phosphate. Because the final reaction bears a fluorescent tag and is larger than starting materials, separation from unmodified starting material and tagged pyrophosphate is straight forward.
  • Fluorescein (FR) is first reacted with isobutyryl anhydride in pyridine in the presence of diisopropyl amine to produce a fluorescein having both ring hydroxy groups protected for subsequent linker attachment.
  • the hydroxy protected fluorescein is then reacted with N-hydroxylsuccinimide in DCC and dichloromethane to produce followed by the addition of 1-hydroxy-6-amino hexane to produce an hydroxy terminated FR-L group.
  • This group can then be reacted either with pyrophosphate to tag the dNTPs at their ⁇ -phosphate group or to tag amino acids. See, e.g., Ward et al., 1987; Engelhardt et al., 1993; Little et al., 2000; Hobbs, 1991.
  • tags can be designed so that each tag emits a distinguishable emission spectrum.
  • the emission spectra can be distinguished by producing tags with non-overlapping emission frequencies—multicolor—or each tag can have a non-overlapping spectral feature such a unique emission band, a unique absorption band and/or a unique intensity feature.
  • Systems that use a distinguishable tag on each dNTP improves confidence values associated with the base calling algorithm.
  • fluorescein is adaptable to other dyes as well such as tetrachlorofluorescein (JOE) or N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA).
  • JE tetrachlorofluorescein
  • TAMRA N,N,N′,N′-tetramethyl-6-carboxyrhodamine
  • the gamma phosphate tagged reactions are carried out in basic aqueous solutions and a carbodiimide, such as DEC.
  • DEC carbodiimide
  • Other fluorophore molecules and dNTPs can be similarly modified.
  • the dNTPs can also be tagged on the base and/or sugar moieties while maintaining their polymerase reaction activity.
  • the sites for modifications are preferably selected to not interfere with Watson-Crick base pairing. A generalized scheme for base modification is shown below:
  • polymerase variants The activities of polymerase variants are monitored throughout polymerase development. Polymerase activity is assayed after a candidate amino acid is mutated to cysteine and after fluorescent tagging of the cysteine.
  • the assay used to monitor the ability of the native Taq polymerase to incorporate fluorescently-tagged dNTPs is also used to screen polymerase variants. Since the mutant Taq polymerases have altered amino acid sequences, the assays provide mutant characterization data such as thermostability, fidelity, polymerization rate, affinity for modified versus natural bases.
  • Mutant Taq polymerase activity assays are carried out under conditions similar to those used to examine the incorporation of fluorescently-tagged dNTPs into DNA polymers by the native Taq polymerase.
  • the purified mutant Taq polymerase is incubated in polymerase reaction buffer with a 5′- 32 P end-labeled primer/single-stranded template duplex, and appropriate tagged dNTP(s).
  • the polymerase's ability to incorporate a fluorescently-tagged dNTP is monitored by assaying the relative amount of fluorescence associated with the extended primer on either an AB1377 DNA Sequencer (for fluorescently tagged bases), a Fuji BAS 1000 phosphorimaging system, or other appropriate or similar detectors or detection systems.
  • This assay is used to confirm that the mutant polymerase incorporates tagged dNT ⁇ and to confirm that fluorescent signatures are obtained during base incorporation.
  • These assays use an end-labeled primer, the fluorescently-tagged dNT ⁇ and the appropriate base beyond the fluorescent tag.
  • the products are then size separated and analyzed for extension. Reactions are either performed under constant temperature reaction conditions or thermocycled, as necessary.
  • Taq DNA polymerase to incorporate a ⁇ -phosphate dNTP variant is assayed using conditions similar to those developed to examine single base incorporation by a fluorescently-tagged DNA polymerase. See, e.g., Furey et al., 1998. These experiments demonstrate that polymerases bearing a fluorescent tag do not a priori have reduced polymerization activity. The inventors have demonstrated that the native Taq polymerase incorporates ⁇ -tagged dNTP, singly or collectively to produce long DNA chains.
  • polymerase reaction buffer such as Taq DNA polymerase buffer available from Promega Corporation of Madison, Wis. with either a 5′- 32 P or a fluorescently end-labeled primer (TOP)/single-stranded template (BOT-‘X’) duplex, and appropriate dNTP(s) as shown in Table V. Reactions are carried out either at constant temperature or thermocycled, as desired or as is necessary. Reaction products are then size-separated and quantified using a phosphorimaging or fluorescent detection system. The relative efficiency of incorporation for each tagged dNTP is determined by comparison with its natural counterpart.
  • polymerase reaction buffer such as Taq DNA polymerase buffer available from Promega Corporation of Madison, Wis.
  • TOP fluorescently end-labeled primer
  • BOT-‘X’ single-stranded template
  • TOP represents the primer strand of an assay DNA duplex.
  • Variants of the template strand are represented by ‘BOT’.
  • the relevant feature of the DNA template is indicated after the hyphen.
  • BOT-T, BOT-C, BOT-G, BOT-A are used to monitor polymerase incorporation efficiency and fidelity for either nucleotides or nucleotide variants of dA, dG, dC, and dT, respectively.
  • Preliminary assays are performed prior to exhaustive purification of the tagged dNTP to ensure that the polymerase is not inhibited by a chemical that co-purifies with the tagged dNTP, using the ‘BOT-Sau’ template.
  • the ‘BOT-Sau’ template was designed to monitor incorporation of natural dGTP prior to tagged dATP (i.e., a positive control for polymerase activity). More extensive purification is then performed for promising tagged nucleotides.
  • experiments are carried out to determine whether the polymerase continues extension following incorporation of the tagged dNTPs, individually or collectively, using the same end-labeled ‘TOP’ primer, the appropriate ‘BOT’ primer, the fluorescently-tagged dNTP, and the appropriate base 3′ of the tagged nucleotide.
  • the products are then size-separated and analyzed to determine the relative extension efficiency.
  • the Taq DNA polymerase lacks 3′ to 5′ exonuclease activity (proofreading activity). If the polymerase used in single-molecule DNA sequencing possessed a 3′ to 5′ exonuclease activity, the polymerase would be capable of adding another base to replace one that would be removed by the proofreading activity. This newly added base would produce a signature fluorescent signal evidencing the incorporation of an additional base in the template, resulting in a misidentified DNA sequence, a situation that would render the single-molecule sequencing systems of this invention problematic.
  • the sequencing reactions are preferably run in parallel, with the optimal number required to produce sequence information with a high degree of confidence for each base call determined by the error rate. Larger error rates require more parallel run, while smaller error rates require fewer parallel runs. In fact, if the error rate is low enough, generally less than 1 error in 1,000, preferably 1 error in 5,000 and particularly 1 error in 10,000 incorporated base, then no parallel runs are required. Insertions or deletions are, potentially, more serious types of errors and warrant a minimal redundancy of 3 repeats per sample. If 2 reactions were run, one could not be certain which was correct. Thus, 3 reactions are needed for the high quality data produced by this system.
  • the BOT-variant templates are used to characterize the accuracy at which each ⁇ -tagged dNTP is incorporated by an engineered polymerase as set forth in Table V.
  • Oligonucleotides serve as DNA templates, and each differing only in the identity of the first base incorporated. Experiments using these templates are used to examine the relative incorporation efficiency of each base and the ability of the polymerase to discriminate between the tagged dNTPs.
  • experiments with polymerase variants are carried out using relatively simple-sequence, single-stranded DNA templates.
  • a wide array of sequence-characterized templates is available from the University of Houston in Dr. Hardin's laboratory, including a resource of over 300 purified templates.
  • one series of templates contains variable length polyA or polyT sequences. Additional defined-sequence templates are constructed as necessary, facilitating the development of the base-calling algorithms.
  • Direct detection of polymerase action on the tagged dNTP is obtained by solution fluorescence measurements, using SPEX 212 instrument or similar instrument.
  • This instrument was used to successfully detect fluorescent signals from ANS tagged ⁇ -phosphate dNTPs, being incorporated by Taq polymerase at nanomolar concentration levels.
  • the SPEX 212 instrument includes a 450 watt xenon arc source, dual emission and dual excitation monochromators, cooled PMT (recently upgraded to simultaneous T-format anisotropy data collection), and a Hi-Tech stopped-flow accessory.
  • This instrument is capable of detecting an increase in fluorescence intensity and/or change in absorption spectra upon liberation of the tagged pyrophosphate from ANS tagged ⁇ -phosphate dNTPs, as was verified for ANS-pyrophosphate released by Taq and RNA polymerase and venom phosphodiesterase.
  • the detection of fluorescence from single molecules is preferably carried out using microscopy. Confocal-scanning microscopy can be used in this application, but a non-scanning approach is preferred.
  • a microscope useful for detecting fluorescent signals due to polymerase activity include any type of microscope, with oil-immersion type microscopes being preferred.
  • the microscopes are preferably located in an environment in which vibration and temperature variations are controlled, and fitted with a highly-sensitive digital camera. While many different cameras can be to record the fluorescent signals, the preferred cameras are intensified CCD type cameras such as the iPentaMax from Princeton Instruments.
  • the method of detection involves illuminating the samples at wavelengths sufficient to induce fluorescence of the tags, preferably in an internal-reflection format. If the fluorescent tags are a donor-acceptor pair, then the excitation frequency must be sufficient to excite the donor.
  • the preferred light source is a laser. It will often be advantageous to image the same sample in multiple fluorescence emission wavelengths, either in rapid succession or simultaneously. For simultaneous multi-color imaging, an image splitter is preferred to allow the same CCD to collect all of the color images simultaneously. Alternatively, multiple cameras can be used, each viewing the sample through emission optical filters of different wavelength specificity.
  • Tag detection in practice, of course, depends upon many variables including the specific tag used as well electrical, fluorescent, chemical, physical, electrochemical, mass isotope, or other properties.
  • Single-molecule fluorescence imaging is obtainable employing a research-grade Nikon Diaphot TMD inverted epifluorescence microscope, upgraded with laser illumination and a more-sensitive camera.
  • single-molecule technology is a well-developed and commercially available technology.
  • the epifluorescence microscope can be retrofitted for evanescent-wave excitation using an argon ion laser at 488 nm.
  • the inventors have previously used this illumination geometry in assays for nucleic acid hybridization studies.
  • the existing setup has also been upgraded by replacement of the current CCD camera with a 12-bit 512.times.512 pixel Princeton Instruments I-PentaMAX generation IV intensified CCD camera, which has been used successfully in a variety of similar single-molecule applications. This camera achieves a quantum efficiency of over 45% in the entire range of emission wavelengths of the dyes to be used, and considerably beyond this range.
  • the vertical alignment of their existing microscope tends to minimize vibration problems, and the instrument is currently mounted on an anti-vibration table.
  • a preferred high-sensitivity imaging system is based on an Olympus IX70-58F inverted epifluorescence microscope.
  • the system incorporates low-background components and enables capture of single molecule fluorescence images at rates of greater than 80 frames per second with quantum efficiency between 60-70% in the range of emission wavelengths of the fluorescently active tags.
  • a data collection channel such as a single pixel or pixel-bin of the viewing field of the CCD or other digital imaging system.
  • a finite number of data collection channels such as pixels are available in any given digital imaging apparatus. Randomly-spaced, densely-positioned fluorescent emitters generally produce an increased fraction of pixels or pixel bins that are multiply-occupied and problematic in data analysis. As the density of emitters in the viewing field increases so does the number of problematic data channels. While multiple occupancy of distinguishable data collection regions within the viewing field can be reduced by reducing the concentration of emitters in the viewing field, this decrease in concentration of emitters increases the fraction of data collection channels or pixels that see no emitter at all, therefore, leading to inefficient data collection.
  • a preferred method for increasing and/or maximizing the data collection efficiency involves controlling the spacing between emitters (tagged polymerase molecules). This spacing is achieved in a number of ways.
  • the polymerases can be immobilized on a substrate so that only a single polymerase is localized within each data collection channel or pixel region within the viewing field of the imaging system.
  • the immobilization is accomplished by anchoring a capture agent or linking group chemically attached to the substrate.
  • Capture or linking agents can be spaced to useful distances by choosing inherently large capture agents, by conjugating them with or bonding them to molecules which enhance their steric bulk or electrostatic repulsion bulk, or by immobilizing under conditions chosen to maximize repulsion between polymerizing molecular assembly (e.g., low ionic strength to maximize electrostatic repulsion).
  • the polymerase can be associated with associated proteins that increase the steric bulk of the polymerase or the electrostatic repulsion bulk of the polymerizing system so that each polymerizing molecular assembly cannot approach any closer than a distance greater than the data channel resolution size of the imaging system.
  • assays are performed essentially as described in for polymerase activity assays described herein.
  • the primary difference between assaying polymerase activity for screening purposes involves the immobilization of some part of the polymerizing assembly such as the polymerase, target DNA or a primer associated protein to a solid support to enable viewing of individual replication events.
  • immobilization options are available, including, without limitation, covalent and/or non-covalent attachment of one of the molecular assemblies on a surface such as an organic surface, an inorganic surface, in or on a nanotubes or other similar nano-structures and/or in or on porous matrices.
  • a preferred data collection method for single-molecule sequencing is to ensure that the fluorescently tagged polymerases are spaced apart within the viewing field of the imagining apparatus so that each data collection channel sees the activity of only a single polymerase.
  • the raw data generated by the detector represents between one to four time-dependent fluorescence data streams comprising wavelengths and intensities: one data stream for each fluorescently labeled base being monitored. Assignment of base identities and reliabilities are calculated using the PHRED computer program. If needed, the inventors will write computer programs to interpret the data streams having partial and overlapping data. In such cases, multiple experiments are run so that confidence limits are assigned to each base identity according to the variation in the reliability indices and the difficulties associated with assembling stretches of sequence from fragments. The reliability indices represent the goodness of the fit between the observed wavelengths and intensities of fluorescence compared with ideal values. The result of the signal analyses is a linear DNA sequence with associated probabilities of certainty.
  • the data is stored in a database for dynamic querying for identification and comparison purposes.
  • a search term sequence of 6-10, 11-16, 17-20, 21-30 bases can be compared against reference sequences to quickly identify perfectly matched sequences or those sharing a user defined level of identity.
  • Multiple experiments are run so that confidence limits can be assigned to each base identity according to the variation in the reliability indices and the difficulties associated with assembling stretches of sequence from fragments.
  • the reliability indices represent the goodness of the fit between the observed wavelengths and intensities of fluorescence compared with the ideal values.
  • the result of the signal analyses is a linear DNA sequence with associated probabilities of certainty.
  • Data collection allows data to be assembled from partial information to obtain sequence information from multiple polymerase molecules in order to determine the overall sequence of the template or target molecule.
  • An important driving force for convolving together results obtained with multiple single-molecules is the impossibility of obtaining data from a single molecule over an indefinite period of time.
  • a typical dye photobleaching efficiency of 2*10 ⁇ 5 a typical dye molecule is expected to undergo 50,000 excitation/emission cycles before permanent photobleaching.
  • Data collection from a given molecule may also be interrupted by intersystem crossing to an optically inactive (on the time scales of interest) triplet state. Even with precautions against photobleaching, therefore, data obtained from any given molecule is necessarily fragmentary for template sequences of substantial length, and these subsequences are co-processed in order to derive the overall sequence of a target DNA molecule.
  • reaction with reference controls, similar to microarray assays. Comparison of signal(s) between the reference sequence and the test sample are used to identify differences and similarities in sequences or sequence composition. Such reactions can be used for fast screening of DNA polymers to determine degrees of homololgy between the polymers, to determine polymorphisms in DNA polymers, or to identity pathogens.
  • Taq pol I Bacteriophage lambda host strain Charon 35 harboring the full-length of the Thermus aquaticus gene encoding DNA polymerase I (Taq pol I) was obtained from the AMERICAN TYPE CULTURE COLLECTION (ATCC; Manassas, Va.). Taq pol I was amplified directly from the lysate of the infected E. coli host using the following DNA oligonucleotide primers:
  • Taq Pol I forward (SEQ ID NO: 8) 5′-gcgaattcat gagggggatg ctgcccctct ttgagccc-3′
  • Taq Pol I reverse (SEQ ID NO: 9) 5′-gcgaattcac cctccttggc ggagcgccag tcctcc-3′
  • each synthetic DNA oligonucleotide represents engineered EcoRI restriction sites immediately preceding and following the Taq pol I gene.
  • PCR amplification using the reverse primer described above and the following forward primer created an additional construct with an N-terminal deletion of the gene:
  • the underlined segment corresponds to an engineered NcoI restriction site with the first codon encoding for an alanine (ATG start representing an expression vector following the ribosome binding site).
  • ATG start representing an expression vector following the ribosome binding site.
  • the full-length and truncated constructs of the Taq pol I gene is ligated to a single EcoRI site (full-length) and in an Ncoo/EcoRI digested pRSET-b expression vector.
  • E. coli strain JM109 is used as host for all in vivo manipulation of the engineered vectors.
  • Overlapping primers are used to introduce point mutations into the native gene by PCR based mutagenesis (using Pfu DNA polymerase).
  • Complementary forward and reverse primers each contain a codon that encodes the desired mutated amino acid residue.
  • PCR using these primers results in a nicked, non-methylated, double-stranded plasmid containing the desired mutation.
  • DpnI restriction enzyme cuts at methylated guanosines in the sequence GATC.
  • ALANINE 643 TO CYSTEINE REPLACEMENT Taq Pol I_Ala643Cys_fwd (SEQ ID NO: 13) 5′-C CAC ACG GAG ACC tgC AGC TGG ATG TTC GGC G-3′ Taq PolI_Ala643Cys_rev (SEQ ID NO: 14) 5′-C GCC GAA CAT CCA CGA Gca GGT CTC CGT GTG G-3′ PHENYLALANINE 647 TO CYSTEINE REPLACEMENT Taq Pol I_Phe647Cys_fwd (SEQ ID NO: 15) 5′-CC GCC AGC TGG ATG TgC GGC GTC CCC CGG GAG GCC-3′ Tag Pol I_Phe647Cys_rev (SEQ ID NO: 16) 5′-GGC CTC CCG GGG GAC GCC GcA CAT CCA CGT GGC GG-3′ VALINE 649 TO CYSTEINE REPL
  • the resulting mutant Taq polymerases are then reacted with a desired atomic or molecular tag to tag the cysteine in the mutant structure through the SH group of the cysteine residue and screened for native and/or tagged dNTP incorporation and incorporation efficiency.
  • the mutant polymerases are also screened for fluorescent activity during base incorporation.
  • the present invention also relates to mutant Taq polymerase having a cysteine residue added one or more of the sites selected from the group consisting of 513-518, 643, 647, 649 and 653-661. After cysteine replacement and verification of polymerase activity using the modified dNTPs, the mutant Taq polymerases are reacted with a tag through the SH group of the inserted cysteine residue.
  • the resulting amino acid replacement for the positions 513-518, 643, 647, 649 and 653-661 are shown below:
  • Nucleotide analogs which contain fluorophore 1-aminonaphalene-5-sulfonate attached to the ⁇ -phosphate bond were synthesized (J. Biol. Chem. 254, 12069-12073, 1979).
  • dATP analog-( ⁇ -AmNS) dATP was synthesized according to the procedures slightly altered from what was described by Yarbrough and co-workers for ( ⁇ -AmNS)ATP with some modifications.
  • This example illustrates the preparation of gamma ANS tagged dATP, shown graphically in FIG. 4 .
  • reaction was diluted to 50 mL and adjusted to a solution of 0.05 M triethylammonium bicarbonate buffer (TEAB, pH ⁇ 7.5).
  • TEAB triethylammonium bicarbonate buffer
  • the reaction product was chromatographed on a 50 mL DEAE-SEPHADEX® ion exchanger (A-25-120) column at low temperature that was equilibrated with ⁇ pH 7.5 1.0 M aqueous TEAB (100 mL), 1.0 M aqueous sodiumbicarbonate (1100 mL), and ⁇ pH 7.5, 0.05 Maqueous TEAB (100 mL).
  • the column was eluted with a linear gradient of ⁇ pH 7.5 aqueous TEAB from 0.05 to 0.9 M.
  • the first example illustrates the incorporation of ANS- ⁇ -phosphate DATP to produce extended DNA products from primer templates.
  • the reactions were carried out in extension buffer and the resulting radiolabeled products were size separated on a 20% denaturing polyacrylamide gel. Data was collected using a phosphorimaging system.
  • Lane 1 contains 5′ radiolabeled ‘TOP’ probe in extension buffer.
  • Lane 2 contains Taq DNA polymerase, 50 ⁇ M dGTP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-Sau’).
  • Lane 3 contains Taq DNA polymerase, 50 ⁇ M dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-Sau’).
  • Lane 4 contains Taq DNA polymerase, 50 ⁇ M ANS- ⁇ -dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-Sau’).
  • Lane 5 contains Taq DNA polymerase, 50 ⁇ M dGTP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-T’).
  • Lane 6 contains spill-over from lane 5.
  • Lane 7 contains Taq DNA polymerase, 50 ⁇ M dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-T’).
  • Lane 8 contains Taq DNA polymerase, 50 ⁇ M ANS- ⁇ -dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-T’).
  • Lane 9 contains Taq DNA polymerase, 50 ⁇ M dGTP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-3T’).
  • Lane 10 contains Taq DNA polymerase, 50 ⁇ M dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-3T’).
  • Lane 11 contains Taq DNA polymerase, ANS- ⁇ -dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-3T’).
  • Lane 12 contains 5′ radiolabeled ‘TOP’ probe in extension buffer.
  • Lane 13 contains 5′ radiolabeled ‘TOP’ probe and Taq DNA polymerase in extension buffer. Oligonucleotide sequences are shown in Table V.
  • Quantitative comparison of lane 1 with lane 4 demonstrates that very little non-specific, single-base extension was detected when ANS- ⁇ -dATP was included in the reaction, but the first incorporated base should be dGTP (which was not added to the reaction).
  • Quantitative analysis of lanes 1 and 8 demonstrates that approximately 71% of the TOP primer are extended by a template-directed single base when ANS- ⁇ -dATP was included in the reaction and the first incorporated base should be dATP.
  • Taq DNA polymerase incorporates ⁇ -tagged nucleotides. Equally important to the polymerase's ability to incorporate a ⁇ -tagged nucleotide is its ability to extend the DNA polymer after the modified DATP was incorporated.
  • This next example illustrates the synthesis of extended DNA polymers using all four ANS tagged ⁇ -phosphate dNTPs. Products generated in these reactions were separated on a 20% denaturing polyacrylamide gel, the gel was dried and imaged following overnight exposure to a Fuji BAS 1000 imaging plate. Referring now to FIG. 6 , an image of (A) the actual gel, (B) a lightened phosphorimage and (C) an enhanced phosphorimage. Lane descriptions for A, B, and C follow: Lane 1 is the control containing purified 10-base primer extended to 11 and 12 bases by template-mediated addition of ⁇ - 32 P dCTP. Lane 2 includes the same primer that was incubated with double-stranded plasmid DNA at 96° C.
  • Lane 3 includes the same labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was DNA polymerase and all four gamma-modified DNTPs (100 uM, each) were added and the reaction was incubated at 37° C. for 60 minutes.
  • Lane 4 includes the control, purified 10-base primer that was extended to 11 and 12 bases by the addition of ⁇ - 32 P-dCTP was cycled in parallel with lanes 5-8 reactions.
  • Lane 5 includes the same 32 P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes.
  • Lane 6 includes the same 32 P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C.
  • the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs (100 uM, each) were added.
  • the reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes.
  • Lane 7 includes nonpurified, 10-base, 32 P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added.
  • the reaction was cycled 25 times at 96° C. for 10 seconds, 37° C.
  • Lane 8 includes nonpurified, 10-base, 32 P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes.
  • Evident in the reactions involving tagged dNTPs is a substantial decrease in pyrophosphorolysis as compared to reactions involving natural nucleotides.
  • This next example illustrates the synthesis of long DNA polymers using all four ANS tagged ⁇ -phosphate dNTPs.
  • Each primer extension reaction was split into two fractions, and one fraction was electrophoresed through a 20% denaturing gel (as described above), while the other was electrophoresed through a 6% denaturing gel to better estimate product lengths.
  • the gel was dried and imaged (overnight) to a Fuji BAS 1000 imaging plate.
  • FIG. 7 an image of (A) the actual gel, (B) a lightened phosphorimage of the actual gel, and (C) an enhanced phosphorimage of the actual gel.
  • Lane descriptions for A, B, and C follow: Lane 1 includes 123 Marker with size standards indicated at the left of each panel.
  • Lane 2 contains the control, purified 10-base primer extended to 11 and 12 bases by template-mediated addition of ⁇ - 32 P dCTP.
  • Lane 3 contains the same 32 P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes (to denature template), the reaction was brought to 37° C. (to anneal primer-template), Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added and the reaction was incubated at 37° C. for 60 minutes.
  • Lane 4 includes the same 32 P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C.
  • Lane 5 includes the control, purified 10-base primer that was extended to 11 and 12 bases by the addition of ⁇ - 32 P-dCTP was cycled in parallel with lanes 5-8 reactions.
  • Lane 6 includes the same 32 P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added.
  • the reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes.
  • Lane 7 includes the same 32 P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs (100 uM, each) were added.
  • the reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes.
  • Lane 8 includes nonpurified, 10-base, 32 P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 9 includes nonpurified, 10-base, 32 P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C.
  • extension products in this reaction are several hundred bases long for both natural and ⁇ -modified dNTPs, and a significant percentage of these products are too large to enter the gel.
  • gamma phosphate tagged dNTPs are used by Taq polymerase to generate long DNA polymers that are non-tagged or native DNA polymer chains.
  • the indicated enzyme (Taq DNA Polymerase, SEQUENASE®, HIV-1 Reverse Transcriptase, T7 DNA Polymerase, Klenow Fragment, Pfu DNA Polymerase) were incubated in the manufacturers suggested reaction buffer, 50 ⁇ M of the indicated nucleotide at 37° C. for 30-60 minutes, and the reaction products were analyzed by size separation through a 20% denaturing gel.
  • Taq DNA polymerase efficiently uses the gamma-modified nucleotides to synthesize extended DNA polymers at increased accuracy as shown in FIGS. 4-6 .
  • the Klenow fragment from E. coli DNA polymerase I efficiently uses the gamma-modified nucleotides, but does not exhibit the extreme fidelity improvements observed with other enzymes as shown in FIG. 8 .
  • Pfu DNA polymerase does not efficiently use gamma-modified nucleotides and is, thus, not a preferred enzyme for the single-molecule sequencing system as shown in FIG. 9 .
  • HIV-1 reverse transcriptase efficiently uses the gamma-tagged nucleotides, and significant fidelity improvement results as shown in FIG. 10 .
  • the fidelity of nucleic acid synthesis is a limiting factor in achieving amplification of long target molecules using PCR.
  • the misincorporation of nucleotides during the synthesis of primer extension products limits the length of target that can be efficiently amplified.
  • the effect on primer extension of a 3′-terminal base that is mismatched with the template is described in Huang et al., 1992, Nucl. Acids Res. 20:4567-4573, incorporated herein by reference.
  • the presence of misincorporated nucleotides may result in prematurely terminated strand synthesis, reducing the number of template strands for future rounds of amplification, and thus reducing the efficiency of long target amplification.
  • the present invention provides an improved PCR system for generating increased extension length PCR amplified DNA products comprising contacting a native Taq polymerase with gamma tagged dNTPs of this invention under PCR reaction conditions.
  • the extended length PCR products are due to improved accuracy of base incorporation, resulting from the use of the gamma-modified dNTPs of this invention.
  • the solid line corresponds to reaction products produced when the four natural nucleotides (dATP, dCTP, dGTP and dTTP) are included in the synthesis reaction.
  • the dashed or broken line corresponds to reaction products produced when proprietary, base-modified nucleotides are included in the reaction.
  • sequence context and base modification(s) influence reaction product intensity and/or kinetics, and these identifying patterns are incorporated into proprietary base-calling software to provide a high confidence value for base identity at each sequenced position.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Genetics & Genomics (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Immunology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Apparatus Associated With Microorganisms And Enzymes (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Soil Working Implements (AREA)
  • Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)
  • Investigating Or Analysing Biological Materials (AREA)
  • Investigating Or Analysing Materials By The Use Of Chemical Reactions (AREA)
  • Saccharide Compounds (AREA)
  • Error Detection And Correction (AREA)
  • Color Television Systems (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Oscillators With Electromechanical Resonators (AREA)

Abstract

A sequencing methodology is disclosed that allows a single DNA or RNA molecule or portion thereof to be sequenced directly and in substantially real time. The methodology involves engineering a polymerase and/or dNTPs with atomic and/or molecular tags that have a detectable property that is monitored by a detection system.

Description

  • This application is a continuation of U.S. patent application Ser. No. 11/648,191, filed Dec. 29, 2006 and published as United States Patent Application Publication No. 2007/017286y A1 on Jul. 26, 2007, which is a continuation of U.S. application Ser. No. 09/901,782, filed Jul. 9, 2001 and published as United States Patent Application Publication No. 2003/0064366 A1 on Apr. 3, 2003, which claims provisional priority to U.S. Provisional Patent Application No. 60/216,594, filed Jul. 7, 2000, all of which are incorporated herein by reference in their entirety.
  • BACKGROUND OF THE INVENTION
  • The present invention relates to a single-molecule sequencing apparatus and methods.
  • More particularly, the present invention relates to a single-molecule sequencing apparatus and methods using tagged polymerizing agents and/or tagged monomers where the tagged polymerizing agent and/or the tagged monomers undergo a change in a detectable property before, during and/or after monomer insertion into a growing polymer chain. The apparatus and methods are ideally-suited for sequencing DNA, RNA, polypeptide, carbohydrate or similar bio-molecular sequences under near real-time or real-time conditions. The present invention also relates to a single-molecule sequencing apparatus and methods using tagged depolymerizing agents and/or tagged depolymerizable polymer where the tagged depolymerizing agent and/or the tagged depolymerizable polymer undergo a change in a detectable property before, during and/or after monomer removal from the depolymerizable polymer chain. The apparatus and methods are ideally-suited for sequencing DNA, RNA, polypeptide, carbohydrate or similar bio-molecular sequences. The present invention also relates to detecting a signal evidencing interactions between the tagged polymerizing agent or depolymerizing agent and a tagged or untagged polymer subunit such as a monomer or collection of monomers, where the detected signal provides information about monomer order. In a preferred embodiment, the methods are carried out in real-time or near real-time.
  • DESCRIPTION OF THE RELATED ART Overview of Conventional DNA Sequencing
  • The development of methods that allow one to quickly and reliably determine the order of bases or ‘sequence’ in a fragment of DNA is a key technical advance, the importance of which cannot be overstated. Knowledge of DNA sequence enables a greater understanding of the molecular basis of life. DNA sequence information provides scientists with information critical to a wide range of biological processes. The order of bases in DNA specifies the order of bases in RNA, the molecule within the cell that directly encodes the informational content of proteins. DNA sequence information is routinely used to deduce protein sequence information. Base order dictates DNA structure and its function, and provides a molecular program that can specify normal development, manifestation of a genetic disease, or cancer.
  • Knowledge of DNA sequence and the ability to manipulate these sequences has accelerated development of biotechnology and led to the development of molecular techniques that provide the tools to ask and answer important scientific questions. The polymerase chain reaction (PCR), an important biotechnique that facilitates sequence-specific detection of nucleic acid, relies on sequence information. DNA sequencing methods allow scientists to determine whether a change has been introduced into the DNA, and to assay the effect of the change on the biology of the organism, regardless of the type of organism that is being studied. Ultimately, DNA sequence information may provide a way to uniquely identify individuals.
  • In order to understand the DNA sequencing process, one must recall several facts about DNA. First, a DNA molecule is comprised of four bases, adenine (A), guanine (G), cytosine (C), and thymine (T). These bases interact with each other in very specific ways through hydrogen bonds, such that A interacts with T, and G interacts with C. These specific interactions between the bases are referred to as base-pairings. In fact, it is these base-pairings (and base stacking interactions) that stabilize double-stranded DNA. The two strands of a DNA molecule occur in an antiparallel orientation, where one strand is positioned in the 5′ to 3′ direction, and the other strand is positioned in the 3′ to 5′ direction. The terms 5′ and 3′ refer to the directionality of the DNA backbone, and are critical to describing the order of the bases. The convention for describing base order in a DNA sequence uses the 5′ to 3′ direction, and is written from left to right. Thus, if one knows the sequence of one DNA strand, the complementary sequence can be deduced.
  • Sanger DNA Sequencing (Enzymatic Synthesis)
  • Sanger sequencing is currently the most commonly used method to sequence DNA (Sanger et al., 1977). This method exploits several features of a DNA polymerase: its ability to make an exact copy of a DNA molecule, its directionality of synthesis (5′ to 3′), its requirement of a DNA strand (a ‘primer’) from which to begin synthesis, and its requirement for a 3′ OH at the end of the primer. If a 3′ OH is not available, then the DNA strand cannot be extended by the polymerase. If a dideoxynucleotide (ddNTP; ddATP, ddTTP, ddGTP, ddCTP), a base analog lacking a 3′ OH, is added into an enzymatic sequencing reaction, it is incorporated into the growing strand by the polymerase. However, once the ddNTP is incorporated, the polymerase is unable to add any additional bases to the end of the strand. Importantly, ddNTPs are incorporated by the polymerase into the DNA strand using the same base incorporation rules that dictate incorporation of natural nucleotides, where A specifies incorporation of T, and G specifies incorporation of C (and vice versa).
  • Fluorescent DNA Sequencing
  • A major advance in determining DNA sequence information occurred with the introduction of automated DNA sequencing machines (Smith et al., 1986). The automated sequencer is used to separate sequencing reaction products, detect and collect (via computer) the data from the reactions, and analyze the order of the bases to automatically deduce the base sequence of a DNA fragment. Automated sequencers detect extension products containing a fluorescent tag. Sequence read lengths obtained using an automated sequencer are dependent upon a variety of parameters, but typically range between 500 to 1,000 bases (3-18 hours of data collection). At maximum capacity an automated sequencer can collect data from 96 samples in parallel.
  • When dye-labeled terminator chemistry is used to detect the sequencing products, base identity is determined by the color of the fluorescent tag attached to the ddNTP. After the reaction is assembled and processed through the appropriate number of cycles (3-12 hours), the extension products are prepared for loading into a single lane on an automated sequencer (unincorporated, dye-labeled ddNTPs are removed and the reaction is concentrated; 1-2 hours). An advantage of dye-terminator chemistry is that extension products are visualized only if they terminate with a dye-labeled ddNTP; prematurely terminated products are not detected. Thus, reduced background noise typically results with this chemistry.
  • State-of-the-art dye-terminator chemistry uses four energy transfer fluorescent dyes (Rosenblum et al., 1997). These terminators include a fluorescein donor dye (6-FAM) linked to one of four different dichlororhodamine (dRhodamine) acceptor dyes. The d-Rhodamine acceptor dyes associated with the terminators are dichloro[R110], dichloro[R6G], dichloro[TAMRA] or dichloro[ROX], for the G-, A-, T- or C-terminators, respectively. The donor dye (6-FAM) efficiently absorbs energy from the argon ion laser in the automated sequencing machine and transfers that energy to the linked acceptor dye. The linker connecting the donor and acceptor portions of the terminator is optimally spaced to achieve essentially 100% efficient energy transfer. The fluorescence signals emitted from these acceptor dyes exhibit minimal spectral overlap and are collected by an ABI PRISM 377 DNA sequencer using 10 nm virtual filters centered at 540, 570, 595 and 625 nm, for G-, A-, T- or C-terminators, respectively. Thus, energy transfer dye-labeled terminators produce brighter signals and improve spectral resolution. These improvements result in more accurate DNA sequence information.
  • The predominant enzyme used in automated DNA sequencing reactions is a genetically engineered form of DNA polymerase I from Therms aquaticus. This enzyme, AmpliTaq DNA Polymerase, FS, was optimized to more efficiently incorporate ddNTPs and to eliminate the 3′ to 5′ and 5′ to 3′ exonuclease activities. Replacing a naturally occurring phenylalanine at position 667 in T. aquaticus DNA polymerase with a tyrosine reduced the preferential incorporation of a dNTP, relative to a ddNTP (Tabor and Richardson, 1995; Reeve and Fuller, 1995). Thus, a single hydroxyl group within the polymerase is responsible for discrimination between dNTPs and ddNTPs. The 3′ to 5′ exonuclease activity, which enables the polymerase to remove a mis-incorporated base from the newly replicated DNA strand (proofreading activity), was eliminated because it also allows the polymerase to remove an incorporated ddNTP. The 5′ to 3′ exonuclease activity was eliminated because it removes bases from the 5′ end of the reaction products. Since the reaction products are size separated during gel electrophoresis, interpretable sequence data is only obtained if the reaction products share a common endpoint. More specifically, the primer defines the 5′ end of the extension product and the incorporated, color-coded ddNTP defines base identity at the 3′ end of the molecule. Thus, conventional DNA sequencing involves analysis of a population of DNA molecules sharing the same 5′ endpoint, but differing in the location of the ddNTP at the 3′ end of the DNA chain.
  • Genome Sequencing
  • Very often a researcher needs to determine the sequence of a DNA fragment that is larger than the 500-1,000 base average sequencing read length. Not surprisingly, strategies to accomplish this have been developed. These strategies are divided into two major classes, random or directed, and strategy choice is influenced by the size of the fragment to be sequenced.
  • In random or shotgun DNA sequencing, a large DNA fragment (typically one larger than 20,000 base pairs) is broken into smaller fragments that are inserted into a cloning vector. It is assumed that the sum of information contained within these smaller clones is equivalent to that contained within the original DNA fragment. Numerous smaller clones are randomly selected, DNA templates are prepared for sequencing reactions, and primers that will base-pair with the vector DNA sequence bordering the insert are used to begin the sequencing reaction (2-7 days for a 20 kbp insert). Subsequently, the quality of each base call is examined (manually or automatically via software (PHRED, Ewing et al., 1998); 1-10 minutes per sequence reaction), and the sequence of the original DNA fragment is reconstructed by computer assembly of the sequences obtained from the smaller DNA fragments. Based on the time estimates provided, if a shotgun sequencing strategy is used, a 20 kbp insert is expected to be completed in 3-10 days. This strategy was extensively used to determine the sequence of ordered fragments that represent the entire human genome (see the United States Government web site nhgri.nih, the HGP sublink at http://www.nhgri.nih.gov/HGP/). However, this random approach is typically not sufficient to complete sequence determination, since gaps in the sequence often remain after computer assembly. A directed strategy (described below) is usually used to complete the sequence project.
  • A directed or primer-walking sequencing strategy can be used to fill-in gaps remaining after the random phase of large-fragment sequencing, and as an efficient approach for sequencing smaller DNA fragments. This strategy uses DNA primers that anneal to the template at a single site and act as a start site for chain elongation. This approach requires knowledge of some sequence information to design the primer. The sequence obtained from the first reaction is used to design the primer for the next reaction and these steps are repeated until the complete sequence is determined. Thus, a primer-based strategy involves repeated sequencing steps from known into unknown DNA regions, the process minimizes redundancy, and it does not require additional cloning steps. However, this strategy requires the synthesis of a new primer for each round of sequencing.
  • The necessity of designing and synthesizing new primers, coupled with the expense and the time required for their synthesis, has limited the routine application of primer-walking for sequencing large DNA fragments. Researchers have proposed using a library of short primers to eliminate the requirement for custom primer synthesis (Studier, 1989; Siemieniak and Slightom, 1990; Kieleczawa et al., 1992; Kotler et al., 1993; Burbelo and ladarola, 1994; Hardin et al., 1996; Raja et al., 1997; Jones and Hardin, 1998a,b; Ball et al., 1998; Mei and Hardin, 2000; Kraltcheva and Hardin, 2001). The availability of a primer library minimizes primer waste, since each primer is used to prime multiple reactions, and allows immediate access to the next sequencing primer.
  • One of the original goals of the Human Genome Project was to complete sequence determination of the entire human genome by 2005 (see the United States Government website nhgri.nih, the HGP sublink at http://www.nhgri.nih.gov/HGP/). However, the plan is ahead of schedule and a ‘working draft’ of the human genome was published in February 2001 (Venter er al., 2001, “International Human Genome Sequencing Consortium 2001”). Due to technological advances in several disciplines, the completed genome sequence is expected in 2003, two years ahead of schedule. Progress in all aspects involving DNA manipulation (especially manipulation and propagation of large DNA fragments), evolution of faster and better DNA sequencing methods (see the abrg.org at http://www.abrLorg), development of computer hardware and software capable of manipulating and analyzing the data (bioinformatics), and automation of procedures associated with generating and analyzing DNA sequences (engineering) are responsible for this accelerated time frame.
  • Single-Molecule DNA Sequencing
  • Conventional DNA sequencing strategies and methods are reliable, but time, labor, and cost intensive. To address these issues, some researchers are investigating fluorescence-based, single-molecule sequencing methods that use enzymatic degradation, followed by single-dNMP detection and identification. The DNA polymer containing fluorescently-labeled nucleotides is digested by an exonuclease, and the labeled nucleotides are detected and identified by flow cytometry (Davis et al., 1991; Davis et al., 1992; Goodwin et al., 1997; Keller et al., 1996; Sauer et al., 1999; Werner et al., 1999). This method requires that the DNA strand is synthesized to contain the flourescently-labeled base(s). This requirement limits the length of sequence that can be determined, and increases the number of manipulations that must be performed before any sequence data is obtained. A related approach proposes to sequentially separate single (unlabeled) nucleotides from a strand of DNA, confine them in their original order in a solid matrix, and detect the spectroscopic emission of the separated nucleotides to reconstruct DNA sequence information (Ulmer, 1997; Mitsis and Kwagh, 1999; Dapprich, 1999). This is the approach that is being developed by Praelux, Inc., a company with a goal to develop single-molecule DNA sequencing. Theoretically, this latter method should not be as susceptible to length limitations as the former enzymatic degradation method, but it does require numerous manipulations before any sequence information can be obtained.
  • Li-cor, Inc. is developing an enzyme synthesis based strategy for single-molecule sequencing as set forth in PCT application WO 00/36151. The Li-cor method involves multiply modifying each dNTP by attaching a fluorescent tag to the γ-phosphate and a quenching moiety to another site on the dNTP, preferably on the base. The quenching moiety is added to prevent emission from the fluorescent tag attached to an unincorporated dNTP. Upon incorporation the fluorescent tag and quenching moiety are separated, resulting in emission from the tag. The tag (contained on the pyrophosphate) flows away from the polymerase active site, but the modified (quenched) base becomes part of the DNA polymer.
  • Although some single-molecular sequencing systems have been disclosed, many of them anticipate or require base modification. See, e.g., Patent Application Serial Numbers WO 01/16375 A2, WO 01/23610 A2, WO 01/25480, WO 00/06770, WO 99/05315, WO 00/60114, WO 00/36151, WO 00/36512, and WO 00/70073, incorporated herein by reference. Base modifications may distort DNA structure (which normally consists of A-form DNA nearest the enzyme active site; Li et al., 1998a). Since the dNTP and approximately 7 of the 3′-nearest bases in the newly synthesized strand contact internal regions of the polymerase (Li et al., 1998a), the A-form DNA may be important for maximizing minor groove contacts between the enzyme and the DNA. If the DNA structure is affected due to base modification, enzyme fidelity and/or function may be altered. Thus, there is still a need in the art for a fast and efficient enzymatic DNA sequencing system for single molecular DNA sequences.
  • SUMMARY OF THE INVENTION Single-Molecule Sequencing
  • The present invention provides a polymerizing agent modified with at least one molecular or atomic tag located at or near, associated with or covalently bonded to a site on the polymerizing agent, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation. The monomers can be organic, inorganic or bio-organic monomers such as nucleotides for DNA, RNA, mixed DNA/RNA sequences, amino acids, monosaccharides, synthetic analogs of naturally occurring nucleotides, synthetic analogs of naturally occurring amino acids or synthetic analogs of naturally occurring monosaccharides, synthetic organic or inorganic monomers, or the like.
  • The present invention provides a depolymerizing agent modified with at least one molecular or atomic tag located at or near, associated with or covalently bonded to a site on the depolymerizing agent, where a detectable property of the tag undergoes a change before, during and/or after monomer removal. The polymers can be DNA, RNA, mixed DNA/RNA sequences containing only naturally occurring nucleotides or a mixture of naturally occurring nucleotides and synthetic analogs thereof, polypeptide sequences containing only naturally occurring amino acids or a mixture of naturally occurring amino acids and synthetic analogs thereof, polysaccharide or carbohydrate sequences containing only naturally occurring monosaccharides or a mixture of naturally occurring monosaccharides and synthetic analogs thereof, or polymers containing synthetic organic or inorganic monomers, or the like.
  • The present invention also provides a system that enables detecting a signal corresponding to a detectable property evidencing changes in interactions between a synthesizing/polymerizing agent or a depolymerizing agent (molecule) and its substrates (monomers or depolymerizable polymers) and decoding the signal into monomer order specific information or monomer sequence information, preferably in real-time or near real-time.
  • Single Site Tagged Polymerase
  • The present invention provides a polymerase modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site on the polymerase, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation. The monomers can be nucleotides for DNA, RNA or mixed DNA/RNA monomers or synthetic analogs polymerizable by the polymerase.
  • The present invention provides an exonuclease modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site on the exonuclease, where a detectable property of the tag undergoes a change before, during and/or after monomer release. The polymers can be DNA, RNA or mixed DNA/RNA sequences comprised of naturally occurring monomers or synthetic analogs depolymerizable by the exonuclease.
  • The present invention provides a polymerase modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where the tag has a first detection propensity when the polymerase is in a first conformational state and a second detection propensity when the polymerase is in a second conformational state.
  • The present invention provides a polymerase modified with at least one chromophore located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where an intensity and/or frequency of emitted light of the chromophore has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state.
  • The present invention provides a polymerase modified with at least one fluorescently active molecular tag located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where the tag has a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state.
  • The present invention provides a polymerase modified with a molecular tag located at or near, associated with, or covalently bonded to a site that undergoes a conformational change before, during and/or after monomer incorporation, where the tag is substantially detectable when the polymerase is in a first conformational state and substantially non-detectable when the polymerase is in a second conformational state or substantially non-detectable when the polymerase is in the first conformational state and substantially detectable when the polymerase is in the second conformational state.
  • The present invention provides a polymerase modified with at least one molecular or atomic tag located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where the polymerase tag has a first detection propensity before interacting with the tag on the released pyrophosphate group and a second detection propensity when interacting with the tag on the released pyrophosphate group. In a preferred embodiment, this change in detection propensity is cyclical occurring as each pyrophosphate group is released.
  • The present invention provides a polymerase modified with at least one chromophore located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where an intensity and/or frequency of light emitted by the chromophore has a first value before the chromophore interacts with the tag on the released pyrophosphate and a second value when interacting with the tag on the released pyrophosphate group. In a preferred embodiment, this change in detection propensity is cyclical occurring as each pyrophosphate group is released.
  • The present invention provides a polymerase modified with at least one fluorescently active molecular tag located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where the polymerase tag changes from a first state prior to release of the pyrophosphate group and a second state as the pyrophosphate group diffuses away from the site of release. In a preferred embodiment, this change in detection propensity is cyclical occurring as each pyrophosphate group is released.
  • The present invention provides a polymerase modified with a molecular tag located at or near, associated with, or covalently bonded to a site that interacts with a tag on the released pyrophosphate group, where the polymerase tag changes from a substantially detectable state prior to pyrophosphate release to a substantially non-detectable state when the polymerase tag interacts with the tag on the pyrophosphate group after group release, or changes from a substantially non-detectable state prior to pyrophosphate release to a substantially detectable state when the polymerase tag interacts with the tag on the pyrophosphate group after group release.
  • Multiple Site Tagged Polymerizing or Depolymerizing Agents
  • The present invention provides a monomer polymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with, or covalently bonded to sites on the polymerizing agent, where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer incorporation or where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer incorporation due to a change in inter-tag interaction.
  • The present invention provides a depolymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with, or covalently bonded to sites on the depolymerizing agent, where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer release or where a detectable property of at least one tag of the pair undergoes a change before, during and/or after monomer release due to a change in inter-tag interaction.
  • The present invention provides a monomer polymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with, or covalently bonded to sites on the polymerizing agent, where a detectable property of at least one tag of the pair has a first value when the polymerizing agent is in a first state and a second value when the polymerizing agent is in a second state, where the polymerizing agent changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a depolymerizing agent modified with at least one pair of molecular and/or atomic tags located at or near, associated with or covalently bonded to sites on the polymerizing agent, where a detectable property of at least one tag of the pair has a first value when the depolymerizing agent is in a first state and a second value when the depolymerizing agent is in a second state, where the depolymerizing agent changes from the first state to the second state and back to the first state during a monomer release cycle.
  • Preferably, the first and second states are different so that a change in the detected signal occurs. However, a no-change result may evidence other properties of the polymerizing media or depolymerizing media.
  • Multiple Site Tagged Polymerase
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with, or covalently bonded to sites at least one of the tags undergoes a change during monomer incorporation, where a detectable property of the pair has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the detectably property of the pair has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a polymerase modified with at least one pair of molecules or atoms located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the pair interact to form a chromophore when the polymerase is in a first conformational state or a second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the tags have a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes conformational change during monomer incorporation, where the pair is substantially active when the polymerase is in a first conformational state and substantially inactive when the polymerase is in a second conformational state or substantially inactive when the polymerase is in the first conformational state and substantially active when the polymerase is in the second conformational state, where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with, or covalently bonded to sites at least one of the tags undergoes a change during and/or after pyrophosphate release during the monomer incorporation process, where a detectable property of the pair has a first value when the tag is in a first state prior to pyrophosphate release and a second value when the tag is in a second state during and/or after pyrophosphate release, where the tag changes from its first state to its second state and back to its first state during a monomer incorporation cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites at least one of the tags undergoes a change in position due to a conformational change in the polymerase during the pyrophosphate release process, where the detectably property of the pair has a first value when the tag is in its first position and a second value when the tag is in its second position, where the tag changes from its first position to its second position and back to its first position during a release cycle.
  • The present invention provides a polymerase modified with at least one pair of molecules or atoms located at or near, associated with or covalently bonded to sites, where the tags change relative separation due to a conformational change in the polymerase during pyrophosphate release, where the tags interact to form a chromophore having a first emission profile when the tags are a first distance apart and a second profile when the tags are a second distance apart, where the separation distance changes from its first state to its second state and back to its first state during a pyrophosphate release cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites, where the tags change relative separation due to a conformational change in the polymerase during pyrophosphate release, where the tags have a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state, where the propensity changes from its the first value to its second value and back again during a pyrophosphate release cycle.
  • The present invention provides a polymerase modified with at least one pair of molecular tags located at or near, associated with or covalently bonded to sites, where the tags change relative separation due to a conformational change in the polymerase during pyrophosphate release, where the pair is substantially fluorescently active when the tags have a first separation and substantially fluorescently inactive when the tags have a second separation or substantially fluorescently inactive when the tags have the first separation and substantially fluorescently active when the tags have the second separation, where the fluorescence activity undergoes one cycle during a pyrophosphate release cycle.
  • It should be recognized that when a property changes from a first state to a second state and back again, then the property undergoes a cycle. Preferably, the first and second states are different so that a change in the detected signal occurs. However, a no-change result may evidence other properties of the polymerizing medium or depolymerizing medium.
  • Methods Using Tagged Polymerizing Agent
  • The present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of an atomic or molecular tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerizing agent, where the detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • The present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of an atomic or molecular tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerizing agent, where the detectable property has a first value when the agent is in a first state and a second value when the agent is in a second state, where the agent changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • Preferably, the first and second states are different so that a change in the detected signal occurs. However, a no-change result may evidence other properties of the polymerizing medium.
  • Methods Using Tagged Polymerase
  • The present invention provides a method for determining when or whether a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerase, where the site undergoes a change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state, where the values signify that the site has undergone the change and where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a method for determining when or whether a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with, or covalently bonded to a site on a polymerase, where the site undergoes a conformational change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state, where the values signify that the site has undergone the change and where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention provides a method for determining when or whether a monomer is incorporated into a growing molecular chain comprising the steps of exposing a tagged polymerase to light, monitoring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase, where the tagged polymerase comprises a polymerase including a tag located at or near, associated with, or covalently bonded to a site that undergoes conformational change during monomer incorporation and where the tag emits fluorescent light at a first intensity and/or frequency when the polymerase is in a first conformational state and a second intensity and/or frequency when the polymerase is in a second conformational state, where the change in intensities and/or frequencies signifies that the site has undergone the change and where the polymerase changes from the first state to the second state and back to the first state during a monomer incorporation cycle.
  • The present invention also provides the above methods using a plurality of tagged polymerases permitting parallel and/or massively parallel sequencing simultaneously. Such parallelism can be used to ensure confidence. Such parallelism can also be used to quickly detect the degree of homology in DNA sequences for a given gene across species or to quickly screen patient DNA for specific genetic traits or to quickly screen DNA sequences for polymorphisms.
  • The present invention also provides a method for determining if or when a monomer is incorporated into a growing DNA chain associated with a polymerase, where a tag is located on the polymerase so that as the pyrophosphate group is released after base incorporation and prior to its diffusion away from the polymerase, the polymerase tag interacts with the tag on the pyrophosphate causing a change in a detectable property of one of the tags or a detectable property associated with both tags in the case of a fluorescent pair.
  • Preferably, the first and second states are different so that a change in the detected signal occurs. However, a no-change result may evidence other properties of the polymerizing media.
  • Apparatuses Using Tagged Polymerizing Agent
  • The present invention provides a single-molecule sequencing apparatus comprising a substrate having deposited thereon at least one tagged polymerizing agent. The tagged polymerizing agent can be placed on the surface of the substrate in an appropriate polymerizing medium or the polymerizing agent can be confined in a region, area, well, groove, channel or other similar structure on the substrate. The substrate can also include a monomer region, area, well, groove, channel, reservoir or other similar structure on the substrate connected to the polymerizing agent confinement structure by at least one connecting structure capable of supporting molecular transport of monomer to the polymerizing agent such as a channel, groove, or the like. Alternatively, the substrate can include structures containing each monomer, where each structure is connected to the polymerizing agent confinement structure by a connecting structure capable of supporting molecular transport of monomer to the polymerizing agent. The substrate can also be subdivided into a plurality of polymerizing agent confinement structures, where each structure is connected to a monomer reservoir. Alternatively, each polymerizing agent confinement structure can have its own monomer reservoir or sufficient monomer reservoirs so that each reservoir contains a specific monomer.
  • The present invention also provides a single-molecule sequencing apparatus comprising a substrate having at least one tagged polymerizing agent attached to the surface of the substrate by a molecular tether or linking group, where one end of the tether or linking group is bonded to a site on the surface of the substrate and the other end is bonded to a site on the polymerizing agent or bonded to a site on a molecule strongly associated with the polymerizing agent. In this context, the term “bonded to” means that chemical and/or physical interactions sufficient to maintain the polymerizing agent within a given region of the substrate under normal polymerizing conditions. The chemical and/or physical interactions include, without limitation, covalent bonding, ionic bonding, hydrogen bonding, apolar bonding, attractive electrostatic interactions, dipole interactions, or any other electrical or quantum mechanical interaction sufficient in toto to maintain the polymerizing agent in a desired region of the substrate. The substrate having tethered tagged polymerizing agent attached thereon can be placed in container containing an appropriate polymerizing medium. Alternatively, the tagged polymerizing agent can be tethered or anchored on or within a region, area, well, groove, channel or other similar structure on the substrate capable of being filled with an appropriate polymerizing medium. The substrate can also include a monomer region, area, well, groove, channel or other similar structure on the substrate connected to the polymerizing agent structure by at least one a connecting structure capable of supporting molecular transports of monomer to the polymerizing agent. Alternatively, the substrate can include structures containing each monomer, where each structure is connected to the polymerizing agent structure by a connecting structure capable of supporting molecular transports of monomer to the polymerizing agent. The substrate can also be subdivided into a plurality of polymerizing agent structures each having at least one tethered polymerizing agent, where each structure is connected to a monomer reservoir. Alternatively, each polymerizing agent structure can have its own monomer reservoir or sufficient monomer reservoirs, one reservoir of each specific monomer.
  • The monomers for use in these apparatus including, without limitation, dNTPs, tagged dNTPs, ddNTPs, tagged ddNTPs, amino acids, tagged amino acids, mono saccharides, tagged monosaccharides or appropriate mixtures or combinations thereof depending on the type of polymer being sequenced.
  • Apparatus Using Tagged Polymerase
  • The present invention provides a single-molecule sequencing apparatus comprising a substrate having deposited thereon at least one tagged polymerase. The tagged polymerase can be placed on the surface of the substrate in an appropriate polymerizing medium or the polymerase can be confined in a region, area, well, groove, channel or other similar structure on the substrate capable of being filled with an appropriate polymerizing medium. The substrate can also include a monomer region, area, well, groove, channel or other similar structure on the substrate connected to the polymerase confinement structure by at least one connecting structure capable of supporting molecular transports of monomer to the polymerase. Alternatively, the substrate can include structures containing each monomer, where each structure is connected to the polymerase confinement structure by a connecting structure capable of supporting molecular transports of the monomer to the polymerase in the polymerase confinement structures. The substrate can also be subdivided into a plurality of polymerase confinement structures, where each structure is connected to a monomer reservoir. Alternatively, each polymerase confinement structure can have its own monomer reservoir or four reservoirs, each reservoir containing a specific monomer.
  • The present invention also provides a single-molecule sequencing apparatus comprising a substrate having at least one tagged polymerase attached to the surface of the substrate by a molecular tether or linking group, where one end of the tether or linking group is bonded to a site on the surface of the substrate and the other end is bonded (either directly or indirectly) to a site on the polymerase or bonded to a site on a molecule strongly associated with the polymerase. In this context, the term “bonded to” means that chemical and/or physical interactions sufficient to maintain the polymerase within a given region of the substrate under normal polymerizing conditions. The chemical and/or physical interactions include, without limitation, covalent bonding, ionic bonding, hydrogen bonding, apolar bonding, attractive electrostatic interactions, dipole interactions, or any other electrical or quantum mechanical interaction sufficient in toto to maintain the polymerase in its desired region. The substrate having tethered tagged polymerizing agent attached thereon can be placed in container containing an appropriate polymerizing medium. Alternatively, the tagged polymerizing agent can be tethered or anchored on or within a region, area, well, groove, channel or other similar structure on the substrate capable of being filled with an appropriate polymerizing medium. The substrate can also include a monomer region, area, well, groove, channel or other similar structure on the substrate connected to the polymerase structure by at least one channel. Alternatively, the substrate can include structures containing each monomer, where each structure is connected to the polymerase structure by a connecting structure that supports molecular transports of the monomer to the polymerase in the polymerase confinement structures. The substrate can also be subdivided into a plurality of polymerase structures each having at least one tethered polymerase, where each structure is connected to a monomer reservoir. Alternatively, each polymerase structure can have its own monomer reservoir or four reservoirs, each reservoir containing a specific monomer.
  • The monomers for use in these apparatus including, without limitation, dNTPs, tagged dNTPs, ddNTPs, tagged ddNTPs, or mixtures or combinations thereof.
  • Methods Using the Single-Molecule Sequencing Apparatuses
  • The present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerizing agent confined on or tethered to a substrate and monitoring a detectable property of the tag over time. The method can also include a step of relating changes in the detectable property to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of incorporated monomers.
  • The present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerizing agent confined on or tethered to a substrate, exposing the tagged polymerizing agent to light either continuously or periodically and measuring an intensity and/or frequency of fluorescent light emitted by the tag over time. The method can further comprise relating the changes in the measured intensity and/or frequency of emitted fluorescent light from the tag over time to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of the incorporated monomers.
  • The present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerase confined on or tethered to a substrate and monitoring a detectable property of the tag over time. The method can also include a step of relating changes in the detectable property over time to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of the incorporated monomers.
  • The present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of monomers to a tagged polymerase confined on a substrate, exposing the tagged polymerase to light continuously or periodically and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase over time. The method can further comprise relating changes in the measured intensity and/or frequency of emitted fluorescent light from the tag over time to the occurrence (timing) of monomer addition and/or to the identity of each incorporated monomer and/or to the near simultaneous determination of the sequence of the incorporated monomers.
  • Cooperatively Tagged Systems
  • The present invention provides cooperatively tagged polymerizing agents and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact before, during and/or after monomer insertion. In one preferred embodiment, the tag on the polymerase is positioned such that the tags interact before, during and/or after each monomer insertion. In the of case tags that are released from the monomers after monomer insert such as of β and/or γ phosphate tagged dNTPs, i.e., the tags reside on the β and/or γ phosphate groups, the tag on the polymerizing agent can be designed to interact with the tag on the monomer only after the tag is released from the polymerizing agent after monomer insertion. Tag placement within a polymerizing agent can be optimized to enhance interaction between the polymerase and dNTP tags by attaching the polymerase tag to sites on the polymerase that move during an incorporation event changing the relative separation of the two tags or optimized to enhance interaction between the polymerase tag and the tag on the pyrophosphate as it is release during base incorporation and prior to its diffusion away from the polymerizing agent.
  • The present invention provides cooperatively tagged polymerizing agents and tagged monomers, where a detectable property of at least one of the tags changes when the tags are within a distance sufficient to cause a measurable change in the detectable property. If the detectable property is fluorescence induced in one tag by energy transfer to the other tag or due to one tag quenching the fluorescence of the other tag or causing a measurable change in the fluorescence intensity and/or frequency, the measurable change is caused by bringing the tags into close proximity to each other, i.e., decrease the distance separating the tags. Generally, the distance needed to cause a measurable change in the detectable property is within (less than or equal to) about 100 Å, preferably within about 50 Å, particularly within about 25 Å, especially within about 15 Å and most preferably within about 10 Å. Of course, one skilled in the art will recognize that a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tag, the nature of the tag, the solvent system, external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • The present invention provides a tagged polymerizing agent and tagged monomer precursor(s), where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags interact before, during and/or after monomer insertion.
  • The present invention provides cooperatively tagged depolymerizing agents and tagged depolymerizable polymer, where a detectable property of at least one of the tags changes when the tags interact before, during and/or after monomer release. The tag on the depolymerizing agent can be designed so that the tags interact before, during and/or after each monomer release.
  • The present invention provides cooperatively tagged depolymerizing agents and tagged polymers, where a detectable property of at least one of the tags changes when the tags are within a distance sufficient to cause a change in measurable change in the detectable property. If the detectable property is fluorescence induced in one tag by energy transfer to the other tag or due to one tag quenching the fluorescence of the other tag or causing a measurable change in the fluorescence intensity and/or frequency, the measurable change is caused by bringing two tags into close proximity to each other, i.e., decrease the distance separating the tags. Generally, the distance needed to cause a measurable change in the detectable property is within (less than or equal to) about 100 Å preferably within about 50 Å, particularly within about 25 Å, especially within about 15 Å and most preferably within about 10 Å. Of course, one skilled in the art will recognize that a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tag, the nature of the tag, the solvent system, external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • The present invention provides a tagged depolymerizing agents and a tagged polymer, where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags interact before, during and/or after monomer release.
  • Cooperatively Tagged Systems Using a Polymerase
  • The present invention provides cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact before, during and/or after monomer insertion. The tag on the polymerase can be designed so that the tags interact before, during and/or after each monomer insertion. In the of case tags that are released from the monomers after monomer insert such as of β and/or γ phosphate tagged dNTPs, i.e., the tags reside on the β and/or γ phosphate groups, the tag on the polymerizing agent can be designed to interact with the tag on the monomer only after the tag is released from the polymerizing agent after monomer insertion. In the first case, the polymerase tag must be located on a site of the polymerase which allows the polymerase tag to interact with the monomer tag during the monomer insertion process—initial binding and bonding into the growing polymer. While in the second case, the polymerase tag must be located on a site of the polymerase which allows the polymerase tag to interact with the monomer tag now on the released pyrophosphate prior to its diffusion away from the polymerase and into the polymerizing medium.
  • The present invention provides cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tags are within a distance sufficient or in close proximity to cause a measurable change in the detectable property. If the detectable property is fluorescence induced in one tag by energy transfer to the other tag or due to one tag quenching the fluorescence of the other tag or causing a measurable change in the fluorescence intensity and/or frequency, the measurable change is caused by bringing two tags into close proximity to each other, i.e., decrease the distance separating the tags. Generally, the distance or close proximity is a distance between about 100 Å and about 10 Å. Alternatively, the distance is less than or equal to about 100 Å, preferably less than or equal to about 50 Å, particularly less than or equal to about 25 Å, especially less than or equal to about 15 Å and most preferably less than or equal to about 10 Å. Of course, one skilled in the art will recognize that a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tags, the nature of the tags, the solvent system (polymerizing medium), external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • The present invention provides a tagged polymerase and tagged monomer precursors, where the tags form a fluorescently active pair such as a donor-acceptor pair and an intensity and/or frequency of fluorescence light emitted by at least one tag (generally the acceptor tag in donor-acceptor pairs) changes when the tags interact.
  • The present invention provides a tagged polymerase and a tagged monomer precursors, where the tags form a fluorescently active pair such as a donor-acceptor pair and an intensity and/or frequency of fluorescence light emitted by at least one tag (generally the acceptor tag in donor-acceptor pairs) changes when the tags are a distance sufficient or in close proximity to change either the intensity and/or frequency of the fluorescent light. Generally, the distance or close proximity is a distance between about 100 Å and about 10 Å. Alternatively, the distance is less than or equal to about 100 Å, preferably less than or equal to about 50 Å, particularly less than or equal to about 25 Å, especially less than or equal to about 15 Å and most preferably less than or equal to about 10 Å. Of course, one skilled in the art will recognize that a distance sufficient to cause a measurable change in a detectable property of a tag will depend on many parameters including the location of the tag, the nature of the tag, the solvent system, external fields, excitation source intensity and frequency band width, temperature, pressure, etc.
  • The present invention provides a single-molecule sequencing apparatus comprising a container having at least one tagged polymerase confined on or tethered to an interior surface thereof and having a solution containing a plurality of tagged monomers in contact with the interior surface.
  • Molecular Data Stream Reading Methods and Apparatus
  • The present invention provides a method for single-molecule sequencing comprising the step of supplying a plurality of tagged monomers to a tagged polymerase confined on an interior surface of a container, exposing the tagged polymerase to light and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase during each successive monomer addition or insertion into a growing polymer chain. The method can further comprise relating the measured intensity and/or frequency of emitted fluorescent light to incorporation events and/or to the identification of each inserted or added monomer resulting in a near real-time or real-time readout of the sequence of the a growing nucleic acid sequence—DNA sequence, RNA sequence or mixed DNA/RNA sequences.
  • The present invention also provides a system for retrieving stored information comprising a molecule having a sequence of known elements representing a data stream, a single-molecule sequencer comprising a polymerase having at least one tag associated therewith, an excitation source adapted to excite at least one tag on the polymerase, and a detector adapted to detect a response from the excited tag on the polymerase, where the response from the at least one tag changes during polymerization of a complementary sequence of elements and the change in response represents a content of the data stream.
  • The present invention also provides a system for determining sequence information from a single-molecule comprising a molecule having a sequence of known elements, a single-molecule sequencer comprising a polymerase having at least one tag associated therewith, a excitation source adapted to excite at least one tag on the polymerase, and a detector adapted to detect a response from the excited tag on the polymerase, where the response from at least one tag changes during polymerization of a complementary sequence of elements representing the element sequence of the molecule.
  • The present invention also provides a system for determining sequence information from a single-molecule comprising a molecule having a sequence of known elements, a single-molecule sequencer comprising a polymerase having at least one fluorescent tag associated therewith, an excitation light source adapted to excite at least one fluorescent tag on the polymerase and/or monomer and a fluorescent light detector adapted to detect at least an intensity of emitted fluorescent light from at least one fluorescent tag on the polymerase and/or monomer, where the signal intensity changes each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • The present invention also provides a system for storing and retrieving data comprising a sequence of nucleotides or nucleotide analogs representing a given data stream; a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; an excitation light source adapted to excite the at least one fluorescent tag on the polymerase and/or monomer; and a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase and/or monomer, where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • The term monomer as used herein means any compound that can be incorporated into a growing molecular chain by a given polymerase. Such monomers include, without limitations, naturally occurring nucleotides (e.g., ATP, GTP, TTP, UTP, CTP, dATP, dGTP, dTTP, dUTP, dCTP, synthetic analogs), precursors for each nucleotide, non-naturally occurring nucleotides and their precursors or any other molecule that can be incorporated into a growing polymer chain by a given polymerase. Additionally, amino acids (natural or synthetic) for protein or protein analog synthesis, mono saccharides for carbohydrate synthesis or other monomeric syntheses.
  • The term polymerase as used herein means any molecule or molecular assembly that can polymerize a set of monomers into a polymer having a predetermined sequence of the monomers, including, without limitation, naturally occurring polymerases or reverse transcriptases, mutated naturally occurring polymerases or reverse transcriptases, where the mutation involves the replacement of one or more or many amino acids with other amino acids, the insertion or deletion of one or more or many amino acids from the polymerases or reverse transcriptases, or the conjugation of parts of one or more polymerases or reverse transcriptases, non-naturally occurring polymerases or reverse transcriptases. The term polymerase also embraces synthetic molecules or molecular assembly that can polymerize a polymer having a pre-determined sequence of monomers, or any other molecule or molecular assembly that may have additional sequences that facilitate purification and/or immobilization and/or molecular interaction of the tags, and that can polymerize a polymer having a pre-determined or specified or templated sequence of monomers.
  • Single Site Tagged Polymerizing or Depolymerizing Agents
  • The present invention provides a composition comprising a polymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • The present invention provides a composition comprising a polymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • The present invention provides a composition comprising a depolymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property of the tag undergoes a change before, during and/or after monomer removal.
  • The present invention provides a composition comprising a polymerizing agent including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • Single Site Tagged Polymerase
  • The present invention provides a composition comprising a polymerase including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the polymerase, where a detectable property of the tag undergoes a change before, during and/or after monomer incorporation.
  • The present invention provides a composition comprising a polymerase including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the polymerase, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • The present invention provides a composition comprising an exonuclease including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property of the tag undergoes a change before, during and/or after monomer removal.
  • The present invention provides a composition comprising an exonuclease including at least one molecular and/or atomic tag located at or near, associated with or covalently bonded to a site on the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • The present invention provides a composition comprising an enzyme modified to produce a detectable response prior to, during and/or after interaction with an appropriately modified monomer, where the monomers are nucleotides, nucleotide analogs, amino acids, amino acid analogs, monosaccharides, monosaccharide analogs or mixtures or combinations thereof.
  • The present invention provides a composition comprising a polymerase including at least one molecular tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tag has a first detection propensity when the polymerase is in a first conformational state and a second detection propensity when the polymerase is in a second conformational state.
  • The present invention provides a composition comprising a polymerase including at least one chromophore located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where an intensity and/or frequency of emitted light of the tag has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state.
  • The present invention provides a composition comprising a polymerase including at least one molecular tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tag has a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state.
  • The present invention provides a composition comprising a polymerase including a molecular tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tag is substantially active when the polymerase is in a first conformational state and substantially inactive when the polymerase is in a second conformational state or substantially inactive when the polymerase is in the first conformational state and substantially active when the polymerase is in the second conformational state.
  • Multiple Site Tagged Polymerizing and Depolymerizing Agents
  • The present invention provides a composition comprising a polymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer incorporation.
  • The present invention provides a composition comprising a polymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • The present invention provides a composition comprising a depolymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer removal.
  • The present invention provides a composition comprising a depolymerizing agent including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the agent, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • Multiple Site Tagged Polymerase
  • The present invention provides a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer incorporation.
  • The present invention provides a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer incorporation.
  • The present invention provides a composition comprising an exonuclease including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property of at least one of the tags undergoes a change before, during and/or after monomer removal.
  • The present invention provides a composition comprising an exonuclease including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site of the polymerase, where a detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state during monomer removal.
  • The present invention provides a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the detectable property of the pair has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state.
  • The present invention provides a composition comprising a polymerase including at least one pair of molecules or atoms located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the pair interact to form a chromophore when the polymerase is in a first conformational state or a second conformational state.
  • The present invention provides a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the tags have a first fluorescence propensity when the polymerase is in a first conformational state and a second fluorescence propensity when the polymerase is in a second conformational state.
  • The present invention provides a composition comprising a polymerase including at least one pair of molecular tags located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation, where the pair is substantially active when the polymerase is in a first conformational state and substantially inactive when the polymerase is in a second conformational state or substantially inactive when the polymerase is in the first conformational state and substantially active when the polymerase is in the second conformational state.
  • Methods Using Tagged Polymerase
  • The present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with or covalently bonded to a site on a polymerase or associated with or covalently bonded to a site on the monomer, where the site undergoes a change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first state and a second value when the polymerase is in a second state and cycles from the first value to the second value during each monomer addition.
  • The present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of monitoring a detectable property of a tag, where the tag is located at or near, associated with or covalently bonded to a site on a polymerase or associated with or covalently bonded to a site on the monomer, where the site undergoes a conformational change during monomer incorporation and where the detectable property has a first value when the polymerase is in a first conformational state and a second value when the polymerase is in a second conformational state and cycles from the first value to the second value during each monomer addition.
  • The present invention provides a method for determining when a monomer is incorporated into a growing molecular chain comprising the steps of exposing a tagged polymerase to light, monitoring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase and/or monomer, where the tagged polymerase comprises a polymerase including a tag located at or near, associated with or covalently bonded to a site that undergoes conformational change during monomer incorporation or associated with or covalently bonded to a site on the monomer and where the tag emits fluorescent light at a first intensity and/or frequency when the polymerase is in a first conformational state and a second intensity and/or frequency when the polymerase is in a second conformational state and cycles from the first value to the second value during each monomer addition.
  • Single-molecule Sequencing Apparatus Using Tagged Polymerase
  • The present invention provides a composition comprising a single-molecule sequencing apparatus comprising a substrate having a chamber or chip surface in which at least one tagged polymerase is confined therein and a plurality of chambers, each of which includes a specific monomer and a plurality of channels interconnecting the chambers, where each replication complex is sufficiently distant to enable data collection from each complex individually.
  • The present invention provides a method for single-molecule sequencing comprising the steps of supplying a plurality of monomers to a tagged polymerase confined on a substrate, exposing the tagged polymerase to light and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase. The method can further comprise the step of relating the measured intensity and/or frequency of emitted fluorescent light to incorporation of a specific monomer into a growing DNA chain.
  • Cooperatively Tagged Monomers and Tagged Polymerizing Agent
  • The present invention provides a composition comprising a cooperatively tagged polymerizing agent and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact.
  • The present invention provides a composition comprising a cooperatively tagged depolymerizing agent and tagged depolymerizable monomers, where a detectable property of at least one of the tags changes when the tags interact.
  • Cooperatively Tagged Monomers and Tagged Polymerase
  • The present invention provides a composition comprising a cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tags interact.
  • The present invention provides a composition comprising a cooperatively tagged polymerase and tagged monomers, where a detectable property of at least one of the tags changes when the tag are within a distance sufficient to cause a change in the intensity and/or frequency of emitted fluorescent light.
  • The present invention provides a composition comprising a tagged polymerase and tagged monomer precursors, where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags interact.
  • The present invention provides a composition comprising a tagged polymerase and a tagged monomer precursors, where an intensity and/or frequency of fluorescence light emitted by at least one tag changes when the tags are within a distance sufficient to cause a change in the intensity and/or frequency of emitted fluorescent light.
  • The present invention provides a single-molecule sequencing apparatus comprising a container having at least one tagged polymerase confined on an interior surface thereof and having a solution containing a plurality of tagged monomers in contact with the interior surface or a subset of tagged monomers and a subset of untagged monomers which together provide all monomers precursor for polymerization.
  • The present invention provides a method for single-molecule sequencing comprising the steps of supplying a plurality of tagged monomers to a tagged polymerase confined on an interior surface of a container, exposing the tagged polymerase to light and measuring an intensity and/or frequency of fluorescent light emitted by the tagged polymerase. The method can further comprise relating the measured intensity and/or frequency of emitted fluorescent light to incorporation of a specific monomer into a growing DNA chain.
  • The present invention provides a system for retrieving stored information comprising: (a) a molecule having a sequence of elements representing a data stream; (b) a single-molecule sequencer comprising a polymerase having at least one tag associated therewith; (c) an excitation source adapted to excite the at least one tag on the polymerase; and (d) a detector adapted to detect a response from the tag on the polymerase or on the monomers; where the response from at least one tag changes during polymerization of a complementary sequence of elements and the change in response represents a data stream content.
  • The present invention provides a system for determining sequence information from a single-molecule comprising: (a) a molecule having a sequence of elements; (b) a single-molecule sequencer comprising a polymerase having at least one tag associated therewith; (c) an excitation source adapted to excite at least one tag on the polymerase or on the monomers; and (d) a detector adapted to detect a response from the tag on the polymerase; where the response from at least one tag changes during polymerization of a complementary sequence of elements representing the element sequence of the molecule.
  • The present invention provides a system for determining sequence information from an individual molecule comprising: (a) a molecule having a sequence of elements; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag associated therewith; (c) an excitation light source adapted to excite the at least one fluorescent tag on the polymerase or on the monomers; and (d) a fluorescent light detector adapted to detect at least an intensity of emitted fluorescent light from the at least one fluorescent tag on the polymerase; where the intensity change of at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • The present invention provides a system for storing and retrieving data comprising: (a) a sequence of nucleotides or nucleotide analogs representing a given data stream; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; (c) an excitation light source adapted to excite at least one fluorescent tag on the polymerase; and (d) a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase; where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • The present invention provides a system for storing and retrieving data comprising: (a) a sequence of nucleotides or nucleotide analogs representing a given data stream; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; (c) an excitation light source adapted to excite the at least one fluorescent tag on the polymerase or the monomers; and (d) a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase or the monomers; where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • The present invention provides a method for sequencing a molecular sequence comprising the steps of: (a) a sequenced of nucleotides or nucleotide analogs representing a given data stream; (b) a single-molecule sequencer comprising a polymerase having at least one fluorescent tag covalently attached thereto; (c) an excitation light source adapted to excite at least one fluorescent tag on the polymerase or the monomers; and (d) a fluorescent light detector adapted to detect emitted fluorescent light from at least one fluorescent tag on the polymerase; where at least one fluorescent tag emits or fails to emit fluorescent light each time a new nucleotide or nucleotide analog is polymerized into a complementary sequence and either the duration of the emission or lack of emission or the wavelength range of the emitted light evidences the particular nucleotide or nucleotide analog polymerized into the sequence so that at the completion of the sequencing the data stream is retrieved.
  • The present invention provides a method for synthesizing a γ-phosphate modified nucleotide comprising the steps of attaching a molecular tag to a pyrophosphate group and contacting the modified pyrophosphate with a dNMP to produce a γ-phosphate tagged dNTP.
  • The present invention provides a method for 5′ end-labeling a biomolecule comprising the step of contacting the biomolecule with a kinase able to transfer a γ-phosphate of a γ-phosphate labeled ATP to the 5′ end of the biomolecule resulting in a covalently modified biomolecule.
  • The present invention provides a method for end-labeling a polypeptide or carbohydrate comprising the step of contacting the polypeptide or carbohydrate with an agent able to transfer an atomic or molecular tag to either a carboxy or amino end of a protein or polypeptide or to either the γ-phosphate of a γ-phosphate labeled ATP to the 5′ end of the biomolecule resulting in a covalently modified biomolecule.
  • DESCRIPTION OF THE DRAWINGS
  • The invention can be better understood with reference to the following detailed description together with the appended illustrative drawings in which like elements are numbered the same:
  • FIG. 1 depicts FRET activity as a function of distance separating the fluorescent donor and acceptor;
  • FIG. 2 depicts the open and closed ternary complex forms of the large fragment of Taq DNA pol I (Klentaq 1);
  • FIGS. 3A-C depicts an overlay between 3ktq (closed ‘black’) and 1tau (open ‘light blue’), the large fragment of Taq DNA polymerase I;
  • FIG. 4 depicts an image of a 20% denaturing polyacrylamide gel containing size separated radiolabeled products from DNA extension experiments involving γ-ANS-phosphate-dATP;
  • FIG. 5 depicts an image of (A) the actual gel, (B) a lightened phosphorimage and (C) an enhanced phosphorimage of products generated in DNA extension reactions using γ-ANS-phosphate-dNTPs;
  • FIG. 6 depicts an image of (A) 6% denaturing polyacrylamide gel, (B) a lightened phosphorimage of the actual gel, and (C) an enhanced phosphorimage of the actual gel containing products generated in DNA extension reactions using γ-ANS-phosphate-dNTPs;
  • FIG. 7 depicts an image of (A) the actual gel, (B) a lightened phosphorimage of the actual gel, and (C) an enhanced phosphorimage of the actual gel;
  • FIG. 8 depicts data for the Klenow fragment from E. coli DNA polymerase I incorporation of gamma-modified nucleotides;
  • FIG. 9 depicts data for the Pfu DNA polymerase incorporation of gamma-modified nucleotides;
  • FIG. 10 depicts data for the HIV-1 reverse transcriptase incorporation of gamma-tagged nucleotides;
  • FIG. 11 depicts experimental results for the native T7 DNA polymerase and SEQUENASE® incorporation of gamma-tagged nucleotides; and
  • FIG. 12 depicts reaction products produced when the four natural nucleotides (dATP, dCTP, dGTP and dTTP) are used in the synthesis reaction (solid line) and reaction products produced when base-modified nucleotides are used in the synthesis reaction.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The inventors have devised a methodology using tagged monomers such as dNTPs and/or tagged polymerizing agents such as polymerase and/or tagged agents associated with the polymerizing agent such as polymerase associated proteins or probes to directly readout the exact monomer sequence such as a base sequence of an RNA or DNA sequence during polymerase activity. The methodology of this invention is adaptable to protein synthesis or to carbohydrate synthesis or to the synthesis of any molecular sequence where the sequence of monomers provides usable information such as the sequence of a RNA or DNA molecule, a protein, a carbohydrate, a mixed biomolecule or an inorganic or organic sequence of monomers which stores a data stream. The methods and apparatuses using these methods are designed to create new ways to address basic research questions such as monitoring conformation changes occurring during replication and assaying polymerase incorporation fidelity in a variety of sequence contexts. The single-molecule detection systems of this invention are designed to improve fluorescent molecule chemistry, computer modeling, base-calling algorithms, and genetic engineering of biomolecules, especially for real-time or near real-time sequencing. The inventors have also found that the methodology can be adapted to depolymerizing agents such as exonucleases where the polymer sequence is determined by depolymerization instead of polymerization. Moreover, the single-molecule systems of this invention are amendable to parallel and/or massively parallel assays, where tagged polymerases are patterned in arrays on a substrate. The data collected from such arrays can be used to improve sequence confidence and/or to simultaneously sequence DNA regions from many different sources to identify similarities or differences.
  • The pattern of emission signals is collected, either directly, such as by an Intensified Charge Coupled Devise (ICCD) or through an intermediate or series of intermediates to amplify signal prior to electronic detection, where the signals are decoded and confidence values are assigned to each base to reveal the sequence complementary to that of the template. Thus, the present invention also provides techniques for amplifying the fluorescent light emitted from a fluorescent tag using physical light amplification techniques or molecular cascading agent to amplify the light produced by single-molecular fluorescent events.
  • The single-molecule DNA sequencing systems of this invention have the potential to replace current DNA sequencing technologies, because the methodology can decrease time, labor, and costs associated with the sequencing process, and can lead to highly scalable sequencing systems, improving the DNA sequence discovery process by at least one to two orders of magnitude per reaction.
  • The single-molecule DNA sequencing technology of this invention can: (1) make it easier to classify an organism or identify variations within an organism by simply sequencing the genome or a portion thereof; (2) make rapid identification of a pathogen or a genetically-modified pathogen easier, especially in extreme circumstances such as in pathogens used in warfare; and (3) make rapid identification of persons for either law enforcement and military applications easier.
  • One embodiment of the single-molecule sequencing technology of this invention involves strategically positioning a pair of tags on a DNA polymerase so that as a dNTP is incorporated during the polymerization reaction, the tags change relative separation. This relative change causes a change in a detectable property, such as the intensity and/or frequency of fluorescence from one or both of the tags. A time profile of these changes in the detectable property evidences each monomer incorporation event and provides evidence about which particular dNTP is being incorporated at each incorporation event. The pair of tags do not have to be covalently attached to the polymerase, but can be attached to molecules that associate with the polymerase in such a way that the relative separation of the tags change during base incorporation.
  • Another embodiment of the single-molecule sequencing technology of this invention involves a single tag strategically positioned on a DNA polymerase that interacts with a tag on a dNTP or separate tags on each dNTP. The tags could be different for each dNTP such as color-coded tags which emit a different color of fluorescent light. As the next dNTP is incorporated during the polymerization process, the identity of the base is indicated by a signature fluorescent signal (color) or a change in a fluorescent signal intensity and/or frequency. The rate of polymerase incorporation can be varied and/or controlled to create an essentially “real-time” or near “real-time” or real-time readout of polymerase activity and base sequence. Sequence data can be collected at a rate of >100,000 bases per hour from each polymerase.
  • In another embodiment of the single-molecule sequencing technology of this invention, the tagged polymerases each include a donor tag and an acceptor tag situated or located on or within the polymerase, where the distance between the tags changes during dNTP binding, dNTP incorporation and/or chain extension. This change in inter-tag distance results in a change in the intensity and/or wavelength of emitted fluorescent light from the fluorescing tag. Monitoring the changes in intensity and/or frequency of the emitted light provides information or data about polymerization events and the identity of incorporated bases.
  • In another embodiment, the tags on the polymerases are designed to interact with the tags on the dNTPs, where the interaction changes a detectable property of one or both of the tags. Each fluorescently tagged polymerase is monitored for polymerization using tagged dNTPs to determine the efficacy of base incorporation data derived therefrom. Specific assays and protocols have been developed along with specific analytical equipment to measure and quantify the fluorescent data allowing the determination and identification of each incorporated dNTP. Concurrently, the inventors have identified tagged dNTPs that are polymerized by suitable polymerases and have developed software that analyze the fluorescence emitted from the reaction and interpret base identity. One skilled in the art will recognize that appropriate fluorescently active pairs are well-known in the art and commercially available from such vendors as Molecular Probes located in Oregon or Biosearch Technologies, Inc. in Novato, Calif.
  • The tagged DNA polymerase for use in this invention are genetically engineered to provide one or more tag binding sites that allow the different embodiments of this invention to operate. Once a suitable polymerase candidate is identified, specific amino acids within the polymerase are mutated and/or modified such reactions well-known in the art; provided, however, that the mutation and/or modification do not significantly adversely affect polymerization efficiency. The mutated and/or modified amino acids are adapted to facilitate tag attachment such as a dye or fluorescent donor or acceptor molecule in the case of light activated tags. Once formed, the engineered polymerase can be contacted with one or more appropriate tags and used in the apparatuses and methods of this invention.
  • Engineering a polymerase to function as a direct molecular sensor of DNA base identity provides a route to a fast and potentially real-time enzymatic DNA sequencing system. The single-molecule DNA sequencing system of this invention can significantly reduce time, labor, and costs associated with the sequencing process and is highly scalable. The single-molecule DNA sequencing system of this invention: (1) can improve the sequence discovery process by at least two orders of magnitude per reaction; (2) is not constrained by the length limitations associated with the degradation-based, single-molecule methods; and (3) allows direct sequencing of desired (target) DNA sequences, especially genomes without the need for cloning or PCR amplification, both of which introduce errors in the sequence. The systems of this invention can make easier the task of classifying an organism or identifying variations within an organism by simply sequencing the genome in question or any desired portion of the genome. The system of this invention is adapted to rapidly identify pathogens or engineered pathogens, which has importance for assessing health-related effects, and for general DNA diagnostics, including cancer detection and/or characterization, genome analysis, or a more comprehensive form of genetic variation detection. The single-molecule DNA sequencing system of this invention can become an enabling platform technology for single-molecule genetic analysis.
  • The single-molecule sequencing systems of this invention have the following advantages: (1) the systems eliminates sequencing reaction processing, gel or capillary loading, electrophoresis, and data assembly; (2) the systems results in significant savings in labor, time, and costs; (3) the systems allows near real-time or real-time data acquisition, processing and determination of incorporation events (timing, duration, etc.), base sequence, etc.; (4) the systems allows parallel or massively parallel sample processing in microarray format; (5) the systems allows rapid genome sequencing, in time frames of a day or less; (6) the systems requires very small amount of material for analysis; (7) the systems allows rapid genetic identification, screening and characterization of animals including humans or pathogen; (8) the systems allows large increases in sequence throughput; (9) the system can avoid error introduced in PCR, RT-PCR, and transcription processes; (10) the systems can allow accurate sequence information for allele-specific mutation detection; (11) the systems allows rapid medical diagnostics, e.g., Single Nucleotide Polymorphism (SNP) detection; (12) the systems allows improvement in basic research, e.g., examination of polymerase incorporation rates in a variety of different sequence contexts; analysis of errors in different contexts; epigenotypic analysis; analysis of protein glycosylation; protein identification; (13) the systems allows the creation of new robust (rugged) single-molecule detection apparatus; (14) the systems allows the development of systems and procedures that are compatible with biomolecules; (15) the systems allows the development genetic nanomachines or nanotechnology; (16) the systems allows the construction of large genetic databases and (17) the system has high sensitivity for low mutation event detection.
  • Brief Overview of Single-Molecule DNA Sequencing
  • In one embodiment of the single-molecule DNA sequencing system of this invention, a single tag is attached to an appropriate site on a polymerase and a unique tag is attached to each of the four nucleotides: dATP, dTTP, dCTP and dGTP. The tags on each dNTPs are designed to have a unique emission signature (i.e., different emission frequency spectrum or color), which is directly detected upon incorporation. As a tagged dNTP is incorporated into a growing DNA polymer, a characteristic fluorescent signal or base emission signature is emitted due to the interaction of polymerase tag and the dNTP tag. The fluorescent signals, i.e., the emission intensity and/or frequency, are then detected and analyzed to determine DNA base sequence.
  • One criteria for selection of the tagged polymerase and/or dNTPs for use in this invention is that the tags on either the polymerase and/or the dNTPs do not interfere with Watson-Crick base-pairing or significantly adversely impact polymerase activity. The inventors have found that dNTPs containing tags attached to the terminal (gamma) phosphate are incorporated by a native Taq polymerase either in combination with untagged dNTPs or using only tagged dNTPs. Tagging the dNTPs on the β and/or γ phosphate group is preferred because the resulting DNA strands do not include any of the dNTP tags in their molecular make up, minimizing enzyme distortion and background fluorescence.
  • One embodiment of the sequencing system of this invention involves placing a fluorescent donor such as fluorescein or a fluorescein-type molecule on the polymerase and unique fluorescent acceptors such as a d-rhodamine or a similar molecule on each dNTP, where each unique acceptor, when interacting with the donor on the polymerase, generates a fluorescent spectrum including at least one distinguishable frequency or spectral feature. As an incoming, tagged dNTP is bound by the polymerase for DNA elongation, the detected fluorescent signal or spectrum is analyzed and the identity of the incorporated base is determined.
  • Another embodiment of the sequencing system of this invention involves a fluorescent tag on the polymerase and unique quenchers on the dNTPs, where the quenchers preferably have distinguishable quenching efficiencies for the polymerase tag. Consequently, the identity of each incoming quencher tagged dNTP is determined by its unique quenching efficiency of the emission of the polymerase fluorescent tag. Again, the signals produced during incorporation are detected and analyzed to determine each base incorporated, the sequence of which generates the DNA base sequence.
  • Reagents
  • Suitable polymerizing agents for use in this invention include, without limitation, any polymerizing agent that polymerizes monomers relative to a specific template such as a DNA or RNA polymerase, reverse transcriptase, or the like or that polymerizes monomers in a step-wise fashion.
  • Suitable polymerases for use in this invention include, without limitation, any polymerase that can be isolated from its host in sufficient amounts for purification and use and/or genetically engineered into other organisms for expression, isolation and purification in amounts sufficient for use in this invention such as DNA or RNA polymerases that polymerize DNA, RNA or mixed sequences, into extended nucleic acid polymers. Preferred polymerases for use in this invention include mutants or mutated variants of native polymerases where the mutants have one or more amino acids replaced by amino acids amenable to attaching an atomic or molecular tag, which have a detectable property. Exemplary DNA polymerases include, without limitation, HIV1-Reverse Transcriptase using either RNA or DNA templates, DNA pol I from T. aquaticus or E. coli, Bateriophage T4 DNA pol, T7 DNA pol or the like. Exemplary RNA polymerases include, without limitation, T7 RNA polymerase or the like.
  • Suitable depolymerizing agents for use in this invention include, without limitation, any depolymerizing agent that depolymerizes monomers in a step-wise fashion such as exonucleases in the case of DNA, RNA or mixed DNA/RNA polymers, proteases in the case of polypeptides and enzymes or enzyme systems that sequentially depolymerize polysaccharides.
  • Suitable monomers for use in this invention include, without limitation, any monomer that can be step-wise polymerized into a polymer using a polymerizing agent. Suitable nucleotides for use in this invention include, without limitation, naturally occurring nucleotides, synthetic analogs thereof, analog having atomic and/or molecular tags attached thereto, or mixtures or combinations thereof.
  • Suitable atomic tag for use in this invention include, without limitation, any atomic element amenable to attachment to a specific site in a polymerizing agent or dNTP, especially Europium shift agents, nmr active atoms or the like.
  • Suitable atomic tag for use in this invention include, without limitation, any atomic element amenable to attachment to a specific site in a polymerizing agent or dNTP, especially fluorescent dyes such as d-Rhodamine acceptor dyes including dichloro[R110], dichloro[R6G], dichloro[TAMRA], dichloro[ROX] or the like, fluorescein donor dye including fluorescein, 6-FAM, or the like; Acridine including Acridine orange, Acridine yellow, Proflavin, pH 7, or the like; Aromatic Hydrocarbon including 2-Methylbenzoxazole, Ethyl p-dimethylaminobenzoate, Phenol, Pyrrole, benzene, toluene, or the like; Arylmethine Dyes including Auramine O, Crystal violet, H2O, Crystal violet, glycerol, Malachite Green or the like; Coumarin dyes including 7-Methoxycoumarin-4-acetic acid, Coumarin 1, Coumarin 30, Coumarin 314, Coumarin 343, Coumarin 6 or the like; Cyanine Dye including 1,1′-diethyl-2,2′-cyanine iodide, Cryptocyanine, Indocarbocyanine (C3)dye, Indodicarbocyanine (C5)dye, Indotricarbocyanine (C7)dye, Oxacarbocyanine (C3)dye, Oxadicarbocyanine (C5)dye, Oxatricarbocyanine (C7)dye, Pinacyanol iodide, Stains all, Thiacarbocyanine (C3)dye, ethanol, Thiacarbocyanine (C3)dye, n-propanol, Thiadicarbocyanine (C5)dye, Thiatricarbocyanine (C7)dye, or the like; Dipyrrin dyes including N,N′-Difluoroboryl-1,9-dimethyl-5-(4-iodophenyl)-dipyrrin, N,N′-Difluoroboryl-1,9-dimethyl-5-[(4-(2-trimethylsilylethynyl), N,N′-Difluoroboryl-1,9-dimethyl-5-phenydipyrrin, or the like; Merocyanines including 4-(dicyanomethylene)-2-methyl-6-(p-dimethylaminostyryl)-4H-pyran (DCM), acetonitrile, 4-(dicyanomethylene)-2-methyl-6-(p-dimethylaminostyryl)-4H-pyran (DCM), methanol, 4-Dimethylamino-4′-nitrostilbene, Merocyanine 540, or the like; Miscellaneous Dye including 4′,6-Diamidino-2-phenylindole (DAPI), 4′,6-Diamidino-2-phenylindole (DAPI), dimethylsulfoxide, 7-Benzylamino-4-nitrobenz-2-oxa-1,3-diazole, Dansyl glycine, H2O, Dansyl glycine, dioxane, HOECHST® 33258, DMF, HOECHST® 33258, H2O, Lucifer yellow CH, Piroxicam, Quinine sulfate, 0.05 M H2SO4, Quinine sulfate, 0.5 M H2SO4, Squarylium dye III, or the like; Oligophenylenes including 2,5-Diphenyloxazole (PPO), Biphenyl, POPOP, p-Quaterphenyl, p-Terphenyl, or the like; Oxazines including Cresyl violet perchlorate, Nile Blue, methanol, Nile Red, Nile blue, ethanol, Oxazine 1, Oxazine 170, or the like; Polycyclic Aromatic Hydrocarbons including 9,10-Bis(phenylethynyl)anthracene, 9,10-Diphenylanthracene, Anthracene, Naphthalene, Perylene, Pyrene, or the like; polyene/polyynes including 1,2-diphenylacetylene, 1,4-diphenylbutadiene, 1,4-diphenylbutadiyne, 1,6-Diphenylhexatriene, Beta-carotene, Stilbene, or the like; Redox-active Chromophores including Anthraquinone, Azobenzene, Benzoquinone, Ferrocene, Riboflavin, Tris(2,2′-bipyridyl)ruthenium(II), Tetrapyrrole, Bilirubin, Chlorophyll a, diethyl ether, Chlorophyll a, methanol, Chlorophyll b, Diprotonated-tetraphenylporphyrin, Hematin, Magnesium octaethylporphyrin, Magnesium octaethylporphyrin (MgOEP), Magnesium phthalocyanine (MgPc), PrOH, Magnesium phthalocyanine (MgPc), pyridine, Magnesium tetramesitylporphyrin (MgTMP), Magnesium tetraphenylporphyrin (MgTPP), Octaethylporphyrin, Phthalocyanine (Pc), Porphin, Tetra-t-butylazaporphine, Tetra-t-butylnaphthalocyanine, Tetrakis(2,6-dichlorophenyl)porphyrin, Tetrakis(o-aminophenyl)porphyrin, Tetramesitylporphyrin (TMP), Tetraphenylporphyrin (TPP), Vitamin B12, Zinc octaethylporphyrin (ZnOEP), Zinc phthalocyanine (ZnPc), pyridine, Zinc tetramesitylporphyrin (ZnTMP), Zinc tetramesitylporphyrin radical cation, Zinc tetraphenylporphyrin (ZnTPP), or the like; Xanthenes including Eosin Y, Fluorescein, basic ethanol, Fluorescein, ethanol, Rhodamine 123, Rhodamine 6G, Rhodamine B, Rose bengal, Sulforhodamine 101, or the like; or mixtures or combination thereof or synthetic derivatives thereof or FRET fluorophore-quencher pairs including DLO-FB1 (5′-FAM/3′-BHQ-1) DLO-TEB1 (5′-TET/3′-BHQ-1), DLO-JB1 (5′-JOE/3′-BHQ-1), DLO-HB1 (5′-HEX/3′-BHQ-1), DLO-C3B2 (5′-Cy3/3′-BHQ-2), DLO-TAB2 (5′-TAMRA/3′-BHQ-2), DLO-RB2 (5′-ROX/3′-BHQ-2), DLO-C5B3 (5′-Cy5/3′-BHQ-3), DLO-C55B3 (5′-Cy5.5/3′-BHQ-3), MBO-FB1 (5′-FAM/3′-BHQ-1), MBO-TEB1 (5′-TET/3′-BHQ-1), MBO-JB1 (5′-JOE/3′-BHQ-1), MBO-HB1 (5′-HEX/3′-BHQ-1), MBO-C3B2 (5′-Cy3/3′-BHQ-2), MBO-TAB2 (5′-TAMRA/3′-BHQ-2), MBO-RB2 (5′-ROX/3′-BHQ-2); MBO-05B3 (5′-Cy5/3′-BHQ-3), MBO-C55B3 (5′-Cy5.5/3′-BHQ-3) or similar FRET pairs available from Biosearch Technologies, Inc. of Novato, Calif., tags with nmr active groups, tags with spectral features that can be easily identified such as IR, far IR, visible UV, far UV or the like.
  • Enzyme Choice
  • The inventors have found that the DNA polymerase from Therms aquaticus—Taq DNA polymerase I—is ideally suited for use in the single-molecule apparatuses, systems and methods of this invention. Taq DNA Polymerase, sometimes simply referred to herein as Taq, has many attributes that the inventors can utilize in constructing tagged polymerases for use in the inventions disclosed in this application. Of course, ordinary artisans will recognize that other polymerases can be adapted for use in the single-molecule sequencing systems of this invention.
  • Since Taq DNA polymerase I tolerates so many mutations within or near its active site (as reviewed in Patel et al, J. Mol. Biol., volume 308, pages 823-837, and incorporated herein by reference), the enzyme is more tolerant of enzyme tagging modification(s) and also able to incorporate a wider range of modified nucleotide substrates.
  • Crystal Structures are Available for Taq DNA Polymerase
  • There are 13 structures solved for Taq DNA polymerase, with or without DNA template/primer, dNTP, or ddNTP, which allows sufficient information for the selection of amino acid sites within the polymerase to which an atomic and/or molecular tag such as a fluorescent tag can be attached without adversely affecting polymerase activity. See, e.g., Eom et al., 1996; Li et al., 1998a; Li et al., 1998b. Additionally, the inventors have a written program to aid in identifying optimal tag addition sites. The program compares structural data associated with the Taq polymerase in its open and closed form to identify regions in the polymerase structure that are optimally positioned to optimize the difference in conformation extremes between a tag on the polymerase and the dNTP or to optimize a change in separation between two tags on the polymerase, thereby increasing or maximizing changes in a detectable property of one of the tags or tag pair.
  • Taq DNA Polymerase is Efficiently Expressed in E. Coli
  • The Taq DNA polymerase is efficiently expressed in E. coli allowing efficient production and purification of the nascent polymerase and variants thereof for rapid identification, characterization and optimization of an engineered Taq DNA polymerase for use in the single-molecule DNA sequencing systems of this invention.
  • No Cysteines are Present in the Protein Sequence
  • The Taq DNA polymerase contains no cysteines, which allows the easy generation of cysteine-containing mutants in which a single cysteine is placed or substituted for an existing amino acid at strategic sites, where the inserted cysteine serves as a tag attachment site.
  • The Processivity of the Enzyme can be Modified
  • Although native Taq DNA polymerase may not represent an optimal polymerase for sequencing system of this invention because it is not a very processive polymerase (50-80 nucleotides are incorporated before dissociation), the low processivity may be compensated for by appropriately modifying the base calling software. Alternatively, the processivity of the Taq DNA Polymerase can be enhanced through genetic engineering by inserting into the polymerase gene a processivity enhancing sequence. Highly processive polymerases are expected to minimize complications that may arise from template dissociation effects, which can alter polymerization rate. The processivity of Taq can be genetically altered by introducing the 76 amino acid ‘processivity domain’ from T7 DNA polymerase between the H and H1 helices (at the tip of ‘thumb’ region within the polymerase) of Taq. The processivity domain also includes the thioredoxin binding domain (TBD) from T7 DNA polymerase causing the Taq polymerase to be thioredoxin-dependent increasing both the processivity and specific activity of Taq polymerase. See, e.g., Bedford et al., 1997; Bedford et al., 1999.
  • Taq DNA Polymerase Possesses a 5′ to 3′ Exonuclease Activity and is Thermostable
  • Single-stranded M13 DNA and synthetic oligonucleotides are used in the initial studies. After polymerase activity is optimized, the sequencing system can be used to directly determine sequence information from an isolated chromosome—a double-stranded DNA molecule. Generally, heating a sample of double-stranded DNA is sufficient to produce or maintain the double-stranded DNA in stranded DNA form for sequencing.
  • To favor the single-stranded state, the 5′ to 3′ exonuclease activity of the native Taq DNA polymerase in the enzyme engineered for single-molecule DNA sequencing is retained. This activity of the polymerase is exploited by the ‘TaqMan’ assay. The exonuclease activity removes a duplex strand that may renature downstream from the replication site using a nick-translation reaction mechanism. Synthesis from the engineered polymerase is initiated either by a synthetic oligonucleotide primer (if a specific reaction start is necessary) or by a nick in the DNA molecule (if multiple reactions are processed) to determine the sequence of an entire DNA molecule.
  • The Polymerase is Free from 3′ to 5′ Exonuclease Activity
  • The Taq DNA polymerase is does not contain 3′ to 5′ exonuclease activity, which means that the polymerase cannot replace a base, for which fluorescent signal was detected, with another base which would produce another signature fluorescent signal.
  • All polymerases make replication errors. The 3′ to 5′ exonuclease activity is used to proofread the newly replicated DNA strand. Since Taq DNA polymerase lacks this proofreading function, an error in base incorporation becomes an error in DNA replication. Error rates for Taq DNA polymerase are 1 error per ˜100,000 bases synthesized, which is sufficiently low to assure a relatively high fidelity. See, e.g., Eckert and Kunkel, 1990; Cline et al., 1996. It has been suggested and verified for a polymerase that the elimination of this exonuclease activity uncovers a decreased fidelity during incorporation. Thus, Taq polymerase must—by necessity—be more accurate during initial nucleotide selection and/or incorporation, and is therefore an excellent choice of use in the present inventions.
  • The error rate of engineered polymerases of this invention is assayed by determining their error rates in synthesizing known sequences. The error rate determines the optimal number of reactions to be run in parallel so that sequencing information can be assigned with confidence. The optimal number can be 1 or 10 or more. For example, the inventors have discovered that base context influences polymerase accuracy and reaction kinetics, and this information is used to assign confidence values to individual base calls. However, depending on the goal of a particular sequencing project, it may be more important to generate a genome sequence as rapidly as possible. For example, it may be preferable to generate, or draft, the genome sequence of a pathogen at reduced accuracy for initial identification purposes or for fast screening of potential pathogens.
  • Taq DNA Polymerase is the Enzyme of Choice for Single-Molecule DNA Sequencing
  • Engineering the polymerase to function as a direct molecular sensor of DNA base identity provides the fastest enzymatic DNA sequencing system possible. For the reasons detailed above, Taq DNA polymerase is the optimal enzyme to genetically modify and adapt for single-molecule DNA sequencing. Additionally, basic research questions concerning DNA polymerase structure and function during replication can be addressed using this technology advancing single-molecule detection systems and molecular models in other disciplines. The inventors have found that native Taq DNA polymerase incorporates gamma-tagged dNTPs, yielding extended DNA polymers. Importantly, incorporation of a modified nucleotide is not detrimental to polymerase activity and extension of primer strands by incorporation of a _-tagged nucleotide conforms to Watson-Crick base pairing rules.
  • Detecting Tagged Polymerase-Nucleotide Interactions
  • One preferred method for detecting polymerase-nucleotide interactions involves a fluorescence resonance energy transfer-based (FRET-based) method to maximize signal and minimize noise. A FRET-based method exists when the emission from an acceptor is more intense than the emission from a donor, i.e., the acceptor has a higher fluorescence quantum yield than the donor at the excitation frequency. The efficiency of FRET method can be estimated form computational models. See, e.g., Furey et al., 1998; Clegg et al., 1993; Mathies et al., 1990. The efficiency of energy transfer (E) is computed from equation (1) as follows:

  • E=1/(1+[R/R 0]6)  (1)
  • where R0 is the Forster critical distance at E=0.5. R0 is calculated from equation (2):

  • R 0=(9.79×103)(κ2η−4 Q D J DA)1/6  (2)
  • where ηi is the refractive index of the medium (η=1.4 for aqueous solution), κ2 is a geometric orientation factor related to the relative angle of the two transition dipoles (κ2 is generally assumed to be ⅔), JDA [M−1 cm3] is the overlap integral representing the normalized spectral overlap of the donor emission and acceptor absorption, and QD is the quantum yield. The overlap integral is computed from equation (3):

  • J DA =[∫F D(λ)∈A(λ)∈A(λ)λ4 dλ]/[∫F D(λ)dλ]  (3)
  • where FD is the donor emission, ∈A is the acceptor absorption. QD is calculated from equation (4):

  • Q D =Q RF(I D /I RF)(A RF /A D)  (4)
  • where ID and IRF are the fluorescence intensities of donor and a reference compound (fluorescein in 0.1 N NaOH), and ARF and AD are the absorbances of the reference compound and donor. QRF is the quantum yield of fluorescein in 0.1N NaOH and is taken to be 0.90.
  • R, the distance between the donor and acceptor, is measured by looking at different configurations (e.g., conformations) of the polymerase in order to obtain a conformationally averaged value. If both tags are on the polymerase, then R is the distance between the donor and acceptor in the open and closed conformation, while if the donor is on the polymerase and the acceptor on the dNTP, R is the distance between the donor and acceptor when the dNTP is bound to the polymerase and the polymerase is its closed form.
  • The distance between the tagged γ-phosphate and the selected amino acid sites for labeling in the open versus closed polymerase conformation delineates optimal dye combinations. If the distance (R) between the donor and acceptor is the same as R0 (R0 is the Forster critical distance), FRET efficiency (E) is 50%. If R is more than 1.5 R0, the energy transfer efficiency becomes negligible (E<0.02). Sites within the enzyme at which R/R0 differ by more than 1.6 in the open versus closed forms are identified and, if necessary, these distances and/or distance differences can be increased through genetic engineering. A plot of FRET efficiency verses distance is shown in FIG. 1.
  • Fluorescent Dye Selection Process
  • Dye sets are chosen to maximize energy transfer efficiency between a tagged dNTP and a tag on the polymerase when the polymerase is in its closed configuration and to minimize energy transfer efficiency between the tag on the DNTP (either non-productively bound or in solution) and the tag on the polymerase when the polymerase is in its open configuration. Given a molarity of each nucleotide in the reaction medium of no more than about 1 μM, an average distance between tagged nucleotides is calculated to be greater than or equal to about 250 Å. Because this distance is several fold larger than the distance separating sites on the polymerase in its open to closed conformational, minimal FRET background between the polymerase and free dNTPs is observed. Preferably, nucleotide concentrations are reduced below 1 μM. Reducing dNTP concentrations to levels of at least <10% of the Km further minimizes background fluorescence and provides a convenient method for controlling the rate of the polymerase reaction for the real-time monitoring. Under such conditions, the velocity of the polymerization reaction is linearly proportional to the dNTP concentration and, thus, highly sensitive to regulation. Additionally, the use of a single excitation wavelength allows improved identification of unique tags on each dNTP. A single, lower-wavelength excitation laser is used to achieve high selectivity.
  • In one preferred embodiment, a fluorescence donor is attached to a site on the polymerase comprising a replaced amino acid more amenable to donor attachment such as cysteine and four unique fluorescence acceptors are attached to each dNTP. For example, fluorescein is attached to a site on the polymerase and rhodamine, rhodamine derivatives and/or fluorescein derivatives are attached to each dNTP. Each donor-acceptor fluorophore pair is designed to have an absorption spectrum sufficiently distinct from the spectra of other pairs to allow separate identification after excitation. Preferably, the donor is selected such that the excitation light activates the donor, which then efficiency transfers the excitation energy to one of the acceptors. After energy transfer, the acceptor emits it unique fluorescence signature. The emission of the fluorescence donor must significant overlap with the absorption spectra of the fluorescence acceptors for efficient energy transfer. However, the methods of this invention can also be performed using two, three or four unique fluorescence donor-acceptor pairs, by running parallel reactions.
  • Fluorophore choice is a function of not only its enzyme compatibility, but also its spectral and photophysical properties. For instance, it is critical that the acceptor fluorophore does not have any significant absorption at the excitation wavelength of the donor fluorophore, and less critical (but also desirable) is that the donor fluorophore does not have emission at the detection wavelength of the acceptor fluorophore. These spectral properties can be attenuated by chemical modifications of the fluorophore ring systems.
  • Although the dNTPs are amenable to tagging at several sites including the base, the sugar and the phosphate groups, the dNTPs are preferably tagged at either the β and/or γ phosphate. Tagging the terminal phosphates of dNTP has a unique advantage. When the incoming, tagged dNTP is bound to the active site of the polymerase, significant FRET from the donor on the polymerase to the acceptor on the dNTP occurs. The unique fluorescence of the acceptor identifies which dNTP is incorporated. Once the tagged DNTP is incorporated into the growing DNA chain, the fluorescence acceptor, which is now attached to the pyrophosphate group, is released to the medium with the cleaved pyrophosphate group. In fact, the growing DNA chain includes no fluorescence acceptor molecules at all. In essence, FRET occurs only between the donor on the polymerase and incoming acceptor-labeled dNTP, one at a time. This approach is better than the alternative attachment of the acceptor to a site within the dNMP moiety of the dNTP or the use of multiply-modified dNTPs. If the acceptor is attached to a site other than the β or γ phosphate group, it becomes part of the growing DNA chain and the DNA chain will contain multiple fluorescence acceptors. Interference with the polymerization reaction and FRET measurements would likely occur.
  • If the fluorescence from the tagged dNTPs in the polymerizing medium (background) is problematic, collisional quenchers can be added to the polymerizing medium that do not covalently interact with the acceptors on the dNTPs and quench fluorescence from the tagged dNTPs in the medium. Of course, the quenchers are also adapted to have insignificant contact with the donor on the polymerase. To minimize interaction between the collisional quenchers and the donor on the polymerase, the polymerase tag is preferably localized internally and shielded from the collisional quenchers or the collisional quencher can be made sterically bulky or associate with a sterically bulky group to decrease interaction between the quencher and the polymerase.
  • Another preferred method for detecting polymerase-nucleotide interactions involves using nucleotide-specific quenching agents to quench the emission of a fluorescent tag on the polymerase. Thus, the polymerase is tagged with a fluorophore, while each dNTP is labeled with a quencher for the fluorophore. Typically, DABCYL (4-(4′-dimethylaminophe-nylazo) benzoic acid is a universal quencher, which absorbs energy from a fluorophore, such as 5-(2′-aminoethyl)aminonaphthalene-1-sulfonic acid (AEANS) and dissipates heat. Preferably, a quencher is selected for each dNTP so that when each quencher is brought into close proximity to the fluorophore, a distinguishable quenching efficiency is obtained. Therefore, the degree of quenching is used to identify each dNTP as it is being incorporated into the growing DNA chain. One advantage of this preferred detection method is that fluorescence emission comes from a single source rendering background noise negligible. Although less preferred, if only two or three suitable quenchers are identified, then two or three of the four dNTPs are labeled and a series of polymerization reaction are made each time with a different pair of the labeled dNTPs. Combining the results from these runs generates a complete sequence of the DNA molecule.
  • Site Selection for Labeling the Taq Polymerase and dNTPs
  • Although the present invention is directed to attaching any type of atomic and/or molecular tag that has a detectable property, the processes for site selection and tag attachment are illustrated using a preferred class of tags, namely fluorescent tags.
  • Fluorescent Labeling of Polymerase and/or dNTPs
  • The fluorescence probes or quenchers attached to the polymerase or dNTPs are designed to minimize adverse effects on the DNA polymerization reaction. The inventors have developed synthetic methods for chemically tagging the polymerase and dNTPs with fluorescence probes or quenchers.
  • In general, the polymerase is tagged by replacing a selected amino acid codon in the DNA sequence encoding the polymerase with a codon for an amino acid that more easily reacts with a molecular tag such as cysteine via mutagenesis. Once a mutated DNA sequence is prepared, the mutant is inserted into E. coli for expression. After expression, the mutant polymerase is isolated and purified. The purified mutant polymerase is then tested for polymerase activity. After activity verification, the mutant polymerase is reacted with a slight molar excess of a desired tag to achieve near stoichiometric labeling. Alternatively, the polymerase can be treated with an excess amount of the tag and labeling followed as a function of time. The tagging reaction is than stopped when near stoichiometric labeling is obtained.
  • If the mutant polymerase includes several sites including the target residue that can undergo tagging with the desired molecular tag, then the tagging reaction can also be carried out under special reaction conditions such as using a protecting group or competitive inhibitor and a reversible blocking group, which are later removed. If the target amino acid residue in the mutant polymerase is close to the active dNTP binding site, a saturating level of a protecting group or a competitive inhibitor is first added to protect the target residue and a reversible blocking group is subsequently added to inactivate non-target residues. The protecting group or competitive inhibitor is then removed from the target residue, and the mutant polymerase is treated with the desired tag to label the target residue. Finally, the blocking groups are chemically removed from non-target residues in the mutant polymerase and removed to obtain a tagged mutant polymerase with the tag substantially to completely isolated on the target residue.
  • Alternatively, if the target residue is not near the active site, the polymerase can be treated with a blocking group to inactivate non-target residues. After removal of unreacted blocking group, the mutant polymerase is treated with the desired tag for labeling the target residue. Finally, the blocking groups are chemically removed from the non-target residues in the mutant polymerase and removed to obtain the tagged mutant polymerase.
  • Amino Acid Site Selection for the Taq Polymerase
  • The inventors have identified amino acids in the Taq polymerase that are likely to withstand mutation and subsequent tag attachment such as the attachment of a fluorescent tag. While many sites are capable of cysteine replacement and tag attachment, preferred sites in the polymerase were identified using the following criteria: (1) they are not in contact with other proteins; (2) they do not alter the conformation or folding of the polymerase; and (3) they are not involved in the function of the protein. The selections were accomplished using a combination of mutational studies including sequence analyses data, computational studies including molecular docking data and assaying for polymerase activity and fidelity. After site mutation, computational studies will be used to refine the molecular models and help to identify other potential sites for mutation.
  • Regions of the protein surface that are not important for function were identified, indirectly, by investigating the variation in sequence as a function of evolutionary time and protein function using the evolutionary trace method. See, e.g., Lichtarge et al., 1996. In this approach, amino acid residues that are important for structure or function are found by comparing evolutionary mutations and structural homologies. The polymerases are ideal systems for this type of study, as there are many crystal and co-crystal structures and many available sequences. The inventors have excluded regions of structural/functional importance from sites selection for mutation/labeling. In addition, visual inspection and overlays of available crystal structures of the polymerase in different conformational states, provided further assistance in identifying amino acid sites near the binding site for dNTPs. Some of the chosen amino acids sites are somewhat internally located and preferably surround active regions in the polymerase that undergo changes during base incorporation, such as the dNTP binding regions, base incorporation regions, pyrophosphate release regions, etc. These internal sites are preferred because a tag on these sites show reduced background signals during detection, i.e., reduce interaction between the polymerase enzyme and non-specifically associated tagged dNTPs, when fluorescently tagged dNTPs are used.
  • Once tagged mutant polymerases are prepared and energy minimized in a full solvent environment, estimates of the effect on the structure of the polymerase due to the mutation and/or labeling are generated to provide information about relative tag positioning and separation. This data is then used to estimate FRET efficiencies prior to measurement. Of course, if the dNTPs are tagged with quenchers, then these considerations are not as important.
  • Another aspect of this invention involves the construction of molecular mechanics force field parameters for atomic and/or molecular tags such as fluorescent tags used to tag the dNTPs and the polymerase and parameters for the fluorescent tagged amino acid on the polymerase and/or dNTP. Force field parameters are using quantum mechanical studies to obtain partial charge distributions and energies for relevant intramolecular conformations (i.e., for the dihedral angle definitions) derived from known polymerase crystal structures.
  • Ionization states of each ionizable residue are estimated using an electrostatic model in which the protein is treated as a low dielectric region and the solvent as a high dielectric, using the UHBD program. See, e.g., Antosiewicz et al., 1994; Briggs and Antosiewicz, 1999; Madura et al., 1995. The electrostatic free energies of ionization of each ionizable residue are computed by solving the Poisson-Boltzmann equation for each residue. These individual ionization free energies are modified to take into account coupled titration behavior resulting in a set of self-consistent predicted ionization states. These predicted ionization free energies are then recalculated so that shifts in ionization caused by the binding of a DNTP are taken into account. Unexpected ionization states are subject to further computational and experimental studies, leading to a set of partial charges for each residue in the protein, i.e., each ionizable residue in the protein can have a different charge state depending on the type of attached tag or amino acid substitution.
  • To further aid in amino acid site selection, an electrostatic potential map is generated from properties of the molecular surface of the Taq polymerase/DNA complex, screened by solvent and, optionally, by dissolved ions (i.e., ionic strength) using mainly the UHBD program. The map provides guidance about binding locations for the dNTPs and the electrostatic environment at proposed mutation/labeling sites.
  • The molecular models generated are designed to be continually refined taking into account new experimental data, allowing the construction of improved molecular models, improved molecular dynamics calculations and improved force field parameters so that the models better predict system behavior for refining tag chemistry and/or tag positioning, predicting new polymerase mutants, base incorporation rates and polymerase fidelity.
  • Molecular docking simulations are used to predict the docked orientation of the natural and fluorescently labeled dNTPs, within the polymerase binding pocket. The best-docked configurations are energy minimized in the presence of an explicit solvent environment. In conjunction with amino acid sites in the polymerase selected for labeling, the docking studies are used to analyze how the tags interact and to predict FRET efficiency for each selected amino acid site.
  • With the exception of the electrostatics calculations, all docking, quantum mechanics, molecular mechanics, and molecular dynamics calculations are and will be performed using the HyperChem (v6.0) computer program. The HyperChem software runs on PCs under a Windows operating system. A number of computer programs for data analysis or for FRET prediction (as described below) are and will be written on a PC using the Linux operating system and the UHBD program running under Linux.
  • Analysis of Polymerase Structures
  • Co-crystal structures solved for DNA polymerase I (DNA pol I) from E. coli, T. aquaticus, B. stearothermophilus, T7 bacteriophage, and human pol α demonstrate that (replicative) polymerases share mechanistic and structural features. The structures that capture Taq DNA polymerase in an ‘open’ (non-productive) conformation and in a ‘closed’ (productive) conformation are of particular importance for identifying regions of the polymerase that undergo changes during base incorporation. The addition of the nucleotide to the polymerase/primer/template complex is responsible for the transition from its open to its closed conformation. Comparison of these structures provides information about the conformational changes that occur within the polymerase during nucleotide incorporation. Specifically, in the closed conformation, the tip of the fingers domain is rotated inward by 46.degree., thereby positioning the dNTP at the 3′ end of the primer strand in the polymerase active site. The geometry of this terminal base pair is precisely matched with that of its binding pocket. The binding of the correct, complementary base facilitates formation of the closed conformation, whereas incorrect dNTP binding does not induce this conformational change. Reaction chemistry occurs when the enzyme is in the closed conformation.
  • Referring now to FIG. 2, the open and closed ternary complex forms of the large fragment of Taq DNA pol I (Klentaq 1) are shown in a superimposition of their Cα tracings. The ternary complex contains the enzyme, the ddCTP and the primer/template duplex DNA. The open structure is shown in magenta and the closed structure is shown in yellow. The disorganized appearance in the upper left portion of the protein shows movement of the ‘fingers’ domain in open and closed conformations.
  • Using a program to determine the change in position of amino acids in the open and closed conformation of the polymerase relative to the gamma phosphate of a bound ddGTP from two different crystal structures of the Taq polymerase containing the primer and bound ddGTP, lists of in 20 amino acid sites that undergo the largest change in position for mutation and labeling were identified. The distances were calculated for each amino acid between their alpha and beta carbon atoms and the gamma phosphate group of the bound ddGTP. Lists derived from the two different sets of crystallographic data for the Taq polymerase are given in Tables I, II, III and IV below.
  • TABLE I
    The 20 Amino Acid Sites Undergoing the Largest Positional
    Change in 2ktq Data Between the Open Form of the Polymerase
    to the Closed Form of the Polymerase Relative to the Alpha Carbon
    of the Residue
    Residue Changes in
    Residue Location Identity Distance (Å)
    517 Alanine 9.10
    516 Alanine 6.86
    515 Serine 6.53
    513 Serine 6.40
    518 Valine 5.12
    514 Threonine 3.94
    488 Asparagine 3.73
    487 Arginine 3.50
    489 Glutamine 3.13
    495 Phenylalanine 3.05
    491 Glutamic acid 2.90
    486 Serine 2.78
    490 Leucine 2.62
    586 Valine 2.61
    492 Arginine 2.60
    462 Glutamic acid 2.59
    483 Asparagine 2.47
    685 Proline 2.46
    587 Arginine 2.44
    521 Alanine 2.38
  • TABLE II
    The 20 Amino Acid Sites Undergoing the Largest Positional Change
    in 2ktq Data Between the Open Form of the Polymerase to the Closed
    Form of the Polymerase Relative to the Beta Carbon of the Residue
    Residue Changes in
    Residue Location Identity Distance (Å)
    517 Alanine 10.98
    516 Alanine 9.05
    515 Serine 8.02
    513 Serine 7.46
    518 Valine 5.47
    685 Proline 5.16
    487 Arginine 4.24
    495 Phenylalanine 3.94
    488 Aspartic acid 3.88
    520 Glutamic acid 3.66
    491 Glutamic acid 3.41
    587 Arginine 3.39
    521 Alanine 3.33
    498 Leucine 3.21
    489 Glutamine 3.08
    514 Threonine 2.97
    581 Leucine 2.93
    483 Asparagine 2.92
    497 Glutamic acid 2.91
    462 Glutamic acid 2.83
  • TABLE III
    The 20 Amino Acid Sites Undergoing the Largest Positional Change
    in 3ktq Data Between the Open Form of the Polymerase to the Closed
    Form of the Polymerase Relative to the Alpha Carbon of the Residue
    Residue Changes in
    Residue Location Identity Distance (Å)
    517 Alanine 8.95
    656 Proline 8.75
    657 Leucine 8.59
    655 Aspartic acid 8.05
    660 Arginine 7.35
    658 Methionine 7.06
    659 Arginine 6.69
    654 Valine 6.60
    513 Serine 6.59
    516 Alanine 6.57
    515 Serine 6.36
    653 Alanine 6.16
    661 Alanine 5.94
    652 Glutamic acid 5.44
    647 Phenylalanine 5.25
    649 Valine 5.22
    518 Valine 5.15
    644 Serine 5.08
    643 Alanine 5.01
    650 Proline 4.72
  • TABLE IV
    The 20 Amino Acid Sites Undergoing the Largest Positional Change
    in 3ktq Data Between the Open Form of the Polymerase to the Closed
    Form of the Polymerase Relative to the Beta Carbon of the Residue
    Residue Changes in
    Residue Location Identity Distance (Å)
    517 Alanine 10.85
    656 Proline 9.05
    657 Leucine 8.75
    516 Alanine 8.68
    655 Aspartic acid 8.24
    515 Serine 7.92
    660 Arginine 7.89
    513 Serine 7.60
    659 Arginine 6.98
    658 Methionine 6.77
    654 Valine 6.25
    653 Alanine 6.14
    661 Alanine 6.04
    643 Alanine 5.74
    649 Valine 5.55
    647 Phenylalanine 5.45
    518 Valine 5.42
    652 Glutamic acid 5.13
    644 Serine 4.89
    487 Arginine 4.77
  • The above listed amino acids represent preferred amino acid sites for cysteine replacement and subsequent tag attachment, because these sites represent the sites in the Taq polymerase the undergo significant changes in position during base incorporation.
  • To further refine the amino acid site selection, visualization of the polymerase in its open and closed conformational extremes for these identified amino acid sites is used so that the final selected amino acid sites maximize signal and minimize background noise, when modified to carry fluorescent tags for analysis using the FRET methodology. Amino acid changes that are not predicted to significantly affect the protein's secondary structure or activity make up a refined set of amino acid sites in the Taq polymerase for mutagenesis and fluorescent modification so that the tag is shielded from interaction with free dNTPs. The following three panels illustrate the protocol used in this invention to refine amino acid site selection from the about list of amino acids that undergo the largest change in position relative to a bound ddGTP as the polymerase transitions from the open to the closed form.
  • Referring now to FIGS. 3A-C, an overlay between 3 ktq (closed ‘black’) and 1 tau (open ‘light blue’), the large fragment of Taq DNA polymerase I is shown. Looking at FIG. 3A, the bound DNA from 3 ktq is shown in red while the ddCTP bound to 3 ktq is in green. Three residues were visually identified as moving the most when the polymerase goes from open (1 tau) to closed (3 ktq), namely, Asp655, Pro656, and Leu657. Based on further analyses of the structures, Pro656 appears to have the role of capping the O-helix. Leu657's side chain is very close to another part of the protein in the closed (3 ktq) form. Addition of a larger side chain/tag is thought to diminish the ability of the polymerase to achieve a fully closed, active conformation. Conversely, Asp655 is entirely solvent exposed in both the closed and open conformations of the polymerase. Looking at FIG. 3B, a close-up view of the active site from the overlay of the 3 ktq (closed) and 1 tau (open) conformations of Taq polymerase is shown. The large displacements between the open and closed conformations are evident. Looking at FIG. 3C, a close-up view of a molecular surface representation of 3 ktq (in the absence of DNA and ddCTP). The molecular surface is colored in two areas, blue for Asp655 and green for Leu657. In this representation, it is evident that Leu657 is in close proximity to another part of the protein, because the green part of the molecular surface, in the thumb domain, is “connected” to a part of the fingers domain. This view shows this region of the polymerase looking into the palm of the hand with fingers to the right and thumb to the left.
  • Mutagenesis and Sequencing of Polymerase Variants
  • The gene encoding Taq DNA polymerase was obtained and will be expressed in pTTQ 18 in E. coli strain DH1. See, e.g., Engelke et al., 1990. The inventors have identified candidate amino acids for mutagenesis including the amino acids in Tables I-IV, the refined lists or mixtures or combinations thereof. The inventors using standard molecular methods well-known in the art introduced a cysteine codon, individually, at each of target amino acid sites. See, e.g., Sambrook et al., 1989 and Allen et al., 1998. DNA is purified from isolated colonies expressing the mutant polymerase, sequenced using dye-terminator fluorescent chemistry, detected on an ABI PRISM 377 Automated Sequencer, and analyzed using Sequencher™ available from GeneCodes, Inc.
  • Expression and Purification of Enzyme Variants
  • The inventors have demonstrated that the Taq polymerase is capable of incorporating γ-tagged dNTPs to synthesize extended DNA sequences. The next step involves the construction of mutants capable of carrying a tag designed to interact with the tags on the dNTPS and optimization of the polymerase for single-molecule sequencing. The mutants are constructed using standard site specific mutagenesis as described above and in the experimental section. The constructs are then inserted into and expressed in E. coli. Mutant Taq polymerase is then obtained after sufficient E. coli is grown for subsequence polymerase isolation and purification.
  • Although E. coli can be grown to optical densities exceeding 100 by computer-controlled feedback-based supply of non-fermentation substrates, the resulting three kg of E. coli cell paste will be excessive during polymerase optimization. Of course, when optimized polymerases construct are prepared, then this large scale production will be used. During the development of optimized polymerases, the mutants are derived from E. coli cell masses grown in 10 L well-oxygenated batch cultures using a rich medium available from Amgen. For fast polymerase mutant screening, the mutants are prepared by growing E. coli in 2 L baffled shake glasses. Cell paste are then harvested using a 6 L preparative centrifuge, lysed by French press, and cleared of cell debris by centrifugation. To reduce interference from E. coli nucleic acid sequences, it is preferably to also remove other nucleic acids. Removal is achieved using either nucleases (with subsequent heat denaturation of the nuclease) or, preferably using a variation of the compaction agent-based nucleic acid precipitation protocol as described in Murphy et al., Nature Biotechnology 17, 822, 1999.
  • Because the thermal stability of Taq polymerase is considerably greater than typical E. coli proteins, purification of Taq polymerase or its mutants from contaminating Taq polymerase proteins is achieved by a simple heat treatment of the crude polymerase at 75° C. for 60 minutes, which reduces E. coli protein contamination by approximately 100-fold. This reduction in E. coli protein contamination combined with the high initial expression level, produces nearly pure Taq polymerase or its mutants in a convenient initial step; provide, of course, that the mutant polymerase retains the thermal stability of the native polymerase.
  • For routine sequencing and PCR screening, further limited purification is generally required. A single anion-exchange step, typically on Q Sepharose at pH 8.0, is generally sufficient to produce a product pure enough to these tests. Preferably, a second purification step will also be performed to insure that contamination does not cloud the results of subsequent testing. The second purification step involves SDS-PAGE and CD-monitored melting experiments.
  • Selection of site in dNTP to accept fluorescent tag
  • Molecular docking simulations were carried out to predict the docked orientation of the natural and fluorescently labeled dNTPs using the AutoDock computer program (Morris et al., 1998; Soares et al., 1999). Conformational flexibility is permitted during the docking simulations making use of an efficient Lamarckian Genetic algorithm implemented in the AutoDock program. A subset of protein side chains is also allowed to move to accommodate the DNTP as it docks. The best docked configurations is then energy minimized in the presence of a solvent environment. Experimental data are available which identify amino acids in the polymerase active site that are involved in catalysis and in contact with the template/primer DNA strands or the dNTP to be incorporated. The computer-aided chemical modeling such as docking studies can be used identify and support sites in the dNTP that can be labeled and to predict the FRET efficiency of dNTPs carrying a specific label at a specific site.
  • In general, the dNTPs are tagged either by reacting a dNTP with a desired tag or by reacting a precursor such as the pyrophosphate group or the base with a desired tag and then completing the synthesis of the dNTP.
  • Chemical Modification of Nucleotides for DNA Polymerase Reactions
  • The inventors have developed syntheses for modifying fluorophore and fluorescence energy transfer compounds to have distinct optical properties for differential signal detection, for nucleotide/nucleoside synthons for incorporation of modifications on base, sugar or phosphate backbone positions, and for producing complementary sets of four deoxynucleotide triphosphates (dNTPs) containing substituents on nucleobases, sugar or phosphate backbone.
  • Synthesis of γ-Phosphate Modified dNTPs
  • The inventors have found that the native Taq polymerase is capable of polymerizing phosphate-modified dNTPs or ddNTPs. Again, tagging the dNPTs or ddNTPs at the beta and/or gamma phosphate groups is a preferred because the replicated DNA contains no unnatural bases, polymerase activity is not significantly adversely affected and long DNA strands are produced. The inventors have synthesized γ-ANS-phosphate dNTPs, where the ANS is attached to the phosphate through a phosphamide bond. Although these tagged dNTPs are readily incorporated by the native Taq polymerase and by HIV reverse transcriptase, ANS is only one of a wide range of tags that can be attached through either the β and/or γ phosphate groups.
  • The present invention uses tagged dNTPs or ddNTPs in combination with polymerase for signal detection. The dNTPs are modified at phosphate positions (alpha, beta and/or gamma) and/or other positions of nucleotides through a covalent bond or affinity association. The tags are designed to be removed from the base before the next monomer is added to the sequence. One method for removing the tag is to place the tag on the gamma and/or beta phosphates. The tag is removed as pyrophosphate dissociates from the growing DNA sequence. Another method is to attach the tag to a position of on the monomer through a cleavable bond. The tag is then removed after incorporation and before the next monomer incorporation cleaving the cleavable bond using light, a chemical bond cleaving reagent in the polymerization medium, and/or heat.
  • One generalized synthetic routine to synthesizing other γ-tagged dNTPs is given below:
  • Figure US20110059436A1-20110310-C00001
  • where FR is a fluorescent tag, L is a linker group, X is either H or a counterion depending on the pH of the reaction medium, Z is a group capable of reaction with the hydroxyl group of the pyrophosphate and Z′ is group after reaction with the dNMP. Preferably, Z is Cl, Br, I, OH, SH, NH2, NHR, CO2H, CO2R, SiOH, SiOR, GeOH, GeOR, or similar reactive functional groups, where R is an alkyl, aryl, aralkyl, alkaryl, halogenated analogs thereof or hetero atom analogs thereof and Z′ is O, NH, NR, CO2, SiO, GeO, where R is an alkyl, aryl, aralkyl, alkaryl, halogenated analogs thereof or hetero atom analogs thereof.
  • The synthesis involves reacting Z terminated fluorescent tag, FR-L-Z with a pyrophosphate group, P2O6X3H, in DCC and dichloromethane to produce a fluorescent tagged pyrophosphate. After the fluorescent tagged pyrophosphate is prepared, it is reacted with a morpholine terminated dNMP in acidic THF to produce a dNTP having a fluorescent tag on its γ-phosphate. Because the final reaction bears a fluorescent tag and is larger than starting materials, separation from unmodified starting material and tagged pyrophosphate is straight forward.
  • A generalized synthesis of a FR-L group is shown below:
  • Figure US20110059436A1-20110310-C00002
  • Fluorescein (FR) is first reacted with isobutyryl anhydride in pyridine in the presence of diisopropyl amine to produce a fluorescein having both ring hydroxy groups protected for subsequent linker attachment. The hydroxy protected fluorescein is then reacted with N-hydroxylsuccinimide in DCC and dichloromethane to produce followed by the addition of 1-hydroxy-6-amino hexane to produce an hydroxy terminated FR-L group. This group can then be reacted either with pyrophosphate to tag the dNTPs at their γ-phosphate group or to tag amino acids. See, e.g., Ward et al., 1987; Engelhardt et al., 1993; Little et al., 2000; Hobbs, 1991.
  • By using different fluorescent tags on each dNTP, tags can be designed so that each tag emits a distinguishable emission spectrum. The emission spectra can be distinguished by producing tags with non-overlapping emission frequencies—multicolor—or each tag can have a non-overlapping spectral feature such a unique emission band, a unique absorption band and/or a unique intensity feature. Systems that use a distinguishable tag on each dNTP improves confidence values associated with the base calling algorithm.
  • The synthetic scheme shown above for fluorescein is adaptable to other dyes as well such as tetrachlorofluorescein (JOE) or N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA). Typically, the gamma phosphate tagged reactions are carried out in basic aqueous solutions and a carbodiimide, such as DEC. Other fluorophore molecules and dNTPs can be similarly modified.
  • Synthesis of dNTP Tagged at on the Base
  • Although tagging the dNTPs at the beta and/or gamma phosphate is preferred, the dNTPs can also be tagged on the base and/or sugar moieties while maintaining their polymerase reaction activity. The sites for modifications are preferably selected to not interfere with Watson-Crick base pairing. A generalized scheme for base modification is shown below:
  • Figure US20110059436A1-20110310-C00003
  • Polymerase Activity Assays Using a Fluorescently-Tagged Enzyme
  • The activities of polymerase variants are monitored throughout polymerase development. Polymerase activity is assayed after a candidate amino acid is mutated to cysteine and after fluorescent tagging of the cysteine. The assay used to monitor the ability of the native Taq polymerase to incorporate fluorescently-tagged dNTPs is also used to screen polymerase variants. Since the mutant Taq polymerases have altered amino acid sequences, the assays provide mutant characterization data such as thermostability, fidelity, polymerization rate, affinity for modified versus natural bases.
  • Mutant Taq polymerase activity assays are carried out under conditions similar to those used to examine the incorporation of fluorescently-tagged dNTPs into DNA polymers by the native Taq polymerase. To examine mutant Taq polymerase activity, the purified mutant Taq polymerase is incubated in polymerase reaction buffer with a 5′-32P end-labeled primer/single-stranded template duplex, and appropriate tagged dNTP(s). The polymerase's ability to incorporate a fluorescently-tagged dNTP is monitored by assaying the relative amount of fluorescence associated with the extended primer on either an AB1377 DNA Sequencer (for fluorescently tagged bases), a Fuji BAS 1000 phosphorimaging system, or other appropriate or similar detectors or detection systems. This assay is used to confirm that the mutant polymerase incorporates tagged dNTβ and to confirm that fluorescent signatures are obtained during base incorporation. These assays use an end-labeled primer, the fluorescently-tagged dNTβ and the appropriate base beyond the fluorescent tag. The products are then size separated and analyzed for extension. Reactions are either performed under constant temperature reaction conditions or thermocycled, as necessary.
  • Primer Extension Assays
  • The ability of Taq DNA polymerase to incorporate a γ-phosphate dNTP variant is assayed using conditions similar to those developed to examine single base incorporation by a fluorescently-tagged DNA polymerase. See, e.g., Furey et al., 1998. These experiments demonstrate that polymerases bearing a fluorescent tag do not a priori have reduced polymerization activity. The inventors have demonstrated that the native Taq polymerase incorporates γ-tagged dNTP, singly or collectively to produce long DNA chains.
  • To examine polymerase activity, the polymerase is incubated in polymerase reaction buffer such as Taq DNA polymerase buffer available from Promega Corporation of Madison, Wis. with either a 5′-32P or a fluorescently end-labeled primer (TOP)/single-stranded template (BOT-‘X’) duplex, and appropriate dNTP(s) as shown in Table V. Reactions are carried out either at constant temperature or thermocycled, as desired or as is necessary. Reaction products are then size-separated and quantified using a phosphorimaging or fluorescent detection system. The relative efficiency of incorporation for each tagged dNTP is determined by comparison with its natural counterpart.
  • TABLE V
    Primer Strand:
    TOP 5′ GGT ACT AAG CGG CCG CAT G 3′ (SEQ ID NO: 1)
    Template Strands:
    BOT-T 3′ CCA TGA TTC GCC GGC GTA CTC 5′ (SEQ ID NO: 2)
    BOT-C 3′ CCA TGA TTC GCC GGC GTA CCC 5′ (SEQ ID NO: 3)
    BOT-G 3′ CCA TGA TTC GCC GGC GTA CGC 5′ (SEQ ID NO: 4)
    BOT-A 3′ CCA TGA TTC GCC GGC GTA CAC 5′ (SEQ ID NO: 5)
    BOT-3T 3′ CCA TGA TTC GCC GGC GTA CTT TC 5′ (SEQ ID NO: 6)
    BOT-Sau 3′ CCA TGA TTC GCC GGC GTA CCT AG 5′ (SEQ ID NO: 7)
  • In Table V, ‘TOP’ represents the primer strand of an assay DNA duplex. Variants of the template strand are represented by ‘BOT’. The relevant feature of the DNA template is indicated after the hyphen. For example, BOT-T, BOT-C, BOT-G, BOT-A are used to monitor polymerase incorporation efficiency and fidelity for either nucleotides or nucleotide variants of dA, dG, dC, and dT, respectively.
  • Preliminary assays are performed prior to exhaustive purification of the tagged dNTP to ensure that the polymerase is not inhibited by a chemical that co-purifies with the tagged dNTP, using the ‘BOT-Sau’ template. The ‘BOT-Sau’ template was designed to monitor incorporation of natural dGTP prior to tagged dATP (i.e., a positive control for polymerase activity). More extensive purification is then performed for promising tagged nucleotides. Similarly, experiments are carried out to determine whether the polymerase continues extension following incorporation of the tagged dNTPs, individually or collectively, using the same end-labeled ‘TOP’ primer, the appropriate ‘BOT’ primer, the fluorescently-tagged dNTP, and the appropriate base 3′ of the tagged nucleotide. The products are then size-separated and analyzed to determine the relative extension efficiency.
  • Assay Fidelity of γ-phosphate Tagged Nucleotide Incorporation
  • The Taq DNA polymerase lacks 3′ to 5′ exonuclease activity (proofreading activity). If the polymerase used in single-molecule DNA sequencing possessed a 3′ to 5′ exonuclease activity, the polymerase would be capable of adding another base to replace one that would be removed by the proofreading activity. This newly added base would produce a signature fluorescent signal evidencing the incorporation of an additional base in the template, resulting in a misidentified DNA sequence, a situation that would render the single-molecule sequencing systems of this invention problematic.
  • If the error rate for the incorporation of modified dNTPs exceeds a threshold level of about 1 error in 100, the sequencing reactions are preferably run in parallel, with the optimal number required to produce sequence information with a high degree of confidence for each base call determined by the error rate. Larger error rates require more parallel run, while smaller error rates require fewer parallel runs. In fact, if the error rate is low enough, generally less than 1 error in 1,000, preferably 1 error in 5,000 and particularly 1 error in 10,000 incorporated base, then no parallel runs are required. Insertions or deletions are, potentially, more serious types of errors and warrant a minimal redundancy of 3 repeats per sample. If 2 reactions were run, one could not be certain which was correct. Thus, 3 reactions are needed for the high quality data produced by this system.
  • The BOT-variant templates are used to characterize the accuracy at which each γ-tagged dNTP is incorporated by an engineered polymerase as set forth in Table V. Oligonucleotides serve as DNA templates, and each differing only in the identity of the first base incorporated. Experiments using these templates are used to examine the relative incorporation efficiency of each base and the ability of the polymerase to discriminate between the tagged dNTPs. Initially, experiments with polymerase variants are carried out using relatively simple-sequence, single-stranded DNA templates. A wide array of sequence-characterized templates is available from the University of Houston in Dr. Hardin's laboratory, including a resource of over 300 purified templates. For example, one series of templates contains variable length polyA or polyT sequences. Additional defined-sequence templates are constructed as necessary, facilitating the development of the base-calling algorithms.
  • Relative Fluorescence Intensity Assays
  • Direct detection of polymerase action on the tagged dNTP is obtained by solution fluorescence measurements, using SPEX 212 instrument or similar instrument. This instrument was used to successfully detect fluorescent signals from ANS tagged γ-phosphate dNTPs, being incorporated by Taq polymerase at nanomolar concentration levels. The SPEX 212 instrument includes a 450 watt xenon arc source, dual emission and dual excitation monochromators, cooled PMT (recently upgraded to simultaneous T-format anisotropy data collection), and a Hi-Tech stopped-flow accessory. This instrument is capable of detecting an increase in fluorescence intensity and/or change in absorption spectra upon liberation of the tagged pyrophosphate from ANS tagged γ-phosphate dNTPs, as was verified for ANS-pyrophosphate released by Taq and RNA polymerase and venom phosphodiesterase.
  • Experiments have been and are being performed by incubating γ-phosphate tagged dATP or TTP (Control: non-modified dATP and TTP) in an appropriate buffer (e.g., buffers available from Promega Corporation) in the presence of polymerase (Control: no enzyme) and DNA primer/template [poly(dA). poly(dT)] (Control: no primer/template DNA). When the polymerase incorporates a tagged dNTP, changes in fluorescence intensity and/or frequency, absorption and/or emission spectra, and DNA polymer concentration are detected. Changes in these measurables as a function of time and/or temperature for experimental versus control cuvettes allows for unambiguous determination of whether a polymerase is incorporating the γ-phosphate tagged dNTP. Excitation and fluorescence emission can be optimized for each tagged dNTP based on changes in these measurables.
  • Development of a Single-Molecule Detection System
  • The detection of fluorescence from single molecules is preferably carried out using microscopy. Confocal-scanning microscopy can be used in this application, but a non-scanning approach is preferred. A microscope useful for detecting fluorescent signals due to polymerase activity include any type of microscope, with oil-immersion type microscopes being preferred. The microscopes are preferably located in an environment in which vibration and temperature variations are controlled, and fitted with a highly-sensitive digital camera. While many different cameras can be to record the fluorescent signals, the preferred cameras are intensified CCD type cameras such as the iPentaMax from Princeton Instruments.
  • The method of detection involves illuminating the samples at wavelengths sufficient to induce fluorescence of the tags, preferably in an internal-reflection format. If the fluorescent tags are a donor-acceptor pair, then the excitation frequency must be sufficient to excite the donor. Although any type of light source can be used, the preferred light source is a laser. It will often be advantageous to image the same sample in multiple fluorescence emission wavelengths, either in rapid succession or simultaneously. For simultaneous multi-color imaging, an image splitter is preferred to allow the same CCD to collect all of the color images simultaneously. Alternatively, multiple cameras can be used, each viewing the sample through emission optical filters of different wavelength specificity.
  • Tag detection in practice, of course, depends upon many variables including the specific tag used as well electrical, fluorescent, chemical, physical, electrochemical, mass isotope, or other properties. Single-molecule fluorescence imaging is obtainable employing a research-grade Nikon Diaphot TMD inverted epifluorescence microscope, upgraded with laser illumination and a more-sensitive camera. Moreover, single-molecule technology is a well-developed and commercially available technology. See, e.g., Peck et al., 1989; Ambrose et al., 1994; Goodwin et al., 1997; Brouwer et al., 1999; Castro and Williams, 1997; Davis et al., 1991; Davis et al., 1992; Goodwin et al., 1997; Keller et al., 1996; Michaelis et al., 2000; Orrit and Bernard, 1990; Orrit et al., 1994; Sauer et al., 1999; Unger et al., 1999; Zhuang et al., 2000.
  • The epifluorescence microscope can be retrofitted for evanescent-wave excitation using an argon ion laser at 488 nm. The inventors have previously used this illumination geometry in assays for nucleic acid hybridization studies. The existing setup has also been upgraded by replacement of the current CCD camera with a 12-bit 512.times.512 pixel Princeton Instruments I-PentaMAX generation IV intensified CCD camera, which has been used successfully in a variety of similar single-molecule applications. This camera achieves a quantum efficiency of over 45% in the entire range of emission wavelengths of the dyes to be used, and considerably beyond this range. The vertical alignment of their existing microscope tends to minimize vibration problems, and the instrument is currently mounted on an anti-vibration table.
  • A preferred high-sensitivity imaging system is based on an Olympus IX70-58F inverted epifluorescence microscope. The system incorporates low-background components and enables capture of single molecule fluorescence images at rates of greater than 80 frames per second with quantum efficiency between 60-70% in the range of emission wavelengths of the fluorescently active tags.
  • In imaging the fluorescence of multiple single molecules, it is preferable to minimize the occurrence of multiple fluorescent emitters within a data collection channel such as a single pixel or pixel-bin of the viewing field of the CCD or other digital imaging system. A finite number of data collection channels such as pixels are available in any given digital imaging apparatus. Randomly-spaced, densely-positioned fluorescent emitters generally produce an increased fraction of pixels or pixel bins that are multiply-occupied and problematic in data analysis. As the density of emitters in the viewing field increases so does the number of problematic data channels. While multiple occupancy of distinguishable data collection regions within the viewing field can be reduced by reducing the concentration of emitters in the viewing field, this decrease in concentration of emitters increases the fraction of data collection channels or pixels that see no emitter at all, therefore, leading to inefficient data collection.
  • A preferred method for increasing and/or maximizing the data collection efficiency involves controlling the spacing between emitters (tagged polymerase molecules). This spacing is achieved in a number of ways. First, the polymerases can be immobilized on a substrate so that only a single polymerase is localized within each data collection channel or pixel region within the viewing field of the imaging system. The immobilization is accomplished by anchoring a capture agent or linking group chemically attached to the substrate. Capture or linking agents can be spaced to useful distances by choosing inherently large capture agents, by conjugating them with or bonding them to molecules which enhance their steric bulk or electrostatic repulsion bulk, or by immobilizing under conditions chosen to maximize repulsion between polymerizing molecular assembly (e.g., low ionic strength to maximize electrostatic repulsion).
  • Alternatively, the polymerase can be associated with associated proteins that increase the steric bulk of the polymerase or the electrostatic repulsion bulk of the polymerizing system so that each polymerizing molecular assembly cannot approach any closer than a distance greater than the data channel resolution size of the imaging system.
  • Polymerase Activity Assays Using a Single-Molecule Detection System
  • These assays are performed essentially as described in for polymerase activity assays described herein. As stated above, the primary difference between assaying polymerase activity for screening purposes involves the immobilization of some part of the polymerizing assembly such as the polymerase, target DNA or a primer associated protein to a solid support to enable viewing of individual replication events. A variety of immobilization options are available, including, without limitation, covalent and/or non-covalent attachment of one of the molecular assemblies on a surface such as an organic surface, an inorganic surface, in or on a nanotubes or other similar nano-structures and/or in or on porous matrices. These immobilization techniques are designed to provide specific areas for detection of the detectable property such as fluorescent, NMR, or the like, where the spacing is sufficient to decrease or minimize data collection channels having multiple emitters. Thus, a preferred data collection method for single-molecule sequencing is to ensure that the fluorescently tagged polymerases are spaced apart within the viewing field of the imagining apparatus so that each data collection channel sees the activity of only a single polymerase.
  • Analysis of Fluorescent Signals from Single-Molecule Sequencing System
  • The raw data generated by the detector represents between one to four time-dependent fluorescence data streams comprising wavelengths and intensities: one data stream for each fluorescently labeled base being monitored. Assignment of base identities and reliabilities are calculated using the PHRED computer program. If needed, the inventors will write computer programs to interpret the data streams having partial and overlapping data. In such cases, multiple experiments are run so that confidence limits are assigned to each base identity according to the variation in the reliability indices and the difficulties associated with assembling stretches of sequence from fragments. The reliability indices represent the goodness of the fit between the observed wavelengths and intensities of fluorescence compared with ideal values. The result of the signal analyses is a linear DNA sequence with associated probabilities of certainty. Additionally, when required, the data is stored in a database for dynamic querying for identification and comparison purposes. A search term (sequence) of 6-10, 11-16, 17-20, 21-30 bases can be compared against reference sequences to quickly identify perfectly matched sequences or those sharing a user defined level of identity. Multiple experiments are run so that confidence limits can be assigned to each base identity according to the variation in the reliability indices and the difficulties associated with assembling stretches of sequence from fragments. The reliability indices represent the goodness of the fit between the observed wavelengths and intensities of fluorescence compared with the ideal values. The result of the signal analyses is a linear DNA sequence with associated probabilities of certainty.
  • Informatics: Analysis of Fluorescent Signals from the Single-Molecule Detection System
  • Data collection allows data to be assembled from partial information to obtain sequence information from multiple polymerase molecules in order to determine the overall sequence of the template or target molecule. An important driving force for convolving together results obtained with multiple single-molecules is the impossibility of obtaining data from a single molecule over an indefinite period of time. At a typical dye photobleaching efficiency of 2*10−5, a typical dye molecule is expected to undergo 50,000 excitation/emission cycles before permanent photobleaching. Data collection from a given molecule may also be interrupted by intersystem crossing to an optically inactive (on the time scales of interest) triplet state. Even with precautions against photobleaching, therefore, data obtained from any given molecule is necessarily fragmentary for template sequences of substantial length, and these subsequences are co-processed in order to derive the overall sequence of a target DNA molecule.
  • Additionally, in certain instances it is useful to perform reactions with reference controls, similar to microarray assays. Comparison of signal(s) between the reference sequence and the test sample are used to identify differences and similarities in sequences or sequence composition. Such reactions can be used for fast screening of DNA polymers to determine degrees of homololgy between the polymers, to determine polymorphisms in DNA polymers, or to identity pathogens.
  • EXAMPLES Cloning and Mutagenesis of Taq Polymerase
  • Cloning
  • Bacteriophage lambda host strain Charon 35 harboring the full-length of the Thermus aquaticus gene encoding DNA polymerase I (Taq pol I) was obtained from the AMERICAN TYPE CULTURE COLLECTION (ATCC; Manassas, Va.). Taq pol I was amplified directly from the lysate of the infected E. coli host using the following DNA oligonucleotide primers:
  • Taq Pol I forward
    (SEQ ID NO: 8)
    5′-gcgaattcat gagggggatg ctgcccctct ttgagccc-3′
    Taq Pol I reverse
    (SEQ ID NO: 9)
    5′-gcgaattcac cctccttggc ggagcgccag tcctccc-3′
  • The underlined segment of each synthetic DNA oligonucleotide represents engineered EcoRI restriction sites immediately preceding and following the Taq pol I gene. PCR amplification using the reverse primer described above and the following forward primer created an additional construct with an N-terminal deletion of the gene:
  • Tag Pal I_A293_trunk
    (SEQ ID NO: 10)
    5′-aatccatggg ccctggagga ggccccctggcccccgc-3′
  • The underlined segment corresponds to an engineered NcoI restriction site with the first codon encoding for an alanine (ATG start representing an expression vector following the ribosome binding site). Ideally, the full-length and truncated constructs of the Taq pol I gene is ligated to a single EcoRI site (full-length) and in an Ncoo/EcoRI digested pRSET-b expression vector. E. coli strain JM109 is used as host for all in vivo manipulation of the engineered vectors.
  • Mutagenesis
  • Once a suitable construct is generated, individual cysteine mutations are introduced at preferred amino acid positions including positions 513-518, 643, 647, 649 and 653-661 of the native Taq polymerase having the following amino acid sequence (SEQ ID NO:11):
  • (SEQ ID NO. 11)
    Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu
    1               5                   10                  15
    Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly
                20                  25                  30
    Leu Thr Thr Ser Arg Gly Glu Pro Val Gln Ala Val Tyr Gly Phe Ala
            35                  40                  45
    Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val Ile Val
        50                  55                  60
    Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly
    65                  70                  75                  80
    Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gln Leu
                    85                  90                  95
    Ala Leu Ile Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu
                100                 105                 110
    Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys
            115                 120                 125
    Ala Glu Lys Glu Gly Tyr Glu Val Arg Ile Leu Thr Ala Asp Lys Asp
        130                 135                 140
    Leu Tyr Gln Leu Leu Ser Asp Arg Ile His Val Leu His Pro Glu Gly
    145                 150                 155                 160
    Tyr Leu Ile Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro
                    165                 170                 175
    Asp Gln Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn
                180                 185                 190
    Leu Pro Gly Val Lys Gly Ile Gly Glu Lys Thr Ala Arg Lys Leu Leu
            195                 200                 205
    Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu
        210                 215                 220
    Lys Pro Ala Ile Arg Glu Lys Ile Leu Ala His Met Asp Asp Leu Lys
    225                 230                 235                 240
    Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val
                    245                 250                 255
    Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe
                260                 265                 270
    Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu
            275                 280                 285
    Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly
        290                 295                 300
    Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala Asp
    305                 310                 315                 320
    Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Pro
                    325                 330                 335
    Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu
                340                 345                 350
    Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro
            355                 360                 365
    Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn
        370                 375                 380
    Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu
    385                 390                 395                 400
    Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu
                    405                 410                 415
    Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu
                420                 425                 430
    Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly
            435                 440                 445
    Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala
        450                 455                 460
    Glu Glu Ile Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His
    465                 470                 475                 480
    Pro Phe Asn Leu Asn Ser Arg Asp Gln Leu Glu Arg Val Leu Phe Asp
                    485                 490                 495
    Glu Leu Gly Leu Pro Ala Ile Gly Lys Thr Glu Lys Thr Gly Lys Arg
                500                 505                 510
    Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro Ile
            515                 520                 525
    Val Glu Lys Ile Leu Gln Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr
        530                 535                 540
    Tyr Ile Asp Pro Leu Pro Asp Leu Ile His Pro Arg Thr Gly Arg Leu
    545                 550                 555                 560
    His Thr Arg Phe Asn Gln Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser
                    565                 570                 575
    Ser Asp Pro Asn Leu Gln Asn Ile Pro Val Arg Thr Pro Leu Gly Gln
                580                 585                 590
    Arg Ile Arg Arg Ala Phe Ile Ala Glu Glu Gly Trp Leu Leu Val Ala
            595                 600                 605
    Leu Asp Tyr Ser Gln Ile Glu Leu Arg Val Leu Ala His Leu Ser Gly
        610                 615                 620
    Asp Glu Asn Leu Ile Arg Val Phe Gln Glu Gly Arg Asp Ile His Thr
    625                 630                 635                 640
    Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro
                    645                 650                 655
    Leu Met Arg Arg Ala Ala Lys Thr Ile Asn Phe Gly Val Leu Tyr Gly
                660                 665                 670
    Met Ser Ala His Arg Leu Ser Gln Glu Leu Ala Ile Pro Tyr Glu Glu
            675                 680                 685
    Ala Gln Ala Phe Ile Glu Arg Tyr Phe Gln Ser Phe Pro Lys Val Arg
        690                 695                 700
    Ala Trp Ile Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val
    705                 710                 715                 720
    Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg
                    725                 730                 735
    Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro
                740                 745                 750
    Val Gln Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu
            755                 760                 765
    Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gln Val His
        770                 775                 780
    Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala
    785                 790                 795                 800
    Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro
                    805                 810                 815
    Leu Glu Val Glu Val Gly Ile Gly Glu Asp Trp Leu Ser Ala Lys Glu
                820                 825                 830

    The following amino acid residues correspond to the amino acids between amino acid 643 and 661, where xxx represents intervening amino acid residues in the native polymerase:
  • (SEQ ID NO: 12)
    643-Ala xxx xxx xxx Phe xxx Val xxx xxx Glu Ala
    Val Asp Pro Leu Met Arg Arg Ala-661
  • Overlapping primers are used to introduce point mutations into the native gene by PCR based mutagenesis (using Pfu DNA polymerase).
  • Complementary forward and reverse primers each contain a codon that encodes the desired mutated amino acid residue. PCR using these primers results in a nicked, non-methylated, double-stranded plasmid containing the desired mutation. To remove the template DNA, the entire PCR product is treated with DpnI restriction enzyme (cuts at methylated guanosines in the sequence GATC). Following digestion of the template plasmid, the mutated plasmid is transformed and ligation occurs in vivo.
  • The following synthetic DNA oligonucleotide primers are used for mutagenesis as described below, where the letters designated in lowercase have been modified to yield the desired Cysteine substitution at the indicated position. Mutants are then screened via automated sequencing.
  • ALANINE 643 TO CYSTEINE REPLACEMENT
    Taq Pol I_Ala643Cys_fwd
    (SEQ ID NO: 13)
    5′-C CAC ACG GAG ACC tgC AGC TGG ATG TTC GGC G-3′
    Taq PolI_Ala643Cys_rev
    (SEQ ID NO: 14)
    5′-C GCC GAA CAT CCA CGA Gca GGT CTC CGT GTG G-3′
    PHENYLALANINE 647 TO CYSTEINE REPLACEMENT
    Taq Pol I_Phe647Cys_fwd
    (SEQ ID NO: 15)
    5′-CC GCC AGC TGG ATG TgC GGC GTC CCC CGG GAG
    GCC-3′
    Tag Pol I_Phe647Cys_rev
    (SEQ ID NO: 16)
    5′-GGC CTC CCG GGG GAC GCC GcA CAT CCA CGT GGC
    GG-3′
    VALINE 649 TO CYSTEINE REPLACEMENT
    Taq Pol I_Val649Cys_fwd
    (SEQ ID NO: 17)
    5′-GCC AGC TGG ATG TTC GGC tgC CCC CGG GAG GCC
    GTG G-3′
    Taq Pol I_Val649Cys_rev
    (SEQ ID NO: 18)
    5′-C CAC GGC CTC CCG GGG Gca GCC GAA CAT CCA 
    GCT GGC-3′
    GLUTAMIC ACID 652 TO CYSTEINE REPLACEMENT
    Taq Pol I_Glu652Cys_fwd
    (SEQ ID NO: 19)
    5′-GGC GTC CCC CGG tgc GCC GTG GAC CCC CTG ATG 
    CGC-3′
    Taq PolI_Glu652Cys_rev
    (SEQ ID NO: 20)
    5′-GCG CAT CAG GGG GTC CAC GGC gca CCG GGG GAC 
    GCC-3′
    ALANINE 653 TO CYSTEINE REPLACEMENT
    Taq Pol I_Ala653Cys_fwd
    (SEQ ID NO: 21)
    5′-GGC GTC CCC CGG GAG tgC GTG GAC CCC CTG ATG 
    CGC-3′
    Taq Pol I_Ala653Cys_rev
    (SEQ ID NO: 22)
    5′-GCG CAT CAG GGG GTC CAC Gca CTC CCG GGG GAC 
    GCC-3′
    VALINE 654 TO CYSTEINE REPLACEMENT
    Taq Pol I_Val6S4Cys_fwd
    (SEQ ID NO: 23)
    5′-GTC CCC CGG GAG GCC tgt GAC CCC CTG ATG CGC-3′
    Taq PolI_Val654Cys_rev
    (SEQ ID NO: 24)
    5′-GCG CAT CAG GGG GTC aca GGC CTC CCG GGG GAC-3′
    ASPARTIC ACID 655 TO CYSTEINE REPLACEMENT
    Taq Pol I_D655C_fwd
    (SEQ ID NO: 25)
    5′-CCC CGG GAG GCC GTG tgC CCC CTG ATG CGC CGG-3′
    Taq Pol I_D655C_rev
    (SEQ ID NO: 26)
    5′-CCG GCG CAT CAG GGG Gca CAC GGC CTC CCG GGG-3′
    PROLINE 656 TO CYSTEINE REPLACEMENT
    Taq Pol I_Pro656Cys_fwd
    (SEQ ID NO: 27)
    5′-CGG GAG GCC GTG GAC tgC CTG ATG CGC CGG GCG-3′
    Taq Pol I_Pro656Cys_rev
    (SEQ ID NO: 28)
    5′-CGC CCG GCG CAT CAG Gca GTC CAC GGC CTC CCG-3′
    LEUCINE 657 TO CYSTEINE REPLACEMENT
    Taq Pol I_Leu657Cys_fwd
    (SEQ ID NO: 29)
    5′-GCC GTG GAC CCC tgc ATG CGC CGG GCG GCC-3′
    Taq Pol I_Leu657Cys_rev
    (SEQ ID NO: 30)
    5′-GGC CGC CCG GCG CAT gca GGG GTC CAC GGC-3′
    METHIONINE 658 TO CYSTEINE REPLACEMENT
    Taq Pol I_Met658Cys_fwd
    (SEQ ID NO: 31)
    5′-GCC GTG GAC CCC CTG tgt CGC CGG GCG GCC-3′
    Taq Pol I_Met658Cys_rev
    (SEQ ID NO: 32)
    5′-GGC CGC CCG GCG aca CAG GGG GTC CAC GGC-3′
    ARGININE 659 TO CYSTEINE REPLACEMENT
    Taq Pol I_Arg659Cys_fwd
    (SEQ ID NO: 33)
    5′-GCC GTG GAC CCC CTG ATG tGC CGG GCG GCC AAG 
    ACC-3′
    Taq Pol I_Arg659Cys_rev
    (SEQ ID NO: 34)
    5′-GGT CTT GGC CGC CCG GCa CAT CAG GGG GTC CAC 
    GGC-3′
    ARGININE 660 TO CYSTEINE REPLACEMENT
    Taq Pol I_Arg660Cys_fwd
    (SEQ ID NO: 35)
    5′-GAC CCC CTG ATG CGC tGc GCG GCC AAG ACC ATC-3′
    Taq Pol I_Arg660Cys_rev
    (SEQ ID NO: 36)
    5′-GAT GGT CTT GGC CGC gCa GCG CAT CAG GGG GTC-3′
    ALANINE 661 TO CYSTEINE REPLACEMENT
    Taq Pol I_Ala661Cys_fwd
    (SEQ ID NO: 37)
    5′-CCC CTG ATG CGC CGG tgc GCC AAG ACC ATC AAC-3′
    Taq Pol I_Ala661Cys_rev
    (SEQ ID NO: 38)
    5′-GTT GAT GGT CTT GGC gca CCG GCG CAT CAG GGG-3′
  • The resulting mutant Taq polymerases are then reacted with a desired atomic or molecular tag to tag the cysteine in the mutant structure through the SH group of the cysteine residue and screened for native and/or tagged dNTP incorporation and incorporation efficiency. The mutant polymerases are also screened for fluorescent activity during base incorporation. Thus, the present invention also relates to mutant Taq polymerase having a cysteine residue added one or more of the sites selected from the group consisting of 513-518, 643, 647, 649 and 653-661. After cysteine replacement and verification of polymerase activity using the modified dNTPs, the mutant Taq polymerases are reacted with a tag through the SH group of the inserted cysteine residue. The resulting amino acid replacement for the positions 513-518, 643, 647, 649 and 653-661 are shown below:
  • (SEQ ID NO: 39)
    Cys Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 40)
    Ala Ser Trp Met Cys Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 41)
    Ala Ser Trp Met Phe Gly Cys Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 42)
    Ala Ser Trp Met Phe Gly Val Pro Arg Cys Ala Val Asp Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 43)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Cys Val Asp Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 44)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Cys Asp Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 45)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Cys Pro Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 46)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Cys Leu Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 47)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Cys Met Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 48)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Cys Arg Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 49)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Cys Arg Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 50)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Cys Ala
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 51)
    Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Cys
    643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661
    (SEQ ID NO: 52)
    Cys Thr Ser Ala Ala Val
    513 514 515 516 517 518
    (SEQ ID NO: 53)
    Ser Cys Ser Ala Ala Val
    513 514 515 516 517 518
    (SEQ ID NO: 54)
    Ser Thr Cys Ala Ala Val
    513 514 515 516 517 518
    (SEQ ID NO: 55)
    Ser Thr Ser Cys Ala Val
    513 514 515 516 517 518
    (SEQ ID NO: 56)
    Ser Thr Ser Ala Cys Val
    513 514 515 516 517 518
    (SEQ ID NO: 57)
    Ser Thr Ser Ala Ala Cys
    513 514 515 516 517 518
  • Synthesis of Modified dNTPs
  • Synthesis of (γ-AmNS)dATP
  • Nucleotide analogs which contain fluorophore 1-aminonaphalene-5-sulfonate attached to the γ-phosphate bond were synthesized (J. Biol. Chem. 254, 12069-12073, 1979). dATP analog-(γ-AmNS) dATP was synthesized according to the procedures slightly altered from what was described by Yarbrough and co-workers for (γ-AmNS)ATP with some modifications.
  • This example illustrates the preparation of gamma ANS tagged dATP, shown graphically in FIG. 4.
  • 1-Aminonaphthalene-5-sulphonic acid (447 mg, 2 mmol, 40 eq., from Lancaster) was added to 10 mL of H2O, and the pH was adjusted to 5.8 with 1 N NaOH. The insoluble material was removed by syringe filter, yielding a solution which was essentially saturated for this pH value (˜0.18 to 0.2 M). 4 mL of 12.5 mM 5′ triphosphate-2′-deoxyadenosine disodium salt (0.05 mmol, 1 eq.) and 2 mL of 1 M 1-(3-dimethylaminopropyl-)-3-ethyl-carbodiimide hydrochloride (DEC, 2 mmol, 40 eq., from Lancaster) were added to a reaction vessel at 22° C. The reaction was initiated by adding 10 mL of the 1-aminonaphthalene-5-sulfonate solution and allowed to continue for 2.5 h. The pH was kept between 5.65-5.75 by periodic addition of 0.1 N HCl. After 2.5 h, the reaction was diluted to 50 mL and adjusted to a solution of 0.05 M triethylammonium bicarbonate buffer (TEAB, pH ˜7.5). The reaction product was chromatographed on a 50 mL DEAE-SEPHADEX® ion exchanger (A-25-120) column at low temperature that was equilibrated with ˜pH 7.5 1.0 M aqueous TEAB (100 mL), 1.0 M aqueous sodiumbicarbonate (1100 mL), and ˜pH 7.5, 0.05 Maqueous TEAB (100 mL). The column was eluted with a linear gradient of ˜pH 7.5 aqueous TEAB from 0.05 to 0.9 M. Approximately 10 mL fractions were collected. Absorbance and fluorescence profiles (UV 366 nm) of the fractions were obtained after appropriate dilution. The fluorescent fraction eluted at ˜0.7 M buffer after the peak of the unreacted DATβ and showed a brilliant blue fluorescence. The product-containing fractions were pooled, dried by lyophilizer and co-evaporated twice with H2O/ethanol (70/30). The residue was taken up in water and lyophilized. 31P NMR (1H decoupled, 600 MHz, D2O, Me3PO4 external reference, 293 K, pH 6.1) δ (ppm) −12.60, −14.10, −25.79. The reference compound dATP gave the following resonance peaks: 31P NMR (dATP, Na+) in D2O 293 K, δ (ppm)-11.53 (γ), −13.92 (α), −24.93 (β).
  • Using diode array UV detection HPLC, the fraction containing the desired product was easily identified by the distinct absorption of the ANS group at 366 nm. Additionally, 31P NMR spectra were recorded for the γ-phosphate tagged dATP and regular dATP in an aqueous solution. For each compound, three characteristic resonances were observed, confirming the triphosphate moiety in the γ-tagged dATP. The combined analyses—1H-NMR, HPLC, and UV spectra—provide supporting information for the formation of the correct compound.
  • The same synthetic procedure was used to prepare γ-ANS-phosphate modified dGTP, dTTP and dCTP.
  • γ-Phosphate-tagged dNTP Incorporation by Tag Polymerase
  • The following examples illustrate that commercially available Taq DNA polymerase efficiently incorporates the ANS-γ-phosphate dNTPs, the syntheses and characterization as described above.
  • The first example illustrates the incorporation of ANS-γ-phosphate DATP to produce extended DNA products from primer templates. The reactions were carried out in extension buffer and the resulting radiolabeled products were size separated on a 20% denaturing polyacrylamide gel. Data was collected using a phosphorimaging system. Referring now the FIG. 5, Lane 1 contains 5′ radiolabeled ‘TOP’ probe in extension buffer. Lane 2 contains Taq DNA polymerase, 50 μM dGTP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-Sau’). Lane 3 contains Taq DNA polymerase, 50 μM dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-Sau’). Lane 4 contains Taq DNA polymerase, 50 μM ANS-γ-dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-Sau’). Lane 5 contains Taq DNA polymerase, 50 μM dGTP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-T’). Lane 6 contains spill-over from lane 5. Lane 7 contains Taq DNA polymerase, 50 μM dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-T’). Lane 8 contains Taq DNA polymerase, 50 μM ANS-γ-dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-T’). Lane 9 contains Taq DNA polymerase, 50 μM dGTP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-3T’). Lane 10 contains Taq DNA polymerase, 50 μM dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-3T’). Lane 11 contains Taq DNA polymerase, ANS-γ-dATP incubated with a DNA duplex (radiolabeled TOP with excess ‘BOT-3T’). Lane 12 contains 5′ radiolabeled ‘TOP’ probe in extension buffer. Lane 13 contains 5′ radiolabeled ‘TOP’ probe and Taq DNA polymerase in extension buffer. Oligonucleotide sequences are shown in Table V.
  • Quantitative comparison of lane 1 with lane 4 demonstrates that very little non-specific, single-base extension was detected when ANS-γ-dATP was included in the reaction, but the first incorporated base should be dGTP (which was not added to the reaction). Quantitative analysis of lanes 1 and 8 demonstrates that approximately 71% of the TOP primer are extended by a template-directed single base when ANS-γ-dATP was included in the reaction and the first incorporated base should be dATP. Thus, Taq DNA polymerase incorporates γ-tagged nucleotides. Equally important to the polymerase's ability to incorporate a γ-tagged nucleotide is its ability to extend the DNA polymer after the modified DATP was incorporated. Comparison of lane 1 with lane 11 demonstrated that a DNA strand was extended after a γ-tagged nucleotide was incorporated. Thus, incorporation of a modified nucleotide was not detrimental to polymerase activity. Note, too, that extension of the primer strand by incorporation of an ANS-γ-nucleotide depended upon Watson-Crick base-pairing rules. In fact, the fidelity of nucleotide incorporation was increased at least 15-fold by the addition of this tag to the γ-phosphate.
  • This next example illustrates the synthesis of extended DNA polymers using all four ANS tagged γ-phosphate dNTPs. Products generated in these reactions were separated on a 20% denaturing polyacrylamide gel, the gel was dried and imaged following overnight exposure to a Fuji BAS 1000 imaging plate. Referring now to FIG. 6, an image of (A) the actual gel, (B) a lightened phosphorimage and (C) an enhanced phosphorimage. Lane descriptions for A, B, and C follow: Lane 1 is the control containing purified 10-base primer extended to 11 and 12 bases by template-mediated addition of α-32P dCTP. Lane 2 includes the same primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes (to denature template), the reaction was brought to 37° C. (to anneal primer-template), Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added and the reaction was incubated at 37° C. for 60 minutes. Lane 3 includes the same labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was DNA polymerase and all four gamma-modified DNTPs (100 uM, each) were added and the reaction was incubated at 37° C. for 60 minutes. Lane 4 includes the control, purified 10-base primer that was extended to 11 and 12 bases by the addition of α-32P-dCTP was cycled in parallel with lanes 5-8 reactions. Lane 5 includes the same 32P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 6 includes the same 32P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 7 includes nonpurified, 10-base, 32P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 8 includes nonpurified, 10-base, 32P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Evident in the reactions involving tagged dNTPs is a substantial decrease in pyrophosphorolysis as compared to reactions involving natural nucleotides.
  • This next example illustrates the synthesis of long DNA polymers using all four ANS tagged γ-phosphate dNTPs. Each primer extension reaction was split into two fractions, and one fraction was electrophoresed through a 20% denaturing gel (as described above), while the other was electrophoresed through a 6% denaturing gel to better estimate product lengths. The gel was dried and imaged (overnight) to a Fuji BAS 1000 imaging plate. Referring now to FIG. 7, an image of (A) the actual gel, (B) a lightened phosphorimage of the actual gel, and (C) an enhanced phosphorimage of the actual gel. Lane descriptions for A, B, and C follow: Lane 1 includes 123 Marker with size standards indicated at the left of each panel. Lane 2 contains the control, purified 10-base primer extended to 11 and 12 bases by template-mediated addition of α-32P dCTP. Lane 3 contains the same 32P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes (to denature template), the reaction was brought to 37° C. (to anneal primer-template), Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added and the reaction was incubated at 37° C. for 60 minutes. Lane 4 includes the same 32P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C., Taq DNA polymerase and all four gamma-modified dNTPs (100 uM, each) were added and the reaction was incubated at 37° C. for 60 minutes. Lane 5 includes the control, purified 10-base primer that was extended to 11 and 12 bases by the addition of α-32P-dCTP was cycled in parallel with lanes 5-8 reactions. Lane 6 includes the same 32P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 7 includes the same 32P-labeled primer that was incubated with double-stranded plasmid DNA at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 8 includes nonpurified, 10-base, 32P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four natural dNTPs (100 uM, each) were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes. Lane 9 includes nonpurified, 10-base, 32P-labeled primer that was incubated with double-stranded DNA plasmid at 96° C. for 3 minutes, the reaction was brought to 37° C. for 10 minutes, during which time Taq DNA polymerase and all four gamma-modified dNTPs were added. The reaction was cycled 25 times at 96° C. for 10 seconds, 37° C. for 1 minute, and 70° C. for 5 minutes.
  • The majority of extension products in this reaction are several hundred bases long for both natural and γ-modified dNTPs, and a significant percentage of these products are too large to enter the gel. Thus, demonstrating the gamma phosphate tagged dNTPs are used by Taq polymerase to generate long DNA polymers that are non-tagged or native DNA polymer chains.
  • Different Polymerases React Differently to the Gamma-Modified Nucleotides
  • The indicated enzyme (Taq DNA Polymerase, SEQUENASE®, HIV-1 Reverse Transcriptase, T7 DNA Polymerase, Klenow Fragment, Pfu DNA Polymerase) were incubated in the manufacturers suggested reaction buffer, 50 μM of the indicated nucleotide at 37° C. for 30-60 minutes, and the reaction products were analyzed by size separation through a 20% denaturing gel.
  • Taq DNA polymerase efficiently uses the gamma-modified nucleotides to synthesize extended DNA polymers at increased accuracy as shown in FIGS. 4-6.
  • The Klenow fragment from E. coli DNA polymerase I efficiently uses the gamma-modified nucleotides, but does not exhibit the extreme fidelity improvements observed with other enzymes as shown in FIG. 8.
  • Pfu DNA polymerase does not efficiently use gamma-modified nucleotides and is, thus, not a preferred enzyme for the single-molecule sequencing system as shown in FIG. 9.
  • HIV-1 reverse transcriptase efficiently uses the gamma-tagged nucleotides, and significant fidelity improvement results as shown in FIG. 10.
  • Polymerization activity is difficult to detect in the reaction products generated by native T7 DNA polymerase (due to the presence of the enzymes exonuclease activity). However, its genetically modified derivative, SEQUENASE®, shows that the gamma-modified nucleotides are efficiently incorporated, and that incorporation fidelity is improved, relative to non-modified nucleotides. The experimental results for native T7 DNA polymerase and SEQUENASE® are shown in FIG. 11.
  • Thus, for the Taq polymerase or the HIV1 reverse transcriptase, improved fidelity, due to the use of the gamma-modified dNTPs of this invention, enables single-molecule DNA sequencing. However, not all polymerases equally utilize the gamma-modified nucleotides of this invention, specifically, Klenow, SEQUENASE®, HIV-1 reverse transcriptase and Taq polymerases incorporate the modified nucleotides of this invention, while the Pfu DNA polymerase does not appear to incorporate the modified nucleotides of this invention.
  • Improved PCR-Generation of Long DNA Sequences
  • The fidelity of nucleic acid synthesis is a limiting factor in achieving amplification of long target molecules using PCR. The misincorporation of nucleotides during the synthesis of primer extension products limits the length of target that can be efficiently amplified. The effect on primer extension of a 3′-terminal base that is mismatched with the template is described in Huang et al., 1992, Nucl. Acids Res. 20:4567-4573, incorporated herein by reference. The presence of misincorporated nucleotides may result in prematurely terminated strand synthesis, reducing the number of template strands for future rounds of amplification, and thus reducing the efficiency of long target amplification. Even low levels of nucleotide misincorporation may become critical for sequences longer than 10 kb. The data shown in FIG. 4 shows that the fidelity of DNA synthesis using gamma tagged dNTPs is improved for the native Taq polymerase making longer DNA extension possible without the need for adding polymerases with 3′- to 5′ exonuclease, or “proofreading”, activity as required in the long-distance PCR method developed by Cheng et al., U.S. Pat. No. 5,512,462, incorporated herein by reference. Thus, the present invention provides an improved PCR system for generating increased extension length PCR amplified DNA products comprising contacting a native Taq polymerase with gamma tagged dNTPs of this invention under PCR reaction conditions. The extended length PCR products are due to improved accuracy of base incorporation, resulting from the use of the gamma-modified dNTPs of this invention.
  • Signal Intensity and Reaction Kinetics Provide Information Concerning Base Identity
  • Signal intensities for each nucleotide in the extended DNA strand are used to determine, confirm or support base identity data. Referring now to FIG. 12, the solid line corresponds to reaction products produced when the four natural nucleotides (dATP, dCTP, dGTP and dTTP) are included in the synthesis reaction. The dashed or broken line corresponds to reaction products produced when proprietary, base-modified nucleotides are included in the reaction. As is clearly demonstrated, sequence context and base modification(s) influence reaction product intensity and/or kinetics, and these identifying patterns are incorporated into proprietary base-calling software to provide a high confidence value for base identity at each sequenced position.
  • All references cited herein and listed in are incorporated by reference. While this invention has been described fully and completely, it should be understood that, within the scope of the appended claims, the invention maybe practiced otherwise than as specifically described. Although the invention has been disclosed with reference to its preferred embodiments, from reading this description those of skill in the art may appreciate changes and modifications that may be made which do not depart from the scope and spirit of the invention as described above and claimed hereafter.

Claims (21)

1. A system for determining sequence information at the single molecule level, comprising:
a detector comprising a viewing field,
at least one active complex confined within the viewing field, where the complex comprises:
a reaction buffer,
a polymerizing agent,
a template,
a primer adapted to duplex with a portion of the template to form an extendable duplex, and
monomer types for the polymerizing agent,
where the polymerizing agent and/or the monomer types comprising tags having detectable properties that undergo changes before, during and/or after a monomer incorporation event, where the tags and the detectable properties are same or different; and
an excitation source adapted to excite the polymerizing agent and/or monomer tags within the viewing field,
where the detector is adapted to detect changes in the detectable properties of the tags before, during and/or after one or a series of monomer incorporation events and to convert the incorporation event or events into an identity of one monomer or identities of a plurality of monomers complementary to a sequence of monomers in the template.
2. The system of claim 1, wherein the complex does not comprise a quencher.
3. The system of claim 1, wherein the detector further comprises data channels within the viewing field, each data channel comprising one pixel or a plurality of pixels.
4. The system of claim 1, further comprising a plurality of active complexes confined within the viewing field and where the detector further comprises data channels within the viewing field, each data channel comprising one pixel or a plurality of pixels and where the active complexes are distributed within the viewing field to minimize data channels having a multiple active complexes therein.
5. The system of claim 1, wherein the monomer types comprise tags and where the tags on each monomer type are all different and where the detectable properties are fluorescence from each tag.
6. The system of claim 5, wherein the polymerizing agent tag is a donor fluorophore and the monomer tags are acceptor fluorophores, each monomer type having a different fluorophore having a different fluorescent signature and the acceptors fluoresce after fluorescent resonance energy transfer (FRET) with the donor.
7. The system of claim 6, wherein the tag on the polymerizing agent is positioned so that during monomer incorporation, the donor and acceptor tags are brought into close proximity of between 100 Å and 10 Å.
8. An apparatus for determining sequence information at the single molecule level comprising:
a substrate,
a detector comprising a viewing field focused on a region of the substrate,
at least one active sequencing complex confined within the viewing field and comprising a reaction buffer, a polymerizing agent, a template, a primer and monomer types for the polymerizing agent, where the polymerizing agent and/or the monomer types comprise tags having detectable properties that undergo changes before, during and/or after a monomer incorporation event, where the tags and the detectable properties are same or different; and
an excitation source adapted to excite the tags within the viewing field,
where the detector is adapted to detect changes in the detectable properties of the tags before, during and/or after one monomer incorporation event or a series of monomer incorporation events and to convert the event or events into an identity of one monomer or identities of a plurality of monomers complementary to a sequence of monomers in the template.
9. The apparatus of claim 8, wherein the complex does not comprise a quencher.
10. The apparatus of claim 8, wherein the detector further comprises data channels within the viewing field, each data channel comprising one pixel or a plurality of pixels.
11. The apparatus of claim 8, further comprising a plurality of active complexes confined within the viewing field and where the detector further comprises data channels within the viewing field, each data channel comprising one pixel or a plurality of pixels and where the active complexes are distributed within the viewing field to minimize data channels having a multiple active complexes therein.
12. The apparatus of claim 8, wherein the monomer types comprise tags and where the tags on each monomer type are all different and where the detectable properties are fluorescence from each tag.
13. The apparatus of claim 12, wherein the polymerizing agent tag is a donor fluorophore and the monomer tags are acceptor fluorophores, each monomer type having a different fluorophore having a different fluorescent signature and the acceptors fluoresce after fluorescent resonance energy transfer (FRET) with the donor.
14. The apparatus of claim 13, wherein the tag on the polymerizing agent is positioned so that during monomer incorporation, the donor and acceptor tags are brought into close proximity of between 100 Å and 10 Å.
15. A method for determining sequence information at the single molecule level comprising:
providing a detector comprising a viewing field;
providing at least one active sequencing complex confined within the viewing field, where the complex comprises:
a reaction buffer,
a polymerizing agent,
a template,
a primer adapted to duplex with a portion of the template to form an extendable duplex, and
monomer types for the polymerizing agent,
where the polymerizing agent and/or the monomer types comprise tags having detectable properties that undergo changes before, during and/or after a monomer incorporation event, where the tags and the detectable properties are same or different;
an excitation source adapted to excite the tags within the viewing field,
detecting changes in the detectable properties of the tags before, during and/or after one monomer incorporation event or a series of monomer incorporation events; and
converting the event or events into an identity of one monomer or identities of a plurality of monomers complementary to a sequence of monomers in the template.
16. The method of claim 15, wherein the complex does not comprise a quencher.
17. The method of claim 15, wherein the detector further comprises data channels within the viewing field, each data channel comprising one pixel or a plurality of pixels.
18. The method of claim 15, further comprising a plurality of active complexes confined within the viewing field and where the detector further comprises data channels within the viewing field, each data channel comprising one pixel or a plurality of pixels and where the active complexes are distributed within the viewing field to minimize data channels having a multiple active complexes therein.
19. The method of claim 15, wherein the monomer types comprise tags and where the tags on each monomer type are all different and where the detectable properties are fluorescence from each tag.
20. The method of claim 19, wherein the polymerizing agent tag is a donor fluorophore and the monomer tags are acceptor fluorophores, each monomer type having a different fluorophore having a different fluorescent signature and the acceptors fluoresce after fluorescent resonance energy transfer (FRET) with the donor.
21. The method of claim 20, wherein the tag on the polymerizing agent is positioned so that during monomer incorporation, the donor and acceptor tags are brought into close proximity of between 100 Å and 10 Å.
US12/419,660 2000-07-07 2009-04-07 Methods for sequence determination Abandoned US20110059436A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/419,660 US20110059436A1 (en) 2000-07-07 2009-04-07 Methods for sequence determination

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US21659400P 2000-07-07 2000-07-07
US09/901,782 US20030064366A1 (en) 2000-07-07 2001-07-09 Real-time sequence determination
US11/648,191 US20070172867A1 (en) 2000-07-07 2006-12-29 Methods for sequence determination
US12/419,660 US20110059436A1 (en) 2000-07-07 2009-04-07 Methods for sequence determination

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/648,191 Continuation US20070172867A1 (en) 2000-07-07 2006-12-29 Methods for sequence determination

Publications (1)

Publication Number Publication Date
US20110059436A1 true US20110059436A1 (en) 2011-03-10

Family

ID=22807697

Family Applications (22)

Application Number Title Priority Date Filing Date
US09/901,782 Abandoned US20030064366A1 (en) 2000-07-07 2001-07-09 Real-time sequence determination
US11/007,642 Abandoned US20050266424A1 (en) 2000-07-07 2004-12-08 Real-time sequence determination
US11/007,797 Expired - Lifetime US7329492B2 (en) 2000-07-07 2004-12-08 Methods for real-time single molecule sequence determination
US11/648,184 Abandoned US20100255463A1 (en) 2000-07-07 2006-12-29 Compositions and methods for sequence determination
US11/648,174 Abandoned US20070172865A1 (en) 2000-07-07 2006-12-29 Sequence determination in confined regions
US11/648,856 Abandoned US20070275395A1 (en) 2000-07-07 2006-12-29 Tagged monomers for use in sequence determination
US11/648,136 Abandoned US20070172862A1 (en) 2000-07-07 2006-12-29 Data stream determination
US11/648,137 Abandoned US20070292867A1 (en) 2000-07-07 2006-12-29 Sequence determination using multiply tagged polymerizing agents
US11/648,106 Abandoned US20070172858A1 (en) 2000-07-07 2006-12-29 Methods for sequence determination
US11/648,713 Abandoned US20070184475A1 (en) 2000-07-07 2006-12-29 Sequence determination by direct detection
US11/648,138 Abandoned US20070172863A1 (en) 2000-07-07 2006-12-29 Compositions and methods for sequence determination
US11/648,115 Abandoned US20070172861A1 (en) 2000-07-07 2006-12-29 Mutant polymerases
US11/648,191 Abandoned US20070172867A1 (en) 2000-07-07 2006-12-29 Methods for sequence determination
US11/648,722 Abandoned US20070172868A1 (en) 2000-07-07 2006-12-29 Compositions for sequence determination using tagged polymerizing agents and tagged monomers
US11/648,164 Abandoned US20070172864A1 (en) 2000-07-07 2006-12-29 Composition for sequence determination
US12/410,370 Abandoned US20110014604A1 (en) 2000-07-07 2009-03-24 Methods for sequence determination
US12/412,208 Abandoned US20100304367A1 (en) 2000-07-07 2009-03-26 Compositions for real-time, single molecule sequence determination
US12/411,997 Abandoned US20110021383A1 (en) 2000-07-07 2009-03-26 Apparatuses for real-time, single molecule sequence determination
US12/414,417 Abandoned US20090275036A1 (en) 2000-07-07 2009-03-30 Systems and methods for real time single molecule sequence determination
US12/419,214 Abandoned US20090305278A1 (en) 2000-07-07 2009-04-06 Sequence determination in confined regions
US12/419,660 Abandoned US20110059436A1 (en) 2000-07-07 2009-04-07 Methods for sequence determination
US12/724,392 Abandoned US20100317005A1 (en) 2000-07-07 2010-03-15 Modified Nucleotides and Methods for Making and Use Same

Family Applications Before (20)

Application Number Title Priority Date Filing Date
US09/901,782 Abandoned US20030064366A1 (en) 2000-07-07 2001-07-09 Real-time sequence determination
US11/007,642 Abandoned US20050266424A1 (en) 2000-07-07 2004-12-08 Real-time sequence determination
US11/007,797 Expired - Lifetime US7329492B2 (en) 2000-07-07 2004-12-08 Methods for real-time single molecule sequence determination
US11/648,184 Abandoned US20100255463A1 (en) 2000-07-07 2006-12-29 Compositions and methods for sequence determination
US11/648,174 Abandoned US20070172865A1 (en) 2000-07-07 2006-12-29 Sequence determination in confined regions
US11/648,856 Abandoned US20070275395A1 (en) 2000-07-07 2006-12-29 Tagged monomers for use in sequence determination
US11/648,136 Abandoned US20070172862A1 (en) 2000-07-07 2006-12-29 Data stream determination
US11/648,137 Abandoned US20070292867A1 (en) 2000-07-07 2006-12-29 Sequence determination using multiply tagged polymerizing agents
US11/648,106 Abandoned US20070172858A1 (en) 2000-07-07 2006-12-29 Methods for sequence determination
US11/648,713 Abandoned US20070184475A1 (en) 2000-07-07 2006-12-29 Sequence determination by direct detection
US11/648,138 Abandoned US20070172863A1 (en) 2000-07-07 2006-12-29 Compositions and methods for sequence determination
US11/648,115 Abandoned US20070172861A1 (en) 2000-07-07 2006-12-29 Mutant polymerases
US11/648,191 Abandoned US20070172867A1 (en) 2000-07-07 2006-12-29 Methods for sequence determination
US11/648,722 Abandoned US20070172868A1 (en) 2000-07-07 2006-12-29 Compositions for sequence determination using tagged polymerizing agents and tagged monomers
US11/648,164 Abandoned US20070172864A1 (en) 2000-07-07 2006-12-29 Composition for sequence determination
US12/410,370 Abandoned US20110014604A1 (en) 2000-07-07 2009-03-24 Methods for sequence determination
US12/412,208 Abandoned US20100304367A1 (en) 2000-07-07 2009-03-26 Compositions for real-time, single molecule sequence determination
US12/411,997 Abandoned US20110021383A1 (en) 2000-07-07 2009-03-26 Apparatuses for real-time, single molecule sequence determination
US12/414,417 Abandoned US20090275036A1 (en) 2000-07-07 2009-03-30 Systems and methods for real time single molecule sequence determination
US12/419,214 Abandoned US20090305278A1 (en) 2000-07-07 2009-04-06 Sequence determination in confined regions

Family Applications After (1)

Application Number Title Priority Date Filing Date
US12/724,392 Abandoned US20100317005A1 (en) 2000-07-07 2010-03-15 Modified Nucleotides and Methods for Making and Use Same

Country Status (9)

Country Link
US (22) US20030064366A1 (en)
EP (3) EP1975251A3 (en)
JP (3) JP2004513619A (en)
CN (2) CN100462433C (en)
AT (1) ATE377093T1 (en)
AU (2) AU8288101A (en)
CA (1) CA2415897A1 (en)
DE (1) DE60131194T2 (en)
WO (1) WO2002004680A2 (en)

Cited By (93)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110003343A1 (en) * 2009-03-27 2011-01-06 Life Technologies Corporation Conjugates of biomolecules to nanoparticles
WO2013130512A3 (en) * 2012-02-27 2013-10-24 The University Of North Carolina At Chapel Hill Methods and uses for molecular tags
WO2014210225A1 (en) 2013-06-25 2014-12-31 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
WO2015042708A1 (en) 2013-09-25 2015-04-02 Bio-Id Diagnostic Inc. Methods for detecting nucleic acid fragments
US9290809B2 (en) 2009-12-15 2016-03-22 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US9315857B2 (en) 2009-12-15 2016-04-19 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse label-tags
US9482615B2 (en) 2010-03-15 2016-11-01 Industrial Technology Research Institute Single-molecule detection system and methods
US9567646B2 (en) 2013-08-28 2017-02-14 Cellular Research, Inc. Massively parallel single cell analysis
US9582877B2 (en) 2013-10-07 2017-02-28 Cellular Research, Inc. Methods and systems for digitally counting features on arrays
EP3150750A2 (en) 2011-04-08 2017-04-05 Prognosys Biosciences, Inc. Peptide constructs and assay systems
US9670243B2 (en) 2010-06-02 2017-06-06 Industrial Technology Research Institute Compositions and methods for sequencing nucleic acids
US9727810B2 (en) 2015-02-27 2017-08-08 Cellular Research, Inc. Spatially addressable molecular barcoding
US9778188B2 (en) 2009-03-11 2017-10-03 Industrial Technology Research Institute Apparatus and method for detection and discrimination molecular object
US9995683B2 (en) 2010-06-11 2018-06-12 Industrial Technology Research Institute Apparatus for single-molecule detection
US10202641B2 (en) 2016-05-31 2019-02-12 Cellular Research, Inc. Error correction in amplification of samples
US10301677B2 (en) 2016-05-25 2019-05-28 Cellular Research, Inc. Normalization of nucleic acid libraries
US10338066B2 (en) 2016-09-26 2019-07-02 Cellular Research, Inc. Measurement of protein expression using reagents with barcoded oligonucleotide sequences
US10480022B2 (en) 2010-04-05 2019-11-19 Prognosys Biosciences, Inc. Spatially encoded biological assays
US10619186B2 (en) 2015-09-11 2020-04-14 Cellular Research, Inc. Methods and compositions for library normalization
US10640763B2 (en) 2016-05-31 2020-05-05 Cellular Research, Inc. Molecular indexing of internal sequences
US10669570B2 (en) 2017-06-05 2020-06-02 Becton, Dickinson And Company Sample indexing for single cells
US10697010B2 (en) 2015-02-19 2020-06-30 Becton, Dickinson And Company High-throughput single-cell analysis combining proteomic and genomic information
US10722880B2 (en) 2017-01-13 2020-07-28 Cellular Research, Inc. Hydrophilic coating of fluidic channels
US10822643B2 (en) 2016-05-02 2020-11-03 Cellular Research, Inc. Accurate molecular barcoding
US10941396B2 (en) 2012-02-27 2021-03-09 Becton, Dickinson And Company Compositions and kits for molecular counting
US11124823B2 (en) 2015-06-01 2021-09-21 Becton, Dickinson And Company Methods for RNA quantification
US11162132B2 (en) 2015-04-10 2021-11-02 Spatial Transcriptomics Ab Spatially distinguished, multiplex nucleic acid analysis of biological specimens
US11164659B2 (en) 2016-11-08 2021-11-02 Becton, Dickinson And Company Methods for expression profile classification
US11208684B2 (en) 2010-04-05 2021-12-28 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11319583B2 (en) 2017-02-01 2022-05-03 Becton, Dickinson And Company Selective amplification using blocking oligonucleotides
US11332790B2 (en) 2019-12-23 2022-05-17 10X Genomics, Inc. Methods for spatial analysis using RNA-templated ligation
US11352659B2 (en) 2011-04-13 2022-06-07 Spatial Transcriptomics Ab Methods of detecting analytes
US11365409B2 (en) 2018-05-03 2022-06-21 Becton, Dickinson And Company Molecular barcoding on opposite transcript ends
US11371076B2 (en) 2019-01-16 2022-06-28 Becton, Dickinson And Company Polymerase chain reaction normalization through primer titration
US11390914B2 (en) 2015-04-23 2022-07-19 Becton, Dickinson And Company Methods and compositions for whole transcriptome amplification
US11397882B2 (en) 2016-05-26 2022-07-26 Becton, Dickinson And Company Molecular label counting adjustment methods
US11408029B2 (en) 2020-06-25 2022-08-09 10X Genomics, Inc. Spatial analysis of DNA methylation
US11407992B2 (en) 2020-06-08 2022-08-09 10X Genomics, Inc. Methods of determining a surgical margin and methods of use thereof
US11434524B2 (en) 2020-06-10 2022-09-06 10X Genomics, Inc. Methods for determining a location of an analyte in a biological sample
US11492660B2 (en) 2018-12-13 2022-11-08 Becton, Dickinson And Company Selective extension in single cell whole transcriptome analysis
US11512308B2 (en) 2020-06-02 2022-11-29 10X Genomics, Inc. Nucleic acid library methods
US11519033B2 (en) 2018-08-28 2022-12-06 10X Genomics, Inc. Method for transposase-mediated spatial tagging and analyzing genomic DNA in a biological sample
US11535887B2 (en) 2020-04-22 2022-12-27 10X Genomics, Inc. Methods for spatial analysis using targeted RNA depletion
US11535882B2 (en) 2015-03-30 2022-12-27 Becton, Dickinson And Company Methods and compositions for combinatorial barcoding
US11560592B2 (en) 2020-05-26 2023-01-24 10X Genomics, Inc. Method for resetting an array
US11592447B2 (en) 2019-11-08 2023-02-28 10X Genomics, Inc. Spatially-tagged analyte capture agents for analyte multiplexing
US11608497B2 (en) 2016-11-08 2023-03-21 Becton, Dickinson And Company Methods for cell label classification
US11608520B2 (en) 2020-05-22 2023-03-21 10X Genomics, Inc. Spatial analysis to detect sequence variants
US11618897B2 (en) 2020-12-21 2023-04-04 10X Genomics, Inc. Methods, compositions, and systems for capturing probes and/or barcodes
US11624086B2 (en) 2020-05-22 2023-04-11 10X Genomics, Inc. Simultaneous spatio-temporal measurement of gene expression and cellular activity
US11639517B2 (en) 2018-10-01 2023-05-02 Becton, Dickinson And Company Determining 5′ transcript sequences
US11649485B2 (en) 2019-01-06 2023-05-16 10X Genomics, Inc. Generating capture probes for spatial analysis
US11649497B2 (en) 2020-01-13 2023-05-16 Becton, Dickinson And Company Methods and compositions for quantitation of proteins and RNA
US11661625B2 (en) 2020-05-14 2023-05-30 Becton, Dickinson And Company Primers for immune repertoire profiling
US11661631B2 (en) 2019-01-23 2023-05-30 Becton, Dickinson And Company Oligonucleotides associated with antibodies
US11692218B2 (en) 2020-06-02 2023-07-04 10X Genomics, Inc. Spatial transcriptomics for antigen-receptors
US11702693B2 (en) 2020-01-21 2023-07-18 10X Genomics, Inc. Methods for printing cells and generating arrays of barcoded cells
US11702698B2 (en) 2019-11-08 2023-07-18 10X Genomics, Inc. Enhancing specificity of analyte binding
US11733238B2 (en) 2010-04-05 2023-08-22 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11732299B2 (en) 2020-01-21 2023-08-22 10X Genomics, Inc. Spatial assays with perturbed cells
US11732300B2 (en) 2020-02-05 2023-08-22 10X Genomics, Inc. Increasing efficiency of spatial analysis in a biological sample
US11739443B2 (en) 2020-11-20 2023-08-29 Becton, Dickinson And Company Profiling of highly expressed and lowly expressed proteins
US11739381B2 (en) 2021-03-18 2023-08-29 10X Genomics, Inc. Multiplex capture of gene and protein expression from a biological sample
US11753673B2 (en) 2021-09-01 2023-09-12 10X Genomics, Inc. Methods, compositions, and kits for blocking a capture probe on a spatial array
US11761038B1 (en) 2020-07-06 2023-09-19 10X Genomics, Inc. Methods for identifying a location of an RNA in a biological sample
US11768175B1 (en) 2020-03-04 2023-09-26 10X Genomics, Inc. Electrophoretic methods for spatial analysis
US11773436B2 (en) 2019-11-08 2023-10-03 Becton, Dickinson And Company Using random priming to obtain full-length V(D)J information for immune repertoire sequencing
US11773441B2 (en) 2018-05-03 2023-10-03 Becton, Dickinson And Company High throughput multiomics sample analysis
US11821035B1 (en) 2020-01-29 2023-11-21 10X Genomics, Inc. Compositions and methods of making gene expression libraries
US11827935B1 (en) 2020-11-19 2023-11-28 10X Genomics, Inc. Methods for spatial analysis using rolling circle amplification and detection probes
US11835462B2 (en) 2020-02-11 2023-12-05 10X Genomics, Inc. Methods and compositions for partitioning a biological sample
US11891654B2 (en) 2020-02-24 2024-02-06 10X Genomics, Inc. Methods of making gene expression libraries
US11898205B2 (en) 2020-02-03 2024-02-13 10X Genomics, Inc. Increasing capture efficiency of spatial assays
US11926867B2 (en) 2019-01-06 2024-03-12 10X Genomics, Inc. Generating capture probes for spatial analysis
US11926822B1 (en) 2020-09-23 2024-03-12 10X Genomics, Inc. Three-dimensional spatial analysis
US11926863B1 (en) 2020-02-27 2024-03-12 10X Genomics, Inc. Solid state single cell method for analyzing fixed biological cells
US11932901B2 (en) 2020-07-13 2024-03-19 Becton, Dickinson And Company Target enrichment using nucleic acid probes for scRNAseq
US11932849B2 (en) 2018-11-08 2024-03-19 Becton, Dickinson And Company Whole transcriptome analysis of single cells using random priming
US11933957B1 (en) 2018-12-10 2024-03-19 10X Genomics, Inc. Imaging system hardware
US11939622B2 (en) 2019-07-22 2024-03-26 Becton, Dickinson And Company Single cell chromatin immunoprecipitation sequencing assay
US11946095B2 (en) 2017-12-19 2024-04-02 Becton, Dickinson And Company Particles associated with oligonucleotides
US11965213B2 (en) 2019-05-30 2024-04-23 10X Genomics, Inc. Methods of detecting spatial heterogeneity of a biological sample
US11965208B2 (en) 2019-04-19 2024-04-23 Becton, Dickinson And Company Methods of associating phenotypical data and single cell sequencing data
US11981958B1 (en) 2020-08-20 2024-05-14 10X Genomics, Inc. Methods for spatial analysis using DNA capture
US11981960B1 (en) 2020-07-06 2024-05-14 10X Genomics, Inc. Spatial analysis utilizing degradable hydrogels
US12031177B1 (en) 2020-06-04 2024-07-09 10X Genomics, Inc. Methods of enhancing spatial resolution of transcripts
USRE50065E1 (en) 2012-10-17 2024-07-30 10X Genomics Sweden Ab Methods and product for optimising localised or spatial detection of gene expression in a tissue sample
US12071655B2 (en) 2021-06-03 2024-08-27 10X Genomics, Inc. Methods, compositions, kits, and systems for enhancing analyte capture for spatial analysis
US12071617B2 (en) 2019-02-14 2024-08-27 Becton, Dickinson And Company Hybrid targeted and whole transcriptome amplification
US12076701B2 (en) 2020-01-31 2024-09-03 10X Genomics, Inc. Capturing oligonucleotides in spatial transcriptomics
US12098985B2 (en) 2021-02-19 2024-09-24 10X Genomics, Inc. Modular assay support devices
US12110541B2 (en) 2020-02-03 2024-10-08 10X Genomics, Inc. Methods for preparing high-resolution spatial arrays
US12117439B2 (en) 2019-12-23 2024-10-15 10X Genomics, Inc. Compositions and methods for using fixed biological samples

Families Citing this family (623)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9213691D0 (en) * 1992-06-27 1992-08-12 Hurlstone Gary Personal alarm torch
US5823720A (en) * 1996-02-16 1998-10-20 Bitmoore High precision cutting tools
US6780591B2 (en) 1998-05-01 2004-08-24 Arizona Board Of Regents Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US7875440B2 (en) 1998-05-01 2011-01-25 Arizona Board Of Regents Method of determining the nucleotide sequence of oligonucleotides and DNA molecules
US6232075B1 (en) 1998-12-14 2001-05-15 Li-Cor, Inc. Heterogeneous assay for pyrophosphate detection
US7056661B2 (en) 1999-05-19 2006-06-06 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
US6818395B1 (en) * 1999-06-28 2004-11-16 California Institute Of Technology Methods and apparatus for analyzing polynucleotide sequences
US6982146B1 (en) * 1999-08-30 2006-01-03 The United States Of America As Represented By The Department Of Health And Human Services High speed parallel molecular nucleic acid sequencing
US20070172866A1 (en) * 2000-07-07 2007-07-26 Susan Hardin Methods for sequence determination using depolymerizing agent
US20030064366A1 (en) * 2000-07-07 2003-04-03 Susan Hardin Real-time sequence determination
WO2002044425A2 (en) * 2000-12-01 2002-06-06 Visigen Biotechnologies, Inc. Enzymatic nucleic acid synthesis: compositions and methods for altering monomer incorporation fidelity
CA2440754A1 (en) * 2001-03-12 2002-09-19 Stephen Quake Methods and apparatus for analyzing polynucleotide sequences by asynchronous base extension
US7668697B2 (en) * 2006-02-06 2010-02-23 Andrei Volkov Method for analyzing dynamic detectable events at the single molecule level
US7223541B2 (en) * 2001-08-29 2007-05-29 Ge Healthcare Bio-Sciences Corp. Terminal-phosphate-labeled nucleotides and methods of use
DE60235376D1 (en) * 2001-08-29 2010-04-01 Ge Healthcare Bio Sciences LABELED NUCLEOSIDE POLYPHOSPHATES
US7727722B2 (en) * 2001-08-29 2010-06-01 General Electric Company Ligation amplification
US7052839B2 (en) * 2001-08-29 2006-05-30 Amersham Biosciences Corp Terminal-phosphate-labeled nucleotides and methods of use
US7033762B2 (en) * 2001-08-29 2006-04-25 Amersham Biosciences Corp Single nucleotide amplification and detection by polymerase
US6852492B2 (en) * 2001-09-24 2005-02-08 Intel Corporation Nucleic acid sequencing by raman monitoring of uptake of precursors during molecular replication
US20050009124A1 (en) * 2001-11-26 2005-01-13 Echelon Biosciences Incorporated Assays for detection of phosphoinositide kinase and phosphatase activity
EP1497304B1 (en) * 2002-04-12 2014-06-25 Catalyst Assets LLC Dual-labeled nucleotides
ATE547535T1 (en) * 2002-08-29 2012-03-15 Ge Healthcare Bio Sciences NUCLEOSIDE POLYPHOSPHATES WITH BLOCKED TERMINAL PHOSPHATE
US7563600B2 (en) 2002-09-12 2009-07-21 Combimatrix Corporation Microarray synthesis and assembly of gene-length polynucleotides
AU2003290925A1 (en) * 2002-11-14 2004-06-15 John Wayne Cancer Institute Detection of micro metastasis of melanoma and breast cancer in paraffin-embedded tumor draining lymph nodes by multimaker quantitative rt-pcr
US20090186343A1 (en) * 2003-01-28 2009-07-23 Visigen Biotechnologies, Inc. Methods for preparing modified biomolecules, modified biomolecules and methods for using same
WO2004072304A1 (en) * 2003-02-05 2004-08-26 Amersham Biosciences Corp Nucleic acid amplification
EP1592779A4 (en) * 2003-02-05 2007-12-12 Ge Healthcare Bio Sciences Terminal-phosphate-labeled nucleotides with new linkers
TW200500290A (en) * 2003-02-10 2005-01-01 Reveo Inc Micro-nozzle, nano-nozzle, manufacturing methods therefor, applications therefor
WO2004092331A2 (en) * 2003-04-08 2004-10-28 Li-Cor, Inc. Composition and method for nucleic acid sequencing
GB0324456D0 (en) * 2003-10-20 2003-11-19 Isis Innovation Parallel DNA sequencing methods
US7169560B2 (en) 2003-11-12 2007-01-30 Helicos Biosciences Corporation Short cycle methods for sequencing polynucleotides
US7981604B2 (en) 2004-02-19 2011-07-19 California Institute Of Technology Methods and kits for analyzing polynucleotide sequences
EP1758981A4 (en) 2004-05-28 2013-01-16 Wafergen Inc Apparatus and methods for multiplex analyses
US7692219B1 (en) 2004-06-25 2010-04-06 University Of Hawaii Ultrasensitive biosensors
WO2006044078A2 (en) * 2004-09-17 2006-04-27 Pacific Biosciences Of California, Inc. Apparatus and method for analysis of molecules
US7170050B2 (en) 2004-09-17 2007-01-30 Pacific Biosciences Of California, Inc. Apparatus and methods for optical analysis of molecules
US7785785B2 (en) 2004-11-12 2010-08-31 The Board Of Trustees Of The Leland Stanford Junior University Charge perturbation detection system for DNA and other molecules
US7405434B2 (en) * 2004-11-16 2008-07-29 Cornell Research Foundation, Inc. Quantum dot conjugates in a sub-micrometer fluidic channel
JP2006197832A (en) * 2005-01-19 2006-08-03 Tohoku Univ Multi-purpose method for detecting substrate affinity of abc transporter having cysteine residue induced by variation by using environment-sensitive fluorescent probe
WO2006086673A2 (en) * 2005-02-09 2006-08-17 Pacific Biosciences Of California, Inc. Nucleotide compositions and uses thereof
EP1901057A1 (en) * 2005-07-07 2008-03-19 Sony Corporation Substance information acquisition method and substance information measurement device using evanescent light, and base arrangement decision method and device
GB0514935D0 (en) 2005-07-20 2005-08-24 Solexa Ltd Methods for sequencing a polynucleotide template
US7666593B2 (en) 2005-08-26 2010-02-23 Helicos Biosciences Corporation Single molecule sequencing of captured nucleic acids
WO2007041621A2 (en) * 2005-10-03 2007-04-12 Xingsheng Sean Ling Hybridization assisted nanopore sequencing
US9156004B2 (en) 2005-10-17 2015-10-13 Stc.Unm Fabrication of enclosed nanochannels using silica nanoparticles
US10060904B1 (en) 2005-10-17 2018-08-28 Stc.Unm Fabrication of enclosed nanochannels using silica nanoparticles
US7825037B2 (en) * 2005-10-17 2010-11-02 Stc.Unm Fabrication of enclosed nanochannels using silica nanoparticles
US8017331B2 (en) * 2005-11-04 2011-09-13 Mannkind Corporation IRE-1α substrates
US8703734B2 (en) 2005-12-12 2014-04-22 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Nanoprobes for detection or modification of molecules
WO2007070572A2 (en) * 2005-12-12 2007-06-21 The Government Of The United States Of America, As Represented By The Secretary Of The Department Of Health And Human Services Probe for nucleic acid sequencing and methods of use
US20080309926A1 (en) * 2006-03-08 2008-12-18 Aaron Weber Systems and methods for reducing detected intensity non uniformity in a laser beam
US7397546B2 (en) * 2006-03-08 2008-07-08 Helicos Biosciences Corporation Systems and methods for reducing detected intensity non-uniformity in a laser beam
US20080091005A1 (en) * 2006-07-20 2008-04-17 Visigen Biotechnologies, Inc. Modified nucleotides, methods for making and using same
US20080241938A1 (en) * 2006-07-20 2008-10-02 Visigen Biotechnologies, Inc. Automated synthesis or sequencing apparatus and method for making and using same
US20080241951A1 (en) * 2006-07-20 2008-10-02 Visigen Biotechnologies, Inc. Method and apparatus for moving stage detection of single molecular events
US7897737B2 (en) 2006-12-05 2011-03-01 Lasergen, Inc. 3′-OH unblocked, nucleotides and nucleosides, base modified with photocleavable, terminating groups and methods for their use in DNA sequencing
US8349167B2 (en) 2006-12-14 2013-01-08 Life Technologies Corporation Methods and apparatus for detecting molecular interactions using FET arrays
US8262900B2 (en) 2006-12-14 2012-09-11 Life Technologies Corporation Methods and apparatus for measuring analytes using large scale FET arrays
US11339430B2 (en) 2007-07-10 2022-05-24 Life Technologies Corporation Methods and apparatus for measuring analytes using large scale FET arrays
EP2639578B1 (en) 2006-12-14 2016-09-14 Life Technologies Corporation Apparatus for measuring analytes using large scale fet arrays
US20100035268A1 (en) * 2007-02-21 2010-02-11 Joseph Beechem Materials and methods for single molecule nucleic acid sequencing
EP3543357A1 (en) 2007-05-08 2019-09-25 Trustees of Boston University Chemical functionalization of solid-state nanopores and nanopore arrays and applications thereof
WO2008147879A1 (en) * 2007-05-22 2008-12-04 Ryan Golhar Automated method and device for dna isolation, sequence determination, and identification
WO2008151023A2 (en) 2007-06-01 2008-12-11 Ibis Biosciences, Inc. Methods and compositions for multiple displacement amplification of nucleic acids
US8703422B2 (en) 2007-06-06 2014-04-22 Pacific Biosciences Of California, Inc. Methods and processes for calling bases in sequence by incorporation methods
EP2155855B1 (en) * 2007-06-06 2016-10-12 Pacific Biosciences of California, Inc. Methods and processes for calling bases in sequence by incorporation methods
EP2201136B1 (en) 2007-10-01 2017-12-06 Nabsys 2.0 LLC Nanopore sequencing by hybridization of probes to form ternary complexes and variable range alignment
EP2203568A1 (en) * 2007-10-22 2010-07-07 Life Technologies Corporation A method and system for obtaining ordered, segmented sequence fragments along a nucleic acid molecule
GB0721340D0 (en) * 2007-10-30 2007-12-12 Isis Innovation Polymerase-based single-molecule DNA sequencing
WO2009117031A2 (en) * 2007-12-18 2009-09-24 Advanced Analytical Technologies, Inc. System and method for nucleotide sequence profiling for sample identification
CN101519698B (en) * 2008-03-10 2013-05-22 周国华 Method for quantitatively measuring nucleic acid with sequence tags
CA2991818C (en) * 2008-03-28 2022-10-11 Pacific Biosciences Of California, Inc. Compositions and methods for nucleic acid sequencing
JP4586081B2 (en) * 2008-03-31 2010-11-24 株式会社日立ハイテクノロジーズ Fluorescence analyzer
US20090269746A1 (en) * 2008-04-25 2009-10-29 Gil Atzmon Microsequencer-whole genome sequencer
BRPI0915173B8 (en) 2008-06-11 2021-05-25 Agilent Technologies Inc nucleotide and nucleoside compounds and method of sequencing a target nucleic acid
JP5268444B2 (en) * 2008-06-23 2013-08-21 株式会社日立ハイテクノロジーズ Single molecule real-time sequence device, nucleic acid analyzer, and single molecule real-time sequence method
JP5667049B2 (en) 2008-06-25 2015-02-12 ライフ テクノロジーズ コーポレーション Method and apparatus for measuring analytes using large-scale FET arrays
WO2010002939A2 (en) * 2008-06-30 2010-01-07 Life Technologies Corporation Methods for real time single molecule sequencing
US20100036110A1 (en) * 2008-08-08 2010-02-11 Xiaoliang Sunney Xie Methods and compositions for continuous single-molecule nucleic acid sequencing by synthesis with fluorogenic nucleotides
US20100227327A1 (en) * 2008-08-08 2010-09-09 Xiaoliang Sunney Xie Methods and compositions for continuous single-molecule nucleic acid sequencing by synthesis with fluorogenic nucleotides
US20100035252A1 (en) * 2008-08-08 2010-02-11 Ion Torrent Systems Incorporated Methods for sequencing individual nucleic acids under tension
US8262879B2 (en) 2008-09-03 2012-09-11 Nabsys, Inc. Devices and methods for determining the length of biopolymers and distances between probes bound thereto
US9650668B2 (en) 2008-09-03 2017-05-16 Nabsys 2.0 Llc Use of longitudinally displaced nanoscale electrodes for voltage sensing of biomolecules and other analytes in fluidic channels
CN102186989B (en) * 2008-09-03 2021-06-29 纳伯塞斯2.0有限责任公司 Use of longitudinally displaced nanoscale electrodes for voltage sensing of biomolecules and other analytes in fluidic channels
US8383345B2 (en) 2008-09-12 2013-02-26 University Of Washington Sequence tag directed subassembly of short sequencing reads into long sequencing reads
US20100184045A1 (en) * 2008-09-23 2010-07-22 Helicos Biosciences Corporation Methods for sequencing degraded or modified nucleic acids
US20100137143A1 (en) * 2008-10-22 2010-06-03 Ion Torrent Systems Incorporated Methods and apparatus for measuring analytes
US20100301398A1 (en) 2009-05-29 2010-12-02 Ion Torrent Systems Incorporated Methods and apparatus for measuring analytes
AU2009319907B2 (en) 2008-11-03 2015-10-01 The Regents Of The University Of California Methods for detecting modification resistant nucleic acids
WO2010075188A2 (en) 2008-12-23 2010-07-01 Illumina Inc. Multibase delivery for long reads in sequencing by synthesis protocols
US8455260B2 (en) * 2009-03-27 2013-06-04 Massachusetts Institute Of Technology Tagged-fragment map assembly
US20100243449A1 (en) * 2009-03-27 2010-09-30 Oliver John S Devices and methods for analyzing biomolecules and probes bound thereto
WO2010129019A2 (en) * 2009-04-27 2010-11-11 Pacific Biosciences Of California, Inc. Real-time sequencing methods and systems
US20100279882A1 (en) * 2009-05-01 2010-11-04 Mostafa Ronaghi Sequencing methods
US8246799B2 (en) * 2009-05-28 2012-08-21 Nabsys, Inc. Devices and methods for analyzing biomolecules and probes bound thereto
US20120261274A1 (en) 2009-05-29 2012-10-18 Life Technologies Corporation Methods and apparatus for measuring analytes
US8673627B2 (en) 2009-05-29 2014-03-18 Life Technologies Corporation Apparatus and methods for performing electrochemical reactions
US8776573B2 (en) 2009-05-29 2014-07-15 Life Technologies Corporation Methods and apparatus for measuring analytes
WO2010141390A2 (en) 2009-06-05 2010-12-09 Life Technologies Corporation Nucleotide transient binding for sequencing methods
WO2010144151A2 (en) * 2009-06-12 2010-12-16 Pacific Biosciences Of California, Inc. Single-molecule real-time analysis of protein synthesis
US9416409B2 (en) 2009-07-31 2016-08-16 Ibis Biosciences, Inc. Capture primers and capture sequence linked solid supports for molecular diagnostic tests
CN102625838B (en) * 2009-08-14 2016-01-20 阿霹震中科技公司 For generating sample that rRNA eliminates or for from the method for sample separation rRNA, composition and test kit
WO2011028296A2 (en) * 2009-09-07 2011-03-10 Caerus Molecular Diagnostics Incorporated Sequence determination by use of opposing forces
WO2011078897A1 (en) * 2009-09-15 2011-06-30 Life Technologies Corporation Improved sequencing methods, compositions, systems, kits and apparatuses
WO2011040996A1 (en) 2009-09-30 2011-04-07 Quantapore, Inc. Ultrafast sequencing of biological polymers using a labeled nanopore
EP2488656B1 (en) 2009-10-15 2015-06-03 Ibis Biosciences, Inc. Multiple displacement amplification
US9127313B2 (en) 2009-12-01 2015-09-08 Oxford Nanopore Technologies Limited Biochemical analysis instrument
US8911972B2 (en) 2009-12-16 2014-12-16 Pacific Biosciences Of California, Inc. Sequencing methods using enzyme conformation
US8547533B2 (en) * 2009-12-28 2013-10-01 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Composite probes and use thereof in super resolution methods
WO2011093939A1 (en) 2010-02-01 2011-08-04 Illumina, Inc. Focusing methods and optical systems and assemblies using the same
US8603741B2 (en) 2010-02-18 2013-12-10 Pacific Biosciences Of California, Inc. Single molecule sequencing with two distinct chemistry steps
DE202011003570U1 (en) 2010-03-06 2012-01-30 Illumina, Inc. Systems and apparatus for detecting optical signals from a sample
CN103003449A (en) * 2010-05-06 2013-03-27 艾比斯生物科学公司 Integrated sample preparation systems and stabilized enzyme mixtures
US8865077B2 (en) 2010-06-11 2014-10-21 Industrial Technology Research Institute Apparatus for single-molecule detection
WO2011159942A1 (en) 2010-06-18 2011-12-22 Illumina, Inc. Conformational probes and methods for sequencing nucleic acids
US20120001646A1 (en) 2010-06-30 2012-01-05 Life Technologies Corporation Methods and apparatus for testing isfet arrays
EP2589084B1 (en) 2010-06-30 2016-11-16 Life Technologies Corporation Transistor circuits for detection and measurement of chemical reactions and compounds
US8858782B2 (en) 2010-06-30 2014-10-14 Life Technologies Corporation Ion-sensing charge-accumulation circuits and methods
US11307166B2 (en) 2010-07-01 2022-04-19 Life Technologies Corporation Column ADC
EP2589065B1 (en) 2010-07-03 2015-08-19 Life Technologies Corporation Chemically sensitive sensor with lightly doped drains
US11203786B2 (en) 2010-08-06 2021-12-21 Ariosa Diagnostics, Inc. Detection of target nucleic acids using hybridization
US8700338B2 (en) 2011-01-25 2014-04-15 Ariosa Diagnosis, Inc. Risk calculation for evaluation of fetal aneuploidy
US20140342940A1 (en) 2011-01-25 2014-11-20 Ariosa Diagnostics, Inc. Detection of Target Nucleic Acids using Hybridization
US20120034603A1 (en) 2010-08-06 2012-02-09 Tandem Diagnostics, Inc. Ligation-based detection of genetic variants
US20130261003A1 (en) 2010-08-06 2013-10-03 Ariosa Diagnostics, In. Ligation-based detection of genetic variants
US11031095B2 (en) 2010-08-06 2021-06-08 Ariosa Diagnostics, Inc. Assay systems for determination of fetal copy number variation
US20130040375A1 (en) 2011-08-08 2013-02-14 Tandem Diagnotics, Inc. Assay systems for genetic analysis
US10533223B2 (en) 2010-08-06 2020-01-14 Ariosa Diagnostics, Inc. Detection of target nucleic acids using hybridization
WO2012027623A2 (en) 2010-08-25 2012-03-01 Pacific Biosciences Of California, Inc. Cyanine dyes
EP2617061B1 (en) 2010-09-15 2021-06-30 Life Technologies Corporation Methods and apparatus for measuring analytes
US20120070830A1 (en) 2010-09-16 2012-03-22 lbis Biosciences, Inc. Stabilization of ozone-labile fluorescent dyes by thiourea
EP2616555B1 (en) 2010-09-16 2017-11-08 Gen-Probe Incorporated Capture probes immobilizable via l-nucleotide tail
CN103299182A (en) 2010-09-24 2013-09-11 生命科技公司 Matched pair transistor circuits
US8715933B2 (en) 2010-09-27 2014-05-06 Nabsys, Inc. Assay methods using nicking endonucleases
US8753816B2 (en) 2010-10-26 2014-06-17 Illumina, Inc. Sequencing methods
EP3388532B1 (en) 2010-11-01 2021-03-10 Gen-Probe Incorporated Integrated capture and amplification of target nucleic acid for sequencing
US9074251B2 (en) 2011-02-10 2015-07-07 Illumina, Inc. Linking sequence reads using paired code tags
CA2821299C (en) 2010-11-05 2019-02-12 Frank J. Steemers Linking sequence reads using paired code tags
EP2640849B1 (en) 2010-11-16 2016-04-06 Nabsys 2.0 LLC Methods for sequencing a biomolecule by detecting relative positions of hybridized probes
WO2012074855A2 (en) 2010-11-22 2012-06-07 The Regents Of The University Of California Methods of identifying a cellular nascent rna transcript
WO2012092265A1 (en) 2010-12-27 2012-07-05 Ibis Biosciences, Inc. Nucleic acid sample preparation methods and compositions
US8951781B2 (en) 2011-01-10 2015-02-10 Illumina, Inc. Systems, methods, and apparatuses to image a sample for biological or chemical analysis
US8756020B2 (en) 2011-01-25 2014-06-17 Ariosa Diagnostics, Inc. Enhanced risk probabilities using biomolecule estimations
US11270781B2 (en) 2011-01-25 2022-03-08 Ariosa Diagnostics, Inc. Statistical analysis for non-invasive sex chromosome aneuploidy determination
US9994897B2 (en) 2013-03-08 2018-06-12 Ariosa Diagnostics, Inc. Non-invasive fetal sex determination
US10131947B2 (en) 2011-01-25 2018-11-20 Ariosa Diagnostics, Inc. Noninvasive detection of fetal aneuploidy in egg donor pregnancies
WO2012104851A1 (en) 2011-01-31 2012-08-09 Yeda Research And Development Co. Ltd. Methods of diagnosing disease using overlap extension pcr
CN103403188B (en) 2011-01-31 2016-03-30 伊鲁米那股份有限公司 For reducing the method for nucleic acid damaging
JP6017458B2 (en) 2011-02-02 2016-11-02 ユニヴァーシティ・オブ・ワシントン・スルー・イッツ・センター・フォー・コマーシャリゼーション Mass parallel continuity mapping
WO2012109315A1 (en) * 2011-02-08 2012-08-16 Life Technologies Corporation Linking methods, compositions, systems, kits and apparatuses
US11274341B2 (en) 2011-02-11 2022-03-15 NABsys, 2.0 LLC Assay methods using DNA binding proteins
ES2538694T3 (en) 2011-04-01 2015-06-23 F. Hoffmann-La Roche Ag Variants of the T7 RNA polymerase with Cysteine-Serine substitutions
US8778848B2 (en) 2011-06-09 2014-07-15 Illumina, Inc. Patterned flow-cells useful for nucleic acid analysis
GB201113430D0 (en) 2011-08-03 2011-09-21 Fermentas Uab DNA polymerases
WO2013022778A1 (en) 2011-08-05 2013-02-14 Ibis Biosciences, Inc. Nucleic acid sequencing by electrochemical detection
CN103857805B (en) 2011-08-10 2017-07-18 生命技术公司 It polymerize enzymatic compositions, prepares and using the method for the polymerization enzymatic compositions
US11208636B2 (en) 2011-08-10 2021-12-28 Life Technologies Corporation Polymerase compositions, methods of making and using same
US8712697B2 (en) 2011-09-07 2014-04-29 Ariosa Diagnostics, Inc. Determination of copy number variations using binomial probability calculations
CN104024269B (en) 2011-09-13 2017-05-24 激光基因公司 5-methoxy. 3'-oh unblocked, fast photocleavable terminating nucleotides and methods for nucleic acid sequencing
US9164053B2 (en) * 2011-09-26 2015-10-20 The Regents Of The University Of California Electronic device for monitoring single molecule dynamics
WO2013049135A1 (en) 2011-09-26 2013-04-04 Gen-Probe Incorporated Algorithms for sequence determinations
WO2013063308A1 (en) 2011-10-25 2013-05-02 University Of Massachusetts An enzymatic method to enrich for capped rna, kits for performing same, and compositions derived therefrom
AU2012328662B2 (en) 2011-10-28 2015-12-17 Illumina, Inc. Microarray fabrication system and method
JP5863396B2 (en) * 2011-11-04 2016-02-16 株式会社日立製作所 DNA sequence decoding system, DNA sequence decoding method and program
US8637242B2 (en) 2011-11-07 2014-01-28 Illumina, Inc. Integrated sequencing apparatuses and methods of use
US9970984B2 (en) 2011-12-01 2018-05-15 Life Technologies Corporation Method and apparatus for identifying defects in a chemical sensor array
DK2788499T3 (en) 2011-12-09 2016-03-21 Illumina Inc Enhanced root for polymer tags
US9279154B2 (en) 2011-12-21 2016-03-08 Illumina, Inc. Apparatus and methods for kinetic analysis and determination of nucleic acid sequences
EP3211100A1 (en) 2011-12-22 2017-08-30 Ibis Biosciences, Inc. Amplification primers and methods
WO2013096799A1 (en) 2011-12-22 2013-06-27 Ibis Biosciences, Inc. Systems and methods for isolating nucleic acids from cellular samples
EP3269820A1 (en) 2011-12-22 2018-01-17 Ibis Biosciences, Inc. Kit for the amplification of a sequence from a ribonucleic acid
WO2013096819A2 (en) 2011-12-22 2013-06-27 Ibis Biosciences, Inc. Macromolecule positioning by electrical potential
WO2013102091A1 (en) 2011-12-28 2013-07-04 Ibis Biosciences, Inc. Nucleic acid ligation systems and methods
WO2013102081A2 (en) 2011-12-29 2013-07-04 Ibis Biosciences, Inc. Macromolecule delivery to nanowells
US9855559B2 (en) 2011-12-30 2018-01-02 Abbott Molecular Inc. Microorganism nucleic acid purification from host samples
US9822417B2 (en) 2012-01-09 2017-11-21 Oslo Universitetssykehus Hf Methods and biomarkers for analysis of colorectal cancer
CN104334739A (en) 2012-01-13 2015-02-04 Data生物有限公司 Genotyping by next-generation sequencing
US8747748B2 (en) 2012-01-19 2014-06-10 Life Technologies Corporation Chemical sensor with conductive cup-shaped sensor surface
US8821798B2 (en) 2012-01-19 2014-09-02 Life Technologies Corporation Titanium nitride as sensing layer for microwell structure
EP3511426B1 (en) 2012-02-01 2022-08-17 Gen-Probe Incorporated Asymmetric hairpin target capture oligomers
BR112014019168B1 (en) 2012-02-03 2021-10-05 California Institute Of Technology METHOD CAPABLE OF DETECTING IN A NON-DEGENERATED PRESENCE OR ABSENCE OF ANALYTES IN A SINGLE SAMPLE VOLUME AND METHOD OF DETECTION OF THE PRESENCE OR ABSENCE OF EACH ANALYTE AMONG A PLURALITY OF ANALYTES
NO2694769T3 (en) 2012-03-06 2018-03-03
US20130261984A1 (en) 2012-03-30 2013-10-03 Illumina, Inc. Methods and systems for determining fetal chromosomal abnormalities
EP4219012A1 (en) 2012-04-03 2023-08-02 Illumina, Inc. Method of imaging a substrate comprising fluorescent features and use of the method in nucleic acid sequencing
EP2834370B1 (en) 2012-04-03 2019-01-02 The Regents Of The University Of Michigan Biomarker associated with irritable bowel syndrome and crohn's disease
US20130274148A1 (en) 2012-04-11 2013-10-17 Illumina, Inc. Portable genetic detection and analysis system and method
EP3783111A1 (en) 2012-05-02 2021-02-24 Ibis Biosciences, Inc. Dna sequencing
ES2683978T3 (en) 2012-05-02 2018-10-01 Ibis Biosciences, Inc. DNA sequencing
WO2013166303A1 (en) 2012-05-02 2013-11-07 Ibis Biosciences, Inc. Dna sequencing
EP2844767A4 (en) 2012-05-02 2015-11-18 Ibis Biosciences Inc Nucleic acid sequencing systems and methods
US9315864B2 (en) 2012-05-18 2016-04-19 Pacific Biosciences Of California, Inc. Heteroarylcyanine dyes with sulfonic acid substituents
EP2850086B1 (en) 2012-05-18 2023-07-05 Pacific Biosciences Of California, Inc. Heteroarylcyanine dyes
US10289800B2 (en) 2012-05-21 2019-05-14 Ariosa Diagnostics, Inc. Processes for calculating phased fetal genomic sequences
US8786331B2 (en) 2012-05-29 2014-07-22 Life Technologies Corporation System for reducing noise in a chemical sensor array
US9012022B2 (en) 2012-06-08 2015-04-21 Illumina, Inc. Polymer coatings
US8895249B2 (en) 2012-06-15 2014-11-25 Illumina, Inc. Kinetic exclusion amplification of nucleic acid libraries
WO2014005076A2 (en) 2012-06-29 2014-01-03 The Regents Of The University Of Michigan Methods and biomarkers for detection of kidney disorders
WO2014008448A1 (en) 2012-07-03 2014-01-09 Sloan Kettering Institute For Cancer Research Quantitative assessment of human t-cell repertoire recovery after allogeneic hematopoietic stem cell transplantation
CN104583421A (en) 2012-07-19 2015-04-29 阿瑞奥萨诊断公司 Multiplexed sequential ligation-based detection of genetic variants
NL2017959B1 (en) 2016-12-08 2018-06-19 Illumina Inc Cartridge assembly
AU2013306373B2 (en) 2012-08-20 2017-09-07 Illumina, Inc. Method and system for fluorescence lifetime based sequencing
JP6510978B2 (en) 2012-10-16 2019-05-08 アボツト・モレキユラー・インコーポレイテツド Method and apparatus for sequencing nucleic acids
US9181583B2 (en) 2012-10-23 2015-11-10 Illumina, Inc. HLA typing using selective amplification and sequencing
US9651539B2 (en) 2012-10-28 2017-05-16 Quantapore, Inc. Reducing background fluorescence in MEMS materials by low energy ion beam treatment
US9605309B2 (en) * 2012-11-09 2017-03-28 Genia Technologies, Inc. Nucleic acid sequencing using tags
US9914966B1 (en) 2012-12-20 2018-03-13 Nabsys 2.0 Llc Apparatus and methods for analysis of biomolecules using high frequency alternating current excitation
US9080968B2 (en) 2013-01-04 2015-07-14 Life Technologies Corporation Methods and systems for point of use removal of sacrificial material
US9841398B2 (en) 2013-01-08 2017-12-12 Life Technologies Corporation Methods for manufacturing well structures for low-noise chemical sensors
US9683230B2 (en) 2013-01-09 2017-06-20 Illumina Cambridge Limited Sample preparation on a solid support
EP2956550B1 (en) 2013-01-18 2020-04-08 Nabsys 2.0 LLC Enhanced probe binding
US9805407B2 (en) 2013-01-25 2017-10-31 Illumina, Inc. Methods and systems for using a cloud computing environment to configure and sell a biological sample preparation cartridge and share related data
US9512422B2 (en) 2013-02-26 2016-12-06 Illumina, Inc. Gel patterned surfaces
US8963216B2 (en) 2013-03-13 2015-02-24 Life Technologies Corporation Chemical sensor with sidewall spacer sensor surface
WO2014142850A1 (en) 2013-03-13 2014-09-18 Illumina, Inc. Methods and compositions for nucleic acid sequencing
WO2014142841A1 (en) 2013-03-13 2014-09-18 Illumina, Inc. Multilayer fluidic devices and methods for their fabrication
WO2014160117A1 (en) 2013-03-14 2014-10-02 Abbott Molecular Inc. Multiplex methylation-specific amplification systems and methods
EP2971070B2 (en) 2013-03-14 2021-03-03 Illumina, Inc. Modified polymerases for improved incorporation of nucleotide analogues
WO2014152937A1 (en) 2013-03-14 2014-09-25 Ibis Biosciences, Inc. Nucleic acid control panels
WO2014149779A1 (en) 2013-03-15 2014-09-25 Life Technologies Corporation Chemical device with thin conductive element
ES2716094T3 (en) 2013-03-15 2019-06-10 Ibis Biosciences Inc Methods to analyze contamination in DNA sequencing
US9193998B2 (en) 2013-03-15 2015-11-24 Illumina, Inc. Super resolution imaging
US9835585B2 (en) 2013-03-15 2017-12-05 Life Technologies Corporation Chemical sensor with protruded sensor surface
US20140274747A1 (en) 2013-03-15 2014-09-18 Illumina, Inc. Super resolution imaging
CA2905410A1 (en) 2013-03-15 2014-09-25 Abbott Molecular Inc. Systems and methods for detection of genomic copy number changes
KR102230831B1 (en) 2013-03-15 2021-03-22 일루미나 케임브리지 리미티드 Modified nucleosides or nucleotides
JP2016510895A (en) 2013-03-15 2016-04-11 ライフ テクノロジーズ コーポレーション Chemical sensor with consistent sensor surface area
US20140336063A1 (en) 2013-05-09 2014-11-13 Life Technologies Corporation Windowed Sequencing
WO2014190322A2 (en) 2013-05-24 2014-11-27 Quantapore, Inc. Nanopore-based nucleic acid analysis with mixed fret detection
US10458942B2 (en) 2013-06-10 2019-10-29 Life Technologies Corporation Chemical sensor array having multiple sensors per well
PT3017065T (en) 2013-07-01 2018-12-18 Illumina Inc Catalyst-free surface functionalization and polymer grafting
CA2914248C (en) 2013-07-03 2021-09-07 Illumina, Inc. Sequencing by orthogonal synthesis
CN103333680B (en) * 2013-07-16 2015-05-27 北京化工大学 Dinyl oxazole eutectic material with multi-color fluorescence characteristic and preparation method of dinyl oxazole eutectic material
KR101869090B1 (en) 2013-08-08 2018-06-20 일루미나, 인코포레이티드 Fluidic system for reagent delivery to a flow cell
WO2015026853A2 (en) 2013-08-19 2015-02-26 Abbott Molecular Inc. Next-generation sequencing libraries
CA2920390A1 (en) 2013-08-30 2015-03-05 Illumina, Inc. Manipulation of droplets on hydrophilic or variegated-hydrophilic surfaces
WO2015048763A1 (en) 2013-09-30 2015-04-02 Life Technologies Corporation Polymerase compositions, methods of making and using same
US10174381B2 (en) 2013-10-18 2019-01-08 The Regents Of The University Of Michigan Systems and methods for determining a treatment course of action
US10540783B2 (en) * 2013-11-01 2020-01-21 Illumina, Inc. Image analysis useful for patterned objects
PL3077943T3 (en) 2013-12-03 2020-11-30 Illumina, Inc. Methods and systems for analyzing image data
AU2014364006B2 (en) 2013-12-10 2019-07-11 Illumina, Inc. Biosensors for biological or chemical analysis and methods of manufacturing the same
US10768181B2 (en) 2013-12-17 2020-09-08 The Brigham And Women's Hospital, Inc. Detection of an antibody against a pathogen
WO2015095226A2 (en) 2013-12-20 2015-06-25 Illumina, Inc. Preserving genomic connectivity information in fragmented genomic dna samples
AU2014369879B2 (en) 2013-12-23 2020-12-24 Illumina, Inc. Structured substrates for improving detection of light emissions and methods relating to the same
KR102001554B1 (en) 2014-01-16 2019-07-18 일루미나, 인코포레이티드 Amplicon preparation and sequencing on solid supports
CA2935720A1 (en) * 2014-01-16 2015-07-23 Illumina, Inc. Gene expression panel for prognosis of prostate cancer recurrence
US9677132B2 (en) 2014-01-16 2017-06-13 Illumina, Inc. Polynucleotide modification on solid support
WO2015107430A2 (en) 2014-01-16 2015-07-23 Oslo Universitetssykehus Hf Methods and biomarkers for detection and prognosis of cervical cancer
WO2015126766A1 (en) 2014-02-18 2015-08-27 Illumina, Inc. Methods and compositions for dna profiling
EP3116651B1 (en) 2014-03-11 2020-04-22 Illumina, Inc. Disposable, integrated microfluidic cartridge and methods of making it
SG11201609053YA (en) 2014-04-29 2016-11-29 Illumina Inc Multiplexed single cell gene expression analysis using template switch and tagmentation
SG11201610168YA (en) 2014-05-16 2017-01-27 Illumina Inc Nucleic acid synthesis techniques
EP3148697A1 (en) 2014-05-27 2017-04-05 Illumina, Inc. Systems and methods for biochemical analysis including a base instrument and a removable cartridge
WO2015187670A2 (en) 2014-06-03 2015-12-10 Illumina, Inc. Compositions, systems, and methods for detecting events using tethers anchored to or adjacent to nanopores
WO2015188178A1 (en) 2014-06-06 2015-12-10 The Regents Of The University Of Michigan Compositions and methods for characterizing and diagnosing periodontal disease
US20150353989A1 (en) 2014-06-09 2015-12-10 Illumina Cambridge Limited Sample preparation for nucleic acid amplification
EP3155125A1 (en) 2014-06-13 2017-04-19 Illumina Cambridge Limited Methods and compositions for preparing sequencing libraries
CA2953362A1 (en) 2014-06-23 2015-12-30 The General Hospital Corporation Genomewide unbiased identification of dsbs evaluated by sequencing (guide-seq)
CN114214314A (en) 2014-06-24 2022-03-22 生物辐射实验室股份有限公司 Digital PCR barcoding
US10017759B2 (en) 2014-06-26 2018-07-10 Illumina, Inc. Library preparation of tagged nucleic acid
WO2015200693A1 (en) 2014-06-27 2015-12-30 Illumina, Inc. Modified polymerases for improved incorporation of nucleotide analogues
WO2015200833A2 (en) 2014-06-27 2015-12-30 Abbott Laboratories COMPOSITIONS AND METHODS FOR DETECTING HUMAN PEGIVIRUS 2 (HPgV-2)
WO2016003814A1 (en) 2014-06-30 2016-01-07 Illumina, Inc. Methods and compositions using one-sided transposition
EP3828279B1 (en) 2014-07-15 2022-11-16 Illumina, Inc. Biochemically activated electronic device
EP3172321B2 (en) 2014-07-21 2023-01-04 Illumina, Inc. Polynucleotide enrichment using crispr-cas systems
ES2962636T3 (en) 2014-07-24 2024-03-20 Abbott Molecular Inc Methods for the detection and analysis of mycobacterium tuberculosis
GB201414098D0 (en) 2014-08-08 2014-09-24 Illumina Cambridge Ltd Modified nucleotide linkers
EP3183577B1 (en) 2014-08-21 2020-08-19 Illumina Cambridge Limited Reversible surface functionalization
WO2016040602A1 (en) 2014-09-11 2016-03-17 Epicentre Technologies Corporation Reduced representation bisulfite sequencing using uracil n-glycosylase (ung) and endonuclease iv
US10633694B2 (en) 2014-09-12 2020-04-28 Illumina, Inc. Compositions, systems, and methods for detecting the presence of polymer subunits using chemiluminescence
JP6802154B2 (en) 2014-09-18 2020-12-16 イラミーナ インコーポレーテッド Methods and systems for analyzing nucleic acid sequencing data
WO2016054096A1 (en) 2014-09-30 2016-04-07 Illumina, Inc. Modified polymerases for improved incorporation of nucleotide analogues
WO2016153999A1 (en) 2015-03-25 2016-09-29 Life Technologies Corporation Modified nucleotides and uses thereof
EP3204148B1 (en) 2014-10-09 2020-07-08 Illumina, Inc. Method and device for separating immiscible liquids to effectively isolate at least one of the liquids
CN107109472B (en) 2014-10-10 2021-05-11 昆塔波尔公司 Nanopore-based polymer analysis using mutually quenching fluorescent labels
US9897791B2 (en) 2014-10-16 2018-02-20 Illumina, Inc. Optical scanning systems for in situ genetic analysis
RU2709655C2 (en) 2014-10-17 2019-12-19 Иллумина Кембридж Лимитед Transposition with preservation of gene adhesion
AU2015335616B2 (en) 2014-10-24 2019-09-12 Quantapore, Inc. Efficient optical analysis of polymers using arrays of nanostructures
CN107108822B (en) 2014-10-31 2020-02-07 伊鲁米纳剑桥有限公司 Polymer and DNA copolymer coating
GB201419731D0 (en) 2014-11-05 2014-12-17 Illumina Cambridge Ltd Sequencing from multiple primers to increase data rate and density
DK3215616T3 (en) 2014-11-05 2020-03-02 Illumina Cambridge Ltd Reducing DNA damage during sample preparation and sequencing using siderophore chelators
CN107208019B (en) 2014-11-11 2021-01-01 伊鲁米纳剑桥有限公司 Methods and arrays for the generation and sequencing of nucleic acid monoclonal clusters
EP3218513B1 (en) 2014-11-11 2018-10-31 Illumina, Inc. Polynucleotide amplification using crispr-cas systems
RU2582198C1 (en) * 2014-11-20 2016-04-20 Федеральное государственное бюджетное учреждение науки Лимнологический институт Сибирского отделения Российской академии наук (ЛИН СО РАН) Analogues of natural deoxyribonucleoside triphosphates and ribonucleoside triphosphates containing reporter fluorescent groups, for use in analytical bioorganic chemistry
CN104458686B (en) * 2014-12-02 2017-01-18 公安部第一研究所 DNA fluorescence spectrum collecting method based on characteristic molecular weight interior label quantitative analysis
EP3882356A1 (en) 2014-12-15 2021-09-22 Illumina, Inc. Compositions and methods for single molecular placement on a substrate
EP3795681B1 (en) 2014-12-16 2024-09-18 Life Technologies Corporation Polymerase compositions and methods of making and using same
EP4354131A3 (en) 2014-12-18 2024-06-26 Life Technologies Corporation High data rate integrated circuit with transmitter configuration
US10077472B2 (en) 2014-12-18 2018-09-18 Life Technologies Corporation High data rate integrated circuit with power management
CN107407656B (en) 2014-12-18 2020-04-07 生命科技公司 Method and apparatus for measuring analytes using large scale FET arrays
KR20200020997A (en) 2015-02-10 2020-02-26 일루미나, 인코포레이티드 The method and the composition for analyzing the cellular constituent
US10208339B2 (en) 2015-02-19 2019-02-19 Takara Bio Usa, Inc. Systems and methods for whole genome amplification
EP3822361A1 (en) 2015-02-20 2021-05-19 Takara Bio USA, Inc. Method for rapid accurate dispensing, visualization and analysis of single cells
EP3271073B1 (en) 2015-03-20 2019-06-12 Illumina, Inc. Fluidics cartridge for use in the vertical or substantially vertical position
CA3225867A1 (en) 2015-03-24 2016-09-29 Illumina, Inc. Methods, carrier assemblies, and systems for imaging samples for biological or chemical analysis
EP3277839B1 (en) 2015-03-31 2020-12-02 Illumina Cambridge Limited Surface concatamerization of templates
CN112229834A (en) 2015-04-14 2021-01-15 亿明达股份有限公司 Structured substrate for improved detection of light emission and method relating thereto
KR20170136555A (en) 2015-04-15 2017-12-11 더 제너럴 하스피탈 코포레이션 LNA-based mutant enrichment next-gen sequencing assays
US10844428B2 (en) 2015-04-28 2020-11-24 Illumina, Inc. Error suppression in sequenced DNA fragments using redundant reads with unique molecular indices (UMIS)
DK3822365T3 (en) 2015-05-11 2023-02-06 Illumina Inc Platform for discovery and analysis of therapeutics
CA2987397C (en) 2015-05-29 2022-03-01 Illumina, Inc. Sample carrier and assay system for conducting designated reactions
US10640809B2 (en) 2015-05-29 2020-05-05 Epicentre Technologies Corporation Methods of analyzing nucleic acids
JP6609641B2 (en) 2015-05-29 2019-11-20 イルミナ ケンブリッジ リミテッド Advanced use of surface primers in clusters
IL255758B (en) 2015-06-03 2022-07-01 Illumina Inc Compositions, systems, and methods for sequencing polynucleotides using tethers anchored to polymerases adjacent to nanopores
EP3878974A1 (en) 2015-07-06 2021-09-15 Illumina Cambridge Limited Sample preparation for nucleic acid amplification
WO2017007757A1 (en) 2015-07-06 2017-01-12 Illumina, Inc. Balanced ac modulation for driving droplet operations electrodes
US10808282B2 (en) 2015-07-07 2020-10-20 Illumina, Inc. Selective surface patterning via nanoimprinting
EP3988658A1 (en) 2015-07-14 2022-04-27 Abbott Molecular Inc. Purification of nucleic acids using copper-titanium oxides
EP3322483A4 (en) 2015-07-14 2019-01-02 Abbott Molecular Inc. Compositions and methods for identifying drug resistant tuberculosis
EP3325648B1 (en) 2015-07-17 2023-03-29 Illumina, Inc. Polymer sheets for sequencing applications
US10077470B2 (en) 2015-07-21 2018-09-18 Omniome, Inc. Nucleic acid sequencing methods and systems
CN107849605B (en) * 2015-07-21 2021-08-20 欧姆尼欧美公司 Nucleic acid sequencing methods and systems
DK3329012T3 (en) 2015-07-27 2021-10-11 Illumina Inc Spatial mapping of nucleic acid sequence information
JP6712606B2 (en) 2015-07-30 2020-06-24 イラミーナ インコーポレーテッド Orthogonal deblocking of nucleotides
AU2016308460B2 (en) 2015-08-14 2020-02-06 Illumina, Inc. Systems and methods using magnetically-responsive sensors for determining a genetic characteristic
US10976334B2 (en) 2015-08-24 2021-04-13 Illumina, Inc. In-line pressure accumulator and flow-control system for biological or chemical assays
EP4086357A1 (en) 2015-08-28 2022-11-09 Illumina, Inc. Nucleic acid sequence analysis from single cells
EP3344389B1 (en) 2015-09-02 2020-06-10 Illumina Cambridge Limited Method of fixing defects in a hydrophobic surface of a droplet actuator
WO2017044843A1 (en) 2015-09-11 2017-03-16 The General Hospital Corporation Full interrogation of nuclease dsbs and sequencing (find-seq)
US10450598B2 (en) 2015-09-11 2019-10-22 Illumina, Inc. Systems and methods for obtaining a droplet having a designated concentration of a substance-of-interest
CA3000762A1 (en) 2015-09-30 2017-04-06 The General Hospital Corporation Comprehensive in vitro reporting of cleavage events by sequencing (circle-seq)
CN116218815A (en) 2015-10-01 2023-06-06 生命技术公司 Polymerase compositions and kits and methods of use and manufacture thereof
CA3002983A1 (en) 2015-10-22 2017-04-27 Illumina, Inc. Filler fluid for fluidic devices
US11203016B2 (en) 2015-12-01 2021-12-21 Illumina, Inc. Digital microfluidic system for single-cell isolation and characterization of analytes
AU2016370835B2 (en) 2015-12-17 2020-02-13 Illumina, Inc. Distinguishing methylation levels in complex biological samples
US10730030B2 (en) 2016-01-08 2020-08-04 Bio-Rad Laboratories, Inc. Multiple beads per droplet resolution
KR20180108578A (en) 2016-01-11 2018-10-04 일루미나, 인코포레이티드 A microfluorescent meter, a fluid system, and a flow cell latch clamp module
WO2017139260A1 (en) 2016-02-08 2017-08-17 RGENE, Inc. Multiple ligase compositions, systems, and methods
US20190062827A1 (en) 2016-03-14 2019-02-28 RGENE, Inc. HYPER-THERMOSTABLE LYSINE-MUTANT ssDNA/RNA LIGASES
CN108474744B (en) 2016-03-24 2019-11-05 伊鲁米那股份有限公司 Equipment and constituent and its application method used in the luminescence imaging based on photon superlattices
DK3377226T3 (en) 2016-03-28 2021-04-26 Illumina Inc Micro rows in several levels
WO2017177017A1 (en) 2016-04-07 2017-10-12 Omniome, Inc. Methods of quantifying target nucleic acids and identifying sequence variants
CA3016221C (en) 2016-04-07 2021-09-28 Illumina, Inc. Methods and systems for construction of normalized nucleic acid libraries
US10982264B2 (en) 2016-04-22 2021-04-20 Omniome, Inc. Nucleic acid sequencing method and system employing enhanced detection of nucleotide-specific ternary complex formation
US11579336B2 (en) 2016-04-22 2023-02-14 Illumina, Inc. Photonic structure-based devices and compositions for use in luminescent imaging of multiple sites within a pixel, and methods of using the same
US10294514B2 (en) 2016-04-29 2019-05-21 Omniome, Inc. Sequencing method employing ternary complex destabilization to identify cognate nucleotides
EP4269611A3 (en) 2016-05-11 2024-01-17 Illumina, Inc. Polynucleotide enrichment and amplification using argonaute systems
EP3458913B1 (en) 2016-05-18 2020-12-23 Illumina, Inc. Self assembled patterning using patterned hydrophobic surfaces
JP7133473B2 (en) * 2016-05-20 2022-09-08 クアンタム-エスアイ インコーポレイテッド Labeled Nucleotide Compositions and Methods for Determining the Sequence of Nucleic Acids
US10823721B2 (en) 2016-07-05 2020-11-03 Quantapore, Inc. Optically based nanopore sequencing
WO2018013509A1 (en) 2016-07-11 2018-01-18 Arizona Board Of Regents On Behalf Of The University Of Arizona Compositions and methods for diagnosing and treating arrhythmias
EP3875603A1 (en) 2016-07-12 2021-09-08 Life Technologies Corporation Compositions and methods for detecting nucleic acid regions
JP7075394B2 (en) 2016-07-21 2022-05-25 タカラ バイオ ユーエスエー, インコーポレイテッド Multi-Z imaging and dispensing using a multi-well device
US11535883B2 (en) 2016-07-22 2022-12-27 Illumina, Inc. Single cell whole genome libraries and combinatorial indexing methods of making thereof
US10428378B2 (en) 2016-08-15 2019-10-01 Omniome, Inc. Sequencing method for rapid identification and processing of cognate nucleotide pairs
CA3033741A1 (en) 2016-08-15 2018-02-22 Omniome, Inc. Method and system for sequencing nucleic acids
US11543417B2 (en) 2016-08-29 2023-01-03 Oslo Universitetssykehus Hf ChIP-seq assays
WO2018064116A1 (en) 2016-09-28 2018-04-05 Illumina, Inc. Methods and systems for data compression
MY194951A (en) 2016-10-14 2022-12-28 Illumina Inc Cartridge assembly
AU2017345562B2 (en) 2016-10-19 2024-01-25 Illumina Singapore Pte. Ltd. Methods for chemical ligation of nucleic acids
BR112019009949A2 (en) 2016-11-16 2019-08-20 Illumina Inc computer-implemented method for validating variant calls and system for validating variant calls
GB201619458D0 (en) 2016-11-17 2017-01-04 Spatial Transcriptomics Ab Method for spatial tagging and analysing nucleic acids in a biological specimen
DK3551753T3 (en) 2016-12-09 2022-09-05 Broad Inst Inc DIAGNOSTICS BASED ON CRISPR EFFECTOR SYSTEM
CN110139932B (en) 2016-12-19 2024-05-17 生物辐射实验室股份有限公司 Drop-on labeled DNA with maintained adjacency
CA3046532A1 (en) 2016-12-22 2018-06-28 Illumina, Inc. Arrays including a resin film and a patterned polymer layer
JP7126501B2 (en) 2016-12-22 2022-08-26 イラミーナ インコーポレーテッド Array with quality control tracer
CA3046533A1 (en) 2016-12-22 2018-06-28 Illumina, Inc. Array including sequencing primer and non-sequencing entity
AU2017385424B2 (en) * 2016-12-27 2024-06-27 Qingdao MGI Tech Co., Ltd. Single fluorescent dye based sequencing method
EP3562962B1 (en) 2016-12-30 2022-01-05 Omniome, Inc. Method and system employing distinguishable polymerases for detecting ternary complexes and identifying cognate nucleotides
GB201704754D0 (en) 2017-01-05 2017-05-10 Illumina Inc Kinetic exclusion amplification of nucleic acid libraries
KR102385560B1 (en) 2017-01-06 2022-04-11 일루미나, 인코포레이티드 Paging Correction
EP3568490B1 (en) 2017-01-10 2021-03-10 Omniome, Inc. Polymerases engineered to reduce nucleotide-independent dna binding
BR112019014042A2 (en) 2017-01-17 2020-02-04 Illumina Inc determination of oncogenic splice variant
EP3889962A1 (en) 2017-01-18 2021-10-06 Illumina, Inc. Methods and systems for generation and error-correction of unique molecular index sets with heterogeneous molecular lengths
WO2018136487A1 (en) 2017-01-20 2018-07-26 Omniome, Inc. Process for cognate nucleotide detection in a nucleic acid sequencing workflow
SG11201906567YA (en) 2017-01-20 2019-08-27 Omniome Inc Allele-specific capture of nucleic acids
AU2017394645B2 (en) 2017-01-20 2020-01-23 Pacific Biosciences Of California, Inc. Genotyping by polymerase binding
EP3354746B1 (en) 2017-01-30 2019-05-29 Gregor Mendel Institute of Molecular Plant Biology GmbH Novel spike-in oligonucleotides for normalization of sequence data
GB201701689D0 (en) 2017-02-01 2017-03-15 Illumia Inc System and method with fiducials of non-closed shapes
GB201701686D0 (en) 2017-02-01 2017-03-15 Illunina Inc System & method with fiducials having offset layouts
GB201701688D0 (en) 2017-02-01 2017-03-15 Illumia Inc System and method with fiducials in non-recliner layouts
US11492666B2 (en) 2017-02-15 2022-11-08 Pacific Biosciences Of California, Inc. Distinguishing sequences by detecting polymerase dissociation
ES2933806T3 (en) 2017-02-21 2023-02-14 Illumina Inc Tagmentation using linker-immobilized transposomes
US11104937B2 (en) 2017-03-15 2021-08-31 The Broad Institute, Inc. CRISPR effector system based diagnostics
US11174515B2 (en) 2017-03-15 2021-11-16 The Broad Institute, Inc. CRISPR effector system based diagnostics
CA3056411A1 (en) 2017-03-15 2018-09-20 The Broad Institute, Inc. Crispr effector system based diagnostics for virus detection
US11021740B2 (en) 2017-03-15 2021-06-01 The Broad Institute, Inc. Devices for CRISPR effector system based diagnostics
EP3601560A1 (en) 2017-03-20 2020-02-05 Illumina, Inc. Methods and compositions for preparing nucleic acid libraries
CN110446787A (en) 2017-03-24 2019-11-12 生物辐射实验室股份有限公司 General clamp primers
WO2018187013A1 (en) 2017-04-04 2018-10-11 Omniome, Inc. Fluidic apparatus and methods useful for chemical and biological reactions
CA3060979C (en) 2017-04-23 2023-07-11 Illumina Cambridge Limited Compositions and methods for improving sample identification in indexed nucleic acid libraries
CA3059952C (en) 2017-04-23 2023-04-18 Illumina Cambridge Limited Compositions and methods for improving sample identification in indexed nucleic acid libraries
WO2018197945A1 (en) 2017-04-23 2018-11-01 Illumina Cambridge Limited Compositions and methods for improving sample identification in indexed nucleic acid libraries
US9951385B1 (en) 2017-04-25 2018-04-24 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
US10161003B2 (en) 2017-04-25 2018-12-25 Omniome, Inc. Methods and apparatus that increase sequencing-by-binding efficiency
SG11201909697TA (en) 2017-05-01 2019-11-28 Illumina Inc Optimal index sequences for multiplex massively parallel sequencing
DK3622089T3 (en) 2017-05-08 2024-10-14 Illumina Inc PROCEDURE FOR SEQUENCE USING UNIVERSAL SHORT ADAPTERS FOR INDEXING POLYNUCLEOTIDE SAMPLES
KR102628035B1 (en) 2017-06-07 2024-01-22 오레곤 헬스 앤드 사이언스 유니버시티 Single cell whole genome library for methylation sequencing
US11834653B2 (en) 2017-06-08 2023-12-05 The Brigham And Women's Hospital, Inc. Methods and compositions for identifying epitopes
WO2018236631A1 (en) 2017-06-20 2018-12-27 Illumina, Inc. Methods and compositions for addressing inefficiencies in amplification reactions
EP3642363A1 (en) 2017-06-20 2020-04-29 Bio-Rad Laboratories, Inc. Mda using bead oligonucleotide
JP7390285B2 (en) 2017-06-26 2023-12-01 ウニヴェルズィテート・フューア・ボーデンクルトゥーア・ウィーン Novel biomarker for detecting senescent cells
WO2019018366A1 (en) 2017-07-18 2019-01-24 Omniome, Inc. Method of chemically modifying plastic surfaces
CN111148850A (en) 2017-07-24 2020-05-12 宽腾矽公司 High intensity labeled reactant compositions and methods for sequencing
WO2019027767A1 (en) 2017-07-31 2019-02-07 Illumina Inc. Sequencing system with multiplexed biological sample aggregation
JP6998404B2 (en) 2017-08-01 2022-02-04 深▲セン▼恒特基因有限公司 Method for enriching and determining the target nucleotide sequence
NZ759924A (en) 2017-08-01 2023-07-28 Illumina Inc Hydrogel beads for nucleotide sequencing
US11352668B2 (en) 2017-08-01 2022-06-07 Illumina, Inc. Spatial indexing of genetic material and library preparation using hydrogel beads and flow cells
US10858701B2 (en) 2017-08-15 2020-12-08 Omniome, Inc. Scanning apparatus and method useful for detection of chemical and biological analytes
US11447818B2 (en) 2017-09-15 2022-09-20 Illumina, Inc. Universal short adapters with variable length non-random unique molecular identifiers
WO2019060640A1 (en) 2017-09-20 2019-03-28 Guardant Health, Inc. Methods and systems for differentiating somatic and germline variants
KR102526103B1 (en) 2017-10-16 2023-04-26 일루미나, 인코포레이티드 Deep learning-based splice site classification
AU2018350891B9 (en) 2017-10-16 2022-05-19 Illumina, Inc. Deep learning-based techniques for training deep convolutional neural networks
CA3079411C (en) 2017-10-19 2023-12-05 Omniome, Inc. Simultaneous background reduction and complex stabilization in binding assay workflows
WO2019089959A1 (en) 2017-11-02 2019-05-09 Bio-Rad Laboratories, Inc. Transposase-based genomic analysis
WO2019136376A1 (en) 2018-01-08 2019-07-11 Illumina, Inc. High-throughput sequencing with semiconductor-based detection
DK3622272T3 (en) 2018-01-08 2021-11-15 Illumina Inc FACILITIES FOR HIGH-PURPOSE SEQUENCE WITH SEMICONDUCTOR-BASED DETECTION
SG11201911805VA (en) 2018-01-15 2020-01-30 Illumina Inc Deep learning-based variant classifier
EP3746568B1 (en) 2018-01-29 2023-11-15 The Broad Institute, Inc. Crispr effector system based diagnostics
WO2019152395A1 (en) 2018-01-31 2019-08-08 Bio-Rad Laboratories, Inc. Methods and compositions for deconvoluting partition barcodes
CA3089267A1 (en) 2018-02-06 2019-08-15 Omniome, Inc. Compositions and techniques for nucleic acid primer extension
DK3752634T3 (en) 2018-02-13 2022-10-24 Illumina Inc DNA Sequencing Using Hydrogel Beads / DNA sequencing using hydrogel beads
JP6974504B2 (en) 2018-04-02 2021-12-01 イルミナ インコーポレイテッド Compositions and Methods for Making Controls for Sequence-Based Genetic Testing
US20190318806A1 (en) 2018-04-12 2019-10-17 Illumina, Inc. Variant Classifier Based on Deep Neural Networks
US11512002B2 (en) 2018-04-18 2022-11-29 University Of Virginia Patent Foundation Silica materials and methods of making thereof
US20190338352A1 (en) 2018-04-19 2019-11-07 Omniome, Inc. Accuracy of base calls in nucleic acid sequencing methods
CA3067181A1 (en) 2018-04-20 2019-10-24 Illumina, Inc. Methods of encapsulating single cells, the encapsulated cells and uses thereof
CA3098296A1 (en) 2018-04-26 2019-10-31 Omniome, Inc. Methods and compositions for stabilizing nucleic acid-nucleotide-polymerase complexes
EP3788377A1 (en) 2018-05-04 2021-03-10 Abbott Laboratories Hbv diagnostic, prognostic, and therapeutic methods and products
FI3794012T3 (en) 2018-05-15 2024-01-08 Illumina Inc Compositions and methods for chemical cleavage and deprotection of surface-bound oligonucleotides
BR112020005982A2 (en) 2018-05-17 2020-12-08 Illumina, Inc. HIGH PERFORMANCE SINGLE CELL SEQUENCING WITH REDUCED AMPLIFICATION BIAS
AU2019275457B2 (en) 2018-05-25 2021-11-04 Illumina, Inc. Circulating RNA signatures specific to preeclampsia
JP2021525078A (en) 2018-05-31 2021-09-24 オムニオム インコーポレイテッドOmniome, Inc. Increased signal vs. noise in nucleic acid sequencing
SG11202000905PA (en) 2018-06-04 2020-02-27 Illumina Inc High-throughput single-cell transcriptome libraries and methods of making and of using
WO2019236478A1 (en) 2018-06-04 2019-12-12 Guardant Health, Inc. Methods and systems for determining the cellular origin of cell-free nucleic acids
US12073922B2 (en) 2018-07-11 2024-08-27 Illumina, Inc. Deep learning-based framework for identifying sequence patterns that cause sequence-specific errors (SSEs)
CA3107983A1 (en) 2018-07-23 2020-01-30 Guardant Health, Inc. Methods and systems for adjusting tumor mutational burden by tumor fraction and coverage
EP3827097A1 (en) 2018-07-24 2021-06-02 Omniome, Inc. Serial formation of ternary complex species
EP3833761A1 (en) 2018-08-07 2021-06-16 The Broad Institute, Inc. Novel cas12b enzymes and systems
WO2020036991A1 (en) 2018-08-15 2020-02-20 Illumina, Inc. Compositions and methods for improving library enrichment
CN118406747A (en) 2018-08-20 2024-07-30 生物辐射实验室股份有限公司 Nucleotide sequence generation by co-localization of barcode beads in partitions
CN109161536B (en) * 2018-08-20 2022-04-08 天津科技大学 Enzyme preparation for preparing uridylic acid and method for preparing uridylic acid by enzyme catalysis
JP2021536232A (en) 2018-08-30 2021-12-27 ガーダント ヘルス, インコーポレイテッド Methods and systems for detecting contamination between samples
AU2019328344A1 (en) 2018-08-31 2021-04-08 Guardant Health, Inc. Microsatellite instability detection in cell-free DNA
US20200075124A1 (en) 2018-09-04 2020-03-05 Guardant Health, Inc. Methods and systems for detecting allelic imbalance in cell-free nucleic acid samples
EP3853358A1 (en) 2018-09-17 2021-07-28 Omniome, Inc. Engineered polymerases for improved sequencing
PL3814533T3 (en) 2018-09-20 2022-01-31 Tamirna Gmbh Micro-rna signatures for the prediction of liver dysfunction
WO2020072816A1 (en) 2018-10-03 2020-04-09 The Broad Institute, Inc. Crispr effector system based diagnostics for hemorrhagic fever detection
CN113705585A (en) 2018-10-15 2021-11-26 因美纳有限公司 Method and system based on neural network implementation
WO2020086843A1 (en) 2018-10-26 2020-04-30 Illumina, Inc. Modulating polymer beads for dna processing
MX2020013347A (en) 2018-10-31 2021-04-28 Illumina Inc Polymerases, compositions, and methods of use.
SG11202103486YA (en) 2018-10-31 2021-05-28 Guardant Health Inc Methods, compositions and systems for calibrating epigenetic partitioning assays
NL2022043B1 (en) 2018-11-21 2020-06-03 Akershus Univ Hf Tagmentation-Associated Multiplex PCR Enrichment Sequencing
EP3887540B1 (en) 2018-11-30 2023-08-23 Illumina, Inc. Analysis of multiple analytes using a single assay
EP3891304A1 (en) 2018-12-04 2021-10-13 Omniome, Inc. Mixed-phase fluids for nucleic acid sequencing and other analytical assays
SG11202012762WA (en) 2018-12-05 2021-01-28 Illumina Cambridge Ltd Methods and compositions for cluster generation by bridge amplification
BR112020026655A2 (en) 2018-12-05 2021-04-06 Illumina, Inc. POLYMERASES, COMPOSITIONS AND METHODS OF USE
GB201820341D0 (en) 2018-12-13 2019-01-30 10X Genomics Inc Method for transposase-mediated spatial tagging and analysing genomic DNA in a biological specimen
GB201820300D0 (en) 2018-12-13 2019-01-30 10X Genomics Inc Method for spatial tagging and analysing genomic DNA in a biological specimen
SG11202012748TA (en) 2018-12-14 2021-01-28 Illumina Cambridge Ltd Decreasing phasing with unlabeled nucleotides during sequencing
CA3103744A1 (en) 2018-12-17 2020-06-25 Pietro GATTI-LAFRANCONI Compositions for use in polynucleotide sequencing
CA3103750A1 (en) 2018-12-17 2020-06-25 Illumina Cambridge Limited Primer oligonucleotide for sequencing
WO2020126602A1 (en) 2018-12-18 2020-06-25 Illumina Cambridge Limited Methods and compositions for paired end sequencing using a single surface primer
JP2022512264A (en) 2018-12-19 2022-02-03 イラミーナ インコーポレーテッド Methods for improving the clonality of polynucleotide clusters
WO2020132350A2 (en) 2018-12-20 2020-06-25 Omniome, Inc. Temperature control for analysis of nucleic acids and other analytes
CA3119980A1 (en) 2018-12-20 2020-06-25 Guardant Health, Inc. Methods, compositions, and systems for improving recovery of nucleic acid molecules
US11293061B2 (en) 2018-12-26 2022-04-05 Illumina Cambridge Limited Sequencing methods using nucleotides with 3′ AOM blocking group
US11685946B2 (en) 2019-01-11 2023-06-27 Illumina Cambridge Limited Complex surface-bound transposome complexes
CA3127102A1 (en) 2019-01-23 2020-07-30 Quantum-Si Incorporated High intensity labeled reactant compositions and methods for sequencing
CN113661249A (en) 2019-01-31 2021-11-16 夸登特健康公司 Compositions and methods for isolating cell-free DNA
EP3924513B1 (en) 2019-02-14 2023-04-12 Pacific Biosciences of California, Inc. Mitigating adverse impacts of detection systems on nucleic acids and other biological analytes
EP3927467A4 (en) 2019-02-20 2022-12-14 Pacific Biosciences of California, Inc. Scanning apparatus and methods for detecting chemical and biological analytes
WO2020176659A1 (en) 2019-02-27 2020-09-03 Guardant Health, Inc. Methods and systems for determining the cellular origin of cell-free dna
US20220033805A1 (en) 2019-03-01 2022-02-03 Sanjay Srivatsan High-throughput single-nuclei and single-cell libraries and methods of making and of using
NL2023311B9 (en) 2019-03-21 2021-03-12 Illumina Inc Artificial intelligence-based generation of sequencing metadata
NL2023312B1 (en) 2019-03-21 2020-09-28 Illumina Inc Artificial intelligence-based base calling
NL2023310B1 (en) 2019-03-21 2020-09-28 Illumina Inc Training data generation for artificial intelligence-based sequencing
NL2023314B1 (en) 2019-03-21 2020-09-28 Illumina Inc Artificial intelligence-based quality scoring
US11210554B2 (en) 2019-03-21 2021-12-28 Illumina, Inc. Artificial intelligence-based generation of sequencing metadata
WO2020191387A1 (en) 2019-03-21 2020-09-24 Illumina, Inc. Artificial intelligence-based base calling
US11347965B2 (en) 2019-03-21 2022-05-31 Illumina, Inc. Training data generation for artificial intelligence-based sequencing
NL2023316B1 (en) 2019-03-21 2020-09-28 Illumina Inc Artificial intelligence-based sequencing
US11593649B2 (en) 2019-05-16 2023-02-28 Illumina, Inc. Base calling using convolutions
US11423306B2 (en) 2019-05-16 2022-08-23 Illumina, Inc. Systems and devices for characterization and performance analysis of pixel-based sequencing
EP3976822A1 (en) 2019-05-31 2022-04-06 Guardant Health, Inc. Methods and systems for improving patient monitoring after surgery
US11644406B2 (en) 2019-06-11 2023-05-09 Pacific Biosciences Of California, Inc. Calibrated focus sensing
JP2022539630A (en) 2019-07-12 2022-09-13 イルミナ ケンブリッジ リミテッド Preparation of nucleic acid libraries using electrophoresis
BR112021012755A2 (en) 2019-07-12 2022-04-26 Illumina Cambridge Ltd Compositions and methods for preparing nucleic acid sequencing libraries using crispr/cas9 immobilized on a solid support
WO2021011803A1 (en) 2019-07-16 2021-01-21 Omniome, Inc. Synthetic nucleic acids having non-natural structures
US10656368B1 (en) 2019-07-24 2020-05-19 Omniome, Inc. Method and system for biological imaging using a wide field objective lens
WO2021050681A1 (en) 2019-09-10 2021-03-18 Omniome, Inc. Reversible modification of nucleotides
WO2021050962A1 (en) 2019-09-11 2021-03-18 The United States Of America, As Represented By The Secretary, Department Of Health And Human Services Cancer detection and classification
CN118186063A (en) 2019-10-18 2024-06-14 加利福尼亚太平洋生物科学股份有限公司 Methods and compositions for capping nucleic acids
EP3812472B1 (en) 2019-10-21 2022-11-23 Albert-Ludwigs-Universität Freiburg A truly unbiased in vitro assay to profile off-target activity of one or more target-specific programmable nucleases in cells (abnoba-seq)
US20210139867A1 (en) 2019-11-08 2021-05-13 Omniome, Inc. Engineered polymerases for improved sequencing by binding
CA3131748A1 (en) 2019-11-22 2021-05-27 Sarah E. SHULTZABERGER Circulating rna signatures specific to preeclampsia
CA3157560A1 (en) 2019-11-26 2021-06-03 Dustin Howard HITE Methods, compositions and systems for improving the binding of methylated polynucleotides
DE202019106694U1 (en) 2019-12-02 2020-03-19 Omniome, Inc. System for sequencing nucleic acids in fluid foam
DE202019106695U1 (en) 2019-12-02 2020-03-19 Omniome, Inc. System for sequencing nucleic acids in fluid foam
US20210172012A1 (en) 2019-12-04 2021-06-10 Illumina, Inc. Preparation of dna sequencing libraries for detection of dna pathogens in plasma
CA3134746A1 (en) 2019-12-19 2021-06-24 Illumina, Inc. High-throughput single-cell libraries and methods of making and of using
WO2021152586A1 (en) 2020-01-30 2021-08-05 Yeda Research And Development Co. Ltd. Methods of analyzing microbiome, immunoglobulin profile and physiological state
WO2021158511A1 (en) 2020-02-04 2021-08-12 Omniome, Inc. Flow cells and methods for their manufacture and use
CN111311265B (en) * 2020-02-13 2023-07-25 布比(北京)网络技术有限公司 Blockchain private transaction proving method, blockchain private transaction proving device, computer equipment and storage medium
US20210265018A1 (en) 2020-02-20 2021-08-26 Illumina, Inc. Knowledge Distillation and Gradient Pruning-Based Compression of Artificial Intelligence-Based Base Caller
US20210265015A1 (en) 2020-02-20 2021-08-26 Illumina, Inc. Hardware Execution and Acceleration of Artificial Intelligence-Based Base Caller
CN115136244A (en) 2020-02-20 2022-09-30 因美纳有限公司 Many-to-many base interpretation based on artificial intelligence
US20210265016A1 (en) 2020-02-20 2021-08-26 Illumina, Inc. Data Compression for Artificial Intelligence-Based Base Calling
AU2021230282A1 (en) 2020-03-03 2022-09-22 Pacific Biosciences Of California, Inc. Methods and compositions for sequencing double stranded nucleic acids
AU2021249046A1 (en) 2020-03-30 2022-10-20 Illumina, Inc. Methods and compositions for preparing nucleic acid libraries
WO2021214766A1 (en) 2020-04-21 2021-10-28 Yeda Research And Development Co. Ltd. Methods of diagnosing viral infections and vaccines thereto
CA3177127A1 (en) 2020-04-30 2021-11-04 Guardant Health, Inc. Methods for sequence determination using partitioned nucleic acids
CN115836135A (en) 2020-05-05 2023-03-21 加利福尼亚太平洋生物科学股份有限公司 Compositions and methods for modifying polymerase-nucleic acid complexes
US20230203592A1 (en) 2020-05-05 2023-06-29 Akershus Universitetssykehus Hf Compositions and methods for characterizing bowel cancer
US11188778B1 (en) 2020-05-05 2021-11-30 Illumina, Inc. Equalization-based image processing and spatial crosstalk attenuator
JP2023526062A (en) 2020-05-12 2023-06-20 イルミナ インコーポレイテッド Generation of Nucleic Acids with Modified Bases Using Recombinant Terminal Deoxynucleotidyl Transferase
EP4150113A1 (en) 2020-05-14 2023-03-22 Guardant Health, Inc. Homologous recombination repair deficiency detection
WO2021252617A1 (en) 2020-06-09 2021-12-16 Illumina, Inc. Methods for increasing yield of sequencing libraries
EP4231174A3 (en) 2020-06-11 2023-11-01 Nautilus Subsidiary, Inc. Methods and systems for computational decoding of biological, chemical, and physical entities
JP2023531009A (en) 2020-06-22 2023-07-20 イルミナ ケンブリッジ リミテッド Nucleosides and nucleotides with 3' acetal blocking groups
CN115843315A (en) 2020-07-02 2023-03-24 伊鲁米纳公司 Method for calibrating nucleic acid library inoculation efficiency in flow cell
JP2023532905A (en) 2020-07-08 2023-08-01 イルミナ インコーポレイテッド Beads as transposome carriers
EP4407042A3 (en) 2020-07-10 2024-09-18 Guardant Health, Inc. Methods of detecting genomic rearrangements using cell free nucleic acids
WO2023282916A1 (en) 2021-07-09 2023-01-12 Guardant Health, Inc. Methods of detecting genomic rearrangements using cell free nucleic acids
US12116637B2 (en) 2020-07-24 2024-10-15 The Regents Of The University Of Michigan Compositions and methods for detecting and treating high grade subtypes of uterine cancer
US11946044B2 (en) 2020-07-30 2024-04-02 Guardant Health, Inc. Methods for isolating cell-free DNA
IL300112A (en) 2020-08-06 2023-03-01 Illumina Inc Preparation of rna and dna sequencing libraries using bead-linked transposomes
WO2022040176A1 (en) 2020-08-18 2022-02-24 Illumina, Inc. Sequence-specific targeted transposition and selection and sorting of nucleic acids
WO2022046947A1 (en) 2020-08-25 2022-03-03 Guardant Health, Inc. Methods and systems for predicting an origin of a variant
US20220067489A1 (en) 2020-08-28 2022-03-03 Illumina, Inc. Detecting and Filtering Clusters Based on Artificial Intelligence-Predicted Base Calls
WO2022053610A1 (en) 2020-09-11 2022-03-17 Illumina Cambridge Limited Methods of enriching a target sequence from a sequencing library using hairpin adaptors
EP4222278A1 (en) 2020-09-30 2023-08-09 Guardant Health, Inc. Compositions and methods for analyzing dna using partitioning and a methylation-dependent nuclease
IL302207A (en) 2020-10-21 2023-06-01 Illumina Inc Sequencing templates comprising multiple inserts and compositions and methods for improving sequencing throughput
EP4267757A1 (en) 2020-12-23 2023-11-01 Guardant Health, Inc. Methods and systems for analyzing methylated polynucleotides
EP4277740A1 (en) 2021-01-13 2023-11-22 Pacific Biosciences of California, Inc. Surface structuring with colloidal assembly
CN117043352A (en) 2021-01-29 2023-11-10 伊鲁米纳公司 Methods, compositions and kits for improving the seeding efficiency of a flow cell using polynucleotides
EP4284751A1 (en) 2021-01-29 2023-12-06 Armonica Technologies, Inc. Enhancement structures for surface-enhanced raman scattering
MX2023009046A (en) 2021-02-04 2023-08-10 Illumina Inc Long indexed-linked read generation on transposome bound beads.
WO2022174054A1 (en) 2021-02-13 2022-08-18 The General Hospital Corporation Methods and compositions for in situ macromolecule detection and uses thereof
WO2022187862A1 (en) 2021-03-05 2022-09-09 Guardant Health, Inc. Methods and related aspects for analyzing molecular response
US20220344004A1 (en) 2021-03-09 2022-10-27 Guardant Health, Inc. Detecting the presence of a tumor based on off-target polynucleotide sequencing data
WO2022197754A1 (en) 2021-03-16 2022-09-22 Illumina Software, Inc. Neural network parameter quantization for base calling
CN117222747A (en) 2021-03-22 2023-12-12 伊鲁米纳剑桥有限公司 Methods for improving clonality of nucleic acid clusters
JP2024511766A (en) 2021-03-29 2024-03-15 イルミナ インコーポレイテッド Improved library preparation method
EP4314328A1 (en) 2021-03-29 2024-02-07 Illumina, Inc. Compositions and methods for assessing dna damage in a library and normalizing amplicon size bias
IL307172A (en) 2021-03-30 2023-11-01 Illumina Inc Improved methods of isothermal complementary dna and library preparation
AU2022248999A1 (en) 2021-03-31 2023-02-02 Illumina, Inc. Artificial intelligence-based base caller with contextual awareness
JP2024511760A (en) 2021-03-31 2024-03-15 イルミナ インコーポレイテッド Method for preparing directional tagmentation sequencing libraries using transposon-based technology with unique molecular identifiers for error correction
CA3214148A1 (en) 2021-04-02 2022-10-06 Brandon Tyler WESTERBERG Machine-learning model for detecting a bubble within a nucleotide-sample slide for sequencing
US20220336054A1 (en) 2021-04-15 2022-10-20 Illumina, Inc. Deep Convolutional Neural Networks to Predict Variant Pathogenicity using Three-Dimensional (3D) Protein Structures
US12070744B2 (en) 2021-04-22 2024-08-27 Illumina, Inc. Valve assemblies and related systems
US20220356515A1 (en) 2021-05-10 2022-11-10 Pacific Biosciences Of California, Inc. Dna amplification buffer replenishment during rolling circle amplification
US20220356519A1 (en) 2021-05-10 2022-11-10 Pacific Biosciences Of California, Inc. Single-molecule seeding and amplification on a surface
BR112023024130A2 (en) 2021-05-20 2024-01-30 Illumina Cambridge Ltd COMPOSITIONS AND METHODS FOR SEQUENCING BY SYNTHESIS
US20230016633A1 (en) 2021-06-15 2023-01-19 Illumina, Inc. Hydrogel-free surface functionalization for sequencing
US20220411864A1 (en) 2021-06-23 2022-12-29 Illumina, Inc. Compositions, methods, kits, cartridges, and systems for sequencing reagents
CN117546245A (en) 2021-06-29 2024-02-09 因美纳有限公司 Machine learning model for generating confidence classifications of genomic coordinates
WO2023278184A1 (en) 2021-06-29 2023-01-05 Illumina, Inc. Methods and systems to correct crosstalk in illumination emitted from reaction sites
JP2024527306A (en) 2021-06-29 2024-07-24 イルミナ インコーポレイテッド A self-learning base collab trained using oligo sequences
JP2024527307A (en) 2021-06-29 2024-07-24 イルミナ インコーポレイテッド Signal-to-Noise Ratio Metrics for Determining Nucleotide Base Calls and Base Call Quality
US20230005253A1 (en) 2021-07-01 2023-01-05 Illumina, Inc. Efficient artificial intelligence-based base calling of index sequences
US20230027409A1 (en) 2021-07-13 2023-01-26 Illumina, Inc. Methods and systems for real time extraction of crosstalk in illumination emitted from reaction sites
US11455487B1 (en) 2021-10-26 2022-09-27 Illumina Software, Inc. Intensity extraction and crosstalk attenuation using interpolation and adaptation for base calling
KR20240031968A (en) 2021-07-19 2024-03-08 일루미나, 인코포레이티드 Intensity extraction with interpolation and adaptation to base calling
CN117813391A (en) 2021-07-23 2024-04-02 因美纳有限公司 Method for preparing substrate surface for DNA sequencing
US20230021577A1 (en) 2021-07-23 2023-01-26 Illumina Software, Inc. Machine-learning model for recalibrating nucleotide-base calls
KR20240037882A (en) 2021-07-28 2024-03-22 일루미나, 인코포레이티드 Quality score calibration of the base calling system
KR20240035413A (en) 2021-08-03 2024-03-15 일루미나, 인코포레이티드 Base calling using the multi-base caller model
US12077789B2 (en) 2021-08-14 2024-09-03 Illumina, Inc. Polymerases, compositions, and methods of use
AU2022331421A1 (en) 2021-08-17 2024-01-04 Illumina, Inc. Methods and compositions for identifying methylated cytosines
US20230093253A1 (en) 2021-09-17 2023-03-23 Illumina, Inc. Automatically identifying failure sources in nucleotide sequencing from base-call-error patterns
KR20240072970A (en) 2021-09-21 2024-05-24 일루미나, 인코포레이티드 Graph reference genome and base determination approaches using imputed haplotypes.
WO2023049215A1 (en) 2021-09-22 2023-03-30 Illumina, Inc. Compressed state-based base calling
US20230096386A1 (en) 2021-09-30 2023-03-30 Illumina Cambridge Limited Polynucleotide sequencing
WO2023056328A2 (en) 2021-09-30 2023-04-06 Illumina, Inc. Solid supports and methods for depleting and/or enriching library fragments prepared from biosamples
AU2022367166A1 (en) 2021-10-11 2024-04-04 Nautilus Subsidiary, Inc. Highly multiplexable analysis of proteins and proteomes
WO2023069927A1 (en) 2021-10-20 2023-04-27 Illumina, Inc. Methods for capturing library dna for sequencing
WO2023081485A1 (en) 2021-11-08 2023-05-11 Pacific Biosciences Of California, Inc. Stepwise sequencing of a polynucleotide with a homogenous reaction mixture
WO2023102354A1 (en) 2021-12-02 2023-06-08 Illumina Software, Inc. Generating cluster-specific-signal corrections for determining nucleotide-base calls
US20230215515A1 (en) 2021-12-23 2023-07-06 Illumina Software, Inc. Facilitating secure execution of external workflows for genomic sequencing diagnostics
WO2023122363A1 (en) 2021-12-23 2023-06-29 Illumina Software, Inc. Dynamic graphical status summaries for nucelotide sequencing
US20230207050A1 (en) 2021-12-28 2023-06-29 Illumina Software, Inc. Machine learning model for recalibrating nucleotide base calls corresponding to target variants
CA3242624A1 (en) 2021-12-29 2023-07-06 Shannon Whitmore Automatically switching variant analysis model versions for genomic analysis applications
WO2023141154A1 (en) 2022-01-20 2023-07-27 Illumina Cambridge Limited Methods of detecting methylcytosine and hydroxymethylcytosine by sequencing
CA3224595A1 (en) 2022-02-25 2023-08-31 Steven Norberg Machine-learning models for detecting and adjusting values for nucleotide methylation levels
WO2023164660A1 (en) 2022-02-25 2023-08-31 Illumina, Inc. Calibration sequences for nucelotide sequencing
US20230343414A1 (en) 2022-03-25 2023-10-26 Illumina, Inc. Sequence-to-sequence base calling
AU2023241893A1 (en) 2022-03-29 2024-09-26 Nautilus Subsidiary, Inc. Integrated arrays for single-analyte processes
CA3223722A1 (en) 2022-04-07 2023-10-12 Illumina, Inc. Altered cytidine deaminases and methods of use
US20230360732A1 (en) 2022-04-25 2023-11-09 Nautilus Subsidiary, Inc. Systems and methods for assessing and improving the quality of multiplex molecular assays
WO2023212601A1 (en) 2022-04-26 2023-11-02 Illumina, Inc. Machine-learning models for selecting oligonucleotide probes for array technologies
US20230348967A1 (en) 2022-04-29 2023-11-02 Illumina Cambridge Limited Methods and systems for encapsulating lyophilised microspheres
US20230360725A1 (en) 2022-05-09 2023-11-09 Guardant Health, Inc. Detecting degradation based on strand bias
US20230368866A1 (en) 2022-05-10 2023-11-16 Illumina Software, Inc. Adaptive neural network for nucelotide sequencing
WO2023235353A2 (en) 2022-06-03 2023-12-07 Illumina, Inc. Circulating rna biomarkers for preeclampsia
US20230407386A1 (en) 2022-06-09 2023-12-21 Illumina, Inc. Dependence of base calling on flow cell tilt
WO2023240494A1 (en) * 2022-06-15 2023-12-21 深圳华大智造科技股份有限公司 Sequencing buffer and method for improving stability of dntp modified with reversible blocking group
WO2023250504A1 (en) 2022-06-24 2023-12-28 Illumina Software, Inc. Improving split-read alignment by intelligently identifying and scoring candidate split groups
US20230420082A1 (en) 2022-06-27 2023-12-28 Illumina Software, Inc. Generating and implementing a structural variation graph genome
WO2024006779A1 (en) 2022-06-27 2024-01-04 Illumina, Inc. Accelerators for a genotype imputation model
WO2024006705A1 (en) 2022-06-27 2024-01-04 Illumina Software, Inc. Improved human leukocyte antigen (hla) genotyping
CN116024320A (en) * 2022-07-13 2023-04-28 上海翔琼生物技术有限公司 Fluorescent quantitative PCR method for detecting nucleic acid
WO2024015962A1 (en) 2022-07-15 2024-01-18 Pacific Biosciences Of California, Inc. Blocked asymmetric hairpin adaptors
US20240038327A1 (en) 2022-07-26 2024-02-01 Illumina Software, Inc. Rapid single-cell multiomics processing using an executable file
WO2024039516A1 (en) 2022-08-19 2024-02-22 Illumina, Inc. Third dna base pair site-specific dna detection
US20240094215A1 (en) 2022-09-15 2024-03-21 Nautilus Subsidiary, Inc. Characterizing accessibility of macromolecule structures
WO2024073516A1 (en) 2022-09-29 2024-04-04 Illumina, Inc. A target-variant-reference panel for imputing target variants
WO2024073047A1 (en) 2022-09-30 2024-04-04 Illumina, Inc. Cytidine deaminases and methods of use in mapping modified cytosine nucleotides
WO2024073043A1 (en) 2022-09-30 2024-04-04 Illumina, Inc. Methods of using cpg binding proteins in mapping modified cytosine nucleotides
WO2024073519A1 (en) 2022-09-30 2024-04-04 Illumina, Inc. Machine-learning model for refining structural variant calls
WO2024069581A1 (en) 2022-09-30 2024-04-04 Illumina Singapore Pte. Ltd. Helicase-cytidine deaminase complexes and methods of use
US20240141427A1 (en) 2022-09-30 2024-05-02 Illumina, Inc. Polymerases, compositions, and methods of use
WO2024077096A1 (en) 2022-10-05 2024-04-11 Illumina, Inc. Integrating variant calls from multiple sequencing pipelines utilizing a machine learning architecture
WO2024077152A1 (en) 2022-10-06 2024-04-11 Illumina, Inc. Probes for depleting abundant small noncoding rna
WO2024077162A2 (en) 2022-10-06 2024-04-11 Illumina, Inc. Probes for improving coronavirus sample surveillance
WO2024077202A2 (en) 2022-10-06 2024-04-11 Illumina, Inc. Probes for improving environmental sample surveillance
US20240127906A1 (en) 2022-10-11 2024-04-18 Illumina, Inc. Detecting and correcting methylation values from methylation sequencing assays
WO2024107599A1 (en) 2022-11-15 2024-05-23 Guardant Health, Inc. Method of predicting non-small cell lung cancer (nsclc) patient drug response or time until death or cancer progression from circulating tumor dna (ctdna) utilizing signals from both baseline ctdna level and longitudinal change of ctdna level over time
WO2024118903A1 (en) 2022-11-30 2024-06-06 Illumina, Inc. Chemoenzymatic correction of false positive uracil transformations
US20240177802A1 (en) 2022-11-30 2024-05-30 Illumina, Inc. Accurately predicting variants from methylation sequencing data
WO2024129672A1 (en) 2022-12-12 2024-06-20 The Broad Institute, Inc. Trafficked rnas for assessment of cell-cell connectivity and neuroanatomy
WO2024129969A1 (en) 2022-12-14 2024-06-20 Illumina, Inc. Systems and methods for capture and enrichment of clustered beads on flow cell substrates
WO2024130031A1 (en) 2022-12-16 2024-06-20 Illumina, Inc. Boranes on solid supports
WO2024137682A1 (en) 2022-12-21 2024-06-27 Guardant Health, Inc. Detecting homologous recombination deficiences based on methylation status of cell-free nucleic acid molecules
WO2024137765A1 (en) 2022-12-22 2024-06-27 Illumina, Inc. Transition-metal catalyst compositions and methods for sequencing by synthesis
US20240247311A1 (en) 2022-12-22 2024-07-25 Illumina, Inc. Palladium catalyst compositions and methods for sequencing by synthesis
AU2023419253A1 (en) 2022-12-27 2024-10-03 Illumina, Inc. Methods of sequencing using 3´ allyl blocked nucleotides
WO2024147904A1 (en) 2023-01-06 2024-07-11 Illumina, Inc. Reducing uracils by polymerase
WO2024151373A1 (en) 2023-01-12 2024-07-18 Nautilus Subsidiary, Inc. Characterization of glycans and glycoconjugates
US20240266003A1 (en) 2023-02-06 2024-08-08 Illumina, Inc. Determining and removing inter-cluster light interference
WO2024173756A1 (en) 2023-02-17 2024-08-22 Illumina, Inc. Cell-free dna signals as biomarkers of preeclampsia
US20240327909A1 (en) 2023-03-30 2024-10-03 Illumina, Inc. Compositions and methods for nucleic acid sequencing
WO2024206848A1 (en) 2023-03-30 2024-10-03 Illumina, Inc. Tandem repeat genotyping
CN117809757B (en) * 2023-12-28 2024-10-01 苏州腾迈医药科技有限公司 Molecular force field fitting method and device

Family Cites Families (91)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5241060A (en) * 1982-06-23 1993-08-31 Enzo Diagnostics, Inc. Base moiety-labeled detectable nucleatide
US4994373A (en) * 1983-01-27 1991-02-19 Enzo Biochem, Inc. Method and structures employing chemically-labelled polynucleotide probes
US5200313A (en) * 1983-08-05 1993-04-06 Miles Inc. Nucleic acid hybridization assay employing detectable anti-hybrid antibodies
US6086737A (en) * 1984-03-29 2000-07-11 Li-Cor, Inc. Sequencing near infrared and infrared fluorescence labeled DNA for detecting using laser diodes and suitable labels therefor
US5230781A (en) * 1984-03-29 1993-07-27 Li-Cor, Inc. Sequencing near infrared and infrared fluorescence labeled DNA for detecting using laser diodes
US5366603A (en) * 1984-03-29 1994-11-22 Li-Cor, Inc. Sequencing near infrared and infrared fluorescence labeled DNA for detecting useing laser diodes
US5360523A (en) * 1984-03-29 1994-11-01 Li-Cor, Inc. DNA sequencing
US5571388A (en) * 1984-03-29 1996-11-05 Li-Cor, Inc. Sequencing near infrared and infrared fluorescense labeled DNA for detecting using laser diodes and suitable labels thereof
US6207421B1 (en) * 1984-03-29 2001-03-27 Li-Cor, Inc. DNA sequencing and DNA terminators
US4997928A (en) * 1988-09-15 1991-03-05 E. I. Du Pont De Nemours And Company Fluorescent reagents for the preparation of 5'-tagged oligonucleotides
US5547839A (en) * 1989-06-07 1996-08-20 Affymax Technologies N.V. Sequencing of surface immobilized polymers utilizing microflourescence detection
US5302509A (en) * 1989-08-14 1994-04-12 Beckman Instruments, Inc. Method for sequencing polynucleotides
US5401847A (en) * 1990-03-14 1995-03-28 Regents Of The University Of California DNA complexes with dyes designed for energy transfer as fluorescent markers
US5232075A (en) * 1991-06-25 1993-08-03 New Venture Gear, Inc. Viscous coupling apparatus with coined plates
US5405747A (en) * 1991-09-25 1995-04-11 The Regents Of The University Of California Office Of Technology Transfer Method for rapid base sequencing in DNA and RNA with two base labeling
US6048690A (en) * 1991-11-07 2000-04-11 Nanogen, Inc. Methods for electronic fluorescent perturbation for analysis and electronic perturbation catalysis for synthesis
JP3227189B2 (en) * 1991-12-25 2001-11-12 キヤノン株式会社 Apparatus provided with flexible cable and ink jet recording apparatus provided with the apparatus
US5403708A (en) * 1992-07-06 1995-04-04 Brennan; Thomas M. Methods and compositions for determining the sequence of nucleic acids
US5503980A (en) * 1992-11-06 1996-04-02 Trustees Of Boston University Positional sequencing by hybridization
US5547835A (en) * 1993-01-07 1996-08-20 Sequenom, Inc. DNA sequencing by mass spectrometry
US5677196A (en) * 1993-05-18 1997-10-14 University Of Utah Research Foundation Apparatus and methods for multi-analyte homogeneous fluoro-immunoassays
WO1995006138A1 (en) * 1993-08-25 1995-03-02 The Regents Of The University Of California Microscopic method for detecting micromotions
US5654419A (en) * 1994-02-01 1997-08-05 The Regents Of The University Of California Fluorescent labels and their use in separations
US5512462A (en) 1994-02-25 1996-04-30 Hoffmann-La Roche Inc. Methods and reagents for the polymerase chain reaction amplification of long DNA sequences
US6593148B1 (en) * 1994-03-01 2003-07-15 Li-Cor, Inc. Cyanine dye compounds and labeling methods
US5601982A (en) * 1995-02-07 1997-02-11 Sargent; Jeannine P. Method and apparatus for determining the sequence of polynucleotides
US5961923A (en) * 1995-04-25 1999-10-05 Irori Matrices with memories and uses thereof
US5856174A (en) * 1995-06-29 1999-01-05 Affymetrix, Inc. Integrated nucleic acid diagnostic device
US5661028A (en) * 1995-09-29 1997-08-26 Lockheed Martin Energy Systems, Inc. Large scale DNA microsequencing device
US6165765A (en) * 1995-10-18 2000-12-26 Shanghai Institute Of Biochemistry, Chinese Academy Of Sciences DNA polymerase having ability to reduce innate selective discrimination against fluorescent dye-labeled dideoxynucleotides
US6027890A (en) * 1996-01-23 2000-02-22 Rapigene, Inc. Methods and compositions for enhancing sensitivity in the analysis of biological-based assays
US5972603A (en) * 1996-02-09 1999-10-26 President And Fellows Of Harvard College DNA polymerase with modified processivity
US5723298A (en) * 1996-09-16 1998-03-03 Li-Cor, Inc. Cycle labeling and sequencing with thermostable polymerases
US5858671A (en) * 1996-11-01 1999-01-12 The University Of Iowa Research Foundation Iterative and regenerative DNA sequencing method
US6027709A (en) * 1997-01-10 2000-02-22 Li-Cor Inc. Fluorescent cyanine dyes
US5804386A (en) * 1997-01-15 1998-09-08 Incyte Pharmaceuticals, Inc. Sets of labeled energy transfer fluorescent primers and their use in multi component analysis
US6403311B1 (en) * 1997-02-12 2002-06-11 Us Genomics Methods of analyzing polymers using ordered label strategies
EP1009802B1 (en) * 1997-02-12 2004-08-11 Eugene Y. Chan Methods for analyzimg polymers
EP0983364B1 (en) * 1997-03-12 2002-06-12 PE Corporation (NY) Dna polymerases having improved labeled nucleotide incorporation properties
ES2183394T5 (en) 1997-07-28 2012-05-03 Gen-Probe Incorporated Nucleic Acid Sequence Analysis
JP3813818B2 (en) * 1998-05-01 2006-08-23 アリゾナ ボード オブ リージェンツ Method for determining the nucleotide sequence of oligonucleotides and DNA molecules
IL141148A0 (en) 1998-07-30 2002-02-10 Solexa Ltd Arrayed biomolecules and their use in sequencing
US6210896B1 (en) * 1998-08-13 2001-04-03 Us Genomics Molecular motors
US6263286B1 (en) * 1998-08-13 2001-07-17 U.S. Genomics, Inc. Methods of analyzing polymers using a spatial network of fluorophores and fluorescence resonance energy transfer
US6280939B1 (en) * 1998-09-01 2001-08-28 Veeco Instruments, Inc. Method and apparatus for DNA sequencing using a local sensitive force detector
DE19844931C1 (en) * 1998-09-30 2000-06-15 Stefan Seeger Procedures for DNA or RNA sequencing
US6221592B1 (en) * 1998-10-20 2001-04-24 Wisconsin Alumi Research Foundation Computer-based methods and systems for sequencing of individual nucleic acid molecules
US6044744A (en) * 1998-10-29 2000-04-04 At&T Corp. Fiber optic cable sheath removal tool
US6232075B1 (en) * 1998-12-14 2001-05-15 Li-Cor, Inc. Heterogeneous assay for pyrophosphate detection
WO2000036512A1 (en) 1998-12-15 2000-06-22 Matsushita Electric Industrial Co., Ltd. Clock phase adjustment method, and integrated circuit and design method therefor
US6558945B1 (en) * 1999-03-08 2003-05-06 Aclara Biosciences, Inc. Method and device for rapid color detection
JP2002537858A (en) * 1999-03-10 2002-11-12 エーエスエム サイエンティフィック, インコーポレイテッド Methods for direct sequencing of nucleic acids
GB9907812D0 (en) 1999-04-06 1999-06-02 Medical Biosystems Ltd Sequencing
US7056661B2 (en) * 1999-05-19 2006-06-06 Cornell Research Foundation, Inc. Method for sequencing nucleic acid molecules
US6818395B1 (en) * 1999-06-28 2004-11-16 California Institute Of Technology Methods and apparatus for analyzing polynucleotide sequences
AU7086800A (en) * 1999-08-30 2001-03-26 Government Of The United States Of America, As Represented By The Secretary Of The Department Of Health And Human Services, The High speed parallel molecular nucleic acid sequencing
US6982146B1 (en) * 1999-08-30 2006-01-03 The United States Of America As Represented By The Department Of Health And Human Services High speed parallel molecular nucleic acid sequencing
EP1218543A2 (en) 1999-09-29 2002-07-03 Solexa Ltd. Polynucleotide sequencing
GB9923644D0 (en) 1999-10-06 1999-12-08 Medical Biosystems Ltd DNA sequencing
US6399335B1 (en) * 1999-11-16 2002-06-04 Advanced Research And Technology Institute, Inc. γ-phosphoester nucleoside triphosphates
US6917726B2 (en) * 2001-09-27 2005-07-12 Cornell Research Foundation, Inc. Zero-mode clad waveguides for performing spectroscopy with confined effective observation volumes
US6869764B2 (en) * 2000-06-07 2005-03-22 L--Cor, Inc. Nucleic acid sequencing using charge-switch nucleotides
US6936702B2 (en) * 2000-06-07 2005-08-30 Li-Cor, Inc. Charge-switch nucleotides
US20070172866A1 (en) * 2000-07-07 2007-07-26 Susan Hardin Methods for sequence determination using depolymerizing agent
US20030064366A1 (en) * 2000-07-07 2003-04-03 Susan Hardin Real-time sequence determination
CA2424454C (en) * 2000-09-19 2010-07-20 Li-Cor, Inc. Cyanine dyes
WO2002044425A2 (en) * 2000-12-01 2002-06-06 Visigen Biotechnologies, Inc. Enzymatic nucleic acid synthesis: compositions and methods for altering monomer incorporation fidelity
EP1417794A4 (en) * 2001-04-25 2009-09-23 Tekelec Us Methods and systems for load sharing signaling messages among signaling links
US20040161741A1 (en) * 2001-06-30 2004-08-19 Elazar Rabani Novel compositions and processes for analyte detection, quantification and amplification
US20030064400A1 (en) * 2001-08-24 2003-04-03 Li-Cor, Inc. Microfluidics system for single molecule DNA sequencing
US7223541B2 (en) * 2001-08-29 2007-05-29 Ge Healthcare Bio-Sciences Corp. Terminal-phosphate-labeled nucleotides and methods of use
DE60235376D1 (en) * 2001-08-29 2010-04-01 Ge Healthcare Bio Sciences LABELED NUCLEOSIDE POLYPHOSPHATES
US7033762B2 (en) * 2001-08-29 2006-04-25 Amersham Biosciences Corp Single nucleotide amplification and detection by polymerase
US7052839B2 (en) * 2001-08-29 2006-05-30 Amersham Biosciences Corp Terminal-phosphate-labeled nucleotides and methods of use
US7005518B2 (en) * 2002-10-25 2006-02-28 Li-Cor, Inc. Phthalocyanine dyes
WO2004072304A1 (en) * 2003-02-05 2004-08-26 Amersham Biosciences Corp Nucleic acid amplification
EP1592779A4 (en) * 2003-02-05 2007-12-12 Ge Healthcare Bio Sciences Terminal-phosphate-labeled nucleotides with new linkers
WO2004092331A2 (en) * 2003-04-08 2004-10-28 Li-Cor, Inc. Composition and method for nucleic acid sequencing
KR100558528B1 (en) * 2003-09-25 2006-03-10 동부아남반도체 주식회사 CMOS Image sensor and its fabricating method
US7170050B2 (en) * 2004-09-17 2007-01-30 Pacific Biosciences Of California, Inc. Apparatus and methods for optical analysis of molecules
WO2006044078A2 (en) * 2004-09-17 2006-04-27 Pacific Biosciences Of California, Inc. Apparatus and method for analysis of molecules
US20070048748A1 (en) * 2004-09-24 2007-03-01 Li-Cor, Inc. Mutant polymerases for sequencing and genotyping
US7482120B2 (en) * 2005-01-28 2009-01-27 Helicos Biosciences Corporation Methods and compositions for improving fidelity in a nucleic acid synthesis reaction
WO2006086673A2 (en) * 2005-02-09 2006-08-17 Pacific Biosciences Of California, Inc. Nucleotide compositions and uses thereof
US7130041B2 (en) * 2005-03-02 2006-10-31 Li-Cor, Inc. On-chip spectral filtering using CCD array for imaging and spectroscopy
US8227621B2 (en) * 2005-06-30 2012-07-24 Li-Cor, Inc. Cyanine dyes and methods of use
US7509836B2 (en) * 2005-09-01 2009-03-31 Li-Cor, Inc. Gas flux system chamber design and positioning method
WO2007064597A2 (en) * 2005-11-28 2007-06-07 Pacific Biosciences Of California, Inc. Uniform surfaces for hybrid material substrates and methods for making and using same
JP2009519717A (en) * 2005-12-16 2009-05-21 アプレラ コーポレイション Method and system for fixed phase sequencing
US20080076189A1 (en) * 2006-03-30 2008-03-27 Visigen Biotechnologies, Inc. Modified surfaces for the detection of biomolecules at the single molecule level
US20080091005A1 (en) * 2006-07-20 2008-04-17 Visigen Biotechnologies, Inc. Modified nucleotides, methods for making and using same

Cited By (204)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10996166B2 (en) 2009-03-11 2021-05-04 Industrial Technology Research Institute Apparatus and method for detection and discrimination molecular object
US9778188B2 (en) 2009-03-11 2017-10-03 Industrial Technology Research Institute Apparatus and method for detection and discrimination molecular object
US9365839B2 (en) 2009-03-27 2016-06-14 Life Technologies Corporation Polymerase compositions and methods
US9695471B2 (en) 2009-03-27 2017-07-04 Life Technologies Corporation Methods and apparatus for single molecule sequencing using energy transfer detection
US8603792B2 (en) 2009-03-27 2013-12-10 Life Technologies Corporation Conjugates of biomolecules to nanoparticles
US10093974B2 (en) 2009-03-27 2018-10-09 Life Technologies Corporation Methods and apparatus for single molecule sequencing using energy transfer detection
US10093973B2 (en) 2009-03-27 2018-10-09 Life Technologies Corporation Polymerase compositions and methods
US9932573B2 (en) 2009-03-27 2018-04-03 Life Technologies Corporation Labeled enzyme compositions, methods and systems
US10093972B2 (en) 2009-03-27 2018-10-09 Life Technologies Corporation Conjugates of biomolecules to nanoparticles
US9365838B2 (en) 2009-03-27 2016-06-14 Life Technologies Corporation Conjugates of biomolecules to nanoparticles
US11453909B2 (en) 2009-03-27 2022-09-27 Life Technologies Corporation Polymerase compositions and methods
US20110003343A1 (en) * 2009-03-27 2011-01-06 Life Technologies Corporation Conjugates of biomolecules to nanoparticles
US9567629B2 (en) 2009-03-27 2017-02-14 Life Technologies Corporation Labeled enzyme compositions, methods and systems
US11542549B2 (en) 2009-03-27 2023-01-03 Life Technologies Corporation Labeled enzyme compositions, methods and systems
US11015220B2 (en) 2009-03-27 2021-05-25 Life Technologies Corporation Conjugates of biomolecules to nanoparticles
US11008612B2 (en) 2009-03-27 2021-05-18 Life Technologies Corporation Methods and apparatus for single molecule sequencing using energy transfer detection
US9816137B2 (en) 2009-12-15 2017-11-14 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US11970737B2 (en) 2009-12-15 2024-04-30 Becton, Dickinson And Company Digital counting of individual molecules by stochastic attachment of diverse labels
US10392661B2 (en) 2009-12-15 2019-08-27 Becton, Dickinson And Company Digital counting of individual molecules by stochastic attachment of diverse labels
US10059991B2 (en) 2009-12-15 2018-08-28 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US9708659B2 (en) 2009-12-15 2017-07-18 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US11993814B2 (en) 2009-12-15 2024-05-28 Becton, Dickinson And Company Digital counting of individual molecules by stochastic attachment of diverse labels
US10619203B2 (en) 2009-12-15 2020-04-14 Becton, Dickinson And Company Digital counting of individual molecules by stochastic attachment of diverse labels
US12060607B2 (en) 2009-12-15 2024-08-13 Becton, Dickinson And Company Digital counting of individual molecules by stochastic attachment of diverse labels
US10047394B2 (en) 2009-12-15 2018-08-14 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US9845502B2 (en) 2009-12-15 2017-12-19 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US10202646B2 (en) 2009-12-15 2019-02-12 Becton, Dickinson And Company Digital counting of individual molecules by stochastic attachment of diverse labels
US9315857B2 (en) 2009-12-15 2016-04-19 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse label-tags
US9290808B2 (en) 2009-12-15 2016-03-22 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US9290809B2 (en) 2009-12-15 2016-03-22 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US9482615B2 (en) 2010-03-15 2016-11-01 Industrial Technology Research Institute Single-molecule detection system and methods
US9777321B2 (en) 2010-03-15 2017-10-03 Industrial Technology Research Institute Single molecule detection system and methods
US11634756B2 (en) 2010-04-05 2023-04-25 Prognosys Biosciences, Inc. Spatially encoded biological assays
US10662467B2 (en) 2010-04-05 2020-05-26 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11542543B2 (en) 2010-04-05 2023-01-03 Prognosys Biosciences, Inc. System for analyzing targets of a tissue section
US11549138B2 (en) 2010-04-05 2023-01-10 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11156603B2 (en) 2010-04-05 2021-10-26 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11401545B2 (en) 2010-04-05 2022-08-02 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11208684B2 (en) 2010-04-05 2021-12-28 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11733238B2 (en) 2010-04-05 2023-08-22 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11293917B2 (en) 2010-04-05 2022-04-05 Prognosys Biosciences, Inc. Systems for analyzing target biological molecules via sample imaging and delivery of probes to substrate wells
US11384386B2 (en) 2010-04-05 2022-07-12 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11313856B2 (en) 2010-04-05 2022-04-26 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11371086B2 (en) 2010-04-05 2022-06-28 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11519022B2 (en) 2010-04-05 2022-12-06 Prognosys Biosciences, Inc. Spatially encoded biological assays
US10480022B2 (en) 2010-04-05 2019-11-19 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11365442B2 (en) 2010-04-05 2022-06-21 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11479810B1 (en) 2010-04-05 2022-10-25 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11732292B2 (en) 2010-04-05 2023-08-22 Prognosys Biosciences, Inc. Spatially encoded biological assays correlating target nucleic acid to tissue section location
US11560587B2 (en) 2010-04-05 2023-01-24 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11767550B2 (en) 2010-04-05 2023-09-26 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11866770B2 (en) 2010-04-05 2024-01-09 Prognosys Biosciences, Inc. Spatially encoded biological assays
US11761030B2 (en) 2010-04-05 2023-09-19 Prognosys Biosciences, Inc. Spatially encoded biological assays
US9670243B2 (en) 2010-06-02 2017-06-06 Industrial Technology Research Institute Compositions and methods for sequencing nucleic acids
US10112969B2 (en) 2010-06-02 2018-10-30 Industrial Technology Research Institute Compositions and methods for sequencing nucleic acids
US9995683B2 (en) 2010-06-11 2018-06-12 Industrial Technology Research Institute Apparatus for single-molecule detection
EP3150750A2 (en) 2011-04-08 2017-04-05 Prognosys Biosciences, Inc. Peptide constructs and assay systems
US11788122B2 (en) 2011-04-13 2023-10-17 10X Genomics Sweden Ab Methods of detecting analytes
US11479809B2 (en) 2011-04-13 2022-10-25 Spatial Transcriptomics Ab Methods of detecting analytes
US11795498B2 (en) 2011-04-13 2023-10-24 10X Genomics Sweden Ab Methods of detecting analytes
US11352659B2 (en) 2011-04-13 2022-06-07 Spatial Transcriptomics Ab Methods of detecting analytes
US11634708B2 (en) 2012-02-27 2023-04-25 Becton, Dickinson And Company Compositions and kits for molecular counting
WO2013130512A3 (en) * 2012-02-27 2013-10-24 The University Of North Carolina At Chapel Hill Methods and uses for molecular tags
US10941396B2 (en) 2012-02-27 2021-03-09 Becton, Dickinson And Company Compositions and kits for molecular counting
US11177020B2 (en) 2012-02-27 2021-11-16 The University Of North Carolina At Chapel Hill Methods and uses for molecular tags
USRE50065E1 (en) 2012-10-17 2024-07-30 10X Genomics Sweden Ab Methods and product for optimising localised or spatial detection of gene expression in a tissue sample
US11618918B2 (en) 2013-06-25 2023-04-04 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
US11821024B2 (en) 2013-06-25 2023-11-21 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
WO2014210225A1 (en) 2013-06-25 2014-12-31 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
EP4219745A2 (en) 2013-06-25 2023-08-02 Prognosys Biosciences, Inc. Spatially encoded biological assays using a microfluidic device
US11286515B2 (en) 2013-06-25 2022-03-29 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
EP4234716A2 (en) 2013-06-25 2023-08-30 Prognosys Biosciences, Inc. Methods for determining spatial patterns of biological targets in a sample
US11359228B2 (en) 2013-06-25 2022-06-14 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
US11753674B2 (en) 2013-06-25 2023-09-12 Prognosys Biosciences, Inc. Methods and systems for determining spatial patterns of biological targets in a sample
US9567646B2 (en) 2013-08-28 2017-02-14 Cellular Research, Inc. Massively parallel single cell analysis
US10208356B1 (en) 2013-08-28 2019-02-19 Becton, Dickinson And Company Massively parallel single cell analysis
US11618929B2 (en) 2013-08-28 2023-04-04 Becton, Dickinson And Company Massively parallel single cell analysis
US10954570B2 (en) 2013-08-28 2021-03-23 Becton, Dickinson And Company Massively parallel single cell analysis
US11702706B2 (en) 2013-08-28 2023-07-18 Becton, Dickinson And Company Massively parallel single cell analysis
US9598736B2 (en) 2013-08-28 2017-03-21 Cellular Research, Inc. Massively parallel single cell analysis
US9637799B2 (en) 2013-08-28 2017-05-02 Cellular Research, Inc. Massively parallel single cell analysis
US10131958B1 (en) 2013-08-28 2018-11-20 Cellular Research, Inc. Massively parallel single cell analysis
US10151003B2 (en) 2013-08-28 2018-12-11 Cellular Research, Inc. Massively Parallel single cell analysis
US10253375B1 (en) 2013-08-28 2019-04-09 Becton, Dickinson And Company Massively parallel single cell analysis
US10927419B2 (en) 2013-08-28 2021-02-23 Becton, Dickinson And Company Massively parallel single cell analysis
US9567645B2 (en) 2013-08-28 2017-02-14 Cellular Research, Inc. Massively parallel single cell analysis
WO2015042708A1 (en) 2013-09-25 2015-04-02 Bio-Id Diagnostic Inc. Methods for detecting nucleic acid fragments
US9905005B2 (en) 2013-10-07 2018-02-27 Cellular Research, Inc. Methods and systems for digitally counting features on arrays
US9582877B2 (en) 2013-10-07 2017-02-28 Cellular Research, Inc. Methods and systems for digitally counting features on arrays
US11098358B2 (en) 2015-02-19 2021-08-24 Becton, Dickinson And Company High-throughput single-cell analysis combining proteomic and genomic information
US10697010B2 (en) 2015-02-19 2020-06-30 Becton, Dickinson And Company High-throughput single-cell analysis combining proteomic and genomic information
USRE48913E1 (en) 2015-02-27 2022-02-01 Becton, Dickinson And Company Spatially addressable molecular barcoding
US9727810B2 (en) 2015-02-27 2017-08-08 Cellular Research, Inc. Spatially addressable molecular barcoding
US10002316B2 (en) 2015-02-27 2018-06-19 Cellular Research, Inc. Spatially addressable molecular barcoding
US11535882B2 (en) 2015-03-30 2022-12-27 Becton, Dickinson And Company Methods and compositions for combinatorial barcoding
US11613773B2 (en) 2015-04-10 2023-03-28 Spatial Transcriptomics Ab Spatially distinguished, multiplex nucleic acid analysis of biological specimens
US11390912B2 (en) 2015-04-10 2022-07-19 Spatial Transcriptomics Ab Spatially distinguished, multiplex nucleic acid analysis of biological specimens
US11739372B2 (en) 2015-04-10 2023-08-29 Spatial Transcriptomics Ab Spatially distinguished, multiplex nucleic acid analysis of biological specimens
US11162132B2 (en) 2015-04-10 2021-11-02 Spatial Transcriptomics Ab Spatially distinguished, multiplex nucleic acid analysis of biological specimens
US11299774B2 (en) 2015-04-10 2022-04-12 Spatial Transcriptomics Ab Spatially distinguished, multiplex nucleic acid analysis of biological specimens
US11390914B2 (en) 2015-04-23 2022-07-19 Becton, Dickinson And Company Methods and compositions for whole transcriptome amplification
US11124823B2 (en) 2015-06-01 2021-09-21 Becton, Dickinson And Company Methods for RNA quantification
US11332776B2 (en) 2015-09-11 2022-05-17 Becton, Dickinson And Company Methods and compositions for library normalization
US10619186B2 (en) 2015-09-11 2020-04-14 Cellular Research, Inc. Methods and compositions for library normalization
US10822643B2 (en) 2016-05-02 2020-11-03 Cellular Research, Inc. Accurate molecular barcoding
US11845986B2 (en) 2016-05-25 2023-12-19 Becton, Dickinson And Company Normalization of nucleic acid libraries
US10301677B2 (en) 2016-05-25 2019-05-28 Cellular Research, Inc. Normalization of nucleic acid libraries
US11397882B2 (en) 2016-05-26 2022-07-26 Becton, Dickinson And Company Molecular label counting adjustment methods
US10202641B2 (en) 2016-05-31 2019-02-12 Cellular Research, Inc. Error correction in amplification of samples
US11525157B2 (en) 2016-05-31 2022-12-13 Becton, Dickinson And Company Error correction in amplification of samples
US11220685B2 (en) 2016-05-31 2022-01-11 Becton, Dickinson And Company Molecular indexing of internal sequences
US10640763B2 (en) 2016-05-31 2020-05-05 Cellular Research, Inc. Molecular indexing of internal sequences
US11782059B2 (en) 2016-09-26 2023-10-10 Becton, Dickinson And Company Measurement of protein expression using reagents with barcoded oligonucleotide sequences
US11467157B2 (en) 2016-09-26 2022-10-11 Becton, Dickinson And Company Measurement of protein expression using reagents with barcoded oligonucleotide sequences
US11460468B2 (en) 2016-09-26 2022-10-04 Becton, Dickinson And Company Measurement of protein expression using reagents with barcoded oligonucleotide sequences
US10338066B2 (en) 2016-09-26 2019-07-02 Cellular Research, Inc. Measurement of protein expression using reagents with barcoded oligonucleotide sequences
US11164659B2 (en) 2016-11-08 2021-11-02 Becton, Dickinson And Company Methods for expression profile classification
US11608497B2 (en) 2016-11-08 2023-03-21 Becton, Dickinson And Company Methods for cell label classification
US10722880B2 (en) 2017-01-13 2020-07-28 Cellular Research, Inc. Hydrophilic coating of fluidic channels
US11319583B2 (en) 2017-02-01 2022-05-03 Becton, Dickinson And Company Selective amplification using blocking oligonucleotides
US12084712B2 (en) 2017-06-05 2024-09-10 Becton, Dickinson And Company Sample indexing for single cells
US10669570B2 (en) 2017-06-05 2020-06-02 Becton, Dickinson And Company Sample indexing for single cells
US10676779B2 (en) 2017-06-05 2020-06-09 Becton, Dickinson And Company Sample indexing for single cells
US11946095B2 (en) 2017-12-19 2024-04-02 Becton, Dickinson And Company Particles associated with oligonucleotides
US11773441B2 (en) 2018-05-03 2023-10-03 Becton, Dickinson And Company High throughput multiomics sample analysis
US11365409B2 (en) 2018-05-03 2022-06-21 Becton, Dickinson And Company Molecular barcoding on opposite transcript ends
US11519033B2 (en) 2018-08-28 2022-12-06 10X Genomics, Inc. Method for transposase-mediated spatial tagging and analyzing genomic DNA in a biological sample
US11639517B2 (en) 2018-10-01 2023-05-02 Becton, Dickinson And Company Determining 5′ transcript sequences
US11932849B2 (en) 2018-11-08 2024-03-19 Becton, Dickinson And Company Whole transcriptome analysis of single cells using random priming
US11933957B1 (en) 2018-12-10 2024-03-19 10X Genomics, Inc. Imaging system hardware
US11492660B2 (en) 2018-12-13 2022-11-08 Becton, Dickinson And Company Selective extension in single cell whole transcriptome analysis
US11926867B2 (en) 2019-01-06 2024-03-12 10X Genomics, Inc. Generating capture probes for spatial analysis
US11753675B2 (en) 2019-01-06 2023-09-12 10X Genomics, Inc. Generating capture probes for spatial analysis
US11649485B2 (en) 2019-01-06 2023-05-16 10X Genomics, Inc. Generating capture probes for spatial analysis
US11371076B2 (en) 2019-01-16 2022-06-28 Becton, Dickinson And Company Polymerase chain reaction normalization through primer titration
US11661631B2 (en) 2019-01-23 2023-05-30 Becton, Dickinson And Company Oligonucleotides associated with antibodies
US12071617B2 (en) 2019-02-14 2024-08-27 Becton, Dickinson And Company Hybrid targeted and whole transcriptome amplification
US11965208B2 (en) 2019-04-19 2024-04-23 Becton, Dickinson And Company Methods of associating phenotypical data and single cell sequencing data
US11965213B2 (en) 2019-05-30 2024-04-23 10X Genomics, Inc. Methods of detecting spatial heterogeneity of a biological sample
US11939622B2 (en) 2019-07-22 2024-03-26 Becton, Dickinson And Company Single cell chromatin immunoprecipitation sequencing assay
US11702698B2 (en) 2019-11-08 2023-07-18 10X Genomics, Inc. Enhancing specificity of analyte binding
US11808769B2 (en) 2019-11-08 2023-11-07 10X Genomics, Inc. Spatially-tagged analyte capture agents for analyte multiplexing
US11592447B2 (en) 2019-11-08 2023-02-28 10X Genomics, Inc. Spatially-tagged analyte capture agents for analyte multiplexing
US11773436B2 (en) 2019-11-08 2023-10-03 Becton, Dickinson And Company Using random priming to obtain full-length V(D)J information for immune repertoire sequencing
US11332790B2 (en) 2019-12-23 2022-05-17 10X Genomics, Inc. Methods for spatial analysis using RNA-templated ligation
US11981965B2 (en) 2019-12-23 2024-05-14 10X Genomics, Inc. Methods for spatial analysis using RNA-templated ligation
US11505828B2 (en) 2019-12-23 2022-11-22 10X Genomics, Inc. Methods for spatial analysis using RNA-templated ligation
US12117439B2 (en) 2019-12-23 2024-10-15 10X Genomics, Inc. Compositions and methods for using fixed biological samples
US11795507B2 (en) 2019-12-23 2023-10-24 10X Genomics, Inc. Methods for spatial analysis using RNA-templated ligation
US11560593B2 (en) 2019-12-23 2023-01-24 10X Genomics, Inc. Methods for spatial analysis using RNA-templated ligation
US11649497B2 (en) 2020-01-13 2023-05-16 Becton, Dickinson And Company Methods and compositions for quantitation of proteins and RNA
US11732299B2 (en) 2020-01-21 2023-08-22 10X Genomics, Inc. Spatial assays with perturbed cells
US11702693B2 (en) 2020-01-21 2023-07-18 10X Genomics, Inc. Methods for printing cells and generating arrays of barcoded cells
US11821035B1 (en) 2020-01-29 2023-11-21 10X Genomics, Inc. Compositions and methods of making gene expression libraries
US12076701B2 (en) 2020-01-31 2024-09-03 10X Genomics, Inc. Capturing oligonucleotides in spatial transcriptomics
US11898205B2 (en) 2020-02-03 2024-02-13 10X Genomics, Inc. Increasing capture efficiency of spatial assays
US12110541B2 (en) 2020-02-03 2024-10-08 10X Genomics, Inc. Methods for preparing high-resolution spatial arrays
US11732300B2 (en) 2020-02-05 2023-08-22 10X Genomics, Inc. Increasing efficiency of spatial analysis in a biological sample
US11835462B2 (en) 2020-02-11 2023-12-05 10X Genomics, Inc. Methods and compositions for partitioning a biological sample
US11891654B2 (en) 2020-02-24 2024-02-06 10X Genomics, Inc. Methods of making gene expression libraries
US11926863B1 (en) 2020-02-27 2024-03-12 10X Genomics, Inc. Solid state single cell method for analyzing fixed biological cells
US11768175B1 (en) 2020-03-04 2023-09-26 10X Genomics, Inc. Electrophoretic methods for spatial analysis
US11535887B2 (en) 2020-04-22 2022-12-27 10X Genomics, Inc. Methods for spatial analysis using targeted RNA depletion
US11773433B2 (en) 2020-04-22 2023-10-03 10X Genomics, Inc. Methods for spatial analysis using targeted RNA depletion
US11661625B2 (en) 2020-05-14 2023-05-30 Becton, Dickinson And Company Primers for immune repertoire profiling
US11624086B2 (en) 2020-05-22 2023-04-11 10X Genomics, Inc. Simultaneous spatio-temporal measurement of gene expression and cellular activity
US11608520B2 (en) 2020-05-22 2023-03-21 10X Genomics, Inc. Spatial analysis to detect sequence variants
US11866767B2 (en) 2020-05-22 2024-01-09 10X Genomics, Inc. Simultaneous spatio-temporal measurement of gene expression and cellular activity
US11959130B2 (en) 2020-05-22 2024-04-16 10X Genomics, Inc. Spatial analysis to detect sequence variants
US11560592B2 (en) 2020-05-26 2023-01-24 10X Genomics, Inc. Method for resetting an array
US11608498B2 (en) 2020-06-02 2023-03-21 10X Genomics, Inc. Nucleic acid library methods
US11512308B2 (en) 2020-06-02 2022-11-29 10X Genomics, Inc. Nucleic acid library methods
US12098417B2 (en) 2020-06-02 2024-09-24 10X Genomics, Inc. Spatial transcriptomics for antigen-receptors
US11859178B2 (en) 2020-06-02 2024-01-02 10X Genomics, Inc. Nucleic acid library methods
US11692218B2 (en) 2020-06-02 2023-07-04 10X Genomics, Inc. Spatial transcriptomics for antigen-receptors
US11845979B2 (en) 2020-06-02 2023-12-19 10X Genomics, Inc. Spatial transcriptomics for antigen-receptors
US11840687B2 (en) 2020-06-02 2023-12-12 10X Genomics, Inc. Nucleic acid library methods
US12031177B1 (en) 2020-06-04 2024-07-09 10X Genomics, Inc. Methods of enhancing spatial resolution of transcripts
US11407992B2 (en) 2020-06-08 2022-08-09 10X Genomics, Inc. Methods of determining a surgical margin and methods of use thereof
US11492612B1 (en) 2020-06-08 2022-11-08 10X Genomics, Inc. Methods of determining a surgical margin and methods of use thereof
US11781130B2 (en) 2020-06-08 2023-10-10 10X Genomics, Inc. Methods of determining a surgical margin and methods of use thereof
US11624063B2 (en) 2020-06-08 2023-04-11 10X Genomics, Inc. Methods of determining a surgical margin and methods of use thereof
US11434524B2 (en) 2020-06-10 2022-09-06 10X Genomics, Inc. Methods for determining a location of an analyte in a biological sample
US11408029B2 (en) 2020-06-25 2022-08-09 10X Genomics, Inc. Spatial analysis of DNA methylation
US12060604B2 (en) 2020-06-25 2024-08-13 10X Genomics, Inc. Spatial analysis of epigenetic modifications
US11661626B2 (en) 2020-06-25 2023-05-30 10X Genomics, Inc. Spatial analysis of DNA methylation
US11952627B2 (en) 2020-07-06 2024-04-09 10X Genomics, Inc. Methods for identifying a location of an RNA in a biological sample
US11761038B1 (en) 2020-07-06 2023-09-19 10X Genomics, Inc. Methods for identifying a location of an RNA in a biological sample
US11981960B1 (en) 2020-07-06 2024-05-14 10X Genomics, Inc. Spatial analysis utilizing degradable hydrogels
US11932901B2 (en) 2020-07-13 2024-03-19 Becton, Dickinson And Company Target enrichment using nucleic acid probes for scRNAseq
US11981958B1 (en) 2020-08-20 2024-05-14 10X Genomics, Inc. Methods for spatial analysis using DNA capture
US11926822B1 (en) 2020-09-23 2024-03-12 10X Genomics, Inc. Three-dimensional spatial analysis
US11827935B1 (en) 2020-11-19 2023-11-28 10X Genomics, Inc. Methods for spatial analysis using rolling circle amplification and detection probes
US11739443B2 (en) 2020-11-20 2023-08-29 Becton, Dickinson And Company Profiling of highly expressed and lowly expressed proteins
US11618897B2 (en) 2020-12-21 2023-04-04 10X Genomics, Inc. Methods, compositions, and systems for capturing probes and/or barcodes
US11959076B2 (en) 2020-12-21 2024-04-16 10X Genomics, Inc. Methods, compositions, and systems for capturing probes and/or barcodes
US11873482B2 (en) 2020-12-21 2024-01-16 10X Genomics, Inc. Methods, compositions, and systems for spatial analysis of analytes in a biological sample
US11680260B2 (en) 2020-12-21 2023-06-20 10X Genomics, Inc. Methods, compositions, and systems for spatial analysis of analytes in a biological sample
US12098985B2 (en) 2021-02-19 2024-09-24 10X Genomics, Inc. Modular assay support devices
US11970739B2 (en) 2021-03-18 2024-04-30 10X Genomics, Inc. Multiplex capture of gene and protein expression from a biological sample
US11739381B2 (en) 2021-03-18 2023-08-29 10X Genomics, Inc. Multiplex capture of gene and protein expression from a biological sample
US12071655B2 (en) 2021-06-03 2024-08-27 10X Genomics, Inc. Methods, compositions, kits, and systems for enhancing analyte capture for spatial analysis
US11753673B2 (en) 2021-09-01 2023-09-12 10X Genomics, Inc. Methods, compositions, and kits for blocking a capture probe on a spatial array
US11840724B2 (en) 2021-09-01 2023-12-12 10X Genomics, Inc. Methods, compositions, and kits for blocking a capture probe on a spatial array

Also Published As

Publication number Publication date
US20100255463A1 (en) 2010-10-07
ATE377093T1 (en) 2007-11-15
US20030064366A1 (en) 2003-04-03
WO2002004680A3 (en) 2003-10-16
US20050260614A1 (en) 2005-11-24
CA2415897A1 (en) 2002-01-17
US20070172863A1 (en) 2007-07-26
US20100304367A1 (en) 2010-12-02
US20070184475A1 (en) 2007-08-09
JP2009118847A (en) 2009-06-04
US20070172868A1 (en) 2007-07-26
US20070172862A1 (en) 2007-07-26
WO2002004680A2 (en) 2002-01-17
US20070172861A1 (en) 2007-07-26
DE60131194T2 (en) 2008-08-07
CN1553953A (en) 2004-12-08
CN101525660A (en) 2009-09-09
US20100317005A1 (en) 2010-12-16
US20070172858A1 (en) 2007-07-26
EP1368460A2 (en) 2003-12-10
EP1975251A2 (en) 2008-10-01
US20070172865A1 (en) 2007-07-26
AU2001282881B2 (en) 2007-06-14
US20090275036A1 (en) 2009-11-05
CN100462433C (en) 2009-02-18
US20090305278A1 (en) 2009-12-10
US20070172867A1 (en) 2007-07-26
JP2009240318A (en) 2009-10-22
US20110021383A1 (en) 2011-01-27
US20110014604A1 (en) 2011-01-20
US20070172864A1 (en) 2007-07-26
EP2100971A2 (en) 2009-09-16
US20050266424A1 (en) 2005-12-01
EP2100971A3 (en) 2009-11-25
EP1975251A3 (en) 2009-03-25
JP2004513619A (en) 2004-05-13
EP1368460B1 (en) 2007-10-31
DE60131194D1 (en) 2007-12-13
US20070275395A1 (en) 2007-11-29
US20070292867A1 (en) 2007-12-20
AU8288101A (en) 2002-01-21
US7329492B2 (en) 2008-02-12

Similar Documents

Publication Publication Date Title
US7329492B2 (en) Methods for real-time single molecule sequence determination
US20070172866A1 (en) Methods for sequence determination using depolymerizing agent
AU2001282881A1 (en) Real-time sequence determination
US9382584B2 (en) Methods and systems for direct sequencing of single DNA molecules
US8927211B2 (en) Nucleotide compositions and uses thereof
US20030235854A1 (en) Methods for analyzing a nucleic acid
AU2007216737A1 (en) Real-time sequence determination
EP2203568A1 (en) A method and system for obtaining ordered, segmented sequence fragments along a nucleic acid molecule

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION